CsGy1G005280.1 (mRNA) Cucumber (Gy14) v2

NameCsGy1G005280.1
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr1 : 3505690 .. 3508455 (-)
Sequence length2406
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGACTTCAGGGCAGTGGCTGGAGAAGGCGTTGGATGATCTCTGCAAGAAGATGGAAACTGGTTGGGGTCTCGATAAGGATATGATTTCTGGCTTGGTTTCATACTGTGAGCTCGCCCAGCCCCAAGACGCTAAAGAGTATCTTGATGTAAATTTCTTAGCTCGGCTTCCTTTTTTTGTTTTTTTTTTTTTGTGCAATAATCTGAATGATTGAGATATGTTTGTTCCTATCGATGAACTTCGCTTCTGGCTTTAGATATTGTCGCTGTATTAGTAGTCTTCGCACCGTCGCTTTGAGAGCTTTATATTTCTAATTCTTTCACCGACCTTTTTCTTCCGTTATCTTCCCACTCTAGGCAATCAAATGAATTTTATCGCAATTCAAAACTTTGTTAACAAAACGCTGATATCCCCACGTAGATTGGTTTCCTCTGTCGCGACTGTGGACAATGTGTCCAATTTTTCCTTCACCAAAATTGGAACTTTCGCTCCTTTCAATCCTGTTCAGTTGCTTAATGATTTTGTTAAATTGGGAAAATTCTCTTTGAGAAACACGAAAGTTCTACACGCTAAGTTGCTCCGAGAAACTCTTCGTTTCGATATCTATGTTTCAAATTCTTTGCTACATTTGTACTCCAAATCTAACGCTATGGACCATGCAATCAAACTTTTTGATACAATCCTATACCCAAATGTTATTTCTTGGAATACCATTATCACGGGTTTGAACAACAATTTCTTACATTTGGACTCGTTGAGAACATTTTGCTGGATGCATTTCCTGGGTTTTAAACCTAATGAGGTAACATGTGGGAGTGTTTTATCTGCTTGTGCTGCCATTCAAGCCTCAATGTTTGGCAAGCAGGTTTATTCACTTGCTGTGAGAAATGGGTTCTTCGATAACGGTTATGTTCGAACCGTAATGATTGATTTATTTGCAAAAGATTCTAAATTTTTGGATGCTCTAAGGGTGTTTCATGATGTTGATTGTGCGAATGTGGTGTGTTGGAATGCTATTGTCTCTGCAGCGGTAACAAATGGGGAGAATCTGATGGCTTTGGATCTTTTCAACAGAATGTGTAGTAAATTTCTGGAGCCTAATAGTTTCACCTTTTCTAGTGTTCTAACTGCGTGTTCTGCACTTCAAGATCTTGAATTTGGGAAAAAGGTTCAAGGGAGAGTGATTAAATGTGGTGGAGGAGACGTTTTTGTTGAGACAGCCCTTGTTAGTTTGTACGCTAAGTGTGGAGACATGGATGAGGCTGTTAAGACATTCTTGCAGATGCCCATTCGCAATGTGGTCTCGTGGACAGTTATAATGTCTGGCTTTGTGCAAAATAATGATTATTTAATGGTCATCAAGTTTTTTGAAGATTTGAGAAAAGTAGGAGAGGAAATTAATAGCTACACAGTTACTACCCTGTTAAGGGCATGTGCTAATCCAGCCATGAGAAAAGAGGCAACCCAACTTCACTCCTGGATTCTAAAAGCTGGTTTTTCTTCACATTCAGAGGTGGCGGCTGCTTTAATTATTATGTATTCAAAAATAGGAGCAGTTGATCTTTCATTGATGATTTTTAGAGAGATGGATAATCATAGGAATCTCAGTTCTTGGACAGCCATGATATTGTCGTTTGCAAAAAATAATGATAAAGAGGAAGCAAGTGATTTGTTCCGAAAAATGTTAAGGGAAAGAATGGGACCAGATTCAGTATGTACTTCCGCCCTCTTGAGTTTGACTGACTGTATTACTTTTGGGAGGCAGATACACTGCTACGCACTTAAAACTGAATTAATATTTAATGTTCATGTTGGGAGTTCTCTTCTTACAATGTATTCTAAATGTGGCCATCTAAAGGAAGCTTTTCAAGTTTTTGAAAACATGCCAGAGAAAGACAATGTTTCCTGGACCTTAATGATTTCCTGCTTCTCAGAACATGGCTATGCAAAAGATGCCATTCAATTATTTAGAGAAATGTTGTTAGAATGTGTACCTGATGGTACGTCTTTGAGTGCAGTCCTAACTGCATGCTACGCCCTTCCTTCTATTCAATTAGGTAGAGAAATTCATGGTTACTCAGTTCGTGTCGGACTTAATGAAAACGTAGCTTTGGGAAGTTCGCTTGTGACTATGTACTCAAAATGTGGTAACCTGGCATTGGCTAGGAGGGTGTTTGAAACATTGCCCCAGAAAGATGATATTGTGTGTTCTTCATTGGTTTCAGGATATGCTCAACAAAAGTGCATCAAAGAAGCTCTTTTGCTATTTCGCAGTCTACTAGTGGCTGGCTTAGCCATTGATCCCTTCTCAATCTCGTCCATATTGGGAGCTATTGCACTTTTAAATAGGCCTGCAATTGGGACTCAAATCCATGCACTCATTGTGAAAGTAGGCTTGGAGAAAGATGTATCTGTTGGGAGTTCGCTAGTAATGGTATACTCCAGATGTGGAAGTATAGAAGACTGCTGCAAAGCATTTGGGCAGATTGGAAAGCCCGATTTGATAGGTTGGACATCCATGATTGTCAGTTATGCTCAGCATGGGAAAGGTGCTGAAGCTTTATGTGCCTATGAACTTATGAAGAAAGAAGGATTCAAGCCTGATCCAGTCACCTTTGTTGGGGTTTTGTCTGCTTGTAGCCATAATGGTTTGGTCGATGAAGCCTATTTCCACCTCAATTCAATGGTGGAAGACTATGGTATACAACCAGGATATCGACATTATGTATGTATGGTAGATCTTCTTGGCCGGTGTGGAAACTGA

mRNA sequence

ATGGCGACTTCAGGGCAGTGGCTGGAGAAGGCGTTGGATGATCTCTGCAAGAAGATGGAAACTGGTTGGGGTCTCGATAAGGATATGATTTCTGGCTTGGTTTCATACTGTGAGCTCGCCCAGCCCCAAGACGCTAAAGAGTATCTTGATTTGCTTAATGATTTTGTTAAATTGGGAAAATTCTCTTTGAGAAACACGAAAGTTCTACACGCTAAGTTGCTCCGAGAAACTCTTCGTTTCGATATCTATGTTTCAAATTCTTTGCTACATTTGTACTCCAAATCTAACGCTATGGACCATGCAATCAAACTTTTTGATACAATCCTATACCCAAATGTTATTTCTTGGAATACCATTATCACGGGTTTGAACAACAATTTCTTACATTTGGACTCGTTGAGAACATTTTGCTGGATGCATTTCCTGGGTTTTAAACCTAATGAGGTAACATGTGGGAGTGTTTTATCTGCTTGTGCTGCCATTCAAGCCTCAATGTTTGGCAAGCAGGTTTATTCACTTGCTGTGAGAAATGGGTTCTTCGATAACGGTTATGTTCGAACCGTAATGATTGATTTATTTGCAAAAGATTCTAAATTTTTGGATGCTCTAAGGGTGTTTCATGATGTTGATTGTGCGAATGTGGTGTGTTGGAATGCTATTGTCTCTGCAGCGGTAACAAATGGGGAGAATCTGATGGCTTTGGATCTTTTCAACAGAATGTGTAGTAAATTTCTGGAGCCTAATAGTTTCACCTTTTCTAGTGTTCTAACTGCGTGTTCTGCACTTCAAGATCTTGAATTTGGGAAAAAGGTTCAAGGGAGAGTGATTAAATGTGGTGGAGGAGACGTTTTTGTTGAGACAGCCCTTGTTAGTTTGTACGCTAAGTGTGGAGACATGGATGAGGCTGTTAAGACATTCTTGCAGATGCCCATTCGCAATGTGGTCTCGTGGACAGTTATAATGTCTGGCTTTGTGCAAAATAATGATTATTTAATGGTCATCAAGTTTTTTGAAGATTTGAGAAAAGTAGGAGAGGAAATTAATAGCTACACAGTTACTACCCTGTTAAGGGCATGTGCTAATCCAGCCATGAGAAAAGAGGCAACCCAACTTCACTCCTGGATTCTAAAAGCTGGTTTTTCTTCACATTCAGAGGTGGCGGCTGCTTTAATTATTATGTATTCAAAAATAGGAGCAGTTGATCTTTCATTGATGATTTTTAGAGAGATGGATAATCATAGGAATCTCAGTTCTTGGACAGCCATGATATTGTCGTTTGCAAAAAATAATGATAAAGAGGAAGCAAGTGATTTGTTCCGAAAAATGTTAAGGGAAAGAATGGGACCAGATTCAGTATGTACTTCCGCCCTCTTGAGTTTGACTGACTGTATTACTTTTGGGAGGCAGATACACTGCTACGCACTTAAAACTGAATTAATATTTAATGTTCATGTTGGGAGTTCTCTTCTTACAATGTATTCTAAATGTGGCCATCTAAAGGAAGCTTTTCAAGTTTTTGAAAACATGCCAGAGAAAGACAATGTTTCCTGGACCTTAATGATTTCCTGCTTCTCAGAACATGGCTATGCAAAAGATGCCATTCAATTATTTAGAGAAATGTTGTTAGAATGTGTACCTGATGGTACGTCTTTGAGTGCAGTCCTAACTGCATGCTACGCCCTTCCTTCTATTCAATTAGGTAGAGAAATTCATGGTTACTCAGTTCGTGTCGGACTTAATGAAAACGTAGCTTTGGGAAGTTCGCTTGTGACTATGTACTCAAAATGTGGTAACCTGGCATTGGCTAGGAGGGTGTTTGAAACATTGCCCCAGAAAGATGATATTGTGTGTTCTTCATTGGTTTCAGGATATGCTCAACAAAAGTGCATCAAAGAAGCTCTTTTGCTATTTCGCAGTCTACTAGTGGCTGGCTTAGCCATTGATCCCTTCTCAATCTCGTCCATATTGGGAGCTATTGCACTTTTAAATAGGCCTGCAATTGGGACTCAAATCCATGCACTCATTGTGAAAGTAGGCTTGGAGAAAGATGTATCTGTTGGGAGTTCGCTAGTAATGGTATACTCCAGATGTGGAAGTATAGAAGACTGCTGCAAAGCATTTGGGCAGATTGGAAAGCCCGATTTGATAGGTTGGACATCCATGATTGTCAGTTATGCTCAGCATGGGAAAGGTGCTGAAGCTTTATGTGCCTATGAACTTATGAAGAAAGAAGGATTCAAGCCTGATCCAGTCACCTTTGTTGGGGTTTTGTCTGCTTGTAGCCATAATGGTTTGGTCGATGAAGCCTATTTCCACCTCAATTCAATGGTGGAAGACTATGGTATACAACCAGGATATCGACATTATGTATGTATGGTAGATCTTCTTGGCCGGTGTGGAAACTGA

Coding sequence (CDS)

ATGGCGACTTCAGGGCAGTGGCTGGAGAAGGCGTTGGATGATCTCTGCAAGAAGATGGAAACTGGTTGGGGTCTCGATAAGGATATGATTTCTGGCTTGGTTTCATACTGTGAGCTCGCCCAGCCCCAAGACGCTAAAGAGTATCTTGATTTGCTTAATGATTTTGTTAAATTGGGAAAATTCTCTTTGAGAAACACGAAAGTTCTACACGCTAAGTTGCTCCGAGAAACTCTTCGTTTCGATATCTATGTTTCAAATTCTTTGCTACATTTGTACTCCAAATCTAACGCTATGGACCATGCAATCAAACTTTTTGATACAATCCTATACCCAAATGTTATTTCTTGGAATACCATTATCACGGGTTTGAACAACAATTTCTTACATTTGGACTCGTTGAGAACATTTTGCTGGATGCATTTCCTGGGTTTTAAACCTAATGAGGTAACATGTGGGAGTGTTTTATCTGCTTGTGCTGCCATTCAAGCCTCAATGTTTGGCAAGCAGGTTTATTCACTTGCTGTGAGAAATGGGTTCTTCGATAACGGTTATGTTCGAACCGTAATGATTGATTTATTTGCAAAAGATTCTAAATTTTTGGATGCTCTAAGGGTGTTTCATGATGTTGATTGTGCGAATGTGGTGTGTTGGAATGCTATTGTCTCTGCAGCGGTAACAAATGGGGAGAATCTGATGGCTTTGGATCTTTTCAACAGAATGTGTAGTAAATTTCTGGAGCCTAATAGTTTCACCTTTTCTAGTGTTCTAACTGCGTGTTCTGCACTTCAAGATCTTGAATTTGGGAAAAAGGTTCAAGGGAGAGTGATTAAATGTGGTGGAGGAGACGTTTTTGTTGAGACAGCCCTTGTTAGTTTGTACGCTAAGTGTGGAGACATGGATGAGGCTGTTAAGACATTCTTGCAGATGCCCATTCGCAATGTGGTCTCGTGGACAGTTATAATGTCTGGCTTTGTGCAAAATAATGATTATTTAATGGTCATCAAGTTTTTTGAAGATTTGAGAAAAGTAGGAGAGGAAATTAATAGCTACACAGTTACTACCCTGTTAAGGGCATGTGCTAATCCAGCCATGAGAAAAGAGGCAACCCAACTTCACTCCTGGATTCTAAAAGCTGGTTTTTCTTCACATTCAGAGGTGGCGGCTGCTTTAATTATTATGTATTCAAAAATAGGAGCAGTTGATCTTTCATTGATGATTTTTAGAGAGATGGATAATCATAGGAATCTCAGTTCTTGGACAGCCATGATATTGTCGTTTGCAAAAAATAATGATAAAGAGGAAGCAAGTGATTTGTTCCGAAAAATGTTAAGGGAAAGAATGGGACCAGATTCAGTATGTACTTCCGCCCTCTTGAGTTTGACTGACTGTATTACTTTTGGGAGGCAGATACACTGCTACGCACTTAAAACTGAATTAATATTTAATGTTCATGTTGGGAGTTCTCTTCTTACAATGTATTCTAAATGTGGCCATCTAAAGGAAGCTTTTCAAGTTTTTGAAAACATGCCAGAGAAAGACAATGTTTCCTGGACCTTAATGATTTCCTGCTTCTCAGAACATGGCTATGCAAAAGATGCCATTCAATTATTTAGAGAAATGTTGTTAGAATGTGTACCTGATGGTACGTCTTTGAGTGCAGTCCTAACTGCATGCTACGCCCTTCCTTCTATTCAATTAGGTAGAGAAATTCATGGTTACTCAGTTCGTGTCGGACTTAATGAAAACGTAGCTTTGGGAAGTTCGCTTGTGACTATGTACTCAAAATGTGGTAACCTGGCATTGGCTAGGAGGGTGTTTGAAACATTGCCCCAGAAAGATGATATTGTGTGTTCTTCATTGGTTTCAGGATATGCTCAACAAAAGTGCATCAAAGAAGCTCTTTTGCTATTTCGCAGTCTACTAGTGGCTGGCTTAGCCATTGATCCCTTCTCAATCTCGTCCATATTGGGAGCTATTGCACTTTTAAATAGGCCTGCAATTGGGACTCAAATCCATGCACTCATTGTGAAAGTAGGCTTGGAGAAAGATGTATCTGTTGGGAGTTCGCTAGTAATGGTATACTCCAGATGTGGAAGTATAGAAGACTGCTGCAAAGCATTTGGGCAGATTGGAAAGCCCGATTTGATAGGTTGGACATCCATGATTGTCAGTTATGCTCAGCATGGGAAAGGTGCTGAAGCTTTATGTGCCTATGAACTTATGAAGAAAGAAGGATTCAAGCCTGATCCAGTCACCTTTGTTGGGGTTTTGTCTGCTTGTAGCCATAATGGTTTGGTCGATGAAGCCTATTTCCACCTCAATTCAATGGTGGAAGACTATGGTATACAACCAGGATATCGACATTATGTATGTATGGTAGATCTTCTTGGCCGGTGTGGAAACTGA

Protein sequence

MATSGQWLEKALDDLCKKMETGWGLDKDMISGLVSYCELAQPQDAKEYLDLLNDFVKLGKFSLRNTKVLHAKLLRETLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTILYPNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASMFGKQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGENLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCGGGDVFVETALVSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSYTVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFREMDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCITFGRQIHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCFSEHGYAKDAIQLFREMLLECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSSLVTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDPFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCGN
BLAST of CsGy1G005280.1 vs. NCBI nr
Match: XP_011649738.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 1411.7 bits (3653), Expect = 0.0e+00
Identity = 702/703 (99.86%), Postives = 703/703 (100.00%), Query Frame = 0

Query: 98  MDHAIKLFDTILYPNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSA 157
           MDHAIKLFDTILYPNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSA
Sbjct: 1   MDHAIKLFDTILYPNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSA 60

Query: 158 CAAIQASMFGKQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCW 217
           CAAIQASMFGKQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCW
Sbjct: 61  CAAIQASMFGKQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCW 120

Query: 218 NAIVSAAVTNGENLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIK 277
           NAIVSAAVTNGENLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIK
Sbjct: 121 NAIVSAAVTNGENLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIK 180

Query: 278 CGGGDVFVETALVSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFF 337
           CGGGDVFVETALVSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQ+NDYLMVIKFF
Sbjct: 181 CGGGDVFVETALVSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQSNDYLMVIKFF 240

Query: 338 EDLRKVGEEINSYTVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKI 397
           EDLRKVGEEINSYTVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKI
Sbjct: 241 EDLRKVGEEINSYTVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKI 300

Query: 398 GAVDLSLMIFREMDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSAL 457
           GAVDLSLMIFREMDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSAL
Sbjct: 301 GAVDLSLMIFREMDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSAL 360

Query: 458 LSLTDCITFGRQIHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSW 517
           LSLTDCITFGRQIHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSW
Sbjct: 361 LSLTDCITFGRQIHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSW 420

Query: 518 TLMISCFSEHGYAKDAIQLFREMLLECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRV 577
           TLMISCFSEHGYAKDAIQLFREMLLECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRV
Sbjct: 421 TLMISCFSEHGYAKDAIQLFREMLLECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRV 480

Query: 578 GLNENVALGSSLVTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLF 637
           GLNENVALGSSLVTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLF
Sbjct: 481 GLNENVALGSSLVTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLF 540

Query: 638 RSLLVAGLAIDPFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRC 697
           RSLLVAGLAIDPFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRC
Sbjct: 541 RSLLVAGLAIDPFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRC 600

Query: 698 GSIEDCCKAFGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVL 757
           GSIEDCCKAFGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVL
Sbjct: 601 GSIEDCCKAFGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVL 660

Query: 758 SACSHNGLVDEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
           SACSHNGLVDEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG
Sbjct: 661 SACSHNGLVDEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 703

BLAST of CsGy1G005280.1 vs. NCBI nr
Match: XP_008441907.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucumis melo])

HSP 1 Score: 1317.4 bits (3408), Expect = 0.0e+00
Identity = 665/751 (88.55%), Postives = 677/751 (90.15%), Query Frame = 0

Query: 51  LLNDFVKLGKFSLRNTKVLHAKLLRETLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTILY 110
           LLNDFVKLG FSLRNTKVLHAK LR T R DIYVSNSLLH YSKSNAMDHA+KLFDTIL 
Sbjct: 49  LLNDFVKLGNFSLRNTKVLHAKFLRVTPRIDIYVSNSLLHCYSKSNAMDHALKLFDTILN 108

Query: 111 PNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASMFGKQV 170
           PNVISWNTIITG NNNFLHLDSLR FCWMH+LGFKPNEVTCGSVLSACAAIQA+MFGKQV
Sbjct: 109 PNVISWNTIITGFNNNFLHLDSLRIFCWMHYLGFKPNEVTCGSVLSACAAIQATMFGKQV 168

Query: 171 YSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGEN 230
           YSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGE 
Sbjct: 169 YSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGEY 228

Query: 231 LMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCGGGDVFVETALV 290
           LMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGK VQGRVIKCGGGDVFVETALV
Sbjct: 229 LMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKSVQGRVIKCGGGDVFVETALV 288

Query: 291 SLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSY 350
           SLYAKCGDMDEAVK F QMPIRNVVSWTVIMSGFVQNNDYLMVIK FEDLRK+GEEINSY
Sbjct: 289 SLYAKCGDMDEAVKIFFQMPIRNVVSWTVIMSGFVQNNDYLMVIKIFEDLRKIGEEINSY 348

Query: 351 TVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFREM 410
           TVTTLLRACANP MRKEATQLHSWILKAGFSS +EV AALIIMYSKIGA+DLSLM+FREM
Sbjct: 349 TVTTLLRACANPGMRKEATQLHSWILKAGFSSQAEVVAALIIMYSKIGAIDLSLMVFREM 408

Query: 411 DNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCITFGRQI 470
           DNHRNLSSWTAMILS AKNNDKEEASDLFRKMLRE+M PDSVCTS LLSLTDCITFGRQI
Sbjct: 409 DNHRNLSSWTAMILSLAKNNDKEEASDLFRKMLREKMEPDSVCTSTLLSLTDCITFGRQI 468

Query: 471 HCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMIS-CFSEHGY 530
           HCY LKTELIFNV VGSSL TMYSKCGHLKEAFQVFENMPEKDNVSWTLMIS        
Sbjct: 469 HCYTLKTELIFNVSVGSSLFTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISXXXXXXXX 528

Query: 531 AKDAIQLFREMLLECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSSL 590
                                                 REIHGYS+RVGL+ENV+ GSSL
Sbjct: 529 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXREIHGYSIRVGLSENVSFGSSL 588

Query: 591 VTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDP 650
           VTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDP
Sbjct: 589 VTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDP 648

Query: 651 FSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQ 710
           FSISSILG IALL RPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQ
Sbjct: 649 FSISSILGGIALLKRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQ 708

Query: 711 IGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEA 770
           IGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEA
Sbjct: 709 IGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEA 768

Query: 771 YFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
           YFHLNSMVEDYGIQPG RHY C+VDLLGRCG
Sbjct: 769 YFHLNSMVEDYGIQPGCRHYACLVDLLGRCG 799

BLAST of CsGy1G005280.1 vs. NCBI nr
Match: XP_023519257.1 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1149.0 bits (2971), Expect = 0.0e+00
Identity = 573/752 (76.20%), Postives = 640/752 (85.11%), Query Frame = 0

Query: 51  LLNDFVKLGKFSLRNTKVL-HAKLLRETLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTIL 110
           LL+D+VK  K SLRNTKVL              YVSNSLL  YSKSN+            
Sbjct: 49  LLSDYVKSRKCSLRNTKVLXXXXXXXXXXXXXXYVSNSLLDCYSKSNSXXXXXXXXXXXX 108

Query: 111 YPNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASMFGKQ 170
                      +  N+NF++LDS RTFC MHFLGF+P+E+T GSVLSACAAIQA MFGKQ
Sbjct: 109 XXXXXXXXXXXSSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 168

Query: 171 VYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGE 230
           VYSLAVRNGFF NGYVR  MIDLFAKDS FLDALRVFHDVDC NVVCWNAIVSAAV NGE
Sbjct: 169 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 228

Query: 231 NLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCGGGDVFVETAL 290
           N MALDL+N MC  FLEPNSFTFSSVLTAC+AL+  EFGK+VQG+VIKCGG DVFVETAL
Sbjct: 229 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 288

Query: 291 VSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINS 350
           + LY+KCG+MDEAVK FL+MPIRNVVSWT I+SGFVQ NDYLM +KFF+D+RK+GEEINS
Sbjct: 289 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 348

Query: 351 YTVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFRE 410
           YTVT++L ACANPAM KEA QLHSWIL+AGFSSH+ V AALI MYSKIGA+DLS+ +F E
Sbjct: 349 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 408

Query: 411 MDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCITFGRQ 470
           MDN RNLSSWTAMI SFA+NNDKE+AS+LF+KMLRE MGPD+ CTS++LS+TDCITFGRQ
Sbjct: 409 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 468

Query: 471 IHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCFSEHGY 530
           IHC+  KT LIF++ VGS+L TMYSKCG+L+EAF VF+NMP+KDN+SW  M+SCFSEHGY
Sbjct: 469 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHVFKNMPKKDNISWASMMSCFSEHGY 528

Query: 531 AKDAIQLFREMLL-ECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSS 590
           AK+ IQLFREML  E VPD   LS VL AC  L SIQ+GREIH YSVR+GL+++VA+G S
Sbjct: 529 AKEGIQLFREMLFEEYVPDYMILSTVLNACSVLHSIQIGREIHCYSVRLGLDKDVAIGGS 588

Query: 591 LVTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAID 650
           LVTMYSKCGNL +ARRVFETLP+KD+I CSSLVSGYAQ KCIKE +LLF+ LL AGLAID
Sbjct: 589 LVTMYSKCGNLEMARRVFETLPEKDNIACSSLVSGYAQHKCIKETILLFQDLLEAGLAID 648

Query: 651 PFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFG 710
           PFSISSILGAIALLNRP IGTQ+HA+I KVGLEKDVSVGSSLVMVYS+CGSIEDCCKAF 
Sbjct: 649 PFSISSILGAIALLNRPGIGTQLHAIITKVGLEKDVSVGSSLVMVYSKCGSIEDCCKAFE 708

Query: 711 QIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDE 770
           QIGKPDLIGWT+MIVSYAQHGKGAEALC YELMKKEG KPDPVTFVGVLSACSHNGLVDE
Sbjct: 709 QIGKPDLIGWTAMIVSYAQHGKGAEALCVYELMKKEGIKPDPVTFVGVLSACSHNGLVDE 768

Query: 771 AYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
           AYFHLNSMV+DYGIQPG+RHY CMVDLLGRCG
Sbjct: 769 AYFHLNSMVKDYGIQPGHRHYACMVDLLGRCG 800

BLAST of CsGy1G005280.1 vs. NCBI nr
Match: XP_022137435.1 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica charantia] >XP_022137436.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica charantia] >XP_022137437.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica charantia] >XP_022137439.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica charantia])

HSP 1 Score: 1143.3 bits (2956), Expect = 0.0e+00
Identity = 574/754 (76.13%), Postives = 642/754 (85.15%), Query Frame = 0

Query: 49  LDLLNDFVKLGKFSLRNTKVLHAKLLRET-LRFDIYVSNSLLHLYSKSNAMDHAIKLFDT 108
           L LLND+VK  K SL+NTKV+HAKLLR T L   IYV+NSLL  YSKS AMD+A+KLFD 
Sbjct: 47  LQLLNDYVKSRKCSLKNTKVMHAKLLRATLLHSSIYVTNSLLDCYSKSGAMDNALKLFDK 106

Query: 109 ILYPNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASMFG 168
           +L+ NVISWN +I+G N NFL L+S RTFC MHFLGF+P+E+T GSVLSACAA+QA MFG
Sbjct: 107 MLHLNVISWNIMISGFNQNFLFLESWRTFCRMHFLGFEPSEITYGSVLSACAAMQAPMFG 166

Query: 169 KQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTN 228
           KQ+YSL VRNG F NGYVR  MIDLFAKDS F DALRVF+DVDC NVVCWNAIVSAAV N
Sbjct: 167 KQIYSLVVRNGSFVNGYVRAGMIDLFAKDSSFPDALRVFNDVDCENVVCWNAIVSAAVRN 226

Query: 229 GENLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCGGGDVFVET 288
           GEN +ALDLFN MCS FLEPNSFTFSSVLTAC+A++DLEFGK+VQGRVIKCGG DVFVET
Sbjct: 227 GENSVALDLFNTMCSGFLEPNSFTFSSVLTACAAVEDLEFGKRVQGRVIKCGGEDVFVET 286

Query: 289 ALVSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEI 348
           AL+ LYAKCGD+DEAVKTFLQMPIRNVVSWT I+SGFVQ ND  M +K F+D+R +GEEI
Sbjct: 287 ALIDLYAKCGDIDEAVKTFLQMPIRNVVSWTAIISGFVQKNDCFMALKVFKDMRNLGEEI 346

Query: 349 NSYTVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIF 408
           NSYTVT++L ACANPAMRKEA QLHSWILKAGF S++ V +ALI MYSKIG +DLS+M+F
Sbjct: 347 NSYTVTSVLTACANPAMRKEAIQLHSWILKAGFLSYAVVVSALINMYSKIGTIDLSMMVF 406

Query: 409 REMDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCITFG 468
           RE+D+ RNLSSW AMI SFA+N DKE+A +LF+KML+E +GPD+ CTS++LS+TDCITFG
Sbjct: 407 REIDDQRNLSSWAAMITSFAQNMDKEKAIELFQKMLQESIGPDTFCTSSVLSVTDCITFG 466

Query: 469 RQIHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCFSEH 528
           RQIHCY LKT LIF+V VGSSL TMYSKCG+L+EAFQ FENMP+KD+VSW          
Sbjct: 467 RQIHCYTLKTGLIFDVSVGSSLFTMYSKCGYLEEAFQFFENMPKKDSVSWASXXXXXXXX 526

Query: 529 GYAKDAIQLFREMLL-ECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALG 588
                        L  E VPD  +LSAVLT C  L SIQ+GREIHGYSVRVGL ++VA+G
Sbjct: 527 XXXXXXXXXXXXXLFEEYVPDHITLSAVLTVCSVLHSIQIGREIHGYSVRVGLGKDVAIG 586

Query: 589 SSLVTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLA 648
             LVTMYSKCGNL LARRVFETLPQKD I CSSLVSGYAQ K I+EAL LF  LLV GLA
Sbjct: 587 GPLVTMYSKCGNLELARRVFETLPQKDQIACSSLVSGYAQHKRIQEALSLFCDLLVPGLA 646

Query: 649 IDPFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKA 708
           IDPFS+SSILGAIA+L+RP IG Q+HALI+KVGLEKDVSVGSSLVMVYS+CGSIEDCCKA
Sbjct: 647 IDPFSVSSILGAIAVLDRPGIGAQLHALIMKVGLEKDVSVGSSLVMVYSKCGSIEDCCKA 706

Query: 709 FGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLV 768
           F QIGKPDLIGWT+MIVSYAQHGKGAEALC YELMKKEG KPDPVTFVGVLSACSHNGLV
Sbjct: 707 FEQIGKPDLIGWTAMIVSYAQHGKGAEALCVYELMKKEGIKPDPVTFVGVLSACSHNGLV 766

Query: 769 DEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
           DEAYFHLNSMV+DYGIQPGYRHY CMVDLLGRCG
Sbjct: 767 DEAYFHLNSMVKDYGIQPGYRHYACMVDLLGRCG 800

BLAST of CsGy1G005280.1 vs. NCBI nr
Match: XP_023001341.1 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1108.6 bits (2866), Expect = 0.0e+00
Identity = 552/752 (73.40%), Postives = 624/752 (82.98%), Query Frame = 0

Query: 51  LLNDFVKLGKFSLR-NTKVLHAKLLRETLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTIL 110
           LL+D+VK  K SLR                       SLL  YSKSN++DHA+KLF    
Sbjct: 49  LLSDYVKSRKCSLRXXXXXXXXXXXXXXXXXXXXXXXSLLDCYSKSNSLDHALKLFXXXX 108

Query: 111 YPNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASMFGKQ 170
               ISWN +I+  N+NFL+LDS RTFC MHFLGF+P+E+T GSVLSACAAIQA MFGKQ
Sbjct: 109 XXXXISWNILISSFNHNFLYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 168

Query: 171 VYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGE 230
           VYSLAVRNGFF NGYVR  MIDLFAK+S FLDALRVF DVDC NVVCWNAIVSAAV NGE
Sbjct: 169 VYSLAVRNGFFVNGYVRAGMIDLFAKESSFLDALRVFQDVDCENVVCWNAIVSAAVRNGE 228

Query: 231 NLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCGGGDVFVETAL 290
           N MALDL+N MC  FLEPNSFTFSSVLTAC+AL+  EFGK+VQG+VIKCGG DVFVETAL
Sbjct: 229 NFMALDLYNTMCRGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 288

Query: 291 VSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINS 350
           + LY+KCG+MDEAVK FL+MPIRNVVSWT I+SGFVQ NDYLM +KFF+D+RK+GEEINS
Sbjct: 289 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 348

Query: 351 YTVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFRE 410
           YTVT++L ACANPAM KEA QLHSWIL+AGFSSH+ V AALI MYSKIGA+DLS+ +F E
Sbjct: 349 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 408

Query: 411 MDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCITFGRQ 470
           MDN RNLSSWTAMI SFA+NNDKE+AS+LF+KMLRE MGPD+ CTS++LS+TDCITFGRQ
Sbjct: 409 MDNQRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 468

Query: 471 IHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMIS-CFSEHG 530
           IHC+  KT L+F + VGS+L TMYSKCG+L+EAF VF+NMP+KD++SW  M+S       
Sbjct: 469 IHCFTHKTGLVFGISVGSALFTMYSKCGYLEEAFHVFKNMPKKDHISWASMMSXXXXXXX 528

Query: 531 YAKDAIQLFREMLLECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSS 590
                           VPD   L+ VL AC  L SIQ+GREIH YSVR+GL+++VA+G S
Sbjct: 529 XXXXXXXXXXXXXXXXVPDSMILNTVLNACSVLHSIQIGREIHSYSVRLGLDKDVAIGGS 588

Query: 591 LVTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAID 650
           LVTMYSKCGNL +ARRVFETLP+KD+I CSSLVSGYAQ KCIKE +LLF+ LL AGLAID
Sbjct: 589 LVTMYSKCGNLEMARRVFETLPEKDNIACSSLVSGYAQHKCIKETILLFQDLLEAGLAID 648

Query: 651 PFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFG 710
           PFSISSILGAIALLNRP IGTQ+HA+I KVGLEKDVS+GSSLVMVYS+CGSIEDCCKAF 
Sbjct: 649 PFSISSILGAIALLNRPGIGTQLHAIITKVGLEKDVSIGSSLVMVYSKCGSIEDCCKAFE 708

Query: 711 QIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDE 770
           QIGKPDLIGWT+MIVSYAQHGKGAEALC YELMKKEG KPDPVTFVGVLSACSHNGLVDE
Sbjct: 709 QIGKPDLIGWTAMIVSYAQHGKGAEALCVYELMKKEGIKPDPVTFVGVLSACSHNGLVDE 768

Query: 771 AYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
           AYFHLNSMV+DYGIQPG+RHY CMVDLLGRCG
Sbjct: 769 AYFHLNSMVKDYGIQPGHRHYACMVDLLGRCG 800

BLAST of CsGy1G005280.1 vs. TAIR10
Match: AT1G74600.1 (pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 730.7 bits (1885), Expect = 9.6e-211
Identity = 366/718 (50.97%), Postives = 515/718 (71.73%), Query Frame = 0

Query: 62  SLRNTKVLHAKLLRE-TLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTILYPNVISWNTII 121
           +LR TK+L A LLR   L FD++++ SLL  YS S +M  A KLFDTI  P+V+S N +I
Sbjct: 63  NLRTTKILQAHLLRRYLLPFDVFLTKSLLSWYSNSGSMADAAKLFDTIPQPDVVSCNIMI 122

Query: 122 TGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASMFGKQVYSLAVRNGFF 181
           +G   + L  +SLR F  MHFLGF+ NE++ GSV+SAC+A+QA +F + V    ++ G+F
Sbjct: 123 SGYKQHRLFEESLRFFSKMHFLGFEANEISYGSVISACSALQAPLFSELVCCHTIKMGYF 182

Query: 182 DNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGENLMALDLFNRM 241
               V + +ID+F+K+ +F DA +VF D   ANV CWN I++ A+ N       DLF+ M
Sbjct: 183 FYEVVESALIDVFSKNLRFEDAYKVFRDSLSANVYCWNTIIAGALRNQNYGAVFDLFHEM 242

Query: 242 CSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCGGGDVFVETALVSLYAKCGDMD 301
           C  F +P+S+T+SSVL AC++L+ L FGK VQ RVIKCG  DVFV TA+V LYAKCG M 
Sbjct: 243 CVGFQKPDSYTYSSVLAACASLEKLRFGKVVQARVIKCGAEDVFVCTAIVDLYAKCGHMA 302

Query: 302 EAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSYTVTTLLRACA 361
           EA++ F ++P  +VVSWTV++SG+ ++ND    ++ F+++R  G EIN+ TVT+++ AC 
Sbjct: 303 EAMEVFSRIPNPSVVSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISACG 362

Query: 362 NPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFREMDNHRNLSSWT 421
            P+M  EA+Q+H+W+ K+GF   S VAAALI MYSK G +DLS  +F ++D+ +  +   
Sbjct: 363 RPSMVCEASQVHAWVFKSGFYLDSSVAAALISMYSKSGDIDLSEQVFEDLDDIQRQNIVN 422

Query: 422 AMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCITFGRQIHCYALKTELI 481
            MI SF+++    +A  LF +ML+E +  D     +LLS+ DC+  G+Q+H Y LK+ L+
Sbjct: 423 VMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVLDCLNLGKQVHGYTLKSGLV 482

Query: 482 FNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCFSEHGYAKDAIQLFREM 541
            ++ VGSSL T+YSKCG L+E++++F+ +P KDN  W  MIS F+E+GY ++AI LF EM
Sbjct: 483 LDLTVGSSLFTLYSKCGSLEESYKLFQGIPFKDNACWASMISGFNEYGYLREAIGLFSEM 542

Query: 542 LLE-CVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSSLVTMYSKCGNL 601
           L +   PD ++L+AVLT C + PS+  G+EIHGY++R G+++ + LGS+LV MYSKCG+L
Sbjct: 543 LDDGTSPDESTLAAVLTVCSSHPSLPRGKEIHGYTLRAGIDKGMDLGSALVNMYSKCGSL 602

Query: 602 ALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDPFSISSILGAI 661
            LAR+V++ LP+ D + CSSL+SGY+Q   I++  LLFR ++++G  +D F+ISSIL A 
Sbjct: 603 KLARQVYDRLPELDPVSCSSLISGYSQHGLIQDGFLLFRDMVMSGFTMDSFAISSILKAA 662

Query: 662 ALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQIGKPDLIGWT 721
           AL +  ++G Q+HA I K+GL  + SVGSSL+ +YS+ GSI+DCCKAF QI  PDLI WT
Sbjct: 663 ALSDESSLGAQVHAYITKIGLCTEPSVGSSLLTMYSKFGSIDDCCKAFSQINGPDLIAWT 722

Query: 722 SMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEAYFHLNSMV 778
           ++I SYAQHGK  EAL  Y LMK++GFKPD VTFVGVLSACSH GLV+E+YFHLNSMV
Sbjct: 723 ALIASYAQHGKANEALQVYNLMKEKGFKPDKVTFVGVLSACSHGGLVEESYFHLNSMV 780

BLAST of CsGy1G005280.1 vs. TAIR10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 366.7 bits (940), Expect = 3.7e-101
Identity = 233/747 (31.19%), Postives = 397/747 (53.15%), Query Frame = 0

Query: 62  SLRNTKVLHAKLLRETLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTILYPNVISWNTIIT 121
           SL   + LH+++L+  L  +  +S  L   Y     +  A K+FD +    + +WN +I 
Sbjct: 100 SLDEGRKLHSQILKLGLDSNGCLSEKLFDFYLFKGDLYGAFKVFDEMPERTIFTWNKMIK 159

Query: 122 GLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASM-FGKQVYSLAVRNGFF 181
            L +  L  +    F  M      PNE T   VL AC     +    +Q+++  +  G  
Sbjct: 160 ELASRNLIGEVFGLFVRMVSENVTPNEGTFSGVLEACRGGSVAFDVVEQIHARILYQGLR 219

Query: 182 DNGYVRTVMIDLFAKDSKFLD-ALRVFHDVDCANVVCWNAIVSAAVTNGENLMALDLFNR 241
           D+  V   +IDL++++  F+D A RVF  +   +   W A++S    N     A+ LF  
Sbjct: 220 DSTVVCNPLIDLYSRNG-FVDLARRVFDGLRLKDHSSWVAMISGLSKNECEAEAIRLFCD 279

Query: 242 MCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCG-GGDVFVETALVSLYAKCGD 301
           M    + P  + FSSVL+AC  ++ LE G+++ G V+K G   D +V  ALVSLY   G+
Sbjct: 280 MYVLGIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLGFSSDTYVCNALVSLYFHLGN 339

Query: 302 MDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSYTVTTLLRA 361
           +  A   F  M  R+ V++  +++G  Q       ++ F+ +   G E +S T+ +L+ A
Sbjct: 340 LISAEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEPDSNTLASLVVA 399

Query: 362 CANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFREMDNHRNLSS 421
           C+         QLH++  K GF+S++++  AL+ +Y+K   ++ +L  F E +   N+  
Sbjct: 400 CSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETE-VENVVL 459

Query: 422 WTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLS----LTDCITFGRQIHCYA 481
           W  M++++   +D   +  +FR+M  E + P+     ++L     L D +  G QIH   
Sbjct: 460 WNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGD-LELGEQIHSQI 519

Query: 482 LKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCFSEHGYAKDAI 541
           +KT    N +V S L+ MY+K G L  A+ +      KD VSWT MI+ ++++ +   A+
Sbjct: 520 IKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFDDKAL 579

Query: 542 QLFREMLLECV-PDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSSLVTMY 601
             FR+ML   +  D   L+  ++AC  L +++ G++IH  +   G + ++   ++LVT+Y
Sbjct: 580 TTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNALVTLY 639

Query: 602 SKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDPFSIS 661
           S+CG +  +   FE     D+I  ++LVSG+ Q    +EAL +F  +   G+  + F+  
Sbjct: 640 SRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFG 699

Query: 662 SILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQIGKP 721
           S + A +       G Q+HA+I K G + +  V ++L+ +Y++CGSI D  K F ++   
Sbjct: 700 SAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTK 759

Query: 722 DLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEAYFHL 781
           + + W ++I +Y++HG G+EAL +++ M     +P+ VT VGVLSACSH GLVD+   + 
Sbjct: 760 NEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYF 819

Query: 782 NSMVEDYGIQPGYRHYVCMVDLLGRCG 801
            SM  +YG+ P   HYVC+VD+L R G
Sbjct: 820 ESMNSEYGLSPKPEHYVCVVDMLTRAG 843

BLAST of CsGy1G005280.1 vs. TAIR10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 356.3 bits (913), Expect = 4.9e-98
Identity = 235/755 (31.13%), Postives = 394/755 (52.19%), Query Frame = 0

Query: 67  KVLHAKLLRETLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTILYPNVISWNTIITGLNNN 126
           +  H++L +  L  D+Y+ N+L++ Y ++     A K+FD +   N +SW  I++G + N
Sbjct: 21  RFFHSRLYKNRLDKDVYLCNNLINAYLETGDSVSARKVFDEMPLRNCVSWACIVSGYSRN 80

Query: 127 FLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQA--SMFGKQVYSLAVRNGFFDNGY 186
             H ++L     M   G   N+    SVL AC  I +   +FG+Q++ L  +  +  +  
Sbjct: 81  GEHKEALVFLRDMVKEGIFSNQYAFVSVLRACQEIGSVGILFGRQIHGLMFKLSYAVDAV 140

Query: 187 VRTVMIDLFAKDSKFLD-ALRVFHDVDCANVVCWNAIVSAAVTNGENLMALDLFNRMCSK 246
           V  V+I ++ K    +  AL  F D++  N V WN+I+S     G+   A  +F+ M   
Sbjct: 141 VSNVLISMYWKCIGSVGYALCAFGDIEVKNSVSWNSIISVYSQAGDQRSAFRIFSSMQYD 200

Query: 247 FLEPNSFTFSS-VLTACSALQ-DLEFGKKVQGRVIKCG-GGDVFVETALVSLYAKCGDMD 306
              P  +TF S V TACS  + D+   +++   + K G   D+FV + LVS +AK G + 
Sbjct: 201 GSRPTEYTFGSLVTTACSLTEPDVRLLEQIMCTIQKSGLLTDLFVGSGLVSAFAKSGSLS 260

Query: 307 EAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSYTVTTLLR--- 366
            A K F QM  RN V+   +M G V+        K F D+  +  +++  +   LL    
Sbjct: 261 YARKVFNQMETRNAVTLNGLMVGLVRQKWGEEATKLFMDMNSM-IDVSPESYVILLSSFP 320

Query: 367 --ACANPAMRKEATQLHSWILKAGFSSHS-EVAAALIIMYSKIGAVDLSLMIFREMDNHR 426
             + A     K+  ++H  ++  G       +   L+ MY+K G++  +  +F  M + +
Sbjct: 321 EYSLAEEVGLKKGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTD-K 380

Query: 427 NLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCIT-----FGRQ 486
           +  SW +MI    +N    EA + ++ M R  + P S   + + SL+ C +      G+Q
Sbjct: 381 DSVSWNSMITGLDQNGCFIEAVERYKSMRRHDILPGSF--TLISSLSSCASLKWAKLGQQ 440

Query: 487 IHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCF--SEH 546
           IH  +LK  +  NV V ++L+T+Y++ G+L E  ++F +MPE D VSW  +I     SE 
Sbjct: 441 IHGESLKLGIDLNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNSIIGALARSER 500

Query: 547 GYAKDAIQLFREMLLECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGS 606
              +  +            +  + S+VL+A  +L   +LG++IHG +++  + +     +
Sbjct: 501 SLPEAVVCFLNAQRAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTEN 560

Query: 607 SLVTMYSKCGNLALARRVFETLPQ-KDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLA 666
           +L+  Y KCG +    ++F  + + +D++  +S++SGY   + + +AL L   +L  G  
Sbjct: 561 ALIACYGKCGEMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQR 620

Query: 667 IDPFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKA 726
           +D F  +++L A A +     G ++HA  V+  LE DV VGS+LV +YS+CG ++   + 
Sbjct: 621 LDSFMYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRF 680

Query: 727 FGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEG-FKPDPVTFVGVLSACSHNGL 786
           F  +   +   W SMI  YA+HG+G EAL  +E MK +G   PD VTFVGVLSACSH GL
Sbjct: 681 FNTMPVRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGL 740

Query: 787 VDEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
           ++E + H  SM + YG+ P   H+ CM D+LGR G
Sbjct: 741 LEEGFKHFESMSDSYGLAPRIEHFSCMADVLGRAG 771

BLAST of CsGy1G005280.1 vs. TAIR10
Match: AT1G16480.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 350.5 bits (898), Expect = 2.7e-96
Identity = 218/719 (30.32%), Postives = 376/719 (52.29%), Query Frame = 0

Query: 91  LYSKSNAMDHAIKLFDTILYPNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVT 150
           +Y+K   +  A  LFD +   N +SWNT+++G+    L+L+ +  F  M  LG KP+   
Sbjct: 1   MYTKFGRVKPARHLFDIMPVRNEVSWNTMMSGIVRVGLYLEGMEFFRKMCDLGIKPSSFV 60

Query: 151 CGSVLSACAAIQASMF--GKQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHD 210
             S+++AC     SMF  G QV+    ++G   + YV T ++ L+        + +VF +
Sbjct: 61  IASLVTACGR-SGSMFREGVQVHGFVAKSGLLSDVYVSTAILHLYGVYGLVSCSRKVFEE 120

Query: 211 VDCANVVCWNAIVSAAVTNGENLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFG 270
           +   NVV W +++      GE    +D++  M  + +  N  + S V+++C  L+D   G
Sbjct: 121 MPDRNVVSWTSLMVGYSDKGEPEEVIDIYKGMRGEGVGCNENSMSLVISSCGLLKDESLG 180

Query: 271 KKVQGRVIKCG-GGDVFVETALVSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQN 330
           +++ G+V+K G    + VE +L+S+    G++D A   F QM  R+ +SW  I + + QN
Sbjct: 181 RQIIGQVVKSGLESKLAVENSLISMLGSMGNVDYANYIFDQMSERDTISWNSIAAAYAQN 240

Query: 331 NDYLMVIKFFEDLRKVGEEINSYTVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVA 390
                  + F  +R+  +E+NS TV+TLL    +   +K    +H  ++K GF S   V 
Sbjct: 241 GHIEESFRIFSLMRRFHDEVNSTTVSTLLSVLGHVDHQKWGRGIHGLVVKMGFDSVVCVC 300

Query: 391 AALIIMYSKIGAVDLSLMIFREMDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERM 450
             L+ MY+  G    + ++F++M   ++L SW +++ SF  +    +A  L   M+    
Sbjct: 301 NTLLRMYAGAGRSVEANLVFKQMPT-KDLISWNSLMASFVNDGRSLDALGLLCSMISSGK 360

Query: 451 GPDSVC-TSALLS--LTDCITFGRQIHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQ 510
             + V  TSAL +    D    GR +H   + + L +N  +G++L++MY K G + E+ +
Sbjct: 361 SVNYVTFTSALAACFTPDFFEKGRILHGLVVVSGLFYNQIIGNALVSMYGKIGEMSESRR 420

Query: 511 VFENMPEKDNVSWTLMISCFSEHGYAKDAIQLFREMLLECVPDG-TSLSAVLTACYALPS 570
           V   MP +D V+W  +I  ++E      A+  F+ M +E V     ++ +VL+AC  LP 
Sbjct: 421 VLLQMPRRDVVAWNALIGGYAEDEDPDKALAAFQTMRVEGVSSNYITVVSVLSAC-LLPG 480

Query: 571 --IQLGREIHGYSVRVGLNENVALGSSLVTMYSKCGNLALARRVFETLPQKDDIVCSSLV 630
             ++ G+ +H Y V  G   +  + +SL+TMY+KCG+L+ ++ +F  L  ++ I  ++++
Sbjct: 481 DLLERGKPLHAYIVSAGFESDEHVKNSLITMYAKCGDLSSSQDLFNGLDNRNIITWNAML 540

Query: 631 SGYAQQKCIKEALLLFRSLLVAGLAIDPFSISSILGAIALLNRPAIGTQIHALIVKVGLE 690
           +  A     +E L L   +   G+++D FS S  L A A L     G Q+H L VK+G E
Sbjct: 541 AANAHHGHGEEVLKLVSKMRSFGVSLDQFSFSEGLSAAAKLAVLEEGQQLHGLAVKLGFE 600

Query: 691 KDVSVGSSLVMVYSRCGSIEDCCKAFGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYELM 750
            D  + ++   +YS+CG I +  K         L  W  +I +  +HG   E    +  M
Sbjct: 601 HDSFIFNAAADMYSKCGEIGEVVKMLPPSVNRSLPSWNILISALGRHGYFEEVCATFHEM 660

Query: 751 KKEGFKPDPVTFVGVLSACSHNGLVDEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
            + G KP  VTFV +L+ACSH GLVD+   + + +  D+G++P   H +C++DLLGR G
Sbjct: 661 LEMGIKPGHVTFVSLLTACSHGGLVDKGLAYYDMIARDFGLEPAIEHCICVIDLLGRSG 716

BLAST of CsGy1G005280.1 vs. TAIR10
Match: AT2G39620.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 346.7 bits (888), Expect = 3.9e-95
Identity = 205/721 (28.43%), Postives = 374/721 (51.87%), Query Frame = 0

Query: 86  NSLLHLYSKSNAMDHAIKLFDTILYPNVISWNTIITGLNNNFLHLDSLRTFCWM-HFLGF 145
           N L++ YS     D +  +FD++  P V+ WN++I G     LH ++L  F +M    G 
Sbjct: 37  NQLINAYSLFQRQDLSRVIFDSVRDPGVVLWNSMIRGYTRAGLHREALGFFGYMSEEKGI 96

Query: 146 KPNEVTCGSVLSACAAIQASMFGKQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALR 205
            P++ +    L ACA       G +++ L    G   + Y+ T +++++ K    + A +
Sbjct: 97  DPDKYSFTFALKACAGSMDFKKGLRIHDLIAEMGLESDVYIGTALVEMYCKARDLVSARQ 156

Query: 206 VFHDVDCANVVCWNAIVSAAVTNGENLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQD 265
           VF  +   +VV WN +VS    NG +  AL LF+ M S  ++ +  +  +++ A S L+ 
Sbjct: 157 VFDKMHVKDVVTWNTMVSGLAQNGCSSAALLLFHDMRSCCVDIDHVSLYNLIPAVSKLEK 216

Query: 266 LEFGKKVQGRVIKCGGGDVFVETALVSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGF 325
            +  + + G VIK G    F  + L+ +Y  C D+  A   F ++  ++  SW  +M+ +
Sbjct: 217 SDVCRCLHGLVIKKGFIFAF-SSGLIDMYCNCADLYAAESVFEEVWRKDESSWGTMMAAY 276

Query: 326 VQNNDYLMVIKFFEDLRKVGEEINSYTVTTLLRACANPAMRKEATQLHSWILKAGFSSHS 385
             N  +  V++ F+ +R     +N     + L+A A      +   +H + ++ G     
Sbjct: 277 AHNGFFEEVLELFDLMRNYDVRMNKVAAASALQAAAYVGDLVKGIAIHDYAVQQGLIGDV 336

Query: 386 EVAAALIIMYSKIGAVDLSLMIFREMDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLR 445
            VA +L+ MYSK G ++++  +F  +++ R++ SW+AMI S+ +    +EA  LFR M+R
Sbjct: 337 SVATSLMSMYSKCGELEIAEQLFINIED-RDVVSWSAMIASYEQAGQHDEAISLFRDMMR 396

Query: 446 ERMGPDSVCTSALLSLTDCIT---FGRQIHCYALKTELIFNVHVGSSLLTMYSKCGHLKE 505
             + P++V  +++L     +     G+ IHCYA+K ++   +   +++++MY+KCG    
Sbjct: 397 IHIKPNAVTLTSVLQGCAGVAASRLGKSIHCYAIKADIESELETATAVISMYAKCGRFSP 456

Query: 506 AFQVFENMPEKDNVSWTLMISCFSEHGYAKDAIQLFREMLLECV-PDGTSLSAVLTACYA 565
           A + FE +P KD V++  +   +++ G A  A  +++ M L  V PD  ++  +L  C  
Sbjct: 457 ALKAFERLPIKDAVAFNALAQGYTQIGDANKAFDVYKNMKLHGVCPDSRTMVGMLQTCAF 516

Query: 566 LPSIQLGREIHGYSVRVGLNENVALGSSLVTMYSKCGNLALARRVFETLP-QKDDIVCSS 625
                 G  ++G  ++ G +    +  +L+ M++KC  LA A  +F+    +K  +  + 
Sbjct: 517 CSDYARGSCVYGQIIKHGFDSECHVAHALINMFTKCDALAAAIVLFDKCGFEKSTVSWNI 576

Query: 626 LVSGYAQQKCIKEALLLFRSLLVAGLAIDPFSISSILGAIALLNRPAIGTQIHALIVKVG 685
           +++GY      +EA+  FR + V     +  +  +I+ A A L+   +G  +H+ +++ G
Sbjct: 577 MMNGYLLHGQAEEAVATFRQMKVEKFQPNAVTFVNIVRAAAELSALRVGMSVHSSLIQCG 636

Query: 686 LEKDVSVGSSLVMVYSRCGSIEDCCKAFGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYE 745
                 VG+SLV +Y++CG IE   K F +I    ++ W +M+ +YA HG  + A+  + 
Sbjct: 637 FCSQTPVGNSLVDMYAKCGMIESSEKCFIEISNKYIVSWNTMLSAYAAHGLASCAVSLFL 696

Query: 746 LMKKEGFKPDPVTFVGVLSACSHNGLVDEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRC 801
            M++   KPD V+F+ VLSAC H GLV+E       M E + I+    HY CMVDLLG+ 
Sbjct: 697 SMQENELKPDSVSFLSVLSACRHAGLVEEGKRIFEEMGERHKIEAEVEHYACMVDLLGKA 755

BLAST of CsGy1G005280.1 vs. Swiss-Prot
Match: sp|Q9CA56|PP121_ARATH (Pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-E69 PE=3 SV=1)

HSP 1 Score: 730.7 bits (1885), Expect = 1.7e-209
Identity = 366/718 (50.97%), Postives = 515/718 (71.73%), Query Frame = 0

Query: 62  SLRNTKVLHAKLLRE-TLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTILYPNVISWNTII 121
           +LR TK+L A LLR   L FD++++ SLL  YS S +M  A KLFDTI  P+V+S N +I
Sbjct: 63  NLRTTKILQAHLLRRYLLPFDVFLTKSLLSWYSNSGSMADAAKLFDTIPQPDVVSCNIMI 122

Query: 122 TGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASMFGKQVYSLAVRNGFF 181
           +G   + L  +SLR F  MHFLGF+ NE++ GSV+SAC+A+QA +F + V    ++ G+F
Sbjct: 123 SGYKQHRLFEESLRFFSKMHFLGFEANEISYGSVISACSALQAPLFSELVCCHTIKMGYF 182

Query: 182 DNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGENLMALDLFNRM 241
               V + +ID+F+K+ +F DA +VF D   ANV CWN I++ A+ N       DLF+ M
Sbjct: 183 FYEVVESALIDVFSKNLRFEDAYKVFRDSLSANVYCWNTIIAGALRNQNYGAVFDLFHEM 242

Query: 242 CSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCGGGDVFVETALVSLYAKCGDMD 301
           C  F +P+S+T+SSVL AC++L+ L FGK VQ RVIKCG  DVFV TA+V LYAKCG M 
Sbjct: 243 CVGFQKPDSYTYSSVLAACASLEKLRFGKVVQARVIKCGAEDVFVCTAIVDLYAKCGHMA 302

Query: 302 EAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSYTVTTLLRACA 361
           EA++ F ++P  +VVSWTV++SG+ ++ND    ++ F+++R  G EIN+ TVT+++ AC 
Sbjct: 303 EAMEVFSRIPNPSVVSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCTVTSVISACG 362

Query: 362 NPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFREMDNHRNLSSWT 421
            P+M  EA+Q+H+W+ K+GF   S VAAALI MYSK G +DLS  +F ++D+ +  +   
Sbjct: 363 RPSMVCEASQVHAWVFKSGFYLDSSVAAALISMYSKSGDIDLSEQVFEDLDDIQRQNIVN 422

Query: 422 AMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCITFGRQIHCYALKTELI 481
            MI SF+++    +A  LF +ML+E +  D     +LLS+ DC+  G+Q+H Y LK+ L+
Sbjct: 423 VMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVLDCLNLGKQVHGYTLKSGLV 482

Query: 482 FNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCFSEHGYAKDAIQLFREM 541
            ++ VGSSL T+YSKCG L+E++++F+ +P KDN  W  MIS F+E+GY ++AI LF EM
Sbjct: 483 LDLTVGSSLFTLYSKCGSLEESYKLFQGIPFKDNACWASMISGFNEYGYLREAIGLFSEM 542

Query: 542 LLE-CVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSSLVTMYSKCGNL 601
           L +   PD ++L+AVLT C + PS+  G+EIHGY++R G+++ + LGS+LV MYSKCG+L
Sbjct: 543 LDDGTSPDESTLAAVLTVCSSHPSLPRGKEIHGYTLRAGIDKGMDLGSALVNMYSKCGSL 602

Query: 602 ALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDPFSISSILGAI 661
            LAR+V++ LP+ D + CSSL+SGY+Q   I++  LLFR ++++G  +D F+ISSIL A 
Sbjct: 603 KLARQVYDRLPELDPVSCSSLISGYSQHGLIQDGFLLFRDMVMSGFTMDSFAISSILKAA 662

Query: 662 ALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQIGKPDLIGWT 721
           AL +  ++G Q+HA I K+GL  + SVGSSL+ +YS+ GSI+DCCKAF QI  PDLI WT
Sbjct: 663 ALSDESSLGAQVHAYITKIGLCTEPSVGSSLLTMYSKFGSIDDCCKAFSQINGPDLIAWT 722

Query: 722 SMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEAYFHLNSMV 778
           ++I SYAQHGK  EAL  Y LMK++GFKPD VTFVGVLSACSH GLV+E+YFHLNSMV
Sbjct: 723 ALIASYAQHGKANEALQVYNLMKEKGFKPDKVTFVGVLSACSHGGLVEESYFHLNSMV 780

BLAST of CsGy1G005280.1 vs. Swiss-Prot
Match: sp|Q9SVP7|PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 366.7 bits (940), Expect = 6.6e-100
Identity = 233/747 (31.19%), Postives = 397/747 (53.15%), Query Frame = 0

Query: 62  SLRNTKVLHAKLLRETLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTILYPNVISWNTIIT 121
           SL   + LH+++L+  L  +  +S  L   Y     +  A K+FD +    + +WN +I 
Sbjct: 100 SLDEGRKLHSQILKLGLDSNGCLSEKLFDFYLFKGDLYGAFKVFDEMPERTIFTWNKMIK 159

Query: 122 GLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASM-FGKQVYSLAVRNGFF 181
            L +  L  +    F  M      PNE T   VL AC     +    +Q+++  +  G  
Sbjct: 160 ELASRNLIGEVFGLFVRMVSENVTPNEGTFSGVLEACRGGSVAFDVVEQIHARILYQGLR 219

Query: 182 DNGYVRTVMIDLFAKDSKFLD-ALRVFHDVDCANVVCWNAIVSAAVTNGENLMALDLFNR 241
           D+  V   +IDL++++  F+D A RVF  +   +   W A++S    N     A+ LF  
Sbjct: 220 DSTVVCNPLIDLYSRNG-FVDLARRVFDGLRLKDHSSWVAMISGLSKNECEAEAIRLFCD 279

Query: 242 MCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCG-GGDVFVETALVSLYAKCGD 301
           M    + P  + FSSVL+AC  ++ LE G+++ G V+K G   D +V  ALVSLY   G+
Sbjct: 280 MYVLGIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLGFSSDTYVCNALVSLYFHLGN 339

Query: 302 MDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSYTVTTLLRA 361
           +  A   F  M  R+ V++  +++G  Q       ++ F+ +   G E +S T+ +L+ A
Sbjct: 340 LISAEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEPDSNTLASLVVA 399

Query: 362 CANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFREMDNHRNLSS 421
           C+         QLH++  K GF+S++++  AL+ +Y+K   ++ +L  F E +   N+  
Sbjct: 400 CSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETE-VENVVL 459

Query: 422 WTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLS----LTDCITFGRQIHCYA 481
           W  M++++   +D   +  +FR+M  E + P+     ++L     L D +  G QIH   
Sbjct: 460 WNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGD-LELGEQIHSQI 519

Query: 482 LKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCFSEHGYAKDAI 541
           +KT    N +V S L+ MY+K G L  A+ +      KD VSWT MI+ ++++ +   A+
Sbjct: 520 IKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFDDKAL 579

Query: 542 QLFREMLLECV-PDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSSLVTMY 601
             FR+ML   +  D   L+  ++AC  L +++ G++IH  +   G + ++   ++LVT+Y
Sbjct: 580 TTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNALVTLY 639

Query: 602 SKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDPFSIS 661
           S+CG +  +   FE     D+I  ++LVSG+ Q    +EAL +F  +   G+  + F+  
Sbjct: 640 SRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFG 699

Query: 662 SILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQIGKP 721
           S + A +       G Q+HA+I K G + +  V ++L+ +Y++CGSI D  K F ++   
Sbjct: 700 SAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTK 759

Query: 722 DLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEAYFHL 781
           + + W ++I +Y++HG G+EAL +++ M     +P+ VT VGVLSACSH GLVD+   + 
Sbjct: 760 NEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYF 819

Query: 782 NSMVEDYGIQPGYRHYVCMVDLLGRCG 801
            SM  +YG+ P   HYVC+VD+L R G
Sbjct: 820 ESMNSEYGLSPKPEHYVCVVDMLTRAG 843

BLAST of CsGy1G005280.1 vs. Swiss-Prot
Match: sp|Q9FIB2|PP373_ARATH (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 356.3 bits (913), Expect = 8.9e-97
Identity = 235/755 (31.13%), Postives = 394/755 (52.19%), Query Frame = 0

Query: 67  KVLHAKLLRETLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTILYPNVISWNTIITGLNNN 126
           +  H++L +  L  D+Y+ N+L++ Y ++     A K+FD +   N +SW  I++G + N
Sbjct: 21  RFFHSRLYKNRLDKDVYLCNNLINAYLETGDSVSARKVFDEMPLRNCVSWACIVSGYSRN 80

Query: 127 FLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQA--SMFGKQVYSLAVRNGFFDNGY 186
             H ++L     M   G   N+    SVL AC  I +   +FG+Q++ L  +  +  +  
Sbjct: 81  GEHKEALVFLRDMVKEGIFSNQYAFVSVLRACQEIGSVGILFGRQIHGLMFKLSYAVDAV 140

Query: 187 VRTVMIDLFAKDSKFLD-ALRVFHDVDCANVVCWNAIVSAAVTNGENLMALDLFNRMCSK 246
           V  V+I ++ K    +  AL  F D++  N V WN+I+S     G+   A  +F+ M   
Sbjct: 141 VSNVLISMYWKCIGSVGYALCAFGDIEVKNSVSWNSIISVYSQAGDQRSAFRIFSSMQYD 200

Query: 247 FLEPNSFTFSS-VLTACSALQ-DLEFGKKVQGRVIKCG-GGDVFVETALVSLYAKCGDMD 306
              P  +TF S V TACS  + D+   +++   + K G   D+FV + LVS +AK G + 
Sbjct: 201 GSRPTEYTFGSLVTTACSLTEPDVRLLEQIMCTIQKSGLLTDLFVGSGLVSAFAKSGSLS 260

Query: 307 EAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSYTVTTLLR--- 366
            A K F QM  RN V+   +M G V+        K F D+  +  +++  +   LL    
Sbjct: 261 YARKVFNQMETRNAVTLNGLMVGLVRQKWGEEATKLFMDMNSM-IDVSPESYVILLSSFP 320

Query: 367 --ACANPAMRKEATQLHSWILKAGFSSHS-EVAAALIIMYSKIGAVDLSLMIFREMDNHR 426
             + A     K+  ++H  ++  G       +   L+ MY+K G++  +  +F  M + +
Sbjct: 321 EYSLAEEVGLKKGREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTD-K 380

Query: 427 NLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCIT-----FGRQ 486
           +  SW +MI    +N    EA + ++ M R  + P S   + + SL+ C +      G+Q
Sbjct: 381 DSVSWNSMITGLDQNGCFIEAVERYKSMRRHDILPGSF--TLISSLSSCASLKWAKLGQQ 440

Query: 487 IHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCF--SEH 546
           IH  +LK  +  NV V ++L+T+Y++ G+L E  ++F +MPE D VSW  +I     SE 
Sbjct: 441 IHGESLKLGIDLNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNSIIGALARSER 500

Query: 547 GYAKDAIQLFREMLLECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGS 606
              +  +            +  + S+VL+A  +L   +LG++IHG +++  + +     +
Sbjct: 501 SLPEAVVCFLNAQRAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTEN 560

Query: 607 SLVTMYSKCGNLALARRVFETLPQ-KDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLA 666
           +L+  Y KCG +    ++F  + + +D++  +S++SGY   + + +AL L   +L  G  
Sbjct: 561 ALIACYGKCGEMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQR 620

Query: 667 IDPFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKA 726
           +D F  +++L A A +     G ++HA  V+  LE DV VGS+LV +YS+CG ++   + 
Sbjct: 621 LDSFMYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRF 680

Query: 727 FGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEG-FKPDPVTFVGVLSACSHNGL 786
           F  +   +   W SMI  YA+HG+G EAL  +E MK +G   PD VTFVGVLSACSH GL
Sbjct: 681 FNTMPVRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGL 740

Query: 787 VDEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
           ++E + H  SM + YG+ P   H+ CM D+LGR G
Sbjct: 741 LEEGFKHFESMSDSYGLAPRIEHFSCMADVLGRAG 771

BLAST of CsGy1G005280.1 vs. Swiss-Prot
Match: sp|O80647|PP195_ARATH (Pentatricopeptide repeat-containing protein At2g39620 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E33 PE=3 SV=1)

HSP 1 Score: 346.7 bits (888), Expect = 7.1e-94
Identity = 205/721 (28.43%), Postives = 374/721 (51.87%), Query Frame = 0

Query: 86  NSLLHLYSKSNAMDHAIKLFDTILYPNVISWNTIITGLNNNFLHLDSLRTFCWM-HFLGF 145
           N L++ YS     D +  +FD++  P V+ WN++I G     LH ++L  F +M    G 
Sbjct: 37  NQLINAYSLFQRQDLSRVIFDSVRDPGVVLWNSMIRGYTRAGLHREALGFFGYMSEEKGI 96

Query: 146 KPNEVTCGSVLSACAAIQASMFGKQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALR 205
            P++ +    L ACA       G +++ L    G   + Y+ T +++++ K    + A +
Sbjct: 97  DPDKYSFTFALKACAGSMDFKKGLRIHDLIAEMGLESDVYIGTALVEMYCKARDLVSARQ 156

Query: 206 VFHDVDCANVVCWNAIVSAAVTNGENLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQD 265
           VF  +   +VV WN +VS    NG +  AL LF+ M S  ++ +  +  +++ A S L+ 
Sbjct: 157 VFDKMHVKDVVTWNTMVSGLAQNGCSSAALLLFHDMRSCCVDIDHVSLYNLIPAVSKLEK 216

Query: 266 LEFGKKVQGRVIKCGGGDVFVETALVSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGF 325
            +  + + G VIK G    F  + L+ +Y  C D+  A   F ++  ++  SW  +M+ +
Sbjct: 217 SDVCRCLHGLVIKKGFIFAF-SSGLIDMYCNCADLYAAESVFEEVWRKDESSWGTMMAAY 276

Query: 326 VQNNDYLMVIKFFEDLRKVGEEINSYTVTTLLRACANPAMRKEATQLHSWILKAGFSSHS 385
             N  +  V++ F+ +R     +N     + L+A A      +   +H + ++ G     
Sbjct: 277 AHNGFFEEVLELFDLMRNYDVRMNKVAAASALQAAAYVGDLVKGIAIHDYAVQQGLIGDV 336

Query: 386 EVAAALIIMYSKIGAVDLSLMIFREMDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLR 445
            VA +L+ MYSK G ++++  +F  +++ R++ SW+AMI S+ +    +EA  LFR M+R
Sbjct: 337 SVATSLMSMYSKCGELEIAEQLFINIED-RDVVSWSAMIASYEQAGQHDEAISLFRDMMR 396

Query: 446 ERMGPDSVCTSALLSLTDCIT---FGRQIHCYALKTELIFNVHVGSSLLTMYSKCGHLKE 505
             + P++V  +++L     +     G+ IHCYA+K ++   +   +++++MY+KCG    
Sbjct: 397 IHIKPNAVTLTSVLQGCAGVAASRLGKSIHCYAIKADIESELETATAVISMYAKCGRFSP 456

Query: 506 AFQVFENMPEKDNVSWTLMISCFSEHGYAKDAIQLFREMLLECV-PDGTSLSAVLTACYA 565
           A + FE +P KD V++  +   +++ G A  A  +++ M L  V PD  ++  +L  C  
Sbjct: 457 ALKAFERLPIKDAVAFNALAQGYTQIGDANKAFDVYKNMKLHGVCPDSRTMVGMLQTCAF 516

Query: 566 LPSIQLGREIHGYSVRVGLNENVALGSSLVTMYSKCGNLALARRVFETLP-QKDDIVCSS 625
                 G  ++G  ++ G +    +  +L+ M++KC  LA A  +F+    +K  +  + 
Sbjct: 517 CSDYARGSCVYGQIIKHGFDSECHVAHALINMFTKCDALAAAIVLFDKCGFEKSTVSWNI 576

Query: 626 LVSGYAQQKCIKEALLLFRSLLVAGLAIDPFSISSILGAIALLNRPAIGTQIHALIVKVG 685
           +++GY      +EA+  FR + V     +  +  +I+ A A L+   +G  +H+ +++ G
Sbjct: 577 MMNGYLLHGQAEEAVATFRQMKVEKFQPNAVTFVNIVRAAAELSALRVGMSVHSSLIQCG 636

Query: 686 LEKDVSVGSSLVMVYSRCGSIEDCCKAFGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYE 745
                 VG+SLV +Y++CG IE   K F +I    ++ W +M+ +YA HG  + A+  + 
Sbjct: 637 FCSQTPVGNSLVDMYAKCGMIESSEKCFIEISNKYIVSWNTMLSAYAAHGLASCAVSLFL 696

Query: 746 LMKKEGFKPDPVTFVGVLSACSHNGLVDEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRC 801
            M++   KPD V+F+ VLSAC H GLV+E       M E + I+    HY CMVDLLG+ 
Sbjct: 697 SMQENELKPDSVSFLSVLSACRHAGLVEEGKRIFEEMGERHKIEAEVEHYACMVDLLGKA 755

BLAST of CsGy1G005280.1 vs. Swiss-Prot
Match: sp|Q9FWA6|PP207_ARATH (Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E90 PE=2 SV=2)

HSP 1 Score: 333.2 bits (853), Expect = 8.1e-90
Identity = 208/703 (29.59%), Postives = 351/703 (49.93%), Query Frame = 0

Query: 154 VLSACAAIQASMFGKQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCAN 213
           V   CA   A   GKQ ++  + +GF    +V   ++ ++     F+ A  VF  +   +
Sbjct: 54  VFKECAKQGALELGKQAHAHMIISGFRPTTFVLNCLLQVYTNSRDFVSASMVFDKMPLRD 113

Query: 214 VVCWNAIVSAAVTNGENLMALDLFNR-------------------------------MCS 273
           VV W                                                     M  
Sbjct: 114 VVSWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVDMGR 173

Query: 274 KFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCG-GGDVFVETALVSLYAKCGDMDE 333
           + +E +  TF+ +L  CS L+D   G ++ G V++ G   DV   +AL+ +YAK     E
Sbjct: 174 EGIEFDGRTFAIILKVCSFLEDTSLGMQIHGIVVRVGCDTDVVAASALLDMYAKGKRFVE 233

Query: 334 AVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSYTVTTLLRACAN 393
           +++ F  +P +N VSW+ I++G VQNN   + +KFF++++KV   ++     ++LR+CA 
Sbjct: 234 SLRVFQGIPEKNSVSWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQSIYASVLRSCAA 293

Query: 394 PAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFREMDNHRNLSSWTA 453
            +  +   QLH+  LK+ F++   V  A + MY+K   +  + ++F   +N  N  S+ A
Sbjct: 294 LSELRLGGQLHAHALKSDFAADGIVRTATLDMYAKCDNMQDAQILFDNSEN-LNRQSYNA 353

Query: 454 MILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALL---SLTDCITFGRQIHCYALKTE 513
           MI  +++     +A  LF +++   +G D +  S +    +L   ++ G QI+  A+K+ 
Sbjct: 354 MITGYSQEEHGFKALLLFHRLMSSGLGFDEISLSGVFRACALVKGLSEGLQIYGLAIKSS 413

Query: 514 LIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCFSEHGYAKDAIQLFR 573
           L  +V V ++ + MY KC  L EAF+VF+ M  +D VSW  +I+   ++G   + + LF 
Sbjct: 414 LSLDVCVANAAIDMYGKCQALAEAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETLFLFV 473

Query: 574 EMLLECV-PDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSSLVTMYSKCG 633
            ML   + PD  +  ++L AC    S+  G EIH   V+ G+  N ++G SL+ MYSKCG
Sbjct: 474 SMLRSRIEPDEFTFGSILKACTG-GSLGYGMEIHSSIVKSGMASNSSVGCSLIDMYSKCG 533

Query: 634 NLALARRVFETLPQKDDIVC--------------------SSLVSGYAQQKCIKEALLLF 693
            +  A ++     Q+ ++                      +S++SGY  ++  ++A +LF
Sbjct: 534 MIEEAEKIHSRFFQRANVXXXXXXXXXXXXXXXXXXXVSWNSIISGYVMKEQSEDAQMLF 593

Query: 694 RSLLVAGLAIDPFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRC 753
             ++  G+  D F+ +++L   A L    +G QIHA ++K  L+ DV + S+LV +YS+C
Sbjct: 594 TRMMEMGITPDKFTYATVLDTCANLASAGLGKQIHAQVIKKELQSDVYICSTLVDMYSKC 653

Query: 754 GSIEDCCKAFGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVL 801
           G + D    F +  + D + W +MI  YA HGKG EA+  +E M  E  KP+ VTF+ +L
Sbjct: 654 GDLHDSRLMFEKSLRRDFVTWNAMICGYAHHGKGEEAIQLFERMILENIKPNHVTFISIL 713

BLAST of CsGy1G005280.1 vs. TrEMBL
Match: tr|A0A1S3B4I2|A0A1S3B4I2_CUCME (pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103485891 PE=4 SV=1)

HSP 1 Score: 1317.4 bits (3408), Expect = 0.0e+00
Identity = 665/751 (88.55%), Postives = 677/751 (90.15%), Query Frame = 0

Query: 51  LLNDFVKLGKFSLRNTKVLHAKLLRETLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTILY 110
           LLNDFVKLG FSLRNTKVLHAK LR T R DIYVSNSLLH YSKSNAMDHA+KLFDTIL 
Sbjct: 49  LLNDFVKLGNFSLRNTKVLHAKFLRVTPRIDIYVSNSLLHCYSKSNAMDHALKLFDTILN 108

Query: 111 PNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASMFGKQV 170
           PNVISWNTIITG NNNFLHLDSLR FCWMH+LGFKPNEVTCGSVLSACAAIQA+MFGKQV
Sbjct: 109 PNVISWNTIITGFNNNFLHLDSLRIFCWMHYLGFKPNEVTCGSVLSACAAIQATMFGKQV 168

Query: 171 YSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGEN 230
           YSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGE 
Sbjct: 169 YSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGEY 228

Query: 231 LMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCGGGDVFVETALV 290
           LMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGK VQGRVIKCGGGDVFVETALV
Sbjct: 229 LMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKSVQGRVIKCGGGDVFVETALV 288

Query: 291 SLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSY 350
           SLYAKCGDMDEAVK F QMPIRNVVSWTVIMSGFVQNNDYLMVIK FEDLRK+GEEINSY
Sbjct: 289 SLYAKCGDMDEAVKIFFQMPIRNVVSWTVIMSGFVQNNDYLMVIKIFEDLRKIGEEINSY 348

Query: 351 TVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFREM 410
           TVTTLLRACANP MRKEATQLHSWILKAGFSS +EV AALIIMYSKIGA+DLSLM+FREM
Sbjct: 349 TVTTLLRACANPGMRKEATQLHSWILKAGFSSQAEVVAALIIMYSKIGAIDLSLMVFREM 408

Query: 411 DNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCITFGRQI 470
           DNHRNLSSWTAMILS AKNNDKEEASDLFRKMLRE+M PDSVCTS LLSLTDCITFGRQI
Sbjct: 409 DNHRNLSSWTAMILSLAKNNDKEEASDLFRKMLREKMEPDSVCTSTLLSLTDCITFGRQI 468

Query: 471 HCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMIS-CFSEHGY 530
           HCY LKTELIFNV VGSSL TMYSKCGHLKEAFQVFENMPEKDNVSWTLMIS        
Sbjct: 469 HCYTLKTELIFNVSVGSSLFTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISXXXXXXXX 528

Query: 531 AKDAIQLFREMLLECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSSL 590
                                                 REIHGYS+RVGL+ENV+ GSSL
Sbjct: 529 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXREIHGYSIRVGLSENVSFGSSL 588

Query: 591 VTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDP 650
           VTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDP
Sbjct: 589 VTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDP 648

Query: 651 FSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQ 710
           FSISSILG IALL RPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQ
Sbjct: 649 FSISSILGGIALLKRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQ 708

Query: 711 IGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEA 770
           IGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEA
Sbjct: 709 IGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEA 768

Query: 771 YFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
           YFHLNSMVEDYGIQPG RHY C+VDLLGRCG
Sbjct: 769 YFHLNSMVEDYGIQPGCRHYACLVDLLGRCG 799

BLAST of CsGy1G005280.1 vs. TrEMBL
Match: tr|A0A2I4F4H3|A0A2I4F4H3_9ROSI (pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Juglans regia OX=51240 GN=LOC108995418 PE=4 SV=1)

HSP 1 Score: 941.8 bits (2433), Expect = 1.0e-270
Identity = 471/746 (63.14%), Postives = 584/746 (78.28%), Query Frame = 0

Query: 57  KLGKFSLRNTKVLHAKLLRE-TLRFDIYVSNSLLHLYSKSNAMDHAIKLFDTILYPNVIS 116
           K GK  +R  K +H +LL+   L  +I+V+NSLL  Y K   M  A+ LFDT+  PNVIS
Sbjct: 55  KSGKCGVRVAKFIHTQLLKSAALHSNIFVANSLLDWYCKYAGMVDALLLFDTMARPNVIS 114

Query: 117 WNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASMFGKQVYSLAV 176
           WN +I+G N + L  DS + FC MH LGF+PNE+T GSVLSAC A QA +FGKQVYSLA+
Sbjct: 115 WNILISGYNQDHLFEDSWKIFCRMHSLGFEPNEITYGSVLSACTAFQAPIFGKQVYSLAM 174

Query: 177 RNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGENLMALD 236
           +NGFF NGYVRT MIDLF+K+  F DAL VFHDV C NVVCWNAI+S AV NGEN +ALD
Sbjct: 175 KNGFFSNGYVRTGMIDLFSKNFSFEDALGVFHDVFCENVVCWNAIISGAVKNGENRVALD 234

Query: 237 LFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCGGGDVFVETALVSLYAK 296
           LF  MCS    PNS+TFSS+L AC+AL++L+ GK VQG VIKCG GDVFVETA+V LYAK
Sbjct: 235 LFREMCSGSFLPNSYTFSSILGACAALEELDVGKGVQGWVIKCGAGDVFVETAIVDLYAK 294

Query: 297 CGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSYTVTTL 356
           CG M+EAV+ FLQMPIRNVVSWT ++SGFV+  D +  +KFF+D+R++G EIN+YTVT++
Sbjct: 295 CGLMEEAVEEFLQMPIRNVVSWTTVISGFVKKEDSICALKFFKDMRELGVEINNYTVTSV 354

Query: 357 LRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFREMDNHRN 416
           + ACA PAM +EA Q+HSWILKAGF     V AALI MYSKIG  DLS+++F+++ + +N
Sbjct: 355 VTACAKPAMIEEAIQVHSWILKAGFYLDEAVGAALINMYSKIGEFDLSVLVFKDIGSLKN 414

Query: 417 LSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCITFGRQIHCYAL 476
              W AMI + A+N +  EA ++FRKML+E +  D  C S+LLS+  C+  GRQIHCY+L
Sbjct: 415 PGVWVAMISASAQNQNPGEALEIFRKMLQESVRLDKFCISSLLSVIGCLNLGRQIHCYSL 474

Query: 477 KTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCFSEHGYAKDAIQ 536
           KT L+ +V VGS+LLTMYSK G LKE+ +VFE + E+DNVSW  MI+ F+EHG A  AI+
Sbjct: 475 KTGLVSDVSVGSALLTMYSKSGSLKESHKVFEQILERDNVSWASMIAGFAEHGCADQAIK 534

Query: 537 LFREMLL-ECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSSLVTMYS 596
           LF EMLL E VPD  +L+A LTAC AL S++ G+EIHGY++R+G+ ++V +G +LVT+YS
Sbjct: 535 LFGEMLLEEIVPDQMTLTATLTACSALRSLRKGKEIHGYALRIGVGKDVVVGGALVTLYS 594

Query: 597 KCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDPFSISS 656
           KCG L LA+ VF+ LPQKD + CSSL+S YAQ   I++AL+LF  +L+A LAID F++SS
Sbjct: 595 KCGTLELAKSVFDMLPQKDQVACSSLISSYAQNGYIEKALMLFFDMLMADLAIDSFTVSS 654

Query: 657 ILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQIGKPD 716
           +LGA+ALLNR  IGTQ+HALI K+GL+ DVSVGSSLV +YS+CGSIE C KAF QI KPD
Sbjct: 655 VLGAVALLNRSDIGTQMHALITKMGLDSDVSVGSSLVTMYSKCGSIEGCRKAFDQIEKPD 714

Query: 717 LIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEAYFHLN 776
           LIGWT+MIVSYAQHGKG EAL  YELM+KEG KPD VTFVGVLSACSHNGLV+EAY HLN
Sbjct: 715 LIGWTAMIVSYAQHGKGTEALSLYELMRKEGIKPDAVTFVGVLSACSHNGLVEEAYIHLN 774

Query: 777 SMVEDYGIQPGYRHYVCMVDLLGRCG 801
           SM +D+GI+PGYRHY CMVDLLGR G
Sbjct: 775 SMAKDHGIEPGYRHYACMVDLLGRSG 800

BLAST of CsGy1G005280.1 vs. TrEMBL
Match: tr|A0A1Q3AQ16|A0A1Q3AQ16_CEPFO (PPR domain-containing protein/PPR_2 domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_01375 PE=4 SV=1)

HSP 1 Score: 930.2 bits (2403), Expect = 3.0e-267
Identity = 461/750 (61.47%), Postives = 578/750 (77.07%), Query Frame = 0

Query: 53  NDFVKLGKFSLRNTKVLHAKLLRET-LRFDIYVSNSLLHLYSKSNAMDHAIKLFDTILYP 112
           ND  K    ++++T ++H  LL++  L+ DI+V+NSLL  Y KS +MD A++LFDTI +P
Sbjct: 51  NDHKKSRHQTIKSTTIIHTHLLKKALLQSDIFVANSLLDWYCKSGSMDGALQLFDTIPHP 110

Query: 113 NVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASMFGKQVY 172
           N ISWN +I+G N+NFL  DS RTFC MH LGF+PN  T GSVLSAC A+QA  FGK VY
Sbjct: 111 NEISWNIMISGYNHNFLFEDSWRTFCRMHLLGFEPNGFTYGSVLSACTALQAPSFGKLVY 170

Query: 173 SLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGENL 232
           SLA++NGF  +GYVR   ID FAK+S F DALRVF+DV C NVVCWNA++S AV N EN 
Sbjct: 171 SLAIKNGFSLDGYVRAGTIDFFAKNSSFEDALRVFYDVSCDNVVCWNALISGAVKNRENW 230

Query: 233 MALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCGGGDVFVETALVS 292
           +ALDLF RMC   L PNSFTFSSVLTAC+ L++L++GK VQG VIKC   DVFV TA+V 
Sbjct: 231 LALDLFIRMCRLSLLPNSFTFSSVLTACATLEELQYGKGVQGWVIKCTAKDVFVGTAIVD 290

Query: 293 LYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEINSYT 352
           LYAKCGD+DEAVK F +MP RNVVSWT I++GFVQ ND +  +KF++++R +  EIN+YT
Sbjct: 291 LYAKCGDIDEAVKEFSRMPDRNVVSWTAIIAGFVQKNDCVTALKFYKEMRNMRVEINNYT 350

Query: 353 VTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIFREMD 412
           VT+++ ACANP M +EA Q+HSWILK+GF     V +ALI MYSK+ A+DLS M+FREM+
Sbjct: 351 VTSVITACANPDMIEEAKQIHSWILKSGFYMDQVVGSALINMYSKLRAIDLSEMVFREME 410

Query: 413 NHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCITFGRQIH 472
           N      W AMI SFA+N + + A  L+R ML E + PD  CTS++LS+ + +  GRQIH
Sbjct: 411 NFNIPGKWAAMISSFAQNQNSQRAIQLYRSMLEEGLRPDKYCTSSVLSVVNSLKLGRQIH 470

Query: 473 CYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCFSEHGYAK 532
           CY LKT+L   + VGSSL TMYSKCG L+++++VF+ +P +DNVSW  MIS F+EHG A 
Sbjct: 471 CYTLKTDLAIELLVGSSLFTMYSKCGCLEDSYKVFKQIPVRDNVSWASMISGFAEHGCAD 530

Query: 533 DAIQLFREMLLECV-PDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALGSSLV 592
            A+QLFREML E   PD  +L+A+LTAC AL S+Q GREIHGY++R G+ E   LG +LV
Sbjct: 531 QAVQLFREMLSEKTRPDQMTLTAILTACSALFSLQRGREIHGYALRTGIGEKQLLGGALV 590

Query: 593 TMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLAIDPF 652
            MYSKCG + LARRVF+ LP+K+ + CSSLVSGYAQ   +++A++LF+ +L++GL  D F
Sbjct: 591 NMYSKCGAVELARRVFDMLPEKNQVCCSSLVSGYAQNGLLEDAVVLFQEMLMSGLEQDSF 650

Query: 653 SISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKAFGQI 712
           + SS++GAIALLNR  IGTQ+HALI+K+GL  DV VGSSLV +YSRCGS+EDCCKAF +I
Sbjct: 651 TFSSVIGAIALLNRSGIGTQLHALIIKMGLGSDVCVGSSLVTMYSRCGSMEDCCKAFDEI 710

Query: 713 GKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLVDEAY 772
            KPDLIGW++MI SYAQHGKGAEAL AY+LM K G KPD VTFVGVLSACSHNGLV+E Y
Sbjct: 711 DKPDLIGWSAMITSYAQHGKGAEALKAYDLMVKGGIKPDSVTFVGVLSACSHNGLVEEGY 770

Query: 773 FHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
           FH N+M +DYGI+P Y HY CMVDLLGR G
Sbjct: 771 FHFNAMAKDYGIRPNYHHYACMVDLLGRSG 800

BLAST of CsGy1G005280.1 vs. TrEMBL
Match: tr|A0A2N9J493|A0A2N9J493_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS59540 PE=4 SV=1)

HSP 1 Score: 928.7 bits (2399), Expect = 8.8e-267
Identity = 452/704 (64.20%), Postives = 559/704 (79.40%), Query Frame = 0

Query: 98  MDHAIKLFDTILYPNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSA 157
           M  A+KLFDT+  PNV SWN II+G N  +L  DS + FC MH LGF+PNEVT GSVL A
Sbjct: 1   MVDALKLFDTMAQPNVTSWNIIISGYNQIYLFEDSWKIFCMMHSLGFEPNEVTYGSVLPA 60

Query: 158 CAAIQASMFGKQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCW 217
           C A+QA + GKQVYSLA++NGFF NGYV+  MIDLF K+  F DAL  F DV C NVVCW
Sbjct: 61  CTALQAPILGKQVYSLAMKNGFFSNGYVQCGMIDLFTKNCSFKDALTAFRDVSCENVVCW 120

Query: 218 NAIVSAAVTNGENLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIK 277
           N I++ AV NGEN +AL LF +MC    +PNSFTFSSVLTAC+AL++L+ GK VQG VIK
Sbjct: 121 NTIITGAVRNGENSVALGLFRQMCGGPFQPNSFTFSSVLTACAALEELDIGKGVQGWVIK 180

Query: 278 CGGGDVFVETALVSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFF 337
           CG GD+FV TA+V LYAKCG+M+EAVK F +MPIRNVVSWT ++SGFVQ +D +  +KFF
Sbjct: 181 CGAGDIFVGTAIVDLYAKCGNMEEAVKEFSRMPIRNVVSWTTVISGFVQKDDSVSALKFF 240

Query: 338 EDLRKVGEEINSYTVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKI 397
           +D+R++  EIN+YTVT++L ACA PAM +EA Q+HSWILK G+   + V AALI MYSKI
Sbjct: 241 KDMRELEVEINNYTVTSVLTACAKPAMMEEAIQIHSWILKTGYYLDAAVGAALISMYSKI 300

Query: 398 GAVDLSLMIFREMDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSAL 457
           GA+DLS + F+E+ N +NL +W AMI +FA+N +   A ++F+ ML+E M PD  CTS+L
Sbjct: 301 GALDLSELAFKEVGNIKNLGTWAAMISAFAQNQNSRGAIEMFQGMLQESMTPDKFCTSSL 360

Query: 458 LSLTDCITFGRQIHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSW 517
            S+  C+  GRQIHCY LKT L+ NV VGSSL TMYSKCG L+++++VFE + +KDNVSW
Sbjct: 361 FSVIGCLNLGRQIHCYTLKTGLVSNVLVGSSLFTMYSKCGSLEQSYKVFEQILDKDNVSW 420

Query: 518 TLMISCFSEHGYAKDAIQLFREMLL-ECVPDGTSLSAVLTACYALPSIQLGREIHGYSVR 577
             MI+ F+EHG +  AIQLFREML  E +PD T+L+A LTAC AL S+Q G+EIHGY++R
Sbjct: 421 ASMIAGFAEHGCSDQAIQLFREMLFEEIIPDQTTLTATLTACSALRSLQKGKEIHGYALR 480

Query: 578 VGLNENVALGSSLVTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLL 637
            G+ +++ +G +LVTMYSKCG+L LA RVF+ LP KD + CSSLVSGYAQ   I+++L+L
Sbjct: 481 FGVGKDIVVGGALVTMYSKCGSLELASRVFDMLPDKDPVACSSLVSGYAQNGYIEKSLML 540

Query: 638 FRSLLVAGLAIDPFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSR 697
           FR +L+A LAID F++SS+LGA+A+LNR  IGTQ+HA I K+GL+ DVSVGSSL+ +YS+
Sbjct: 541 FRDMLMADLAIDCFTVSSVLGAVAILNRSGIGTQLHAHIAKLGLDSDVSVGSSLMTMYSK 600

Query: 698 CGSIEDCCKAFGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGV 757
            GSIEDCCKAF QI KPDLIGWT+MIVSYAQHGKGAEAL AYELM+K G KPD VTFVGV
Sbjct: 601 AGSIEDCCKAFDQIEKPDLIGWTTMIVSYAQHGKGAEALSAYELMRKVGIKPDSVTFVGV 660

Query: 758 LSACSHNGLVDEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
           LSACSHNGLV+EAY HLNSM +DYGI+PGYRHY CMVDLLGR G
Sbjct: 661 LSACSHNGLVEEAYIHLNSMAKDYGIEPGYRHYACMVDLLGRSG 704

BLAST of CsGy1G005280.1 vs. TrEMBL
Match: tr|A0A251PAR2|A0A251PAR2_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G192400 PE=4 SV=1)

HSP 1 Score: 897.5 bits (2318), Expect = 2.2e-257
Identity = 461/754 (61.14%), Postives = 565/754 (74.93%), Query Frame = 0

Query: 49  LDLLNDFVKLGKFSLRNTKVL-HAKLLRETLRFDIYVSNSLLHLYSKSNAMDHAIKLFDT 108
           L   ND+ K  + + RNTK+L                  SLL  Y KS+AM  A+KLFD 
Sbjct: 47  LQFFNDYTKSRQCTTRNTKILXXXXXXXXXXXXXXXXXXSLLDSYCKSSAMVDALKLFDF 106

Query: 109 ILYPNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSACAAIQASMFG 168
           I    VISWN +I+G N N L   S   FC MH  GF+PNE T GS LSAC A+QA  FG
Sbjct: 107 IADRTVISWNMMISGYNQNSLFEKSWEIFCRMHSSGFEPNEFTYGSTLSACTALQAPTFG 166

Query: 169 KQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTN 228
           KQVYSLA++NGFF NGYV+  MIDLFAK+  F DALRVF+DV C NVV WNAI+S AV N
Sbjct: 167 KQVYSLAIKNGFFPNGYVQAGMIDLFAKNFSFDDALRVFNDVSCQNVVSWNAIISGAVRN 226

Query: 229 GENLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIKCGGGDVFVET 288
           GEN+ AL LF +MC     PNSFTFSSVLTACSAL+++  GK+VQG VIK G  DVFV T
Sbjct: 227 GENMAALYLFRQMCRGVFLPNSFTFSSVLTACSALEEVGVGKEVQGWVIKRGAEDVFVGT 286

Query: 289 ALVSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQNNDYLMVIKFFEDLRKVGEEI 348
            +V LYAKCG M+EAVK F +MP RNVVSWT I+SGFV  +D +  +K F ++RK+GE++
Sbjct: 287 TIVDLYAKCGKMNEAVKKFSRMPTRNVVSWTAIISGFVHKDDSVSALKAFREMRKMGEQM 346

Query: 349 NSYTVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKIGAVDLSLMIF 408
           N YTVT++L ACA  +M +EATQ+HS ILKAGF S + V +ALI  YSKIGAVDLS M+F
Sbjct: 347 NKYTVTSILTACAKTSMAEEATQIHSLILKAGFYSAAVVGSALINAYSKIGAVDLSEMVF 406

Query: 409 REMDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSALLSLTDCITFG 468
           REM+N ++L +W AMI SFA+N +   A +LF++ML   + PD  CTS++LS+ DC+  G
Sbjct: 407 REMENIKDLGTWAAMISSFAQNQNSGRAIELFQRMLEGSVRPDKFCTSSVLSIVDCLNLG 466

Query: 469 RQIHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENMPEKDNVSWTLMISCFSEH 528
           RQIH Y LK  L+  V VGSSL TMYSKC  L+E+++VF+ +P+KDNVSW  MIS F EH
Sbjct: 467 RQIHSYTLKIGLVSVVSVGSSLFTMYSKCDSLEESYKVFQQIPDKDNVSWASMISGFVEH 526

Query: 529 GYAKDAIQLFREMLL-ECVPDGTSLSAVLTACYALPSIQLGREIHGYSVRVGLNENVALG 588
           G A  A+QL REML  E +PD  +L+A+LTAC A  S+Q G+EIHG+++R G+ ++V LG
Sbjct: 527 GCADQALQLCREMLSEEVIPDQITLTAILTACSASRSLQTGKEIHGHALRKGVQQDV-LG 586

Query: 589 SSLVTMYSKCGNLALARRVFETLPQKDDIVCSSLVSGYAQQKCIKEALLLFRSLLVAGLA 648
            ++VTMYSKC    LAR VF+ LPQKD++ CSSLVSGYAQ   I+EALLLF  +L+A L 
Sbjct: 587 GAIVTMYSKCSAQKLARTVFDMLPQKDEVACSSLVSGYAQNGYIEEALLLFHDILMADLT 646

Query: 649 IDPFSISSILGAIALLNRPAIGTQIHALIVKVGLEKDVSVGSSLVMVYSRCGSIEDCCKA 708
           ID F+ISSI+GAIALLNR +IGTQ+HA I+KVG   DVSVGSSL+ +YS+CGSIEDCCKA
Sbjct: 647 IDSFTISSIIGAIALLNRLSIGTQLHAHIMKVGFNSDVSVGSSLLTMYSKCGSIEDCCKA 706

Query: 709 FGQIGKPDLIGWTSMIVSYAQHGKGAEALCAYELMKKEGFKPDPVTFVGVLSACSHNGLV 768
           F QI KPDLI WT+MIVSYAQHGKGAEAL AYEL++++G +PD VTFVG+LSACSHNGLV
Sbjct: 707 FVQIEKPDLISWTAMIVSYAQHGKGAEALRAYELLREQGIRPDSVTFVGLLSACSHNGLV 766

Query: 769 DEAYFHLNSMVEDYGIQPGYRHYVCMVDLLGRCG 801
           +EAYF+ NSMV DYG++PGYRHY CMVDLLGR G
Sbjct: 767 EEAYFYFNSMVNDYGLEPGYRHYACMVDLLGRSG 799

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011649738.10.0e+0099.86PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic ... [more]
XP_008441907.10.0e+0088.55PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic ... [more]
XP_023519257.10.0e+0076.20pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita ... [more]
XP_022137435.10.0e+0076.13pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica ... [more]
XP_023001341.10.0e+0073.40pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT1G74600.19.6e-21150.97pentatricopeptide (PPR) repeat-containing protein[more]
AT4G13650.13.7e-10131.19Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G09950.14.9e-9831.13Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G16480.12.7e-9630.32Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G39620.13.9e-9528.43Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9CA56|PP121_ARATH1.7e-20950.97Pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Arabidop... [more]
sp|Q9SVP7|PP307_ARATH6.6e-10031.19Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
sp|Q9FIB2|PP373_ARATH8.9e-9731.13Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
sp|O80647|PP195_ARATH7.1e-9428.43Pentatricopeptide repeat-containing protein At2g39620 OS=Arabidopsis thaliana OX... [more]
sp|Q9FWA6|PP207_ARATH8.1e-9029.59Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3B4I2|A0A1S3B4I2_CUCME0.0e+0088.55pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Cucumis ... [more]
tr|A0A2I4F4H3|A0A2I4F4H3_9ROSI1.0e-27063.14pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Juglans ... [more]
tr|A0A1Q3AQ16|A0A1Q3AQ16_CEPFO3.0e-26761.47PPR domain-containing protein/PPR_2 domain-containing protein OS=Cephalotus foll... [more]
tr|A0A2N9J493|A0A2N9J493_FAGSY8.8e-26764.20Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS59540 PE=4 SV=1[more]
tr|A0A251PAR2|A0A251PAR2_PRUPE2.2e-25761.14Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G192400 PE=4 SV=1[more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy1G005280CsGy1G005280gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy1G005280.1.CDS.2CsGy1G005280.1.CDS.2CDS
CsGy1G005280.1.CDS.1CsGy1G005280.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy1G005280.1.exon.2CsGy1G005280.1.exon.2exon
CsGy1G005280.1.exon.1CsGy1G005280.1.exon.1exon


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy1G005280.1CsGy1G005280.1-proteinpolypeptide


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 86..106
e-value: 0.37
score: 11.1
coord: 616..644
e-value: 0.031
score: 14.4
coord: 487..512
e-value: 0.0016
score: 18.4
coord: 418..445
e-value: 2.1E-5
score: 24.4
coord: 515..541
e-value: 6.8E-6
score: 25.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 515..541
e-value: 9.5E-5
score: 20.3
coord: 418..450
e-value: 4.3E-5
score: 21.4
coord: 616..648
e-value: 0.0017
score: 16.4
coord: 718..749
e-value: 2.5E-6
score: 25.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 213..259
e-value: 6.0E-11
score: 42.2
coord: 312..361
e-value: 2.2E-10
score: 40.4
coord: 713..761
e-value: 4.4E-9
score: 36.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 348..382
score: 7.629
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 714..748
score: 10.896
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 482..512
score: 8.835
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 613..647
score: 8.364
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 182..212
score: 5.393
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 415..449
score: 10.819
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 582..612
score: 6.785
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 147..181
score: 5.24
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 749..784
score: 8.364
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 248..278
score: 5.415
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 683..713
score: 5.207
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 213..247
score: 8.824
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 317..347
score: 5.031
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 648..682
score: 5.13
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 383..413
score: 5.788
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 81..111
score: 6.675
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 112..146
score: 7.991
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 282..316
score: 9.197
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 513..547
score: 9.613
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 563..665
e-value: 4.1E-12
score: 48.3
coord: 666..801
e-value: 2.8E-22
score: 81.6
coord: 8..166
e-value: 2.7E-14
score: 55.4
coord: 366..461
e-value: 6.3E-15
score: 57.5
coord: 281..365
e-value: 1.7E-15
score: 59.4
coord: 167..280
e-value: 2.0E-14
score: 55.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 474..562
e-value: 3.2E-18
score: 67.7
NoneNo IPR availablePANTHERPTHR24015:SF878SUBFAMILY NOT NAMEDcoord: 50..125
coord: 461..541
coord: 500..599
NoneNo IPR availablePANTHERPTHR24015:SF878SUBFAMILY NOT NAMEDcoord: 63..159
NoneNo IPR availablePANTHERPTHR24015:SF878SUBFAMILY NOT NAMEDcoord: 171..362
coord: 601..800
coord: 336..459
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 171..362
coord: 601..800
coord: 336..459
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 50..125
coord: 461..541
coord: 500..599
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 63..159