CsGy1G032220 (gene) Cucumber (Gy14) v2

NameCsGy1G032220
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPentatricopeptide repeat
LocationChr1 : 31489207 .. 31490967 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGTTTGTGGGAGTGAAGCTCAGTGGTGTTGCTTTGATTAGCTTGATTGCTGTATTTGGAAATCTCTTGGATATGAAATCGGGGAGGGCTGTTCATGGTTACATCGTGAGAAATGTTGGTGATGAGAAGATGGAAGTTTCAATGACTACTGCATTGATCGATATGTATTGCAAAGGTGGATGTTTAGCATCAGCACAGAGGCTTTTTGACAGGTTATCTAAAAGAAGTGTTGTCTCATGGACGGTGATGATAGCAGGTTGTATTCGCAGTTGCAGATTAGATGAAGGGGCAAAGAACTTTAATAGAATGCTCGAAGAAAAATTATTCCCCAATGAGATTACACTACTAAGTTTGATTACTGAATGTGGTTTCGTGGGAACCTTGGATTTGGGCAAATGGTTTCATGCGTATCTCTTAAGAAATGGGTTTGGTATGTCTTTGGCTTTGGTCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGGATATGCAAGAGCTCTTTTCAACGGTGTCAAGAAAAAAGATGTCAAGATTTGGAGTGTTTTAATATCGGCTTATGCACATGTGAGTTGCATGGATCAAGTTTTTAACCTCTTCGTCGAGATGTTGAACAATGACGTGAAACCAAACAACGTGACAATGGTTAGCCTTCTTTCTTTGTGTGCAGAGGCTGGAGCCCTTGACCTTGGCAAGTGGACTCATGCATACATAAACCGTCATGGTCTTGAAGTAGATGTCATTCTAGAAACAGCTTTAATCAACATGTATGCAAAATGTGGAGATGTAACAATTGCTCGTAGCCTGTTCAATGAAGCTATGCAACGGGACATTCGCATGTGGAACACAATGATGGCTGGATTCTCGATGCATGGTTGTGGAAAAGAAGCTTTGGAACTCTTTTCAGAGATGGAGAGCCATGGTGTTGAACCTAATGATATCACATTCGTTTCCATTTTCCATGCTTGTAGTCATTCCGGATTGGTAGTAGAAGGAAAAAAGTATTTCAACAAAATGGTTCACGACTTTGGAATTGTTCCAAAGATGGAGCACTATGGATGCTTGGTGGATCTTCTTGGTCGAGCTGGACATCTTGACGAAGCTCACAACATCATTGAAAACATGCCCATGAGGCCTAACACAATTATATGGGGTGCTCTGCTTGCTGCATGTAAGCTGCATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGATCCACAAAACTGTGGGTATAGTGTTCTTAAGTCAAACATCTATGCATCAGCAAAGCGATGGAATGATGTAACAAGCGTTAGAGAAGCAATGAGCCATTCAGGAATGAAGAAAGAACCAGGCCTCAGCTGGATTGAAGTAAGTGGTTCAGTTCACCACTTCAAATCTGGAGACAAGGCATGCACACAAACTACAAAAGTATATGAAATGGTGACCGAAATGTGCATCAAATTGAGAGAGTCGGGATACACACCGAACACAGCAGCAGTTTTGTTAAACATAGATGAGGAAGAGAAGGAATCTGCACTCAGTTACCATAGCGAGAAACTTGCTACAGCATTTGGACTCATAAGCACAGCTCCTGGTACACCCATCCGAATCGTTAAGAATTTGAGGATTTGTGATGATTGTCATGCTGCAACGAAGCTATTATCAAAAATCTATGGACGAACAATAATAGTTAGAGATAGAAACCGATTTCACCACTTCAGTGAAGGATATTGTTCTTGTATGGGTTATTGGTAA

mRNA sequence

ATGCAGTTTGTGGGAGTGAAGCTCAGTGGTGTTGCTTTGATTAGCTTGATTGCTGTATTTGGAAATCTCTTGGATATGAAATCGGGGAGGGCTGTTCATGGTTACATCGTGAGAAATGTTGGTGATGAGAAGATGGAAGTTTCAATGACTACTGCATTGATCGATATGTATTGCAAAGGTGGATGTTTAGCATCAGCACAGAGGCTTTTTGACAGGTTATCTAAAAGAAGTGTTGTCTCATGGACGGTGATGATAGCAGGTTGTATTCGCAGTTGCAGATTAGATGAAGGGGCAAAGAACTTTAATAGAATGCTCGAAGAAAAATTATTCCCCAATGAGATTACACTACTAAGTTTGATTACTGAATGTGGTTTCGTGGGAACCTTGGATTTGGGCAAATGGTTTCATGCGTATCTCTTAAGAAATGGGTTTGGTATGTCTTTGGCTTTGGTCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGGATATGCAAGAGCTCTTTTCAACGGTGTCAAGAAAAAAGATGTCAAGATTTGGAGTGTTTTAATATCGGCTTATGCACATGTGAGTTGCATGGATCAAGTTTTTAACCTCTTCGTCGAGATGTTGAACAATGACGTGAAACCAAACAACGTGACAATGGTTAGCCTTCTTTCTTTGTGTGCAGAGGCTGGAGCCCTTGACCTTGGCAAGTGGACTCATGCATACATAAACCGTCATGGTCTTGAAGTAGATGTCATTCTAGAAACAGCTTTAATCAACATGTATGCAAAATGTGGAGATGTAACAATTGCTCGTAGCCTGTTCAATGAAGCTATGCAACGGGACATTCGCATGTGGAACACAATGATGGCTGGATTCTCGATGCATGGTTGTGGAAAAGAAGCTTTGGAACTCTTTTCAGAGATGGAGAGCCATGGTGTTGAACCTAATGATATCACATTCGTTTCCATTTTCCATGCTTGTAGTCATTCCGGATTGGTAGTAGAAGGAAAAAAGTATTTCAACAAAATGGTTCACGACTTTGGAATTGTTCCAAAGATGGAGCACTATGGATGCTTGGTGGATCTTCTTGGTCGAGCTGGACATCTTGACGAAGCTCACAACATCATTGAAAACATGCCCATGAGGCCTAACACAATTATATGGGGTGCTCTGCTTGCTGCATGTAAGCTGCATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGATCCACAAAACTGTGGGTATAGTGTTCTTAAGTCAAACATCTATGCATCAGCAAAGCGATGGAATGATGTAACAAGCGTTAGAGAAGCAATGAGCCATTCAGGAATGAAGAAAGAACCAGGCCTCAGCTGGATTGAAGTAAGTGGTTCAGTTCACCACTTCAAATCTGGAGACAAGGCATGCACACAAACTACAAAAGTATATGAAATGGTGACCGAAATGTGCATCAAATTGAGAGAGTCGGGATACACACCGAACACAGCAGCAGTTTTGTTAAACATAGATGAGGAAGAGAAGGAATCTGCACTCAGTTACCATAGCGAGAAACTTGCTACAGCATTTGGACTCATAAGCACAGCTCCTGGTACACCCATCCGAATCGTTAAGAATTTGAGGATTTGTGATGATTGTCATGCTGCAACGAAGCTATTATCAAAAATCTATGGACGAACAATAATAGTTAGAGATAGAAACCGATTTCACCACTTCAGTGAAGGATATTGTTCTTGTATGGGTTATTGGTAA

Coding sequence (CDS)

ATGCAGTTTGTGGGAGTGAAGCTCAGTGGTGTTGCTTTGATTAGCTTGATTGCTGTATTTGGAAATCTCTTGGATATGAAATCGGGGAGGGCTGTTCATGGTTACATCGTGAGAAATGTTGGTGATGAGAAGATGGAAGTTTCAATGACTACTGCATTGATCGATATGTATTGCAAAGGTGGATGTTTAGCATCAGCACAGAGGCTTTTTGACAGGTTATCTAAAAGAAGTGTTGTCTCATGGACGGTGATGATAGCAGGTTGTATTCGCAGTTGCAGATTAGATGAAGGGGCAAAGAACTTTAATAGAATGCTCGAAGAAAAATTATTCCCCAATGAGATTACACTACTAAGTTTGATTACTGAATGTGGTTTCGTGGGAACCTTGGATTTGGGCAAATGGTTTCATGCGTATCTCTTAAGAAATGGGTTTGGTATGTCTTTGGCTTTGGTCACTGCTCTCATAGACATGTATGGAAAGTGTGGGCAAGTTGGATATGCAAGAGCTCTTTTCAACGGTGTCAAGAAAAAAGATGTCAAGATTTGGAGTGTTTTAATATCGGCTTATGCACATGTGAGTTGCATGGATCAAGTTTTTAACCTCTTCGTCGAGATGTTGAACAATGACGTGAAACCAAACAACGTGACAATGGTTAGCCTTCTTTCTTTGTGTGCAGAGGCTGGAGCCCTTGACCTTGGCAAGTGGACTCATGCATACATAAACCGTCATGGTCTTGAAGTAGATGTCATTCTAGAAACAGCTTTAATCAACATGTATGCAAAATGTGGAGATGTAACAATTGCTCGTAGCCTGTTCAATGAAGCTATGCAACGGGACATTCGCATGTGGAACACAATGATGGCTGGATTCTCGATGCATGGTTGTGGAAAAGAAGCTTTGGAACTCTTTTCAGAGATGGAGAGCCATGGTGTTGAACCTAATGATATCACATTCGTTTCCATTTTCCATGCTTGTAGTCATTCCGGATTGGTAGTAGAAGGAAAAAAGTATTTCAACAAAATGGTTCACGACTTTGGAATTGTTCCAAAGATGGAGCACTATGGATGCTTGGTGGATCTTCTTGGTCGAGCTGGACATCTTGACGAAGCTCACAACATCATTGAAAACATGCCCATGAGGCCTAACACAATTATATGGGGTGCTCTGCTTGCTGCATGTAAGCTGCATAAGAATCTGGCCTTGGGGGAGGTGGCTGCAAGAAAGATTCTTGAATTGGATCCACAAAACTGTGGGTATAGTGTTCTTAAGTCAAACATCTATGCATCAGCAAAGCGATGGAATGATGTAACAAGCGTTAGAGAAGCAATGAGCCATTCAGGAATGAAGAAAGAACCAGGCCTCAGCTGGATTGAAGTAAGTGGTTCAGTTCACCACTTCAAATCTGGAGACAAGGCATGCACACAAACTACAAAAGTATATGAAATGGTGACCGAAATGTGCATCAAATTGAGAGAGTCGGGATACACACCGAACACAGCAGCAGTTTTGTTAAACATAGATGAGGAAGAGAAGGAATCTGCACTCAGTTACCATAGCGAGAAACTTGCTACAGCATTTGGACTCATAAGCACAGCTCCTGGTACACCCATCCGAATCGTTAAGAATTTGAGGATTTGTGATGATTGTCATGCTGCAACGAAGCTATTATCAAAAATCTATGGACGAACAATAATAGTTAGAGATAGAAACCGATTTCACCACTTCAGTGAAGGATATTGTTCTTGTATGGGTTATTGGTAA

Protein sequence

MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGGCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKIWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYINRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW
BLAST of CsGy1G032220 vs. NCBI nr
Match: XP_011660280.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Cucumis sativus] >KGN66788.1 hypothetical protein Csa_1G690260 [Cucumis sativus])

HSP 1 Score: 1198.7 bits (3100), Expect = 0.0e+00
Identity = 586/586 (100.00%), Postives = 586/586 (100.00%), Query Frame = 0

Query: 1   MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 60
           MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG
Sbjct: 180 MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 239

Query: 61  GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 120
           GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI
Sbjct: 240 GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 299

Query: 121 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 180
           TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK
Sbjct: 300 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 359

Query: 181 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 240
           IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI
Sbjct: 360 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 419

Query: 241 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 300
           NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL
Sbjct: 420 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 479

Query: 301 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 360
           ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL
Sbjct: 480 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 539

Query: 361 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 420
           LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 540 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 599

Query: 421 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 480
           VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY
Sbjct: 600 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 659

Query: 481 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 540
           EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV
Sbjct: 660 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 719

Query: 541 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW
Sbjct: 720 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 765

BLAST of CsGy1G032220 vs. NCBI nr
Match: XP_008462708.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Cucumis melo])

HSP 1 Score: 1131.7 bits (2926), Expect = 0.0e+00
Identity = 549/586 (93.69%), Postives = 565/586 (96.42%), Query Frame = 0

Query: 1   MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 60
           MQFVGVKLSGVALISLI VFGNLLDMKSGRAVHGYI+RNVGDEKMEVS+TTALIDMYCK 
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIMRNVGDEKMEVSLTTALIDMYCKC 240

Query: 61  GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 120
            CLASAQRLFDRLSKRSVVSWTVMI GCIRSCRL EGAKNFNRMLEEKLFPNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 121 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 180
           TECGFV TLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV+KKDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 181 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 240
           IWS LISAYAHVSCMDQVFNLF+EML+N+VKPN VTMVSLLSLCAEAG LDLGKWTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 241 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 300
           NRHGLEVDVILETALINMY KCGDVTIARSLF+EA QRDI MWN MMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 301 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 360
           ELFSEMESHGVEPNDITF+SIFHACSHSGLVVEGKK+FN+MVH+FGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHNFGIVPKMEHYGCLVDL 540

Query: 361 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 420
           LGRAGHL+EAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 421 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 480
           VLKSNIYASAKRWNDVTSVRE MSH GMKKEPGLSWIEV+GSVHHFKSGDK CTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 481 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 540
           EMV EMCIKLRE+GYTPNTA VLLNIDEEEKESALSYHSEKLA AFGLISTAPGTPIRI+
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 541 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of CsGy1G032220 vs. NCBI nr
Match: XP_023533718.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1038.9 bits (2685), Expect = 6.7e-300
Identity = 492/586 (83.96%), Postives = 539/586 (91.98%), Query Frame = 0

Query: 1   MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 60
           M FVGV+LS VALIS+I VFG L DMKSGRA+HGY+VRNVG+E+MEV +TTALIDMYCKG
Sbjct: 179 MHFVGVRLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERMEVPLTTALIDMYCKG 238

Query: 61  GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 120
             LASA RLFD LS+R+VVSWT +IAGCIRSCR DEGAKNF+RMLEE + PNEITLLSLI
Sbjct: 239 DNLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFDEGAKNFSRMLEENIVPNEITLLSLI 298

Query: 121 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 180
           TECGFVG LDLGKW HAYLLRNGFGMSLAL TALIDMYGKCGQV YARALFNGV++KDVK
Sbjct: 299 TECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVK 358

Query: 181 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 240
           IWS LISAYAH SC+DQ F LF++ML+++VKPN VTMVSLLSLCAE GALDLG+WTHAYI
Sbjct: 359 IWSALISAYAHASCIDQAFGLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYI 418

Query: 241 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 300
           NRHGLEVDV+LETALINMYAKCGD+  ARSLF+EA +RDI MWN MMAGFS+HGCGKEAL
Sbjct: 419 NRHGLEVDVVLETALINMYAKCGDLKTARSLFDEATRRDIHMWNAMMAGFSIHGCGKEAL 478

Query: 301 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 360
           ELF +ME HGVEPNDITF+S+FHACSHSGLV EG K+F++MVH+FGIVPK+EHYGCLVDL
Sbjct: 479 ELFLDMECHGVEPNDITFISLFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDL 538

Query: 361 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 420
           LGRA  LD AH+IIENMPMRPNTI+WGALLAACKLHKNL LGEVAARKILELDP+NCGY 
Sbjct: 539 LGRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLTLGEVAARKILELDPENCGYR 598

Query: 421 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 480
           VLKSNIYAS KRW DVTSVRE MSH GMKKEPGLSWIEV+GSVHHF+SGDK CTQT KV+
Sbjct: 599 VLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVH 658

Query: 481 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 540
           EMVTEMCIKLRE+GY PNT+AVLLN+++EEKESALSYHSEKLA AFGLISTAPGTPIRI+
Sbjct: 659 EMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRII 718

Query: 541 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHF EGYCSC+GYW
Sbjct: 719 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFREGYCSCLGYW 764

BLAST of CsGy1G032220 vs. NCBI nr
Match: XP_022960858.1 (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Cucurbita moschata])

HSP 1 Score: 1034.2 bits (2673), Expect = 1.6e-298
Identity = 490/586 (83.62%), Postives = 540/586 (92.15%), Query Frame = 0

Query: 1   MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 60
           M FVGVKLS VALIS+I VFG L DMKSGRA+HGY+VRNVG+E++E+ +TTALIDMYCKG
Sbjct: 179 MHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGNERIELPLTTALIDMYCKG 238

Query: 61  GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 120
             LASA RLFD LS+R+VVSWT +IAGCIRSCR  EGAKNF+RMLEE + PNEITLLSLI
Sbjct: 239 DKLASAMRLFDGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIAPNEITLLSLI 298

Query: 121 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 180
           TECGFVG LDLGKW HAYLLRNGFGMSLAL TALIDMYGKCGQV YARALFNGV++KDVK
Sbjct: 299 TECGFVGALDLGKWLHAYLLRNGFGMSLALATALIDMYGKCGQVAYARALFNGVEEKDVK 358

Query: 181 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 240
           IWS LISAYAH SC+DQ F+LF++ML+++VKPN VTMVSLLSLCAE GALDLG+WTHAYI
Sbjct: 359 IWSALISAYAHASCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYI 418

Query: 241 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 300
           NRHG+EVDV+LETALINMYAKCGD+  AR LF+EA +RDI MWN MMAGFS+HGCGKEAL
Sbjct: 419 NRHGVEVDVVLETALINMYAKCGDLKTARCLFDEATRRDIHMWNAMMAGFSIHGCGKEAL 478

Query: 301 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 360
           ELFS+M  HGVEPNDITF+S+FHACSHSGLV EG K+F++MVH+FGIVPK+EHYGCLVDL
Sbjct: 479 ELFSDMVCHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDL 538

Query: 361 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 420
           LGRA  LD AH+IIENMPMRPNTI+WGALLAACKLHKNLALGEVAARKILELDP+NCGY 
Sbjct: 539 LGRAKRLDAAHSIIENMPMRPNTIVWGALLAACKLHKNLALGEVAARKILELDPENCGYR 598

Query: 421 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 480
           VLKSNIYAS KRW DVTSVRE MSH GMKKEPGLSWIEV+GSVHHF+SGDK CTQT KV+
Sbjct: 599 VLKSNIYASEKRWTDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFRSGDKTCTQTRKVH 658

Query: 481 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 540
           EMVTEMCIKLRE+GY PNT+AVLLN+++EEKESALSYHSEKLA AFGLISTAPGTPIRI+
Sbjct: 659 EMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRII 718

Query: 541 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 719 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 764

BLAST of CsGy1G032220 vs. NCBI nr
Match: XP_022988029.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima])

HSP 1 Score: 1016.5 bits (2627), Expect = 3.5e-293
Identity = 481/586 (82.08%), Postives = 534/586 (91.13%), Query Frame = 0

Query: 1   MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 60
           M FVGVKLS VALIS+I VFG L DMKSGRA+HGY+VRNVG E++E+ +TTALIDMYCKG
Sbjct: 179 MHFVGVKLSDVALISMIGVFGELSDMKSGRAIHGYVVRNVGKERIELPLTTALIDMYCKG 238

Query: 61  GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 120
             LASA RLF+ LS+R+VVSWT +IAGCIRSCR  EGAKNF+RMLEE + PNEITLLSLI
Sbjct: 239 DNLASAMRLFNGLSQRNVVSWTALIAGCIRSCRFVEGAKNFSRMLEENIVPNEITLLSLI 298

Query: 121 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 180
           TECGFVG LDLGKW H+YLLRNGFGMSL L TALIDMYGKCGQV YARALFN V +KDVK
Sbjct: 299 TECGFVGALDLGKWLHSYLLRNGFGMSLTLTTALIDMYGKCGQVAYARALFNVVDEKDVK 358

Query: 181 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 240
           IWS LISAYAH SC+DQ F+LF++ML+++VKPN VTMVSLLSLCAE GALDLG+WTHAYI
Sbjct: 359 IWSALISAYAHTSCIDQAFSLFLKMLDSEVKPNKVTMVSLLSLCAEVGALDLGRWTHAYI 418

Query: 241 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 300
             HG+EVD++LETALINMYAKCGD+  ARSLF+EA QRDI MWN MMAGFS+HGCGKEAL
Sbjct: 419 IHHGVEVDIVLETALINMYAKCGDLKTARSLFDEATQRDIHMWNAMMAGFSIHGCGKEAL 478

Query: 301 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 360
           ELFS+ME HGVEPNDITF+S+FHACSHSGLV EG K+F++MVH+FGIVPK+EHYGCLVDL
Sbjct: 479 ELFSDMECHGVEPNDITFISVFHACSHSGLVGEGMKHFDRMVHEFGIVPKIEHYGCLVDL 538

Query: 361 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 420
           LGRA  LD AH+IIENMPMRPNTIIWGALLAACKLHKNL LG+VAARKILELDP+NCGY 
Sbjct: 539 LGRAKRLDAAHSIIENMPMRPNTIIWGALLAACKLHKNLPLGKVAARKILELDPENCGYR 598

Query: 421 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 480
           VLKSNIYAS KRW +VTS+RE+MSH GMKKEPGLSW EV+GSVHHF+SGDK CTQ  KV+
Sbjct: 599 VLKSNIYASEKRWTNVTSIRESMSHLGMKKEPGLSWTEVNGSVHHFRSGDKTCTQARKVH 658

Query: 481 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 540
           EMVTEMCIKLRE+GY PNT+AVLLN+++EEKESALSYHSEKLA AFGLISTAPGTPIRI+
Sbjct: 659 EMVTEMCIKLREAGYAPNTSAVLLNVEDEEKESALSYHSEKLAMAFGLISTAPGTPIRII 718

Query: 541 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 719 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCLGYW 764

BLAST of CsGy1G032220 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 489.2 bits (1258), Expect = 3.6e-138
Identity = 243/593 (40.98%), Postives = 352/593 (59.36%), Query Frame = 0

Query: 27  KSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGGCLASAQRLFDRLSKR---------- 86
           K G+ +HG++++   D  +++ + T+LI MY + G L  A ++FD+   R          
Sbjct: 151 KEGQQIHGHVLKLGCD--LDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRXXXXXXXXXX 210

Query: 87  ---------------------SVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEIT 146
                                                             +  + P+E T
Sbjct: 211 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTNVRPDEST 270

Query: 147 LLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVK 206
           ++++++ C   G+++LG+  H ++  +GFG +L +V ALID+Y KCG++  A  LF  + 
Sbjct: 271 MVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLP 330

Query: 207 KKDVKIWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKW 266
            KDV  W+ LI  Y H++   +   LF EML +   PN+VTM+S+L  CA  GA+D+G+W
Sbjct: 331 YKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRW 390

Query: 267 THAYINRH--GLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMH 326
            H YI++   G+     L T+LI+MYAKCGD+  A  +FN  + + +  WN M+ GF+MH
Sbjct: 391 IHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMH 450

Query: 327 GCGKEALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEH 386
           G    + +LFS M   G++P+DITFV +  ACSHSG++  G+  F  M  D+ + PK+EH
Sbjct: 451 GRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEH 510

Query: 387 YGCLVDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELD 446
           YGC++DLLG +G   EA  +I  M M P+ +IW +LL ACK+H N+ LGE  A  +++++
Sbjct: 511 YGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIE 570

Query: 447 PQNCGYSVLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKAC 506
           P+N G  VL SNIYASA RWN+V   R  ++  GMKK PG S IE+   VH F  GDK  
Sbjct: 571 PENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFH 630

Query: 507 TQTTKVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAP 566
            +  ++Y M+ EM + L ++G+ P+T+ VL  ++EE KE AL +HSEKLA AFGLIST P
Sbjct: 631 PRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKP 690

Query: 567 GTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           GT + IVKNLR+C +CH ATKL+SKIY R II RDR RFHHF +G CSC  YW
Sbjct: 691 GTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CsGy1G032220 vs. TAIR10
Match: AT3G26782.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 488.8 bits (1257), Expect = 4.7e-138
Identity = 246/579 (42.49%), Postives = 363/579 (62.69%), Query Frame = 0

Query: 17  IAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGGCLASAQRLFDRLSKR 76
           I    +L D+ SG+  H      V   + ++ +++ALI MY   G L  A+++FD + KR
Sbjct: 83  IKACSSLFDIFSGKQTHQQAF--VFGYQSDIFVSSALIVMYSTCGKLEDARKVFDEIPKR 142

Query: 77  SVVSWTVMIAGCIRSCRLDEGAKNFNRML------EEKLFPNEITLLSLITECGFVGTLD 136
           ++VSWT MI G   +    +    F  +L      ++ +F + + L+S+I+ C  V    
Sbjct: 143 NIVSWTSMIRGYDLNGNALDAVSLFKDLLVDENDDDDAMFLDSMGLVSVISACSRVPAKG 202

Query: 137 LGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQ--VGYARALFNGVKKKDVKIWSVLISA 196
           L +  H+++++ GF   +++   L+D Y K G+  V  AR +F+ +  KD   ++ ++S 
Sbjct: 203 LTESIHSFVIKRGFDRGVSVGNTLLDAYAKGGEGGVAVARKIFDQIVDKDRVSYNSIMSV 262

Query: 197 YAHVSCMDQVFNLFVEMLNNDVKP-NNVTMVSLLSLCAEAGALDLGKWTHAYINRHGLEV 256
           YA     ++ F +F  ++ N V   N +T+ ++L   + +GAL +GK  H  + R GLE 
Sbjct: 263 YAQSGMSNEAFEVFRRLVKNKVVTFNAITLSTVLLAVSHSGALRIGKCIHDQVIRMGLED 322

Query: 257 DVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALELFSEME 316
           DVI+ T++I+MY KCG V  AR  F+    +++R W  M+AG+ MHG   +ALELF  M 
Sbjct: 323 DVIVGTSIIDMYCKCGRVETARKAFDRMKNKNVRSWTAMIAGYGMHGHAAKALELFPAMI 382

Query: 317 SHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLLGRAGHL 376
             GV PN ITFVS+  ACSH+GL VEG ++FN M   FG+ P +EHYGC+VDLLGRAG L
Sbjct: 383 DSGVRPNYITFVSVLAACSHAGLHVEGWRWFNAMKGRFGVEPGLEHYGCMVDLLGRAGFL 442

Query: 377 DEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIY 436
            +A+++I+ M M+P++IIW +LLAAC++HKN+ L E++  ++ ELD  NCGY +L S+IY
Sbjct: 443 QKAYDLIQRMKMKPDSIIWSSLLAACRIHKNVELAEISVARLFELDSSNCGYYMLLSHIY 502

Query: 437 ASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVYEMVTEMC 496
           A A RW DV  VR  M + G+ K PG S +E++G VH F  GD+   Q  K+YE + E+ 
Sbjct: 503 ADAGRWKDVERVRMIMKNRGLVKPPGFSLLELNGEVHVFLIGDEEHPQREKIYEFLAELN 562

Query: 497 IKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVKNLRICD 556
            KL E+GY  NT++V  ++DEEEKE  L  HSEKLA AFG+++T PG+ + +VKNLR+C 
Sbjct: 563 RKLLEAGYVSNTSSVCHDVDEEEKEMTLRVHSEKLAIAFGIMNTVPGSTVNVVKNLRVCS 622

Query: 557 DCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           DCH   KL+SKI  R  +VRD  RFHHF +G CSC  YW
Sbjct: 623 DCHNVIKLISKIVDREFVVRDAKRFHHFKDGGCSCGDYW 659

BLAST of CsGy1G032220 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 477.2 bits (1227), Expect = 1.4e-134
Identity = 233/581 (40.10%), Postives = 355/581 (61.10%), Query Frame = 0

Query: 6   VKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGGCLAS 65
           +K S + ++S++     L  +  G+ +HGY +R+  D  + +S  TAL+DMY K G L +
Sbjct: 232 LKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS--TALVDMYAKCGSLET 291

Query: 66  AQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLITECGF 125
           A++LFD + +R+VVSW  MI   +++    E    F +ML+E + P +++++  +  C  
Sbjct: 292 ARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACAD 351

Query: 126 VGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKIWSVL 185
           +G L+ G++ H   +  G   ++++V +LI MY KC +V  A ++F  ++ + +  W+ +
Sbjct: 352 LGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAM 411

Query: 186 ISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYINRHGL 245
           I  +A         N F +M +  VKP+  T VS+++  AE       KW H  + R  L
Sbjct: 412 ILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCL 471

Query: 246 EVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALELFSE 305
           + +V + TAL++MYAKCG + IAR +F+   +R +  WN M+ G+  HG GK ALELF E
Sbjct: 472 DKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEE 531

Query: 306 MESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLLGRAG 365
           M+   ++PN +TF+S+  ACSHSGLV  G K F  M  ++ I   M+HYG +VDLLGRAG
Sbjct: 532 MQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAG 591

Query: 366 HLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSN 425
            L+EA + I  MP++P   ++GA+L AC++HKN+   E AA ++ EL+P + GY VL +N
Sbjct: 592 RLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLAN 651

Query: 426 IYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVYEMVTE 485
           IY +A  W  V  VR +M   G++K PG S +E+   VH F SG  A   + K+Y  + +
Sbjct: 652 IYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEK 711

Query: 486 MCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVKNLRI 545
           +   ++E+GY P+T  V L ++ + KE  LS HSEKLA +FGL++T  GT I + KNLR+
Sbjct: 712 LICHIKEAGYVPDTNLV-LGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRV 771

Query: 546 CDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           C DCH ATK +S + GR I+VRD  RFHHF  G CSC  YW
Sbjct: 772 CADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of CsGy1G032220 vs. TAIR10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 457.2 bits (1175), Expect = 1.5e-128
Identity = 231/605 (38.18%), Postives = 356/605 (58.84%), Query Frame = 0

Query: 16  LIAVFGNLLDMKSGRAVHGYIVRN-VGDEKMEVSMTTALIDMYCKGGCLASAQRLFDRLS 75
           LI     +  +  G+++HG  V++ VG    +V +  +LI  Y   G L SA ++F  + 
Sbjct: 137 LIKAAAEVSSLSLGQSLHGMAVKSAVGS---DVFVANSLIHCYFSCGDLDSACKVFTTIK 196

Query: 76  KRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLITECGFVGTLDLGKW 135
           ++                                     +T++ +++ C  +  L+ G+ 
Sbjct: 197 EKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTMVGVLSACAKIRNLEFGRQ 256

Query: 136 FHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKIWSVLISAYAHVSC 195
             +Y+  N   ++L L  A++DMY KCG +  A+ LF+ +++KD   W+           
Sbjct: 257 VCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTXXXXXXXXXXX 316

Query: 196 M-------------------------------DQVFNLFVEM-LNNDVKPNNVTMVSLLS 255
                                           ++   +F E+ L  ++K N +T+VS LS
Sbjct: 317 XXXXXXXXXXXXXXXXXXXXXXXXXXXXNGKPNEALIVFHELQLQKNMKLNQITLVSTLS 376

Query: 256 LCAEAGALDLGKWTHAYINRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRM 315
            CA+ GAL+LG+W H+YI +HG+ ++  + +ALI+MY+KCGD+  +R +FN   +RD+ +
Sbjct: 377 ACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFV 436

Query: 316 WNTMMAGFSMHGCGKEALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMV 375
           W+ M+ G +MHGCG EA+++F +M+   V+PN +TF ++F ACSH+GLV E +  F++M 
Sbjct: 437 WSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQME 496

Query: 376 HDFGIVPKMEHYGCLVDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALG 435
            ++GIVP+ +HY C+VD+LGR+G+L++A   IE MP+ P+T +WGALL ACK+H NL L 
Sbjct: 497 SNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLA 556

Query: 436 EVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGS 495
           E+A  ++LEL+P+N G  VL SNIYA   +W +V+ +R+ M  +G+KKEPG S IE+ G 
Sbjct: 557 EMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGM 616

Query: 496 VHHFKSGDKACTQTTKVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEE-KESALSYHSEK 555
           +H F SGD A   + KVY  + E+  KL+ +GY P  + VL  I+EEE KE +L+ HSEK
Sbjct: 617 IHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEK 676

Query: 556 LATAFGLISTAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCS 587
           LA  +GLIST     IR++KNLR+C DCH+  KL+S++Y R IIVRDR RFHHF  G CS
Sbjct: 677 LAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCS 736

BLAST of CsGy1G032220 vs. TAIR10
Match: AT4G30700.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 454.1 bits (1167), Expect = 1.3e-127
Identity = 225/580 (38.79%), Postives = 334/580 (57.59%), Query Frame = 0

Query: 7   KLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGGCLASA 66
           +L    L+ ++     L +++ G  +H  +    G    +  + T  I +Y K G +   
Sbjct: 218 RLDTTTLLDILPAVAELQELRLGMQIHS-LATKTGCYSHDY-VLTGFISLYSKCGKIKMG 277

Query: 67  QRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLITECGFV 126
             LF    K  +V++  MI G   +   +     F  ++         TL+SL+   G  
Sbjct: 278 SALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVSGH- 337

Query: 127 GTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKIWSVLI 186
             L L    H Y L++ F    ++ TAL  +Y K  ++  AR LF+   +K +  W+ +I
Sbjct: 338 --LMLIYAIHGYCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWNAMI 397

Query: 187 SAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYINRHGLE 246
           S Y      +   +LF EM  ++  PN VT+  +LS CA+ GAL LGKW H  +     E
Sbjct: 398 SGYTQNGLTEDAISLFREMQKSEFSPNPVTITCILSACAQLGALSLGKWVHDLVRSTDFE 457

Query: 247 VDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALELFSEM 306
             + + TALI MYAKCG +  AR LF+   +++   WNTM++G+ +HG G+EAL +F EM
Sbjct: 458 SSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNIFYEM 517

Query: 307 ESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLLGRAGH 366
            + G+ P  +TF+ + +ACSH+GLV EG + FN M+H +G  P ++HY C+VD+LGRAGH
Sbjct: 518 LNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILGRAGH 577

Query: 367 LDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNI 426
           L  A   IE M + P + +W  LL AC++HK+  L    + K+ ELDP N GY VL SNI
Sbjct: 578 LQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLLSNI 637

Query: 427 YASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVYEMVTEM 486
           +++ + +    +VR+      + K PG + IE+  + H F SGD++  Q  ++YE + ++
Sbjct: 638 HSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEKLEKL 697

Query: 487 CIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVKNLRIC 546
             K+RE+GY P T   L +++EEE+E  +  HSE+LA AFGLI+T PGT IRI+KNLR+C
Sbjct: 698 EGKMREAGYQPETELALHDVEEEERELMVKVHSERLAIAFGLIATEPGTEIRIIKNLRVC 757

Query: 547 DDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
            DCH  TKL+SKI  R I+VRD NRFHHF +G CSC  YW
Sbjct: 758 LDCHTVTKLISKITERVIVVRDANRFHHFKDGVCSCGDYW 792

BLAST of CsGy1G032220 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 6.4e-137
Identity = 243/593 (40.98%), Postives = 352/593 (59.36%), Query Frame = 0

Query: 27  KSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGGCLASAQRLFDRLSKR---------- 86
           K G+ +HG++++   D  +++ + T+LI MY + G L  A ++FD+   R          
Sbjct: 151 KEGQQIHGHVLKLGCD--LDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRXXXXXXXXXX 210

Query: 87  ---------------------SVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEIT 146
                                                             +  + P+E T
Sbjct: 211 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTNVRPDEST 270

Query: 147 LLSLITECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVK 206
           ++++++ C   G+++LG+  H ++  +GFG +L +V ALID+Y KCG++  A  LF  + 
Sbjct: 271 MVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLP 330

Query: 207 KKDVKIWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKW 266
            KDV  W+ LI  Y H++   +   LF EML +   PN+VTM+S+L  CA  GA+D+G+W
Sbjct: 331 YKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRW 390

Query: 267 THAYINRH--GLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMH 326
            H YI++   G+     L T+LI+MYAKCGD+  A  +FN  + + +  WN M+ GF+MH
Sbjct: 391 IHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMH 450

Query: 327 GCGKEALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEH 386
           G    + +LFS M   G++P+DITFV +  ACSHSG++  G+  F  M  D+ + PK+EH
Sbjct: 451 GRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEH 510

Query: 387 YGCLVDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELD 446
           YGC++DLLG +G   EA  +I  M M P+ +IW +LL ACK+H N+ LGE  A  +++++
Sbjct: 511 YGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIE 570

Query: 447 PQNCGYSVLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKAC 506
           P+N G  VL SNIYASA RWN+V   R  ++  GMKK PG S IE+   VH F  GDK  
Sbjct: 571 PENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFH 630

Query: 507 TQTTKVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAP 566
            +  ++Y M+ EM + L ++G+ P+T+ VL  ++EE KE AL +HSEKLA AFGLIST P
Sbjct: 631 PRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKP 690

Query: 567 GTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           GT + IVKNLR+C +CH ATKL+SKIY R II RDR RFHHF +G CSC  YW
Sbjct: 691 GTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CsGy1G032220 vs. Swiss-Prot
Match: sp|Q9LW32|PP258_ARATH (Pentatricopeptide repeat-containing protein At3g26782, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H34 PE=2 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 8.4e-137
Identity = 246/579 (42.49%), Postives = 363/579 (62.69%), Query Frame = 0

Query: 17  IAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGGCLASAQRLFDRLSKR 76
           I    +L D+ SG+  H      V   + ++ +++ALI MY   G L  A+++FD + KR
Sbjct: 83  IKACSSLFDIFSGKQTHQQAF--VFGYQSDIFVSSALIVMYSTCGKLEDARKVFDEIPKR 142

Query: 77  SVVSWTVMIAGCIRSCRLDEGAKNFNRML------EEKLFPNEITLLSLITECGFVGTLD 136
           ++VSWT MI G   +    +    F  +L      ++ +F + + L+S+I+ C  V    
Sbjct: 143 NIVSWTSMIRGYDLNGNALDAVSLFKDLLVDENDDDDAMFLDSMGLVSVISACSRVPAKG 202

Query: 137 LGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQ--VGYARALFNGVKKKDVKIWSVLISA 196
           L +  H+++++ GF   +++   L+D Y K G+  V  AR +F+ +  KD   ++ ++S 
Sbjct: 203 LTESIHSFVIKRGFDRGVSVGNTLLDAYAKGGEGGVAVARKIFDQIVDKDRVSYNSIMSV 262

Query: 197 YAHVSCMDQVFNLFVEMLNNDVKP-NNVTMVSLLSLCAEAGALDLGKWTHAYINRHGLEV 256
           YA     ++ F +F  ++ N V   N +T+ ++L   + +GAL +GK  H  + R GLE 
Sbjct: 263 YAQSGMSNEAFEVFRRLVKNKVVTFNAITLSTVLLAVSHSGALRIGKCIHDQVIRMGLED 322

Query: 257 DVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALELFSEME 316
           DVI+ T++I+MY KCG V  AR  F+    +++R W  M+AG+ MHG   +ALELF  M 
Sbjct: 323 DVIVGTSIIDMYCKCGRVETARKAFDRMKNKNVRSWTAMIAGYGMHGHAAKALELFPAMI 382

Query: 317 SHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLLGRAGHL 376
             GV PN ITFVS+  ACSH+GL VEG ++FN M   FG+ P +EHYGC+VDLLGRAG L
Sbjct: 383 DSGVRPNYITFVSVLAACSHAGLHVEGWRWFNAMKGRFGVEPGLEHYGCMVDLLGRAGFL 442

Query: 377 DEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNIY 436
            +A+++I+ M M+P++IIW +LLAAC++HKN+ L E++  ++ ELD  NCGY +L S+IY
Sbjct: 443 QKAYDLIQRMKMKPDSIIWSSLLAACRIHKNVELAEISVARLFELDSSNCGYYMLLSHIY 502

Query: 437 ASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVYEMVTEMC 496
           A A RW DV  VR  M + G+ K PG S +E++G VH F  GD+   Q  K+YE + E+ 
Sbjct: 503 ADAGRWKDVERVRMIMKNRGLVKPPGFSLLELNGEVHVFLIGDEEHPQREKIYEFLAELN 562

Query: 497 IKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVKNLRICD 556
            KL E+GY  NT++V  ++DEEEKE  L  HSEKLA AFG+++T PG+ + +VKNLR+C 
Sbjct: 563 RKLLEAGYVSNTSSVCHDVDEEEKEMTLRVHSEKLAIAFGIMNTVPGSTVNVVKNLRVCS 622

Query: 557 DCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           DCH   KL+SKI  R  +VRD  RFHHF +G CSC  YW
Sbjct: 623 DCHNVIKLISKIVDREFVVRDAKRFHHFKDGGCSCGDYW 659

BLAST of CsGy1G032220 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 477.2 bits (1227), Expect = 2.5e-133
Identity = 233/581 (40.10%), Postives = 355/581 (61.10%), Query Frame = 0

Query: 6   VKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGGCLAS 65
           +K S + ++S++     L  +  G+ +HGY +R+  D  + +S  TAL+DMY K G L +
Sbjct: 232 LKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS--TALVDMYAKCGSLET 291

Query: 66  AQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLITECGF 125
           A++LFD + +R+VVSW  MI   +++    E    F +ML+E + P +++++  +  C  
Sbjct: 292 ARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACAD 351

Query: 126 VGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKIWSVL 185
           +G L+ G++ H   +  G   ++++V +LI MY KC +V  A ++F  ++ + +  W+ +
Sbjct: 352 LGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAM 411

Query: 186 ISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYINRHGL 245
           I  +A         N F +M +  VKP+  T VS+++  AE       KW H  + R  L
Sbjct: 412 ILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCL 471

Query: 246 EVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALELFSE 305
           + +V + TAL++MYAKCG + IAR +F+   +R +  WN M+ G+  HG GK ALELF E
Sbjct: 472 DKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEE 531

Query: 306 MESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLLGRAG 365
           M+   ++PN +TF+S+  ACSHSGLV  G K F  M  ++ I   M+HYG +VDLLGRAG
Sbjct: 532 MQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAG 591

Query: 366 HLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSN 425
            L+EA + I  MP++P   ++GA+L AC++HKN+   E AA ++ EL+P + GY VL +N
Sbjct: 592 RLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLAN 651

Query: 426 IYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVYEMVTE 485
           IY +A  W  V  VR +M   G++K PG S +E+   VH F SG  A   + K+Y  + +
Sbjct: 652 IYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEK 711

Query: 486 MCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVKNLRI 545
           +   ++E+GY P+T  V L ++ + KE  LS HSEKLA +FGL++T  GT I + KNLR+
Sbjct: 712 LICHIKEAGYVPDTNLV-LGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRV 771

Query: 546 CDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           C DCH ATK +S + GR I+VRD  RFHHF  G CSC  YW
Sbjct: 772 CADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of CsGy1G032220 vs. Swiss-Prot
Match: sp|O82380|PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 2.7e-127
Identity = 231/605 (38.18%), Postives = 356/605 (58.84%), Query Frame = 0

Query: 16  LIAVFGNLLDMKSGRAVHGYIVRN-VGDEKMEVSMTTALIDMYCKGGCLASAQRLFDRLS 75
           LI     +  +  G+++HG  V++ VG    +V +  +LI  Y   G L SA ++F  + 
Sbjct: 137 LIKAAAEVSSLSLGQSLHGMAVKSAVGS---DVFVANSLIHCYFSCGDLDSACKVFTTIK 196

Query: 76  KRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLITECGFVGTLDLGKW 135
           ++                                     +T++ +++ C  +  L+ G+ 
Sbjct: 197 EKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTMVGVLSACAKIRNLEFGRQ 256

Query: 136 FHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKIWSVLISAYAHVSC 195
             +Y+  N   ++L L  A++DMY KCG +  A+ LF+ +++KD   W+           
Sbjct: 257 VCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTXXXXXXXXXXX 316

Query: 196 M-------------------------------DQVFNLFVEM-LNNDVKPNNVTMVSLLS 255
                                           ++   +F E+ L  ++K N +T+VS LS
Sbjct: 317 XXXXXXXXXXXXXXXXXXXXXXXXXXXXNGKPNEALIVFHELQLQKNMKLNQITLVSTLS 376

Query: 256 LCAEAGALDLGKWTHAYINRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRM 315
            CA+ GAL+LG+W H+YI +HG+ ++  + +ALI+MY+KCGD+  +R +FN   +RD+ +
Sbjct: 377 ACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFV 436

Query: 316 WNTMMAGFSMHGCGKEALELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMV 375
           W+ M+ G +MHGCG EA+++F +M+   V+PN +TF ++F ACSH+GLV E +  F++M 
Sbjct: 437 WSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQME 496

Query: 376 HDFGIVPKMEHYGCLVDLLGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALG 435
            ++GIVP+ +HY C+VD+LGR+G+L++A   IE MP+ P+T +WGALL ACK+H NL L 
Sbjct: 497 SNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLA 556

Query: 436 EVAARKILELDPQNCGYSVLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGS 495
           E+A  ++LEL+P+N G  VL SNIYA   +W +V+ +R+ M  +G+KKEPG S IE+ G 
Sbjct: 557 EMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGM 616

Query: 496 VHHFKSGDKACTQTTKVYEMVTEMCIKLRESGYTPNTAAVLLNIDEEE-KESALSYHSEK 555
           +H F SGD A   + KVY  + E+  KL+ +GY P  + VL  I+EEE KE +L+ HSEK
Sbjct: 617 IHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEK 676

Query: 556 LATAFGLISTAPGTPIRIVKNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCS 587
           LA  +GLIST     IR++KNLR+C DCH+  KL+S++Y R IIVRDR RFHHF  G CS
Sbjct: 677 LAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCS 736

BLAST of CsGy1G032220 vs. Swiss-Prot
Match: sp|Q9SUH6|PP341_ARATH (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX=3702 GN=DYW9 PE=2 SV=1)

HSP 1 Score: 454.1 bits (1167), Expect = 2.3e-126
Identity = 225/580 (38.79%), Postives = 334/580 (57.59%), Query Frame = 0

Query: 7   KLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKGGCLASA 66
           +L    L+ ++     L +++ G  +H  +    G    +  + T  I +Y K G +   
Sbjct: 218 RLDTTTLLDILPAVAELQELRLGMQIHS-LATKTGCYSHDY-VLTGFISLYSKCGKIKMG 277

Query: 67  QRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLITECGFV 126
             LF    K  +V++  MI G   +   +     F  ++         TL+SL+   G  
Sbjct: 278 SALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVSGH- 337

Query: 127 GTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVKIWSVLI 186
             L L    H Y L++ F    ++ TAL  +Y K  ++  AR LF+   +K +  W+ +I
Sbjct: 338 --LMLIYAIHGYCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWNAMI 397

Query: 187 SAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYINRHGLE 246
           S Y      +   +LF EM  ++  PN VT+  +LS CA+ GAL LGKW H  +     E
Sbjct: 398 SGYTQNGLTEDAISLFREMQKSEFSPNPVTITCILSACAQLGALSLGKWVHDLVRSTDFE 457

Query: 247 VDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEALELFSEM 306
             + + TALI MYAKCG +  AR LF+   +++   WNTM++G+ +HG G+EAL +F EM
Sbjct: 458 SSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNIFYEM 517

Query: 307 ESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDLLGRAGH 366
            + G+ P  +TF+ + +ACSH+GLV EG + FN M+H +G  P ++HY C+VD+LGRAGH
Sbjct: 518 LNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILGRAGH 577

Query: 367 LDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYSVLKSNI 426
           L  A   IE M + P + +W  LL AC++HK+  L    + K+ ELDP N GY VL SNI
Sbjct: 578 LQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLLSNI 637

Query: 427 YASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVYEMVTEM 486
           +++ + +    +VR+      + K PG + IE+  + H F SGD++  Q  ++YE + ++
Sbjct: 638 HSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEKLEKL 697

Query: 487 CIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIVKNLRIC 546
             K+RE+GY P T   L +++EEE+E  +  HSE+LA AFGLI+T PGT IRI+KNLR+C
Sbjct: 698 EGKMREAGYQPETELALHDVEEEERELMVKVHSERLAIAFGLIATEPGTEIRIIKNLRVC 757

Query: 547 DDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
            DCH  TKL+SKI  R I+VRD NRFHHF +G CSC  YW
Sbjct: 758 LDCHTVTKLISKITERVIVVRDANRFHHFKDGVCSCGDYW 792

BLAST of CsGy1G032220 vs. TrEMBL
Match: tr|A0A0A0LYC2|A0A0A0LYC2_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G690260 PE=4 SV=1)

HSP 1 Score: 1198.7 bits (3100), Expect = 0.0e+00
Identity = 586/586 (100.00%), Postives = 586/586 (100.00%), Query Frame = 0

Query: 1   MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 60
           MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG
Sbjct: 180 MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 239

Query: 61  GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 120
           GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI
Sbjct: 240 GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 299

Query: 121 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 180
           TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK
Sbjct: 300 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 359

Query: 181 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 240
           IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI
Sbjct: 360 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 419

Query: 241 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 300
           NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL
Sbjct: 420 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 479

Query: 301 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 360
           ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL
Sbjct: 480 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 539

Query: 361 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 420
           LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 540 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 599

Query: 421 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 480
           VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY
Sbjct: 600 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 659

Query: 481 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 540
           EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV
Sbjct: 660 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 719

Query: 541 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW
Sbjct: 720 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 765

BLAST of CsGy1G032220 vs. TrEMBL
Match: tr|A0A1S3CJ58|A0A1S3CJ58_CUCME (pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103501009 PE=4 SV=1)

HSP 1 Score: 1131.7 bits (2926), Expect = 0.0e+00
Identity = 549/586 (93.69%), Postives = 565/586 (96.42%), Query Frame = 0

Query: 1   MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 60
           MQFVGVKLSGVALISLI VFGNLLDMKSGRAVHGYI+RNVGDEKMEVS+TTALIDMYCK 
Sbjct: 181 MQFVGVKLSGVALISLIGVFGNLLDMKSGRAVHGYIMRNVGDEKMEVSLTTALIDMYCKC 240

Query: 61  GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 120
            CLASAQRLFDRLSKRSVVSWTVMI GCIRSCRL EGAKNFNRMLEEKLFPNEITLLSLI
Sbjct: 241 ECLASAQRLFDRLSKRSVVSWTVMIKGCIRSCRLVEGAKNFNRMLEEKLFPNEITLLSLI 300

Query: 121 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 180
           TECGFV TLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGV+KKDVK
Sbjct: 301 TECGFVKTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVEKKDVK 360

Query: 181 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 240
           IWS LISAYAHVSCMDQVFNLF+EML+N+VKPN VTMVSLLSLCAEAG LDLGKWTHAYI
Sbjct: 361 IWSALISAYAHVSCMDQVFNLFLEMLDNEVKPNKVTMVSLLSLCAEAGTLDLGKWTHAYI 420

Query: 241 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 300
           NRHGLEVDVILETALINMY KCGDVTIARSLF+EA QRDI MWN MMAGFSMHGCGKEAL
Sbjct: 421 NRHGLEVDVILETALINMYVKCGDVTIARSLFDEATQRDIHMWNAMMAGFSMHGCGKEAL 480

Query: 301 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 360
           ELFSEMESHGVEPNDITF+SIFHACSHSGLVVEGKK+FN+MVH+FGIVPKMEHYGCLVDL
Sbjct: 481 ELFSEMESHGVEPNDITFISIFHACSHSGLVVEGKKHFNRMVHNFGIVPKMEHYGCLVDL 540

Query: 361 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 420
           LGRAGHL+EAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS
Sbjct: 541 LGRAGHLEEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 600

Query: 421 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 480
           VLKSNIYASAKRWNDVTSVRE MSH GMKKEPGLSWIEV+GSVHHFKSGDK CTQTTKVY
Sbjct: 601 VLKSNIYASAKRWNDVTSVRETMSHLGMKKEPGLSWIEVNGSVHHFKSGDKTCTQTTKVY 660

Query: 481 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 540
           EMV EMCIKLRE+GYTPNTA VLLNIDEEEKESALSYHSEKLA AFGLISTAPGTPIRI+
Sbjct: 661 EMVAEMCIKLREAGYTPNTAEVLLNIDEEEKESALSYHSEKLAMAFGLISTAPGTPIRII 720

Query: 541 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           KNLRICDDCHAA KLLSKIY RTIIVRDRNRFHHFSEGYCSC+GYW
Sbjct: 721 KNLRICDDCHAAMKLLSKIYARTIIVRDRNRFHHFSEGYCSCLGYW 766

BLAST of CsGy1G032220 vs. TrEMBL
Match: tr|A0A2P6R5R0|A0A2P6R5R0_ROSCH (Putative tetratricopeptide-like helical domain, DYW domain-containing protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr3g0450051 PE=4 SV=1)

HSP 1 Score: 823.2 bits (2125), Expect = 3.8e-235
Identity = 387/586 (66.04%), Postives = 473/586 (80.72%), Query Frame = 0

Query: 1   MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 60
           M  + VK S VA+IS++++F ++ D+K G+A+H Y+ RN  +E+M V +TTAL+DMY K 
Sbjct: 221 MHCLQVKPSEVAMISMVSLFADIADVKMGKAMHAYVARNSSNERMVVHVTTALVDMYVKC 280

Query: 61  GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 120
           G LA  +RLFD L+++SVVSWT MIAGCIR   ++EG K F RMLEE+ FPNEIT+LSL+
Sbjct: 281 GNLAYGRRLFDGLAQKSVVSWTAMIAGCIRCNEVEEGVKLFKRMLEERKFPNEITMLSLV 340

Query: 121 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 180
            E G VG L+LGKW HAY+LRNGF MSLAL TAL+DMYGKCG+  YARA+F+ ++KKDV 
Sbjct: 341 IESGSVGALELGKWLHAYVLRNGFVMSLALATALVDMYGKCGEAEYARAVFDSMEKKDVM 400

Query: 181 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 240
           IWS +ISAYA  +C +Q   LF  M ++ ++PN VTMVSL+SLCAE GALDLGKW H+YI
Sbjct: 401 IWSAMISAYARSNCTNQASELFARMKDSGIRPNQVTMVSLISLCAEVGALDLGKWVHSYI 460

Query: 241 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 300
           N+  +EVDVIL TAL++MYAKCG++  A  LF+EA  RD RMWN M+ GFSMHGCGK+AL
Sbjct: 461 NQQRIEVDVILRTALVDMYAKCGEIDAALRLFSEARYRDSRMWNAMITGFSMHGCGKQAL 520

Query: 301 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 360
           ELF EM+  GVEPNDITF+ + HACSH+GLV +GKK F KMV DFG+ PK+EHYGC+VDL
Sbjct: 521 ELFEEMQRAGVEPNDITFIGLLHACSHAGLVADGKKVFEKMVLDFGLAPKVEHYGCMVDL 580

Query: 361 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 420
           LGRAG LDEAH +I++MP+ PN I+WG+LLAACK+HK+  L EVAAR++LEL+PQNCGY+
Sbjct: 581 LGRAGKLDEAHELIKSMPVEPNPIVWGSLLAACKIHKSPNLAEVAARQLLELEPQNCGYN 640

Query: 421 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 480
           VL SNIYA++ RW DV  VR+AM   G KKEPGLS IEV+G+VH F  GDK   QT K+Y
Sbjct: 641 VLMSNIYAASNRWIDVAGVRKAMEDKGTKKEPGLSSIEVNGAVHDFMMGDKTHPQTRKIY 700

Query: 481 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 540
           EM+ EM  KL+E+GYTPNT+ VL NIDEEEKE+A++YHSEKLA AFGLISTA GTPIRIV
Sbjct: 701 EMLAEMIKKLKEAGYTPNTSVVLQNIDEEEKETAVNYHSEKLAMAFGLISTAAGTPIRIV 760

Query: 541 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           KNLR+CDDCH ATKLLSKIYGR + VRDRNRFHHF EG CSC  YW
Sbjct: 761 KNLRVCDDCHTATKLLSKIYGRVMTVRDRNRFHHFIEGSCSCGDYW 806

BLAST of CsGy1G032220 vs. TrEMBL
Match: tr|A0A251P036|A0A251P036_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G348400 PE=4 SV=1)

HSP 1 Score: 821.2 bits (2120), Expect = 1.4e-234
Identity = 385/586 (65.70%), Postives = 478/586 (81.57%), Query Frame = 0

Query: 1   MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 60
           M  + VK S +A++S++ +F ++ D + G+A+H Y+VRN  +EK+ VS++TALIDMY K 
Sbjct: 237 MHCMQVKPSEIAMVSMVNLFADVADREMGKAMHAYVVRNSTNEKLGVSISTALIDMYVKC 296

Query: 61  GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 120
           G LA A+R+FD L+++++VSWT MIAG I    L EGAK FNRML E+ +PNEIT+LSL+
Sbjct: 297 GNLAYARRVFDGLAQKNIVSWTAMIAGYIHCRNLQEGAKLFNRMLMERNYPNEITMLSLV 356

Query: 121 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 180
            E G VG L+LGKW HAY+LRNGF MSLAL TAL+DMYGKC ++ YARALF+ V  KDV 
Sbjct: 357 IESGSVGALELGKWLHAYILRNGFIMSLALATALVDMYGKCKEITYARALFDSVDNKDVM 416

Query: 181 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 240
           IWS LISAYAH +C +Q  +LF +M ++ V+P+ VTMVSL+SLCAE GALDLGKW H+YI
Sbjct: 417 IWSALISAYAHANCTNQASDLFAQMKDSGVRPSQVTMVSLISLCAEVGALDLGKWVHSYI 476

Query: 241 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 300
           N+  +EVDVIL TAL++MYAKCGD+ +A  LF+EA  RD  MWN MM GF+MHGCGK+AL
Sbjct: 477 NQQRMEVDVILRTALVDMYAKCGDMDMALRLFSEASNRDSCMWNAMMTGFAMHGCGKQAL 536

Query: 301 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 360
           ELF +M+  GVEPNDITF+ + HACSH+GLV +GK  F KMVH +G+ PK+EHYGC+VDL
Sbjct: 537 ELFEQMDRQGVEPNDITFIGVLHACSHAGLVADGKLLFEKMVHVYGLAPKVEHYGCMVDL 596

Query: 361 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 420
           LGRAG+LDEAH +I++MPM+PNTI+WGALLAACK+HKN  L EVAAR++LEL+PQNCGY+
Sbjct: 597 LGRAGNLDEAHKLIKSMPMQPNTIVWGALLAACKIHKNPNLAEVAARELLELEPQNCGYN 656

Query: 421 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 480
           +L SNIYA++ RWN+V  VR+ M   G KKEPGLS IEV+GSVH F  GDKA  QT K+Y
Sbjct: 657 ILMSNIYAASNRWNEVDGVRKYMKDRGTKKEPGLSSIEVNGSVHDFIMGDKAHPQTRKIY 716

Query: 481 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 540
           EM+ EM  KL+E+GYTPNT+ VL NIDEEEKE+A++YHSE+LA AFGLISTA GTPIRIV
Sbjct: 717 EMLAEMTKKLKEAGYTPNTSVVLQNIDEEEKETAVNYHSERLAMAFGLISTAAGTPIRIV 776

Query: 541 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           KNLR+C+DCH ATKLLSKIYGR +IVRDRNRFHHF +GYCSC  YW
Sbjct: 777 KNLRVCEDCHTATKLLSKIYGRVMIVRDRNRFHHFRDGYCSCGDYW 822

BLAST of CsGy1G032220 vs. TrEMBL
Match: tr|A0A2P5C362|A0A2P5C362_PARAD (DYW domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_187660 PE=4 SV=1)

HSP 1 Score: 814.7 bits (2103), Expect = 1.4e-232
Identity = 382/586 (65.19%), Postives = 475/586 (81.06%), Query Frame = 0

Query: 1   MQFVGVKLSGVALISLIAVFGNLLDMKSGRAVHGYIVRNVGDEKMEVSMTTALIDMYCKG 60
           M  +G+K SG+A+IS++ +F +L  +K G+A+HGY+ RN   EK+ V +TT+LIDMY K 
Sbjct: 218 MHSLGIKPSGIAMISMVNLFADLSHVKPGKAMHGYVTRNGRTEKLGVPITTSLIDMYAKC 277

Query: 61  GCLASAQRLFDRLSKRSVVSWTVMIAGCIRSCRLDEGAKNFNRMLEEKLFPNEITLLSLI 120
           G  A A+RLF+ L+++SVVSW+ MIAG IR   L++GA+ FN MLEE +FPNEIT+LSLI
Sbjct: 278 GNSAYAERLFNGLTQKSVVSWSAMIAGYIRCKELEKGARLFNEMLEESIFPNEITVLSLI 337

Query: 121 TECGFVGTLDLGKWFHAYLLRNGFGMSLALVTALIDMYGKCGQVGYARALFNGVKKKDVK 180
            ECGFVG L++GKW H+Y+LRNGF MSLAL TAL+DMYGKCG +  ARA+F+ +  KDV 
Sbjct: 338 IECGFVGALEVGKWLHSYILRNGFVMSLALATALVDMYGKCGDLRKARAVFDYMIDKDVM 397

Query: 181 IWSVLISAYAHVSCMDQVFNLFVEMLNNDVKPNNVTMVSLLSLCAEAGALDLGKWTHAYI 240
           IWS +ISAYA  +C  QV +LF  M    +KPN VTMVSL+SLCA+ GALDLGKW H YI
Sbjct: 398 IWSAMISAYAQANCSSQVCDLFARMREKGLKPNKVTMVSLISLCAKVGALDLGKWLHLYI 457

Query: 241 NRHGLEVDVILETALINMYAKCGDVTIARSLFNEAMQRDIRMWNTMMAGFSMHGCGKEAL 300
           N+ GLEVD++L+TAL++MYAKCGD+  A  LF EA  RD+ MWN MMAGF+MHGCG EAL
Sbjct: 458 NQQGLEVDLVLKTALVDMYAKCGDIDGAHRLFIEATDRDLCMWNAMMAGFAMHGCGNEAL 517

Query: 301 ELFSEMESHGVEPNDITFVSIFHACSHSGLVVEGKKYFNKMVHDFGIVPKMEHYGCLVDL 360
           +L  EME HG++PNDITF+++ HACSH+GLV EGK+ F KM  DFG+VPK+EHYGC+VDL
Sbjct: 518 KLVEEMERHGIKPNDITFIAVLHACSHAGLVTEGKRLFEKMGLDFGLVPKIEHYGCMVDL 577

Query: 361 LGRAGHLDEAHNIIENMPMRPNTIIWGALLAACKLHKNLALGEVAARKILELDPQNCGYS 420
           LGRA  LDEAH +I++MP+ PNT+IWGALLAACKLHKN +LGE+AA+++LEL+PQNCGYS
Sbjct: 578 LGRARELDEAHELIKSMPVEPNTVIWGALLAACKLHKNPSLGELAAKQLLELEPQNCGYS 637

Query: 421 VLKSNIYASAKRWNDVTSVREAMSHSGMKKEPGLSWIEVSGSVHHFKSGDKACTQTTKVY 480
           +L SNIYA++ RWNDV +VR A+  +GMKK+PGLS IEV+G VH F  GD++  Q  K+Y
Sbjct: 638 ILLSNIYAASNRWNDVAAVRTAVKDTGMKKQPGLSSIEVNGLVHDFVMGDQSHPQAGKIY 697

Query: 481 EMVTEMCIKLRESGYTPNTAAVLLNIDEEEKESALSYHSEKLATAFGLISTAPGTPIRIV 540
           EM+ EM  KL E+GY+PNT+ +L NIDEEEKE AL+YHSEKLA AFGLISTA GTP+RIV
Sbjct: 698 EMLAEMRTKLTEAGYSPNTSVILQNIDEEEKEIALNYHSEKLAMAFGLISTAAGTPVRIV 757

Query: 541 KNLRICDDCHAATKLLSKIYGRTIIVRDRNRFHHFSEGYCSCMGYW 587
           KNLR+C DCHAATKL+SK+YGR IIVRDRNRFHHF EG CSC  YW
Sbjct: 758 KNLRVCHDCHAATKLVSKLYGRVIIVRDRNRFHHFKEGSCSCGDYW 803

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011660280.10.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At3g26782, mitochondrial-... [more]
XP_008462708.10.0e+0093.69PREDICTED: pentatricopeptide repeat-containing protein At3g26782, mitochondrial-... [more]
XP_023533718.16.7e-30083.96pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp... [more]
XP_022960858.11.6e-29883.62pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Cucur... [more]
XP_022988029.13.5e-29382.08pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT1G08070.13.6e-13840.98Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G26782.14.7e-13842.49Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G11290.11.4e-13440.10Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G29760.11.5e-12838.18Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G30700.11.3e-12738.79Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9LN01|PPR21_ARATH6.4e-13740.98Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|Q9LW32|PP258_ARATH8.4e-13742.49Pentatricopeptide repeat-containing protein At3g26782, mitochondrial OS=Arabidop... [more]
sp|Q3E6Q1|PPR32_ARATH2.5e-13340.10Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
sp|O82380|PP175_ARATH2.7e-12738.18Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
sp|Q9SUH6|PP341_ARATH2.3e-12638.79Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LYC2|A0A0A0LYC2_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G690260 PE=4 SV=1[more]
tr|A0A1S3CJ58|A0A1S3CJ58_CUCME0.0e+0093.69pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like OS=Cuc... [more]
tr|A0A2P6R5R0|A0A2P6R5R0_ROSCH3.8e-23566.04Putative tetratricopeptide-like helical domain, DYW domain-containing protein OS... [more]
tr|A0A251P036|A0A251P036_PRUPE1.4e-23465.70Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G348400 PE=4 SV=1[more]
tr|A0A2P5C362|A0A2P5C362_PARAD1.4e-23265.19DYW domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_187... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G032220.1CsGy1G032220.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 127..250
e-value: 2.6E-21
score: 78.3
coord: 253..497
e-value: 8.6E-41
score: 142.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 4..126
e-value: 8.8E-17
score: 63.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 279..326
e-value: 3.6E-12
score: 46.1
coord: 177..224
e-value: 6.2E-8
score: 32.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 282..314
e-value: 1.0E-8
score: 32.8
coord: 181..213
e-value: 8.2E-5
score: 20.5
coord: 79..113
e-value: 7.5E-5
score: 20.7
coord: 316..349
e-value: 0.0021
score: 16.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 253..278
e-value: 0.014
score: 15.5
coord: 51..76
e-value: 0.0088
score: 16.1
coord: 353..378
e-value: 0.19
score: 12.0
coord: 79..107
e-value: 0.026
score: 14.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 213..247
score: 6.697
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 112..146
score: 5.568
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 248..278
score: 7.509
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 147..177
score: 6.588
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 178..212
score: 10.084
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 77..111
score: 9.953
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 279..313
score: 12.441
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 46..76
score: 7.618
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 314..349
score: 7.903
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 350..380
score: 7.267
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 452..575
e-value: 8.1E-38
score: 129.1
NoneNo IPR availablePANTHERPTHR24015:SF1086SUBFAMILY NOT NAMEDcoord: 208..502
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 108..213
coord: 208..502
NoneNo IPR availablePANTHERPTHR24015:SF1086SUBFAMILY NOT NAMEDcoord: 33..104
NoneNo IPR availablePANTHERPTHR24015:SF1086SUBFAMILY NOT NAMEDcoord: 108..213
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 33..104