CsGy1G030810 (gene) Cucumber (Gy14) v2

NameCsGy1G030810
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionpentatricopeptide repeat-containing protein At4g04370
LocationChr1 : 29366069 .. 29368522 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGGTGTCGGGATTGGCGAACTTGGCATACAAGGTTGCATTTCGTTGTTGGACTGTTATGAGTGGATTGATCCATGAGTCAATAGCCCATGGCTGCACCAAATCATTTAACTCTCTCGTAAGTCGTCTTTCTTATCAAGGTGCTCACCATCAAGTTTTGCAAACATATATTTCTATGCAAAAAACCCACACCCAATTAGATGCTTACACTTTTCCCAGCCTCTTCAAAGCTTGTACCAATTTGAACTTATTTTCACATGGCCTCTCACTTCATCAATCTGTCGTCGTTAATGGGCTCTCTCATGATTCCTATATTGGGTCTTCGCTTATCAGTTTTTATGCGAAATTCGGGTGCATTCATCTTGGTCGCAAGGTGTTTGATACAATGCTCAAAAGAAATGTTGTTCCTTGGACTACCATAATTGGGTCTTATTCACGGGAAGGGGACATCGATATTGCTTTCTCTATGTTCAAACAAATGCGGGAGAGTGGTATTCAGCCCACTTCTGTTACCTTGTTGAGTCTCCTTCCTGGTATTTCAAAGCTTCCCCTTCTTCTTTGTTTGCATTGTTTGATTATTTTACATGGTTTTGAGTCAGACTTAGCTTTATCGAATTCCATGGTGAATATGTATGGTAAATGTGGCAGAATTGCTGATGCAAGAAGGTTGTTTGAGTCAATTGGTTGCAGAGACATAGTTTCTTGGAATTCACTATTGTCTGCCTATTCGAAAATTGGAGCCACAGAAGAAATATTGCAGCTTCTACAAGCAATGAAGATTGAAGATATCAAACCTGACAAGCAGACTTTTTGCTCTGCTTTGTCTGCTTCTGCTATAAAGGGTGATCTTCGACTTGGTAAGTTAGTGCATGGTCTGATGCTCAAAGATGGGTTAAATATAGATCAACATGTAGAGTCGGCACTCGTAGTTTTATACTTGAGATGTAGATGTTTGGATCCCGCCTATAAAGTTTTCAAATCAACTACTGAAAAGGACGTGGTCATGTGGACAGCAATGATATCAGGACTTGTTCAGAACGATTGTGCTGACAAGGCATTGGGGGTCTTTTATCAAATGATCGAATCAAATGTCAAGCCGAGTACTGCTACCTTAGCTAGTGGTCTTGCAGCCTGTGCTCAACTTGGTTGTTGTGATATTGGTGCCTCAATTCATGGTTACGTATTAAGGCAAGGAATAATGCTAGACATCCCTGCTCAAAATTCTCTTGTCACCATGTATGCAAAGTGTAATAAGCTGCAGCAAAGTTGTTCAATTTTTAATAAGATGGTTGAAAAGGACTTAGTTTCTTGGAATGCAATTGTGGCTGGACATGCTAAAAATGGTTATTTAAGCAAAGGTATCTTTTTCTTCAATGAAATGAGAAAGAGCTTTCTAAGGCCCGACTCGATAACAGTGACCTCACTTCTTCAGGCTTGTGGTTCTGCTGGTGCACTTTGCCAGGGAAAGTGGATTCACAACTTTGTTCTTAGAAGTTCTCTTATTCCATGCATTATGACTGAAACAGCTCTAGTTGACATGTACTTCAAGTGCGGAAATTTAGAGAATGCTCAGAAGTGTTTTGATTGTATGTTACAACGAGATCTTGTAGCATGGAGCACCCTTATTGTTGGATATGGTTTTAATGGAAAAGGTGAAATCGCTTTGAGAAAATATTCAGAGTTTCTTGGCACAGGGATGGAACCAAATCATGTTATTTTCATTTCAGTTCTTTCTGCTTGTAGTCATGGTGGGCTTATTAGCAAAGGTTTGAGCATATATGAGTCAATGACTAAAGATTTTAGAATGTCACCAAATCTTGAGCATCGAGCTTGCGTCGTCGACCTTCTAAGTCGAGCTGGAAAGGTTGATGAGGCATATAGCTTCTATAAAATGATGTTTAAAGAACCCTCAATAGTTGTTTTAGGCATGCTCCTTGATGCTTGTCGGGTGAATGGTAGGGTCGAACTTGGAAAGGTTATTGCTAGAGATATGTTTGAATTAAAGCCTGTGGATCCTGGAAACTTTGTGCAACTTGCCAATAGTTATGCATCCATGAGTAGATGGGATGGAGTGGAGAAGGCATGGACTCAAATGAGATCTCTTGGTCTGAAAAAGTATCCAGGATGGAGTTCTATTGAAGTTCATGGAACCACTTTTACATTTTTTGCATCTCACAATTCACATCCTAAGATTGAAAAAATAATCTTGACAGTGAAAGCTTTGAGCAAGAATATCAGAAATTTGTATGTTAAAAATGAGATTTGTGAGGATTTTGTTGAATATTCATGAGATGCAGTTTCTCCTCCCTTTTGTTAAAATAAAATACATAGAATTGCACGAGCTACCAGAAGAATGTGTTTGGAATGGTATTGGTTGTGTACTACTTACCATGGTTTTCTAGTGCATTTTATGTTTATCCAACAGGGAATAAAAAGTGAATCTATTTCTTTACCA

mRNA sequence

TTGGTGTCGGGATTGGCGAACTTGGCATACAAGGTTGCATTTCGTTGTTGGACTGTTATGAGTGGATTGATCCATGAGTCAATAGCCCATGGCTGCACCAAATCATTTAACTCTCTCGTAAGTCGTCTTTCTTATCAAGGTGCTCACCATCAAGTTTTGCAAACATATATTTCTATGCAAAAAACCCACACCCAATTAGATGCTTACACTTTTCCCAGCCTCTTCAAAGCTTGTACCAATTTGAACTTATTTTCACATGGCCTCTCACTTCATCAATCTGTCGTCGTTAATGGGCTCTCTCATGATTCCTATATTGGGTCTTCGCTTATCAGTTTTTATGCGAAATTCGGGTGCATTCATCTTGGTCGCAAGGTGTTTGATACAATGCTCAAAAGAAATGTTGTTCCTTGGACTACCATAATTGGGTCTTATTCACGGGAAGGGGACATCGATATTGCTTTCTCTATGTTCAAACAAATGCGGGAGAGTGGTATTCAGCCCACTTCTGTTACCTTAATTGCTGATGCAAGAAGGTTGTTTGAGTCAATTGGTTGCAGAGACATAGTTTCTTGGAATTCACTATTGTCTGCCTATTCGAAAATTGGAGCCACAGAAGAAATATTGCAGCTTCTACAAGCAATGAAGATTGAAGATATCAAACCTGACAAGCAGACTTTTTGCTCTGCTTTGTCTGCTTCTGCTATAAAGGGTGATCTTCGACTTGGTAAGTTAGTGCATGGTCTGATGCTCAAAGATGGGTTAAATATAGATCAACATGTAGAGTCGGCACTCGTAGTTTTATACTTGAGATGTAGATGTTTGGATCCCGCCTATAAAGTTTTCAAATCAACTACTGAAAAGGACGTGGTCATGTGGACAGCAATGATATCAGGACTTGTTCAGAACGATTGTGCTGACAAGGCATTGGGGGTCTTTTATCAAATGATCGAATCAAATGTCAAGCCGAGTACTGCTACCTTAGCTAGTGGTCTTGCAGCCTGTGCTCAACTTGGTTGTTGTGATATTGGTGCCTCAATTCATGGTTACGTATTAAGGCAAGGAATAATGCTAGACATCCCTGCTCAAAATTCTCTTGTCACCATGTATGCAAAGTGTAATAAGCTGCAGCAAAGTTGTTCAATTTTTAATAAGATGGTTGAAAAGGACTTAGTTTCTTGGAATGCAATTGTGGCTGGACATGCTAAAAATGGTTATTTAAGCAAAGGTATCTTTTTCTTCAATGAAATGAGAAAGAGCTTTCTAAGGCCCGACTCGATAACAGTGACCTCACTTCTTCAGGCTTGTGGTTCTGCTGGTGCACTTTGCCAGGGAAAGTGGATTCACAACTTTGTTCTTAGAAGTTCTCTTATTCCATGCATTATGACTGAAACAGCTCTAGTTGACATGTACTTCAAGTGCGGAAATTTAGAGAATGCTCAGAAGTGTTTTGATTGTATGTTACAACGAGATCTTGTAGCATGGAGCACCCTTATTGTTGGATATGGTTTTAATGGAAAAGGTGAAATCGCTTTGAGAAAATATTCAGAGTTTCTTGGCACAGGGATGGAACCAAATCATGTTATTTTCATTTCAGTTCTTTCTGCTTGTAGTCATGGTGGGCTTATTAGCAAAGGTTTGAGCATATATGAGTCAATGACTAAAGATTTTAGAATGTCACCAAATCTTGAGCATCGAGCTTGCGTCGTCGACCTTCTAAGTCGAGCTGGAAAGGTTGATGAGGCATATAGCTTCTATAAAATGATGTTTAAAGAACCCTCAATAGTTGTTTTAGGCATGCTCCTTGATGCTTGTCGGGTGAATGGTAGGGTCGAACTTGGAAAGGTTATTGCTAGAGATATGTTTGAATTAAAGCCTGTGGATCCTGGAAACTTTGTGCAACTTGCCAATAGTTATGCATCCATGAGTAGATGGGATGGAGTGGAGAAGGCATGGACTCAAATGAGATCTCTTGGTCTGAAAAAGTATCCAGGATGGAGTTCTATTGAAGTTCATGGAACCACTTTTACATTTTTTGCATCTCACAATTCACATCCTAAGATTGAAAAAATAATCTTGACAGTGAAAGCTTTGAGCAAGAATATCAGAAATTTGTATGTTAAAAATGAGATTTGTGAGGATTTTGTTGAATATTCATGAGATGCAGTTTCTCCTCCCTTTTGTTAAAATAAAATACATAGAATTGCACGAGCTACCAGAAGAATGTGTTTGGAATGGTATTGGTTGTGTACTACTTACCATGGTTTTCTAGTGCATTTTATGTTTATCCAACAGGGAATAAAAAGTGAATCTATTTCTTTACCA

Coding sequence (CDS)

TTGGTGTCGGGATTGGCGAACTTGGCATACAAGGTTGCATTTCGTTGTTGGACTGTTATGAGTGGATTGATCCATGAGTCAATAGCCCATGGCTGCACCAAATCATTTAACTCTCTCGTAAGTCGTCTTTCTTATCAAGGTGCTCACCATCAAGTTTTGCAAACATATATTTCTATGCAAAAAACCCACACCCAATTAGATGCTTACACTTTTCCCAGCCTCTTCAAAGCTTGTACCAATTTGAACTTATTTTCACATGGCCTCTCACTTCATCAATCTGTCGTCGTTAATGGGCTCTCTCATGATTCCTATATTGGGTCTTCGCTTATCAGTTTTTATGCGAAATTCGGGTGCATTCATCTTGGTCGCAAGGTGTTTGATACAATGCTCAAAAGAAATGTTGTTCCTTGGACTACCATAATTGGGTCTTATTCACGGGAAGGGGACATCGATATTGCTTTCTCTATGTTCAAACAAATGCGGGAGAGTGGTATTCAGCCCACTTCTGTTACCTTAATTGCTGATGCAAGAAGGTTGTTTGAGTCAATTGGTTGCAGAGACATAGTTTCTTGGAATTCACTATTGTCTGCCTATTCGAAAATTGGAGCCACAGAAGAAATATTGCAGCTTCTACAAGCAATGAAGATTGAAGATATCAAACCTGACAAGCAGACTTTTTGCTCTGCTTTGTCTGCTTCTGCTATAAAGGGTGATCTTCGACTTGGTAAGTTAGTGCATGGTCTGATGCTCAAAGATGGGTTAAATATAGATCAACATGTAGAGTCGGCACTCGTAGTTTTATACTTGAGATGTAGATGTTTGGATCCCGCCTATAAAGTTTTCAAATCAACTACTGAAAAGGACGTGGTCATGTGGACAGCAATGATATCAGGACTTGTTCAGAACGATTGTGCTGACAAGGCATTGGGGGTCTTTTATCAAATGATCGAATCAAATGTCAAGCCGAGTACTGCTACCTTAGCTAGTGGTCTTGCAGCCTGTGCTCAACTTGGTTGTTGTGATATTGGTGCCTCAATTCATGGTTACGTATTAAGGCAAGGAATAATGCTAGACATCCCTGCTCAAAATTCTCTTGTCACCATGTATGCAAAGTGTAATAAGCTGCAGCAAAGTTGTTCAATTTTTAATAAGATGGTTGAAAAGGACTTAGTTTCTTGGAATGCAATTGTGGCTGGACATGCTAAAAATGGTTATTTAAGCAAAGGTATCTTTTTCTTCAATGAAATGAGAAAGAGCTTTCTAAGGCCCGACTCGATAACAGTGACCTCACTTCTTCAGGCTTGTGGTTCTGCTGGTGCACTTTGCCAGGGAAAGTGGATTCACAACTTTGTTCTTAGAAGTTCTCTTATTCCATGCATTATGACTGAAACAGCTCTAGTTGACATGTACTTCAAGTGCGGAAATTTAGAGAATGCTCAGAAGTGTTTTGATTGTATGTTACAACGAGATCTTGTAGCATGGAGCACCCTTATTGTTGGATATGGTTTTAATGGAAAAGGTGAAATCGCTTTGAGAAAATATTCAGAGTTTCTTGGCACAGGGATGGAACCAAATCATGTTATTTTCATTTCAGTTCTTTCTGCTTGTAGTCATGGTGGGCTTATTAGCAAAGGTTTGAGCATATATGAGTCAATGACTAAAGATTTTAGAATGTCACCAAATCTTGAGCATCGAGCTTGCGTCGTCGACCTTCTAAGTCGAGCTGGAAAGGTTGATGAGGCATATAGCTTCTATAAAATGATGTTTAAAGAACCCTCAATAGTTGTTTTAGGCATGCTCCTTGATGCTTGTCGGGTGAATGGTAGGGTCGAACTTGGAAAGGTTATTGCTAGAGATATGTTTGAATTAAAGCCTGTGGATCCTGGAAACTTTGTGCAACTTGCCAATAGTTATGCATCCATGAGTAGATGGGATGGAGTGGAGAAGGCATGGACTCAAATGAGATCTCTTGGTCTGAAAAAGTATCCAGGATGGAGTTCTATTGAAGTTCATGGAACCACTTTTACATTTTTTGCATCTCACAATTCACATCCTAAGATTGAAAAAATAATCTTGACAGTGAAAGCTTTGAGCAAGAATATCAGAAATTTGTATGTTAAAAATGAGATTTGTGAGGATTTTGTTGAATATTCATGA

Protein sequence

LVSGLANLAYKVAFRCWTVMSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLIADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALSKNIRNLYVKNEICEDFVEYS
BLAST of CsGy1G030810 vs. NCBI nr
Match: XP_004139152.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g04370 [Cucumis sativus] >KGN66617.1 hypothetical protein Csa_1G650050 [Cucumis sativus])

HSP 1 Score: 1322.4 bits (3421), Expect = 0.0e+00
Identity = 668/743 (89.91%), Postives = 669/743 (90.04%), Query Frame = 0

Query: 20  MSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 79
           MSG IHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT
Sbjct: 1   MSGFIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 60

Query: 80  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 139
           NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT
Sbjct: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120

Query: 140 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTL--------------------------- 199
           IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTL                           
Sbjct: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFES 180

Query: 200 -----------------IADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 259
                            IADARRLF+SI CRDIVS                         
Sbjct: 181 DLALSNSMVNMYGKCGRIADARRLFQSIDCRDIVSXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 260 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 319
              IKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD
Sbjct: 241 XXXIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300

Query: 320 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 379
           PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA
Sbjct: 301 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 360

Query: 380 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 439
           QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA
Sbjct: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 420

Query: 440 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 499
           IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS
Sbjct: 421 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480

Query: 500 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 559
           LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS
Sbjct: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 540

Query: 560 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 619
           EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA
Sbjct: 541 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 600

Query: 620 GKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 679
           GKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA
Sbjct: 601 GKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 660

Query: 680 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 719
           NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK
Sbjct: 661 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 720

BLAST of CsGy1G030810 vs. NCBI nr
Match: XP_016899786.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g04370 [Cucumis melo])

HSP 1 Score: 1319.7 bits (3414), Expect = 0.0e+00
Identity = 665/741 (89.74%), Postives = 679/741 (91.63%), Query Frame = 0

Query: 20  MSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 79
           MS LIHESIAHG TKSFNSLVSRLS QGAHHQVLQTYISMQKTHT  DAYTFPSLFKACT
Sbjct: 1   MSRLIHESIAHGSTKSFNSLVSRLSSQGAHHQVLQTYISMQKTHTPSDAYTFPSLFKACT 60

Query: 80  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 139
           NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT
Sbjct: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120

Query: 140 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTL--------------------------- 199
           IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTL                           
Sbjct: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIFLYGFES 180

Query: 200 -----------------IADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 259
                            IADAR LFESI  RDIVSWNSLLSAYSKIGATEEILQL+QAMK
Sbjct: 181 DLALSNSMVNMYGKCGRIADARSLFESIDYRDIVSWNSLLSAYSKIGATEEILQLVQAMK 240

Query: 260 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 319
           IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD
Sbjct: 241 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300

Query: 320 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 379
            A+KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNV+PSTATLAS LAACA
Sbjct: 301 LAHKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVQPSTATLASALAACA 360

Query: 380 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 439
           QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKD+VSWNA
Sbjct: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDVVSWNA 420

Query: 440 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 499
           IVAG+AKNGYLSK IFFFNEMR SF RPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS
Sbjct: 421 IVAGNAKNGYLSKAIFFFNEMRTSFQRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480

Query: 500 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 559
           LIPCIMTETALVDMYFKCGNLENAQKCFDCM QRDLVAWSTLIVGYGFNGKGEIALRKYS
Sbjct: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMSQRDLVAWSTLIVGYGFNGKGEIALRKYS 540

Query: 560 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 619
           EFLGTGMEPNHVIFISVLSACSH GLIS+GLSIYESMTKDFRM PNLEHRAC+VDLLSRA
Sbjct: 541 EFLGTGMEPNHVIFISVLSACSHSGLISQGLSIYESMTKDFRMPPNLEHRACIVDLLSRA 600

Query: 620 GKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 679
           GKVDEAYSFYKMMFKEPS+VVLG LLDACRVNG VELGKVIARDMFELKPVDPGNFVQLA
Sbjct: 601 GKVDEAYSFYKMMFKEPSMVVLGTLLDACRVNGSVELGKVIARDMFELKPVDPGNFVQLA 660

Query: 680 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 717
           NSYASM+RWDGVEKAWTQMRSLGLKK+PGWSSIE+HGTTFTFFA+HNSHPKIEKIILTVK
Sbjct: 661 NSYASMNRWDGVEKAWTQMRSLGLKKFPGWSSIELHGTTFTFFAAHNSHPKIEKIILTVK 720

BLAST of CsGy1G030810 vs. NCBI nr
Match: XP_022159804.1 (pentatricopeptide repeat-containing protein At4g04370 [Momordica charantia] >XP_022159805.1 pentatricopeptide repeat-containing protein At4g04370 [Momordica charantia] >XP_022159806.1 pentatricopeptide repeat-containing protein At4g04370 [Momordica charantia] >XP_022159807.1 pentatricopeptide repeat-containing protein At4g04370 [Momordica charantia])

HSP 1 Score: 1124.0 bits (2906), Expect = 0.0e+00
Identity = 559/731 (76.47%), Postives = 620/731 (84.82%), Query Frame = 0

Query: 28  IAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHG 87
           IA+G TKSFNS+++RLS QGAHHQVLQTY SMQKT T  DAYTFPSL KACT LNLF  G
Sbjct: 16  IANGSTKSFNSIINRLSSQGAHHQVLQTYASMQKTSTPPDAYTFPSLLKACTILNLFLDG 75

Query: 88  LSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSRE 147
           LSLHQS++VNG S DSYIGSSLISFYAKFGCI +GRKVFD M +RNVVPWTTIIG YSRE
Sbjct: 76  LSLHQSIIVNGFSLDSYIGSSLISFYAKFGCIDIGRKVFDIMPERNVVPWTTIIGCYSRE 135

Query: 148 GDIDIAFSMFKQMRESGIQPTSVTL----------------------------------- 207
           G+ID+AFSMFKQMR +GIQPTSVTL                                   
Sbjct: 136 GEIDVAFSMFKQMRATGIQPTSVTLLSLLPSISELPLLQCLHCWIILYGFESNLSLSNSM 195

Query: 208 ---------IADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDK 267
                    I DAR LFES+  RDIVSWNSLLSAYSKIG  EEILQL+  M+ EDIKPDK
Sbjct: 196 VNVYGRCGSIEDARSLFESMDYRDIVSWNSLLSAYSKIGVIEEILQLVLGMRTEDIKPDK 255

Query: 268 QTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAYKVFKS 327
           QTFCSALSASAIKGD+RLGKLVHGL++KDGL IDQ VE+AL+VLYLRC+ LD A KVFKS
Sbjct: 256 QTFCSALSASAIKGDIRLGKLVHGLIIKDGLGIDQQVETALMVLYLRCKSLDLALKVFKS 315

Query: 328 TTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIG 387
           TTEKD+V+WTAMISGLVQNDCADKAL VFYQM+ESN++P TATLAS LAACAQLGC DIG
Sbjct: 316 TTEKDMVLWTAMISGLVQNDCADKALRVFYQMLESNMEPGTATLASALAACAQLGCYDIG 375

Query: 388 ASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKN 447
             IHGY+LRQGIMLDIPAQN+LVTMYAKCN+L+QSC IFNKMVE+DLVSWNAIVAGHAKN
Sbjct: 376 TLIHGYILRQGIMLDIPAQNALVTMYAKCNRLEQSCGIFNKMVERDLVSWNAIVAGHAKN 435

Query: 448 GYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTE 507
           GYLSK I FFNEMR S  RPDSITVTSLLQACGSAGAL QGKWIHNFV RSSL+PCIM E
Sbjct: 436 GYLSKAILFFNEMRTSLQRPDSITVTSLLQACGSAGALWQGKWIHNFVFRSSLMPCIMIE 495

Query: 508 TALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGME 567
           TAL+DMYFKCGNLE AQKCFD M  +DLV WSTLI GYGFNG GEIALRKYSEFLGTG+E
Sbjct: 496 TALIDMYFKCGNLEIAQKCFDYMPHQDLVTWSTLISGYGFNGNGEIALRKYSEFLGTGLE 555

Query: 568 PNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYS 627
           PNHVIF+SVLSACSH GL+++GL IYESMT+DF M PNLEHRAC+VDLLSRAGKV+EAYS
Sbjct: 556 PNHVIFLSVLSACSHSGLVNQGLRIYESMTRDFLMPPNLEHRACIVDLLSRAGKVEEAYS 615

Query: 628 FYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSR 687
           FYKMMF+EPSI VLG+LLDACRVNG VELG+ IARD+F LKPVDPGN+VQLA+SYASM R
Sbjct: 616 FYKMMFQEPSIDVLGILLDACRVNGSVELGEAIARDIFALKPVDPGNYVQLAHSYASMGR 675

Query: 688 WDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALSKNIRN 715
           WDGVE+AWTQMRSLGLKK PGWSSIEVHGT+F+F++ HNSHPKIE+I+LTVK+LS +IR 
Sbjct: 676 WDGVEEAWTQMRSLGLKKLPGWSSIEVHGTSFSFYSVHNSHPKIEEIMLTVKSLSNDIRK 735

BLAST of CsGy1G030810 vs. NCBI nr
Match: XP_022931568.1 (pentatricopeptide repeat-containing protein At4g04370 [Cucurbita moschata] >XP_022931569.1 pentatricopeptide repeat-containing protein At4g04370 [Cucurbita moschata])

HSP 1 Score: 1019.2 bits (2634), Expect = 6.7e-294
Identity = 517/739 (69.96%), Postives = 581/739 (78.62%), Query Frame = 0

Query: 20  MSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 79
           M+ LIHESI HG TKSFN+L++RLS Q AHHQVLQTYISM  T+T  DAYTFPSL KACT
Sbjct: 1   MNRLIHESITHGSTKSFNALINRLSSQAAHHQVLQTYISMLNTNTPPDAYTFPSLLKACT 60

Query: 80  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 139
            LN FS+GLS+HQSV+VNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTM +RNVVPWT 
Sbjct: 61  LLNSFSNGLSIHQSVIVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMPERNVVPWTA 120

Query: 140 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLIA------------------------- 199
           IIG YSR+GD+ IAF+MFKQMRE+ I PTSVT ++                         
Sbjct: 121 IIGCYSRQGDVGIAFTMFKQMRENEIHPTSVTFLSLLPGISELPLLQGLHCLIVLYGFGS 180

Query: 200 -------------------DARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 259
                               A  LFES+  RDIVSWNSLLSAYSKI   EEILQL+ +M+
Sbjct: 181 DLALSNSMVSMYGRCGSVDYATSLFESMDSRDIVSWNSLLSAYSKIVGIEEILQLVHSMR 240

Query: 260 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 319
           IE IKPDK+TFCS LSASA K D+RLGKLVHGL+LK GL++DQ VE+ LVVLYL+C CLD
Sbjct: 241 IEGIKPDKRTFCSVLSASATKCDIRLGKLVHGLVLKHGLDMDQQVETTLVVLYLKCSCLD 300

Query: 320 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 379
            A KVF+ST E                                            LAACA
Sbjct: 301 FALKVFESTIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAACA 360

Query: 380 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 439
           QLGC  IG SIHGY+LRQGIM DIPAQNSLVTMYAKCN+L+QSC+IFNK+VEK+LVSWNA
Sbjct: 361 QLGCYHIGTSIHGYILRQGIMFDIPAQNSLVTMYAKCNRLEQSCAIFNKIVEKNLVSWNA 420

Query: 440 IVAGHAKNGYLSKGIFFFNEMR-KSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRS 499
           I+AGHAKNGYLSK IFFF+EMR  SF RPDSITVTSLLQACGSAGALCQGKWIH+F+ RS
Sbjct: 421 IIAGHAKNGYLSKAIFFFSEMRATSFQRPDSITVTSLLQACGSAGALCQGKWIHDFIFRS 480

Query: 500 SLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKY 559
           SLIPCIMTETALVDMYFKCG++ENAQKCFD MLQ+DLV WSTLI GYGFNG+GEIALRKY
Sbjct: 481 SLIPCIMTETALVDMYFKCGDVENAQKCFDYMLQKDLVTWSTLIAGYGFNGQGEIALRKY 540

Query: 560 SEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSR 619
           SEFLGTGMEPNHVIF+SVLSACSH GLI++GLSIYESMTKDF M  NLEHRAC++DLLSR
Sbjct: 541 SEFLGTGMEPNHVIFLSVLSACSHSGLINQGLSIYESMTKDFSMPTNLEHRACIIDLLSR 600

Query: 620 AGKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQL 679
           AGKV+EAYSFY  MF+EPSI VLG+LLDACRVNG VELG+VI+RDMFELKPVD GN+VQL
Sbjct: 601 AGKVEEAYSFYNRMFEEPSIDVLGILLDACRVNGNVELGEVISRDMFELKPVDAGNYVQL 660

Query: 680 ANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTV 714
           A+SYAS SRWDGVE AWTQMRSLGLKK PGWS IEV GT+FTFF+ HNSHPKIE+I+ TV
Sbjct: 661 AHSYASTSRWDGVEVAWTQMRSLGLKKLPGWSCIEVDGTSFTFFSVHNSHPKIEEIVWTV 720

BLAST of CsGy1G030810 vs. NCBI nr
Match: XP_023520788.1 (pentatricopeptide repeat-containing protein At4g04370 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023520789.1 pentatricopeptide repeat-containing protein At4g04370 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1011.1 bits (2613), Expect = 1.8e-291
Identity = 514/739 (69.55%), Postives = 576/739 (77.94%), Query Frame = 0

Query: 20  MSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 79
           M+ LIHESI HG TKSFN+L++RLS Q AHHQVLQTYISM  T+T  DAYTFPSL KACT
Sbjct: 1   MNRLIHESITHGSTKSFNALINRLSSQAAHHQVLQTYISMLNTNTPPDAYTFPSLLKACT 60

Query: 80  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 139
            LN FS+GLS+HQSV+VNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTM +RNVVPWT 
Sbjct: 61  LLNSFSNGLSIHQSVIVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMPERNVVPWTA 120

Query: 140 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLIA------------------------- 199
           IIG YSR+GD+ IAF+MFKQMRE+ I PTSVT ++                         
Sbjct: 121 IIGCYSRQGDVGIAFTMFKQMRENEIHPTSVTFLSLLPGISELPLLQGLHCLIILYGFGS 180

Query: 200 -------------------DARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 259
                               A  LFES+  RDIVSWNSLLS YSKI   EEILQL+ +M+
Sbjct: 181 DLSLSNSMVSMYGRCGSVDYATSLFESMDSRDIVSWNSLLSVYSKIVCIEEILQLVHSMR 240

Query: 260 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 319
           IE IKPDK+TFCS LS SA K D+RLGKLVHGL+LK GL++DQ VE+ LVVLYL+C CLD
Sbjct: 241 IEGIKPDKRTFCSVLSTSATKCDIRLGKLVHGLVLKHGLDMDQQVETTLVVLYLKCSCLD 300

Query: 320 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 379
            A KVF+ST E                                            LAACA
Sbjct: 301 FALKVFESTIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTLAACA 360

Query: 380 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 439
           QLGC  IG SIHGY+LR GIM DIPAQNSLVTMYAKCN+L+QSC+IFNK+VEK+LVSWNA
Sbjct: 361 QLGCYHIGTSIHGYILRHGIMFDIPAQNSLVTMYAKCNRLEQSCAIFNKIVEKNLVSWNA 420

Query: 440 IVAGHAKNGYLSKGIFFFNEMR-KSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRS 499
           I+AGHAKNGYLSK IFFF+EMR  SF RPDSITVTSLLQACGSAGALCQGKWIHN + RS
Sbjct: 421 IIAGHAKNGYLSKAIFFFSEMRATSFQRPDSITVTSLLQACGSAGALCQGKWIHNIIFRS 480

Query: 500 SLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKY 559
           SLIPCIMTETALVDMYFKCG++ENAQKCFD MLQ+DLV WSTLI GYG NGKGEIALRKY
Sbjct: 481 SLIPCIMTETALVDMYFKCGDVENAQKCFDYMLQKDLVTWSTLIAGYGINGKGEIALRKY 540

Query: 560 SEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSR 619
           SEFLGT MEPNHVIF+SVLSACSH GLI++GLSIYESMTKDF M  NLEHRAC++DLLSR
Sbjct: 541 SEFLGTRMEPNHVIFLSVLSACSHSGLINQGLSIYESMTKDFSMPTNLEHRACIIDLLSR 600

Query: 620 AGKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQL 679
           AGKV+EAYSFY  MF+EPSI VLG+LLDACRVNG VELG+VIARDMFELKPVD GN+VQL
Sbjct: 601 AGKVEEAYSFYDRMFEEPSIDVLGILLDACRVNGSVELGEVIARDMFELKPVDAGNYVQL 660

Query: 680 ANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTV 714
           A+SYAS SRWDGVE AWTQMRSLGLKK PGWS IEV GT+FTFF+ HNSHPKIE+I+ TV
Sbjct: 661 AHSYASTSRWDGVEVAWTQMRSLGLKKLPGWSCIEVDGTSFTFFSVHNSHPKIEEIVWTV 720

BLAST of CsGy1G030810 vs. TAIR10
Match: AT4G04370.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 656.8 bits (1693), Expect = 1.6e-188
Identity = 340/724 (46.96%), Postives = 461/724 (63.67%), Query Frame = 0

Query: 23  LIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLN 82
           +I  S     TK FNS ++ LS  G H QVL T+ SM       D +TFPSL KAC +L 
Sbjct: 1   MIRTSSVLNSTKYFNSHINHLSSHGDHKQVLSTFSSMLANKLLPDTFTFPSLLKACASLQ 60

Query: 83  LFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIG 142
             S GLS+HQ V+VNG S D YI SSL++ YAKFG +   RKVF+ M +R+VV WT +IG
Sbjct: 61  RLSFGLSIHQQVLVNGFSSDFYISSSLVNLYAKFGLLAHARKVFEEMRERDVVHWTAMIG 120

Query: 143 SYSREGDIDIAFSMFKQMRESGIQPTSVTL------------------------------ 202
            YSR G +  A S+  +MR  GI+P  VTL                              
Sbjct: 121 CYSRAGIVGEACSLVNEMRFQGIKPGPVTLLEMLSGVLEITQLQCLHDFAVIYGFDCDIA 180

Query: 203 --------------IADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIED 262
                         + DA+ LF+ +  RD+VSWN+++S Y+ +G   EIL+LL  M+ + 
Sbjct: 181 VMNSMLNLYCKCDHVGDAKDLFDQMEQRDMVSWNTMISGYASVGNMSEILKLLYRMRGDG 240

Query: 263 IKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAY 322
           ++PD+QTF ++LS S    DL +G+++H  ++K G ++D H+++AL+ +YL+C   + +Y
Sbjct: 241 LRPDQQTFGASLSVSGTMCDLEMGRMLHCQIVKTGFDVDMHLKTALITMYLKCGKEEASY 300

Query: 323 KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLG 382
           +V ++   KDVV WT MISGL++   A+KAL VF +M++S    S+  +AS +A+CAQLG
Sbjct: 301 RVLETIPNKDVVCWTVMISGLMRLGRAEKALIVFSEMLQSGSDLSSEAIASVVASCAQLG 360

Query: 383 CCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVA 442
             D+GAS+HGYVLR G  LD PA NSL+TMYAKC  L +S  IF +M E+DLV       
Sbjct: 361 SFDLGASVHGYVLRHGYTLDTPALNSLITMYAKCGHLDKSLVIFERMNERDLVXXXXXXX 420

Query: 443 GHAKNGYLSKGIFFFNEMR-KSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLI 502
                               K+  + DS TV SLLQAC SAGAL  GK IH  V+RS + 
Sbjct: 421 XXXXXXXXXXXXXXXXXXXFKTVQQVDSFTVVSLLQACSSAGALPVGKLIHCIVIRSFIR 480

Query: 503 PCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEF 562
           PC + +TALVDMY KCG LE AQ+CFD +  +D+V+W  LI GYGF+GKG+IAL  YSEF
Sbjct: 481 PCSLVDTALVDMYSKCGYLEAAQRCFDSISWKDVVSWGILIAGYGFHGKGDIALEIYSEF 540

Query: 563 LGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGK 622
           L +GMEPNHVIF++VLS+CSH G++ +GL I+ SM +DF + PN EH ACVVDLL RA +
Sbjct: 541 LHSGMEPNHVIFLAVLSSCSHNGMVQQGLKIFSSMVRDFGVEPNHEHLACVVDLLCRAKR 600

Query: 623 VDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANS 682
           +++A+ FYK  F  PSI VLG++LDACR NG+ E+  +I  DM ELKP D G++V+L +S
Sbjct: 601 IEDAFKFYKENFTRPSIDVLGIILDACRANGKTEVEDIICEDMIELKPGDAGHYVKLGHS 660

Query: 683 YASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKAL 702
           +A+M RWD V ++W QMRSLGLKK PGWS IE++G T TFF +H SH   +  +  +K L
Sbjct: 661 FAAMKRWDDVSESWNQMRSLGLKKLPGWSKIEMNGKTTTFFMNHTSHS--DDTVSLLKLL 720

BLAST of CsGy1G030810 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 393.3 bits (1009), Expect = 3.3e-109
Identity = 214/692 (30.92%), Postives = 368/692 (53.18%), Query Frame = 0

Query: 60  QKTHTQLDAYTFPS--LFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFG 119
           ++ +   + Y  P+  L + C++L      L L   V  NGL  + +  + L+S + ++G
Sbjct: 27  ERNYIPANVYEHPAALLLERCSSLKELRQILPL---VFKNGLYQEHFFQTKLVSLFCRYG 86

Query: 120 CIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPT--------- 179
            +    +VF+ +  +  V + T++  +++  D+D A   F +MR   ++P          
Sbjct: 87  SVDEAARVFEPIDSKLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLK 146

Query: 180 -----------------------SVTLIA---------------DARRLFESIGCRDIVS 239
                                  S+ L A               +AR++F+ +  RD+VS
Sbjct: 147 VCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVS 206

Query: 240 WNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLML 299
           WN++++ YS+ G     L+++++M  E++KP   T  S L A +    + +GK +HG  +
Sbjct: 207 WNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAM 266

Query: 300 KDGLNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALG 359
           + G +   ++ +ALV +Y +C  L+ A ++F    E++VV W +MI   VQN+   +A+ 
Sbjct: 267 RSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAML 326

Query: 360 VFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYA 419
           +F +M++  VKP+  ++   L ACA LG  + G  IH   +  G+  ++   NSL++MY 
Sbjct: 327 IFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYC 386

Query: 420 KCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTS 479
           KC ++  + S+F K+  + LVSWNA++ G A+NG     + +F++MR   ++PD+ T  S
Sbjct: 387 KCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVS 446

Query: 480 LLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRD 539
           ++ A          KWIH  V+RS L   +   TALVDMY KCG +  A+  FD M +R 
Sbjct: 447 VITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERH 506

Query: 540 LVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYE 599
           +  W+ +I GYG +G G+ AL  + E     ++PN V F+SV+SACSH GL+  GL  + 
Sbjct: 507 VTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFY 566

Query: 600 SMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRV 659
            M +++ +  +++H   +VDLL RAG+++EA+ F   M  +P++ V G +L AC+++  V
Sbjct: 567 MMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNV 626

Query: 660 ELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEV 703
              +  A  +FEL P D G  V LAN Y + S W+ V +    M   GL+K PG S +E+
Sbjct: 627 NFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEI 686

BLAST of CsGy1G030810 vs. TAIR10
Match: AT4G33990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 387.1 bits (993), Expect = 2.3e-107
Identity = 223/679 (32.84%), Postives = 353/679 (51.99%), Query Frame = 0

Query: 73  SLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKR 132
           +LF+ CTNL        LH  +VV+    +  I + L++ Y   G + L R  FD +  R
Sbjct: 59  TLFRYCTNL---QSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTFDHIQNR 118

Query: 133 NVVPWTTIIGSYSREG---DIDIAFSMFKQMRESGIQPTSVTL----------------- 192
           +V  W  +I  Y R G   ++   FS+F  M  SG+ P   T                  
Sbjct: 119 DVYAWNLMISGYGRAGNSSEVIRCFSLF--MLSSGLTPDYRTFPSVLKACRTVIDGNKIH 178

Query: 193 ---------------------------IADARRLFESIGCRDIVSWNSLLSAYSKIGATE 252
                                      + +AR LF+ +  RD+ SWN+++S Y + G  +
Sbjct: 179 CLALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAK 238

Query: 253 EILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALV 312
           E L L   ++      D  T  S LSA    GD   G  +H   +K GL  +  V + L+
Sbjct: 239 EALTLSNGLR----AMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLI 298

Query: 313 VLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTA 372
            LY     L    KVF     +D++ W ++I     N+   +A+ +F +M  S ++P   
Sbjct: 299 DLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRIQPDCL 358

Query: 373 TLASGLAACAQLGCCDIGASIHGYVLRQGIML-DIPAQNSLVTMYAKCNKLQQSCSIFNK 432
           TL S  +  +QLG      S+ G+ LR+G  L DI   N++V MYAK   +  + ++FN 
Sbjct: 359 TLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSARAVFNW 418

Query: 433 MVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEM-RKSFLRPDSITVTSLLQACGSAGALCQ 492
           +   D++SWN I++G+A+NG+ S+ I  +N M  +  +  +  T  S+L AC  AGAL Q
Sbjct: 419 LPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQ 478

Query: 493 GKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGF 552
           G  +H  +L++ L   +   T+L DMY KCG LE+A   F  + + + V W+TLI  +GF
Sbjct: 479 GMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIACHGF 538

Query: 553 NGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLE 612
           +G GE A+  + E L  G++P+H+ F+++LSACSH GL+ +G   +E M  D+ ++P+L+
Sbjct: 539 HGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLK 598

Query: 613 HRACVVDLLSRAGKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFEL 672
           H  C+VD+  RAG+++ A  F K M  +P   + G LL ACRV+G V+LGK+ +  +FE+
Sbjct: 599 HYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEV 658

Query: 673 KPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNS 703
           +P   G  V L+N YAS  +W+GV++  +     GL+K PGWSS+EV      F+  + +
Sbjct: 659 EPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQT 718

BLAST of CsGy1G030810 vs. TAIR10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 380.9 bits (977), Expect = 1.7e-105
Identity = 226/726 (31.13%), Postives = 383/726 (52.75%), Query Frame = 0

Query: 51  QVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSY-IGSSL 110
           + + TY+ M     + D Y FP+L KA  +L     G  +H  V   G   DS  + ++L
Sbjct: 80  EAVLTYVDMIVLGIKPDNYAFPALLKAVADLQDMELGKQIHAHVYKFGYGVDSVTVANTL 139

Query: 111 ISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTS 170
           ++ Y K G      KVFD + +RN V W ++I S       ++A   F+ M +  ++P+S
Sbjct: 140 VNLYRKCGDFGAVYKVFDRISERNQVSWNSLISSLCSFEKWEMALEAFRCMLDENVEPSS 199

Query: 171 VTLI-------------------------------------------------ADARRLF 230
            TL+                                                 A ++ L 
Sbjct: 200 FTLVSVVTACSNLPMPEGLMMGKQVHAYGLRKGELNSFIINTLVAMYGKLGKLASSKVLL 259

Query: 231 ESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLR 290
            S G RD+V+WN++LS+  +     E L+ L+ M +E ++PD+ T  S L A +    LR
Sbjct: 260 GSFGGRDLVTWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEFTISSVLPACSHLEMLR 319

Query: 291 LGKLVHGLMLKDG-LNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGL 350
            GK +H   LK+G L+ +  V SALV +Y  C+ +    +VF    ++ + +W AMI+G 
Sbjct: 320 TGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDRKIGLWNAMIAGY 379

Query: 351 VQNDCADKALGVFYQMIES-NVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLD 410
            QN+   +AL +F  M ES  +  ++ T+A  + AC + G      +IHG+V+++G+  D
Sbjct: 380 SQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVVKRGLDRD 439

Query: 411 IPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMR- 470
              QN+L+ MY++  K+  +  IF KM ++DLV+WN ++ G+  + +    +   ++M+ 
Sbjct: 440 RFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKMQN 499

Query: 471 ----------KSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALV 530
                     +  L+P+SIT+ ++L +C +  AL +GK IH + ++++L   +   +ALV
Sbjct: 500 LERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSALV 559

Query: 531 DMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHV 590
           DMY KCG L+ ++K FD + Q++++ W+ +I+ YG +G G+ A+      +  G++PN V
Sbjct: 560 DMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEV 619

Query: 591 IFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKM 650
            FISV +ACSH G++ +GL I+  M  D+ + P+ +H ACVVDLL RAG++ EAY    M
Sbjct: 620 TFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNM 679

Query: 651 MFKE-PSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDG 710
           M ++         LL A R++  +E+G++ A+++ +L+P    ++V LAN Y+S   WD 
Sbjct: 680 MPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDK 739

Query: 711 VEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALSKNIRNL-Y 712
             +    M+  G++K PG S IE       F A  +SHP+ EK+   ++ L + +R   Y
Sbjct: 740 ATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGY 799

BLAST of CsGy1G030810 vs. TAIR10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 375.9 bits (964), Expect = 5.4e-104
Identity = 218/717 (30.40%), Postives = 362/717 (50.49%), Query Frame = 0

Query: 22  GLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNL 81
           GL   S +   T   NS +  L   G   + ++   SMQ+    +D   F +L + C   
Sbjct: 48  GLSVLSSSSSSTHFSNSQLHGLCANGKLEEAMKLLNSMQELRVAVDEDVFVALVRLCEWK 107

Query: 82  NLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTII 141
                G  ++   + +  S    +G++ ++ + +FG +     VF  M +RN+  W  ++
Sbjct: 108 RAQEEGSKVYSIALSSMSSLGVELGNAFLAMFVRFGNLVDAWYVFGKMSERNLFSWNVLV 167

Query: 142 GSYSREGDIDIAFSMFKQMR-ESGIQPTSVTL---------------------------- 201
           G Y+++G  D A  ++ +M    G++P   T                             
Sbjct: 168 GGYAKQGYFDEAMCLYHRMLWVGGVKPDVYTFPCVLRTCGGIPDLARGKEVHVHVVRYGY 227

Query: 202 -------------------IADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQA 261
                              +  AR LF+ +  RDI+SWN+++S Y + G   E L+L  A
Sbjct: 228 ELDIDVVNALITMYVKCGDVKSARLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFA 287

Query: 262 MKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRC 321
           M+   + PD  T  S +SA  + GD RLG+ +H  ++  G  +D  V ++L  +YL    
Sbjct: 288 MRGLSVDPDLMTLTSVISACELLGDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGS 347

Query: 322 LDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAA 381
              A K+F     KD+V WT MISG   N   DKA+  +  M + +VKP   T+A+ L+A
Sbjct: 348 WREAEKLFSRMERKDIVSWTTMISGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSA 407

Query: 382 CAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSW 441
           CA LG  D G  +H   ++  ++  +   N+L+ MY+KC  + ++  IF+ +  K+++SW
Sbjct: 408 CATLGDLDTGVELHKLAIKARLISYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISW 467

Query: 442 NAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLR 501
            +I+AG   N    + + F  +M K  L+P++IT+T+ L AC   GAL  GK IH  VLR
Sbjct: 468 TSIIAGLRLNNRCFEALIFLRQM-KMTLQPNAITLTAALAACARIGALMCGKEIHAHVLR 527

Query: 502 SSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRK 561
           + +        AL+DMY +CG +  A   F+   ++D+ +W+ L+ GY   G+G + +  
Sbjct: 528 TGVGLDDFLPNALLDMYVRCGRMNTAWSQFNSQ-KKDVTSWNILLTGYSERGQGSMVVEL 587

Query: 562 YSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLS 621
           +   + + + P+ + FIS+L  CS   ++ +GL +Y S  +D+ ++PNL+H ACVVDLL 
Sbjct: 588 FDRMVKSRVRPDEITFISLLCGCSKSQMVRQGL-MYFSKMEDYGVTPNLKHYACVVDLLG 647

Query: 622 RAGKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQ 681
           RAG++ EA+ F + M   P   V G LL+ACR++ +++LG++ A+ +FEL     G ++ 
Sbjct: 648 RAGELQEAHKFIQKMPVTPDPAVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYIL 707

Query: 682 LANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKI 691
           L N YA   +W  V K    M+  GL    G S +EV G    F +    HP+ ++I
Sbjct: 708 LCNLYADCGKWREVAKVRRMMKENGLTVDAGCSWVEVKGKVHAFLSDDKYHPQTKEI 761

BLAST of CsGy1G030810 vs. Swiss-Prot
Match: sp|Q9XE98|PP303_ARATH (Pentatricopeptide repeat-containing protein At4g04370 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E99 PE=3 SV=1)

HSP 1 Score: 656.8 bits (1693), Expect = 2.9e-187
Identity = 340/724 (46.96%), Postives = 461/724 (63.67%), Query Frame = 0

Query: 23  LIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLN 82
           +I  S     TK FNS ++ LS  G H QVL T+ SM       D +TFPSL KAC +L 
Sbjct: 1   MIRTSSVLNSTKYFNSHINHLSSHGDHKQVLSTFSSMLANKLLPDTFTFPSLLKACASLQ 60

Query: 83  LFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIG 142
             S GLS+HQ V+VNG S D YI SSL++ YAKFG +   RKVF+ M +R+VV WT +IG
Sbjct: 61  RLSFGLSIHQQVLVNGFSSDFYISSSLVNLYAKFGLLAHARKVFEEMRERDVVHWTAMIG 120

Query: 143 SYSREGDIDIAFSMFKQMRESGIQPTSVTL------------------------------ 202
            YSR G +  A S+  +MR  GI+P  VTL                              
Sbjct: 121 CYSRAGIVGEACSLVNEMRFQGIKPGPVTLLEMLSGVLEITQLQCLHDFAVIYGFDCDIA 180

Query: 203 --------------IADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIED 262
                         + DA+ LF+ +  RD+VSWN+++S Y+ +G   EIL+LL  M+ + 
Sbjct: 181 VMNSMLNLYCKCDHVGDAKDLFDQMEQRDMVSWNTMISGYASVGNMSEILKLLYRMRGDG 240

Query: 263 IKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAY 322
           ++PD+QTF ++LS S    DL +G+++H  ++K G ++D H+++AL+ +YL+C   + +Y
Sbjct: 241 LRPDQQTFGASLSVSGTMCDLEMGRMLHCQIVKTGFDVDMHLKTALITMYLKCGKEEASY 300

Query: 323 KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLG 382
           +V ++   KDVV WT MISGL++   A+KAL VF +M++S    S+  +AS +A+CAQLG
Sbjct: 301 RVLETIPNKDVVCWTVMISGLMRLGRAEKALIVFSEMLQSGSDLSSEAIASVVASCAQLG 360

Query: 383 CCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVA 442
             D+GAS+HGYVLR G  LD PA NSL+TMYAKC  L +S  IF +M E+DLV       
Sbjct: 361 SFDLGASVHGYVLRHGYTLDTPALNSLITMYAKCGHLDKSLVIFERMNERDLVXXXXXXX 420

Query: 443 GHAKNGYLSKGIFFFNEMR-KSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLI 502
                               K+  + DS TV SLLQAC SAGAL  GK IH  V+RS + 
Sbjct: 421 XXXXXXXXXXXXXXXXXXXFKTVQQVDSFTVVSLLQACSSAGALPVGKLIHCIVIRSFIR 480

Query: 503 PCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEF 562
           PC + +TALVDMY KCG LE AQ+CFD +  +D+V+W  LI GYGF+GKG+IAL  YSEF
Sbjct: 481 PCSLVDTALVDMYSKCGYLEAAQRCFDSISWKDVVSWGILIAGYGFHGKGDIALEIYSEF 540

Query: 563 LGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGK 622
           L +GMEPNHVIF++VLS+CSH G++ +GL I+ SM +DF + PN EH ACVVDLL RA +
Sbjct: 541 LHSGMEPNHVIFLAVLSSCSHNGMVQQGLKIFSSMVRDFGVEPNHEHLACVVDLLCRAKR 600

Query: 623 VDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANS 682
           +++A+ FYK  F  PSI VLG++LDACR NG+ E+  +I  DM ELKP D G++V+L +S
Sbjct: 601 IEDAFKFYKENFTRPSIDVLGIILDACRANGKTEVEDIICEDMIELKPGDAGHYVKLGHS 660

Query: 683 YASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKAL 702
           +A+M RWD V ++W QMRSLGLKK PGWS IE++G T TFF +H SH   +  +  +K L
Sbjct: 661 FAAMKRWDDVSESWNQMRSLGLKKLPGWSKIEMNGKTTTFFMNHTSHS--DDTVSLLKLL 720

BLAST of CsGy1G030810 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 5.9e-108
Identity = 214/692 (30.92%), Postives = 368/692 (53.18%), Query Frame = 0

Query: 60  QKTHTQLDAYTFPS--LFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFG 119
           ++ +   + Y  P+  L + C++L      L L   V  NGL  + +  + L+S + ++G
Sbjct: 27  ERNYIPANVYEHPAALLLERCSSLKELRQILPL---VFKNGLYQEHFFQTKLVSLFCRYG 86

Query: 120 CIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPT--------- 179
            +    +VF+ +  +  V + T++  +++  D+D A   F +MR   ++P          
Sbjct: 87  SVDEAARVFEPIDSKLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLK 146

Query: 180 -----------------------SVTLIA---------------DARRLFESIGCRDIVS 239
                                  S+ L A               +AR++F+ +  RD+VS
Sbjct: 147 VCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVS 206

Query: 240 WNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLML 299
           WN++++ YS+ G     L+++++M  E++KP   T  S L A +    + +GK +HG  +
Sbjct: 207 WNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAM 266

Query: 300 KDGLNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALG 359
           + G +   ++ +ALV +Y +C  L+ A ++F    E++VV W +MI   VQN+   +A+ 
Sbjct: 267 RSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAML 326

Query: 360 VFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYA 419
           +F +M++  VKP+  ++   L ACA LG  + G  IH   +  G+  ++   NSL++MY 
Sbjct: 327 IFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYC 386

Query: 420 KCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTS 479
           KC ++  + S+F K+  + LVSWNA++ G A+NG     + +F++MR   ++PD+ T  S
Sbjct: 387 KCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVS 446

Query: 480 LLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRD 539
           ++ A          KWIH  V+RS L   +   TALVDMY KCG +  A+  FD M +R 
Sbjct: 447 VITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERH 506

Query: 540 LVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYE 599
           +  W+ +I GYG +G G+ AL  + E     ++PN V F+SV+SACSH GL+  GL  + 
Sbjct: 507 VTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFY 566

Query: 600 SMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRV 659
            M +++ +  +++H   +VDLL RAG+++EA+ F   M  +P++ V G +L AC+++  V
Sbjct: 567 MMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNV 626

Query: 660 ELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEV 703
              +  A  +FEL P D G  V LAN Y + S W+ V +    M   GL+K PG S +E+
Sbjct: 627 NFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEI 686

BLAST of CsGy1G030810 vs. Swiss-Prot
Match: sp|O81767|PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX=3702 GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 387.1 bits (993), Expect = 4.2e-106
Identity = 223/679 (32.84%), Postives = 353/679 (51.99%), Query Frame = 0

Query: 73  SLFKACTNLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKR 132
           +LF+ CTNL        LH  +VV+    +  I + L++ Y   G + L R  FD +  R
Sbjct: 59  TLFRYCTNL---QSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTFDHIQNR 118

Query: 133 NVVPWTTIIGSYSREG---DIDIAFSMFKQMRESGIQPTSVTL----------------- 192
           +V  W  +I  Y R G   ++   FS+F  M  SG+ P   T                  
Sbjct: 119 DVYAWNLMISGYGRAGNSSEVIRCFSLF--MLSSGLTPDYRTFPSVLKACRTVIDGNKIH 178

Query: 193 ---------------------------IADARRLFESIGCRDIVSWNSLLSAYSKIGATE 252
                                      + +AR LF+ +  RD+ SWN+++S Y + G  +
Sbjct: 179 CLALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAK 238

Query: 253 EILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALV 312
           E L L   ++      D  T  S LSA    GD   G  +H   +K GL  +  V + L+
Sbjct: 239 EALTLSNGLR----AMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLI 298

Query: 313 VLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTA 372
            LY     L    KVF     +D++ W ++I     N+   +A+ +F +M  S ++P   
Sbjct: 299 DLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRIQPDCL 358

Query: 373 TLASGLAACAQLGCCDIGASIHGYVLRQGIML-DIPAQNSLVTMYAKCNKLQQSCSIFNK 432
           TL S  +  +QLG      S+ G+ LR+G  L DI   N++V MYAK   +  + ++FN 
Sbjct: 359 TLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSARAVFNW 418

Query: 433 MVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEM-RKSFLRPDSITVTSLLQACGSAGALCQ 492
           +   D++SWN I++G+A+NG+ S+ I  +N M  +  +  +  T  S+L AC  AGAL Q
Sbjct: 419 LPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQ 478

Query: 493 GKWIHNFVLRSSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGF 552
           G  +H  +L++ L   +   T+L DMY KCG LE+A   F  + + + V W+TLI  +GF
Sbjct: 479 GMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIACHGF 538

Query: 553 NGKGEIALRKYSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLE 612
           +G GE A+  + E L  G++P+H+ F+++LSACSH GL+ +G   +E M  D+ ++P+L+
Sbjct: 539 HGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLK 598

Query: 613 HRACVVDLLSRAGKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFEL 672
           H  C+VD+  RAG+++ A  F K M  +P   + G LL ACRV+G V+LGK+ +  +FE+
Sbjct: 599 HYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEV 658

Query: 673 KPVDPGNFVQLANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNS 703
           +P   G  V L+N YAS  +W+GV++  +     GL+K PGWSS+EV      F+  + +
Sbjct: 659 EPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQT 718

BLAST of CsGy1G030810 vs. Swiss-Prot
Match: sp|Q7Y211|PP285_ARATH (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 380.9 bits (977), Expect = 3.0e-104
Identity = 226/726 (31.13%), Postives = 383/726 (52.75%), Query Frame = 0

Query: 51  QVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQSVVVNGLSHDSY-IGSSL 110
           + + TY+ M     + D Y FP+L KA  +L     G  +H  V   G   DS  + ++L
Sbjct: 80  EAVLTYVDMIVLGIKPDNYAFPALLKAVADLQDMELGKQIHAHVYKFGYGVDSVTVANTL 139

Query: 111 ISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIAFSMFKQMRESGIQPTS 170
           ++ Y K G      KVFD + +RN V W ++I S       ++A   F+ M +  ++P+S
Sbjct: 140 VNLYRKCGDFGAVYKVFDRISERNQVSWNSLISSLCSFEKWEMALEAFRCMLDENVEPSS 199

Query: 171 VTLI-------------------------------------------------ADARRLF 230
            TL+                                                 A ++ L 
Sbjct: 200 FTLVSVVTACSNLPMPEGLMMGKQVHAYGLRKGELNSFIINTLVAMYGKLGKLASSKVLL 259

Query: 231 ESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSALSASAIKGDLR 290
            S G RD+V+WN++LS+  +     E L+ L+ M +E ++PD+ T  S L A +    LR
Sbjct: 260 GSFGGRDLVTWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEFTISSVLPACSHLEMLR 319

Query: 291 LGKLVHGLMLKDG-LNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDVVMWTAMISGL 350
            GK +H   LK+G L+ +  V SALV +Y  C+ +    +VF    ++ + +W AMI+G 
Sbjct: 320 TGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDRKIGLWNAMIAGY 379

Query: 351 VQNDCADKALGVFYQMIES-NVKPSTATLASGLAACAQLGCCDIGASIHGYVLRQGIMLD 410
            QN+   +AL +F  M ES  +  ++ T+A  + AC + G      +IHG+V+++G+  D
Sbjct: 380 SQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVVKRGLDRD 439

Query: 411 IPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKGIFFFNEMR- 470
              QN+L+ MY++  K+  +  IF KM ++DLV+WN ++ G+  + +    +   ++M+ 
Sbjct: 440 RFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKMQN 499

Query: 471 ----------KSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALV 530
                     +  L+P+SIT+ ++L +C +  AL +GK IH + ++++L   +   +ALV
Sbjct: 500 LERKVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSALV 559

Query: 531 DMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHV 590
           DMY KCG L+ ++K FD + Q++++ W+ +I+ YG +G G+ A+      +  G++PN V
Sbjct: 560 DMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEV 619

Query: 591 IFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKM 650
            FISV +ACSH G++ +GL I+  M  D+ + P+ +H ACVVDLL RAG++ EAY    M
Sbjct: 620 TFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNM 679

Query: 651 MFKE-PSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDG 710
           M ++         LL A R++  +E+G++ A+++ +L+P    ++V LAN Y+S   WD 
Sbjct: 680 MPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDK 739

Query: 711 VEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALSKNIRNL-Y 712
             +    M+  G++K PG S IE       F A  +SHP+ EK+   ++ L + +R   Y
Sbjct: 740 ATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGY 799

BLAST of CsGy1G030810 vs. Swiss-Prot
Match: sp|Q9M9E2|PPR45_ARATH (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 375.9 bits (964), Expect = 9.7e-103
Identity = 218/717 (30.40%), Postives = 362/717 (50.49%), Query Frame = 0

Query: 22  GLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNL 81
           GL   S +   T   NS +  L   G   + ++   SMQ+    +D   F +L + C   
Sbjct: 48  GLSVLSSSSSSTHFSNSQLHGLCANGKLEEAMKLLNSMQELRVAVDEDVFVALVRLCEWK 107

Query: 82  NLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTII 141
                G  ++   + +  S    +G++ ++ + +FG +     VF  M +RN+  W  ++
Sbjct: 108 RAQEEGSKVYSIALSSMSSLGVELGNAFLAMFVRFGNLVDAWYVFGKMSERNLFSWNVLV 167

Query: 142 GSYSREGDIDIAFSMFKQMR-ESGIQPTSVTL---------------------------- 201
           G Y+++G  D A  ++ +M    G++P   T                             
Sbjct: 168 GGYAKQGYFDEAMCLYHRMLWVGGVKPDVYTFPCVLRTCGGIPDLARGKEVHVHVVRYGY 227

Query: 202 -------------------IADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQA 261
                              +  AR LF+ +  RDI+SWN+++S Y + G   E L+L  A
Sbjct: 228 ELDIDVVNALITMYVKCGDVKSARLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFA 287

Query: 262 MKIEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRC 321
           M+   + PD  T  S +SA  + GD RLG+ +H  ++  G  +D  V ++L  +YL    
Sbjct: 288 MRGLSVDPDLMTLTSVISACELLGDRRLGRDIHAYVITTGFAVDISVCNSLTQMYLNAGS 347

Query: 322 LDPAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAA 381
              A K+F     KD+V WT MISG   N   DKA+  +  M + +VKP   T+A+ L+A
Sbjct: 348 WREAEKLFSRMERKDIVSWTTMISGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSA 407

Query: 382 CAQLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSW 441
           CA LG  D G  +H   ++  ++  +   N+L+ MY+KC  + ++  IF+ +  K+++SW
Sbjct: 408 CATLGDLDTGVELHKLAIKARLISYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISW 467

Query: 442 NAIVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLR 501
            +I+AG   N    + + F  +M K  L+P++IT+T+ L AC   GAL  GK IH  VLR
Sbjct: 468 TSIIAGLRLNNRCFEALIFLRQM-KMTLQPNAITLTAALAACARIGALMCGKEIHAHVLR 527

Query: 502 SSLIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRK 561
           + +        AL+DMY +CG +  A   F+   ++D+ +W+ L+ GY   G+G + +  
Sbjct: 528 TGVGLDDFLPNALLDMYVRCGRMNTAWSQFNSQ-KKDVTSWNILLTGYSERGQGSMVVEL 587

Query: 562 YSEFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLS 621
           +   + + + P+ + FIS+L  CS   ++ +GL +Y S  +D+ ++PNL+H ACVVDLL 
Sbjct: 588 FDRMVKSRVRPDEITFISLLCGCSKSQMVRQGL-MYFSKMEDYGVTPNLKHYACVVDLLG 647

Query: 622 RAGKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQ 681
           RAG++ EA+ F + M   P   V G LL+ACR++ +++LG++ A+ +FEL     G ++ 
Sbjct: 648 RAGELQEAHKFIQKMPVTPDPAVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYIL 707

Query: 682 LANSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKI 691
           L N YA   +W  V K    M+  GL    G S +EV G    F +    HP+ ++I
Sbjct: 708 LCNLYADCGKWREVAKVRRMMKENGLTVDAGCSWVEVKGKVHAFLSDDKYHPQTKEI 761

BLAST of CsGy1G030810 vs. TrEMBL
Match: tr|A0A0A0M0F8|A0A0A0M0F8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G650050 PE=4 SV=1)

HSP 1 Score: 1322.4 bits (3421), Expect = 0.0e+00
Identity = 668/743 (89.91%), Postives = 669/743 (90.04%), Query Frame = 0

Query: 20  MSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 79
           MSG IHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT
Sbjct: 1   MSGFIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 60

Query: 80  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 139
           NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT
Sbjct: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120

Query: 140 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTL--------------------------- 199
           IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTL                           
Sbjct: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIILHGFES 180

Query: 200 -----------------IADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 259
                            IADARRLF+SI CRDIVS                         
Sbjct: 181 DLALSNSMVNMYGKCGRIADARRLFQSIDCRDIVSXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 260 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 319
              IKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD
Sbjct: 241 XXXIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300

Query: 320 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 379
           PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA
Sbjct: 301 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 360

Query: 380 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 439
           QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA
Sbjct: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 420

Query: 440 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 499
           IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS
Sbjct: 421 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480

Query: 500 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 559
           LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS
Sbjct: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 540

Query: 560 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 619
           EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA
Sbjct: 541 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 600

Query: 620 GKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 679
           GKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA
Sbjct: 601 GKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 660

Query: 680 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 719
           NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK
Sbjct: 661 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 720

BLAST of CsGy1G030810 vs. TrEMBL
Match: tr|A0A1S4DUX5|A0A1S4DUX5_CUCME (pentatricopeptide repeat-containing protein At4g04370 OS=Cucumis melo OX=3656 GN=LOC103487188 PE=4 SV=1)

HSP 1 Score: 1319.7 bits (3414), Expect = 0.0e+00
Identity = 665/741 (89.74%), Postives = 679/741 (91.63%), Query Frame = 0

Query: 20  MSGLIHESIAHGCTKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACT 79
           MS LIHESIAHG TKSFNSLVSRLS QGAHHQVLQTYISMQKTHT  DAYTFPSLFKACT
Sbjct: 1   MSRLIHESIAHGSTKSFNSLVSRLSSQGAHHQVLQTYISMQKTHTPSDAYTFPSLFKACT 60

Query: 80  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 139
           NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT
Sbjct: 61  NLNLFSHGLSLHQSVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTT 120

Query: 140 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTL--------------------------- 199
           IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTL                           
Sbjct: 121 IIGSYSREGDIDIAFSMFKQMRESGIQPTSVTLLSLLPGISKLPLLLCLHCLIFLYGFES 180

Query: 200 -----------------IADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMK 259
                            IADAR LFESI  RDIVSWNSLLSAYSKIGATEEILQL+QAMK
Sbjct: 181 DLALSNSMVNMYGKCGRIADARSLFESIDYRDIVSWNSLLSAYSKIGATEEILQLVQAMK 240

Query: 260 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 319
           IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD
Sbjct: 241 IEDIKPDKQTFCSALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLD 300

Query: 320 PAYKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACA 379
            A+KVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNV+PSTATLAS LAACA
Sbjct: 301 LAHKVFKSTTEKDVVMWTAMISGLVQNDCADKALGVFYQMIESNVQPSTATLASALAACA 360

Query: 380 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNA 439
           QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKD+VSWNA
Sbjct: 361 QLGCCDIGASIHGYVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDVVSWNA 420

Query: 440 IVAGHAKNGYLSKGIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 499
           IVAG+AKNGYLSK IFFFNEMR SF RPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS
Sbjct: 421 IVAGNAKNGYLSKAIFFFNEMRTSFQRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSS 480

Query: 500 LIPCIMTETALVDMYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYS 559
           LIPCIMTETALVDMYFKCGNLENAQKCFDCM QRDLVAWSTLIVGYGFNGKGEIALRKYS
Sbjct: 481 LIPCIMTETALVDMYFKCGNLENAQKCFDCMSQRDLVAWSTLIVGYGFNGKGEIALRKYS 540

Query: 560 EFLGTGMEPNHVIFISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRA 619
           EFLGTGMEPNHVIFISVLSACSH GLIS+GLSIYESMTKDFRM PNLEHRAC+VDLLSRA
Sbjct: 541 EFLGTGMEPNHVIFISVLSACSHSGLISQGLSIYESMTKDFRMPPNLEHRACIVDLLSRA 600

Query: 620 GKVDEAYSFYKMMFKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLA 679
           GKVDEAYSFYKMMFKEPS+VVLG LLDACRVNG VELGKVIARDMFELKPVDPGNFVQLA
Sbjct: 601 GKVDEAYSFYKMMFKEPSMVVLGTLLDACRVNGSVELGKVIARDMFELKPVDPGNFVQLA 660

Query: 680 NSYASMSRWDGVEKAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVK 717
           NSYASM+RWDGVEKAWTQMRSLGLKK+PGWSSIE+HGTTFTFFA+HNSHPKIEKIILTVK
Sbjct: 661 NSYASMNRWDGVEKAWTQMRSLGLKKFPGWSSIELHGTTFTFFAAHNSHPKIEKIILTVK 720

BLAST of CsGy1G030810 vs. TrEMBL
Match: tr|A0A2P4HYI0|A0A2P4HYI0_QUESU (Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_48904 PE=4 SV=1)

HSP 1 Score: 892.5 bits (2305), Expect = 6.3e-256
Identity = 431/721 (59.78%), Postives = 548/721 (76.01%), Query Frame = 0

Query: 33  TKSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQ 92
           TKSFN+++++LS QGAHH+VL TY SM  T+T  DA+TFPSL KACT+L L SHGLS HQ
Sbjct: 23  TKSFNAIINQLSSQGAHHEVLLTYSSMLNTNTPPDAFTFPSLLKACTSLELSSHGLSFHQ 82

Query: 93  SVVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDI 152
            V+VNG S DSYI SSLI+FYAKF C+   +KVFD M  RNVVPWT IIG YSR GD ++
Sbjct: 83  LVIVNGYSSDSYIASSLINFYAKFRCVSSAQKVFDIMPDRNVVPWTAIIGCYSRLGDANM 142

Query: 153 AFSMFKQMRESGIQPTSVTL---------------------------------------- 212
            FS++ +MR  GI+P+SVTL                                        
Sbjct: 143 VFSVYNEMRRQGIRPSSVTLLSMLSGAPELNHVQCLHGCTVLYGFESDIALANSILNVYG 202

Query: 213 ----IADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCS 272
               I DA+ LF+ +  RD++SWNSL+S Y++IG   E+LQLL  M+IE ++PD+QT+ S
Sbjct: 203 KCRSIEDAKDLFKFMDHRDMISWNSLISGYAQIGDINEVLQLLDRMRIEGMEPDQQTYGS 262

Query: 273 ALSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKD 332
            +S +A + +L+ GKLVHG ++K G  +D HVE++L+ +YL+C  +D A+++F+ TT+KD
Sbjct: 263 LVSVTATQSNLKWGKLVHGKVIKAGFYLDAHVETSLIAMYLKCGNIDNAFRIFEQTTDKD 322

Query: 333 VVMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHG 392
           VV+WTAMISGLVQNDCAD+AL VF++M++S VKPSTAT+AS LAACAQ    D+G SIHG
Sbjct: 323 VVLWTAMISGLVQNDCADEALTVFFEMLKSRVKPSTATIASALAACAQQDSYDLGTSIHG 382

Query: 393 YVLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSK 452
           Y+LRQG+ LDIPAQNSLV MY+KC+ L QSC++F++M ++DLVSWNAIVAGHA+NG++ K
Sbjct: 383 YILRQGMTLDIPAQNSLVNMYSKCSHLDQSCAVFDRMAKRDLVSWNAIVAGHAQNGHIFK 442

Query: 453 GIFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVD 512
            +  FNEMR +  RPDSITV SLLQ C S GAL QGKWIHNFV+RS L PCI+ +TALVD
Sbjct: 443 ALSLFNEMRTTLQRPDSITVVSLLQGCASTGALNQGKWIHNFVIRSCLRPCILVDTALVD 502

Query: 513 MYFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVI 572
           MY KCG+L+ AQKCFD M Q DLV+WS +I GYG +GKGE ALR YSEFL TG+EPNHVI
Sbjct: 503 MYSKCGDLDTAQKCFDGMAQHDLVSWSIIISGYGCHGKGETALRMYSEFLRTGIEPNHVI 562

Query: 573 FISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKMM 632
           F+SVLS CSH GL+  GLSI++SMT+DF ++PNLEHRAC+VDLLSRAG+V+EAY+FY+ M
Sbjct: 563 FLSVLSTCSHNGLVQHGLSIFQSMTEDFGIAPNLEHRACIVDLLSRAGRVEEAYNFYRRM 622

Query: 633 FKEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVE 692
           F+EP+I VLG+LLDACRVNG  ELG VIARD+  L+P + GN+VQLA+SYASM+RWDGV 
Sbjct: 623 FQEPTIDVLGILLDACRVNGNDELGDVIARDILMLRPANAGNYVQLAHSYASMNRWDGVG 682

Query: 693 KAWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALSKNIRNLYVKN 710
           + WTQMRSLGLKK PGWS IE+HG   TFF+ HNSHP+ E ++  +K LS  +R + + N
Sbjct: 683 EVWTQMRSLGLKKLPGWSFIELHGAITTFFSDHNSHPQSEDMVSVLKTLSWEMRKMDINN 742

BLAST of CsGy1G030810 vs. TrEMBL
Match: tr|A0A2I4FZU0|A0A2I4FZU0_9ROSI (pentatricopeptide repeat-containing protein At4g04370 OS=Juglans regia OX=51240 GN=LOC109003478 PE=4 SV=1)

HSP 1 Score: 860.1 bits (2221), Expect = 3.4e-246
Identity = 420/715 (58.74%), Postives = 532/715 (74.41%), Query Frame = 0

Query: 34  KSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQS 93
           KSFN++++RLS QG+H +VL TY SM  T+T  DA+TFPSL KA T L+L SHG SLHQ 
Sbjct: 24  KSFNAVINRLSSQGSHREVLLTYSSMLSTNTPPDAFTFPSLLKAFTFLDLLSHGFSLHQR 83

Query: 94  VVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIA 153
           V+VNG S D+YI SSLI+FYAKFG      KVFD M +RNVVPWT +IG YSR GD+  A
Sbjct: 84  VIVNGYSSDAYIASSLINFYAKFGFTSTAHKVFDVMPERNVVPWTAVIGCYSRMGDVCTA 143

Query: 154 FSMFKQMRESGIQPTSVTL----------------------------------------- 213
           FSM+ +MR  GIQPTSVTL                                         
Sbjct: 144 FSMYNEMRGQGIQPTSVTLLTMLSGAPELTHLQCLHSCAVLYGFESDIALANSILNVYGK 203

Query: 214 ---IADARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSA 273
              + DA+  FE +  RDIVSWNSL+S Y++IG   E+   L  M+IE ++PD+ TF S 
Sbjct: 204 CGSVEDAKEFFEFMDSRDIVSWNSLISGYAQIGNIREVFHNLDRMRIEGMEPDQLTFGSL 263

Query: 274 LSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDV 333
           LS +A + +L+LGK+VHG +L+ G  +D HVE++LVV+YL+C  +D A+K+F+    +DV
Sbjct: 264 LSVTATQSNLKLGKMVHGKILRAGFYLDAHVETSLVVMYLKCGNIDIAFKIFEQIPSRDV 323

Query: 334 VMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHGY 393
           V+WTAMISGLVQNDCAD+AL VF QM++S V+PSTAT+AS LAACAQLG  D+G S+HG+
Sbjct: 324 VLWTAMISGLVQNDCADEALKVFCQMLKSRVEPSTATIASALAACAQLGSFDLGTSVHGF 383

Query: 394 VLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKG 453
           +LRQG+ LDIPAQNSLVTMYAKC  L QSC++F++M  +DLVSWNAIVAG+A+NG + K 
Sbjct: 384 ILRQGMTLDIPAQNSLVTMYAKCGHLDQSCAVFDRMARRDLVSWNAIVAGYAQNGLVCKA 443

Query: 454 IFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDM 513
           +  F  MR +  RPDS+TV SLLQ C S GAL QGK +HNFV+RS L PCI+ +TALVDM
Sbjct: 444 LILFGRMRTALQRPDSVTVVSLLQGCASIGALHQGKRMHNFVIRSCLTPCILVDTALVDM 503

Query: 514 YFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIF 573
           Y KCG L+ AQKCFD M Q+DLV WST+I GYG +GKGE ALR YS+F+ TG+EPNHVIF
Sbjct: 504 YSKCGYLDTAQKCFDGMSQQDLVTWSTIIAGYGSHGKGETALRMYSDFIRTGIEPNHVIF 563

Query: 574 ISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKMMF 633
           +SVLSACSH GL+ +GLSI++SMT DF ++PNLEHRAC+VDLLSRAG+V+EAY++YK +F
Sbjct: 564 LSVLSACSHNGLVDQGLSIFQSMTDDFGIAPNLEHRACIVDLLSRAGRVEEAYNYYKRLF 623

Query: 634 KEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVEK 693
            EPS+ VLG+LLDACR NG+ E+ ++IARD+  L+PV+ GN+VQLA+SYASM+RWDGV +
Sbjct: 624 PEPSVDVLGILLDACRANGKDEICEIIARDVLMLRPVNAGNYVQLAHSYASMNRWDGVGE 683

Query: 694 AWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALSKNIRNL 705
            WTQMRSLGLKK PGWS IE++GT  TFF+ HNSHP+ E I+  +  LS  +R +
Sbjct: 684 VWTQMRSLGLKKLPGWSFIELYGTITTFFSDHNSHPQSEDIVSILGTLSWEMRKM 738

BLAST of CsGy1G030810 vs. TrEMBL
Match: tr|A0A2P6Q2F9|A0A2P6Q2F9_ROSCH (Putative tetratricopeptide-like helical domain-containing protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr5g0002401 PE=4 SV=1)

HSP 1 Score: 852.0 bits (2200), Expect = 9.4e-244
Identity = 410/713 (57.50%), Postives = 532/713 (74.61%), Query Frame = 0

Query: 34  KSFNSLVSRLSYQGAHHQVLQTYISMQKTHTQLDAYTFPSLFKACTNLNLFSHGLSLHQS 93
           ++FN++++RLS QG+HH+VL TY SM KTH   D +TFP+L KACT+LNLF +G+S HQ 
Sbjct: 23  RAFNAIINRLSSQGSHHEVLATYSSMLKTHIAPDTHTFPNLLKACTSLNLFCYGVSCHQC 82

Query: 94  VVVNGLSHDSYIGSSLISFYAKFGCIHLGRKVFDTMLKRNVVPWTTIIGSYSREGDIDIA 153
           +VVNG S D+YI SSLI+FYAKFG     RKVFDTM +RNVVPWT+IIG YSR   + +A
Sbjct: 83  IVVNGFSSDAYIASSLINFYAKFGYAQNARKVFDTMAERNVVPWTSIIGCYSRAASVGVA 142

Query: 154 FSMFKQMRESGIQPTSVTLIA--------------------------------------- 213
           F MF  MR  G+QP+SVTL++                                       
Sbjct: 143 FEMFGDMRREGVQPSSVTLLSLLSGALELAHVQCLHGCAVLYGFESDMSLMNSMLNVYCK 202

Query: 214 -----DARRLFESIGCRDIVSWNSLLSAYSKIGATEEILQLLQAMKIEDIKPDKQTFCSA 273
                DAR LFE +  RDIVSWNSL+S Y++ G   E+ QLL  M++E ++PDKQT+ +A
Sbjct: 203 CGRVEDARDLFEYLNRRDIVSWNSLISGYAQSGNIREVFQLLFKMRVEGVEPDKQTYATA 262

Query: 274 LSASAIKGDLRLGKLVHGLMLKDGLNIDQHVESALVVLYLRCRCLDPAYKVFKSTTEKDV 333
           +S +A + +L+LGK VHG +L+ G  +D HVE+AL+V+YL+CR +D A++VF+ T +KDV
Sbjct: 263 VSVAATQSNLKLGKSVHGQILRTGFELDSHVETALIVMYLKCRNIDLAFRVFERTIQKDV 322

Query: 334 VMWTAMISGLVQNDCADKALGVFYQMIESNVKPSTATLASGLAACAQLGCCDIGASIHGY 393
           V+WTAMISGL QND AD+AL VF QM+ES  +PS++T+AS LAACAQLG  D+G SIHGY
Sbjct: 323 VLWTAMISGLAQNDSADRALMVFSQMLESRTEPSSSTIASALAACAQLGSIDLGTSIHGY 382

Query: 394 VLRQGIMLDIPAQNSLVTMYAKCNKLQQSCSIFNKMVEKDLVSWNAIVAGHAKNGYLSKG 453
           VLRQG+ LDIPAQNSLVTMYAKC +L    ++F  M ++DLVSWNAIVAG+A+NG++ + 
Sbjct: 383 VLRQGMRLDIPAQNSLVTMYAKCARLDHCRAVFENMSKRDLVSWNAIVAGYAQNGHICEA 442

Query: 454 IFFFNEMRKSFLRPDSITVTSLLQACGSAGALCQGKWIHNFVLRSSLIPCIMTETALVDM 513
           +  F+EMR +  +PDS+TV SLLQAC S GAL QGKWIHNF++RS L PCI+ +TALVDM
Sbjct: 443 LVLFSEMRATLQKPDSLTVVSLLQACASTGALHQGKWIHNFIIRSCLRPCILVDTALVDM 502

Query: 514 YFKCGNLENAQKCFDCMLQRDLVAWSTLIVGYGFNGKGEIALRKYSEFLGTGMEPNHVIF 573
           Y KCG+++ A KCF  M  RDLV+WST+I GYG +GK E AL  Y+E L TG++PNHVIF
Sbjct: 503 YSKCGDIDKAHKCFVEMSDRDLVSWSTIISGYGCHGKTETALELYAELLQTGIKPNHVIF 562

Query: 574 ISVLSACSHGGLISKGLSIYESMTKDFRMSPNLEHRACVVDLLSRAGKVDEAYSFYKMMF 633
           +S+LSACSH GL+ KGLSIY+SMT+DF ++P+LEHRAC+VDLLSRAG+V++AY FYK +F
Sbjct: 563 LSILSACSHNGLVDKGLSIYQSMTEDFGIAPSLEHRACIVDLLSRAGRVEKAYDFYKRVF 622

Query: 634 KEPSIVVLGMLLDACRVNGRVELGKVIARDMFELKPVDPGNFVQLANSYASMSRWDGVEK 693
            EP++ VLG+LLDACR  G   L  +IA ++  L+PVD GN+VQLA++YASM+RWDGV +
Sbjct: 623 PEPAVDVLGILLDACRTKGNEFLVDIIAGEILRLRPVDAGNYVQLAHTYASMNRWDGVGE 682

Query: 694 AWTQMRSLGLKKYPGWSSIEVHGTTFTFFASHNSHPKIEKIILTVKALSKNIR 703
           AW QM+SLGLKK PGWS IE+HGT  TFF  HNS+P+I+ I+  +K LSK +R
Sbjct: 683 AWNQMKSLGLKKLPGWSFIELHGTITTFFTDHNSNPQIDDIVSLLKILSKEMR 735

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004139152.10.0e+0089.91PREDICTED: pentatricopeptide repeat-containing protein At4g04370 [Cucumis sativu... [more]
XP_016899786.10.0e+0089.74PREDICTED: pentatricopeptide repeat-containing protein At4g04370 [Cucumis melo][more]
XP_022159804.10.0e+0076.47pentatricopeptide repeat-containing protein At4g04370 [Momordica charantia] >XP_... [more]
XP_022931568.16.7e-29469.96pentatricopeptide repeat-containing protein At4g04370 [Cucurbita moschata] >XP_0... [more]
XP_023520788.11.8e-29169.55pentatricopeptide repeat-containing protein At4g04370 isoform X1 [Cucurbita pepo... [more]
Match NameE-valueIdentityDescription
AT4G04370.11.6e-18846.96Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G11290.13.3e-10930.92Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G33990.12.3e-10732.84Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G57430.11.7e-10531.13Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G15510.15.4e-10430.40Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9XE98|PP303_ARATH2.9e-18746.96Pentatricopeptide repeat-containing protein At4g04370 OS=Arabidopsis thaliana OX... [more]
sp|Q3E6Q1|PPR32_ARATH5.9e-10830.92Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
sp|O81767|PP348_ARATH4.2e-10632.84Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX... [more]
sp|Q7Y211|PP285_ARATH3.0e-10431.13Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
sp|Q9M9E2|PPR45_ARATH9.7e-10330.40Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0M0F8|A0A0A0M0F8_CUCSA0.0e+0089.91Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G650050 PE=4 SV=1[more]
tr|A0A1S4DUX5|A0A1S4DUX5_CUCME0.0e+0089.74pentatricopeptide repeat-containing protein At4g04370 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A2P4HYI0|A0A2P4HYI0_QUESU6.3e-25659.78Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_4... [more]
tr|A0A2I4FZU0|A0A2I4FZU0_9ROSI3.4e-24658.74pentatricopeptide repeat-containing protein At4g04370 OS=Juglans regia OX=51240 ... [more]
tr|A0A2P6Q2F9|A0A2P6Q2F9_ROSCH9.4e-24457.50Putative tetratricopeptide-like helical domain-containing protein OS=Rosa chinen... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G030810.1CsGy1G030810.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 290..320
e-value: 9.8E-7
score: 28.5
coord: 568..590
e-value: 0.079
score: 13.1
coord: 492..516
e-value: 0.44
score: 10.8
coord: 464..490
e-value: 0.0066
score: 16.5
coord: 527..554
e-value: 0.42
score: 10.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 391..424
e-value: 2.7E-5
score: 22.1
coord: 108..137
e-value: 0.002
score: 16.2
coord: 290..323
e-value: 1.7E-7
score: 29.0
coord: 137..168
e-value: 2.9E-8
score: 31.4
coord: 189..222
e-value: 1.5E-4
score: 19.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 388..435
e-value: 2.2E-9
score: 37.3
coord: 187..232
e-value: 1.3E-8
score: 34.8
coord: 132..171
e-value: 1.8E-9
score: 37.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 389..423
score: 10.578
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 593..623
score: 5.13
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 102..132
score: 6.971
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 67..101
score: 6.643
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 323..357
score: 5.042
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 358..388
score: 7.783
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 525..555
score: 7.048
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 222..256
score: 6.248
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 490..524
score: 8.988
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 424..458
score: 7.18
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 187..221
score: 11.509
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 257..287
score: 5.218
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 561..591
score: 7.092
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 627..661
score: 7.509
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 288..322
score: 11.192
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 459..489
score: 6.993
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 32..66
score: 5.886
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 133..167
score: 12.759
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 290..449
e-value: 3.3E-32
score: 114.1
coord: 174..289
e-value: 1.2E-18
score: 69.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 544..689
e-value: 2.1E-12
score: 49.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 84..173
e-value: 1.1E-15
score: 59.4
coord: 450..543
e-value: 5.3E-15
score: 57.2
NoneNo IPR availablePANTHERPTHR24015:SF653SUBFAMILY NOT NAMEDcoord: 165..695
coord: 23..148
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 165..695
coord: 23..148