CsGy1G002340 (gene) Cucumber (Gy14) v2

NameCsGy1G002340
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionpentatricopeptide repeat-containing protein At3g47530
LocationChr1 : 1471872 .. 1473680 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTGTCATCTTCCGCTCTCCTTCTATCCTCTCTTTGAAATACCGCCATCATTCTATTTCTTTTTCTCACTTCGAAAGAGAGCCTCTGATTTCTCTCATAAAATCATGCACCCACAAATCCCAATTGCTCCAAATCCATGCCCACATCATCACAACTTCTTCCGTTCAAGACCCCATTGTTTCCCTCCGCTTCTTGACGCGTACCGCCTCCGCCCCTTTTCGCGATTTGGGCTATTCTCGACGACTCTTTGATCTCCTGACTAACCCATTTGTTTCTCATTATAATGCGATGTTAAGAGCTTACTCTTTGAGCCGTTCACCACTAGAAGGATTGTACATGTACAGAGACATGGAGAGGCAAGGAGTTCGTGCTGATCCTTTGTCTTCTTCTTTTGCTGTTAAGTCGTGTATAAAATTGCTCTCGTTACTTTTCGGGATTCAGATTCACGCGAGGATTTTTAGAAATGGGCATCAAGCGGATAGTCTATTGCTCACTTCCATGATGGACCTGTATTCTCATTGTGGCAAACCTGAGGAAGCGTGCAAATTGTTCGATGAAGTTCCTCAAAAAGATGTTGTTGCTTGGAACGTTTTGATTTCTTGTTTAACTCGCAATAAACGAACTAGGGATGCTTTGGGTCTGTTTGAGATCATGCAGAGTCCAACGTATCTCTGCCAACCTGATAAAGTTACTTGTTTACTCCTCCTCCAAGCCTGTGCAGACTTGAATGCATTGGAGTTCGGTGAAAGGATTCATGGTTATATTCAACAACACGGTTATAATACTGAGAGTAATTTGTGTAATTCGCTGATATCAATGTATTCGCGGTGTGGGCGTATGGATAAGGCTTATGAGGTGTTTGATAAAATGACAGAGAAGAATGTTGTTTCATGGAGTGCGATGATTTCCGGGTTATCGATGAATGGGCACGGGAGGGAAGCTATTGAAGCATTTTGGGAGATGCAAAAGAATGGTGTCGAACCTGGTGATCATACTTTCACTGCTGTTCTTTCTGCTTGTAGCCACTGTGGCCTGGTTGATGAAGGAATGGCATTTTTTGATCGTATGAGACAGGAGTTCATGATAGCTCCCAACGTCCATCACTATGGATGTATAGTTGATCTTTTGGGTCGTGCTGGAATGCTCGATCAAGCCTACGAGCTCATAATGTCAATGGAGGTAAGACCAGATGCGACAATGTGGAGAACCCTTCTTGGAGCTTGCAGAATTCACGGTCATGGAAACCTTGGGGAGCGCATAGTTGAACATTTGATTGAACTCAAATCTCAAGAAGCAGGAGATTATGTGTTGTTGCTGAACATTTATTCTTCGGCTGGCAACTGGGACAAGGTAACTGAATTGAGGAAACTTATGAAGGAGAAGGGTATTTATACCACACCTTGCTGCACCACAATAGAACTGAACGGGGTGGTGCATCAATTTGCTGTGGATGATATTTCACATCCTATGAAAGACAAGATCTACAAGCAGTTGGATGAGATCAACAAGCAGCTAAAGATTGCAGGTTATGAAGCTGAAATGTCATCTGAATTACATAGATTAGAGCCAAAAGATAAAGGGTATGCACTTTCTAACCATAGTGAGAAATTGGCCATAGCCTTTGGGGTTCTTGCAACTCCGCCAGGAAGAACCATCAGAATCGCAAATAATATTCGTACTTGCATGGATTGTCATAACTTTGCTAAGTACATATCAAGTGTTTATAACAGAAAAGTGGTTGTTAGAGACCGAAGTCGGTTCCATCATTTCCAAGAGGGTCGGTGTTCCTGCAACGATTTTTGGTAA

mRNA sequence

ATGTGTGTCATCTTCCGCTCTCCTTCTATCCTCTCTTTGAAATACCGCCATCATTCTATTTCTTTTTCTCACTTCGAAAGAGAGCCTCTGATTTCTCTCATAAAATCATGCACCCACAAATCCCAATTGCTCCAAATCCATGCCCACATCATCACAACTTCTTCCGTTCAAGACCCCATTGTTTCCCTCCGCTTCTTGACGCGTACCGCCTCCGCCCCTTTTCGCGATTTGGGCTATTCTCGACGACTCTTTGATCTCCTGACTAACCCATTTGTTTCTCATTATAATGCGATGTTAAGAGCTTACTCTTTGAGCCGTTCACCACTAGAAGGATTGTACATGTACAGAGACATGGAGAGGCAAGGAGTTCGTGCTGATCCTTTGTCTTCTTCTTTTGCTGTTAAGTCGTGTATAAAATTGCTCTCGTTACTTTTCGGGATTCAGATTCACGCGAGGATTTTTAGAAATGGGCATCAAGCGGATAGTCTATTGCTCACTTCCATGATGGACCTGTATTCTCATTGTGGCAAACCTGAGGAAGCGTGCAAATTGTTCGATGAAGTTCCTCAAAAAGATGTTGTTGCTTGGAACGTTTTGATTTCTTGTTTAACTCGCAATAAACGAACTAGGGATGCTTTGGGTCTGTTTGAGATCATGCAGAGTCCAACGTATCTCTGCCAACCTGATAAAGTTACTTGTTTACTCCTCCTCCAAGCCTGTGCAGACTTGAATGCATTGGAGTTCGGTGAAAGGATTCATGGTTATATTCAACAACACGGTTATAATACTGAGAGTAATTTGTGTAATTCGCTGATATCAATGTATTCGCGGTGTGGGCGTATGGATAAGGCTTATGAGGTGTTTGATAAAATGACAGAGAAGAATGTTGTTTCATGGAGTGCGATGATTTCCGGGTTATCGATGAATGGGCACGGGAGGGAAGCTATTGAAGCATTTTGGGAGATGCAAAAGAATGGTGTCGAACCTGGTGATCATACTTTCACTGCTGTTCTTTCTGCTTGTAGCCACTGTGGCCTGGTTGATGAAGGAATGGCATTTTTTGATCGTATGAGACAGGAGTTCATGATAGCTCCCAACGTCCATCACTATGGATGTATAGTTGATCTTTTGGGTCGTGCTGGAATGCTCGATCAAGCCTACGAGCTCATAATGTCAATGGAGGTAAGACCAGATGCGACAATGTGGAGAACCCTTCTTGGAGCTTGCAGAATTCACGGTCATGGAAACCTTGGGGAGCGCATAGTTGAACATTTGATTGAACTCAAATCTCAAGAAGCAGGAGATTATGTGTTGTTGCTGAACATTTATTCTTCGGCTGGCAACTGGGACAAGGTAACTGAATTGAGGAAACTTATGAAGGAGAAGGGTATTTATACCACACCTTGCTGCACCACAATAGAACTGAACGGGGTGGTGCATCAATTTGCTGTGGATGATATTTCACATCCTATGAAAGACAAGATCTACAAGCAGTTGGATGAGATCAACAAGCAGCTAAAGATTGCAGGTTATGAAGCTGAAATGTCATCTGAATTACATAGATTAGAGCCAAAAGATAAAGGGTATGCACTTTCTAACCATAGTGAGAAATTGGCCATAGCCTTTGGGGTTCTTGCAACTCCGCCAGGAAGAACCATCAGAATCGCAAATAATATTCGTACTTGCATGGATTGTCATAACTTTGCTAAGTACATATCAAGTGTTTATAACAGAAAAGTGGTTGTTAGAGACCGAAGTCGGTTCCATCATTTCCAAGAGGGTCGGTGTTCCTGCAACGATTTTTGGTAA

Coding sequence (CDS)

ATGTGTGTCATCTTCCGCTCTCCTTCTATCCTCTCTTTGAAATACCGCCATCATTCTATTTCTTTTTCTCACTTCGAAAGAGAGCCTCTGATTTCTCTCATAAAATCATGCACCCACAAATCCCAATTGCTCCAAATCCATGCCCACATCATCACAACTTCTTCCGTTCAAGACCCCATTGTTTCCCTCCGCTTCTTGACGCGTACCGCCTCCGCCCCTTTTCGCGATTTGGGCTATTCTCGACGACTCTTTGATCTCCTGACTAACCCATTTGTTTCTCATTATAATGCGATGTTAAGAGCTTACTCTTTGAGCCGTTCACCACTAGAAGGATTGTACATGTACAGAGACATGGAGAGGCAAGGAGTTCGTGCTGATCCTTTGTCTTCTTCTTTTGCTGTTAAGTCGTGTATAAAATTGCTCTCGTTACTTTTCGGGATTCAGATTCACGCGAGGATTTTTAGAAATGGGCATCAAGCGGATAGTCTATTGCTCACTTCCATGATGGACCTGTATTCTCATTGTGGCAAACCTGAGGAAGCGTGCAAATTGTTCGATGAAGTTCCTCAAAAAGATGTTGTTGCTTGGAACGTTTTGATTTCTTGTTTAACTCGCAATAAACGAACTAGGGATGCTTTGGGTCTGTTTGAGATCATGCAGAGTCCAACGTATCTCTGCCAACCTGATAAAGTTACTTGTTTACTCCTCCTCCAAGCCTGTGCAGACTTGAATGCATTGGAGTTCGGTGAAAGGATTCATGGTTATATTCAACAACACGGTTATAATACTGAGAGTAATTTGTGTAATTCGCTGATATCAATGTATTCGCGGTGTGGGCGTATGGATAAGGCTTATGAGGTGTTTGATAAAATGACAGAGAAGAATGTTGTTTCATGGAGTGCGATGATTTCCGGGTTATCGATGAATGGGCACGGGAGGGAAGCTATTGAAGCATTTTGGGAGATGCAAAAGAATGGTGTCGAACCTGGTGATCATACTTTCACTGCTGTTCTTTCTGCTTGTAGCCACTGTGGCCTGGTTGATGAAGGAATGGCATTTTTTGATCGTATGAGACAGGAGTTCATGATAGCTCCCAACGTCCATCACTATGGATGTATAGTTGATCTTTTGGGTCGTGCTGGAATGCTCGATCAAGCCTACGAGCTCATAATGTCAATGGAGGTAAGACCAGATGCGACAATGTGGAGAACCCTTCTTGGAGCTTGCAGAATTCACGGTCATGGAAACCTTGGGGAGCGCATAGTTGAACATTTGATTGAACTCAAATCTCAAGAAGCAGGAGATTATGTGTTGTTGCTGAACATTTATTCTTCGGCTGGCAACTGGGACAAGGTAACTGAATTGAGGAAACTTATGAAGGAGAAGGGTATTTATACCACACCTTGCTGCACCACAATAGAACTGAACGGGGTGGTGCATCAATTTGCTGTGGATGATATTTCACATCCTATGAAAGACAAGATCTACAAGCAGTTGGATGAGATCAACAAGCAGCTAAAGATTGCAGGTTATGAAGCTGAAATGTCATCTGAATTACATAGATTAGAGCCAAAAGATAAAGGGTATGCACTTTCTAACCATAGTGAGAAATTGGCCATAGCCTTTGGGGTTCTTGCAACTCCGCCAGGAAGAACCATCAGAATCGCAAATAATATTCGTACTTGCATGGATTGTCATAACTTTGCTAAGTACATATCAAGTGTTTATAACAGAAAAGTGGTTGTTAGAGACCGAAGTCGGTTCCATCATTTCCAAGAGGGTCGGTGTTCCTGCAACGATTTTTGGTAA

Protein sequence

MCVIFRSPSILSLKYRHHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSVQDPIVSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAIAFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW
BLAST of CsGy1G002340 vs. NCBI nr
Match: XP_011660092.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g47530 [Cucumis sativus] >KGN63671.1 hypothetical protein Csa_1G009750 [Cucumis sativus])

HSP 1 Score: 1233.8 bits (3191), Expect = 0.0e+00
Identity = 599/602 (99.50%), Postives = 600/602 (99.67%), Query Frame = 0

Query: 1   MCVIFRSPSILSLKYRHHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSVQDPI 60
           MCVIFRSPSILSLKY HHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSS+QDPI
Sbjct: 1   MCVIFRSPSILSLKYHHHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSIQDPI 60

Query: 61  VSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMER 120
           VSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMER
Sbjct: 61  VSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMER 120

Query: 121 QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEE 180
           QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIF NGHQADSLLLTSMMDLYSHCGKPEE
Sbjct: 121 QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFINGHQADSLLLTSMMDLYSHCGKPEE 180

Query: 181 ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC 240
           ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC
Sbjct: 181 ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC 240

Query: 241 ADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWS 300
           ADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWS
Sbjct: 241 ADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWS 300

Query: 301 AMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQE 360
           AMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQE
Sbjct: 301 AMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQE 360

Query: 361 FMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGER 420
           FMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGER
Sbjct: 361 FMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGER 420

Query: 421 IVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVH 480
           IVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVH
Sbjct: 421 IVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVH 480

Query: 481 QFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAI 540
           QFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAI
Sbjct: 481 QFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAI 540

Query: 541 AFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCND 600
           AFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCND
Sbjct: 541 AFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCND 600

Query: 601 FW 603
           FW
Sbjct: 601 FW 602

BLAST of CsGy1G002340 vs. NCBI nr
Match: XP_008453206.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g47530 [Cucumis melo])

HSP 1 Score: 1143.6 bits (2957), Expect = 0.0e+00
Identity = 561/602 (93.19%), Postives = 570/602 (94.68%), Query Frame = 0

Query: 1   MCVIFRSPSILSLKYRHHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSVQDPI 60
           M VIF  PSILS                 LISLIKSCTHKSQLLQIHAHII TSS+QDPI
Sbjct: 1   MYVIFHRPSILS-----------------LISLIKSCTHKSQLLQIHAHIIRTSSIQDPI 60

Query: 61  VSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMER 120
           VSLRFLTRTASAPFRDLGYSRR  DLLTNP VSHYNAMLRAYS+SRSPLEGLY+YRDMER
Sbjct: 61  VSLRFLTRTASAPFRDLGYSRRFLDLLTNPLVSHYNAMLRAYSVSRSPLEGLYVYRDMER 120

Query: 121 QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEE 180
           QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIF  GHQADSLLLTSMMDLYSHCGKPEE
Sbjct: 121 QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFIYGHQADSLLLTSMMDLYSHCGKPEE 180

Query: 181 ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC 240
           ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC
Sbjct: 181 ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC 240

Query: 241 ADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWS 300
           ADLNALEFGERIHGYIQQH YNTESNLCNSLISMYSRCGR+DKAYEVFDKM EKNVVSWS
Sbjct: 241 ADLNALEFGERIHGYIQQHCYNTESNLCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWS 300

Query: 301 AMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQE 360
           AMISGLSMNGHGREAIEAFWEMQKNGVEP DHTFTAVLSACSHCGLVDEGMAFFDRMRQE
Sbjct: 301 AMISGLSMNGHGREAIEAFWEMQKNGVEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQE 360

Query: 361 FMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGER 420
            MIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGH NLGER
Sbjct: 361 LMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHANLGER 420

Query: 421 IVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVH 480
           IVEHLIELKSQEAGDYVLLLNIYSSAG WDKVTELRKLMKEKGIYTTPCCTTIELNGVVH
Sbjct: 421 IVEHLIELKSQEAGDYVLLLNIYSSAGKWDKVTELRKLMKEKGIYTTPCCTTIELNGVVH 480

Query: 481 QFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAI 540
           +FAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRL+P+DKGYALSNHSEKLAI
Sbjct: 481 EFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLKPEDKGYALSNHSEKLAI 540

Query: 541 AFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCND 600
           AFGVLATPPGRTIR+ANNIRTCMDCHNFAKYISSVYNRKVV+RDRSRFHHFQEGRCSCND
Sbjct: 541 AFGVLATPPGRTIRVANNIRTCMDCHNFAKYISSVYNRKVVLRDRSRFHHFQEGRCSCND 585

Query: 601 FW 603
           FW
Sbjct: 601 FW 585

BLAST of CsGy1G002340 vs. NCBI nr
Match: XP_023515406.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1100.1 bits (2844), Expect = 0.0e+00
Identity = 530/600 (88.33%), Postives = 563/600 (93.83%), Query Frame = 0

Query: 4   IFRSP-SILSLKYRHHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSVQDPIVS 63
           +  SP S+LS K+R  + S   F+REPLISLIKSCTHKSQLLQIHAH+I TS +QDPIVS
Sbjct: 30  LLHSPISLLSSKFREQN-STLRFDREPLISLIKSCTHKSQLLQIHAHMIRTSFIQDPIVS 89

Query: 64  LRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMERQG 123
           LRFLTR  SAPFR+LGYSRR F  LTNPFVSHYN +LRAYSLSRSPLEGLYMYRDMERQG
Sbjct: 90  LRFLTRIVSAPFRELGYSRRFFSQLTNPFVSHYNTLLRAYSLSRSPLEGLYMYRDMERQG 149

Query: 124 VRADPLSSSFAVKSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEAC 183
           V ADPLSSSFAVKSCI++LSL  GIQIHARIFRNGHQ+DSLLLTSMMDLYSHCGK E+AC
Sbjct: 150 VHADPLSSSFAVKSCIRMLSLFSGIQIHARIFRNGHQSDSLLLTSMMDLYSHCGKLEDAC 209

Query: 184 KLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACAD 243
           KLFDE+PQ+DVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLC+PDKVTCLLLLQACAD
Sbjct: 210 KLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCKPDKVTCLLLLQACAD 269

Query: 244 LNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAM 303
           LNALEFGERIH YIQQ+ YNTESNLCNSLISMYSRCGR+DKAYEVFDKM EKNVVSWSAM
Sbjct: 270 LNALEFGERIHSYIQQNDYNTESNLCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAM 329

Query: 304 ISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFM 363
           ISGLSMNGHGREAIEAFW MQK GVEP DHTFTAVLSACSHCGLVDEGMAFFDRMRQEFM
Sbjct: 330 ISGLSMNGHGREAIEAFWAMQKKGVEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFM 389

Query: 364 IAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIV 423
           I P VHHYGC+VDLLGRAGMLDQAY+L+MSMEV PDATMWRTLLGACRIHGH NLGERI+
Sbjct: 390 IVPTVHHYGCMVDLLGRAGMLDQAYQLVMSMEVNPDATMWRTLLGACRIHGHANLGERII 449

Query: 424 EHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQF 483
           EHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRK MKE+GIYTTP CTTIELNGVVH+F
Sbjct: 450 EHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKERGIYTTPGCTTIELNGVVHEF 509

Query: 484 AVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAIAF 543
           AVDDISHPMKDKIYKQLDEIN+QLKIAGYEAE+SSELH L+ +DKGYALS HSEKLAIAF
Sbjct: 510 AVDDISHPMKDKIYKQLDEINQQLKIAGYEAEISSELHNLKAEDKGYALSYHSEKLAIAF 569

Query: 544 GVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
           GVLATPPGRTIR+ANN+RTCMDCHNFAKY+SSVYNRKVVVRDRSRFHHF+EGRCSCND+W
Sbjct: 570 GVLATPPGRTIRVANNLRTCMDCHNFAKYVSSVYNRKVVVRDRSRFHHFREGRCSCNDYW 628

BLAST of CsGy1G002340 vs. NCBI nr
Match: XP_022921651.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita moschata])

HSP 1 Score: 1097.8 bits (2838), Expect = 0.0e+00
Identity = 528/600 (88.00%), Postives = 563/600 (93.83%), Query Frame = 0

Query: 4   IFRSP-SILSLKYRHHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSVQDPIVS 63
           +  SP S+LS K+R  + S   F+REPLISLIKSCTHKSQLLQIHAH+I TS +QDPIVS
Sbjct: 30  LLHSPISLLSSKFREQN-STLRFDREPLISLIKSCTHKSQLLQIHAHMIRTSFIQDPIVS 89

Query: 64  LRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMERQG 123
           LRFLTR  SAPFR+LGYSRR F  LTNPFVSHYN +LRAYSLSRSPLEGLYMYRDMER+G
Sbjct: 90  LRFLTRIVSAPFRELGYSRRFFSQLTNPFVSHYNTLLRAYSLSRSPLEGLYMYRDMERRG 149

Query: 124 VRADPLSSSFAVKSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEAC 183
           V ADPLSSSFAVKSCI++LSL  G+QIHARIFRNGHQ+DSLLLTSMMDLYSHCGK E+AC
Sbjct: 150 VHADPLSSSFAVKSCIRMLSLFSGVQIHARIFRNGHQSDSLLLTSMMDLYSHCGKLEDAC 209

Query: 184 KLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACAD 243
           KLFDE+PQ+DVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLC+PDKVTCLLLLQACAD
Sbjct: 210 KLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCKPDKVTCLLLLQACAD 269

Query: 244 LNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAM 303
           LNALEFGERIH +IQQHGYNTESNLCNSLISMYSRCGR+DKAYEVFDKM EKNVVSWSAM
Sbjct: 270 LNALEFGERIHSHIQQHGYNTESNLCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAM 329

Query: 304 ISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFM 363
           ISGLSMNGHGREAIEAFW MQK GVEP DHTFTAVLSACSHCGLVDEGMAFFDRMRQEFM
Sbjct: 330 ISGLSMNGHGREAIEAFWAMQKKGVEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFM 389

Query: 364 IAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIV 423
           I P VHHYGC+VDLLGRAGMLDQAY+L+MSMEV PDATMWRTLLGACRIHGH NLGERI+
Sbjct: 390 IVPTVHHYGCMVDLLGRAGMLDQAYQLVMSMEVNPDATMWRTLLGACRIHGHANLGERII 449

Query: 424 EHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQF 483
           EHLIELKSQEAGDYVLLLNIYSSAGNW KVTELRK MKE+GIYTTP CTTIELNGVVH+F
Sbjct: 450 EHLIELKSQEAGDYVLLLNIYSSAGNWVKVTELRKFMKERGIYTTPGCTTIELNGVVHEF 509

Query: 484 AVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAIAF 543
           AVDDISHPMKDKIY+QLDEINKQLKIAGYEAE+SSELH L+ +DKGYALS HSEKLAIAF
Sbjct: 510 AVDDISHPMKDKIYEQLDEINKQLKIAGYEAEISSELHNLKAEDKGYALSFHSEKLAIAF 569

Query: 544 GVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
           GVLATPPGRTIR+ANN+RTCMDCHNFAKY+SSVYNRKVVVRDRSRFHHF+EGRCSCND+W
Sbjct: 570 GVLATPPGRTIRVANNLRTCMDCHNFAKYVSSVYNRKVVVRDRSRFHHFREGRCSCNDYW 628

BLAST of CsGy1G002340 vs. NCBI nr
Match: XP_022987181.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita maxima])

HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 525/600 (87.50%), Postives = 564/600 (94.00%), Query Frame = 0

Query: 4   IFRSP-SILSLKYRHHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSVQDPIVS 63
           +  SP S+LS K+R  + S  HF+REPLISLIKSCTHKSQLLQIHAH+I TS +QDPIVS
Sbjct: 30  LLHSPISLLSSKFRQQN-STLHFDREPLISLIKSCTHKSQLLQIHAHMIRTSFIQDPIVS 89

Query: 64  LRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMERQG 123
           LRFLTR  SAPFR+LGYSRR F  LTNPFVSHYN +LRAYSLSRSPLEGLYMYRDMERQG
Sbjct: 90  LRFLTRIVSAPFRELGYSRRFFSQLTNPFVSHYNTLLRAYSLSRSPLEGLYMYRDMERQG 149

Query: 124 VRADPLSSSFAVKSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEAC 183
           V ADPLSSSFA+KSCI++LSL  GIQIHARIFRNGHQ+DSLLLTSMMDLYSHCGK ++AC
Sbjct: 150 VHADPLSSSFALKSCIRMLSLFSGIQIHARIFRNGHQSDSLLLTSMMDLYSHCGKLKDAC 209

Query: 184 KLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACAD 243
           KLFDE+PQ+DVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLC+PDKVTCLLLLQACAD
Sbjct: 210 KLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCKPDKVTCLLLLQACAD 269

Query: 244 LNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAM 303
           LNALEFGERIH +IQQHGYNTESNLCNSLISMYSRCGR+DKAYEVFDKM EKNVVSWSAM
Sbjct: 270 LNALEFGERIHSHIQQHGYNTESNLCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAM 329

Query: 304 ISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFM 363
           ISGLSMNGHGREAIEAFW MQK GVEP DHTFTAVLSACSHCGLVDEGMAFFDRMRQEFM
Sbjct: 330 ISGLSMNGHGREAIEAFWAMQKKGVEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFM 389

Query: 364 IAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIV 423
           I P VHHYGC+VDLLGRAGMLDQAY+L+MSMEV PDATMWRTLLGACRIHGH NLGER++
Sbjct: 390 IVPTVHHYGCMVDLLGRAGMLDQAYQLVMSMEVNPDATMWRTLLGACRIHGHANLGERVI 449

Query: 424 EHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQF 483
           EHL+ELKSQEAGDYVLLLNIYSSAGNWDKVTELRK MKE+GIYTTP CTTIELNGVVH+F
Sbjct: 450 EHLVELKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKERGIYTTPGCTTIELNGVVHEF 509

Query: 484 AVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAIAF 543
           AVDDISHPMKDKIYKQLDEIN+QLKIAGYEAE+SSELH L+ +DKGYALS HSEKLAIAF
Sbjct: 510 AVDDISHPMKDKIYKQLDEINQQLKIAGYEAEISSELHNLKAEDKGYALSFHSEKLAIAF 569

Query: 544 GVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
           GVLATPPG TIR+ANN+RTC+DCHNFAKY+SSVYNRKVVVRDRS+FHHF+EGRCSCND+W
Sbjct: 570 GVLATPPGGTIRVANNLRTCLDCHNFAKYVSSVYNRKVVVRDRSQFHHFREGRCSCNDYW 628

BLAST of CsGy1G002340 vs. TAIR10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 733.0 bits (1891), Expect = 1.5e-211
Identity = 358/578 (61.94%), Postives = 450/578 (77.85%), Query Frame = 0

Query: 30  LISLIKSCTHKSQLLQIHAHIITTSSVQDPIVSLRFLTRTA-SAPFRDLGYSRRLFDLLT 89
           L+SLI S T K  L QIHA ++ TS +++  V   FL+R A S   RD+ YS R+F    
Sbjct: 14  LLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRL 73

Query: 90  NPFVSHYNAMLRAYSLSRSPLEGLYMYRDMER-QGVRADPLSSSFAVKSCIKLLSLLFGI 149
           NP +SH N M+RA+SLS++P EG  ++R + R   + A+PLSSSFA+K CIK   LL G+
Sbjct: 74  NPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGL 133

Query: 150 QIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNK 209
           QIH +IF +G  +DSLL+T++MDLYS C    +ACK+FDE+P++D V+WNVL SC  RNK
Sbjct: 134 QIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYLRNK 193

Query: 210 RTRDALGLFEIMQSPTYLC-QPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYNTESN 269
           RTRD L LF+ M++    C +PD VTCLL LQACA+L AL+FG+++H +I ++G +   N
Sbjct: 194 RTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSGALN 253

Query: 270 LCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNG 329
           L N+L+SMYSRCG MDKAY+VF  M E+NVVSW+A+ISGL+MNG G+EAIEAF EM K G
Sbjct: 254 LSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEMLKFG 313

Query: 330 VEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQ-EFMIAPNVHHYGCIVDLLGRAGMLDQ 389
           + P + T T +LSACSH GLV EGM FFDRMR  EF I PN+HHYGC+VDLLGRA +LD+
Sbjct: 314 ISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARLLDK 373

Query: 390 AYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNIYSS 449
           AY LI SME++PD+T+WRTLLGACR+HG   LGER++ HLIELK++EAGDYVLLLN YS+
Sbjct: 374 AYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNTYST 433

Query: 450 AGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEINKQ 509
            G W+KVTELR LMKEK I+T P C+ IEL G VH+F VDD+SHP K++IYK L EIN+Q
Sbjct: 434 VGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSHPRKEEIYKMLAEINQQ 493

Query: 510 LKIAGYEAEMSSELHRLE-PKDKGYALSNHSEKLAIAFGVLATPPGRTIRIANNIRTCMD 569
           LKIAGY AE++SELH LE  ++KGYAL  HSEKLAIAFG+L TPPG TIR+  N+RTC+D
Sbjct: 494 LKIAGYVAEITSELHNLESEEEKGYALRYHSEKLAIAFGILVTPPGTTIRVTKNLRTCVD 553

Query: 570 CHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
           CHNFAK++S VY+R V+VRDRSRFHHF+ G CSCNDFW
Sbjct: 554 CHNFAKFVSDVYDRIVIVRDRSRFHHFKGGSCSCNDFW 591

BLAST of CsGy1G002340 vs. TAIR10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 437.6 bits (1124), Expect = 1.3e-122
Identity = 220/580 (37.93%), Postives = 351/580 (60.52%), Query Frame = 0

Query: 33  LIKSCTHKSQL---LQIHAHIITTSSVQDPIVSLRFLTRTASAPFRDLG---YSRRLFDL 92
           LI  C H+S L   L++H HI+   S QDP     FL       + DLG   Y+R++FD 
Sbjct: 83  LILCCGHRSSLSDALRVHRHILDNGSDQDP-----FLATKLIGMYSDLGSVDYARKVFDK 142

Query: 93  LTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCI----KLLS 152
                +  +NA+ RA +L+    E L +Y  M R GV +D  + ++ +K+C+     +  
Sbjct: 143 TRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVASECTVNH 202

Query: 153 LLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISC 212
           L+ G +IHA + R G+ +   ++T+++D+Y+  G  + A  +F  +P ++VV+W+ +I+C
Sbjct: 203 LMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIAC 262

Query: 213 LTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYN 272
             +N +  +AL  F  M   T    P+ VT + +LQACA L ALE G+ IHGYI + G +
Sbjct: 263 YAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRGLD 322

Query: 273 TESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAMISGLSMNGHGREAIEAFWEM 332
           +   + ++L++MY RCG+++    VFD+M +++VVSW+++IS   ++G+G++AI+ F EM
Sbjct: 323 SILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQIFEEM 382

Query: 333 QKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIAPNVHHYGCIVDLLGRAGM 392
             NG  P   TF +VL ACSH GLV+EG   F+ M ++  I P + HY C+VDLLGRA  
Sbjct: 383 LANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANR 442

Query: 393 LDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNI 452
           LD+A +++  M   P   +W +LLG+CRIHG+  L ER    L  L+ + AG+YVLL +I
Sbjct: 443 LDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADI 502

Query: 453 YSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEI 512
           Y+ A  WD+V  ++KL++ +G+   P    +E+   ++ F   D  +P+ ++I+  L ++
Sbjct: 503 YAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKL 562

Query: 513 NKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAIAFGVLATPPGRTIRIANNIRTC 572
            + +K  GY  +    L+ LE ++K   +  HSEKLA+AFG++ T  G  IRI  N+R C
Sbjct: 563 AEDMKEKGYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITKNLRLC 622

Query: 573 MDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
            DCH F K+IS    ++++VRD +RFH F+ G CSC D+W
Sbjct: 623 EDCHLFTKFISKFMEKEILVRDVNRFHRFKNGVCSCGDYW 657

BLAST of CsGy1G002340 vs. TAIR10
Match: AT3G56550.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 436.8 bits (1122), Expect = 2.2e-122
Identity = 223/576 (38.72%), Postives = 348/576 (60.42%), Query Frame = 0

Query: 30  LISLIKSCTHKSQLLQIHAHIITTSSVQDPIVSLRFLTRTASAPFRDLGYSRRLFDLL-T 89
           ++ +++ C    +L +IH+H+I       P +    L   A +    L +++ LFD   +
Sbjct: 8   IVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFDHFDS 67

Query: 90  NPFVSHYNAMLRAYSLSRSPLEGLYMYRDMERQGV-RADPLSSSFAVKSCIKLLSLLFGI 149
           +P  S +N ++R +S S SPL  +  Y  M    V R D  + +FA+KSC ++ S+   +
Sbjct: 68  DPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSIPKCL 127

Query: 150 QIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNK 209
           +IH  + R+G   D+++ TS++  YS  G  E A K+FDE+P +D+V+WNV+I C +   
Sbjct: 128 EIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCFSHVG 187

Query: 210 RTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYNTESNL 269
               AL +++ M +   +C  D  T + LL +CA ++AL  G  +H         +   +
Sbjct: 188 LHNQALSMYKRMGNEG-VC-GDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCVFV 247

Query: 270 CNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNGV 329
            N+LI MY++CG ++ A  VF+ M +++V++W++MI G  ++GHG EAI  F +M  +GV
Sbjct: 248 SNALIDMYAKCGSLENAIGVFNGMRKRDVLTWNSMIIGYGVHGHGVEAISFFRKMVASGV 307

Query: 330 EPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIAPNVHHYGCIVDLLGRAGMLDQAY 389
            P   TF  +L  CSH GLV EG+  F+ M  +F + PNV HYGC+VDL GRAG L+ + 
Sbjct: 308 RPNAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLENSL 367

Query: 390 ELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNIYSSAG 449
           E+I +     D  +WRTLLG+C+IH +  LGE  ++ L++L++  AGDYVL+ +IYS+A 
Sbjct: 368 EMIYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSAAN 427

Query: 450 NWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEINKQLK 509
           +      +RKL++   + T P  + IE+   VH+F VDD  HP    IY +L E+  +  
Sbjct: 428 DAQAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINRAI 487

Query: 510 IAGYEAEMSSE-LHRLEPKDKGYALSNHSEKLAIAFGVLATPPGRTIRIANNIRTCMDCH 569
           +AGY+ E S+     L  +  G A ++HSEKLAIA+G++ T  G T+RI  N+R C DCH
Sbjct: 488 LAGYKPEDSNRTAPTLSDRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCRDCH 547

Query: 570 NFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
           +F KY+S  +NR+++VRDR RFHHF +G CSCND+W
Sbjct: 548 SFTKYVSKAFNREIIVRDRVRFHHFADGICSCNDYW 581

BLAST of CsGy1G002340 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 429.9 bits (1104), Expect = 2.6e-120
Identity = 204/486 (41.98%), Postives = 312/486 (64.20%), Query Frame = 0

Query: 120 RQGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPE 179
           +  VR D  +    V +C +  S+  G Q+H  I  +G  ++  ++ +++DLYS CG+ E
Sbjct: 259 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 318

Query: 180 EACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLF-EIMQSPTYLCQPDKVTCLLLLQ 239
            AC LF+ +P KDV++WN LI   T     ++AL LF E+++S      P+ VT L +L 
Sbjct: 319 TACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGE---TPNDVTMLSILP 378

Query: 240 ACADLNALEFGERIHGYIQQH--GYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNV 299
           ACA L A++ G  IH YI +   G    S+L  SLI MY++CG ++ A++VF+ +  K++
Sbjct: 379 ACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSL 438

Query: 300 VSWSAMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDR 359
            SW+AMI G +M+G    + + F  M+K G++P D TF  +LSACSH G++D G   F  
Sbjct: 439 SSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRT 498

Query: 360 MRQEFMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGN 419
           M Q++ + P + HYGC++DLLG +G+  +A E+I  ME+ PD  +W +LL AC++HG+  
Sbjct: 499 MTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVE 558

Query: 420 LGERIVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELN 479
           LGE   E+LI+++ +  G YVLL NIY+SAG W++V + R L+ +KG+   P C++IE++
Sbjct: 559 LGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEID 618

Query: 480 GVVHQFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSE 539
            VVH+F + D  HP   +IY  L+E+   L+ AG+  + S  L  +E + K  AL +HSE
Sbjct: 619 SVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSE 678

Query: 540 KLAIAFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRC 599
           KLAIAFG+++T PG  + I  N+R C +CH   K IS +Y R+++ RDR+RFHHF++G C
Sbjct: 679 KLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVC 738

Query: 600 SCNDFW 603
           SCND+W
Sbjct: 739 SCNDYW 741

BLAST of CsGy1G002340 vs. TAIR10
Match: AT1G34160.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 422.5 bits (1085), Expect = 4.2e-118
Identity = 219/582 (37.63%), Postives = 346/582 (59.45%), Query Frame = 0

Query: 32  SLIKSCTHKSQLLQIHAHIITTSSVQDPIVSLRFLTRTASAPFRDLGYSRRLFDLLTNPF 91
           ++I+ C   SQ+ Q+ +H +T    Q   +  R L R A +PF DL ++ ++F  +  P 
Sbjct: 8   TMIQKCVSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIFRYIPKPL 67

Query: 92  VSHYNAMLRAYSLSRSPLEGLYMYRDMERQG------VRADPLSSSFAVKSCIKLLSLLF 151
            + +NA++R ++ S  P      YR M +Q        R D L+ SF +K+C + L    
Sbjct: 68  TNDWNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACARALCSSA 127

Query: 152 GIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTR 211
             Q+H +I R G  ADSLL T+++D YS  G    A KLFDE+P +DV +WN LI+ L  
Sbjct: 128 MDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAGLVS 187

Query: 212 NKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYNTES 271
             R  +A+ L++ M+  T   +  +VT +  L AC+ L  ++ GE I      HGY+ ++
Sbjct: 188 GNRASEAMELYKRME--TEGIRRSEVTVVAALGACSHLGDVKEGENIF-----HGYSNDN 247

Query: 272 NL-CNSLISMYSRCGRMDKAYEVFDKMT-EKNVVSWSAMISGLSMNGHGREAIEAFWEMQ 331
            +  N+ I MYS+CG +DKAY+VF++ T +K+VV+W+ MI+G +++G    A+E F +++
Sbjct: 248 VIVSNAAIDMYSKCGFVDKAYQVFEQFTGKKSVVTWNTMITGFAVHGEAHRALEIFDKLE 307

Query: 332 KNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIAPNVHHYGCIVDLLGRAGML 391
            NG++P D ++ A L+AC H GLV+ G++ F+ M  +  +  N+ HYGC+VDLL RAG L
Sbjct: 308 DNGIKPDDVSYLAALTACRHAGLVEYGLSVFNNMACK-GVERNMKHYGCVVDLLSRAGRL 367

Query: 392 DQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNIY 451
            +A+++I SM + PD  +W++LLGA  I+    + E     + E+     GD+VLL N+Y
Sbjct: 368 REAHDIICSMSMIPDPVLWQSLLGASEIYSDVEMAEIASREIKEMGVNNDGDFVLLSNVY 427

Query: 452 SSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEIN 511
           ++ G W  V  +R  M+ K +   P  + IE  G +H+F   D SH    +IY+++DEI 
Sbjct: 428 AAQGRWKDVGRVRDDMESKQVKKIPGLSYIEAKGTIHEFYNSDKSHEQWREIYEKIDEIR 487

Query: 512 KQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAIAFGVL---ATPPGRTIRIANNIR 571
            +++  GY A+    LH +  ++K  AL  HSEKLA+A+G++          +R+ NN+R
Sbjct: 488 FKIREDGYVAQTGLVLHDIGEEEKENALCYHSEKLAVAYGLMMMDGADEESPVRVINNLR 547

Query: 572 TCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
            C DCH   K+IS +Y R+++VRDR RFH F++G CSC DFW
Sbjct: 548 ICGDCHVVFKHISKIYKREIIVRDRVRFHRFKDGSCSCRDFW 581

BLAST of CsGy1G002340 vs. Swiss-Prot
Match: sp|Q9SN85|PP267_ARATH (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 733.0 bits (1891), Expect = 2.6e-210
Identity = 358/578 (61.94%), Postives = 450/578 (77.85%), Query Frame = 0

Query: 30  LISLIKSCTHKSQLLQIHAHIITTSSVQDPIVSLRFLTRTA-SAPFRDLGYSRRLFDLLT 89
           L+SLI S T K  L QIHA ++ TS +++  V   FL+R A S   RD+ YS R+F    
Sbjct: 14  LLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRL 73

Query: 90  NPFVSHYNAMLRAYSLSRSPLEGLYMYRDMER-QGVRADPLSSSFAVKSCIKLLSLLFGI 149
           NP +SH N M+RA+SLS++P EG  ++R + R   + A+PLSSSFA+K CIK   LL G+
Sbjct: 74  NPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGL 133

Query: 150 QIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNK 209
           QIH +IF +G  +DSLL+T++MDLYS C    +ACK+FDE+P++D V+WNVL SC  RNK
Sbjct: 134 QIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYLRNK 193

Query: 210 RTRDALGLFEIMQSPTYLC-QPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYNTESN 269
           RTRD L LF+ M++    C +PD VTCLL LQACA+L AL+FG+++H +I ++G +   N
Sbjct: 194 RTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSGALN 253

Query: 270 LCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNG 329
           L N+L+SMYSRCG MDKAY+VF  M E+NVVSW+A+ISGL+MNG G+EAIEAF EM K G
Sbjct: 254 LSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEMLKFG 313

Query: 330 VEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQ-EFMIAPNVHHYGCIVDLLGRAGMLDQ 389
           + P + T T +LSACSH GLV EGM FFDRMR  EF I PN+HHYGC+VDLLGRA +LD+
Sbjct: 314 ISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARLLDK 373

Query: 390 AYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNIYSS 449
           AY LI SME++PD+T+WRTLLGACR+HG   LGER++ HLIELK++EAGDYVLLLN YS+
Sbjct: 374 AYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNTYST 433

Query: 450 AGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEINKQ 509
            G W+KVTELR LMKEK I+T P C+ IEL G VH+F VDD+SHP K++IYK L EIN+Q
Sbjct: 434 VGKWEKVTELRSLMKEKRIHTKPGCSAIELQGTVHEFIVDDVSHPRKEEIYKMLAEINQQ 493

Query: 510 LKIAGYEAEMSSELHRLE-PKDKGYALSNHSEKLAIAFGVLATPPGRTIRIANNIRTCMD 569
           LKIAGY AE++SELH LE  ++KGYAL  HSEKLAIAFG+L TPPG TIR+  N+RTC+D
Sbjct: 494 LKIAGYVAEITSELHNLESEEEKGYALRYHSEKLAIAFGILVTPPGTTIRVTKNLRTCVD 553

Query: 570 CHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
           CHNFAK++S VY+R V+VRDRSRFHHF+ G CSCNDFW
Sbjct: 554 CHNFAKFVSDVYDRIVIVRDRSRFHHFKGGSCSCNDFW 591

BLAST of CsGy1G002340 vs. Swiss-Prot
Match: sp|Q9STF3|PP265_ARATH (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR2 PE=2 SV=1)

HSP 1 Score: 437.6 bits (1124), Expect = 2.3e-121
Identity = 220/580 (37.93%), Postives = 351/580 (60.52%), Query Frame = 0

Query: 33  LIKSCTHKSQL---LQIHAHIITTSSVQDPIVSLRFLTRTASAPFRDLG---YSRRLFDL 92
           LI  C H+S L   L++H HI+   S QDP     FL       + DLG   Y+R++FD 
Sbjct: 83  LILCCGHRSSLSDALRVHRHILDNGSDQDP-----FLATKLIGMYSDLGSVDYARKVFDK 142

Query: 93  LTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCI----KLLS 152
                +  +NA+ RA +L+    E L +Y  M R GV +D  + ++ +K+C+     +  
Sbjct: 143 TRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVASECTVNH 202

Query: 153 LLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISC 212
           L+ G +IHA + R G+ +   ++T+++D+Y+  G  + A  +F  +P ++VV+W+ +I+C
Sbjct: 203 LMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIAC 262

Query: 213 LTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYN 272
             +N +  +AL  F  M   T    P+ VT + +LQACA L ALE G+ IHGYI + G +
Sbjct: 263 YAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRGLD 322

Query: 273 TESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAMISGLSMNGHGREAIEAFWEM 332
           +   + ++L++MY RCG+++    VFD+M +++VVSW+++IS   ++G+G++AI+ F EM
Sbjct: 323 SILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQIFEEM 382

Query: 333 QKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIAPNVHHYGCIVDLLGRAGM 392
             NG  P   TF +VL ACSH GLV+EG   F+ M ++  I P + HY C+VDLLGRA  
Sbjct: 383 LANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANR 442

Query: 393 LDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNI 452
           LD+A +++  M   P   +W +LLG+CRIHG+  L ER    L  L+ + AG+YVLL +I
Sbjct: 443 LDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADI 502

Query: 453 YSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEI 512
           Y+ A  WD+V  ++KL++ +G+   P    +E+   ++ F   D  +P+ ++I+  L ++
Sbjct: 503 YAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKL 562

Query: 513 NKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAIAFGVLATPPGRTIRIANNIRTC 572
            + +K  GY  +    L+ LE ++K   +  HSEKLA+AFG++ T  G  IRI  N+R C
Sbjct: 563 AEDMKEKGYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITKNLRLC 622

Query: 573 MDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
            DCH F K+IS    ++++VRD +RFH F+ G CSC D+W
Sbjct: 623 EDCHLFTKFISKFMEKEILVRDVNRFHRFKNGVCSCGDYW 657

BLAST of CsGy1G002340 vs. Swiss-Prot
Match: sp|Q9LXY5|PP284_ARATH (Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H80 PE=2 SV=1)

HSP 1 Score: 436.8 bits (1122), Expect = 3.9e-121
Identity = 223/576 (38.72%), Postives = 348/576 (60.42%), Query Frame = 0

Query: 30  LISLIKSCTHKSQLLQIHAHIITTSSVQDPIVSLRFLTRTASAPFRDLGYSRRLFDLL-T 89
           ++ +++ C    +L +IH+H+I       P +    L   A +    L +++ LFD   +
Sbjct: 8   IVRMLQGCNSMKKLRKIHSHVIINGLQHHPSIFNHLLRFCAVSVTGSLSHAQLLFDHFDS 67

Query: 90  NPFVSHYNAMLRAYSLSRSPLEGLYMYRDMERQGV-RADPLSSSFAVKSCIKLLSLLFGI 149
           +P  S +N ++R +S S SPL  +  Y  M    V R D  + +FA+KSC ++ S+   +
Sbjct: 68  DPSTSDWNYLIRGFSNSSSPLNSILFYNRMLLSSVSRPDLFTFNFALKSCERIKSIPKCL 127

Query: 150 QIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNK 209
           +IH  + R+G   D+++ TS++  YS  G  E A K+FDE+P +D+V+WNV+I C +   
Sbjct: 128 EIHGSVIRSGFLDDAIVATSLVRCYSANGSVEIASKVFDEMPVRDLVSWNVMICCFSHVG 187

Query: 210 RTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYNTESNL 269
               AL +++ M +   +C  D  T + LL +CA ++AL  G  +H         +   +
Sbjct: 188 LHNQALSMYKRMGNEG-VC-GDSYTLVALLSSCAHVSALNMGVMLHRIACDIRCESCVFV 247

Query: 270 CNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNGV 329
            N+LI MY++CG ++ A  VF+ M +++V++W++MI G  ++GHG EAI  F +M  +GV
Sbjct: 248 SNALIDMYAKCGSLENAIGVFNGMRKRDVLTWNSMIIGYGVHGHGVEAISFFRKMVASGV 307

Query: 330 EPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIAPNVHHYGCIVDLLGRAGMLDQAY 389
            P   TF  +L  CSH GLV EG+  F+ M  +F + PNV HYGC+VDL GRAG L+ + 
Sbjct: 308 RPNAITFLGLLLGCSHQGLVKEGVEHFEIMSSQFHLTPNVKHYGCMVDLYGRAGQLENSL 367

Query: 390 ELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNIYSSAG 449
           E+I +     D  +WRTLLG+C+IH +  LGE  ++ L++L++  AGDYVL+ +IYS+A 
Sbjct: 368 EMIYASSCHEDPVLWRTLLGSCKIHRNLELGEVAMKKLVQLEAFNAGDYVLMTSIYSAAN 427

Query: 450 NWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEINKQLK 509
           +      +RKL++   + T P  + IE+   VH+F VDD  HP    IY +L E+  +  
Sbjct: 428 DAQAFASMRKLIRSHDLQTVPGWSWIEIGDQVHKFVVDDKMHPESAVIYSELGEVINRAI 487

Query: 510 IAGYEAEMSSE-LHRLEPKDKGYALSNHSEKLAIAFGVLATPPGRTIRIANNIRTCMDCH 569
           +AGY+ E S+     L  +  G A ++HSEKLAIA+G++ T  G T+RI  N+R C DCH
Sbjct: 488 LAGYKPEDSNRTAPTLSDRCLGSADTSHSEKLAIAYGLMRTTAGTTLRITKNLRVCRDCH 547

Query: 570 NFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
           +F KY+S  +NR+++VRDR RFHHF +G CSCND+W
Sbjct: 548 SFTKYVSKAFNREIIVRDRVRFHHFADGICSCNDYW 581

BLAST of CsGy1G002340 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 429.9 bits (1104), Expect = 4.8e-119
Identity = 204/486 (41.98%), Postives = 312/486 (64.20%), Query Frame = 0

Query: 120 RQGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPE 179
           +  VR D  +    V +C +  S+  G Q+H  I  +G  ++  ++ +++DLYS CG+ E
Sbjct: 259 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 318

Query: 180 EACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLF-EIMQSPTYLCQPDKVTCLLLLQ 239
            AC LF+ +P KDV++WN LI   T     ++AL LF E+++S      P+ VT L +L 
Sbjct: 319 TACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGE---TPNDVTMLSILP 378

Query: 240 ACADLNALEFGERIHGYIQQH--GYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNV 299
           ACA L A++ G  IH YI +   G    S+L  SLI MY++CG ++ A++VF+ +  K++
Sbjct: 379 ACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSL 438

Query: 300 VSWSAMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDR 359
            SW+AMI G +M+G    + + F  M+K G++P D TF  +LSACSH G++D G   F  
Sbjct: 439 SSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRT 498

Query: 360 MRQEFMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGN 419
           M Q++ + P + HYGC++DLLG +G+  +A E+I  ME+ PD  +W +LL AC++HG+  
Sbjct: 499 MTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVE 558

Query: 420 LGERIVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELN 479
           LGE   E+LI+++ +  G YVLL NIY+SAG W++V + R L+ +KG+   P C++IE++
Sbjct: 559 LGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEID 618

Query: 480 GVVHQFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSE 539
            VVH+F + D  HP   +IY  L+E+   L+ AG+  + S  L  +E + K  AL +HSE
Sbjct: 619 SVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSE 678

Query: 540 KLAIAFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRC 599
           KLAIAFG+++T PG  + I  N+R C +CH   K IS +Y R+++ RDR+RFHHF++G C
Sbjct: 679 KLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVC 738

Query: 600 SCNDFW 603
           SCND+W
Sbjct: 739 SCNDYW 741

BLAST of CsGy1G002340 vs. Swiss-Prot
Match: sp|B8YEK4|OGR1_ORYSJ (Pentatricopeptide repeat-containing protein OGR1, mitochondrial OS=Oryza sativa subsp. japonica OX=39947 GN=OGR1 PE=2 SV=1)

HSP 1 Score: 425.6 bits (1093), Expect = 9.0e-118
Identity = 237/594 (39.90%), Postives = 342/594 (57.58%), Query Frame = 0

Query: 19  SISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSV-QDPIVSLRFLTRTASAPF-RD 78
           S+S +    E L+  + S  H    LQ HA ++T+  +   P +  RFL R A +P    
Sbjct: 2   SVSAAARHLESLLPRLASLRH---YLQFHARLLTSGHLGAHPGLRARFLDRLALSPHPAA 61

Query: 79  LGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMY--RDMERQGVRADPLSSSFAV 138
           L ++  L   L  P  +  NA LR  + S  P   L +   R +     R D LS SFA+
Sbjct: 62  LPHALLLLRSLPTPATNDLNAALRGLAASPHPARSLLLLAGRLLPALLPRPDALSLSFAL 121

Query: 139 KSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVV 198
           K+  +       +Q+HA + R G  AD  LLT+++D Y+ CG    A K+FDE+  +DV 
Sbjct: 122 KASARCSDAHTTVQLHALVLRLGVAADVRLLTTLLDSYAKCGDLASARKVFDEMTVRDVA 181

Query: 199 AWNVLISCLTRNKRTRDALGLFEIM----QSPTYLCQPDKVTCLLLLQACADLNALEFGE 258
            WN L++ L +      AL LF  +    Q      +P++VT +  L ACA +  L+ G 
Sbjct: 182 TWNSLLAGLAQGTEPNLALALFHRLANSFQELPSREEPNEVTIVAALSACAQIGLLKDGM 241

Query: 259 RIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFD--KMTEKNVVSWSAMISGLSM 318
            +H + ++ G +    +CNSLI MYS+CG + +A +VF   K  ++ +VS++A I   SM
Sbjct: 242 YVHEFAKRFGLDRNVRVCNSLIDMYSKCGSLSRALDVFHSIKPEDQTLVSYNAAIQAHSM 301

Query: 319 NGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIAPNVH 378
           +GHG +A+  F EM    +EP   T+ AVL  C+H GLVD+G+  F+ MR    +APN+ 
Sbjct: 302 HGHGGDALRLFDEMPTR-IEPDGVTYLAVLCGCNHSGLVDDGLRVFNSMR----VAPNMK 361

Query: 379 HYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEHLIEL 438
           HYG IVDLLGRAG L +AY+ ++SM    D  +W+TLLGA ++HG   L E     L EL
Sbjct: 362 HYGTIVDLLGRAGRLTEAYDTVISMPFPADIVLWQTLLGAAKMHGVVELAELAANKLAEL 421

Query: 439 KSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQFAVDDIS 498
            S   GDYVLL N+Y+S   W  V  +R  M+   +   P  +  E++GV+H+F   D  
Sbjct: 422 GSNVDGDYVLLSNVYASKARWMDVGRVRDTMRSNDVRKVPGFSYTEIDGVMHKFINGDKE 481

Query: 499 HPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAIAFGVLATP 558
           HP   +IY+ L++I  ++   GYE E S+ LH +  ++K YAL  HSEKLAIAFG++ATP
Sbjct: 482 HPRWQEIYRALEDIVSRISELGYEPETSNVLHDIGEEEKQYALCYHSEKLAIAFGLIATP 541

Query: 559 PGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
           PG T+R+  N+R C DCH  AK IS  Y R +V+RDR+RFH F++G+CSC D+W
Sbjct: 542 PGETLRVIKNLRICGDCHVVAKLISKAYGRVIVIRDRARFHRFEDGQCSCRDYW 587

BLAST of CsGy1G002340 vs. TrEMBL
Match: tr|A0A0A0LUH9|A0A0A0LUH9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G009750 PE=4 SV=1)

HSP 1 Score: 1233.8 bits (3191), Expect = 0.0e+00
Identity = 599/602 (99.50%), Postives = 600/602 (99.67%), Query Frame = 0

Query: 1   MCVIFRSPSILSLKYRHHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSVQDPI 60
           MCVIFRSPSILSLKY HHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSS+QDPI
Sbjct: 1   MCVIFRSPSILSLKYHHHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSIQDPI 60

Query: 61  VSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMER 120
           VSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMER
Sbjct: 61  VSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMER 120

Query: 121 QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEE 180
           QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIF NGHQADSLLLTSMMDLYSHCGKPEE
Sbjct: 121 QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFINGHQADSLLLTSMMDLYSHCGKPEE 180

Query: 181 ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC 240
           ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC
Sbjct: 181 ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC 240

Query: 241 ADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWS 300
           ADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWS
Sbjct: 241 ADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWS 300

Query: 301 AMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQE 360
           AMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQE
Sbjct: 301 AMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQE 360

Query: 361 FMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGER 420
           FMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGER
Sbjct: 361 FMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGER 420

Query: 421 IVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVH 480
           IVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVH
Sbjct: 421 IVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVH 480

Query: 481 QFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAI 540
           QFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAI
Sbjct: 481 QFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAI 540

Query: 541 AFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCND 600
           AFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCND
Sbjct: 541 AFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCND 600

Query: 601 FW 603
           FW
Sbjct: 601 FW 602

BLAST of CsGy1G002340 vs. TrEMBL
Match: tr|A0A1S3BV40|A0A1S3BV40_CUCME (pentatricopeptide repeat-containing protein At3g47530 OS=Cucumis melo OX=3656 GN=LOC103493993 PE=4 SV=1)

HSP 1 Score: 1143.6 bits (2957), Expect = 0.0e+00
Identity = 561/602 (93.19%), Postives = 570/602 (94.68%), Query Frame = 0

Query: 1   MCVIFRSPSILSLKYRHHSISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSVQDPI 60
           M VIF  PSILS                 LISLIKSCTHKSQLLQIHAHII TSS+QDPI
Sbjct: 1   MYVIFHRPSILS-----------------LISLIKSCTHKSQLLQIHAHIIRTSSIQDPI 60

Query: 61  VSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMER 120
           VSLRFLTRTASAPFRDLGYSRR  DLLTNP VSHYNAMLRAYS+SRSPLEGLY+YRDMER
Sbjct: 61  VSLRFLTRTASAPFRDLGYSRRFLDLLTNPLVSHYNAMLRAYSVSRSPLEGLYVYRDMER 120

Query: 121 QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEE 180
           QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIF  GHQADSLLLTSMMDLYSHCGKPEE
Sbjct: 121 QGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFIYGHQADSLLLTSMMDLYSHCGKPEE 180

Query: 181 ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC 240
           ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC
Sbjct: 181 ACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQAC 240

Query: 241 ADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWS 300
           ADLNALEFGERIHGYIQQH YNTESNLCNSLISMYSRCGR+DKAYEVFDKM EKNVVSWS
Sbjct: 241 ADLNALEFGERIHGYIQQHCYNTESNLCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWS 300

Query: 301 AMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQE 360
           AMISGLSMNGHGREAIEAFWEMQKNGVEP DHTFTAVLSACSHCGLVDEGMAFFDRMRQE
Sbjct: 301 AMISGLSMNGHGREAIEAFWEMQKNGVEPDDHTFTAVLSACSHCGLVDEGMAFFDRMRQE 360

Query: 361 FMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGER 420
            MIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGH NLGER
Sbjct: 361 LMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHANLGER 420

Query: 421 IVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVH 480
           IVEHLIELKSQEAGDYVLLLNIYSSAG WDKVTELRKLMKEKGIYTTPCCTTIELNGVVH
Sbjct: 421 IVEHLIELKSQEAGDYVLLLNIYSSAGKWDKVTELRKLMKEKGIYTTPCCTTIELNGVVH 480

Query: 481 QFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAI 540
           +FAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRL+P+DKGYALSNHSEKLAI
Sbjct: 481 EFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLKPEDKGYALSNHSEKLAI 540

Query: 541 AFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCND 600
           AFGVLATPPGRTIR+ANNIRTCMDCHNFAKYISSVYNRKVV+RDRSRFHHFQEGRCSCND
Sbjct: 541 AFGVLATPPGRTIRVANNIRTCMDCHNFAKYISSVYNRKVVLRDRSRFHHFQEGRCSCND 585

Query: 601 FW 603
           FW
Sbjct: 601 FW 585

BLAST of CsGy1G002340 vs. TrEMBL
Match: tr|A0A2I4G1Y4|A0A2I4G1Y4_9ROSI (pentatricopeptide repeat-containing protein At3g47530 OS=Juglans regia OX=51240 GN=LOC109004001 PE=4 SV=1)

HSP 1 Score: 876.7 bits (2264), Expect = 3.0e-251
Identity = 412/577 (71.40%), Postives = 484/577 (83.88%), Query Frame = 0

Query: 26  EREPLISLIKSCTHKSQLLQIHAHIITTSSVQDPIVSLRFLTRTASAPFRDLGYSRRLFD 85
           ER  L SLIKSCT K+ LLQIHAH++ T  +QDP +SL FL+R A +P RD+ YSR+ F 
Sbjct: 30  ERRQLPSLIKSCTQKTHLLQIHAHLVCTGLLQDPTISLIFLSRLALSPARDVDYSRQFFT 89

Query: 86  LLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIKLLSLLF 145
            +++P   HYN M+RAYS+S SPLEG YMYR+M+RQ VR +PLSSSFA+KSCIKL SLL 
Sbjct: 90  QISDPLTFHYNTMIRAYSMSNSPLEGFYMYREMKRQSVRVNPLSSSFAIKSCIKLSSLLG 149

Query: 146 GIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTR 205
           G+Q+HARI  +GHQ+DSLLLT++MDLYS C + +EACK+FD++  +D VAWNVLISC  R
Sbjct: 150 GVQVHARILTDGHQSDSLLLTNLMDLYSCCERCDEACKVFDDIRDRDTVAWNVLISCCMR 209

Query: 206 NKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYNTES 265
           N RTRDA+GLF+IMQS +  C+PD VTCLLLLQACA LNALEFGE+IH YI QHGY T  
Sbjct: 210 NNRTRDAMGLFDIMQSGSDGCEPDDVTCLLLLQACAHLNALEFGEKIHAYIGQHGYGTAG 269

Query: 266 NLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKN 325
            LCNSLI+MYSRCG ++KAY VF  M  KNVVSWSAMISGL+MNGHGREAIEAFWEMQK 
Sbjct: 270 KLCNSLIAMYSRCGCLEKAYGVFKGMRNKNVVSWSAMISGLAMNGHGREAIEAFWEMQKL 329

Query: 326 GVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIAPNVHHYGCIVDLLGRAGMLDQ 385
           G+ P D TFT VLSACSHCGLVDEGM FFD M +EF IAPN  HYGC+VDLLGRAG+LDQ
Sbjct: 330 GIPPDDQTFTGVLSACSHCGLVDEGMMFFDLMSKEFGIAPNTRHYGCVVDLLGRAGLLDQ 389

Query: 386 AYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNIYSS 445
           AY+LI+SM V+PD+TMWRTLLGACRIHGH  LGER+V HLIELK+QEAGDY LLLNIYSS
Sbjct: 390 AYQLILSMSVKPDSTMWRTLLGACRIHGHVTLGERVVGHLIELKAQEAGDYALLLNIYSS 449

Query: 446 AGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEINKQ 505
           AG WDKV E+RK M+EK I TTP C+TI L GVVH+F VDD+SHP K +IY+ L+EIN+Q
Sbjct: 450 AGKWDKVMEVRKFMQEKAIQTTPGCSTIVLKGVVHEFVVDDVSHPRKGEIYEMLNEINQQ 509

Query: 506 LKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLAIAFGVLATPPGRTIRIANNIRTCMDC 565
           LKIAGY AE+S+ELH L  ++KG ALS HSEKLAIAFGVL+TPPG TIR+A N+RTC+DC
Sbjct: 510 LKIAGYVAEVSAELHNLGAEEKGDALSYHSEKLAIAFGVLSTPPGTTIRVAKNLRTCIDC 569

Query: 566 HNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
           HNFAK +S VYNR+V+VRDR+RFHHF+EGRCSCND+W
Sbjct: 570 HNFAKVLSGVYNREVIVRDRTRFHHFKEGRCSCNDYW 606

BLAST of CsGy1G002340 vs. TrEMBL
Match: tr|A0A251PAT0|A0A251PAT0_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G194000 PE=4 SV=1)

HSP 1 Score: 857.4 bits (2214), Expect = 1.9e-245
Identity = 400/603 (66.33%), Postives = 490/603 (81.26%), Query Frame = 0

Query: 9   SILSLKYRHH---------SISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSVQDP 68
           S LS    HH         +IS +   ++ L+ LIKSCT +S LLQIHAHI+ TS V +P
Sbjct: 11  SSLSSSQSHHPNVPVCFTTNISHTQTPKQSLLDLIKSCTRRSHLLQIHAHIVRTSLVLEP 70

Query: 69  IVSLRFLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDME 128
            + L+FL+    +P + + YSRR FD +  P    YN M+RAYS+S SP EG  MYRD+ 
Sbjct: 71  TICLQFLSLVGLSPLKSISYSRRFFDQIAKPTAFQYNTMVRAYSISDSPEEGFSMYRDLL 130

Query: 129 RQGVRADPLSSSFAVKSCIKLLSLLFGIQIHARIFRNGHQADSLLLTSMMDLYSHCGKPE 188
           R+G+RAD L+SSF +KSCI++ SLL GIQ+HARI R GH++DS LLT++MDLYS CGK +
Sbjct: 131 RRGLRADALASSFVIKSCIRVSSLLGGIQVHARILRGGHESDSRLLTTLMDLYSICGKCD 190

Query: 189 EACKLFDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQA 248
           EACKLFDE+P++DVVAWNVLISC   N RTRDA+ LF+IM+S T+ C+PD+VTCLL+LQA
Sbjct: 191 EACKLFDEMPKRDVVAWNVLISCCLHNNRTRDAVSLFDIMRSETHRCEPDEVTCLLMLQA 250

Query: 249 CADLNALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSW 308
           C++LNALEFGER+H YI++HGY+  SNLCNSLI+MYSRCG +DKAYEVF  M +KNVVSW
Sbjct: 251 CSNLNALEFGERVHKYIEEHGYDGASNLCNSLIAMYSRCGCLDKAYEVFKGMKDKNVVSW 310

Query: 309 SAMISGLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQ 368
           SAMISGL++NG+GREAIEAF EMQK GV P D TFT VL ACSHCGLVDEGM FFDRM +
Sbjct: 311 SAMISGLAVNGYGREAIEAFGEMQKMGVLPDDQTFTGVLCACSHCGLVDEGMVFFDRMSK 370

Query: 369 EFMIAPNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGE 428
           +F + PN+HHYGC+VDLLGRAG LDQAY+LI+SM+++PD+T+WRTLLG CRIHGH  L E
Sbjct: 371 DFGVVPNIHHYGCMVDLLGRAGRLDQAYQLILSMDIKPDSTIWRTLLGGCRIHGHDALAE 430

Query: 429 RIVEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKGIYTTPCCTTIELNGVV 488
            ++ HLIELK+QEAGDYVLL+NIYSSAGNW+K+TE+RK MKEK I TTP C+TIEL GV 
Sbjct: 431 SVIGHLIELKAQEAGDYVLLMNIYSSAGNWEKLTEVRKFMKEKAIQTTPGCSTIELKGVA 490

Query: 489 HQFAVDDISHPMKDKIYKQLDEINKQLKIAGYEAEMSSELHRLEPKDKGYALSNHSEKLA 548
           H+F VDD+SHP KD+IY  LDEIN QLKIAGY A++SSELH L  ++KG+ALS HSEKLA
Sbjct: 491 HEFVVDDVSHPRKDEIYNMLDEINSQLKIAGYVADVSSELHNLGTEEKGHALSYHSEKLA 550

Query: 549 IAFGVLATPPGRTIRIANNIRTCMDCHNFAKYISSVYNRKVVVRDRSRFHHFQEGRCSCN 603
           IAFGVLATPPG  IR+A N+R C+DCHNFA  +S VYNR+V++RDR+RFHHF+EGRCSCN
Sbjct: 551 IAFGVLATPPGTPIRVAKNLRICVDCHNFAMVLSGVYNREVIIRDRTRFHHFREGRCSCN 610

BLAST of CsGy1G002340 vs. TrEMBL
Match: tr|A0A067K583|A0A067K583_JATCU (Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_11633 PE=4 SV=1)

HSP 1 Score: 846.7 bits (2186), Expect = 3.3e-242
Identity = 391/573 (68.24%), Postives = 476/573 (83.07%), Query Frame = 0

Query: 30  LISLIKSCTHKSQLLQIHAHIITTSSVQDPIVSLRFLTRTASAPFRDLGYSRRLFDLLTN 89
           LISLIKSCT KS LLQIHA +I T   Q P +S  FL+R A +PF+D+ YSR++F    N
Sbjct: 31  LISLIKSCTQKSHLLQIHAQLIRTFLFQKPTISFPFLSRVALSPFQDVSYSRQIFSQTPN 90

Query: 90  PFVSHYNAMLRAYSLSRSPLEGLYMYRDMERQGVRADPLSSSFAVKSCIKLLSLLFGIQI 149
           P V HYN M+R YS S SP+EG YMY+ M ++G+RADP+S SFAVK  +K+ SL+ G+Q+
Sbjct: 91  PSVFHYNTMIRVYSSSNSPIEGFYMYQQMRKRGLRADPISLSFAVKCFVKVCSLVGGVQV 150

Query: 150 HARIFRNGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNKRT 209
           HARI R+GHQ+DSLLLT++MDLYS C K  EA K+FD++PQ+D VAWNVLISC  RN RT
Sbjct: 151 HARILRDGHQSDSLLLTNLMDLYSICQKGNEAYKVFDDIPQRDTVAWNVLISCYIRNHRT 210

Query: 210 RDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHGYIQQHGYNTESNLCN 269
           +D LG+F+ + S  + C+PD VTCLLLLQAC++L ALEFGE++HGY+++HGY    NLCN
Sbjct: 211 KDVLGVFDHLASGDFGCEPDDVTCLLLLQACSNLGALEFGEKVHGYVEEHGYGDAMNLCN 270

Query: 270 SLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNGVEP 329
           SLI+MYSRCG +DKAY VF +M +KNVV+WSAMISG +MNGHGREAI+AF EMQ+ G+ P
Sbjct: 271 SLIAMYSRCGCLDKAYNVFKRMPKKNVVTWSAMISGFAMNGHGREAIQAFEEMQRIGILP 330

Query: 330 GDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIAPNVHHYGCIVDLLGRAGMLDQAYEL 389
            D TFT VLSACSHCGLVDEGM FF+RM +EF IAPNVHHYGCIVDLLGRA  LD+AY+L
Sbjct: 331 DDQTFTGVLSACSHCGLVDEGMLFFERMNREFGIAPNVHHYGCIVDLLGRASQLDRAYQL 390

Query: 390 IMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEHLIELKSQEAGDYVLLLNIYSSAGNW 449
           IMSM+V+PD+T+WRTLLGACRIH H  L ER+V HLIELK+QEAGDY+LLLNIYSS GNW
Sbjct: 391 IMSMKVKPDSTIWRTLLGACRIHRHVMLAERVVGHLIELKAQEAGDYILLLNIYSSVGNW 450

Query: 450 DKVTELRKLMKEKGIYTTPCCTTIELNGVVHQFAVDDISHPMKDKIYKQLDEINKQLKIA 509
           +KVTELR+ MKEKGI TTP C++IELNG VH+F VDD+SHP KD+IY+ LDEIN QLKIA
Sbjct: 451 EKVTELRRFMKEKGIQTTPGCSSIELNGQVHEFVVDDVSHPRKDEIYEMLDEINTQLKIA 510

Query: 510 GYEAEMSSELHRLEPKDKGYALSNHSEKLAIAFGVLATPPGRTIRIANNIRTCMDCHNFA 569
           GY  E++SELH L  ++K Y LS HSEKLAIAFG+L+TPPG TIR+A N+RTC+DCHNFA
Sbjct: 511 GYVVEITSELHNLGAEEKQYVLSYHSEKLAIAFGILSTPPGTTIRVAKNLRTCVDCHNFA 570

Query: 570 KYISSVYNRKVVVRDRSRFHHFQEGRCSCNDFW 603
           K++S VYNR+V +RDR+RFHHF+EG CSCND+W
Sbjct: 571 KFVSGVYNRQVNIRDRTRFHHFREGSCSCNDYW 603

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011660092.10.0e+0099.50PREDICTED: pentatricopeptide repeat-containing protein At3g47530 [Cucumis sativu... [more]
XP_008453206.10.0e+0093.19PREDICTED: pentatricopeptide repeat-containing protein At3g47530 [Cucumis melo][more]
XP_023515406.10.0e+0088.33pentatricopeptide repeat-containing protein At3g47530 [Cucurbita pepo subsp. pep... [more]
XP_022921651.10.0e+0088.00pentatricopeptide repeat-containing protein At3g47530 [Cucurbita moschata][more]
XP_022987181.10.0e+0087.50pentatricopeptide repeat-containing protein At3g47530 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT3G47530.11.5e-21161.94Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G46790.11.3e-12237.93Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G56550.12.2e-12238.72Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.12.6e-12041.98Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G34160.14.2e-11837.63Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9SN85|PP267_ARATH2.6e-21061.94Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
sp|Q9STF3|PP265_ARATH2.3e-12137.93Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
sp|Q9LXY5|PP284_ARATH3.9e-12138.72Pentatricopeptide repeat-containing protein At3g56550 OS=Arabidopsis thaliana OX... [more]
sp|Q9LN01|PPR21_ARATH4.8e-11941.98Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|B8YEK4|OGR1_ORYSJ9.0e-11839.90Pentatricopeptide repeat-containing protein OGR1, mitochondrial OS=Oryza sativa ... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LUH9|A0A0A0LUH9_CUCSA0.0e+0099.50Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G009750 PE=4 SV=1[more]
tr|A0A1S3BV40|A0A1S3BV40_CUCME0.0e+0093.19pentatricopeptide repeat-containing protein At3g47530 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A2I4G1Y4|A0A2I4G1Y4_9ROSI3.0e-25171.40pentatricopeptide repeat-containing protein At3g47530 OS=Juglans regia OX=51240 ... [more]
tr|A0A251PAT0|A0A251PAT0_PRUPE1.9e-24566.33Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_5G194000 PE=4 SV=1[more]
tr|A0A067K583|A0A067K583_JATCU3.3e-24268.24Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_11633 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0008270zinc ion binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
IPR032867DYW_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G002340.1CsGy1G002340.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 469..592
e-value: 3.4E-34
score: 117.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 191..241
e-value: 2.2E-8
score: 34.0
coord: 294..342
e-value: 4.8E-13
score: 49.0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 269..292
e-value: 4.1E-6
score: 26.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 95..126
e-value: 1.8E-5
score: 22.6
coord: 194..221
e-value: 9.3E-5
score: 20.4
coord: 267..296
e-value: 4.1E-7
score: 27.8
coord: 436..465
e-value: 0.0025
score: 15.9
coord: 332..360
e-value: 2.1E-4
score: 19.3
coord: 297..330
e-value: 1.0E-5
score: 23.4
coord: 166..193
e-value: 0.0026
score: 15.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 94..123
e-value: 0.0025
score: 17.8
coord: 436..464
e-value: 0.0019
score: 18.2
coord: 369..394
e-value: 0.17
score: 12.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 161..195
score: 9.668
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 229..263
score: 6.467
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 264..294
score: 10.326
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 398..428
score: 5.338
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 366..396
score: 7.783
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 330..360
score: 8.802
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 432..466
score: 8.32
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 91..125
score: 8.802
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 295..329
score: 12.047
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 196..226
score: 5.831
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 93..243
e-value: 2.2E-25
score: 91.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 361..536
e-value: 1.7E-16
score: 62.7
coord: 244..360
e-value: 7.1E-31
score: 109.9
NoneNo IPR availablePANTHERPTHR24015:SF1054SUBFAMILY NOT NAMEDcoord: 58..519
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 58..519

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CsGy1G002340Cucumber (Chinese Long) v3cgybcucB004
CsGy1G002340Cucumber (Chinese Long) v3cgybcucB039
CsGy1G002340Watermelon (97103) v2cgybwmbB054
CsGy1G002340Watermelon (97103) v2cgybwmbB058
CsGy1G002340Watermelon (97103) v2cgybwmbB088
CsGy1G002340Wax gourdcgybwgoB009
CsGy1G002340Wax gourdcgybwgoB050
CsGy1G002340Wax gourdcgybwgoB067
CsGy1G002340Melon (DHL92) v3.6.1cgybmedB043
CsGy1G002340Cucumber (Gy14) v2cgybcgybB002
CsGy1G002340Cucumber (Gy14) v2cgybcgybB029
CsGy1G002340Cucurbita maxima (Rimu)cgybcmaB040
CsGy1G002340Cucurbita maxima (Rimu)cgybcmaB103
CsGy1G002340Cucurbita maxima (Rimu)cgybcmaB106
CsGy1G002340Cucurbita maxima (Rimu)cgybcmaB130
CsGy1G002340Cucurbita moschata (Rifu)cgybcmoB028
CsGy1G002340Cucurbita moschata (Rifu)cgybcmoB100
CsGy1G002340Cucurbita moschata (Rifu)cgybcmoB119
CsGy1G002340Cucurbita pepo (Zucchini)cgybcpeB014
CsGy1G002340Cucurbita pepo (Zucchini)cgybcpeB061
CsGy1G002340Cucurbita pepo (Zucchini)cgybcpeB101
CsGy1G002340Cucurbita pepo (Zucchini)cgybcpeB108
CsGy1G002340Cucurbita pepo (Zucchini)cgybcpeB126
CsGy1G002340Cucumber (Chinese Long) v2cgybcuB004
CsGy1G002340Bottle gourd (USVL1VR-Ls)cgyblsiB021
CsGy1G002340Bottle gourd (USVL1VR-Ls)cgyblsiB040
CsGy1G002340Melon (DHL92) v3.5.1cgybmeB007
CsGy1G002340Melon (DHL92) v3.5.1cgybmeB024
CsGy1G002340Melon (DHL92) v3.5.1cgybmeB048
CsGy1G002340Melon (DHL92) v3.6.1cgybmedB005
CsGy1G002340Melon (DHL92) v3.6.1cgybmedB024
CsGy1G002340Watermelon (Charleston Gray)cgybwcgB003
CsGy1G002340Watermelon (Charleston Gray)cgybwcgB054
CsGy1G002340Watermelon (Charleston Gray)cgybwcgB060
CsGy1G002340Watermelon (97103) v1cgybwmB004
CsGy1G002340Watermelon (97103) v1cgybwmB065
CsGy1G002340Wild cucumber (PI 183967)cgybcpiB003
CsGy1G002340Wild cucumber (PI 183967)cgybcpiB031
CsGy1G002340Silver-seed gourdcarcgybB0053
CsGy1G002340Silver-seed gourdcarcgybB0611