CsGy1G031790 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy1G031790
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
LocationGy14Chr1: 30826625 .. 30828858 (+)
RNA-Seq ExpressionCsGy1G031790
SyntenyCsGy1G031790
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCGATATCTCTCGGTTCTTCTTAAGAGAGAAAAGGTATTGGATTATCTATGAGGTGTCCAAGTGCGGACATACCTCGTGGATTTCTTATCCAAAGGGCTAATGCTTTTACAATTGAATCCTTTCAATCTTTTGTTGCTCGTTTCTCTCAGCACTAGATAGACAAAAGGAGCGAAACTCGTTTGCAAATTCTGTTCAAAGTTTAAACAAAGGACGATGCTCCATCTCCAACACCACCTAATGGCATAGCAAGCGTCATCAACGAGCATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAATCCATTAGCGAAGCCAACAATTCCCCAATCACGTTCAGATTCCATCCCCGCTTGCAGATTTTCAAACAAAACCCATCTCAGAAATGTCACTTCTTCTGCTGAATTTAGACAACCCCATTTCCCCAATCTCGATAACAGAGATGCTCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAGTCCCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACTAAACTCATTAAAGGGTTTTTTAATTCGAGGAATTTAAAGAAAGCTATGAGAGTTATGGAGATTTTGGAAACCTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATCAGTGGGTTTAGTAAAGCCAACCAAATTGATTCTGCAAACCAGGTGTTTGATAGAATGCGCAGCAGGGGTTTTTCTCCTGATGTCGTTACTTACAATATAATGATTGGGAGTTTGTGTAGTAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAGTTGGTGTCGAGGGGCCTCCGTCCTGACTTGTATACATACAATGCCATCATTCGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCTTGGATTTTGTTCGACATTTATCAGCTAGAGGGTGTAATCCAGATGTGGTATCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCCGGTGGGAAGATGGGGAGAGGCTTATGAAAGACATGGTCCTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTTTGTCGCGAAGGGAGAGTAAGGGAAGCCGTGAATGTGTTGGAGGTGATGAAGGAGAAAGGGTTAACACCAGATTCATATAGCTATGATCCACTGATTTCCGCCTTCTGCAAAGAAGGGAGATTGGATTTAGCAATTGAGTATTTGGAAAAAATGGTTTCTGATGGTTGTTTGCCCGATATTGTAAACTACAATACAATTTTGGCTACACTTTGTAAATTTGGTTGTGCTGATCTTGCTTTAGACGTCTTTGAGAAGCTGGATGAAGTGGGTTGCCCTCCAACTGTGAGGGCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAACAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCCGATGAGATAACATACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTTGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGGTTCCAGCCGACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAATGAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCCGGTTGGCGAGCAGAGGCTATGGAGTTAGCCAACAGTCTGTACAGATTGGGAGTTATTTCTGGAGATTCTTCCAAGCGTTTGAACAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAAGCTGATGTTGACTATAGAATTTTCAACTTTTGGCTATTTTGTTTTGTTTTCTTTGTTTTTTTTTTTTTTTTTTTTGTGCAATTTCATTAACCCACCAATTGACTTTTAATTTTTCTTTTATTTTTCCTGAAAAAAACAAAGTCTGCTAATAAATAAGACTAGGGAAAGAAGAAAAAAACATTATTAAATTAATTTATTGAATGCCAAAGCAGCTTTTAGTTGGGGATC

mRNA sequence

GCGATATCTCTCGGTTCTTCTTAAGAGAGAAAAGGTATTGGATTATCTATGAGGTGTCCAAGTGCGGACATACCTCGTGGATTTCTTATCCAAAGGGCTAATGCTTTTACAATTGAATCCTTTCAATCTTTTGTTGCTCGTTTCTCTCAGCACTAGATAGACAAAAGGAGCGAAACTCGTTTGCAAATTCTGTTCAAAGTTTAAACAAAGGACGATGCTCCATCTCCAACACCACCTAATGGCATAGCAAGCGTCATCAACGAGCATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAATCCATTAGCGAAGCCAACAATTCCCCAATCACGTTCAGATTCCATCCCCGCTTGCAGATTTTCAAACAAAACCCATCTCAGAAATGTCACTTCTTCTGCTGAATTTAGACAACCCCATTTCCCCAATCTCGATAACAGAGATGCTCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAGTCCCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACTAAACTCATTAAAGGGTTTTTTAATTCGAGGAATTTAAAGAAAGCTATGAGAGTTATGGAGATTTTGGAAACCTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATCAGTGGGTTTAGTAAAGCCAACCAAATTGATTCTGCAAACCAGGTGTTTGATAGAATGCGCAGCAGGGGTTTTTCTCCTGATGTCGTTACTTACAATATAATGATTGGGAGTTTGTGTAGTAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAGTTGGTGTCGAGGGGCCTCCGTCCTGACTTGTATACATACAATGCCATCATTCGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCTTGGATTTTGTTCGACATTTATCAGCTAGAGGGTGTAATCCAGATGTGGTATCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCCGGTGGGAAGATGGGGAGAGGCTTATGAAAGACATGGTCCTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTTTGTCGCGAAGGGAGAGTAAGGGAAGCCGTGAATGTGTTGGAGGTGATGAAGGAGAAAGGGTTAACACCAGATTCATATAGCTATGATCCACTGATTTCCGCCTTCTGCAAAGAAGGGAGATTGGATTTAGCAATTGAGTATTTGGAAAAAATGGTTTCTGATGGTTGTTTGCCCGATATTGTAAACTACAATACAATTTTGGCTACACTTTGTAAATTTGGTTGTGCTGATCTTGCTTTAGACGTCTTTGAGAAGCTGGATGAAGTGGGTTGCCCTCCAACTGTGAGGGCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAACAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCCGATGAGATAACATACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTTGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGGTTCCAGCCGACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAATGAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCCGGTTGGCGAGCAGAGGCTATGGAGTTAGCCAACAGTCTGTACAGATTGGGAGTTATTTCTGGAGATTCTTCCAAGCGTTTGAACAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAAGCTGATGTTGACTATAGAATTTTCAACTTTTGGCTATTTTGTTTTGTTTTCTTTGTTTTTTTTTTTTTTTTTTTTGTGCAATTTCATTAACCCACCAATTGACTTTTAATTTTTCTTTTATTTTTCCTGAAAAAAACAAAGTCTGCTAATAAATAAGACTAGGGAAAGAAGAAAAAAACATTATTAAATTAATTTATTGAATGCCAAAGCAGCTTTTAGTTGGGGATC

Coding sequence (CDS)

ATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAATCCATTAGCGAAGCCAACAATTCCCCAATCACGTTCAGATTCCATCCCCGCTTGCAGATTTTCAAACAAAACCCATCTCAGAAATGTCACTTCTTCTGCTGAATTTAGACAACCCCATTTCCCCAATCTCGATAACAGAGATGCTCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAGTCCCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACTAAACTCATTAAAGGGTTTTTTAATTCGAGGAATTTAAAGAAAGCTATGAGAGTTATGGAGATTTTGGAAACCTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATCAGTGGGTTTAGTAAAGCCAACCAAATTGATTCTGCAAACCAGGTGTTTGATAGAATGCGCAGCAGGGGTTTTTCTCCTGATGTCGTTACTTACAATATAATGATTGGGAGTTTGTGTAGTAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAGTTGGTGTCGAGGGGCCTCCGTCCTGACTTGTATACATACAATGCCATCATTCGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCTTGGATTTTGTTCGACATTTATCAGCTAGAGGGTGTAATCCAGATGTGGTATCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCCGGTGGGAAGATGGGGAGAGGCTTATGAAAGACATGGTCCTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTTTGTCGCGAAGGGAGAGTAAGGGAAGCCGTGAATGTGTTGGAGGTGATGAAGGAGAAAGGGTTAACACCAGATTCATATAGCTATGATCCACTGATTTCCGCCTTCTGCAAAGAAGGGAGATTGGATTTAGCAATTGAGTATTTGGAAAAAATGGTTTCTGATGGTTGTTTGCCCGATATTGTAAACTACAATACAATTTTGGCTACACTTTGTAAATTTGGTTGTGCTGATCTTGCTTTAGACGTCTTTGAGAAGCTGGATGAAGTGGGTTGCCCTCCAACTGTGAGGGCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAACAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCCGATGAGATAACATACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTTGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGGTTCCAGCCGACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAATGAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCCGGTTGGCGAGCAGAGGCTATGGAGTTAGCCAACAGTCTGTACAGATTGGGAGTTATTTCTGGAGATTCTTCCAAGCGTTTGAACAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAAGCTGA

Protein sequence

MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS*
Homology
BLAST of CsGy1G031790 vs. ExPASy Swiss-Prot
Match: Q9SR00 (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 734.6 bits (1895), Expect = 8.9e-211
Identity = 348/555 (62.70%), Postives = 444/555 (80.00%), Query Frame = 0

Query: 11  LHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDNRDAHLMKLLN 70
           L F+N  + P     RS S    R    T   + T   E RQ H  +L  RD  ++K+ +
Sbjct: 40  LTFSN--SNPNNDNGRSFSSSGARNLQTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFH 99

Query: 71  RSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDPD 130
           RSCR+G + ESL+ LE++V KG+ PDV+LCTKLIKGFF  RN+ KA+RVMEILE +G PD
Sbjct: 100 RSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPD 159

Query: 131 VYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMD 190
           V++YNA+I+GF K N+ID A +V DRMRS+ FSPD VTYNIMIGSLCSRGKL+LA +V++
Sbjct: 160 VFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLN 219

Query: 191 ELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEG 250
           +LL D C+P+VITYTILIEAT+LEG ++EAL+L DE++SRGL+PD++TYN IIRG+CKEG
Sbjct: 220 QLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEG 279

Query: 251 MEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHS 310
           M DRA + VR+L  +GC PDV+SYNILLR+ LN+ +WE+GE+LM  M    C+PNVVT+S
Sbjct: 280 MVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYS 339

Query: 311 ILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSD 370
           ILI++ CR+G++ EA+N+L++MKEKGLTPD+YSYDPLI+AFC+EGRLD+AIE+LE M+SD
Sbjct: 340 ILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISD 399

Query: 371 GCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKA 430
           GCLPDIVNYNT+LATLCK G AD AL++F KL EVGC P   +YNTMFSALWS G+KI+A
Sbjct: 400 GCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRA 459

Query: 431 LEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIVLLG 490
           L MI EM+  GIDPDEITYNS+ISCLCR+G+VDEA  LLVDM +  F P+V+++NIVLLG
Sbjct: 460 LHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLG 519

Query: 491 MCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGVISG 550
            CKAHR+ + I +L +MV  GC PNET+Y +LIEGI +AG+RAEAMELAN L R+  IS 
Sbjct: 520 FCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISE 579

Query: 551 DSSKRLNKTFPMLDV 566
            S KRL++TFP+L+V
Sbjct: 580 YSFKRLHRTFPLLNV 592

BLAST of CsGy1G031790 vs. ExPASy Swiss-Prot
Match: Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 464.2 bits (1193), Expect = 2.2e-129
Identity = 223/489 (45.60%), Postives = 319/489 (65.24%), Query Frame = 0

Query: 69  LNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGD 128
           L +  R G+  E   FLE++V  G  PD++ CT LI+GF      +KA +++EILE  G 
Sbjct: 109 LRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGA 168

Query: 129 -PDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFE 188
            PDV +YN MISG+ KA +I++A  V DRM     SPDVVTYN ++ SLC  GKL+ A E
Sbjct: 169 VPDVITYNVMISGYCKAGEINNALSVLDRM---SVSPDVVTYNTILRSLCDSGKLKQAME 228

Query: 189 VMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGIC 248
           V+D +L+  C P VITYTILIEAT  +  +  A++L DE+  RG  PD+ TYN ++ GIC
Sbjct: 229 VLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGIC 288

Query: 249 KEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVV 308
           KEG  D A+ F+  + + GC P+V+++NI+LRS  +  RW D E+L+ DM+  G  P+VV
Sbjct: 289 KEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVV 348

Query: 309 THSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKM 368
           T +ILI+  CR+G +  A+++LE M + G  P+S SY+PL+  FCKE ++D AIEYLE+M
Sbjct: 349 TFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERM 408

Query: 369 VSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNK 428
           VS GC PDIV YNT+L  LCK G  + A+++  +L   GC P +  YNT+   L   G  
Sbjct: 409 VSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKT 468

Query: 429 IKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIV 488
            KA++++ EM  K + PD ITY+SL+  L R+G VDEAI    + E    +P  ++FN +
Sbjct: 469 GKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSI 528

Query: 489 LLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGV 548
           +LG+CK+ +    I+ L+ M+ +GC PNETSY +LIEG+AY G   EA+EL N L   G+
Sbjct: 529 MLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGL 588

Query: 549 ISGDSSKRL 557
           +   S++++
Sbjct: 589 MKKSSAEQV 594

BLAST of CsGy1G031790 vs. ExPASy Swiss-Prot
Match: A3KPF8 (Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g79080 PE=2 SV=1)

HSP 1 Score: 304.3 bits (778), Expect = 3.0e-81
Identity = 170/487 (34.91%), Postives = 275/487 (56.47%), Query Frame = 0

Query: 79  NESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG-DPDVYSYNAM 138
           ++S   LES+V+ G KP+V   T+L+     +  LKKA+RV+E++ + G  PD  +Y  +
Sbjct: 88  SDSFSHLESLVTGGHKPNVAHSTQLLYDLCKANRLKKAIRVIELMVSSGIIPDASAYTYL 147

Query: 139 ISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGC 198
           ++   K   +  A Q+ ++M   G+  + VTYN ++  LC  G L  + + ++ L++ G 
Sbjct: 148 VNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTYNALVRGLCMLGSLNQSLQFVERLMQKGL 207

Query: 199 KPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALD 258
            P+  TY+ L+EA   E   +EA++L DE++ +G  P+L +YN ++ G CKEG  D A+ 
Sbjct: 208 APNAFTYSFLLEAAYKERGTDEAVKLLDEIIVKGGEPNLVSYNVLLTGFCKEGRTDDAMA 267

Query: 259 FVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFC 318
             R L A+G   +VVSYNILLR      RWE+   L+ +M      P+VVT++ILI+S  
Sbjct: 268 LFRELPAKGFKANVVSYNILLRCLCCDGRWEEANSLLAEMDGGDRAPSVVTYNILINSLA 327

Query: 319 REGRVREAVNVLEVMKEKG--LTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPD 378
             GR  +A+ VL+ M +        + SY+P+I+  CKEG++DL ++ L++M+   C P+
Sbjct: 328 FHGRTEQALQVLKEMSKGNHQFRVTATSYNPVIARLCKEGKVDLVVKCLDEMIYRRCKPN 387

Query: 379 IVNYNTILATLCKFGC-ADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMI 438
              YN I  +LC+       A  + + L       T   Y ++ ++L   GN   A +++
Sbjct: 388 EGTYNAI-GSLCEHNSKVQEAFYIIQSLSNKQKCCTHDFYKSVITSLCRKGNTFAAFQLL 447

Query: 439 SEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM-EATRFQPTVISFNIVLLGMCK 498
            EM R G DPD  TY++LI  LC +G+   A+ +L  M E+   +PTV +FN ++LG+CK
Sbjct: 448 YEMTRCGFDPDAHTYSALIRGLCLEGMFTGAMEVLSIMEESENCKPTVDNFNAMILGLCK 507

Query: 499 AHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGVISGDSS 558
             R    +E+   MVEK  +PNET+Y +L+EGIA+      A E+ + L    VI  ++ 
Sbjct: 508 IRRTDLAMEVFEMMVEKKRMPNETTYAILVEGIAHEDELELAKEVLDELRLRKVIGQNAV 567

Query: 559 KRLNKTF 561
            R+   F
Sbjct: 568 DRIVMQF 573

BLAST of CsGy1G031790 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 287.7 bits (735), Expect = 2.9e-76
Identity = 150/512 (29.30%), Postives = 259/512 (50.59%), Query Frame = 0

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG 127
           L+   CRA +   ++  LE + S G  PD    T +++G+    +L  A+R+ E +  +G
Sbjct: 195 LIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFG 254

Query: 128 -------------------------------------DPDVYSYNAMISGFSKANQIDSA 187
                                                 PD Y++N +++G  KA  +  A
Sbjct: 255 CSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHA 314

Query: 188 NQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEA 247
            ++ D M   G+ PDV TYN +I  LC  G+++ A EV+D+++   C P+ +TY  LI  
Sbjct: 315 IEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLIST 374

Query: 248 TILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPD 307
              E ++ EA EL   L S+G+ PD+ T+N++I+G+C       A++    + ++GC PD
Sbjct: 375 LCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPD 434

Query: 308 VVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLE 367
             +YN+L+ S  +K + ++   ++K M LSGC  +V+T++ LI  FC+  + REA  + +
Sbjct: 435 EFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFD 494

Query: 368 VMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFG 427
            M+  G++ +S +Y+ LI   CK  R++ A + +++M+ +G  PD   YN++L   C+ G
Sbjct: 495 EMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGG 554

Query: 428 CADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYN 487
               A D+ + +   GC P +  Y T+ S L   G    A +++  +  KGI+     YN
Sbjct: 555 DIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYN 614

Query: 488 SLISCLCRDGLVDEAIGLLVDM-EATRFQPTVISFNIVLLGMCK-AHRVFEGIELLITMV 541
            +I  L R     EAI L  +M E     P  +S+ IV  G+C     + E ++ L+ ++
Sbjct: 615 PVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGGGPIREAVDFLVELL 674

BLAST of CsGy1G031790 vs. ExPASy Swiss-Prot
Match: Q0WKV3 (Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12300 PE=2 SV=1)

HSP 1 Score: 280.8 bits (717), Expect = 3.5e-74
Identity = 141/470 (30.00%), Postives = 254/470 (54.04%), Query Frame = 0

Query: 67  KLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM-EILET 126
           +L +   +  +++  L   + +  KG   ++   + +I  F   R L  A   M +I++ 
Sbjct: 93  RLFSAIAKTKQYDLVLALCKQMELKGIAHNLYTLSIMINCFCRCRKLCLAFSAMGKIIKL 152

Query: 127 YGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELA 186
             +P+  +++ +I+G     ++  A ++ DRM   G  PD++T N ++  LC  GK   A
Sbjct: 153 GYEPNTITFSTLINGLCLEGRVSEALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEA 212

Query: 187 FEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRG 246
             ++D++++ GC+P+ +TY  ++      G+   A+EL  ++  R ++ D   Y+ II G
Sbjct: 213 MLLIDKMVEYGCQPNAVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDG 272

Query: 247 ICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPN 306
           +CK G  D A +    +  +G   ++++YNIL+  F N  RW+DG +L++DM+     PN
Sbjct: 273 LCKHGSLDNAFNLFNEMEMKGITTNIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPN 332

Query: 307 VVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLE 366
           VVT S+LI SF +EG++REA  + + M  +G+ PD+ +Y  LI  FCKE  LD A + ++
Sbjct: 333 VVTFSVLIDSFVKEGKLREAEELHKEMIHRGIAPDTITYTSLIDGFCKENHLDKANQMVD 392

Query: 367 KMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCG 426
            MVS GC P+I  +N ++   CK    D  L++F K+   G       YNT+       G
Sbjct: 393 LMVSKGCDPNIRTFNILINGYCKANRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELG 452

Query: 427 NKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFN 486
               A E+  EM+ + + P+ +TY  L+  LC +G  ++A+ +   +E ++ +  +  +N
Sbjct: 453 KLNVAKELFQEMVSRKVPPNIVTYKILLDGLCDNGESEKALEIFEKIEKSKMELDIGIYN 512

Query: 487 IVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEA 536
           I++ GMC A +V +  +L  ++  KG  P   +Y ++I G+   G  +EA
Sbjct: 513 IIIHGMCNASKVDDAWDLFCSLPLKGVKPGVKTYNIMIGGLCKKGPLSEA 562

BLAST of CsGy1G031790 vs. NCBI nr
Match: XP_004142590.1 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis sativus] >KGN66736.1 hypothetical protein Csa_007448 [Cucumis sativus])

HSP 1 Score: 1162 bits (3006), Expect = 0.0
Identity = 581/581 (100.00%), Postives = 581/581 (100.00%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA
Sbjct: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581
           SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of CsGy1G031790 vs. NCBI nr
Match: KAA0038402.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1121 bits (2900), Expect = 0.0
Identity = 559/581 (96.21%), Postives = 571/581 (98.28%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPL+KPTIPQS SDSIP  RFSNKT+LRNVTSSAE RQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDEL+SRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRA+DFVR LSARGCNPDVVSYNILLRSFLNKSRWEDGE+LMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPD+YSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYL+KMVSDGCLPDIVNYNTILATLCKFGCADLALD+FEKLD+VGCPP V +YNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDQVGCPPNVSSYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT FQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581
           SLYRLGVIS DSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of CsGy1G031790 vs. NCBI nr
Match: TYJ96990.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1121 bits (2899), Expect = 0.0
Identity = 559/581 (96.21%), Postives = 570/581 (98.11%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQS HFTNPL+KPTIPQS SDSIP  RFSNKT+LRNVTSSAE RQPHFPNLDN
Sbjct: 1   MFSSEFLPQSFHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDEL+SRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRA+DFVR LSARGCNPDVVSYNILLRSFLNKSRWEDGE+LMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPD+YSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYL+KMVSDGCLPDIVNYNTILATLCKFGCADLALD+FEKLDEVGCPP V +YNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT FQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581
           SLYRLGVIS DSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of CsGy1G031790 vs. NCBI nr
Match: XP_008443759.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis melo])

HSP 1 Score: 1120 bits (2897), Expect = 0.0
Identity = 559/581 (96.21%), Postives = 570/581 (98.11%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPL+KPTIPQS SDSIP  RFSNKT+LRNVTSSAE RQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KL LAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDEL+SRGLRPDLYTYN
Sbjct: 181 KLALAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRA+DFVR LSARGCNPDVVSYNILLRSFLNKSRWEDGE+LMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPD+YSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYL+KMVSDGCLPDIVNYNTILATLCKFGCADLALD+FEKLDEVGCPP V +YNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT FQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581
           SLYRLGVIS DSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of CsGy1G031790 vs. NCBI nr
Match: XP_038880759.1 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Benincasa hispida])

HSP 1 Score: 1077 bits (2786), Expect = 0.0
Identity = 537/581 (92.43%), Postives = 558/581 (96.04%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPL+KPTIP+S SDS+   +FSNKTHLRN  SSAE R+PHF NL N
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPRSHSDSLVTRKFSNKTHLRNGASSAESREPHFSNLHN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQI+SAN+VFDRMRSRGFSPDVVTYNIMIG LCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRSRGFSPDVVTYNIMIGCLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDEL+SRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRA++FV+ LSARGCNPDV+SYNILLRSFLNKSRW DGE+LMKDMVL 
Sbjct: 241 AIIRGICKEGMEDRAVEFVQGLSARGCNPDVISYNILLRSFLNKSRWADGEKLMKDMVLI 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISS CREGRV EAVNVL+VMKEKGLTPD+YSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSLCREGRVGEAVNVLKVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYL KMVSDGCLPDIVNYNTILATLCKFG ADLALD+FEKLDEVGCPP V +YNTMFSA
Sbjct: 361 IEYLHKMVSDGCLPDIVNYNTILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCG KIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT FQPT
Sbjct: 421 LWSCGKKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATNFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGC+PN+TSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCVPNKTSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581
           +LYRLGVI  DSSKRLNKTFPMLDVYKGLSLSESKNQLLQ+
Sbjct: 541 ALYRLGVICEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQT 581

BLAST of CsGy1G031790 vs. ExPASy TrEMBL
Match: A0A0A0M3C6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G666460 PE=4 SV=1)

HSP 1 Score: 1162 bits (3006), Expect = 0.0
Identity = 581/581 (100.00%), Postives = 581/581 (100.00%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA
Sbjct: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581
           SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of CsGy1G031790 vs. ExPASy TrEMBL
Match: A0A5A7T4J1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold270G002930 PE=4 SV=1)

HSP 1 Score: 1121 bits (2900), Expect = 0.0
Identity = 559/581 (96.21%), Postives = 571/581 (98.28%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPL+KPTIPQS SDSIP  RFSNKT+LRNVTSSAE RQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDEL+SRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRA+DFVR LSARGCNPDVVSYNILLRSFLNKSRWEDGE+LMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPD+YSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYL+KMVSDGCLPDIVNYNTILATLCKFGCADLALD+FEKLD+VGCPP V +YNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDQVGCPPNVSSYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT FQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581
           SLYRLGVIS DSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of CsGy1G031790 vs. ExPASy TrEMBL
Match: A0A5D3BAX6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold506G00380 PE=4 SV=1)

HSP 1 Score: 1121 bits (2899), Expect = 0.0
Identity = 559/581 (96.21%), Postives = 570/581 (98.11%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQS HFTNPL+KPTIPQS SDSIP  RFSNKT+LRNVTSSAE RQPHFPNLDN
Sbjct: 1   MFSSEFLPQSFHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDEL+SRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRA+DFVR LSARGCNPDVVSYNILLRSFLNKSRWEDGE+LMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPD+YSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYL+KMVSDGCLPDIVNYNTILATLCKFGCADLALD+FEKLDEVGCPP V +YNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT FQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581
           SLYRLGVIS DSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of CsGy1G031790 vs. ExPASy TrEMBL
Match: A0A1S3B9K3 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487275 PE=4 SV=1)

HSP 1 Score: 1120 bits (2897), Expect = 0.0
Identity = 559/581 (96.21%), Postives = 570/581 (98.11%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPL+KPTIPQS SDSIP  RFSNKT+LRNVTSSAE RQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KL LAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDEL+SRGLRPDLYTYN
Sbjct: 181 KLALAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRA+DFVR LSARGCNPDVVSYNILLRSFLNKSRWEDGE+LMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPD+YSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYL+KMVSDGCLPDIVNYNTILATLCKFGCADLALD+FEKLDEVGCPP V +YNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT FQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581
           SLYRLGVIS DSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of CsGy1G031790 vs. ExPASy TrEMBL
Match: A0A6J1H8M7 (pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111461503 PE=4 SV=1)

HSP 1 Score: 1008 bits (2607), Expect = 0.0
Identity = 504/571 (88.27%), Postives = 531/571 (92.99%), Query Frame = 0

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSE L QSLHF NPL+ PTIPQS S S    RF NKTHLRN  SSAE R+PH P LDN
Sbjct: 1   MFSSELLSQSLHFINPLSNPTIPQSHSSSFTR-RFPNKTHLRNGASSAETREPHDPILDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           R+ HLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RETHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQI+SAN+VFDRMR RGFSPDVVTYNI+IGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIESANKVFDRMRRRGFSPDVVTYNILIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KLELA+EV+DELLKDGC+PSVITYTILIEATIL+GRI EAL+L DEL+SRGLRPD YTYN
Sbjct: 181 KLELAYEVLDELLKDGCEPSVITYTILIEATILDGRIREALKLLDELLSRGLRPDRYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMED+A++FVR L ARGCNPDV+SYNILLRS LNKSRW DGERLMKDMV S
Sbjct: 241 AIIRGICKEGMEDQAVEFVRDLLARGCNPDVISYNILLRSLLNKSRWGDGERLMKDMVSS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISS CREGRV EAVNVL+VMK+KGLTPD+YSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSLCREGRVEEAVNVLKVMKQKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYL KMVSDGCLPDIVNYN+ILATLCKFG ADLALD+FEKLDEVGCPP V +YNTMFSA
Sbjct: 361 IEYLHKMVSDGCLPDIVNYNSILATLCKFGSADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMI KGID DEITYNSLISCLCRDGLVDEAIGLLVDMEAT FQPT
Sbjct: 421 LWSCGNKIKALEMISEMIGKGIDADEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLG+CKAHRVFEGIELL TMVEKGC PNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGLCKAHRVFEGIELLTTMVEKGCQPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSL 571
           +LYR+GVI  +SSKRLNK FPML+VYKGLSL
Sbjct: 541 ALYRMGVICEESSKRLNKIFPMLEVYKGLSL 570

BLAST of CsGy1G031790 vs. TAIR 10
Match: AT3G04760.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 734.6 bits (1895), Expect = 6.3e-212
Identity = 348/555 (62.70%), Postives = 444/555 (80.00%), Query Frame = 0

Query: 11  LHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDNRDAHLMKLLN 70
           L F+N  + P     RS S    R    T   + T   E RQ H  +L  RD  ++K+ +
Sbjct: 40  LTFSN--SNPNNDNGRSFSSSGARNLQTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFH 99

Query: 71  RSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDPD 130
           RSCR+G + ESL+ LE++V KG+ PDV+LCTKLIKGFF  RN+ KA+RVMEILE +G PD
Sbjct: 100 RSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPD 159

Query: 131 VYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMD 190
           V++YNA+I+GF K N+ID A +V DRMRS+ FSPD VTYNIMIGSLCSRGKL+LA +V++
Sbjct: 160 VFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLN 219

Query: 191 ELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEG 250
           +LL D C+P+VITYTILIEAT+LEG ++EAL+L DE++SRGL+PD++TYN IIRG+CKEG
Sbjct: 220 QLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEG 279

Query: 251 MEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHS 310
           M DRA + VR+L  +GC PDV+SYNILLR+ LN+ +WE+GE+LM  M    C+PNVVT+S
Sbjct: 280 MVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYS 339

Query: 311 ILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSD 370
           ILI++ CR+G++ EA+N+L++MKEKGLTPD+YSYDPLI+AFC+EGRLD+AIE+LE M+SD
Sbjct: 340 ILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISD 399

Query: 371 GCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKA 430
           GCLPDIVNYNT+LATLCK G AD AL++F KL EVGC P   +YNTMFSALWS G+KI+A
Sbjct: 400 GCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRA 459

Query: 431 LEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIVLLG 490
           L MI EM+  GIDPDEITYNS+ISCLCR+G+VDEA  LLVDM +  F P+V+++NIVLLG
Sbjct: 460 LHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLG 519

Query: 491 MCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGVISG 550
            CKAHR+ + I +L +MV  GC PNET+Y +LIEGI +AG+RAEAMELAN L R+  IS 
Sbjct: 520 FCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISE 579

Query: 551 DSSKRLNKTFPMLDV 566
            S KRL++TFP+L+V
Sbjct: 580 YSFKRLHRTFPLLNV 592

BLAST of CsGy1G031790 vs. TAIR 10
Match: AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 464.2 bits (1193), Expect = 1.6e-130
Identity = 223/489 (45.60%), Postives = 319/489 (65.24%), Query Frame = 0

Query: 69  LNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGD 128
           L +  R G+  E   FLE++V  G  PD++ CT LI+GF      +KA +++EILE  G 
Sbjct: 109 LRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGA 168

Query: 129 -PDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFE 188
            PDV +YN MISG+ KA +I++A  V DRM     SPDVVTYN ++ SLC  GKL+ A E
Sbjct: 169 VPDVITYNVMISGYCKAGEINNALSVLDRM---SVSPDVVTYNTILRSLCDSGKLKQAME 228

Query: 189 VMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGIC 248
           V+D +L+  C P VITYTILIEAT  +  +  A++L DE+  RG  PD+ TYN ++ GIC
Sbjct: 229 VLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGIC 288

Query: 249 KEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVV 308
           KEG  D A+ F+  + + GC P+V+++NI+LRS  +  RW D E+L+ DM+  G  P+VV
Sbjct: 289 KEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVV 348

Query: 309 THSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKM 368
           T +ILI+  CR+G +  A+++LE M + G  P+S SY+PL+  FCKE ++D AIEYLE+M
Sbjct: 349 TFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERM 408

Query: 369 VSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNK 428
           VS GC PDIV YNT+L  LCK G  + A+++  +L   GC P +  YNT+   L   G  
Sbjct: 409 VSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKT 468

Query: 429 IKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIV 488
            KA++++ EM  K + PD ITY+SL+  L R+G VDEAI    + E    +P  ++FN +
Sbjct: 469 GKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSI 528

Query: 489 LLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGV 548
           +LG+CK+ +    I+ L+ M+ +GC PNETSY +LIEG+AY G   EA+EL N L   G+
Sbjct: 529 MLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGL 588

Query: 549 ISGDSSKRL 557
           +   S++++
Sbjct: 589 MKKSSAEQV 594

BLAST of CsGy1G031790 vs. TAIR 10
Match: AT1G79080.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 304.3 bits (778), Expect = 2.1e-82
Identity = 170/487 (34.91%), Postives = 275/487 (56.47%), Query Frame = 0

Query: 79  NESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG-DPDVYSYNAM 138
           ++S   LES+V+ G KP+V   T+L+     +  LKKA+RV+E++ + G  PD  +Y  +
Sbjct: 88  SDSFSHLESLVTGGHKPNVAHSTQLLYDLCKANRLKKAIRVIELMVSSGIIPDASAYTYL 147

Query: 139 ISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGC 198
           ++   K   +  A Q+ ++M   G+  + VTYN ++  LC  G L  + + ++ L++ G 
Sbjct: 148 VNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTYNALVRGLCMLGSLNQSLQFVERLMQKGL 207

Query: 199 KPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALD 258
            P+  TY+ L+EA   E   +EA++L DE++ +G  P+L +YN ++ G CKEG  D A+ 
Sbjct: 208 APNAFTYSFLLEAAYKERGTDEAVKLLDEIIVKGGEPNLVSYNVLLTGFCKEGRTDDAMA 267

Query: 259 FVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFC 318
             R L A+G   +VVSYNILLR      RWE+   L+ +M      P+VVT++ILI+S  
Sbjct: 268 LFRELPAKGFKANVVSYNILLRCLCCDGRWEEANSLLAEMDGGDRAPSVVTYNILINSLA 327

Query: 319 REGRVREAVNVLEVMKEKG--LTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPD 378
             GR  +A+ VL+ M +        + SY+P+I+  CKEG++DL ++ L++M+   C P+
Sbjct: 328 FHGRTEQALQVLKEMSKGNHQFRVTATSYNPVIARLCKEGKVDLVVKCLDEMIYRRCKPN 387

Query: 379 IVNYNTILATLCKFGC-ADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMI 438
              YN I  +LC+       A  + + L       T   Y ++ ++L   GN   A +++
Sbjct: 388 EGTYNAI-GSLCEHNSKVQEAFYIIQSLSNKQKCCTHDFYKSVITSLCRKGNTFAAFQLL 447

Query: 439 SEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM-EATRFQPTVISFNIVLLGMCK 498
            EM R G DPD  TY++LI  LC +G+   A+ +L  M E+   +PTV +FN ++LG+CK
Sbjct: 448 YEMTRCGFDPDAHTYSALIRGLCLEGMFTGAMEVLSIMEESENCKPTVDNFNAMILGLCK 507

Query: 499 AHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGVISGDSS 558
             R    +E+   MVEK  +PNET+Y +L+EGIA+      A E+ + L    VI  ++ 
Sbjct: 508 IRRTDLAMEVFEMMVEKKRMPNETTYAILVEGIAHEDELELAKEVLDELRLRKVIGQNAV 567

Query: 559 KRLNKTF 561
            R+   F
Sbjct: 568 DRIVMQF 573

BLAST of CsGy1G031790 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 287.7 bits (735), Expect = 2.0e-77
Identity = 150/512 (29.30%), Postives = 259/512 (50.59%), Query Frame = 0

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG 127
           L+   CRA +   ++  LE + S G  PD    T +++G+    +L  A+R+ E +  +G
Sbjct: 195 LIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFG 254

Query: 128 -------------------------------------DPDVYSYNAMISGFSKANQIDSA 187
                                                 PD Y++N +++G  KA  +  A
Sbjct: 255 CSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHA 314

Query: 188 NQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEA 247
            ++ D M   G+ PDV TYN +I  LC  G+++ A EV+D+++   C P+ +TY  LI  
Sbjct: 315 IEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLIST 374

Query: 248 TILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPD 307
              E ++ EA EL   L S+G+ PD+ T+N++I+G+C       A++    + ++GC PD
Sbjct: 375 LCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPD 434

Query: 308 VVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLE 367
             +YN+L+ S  +K + ++   ++K M LSGC  +V+T++ LI  FC+  + REA  + +
Sbjct: 435 EFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFD 494

Query: 368 VMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFG 427
            M+  G++ +S +Y+ LI   CK  R++ A + +++M+ +G  PD   YN++L   C+ G
Sbjct: 495 EMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGG 554

Query: 428 CADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYN 487
               A D+ + +   GC P +  Y T+ S L   G    A +++  +  KGI+     YN
Sbjct: 555 DIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYN 614

Query: 488 SLISCLCRDGLVDEAIGLLVDM-EATRFQPTVISFNIVLLGMCK-AHRVFEGIELLITMV 541
            +I  L R     EAI L  +M E     P  +S+ IV  G+C     + E ++ L+ ++
Sbjct: 615 PVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGGGPIREAVDFLVELL 674

BLAST of CsGy1G031790 vs. TAIR 10
Match: AT1G62670.1 (rna processing factor 2 )

HSP 1 Score: 280.8 bits (717), Expect = 2.5e-75
Identity = 142/483 (29.40%), Postives = 257/483 (53.21%), Query Frame = 0

Query: 67  KLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETY 126
           KLL+   +  K +  +   E + + G   +    + LI  F     L  A+ V+  +   
Sbjct: 86  KLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYSILINCFCRRSQLPLALAVLGKMMKL 145

Query: 127 G-DPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELA 186
           G +P++ + +++++G+  + +I  A  + D+M   G+ P+ VT+N +I  L    K   A
Sbjct: 146 GYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTGYQPNTVTFNTLIHGLFLHNKASEA 205

Query: 187 FEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRG 246
             ++D ++  GC+P ++TY +++      G  + A  L +++    L P +  YN II G
Sbjct: 206 MALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNTIIDG 265

Query: 247 ICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPN 306
           +CK    D AL+  + +  +G  P+VV+Y+ L+    N  RW D  RL+ DM+     P+
Sbjct: 266 LCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWSDASRLLSDMIERKINPD 325

Query: 307 VVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLE 366
           V T S LI +F +EG++ EA  + + M ++ + P   +Y  LI+ FC   RLD A +  E
Sbjct: 326 VFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCMHDRLDEAKQMFE 385

Query: 367 KMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCG 426
            MVS  C PD+V YNT++   CK+   +  ++VF ++ + G       YN +   L+  G
Sbjct: 386 FMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFREMSQRGLVGNTVTYNILIQGLFQAG 445

Query: 427 NKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFN 486
           +   A E+  EM+  G+ P+ +TYN+L+  LC++G +++A+ +   ++ ++ +PT+ ++N
Sbjct: 446 DCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGKLEKAMVVFEYLQRSKMEPTIYTYN 505

Query: 487 IVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRL 546
           I++ GMCKA +V +G +L   +  KG  P+  +Y  +I G    G + EA  L   +   
Sbjct: 506 IMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNTMISGFCRKGSKEEADALFKEMKED 565

Query: 547 GVI 549
           G +
Sbjct: 566 GTL 568

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SR008.9e-21162.70Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
Q3EDF82.2e-12945.60Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... [more]
A3KPF83.0e-8134.91Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidop... [more]
Q9LFF12.9e-7629.30Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Q0WKV33.5e-7430.00Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_004142590.10.0100.00pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis sa... [more]
KAA0038402.10.096.21pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
TYJ96990.10.096.21pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_008443759.10.096.21PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic ... [more]
XP_038880759.10.092.43pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Benincasa ... [more]
Match NameE-valueIdentityDescription
A0A0A0M3C60.0100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G666460 PE=4 SV=1[more]
A0A5A7T4J10.096.21Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5D3BAX60.096.21Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B9K30.096.21pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucumis ... [more]
A0A6J1H8M70.088.27pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT3G04760.16.3e-21262.70Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G09900.11.6e-13045.60Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G79080.12.1e-8234.91Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G53700.12.0e-7729.30Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62670.12.5e-7529.40rna processing factor 2 [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 36..194
e-value: 3.8E-33
score: 116.6
coord: 403..475
e-value: 5.2E-20
score: 73.8
coord: 261..331
e-value: 1.6E-20
score: 75.4
coord: 195..260
e-value: 1.5E-19
score: 72.2
coord: 476..544
e-value: 1.1E-10
score: 43.4
coord: 332..402
e-value: 8.9E-20
score: 73.0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 440..473
e-value: 1.0E-13
score: 50.7
coord: 335..368
e-value: 1.4E-9
score: 37.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 307..341
e-value: 7.0E-9
score: 33.3
coord: 167..201
e-value: 1.0E-9
score: 36.0
coord: 379..411
e-value: 3.1E-5
score: 21.9
coord: 132..166
e-value: 5.3E-11
score: 40.0
coord: 342..376
e-value: 3.0E-9
score: 34.5
coord: 202..235
e-value: 1.1E-6
score: 26.4
coord: 237..271
e-value: 1.6E-7
score: 29.0
coord: 482..516
e-value: 7.7E-5
score: 20.6
coord: 413..446
e-value: 4.7E-4
score: 18.1
coord: 272..306
e-value: 4.1E-8
score: 30.9
coord: 447..480
e-value: 2.2E-8
score: 31.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 479..526
e-value: 9.9E-8
score: 32.1
coord: 129..178
e-value: 1.9E-18
score: 66.4
coord: 269..318
e-value: 7.6E-15
score: 54.9
coord: 374..421
e-value: 2.5E-9
score: 37.2
coord: 214..248
e-value: 3.8E-9
score: 36.6
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 85..122
e-value: 5.4E-6
score: 26.4
coord: 179..210
e-value: 2.4E-5
score: 24.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 165..199
score: 13.361882
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 235..269
score: 12.057487
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 305..339
score: 13.208424
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 445..479
score: 12.539784
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 480..514
score: 9.941957
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 410..444
score: 10.522905
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 200..234
score: 11.640958
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 130..164
score: 13.942831
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 270..304
score: 11.936913
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 375..409
score: 11.180584
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 12.967276
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 1..566
NoneNo IPR availablePANTHERPTHR47932:SF16PENTATRICOPEPTIDE (PPR) REPEAT PROTEINcoord: 1..566

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G031790.1CsGy1G031790.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding