CsGy1G024257 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy1G024257
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat
LocationGy14Chr1: 23126264 .. 23128261 (+)
RNA-Seq ExpressionCsGy1G024257
SyntenyCsGy1G024257
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATATGGAGATATGGCTGTAATATCTTCCAAATAGCATCTCGTCAAATCATCAGAACATTTGCCCATCAGTCATATCGTCGATGTACAACCCCACATATAAATTTAACCAAAATCCTCCACAACCGACTAGACGAAGACATTGAACAGAAATTTTCTGAGAACAGGCAAGTGATTCTCACTAATGATCTCGTGTACACCACTCTGTTGAATTGTTCTTCTGATTTAATCGCTTTGAGCTTTTTCATGTGGTGTGCTAAACAGCCCAATTTCTTCCACAACCCTGCGGCGTTTGATTATATGGTGGGTGTGGTTTCCCGTCTCATGAAACAATATGAAACTGTGAATGGGATTCTTGGGGGGTTGGAGAGTGTTGGAAATGTGACTAAGGCACAAACGTTTCTGCTTCTTTTGAGGATTTATTGGCGTGGAGGAATGTACGATTTGGTTTTTGAAGCATTTGACCATATGGATCGCTATGGATTTACACCAAACACATTTGCACGTAATGTGATTATGGATGTGTTGTTCAAGGTTGGACGTGCTGATGTTGCTTTGAAGGTCTTTAAAGAGACACTGCTACCAAATTTTTTGACATTCAACATTGTATTGTGTAATTTATCCAAAACAAAGGATTTGATAGGTATTGGAGATACTTTTAGATGTATGTTGAGAATGGGGTATTGTCCTAATCCTGGGACATTTGAAGTGGTTTTGAATGGTTTATGCAAATTAGGTAGGCTGGCAGAAGCATATCAAGTATGGGGTATTATGACAACTTTTGGAATATCCATGTCCGTAAACATTTGGACTATAATGATCGATGGATTCTGTAGATTGCGCAGAACCGAAGAAGCTTCTTCTTTGGTGAAAAAGATGAAAAAATCTGGTTGTTCTCCAAACATTGTGACATATACTACCTTGATTAAGGGATATATATATGCACAACGGATTAGTGACGCATTTGATGTTTTAAGTATTATTGAATCAGAAGGGCCTTCGCCTGACCTGATTCTTTATAACGTATTGATTGATAGCCTTGCTAAAAATGAGAGGTACAATGATGCTCTCAGTATTTTTCTTAGTTTGCATAAACGAAACATACTTCCTGACTGTTATACTTTCAGTTCGTTGTTGAATACCATATGTTTATCCAAAAGGCTTTTTCTTCTACCCAAGTTGGTTGATGGATTTTTAGTTGAAGTTGACTTGGTGGCATGTAACTCTCTGCTTAGTTATCTTGGTAAGGCTGGGTTTGCTGCACTTGCCTTAGAACTATATAATAACATGGTGAATGGAGGTTTAATGCCAGATAAGTATAGTGTTCTTGGAGTACTGACTGGTCTTTGTGAATCAAGGAGAATCGGTGAAGCAGTTCGTCTGTACAATGGCATTCTCTTGAACTATACAGGAGTTGATGCTCACATCCACACTGTAATTATAGATGGACTTATAAAAGCTGGTAAATTTCATTCAGCCATTAGGATATTCAGAAGAAATTTATTGGAGGAAAATTCTTTAGATGTTGTATCATATTCTGTTGCCATTCGTGGGCTTCTTCTGGTCGGTAGAAATACAGAGGCTTCTAACTTGTATAACTACATGAAGGAGGCTGGAATAAATCCAAACGGGCATGTTTGCAACGTAATGCTCTCCACTTTTTGTAAAGAAAAAAAATTTGCATTAGTGAAGCAGATGCTGCAAGAGATGATTGACCTGGGAATAAAAATGAGCAGGAATAATTTCTTTAGGTTATACAATGCCATTTGTAGATCATCAAATGGTTCTCATCTGGTTATCTACTTATTAATTGAGATGAAAGTTTTGGGATTATTACCTAGGAAACGAGATTGTGAAACATTGGTTCATAATCCCCCCAAAGATGTGAATATTTCTGAAAAACATTATAAATTACTGAATGGCTGTCTAGAATACTGTCTATGTGGTGATACATCTAGCTCCGAAGAATATACTGATGTGGCTGCTTTTGTGGGCTGA

mRNA sequence

ATGATATGGAGATATGGCTGTAATATCTTCCAAATAGCATCTCGTCAAATCATCAGAACATTTGCCCATCAGTCATATCGTCGATGTACAACCCCACATATAAATTTAACCAAAATCCTCCACAACCGACTAGACGAAGACATTGAACAGAAATTTTCTGAGAACAGGCAAGTGATTCTCACTAATGATCTCGTGTACACCACTCTGTTGAATTGTTCTTCTGATTTAATCGCTTTGAGCTTTTTCATGTGGTGTGCTAAACAGCCCAATTTCTTCCACAACCCTGCGGCGTTTGATTATATGGTGGGTGTGGTTTCCCGTCTCATGAAACAATATGAAACTGTGAATGGGATTCTTGGGGGGTTGGAGAGTGTTGGAAATGTGACTAAGGCACAAACGTTTCTGCTTCTTTTGAGGATTTATTGGCGTGGAGGAATGTACGATTTGGTTTTTGAAGCATTTGACCATATGGATCGCTATGGATTTACACCAAACACATTTGCACGTAATGTGATTATGGATGTGTTGTTCAAGGTTGGACGTGCTGATGTTGCTTTGAAGGTCTTTAAAGAGACACTGCTACCAAATTTTTTGACATTCAACATTGTATTGTGTAATTTATCCAAAACAAAGGATTTGATAGGTATTGGAGATACTTTTAGATGTATGTTGAGAATGGGGTATTGTCCTAATCCTGGGACATTTGAAGTGGTTTTGAATGGTTTATGCAAATTAGGTAGGCTGGCAGAAGCATATCAAGTATGGGGTATTATGACAACTTTTGGAATATCCATGTCCGTAAACATTTGGACTATAATGATCGATGGATTCTGTAGATTGCGCAGAACCGAAGAAGCTTCTTCTTTGGTGAAAAAGATGAAAAAATCTGGTTGTTCTCCAAACATTGTGACATATACTACCTTGATTAAGGGATATATATATGCACAACGGATTAGTGACGCATTTGATGTTTTAAGTATTATTGAATCAGAAGGGCCTTCGCCTGACCTGATTCTTTATAACGTATTGATTGATAGCCTTGCTAAAAATGAGAGGTACAATGATGCTCTCAGTATTTTTCTTAGTTTGCATAAACGAAACATACTTCCTGACTGTTATACTTTCAGTTCGTTGTTGAATACCATATGTTTATCCAAAAGGCTTTTTCTTCTACCCAAGTTGGTTGATGGATTTTTAGTTGAAGTTGACTTGGTGGCATGTAACTCTCTGCTTAGTTATCTTGGTAAGGCTGGGTTTGCTGCACTTGCCTTAGAACTATATAATAACATGGTGAATGGAGGTTTAATGCCAGATAAGTATAGTGTTCTTGGAGTACTGACTGGTCTTTGTGAATCAAGGAGAATCGGTGAAGCAGTTCGTCTGTACAATGGCATTCTCTTGAACTATACAGGAGTTGATGCTCACATCCACACTGTAATTATAGATGGACTTATAAAAGCTGGTAAATTTCATTCAGCCATTAGGATATTCAGAAGAAATTTATTGGAGGAAAATTCTTTAGATGTTGTATCATATTCTGTTGCCATTCGTGGGCTTCTTCTGGTCGGTAGAAATACAGAGGCTTCTAACTTGTATAACTACATGAAGGAGGCTGGAATAAATCCAAACGGGCATGTTTGCAACGTAATGCTCTCCACTTTTTGTAAAGAAAAAAAATTTGCATTAGTGAAGCAGATGCTGCAAGAGATGATTGACCTGGGAATAAAAATGAGCAGGAATAATTTCTTTAGGTTATACAATGCCATTTGTAGATCATCAAATGGTTCTCATCTGGTTATCTACTTATTAATTGAGATGAAAGTTTTGGGATTATTACCTAGGAAACGAGATTGTGAAACATTGGTTCATAATCCCCCCAAAGATGTGAATATTTCTGAAAAACATTATAAATTACTGAATGGCTGTCTAGAATACTGTCTATGTGGTGATACATCTAGCTCCGAAGAATATACTGATGTGGCTGCTTTTGTGGGCTGA

Coding sequence (CDS)

ATGATATGGAGATATGGCTGTAATATCTTCCAAATAGCATCTCGTCAAATCATCAGAACATTTGCCCATCAGTCATATCGTCGATGTACAACCCCACATATAAATTTAACCAAAATCCTCCACAACCGACTAGACGAAGACATTGAACAGAAATTTTCTGAGAACAGGCAAGTGATTCTCACTAATGATCTCGTGTACACCACTCTGTTGAATTGTTCTTCTGATTTAATCGCTTTGAGCTTTTTCATGTGGTGTGCTAAACAGCCCAATTTCTTCCACAACCCTGCGGCGTTTGATTATATGGTGGGTGTGGTTTCCCGTCTCATGAAACAATATGAAACTGTGAATGGGATTCTTGGGGGGTTGGAGAGTGTTGGAAATGTGACTAAGGCACAAACGTTTCTGCTTCTTTTGAGGATTTATTGGCGTGGAGGAATGTACGATTTGGTTTTTGAAGCATTTGACCATATGGATCGCTATGGATTTACACCAAACACATTTGCACGTAATGTGATTATGGATGTGTTGTTCAAGGTTGGACGTGCTGATGTTGCTTTGAAGGTCTTTAAAGAGACACTGCTACCAAATTTTTTGACATTCAACATTGTATTGTGTAATTTATCCAAAACAAAGGATTTGATAGGTATTGGAGATACTTTTAGATGTATGTTGAGAATGGGGTATTGTCCTAATCCTGGGACATTTGAAGTGGTTTTGAATGGTTTATGCAAATTAGGTAGGCTGGCAGAAGCATATCAAGTATGGGGTATTATGACAACTTTTGGAATATCCATGTCCGTAAACATTTGGACTATAATGATCGATGGATTCTGTAGATTGCGCAGAACCGAAGAAGCTTCTTCTTTGGTGAAAAAGATGAAAAAATCTGGTTGTTCTCCAAACATTGTGACATATACTACCTTGATTAAGGGATATATATATGCACAACGGATTAGTGACGCATTTGATGTTTTAAGTATTATTGAATCAGAAGGGCCTTCGCCTGACCTGATTCTTTATAACGTATTGATTGATAGCCTTGCTAAAAATGAGAGGTACAATGATGCTCTCAGTATTTTTCTTAGTTTGCATAAACGAAACATACTTCCTGACTGTTATACTTTCAGTTCGTTGTTGAATACCATATGTTTATCCAAAAGGCTTTTTCTTCTACCCAAGTTGGTTGATGGATTTTTAGTTGAAGTTGACTTGGTGGCATGTAACTCTCTGCTTAGTTATCTTGGTAAGGCTGGGTTTGCTGCACTTGCCTTAGAACTATATAATAACATGGTGAATGGAGGTTTAATGCCAGATAAGTATAGTGTTCTTGGAGTACTGACTGGTCTTTGTGAATCAAGGAGAATCGGTGAAGCAGTTCGTCTGTACAATGGCATTCTCTTGAACTATACAGGAGTTGATGCTCACATCCACACTGTAATTATAGATGGACTTATAAAAGCTGGTAAATTTCATTCAGCCATTAGGATATTCAGAAGAAATTTATTGGAGGAAAATTCTTTAGATGTTGTATCATATTCTGTTGCCATTCGTGGGCTTCTTCTGGTCGGTAGAAATACAGAGGCTTCTAACTTGTATAACTACATGAAGGAGGCTGGAATAAATCCAAACGGGCATGTTTGCAACGTAATGCTCTCCACTTTTTGTAAAGAAAAAAAATTTGCATTAGTGAAGCAGATGCTGCAAGAGATGATTGACCTGGGAATAAAAATGAGCAGGAATAATTTCTTTAGGTTATACAATGCCATTTGTAGATCATCAAATGGTTCTCATCTGGTTATCTACTTATTAATTGAGATGAAAGTTTTGGGATTATTACCTAGGAAACGAGATTGTGAAACATTGGTTCATAATCCCCCCAAAGATGTGAATATTTCTGAAAAACATTATAAATTACTGAATGGCTGTCTAGAATACTGTCTATGTGGTGATACATCTAGCTCCGAAGAATATACTGATGTGGCTGCTTTTGTGGGCTGA

Protein sequence

MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVILTNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILGGLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVGRADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIFLSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFAALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVIIDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGINPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVIYLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSSEEYTDVAAFVG*
Homology
BLAST of CsGy1G024257 vs. ExPASy Swiss-Prot
Match: Q3EDA9 (Putative pentatricopeptide repeat-containing protein At1g16830 OS=Arabidopsis thaliana OX=3702 GN=At1g16830 PE=3 SV=2)

HSP 1 Score: 412.5 bits (1059), Expect = 8.9e-114
Identity = 214/535 (40.00%), Postives = 335/535 (62.62%), Query Frame = 0

Query: 60  LTNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGIL 119
           LT+D VY+ L    +DL  L+FF WCAKQ N+FH+  AFD+MVGVV +L ++Y +++ I+
Sbjct: 37  LTHDNVYSCLRESPADLKTLNFFFWCAKQNNYFHDDRAFDHMVGVVEKLTREYYSIDRII 96

Query: 120 GGLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKV 179
             L+  G   K + FLLLL I+WRG +YD   E +  M  +GF PNT A N++MDV FK+
Sbjct: 97  ERLKISGCEIKPRVFLLLLEIFWRGHIYDKAIEVYTGMSSFGFVPNTRAMNMMMDVNFKL 156

Query: 180 GRADVALKVFKETLLPNFLTFNIVL---CNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFE 239
              + AL++F+     NF +F+I L   C+     DL+G+    + M+  G+ PN   F 
Sbjct: 157 NVVNGALEIFEGIRFRNFFSFDIALSHFCSRGGRGDLVGVKIVLKRMIGEGFYPNRERFG 216

Query: 240 VVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKS 299
            +L   C+ G ++EA+QV G+M   GIS+SVN+W++++ GF R    ++A  L  KM + 
Sbjct: 217 QILRLCCRTGCVSEAFQVVGLMICSGISVSVNVWSMLVSGFFRSGEPQKAVDLFNKMIQI 276

Query: 300 GCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDA 359
           GCSPN+VTYT+LIKG++    + +AF VLS ++SEG +PD++L N++I +  +  R+ +A
Sbjct: 277 GCSPNLVTYTSLIKGFVDLGMVDEAFTVLSKVQSEGLAPDIVLCNLMIHTYTRLGRFEEA 336

Query: 360 LSIFLSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGK 419
             +F SL KR ++PD YTF+S+L+++CLS +  L+P++  G   + DLV  N L +   K
Sbjct: 337 RKVFTSLEKRKLVPDQYTFASILSSLCLSGKFDLVPRITHGIGTDFDLVTGNLLSNCFSK 396

Query: 420 AGFAALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHI 479
            G+ + AL++ + M       D Y+    L+ LC       A+++Y  I+     +DAH 
Sbjct: 397 IGYNSYALKVLSIMSYKDFALDCYTYTVYLSALCRGGAPRAAIKMYKIIIKEKKHLDAHF 456

Query: 480 HTVIIDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMK 539
           H+ IID LI+ GK+++A+ +F+R +LE+  LDVVSY+VAI+GL+   R  EA +L   MK
Sbjct: 457 HSAIIDSLIELGKYNTAVHLFKRCILEKYPLDVVSYTVAIKGLVRAKRIEEAYSLCCDMK 516

Query: 540 EAGINPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICR 592
           E GI PN      ++S  CKEK+   V+++L+E I  G+++  N  F++Y+ + R
Sbjct: 517 EGGIYPNRRTYRTIISGLCKEKETEKVRKILRECIQEGVELDPNTKFQVYSLLSR 571

BLAST of CsGy1G024257 vs. ExPASy Swiss-Prot
Match: Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 1.3e-43
Identity = 139/546 (25.46%), Postives = 253/546 (46.34%), Query Frame = 0

Query: 78  ALSFFMWCAKQPNFFHNPAAFDYM---------VGVVSR----LMKQYETVNGILGGLES 137
           AL+F  W ++ P + H+  ++  +         VGVV +    ++K  ++V   L  L+ 
Sbjct: 106 ALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVGVVFKIRLLMIKSCDSVGDALYVLDL 165

Query: 138 VGNVTKAQTFLL-----------LLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIM 197
              + K + F L           LL    R G+ D + + +  M      PN +  N ++
Sbjct: 166 CRKMNKDERFELKYKLIIGCYNTLLNSLARFGLVDEMKQVYMEMLEDKVCPNIYTYNKMV 225

Query: 198 DVLFKVGRADVA----LKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYC 257
           +   K+G  + A     K+ +  L P+F T+  ++    + KDL      F  M   G  
Sbjct: 226 NGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCR 285

Query: 258 PNPGTFEVVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSL 317
            N   +  +++GLC   R+ EA  ++  M       +V  +T++I   C   R  EA +L
Sbjct: 286 RNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNL 345

Query: 318 VKKMKKSGCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAK 377
           VK+M+++G  PNI TYT LI       +   A ++L  +  +G  P++I YN LI+   K
Sbjct: 346 VKEMEETGIKPNIHTYTVLIDSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCK 405

Query: 378 NERYNDALSIFLSLHKRNILPDCYTFSSLLNTICLS---KRLFLLPKLVDGFLVEVDLVA 437
                DA+ +   +  R + P+  T++ L+   C S   K + +L K+++  ++  D+V 
Sbjct: 406 RGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCKSNVHKAMGVLNKMLERKVLP-DVVT 465

Query: 438 CNSLLSYLGKAGFAALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGIL 497
            NSL+    ++G    A  L + M + GL+PD+++   ++  LC+S+R+ EA  L++ + 
Sbjct: 466 YNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLE 525

Query: 498 LNYTGVDAHIHTVIIDGLIKAGKFHSAIRIFRRNLLEENSL-DVVSYSVAIRGLLLVGRN 557
                 +  ++T +IDG  KAGK   A  +    +L +N L + ++++  I GL   G+ 
Sbjct: 526 QKGVNPNVVMYTALIDGYCKAGKVDEA-HLMLEKMLSKNCLPNSLTFNALIHGLCADGKL 585

Query: 558 TEASNLYNYMKEAGINPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRL 592
            EA+ L   M + G+ P      +++    K+  F       Q+M+  G K   + +   
Sbjct: 586 KEATLLEEKMVKIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTF 645

BLAST of CsGy1G024257 vs. ExPASy Swiss-Prot
Match: Q9C8T7 (Pentatricopeptide repeat-containing protein At1g63330 OS=Arabidopsis thaliana OX=3702 GN=At1g63330 PE=2 SV=2)

HSP 1 Score: 177.9 bits (450), Expect = 3.7e-43
Identity = 123/487 (25.26%), Postives = 239/487 (49.08%), Query Frame = 0

Query: 98  FDYMVGVVSRLMKQYETVNGILGGLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHM 157
           F+ ++  +++ MK+++ V  +   ++ +G      T+ +L+  + R     L       M
Sbjct: 13  FNKLLSAIAK-MKKFDLVISLGEKMQRLGISHNLYTYNILINCFCRRSQISLALALLGKM 72

Query: 158 DRYGFTPNTFARNVIMDVLFKVGRADVALKVFKETL----LPNFLTFNIVLCNL---SKT 217
            + G+ P+    + +++      R   A+ +  + +     P+ +TF  ++  L   +K 
Sbjct: 73  MKLGYEPSIVTLSSLLNGYCHGKRISDAVALVDQMVEMGYRPDTITFTTLIHGLFLHNKA 132

Query: 218 KDLIGIGDTFRCMLRMGYCPNPGTFEVVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIW 277
            + + + D    M++ G  PN  T+ VV+NGLCK G +  A+ +   M    I   V I+
Sbjct: 133 SEAVALVDR---MVQRGCQPNLVTYGVVVNGLCKRGDIDLAFNLLNKMEAAKIEADVVIF 192

Query: 278 TIMIDGFCRLRRTEEASSLVKKMKKSGCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIES 337
             +ID  C+ R  ++A +L K+M+  G  PN+VTY++LI       R SDA  +LS +  
Sbjct: 193 NTIIDSLCKYRHVDDALNLFKEMETKGIRPNVVTYSSLISCLCSYGRWSDASQLLSDMIE 252

Query: 338 EGPSPDLILYNVLIDSLAKNERYNDALSIFLSLHKRNILPDCYTFSSLLNTICLSKRLFL 397
           +  +P+L+ +N LID+  K  ++ +A  +   + KR+I PD +T++SL+N  C+  RL  
Sbjct: 253 KKINPNLVTFNALIDAFVKEGKFVEAEKLHDDMIKRSIDPDIFTYNSLINGFCMHDRLDK 312

Query: 398 LPKLVDGFLVE----VDLVACNSLLSYLGKAGFAALALELYNNMVNGGLMPDKYSVLGVL 457
             ++ + F+V      DL   N+L+    K+       EL+  M + GL+ D  +   ++
Sbjct: 313 AKQMFE-FMVSKDCFPDLDTYNTLIKGFCKSKRVEDGTELFREMSHRGLVGDTVTYTTLI 372

Query: 458 TGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVIIDGLIKAGKFHSAIRIFRRNLLEENS 517
            GL        A +++  ++ +    D   +++++DGL   GK   A+ +F      E  
Sbjct: 373 QGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSILLDGLCNNGKLEKALEVFDYMQKSEIK 432

Query: 518 LDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGINPNGHVCNVMLSTFCK----EKKFAL 570
           LD+  Y+  I G+   G+  +  +L+  +   G+ PN    N M+S  C     ++ +AL
Sbjct: 433 LDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKGVKPNVVTYNTMISGLCSKRLLQEAYAL 492

BLAST of CsGy1G024257 vs. ExPASy Swiss-Prot
Match: Q9SXD8 (Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana OX=3702 GN=At1g62590 PE=2 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 8.2e-43
Identity = 123/487 (25.26%), Postives = 239/487 (49.08%), Query Frame = 0

Query: 98  FDYMVGVVSRLMKQYETVNGILGGLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHM 157
           F+ ++  +++ MK+++ V  +   ++ +  V    T+ +L+  + R     L       M
Sbjct: 88  FNKLLSAIAK-MKKFDVVISLGEKMQRLEIVHGLYTYNILINCFCRRSQISLALALLGKM 147

Query: 158 DRYGFTPNTFARNVIMDVLFKVGRADVALKVFKETL----LPNFLTFNIVLCNL---SKT 217
            + G+ P+    + +++      R   A+ +  + +     P+ +TF  ++  L   +K 
Sbjct: 148 MKLGYEPSIVTLSSLLNGYCHGKRISDAVALVDQMVEMGYRPDTITFTTLIHGLFLHNKA 207

Query: 218 KDLIGIGDTFRCMLRMGYCPNPGTFEVVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIW 277
            + + + D    M++ G  PN  T+ VV+NGLCK G    A  +   M    I   V I+
Sbjct: 208 SEAVALVDR---MVQRGCQPNLVTYGVVVNGLCKRGDTDLALNLLNKMEAAKIEADVVIF 267

Query: 278 TIMIDGFCRLRRTEEASSLVKKMKKSGCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIES 337
             +ID  C+ R  ++A +L K+M+  G  PN+VTY++LI       R SDA  +LS +  
Sbjct: 268 NTIIDSLCKYRHVDDALNLFKEMETKGIRPNVVTYSSLISCLCSYGRWSDASQLLSDMIE 327

Query: 338 EGPSPDLILYNVLIDSLAKNERYNDALSIFLSLHKRNILPDCYTFSSLLNTICLSKRLFL 397
           +  +P+L+ +N LID+  K  ++ +A  ++  + KR+I PD +T++SL+N  C+  RL  
Sbjct: 328 KKINPNLVTFNALIDAFVKEGKFVEAEKLYDDMIKRSIDPDIFTYNSLVNGFCMHDRLDK 387

Query: 398 LPKLVDGFLVE----VDLVACNSLLSYLGKAGFAALALELYNNMVNGGLMPDKYSVLGVL 457
             ++ + F+V      D+V  N+L+    K+       EL+  M + GL+ D  +   ++
Sbjct: 388 AKQMFE-FMVSKDCFPDVVTYNTLIKGFCKSKRVEDGTELFREMSHRGLVGDTVTYTTLI 447

Query: 458 TGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVIIDGLIKAGKFHSAIRIFRRNLLEENS 517
            GL        A +++  ++ +    D   +++++DGL   GK   A+ +F      E  
Sbjct: 448 QGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSILLDGLCNNGKLEKALEVFDYMQKSEIK 507

Query: 518 LDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGINPNGHVCNVMLSTFCK----EKKFAL 570
           LD+  Y+  I G+   G+  +  +L+  +   G+ PN    N M+S  C     ++ +AL
Sbjct: 508 LDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKGVKPNVVTYNTMISGLCSKRLLQEAYAL 567

BLAST of CsGy1G024257 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 174.1 bits (440), Expect = 5.3e-42
Identity = 136/550 (24.73%), Postives = 253/550 (46.00%), Query Frame = 0

Query: 75  DLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILGGLESVGNVTKAQTF 134
           D  AL  F   +K+PNF   PA ++ ++  + R    ++ +  IL  ++S        TF
Sbjct: 63  DSAALRLFNLASKKPNFSPEPALYEEILLRLGR-SGSFDDMKKILEDMKSSRCEMGTSTF 122

Query: 135 LLLLRIYWRGGMYDLVFEAFDHM-DRYGFTPNTFARNVIMDVLFKVGRADVA----LKVF 194
           L+L+  Y +  + D +    D M D +G  P+T   N ++++L       +      K+ 
Sbjct: 123 LILIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLVDGNSLKLVEISHAKMS 182

Query: 195 KETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLNGLCKLGRLA 254
              + P+  TFN+++  L +   L         M   G  P+  TF  V+ G  + G L 
Sbjct: 183 VWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLD 242

Query: 255 EAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKM-KKSGCSPNIVTYTTL 314
            A ++   M  FG S S     +++ GFC+  R E+A + +++M  + G  P+  T+ TL
Sbjct: 243 GALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTL 302

Query: 315 IKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIFLSLHKRNI 374
           + G   A  +  A +++ ++  EG  PD+  YN +I  L K     +A+ +   +  R+ 
Sbjct: 303 VNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDC 362

Query: 375 LPDCYTFSSLLNTICLSKRL---FLLPKLVDGFLVEVDLVACNSLLSYLGKAGFAALALE 434
            P+  T+++L++T+C   ++     L +++    +  D+   NSL+  L       +A+E
Sbjct: 363 SPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAME 422

Query: 435 LYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVIIDGLI 494
           L+  M + G  PD+++   ++  LC   ++ EA+ +   + L+        +  +IDG  
Sbjct: 423 LFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFC 482

Query: 495 KAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGINPNGH 554
           KA K   A  IF    +   S + V+Y+  I GL    R  +A+ L + M   G  P+ +
Sbjct: 483 KANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKY 542

Query: 555 VCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVIYLL-- 614
             N +L+ FC+         ++Q M   G +     +  L + +C++     +   LL  
Sbjct: 543 TYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGR-VEVASKLLRS 602

BLAST of CsGy1G024257 vs. NCBI nr
Match: XP_011659004.1 (putative pentatricopeptide repeat-containing protein At1g16830 isoform X1 [Cucumis sativus] >KGN65906.1 hypothetical protein Csa_023183 [Cucumis sativus])

HSP 1 Score: 1342 bits (3474), Expect = 0.0
Identity = 665/665 (100.00%), Postives = 665/665 (100.00%), Query Frame = 0

Query: 1   MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL 60
           MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL
Sbjct: 1   MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL 60

Query: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG 120
           TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG
Sbjct: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG 120

Query: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG 180
           GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG
Sbjct: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG 180

Query: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN 240
           RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN
Sbjct: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN 240

Query: 241 GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP 300
           GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP
Sbjct: 241 GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP 300

Query: 301 NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF 360
           NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF
Sbjct: 301 NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF 360

Query: 361 LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA 420
           LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA
Sbjct: 361 LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA 420

Query: 421 ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480
           ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI
Sbjct: 421 ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480

Query: 481 IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI 540
           IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI
Sbjct: 481 IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI 540

Query: 541 NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI 600
           NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI
Sbjct: 541 NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI 600

Query: 601 YLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSSEEYTDV 660
           YLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSSEEYTDV
Sbjct: 601 YLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSSEEYTDV 660

Query: 661 AAFVG 665
           AAFVG
Sbjct: 661 AAFVG 665

BLAST of CsGy1G024257 vs. NCBI nr
Match: XP_038894703.1 (LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g16830 [Benincasa hispida])

HSP 1 Score: 1148 bits (2969), Expect = 0.0
Identity = 570/665 (85.71%), Postives = 611/665 (91.88%), Query Frame = 0

Query: 1   MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL 60
           MIWR+GCNIFQIA RQI+RTFAHQS+R+ + P +NLTK LHNRLDEDIEQKFSENRQVIL
Sbjct: 1   MIWRHGCNIFQIAPRQILRTFAHQSHRKWSAPFLNLTKNLHNRLDEDIEQKFSENRQVIL 60

Query: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG 120
           TND V TTLLNCSSDL+ALSFFMWCAKQPNFFHN AAFDYMVGVVSRLMK+YETVNGI+G
Sbjct: 61  TNDRVLTTLLNCSSDLVALSFFMWCAKQPNFFHNTAAFDYMVGVVSRLMKRYETVNGIIG 120

Query: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG 180
           GLESVGNVTKAQTFLLLLRIYWRGGMY+LVFEAFDHMD  G+TPNTFA NVI+D+LFKVG
Sbjct: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYNLVFEAFDHMDHCGYTPNTFAHNVIIDMLFKVG 180

Query: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN 240
           RADVALKVFKETL+PNFLTFNIVLCNLSK KDLIGI D FRCM RMGYCPNPGTFEVVLN
Sbjct: 181 RADVALKVFKETLMPNFLTFNIVLCNLSKIKDLIGIRDVFRCMWRMGYCPNPGTFEVVLN 240

Query: 241 GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP 300
           GLCKLGRLAEA+QV G+M TFG+SMSVNIWTIMIDG CRL RT+ AS LVKKM++SGCSP
Sbjct: 241 GLCKLGRLAEAHQVLGVMITFGLSMSVNIWTIMIDGLCRLHRTD-ASFLVKKMERSGCSP 300

Query: 301 NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF 360
           NIVT+TTLIKGYIYA+R+SDAFDVLSIIESEGPSPD ILYNVLIDSLAKNER+NDALSIF
Sbjct: 301 NIVTHTTLIKGYIYAKRVSDAFDVLSIIESEGPSPDQILYNVLIDSLAKNERFNDALSIF 360

Query: 361 LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA 420
           L++ KRNILPDCYTFSSLLNTICLSKR FLLPKLVD F VEVDLVACNSLLSYLGKAGF 
Sbjct: 361 LNMDKRNILPDCYTFSSLLNTICLSKRFFLLPKLVDEFFVEVDLVACNSLLSYLGKAGFP 420

Query: 421 ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480
            LALE+Y+ MV+ GL PDKYSVLGVL GLCESRRIGEAVRLYN ILLNYTGVDAHIHTV 
Sbjct: 421 LLALEVYDEMVDKGLTPDKYSVLGVLIGLCESRRIGEAVRLYNDILLNYTGVDAHIHTVT 480

Query: 481 IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI 540
           IDGLIKAGK+HSAIR+FR NLLEENSLDVVSYSVAI GLL+VGR+TEA N+YNYMKE GI
Sbjct: 481 IDGLIKAGKYHSAIRVFRTNLLEENSLDVVSYSVAICGLLMVGRDTEACNMYNYMKEVGI 540

Query: 541 NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI 600
           NPN H+CN+MLS+FCKEK F  +KQMLQEMIDLGI+MSRNNFFRLYNAICRSSN  HLVI
Sbjct: 541 NPNRHICNIMLSSFCKEKNFESMKQMLQEMIDLGIEMSRNNFFRLYNAICRSSNDPHLVI 600

Query: 601 YLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSSEEYTDV 660
            LLIEMKVLGLLP K  C+TLV N PK VNI+EKHYKLLNGCLEYCL  DTSSSEEYTDV
Sbjct: 601 CLLIEMKVLGLLPGKXACKTLVDNSPKAVNITEKHYKLLNGCLEYCLFCDTSSSEEYTDV 660

Query: 661 AAFVG 665
           AA VG
Sbjct: 661 AASVG 664

BLAST of CsGy1G024257 vs. NCBI nr
Match: XP_011659015.1 (putative pentatricopeptide repeat-containing protein At1g16830 isoform X2 [Cucumis sativus])

HSP 1 Score: 1132 bits (2929), Expect = 0.0
Identity = 565/565 (100.00%), Postives = 565/565 (100.00%), Query Frame = 0

Query: 101 MVGVVSRLMKQYETVNGILGGLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRY 160
           MVGVVSRLMKQYETVNGILGGLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRY
Sbjct: 1   MVGVVSRLMKQYETVNGILGGLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRY 60

Query: 161 GFTPNTFARNVIMDVLFKVGRADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTF 220
           GFTPNTFARNVIMDVLFKVGRADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTF
Sbjct: 61  GFTPNTFARNVIMDVLFKVGRADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTF 120

Query: 221 RCMLRMGYCPNPGTFEVVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRL 280
           RCMLRMGYCPNPGTFEVVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRL
Sbjct: 121 RCMLRMGYCPNPGTFEVVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRL 180

Query: 281 RRTEEASSLVKKMKKSGCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILY 340
           RRTEEASSLVKKMKKSGCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILY
Sbjct: 181 RRTEEASSLVKKMKKSGCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILY 240

Query: 341 NVLIDSLAKNERYNDALSIFLSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLV 400
           NVLIDSLAKNERYNDALSIFLSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLV
Sbjct: 241 NVLIDSLAKNERYNDALSIFLSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLV 300

Query: 401 EVDLVACNSLLSYLGKAGFAALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVR 460
           EVDLVACNSLLSYLGKAGFAALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVR
Sbjct: 301 EVDLVACNSLLSYLGKAGFAALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVR 360

Query: 461 LYNGILLNYTGVDAHIHTVIIDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLL 520
           LYNGILLNYTGVDAHIHTVIIDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLL
Sbjct: 361 LYNGILLNYTGVDAHIHTVIIDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLL 420

Query: 521 LVGRNTEASNLYNYMKEAGINPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRN 580
           LVGRNTEASNLYNYMKEAGINPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRN
Sbjct: 421 LVGRNTEASNLYNYMKEAGINPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRN 480

Query: 581 NFFRLYNAICRSSNGSHLVIYLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLN 640
           NFFRLYNAICRSSNGSHLVIYLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLN
Sbjct: 481 NFFRLYNAICRSSNGSHLVIYLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLN 540

Query: 641 GCLEYCLCGDTSSSEEYTDVAAFVG 665
           GCLEYCLCGDTSSSEEYTDVAAFVG
Sbjct: 541 GCLEYCLCGDTSSSEEYTDVAAFVG 565

BLAST of CsGy1G024257 vs. NCBI nr
Match: XP_023001025.1 (putative pentatricopeptide repeat-containing protein At1g16830 [Cucurbita maxima] >XP_023001026.1 putative pentatricopeptide repeat-containing protein At1g16830 [Cucurbita maxima] >XP_023001027.1 putative pentatricopeptide repeat-containing protein At1g16830 [Cucurbita maxima] >XP_023001028.1 putative pentatricopeptide repeat-containing protein At1g16830 [Cucurbita maxima] >XP_023001029.1 putative pentatricopeptide repeat-containing protein At1g16830 [Cucurbita maxima])

HSP 1 Score: 1076 bits (2782), Expect = 0.0
Identity = 542/665 (81.50%), Postives = 594/665 (89.32%), Query Frame = 0

Query: 1   MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL 60
           MIWR+GCNIF+IASRQI+RTFAHQ YR+C+   INLTK LHN+L+EDIEQKFSENRQVIL
Sbjct: 1   MIWRHGCNIFRIASRQILRTFAHQPYRKCSASLINLTKNLHNQLNEDIEQKFSENRQVIL 60

Query: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG 120
           TN+LV+TTLL CSSDL+ LSFFMWCAKQPNFFHN  AF+YMVGVVSRLM++YETV+GILG
Sbjct: 61  TNELVHTTLLTCSSDLVTLSFFMWCAKQPNFFHNTTAFEYMVGVVSRLMERYETVSGILG 120

Query: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG 180
            LESVG VTKAQTFLLLLRIYWRGGMY+LVFEAFDHMDR GFTPNTFARNVIMDVLFKVG
Sbjct: 121 ALESVGIVTKAQTFLLLLRIYWRGGMYELVFEAFDHMDRCGFTPNTFARNVIMDVLFKVG 180

Query: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN 240
           RADVALKVFKETLLPNFLTFNIVLCNLSK +DLIGI D FRCMLRMGY PNPGTFEVVLN
Sbjct: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKIRDLIGIRDAFRCMLRMGYSPNPGTFEVVLN 240

Query: 241 GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP 300
            LCKLGRL EAYQV GIMTT GISMSVNIWTIMIDG  RL RT +ASSLVKKM+KSGCSP
Sbjct: 241 DLCKLGRLVEAYQVLGIMTTIGISMSVNIWTIMIDGLRRLNRTSDASSLVKKMEKSGCSP 300

Query: 301 NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF 360
           NIVTYTTLIKGY+ AQ +SDAFDVLSII  EG  PD +LYNVLID LAK  RY++AL IF
Sbjct: 301 NIVTYTTLIKGYMDAQLVSDAFDVLSIIRLEGLPPDRVLYNVLIDGLAKIGRYDEALCIF 360

Query: 361 LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA 420
           LS+ K+NILPDCYTFSSLLNTICLSKR+FLLPKLVDGF VEVDLVACNSLLSYL KAGF 
Sbjct: 361 LSMDKQNILPDCYTFSSLLNTICLSKRVFLLPKLVDGFFVEVDLVACNSLLSYLAKAGFP 420

Query: 421 ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480
           +LALELY+ M++ GLMPDKYSVLGVLTGLC+++RI EAV LYNGILLNYTGVDAHIHTVI
Sbjct: 421 SLALELYDEMLDKGLMPDKYSVLGVLTGLCQAKRIDEAVNLYNGILLNYTGVDAHIHTVI 480

Query: 481 IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI 540
           I+GLIKAGK HSAIR+FR+ LLE++SLD VSYSVAIRGLL+VGR TEA NLYNYMKEAGI
Sbjct: 481 INGLIKAGKCHSAIRVFRKKLLEKSSLDAVSYSVAIRGLLMVGRGTEAYNLYNYMKEAGI 540

Query: 541 NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI 600
           NPNG +CNVMLS+FCKEK F  VKQMLQEMIDLGI++S NNFFRL NAI  SS   +LV 
Sbjct: 541 NPNGDMCNVMLSSFCKEKDFESVKQMLQEMIDLGIEISWNNFFRLCNAIYSSSYNFYLVS 600

Query: 601 YLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSSEEYTDV 660
           +LL+EMK LGLLP K  C+TLV+   K VNISE+H+KLLNG LEYCL GDTSSS+EYT+V
Sbjct: 601 HLLVEMKGLGLLPDKLACKTLVYKRLKAVNISEEHHKLLNGHLEYCLFGDTSSSDEYTNV 660

Query: 661 AAFVG 665
           AA VG
Sbjct: 661 AASVG 665

BLAST of CsGy1G024257 vs. NCBI nr
Match: XP_023519013.1 (putative pentatricopeptide repeat-containing protein At1g16830 [Cucurbita pepo subsp. pepo] >XP_023519014.1 putative pentatricopeptide repeat-containing protein At1g16830 [Cucurbita pepo subsp. pepo] >XP_023519015.1 putative pentatricopeptide repeat-containing protein At1g16830 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1054 bits (2726), Expect = 0.0
Identity = 531/654 (81.19%), Postives = 583/654 (89.14%), Query Frame = 0

Query: 1   MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL 60
           MIWR+GCNIF+IA  +I+RTFAHQ +R+C+ P INLTK LHN+L+ED+EQKFSENRQVIL
Sbjct: 1   MIWRHGCNIFRIAPCKILRTFAHQPHRKCSAPLINLTKNLHNQLNEDMEQKFSENRQVIL 60

Query: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG 120
           TN+LV+TTLLNCSSDL+ LSFFMWCAKQPNFFHN  AF+YMVGVVSRLMK+YETV+GIL 
Sbjct: 61  TNELVHTTLLNCSSDLVTLSFFMWCAKQPNFFHNTTAFEYMVGVVSRLMKRYETVSGILS 120

Query: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG 180
           GLESVG VTKAQTFLLLLRIYWRGGMY+LVFEAFDHMDR GFTPNTFARNVIMDVLFKVG
Sbjct: 121 GLESVGIVTKAQTFLLLLRIYWRGGMYELVFEAFDHMDRRGFTPNTFARNVIMDVLFKVG 180

Query: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN 240
           RADVALKVFKETLLPNFLTFNIVLCNLSK +DLIGI D FRCMLRMGY PNPGTFEVVLN
Sbjct: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKIRDLIGIRDAFRCMLRMGYYPNPGTFEVVLN 240

Query: 241 GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP 300
            LCKLGRL EAYQV GIMTT GISMSVNIWTIMIDGF RL RT +ASSLVKKM+KSGCSP
Sbjct: 241 DLCKLGRLVEAYQVLGIMTTIGISMSVNIWTIMIDGFRRLHRTADASSLVKKMEKSGCSP 300

Query: 301 NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF 360
           NIVTYTTLIKGY+ AQ +SDAFDVLSIIE EG SPD +LYNVLID LAK  RY++AL IF
Sbjct: 301 NIVTYTTLIKGYMDAQLVSDAFDVLSIIELEGLSPDRVLYNVLIDGLAKIGRYDEALCIF 360

Query: 361 LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA 420
           LS+ K+NILPDCYTFSSLLNTICLSKR+FLLPKLVDGF  +VDLVACNSLLSYL KAGF 
Sbjct: 361 LSMDKQNILPDCYTFSSLLNTICLSKRVFLLPKLVDGFFADVDLVACNSLLSYLAKAGFP 420

Query: 421 ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480
           +LALELY+ M++ GLMPDKYSVLGVLTGLC+++RI EAV LYNGILLNYTGVDAHIHTVI
Sbjct: 421 SLALELYDEMLDKGLMPDKYSVLGVLTGLCQAKRIDEAVNLYNGILLNYTGVDAHIHTVI 480

Query: 481 IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI 540
           I+GLIKAGK HSAIR+FR+NLL  +SLD VSYSVAIRGLL+VGR TEA NLYNYMKEAGI
Sbjct: 481 INGLIKAGKCHSAIRVFRKNLLGRSSLDAVSYSVAIRGLLMVGRGTEACNLYNYMKEAGI 540

Query: 541 NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI 600
           NPNG +CNVMLS+FCKEK    VKQMLQEMIDLGI++S NNF RL NAI  SS   +LV 
Sbjct: 541 NPNGDMCNVMLSSFCKEKDLESVKQMLQEMIDLGIEISWNNFCRLCNAIYSSSYNFYLVS 600

Query: 601 YLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSS 654
           +LL+EMK LGLLP K  C+TLV+   K VNISE+H+KLLNG LEYCL GDTSSS
Sbjct: 601 HLLVEMKGLGLLPDKLACKTLVYKRLKAVNISEEHHKLLNGHLEYCLFGDTSSS 654

BLAST of CsGy1G024257 vs. ExPASy TrEMBL
Match: A0A0A0M137 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G537540 PE=4 SV=1)

HSP 1 Score: 1342 bits (3474), Expect = 0.0
Identity = 665/665 (100.00%), Postives = 665/665 (100.00%), Query Frame = 0

Query: 1   MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL 60
           MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL
Sbjct: 1   MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL 60

Query: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG 120
           TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG
Sbjct: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG 120

Query: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG 180
           GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG
Sbjct: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG 180

Query: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN 240
           RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN
Sbjct: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN 240

Query: 241 GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP 300
           GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP
Sbjct: 241 GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP 300

Query: 301 NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF 360
           NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF
Sbjct: 301 NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF 360

Query: 361 LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA 420
           LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA
Sbjct: 361 LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA 420

Query: 421 ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480
           ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI
Sbjct: 421 ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480

Query: 481 IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI 540
           IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI
Sbjct: 481 IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI 540

Query: 541 NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI 600
           NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI
Sbjct: 541 NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI 600

Query: 601 YLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSSEEYTDV 660
           YLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSSEEYTDV
Sbjct: 601 YLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSSEEYTDV 660

Query: 661 AAFVG 665
           AAFVG
Sbjct: 661 AAFVG 665

BLAST of CsGy1G024257 vs. ExPASy TrEMBL
Match: A0A6J1KFA9 (putative pentatricopeptide repeat-containing protein At1g16830 OS=Cucurbita maxima OX=3661 GN=LOC111495287 PE=4 SV=1)

HSP 1 Score: 1076 bits (2782), Expect = 0.0
Identity = 542/665 (81.50%), Postives = 594/665 (89.32%), Query Frame = 0

Query: 1   MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL 60
           MIWR+GCNIF+IASRQI+RTFAHQ YR+C+   INLTK LHN+L+EDIEQKFSENRQVIL
Sbjct: 1   MIWRHGCNIFRIASRQILRTFAHQPYRKCSASLINLTKNLHNQLNEDIEQKFSENRQVIL 60

Query: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG 120
           TN+LV+TTLL CSSDL+ LSFFMWCAKQPNFFHN  AF+YMVGVVSRLM++YETV+GILG
Sbjct: 61  TNELVHTTLLTCSSDLVTLSFFMWCAKQPNFFHNTTAFEYMVGVVSRLMERYETVSGILG 120

Query: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG 180
            LESVG VTKAQTFLLLLRIYWRGGMY+LVFEAFDHMDR GFTPNTFARNVIMDVLFKVG
Sbjct: 121 ALESVGIVTKAQTFLLLLRIYWRGGMYELVFEAFDHMDRCGFTPNTFARNVIMDVLFKVG 180

Query: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN 240
           RADVALKVFKETLLPNFLTFNIVLCNLSK +DLIGI D FRCMLRMGY PNPGTFEVVLN
Sbjct: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKIRDLIGIRDAFRCMLRMGYSPNPGTFEVVLN 240

Query: 241 GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP 300
            LCKLGRL EAYQV GIMTT GISMSVNIWTIMIDG  RL RT +ASSLVKKM+KSGCSP
Sbjct: 241 DLCKLGRLVEAYQVLGIMTTIGISMSVNIWTIMIDGLRRLNRTSDASSLVKKMEKSGCSP 300

Query: 301 NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF 360
           NIVTYTTLIKGY+ AQ +SDAFDVLSII  EG  PD +LYNVLID LAK  RY++AL IF
Sbjct: 301 NIVTYTTLIKGYMDAQLVSDAFDVLSIIRLEGLPPDRVLYNVLIDGLAKIGRYDEALCIF 360

Query: 361 LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA 420
           LS+ K+NILPDCYTFSSLLNTICLSKR+FLLPKLVDGF VEVDLVACNSLLSYL KAGF 
Sbjct: 361 LSMDKQNILPDCYTFSSLLNTICLSKRVFLLPKLVDGFFVEVDLVACNSLLSYLAKAGFP 420

Query: 421 ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480
           +LALELY+ M++ GLMPDKYSVLGVLTGLC+++RI EAV LYNGILLNYTGVDAHIHTVI
Sbjct: 421 SLALELYDEMLDKGLMPDKYSVLGVLTGLCQAKRIDEAVNLYNGILLNYTGVDAHIHTVI 480

Query: 481 IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI 540
           I+GLIKAGK HSAIR+FR+ LLE++SLD VSYSVAIRGLL+VGR TEA NLYNYMKEAGI
Sbjct: 481 INGLIKAGKCHSAIRVFRKKLLEKSSLDAVSYSVAIRGLLMVGRGTEAYNLYNYMKEAGI 540

Query: 541 NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI 600
           NPNG +CNVMLS+FCKEK F  VKQMLQEMIDLGI++S NNFFRL NAI  SS   +LV 
Sbjct: 541 NPNGDMCNVMLSSFCKEKDFESVKQMLQEMIDLGIEISWNNFFRLCNAIYSSSYNFYLVS 600

Query: 601 YLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSSEEYTDV 660
           +LL+EMK LGLLP K  C+TLV+   K VNISE+H+KLLNG LEYCL GDTSSS+EYT+V
Sbjct: 601 HLLVEMKGLGLLPDKLACKTLVYKRLKAVNISEEHHKLLNGHLEYCLFGDTSSSDEYTNV 660

Query: 661 AAFVG 665
           AA VG
Sbjct: 661 AASVG 665

BLAST of CsGy1G024257 vs. ExPASy TrEMBL
Match: A0A6J1CES7 (putative pentatricopeptide repeat-containing protein At1g16830 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111010829 PE=4 SV=1)

HSP 1 Score: 1024 bits (2648), Expect = 0.0
Identity = 514/665 (77.29%), Postives = 573/665 (86.17%), Query Frame = 0

Query: 1   MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL 60
           M WR+G ++FQIA RQ +RT AHQS  +C+TP +NLT+ LHNRL+++ E+K+SE  QVIL
Sbjct: 1   MKWRHGRSLFQIAPRQFLRTLAHQSGPKCSTPLVNLTEKLHNRLNQNAERKYSEKGQVIL 60

Query: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG 120
           TN+LV+TTLLNC SDL+ALSFF+WCAKQP+FFH+  AFDYMVGVVSRLMK+YETVNG++G
Sbjct: 61  TNELVHTTLLNCPSDLVALSFFLWCAKQPDFFHSATAFDYMVGVVSRLMKRYETVNGVVG 120

Query: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG 180
           GLESVGNVTKAQTFLLLLRIYWRGGMY+LVFEAFD M   GFTPNTFARNVI+DVLFKVG
Sbjct: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYELVFEAFDQMGHCGFTPNTFARNVIVDVLFKVG 180

Query: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN 240
           RADVALKVFKET LPNFLTFNIVLCNLSK KD +GIGD  R MLRMGY PNPGTFE VLN
Sbjct: 181 RADVALKVFKETPLPNFLTFNIVLCNLSKIKDFVGIGDAVRRMLRMGYYPNPGTFEGVLN 240

Query: 241 GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP 300
             CKLGRLAEAYQV GIMTT GISMSVNIWTI++DGFCRL RT +ASSLVKKMK++GC P
Sbjct: 241 CFCKLGRLAEAYQVLGIMTTLGISMSVNIWTIIVDGFCRLHRTADASSLVKKMKRTGCFP 300

Query: 301 NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF 360
           NIVTYTTLIK Y+ AQ  SDA  VLSIIESEG SPD +LYNVLIDSLAK  RY+ ALSIF
Sbjct: 301 NIVTYTTLIKAYLEAQLFSDASSVLSIIESEGHSPDRVLYNVLIDSLAKIGRYDKALSIF 360

Query: 361 LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA 420
           LSL K+NI PD YTFSSLLN ICLSK  FLLPKLVDGF VE DLVACNSLL+YL KAGF 
Sbjct: 361 LSLDKQNIYPDSYTFSSLLNVICLSKMFFLLPKLVDGFFVEADLVACNSLLNYLSKAGFP 420

Query: 421 ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480
           +LALELY+ M++ GLMPDKYSVLGVLTGLC SRRI EAV LYNGILLN+TGVDAHIHTVI
Sbjct: 421 SLALELYDEMLDKGLMPDKYSVLGVLTGLCGSRRIDEAVHLYNGILLNHTGVDAHIHTVI 480

Query: 481 IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI 540
           IDGLIKAGK HSAIR+FR+ LLEENSLDVVS++V IRGLL+V R+TEA NLYN+MKE GI
Sbjct: 481 IDGLIKAGKCHSAIRVFRKTLLEENSLDVVSFTVGIRGLLMVSRHTEALNLYNHMKEVGI 540

Query: 541 NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI 600
           NPNGH+CN+ML  FCKE+    V+QMLQEMIDL I+MS NNF RL NAICRSS  S LVI
Sbjct: 541 NPNGHICNLMLFNFCKERNLESVEQMLQEMIDLRIEMSCNNFVRLCNAICRSSYNSRLVI 600

Query: 601 YLLIEMKVLGLLPRKRDCETLVHNPPKDVNISEKHYKLLNGCLEYCLCGDTSSSEEYTDV 660
           YLLIEM+ LGLLP K  C+ LVH PPK VN+ EKH++LLNGC+EY L G TS SEEYTDV
Sbjct: 601 YLLIEMRDLGLLPAKLACKILVHKPPKVVNVFEKHHELLNGCIEYGLFGYTSGSEEYTDV 660

Query: 661 AAFVG 665
           +A VG
Sbjct: 661 SASVG 665

BLAST of CsGy1G024257 vs. ExPASy TrEMBL
Match: A0A6J1EGE1 (putative pentatricopeptide repeat-containing protein At1g16830 OS=Cucurbita moschata OX=3662 GN=LOC111434100 PE=4 SV=1)

HSP 1 Score: 1007 bits (2604), Expect = 0.0
Identity = 506/618 (81.88%), Postives = 554/618 (89.64%), Query Frame = 0

Query: 1   MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL 60
           MIW +GCNIF+IA RQI+RTFAHQ +R+C  P INLTK LHN+L++DIEQKFSENRQVIL
Sbjct: 1   MIWGHGCNIFRIAPRQILRTFAHQPHRKCLAPLINLTKNLHNQLNQDIEQKFSENRQVIL 60

Query: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG 120
           TN+LV+TTLLNCSSDL+ LSFFMWCAKQPNFFHN  AF+YMVGVVSRLM++YETV+GILG
Sbjct: 61  TNELVHTTLLNCSSDLVTLSFFMWCAKQPNFFHNTTAFEYMVGVVSRLMERYETVSGILG 120

Query: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG 180
            LESVG VTKAQTFLLLLRIYWRGGMY+LVFEAFDHMDR GFTPNTFARNVIMDVLFKVG
Sbjct: 121 ALESVGIVTKAQTFLLLLRIYWRGGMYELVFEAFDHMDRCGFTPNTFARNVIMDVLFKVG 180

Query: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN 240
           RADVAL+VFKETLLPNFLTFNIVLCNLSK +DLIGI D FRCMLRMGY PNPGTFEVVLN
Sbjct: 181 RADVALEVFKETLLPNFLTFNIVLCNLSKIRDLIGIRDAFRCMLRMGYYPNPGTFEVVLN 240

Query: 241 GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP 300
            LCKLGRL EAYQV GIMTT GISMSVNIWTIMIDGF RL RT +ASSLVKKM+ SGCSP
Sbjct: 241 DLCKLGRLVEAYQVLGIMTTIGISMSVNIWTIMIDGFRRLHRTADASSLVKKMEISGCSP 300

Query: 301 NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF 360
           NIVTYTTLIKGY+ AQ +SDAFDVLSIIE EG SPD +LYNVLID LAK  RY++AL IF
Sbjct: 301 NIVTYTTLIKGYMDAQLVSDAFDVLSIIELEGLSPDRVLYNVLIDGLAKIGRYDEALCIF 360

Query: 361 LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA 420
           LS+ K+NILPDCYTFSSLLNTICLSKR+FLLPKLVDGF VEVDLVACNSLLSYLGKAGF 
Sbjct: 361 LSMDKQNILPDCYTFSSLLNTICLSKRVFLLPKLVDGFFVEVDLVACNSLLSYLGKAGFP 420

Query: 421 ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480
           +LALELY+ M++ GLMPDKYSVLGVLTGLC+++RI EAV LYNGILLNYTGVDAHIHTVI
Sbjct: 421 SLALELYDEMLDKGLMPDKYSVLGVLTGLCQAKRIDEAVNLYNGILLNYTGVDAHIHTVI 480

Query: 481 IDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGI 540
           I+GLIKAGK HSAIR+FR+ LLE++SLD VSYSVAIRGLL+VGR TEA  LYNYMKEAGI
Sbjct: 481 INGLIKAGKCHSAIRVFRKKLLEKSSLDAVSYSVAIRGLLMVGRGTEACKLYNYMKEAGI 540

Query: 541 NPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVI 600
           NPNG +CNVMLS+FCKEK F  VKQMLQEMIDLGI++S NNFFRL NAI  SS   + V 
Sbjct: 541 NPNGDMCNVMLSSFCKEKDFESVKQMLQEMIDLGIEISWNNFFRLCNAIYSSSYNFYPVS 600

Query: 601 YLLIEMKVLGLLPRKRDC 618
           +LL+EMK LGLLP K  C
Sbjct: 601 HLLVEMKGLGLLPDKLGC 618

BLAST of CsGy1G024257 vs. ExPASy TrEMBL
Match: A0A1S3BD22 (putative pentatricopeptide repeat-containing protein At1g16830 OS=Cucumis melo OX=3656 GN=LOC103488546 PE=4 SV=1)

HSP 1 Score: 912 bits (2356), Expect = 0.0
Identity = 453/487 (93.02%), Postives = 465/487 (95.48%), Query Frame = 0

Query: 1   MIWRYGCNIFQIASRQIIRTFAHQSYRRCTTPHINLTKILHNRLDEDIEQKFSENRQVIL 60
           MIWRYGCNIFQIA RQI R FA  SYR+CTTP INLTKILHNRLDEDIEQK  +NRQVIL
Sbjct: 1   MIWRYGCNIFQIAPRQIFRAFARHSYRQCTTPLINLTKILHNRLDEDIEQKIPKNRQVIL 60

Query: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILG 120
           TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETV+GILG
Sbjct: 61  TNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVDGILG 120

Query: 121 GLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKVG 180
           GLESVGNVTKAQTFLLLLRIYWRGGM+DLVFEAFDHM+  GFTPNTFARNVIMDVLFKVG
Sbjct: 121 GLESVGNVTKAQTFLLLLRIYWRGGMFDLVFEAFDHMNHCGFTPNTFARNVIMDVLFKVG 180

Query: 181 RADVALKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLN 240
           RADVALKVFKET LPNFLTFNIVLCNLSK KDLIGIGD FRCMLRMGY PNPGTFEVVLN
Sbjct: 181 RADVALKVFKETQLPNFLTFNIVLCNLSKIKDLIGIGDAFRCMLRMGYYPNPGTFEVVLN 240

Query: 241 GLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKSGCSP 300
           GLCKLGRLAEAYQV GIMTTFGIS+SVNIWTIMIDGFCRL RT+EA+SLVKKMKKSGCSP
Sbjct: 241 GLCKLGRLAEAYQVLGIMTTFGISISVNIWTIMIDGFCRLHRTDEATSLVKKMKKSGCSP 300

Query: 301 NIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIF 360
           NIVTYTTLIKGYIYAQR+ DAFDVLSIIESEGPSPDL+LYNVLIDSLAKNERYNDALSIF
Sbjct: 301 NIVTYTTLIKGYIYAQRVIDAFDVLSIIESEGPSPDLVLYNVLIDSLAKNERYNDALSIF 360

Query: 361 LSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGKAGFA 420
           LSL KR ILPDCYTFSSLLNTICLSKRLFLLPKLVDGF VEVDLVACNSLLSYLGKAG A
Sbjct: 361 LSLDKRKILPDCYTFSSLLNTICLSKRLFLLPKLVDGFFVEVDLVACNSLLSYLGKAGLA 420

Query: 421 ALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480
           ALALELY+NMVN GLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI
Sbjct: 421 ALALELYDNMVNRGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVI 480

Query: 481 IDGLIKA 487
           +DGLIKA
Sbjct: 481 MDGLIKA 487

BLAST of CsGy1G024257 vs. TAIR 10
Match: AT1G16830.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 412.5 bits (1059), Expect = 6.3e-115
Identity = 214/535 (40.00%), Postives = 335/535 (62.62%), Query Frame = 0

Query: 60  LTNDLVYTTLLNCSSDLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGIL 119
           LT+D VY+ L    +DL  L+FF WCAKQ N+FH+  AFD+MVGVV +L ++Y +++ I+
Sbjct: 37  LTHDNVYSCLRESPADLKTLNFFFWCAKQNNYFHDDRAFDHMVGVVEKLTREYYSIDRII 96

Query: 120 GGLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIMDVLFKV 179
             L+  G   K + FLLLL I+WRG +YD   E +  M  +GF PNT A N++MDV FK+
Sbjct: 97  ERLKISGCEIKPRVFLLLLEIFWRGHIYDKAIEVYTGMSSFGFVPNTRAMNMMMDVNFKL 156

Query: 180 GRADVALKVFKETLLPNFLTFNIVL---CNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFE 239
              + AL++F+     NF +F+I L   C+     DL+G+    + M+  G+ PN   F 
Sbjct: 157 NVVNGALEIFEGIRFRNFFSFDIALSHFCSRGGRGDLVGVKIVLKRMIGEGFYPNRERFG 216

Query: 240 VVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKMKKS 299
            +L   C+ G ++EA+QV G+M   GIS+SVN+W++++ GF R    ++A  L  KM + 
Sbjct: 217 QILRLCCRTGCVSEAFQVVGLMICSGISVSVNVWSMLVSGFFRSGEPQKAVDLFNKMIQI 276

Query: 300 GCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDA 359
           GCSPN+VTYT+LIKG++    + +AF VLS ++SEG +PD++L N++I +  +  R+ +A
Sbjct: 277 GCSPNLVTYTSLIKGFVDLGMVDEAFTVLSKVQSEGLAPDIVLCNLMIHTYTRLGRFEEA 336

Query: 360 LSIFLSLHKRNILPDCYTFSSLLNTICLSKRLFLLPKLVDGFLVEVDLVACNSLLSYLGK 419
             +F SL KR ++PD YTF+S+L+++CLS +  L+P++  G   + DLV  N L +   K
Sbjct: 337 RKVFTSLEKRKLVPDQYTFASILSSLCLSGKFDLVPRITHGIGTDFDLVTGNLLSNCFSK 396

Query: 420 AGFAALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHI 479
            G+ + AL++ + M       D Y+    L+ LC       A+++Y  I+     +DAH 
Sbjct: 397 IGYNSYALKVLSIMSYKDFALDCYTYTVYLSALCRGGAPRAAIKMYKIIIKEKKHLDAHF 456

Query: 480 HTVIIDGLIKAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMK 539
           H+ IID LI+ GK+++A+ +F+R +LE+  LDVVSY+VAI+GL+   R  EA +L   MK
Sbjct: 457 HSAIIDSLIELGKYNTAVHLFKRCILEKYPLDVVSYTVAIKGLVRAKRIEEAYSLCCDMK 516

Query: 540 EAGINPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICR 592
           E GI PN      ++S  CKEK+   V+++L+E I  G+++  N  F++Y+ + R
Sbjct: 517 EGGIYPNRRTYRTIISGLCKEKETEKVRKILRECIQEGVELDPNTKFQVYSLLSR 571

BLAST of CsGy1G024257 vs. TAIR 10
Match: AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 179.5 bits (454), Expect = 9.0e-45
Identity = 139/546 (25.46%), Postives = 253/546 (46.34%), Query Frame = 0

Query: 78  ALSFFMWCAKQPNFFHNPAAFDYM---------VGVVSR----LMKQYETVNGILGGLES 137
           AL+F  W ++ P + H+  ++  +         VGVV +    ++K  ++V   L  L+ 
Sbjct: 106 ALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVGVVFKIRLLMIKSCDSVGDALYVLDL 165

Query: 138 VGNVTKAQTFLL-----------LLRIYWRGGMYDLVFEAFDHMDRYGFTPNTFARNVIM 197
              + K + F L           LL    R G+ D + + +  M      PN +  N ++
Sbjct: 166 CRKMNKDERFELKYKLIIGCYNTLLNSLARFGLVDEMKQVYMEMLEDKVCPNIYTYNKMV 225

Query: 198 DVLFKVGRADVA----LKVFKETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYC 257
           +   K+G  + A     K+ +  L P+F T+  ++    + KDL      F  M   G  
Sbjct: 226 NGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCR 285

Query: 258 PNPGTFEVVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSL 317
            N   +  +++GLC   R+ EA  ++  M       +V  +T++I   C   R  EA +L
Sbjct: 286 RNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNL 345

Query: 318 VKKMKKSGCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAK 377
           VK+M+++G  PNI TYT LI       +   A ++L  +  +G  P++I YN LI+   K
Sbjct: 346 VKEMEETGIKPNIHTYTVLIDSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCK 405

Query: 378 NERYNDALSIFLSLHKRNILPDCYTFSSLLNTICLS---KRLFLLPKLVDGFLVEVDLVA 437
                DA+ +   +  R + P+  T++ L+   C S   K + +L K+++  ++  D+V 
Sbjct: 406 RGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCKSNVHKAMGVLNKMLERKVLP-DVVT 465

Query: 438 CNSLLSYLGKAGFAALALELYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGIL 497
            NSL+    ++G    A  L + M + GL+PD+++   ++  LC+S+R+ EA  L++ + 
Sbjct: 466 YNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLE 525

Query: 498 LNYTGVDAHIHTVIIDGLIKAGKFHSAIRIFRRNLLEENSL-DVVSYSVAIRGLLLVGRN 557
                 +  ++T +IDG  KAGK   A  +    +L +N L + ++++  I GL   G+ 
Sbjct: 526 QKGVNPNVVMYTALIDGYCKAGKVDEA-HLMLEKMLSKNCLPNSLTFNALIHGLCADGKL 585

Query: 558 TEASNLYNYMKEAGINPNGHVCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRL 592
            EA+ L   M + G+ P      +++    K+  F       Q+M+  G K   + +   
Sbjct: 586 KEATLLEEKMVKIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTF 645

BLAST of CsGy1G024257 vs. TAIR 10
Match: AT1G63330.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 177.9 bits (450), Expect = 2.6e-44
Identity = 123/487 (25.26%), Postives = 239/487 (49.08%), Query Frame = 0

Query: 98  FDYMVGVVSRLMKQYETVNGILGGLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHM 157
           F+ ++  +++ MK+++ V  +   ++ +G      T+ +L+  + R     L       M
Sbjct: 13  FNKLLSAIAK-MKKFDLVISLGEKMQRLGISHNLYTYNILINCFCRRSQISLALALLGKM 72

Query: 158 DRYGFTPNTFARNVIMDVLFKVGRADVALKVFKETL----LPNFLTFNIVLCNL---SKT 217
            + G+ P+    + +++      R   A+ +  + +     P+ +TF  ++  L   +K 
Sbjct: 73  MKLGYEPSIVTLSSLLNGYCHGKRISDAVALVDQMVEMGYRPDTITFTTLIHGLFLHNKA 132

Query: 218 KDLIGIGDTFRCMLRMGYCPNPGTFEVVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIW 277
            + + + D    M++ G  PN  T+ VV+NGLCK G +  A+ +   M    I   V I+
Sbjct: 133 SEAVALVDR---MVQRGCQPNLVTYGVVVNGLCKRGDIDLAFNLLNKMEAAKIEADVVIF 192

Query: 278 TIMIDGFCRLRRTEEASSLVKKMKKSGCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIES 337
             +ID  C+ R  ++A +L K+M+  G  PN+VTY++LI       R SDA  +LS +  
Sbjct: 193 NTIIDSLCKYRHVDDALNLFKEMETKGIRPNVVTYSSLISCLCSYGRWSDASQLLSDMIE 252

Query: 338 EGPSPDLILYNVLIDSLAKNERYNDALSIFLSLHKRNILPDCYTFSSLLNTICLSKRLFL 397
           +  +P+L+ +N LID+  K  ++ +A  +   + KR+I PD +T++SL+N  C+  RL  
Sbjct: 253 KKINPNLVTFNALIDAFVKEGKFVEAEKLHDDMIKRSIDPDIFTYNSLINGFCMHDRLDK 312

Query: 398 LPKLVDGFLVE----VDLVACNSLLSYLGKAGFAALALELYNNMVNGGLMPDKYSVLGVL 457
             ++ + F+V      DL   N+L+    K+       EL+  M + GL+ D  +   ++
Sbjct: 313 AKQMFE-FMVSKDCFPDLDTYNTLIKGFCKSKRVEDGTELFREMSHRGLVGDTVTYTTLI 372

Query: 458 TGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVIIDGLIKAGKFHSAIRIFRRNLLEENS 517
            GL        A +++  ++ +    D   +++++DGL   GK   A+ +F      E  
Sbjct: 373 QGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSILLDGLCNNGKLEKALEVFDYMQKSEIK 432

Query: 518 LDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGINPNGHVCNVMLSTFCK----EKKFAL 570
           LD+  Y+  I G+   G+  +  +L+  +   G+ PN    N M+S  C     ++ +AL
Sbjct: 433 LDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKGVKPNVVTYNTMISGLCSKRLLQEAYAL 492

BLAST of CsGy1G024257 vs. TAIR 10
Match: AT1G62590.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 176.8 bits (447), Expect = 5.8e-44
Identity = 123/487 (25.26%), Postives = 239/487 (49.08%), Query Frame = 0

Query: 98  FDYMVGVVSRLMKQYETVNGILGGLESVGNVTKAQTFLLLLRIYWRGGMYDLVFEAFDHM 157
           F+ ++  +++ MK+++ V  +   ++ +  V    T+ +L+  + R     L       M
Sbjct: 88  FNKLLSAIAK-MKKFDVVISLGEKMQRLEIVHGLYTYNILINCFCRRSQISLALALLGKM 147

Query: 158 DRYGFTPNTFARNVIMDVLFKVGRADVALKVFKETL----LPNFLTFNIVLCNL---SKT 217
            + G+ P+    + +++      R   A+ +  + +     P+ +TF  ++  L   +K 
Sbjct: 148 MKLGYEPSIVTLSSLLNGYCHGKRISDAVALVDQMVEMGYRPDTITFTTLIHGLFLHNKA 207

Query: 218 KDLIGIGDTFRCMLRMGYCPNPGTFEVVLNGLCKLGRLAEAYQVWGIMTTFGISMSVNIW 277
            + + + D    M++ G  PN  T+ VV+NGLCK G    A  +   M    I   V I+
Sbjct: 208 SEAVALVDR---MVQRGCQPNLVTYGVVVNGLCKRGDTDLALNLLNKMEAAKIEADVVIF 267

Query: 278 TIMIDGFCRLRRTEEASSLVKKMKKSGCSPNIVTYTTLIKGYIYAQRISDAFDVLSIIES 337
             +ID  C+ R  ++A +L K+M+  G  PN+VTY++LI       R SDA  +LS +  
Sbjct: 268 NTIIDSLCKYRHVDDALNLFKEMETKGIRPNVVTYSSLISCLCSYGRWSDASQLLSDMIE 327

Query: 338 EGPSPDLILYNVLIDSLAKNERYNDALSIFLSLHKRNILPDCYTFSSLLNTICLSKRLFL 397
           +  +P+L+ +N LID+  K  ++ +A  ++  + KR+I PD +T++SL+N  C+  RL  
Sbjct: 328 KKINPNLVTFNALIDAFVKEGKFVEAEKLYDDMIKRSIDPDIFTYNSLVNGFCMHDRLDK 387

Query: 398 LPKLVDGFLVE----VDLVACNSLLSYLGKAGFAALALELYNNMVNGGLMPDKYSVLGVL 457
             ++ + F+V      D+V  N+L+    K+       EL+  M + GL+ D  +   ++
Sbjct: 388 AKQMFE-FMVSKDCFPDVVTYNTLIKGFCKSKRVEDGTELFREMSHRGLVGDTVTYTTLI 447

Query: 458 TGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVIIDGLIKAGKFHSAIRIFRRNLLEENS 517
            GL        A +++  ++ +    D   +++++DGL   GK   A+ +F      E  
Sbjct: 448 QGLFHDGDCDNAQKVFKQMVSDGVPPDIMTYSILLDGLCNNGKLEKALEVFDYMQKSEIK 507

Query: 518 LDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGINPNGHVCNVMLSTFCK----EKKFAL 570
           LD+  Y+  I G+   G+  +  +L+  +   G+ PN    N M+S  C     ++ +AL
Sbjct: 508 LDIYIYTTMIEGMCKAGKVDDGWDLFCSLSLKGVKPNVVTYNTMISGLCSKRLLQEAYAL 567

BLAST of CsGy1G024257 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 174.1 bits (440), Expect = 3.8e-43
Identity = 136/550 (24.73%), Postives = 253/550 (46.00%), Query Frame = 0

Query: 75  DLIALSFFMWCAKQPNFFHNPAAFDYMVGVVSRLMKQYETVNGILGGLESVGNVTKAQTF 134
           D  AL  F   +K+PNF   PA ++ ++  + R    ++ +  IL  ++S        TF
Sbjct: 63  DSAALRLFNLASKKPNFSPEPALYEEILLRLGR-SGSFDDMKKILEDMKSSRCEMGTSTF 122

Query: 135 LLLLRIYWRGGMYDLVFEAFDHM-DRYGFTPNTFARNVIMDVLFKVGRADVA----LKVF 194
           L+L+  Y +  + D +    D M D +G  P+T   N ++++L       +      K+ 
Sbjct: 123 LILIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRMLNLLVDGNSLKLVEISHAKMS 182

Query: 195 KETLLPNFLTFNIVLCNLSKTKDLIGIGDTFRCMLRMGYCPNPGTFEVVLNGLCKLGRLA 254
              + P+  TFN+++  L +   L         M   G  P+  TF  V+ G  + G L 
Sbjct: 183 VWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLD 242

Query: 255 EAYQVWGIMTTFGISMSVNIWTIMIDGFCRLRRTEEASSLVKKM-KKSGCSPNIVTYTTL 314
            A ++   M  FG S S     +++ GFC+  R E+A + +++M  + G  P+  T+ TL
Sbjct: 243 GALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTL 302

Query: 315 IKGYIYAQRISDAFDVLSIIESEGPSPDLILYNVLIDSLAKNERYNDALSIFLSLHKRNI 374
           + G   A  +  A +++ ++  EG  PD+  YN +I  L K     +A+ +   +  R+ 
Sbjct: 303 VNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDC 362

Query: 375 LPDCYTFSSLLNTICLSKRL---FLLPKLVDGFLVEVDLVACNSLLSYLGKAGFAALALE 434
            P+  T+++L++T+C   ++     L +++    +  D+   NSL+  L       +A+E
Sbjct: 363 SPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAME 422

Query: 435 LYNNMVNGGLMPDKYSVLGVLTGLCESRRIGEAVRLYNGILLNYTGVDAHIHTVIIDGLI 494
           L+  M + G  PD+++   ++  LC   ++ EA+ +   + L+        +  +IDG  
Sbjct: 423 LFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFC 482

Query: 495 KAGKFHSAIRIFRRNLLEENSLDVVSYSVAIRGLLLVGRNTEASNLYNYMKEAGINPNGH 554
           KA K   A  IF    +   S + V+Y+  I GL    R  +A+ L + M   G  P+ +
Sbjct: 483 KANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKY 542

Query: 555 VCNVMLSTFCKEKKFALVKQMLQEMIDLGIKMSRNNFFRLYNAICRSSNGSHLVIYLL-- 614
             N +L+ FC+         ++Q M   G +     +  L + +C++     +   LL  
Sbjct: 543 TYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGR-VEVASKLLRS 602

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3EDA98.9e-11440.00Putative pentatricopeptide repeat-containing protein At1g16830 OS=Arabidopsis th... [more]
Q9LSL91.3e-4325.46Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... [more]
Q9C8T73.7e-4325.26Pentatricopeptide repeat-containing protein At1g63330 OS=Arabidopsis thaliana OX... [more]
Q9SXD88.2e-4325.26Pentatricopeptide repeat-containing protein At1g62590 OS=Arabidopsis thaliana OX... [more]
Q9LFF15.3e-4224.73Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_011659004.10.0100.00putative pentatricopeptide repeat-containing protein At1g16830 isoform X1 [Cucum... [more]
XP_038894703.10.085.71LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g16... [more]
XP_011659015.10.0100.00putative pentatricopeptide repeat-containing protein At1g16830 isoform X2 [Cucum... [more]
XP_023001025.10.081.50putative pentatricopeptide repeat-containing protein At1g16830 [Cucurbita maxima... [more]
XP_023519013.10.081.19putative pentatricopeptide repeat-containing protein At1g16830 [Cucurbita pepo s... [more]
Match NameE-valueIdentityDescription
A0A0A0M1370.0100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G537540 PE=4 SV=1[more]
A0A6J1KFA90.081.50putative pentatricopeptide repeat-containing protein At1g16830 OS=Cucurbita maxi... [more]
A0A6J1CES70.077.29putative pentatricopeptide repeat-containing protein At1g16830 isoform X1 OS=Mom... [more]
A0A6J1EGE10.081.88putative pentatricopeptide repeat-containing protein At1g16830 OS=Cucurbita mosc... [more]
A0A1S3BD220.093.02putative pentatricopeptide repeat-containing protein At1g16830 OS=Cucumis melo O... [more]
Match NameE-valueIdentityDescription
AT1G16830.16.3e-11540.00Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G65560.19.0e-4525.46Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G63330.12.6e-4425.26Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G62590.15.8e-4425.26pentatricopeptide (PPR) repeat-containing protein [more]
AT3G53700.13.8e-4324.73Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 38..192
e-value: 7.2E-14
score: 53.7
coord: 388..467
e-value: 2.8E-9
score: 38.8
coord: 326..387
e-value: 4.8E-9
score: 38.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 193..325
e-value: 2.2E-29
score: 104.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 470..624
e-value: 5.6E-19
score: 70.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 335..383
e-value: 2.4E-10
score: 40.4
coord: 266..312
e-value: 1.8E-15
score: 56.9
coord: 403..450
e-value: 6.5E-8
score: 32.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 269..302
e-value: 1.4E-10
score: 38.7
coord: 405..438
e-value: 4.8E-4
score: 18.1
coord: 546..578
e-value: 2.0E-5
score: 22.4
coord: 234..266
e-value: 2.9E-4
score: 18.8
coord: 303..336
e-value: 7.9E-4
score: 17.4
coord: 339..372
e-value: 6.0E-7
score: 27.2
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 507..556
e-value: 0.0014
score: 18.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 478..499
e-value: 0.36
score: 11.2
coord: 133..162
e-value: 0.66
score: 10.4
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 226..258
e-value: 3.2E-6
score: 26.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 543..577
score: 9.602157
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 508..542
score: 10.303679
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 130..164
score: 8.95544
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 301..335
score: 9.832344
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 266..300
score: 12.616514
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 403..437
score: 10.150222
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 336..370
score: 10.599635
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 231..265
score: 10.117337
NoneNo IPR availablePANTHERPTHR47938:SF26OS10G0578500 PROTEINcoord: 16..663
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 16..663

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G024257.1CsGy1G024257.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding