CsGy1G023930 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy1G023930
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
LocationGy14Chr1: 22652645 .. 22654988 (-)
RNA-Seq ExpressionCsGy1G023930
SyntenyCsGy1G023930
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTGAAAAGCCAAACTCCGAGTCCAAAAATGTGCCGCTCATTTCGTCCGACATGCTTCACCTTCATCGATCAAAGCCCATTATTCATAGTCCCATTTTCCTCAACTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCCTCCTTTTCAGTCGATGCAACTCCATTCAACACCTCCAGCAAATTCATGCCAGGTTCATCCTCCACGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTTATCGATTGCTATGCCAATCTTGGACTCCTCAATCACTCTCTCCAAGTTTTCTGCTCTGTAATCGACCCCAATTTGACTCTTTTCAACGCCATACTGAGAAATTTGACAAGATATGGAGAATCCGAGCGGACCCTGTTGGTGTATCAACAAATGGTCGCCAAATCTATGCACCCAGATGAAGAAACTTACCCTTTTGTTTTGCGATCATGTTCTTCTTTTTCAAATGTTGGATTTGGGAGGACGATTCATGGGTATTTGGTTAAGCTGGGTTTTGATTTGTTTGATGTTGTAGCGACTGCTCTGGCTGAGATGTATGAGGAATGCATTGAATTTGAGAATGCTCATCAACTGTTTGATAAAAGATCTGTGAAGGATTTGGGATGGCCGAGTTCCTTGACTACGGAGGGTCCTCAAAATGATAACGGGGAGGGAATTTTTCGGGTTTTTGGAAGAATGATAGCAGAACAATTAGTACCAGACTCATTCACATTCTTCAATCTCTTGAGGTTCATTGCAGGTTTGAACTCAATTCAACTTGCAAAGATTGTTCATTGTATTGCAATTGTGAGCAAATTGAGTGGAGATTTGTTAGTAAATACTGCTGTGTTGTCTCTTTACTCTAAGTTGCGTAGCTTAGTAGATGCTAGAAAGTTATTTGACAAAATGCCAGAGAAAGATCGTGTTGTATGGAATATAATGATAGCAGCTTATGCTCGGGAAGGTAAACCGACGGAATGCCTTGAGCTCTTCAAGTCCATGGCACGATCAGGGATTAGATCTGATCTATTTACTGCACTGCCTGTTATCTCTTCGATTGCACAGTTGAAATGTGTTGATTGGGGGAAACAAACCCATGCCCATATATTGAGGAATGGTTCCGATAGTCAAGTTTCAGTTCATAATTCTCTCATTGACATGTACTGCGAATGTAAAATTTTAGATTCAGCTTGTAAGATCTTCAACTGGATGACAGACAAGTCTGTAATTTCATGGAGTGCTATGATCAAGGGGTATGTAAAAAATGGTCAGTCCCTCACTGCATTGTCTCTCTTCTCCAAGATGAAATCTGATGGGATTCAAGCTGATTTTGTTATAATGATCAATATCTTGCCTGCATTTGTTCACATTGGAGCACTTGAAAATGTCAAATATTTACATGGGTACTCAATGAAGCTAGGCCTGACTTCCCTTCCATCACTTAACACAGCCCTTCTGATAACCTATGCAAAATGTGGGTCCATAGAGATGGCTCAAAGACTATTTGAGGAAGAGAAAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAATCATGGAGACTGGTCCCAGTGTTTTAAGCTATACAATCGAATGAAGTGCTCAAATTCAAAGCCAGACCAAGTAACATTCTTGGGACTACTAACAGCTTGTGTCAATTCTGGTCTTGTCGAAAAGGGGAAAGAATTTTTCAAGGAGATGACTGAAAGTTATGGGTGCCAACCAAGCCAAGAGCATTATGCTTGTATGGTTAACCTCTTGGGGAGAGCTGGGCTTATCAGTGAAGCTGGAGAACTTGTAAAAAACATGCCTATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGATGCATCCTGGGTCCAAGCTTGCAGAGTTTGCGGCCGAGAAGCTCATTAACATGGAGCCCAGAAATGCAGGGAATTACATACTGCTTTCGAACATATATGCTGCTGCAGGTAAATGGGATGGAGTGGCAAAAATGAGAAGTTTCTTAAGGAATAAAGGGCTGAAGAAAATCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCATGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGGAGATATATATACCATCCTAGGAAACCTTGAACTTGAAATCAAAGAGGTTAGAGAAAAAAGTCCAGATACATTGGTAAATCCTCTTCTATAACACTTGCATCCATTTTCTTAATGATATATCTCTTAACGTGACAAGTTTTACTCAATTCTTTTATTTACATATTCTTCGCACATTACATTGTTTAGTTACATGCCTTATTCTAATCTAACGAAG

mRNA sequence

GTTTGAAAAGCCAAACTCCGAGTCCAAAAATGTGCCGCTCATTTCGTCCGACATGCTTCACCTTCATCGATCAAAGCCCATTATTCATAGTCCCATTTTCCTCAACTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCCTCCTTTTCAGTCGATGCAACTCCATTCAACACCTCCAGCAAATTCATGCCAGGTTCATCCTCCACGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTTATCGATTGCTATGCCAATCTTGGACTCCTCAATCACTCTCTCCAAGTTTTCTGCTCTGTAATCGACCCCAATTTGACTCTTTTCAACGCCATACTGAGAAATTTGACAAGATATGGAGAATCCGAGCGGACCCTGTTGGTGTATCAACAAATGGTCGCCAAATCTATGCACCCAGATGAAGAAACTTACCCTTTTGTTTTGCGATCATGTTCTTCTTTTTCAAATGTTGGATTTGGGAGGACGATTCATGGGTATTTGGTTAAGCTGGGTTTTGATTTGTTTGATGTTGTAGCGACTGCTCTGGCTGAGATGTATGAGGAATGCATTGAATTTGAGAATGCTCATCAACTGTTTGATAAAAGATCTGTGAAGGATTTGGGATGGCCGAGTTCCTTGACTACGGAGGGTCCTCAAAATGATAACGGGGAGGGAATTTTTCGGGTTTTTGGAAGAATGATAGCAGAACAATTAGTACCAGACTCATTCACATTCTTCAATCTCTTGAGGTTCATTGCAGGTTTGAACTCAATTCAACTTGCAAAGATTGTTCATTGTATTGCAATTGTGAGCAAATTGAGTGGAGATTTGTTAGTAAATACTGCTGTGTTGTCTCTTTACTCTAAGTTGCGTAGCTTAGTAGATGCTAGAAAGTTATTTGACAAAATGCCAGAGAAAGATCGTGTTGTATGGAATATAATGATAGCAGCTTATGCTCGGGAAGGTAAACCGACGGAATGCCTTGAGCTCTTCAAGTCCATGGCACGATCAGGGATTAGATCTGATCTATTTACTGCACTGCCTGTTATCTCTTCGATTGCACAGTTGAAATGTGTTGATTGGGGGAAACAAACCCATGCCCATATATTGAGGAATGGTTCCGATAGTCAAGTTTCAGTTCATAATTCTCTCATTGACATGTACTGCGAATGTAAAATTTTAGATTCAGCTTGTAAGATCTTCAACTGGATGACAGACAAGTCTGTAATTTCATGGAGTGCTATGATCAAGGGGTATGTAAAAAATGGTCAGTCCCTCACTGCATTGTCTCTCTTCTCCAAGATGAAATCTGATGGGATTCAAGCTGATTTTGTTATAATGATCAATATCTTGCCTGCATTTGTTCACATTGGAGCACTTGAAAATGTCAAATATTTACATGGGTACTCAATGAAGCTAGGCCTGACTTCCCTTCCATCACTTAACACAGCCCTTCTGATAACCTATGCAAAATGTGGGTCCATAGAGATGGCTCAAAGACTATTTGAGGAAGAGAAAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAATCATGGAGACTGGTCCCAGTGTTTTAAGCTATACAATCGAATGAAGTGCTCAAATTCAAAGCCAGACCAAGTAACATTCTTGGGACTACTAACAGCTTGTGTCAATTCTGGTCTTGTCGAAAAGGGGAAAGAATTTTTCAAGGAGATGACTGAAAGTTATGGGTGCCAACCAAGCCAAGAGCATTATGCTTGTATGGTTAACCTCTTGGGGAGAGCTGGGCTTATCAGTGAAGCTGGAGAACTTGTAAAAAACATGCCTATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGATGCATCCTGGGTCCAAGCTTGCAGAGTTTGCGGCCGAGAAGCTCATTAACATGGAGCCCAGAAATGCAGGGAATTACATACTGCTTTCGAACATATATGCTGCTGCAGGTAAATGGGATGGAGTGGCAAAAATGAGAAGTTTCTTAAGGAATAAAGGGCTGAAGAAAATCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCATGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGGAGATATATATACCATCCTAGGAAACCTTGAACTTGAAATCAAAGAGGTTAGAGAAAAAAGTCCAGATACATTGGTAAATCCTCTTCTATAACACTTGCATCCATTTTCTTAATGATATATCTCTTAACGTGACAAGTTTTACTCAATTCTTTTATTTACATATTCTTCGCACATTACATTGTTTAGTTACATGCCTTATTCTAATCTAACGAAG

Coding sequence (CDS)

ATGCTTCACCTTCATCGATCAAAGCCCATTATTCATAGTCCCATTTTCCTCAACTTTCCCGCCACCCAATCAAGACTGCTCAACACGCTTTCCCTCCTTTTCAGTCGATGCAACTCCATTCAACACCTCCAGCAAATTCATGCCAGGTTCATCCTCCACGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTTATCGATTGCTATGCCAATCTTGGACTCCTCAATCACTCTCTCCAAGTTTTCTGCTCTGTAATCGACCCCAATTTGACTCTTTTCAACGCCATACTGAGAAATTTGACAAGATATGGAGAATCCGAGCGGACCCTGTTGGTGTATCAACAAATGGTCGCCAAATCTATGCACCCAGATGAAGAAACTTACCCTTTTGTTTTGCGATCATGTTCTTCTTTTTCAAATGTTGGATTTGGGAGGACGATTCATGGGTATTTGGTTAAGCTGGGTTTTGATTTGTTTGATGTTGTAGCGACTGCTCTGGCTGAGATGTATGAGGAATGCATTGAATTTGAGAATGCTCATCAACTGTTTGATAAAAGATCTGTGAAGGATTTGGGATGGCCGAGTTCCTTGACTACGGAGGGTCCTCAAAATGATAACGGGGAGGGAATTTTTCGGGTTTTTGGAAGAATGATAGCAGAACAATTAGTACCAGACTCATTCACATTCTTCAATCTCTTGAGGTTCATTGCAGGTTTGAACTCAATTCAACTTGCAAAGATTGTTCATTGTATTGCAATTGTGAGCAAATTGAGTGGAGATTTGTTAGTAAATACTGCTGTGTTGTCTCTTTACTCTAAGTTGCGTAGCTTAGTAGATGCTAGAAAGTTATTTGACAAAATGCCAGAGAAAGATCGTGTTGTATGGAATATAATGATAGCAGCTTATGCTCGGGAAGGTAAACCGACGGAATGCCTTGAGCTCTTCAAGTCCATGGCACGATCAGGGATTAGATCTGATCTATTTACTGCACTGCCTGTTATCTCTTCGATTGCACAGTTGAAATGTGTTGATTGGGGGAAACAAACCCATGCCCATATATTGAGGAATGGTTCCGATAGTCAAGTTTCAGTTCATAATTCTCTCATTGACATGTACTGCGAATGTAAAATTTTAGATTCAGCTTGTAAGATCTTCAACTGGATGACAGACAAGTCTGTAATTTCATGGAGTGCTATGATCAAGGGGTATGTAAAAAATGGTCAGTCCCTCACTGCATTGTCTCTCTTCTCCAAGATGAAATCTGATGGGATTCAAGCTGATTTTGTTATAATGATCAATATCTTGCCTGCATTTGTTCACATTGGAGCACTTGAAAATGTCAAATATTTACATGGGTACTCAATGAAGCTAGGCCTGACTTCCCTTCCATCACTTAACACAGCCCTTCTGATAACCTATGCAAAATGTGGGTCCATAGAGATGGCTCAAAGACTATTTGAGGAAGAGAAAATTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAATCATGGAGACTGGTCCCAGTGTTTTAAGCTATACAATCGAATGAAGTGCTCAAATTCAAAGCCAGACCAAGTAACATTCTTGGGACTACTAACAGCTTGTGTCAATTCTGGTCTTGTCGAAAAGGGGAAAGAATTTTTCAAGGAGATGACTGAAAGTTATGGGTGCCAACCAAGCCAAGAGCATTATGCTTGTATGGTTAACCTCTTGGGGAGAGCTGGGCTTATCAGTGAAGCTGGAGAACTTGTAAAAAACATGCCTATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGATGCATCCTGGGTCCAAGCTTGCAGAGTTTGCGGCCGAGAAGCTCATTAACATGGAGCCCAGAAATGCAGGGAATTACATACTGCTTTCGAACATATATGCTGCTGCAGGTAAATGGGATGGAGTGGCAAAAATGAGAAGTTTCTTAAGGAATAAAGGGCTGAAGAAAATCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCATGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGGAGATATATATACCATCCTAGGAAACCTTGAACTTGAAATCAAAGAGGTTAGAGAAAAAAGTCCAGATACATTGGTAAATCCTCTTCTATAA

Protein sequence

MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTLVNPLL*
Homology
BLAST of CsGy1G023930 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 436.4 bits (1121), Expect = 6.2e-121
Identity = 250/685 (36.50%), Postives = 379/685 (55.33%), Query Frame = 0

Query: 29  TLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLID-CYANLGL--LNHSLQVFCSV 88
           +LSLL + C ++Q L+ IHA+ I  G H      SKLI+ C  +     L +++ VF ++
Sbjct: 36  SLSLLHN-CKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTI 95

Query: 89  IDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGR 148
            +PNL ++N + R      +    L +Y  M++  + P+  T+PFVL+SC+       G+
Sbjct: 96  QEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQ 155

Query: 149 TIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQND 208
            IHG+++KLG DL   V T+L  MY +    E+AH++FDK   +                
Sbjct: 156 QIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHR---------------- 215

Query: 209 NGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNT 268
                                                                 D++  T
Sbjct: 216 ------------------------------------------------------DVVSYT 275

Query: 269 AVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRS 328
           A++  Y+    + +A+KLFD++P KD V WN MI+ YA  G   E LELFK M ++ +R 
Sbjct: 276 ALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRP 335

Query: 329 DLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIF 388
           D  T + V+S+ AQ   ++ G+Q H  I  +G  S + + N+LID+Y +C  L++AC +F
Sbjct: 336 DESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLF 395

Query: 389 NWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALE 448
             +  K VISW+ +I GY        AL LF +M   G   + V M++ILPA  H+GA++
Sbjct: 396 ERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAID 455

Query: 449 NVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMI 508
             +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    I  K L  WN+MI
Sbjct: 456 IGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVF--NSILHKSLSSWNAMI 515

Query: 509 SAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGC 568
              A HG     F L++RM+    +PD +TF+GLL+AC +SG+++ G+  F+ MT+ Y  
Sbjct: 516 FGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKM 575

Query: 569 QPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAE 628
            P  EHY CM++LLG +GL  EA E++  M ++PD  +W  LL ACKMH   +L E  AE
Sbjct: 576 TPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAE 635

Query: 629 KLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFR 688
            LI +EP N G+Y+LLSNIYA+AG+W+ VAK R+ L +KG+KK+PGCS +EI+  V EF 
Sbjct: 636 NLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFI 647

Query: 689 VADQTHPRAGDIYTILGNLELEIKE 709
           + D+ HPR  +IY +L  +E+ +++
Sbjct: 696 IGDKFHPRNREIYGMLEEMEVLLEK 647

BLAST of CsGy1G023930 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 419.9 bits (1078), Expect = 6.0e-116
Identity = 221/678 (32.60%), Postives = 372/678 (54.87%), Query Frame = 0

Query: 31  SLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNL 90
           +LL  RC+S++ L+QI      +G +Q     +KL+  +   G ++ + +VF  +     
Sbjct: 41  ALLLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLN 100

Query: 91  TLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGY 150
            L++ +L+   +  + ++ L  + +M    + P    + ++L+ C   + +  G+ IHG 
Sbjct: 101 VLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGL 160

Query: 151 LVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGI 210
           LVK GF L     T L  MY +C +   A ++FD+   +DL   +++     QN      
Sbjct: 161 LVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMA 220

Query: 211 FRVFGRMIAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSL 270
             +   M  E L P   T  ++L  ++ L  I + K +H  A+ S     + ++TA++ +
Sbjct: 221 LEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDM 280

Query: 271 YSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRSDLFTA 330
           Y+K  SL  AR+LFD M E++ V WN MI AY +   P E + +F+ M   G++    + 
Sbjct: 281 YAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSV 340

Query: 331 LPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTD 390
           +  + + A L  ++ G+  H   +  G D  VSV NSLI MYC+CK +D+A  +F  +  
Sbjct: 341 MGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQS 400

Query: 391 KSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYL 450
           ++++SW+AMI G+ +NG+ + AL+ FS+M+S  ++ D    ++++ A   +    + K++
Sbjct: 401 RTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWI 460

Query: 451 HGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHG 510
           HG  M+  L     + TAL+  YAKCG+I +A+ +F  + + ++ +  WN+MI  +  HG
Sbjct: 461 HGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIF--DMMSERHVTTWNAMIDGYGTHG 520

Query: 511 DWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHY 570
                 +L+  M+    KP+ VTFL +++AC +SGLVE G + F  M E+Y  + S +HY
Sbjct: 521 FGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHY 580

Query: 571 ACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEP 630
             MV+LLGRAG ++EA + +  MP+KP   V+G +L AC++H     AE AAE+L  + P
Sbjct: 581 GAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNP 640

Query: 631 RNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHP 690
            + G ++LL+NIY AA  W+ V ++R  +  +GL+K PGCS +EI   V  F      HP
Sbjct: 641 DDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHP 700

Query: 691 RAGDIYTILGNLELEIKE 709
            +  IY  L  L   IKE
Sbjct: 701 DSKKIYAFLEKLICHIKE 716

BLAST of CsGy1G023930 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 403.7 bits (1036), Expect = 4.5e-111
Identity = 235/730 (32.19%), Postives = 403/730 (55.21%), Query Frame = 0

Query: 23  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGL---LNHSL 82
           QS+           C +I  L+  H      G   + +  +KL+     LG    L+ + 
Sbjct: 28  QSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAK 87

Query: 83  QVF-CSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSF 142
           +VF  S       ++N+++R     G     +L++ +M+   + PD+ T+PF L +C+  
Sbjct: 88  EVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKS 147

Query: 143 SNVGFGRTIHGYLVKLGF--DLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSS 202
              G G  IHG +VK+G+  DLF  V  +L   Y EC E ++A ++FD+ S +++   +S
Sbjct: 148 RAKGNGIQIHGLIVKMGYAKDLF--VQNSLVHFYAECGELDSARKVFDEMSERNVVSWTS 207

Query: 203 LTTEGPQNDNGEGIFRVFGRMIA-EQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVS 262
           +     + D  +    +F RM+  E++ P+S T   ++   A L  ++  + V+     S
Sbjct: 208 MICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNS 267

Query: 263 KLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF 322
            +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R+G   E L +F
Sbjct: 268 GIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVF 327

Query: 323 KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCEC 382
             M  SG+R D  + L  ISS +QL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C
Sbjct: 328 NLMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKC 387

Query: 383 KILDSACKIFNWMTDKSVISWSAMIKGYVKNGQ--------------------------- 442
              D+A +IF+ M++K+V++W++++ GYV+NG+                           
Sbjct: 388 HRQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLV 447

Query: 443 --SL--TALSLFSKMKS-DGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLP 502
             SL   A+ +F  M+S +G+ AD V M++I  A  H+GAL+  K+++ Y  K G+    
Sbjct: 448 QGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDV 507

Query: 503 SLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMK 562
            L T L+  +++CG  E A  +F    + ++D+  W + I A A  G+  +  +L++ M 
Sbjct: 508 RLGTTLVDMFSRCGDPESAMSIF--NSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMI 567

Query: 563 CSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLI 622
               KPD V F+G LTAC + GLV++GKE F  M + +G  P   HY CMV+LLGRAGL+
Sbjct: 568 EQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLL 627

Query: 623 SEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIY 682
            EA +L+++MP++P+  +W  LL+AC++    ++A +AAEK+  + P   G+Y+LLSN+Y
Sbjct: 628 EEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVY 687

Query: 683 AAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLE 714
           A+AG+W+ +AK+R  ++ KGL+K PG S ++I G   EF   D++HP        + N+E
Sbjct: 688 ASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPE-------MPNIE 746

BLAST of CsGy1G023930 vs. ExPASy Swiss-Prot
Match: Q9STE1 (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 399.1 bits (1024), Expect = 1.1e-109
Identity = 211/655 (32.21%), Postives = 368/655 (56.18%), Query Frame = 0

Query: 54  GFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVY 113
           G   N  ++S LI  Y   G ++   ++F  V+  +  ++N +L    + G  +  +  +
Sbjct: 168 GMDCNEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAKCGALDSVIKGF 227

Query: 114 QQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEEC 173
             M    + P+  T+  VL  C+S   +  G  +HG +V  G D    +  +L  MY +C
Sbjct: 228 SVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKC 287

Query: 174 IEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLL 233
             F++A +LF   S  D    + + +   Q+   E     F  MI+  ++PD+ TF +LL
Sbjct: 288 GRFDDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLL 347

Query: 234 RFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRV 293
             ++   +++  K +HC  +   +S D+ + +A++  Y K R +  A+ +F +    D V
Sbjct: 348 PSVSKFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVV 407

Query: 294 VWNIMIAAYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHI 353
           V+  MI+ Y   G   + LE+F+ + +  I  +  T + ++  I  L  +  G++ H  I
Sbjct: 408 VFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFI 467

Query: 354 LRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTAL 413
           ++ G D++ ++  ++IDMY +C  ++ A +IF  ++ + ++SW++MI    ++     A+
Sbjct: 468 IKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAI 527

Query: 414 SLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITY 473
            +F +M   GI  D V +   L A  ++ +    K +HG+ +K  L S     + L+  Y
Sbjct: 528 DIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDVYSESTLIDMY 587

Query: 474 AKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRM-KCSNSKPDQV 533
           AKCG+++ A  +F+  K  +K+++ WNS+I+A  NHG       L++ M + S  +PDQ+
Sbjct: 588 AKCGNLKAAMNVFKTMK--EKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSGIRPDQI 647

Query: 534 TFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKN 593
           TFL ++++C + G V++G  FF+ MTE YG QP QEHYAC+V+L GRAG ++EA E VK+
Sbjct: 648 TFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEAYETVKS 707

Query: 594 MPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGV 653
           MP  PDA VWG LL AC++H   +LAE A+ KL++++P N+G Y+L+SN +A A +W+ V
Sbjct: 708 MPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHANAREWESV 767

Query: 654 AKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIK 708
            K+RS ++ + ++KIPG SW+EIN     F   D  HP +  IY++L +L  E++
Sbjct: 768 TKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYSLLNSLLGELR 820

BLAST of CsGy1G023930 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 7.1e-109
Identity = 222/670 (33.13%), Postives = 366/670 (54.63%), Query Frame = 0

Query: 53  HGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLV 112
           +GF  +  L SKL   Y N G L  + +VF  V       +N ++  L + G+   ++ +
Sbjct: 123 NGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGL 182

Query: 113 YQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEE 172
           +++M++  +  D  T+  V +S SS  +V  G  +HG+++K GF   + V  +L   Y +
Sbjct: 183 FKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLK 242

Query: 173 CIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNL 232
               ++A ++FD+ + +D+   +S+      N   E    VF +M+   +  D  T  ++
Sbjct: 243 NQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSV 302

Query: 233 LRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDR 292
               A    I L + VH I + +  S +      +L +YSK   L  A+ +F +M ++  
Sbjct: 303 FAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSV 362

Query: 293 VVWNIMIAAYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAH 352
           V +  MIA YAREG   E ++LF+ M   GI  D++T   V++  A+ + +D GK+ H  
Sbjct: 363 VSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEW 422

Query: 353 ILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTA 412
           I  N     + V N+L+DMY +C  +  A  +F+ M  K +ISW+ +I GY KN  +  A
Sbjct: 423 IKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEA 482

Query: 413 LSLFS-KMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLI 472
           LSLF+  ++      D   +  +LPA   + A +  + +HGY M+ G  S   +  +L+ 
Sbjct: 483 LSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVD 542

Query: 473 TYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQ 532
            YAKCG++ +A  LF++  I  KDL+ W  MI+ +  HG   +   L+N+M+ +  + D+
Sbjct: 543 MYAKCGALLLAHMLFDD--IASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADE 602

Query: 533 VTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVK 592
           ++F+ LL AC +SGLV++G  FF  M      +P+ EHYAC+V++L R G + +A   ++
Sbjct: 603 ISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIE 662

Query: 593 NMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDG 652
           NMPI PDA +WG LL  C++H   KLAE  AEK+  +EP N G Y+L++NIYA A KW+ 
Sbjct: 663 NMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQ 722

Query: 653 VAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVRE 712
           V ++R  +  +GL+K PGCSW+EI G V  F   D ++P          N+E  +++VR 
Sbjct: 723 VKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPET-------ENIEAFLRKVRA 782

Query: 713 KSPDTLVNPL 722
           +  +   +PL
Sbjct: 783 RMIEEGYSPL 783

BLAST of CsGy1G023930 vs. NCBI nr
Match: XP_008444579.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucumis melo] >KAA0054005.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK20690.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1357 bits (3511), Expect = 0.0
Identity = 680/722 (94.18%), Postives = 694/722 (96.12%), Query Frame = 0

Query: 1   MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPT 60
           MLHL RSKPIIH+PI LNFPATQSRLLNTLSLLF+RCNSIQHLQQIHARFILHGFHQNPT
Sbjct: 1   MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPT 60

Query: 61  LSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKS 120
           LSSKLIDCYANLGLL HSLQVFCS+IDPNLTLFNAILRNLTRYGESER LLVYQQMVAKS
Sbjct: 61  LSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKS 120

Query: 121 MHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAH 180
           MHPDEETYPF+ RSCSSFSNVGFGRTIHGYLVKLGFD FDVVATALAEMYE+ I FENAH
Sbjct: 121 MHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAH 180

Query: 181 QLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLN 240
           QLFDKRSVKDLGW SSLTTEG QN NGEGIFRVF RM AEQLVPDS TF NLLRFIAGLN
Sbjct: 181 QLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLN 240

Query: 241 SIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA 300
           SIQLAKIVHCIAIVSKLSGDLLV TAVLSLYSKLRSLVDAR+LFDKMPEKDRVVWNIMIA
Sbjct: 241 SIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA 300

Query: 301 AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360
           AYAREGKP ECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS
Sbjct: 301 AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360

Query: 361 QVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMK 420
           QVSVHNSLIDMYCECK+LDSAC IFNWMTDKSVISWSAMIKGYVKNGQSLTA SLFSKMK
Sbjct: 361 QVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMK 420

Query: 421 SDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIE 480
           SDGIQADFV MINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IE
Sbjct: 421 SDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIE 480

Query: 481 MAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540
           MAQRLFEEE+IDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA
Sbjct: 481 MAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540

Query: 541 CVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR 600
           CVNSGL+EKGKEFFKEMTESYGC PSQEH+ACMVNLLGRAGLISEAGELV+NMPIKPDAR
Sbjct: 541 CVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR 600

Query: 601 VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACKMHPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKW+ VAKMRSFLR
Sbjct: 601 VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLR 660

Query: 661 NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTLVNP 720
           NKGLKK PGCS LEING VTEFRVADQTHPRA DIYTILGNLELEIKEVREKS DTLVNP
Sbjct: 661 NKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILGNLELEIKEVREKSLDTLVNP 720

Query: 721 LL 722
           LL
Sbjct: 721 LL 722

BLAST of CsGy1G023930 vs. NCBI nr
Match: XP_038894029.1 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Benincasa hispida])

HSP 1 Score: 1274 bits (3297), Expect = 0.0
Identity = 635/721 (88.07%), Postives = 669/721 (92.79%), Query Frame = 0

Query: 1   MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPT 60
           MLHL RSKP+IHS IF NFPATQSRLLNTLS LF RC+S QHL+QIHARF+LHGFHQNPT
Sbjct: 34  MLHLQRSKPVIHSLIFPNFPATQSRLLNTLSFLFDRCSSRQHLKQIHARFVLHGFHQNPT 93

Query: 61  LSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKS 120
           LSSKLIDCYANLGLLN SLQVF S+ +PN T++NAILRNLTRYGE ERTLLVY+QMVAKS
Sbjct: 94  LSSKLIDCYANLGLLNLSLQVFYSITEPNSTIYNAILRNLTRYGECERTLLVYRQMVAKS 153

Query: 121 MHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAH 180
           MHPDEETYP VLRSC SFSNVG GR IHGYLVKLGFD FD+VATAL EMYEECI+FE+AH
Sbjct: 154 MHPDEETYPSVLRSCCSFSNVGSGRKIHGYLVKLGFDSFDMVATALGEMYEECIDFESAH 213

Query: 181 QLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLN 240
           QLFDKRSVKDL   SS TTE PQN NGEGIF VFGRM  EQLV DS TF NLLRFIAG N
Sbjct: 214 QLFDKRSVKDLECWSSFTTEAPQNGNGEGIFGVFGRMRVEQLVTDSLTFINLLRFIAGFN 273

Query: 241 SIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA 300
           SIQLAKIVHCIAIVSKL GDLLVNTAVLSLYSKL SLVDARKLFDKMPE DRVVWNIMIA
Sbjct: 274 SIQLAKIVHCIAIVSKLCGDLLVNTAVLSLYSKLGSLVDARKLFDKMPENDRVVWNIMIA 333

Query: 301 AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360
           AYAREGKPTECL LFKSMARSGIRSD+FTALPVISSI+QLK  DWGKQTHA+ILRNGSDS
Sbjct: 334 AYAREGKPTECLALFKSMARSGIRSDMFTALPVISSISQLKYFDWGKQTHAYILRNGSDS 393

Query: 361 QVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMK 420
           QVSV+NSLIDMYCEC ILDSACKIFNWM DK+VISWSAMIKGYVK+G SL ALSLFS MK
Sbjct: 394 QVSVYNSLIDMYCECNILDSACKIFNWMKDKTVISWSAMIKGYVKHGHSLIALSLFSSMK 453

Query: 421 SDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIE 480
           SDGIQ+DF+ +INILPAFVHIG LENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IE
Sbjct: 454 SDGIQSDFITVINILPAFVHIGVLENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIE 513

Query: 481 MAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540
           MAQR+FEEE+IDDKDLIMWNSMISAHANHGDWSQCFKLYN+MKCSN+KPDQVTFLGLLTA
Sbjct: 514 MAQRIFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNTKPDQVTFLGLLTA 573

Query: 541 CVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR 600
           CVNSGLVEKGKEF KEMTE+YGCQPSQEHYACMVNLLGRAGLI+EAGELV+NMPIKPDAR
Sbjct: 574 CVNSGLVEKGKEFLKEMTENYGCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR 633

Query: 601 VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACK+HPGSKLAEFAAEKL++MEP+NAGNYILLSNIYAAAGKWD VAKMRSFLR
Sbjct: 634 VWGPLLSACKLHPGSKLAEFAAEKLVDMEPKNAGNYILLSNIYAAAGKWDEVAKMRSFLR 693

Query: 661 NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTLVNP 720
           +KGLKK PGCSWLEINGHVTEFRVADQTHPRA DIYTILGNLELEIKE REKS + L NP
Sbjct: 694 DKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLELEIKEAREKSLEKLGNP 753

BLAST of CsGy1G023930 vs. NCBI nr
Match: XP_004145299.2 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sativus])

HSP 1 Score: 1220 bits (3156), Expect = 0.0
Identity = 607/607 (100.00%), Postives = 607/607 (100.00%), Query Frame = 0

Query: 116 MVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIE 175
           MVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIE
Sbjct: 1   MVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIE 60

Query: 176 FENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRF 235
           FENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRF
Sbjct: 61  FENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRF 120

Query: 236 IAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVW 295
           IAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVW
Sbjct: 121 IAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVW 180

Query: 296 NIMIAAYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILR 355
           NIMIAAYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILR
Sbjct: 181 NIMIAAYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILR 240

Query: 356 NGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSL 415
           NGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSL
Sbjct: 241 NGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSL 300

Query: 416 FSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAK 475
           FSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAK
Sbjct: 301 FSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAK 360

Query: 476 CGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFL 535
           CGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFL
Sbjct: 361 CGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFL 420

Query: 536 GLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPI 595
           GLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPI
Sbjct: 421 GLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPI 480

Query: 596 KPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKM 655
           KPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKM
Sbjct: 481 KPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKM 540

Query: 656 RSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPD 715
           RSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPD
Sbjct: 541 RSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPD 600

Query: 716 TLVNPLL 722
           TLVNPLL
Sbjct: 601 TLVNPLL 607

BLAST of CsGy1G023930 vs. NCBI nr
Match: XP_022139869.1 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Momordica charantia])

HSP 1 Score: 1185 bits (3065), Expect = 0.0
Identity = 593/717 (82.71%), Postives = 640/717 (89.26%), Query Frame = 0

Query: 1   MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPT 60
           MLHL RSKPI     F NFPATQSR LNTLS LFSRC+S Q L+QIHARFILHG HQNP 
Sbjct: 1   MLHLQRSKPIFRFE-FSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPA 60

Query: 61  LSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKS 120
           LS +LID YANLGLL  S QVF S+IDP  TL++AILRNL+ +GE ERTLLVY++M AKS
Sbjct: 61  LSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKS 120

Query: 121 MHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAH 180
           MHPDEETYP VLRSC   SNV +GR IHG+LVKLG DL+D  ATALAEMY +CI FEN H
Sbjct: 121 MHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGH 180

Query: 181 QLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLN 240
            LFDK  +KD    +SL +E  QN NG+ IF++FGRM  EQLV DS TF NLLR I GLN
Sbjct: 181 DLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLN 240

Query: 241 SIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA 300
           SIQLAKIVHC+AI S L GDLLVNTAVLSLYSKL  LV+ARKLFDKMPEKDRVVWNIMIA
Sbjct: 241 SIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIA 300

Query: 301 AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360
           AY REG P ECLELFKSMARSGIR+DLFTALPVISSI+QLKCVDWGKQTHAH LRNGSD+
Sbjct: 301 AYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDN 360

Query: 361 QVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMK 420
           QVSVHNSLIDMYCE  ILDSACKIF+WMT+K+VISWSAMIKG VK+GQSL ALSLFS+MK
Sbjct: 361 QVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMK 420

Query: 421 SDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIE 480
           SDGIQADF+ +INILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IE
Sbjct: 421 SDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIE 480

Query: 481 MAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540
           MAQRLFEEE++DDKDLIMWNSMISAHANHGDWSQCFK+YN+MKCSNS+PDQVTFLGLLTA
Sbjct: 481 MAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTA 540

Query: 541 CVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR 600
           CVNSGLVEKGKE FKEM E+YGCQPSQEHYACMVNLLGRAGLI++AG LV+NMPIKPDAR
Sbjct: 541 CVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDAR 600

Query: 601 VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACK+HPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKWDGVAKMRSFLR
Sbjct: 601 VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660

Query: 661 NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTL 717
           +KGLKK PGCSWLEINGHVTEFRVAD+THPRA DIYTILGNLELEIKE REKSP+ L
Sbjct: 661 DKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKEAREKSPEKL 716

BLAST of CsGy1G023930 vs. NCBI nr
Match: KAG6573373.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1178 bits (3048), Expect = 0.0
Identity = 590/717 (82.29%), Postives = 639/717 (89.12%), Query Frame = 0

Query: 1   MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPT 60
           M HL RSKPI     F NFPATQSRLLNTLS LFSRC S Q LQQIHARF+LHGFHQNPT
Sbjct: 1   MFHLQRSKPIFRFK-FPNFPATQSRLLNTLSSLFSRCKSRQQLQQIHARFVLHGFHQNPT 60

Query: 61  LSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKS 120
           LS KLIDCYAN GLLN S  VF S+IDPN  L+NAILRNLTR+GE ERTLLVY++MVAKS
Sbjct: 61  LSCKLIDCYANFGLLNLSHHVFNSIIDPNSALYNAILRNLTRFGEYERTLLVYREMVAKS 120

Query: 121 MHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAH 180
           MHPDE+TYPFVLRSC   SNV FG+ IHG L+KLG D +D V T L EMYE+CI+FENAH
Sbjct: 121 MHPDEQTYPFVLRSCCCLSNVQFGKNIHGCLIKLGVDSYDTVVTVLVEMYEKCIDFENAH 180

Query: 181 QLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLN 240
           QLFDK SVKDL   SSL TE PQN NG+ I R+FGRM +E LV DS TF NLLR ++GL+
Sbjct: 181 QLFDKMSVKDLDCWSSLITEAPQNGNGDDISRLFGRMKSEPLVTDSLTFINLLRSVSGLS 240

Query: 241 SIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA 300
           SIQLAKIVHCIAIVS L GDLLV+TAVLSLYSKL SLVDARKLF+K+PEKDRVVWNIMIA
Sbjct: 241 SIQLAKIVHCIAIVSNLCGDLLVDTAVLSLYSKLGSLVDARKLFEKIPEKDRVVWNIMIA 300

Query: 301 AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360
           AYAREG+P ECLELF+SMARSGIR+DLFTALPVISSI+QLK  DWGKQTHA+ILRNGSDS
Sbjct: 301 AYAREGRPMECLELFESMARSGIRADLFTALPVISSISQLKRADWGKQTHANILRNGSDS 360

Query: 361 QVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMK 420
           QVSVHNSLIDMYCEC  LDSACKIFN +T+K+VISWSAMIKG VK+G  L ALSLF +MK
Sbjct: 361 QVSVHNSLIDMYCECNSLDSACKIFNSVTNKTVISWSAMIKGNVKHGYPLIALSLFFRMK 420

Query: 421 SDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIE 480
           SDGIQADF+ +INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCG I+
Sbjct: 421 SDGIQADFITVINIMPAFVDIGALENVKYLHGYSLKLALTSLPSLNTALLITYAKCGCID 480

Query: 481 MAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540
           MAQRLFEEE++DDKDLIMWNSMISAHANHGDWSQCF LYN+MKCSNS PDQVTFLGLLTA
Sbjct: 481 MAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFNLYNQMKCSNSNPDQVTFLGLLTA 540

Query: 541 CVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR 600
           CVNSGLVEKGKEFFKEM ESY CQPSQEHYACMVNLLGRAGLI+EAGELV+NMPIKPDAR
Sbjct: 541 CVNSGLVEKGKEFFKEMIESYSCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIKPDAR 600

Query: 601 VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACK+HPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKWDGVAKMRSFLR
Sbjct: 601 VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660

Query: 661 NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTL 717
           +KGLKK PGCSWLEING V EFRVAD+THPRA DIY ILGNLEL+IKE +E SP+ L
Sbjct: 661 DKGLKKTPGCSWLEINGRVAEFRVADRTHPRAEDIYAILGNLELDIKEAKEMSPEKL 716

BLAST of CsGy1G023930 vs. ExPASy TrEMBL
Match: A0A0A0M0Z6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G534720 PE=4 SV=1)

HSP 1 Score: 1449 bits (3752), Expect = 0.0
Identity = 722/722 (100.00%), Postives = 722/722 (100.00%), Query Frame = 0

Query: 1   MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPT 60
           MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPT
Sbjct: 1   MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPT 60

Query: 61  LSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKS 120
           LSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKS
Sbjct: 61  LSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKS 120

Query: 121 MHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAH 180
           MHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAH
Sbjct: 121 MHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAH 180

Query: 181 QLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLN 240
           QLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLN
Sbjct: 181 QLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLN 240

Query: 241 SIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA 300
           SIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA
Sbjct: 241 SIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA 300

Query: 301 AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360
           AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS
Sbjct: 301 AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360

Query: 361 QVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMK 420
           QVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMK
Sbjct: 361 QVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMK 420

Query: 421 SDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIE 480
           SDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIE
Sbjct: 421 SDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIE 480

Query: 481 MAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540
           MAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA
Sbjct: 481 MAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540

Query: 541 CVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR 600
           CVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR
Sbjct: 541 CVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR 600

Query: 601 VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR
Sbjct: 601 VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660

Query: 661 NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTLVNP 720
           NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTLVNP
Sbjct: 661 NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTLVNP 720

Query: 721 LL 722
           LL
Sbjct: 721 LL 722

BLAST of CsGy1G023930 vs. ExPASy TrEMBL
Match: A0A5D3DB69 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold480G00810 PE=4 SV=1)

HSP 1 Score: 1357 bits (3511), Expect = 0.0
Identity = 680/722 (94.18%), Postives = 694/722 (96.12%), Query Frame = 0

Query: 1   MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPT 60
           MLHL RSKPIIH+PI LNFPATQSRLLNTLSLLF+RCNSIQHLQQIHARFILHGFHQNPT
Sbjct: 1   MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPT 60

Query: 61  LSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKS 120
           LSSKLIDCYANLGLL HSLQVFCS+IDPNLTLFNAILRNLTRYGESER LLVYQQMVAKS
Sbjct: 61  LSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKS 120

Query: 121 MHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAH 180
           MHPDEETYPF+ RSCSSFSNVGFGRTIHGYLVKLGFD FDVVATALAEMYE+ I FENAH
Sbjct: 121 MHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAH 180

Query: 181 QLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLN 240
           QLFDKRSVKDLGW SSLTTEG QN NGEGIFRVF RM AEQLVPDS TF NLLRFIAGLN
Sbjct: 181 QLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLN 240

Query: 241 SIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA 300
           SIQLAKIVHCIAIVSKLSGDLLV TAVLSLYSKLRSLVDAR+LFDKMPEKDRVVWNIMIA
Sbjct: 241 SIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA 300

Query: 301 AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360
           AYAREGKP ECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS
Sbjct: 301 AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360

Query: 361 QVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMK 420
           QVSVHNSLIDMYCECK+LDSAC IFNWMTDKSVISWSAMIKGYVKNGQSLTA SLFSKMK
Sbjct: 361 QVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMK 420

Query: 421 SDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIE 480
           SDGIQADFV MINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IE
Sbjct: 421 SDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIE 480

Query: 481 MAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540
           MAQRLFEEE+IDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA
Sbjct: 481 MAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540

Query: 541 CVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR 600
           CVNSGL+EKGKEFFKEMTESYGC PSQEH+ACMVNLLGRAGLISEAGELV+NMPIKPDAR
Sbjct: 541 CVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR 600

Query: 601 VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACKMHPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKW+ VAKMRSFLR
Sbjct: 601 VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLR 660

Query: 661 NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTLVNP 720
           NKGLKK PGCS LEING VTEFRVADQTHPRA DIYTILGNLELEIKEVREKS DTLVNP
Sbjct: 661 NKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILGNLELEIKEVREKSLDTLVNP 720

Query: 721 LL 722
           LL
Sbjct: 721 LL 722

BLAST of CsGy1G023930 vs. ExPASy TrEMBL
Match: A0A1S3BBG7 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103487849 PE=4 SV=1)

HSP 1 Score: 1357 bits (3511), Expect = 0.0
Identity = 680/722 (94.18%), Postives = 694/722 (96.12%), Query Frame = 0

Query: 1   MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPT 60
           MLHL RSKPIIH+PI LNFPATQSRLLNTLSLLF+RCNSIQHLQQIHARFILHGFHQNPT
Sbjct: 1   MLHLQRSKPIIHTPILLNFPATQSRLLNTLSLLFNRCNSIQHLQQIHARFILHGFHQNPT 60

Query: 61  LSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKS 120
           LSSKLIDCYANLGLL HSLQVFCS+IDPNLTLFNAILRNLTRYGESER LLVYQQMVAKS
Sbjct: 61  LSSKLIDCYANLGLLKHSLQVFCSIIDPNLTLFNAILRNLTRYGESERALLVYQQMVAKS 120

Query: 121 MHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAH 180
           MHPDEETYPF+ RSCSSFSNVGFGRTIHGYLVKLGFD FDVVATALAEMYE+ I FENAH
Sbjct: 121 MHPDEETYPFIFRSCSSFSNVGFGRTIHGYLVKLGFDSFDVVATALAEMYEKWIAFENAH 180

Query: 181 QLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLN 240
           QLFDKRSVKDLGW SSLTTEG QN NGEGIFRVF RM AEQLVPDS TF NLLRFIAGLN
Sbjct: 181 QLFDKRSVKDLGWSSSLTTEGSQNGNGEGIFRVFVRMRAEQLVPDSLTFVNLLRFIAGLN 240

Query: 241 SIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA 300
           SIQLAKIVHCIAIVSKLSGDLLV TAVLSLYSKLRSLVDAR+LFDKMPEKDRVVWNIMIA
Sbjct: 241 SIQLAKIVHCIAIVSKLSGDLLVYTAVLSLYSKLRSLVDARRLFDKMPEKDRVVWNIMIA 300

Query: 301 AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360
           AYAREGKP ECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS
Sbjct: 301 AYAREGKPRECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360

Query: 361 QVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMK 420
           QVSVHNSLIDMYCECK+LDSAC IFNWMTDKSVISWSAMIKGYVKNGQSLTA SLFSKMK
Sbjct: 361 QVSVHNSLIDMYCECKMLDSACNIFNWMTDKSVISWSAMIKGYVKNGQSLTASSLFSKMK 420

Query: 421 SDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIE 480
           SDGIQADFV MINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IE
Sbjct: 421 SDGIQADFVTMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGYIE 480

Query: 481 MAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540
           MAQRLFEEE+IDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA
Sbjct: 481 MAQRLFEEERIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540

Query: 541 CVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR 600
           CVNSGL+EKGKEFFKEMTESYGC PSQEH+ACMVNLLGRAGLISEAGELV+NMPIKPDAR
Sbjct: 541 CVNSGLIEKGKEFFKEMTESYGCLPSQEHFACMVNLLGRAGLISEAGELVRNMPIKPDAR 600

Query: 601 VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACKMHPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKW+ VAKMRSFLR
Sbjct: 601 VWGPLLSACKMHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWEEVAKMRSFLR 660

Query: 661 NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTLVNP 720
           NKGLKK PGCS LEING VTEFRVADQTHPRA DIYTILGNLELEIKEVREKS DTLVNP
Sbjct: 661 NKGLKKTPGCSSLEINGRVTEFRVADQTHPRAEDIYTILGNLELEIKEVREKSLDTLVNP 720

Query: 721 LL 722
           LL
Sbjct: 721 LL 722

BLAST of CsGy1G023930 vs. ExPASy TrEMBL
Match: A0A6J1CE61 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111010677 PE=4 SV=1)

HSP 1 Score: 1185 bits (3065), Expect = 0.0
Identity = 593/717 (82.71%), Postives = 640/717 (89.26%), Query Frame = 0

Query: 1   MLHLHRSKPIIHSPIFLNFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPT 60
           MLHL RSKPI     F NFPATQSR LNTLS LFSRC+S Q L+QIHARFILHG HQNP 
Sbjct: 1   MLHLQRSKPIFRFE-FSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPA 60

Query: 61  LSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKS 120
           LS +LID YANLGLL  S QVF S+IDP  TL++AILRNL+ +GE ERTLLVY++M AKS
Sbjct: 61  LSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKS 120

Query: 121 MHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEFENAH 180
           MHPDEETYP VLRSC   SNV +GR IHG+LVKLG DL+D  ATALAEMY +CI FEN H
Sbjct: 121 MHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGH 180

Query: 181 QLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLN 240
            LFDK  +KD    +SL +E  QN NG+ IF++FGRM  EQLV DS TF NLLR I GLN
Sbjct: 181 DLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLN 240

Query: 241 SIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIA 300
           SIQLAKIVHC+AI S L GDLLVNTAVLSLYSKL  LV+ARKLFDKMPEKDRVVWNIMIA
Sbjct: 241 SIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIA 300

Query: 301 AYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDS 360
           AY REG P ECLELFKSMARSGIR+DLFTALPVISSI+QLKCVDWGKQTHAH LRNGSD+
Sbjct: 301 AYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDN 360

Query: 361 QVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMK 420
           QVSVHNSLIDMYCE  ILDSACKIF+WMT+K+VISWSAMIKG VK+GQSL ALSLFS+MK
Sbjct: 361 QVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMK 420

Query: 421 SDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGSIE 480
           SDGIQADF+ +INILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IE
Sbjct: 421 SDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIE 480

Query: 481 MAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTA 540
           MAQRLFEEE++DDKDLIMWNSMISAHANHGDWSQCFK+YN+MKCSNS+PDQVTFLGLLTA
Sbjct: 481 MAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTA 540

Query: 541 CVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDAR 600
           CVNSGLVEKGKE FKEM E+YGCQPSQEHYACMVNLLGRAGLI++AG LV+NMPIKPDAR
Sbjct: 541 CVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDAR 600

Query: 601 VWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660
           VWGPLLSACK+HPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKWDGVAKMRSFLR
Sbjct: 601 VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLR 660

Query: 661 NKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDTL 717
           +KGLKK PGCSWLEINGHVTEFRVAD+THPRA DIYTILGNLELEIKE REKSP+ L
Sbjct: 661 DKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKEAREKSPEKL 716

BLAST of CsGy1G023930 vs. ExPASy TrEMBL
Match: A0A6J1K3Q8 (pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111490383 PE=4 SV=1)

HSP 1 Score: 1177 bits (3045), Expect = 0.0
Identity = 591/721 (81.97%), Postives = 641/721 (88.90%), Query Frame = 0

Query: 1   MLHLHRSKPIIHSPIFL----NFPATQSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFH 60
           M HL RSK I  SPIF     NFPATQSRLLNTLS LFSRC S Q L+QIHARF+LHGFH
Sbjct: 1   MFHLQRSKSITQSPIFRFKFPNFPATQSRLLNTLSSLFSRCKSRQQLEQIHARFVLHGFH 60

Query: 61  QNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVYQQM 120
           QNPTLS KLIDCYAN GLLN S  VF S+IDPN TL+NAILRNLTR+GE ERTLLVY++M
Sbjct: 61  QNPTLSCKLIDCYANFGLLNVSHHVFNSIIDPNSTLYNAILRNLTRFGEYERTLLVYREM 120

Query: 121 VAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEECIEF 180
           VAKSMHPDE+TYPFVL+SC   SNV FG+ IHG L+KLG D +D V T LAEMY +CI+F
Sbjct: 121 VAKSMHPDEQTYPFVLQSCCCLSNVEFGKNIHGCLIKLGVDSYDTVVTVLAEMYGKCIDF 180

Query: 181 ENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLLRFI 240
           ENAHQLFDK SVKDL   SSL +E PQN NG+ I  + GRM +E LV DS TF NLLR I
Sbjct: 181 ENAHQLFDKMSVKDLDCWSSLISEAPQNGNGDEISLLLGRMKSEPLVTDSLTFINLLRSI 240

Query: 241 AGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWN 300
           +GL+SIQLAKIVHCIAIVS L GDLLV+TAVLSLYSKL SLVDARKLF+KMPEKDRVVWN
Sbjct: 241 SGLSSIQLAKIVHCIAIVSNLCGDLLVDTAVLSLYSKLGSLVDARKLFEKMPEKDRVVWN 300

Query: 301 IMIAAYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRN 360
           IMIAAYAREG+P ECLELF+SMARSGIR+DLFTALPVISSI+QLKC DWGKQTHA+ILRN
Sbjct: 301 IMIAAYAREGRPMECLELFESMARSGIRADLFTALPVISSISQLKCADWGKQTHANILRN 360

Query: 361 GSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTALSLF 420
           GSDSQVSVHNSLIDMYCEC  L+SACKIFN +T+K+VISWSAMIKG VK+G  L ALSLF
Sbjct: 361 GSDSQVSVHNSLIDMYCECNSLESACKIFNSVTNKTVISWSAMIKGNVKHGYPLIALSLF 420

Query: 421 SKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKC 480
             MKSDGIQADF+ +INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKC
Sbjct: 421 FMMKSDGIQADFITVINIMPAFVDIGALENVKYLHGYSLKLALTSLPSLNTALLITYAKC 480

Query: 481 GSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLG 540
           G IEMAQRLFEEE+++DKDLIMWNSMISAHANHGDWSQCFKLYN+MKCSNS PDQVTFLG
Sbjct: 481 GCIEMAQRLFEEERVNDKDLIMWNSMISAHANHGDWSQCFKLYNQMKCSNSNPDQVTFLG 540

Query: 541 LLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKNMPIK 600
           LLTACVNSGLVEKGKEFFKEM ESY CQPSQEHYACMVNLLGRAGLI+EAGELV+NMPIK
Sbjct: 541 LLTACVNSGLVEKGKEFFKEMIESYSCQPSQEHYACMVNLLGRAGLINEAGELVRNMPIK 600

Query: 601 PDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGVAKMR 660
           PDARVWGPLLSACK+HPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKWDGVAKMR
Sbjct: 601 PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMR 660

Query: 661 SFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIKEVREKSPDT 717
           SFLR+KGLKK PGCSWLEING V EFRVAD+THPRA DIY ILGNLEL+IKE +E SP+ 
Sbjct: 661 SFLRDKGLKKTPGCSWLEINGRVAEFRVADRTHPRAEDIYAILGNLELDIKEAKEMSPEK 720

BLAST of CsGy1G023930 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 436.4 bits (1121), Expect = 4.4e-122
Identity = 250/685 (36.50%), Postives = 379/685 (55.33%), Query Frame = 0

Query: 29  TLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLID-CYANLGL--LNHSLQVFCSV 88
           +LSLL + C ++Q L+ IHA+ I  G H      SKLI+ C  +     L +++ VF ++
Sbjct: 36  SLSLLHN-CKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTI 95

Query: 89  IDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGR 148
            +PNL ++N + R      +    L +Y  M++  + P+  T+PFVL+SC+       G+
Sbjct: 96  QEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQ 155

Query: 149 TIHGYLVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQND 208
            IHG+++KLG DL   V T+L  MY +    E+AH++FDK   +                
Sbjct: 156 QIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHR---------------- 215

Query: 209 NGEGIFRVFGRMIAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNT 268
                                                                 D++  T
Sbjct: 216 ------------------------------------------------------DVVSYT 275

Query: 269 AVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRS 328
           A++  Y+    + +A+KLFD++P KD V WN MI+ YA  G   E LELFK M ++ +R 
Sbjct: 276 ALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRP 335

Query: 329 DLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIF 388
           D  T + V+S+ AQ   ++ G+Q H  I  +G  S + + N+LID+Y +C  L++AC +F
Sbjct: 336 DESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLF 395

Query: 389 NWMTDKSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALE 448
             +  K VISW+ +I GY        AL LF +M   G   + V M++ILPA  H+GA++
Sbjct: 396 ERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAID 455

Query: 449 NVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMI 508
             +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    I  K L  WN+MI
Sbjct: 456 IGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVF--NSILHKSLSSWNAMI 515

Query: 509 SAHANHGDWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGC 568
              A HG     F L++RM+    +PD +TF+GLL+AC +SG+++ G+  F+ MT+ Y  
Sbjct: 516 FGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKM 575

Query: 569 QPSQEHYACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAE 628
            P  EHY CM++LLG +GL  EA E++  M ++PD  +W  LL ACKMH   +L E  AE
Sbjct: 576 TPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAE 635

Query: 629 KLINMEPRNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFR 688
            LI +EP N G+Y+LLSNIYA+AG+W+ VAK R+ L +KG+KK+PGCS +EI+  V EF 
Sbjct: 636 NLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFI 647

Query: 689 VADQTHPRAGDIYTILGNLELEIKE 709
           + D+ HPR  +IY +L  +E+ +++
Sbjct: 696 IGDKFHPRNREIYGMLEEMEVLLEK 647

BLAST of CsGy1G023930 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 419.9 bits (1078), Expect = 4.3e-117
Identity = 221/678 (32.60%), Postives = 372/678 (54.87%), Query Frame = 0

Query: 31  SLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNL 90
           +LL  RC+S++ L+QI      +G +Q     +KL+  +   G ++ + +VF  +     
Sbjct: 41  ALLLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLN 100

Query: 91  TLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGY 150
            L++ +L+   +  + ++ L  + +M    + P    + ++L+ C   + +  G+ IHG 
Sbjct: 101 VLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGL 160

Query: 151 LVKLGFDLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGI 210
           LVK GF L     T L  MY +C +   A ++FD+   +DL   +++     QN      
Sbjct: 161 LVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMA 220

Query: 211 FRVFGRMIAEQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSL 270
             +   M  E L P   T  ++L  ++ L  I + K +H  A+ S     + ++TA++ +
Sbjct: 221 LEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDM 280

Query: 271 YSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELFKSMARSGIRSDLFTA 330
           Y+K  SL  AR+LFD M E++ V WN MI AY +   P E + +F+ M   G++    + 
Sbjct: 281 YAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSV 340

Query: 331 LPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTD 390
           +  + + A L  ++ G+  H   +  G D  VSV NSLI MYC+CK +D+A  +F  +  
Sbjct: 341 MGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQS 400

Query: 391 KSVISWSAMIKGYVKNGQSLTALSLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYL 450
           ++++SW+AMI G+ +NG+ + AL+ FS+M+S  ++ D    ++++ A   +    + K++
Sbjct: 401 RTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWI 460

Query: 451 HGYSMKLGLTSLPSLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHG 510
           HG  M+  L     + TAL+  YAKCG+I +A+ +F  + + ++ +  WN+MI  +  HG
Sbjct: 461 HGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIF--DMMSERHVTTWNAMIDGYGTHG 520

Query: 511 DWSQCFKLYNRMKCSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHY 570
                 +L+  M+    KP+ VTFL +++AC +SGLVE G + F  M E+Y  + S +HY
Sbjct: 521 FGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHY 580

Query: 571 ACMVNLLGRAGLISEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEP 630
             MV+LLGRAG ++EA + +  MP+KP   V+G +L AC++H     AE AAE+L  + P
Sbjct: 581 GAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNP 640

Query: 631 RNAGNYILLSNIYAAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHP 690
            + G ++LL+NIY AA  W+ V ++R  +  +GL+K PGCS +EI   V  F      HP
Sbjct: 641 DDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHP 700

Query: 691 RAGDIYTILGNLELEIKE 709
            +  IY  L  L   IKE
Sbjct: 701 DSKKIYAFLEKLICHIKE 716

BLAST of CsGy1G023930 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 403.7 bits (1036), Expect = 3.2e-112
Identity = 235/730 (32.19%), Postives = 403/730 (55.21%), Query Frame = 0

Query: 23  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGL---LNHSL 82
           QS+           C +I  L+  H      G   + +  +KL+     LG    L+ + 
Sbjct: 28  QSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAK 87

Query: 83  QVF-CSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSF 142
           +VF  S       ++N+++R     G     +L++ +M+   + PD+ T+PF L +C+  
Sbjct: 88  EVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKS 147

Query: 143 SNVGFGRTIHGYLVKLGF--DLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSS 202
              G G  IHG +VK+G+  DLF  V  +L   Y EC E ++A ++FD+ S +++   +S
Sbjct: 148 RAKGNGIQIHGLIVKMGYAKDLF--VQNSLVHFYAECGELDSARKVFDEMSERNVVSWTS 207

Query: 203 LTTEGPQNDNGEGIFRVFGRMIA-EQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVS 262
           +     + D  +    +F RM+  E++ P+S T   ++   A L  ++  + V+     S
Sbjct: 208 MICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNS 267

Query: 263 KLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF 322
            +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R+G   E L +F
Sbjct: 268 GIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVF 327

Query: 323 KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCEC 382
             M  SG+R D  + L  ISS +QL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C
Sbjct: 328 NLMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKC 387

Query: 383 KILDSACKIFNWMTDKSVISWSAMIKGYVKNGQ--------------------------- 442
              D+A +IF+ M++K+V++W++++ GYV+NG+                           
Sbjct: 388 HRQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLV 447

Query: 443 --SL--TALSLFSKMKS-DGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLP 502
             SL   A+ +F  M+S +G+ AD V M++I  A  H+GAL+  K+++ Y  K G+    
Sbjct: 448 QGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDV 507

Query: 503 SLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMK 562
            L T L+  +++CG  E A  +F    + ++D+  W + I A A  G+  +  +L++ M 
Sbjct: 508 RLGTTLVDMFSRCGDPESAMSIF--NSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMI 567

Query: 563 CSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLI 622
               KPD V F+G LTAC + GLV++GKE F  M + +G  P   HY CMV+LLGRAGL+
Sbjct: 568 EQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLL 627

Query: 623 SEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIY 682
            EA +L+++MP++P+  +W  LL+AC++    ++A +AAEK+  + P   G+Y+LLSN+Y
Sbjct: 628 EEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVY 687

Query: 683 AAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLE 714
           A+AG+W+ +AK+R  ++ KGL+K PG S ++I G   EF   D++HP        + N+E
Sbjct: 688 ASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPE-------MPNIE 746

BLAST of CsGy1G023930 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 403.7 bits (1036), Expect = 3.2e-112
Identity = 235/730 (32.19%), Postives = 403/730 (55.21%), Query Frame = 0

Query: 23  QSRLLNTLSLLFSRCNSIQHLQQIHARFILHGFHQNPTLSSKLIDCYANLGL---LNHSL 82
           QS+           C +I  L+  H      G   + +  +KL+     LG    L+ + 
Sbjct: 28  QSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAK 87

Query: 83  QVF-CSVIDPNLTLFNAILRNLTRYGESERTLLVYQQMVAKSMHPDEETYPFVLRSCSSF 142
           +VF  S       ++N+++R     G     +L++ +M+   + PD+ T+PF L +C+  
Sbjct: 88  EVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKS 147

Query: 143 SNVGFGRTIHGYLVKLGF--DLFDVVATALAEMYEECIEFENAHQLFDKRSVKDLGWPSS 202
              G G  IHG +VK+G+  DLF  V  +L   Y EC E ++A ++FD+ S +++   +S
Sbjct: 148 RAKGNGIQIHGLIVKMGYAKDLF--VQNSLVHFYAECGELDSARKVFDEMSERNVVSWTS 207

Query: 203 LTTEGPQNDNGEGIFRVFGRMIA-EQLVPDSFTFFNLLRFIAGLNSIQLAKIVHCIAIVS 262
           +     + D  +    +F RM+  E++ P+S T   ++   A L  ++  + V+     S
Sbjct: 208 MICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNS 267

Query: 263 KLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRVVWNIMIAAYAREGKPTECLELF 322
            +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R+G   E L +F
Sbjct: 268 GIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVF 327

Query: 323 KSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHILRNGSDSQVSVHNSLIDMYCEC 382
             M  SG+R D  + L  ISS +QL+ + WGK  H ++LRNG +S  ++ N+LIDMY +C
Sbjct: 328 NLMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKC 387

Query: 383 KILDSACKIFNWMTDKSVISWSAMIKGYVKNGQ--------------------------- 442
              D+A +IF+ M++K+V++W++++ GYV+NG+                           
Sbjct: 388 HRQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLV 447

Query: 443 --SL--TALSLFSKMKS-DGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLP 502
             SL   A+ +F  M+S +G+ AD V M++I  A  H+GAL+  K+++ Y  K G+    
Sbjct: 448 QGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDV 507

Query: 503 SLNTALLITYAKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRMK 562
            L T L+  +++CG  E A  +F    + ++D+  W + I A A  G+  +  +L++ M 
Sbjct: 508 RLGTTLVDMFSRCGDPESAMSIF--NSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMI 567

Query: 563 CSNSKPDQVTFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLI 622
               KPD V F+G LTAC + GLV++GKE F  M + +G  P   HY CMV+LLGRAGL+
Sbjct: 568 EQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLL 627

Query: 623 SEAGELVKNMPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIY 682
            EA +L+++MP++P+  +W  LL+AC++    ++A +AAEK+  + P   G+Y+LLSN+Y
Sbjct: 628 EEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVY 687

Query: 683 AAAGKWDGVAKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLE 714
           A+AG+W+ +AK+R  ++ KGL+K PG S ++I G   EF   D++HP        + N+E
Sbjct: 688 ASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPE-------MPNIE 746

BLAST of CsGy1G023930 vs. TAIR 10
Match: AT4G21300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 399.1 bits (1024), Expect = 7.8e-111
Identity = 211/655 (32.21%), Postives = 368/655 (56.18%), Query Frame = 0

Query: 54  GFHQNPTLSSKLIDCYANLGLLNHSLQVFCSVIDPNLTLFNAILRNLTRYGESERTLLVY 113
           G   N  ++S LI  Y   G ++   ++F  V+  +  ++N +L    + G  +  +  +
Sbjct: 168 GMDCNEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAKCGALDSVIKGF 227

Query: 114 QQMVAKSMHPDEETYPFVLRSCSSFSNVGFGRTIHGYLVKLGFDLFDVVATALAEMYEEC 173
             M    + P+  T+  VL  C+S   +  G  +HG +V  G D    +  +L  MY +C
Sbjct: 228 SVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKC 287

Query: 174 IEFENAHQLFDKRSVKDLGWPSSLTTEGPQNDNGEGIFRVFGRMIAEQLVPDSFTFFNLL 233
             F++A +LF   S  D    + + +   Q+   E     F  MI+  ++PD+ TF +LL
Sbjct: 288 GRFDDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLL 347

Query: 234 RFIAGLNSIQLAKIVHCIAIVSKLSGDLLVNTAVLSLYSKLRSLVDARKLFDKMPEKDRV 293
             ++   +++  K +HC  +   +S D+ + +A++  Y K R +  A+ +F +    D V
Sbjct: 348 PSVSKFENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVV 407

Query: 294 VWNIMIAAYAREGKPTECLELFKSMARSGIRSDLFTALPVISSIAQLKCVDWGKQTHAHI 353
           V+  MI+ Y   G   + LE+F+ + +  I  +  T + ++  I  L  +  G++ H  I
Sbjct: 408 VFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFI 467

Query: 354 LRNGSDSQVSVHNSLIDMYCECKILDSACKIFNWMTDKSVISWSAMIKGYVKNGQSLTAL 413
           ++ G D++ ++  ++IDMY +C  ++ A +IF  ++ + ++SW++MI    ++     A+
Sbjct: 468 IKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAI 527

Query: 414 SLFSKMKSDGIQADFVIMINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITY 473
            +F +M   GI  D V +   L A  ++ +    K +HG+ +K  L S     + L+  Y
Sbjct: 528 DIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDVYSESTLIDMY 587

Query: 474 AKCGSIEMAQRLFEEEKIDDKDLIMWNSMISAHANHGDWSQCFKLYNRM-KCSNSKPDQV 533
           AKCG+++ A  +F+  K  +K+++ WNS+I+A  NHG       L++ M + S  +PDQ+
Sbjct: 588 AKCGNLKAAMNVFKTMK--EKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSGIRPDQI 647

Query: 534 TFLGLLTACVNSGLVEKGKEFFKEMTESYGCQPSQEHYACMVNLLGRAGLISEAGELVKN 593
           TFL ++++C + G V++G  FF+ MTE YG QP QEHYAC+V+L GRAG ++EA E VK+
Sbjct: 648 TFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEAYETVKS 707

Query: 594 MPIKPDARVWGPLLSACKMHPGSKLAEFAAEKLINMEPRNAGNYILLSNIYAAAGKWDGV 653
           MP  PDA VWG LL AC++H   +LAE A+ KL++++P N+G Y+L+SN +A A +W+ V
Sbjct: 708 MPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYVLISNAHANAREWESV 767

Query: 654 AKMRSFLRNKGLKKIPGCSWLEINGHVTEFRVADQTHPRAGDIYTILGNLELEIK 708
            K+RS ++ + ++KIPG SW+EIN     F   D  HP +  IY++L +L  E++
Sbjct: 768 TKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYSLLNSLLGELR 820

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LN016.2e-12136.50Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q3E6Q16.0e-11632.60Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9LUJ24.5e-11132.19Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Q9STE11.1e-10932.21Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX... [more]
Q9SN397.1e-10933.13Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
XP_008444579.10.094.18PREDICTED: pentatricopeptide repeat-containing protein At1g11290, chloroplastic-... [more]
XP_038894029.10.088.07pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Benin... [more]
XP_004145299.20.0100.00pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sa... [more]
XP_022139869.10.082.71pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Momor... [more]
KAG6573373.10.082.29Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
A0A0A0M0Z60.0100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G534720 PE=4 SV=1[more]
A0A5D3DB690.094.18Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BBG70.094.18pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cuc... [more]
A0A6J1CE610.082.71pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Mom... [more]
A0A6J1K3Q80.081.97pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT1G08070.14.4e-12236.50Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G11290.14.3e-11732.60Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G22690.13.2e-11232.19CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
AT3G22690.23.2e-11232.19INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
AT4G21300.17.8e-11132.21Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 352..454
e-value: 2.9E-10
score: 41.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 464..704
e-value: 1.5E-35
score: 125.1
coord: 203..349
e-value: 1.0E-20
score: 76.3
coord: 22..199
e-value: 5.2E-14
score: 54.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 293..326
e-value: 6.5E-8
score: 30.3
coord: 93..125
e-value: 0.0014
score: 16.6
coord: 394..427
e-value: 2.0E-7
score: 28.8
coord: 532..566
e-value: 4.5E-5
score: 21.3
coord: 497..530
e-value: 1.4E-7
score: 29.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 365..386
e-value: 0.89
score: 10.0
coord: 394..424
e-value: 3.3E-8
score: 33.2
coord: 570..593
e-value: 1.0
score: 9.8
coord: 293..323
e-value: 9.4E-9
score: 35.0
coord: 265..290
e-value: 0.062
score: 13.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 495..542
e-value: 1.8E-11
score: 44.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 392..426
score: 11.246351
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 89..123
score: 8.801982
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 495..529
score: 11.257313
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 530..565
score: 8.834866
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 291..325
score: 12.375365
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 1..718
NoneNo IPR availablePANTHERPTHR47928:SF25PPR CONTAINING PLANT-LIKE PROTEINcoord: 1..718

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G023930.1CsGy1G023930.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding