CsGy1G001040 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy1G001040
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
LocationGy14Chr1: 589837 .. 592166 (-)
RNA-Seq ExpressionCsGy1G001040
SyntenyCsGy1G001040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAAACCATTTCAAGATAAAGAGTACTATTTTTTACTCCTCCTTCGTGAAAGAATTCTTTGTCGATTGAAATGTTGGCAGCTCTGTTCCACTGAATTGAACTATCTATAGATGATATATTGCCAAATTTAGAATCTTTCGAGCTTCACTTTCATCTCAAATGGCGTCGATAGTCGGTTGCCTTCCCAATATATCTCTGACTTCCATAACCCAGTTCCCTGAAAACCCAAAATCTTTGATTCTTCAGCAATGCAAAACTCCAAAAGACCTCCAGCAAGTTCACGCTCACCTTCTCAAAACTCGCCGTCTCCTCGACCCCATCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCGACACCATAGATTATGCCCTTTCCATTTTCAACCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCTTTCAAGCGATCGCCTGATAATGCCCTTCTCTTGTTCAAGAAAATGCATGAAAAGTCAGTTCAGCATGACAAATTCACTTTCTCCTCTGTCTTAAAGGCTTGCTCTAGAATGAAAGCGCTGAGGGAAGGCGAACAGGTCCACGCGTTGATTCTGAAATCTGGGTTCAAATCAAATGAGTTTGTCGAGAATACTTTGATTCAGATGTATGCGAATTGTGGACAAATTGGGGTTGCACGTCATGTGTTTGATGGAATGCCGGAAAGAAGCATAGTTGCGTGGAATTCGATGTTGTCTGGTTATACGAAAAATGGGCTTTGGGATGAGGTCGTGAAGCTTTTTCGAAAAATTTTGGAACTGCGTATTGAATTTGATGATGTTACAATGATTAGTGTATTGATGGCTTGTGGAAGATTAGCGAATCTGGAAATAGGGGAGTTGATTGGTGAGTATATTGTGTCAAAAGGGCTAAGACGAAACAATACTCTAACGACTTCGCTGATTGATATGTATGCCAAATGTGGTCAAGTTGATACCGCTAGAAAGTTGTTCGATGAAATGGATAAAAGAGATGTTGTTGCTTGGAGTGCAATGATCTCGGGGTATGCTCAAGCTGATCGATGTAAAGAAGCTCTTAATCTGTTCCATGAGATGCAGAAGGGAAATGTATATCCAAACGAGGTAACAATGGTCAGTGTTCTCTATTCGTGCGCTATGCTTGGAGCATACGAAACAGGTAAGTGGGTTCATTTCTACATCAAAAAGAAGAAGATGAAGCTCACGGTTACTCTTGGAACTCAGCTGATAGATTTTTATGCTAAATGTGGGTATATAGATAGATCAGTTGAAGTTTTCAAGGAAATGTCTTTCAAGAATGTGTTCACATGGACAGCATTAATTCAAGGTCTTGCCAATAATGGAGAAGGGAAAATGGCTCTGGAATTCTTTTCCTCGATGCTAGAGAATGATGTAAAGCCAAATGATGTAACTTTCATTGGCGTTCTGTCTGCTTGTAGCCACGCTTGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATACTTGGACGTGCTGGGTTTCTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCTTCCCTCCCAATGCTGTTGTTTGGAGAACACTATTGGCTTCATGTAGAGCTCATAAAAACATTGAAATGGCAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCACAGTGGAGATTACATTCTTCTGTCAAATACTTATGCATTGGTTGGTAGGGTTGAGGATGCAATCAGGGTAAGATCTTTGATAAAAGAGAAGGAGATTAAGAAGATTCCAGGTTGTAGTTTGATTGAGCTCGATGGTGTTGTACATGAGTTTTTTTCAGAAGATGGAGAACATAAGCACTCCAAGGAAATACATGACGCGTTAGATAAAATGATGAAGCAGATCAAGAGGCTCGGATATGTGCCCAACACAGACGATGCTAGACTGGAGGCTGAGGAAGAGAGCAAAGAAACTTCAGTGTCGCATCATAGTGAGAAGCTTGCTATTGCTTATGGTCTGATCCGAACGTCTCCTCGAACCACTATTAGAATTTCAAAAAACCTTAGGATGTGTAGGGACTGCCATAATGCAACGAAGTTTATATCACAAGTCTTTGAAAGAATGATTATTGTTAGGGATCGGAACCGTTTTCATCATTTTAAAGATGGCCTTTGCTCCTGTAATGACTATTGGTGAGTCTTTTATAGTTGATGGAACATCCATTGTTAGAGAGAAGTAGATGGAACATCGTGTTGATTGGTTACAATGCCTAAATATAG

mRNA sequence

CCAAACCATTTCAAGATAAAGAGTACTATTTTTTACTCCTCCTTCGTGAAAGAATTCTTTGTCGATTGAAATGTTGGCAGCTCTGTTCCACTGAATTGAACTATCTATAGATGATATATTGCCAAATTTAGAATCTTTCGAGCTTCACTTTCATCTCAAATGGCGTCGATAGTCGGTTGCCTTCCCAATATATCTCTGACTTCCATAACCCAGTTCCCTGAAAACCCAAAATCTTTGATTCTTCAGCAATGCAAAACTCCAAAAGACCTCCAGCAAGTTCACGCTCACCTTCTCAAAACTCGCCGTCTCCTCGACCCCATCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCGACACCATAGATTATGCCCTTTCCATTTTCAACCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCTTTCAAGCGATCGCCTGATAATGCCCTTCTCTTGTTCAAGAAAATGCATGAAAAGTCAGTTCAGCATGACAAATTCACTTTCTCCTCTGTCTTAAAGGCTTGCTCTAGAATGAAAGCGCTGAGGGAAGGCGAACAGGTCCACGCGTTGATTCTGAAATCTGGGTTCAAATCAAATGAGTTTGTCGAGAATACTTTGATTCAGATGTATGCGAATTGTGGACAAATTGGGGTTGCACGTCATGTGTTTGATGGAATGCCGGAAAGAAGCATAGTTGCGTGGAATTCGATGTTGTCTGGTTATACGAAAAATGGGCTTTGGGATGAGGTCGTGAAGCTTTTTCGAAAAATTTTGGAACTGCGTATTGAATTTGATGATGTTACAATGATTAGTGTATTGATGGCTTGTGGAAGATTAGCGAATCTGGAAATAGGGGAGTTGATTGGTGAGTATATTGTGTCAAAAGGGCTAAGACGAAACAATACTCTAACGACTTCGCTGATTGATATGTATGCCAAATGTGGTCAAGTTGATACCGCTAGAAAGTTGTTCGATGAAATGGATAAAAGAGATGTTGTTGCTTGGAGTGCAATGATCTCGGGGTATGCTCAAGCTGATCGATGTAAAGAAGCTCTTAATCTGTTCCATGAGATGCAGAAGGGAAATGTATATCCAAACGAGGTAACAATGGTCAGTGTTCTCTATTCGTGCGCTATGCTTGGAGCATACGAAACAGGTAAGTGGGTTCATTTCTACATCAAAAAGAAGAAGATGAAGCTCACGGTTACTCTTGGAACTCAGCTGATAGATTTTTATGCTAAATGTGGGTATATAGATAGATCAGTTGAAGTTTTCAAGGAAATGTCTTTCAAGAATGTGTTCACATGGACAGCATTAATTCAAGGTCTTGCCAATAATGGAGAAGGGAAAATGGCTCTGGAATTCTTTTCCTCGATGCTAGAGAATGATGTAAAGCCAAATGATGTAACTTTCATTGGCGTTCTGTCTGCTTGTAGCCACGCTTGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATACTTGGACGTGCTGGGTTTCTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCTTCCCTCCCAATGCTGTTGTTTGGAGAACACTATTGGCTTCATGTAGAGCTCATAAAAACATTGAAATGGCAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCACAGTGGAGATTACATTCTTCTGTCAAATACTTATGCATTGGTTGGTAGGGTTGAGGATGCAATCAGGGTAAGATCTTTGATAAAAGAGAAGGAGATTAAGAAGATTCCAGGTTGTAGTTTGATTGAGCTCGATGGTGTTGTACATGAGTTTTTTTCAGAAGATGGAGAACATAAGCACTCCAAGGAAATACATGACGCGTTAGATAAAATGATGAAGCAGATCAAGAGGCTCGGATATGTGCCCAACACAGACGATGCTAGACTGGAGGCTGAGGAAGAGAGCAAAGAAACTTCAGTGTCGCATCATAGTGAGAAGCTTGCTATTGCTTATGGTCTGATCCGAACGTCTCCTCGAACCACTATTAGAATTTCAAAAAACCTTAGGATGTGTAGGGACTGCCATAATGCAACGAAGTTTATATCACAAGTCTTTGAAAGAATGATTATTGTTAGGGATCGGAACCGTTTTCATCATTTTAAAGATGGCCTTTGCTCCTGTAATGACTATTGGTGAGTCTTTTATAGTTGATGGAACATCCATTGTTAGAGAGAAGTAGATGGAACATCGTGTTGATTGGTTACAATGCCTAAATATAG

Coding sequence (CDS)

ATGGCGTCGATAGTCGGTTGCCTTCCCAATATATCTCTGACTTCCATAACCCAGTTCCCTGAAAACCCAAAATCTTTGATTCTTCAGCAATGCAAAACTCCAAAAGACCTCCAGCAAGTTCACGCTCACCTTCTCAAAACTCGCCGTCTCCTCGACCCCATCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCGACACCATAGATTATGCCCTTTCCATTTTCAACCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCTTTCAAGCGATCGCCTGATAATGCCCTTCTCTTGTTCAAGAAAATGCATGAAAAGTCAGTTCAGCATGACAAATTCACTTTCTCCTCTGTCTTAAAGGCTTGCTCTAGAATGAAAGCGCTGAGGGAAGGCGAACAGGTCCACGCGTTGATTCTGAAATCTGGGTTCAAATCAAATGAGTTTGTCGAGAATACTTTGATTCAGATGTATGCGAATTGTGGACAAATTGGGGTTGCACGTCATGTGTTTGATGGAATGCCGGAAAGAAGCATAGTTGCGTGGAATTCGATGTTGTCTGGTTATACGAAAAATGGGCTTTGGGATGAGGTCGTGAAGCTTTTTCGAAAAATTTTGGAACTGCGTATTGAATTTGATGATGTTACAATGATTAGTGTATTGATGGCTTGTGGAAGATTAGCGAATCTGGAAATAGGGGAGTTGATTGGTGAGTATATTGTGTCAAAAGGGCTAAGACGAAACAATACTCTAACGACTTCGCTGATTGATATGTATGCCAAATGTGGTCAAGTTGATACCGCTAGAAAGTTGTTCGATGAAATGGATAAAAGAGATGTTGTTGCTTGGAGTGCAATGATCTCGGGGTATGCTCAAGCTGATCGATGTAAAGAAGCTCTTAATCTGTTCCATGAGATGCAGAAGGGAAATGTATATCCAAACGAGGTAACAATGGTCAGTGTTCTCTATTCGTGCGCTATGCTTGGAGCATACGAAACAGGTAAGTGGGTTCATTTCTACATCAAAAAGAAGAAGATGAAGCTCACGGTTACTCTTGGAACTCAGCTGATAGATTTTTATGCTAAATGTGGGTATATAGATAGATCAGTTGAAGTTTTCAAGGAAATGTCTTTCAAGAATGTGTTCACATGGACAGCATTAATTCAAGGTCTTGCCAATAATGGAGAAGGGAAAATGGCTCTGGAATTCTTTTCCTCGATGCTAGAGAATGATGTAAAGCCAAATGATGTAACTTTCATTGGCGTTCTGTCTGCTTGTAGCCACGCTTGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATACTTGGACGTGCTGGGTTTCTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCTTCCCTCCCAATGCTGTTGTTTGGAGAACACTATTGGCTTCATGTAGAGCTCATAAAAACATTGAAATGGCAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCACAGTGGAGATTACATTCTTCTGTCAAATACTTATGCATTGGTTGGTAGGGTTGAGGATGCAATCAGGGTAAGATCTTTGATAAAAGAGAAGGAGATTAAGAAGATTCCAGGTTGTAGTTTGATTGAGCTCGATGGTGTTGTACATGAGTTTTTTTCAGAAGATGGAGAACATAAGCACTCCAAGGAAATACATGACGCGTTAGATAAAATGATGAAGCAGATCAAGAGGCTCGGATATGTGCCCAACACAGACGATGCTAGACTGGAGGCTGAGGAAGAGAGCAAAGAAACTTCAGTGTCGCATCATAGTGAGAAGCTTGCTATTGCTTATGGTCTGATCCGAACGTCTCCTCGAACCACTATTAGAATTTCAAAAAACCTTAGGATGTGTAGGGACTGCCATAATGCAACGAAGTTTATATCACAAGTCTTTGAAAGAATGATTATTGTTAGGGATCGGAACCGTTTTCATCATTTTAAAGATGGCCTTTGCTCCTGTAATGACTATTGGTGA

Protein sequence

MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW*
Homology
BLAST of CsGy1G001040 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 588.2 bits (1515), Expect = 1.2e-166
Identity = 288/726 (39.67%), Postives = 449/726 (61.85%), Query Frame = 0

Query: 8   LPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLK-----TRRLLDPIITEAVLES 67
           LP+ S         +P   +L  CKT + L+ +HA ++K     T   L  +I   +L  
Sbjct: 20  LPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSP 79

Query: 68  AALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKF 127
               LP    YA+S+F  I +P    +N M RG A    P +AL L+  M    +  + +
Sbjct: 80  HFEGLP----YAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSY 139

Query: 128 TFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQM----------------- 187
           TF  VLK+C++ KA +EG+Q+H  +LK G   + +V  +LI M                 
Sbjct: 140 TFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKS 199

Query: 188 --------------YANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFR 247
                         YA+ G I  A+ +FD +P + +V+WN+M+SGY + G + E ++LF+
Sbjct: 200 PHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFK 259

Query: 248 KILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCG 307
            +++  +  D+ TM++V+ AC +  ++E+G  +  +I   G   N  +  +LID+Y+KCG
Sbjct: 260 DMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCG 319

Query: 308 QVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLY 367
           +++TA  LF+ +  +DV++W+ +I GY   +  KEAL LF EM +    PN+VTM+S+L 
Sbjct: 320 ELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILP 379

Query: 368 SCAMLGAYETGKWVHFYIKKKKMKLT--VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNV 427
           +CA LGA + G+W+H YI K+   +T   +L T LID YAKCG I+ + +VF  +  K++
Sbjct: 380 ACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSL 439

Query: 428 FTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNS 487
            +W A+I G A +G    + + FS M +  ++P+D+TF+G+LSACSH+ ++D GRH+F +
Sbjct: 440 SSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRT 499

Query: 488 MRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIE 547
           M +D+ + P++EHYGCM+D+LG +G  +EA + I+ M   P+ V+W +LL +C+ H N+E
Sbjct: 500 MTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVE 559

Query: 548 MAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELD 607
           + E   E++ ++EP + G Y+LLSN YA  GR  +  + R+L+ +K +KK+PGCS IE+D
Sbjct: 560 LGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEID 619

Query: 608 GVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSE 667
            VVHEF   D  H  ++EI+  L++M   +++ G+VP+T +   E EEE KE ++ HHSE
Sbjct: 620 SVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSE 679

Query: 668 KLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLC 696
           KLAIA+GLI T P T + I KNLR+CR+CH ATK IS++++R II RDR RFHHF+DG+C
Sbjct: 680 KLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVC 739

BLAST of CsGy1G001040 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 567.8 bits (1462), Expect = 1.7e-160
Identity = 280/703 (39.83%), Postives = 432/703 (61.45%), Query Frame = 0

Query: 27  ILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPES 86
           ++++C + + L+Q H H+++T    DP     +   AAL    +++YA  +F+ I KP S
Sbjct: 36  LIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNS 95

Query: 87  SAYNVMIRGLAFKRSPDNALLLFKKM-HEKSVQHDKFTFSSVLKACSRMKALREGEQVHA 146
            A+N +IR  A    P  ++  F  M  E     +K+TF  ++KA + + +L  G+ +H 
Sbjct: 96  FAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHG 155

Query: 147 LILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDE 206
           + +KS   S+ FV N+LI  Y +CG +  A  VF  + E+ +V+WNSM++G+ + G  D+
Sbjct: 156 MAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDK 215

Query: 207 VVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLID 266
            ++LF+K+    ++   VTM+ VL AC ++ NLE G  +  YI    +  N TL  +++D
Sbjct: 216 ALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLD 275

Query: 267 MYAKCGQVDTARKLFDEMD-------------------------------KRDVVAWSAM 326
           MY KCG ++ A++LFD M+                               ++D+VAW+A+
Sbjct: 276 MYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNAL 335

Query: 327 ISGYAQADRCKEALNLFHEMQ-KGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKK 386
           IS Y Q  +  EAL +FHE+Q + N+  N++T+VS L +CA +GA E G+W+H YIKK  
Sbjct: 336 ISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHG 395

Query: 387 MKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS 446
           +++   + + LI  Y+KCG +++S EVF  +  ++VF W+A+I GLA +G G  A++ F 
Sbjct: 396 IRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFY 455

Query: 447 SMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRA 506
            M E +VKPN VTF  V  ACSH  LVD+   LF+ M  ++ I P  +HY C+VD+LGR+
Sbjct: 456 KMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRS 515

Query: 507 GFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLS 566
           G+LE+A +FI+ MP PP+  VW  LL +C+ H N+ +AE +   +  LEP + G ++LLS
Sbjct: 516 GYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLS 575

Query: 567 NTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD 626
           N YA +G+ E+   +R  ++   +KK PGCS IE+DG++HEF S D  H  S++++  L 
Sbjct: 576 NIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLH 635

Query: 627 KMMKQIKRLGYVPNTDDA-RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNL 686
           ++M+++K  GY P      ++  EEE KE S++ HSEKLAI YGLI T     IR+ KNL
Sbjct: 636 EVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNL 695

Query: 687 RMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           R+C DCH+  K ISQ+++R IIVRDR RFHHF++G CSCND+W
Sbjct: 696 RVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of CsGy1G001040 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 541.2 bits (1393), Expect = 1.7e-152
Identity = 270/670 (40.30%), Postives = 398/670 (59.40%), Query Frame = 0

Query: 28  LQQCKTPKDLQQVHAHLLKTRRLLDP-IITEAVLESAALLLPDTIDYALSIFNHIDKPES 87
           LQ+C   ++L+Q+HA +LKT  + D   IT+ +    +    D + YA  +F+  D+P++
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 88  SAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHAL 147
             +N+MIRG +    P+ +LLL+++M   S  H+ +TF S+LKACS + A  E  Q+HA 
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQ 140

Query: 148 ILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEV 207
           I K G++++ +  N+LI  YA  G   +A  +FD +PE   V+WNS++ GY K G  D  
Sbjct: 141 ITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIA 200

Query: 208 VKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDM 267
           + LFRK+ E                                                   
Sbjct: 201 LTLFRKMAE--------------------------------------------------- 260

Query: 268 YAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTM 327
                              ++ ++W+ MISGY QAD  KEAL LFHEMQ  +V P+ V++
Sbjct: 261 -------------------KNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSL 320

Query: 328 VSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSF 387
            + L +CA LGA E GKW+H Y+ K ++++   LG  LID YAKCG ++ ++EVFK +  
Sbjct: 321 ANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKK 380

Query: 388 KNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLVDQGRHL 447
           K+V  WTALI G A +G G+ A+  F  M +  +KPN +TF  VL+ACS+  LV++G+ +
Sbjct: 381 KSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLI 440

Query: 448 FNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHK 507
           F SM RD++++P IEHYGC+VD+LGRAG L+EA +FI  MP  PNAV+W  LL +CR HK
Sbjct: 441 FYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHK 500

Query: 508 NIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLI 567
           NIE+ E+  E +  ++P H G Y+  +N +A+  + + A   R L+KE+ + K+PGCS I
Sbjct: 501 NIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI 560

Query: 568 ELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLE-AEEESKETSVS 627
            L+G  HEF + D  H   ++I      M ++++  GYVP  ++  L+  +++ +E  V 
Sbjct: 561 SLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVH 620

Query: 628 HHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFK 687
            HSEKLAI YGLI+T P T IRI KNLR+C+DCH  TK IS++++R I++RDR RFHHF+
Sbjct: 621 QHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFR 620

Query: 688 DGLCSCNDYW 696
           DG CSC DYW
Sbjct: 681 DGKCSCGDYW 620

BLAST of CsGy1G001040 vs. ExPASy Swiss-Prot
Match: O23337 (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 533.1 bits (1372), Expect = 4.7e-150
Identity = 268/708 (37.85%), Postives = 423/708 (59.75%), Query Frame = 0

Query: 28  LQQCKTPKDLQQVHAHLLKT--RRLLDPIITEAVLESAALLLPDTIDYALSIFNHI-DKP 87
           L  CK+   ++Q+HAH+L+T     L+  +    + S+++     + YAL++F+ I   P
Sbjct: 19  LSFCKSLNHIKQLHAHILRTVINHKLNSFLFNLSVSSSSI----NLSYALNVFSSIPSPP 78

Query: 88  ESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVH 147
           ES  +N  +R L+    P   +L ++++     + D+F+F  +LKA S++ AL EG ++H
Sbjct: 79  ESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGMELH 138

Query: 148 ALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWD 207
            +  K     + FVE   + MYA+CG+I  AR+VFD M  R +V WN+M+  Y + GL D
Sbjct: 139 GVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFGLVD 198

Query: 208 EVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLI 267
           E  KLF ++ +  +  D++ + +++ ACGR  N+     I E+++   +R +  L T+L+
Sbjct: 199 EAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLTALV 258

Query: 268 DMYA-------------------------------KCGQVDTARKLFDEMDKRDVVAWSA 327
            MYA                               KCG++D A+ +FD+ +K+D+V W+ 
Sbjct: 259 TMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVCWTT 318

Query: 328 MISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKK 387
           MIS Y ++D  +EAL +F EM    + P+ V+M SV+ +CA LG  +  KWVH  I    
Sbjct: 319 MISAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWVHSCIHVNG 378

Query: 388 MKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS 447
           ++  +++   LI+ YAKCG +D + +VF++M  +NV +W+++I  L+ +GE   AL  F+
Sbjct: 379 LESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDALSLFA 438

Query: 448 SMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRA 507
            M + +V+PN+VTF+GVL  CSH+ LV++G+ +F SM  +++I P++EHYGCMVD+ GRA
Sbjct: 439 RMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDLFGRA 498

Query: 508 GFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLS 567
             L EA + I++MP   N V+W +L+++CR H  +E+ + + + I  LEP H G  +L+S
Sbjct: 499 NLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGALVLMS 558

Query: 568 NTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD 627
           N YA   R ED   +R +++EK + K  G S I+ +G  HEF   D  HK S EI+  LD
Sbjct: 559 NIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIYAKLD 618

Query: 628 KMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRT------TIR 687
           +++ ++K  GYVP+     ++ EEE K+  V  HSEKLA+ +GL+             IR
Sbjct: 619 EVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEEEKDSCGVIR 678

Query: 688 ISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           I KNLR+C DCH   K +S+V+ER IIVRDR RFH +K+GLCSC DYW
Sbjct: 679 IVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of CsGy1G001040 vs. ExPASy Swiss-Prot
Match: Q9SR82 (Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H84 PE=3 SV=1)

HSP 1 Score: 526.6 bits (1355), Expect = 4.4e-148
Identity = 264/685 (38.54%), Postives = 409/685 (59.71%), Query Frame = 0

Query: 11  ISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDT 70
           +++ S T   +  K+LI   C T   L+Q+H  L+      D  +   +L+         
Sbjct: 4   VTVPSATSKVQQIKTLISVAC-TVNHLKQIHVSLINHHLHHDTFLVNLLLKRTLFFRQTK 63

Query: 71  IDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKA 130
             Y L  F+H   P    YN +I G          L LF  + +  +    FTF  VLKA
Sbjct: 64  YSYLL--FSHTQFPNIFLYNSLINGFVNNHLFHETLDLFLSIRKHGLYLHGFTFPLVLKA 123

Query: 131 CSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAW 190
           C+R  + + G  +H+L++K GF  +     +L+ +Y+  G++  A  +FD +P+RS+V W
Sbjct: 124 CTRASSRKLGIDLHSLVVKCGFNHDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSVVTW 183

Query: 191 NSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVS 250
            ++ SGYT +G   E + LF+K++E+ ++ D   ++ VL AC  + +L+ GE I +Y+  
Sbjct: 184 TALFSGYTTSGRHREAIDLFKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKYMEE 243

Query: 251 KGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNL 310
             +++N+ + T+L+++YAKCG+++ AR +FD M ++D+V WS MI GYA     KE + L
Sbjct: 244 MEMQKNSFVRTTLVNLYAKCGKMEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEGIEL 303

Query: 311 FHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAK 370
           F +M + N+ P++ ++V  L SCA LGA + G+W    I + +    + +   LID YAK
Sbjct: 304 FLQMLQENLKPDQFSIVGFLSSCASLGALDLGEWGISLIDRHEFLTNLFMANALIDMYAK 363

Query: 371 CGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGV 430
           CG + R  EVFKEM  K++    A I GLA NG  K++   F    +  + P+  TF+G+
Sbjct: 364 CGAMARGFEVFKEMKEKDIVIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTFLGL 423

Query: 431 LSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPP 490
           L  C HA L+  G   FN++   + ++  +EHYGCMVD+ GRAG L++AY+ I +MP  P
Sbjct: 424 LCGCVHAGLIQDGLRFFNAISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMPMRP 483

Query: 491 NAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRS 550
           NA+VW  LL+ CR  K+ ++AE  L+ +  LEP ++G+Y+ LSN Y++ GR ++A  VR 
Sbjct: 484 NAIVWGALLSGCRLVKDTQLAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAEVRD 543

Query: 551 LIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDD 610
           ++ +K +KKIPG S IEL+G VHEF ++D  H  S +I+  L+ +  +++ +G+VP T+ 
Sbjct: 544 MMNKKGMKKIPGYSWIELEGKVHEFLADDKSHPLSDKIYAKLEDLGNEMRLMGFVPTTEF 603

Query: 611 ARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFE 670
              + EEE KE  + +HSEKLA+A GLI T     IR+ KNLR+C DCH   K IS++  
Sbjct: 604 VFFDVEEEEKERVLGYHSEKLAVALGLISTDHGQVIRVVKNLRVCGDCHEVMKLISKITR 663

Query: 671 RMIIVRDRNRFHHFKDGLCSCNDYW 696
           R I+VRD NRFH F +G CSCNDYW
Sbjct: 664 REIVVRDNNRFHCFTNGSCSCNDYW 685

BLAST of CsGy1G001040 vs. NCBI nr
Match: XP_004138266.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sativus] >KGN63534.1 hypothetical protein Csa_013924 [Cucumis sativus])

HSP 1 Score: 1394 bits (3608), Expect = 0.0
Identity = 695/695 (100.00%), Postives = 695/695 (100.00%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL 60
           MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL
Sbjct: 1   MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL 60

Query: 61  ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD 120
           ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD
Sbjct: 61  ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD 120

Query: 121 KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD 180
           KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD
Sbjct: 121 KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD 180

Query: 181 GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI 240
           GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI
Sbjct: 181 GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI 240

Query: 241 GELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ 300
           GELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ
Sbjct: 241 GELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ 300

Query: 301 ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTL 360
           ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTL
Sbjct: 301 ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTL 360

Query: 361 GTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV 420
           GTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV
Sbjct: 361 GTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV 420

Query: 421 KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY 480
           KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY
Sbjct: 421 KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY 480

Query: 481 QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG 540
           QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG
Sbjct: 481 QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG 540

Query: 541 RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK 600
           RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK
Sbjct: 541 RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK 600

Query: 601 RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHN 660
           RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHN
Sbjct: 601 RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHN 660

Query: 661 ATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695
           ATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
Sbjct: 661 ATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695

BLAST of CsGy1G001040 vs. NCBI nr
Match: XP_016903201.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucumis melo] >TYJ98502.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1328 bits (3436), Expect = 0.0
Identity = 663/698 (94.99%), Postives = 677/698 (96.99%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSITQ---FPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIVGCLP  SLTSITQ   FPENPKSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITE
Sbjct: 1   MASIVGCLPITSLTSITQISQFPENPKSLILQQCKTPKDLRQVHAHLLKTRRLLDPIITE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHE SV
Sbjct: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH
Sbjct: 121 QHDKFTFSSVLKACSRMRGLKEGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGMPER IVAWNSMLSGYTKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLAN
Sbjct: 181 VFDGMPERGIVAWNSMLSGYTKNGLWDEVVKLFQKILELNIGFDDVTMISVLMACGRLAN 240

Query: 241 LEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300
           LE+GELIGEYIVSKGLRRNNTL TSLIDMYAKCG++DTARKLF+EMDKRDVVAWSAMISG
Sbjct: 241 LEMGELIGEYIVSKGLRRNNTLITSLIDMYAKCGRIDTARKLFNEMDKRDVVAWSAMISG 300

Query: 301 YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT 360
           YAQADRCKEALNLFHEMQKGNV PNEVTMVSVLYSCAMLGAY+TGKWVHFYIKKKKMKLT
Sbjct: 301 YAQADRCKEALNLFHEMQKGNVDPNEVTMVSVLYSCAMLGAYQTGKWVHFYIKKKKMKLT 360

Query: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
           VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS MLE
Sbjct: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSLMLE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           NDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE
Sbjct: 421 NDVKPNDVTFIGVLSACSHACLVDQGRNLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFID+MPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYA
Sbjct: 481 EAYQFIDSMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPTHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600
           LVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Sbjct: 541 LVGRVEDAIRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600

Query: 601 QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRD 660
           QIK LGYVPN + ARLEAEEE+KETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMC D
Sbjct: 601 QIKTLGYVPNIEGARLEAEEENKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCGD 660

Query: 661 CHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695
           CHNATK+ISQ FERMIIVRDRNRFHHFKDGLCSC DYW
Sbjct: 661 CHNATKYISQAFERMIIVRDRNRFHHFKDGLCSCKDYW 698

BLAST of CsGy1G001040 vs. NCBI nr
Match: KAA0057818.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1326 bits (3431), Expect = 0.0
Identity = 662/698 (94.84%), Postives = 676/698 (96.85%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSITQ---FPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIVGCLP  SLTSITQ   FPENPKSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITE
Sbjct: 1   MASIVGCLPITSLTSITQISQFPENPKSLILQQCKTPKDLRQVHAHLLKTRRLLDPIITE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHE SV
Sbjct: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH
Sbjct: 121 QHDKFTFSSVLKACSRMRGLKEGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGMPER IVAWNSMLSGYTKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLAN
Sbjct: 181 VFDGMPERGIVAWNSMLSGYTKNGLWDEVVKLFQKILELNIGFDDVTMISVLMACGRLAN 240

Query: 241 LEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300
           LE+GELIGEYIVSKGLRRNNTL TSLIDMYAKCG++DTARKLF+EMDKRDVVAWSAMISG
Sbjct: 241 LEMGELIGEYIVSKGLRRNNTLMTSLIDMYAKCGRIDTARKLFNEMDKRDVVAWSAMISG 300

Query: 301 YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT 360
           YAQADRCKEALNLFHEMQKGNV PNEVTMVSVLYSCAMLGAY+TGKWVHFYIKKKKMKLT
Sbjct: 301 YAQADRCKEALNLFHEMQKGNVDPNEVTMVSVLYSCAMLGAYQTGKWVHFYIKKKKMKLT 360

Query: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
           VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFF  MLE
Sbjct: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFCLMLE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           NDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE
Sbjct: 421 NDVKPNDVTFIGVLSACSHACLVDQGRNLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFID+MPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYA
Sbjct: 481 EAYQFIDSMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPTHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600
           LVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Sbjct: 541 LVGRVEDAIRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600

Query: 601 QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRD 660
           QIK LGYVPN + ARLEAEEE+KETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMC D
Sbjct: 601 QIKTLGYVPNIEGARLEAEEENKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCGD 660

Query: 661 CHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695
           CHNATK+ISQ FERMIIVRDRNRFHHFKDGLCSC DYW
Sbjct: 661 CHNATKYISQAFERMIIVRDRNRFHHFKDGLCSCKDYW 698

BLAST of CsGy1G001040 vs. NCBI nr
Match: KAG7023094.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1247 bits (3227), Expect = 0.0
Identity = 621/699 (88.84%), Postives = 659/699 (94.28%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIV CLPNIS+TSIT   QFPENPKSLILQ+CKTPKDL+QVHAHLLKTRRL DP I E
Sbjct: 1   MASIVACLPNISVTSITHLSQFPENPKSLILQRCKTPKDLRQVHAHLLKTRRLQDPTIAE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLP++IDYALSIFNH+DKPESSAYNVMIRGLAFK+SP NA+LLFKKMHE SV
Sbjct: 61  AVLESAALLLPNSIDYALSIFNHMDKPESSAYNVMIRGLAFKQSPHNAVLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ALREGEQVHALILKSGFKSNEFVENTLI MYANCGQ+GVAR 
Sbjct: 121 QHDKFTFSSVLKACSRMRALREGEQVHALILKSGFKSNEFVENTLIHMYANCGQVGVARQ 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGM +R+ VAWNSMLSGYTKNGLWDEVVKLFRK+LEL IEFDDVTMISVLMACGRLA+
Sbjct: 181 VFDGMSQRATVAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLAD 240

Query: 241 LEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300
           LE+GELIGEYI+SKG+RRN+TLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
Sbjct: 241 LELGELIGEYILSKGIRRNSTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300

Query: 301 YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT 360
           YAQADRCKEAL+LFHEMQK  V  NEVTMVSVLYSCA+LGAYETGKWVH YIK+KKMKLT
Sbjct: 301 YAQADRCKEALDLFHEMQKAKVDANEVTMVSVLYSCAVLGAYETGKWVHSYIKRKKMKLT 360

Query: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
           V+LGTQLIDFYAKCGYIDRSVEVF+ M F NVFTWTALIQGLANNGEGKMAL+FF+ M E
Sbjct: 361 VSLGTQLIDFYAKCGYIDRSVEVFRAMPFANVFTWTALIQGLANNGEGKMALDFFALMRE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           N+VKPNDVTFI VLSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAG LE
Sbjct: 421 NNVKPNDVTFIAVLSACSHACLVDQGRHLFNSMRRGFDIEPRIEHYGCMVDILGRAGLLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFI NMP PPNAVVWRTLLASC+AHKN+EMAEKS +HIT LEPAHSGDYILLSNTYA
Sbjct: 481 EAYQFIANMPIPPNAVVWRTLLASCKAHKNVEMAEKSFDHITLLEPAHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600
           LVGRVEDA+RVRSLIK+KEIKK PGCSLIELDGVVHEFFSEDG+H HSKEIHDALD+MMK
Sbjct: 541 LVGRVEDALRVRSLIKDKEIKKTPGCSLIELDGVVHEFFSEDGDHTHSKEIHDALDEMMK 600

Query: 601 QIKRLGYVPNTDDARLEAEEE-SKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCR 660
           +IK LGYVPN +DARLEAEEE SKETSVSHHSEKLAIAYGLIRT  +TTIRISKNLRMCR
Sbjct: 601 RIKSLGYVPNMEDARLEAEEEESKETSVSHHSEKLAIAYGLIRTPLQTTIRISKNLRMCR 660

Query: 661 DCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695
           DCHNATK ISQV++R IIVRDRNRFHHFKDGLCSCNDYW
Sbjct: 661 DCHNATKVISQVYKRTIIVRDRNRFHHFKDGLCSCNDYW 699

BLAST of CsGy1G001040 vs. NCBI nr
Match: XP_022921781.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 1245 bits (3222), Expect = 0.0
Identity = 620/699 (88.70%), Postives = 658/699 (94.13%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIV CLPNIS+TSIT   QFPENPKSLILQ+CKTPKDL+QVHAHLLKTRRL DP I E
Sbjct: 1   MASIVACLPNISVTSITHLSQFPENPKSLILQRCKTPKDLRQVHAHLLKTRRLQDPTIAE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLP++IDYALSIFNH+DKPESSAYNVMIRGLAFK+SP NA+LLFKKMHE SV
Sbjct: 61  AVLESAALLLPNSIDYALSIFNHMDKPESSAYNVMIRGLAFKQSPHNAVLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ALREGEQVHALILKSGFK NEFVENTLI MYANCGQ+GVAR 
Sbjct: 121 QHDKFTFSSVLKACSRMRALREGEQVHALILKSGFKPNEFVENTLIHMYANCGQVGVARQ 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGM +R+ VAWNSMLSGYTKNGLWDEVVKLFRK+LEL IEFDDVTMISVLMACGRLA+
Sbjct: 181 VFDGMSQRATVAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLAD 240

Query: 241 LEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300
           LE+GELIGEYI+SKG+RRN+TLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
Sbjct: 241 LELGELIGEYILSKGIRRNSTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300

Query: 301 YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT 360
           YAQADRCKEAL+LFHEMQK  V  NEVTMVSVLYSCA+LGAYETGKWVH YIK+KKMKLT
Sbjct: 301 YAQADRCKEALDLFHEMQKAKVDANEVTMVSVLYSCAVLGAYETGKWVHSYIKRKKMKLT 360

Query: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
           V+LGTQLIDFYAKCGYIDRSVEVF+ M F NVFTWTALIQGLANNGEGKMAL+FF+ M E
Sbjct: 361 VSLGTQLIDFYAKCGYIDRSVEVFRAMPFANVFTWTALIQGLANNGEGKMALDFFALMRE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           N+VKPNDVTFI VLSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAG LE
Sbjct: 421 NNVKPNDVTFIAVLSACSHACLVDQGRHLFNSMRRGFDIEPRIEHYGCMVDILGRAGLLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFI NMP PPNAVVWRTLLASC+AHKN+EMAEKS +HIT LEPAHSGDYILLSNTYA
Sbjct: 481 EAYQFIANMPIPPNAVVWRTLLASCKAHKNVEMAEKSFDHITLLEPAHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600
           LVGRVEDA+RVRSLIK+KEIKK PGCSLIELDGVVHEFFSEDG+H HSKEIHDALD+MMK
Sbjct: 541 LVGRVEDALRVRSLIKDKEIKKTPGCSLIELDGVVHEFFSEDGDHTHSKEIHDALDEMMK 600

Query: 601 QIKRLGYVPNTDDARLEAEEE-SKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCR 660
           +IK LGYVPN +DARLEAEEE SKETSVSHHSEKLAIAYGLIRT  +TTIRISKNLRMCR
Sbjct: 601 RIKSLGYVPNMEDARLEAEEEESKETSVSHHSEKLAIAYGLIRTPLQTTIRISKNLRMCR 660

Query: 661 DCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695
           DCHNATK ISQV++R IIVRDRNRFHHFKDGLCSCNDYW
Sbjct: 661 DCHNATKVISQVYKRTIIVRDRNRFHHFKDGLCSCNDYW 699

BLAST of CsGy1G001040 vs. ExPASy TrEMBL
Match: A0A0A0LRD6 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G003520 PE=3 SV=1)

HSP 1 Score: 1394 bits (3608), Expect = 0.0
Identity = 695/695 (100.00%), Postives = 695/695 (100.00%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL 60
           MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL
Sbjct: 1   MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL 60

Query: 61  ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD 120
           ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD
Sbjct: 61  ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD 120

Query: 121 KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD 180
           KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD
Sbjct: 121 KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD 180

Query: 181 GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI 240
           GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI
Sbjct: 181 GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI 240

Query: 241 GELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ 300
           GELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ
Sbjct: 241 GELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ 300

Query: 301 ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTL 360
           ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTL
Sbjct: 301 ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTL 360

Query: 361 GTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV 420
           GTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV
Sbjct: 361 GTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV 420

Query: 421 KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY 480
           KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY
Sbjct: 421 KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY 480

Query: 481 QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG 540
           QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG
Sbjct: 481 QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG 540

Query: 541 RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK 600
           RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK
Sbjct: 541 RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK 600

Query: 601 RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHN 660
           RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHN
Sbjct: 601 RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHN 660

Query: 661 ATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695
           ATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
Sbjct: 661 ATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695

BLAST of CsGy1G001040 vs. ExPASy TrEMBL
Match: A0A5D3BFH8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold350G001050 PE=3 SV=1)

HSP 1 Score: 1328 bits (3436), Expect = 0.0
Identity = 663/698 (94.99%), Postives = 677/698 (96.99%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSITQ---FPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIVGCLP  SLTSITQ   FPENPKSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITE
Sbjct: 1   MASIVGCLPITSLTSITQISQFPENPKSLILQQCKTPKDLRQVHAHLLKTRRLLDPIITE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHE SV
Sbjct: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH
Sbjct: 121 QHDKFTFSSVLKACSRMRGLKEGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGMPER IVAWNSMLSGYTKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLAN
Sbjct: 181 VFDGMPERGIVAWNSMLSGYTKNGLWDEVVKLFQKILELNIGFDDVTMISVLMACGRLAN 240

Query: 241 LEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300
           LE+GELIGEYIVSKGLRRNNTL TSLIDMYAKCG++DTARKLF+EMDKRDVVAWSAMISG
Sbjct: 241 LEMGELIGEYIVSKGLRRNNTLITSLIDMYAKCGRIDTARKLFNEMDKRDVVAWSAMISG 300

Query: 301 YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT 360
           YAQADRCKEALNLFHEMQKGNV PNEVTMVSVLYSCAMLGAY+TGKWVHFYIKKKKMKLT
Sbjct: 301 YAQADRCKEALNLFHEMQKGNVDPNEVTMVSVLYSCAMLGAYQTGKWVHFYIKKKKMKLT 360

Query: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
           VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS MLE
Sbjct: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSLMLE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           NDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE
Sbjct: 421 NDVKPNDVTFIGVLSACSHACLVDQGRNLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFID+MPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYA
Sbjct: 481 EAYQFIDSMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPTHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600
           LVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Sbjct: 541 LVGRVEDAIRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600

Query: 601 QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRD 660
           QIK LGYVPN + ARLEAEEE+KETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMC D
Sbjct: 601 QIKTLGYVPNIEGARLEAEEENKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCGD 660

Query: 661 CHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695
           CHNATK+ISQ FERMIIVRDRNRFHHFKDGLCSC DYW
Sbjct: 661 CHNATKYISQAFERMIIVRDRNRFHHFKDGLCSCKDYW 698

BLAST of CsGy1G001040 vs. ExPASy TrEMBL
Match: A0A1S4E4P2 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103502371 PE=3 SV=1)

HSP 1 Score: 1328 bits (3436), Expect = 0.0
Identity = 663/698 (94.99%), Postives = 677/698 (96.99%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSITQ---FPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIVGCLP  SLTSITQ   FPENPKSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITE
Sbjct: 1   MASIVGCLPITSLTSITQISQFPENPKSLILQQCKTPKDLRQVHAHLLKTRRLLDPIITE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHE SV
Sbjct: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH
Sbjct: 121 QHDKFTFSSVLKACSRMRGLKEGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGMPER IVAWNSMLSGYTKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLAN
Sbjct: 181 VFDGMPERGIVAWNSMLSGYTKNGLWDEVVKLFQKILELNIGFDDVTMISVLMACGRLAN 240

Query: 241 LEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300
           LE+GELIGEYIVSKGLRRNNTL TSLIDMYAKCG++DTARKLF+EMDKRDVVAWSAMISG
Sbjct: 241 LEMGELIGEYIVSKGLRRNNTLITSLIDMYAKCGRIDTARKLFNEMDKRDVVAWSAMISG 300

Query: 301 YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT 360
           YAQADRCKEALNLFHEMQKGNV PNEVTMVSVLYSCAMLGAY+TGKWVHFYIKKKKMKLT
Sbjct: 301 YAQADRCKEALNLFHEMQKGNVDPNEVTMVSVLYSCAMLGAYQTGKWVHFYIKKKKMKLT 360

Query: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
           VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS MLE
Sbjct: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSLMLE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           NDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE
Sbjct: 421 NDVKPNDVTFIGVLSACSHACLVDQGRNLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFID+MPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYA
Sbjct: 481 EAYQFIDSMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPTHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600
           LVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Sbjct: 541 LVGRVEDAIRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600

Query: 601 QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRD 660
           QIK LGYVPN + ARLEAEEE+KETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMC D
Sbjct: 601 QIKTLGYVPNIEGARLEAEEENKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCGD 660

Query: 661 CHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695
           CHNATK+ISQ FERMIIVRDRNRFHHFKDGLCSC DYW
Sbjct: 661 CHNATKYISQAFERMIIVRDRNRFHHFKDGLCSCKDYW 698

BLAST of CsGy1G001040 vs. ExPASy TrEMBL
Match: A0A5A7URN2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G001020 PE=3 SV=1)

HSP 1 Score: 1326 bits (3431), Expect = 0.0
Identity = 662/698 (94.84%), Postives = 676/698 (96.85%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSITQ---FPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIVGCLP  SLTSITQ   FPENPKSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITE
Sbjct: 1   MASIVGCLPITSLTSITQISQFPENPKSLILQQCKTPKDLRQVHAHLLKTRRLLDPIITE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHE SV
Sbjct: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH
Sbjct: 121 QHDKFTFSSVLKACSRMRGLKEGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGMPER IVAWNSMLSGYTKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLAN
Sbjct: 181 VFDGMPERGIVAWNSMLSGYTKNGLWDEVVKLFQKILELNIGFDDVTMISVLMACGRLAN 240

Query: 241 LEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300
           LE+GELIGEYIVSKGLRRNNTL TSLIDMYAKCG++DTARKLF+EMDKRDVVAWSAMISG
Sbjct: 241 LEMGELIGEYIVSKGLRRNNTLMTSLIDMYAKCGRIDTARKLFNEMDKRDVVAWSAMISG 300

Query: 301 YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT 360
           YAQADRCKEALNLFHEMQKGNV PNEVTMVSVLYSCAMLGAY+TGKWVHFYIKKKKMKLT
Sbjct: 301 YAQADRCKEALNLFHEMQKGNVDPNEVTMVSVLYSCAMLGAYQTGKWVHFYIKKKKMKLT 360

Query: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
           VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFF  MLE
Sbjct: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFCLMLE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           NDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE
Sbjct: 421 NDVKPNDVTFIGVLSACSHACLVDQGRNLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFID+MPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYA
Sbjct: 481 EAYQFIDSMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPTHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600
           LVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Sbjct: 541 LVGRVEDAIRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600

Query: 601 QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRD 660
           QIK LGYVPN + ARLEAEEE+KETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMC D
Sbjct: 601 QIKTLGYVPNIEGARLEAEEENKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCGD 660

Query: 661 CHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695
           CHNATK+ISQ FERMIIVRDRNRFHHFKDGLCSC DYW
Sbjct: 661 CHNATKYISQAFERMIIVRDRNRFHHFKDGLCSCKDYW 698

BLAST of CsGy1G001040 vs. ExPASy TrEMBL
Match: A0A6J1E2C2 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111429933 PE=3 SV=1)

HSP 1 Score: 1245 bits (3222), Expect = 0.0
Identity = 620/699 (88.70%), Postives = 658/699 (94.13%), Query Frame = 0

Query: 1   MASIVGCLPNISLTSIT---QFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIV CLPNIS+TSIT   QFPENPKSLILQ+CKTPKDL+QVHAHLLKTRRL DP I E
Sbjct: 1   MASIVACLPNISVTSITHLSQFPENPKSLILQRCKTPKDLRQVHAHLLKTRRLQDPTIAE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLP++IDYALSIFNH+DKPESSAYNVMIRGLAFK+SP NA+LLFKKMHE SV
Sbjct: 61  AVLESAALLLPNSIDYALSIFNHMDKPESSAYNVMIRGLAFKQSPHNAVLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ALREGEQVHALILKSGFK NEFVENTLI MYANCGQ+GVAR 
Sbjct: 121 QHDKFTFSSVLKACSRMRALREGEQVHALILKSGFKPNEFVENTLIHMYANCGQVGVARQ 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGM +R+ VAWNSMLSGYTKNGLWDEVVKLFRK+LEL IEFDDVTMISVLMACGRLA+
Sbjct: 181 VFDGMSQRATVAWNSMLSGYTKNGLWDEVVKLFRKMLELHIEFDDVTMISVLMACGRLAD 240

Query: 241 LEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300
           LE+GELIGEYI+SKG+RRN+TLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG
Sbjct: 241 LELGELIGEYILSKGIRRNSTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300

Query: 301 YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT 360
           YAQADRCKEAL+LFHEMQK  V  NEVTMVSVLYSCA+LGAYETGKWVH YIK+KKMKLT
Sbjct: 301 YAQADRCKEALDLFHEMQKAKVDANEVTMVSVLYSCAVLGAYETGKWVHSYIKRKKMKLT 360

Query: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
           V+LGTQLIDFYAKCGYIDRSVEVF+ M F NVFTWTALIQGLANNGEGKMAL+FF+ M E
Sbjct: 361 VSLGTQLIDFYAKCGYIDRSVEVFRAMPFANVFTWTALIQGLANNGEGKMALDFFALMRE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           N+VKPNDVTFI VLSACSHACLVDQGRHLFNSMRR FDIEPRIEHYGCMVDILGRAG LE
Sbjct: 421 NNVKPNDVTFIAVLSACSHACLVDQGRHLFNSMRRGFDIEPRIEHYGCMVDILGRAGLLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFI NMP PPNAVVWRTLLASC+AHKN+EMAEKS +HIT LEPAHSGDYILLSNTYA
Sbjct: 481 EAYQFIANMPIPPNAVVWRTLLASCKAHKNVEMAEKSFDHITLLEPAHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600
           LVGRVEDA+RVRSLIK+KEIKK PGCSLIELDGVVHEFFSEDG+H HSKEIHDALD+MMK
Sbjct: 541 LVGRVEDALRVRSLIKDKEIKKTPGCSLIELDGVVHEFFSEDGDHTHSKEIHDALDEMMK 600

Query: 601 QIKRLGYVPNTDDARLEAEEE-SKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCR 660
           +IK LGYVPN +DARLEAEEE SKETSVSHHSEKLAIAYGLIRT  +TTIRISKNLRMCR
Sbjct: 601 RIKSLGYVPNMEDARLEAEEEESKETSVSHHSEKLAIAYGLIRTPLQTTIRISKNLRMCR 660

Query: 661 DCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695
           DCHNATK ISQV++R IIVRDRNRFHHFKDGLCSCNDYW
Sbjct: 661 DCHNATKVISQVYKRTIIVRDRNRFHHFKDGLCSCNDYW 699

BLAST of CsGy1G001040 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 588.2 bits (1515), Expect = 8.7e-168
Identity = 288/726 (39.67%), Postives = 449/726 (61.85%), Query Frame = 0

Query: 8   LPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLK-----TRRLLDPIITEAVLES 67
           LP+ S         +P   +L  CKT + L+ +HA ++K     T   L  +I   +L  
Sbjct: 20  LPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSP 79

Query: 68  AALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKF 127
               LP    YA+S+F  I +P    +N M RG A    P +AL L+  M    +  + +
Sbjct: 80  HFEGLP----YAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSY 139

Query: 128 TFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQM----------------- 187
           TF  VLK+C++ KA +EG+Q+H  +LK G   + +V  +LI M                 
Sbjct: 140 TFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKS 199

Query: 188 --------------YANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFR 247
                         YA+ G I  A+ +FD +P + +V+WN+M+SGY + G + E ++LF+
Sbjct: 200 PHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFK 259

Query: 248 KILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCG 307
            +++  +  D+ TM++V+ AC +  ++E+G  +  +I   G   N  +  +LID+Y+KCG
Sbjct: 260 DMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCG 319

Query: 308 QVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLY 367
           +++TA  LF+ +  +DV++W+ +I GY   +  KEAL LF EM +    PN+VTM+S+L 
Sbjct: 320 ELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILP 379

Query: 368 SCAMLGAYETGKWVHFYIKKKKMKLT--VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNV 427
           +CA LGA + G+W+H YI K+   +T   +L T LID YAKCG I+ + +VF  +  K++
Sbjct: 380 ACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSL 439

Query: 428 FTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNS 487
            +W A+I G A +G    + + FS M +  ++P+D+TF+G+LSACSH+ ++D GRH+F +
Sbjct: 440 SSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRT 499

Query: 488 MRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIE 547
           M +D+ + P++EHYGCM+D+LG +G  +EA + I+ M   P+ V+W +LL +C+ H N+E
Sbjct: 500 MTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVE 559

Query: 548 MAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELD 607
           + E   E++ ++EP + G Y+LLSN YA  GR  +  + R+L+ +K +KK+PGCS IE+D
Sbjct: 560 LGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEID 619

Query: 608 GVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSE 667
            VVHEF   D  H  ++EI+  L++M   +++ G+VP+T +   E EEE KE ++ HHSE
Sbjct: 620 SVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSE 679

Query: 668 KLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLC 696
           KLAIA+GLI T P T + I KNLR+CR+CH ATK IS++++R II RDR RFHHF+DG+C
Sbjct: 680 KLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVC 739

BLAST of CsGy1G001040 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 567.8 bits (1462), Expect = 1.2e-161
Identity = 280/703 (39.83%), Postives = 432/703 (61.45%), Query Frame = 0

Query: 27  ILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPES 86
           ++++C + + L+Q H H+++T    DP     +   AAL    +++YA  +F+ I KP S
Sbjct: 36  LIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNS 95

Query: 87  SAYNVMIRGLAFKRSPDNALLLFKKM-HEKSVQHDKFTFSSVLKACSRMKALREGEQVHA 146
            A+N +IR  A    P  ++  F  M  E     +K+TF  ++KA + + +L  G+ +H 
Sbjct: 96  FAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHG 155

Query: 147 LILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDE 206
           + +KS   S+ FV N+LI  Y +CG +  A  VF  + E+ +V+WNSM++G+ + G  D+
Sbjct: 156 MAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDK 215

Query: 207 VVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLID 266
            ++LF+K+    ++   VTM+ VL AC ++ NLE G  +  YI    +  N TL  +++D
Sbjct: 216 ALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLD 275

Query: 267 MYAKCGQVDTARKLFDEMD-------------------------------KRDVVAWSAM 326
           MY KCG ++ A++LFD M+                               ++D+VAW+A+
Sbjct: 276 MYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNAL 335

Query: 327 ISGYAQADRCKEALNLFHEMQ-KGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKK 386
           IS Y Q  +  EAL +FHE+Q + N+  N++T+VS L +CA +GA E G+W+H YIKK  
Sbjct: 336 ISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHG 395

Query: 387 MKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS 446
           +++   + + LI  Y+KCG +++S EVF  +  ++VF W+A+I GLA +G G  A++ F 
Sbjct: 396 IRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFY 455

Query: 447 SMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRA 506
            M E +VKPN VTF  V  ACSH  LVD+   LF+ M  ++ I P  +HY C+VD+LGR+
Sbjct: 456 KMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRS 515

Query: 507 GFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLS 566
           G+LE+A +FI+ MP PP+  VW  LL +C+ H N+ +AE +   +  LEP + G ++LLS
Sbjct: 516 GYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLS 575

Query: 567 NTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD 626
           N YA +G+ E+   +R  ++   +KK PGCS IE+DG++HEF S D  H  S++++  L 
Sbjct: 576 NIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLH 635

Query: 627 KMMKQIKRLGYVPNTDDA-RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNL 686
           ++M+++K  GY P      ++  EEE KE S++ HSEKLAI YGLI T     IR+ KNL
Sbjct: 636 EVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNL 695

Query: 687 RMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           R+C DCH+  K ISQ+++R IIVRDR RFHHF++G CSCND+W
Sbjct: 696 RVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of CsGy1G001040 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 541.2 bits (1393), Expect = 1.2e-153
Identity = 270/670 (40.30%), Postives = 398/670 (59.40%), Query Frame = 0

Query: 28  LQQCKTPKDLQQVHAHLLKTRRLLDP-IITEAVLESAALLLPDTIDYALSIFNHIDKPES 87
           LQ+C   ++L+Q+HA +LKT  + D   IT+ +    +    D + YA  +F+  D+P++
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 88  SAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHAL 147
             +N+MIRG +    P+ +LLL+++M   S  H+ +TF S+LKACS + A  E  Q+HA 
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQ 140

Query: 148 ILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEV 207
           I K G++++ +  N+LI  YA  G   +A  +FD +PE   V+WNS++ GY K G  D  
Sbjct: 141 ITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIA 200

Query: 208 VKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDM 267
           + LFRK+ E                                                   
Sbjct: 201 LTLFRKMAE--------------------------------------------------- 260

Query: 268 YAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTM 327
                              ++ ++W+ MISGY QAD  KEAL LFHEMQ  +V P+ V++
Sbjct: 261 -------------------KNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSL 320

Query: 328 VSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSF 387
            + L +CA LGA E GKW+H Y+ K ++++   LG  LID YAKCG ++ ++EVFK +  
Sbjct: 321 ANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKK 380

Query: 388 KNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLVDQGRHL 447
           K+V  WTALI G A +G G+ A+  F  M +  +KPN +TF  VL+ACS+  LV++G+ +
Sbjct: 381 KSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLI 440

Query: 448 FNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHK 507
           F SM RD++++P IEHYGC+VD+LGRAG L+EA +FI  MP  PNAV+W  LL +CR HK
Sbjct: 441 FYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHK 500

Query: 508 NIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLI 567
           NIE+ E+  E +  ++P H G Y+  +N +A+  + + A   R L+KE+ + K+PGCS I
Sbjct: 501 NIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI 560

Query: 568 ELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLE-AEEESKETSVS 627
            L+G  HEF + D  H   ++I      M ++++  GYVP  ++  L+  +++ +E  V 
Sbjct: 561 SLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVH 620

Query: 628 HHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFK 687
            HSEKLAI YGLI+T P T IRI KNLR+C+DCH  TK IS++++R I++RDR RFHHF+
Sbjct: 621 QHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFR 620

Query: 688 DGLCSCNDYW 696
           DG CSC DYW
Sbjct: 681 DGKCSCGDYW 620

BLAST of CsGy1G001040 vs. TAIR 10
Match: AT4G14820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 533.1 bits (1372), Expect = 3.3e-151
Identity = 268/708 (37.85%), Postives = 423/708 (59.75%), Query Frame = 0

Query: 28  LQQCKTPKDLQQVHAHLLKT--RRLLDPIITEAVLESAALLLPDTIDYALSIFNHI-DKP 87
           L  CK+   ++Q+HAH+L+T     L+  +    + S+++     + YAL++F+ I   P
Sbjct: 19  LSFCKSLNHIKQLHAHILRTVINHKLNSFLFNLSVSSSSI----NLSYALNVFSSIPSPP 78

Query: 88  ESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVH 147
           ES  +N  +R L+    P   +L ++++     + D+F+F  +LKA S++ AL EG ++H
Sbjct: 79  ESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGMELH 138

Query: 148 ALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWD 207
            +  K     + FVE   + MYA+CG+I  AR+VFD M  R +V WN+M+  Y + GL D
Sbjct: 139 GVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFGLVD 198

Query: 208 EVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLI 267
           E  KLF ++ +  +  D++ + +++ ACGR  N+     I E+++   +R +  L T+L+
Sbjct: 199 EAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLTALV 258

Query: 268 DMYA-------------------------------KCGQVDTARKLFDEMDKRDVVAWSA 327
            MYA                               KCG++D A+ +FD+ +K+D+V W+ 
Sbjct: 259 TMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVCWTT 318

Query: 328 MISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKK 387
           MIS Y ++D  +EAL +F EM    + P+ V+M SV+ +CA LG  +  KWVH  I    
Sbjct: 319 MISAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWVHSCIHVNG 378

Query: 388 MKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS 447
           ++  +++   LI+ YAKCG +D + +VF++M  +NV +W+++I  L+ +GE   AL  F+
Sbjct: 379 LESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDALSLFA 438

Query: 448 SMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRA 507
            M + +V+PN+VTF+GVL  CSH+ LV++G+ +F SM  +++I P++EHYGCMVD+ GRA
Sbjct: 439 RMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDLFGRA 498

Query: 508 GFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLS 567
             L EA + I++MP   N V+W +L+++CR H  +E+ + + + I  LEP H G  +L+S
Sbjct: 499 NLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGALVLMS 558

Query: 568 NTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD 627
           N YA   R ED   +R +++EK + K  G S I+ +G  HEF   D  HK S EI+  LD
Sbjct: 559 NIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIYAKLD 618

Query: 628 KMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRT------TIR 687
           +++ ++K  GYVP+     ++ EEE K+  V  HSEKLA+ +GL+             IR
Sbjct: 619 EVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEEEKDSCGVIR 678

Query: 688 ISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           I KNLR+C DCH   K +S+V+ER IIVRDR RFH +K+GLCSC DYW
Sbjct: 679 IVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of CsGy1G001040 vs. TAIR 10
Match: AT3G08820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 526.6 bits (1355), Expect = 3.1e-149
Identity = 264/685 (38.54%), Postives = 409/685 (59.71%), Query Frame = 0

Query: 11  ISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDT 70
           +++ S T   +  K+LI   C T   L+Q+H  L+      D  +   +L+         
Sbjct: 4   VTVPSATSKVQQIKTLISVAC-TVNHLKQIHVSLINHHLHHDTFLVNLLLKRTLFFRQTK 63

Query: 71  IDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKA 130
             Y L  F+H   P    YN +I G          L LF  + +  +    FTF  VLKA
Sbjct: 64  YSYLL--FSHTQFPNIFLYNSLINGFVNNHLFHETLDLFLSIRKHGLYLHGFTFPLVLKA 123

Query: 131 CSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAW 190
           C+R  + + G  +H+L++K GF  +     +L+ +Y+  G++  A  +FD +P+RS+V W
Sbjct: 124 CTRASSRKLGIDLHSLVVKCGFNHDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSVVTW 183

Query: 191 NSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVS 250
            ++ SGYT +G   E + LF+K++E+ ++ D   ++ VL AC  + +L+ GE I +Y+  
Sbjct: 184 TALFSGYTTSGRHREAIDLFKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKYMEE 243

Query: 251 KGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNL 310
             +++N+ + T+L+++YAKCG+++ AR +FD M ++D+V WS MI GYA     KE + L
Sbjct: 244 MEMQKNSFVRTTLVNLYAKCGKMEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEGIEL 303

Query: 311 FHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAK 370
           F +M + N+ P++ ++V  L SCA LGA + G+W    I + +    + +   LID YAK
Sbjct: 304 FLQMLQENLKPDQFSIVGFLSSCASLGALDLGEWGISLIDRHEFLTNLFMANALIDMYAK 363

Query: 371 CGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGV 430
           CG + R  EVFKEM  K++    A I GLA NG  K++   F    +  + P+  TF+G+
Sbjct: 364 CGAMARGFEVFKEMKEKDIVIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTFLGL 423

Query: 431 LSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPP 490
           L  C HA L+  G   FN++   + ++  +EHYGCMVD+ GRAG L++AY+ I +MP  P
Sbjct: 424 LCGCVHAGLIQDGLRFFNAISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMPMRP 483

Query: 491 NAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRS 550
           NA+VW  LL+ CR  K+ ++AE  L+ +  LEP ++G+Y+ LSN Y++ GR ++A  VR 
Sbjct: 484 NAIVWGALLSGCRLVKDTQLAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAEVRD 543

Query: 551 LIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDD 610
           ++ +K +KKIPG S IEL+G VHEF ++D  H  S +I+  L+ +  +++ +G+VP T+ 
Sbjct: 544 MMNKKGMKKIPGYSWIELEGKVHEFLADDKSHPLSDKIYAKLEDLGNEMRLMGFVPTTEF 603

Query: 611 ARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFE 670
              + EEE KE  + +HSEKLA+A GLI T     IR+ KNLR+C DCH   K IS++  
Sbjct: 604 VFFDVEEEEKERVLGYHSEKLAVALGLISTDHGQVIRVVKNLRVCGDCHEVMKLISKITR 663

Query: 671 RMIIVRDRNRFHHFKDGLCSCNDYW 696
           R I+VRD NRFH F +G CSCNDYW
Sbjct: 664 REIVVRDNNRFHCFTNGSCSCNDYW 685

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LN011.2e-16639.67Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O823801.7e-16039.83Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9FJY71.7e-15240.30Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
O233374.7e-15037.85Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX... [more]
Q9SR824.4e-14838.54Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
XP_004138266.10.0100.00pentatricopeptide repeat-containing protein At1g08070, chloroplastic [Cucumis sa... [more]
XP_016903201.10.094.99PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-... [more]
KAA0057818.10.094.84pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
KAG7023094.10.088.84Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
XP_022921781.10.088.70pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucur... [more]
Match NameE-valueIdentityDescription
A0A0A0LRD60.0100.00DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G0035... [more]
A0A5D3BFH80.094.99Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4E4P20.094.99pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cuc... [more]
A0A5A7URN20.094.84Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1E2C20.088.70pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT1G08070.18.7e-16839.67Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G29760.11.2e-16139.83Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G66520.11.2e-15340.30Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G14820.13.3e-15137.85Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G08820.13.1e-14938.54Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 85..132
e-value: 1.6E-9
score: 37.8
coord: 387..435
e-value: 1.9E-11
score: 44.0
coord: 286..333
e-value: 1.1E-11
score: 44.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 261..289
e-value: 6.5E-6
score: 24.0
coord: 390..423
e-value: 3.1E-7
score: 28.1
coord: 188..221
e-value: 2.1E-6
score: 25.6
coord: 88..118
e-value: 0.002
score: 16.1
coord: 289..323
e-value: 1.1E-7
score: 29.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 462..486
e-value: 0.096
score: 13.0
coord: 160..186
e-value: 0.2
score: 12.0
coord: 188..217
e-value: 9.3E-8
score: 31.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 388..422
score: 11.695765
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 120..154
score: 9.152743
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 287..321
score: 12.199985
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 186..220
score: 11.027125
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 256..286
score: 9.350046
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 85..119
score: 9.086975
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 243..341
e-value: 8.3E-26
score: 92.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 99..242
e-value: 5.6E-34
score: 119.9
coord: 363..527
e-value: 4.3E-38
score: 133.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 267..547
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 561..685
e-value: 2.8E-39
score: 133.9
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 24..667
NoneNo IPR availablePANTHERPTHR47928:SF21PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN CHLOROPLASTICcoord: 24..667

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G001040.2CsGy1G001040.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding