CsGy1G023300 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy1G023300
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
LocationGy14Chr1: 22152485 .. 22155186 (+)
RNA-Seq ExpressionCsGy1G023300
SyntenyCsGy1G023300
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAGATTATTATTATTATTATTACGAATTCAAAGAAATCACAAATCACTCACCTACTCTTCGATAGAATTTCAAGAACCTTCGATCGTTCATCTTCCATTCGCCAGATGGAGACATCACGGGAGTTGATTATTAGAAATTGTGAAGTGAAACCACTTTAAATAGAATTAGAAGAAAAGACAAGTTTTGTTGGAATAGCGAGGAAGAGAAATCGAGGAAGAGAGTTCCTCAAAAGGCGAGTCAATGACGGAGAAACGAAACTTATTAGGAGGCCATTTGTAAGGGGCTATGGAAGCTCTAAGTGTTCCATCGATTTCTCTCCAGAATTTCTCAACCCTAAACAACAATCTTCTTTTCAGAAACCATCAAATTCTCTCTACAATAGATAAATGTTCAAGTTCAAAGCAATTGAAGGAAGTTCACGCTCGCATGCTCCGTACCGGTCTCTTCTTCGACCCCTTTTCGGCTAGCAAACTCTTCACAGCCTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCAACTTGTTCGACCAAATTCCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCTTACGCTTCCAGCTCCGACCCTTTTCAGAGTTTCGTAATATTTCTGGATTTGCTTGATAAATGTGAGGATTTGCCTAATAAGTTCACTTTCCCGTTTGTTATTAAGGCCGCTTCGGAGCTTAAAGCTTCACGGGTTGGGACAGCTGTTCATGGAATGGCGATTAAGTTGTCGTTTGGTATGGATCTTTATATCCTTAATTCTCTTGTGCGATTCTATGGGGCATGTGGCGATTTGAGTATGGCTGAGCGATTGTTTAAGGGTATTTCTTGCAAAGATGTAGTGTCTTGGAATTCCATGATTTCGGCTTTTGCTCAGGGGAACTGTCCAGAAGATGCATTGGAGTTGTTTTTGAAAATGGAGAGGGAAAATGTGATGCCTAACTCTGTAACAATGGTGGGTGTTTTATCTGCTTGTGCAAAGAAGTTGGATTTGGAATTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAAGGGATCAAAGTGGATTTAACTTTATGTAACGCCATGCTTGACATGTATACAAAGTGTGGAAGTGTTGATGATGCACAGAAGCTGTTTGACGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCATCATGCTTGATGGGTATGCGAAAATGGGCGACTACGATGCTGCTCGGCTAGTGTTCAATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAATGTTCTCATATCTGCTTATGAACAAAATGGTAAACCTAAGGAAGCTTTGGCCATTTTTAATGAGTTGCAGCTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGAGCAATTGATTTGGGTGGATGGATTCATGTGTACATAAAAAGGGAAGGGATAGTTCTAAATTGCCATTTAATTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTTCTTTAGAGAAAGCTCTCGAGGTGTTCTATTCAGTGGAGGAGAGAGATGTGTATGTTTGGAGTGCCATGATTGCTGGTTTGGGAATGCACGGCCGTGGGAAGGCGGCAATTGATCTATTCTTCGAAATGCAGGAAGCTAAGGTGAAGCCAAATAGTGTGACGTTTACAAATGTATTATGTGCCTGTAGCCATGCTGGCTTAGTTGATGAGGGACGGGTGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTTCCTGAGATGAAGCACTATGCTTGTATGGTTGATATTCTCGGTCGTGCAGGGTTTCTTGAAGAAGCTATGGAGTTGATCAATGAAATGTCTACAACTCCAAGCGCATCCGTTTGGGGTGCTTTGCTTGGTGCTTGCAGCCTTCATATGAATGTTGAGCTTGGAGAATTAGCGAGTGACCAATTGCTAAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCAAACATATATGCTAAAACAGGAAGATGGGAAAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACACTGAATTGAAAAAGGAACCTGGTTGTAGCTCCATTGAAGCCAACGGCAACGTCCACGAGTTTCTAGTGGGAGATAATACGCACCCGTTATCCAGTAACATCTATTCAAAGTTGGAGGAAATTGCAACAAAACTAAAATCAGTCGGTTACGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGAGGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATCGCTTTTGGGCTTGTTACTTTGGCTCCATCTCAACCAATTCGAGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGCATTTGCTAAGCTTGTATCTAGAGTTTACGACAGAGATATATTACTTCGAGATCGATATCGATTCCATCATTTCCGAGATGGGCATTGTTCGTGTATGGATTACTGGTAAAGCTGCAAAATAATGGATTCTACGTTCACTTGCTCTGTTTGGTGCATGAGTTGAACTTCACAACAATAGCCACAACCTTTATGGACTTTTGTAACTATATCAATTTGAAATTTGGGAAAGCCTCGTGTAAATTTTGATTTAAATTTTCAGTTAGAATTTCCTTTAGAAAATTGTCTATGCATCTACTCTTATAGTCCATTTTGTAATACCGAC

mRNA sequence

TGAGATTATTATTATTATTATTACGAATTCAAAGAAATCACAAATCACTCACCTACTCTTCGATAGAATTTCAAGAACCTTCGATCGTTCATCTTCCATTCGCCAGATGGAGACATCACGGGAGTTGATTATTAGAAATTGTGAAGTGAAACCACTTTAAATAGAATTAGAAGAAAAGACAAGTTTTGTTGGAATAGCGAGGAAGAGAAATCGAGGAAGAGAGTTCCTCAAAAGGCGAGTCAATGACGGAGAAACGAAACTTATTAGGAGGCCATTTGTAAGGGGCTATGGAAGCTCTAAGTGTTCCATCGATTTCTCTCCAGAATTTCTCAACCCTAAACAACAATCTTCTTTTCAGAAACCATCAAATTCTCTCTACAATAGATAAATGTTCAAGTTCAAAGCAATTGAAGGAAGTTCACGCTCGCATGCTCCGTACCGGTCTCTTCTTCGACCCCTTTTCGGCTAGCAAACTCTTCACAGCCTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCAACTTGTTCGACCAAATTCCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCTTACGCTTCCAGCTCCGACCCTTTTCAGAGTTTCGTAATATTTCTGGATTTGCTTGATAAATGTGAGGATTTGCCTAATAAGTTCACTTTCCCGTTTGTTATTAAGGCCGCTTCGGAGCTTAAAGCTTCACGGGTTGGGACAGCTGTTCATGGAATGGCGATTAAGTTGTCGTTTGGTATGGATCTTTATATCCTTAATTCTCTTGTGCGATTCTATGGGGCATGTGGCGATTTGAGTATGGCTGAGCGATTGTTTAAGGGTATTTCTTGCAAAGATGTAGTGTCTTGGAATTCCATGATTTCGGCTTTTGCTCAGGGGAACTGTCCAGAAGATGCATTGGAGTTGTTTTTGAAAATGGAGAGGGAAAATGTGATGCCTAACTCTGTAACAATGGTGGGTGTTTTATCTGCTTGTGCAAAGAAGTTGGATTTGGAATTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAAGGGATCAAAGTGGATTTAACTTTATGTAACGCCATGCTTGACATGTATACAAAGTGTGGAAGTGTTGATGATGCACAGAAGCTGTTTGACGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCATCATGCTTGATGGGTATGCGAAAATGGGCGACTACGATGCTGCTCGGCTAGTGTTCAATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAATGTTCTCATATCTGCTTATGAACAAAATGGTAAACCTAAGGAAGCTTTGGCCATTTTTAATGAGTTGCAGCTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGAGCAATTGATTTGGGTGGATGGATTCATGTGTACATAAAAAGGGAAGGGATAGTTCTAAATTGCCATTTAATTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTTCTTTAGAGAAAGCTCTCGAGGTGTTCTATTCAGTGGAGGAGAGAGATGTGTATGTTTGGAGTGCCATGATTGCTGGTTTGGGAATGCACGGCCGTGGGAAGGCGGCAATTGATCTATTCTTCGAAATGCAGGAAGCTAAGGTGAAGCCAAATAGTGTGACGTTTACAAATGTATTATGTGCCTGTAGCCATGCTGGCTTAGTTGATGAGGGACGGGTGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTTCCTGAGATGAAGCACTATGCTTGTATGGTTGATATTCTCGGTCGTGCAGGGTTTCTTGAAGAAGCTATGGAGTTGATCAATGAAATGTCTACAACTCCAAGCGCATCCGTTTGGGGTGCTTTGCTTGGTGCTTGCAGCCTTCATATGAATGTTGAGCTTGGAGAATTAGCGAGTGACCAATTGCTAAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCAAACATATATGCTAAAACAGGAAGATGGGAAAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACACTGAATTGAAAAAGGAACCTGGTTGTAGCTCCATTGAAGCCAACGGCAACGTCCACGAGTTTCTAGTGGGAGATAATACGCACCCGTTATCCAGTAACATCTATTCAAAGTTGGAGGAAATTGCAACAAAACTAAAATCAGTCGGTTACGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGAGGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATCGCTTTTGGGCTTGTTACTTTGGCTCCATCTCAACCAATTCGAGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGCATTTGCTAAGCTTGTATCTAGAGTTTACGACAGAGATATATTACTTCGAGATCGATATCGATTCCATCATTTCCGAGATGGGCATTGTTCGTGTATGGATTACTGGTAAAGCTGCAAAATAATGGATTCTACGTTCACTTGCTCTGTTTGGTGCATGAGTTGAACTTCACAACAATAGCCACAACCTTTATGGACTTTTGTAACTATATCAATTTGAAATTTGGGAAAGCCTCGTGTAAATTTTGATTTAAATTTTCAGTTAGAATTTCCTTTAGAAAATTGTCTATGCATCTACTCTTATAGTCCATTTTGTAATACCGAC

Coding sequence (CDS)

ATGGAAGCTCTAAGTGTTCCATCGATTTCTCTCCAGAATTTCTCAACCCTAAACAACAATCTTCTTTTCAGAAACCATCAAATTCTCTCTACAATAGATAAATGTTCAAGTTCAAAGCAATTGAAGGAAGTTCACGCTCGCATGCTCCGTACCGGTCTCTTCTTCGACCCCTTTTCGGCTAGCAAACTCTTCACAGCCTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCAACTTGTTCGACCAAATTCCCCAACCAAATCTCTACACTTGGAACACCCTCATTCGAGCTTACGCTTCCAGCTCCGACCCTTTTCAGAGTTTCGTAATATTTCTGGATTTGCTTGATAAATGTGAGGATTTGCCTAATAAGTTCACTTTCCCGTTTGTTATTAAGGCCGCTTCGGAGCTTAAAGCTTCACGGGTTGGGACAGCTGTTCATGGAATGGCGATTAAGTTGTCGTTTGGTATGGATCTTTATATCCTTAATTCTCTTGTGCGATTCTATGGGGCATGTGGCGATTTGAGTATGGCTGAGCGATTGTTTAAGGGTATTTCTTGCAAAGATGTAGTGTCTTGGAATTCCATGATTTCGGCTTTTGCTCAGGGGAACTGTCCAGAAGATGCATTGGAGTTGTTTTTGAAAATGGAGAGGGAAAATGTGATGCCTAACTCTGTAACAATGGTGGGTGTTTTATCTGCTTGTGCAAAGAAGTTGGATTTGGAATTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAAGGGATCAAAGTGGATTTAACTTTATGTAACGCCATGCTTGACATGTATACAAAGTGTGGAAGTGTTGATGATGCACAGAAGCTGTTTGACGAAATGCCTGAAAGAGATGTCTTCTCTTGGACCATCATGCTTGATGGGTATGCGAAAATGGGCGACTACGATGCTGCTCGGCTAGTGTTCAATGCAATGCCTGTGAAAGAAATTGCTGCTTGGAATGTTCTCATATCTGCTTATGAACAAAATGGTAAACCTAAGGAAGCTTTGGCCATTTTTAATGAGTTGCAGCTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGAGCAATTGATTTGGGTGGATGGATTCATGTGTACATAAAAAGGGAAGGGATAGTTCTAAATTGCCATTTAATTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTTCTTTAGAGAAAGCTCTCGAGGTGTTCTATTCAGTGGAGGAGAGAGATGTGTATGTTTGGAGTGCCATGATTGCTGGTTTGGGAATGCACGGCCGTGGGAAGGCGGCAATTGATCTATTCTTCGAAATGCAGGAAGCTAAGGTGAAGCCAAATAGTGTGACGTTTACAAATGTATTATGTGCCTGTAGCCATGCTGGCTTAGTTGATGAGGGACGGGTGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTTCCTGAGATGAAGCACTATGCTTGTATGGTTGATATTCTCGGTCGTGCAGGGTTTCTTGAAGAAGCTATGGAGTTGATCAATGAAATGTCTACAACTCCAAGCGCATCCGTTTGGGGTGCTTTGCTTGGTGCTTGCAGCCTTCATATGAATGTTGAGCTTGGAGAATTAGCGAGTGACCAATTGCTAAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCAAACATATATGCTAAAACAGGAAGATGGGAAAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACACTGAATTGAAAAAGGAACCTGGTTGTAGCTCCATTGAAGCCAACGGCAACGTCCACGAGTTTCTAGTGGGAGATAATACGCACCCGTTATCCAGTAACATCTATTCAAAGTTGGAGGAAATTGCAACAAAACTAAAATCAGTCGGTTACGAACCAAACAAATCCCATCTTCTCCAGCTCATCGAAGAGGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATCGCTTTTGGGCTTGTTACTTTGGCTCCATCTCAACCAATTCGAGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGCATTTGCTAAGCTTGTATCTAGAGTTTACGACAGAGATATATTACTTCGAGATCGATATCGATTCCATCATTTCCGAGATGGGCATTGTTCGTGTATGGATTACTGGTAA

Protein sequence

MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGHCSCMDYW*
Homology
BLAST of CsGy1G023300 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 939.1 bits (2426), Expect = 3.0e-272
Identity = 447/727 (61.49%), Postives = 564/727 (77.58%), Query Frame = 0

Query: 7   PSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTA 66
           P+ S  N  T NN       + +S I++C S +QLK+ H  M+RTG F DP+SASKLF  
Sbjct: 16  PNFSNPNQPTTNN----ERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 75

Query: 67  SALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNK 126
           +ALSSF++L+YAR +FD+IP+PN + WNTLIRAYAS  DP  S   FLD++ + +  PNK
Sbjct: 76  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 135

Query: 127 FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKG 186
           +TFPF+IKAA+E+ +  +G ++HGMA+K + G D+++ NSL+  Y +CGDL  A ++F  
Sbjct: 136 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 195

Query: 187 ISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 246
           I  KDVVSWNSMI+ F Q   P+ ALELF KME E+V  + VTMVGVLSACAK  +LEFG
Sbjct: 196 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 255

Query: 247 RWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 306
           R VCSYIE   + V+LTL NAMLDMYTKCGS++DA++LFD M E+D  +WT MLDGYA  
Sbjct: 256 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 315

Query: 307 GDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVST 366
            DY+AAR V N+MP K+I AWN LISAYEQNGKP EAL +F+ELQL K  K +++TLVST
Sbjct: 316 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 375

Query: 367 LSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDV 426
           LSACAQ+GA++LG WIH YIK+ GI +N H+ S+L+ MY+KCG LEK+ EVF SVE+RDV
Sbjct: 376 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 435

Query: 427 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHE 486
           +VWSAMI GL MHG G  A+D+F++MQEA VKPN VTFTNV CACSH GLVDE    FH+
Sbjct: 436 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 495

Query: 487 MEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVE 546
           ME  YG+VPE KHYAC+VD+LGR+G+LE+A++ I  M   PS SVWGALLGAC +H N+ 
Sbjct: 496 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 555

Query: 547 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEAN 606
           L E+A  +LL+LEPRN GA VLLSNIYAK G+WE VSELRK MR T LKKEPGCSSIE +
Sbjct: 556 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 615

Query: 607 GNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 666
           G +HEFL GDN HP+S  +Y KL E+  KLKS GYEP  S +LQ+IEE+++KEQ+L+LHS
Sbjct: 616 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 675

Query: 667 EKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGH 726
           EKLAI +GL++    + IRV+KNLR+CGDCH+ AKL+S++YDR+I++RDRYRFHHFR+G 
Sbjct: 676 EKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQ 735

Query: 727 CSCMDYW 734
           CSC D+W
Sbjct: 736 CSCNDFW 738

BLAST of CsGy1G023300 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 6.0e-172
Identity = 314/744 (42.20%), Postives = 449/744 (60.35%), Query Frame = 0

Query: 24  RNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASALS-SFSTLDYARNLF 83
           RNH  LS +  C + + L+ +HA+M++ GL    ++ SKL     LS  F  L YA ++F
Sbjct: 32  RNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVF 91

Query: 84  DQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNKFTFPFVIKAASELKAS 143
             I +PNL  WNT+ R +A SSDP  +  +++ ++     LPN +TFPFV+K+ ++ KA 
Sbjct: 92  KTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMI-SLGLLPNSYTFPFVLKSCAKSKAF 151

Query: 144 RVGTAVHGMAIKLSFGMDLYILNS-------------------------------LVRFY 203
           + G  +HG  +KL   +DLY+  S                               L++ Y
Sbjct: 152 KEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGY 211

Query: 204 GACGDLSMAERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV 263
            + G +  A++LF  I  KDVVSWN+MIS +A+    ++ALELF  M + NV P+  TMV
Sbjct: 212 ASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMV 271

Query: 264 GVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPER 323
            V+SACA+   +E GR V  +I+  G   +L + NA++D+Y+KCG ++ A  LF+ +P +
Sbjct: 272 TVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYK 331

Query: 324 DVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQ 383
           DV SW  ++ GY  M  Y                               KEAL +F E+ 
Sbjct: 332 DVISWNTLIGGYTHMNLY-------------------------------KEALLLFQEM- 391

Query: 384 LSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKR--EGIVLNCHLISSLVDMYAKCG 443
           L     P++VT++S L ACA LGAID+G WIHVYI +  +G+     L +SL+DMYAKCG
Sbjct: 392 LRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCG 451

Query: 444 SLEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLC 503
            +E A +VF S+  + +  W+AMI G  MHGR  A+ DLF  M++  ++P+ +TF  +L 
Sbjct: 452 DIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLS 511

Query: 504 ACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSA 563
           ACSH+G++D GR  F  M   Y + P+++HY CM+D+LG +G  +EA E+IN M   P  
Sbjct: 512 ACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDG 571

Query: 564 SVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLM 623
            +W +LL AC +H NVELGE  ++ L+K+EP N G+ VLLSNIYA  GRW +V++ R L+
Sbjct: 572 VIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALL 631

Query: 624 RDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLL 683
            D  +KK PGCSSIE +  VHEF++GD  HP +  IY  LEE+   L+  G+ P+ S +L
Sbjct: 632 NDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVL 691

Query: 684 QLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDR 734
           Q +EE + KE AL  HSEKLAIAFGL++  P   + +VKNLR+C +CH   KL+S++Y R
Sbjct: 692 QEMEE-EWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKR 741

BLAST of CsGy1G023300 vs. ExPASy Swiss-Prot
Match: O23337 (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 558.9 bits (1439), Expect = 8.4e-158
Identity = 279/713 (39.13%), Postives = 437/713 (61.29%), Query Frame = 0

Query: 28  ILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASALSSFSTLDYARNLFDQIPQ 87
           IL  +  C S   +K++HA +LRT    +    S LF  S  SS   L YA N+F  IP 
Sbjct: 15  ILEKLSFCKSLNHIKQLHAHILRT--VINHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74

Query: 88  -PNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNKFTFPFVIKAASELKASRVGT 147
            P    +N  +R  + SS+P ++ ++F   +       ++F+F  ++KA S++ A   G 
Sbjct: 75  PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134

Query: 148 AVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKGISCKDVVSWNSMISAFAQGN 207
            +HG+A K++   D ++    +  Y +CG ++ A  +F  +S +DVV+WN+MI  + +  
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194

Query: 208 CPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCN 267
             ++A +LF +M+  NVMP+ + +  ++SAC +  ++ + R +  ++    +++D  L  
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254

Query: 268 AMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAA 327
           A++ MY   G +D A++ F +M  R++F  T M+ GY+K G  D A+++F+    K++  
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314

Query: 328 WNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 387
           W  +ISAY ++  P+EAL +F E+  S I KPD V++ S +SACA LG +D   W+H  I
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374

Query: 388 KREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAI 447
              G+     + ++L++MYAKCG L+   +VF  +  R+V  WS+MI  L MHG    A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434

Query: 448 DLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDI 507
            LF  M++  V+PN VTF  VL  CSH+GLV+EG+  F  M   Y + P+++HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494

Query: 508 LGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAI 567
            GRA  L EA+E+I  M    +  +WG+L+ AC +H  +ELG+ A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554

Query: 568 VLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIY 627
           VL+SNIYA+  RWE V  +R++M +  + KE G S I+ NG  HEFL+GD  H  S+ IY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614

Query: 628 SKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQP--- 687
           +KL+E+ +KLK  GY P+   +L  +EE++ K+  L  HSEKLA+ FGL+     +    
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674

Query: 688 ---IRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGHCSCMDYW 734
              IR+VKNLR+C DCH F KLVS+VY+R+I++RDR RFH +++G CSC DYW
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of CsGy1G023300 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 541.6 bits (1394), Expect = 1.4e-152
Identity = 284/727 (39.06%), Postives = 433/727 (59.56%), Query Frame = 0

Query: 15  STLNNNLLFRNHQI------LSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASA 74
           S L + LL+ N  I       S ID  +   QLK++HAR+L  GL F  F  +KL  AS 
Sbjct: 5   SCLASPLLYTNSGIHSDSFYASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHAS- 64

Query: 75  LSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNKFT 134
            SSF  + +AR +FD +P+P ++ WN +IR Y S ++ FQ  ++    +      P+ FT
Sbjct: 65  -SSFGDITFARQVFDDLPRPQIFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFT 124

Query: 135 FPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKGIS 194
           FP ++KA S L   ++G  VH    +L F  D+++ N L+  Y  C  L  A  +F+G+ 
Sbjct: 125 FPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLP 184

Query: 195 C--KDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 254
              + +VSW +++SA+AQ   P +ALE+F +M + +V P+ V +V VL+A     DL+ G
Sbjct: 185 LPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQG 244

Query: 255 RWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 314
           R + + + + G++++  L  ++  MY KCG V  A+ LFD+M   ++  W  M+ GYAK 
Sbjct: 245 RSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK- 304

Query: 315 GDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVST 374
                                         NG  +EA+ +F+E+ ++K  +PD +++ S 
Sbjct: 305 ------------------------------NGYAREAIDMFHEM-INKDVRPDTISITSA 364

Query: 375 LSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDV 434
           +SACAQ+G+++    ++ Y+ R     +  + S+L+DM+AKCGS+E A  VF    +RDV
Sbjct: 365 ISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDV 424

Query: 435 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHE 494
            VWSAMI G G+HGR + AI L+  M+   V PN VTF  +L AC+H+G+V EG  FF+ 
Sbjct: 425 VVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNR 484

Query: 495 MEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVE 554
           M   + + P+ +HYAC++D+LGRAG L++A E+I  M   P  +VWGALL AC  H +VE
Sbjct: 485 MAD-HKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVE 544

Query: 555 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEAN 614
           LGE A+ QL  ++P N G  V LSN+YA    W++V+E+R  M++  L K+ GCS +E  
Sbjct: 545 LGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVR 604

Query: 615 GNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 674
           G +  F VGD +HP    I  ++E I ++LK  G+  NK   L  + +++  E+ L  HS
Sbjct: 605 GRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHS 664

Query: 675 EKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGH 734
           E++AIA+GL++     P+R+ KNLR C +CHA  KL+S++ DR+I++RD  RFHHF+DG 
Sbjct: 665 ERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGV 694

BLAST of CsGy1G023300 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 523.1 bits (1346), Expect = 5.1e-147
Identity = 273/707 (38.61%), Postives = 414/707 (58.56%), Query Frame = 0

Query: 32  IDKCSSSKQLKEVHARMLRTGLFFDPFSASKL--FTASALSSFSTLDYARNLFDQIPQPN 91
           I+ C + + L ++HA  +++G   D  +A+++  F A++      LDYA  +F+Q+PQ N
Sbjct: 30  INNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRN 89

Query: 92  LYTWNTLIRAYASSSD--PFQSFVIFLDLLDKCEDLPNKFTFPFVIKAASELKASRVGTA 151
            ++WNT+IR ++ S +     +  +F +++      PN+FTFP V+KA ++    + G  
Sbjct: 90  CFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQ 149

Query: 152 VHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLF-KGISCKDVVSWNSMISAFAQGN 211
           +HG+A+K  FG D +++++LVR Y  CG +  A  LF K I  KD+V             
Sbjct: 150 IHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMV------------- 209

Query: 212 CPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCN 271
                                                                       
Sbjct: 210 ------------------------------------------------------------ 269

Query: 272 AMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAA 331
            M D   + G               ++  W +M+DGY ++GD  AAR++F+ M  + + +
Sbjct: 270 VMTDRRKRDG---------------EIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVS 329

Query: 332 WNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 391
           WN +IS Y  NG  K+A+ +F E++   I +P+ VTLVS L A ++LG+++LG W+H+Y 
Sbjct: 330 WNTMISGYSLNGFFKDAVEVFREMKKGDI-RPNYVTLVSVLPAISRLGSLELGEWLHLYA 389

Query: 392 KREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAI 451
           +  GI ++  L S+L+DMY+KCG +EKA+ VF  +   +V  WSAMI G  +HG+   AI
Sbjct: 390 EDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAI 449

Query: 452 DLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDI 511
           D F +M++A V+P+ V + N+L ACSH GLV+EGR +F +M  V G+ P ++HY CMVD+
Sbjct: 450 DCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDL 509

Query: 512 LGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAI 571
           LGR+G L+EA E I  M   P   +W ALLGAC +  NVE+G+  ++ L+ + P + GA 
Sbjct: 510 LGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAY 569

Query: 572 VLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIY 631
           V LSN+YA  G W +VSE+R  M++ +++K+PGCS I+ +G +HEF+V D++HP +  I 
Sbjct: 570 VALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEIN 629

Query: 632 SKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQPIRV 691
           S L EI+ KL+  GY P  + +L  +EE+D KE  L  HSEK+A AFGL++ +P +PIR+
Sbjct: 630 SMLVEISDKLRLAGYRPITTQVLLNLEEED-KENVLHYHSEKIATAFGLISTSPGKPIRI 646

Query: 692 VKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGHCSCMDYW 734
           VKNLRIC DCH+  KL+S+VY R I +RDR RFHHF+DG CSCMDYW
Sbjct: 690 VKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of CsGy1G023300 vs. NCBI nr
Match: XP_004145320.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis sativus] >KGN65801.1 hypothetical protein Csa_023315 [Cucumis sativus])

HSP 1 Score: 1477 bits (3824), Expect = 0.0
Identity = 733/733 (100.00%), Postives = 733/733 (100.00%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 733
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CsGy1G023300 vs. NCBI nr
Match: XP_008457379.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis melo] >TYJ97320.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1431 bits (3704), Expect = 0.0
Identity = 711/733 (97.00%), Postives = 717/733 (97.82%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNFSTLNNNL FRNHQILS IDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKLFTASALSSFSTLDYARN+FDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPN FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIK+DLTL NAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAAR VFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPE KHYACMVDILGRAGFLEEAMELINEMS TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE NGNVHEFLVGDN HPLSSNIYSKL++IATKLK VGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLV+LAPSQPIRVVKNLRICGDCH FAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 733
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CsGy1G023300 vs. NCBI nr
Match: KAA0031814.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1429 bits (3699), Expect = 0.0
Identity = 710/733 (96.86%), Postives = 716/733 (97.68%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNFSTLNNNL FRNHQILS IDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKLFTASALSSFSTLDYARN+FDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPN FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIK+DLTL NAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAAR VFNAMPVKEIAAWNVLISAYEQNGKPKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPE KHYACMVDILGRAGFLEEAMELINEMS TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE NGNVHEFLVGDN HPLSSNIYSKL++IATKLK VGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLV+LAPSQPIRVVKNLRICGDCH FAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 733
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CsGy1G023300 vs. NCBI nr
Match: XP_038893523.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Benincasa hispida])

HSP 1 Score: 1372 bits (3551), Expect = 0.0
Identity = 676/733 (92.22%), Postives = 699/733 (95.36%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNF T N+NL FRNHQILSTID+CSS KQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFPTPNDNLPFRNHQILSTIDQCSSPKQLKQVHAHMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKLFTASALSSFSTLDYA N+FDQI  PNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYALNVFDQISHPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           +DLPN FTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYG CGDL+MA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGTCGDLNMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLF+GISCKDVVSWNSMISAFAQGNCPEDAL+LFLKMERENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALDLFLKMERENVMPNSVTMVGVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERK IKVDLTLCNAMLDMYTKCGS+DDAQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKEIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGD+DAAR VF+AMPVKEIAAWNVLISAYEQNG PKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDFDAARRVFDAMPVKEIAAWNVLISAYEQNGNPKEALATFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSAC+QLGAIDLGGWIHVYIKREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACSQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VE RDVYVWSAMIAGLGMHGRGKAAI+LFFEMQEAKVKPNSVTF NVLCACSHAGLVDEG
Sbjct: 421 VEVRDVYVWSAMIAGLGMHGRGKAAINLFFEMQEAKVKPNSVTFMNVLCACSHAGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           R F HEMEP+YGVVP  KHYACMVDILGRAGFLEEAMELINEM  TPSAS+WGALLGACS
Sbjct: 481 RAFLHEMEPIYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASIWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRD++LKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDSKLKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE +GNVHEFLVGDN+HPLSS IY KL+EIATKLKSVGYEPNKSHLLQ IEEDDLKEQ
Sbjct: 601 SSIEVDGNVHEFLVGDNSHPLSSKIYLKLDEIATKLKSVGYEPNKSHLLQFIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGL++LAPSQPIRVVKNLRICGDCH  AKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 733
           HFRDGHCSC DYW
Sbjct: 721 HFRDGHCSCRDYW 733

BLAST of CsGy1G023300 vs. NCBI nr
Match: XP_022964665.1 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1321 bits (3420), Expect = 0.0
Identity = 645/733 (87.99%), Postives = 688/733 (93.86%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           ME LS P +SL N S  +NNL FRNHQILSTID+CSS KQLK+VHA+MLRTGLFFDPFSA
Sbjct: 1   METLSAPLVSLPNRSIADNNLHFRNHQILSTIDQCSSGKQLKQVHAQMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKL  ASAL S STL+YAR++FDQIP PNLYTWNTLIRAYASS+DPFQSFVIFL LLD+C
Sbjct: 61  SKLIAASALKSSSTLEYARDVFDQIPHPNLYTWNTLIRAYASSADPFQSFVIFLALLDEC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           +DLPN FTFPFVIKAASELKASRVG AVHGMAIKLS GMD YILNSLVRFYGACGDL+MA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSLGMDQYILNSLVRFYGACGDLNMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLF+GISCKDVVSWNSMISAFAQGNCPEDALELFLKME  NVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALELFLKMEGANVMPNSVTMVGVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERK I VDLTLCNAMLDMYTKCGS+ DA+KLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKEISVDLTLCNAMLDMYTKCGSIGDAEKLFDEMPERDVFSWTTML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGD++AAR VF+ MPVKEIAAWN LISAYE+NGKPKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDFNAARKVFDEMPVKEIAAWNALISAYERNGKPKEALATFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVS+LSACAQLGAIDLGGWIHVYIKREGI LN HLI+SL+DMYAKCG+LEKALEVFY+
Sbjct: 361 VTLVSSLSACAQLGAIDLGGWIHVYIKREGINLNGHLITSLIDMYAKCGALEKALEVFYA 420

Query: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEE+DVYVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPN VTFTN+LCACSHAGLVDEG
Sbjct: 421 VEEKDVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNLLCACSHAGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           R  FHEMEPVYGVVP  KHYACMVDILGRAGFLEEAMELINEM TTPSASVWGALLGACS
Sbjct: 481 RALFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPTTPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVEL ELASDQLLKLEPRNHGAI+LLSN+YAKTGRW+KVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIILLSNVYAKTGRWDKVSELRKLMRDSELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SS+E NG VHEFLVGDN+HPLS +IYSKL+EIA KLKSVGYEPNKSHLLQLIEEDD+KE 
Sbjct: 601 SSVEVNGIVHEFLVGDNSHPLSRDIYSKLDEIAAKLKSVGYEPNKSHLLQLIEEDDVKEH 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGL++LAPSQPIRVVKNLRICGDCH  AKL+SRVY+RDIL++DRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLISRVYNRDILVQDRYRFH 720

Query: 721 HFRDGHCSCMDYW 733
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CsGy1G023300 vs. ExPASy TrEMBL
Match: A0A0A0M0R9 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G530130 PE=3 SV=1)

HSP 1 Score: 1477 bits (3824), Expect = 0.0
Identity = 733/733 (100.00%), Postives = 733/733 (100.00%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 733
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CsGy1G023300 vs. ExPASy TrEMBL
Match: A0A5D3BBW6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G001260 PE=3 SV=1)

HSP 1 Score: 1431 bits (3704), Expect = 0.0
Identity = 711/733 (97.00%), Postives = 717/733 (97.82%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNFSTLNNNL FRNHQILS IDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKLFTASALSSFSTLDYARN+FDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPN FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIK+DLTL NAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAAR VFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPE KHYACMVDILGRAGFLEEAMELINEMS TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE NGNVHEFLVGDN HPLSSNIYSKL++IATKLK VGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLV+LAPSQPIRVVKNLRICGDCH FAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 733
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CsGy1G023300 vs. ExPASy TrEMBL
Match: A0A1S3C623 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103497087 PE=3 SV=1)

HSP 1 Score: 1431 bits (3704), Expect = 0.0
Identity = 711/733 (97.00%), Postives = 717/733 (97.82%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNFSTLNNNL FRNHQILS IDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKLFTASALSSFSTLDYARN+FDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPN FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIK+DLTL NAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAAR VFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPE KHYACMVDILGRAGFLEEAMELINEMS TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE NGNVHEFLVGDN HPLSSNIYSKL++IATKLK VGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLV+LAPSQPIRVVKNLRICGDCH FAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 733
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CsGy1G023300 vs. ExPASy TrEMBL
Match: A0A5A7SKX2 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold848G00570 PE=3 SV=1)

HSP 1 Score: 1429 bits (3699), Expect = 0.0
Identity = 710/733 (96.86%), Postives = 716/733 (97.68%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           MEALSVP ISLQNFSTLNNNL FRNHQILS IDKCSSSKQLKEVHARMLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKLFTASALSSFSTLDYARN+FDQIPQPNLYTWN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           EDLPN FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV VLSACAKK
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERKGIK+DLTL NAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGDYDAAR VFNAMPVKEIAAWNVLISAYEQNGKPKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVYIKREGI LNCHLISSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           RVFFHEMEPVYGVVPE KHYACMVDILGRAGFLEEAMELINEMS TPSASVWGALLGACS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SSIE NGNVHEFLVGDN HPLSSNIYSKL++IATKLK VGYEPNKSHLLQLIEEDDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGLV+LAPSQPIRVVKNLRICGDCH FAKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRDGHCSCMDYW 733
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CsGy1G023300 vs. ExPASy TrEMBL
Match: A0A6J1HLG4 (pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111464676 PE=3 SV=1)

HSP 1 Score: 1321 bits (3420), Expect = 0.0
Identity = 645/733 (87.99%), Postives = 688/733 (93.86%), Query Frame = 0

Query: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60
           ME LS P +SL N S  +NNL FRNHQILSTID+CSS KQLK+VHA+MLRTGLFFDPFSA
Sbjct: 1   METLSAPLVSLPNRSIADNNLHFRNHQILSTIDQCSSGKQLKQVHAQMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKL  ASAL S STL+YAR++FDQIP PNLYTWNTLIRAYASS+DPFQSFVIFL LLD+C
Sbjct: 61  SKLIAASALKSSSTLEYARDVFDQIPHPNLYTWNTLIRAYASSADPFQSFVIFLALLDEC 120

Query: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180
           +DLPN FTFPFVIKAASELKASRVG AVHGMAIKLS GMD YILNSLVRFYGACGDL+MA
Sbjct: 121 DDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSLGMDQYILNSLVRFYGACGDLNMA 180

Query: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240
           ERLF+GISCKDVVSWNSMISAFAQGNCPEDALELFLKME  NVMPNSVTMVGVLSACAKK
Sbjct: 181 ERLFEGISCKDVVSWNSMISAFAQGNCPEDALELFLKMEGANVMPNSVTMVGVLSACAKK 240

Query: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300
           LDLEFGRWVCSYIERK I VDLTLCNAMLDMYTKCGS+ DA+KLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKEISVDLTLCNAMLDMYTKCGSIGDAEKLFDEMPERDVFSWTTML 300

Query: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360
           DGYAKMGD++AAR VF+ MPVKEIAAWN LISAYE+NGKPKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDFNAARKVFDEMPVKEIAAWNALISAYERNGKPKEALATFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420
           VTLVS+LSACAQLGAIDLGGWIHVYIKREGI LN HLI+SL+DMYAKCG+LEKALEVFY+
Sbjct: 361 VTLVSSLSACAQLGAIDLGGWIHVYIKREGINLNGHLITSLIDMYAKCGALEKALEVFYA 420

Query: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEE+DVYVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPN VTFTN+LCACSHAGLVDEG
Sbjct: 421 VEEKDVYVWSAMIAGLGMHGRGKAAIELFFKMQEAKVKPNDVTFTNLLCACSHAGLVDEG 480

Query: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540
           R  FHEMEPVYGVVP  KHYACMVDILGRAGFLEEAMELINEM TTPSASVWGALLGACS
Sbjct: 481 RALFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPTTPSASVWGALLGACS 540

Query: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600
           LHMNVEL ELASDQLLKLEPRNHGAI+LLSN+YAKTGRW+KVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELAELASDQLLKLEPRNHGAIILLSNVYAKTGRWDKVSELRKLMRDSELKKEPGC 600

Query: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660
           SS+E NG VHEFLVGDN+HPLS +IYSKL+EIA KLKSVGYEPNKSHLLQLIEEDD+KE 
Sbjct: 601 SSVEVNGIVHEFLVGDNSHPLSRDIYSKLDEIAAKLKSVGYEPNKSHLLQLIEEDDVKEH 660

Query: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGL++LAPSQPIRVVKNLRICGDCH  AKL+SRVY+RDIL++DRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLISLAPSQPIRVVKNLRICGDCHEVAKLISRVYNRDILVQDRYRFH 720

Query: 721 HFRDGHCSCMDYW 733
           HFRDGHCSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of CsGy1G023300 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 939.1 bits (2426), Expect = 2.1e-273
Identity = 447/727 (61.49%), Postives = 564/727 (77.58%), Query Frame = 0

Query: 7   PSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTA 66
           P+ S  N  T NN       + +S I++C S +QLK+ H  M+RTG F DP+SASKLF  
Sbjct: 16  PNFSNPNQPTTNN----ERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 75

Query: 67  SALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNK 126
           +ALSSF++L+YAR +FD+IP+PN + WNTLIRAYAS  DP  S   FLD++ + +  PNK
Sbjct: 76  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 135

Query: 127 FTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKG 186
           +TFPF+IKAA+E+ +  +G ++HGMA+K + G D+++ NSL+  Y +CGDL  A ++F  
Sbjct: 136 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 195

Query: 187 ISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 246
           I  KDVVSWNSMI+ F Q   P+ ALELF KME E+V  + VTMVGVLSACAK  +LEFG
Sbjct: 196 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 255

Query: 247 RWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 306
           R VCSYIE   + V+LTL NAMLDMYTKCGS++DA++LFD M E+D  +WT MLDGYA  
Sbjct: 256 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 315

Query: 307 GDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVST 366
            DY+AAR V N+MP K+I AWN LISAYEQNGKP EAL +F+ELQL K  K +++TLVST
Sbjct: 316 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 375

Query: 367 LSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDV 426
           LSACAQ+GA++LG WIH YIK+ GI +N H+ S+L+ MY+KCG LEK+ EVF SVE+RDV
Sbjct: 376 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 435

Query: 427 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHE 486
           +VWSAMI GL MHG G  A+D+F++MQEA VKPN VTFTNV CACSH GLVDE    FH+
Sbjct: 436 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 495

Query: 487 MEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVE 546
           ME  YG+VPE KHYAC+VD+LGR+G+LE+A++ I  M   PS SVWGALLGAC +H N+ 
Sbjct: 496 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 555

Query: 547 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEAN 606
           L E+A  +LL+LEPRN GA VLLSNIYAK G+WE VSELRK MR T LKKEPGCSSIE +
Sbjct: 556 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 615

Query: 607 GNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 666
           G +HEFL GDN HP+S  +Y KL E+  KLKS GYEP  S +LQ+IEE+++KEQ+L+LHS
Sbjct: 616 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 675

Query: 667 EKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGH 726
           EKLAI +GL++    + IRV+KNLR+CGDCH+ AKL+S++YDR+I++RDRYRFHHFR+G 
Sbjct: 676 EKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQ 735

Query: 727 CSCMDYW 734
           CSC D+W
Sbjct: 736 CSCNDFW 738

BLAST of CsGy1G023300 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 605.9 bits (1561), Expect = 4.3e-173
Identity = 314/744 (42.20%), Postives = 449/744 (60.35%), Query Frame = 0

Query: 24  RNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASALS-SFSTLDYARNLF 83
           RNH  LS +  C + + L+ +HA+M++ GL    ++ SKL     LS  F  L YA ++F
Sbjct: 32  RNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVF 91

Query: 84  DQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNKFTFPFVIKAASELKAS 143
             I +PNL  WNT+ R +A SSDP  +  +++ ++     LPN +TFPFV+K+ ++ KA 
Sbjct: 92  KTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMI-SLGLLPNSYTFPFVLKSCAKSKAF 151

Query: 144 RVGTAVHGMAIKLSFGMDLYILNS-------------------------------LVRFY 203
           + G  +HG  +KL   +DLY+  S                               L++ Y
Sbjct: 152 KEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGY 211

Query: 204 GACGDLSMAERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMV 263
            + G +  A++LF  I  KDVVSWN+MIS +A+    ++ALELF  M + NV P+  TMV
Sbjct: 212 ASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMV 271

Query: 264 GVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPER 323
            V+SACA+   +E GR V  +I+  G   +L + NA++D+Y+KCG ++ A  LF+ +P +
Sbjct: 272 TVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYK 331

Query: 324 DVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQ 383
           DV SW  ++ GY  M  Y                               KEAL +F E+ 
Sbjct: 332 DVISWNTLIGGYTHMNLY-------------------------------KEALLLFQEM- 391

Query: 384 LSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKR--EGIVLNCHLISSLVDMYAKCG 443
           L     P++VT++S L ACA LGAID+G WIHVYI +  +G+     L +SL+DMYAKCG
Sbjct: 392 LRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCG 451

Query: 444 SLEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLC 503
            +E A +VF S+  + +  W+AMI G  MHGR  A+ DLF  M++  ++P+ +TF  +L 
Sbjct: 452 DIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLS 511

Query: 504 ACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSA 563
           ACSH+G++D GR  F  M   Y + P+++HY CM+D+LG +G  +EA E+IN M   P  
Sbjct: 512 ACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDG 571

Query: 564 SVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLM 623
            +W +LL AC +H NVELGE  ++ L+K+EP N G+ VLLSNIYA  GRW +V++ R L+
Sbjct: 572 VIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALL 631

Query: 624 RDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLL 683
            D  +KK PGCSSIE +  VHEF++GD  HP +  IY  LEE+   L+  G+ P+ S +L
Sbjct: 632 NDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVL 691

Query: 684 QLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDR 734
           Q +EE + KE AL  HSEKLAIAFGL++  P   + +VKNLR+C +CH   KL+S++Y R
Sbjct: 692 QEMEE-EWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKR 741

BLAST of CsGy1G023300 vs. TAIR 10
Match: AT4G14820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 558.9 bits (1439), Expect = 6.0e-159
Identity = 279/713 (39.13%), Postives = 437/713 (61.29%), Query Frame = 0

Query: 28  ILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASALSSFSTLDYARNLFDQIPQ 87
           IL  +  C S   +K++HA +LRT    +    S LF  S  SS   L YA N+F  IP 
Sbjct: 15  ILEKLSFCKSLNHIKQLHAHILRT--VINHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74

Query: 88  -PNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNKFTFPFVIKAASELKASRVGT 147
            P    +N  +R  + SS+P ++ ++F   +       ++F+F  ++KA S++ A   G 
Sbjct: 75  PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134

Query: 148 AVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKGISCKDVVSWNSMISAFAQGN 207
            +HG+A K++   D ++    +  Y +CG ++ A  +F  +S +DVV+WN+MI  + +  
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194

Query: 208 CPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCN 267
             ++A +LF +M+  NVMP+ + +  ++SAC +  ++ + R +  ++    +++D  L  
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254

Query: 268 AMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAA 327
           A++ MY   G +D A++ F +M  R++F  T M+ GY+K G  D A+++F+    K++  
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314

Query: 328 WNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 387
           W  +ISAY ++  P+EAL +F E+  S I KPD V++ S +SACA LG +D   W+H  I
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374

Query: 388 KREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAI 447
              G+     + ++L++MYAKCG L+   +VF  +  R+V  WS+MI  L MHG    A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434

Query: 448 DLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDI 507
            LF  M++  V+PN VTF  VL  CSH+GLV+EG+  F  M   Y + P+++HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494

Query: 508 LGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAI 567
            GRA  L EA+E+I  M    +  +WG+L+ AC +H  +ELG+ A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554

Query: 568 VLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIY 627
           VL+SNIYA+  RWE V  +R++M +  + KE G S I+ NG  HEFL+GD  H  S+ IY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614

Query: 628 SKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQP--- 687
           +KL+E+ +KLK  GY P+   +L  +EE++ K+  L  HSEKLA+ FGL+     +    
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674

Query: 688 ---IRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGHCSCMDYW 734
              IR+VKNLR+C DCH F KLVS+VY+R+I++RDR RFH +++G CSC DYW
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of CsGy1G023300 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 541.6 bits (1394), Expect = 9.9e-154
Identity = 284/727 (39.06%), Postives = 433/727 (59.56%), Query Frame = 0

Query: 15  STLNNNLLFRNHQI------LSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASA 74
           S L + LL+ N  I       S ID  +   QLK++HAR+L  GL F  F  +KL  AS 
Sbjct: 5   SCLASPLLYTNSGIHSDSFYASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHAS- 64

Query: 75  LSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNKFT 134
            SSF  + +AR +FD +P+P ++ WN +IR Y S ++ FQ  ++    +      P+ FT
Sbjct: 65  -SSFGDITFARQVFDDLPRPQIFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFT 124

Query: 135 FPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLFKGIS 194
           FP ++KA S L   ++G  VH    +L F  D+++ N L+  Y  C  L  A  +F+G+ 
Sbjct: 125 FPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLP 184

Query: 195 C--KDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFG 254
              + +VSW +++SA+AQ   P +ALE+F +M + +V P+ V +V VL+A     DL+ G
Sbjct: 185 LPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQG 244

Query: 255 RWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKM 314
           R + + + + G++++  L  ++  MY KCG V  A+ LFD+M   ++  W  M+ GYAK 
Sbjct: 245 RSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK- 304

Query: 315 GDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVST 374
                                         NG  +EA+ +F+E+ ++K  +PD +++ S 
Sbjct: 305 ------------------------------NGYAREAIDMFHEM-INKDVRPDTISITSA 364

Query: 375 LSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDV 434
           +SACAQ+G+++    ++ Y+ R     +  + S+L+DM+AKCGS+E A  VF    +RDV
Sbjct: 365 ISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDV 424

Query: 435 YVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHE 494
            VWSAMI G G+HGR + AI L+  M+   V PN VTF  +L AC+H+G+V EG  FF+ 
Sbjct: 425 VVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNR 484

Query: 495 MEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVE 554
           M   + + P+ +HYAC++D+LGRAG L++A E+I  M   P  +VWGALL AC  H +VE
Sbjct: 485 MAD-HKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVE 544

Query: 555 LGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEAN 614
           LGE A+ QL  ++P N G  V LSN+YA    W++V+E+R  M++  L K+ GCS +E  
Sbjct: 545 LGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVR 604

Query: 615 GNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHS 674
           G +  F VGD +HP    I  ++E I ++LK  G+  NK   L  + +++  E+ L  HS
Sbjct: 605 GRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHS 664

Query: 675 EKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGH 734
           E++AIA+GL++     P+R+ KNLR C +CHA  KL+S++ DR+I++RD  RFHHF+DG 
Sbjct: 665 ERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGV 694

BLAST of CsGy1G023300 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 523.1 bits (1346), Expect = 3.6e-148
Identity = 273/707 (38.61%), Postives = 414/707 (58.56%), Query Frame = 0

Query: 32  IDKCSSSKQLKEVHARMLRTGLFFDPFSASKL--FTASALSSFSTLDYARNLFDQIPQPN 91
           I+ C + + L ++HA  +++G   D  +A+++  F A++      LDYA  +F+Q+PQ N
Sbjct: 30  INNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRN 89

Query: 92  LYTWNTLIRAYASSSD--PFQSFVIFLDLLDKCEDLPNKFTFPFVIKAASELKASRVGTA 151
            ++WNT+IR ++ S +     +  +F +++      PN+FTFP V+KA ++    + G  
Sbjct: 90  CFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQ 149

Query: 152 VHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLF-KGISCKDVVSWNSMISAFAQGN 211
           +HG+A+K  FG D +++++LVR Y  CG +  A  LF K I  KD+V             
Sbjct: 150 IHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMV------------- 209

Query: 212 CPEDALELFLKMERENVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKGIKVDLTLCN 271
                                                                       
Sbjct: 210 ------------------------------------------------------------ 269

Query: 272 AMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIMLDGYAKMGDYDAARLVFNAMPVKEIAA 331
            M D   + G               ++  W +M+DGY ++GD  AAR++F+ M  + + +
Sbjct: 270 VMTDRRKRDG---------------EIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVS 329

Query: 332 WNVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 391
           WN +IS Y  NG  K+A+ +F E++   I +P+ VTLVS L A ++LG+++LG W+H+Y 
Sbjct: 330 WNTMISGYSLNGFFKDAVEVFREMKKGDI-RPNYVTLVSVLPAISRLGSLELGEWLHLYA 389

Query: 392 KREGIVLNCHLISSLVDMYAKCGSLEKALEVFYSVEERDVYVWSAMIAGLGMHGRGKAAI 451
           +  GI ++  L S+L+DMY+KCG +EKA+ VF  +   +V  WSAMI G  +HG+   AI
Sbjct: 390 EDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAI 449

Query: 452 DLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEGRVFFHEMEPVYGVVPEMKHYACMVDI 511
           D F +M++A V+P+ V + N+L ACSH GLV+EGR +F +M  V G+ P ++HY CMVD+
Sbjct: 450 DCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDL 509

Query: 512 LGRAGFLEEAMELINEMSTTPSASVWGALLGACSLHMNVELGELASDQLLKLEPRNHGAI 571
           LGR+G L+EA E I  M   P   +W ALLGAC +  NVE+G+  ++ L+ + P + GA 
Sbjct: 510 LGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAY 569

Query: 572 VLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGCSSIEANGNVHEFLVGDNTHPLSSNIY 631
           V LSN+YA  G W +VSE+R  M++ +++K+PGCS I+ +G +HEF+V D++HP +  I 
Sbjct: 570 VALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEIN 629

Query: 632 SKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQALSLHSEKLAIAFGLVTLAPSQPIRV 691
           S L EI+ KL+  GY P  + +L  +EE+D KE  L  HSEK+A AFGL++ +P +PIR+
Sbjct: 630 SMLVEISDKLRLAGYRPITTQVLLNLEEED-KENVLHYHSEKIATAFGLISTSPGKPIRI 646

Query: 692 VKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFHHFRDGHCSCMDYW 734
           VKNLRIC DCH+  KL+S+VY R I +RDR RFHHF+DG CSCMDYW
Sbjct: 690 VKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O823803.0e-27261.49Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9LN016.0e-17242.20Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
O233378.4e-15839.13Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana OX... [more]
Q9LTV81.4e-15239.06Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9FI805.1e-14738.61Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_004145320.10.0100.00pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis sa... [more]
XP_008457379.10.097.00PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic ... [more]
KAA0031814.10.096.86pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_038893523.10.092.22pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Benincasa ... [more]
XP_022964665.10.087.99pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
A0A0A0M0R90.0100.00DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G5301... [more]
A0A5D3BBW60.097.00Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3C6230.097.00pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucumis ... [more]
A0A5A7SKX20.096.86Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1HLG40.087.99pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT2G29760.12.1e-27361.49Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.14.3e-17342.20Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G14820.16.0e-15939.13Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.19.9e-15439.06mitochondrial editing factor 22 [more]
AT5G48910.13.6e-14838.61Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 427..460
e-value: 3.8E-7
score: 27.9
coord: 399..427
e-value: 0.0032
score: 15.5
coord: 500..524
e-value: 0.0014
score: 16.7
coord: 193..226
e-value: 1.9E-7
score: 28.8
coord: 462..495
e-value: 0.0015
score: 16.6
coord: 265..293
e-value: 1.0E-4
score: 20.2
coord: 294..321
e-value: 1.5E-5
score: 22.8
coord: 326..359
e-value: 9.3E-5
score: 20.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 425..472
e-value: 1.8E-10
score: 40.9
coord: 190..239
e-value: 2.9E-11
score: 43.4
coord: 324..371
e-value: 4.0E-7
score: 30.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 500..523
e-value: 0.0019
score: 18.3
coord: 294..322
e-value: 3.2E-5
score: 23.9
coord: 265..292
e-value: 6.8E-6
score: 26.0
coord: 164..186
e-value: 0.5
score: 10.7
coord: 91..117
e-value: 0.065
score: 13.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 191..225
score: 12.353442
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 425..459
score: 10.884628
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 323..357
score: 9.689847
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 261..295
score: 11.070971
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 146..244
e-value: 8.3E-21
score: 76.1
coord: 245..325
e-value: 6.0E-20
score: 73.3
coord: 9..145
e-value: 4.1E-9
score: 38.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 327..461
e-value: 1.8E-27
score: 98.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 465..605
e-value: 4.0E-16
score: 60.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 303..582
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 598..723
e-value: 9.0E-36
score: 122.6
NoneNo IPR availablePANTHERPTHR47926:SF99BNAANNG32650D PROTEINcoord: 171..238
coord: 304..719
coord: 190..305
NoneNo IPR availablePANTHERPTHR47926:SF99BNAANNG32650D PROTEINcoord: 14..184
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 171..238
coord: 304..719
coord: 190..305
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 14..184

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G023300.2CsGy1G023300.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding