CsGy5G012620 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy5G012620
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
LocationGy14Chr5: 14855097 .. 14856995 (+)
RNA-Seq ExpressionCsGy5G012620
SyntenyCsGy5G012620
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATCAAATGAGCAAAAAGACCATCCTCTCAAAAATCATTGGTAAGAAGCACCATTTATTCTCTTACCCATTTTTCCATTTACCCTCCAGACTTCTAGTTTTTCAAATACACTATGATGTAGCCACGTGCAATCTTTCTCTCCAATCATATGCCAACCACAAGAATCTCACCAAAGGGAGACAGCTACATTCCTTAATGGTCACTTCTGGATTTATTCATTTGCCTTCATCCATTACTAGCTTGATCAACATGTACTCTAGATGCAATCAAATGGAAGAGGCTGTTTTAGTTTTCCGTGATCCATATCATGAGCGTAATGTGTTTGCATACAATGCAATAATTGCTGGATTTGTTGCAAACGGGCTTGCAGCAGATGGATTTCAGTTTTATAAGAGAATGAGGTCAGTGGGTGTAATGCCTGATAAGTTCACTTTTCCATGTGTAGTTAGAGCTTGTTGTGAGTTCATGGAGGTAAGGAAGATTCATGGTTGTTTATTTAAAATGGGATTGGAGTTGAATGTGTTTGTTGGTAGTGCTCTTGTCAATACTTACTTGAAGGTTGATGGAACGGAGGATGCTGAGAAAGTTTTTGAAGAGTTACCCGAGAGAGATGTTGTGCTTTGGAATGCAATGATCAATGGGTACACCAAAATTGGTCACCTCAACAAAGCAGTAGTGGTTTTTAAGAGAATGGGTGAAGAAGGGATTTCACTTAGTAGATTTACAACGACTAGCATTTTGTCTATTTTAACTTCGATGGGAGATATTAACAATGGGAGAGCTATTCATGGAATTGTAACAAAAATGGGTTATAGTTCATGTGTTGCAGTTTCAAATGCACTAATTGATATGTATGGGAAGTGCAAGCATACTGAAGATGCTTTAATGATTTTTGAGATGATAAATGAGAAGGATTTATTTTCATGGAATTCCATTATATCTGCTCATGAGCAATGTGATGATCATGATGGTACCTTAAGACTTTTTGGCAAGATGTTAGGTTCTAGGGTTCTACCTGATGTGATTACTATCACCGCTGTACTTCCAGCTTGCTCTCACTTGGCTGCCCTCATGCATGGTAGAGAAATTCATGGATATATGATTGTTAATGGATTGGGAAAAAATGAAAATGGTGATGATGTGTTATTAAACAACGCCATTATGGACATGTATGCAAAGTGCGGATGCATGAAAAATGCTGACATAATATTTGATCTAATGAGGAATAAGGATGTGGCGTCTTGGAACATCATGATTATGGGTTATGCAATGCATGGATATGGTACAGAGGCGTTGGATATGTTTCATCGAATGTGTGAGGCCCAAATTAAACCAGATGTTGTCACATTTGTTGGAGTTTTATCTGCTTGTAGCCATGCAGGCTTTGTACATCAAGGGCGCTCATTCTTAACTCGAATGGAACTGGAATTTGGCGTGATTCCAACTATTGAGCATTATACATGTATAATCGACATGCTTGGTCGAGCTGGACATTTAGGGGAAGCTTATGACCTGGCTCAAAGAATACCTCTTGAAGACAACCTCATTTTATGGATGGCATTATTGGGAGCATGTCGACTTCATGGTAATGCAGAGTTGGGAAATGTTGTTGGAGAAAAGATAACGCAACTTGAACCTAAGCATTGTGGTAGTGGTAGTTATATATTGATGTCTAGTTTGTACGGAGTCGTAGGTCGATATGAAGAAGCATTGGAAGTTAGACGAACAATGAAGGAACAAAATGTTAAGAAGACACCAGGTTGTAGCTGGATTGAACTCAAGGATGGGCTGTATGTTTTTAGCATGGGAGACAGGACACATCATGAACTAAATGCATTGATTAACTGCCTTTGTGGCTTTGGATACTTTCATGATGAAGTGATGCATTCGTTTTAA

mRNA sequence

ATGAATCAAATGAGCAAAAAGACCATCCTCTCAAAAATCATTGGTAAGAAGCACCATTTATTCTCTTACCCATTTTTCCATTTACCCTCCAGACTTCTAGTTTTTCAAATACACTATGATGTAGCCACGTGCAATCTTTCTCTCCAATCATATGCCAACCACAAGAATCTCACCAAAGGGAGACAGCTACATTCCTTAATGGTCACTTCTGGATTTATTCATTTGCCTTCATCCATTACTAGCTTGATCAACATGTACTCTAGATGCAATCAAATGGAAGAGGCTGTTTTAGTTTTCCGTGATCCATATCATGAGCGTAATGTGTTTGCATACAATGCAATAATTGCTGGATTTGTTGCAAACGGGCTTGCAGCAGATGGATTTCAGTTTTATAAGAGAATGAGGTCAGTGGGTGTAATGCCTGATAAGTTCACTTTTCCATGTGTAGTTAGAGCTTGTTGTGAGTTCATGGAGGTAAGGAAGATTCATGGTTGTTTATTTAAAATGGGATTGGAGTTGAATGTGTTTGTTGGTAGTGCTCTTGTCAATACTTACTTGAAGGTTGATGGAACGGAGGATGCTGAGAAAGTTTTTGAAGAGTTACCCGAGAGAGATGTTGTGCTTTGGAATGCAATGATCAATGGGTACACCAAAATTGGTCACCTCAACAAAGCAGTAGTGGTTTTTAAGAGAATGGGTGAAGAAGGGATTTCACTTAGTAGATTTACAACGACTAGCATTTTGTCTATTTTAACTTCGATGGGAGATATTAACAATGGGAGAGCTATTCATGGAATTGTAACAAAAATGGGTTATAGTTCATGTGTTGCAGTTTCAAATGCACTAATTGATATGTATGGGAAGTGCAAGCATACTGAAGATGCTTTAATGATTTTTGAGATGATAAATGAGAAGGATTTATTTTCATGGAATTCCATTATATCTGCTCATGAGCAATGTGATGATCATGATGGTACCTTAAGACTTTTTGGCAAGATGTTAGGTTCTAGGGTTCTACCTGATGTGATTACTATCACCGCTGTACTTCCAGCTTGCTCTCACTTGGCTGCCCTCATGCATGGTAGAGAAATTCATGGATATATGATTGTTAATGGATTGGGAAAAAATGAAAATGGTGATGATGTGTTATTAAACAACGCCATTATGGACATGTATGCAAAGTGCGGATGCATGAAAAATGCTGACATAATATTTGATCTAATGAGGAATAAGGATGTGGCGTCTTGGAACATCATGATTATGGGTTATGCAATGCATGGATATGGTACAGAGGCGTTGGATATGTTTCATCGAATGTGTGAGGCCCAAATTAAACCAGATGTTGTCACATTTGTTGGAGTTTTATCTGCTTGTAGCCATGCAGGCTTTGTACATCAAGGGCGCTCATTCTTAACTCGAATGGAACTGGAATTTGGCGTGATTCCAACTATTGAGCATTATACATGTATAATCGACATGCTTGGTCGAGCTGGACATTTAGGGGAAGCTTATGACCTGGCTCAAAGAATACCTCTTGAAGACAACCTCATTTTATGGATGGCATTATTGGGAGCATGTCGACTTCATGGTAATGCAGAGTTGGGAAATGTTGTTGGAGAAAAGATAACGCAACTTGAACCTAAGCATTGTGGTAGTGGTAGTTATATATTGATGTCTAGTTTGTACGGAGTCGTAGGTCGATATGAAGAAGCATTGGAAGTTAGACGAACAATGAAGGAACAAAATGTTAAGAAGACACCAGGTTGTAGCTGGATTGAACTCAAGGATGGGCTGTATGTTTTTAGCATGGGAGACAGGACACATCATGAACTAAATGCATTGATTAACTGCCTTTGTGGCTTTGGATACTTTCATGATGAAGTGATGCATTCGTTTTAA

Coding sequence (CDS)

ATGAATCAAATGAGCAAAAAGACCATCCTCTCAAAAATCATTGGTAAGAAGCACCATTTATTCTCTTACCCATTTTTCCATTTACCCTCCAGACTTCTAGTTTTTCAAATACACTATGATGTAGCCACGTGCAATCTTTCTCTCCAATCATATGCCAACCACAAGAATCTCACCAAAGGGAGACAGCTACATTCCTTAATGGTCACTTCTGGATTTATTCATTTGCCTTCATCCATTACTAGCTTGATCAACATGTACTCTAGATGCAATCAAATGGAAGAGGCTGTTTTAGTTTTCCGTGATCCATATCATGAGCGTAATGTGTTTGCATACAATGCAATAATTGCTGGATTTGTTGCAAACGGGCTTGCAGCAGATGGATTTCAGTTTTATAAGAGAATGAGGTCAGTGGGTGTAATGCCTGATAAGTTCACTTTTCCATGTGTAGTTAGAGCTTGTTGTGAGTTCATGGAGGTAAGGAAGATTCATGGTTGTTTATTTAAAATGGGATTGGAGTTGAATGTGTTTGTTGGTAGTGCTCTTGTCAATACTTACTTGAAGGTTGATGGAACGGAGGATGCTGAGAAAGTTTTTGAAGAGTTACCCGAGAGAGATGTTGTGCTTTGGAATGCAATGATCAATGGGTACACCAAAATTGGTCACCTCAACAAAGCAGTAGTGGTTTTTAAGAGAATGGGTGAAGAAGGGATTTCACTTAGTAGATTTACAACGACTAGCATTTTGTCTATTTTAACTTCGATGGGAGATATTAACAATGGGAGAGCTATTCATGGAATTGTAACAAAAATGGGTTATAGTTCATGTGTTGCAGTTTCAAATGCACTAATTGATATGTATGGGAAGTGCAAGCATACTGAAGATGCTTTAATGATTTTTGAGATGATAAATGAGAAGGATTTATTTTCATGGAATTCCATTATATCTGCTCATGAGCAATGTGATGATCATGATGGTACCTTAAGACTTTTTGGCAAGATGTTAGGTTCTAGGGTTCTACCTGATGTGATTACTATCACCGCTGTACTTCCAGCTTGCTCTCACTTGGCTGCCCTCATGCATGGTAGAGAAATTCATGGATATATGATTGTTAATGGATTGGGAAAAAATGAAAATGGTGATGATGTGTTATTAAACAACGCCATTATGGACATGTATGCAAAGTGCGGATGCATGAAAAATGCTGACATAATATTTGATCTAATGAGGAATAAGGATGTGGCGTCTTGGAACATCATGATTATGGGTTATGCAATGCATGGATATGGTACAGAGGCGTTGGATATGTTTCATCGAATGTGTGAGGCCCAAATTAAACCAGATGTTGTCACATTTGTTGGAGTTTTATCTGCTTGTAGCCATGCAGGCTTTGTACATCAAGGGCGCTCATTCTTAACTCGAATGGAACTGGAATTTGGCGTGATTCCAACTATTGAGCATTATACATGTATAATCGACATGCTTGGTCGAGCTGGACATTTAGGGGAAGCTTATGACCTGGCTCAAAGAATACCTCTTGAAGACAACCTCATTTTATGGATGGCATTATTGGGAGCATGTCGACTTCATGGTAATGCAGAGTTGGGAAATGTTGTTGGAGAAAAGATAACGCAACTTGAACCTAAGCATTGTGGTAGTGGTAGTTATATATTGATGTCTAGTTTGTACGGAGTCGTAGGTCGATATGAAGAAGCATTGGAAGTTAGACGAACAATGAAGGAACAAAATGTTAAGAAGACACCAGGTTGTAGCTGGATTGAACTCAAGGATGGGCTGTATGTTTTTAGCATGGGAGACAGGACACATCATGAACTAAATGCATTGATTAACTGCCTTTGTGGCTTTGGATACTTTCATGATGAAGTGATGCATTCGTTTTAA

Protein sequence

MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKGRQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSALVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLSRFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFEMINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMHGREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMIMGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGEKITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDRTHHELNALINCLCGFGYFHDEVMHSF*
Homology
BLAST of CsGy5G012620 vs. ExPASy Swiss-Prot
Match: Q9LUC2 (Pentatricopeptide repeat-containing protein At3g14730 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E31 PE=2 SV=1)

HSP 1 Score: 639.0 bits (1647), Expect = 5.5e-182
Identity = 301/581 (51.81%), Postives = 416/581 (71.60%), Query Frame = 0

Query: 38  HYDVATCNLSLQSYANHKNLTKGRQLHSLMVTSGFI-HLPSSITSLINMYSRCNQMEEAV 97
           H++VATC  +LQ  A  K+   G+Q+H  MV  GF+   P + TSL+NMY++C  M  AV
Sbjct: 57  HHNVATCIATLQRCAQRKDYVSGQQIHGFMVRKGFLDDSPRAGTSLVNMYAKCGLMRRAV 116

Query: 98  LVFRDPYHERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVGVMPDKFTFPCVVRA--CC 157
           LVF     ER+VF YNA+I+GFV NG   D  + Y+ MR+ G++PDK+TFP +++     
Sbjct: 117 LVFGG--SERDVFGYNALISGFVVNGSPLDAMETYREMRANGILPDKYTFPSLLKGSDAM 176

Query: 158 EFMEVRKIHGCLFKMGLELNVFVGSALVNTYLKVDGTEDAEKVFEELPER-DVVLWNAMI 217
           E  +V+K+HG  FK+G + + +VGS LV +Y K    EDA+KVF+ELP+R D VLWNA++
Sbjct: 177 ELSDVKKVHGLAFKLGFDSDCYVGSGLVTSYSKFMSVEDAQKVFDELPDRDDSVLWNALV 236

Query: 218 NGYTKIGHLNKAVVVFKRMGEEGISLSRFTTTSILSILTSMGDINNGRAIHGIVTKMGYS 277
           NGY++I     A++VF +M EEG+ +SR T TS+LS  T  GDI+NGR+IHG+  K G  
Sbjct: 237 NGYSQIFRFEDALLVFSKMREEGVGVSRHTITSVLSAFTVSGDIDNGRSIHGLAVKTGSG 296

Query: 278 SCVAVSNALIDMYGKCKHTEDALMIFEMINEKDLFSWNSIISAHEQCDDHDGTLRLFGKM 337
           S + VSNALIDMYGK K  E+A  IFE ++E+DLF+WNS++  H+ C DHDGTL LF +M
Sbjct: 297 SDIVVSNALIDMYGKSKWLEEANSIFEAMDERDLFTWNSVLCVHDYCGDHDGTLALFERM 356

Query: 338 LGSRVLPDVITITAVLPACSHLAALMHGREIHGYMIVNGLGKNENGDDVLLNNAIMDMYA 397
           L S + PD++T+T VLP C  LA+L  GREIHGYMIV+GL  N    +  ++N++MDMY 
Sbjct: 357 LCSGIRPDIVTLTTVLPTCGRLASLRQGREIHGYMIVSGL-LNRKSSNEFIHNSLMDMYV 416

Query: 398 KCGCMKNADIIFDLMRNKDVASWNIMIMGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVG 457
           KCG +++A ++FD MR KD ASWNIMI GY +   G  ALDMF  MC A +KPD +TFVG
Sbjct: 417 KCGDLRDARMVFDSMRVKDSASWNIMINGYGVQSCGELALDMFSCMCRAGVKPDEITFVG 476

Query: 458 VLSACSHAGFVHQGRSFLTRMELEFGVIPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLE 517
           +L ACSH+GF+++GR+FL +ME  + ++PT +HY C+IDMLGRA  L EAY+LA   P+ 
Sbjct: 477 LLQACSHSGFLNEGRNFLAQMETVYNILPTSDHYACVIDMLGRADKLEEAYELAISKPIC 536

Query: 518 DNLILWMALLGACRLHGNAELGNVVGEKITQLEPKHCGSGSYILMSSLYGVVGRYEEALE 577
           DN ++W ++L +CRLHGN +L  V G+++ +LEP+HC  G Y+LMS++Y   G+YEE L+
Sbjct: 537 DNPVVWRSILSSCRLHGNKDLALVAGKRLHELEPEHC--GGYVLMSNVYVEAGKYEEVLD 596

Query: 578 VRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDRTHHELNAL 615
           VR  M++QNVKKTPGCSWI LK+G++ F  G++TH E  ++
Sbjct: 597 VRDAMRQQNVKKTPGCSWIVLKNGVHTFFTGNQTHPEFKSI 632

BLAST of CsGy5G012620 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 389.4 bits (999), Expect = 7.6e-107
Identity = 212/614 (34.53%), Postives = 348/614 (56.68%), Query Frame = 0

Query: 1   MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG 60
           MN+++K    S  IG            L  +++   +  D  T +   +S+++ +++  G
Sbjct: 167 MNELAKSGDFSGSIG------------LFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGG 226

Query: 61  RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA 120
            QLH  ++ SGF    S   SL+  Y +  +++ A  VF D   ER+V ++N+II G+V+
Sbjct: 227 EQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVF-DEMTERDVISWNSIINGYVS 286

Query: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEV---RKIHGCLFKMGLELNVFV 180
           NGLA  G   + +M   G+  D  T   V   C +   +   R +H    K         
Sbjct: 287 NGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRF 346

Query: 181 GSALVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGI 240
            + L++ Y K    + A+ VF E+ +R VV + +MI GY + G   +AV +F+ M EEGI
Sbjct: 347 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 406

Query: 241 SLSRFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALM 300
           S   +T T++L+       ++ G+ +H  + +      + VSNAL+DMY KC   ++A +
Sbjct: 407 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 466

Query: 301 IFEMINEKDLFSWNSIISAHEQCDDHDGTLRLFGKML-GSRVLPDVITITAVLPACSHLA 360
           +F  +  KD+ SWN+II  + +    +  L LF  +L   R  PD  T+  VLPAC+ L+
Sbjct: 467 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 526

Query: 361 ALMHGREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASW 420
           A   GREIHGY++ NG   + +     + N+++DMYAKCG +  A ++FD + +KD+ SW
Sbjct: 527 AFDKGREIHGYIMRNGYFSDRH-----VANSLVDMYAKCGALLLAHMLFDDIASKDLVSW 586

Query: 421 NIMIMGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMEL 480
            +MI GY MHG+G EA+ +F++M +A I+ D ++FV +L ACSH+G V +G  F   M  
Sbjct: 587 TVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRH 646

Query: 481 EFGVIPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGN 540
           E  + PT+EHY CI+DML R G L +AY   + +P+  +  +W ALL  CR+H + +L  
Sbjct: 647 ECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAE 706

Query: 541 VVGEKITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKD 600
            V EK+ +LEP++  +G Y+LM+++Y    ++E+   +R+ + ++ ++K PGCSWIE+K 
Sbjct: 707 KVAEKVFELEPEN--TGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKG 760

Query: 601 GLYVFSMGDRTHHE 611
            + +F  GD ++ E
Sbjct: 767 RVNIFVAGDSSNPE 760

BLAST of CsGy5G012620 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 387.5 bits (994), Expect = 2.9e-106
Identity = 202/614 (32.90%), Postives = 353/614 (57.49%), Query Frame = 0

Query: 45  NLSLQSYANHKNLTKGRQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYH 104
           N  + +Y+   +L  GRQ+   M         S +T L    ++   ++EA  +FR    
Sbjct: 59  NRLIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGL----TKLGFLDEADSLFRS-MP 118

Query: 105 ERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRK--- 164
           ER+   +N++++GF  +    +   ++  M   G + ++++F  V+ AC    ++ K   
Sbjct: 119 ERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQ 178

Query: 165 IHGCLFKMGLELNVFVGSALVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGH 224
           +H  + K     +V++GSALV+ Y K     DA++VF+E+ +R+VV WN++I  + + G 
Sbjct: 179 VHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGP 238

Query: 225 LNKAVVVFKRMGEEGISLSRFTTTSILSILTSMGDINNGRAIHGIVTKMG-YSSCVAVSN 284
             +A+ VF+ M E  +     T  S++S   S+  I  G+ +HG V K     + + +SN
Sbjct: 239 AVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSN 298

Query: 285 ALIDMYGKCKHTEDALMIFE------------MIN-------------------EKDLFS 344
           A +DMY KC   ++A  IF+            MI+                   E+++ S
Sbjct: 299 AFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVS 358

Query: 345 WNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMHGREIHGYMI 404
           WN++I+ + Q  +++  L LF  +    V P   +   +L AC+ LA L  G + H +++
Sbjct: 359 WNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVL 418

Query: 405 VNGLGKNENG--DDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMIMGYAMHG 464
            +G  K ++G  DD+ + N+++DMY KCGC++   ++F  M  +D  SWN MI+G+A +G
Sbjct: 419 KHGF-KFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNG 478

Query: 465 YGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEHY 524
           YG EAL++F  M E+  KPD +T +GVLSAC HAGFV +GR + + M  +FGV P  +HY
Sbjct: 479 YGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHY 538

Query: 525 TCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGEKITQLEP 584
           TC++D+LGRAG L EA  + + +P++ + ++W +LL AC++H N  LG  V EK+ ++EP
Sbjct: 539 TCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEP 598

Query: 585 KHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDRT 619
            +  SG Y+L+S++Y  +G++E+ + VR++M+++ V K PGCSWI+++   +VF + D++
Sbjct: 599 SN--SGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKS 658

BLAST of CsGy5G012620 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 386.7 bits (992), Expect = 4.9e-106
Identity = 210/624 (33.65%), Postives = 344/624 (55.13%), Query Frame = 0

Query: 22  SYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKGRQLHSLMVTSGFIHLPSSITS 81
           SYPF  LPS        YD    + SL    N K L   R +H+ M+  G  +   +++ 
Sbjct: 14  SYPFHFLPSS---SDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSK 73

Query: 82  LIN---MYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVG 141
           LI    +      +  A+ VF+    E N+  +N +  G   +       + Y  M S+G
Sbjct: 74  LIEFCILSPHFEGLPYAISVFK-TIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLG 133

Query: 142 VMPDKFTFPCVVRACCE---FMEVRKIHGCLFKMGLELNVFVGSALVNTYLKVDGTEDAE 201
           ++P+ +TFP V+++C +   F E ++IHG + K+G +L+++V ++L++ Y++    EDA 
Sbjct: 134 LLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAH 193

Query: 202 KV-------------------------------FEELPERDVVLWNAMINGYTKIGHLNK 261
           KV                               F+E+P +DVV WNAMI+GY + G+  +
Sbjct: 194 KVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKE 253

Query: 262 AVVVFKRMGEEGISLSRFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALID 321
           A+ +FK M +  +     T  +++S     G I  GR +H  +   G+ S + + NALID
Sbjct: 254 ALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALID 313

Query: 322 MYGKCKHTEDALMIFEMINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVIT 381
           +Y KC   E A  +FE +  KD+ SWN++I  +   + +   L LF +ML S   P+ +T
Sbjct: 314 LYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVT 373

Query: 382 ITAVLPACSHLAALMHGREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADII 441
           + ++LPAC+HL A+  GR IH Y+     G         L  +++DMYAKCG ++ A  +
Sbjct: 374 MLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASS---LRTSLIDMYAKCGDIEAAHQV 433

Query: 442 FDLMRNKDVASWNIMIMGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFV 501
           F+ + +K ++SWN MI G+AMHG    + D+F RM +  I+PD +TFVG+LSACSH+G +
Sbjct: 434 FNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGML 493

Query: 502 HQGRSFLTRMELEFGVIPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLG 561
             GR     M  ++ + P +EHY C+ID+LG +G   EA ++   + +E + ++W +LL 
Sbjct: 494 DLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLK 553

Query: 562 ACRLHGNAELGNVVGEKITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVK 609
           AC++HGN ELG    E + ++EP++   GSY+L+S++Y   GR+ E  + R  + ++ +K
Sbjct: 554 ACKMHGNVELGESFAENLIKIEPEN--PGSYVLLSNIYASAGRWNEVAKTRALLNDKGMK 613

BLAST of CsGy5G012620 vs. ExPASy Swiss-Prot
Match: Q9STE1 (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 384.4 bits (986), Expect = 2.5e-105
Identity = 207/592 (34.97%), Postives = 335/592 (56.59%), Query Frame = 0

Query: 36  QIHYDVATCNLSLQSYANHKNLTKGRQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEA 95
           QI  +  T +  L   A+   +  G QLH L+V SG     S   SL++MYS+C + ++A
Sbjct: 234 QISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKCGRFDDA 293

Query: 96  VLVFRDPYHERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCE 155
             +FR      +   +N +I+G+V +GL  +   F+  M S GV+PD  TF  ++ +  +
Sbjct: 294 SKLFR-MMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLLPSVSK 353

Query: 156 FMEV---RKIHGCLFKMGLELNVFVGSALVNTYLKVDGTEDAEKVFEELPERDVVLWNAM 215
           F  +   ++IH  + +  + L++F+ SAL++ Y K  G   A+ +F +    DVV++ AM
Sbjct: 354 FENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVVVFTAM 413

Query: 216 INGYTKIGHLNKAVVVFKRMGEEGISLSRFTTTSILSILTSMGDINNGRAIHGIVTKMGY 275
           I+GY   G    ++ +F+ + +  IS +  T  SIL ++  +  +  GR +HG + K G+
Sbjct: 414 ISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFIIKKGF 473

Query: 276 SSCVAVSNALIDMYGKCKHTEDALMIFEMINEKDLFSWNSIISAHEQCDDHDGTLRLFGK 335
            +   +  A+IDMY KC     A  IFE ++++D+ SWNS+I+   Q D+    + +F +
Sbjct: 474 DNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAIDIFRQ 533

Query: 336 MLGSRVLPDVITITAVLPACSHLAALMHGREIHGYMIVNGLGKNENGDDVLLNNAIMDMY 395
           M  S +  D ++I+A L AC++L +   G+ IHG+MI     K+    DV   + ++DMY
Sbjct: 534 MGVSGICYDCVSISAALSACANLPSESFGKAIHGFMI-----KHSLASDVYSESTLIDMY 593

Query: 396 AKCGCMKNADIIFDLMRNKDVASWNIMIMGYAMHGYGTEALDMFHRMCE-AQIKPDVVTF 455
           AKCG +K A  +F  M+ K++ SWN +I     HG   ++L +FH M E + I+PD +TF
Sbjct: 594 AKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSGIRPDQITF 653

Query: 456 VGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEHYTCIIDMLGRAGHLGEAYDLAQRIP 515
           + ++S+C H G V +G  F   M  ++G+ P  EHY C++D+ GRAG L EAY+  + +P
Sbjct: 654 LEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEAYETVKSMP 713

Query: 516 LEDNLILWMALLGACRLHGNAELGNVVGEKITQLEPKHCGSGSYILMSSLYGVVGRYEEA 575
              +  +W  LLGACRLH N EL  V   K+  L+P +  SG Y+L+S+ +     +E  
Sbjct: 714 FPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSN--SGYYVLISNAHANAREWESV 773

Query: 576 LEVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDRTHHE---LNALINCLCG 621
            +VR  MKE+ V+K PG SWIE+    ++F  GD  H E   + +L+N L G
Sbjct: 774 TKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYSLLNSLLG 817

BLAST of CsGy5G012620 vs. NCBI nr
Match: XP_004149501.2 (pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >XP_031740827.1 pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >KAE8648313.1 hypothetical protein Csa_004684 [Cucumis sativus])

HSP 1 Score: 1303 bits (3373), Expect = 0.0
Identity = 632/632 (100.00%), Postives = 632/632 (100.00%), Query Frame = 0

Query: 1   MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG 60
           MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG
Sbjct: 1   MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG 60

Query: 61  RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA 120
           RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA
Sbjct: 61  RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA 120

Query: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSA 180
           NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSA
Sbjct: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSA 180

Query: 181 LVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLS 240
           LVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLS
Sbjct: 181 LVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLS 240

Query: 241 RFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFE 300
           RFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFE
Sbjct: 241 RFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFE 300

Query: 301 MINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMH 360
           MINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMH
Sbjct: 301 MINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMH 360

Query: 361 GREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMI 420
           GREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMI
Sbjct: 361 GREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMI 420

Query: 421 MGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480
           MGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV
Sbjct: 421 MGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480

Query: 481 IPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGE 540
           IPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGE
Sbjct: 481 IPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGE 540

Query: 541 KITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYV 600
           KITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYV
Sbjct: 541 KITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYV 600

Query: 601 FSMGDRTHHELNALINCLCGFGYFHDEVMHSF 632
           FSMGDRTHHELNALINCLCGFGYFHDEVMHSF
Sbjct: 601 FSMGDRTHHELNALINCLCGFGYFHDEVMHSF 632

BLAST of CsGy5G012620 vs. NCBI nr
Match: XP_008466127.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g14730-like [Cucumis melo])

HSP 1 Score: 1251 bits (3236), Expect = 0.0
Identity = 603/632 (95.41%), Postives = 616/632 (97.47%), Query Frame = 0

Query: 1   MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG 60
           MNQMSKKTI+SKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNL LQSYANHKNLTKG
Sbjct: 1   MNQMSKKTIISKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLFLQSYANHKNLTKG 60

Query: 61  RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA 120
           +QLHSLMVTSGFIHLPSSITSLINMYS+CNQMEEAVLVF DPY ERNVFAYNAIIAGFVA
Sbjct: 61  KQLHSLMVTSGFIHLPSSITSLINMYSKCNQMEEAVLVFHDPYRERNVFAYNAIIAGFVA 120

Query: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSA 180
           NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFME+RKIHGCLFKMGLELNVFVGSA
Sbjct: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEIRKIHGCLFKMGLELNVFVGSA 180

Query: 181 LVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLS 240
           LVNTYLKVDG EDAEKVFEELPERDVVLWNAMINGY KIGHLNKAV VFK+MGEEGISLS
Sbjct: 181 LVNTYLKVDGMEDAEKVFEELPERDVVLWNAMINGYIKIGHLNKAVAVFKKMGEEGISLS 240

Query: 241 RFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFE 300
           RFTTTSILS+ TSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKH +DAL+IFE
Sbjct: 241 RFTTTSILSVFTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHNKDALLIFE 300

Query: 301 MINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMH 360
           MINEKDLFSWNSIISAHEQC DHDGTLRLFGKML SRVLPDVITIT VLPACSHLAALMH
Sbjct: 301 MINEKDLFSWNSIISAHEQCGDHDGTLRLFGKMLASRVLPDVITITVVLPACSHLAALMH 360

Query: 361 GREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMI 420
           GREIHGYMIVNGLGKNEN DDVLLNNA+MDMYAKCGCMKNA IIFDLMRNKDVASWNIMI
Sbjct: 361 GREIHGYMIVNGLGKNENSDDVLLNNAVMDMYAKCGCMKNAGIIFDLMRNKDVASWNIMI 420

Query: 421 MGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480
           MGYAMHGYGTEALDMFHRMCEAQIKP+VVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV
Sbjct: 421 MGYAMHGYGTEALDMFHRMCEAQIKPNVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480

Query: 481 IPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGE 540
           IPTIEHYTCIIDMLGRAG LGEAYDLAQRIPL+DNLILWMALLGACRLHGNAELGNVVGE
Sbjct: 481 IPTIEHYTCIIDMLGRAGQLGEAYDLAQRIPLQDNLILWMALLGACRLHGNAELGNVVGE 540

Query: 541 KITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYV 600
           KI QLEPK+CGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELK+GLYV
Sbjct: 541 KIRQLEPKNCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKNGLYV 600

Query: 601 FSMGDRTHHELNALINCLCGFGYFHDEVMHSF 632
           FSMGDRTHHELNALINCLCGF YFHDEVMHSF
Sbjct: 601 FSMGDRTHHELNALINCLCGFQYFHDEVMHSF 632

BLAST of CsGy5G012620 vs. NCBI nr
Match: XP_038877905.1 (pentatricopeptide repeat-containing protein At3g14730-like [Benincasa hispida] >XP_038877906.1 pentatricopeptide repeat-containing protein At3g14730-like [Benincasa hispida])

HSP 1 Score: 1213 bits (3139), Expect = 0.0
Identity = 584/632 (92.41%), Postives = 604/632 (95.57%), Query Frame = 0

Query: 1   MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG 60
           MNQMSKKTILSKIIGKKHHLFS PFF LPSRLLVFQIHYDVATCN  LQSYANHKNLTKG
Sbjct: 1   MNQMSKKTILSKIIGKKHHLFSSPFFCLPSRLLVFQIHYDVATCNFVLQSYANHKNLTKG 60

Query: 61  RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA 120
           +QLHSLM+TSGFIHLPSSITSLINMYS+CNQME+AVLVF DPY ERNVFAYNAIIAGFVA
Sbjct: 61  KQLHSLMITSGFIHLPSSITSLINMYSKCNQMEQAVLVFHDPYRERNVFAYNAIIAGFVA 120

Query: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSA 180
           NGLAA GFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLEL+VFVGSA
Sbjct: 121 NGLAAHGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELDVFVGSA 180

Query: 181 LVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLS 240
           LVNTYLKVD  EDAEKVFEELPERDVVLWNAMINGYT+IG LNKAVVVFK MG+EGIS  
Sbjct: 181 LVNTYLKVDMMEDAEKVFEELPERDVVLWNAMINGYTQIGRLNKAVVVFKNMGKEGISPC 240

Query: 241 RFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFE 300
           RFT T ILSILT M DINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKH EDALMIFE
Sbjct: 241 RFTMTGILSILTLMRDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHIEDALMIFE 300

Query: 301 MINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMH 360
           MINEKDLFSWNS+ISAHEQC DHD TLRLFGKMLGSRVLPDV+TITAVLPACSHLAALMH
Sbjct: 301 MINEKDLFSWNSVISAHEQCGDHDSTLRLFGKMLGSRVLPDVVTITAVLPACSHLAALMH 360

Query: 361 GREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMI 420
           GREIHGYMIVNGLGKNENGDDVLLNNA+MDMYAKCGCMKNADI+FDL RNKDVASWNIMI
Sbjct: 361 GREIHGYMIVNGLGKNENGDDVLLNNAVMDMYAKCGCMKNADIVFDLTRNKDVASWNIMI 420

Query: 421 MGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480
           MGYAMHGYG EALDMFH MCEAQIKPD VTFVGVLSACSHAGFVHQGRSFLTRMELEFGV
Sbjct: 421 MGYAMHGYGKEALDMFHLMCEAQIKPDAVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480

Query: 481 IPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGE 540
           +PTIEHYTCIIDMLGRAGH+ EAY+LAQRIPL++NL+LWMALLGACRLHGNAELG VVGE
Sbjct: 481 VPTIEHYTCIIDMLGRAGHVEEAYELAQRIPLQENLVLWMALLGACRLHGNAELGKVVGE 540

Query: 541 KITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYV 600
           KIT+LEP+HCGSGSYILMSS+YGVVGRYEEALEVRRTM EQNVKKTPGCSWIELKDGLYV
Sbjct: 541 KITRLEPEHCGSGSYILMSSMYGVVGRYEEALEVRRTMNEQNVKKTPGCSWIELKDGLYV 600

Query: 601 FSMGDRTHHELNALINCLCGFGYFHDEVMHSF 632
           FSMGDRTHHELNALINCLCG GYFHDEVMHSF
Sbjct: 601 FSMGDRTHHELNALINCLCGIGYFHDEVMHSF 632

BLAST of CsGy5G012620 vs. NCBI nr
Match: XP_022939865.1 (pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita moschata] >XP_022939866.1 pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 1154 bits (2985), Expect = 0.0
Identity = 551/631 (87.32%), Postives = 586/631 (92.87%), Query Frame = 0

Query: 1   MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG 60
           MNQMSKKT+LSKIIGKKHHLFS PF  LPSRLLVFQ+HYDVATCN  LQSYANHKNL +G
Sbjct: 1   MNQMSKKTVLSKIIGKKHHLFSSPFLSLPSRLLVFQMHYDVATCNFFLQSYANHKNLPEG 60

Query: 61  RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA 120
           +QLHS+M+TSGF+HLPSSITSLINMYS+CNQME+AVLVF DPYHERNVFAYNAIIAGFVA
Sbjct: 61  KQLHSVMITSGFMHLPSSITSLINMYSKCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVA 120

Query: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSA 180
           N L A GFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFK+GLEL++FV SA
Sbjct: 121 NRLPAHGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKLGLELDMFVSSA 180

Query: 181 LVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLS 240
           LVNTYLK D  EDA+KVF+ELPERDVVLWNAMING  KIGHLNKAVVVFK+MGEEGIS  
Sbjct: 181 LVNTYLKFDLMEDAKKVFKELPERDVVLWNAMINGCAKIGHLNKAVVVFKKMGEEGISPC 240

Query: 241 RFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFE 300
           RFT T ILSI + MGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKH EDAL+IFE
Sbjct: 241 RFTITGILSIFSLMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHIEDALVIFE 300

Query: 301 MINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMH 360
           MINEKDLFSWNSIISAHEQC DHDGTLR F KML SRVLPDVITITAVLPACS+ AALMH
Sbjct: 301 MINEKDLFSWNSIISAHEQCVDHDGTLRFFDKMLASRVLPDVITITAVLPACSYFAALMH 360

Query: 361 GREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMI 420
           GREIHGYM VNGLGKNE+GDDVLLNNA+MDMYAKCGC+KNA  +FD   NKDVASWNIMI
Sbjct: 361 GREIHGYMTVNGLGKNEDGDDVLLNNAVMDMYAKCGCLKNAHRVFDRTSNKDVASWNIMI 420

Query: 421 MGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480
           MGYA HGYG EALDMFH MCEAQIKPD +TFVGVLSACSHAGF+ QGRSFL RMELEFGV
Sbjct: 421 MGYATHGYGQEALDMFHHMCEAQIKPDAITFVGVLSACSHAGFLRQGRSFLARMELEFGV 480

Query: 481 IPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGE 540
           +PTIEHYTCIIDMLGRAGHLGEAY+LA+RIPL+DNL+LWMALLGACRLHGNA+LG VVGE
Sbjct: 481 VPTIEHYTCIIDMLGRAGHLGEAYELAERIPLQDNLVLWMALLGACRLHGNADLGKVVGE 540

Query: 541 KITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYV 600
           KI +LEPKHCGSGSY+LMSS+YGVVGRYEEAL+VRRTMKEQNVKKTPGCSWIELKDGLYV
Sbjct: 541 KIMRLEPKHCGSGSYVLMSSMYGVVGRYEEALQVRRTMKEQNVKKTPGCSWIELKDGLYV 600

Query: 601 FSMGDRTHHELNALINCLCGFGYFHDEVMHS 631
           FSMGDRTH ELNALI+CLCG GY HDEVMHS
Sbjct: 601 FSMGDRTHPELNALIHCLCGIGYLHDEVMHS 631

BLAST of CsGy5G012620 vs. NCBI nr
Match: XP_023551696.1 (pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023551697.1 pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1153 bits (2983), Expect = 0.0
Identity = 549/631 (87.00%), Postives = 587/631 (93.03%), Query Frame = 0

Query: 1   MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG 60
           MNQ+SKKT+LSKIIGKKHHLFS PF  LPSRLLVFQ+HYDVATCN  LQSYANHKNL++G
Sbjct: 1   MNQISKKTVLSKIIGKKHHLFSSPFLSLPSRLLVFQMHYDVATCNFFLQSYANHKNLSEG 60

Query: 61  RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA 120
           ++LHS+M+TSGF+HLPSSITSLINMYS+CN ME+AVLVF DPYHE NVFAYNAIIAGFVA
Sbjct: 61  KKLHSVMITSGFMHLPSSITSLINMYSKCNHMEQAVLVFHDPYHESNVFAYNAIIAGFVA 120

Query: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSA 180
           N L A GFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFK+GLEL++FV SA
Sbjct: 121 NRLPAHGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKLGLELDMFVSSA 180

Query: 181 LVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLS 240
           LVNTYLK D  EDA+KVF+ELPERDVVLWNAMINGY KIGHLNKAVVVFK+MGEEGIS  
Sbjct: 181 LVNTYLKFDLMEDAKKVFKELPERDVVLWNAMINGYAKIGHLNKAVVVFKKMGEEGISPC 240

Query: 241 RFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFE 300
           RFT T ILSI + MGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKH EDAL+IFE
Sbjct: 241 RFTITGILSIFSLMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHIEDALVIFE 300

Query: 301 MINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMH 360
           MINEKDLFSWNSIISAHEQC DHDGTLR F KML SRVLPDVITITAVLPACS+ AALMH
Sbjct: 301 MINEKDLFSWNSIISAHEQCVDHDGTLRFFDKMLASRVLPDVITITAVLPACSYFAALMH 360

Query: 361 GREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMI 420
           GREIHGYM VNGLGKNE+GDDVLLNNA+MDMYAKCGC+KNA  +FD   NKDVASWNIMI
Sbjct: 361 GREIHGYMTVNGLGKNEDGDDVLLNNAVMDMYAKCGCLKNAHRVFDRTSNKDVASWNIMI 420

Query: 421 MGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480
           MGYAMHGYG EALDMFH MCEAQIKPD +TFVGVLSACSHAGF+ QGRSFL RMELEFGV
Sbjct: 421 MGYAMHGYGQEALDMFHHMCEAQIKPDAITFVGVLSACSHAGFLRQGRSFLARMELEFGV 480

Query: 481 IPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGE 540
           +PTIEHYTCIIDMLGRAGHLGEAY+LA+RIPL+DNL+LWMALLGACRLHGNA+LG VVGE
Sbjct: 481 VPTIEHYTCIIDMLGRAGHLGEAYELAERIPLQDNLVLWMALLGACRLHGNADLGKVVGE 540

Query: 541 KITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYV 600
           KI +LEPKHCGSGSY+LMSS+YGVVGRYEEAL+VRRTMKEQNVKKTPGCSWIELKDGLYV
Sbjct: 541 KIMRLEPKHCGSGSYVLMSSMYGVVGRYEEALQVRRTMKEQNVKKTPGCSWIELKDGLYV 600

Query: 601 FSMGDRTHHELNALINCLCGFGYFHDEVMHS 631
           FSMGDRTH ELNALI+CLCG GY HDEVMHS
Sbjct: 601 FSMGDRTHPELNALIHCLCGIGYLHDEVMHS 631

BLAST of CsGy5G012620 vs. ExPASy TrEMBL
Match: A0A1S3CQH1 (pentatricopeptide repeat-containing protein At3g14730-like OS=Cucumis melo OX=3656 GN=LOC103503636 PE=4 SV=1)

HSP 1 Score: 1251 bits (3236), Expect = 0.0
Identity = 603/632 (95.41%), Postives = 616/632 (97.47%), Query Frame = 0

Query: 1   MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG 60
           MNQMSKKTI+SKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNL LQSYANHKNLTKG
Sbjct: 1   MNQMSKKTIISKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLFLQSYANHKNLTKG 60

Query: 61  RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA 120
           +QLHSLMVTSGFIHLPSSITSLINMYS+CNQMEEAVLVF DPY ERNVFAYNAIIAGFVA
Sbjct: 61  KQLHSLMVTSGFIHLPSSITSLINMYSKCNQMEEAVLVFHDPYRERNVFAYNAIIAGFVA 120

Query: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSA 180
           NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFME+RKIHGCLFKMGLELNVFVGSA
Sbjct: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEIRKIHGCLFKMGLELNVFVGSA 180

Query: 181 LVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLS 240
           LVNTYLKVDG EDAEKVFEELPERDVVLWNAMINGY KIGHLNKAV VFK+MGEEGISLS
Sbjct: 181 LVNTYLKVDGMEDAEKVFEELPERDVVLWNAMINGYIKIGHLNKAVAVFKKMGEEGISLS 240

Query: 241 RFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFE 300
           RFTTTSILS+ TSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKH +DAL+IFE
Sbjct: 241 RFTTTSILSVFTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHNKDALLIFE 300

Query: 301 MINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMH 360
           MINEKDLFSWNSIISAHEQC DHDGTLRLFGKML SRVLPDVITIT VLPACSHLAALMH
Sbjct: 301 MINEKDLFSWNSIISAHEQCGDHDGTLRLFGKMLASRVLPDVITITVVLPACSHLAALMH 360

Query: 361 GREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMI 420
           GREIHGYMIVNGLGKNEN DDVLLNNA+MDMYAKCGCMKNA IIFDLMRNKDVASWNIMI
Sbjct: 361 GREIHGYMIVNGLGKNENSDDVLLNNAVMDMYAKCGCMKNAGIIFDLMRNKDVASWNIMI 420

Query: 421 MGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480
           MGYAMHGYGTEALDMFHRMCEAQIKP+VVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV
Sbjct: 421 MGYAMHGYGTEALDMFHRMCEAQIKPNVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480

Query: 481 IPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGE 540
           IPTIEHYTCIIDMLGRAG LGEAYDLAQRIPL+DNLILWMALLGACRLHGNAELGNVVGE
Sbjct: 481 IPTIEHYTCIIDMLGRAGQLGEAYDLAQRIPLQDNLILWMALLGACRLHGNAELGNVVGE 540

Query: 541 KITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYV 600
           KI QLEPK+CGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELK+GLYV
Sbjct: 541 KIRQLEPKNCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKNGLYV 600

Query: 601 FSMGDRTHHELNALINCLCGFGYFHDEVMHSF 632
           FSMGDRTHHELNALINCLCGF YFHDEVMHSF
Sbjct: 601 FSMGDRTHHELNALINCLCGFQYFHDEVMHSF 632

BLAST of CsGy5G012620 vs. ExPASy TrEMBL
Match: A0A0A0KN02 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G129340 PE=4 SV=1)

HSP 1 Score: 1172 bits (3031), Expect = 0.0
Identity = 566/566 (100.00%), Postives = 566/566 (100.00%), Query Frame = 0

Query: 67  MVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVANGLAAD 126
           MVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVANGLAAD
Sbjct: 1   MVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVANGLAAD 60

Query: 127 GFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSALVNTYL 186
           GFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSALVNTYL
Sbjct: 61  GFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSALVNTYL 120

Query: 187 KVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLSRFTTTS 246
           KVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLSRFTTTS
Sbjct: 121 KVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLSRFTTTS 180

Query: 247 ILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFEMINEKD 306
           ILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFEMINEKD
Sbjct: 181 ILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFEMINEKD 240

Query: 307 LFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMHGREIHG 366
           LFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMHGREIHG
Sbjct: 241 LFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMHGREIHG 300

Query: 367 YMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMIMGYAMH 426
           YMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMIMGYAMH
Sbjct: 301 YMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMIMGYAMH 360

Query: 427 GYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEH 486
           GYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEH
Sbjct: 361 GYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEH 420

Query: 487 YTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGEKITQLE 546
           YTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGEKITQLE
Sbjct: 421 YTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGEKITQLE 480

Query: 547 PKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDR 606
           PKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDR
Sbjct: 481 PKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDR 540

Query: 607 THHELNALINCLCGFGYFHDEVMHSF 632
           THHELNALINCLCGFGYFHDEVMHSF
Sbjct: 541 THHELNALINCLCGFGYFHDEVMHSF 566

BLAST of CsGy5G012620 vs. ExPASy TrEMBL
Match: A0A6J1FNY8 (pentatricopeptide repeat-containing protein At3g14730-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445604 PE=4 SV=1)

HSP 1 Score: 1154 bits (2985), Expect = 0.0
Identity = 551/631 (87.32%), Postives = 586/631 (92.87%), Query Frame = 0

Query: 1   MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG 60
           MNQMSKKT+LSKIIGKKHHLFS PF  LPSRLLVFQ+HYDVATCN  LQSYANHKNL +G
Sbjct: 1   MNQMSKKTVLSKIIGKKHHLFSSPFLSLPSRLLVFQMHYDVATCNFFLQSYANHKNLPEG 60

Query: 61  RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA 120
           +QLHS+M+TSGF+HLPSSITSLINMYS+CNQME+AVLVF DPYHERNVFAYNAIIAGFVA
Sbjct: 61  KQLHSVMITSGFMHLPSSITSLINMYSKCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVA 120

Query: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSA 180
           N L A GFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFK+GLEL++FV SA
Sbjct: 121 NRLPAHGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKLGLELDMFVSSA 180

Query: 181 LVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLS 240
           LVNTYLK D  EDA+KVF+ELPERDVVLWNAMING  KIGHLNKAVVVFK+MGEEGIS  
Sbjct: 181 LVNTYLKFDLMEDAKKVFKELPERDVVLWNAMINGCAKIGHLNKAVVVFKKMGEEGISPC 240

Query: 241 RFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFE 300
           RFT T ILSI + MGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKH EDAL+IFE
Sbjct: 241 RFTITGILSIFSLMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHIEDALVIFE 300

Query: 301 MINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMH 360
           MINEKDLFSWNSIISAHEQC DHDGTLR F KML SRVLPDVITITAVLPACS+ AALMH
Sbjct: 301 MINEKDLFSWNSIISAHEQCVDHDGTLRFFDKMLASRVLPDVITITAVLPACSYFAALMH 360

Query: 361 GREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMI 420
           GREIHGYM VNGLGKNE+GDDVLLNNA+MDMYAKCGC+KNA  +FD   NKDVASWNIMI
Sbjct: 361 GREIHGYMTVNGLGKNEDGDDVLLNNAVMDMYAKCGCLKNAHRVFDRTSNKDVASWNIMI 420

Query: 421 MGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480
           MGYA HGYG EALDMFH MCEAQIKPD +TFVGVLSACSHAGF+ QGRSFL RMELEFGV
Sbjct: 421 MGYATHGYGQEALDMFHHMCEAQIKPDAITFVGVLSACSHAGFLRQGRSFLARMELEFGV 480

Query: 481 IPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGE 540
           +PTIEHYTCIIDMLGRAGHLGEAY+LA+RIPL+DNL+LWMALLGACRLHGNA+LG VVGE
Sbjct: 481 VPTIEHYTCIIDMLGRAGHLGEAYELAERIPLQDNLVLWMALLGACRLHGNADLGKVVGE 540

Query: 541 KITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYV 600
           KI +LEPKHCGSGSY+LMSS+YGVVGRYEEAL+VRRTMKEQNVKKTPGCSWIELKDGLYV
Sbjct: 541 KIMRLEPKHCGSGSYVLMSSMYGVVGRYEEALQVRRTMKEQNVKKTPGCSWIELKDGLYV 600

Query: 601 FSMGDRTHHELNALINCLCGFGYFHDEVMHS 631
           FSMGDRTH ELNALI+CLCG GY HDEVMHS
Sbjct: 601 FSMGDRTHPELNALIHCLCGIGYLHDEVMHS 631

BLAST of CsGy5G012620 vs. ExPASy TrEMBL
Match: A0A6J1JQW1 (pentatricopeptide repeat-containing protein At3g14730-like OS=Cucurbita maxima OX=3661 GN=LOC111489017 PE=4 SV=1)

HSP 1 Score: 1145 bits (2961), Expect = 0.0
Identity = 546/631 (86.53%), Postives = 586/631 (92.87%), Query Frame = 0

Query: 1   MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG 60
           MNQMSKKT+LSKIIGKKHHLFS PF  LPSRLLVFQ+HYDVATCN  LQSYANHKNL +G
Sbjct: 1   MNQMSKKTVLSKIIGKKHHLFSSPFLSLPSRLLVFQMHYDVATCNFLLQSYANHKNLPEG 60

Query: 61  RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA 120
           +QLHS+M+TSGF+HLPSSITSLINMYS+CNQME+AVLVF DPYHERNVFAYNAIIAGFVA
Sbjct: 61  KQLHSVMITSGFMHLPSSITSLINMYSKCNQMEQAVLVFHDPYHERNVFAYNAIIAGFVA 120

Query: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSA 180
           N L A+GFQFYKRMR+VGVMPDKFTFPCVVRACCEFMEVRKIHGCLFK+GLEL++FV SA
Sbjct: 121 NRLPANGFQFYKRMRAVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKLGLELDMFVSSA 180

Query: 181 LVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLS 240
           LVNTYLK D  E+A+KVFEELP RDVVLWNAMINGY +IGHLNKAVVVFK+MGEEGIS  
Sbjct: 181 LVNTYLKFDLMENAKKVFEELPVRDVVLWNAMINGYAQIGHLNKAVVVFKKMGEEGISPC 240

Query: 241 RFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFE 300
           RFT T ILSI + MGD+NNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKH EDAL+IFE
Sbjct: 241 RFTITGILSIFSLMGDVNNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHIEDALVIFE 300

Query: 301 MINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMH 360
           MINEKDLFSWNSIISAHEQC DHDGTLR F KML SRVLPDVITITAVLPACS+ AALMH
Sbjct: 301 MINEKDLFSWNSIISAHEQCVDHDGTLRFFDKMLASRVLPDVITITAVLPACSYFAALMH 360

Query: 361 GREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMI 420
           GREIHGYM VNGLGKNE+GDDVLLNNA+MDMYAKCGC+KNA  +FD   NKDVASWNIMI
Sbjct: 361 GREIHGYMTVNGLGKNEDGDDVLLNNAVMDMYAKCGCLKNAHRVFDRTSNKDVASWNIMI 420

Query: 421 MGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGV 480
           MGYAMHGYG EALDMFH M EAQIKPD +TFVGVLSACSHAGF+ QGRSFL RMELEFGV
Sbjct: 421 MGYAMHGYGQEALDMFHHMREAQIKPDAITFVGVLSACSHAGFLRQGRSFLARMELEFGV 480

Query: 481 IPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGE 540
           +PTIEHYTCIIDMLGRAGHLGEAY+LA+RIPL+DNL+LWMALLGACRLHGNA+LG VVGE
Sbjct: 481 VPTIEHYTCIIDMLGRAGHLGEAYELAERIPLQDNLVLWMALLGACRLHGNADLGKVVGE 540

Query: 541 KITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYV 600
           KI +LEPKHCGSGSY+LMSS+YGVVGRYEEAL+VRR MKEQNVKKTPGCSWIELKDGLYV
Sbjct: 541 KIMRLEPKHCGSGSYVLMSSMYGVVGRYEEALQVRRMMKEQNVKKTPGCSWIELKDGLYV 600

Query: 601 FSMGDRTHHELNALINCLCGFGYFHDEVMHS 631
           FSMGDRTH ELNALI+CLCG GY HDEVM+S
Sbjct: 601 FSMGDRTHPELNALIHCLCGIGYLHDEVMNS 631

BLAST of CsGy5G012620 vs. ExPASy TrEMBL
Match: A0A5A7U9S8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold180G00380 PE=4 SV=1)

HSP 1 Score: 1120 bits (2897), Expect = 0.0
Identity = 538/566 (95.05%), Postives = 550/566 (97.17%), Query Frame = 0

Query: 67  MVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVANGLAAD 126
           MVTSGFIHLPSSITSLINMYS+CNQMEEAVLVF DPY ERNVFAYNAIIAGFVANGLAAD
Sbjct: 1   MVTSGFIHLPSSITSLINMYSKCNQMEEAVLVFHDPYRERNVFAYNAIIAGFVANGLAAD 60

Query: 127 GFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRKIHGCLFKMGLELNVFVGSALVNTYL 186
           GFQFYKRMRSVGVMPDKFTFPCVVRACCEFME+RKIHGCLFKMGLELNVFVGSALVNTYL
Sbjct: 61  GFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEIRKIHGCLFKMGLELNVFVGSALVNTYL 120

Query: 187 KVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGISLSRFTTTS 246
           KVDG EDAEKVFEELPERDVVLWNAMINGY KIGHLNKAV VFK+MGEEGISLSRFTTTS
Sbjct: 121 KVDGMEDAEKVFEELPERDVVLWNAMINGYIKIGHLNKAVAVFKKMGEEGISLSRFTTTS 180

Query: 247 ILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALMIFEMINEKD 306
           ILS+ TSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKH +DAL+IFEMINEKD
Sbjct: 181 ILSVFTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHNKDALLIFEMINEKD 240

Query: 307 LFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMHGREIHG 366
           LFSWNSIISAHEQC DHDGTLRLFGKML SRVLPDVITIT VLPACSHLAALMHGREIHG
Sbjct: 241 LFSWNSIISAHEQCGDHDGTLRLFGKMLASRVLPDVITITVVLPACSHLAALMHGREIHG 300

Query: 367 YMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMIMGYAMH 426
           YMIVNGLGKNEN DDVLLNNA+MDMYAKCGCMKNA IIFDLMRNKDVASWNIMIMGYAMH
Sbjct: 301 YMIVNGLGKNENSDDVLLNNAVMDMYAKCGCMKNAGIIFDLMRNKDVASWNIMIMGYAMH 360

Query: 427 GYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEH 486
           GYGTEAL+MFHRMCEAQIKP+VVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEH
Sbjct: 361 GYGTEALEMFHRMCEAQIKPNVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEH 420

Query: 487 YTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGEKITQLE 546
           YTCIIDMLGRAG LGEAYDLAQRIPL+DNLILWMALLGACRLHGNAE GNVVGEKI QLE
Sbjct: 421 YTCIIDMLGRAGQLGEAYDLAQRIPLQDNLILWMALLGACRLHGNAEFGNVVGEKIRQLE 480

Query: 547 PKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDR 606
           PK+CGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELK+GLYVFSMGDR
Sbjct: 481 PKNCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKNGLYVFSMGDR 540

Query: 607 THHELNALINCLCGFGYFHDEVMHSF 632
           THHELNALINCLCGF YFHDEVMHSF
Sbjct: 541 THHELNALINCLCGFQYFHDEVMHSF 566

BLAST of CsGy5G012620 vs. TAIR 10
Match: AT3G14730.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 639.0 bits (1647), Expect = 3.9e-183
Identity = 301/581 (51.81%), Postives = 416/581 (71.60%), Query Frame = 0

Query: 38  HYDVATCNLSLQSYANHKNLTKGRQLHSLMVTSGFI-HLPSSITSLINMYSRCNQMEEAV 97
           H++VATC  +LQ  A  K+   G+Q+H  MV  GF+   P + TSL+NMY++C  M  AV
Sbjct: 57  HHNVATCIATLQRCAQRKDYVSGQQIHGFMVRKGFLDDSPRAGTSLVNMYAKCGLMRRAV 116

Query: 98  LVFRDPYHERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVGVMPDKFTFPCVVRA--CC 157
           LVF     ER+VF YNA+I+GFV NG   D  + Y+ MR+ G++PDK+TFP +++     
Sbjct: 117 LVFGG--SERDVFGYNALISGFVVNGSPLDAMETYREMRANGILPDKYTFPSLLKGSDAM 176

Query: 158 EFMEVRKIHGCLFKMGLELNVFVGSALVNTYLKVDGTEDAEKVFEELPER-DVVLWNAMI 217
           E  +V+K+HG  FK+G + + +VGS LV +Y K    EDA+KVF+ELP+R D VLWNA++
Sbjct: 177 ELSDVKKVHGLAFKLGFDSDCYVGSGLVTSYSKFMSVEDAQKVFDELPDRDDSVLWNALV 236

Query: 218 NGYTKIGHLNKAVVVFKRMGEEGISLSRFTTTSILSILTSMGDINNGRAIHGIVTKMGYS 277
           NGY++I     A++VF +M EEG+ +SR T TS+LS  T  GDI+NGR+IHG+  K G  
Sbjct: 237 NGYSQIFRFEDALLVFSKMREEGVGVSRHTITSVLSAFTVSGDIDNGRSIHGLAVKTGSG 296

Query: 278 SCVAVSNALIDMYGKCKHTEDALMIFEMINEKDLFSWNSIISAHEQCDDHDGTLRLFGKM 337
           S + VSNALIDMYGK K  E+A  IFE ++E+DLF+WNS++  H+ C DHDGTL LF +M
Sbjct: 297 SDIVVSNALIDMYGKSKWLEEANSIFEAMDERDLFTWNSVLCVHDYCGDHDGTLALFERM 356

Query: 338 LGSRVLPDVITITAVLPACSHLAALMHGREIHGYMIVNGLGKNENGDDVLLNNAIMDMYA 397
           L S + PD++T+T VLP C  LA+L  GREIHGYMIV+GL  N    +  ++N++MDMY 
Sbjct: 357 LCSGIRPDIVTLTTVLPTCGRLASLRQGREIHGYMIVSGL-LNRKSSNEFIHNSLMDMYV 416

Query: 398 KCGCMKNADIIFDLMRNKDVASWNIMIMGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVG 457
           KCG +++A ++FD MR KD ASWNIMI GY +   G  ALDMF  MC A +KPD +TFVG
Sbjct: 417 KCGDLRDARMVFDSMRVKDSASWNIMINGYGVQSCGELALDMFSCMCRAGVKPDEITFVG 476

Query: 458 VLSACSHAGFVHQGRSFLTRMELEFGVIPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLE 517
           +L ACSH+GF+++GR+FL +ME  + ++PT +HY C+IDMLGRA  L EAY+LA   P+ 
Sbjct: 477 LLQACSHSGFLNEGRNFLAQMETVYNILPTSDHYACVIDMLGRADKLEEAYELAISKPIC 536

Query: 518 DNLILWMALLGACRLHGNAELGNVVGEKITQLEPKHCGSGSYILMSSLYGVVGRYEEALE 577
           DN ++W ++L +CRLHGN +L  V G+++ +LEP+HC  G Y+LMS++Y   G+YEE L+
Sbjct: 537 DNPVVWRSILSSCRLHGNKDLALVAGKRLHELEPEHC--GGYVLMSNVYVEAGKYEEVLD 596

Query: 578 VRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDRTHHELNAL 615
           VR  M++QNVKKTPGCSWI LK+G++ F  G++TH E  ++
Sbjct: 597 VRDAMRQQNVKKTPGCSWIVLKNGVHTFFTGNQTHPEFKSI 632

BLAST of CsGy5G012620 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 389.4 bits (999), Expect = 5.4e-108
Identity = 212/614 (34.53%), Postives = 348/614 (56.68%), Query Frame = 0

Query: 1   MNQMSKKTILSKIIGKKHHLFSYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKG 60
           MN+++K    S  IG            L  +++   +  D  T +   +S+++ +++  G
Sbjct: 167 MNELAKSGDFSGSIG------------LFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGG 226

Query: 61  RQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVA 120
            QLH  ++ SGF    S   SL+  Y +  +++ A  VF D   ER+V ++N+II G+V+
Sbjct: 227 EQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVF-DEMTERDVISWNSIINGYVS 286

Query: 121 NGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEV---RKIHGCLFKMGLELNVFV 180
           NGLA  G   + +M   G+  D  T   V   C +   +   R +H    K         
Sbjct: 287 NGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRF 346

Query: 181 GSALVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGHLNKAVVVFKRMGEEGI 240
            + L++ Y K    + A+ VF E+ +R VV + +MI GY + G   +AV +F+ M EEGI
Sbjct: 347 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 406

Query: 241 SLSRFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALIDMYGKCKHTEDALM 300
           S   +T T++L+       ++ G+ +H  + +      + VSNAL+DMY KC   ++A +
Sbjct: 407 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 466

Query: 301 IFEMINEKDLFSWNSIISAHEQCDDHDGTLRLFGKML-GSRVLPDVITITAVLPACSHLA 360
           +F  +  KD+ SWN+II  + +    +  L LF  +L   R  PD  T+  VLPAC+ L+
Sbjct: 467 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 526

Query: 361 ALMHGREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASW 420
           A   GREIHGY++ NG   + +     + N+++DMYAKCG +  A ++FD + +KD+ SW
Sbjct: 527 AFDKGREIHGYIMRNGYFSDRH-----VANSLVDMYAKCGALLLAHMLFDDIASKDLVSW 586

Query: 421 NIMIMGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMEL 480
            +MI GY MHG+G EA+ +F++M +A I+ D ++FV +L ACSH+G V +G  F   M  
Sbjct: 587 TVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRH 646

Query: 481 EFGVIPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGN 540
           E  + PT+EHY CI+DML R G L +AY   + +P+  +  +W ALL  CR+H + +L  
Sbjct: 647 ECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAE 706

Query: 541 VVGEKITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKD 600
            V EK+ +LEP++  +G Y+LM+++Y    ++E+   +R+ + ++ ++K PGCSWIE+K 
Sbjct: 707 KVAEKVFELEPEN--TGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWIEIKG 760

Query: 601 GLYVFSMGDRTHHE 611
            + +F  GD ++ E
Sbjct: 767 RVNIFVAGDSSNPE 760

BLAST of CsGy5G012620 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 387.5 bits (994), Expect = 2.1e-107
Identity = 202/614 (32.90%), Postives = 353/614 (57.49%), Query Frame = 0

Query: 45  NLSLQSYANHKNLTKGRQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEAVLVFRDPYH 104
           N  + +Y+   +L  GRQ+   M         S +T L    ++   ++EA  +FR    
Sbjct: 59  NRLIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGL----TKLGFLDEADSLFRS-MP 118

Query: 105 ERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCEFMEVRK--- 164
           ER+   +N++++GF  +    +   ++  M   G + ++++F  V+ AC    ++ K   
Sbjct: 119 ERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQ 178

Query: 165 IHGCLFKMGLELNVFVGSALVNTYLKVDGTEDAEKVFEELPERDVVLWNAMINGYTKIGH 224
           +H  + K     +V++GSALV+ Y K     DA++VF+E+ +R+VV WN++I  + + G 
Sbjct: 179 VHSLIAKSPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGP 238

Query: 225 LNKAVVVFKRMGEEGISLSRFTTTSILSILTSMGDINNGRAIHGIVTKMG-YSSCVAVSN 284
             +A+ VF+ M E  +     T  S++S   S+  I  G+ +HG V K     + + +SN
Sbjct: 239 AVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSN 298

Query: 285 ALIDMYGKCKHTEDALMIFE------------MIN-------------------EKDLFS 344
           A +DMY KC   ++A  IF+            MI+                   E+++ S
Sbjct: 299 AFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVS 358

Query: 345 WNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVITITAVLPACSHLAALMHGREIHGYMI 404
           WN++I+ + Q  +++  L LF  +    V P   +   +L AC+ LA L  G + H +++
Sbjct: 359 WNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVL 418

Query: 405 VNGLGKNENG--DDVLLNNAIMDMYAKCGCMKNADIIFDLMRNKDVASWNIMIMGYAMHG 464
            +G  K ++G  DD+ + N+++DMY KCGC++   ++F  M  +D  SWN MI+G+A +G
Sbjct: 419 KHGF-KFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNG 478

Query: 465 YGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEHY 524
           YG EAL++F  M E+  KPD +T +GVLSAC HAGFV +GR + + M  +FGV P  +HY
Sbjct: 479 YGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHY 538

Query: 525 TCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLGACRLHGNAELGNVVGEKITQLEP 584
           TC++D+LGRAG L EA  + + +P++ + ++W +LL AC++H N  LG  V EK+ ++EP
Sbjct: 539 TCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEP 598

Query: 585 KHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDRT 619
            +  SG Y+L+S++Y  +G++E+ + VR++M+++ V K PGCSWI+++   +VF + D++
Sbjct: 599 SN--SGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKS 658

BLAST of CsGy5G012620 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 386.7 bits (992), Expect = 3.5e-107
Identity = 210/624 (33.65%), Postives = 344/624 (55.13%), Query Frame = 0

Query: 22  SYPFFHLPSRLLVFQIHYDVATCNLSLQSYANHKNLTKGRQLHSLMVTSGFIHLPSSITS 81
           SYPF  LPS        YD    + SL    N K L   R +H+ M+  G  +   +++ 
Sbjct: 14  SYPFHFLPSS---SDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSK 73

Query: 82  LIN---MYSRCNQMEEAVLVFRDPYHERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVG 141
           LI    +      +  A+ VF+    E N+  +N +  G   +       + Y  M S+G
Sbjct: 74  LIEFCILSPHFEGLPYAISVFK-TIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLG 133

Query: 142 VMPDKFTFPCVVRACCE---FMEVRKIHGCLFKMGLELNVFVGSALVNTYLKVDGTEDAE 201
           ++P+ +TFP V+++C +   F E ++IHG + K+G +L+++V ++L++ Y++    EDA 
Sbjct: 134 LLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAH 193

Query: 202 KV-------------------------------FEELPERDVVLWNAMINGYTKIGHLNK 261
           KV                               F+E+P +DVV WNAMI+GY + G+  +
Sbjct: 194 KVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKE 253

Query: 262 AVVVFKRMGEEGISLSRFTTTSILSILTSMGDINNGRAIHGIVTKMGYSSCVAVSNALID 321
           A+ +FK M +  +     T  +++S     G I  GR +H  +   G+ S + + NALID
Sbjct: 254 ALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALID 313

Query: 322 MYGKCKHTEDALMIFEMINEKDLFSWNSIISAHEQCDDHDGTLRLFGKMLGSRVLPDVIT 381
           +Y KC   E A  +FE +  KD+ SWN++I  +   + +   L LF +ML S   P+ +T
Sbjct: 314 LYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVT 373

Query: 382 ITAVLPACSHLAALMHGREIHGYMIVNGLGKNENGDDVLLNNAIMDMYAKCGCMKNADII 441
           + ++LPAC+HL A+  GR IH Y+     G         L  +++DMYAKCG ++ A  +
Sbjct: 374 MLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASS---LRTSLIDMYAKCGDIEAAHQV 433

Query: 442 FDLMRNKDVASWNIMIMGYAMHGYGTEALDMFHRMCEAQIKPDVVTFVGVLSACSHAGFV 501
           F+ + +K ++SWN MI G+AMHG    + D+F RM +  I+PD +TFVG+LSACSH+G +
Sbjct: 434 FNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGML 493

Query: 502 HQGRSFLTRMELEFGVIPTIEHYTCIIDMLGRAGHLGEAYDLAQRIPLEDNLILWMALLG 561
             GR     M  ++ + P +EHY C+ID+LG +G   EA ++   + +E + ++W +LL 
Sbjct: 494 DLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLK 553

Query: 562 ACRLHGNAELGNVVGEKITQLEPKHCGSGSYILMSSLYGVVGRYEEALEVRRTMKEQNVK 609
           AC++HGN ELG    E + ++EP++   GSY+L+S++Y   GR+ E  + R  + ++ +K
Sbjct: 554 ACKMHGNVELGESFAENLIKIEPEN--PGSYVLLSNIYASAGRWNEVAKTRALLNDKGMK 613

BLAST of CsGy5G012620 vs. TAIR 10
Match: AT4G21300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 384.4 bits (986), Expect = 1.7e-106
Identity = 207/592 (34.97%), Postives = 335/592 (56.59%), Query Frame = 0

Query: 36  QIHYDVATCNLSLQSYANHKNLTKGRQLHSLMVTSGFIHLPSSITSLINMYSRCNQMEEA 95
           QI  +  T +  L   A+   +  G QLH L+V SG     S   SL++MYS+C + ++A
Sbjct: 234 QISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKCGRFDDA 293

Query: 96  VLVFRDPYHERNVFAYNAIIAGFVANGLAADGFQFYKRMRSVGVMPDKFTFPCVVRACCE 155
             +FR      +   +N +I+G+V +GL  +   F+  M S GV+PD  TF  ++ +  +
Sbjct: 294 SKLFR-MMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGVLPDAITFSSLLPSVSK 353

Query: 156 FMEV---RKIHGCLFKMGLELNVFVGSALVNTYLKVDGTEDAEKVFEELPERDVVLWNAM 215
           F  +   ++IH  + +  + L++F+ SAL++ Y K  G   A+ +F +    DVV++ AM
Sbjct: 354 FENLEYCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVSMAQNIFSQCNSVDVVVFTAM 413

Query: 216 INGYTKIGHLNKAVVVFKRMGEEGISLSRFTTTSILSILTSMGDINNGRAIHGIVTKMGY 275
           I+GY   G    ++ +F+ + +  IS +  T  SIL ++  +  +  GR +HG + K G+
Sbjct: 414 ISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFIIKKGF 473

Query: 276 SSCVAVSNALIDMYGKCKHTEDALMIFEMINEKDLFSWNSIISAHEQCDDHDGTLRLFGK 335
            +   +  A+IDMY KC     A  IFE ++++D+ SWNS+I+   Q D+    + +F +
Sbjct: 474 DNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAIDIFRQ 533

Query: 336 MLGSRVLPDVITITAVLPACSHLAALMHGREIHGYMIVNGLGKNENGDDVLLNNAIMDMY 395
           M  S +  D ++I+A L AC++L +   G+ IHG+MI     K+    DV   + ++DMY
Sbjct: 534 MGVSGICYDCVSISAALSACANLPSESFGKAIHGFMI-----KHSLASDVYSESTLIDMY 593

Query: 396 AKCGCMKNADIIFDLMRNKDVASWNIMIMGYAMHGYGTEALDMFHRMCE-AQIKPDVVTF 455
           AKCG +K A  +F  M+ K++ SWN +I     HG   ++L +FH M E + I+PD +TF
Sbjct: 594 AKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEMVEKSGIRPDQITF 653

Query: 456 VGVLSACSHAGFVHQGRSFLTRMELEFGVIPTIEHYTCIIDMLGRAGHLGEAYDLAQRIP 515
           + ++S+C H G V +G  F   M  ++G+ P  EHY C++D+ GRAG L EAY+  + +P
Sbjct: 654 LEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLFGRAGRLTEAYETVKSMP 713

Query: 516 LEDNLILWMALLGACRLHGNAELGNVVGEKITQLEPKHCGSGSYILMSSLYGVVGRYEEA 575
              +  +W  LLGACRLH N EL  V   K+  L+P +  SG Y+L+S+ +     +E  
Sbjct: 714 FPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSN--SGYYVLISNAHANAREWESV 773

Query: 576 LEVRRTMKEQNVKKTPGCSWIELKDGLYVFSMGDRTHHE---LNALINCLCG 621
            +VR  MKE+ V+K PG SWIE+    ++F  GD  H E   + +L+N L G
Sbjct: 774 TKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYSLLNSLLG 817

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LUC25.5e-18251.81Pentatricopeptide repeat-containing protein At3g14730 OS=Arabidopsis thaliana OX... [more]
Q9SN397.6e-10734.53Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9SIT72.9e-10632.90Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9LN014.9e-10633.65Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9STE12.5e-10534.97Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_004149501.20.0100.00pentatricopeptide repeat-containing protein At3g14730 [Cucumis sativus] >XP_0317... [more]
XP_008466127.10.095.41PREDICTED: pentatricopeptide repeat-containing protein At3g14730-like [Cucumis m... [more]
XP_038877905.10.092.41pentatricopeptide repeat-containing protein At3g14730-like [Benincasa hispida] >... [more]
XP_022939865.10.087.32pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita... [more]
XP_023551696.10.087.00pentatricopeptide repeat-containing protein At3g14730-like isoform X1 [Cucurbita... [more]
Match NameE-valueIdentityDescription
A0A1S3CQH10.095.41pentatricopeptide repeat-containing protein At3g14730-like OS=Cucumis melo OX=36... [more]
A0A0A0KN020.0100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G129340 PE=4 SV=1[more]
A0A6J1FNY80.087.32pentatricopeptide repeat-containing protein At3g14730-like isoform X1 OS=Cucurbi... [more]
A0A6J1JQW10.086.53pentatricopeptide repeat-containing protein At3g14730-like OS=Cucurbita maxima O... [more]
A0A5A7U9S80.095.05Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT3G14730.13.9e-18351.81Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18750.15.4e-10834.53Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G13600.12.1e-10732.90Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.13.5e-10733.65Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21300.11.7e-10634.97Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 280..305
e-value: 0.013
score: 15.7
coord: 486..506
e-value: 0.73
score: 10.2
coord: 308..336
e-value: 0.01
score: 16.0
coord: 554..582
e-value: 0.074
score: 13.3
coord: 80..100
e-value: 0.016
score: 15.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 110..142
e-value: 1.1E-4
score: 20.2
coord: 207..239
e-value: 3.0E-7
score: 28.2
coord: 415..448
e-value: 1.6E-7
score: 29.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 204..249
e-value: 5.4E-10
score: 39.3
coord: 411..459
e-value: 1.5E-10
score: 41.1
coord: 106..154
e-value: 1.7E-10
score: 41.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 412..446
score: 11.092894
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 306..340
score: 9.602157
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 107..141
score: 10.939435
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 205..239
score: 11.980759
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 160..258
e-value: 6.4E-19
score: 70.0
coord: 38..159
e-value: 1.2E-20
score: 75.6
coord: 259..359
e-value: 1.4E-18
score: 68.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 360..552
e-value: 4.0E-33
score: 117.1
NoneNo IPR availablePANTHERPTHR47928:SF61OS01G0818200 PROTEINcoord: 37..609
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 37..609

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy5G012620.2CsGy5G012620.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding