CsGy5G012250 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy5G012250
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
LocationGy14Chr5: 14542176 .. 14544334 (-)
RNA-Seq ExpressionCsGy5G012250
SyntenyCsGy5G012250
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAATGCTTACATGGAAAATCCCTTCACCCTTCTGTCCAAAAGCTCCCCACTTGGCACCAATTCAGTGGCTTACAAGATCTCGAACCCAAACTTCCTTACCGATAAGCTATATATTATGAGTGGCACTCCGATCTTCTTCCGACCTTATCTCCCATGGCCCTCTCTCCCTCTCCCGACTGCTCATTCCCTCCATCAAACTCTTTCAGAAAATCCCATTTCATCTCCACCTCTAACTTCTCTCTCCTTTTCTCTCTTCCCACTTCAAATCTTCCATCCCTTCATCTAAATTCCTCCGGTTGCCCTTCCCCAATCTTAGAACAACCCTCCATCGCCTTACCCGACATCCATTCAAACTCCAATCTTCACGATTTTCAACTTCCCTCCTTGCCCAACGTTCAAGATTTGAACGATTTCTTATGTGGGTTGTCGCAAAACCCCGGAACCGAGGATTTGATCTATGACTATTATGTGAAAGCGAAGGAGACGGCTGGGTTTCGACCTCAGAAATCGACATTGCGGCATCTGATCAGGTACTTAGTTCGATTGAAGAAATGGGATTTGATTCTTTTAGTTTCTAGGGACTTTGTGGATTTTGGTGTTTGCCCTGATAGAGATACTTGTTCTAAATTGGTTAGTAGTTGTGTTAGAGGTAGAAAATTTAAAGTTGTCAAGTCTTTGCTTGAGGTTTTTGAAAGGAATAGTGGTGTTGCTATGACTGCTTTTGAAGCTGCCATGAGAGGCTACAATAAGCTTCACATGCACAAAAGCACTATCATGGTTTTCCAGCGCTTGAAATCTGCTAGAATCGAAGCAGATTCTGGATGTTATTGTAGGGTCATGGAAGCCTATCTTAAACTTGGGGATTCTGAGAGAGTTATGGAACTGTTTAATGAAGTGGAGAGTAGGATTTCGGATTCTACGCCATTTTCGACCAAGATTTACGGGATACTTTGCGAGTCCTTAGCAAAGTCGGGGAGGGTTTTCGAGTCGCTTGAGTTTTTTAGAGATATGAGGAAGAAAGGGATCGCAGAAGACTACACTATTTACTCTGCTTTGATATGTACTTTTGCTAGCATCCAGGAAGTTAAATTAGCTGAAGATCTTTACAATGAAGCAAAAGCCAAGAAGTTGTTGAGGGACCCTGCGATGTTTCTAAAGCTCATATTGATGTACGTTCAACAAGGATCATTAGAAAAGGCACTTGAGATTGTTGAAGTAATGAAAGACTTTAAAATTGGAGTCTCTGACTGTATTTTCTGTGCAATTGTCAATGGTTATGCCACGAGAAGGGGCTATGAAGCTGCAGTTAAAGTTTACGAGAAGTTGATCGAAGACGGATGTGAGCCAGGACAAGTGACGTACGCCTCAGCAATCAATGCCTACTGCCGTGTCGGACTCTACTCGAAAGCAGAGGACATTTTTGGAGAAATGGAGGAGAAAGGGTTTGATAAATGTGTGGTAGCTTACTCTAGCTTGATATCAATGTATGGAAAAACAGGGAGATTAAAGGATGCAATGAGGCTATTAGCAAAGATGAAAGAAAAAGGGTGTCAGCCAAATGTTTGGATATACAACATATTGATGGAGATGCATGGGAAGGCTAAGAATTTGAAGCAAGTTGAGAAGTTATGGAAGGAAATGAAGCGCAAAAAGATAGCACCTGATAAGGTTAGCTATACAAGTATCATAAGTGCCTATGTCAAGGCATCAGAATTCGAAAAGTGCGAGCAATATTACCGGGAGTTTCGGATGAACGGGGGCACCATTGACAAGGCATTTGGGGGGATCATGGTTGGGGTGTTCTCAAAGACGAGTCGGGTTGATGAGCTGGTGAAGCTTCTCAGAGACATGAAGTTAGAAGGAACAAGGCTGGATGAGAGGCTTTATAGGACAGCATTGAATGCTCTGATGGATGCTGGGTTGCAAGTGCAAGCAAAATGGTTGCAAGATCATTATGCTGGAAAATCAGGCTTTGTTTAAACTGTTTTTCATTGGTTTGGGATAAAGCCTTGGAGTAAAAAAAATTTCTTGATGCCCAAGAGATTTACTTATTTAATTTGTAAGTAGTTTTTCTTTTTCTCTATAAGTTTTCGATTTGTTCTCCTAGTTTTGAGTTTAATTTTACTTTGATTTTCATCGTTGTATATTTGGACCTCAC

mRNA sequence

CAAATGCTTACATGGAAAATCCCTTCACCCTTCTGTCCAAAAGCTCCCCACTTGGCACCAATTCAGTGGCTTACAAGATCTCGAACCCAAACTTCCTTACCGATAAGCTATATATTATGAGTGGCACTCCGATCTTCTTCCGACCTTATCTCCCATGGCCCTCTCTCCCTCTCCCGACTGCTCATTCCCTCCATCAAACTCTTTCAGAAAATCCCATTTCATCTCCACCTCTAACTTCTCTCTCCTTTTCTCTCTTCCCACTTCAAATCTTCCATCCCTTCATCTAAATTCCTCCGGTTGCCCTTCCCCAATCTTAGAACAACCCTCCATCGCCTTACCCGACATCCATTCAAACTCCAATCTTCACGATTTTCAACTTCCCTCCTTGCCCAACGTTCAAGATTTGAACGATTTCTTATGTGGGTTGTCGCAAAACCCCGGAACCGAGGATTTGATCTATGACTATTATGTGAAAGCGAAGGAGACGGCTGGGTTTCGACCTCAGAAATCGACATTGCGGCATCTGATCAGGTACTTAGTTCGATTGAAGAAATGGGATTTGATTCTTTTAGTTTCTAGGGACTTTGTGGATTTTGGTGTTTGCCCTGATAGAGATACTTGTTCTAAATTGGTTAGTAGTTGTGTTAGAGGTAGAAAATTTAAAGTTGTCAAGTCTTTGCTTGAGGTTTTTGAAAGGAATAGTGGTGTTGCTATGACTGCTTTTGAAGCTGCCATGAGAGGCTACAATAAGCTTCACATGCACAAAAGCACTATCATGGTTTTCCAGCGCTTGAAATCTGCTAGAATCGAAGCAGATTCTGGATGTTATTGTAGGGTCATGGAAGCCTATCTTAAACTTGGGGATTCTGAGAGAGTTATGGAACTGTTTAATGAAGTGGAGAGTAGGATTTCGGATTCTACGCCATTTTCGACCAAGATTTACGGGATACTTTGCGAGTCCTTAGCAAAGTCGGGGAGGGTTTTCGAGTCGCTTGAGTTTTTTAGAGATATGAGGAAGAAAGGGATCGCAGAAGACTACACTATTTACTCTGCTTTGATATGTACTTTTGCTAGCATCCAGGAAGTTAAATTAGCTGAAGATCTTTACAATGAAGCAAAAGCCAAGAAGTTGTTGAGGGACCCTGCGATGTTTCTAAAGCTCATATTGATGTACGTTCAACAAGGATCATTAGAAAAGGCACTTGAGATTGTTGAAGTAATGAAAGACTTTAAAATTGGAGTCTCTGACTGTATTTTCTGTGCAATTGTCAATGGTTATGCCACGAGAAGGGGCTATGAAGCTGCAGTTAAAGTTTACGAGAAGTTGATCGAAGACGGATGTGAGCCAGGACAAGTGACGTACGCCTCAGCAATCAATGCCTACTGCCGTGTCGGACTCTACTCGAAAGCAGAGGACATTTTTGGAGAAATGGAGGAGAAAGGGTTTGATAAATGTGTGGTAGCTTACTCTAGCTTGATATCAATGTATGGAAAAACAGGGAGATTAAAGGATGCAATGAGGCTATTAGCAAAGATGAAAGAAAAAGGGTGTCAGCCAAATGTTTGGATATACAACATATTGATGGAGATGCATGGGAAGGCTAAGAATTTGAAGCAAGTTGAGAAGTTATGGAAGGAAATGAAGCGCAAAAAGATAGCACCTGATAAGGTTAGCTATACAAGTATCATAAGTGCCTATGTCAAGGCATCAGAATTCGAAAAGTGCGAGCAATATTACCGGGAGTTTCGGATGAACGGGGGCACCATTGACAAGGCATTTGGGGGGATCATGGTTGGGGTGTTCTCAAAGACGAGTCGGGTTGATGAGCTGGTGAAGCTTCTCAGAGACATGAAGTTAGAAGGAACAAGGCTGGATGAGAGGCTTTATAGGACAGCATTGAATGCTCTGATGGATGCTGGGTTGCAAGTGCAAGCAAAATGGTTGCAAGATCATTATGCTGGAAAATCAGGCTTTGTTTAAACTGTTTTTCATTGGTTTGGGATAAAGCCTTGGAGTAAAAAAAATTTCTTGATGCCCAAGAGATTTACTTATTTAATTTGTAAGTAGTTTTTCTTTTTCTCTATAAGTTTTCGATTTGTTCTCCTAGTTTTGAGTTTAATTTTACTTTGATTTTCATCGTTGTATATTTGGACCTCAC

Coding sequence (CDS)

ATGGCCCTCTCTCCCTCTCCCGACTGCTCATTCCCTCCATCAAACTCTTTCAGAAAATCCCATTTCATCTCCACCTCTAACTTCTCTCTCCTTTTCTCTCTTCCCACTTCAAATCTTCCATCCCTTCATCTAAATTCCTCCGGTTGCCCTTCCCCAATCTTAGAACAACCCTCCATCGCCTTACCCGACATCCATTCAAACTCCAATCTTCACGATTTTCAACTTCCCTCCTTGCCCAACGTTCAAGATTTGAACGATTTCTTATGTGGGTTGTCGCAAAACCCCGGAACCGAGGATTTGATCTATGACTATTATGTGAAAGCGAAGGAGACGGCTGGGTTTCGACCTCAGAAATCGACATTGCGGCATCTGATCAGGTACTTAGTTCGATTGAAGAAATGGGATTTGATTCTTTTAGTTTCTAGGGACTTTGTGGATTTTGGTGTTTGCCCTGATAGAGATACTTGTTCTAAATTGGTTAGTAGTTGTGTTAGAGGTAGAAAATTTAAAGTTGTCAAGTCTTTGCTTGAGGTTTTTGAAAGGAATAGTGGTGTTGCTATGACTGCTTTTGAAGCTGCCATGAGAGGCTACAATAAGCTTCACATGCACAAAAGCACTATCATGGTTTTCCAGCGCTTGAAATCTGCTAGAATCGAAGCAGATTCTGGATGTTATTGTAGGGTCATGGAAGCCTATCTTAAACTTGGGGATTCTGAGAGAGTTATGGAACTGTTTAATGAAGTGGAGAGTAGGATTTCGGATTCTACGCCATTTTCGACCAAGATTTACGGGATACTTTGCGAGTCCTTAGCAAAGTCGGGGAGGGTTTTCGAGTCGCTTGAGTTTTTTAGAGATATGAGGAAGAAAGGGATCGCAGAAGACTACACTATTTACTCTGCTTTGATATGTACTTTTGCTAGCATCCAGGAAGTTAAATTAGCTGAAGATCTTTACAATGAAGCAAAAGCCAAGAAGTTGTTGAGGGACCCTGCGATGTTTCTAAAGCTCATATTGATGTACGTTCAACAAGGATCATTAGAAAAGGCACTTGAGATTGTTGAAGTAATGAAAGACTTTAAAATTGGAGTCTCTGACTGTATTTTCTGTGCAATTGTCAATGGTTATGCCACGAGAAGGGGCTATGAAGCTGCAGTTAAAGTTTACGAGAAGTTGATCGAAGACGGATGTGAGCCAGGACAAGTGACGTACGCCTCAGCAATCAATGCCTACTGCCGTGTCGGACTCTACTCGAAAGCAGAGGACATTTTTGGAGAAATGGAGGAGAAAGGGTTTGATAAATGTGTGGTAGCTTACTCTAGCTTGATATCAATGTATGGAAAAACAGGGAGATTAAAGGATGCAATGAGGCTATTAGCAAAGATGAAAGAAAAAGGGTGTCAGCCAAATGTTTGGATATACAACATATTGATGGAGATGCATGGGAAGGCTAAGAATTTGAAGCAAGTTGAGAAGTTATGGAAGGAAATGAAGCGCAAAAAGATAGCACCTGATAAGGTTAGCTATACAAGTATCATAAGTGCCTATGTCAAGGCATCAGAATTCGAAAAGTGCGAGCAATATTACCGGGAGTTTCGGATGAACGGGGGCACCATTGACAAGGCATTTGGGGGGATCATGGTTGGGGTGTTCTCAAAGACGAGTCGGGTTGATGAGCTGGTGAAGCTTCTCAGAGACATGAAGTTAGAAGGAACAAGGCTGGATGAGAGGCTTTATAGGACAGCATTGAATGCTCTGATGGATGCTGGGTTGCAAGTGCAAGCAAAATGGTTGCAAGATCATTATGCTGGAAAATCAGGCTTTGTTTAA

Protein sequence

MALSPSPDCSFPPSNSFRKSHFISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQPSIALPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKSTLRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFERNSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDHYAGKSGFV*
Homology
BLAST of CsGy5G012250 vs. ExPASy Swiss-Prot
Match: Q66GP4 (Pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At5g13770 PE=2 SV=1)

HSP 1 Score: 585.1 bits (1507), Expect = 9.1e-166
Identity = 289/525 (55.05%), Postives = 384/525 (73.14%), Query Frame = 0

Query: 79  PNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKSTLRHLIRYLVRLKKWDLIL 138
           P   DLN  L    ++P T  L  ++Y KAKE +  R    T +HLI YLV  K WDL++
Sbjct: 69  PGPNDLNRVLSRFLRDPETRKLSSEFYEKAKENSELR----TTKHLISYLVSSKSWDLLV 128

Query: 139 LVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFERNSGVAMTAFEAAMRGYN 198
            V  D  +    PD  TCS L+ SC+R RKF++   LL VF  +  +A++A +AAM+G+N
Sbjct: 129 SVCEDLREHKALPDGQTCSNLIRSCIRDRKFRITHCLLSVFRSDKSLAVSASDAAMKGFN 188

Query: 199 KLHMHKSTIMVFQRLK-SARIEADSGCYCRVMEAYLKLGDSERVMELFNEVES-RISDST 258
           KL M+ STI VF RLK S  +E   GCYCR+MEA+ K+G++ +V+ELF E +S R+S   
Sbjct: 189 KLQMYSSTIQVFDRLKQSVGVEPSPGCYCRIMEAHEKIGENHKVVELFQEFKSQRLSFLA 248

Query: 259 PFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAED 318
             S  IY I+C SLAKSGR FE+LE   +M+ KGI E   +YS LI  FA  +EV + E 
Sbjct: 249 KESGSIYTIVCSSLAKSGRAFEALEVLEEMKDKGIPESSELYSMLIRAFAEAREVVITEK 308

Query: 319 LYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYA 378
           L+ EA  KKLL+DP M LK++LMYV++G++E  LE+V  M+  ++ V+DCI CAIVNG++
Sbjct: 309 LFKEAGGKKLLKDPEMCLKVVLMYVREGNMETTLEVVAAMRKAELKVTDCILCAIVNGFS 368

Query: 379 TRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVV 438
            +RG+  AVKVYE  +++ CE GQVTYA AINAYCR+  Y+KAE +F EM +KGFDKCVV
Sbjct: 369 KQRGFAEAVKVYEWAMKEECEAGQVTYAIAINAYCRLEKYNKAEMLFDEMVKKGFDKCVV 428

Query: 439 AYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEM 498
           AYS+++ MYGKT RL DA+RL+AKMK++GC+PN+WIYN L++MHG+A +L++ EK+WKEM
Sbjct: 429 AYSNIMDMYGKTRRLSDAVRLMAKMKQRGCKPNIWIYNSLIDMHGRAMDLRRAEKIWKEM 488

Query: 499 KRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAFGGIMVGVFSKTSRV 558
           KR K+ PDKVSYTS+ISAY ++ E E+C + Y+EFRMN G ID+A  GIMVGVFSKTSR+
Sbjct: 489 KRAKVLPDKVSYTSMISAYNRSKELERCVELYQEFRMNRGKIDRAMAGIMVGVFSKTSRI 548

Query: 559 DELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDHY 602
           DEL++LL+DMK+EGTRLD RLY +ALNAL DAGL  Q +WLQ+ +
Sbjct: 549 DELMRLLQDMKVEGTRLDARLYSSALNALRDAGLNSQIRWLQESF 589

BLAST of CsGy5G012250 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 1.2e-35
Identity = 122/508 (24.02%), Postives = 212/508 (41.73%), Query Frame = 0

Query: 78  LPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKSTLRHLIRYLVRLKKWDLI 137
           LP   D N  LC         DL+   + K  E  G      T+  +I    R KK    
Sbjct: 67  LPTPIDFNR-LCSAVARTKQYDLVLG-FCKGMELNGIEHDMYTMTIMINCYCRKKKLLFA 126

Query: 138 LLVSRDFVDFGVCPDRDTCSKLVSS-CVRGRKFKVVKSLLEVFERNSGVAMTAFEAAMRG 197
             V       G  PD  T S LV+  C+ GR  + V  +  + E      +      + G
Sbjct: 127 FSVLGRAWKLGYEPDTITFSTLVNGFCLEGRVSEAVALVDRMVEMKQRPDLVTVSTLING 186

Query: 198 YNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDST 257
                     +++  R+     + D   Y  V+    K G+S   ++LF ++E R   + 
Sbjct: 187 LCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDLFRKMEER---NI 246

Query: 258 PFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAED 317
             S   Y I+ +SL K G   ++L  F +M  KGI  D   YS+LI    +  +      
Sbjct: 247 KASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGLCNDGKWDDGAK 306

Query: 318 LYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYA 377
           +  E   + ++ D   F  LI ++V++G L +A E+   M    I      + ++++G+ 
Sbjct: 307 MLREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDTITYNSLIDGFC 366

Query: 378 TRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVV 437
                  A ++++ ++  GCEP  VTY+  IN+YC+         +F E+  KG     +
Sbjct: 367 KENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFREISSKGLIPNTI 426

Query: 438 AYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEM 497
            Y++L+  + ++G+L  A  L  +M  +G  P+V  Y IL++       L +  +++++M
Sbjct: 427 TYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGELNKALEIFEKM 486

Query: 498 KRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAFGGIMVGVFSKTSRV 557
           ++ ++      Y  II     AS+ +     +      G   D     +M+G   K   +
Sbjct: 487 QKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSL 546

Query: 558 DELVKLLRDMKLEGTRLDERLYRTALNA 585
            E   L R MK +G   D+  Y   + A
Sbjct: 547 SEADMLFRKMKEDGCTPDDFTYNILIRA 569

BLAST of CsGy5G012250 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 1.5e-35
Identity = 112/524 (21.37%), Postives = 236/524 (45.04%), Query Frame = 0

Query: 74  QLPSLPNVQDLNDFLCGL-SQNPGTEDLIYDYYVKAKETAGFRPQKSTLRHLIRYLVRLK 133
           +L  +PNV   N  L GL  +N   E L   + +      G  P   +   +I    +  
Sbjct: 151 ELGCIPNVFSYNILLKGLCDENRSQEALELLHMMADDRGGGSPPDVVSYTTVINGFFKEG 210

Query: 134 KWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFERNSGVA-MTAFE 193
             D       + +D G+ PD  T + ++++  + +       +L    +N  +     + 
Sbjct: 211 DSDKAYSTYHEMLDRGILPDVVTYNSIIAALCKAQAMDKAMEVLNTMVKNGVMPDCMTYN 270

Query: 194 AAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESR 253
           + + GY      K  I   ++++S  +E D   Y  +M+   K G      ++F+ +  R
Sbjct: 271 SILHGYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCKNGRCMEARKIFDSMTKR 330

Query: 254 ISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEV 313
                      YG L +  A  G + E       M + GI  D+ ++S LIC +A   +V
Sbjct: 331 ---GLKPEITTYGTLLQGYATKGALVEMHGLLDLMVRNGIHPDHYVFSILICAYAKQGKV 390

Query: 314 KLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFKIGVSDCIFCAI 373
             A  ++++ + + L  +   +  +I +  + G +E A+   E M D  +   + ++ ++
Sbjct: 391 DQAMLVFSKMRQQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQMIDEGLSPGNIVYNSL 450

Query: 374 VNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGF 433
           ++G  T   +E A ++  ++++ G     + + S I+++C+ G   ++E +F  M   G 
Sbjct: 451 IHGLCTCNKWERAEELILEMLDRGICLNTIFFNSIIDSHCKEGRVIESEKLFELMVRIGV 510

Query: 434 DKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEK 493
              V+ Y++LI+ Y   G++ +AM+LL+ M   G +PN   Y+ L+  + K   ++    
Sbjct: 511 KPNVITYNTLINGYCLAGKMDEAMKLLSGMVSVGLKPNTVTYSTLINGYCKISRMEDALV 570

Query: 494 LWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAFGGIMVGVFS 553
           L+KEM+   ++PD ++Y  I+    +       ++ Y     +G  I+ +   I++    
Sbjct: 571 LFKEMESSGVSPDIITYNIILQGLFQTRRTAAAKELYVRITESGTQIELSTYNIILHGLC 630

Query: 554 KTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAK 596
           K    D+ +++ +++ L   +L+ R +   ++AL+  G   +AK
Sbjct: 631 KNKLTDDALQMFQNLCLMDLKLEARTFNIMIDALLKVGRNDEAK 671

BLAST of CsGy5G012250 vs. ExPASy Swiss-Prot
Match: Q9LW84 (Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX=3702 GN=At3g16010 PE=2 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 3.2e-33
Identity = 92/397 (23.17%), Postives = 186/397 (46.85%), Query Frame = 0

Query: 190 FEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVE 249
           + A +  Y KL  + S I +F  +K   ++     Y  ++  Y K+G  E+ ++LF E++
Sbjct: 236 YSALISSYEKLGRNDSAIRLFDEMKDNCMQPTEKIYTTLLGIYFKVGKVEKALDLFEEMK 295

Query: 250 SRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQ 309
                 T ++   Y  L + L K+GRV E+  F++DM + G+  D    + L+     + 
Sbjct: 296 RAGCSPTVYT---YTELIKGLGKAGRVDEAYGFYKDMLRDGLTPDVVFLNNLMNILGKVG 355

Query: 310 EVKLAEDLYNEAKAKKLLRDPAMFLKLI-LMYVQQGSLEKALEIVEVMKDFKIGVSDCIF 369
            V+   ++++E    +       +  +I  ++  +  + +     + MK   +  S+  +
Sbjct: 356 RVEELTNVFSEMGMWRCTPTVVSYNTVIKALFESKAHVSEVSSWFDKMKADSVSPSEFTY 415

Query: 370 CAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEE 429
             +++GY      E A+ + E++ E G  P    Y S INA  +   Y  A ++F E++E
Sbjct: 416 SILIDGYCKTNRVEKALLLLEEMDEKGFPPCPAAYCSLINALGKAKRYEAANELFKELKE 475

Query: 430 KGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQ 489
              +     Y+ +I  +GK G+L +A+ L  +MK +G  P+V+ YN LM    KA  + +
Sbjct: 476 NFGNVSSRVYAVMIKHFGKCGKLSEAVDLFNEMKNQGSGPDVYAYNALMSGMVKAGMINE 535

Query: 490 VEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAFGGIMVG 549
              L ++M+      D  S+  I++ + +     +  + +   + +G   D      ++G
Sbjct: 536 ANSLLRKMEENGCRADINSHNIILNGFARTGVPRRAIEMFETIKHSGIKPDGVTYNTLLG 595

Query: 550 VFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNAL 586
            F+     +E  +++R+MK +G   D   Y + L+A+
Sbjct: 596 CFAHAGMFEEAARMMREMKDKGFEYDAITYSSILDAV 629

BLAST of CsGy5G012250 vs. ExPASy Swiss-Prot
Match: Q8L844 (Pentatricopeptide repeat-containing protein At5g42310, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRP1 PE=1 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 4.1e-33
Identity = 104/447 (23.27%), Postives = 204/447 (45.64%), Query Frame = 0

Query: 143 DFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFERNSGVAMTAFEAAMRGYNKLHM 202
           DFV++ +     T S  + S +  R +K      E+      + +      + G+ K   
Sbjct: 231 DFVNYSLVIQSLTRSNKIDSVMLLRLYK------EIERDKLELDVQLVNDIIMGFAKSGD 290

Query: 203 HKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVME---LFNEVESRISDSTPFS 262
               + +    ++  + A +     ++ A   L DS R +E   LF E+  R S   P  
Sbjct: 291 PSKALQLLGMAQATGLSAKTATLVSIISA---LADSGRTLEAEALFEEL--RQSGIKP-R 350

Query: 263 TKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYN 322
           T+ Y  L +   K+G + ++     +M K+G++ D   YS LI  + +    + A  +  
Sbjct: 351 TRAYNALLKGYVKTGPLKDAESMVSEMEKRGVSPDEHTYSLLIDAYVNAGRWESARIVLK 410

Query: 323 EAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRR 382
           E +A  +  +  +F +L+  +  +G  +K  ++++ MK   +      +  +++ +    
Sbjct: 411 EMEAGDVQPNSFVFSRLLAGFRDRGEWQKTFQVLKEMKSIGVKPDRQFYNVVIDTFGKFN 470

Query: 383 GYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYS 442
             + A+  +++++ +G EP +VT+ + I+ +C+ G +  AE++F  ME +G   C   Y+
Sbjct: 471 CLDHAMTTFDRMLSEGIEPDRVTWNTLIDCHCKHGRHIVAEEMFEAMERRGCLPCATTYN 530

Query: 443 SLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRK 502
            +I+ YG   R  D  RLL KMK +G  PNV  +  L++++GK+       +  +EMK  
Sbjct: 531 IMINSYGDQERWDDMKRLLGKMKSQGILPNVVTHTTLVDVYGKSGRFNDAIECLEEMKSV 590

Query: 503 KIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAFGGIMVGVFSKTSRVDEL 562
            + P    Y ++I+AY +    E+    +R    +G          ++  F +  R  E 
Sbjct: 591 GLKPSSTMYNALINAYAQRGLSEQAVNAFRVMTSDGLKPSLLALNSLINAFGEDRRDAEA 650

Query: 563 VKLLRDMKLEGTRLDERLYRTALNALM 587
             +L+ MK  G + D   Y T + AL+
Sbjct: 651 FAVLQYMKENGVKPDVVTYTTLMKALI 665

BLAST of CsGy5G012250 vs. NCBI nr
Match: XP_004151188.1 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic [Cucumis sativus] >KGN49761.1 hypothetical protein Csa_017804 [Cucumis sativus])

HSP 1 Score: 1201 bits (3106), Expect = 0.0
Identity = 606/608 (99.67%), Postives = 607/608 (99.84%), Query Frame = 0

Query: 1   MALSPSPDCSFPPSNSFRKSHFISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQPSIA 60
           MALSPSPDCSFPPSNSFRKSHFISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQPSIA
Sbjct: 1   MALSPSPDCSFPPSNSFRKSHFISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQPSIA 60

Query: 61  LPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKST 120
           LPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKST
Sbjct: 61  LPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKST 120

Query: 121 LRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFE 180
           LRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFE
Sbjct: 121 LRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFE 180

Query: 181 RNSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSER 240
           R+SGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSER
Sbjct: 181 RDSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSER 240

Query: 241 VMELFNEVESRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300
           VMELFNEVESRIS STPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA
Sbjct: 241 VMELFNEVESRISVSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300

Query: 301 LICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFK 360
           LICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFK
Sbjct: 301 LICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFK 360

Query: 361 IGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAE 420
           IGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAE
Sbjct: 361 IGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAE 420

Query: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480
           DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH
Sbjct: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480

Query: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDK 540
           GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDK
Sbjct: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDK 540

Query: 541 AFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDH 600
           AFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDH
Sbjct: 541 AFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDH 600

Query: 601 YAGKSGFV 608
           YAGKSGFV
Sbjct: 601 YAGKSGFV 608

BLAST of CsGy5G012250 vs. NCBI nr
Match: XP_008465506.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic [Cucumis melo])

HSP 1 Score: 1158 bits (2996), Expect = 0.0
Identity = 584/608 (96.05%), Postives = 596/608 (98.03%), Query Frame = 0

Query: 1   MALSPSPDCSFPPSNSFRKSHFISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQPSIA 60
           MA+SPSPDCSFPPSNSFRKSHFI TSNF LLFSLPTSNLPSLHLNSSG PSPILEQPSIA
Sbjct: 1   MAVSPSPDCSFPPSNSFRKSHFIPTSNFPLLFSLPTSNLPSLHLNSSGFPSPILEQPSIA 60

Query: 61  LPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKST 120
           LPDIHSNSNLHDFQLP L NV+DLNDFLCGLSQNPGTEDLIYDYYVKAKE AGFRP+KST
Sbjct: 61  LPDIHSNSNLHDFQLPPLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPEKST 120

Query: 121 LRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFE 180
           LRHLIRYLVRLKKWDLI LVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVK+LLEVFE
Sbjct: 121 LRHLIRYLVRLKKWDLIFLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKALLEVFE 180

Query: 181 RNSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSER 240
           R+S VAMTAFEAAMRGYNKLHM+KSTIMVFQRLKSARIEADSGCY RVMEAYLKLGDSER
Sbjct: 181 RDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYFRVMEAYLKLGDSER 240

Query: 241 VMELFNEVESRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300
           VMELFNEVESRIS+ TPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA
Sbjct: 241 VMELFNEVESRISNLTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300

Query: 301 LICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFK 360
           LICTFASI+EVKLAEDLYNEAKAKKLLRDPAMFLKLILMY+QQGSLEKALEIVEVMKDFK
Sbjct: 301 LICTFASIREVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFK 360

Query: 361 IGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAE 420
           IGVSDCIFCAIVNGYATRRGY+AAVKVYEKLI DGCEPGQVTYASAINAYCRVGLYSKAE
Sbjct: 361 IGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIGDGCEPGQVTYASAINAYCRVGLYSKAE 420

Query: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480
           DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH
Sbjct: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480

Query: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDK 540
           GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKASEFEKCEQYYREFRMNGGTIDK
Sbjct: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKASEFEKCEQYYREFRMNGGTIDK 540

Query: 541 AFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDH 600
           A GGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYR+ALNALMDAGLQVQAKWLQDH
Sbjct: 541 AIGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRSALNALMDAGLQVQAKWLQDH 600

Query: 601 YAGKSGFV 608
           YAGKSGFV
Sbjct: 601 YAGKSGFV 608

BLAST of CsGy5G012250 vs. NCBI nr
Match: KAA0067569.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYJ97187.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1152 bits (2981), Expect = 0.0
Identity = 581/608 (95.56%), Postives = 594/608 (97.70%), Query Frame = 0

Query: 1   MALSPSPDCSFPPSNSFRKSHFISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQPSIA 60
           MA+SPSPDCSFPPSNSFRKSHFI TSNF LLFSL TSNLPSLHLNSSG PSPILEQPSIA
Sbjct: 1   MAVSPSPDCSFPPSNSFRKSHFIPTSNFPLLFSLSTSNLPSLHLNSSGFPSPILEQPSIA 60

Query: 61  LPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKST 120
           LPDIHSNSNLHDFQLP L NV+DLNDFLCGLSQNPGTEDLIYDYYVKAKE AGFRP+KST
Sbjct: 61  LPDIHSNSNLHDFQLPPLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPEKST 120

Query: 121 LRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFE 180
           LRHLIRYLVRLKKWDLI LVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVK+LLEVFE
Sbjct: 121 LRHLIRYLVRLKKWDLIFLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKALLEVFE 180

Query: 181 RNSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSER 240
           R+S VA+T FEAAMRGYNKLHM+KSTIMVFQRLKSARIEADSGCY RVMEAYLKLGDSER
Sbjct: 181 RDSDVALTTFEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYFRVMEAYLKLGDSER 240

Query: 241 VMELFNEVESRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300
           VMELFNEVESRIS+ TPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA
Sbjct: 241 VMELFNEVESRISNLTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300

Query: 301 LICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFK 360
           LICTFASI+EVKLAEDLYNEAKAKKLLRDPAMFLKLILMY+QQGSLEKALEIVEVMKDFK
Sbjct: 301 LICTFASIREVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFK 360

Query: 361 IGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAE 420
           IGVSDCIFCAIVNGYATRRGY+AAVKVYEKLI DGCEPGQVTYASAINAYCRVGLYSKAE
Sbjct: 361 IGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIGDGCEPGQVTYASAINAYCRVGLYSKAE 420

Query: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480
           DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH
Sbjct: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480

Query: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDK 540
           GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKASEFEKCEQYYREFRMNGGTIDK
Sbjct: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKASEFEKCEQYYREFRMNGGTIDK 540

Query: 541 AFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDH 600
           A GGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYR+ALNALMDAGLQVQAKWLQDH
Sbjct: 541 AIGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRSALNALMDAGLQVQAKWLQDH 600

Query: 601 YAGKSGFV 608
           YAGKSGFV
Sbjct: 601 YAGKSGFV 608

BLAST of CsGy5G012250 vs. NCBI nr
Match: XP_038874313.1 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 [Benincasa hispida] >XP_038874314.1 pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 1097 bits (2836), Expect = 0.0
Identity = 557/611 (91.16%), Postives = 577/611 (94.44%), Query Frame = 0

Query: 1   MALSPSPDCSFPPSNSFRKSH---FISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQP 60
           MA++ SPD S PPS SFRKSH   FI TSN S LFSLPTSNL SLHL SSGCPSPILEQ 
Sbjct: 1   MAVTGSPDWSLPPSTSFRKSHLINFIPTSNLSFLFSLPTSNLRSLHLKSSGCPSPILEQS 60

Query: 61  SIALPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQ 120
           SIALPDIH +SNL D QLPSLP V+DLNDFLCGLSQNPG+EDLIY+YYVKAKE AGFRP+
Sbjct: 61  SIALPDIHLDSNLQDIQLPSLPTVEDLNDFLCGLSQNPGSEDLIYEYYVKAKEKAGFRPE 120

Query: 121 KSTLRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLE 180
           KSTLRHLIRYLVRLKKWDLILLVSRDFVD+ VCPDRDTCS+LVSSCVRGRKFKVVK+LLE
Sbjct: 121 KSTLRHLIRYLVRLKKWDLILLVSRDFVDYSVCPDRDTCSRLVSSCVRGRKFKVVKALLE 180

Query: 181 VFERNSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFE++S VA  AFEAAMRGYNKLHM+KSTI+VFQRLKSARIEADSGC CRVMEAYLKLGD
Sbjct: 181 VFEKDSDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSARIEADSGCCCRVMEAYLKLGD 240

Query: 241 SERVMELFNEVESRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTI 300
           SERVMELFNEVESRISD TPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGI EDYTI
Sbjct: 241 SERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIVEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMK 360
           YSALI TFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMY+QQGSLEKALE+V+VMK
Sbjct: 301 YSALISTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALELVQVMK 360

Query: 361 DFKIGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFKIGVSDCIFCAIVNGYATRRGY AAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS
Sbjct: 361 DFKIGVSDCIFCAIVNGYATRRGYNAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420

Query: 421 KAEDIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILM 480
           KAED+FGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILM
Sbjct: 421 KAEDMFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGT 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAY KASEFE CEQYY EFRMNGGT
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYAKASEFETCEQYYLEFRMNGGT 540

Query: 541 IDKAFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWL 600
           IDKA  GIMVGVFSKTSRVDELVKLLRDMKLEGTRLD RLYR+ALNALMDAGLQVQAKWL
Sbjct: 541 IDKAMAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGFV 608
           Q HYAGKSGFV
Sbjct: 601 QGHYAGKSGFV 611

BLAST of CsGy5G012250 vs. NCBI nr
Match: XP_022993436.1 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 [Cucurbita maxima])

HSP 1 Score: 1048 bits (2709), Expect = 0.0
Identity = 531/611 (86.91%), Postives = 561/611 (91.82%), Query Frame = 0

Query: 1   MALSPSPDCSFPPSNSFRKSH---FISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQP 60
           MA++ SPD S P S  FRKS    FI  SN +LLFSLP  NL SLHLNSSGCPSPILE  
Sbjct: 1   MAVTGSPDWSLPSSTCFRKSRLITFIPASNLALLFSLP--NLRSLHLNSSGCPSPILESS 60

Query: 61  SIALPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQ 120
             +LP+I S+SNL DFQLPS  +V+DLNDFLCGL QNPG EDLIY+YYVKAKET GFRP+
Sbjct: 61  PTSLPEIDSDSNLQDFQLPSSSSVEDLNDFLCGLPQNPGREDLIYEYYVKAKETPGFRPE 120

Query: 121 KSTLRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLE 180
           KSTLRHLIRYLVRLKKW LILLVSRDFVD+ VCPDRDTCS+LVSSCVRGRKFKVV++LLE
Sbjct: 121 KSTLRHLIRYLVRLKKWSLILLVSRDFVDYDVCPDRDTCSRLVSSCVRGRKFKVVRALLE 180

Query: 181 VFERNSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFER+  VA  AFEAAMRGYNKLHM++STI+VFQRLKSA+IEADSGCYCRVMEAYLKLGD
Sbjct: 181 VFERDRDVATAAFEAAMRGYNKLHMYRSTILVFQRLKSAKIEADSGCYCRVMEAYLKLGD 240

Query: 241 SERVMELFNEVESRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTI 300
           SER+MELFNE+ESRISD TPFSTKIYGILC+SLAKSGRVFESLEFFRDMRKKGI EDYTI
Sbjct: 241 SERIMELFNEIESRISDFTPFSTKIYGILCKSLAKSGRVFESLEFFRDMRKKGIVEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMK 360
           YSALICTFASIQEVKLAEDLYNEAK KKLLRDPA F KLILMY+QQGSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIQEVKLAEDLYNEAKTKKLLRDPATFQKLILMYIQQGSLEKALEIVEVMK 360

Query: 361 DFKIGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFKIG SDCIFCAIVNGYATRRGY AAV +YEKLI D CEPGQVTYA AINAYCRVGLYS
Sbjct: 361 DFKIGASDCIFCAIVNGYATRRGYNAAVNIYEKLIRDECEPGQVTYALAINAYCRVGLYS 420

Query: 421 KAEDIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILM 480
           KAED+F EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILM
Sbjct: 421 KAEDVFVEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGT 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKA+EFE CE+YYREFRMNGGT
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKAAEFETCERYYREFRMNGGT 540

Query: 541 IDKAFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWL 600
           IDKA  GIMVGVFSKTSRVDELVKLLRDM LEG RLDERLYR+ALNALMDAGLQVQAKWL
Sbjct: 541 IDKAIAGIMVGVFSKTSRVDELVKLLRDMNLEGIRLDERLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGFV 608
           QDHYAGKSGFV
Sbjct: 601 QDHYAGKSGFV 609

BLAST of CsGy5G012250 vs. ExPASy TrEMBL
Match: A0A0A0KPV8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G114580 PE=4 SV=1)

HSP 1 Score: 1201 bits (3106), Expect = 0.0
Identity = 606/608 (99.67%), Postives = 607/608 (99.84%), Query Frame = 0

Query: 1   MALSPSPDCSFPPSNSFRKSHFISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQPSIA 60
           MALSPSPDCSFPPSNSFRKSHFISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQPSIA
Sbjct: 1   MALSPSPDCSFPPSNSFRKSHFISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQPSIA 60

Query: 61  LPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKST 120
           LPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKST
Sbjct: 61  LPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKST 120

Query: 121 LRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFE 180
           LRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFE
Sbjct: 121 LRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFE 180

Query: 181 RNSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSER 240
           R+SGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSER
Sbjct: 181 RDSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSER 240

Query: 241 VMELFNEVESRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300
           VMELFNEVESRIS STPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA
Sbjct: 241 VMELFNEVESRISVSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300

Query: 301 LICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFK 360
           LICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFK
Sbjct: 301 LICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFK 360

Query: 361 IGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAE 420
           IGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAE
Sbjct: 361 IGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAE 420

Query: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480
           DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH
Sbjct: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480

Query: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDK 540
           GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDK
Sbjct: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDK 540

Query: 541 AFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDH 600
           AFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDH
Sbjct: 541 AFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDH 600

Query: 601 YAGKSGFV 608
           YAGKSGFV
Sbjct: 601 YAGKSGFV 608

BLAST of CsGy5G012250 vs. ExPASy TrEMBL
Match: A0A1S3CPF0 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103503127 PE=4 SV=1)

HSP 1 Score: 1158 bits (2996), Expect = 0.0
Identity = 584/608 (96.05%), Postives = 596/608 (98.03%), Query Frame = 0

Query: 1   MALSPSPDCSFPPSNSFRKSHFISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQPSIA 60
           MA+SPSPDCSFPPSNSFRKSHFI TSNF LLFSLPTSNLPSLHLNSSG PSPILEQPSIA
Sbjct: 1   MAVSPSPDCSFPPSNSFRKSHFIPTSNFPLLFSLPTSNLPSLHLNSSGFPSPILEQPSIA 60

Query: 61  LPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKST 120
           LPDIHSNSNLHDFQLP L NV+DLNDFLCGLSQNPGTEDLIYDYYVKAKE AGFRP+KST
Sbjct: 61  LPDIHSNSNLHDFQLPPLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPEKST 120

Query: 121 LRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFE 180
           LRHLIRYLVRLKKWDLI LVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVK+LLEVFE
Sbjct: 121 LRHLIRYLVRLKKWDLIFLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKALLEVFE 180

Query: 181 RNSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSER 240
           R+S VAMTAFEAAMRGYNKLHM+KSTIMVFQRLKSARIEADSGCY RVMEAYLKLGDSER
Sbjct: 181 RDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYFRVMEAYLKLGDSER 240

Query: 241 VMELFNEVESRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300
           VMELFNEVESRIS+ TPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA
Sbjct: 241 VMELFNEVESRISNLTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300

Query: 301 LICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFK 360
           LICTFASI+EVKLAEDLYNEAKAKKLLRDPAMFLKLILMY+QQGSLEKALEIVEVMKDFK
Sbjct: 301 LICTFASIREVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFK 360

Query: 361 IGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAE 420
           IGVSDCIFCAIVNGYATRRGY+AAVKVYEKLI DGCEPGQVTYASAINAYCRVGLYSKAE
Sbjct: 361 IGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIGDGCEPGQVTYASAINAYCRVGLYSKAE 420

Query: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480
           DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH
Sbjct: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480

Query: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDK 540
           GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKASEFEKCEQYYREFRMNGGTIDK
Sbjct: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKASEFEKCEQYYREFRMNGGTIDK 540

Query: 541 AFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDH 600
           A GGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYR+ALNALMDAGLQVQAKWLQDH
Sbjct: 541 AIGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRSALNALMDAGLQVQAKWLQDH 600

Query: 601 YAGKSGFV 608
           YAGKSGFV
Sbjct: 601 YAGKSGFV 608

BLAST of CsGy5G012250 vs. ExPASy TrEMBL
Match: A0A5A7VR01 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold174G00670 PE=4 SV=1)

HSP 1 Score: 1152 bits (2981), Expect = 0.0
Identity = 581/608 (95.56%), Postives = 594/608 (97.70%), Query Frame = 0

Query: 1   MALSPSPDCSFPPSNSFRKSHFISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQPSIA 60
           MA+SPSPDCSFPPSNSFRKSHFI TSNF LLFSL TSNLPSLHLNSSG PSPILEQPSIA
Sbjct: 1   MAVSPSPDCSFPPSNSFRKSHFIPTSNFPLLFSLSTSNLPSLHLNSSGFPSPILEQPSIA 60

Query: 61  LPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKST 120
           LPDIHSNSNLHDFQLP L NV+DLNDFLCGLSQNPGTEDLIYDYYVKAKE AGFRP+KST
Sbjct: 61  LPDIHSNSNLHDFQLPPLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPEKST 120

Query: 121 LRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFE 180
           LRHLIRYLVRLKKWDLI LVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVK+LLEVFE
Sbjct: 121 LRHLIRYLVRLKKWDLIFLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKALLEVFE 180

Query: 181 RNSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSER 240
           R+S VA+T FEAAMRGYNKLHM+KSTIMVFQRLKSARIEADSGCY RVMEAYLKLGDSER
Sbjct: 181 RDSDVALTTFEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYFRVMEAYLKLGDSER 240

Query: 241 VMELFNEVESRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300
           VMELFNEVESRIS+ TPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA
Sbjct: 241 VMELFNEVESRISNLTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSA 300

Query: 301 LICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFK 360
           LICTFASI+EVKLAEDLYNEAKAKKLLRDPAMFLKLILMY+QQGSLEKALEIVEVMKDFK
Sbjct: 301 LICTFASIREVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFK 360

Query: 361 IGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAE 420
           IGVSDCIFCAIVNGYATRRGY+AAVKVYEKLI DGCEPGQVTYASAINAYCRVGLYSKAE
Sbjct: 361 IGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIGDGCEPGQVTYASAINAYCRVGLYSKAE 420

Query: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480
           DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH
Sbjct: 421 DIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMH 480

Query: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDK 540
           GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKASEFEKCEQYYREFRMNGGTIDK
Sbjct: 481 GKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKASEFEKCEQYYREFRMNGGTIDK 540

Query: 541 AFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDH 600
           A GGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYR+ALNALMDAGLQVQAKWLQDH
Sbjct: 541 AIGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRSALNALMDAGLQVQAKWLQDH 600

Query: 601 YAGKSGFV 608
           YAGKSGFV
Sbjct: 601 YAGKSGFV 608

BLAST of CsGy5G012250 vs. ExPASy TrEMBL
Match: A0A6J1JWB4 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489451 PE=4 SV=1)

HSP 1 Score: 1048 bits (2709), Expect = 0.0
Identity = 531/611 (86.91%), Postives = 561/611 (91.82%), Query Frame = 0

Query: 1   MALSPSPDCSFPPSNSFRKSH---FISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQP 60
           MA++ SPD S P S  FRKS    FI  SN +LLFSLP  NL SLHLNSSGCPSPILE  
Sbjct: 1   MAVTGSPDWSLPSSTCFRKSRLITFIPASNLALLFSLP--NLRSLHLNSSGCPSPILESS 60

Query: 61  SIALPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQ 120
             +LP+I S+SNL DFQLPS  +V+DLNDFLCGL QNPG EDLIY+YYVKAKET GFRP+
Sbjct: 61  PTSLPEIDSDSNLQDFQLPSSSSVEDLNDFLCGLPQNPGREDLIYEYYVKAKETPGFRPE 120

Query: 121 KSTLRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLE 180
           KSTLRHLIRYLVRLKKW LILLVSRDFVD+ VCPDRDTCS+LVSSCVRGRKFKVV++LLE
Sbjct: 121 KSTLRHLIRYLVRLKKWSLILLVSRDFVDYDVCPDRDTCSRLVSSCVRGRKFKVVRALLE 180

Query: 181 VFERNSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFER+  VA  AFEAAMRGYNKLHM++STI+VFQRLKSA+IEADSGCYCRVMEAYLKLGD
Sbjct: 181 VFERDRDVATAAFEAAMRGYNKLHMYRSTILVFQRLKSAKIEADSGCYCRVMEAYLKLGD 240

Query: 241 SERVMELFNEVESRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTI 300
           SER+MELFNE+ESRISD TPFSTKIYGILC+SLAKSGRVFESLEFFRDMRKKGI EDYTI
Sbjct: 241 SERIMELFNEIESRISDFTPFSTKIYGILCKSLAKSGRVFESLEFFRDMRKKGIVEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMK 360
           YSALICTFASIQEVKLAEDLYNEAK KKLLRDPA F KLILMY+QQGSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIQEVKLAEDLYNEAKTKKLLRDPATFQKLILMYIQQGSLEKALEIVEVMK 360

Query: 361 DFKIGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFKIG SDCIFCAIVNGYATRRGY AAV +YEKLI D CEPGQVTYA AINAYCRVGLYS
Sbjct: 361 DFKIGASDCIFCAIVNGYATRRGYNAAVNIYEKLIRDECEPGQVTYALAINAYCRVGLYS 420

Query: 421 KAEDIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILM 480
           KAED+F EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILM
Sbjct: 421 KAEDVFVEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGT 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKA+EFE CE+YYREFRMNGGT
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKAAEFETCERYYREFRMNGGT 540

Query: 541 IDKAFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWL 600
           IDKA  GIMVGVFSKTSRVDELVKLLRDM LEG RLDERLYR+ALNALMDAGLQVQAKWL
Sbjct: 541 IDKAIAGIMVGVFSKTSRVDELVKLLRDMNLEGIRLDERLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGFV 608
           QDHYAGKSGFV
Sbjct: 601 QDHYAGKSGFV 609

BLAST of CsGy5G012250 vs. ExPASy TrEMBL
Match: A0A6J1FDV4 (pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111444845 PE=4 SV=1)

HSP 1 Score: 1047 bits (2708), Expect = 0.0
Identity = 532/611 (87.07%), Postives = 561/611 (91.82%), Query Frame = 0

Query: 1   MALSPSPDCSFPPSNSFRKSH---FISTSNFSLLFSLPTSNLPSLHLNSSGCPSPILEQP 60
           MA++ SPD S P S  FRKS    FI  SN +LLFSLP  NL SLHLNSSGCPSPILE  
Sbjct: 1   MAVTGSPDWSLPSSTCFRKSRLLTFIPASNLALLFSLP--NLRSLHLNSSGCPSPILESS 60

Query: 61  SIALPDIHSNSNLHDFQLPSLPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQ 120
             +LP+I S+SNL DFQLPS  +V+DLNDFLCGL QNPG EDLIY+YYVKAKET GFRP+
Sbjct: 61  PTSLPEIDSDSNLQDFQLPSSSSVEDLNDFLCGLPQNPGREDLIYEYYVKAKETPGFRPE 120

Query: 121 KSTLRHLIRYLVRLKKWDLILLVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLE 180
           KSTLRHLIRYLVR K W+LILLVSRDFVD+ VCPDRDTCS+LVSSCVRGRKFKVV++LLE
Sbjct: 121 KSTLRHLIRYLVRSKNWNLILLVSRDFVDYDVCPDRDTCSRLVSSCVRGRKFKVVRALLE 180

Query: 181 VFERNSGVAMTAFEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGD 240
           VFER+  VA  AFEAAMRGYNKLHM+KSTI+VFQRLKSA+IEADSGCYCRVMEAYLKLGD
Sbjct: 181 VFERDRDVATAAFEAAMRGYNKLHMYKSTILVFQRLKSAKIEADSGCYCRVMEAYLKLGD 240

Query: 241 SERVMELFNEVESRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTI 300
           SER+MELFNE+ESRISD TPFSTKIYGILC+SLAKSGRVFESLEFFRDMRKKGI EDYTI
Sbjct: 241 SERIMELFNEIESRISDFTPFSTKIYGILCKSLAKSGRVFESLEFFRDMRKKGIVEDYTI 300

Query: 301 YSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMK 360
           YSALICTFASIQEVKLAEDLYNEAK KKLLRDPAMF KLILMY+QQGSLEKALEIVEVMK
Sbjct: 301 YSALICTFASIQEVKLAEDLYNEAKTKKLLRDPAMFQKLILMYIQQGSLEKALEIVEVMK 360

Query: 361 DFKIGVSDCIFCAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYS 420
           DFKIGVSDCIFCAIVNGYATRRGY AAV VYEKLI D CEPGQVTYA AINAYCRVGLYS
Sbjct: 361 DFKIGVSDCIFCAIVNGYATRRGYNAAVNVYEKLIRDECEPGQVTYALAINAYCRVGLYS 420

Query: 421 KAEDIFGEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILM 480
           KAED+F EMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILM
Sbjct: 421 KAEDVFVEMEEKGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKERGCQPNVWIYNILM 480

Query: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGT 540
           EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKA+EFE CE+YYREFRMNGG 
Sbjct: 481 EMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIINAYVKAAEFETCERYYREFRMNGGA 540

Query: 541 IDKAFGGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWL 600
           IDKA  GIMVGVFSKTSRVDELVKLLRDM LEG RLDERLYR+ALNALMDAGLQVQAKWL
Sbjct: 541 IDKAIAGIMVGVFSKTSRVDELVKLLRDMNLEGIRLDERLYRSALNALMDAGLQVQAKWL 600

Query: 601 QDHYAGKSGFV 608
           QDHYAGKSGFV
Sbjct: 601 QDHYAGKSGFV 609

BLAST of CsGy5G012250 vs. TAIR 10
Match: AT5G13770.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 585.1 bits (1507), Expect = 6.5e-167
Identity = 289/525 (55.05%), Postives = 384/525 (73.14%), Query Frame = 0

Query: 79  PNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKSTLRHLIRYLVRLKKWDLIL 138
           P   DLN  L    ++P T  L  ++Y KAKE +  R    T +HLI YLV  K WDL++
Sbjct: 69  PGPNDLNRVLSRFLRDPETRKLSSEFYEKAKENSELR----TTKHLISYLVSSKSWDLLV 128

Query: 139 LVSRDFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFERNSGVAMTAFEAAMRGYN 198
            V  D  +    PD  TCS L+ SC+R RKF++   LL VF  +  +A++A +AAM+G+N
Sbjct: 129 SVCEDLREHKALPDGQTCSNLIRSCIRDRKFRITHCLLSVFRSDKSLAVSASDAAMKGFN 188

Query: 199 KLHMHKSTIMVFQRLK-SARIEADSGCYCRVMEAYLKLGDSERVMELFNEVES-RISDST 258
           KL M+ STI VF RLK S  +E   GCYCR+MEA+ K+G++ +V+ELF E +S R+S   
Sbjct: 189 KLQMYSSTIQVFDRLKQSVGVEPSPGCYCRIMEAHEKIGENHKVVELFQEFKSQRLSFLA 248

Query: 259 PFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAED 318
             S  IY I+C SLAKSGR FE+LE   +M+ KGI E   +YS LI  FA  +EV + E 
Sbjct: 249 KESGSIYTIVCSSLAKSGRAFEALEVLEEMKDKGIPESSELYSMLIRAFAEAREVVITEK 308

Query: 319 LYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYA 378
           L+ EA  KKLL+DP M LK++LMYV++G++E  LE+V  M+  ++ V+DCI CAIVNG++
Sbjct: 309 LFKEAGGKKLLKDPEMCLKVVLMYVREGNMETTLEVVAAMRKAELKVTDCILCAIVNGFS 368

Query: 379 TRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVV 438
            +RG+  AVKVYE  +++ CE GQVTYA AINAYCR+  Y+KAE +F EM +KGFDKCVV
Sbjct: 369 KQRGFAEAVKVYEWAMKEECEAGQVTYAIAINAYCRLEKYNKAEMLFDEMVKKGFDKCVV 428

Query: 439 AYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEM 498
           AYS+++ MYGKT RL DA+RL+AKMK++GC+PN+WIYN L++MHG+A +L++ EK+WKEM
Sbjct: 429 AYSNIMDMYGKTRRLSDAVRLMAKMKQRGCKPNIWIYNSLIDMHGRAMDLRRAEKIWKEM 488

Query: 499 KRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAFGGIMVGVFSKTSRV 558
           KR K+ PDKVSYTS+ISAY ++ E E+C + Y+EFRMN G ID+A  GIMVGVFSKTSR+
Sbjct: 489 KRAKVLPDKVSYTSMISAYNRSKELERCVELYQEFRMNRGKIDRAMAGIMVGVFSKTSRI 548

Query: 559 DELVKLLRDMKLEGTRLDERLYRTALNALMDAGLQVQAKWLQDHY 602
           DEL++LL+DMK+EGTRLD RLY +ALNAL DAGL  Q +WLQ+ +
Sbjct: 549 DELMRLLQDMKVEGTRLDARLYSSALNALRDAGLNSQIRWLQESF 589

BLAST of CsGy5G012250 vs. TAIR 10
Match: AT3G22470.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 152.9 bits (385), Expect = 8.2e-37
Identity = 122/508 (24.02%), Postives = 212/508 (41.73%), Query Frame = 0

Query: 78  LPNVQDLNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKSTLRHLIRYLVRLKKWDLI 137
           LP   D N  LC         DL+   + K  E  G      T+  +I    R KK    
Sbjct: 67  LPTPIDFNR-LCSAVARTKQYDLVLG-FCKGMELNGIEHDMYTMTIMINCYCRKKKLLFA 126

Query: 138 LLVSRDFVDFGVCPDRDTCSKLVSS-CVRGRKFKVVKSLLEVFERNSGVAMTAFEAAMRG 197
             V       G  PD  T S LV+  C+ GR  + V  +  + E      +      + G
Sbjct: 127 FSVLGRAWKLGYEPDTITFSTLVNGFCLEGRVSEAVALVDRMVEMKQRPDLVTVSTLING 186

Query: 198 YNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDST 257
                     +++  R+     + D   Y  V+    K G+S   ++LF ++E R   + 
Sbjct: 187 LCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDLFRKMEER---NI 246

Query: 258 PFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAED 317
             S   Y I+ +SL K G   ++L  F +M  KGI  D   YS+LI    +  +      
Sbjct: 247 KASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGLCNDGKWDDGAK 306

Query: 318 LYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYA 377
           +  E   + ++ D   F  LI ++V++G L +A E+   M    I      + ++++G+ 
Sbjct: 307 MLREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDTITYNSLIDGFC 366

Query: 378 TRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVV 437
                  A ++++ ++  GCEP  VTY+  IN+YC+         +F E+  KG     +
Sbjct: 367 KENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFREISSKGLIPNTI 426

Query: 438 AYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEM 497
            Y++L+  + ++G+L  A  L  +M  +G  P+V  Y IL++       L +  +++++M
Sbjct: 427 TYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGELNKALEIFEKM 486

Query: 498 KRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAFGGIMVGVFSKTSRV 557
           ++ ++      Y  II     AS+ +     +      G   D     +M+G   K   +
Sbjct: 487 QKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSL 546

Query: 558 DELVKLLRDMKLEGTRLDERLYRTALNA 585
            E   L R MK +G   D+  Y   + A
Sbjct: 547 SEADMLFRKMKEDGCTPDDFTYNILIRA 569

BLAST of CsGy5G012250 vs. TAIR 10
Match: AT3G16010.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 144.8 bits (364), Expect = 2.2e-34
Identity = 92/397 (23.17%), Postives = 186/397 (46.85%), Query Frame = 0

Query: 190 FEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVE 249
           + A +  Y KL  + S I +F  +K   ++     Y  ++  Y K+G  E+ ++LF E++
Sbjct: 236 YSALISSYEKLGRNDSAIRLFDEMKDNCMQPTEKIYTTLLGIYFKVGKVEKALDLFEEMK 295

Query: 250 SRISDSTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQ 309
                 T ++   Y  L + L K+GRV E+  F++DM + G+  D    + L+     + 
Sbjct: 296 RAGCSPTVYT---YTELIKGLGKAGRVDEAYGFYKDMLRDGLTPDVVFLNNLMNILGKVG 355

Query: 310 EVKLAEDLYNEAKAKKLLRDPAMFLKLI-LMYVQQGSLEKALEIVEVMKDFKIGVSDCIF 369
            V+   ++++E    +       +  +I  ++  +  + +     + MK   +  S+  +
Sbjct: 356 RVEELTNVFSEMGMWRCTPTVVSYNTVIKALFESKAHVSEVSSWFDKMKADSVSPSEFTY 415

Query: 370 CAIVNGYATRRGYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEE 429
             +++GY      E A+ + E++ E G  P    Y S INA  +   Y  A ++F E++E
Sbjct: 416 SILIDGYCKTNRVEKALLLLEEMDEKGFPPCPAAYCSLINALGKAKRYEAANELFKELKE 475

Query: 430 KGFDKCVVAYSSLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQ 489
              +     Y+ +I  +GK G+L +A+ L  +MK +G  P+V+ YN LM    KA  + +
Sbjct: 476 NFGNVSSRVYAVMIKHFGKCGKLSEAVDLFNEMKNQGSGPDVYAYNALMSGMVKAGMINE 535

Query: 490 VEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAFGGIMVG 549
              L ++M+      D  S+  I++ + +     +  + +   + +G   D      ++G
Sbjct: 536 ANSLLRKMEENGCRADINSHNIILNGFARTGVPRRAIEMFETIKHSGIKPDGVTYNTLLG 595

Query: 550 VFSKTSRVDELVKLLRDMKLEGTRLDERLYRTALNAL 586
            F+     +E  +++R+MK +G   D   Y + L+A+
Sbjct: 596 CFAHAGMFEEAARMMREMKDKGFEYDAITYSSILDAV 629

BLAST of CsGy5G012250 vs. TAIR 10
Match: AT5G42310.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 144.4 bits (363), Expect = 2.9e-34
Identity = 104/447 (23.27%), Postives = 204/447 (45.64%), Query Frame = 0

Query: 143 DFVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVFERNSGVAMTAFEAAMRGYNKLHM 202
           DFV++ +     T S  + S +  R +K      E+      + +      + G+ K   
Sbjct: 231 DFVNYSLVIQSLTRSNKIDSVMLLRLYK------EIERDKLELDVQLVNDIIMGFAKSGD 290

Query: 203 HKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVME---LFNEVESRISDSTPFS 262
               + +    ++  + A +     ++ A   L DS R +E   LF E+  R S   P  
Sbjct: 291 PSKALQLLGMAQATGLSAKTATLVSIISA---LADSGRTLEAEALFEEL--RQSGIKP-R 350

Query: 263 TKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYN 322
           T+ Y  L +   K+G + ++     +M K+G++ D   YS LI  + +    + A  +  
Sbjct: 351 TRAYNALLKGYVKTGPLKDAESMVSEMEKRGVSPDEHTYSLLIDAYVNAGRWESARIVLK 410

Query: 323 EAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRR 382
           E +A  +  +  +F +L+  +  +G  +K  ++++ MK   +      +  +++ +    
Sbjct: 411 EMEAGDVQPNSFVFSRLLAGFRDRGEWQKTFQVLKEMKSIGVKPDRQFYNVVIDTFGKFN 470

Query: 383 GYEAAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYS 442
             + A+  +++++ +G EP +VT+ + I+ +C+ G +  AE++F  ME +G   C   Y+
Sbjct: 471 CLDHAMTTFDRMLSEGIEPDRVTWNTLIDCHCKHGRHIVAEEMFEAMERRGCLPCATTYN 530

Query: 443 SLISMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRK 502
            +I+ YG   R  D  RLL KMK +G  PNV  +  L++++GK+       +  +EMK  
Sbjct: 531 IMINSYGDQERWDDMKRLLGKMKSQGILPNVVTHTTLVDVYGKSGRFNDAIECLEEMKSV 590

Query: 503 KIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAFGGIMVGVFSKTSRVDEL 562
            + P    Y ++I+AY +    E+    +R    +G          ++  F +  R  E 
Sbjct: 591 GLKPSSTMYNALINAYAQRGLSEQAVNAFRVMTSDGLKPSLLALNSLINAFGEDRRDAEA 650

Query: 563 VKLLRDMKLEGTRLDERLYRTALNALM 587
             +L+ MK  G + D   Y T + AL+
Sbjct: 651 FAVLQYMKENGVKPDVVTYTTLMKALI 665

BLAST of CsGy5G012250 vs. TAIR 10
Match: AT2G35130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 144.1 bits (362), Expect = 3.8e-34
Identity = 96/414 (23.19%), Postives = 190/414 (45.89%), Query Frame = 0

Query: 190 FEAAMRGYNKLHMHKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVE 249
           F   +  Y +   +K    ++ +L  +R       Y  +++AY   G  ER   +  E++
Sbjct: 158 FNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMAGLIERAEVVLVEMQ 217

Query: 250 SRISDSTPFSTKIYGILCESLAK-SGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASI 309
           +           +Y    E L K  G   E+++ F+ M++         Y+ +I  +   
Sbjct: 218 NHHVSPKTIGVTVYNAYIEGLMKRKGNTEEAIDVFQRMKRDRCKPTTETYNLMINLYGKA 277

Query: 310 QEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYVQQGSLEKALEIVEVMKDFKIGVSDCIF 369
            +  ++  LY E ++ +   +   +  L+  + ++G  EKA EI E +++  +     ++
Sbjct: 278 SKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKAEEIFEQLQEDGLEPDVYVY 337

Query: 370 CAIVNGYATRRGYE-AAVKVYEKLIEDGCEPGQVTYASAINAYCRVGLYSKAEDIFGEME 429
            A++  Y +R GY   A +++  +   GCEP + +Y   ++AY R GL+S AE +F EM+
Sbjct: 338 NALMESY-SRAGYPYGAAEIFSLMQHMGCEPDRASYNIMVDAYGRAGLHSDAEAVFEEMK 397

Query: 430 -----------------------------------EKGFDKCVVAYSSLISMYGKTGRLK 489
                                              E G +      +S++++YG+ G+  
Sbjct: 398 RLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVEPDTFVLNSMLNLYGRLGQFT 457

Query: 490 DAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII 549
              ++LA+M+   C  ++  YNIL+ ++GKA  L+++E+L+ E+K K   PD V++TS I
Sbjct: 458 KMEKILAEMENGPCTADISTYNILINIYGKAGFLERIEELFVELKEKNFRPDVVTWTSRI 517

Query: 550 SAYVKASEFEKCEQYYREFRMNGGTIDKAFGGIMVGVFSKTSRVDELVKLLRDM 567
            AY +   + KC + + E   +G   D     +++   S   +V+++  +LR M
Sbjct: 518 GAYSRKKLYVKCLEVFEEMIDSGCAPDGGTAKVLLSACSSEEQVEQVTSVLRTM 570

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q66GP49.1e-16655.05Pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Arabidop... [more]
Q6NQ831.2e-3524.02Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Q76C991.5e-3521.37Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Q9LW843.2e-3323.17Pentatricopeptide repeat-containing protein At3g16010 OS=Arabidopsis thaliana OX... [more]
Q8L8444.1e-3323.27Pentatricopeptide repeat-containing protein At5g42310, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_004151188.10.099.67pentatricopeptide repeat-containing protein At5g13770, chloroplastic [Cucumis sa... [more]
XP_008465506.10.096.05PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic ... [more]
KAA0067569.10.095.56pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYJ97187... [more]
XP_038874313.10.091.16pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 ... [more]
XP_022993436.10.086.91pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KPV80.099.67Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G114580 PE=4 SV=1[more]
A0A1S3CPF00.096.05pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Cucumis ... [more]
A0A5A7VR010.095.56Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1JWB40.086.91pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 ... [more]
A0A6J1FDV40.087.07pentatricopeptide repeat-containing protein At5g13770, chloroplastic OS=Cucurbit... [more]
Match NameE-valueIdentityDescription
AT5G13770.16.5e-16755.05Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT3G22470.18.2e-3724.02Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G16010.12.2e-3423.17Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G42310.12.9e-3423.27Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G35130.13.8e-3423.19Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 186..333
e-value: 5.0E-17
score: 64.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 499..600
e-value: 1.6E-14
score: 55.7
coord: 74..185
e-value: 9.6E-6
score: 27.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 334..497
e-value: 2.3E-41
score: 144.2
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 368..410
e-value: 0.0023
score: 18.0
coord: 422..482
e-value: 3.2E-14
score: 52.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 545..570
e-value: 1.4
score: 9.3
coord: 506..535
e-value: 1.1E-4
score: 22.2
coord: 225..250
e-value: 0.1
score: 12.9
coord: 336..360
e-value: 0.27
score: 11.6
coord: 268..291
e-value: 0.0078
score: 16.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 506..535
e-value: 6.4E-6
score: 24.0
coord: 401..431
e-value: 7.9E-8
score: 30.0
coord: 472..504
e-value: 2.1E-7
score: 28.7
coord: 436..470
e-value: 2.2E-9
score: 34.9
coord: 367..398
e-value: 0.0022
score: 16.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 364..398
score: 9.152743
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 469..503
score: 10.961357
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 504..538
score: 9.13082
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 259..293
score: 9.711769
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 399..433
score: 11.421732
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 434..468
score: 12.857662
NoneNo IPR availablePANTHERPTHR47934:SF14OSJNBA0088A01.11 PROTEINcoord: 1..602
NoneNo IPR availablePANTHERPTHR47934PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN PET309, MITOCHONDRIALcoord: 1..602
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 270..475

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy5G012250.1CsGy5G012250.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding