Chy4G080660 (gene) Cucumber (hystrix) v1

Overview
NameChy4G080660
Typegene
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
LocationchrH04: 20109282 .. 20111273 (-)
RNA-Seq ExpressionChy4G080660
SyntenyChy4G080660
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATCTTCTTCTCTCCACCCACACCCACTGTCTCCCCATTACCCAGAAACCCAATCACGCATACCATCGCCACCCACCATTTAATAATCTCCCTCATGCTATGACGGTAGAGAATTATGCTAATTTATGCATAGCCCACCAAGTGTTCGACGATATTCCTATCTGGGATACTTTTGCATGGAACAATCTGATTCAAACCCATCTCACCAATGGAGATTTGGGGCATGTAATTTCAACCTATCGACAGATGTTATTTCGTGGAGTTCGTCCTGACAAACACACATTTCCTCGAATTATATGTGCTACACGCCAGTATGGTGATCTACAGGTTGGCAAACAGCTCCACGCTCAAGCCTTCAAACTTGGGTTCTCCTCTAACCTTTATGTACTTACTTCCTTAATTGAGTTGTATGGGATTCTTGATAGTGCTGACACTGCAAAGTGGCTTCATGACAAGTCCACTTGTAGAAACTCTGTTTCTTGGACCATTTTAGCTAAGCTGTACTTGAGGGAAGATAAACCCAGTTTTGCCCTAGACTTGTTTTACCAAATGGTGGAGTTGGCGGATGATATTGATGCAGTGGCTTTGGCCACGGCCATTGGTGCCTGTGGTGCTCGCAAAATGCTGTATCATGGAAGAAACATCCACCATCTTGCAAGAATTCATGGCTTGGAATTTAATATATTGGTCAGTAATTCTCTATTGAAAATGTACATCGACTGTGATAGTATCAAAGATGCTCGGGGGTTCTTCGACCAAATGCCGTCCAAAGATATCATTTCCTGGACAGAACTTATCCATATGTACGTTAAGAAAGGTGGAATCAATGAGGCCTTTAAGCTGTTTCGACAAATGAATATGGATGGAGAATTGAAGCCTGATCCTCGTACAATCAGCAGCATTCTCCCAGCTTGTGGAAGAATGGCTGCACATAAGCATGGAAAAGAGGTTCATGGATATGTGGTTAAAAATGCTTTTGACGAGAATCTCATCGTCCAAAATGCTTTGGTTGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCAAAAACTTTCTCGATGATGAAGGAGAAAGATATGGTTTCGTGGAGCATCATGACTTTGGGCTACAGCTTACATGGTCAAGGAAAACTGGGAGTCAGTTTGTTCCGTGAGATGGAGAAGAACTTTAAGATGCTTAGAGATGAGATCACTTACACTGCAGTTTTGCATGCTTGTACTACTGCAAACATGGTAGATGAAGGGGATTCTTACTTCAGTTGCATTACCAAACCAACCGTGGCACACATTGCTCTAAAGGTGGCTCTTTTAGCTCGAGCGGGGAGACTGGATGAAGCTAGGACATTTATAGAAAAAAAGAAACTTGATAAACATCCTGAGATTTTGAGAGCATTGCTCGATGGATGCCGGAACCACCGTCAACAAAAACTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTGGAGCCTCTAAATGCTGAGAATTACATTCTACTTTCGAACTGGTATGCATGCAACGACAAATGGGGCATGGTTGAAAAGTTGAGAGAAACAATAAGAGACATGGGATTAAGACCCAAGAAGGCTTATAGTTGGATAGAGTTCTGCAACAAAATTCACGTGTTCGGGACAGGGGATGTATCCCACCCGAGATCACAAAACATATATTGGAATTTACAGTGCTTAATGAAAAAAATGGAAGAAGATGGTTCCAAGCCGAATCCGGATTTCAGCTTGCACGACGTCGACGAGGAGCGAGAGTGTGTTCCAATAGGACACAGCGAACTGTTGGCGATTTCATTCGGGCTGATTAGTACAGAAGCAGGAAGGACAATTCGTATTACAAAGAACCTTCGTGTATGTCATAGTTGTCATGAGTCTGCAAAGTTTATATCCAAGATGGTTGGCCGAGAAATCATAGTAAAAGATCCTCGTGTCTTCCATCATTTTAAAGATGGTTGCTGCTCTTGTGAAAACTTTTGTTAG

mRNA sequence

ATGAATCTTCTTCTCTCCACCCACACCCACTGTCTCCCCATTACCCAGAAACCCAATCACGCATACCATCGCCACCCACCATTTAATAATCTCCCTCATGCTATGACGGTAGAGAATTATGCTAATTTATGCATAGCCCACCAAGTGTTCGACGATATTCCTATCTGGGATACTTTTGCATGGAACAATCTGATTCAAACCCATCTCACCAATGGAGATTTGGGGCATGTAATTTCAACCTATCGACAGATGTTATTTCGTGGAGTTCGTCCTGACAAACACACATTTCCTCGAATTATATGTGCTACACGCCAGTATGGTGATCTACAGGTTGGCAAACAGCTCCACGCTCAAGCCTTCAAACTTGGGTTCTCCTCTAACCTTTATGTACTTACTTCCTTAATTGAGTTGTATGGGATTCTTGATAGTGCTGACACTGCAAAGTGGCTTCATGACAAGTCCACTTGTAGAAACTCTGTTTCTTGGACCATTTTAGCTAAGCTGTACTTGAGGGAAGATAAACCCAGTTTTGCCCTAGACTTGTTTTACCAAATGGTGGAGTTGGCGGATGATATTGATGCAGTGGCTTTGGCCACGGCCATTGGTGCCTGTGGTGCTCGCAAAATGCTGTATCATGGAAGAAACATCCACCATCTTGCAAGAATTCATGGCTTGGAATTTAATATATTGGTCAGTAATTCTCTATTGAAAATGTACATCGACTGTGATAGTATCAAAGATGCTCGGGGGTTCTTCGACCAAATGCCGTCCAAAGATATCATTTCCTGGACAGAACTTATCCATATGTACGTTAAGAAAGGTGGAATCAATGAGGCCTTTAAGCTGTTTCGACAAATGAATATGGATGGAGAATTGAAGCCTGATCCTCGTACAATCAGCAGCATTCTCCCAGCTTGTGGAAGAATGGCTGCACATAAGCATGGAAAAGAGGTTCATGGATATGTGGTTAAAAATGCTTTTGACGAGAATCTCATCGTCCAAAATGCTTTGGTTGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCAAAAACTTTCTCGATGATGAAGGAGAAAGATATGGTTTCGTGGAGCATCATGACTTTGGGCTACAGCTTACATGGTCAAGGAAAACTGGGAGTCAGTTTGTTCCGTGAGATGGAGAAGAACTTTAAGATGCTTAGAGATGAGATCACTTACACTGCAGTTTTGCATGCTTGTACTACTGCAAACATGGTAGATGAAGGGGATTCTTACTTCAGTTGCATTACCAAACCAACCGTGGCACACATTGCTCTAAAGGTGGCTCTTTTAGCTCGAGCGGGGAGACTGGATGAAGCTAGGACATTTATAGAAAAAAAGAAACTTGATAAACATCCTGAGATTTTGAGAGCATTGCTCGATGGATGCCGGAACCACCGTCAACAAAAACTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTGGAGCCTCTAAATGCTGAGAATTACATTCTACTTTCGAACTGGTATGCATGCAACGACAAATGGGGCATGGTTGAAAAGTTGAGAGAAACAATAAGAGACATGGGATTAAGACCCAAGAAGGCTTATAGTTGGATAGAGTTCTGCAACAAAATTCACGTGTTCGGGACAGGGGATGTATCCCACCCGAGATCACAAAACATATATTGGAATTTACAGTGCTTAATGAAAAAAATGGAAGAAGATGGTTCCAAGCCGAATCCGGATTTCAGCTTGCACGACGTCGACGAGGAGCGAGAGTGTGTTCCAATAGGACACAGCGAACTGTTGGCGATTTCATTCGGGCTGATTAGTACAGAAGCAGGAAGGACAATTCGTATTACAAAGAACCTTCGTGTATGTCATAGTTGTCATGAGTCTGCAAAGTTTATATCCAAGATGGTTGGCCGAGAAATCATAGTAAAAGATCCTCGTGTCTTCCATCATTTTAAAGATGGTTGCTGCTCTTGTGAAAACTTTTGTTAG

Coding sequence (CDS)

ATGAATCTTCTTCTCTCCACCCACACCCACTGTCTCCCCATTACCCAGAAACCCAATCACGCATACCATCGCCACCCACCATTTAATAATCTCCCTCATGCTATGACGGTAGAGAATTATGCTAATTTATGCATAGCCCACCAAGTGTTCGACGATATTCCTATCTGGGATACTTTTGCATGGAACAATCTGATTCAAACCCATCTCACCAATGGAGATTTGGGGCATGTAATTTCAACCTATCGACAGATGTTATTTCGTGGAGTTCGTCCTGACAAACACACATTTCCTCGAATTATATGTGCTACACGCCAGTATGGTGATCTACAGGTTGGCAAACAGCTCCACGCTCAAGCCTTCAAACTTGGGTTCTCCTCTAACCTTTATGTACTTACTTCCTTAATTGAGTTGTATGGGATTCTTGATAGTGCTGACACTGCAAAGTGGCTTCATGACAAGTCCACTTGTAGAAACTCTGTTTCTTGGACCATTTTAGCTAAGCTGTACTTGAGGGAAGATAAACCCAGTTTTGCCCTAGACTTGTTTTACCAAATGGTGGAGTTGGCGGATGATATTGATGCAGTGGCTTTGGCCACGGCCATTGGTGCCTGTGGTGCTCGCAAAATGCTGTATCATGGAAGAAACATCCACCATCTTGCAAGAATTCATGGCTTGGAATTTAATATATTGGTCAGTAATTCTCTATTGAAAATGTACATCGACTGTGATAGTATCAAAGATGCTCGGGGGTTCTTCGACCAAATGCCGTCCAAAGATATCATTTCCTGGACAGAACTTATCCATATGTACGTTAAGAAAGGTGGAATCAATGAGGCCTTTAAGCTGTTTCGACAAATGAATATGGATGGAGAATTGAAGCCTGATCCTCGTACAATCAGCAGCATTCTCCCAGCTTGTGGAAGAATGGCTGCACATAAGCATGGAAAAGAGGTTCATGGATATGTGGTTAAAAATGCTTTTGACGAGAATCTCATCGTCCAAAATGCTTTGGTTGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCAAAAACTTTCTCGATGATGAAGGAGAAAGATATGGTTTCGTGGAGCATCATGACTTTGGGCTACAGCTTACATGGTCAAGGAAAACTGGGAGTCAGTTTGTTCCGTGAGATGGAGAAGAACTTTAAGATGCTTAGAGATGAGATCACTTACACTGCAGTTTTGCATGCTTGTACTACTGCAAACATGGTAGATGAAGGGGATTCTTACTTCAGTTGCATTACCAAACCAACCGTGGCACACATTGCTCTAAAGGTGGCTCTTTTAGCTCGAGCGGGGAGACTGGATGAAGCTAGGACATTTATAGAAAAAAAGAAACTTGATAAACATCCTGAGATTTTGAGAGCATTGCTCGATGGATGCCGGAACCACCGTCAACAAAAACTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTGGAGCCTCTAAATGCTGAGAATTACATTCTACTTTCGAACTGGTATGCATGCAACGACAAATGGGGCATGGTTGAAAAGTTGAGAGAAACAATAAGAGACATGGGATTAAGACCCAAGAAGGCTTATAGTTGGATAGAGTTCTGCAACAAAATTCACGTGTTCGGGACAGGGGATGTATCCCACCCGAGATCACAAAACATATATTGGAATTTACAGTGCTTAATGAAAAAAATGGAAGAAGATGGTTCCAAGCCGAATCCGGATTTCAGCTTGCACGACGTCGACGAGGAGCGAGAGTGTGTTCCAATAGGACACAGCGAACTGTTGGCGATTTCATTCGGGCTGATTAGTACAGAAGCAGGAAGGACAATTCGTATTACAAAGAACCTTCGTGTATGTCATAGTTGTCATGAGTCTGCAAAGTTTATATCCAAGATGGTTGGCCGAGAAATCATAGTAAAAGATCCTCGTGTCTTCCATCATTTTAAAGATGGTTGCTGCTCTTGTGAAAACTTTTGTTAG

Protein sequence

MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHAMTVENYANLCIAHQVFDDIPIWDTFAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFALDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGLEFNILVSNSLLKMYIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRTISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMVDEGDSYFSCITKPTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPKKAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPRVFHHFKDGCCSCENFC*
Homology
BLAST of Chy4G080660 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 440.7 bits (1132), Expect = 3.0e-122
Identity = 231/623 (37.08%), Postives = 357/623 (57.30%), Query Frame = 0

Query: 46  AHQVFDDIPIWDTFAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQ 105
           A +VFD++   D  +WN++I  +++NG     +S + QML  G+  D  T   +      
Sbjct: 249 ARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCAD 308

Query: 106 YGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTIL 165
              + +G+ +H+   K  FS       +L+++Y      D+AK +  + + R+ VS+T +
Sbjct: 309 SRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSM 368

Query: 166 AKLYLREDKPSFALDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGL 225
              Y RE     A+ LF +M E     D   +   +  C   ++L  G+ +H   + + L
Sbjct: 369 IAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDL 428

Query: 226 EFNILVSNSLLKMYIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQ 285
            F+I VSN+L+ MY  C S+++A   F +M  KDIISW  +I  Y K    NEA  LF  
Sbjct: 429 GFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNL 488

Query: 286 MNMDGELKPDPRTISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSG 345
           +  +    PD RT++ +LPAC  ++A   G+E+HGY+++N +  +  V N+LVDMY K G
Sbjct: 489 LLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCG 548

Query: 346 CIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVL 405
            +  A   F  +  KD+VSW++M  GY +HG GK  ++LF +M +   +  DEI++ ++L
Sbjct: 549 ALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQM-RQAGIEADEISFVSLL 608

Query: 406 HACTTANMVDEGDSYFS-----CITKPTVAHIALKVALLARAGRLDEARTFIEKKKLDKH 465
           +AC+ + +VDEG  +F+     C  +PTV H A  V +LAR G L +A  FIE   +   
Sbjct: 609 YACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPD 668

Query: 466 PEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRET 525
             I  ALL GCR H   KL +++ E++ +LEP N   Y+L++N YA  +KW  V++LR+ 
Sbjct: 669 ATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKR 728

Query: 526 IRDMGLRPKKAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFS 585
           I   GLR     SWIE   ++++F  GD S+P ++NI   L+ +  +M E+G  P   ++
Sbjct: 729 IGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYA 788

Query: 586 LHDVDE-ERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGR 645
           L D +E E+E    GHSE LA++ G+IS+  G+ IR+TKNLRVC  CHE AKF+SK+  R
Sbjct: 789 LIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRR 848

Query: 646 EIIVKDPRVFHHFKDGCCSCENF 663
           EI+++D   FH FKDG CSC  F
Sbjct: 849 EIVLRDSNRFHQFKDGHCSCRGF 870

BLAST of Chy4G080660 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 416.0 bits (1068), Expect = 8.0e-115
Identity = 221/631 (35.02%), Postives = 350/631 (55.47%), Query Frame = 0

Query: 39  NYANLCIAHQVFDDIPIWDTFAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPR 98
           ++ ++  A QVFDD+P    F WN +I+ +  N      +  Y  M    V PD  TFP 
Sbjct: 65  SFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPH 124

Query: 99  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTC-- 158
           ++ A      LQ+G+ +HAQ F+LGF ++++V   LI LY       +A+ + +      
Sbjct: 125 LLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPE 184

Query: 159 RNSVSWTILAKLYLREDKPSFALDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNI 218
           R  VSWT +   Y +  +P  AL++F QM ++    D VAL + + A    + L  GR+I
Sbjct: 185 RTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSI 244

Query: 219 HHLARIHGLEFNILVSNSLLKMYIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGI 278
           H      GLE    +  SL  MY  C  +  A+  FD+M S ++I W  +I  Y K G  
Sbjct: 245 HASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYA 304

Query: 279 NEAFKLFRQMNMDGELKPDPRTISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNA 338
            EA  +F +M ++ +++PD  +I+S + AC ++ + +  + ++ YV ++ + +++ + +A
Sbjct: 305 REAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSA 364

Query: 339 LVDMYVKSGCIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLR 398
           L+DM+ K G ++ A   F    ++D+V WS M +GY LHG+ +  +SL+R ME+   +  
Sbjct: 365 LIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERG-GVHP 424

Query: 399 DEITYTAVLHACTTANMVDEGDSYFSCIT----KPTVAHIALKVALLARAGRLDEARTFI 458
           +++T+  +L AC  + MV EG  +F+ +      P   H A  + LL RAG LD+A   I
Sbjct: 425 NDVTFLGLLMACNHSGMVREGWWFFNRMADHKINPQQQHYACVIDLLGRAGHLDQAYEVI 484

Query: 459 EKKKLDKHPEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWG 518
           +   +     +  ALL  C+ HR  +LG+   +QL  ++P N  +Y+ LSN YA    W 
Sbjct: 485 KCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWD 544

Query: 519 MVEKLRETIRDMGLRPKKAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDG 578
            V ++R  +++ GL      SW+E   ++  F  GD SHPR + I   ++ +  +++E G
Sbjct: 545 RVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGG 604

Query: 579 SKPNPDFSLHDV-DEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAK 638
              N D SLHD+ DEE E     HSE +AI++GLIST  G  +RITKNLR C +CH + K
Sbjct: 605 FVANKDASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATK 664

Query: 639 FISKMVGREIIVKDPRVFHHFKDGCCSCENF 663
            ISK+V REI+V+D   FHHFKDG CSC ++
Sbjct: 665 LISKLVDREIVVRDTNRFHHFKDGVCSCGDY 693

BLAST of Chy4G080660 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 389.0 bits (998), Expect = 1.0e-106
Identity = 222/675 (32.89%), Postives = 353/675 (52.30%), Query Frame = 0

Query: 34  AMTVENYANLCIAHQ---VFDDIPIWDTFAWNNLIQTHLTNGDLGHVISTYRQMLFRGVR 93
           ++ +  Y NL + H+   +F  +      AW ++I+           ++++ +M   G  
Sbjct: 43  SIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQSLFSKALASFVEMRASGRC 102

Query: 94  PDKHTFPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGIL--------- 153
           PD + FP ++ +     DL+ G+ +H    +LG   +LY   +L+ +Y  L         
Sbjct: 103 PDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISV 162

Query: 154 ---------------------------DSADTAKWLHDKSTCRNSVSWTILAKLYLREDK 213
                                         D+ + + +    ++ VS+  +   Y +   
Sbjct: 163 GNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGM 222

Query: 214 PSFALDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGLEFNILVSNS 273
              AL +  +M       D+  L++ +        +  G+ IH      G++ ++ + +S
Sbjct: 223 YEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSS 282

Query: 274 LLKMYIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKP 333
           L+ MY     I+D+   F ++  +D ISW  L+  YV+ G  NEA +LFRQM +  ++KP
Sbjct: 283 LVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQM-VTAKVKP 342

Query: 334 DPRTISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTF 393
                SS++PAC  +A    GK++HGYV++  F  N+ + +ALVDMY K G I++A K F
Sbjct: 343 GAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIF 402

Query: 394 SMMKEKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMV 453
             M   D VSW+ + +G++LHG G   VSLF EM++   +  +++ + AVL AC+   +V
Sbjct: 403 DRMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQ-GVKPNQVAFVAVLTACSHVGLV 462

Query: 454 DEGDSYFSCITK-----PTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLD 513
           DE   YF+ +TK       + H A    LL RAG+L+EA  FI K  ++    +   LL 
Sbjct: 463 DEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLS 522

Query: 514 GCRNHRQQKLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPK 573
            C  H+  +L +++ E++  ++  N   Y+L+ N YA N +W  + KLR  +R  GLR K
Sbjct: 523 SCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKK 582

Query: 574 KAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEE-- 633
            A SWIE  NK H F +GD SHP    I   L+ +M++ME++G   +    LHDVDEE  
Sbjct: 583 PACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHK 642

Query: 634 RECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPR 663
           RE +  GHSE LA++FG+I+TE G TIR+TKN+R+C  CH + KFISK+  REIIV+D  
Sbjct: 643 RELL-FGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNS 702

BLAST of Chy4G080660 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 387.5 bits (994), Expect = 3.0e-106
Identity = 231/731 (31.60%), Postives = 364/731 (49.79%), Query Frame = 0

Query: 14  ITQKPNHAYHRHPPFNNLPHAMTVENYANLCIAH----------QVFDDIPIWDTFAWNN 73
           +  K  +A H    F+ +P   T  ++  +  A+          + FD +P  D+ +W  
Sbjct: 58  VYSKTGYALHARKLFDEMP-LRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTT 117

Query: 74  LIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQAFKLG 133
           +I  +   G     I     M+  G+ P + T   ++ +      ++ GK++H+   KLG
Sbjct: 118 MIVGYKNIGQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLG 177

Query: 134 FSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFALDLFY 193
              N+ V  SL+ +Y        AK++ D+   R+  SW  +  L+++  +   A+  F 
Sbjct: 178 LRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFE 237

Query: 194 QMVE-----------------------------LADDI---DAVALATAIGACGARKMLY 253
           QM E                             L D +   D   LA+ + AC   + L 
Sbjct: 238 QMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLC 297

Query: 254 HGRNIHHLARIHGLEFNILVSNSLLKMYIDCDSIKDARGFFDQ----------------- 313
            G+ IH      G + + +V N+L+ MY  C  ++ AR   +Q                 
Sbjct: 298 IGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDG 357

Query: 314 ----------------MPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 373
                           +  +D+++WT +I  Y + G   EA  LFR M + G  +P+  T
Sbjct: 358 YIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSM-VGGGQRPNSYT 417

Query: 374 ISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 433
           ++++L     +A+  HGK++HG  VK+    ++ V NAL+ MY K+G I SAS+ F +++
Sbjct: 418 LAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIR 477

Query: 434 -EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMVDEG 493
            E+D VSW+ M +  + HG  +  + LF  M     +  D ITY  V  ACT A +V++G
Sbjct: 478 CERDTVSWTSMIIALAQHGHAEEALELFETMLME-GLRPDHITYVGVFSACTHAGLVNQG 537

Query: 494 DSYFSCITK-----PTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLDGCR 553
             YF  +       PT++H A  V L  RAG L EA+ FIEK  ++       +LL  CR
Sbjct: 538 RQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACR 597

Query: 554 NHRQQKLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPKKAY 613
            H+   LGK   E+L  LEP N+  Y  L+N Y+   KW    K+R++++D  ++ ++ +
Sbjct: 598 VHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGF 657

Query: 614 SWIEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEE-RECV 663
           SWIE  +K+HVFG  D +HP    IY  ++ +  ++++ G  P+    LHD++EE +E +
Sbjct: 658 SWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQI 717

BLAST of Chy4G080660 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 1.2e-105
Identity = 209/622 (33.60%), Postives = 337/622 (54.18%), Query Frame = 0

Query: 46  AHQVFDDIPIWDTFAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQ 105
           A +VFD +P  D  +WN ++  +  NG     +   + M    ++P   T   ++ A   
Sbjct: 189 ARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSA 248

Query: 106 YGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTIL 165
              + VGK++H  A + GF S + + T+L+++Y    S +TA+ L D    RN VSW  +
Sbjct: 249 LRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSM 308

Query: 166 AKLYLREDKPSFALDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGL 225
              Y++ + P  A+ +F +M++       V++  A+ AC     L  GR IH L+   GL
Sbjct: 309 IDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGL 368

Query: 226 EFNILVSNSLLKMYIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQ 285
           + N+ V NSL+ MY  C  +  A   F ++ S+ ++SW  +I  + + G   +A   F Q
Sbjct: 369 DRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQ 428

Query: 286 MNMDGELKPDPRTISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSG 345
           M     +KPD  T  S++ A   ++   H K +HG V+++  D+N+ V  ALVDMY K G
Sbjct: 429 MR-SRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCG 488

Query: 346 CIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVL 405
            I  A   F MM E+ + +W+ M  GY  HG GK  + LF EM+K   +  + +T+ +V+
Sbjct: 489 AIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKG-TIKPNGVTFLSVI 548

Query: 406 HACTTANMVDEGDSYFSCITKP-----TVAHIALKVALLARAGRLDEARTFIEKKKLDKH 465
            AC+ + +V+ G   F  + +      ++ H    V LL RAGRL+EA  FI +  +   
Sbjct: 549 SACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPA 608

Query: 466 PEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRET 525
             +  A+L  C+ H+     ++  E+L +L P +   ++LL+N Y     W  V ++R +
Sbjct: 609 VNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVS 668

Query: 526 IRDMGLRPKKAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFS 585
           +   GLR     S +E  N++H F +G  +HP S+ IY  L+ L+  ++E G  P+ +  
Sbjct: 669 MLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLV 728

Query: 586 LHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGRE 645
           L   ++ +E +   HSE LAISFGL++T AG TI + KNLRVC  CH + K+IS + GRE
Sbjct: 729 LGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGRE 788

Query: 646 IIVKDPRVFHHFKDGCCSCENF 663
           I+V+D + FHHFK+G CSC ++
Sbjct: 789 IVVRDMQRFHHFKNGACSCGDY 808

BLAST of Chy4G080660 vs. ExPASy TrEMBL
Match: A0A1S3CPR5 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103502829 PE=3 SV=1)

HSP 1 Score: 1312.7 bits (3396), Expect = 0.0e+00
Identity = 628/665 (94.44%), Postives = 643/665 (96.69%), Query Frame = 0

Query: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPH--AMTVENYANLCIAHQVFDDIPIWDT 60
           MNLLLSTHTHCLPITQKP HAYHRHPPFNNLPH    TVENYA+LC+AHQVFD+IPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPYHAYHRHPPFNNLPHVRTTTVENYADLCVAHQVFDEIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGD GHVIS YRQMLFRGVRPDKHT PRIICATRQYGDL VGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDWGHVISIYRQMLFRGVRPDKHTLPRIICATRQYGDLPVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180
           AFKLGFSS+LYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA
Sbjct: 121 AFKLGFSSDLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180

Query: 181 LDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGLEFNILVSNSLLKM 240
           +DLFYQMVELADDID+VALATAIGACGA KML+HGRNIHHLARIHGLEFNILVSNSLLKM
Sbjct: 181 IDLFYQMVELADDIDSVALATAIGACGALKMLHHGRNIHHLARIHGLEFNILVSNSLLKM 240

Query: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300
           Y+DCDSIKDARGFFDQMPSKD+ISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDP T
Sbjct: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHGKE+HGYV+KN FDENLIVQNALVDMYVKSGCIQSASKTFSMMK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVLKNGFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMVDEGD 420
           EKDMVSWSIMTLGYSLHGQGKLGV LFREMEKN KM RDEITYTAVLHACTTANMVDEGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVGLFREMEKNLKMHRDEITYTAVLHACTTANMVDEGD 420

Query: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLDGCRNHRQQK 480
            YFS ITKPTVAHIALKVALLARAGRLDEARTF+EKKKL+KHPEILRALLDGCRNHRQQK
Sbjct: 421 FYFSRITKPTVAHIALKVALLARAGRLDEARTFVEKKKLNKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPKKAYSWIEFC 540
           LGKRIIEQLCDLEPLN ENYILLSNWYACN KW MVE+LRETIRDMGLRPKKAYSWIEFC
Sbjct: 481 LGKRIIEQLCDLEPLNTENYILLSNWYACNKKWDMVEELRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSK NP+FSLHDVDEERECVPIGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKTNPEFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPRVFHHFKDGCCS 660
           LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDP VFHHFKDGCCS
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CENFC 664
           CENFC
Sbjct: 661 CENFC 665

BLAST of Chy4G080660 vs. ExPASy TrEMBL
Match: A0A0A0L9N4 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G722890 PE=3 SV=1)

HSP 1 Score: 1263.1 bits (3267), Expect = 0.0e+00
Identity = 607/624 (97.28%), Postives = 616/624 (98.72%), Query Frame = 0

Query: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPH--AMTVENYANLCIAHQVFDDIPIWDT 60
           MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPH   MTVENYANLC+AHQVFDDIPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHVRTMTVENYANLCVAHQVFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHT PRIICATRQYGDLQVGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180
           AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWT+LAKLYLREDKPS A
Sbjct: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTVLAKLYLREDKPSLA 180

Query: 181 LDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGLEFNILVSNSLLKM 240
           LDLFYQMVELADDIDAVALATAIGACGA KML+HGRNIHHLAR+HGLEFNILVSNSLLKM
Sbjct: 181 LDLFYQMVELADDIDAVALATAIGACGALKMLHHGRNIHHLARVHGLEFNILVSNSLLKM 240

Query: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300
           YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT
Sbjct: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300

Query: 301 ISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHGKE+HGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMVDEGD 420
           EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKM RDEITYTAVLHACTTANMVDEGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMRRDEITYTAVLHACTTANMVDEGD 420

Query: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLDGCRNHRQQK 480
           SYFSCITKPTVAHIALKVALLARAGRLDEARTF+EKKKLDKHPEILRALLDGCRNHRQQK
Sbjct: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFVEKKKLDKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPKKAYSWIEFC 540
           LGKRIIEQLCDLEPLNAENYILLSNWYACN+KW MVEKLRETIRDMGLRPKKAYSWIEFC
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNLQCLMK+MEEDGSKPNPDFSLHDVDEERECVPIGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKEMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRV 623
           LAISFGLISTEAGRTIRITKNLR+
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRM 624

BLAST of Chy4G080660 vs. ExPASy TrEMBL
Match: A0A6J1I9E1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111470851 PE=3 SV=1)

HSP 1 Score: 1154.4 bits (2985), Expect = 0.0e+00
Identity = 546/665 (82.11%), Postives = 598/665 (89.92%), Query Frame = 0

Query: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPH--AMTVENYANLCIAHQVFDDIPIWDT 60
           M+LLLST  H LP+TQKPNH YHRH  FNN PH    T E  A+LC+AHQ+FDDIPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLT+GD+GHVISTY+QML RGVRPD HT PR+ICA+R YGDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180
           AFKLG  SNLYV TSLIELYGILDSADTA+WLHDKS CRN+VSWT+LAKLYL EDKPSF+
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 LDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGLEFNILVSNSLLKM 240
           LDLFYQMVELA DIDAVALATAIGACGARK+L HGRNIHH+ARIHGLEF++LVSN LLKM
Sbjct: 181 LDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300
           Y+DC SIKDARG F++MP +DIISWT+LIH YVK GGINEA KLFRQMNMDGELKPDP T
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGR+AAHKHG+E+HGYV+KN FD+NLIVQNALVDMYVKSGCIQSA K FS MK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNDFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMVDEGD 420
           EKDMVSW++M  GYSLHGQGKLGV LFREM++NF++ RDEITYTAVL +C+TA+MV+EGD
Sbjct: 361 EKDMVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQSCSTASMVEEGD 420

Query: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLDGCRNHRQQK 480
            YF+CIT+PT+AH  LKVALL RAGR DEARTF++K KLDK+ EILRALLDGCR H Q K
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQHK 480

Query: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPKKAYSWIEFC 540
           LGKRIIEQLCDLEPLNAENY+LLSNWYA N++W MVEKLR+TIRDMGLRPKKAYSW+EF 
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDG K N DF  HDVDEEREC PIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECAPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPRVFHHFKDGCCS 660
           LAISFGLISTEAGRTIRI+KNLRVCHSCHESAKFIS  VGREIIVKDP VFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRISKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CENFC 664
           CE+FC
Sbjct: 661 CEDFC 665

BLAST of Chy4G080660 vs. ExPASy TrEMBL
Match: A0A5A7T6A7 (Pentatricopeptide repeat-containing protein DOT4 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold1593G00200 PE=3 SV=1)

HSP 1 Score: 1151.0 bits (2976), Expect = 0.0e+00
Identity = 553/580 (95.34%), Postives = 565/580 (97.41%), Query Frame = 0

Query: 84  MLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDS 143
           MLFRGVRPDKHT PRIICATRQYGDL VGKQLHAQAFKLGFSS+LYVLTSLIELYGILDS
Sbjct: 1   MLFRGVRPDKHTLPRIICATRQYGDLPVGKQLHAQAFKLGFSSDLYVLTSLIELYGILDS 60

Query: 144 ADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFALDLFYQMVELADDIDAVALATAIGA 203
           ADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA+DLFYQMVELADDID+VALATAIGA
Sbjct: 61  ADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFAIDLFYQMVELADDIDSVALATAIGA 120

Query: 204 CGARKMLYHGRNIHHLARIHGLEFNILVSNSLLKMYIDCDSIKDARGFFDQMPSKDIISW 263
           CGA KML+HGRNIHHLARIHGLEFNILVSNSLLKMY+DCDSIKDARGFFDQMPSKD+ISW
Sbjct: 121 CGALKMLHHGRNIHHLARIHGLEFNILVSNSLLKMYLDCDSIKDARGFFDQMPSKDVISW 180

Query: 264 TELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRTISSILPACGRMAAHKHGKEVHGYVV 323
           TELIHMYVKKGGINEAFKLFRQMNMDGELKPDP TISSILPACGRMAAHKHGKE+HGYV+
Sbjct: 181 TELIHMYVKKGGINEAFKLFRQMNMDGELKPDPLTISSILPACGRMAAHKHGKEIHGYVL 240

Query: 324 KNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGVS 383
           KN FDENLIVQNALVDMYVKSGCIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGV 
Sbjct: 241 KNGFDENLIVQNALVDMYVKSGCIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGVG 300

Query: 384 LFREMEKNFKMLRDEITYTAVLHACTTANMVDEGDSYFSCITKPTVAHIALKVALLARAG 443
           LFREMEKN KM RDEITYTAVLHACTTANMVDEGD YFS ITKPTVAHIALKVALLARAG
Sbjct: 301 LFREMEKNLKMHRDEITYTAVLHACTTANMVDEGDFYFSRITKPTVAHIALKVALLARAG 360

Query: 444 RLDEARTFIEKKKLDKHPEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLNAENYILLSN 503
           RLDEARTF+EKKKL+KHPEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLN ENYILLSN
Sbjct: 361 RLDEARTFVEKKKLNKHPEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLNTENYILLSN 420

Query: 504 WYACNDKWGMVEKLRETIRDMGLRPKKAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQC 563
           WYACN KW MVE+LRETIRDMGLRPKKAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQC
Sbjct: 421 WYACNKKWDMVEELRETIRDMGLRPKKAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQC 480

Query: 564 LMKKMEEDGSKPNPDFSLHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVC 623
           LMKKMEEDGSK NP+FSLHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVC
Sbjct: 481 LMKKMEEDGSKTNPEFSLHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVC 540

Query: 624 HSCHESAKFISKMVGREIIVKDPRVFHHFKDGCCSCENFC 664
           HSCHESAKFISKMVGREIIVKDP VFHHFKDGCCSCENFC
Sbjct: 541 HSCHESAKFISKMVGREIIVKDPYVFHHFKDGCCSCENFC 580

BLAST of Chy4G080660 vs. ExPASy TrEMBL
Match: A0A5D3CFC2 (Pentatricopeptide repeat-containing protein DOT4 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00420 PE=3 SV=1)

HSP 1 Score: 1147.5 bits (2967), Expect = 0.0e+00
Identity = 552/580 (95.17%), Postives = 564/580 (97.24%), Query Frame = 0

Query: 84  MLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDS 143
           MLFRGVRPDKHT PRIICATRQYGDL VGKQLHAQAFKLGFSS+LYVLTSLIELYGILDS
Sbjct: 1   MLFRGVRPDKHTLPRIICATRQYGDLPVGKQLHAQAFKLGFSSDLYVLTSLIELYGILDS 60

Query: 144 ADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFALDLFYQMVELADDIDAVALATAIGA 203
           ADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA+DLFYQMVELADDID+VALATAIGA
Sbjct: 61  ADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFAIDLFYQMVELADDIDSVALATAIGA 120

Query: 204 CGARKMLYHGRNIHHLARIHGLEFNILVSNSLLKMYIDCDSIKDARGFFDQMPSKDIISW 263
           CGA KML+HGRNIHHLARIHGLEFNILVSNSLLKMY+DCDSIKDARGFFDQMPSKD+ISW
Sbjct: 121 CGALKMLHHGRNIHHLARIHGLEFNILVSNSLLKMYLDCDSIKDARGFFDQMPSKDVISW 180

Query: 264 TELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRTISSILPACGRMAAHKHGKEVHGYVV 323
           TELIHMYVKKGGINEAFKLFRQMNMDGELKPDP TISSILPACGRMAAHKHGKE+HGYV+
Sbjct: 181 TELIHMYVKKGGINEAFKLFRQMNMDGELKPDPLTISSILPACGRMAAHKHGKEIHGYVL 240

Query: 324 KNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGVS 383
           KN FDENLIVQNALVDMYVKSGCIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGV 
Sbjct: 241 KNGFDENLIVQNALVDMYVKSGCIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGVG 300

Query: 384 LFREMEKNFKMLRDEITYTAVLHACTTANMVDEGDSYFSCITKPTVAHIALKVALLARAG 443
           LFREMEKN KM RDEITYTAVLHACTTANMVDEGD YFS ITKPTVAHIALKVALLARAG
Sbjct: 301 LFREMEKNLKMHRDEITYTAVLHACTTANMVDEGDFYFSRITKPTVAHIALKVALLARAG 360

Query: 444 RLDEARTFIEKKKLDKHPEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLNAENYILLSN 503
           RLDEARTF+EKKKL+KHPEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLN ENYILLSN
Sbjct: 361 RLDEARTFVEKKKLNKHPEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLNTENYILLSN 420

Query: 504 WYACNDKWGMVEKLRETIRDMGLRPKKAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQC 563
           WYACN KW MVE+LRETIRDMGLRPKKAYSWIEFCNKIHVF TGDVSHPRSQNIYWNLQC
Sbjct: 421 WYACNKKWDMVEELRETIRDMGLRPKKAYSWIEFCNKIHVFVTGDVSHPRSQNIYWNLQC 480

Query: 564 LMKKMEEDGSKPNPDFSLHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVC 623
           LMKKMEEDGSK NP+FSLHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVC
Sbjct: 481 LMKKMEEDGSKTNPEFSLHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVC 540

Query: 624 HSCHESAKFISKMVGREIIVKDPRVFHHFKDGCCSCENFC 664
           HSCHESAKFISKMVGREIIVKDP VFHHFKDGCCSCENFC
Sbjct: 541 HSCHESAKFISKMVGREIIVKDPYVFHHFKDGCCSCENFC 580

BLAST of Chy4G080660 vs. NCBI nr
Match: XP_004137884.2 (pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis sativus])

HSP 1 Score: 1348 bits (3489), Expect = 0.0
Identity = 648/665 (97.44%), Postives = 656/665 (98.65%), Query Frame = 0

Query: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHA--MTVENYANLCIAHQVFDDIPIWDT 60
           MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPH   MTVENYANLC+AHQVFDDIPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHVRTMTVENYANLCVAHQVFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHT PRIICATRQYGDLQVGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180
           AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWT+LAKLYLREDKPS A
Sbjct: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTVLAKLYLREDKPSLA 180

Query: 181 LDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGLEFNILVSNSLLKM 240
           LDLFYQMVELADDIDAVALATAIGACGA KML+HGRNIHHLAR+HGLEFNILVSNSLLKM
Sbjct: 181 LDLFYQMVELADDIDAVALATAIGACGALKMLHHGRNIHHLARVHGLEFNILVSNSLLKM 240

Query: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300
           YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT
Sbjct: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300

Query: 301 ISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHGKE+HGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMVDEGD 420
           EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKM RDEITYTAVLHACTTANMVDEGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMRRDEITYTAVLHACTTANMVDEGD 420

Query: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLDGCRNHRQQK 480
           SYFSCITKPTVAHIALKVALLARAGRLDEARTF+EKKKLDKHPEILRALLDGCRNHRQQK
Sbjct: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFVEKKKLDKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPKKAYSWIEFC 540
           LGKRIIEQLCDLEPLNAENYILLSNWYACN+KW MVEKLRETIRDMGLRPKKAYSWIEFC
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNLQCLMK+MEEDGSKPNPDFSLHDVDEERECVPIGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKEMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPRVFHHFKDGCCS 660
           LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDP VFHHFKDGCCS
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CENFC 663
           CENFC
Sbjct: 661 CENFC 665

BLAST of Chy4G080660 vs. NCBI nr
Match: XP_008465161.1 (PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucumis melo])

HSP 1 Score: 1305 bits (3376), Expect = 0.0
Identity = 628/665 (94.44%), Postives = 643/665 (96.69%), Query Frame = 0

Query: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHAMT--VENYANLCIAHQVFDDIPIWDT 60
           MNLLLSTHTHCLPITQKP HAYHRHPPFNNLPH  T  VENYA+LC+AHQVFD+IPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPYHAYHRHPPFNNLPHVRTTTVENYADLCVAHQVFDEIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGD GHVIS YRQMLFRGVRPDKHT PRIICATRQYGDL VGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDWGHVISIYRQMLFRGVRPDKHTLPRIICATRQYGDLPVGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180
           AFKLGFSS+LYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA
Sbjct: 121 AFKLGFSSDLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180

Query: 181 LDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGLEFNILVSNSLLKM 240
           +DLFYQMVELADDID+VALATAIGACGA KML+HGRNIHHLARIHGLEFNILVSNSLLKM
Sbjct: 181 IDLFYQMVELADDIDSVALATAIGACGALKMLHHGRNIHHLARIHGLEFNILVSNSLLKM 240

Query: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300
           Y+DCDSIKDARGFFDQMPSKD+ISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDP T
Sbjct: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHGKE+HGYV+KN FDENLIVQNALVDMYVKSGCIQSASKTFSMMK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVLKNGFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMVDEGD 420
           EKDMVSWSIMTLGYSLHGQGKLGV LFREMEKN KM RDEITYTAVLHACTTANMVDEGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVGLFREMEKNLKMHRDEITYTAVLHACTTANMVDEGD 420

Query: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLDGCRNHRQQK 480
            YFS ITKPTVAHIALKVALLARAGRLDEARTF+EKKKL+KHPEILRALLDGCRNHRQQK
Sbjct: 421 FYFSRITKPTVAHIALKVALLARAGRLDEARTFVEKKKLNKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPKKAYSWIEFC 540
           LGKRIIEQLCDLEPLN ENYILLSNWYACN KW MVE+LRETIRDMGLRPKKAYSWIEFC
Sbjct: 481 LGKRIIEQLCDLEPLNTENYILLSNWYACNKKWDMVEELRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600
           NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSK NP+FSLHDVDEERECVPIGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKTNPEFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPRVFHHFKDGCCS 660
           LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDP VFHHFKDGCCS
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CENFC 663
           CENFC
Sbjct: 661 CENFC 665

BLAST of Chy4G080660 vs. NCBI nr
Match: XP_038905218.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1221 bits (3160), Expect = 0.0
Identity = 591/666 (88.74%), Postives = 623/666 (93.54%), Query Frame = 0

Query: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHAMTV--ENYANLCIAHQVFDDIPIWDT 60
           MNLLLSTH HCLPITQ+ +H      PFNN PH  T   +N ANL +AHQ+FD+IPIWDT
Sbjct: 1   MNLLLSTHVHCLPITQETSHRQ----PFNNPPHVRTTTAKNSANLSVAHQLFDEIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLTNGDLGHVISTY+QMLFRGVRPDKHT PRIICATRQYG+LQ GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDLGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGNLQFGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180
           AFKLGFSSNLYVLTSLIE YGILDSADTAKWLHDKS CRNSVSWT+LAKLYL EDKPS A
Sbjct: 121 AFKLGFSSNLYVLTSLIEFYGILDSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSCA 180

Query: 181 LDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGLEFNILVSNSLLKM 240
           +DLFYQMVELADDIDAVALATAIGACGA KML HGRNIH LARIHGLEFN+LVSNSLLKM
Sbjct: 181 IDLFYQMVELADDIDAVALATAIGACGALKMLQHGRNIHLLARIHGLEFNVLVSNSLLKM 240

Query: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300
           Y+DCDSIKDARGFFD+MPSKD+ISWTELIHMYVKKGGINEAFKLFRQMN DG LKPDP T
Sbjct: 241 YLDCDSIKDARGFFDRMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNKDGGLKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGRMAAHKHGKE+HGYV+KNAFDENLIVQNALVDMYVKSGCIQSAS+TFSMMK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVLKNAFDENLIVQNALVDMYVKSGCIQSASETFSMMK 360

Query: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKML-RDEITYTAVLHACTTANMVDEG 420
           EKDMVSW+IMTLGYSLHGQGKLGVSLFRE+E+N +M  RD+ITYTAVLHACTTANMVDEG
Sbjct: 361 EKDMVSWTIMTLGYSLHGQGKLGVSLFREIERNLRMHNRDQITYTAVLHACTTANMVDEG 420

Query: 421 DSYFSCITKPTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLDGCRNHRQQ 480
           D YFSCIT+PTVAHIALKVALLARAGRLDEA TF+EK KLDKH  ILRALLDGCR H Q+
Sbjct: 421 DFYFSCITEPTVAHIALKVALLARAGRLDEATTFVEKNKLDKHAVILRALLDGCRKHHQR 480

Query: 481 KLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPKKAYSWIEF 540
           KLGK+IIE+LCDLEPLNAENYILLSNWYACN KW MVEKLRET+RDMGLRPKKAYSW+EF
Sbjct: 481 KLGKQIIEKLCDLEPLNAENYILLSNWYACNKKWDMVEKLRETMRDMGLRPKKAYSWMEF 540

Query: 541 CNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEERECVPIGHSE 600
           CNKIHVFGTGDVSHPRS+NIYWNLQCLMKKMEEDGSKPNPDFS HDVDEERECVPIGHSE
Sbjct: 541 CNKIHVFGTGDVSHPRSRNIYWNLQCLMKKMEEDGSKPNPDFSFHDVDEERECVPIGHSE 600

Query: 601 LLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPRVFHHFKDGCC 660
           LLAISFGLIST+AGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDP VFHHFKDGCC
Sbjct: 601 LLAISFGLISTKAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCC 660

Query: 661 SCENFC 663
           SCE+ C
Sbjct: 661 SCEDVC 662

BLAST of Chy4G080660 vs. NCBI nr
Match: XP_022972268.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 1146 bits (2965), Expect = 0.0
Identity = 546/665 (82.11%), Postives = 598/665 (89.92%), Query Frame = 0

Query: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHAMTV--ENYANLCIAHQVFDDIPIWDT 60
           M+LLLST  H LP+TQKPNH YHRH  FNN PH  T   E  A+LC+AHQ+FDDIPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLT+GD+GHVISTY+QML RGVRPD HT PR+ICA+R YGDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180
           AFKLG  SNLYV TSLIELYGILDSADTA+WLHDKS CRN+VSWT+LAKLYL EDKPSF+
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 LDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGLEFNILVSNSLLKM 240
           LDLFYQMVELA DIDAVALATAIGACGARK+L HGRNIHH+ARIHGLEF++LVSN LLKM
Sbjct: 181 LDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300
           Y+DC SIKDARG F++MP +DIISWT+LIH YVK GGINEA KLFRQMNMDGELKPDP T
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGR+AAHKHG+E+HGYV+KN FD+NLIVQNALVDMYVKSGCIQSA K FS MK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNDFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMVDEGD 420
           EKDMVSW++M  GYSLHGQGKLGV LFREM++NF++ RDEITYTAVL +C+TA+MV+EGD
Sbjct: 361 EKDMVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQSCSTASMVEEGD 420

Query: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLDGCRNHRQQK 480
            YF+CIT+PT+AH  LKVALL RAGR DEARTF++K KLDK+ EILRALLDGCR H Q K
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQHK 480

Query: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPKKAYSWIEFC 540
           LGKRIIEQLCDLEPLNAENY+LLSNWYA N++W MVEKLR+TIRDMGLRPKKAYSW+EF 
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDG K N DF  HDVDEEREC PIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECAPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPRVFHHFKDGCCS 660
           LAISFGLISTEAGRTIRI+KNLRVCHSCHESAKFIS  VGREIIVKDP VFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRISKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CENFC 663
           CE+FC
Sbjct: 661 CEDFC 665

BLAST of Chy4G080660 vs. NCBI nr
Match: XP_023539701.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1146 bits (2964), Expect = 0.0
Identity = 546/665 (82.11%), Postives = 598/665 (89.92%), Query Frame = 0

Query: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHAMTV--ENYANLCIAHQVFDDIPIWDT 60
           MNLLLST  H LP+TQKPNH YHRH  FNN PH  T   E  A+LC+AHQ+FDDIPIWDT
Sbjct: 1   MNLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQ 120
           FAWNNLIQTHLT+GD+GHVISTY+QML RGVRPD HT PR+ICA+R YGDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180
           AFKLG  SNLYV TSLIELYGILDSADTA+WLHDKS CRN+VSWT+LAKLYL EDKPSF+
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 LDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGLEFNILVSNSLLKM 240
           +DLFYQMVELA DIDAVALATA+GACGARK+L HGRNIHH+ARIHGLEF++LVSN LLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATALGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300
           Y+DC SIKDARG F++MP +DIISWT+LIH YVK GGINEA KLFRQMNMDGELKPDP T
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360
           ISSILPACGR+AAHKHG+E+HGYV+KN FD+NLIVQNALVDMYVKSGCIQSA K FS MK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMVDEGD 420
           EKDMVSW++M  GYSLHGQGKLGV LFREM++NF++ RDEITYTAVL AC+TA+MV+EGD
Sbjct: 361 EKDMVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQACSTASMVEEGD 420

Query: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLDGCRNHRQQK 480
            YF+CIT+PT+AH  LKVALL RAGR DEARTF++K KLDK+ EILRALLDGCR H QQK
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPKKAYSWIEFC 540
           LGKRIIEQLCDLEPLNAENY+LLSNWYA N++W MVEKLR+TIRDMGLRPKKAYSW+EF 
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDG K N DF  HDVDEEREC  IGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPRVFHHFKDGCCS 660
           LAISFGLISTEAGRTIRI+KNLRVCHSCHESAKFIS  VGREIIVKDP VFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRISKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CENFC 663
           CE+FC
Sbjct: 661 CEDFC 665

BLAST of Chy4G080660 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 440.7 bits (1132), Expect = 2.2e-123
Identity = 231/623 (37.08%), Postives = 357/623 (57.30%), Query Frame = 0

Query: 46  AHQVFDDIPIWDTFAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQ 105
           A +VFD++   D  +WN++I  +++NG     +S + QML  G+  D  T   +      
Sbjct: 249 ARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCAD 308

Query: 106 YGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTIL 165
              + +G+ +H+   K  FS       +L+++Y      D+AK +  + + R+ VS+T +
Sbjct: 309 SRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSM 368

Query: 166 AKLYLREDKPSFALDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGL 225
              Y RE     A+ LF +M E     D   +   +  C   ++L  G+ +H   + + L
Sbjct: 369 IAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDL 428

Query: 226 EFNILVSNSLLKMYIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQ 285
            F+I VSN+L+ MY  C S+++A   F +M  KDIISW  +I  Y K    NEA  LF  
Sbjct: 429 GFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNL 488

Query: 286 MNMDGELKPDPRTISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSG 345
           +  +    PD RT++ +LPAC  ++A   G+E+HGY+++N +  +  V N+LVDMY K G
Sbjct: 489 LLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCG 548

Query: 346 CIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVL 405
            +  A   F  +  KD+VSW++M  GY +HG GK  ++LF +M +   +  DEI++ ++L
Sbjct: 549 ALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQM-RQAGIEADEISFVSLL 608

Query: 406 HACTTANMVDEGDSYFS-----CITKPTVAHIALKVALLARAGRLDEARTFIEKKKLDKH 465
           +AC+ + +VDEG  +F+     C  +PTV H A  V +LAR G L +A  FIE   +   
Sbjct: 609 YACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPD 668

Query: 466 PEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRET 525
             I  ALL GCR H   KL +++ E++ +LEP N   Y+L++N YA  +KW  V++LR+ 
Sbjct: 669 ATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKR 728

Query: 526 IRDMGLRPKKAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFS 585
           I   GLR     SWIE   ++++F  GD S+P ++NI   L+ +  +M E+G  P   ++
Sbjct: 729 IGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYA 788

Query: 586 LHDVDE-ERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGR 645
           L D +E E+E    GHSE LA++ G+IS+  G+ IR+TKNLRVC  CHE AKF+SK+  R
Sbjct: 789 LIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRR 848

Query: 646 EIIVKDPRVFHHFKDGCCSCENF 663
           EI+++D   FH FKDG CSC  F
Sbjct: 849 EIVLRDSNRFHQFKDGHCSCRGF 870

BLAST of Chy4G080660 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 416.0 bits (1068), Expect = 5.7e-116
Identity = 221/631 (35.02%), Postives = 350/631 (55.47%), Query Frame = 0

Query: 39  NYANLCIAHQVFDDIPIWDTFAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPR 98
           ++ ++  A QVFDD+P    F WN +I+ +  N      +  Y  M    V PD  TFP 
Sbjct: 65  SFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPH 124

Query: 99  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTC-- 158
           ++ A      LQ+G+ +HAQ F+LGF ++++V   LI LY       +A+ + +      
Sbjct: 125 LLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPE 184

Query: 159 RNSVSWTILAKLYLREDKPSFALDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNI 218
           R  VSWT +   Y +  +P  AL++F QM ++    D VAL + + A    + L  GR+I
Sbjct: 185 RTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSI 244

Query: 219 HHLARIHGLEFNILVSNSLLKMYIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGI 278
           H      GLE    +  SL  MY  C  +  A+  FD+M S ++I W  +I  Y K G  
Sbjct: 245 HASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYA 304

Query: 279 NEAFKLFRQMNMDGELKPDPRTISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNA 338
            EA  +F +M ++ +++PD  +I+S + AC ++ + +  + ++ YV ++ + +++ + +A
Sbjct: 305 REAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSA 364

Query: 339 LVDMYVKSGCIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLR 398
           L+DM+ K G ++ A   F    ++D+V WS M +GY LHG+ +  +SL+R ME+   +  
Sbjct: 365 LIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERG-GVHP 424

Query: 399 DEITYTAVLHACTTANMVDEGDSYFSCIT----KPTVAHIALKVALLARAGRLDEARTFI 458
           +++T+  +L AC  + MV EG  +F+ +      P   H A  + LL RAG LD+A   I
Sbjct: 425 NDVTFLGLLMACNHSGMVREGWWFFNRMADHKINPQQQHYACVIDLLGRAGHLDQAYEVI 484

Query: 459 EKKKLDKHPEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWG 518
           +   +     +  ALL  C+ HR  +LG+   +QL  ++P N  +Y+ LSN YA    W 
Sbjct: 485 KCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWD 544

Query: 519 MVEKLRETIRDMGLRPKKAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDG 578
            V ++R  +++ GL      SW+E   ++  F  GD SHPR + I   ++ +  +++E G
Sbjct: 545 RVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGG 604

Query: 579 SKPNPDFSLHDV-DEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAK 638
              N D SLHD+ DEE E     HSE +AI++GLIST  G  +RITKNLR C +CH + K
Sbjct: 605 FVANKDASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATK 664

Query: 639 FISKMVGREIIVKDPRVFHHFKDGCCSCENF 663
            ISK+V REI+V+D   FHHFKDG CSC ++
Sbjct: 665 LISKLVDREIVVRDTNRFHHFKDGVCSCGDY 693

BLAST of Chy4G080660 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 389.0 bits (998), Expect = 7.4e-108
Identity = 222/675 (32.89%), Postives = 353/675 (52.30%), Query Frame = 0

Query: 34  AMTVENYANLCIAHQ---VFDDIPIWDTFAWNNLIQTHLTNGDLGHVISTYRQMLFRGVR 93
           ++ +  Y NL + H+   +F  +      AW ++I+           ++++ +M   G  
Sbjct: 43  SIVISIYTNLKLLHEALLLFKTLKSPPVLAWKSVIRCFTDQSLFSKALASFVEMRASGRC 102

Query: 94  PDKHTFPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGIL--------- 153
           PD + FP ++ +     DL+ G+ +H    +LG   +LY   +L+ +Y  L         
Sbjct: 103 PDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISV 162

Query: 154 ---------------------------DSADTAKWLHDKSTCRNSVSWTILAKLYLREDK 213
                                         D+ + + +    ++ VS+  +   Y +   
Sbjct: 163 GNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGM 222

Query: 214 PSFALDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGLEFNILVSNS 273
              AL +  +M       D+  L++ +        +  G+ IH      G++ ++ + +S
Sbjct: 223 YEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSS 282

Query: 274 LLKMYIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKP 333
           L+ MY     I+D+   F ++  +D ISW  L+  YV+ G  NEA +LFRQM +  ++KP
Sbjct: 283 LVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQM-VTAKVKP 342

Query: 334 DPRTISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTF 393
                SS++PAC  +A    GK++HGYV++  F  N+ + +ALVDMY K G I++A K F
Sbjct: 343 GAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIF 402

Query: 394 SMMKEKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMV 453
             M   D VSW+ + +G++LHG G   VSLF EM++   +  +++ + AVL AC+   +V
Sbjct: 403 DRMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMKRQ-GVKPNQVAFVAVLTACSHVGLV 462

Query: 454 DEGDSYFSCITK-----PTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLD 513
           DE   YF+ +TK       + H A    LL RAG+L+EA  FI K  ++    +   LL 
Sbjct: 463 DEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLS 522

Query: 514 GCRNHRQQKLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPK 573
            C  H+  +L +++ E++  ++  N   Y+L+ N YA N +W  + KLR  +R  GLR K
Sbjct: 523 SCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKK 582

Query: 574 KAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEE-- 633
            A SWIE  NK H F +GD SHP    I   L+ +M++ME++G   +    LHDVDEE  
Sbjct: 583 PACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHK 642

Query: 634 RECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPR 663
           RE +  GHSE LA++FG+I+TE G TIR+TKN+R+C  CH + KFISK+  REIIV+D  
Sbjct: 643 RELL-FGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNS 702

BLAST of Chy4G080660 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 387.5 bits (994), Expect = 2.2e-107
Identity = 231/731 (31.60%), Postives = 364/731 (49.79%), Query Frame = 0

Query: 14  ITQKPNHAYHRHPPFNNLPHAMTVENYANLCIAH----------QVFDDIPIWDTFAWNN 73
           +  K  +A H    F+ +P   T  ++  +  A+          + FD +P  D+ +W  
Sbjct: 58  VYSKTGYALHARKLFDEMP-LRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTT 117

Query: 74  LIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQYGDLQVGKQLHAQAFKLG 133
           +I  +   G     I     M+  G+ P + T   ++ +      ++ GK++H+   KLG
Sbjct: 118 MIVGYKNIGQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLG 177

Query: 134 FSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFALDLFY 193
              N+ V  SL+ +Y        AK++ D+   R+  SW  +  L+++  +   A+  F 
Sbjct: 178 LRGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFE 237

Query: 194 QMVE-----------------------------LADDI---DAVALATAIGACGARKMLY 253
           QM E                             L D +   D   LA+ + AC   + L 
Sbjct: 238 QMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLC 297

Query: 254 HGRNIHHLARIHGLEFNILVSNSLLKMYIDCDSIKDARGFFDQ----------------- 313
            G+ IH      G + + +V N+L+ MY  C  ++ AR   +Q                 
Sbjct: 298 IGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDG 357

Query: 314 ----------------MPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 373
                           +  +D+++WT +I  Y + G   EA  LFR M + G  +P+  T
Sbjct: 358 YIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSM-VGGGQRPNSYT 417

Query: 374 ISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 433
           ++++L     +A+  HGK++HG  VK+    ++ V NAL+ MY K+G I SAS+ F +++
Sbjct: 418 LAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIR 477

Query: 434 -EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVLHACTTANMVDEG 493
            E+D VSW+ M +  + HG  +  + LF  M     +  D ITY  V  ACT A +V++G
Sbjct: 478 CERDTVSWTSMIIALAQHGHAEEALELFETMLME-GLRPDHITYVGVFSACTHAGLVNQG 537

Query: 494 DSYFSCITK-----PTVAHIALKVALLARAGRLDEARTFIEKKKLDKHPEILRALLDGCR 553
             YF  +       PT++H A  V L  RAG L EA+ FIEK  ++       +LL  CR
Sbjct: 538 RQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACR 597

Query: 554 NHRQQKLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRETIRDMGLRPKKAY 613
            H+   LGK   E+L  LEP N+  Y  L+N Y+   KW    K+R++++D  ++ ++ +
Sbjct: 598 VHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGF 657

Query: 614 SWIEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFSLHDVDEE-RECV 663
           SWIE  +K+HVFG  D +HP    IY  ++ +  ++++ G  P+    LHD++EE +E +
Sbjct: 658 SWIEVKHKVHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQI 717

BLAST of Chy4G080660 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 385.6 bits (989), Expect = 8.2e-107
Identity = 209/622 (33.60%), Postives = 337/622 (54.18%), Query Frame = 0

Query: 46  AHQVFDDIPIWDTFAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTFPRIICATRQ 105
           A +VFD +P  D  +WN ++  +  NG     +   + M    ++P   T   ++ A   
Sbjct: 189 ARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSA 248

Query: 106 YGDLQVGKQLHAQAFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTIL 165
              + VGK++H  A + GF S + + T+L+++Y    S +TA+ L D    RN VSW  +
Sbjct: 249 LRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSM 308

Query: 166 AKLYLREDKPSFALDLFYQMVELADDIDAVALATAIGACGARKMLYHGRNIHHLARIHGL 225
              Y++ + P  A+ +F +M++       V++  A+ AC     L  GR IH L+   GL
Sbjct: 309 IDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGL 368

Query: 226 EFNILVSNSLLKMYIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQ 285
           + N+ V NSL+ MY  C  +  A   F ++ S+ ++SW  +I  + + G   +A   F Q
Sbjct: 369 DRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQ 428

Query: 286 MNMDGELKPDPRTISSILPACGRMAAHKHGKEVHGYVVKNAFDENLIVQNALVDMYVKSG 345
           M     +KPD  T  S++ A   ++   H K +HG V+++  D+N+ V  ALVDMY K G
Sbjct: 429 MR-SRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCG 488

Query: 346 CIQSASKTFSMMKEKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMLRDEITYTAVL 405
            I  A   F MM E+ + +W+ M  GY  HG GK  + LF EM+K   +  + +T+ +V+
Sbjct: 489 AIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKG-TIKPNGVTFLSVI 548

Query: 406 HACTTANMVDEGDSYFSCITKP-----TVAHIALKVALLARAGRLDEARTFIEKKKLDKH 465
            AC+ + +V+ G   F  + +      ++ H    V LL RAGRL+EA  FI +  +   
Sbjct: 549 SACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPA 608

Query: 466 PEILRALLDGCRNHRQQKLGKRIIEQLCDLEPLNAENYILLSNWYACNDKWGMVEKLRET 525
             +  A+L  C+ H+     ++  E+L +L P +   ++LL+N Y     W  V ++R +
Sbjct: 609 VNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVS 668

Query: 526 IRDMGLRPKKAYSWIEFCNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKPNPDFS 585
           +   GLR     S +E  N++H F +G  +HP S+ IY  L+ L+  ++E G  P+ +  
Sbjct: 669 MLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLV 728

Query: 586 LHDVDEERECVPIGHSELLAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGRE 645
           L   ++ +E +   HSE LAISFGL++T AG TI + KNLRVC  CH + K+IS + GRE
Sbjct: 729 LGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGRE 788

Query: 646 IIVKDPRVFHHFKDGCCSCENF 663
           I+V+D + FHHFK+G CSC ++
Sbjct: 789 IVVRDMQRFHHFKNGACSCGDY 808

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SN393.0e-12237.08Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9LTV88.0e-11535.02Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9LW631.0e-10632.89Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9SHZ83.0e-10631.60Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q3E6Q11.2e-10533.60Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A1S3CPR50.0e+0094.44pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucumis ... [more]
A0A0A0L9N40.0e+0097.28DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G7228... [more]
A0A6J1I9E10.0e+0082.11pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbit... [more]
A0A5A7T6A70.0e+0095.34Pentatricopeptide repeat-containing protein DOT4 OS=Cucumis melo var. makuwa OX=... [more]
A0A5D3CFC20.0e+0095.17Pentatricopeptide repeat-containing protein DOT4 OS=Cucumis melo var. makuwa OX=... [more]
Match NameE-valueIdentityDescription
XP_004137884.20.097.44pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis sativus... [more]
XP_008465161.10.094.44PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like ... [more]
XP_038905218.10.088.74pentatricopeptide repeat-containing protein DOT4, chloroplastic-like isoform X1 ... [more]
XP_022972268.10.082.11pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita ... [more]
XP_023539701.10.082.11pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT4G18750.12.2e-12337.08Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.15.7e-11635.02mitochondrial editing factor 22 [more]
AT3G23330.17.4e-10832.89Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G22070.12.2e-10731.60pentatricopeptide (PPR) repeat-containing protein [more]
AT1G11290.18.2e-10733.60Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (hystrix) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 57..96
e-value: 3.8E-7
score: 30.2
coord: 258..306
e-value: 9.4E-9
score: 35.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 363..391
e-value: 0.011
score: 15.9
coord: 335..361
e-value: 0.011
score: 15.9
coord: 399..423
e-value: 1.3
score: 9.5
coord: 160..187
e-value: 0.0088
score: 16.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 261..295
e-value: 0.0015
score: 16.6
coord: 333..363
e-value: 0.0029
score: 15.7
coord: 60..92
e-value: 0.002
score: 16.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 57..91
score: 9.930995
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 330..364
score: 8.53891
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 259..293
score: 9.876189
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 211..363
e-value: 6.0E-23
score: 83.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 46..210
e-value: 3.1E-17
score: 64.9
coord: 381..543
e-value: 3.8E-14
score: 54.8
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 532..653
e-value: 1.6E-32
score: 112.0
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 1..656
NoneNo IPR availablePANTHERPTHR24015:SF1853REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 1..656

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Chy4G080660.1Chy4G080660.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding