CmoCh06G017390 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh06G017390
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionpentatricopeptide repeat-containing protein DOT4, chloroplastic-like
LocationCmo_Chr06: 11926032 .. 11928029 (+)
RNA-Seq ExpressionCmoCh06G017390
SyntenyCmoCh06G017390
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCTCCTCCTCTCCACCCCCATTCATCGTCTTCCTCTTACTCAAAAACCTAATCACACATACGATCGCCACCGACTCTTTAATAATCCCCCTCATGTTCGCACCACGACTGCTGAGAAGAATGCTCATTTATGCGTAGCCCACCAACTGTTCGACGATATTCCTATATGGGATACTTTTGCTTGGAACAATCTGATTCAAACCCATCTCACCAGCGGAGATGTGGGGCATGTTATTTCTACTTATCAACAGATGTTGTCTCGAGGGGTTCGCCCCGACAACCACACCCTTCCTCGAGTTATCTGCGCCTCCCGTCACTATGGTGATCTGCAGCTTGGCAAGCAGCTCCATGCTCAAGCCTTCAAACTTGGGCTCTTCTCTAACCTCTATGTATTTACTTCCTTGATTGAGCTGTATGGGATTCTTGACAGTGCGGACACTGCAAGGTGGCTCCATGACAAGTCGGCTTGCAGAAACGCTGTTTCTTGGACCATGTTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTTCCATAGACTTGTTTTACCAAATGGTGGAGTTGGCGGCTGATATTGATGCAGTGGCATTGGCCACGGCTATTGGTGCCTGTGGTGCACGTAAACTGCTGCAACACGGAAGAAATATCCACCATGTCGCCAGAATTCATGGCTTGGAATTTGATATCTTGGTCAGTAATTGCCTGTTGAAAATGTACCTTGACTGTGGCAGTATCAAAGATGCTAGGGGGTTGTTCAATCGAATGCCGTTCAGAGATATCATTTCGTGGACAGACCTCATCCATTTTTATGTTAAGAATGGTGGAATCAATGAGGCCTTAAAGCTCTTTCGACAGATGAATATGGATGGAGAATTGAAGCCTGATCCTCTCACAATCAGCAGCATTCTCCCAGCCTGTGGAAGAATCACTGCTCATAAGCATGGAAGAGAGATTCACGGATACGTGCTTAAAAATTATTTTGATGACAATCTCATCGTCCAAAATGCTTTAGTTGACATGTATGTCAAATCTGGATGTATCCAATCTGCATTGAAAATTTTCTCGAGGATGAAGGAGAAAGACATGGTTTCTTGGACTGTCGTGATCTCGGGCTACAGCTTACATGGGCAAGGAAAACTTGGAGTGGGTTTGTTCCGTGAGATGGACAGGAACTTTAGTGTGCATAGAGATGAGATCACTTATACTGCAGTTTTGCAGGCTTGTAGTACTGCAAGCATGGTAGAGGAAGGGGATTTTTACTTCAATTGCATTACCGAACCAACCATGGCACACTTTGTTTTAAAGGTGGCTCTTTTAGGCCGGGCAGGACGATTCAATGAAGCAAGAACATTTGTCGATAAACATAAACTCGACAAAAATTTAGAGATTTTGAGAGCATTGCTCGATGGATGCAGGAAGCACCATCAACAGAAACTAGGGAAGCGAATCATTGAGCAGCTGTGTGATTTGGAACCTCTAAATGCCGAGAATTACGTTCTACTTTCAAACTGGTATGCCAGCAACGAAGAATGGGAGATGGTCGAAAAGCTGAGAAAAACTATTAGAGACATGGGATTAAGACCAAAGAAGGCTTACAGTTGGATGGAGTTCCGCAACAAAATCCATGCATTTGGCACAGGGGATGTATCCCACCCAAGATCGCAGGCCATATATTGGAATTTACAGTGCCTGATGAAGAAAATGGAAGAAGATGGTTTCAAGAGGAATACAGATTTCAGATTCCACGACGTGGATGAGGAGCGAGAGTGTGCTCTGATAGGACACAGCGAGCTCTTGGCAATTTCCTTCGGGCTGATTAGTACAGAAGCCGGAAGGACAATTCGTATTGCAAAGAACCTTCGTGTATGCCATAGTTGTCATGAATCCGCGAAGTTCATATCCAACAAGGTTGGACGAGAAATCATAGTAAAAGATCCTTATGTTTTCCATCATTTCAAGGATGGCCGTTGTTCTTGTGAAGATTTTTGTTAG

mRNA sequence

ATGGATCTCCTCCTCTCCACCCCCATTCATCGTCTTCCTCTTACTCAAAAACCTAATCACACATACGATCGCCACCGACTCTTTAATAATCCCCCTCATGTTCGCACCACGACTGCTGAGAAGAATGCTCATTTATGCGTAGCCCACCAACTGTTCGACGATATTCCTATATGGGATACTTTTGCTTGGAACAATCTGATTCAAACCCATCTCACCAGCGGAGATGTGGGGCATGTTATTTCTACTTATCAACAGATGTTGTCTCGAGGGGTTCGCCCCGACAACCACACCCTTCCTCGAGTTATCTGCGCCTCCCGTCACTATGGTGATCTGCAGCTTGGCAAGCAGCTCCATGCTCAAGCCTTCAAACTTGGGCTCTTCTCTAACCTCTATGTATTTACTTCCTTGATTGAGCTGTATGGGATTCTTGACAGTGCGGACACTGCAAGGTGGCTCCATGACAAGTCGGCTTGCAGAAACGCTGTTTCTTGGACCATGTTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTTCCATAGACTTGTTTTACCAAATGGTGGAGTTGGCGGCTGATATTGATGCAGTGGCATTGGCCACGGCTATTGGTGCCTGTGGTGCACGTAAACTGCTGCAACACGGAAGAAATATCCACCATGTCGCCAGAATTCATGGCTTGGAATTTGATATCTTGGTCAGTAATTGCCTGTTGAAAATGTACCTTGACTGTGGCAGTATCAAAGATGCTAGGGGGTTGTTCAATCGAATGCCGTTCAGAGATATCATTTCGTGGACAGACCTCATCCATTTTTATGTTAAGAATGGTGGAATCAATGAGGCCTTAAAGCTCTTTCGACAGATGAATATGGATGGAGAATTGAAGCCTGATCCTCTCACAATCAGCAGCATTCTCCCAGCCTGTGGAAGAATCACTGCTCATAAGCATGGAAGAGAGATTCACGGATACGTGCTTAAAAATTATTTTGATGACAATCTCATCGTCCAAAATGCTTTAGTTGACATGTATGTCAAATCTGGATGTATCCAATCTGCATTGAAAATTTTCTCGAGGATGAAGGAGAAAGACATGGTTTCTTGGACTGTCGTGATCTCGGGCTACAGCTTACATGGGCAAGGAAAACTTGGAGTGGGTTTGTTCCGTGAGATGGACAGGAACTTTAGTGTGCATAGAGATGAGATCACTTATACTGCAGTTTTGCAGGCTTGTAGTACTGCAAGCATGGTAGAGGAAGGGGATTTTTACTTCAATTGCATTACCGAACCAACCATGGCACACTTTGTTTTAAAGGTGGCTCTTTTAGGCCGGGCAGGACGATTCAATGAAGCAAGAACATTTGTCGATAAACATAAACTCGACAAAAATTTAGAGATTTTGAGAGCATTGCTCGATGGATGCAGGAAGCACCATCAACAGAAACTAGGGAAGCGAATCATTGAGCAGCTGTGTGATTTGGAACCTCTAAATGCCGAGAATTACGTTCTACTTTCAAACTGGTATGCCAGCAACGAAGAATGGGAGATGGTCGAAAAGCTGAGAAAAACTATTAGAGACATGGGATTAAGACCAAAGAAGGCTTACAGTTGGATGGAGTTCCGCAACAAAATCCATGCATTTGGCACAGGGGATGTATCCCACCCAAGATCGCAGGCCATATATTGGAATTTACAGTGCCTGATGAAGAAAATGGAAGAAGATGGTTTCAAGAGGAATACAGATTTCAGATTCCACGACGTGGATGAGGAGCGAGAGTGTGCTCTGATAGGACACAGCGAGCTCTTGGCAATTTCCTTCGGGCTGATTAGTACAGAAGCCGGAAGGACAATTCGTATTGCAAAGAACCTTCGTGTATGCCATAGTTGTCATGAATCCGCGAAGTTCATATCCAACAAGGTTGGACGAGAAATCATAGTAAAAGATCCTTATGTTTTCCATCATTTCAAGGATGGCCGTTGTTCTTGTGAAGATTTTTGTTAG

Coding sequence (CDS)

ATGGATCTCCTCCTCTCCACCCCCATTCATCGTCTTCCTCTTACTCAAAAACCTAATCACACATACGATCGCCACCGACTCTTTAATAATCCCCCTCATGTTCGCACCACGACTGCTGAGAAGAATGCTCATTTATGCGTAGCCCACCAACTGTTCGACGATATTCCTATATGGGATACTTTTGCTTGGAACAATCTGATTCAAACCCATCTCACCAGCGGAGATGTGGGGCATGTTATTTCTACTTATCAACAGATGTTGTCTCGAGGGGTTCGCCCCGACAACCACACCCTTCCTCGAGTTATCTGCGCCTCCCGTCACTATGGTGATCTGCAGCTTGGCAAGCAGCTCCATGCTCAAGCCTTCAAACTTGGGCTCTTCTCTAACCTCTATGTATTTACTTCCTTGATTGAGCTGTATGGGATTCTTGACAGTGCGGACACTGCAAGGTGGCTCCATGACAAGTCGGCTTGCAGAAACGCTGTTTCTTGGACCATGTTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTTCCATAGACTTGTTTTACCAAATGGTGGAGTTGGCGGCTGATATTGATGCAGTGGCATTGGCCACGGCTATTGGTGCCTGTGGTGCACGTAAACTGCTGCAACACGGAAGAAATATCCACCATGTCGCCAGAATTCATGGCTTGGAATTTGATATCTTGGTCAGTAATTGCCTGTTGAAAATGTACCTTGACTGTGGCAGTATCAAAGATGCTAGGGGGTTGTTCAATCGAATGCCGTTCAGAGATATCATTTCGTGGACAGACCTCATCCATTTTTATGTTAAGAATGGTGGAATCAATGAGGCCTTAAAGCTCTTTCGACAGATGAATATGGATGGAGAATTGAAGCCTGATCCTCTCACAATCAGCAGCATTCTCCCAGCCTGTGGAAGAATCACTGCTCATAAGCATGGAAGAGAGATTCACGGATACGTGCTTAAAAATTATTTTGATGACAATCTCATCGTCCAAAATGCTTTAGTTGACATGTATGTCAAATCTGGATGTATCCAATCTGCATTGAAAATTTTCTCGAGGATGAAGGAGAAAGACATGGTTTCTTGGACTGTCGTGATCTCGGGCTACAGCTTACATGGGCAAGGAAAACTTGGAGTGGGTTTGTTCCGTGAGATGGACAGGAACTTTAGTGTGCATAGAGATGAGATCACTTATACTGCAGTTTTGCAGGCTTGTAGTACTGCAAGCATGGTAGAGGAAGGGGATTTTTACTTCAATTGCATTACCGAACCAACCATGGCACACTTTGTTTTAAAGGTGGCTCTTTTAGGCCGGGCAGGACGATTCAATGAAGCAAGAACATTTGTCGATAAACATAAACTCGACAAAAATTTAGAGATTTTGAGAGCATTGCTCGATGGATGCAGGAAGCACCATCAACAGAAACTAGGGAAGCGAATCATTGAGCAGCTGTGTGATTTGGAACCTCTAAATGCCGAGAATTACGTTCTACTTTCAAACTGGTATGCCAGCAACGAAGAATGGGAGATGGTCGAAAAGCTGAGAAAAACTATTAGAGACATGGGATTAAGACCAAAGAAGGCTTACAGTTGGATGGAGTTCCGCAACAAAATCCATGCATTTGGCACAGGGGATGTATCCCACCCAAGATCGCAGGCCATATATTGGAATTTACAGTGCCTGATGAAGAAAATGGAAGAAGATGGTTTCAAGAGGAATACAGATTTCAGATTCCACGACGTGGATGAGGAGCGAGAGTGTGCTCTGATAGGACACAGCGAGCTCTTGGCAATTTCCTTCGGGCTGATTAGTACAGAAGCCGGAAGGACAATTCGTATTGCAAAGAACCTTCGTGTATGCCATAGTTGTCATGAATCCGCGAAGTTCATATCCAACAAGGTTGGACGAGAAATCATAGTAAAAGATCCTTATGTTTTCCATCATTTCAAGGATGGCCGTTGTTCTTGTGAAGATTTTTGTTAG

Protein sequence

MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDTFAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQAFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFSIDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKMYLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLTISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMKEKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGDFYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFRNKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSELLAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCSCEDFC
Homology
BLAST of CmoCh06G017390 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 448.7 bits (1153), Expect = 1.1e-124
Identity = 233/630 (36.98%), Postives = 359/630 (56.98%), Query Frame = 0

Query: 41  KNAHLCVAHQLFDDIPIWDTFAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPR 100
           KN  +  A ++FD++   D  +WN++I  ++++G     +S + QML  G+  D  T+  
Sbjct: 242 KNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVS 301

Query: 101 VICASRHYGDLQLGKQLHAQAFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRN 160
           V         + LG+ +H+   K           +L+++Y      D+A+ +  + + R+
Sbjct: 302 VFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRS 361

Query: 161 AVSWTMLAKLYLMEDKPSFSIDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHH 220
            VS+T +   Y  E     ++ LF +M E     D   +   +  C   +LL  G+ +H 
Sbjct: 362 VVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHE 421

Query: 221 VARIHGLEFDILVSNCLLKMYLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINE 280
             + + L FDI VSN L+ MY  CGS+++A  +F+ M  +DIISW  +I  Y KN   NE
Sbjct: 422 WIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANE 481

Query: 281 ALKLFRQMNMDGELKPDPLTISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALV 340
           AL LF  +  +    PD  T++ +LPAC  ++A   GREIHGY+++N +  +  V N+LV
Sbjct: 482 ALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLV 541

Query: 341 DMYVKSGCIQSALKIFSRMKEKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDE 400
           DMY K G +  A  +F  +  KD+VSWTV+I+GY +HG GK  + LF +M R   +  DE
Sbjct: 542 DMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQM-RQAGIEADE 601

Query: 401 ITYTAVLQACSTASMVEEGDFYFN-----CITEPTMAHFVLKVALLGRAGRFNEARTFVD 460
           I++ ++L ACS + +V+EG  +FN     C  EPT+ H+   V +L R G   +A  F++
Sbjct: 602 ISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIE 661

Query: 461 KHKLDKNLEILRALLDGCRKHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEM 520
              +  +  I  ALL GCR HH  KL +++ E++ +LEP N   YVL++N YA  E+WE 
Sbjct: 662 NMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQ 721

Query: 521 VEKLRKTIRDMGLRPKKAYSWMEFRNKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGF 580
           V++LRK I   GLR     SW+E + +++ F  GD S+P ++ I   L+ +  +M E+G+
Sbjct: 722 VKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGY 781

Query: 581 KRNTDFRFHDVDE-ERECALIGHSELLAISFGLISTEAGRTIRIAKNLRVCHSCHESAKF 640
              T +   D +E E+E AL GHSE LA++ G+IS+  G+ IR+ KNLRVC  CHE AKF
Sbjct: 782 SPLTKYALIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKF 841

Query: 641 ISNKVGREIIVKDPYVFHHFKDGRCSCEDF 665
           +S    REI+++D   FH FKDG CSC  F
Sbjct: 842 MSKLTRREIVLRDSNRFHQFKDGHCSCRGF 870

BLAST of CmoCh06G017390 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 427.6 bits (1098), Expect = 2.7e-118
Identity = 222/626 (35.46%), Postives = 358/626 (57.19%), Query Frame = 0

Query: 48  AHQLFDDIPIWDTFAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRH 107
           A Q+FDD+P    F WN +I+ +  +      +  Y  M    V PD+ T P ++ A   
Sbjct: 72  ARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSG 131

Query: 108 YGDLQLGKQLHAQAFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSAC--RNAVSWT 167
              LQ+G+ +HAQ F+LG  ++++V   LI LY       +AR + +      R  VSWT
Sbjct: 132 LSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWT 191

Query: 168 MLAKLYLMEDKPSFSIDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIH 227
            +   Y    +P  ++++F QM ++    D VAL + + A    + L+ GR+IH      
Sbjct: 192 AIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKM 251

Query: 228 GLEF--DILVSNCLLKMYLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALK 287
           GLE   D+L+S  L  MY  CG +  A+ LF++M   ++I W  +I  Y KNG   EA+ 
Sbjct: 252 GLEIEPDLLIS--LNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAID 311

Query: 288 LFRQMNMDGELKPDPLTISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMY 347
           +F +M ++ +++PD ++I+S + AC ++ + +  R ++ YV ++ + D++ + +AL+DM+
Sbjct: 312 MFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMF 371

Query: 348 VKSGCIQSALKIFSRMKEKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITY 407
            K G ++ A  +F R  ++D+V W+ +I GY LHG+ +  + L+R M+R   VH +++T+
Sbjct: 372 AKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERG-GVHPNDVTF 431

Query: 408 TAVLQACSTASMVEEGDFYFNCITE----PTMAHFVLKVALLGRAGRFNEARTFVDKHKL 467
             +L AC+ + MV EG ++FN + +    P   H+   + LLGRAG  ++A   +    +
Sbjct: 432 LGLLMACNHSGMVREGWWFFNRMADHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPV 491

Query: 468 DKNLEILRALLDGCRKHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKL 527
              + +  ALL  C+KH   +LG+   +QL  ++P N  +YV LSN YA+   W+ V ++
Sbjct: 492 QPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEV 551

Query: 528 RKTIRDMGLRPKKAYSWMEFRNKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNT 587
           R  +++ GL      SW+E R ++ AF  GD SHPR + I   ++ +  +++E GF  N 
Sbjct: 552 RVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANK 611

Query: 588 DFRFHDV-DEERECALIGHSELLAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNK 647
           D   HD+ DEE E  L  HSE +AI++GLIST  G  +RI KNLR C +CH + K IS  
Sbjct: 612 DASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKL 671

Query: 648 VGREIIVKDPYVFHHFKDGRCSCEDF 665
           V REI+V+D   FHHFKDG CSC D+
Sbjct: 672 VDREIVVRDTNRFHHFKDGVCSCGDY 693

BLAST of CmoCh06G017390 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 404.1 bits (1037), Expect = 3.1e-111
Identity = 223/656 (33.99%), Postives = 346/656 (52.74%), Query Frame = 0

Query: 51  LFDDIPIWDTFAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGD 110
           LF  +      AW ++I+           ++++ +M + G  PD++  P V+ +     D
Sbjct: 61  LFKTLKSPPVLAWKSVIRCFTDQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMD 120

Query: 111 LQLGKQLHAQAFKLGLFSNLYVFTSLIELYGIL--------------------------- 170
           L+ G+ +H    +LG+  +LY   +L+ +Y  L                           
Sbjct: 121 LRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDED 180

Query: 171 ---------DSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFSIDLFYQMVELAADI 230
                       D+ R + +    ++ VS+  +   Y        ++ +  +M       
Sbjct: 181 VKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKP 240

Query: 231 DAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKMYLDCGSIKDARGLF 290
           D+  L++ +        +  G+ IH      G++ D+ + + L+ MY     I+D+  +F
Sbjct: 241 DSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVF 300

Query: 291 NRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLTISSILPACGRITAH 350
           +R+  RD ISW  L+  YV+NG  NEAL+LFRQM +  ++KP  +  SS++PAC  +   
Sbjct: 301 SRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQM-VTAKVKPGAVAFSSVIPACAHLATL 360

Query: 351 KHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMKEKDMVSWTVVISGY 410
             G+++HGYVL+  F  N+ + +ALVDMY K G I++A KIF RM   D VSWT +I G+
Sbjct: 361 HLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGH 420

Query: 411 SLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGDFYFNCITE-----P 470
           +LHG G   V LF EM R   V  +++ + AVL ACS   +V+E   YFN +T+      
Sbjct: 421 ALHGHGHEAVSLFEEMKRQ-GVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQ 480

Query: 471 TMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQKLGKRIIEQL 530
            + H+     LLGRAG+  EA  F+ K  ++    +   LL  C  H   +L +++ E++
Sbjct: 481 ELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKI 540

Query: 531 CDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFRNKIHAFGTG 590
             ++  N   YVL+ N YASN  W+ + KLR  +R  GLR K A SW+E +NK H F +G
Sbjct: 541 FTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSG 600

Query: 591 DVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALI-GHSELLAISFGLI 650
           D SHP    I   L+ +M++ME++G+  +T    HDVDEE +  L+ GHSE LA++FG+I
Sbjct: 601 DRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGII 660

Query: 651 STEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCSCEDF 665
           +TE G TIR+ KN+R+C  CH + KFIS    REIIV+D   FHHF  G CSC D+
Sbjct: 661 NTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDY 714

BLAST of CmoCh06G017390 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 394.4 bits (1012), Expect = 2.5e-108
Identity = 213/622 (34.24%), Postives = 341/622 (54.82%), Query Frame = 0

Query: 48  AHQLFDDIPIWDTFAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRH 107
           A ++FD +P  D  +WN ++  +  +G     +   + M    ++P   T+  V+ A   
Sbjct: 189 ARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSA 248

Query: 108 YGDLQLGKQLHAQAFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTML 167
              + +GK++H  A + G  S + + T+L+++Y    S +TAR L D    RN VSW  +
Sbjct: 249 LRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSM 308

Query: 168 AKLYLMEDKPSFSIDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGL 227
              Y+  + P  ++ +F +M++       V++  A+ AC     L+ GR IH ++   GL
Sbjct: 309 IDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGL 368

Query: 228 EFDILVSNCLLKMYLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQ 287
           + ++ V N L+ MY  C  +  A  +F ++  R ++SW  +I  + +NG   +AL  F Q
Sbjct: 369 DRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQ 428

Query: 288 MNMDGELKPDPLTISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSG 347
           M     +KPD  T  S++ A   ++   H + IHG V+++  D N+ V  ALVDMY K G
Sbjct: 429 MR-SRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCG 488

Query: 348 CIQSALKIFSRMKEKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVL 407
            I  A  IF  M E+ + +W  +I GY  HG GK  + LF EM +  ++  + +T+ +V+
Sbjct: 489 AIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKG-TIKPNGVTFLSVI 548

Query: 408 QACSTASMVEEG--DFYF---NCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKN 467
            ACS + +VE G   FY    N   E +M H+   V LLGRAGR NEA  F+ +  +   
Sbjct: 549 SACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPA 608

Query: 468 LEILRALLDGCRKHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKT 527
           + +  A+L  C+ H      ++  E+L +L P +   +VLL+N Y +   WE V ++R +
Sbjct: 609 VNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVS 668

Query: 528 IRDMGLRPKKAYSWMEFRNKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFR 587
           +   GLR     S +E +N++H+F +G  +HP S+ IY  L+ L+  ++E G+  +T+  
Sbjct: 669 MLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLV 728

Query: 588 FHDVDEERECALIGHSELLAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGRE 647
               ++ +E  L  HSE LAISFGL++T AG TI + KNLRVC  CH + K+IS   GRE
Sbjct: 729 LGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGRE 788

Query: 648 IIVKDPYVFHHFKDGRCSCEDF 665
           I+V+D   FHHFK+G CSC D+
Sbjct: 789 IVVRDMQRFHHFKNGACSCGDY 808

BLAST of CmoCh06G017390 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 386.3 bits (991), Expect = 6.8e-106
Identity = 225/656 (34.30%), Postives = 343/656 (52.29%), Query Frame = 0

Query: 48  AHQLFDDIPIWDTFAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRH 107
           A  +F  I   +   WN + + H  S D    +  Y  M+S G+ P+++T P V+ +   
Sbjct: 87  AISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAK 146

Query: 108 YGDLQLGKQLHAQAFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTML 167
               + G+Q+H    KLG   +LYV TSLI +Y      + A  + DKS  R+ VS+T L
Sbjct: 147 SKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTAL 206

Query: 168 AKLY-----------LMEDKP--------------------SFSIDLFYQMVELAADIDA 227
            K Y           L ++ P                      +++LF  M++     D 
Sbjct: 207 IKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDE 266

Query: 228 VALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKMYLDCGSIKDARGLFNR 287
             + T + AC     ++ GR +H     HG   ++ + N L+ +Y  CG ++ A GLF R
Sbjct: 267 STMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFER 326

Query: 288 MPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLTISSILPACGRITAHKH 347
           +P++D+ISW  LI  Y       EAL LF++M   GE  P+ +T+ SILPAC  + A   
Sbjct: 327 LPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGE-TPNDVTMLSILPACAHLGAIDI 386

Query: 348 GREIHGYVLKNY--FDDNLIVQNALVDMYVKSGCIQSALKIFSRMKEKDMVSWTVVISGY 407
           GR IH Y+ K      +   ++ +L+DMY K G I++A ++F+ +  K + SW  +I G+
Sbjct: 387 GRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGF 446

Query: 408 SLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGDFYFNCITE-----P 467
           ++HG+      LF  M R   +  D+IT+  +L ACS + M++ G   F  +T+     P
Sbjct: 447 AMHGRADASFDLFSRM-RKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTP 506

Query: 468 TMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQKLGKRIIEQL 527
            + H+   + LLG +G F EA   ++  +++ +  I  +LL  C+ H   +LG+   E L
Sbjct: 507 KLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENL 566

Query: 528 CDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFRNKIHAFGTG 587
             +EP N  +YVLLSN YAS   W  V K R  + D G++     S +E  + +H F  G
Sbjct: 567 IKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIG 626

Query: 588 DVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEE-RECALIGHSELLAISFGLI 647
           D  HPR++ IY  L+ +   +E+ GF  +T     +++EE +E AL  HSE LAI+FGLI
Sbjct: 627 DKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLI 686

Query: 648 STEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCSCEDF 665
           ST+ G  + I KNLRVC +CHE+ K IS    REII +D   FHHF+DG CSC D+
Sbjct: 687 STKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDY 740

BLAST of CmoCh06G017390 vs. ExPASy TrEMBL
Match: A0A6J1EXC6 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111439085 PE=3 SV=1)

HSP 1 Score: 1385.2 bits (3584), Expect = 0.0e+00
Identity = 665/665 (100.00%), Postives = 665/665 (100.00%), Query Frame = 0

Query: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60
           MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120
           FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180
           AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240
           IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240

Query: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300
           YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT
Sbjct: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360
           ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK
Sbjct: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420
           EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD
Sbjct: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420

Query: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480
           FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600
           NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660
           LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS
Sbjct: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CmoCh06G017390 vs. ExPASy TrEMBL
Match: A0A6J1I9E1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111470851 PE=3 SV=1)

HSP 1 Score: 1359.0 bits (3516), Expect = 0.0e+00
Identity = 651/665 (97.89%), Postives = 657/665 (98.80%), Query Frame = 0

Query: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60
           MDLLLSTPIHRLPLTQKPNHTY RHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120
           FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180
           AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240
           +DLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFD+LVSNCLLKM
Sbjct: 181 LDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300
           YLDC SIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360
           ISSILPACGRI AHKHGREIHGYVLKN FDDNLIVQNALVDMYVKSGCIQSALKIFSRMK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNDFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420
           EKDMVSWTV+ISGYSLHGQGKLGVGLFREMDRNF VHRDEITYTAVLQ+CSTASMVEEGD
Sbjct: 361 EKDMVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQSCSTASMVEEGD 420

Query: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480
           FYFNCITEPTMAHFVLKVALLGRAGRF+EARTFVDKHKLDKN EILRALLDGCRKHHQ K
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQHK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600
           NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECA IGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECAPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660
           LAISFGLISTEAGRTIRI+KNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS
Sbjct: 601 LAISFGLISTEAGRTIRISKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CmoCh06G017390 vs. ExPASy TrEMBL
Match: A0A1S3CPR5 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103502829 PE=3 SV=1)

HSP 1 Score: 1148.3 bits (2969), Expect = 0.0e+00
Identity = 545/665 (81.95%), Postives = 596/665 (89.62%), Query Frame = 0

Query: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60
           M+LLLST  H LP+TQKP H Y RH  FNN PHVRTTT E  A LCVAHQ+FD+IPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPYHAYHRHPPFNNLPHVRTTTVENYADLCVAHQVFDEIPIWDT 60

Query: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120
           FAWNNLIQTHLT+GD GHVIS Y+QML RGVRPD HTLPR+ICA+R YGDL +GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDWGHVISIYRQMLFRGVRPDKHTLPRIICATRQYGDLPVGKQLHAQ 120

Query: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180
           AFKLG  S+LYV TSLIELYGILDSADTA+WLHDKS CRN+VSWT+LAKLYL EDKPSF+
Sbjct: 121 AFKLGFSSDLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTILAKLYLREDKPSFA 180

Query: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240
           IDLFYQMVELA DID+VALATAIGACGA K+L HGRNIHH+ARIHGLEF+ILVSN LLKM
Sbjct: 181 IDLFYQMVELADDIDSVALATAIGACGALKMLHHGRNIHHLARIHGLEFNILVSNSLLKM 240

Query: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300
           YLDC SIKDARG F++MP +D+ISWT+LIH YVK GGINEA KLFRQMNMDGELKPDPLT
Sbjct: 241 YLDCDSIKDARGFFDQMPSKDVISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360
           ISSILPACGR+ AHKHG+EIHGYVLKN FD+NLIVQNALVDMYVKSGCIQSA K FS MK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVLKNGFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420
           EKDMVSW+++  GYSLHGQGKLGVGLFREM++N  +HRDEITYTAVL AC+TA+MV+EGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVGLFREMEKNLKMHRDEITYTAVLHACTTANMVDEGD 420

Query: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480
           FYF+ IT+PT+AH  LKVALL RAGR +EARTFV+K KL+K+ EILRALLDGCR H QQK
Sbjct: 421 FYFSRITKPTVAHIALKVALLARAGRLDEARTFVEKKKLNKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCDLEPLN ENY+LLSNWYA N++W+MVE+LR+TIRDMGLRPKKAYSW+EF 
Sbjct: 481 LGKRIIEQLCDLEPLNTENYILLSNWYACNKKWDMVEELRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDG K N +F  HDVDEEREC  IGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGSKTNPEFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660
           LAISFGLISTEAGRTIRI KNLRVCHSCHESAKFIS  VGREIIVKDPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRVCHSCHESAKFISKMVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CEDFC 666
           CE+FC
Sbjct: 661 CENFC 665

BLAST of CmoCh06G017390 vs. ExPASy TrEMBL
Match: A0A6J1E0A4 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111025202 PE=3 SV=1)

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 545/665 (81.95%), Postives = 586/665 (88.12%), Query Frame = 0

Query: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60
           MDLLLST   RLP+T K + TY R R FNNPPHVRT   E  A+LC AH  FD+IP WDT
Sbjct: 1   MDLLLSTHFRRLPITPKTDLTYRRRRPFNNPPHVRTAITENYANLCEAHHPFDEIPTWDT 60

Query: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120
           FAWNNLIQTHLT+GDVG VISTY+QML RGVRPDNHTLPR+I ASR  GDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDVGLVISTYEQMLLRGVRPDNHTLPRIIGASRQCGDLQVGKQLHAQ 120

Query: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180
            FKLG  SNLYV TSLIELYGILD ADTA+WLHDKSACRN+VSWTMLAKLY+MEDKPSF+
Sbjct: 121 VFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTMLAKLYVMEDKPSFA 180

Query: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240
           IDLFYQMVELAADIDAVALATAIGACG+ KLLQHGRNIH +AR HGLEFD+LVSN LLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGSLKLLQHGRNIHLLARTHGLEFDVLVSNSLLKM 240

Query: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300
           YLDCGSI+DARG FNRMP +D+ISWT+LI  YVK GGINE  KLFRQMNMDG LKPDP+T
Sbjct: 241 YLDCGSIRDARGFFNRMPSKDVISWTELIQAYVKKGGINEGFKLFRQMNMDGGLKPDPIT 300

Query: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360
           ISSILPACGR+ AHKHGREIHGYVLK+  D NLIVQNALVDMYVKSGCIQSALKIFSRMK
Sbjct: 301 ISSILPACGRMAAHKHGREIHGYVLKSAIDVNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420
           EKD +SWTV+I GYSLHGQGKLGV LFR M+RN  +HRDEITYT+VL ACSTAS+VEEGD
Sbjct: 361 EKDAISWTVMILGYSLHGQGKLGVSLFRLMERNLRMHRDEITYTSVLHACSTASLVEEGD 420

Query: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480
           FYFNCI EPT +HF LKVALL RAGR +EAR FV++HKLDK+ EILRALLDGCR H  +K
Sbjct: 421 FYFNCIMEPTFSHFALKVALLARAGRLDEARAFVEQHKLDKHPEILRALLDGCRTHRDKK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCDLEPLNAENY+LLSNWYA N + +MVEK R+ +RDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNGKLDMVEKSREIVRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNL+CLMKKME+DG K   DF FHDVDEEREC LIGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLECLMKKMEDDGLKPKPDFSFHDVDEERECVLIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660
           LAISFGLISTEAGRTI I KNLRVCHSCHESAKFIS  VGREIIVKDPYVFHHFKDG CS
Sbjct: 601 LAISFGLISTEAGRTICITKNLRVCHSCHESAKFISKIVGREIIVKDPYVFHHFKDGCCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CmoCh06G017390 vs. ExPASy TrEMBL
Match: A0A0A0L9N4 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G722890 PE=3 SV=1)

HSP 1 Score: 1073.2 bits (2774), Expect = 4.9e-310
Identity = 510/624 (81.73%), Postives = 560/624 (89.74%), Query Frame = 0

Query: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60
           M+LLLST  H LP+TQKPNH Y RH  FNN PHVRT T E  A+LCVAHQ+FDDIPIWDT
Sbjct: 1   MNLLLSTHTHCLPITQKPNHAYHRHPPFNNLPHVRTMTVENYANLCVAHQVFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120
           FAWNNLIQTHLT+GD+GHVISTY+QML RGVRPD HTLPR+ICA+R YGDLQ+GKQLHAQ
Sbjct: 61  FAWNNLIQTHLTNGDLGHVISTYRQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQ 120

Query: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180
           AFKLG  SNLYV TSLIELYGILDSADTA+WLHDKS CRN+VSWT+LAKLYL EDKPS +
Sbjct: 121 AFKLGFSSNLYVLTSLIELYGILDSADTAKWLHDKSTCRNSVSWTVLAKLYLREDKPSLA 180

Query: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240
           +DLFYQMVELA DIDAVALATAIGACGA K+L HGRNIHH+AR+HGLEF+ILVSN LLKM
Sbjct: 181 LDLFYQMVELADDIDAVALATAIGACGALKMLHHGRNIHHLARVHGLEFNILVSNSLLKM 240

Query: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300
           Y+DC SIKDARG F++MP +DIISWT+LIH YVK GGINEA KLFRQMNMDGELKPDP T
Sbjct: 241 YIDCDSIKDARGFFDQMPSKDIISWTELIHMYVKKGGINEAFKLFRQMNMDGELKPDPRT 300

Query: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360
           ISSILPACGR+ AHKHG+EIHGYV+KN FD+NLIVQNALVDMYVKSGCIQSA K FS MK
Sbjct: 301 ISSILPACGRMAAHKHGKEIHGYVVKNAFDENLIVQNALVDMYVKSGCIQSASKTFSMMK 360

Query: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420
           EKDMVSW+++  GYSLHGQGKLGV LFREM++NF + RDEITYTAVL AC+TA+MV+EGD
Sbjct: 361 EKDMVSWSIMTLGYSLHGQGKLGVSLFREMEKNFKMRRDEITYTAVLHACTTANMVDEGD 420

Query: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480
            YF+CIT+PT+AH  LKVALL RAGR +EARTFV+K KLDK+ EILRALLDGCR H QQK
Sbjct: 421 SYFSCITKPTVAHIALKVALLARAGRLDEARTFVEKKKLDKHPEILRALLDGCRNHRQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCDLEPLNAENY+LLSNWYA NE+W+MVEKLR+TIRDMGLRPKKAYSW+EF 
Sbjct: 481 LGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVEKLRETIRDMGLRPKKAYSWIEFC 540

Query: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600
           NKIH FGTGDVSHPRSQ IYWNLQCLMK+MEEDG K N DF  HDVDEEREC  IGHSEL
Sbjct: 541 NKIHVFGTGDVSHPRSQNIYWNLQCLMKEMEEDGSKPNPDFSLHDVDEERECVPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRIAKNLRV 625
           LAISFGLISTEAGRTIRI KNLR+
Sbjct: 601 LAISFGLISTEAGRTIRITKNLRM 624

BLAST of CmoCh06G017390 vs. NCBI nr
Match: XP_022932569.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 1385.2 bits (3584), Expect = 0.0e+00
Identity = 665/665 (100.00%), Postives = 665/665 (100.00%), Query Frame = 0

Query: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60
           MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120
           FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180
           AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240
           IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240

Query: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300
           YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT
Sbjct: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360
           ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK
Sbjct: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420
           EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD
Sbjct: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420

Query: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480
           FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600
           NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660
           LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS
Sbjct: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CmoCh06G017390 vs. NCBI nr
Match: KAG7029175.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1370.5 bits (3546), Expect = 0.0e+00
Identity = 657/665 (98.80%), Postives = 660/665 (99.25%), Query Frame = 0

Query: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60
           MDLLLSTPIHRLPLTQKPNHTY RHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120
           FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180
           AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240
           IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFD+LVSNCLLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300
           YLDC SIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360
           ISSILPACGRI AHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420
           EKDMVSWTV+ISGYSLHGQGKLGVGLFREMDRNF VHRDEITYTAVLQACSTASMVEEGD
Sbjct: 361 EKDMVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQACSTASMVEEGD 420

Query: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480
           FYFNCITEPTMAHFVLKVALLGRAGRF+EARTFVDKHKLDKN EILRALLDGCRKHHQQK
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600
           NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660
           LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS
Sbjct: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CmoCh06G017390 vs. NCBI nr
Match: KAG6597728.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1366.7 bits (3536), Expect = 0.0e+00
Identity = 655/665 (98.50%), Postives = 659/665 (99.10%), Query Frame = 0

Query: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60
           MDLLLSTPIHRLPLTQKPNHTY RHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120
           FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180
           AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240
           IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFD+LVSNCLLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300
           YLDC SIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360
           ISSILPACGRI AHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420
           EKD+VSWTV+ISGYSLHGQGKLGVGLFREMDRNF VHRDEITYTAVLQACSTASMVEEGD
Sbjct: 361 EKDVVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQACSTASMVEEGD 420

Query: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480
           FYFNCITEPTMAHFVLKVALLGRAGRF+EARTFVDKHKLDKN EILRALLDGCRKHHQQK
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600
           NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660
           LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFIS KVGREIIVKDPYVFHHFKDGRCS
Sbjct: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISKKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CmoCh06G017390 vs. NCBI nr
Match: XP_023539701.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1366.7 bits (3536), Expect = 0.0e+00
Identity = 654/665 (98.35%), Postives = 660/665 (99.25%), Query Frame = 0

Query: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60
           M+LLLSTPIHRLPLTQKPNHTY RHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT
Sbjct: 1   MNLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120
           FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180
           AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240
           IDLFYQMVELAADIDAVALATA+GACGARKLLQHGRNIHHVARIHGLEFD+LVSNCLLKM
Sbjct: 181 IDLFYQMVELAADIDAVALATALGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300
           YLDC SIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360
           ISSILPACGRI AHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420
           EKDMVSWTV+ISGYSLHGQGKLGVGLFREMDRNF VHRDEITYTAVLQACSTASMVEEGD
Sbjct: 361 EKDMVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQACSTASMVEEGD 420

Query: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480
           FYFNCITEPTMAHFVLKVALLGRAGRF+EARTFVDKHKLDKN EILRALLDGCRKHHQQK
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQQK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600
           NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660
           LAISFGLISTEAGRTIRI+KNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS
Sbjct: 601 LAISFGLISTEAGRTIRISKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CmoCh06G017390 vs. NCBI nr
Match: XP_022972268.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 1359.0 bits (3516), Expect = 0.0e+00
Identity = 651/665 (97.89%), Postives = 657/665 (98.80%), Query Frame = 0

Query: 1   MDLLLSTPIHRLPLTQKPNHTYDRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60
           MDLLLSTPIHRLPLTQKPNHTY RHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT
Sbjct: 1   MDLLLSTPIHRLPLTQKPNHTYHRHRLFNNPPHVRTTTAEKNAHLCVAHQLFDDIPIWDT 60

Query: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120
           FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ
Sbjct: 61  FAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGDLQLGKQLHAQ 120

Query: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180
           AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS
Sbjct: 121 AFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFS 180

Query: 181 IDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKM 240
           +DLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFD+LVSNCLLKM
Sbjct: 181 LDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDVLVSNCLLKM 240

Query: 241 YLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300
           YLDC SIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT
Sbjct: 241 YLDCSSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLT 300

Query: 301 ISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360
           ISSILPACGRI AHKHGREIHGYVLKN FDDNLIVQNALVDMYVKSGCIQSALKIFSRMK
Sbjct: 301 ISSILPACGRIAAHKHGREIHGYVLKNDFDDNLIVQNALVDMYVKSGCIQSALKIFSRMK 360

Query: 361 EKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGD 420
           EKDMVSWTV+ISGYSLHGQGKLGVGLFREMDRNF VHRDEITYTAVLQ+CSTASMVEEGD
Sbjct: 361 EKDMVSWTVMISGYSLHGQGKLGVGLFREMDRNFRVHRDEITYTAVLQSCSTASMVEEGD 420

Query: 421 FYFNCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQK 480
           FYFNCITEPTMAHFVLKVALLGRAGRF+EARTFVDKHKLDKN EILRALLDGCRKHHQ K
Sbjct: 421 FYFNCITEPTMAHFVLKVALLGRAGRFDEARTFVDKHKLDKNSEILRALLDGCRKHHQHK 480

Query: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540
           LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR
Sbjct: 481 LGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFR 540

Query: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALIGHSEL 600
           NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECA IGHSEL
Sbjct: 541 NKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECAPIGHSEL 600

Query: 601 LAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660
           LAISFGLISTEAGRTIRI+KNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS
Sbjct: 601 LAISFGLISTEAGRTIRISKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCS 660

Query: 661 CEDFC 666
           CEDFC
Sbjct: 661 CEDFC 665

BLAST of CmoCh06G017390 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 448.7 bits (1153), Expect = 7.9e-126
Identity = 233/630 (36.98%), Postives = 359/630 (56.98%), Query Frame = 0

Query: 41  KNAHLCVAHQLFDDIPIWDTFAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPR 100
           KN  +  A ++FD++   D  +WN++I  ++++G     +S + QML  G+  D  T+  
Sbjct: 242 KNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVS 301

Query: 101 VICASRHYGDLQLGKQLHAQAFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRN 160
           V         + LG+ +H+   K           +L+++Y      D+A+ +  + + R+
Sbjct: 302 VFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRS 361

Query: 161 AVSWTMLAKLYLMEDKPSFSIDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHH 220
            VS+T +   Y  E     ++ LF +M E     D   +   +  C   +LL  G+ +H 
Sbjct: 362 VVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHE 421

Query: 221 VARIHGLEFDILVSNCLLKMYLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINE 280
             + + L FDI VSN L+ MY  CGS+++A  +F+ M  +DIISW  +I  Y KN   NE
Sbjct: 422 WIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANE 481

Query: 281 ALKLFRQMNMDGELKPDPLTISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALV 340
           AL LF  +  +    PD  T++ +LPAC  ++A   GREIHGY+++N +  +  V N+LV
Sbjct: 482 ALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLV 541

Query: 341 DMYVKSGCIQSALKIFSRMKEKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDE 400
           DMY K G +  A  +F  +  KD+VSWTV+I+GY +HG GK  + LF +M R   +  DE
Sbjct: 542 DMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQM-RQAGIEADE 601

Query: 401 ITYTAVLQACSTASMVEEGDFYFN-----CITEPTMAHFVLKVALLGRAGRFNEARTFVD 460
           I++ ++L ACS + +V+EG  +FN     C  EPT+ H+   V +L R G   +A  F++
Sbjct: 602 ISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIE 661

Query: 461 KHKLDKNLEILRALLDGCRKHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEM 520
              +  +  I  ALL GCR HH  KL +++ E++ +LEP N   YVL++N YA  E+WE 
Sbjct: 662 NMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQ 721

Query: 521 VEKLRKTIRDMGLRPKKAYSWMEFRNKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGF 580
           V++LRK I   GLR     SW+E + +++ F  GD S+P ++ I   L+ +  +M E+G+
Sbjct: 722 VKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGY 781

Query: 581 KRNTDFRFHDVDE-ERECALIGHSELLAISFGLISTEAGRTIRIAKNLRVCHSCHESAKF 640
              T +   D +E E+E AL GHSE LA++ G+IS+  G+ IR+ KNLRVC  CHE AKF
Sbjct: 782 SPLTKYALIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKF 841

Query: 641 ISNKVGREIIVKDPYVFHHFKDGRCSCEDF 665
           +S    REI+++D   FH FKDG CSC  F
Sbjct: 842 MSKLTRREIVLRDSNRFHQFKDGHCSCRGF 870

BLAST of CmoCh06G017390 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 427.6 bits (1098), Expect = 1.9e-119
Identity = 222/626 (35.46%), Postives = 358/626 (57.19%), Query Frame = 0

Query: 48  AHQLFDDIPIWDTFAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRH 107
           A Q+FDD+P    F WN +I+ +  +      +  Y  M    V PD+ T P ++ A   
Sbjct: 72  ARQVFDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSG 131

Query: 108 YGDLQLGKQLHAQAFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSAC--RNAVSWT 167
              LQ+G+ +HAQ F+LG  ++++V   LI LY       +AR + +      R  VSWT
Sbjct: 132 LSHLQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWT 191

Query: 168 MLAKLYLMEDKPSFSIDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIH 227
            +   Y    +P  ++++F QM ++    D VAL + + A    + L+ GR+IH      
Sbjct: 192 AIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKM 251

Query: 228 GLEF--DILVSNCLLKMYLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALK 287
           GLE   D+L+S  L  MY  CG +  A+ LF++M   ++I W  +I  Y KNG   EA+ 
Sbjct: 252 GLEIEPDLLIS--LNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAID 311

Query: 288 LFRQMNMDGELKPDPLTISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMY 347
           +F +M ++ +++PD ++I+S + AC ++ + +  R ++ YV ++ + D++ + +AL+DM+
Sbjct: 312 MFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMF 371

Query: 348 VKSGCIQSALKIFSRMKEKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITY 407
            K G ++ A  +F R  ++D+V W+ +I GY LHG+ +  + L+R M+R   VH +++T+
Sbjct: 372 AKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERG-GVHPNDVTF 431

Query: 408 TAVLQACSTASMVEEGDFYFNCITE----PTMAHFVLKVALLGRAGRFNEARTFVDKHKL 467
             +L AC+ + MV EG ++FN + +    P   H+   + LLGRAG  ++A   +    +
Sbjct: 432 LGLLMACNHSGMVREGWWFFNRMADHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPV 491

Query: 468 DKNLEILRALLDGCRKHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKL 527
              + +  ALL  C+KH   +LG+   +QL  ++P N  +YV LSN YA+   W+ V ++
Sbjct: 492 QPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEV 551

Query: 528 RKTIRDMGLRPKKAYSWMEFRNKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNT 587
           R  +++ GL      SW+E R ++ AF  GD SHPR + I   ++ +  +++E GF  N 
Sbjct: 552 RVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANK 611

Query: 588 DFRFHDV-DEERECALIGHSELLAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNK 647
           D   HD+ DEE E  L  HSE +AI++GLIST  G  +RI KNLR C +CH + K IS  
Sbjct: 612 DASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKL 671

Query: 648 VGREIIVKDPYVFHHFKDGRCSCEDF 665
           V REI+V+D   FHHFKDG CSC D+
Sbjct: 672 VDREIVVRDTNRFHHFKDGVCSCGDY 693

BLAST of CmoCh06G017390 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 404.1 bits (1037), Expect = 2.2e-112
Identity = 223/656 (33.99%), Postives = 346/656 (52.74%), Query Frame = 0

Query: 51  LFDDIPIWDTFAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRHYGD 110
           LF  +      AW ++I+           ++++ +M + G  PD++  P V+ +     D
Sbjct: 61  LFKTLKSPPVLAWKSVIRCFTDQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMD 120

Query: 111 LQLGKQLHAQAFKLGLFSNLYVFTSLIELYGIL--------------------------- 170
           L+ G+ +H    +LG+  +LY   +L+ +Y  L                           
Sbjct: 121 LRFGESVHGFIVRLGMDCDLYTGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDED 180

Query: 171 ---------DSADTARWLHDKSACRNAVSWTMLAKLYLMEDKPSFSIDLFYQMVELAADI 230
                       D+ R + +    ++ VS+  +   Y        ++ +  +M       
Sbjct: 181 VKAETCIMPFGIDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKP 240

Query: 231 DAVALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKMYLDCGSIKDARGLF 290
           D+  L++ +        +  G+ IH      G++ D+ + + L+ MY     I+D+  +F
Sbjct: 241 DSFTLSSVLPIFSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVF 300

Query: 291 NRMPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLTISSILPACGRITAH 350
           +R+  RD ISW  L+  YV+NG  NEAL+LFRQM +  ++KP  +  SS++PAC  +   
Sbjct: 301 SRLYCRDGISWNSLVAGYVQNGRYNEALRLFRQM-VTAKVKPGAVAFSSVIPACAHLATL 360

Query: 351 KHGREIHGYVLKNYFDDNLIVQNALVDMYVKSGCIQSALKIFSRMKEKDMVSWTVVISGY 410
             G+++HGYVL+  F  N+ + +ALVDMY K G I++A KIF RM   D VSWT +I G+
Sbjct: 361 HLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGH 420

Query: 411 SLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGDFYFNCITE-----P 470
           +LHG G   V LF EM R   V  +++ + AVL ACS   +V+E   YFN +T+      
Sbjct: 421 ALHGHGHEAVSLFEEMKRQ-GVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQ 480

Query: 471 TMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQKLGKRIIEQL 530
            + H+     LLGRAG+  EA  F+ K  ++    +   LL  C  H   +L +++ E++
Sbjct: 481 ELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKI 540

Query: 531 CDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFRNKIHAFGTG 590
             ++  N   YVL+ N YASN  W+ + KLR  +R  GLR K A SW+E +NK H F +G
Sbjct: 541 FTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSG 600

Query: 591 DVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEERECALI-GHSELLAISFGLI 650
           D SHP    I   L+ +M++ME++G+  +T    HDVDEE +  L+ GHSE LA++FG+I
Sbjct: 601 DRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGII 660

Query: 651 STEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCSCEDF 665
           +TE G TIR+ KN+R+C  CH + KFIS    REIIV+D   FHHF  G CSC D+
Sbjct: 661 NTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDY 714

BLAST of CmoCh06G017390 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 394.4 bits (1012), Expect = 1.8e-109
Identity = 213/622 (34.24%), Postives = 341/622 (54.82%), Query Frame = 0

Query: 48  AHQLFDDIPIWDTFAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRH 107
           A ++FD +P  D  +WN ++  +  +G     +   + M    ++P   T+  V+ A   
Sbjct: 189 ARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSA 248

Query: 108 YGDLQLGKQLHAQAFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTML 167
              + +GK++H  A + G  S + + T+L+++Y    S +TAR L D    RN VSW  +
Sbjct: 249 LRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSM 308

Query: 168 AKLYLMEDKPSFSIDLFYQMVELAADIDAVALATAIGACGARKLLQHGRNIHHVARIHGL 227
              Y+  + P  ++ +F +M++       V++  A+ AC     L+ GR IH ++   GL
Sbjct: 309 IDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGL 368

Query: 228 EFDILVSNCLLKMYLDCGSIKDARGLFNRMPFRDIISWTDLIHFYVKNGGINEALKLFRQ 287
           + ++ V N L+ MY  C  +  A  +F ++  R ++SW  +I  + +NG   +AL  F Q
Sbjct: 369 DRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQ 428

Query: 288 MNMDGELKPDPLTISSILPACGRITAHKHGREIHGYVLKNYFDDNLIVQNALVDMYVKSG 347
           M     +KPD  T  S++ A   ++   H + IHG V+++  D N+ V  ALVDMY K G
Sbjct: 429 MR-SRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCG 488

Query: 348 CIQSALKIFSRMKEKDMVSWTVVISGYSLHGQGKLGVGLFREMDRNFSVHRDEITYTAVL 407
            I  A  IF  M E+ + +W  +I GY  HG GK  + LF EM +  ++  + +T+ +V+
Sbjct: 489 AIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKG-TIKPNGVTFLSVI 548

Query: 408 QACSTASMVEEG--DFYF---NCITEPTMAHFVLKVALLGRAGRFNEARTFVDKHKLDKN 467
            ACS + +VE G   FY    N   E +M H+   V LLGRAGR NEA  F+ +  +   
Sbjct: 549 SACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPA 608

Query: 468 LEILRALLDGCRKHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNEEWEMVEKLRKT 527
           + +  A+L  C+ H      ++  E+L +L P +   +VLL+N Y +   WE V ++R +
Sbjct: 609 VNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVS 668

Query: 528 IRDMGLRPKKAYSWMEFRNKIHAFGTGDVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFR 587
           +   GLR     S +E +N++H+F +G  +HP S+ IY  L+ L+  ++E G+  +T+  
Sbjct: 669 MLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLV 728

Query: 588 FHDVDEERECALIGHSELLAISFGLISTEAGRTIRIAKNLRVCHSCHESAKFISNKVGRE 647
               ++ +E  L  HSE LAISFGL++T AG TI + KNLRVC  CH + K+IS   GRE
Sbjct: 729 LGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGRE 788

Query: 648 IIVKDPYVFHHFKDGRCSCEDF 665
           I+V+D   FHHFK+G CSC D+
Sbjct: 789 IVVRDMQRFHHFKNGACSCGDY 808

BLAST of CmoCh06G017390 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 386.3 bits (991), Expect = 4.8e-107
Identity = 225/656 (34.30%), Postives = 343/656 (52.29%), Query Frame = 0

Query: 48  AHQLFDDIPIWDTFAWNNLIQTHLTSGDVGHVISTYQQMLSRGVRPDNHTLPRVICASRH 107
           A  +F  I   +   WN + + H  S D    +  Y  M+S G+ P+++T P V+ +   
Sbjct: 87  AISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAK 146

Query: 108 YGDLQLGKQLHAQAFKLGLFSNLYVFTSLIELYGILDSADTARWLHDKSACRNAVSWTML 167
               + G+Q+H    KLG   +LYV TSLI +Y      + A  + DKS  R+ VS+T L
Sbjct: 147 SKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTAL 206

Query: 168 AKLY-----------LMEDKP--------------------SFSIDLFYQMVELAADIDA 227
            K Y           L ++ P                      +++LF  M++     D 
Sbjct: 207 IKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDE 266

Query: 228 VALATAIGACGARKLLQHGRNIHHVARIHGLEFDILVSNCLLKMYLDCGSIKDARGLFNR 287
             + T + AC     ++ GR +H     HG   ++ + N L+ +Y  CG ++ A GLF R
Sbjct: 267 STMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFER 326

Query: 288 MPFRDIISWTDLIHFYVKNGGINEALKLFRQMNMDGELKPDPLTISSILPACGRITAHKH 347
           +P++D+ISW  LI  Y       EAL LF++M   GE  P+ +T+ SILPAC  + A   
Sbjct: 327 LPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGE-TPNDVTMLSILPACAHLGAIDI 386

Query: 348 GREIHGYVLKNY--FDDNLIVQNALVDMYVKSGCIQSALKIFSRMKEKDMVSWTVVISGY 407
           GR IH Y+ K      +   ++ +L+DMY K G I++A ++F+ +  K + SW  +I G+
Sbjct: 387 GRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGF 446

Query: 408 SLHGQGKLGVGLFREMDRNFSVHRDEITYTAVLQACSTASMVEEGDFYFNCITE-----P 467
           ++HG+      LF  M R   +  D+IT+  +L ACS + M++ G   F  +T+     P
Sbjct: 447 AMHGRADASFDLFSRM-RKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTP 506

Query: 468 TMAHFVLKVALLGRAGRFNEARTFVDKHKLDKNLEILRALLDGCRKHHQQKLGKRIIEQL 527
            + H+   + LLG +G F EA   ++  +++ +  I  +LL  C+ H   +LG+   E L
Sbjct: 507 KLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENL 566

Query: 528 CDLEPLNAENYVLLSNWYASNEEWEMVEKLRKTIRDMGLRPKKAYSWMEFRNKIHAFGTG 587
             +EP N  +YVLLSN YAS   W  V K R  + D G++     S +E  + +H F  G
Sbjct: 567 IKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIG 626

Query: 588 DVSHPRSQAIYWNLQCLMKKMEEDGFKRNTDFRFHDVDEE-RECALIGHSELLAISFGLI 647
           D  HPR++ IY  L+ +   +E+ GF  +T     +++EE +E AL  HSE LAI+FGLI
Sbjct: 627 DKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLI 686

Query: 648 STEAGRTIRIAKNLRVCHSCHESAKFISNKVGREIIVKDPYVFHHFKDGRCSCEDF 665
           ST+ G  + I KNLRVC +CHE+ K IS    REII +D   FHHF+DG CSC D+
Sbjct: 687 STKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDY 740

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SN391.1e-12436.98Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9LTV82.7e-11835.46Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9LW633.1e-11133.99Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q3E6Q12.5e-10834.24Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9LN016.8e-10634.30Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1EXC60.0e+00100.00pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbit... [more]
A0A6J1I9E10.0e+0097.89pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucurbit... [more]
A0A1S3CPR50.0e+0081.95pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Cucumis ... [more]
A0A6J1E0A40.0e+0081.95pentatricopeptide repeat-containing protein DOT4, chloroplastic-like OS=Momordic... [more]
A0A0A0L9N44.9e-31081.73DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G7228... [more]
Match NameE-valueIdentityDescription
XP_022932569.10.0e+00100.00pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita ... [more]
KAG7029175.10.0e+0098.80Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
KAG6597728.10.0e+0098.50Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
XP_023539701.10.0e+0098.35pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita ... [more]
XP_022972268.10.0e+0097.89pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT4G18750.17.9e-12636.98Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.11.9e-11935.46mitochondrial editing factor 22 [more]
AT3G23330.12.2e-11233.99Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G11290.11.8e-10934.24Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.14.8e-10734.30Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 159..316
e-value: 1.0E-22
score: 82.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 36..141
e-value: 1.6E-10
score: 42.6
coord: 317..420
e-value: 3.0E-19
score: 71.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 425..598
e-value: 1.2E-5
score: 26.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 261..308
e-value: 4.3E-7
score: 30.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 337..363
e-value: 1.5E-4
score: 21.7
coord: 62..91
e-value: 0.49
score: 10.7
coord: 235..258
e-value: 0.4
score: 11.0
coord: 365..392
e-value: 1.9E-4
score: 21.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 263..297
e-value: 6.8E-4
score: 17.6
coord: 62..94
e-value: 5.1E-4
score: 18.0
coord: 335..365
e-value: 8.4E-5
score: 20.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 261..295
score: 9.722731
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 59..93
score: 10.314641
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 332..366
score: 9.656963
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 534..655
e-value: 3.2E-32
score: 111.1
NoneNo IPR availablePANTHERPTHR24015:SF1853REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 1..658
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 1..658

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh06G017390.1CmoCh06G017390.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding