CmaCh18G003650 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh18G003650
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat-containing protein, mitochondrial
LocationCma_Chr18: 1990198 .. 1993065 (-)
RNA-Seq ExpressionCmaCh18G003650
SyntenyCmaCh18G003650
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTCTCTGTCTCACTTGAAACAGTTCTCGTCTTCTTCTTCTTCTTCTTCGTTTTCTTCTTTCGAAGGCGAGATCTCCGCGGATCTCATCAATGGATCAGTTCTTTTCCAAAGCTTTAACACGCTATGCCCTAGCTGACCGATTTTACCATACGAATCGATTGAAAAAGGCGACTCTGTACGCGAAAATTAGTCCTCTGGGCGATCCTAACGTCAGTGTGGAGCCGGAGCTGGACGGCTGGGTTAAGGAGGGAAAGAAGGTCCGAATCGCTGAGCTTCAGAGAATCATTCATGACCTTCGCAAGCGTAAACGCTTCACACAAGCTCTTGAGGTTTGAGTTTTTCTTTCTTGAAATGTGGTATATTTTGGTTTGAGCTGTAGGATGTGTAGGAAGAAGTGAACTGTGAAAAAAGATGCATACAGTGAGAACTAAGTGTCTTTTGGATGAGCCTCCTTTTCAAAATGTGAATAGCTCAATCGCTAACTTCAAATCCTCAGGATTATGGAGAATGAAGTTATATTGACAGATATAGCTTGGGGAAATGGAAGGTACGCGGGTTTTCTATGCTAAAAAGTCAGAAAATACTTCTTTAGATTCTTTGATTCAATGAATTTGTTGAGTAACTTCATTAGTTCACATAATATGGACCTTCAGCCATGGAGTTATGTTGTTTCTCCCTTATTTAATCTGGAGTAACTCACTTATTTAAATCTGTGGATAATGATAGAATGACTATAGCATCATATGAGCTAGAAATGTGTGGATTTCAACAATTTATTGGGGGTGTTCTTTATAATTGATGATGGATGTAAATGCAGGTGTCCGAATGGATGAAGAAAACCGGTGTCTGCATATTTTCGCCAAGCGAACATGCGGTGCAATTGGACCTGATTGGCCGAGTTCGTGGATATCTTTCTGCCGAAAGCTATTTCAATCAGTTGAAGGAGCAAGACCAGACTGATAAGACATATGGTGCTCTCCTCAATTGCTACGTTCGGCAGCGGCAAGTTGAAAAATCCCTTTCCCATTTGCAAAAAATGAAAGAGATGGGCTTTGCAACTTCAGAGCTCACTTACAATGACATGATGTGCTTGTATACAAATGTTGGTCAGCATGACAAGGTCCCTGAGGTACTAGCAGAGATGAAAGAGAAAAATGTTTCTCCAGACAACTTCAGTTACAGAATCTGCATCAATTCGTATGGTGCGAGACGGGATCTTGAGGGGATGGAGAATGTATTAAAGGAGATGGAATCTCAACCTCATATAGTCATGGATTGGAACACTTATGCAGTAGTAGCTAACTTCTTTATAAAAGCGGATCTTGCTGATAAGGCGGTTGATGCCTTGAAAAAAGCAGAAGAGAGACTGAAGAGTAAAGATAGAATTGGCCATAACCATCTAATCTCGCTATACACAACGTTAGGAAACAAGGAGAAGGTGTTGAGATTGTGGAATCTGGATAAAACTGATACTACGAGATTCATCAATAGAGACTACATCACAATGCTTGAATCTCTGGTGAGACTGGGTGAACTTGAGGAAGCTGAAAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACACTGTCATTGTTGGGTATATTGACAAGGGAATGTGTGAGAGAGCCGAAGCCCTGCTCGAAGACTTGATGGAGAAGGGAAAGACTACCACACCAAACAGTTGGGGTGCTGTGGCTGTTCAATATATGGACAGGGGTGAGACTGAAAAATCTGTAGAGTGCATGAAGGCCGCCCTTACTCTAAATATGGATAAAGGATGGAAACCTAATCTACGTGTGATTACAGGCATATTGAATTGGCTTGGTGAAAACGCCAGCATTGAAGAAGTAGAAGCTTTTGTAGGCTCATTGAGGTCTGCGATTCCAGTGAACAGAGAGATGTATCATGCCTTGATGAAGGTTCATATAAGAGGTGGTAAAGAAGTACATGAGCTGTTAAATCAAATGAAGTCTGATAAAATAGATGAAGATGAAGAAACAAAGAAAATTCTTGGCACTGGGCAAGAAACAACTGAAGGAAGGTAAGAACATTTGCTGATTGTTTTCTTATTGAATCTTATTGTTGTTGTCAATCTGAACACCTGTTGCTTGGGTTTCAGTAATGCAAAATCTTATTTATCATCATGTTCTTCACATGAAAGGAGTAAAAGTAGGTGATGTTCACTTCCAATTTACTGATTGTATTTGGTGGTTTAATGAAGTGTTGGTGATTTGAGAGAAATGTGTTACATGCATAGATGATTTGGTATTTCGTGAACAGGGAAGAAAGAAGTTAAAATAGTGAATTCAATTAAGATTCTTTTTCTGGTATGTAGCTCCACAAGCTAGGAGCGCATCACCGGGGCGAAAACAAAGATTGGACCATCATCATGACCCTTTTCTTTCTTTTGTTTGTGTCCCGACTTCAGTTAAAAAGAGGCGTGTGTGTTTTTTTAAGGCTTATCCAGGGACTGCAGCTCTAGGGCATCCAATTCCTCTTCTTTCTTTTTCTTTTCTGCAGATCTCATATCGAGAGCAGCTTCTTCTTGTTCAGGGAAGTAAGATAAGTGAATGGAAATAAGAATTACTCTCGAGTATCAAGATTAAGTTGATCGTGTCTGATTCAATGAATTGTACAAATTTGGTGTGGAGGTGCGCACGAAAACTGAAACTTCTACGAAAAGCTTGAATTTGTTTGTTACAATCTAGATCAAAGAACCTAGAACTTGAAGCAGCTACGTGTTTCCTGTATTGTTTTAGTTCAGCTTGTCGTCAATATTTACATATATGAGATGTGATTCATTTGAGTTCAAACTAGATGGTGTACTCAGCTTGACATAG

mRNA sequence

ATGGGTTCTCTGTCTCACTTGAAACAGTTCTCGTCTTCTTCTTCTTCTTCTTCGTTTTCTTCTTTCGAAGGCGAGATCTCCGCGGATCTCATCAATGGATCAGTTCTTTTCCAAAGCTTTAACACGCTATGCCCTAGCTGACCGATTTTACCATACGAATCGATTGAAAAAGGCGACTCTGTACGCGAAAATTAGTCCTCTGGGCGATCCTAACGTCAGTGTGGAGCCGGAGCTGGACGGCTGGGTTAAGGAGGGAAAGAAGGTCCGAATCGCTGAGCTTCAGAGAATCATTCATGACCTTCGCAAGCGTAAACGCTTCACACAAGCTCTTGAGGTGTCCGAATGGATGAAGAAAACCGGTGTCTGCATATTTTCGCCAAGCGAACATGCGGTGCAATTGGACCTGATTGGCCGAGTTCGTGGATATCTTTCTGCCGAAAGCTATTTCAATCAGTTGAAGGAGCAAGACCAGACTGATAAGACATATGGTGCTCTCCTCAATTGCTACGTTCGGCAGCGGCAAGTTGAAAAATCCCTTTCCCATTTGCAAAAAATGAAAGAGATGGGCTTTGCAACTTCAGAGCTCACTTACAATGACATGATGTGCTTGTATACAAATGTTGGTCAGCATGACAAGGTCCCTGAGGTACTAGCAGAGATGAAAGAGAAAAATGTTTCTCCAGACAACTTCAGTTACAGAATCTGCATCAATTCGTATGGTGCGAGACGGGATCTTGAGGGGATGGAGAATGTATTAAAGGAGATGGAATCTCAACCTCATATAGTCATGGATTGGAACACTTATGCAGTAGTAGCTAACTTCTTTATAAAAGCGGATCTTGCTGATAAGGCGGTTGATGCCTTGAAAAAAGCAGAAGAGAGACTGAAGAGTAAAGATAGAATTGGCCATAACCATCTAATCTCGCTATACACAACGTTAGGAAACAAGGAGAAGGTGTTGAGATTGTGGAATCTGGATAAAACTGATACTACGAGATTCATCAATAGAGACTACATCACAATGCTTGAATCTCTGGTGAGACTGGGTGAACTTGAGGAAGCTGAAAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACACTGTCATTGTTGGGTATATTGACAAGGGAATGTGTGAGAGAGCCGAAGCCCTGCTCGAAGACTTGATGGAGAAGGGAAAGACTACCACACCAAACAGTTGGGGTGCTGTGGCTGTTCAATATATGGACAGGGGTGAGACTGAAAAATCTGTAGAGTGCATGAAGGCCGCCCTTACTCTAAATATGGATAAAGGATGGAAACCTAATCTACGTGTGATTACAGGCATATTGAATTGGCTTGGTGAAAACGCCAGCATTGAAGAAGTAGAAGCTTTTGTAGGCTCATTGAGGTCTGCGATTCCAGTGAACAGAGAGATGTATCATGCCTTGATGAAGGTTCATATAAGAGGTGGTAAAGAAGTACATGAGCTGTTAAATCAAATGAAGTCTGATAAAATAGATGAAGATGAAGAAACAAAGAAAATTCTTGGCACTGGGCAAGAAACAACTGAAGGAAGATCTCATATCGAGAGCAGCTTCTTCTTGTTCAGGGAATTCAGCTTGTCGTCAATATTTACATATATGAGATGTGATTCATTTGAGTTCAAACTAGATGGTGTACTCAGCTTGACATAG

Coding sequence (CDS)

ATGGATCAGTTCTTTTCCAAAGCTTTAACACGCTATGCCCTAGCTGACCGATTTTACCATACGAATCGATTGAAAAAGGCGACTCTGTACGCGAAAATTAGTCCTCTGGGCGATCCTAACGTCAGTGTGGAGCCGGAGCTGGACGGCTGGGTTAAGGAGGGAAAGAAGGTCCGAATCGCTGAGCTTCAGAGAATCATTCATGACCTTCGCAAGCGTAAACGCTTCACACAAGCTCTTGAGGTGTCCGAATGGATGAAGAAAACCGGTGTCTGCATATTTTCGCCAAGCGAACATGCGGTGCAATTGGACCTGATTGGCCGAGTTCGTGGATATCTTTCTGCCGAAAGCTATTTCAATCAGTTGAAGGAGCAAGACCAGACTGATAAGACATATGGTGCTCTCCTCAATTGCTACGTTCGGCAGCGGCAAGTTGAAAAATCCCTTTCCCATTTGCAAAAAATGAAAGAGATGGGCTTTGCAACTTCAGAGCTCACTTACAATGACATGATGTGCTTGTATACAAATGTTGGTCAGCATGACAAGGTCCCTGAGGTACTAGCAGAGATGAAAGAGAAAAATGTTTCTCCAGACAACTTCAGTTACAGAATCTGCATCAATTCGTATGGTGCGAGACGGGATCTTGAGGGGATGGAGAATGTATTAAAGGAGATGGAATCTCAACCTCATATAGTCATGGATTGGAACACTTATGCAGTAGTAGCTAACTTCTTTATAAAAGCGGATCTTGCTGATAAGGCGGTTGATGCCTTGAAAAAAGCAGAAGAGAGACTGAAGAGTAAAGATAGAATTGGCCATAACCATCTAATCTCGCTATACACAACGTTAGGAAACAAGGAGAAGGTGTTGAGATTGTGGAATCTGGATAAAACTGATACTACGAGATTCATCAATAGAGACTACATCACAATGCTTGAATCTCTGGTGAGACTGGGTGAACTTGAGGAAGCTGAAAAAGTGCTGAAAGAGTGGGAATCATCTGGGAATTGCTATGATTTTCGAGTTCCTAACACTGTCATTGTTGGGTATATTGACAAGGGAATGTGTGAGAGAGCCGAAGCCCTGCTCGAAGACTTGATGGAGAAGGGAAAGACTACCACACCAAACAGTTGGGGTGCTGTGGCTGTTCAATATATGGACAGGGGTGAGACTGAAAAATCTGTAGAGTGCATGAAGGCCGCCCTTACTCTAAATATGGATAAAGGATGGAAACCTAATCTACGTGTGATTACAGGCATATTGAATTGGCTTGGTGAAAACGCCAGCATTGAAGAAGTAGAAGCTTTTGTAGGCTCATTGAGGTCTGCGATTCCAGTGAACAGAGAGATGTATCATGCCTTGATGAAGGTTCATATAAGAGGTGGTAAAGAAGTACATGAGCTGTTAAATCAAATGAAGTCTGATAAAATAGATGAAGATGAAGAAACAAAGAAAATTCTTGGCACTGGGCAAGAAACAACTGAAGGAAGATCTCATATCGAGAGCAGCTTCTTCTTGTTCAGGGAATTCAGCTTGTCGTCAATATTTACATATATGAGATGTGATTCATTTGAGTTCAAACTAGATGGTGTACTCAGCTTGACATAG

Protein sequence

MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIAELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVVANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTTRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGILNWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDEETKKILGTGQETTEGRSHIESSFFLFREFSLSSIFTYMRCDSFEFKLDGVLSLT
Homology
BLAST of CmaCh18G003650 vs. ExPASy Swiss-Prot
Match: Q84JR3 (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 606.7 bits (1563), Expect = 2.6e-172
Identity = 293/477 (61.43%), Postives = 372/477 (77.99%), Query Frame = 0

Query: 14  LADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIAELQRIIHDLRKRK 73
           +A R+Y+TNR+KK TLY+KISPLGDP  SV PEL  WV+ GKKV +AEL RI+HDLR+RK
Sbjct: 12  IASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRK 71

Query: 74  RFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGA 133
           RF  ALEVS+WM +TGVC+FSP+EHAV LDLIGRV G+++AE YF  LKEQ + DKTYGA
Sbjct: 72  RFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGA 131

Query: 134 LLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHDKVPEVLAEMKEKN 193
           LLNCYVRQ+ VEKSL H +KMKEMGF TS LTYN++MCLYTN+GQH+KVP+VL EMKE+N
Sbjct: 132 LLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEEN 191

Query: 194 VSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVVANFFIKADLADKA 253
           V+PDN+SYRICIN++GA  DLE +   L++ME +  I MDWNTYAV A F+I     D+A
Sbjct: 192 VAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRA 251

Query: 254 VDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTTRFINRDYITMLES 313
           V+ LK +E RL+ KD  G+NHLI+LY  LG K +VLRLW+L+K    R IN+DY+T+L+S
Sbjct: 252 VELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQS 311

Query: 314 LVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKTTT 373
           LV++  L EAE+VL EW+SSGNCYDFRVPNTVI GYI K M E+AEA+LEDL  +GK TT
Sbjct: 312 LVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATT 371

Query: 374 PNSWGAVAVQYMDRGETEKSVECMKAALTLNM-DKGWKPNLRVITGILNWLGENASIEEV 433
           P SW  VA  Y ++G  E + +CMK AL + +  + W+P L ++T +L+W+G+  S++EV
Sbjct: 372 PESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKEV 431

Query: 434 EAFVGSLRSAIPVNREMYHALMKVHIR-GGKEVHELLNQMKSDKIDEDEETKKILGT 489
           E+FV SLR+ I VN++MYHAL+K  IR GG+ +  LL +MK DKI+ DEET  IL T
Sbjct: 432 ESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILST 488

BLAST of CmaCh18G003650 vs. ExPASy Swiss-Prot
Match: Q8LPS6 (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX=3702 GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 292.0 bits (746), Expect = 1.4e-77
Identity = 154/429 (35.90%), Postives = 254/429 (59.21%), Query Frame = 0

Query: 29  LYAKISPLGDPNVSVEPELDGWVKEGKKVRIAELQRIIHDLRKRKRFTQALEVSEWMKKT 88
           +Y KIS +  P +     L+ W K G+K+   EL R++ +LRK KR  QALEV +WM   
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 89  GVCI-FSPSEHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKS 148
           G     S S+ A+QLDLIG+VRG   AE +F QL E  +  + YG+LLN YVR +  EK+
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 149 LSHLQKMKEMGFATSELTYNDMMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 208
            + L  M++ G+A   L +N MM LY N+ ++DKV  ++ EMK+K++  D +SY I ++S
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 209 YGARRDLEGMENVLKEMESQPHIVMDWNTYAVVANFFIKADLADKAVDALKKAEERLKSK 268
            G+   +E ME V ++M+S   I  +W T++ +A  +IK    +KA DAL+K E R+  +
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 269 DRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTTRFINRDYITMLESLVRLGELEEAEKVL 328
           +RI +++L+SLY +LGNK+++ R+W++ K+      N  Y  ++ SLVR+G++E AEKV 
Sbjct: 309 NRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVY 368

Query: 329 KEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKTTTPNSWGAVAVQYMDR 388
           +EW    + YD R+PN ++  Y+     E AE L + ++E G   + ++W  +AV +  +
Sbjct: 369 EEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRK 428

Query: 389 GETEKSVECMKAALTLNMDKGWKPNLRVITGILNWLGENASIEEVEAFVGSLRSAIPVNR 448
               +++ C++ A +      W+P + +++G      E + +   EA +  LR +  +  
Sbjct: 429 RCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLED 488

Query: 449 EMYHALMKV 457
           + Y AL+ V
Sbjct: 489 KSYLALIDV 497

BLAST of CmaCh18G003650 vs. ExPASy Swiss-Prot
Match: Q9SKU6 (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 287.0 bits (733), Expect = 4.5e-76
Identity = 147/404 (36.39%), Postives = 246/404 (60.89%), Query Frame = 0

Query: 28  TLYAKISPLGDPNVSVEPELDGWVKEGKKVRIAELQRIIHDLRKRKRFTQALEVSEWMKK 87
           TL  +++  GDP+ S+   LDGW+ +G  V+ +EL  II  LRK  RF+ AL++S+WM +
Sbjct: 39  TLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSE 98

Query: 88  TGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKS 147
             V   S  + A++LDLI +V G   AE +F  +  + +    YGALLNCY  ++ + K+
Sbjct: 99  HRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKA 158

Query: 148 LSHLQKMKEMGFATSELTYNDMMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 207
               Q+MKE+GF    L YN M+ LY   G++  V ++L EM+++ V PD F+    +++
Sbjct: 159 EQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHA 218

Query: 208 YGARRDLEGMENVLKEMESQPHIVMDWNTYAVVANFFIKADLADKAVDALKKAEERLKS- 267
           Y    D+EGME  L   E+   + +DW TYA  AN +IKA L +KA++ L+K+E+ + + 
Sbjct: 219 YSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQ 278

Query: 268 KDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTTRFINRDYITMLESLVRLGELEEAEKV 327
           K +  +  L+S Y   G KE+V RLW+L K +   F N  YI+++ +L+++ ++EE EK+
Sbjct: 279 KRKHAYEVLMSFYGAAGKKEEVYRLWSLYK-ELDGFYNTGYISVISALLKMDDIEEVEKI 338

Query: 328 LKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKTTTPNSWGAVAVQYMD 387
           ++EWE+  + +D R+P+ +I GY  KGM E+AE ++  L++K +    ++W  +A+ Y  
Sbjct: 339 MEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKM 398

Query: 388 RGETEKSVECMKAALTLNMDKGWKPNLRVITGILNWLGENASIE 431
            G+ EK+VE  K A+ ++   GW+P+  V+   +++L     +E
Sbjct: 399 AGKMEKAVEKWKRAIEVS-KPGWRPHQVVLMSCVDYLEGQRDME 440

BLAST of CmaCh18G003650 vs. ExPASy Swiss-Prot
Match: Q93WC5 (Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g01990 PE=2 SV=1)

HSP 1 Score: 276.9 bits (707), Expect = 4.7e-73
Identity = 153/467 (32.76%), Postives = 266/467 (56.96%), Query Frame = 0

Query: 21  TNRLKKATLYAKISPLGD-PNVSVEPELDGWVKEGKKVRIAELQRIIHDLRKRKRFTQAL 80
           T   K  ++Y K+S LG      +E  L+ +V EG  V+  +L R   DLRK ++  +AL
Sbjct: 33  TKAKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDLIRYAKDLRKFRQPQRAL 92

Query: 81  EVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYV 140
           E+ EWM++  +  F+ S+HA++L+LI + +G  +AE+YFN L +  +   TYG+LLNCY 
Sbjct: 93  EIFEWMERKEIA-FTGSDHAIRLNLIAKSKGLEAAETYFNSLDDSIKNQSTYGSLLNCYC 152

Query: 141 RQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHDKVPEVLAEMKEKNVSPDNF 200
            +++  K+ +H + M ++   ++ L +N++M +Y  +GQ +KVP ++  MKEK+++P + 
Sbjct: 153 VEKEEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVPALVVAMKEKSITPCDI 212

Query: 201 SYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVVANFFIKADLADKAVDALKK 260
           +Y + I S G+ +DL+G+E VL EM+++   +  WNT+A +A  +IK  L  KA +ALK 
Sbjct: 213 TYSMWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAIYIKVGLYGKAEEALKS 272

Query: 261 AEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTTRFINRDYITMLESLVRLGE 320
            E  +    R  ++ LI+LYT + N  +V R+W+L K       N  Y+TML +L +L +
Sbjct: 273 LENNMNPDVRDCYHFLINLYTGIANASEVYRVWDLLKKRYPNVNNSSYLTMLRALSKLDD 332

Query: 321 LEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKTTTPNSWGA 380
           ++  +KV  EWES+   YD R+ N  I  Y+ + M E AEA+    M+K K     +   
Sbjct: 333 IDGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEAVFNGAMKKCKGQFSKARQL 392

Query: 381 VAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGILNWLGENASIEEVEAFVGSL 440
           + +  +   + + +++  +AA+ L+ DK W  +  +I+       E   ++  E F  +L
Sbjct: 393 LMMHLLKNDQADLALKHFEAAV-LDQDKNWTWSSELISSFFLHFEEAKDVDGAEEFCKTL 452

Query: 441 RSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDEETKKIL 487
               P++ E Y  LMK ++  GK   ++  +++   I  DEE + +L
Sbjct: 453 TKWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGILVDEEQECLL 497

BLAST of CmaCh18G003650 vs. ExPASy Swiss-Prot
Match: O22714 (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX=3702 GN=At1g60770 PE=1 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 1.4e-69
Identity = 140/469 (29.85%), Postives = 256/469 (54.58%), Query Frame = 0

Query: 21  TNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIAELQRIIHDLRKRKRFTQALE 80
           T +  +  LY ++   G   V V  +L+ ++K  K V   E+   I  LR R  +  AL+
Sbjct: 17  TKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALK 76

Query: 81  VSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVR 140
           +SE M++ G+   + S+ A+ LDL+ + R   + E+YF  L E  +T+ TYG+LLNCY +
Sbjct: 77  LSEVMEERGM-NKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCK 136

Query: 141 QRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFS 200
           +   EK+   L KMKE+    S ++YN +M LYT  G+ +KVP ++ E+K +NV PD+++
Sbjct: 137 ELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYT 196

Query: 201 YRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVVANFFIKADLADKAVDALKKA 260
           Y + + +  A  D+ G+E V++EM     +  DW TY+ +A+ ++ A L+ KA  AL++ 
Sbjct: 197 YNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQEL 256

Query: 261 EERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTTRFINRDYITMLESLVRLGEL 320
           E +   +D   +  LI+LY  LG   +V R+W   +    +  N  Y+ M++ LV+L +L
Sbjct: 257 EMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDL 316

Query: 321 EEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKTTTPNSWGAV 380
             AE + KEW+++ + YD R+ N +I  Y  +G+ ++A  L E    +G      +W   
Sbjct: 317 PGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIF 376

Query: 381 AVQYMDRGETEKSVECMKAALTLNMDKG--WKPNLRVITGILNWLGENASIEEVEAFVGS 440
              Y+  G+  +++ECM  A+++    G  W P+   +  ++++  +   +   E  +  
Sbjct: 377 MDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLEI 436

Query: 441 LRSAIP-VNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDEETKKIL 487
           L++    +  E++  L++ +   GK    +  ++K + ++ +E TKK+L
Sbjct: 437 LKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLL 484

BLAST of CmaCh18G003650 vs. ExPASy TrEMBL
Match: A0A6J1K124 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490120 PE=4 SV=1)

HSP 1 Score: 1020.8 bits (2638), Expect = 2.1e-294
Identity = 508/508 (100.00%), Postives = 508/508 (100.00%), Query Frame = 0

Query: 1   MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIA 60
           MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIA
Sbjct: 1   MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIA 60

Query: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120
           ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ
Sbjct: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120

Query: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180
           LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD
Sbjct: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180

Query: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240
           KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV
Sbjct: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240

Query: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTT 300
           ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTT
Sbjct: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTT 300

Query: 301 RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360
           RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA
Sbjct: 301 RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360

Query: 361 LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420
           LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL
Sbjct: 361 LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420

Query: 421 NWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDE 480
           NWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDE
Sbjct: 421 NWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDE 480

Query: 481 ETKKILGTGQETTEGRSHIESSFFLFRE 509
           ETKKILGTGQETTEGRSHIESSFFLFRE
Sbjct: 481 ETKKILGTGQETTEGRSHIESSFFLFRE 508

BLAST of CmaCh18G003650 vs. ExPASy TrEMBL
Match: A0A6J1GTN5 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111457015 PE=4 SV=1)

HSP 1 Score: 977.2 bits (2525), Expect = 2.7e-281
Identity = 486/495 (98.18%), Postives = 487/495 (98.38%), Query Frame = 0

Query: 1   MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIA 60
           MDQFFSKALTRYALA RFYHTNRLKKATLYAKISPLGDP+VSVEP LDGWVKEGKKVRIA
Sbjct: 1   MDQFFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRIA 60

Query: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120
           ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ
Sbjct: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120

Query: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180
           LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD
Sbjct: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180

Query: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240
           KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV
Sbjct: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240

Query: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTT 300
           ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLY TLGNKEKVLRLWNLDKTDTT
Sbjct: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDTT 300

Query: 301 RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360
           R INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA
Sbjct: 301 RLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360

Query: 361 LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420
           LLEDLMEKGKTTTPN WGAVAVQYMDR ETEKSVECMKAALTLNMDKGWKPNLRVITGIL
Sbjct: 361 LLEDLMEKGKTTTPNCWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420

Query: 421 NWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDE 480
           NWLGENASIEEVEAFVGSLRS IPVNREMYHALMK HIRGGKEVHELLNQMKSDKIDEDE
Sbjct: 421 NWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKIDEDE 480

Query: 481 ETKKILGTGQETTEG 496
           ETKKILGTGQETTEG
Sbjct: 481 ETKKILGTGQETTEG 495

BLAST of CmaCh18G003650 vs. ExPASy TrEMBL
Match: A0A6J1CGU2 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Momordica charantia OX=3673 GN=LOC111010721 PE=4 SV=1)

HSP 1 Score: 901.4 bits (2328), Expect = 1.9e-258
Identity = 441/498 (88.55%), Postives = 474/498 (95.18%), Query Frame = 0

Query: 1   MDQ-FFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRI 60
           MDQ  FSKALTRYA+A R YHTNR+KKATLYAKISPLGDP++SV PELDGWV+EGKK+R+
Sbjct: 10  MDQNLFSKALTRYAMAGRSYHTNRMKKATLYAKISPLGDPSISVGPELDGWVQEGKKIRV 69

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120
           AELQRIIHDLRKRKRFTQALEVSEWMK++GVCIFSPSEHAVQLDLIGRVRGYLSAESYF+
Sbjct: 70  AELQRIIHDLRKRKRFTQALEVSEWMKQSGVCIFSPSEHAVQLDLIGRVRGYLSAESYFD 129

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180
           QLK+QD+T KTYGALLNCYVRQRQV+KSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH
Sbjct: 130 QLKDQDKTGKTYGALLNCYVRQRQVDKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 189

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240
           DKVP+VLAEMKE  VSPDNFSYRICINSYG R DLEGME+VLKEMESQPHIVMDWNTYAV
Sbjct: 190 DKVPQVLAEMKENKVSPDNFSYRICINSYGTRCDLEGMESVLKEMESQPHIVMDWNTYAV 249

Query: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDT 300
           VANFFIK  L DKAVDAL+K+EERL SKDRIGHNHLISLY TLGNKE+VLRLW LDK+D+
Sbjct: 250 VANFFIKGGLTDKAVDALRKSEERLNSKDRIGHNHLISLYATLGNKEEVLRLWKLDKSDS 309

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 310 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 369

Query: 361 ALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGI 420
           ALLEDLM++GK TTPNSWGAVAVQY+DRGETEK+VECMK AL+L++DKGWKPNLRVITGI
Sbjct: 370 ALLEDLMKEGKATTPNSWGAVAVQYLDRGETEKAVECMKTALSLHIDKGWKPNLRVITGI 429

Query: 421 LNWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDED 480
           LNW+G+N+S EEVEAFVGSLRS IPVNREMYHALMK HIRGGKEVH LL+QMKSD+IDED
Sbjct: 430 LNWIGDNSSTEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHGLLSQMKSDQIDED 489

Query: 481 EETKKILGTGQETTEGRS 498
           EETKKILGT QE TEG+S
Sbjct: 490 EETKKILGTWQEATEGKS 507

BLAST of CmaCh18G003650 vs. ExPASy TrEMBL
Match: A0A5A7UM45 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold861G00740 PE=4 SV=1)

HSP 1 Score: 861.7 bits (2225), Expect = 1.6e-246
Identity = 422/498 (84.74%), Postives = 461/498 (92.57%), Query Frame = 0

Query: 1   MDQ-FFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRI 60
           MDQ  FSKALT YALA R YHT RLKKATLYAKISPLGDP++SVE ELDGWV+EGKKVR+
Sbjct: 1   MDQKLFSKALTHYALASRSYHTTRLKKATLYAKISPLGDPSISVESELDGWVQEGKKVRV 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120
           AELQRII D RKR RF+QAL+VSEWMKK+G CIFSP+EHAVQLDLIGRVRGYLSAE YFN
Sbjct: 61  AELQRIIRDFRKRSRFSQALQVSEWMKKSGACIFSPTEHAVQLDLIGRVRGYLSAEKYFN 120

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180
           QLKEQDQ  KTYGALLNCYVRQ+QV+KSLSHLQKMKE+GFATSELTYND+MCLYT VGQH
Sbjct: 121 QLKEQDQNIKTYGALLNCYVRQQQVDKSLSHLQKMKELGFATSELTYNDIMCLYTRVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240
           +KVPEVLAEMK  NVSPDNFSYRICINSYGAR+DLEGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 EKVPEVLAEMKGNNVSPDNFSYRICINSYGARKDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDT 300
           VANFFIKA L DKAVDAL+K+EE+LKSKDRIGHNHLISLY TLGNKEKVLR+WNLDKT T
Sbjct: 241 VANFFIKAGLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRVWNLDKTAT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 301 TRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 ALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGI 420
            LLE+L +  K TTPNSWGAVAV+Y+DRGETEK++ECMKAAL++N DKGWKPN RVITG+
Sbjct: 361 TLLENLNQNEKATTPNSWGAVAVKYLDRGETEKALECMKAALSVNTDKGWKPNPRVITGV 420

Query: 421 LNWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDED 480
           LNWLG+   +EEVEAFV +LRS IPVNREMYHAL+KV+IR  KEV+E+LN+MK+DKI+ED
Sbjct: 421 LNWLGDKGIVEEVEAFVSALRSVIPVNREMYHALLKVYIRADKEVNEVLNKMKADKINED 480

Query: 481 EETKKILGTGQETTEGRS 498
           EETKKILGT +ETTEG+S
Sbjct: 481 EETKKILGTWEETTEGKS 498

BLAST of CmaCh18G003650 vs. ExPASy TrEMBL
Match: A0A1S3B5N2 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Cucumis melo OX=3656 GN=LOC103486296 PE=4 SV=1)

HSP 1 Score: 861.7 bits (2225), Expect = 1.6e-246
Identity = 422/498 (84.74%), Postives = 461/498 (92.57%), Query Frame = 0

Query: 1   MDQ-FFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRI 60
           MDQ  FSKALT YALA R YHT RLKKATLYAKISPLGDP++SVE ELDGWV+EGKKVR+
Sbjct: 1   MDQKLFSKALTHYALASRSYHTTRLKKATLYAKISPLGDPSISVESELDGWVQEGKKVRV 60

Query: 61  AELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFN 120
           AELQRII D RKR RF+QAL+VSEWMKK+G CIFSP+EHAVQLDLIGRVRGYLSAE YFN
Sbjct: 61  AELQRIIRDFRKRSRFSQALQVSEWMKKSGACIFSPTEHAVQLDLIGRVRGYLSAEKYFN 120

Query: 121 QLKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQH 180
           QLKEQDQ  KTYGALLNCYVRQ+QV+KSLSHLQKMKE+GFATSELTYND+MCLYT VGQH
Sbjct: 121 QLKEQDQNIKTYGALLNCYVRQQQVDKSLSHLQKMKELGFATSELTYNDIMCLYTRVGQH 180

Query: 181 DKVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAV 240
           +KVPEVLAEMK  NVSPDNFSYRICINSYGAR+DLEGMENVLKEMESQPHIVMDWNTYAV
Sbjct: 181 EKVPEVLAEMKGNNVSPDNFSYRICINSYGARKDLEGMENVLKEMESQPHIVMDWNTYAV 240

Query: 241 VANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDT 300
           VANFFIKA L DKAVDAL+K+EE+LKSKDRIGHNHLISLY TLGNKEKVLR+WNLDKT T
Sbjct: 241 VANFFIKAGLTDKAVDALRKSEEKLKSKDRIGHNHLISLYATLGNKEKVLRVWNLDKTAT 300

Query: 301 TRFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360
           TR INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE
Sbjct: 301 TRIINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAE 360

Query: 361 ALLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGI 420
            LLE+L +  K TTPNSWGAVAV+Y+DRGETEK++ECMKAAL++N DKGWKPN RVITG+
Sbjct: 361 TLLENLNQNEKATTPNSWGAVAVKYLDRGETEKALECMKAALSVNTDKGWKPNPRVITGV 420

Query: 421 LNWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDED 480
           LNWLG+   +EEVEAFV +LRS IPVNREMYHAL+KV+IR  KEV+E+LN+MK+DKI+ED
Sbjct: 421 LNWLGDKGIVEEVEAFVSALRSVIPVNREMYHALLKVYIRADKEVNEVLNKMKADKINED 480

Query: 481 EETKKILGTGQETTEGRS 498
           EETKKILGT +ETTEG+S
Sbjct: 481 EETKKILGTWEETTEGKS 498

BLAST of CmaCh18G003650 vs. NCBI nr
Match: XP_022994385.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Cucurbita maxima])

HSP 1 Score: 1020.8 bits (2638), Expect = 4.3e-294
Identity = 508/508 (100.00%), Postives = 508/508 (100.00%), Query Frame = 0

Query: 1   MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIA 60
           MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIA
Sbjct: 1   MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIA 60

Query: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120
           ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ
Sbjct: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120

Query: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180
           LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD
Sbjct: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180

Query: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240
           KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV
Sbjct: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240

Query: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTT 300
           ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTT
Sbjct: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTT 300

Query: 301 RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360
           RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA
Sbjct: 301 RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360

Query: 361 LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420
           LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL
Sbjct: 361 LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420

Query: 421 NWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDE 480
           NWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDE
Sbjct: 421 NWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDE 480

Query: 481 ETKKILGTGQETTEGRSHIESSFFLFRE 509
           ETKKILGTGQETTEGRSHIESSFFLFRE
Sbjct: 481 ETKKILGTGQETTEGRSHIESSFFLFRE 508

BLAST of CmaCh18G003650 vs. NCBI nr
Match: XP_022954890.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Cucurbita moschata])

HSP 1 Score: 977.2 bits (2525), Expect = 5.5e-281
Identity = 486/495 (98.18%), Postives = 487/495 (98.38%), Query Frame = 0

Query: 1   MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIA 60
           MDQFFSKALTRYALA RFYHTNRLKKATLYAKISPLGDP+VSVEP LDGWVKEGKKVRIA
Sbjct: 1   MDQFFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRIA 60

Query: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120
           ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ
Sbjct: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120

Query: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180
           LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD
Sbjct: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180

Query: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240
           KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV
Sbjct: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240

Query: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTT 300
           ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLY TLGNKEKVLRLWNLDKTDTT
Sbjct: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDTT 300

Query: 301 RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360
           R INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA
Sbjct: 301 RLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360

Query: 361 LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420
           LLEDLMEKGKTTTPN WGAVAVQYMDR ETEKSVECMKAALTLNMDKGWKPNLRVITGIL
Sbjct: 361 LLEDLMEKGKTTTPNCWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420

Query: 421 NWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDE 480
           NWLGENASIEEVEAFVGSLRS IPVNREMYHALMK HIRGGKEVHELLNQMKSDKIDEDE
Sbjct: 421 NWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKIDEDE 480

Query: 481 ETKKILGTGQETTEG 496
           ETKKILGTGQETTEG
Sbjct: 481 ETKKILGTGQETTEG 495

BLAST of CmaCh18G003650 vs. NCBI nr
Match: KAG6573262.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 976.5 bits (2523), Expect = 9.4e-281
Identity = 486/496 (97.98%), Postives = 487/496 (98.19%), Query Frame = 0

Query: 1   MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIA 60
           MDQFFSKALTRYALA RFYHTNRLKKATLYAKISPLGDP+VSVEP LDGWVKEGKKVRIA
Sbjct: 1   MDQFFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRIA 60

Query: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120
           ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ
Sbjct: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120

Query: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180
           LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD
Sbjct: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180

Query: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240
           KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV
Sbjct: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240

Query: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTT 300
           ANFFIKADLADKAVDALKKAE RLKSKDRIGHNHLISLY TLGNKEKVLRLWNLDKTDTT
Sbjct: 241 ANFFIKADLADKAVDALKKAEVRLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDTT 300

Query: 301 RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360
           R INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA
Sbjct: 301 RLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360

Query: 361 LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420
           LLEDLMEKGKTTTPN WGAVAVQYMDR ETEKSVECMKAALTLNMDKGWKPNLRVITGIL
Sbjct: 361 LLEDLMEKGKTTTPNCWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420

Query: 421 NWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDE 480
           NWLGENASIEEVEAFVGSLRS IPVNREMYHALMK HIRGGKEVHELLNQMKSDKIDEDE
Sbjct: 421 NWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKIDEDE 480

Query: 481 ETKKILGTGQETTEGR 497
           ETKKILGTGQETTEGR
Sbjct: 481 ETKKILGTGQETTEGR 496

BLAST of CmaCh18G003650 vs. NCBI nr
Match: XP_023542644.1 (pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 975.3 bits (2520), Expect = 2.1e-280
Identity = 482/494 (97.57%), Postives = 488/494 (98.79%), Query Frame = 0

Query: 1   MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIA 60
           MDQFFSKALTRYALA RFYHTNRLKKATLYAKISPLGDP+VSVEPELDGWVKEGKKVRIA
Sbjct: 1   MDQFFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPELDGWVKEGKKVRIA 60

Query: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120
           ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ
Sbjct: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120

Query: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180
           LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELT+NDMMCLYTNVGQHD
Sbjct: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTFNDMMCLYTNVGQHD 180

Query: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240
           KVPEVLAEMKEKN+SPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV
Sbjct: 181 KVPEVLAEMKEKNISPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240

Query: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTT 300
           ANFFIKADL +KAVDAL+KAEERLKSKDRIGHNHLISLY TLGNKEKVLRLWNLDKTD T
Sbjct: 241 ANFFIKADLTEKAVDALRKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDAT 300

Query: 301 RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360
           RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA
Sbjct: 301 RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360

Query: 361 LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420
           LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL
Sbjct: 361 LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420

Query: 421 NWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDE 480
           NWLGENASIEEVEAFVGSLRS IPVNREMYHALMK HIRGGKEVHELLNQMKSDK+DEDE
Sbjct: 421 NWLGENASIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKLDEDE 480

Query: 481 ETKKILGTGQETTE 495
           ETKKILGTGQETTE
Sbjct: 481 ETKKILGTGQETTE 494

BLAST of CmaCh18G003650 vs. NCBI nr
Match: KAG7012430.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 974.5 bits (2518), Expect = 3.6e-280
Identity = 484/494 (97.98%), Postives = 486/494 (98.38%), Query Frame = 0

Query: 1   MDQFFSKALTRYALADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIA 60
           MDQFFSKALTRYALA RFYHTNRLKKATLYAKISPLGDP+VSVEP LDGWVKEGKKVRIA
Sbjct: 1   MDQFFSKALTRYALAGRFYHTNRLKKATLYAKISPLGDPSVSVEPVLDGWVKEGKKVRIA 60

Query: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120
           ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ
Sbjct: 61  ELQRIIHDLRKRKRFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQ 120

Query: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180
           LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD
Sbjct: 121 LKEQDQTDKTYGALLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHD 180

Query: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240
           KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV
Sbjct: 181 KVPEVLAEMKEKNVSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVV 240

Query: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTT 300
           ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLY TLGNKEKVLRLWNLDKTDTT
Sbjct: 241 ANFFIKADLADKAVDALKKAEERLKSKDRIGHNHLISLYATLGNKEKVLRLWNLDKTDTT 300

Query: 301 RFINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360
           R INRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA
Sbjct: 301 RLINRDYITMLESLVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEA 360

Query: 361 LLEDLMEKGKTTTPNSWGAVAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420
           LLEDLMEKGKTTTPNSWGAVAVQYMDR ETEKSVECMKAALTLNMDKGWKPNLRVITGIL
Sbjct: 361 LLEDLMEKGKTTTPNSWGAVAVQYMDRSETEKSVECMKAALTLNMDKGWKPNLRVITGIL 420

Query: 421 NWLGENASIEEVEAFVGSLRSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDE 480
           NWLGEN SIEEVEAFVGSLRS IPVNREMYHALMK HIRGGKEVHELLNQMKSDK+DEDE
Sbjct: 421 NWLGENGSIEEVEAFVGSLRSVIPVNREMYHALMKAHIRGGKEVHELLNQMKSDKLDEDE 480

Query: 481 ETKKILGTGQETTE 495
           ETKKILGTGQETTE
Sbjct: 481 ETKKILGTGQETTE 494

BLAST of CmaCh18G003650 vs. TAIR 10
Match: AT4G21705.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 606.7 bits (1563), Expect = 1.8e-173
Identity = 293/477 (61.43%), Postives = 372/477 (77.99%), Query Frame = 0

Query: 14  LADRFYHTNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIAELQRIIHDLRKRK 73
           +A R+Y+TNR+KK TLY+KISPLGDP  SV PEL  WV+ GKKV +AEL RI+HDLR+RK
Sbjct: 12  IASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRK 71

Query: 74  RFTQALEVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGA 133
           RF  ALEVS+WM +TGVC+FSP+EHAV LDLIGRV G+++AE YF  LKEQ + DKTYGA
Sbjct: 72  RFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGA 131

Query: 134 LLNCYVRQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHDKVPEVLAEMKEKN 193
           LLNCYVRQ+ VEKSL H +KMKEMGF TS LTYN++MCLYTN+GQH+KVP+VL EMKE+N
Sbjct: 132 LLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEEN 191

Query: 194 VSPDNFSYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVVANFFIKADLADKA 253
           V+PDN+SYRICIN++GA  DLE +   L++ME +  I MDWNTYAV A F+I     D+A
Sbjct: 192 VAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRA 251

Query: 254 VDALKKAEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTTRFINRDYITMLES 313
           V+ LK +E RL+ KD  G+NHLI+LY  LG K +VLRLW+L+K    R IN+DY+T+L+S
Sbjct: 252 VELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQS 311

Query: 314 LVRLGELEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKTTT 373
           LV++  L EAE+VL EW+SSGNCYDFRVPNTVI GYI K M E+AEA+LEDL  +GK TT
Sbjct: 312 LVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATT 371

Query: 374 PNSWGAVAVQYMDRGETEKSVECMKAALTLNM-DKGWKPNLRVITGILNWLGENASIEEV 433
           P SW  VA  Y ++G  E + +CMK AL + +  + W+P L ++T +L+W+G+  S++EV
Sbjct: 372 PESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKEV 431

Query: 434 EAFVGSLRSAIPVNREMYHALMKVHIR-GGKEVHELLNQMKSDKIDEDEETKKILGT 489
           E+FV SLR+ I VN++MYHAL+K  IR GG+ +  LL +MK DKI+ DEET  IL T
Sbjct: 432 ESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILST 488

BLAST of CmaCh18G003650 vs. TAIR 10
Match: AT1G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 292.0 bits (746), Expect = 9.9e-79
Identity = 154/429 (35.90%), Postives = 254/429 (59.21%), Query Frame = 0

Query: 29  LYAKISPLGDPNVSVEPELDGWVKEGKKVRIAELQRIIHDLRKRKRFTQALEVSEWMKKT 88
           +Y KIS +  P +     L+ W K G+K+   EL R++ +LRK KR  QALEV +WM   
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 89  GVCI-FSPSEHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKS 148
           G     S S+ A+QLDLIG+VRG   AE +F QL E  +  + YG+LLN YVR +  EK+
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 149 LSHLQKMKEMGFATSELTYNDMMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 208
            + L  M++ G+A   L +N MM LY N+ ++DKV  ++ EMK+K++  D +SY I ++S
Sbjct: 189 EALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSS 248

Query: 209 YGARRDLEGMENVLKEMESQPHIVMDWNTYAVVANFFIKADLADKAVDALKKAEERLKSK 268
            G+   +E ME V ++M+S   I  +W T++ +A  +IK    +KA DAL+K E R+  +
Sbjct: 249 CGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGR 308

Query: 269 DRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTTRFINRDYITMLESLVRLGELEEAEKVL 328
           +RI +++L+SLY +LGNK+++ R+W++ K+      N  Y  ++ SLVR+G++E AEKV 
Sbjct: 309 NRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVY 368

Query: 329 KEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKTTTPNSWGAVAVQYMDR 388
           +EW    + YD R+PN ++  Y+     E AE L + ++E G   + ++W  +AV +  +
Sbjct: 369 EEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRK 428

Query: 389 GETEKSVECMKAALTLNMDKGWKPNLRVITGILNWLGENASIEEVEAFVGSLRSAIPVNR 448
               +++ C++ A +      W+P + +++G      E + +   EA +  LR +  +  
Sbjct: 429 RCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGDLED 488

Query: 449 EMYHALMKV 457
           + Y AL+ V
Sbjct: 489 KSYLALIDV 497

BLAST of CmaCh18G003650 vs. TAIR 10
Match: AT2G20710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 287.0 bits (733), Expect = 3.2e-77
Identity = 147/404 (36.39%), Postives = 246/404 (60.89%), Query Frame = 0

Query: 28  TLYAKISPLGDPNVSVEPELDGWVKEGKKVRIAELQRIIHDLRKRKRFTQALEVSEWMKK 87
           TL  +++  GDP+ S+   LDGW+ +G  V+ +EL  II  LRK  RF+ AL++S+WM +
Sbjct: 39  TLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSE 98

Query: 88  TGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVRQRQVEKS 147
             V   S  + A++LDLI +V G   AE +F  +  + +    YGALLNCY  ++ + K+
Sbjct: 99  HRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKA 158

Query: 148 LSHLQKMKEMGFATSELTYNDMMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFSYRICINS 207
               Q+MKE+GF    L YN M+ LY   G++  V ++L EM+++ V PD F+    +++
Sbjct: 159 EQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHA 218

Query: 208 YGARRDLEGMENVLKEMESQPHIVMDWNTYAVVANFFIKADLADKAVDALKKAEERLKS- 267
           Y    D+EGME  L   E+   + +DW TYA  AN +IKA L +KA++ L+K+E+ + + 
Sbjct: 219 YSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQ 278

Query: 268 KDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTTRFINRDYITMLESLVRLGELEEAEKV 327
           K +  +  L+S Y   G KE+V RLW+L K +   F N  YI+++ +L+++ ++EE EK+
Sbjct: 279 KRKHAYEVLMSFYGAAGKKEEVYRLWSLYK-ELDGFYNTGYISVISALLKMDDIEEVEKI 338

Query: 328 LKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKTTTPNSWGAVAVQYMD 387
           ++EWE+  + +D R+P+ +I GY  KGM E+AE ++  L++K +    ++W  +A+ Y  
Sbjct: 339 MEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKM 398

Query: 388 RGETEKSVECMKAALTLNMDKGWKPNLRVITGILNWLGENASIE 431
            G+ EK+VE  K A+ ++   GW+P+  V+   +++L     +E
Sbjct: 399 AGKMEKAVEKWKRAIEVS-KPGWRPHQVVLMSCVDYLEGQRDME 440

BLAST of CmaCh18G003650 vs. TAIR 10
Match: AT4G01990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 276.9 bits (707), Expect = 3.3e-74
Identity = 153/467 (32.76%), Postives = 266/467 (56.96%), Query Frame = 0

Query: 21  TNRLKKATLYAKISPLGD-PNVSVEPELDGWVKEGKKVRIAELQRIIHDLRKRKRFTQAL 80
           T   K  ++Y K+S LG      +E  L+ +V EG  V+  +L R   DLRK ++  +AL
Sbjct: 33  TKAKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDLIRYAKDLRKFRQPQRAL 92

Query: 81  EVSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYV 140
           E+ EWM++  +  F+ S+HA++L+LI + +G  +AE+YFN L +  +   TYG+LLNCY 
Sbjct: 93  EIFEWMERKEIA-FTGSDHAIRLNLIAKSKGLEAAETYFNSLDDSIKNQSTYGSLLNCYC 152

Query: 141 RQRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHDKVPEVLAEMKEKNVSPDNF 200
            +++  K+ +H + M ++   ++ L +N++M +Y  +GQ +KVP ++  MKEK+++P + 
Sbjct: 153 VEKEEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVPALVVAMKEKSITPCDI 212

Query: 201 SYRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVVANFFIKADLADKAVDALKK 260
           +Y + I S G+ +DL+G+E VL EM+++   +  WNT+A +A  +IK  L  KA +ALK 
Sbjct: 213 TYSMWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAIYIKVGLYGKAEEALKS 272

Query: 261 AEERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTTRFINRDYITMLESLVRLGE 320
            E  +    R  ++ LI+LYT + N  +V R+W+L K       N  Y+TML +L +L +
Sbjct: 273 LENNMNPDVRDCYHFLINLYTGIANASEVYRVWDLLKKRYPNVNNSSYLTMLRALSKLDD 332

Query: 321 LEEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKTTTPNSWGA 380
           ++  +KV  EWES+   YD R+ N  I  Y+ + M E AEA+    M+K K     +   
Sbjct: 333 IDGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEAVFNGAMKKCKGQFSKARQL 392

Query: 381 VAVQYMDRGETEKSVECMKAALTLNMDKGWKPNLRVITGILNWLGENASIEEVEAFVGSL 440
           + +  +   + + +++  +AA+ L+ DK W  +  +I+       E   ++  E F  +L
Sbjct: 393 LMMHLLKNDQADLALKHFEAAV-LDQDKNWTWSSELISSFFLHFEEAKDVDGAEEFCKTL 452

Query: 441 RSAIPVNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDEETKKIL 487
               P++ E Y  LMK ++  GK   ++  +++   I  DEE + +L
Sbjct: 453 TKWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGILVDEEQECLL 497

BLAST of CmaCh18G003650 vs. TAIR 10
Match: AT1G60770.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 265.4 bits (677), Expect = 1.0e-70
Identity = 140/469 (29.85%), Postives = 256/469 (54.58%), Query Frame = 0

Query: 21  TNRLKKATLYAKISPLGDPNVSVEPELDGWVKEGKKVRIAELQRIIHDLRKRKRFTQALE 80
           T +  +  LY ++   G   V V  +L+ ++K  K V   E+   I  LR R  +  AL+
Sbjct: 17  TKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALK 76

Query: 81  VSEWMKKTGVCIFSPSEHAVQLDLIGRVRGYLSAESYFNQLKEQDQTDKTYGALLNCYVR 140
           +SE M++ G+   + S+ A+ LDL+ + R   + E+YF  L E  +T+ TYG+LLNCY +
Sbjct: 77  LSEVMEERGM-NKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCK 136

Query: 141 QRQVEKSLSHLQKMKEMGFATSELTYNDMMCLYTNVGQHDKVPEVLAEMKEKNVSPDNFS 200
           +   EK+   L KMKE+    S ++YN +M LYT  G+ +KVP ++ E+K +NV PD+++
Sbjct: 137 ELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYT 196

Query: 201 YRICINSYGARRDLEGMENVLKEMESQPHIVMDWNTYAVVANFFIKADLADKAVDALKKA 260
           Y + + +  A  D+ G+E V++EM     +  DW TY+ +A+ ++ A L+ KA  AL++ 
Sbjct: 197 YNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQEL 256

Query: 261 EERLKSKDRIGHNHLISLYTTLGNKEKVLRLWNLDKTDTTRFINRDYITMLESLVRLGEL 320
           E +   +D   +  LI+LY  LG   +V R+W   +    +  N  Y+ M++ LV+L +L
Sbjct: 257 EMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDL 316

Query: 321 EEAEKVLKEWESSGNCYDFRVPNTVIVGYIDKGMCERAEALLEDLMEKGKTTTPNSWGAV 380
             AE + KEW+++ + YD R+ N +I  Y  +G+ ++A  L E    +G      +W   
Sbjct: 317 PGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIF 376

Query: 381 AVQYMDRGETEKSVECMKAALTLNMDKG--WKPNLRVITGILNWLGENASIEEVEAFVGS 440
              Y+  G+  +++ECM  A+++    G  W P+   +  ++++  +   +   E  +  
Sbjct: 377 MDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLEI 436

Query: 441 LRSAIP-VNREMYHALMKVHIRGGKEVHELLNQMKSDKIDEDEETKKIL 487
           L++    +  E++  L++ +   GK    +  ++K + ++ +E TKK+L
Sbjct: 437 LKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLL 484

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q84JR32.6e-17261.43Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
Q8LPS61.4e-7735.90Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX... [more]
Q9SKU64.5e-7636.39Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
Q93WC54.7e-7332.76Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidop... [more]
O227141.4e-6929.85Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1K1242.1e-294100.00pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
A0A6J1GTN52.7e-28198.18pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
A0A6J1CGU21.9e-25888.55pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Momordic... [more]
A0A5A7UM451.6e-24684.74Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B5N21.6e-24684.74pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
XP_022994385.14.3e-294100.00pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
XP_022954890.15.5e-28198.18pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
KAG6573262.19.4e-28197.98Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_023542644.12.1e-28097.57pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 ... [more]
KAG7012430.13.6e-28097.98Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
Match NameE-valueIdentityDescription
AT4G21705.11.8e-17361.43Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G02150.19.9e-7935.90Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G20710.13.2e-7736.39Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G01990.13.3e-7432.76Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G60770.11.0e-7029.85Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 52..72
NoneNo IPR availableCOILSCoilCoilcoord: 250..270
NoneNo IPR availablePANTHERPTHR45717OS12G0527900 PROTEINcoord: 14..488
NoneNo IPR availablePANTHERPTHR45717:SF20OS07G0598500 PROTEINcoord: 14..488
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 403..504
e-value: 4.2E-5
score: 25.1
coord: 246..402
e-value: 4.4E-13
score: 51.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 58..244
e-value: 3.3E-24
score: 87.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 111..404
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 310..404
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 165..197
e-value: 5.7E-6
score: 24.2
coord: 307..337
e-value: 4.1E-4
score: 18.3
coord: 199..233
e-value: 0.0013
score: 16.8
coord: 130..159
e-value: 5.6E-5
score: 21.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 165..208
e-value: 6.5E-9
score: 35.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 307..334
e-value: 0.0026
score: 17.9
coord: 130..159
e-value: 1.4E-4
score: 21.8
coord: 343..369
e-value: 0.065
score: 13.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 162..196
score: 10.39137
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 127..161
score: 9.613118

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh18G003650.1CmaCh18G003650.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005739 mitochondrion
molecular_function GO:0003700 DNA-binding transcription factor activity
molecular_function GO:0008168 methyltransferase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0043565 sequence-specific DNA binding