CmaCh14G008690 (gene) Cucurbita maxima (Rimu)

NameCmaCh14G008690
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing family protein
LocationCma_Chr14 : 4481662 .. 4483446 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGCCAATTCCACATTCAAACTCTCCAATTTCTCCAGTTCCCTCCCCTCCAAACCTTCCTTCCGCTACTCCACATGGCACTCCCCGCCGCCTCCGGCGGCGGCAGCCGATCCTGTACTCGCCGCCGTCTCCACAGCCATCAACAACGTCGAAACAAAGCCTCTCGCCTCTTCTCTTCGGCGGCTCCTCCCTTCCTTCAAACCTCACCATTTCATTGACCTCATTAACCATAACCCCTTCTCTCTCTCCCCTGTCTCTCTCTTCTCCTTCTTCAATTGGCTCTCTTCTGTCCCCACCTTCCGCCACACCCTCCAATCCTACTGCGCTATGGCTAATTTCCTCTGCACCCATCAAATGTTCGAAGAATCACAATCGATCATCCGATTTCTCGTCTCCCGCAAAGGTAAGGACTCGGCGGCTTCGATCTTCGCCGCGATTCTTGAAATTACAGATACGCGTTGTTCGAATTTTGTATTTGATGCTTTGATGATTGCGTATTCGGATTCTGGGTTCATCTCCGATGCGATTCAGTGCTTTAGGTTGGTCAGGAAGAGAAATTTTCAAATCCCGTTTCGTGGATGTGAGTACTTACTTGATAAAATGATGAATTCAAACTCCCCTGTTACGATTTGGACGTTTTATCTGGAAATTTTGGATTCTGGATTCCCGCCTAAAGTAAAGTATTTCAACATTTTGATTAATAAGTTCTGTAAACAGGGTAGCATTAGAGATGCCAGGTTGATCTTCGATGAAATTGGGAAGAGGGGTTTTCGTCCCACAACTGTTAGTTTCAATACCTTGATTAATGGTCTCTGTAAATCCCGAAATTTAGATGAGAGTTTTAGGTTGAAGAAAGCCATGGAAGAGAATAGAATATATCCTGATGTTTACACTTACAGTGTTCTGATTCATGGGTTATGCAAGGAAGGTAGGGTAGATGATGCAGAACAACTGTTCGATGAAATGCGTCAGAGAGGATTAAGGGCAAACGACGTTACATTCACTGCTTTGATTGATGGGCAATGCAGGAGCGGACGAATTGACTCAGCCATGAACACTTATCAGCAAATGTTAGCCATGGGAGTGAAACCAGATTTAGTTATGTATAACACACTCTTGAATGGCCTCTGCAAAGTTGGGGATGTTAGTAAAGCTAGGAAGCTGGTCGATGAAATGAAAATGGTGGGGATGAAACCAGATAAAATCACTTACACAACTCTCATAGATGGTTACTGCAAAGAGGGAGATTTAGAATCAGCCATGGAGATTAGGAAAGGGATGAATGTAGAAGGGGTTGTTCTTGATAATGTAGCATTCACAGCCATTATTTCAGGTTTGTGTAGAGATGGAAGGGTGATGGATGCAGAGGGGACCTTGAGGGAGATGAAGGAAGCTGGGATGAAACCCGACGATGCGACGTATACTATGGTGATCGACGGGTATTGCAAGAACGGCGATGTTAAGCCGGGGTTTAAGTTGCTGAAAGAGATGCAGAGAAATGGCCATAATCCTGGTGTGATAACTTACAATGTGCTTATGAATGGGCTTTGCAAGCAAGGACAGATGAAGAATGCCAATATGCTGTTGGAAGCAATGCTTAACTTAGGAGTAACTCCTGATGACATTACATACAATATTCTGTTGGAAGGGCACTGTAAAAGTGGAAGAGCAGAAGATTTCCTTCACCTCAGAAATGAGAAAGGGCTCGTAGTAGACTACGCGTATTATACTTCTTTAGTCGGTGAATACGATAAATCATTGAAGGATCGTCGAAAGAGGTGA

mRNA sequence

ATGGCCGCCAATTCCACATTCAAACTCTCCAATTTCTCCAGTTCCCTCCCCTCCAAACCTTCCTTCCGCTACTCCACATGGCACTCCCCGCCGCCTCCGGCGGCGGCAGCCGATCCTGTACTCGCCGCCGTCTCCACAGCCATCAACAACGTCGAAACAAAGCCTCTCGCCTCTTCTCTTCGGCGGCTCCTCCCTTCCTTCAAACCTCACCATTTCATTGACCTCATTAACCATAACCCCTTCTCTCTCTCCCCTGTCTCTCTCTTCTCCTTCTTCAATTGGCTCTCTTCTGTCCCCACCTTCCGCCACACCCTCCAATCCTACTGCGCTATGGCTAATTTCCTCTGCACCCATCAAATGTTCGAAGAATCACAATCGATCATCCGATTTCTCGTCTCCCGCAAAGGTAAGGACTCGGCGGCTTCGATCTTCGCCGCGATTCTTGAAATTACAGATACGCGTTGTTCGAATTTTGTATTTGATGCTTTGATGATTGCGTATTCGGATTCTGGGTTCATCTCCGATGCGATTCAGTGCTTTAGGTTGGTCAGGAAGAGAAATTTTCAAATCCCGTTTCGTGGATGTGAGTACTTACTTGATAAAATGATGAATTCAAACTCCCCTGTTACGATTTGGACGTTTTATCTGGAAATTTTGGATTCTGGATTCCCGCCTAAAGTAAAGTATTTCAACATTTTGATTAATAAGTTCTGTAAACAGGGTAGCATTAGAGATGCCAGGTTGATCTTCGATGAAATTGGGAAGAGGGGTTTTCGTCCCACAACTGTTAGTTTCAATACCTTGATTAATGGTCTCTGTAAATCCCGAAATTTAGATGAGAGTTTTAGGTTGAAGAAAGCCATGGAAGAGAATAGAATATATCCTGATGTTTACACTTACAGTGTTCTGATTCATGGGTTATGCAAGGAAGGTAGGGTAGATGATGCAGAACAACTGTTCGATGAAATGCGTCAGAGAGGATTAAGGGCAAACGACGTTACATTCACTGCTTTGATTGATGGGCAATGCAGGAGCGGACGAATTGACTCAGCCATGAACACTTATCAGCAAATGTTAGCCATGGGAGTGAAACCAGATTTAGTTATGTATAACACACTCTTGAATGGCCTCTGCAAAGTTGGGGATGTTAGTAAAGCTAGGAAGCTGGTCGATGAAATGAAAATGGTGGGGATGAAACCAGATAAAATCACTTACACAACTCTCATAGATGGTTACTGCAAAGAGGGAGATTTAGAATCAGCCATGGAGATTAGGAAAGGGATGAATGTAGAAGGGGTTGTTCTTGATAATGTAGCATTCACAGCCATTATTTCAGGTTTGTGTAGAGATGGAAGGGTGATGGATGCAGAGGGGACCTTGAGGGAGATGAAGGAAGCTGGGATGAAACCCGACGATGCGACGTATACTATGGTGATCGACGGGTATTGCAAGAACGGCGATGTTAAGCCGGGGTTTAAGTTGCTGAAAGAGATGCAGAGAAATGGCCATAATCCTGGTGTGATAACTTACAATGTGCTTATGAATGGGCTTTGCAAGCAAGGACAGATGAAGAATGCCAATATGCTGTTGGAAGCAATGCTTAACTTAGGAGTAACTCCTGATGACATTACATACAATATTCTGTTGGAAGGGCACTGTAAAAGTGGAAGAGCAGAAGATTTCCTTCACCTCAGAAATGAGAAAGGGCTCGTAGTAGACTACGCGTATTATACTTCTTTAGTCGGTGAATACGATAAATCATTGAAGGATCGTCGAAAGAGGTGA

Coding sequence (CDS)

ATGGCCGCCAATTCCACATTCAAACTCTCCAATTTCTCCAGTTCCCTCCCCTCCAAACCTTCCTTCCGCTACTCCACATGGCACTCCCCGCCGCCTCCGGCGGCGGCAGCCGATCCTGTACTCGCCGCCGTCTCCACAGCCATCAACAACGTCGAAACAAAGCCTCTCGCCTCTTCTCTTCGGCGGCTCCTCCCTTCCTTCAAACCTCACCATTTCATTGACCTCATTAACCATAACCCCTTCTCTCTCTCCCCTGTCTCTCTCTTCTCCTTCTTCAATTGGCTCTCTTCTGTCCCCACCTTCCGCCACACCCTCCAATCCTACTGCGCTATGGCTAATTTCCTCTGCACCCATCAAATGTTCGAAGAATCACAATCGATCATCCGATTTCTCGTCTCCCGCAAAGGTAAGGACTCGGCGGCTTCGATCTTCGCCGCGATTCTTGAAATTACAGATACGCGTTGTTCGAATTTTGTATTTGATGCTTTGATGATTGCGTATTCGGATTCTGGGTTCATCTCCGATGCGATTCAGTGCTTTAGGTTGGTCAGGAAGAGAAATTTTCAAATCCCGTTTCGTGGATGTGAGTACTTACTTGATAAAATGATGAATTCAAACTCCCCTGTTACGATTTGGACGTTTTATCTGGAAATTTTGGATTCTGGATTCCCGCCTAAAGTAAAGTATTTCAACATTTTGATTAATAAGTTCTGTAAACAGGGTAGCATTAGAGATGCCAGGTTGATCTTCGATGAAATTGGGAAGAGGGGTTTTCGTCCCACAACTGTTAGTTTCAATACCTTGATTAATGGTCTCTGTAAATCCCGAAATTTAGATGAGAGTTTTAGGTTGAAGAAAGCCATGGAAGAGAATAGAATATATCCTGATGTTTACACTTACAGTGTTCTGATTCATGGGTTATGCAAGGAAGGTAGGGTAGATGATGCAGAACAACTGTTCGATGAAATGCGTCAGAGAGGATTAAGGGCAAACGACGTTACATTCACTGCTTTGATTGATGGGCAATGCAGGAGCGGACGAATTGACTCAGCCATGAACACTTATCAGCAAATGTTAGCCATGGGAGTGAAACCAGATTTAGTTATGTATAACACACTCTTGAATGGCCTCTGCAAAGTTGGGGATGTTAGTAAAGCTAGGAAGCTGGTCGATGAAATGAAAATGGTGGGGATGAAACCAGATAAAATCACTTACACAACTCTCATAGATGGTTACTGCAAAGAGGGAGATTTAGAATCAGCCATGGAGATTAGGAAAGGGATGAATGTAGAAGGGGTTGTTCTTGATAATGTAGCATTCACAGCCATTATTTCAGGTTTGTGTAGAGATGGAAGGGTGATGGATGCAGAGGGGACCTTGAGGGAGATGAAGGAAGCTGGGATGAAACCCGACGATGCGACGTATACTATGGTGATCGACGGGTATTGCAAGAACGGCGATGTTAAGCCGGGGTTTAAGTTGCTGAAAGAGATGCAGAGAAATGGCCATAATCCTGGTGTGATAACTTACAATGTGCTTATGAATGGGCTTTGCAAGCAAGGACAGATGAAGAATGCCAATATGCTGTTGGAAGCAATGCTTAACTTAGGAGTAACTCCTGATGACATTACATACAATATTCTGTTGGAAGGGCACTGTAAAAGTGGAAGAGCAGAAGATTTCCTTCACCTCAGAAATGAGAAAGGGCTCGTAGTAGACTACGCGTATTATACTTCTTTAGTCGGTGAATACGATAAATCATTGAAGGATCGTCGAAAGAGGTGA

Protein sequence

MAANSTFKLSNFSSSLPSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNVETKPLASSLRRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQSIIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNGDVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR
BLAST of CmaCh14G008690 vs. Swiss-Prot
Match: PPR26_ARATH (Putative pentatricopeptide repeat-containing protein At1g09680 OS=Arabidopsis thaliana GN=At1g09680 PE=3 SV=1)

HSP 1 Score: 666.0 bits (1717), Expect = 3.8e-190
Identity = 332/588 (56.46%), Postives = 426/588 (72.45%), Query Frame = 1

Query: 18  SKPSFRYSTWHSPPPPAAAA---DPVLAAVSTAI-NNVETKPL-------ASSLRRLLPS 77
           S+ SF  STW+S    +AA    DPVL  +S AI ++ +  PL         S+R++LPS
Sbjct: 20  SRASFLLSTWYSQESVSAADNDDDPVLVKLSVAIRDSYKDPPLEFSSFTDCPSIRKVLPS 79

Query: 78  FKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQS 137
              HH +DLINHNP SL   S+F+FF ++SS P FR T+++Y  +A FL  H+MF E+QS
Sbjct: 80  LSVHHVVDLINHNPLSLPQRSIFAFFKFISSQPGFRFTVETYFVLARFLAVHEMFTEAQS 139

Query: 138 IIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKR 197
           +I  +VSRKGK+SA+S+F +++E+  T    F+ DALMI Y+D GFI DAIQCFRL RK 
Sbjct: 140 LIELVVSRKGKNSASSVFISLVEMRVTPMCGFLVDALMITYTDLGFIPDAIQCFRLSRKH 199

Query: 198 NFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDA 257
            F +P RGC  LLD+MM  N   TIW FY+EILD+GFP  V  FNIL+NKFCK+G+I DA
Sbjct: 200 RFDVPIRGCGNLLDRMMKLNPTGTIWGFYMEILDAGFPLNVYVFNILMNKFCKEGNISDA 259

Query: 258 RLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHG 317
           + +FDEI KR  +PT VSFNTLING CK  NLDE FRLK  ME++R  PDV+TYS LI+ 
Sbjct: 260 QKVFDEITKRSLQPTVVSFNTLINGYCKVGNLDEGFRLKHQMEKSRTRPDVFTYSALINA 319

Query: 318 LCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPD 377
           LCKE ++D A  LFDEM +RGL  NDV FT LI G  R+G ID    +YQ+ML+ G++PD
Sbjct: 320 LCKENKMDGAHGLFDEMCKRGLIPNDVIFTTLIHGHSRNGEIDLMKESYQKMLSKGLQPD 379

Query: 378 LVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRK 437
           +V+YNTL+NG CK GD+  AR +VD M   G++PDKITYTTLIDG+C+ GD+E+A+EIRK
Sbjct: 380 IVLYNTLVNGFCKNGDLVAARNIVDGMIRRGLRPDKITYTTLIDGFCRGGDVETALEIRK 439

Query: 438 GMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNG 497
            M+  G+ LD V F+A++ G+C++GRV+DAE  LREM  AG+KPDD TYTM++D +CK G
Sbjct: 440 EMDQNGIELDRVGFSALVCGMCKEGRVIDAERALREMLRAGIKPDDVTYTMMMDAFCKKG 499

Query: 498 DVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYN 557
           D + GFKLLKEMQ +GH P V+TYNVL+NGLCK GQMKNA+MLL+AMLN+GV PDDITYN
Sbjct: 500 DAQTGFKLLKEMQSDGHVPSVVTYNVLLNGLCKLGQMKNADMLLDAMLNIGVVPDDITYN 559

Query: 558 ILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 595
            LLEGH +   +      + E G+V D A Y S+V E D++ KD R R
Sbjct: 560 TLLEGHHRHANSSKRYIQKPEIGIVADLASYKSIVNELDRASKDHRNR 607

BLAST of CmaCh14G008690 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 308.9 bits (790), Expect = 1.2e-82
Identity = 179/578 (30.97%), Postives = 307/578 (53.11%), Query Frame = 1

Query: 25  STWHSPPPPAAAADPVLAAVSTAINNVETKPLASSLRRLLPSFKPHHFIDLI--NHNPFS 84
           ST+ S P  +  AD  L  +         K     L  L  +F P    +L+  + N  +
Sbjct: 13  STFASSPSDSLLADKALTFL---------KRHPYQLHHLSANFTPEAASNLLLKSQNDQA 72

Query: 85  LSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQSIIRFLVSRKGKDSAAS 144
           L    +  F NW +    F  TL+  C   + L   ++++ +Q +   + ++   D  AS
Sbjct: 73  L----ILKFLNWANPHQFF--TLRCKCITLHILTKFKLYKTAQILAEDVAAKTLDDEYAS 132

Query: 145 IFAAILEITDTRC--SNFVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLD 204
           +    L+ T   C  ++ VFD ++ +YS    I  A+    L +   F         +LD
Sbjct: 133 LVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLD 192

Query: 205 KMMNSNSPVTIW-TFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFR 264
             + S   ++     + E+L+S   P V  +NILI  FC  G+I  A  +FD++  +G  
Sbjct: 193 ATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCL 252

Query: 265 PTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHGLCKEGRVDDAEQL 324
           P  V++NTLI+G CK R +D+ F+L ++M    + P++ +Y+V+I+GLC+EGR+ +   +
Sbjct: 253 PNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFV 312

Query: 325 FDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCK 384
             EM +RG   ++VT+  LI G C+ G    A+  + +ML  G+ P ++ Y +L++ +CK
Sbjct: 313 LTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCK 372

Query: 385 VGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNVEGVVLDNVA 444
            G++++A + +D+M++ G+ P++ TYTTL+DG+ ++G +  A  + + MN  G     V 
Sbjct: 373 AGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVT 432

Query: 445 FTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNGDVKPGFKLLKEMQ 504
           + A+I+G C  G++ DA   L +MKE G+ PD  +Y+ V+ G+C++ DV    ++ +EM 
Sbjct: 433 YNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMV 492

Query: 505 RNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGRAE 564
             G  P  ITY+ L+ G C+Q + K A  L E ML +G+ PD+ TY  L+  +C  G  E
Sbjct: 493 EKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLE 552

Query: 565 DFLHLRN---EKGLVVDYAYYTSLVGEYDKSLKDRRKR 595
             L L N   EKG++ D   Y+ L+   +K  + R  +
Sbjct: 553 KALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAK 575

BLAST of CmaCh14G008690 vs. Swiss-Prot
Match: PP432_ARATH (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 280.8 bits (717), Expect = 3.5e-74
Identity = 154/503 (30.62%), Postives = 261/503 (51.89%), Query Frame = 1

Query: 91  FFNWLSSVPTFR--HTLQSYCAMANFLCTHQMFEESQSIIRFLVSRKGKDSAASIFAAIL 150
           F  W+   P     H +Q  C   + L   +M++ ++ I++ L    GK S   +F A++
Sbjct: 56  FLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSF--VFGALM 115

Query: 151 EITDTRCSN-FVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNS 210
                  SN  V+D L+  Y   G I D+++ FRL+    F      C  +L  ++ S  
Sbjct: 116 TTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGE 175

Query: 211 PVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFRPTTVSFNT 270
            V++W+F  E+L     P V  FNILIN  C +GS   +  +  ++ K G+ PT V++NT
Sbjct: 176 DVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNT 235

Query: 271 LINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHGLCKEGRVDDAEQLFDEMRQRG 330
           +++  CK      +  L   M+   +  DV TY++LIH LC+  R+     L  +MR+R 
Sbjct: 236 VLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRM 295

Query: 331 LRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKAR 390
           +  N+VT+  LI+G    G++  A     +ML+ G+ P+ V +N L++G    G+  +A 
Sbjct: 296 IHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEAL 355

Query: 391 KLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNVEGVVLDNVAFTAIISGL 450
           K+   M+  G+ P +++Y  L+DG CK  + + A      M   GV +  + +T +I GL
Sbjct: 356 KMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGL 415

Query: 451 CRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNGDVKPGFKLLKEMQRNGHNPGV 510
           C++G + +A   L EM + G+ PD  TY+ +I+G+CK G  K   +++  + R G +P  
Sbjct: 416 CKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNG 475

Query: 511 ITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGR---AEDFLHL 570
           I Y+ L+   C+ G +K A  + EAM+  G T D  T+N+L+   CK+G+   AE+F+  
Sbjct: 476 IIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRC 535

Query: 571 RNEKGLVVDYAYYTSLVGEYDKS 588
               G++ +   +  L+  Y  S
Sbjct: 536 MTSDGILPNTVSFDCLINGYGNS 556

BLAST of CmaCh14G008690 vs. Swiss-Prot
Match: PP404_ARATH (Pentatricopeptide repeat-containing protein At5g38730 OS=Arabidopsis thaliana GN=At5g38730 PE=2 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 2.3e-70
Identity = 152/504 (30.16%), Postives = 264/504 (52.38%), Query Frame = 1

Query: 85  PVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQSIIRFLVSRKGKDSAASIF 144
           P   +SFF W  S+P+ +H+LQS   M   L  H+ F+ +  ++  L  R+   S   + 
Sbjct: 60  PSLSWSFFIWTDSLPSSKHSLQSSWKMILILTKHKHFKTAHQLLDKLAQRELLSSPLVLR 119

Query: 145 AAILEIT-DTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMM 204
           + +  ++ D    + VF  LMI Y+ +G I+D+I  F  +R    +   + C  LL+ ++
Sbjct: 120 SLVGGVSEDPEDVSHVFSWLMIYYAKAGMINDSIVVFEQIRSCGLKPHLQACTVLLNSLV 179

Query: 205 NSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFRPTTV 264
                 T+W  + +++  G    +  +N+L++   K G    A  +  E+ ++G  P   
Sbjct: 180 KQRLTDTVWKIFKKMVKLGVVANIHVYNVLVHACSKSGDPEKAEKLLSEMEEKGVFPDIF 239

Query: 265 SFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHGLCKEGRVDDAEQLFDEM 324
           ++NTLI+  CK     E+  ++  ME + + P++ TY+  IHG  +EGR+ +A +LF E+
Sbjct: 240 TYNTLISVYCKKSMHFEALSVQDRMERSGVAPNIVTYNSFIHGFSREGRMREATRLFREI 299

Query: 325 RQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDV 384
           +   + AN VT+T LIDG CR   ID A+   + M + G  P +V YN++L  LC+ G +
Sbjct: 300 KD-DVTANHVTYTTLIDGYCRMNDIDEALRLREVMESRGFSPGVVTYNSILRKLCEDGRI 359

Query: 385 SKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNVEGVVLDNVAFTAI 444
            +A +L+ EM    ++PD IT  TLI+ YCK  D+ SA++++K M   G+ LD  ++ A+
Sbjct: 360 REANRLLTEMSGKKIEPDNITCNTLINAYCKIEDMVSAVKVKKKMIESGLKLDMYSYKAL 419

Query: 445 ISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNGDVKPGFKLLKEMQRNGH 504
           I G C+   + +A+  L  M E G  P  ATY+ ++DG+          KLL+E ++ G 
Sbjct: 420 IHGFCKVLELENAKEELFSMIEKGFSPGYATYSWLVDGFYNQNKQDEITKLLEEFEKRGL 479

Query: 505 NPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGR---AED 564
              V  Y  L+  +CK  Q+  A +L E+M   G+  D + +  +   + ++G+   A  
Sbjct: 480 CADVALYRGLIRRICKLEQVDYAKVLFESMEKKGLVGDSVIFTTMAYAYWRTGKVTEASA 539

Query: 565 FLHLRNEKGLVVDYAYYTSLVGEY 585
              +   + L+V+   Y S+   Y
Sbjct: 540 LFDVMYNRRLMVNLKLYKSISASY 562

BLAST of CmaCh14G008690 vs. Swiss-Prot
Match: PPR12_ARATH (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 2.3e-70
Identity = 168/583 (28.82%), Postives = 288/583 (49.40%), Query Frame = 1

Query: 11  NFSSSLPSKPSFRYSTWHSPPPPAAAADPVLA-AVSTAINNVETKPLASSLRRLLPSFKP 70
           +FS+   ++P   YS     P  A+  D      ++  I     +PL  SL+     FK 
Sbjct: 33  SFSTLTDTRPFPDYS-----PKKASVRDTEFVHQITNVIKLRRAEPLRRSLKPYECKFKT 92

Query: 71  HHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQSIIR 130
            H I ++         V    FF+W  S       L+S C + +     +  + +QS+I 
Sbjct: 93  DHLIWVLMKIKCDYRLV--LDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLIS 152

Query: 131 FLVSRKG---KDSAASIFAAIL-EITDTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRK 190
               R      DS    F  ++    D      VFD       D G + +A + F  +  
Sbjct: 153 SFWERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLN 212

Query: 191 RNFQIPFRGCEYLLDKMMNS-NSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIR 250
               +    C   L ++        T    + E  + G    V  +NI+I+  C+ G I+
Sbjct: 213 YGLVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIK 272

Query: 251 DARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLI 310
           +A  +   +  +G+ P  +S++T++NG C+   LD+ ++L + M+   + P+ Y Y  +I
Sbjct: 273 EAHHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSII 332

Query: 311 HGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVK 370
             LC+  ++ +AE+ F EM ++G+  + V +T LIDG C+ G I +A   + +M +  + 
Sbjct: 333 GLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDIT 392

Query: 371 PDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEI 430
           PD++ Y  +++G C++GD+ +A KL  EM   G++PD +T+T LI+GYCK G ++ A  +
Sbjct: 393 PDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRV 452

Query: 431 RKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCK 490
              M   G   + V +T +I GLC++G +  A   L EM + G++P+  TY  +++G CK
Sbjct: 453 HNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCK 512

Query: 491 NGDVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDIT 550
           +G+++   KL+ E +  G N   +TY  LM+  CK G+M  A  +L+ ML  G+ P  +T
Sbjct: 513 SGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVT 572

Query: 551 YNILLEGHCKSGRAEDFLHLRN---EKGLVVDYAYYTSLVGEY 585
           +N+L+ G C  G  ED   L N    KG+  +   + SLV +Y
Sbjct: 573 FNVLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQY 606

BLAST of CmaCh14G008690 vs. TrEMBL
Match: M5XHR3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa024153mg PE=4 SV=1)

HSP 1 Score: 807.4 bits (2084), Expect = 1.2e-230
Identity = 391/572 (68.36%), Postives = 469/572 (81.99%), Query Frame = 1

Query: 25  STWHSPPPPAAAADPVLAAVSTAINNV--ETKPLASSLRRLLPSFKPHHFIDLINHNPFS 84
           + W++ PP     DP L+A+S AI      ++PL SSLR+LLPS      I+LIN NP S
Sbjct: 34  TAWYNQPPTPHNEDPKLSAISDAIKTTTQNSQPLDSSLRKLLPSLTARDVINLINLNPHS 93

Query: 85  LSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQSIIRFLVSRKGKDSAAS 144
           LSP+SL SFFNWLSS PTFRH +QSYC MA+FLC HQM+ ++QS++R +VSRKGK++A+S
Sbjct: 94  LSPLSLLSFFNWLSSHPTFRHNIQSYCTMAHFLCAHQMYPQAQSLLRIVVSRKGKETASS 153

Query: 145 IFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLDKM 204
           +FA+ILE   T  SN+VFDALM AY D GF+SDA QCFRL+RK NF+IPF  C  LLDKM
Sbjct: 154 VFASILETRGTHQSNYVFDALMNAYVDCGFVSDACQCFRLLRKHNFRIPFHACGCLLDKM 213

Query: 205 MNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFRPTT 264
           +  NSPV  W FYLEILDSGFPPKV  FN+L++K CK+G IR+A+L+FDEIGKRG  PT 
Sbjct: 214 LKLNSPVVAWGFYLEILDSGFPPKVYNFNVLMHKLCKEGEIREAQLVFDEIGKRGLLPTV 273

Query: 265 VSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHGLCKEGRVDDAEQLFDE 324
           VSFNTLING CKSRNL+E FRLK+ MEE+R  PDV+TYSVLI+GLCKE R+DDA  LFDE
Sbjct: 274 VSFNTLINGYCKSRNLEECFRLKRDMEESRTRPDVFTYSVLINGLCKELRLDDANLLFDE 333

Query: 325 MRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGD 384
           M +RGL  N+VT+T LIDGQC++GRID AM  YQ+ML +G+KPD++ YNTL+NGLCKVGD
Sbjct: 334 MCERGLVPNNVTYTTLIDGQCKNGRIDLAMEVYQKMLGIGIKPDVITYNTLINGLCKVGD 393

Query: 385 VSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNVEGVVLDNVAFTA 444
           + +ARKLV+EM + G+KPD ITYTTLIDG CKEG+L+SA+EIRKGM  +G+ LDNVAFTA
Sbjct: 394 LKEARKLVEEMNIAGLKPDTITYTTLIDGCCKEGNLQSALEIRKGMIKQGIELDNVAFTA 453

Query: 445 IISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNGDVKPGFKLLKEMQRNG 504
           +ISGLCR+G+ +DAE TLREM  +GMKPDDATYTM+IDG+CK GDVK GFKLLKEMQ +G
Sbjct: 454 LISGLCREGKTLDAERTLREMLNSGMKPDDATYTMIIDGFCKKGDVKMGFKLLKEMQGDG 513

Query: 505 HNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGRAEDFL 564
           + P V+TYN LMNGLCK GQMKNANMLL+AM+NLGV PDDITYNILLEGHCK G  EDF 
Sbjct: 514 YVPSVVTYNALMNGLCKLGQMKNANMLLDAMINLGVAPDDITYNILLEGHCKHGNPEDFD 573

Query: 565 HLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 595
            LR+ KGLV+DYA YTSLV E++KS KDRRKR
Sbjct: 574 KLRSGKGLVLDYASYTSLVSEFNKSSKDRRKR 605

BLAST of CmaCh14G008690 vs. TrEMBL
Match: A0A0A0L668_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G188330 PE=4 SV=1)

HSP 1 Score: 798.1 bits (2060), Expect = 7.2e-228
Identity = 402/497 (80.89%), Postives = 434/497 (87.32%), Query Frame = 1

Query: 1   MAANSTFKLSNFSSSLPSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNVETKPLASSL 60
           M  NS+FK    S SL SKPSF YSTWHSPPP AA ADPVLAAVSTAINN +TKPLASSL
Sbjct: 15  MPNNSSFK---HSISL-SKPSFLYSTWHSPPPLAALADPVLAAVSTAINNAQTKPLASSL 74

Query: 61  RRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQM 120
           RRLLPSFKPHHFIDLIN NPFSLSP SLFSFFNWLSS+PTFRHT QSYCAMANFL  HQM
Sbjct: 75  RRLLPSFKPHHFIDLINQNPFSLSPSSLFSFFNWLSSIPTFRHTSQSYCAMANFLSAHQM 134

Query: 121 FEESQSIIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCF 180
           F+E QSIIRFLVSRKGKDSAAS+FAAIL+   TRCSNFVFDALMIAY DSGF+SDAIQCF
Sbjct: 135 FQECQSIIRFLVSRKGKDSAASVFAAILDTAGTRCSNFVFDALMIAYWDSGFVSDAIQCF 194

Query: 181 RLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQ 240
           RLVR  NFQIPF GC YLLDKM+NSNSPVTIWTFY EIL+ GFPPKV+Y+NILINKFCK+
Sbjct: 195 RLVRNSNFQIPFHGCGYLLDKMINSNSPVTIWTFYSEILEYGFPPKVQYYNILINKFCKE 254

Query: 241 GSIRDARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTY 300
           GSIRDA+LIF+EI KRG RPTTVSFNTLINGLCKSRNLDE FRLKK MEENRIYPDV+TY
Sbjct: 255 GSIRDAKLIFNEIRKRGLRPTTVSFNTLINGLCKSRNLDEGFRLKKTMEENRIYPDVFTY 314

Query: 301 SVLIHGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLA 360
           SVLIHGLCKEGR+D AEQLFDEM+QRGLR N +TFTALIDGQCRS RIDSAMNTY QML 
Sbjct: 315 SVLIHGLCKEGRLDVAEQLFDEMQQRGLRPNGITFTALIDGQCRSRRIDSAMNTYHQMLT 374

Query: 361 MGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420
           MGVKPDLVMYNTLLNGLCKVGDV+KARKLVDEM+MVGMKPDKITYTTLIDGYCKEGDLES
Sbjct: 375 MGVKPDLVMYNTLLNGLCKVGDVNKARKLVDEMRMVGMKPDKITYTTLIDGYCKEGDLES 434

Query: 421 AMEIRKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVID 480
           AMEIRKGMN EGVVLDNVAFTA+ISG  ++  ++     L  M   G+ PDD TY ++++
Sbjct: 435 AMEIRKGMNEEGVVLDNVAFTALISGQMKNANML-----LEAMLNLGVTPDDITYNILLE 494

Query: 481 GYCKNGDVKPGFKLLKE 498
           G+CKNG  +   KL  E
Sbjct: 495 GHCKNGKAEDLLKLRNE 502

BLAST of CmaCh14G008690 vs. TrEMBL
Match: D7TSI7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0006g00170 PE=4 SV=1)

HSP 1 Score: 766.5 bits (1978), Expect = 2.3e-218
Identity = 382/594 (64.31%), Postives = 456/594 (76.77%), Query Frame = 1

Query: 1   MAANSTFKLSNFSSSLPSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNVETKPLASSL 60
           MA        NF SS P      +STW +PP  +   DP+L  +S AI    TKPL SSL
Sbjct: 1   MATPKILTTRNFLSS-PRGGFLCFSTWMTPPT-SHCHDPILTTISEAIKVSPTKPLHSSL 60

Query: 61  RRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQM 120
           +R+LPS  P+H IDLIN NP SLSP SL SFF WLS+   FR ++ SYC M +FLCTH+M
Sbjct: 61  KRILPSLTPNHLIDLINLNPHSLSPPSLLSFFKWLSTQHHFRLSIHSYCTMTHFLCTHKM 120

Query: 121 FEESQSIIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCF 180
             E+QS+++F+VSRKGK+SA+S+F ++LE   T  SN VF  LM AY+DSG+ SDAIQCF
Sbjct: 121 LSEAQSLLQFVVSRKGKNSASSVFTSVLEARGTHQSNLVFSVLMNAYTDSGYFSDAIQCF 180

Query: 181 RLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQ 240
           RLVRK N QIPF  C YL D++M  N     W FY EILD G+PP V  FN+L+++ CK+
Sbjct: 181 RLVRKHNLQIPFHSCGYLFDRLMKLNLTSPAWAFYEEILDCGYPPDVCKFNVLMHRLCKE 240

Query: 241 GSIRDARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTY 300
             I +A+L+F EIGKRG RPT VSFNTLING CKS NLD+ FRLK+ M ENR++PDV+TY
Sbjct: 241 HKINEAQLLFGEIGKRGLRPTVVSFNTLINGYCKSGNLDQGFRLKRFMMENRVFPDVFTY 300

Query: 301 SVLIHGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLA 360
           SVLI+GLCKEG++DDA +LF EM  RGL  NDVTFT LI+G C +GR D  M  YQQML 
Sbjct: 301 SVLINGLCKEGQLDDANKLFLEMCDRGLVPNDVTFTTLINGHCVTGRADLGMEIYQQMLR 360

Query: 361 MGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420
            GVKPD++ YNTL+NGLCKVGD+ +A+KLV EM   G+KPDK TYT LIDG CKEGDLES
Sbjct: 361 KGVKPDVITYNTLINGLCKVGDLREAKKLVIEMTQRGLKPDKFTYTMLIDGCCKEGDLES 420

Query: 421 AMEIRKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVID 480
           A+EIRK M  EG+ LDNVAFTA+ISG CR+G+V++AE TLREM EAG+KPDDATYTMVI 
Sbjct: 421 ALEIRKEMVKEGIELDNVAFTALISGFCREGQVIEAERTLREMLEAGIKPDDATYTMVIH 480

Query: 481 GYCKNGDVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540
           G+CK GDVK GFKLLKEMQ +GH PGV+TYNVL+NGLCKQGQMKNANMLL+AMLNLGV P
Sbjct: 481 GFCKKGDVKTGFKLLKEMQCDGHVPGVVTYNVLLNGLCKQGQMKNANMLLDAMLNLGVVP 540

Query: 541 DDITYNILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 595
           DDITYNILLEGHCK G  EDF  L++EKGLV DY  YTSL+G+  K+ K+R+KR
Sbjct: 541 DDITYNILLEGHCKHGNREDFDKLQSEKGLVQDYGSYTSLIGDLRKTCKERQKR 592

BLAST of CmaCh14G008690 vs. TrEMBL
Match: U5FHA0_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0017s02740g PE=4 SV=1)

HSP 1 Score: 761.1 bits (1964), Expect = 9.8e-217
Identity = 374/599 (62.44%), Postives = 461/599 (76.96%), Query Frame = 1

Query: 5   STFKLSNFSSSLPSKPSFRYSTWHSPPP---------PAAAADPVLAAVSTAINNVETKP 64
           +TFKL     S P K    +STW+SPPP         P+    P+L  +S AI N+ETKP
Sbjct: 2   ATFKLPKTHLS-PPKTLCAFSTWYSPPPQPPPRQSPPPSRHESPILTTISEAIKNIETKP 61

Query: 65  LASSLRRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFL 124
           L  SL+ +LPSFK HHFI L+N NP+ L P SL SFF++LSS PTF HT+QSYC+M +FL
Sbjct: 62  LHISLKNILPSFKAHHFISLVNQNPYFLPPKSLLSFFDFLSSYPTFSHTVQSYCSMVHFL 121

Query: 125 CTHQMFEESQSIIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISD 184
             H+M ++++S++ F+VSRKGK SA+S+FA+ILE   T  S+FVFDALM  Y++ G++SD
Sbjct: 122 IAHRMNQQAESLLHFVVSRKGKGSASSVFASILETKGTLSSSFVFDALMSVYTEFGYVSD 181

Query: 185 AIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILIN 244
           AIQCFRL +K N +IPF GC+ LL++M+  +SP+    FYLEILDSG+PP V  FN+L+N
Sbjct: 182 AIQCFRLTKKHNLKIPFNGCKCLLERMIKMSSPMVALEFYLEILDSGYPPNVYTFNVLMN 241

Query: 245 KFCKQGSIRDARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYP 304
           + CK+G ++DA+LIFDEI K G +PT VSFNTLING CKS NL+E FRLK  MEE R++P
Sbjct: 242 RLCKEGKVKDAQLIFDEIRKTGLQPTAVSFNTLINGYCKSGNLEEGFRLKMVMEEFRVFP 301

Query: 305 DVYTYSVLIHGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTY 364
           DV+TYS LI GLCKE ++DDA  LF EM  RGL  NDVTFT LI+GQC++GR+D A+  Y
Sbjct: 302 DVFTYSALIDGLCKECQLDDANHLFKEMCDRGLVPNDVTFTTLINGQCKNGRVDLALEIY 361

Query: 365 QQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKE 424
           QQM   G+K DLV+YNTL++GLCK G   +ARK V EM   G+ PDK TYTTL+DG CKE
Sbjct: 362 QQMFTKGLKADLVLYNTLIDGLCKGGYFREARKFVGEMTKRGLIPDKFTYTTLLDGSCKE 421

Query: 425 GDLESAMEIRKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATY 484
           GDLE A+E+RK M  EG+ LDNVAFTAIISGLCRDG+++DAE TLREM  AG+KPDD TY
Sbjct: 422 GDLELALEMRKEMVKEGIQLDNVAFTAIISGLCRDGKIVDAERTLREMLRAGLKPDDGTY 481

Query: 485 TMVIDGYCKNGDVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLN 544
           TMV+DG+CK GDVK GFKLLKEMQ +GH PGVITYNVLMNGLCKQGQ+KNA+MLL AMLN
Sbjct: 482 TMVMDGFCKKGDVKMGFKLLKEMQSDGHIPGVITYNVLMNGLCKQGQVKNADMLLNAMLN 541

Query: 545 LGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 595
           LGV PDDITYNILL+GHCK G+  DF +++ E GLV DYA Y SL+ E  K+ KDR+KR
Sbjct: 542 LGVVPDDITYNILLQGHCKHGKLGDFQNVKTEMGLVSDYASYRSLLHELTKASKDRQKR 599

BLAST of CmaCh14G008690 vs. TrEMBL
Match: A0A061EF62_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_017452 PE=4 SV=1)

HSP 1 Score: 750.7 bits (1937), Expect = 1.3e-213
Identity = 371/572 (64.86%), Postives = 450/572 (78.67%), Query Frame = 1

Query: 12  FSSSLPSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNVETKPLASSLRRLLPSFKPHH 71
           F SS  S P      W+SPP P    DP+L  +S AI + +TKPL  SL++LLPS  P H
Sbjct: 25  FFSSSSSTP------WYSPPLPHQE-DPILTTLSQAIRSSQTKPLHISLKKLLPSLTPSH 84

Query: 72  FIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQSIIRFL 131
            I+LI  NP SLSP+SL SFFN+LSS P FRHTL+SY  MA+FL  H+MF ++QS++ FL
Sbjct: 85  VINLITLNPHSLSPLSLLSFFNFLSSHPPFRHTLRSYSTMAHFLIAHKMFHQAQSLLHFL 144

Query: 132 VSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIP 191
           VSRKGK SA+ +F +I+E   T    FVFD+LMIAY D GF+ DAIQCFRLVRK  F++P
Sbjct: 145 VSRKGKGSASLVFTSIIETKGTHQCGFVFDSLMIAYKDLGFVPDAIQCFRLVRKHKFKLP 204

Query: 192 FRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFD 251
           F+GC+YLLD+MM  +SP+    FYLEILD GF P V  FNIL++K C+   I+DA+++F+
Sbjct: 205 FQGCKYLLDRMMKISSPMVSLGFYLEILDYGFSPSVYNFNILMHKLCRVSQIKDAQMVFN 264

Query: 252 EIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHGLCKEG 311
           EIGKRG R T VSFNTLING CKS NL E FRLK+AME++ I PDV+TYSVLI+GLCKE 
Sbjct: 265 EIGKRGLRATVVSFNTLINGYCKSGNLGEGFRLKRAMEDSGIRPDVFTYSVLINGLCKES 324

Query: 312 RVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYN 371
           R+DDA  LF+EM  RGL  NDVTFT LIDGQC++GRID AM TYQ++L+ G+KPDLVM+N
Sbjct: 325 RLDDANGLFEEMCNRGLVPNDVTFTTLIDGQCKNGRIDLAMTTYQRILSKGLKPDLVMFN 384

Query: 372 TLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNVE 431
           TL+NGLCK GD+ +A+ L+ EM + G+KPDK TYT LIDG+CKEG++E A+EIR  M   
Sbjct: 385 TLINGLCKAGDLKEAKNLIAEMSLRGLKPDKFTYTILIDGFCKEGNMELAIEIRDEMVKH 444

Query: 432 GVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNGDVKPG 491
           G+ LDNVAFTA+ISGLCR+GR++DAE TLREM  AGMKPDDATYTMVIDG+CKNG+VK G
Sbjct: 445 GIELDNVAFTALISGLCREGRLIDAERTLREMLSAGMKPDDATYTMVIDGFCKNGNVKMG 504

Query: 492 FKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEG 551
           FKLLKEMQ +GH PGVITYNVLMNGLCK GQMKNANMLL+AM+ LGV PDDITYNILL+G
Sbjct: 505 FKLLKEMQSDGHVPGVITYNVLMNGLCKLGQMKNANMLLDAMIGLGVVPDDITYNILLDG 564

Query: 552 HCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGE 584
           HCK G  +DF  L++E GLV DYA Y SL+ +
Sbjct: 565 HCKKGNPKDFNRLKSEMGLVADYASYKSLISQ 589

BLAST of CmaCh14G008690 vs. TAIR10
Match: AT1G09680.1 (AT1G09680.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 666.0 bits (1717), Expect = 2.2e-191
Identity = 332/588 (56.46%), Postives = 426/588 (72.45%), Query Frame = 1

Query: 18  SKPSFRYSTWHSPPPPAAAA---DPVLAAVSTAI-NNVETKPL-------ASSLRRLLPS 77
           S+ SF  STW+S    +AA    DPVL  +S AI ++ +  PL         S+R++LPS
Sbjct: 20  SRASFLLSTWYSQESVSAADNDDDPVLVKLSVAIRDSYKDPPLEFSSFTDCPSIRKVLPS 79

Query: 78  FKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQS 137
              HH +DLINHNP SL   S+F+FF ++SS P FR T+++Y  +A FL  H+MF E+QS
Sbjct: 80  LSVHHVVDLINHNPLSLPQRSIFAFFKFISSQPGFRFTVETYFVLARFLAVHEMFTEAQS 139

Query: 138 IIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKR 197
           +I  +VSRKGK+SA+S+F +++E+  T    F+ DALMI Y+D GFI DAIQCFRL RK 
Sbjct: 140 LIELVVSRKGKNSASSVFISLVEMRVTPMCGFLVDALMITYTDLGFIPDAIQCFRLSRKH 199

Query: 198 NFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDA 257
            F +P RGC  LLD+MM  N   TIW FY+EILD+GFP  V  FNIL+NKFCK+G+I DA
Sbjct: 200 RFDVPIRGCGNLLDRMMKLNPTGTIWGFYMEILDAGFPLNVYVFNILMNKFCKEGNISDA 259

Query: 258 RLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHG 317
           + +FDEI KR  +PT VSFNTLING CK  NLDE FRLK  ME++R  PDV+TYS LI+ 
Sbjct: 260 QKVFDEITKRSLQPTVVSFNTLINGYCKVGNLDEGFRLKHQMEKSRTRPDVFTYSALINA 319

Query: 318 LCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPD 377
           LCKE ++D A  LFDEM +RGL  NDV FT LI G  R+G ID    +YQ+ML+ G++PD
Sbjct: 320 LCKENKMDGAHGLFDEMCKRGLIPNDVIFTTLIHGHSRNGEIDLMKESYQKMLSKGLQPD 379

Query: 378 LVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRK 437
           +V+YNTL+NG CK GD+  AR +VD M   G++PDKITYTTLIDG+C+ GD+E+A+EIRK
Sbjct: 380 IVLYNTLVNGFCKNGDLVAARNIVDGMIRRGLRPDKITYTTLIDGFCRGGDVETALEIRK 439

Query: 438 GMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNG 497
            M+  G+ LD V F+A++ G+C++GRV+DAE  LREM  AG+KPDD TYTM++D +CK G
Sbjct: 440 EMDQNGIELDRVGFSALVCGMCKEGRVIDAERALREMLRAGIKPDDVTYTMMMDAFCKKG 499

Query: 498 DVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYN 557
           D + GFKLLKEMQ +GH P V+TYNVL+NGLCK GQMKNA+MLL+AMLN+GV PDDITYN
Sbjct: 500 DAQTGFKLLKEMQSDGHVPSVVTYNVLLNGLCKLGQMKNADMLLDAMLNIGVVPDDITYN 559

Query: 558 ILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 595
            LLEGH +   +      + E G+V D A Y S+V E D++ KD R R
Sbjct: 560 TLLEGHHRHANSSKRYIQKPEIGIVADLASYKSIVNELDRASKDHRNR 607

BLAST of CmaCh14G008690 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 308.9 bits (790), Expect = 6.7e-84
Identity = 179/578 (30.97%), Postives = 307/578 (53.11%), Query Frame = 1

Query: 25  STWHSPPPPAAAADPVLAAVSTAINNVETKPLASSLRRLLPSFKPHHFIDLI--NHNPFS 84
           ST+ S P  +  AD  L  +         K     L  L  +F P    +L+  + N  +
Sbjct: 13  STFASSPSDSLLADKALTFL---------KRHPYQLHHLSANFTPEAASNLLLKSQNDQA 72

Query: 85  LSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQSIIRFLVSRKGKDSAAS 144
           L    +  F NW +    F  TL+  C   + L   ++++ +Q +   + ++   D  AS
Sbjct: 73  L----ILKFLNWANPHQFF--TLRCKCITLHILTKFKLYKTAQILAEDVAAKTLDDEYAS 132

Query: 145 IFAAILEITDTRC--SNFVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLD 204
           +    L+ T   C  ++ VFD ++ +YS    I  A+    L +   F         +LD
Sbjct: 133 LVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLD 192

Query: 205 KMMNSNSPVTIW-TFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFR 264
             + S   ++     + E+L+S   P V  +NILI  FC  G+I  A  +FD++  +G  
Sbjct: 193 ATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCL 252

Query: 265 PTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHGLCKEGRVDDAEQL 324
           P  V++NTLI+G CK R +D+ F+L ++M    + P++ +Y+V+I+GLC+EGR+ +   +
Sbjct: 253 PNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFV 312

Query: 325 FDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCK 384
             EM +RG   ++VT+  LI G C+ G    A+  + +ML  G+ P ++ Y +L++ +CK
Sbjct: 313 LTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCK 372

Query: 385 VGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNVEGVVLDNVA 444
            G++++A + +D+M++ G+ P++ TYTTL+DG+ ++G +  A  + + MN  G     V 
Sbjct: 373 AGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVT 432

Query: 445 FTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNGDVKPGFKLLKEMQ 504
           + A+I+G C  G++ DA   L +MKE G+ PD  +Y+ V+ G+C++ DV    ++ +EM 
Sbjct: 433 YNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMV 492

Query: 505 RNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGRAE 564
             G  P  ITY+ L+ G C+Q + K A  L E ML +G+ PD+ TY  L+  +C  G  E
Sbjct: 493 EKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLE 552

Query: 565 DFLHLRN---EKGLVVDYAYYTSLVGEYDKSLKDRRKR 595
             L L N   EKG++ D   Y+ L+   +K  + R  +
Sbjct: 553 KALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAK 575

BLAST of CmaCh14G008690 vs. TAIR10
Match: AT5G55840.1 (AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 280.8 bits (717), Expect = 2.0e-75
Identity = 154/503 (30.62%), Postives = 261/503 (51.89%), Query Frame = 1

Query: 91  FFNWLSSVPTFR--HTLQSYCAMANFLCTHQMFEESQSIIRFLVSRKGKDSAASIFAAIL 150
           F  W+   P     H +Q  C   + L   +M++ ++ I++ L    GK S   +F A++
Sbjct: 96  FLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSF--VFGALM 155

Query: 151 EITDTRCSN-FVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNS 210
                  SN  V+D L+  Y   G I D+++ FRL+    F      C  +L  ++ S  
Sbjct: 156 TTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGE 215

Query: 211 PVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFRPTTVSFNT 270
            V++W+F  E+L     P V  FNILIN  C +GS   +  +  ++ K G+ PT V++NT
Sbjct: 216 DVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNT 275

Query: 271 LINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHGLCKEGRVDDAEQLFDEMRQRG 330
           +++  CK      +  L   M+   +  DV TY++LIH LC+  R+     L  +MR+R 
Sbjct: 276 VLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRM 335

Query: 331 LRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKAR 390
           +  N+VT+  LI+G    G++  A     +ML+ G+ P+ V +N L++G    G+  +A 
Sbjct: 336 IHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEAL 395

Query: 391 KLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNVEGVVLDNVAFTAIISGL 450
           K+   M+  G+ P +++Y  L+DG CK  + + A      M   GV +  + +T +I GL
Sbjct: 396 KMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGL 455

Query: 451 CRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNGDVKPGFKLLKEMQRNGHNPGV 510
           C++G + +A   L EM + G+ PD  TY+ +I+G+CK G  K   +++  + R G +P  
Sbjct: 456 CKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNG 515

Query: 511 ITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGR---AEDFLHL 570
           I Y+ L+   C+ G +K A  + EAM+  G T D  T+N+L+   CK+G+   AE+F+  
Sbjct: 516 IIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRC 575

Query: 571 RNEKGLVVDYAYYTSLVGEYDKS 588
               G++ +   +  L+  Y  S
Sbjct: 576 MTSDGILPNTVSFDCLINGYGNS 596

BLAST of CmaCh14G008690 vs. TAIR10
Match: AT5G38730.1 (AT5G38730.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 268.1 bits (684), Expect = 1.3e-71
Identity = 152/504 (30.16%), Postives = 264/504 (52.38%), Query Frame = 1

Query: 85  PVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQSIIRFLVSRKGKDSAASIF 144
           P   +SFF W  S+P+ +H+LQS   M   L  H+ F+ +  ++  L  R+   S   + 
Sbjct: 60  PSLSWSFFIWTDSLPSSKHSLQSSWKMILILTKHKHFKTAHQLLDKLAQRELLSSPLVLR 119

Query: 145 AAILEIT-DTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMM 204
           + +  ++ D    + VF  LMI Y+ +G I+D+I  F  +R    +   + C  LL+ ++
Sbjct: 120 SLVGGVSEDPEDVSHVFSWLMIYYAKAGMINDSIVVFEQIRSCGLKPHLQACTVLLNSLV 179

Query: 205 NSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFRPTTV 264
                 T+W  + +++  G    +  +N+L++   K G    A  +  E+ ++G  P   
Sbjct: 180 KQRLTDTVWKIFKKMVKLGVVANIHVYNVLVHACSKSGDPEKAEKLLSEMEEKGVFPDIF 239

Query: 265 SFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHGLCKEGRVDDAEQLFDEM 324
           ++NTLI+  CK     E+  ++  ME + + P++ TY+  IHG  +EGR+ +A +LF E+
Sbjct: 240 TYNTLISVYCKKSMHFEALSVQDRMERSGVAPNIVTYNSFIHGFSREGRMREATRLFREI 299

Query: 325 RQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDV 384
           +   + AN VT+T LIDG CR   ID A+   + M + G  P +V YN++L  LC+ G +
Sbjct: 300 KD-DVTANHVTYTTLIDGYCRMNDIDEALRLREVMESRGFSPGVVTYNSILRKLCEDGRI 359

Query: 385 SKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNVEGVVLDNVAFTAI 444
            +A +L+ EM    ++PD IT  TLI+ YCK  D+ SA++++K M   G+ LD  ++ A+
Sbjct: 360 REANRLLTEMSGKKIEPDNITCNTLINAYCKIEDMVSAVKVKKKMIESGLKLDMYSYKAL 419

Query: 445 ISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNGDVKPGFKLLKEMQRNGH 504
           I G C+   + +A+  L  M E G  P  ATY+ ++DG+          KLL+E ++ G 
Sbjct: 420 IHGFCKVLELENAKEELFSMIEKGFSPGYATYSWLVDGFYNQNKQDEITKLLEEFEKRGL 479

Query: 505 NPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGR---AED 564
              V  Y  L+  +CK  Q+  A +L E+M   G+  D + +  +   + ++G+   A  
Sbjct: 480 CADVALYRGLIRRICKLEQVDYAKVLFESMEKKGLVGDSVIFTTMAYAYWRTGKVTEASA 539

Query: 565 FLHLRNEKGLVVDYAYYTSLVGEY 585
              +   + L+V+   Y S+   Y
Sbjct: 540 LFDVMYNRRLMVNLKLYKSISASY 562

BLAST of CmaCh14G008690 vs. TAIR10
Match: AT1G05670.1 (AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 268.1 bits (684), Expect = 1.3e-71
Identity = 168/583 (28.82%), Postives = 288/583 (49.40%), Query Frame = 1

Query: 11  NFSSSLPSKPSFRYSTWHSPPPPAAAADPVLA-AVSTAINNVETKPLASSLRRLLPSFKP 70
           +FS+   ++P   YS     P  A+  D      ++  I     +PL  SL+     FK 
Sbjct: 33  SFSTLTDTRPFPDYS-----PKKASVRDTEFVHQITNVIKLRRAEPLRRSLKPYECKFKT 92

Query: 71  HHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQSIIR 130
            H I ++         V    FF+W  S       L+S C + +     +  + +QS+I 
Sbjct: 93  DHLIWVLMKIKCDYRLV--LDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLIS 152

Query: 131 FLVSRKG---KDSAASIFAAIL-EITDTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRK 190
               R      DS    F  ++    D      VFD       D G + +A + F  +  
Sbjct: 153 SFWERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLN 212

Query: 191 RNFQIPFRGCEYLLDKMMNS-NSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIR 250
               +    C   L ++        T    + E  + G    V  +NI+I+  C+ G I+
Sbjct: 213 YGLVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIK 272

Query: 251 DARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLI 310
           +A  +   +  +G+ P  +S++T++NG C+   LD+ ++L + M+   + P+ Y Y  +I
Sbjct: 273 EAHHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSII 332

Query: 311 HGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVK 370
             LC+  ++ +AE+ F EM ++G+  + V +T LIDG C+ G I +A   + +M +  + 
Sbjct: 333 GLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDIT 392

Query: 371 PDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEI 430
           PD++ Y  +++G C++GD+ +A KL  EM   G++PD +T+T LI+GYCK G ++ A  +
Sbjct: 393 PDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRV 452

Query: 431 RKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCK 490
              M   G   + V +T +I GLC++G +  A   L EM + G++P+  TY  +++G CK
Sbjct: 453 HNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCK 512

Query: 491 NGDVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDIT 550
           +G+++   KL+ E +  G N   +TY  LM+  CK G+M  A  +L+ ML  G+ P  +T
Sbjct: 513 SGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVT 572

Query: 551 YNILLEGHCKSGRAEDFLHLRN---EKGLVVDYAYYTSLVGEY 585
           +N+L+ G C  G  ED   L N    KG+  +   + SLV +Y
Sbjct: 573 FNVLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQY 606

BLAST of CmaCh14G008690 vs. NCBI nr
Match: gi|659114573|ref|XP_008457121.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Cucumis melo])

HSP 1 Score: 1048.1 bits (2709), Expect = 5.7e-303
Identity = 519/594 (87.37%), Postives = 549/594 (92.42%), Query Frame = 1

Query: 1   MAANSTFKLSNFSSSLPSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNVETKPLASSL 60
           MA NS+FKLS    SL SKPSFRYSTWHSPPPPAA ADP+LAAVSTAINN +TKPLASSL
Sbjct: 1   MANNSSFKLS---ISL-SKPSFRYSTWHSPPPPAAVADPLLAAVSTAINNAQTKPLASSL 60

Query: 61  RRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQM 120
           RRLLPSFKPHHFIDLINHNPFSLSP+SLFSFFNWLSS+PTFRHT QSYCAMANFL  HQM
Sbjct: 61  RRLLPSFKPHHFIDLINHNPFSLSPLSLFSFFNWLSSMPTFRHTSQSYCAMANFLSAHQM 120

Query: 121 FEESQSIIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCF 180
           FEE QSIIRFLVSRKGKDSAAS+FAAIL+I  TRCSNFVFDALMIAY DSGF+SDAIQCF
Sbjct: 121 FEECQSIIRFLVSRKGKDSAASVFAAILDIAGTRCSNFVFDALMIAYWDSGFVSDAIQCF 180

Query: 181 RLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQ 240
           RLVRK NFQIPF GC YLLDKMMNSNSPVTIWTFY EILDSGFPP VKYFNILINKFCK+
Sbjct: 181 RLVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYFNILINKFCKE 240

Query: 241 GSIRDARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTY 300
           GS RDA+LIFDEI K G RPTTVSFNTLINGLCKSRNLDE FRLKK MEENRIYPDV+TY
Sbjct: 241 GSTRDAKLIFDEIRKWGLRPTTVSFNTLINGLCKSRNLDEGFRLKKIMEENRIYPDVFTY 300

Query: 301 SVLIHGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLA 360
           SVLIH LCKEGR+D+AEQLFDEM++RGLR N VTFTALIDGQC+  +IDSAMNTY QML 
Sbjct: 301 SVLIHRLCKEGRLDNAEQLFDEMQKRGLRPNGVTFTALIDGQCKRRQIDSAMNTYHQMLT 360

Query: 361 MGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420
           MGVKPDLVMYNTLL GLCKVGDV+KARKL+DEM+MVGMKPDKI+YTTLIDGYCKEGDLES
Sbjct: 361 MGVKPDLVMYNTLLYGLCKVGDVNKARKLIDEMRMVGMKPDKISYTTLIDGYCKEGDLES 420

Query: 421 AMEIRKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVID 480
           A+EIRKGMN EGVVLDNVAFTA+ISG CRDGRV DAE TLREM EAGMKPDDATYTMVID
Sbjct: 421 ALEIRKGMNEEGVVLDNVAFTALISGFCRDGRVRDAERTLREMMEAGMKPDDATYTMVID 480

Query: 481 GYCKNGDVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540
           GYCK GDVK GFK+LKEMQ NGH PGVITYNVLMNGLCKQGQMKNA+MLLEAMLNLGVTP
Sbjct: 481 GYCKKGDVKTGFKMLKEMQINGHKPGVITYNVLMNGLCKQGQMKNAHMLLEAMLNLGVTP 540

Query: 541 DDITYNILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 595
           DDITYNILLEGHCK+G+AED L LRNEKGL++DYAYYTSLVGEYDKSLKDR+KR
Sbjct: 541 DDITYNILLEGHCKNGKAEDLLKLRNEKGLIIDYAYYTSLVGEYDKSLKDRQKR 590

BLAST of CmaCh14G008690 vs. NCBI nr
Match: gi|778679906|ref|XP_004140820.2| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Cucumis sativus])

HSP 1 Score: 1036.9 bits (2680), Expect = 1.3e-299
Identity = 517/594 (87.04%), Postives = 544/594 (91.58%), Query Frame = 1

Query: 1   MAANSTFKLSNFSSSLPSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNVETKPLASSL 60
           M  NS+FK    S SL SKPSF YSTWHSPPP AA ADPVLAAVSTAINN +TKPLASSL
Sbjct: 15  MPNNSSFK---HSISL-SKPSFLYSTWHSPPPLAALADPVLAAVSTAINNAQTKPLASSL 74

Query: 61  RRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQM 120
           RRLLPSFKPHHFIDLIN NPFSLSP SLFSFFNWLSS+PTFRHT QSYCAMANFL  HQM
Sbjct: 75  RRLLPSFKPHHFIDLINQNPFSLSPSSLFSFFNWLSSIPTFRHTSQSYCAMANFLSAHQM 134

Query: 121 FEESQSIIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCF 180
           F+E QSIIRFLVSRKGKDSAAS+FAAIL+   TRCSNFVFDALMIAY DSGF+SDAIQCF
Sbjct: 135 FQECQSIIRFLVSRKGKDSAASVFAAILDTAGTRCSNFVFDALMIAYWDSGFVSDAIQCF 194

Query: 181 RLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQ 240
           RLVR  NFQIPF GC YLLDKM+NSNSPVTIWTFY EIL+ GFPPKV+Y+NILINKFCK+
Sbjct: 195 RLVRNSNFQIPFHGCGYLLDKMINSNSPVTIWTFYSEILEYGFPPKVQYYNILINKFCKE 254

Query: 241 GSIRDARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTY 300
           GSIRDA+LIF+EI KRG RPTTVSFNTLINGLCKSRNLDE FRLKK MEENRIYPDV+TY
Sbjct: 255 GSIRDAKLIFNEIRKRGLRPTTVSFNTLINGLCKSRNLDEGFRLKKTMEENRIYPDVFTY 314

Query: 301 SVLIHGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLA 360
           SVLIHGLCKEGR+D AEQLFDEM+QRGLR N +TFTALIDGQCRS RIDSAMNTY QML 
Sbjct: 315 SVLIHGLCKEGRLDVAEQLFDEMQQRGLRPNGITFTALIDGQCRSRRIDSAMNTYHQMLT 374

Query: 361 MGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420
           MGVKPDLVMYNTLLNGLCKVGDV+KARKLVDEM+MVGMKPDKITYTTLIDGYCKEGDLES
Sbjct: 375 MGVKPDLVMYNTLLNGLCKVGDVNKARKLVDEMRMVGMKPDKITYTTLIDGYCKEGDLES 434

Query: 421 AMEIRKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVID 480
           AMEIRKGMN EGVVLDNVAFTA+ISG CRDGRV DAE TLREM EAGMKPDDATYTMVID
Sbjct: 435 AMEIRKGMNEEGVVLDNVAFTALISGFCRDGRVRDAERTLREMVEAGMKPDDATYTMVID 494

Query: 481 GYCKNGDVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 540
           GYCK G+VK GFKLLKEMQ NGH PGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP
Sbjct: 495 GYCKKGNVKMGFKLLKEMQINGHKPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTP 554

Query: 541 DDITYNILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 595
           DDITYNILLEGHCK+G+AED L LRNEKGL+VDYAYYTSLV EY+KSLKDR+KR
Sbjct: 555 DDITYNILLEGHCKNGKAEDLLKLRNEKGLIVDYAYYTSLVSEYNKSLKDRQKR 604

BLAST of CmaCh14G008690 vs. NCBI nr
Match: gi|1000940553|ref|XP_015582938.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Ricinus communis])

HSP 1 Score: 811.2 bits (2094), Expect = 1.2e-231
Identity = 396/597 (66.33%), Postives = 477/597 (79.90%), Query Frame = 1

Query: 1   MAANSTFKLSNFSSSLPSKPSFRYSTWHSPPPPAAAA----DPVLAAVSTAINNVETKPL 60
           MA     K   F SS   KP   +STW+SPPPP+ +     DP+L+ +S AI N +TKPL
Sbjct: 1   MATLKLHKTRTFLSSPNKKPVLLFSTWYSPPPPSPSPQYQEDPILSTISNAIKNSDTKPL 60

Query: 61  ASSLRRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLC 120
             SL +++PS KP+H I+LIN NP SLSP SLFSFFN+LSS PTFRHT+ SYC M +FL 
Sbjct: 61  HISLNKIIPSLKPNHIINLINQNPHSLSPHSLFSFFNFLSSCPTFRHTIHSYCTMVHFLS 120

Query: 121 THQMFEESQSIIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDA 180
            HQM  ++QS++ F++SRKGKDS+ S+FA+ILE      S+FVFD LM AY+D  F+SDA
Sbjct: 121 AHQMHSQAQSLLHFIISRKGKDSSFSVFASILETKGNHFSDFVFDGLMNAYTDLEFLSDA 180

Query: 181 IQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINK 240
           IQCFRLVRK NF+IPF GC+YLLD+M+ ++SP+    FYLEILDSG+PP VK FN+LI++
Sbjct: 181 IQCFRLVRKHNFKIPFHGCKYLLDRMIRNSSPILARRFYLEILDSGYPPNVKSFNMLISR 240

Query: 241 FCKQGSIRDARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPD 300
           FCK+G I DA +IFDEIGK G +PT VSFNTLING CK  NLDE FRLKK MEE+RI+PD
Sbjct: 241 FCKEGKINDAHMIFDEIGKWGLQPTVVSFNTLINGYCKLGNLDEGFRLKKVMEESRIFPD 300

Query: 301 VYTYSVLIHGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQ 360
           V+TYSVLI+GLCK+ R++DA QLFDEM ++GL  NDVTFT LI+GQC++GRID A+  YQ
Sbjct: 301 VFTYSVLINGLCKDRRLNDANQLFDEMCEKGLVPNDVTFTTLINGQCKNGRIDLAVEMYQ 360

Query: 361 QMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEG 420
           QML  G+KPDL++YNTL+NGLCKVGD+ +ARKL DEM     KPDKITYTTLIDGYCKEG
Sbjct: 361 QMLRKGLKPDLILYNTLINGLCKVGDMREARKLADEMNERCQKPDKITYTTLIDGYCKEG 420

Query: 421 DLESAMEIRKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYT 480
           DLESA+EIR  M  EGV LD VAFTAIISGLC++G+V+DAE  LREM +AG KPDDATYT
Sbjct: 421 DLESALEIRNIMIKEGVELDIVAFTAIISGLCKEGKVIDAERALREMLKAGFKPDDATYT 480

Query: 481 MVIDGYCKNGDVKPGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNL 540
           MV+DG+CK GD+K GFKLLKEMQ +GH PGV+TYNVLMNG CKQ QMKNANMLL+AM+NL
Sbjct: 481 MVMDGFCKKGDMKTGFKLLKEMQSDGHVPGVVTYNVLMNGYCKQSQMKNANMLLDAMMNL 540

Query: 541 GVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGLVVDYAYYTSLVGEYDKSLKDRRK 594
           GV PDDITYNILLEGHCK G  +DF  L++EKG+V DYA Y SL+ E  +S KDR+K
Sbjct: 541 GVVPDDITYNILLEGHCKHGNLQDFHKLQSEKGVVADYASYKSLLNELSRSSKDRQK 597

BLAST of CmaCh14G008690 vs. NCBI nr
Match: gi|595968678|ref|XP_007217406.1| (hypothetical protein PRUPE_ppa024153mg [Prunus persica])

HSP 1 Score: 807.4 bits (2084), Expect = 1.7e-230
Identity = 391/572 (68.36%), Postives = 469/572 (81.99%), Query Frame = 1

Query: 25  STWHSPPPPAAAADPVLAAVSTAINNV--ETKPLASSLRRLLPSFKPHHFIDLINHNPFS 84
           + W++ PP     DP L+A+S AI      ++PL SSLR+LLPS      I+LIN NP S
Sbjct: 34  TAWYNQPPTPHNEDPKLSAISDAIKTTTQNSQPLDSSLRKLLPSLTARDVINLINLNPHS 93

Query: 85  LSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQMFEESQSIIRFLVSRKGKDSAAS 144
           LSP+SL SFFNWLSS PTFRH +QSYC MA+FLC HQM+ ++QS++R +VSRKGK++A+S
Sbjct: 94  LSPLSLLSFFNWLSSHPTFRHNIQSYCTMAHFLCAHQMYPQAQSLLRIVVSRKGKETASS 153

Query: 145 IFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLDKM 204
           +FA+ILE   T  SN+VFDALM AY D GF+SDA QCFRL+RK NF+IPF  C  LLDKM
Sbjct: 154 VFASILETRGTHQSNYVFDALMNAYVDCGFVSDACQCFRLLRKHNFRIPFHACGCLLDKM 213

Query: 205 MNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFRPTT 264
           +  NSPV  W FYLEILDSGFPPKV  FN+L++K CK+G IR+A+L+FDEIGKRG  PT 
Sbjct: 214 LKLNSPVVAWGFYLEILDSGFPPKVYNFNVLMHKLCKEGEIREAQLVFDEIGKRGLLPTV 273

Query: 265 VSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTYSVLIHGLCKEGRVDDAEQLFDE 324
           VSFNTLING CKSRNL+E FRLK+ MEE+R  PDV+TYSVLI+GLCKE R+DDA  LFDE
Sbjct: 274 VSFNTLINGYCKSRNLEECFRLKRDMEESRTRPDVFTYSVLINGLCKELRLDDANLLFDE 333

Query: 325 MRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGD 384
           M +RGL  N+VT+T LIDGQC++GRID AM  YQ+ML +G+KPD++ YNTL+NGLCKVGD
Sbjct: 334 MCERGLVPNNVTYTTLIDGQCKNGRIDLAMEVYQKMLGIGIKPDVITYNTLINGLCKVGD 393

Query: 385 VSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNVEGVVLDNVAFTA 444
           + +ARKLV+EM + G+KPD ITYTTLIDG CKEG+L+SA+EIRKGM  +G+ LDNVAFTA
Sbjct: 394 LKEARKLVEEMNIAGLKPDTITYTTLIDGCCKEGNLQSALEIRKGMIKQGIELDNVAFTA 453

Query: 445 IISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVIDGYCKNGDVKPGFKLLKEMQRNG 504
           +ISGLCR+G+ +DAE TLREM  +GMKPDDATYTM+IDG+CK GDVK GFKLLKEMQ +G
Sbjct: 454 LISGLCREGKTLDAERTLREMLNSGMKPDDATYTMIIDGFCKKGDVKMGFKLLKEMQGDG 513

Query: 505 HNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGRAEDFL 564
           + P V+TYN LMNGLCK GQMKNANMLL+AM+NLGV PDDITYNILLEGHCK G  EDF 
Sbjct: 514 YVPSVVTYNALMNGLCKLGQMKNANMLLDAMINLGVAPDDITYNILLEGHCKHGNPEDFD 573

Query: 565 HLRNEKGLVVDYAYYTSLVGEYDKSLKDRRKR 595
            LR+ KGLV+DYA YTSLV E++KS KDRRKR
Sbjct: 574 KLRSGKGLVLDYASYTSLVSEFNKSSKDRRKR 605

BLAST of CmaCh14G008690 vs. NCBI nr
Match: gi|700202327|gb|KGN57460.1| (hypothetical protein Csa_3G188330 [Cucumis sativus])

HSP 1 Score: 798.1 bits (2060), Expect = 1.0e-227
Identity = 402/497 (80.89%), Postives = 434/497 (87.32%), Query Frame = 1

Query: 1   MAANSTFKLSNFSSSLPSKPSFRYSTWHSPPPPAAAADPVLAAVSTAINNVETKPLASSL 60
           M  NS+FK    S SL SKPSF YSTWHSPPP AA ADPVLAAVSTAINN +TKPLASSL
Sbjct: 15  MPNNSSFK---HSISL-SKPSFLYSTWHSPPPLAALADPVLAAVSTAINNAQTKPLASSL 74

Query: 61  RRLLPSFKPHHFIDLINHNPFSLSPVSLFSFFNWLSSVPTFRHTLQSYCAMANFLCTHQM 120
           RRLLPSFKPHHFIDLIN NPFSLSP SLFSFFNWLSS+PTFRHT QSYCAMANFL  HQM
Sbjct: 75  RRLLPSFKPHHFIDLINQNPFSLSPSSLFSFFNWLSSIPTFRHTSQSYCAMANFLSAHQM 134

Query: 121 FEESQSIIRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCF 180
           F+E QSIIRFLVSRKGKDSAAS+FAAIL+   TRCSNFVFDALMIAY DSGF+SDAIQCF
Sbjct: 135 FQECQSIIRFLVSRKGKDSAASVFAAILDTAGTRCSNFVFDALMIAYWDSGFVSDAIQCF 194

Query: 181 RLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQ 240
           RLVR  NFQIPF GC YLLDKM+NSNSPVTIWTFY EIL+ GFPPKV+Y+NILINKFCK+
Sbjct: 195 RLVRNSNFQIPFHGCGYLLDKMINSNSPVTIWTFYSEILEYGFPPKVQYYNILINKFCKE 254

Query: 241 GSIRDARLIFDEIGKRGFRPTTVSFNTLINGLCKSRNLDESFRLKKAMEENRIYPDVYTY 300
           GSIRDA+LIF+EI KRG RPTTVSFNTLINGLCKSRNLDE FRLKK MEENRIYPDV+TY
Sbjct: 255 GSIRDAKLIFNEIRKRGLRPTTVSFNTLINGLCKSRNLDEGFRLKKTMEENRIYPDVFTY 314

Query: 301 SVLIHGLCKEGRVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLA 360
           SVLIHGLCKEGR+D AEQLFDEM+QRGLR N +TFTALIDGQCRS RIDSAMNTY QML 
Sbjct: 315 SVLIHGLCKEGRLDVAEQLFDEMQQRGLRPNGITFTALIDGQCRSRRIDSAMNTYHQMLT 374

Query: 361 MGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLES 420
           MGVKPDLVMYNTLLNGLCKVGDV+KARKLVDEM+MVGMKPDKITYTTLIDGYCKEGDLES
Sbjct: 375 MGVKPDLVMYNTLLNGLCKVGDVNKARKLVDEMRMVGMKPDKITYTTLIDGYCKEGDLES 434

Query: 421 AMEIRKGMNVEGVVLDNVAFTAIISGLCRDGRVMDAEGTLREMKEAGMKPDDATYTMVID 480
           AMEIRKGMN EGVVLDNVAFTA+ISG  ++  ++     L  M   G+ PDD TY ++++
Sbjct: 435 AMEIRKGMNEEGVVLDNVAFTALISGQMKNANML-----LEAMLNLGVTPDDITYNILLE 494

Query: 481 GYCKNGDVKPGFKLLKE 498
           G+CKNG  +   KL  E
Sbjct: 495 GHCKNGKAEDLLKLRNE 502

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR26_ARATH3.8e-19056.46Putative pentatricopeptide repeat-containing protein At1g09680 OS=Arabidopsis th... [more]
PP407_ARATH1.2e-8230.97Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PP432_ARATH3.5e-7430.62Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN... [more]
PP404_ARATH2.3e-7030.16Pentatricopeptide repeat-containing protein At5g38730 OS=Arabidopsis thaliana GN... [more]
PPR12_ARATH2.3e-7028.82Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
M5XHR3_PRUPE1.2e-23068.36Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa024153mg PE=4 SV=1[more]
A0A0A0L668_CUCSA7.2e-22880.89Uncharacterized protein OS=Cucumis sativus GN=Csa_3G188330 PE=4 SV=1[more]
D7TSI7_VITVI2.3e-21864.31Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0006g00170 PE=4 SV=... [more]
U5FHA0_POPTR9.8e-21762.44Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
A0A061EF62_THECC1.3e-21364.86Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_017452 PE... [more]
Match NameE-valueIdentityDescription
AT1G09680.12.2e-19156.46 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.16.7e-8430.97 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G55840.12.0e-7530.62 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G38730.11.3e-7130.16 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G05670.11.3e-7128.82 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659114573|ref|XP_008457121.1|5.7e-30387.37PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Cucum... [more]
gi|778679906|ref|XP_004140820.2|1.3e-29987.04PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Cucum... [more]
gi|1000940553|ref|XP_015582938.1|1.2e-23166.33PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Ricin... [more]
gi|595968678|ref|XP_007217406.1|1.7e-23068.36hypothetical protein PRUPE_ppa024153mg [Prunus persica][more]
gi|700202327|gb|KGN57460.1|1.0e-22780.89hypothetical protein Csa_3G188330 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G008690.1CmaCh14G008690.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 230..258
score: 9.3E-4coord: 159..188
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 326..358
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 436..484
score: 3.0E-15coord: 505..554
score: 7.4E-17coord: 260..309
score: 1.7E-17coord: 365..414
score: 5.8
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 403..436
score: 7.4E-8coord: 368..401
score: 2.8E-10coord: 230..261
score: 3.1E-6coord: 333..366
score: 4.8E-8coord: 263..297
score: 3.5E-8coord: 298..331
score: 1.1E-11coord: 543..565
score: 0.0031coord: 438..471
score: 2.6E-8coord: 508..541
score: 1.8E-7coord: 474..506
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 506..540
score: 11.904coord: 541..571
score: 6.95coord: 471..505
score: 12.869coord: 156..190
score: 7.892coord: 296..330
score: 14.929coord: 366..400
score: 13.329coord: 261..295
score: 11.356coord: 226..260
score: 10.731coord: 104..138
score: 5.656coord: 331..365
score: 12.485coord: 401..435
score: 11.86coord: 436..470
score: 1
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 157..187
score: 2.0E-5coord: 296..423
score: 2.0E-5coord: 224..260
score: 2.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 133..187
score: 9.86E-5coord: 224..423
score: 9.8
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 152..579
score: 3.8E-214coord: 18..134
score: 3.8E
NoneNo IPR availablePANTHERPTHR24015:SF824SUBFAMILY NOT NAMEDcoord: 152..579
score: 3.8E-214coord: 18..134
score: 3.8E