CmaCh16G006330 (gene) Cucurbita maxima (Rimu)

NameCmaCh16G006330
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr16 : 3265872 .. 3267824 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTGGTCTCTCCCGCAACTTCTCCACAGTCTGCAAGCTTAGTTATTTACAGCCCCGCCAGACACAGGCGACGCCAAATTTCATACCCTTTTCTCAGCTTCAACAGCAGCGGAAGCTCTTAGAATGGCGACTCATAAGTATTCTCCACGACTGCACGGACTTTTCTCAAATCAAGCAAGTCCATGGCCAAATCATTTGCAATGGCTTGAGTCAATGCTCTTACGTCCTTACCAAACTCATTCGCATGCTTTCGAAGGTAGATGTCCCAATGGATTGCTACCCACGTCTGGTTTTTGGACAGGTTAATTACCCCAATCCCTTTCTTTGGACTGCCATGATTCGTGGGTATGCCCTTCAAGGACCCTTCACTGAGTCTATTAACCTGTATACTAACATGAGGGGGAATGGCGTCAGTCCTGTTTCGTTTACGTTTTCTGCGCTTTTTAAGGCTTGTGGGGCTTCTCTTAATTTGGATTTAGGTAGGCAGGTGCATGCTCAGACGATTTTGATTGGGGGATTCGCTTCTGATTTATATGTTGGTAATACGATGATTGATATGTATGTGAAATGTGGGGTTTTGGGTTGTGGGCGGAAGGTGTTCGATGAAATGTCTGAGAGAGATGTGGTTTCATGGACTGAGCTGATTGTTGCGTATGCGAAGTTTGGGGACATGGAATCTGCTAGGGGGCTGTTTGATGAATTGCCTTTGAAGGATATGGTGGCATGGACTGCAATGGTCACTGGTTATGCCCAAAATGCTAGACCAAAGGAGGCATTGGAGTATTTTCAGAAGATGCAGGATGTCGGTATCGAGACCGATGAAGTTACACTTGTTGGTGTCATCTCAGCCTGTGCACAATTGGGTGCAGTTAAATATGCTAACTGGATTAGAGACATTGCTGAAAGATCAGGCTTTGGACCTTATGAACATGTAATGGTGGGATCTGCTCTTATCGATATGTATTCTAAATGTGGTAGTCCTGATGAGGCATACAAGATTTTTGAAGGAATGAAGGAAAGAAATGTGTTCTCGTATAGTTCAATGATTGTGGGATACGCTATGCATGGTCGCGCTCATTCCGCTTTGCAGTTGTTCCATGAGATGTTAAAGACTGAGATCAGGCCAAATAAGGTTACTTTCATTGGGGTGCTTTCAGCATGTAGCCATGCCGGTATGGTCGAACAAGGTCGGCAGCTATTTGCTAAGATGGAAAAGTATTTTAACGTAACGCCTTCACCCGATCATTATGCGTGTATGGTTGATCTCCTTGGTCGAGGTGGATGTTTGGAAGAAGCTCTTGAACTCATTGAAACCATGCCAATGGAACCCCATGGAGGCGTATGGGGAGCACTGCTTGGAGCTTGCCGCATCCATGGGAATCCCAACATTGCTCAGGTAGCTGCCGATCAATTATTCAAGCTAGAACCAGATGGTATAGGTAACTACATTCTGCTATCAAACATATATGCATCAGCAGGAAGATGGGAAGAGGTGTCGAAATTACGGAAAGTGATTCGAGCCAAAGGCTTAAAGAAGAATCCTGGCTGCAGCTGGTTTGAAGGAAAGAAAGGGGACATTCATGAATTCTTTGCGGATGATGCAACCCATCAACGATCAAGTGAGATTAGACAAGCCTTGAGGCAACTCCTTGTGAGATTAAGAGCCCATGGATACAAGCCAAACTTGAGCTCTGTGCCTTATGACTTGACCGATGATGAAAAGGAACGAATACTGATGAGTCATAGTGAGAAGCTGGCGTTGGCATACGGGATGTTATGTACTGAGGCAGGGGAAACCATTACGATCATGAAAAACCTTAGGATATGCGAGGATTGCCACAATGTCATGTGTGCTGCATCTGAAATCACTGGAAGGGAGATCATTATAAGGGATAACATGAGATTTCATCACTTCCACAATGGGACTTGTTCTTGTGGTAACTTTTGGTGA

mRNA sequence

ATGATTGGTCTCTCCCGCAACTTCTCCACAGTCTGCAAGCTTAGTTATTTACAGCCCCGCCAGACACAGGCGACGCCAAATTTCATACCCTTTTCTCAGCTTCAACAGCAGCGGAAGCTCTTAGAATGGCGACTCATAAGTATTCTCCACGACTGCACGGACTTTTCTCAAATCAAGCAAGTCCATGGCCAAATCATTTGCAATGGCTTGAGTCAATGCTCTTACGTCCTTACCAAACTCATTCGCATGCTTTCGAAGGTAGATGTCCCAATGGATTGCTACCCACGTCTGGTTTTTGGACAGGTTAATTACCCCAATCCCTTTCTTTGGACTGCCATGATTCGTGGGTATGCCCTTCAAGGACCCTTCACTGAGTCTATTAACCTGTATACTAACATGAGGGGGAATGGCGTCAGTCCTGTTTCGTTTACGTTTTCTGCGCTTTTTAAGGCTTGTGGGGCTTCTCTTAATTTGGATTTAGGTAGGCAGGTGCATGCTCAGACGATTTTGATTGGGGGATTCGCTTCTGATTTATATGTTGGTAATACGATGATTGATATGTATGTGAAATGTGGGGTTTTGGGTTGTGGGCGGAAGGTGTTCGATGAAATGTCTGAGAGAGATGTGGTTTCATGGACTGAGCTGATTGTTGCGTATGCGAAGTTTGGGGACATGGAATCTGCTAGGGGGCTGTTTGATGAATTGCCTTTGAAGGATATGGTGGCATGGACTGCAATGGTCACTGGTTATGCCCAAAATGCTAGACCAAAGGAGGCATTGGAGTATTTTCAGAAGATGCAGGATGTCGGTATCGAGACCGATGAAGTTACACTTGTTGGTGTCATCTCAGCCTGTGCACAATTGGGTGCAGTTAAATATGCTAACTGGATTAGAGACATTGCTGAAAGATCAGGCTTTGGACCTTATGAACATGTAATGGTGGGATCTGCTCTTATCGATATGTATTCTAAATGTGGTAGTCCTGATGAGGCATACAAGATTTTTGAAGGAATGAAGGAAAGAAATGTGTTCTCGTATAGTTCAATGATTGTGGGATACGCTATGCATGGTCGCGCTCATTCCGCTTTGCAGTTGTTCCATGAGATGTTAAAGACTGAGATCAGGCCAAATAAGGTTACTTTCATTGGGGTGCTTTCAGCATGTAGCCATGCCGGTATGGTCGAACAAGGTCGGCAGCTATTTGCTAAGATGGAAAAGTATTTTAACGTAACGCCTTCACCCGATCATTATGCGTGTATGGTTGATCTCCTTGGTCGAGGTGGATGTTTGGAAGAAGCTCTTGAACTCATTGAAACCATGCCAATGGAACCCCATGGAGGCGTATGGGGAGCACTGCTTGGAGCTTGCCGCATCCATGGGAATCCCAACATTGCTCAGGTAGCTGCCGATCAATTATTCAAGCTAGAACCAGATGGTATAGGTAACTACATTCTGCTATCAAACATATATGCATCAGCAGGAAGATGGGAAGAGGTGTCGAAATTACGGAAAGTGATTCGAGCCAAAGGCTTAAAGAAGAATCCTGGCTGCAGCTGGTTTGAAGGAAAGAAAGGGGACATTCATGAATTCTTTGCGGATGATGCAACCCATCAACGATCAAGTGAGATTAGACAAGCCTTGAGGCAACTCCTTGTGAGATTAAGAGCCCATGGATACAAGCCAAACTTGAGCTCTGTGCCTTATGACTTGACCGATGATGAAAAGGAACGAATACTGATGAGTCATAGTGAGAAGCTGGCGTTGGCATACGGGATGTTATGTACTGAGGCAGGGGAAACCATTACGATCATGAAAAACCTTAGGATATGCGAGGATTGCCACAATGTCATGTGTGCTGCATCTGAAATCACTGGAAGGGAGATCATTATAAGGGATAACATGAGATTTCATCACTTCCACAATGGGACTTGTTCTTGTGGTAACTTTTGGTGA

Coding sequence (CDS)

ATGATTGGTCTCTCCCGCAACTTCTCCACAGTCTGCAAGCTTAGTTATTTACAGCCCCGCCAGACACAGGCGACGCCAAATTTCATACCCTTTTCTCAGCTTCAACAGCAGCGGAAGCTCTTAGAATGGCGACTCATAAGTATTCTCCACGACTGCACGGACTTTTCTCAAATCAAGCAAGTCCATGGCCAAATCATTTGCAATGGCTTGAGTCAATGCTCTTACGTCCTTACCAAACTCATTCGCATGCTTTCGAAGGTAGATGTCCCAATGGATTGCTACCCACGTCTGGTTTTTGGACAGGTTAATTACCCCAATCCCTTTCTTTGGACTGCCATGATTCGTGGGTATGCCCTTCAAGGACCCTTCACTGAGTCTATTAACCTGTATACTAACATGAGGGGGAATGGCGTCAGTCCTGTTTCGTTTACGTTTTCTGCGCTTTTTAAGGCTTGTGGGGCTTCTCTTAATTTGGATTTAGGTAGGCAGGTGCATGCTCAGACGATTTTGATTGGGGGATTCGCTTCTGATTTATATGTTGGTAATACGATGATTGATATGTATGTGAAATGTGGGGTTTTGGGTTGTGGGCGGAAGGTGTTCGATGAAATGTCTGAGAGAGATGTGGTTTCATGGACTGAGCTGATTGTTGCGTATGCGAAGTTTGGGGACATGGAATCTGCTAGGGGGCTGTTTGATGAATTGCCTTTGAAGGATATGGTGGCATGGACTGCAATGGTCACTGGTTATGCCCAAAATGCTAGACCAAAGGAGGCATTGGAGTATTTTCAGAAGATGCAGGATGTCGGTATCGAGACCGATGAAGTTACACTTGTTGGTGTCATCTCAGCCTGTGCACAATTGGGTGCAGTTAAATATGCTAACTGGATTAGAGACATTGCTGAAAGATCAGGCTTTGGACCTTATGAACATGTAATGGTGGGATCTGCTCTTATCGATATGTATTCTAAATGTGGTAGTCCTGATGAGGCATACAAGATTTTTGAAGGAATGAAGGAAAGAAATGTGTTCTCGTATAGTTCAATGATTGTGGGATACGCTATGCATGGTCGCGCTCATTCCGCTTTGCAGTTGTTCCATGAGATGTTAAAGACTGAGATCAGGCCAAATAAGGTTACTTTCATTGGGGTGCTTTCAGCATGTAGCCATGCCGGTATGGTCGAACAAGGTCGGCAGCTATTTGCTAAGATGGAAAAGTATTTTAACGTAACGCCTTCACCCGATCATTATGCGTGTATGGTTGATCTCCTTGGTCGAGGTGGATGTTTGGAAGAAGCTCTTGAACTCATTGAAACCATGCCAATGGAACCCCATGGAGGCGTATGGGGAGCACTGCTTGGAGCTTGCCGCATCCATGGGAATCCCAACATTGCTCAGGTAGCTGCCGATCAATTATTCAAGCTAGAACCAGATGGTATAGGTAACTACATTCTGCTATCAAACATATATGCATCAGCAGGAAGATGGGAAGAGGTGTCGAAATTACGGAAAGTGATTCGAGCCAAAGGCTTAAAGAAGAATCCTGGCTGCAGCTGGTTTGAAGGAAAGAAAGGGGACATTCATGAATTCTTTGCGGATGATGCAACCCATCAACGATCAAGTGAGATTAGACAAGCCTTGAGGCAACTCCTTGTGAGATTAAGAGCCCATGGATACAAGCCAAACTTGAGCTCTGTGCCTTATGACTTGACCGATGATGAAAAGGAACGAATACTGATGAGTCATAGTGAGAAGCTGGCGTTGGCATACGGGATGTTATGTACTGAGGCAGGGGAAACCATTACGATCATGAAAAACCTTAGGATATGCGAGGATTGCCACAATGTCATGTGTGCTGCATCTGAAATCACTGGAAGGGAGATCATTATAAGGGATAACATGAGATTTCATCACTTCCACAATGGGACTTGTTCTTGTGGTAACTTTTGGTGA

Protein sequence

MIGLSRNFSTVCKLSYLQPRQTQATPNFIPFSQLQQQRKLLEWRLISILHDCTDFSQIKQVHGQIICNGLSQCSYVLTKLIRMLSKVDVPMDCYPRLVFGQVNYPNPFLWTAMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLGRQVHAQTILIGGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAKFGDMESARGLFDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVGVISACAQLGAVKYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKERNVFSYSSMIVGYAMHGRAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLFAKMEKYFNVTPSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGNPNIAQVAADQLFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFEGKKGDIHEFFADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEKERILMSHSEKLALAYGMLCTEAGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFHNGTCSCGNFW
BLAST of CmaCh16G006330 vs. Swiss-Prot
Match: PP417_ARATH (Pentatricopeptide repeat-containing protein At5g44230 OS=Arabidopsis thaliana GN=PCMP-H17 PE=2 SV=1)

HSP 1 Score: 802.0 bits (2070), Expect = 4.9e-231
Identity = 382/621 (61.51%), Postives = 475/621 (76.49%), Query Frame = 1

Query: 31  FSQLQQQRKLLEWRLISILHDCTDFSQIKQVHGQIICNGLSQCSYVLTKLIRMLSKVDVP 90
           FS++  Q++LL   LIS L DC + +QIKQ+HG ++  GL Q  Y+LTKLIR L+K+ VP
Sbjct: 38  FSEISNQKELLVSSLISKLDDCINLNQIKQIHGHVLRKGLDQSCYILTKLIRTLTKLGVP 97

Query: 91  MDCYPRLVFGQVNYPNPFLWTAMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFK 150
           MD Y R V   V + NPFLWTA+IRGYA++G F E+I +Y  MR   ++PVSFTFSAL K
Sbjct: 98  MDPYARRVIEPVQFRNPFLWTAVIRGYAIEGKFDEAIAMYGCMRKEEITPVSFTFSALLK 157

Query: 151 ACGASLNLDLGRQVHAQTILIGGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVV 210
           ACG   +L+LGRQ HAQT  + GF   +YVGNTMIDMYVKC  + C RKVFDEM ERDV+
Sbjct: 158 ACGTMKDLNLGRQFHAQTFRLRGFCF-VYVGNTMIDMYVKCESIDCARKVFDEMPERDVI 217

Query: 211 SWTELIVAYAKFGDMESARGLFDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVG 270
           SWTELI AYA+ G+ME A  LF+ LP KDMVAWTAMVTG+AQNA+P+EALEYF +M+  G
Sbjct: 218 SWTELIAAYARVGNMECAAELFESLPTKDMVAWTAMVTGFAQNAKPQEALEYFDRMEKSG 277

Query: 271 IETDEVTLVGVISACAQLGAVKYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDE 330
           I  DEVT+ G ISACAQLGA KYA+    IA++SG+ P +HV++GSALIDMYSKCG+ +E
Sbjct: 278 IRADEVTVAGYISACAQLGASKYADRAVQIAQKSGYSPSDHVVIGSALIDMYSKCGNVEE 337

Query: 331 AYKIFEGMKERNVFSYSSMIVGYAMHGRAHSALQLFHEML-KTEIRPNKVTFIGVLSACS 390
           A  +F  M  +NVF+YSSMI+G A HGRA  AL LFH M+ +TEI+PN VTF+G L ACS
Sbjct: 338 AVNVFMSMNNKNVFTYSSMILGLATHGRAQEALHLFHYMVTQTEIKPNTVTFVGALMACS 397

Query: 391 HAGMVEQGRQLFAKMEKYFNVTPSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVW 450
           H+G+V+QGRQ+F  M + F V P+ DHY CMVDLLGR G L+EALELI+TM +EPHGGVW
Sbjct: 398 HSGLVDQGRQVFDSMYQTFGVQPTRDHYTCMVDLLGRTGRLQEALELIKTMSVEPHGGVW 457

Query: 451 GALLGACRIHGNPNIAQVAADQLFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAK 510
           GALLGACRIH NP IA++AA+ LF+LEPD IGNYILLSN+YASAG W  V ++RK+I+ K
Sbjct: 458 GALLGACRIHNNPEIAEIAAEHLFELEPDIIGNYILLSNVYASAGDWGGVLRVRKLIKEK 517

Query: 511 GLKKNPGCSWFEGKKGDIHEFFADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYD 570
           GLKK P  SW   K G +H+FF  +  H  S++I+  L +L+ RL   GY+P+LSSVPYD
Sbjct: 518 GLKKTPAVSWVVDKNGQMHKFFPGNLNHPMSNKIQDKLEELVERLTVLGYQPDLSSVPYD 577

Query: 571 LTDDEKERILMSHSEKLALAYGMLCTEAGETITIMKNLRICEDCHNVMCAASEITGREII 630
           ++D+ K  IL+ H+EKLALA+ +L T    TITIMKNLR+C DCH  M  ASE+TG+ II
Sbjct: 578 VSDNAKRLILIQHTEKLALAFSLLTTNRDSTITIMKNLRMCLDCHKFMRLASEVTGKVII 637

Query: 631 IRDNMRFHHFHNGTCSCGNFW 651
           +RDNMRFHHF +G CSCG+FW
Sbjct: 638 MRDNMRFHHFRSGDCSCGDFW 657

BLAST of CmaCh16G006330 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 502.7 bits (1293), Expect = 6.1e-141
Identity = 249/609 (40.89%), Postives = 372/609 (61.08%), Query Frame = 1

Query: 45  LISILHDCTDFSQIKQVHGQIICNGLSQCSYVLTKLIRM-LSKVDVPMDCYPRLVFGQVN 104
           LI    + +  S  + +HG  + + +    +V   LI    S  D+   C    VF  + 
Sbjct: 137 LIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACK---VFTTIK 196

Query: 105 YPNPFLWTAMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLGRQ 164
             +   W +MI G+  +G   +++ L+  M    V     T   +  AC    NL+ GRQ
Sbjct: 197 EKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQ 256

Query: 165 VHAQTILIGGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAKFG 224
           V    I       +L + N M+DMY KCG +   +++FD M E+D V+WT ++  YA   
Sbjct: 257 V-CSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISE 316

Query: 225 DMESARGLFDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQ-DVGIETDEVTLVGVI 284
           D E+AR + + +P KD+VAW A+++ Y QN +P EAL  F ++Q    ++ +++TLV  +
Sbjct: 317 DYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTL 376

Query: 285 SACAQLGAVKYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKERN 344
           SACAQ+GA++   WI    ++ G     HV   SALI MYSKCG  +++ ++F  +++R+
Sbjct: 377 SACAQVGALELGRWIHSYIKKHGIRMNFHVT--SALIHMYSKCGDLEKSREVFNSVEKRD 436

Query: 345 VFSYSSMIVGYAMHGRAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLFA 404
           VF +S+MI G AMHG  + A+ +F++M +  ++PN VTF  V  ACSH G+V++   LF 
Sbjct: 437 VFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFH 496

Query: 405 KMEKYFNVTPSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGNP 464
           +ME  + + P   HYAC+VD+LGR G LE+A++ IE MP+ P   VWGALLGAC+IH N 
Sbjct: 497 QMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANL 556

Query: 465 NIAQVAADQLFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFEG 524
           N+A++A  +L +LEP   G ++LLSNIYA  G+WE VS+LRK +R  GLKK PGCS  E 
Sbjct: 557 NLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIE- 616

Query: 525 KKGDIHEFFADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDE-KERILMS 584
             G IHEF + D  H  S ++   L +++ +L+++GY+P +S V   + ++E KE+ L  
Sbjct: 617 IDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNL 676

Query: 585 HSEKLALAYGMLCTEAGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFHN 644
           HSEKLA+ YG++ TEA + I ++KNLR+C DCH+V    S++  REII+RD  RFHHF N
Sbjct: 677 HSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRN 736

Query: 645 GTCSCGNFW 651
           G CSC +FW
Sbjct: 737 GQCSCNDFW 738

BLAST of CmaCh16G006330 vs. Swiss-Prot
Match: PP367_ARATH (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 490.3 bits (1261), Expect = 3.2e-137
Identity = 237/611 (38.79%), Postives = 378/611 (61.87%), Query Frame = 1

Query: 46  ISILHDCTDFSQIKQVHGQIICNGLSQCSYVLTKLIRML---SKVDVPMDC--YPRLVFG 105
           +++L  C+ FS +K +HG ++   L    +V ++L+ +    S  + P +   Y   +F 
Sbjct: 16  LALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFS 75

Query: 106 QVNYPNPFLWTAMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDL 165
           Q+  PN F++  +IR ++     +++   YT M  + + P + TF  L KA      + +
Sbjct: 76  QIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLV 135

Query: 166 GRQVHAQTILIGGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYA 225
           G Q H+Q +  G F +D+YV N+++ MY  CG +    ++F +M  RDVVSWT ++  Y 
Sbjct: 136 GEQTHSQIVRFG-FQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYC 195

Query: 226 KFGDMESARGLFDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVG 285
           K G +E+AR +FDE+P +++  W+ M+ GYA+N   ++A++ F+ M+  G+  +E  +V 
Sbjct: 196 KCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVS 255

Query: 286 VISACAQLGAVKYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKE 345
           VIS+CA LGA+++     +   +S      ++++G+AL+DM+ +CG  ++A  +FEG+ E
Sbjct: 256 VISSCAHLGALEFGERAYEYVVKSHM--TVNLILGTALVDMFWRCGDIEKAIHVFEGLPE 315

Query: 346 RNVFSYSSMIVGYAMHGRAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQL 405
            +  S+SS+I G A+HG AH A+  F +M+     P  VTF  VLSACSH G+VE+G ++
Sbjct: 316 TDSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEI 375

Query: 406 FAKMEKYFNVTPSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHG 465
           +  M+K   + P  +HY C+VD+LGR G L EA   I  M ++P+  + GALLGAC+I+ 
Sbjct: 376 YENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYK 435

Query: 466 NPNIAQVAADQLFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWF 525
           N  +A+   + L K++P+  G Y+LLSNIYA AG+W+++  LR +++ K +KK PG S  
Sbjct: 436 NTEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLI 495

Query: 526 EGKKGDIHEF-FADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEKERIL 585
           E   G I++F   DD  H    +IR+   ++L ++R  GYK N     +D+ ++EKE  +
Sbjct: 496 E-IDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKESSI 555

Query: 586 MSHSEKLALAYGMLCTEAGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHF 645
             HSEKLA+AYGM+ T+ G TI I+KNLR+CEDCH V    SE+ GRE+I+RD  RFHHF
Sbjct: 556 HMHSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHF 615

Query: 646 HNGTCSCGNFW 651
            NG CSC ++W
Sbjct: 616 RNGVCSCRDYW 622

BLAST of CmaCh16G006330 vs. Swiss-Prot
Match: PP449_ARATH (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 488.4 bits (1256), Expect = 1.2e-136
Identity = 236/610 (38.69%), Postives = 368/610 (60.33%), Query Frame = 1

Query: 43  WRLISILHDCTDFSQIKQVHGQIICNGLSQCSYVLTKLIRM-LSKVDVPMDCYPRLVFGQ 102
           +  +S L  C+   ++KQ+H +++  GL Q SY +TK +   +S        Y ++VF  
Sbjct: 15  YETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDG 74

Query: 103 VNYPNPFLWTAMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLG 162
            + P+ FLW  MIRG++       S+ LY  M  +     ++TF +L KAC      +  
Sbjct: 75  FDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEET 134

Query: 163 RQVHAQTILIGGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAK 222
            Q+HAQ   +G + +D+Y  N++I+ Y   G       +FD + E D VSW  +I  Y K
Sbjct: 135 TQIHAQITKLG-YENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVK 194

Query: 223 FGDMESARGLFDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVGV 282
            G M+ A  LF ++  K+ ++WT M++GY Q    KEAL+ F +MQ+  +E D V+L   
Sbjct: 195 AGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANA 254

Query: 283 ISACAQLGAVKYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKER 342
           +SACAQLGA++   WI     ++        ++G  LIDMY+KCG  +EA ++F+ +K++
Sbjct: 255 LSACAQLGALEQGKWIHSYLNKTRIRMDS--VLGCVLIDMYAKCGEMEEALEVFKNIKKK 314

Query: 343 NVFSYSSMIVGYAMHGRAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLF 402
           +V +++++I GYA HG    A+  F EM K  I+PN +TF  VL+ACS+ G+VE+G+ +F
Sbjct: 315 SVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIF 374

Query: 403 AKMEKYFNVTPSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGN 462
             ME+ +N+ P+ +HY C+VDLLGR G L+EA   I+ MP++P+  +WGALL ACRIH N
Sbjct: 375 YSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKN 434

Query: 463 PNIAQVAADQLFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFE 522
             + +   + L  ++P   G Y+  +NI+A   +W++ ++ R++++ +G+ K PGCS   
Sbjct: 435 IELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI- 494

Query: 523 GKKGDIHEFFADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLT-DDEKERILM 582
             +G  HEF A D +H    +I+   R +  +L  +GY P L  +  DL  DDE+E I+ 
Sbjct: 495 SLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVH 554

Query: 583 SHSEKLALAYGMLCTEAGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFH 642
            HSEKLA+ YG++ T+ G  I IMKNLR+C+DCH V    S+I  R+I++RD  RFHHF 
Sbjct: 555 QHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFR 614

Query: 643 NGTCSCGNFW 651
           +G CSCG++W
Sbjct: 615 DGKCSCGDYW 620

BLAST of CmaCh16G006330 vs. Swiss-Prot
Match: PP354_ARATH (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana GN=ELI1 PE=3 SV=1)

HSP 1 Score: 473.8 bits (1218), Expect = 3.1e-132
Identity = 235/555 (42.34%), Postives = 352/555 (63.42%), Query Frame = 1

Query: 98  VFGQVNYPNPFLWTAMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLN 157
           +F Q   P+ FL+TA I   ++ G   ++  LY  +  + ++P  FTFS+L K+C     
Sbjct: 86  LFHQTIDPDLFLFTAAINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSCSTKS- 145

Query: 158 LDLGRQVHAQTILIGGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIV 217
              G+ +H   +L  G   D YV   ++D+Y K G +   +KVFD M ER +VS T +I 
Sbjct: 146 ---GKLIHTH-VLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMIT 205

Query: 218 AYAKFGDMESARGLFDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGI-ETDEV 277
            YAK G++E+AR LFD +  +D+V+W  M+ GYAQ+  P +AL  FQK+   G  + DE+
Sbjct: 206 CYAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEI 265

Query: 278 TLVGVISACAQLGAVKYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFE 337
           T+V  +SAC+Q+GA++   WI    + S      +V V + LIDMYSKCGS +EA  +F 
Sbjct: 266 TVVAALSACSQIGALETGRWIHVFVKSSRIRL--NVKVCTGLIDMYSKCGSLEEAVLVFN 325

Query: 338 GMKERNVFSYSSMIVGYAMHGRAHSALQLFHEMLK-TEIRPNKVTFIGVLSACSHAGMVE 397
               +++ ++++MI GYAMHG +  AL+LF+EM   T ++P  +TFIG L AC+HAG+V 
Sbjct: 326 DTPRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVN 385

Query: 398 QGRQLFAKMEKYFNVTPSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGA 457
           +G ++F  M + + + P  +HY C+V LLGR G L+ A E I+ M M+    +W ++LG+
Sbjct: 386 EGIRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGS 445

Query: 458 CRIHGNPNIAQVAADQLFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNP 517
           C++HG+  + +  A+ L  L     G Y+LLSNIYAS G +E V+K+R +++ KG+ K P
Sbjct: 446 CKLHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEP 505

Query: 518 GCSWFEGKKGDIHEFFADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEK 577
           G S  E  +  +HEF A D  H +S EI   LR++  R+++HGY PN ++V  DL + EK
Sbjct: 506 GISTIE-IENKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYVPNTNTVLQDLEETEK 565

Query: 578 ERILMSHSEKLALAYGMLCTEAGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMR 637
           E+ L  HSE+LA+AYG++ T+ G  + I KNLR+C DCH V    S+ITGR+I++RD  R
Sbjct: 566 EQSLQVHSERLAIAYGLISTKPGSPLKIFKNLRVCSDCHTVTKLISKITGRKIVMRDRNR 625

Query: 638 FHHFHNGTCSCGNFW 651
           FHHF +G+CSCG+FW
Sbjct: 626 FHHFTDGSCSCGDFW 632

BLAST of CmaCh16G006330 vs. TrEMBL
Match: A0A0A0KUQ8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G045030 PE=4 SV=1)

HSP 1 Score: 1191.8 bits (3082), Expect = 0.0e+00
Identity = 565/650 (86.92%), Postives = 607/650 (93.38%), Query Frame = 1

Query: 1   MIGLSRNFSTVCKLSYLQPRQTQATPNFIPFSQLQQQRKLLEWRLISILHDCTDFSQIKQ 60
           MIG SRN STV KLS+LQ  QT+ +PNFIPF QLQ QRKLLEWRL+SILHDCT FSQIKQ
Sbjct: 1   MIGFSRNLSTVSKLSHLQNLQTRGSPNFIPFPQLQHQRKLLEWRLMSILHDCTLFSQIKQ 60

Query: 61  VHGQIICNGLSQCSYVLTKLIRMLSKVDVPMDCYPRLVFGQVNYPNPFLWTAMIRGYALQ 120
           VH  II NGLSQCSYVLTKLIRML+KVDVPM  YP LVFGQVNYPNPFLWTAMIRGYALQ
Sbjct: 61  VHAHIIRNGLSQCSYVLTKLIRMLTKVDVPMGSYPLLVFGQVNYPNPFLWTAMIRGYALQ 120

Query: 121 GPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLGRQVHAQTILIGGFASDLYV 180
           G  +ES N YT MR +GV PVSFTFSALFKACGA+LN+DLG+QVHAQTILIGGFASDLYV
Sbjct: 121 GLLSESTNFYTRMRRDGVGPVSFTFSALFKACGAALNMDLGKQVHAQTILIGGFASDLYV 180

Query: 181 GNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAKFGDMESARGLFDELPLKDM 240
           GN+MID+YVKCG LGC RKVFDEMSERDVVSWTELIVAYAK+GDMESA GLFD+LPLKDM
Sbjct: 181 GNSMIDLYVKCGFLGCARKVFDEMSERDVVSWTELIVAYAKYGDMESASGLFDDLPLKDM 240

Query: 241 VAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVGVISACAQLGAVKYANWIRDI 300
           VAWTAMVTGYAQN RPKEALEYFQKMQDVG+ETDEVTL GVISACAQLGAVK+ANWIRDI
Sbjct: 241 VAWTAMVTGYAQNGRPKEALEYFQKMQDVGMETDEVTLAGVISACAQLGAVKHANWIRDI 300

Query: 301 AERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKERNVFSYSSMIVGYAMHGRAH 360
           AERSGFGP  +V+VGSALIDMYSKCGSPDEAYK+FE MKERNVFSYSSMI+GYAMHGRAH
Sbjct: 301 AERSGFGPSGNVVVGSALIDMYSKCGSPDEAYKVFEVMKERNVFSYSSMILGYAMHGRAH 360

Query: 361 SALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLFAKMEKYFNVTPSPDHYACM 420
           SALQLFH+MLKTEIRPNKVTFIG+LSACSHAG+VEQGRQLFAKMEK+F V PSPDHYACM
Sbjct: 361 SALQLFHDMLKTEIRPNKVTFIGILSACSHAGLVEQGRQLFAKMEKFFGVAPSPDHYACM 420

Query: 421 VDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGNPNIAQVAADQLFKLEPDGI 480
           VDLLGR GCLEEAL+L++TMPMEP+GGVWGALLGACRIHGNP+IAQ+AA++LFKLEP+GI
Sbjct: 421 VDLLGRAGCLEEALDLVKTMPMEPNGGVWGALLGACRIHGNPDIAQIAANELFKLEPNGI 480

Query: 481 GNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFEGKKGDIHEFFADDATHQRS 540
           GNYILLSNIYASAGRWEEVSKLRKVIR KG KKNPGCSWFEGK G+IH+FFA D TH RS
Sbjct: 481 GNYILLSNIYASAGRWEEVSKLRKVIREKGFKKNPGCSWFEGKNGEIHDFFAGDTTHPRS 540

Query: 541 SEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEKERILMSHSEKLALAYGMLCTEAGET 600
           SEIRQAL+QL+ RLR+HGYKPNL S PYDLTDDEKERILMSHSEKLALAYG+LCTEAG+T
Sbjct: 541 SEIRQALKQLIERLRSHGYKPNLGSAPYDLTDDEKERILMSHSEKLALAYGLLCTEAGDT 600

Query: 601 ITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFHNGTCSCGNFW 651
           I IMKN+RICEDCHNVMCAASEITGREII+RDNMRFHHFHNGTCSCGNFW
Sbjct: 601 IKIMKNIRICEDCHNVMCAASEITGREIIVRDNMRFHHFHNGTCSCGNFW 650

BLAST of CmaCh16G006330 vs. TrEMBL
Match: F6GWS8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0023g02490 PE=4 SV=1)

HSP 1 Score: 993.4 bits (2567), Expect = 1.3e-286
Identity = 466/638 (73.04%), Postives = 552/638 (86.52%), Query Frame = 1

Query: 13  KLSYLQPRQTQATPNFIPFSQLQQQRKLLEWRLISILHDCTDFSQIKQVHGQIICNGLSQ 72
           K SY Q +    T +FIPFS ++Q++K+LE RL+S+LH CT  +Q+KQVH  I   GL Q
Sbjct: 15  KTSYCQLQ----TQSFIPFS-VRQEQKILESRLVSVLHGCTHINQVKQVHAHIFRKGLEQ 74

Query: 73  CSYVLTKLIRMLSKVDVPMDCYPRLVFGQVNYPNPFLWTAMIRGYALQGPFTESINLYTN 132
           C +VL KL+R L+K+DVPMD YPRLVF QV YPNPFLWTA+IRGYALQGPF ES+ LY +
Sbjct: 75  CCFVLAKLLRTLTKLDVPMDPYPRLVFQQVEYPNPFLWTALIRGYALQGPFMESVLLYNS 134

Query: 133 MRGNGVSPVSFTFSALFKACGASLNLDLGRQVHAQTILIGGFASDLYVGNTMIDMYVKCG 192
           MR  G+ PVSFTF+AL KAC A+L+++LGRQVH QTILIGGF SDLYVGNT+IDMYVKCG
Sbjct: 135 MRRQGIGPVSFTFTALLKACSAALDVNLGRQVHTQTILIGGFGSDLYVGNTLIDMYVKCG 194

Query: 193 VLGCGRKVFDEMSERDVVSWTELIVAYAKFGDMESARGLFDELPLKDMVAWTAMVTGYAQ 252
            LGCG +VFDEM +RDV+SWT LIVAYAK G+ME+A  LFD LP+KDMVAWTAMVTGYAQ
Sbjct: 195 CLGCGHRVFDEMLDRDVISWTSLIVAYAKVGNMEAASELFDGLPMKDMVAWTAMVTGYAQ 254

Query: 253 NARPKEALEYFQKMQDVGIETDEVTLVGVISACAQLGAVKYANWIRDIAERSGFGPYEHV 312
           NARP+EALE F++MQ  G++TDEVTLVGVISACAQLGA KYANW+RD+AE+SGFGP  +V
Sbjct: 255 NARPREALEVFERMQAAGVKTDEVTLVGVISACAQLGAAKYANWVRDVAEQSGFGPTSNV 314

Query: 313 MVGSALIDMYSKCGSPDEAYKIFEGMKERNVFSYSSMIVGYAMHGRAHSALQLFHEMLKT 372
           +VGSALIDMY+KCGS ++AYK+FE M+ERNV+SYSSMIVG+AMHG A +A++LF EMLKT
Sbjct: 315 VVGSALIDMYAKCGSVEDAYKVFERMEERNVYSYSSMIVGFAMHGLAGAAMELFDEMLKT 374

Query: 373 EIRPNKVTFIGVLSACSHAGMVEQGRQLFAKMEKYFNVTPSPDHYACMVDLLGRGGCLEE 432
           EI+PN+VTFIGVL+ACSHAGMVEQG+QLFA ME+   V PS DHYACMVDLLGR G LEE
Sbjct: 375 EIKPNRVTFIGVLTACSHAGMVEQGQQLFAMMEECHGVAPSEDHYACMVDLLGRAGRLEE 434

Query: 433 ALELIETMPMEPHGGVWGALLGACRIHGNPNIAQVAADQLFKLEPDGIGNYILLSNIYAS 492
           AL L++ MPM PHGGVWGALLGACRIHGNP++AQ+AA  LF+LEP+GIGNYILLSNIYAS
Sbjct: 435 ALNLVKMMPMNPHGGVWGALLGACRIHGNPDMAQIAASHLFELEPNGIGNYILLSNIYAS 494

Query: 493 AGRWEEVSKLRKVIRAKGLKKNPGCSWFEGKKGDIHEFFADDATHQRSSEIRQALRQLLV 552
           AGRW++VSK+RK++RAKGLKKNPGCSW EGKKG IHEFFA D +H +S EI+QAL  LL 
Sbjct: 495 AGRWDDVSKVRKLMRAKGLKKNPGCSWVEGKKGIIHEFFAGDMSHPKSREIKQALEDLLD 554

Query: 553 RLRAHGYKPNLSSVPYDLTDDEKERILMSHSEKLALAYGMLCTEAGETITIMKNLRICED 612
           RL+  GY+PNLSSV YD++D+EK+R+LMSHSEKLALA+G+L T AG TI I+KNLRICED
Sbjct: 555 RLKYLGYQPNLSSVAYDISDEEKKRLLMSHSEKLALAFGLLTTNAGCTIRIVKNLRICED 614

Query: 613 CHNVMCAASEITGREIIIRDNMRFHHFHNGTCSCGNFW 651
           CH+VMC AS+ITGREI++RDNMRFHHF +G CSCGNFW
Sbjct: 615 CHSVMCGASQITGREIVVRDNMRFHHFRDGRCSCGNFW 647

BLAST of CmaCh16G006330 vs. TrEMBL
Match: M5VV81_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002597mg PE=4 SV=1)

HSP 1 Score: 992.6 bits (2565), Expect = 2.2e-286
Identity = 469/654 (71.71%), Postives = 553/654 (84.56%), Query Frame = 1

Query: 1   MIGLSRNFSTVC--KL--SYLQPRQTQATPNFIPFSQLQQQRKLLEWRLISILHDCTDFS 60
           M  LSR FSTV   KL    L   Q Q T  FIPFS+ QQ+RKLLE +LIS L  C + S
Sbjct: 1   MQNLSRRFSTVPIHKLLPQQLHQHQPQPTRFFIPFSEFQQKRKLLEHKLISDLDGCDNLS 60

Query: 61  QIKQVHGQIICNGLSQCSYVLTKLIRMLSKVDVPMDCYPRLVFGQVNYPNPFLWTAMIRG 120
           Q+K+VH  ++ +GLSQC YVLTKL+R L+K+ VP+D YPRLVF QV YPNPFLWTAMIRG
Sbjct: 61  QVKEVHAHLLRHGLSQCCYVLTKLVRTLTKLGVPVDAYPRLVFVQVKYPNPFLWTAMIRG 120

Query: 121 YALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLGRQVHAQTILIGGFAS 180
           Y +QGP +E++N YT MR  G  PVSFTFSALFKACG  L+++LGRQ+HAQTIL+GGFA+
Sbjct: 121 YTVQGPISEALNFYTCMRSAGTGPVSFTFSALFKACGDVLDVNLGRQIHAQTILVGGFAA 180

Query: 181 DLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAKFGDMESARGLFDELP 240
           DLYVGNTMIDMYVKCG L CGRKVFDEM +RDVVSWTELIVAY K GDM SAR LF+ LP
Sbjct: 181 DLYVGNTMIDMYVKCGFLDCGRKVFDEMPDRDVVSWTELIVAYTKIGDMGSARELFEGLP 240

Query: 241 LKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVGVISACAQLGAVKYANW 300
           +KDMVAWTAMVTGYAQNARP++AL+ F++MQ  G+ TDE+TLVG+ISACAQLGA KYANW
Sbjct: 241 VKDMVAWTAMVTGYAQNARPRDALDCFERMQGAGVGTDEITLVGLISACAQLGASKYANW 300

Query: 301 IRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKERNVFSYSSMIVGYAMH 360
           +RDIAE+SGFGP E+V+VGSALIDMYSKCGS DEAYK+F+GMKERNVFSYSSMI+G+AMH
Sbjct: 301 VRDIAEKSGFGPTENVLVGSALIDMYSKCGSLDEAYKVFQGMKERNVFSYSSMILGFAMH 360

Query: 361 GRAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLFAKMEKYFNVTPSPDH 420
           GRA++A++LFHEML TEIRPN+VTFIGVL+ACSHAGMV+QGRQLFA MEKY+NV PS DH
Sbjct: 361 GRANAAIELFHEMLTTEIRPNRVTFIGVLTACSHAGMVDQGRQLFATMEKYYNVVPSADH 420

Query: 421 YACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGNPNIAQVAADQLFKLE 480
           Y CMVDLLGR G LEEALEL+ETMP+  HGGVWGALLGAC IHGNP+IAQ+AA+ LF+LE
Sbjct: 421 YTCMVDLLGRAGRLEEALELVETMPIAAHGGVWGALLGACHIHGNPDIAQIAANHLFELE 480

Query: 481 PDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFEGKKGDIHEFFADDAT 540
           PD IGN+++LSNIYASAGRW +VS++RK+++ KGLKKNP  SW E KKG IHEF A +  
Sbjct: 481 PDSIGNHVMLSNIYASAGRWADVSRVRKMMKEKGLKKNPAYSWVETKKGVIHEFCAGETN 540

Query: 541 HQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEKERILMSHSEKLALAYGMLCTE 600
           H   +EI++AL  LL RL+AHGY+PNL+S  YDL  +E++RILMSHSEKLALAY ++ T+
Sbjct: 541 HPEYAEIKKALDDLLNRLQAHGYQPNLNSAAYDLGIEERKRILMSHSEKLALAYALVSTD 600

Query: 601 AGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFHNGTCSCGNFW 651
           +G TI IMKN+RICEDCH  MC AS++ GREI++RDNMRFHHF NG CSCGNFW
Sbjct: 601 SGSTIKIMKNIRICEDCHVFMCGASQVAGREIVVRDNMRFHHFSNGKCSCGNFW 654

BLAST of CmaCh16G006330 vs. TrEMBL
Match: V4SE67_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025108mg PE=4 SV=1)

HSP 1 Score: 951.8 bits (2459), Expect = 4.3e-274
Identity = 450/653 (68.91%), Postives = 538/653 (82.39%), Query Frame = 1

Query: 1   MIGLSRNFSTVCKLSYLQPRQTQA---TPNFIPFSQLQQQRKLLEWRLISILHDCTDFSQ 60
           M+ L+R FST   L Y+  + TQ    +  F   S+LQ+Q++LLE +LIS L  C D  Q
Sbjct: 1   MVSLTRTFSTASSLGYIPKQFTQQLKPSETFELLSELQRQKRLLETQLISTLDGCNDLLQ 60

Query: 61  IKQVHGQIICNGLSQCSYVLTKLIRMLSKVDVPMDCYPRLVFGQVNYPNPFLWTAMIRGY 120
           IKQVH  I+  GL Q  YVLTK+IR L+K++VPMD YPRLVF QV Y NPFLWTA+IRGY
Sbjct: 61  IKQVHAHILRRGLDQSCYVLTKVIRTLTKINVPMDSYPRLVFEQVKYRNPFLWTALIRGY 120

Query: 121 ALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLGRQVHAQTILIGGFASD 180
            LQG   +SI+LY +MR  G+ PVSFT SALFKAC   L++ LG+Q+HAQTIL+GGF SD
Sbjct: 121 ILQGHLKDSISLYCSMRREGIGPVSFTLSALFKACTEVLDVSLGQQIHAQTILLGGFTSD 180

Query: 181 LYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAKFGDMESARGLFDELPL 240
           LYV NTMI MYVKCG LGC RKVFDEM ERDVVSWTELIVAYA  GDMESA GLF+ELPL
Sbjct: 181 LYVANTMIGMYVKCGFLGCSRKVFDEMPERDVVSWTELIVAYANNGDMESAGGLFNELPL 240

Query: 241 KDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVGVISACAQLGAVKYANWI 300
           KD VAWTAMVTGY QNA+P+EA+EYF++MQ  G+ETD VTLVGVISACAQLG VKYANW+
Sbjct: 241 KDKVAWTAMVTGYVQNAKPREAIEYFERMQYAGVETDYVTLVGVISACAQLGVVKYANWV 300

Query: 301 RDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKERNVFSYSSMIVGYAMHG 360
            +IAE SGFGP  +V+VGSALIDMYSKCGS D+AY++F  MK+RNVFSYSSMI+G+AMHG
Sbjct: 301 CEIAEGSGFGPINNVVVGSALIDMYSKCGSIDDAYRVFVDMKQRNVFSYSSMILGFAMHG 360

Query: 361 RAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLFAKMEKYFNVTPSPDHY 420
           RAH+A+QLF EM+KTE +PN VTFIGVL+ACSH G+VEQGR+LFA MEK + V+PS DHY
Sbjct: 361 RAHAAIQLFGEMVKTETKPNGVTFIGVLTACSHVGLVEQGRKLFASMEKCYGVSPSTDHY 420

Query: 421 ACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGNPNIAQVAADQLFKLEP 480
           ACMVDLLGR GCLEEAL+++E MP+EP+GGVWGALLGAC+IH NP IAQ+AA+ LF+LEP
Sbjct: 421 ACMVDLLGRAGCLEEALKMVEKMPVEPNGGVWGALLGACQIHRNPEIAQIAANHLFQLEP 480

Query: 481 DGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFEGKKGDIHEFFADDATH 540
           D IGNYI+LSNIYASAG W++VS++R++++  GLKKNPG SW EG +G IHEF A D TH
Sbjct: 481 DKIGNYIILSNIYASAGMWDDVSRVRRLLKMTGLKKNPGYSWLEGDRGVIHEFRAGDLTH 540

Query: 541 QRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEKERILMSHSEKLALAYGMLCTEA 600
             S+EI+QAL  LL RL+A GY+PNL SV YD++D+EK+RILM+HSEKLALA+G+L T  
Sbjct: 541 PNSTEIQQALGDLLDRLQADGYQPNLRSVLYDVSDEEKKRILMTHSEKLALAFGLLTTSP 600

Query: 601 GETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFHNGTCSCGNFW 651
           G T+ IMKNLRICEDCH  MC AS++ GREI++RDNMRFHHF +G CSCGN+W
Sbjct: 601 GATVRIMKNLRICEDCHLFMCGASQVIGREIVVRDNMRFHHFQDGKCSCGNYW 653

BLAST of CmaCh16G006330 vs. TrEMBL
Match: A0A061DFG9_THECC (Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_000299 PE=4 SV=1)

HSP 1 Score: 947.2 bits (2447), Expect = 1.0e-272
Identity = 449/659 (68.13%), Postives = 537/659 (81.49%), Query Frame = 1

Query: 1   MIGLSRNFST---------VCKLSYLQPRQTQATPNFIPFSQLQQQRKLLEWRLISILHD 60
           M+ LSR FST          C+  +LQ  Q+Q T  FIPFSQLQ QR LLE +LIS L+ 
Sbjct: 1   MVALSRKFSTSSLKFIPKQFCQY-HLQQIQSQTTQPFIPFSQLQNQRNLLESQLISTLNG 60

Query: 61  CTDFSQIKQVHGQIICNGLSQCSYVLTKLIRMLSKVDVPMDCYPRLVFGQVNYPNPFLWT 120
           CT  +Q KQ H  II  GL QC Y+L KL+R L+K+ +PMD Y +LVF QV YPNPFLWT
Sbjct: 61  CTSLTQFKQTHAYIIRKGLDQCCYILAKLVRNLTKMGIPMDNYAKLVFDQVEYPNPFLWT 120

Query: 121 AMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLGRQVHAQTILI 180
           A+IRGYALQG   ES+++Y+ MR  G  PVSFTFSALFKAC   L+++LGRQ+HAQTILI
Sbjct: 121 ALIRGYALQGHVKESVSVYSCMREEGSLPVSFTFSALFKACCTVLDVNLGRQIHAQTILI 180

Query: 181 GGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAKFGDMESARGL 240
           GGF SDLYV N++I+MYVK G LGC RKVFDE+ ERD++SWTELIVAYAK GDMESA  L
Sbjct: 181 GGFGSDLYVNNSLIEMYVKLGFLGCARKVFDELPERDLISWTELIVAYAKLGDMESAGEL 240

Query: 241 FDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVGVISACAQLGAV 300
           FDELP+KDMVAWT MVTGYAQNA+P+EALE+F++MQ+ G+ETDEVTLVGVISACAQLG  
Sbjct: 241 FDELPIKDMVAWTTMVTGYAQNAKPREALEFFERMQNEGVETDEVTLVGVISACAQLGTA 300

Query: 301 KYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKERNVFSYSSMIV 360
           KYANW+R IAE SGF P   V+VGSALIDMYSKCGS ++AYK+FE M+ERNVFSYSSMI 
Sbjct: 301 KYANWVRGIAENSGFDPTRCVVVGSALIDMYSKCGSVEDAYKVFEAMEERNVFSYSSMIA 360

Query: 361 GYAMHGRAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLFAKMEKYFNVT 420
           G+AMHG A++AL+LF EM+KT I+PN+VTFIGVL+ACSH+GMVEQGRQ+FA ME+ F V+
Sbjct: 361 GFAMHGCAYAALELFREMVKTGIKPNRVTFIGVLTACSHSGMVEQGRQIFASMEEEFGVS 420

Query: 421 PSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGNPNIAQVAADQ 480
           P+ DHYAC+VDLLGR GCLEEAL L ETMP+EP+GGVWGALLGACR +GNP++AQ+ A+ 
Sbjct: 421 PAVDHYACIVDLLGRAGCLEEALNLAETMPVEPNGGVWGALLGACRTYGNPDMAQIGANH 480

Query: 481 LFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFEGKKGDIHEFF 540
           LF+LEP+ IGNYILLSNIYASAGRW +VS +RK++R KGL+KNP CSW E KKG IHEFF
Sbjct: 481 LFELEPNAIGNYILLSNIYASAGRWNDVSMVRKLMREKGLRKNPACSWLEAKKGVIHEFF 540

Query: 541 ADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEKERILMSHSEKLALAYG 600
           A D T+ RS +++Q L  LL RL+  GY+PN+SSV YD+ D++K R+LM+HSEKLALA+G
Sbjct: 541 AGDITNPRSGQMKQVLEDLLNRLKGLGYQPNMSSVAYDVNDEDKRRLLMAHSEKLALAFG 600

Query: 601 MLCTEAGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFHNGTCSCGNFW 651
           +L   A   I IMKNLRICEDCH+ MC  S+IT R II+RDN+RFHHFH G CSCGNFW
Sbjct: 601 LLTISADCPIRIMKNLRICEDCHSFMCGVSQITERVIIVRDNLRFHHFHAGKCSCGNFW 658

BLAST of CmaCh16G006330 vs. TAIR10
Match: AT5G44230.1 (AT5G44230.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 802.0 bits (2070), Expect = 2.8e-232
Identity = 382/621 (61.51%), Postives = 475/621 (76.49%), Query Frame = 1

Query: 31  FSQLQQQRKLLEWRLISILHDCTDFSQIKQVHGQIICNGLSQCSYVLTKLIRMLSKVDVP 90
           FS++  Q++LL   LIS L DC + +QIKQ+HG ++  GL Q  Y+LTKLIR L+K+ VP
Sbjct: 38  FSEISNQKELLVSSLISKLDDCINLNQIKQIHGHVLRKGLDQSCYILTKLIRTLTKLGVP 97

Query: 91  MDCYPRLVFGQVNYPNPFLWTAMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFK 150
           MD Y R V   V + NPFLWTA+IRGYA++G F E+I +Y  MR   ++PVSFTFSAL K
Sbjct: 98  MDPYARRVIEPVQFRNPFLWTAVIRGYAIEGKFDEAIAMYGCMRKEEITPVSFTFSALLK 157

Query: 151 ACGASLNLDLGRQVHAQTILIGGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVV 210
           ACG   +L+LGRQ HAQT  + GF   +YVGNTMIDMYVKC  + C RKVFDEM ERDV+
Sbjct: 158 ACGTMKDLNLGRQFHAQTFRLRGFCF-VYVGNTMIDMYVKCESIDCARKVFDEMPERDVI 217

Query: 211 SWTELIVAYAKFGDMESARGLFDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVG 270
           SWTELI AYA+ G+ME A  LF+ LP KDMVAWTAMVTG+AQNA+P+EALEYF +M+  G
Sbjct: 218 SWTELIAAYARVGNMECAAELFESLPTKDMVAWTAMVTGFAQNAKPQEALEYFDRMEKSG 277

Query: 271 IETDEVTLVGVISACAQLGAVKYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDE 330
           I  DEVT+ G ISACAQLGA KYA+    IA++SG+ P +HV++GSALIDMYSKCG+ +E
Sbjct: 278 IRADEVTVAGYISACAQLGASKYADRAVQIAQKSGYSPSDHVVIGSALIDMYSKCGNVEE 337

Query: 331 AYKIFEGMKERNVFSYSSMIVGYAMHGRAHSALQLFHEML-KTEIRPNKVTFIGVLSACS 390
           A  +F  M  +NVF+YSSMI+G A HGRA  AL LFH M+ +TEI+PN VTF+G L ACS
Sbjct: 338 AVNVFMSMNNKNVFTYSSMILGLATHGRAQEALHLFHYMVTQTEIKPNTVTFVGALMACS 397

Query: 391 HAGMVEQGRQLFAKMEKYFNVTPSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVW 450
           H+G+V+QGRQ+F  M + F V P+ DHY CMVDLLGR G L+EALELI+TM +EPHGGVW
Sbjct: 398 HSGLVDQGRQVFDSMYQTFGVQPTRDHYTCMVDLLGRTGRLQEALELIKTMSVEPHGGVW 457

Query: 451 GALLGACRIHGNPNIAQVAADQLFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAK 510
           GALLGACRIH NP IA++AA+ LF+LEPD IGNYILLSN+YASAG W  V ++RK+I+ K
Sbjct: 458 GALLGACRIHNNPEIAEIAAEHLFELEPDIIGNYILLSNVYASAGDWGGVLRVRKLIKEK 517

Query: 511 GLKKNPGCSWFEGKKGDIHEFFADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYD 570
           GLKK P  SW   K G +H+FF  +  H  S++I+  L +L+ RL   GY+P+LSSVPYD
Sbjct: 518 GLKKTPAVSWVVDKNGQMHKFFPGNLNHPMSNKIQDKLEELVERLTVLGYQPDLSSVPYD 577

Query: 571 LTDDEKERILMSHSEKLALAYGMLCTEAGETITIMKNLRICEDCHNVMCAASEITGREII 630
           ++D+ K  IL+ H+EKLALA+ +L T    TITIMKNLR+C DCH  M  ASE+TG+ II
Sbjct: 578 VSDNAKRLILIQHTEKLALAFSLLTTNRDSTITIMKNLRMCLDCHKFMRLASEVTGKVII 637

Query: 631 IRDNMRFHHFHNGTCSCGNFW 651
           +RDNMRFHHF +G CSCG+FW
Sbjct: 638 MRDNMRFHHFRSGDCSCGDFW 657

BLAST of CmaCh16G006330 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 502.7 bits (1293), Expect = 3.5e-142
Identity = 249/609 (40.89%), Postives = 372/609 (61.08%), Query Frame = 1

Query: 45  LISILHDCTDFSQIKQVHGQIICNGLSQCSYVLTKLIRM-LSKVDVPMDCYPRLVFGQVN 104
           LI    + +  S  + +HG  + + +    +V   LI    S  D+   C    VF  + 
Sbjct: 137 LIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACK---VFTTIK 196

Query: 105 YPNPFLWTAMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLGRQ 164
             +   W +MI G+  +G   +++ L+  M    V     T   +  AC    NL+ GRQ
Sbjct: 197 EKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQ 256

Query: 165 VHAQTILIGGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAKFG 224
           V    I       +L + N M+DMY KCG +   +++FD M E+D V+WT ++  YA   
Sbjct: 257 V-CSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISE 316

Query: 225 DMESARGLFDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQ-DVGIETDEVTLVGVI 284
           D E+AR + + +P KD+VAW A+++ Y QN +P EAL  F ++Q    ++ +++TLV  +
Sbjct: 317 DYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTL 376

Query: 285 SACAQLGAVKYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKERN 344
           SACAQ+GA++   WI    ++ G     HV   SALI MYSKCG  +++ ++F  +++R+
Sbjct: 377 SACAQVGALELGRWIHSYIKKHGIRMNFHVT--SALIHMYSKCGDLEKSREVFNSVEKRD 436

Query: 345 VFSYSSMIVGYAMHGRAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLFA 404
           VF +S+MI G AMHG  + A+ +F++M +  ++PN VTF  V  ACSH G+V++   LF 
Sbjct: 437 VFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFH 496

Query: 405 KMEKYFNVTPSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGNP 464
           +ME  + + P   HYAC+VD+LGR G LE+A++ IE MP+ P   VWGALLGAC+IH N 
Sbjct: 497 QMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANL 556

Query: 465 NIAQVAADQLFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFEG 524
           N+A++A  +L +LEP   G ++LLSNIYA  G+WE VS+LRK +R  GLKK PGCS  E 
Sbjct: 557 NLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIE- 616

Query: 525 KKGDIHEFFADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDE-KERILMS 584
             G IHEF + D  H  S ++   L +++ +L+++GY+P +S V   + ++E KE+ L  
Sbjct: 617 IDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNL 676

Query: 585 HSEKLALAYGMLCTEAGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFHN 644
           HSEKLA+ YG++ TEA + I ++KNLR+C DCH+V    S++  REII+RD  RFHHF N
Sbjct: 677 HSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRN 736

Query: 645 GTCSCGNFW 651
           G CSC +FW
Sbjct: 737 GQCSCNDFW 738

BLAST of CmaCh16G006330 vs. TAIR10
Match: AT5G06540.1 (AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 490.3 bits (1261), Expect = 1.8e-138
Identity = 237/611 (38.79%), Postives = 378/611 (61.87%), Query Frame = 1

Query: 46  ISILHDCTDFSQIKQVHGQIICNGLSQCSYVLTKLIRML---SKVDVPMDC--YPRLVFG 105
           +++L  C+ FS +K +HG ++   L    +V ++L+ +    S  + P +   Y   +F 
Sbjct: 16  LALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFS 75

Query: 106 QVNYPNPFLWTAMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDL 165
           Q+  PN F++  +IR ++     +++   YT M  + + P + TF  L KA      + +
Sbjct: 76  QIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLV 135

Query: 166 GRQVHAQTILIGGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYA 225
           G Q H+Q +  G F +D+YV N+++ MY  CG +    ++F +M  RDVVSWT ++  Y 
Sbjct: 136 GEQTHSQIVRFG-FQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYC 195

Query: 226 KFGDMESARGLFDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVG 285
           K G +E+AR +FDE+P +++  W+ M+ GYA+N   ++A++ F+ M+  G+  +E  +V 
Sbjct: 196 KCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVS 255

Query: 286 VISACAQLGAVKYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKE 345
           VIS+CA LGA+++     +   +S      ++++G+AL+DM+ +CG  ++A  +FEG+ E
Sbjct: 256 VISSCAHLGALEFGERAYEYVVKSHM--TVNLILGTALVDMFWRCGDIEKAIHVFEGLPE 315

Query: 346 RNVFSYSSMIVGYAMHGRAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQL 405
            +  S+SS+I G A+HG AH A+  F +M+     P  VTF  VLSACSH G+VE+G ++
Sbjct: 316 TDSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEI 375

Query: 406 FAKMEKYFNVTPSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHG 465
           +  M+K   + P  +HY C+VD+LGR G L EA   I  M ++P+  + GALLGAC+I+ 
Sbjct: 376 YENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYK 435

Query: 466 NPNIAQVAADQLFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWF 525
           N  +A+   + L K++P+  G Y+LLSNIYA AG+W+++  LR +++ K +KK PG S  
Sbjct: 436 NTEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLI 495

Query: 526 EGKKGDIHEF-FADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEKERIL 585
           E   G I++F   DD  H    +IR+   ++L ++R  GYK N     +D+ ++EKE  +
Sbjct: 496 E-IDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKESSI 555

Query: 586 MSHSEKLALAYGMLCTEAGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHF 645
             HSEKLA+AYGM+ T+ G TI I+KNLR+CEDCH V    SE+ GRE+I+RD  RFHHF
Sbjct: 556 HMHSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHF 615

Query: 646 HNGTCSCGNFW 651
            NG CSC ++W
Sbjct: 616 RNGVCSCRDYW 622

BLAST of CmaCh16G006330 vs. TAIR10
Match: AT5G66520.1 (AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 488.4 bits (1256), Expect = 6.8e-138
Identity = 236/610 (38.69%), Postives = 368/610 (60.33%), Query Frame = 1

Query: 43  WRLISILHDCTDFSQIKQVHGQIICNGLSQCSYVLTKLIRM-LSKVDVPMDCYPRLVFGQ 102
           +  +S L  C+   ++KQ+H +++  GL Q SY +TK +   +S        Y ++VF  
Sbjct: 15  YETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDG 74

Query: 103 VNYPNPFLWTAMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLG 162
            + P+ FLW  MIRG++       S+ LY  M  +     ++TF +L KAC      +  
Sbjct: 75  FDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEET 134

Query: 163 RQVHAQTILIGGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAK 222
            Q+HAQ   +G + +D+Y  N++I+ Y   G       +FD + E D VSW  +I  Y K
Sbjct: 135 TQIHAQITKLG-YENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVK 194

Query: 223 FGDMESARGLFDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVGV 282
            G M+ A  LF ++  K+ ++WT M++GY Q    KEAL+ F +MQ+  +E D V+L   
Sbjct: 195 AGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANA 254

Query: 283 ISACAQLGAVKYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKER 342
           +SACAQLGA++   WI     ++        ++G  LIDMY+KCG  +EA ++F+ +K++
Sbjct: 255 LSACAQLGALEQGKWIHSYLNKTRIRMDS--VLGCVLIDMYAKCGEMEEALEVFKNIKKK 314

Query: 343 NVFSYSSMIVGYAMHGRAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLF 402
           +V +++++I GYA HG    A+  F EM K  I+PN +TF  VL+ACS+ G+VE+G+ +F
Sbjct: 315 SVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIF 374

Query: 403 AKMEKYFNVTPSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGN 462
             ME+ +N+ P+ +HY C+VDLLGR G L+EA   I+ MP++P+  +WGALL ACRIH N
Sbjct: 375 YSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKN 434

Query: 463 PNIAQVAADQLFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFE 522
             + +   + L  ++P   G Y+  +NI+A   +W++ ++ R++++ +G+ K PGCS   
Sbjct: 435 IELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI- 494

Query: 523 GKKGDIHEFFADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLT-DDEKERILM 582
             +G  HEF A D +H    +I+   R +  +L  +GY P L  +  DL  DDE+E I+ 
Sbjct: 495 SLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVH 554

Query: 583 SHSEKLALAYGMLCTEAGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFH 642
            HSEKLA+ YG++ T+ G  I IMKNLR+C+DCH V    S+I  R+I++RD  RFHHF 
Sbjct: 555 QHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFR 614

Query: 643 NGTCSCGNFW 651
           +G CSCG++W
Sbjct: 615 DGKCSCGDYW 620

BLAST of CmaCh16G006330 vs. TAIR10
Match: AT4G37380.1 (AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 473.8 bits (1218), Expect = 1.7e-133
Identity = 235/555 (42.34%), Postives = 352/555 (63.42%), Query Frame = 1

Query: 98  VFGQVNYPNPFLWTAMIRGYALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLN 157
           +F Q   P+ FL+TA I   ++ G   ++  LY  +  + ++P  FTFS+L K+C     
Sbjct: 86  LFHQTIDPDLFLFTAAINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSCSTKS- 145

Query: 158 LDLGRQVHAQTILIGGFASDLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIV 217
              G+ +H   +L  G   D YV   ++D+Y K G +   +KVFD M ER +VS T +I 
Sbjct: 146 ---GKLIHTH-VLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMIT 205

Query: 218 AYAKFGDMESARGLFDELPLKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGI-ETDEV 277
            YAK G++E+AR LFD +  +D+V+W  M+ GYAQ+  P +AL  FQK+   G  + DE+
Sbjct: 206 CYAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEI 265

Query: 278 TLVGVISACAQLGAVKYANWIRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFE 337
           T+V  +SAC+Q+GA++   WI    + S      +V V + LIDMYSKCGS +EA  +F 
Sbjct: 266 TVVAALSACSQIGALETGRWIHVFVKSSRIRL--NVKVCTGLIDMYSKCGSLEEAVLVFN 325

Query: 338 GMKERNVFSYSSMIVGYAMHGRAHSALQLFHEMLK-TEIRPNKVTFIGVLSACSHAGMVE 397
               +++ ++++MI GYAMHG +  AL+LF+EM   T ++P  +TFIG L AC+HAG+V 
Sbjct: 326 DTPRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVN 385

Query: 398 QGRQLFAKMEKYFNVTPSPDHYACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGA 457
           +G ++F  M + + + P  +HY C+V LLGR G L+ A E I+ M M+    +W ++LG+
Sbjct: 386 EGIRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGS 445

Query: 458 CRIHGNPNIAQVAADQLFKLEPDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNP 517
           C++HG+  + +  A+ L  L     G Y+LLSNIYAS G +E V+K+R +++ KG+ K P
Sbjct: 446 CKLHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEP 505

Query: 518 GCSWFEGKKGDIHEFFADDATHQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEK 577
           G S  E  +  +HEF A D  H +S EI   LR++  R+++HGY PN ++V  DL + EK
Sbjct: 506 GISTIE-IENKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYVPNTNTVLQDLEETEK 565

Query: 578 ERILMSHSEKLALAYGMLCTEAGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMR 637
           E+ L  HSE+LA+AYG++ T+ G  + I KNLR+C DCH V    S+ITGR+I++RD  R
Sbjct: 566 EQSLQVHSERLAIAYGLISTKPGSPLKIFKNLRVCSDCHTVTKLISKITGRKIVMRDRNR 625

Query: 638 FHHFHNGTCSCGNFW 651
           FHHF +G+CSCG+FW
Sbjct: 626 FHHFTDGSCSCGDFW 632

BLAST of CmaCh16G006330 vs. NCBI nr
Match: gi|449457516|ref|XP_004146494.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g44230 [Cucumis sativus])

HSP 1 Score: 1191.8 bits (3082), Expect = 0.0e+00
Identity = 565/650 (86.92%), Postives = 607/650 (93.38%), Query Frame = 1

Query: 1   MIGLSRNFSTVCKLSYLQPRQTQATPNFIPFSQLQQQRKLLEWRLISILHDCTDFSQIKQ 60
           MIG SRN STV KLS+LQ  QT+ +PNFIPF QLQ QRKLLEWRL+SILHDCT FSQIKQ
Sbjct: 1   MIGFSRNLSTVSKLSHLQNLQTRGSPNFIPFPQLQHQRKLLEWRLMSILHDCTLFSQIKQ 60

Query: 61  VHGQIICNGLSQCSYVLTKLIRMLSKVDVPMDCYPRLVFGQVNYPNPFLWTAMIRGYALQ 120
           VH  II NGLSQCSYVLTKLIRML+KVDVPM  YP LVFGQVNYPNPFLWTAMIRGYALQ
Sbjct: 61  VHAHIIRNGLSQCSYVLTKLIRMLTKVDVPMGSYPLLVFGQVNYPNPFLWTAMIRGYALQ 120

Query: 121 GPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLGRQVHAQTILIGGFASDLYV 180
           G  +ES N YT MR +GV PVSFTFSALFKACGA+LN+DLG+QVHAQTILIGGFASDLYV
Sbjct: 121 GLLSESTNFYTRMRRDGVGPVSFTFSALFKACGAALNMDLGKQVHAQTILIGGFASDLYV 180

Query: 181 GNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAKFGDMESARGLFDELPLKDM 240
           GN+MID+YVKCG LGC RKVFDEMSERDVVSWTELIVAYAK+GDMESA GLFD+LPLKDM
Sbjct: 181 GNSMIDLYVKCGFLGCARKVFDEMSERDVVSWTELIVAYAKYGDMESASGLFDDLPLKDM 240

Query: 241 VAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVGVISACAQLGAVKYANWIRDI 300
           VAWTAMVTGYAQN RPKEALEYFQKMQDVG+ETDEVTL GVISACAQLGAVK+ANWIRDI
Sbjct: 241 VAWTAMVTGYAQNGRPKEALEYFQKMQDVGMETDEVTLAGVISACAQLGAVKHANWIRDI 300

Query: 301 AERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKERNVFSYSSMIVGYAMHGRAH 360
           AERSGFGP  +V+VGSALIDMYSKCGSPDEAYK+FE MKERNVFSYSSMI+GYAMHGRAH
Sbjct: 301 AERSGFGPSGNVVVGSALIDMYSKCGSPDEAYKVFEVMKERNVFSYSSMILGYAMHGRAH 360

Query: 361 SALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLFAKMEKYFNVTPSPDHYACM 420
           SALQLFH+MLKTEIRPNKVTFIG+LSACSHAG+VEQGRQLFAKMEK+F V PSPDHYACM
Sbjct: 361 SALQLFHDMLKTEIRPNKVTFIGILSACSHAGLVEQGRQLFAKMEKFFGVAPSPDHYACM 420

Query: 421 VDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGNPNIAQVAADQLFKLEPDGI 480
           VDLLGR GCLEEAL+L++TMPMEP+GGVWGALLGACRIHGNP+IAQ+AA++LFKLEP+GI
Sbjct: 421 VDLLGRAGCLEEALDLVKTMPMEPNGGVWGALLGACRIHGNPDIAQIAANELFKLEPNGI 480

Query: 481 GNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFEGKKGDIHEFFADDATHQRS 540
           GNYILLSNIYASAGRWEEVSKLRKVIR KG KKNPGCSWFEGK G+IH+FFA D TH RS
Sbjct: 481 GNYILLSNIYASAGRWEEVSKLRKVIREKGFKKNPGCSWFEGKNGEIHDFFAGDTTHPRS 540

Query: 541 SEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEKERILMSHSEKLALAYGMLCTEAGET 600
           SEIRQAL+QL+ RLR+HGYKPNL S PYDLTDDEKERILMSHSEKLALAYG+LCTEAG+T
Sbjct: 541 SEIRQALKQLIERLRSHGYKPNLGSAPYDLTDDEKERILMSHSEKLALAYGLLCTEAGDT 600

Query: 601 ITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFHNGTCSCGNFW 651
           I IMKN+RICEDCHNVMCAASEITGREII+RDNMRFHHFHNGTCSCGNFW
Sbjct: 601 IKIMKNIRICEDCHNVMCAASEITGREIIVRDNMRFHHFHNGTCSCGNFW 650

BLAST of CmaCh16G006330 vs. NCBI nr
Match: gi|659102402|ref|XP_008452110.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g44230 [Cucumis melo])

HSP 1 Score: 1188.3 bits (3073), Expect = 0.0e+00
Identity = 567/650 (87.23%), Postives = 605/650 (93.08%), Query Frame = 1

Query: 1   MIGLSRNFSTVCKLSYLQPRQTQATPNFIPFSQLQQQRKLLEWRLISILHDCTDFSQIKQ 60
           MIG SRN STV KLS+LQ  QT A+PN IPF QLQ QRKLLEWRL+SILHDCT FSQIKQ
Sbjct: 14  MIGFSRNLSTVSKLSHLQNLQTPASPNIIPFPQLQHQRKLLEWRLMSILHDCTLFSQIKQ 73

Query: 61  VHGQIICNGLSQCSYVLTKLIRMLSKVDVPMDCYPRLVFGQVNYPNPFLWTAMIRGYALQ 120
           VHG II NGLSQCSYVLTKLIRML+KVDVPM  YP LVFGQVNYPNPFLWTAMIRGYALQ
Sbjct: 74  VHGHIIRNGLSQCSYVLTKLIRMLTKVDVPMGSYPLLVFGQVNYPNPFLWTAMIRGYALQ 133

Query: 121 GPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLGRQVHAQTILIGGFASDLYV 180
           G  +ES N YT MR +GV PVSFTFSALFKACGASLN+DLG+QVHAQTILIGGFASDLYV
Sbjct: 134 GLVSESTNFYTRMRRDGVGPVSFTFSALFKACGASLNMDLGKQVHAQTILIGGFASDLYV 193

Query: 181 GNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAKFGDMESARGLFDELPLKDM 240
           GN+MID+YVKCG LGC RKVFDEMSERDVVSWTELIVAYAK+GDMESA GLFD+LPLKDM
Sbjct: 194 GNSMIDLYVKCGFLGCARKVFDEMSERDVVSWTELIVAYAKYGDMESASGLFDDLPLKDM 253

Query: 241 VAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVGVISACAQLGAVKYANWIRDI 300
           VAWTAMVTGYAQNARPKEALEYFQKMQDVG+ETDEVTL GVISACAQLGAVK+ANWIRDI
Sbjct: 254 VAWTAMVTGYAQNARPKEALEYFQKMQDVGMETDEVTLAGVISACAQLGAVKHANWIRDI 313

Query: 301 AERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKERNVFSYSSMIVGYAMHGRAH 360
           AERSGFGP  +V+VGSALIDMYSKCGSPDEAYK+FE MKERNVFSYSSMI+GYAMHGRA 
Sbjct: 314 AERSGFGPSGNVVVGSALIDMYSKCGSPDEAYKVFEVMKERNVFSYSSMILGYAMHGRAR 373

Query: 361 SALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLFAKMEKYFNVTPSPDHYACM 420
           SALQLFHEMLKTEIRPNKVTF+GVLSACSHAG+VEQGRQ+FAKMEK+F V PSPDHYACM
Sbjct: 374 SALQLFHEMLKTEIRPNKVTFLGVLSACSHAGLVEQGRQVFAKMEKFFGVAPSPDHYACM 433

Query: 421 VDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGNPNIAQVAADQLFKLEPDGI 480
           VDLLGR GCLEEAL+L++TMPMEP+GGVWGALLGACRIHGNP+IAQ+AA+QLF LEP+ I
Sbjct: 434 VDLLGRAGCLEEALDLVKTMPMEPNGGVWGALLGACRIHGNPDIAQIAANQLFNLEPNSI 493

Query: 481 GNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFEGKKGDIHEFFADDATHQRS 540
           GNYILLSNIYASAGRWEEVSKLRKVIR KGLKKNPGCSWFEGK G+IH+FFA D TH RS
Sbjct: 494 GNYILLSNIYASAGRWEEVSKLRKVIREKGLKKNPGCSWFEGKNGEIHDFFAGDTTHSRS 553

Query: 541 SEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEKERILMSHSEKLALAYGMLCTEAGET 600
           SEIRQAL+QL+ RLRAHGYKPNLSS PYD+TDDEKERILMSHSEKLALAYG+LCT AG+T
Sbjct: 554 SEIRQALKQLIERLRAHGYKPNLSSAPYDMTDDEKERILMSHSEKLALAYGLLCTSAGDT 613

Query: 601 ITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFHNGTCSCGNFW 651
           I IMKN+RICEDCHNVMCAASEITGREIIIRDNMRFHHFH GTCSCGNFW
Sbjct: 614 IKIMKNIRICEDCHNVMCAASEITGREIIIRDNMRFHHFHKGTCSCGNFW 663

BLAST of CmaCh16G006330 vs. NCBI nr
Match: gi|225431281|ref|XP_002268784.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g44230 [Vitis vinifera])

HSP 1 Score: 993.4 bits (2567), Expect = 1.8e-286
Identity = 466/638 (73.04%), Postives = 552/638 (86.52%), Query Frame = 1

Query: 13  KLSYLQPRQTQATPNFIPFSQLQQQRKLLEWRLISILHDCTDFSQIKQVHGQIICNGLSQ 72
           K SY Q +    T +FIPFS ++Q++K+LE RL+S+LH CT  +Q+KQVH  I   GL Q
Sbjct: 15  KTSYCQLQ----TQSFIPFS-VRQEQKILESRLVSVLHGCTHINQVKQVHAHIFRKGLEQ 74

Query: 73  CSYVLTKLIRMLSKVDVPMDCYPRLVFGQVNYPNPFLWTAMIRGYALQGPFTESINLYTN 132
           C +VL KL+R L+K+DVPMD YPRLVF QV YPNPFLWTA+IRGYALQGPF ES+ LY +
Sbjct: 75  CCFVLAKLLRTLTKLDVPMDPYPRLVFQQVEYPNPFLWTALIRGYALQGPFMESVLLYNS 134

Query: 133 MRGNGVSPVSFTFSALFKACGASLNLDLGRQVHAQTILIGGFASDLYVGNTMIDMYVKCG 192
           MR  G+ PVSFTF+AL KAC A+L+++LGRQVH QTILIGGF SDLYVGNT+IDMYVKCG
Sbjct: 135 MRRQGIGPVSFTFTALLKACSAALDVNLGRQVHTQTILIGGFGSDLYVGNTLIDMYVKCG 194

Query: 193 VLGCGRKVFDEMSERDVVSWTELIVAYAKFGDMESARGLFDELPLKDMVAWTAMVTGYAQ 252
            LGCG +VFDEM +RDV+SWT LIVAYAK G+ME+A  LFD LP+KDMVAWTAMVTGYAQ
Sbjct: 195 CLGCGHRVFDEMLDRDVISWTSLIVAYAKVGNMEAASELFDGLPMKDMVAWTAMVTGYAQ 254

Query: 253 NARPKEALEYFQKMQDVGIETDEVTLVGVISACAQLGAVKYANWIRDIAERSGFGPYEHV 312
           NARP+EALE F++MQ  G++TDEVTLVGVISACAQLGA KYANW+RD+AE+SGFGP  +V
Sbjct: 255 NARPREALEVFERMQAAGVKTDEVTLVGVISACAQLGAAKYANWVRDVAEQSGFGPTSNV 314

Query: 313 MVGSALIDMYSKCGSPDEAYKIFEGMKERNVFSYSSMIVGYAMHGRAHSALQLFHEMLKT 372
           +VGSALIDMY+KCGS ++AYK+FE M+ERNV+SYSSMIVG+AMHG A +A++LF EMLKT
Sbjct: 315 VVGSALIDMYAKCGSVEDAYKVFERMEERNVYSYSSMIVGFAMHGLAGAAMELFDEMLKT 374

Query: 373 EIRPNKVTFIGVLSACSHAGMVEQGRQLFAKMEKYFNVTPSPDHYACMVDLLGRGGCLEE 432
           EI+PN+VTFIGVL+ACSHAGMVEQG+QLFA ME+   V PS DHYACMVDLLGR G LEE
Sbjct: 375 EIKPNRVTFIGVLTACSHAGMVEQGQQLFAMMEECHGVAPSEDHYACMVDLLGRAGRLEE 434

Query: 433 ALELIETMPMEPHGGVWGALLGACRIHGNPNIAQVAADQLFKLEPDGIGNYILLSNIYAS 492
           AL L++ MPM PHGGVWGALLGACRIHGNP++AQ+AA  LF+LEP+GIGNYILLSNIYAS
Sbjct: 435 ALNLVKMMPMNPHGGVWGALLGACRIHGNPDMAQIAASHLFELEPNGIGNYILLSNIYAS 494

Query: 493 AGRWEEVSKLRKVIRAKGLKKNPGCSWFEGKKGDIHEFFADDATHQRSSEIRQALRQLLV 552
           AGRW++VSK+RK++RAKGLKKNPGCSW EGKKG IHEFFA D +H +S EI+QAL  LL 
Sbjct: 495 AGRWDDVSKVRKLMRAKGLKKNPGCSWVEGKKGIIHEFFAGDMSHPKSREIKQALEDLLD 554

Query: 553 RLRAHGYKPNLSSVPYDLTDDEKERILMSHSEKLALAYGMLCTEAGETITIMKNLRICED 612
           RL+  GY+PNLSSV YD++D+EK+R+LMSHSEKLALA+G+L T AG TI I+KNLRICED
Sbjct: 555 RLKYLGYQPNLSSVAYDISDEEKKRLLMSHSEKLALAFGLLTTNAGCTIRIVKNLRICED 614

Query: 613 CHNVMCAASEITGREIIIRDNMRFHHFHNGTCSCGNFW 651
           CH+VMC AS+ITGREI++RDNMRFHHF +G CSCGNFW
Sbjct: 615 CHSVMCGASQITGREIVVRDNMRFHHFRDGRCSCGNFW 647

BLAST of CmaCh16G006330 vs. NCBI nr
Match: gi|595814629|ref|XP_007203772.1| (hypothetical protein PRUPE_ppa002597mg [Prunus persica])

HSP 1 Score: 992.6 bits (2565), Expect = 3.1e-286
Identity = 469/654 (71.71%), Postives = 553/654 (84.56%), Query Frame = 1

Query: 1   MIGLSRNFSTVC--KL--SYLQPRQTQATPNFIPFSQLQQQRKLLEWRLISILHDCTDFS 60
           M  LSR FSTV   KL    L   Q Q T  FIPFS+ QQ+RKLLE +LIS L  C + S
Sbjct: 1   MQNLSRRFSTVPIHKLLPQQLHQHQPQPTRFFIPFSEFQQKRKLLEHKLISDLDGCDNLS 60

Query: 61  QIKQVHGQIICNGLSQCSYVLTKLIRMLSKVDVPMDCYPRLVFGQVNYPNPFLWTAMIRG 120
           Q+K+VH  ++ +GLSQC YVLTKL+R L+K+ VP+D YPRLVF QV YPNPFLWTAMIRG
Sbjct: 61  QVKEVHAHLLRHGLSQCCYVLTKLVRTLTKLGVPVDAYPRLVFVQVKYPNPFLWTAMIRG 120

Query: 121 YALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLGRQVHAQTILIGGFAS 180
           Y +QGP +E++N YT MR  G  PVSFTFSALFKACG  L+++LGRQ+HAQTIL+GGFA+
Sbjct: 121 YTVQGPISEALNFYTCMRSAGTGPVSFTFSALFKACGDVLDVNLGRQIHAQTILVGGFAA 180

Query: 181 DLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAKFGDMESARGLFDELP 240
           DLYVGNTMIDMYVKCG L CGRKVFDEM +RDVVSWTELIVAY K GDM SAR LF+ LP
Sbjct: 181 DLYVGNTMIDMYVKCGFLDCGRKVFDEMPDRDVVSWTELIVAYTKIGDMGSARELFEGLP 240

Query: 241 LKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVGVISACAQLGAVKYANW 300
           +KDMVAWTAMVTGYAQNARP++AL+ F++MQ  G+ TDE+TLVG+ISACAQLGA KYANW
Sbjct: 241 VKDMVAWTAMVTGYAQNARPRDALDCFERMQGAGVGTDEITLVGLISACAQLGASKYANW 300

Query: 301 IRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKERNVFSYSSMIVGYAMH 360
           +RDIAE+SGFGP E+V+VGSALIDMYSKCGS DEAYK+F+GMKERNVFSYSSMI+G+AMH
Sbjct: 301 VRDIAEKSGFGPTENVLVGSALIDMYSKCGSLDEAYKVFQGMKERNVFSYSSMILGFAMH 360

Query: 361 GRAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLFAKMEKYFNVTPSPDH 420
           GRA++A++LFHEML TEIRPN+VTFIGVL+ACSHAGMV+QGRQLFA MEKY+NV PS DH
Sbjct: 361 GRANAAIELFHEMLTTEIRPNRVTFIGVLTACSHAGMVDQGRQLFATMEKYYNVVPSADH 420

Query: 421 YACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGNPNIAQVAADQLFKLE 480
           Y CMVDLLGR G LEEALEL+ETMP+  HGGVWGALLGAC IHGNP+IAQ+AA+ LF+LE
Sbjct: 421 YTCMVDLLGRAGRLEEALELVETMPIAAHGGVWGALLGACHIHGNPDIAQIAANHLFELE 480

Query: 481 PDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFEGKKGDIHEFFADDAT 540
           PD IGN+++LSNIYASAGRW +VS++RK+++ KGLKKNP  SW E KKG IHEF A +  
Sbjct: 481 PDSIGNHVMLSNIYASAGRWADVSRVRKMMKEKGLKKNPAYSWVETKKGVIHEFCAGETN 540

Query: 541 HQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEKERILMSHSEKLALAYGMLCTE 600
           H   +EI++AL  LL RL+AHGY+PNL+S  YDL  +E++RILMSHSEKLALAY ++ T+
Sbjct: 541 HPEYAEIKKALDDLLNRLQAHGYQPNLNSAAYDLGIEERKRILMSHSEKLALAYALVSTD 600

Query: 601 AGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFHNGTCSCGNFW 651
           +G TI IMKN+RICEDCH  MC AS++ GREI++RDNMRFHHF NG CSCGNFW
Sbjct: 601 SGSTIKIMKNIRICEDCHVFMCGASQVAGREIVVRDNMRFHHFSNGKCSCGNFW 654

BLAST of CmaCh16G006330 vs. NCBI nr
Match: gi|645273140|ref|XP_008241735.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g44230 [Prunus mume])

HSP 1 Score: 985.3 bits (2546), Expect = 5.0e-284
Identity = 466/654 (71.25%), Postives = 551/654 (84.25%), Query Frame = 1

Query: 1   MIGLSRNFSTVC--KL--SYLQPRQTQATPNFIPFSQLQQQRKLLEWRLISILHDCTDFS 60
           M  LSR FSTV   KL    L   Q Q T  FIPFS+ QQ+RKLLE +LIS L  C + S
Sbjct: 1   MQNLSRRFSTVPIHKLLPQQLHQHQPQPTRFFIPFSEFQQKRKLLEHKLISDLDGCANLS 60

Query: 61  QIKQVHGQIICNGLSQCSYVLTKLIRMLSKVDVPMDCYPRLVFGQVNYPNPFLWTAMIRG 120
           Q+K+VH  ++ +GLSQC YVLTKL+R L+K+ VP+D YPRLVF QV YPNPFLWTAMIRG
Sbjct: 61  QVKEVHAHLLRHGLSQCCYVLTKLVRTLTKLGVPVDAYPRLVFLQVKYPNPFLWTAMIRG 120

Query: 121 YALQGPFTESINLYTNMRGNGVSPVSFTFSALFKACGASLNLDLGRQVHAQTILIGGFAS 180
           Y +QGP +E++N YT MR  G  PVSFTFSALFKACG  L+++LGRQ+HAQTIL+GGFA+
Sbjct: 121 YTVQGPISEALNFYTCMRRAGTGPVSFTFSALFKACGDVLDVNLGRQIHAQTILVGGFAA 180

Query: 181 DLYVGNTMIDMYVKCGVLGCGRKVFDEMSERDVVSWTELIVAYAKFGDMESARGLFDELP 240
           DLYVGNTMIDMYVKCGVL CGRKVFDEM +RDVVSWTELIVAY K GDM SAR LF+ LP
Sbjct: 181 DLYVGNTMIDMYVKCGVLDCGRKVFDEMPDRDVVSWTELIVAYTKIGDMGSARELFEGLP 240

Query: 241 LKDMVAWTAMVTGYAQNARPKEALEYFQKMQDVGIETDEVTLVGVISACAQLGAVKYANW 300
           +KDMVAWTAMVTGYAQNARP++AL+ F++MQ  G+ TDE+TLVG+ISACAQLGA KYA+W
Sbjct: 241 VKDMVAWTAMVTGYAQNARPRDALDCFERMQGAGVGTDEITLVGLISACAQLGASKYASW 300

Query: 301 IRDIAERSGFGPYEHVMVGSALIDMYSKCGSPDEAYKIFEGMKERNVFSYSSMIVGYAMH 360
           +RDIAE+ GFGP E+V+VGSALIDMYSKCGS DEAYK+F+GMKERNVFSYSSMI+G+AMH
Sbjct: 301 VRDIAEKYGFGPTENVLVGSALIDMYSKCGSLDEAYKVFQGMKERNVFSYSSMILGFAMH 360

Query: 361 GRAHSALQLFHEMLKTEIRPNKVTFIGVLSACSHAGMVEQGRQLFAKMEKYFNVTPSPDH 420
           GRA++A++LF EML TEIRPN+VTFIGVL+ACSHAGMV+QGRQLFA MEKY+NV PS DH
Sbjct: 361 GRANAAIELFQEMLTTEIRPNRVTFIGVLTACSHAGMVDQGRQLFATMEKYYNVVPSADH 420

Query: 421 YACMVDLLGRGGCLEEALELIETMPMEPHGGVWGALLGACRIHGNPNIAQVAADQLFKLE 480
           Y CMVDLLGR G LEEALEL++TMP+  HGGVWGALLGACRIHGNP+IAQ+AA+ LF+LE
Sbjct: 421 YTCMVDLLGRAGRLEEALELVKTMPIAAHGGVWGALLGACRIHGNPDIAQIAANHLFELE 480

Query: 481 PDGIGNYILLSNIYASAGRWEEVSKLRKVIRAKGLKKNPGCSWFEGKKGDIHEFFADDAT 540
           PD IGNY++LSNIYA   RW +VS++RK+++ KGLKKNP  SW E KKG IHEF+A +  
Sbjct: 481 PDSIGNYVMLSNIYAXXXRWADVSRVRKMMKEKGLKKNPAYSWVETKKGVIHEFYAGETN 540

Query: 541 HQRSSEIRQALRQLLVRLRAHGYKPNLSSVPYDLTDDEKERILMSHSEKLALAYGMLCTE 600
           H   +EI++AL  LL RL+AHGY+PNL+S  YDL  +E++RILMSHSEKLALAY +L T+
Sbjct: 541 HPEYAEIKKALDDLLNRLQAHGYQPNLNSAAYDLGIEERKRILMSHSEKLALAYALLSTD 600

Query: 601 AGETITIMKNLRICEDCHNVMCAASEITGREIIIRDNMRFHHFHNGTCSCGNFW 651
           +G TI IMKN+RICEDCH  MC AS++ GREI++RDNMRFHHF NG CSCGNFW
Sbjct: 601 SGSTIKIMKNIRICEDCHVFMCGASQVAGREIVVRDNMRFHHFSNGKCSCGNFW 654

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP417_ARATH4.9e-23161.51Pentatricopeptide repeat-containing protein At5g44230 OS=Arabidopsis thaliana GN... [more]
PP175_ARATH6.1e-14140.89Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP367_ARATH3.2e-13738.79Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN... [more]
PP449_ARATH1.2e-13638.69Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN... [more]
PP354_ARATH3.1e-13242.34Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
A0A0A0KUQ8_CUCSA0.0e+0086.92Uncharacterized protein OS=Cucumis sativus GN=Csa_4G045030 PE=4 SV=1[more]
F6GWS8_VITVI1.3e-28673.04Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0023g02490 PE=4 SV=... [more]
M5VV81_PRUPE2.2e-28671.71Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002597mg PE=4 SV=1[more]
V4SE67_9ROSI4.3e-27468.91Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025108mg PE=4 SV=1[more]
A0A061DFG9_THECC1.0e-27268.13Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_000... [more]
Match NameE-valueIdentityDescription
AT5G44230.12.8e-23261.51 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G29760.13.5e-14240.89 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G06540.11.8e-13838.79 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G66520.16.8e-13838.69 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G37380.11.7e-13342.34 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449457516|ref|XP_004146494.1|0.0e+0086.92PREDICTED: pentatricopeptide repeat-containing protein At5g44230 [Cucumis sativu... [more]
gi|659102402|ref|XP_008452110.1|0.0e+0087.23PREDICTED: pentatricopeptide repeat-containing protein At5g44230 [Cucumis melo][more]
gi|225431281|ref|XP_002268784.1|1.8e-28673.04PREDICTED: pentatricopeptide repeat-containing protein At5g44230 [Vitis vinifera... [more]
gi|595814629|ref|XP_007203772.1|3.1e-28671.71hypothetical protein PRUPE_ppa002597mg [Prunus persica][more]
gi|645273140|ref|XP_008241735.1|5.0e-28471.25PREDICTED: pentatricopeptide repeat-containing protein At5g44230 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G006330.1CmaCh16G006330.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 210..236
score: 8.6E-4coord: 109..138
score: 0.0031coord: 182..208
score: 0.011coord: 417..441
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 342..389
score: 3.2E-10coord: 238..286
score: 4.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 109..140
score: 0.0011coord: 182..210
score: 4.2E-4coord: 241..274
score: 3.9E-6coord: 317..343
score: 1.3E-4coord: 379..410
score: 0.0029coord: 344..377
score: 4.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 106..140
score: 10.698coord: 311..341
score: 8.594coord: 212..238
score: 5.152coord: 177..211
score: 8.944coord: 342..376
score: 12.014coord: 141..176
score: 5.02coord: 479..513
score: 6.939coord: 377..407
score: 8.166coord: 413..443
score: 7.487coord: 274..308
score: 7.399coord: 239..273
score: 11
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 443..507
score: 1.2E-9coord: 205..373
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 18..520
score: 1.1E
NoneNo IPR availablePANTHERPTHR24015:SF264SUBFAMILY NOT NAMEDcoord: 18..520
score: 1.1E

The following gene(s) are paralogous to this gene:

None