CmaCh16G000480 (gene) Cucurbita maxima (Rimu)

NameCmaCh16G000480
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr16 : 209774 .. 211636 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATTATCGAAGCCGACTTTCTTCACACACCTCAGAACCCTAACCAGGTCCCAACATTTACTGCACCAGGCTCCGGCACCTCCCATCGTCACACTCCGATTTCTCTCCTTTGCATCACCGGAGGAAGCTGCTGCCGAACGACGCCGTCGAAAGCGCCGCCTCCGCATTGAGCCCCCGCTCTCTTCCTCCTCTGCCGCTCGCCCACAATCGCAGCCTTCTAAGCCTCAATCCCCACAAAACCCTAATGCCCCCAAACTCCCTGAGCATATCTCTGCTCTCTCTGGTAATCGTCTTAACCTCCACAACCGCATTCTTACTCTCATTCGTGAAAATGATCTTGAAGAGGCTGCGCTTTTCACCCGCCATTCCATTTACTCTAATTGTCGTCCCACGATCTTCACCGTCAATGCCGTTCTCAATGCACTGCTTCGTCAATCGAAATATTCCGATTTGCTTTCACTTCACCGGTTTATTACACAAGCTGGTGTTGCCCCCAATATAATCACTCACAATTTGATTTTTCAGACGTATTTGGATTGTCGTAAGCCGGATACGGCAATGGAACATTACAAGCAGTTGATCAATGATGCGCCTTTCAACCCGTCGCCGACGACTTACAGGATCTTGATTAAAGGGTTGGTAGATAATAACAAGTTGGAGAGGGCAATGGAGCTGAAAGAGGAAATGACTATGAAGGGTTTTGTTCCCGACCCTCTTATTTATCTTTATCTGATGGTGGGTTGTGTGAGAAGTTCGGATCCTGATGGAGTTTTTAAGCTTTTTGAAGAGTTGAAAGAGAAGTTGGGAGGGACTGTGGAAGATGGAGTTGTTTATGGGAGCCTGATGAAAGGGTACTTTATGAAAGAAATGGAGGAGGAAGCAATGAGGTGTTATGAGGAGACTGTGGGTGTTAATTCAGTGGTGAAGATGAGCGCCATTGCATACAATTCCGTGCTTGATGCATTATGCAAGAATGGGAAGTTTGGTGAGGCCTTGATGTTGTTTGATAGGATGACAAAGGAACACAGTCCCCCCAGGCGTCTGGCAGTGGACTTGGGAAGCTTTAATGTGATGGTTGATGGATACTGTATTGAAGGGAGGTTCAAAGATGCCATTGAAGTATTCGAGAAGATGGGTGATTATAGGTGTAGCCCAGATACATTATCATTCAATAATTTGATCGAACAATTATGTAATAATGGAATGTTGGCTGATGCTGAGGAGCTTTATGGAACAATGGGCGATAAGGGAGTTAACCCTGACGAGTTTACTTACGGTTTGTTGATGGATTCTTGCTTTAAAGCGAACAGGCCAGATGATGCAGCTGGATACTTTAGAAAAATGGTTGAGTCCGGACTTAGACCGAATATAGCAGTTTACAATAGATTAGTGGATGAGCTGGTTAAATTGGGGAAGATTAACGAGGCAAAGGCTTTCTTTGACTTGATGGTAAAGAAGTTAAAAATGGATGCCTCGAGCTATCAGTTTATAATGAAGGCGTTAAGTGAATCGGGGCAGCTGGATGAAATGCTAAATGTGGTTGATACTCTTCTGGATGATGATGGGATTGAATTTTCTGAAGAGTTGCAGGAGTTTGTCAGAGGTGAGCTGAGGAAGGAAGACAGGGAAGGAGATTTAGGTAAACTAATGGAAGAGAAAGAAAGAGTGAAAGCTGAAGCGAAGGCAAAGGAGGCTGAGGCAGCAGAGGCACAGAAAAGAAGTGCAAAAGCTGCGGTCTCTTCTTTACTGTCATCCAAGTTGTTTGGGAACAAGGAAGGTGAGAAAGAATCTGCAGTGAACGAAATGCAATCTGGTCAAGAAGACAGTGGTAAAACTGAACTCGCGGAATCGAATCCTTGA

mRNA sequence

ATGGCATTATCGAAGCCGACTTTCTTCACACACCTCAGAACCCTAACCAGGTCCCAACATTTACTGCACCAGGCTCCGGCACCTCCCATCGTCACACTCCGATTTCTCTCCTTTGCATCACCGGAGGAAGCTGCTGCCGAACGACGCCGTCGAAAGCGCCGCCTCCGCATTGAGCCCCCGCTCTCTTCCTCCTCTGCCGCTCGCCCACAATCGCAGCCTTCTAAGCCTCAATCCCCACAAAACCCTAATGCCCCCAAACTCCCTGAGCATATCTCTGCTCTCTCTGGTAATCGTCTTAACCTCCACAACCGCATTCTTACTCTCATTCGTGAAAATGATCTTGAAGAGGCTGCGCTTTTCACCCGCCATTCCATTTACTCTAATTGTCGTCCCACGATCTTCACCGTCAATGCCGTTCTCAATGCACTGCTTCGTCAATCGAAATATTCCGATTTGCTTTCACTTCACCGGTTTATTACACAAGCTGGTGTTGCCCCCAATATAATCACTCACAATTTGATTTTTCAGACGTATTTGGATTGTCGTAAGCCGGATACGGCAATGGAACATTACAAGCAGTTGATCAATGATGCGCCTTTCAACCCGTCGCCGACGACTTACAGGATCTTGATTAAAGGGTTGGTAGATAATAACAAGTTGGAGAGGGCAATGGAGCTGAAAGAGGAAATGACTATGAAGGGTTTTGTTCCCGACCCTCTTATTTATCTTTATCTGATGGTGGGTTGTGTGAGAAGTTCGGATCCTGATGGAGTTTTTAAGCTTTTTGAAGAGTTGAAAGAGAAGTTGGGAGGGACTGTGGAAGATGGAGTTGTTTATGGGAGCCTGATGAAAGGGTACTTTATGAAAGAAATGGAGGAGGAAGCAATGAGGTGTTATGAGGAGACTGTGGGTGTTAATTCAGTGGTGAAGATGAGCGCCATTGCATACAATTCCGTGCTTGATGCATTATGCAAGAATGGGAAGTTTGGTGAGGCCTTGATGTTGTTTGATAGGATGACAAAGGAACACAGTCCCCCCAGGCGTCTGGCAGTGGACTTGGGAAGCTTTAATGTGATGGTTGATGGATACTGTATTGAAGGGAGGTTCAAAGATGCCATTGAAGTATTCGAGAAGATGGGTGATTATAGGTGTAGCCCAGATACATTATCATTCAATAATTTGATCGAACAATTATGTAATAATGGAATGTTGGCTGATGCTGAGGAGCTTTATGGAACAATGGGCGATAAGGGAGTTAACCCTGACGAGTTTACTTACGGTTTGTTGATGGATTCTTGCTTTAAAGCGAACAGGCCAGATGATGCAGCTGGATACTTTAGAAAAATGGTTGAGTCCGGACTTAGACCGAATATAGCAGTTTACAATAGATTAGTGGATGAGCTGGTTAAATTGGGGAAGATTAACGAGGCAAAGGCTTTCTTTGACTTGATGGTAAAGAAGTTAAAAATGGATGCCTCGAGCTATCAGTTTATAATGAAGGCGTTAAGTGAATCGGGGCAGCTGGATGAAATGCTAAATGTGGTTGATACTCTTCTGGATGATGATGGGATTGAATTTTCTGAAGAGTTGCAGGAGTTTGTCAGAGGTGAGCTGAGGAAGGAAGACAGGGAAGGAGATTTAGGTAAACTAATGGAAGAGAAAGAAAGAGTGAAAGCTGAAGCGAAGGCAAAGGAGGCTGAGGCAGCAGAGGCACAGAAAAGAAGTGCAAAAGCTGCGGTCTCTTCTTTACTGTCATCCAAGTTGTTTGGGAACAAGGAAGGTGAGAAAGAATCTGCAGTGAACGAAATGCAATCTGGTCAAGAAGACAGTGGTAAAACTGAACTCGCGGAATCGAATCCTTGA

Coding sequence (CDS)

ATGGCATTATCGAAGCCGACTTTCTTCACACACCTCAGAACCCTAACCAGGTCCCAACATTTACTGCACCAGGCTCCGGCACCTCCCATCGTCACACTCCGATTTCTCTCCTTTGCATCACCGGAGGAAGCTGCTGCCGAACGACGCCGTCGAAAGCGCCGCCTCCGCATTGAGCCCCCGCTCTCTTCCTCCTCTGCCGCTCGCCCACAATCGCAGCCTTCTAAGCCTCAATCCCCACAAAACCCTAATGCCCCCAAACTCCCTGAGCATATCTCTGCTCTCTCTGGTAATCGTCTTAACCTCCACAACCGCATTCTTACTCTCATTCGTGAAAATGATCTTGAAGAGGCTGCGCTTTTCACCCGCCATTCCATTTACTCTAATTGTCGTCCCACGATCTTCACCGTCAATGCCGTTCTCAATGCACTGCTTCGTCAATCGAAATATTCCGATTTGCTTTCACTTCACCGGTTTATTACACAAGCTGGTGTTGCCCCCAATATAATCACTCACAATTTGATTTTTCAGACGTATTTGGATTGTCGTAAGCCGGATACGGCAATGGAACATTACAAGCAGTTGATCAATGATGCGCCTTTCAACCCGTCGCCGACGACTTACAGGATCTTGATTAAAGGGTTGGTAGATAATAACAAGTTGGAGAGGGCAATGGAGCTGAAAGAGGAAATGACTATGAAGGGTTTTGTTCCCGACCCTCTTATTTATCTTTATCTGATGGTGGGTTGTGTGAGAAGTTCGGATCCTGATGGAGTTTTTAAGCTTTTTGAAGAGTTGAAAGAGAAGTTGGGAGGGACTGTGGAAGATGGAGTTGTTTATGGGAGCCTGATGAAAGGGTACTTTATGAAAGAAATGGAGGAGGAAGCAATGAGGTGTTATGAGGAGACTGTGGGTGTTAATTCAGTGGTGAAGATGAGCGCCATTGCATACAATTCCGTGCTTGATGCATTATGCAAGAATGGGAAGTTTGGTGAGGCCTTGATGTTGTTTGATAGGATGACAAAGGAACACAGTCCCCCCAGGCGTCTGGCAGTGGACTTGGGAAGCTTTAATGTGATGGTTGATGGATACTGTATTGAAGGGAGGTTCAAAGATGCCATTGAAGTATTCGAGAAGATGGGTGATTATAGGTGTAGCCCAGATACATTATCATTCAATAATTTGATCGAACAATTATGTAATAATGGAATGTTGGCTGATGCTGAGGAGCTTTATGGAACAATGGGCGATAAGGGAGTTAACCCTGACGAGTTTACTTACGGTTTGTTGATGGATTCTTGCTTTAAAGCGAACAGGCCAGATGATGCAGCTGGATACTTTAGAAAAATGGTTGAGTCCGGACTTAGACCGAATATAGCAGTTTACAATAGATTAGTGGATGAGCTGGTTAAATTGGGGAAGATTAACGAGGCAAAGGCTTTCTTTGACTTGATGGTAAAGAAGTTAAAAATGGATGCCTCGAGCTATCAGTTTATAATGAAGGCGTTAAGTGAATCGGGGCAGCTGGATGAAATGCTAAATGTGGTTGATACTCTTCTGGATGATGATGGGATTGAATTTTCTGAAGAGTTGCAGGAGTTTGTCAGAGGTGAGCTGAGGAAGGAAGACAGGGAAGGAGATTTAGGTAAACTAATGGAAGAGAAAGAAAGAGTGAAAGCTGAAGCGAAGGCAAAGGAGGCTGAGGCAGCAGAGGCACAGAAAAGAAGTGCAAAAGCTGCGGTCTCTTCTTTACTGTCATCCAAGTTGTTTGGGAACAAGGAAGGTGAGAAAGAATCTGCAGTGAACGAAATGCAATCTGGTCAAGAAGACAGTGGTAAAACTGAACTCGCGGAATCGAATCCTTGA

Protein sequence

MALSKPTFFTHLRTLTRSQHLLHQAPAPPIVTLRFLSFASPEEAAAERRRRKRRLRIEPPLSSSSAARPQSQPSKPQSPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPDPLIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCYEETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKGVNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAKAFFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGELRKEDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKESAVNEMQSGQEDSGKTELAESNP
BLAST of CmaCh16G000480 vs. Swiss-Prot
Match: PP273_ARATH (Pentatricopeptide repeat-containing protein At3g49240 OS=Arabidopsis thaliana GN=EMB1796 PE=2 SV=1)

HSP 1 Score: 763.8 bits (1971), Expect = 1.4e-219
Identity = 389/617 (63.05%), Postives = 492/617 (79.74%), Query Frame = 1

Query: 1   MALSKPTFFTHLRTLTRSQHLLHQAPAPPIVTLRFLSFASPEEAAAERRRRKRRLRIEPP 60
           M++SK  F  HL+TL+RS    H+    P + +R++SFA+ EEAAAERRRRKRRLR+EPP
Sbjct: 1   MSISKAAFLNHLQTLSRSYR--HRVLPQPFLAVRYMSFATQEEAAAERRRRKRRLRMEPP 60

Query: 61  LSSSSAARPQSQPSKPQSPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAALF 120
           ++S + ++ Q Q   P+  QNPN PKLPE +SAL G RL+LHN IL LIRENDLEEAAL+
Sbjct: 61  VNSFNRSQ-QQQSQIPRPIQNPNIPKLPESVSALVGKRLDLHNHILKLIRENDLEEAALY 120

Query: 121 TRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYLD 180
           TRHS+YSNCRPTIFTVN VL A LRQ+KY  LL LH FI QAG+APNIIT+NLIFQ YLD
Sbjct: 121 TRHSVYSNCRPTIFTVNTVLAAQLRQAKYGALLQLHGFINQAGIAPNIITYNLIFQAYLD 180

Query: 181 CRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPDPL 240
            RKP+ A+EHYK  I++AP NPS  T+RIL+KGLV N+ LE+AME+KE+M +KGFV DP+
Sbjct: 181 VRKPEIALEHYKLFIDNAPLNPSIATFRILVKGLVSNDNLEKAMEIKEDMAVKGFVVDPV 240

Query: 241 IYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCYE 300
           +Y YLM+GCV++SD DGV KL++ELKEKLGG V+DGVVYG LMKGYFMKEME+EAM CYE
Sbjct: 241 VYSYLMMGCVKNSDADGVLKLYQELKEKLGGFVDDGVVYGQLMKGYFMKEMEKEAMECYE 300

Query: 301 ETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNVMV 360
           E VG NS V+MSA+AYN VL+AL +NGKF EAL LFD + KEH+PPR LAV+LG+FNVMV
Sbjct: 301 EAVGENSKVRMSAMAYNYVLEALSENGKFDEALKLFDAVKKEHNPPRHLAVNLGTFNVMV 360

Query: 361 DGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKGVN 420
           +GYC  G+F++A+EVF +MGD++CSPDTLSFNNL+ QLC+N +LA+AE+LYG M +K V 
Sbjct: 361 NGYCAGGKFEEAMEVFRQMGDFKCSPDTLSFNNLMNQLCDNELLAEAEKLYGEMEEKNVK 420

Query: 421 PDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAKAF 480
           PDE+TYGLLMD+CFK  + D+ A Y++ MVES LRPN+AVYNRL D+L+K GK+++AK+F
Sbjct: 421 PDEYTYGLLMDTCFKEGKIDEGAAYYKTMVESNLRPNLAVYNRLQDQLIKAGKLDDAKSF 480

Query: 481 FDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGELRK 540
           FD+MV KLKMD  +Y+FIM+ALSE+G+LDEML +VD +LDDD +  SEELQEFV+ ELRK
Sbjct: 481 FDMMVSKLKMDDEAYKFIMRALSEAGRLDEMLKIVDEMLDDDTVRVSEELQEFVKEELRK 540

Query: 541 EDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKESAV 600
             REGDL KLMEEKER+KAEAKAKE   AE +K++    +++L+  K    K+   +   
Sbjct: 541 GGREGDLEKLMEEKERLKAEAKAKELADAEEKKKAQSINIAALIPPKAVEEKKETAKLLW 600

Query: 601 NEMQSGQEDSGKTELAE 618
                G E++   E+A+
Sbjct: 601 ENEAGGVEEADVVEMAK 614

BLAST of CmaCh16G000480 vs. Swiss-Prot
Match: PPR29_ARATH (Pentatricopeptide repeat-containing protein At1g10270 OS=Arabidopsis thaliana GN=GRP23 PE=1 SV=1)

HSP 1 Score: 305.4 bits (781), Expect = 1.4e-81
Identity = 193/541 (35.67%), Postives = 298/541 (55.08%), Query Frame = 1

Query: 23  HQAPAP-PIVTLRFLSFASPEEAAAERRRRKRRLRIEPPLSSSSAARPQSQPSKPQSPQN 82
           H  P P P +  R ++F+S EEAAAERRRRKRRLRIEPPL +      +  PS P   ++
Sbjct: 74  HTPPIPYPPIPHRTMAFSSAEEAAAERRRRKRRLRIEPPLHAL-----RRDPSAPPPKRD 133

Query: 83  PNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLN 142
           PNAP+LP+  SAL G RLNLHNR+ +LIR +DL+ A+   R S++SN RPT+FT NA++ 
Sbjct: 134 PNAPRLPDSTSALVGQRLNLHNRVQSLIRASDLDAASKLARQSVFSNTRPTVFTCNAIIA 193

Query: 143 ALLRQSKYSDLLSLHR-FITQAGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPF 202
           A+ R  +YS+ +SL + F  Q+ + PN++++N I   + D    D A+E Y+ ++ +APF
Sbjct: 194 AMYRAKRYSESISLFQYFFKQSNIVPNVVSYNQIINAHCDEGNVDEALEVYRHILANAPF 253

Query: 203 NPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPDPLIYLYLMVGCVRSSDPDGVFK 262
            PS  TYR L KGLV   ++  A  L  EM  KG   D  +Y  L+ G +   D D   +
Sbjct: 254 APSSVTYRHLTKGLVQAGRIGDAASLLREMLSKGQAADSTVYNNLIRGYLDLGDFDKAVE 313

Query: 263 LFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCYEETVGVNSVVKMSAIAYNSVL 322
            F+ELK K   TV DG+V  + M+ +F K  ++EAM  Y     ++   +M     N +L
Sbjct: 314 FFDELKSKC--TVYDGIVNATFMEYWFEKGNDKEAMESYRSL--LDKKFRMHPPTGNVLL 373

Query: 323 DALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNVMVDGYCIEGRFKDAIEVFEKMG 382
           +   K GK  EA  LF+ M   H+PP  L+V+  +  +MV+     G F +AI  F+K+G
Sbjct: 374 EVFLKFGKKDEAWALFNEMLDNHAPPNILSVNSDTVGIMVNECFKMGEFSEAINTFKKVG 433

Query: 383 DYRCSP----DTLSFNNLIEQLCNNGMLADAEELYGTMGDKGVNPDEFTYGLLMDSCFKA 442
               S     D L + N++ + C  GML +AE  +     + +  D  ++  ++D+  KA
Sbjct: 434 SKVTSKPFVMDYLGYCNIVTRFCEQGMLTEAERFFAEGVSRSLPADAPSHRAMIDAYLKA 493

Query: 443 NRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINE-AKAFFDLMVKKLKMDASSY 502
            R DDA     +MV+  LR       R+  EL+K GK+ E A+    +  ++ K D S Y
Sbjct: 494 ERIDDAVKMLDRMVDVNLRVVADFGARVFGELIKNGKLTESAEVLTKMGEREPKPDPSIY 553

Query: 503 QFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGELRKEDREGDLGKLMEEKE 557
             +++ L +   LD+  ++V  ++  + +  +  L+EF+     K  R  ++ K++    
Sbjct: 554 DVVVRGLCDGDALDQAKDIVGEMIRHN-VGVTTVLREFIIEVFEKAGRREEIEKILNSVA 604

BLAST of CmaCh16G000480 vs. Swiss-Prot
Match: PPR28_ARATH (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 177.6 bits (449), Expect = 4.3e-43
Identity = 106/413 (25.67%), Postives = 199/413 (48.18%), Query Frame = 1

Query: 102 HNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQ 161
           +N +  ++R  +LEE   F  + +Y    P I     ++    R  K      +   +  
Sbjct: 106 NNHLRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEG 165

Query: 162 AGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLE 221
           +G  P++IT+N++   Y    + + A+     +++    +P   TY  +++ L D+ KL+
Sbjct: 166 SGAVPDVITYNVMISGYCKAGEINNALS----VLDRMSVSPDVVTYNTILRSLCDSGKLK 225

Query: 222 RAMELKEEMTMKGFVPDPLIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGS 281
           +AME+ + M  +   PD + Y  L+    R S      KL +E++++  G   D V Y  
Sbjct: 226 QAMEVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDR--GCTPDVVTYNV 285

Query: 282 LMKGYFMKEMEEEAMRCYEETVGVNSVVKMSAIAYNSVLDALCKNGKFGEA-LMLFDRMT 341
           L+ G   +   +EA++   +    +S  + + I +N +L ++C  G++ +A  +L D + 
Sbjct: 286 LVNGICKEGRLDEAIKFLNDMP--SSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLR 345

Query: 342 KEHSPPRRLAVDLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCN 401
           K  SP       + +FN++++  C +G    AI++ EKM  + C P++LS+N L+   C 
Sbjct: 346 KGFSP------SVVTFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCK 405

Query: 402 NGMLADAEELYGTMGDKGVNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAV 461
              +  A E    M  +G  PD  TY  ++ +  K  + +DA     ++   G  P +  
Sbjct: 406 EKKMDRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLIT 465

Query: 462 YNRLVDELVKLGKINEA-KAFFDLMVKKLKMDASSYQFIMKALSESGQLDEML 513
           YN ++D L K GK  +A K   ++  K LK D  +Y  ++  LS  G++DE +
Sbjct: 466 YNTVIDGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAI 504

BLAST of CmaCh16G000480 vs. Swiss-Prot
Match: PPR37_ARATH (Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana GN=At1g12620 PE=2 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 1.2e-40
Identity = 111/418 (26.56%), Postives = 198/418 (47.37%), Query Frame = 1

Query: 108 LIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPN 167
           L  E  + EA       +    +PT+ T+NA++N L    K SD + L   + + G  PN
Sbjct: 152 LCLEGRVSEALELVDRMVEMGHKPTLITLNALVNGLCLNGKVSDAVLLIDRMVETGFQPN 211

Query: 168 IITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELK 227
            +T+  + +      +   AME  +++  +         Y I+I GL  +  L+ A  L 
Sbjct: 212 EVTYGPVLKVMCKSGQTALAMELLRKM-EERKIKLDAVKYSIIIDGLCKDGSLDNAFNLF 271

Query: 228 EEMTMKGFVPDPLIYLYLMVG-CVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGY 287
            EM +KGF  D +IY  L+ G C      DG   L + +K K+     D V + +L+  +
Sbjct: 272 NEMEIKGFKADIIIYTTLIRGFCYAGRWDDGAKLLRDMIKRKI---TPDVVAFSALIDCF 331

Query: 288 FMKEMEEEAMRCYEETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPP 347
             +    EA   ++E +     +    + Y S++D  CK  +  +A  + D M  +   P
Sbjct: 332 VKEGKLREAEELHKEMI--QRGISPDTVTYTSLIDGFCKENQLDKANHMLDLMVSKGCGP 391

Query: 348 RRLAVDLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAD 407
                ++ +FN++++GYC      D +E+F KM       DT+++N LI+  C  G L  
Sbjct: 392 -----NIRTFNILINGYCKANLIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLEV 451

Query: 408 AEELYGTMGDKGVNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVD 467
           A+EL+  M  + V PD  +Y +L+D       P+ A   F K+ +S +  +I +YN ++ 
Sbjct: 452 AKELFQEMVSRRVRPDIVSYKILLDGLCDNGEPEKALEIFEKIEKSKMELDIGIYNIIIH 511

Query: 468 ELVKLGKINEA-KAFFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDG 524
            +    K+++A   F  L +K +K D  +Y  ++  L + G L E  +++   +++DG
Sbjct: 512 GMCNASKVDDAWDLFCSLPLKGVKPDVKTYNIMIGGLCKKGSLSE-ADLLFRKMEEDG 557

BLAST of CmaCh16G000480 vs. Swiss-Prot
Match: PPR97_ARATH (Pentatricopeptide repeat-containing protein At1g63070, mitochondrial OS=Arabidopsis thaliana GN=At1g63070 PE=2 SV=1)

HSP 1 Score: 162.2 bits (409), Expect = 1.9e-38
Identity = 105/417 (25.18%), Postives = 212/417 (50.84%), Query Frame = 1

Query: 133 IFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYLDCRKPDTAMEHYK 192
           ++T +  +N   R+S+ S  L++   + + G  P+I+T N +   +    +   A+    
Sbjct: 110 LYTYSIFINYFCRRSQLSLALAILGKMMKLGYGPSIVTLNSLLNGFCHGNRISEAVALVD 169

Query: 193 QLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPDPLIYLYLMVGCVRS 252
           Q++ +  + P   T+  L+ GL  +NK   A+ L E M +KG  PD + Y  ++ G  + 
Sbjct: 170 QMV-EMGYQPDTVTFTTLVHGLFQHNKASEAVALVERMVVKGCQPDLVTYGAVINGLCKR 229

Query: 253 SDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCYE--ETVGVNSVVK 312
            +PD    L  ++++  G    D V+Y +++ G    +  ++A   +   ET G+    K
Sbjct: 230 GEPDLALNLLNKMEK--GKIEADVVIYNTIIDGLCKYKHMDDAFDLFNKMETKGI----K 289

Query: 313 MSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNVMVDGYCIEGRFK 372
                YN ++  LC  G++ +A  L   M +++  P     DL  FN ++D +  EG+  
Sbjct: 290 PDVFTYNPLISCLCNYGRWSDASRLLSDMLEKNINP-----DLVFFNALIDAFVKEGKLV 349

Query: 373 DAIEVFEKMGDYR-CSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKGVNPDEFTYGLL 432
           +A +++++M   + C PD +++N LI+  C    + +  E++  M  +G+  +  TY  L
Sbjct: 350 EAEKLYDEMVKSKHCFPDVVAYNTLIKGFCKYKRVEEGMEVFREMSQRGLVGNTVTYTTL 409

Query: 433 MDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAKAFFDLMVKK-L 492
           +   F+A   D+A   F++MV  G+ P+I  YN L+D L   G +  A   F+ M K+ +
Sbjct: 410 IHGFFQARDCDNAQMVFKQMVSDGVHPDIMTYNILLDGLCNNGNVETALVVFEYMQKRDM 469

Query: 493 KMDASSYQFIMKALSESGQLDEMLNVVDTL----LDDDGIEFSEELQEFVRGELRKE 542
           K+D  +Y  +++AL ++G++++  ++  +L    +  + + ++  +  F R  L++E
Sbjct: 470 KLDIVTYTTMIEALCKAGKVEDGWDLFCSLSLKGVKPNVVTYTTMMSGFCRKGLKEE 514

BLAST of CmaCh16G000480 vs. TrEMBL
Match: E5GB98_CUCME (Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 1067.4 bits (2759), Expect = 6.7e-309
Identity = 557/624 (89.26%), Postives = 585/624 (93.75%), Query Frame = 1

Query: 1   MALSKPTFFTHLRTLTRSQHLLH-QAPAP-PIVTLRFLSFASPEEAAAERRRRKRRLRIE 60
           MALSKP FFTHL+TLT S HLL  QAPAP PIVT RFLSFAS EEA AERRRRKRRLRIE
Sbjct: 1   MALSKPAFFTHLKTLTGSHHLLQRQAPAPLPIVTFRFLSFASAEEADAERRRRKRRLRIE 60

Query: 61  PPLSSSSAARPQSQPSKPQSPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           PPLSSSSAARPQSQPS+ Q+PQNPN PK+PEHISALSGNRLNLHNRILTLIRENDLEEAA
Sbjct: 61  PPLSSSSAARPQSQPSRSQTPQNPNTPKVPEHISALSGNRLNLHNRILTLIRENDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTY 180
           LFTRHSIYSNCRPTIFTVNAVLNA LRQSKY+DLLSLHRFITQAGV PNIITHNLIFQTY
Sbjct: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVVPNIITHNLIFQTY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPD 240
           LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDN KLERAMELKEEM +KGF PD
Sbjct: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNKKLERAMELKEEMIVKGFAPD 240

Query: 241 PLIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           PLIY YLM GCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYG+LMKGYFMKEMEEEAM+C
Sbjct: 241 PLIYHYLMAGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGNLMKGYFMKEMEEEAMKC 300

Query: 301 YEETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNV 360
           YEETVG N VVKMSAIAYNSVLDALCK+GKF EAL LFDRMTKEH PPR LAV+LG+FNV
Sbjct: 301 YEETVGDNPVVKMSAIAYNSVLDALCKHGKFSEALTLFDRMTKEHRPPRHLAVNLGTFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKG 420
           MVDGYCI+GRFK+AI VFE+MGDYRCSPDTLSFNNLIEQLCNNGMLA+AE LYGTMG+KG
Sbjct: 361 MVDGYCIKGRFKEAIGVFEEMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEMLYGTMGEKG 420

Query: 421 VNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAK 480
           VNPDEFTYGLLM SCF+ NR DDAA YFRKMV+SGLRPNIAVYN LV ELVKLGK++EAK
Sbjct: 421 VNPDEFTYGLLMHSCFQKNRADDAAAYFRKMVDSGLRPNIAVYNILVGELVKLGKVDEAK 480

Query: 481 AFFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGEL 540
           +FFDLMVKKLKMDAS+YQFIMKALSESG++DE+LNVVDTLLDDDGIEFSEELQEFVRGEL
Sbjct: 481 SFFDLMVKKLKMDASNYQFIMKALSESGKMDEVLNVVDTLLDDDGIEFSEELQEFVRGEL 540

Query: 541 RKEDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           RKEDRE DL KL+EEKER+KAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLF NKEGEKES
Sbjct: 541 RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFANKEGEKES 600

Query: 601 AVNEMQSGQ--EDSGKTELAESNP 621
            VNEMQSGQ  +D GKTELAESNP
Sbjct: 601 VVNEMQSGQQEDDGGKTELAESNP 624

BLAST of CmaCh16G000480 vs. TrEMBL
Match: A0A0A0L7B2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G239860 PE=4 SV=1)

HSP 1 Score: 1065.1 bits (2753), Expect = 3.3e-308
Identity = 559/624 (89.58%), Postives = 585/624 (93.75%), Query Frame = 1

Query: 1   MALSKPTFFTHLRTLTRSQHLLH-QAPAP-PIVTLRFLSFASPEEAAAERRRRKRRLRIE 60
           MALSKP FFTHL+TLT S HLL  QA AP PIVTLRFLSFAS EEA AERRRRKRRLRIE
Sbjct: 1   MALSKPAFFTHLKTLTGSHHLLQRQALAPFPIVTLRFLSFASAEEADAERRRRKRRLRIE 60

Query: 61  PPLSSSSAARPQSQPSKPQSPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           PPLSSSSAARP +QP + Q+PQNPNAPK+PEHISALSGNRLNLHNRILTLIRENDLEEAA
Sbjct: 61  PPLSSSSAARPLTQPPRSQTPQNPNAPKIPEHISALSGNRLNLHNRILTLIRENDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTY 180
           LFTRHSIYSNCRPTIFTVNAVLNA LRQSKY+DLLSLHRFITQAGV PNIITHNLIFQTY
Sbjct: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVVPNIITHNLIFQTY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPD 240
           LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELK+EM  KGF PD
Sbjct: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKDEMIEKGFAPD 240

Query: 241 PLIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           PLIY YLM GCVRS DPDGVFKLFEELKEKLG TVEDGVVYG+LMKGYFMKEMEEEAM+C
Sbjct: 241 PLIYHYLMGGCVRSLDPDGVFKLFEELKEKLGATVEDGVVYGNLMKGYFMKEMEEEAMKC 300

Query: 301 YEETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNV 360
           YEETVG NSVVKMSAIAYNSVLDALC+NGKFGEAL LFDRMTKEH PPR LAV+LGSFNV
Sbjct: 301 YEETVGDNSVVKMSAIAYNSVLDALCRNGKFGEALTLFDRMTKEHRPPRHLAVNLGSFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKG 420
           MVDGYCIEGRFK+AIEVFEKMGDYRC PDTLSFNNLIEQLCNNGMLA+AE LYGTM DKG
Sbjct: 361 MVDGYCIEGRFKEAIEVFEKMGDYRCCPDTLSFNNLIEQLCNNGMLAEAEMLYGTMDDKG 420

Query: 421 VNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAK 480
           VNPDEFTYGLLMDSCFK NR DDAA YFRKMV+SGLRPNIAVYN LVDELVKLGKI++AK
Sbjct: 421 VNPDEFTYGLLMDSCFKKNRADDAAAYFRKMVDSGLRPNIAVYNILVDELVKLGKIDDAK 480

Query: 481 AFFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGEL 540
           +FFDLMVKKLKMDASSYQFIMKALSESG++DE+LNVVDTLLDDDGIEFSEELQEFVRGEL
Sbjct: 481 SFFDLMVKKLKMDASSYQFIMKALSESGKMDEILNVVDTLLDDDGIEFSEELQEFVRGEL 540

Query: 541 RKEDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           RKE+RE DL KL+EEKER+KAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLF NKEGEKES
Sbjct: 541 RKENREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFANKEGEKES 600

Query: 601 AVNEMQS--GQEDSGKTELAESNP 621
            VNEMQS   ++DSGKTELAES+P
Sbjct: 601 VVNEMQSVEQEDDSGKTELAESSP 624

BLAST of CmaCh16G000480 vs. TrEMBL
Match: M5X3R7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002582mg PE=4 SV=1)

HSP 1 Score: 934.9 bits (2415), Expect = 5.1e-269
Identity = 481/619 (77.71%), Postives = 542/619 (87.56%), Query Frame = 1

Query: 1   MALSKPTFFTHLRTLTRSQHLLHQAPAPPIVTLRFLSFASPEEAAAERRRRKRRLRIEPP 60
           MALSKPTF THLRTL +  +  H    P  ++LRFLSFA+PEEAAAERRRRKRRLRIEPP
Sbjct: 1   MALSKPTFLTHLRTLAKPPNCHHPTTPPSFISLRFLSFATPEEAAAERRRRKRRLRIEPP 60

Query: 61  LSSSSAARPQSQPSK-PQSPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAAL 120
           LSS    + Q Q  + P+  QNPNAPKLPE +SALSGNRLNLHNRILTL+R+NDLEEAAL
Sbjct: 61  LSSLHRNQQQQQQQQSPKPQQNPNAPKLPEPVSALSGNRLNLHNRILTLVRQNDLEEAAL 120

Query: 121 FTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYL 180
           +TRHSIYSNCRPTIFTVN+VL A LRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYL
Sbjct: 121 YTRHSIYSNCRPTIFTVNSVLTAQLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYL 180

Query: 181 DCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPDP 240
           DCRKPDTAME+YKQLINDAPFNPSPTTYRILIKGLVDNNKL+RAMELKEE+  KGF PDP
Sbjct: 181 DCRKPDTAMENYKQLINDAPFNPSPTTYRILIKGLVDNNKLDRAMELKEEIDAKGFAPDP 240

Query: 241 LIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCY 300
           ++Y YLMVGCV++SD DGVF+L+EELKEKLGG VEDG+VYG+LMKGYFM+ ME+EAM CY
Sbjct: 241 VVYHYLMVGCVKNSDSDGVFRLYEELKEKLGGVVEDGIVYGNLMKGYFMRGMEKEAMECY 300

Query: 301 EETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNVM 360
           EE+ G +S VK SA+AYNSVLDAL KNGKF EAL LFDRM  EH+PPRRLAV+LGSFNVM
Sbjct: 301 EESFGESSKVKTSAVAYNSVLDALSKNGKFDEALRLFDRMVAEHNPPRRLAVNLGSFNVM 360

Query: 361 VDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKGV 420
            DGYC++GRFK+AIEVF KMGDYRCSPDTLSFNNLIEQLC NGML++AEELYG M DKGV
Sbjct: 361 ADGYCVQGRFKEAIEVFRKMGDYRCSPDTLSFNNLIEQLCKNGMLSEAEELYGEMSDKGV 420

Query: 421 NPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAKA 480
            PDEFTY LLMD+CF+ NR DDAA YFRKMV++ LRPN+AVYNRLVD L+K+GK++EAK+
Sbjct: 421 YPDEFTYVLLMDTCFEENRADDAAEYFRKMVDAKLRPNLAVYNRLVDGLIKVGKVDEAKS 480

Query: 481 FFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGELR 540
           FFDLMVKKLKMD  SYQFIMK LSE+G+LDE+LNVVDT+LDDDG+EF+EELQEFV+GELR
Sbjct: 481 FFDLMVKKLKMDIPSYQFIMKTLSEAGKLDEVLNVVDTMLDDDGVEFNEELQEFVKGELR 540

Query: 541 KEDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKESA 600
           KE RE ++GKLMEEKER KAEAKAKEAEAAEA KRSA+AAVSSLL SKLFGNKE E  S 
Sbjct: 541 KEGREDEVGKLMEEKERQKAEAKAKEAEAAEAAKRSARAAVSSLLPSKLFGNKESETGST 600

Query: 601 VNEMQSGQEDSGKTELAES 619
                +G  ++  T+ AE+
Sbjct: 601 QATENAG--EAASTQPAEA 617

BLAST of CmaCh16G000480 vs. TrEMBL
Match: W9RP26_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006813 PE=4 SV=1)

HSP 1 Score: 911.8 bits (2355), Expect = 4.7e-262
Identity = 478/644 (74.22%), Postives = 539/644 (83.70%), Query Frame = 1

Query: 1   MALSKPT-FFTHLRTLTRSQHLLHQAPAPP---IVTLRFLSFASPEEAAAERRRRKRRLR 60
           MALSKP  F THL+TL +  H    +P PP    V+LRFLSFA+PE+AAAERRRRKRRLR
Sbjct: 1   MALSKPNAFLTHLKTLAKPPHRRFLSPPPPPPSFVSLRFLSFATPEDAAAERRRRKRRLR 60

Query: 61  IEPPLSSSSAARPQSQPSKPQSPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEE 120
           IEPPLSS    + Q Q S P   +NPNAPKLP+H+SAL+GNRLNLHN+ILTLIRENDLEE
Sbjct: 61  IEPPLSSLHRNQQQQQQSPPPPQRNPNAPKLPDHVSALTGNRLNLHNKILTLIRENDLEE 120

Query: 121 AALFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQ 180
           AAL+TRHSIYSNCRPTIFTVN+VLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNL+FQ
Sbjct: 121 AALYTRHSIYSNCRPTIFTVNSVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLVFQ 180

Query: 181 TYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFV 240
           TYLDCRKPDTAMEHYKQLINDAPF+PSPTTYRIL+KGLVDNN+LERA+ELKEEM+ KG  
Sbjct: 181 TYLDCRKPDTAMEHYKQLINDAPFSPSPTTYRILVKGLVDNNRLERALELKEEMSEKGLA 240

Query: 241 PDPLIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAM 300
           PDP +Y YLM GCVR+SD D VF L+EELK KLGG VEDGVVYGSLMK YF+K ME+EAM
Sbjct: 241 PDPTVYHYLMAGCVRNSDVDKVFDLYEELKGKLGGFVEDGVVYGSLMKAYFLKGMEKEAM 300

Query: 301 RCYEETVGV---------------------NSVVKMSAIAYNSVLDALCKNGKFGEALML 360
             +EE VG                      NS VKMSA+AYNSVLDAL KNGKF EAL L
Sbjct: 301 EIFEEAVGAGYFLKGIKKESMETFEEALAENSSVKMSAVAYNSVLDALSKNGKFDEALKL 360

Query: 361 FDRMTKEHSPPRRLAVDLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLI 420
           FDRM KEH+PPRRLAV+LG+FNV+ +GYC +GRF+DAIEVF  MGDYRCSPDTLSFN LI
Sbjct: 361 FDRMKKEHNPPRRLAVNLGTFNVIAEGYCAQGRFRDAIEVFRTMGDYRCSPDTLSFNVLI 420

Query: 421 EQLCNNGMLADAEELYGTMGDKGVNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLR 480
           EQLCNNGML +AE LYG MG+KGVNPDEFT+GLLMD+CFK NRPDDAAGYFRKMV+S LR
Sbjct: 421 EQLCNNGMLGEAEALYGEMGEKGVNPDEFTFGLLMDTCFKENRPDDAAGYFRKMVDSKLR 480

Query: 481 PNIAVYNRLVDELVKLGKINEAKAFFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVV 540
           PN+AVYNRLVD LVK+GK++EAK+FFDLMVKKLKMD  SY+FIMKALSESG+LDE+LNVV
Sbjct: 481 PNLAVYNRLVDGLVKVGKVDEAKSFFDLMVKKLKMDVPSYKFIMKALSESGKLDEVLNVV 540

Query: 541 DTLLDDDGIEFSEELQEFVRGELRKEDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRS 600
           DT+LDDDG+EF+EE+QEFV+GELRKE RE +L KL+EEKER KAEAKAKEAEAAEA KRS
Sbjct: 541 DTMLDDDGVEFNEEVQEFVKGELRKEGREDELAKLIEEKERQKAEAKAKEAEAAEAAKRS 600

Query: 601 AKAAVSSLLSSKLFGNKEGEKESAVNEMQSGQEDSGKTELAESN 620
           A+AAVSSLL SKLFG+KE  +  +     +G    G+    ES+
Sbjct: 601 ARAAVSSLLPSKLFGSKESTESGSAE--ANGSPTVGEASSTESS 642

BLAST of CmaCh16G000480 vs. TrEMBL
Match: A0A061G122_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_015413 PE=4 SV=1)

HSP 1 Score: 867.8 bits (2241), Expect = 7.7e-249
Identity = 440/621 (70.85%), Postives = 520/621 (83.74%), Query Frame = 1

Query: 1   MALSKPTFFTHLRTLTRSQHLLHQAPAPPIVTLRFLSFASPEEAAAERRRRKRRLRIEPP 60
           MALSKPTF THL+ L +  H   ++P P  +T R LSF +PEEAAAERRRRKRRLR+EPP
Sbjct: 1   MALSKPTFLTHLQNLAKRHH---RSP-PSFITFRHLSFNTPEEAAAERRRRKRRLRVEPP 60

Query: 61  LSSSSAARPQSQPSKPQSP-QNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAAL 120
           LSS+  ++ Q+Q   P  P QNPNAPK+PE ++ L+GNRLNLHN+IL LIRENDLEEAAL
Sbjct: 61  LSSAHRSKQQAQQVAPSKPIQNPNAPKIPEPVTVLTGNRLNLHNKILKLIRENDLEEAAL 120

Query: 121 FTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYL 180
           +TRHS+YSNCRPT++TVNAVLNA LRQSKY+DLLSLHRFIT AG+APN+ITHNLIFQTYL
Sbjct: 121 YTRHSVYSNCRPTVYTVNAVLNAQLRQSKYADLLSLHRFITLAGIAPNVITHNLIFQTYL 180

Query: 181 DCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPDP 240
           DC+KPDTA+EHYKQ  N++P NPSPTTYRIL+KGLVDN KLE+A+E+KEEM  KG  PDP
Sbjct: 181 DCKKPDTALEHYKQFSNESPVNPSPTTYRILVKGLVDNGKLEKALEMKEEMVEKGLAPDP 240

Query: 241 LIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCY 300
           ++Y YL++GC +S D DG+FKLFEELKEK  G +EDGV+YG LMKGYFM+ ME+EAM CY
Sbjct: 241 VVYSYLILGCAKSGDSDGIFKLFEELKEKKDGVLEDGVIYGGLMKGYFMRGMEKEAMECY 300

Query: 301 EETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNVM 360
           EE  G NS VKMSA+AYN VLDAL KNGKF EAL LFDRM  EHSPPRRLAV+LGSFNV+
Sbjct: 301 EEACGENSKVKMSAVAYNYVLDALSKNGKFDEALRLFDRMKNEHSPPRRLAVNLGSFNVI 360

Query: 361 VDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKGV 420
            DGYC EG+FK+A+E F  MGDYRCSPDTLSFNNLI+QLC NG+L +AE+LYG MGDKGV
Sbjct: 361 ADGYCAEGKFKEAMEAFRLMGDYRCSPDTLSFNNLIDQLCQNGLLGEAEDLYGEMGDKGV 420

Query: 421 NPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAKA 480
           NPDE+TY LLMD+CFK +R DD A YFRKMVESGLRPN+AVYNRLVDELVK+GK++EAK+
Sbjct: 421 NPDEYTYVLLMDACFKVDRIDDGASYFRKMVESGLRPNLAVYNRLVDELVKVGKVDEAKS 480

Query: 481 FFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGELR 540
           F+D MVKKLKMD +SY+F++KALS+ G+LD +L +VD +LDD+ ++F+EELQEFV+ ELR
Sbjct: 481 FYDTMVKKLKMDDASYKFMIKALSDVGKLDVVLKMVDEMLDDESVDFNEELQEFVKEELR 540

Query: 541 KEDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES- 600
            E RE DL KLMEEKER+KAEAKA+E EAAEA KRSAKAAVSSLL SKLFG KE E +S 
Sbjct: 541 NEGREEDLTKLMEEKERLKAEAKAREIEAAEAAKRSAKAAVSSLLPSKLFGKKEDESQST 600

Query: 601 AVNEMQSGQEDSGKTELAESN 620
           A NE        G+ +  + N
Sbjct: 601 AANESTIEAASEGEVQAQDVN 617

BLAST of CmaCh16G000480 vs. TAIR10
Match: AT3G49240.1 (AT3G49240.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 763.8 bits (1971), Expect = 7.9e-221
Identity = 389/617 (63.05%), Postives = 492/617 (79.74%), Query Frame = 1

Query: 1   MALSKPTFFTHLRTLTRSQHLLHQAPAPPIVTLRFLSFASPEEAAAERRRRKRRLRIEPP 60
           M++SK  F  HL+TL+RS    H+    P + +R++SFA+ EEAAAERRRRKRRLR+EPP
Sbjct: 1   MSISKAAFLNHLQTLSRSYR--HRVLPQPFLAVRYMSFATQEEAAAERRRRKRRLRMEPP 60

Query: 61  LSSSSAARPQSQPSKPQSPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAALF 120
           ++S + ++ Q Q   P+  QNPN PKLPE +SAL G RL+LHN IL LIRENDLEEAAL+
Sbjct: 61  VNSFNRSQ-QQQSQIPRPIQNPNIPKLPESVSALVGKRLDLHNHILKLIRENDLEEAALY 120

Query: 121 TRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYLD 180
           TRHS+YSNCRPTIFTVN VL A LRQ+KY  LL LH FI QAG+APNIIT+NLIFQ YLD
Sbjct: 121 TRHSVYSNCRPTIFTVNTVLAAQLRQAKYGALLQLHGFINQAGIAPNIITYNLIFQAYLD 180

Query: 181 CRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPDPL 240
            RKP+ A+EHYK  I++AP NPS  T+RIL+KGLV N+ LE+AME+KE+M +KGFV DP+
Sbjct: 181 VRKPEIALEHYKLFIDNAPLNPSIATFRILVKGLVSNDNLEKAMEIKEDMAVKGFVVDPV 240

Query: 241 IYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCYE 300
           +Y YLM+GCV++SD DGV KL++ELKEKLGG V+DGVVYG LMKGYFMKEME+EAM CYE
Sbjct: 241 VYSYLMMGCVKNSDADGVLKLYQELKEKLGGFVDDGVVYGQLMKGYFMKEMEKEAMECYE 300

Query: 301 ETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNVMV 360
           E VG NS V+MSA+AYN VL+AL +NGKF EAL LFD + KEH+PPR LAV+LG+FNVMV
Sbjct: 301 EAVGENSKVRMSAMAYNYVLEALSENGKFDEALKLFDAVKKEHNPPRHLAVNLGTFNVMV 360

Query: 361 DGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKGVN 420
           +GYC  G+F++A+EVF +MGD++CSPDTLSFNNL+ QLC+N +LA+AE+LYG M +K V 
Sbjct: 361 NGYCAGGKFEEAMEVFRQMGDFKCSPDTLSFNNLMNQLCDNELLAEAEKLYGEMEEKNVK 420

Query: 421 PDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAKAF 480
           PDE+TYGLLMD+CFK  + D+ A Y++ MVES LRPN+AVYNRL D+L+K GK+++AK+F
Sbjct: 421 PDEYTYGLLMDTCFKEGKIDEGAAYYKTMVESNLRPNLAVYNRLQDQLIKAGKLDDAKSF 480

Query: 481 FDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGELRK 540
           FD+MV KLKMD  +Y+FIM+ALSE+G+LDEML +VD +LDDD +  SEELQEFV+ ELRK
Sbjct: 481 FDMMVSKLKMDDEAYKFIMRALSEAGRLDEMLKIVDEMLDDDTVRVSEELQEFVKEELRK 540

Query: 541 EDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKESAV 600
             REGDL KLMEEKER+KAEAKAKE   AE +K++    +++L+  K    K+   +   
Sbjct: 541 GGREGDLEKLMEEKERLKAEAKAKELADAEEKKKAQSINIAALIPPKAVEEKKETAKLLW 600

Query: 601 NEMQSGQEDSGKTELAE 618
                G E++   E+A+
Sbjct: 601 ENEAGGVEEADVVEMAK 614

BLAST of CmaCh16G000480 vs. TAIR10
Match: AT1G10270.1 (AT1G10270.1 glutamine-rich protein 23)

HSP 1 Score: 305.4 bits (781), Expect = 7.7e-83
Identity = 193/541 (35.67%), Postives = 298/541 (55.08%), Query Frame = 1

Query: 23  HQAPAP-PIVTLRFLSFASPEEAAAERRRRKRRLRIEPPLSSSSAARPQSQPSKPQSPQN 82
           H  P P P +  R ++F+S EEAAAERRRRKRRLRIEPPL +      +  PS P   ++
Sbjct: 74  HTPPIPYPPIPHRTMAFSSAEEAAAERRRRKRRLRIEPPLHAL-----RRDPSAPPPKRD 133

Query: 83  PNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLN 142
           PNAP+LP+  SAL G RLNLHNR+ +LIR +DL+ A+   R S++SN RPT+FT NA++ 
Sbjct: 134 PNAPRLPDSTSALVGQRLNLHNRVQSLIRASDLDAASKLARQSVFSNTRPTVFTCNAIIA 193

Query: 143 ALLRQSKYSDLLSLHR-FITQAGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPF 202
           A+ R  +YS+ +SL + F  Q+ + PN++++N I   + D    D A+E Y+ ++ +APF
Sbjct: 194 AMYRAKRYSESISLFQYFFKQSNIVPNVVSYNQIINAHCDEGNVDEALEVYRHILANAPF 253

Query: 203 NPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPDPLIYLYLMVGCVRSSDPDGVFK 262
            PS  TYR L KGLV   ++  A  L  EM  KG   D  +Y  L+ G +   D D   +
Sbjct: 254 APSSVTYRHLTKGLVQAGRIGDAASLLREMLSKGQAADSTVYNNLIRGYLDLGDFDKAVE 313

Query: 263 LFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCYEETVGVNSVVKMSAIAYNSVL 322
            F+ELK K   TV DG+V  + M+ +F K  ++EAM  Y     ++   +M     N +L
Sbjct: 314 FFDELKSKC--TVYDGIVNATFMEYWFEKGNDKEAMESYRSL--LDKKFRMHPPTGNVLL 373

Query: 323 DALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNVMVDGYCIEGRFKDAIEVFEKMG 382
           +   K GK  EA  LF+ M   H+PP  L+V+  +  +MV+     G F +AI  F+K+G
Sbjct: 374 EVFLKFGKKDEAWALFNEMLDNHAPPNILSVNSDTVGIMVNECFKMGEFSEAINTFKKVG 433

Query: 383 DYRCSP----DTLSFNNLIEQLCNNGMLADAEELYGTMGDKGVNPDEFTYGLLMDSCFKA 442
               S     D L + N++ + C  GML +AE  +     + +  D  ++  ++D+  KA
Sbjct: 434 SKVTSKPFVMDYLGYCNIVTRFCEQGMLTEAERFFAEGVSRSLPADAPSHRAMIDAYLKA 493

Query: 443 NRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINE-AKAFFDLMVKKLKMDASSY 502
            R DDA     +MV+  LR       R+  EL+K GK+ E A+    +  ++ K D S Y
Sbjct: 494 ERIDDAVKMLDRMVDVNLRVVADFGARVFGELIKNGKLTESAEVLTKMGEREPKPDPSIY 553

Query: 503 QFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGELRKEDREGDLGKLMEEKE 557
             +++ L +   LD+  ++V  ++  + +  +  L+EF+     K  R  ++ K++    
Sbjct: 554 DVVVRGLCDGDALDQAKDIVGEMIRHN-VGVTTVLREFIIEVFEKAGRREEIEKILNSVA 604

BLAST of CmaCh16G000480 vs. TAIR10
Match: AT1G09900.1 (AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 177.6 bits (449), Expect = 2.4e-44
Identity = 106/413 (25.67%), Postives = 199/413 (48.18%), Query Frame = 1

Query: 102 HNRILTLIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQ 161
           +N +  ++R  +LEE   F  + +Y    P I     ++    R  K      +   +  
Sbjct: 106 NNHLRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEG 165

Query: 162 AGVAPNIITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLE 221
           +G  P++IT+N++   Y    + + A+     +++    +P   TY  +++ L D+ KL+
Sbjct: 166 SGAVPDVITYNVMISGYCKAGEINNALS----VLDRMSVSPDVVTYNTILRSLCDSGKLK 225

Query: 222 RAMELKEEMTMKGFVPDPLIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGS 281
           +AME+ + M  +   PD + Y  L+    R S      KL +E++++  G   D V Y  
Sbjct: 226 QAMEVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDR--GCTPDVVTYNV 285

Query: 282 LMKGYFMKEMEEEAMRCYEETVGVNSVVKMSAIAYNSVLDALCKNGKFGEA-LMLFDRMT 341
           L+ G   +   +EA++   +    +S  + + I +N +L ++C  G++ +A  +L D + 
Sbjct: 286 LVNGICKEGRLDEAIKFLNDMP--SSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLR 345

Query: 342 KEHSPPRRLAVDLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCN 401
           K  SP       + +FN++++  C +G    AI++ EKM  + C P++LS+N L+   C 
Sbjct: 346 KGFSP------SVVTFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCK 405

Query: 402 NGMLADAEELYGTMGDKGVNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAV 461
              +  A E    M  +G  PD  TY  ++ +  K  + +DA     ++   G  P +  
Sbjct: 406 EKKMDRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLIT 465

Query: 462 YNRLVDELVKLGKINEA-KAFFDLMVKKLKMDASSYQFIMKALSESGQLDEML 513
           YN ++D L K GK  +A K   ++  K LK D  +Y  ++  LS  G++DE +
Sbjct: 466 YNTVIDGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAI 504

BLAST of CmaCh16G000480 vs. TAIR10
Match: AT1G12620.1 (AT1G12620.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 169.5 bits (428), Expect = 6.6e-42
Identity = 111/418 (26.56%), Postives = 198/418 (47.37%), Query Frame = 1

Query: 108 LIRENDLEEAALFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPN 167
           L  E  + EA       +    +PT+ T+NA++N L    K SD + L   + + G  PN
Sbjct: 152 LCLEGRVSEALELVDRMVEMGHKPTLITLNALVNGLCLNGKVSDAVLLIDRMVETGFQPN 211

Query: 168 IITHNLIFQTYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELK 227
            +T+  + +      +   AME  +++  +         Y I+I GL  +  L+ A  L 
Sbjct: 212 EVTYGPVLKVMCKSGQTALAMELLRKM-EERKIKLDAVKYSIIIDGLCKDGSLDNAFNLF 271

Query: 228 EEMTMKGFVPDPLIYLYLMVG-CVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGY 287
            EM +KGF  D +IY  L+ G C      DG   L + +K K+     D V + +L+  +
Sbjct: 272 NEMEIKGFKADIIIYTTLIRGFCYAGRWDDGAKLLRDMIKRKI---TPDVVAFSALIDCF 331

Query: 288 FMKEMEEEAMRCYEETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPP 347
             +    EA   ++E +     +    + Y S++D  CK  +  +A  + D M  +   P
Sbjct: 332 VKEGKLREAEELHKEMI--QRGISPDTVTYTSLIDGFCKENQLDKANHMLDLMVSKGCGP 391

Query: 348 RRLAVDLGSFNVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLAD 407
                ++ +FN++++GYC      D +E+F KM       DT+++N LI+  C  G L  
Sbjct: 392 -----NIRTFNILINGYCKANLIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLEV 451

Query: 408 AEELYGTMGDKGVNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVD 467
           A+EL+  M  + V PD  +Y +L+D       P+ A   F K+ +S +  +I +YN ++ 
Sbjct: 452 AKELFQEMVSRRVRPDIVSYKILLDGLCDNGEPEKALEIFEKIEKSKMELDIGIYNIIIH 511

Query: 468 ELVKLGKINEA-KAFFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDG 524
            +    K+++A   F  L +K +K D  +Y  ++  L + G L E  +++   +++DG
Sbjct: 512 GMCNASKVDDAWDLFCSLPLKGVKPDVKTYNIMIGGLCKKGSLSE-ADLLFRKMEEDG 557

BLAST of CmaCh16G000480 vs. TAIR10
Match: AT1G63070.1 (AT1G63070.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 162.2 bits (409), Expect = 1.1e-39
Identity = 105/417 (25.18%), Postives = 212/417 (50.84%), Query Frame = 1

Query: 133 IFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYLDCRKPDTAMEHYK 192
           ++T +  +N   R+S+ S  L++   + + G  P+I+T N +   +    +   A+    
Sbjct: 110 LYTYSIFINYFCRRSQLSLALAILGKMMKLGYGPSIVTLNSLLNGFCHGNRISEAVALVD 169

Query: 193 QLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPDPLIYLYLMVGCVRS 252
           Q++ +  + P   T+  L+ GL  +NK   A+ L E M +KG  PD + Y  ++ G  + 
Sbjct: 170 QMV-EMGYQPDTVTFTTLVHGLFQHNKASEAVALVERMVVKGCQPDLVTYGAVINGLCKR 229

Query: 253 SDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCYE--ETVGVNSVVK 312
            +PD    L  ++++  G    D V+Y +++ G    +  ++A   +   ET G+    K
Sbjct: 230 GEPDLALNLLNKMEK--GKIEADVVIYNTIIDGLCKYKHMDDAFDLFNKMETKGI----K 289

Query: 313 MSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNVMVDGYCIEGRFK 372
                YN ++  LC  G++ +A  L   M +++  P     DL  FN ++D +  EG+  
Sbjct: 290 PDVFTYNPLISCLCNYGRWSDASRLLSDMLEKNINP-----DLVFFNALIDAFVKEGKLV 349

Query: 373 DAIEVFEKMGDYR-CSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKGVNPDEFTYGLL 432
           +A +++++M   + C PD +++N LI+  C    + +  E++  M  +G+  +  TY  L
Sbjct: 350 EAEKLYDEMVKSKHCFPDVVAYNTLIKGFCKYKRVEEGMEVFREMSQRGLVGNTVTYTTL 409

Query: 433 MDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAKAFFDLMVKK-L 492
           +   F+A   D+A   F++MV  G+ P+I  YN L+D L   G +  A   F+ M K+ +
Sbjct: 410 IHGFFQARDCDNAQMVFKQMVSDGVHPDIMTYNILLDGLCNNGNVETALVVFEYMQKRDM 469

Query: 493 KMDASSYQFIMKALSESGQLDEMLNVVDTL----LDDDGIEFSEELQEFVRGELRKE 542
           K+D  +Y  +++AL ++G++++  ++  +L    +  + + ++  +  F R  L++E
Sbjct: 470 KLDIVTYTTMIEALCKAGKVEDGWDLFCSLSLKGVKPNVVTYTTMMSGFCRKGLKEE 514

BLAST of CmaCh16G000480 vs. NCBI nr
Match: gi|659133624|ref|XP_008466825.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Cucumis melo])

HSP 1 Score: 1067.4 bits (2759), Expect = 9.5e-309
Identity = 557/624 (89.26%), Postives = 585/624 (93.75%), Query Frame = 1

Query: 1   MALSKPTFFTHLRTLTRSQHLLH-QAPAP-PIVTLRFLSFASPEEAAAERRRRKRRLRIE 60
           MALSKP FFTHL+TLT S HLL  QAPAP PIVT RFLSFAS EEA AERRRRKRRLRIE
Sbjct: 1   MALSKPAFFTHLKTLTGSHHLLQRQAPAPLPIVTFRFLSFASAEEADAERRRRKRRLRIE 60

Query: 61  PPLSSSSAARPQSQPSKPQSPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           PPLSSSSAARPQSQPS+ Q+PQNPN PK+PEHISALSGNRLNLHNRILTLIRENDLEEAA
Sbjct: 61  PPLSSSSAARPQSQPSRSQTPQNPNTPKVPEHISALSGNRLNLHNRILTLIRENDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTY 180
           LFTRHSIYSNCRPTIFTVNAVLNA LRQSKY+DLLSLHRFITQAGV PNIITHNLIFQTY
Sbjct: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVVPNIITHNLIFQTY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPD 240
           LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDN KLERAMELKEEM +KGF PD
Sbjct: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNKKLERAMELKEEMIVKGFAPD 240

Query: 241 PLIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           PLIY YLM GCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYG+LMKGYFMKEMEEEAM+C
Sbjct: 241 PLIYHYLMAGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGNLMKGYFMKEMEEEAMKC 300

Query: 301 YEETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNV 360
           YEETVG N VVKMSAIAYNSVLDALCK+GKF EAL LFDRMTKEH PPR LAV+LG+FNV
Sbjct: 301 YEETVGDNPVVKMSAIAYNSVLDALCKHGKFSEALTLFDRMTKEHRPPRHLAVNLGTFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKG 420
           MVDGYCI+GRFK+AI VFE+MGDYRCSPDTLSFNNLIEQLCNNGMLA+AE LYGTMG+KG
Sbjct: 361 MVDGYCIKGRFKEAIGVFEEMGDYRCSPDTLSFNNLIEQLCNNGMLAEAEMLYGTMGEKG 420

Query: 421 VNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAK 480
           VNPDEFTYGLLM SCF+ NR DDAA YFRKMV+SGLRPNIAVYN LV ELVKLGK++EAK
Sbjct: 421 VNPDEFTYGLLMHSCFQKNRADDAAAYFRKMVDSGLRPNIAVYNILVGELVKLGKVDEAK 480

Query: 481 AFFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGEL 540
           +FFDLMVKKLKMDAS+YQFIMKALSESG++DE+LNVVDTLLDDDGIEFSEELQEFVRGEL
Sbjct: 481 SFFDLMVKKLKMDASNYQFIMKALSESGKMDEVLNVVDTLLDDDGIEFSEELQEFVRGEL 540

Query: 541 RKEDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           RKEDRE DL KL+EEKER+KAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLF NKEGEKES
Sbjct: 541 RKEDREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFANKEGEKES 600

Query: 601 AVNEMQSGQ--EDSGKTELAESNP 621
            VNEMQSGQ  +D GKTELAESNP
Sbjct: 601 VVNEMQSGQQEDDGGKTELAESNP 624

BLAST of CmaCh16G000480 vs. NCBI nr
Match: gi|449456969|ref|XP_004146221.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Cucumis sativus])

HSP 1 Score: 1065.1 bits (2753), Expect = 4.7e-308
Identity = 559/624 (89.58%), Postives = 585/624 (93.75%), Query Frame = 1

Query: 1   MALSKPTFFTHLRTLTRSQHLLH-QAPAP-PIVTLRFLSFASPEEAAAERRRRKRRLRIE 60
           MALSKP FFTHL+TLT S HLL  QA AP PIVTLRFLSFAS EEA AERRRRKRRLRIE
Sbjct: 1   MALSKPAFFTHLKTLTGSHHLLQRQALAPFPIVTLRFLSFASAEEADAERRRRKRRLRIE 60

Query: 61  PPLSSSSAARPQSQPSKPQSPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           PPLSSSSAARP +QP + Q+PQNPNAPK+PEHISALSGNRLNLHNRILTLIRENDLEEAA
Sbjct: 61  PPLSSSSAARPLTQPPRSQTPQNPNAPKIPEHISALSGNRLNLHNRILTLIRENDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTY 180
           LFTRHSIYSNCRPTIFTVNAVLNA LRQSKY+DLLSLHRFITQAGV PNIITHNLIFQTY
Sbjct: 121 LFTRHSIYSNCRPTIFTVNAVLNAQLRQSKYADLLSLHRFITQAGVVPNIITHNLIFQTY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPD 240
           LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELK+EM  KGF PD
Sbjct: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKDEMIEKGFAPD 240

Query: 241 PLIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           PLIY YLM GCVRS DPDGVFKLFEELKEKLG TVEDGVVYG+LMKGYFMKEMEEEAM+C
Sbjct: 241 PLIYHYLMGGCVRSLDPDGVFKLFEELKEKLGATVEDGVVYGNLMKGYFMKEMEEEAMKC 300

Query: 301 YEETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNV 360
           YEETVG NSVVKMSAIAYNSVLDALC+NGKFGEAL LFDRMTKEH PPR LAV+LGSFNV
Sbjct: 301 YEETVGDNSVVKMSAIAYNSVLDALCRNGKFGEALTLFDRMTKEHRPPRHLAVNLGSFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKG 420
           MVDGYCIEGRFK+AIEVFEKMGDYRC PDTLSFNNLIEQLCNNGMLA+AE LYGTM DKG
Sbjct: 361 MVDGYCIEGRFKEAIEVFEKMGDYRCCPDTLSFNNLIEQLCNNGMLAEAEMLYGTMDDKG 420

Query: 421 VNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAK 480
           VNPDEFTYGLLMDSCFK NR DDAA YFRKMV+SGLRPNIAVYN LVDELVKLGKI++AK
Sbjct: 421 VNPDEFTYGLLMDSCFKKNRADDAAAYFRKMVDSGLRPNIAVYNILVDELVKLGKIDDAK 480

Query: 481 AFFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGEL 540
           +FFDLMVKKLKMDASSYQFIMKALSESG++DE+LNVVDTLLDDDGIEFSEELQEFVRGEL
Sbjct: 481 SFFDLMVKKLKMDASSYQFIMKALSESGKMDEILNVVDTLLDDDGIEFSEELQEFVRGEL 540

Query: 541 RKEDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           RKE+RE DL KL+EEKER+KAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLF NKEGEKES
Sbjct: 541 RKENREEDLAKLVEEKERLKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFANKEGEKES 600

Query: 601 AVNEMQS--GQEDSGKTELAESNP 621
            VNEMQS   ++DSGKTELAES+P
Sbjct: 601 VVNEMQSVEQEDDSGKTELAESSP 624

BLAST of CmaCh16G000480 vs. NCBI nr
Match: gi|645258146|ref|XP_008234749.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Prunus mume])

HSP 1 Score: 937.6 bits (2422), Expect = 1.1e-269
Identity = 481/620 (77.58%), Postives = 545/620 (87.90%), Query Frame = 1

Query: 1   MALSKPTFFTHLRTLTRSQHLLHQAPAPPIVTLRFLSFASPEEAAAERRRRKRRLRIEPP 60
           MALSKPTF THLRTL +  +  H  P P  ++LRFLSFA+PEEAAAERRRRKRRLRIEPP
Sbjct: 1   MALSKPTFLTHLRTLAKPPNCHHPTPPPSFISLRFLSFATPEEAAAERRRRKRRLRIEPP 60

Query: 61  LSS--SSAARPQSQPSKPQSPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAA 120
           LSS   +  + Q Q   P+  QNPNAPKLPE +SALSGNRLNLHNRILTL+R+NDLEEAA
Sbjct: 61  LSSLHRNQQQQQQQQQSPKPQQNPNAPKLPEPVSALSGNRLNLHNRILTLVRQNDLEEAA 120

Query: 121 LFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTY 180
           L+TRHSIYSNCRPTIFTVN+VL A LRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTY
Sbjct: 121 LYTRHSIYSNCRPTIFTVNSVLTAQLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTY 180

Query: 181 LDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPD 240
           LDCRKPDTAME+YKQLINDAPFNPSPTTYRILIKGLVDNNKL+RAMELKEE+ +KGF PD
Sbjct: 181 LDCRKPDTAMENYKQLINDAPFNPSPTTYRILIKGLVDNNKLDRAMELKEEIDVKGFAPD 240

Query: 241 PLIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRC 300
           P++Y YLMVGCV++SD DGVFKL+EELKEKLGG VEDG+VYG+LMKGYFM+ ME+EAM C
Sbjct: 241 PVVYHYLMVGCVKNSDSDGVFKLYEELKEKLGGVVEDGIVYGNLMKGYFMRGMEKEAMEC 300

Query: 301 YEETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNV 360
           YEE++  +S VKMSA+AYNSVLDAL KNGKF EAL LFDRM  EH+PPRRLAV+LGSFNV
Sbjct: 301 YEESLRESSKVKMSAVAYNSVLDALSKNGKFDEALRLFDRMVAEHNPPRRLAVNLGSFNV 360

Query: 361 MVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKG 420
           M DGYC EGRFK+AIEVF KMGDYRCSPDTLSFNNLIEQLC NGML++AEELYG M DKG
Sbjct: 361 MADGYCAEGRFKEAIEVFRKMGDYRCSPDTLSFNNLIEQLCKNGMLSEAEELYGEMSDKG 420

Query: 421 VNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAK 480
           VN DE+TY LLMD+CF+ NR DDAA YFRKMV++ LRPN+AVYNRLVD L+K+GK++EAK
Sbjct: 421 VNADEYTYVLLMDTCFEENRADDAAEYFRKMVDAKLRPNLAVYNRLVDGLIKVGKVDEAK 480

Query: 481 AFFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGEL 540
           +FFDLMVKKLKMD  SYQFIMK LSE+G+LDE+LNVV+T+LDDDG+EF+EELQEFV+GE+
Sbjct: 481 SFFDLMVKKLKMDIPSYQFIMKTLSEAGKLDEVLNVVNTMLDDDGVEFNEELQEFVKGEM 540

Query: 541 RKEDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKES 600
           RKE RE ++GKLMEEKER KAEAKAKEAEAAEA KRSA+AAVSSLL SKLFGNKE E  S
Sbjct: 541 RKEGREDEVGKLMEEKERQKAEAKAKEAEAAEAAKRSARAAVSSLLPSKLFGNKESETGS 600

Query: 601 AVNEMQSGQEDSGKTELAES 619
             +   +G  ++  T+ AE+
Sbjct: 601 TQSTENAG--EAASTQPAEA 618

BLAST of CmaCh16G000480 vs. NCBI nr
Match: gi|596020758|ref|XP_007218947.1| (hypothetical protein PRUPE_ppa002582mg [Prunus persica])

HSP 1 Score: 934.9 bits (2415), Expect = 7.4e-269
Identity = 481/619 (77.71%), Postives = 542/619 (87.56%), Query Frame = 1

Query: 1   MALSKPTFFTHLRTLTRSQHLLHQAPAPPIVTLRFLSFASPEEAAAERRRRKRRLRIEPP 60
           MALSKPTF THLRTL +  +  H    P  ++LRFLSFA+PEEAAAERRRRKRRLRIEPP
Sbjct: 1   MALSKPTFLTHLRTLAKPPNCHHPTTPPSFISLRFLSFATPEEAAAERRRRKRRLRIEPP 60

Query: 61  LSSSSAARPQSQPSK-PQSPQNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEEAAL 120
           LSS    + Q Q  + P+  QNPNAPKLPE +SALSGNRLNLHNRILTL+R+NDLEEAAL
Sbjct: 61  LSSLHRNQQQQQQQQSPKPQQNPNAPKLPEPVSALSGNRLNLHNRILTLVRQNDLEEAAL 120

Query: 121 FTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYL 180
           +TRHSIYSNCRPTIFTVN+VL A LRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYL
Sbjct: 121 YTRHSIYSNCRPTIFTVNSVLTAQLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQTYL 180

Query: 181 DCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFVPDP 240
           DCRKPDTAME+YKQLINDAPFNPSPTTYRILIKGLVDNNKL+RAMELKEE+  KGF PDP
Sbjct: 181 DCRKPDTAMENYKQLINDAPFNPSPTTYRILIKGLVDNNKLDRAMELKEEIDAKGFAPDP 240

Query: 241 LIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAMRCY 300
           ++Y YLMVGCV++SD DGVF+L+EELKEKLGG VEDG+VYG+LMKGYFM+ ME+EAM CY
Sbjct: 241 VVYHYLMVGCVKNSDSDGVFRLYEELKEKLGGVVEDGIVYGNLMKGYFMRGMEKEAMECY 300

Query: 301 EETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSFNVM 360
           EE+ G +S VK SA+AYNSVLDAL KNGKF EAL LFDRM  EH+PPRRLAV+LGSFNVM
Sbjct: 301 EESFGESSKVKTSAVAYNSVLDALSKNGKFDEALRLFDRMVAEHNPPRRLAVNLGSFNVM 360

Query: 361 VDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGDKGV 420
            DGYC++GRFK+AIEVF KMGDYRCSPDTLSFNNLIEQLC NGML++AEELYG M DKGV
Sbjct: 361 ADGYCVQGRFKEAIEVFRKMGDYRCSPDTLSFNNLIEQLCKNGMLSEAEELYGEMSDKGV 420

Query: 421 NPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINEAKA 480
            PDEFTY LLMD+CF+ NR DDAA YFRKMV++ LRPN+AVYNRLVD L+K+GK++EAK+
Sbjct: 421 YPDEFTYVLLMDTCFEENRADDAAEYFRKMVDAKLRPNLAVYNRLVDGLIKVGKVDEAKS 480

Query: 481 FFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRGELR 540
           FFDLMVKKLKMD  SYQFIMK LSE+G+LDE+LNVVDT+LDDDG+EF+EELQEFV+GELR
Sbjct: 481 FFDLMVKKLKMDIPSYQFIMKTLSEAGKLDEVLNVVDTMLDDDGVEFNEELQEFVKGELR 540

Query: 541 KEDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEKESA 600
           KE RE ++GKLMEEKER KAEAKAKEAEAAEA KRSA+AAVSSLL SKLFGNKE E  S 
Sbjct: 541 KEGREDEVGKLMEEKERQKAEAKAKEAEAAEAAKRSARAAVSSLLPSKLFGNKESETGST 600

Query: 601 VNEMQSGQEDSGKTELAES 619
                +G  ++  T+ AE+
Sbjct: 601 QATENAG--EAASTQPAEA 617

BLAST of CmaCh16G000480 vs. NCBI nr
Match: gi|1009123239|ref|XP_015878436.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Ziziphus jujuba])

HSP 1 Score: 933.7 bits (2412), Expect = 1.6e-268
Identity = 485/621 (78.10%), Postives = 544/621 (87.60%), Query Frame = 1

Query: 1   MALSKPTFFTHLRTLTRSQHLLHQAPAPP--IVTLRFLSFASPEEAAAERRRRKRRLRIE 60
           MALSKPTF  HL++L      L + P PP   ++LRFLSFA+PEEAAAERRRRKRRLRIE
Sbjct: 1   MALSKPTFLIHLKSLNAPHRHLRRLPPPPSSFISLRFLSFATPEEAAAERRRRKRRLRIE 60

Query: 61  PPLSSSSAARPQSQPSKPQSP--QNPNAPKLPEHISALSGNRLNLHNRILTLIRENDLEE 120
           PPLSS    + Q Q ++ QSP  QNPNAPKLPE ++ALSGNRLNLHNRIL LIR+NDLEE
Sbjct: 61  PPLSSLHRTQQQQQQAQTQSPKPQNPNAPKLPEPVTALSGNRLNLHNRILELIRKNDLEE 120

Query: 121 AALFTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQ 180
           AAL+TRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQ
Sbjct: 121 AALYTRHSIYSNCRPTIFTVNAVLNALLRQSKYSDLLSLHRFITQAGVAPNIITHNLIFQ 180

Query: 181 TYLDCRKPDTAMEHYKQLINDAPFNPSPTTYRILIKGLVDNNKLERAMELKEEMTMKGFV 240
           TYLDCRKPD AMEHYKQLINDAPFNPSPTTY+ILI GLVDNNKLERA+ELKEEM +KG  
Sbjct: 181 TYLDCRKPDIAMEHYKQLINDAPFNPSPTTYQILIAGLVDNNKLERALELKEEMDVKGIP 240

Query: 241 PDPLIYLYLMVGCVRSSDPDGVFKLFEELKEKLGGTVEDGVVYGSLMKGYFMKEMEEEAM 300
            +P++Y +LM+GCV++SD DGVF+L+EELKEKLGG+VEDGVVYGSLMKGYF++ ME+EAM
Sbjct: 241 ANPVVYHHLMLGCVKNSDADGVFRLYEELKEKLGGSVEDGVVYGSLMKGYFLRGMEKEAM 300

Query: 301 RCYEETVGVNSVVKMSAIAYNSVLDALCKNGKFGEALMLFDRMTKEHSPPRRLAVDLGSF 360
            CYEE VG NS VKMSA+AYNSVLDAL KNGKF EAL LFDRMTKEH+PP+RLAV+LGSF
Sbjct: 301 ECYEEAVGENSKVKMSAVAYNSVLDALSKNGKFDEALGLFDRMTKEHNPPKRLAVNLGSF 360

Query: 361 NVMVDGYCIEGRFKDAIEVFEKMGDYRCSPDTLSFNNLIEQLCNNGMLADAEELYGTMGD 420
           NVM DGYC +G FKDAIEVF KMGDYRCSPD LSFNNLIEQLCNNG+L +AEELYG M  
Sbjct: 361 NVMADGYCAQGSFKDAIEVFRKMGDYRCSPDALSFNNLIEQLCNNGLLTEAEELYGEMDG 420

Query: 421 KGVNPDEFTYGLLMDSCFKANRPDDAAGYFRKMVESGLRPNIAVYNRLVDELVKLGKINE 480
           KGVNPDE+T+ LLMD+CFK NRPDDAA YFRKM++S LRPN+AVYN+LVD LVK+GKI+E
Sbjct: 421 KGVNPDEYTFVLLMDACFKENRPDDAAEYFRKMIDSKLRPNLAVYNKLVDGLVKVGKIDE 480

Query: 481 AKAFFDLMVKKLKMDASSYQFIMKALSESGQLDEMLNVVDTLLDDDGIEFSEELQEFVRG 540
           AK+FFDLMVKKLKMD  SY+FIMKALSESG+ DE+LNVVDT+LDDDG+EF+EE+QEFV+G
Sbjct: 481 AKSFFDLMVKKLKMDVPSYEFIMKALSESGKFDEVLNVVDTMLDDDGVEFNEEVQEFVKG 540

Query: 541 ELRKEDREGDLGKLMEEKERVKAEAKAKEAEAAEAQKRSAKAAVSSLLSSKLFGNKEGEK 600
           ELRKE RE DL KLMEEKER KAEAKAKEAEAAEA KRSA+AAVSSLL SKLFGNKE + 
Sbjct: 541 ELRKEGREDDLVKLMEEKERQKAEAKAKEAEAAEAAKRSARAAVSSLLPSKLFGNKESDT 600

Query: 601 ESAVNEMQSGQEDSGKTELAE 618
            SA  E      ++GKT +AE
Sbjct: 601 GSA--EANGNAIEAGKTGIAE 619

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP273_ARATH1.4e-21963.05Pentatricopeptide repeat-containing protein At3g49240 OS=Arabidopsis thaliana GN... [more]
PPR29_ARATH1.4e-8135.67Pentatricopeptide repeat-containing protein At1g10270 OS=Arabidopsis thaliana GN... [more]
PPR28_ARATH4.3e-4325.67Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN... [more]
PPR37_ARATH1.2e-4026.56Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana GN... [more]
PPR97_ARATH1.9e-3825.18Pentatricopeptide repeat-containing protein At1g63070, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
E5GB98_CUCME6.7e-30989.26Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo PE=4 SV=... [more]
A0A0A0L7B2_CUCSA3.3e-30889.58Uncharacterized protein OS=Cucumis sativus GN=Csa_3G239860 PE=4 SV=1[more]
M5X3R7_PRUPE5.1e-26977.71Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002582mg PE=4 SV=1[more]
W9RP26_9ROSA4.7e-26274.22Uncharacterized protein OS=Morus notabilis GN=L484_006813 PE=4 SV=1[more]
A0A061G122_THECC7.7e-24970.85Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_... [more]
Match NameE-valueIdentityDescription
AT3G49240.17.9e-22163.05 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G10270.17.7e-8335.67 glutamine-rich protein 23[more]
AT1G09900.12.4e-4425.67 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G12620.16.6e-4226.56 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G63070.11.1e-3925.18 pentatricopeptide (PPR) repeat-containing protein[more]
Match NameE-valueIdentityDescription
gi|659133624|ref|XP_008466825.1|9.5e-30989.26PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Cucumis melo][more]
gi|449456969|ref|XP_004146221.1|4.7e-30889.58PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Cucumis sativu... [more]
gi|645258146|ref|XP_008234749.1|1.1e-26977.58PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Prunus mume][more]
gi|596020758|ref|XP_007218947.1|7.4e-26977.71hypothetical protein PRUPE_ppa002582mg [Prunus persica][more]
gi|1009123239|ref|XP_015878436.1|1.6e-26878.10PREDICTED: pentatricopeptide repeat-containing protein At3g49240 [Ziziphus jujub... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0009960 endosperm development
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006349 regulation of gene expression by genetic imprinting
biological_process GO:0009451 RNA modification
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005575 cellular_component
molecular_function GO:0005524 ATP binding
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G000480.1CmaCh16G000480.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 242..268
score: 0.13coord: 460..487
score: 0.013coord: 315..342
score: 7.4E-8coord: 206..235
score: 5.5E-4coord: 277..301
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 352..379
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 386..434
score: 2.3
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 128..178
score: 0.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 355..388
score: 9.9E-9coord: 206..238
score: 2.5E-6coord: 460..487
score: 0.0022coord: 424..458
score: 5.3E-8coord: 315..343
score: 4.5E-8coord: 390..423
score: 3.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 132..166
score: 8.188coord: 457..487
score: 8.934coord: 387..421
score: 11.411coord: 167..197
score: 6.61coord: 275..309
score: 7.059coord: 238..268
score: 7.772coord: 312..346
score: 12.277coord: 491..526
score: 7.695coord: 203..237
score: 11.159coord: 352..386
score: 12.233coord: 422..456
score: 12
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 274..516
score: 9.7
NoneNo IPR availableunknownCoilCoilcoord: 540..579
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 59..532
score: 6.7E-265coord: 27..41
score: 6.7E
NoneNo IPR availablePANTHERPTHR24015:SF237SUBFAMILY NOT NAMEDcoord: 27..41
score: 6.7E-265coord: 59..532
score: 6.7E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 177..338
score: 2.46E-5coord: 322..499
score: 1.

The following gene(s) are paralogous to this gene:

None