CmaCh10G007920 (gene) Cucurbita maxima (Rimu)

NameCmaCh10G007920
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr10 : 3645345 .. 3647207 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCATCACTTCTTGCCCTTCAGCTCAGTGCATCCCCGATCAACTCCCCGAAAGTTCACACATCACCTATCCATGGCCTTAAGTCATGCTCATCCATGTCTGAGCTCAAGCAATATCACTCCCAGATTATCCGTCTTGGTCTTTCTACTGACAATGATGCCATGGGTCGTCTTGTCAAATTCTGTGCTGTATGCAAGAATGGAGACCTTGATTATGCACTTCTTTTGTTCAAGACAATCCCCAACCCAGATGCATACATCTACAATACCTTAATAAGAGGCTACTTACAACTCCAATTCCCTAGAGCTTGCTTACTGTTGTATTTGGAAATGCTGCATAAGTGTGTCCTTCCCAATAAATTTACATTTCCTTCTCTAATTCGTGCTTGCTGCATCGATAATGCTATTGAGGAAGGGAAGCAAATTCATGCACATGTTCTTAAATTTGGCTTCAGAGCAGATAGATTTTCTCAGAACAATTTAATTCATATGTATGCTAATTTTCAATCCTTGGAAGAAGCCAGAAGGGTCTTCGATGGTATTGAGCTACCCGATGTTGTGTCATGGACCACTTTGCTTACTGGGTATGCTCAGTGTGGATTTCTAGATGAAGCCTTACAAGTTTTCGAGTCAATGCCCGAGCACAGCTCTGCTTCCTGGAACGCCATGATTTCTTCTTTTGTCCAAAACAATCGATTTCATGAAGCATTTGCTTTGTTTAATCGGATGAGGTCGGAGAAGATTGTGTTGGACAAATACATGGCTGCTAGCATGTTATCAGCTTGCACAGGATTGGGAGCGCTTGAACAAGGAATATGGATACACAGATACATCAAGAAAAGTGAGATTAAATTAGACTCGAAACTAGCAACAACGCTCATCGACATGTATTGTAAATGCGGTTGCCTGGATCGTGCTTTTGAAGTGTTCACTCAGTTGCCTGAAAAGGGCATTTCTTCATGGAATTGCATGATTGGAGGGATGGCTATGCACGGGAAAGGAGAGGCAGCCATTGAGCTTTTCAAAGACATGGAGACCAAAATGGTGACACCAGACAACATAACTTTCCTTAATGTTCTCAATGCTTGTGCTCACTCCGGGTTAGTTGAAAAGGGACGCTACTATTTCAATCATTTCTCTCAAGTTTATGATATTAAACCCAGAACCGAGCATTATGGATGCATGGTTGATTTATATGGACGATCCGGGATGCTAGACGAAGCGATGAAGCTCATACGTGAGATGCCAATGAGTCCGGACGCCGGAGTGTTAGGTGCCTTCGTTGGAGCATGTAAAATCCATGGGAACATAGATATGGGGGAGGAAATAGGCAAGAGAGTAATAGAACTAGATCCAAGCAATAGCGGGCGCTACGTTCTACTCGGAAATCTGTACGCCGAGGCCGGTAGATGGGACAGTGTAGCAGAAGTAAGAAAACTAATGAATGATAGAGAAGTGAAGAAGGCTGCTGGGTTTTCGATGATTGAATTGGAGGGTGTGGTGCATGAATTCATTGCAGGAGGAAGGGGTCACCCAGAAGCCAAGGAAATATATGGCAAAGTGAATGAGATGATAGAATGTATAAGATGTGTAGGATATGTAACCGAGGAGGAGGAGGAGGAGGAGGTGGAGAAGGATAACCCTGTTTACTACCATAGTGAGAAACTGGCAGTTGCTTTTGGGTTGCTCAAGACTCAAGCAGGGGAAACGCTTCGAATCACTAAGAATCTGCGTGTCTGTAAGGATTGTCACCAAGCTCTTAAGCTTGTTTCTAAGGTGTTTGAACGAAAATTCATTGTTAGAGATAGAAATCGTTTCCATCATTTTGCTGATGGAGAGTGTTCTTGTAATGATTATTGGTAA

mRNA sequence

ATGTCATCACTTCTTGCCCTTCAGCTCAGTGCATCCCCGATCAACTCCCCGAAAGTTCACACATCACCTATCCATGGCCTTAAGTCATGCTCATCCATGTCTGAGCTCAAGCAATATCACTCCCAGATTATCCGTCTTGGTCTTTCTACTGACAATGATGCCATGGGTCGTCTTGTCAAATTCTGTGCTGTATGCAAGAATGGAGACCTTGATTATGCACTTCTTTTGTTCAAGACAATCCCCAACCCAGATGCATACATCTACAATACCTTAATAAGAGGCTACTTACAACTCCAATTCCCTAGAGCTTGCTTACTGTTGTATTTGGAAATGCTGCATAAGTGTGTCCTTCCCAATAAATTTACATTTCCTTCTCTAATTCGTGCTTGCTGCATCGATAATGCTATTGAGGAAGGGAAGCAAATTCATGCACATGTTCTTAAATTTGGCTTCAGAGCAGATAGATTTTCTCAGAACAATTTAATTCATATGTATGCTAATTTTCAATCCTTGGAAGAAGCCAGAAGGGTCTTCGATGGTATTGAGCTACCCGATGTTGTGTCATGGACCACTTTGCTTACTGGGTATGCTCAGTGTGGATTTCTAGATGAAGCCTTACAAGTTTTCGAGTCAATGCCCGAGCACAGCTCTGCTTCCTGGAACGCCATGATTTCTTCTTTTGTCCAAAACAATCGATTTCATGAAGCATTTGCTTTGTTTAATCGGATGAGGTCGGAGAAGATTGTGTTGGACAAATACATGGCTGCTAGCATGTTATCAGCTTGCACAGGATTGGGAGCGCTTGAACAAGGAATATGGATACACAGATACATCAAGAAAAGTGAGATTAAATTAGACTCGAAACTAGCAACAACGCTCATCGACATGTATTGTAAATGCGGTTGCCTGGATCGTGCTTTTGAAGTGTTCACTCAGTTGCCTGAAAAGGGCATTTCTTCATGGAATTGCATGATTGGAGGGATGGCTATGCACGGGAAAGGAGAGGCAGCCATTGAGCTTTTCAAAGACATGGAGACCAAAATGGTGACACCAGACAACATAACTTTCCTTAATGTTCTCAATGCTTGTGCTCACTCCGGGTTAGTTGAAAAGGGACGCTACTATTTCAATCATTTCTCTCAAGTTTATGATATTAAACCCAGAACCGAGCATTATGGATGCATGGTTGATTTATATGGACGATCCGGGATGCTAGACGAAGCGATGAAGCTCATACGTGAGATGCCAATGAGTCCGGACGCCGGAGTGTTAGGTGCCTTCGTTGGAGCATGTAAAATCCATGGGAACATAGATATGGGGGAGGAAATAGGCAAGAGAGTAATAGAACTAGATCCAAGCAATAGCGGGCGCTACGTTCTACTCGGAAATCTGTACGCCGAGGCCGGTAGATGGGACAGTGTAGCAGAAGTAAGAAAACTAATGAATGATAGAGAAGTGAAGAAGGCTGCTGGGTTTTCGATGATTGAATTGGAGGGTGTGGTGCATGAATTCATTGCAGGAGGAAGGGGTCACCCAGAAGCCAAGGAAATATATGGCAAAGTGAATGAGATGATAGAATGTATAAGATGTGTAGGATATGTAACCGAGGAGGAGGAGGAGGAGGAGGTGGAGAAGGATAACCCTGTTTACTACCATAGTGAGAAACTGGCAGTTGCTTTTGGGTTGCTCAAGACTCAAGCAGGGGAAACGCTTCGAATCACTAAGAATCTGCGTGTCTGTAAGGATTGTCACCAAGCTCTTAAGCTTGTTTCTAAGGTGTTTGAACGAAAATTCATTGTTAGAGATAGAAATCGTTTCCATCATTTTGCTGATGGAGAGTGTTCTTGTAATGATTATTGGTAA

Coding sequence (CDS)

ATGTCATCACTTCTTGCCCTTCAGCTCAGTGCATCCCCGATCAACTCCCCGAAAGTTCACACATCACCTATCCATGGCCTTAAGTCATGCTCATCCATGTCTGAGCTCAAGCAATATCACTCCCAGATTATCCGTCTTGGTCTTTCTACTGACAATGATGCCATGGGTCGTCTTGTCAAATTCTGTGCTGTATGCAAGAATGGAGACCTTGATTATGCACTTCTTTTGTTCAAGACAATCCCCAACCCAGATGCATACATCTACAATACCTTAATAAGAGGCTACTTACAACTCCAATTCCCTAGAGCTTGCTTACTGTTGTATTTGGAAATGCTGCATAAGTGTGTCCTTCCCAATAAATTTACATTTCCTTCTCTAATTCGTGCTTGCTGCATCGATAATGCTATTGAGGAAGGGAAGCAAATTCATGCACATGTTCTTAAATTTGGCTTCAGAGCAGATAGATTTTCTCAGAACAATTTAATTCATATGTATGCTAATTTTCAATCCTTGGAAGAAGCCAGAAGGGTCTTCGATGGTATTGAGCTACCCGATGTTGTGTCATGGACCACTTTGCTTACTGGGTATGCTCAGTGTGGATTTCTAGATGAAGCCTTACAAGTTTTCGAGTCAATGCCCGAGCACAGCTCTGCTTCCTGGAACGCCATGATTTCTTCTTTTGTCCAAAACAATCGATTTCATGAAGCATTTGCTTTGTTTAATCGGATGAGGTCGGAGAAGATTGTGTTGGACAAATACATGGCTGCTAGCATGTTATCAGCTTGCACAGGATTGGGAGCGCTTGAACAAGGAATATGGATACACAGATACATCAAGAAAAGTGAGATTAAATTAGACTCGAAACTAGCAACAACGCTCATCGACATGTATTGTAAATGCGGTTGCCTGGATCGTGCTTTTGAAGTGTTCACTCAGTTGCCTGAAAAGGGCATTTCTTCATGGAATTGCATGATTGGAGGGATGGCTATGCACGGGAAAGGAGAGGCAGCCATTGAGCTTTTCAAAGACATGGAGACCAAAATGGTGACACCAGACAACATAACTTTCCTTAATGTTCTCAATGCTTGTGCTCACTCCGGGTTAGTTGAAAAGGGACGCTACTATTTCAATCATTTCTCTCAAGTTTATGATATTAAACCCAGAACCGAGCATTATGGATGCATGGTTGATTTATATGGACGATCCGGGATGCTAGACGAAGCGATGAAGCTCATACGTGAGATGCCAATGAGTCCGGACGCCGGAGTGTTAGGTGCCTTCGTTGGAGCATGTAAAATCCATGGGAACATAGATATGGGGGAGGAAATAGGCAAGAGAGTAATAGAACTAGATCCAAGCAATAGCGGGCGCTACGTTCTACTCGGAAATCTGTACGCCGAGGCCGGTAGATGGGACAGTGTAGCAGAAGTAAGAAAACTAATGAATGATAGAGAAGTGAAGAAGGCTGCTGGGTTTTCGATGATTGAATTGGAGGGTGTGGTGCATGAATTCATTGCAGGAGGAAGGGGTCACCCAGAAGCCAAGGAAATATATGGCAAAGTGAATGAGATGATAGAATGTATAAGATGTGTAGGATATGTAACCGAGGAGGAGGAGGAGGAGGAGGTGGAGAAGGATAACCCTGTTTACTACCATAGTGAGAAACTGGCAGTTGCTTTTGGGTTGCTCAAGACTCAAGCAGGGGAAACGCTTCGAATCACTAAGAATCTGCGTGTCTGTAAGGATTGTCACCAAGCTCTTAAGCTTGTTTCTAAGGTGTTTGAACGAAAATTCATTGTTAGAGATAGAAATCGTTTCCATCATTTTGCTGATGGAGAGTGTTCTTGTAATGATTATTGGTAA

Protein sequence

MSSLLALQLSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALFNRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPDAGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKLMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEEEEEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFERKFIVRDRNRFHHFADGECSCNDYW
BLAST of CmaCh10G007920 vs. Swiss-Prot
Match: PP449_ARATH (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 2.2e-156
Identity = 270/600 (45.00%), Postives = 383/600 (63.83%), Query Frame = 1

Query: 27  LKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVCKNGD-LDYALLLFKTIPNPDA 86
           L+ CS   ELKQ H+++++ GL  D+ A+ + + FC    + D L YA ++F     PD 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 87  YIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQIHAH 146
           +++N +IRG+     P   LLLY  ML      N +TFPSL++AC   +A EE  QIHA 
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQ 140

Query: 147 VLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGFLDEA 206
           + K G+  D ++ N+LI+ YA   + + A  +FD I  PD VSW +++ GY + G +D A
Sbjct: 141 ITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIA 200

Query: 207 LQVFESMPEHSSASWNAMISSFVQNNRFHEAFALFNRMRSEKIVLDKYMAASMLSACTGL 266
           L +F  M E ++ SW  MIS +VQ +   EA  LF+ M++  +  D    A+ LSAC  L
Sbjct: 201 LTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQL 260

Query: 267 GALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMI 326
           GALEQG WIH Y+ K+ I++DS L   LIDMY KCG ++ A EVF  + +K + +W  +I
Sbjct: 261 GALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 320

Query: 327 GGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFSQVYDI 386
            G A HG G  AI  F +M+   + P+ ITF  VL AC+++GLVE+G+  F    + Y++
Sbjct: 321 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 380

Query: 387 KPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPDAGVLGAFVGACKIHGNIDMGEEIGK 446
           KP  EHYGC+VDL GR+G+LDEA + I+EMP+ P+A + GA + AC+IH NI++GEEIG+
Sbjct: 381 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGE 440

Query: 447 RVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKLMNDREVKKAAGFSMIELEGVVHEFI 506
            +I +DP + GRYV   N++A   +WD  AE R+LM ++ V K  G S I LEG  HEF+
Sbjct: 441 ILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFL 500

Query: 507 AGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEEE-----EEEVEKDNPVYYHSEKLAVAF 566
           AG R HPE ++I  K   M   +   GYV E EE      ++ E++  V+ HSEKLA+ +
Sbjct: 501 AGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLAITY 560

Query: 567 GLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFERKFIVRDRNRFHHFADGECSCNDYW 621
           GL+KT+ G  +RI KNLRVCKDCH+  KL+SK+++R  ++RDR RFHHF DG+CSC DYW
Sbjct: 561 GLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCSCGDYW 620

BLAST of CmaCh10G007920 vs. Swiss-Prot
Match: PP425_ARATH (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 523.9 bits (1348), Expect = 2.5e-147
Identity = 258/633 (40.76%), Postives = 389/633 (61.45%), Query Frame = 1

Query: 12  SPINSPKVHTSPIHG-LKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVCK--NG 71
           SP +SP  H S +   + +C ++ +L Q H+  I+ G   D  A   +++FCA     + 
Sbjct: 14  SPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHR 73

Query: 72  DLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACL---LLYLEMLHKCVLPNKFTFPS 131
           DLDYA  +F  +P  + + +NT+IRG+ +    +A +   L Y  M  + V PN+FTFPS
Sbjct: 74  DLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPS 133

Query: 132 LIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVF------- 191
           +++AC     I+EGKQIH   LK+GF  D F  +NL+ MY     +++AR +F       
Sbjct: 134 VLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEK 193

Query: 192 DGIELPD-------VVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNN 251
           D + + D       +V W  ++ GY + G    A  +F+ M + S  SWN MIS +  N 
Sbjct: 194 DMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNG 253

Query: 252 RFHEAFALFNRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLAT 311
            F +A  +F  M+   I  +     S+L A + LG+LE G W+H Y + S I++D  L +
Sbjct: 254 FFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGS 313

Query: 312 TLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTP 371
            LIDMY KCG +++A  VF +LP + + +W+ MI G A+HG+   AI+ F  M    V P
Sbjct: 314 ALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRP 373

Query: 372 DNITFLNVLNACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKL 431
            ++ ++N+L AC+H GLVE+GR YF+    V  ++PR EHYGCMVDL GRSG+LDEA + 
Sbjct: 374 SDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEF 433

Query: 432 IREMPMSPDAGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRW 491
           I  MP+ PD  +  A +GAC++ GN++MG+ +   ++++ P +SG YV L N+YA  G W
Sbjct: 434 ILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNW 493

Query: 492 DSVAEVRKLMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCV 551
             V+E+R  M +++++K  G S+I+++GV+HEF+     HP+AKEI   + E+ + +R  
Sbjct: 494 SEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLA 553

Query: 552 GY--VTEEE--EEEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQAL 611
           GY  +T +     EE +K+N ++YHSEK+A AFGL+ T  G+ +RI KNLR+C+DCH ++
Sbjct: 554 GYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSI 613

Query: 612 KLVSKVFERKFIVRDRNRFHHFADGECSCNDYW 621
           KL+SKV++RK  VRDR RFHHF DG CSC DYW
Sbjct: 614 KLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of CmaCh10G007920 vs. Swiss-Prot
Match: PP367_ARATH (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 2.6e-141
Identity = 246/604 (40.73%), Postives = 380/604 (62.91%), Query Frame = 1

Query: 27  LKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVCKNGD-----LDYALLLFKTIP 86
           L+SCSS S+LK  H  ++R  L +D     RL+  C      +     L YA  +F  I 
Sbjct: 19  LQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFSQIQ 78

Query: 87  NPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQ 146
           NP+ +++N LIR +     P      Y +ML   + P+  TFP LI+A      +  G+Q
Sbjct: 79  NPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLVGEQ 138

Query: 147 IHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGF 206
            H+ +++FGF+ D + +N+L+HMYAN   +  A R+F  +   DVVSWT+++ GY +CG 
Sbjct: 139 THSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCKCGM 198

Query: 207 LDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALFNRMRSEKIVLDKYMAASMLSA 266
           ++ A ++F+ MP  +  +W+ MI+ + +NN F +A  LF  M+ E +V ++ +  S++S+
Sbjct: 199 VENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSVISS 258

Query: 267 CTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSW 326
           C  LGALE G   + Y+ KS + ++  L T L+DM+ +CG +++A  VF  LPE    SW
Sbjct: 259 CAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPETDSLSW 318

Query: 327 NCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFSQ 386
           + +I G+A+HG    A+  F  M +    P ++TF  VL+AC+H GLVEKG   + +  +
Sbjct: 319 SSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKK 378

Query: 387 VYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPDAGVLGAFVGACKIHGNIDMGE 446
            + I+PR EHYGC+VD+ GR+G L EA   I +M + P+A +LGA +GACKI+ N ++ E
Sbjct: 379 DHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKNTEVAE 438

Query: 447 EIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKLMNDREVKKAAGFSMIELEGVV 506
            +G  +I++ P +SG YVLL N+YA AG+WD +  +R +M ++ VKK  G+S+IE++G +
Sbjct: 439 RVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIEIDGKI 498

Query: 507 HEFIAG-GRGHPEAKEIYGKVNEMIECIRCVGYVTEEEEE----EEVEKDNPVYYHSEKL 566
           ++F  G  + HPE  +I  K  E++  IR +GY     +     +E EK++ ++ HSEKL
Sbjct: 499 NKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKESSIHMHSEKL 558

Query: 567 AVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFERKFIVRDRNRFHHFADGECSC 621
           A+A+G++KT+ G T+RI KNLRVC+DCH   KL+S+V+ R+ IVRDRNRFHHF +G CSC
Sbjct: 559 AIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHFRNGVCSC 618

BLAST of CmaCh10G007920 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 489.6 bits (1259), Expect = 5.1e-137
Identity = 241/612 (39.38%), Postives = 379/612 (61.93%), Query Frame = 1

Query: 17  PKVHTSP--IHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVCKNGDLDYAL 76
           P  +T P  I      SS+S  +  H   ++  + +D      L+     C  GDLD A 
Sbjct: 129 PNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSC--GDLDSAC 188

Query: 77  LLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDN 136
            +F TI   D   +N++I G++Q   P   L L+ +M  + V  +  T   ++ AC    
Sbjct: 189 KVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIR 248

Query: 137 AIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLT 196
            +E G+Q+ +++ +     +    N ++ MY    S+E+A+R+FD +E  D V+WTT+L 
Sbjct: 249 NLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLD 308

Query: 197 GYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALFNRMRSEK-IVLDKY 256
           GYA     + A +V  SMP+    +WNA+IS++ QN + +EA  +F+ ++ +K + L++ 
Sbjct: 309 GYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQI 368

Query: 257 MAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQL 316
              S LSAC  +GALE G WIH YIKK  I+++  + + LI MY KCG L+++ EVF  +
Sbjct: 369 TLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSV 428

Query: 317 PEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGR 376
            ++ +  W+ MIGG+AMHG G  A+++F  M+   V P+ +TF NV  AC+H+GLV++  
Sbjct: 429 EKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAE 488

Query: 377 YYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPDAGVLGAFVGACKI 436
             F+     Y I P  +HY C+VD+ GRSG L++A+K I  MP+ P   V GA +GACKI
Sbjct: 489 SLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKI 548

Query: 437 HGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKLMNDREVKKAAGFS 496
           H N+++ E    R++EL+P N G +VLL N+YA+ G+W++V+E+RK M    +KK  G S
Sbjct: 549 HANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCS 608

Query: 497 MIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEEE-----EEEVEKDNP 556
            IE++G++HEF++G   HP ++++YGK++E++E ++  GY  E  +     EEE  K+  
Sbjct: 609 SIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQS 668

Query: 557 VYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFERKFIVRDRNRFHH 616
           +  HSEKLA+ +GL+ T+A + +R+ KNLRVC DCH   KL+S++++R+ IVRDR RFHH
Sbjct: 669 LNLHSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHH 728

Query: 617 FADGECSCNDYW 621
           F +G+CSCND+W
Sbjct: 729 FRNGQCSCNDFW 738

BLAST of CmaCh10G007920 vs. Swiss-Prot
Match: PP354_ARATH (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana GN=ELI1 PE=3 SV=1)

HSP 1 Score: 476.1 bits (1224), Expect = 5.9e-133
Identity = 249/596 (41.78%), Postives = 360/596 (60.40%), Query Frame = 1

Query: 32  SMSELKQYHSQIIRLGLSTD-NDAMGRLVKFCAVCKNGDLDYALLLFKTIPNPDAYIYNT 91
           S+ E+ Q H+ I+R  L       +  L    A   +G + ++L LF    +PD +++  
Sbjct: 41  SVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSLALFHQTIDPDLFLFTA 100

Query: 92  LIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQIHAHVLKFG 151
            I             LLY+++L   + PN+FTF SL+++C    + + GK IH HVLKFG
Sbjct: 101 AINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSC----STKSGKLIHTHVLKFG 160

Query: 152 FRADRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGFLDEALQVFE 211
              D +    L+ +YA    +  A++VFD +    +VS T ++T YA+ G ++ A  +F+
Sbjct: 161 LGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITCYAKQGNVEAARALFD 220

Query: 212 SMPEHSSASWNAMISSFVQNNRFHEAFALFNRMRSE-KIVLDKYMAASMLSACTGLGALE 271
           SM E    SWN MI  + Q+   ++A  LF ++ +E K   D+    + LSAC+ +GALE
Sbjct: 221 SMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEITVVAALSACSQIGALE 280

Query: 272 QGIWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMIGGMA 331
            G WIH ++K S I+L+ K+ T LIDMY KCG L+ A  VF   P K I +WN MI G A
Sbjct: 281 TGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDTPRKDIVAWNAMIAGYA 340

Query: 332 MHGKGEAAIELFKDME-TKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFSQVYDIKPR 391
           MHG  + A+ LF +M+    + P +ITF+  L ACAH+GLV +G   F    Q Y IKP+
Sbjct: 341 MHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIRIFESMGQEYGIKPK 400

Query: 392 TEHYGCMVDLYGRSGMLDEAMKLIREMPMSPDAGVLGAFVGACKIHGNIDMGEEIGKRVI 451
            EHYGC+V L GR+G L  A + I+ M M  D+ +  + +G+CK+HG+  +G+EI + +I
Sbjct: 401 IEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKLHGDFVLGKEIAEYLI 460

Query: 452 ELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKLMNDREVKKAAGFSMIELEGVVHEFIAGG 511
            L+  NSG YVLL N+YA  G ++ VA+VR LM ++ + K  G S IE+E  VHEF AG 
Sbjct: 461 GLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGISTIEIENKVHEFRAGD 520

Query: 512 RGHPEAKEIYGKVNEMIECIRCVGYV----TEEEEEEEVEKDNPVYYHSEKLAVAFGLLK 571
           R H ++KEIY  + ++ E I+  GYV    T  ++ EE EK+  +  HSE+LA+A+GL+ 
Sbjct: 521 REHSKSKEIYTMLRKISERIKSHGYVPNTNTVLQDLEETEKEQSLQVHSERLAIAYGLIS 580

Query: 572 TQAGETLRITKNLRVCKDCHQALKLVSKVFERKFIVRDRNRFHHFADGECSCNDYW 621
           T+ G  L+I KNLRVC DCH   KL+SK+  RK ++RDRNRFHHF DG CSC D+W
Sbjct: 581 TKPGSPLKIFKNLRVCSDCHTVTKLISKITGRKIVMRDRNRFHHFTDGSCSCGDFW 632

BLAST of CmaCh10G007920 vs. TrEMBL
Match: A0A0A0LWF1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G569490 PE=4 SV=1)

HSP 1 Score: 957.6 bits (2474), Expect = 7.4e-276
Identity = 468/579 (80.83%), Postives = 518/579 (89.46%), Query Frame = 1

Query: 1   MSSLLALQLSASPINSPKVHTSPI-HGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLV 60
           M+SLL L    S  NS K + SPI H L SCSSMSELKQ+HSQIIRLGLSTDN+A+GRL+
Sbjct: 1   MASLLPLHPIPSLPNSTKFNPSPIFHSLSSCSSMSELKQFHSQIIRLGLSTDNNAIGRLI 60

Query: 61  KFCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPN 120
           KFCAV K GDL YALLLF +IP PDA+IYNTLIR YL    P++ LLLYL+MLH  V PN
Sbjct: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLHFNSPKSSLLLYLQMLHNSVFPN 120

Query: 121 KFTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFD 180
           KFTFPS+IRACCIDN++EEGKQIH HV+KFGF  DRF QNNLIHMYANFQSLE+ARRVFD
Sbjct: 121 KFTFPSVIRACCIDNSVEEGKQIHTHVVKFGFSKDRFCQNNLIHMYANFQSLEDARRVFD 180

Query: 181 GIELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFAL 240
            IELPDVV+WTTLLTGYAQ G++DE+L+VFESMPE +SASWNAMIS FVQNNRFHEAF L
Sbjct: 181 CIELPDVVAWTTLLTGYAQLGYVDESLRVFESMPERNSASWNAMISCFVQNNRFHEAFGL 240

Query: 241 FNRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCK 300
           FNRMR EK+VL+KY+AASMLSACTGLGALEQG WIHRYI+++ I+ DSKLATTLIDMYCK
Sbjct: 241 FNRMRIEKVVLEKYVAASMLSACTGLGALEQGKWIHRYIERNGIEFDSKLATTLIDMYCK 300

Query: 301 CGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNV 360
           CGCLD A+EVF  LPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMV PDNITFLNV
Sbjct: 301 CGCLDCAYEVFVHLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVKPDNITFLNV 360

Query: 361 LNACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSP 420
           L+ACAHSGLVEKG++YF  F+QVY I+PRTEHYGCMVDLYGR+G+L+EAMK+I EMPMSP
Sbjct: 361 LSACAHSGLVEKGQHYFYRFTQVYGIEPRTEHYGCMVDLYGRAGLLEEAMKVIDEMPMSP 420

Query: 421 DAGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRK 480
           D GVLGAFVGACKIHGNI++GEE+GKRVIEL+P+NSGRYVLLGNLYAEAGRW+ VAEVRK
Sbjct: 421 DVGVLGAFVGACKIHGNIELGEEVGKRVIELEPTNSGRYVLLGNLYAEAGRWEGVAEVRK 480

Query: 481 LMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEEE 540
           LMNDREVKKAAG SMIELEGVV+EFIAGGR HPEAKEIY K+NEM+ECIR  GYV E E 
Sbjct: 481 LMNDREVKKAAGVSMIELEGVVYEFIAGGRNHPEAKEIYDKLNEMLECIRSEGYVAENEI 540

Query: 541 EEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLR 579
           EE  EKDNPVYYHSEKLA+AFGLLKT+AGE LRITKNLR
Sbjct: 541 EE--EKDNPVYYHSEKLAIAFGLLKTKAGEILRITKNLR 577

BLAST of CmaCh10G007920 vs. TrEMBL
Match: F6H9I8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0069g00930 PE=4 SV=1)

HSP 1 Score: 938.3 bits (2424), Expect = 4.7e-270
Identity = 442/624 (70.83%), Postives = 532/624 (85.26%), Query Frame = 1

Query: 1   MSSLLALQLSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           MSSL  LQ S   ++S K H  P++GL SCS+M+ELKQYHSQIIRLGLS DNDAMGR++K
Sbjct: 1   MSSLQLLQASPPSLSSAKAHKLPLYGLDSCSTMAELKQYHSQIIRLGLSADNDAMGRVIK 60

Query: 61  FCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCA+ K+GDL+YAL +F  IP+PDAYIYNT+ RGYL+ Q  R C+ +Y  MLHK V PNK
Sbjct: 61  FCAISKSGDLNYALEVFDKIPHPDAYIYNTIFRGYLRWQLARNCIFMYSRMLHKSVSPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDG 180
           FT+P LIRACCID AIEEGKQIHAHVLKFGF AD FS NNLIHMY NFQSLE+ARRVFD 
Sbjct: 121 FTYPPLIRACCIDYAIEEGKQIHAHVLKFGFGADGFSLNNLIHMYVNFQSLEQARRVFDN 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALF 240
           +   DVVSWT+L+TGY+Q GF+D+A +VFE MPE +S SWNAMI+++VQ+NR HEAFALF
Sbjct: 181 MPQRDVVSWTSLITGYSQWGFVDKAREVFELMPERNSVSWNAMIAAYVQSNRLHEAFALF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
           +RMR E +VLDK++AASMLSACTGLGALEQG WIH YI+KS I+LDSKLATT+IDMYCKC
Sbjct: 241 DRMRLENVVLDKFVAASMLSACTGLGALEQGKWIHGYIEKSGIELDSKLATTVIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCL++A EVF +LP+KGISSWNCMIGG+AMHGKGEAAIELFK+ME +MV PD ITF+NVL
Sbjct: 301 GCLEKASEVFNELPQKGISSWNCMIGGLAMHGKGEAAIELFKEMEREMVAPDGITFVNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPD 420
           +ACAHSGLVE+G++YF + ++V  +KP  EH+GCMVDL GR+G+L+EA KLI EMP++PD
Sbjct: 361 SACAHSGLVEEGKHYFQYMTEVLGLKPGMEHFGCMVDLLGRAGLLEEARKLINEMPVNPD 420

Query: 421 AGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKL 480
           AGVLGA VGAC+IHGN ++GE+IGK+VIEL+P NSGRYVLL NLYA AGRW+ VA+VRKL
Sbjct: 421 AGVLGALVGACRIHGNTELGEQIGKKVIELEPHNSGRYVLLANLYASAGRWEDVAKVRKL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEE-- 540
           MNDR VKKA GFSMIE E  V EFIAGGR HP+AKEIY K++E++E IR +GYV + +  
Sbjct: 481 MNDRGVKKAPGFSMIESESGVDEFIAGGRAHPQAKEIYAKLDEILETIRSIGYVPDTDGV 540

Query: 541 --EEEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFER 600
             + +E EK+NP+YYHSEKLA+AFGLLKT+ GETLRI+KNLR+C+DCHQA KL+SKV++R
Sbjct: 541 LHDIDEEEKENPLYYHSEKLAIAFGLLKTKPGETLRISKNLRICRDCHQASKLISKVYDR 600

Query: 601 KFIVRDRNRFHHFADGECSCNDYW 621
           + I+RDRNRFHHF  G CSC DYW
Sbjct: 601 EIIIRDRNRFHHFRMGGCSCKDYW 624

BLAST of CmaCh10G007920 vs. TrEMBL
Match: M5Y189_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa018015mg PE=4 SV=1)

HSP 1 Score: 932.2 bits (2408), Expect = 3.3e-268
Identity = 439/624 (70.35%), Postives = 526/624 (84.29%), Query Frame = 1

Query: 1   MSSLLALQLSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           M+SL  LQ +   ++SPK   SP+ G++SCS+M+EL+Q HS++IRLGL+ DNDAMGR++K
Sbjct: 1   MTSLQVLQATPPHLSSPKTQISPLRGIESCSTMAELRQLHSKVIRLGLAADNDAMGRVIK 60

Query: 61  FCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCA+ KNGDL YAL +F T+ +PDA+IYNT++RGYLQ   PR C++LY +ML   V PNK
Sbjct: 61  FCALSKNGDLGYALQVFDTMLHPDAFIYNTVMRGYLQCHLPRNCIVLYSQMLQDSVTPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDG 180
           +TFPS+IRACC D+AI EGKQ+HAHV+K G+ AD F QNNLIHMY  FQSLEEARRVFD 
Sbjct: 121 YTFPSVIRACCNDDAIGEGKQVHAHVVKLGYGADGFCQNNLIHMYVKFQSLEEARRVFDK 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALF 240
           +   D VSWTTL+TGY+QCGF+DEA ++FE MPE +S SWNAMISS+VQ++RFHEAFALF
Sbjct: 181 MLRMDAVSWTTLITGYSQCGFVDEAFELFELMPEKNSVSWNAMISSYVQSDRFHEAFALF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
            +MR EK+ LDK+MAASMLSACTGLGALEQG WIH YI+KS I+LDSKLATT+IDMYCKC
Sbjct: 241 QKMRVEKVELDKFMAASMLSACTGLGALEQGKWIHGYIEKSGIELDSKLATTIIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCL++AFEVF  LP KGISSWNCMIGG+AMHGKGEAAIELF+ M+  MV PDNITF+NVL
Sbjct: 301 GCLEKAFEVFNGLPHKGISSWNCMIGGLAMHGKGEAAIELFEKMQRDMVAPDNITFVNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPD 420
           +ACAHSGLVE+G+ YF    +V+ I+PR EH+GCMVDL GR+GML+EA KLI EMPMSPD
Sbjct: 361 SACAHSGLVEEGQRYFQSMVEVHGIEPRKEHFGCMVDLLGRAGMLEEARKLISEMPMSPD 420

Query: 421 AGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKL 480
            GVLGA +GACKIHGN+++GE IG+ VIEL+P NSGRYVLL NLYA AGRW+ VA VR+L
Sbjct: 421 VGVLGALLGACKIHGNVELGEHIGRIVIELEPENSGRYVLLANLYANAGRWEDVANVRRL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEE-- 540
           MNDR VKK  GFSMIELEGVV+EFIAGG  HP+ KEIY KV+EM++CIR  GYV + E  
Sbjct: 481 MNDRGVKKVPGFSMIELEGVVNEFIAGGGAHPQTKEIYAKVDEMLKCIRSAGYVPDTEGV 540

Query: 541 --EEEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFER 600
             + +E EK+NP+YYHSEKLA+AFGLLKT+ GETLRI+KNLRVCKDCHQA KL+SKVF+R
Sbjct: 541 LHDLDEEEKENPLYYHSEKLAIAFGLLKTKPGETLRISKNLRVCKDCHQASKLISKVFDR 600

Query: 601 KFIVRDRNRFHHFADGECSCNDYW 621
           + IVRDRNRFHHF  G+CSC DYW
Sbjct: 601 EIIVRDRNRFHHFKRGDCSCKDYW 624

BLAST of CmaCh10G007920 vs. TrEMBL
Match: A0A061EPX5_THECC (Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_021493 PE=4 SV=1)

HSP 1 Score: 910.2 bits (2351), Expect = 1.4e-261
Identity = 433/631 (68.62%), Postives = 523/631 (82.88%), Query Frame = 1

Query: 1   MSSLLALQLSASPINS-------PKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDND 60
           MSS   LQ  A+P++S       P++H SP+  L++CSSM+ LKQ+HS +I+LGLS DND
Sbjct: 1   MSSFQLLQ--ATPLSSAGHAPSKPRLH-SPLEALQTCSSMAHLKQHHSHLIKLGLSADND 60

Query: 61  AMGRLVKFCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLH 120
           AMGR++KFCA+ +NG LDY L LF T+P+PDA+IYNTLIRGYLQ Q P  C+L YL+ML 
Sbjct: 61  AMGRIIKFCAISENGHLDYGLHLFDTLPHPDAFIYNTLIRGYLQRQQPTHCILFYLQMLQ 120

Query: 121 KCVLPNKFTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEE 180
             V PNKFTFP LIRAC + NAIE+G QIHAHV KFGF AD F  NNLIHMY NFQ+LE+
Sbjct: 121 HSVFPNKFTFPCLIRACSLANAIEQGSQIHAHVFKFGFAADTFCLNNLIHMYVNFQALEK 180

Query: 181 ARRVFDGIELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRF 240
           AR+VF+ +   DVVSWTTL++GYAQ G +DEA ++FE M E +S SWNAMI+++VQ+NRF
Sbjct: 181 ARKVFEMMPTRDVVSWTTLISGYAQLGLVDEAFEIFELMQERNSVSWNAMIAAYVQSNRF 240

Query: 241 HEAFALFNRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTL 300
           HEAFALFNRMR+EK+VLDK++AASMLSACTGLGALEQG WIH YI+ S I+LD+KLATT+
Sbjct: 241 HEAFALFNRMRAEKVVLDKFVAASMLSACTGLGALEQGKWIHGYIQDSRIELDAKLATTI 300

Query: 301 IDMYCKCGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDN 360
           IDMYCKCGCL++A+E F  L  +GISSWNCMIGG AMHGK EAAI LFK+ME + V PDN
Sbjct: 301 IDMYCKCGCLEKAYETFKGLTCRGISSWNCMIGGFAMHGKWEAAIALFKEMEKEGVAPDN 360

Query: 361 ITFLNVLNACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIR 420
           ITF+N+L+ACAHSGLVE+GRYYF++ ++V+ I+ R EHYGCMVDL GR+G+LD+A KLI 
Sbjct: 361 ITFVNILSACAHSGLVEEGRYYFHYMTEVHAIERRMEHYGCMVDLLGRAGLLDDAKKLID 420

Query: 421 EMPMSPDAGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDS 480
           +MPMSPD GVLGA  GAC+IHGNI++GE+IGKRVIEL+P NSGRYVLL NLYA  GRW+ 
Sbjct: 421 QMPMSPDVGVLGALFGACRIHGNIELGEQIGKRVIELEPENSGRYVLLANLYANTGRWED 480

Query: 481 VAEVRKLMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGY 540
           VA VR++MNDR VKK  GFS+IELEGVV+EFIAGGR H E KEIY KV+EM+ECIR VGY
Sbjct: 481 VANVRRMMNDRGVKKVPGFSVIELEGVVNEFIAGGRAHSETKEIYSKVDEMLECIRSVGY 540

Query: 541 VTEEE----EEEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKL 600
           V + E    + +E E++NP+YYHSEKLA+A GLLKT+ GET RITKNLRVC+DCH A KL
Sbjct: 541 VPDTEGVVHDLDEEERENPLYYHSEKLAIALGLLKTKTGETFRITKNLRVCRDCHHASKL 600

Query: 601 VSKVFERKFIVRDRNRFHHFADGECSCNDYW 621
           +SKVF+R+ IVRDRNRFHHF DGECSC DYW
Sbjct: 601 ISKVFDREIIVRDRNRFHHFKDGECSCKDYW 628

BLAST of CmaCh10G007920 vs. TrEMBL
Match: A0A0J8BUR1_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_7g174460 PE=4 SV=1)

HSP 1 Score: 887.9 bits (2293), Expect = 7.2e-255
Identity = 413/624 (66.19%), Postives = 523/624 (83.81%), Query Frame = 1

Query: 4   LLALQLSASPINSPK--VHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKF 63
           ++ + +S+  +++P   V+ +P+  L  CSSM+ELKQ HSQIIRLGLS+DND MGR +KF
Sbjct: 1   MMHIPISSLVVSAPLNVVYPNPVQSLDKCSSMAELKQLHSQIIRLGLSSDNDIMGRAIKF 60

Query: 64  CAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKF 123
           CA  K  DL YA  +F  +P+PDA+IYNTLIRGYLQ Q  R C +LYL+ML   V+PN F
Sbjct: 61  CAFSKCSDLVYAHQMFDKMPHPDAFIYNTLIRGYLQCQLVRECFILYLQMLQDSVMPNNF 120

Query: 124 TFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDGI 183
           TFPSLIRACCIDN +E+G+QIHAHV+K GF  D+FSQNNL++MY +F  LEEARRVFD +
Sbjct: 121 TFPSLIRACCIDNWVEKGRQIHAHVVKMGFLDDKFSQNNLLYMYVSFGCLEEARRVFDKM 180

Query: 184 ELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALFN 243
              DVVSWTTL++GY+Q G +++A +VF+SMP  +SA+WNAMI+++VQN+RFHEAFALFN
Sbjct: 181 PQRDVVSWTTLISGYSQLGLVNDAYEVFKSMPNRNSAAWNAMIAAYVQNDRFHEAFALFN 240

Query: 244 RMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKCG 303
            MR   + LD+Y+ ASMLSACT LGAL+QG WIH YI+K+ I++D+KLATT+IDMYCKCG
Sbjct: 241 EMRMGNVELDRYVVASMLSACTKLGALDQGEWIHGYIRKNGIEMDTKLATTVIDMYCKCG 300

Query: 304 CLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLN 363
           CL++AFEVF  LP KGIS+WNCMIGG+A+HG+G+AAIELFKDME + V PDNITFLNVL+
Sbjct: 301 CLEKAFEVFNGLPSKGISTWNCMIGGLAIHGRGKAAIELFKDMERETVAPDNITFLNVLS 360

Query: 364 ACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLI-REMPMSPD 423
           ACAH+GLVE GR+YF H ++V+ ++PR EHYGCMVDL GR+G+L+EA KL+  EMPM  D
Sbjct: 361 ACAHAGLVETGRHYFKHMTEVHRLEPRMEHYGCMVDLLGRAGLLEEARKLVEEEMPMEAD 420

Query: 424 AGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKL 483
           AGVLGA VGACKIHG+I++GE  GKR+IELDP+NSGRYVLL NLYA AG+W+ VA +RKL
Sbjct: 421 AGVLGALVGACKIHGDINLGENFGKRLIELDPNNSGRYVLLANLYATAGKWNDVANIRKL 480

Query: 484 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEE--- 543
           MNDR+VKKA GFS+IE+EGVV EFIAGGR HPE+++IY K++EM+E I+C+GYV ++   
Sbjct: 481 MNDRKVKKAPGFSVIEMEGVVSEFIAGGRAHPESRKIYAKLDEMLESIKCLGYVPDKDGL 540

Query: 544 -EEEEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFER 603
            ++ +E EK+NP  YHSEKLA+AFGLLKT+AG+T+RI+KNLRVC+DCHQA KL+SK F+R
Sbjct: 541 IQDIDEEEKENPSNYHSEKLAIAFGLLKTKAGQTIRISKNLRVCRDCHQASKLISKAFDR 600

Query: 604 KFIVRDRNRFHHFADGECSCNDYW 621
           + IVRDRNRFHHF +GECSCND+W
Sbjct: 601 EIIVRDRNRFHHFRNGECSCNDFW 624

BLAST of CmaCh10G007920 vs. TAIR10
Match: AT5G66520.1 (AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 553.9 bits (1426), Expect = 1.2e-157
Identity = 270/600 (45.00%), Postives = 383/600 (63.83%), Query Frame = 1

Query: 27  LKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVCKNGD-LDYALLLFKTIPNPDA 86
           L+ CS   ELKQ H+++++ GL  D+ A+ + + FC    + D L YA ++F     PD 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 87  YIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQIHAH 146
           +++N +IRG+     P   LLLY  ML      N +TFPSL++AC   +A EE  QIHA 
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQ 140

Query: 147 VLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGFLDEA 206
           + K G+  D ++ N+LI+ YA   + + A  +FD I  PD VSW +++ GY + G +D A
Sbjct: 141 ITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIA 200

Query: 207 LQVFESMPEHSSASWNAMISSFVQNNRFHEAFALFNRMRSEKIVLDKYMAASMLSACTGL 266
           L +F  M E ++ SW  MIS +VQ +   EA  LF+ M++  +  D    A+ LSAC  L
Sbjct: 201 LTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQL 260

Query: 267 GALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMI 326
           GALEQG WIH Y+ K+ I++DS L   LIDMY KCG ++ A EVF  + +K + +W  +I
Sbjct: 261 GALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 320

Query: 327 GGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFSQVYDI 386
            G A HG G  AI  F +M+   + P+ ITF  VL AC+++GLVE+G+  F    + Y++
Sbjct: 321 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 380

Query: 387 KPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPDAGVLGAFVGACKIHGNIDMGEEIGK 446
           KP  EHYGC+VDL GR+G+LDEA + I+EMP+ P+A + GA + AC+IH NI++GEEIG+
Sbjct: 381 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGE 440

Query: 447 RVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKLMNDREVKKAAGFSMIELEGVVHEFI 506
            +I +DP + GRYV   N++A   +WD  AE R+LM ++ V K  G S I LEG  HEF+
Sbjct: 441 ILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFL 500

Query: 507 AGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEEE-----EEEVEKDNPVYYHSEKLAVAF 566
           AG R HPE ++I  K   M   +   GYV E EE      ++ E++  V+ HSEKLA+ +
Sbjct: 501 AGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLAITY 560

Query: 567 GLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFERKFIVRDRNRFHHFADGECSCNDYW 621
           GL+KT+ G  +RI KNLRVCKDCH+  KL+SK+++R  ++RDR RFHHF DG+CSC DYW
Sbjct: 561 GLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCSCGDYW 620

BLAST of CmaCh10G007920 vs. TAIR10
Match: AT5G48910.1 (AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 523.9 bits (1348), Expect = 1.4e-148
Identity = 258/633 (40.76%), Postives = 389/633 (61.45%), Query Frame = 1

Query: 12  SPINSPKVHTSPIHG-LKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVCK--NG 71
           SP +SP  H S +   + +C ++ +L Q H+  I+ G   D  A   +++FCA     + 
Sbjct: 14  SPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHR 73

Query: 72  DLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACL---LLYLEMLHKCVLPNKFTFPS 131
           DLDYA  +F  +P  + + +NT+IRG+ +    +A +   L Y  M  + V PN+FTFPS
Sbjct: 74  DLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPS 133

Query: 132 LIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVF------- 191
           +++AC     I+EGKQIH   LK+GF  D F  +NL+ MY     +++AR +F       
Sbjct: 134 VLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEK 193

Query: 192 DGIELPD-------VVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNN 251
           D + + D       +V W  ++ GY + G    A  +F+ M + S  SWN MIS +  N 
Sbjct: 194 DMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNG 253

Query: 252 RFHEAFALFNRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLAT 311
            F +A  +F  M+   I  +     S+L A + LG+LE G W+H Y + S I++D  L +
Sbjct: 254 FFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGS 313

Query: 312 TLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTP 371
            LIDMY KCG +++A  VF +LP + + +W+ MI G A+HG+   AI+ F  M    V P
Sbjct: 314 ALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRP 373

Query: 372 DNITFLNVLNACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKL 431
            ++ ++N+L AC+H GLVE+GR YF+    V  ++PR EHYGCMVDL GRSG+LDEA + 
Sbjct: 374 SDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEF 433

Query: 432 IREMPMSPDAGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRW 491
           I  MP+ PD  +  A +GAC++ GN++MG+ +   ++++ P +SG YV L N+YA  G W
Sbjct: 434 ILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNW 493

Query: 492 DSVAEVRKLMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCV 551
             V+E+R  M +++++K  G S+I+++GV+HEF+     HP+AKEI   + E+ + +R  
Sbjct: 494 SEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLA 553

Query: 552 GY--VTEEE--EEEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQAL 611
           GY  +T +     EE +K+N ++YHSEK+A AFGL+ T  G+ +RI KNLR+C+DCH ++
Sbjct: 554 GYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSI 613

Query: 612 KLVSKVFERKFIVRDRNRFHHFADGECSCNDYW 621
           KL+SKV++RK  VRDR RFHHF DG CSC DYW
Sbjct: 614 KLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of CmaCh10G007920 vs. TAIR10
Match: AT5G06540.1 (AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 503.8 bits (1296), Expect = 1.5e-142
Identity = 246/604 (40.73%), Postives = 380/604 (62.91%), Query Frame = 1

Query: 27  LKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVCKNGD-----LDYALLLFKTIP 86
           L+SCSS S+LK  H  ++R  L +D     RL+  C      +     L YA  +F  I 
Sbjct: 19  LQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFSQIQ 78

Query: 87  NPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQ 146
           NP+ +++N LIR +     P      Y +ML   + P+  TFP LI+A      +  G+Q
Sbjct: 79  NPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLVGEQ 138

Query: 147 IHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGF 206
            H+ +++FGF+ D + +N+L+HMYAN   +  A R+F  +   DVVSWT+++ GY +CG 
Sbjct: 139 THSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCKCGM 198

Query: 207 LDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALFNRMRSEKIVLDKYMAASMLSA 266
           ++ A ++F+ MP  +  +W+ MI+ + +NN F +A  LF  M+ E +V ++ +  S++S+
Sbjct: 199 VENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSVISS 258

Query: 267 CTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSW 326
           C  LGALE G   + Y+ KS + ++  L T L+DM+ +CG +++A  VF  LPE    SW
Sbjct: 259 CAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPETDSLSW 318

Query: 327 NCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFSQ 386
           + +I G+A+HG    A+  F  M +    P ++TF  VL+AC+H GLVEKG   + +  +
Sbjct: 319 SSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKK 378

Query: 387 VYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPDAGVLGAFVGACKIHGNIDMGE 446
            + I+PR EHYGC+VD+ GR+G L EA   I +M + P+A +LGA +GACKI+ N ++ E
Sbjct: 379 DHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKNTEVAE 438

Query: 447 EIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKLMNDREVKKAAGFSMIELEGVV 506
            +G  +I++ P +SG YVLL N+YA AG+WD +  +R +M ++ VKK  G+S+IE++G +
Sbjct: 439 RVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIEIDGKI 498

Query: 507 HEFIAG-GRGHPEAKEIYGKVNEMIECIRCVGYVTEEEEE----EEVEKDNPVYYHSEKL 566
           ++F  G  + HPE  +I  K  E++  IR +GY     +     +E EK++ ++ HSEKL
Sbjct: 499 NKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKESSIHMHSEKL 558

Query: 567 AVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFERKFIVRDRNRFHHFADGECSC 621
           A+A+G++KT+ G T+RI KNLRVC+DCH   KL+S+V+ R+ IVRDRNRFHHF +G CSC
Sbjct: 559 AIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHFRNGVCSC 618

BLAST of CmaCh10G007920 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 489.6 bits (1259), Expect = 2.9e-138
Identity = 241/612 (39.38%), Postives = 379/612 (61.93%), Query Frame = 1

Query: 17  PKVHTSP--IHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVCKNGDLDYAL 76
           P  +T P  I      SS+S  +  H   ++  + +D      L+     C  GDLD A 
Sbjct: 129 PNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSC--GDLDSAC 188

Query: 77  LLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDN 136
            +F TI   D   +N++I G++Q   P   L L+ +M  + V  +  T   ++ AC    
Sbjct: 189 KVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIR 248

Query: 137 AIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLT 196
            +E G+Q+ +++ +     +    N ++ MY    S+E+A+R+FD +E  D V+WTT+L 
Sbjct: 249 NLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLD 308

Query: 197 GYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALFNRMRSEK-IVLDKY 256
           GYA     + A +V  SMP+    +WNA+IS++ QN + +EA  +F+ ++ +K + L++ 
Sbjct: 309 GYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQI 368

Query: 257 MAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQL 316
              S LSAC  +GALE G WIH YIKK  I+++  + + LI MY KCG L+++ EVF  +
Sbjct: 369 TLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSV 428

Query: 317 PEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGR 376
            ++ +  W+ MIGG+AMHG G  A+++F  M+   V P+ +TF NV  AC+H+GLV++  
Sbjct: 429 EKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAE 488

Query: 377 YYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPDAGVLGAFVGACKI 436
             F+     Y I P  +HY C+VD+ GRSG L++A+K I  MP+ P   V GA +GACKI
Sbjct: 489 SLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKI 548

Query: 437 HGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKLMNDREVKKAAGFS 496
           H N+++ E    R++EL+P N G +VLL N+YA+ G+W++V+E+RK M    +KK  G S
Sbjct: 549 HANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCS 608

Query: 497 MIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEEE-----EEEVEKDNP 556
            IE++G++HEF++G   HP ++++YGK++E++E ++  GY  E  +     EEE  K+  
Sbjct: 609 SIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQS 668

Query: 557 VYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFERKFIVRDRNRFHH 616
           +  HSEKLA+ +GL+ T+A + +R+ KNLRVC DCH   KL+S++++R+ IVRDR RFHH
Sbjct: 669 LNLHSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHH 728

Query: 617 FADGECSCNDYW 621
           F +G+CSCND+W
Sbjct: 729 FRNGQCSCNDFW 738

BLAST of CmaCh10G007920 vs. TAIR10
Match: AT4G37380.1 (AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 476.1 bits (1224), Expect = 3.3e-134
Identity = 249/596 (41.78%), Postives = 360/596 (60.40%), Query Frame = 1

Query: 32  SMSELKQYHSQIIRLGLSTD-NDAMGRLVKFCAVCKNGDLDYALLLFKTIPNPDAYIYNT 91
           S+ E+ Q H+ I+R  L       +  L    A   +G + ++L LF    +PD +++  
Sbjct: 41  SVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSLALFHQTIDPDLFLFTA 100

Query: 92  LIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQIHAHVLKFG 151
            I             LLY+++L   + PN+FTF SL+++C    + + GK IH HVLKFG
Sbjct: 101 AINTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSC----STKSGKLIHTHVLKFG 160

Query: 152 FRADRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGFLDEALQVFE 211
              D +    L+ +YA    +  A++VFD +    +VS T ++T YA+ G ++ A  +F+
Sbjct: 161 LGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITCYAKQGNVEAARALFD 220

Query: 212 SMPEHSSASWNAMISSFVQNNRFHEAFALFNRMRSE-KIVLDKYMAASMLSACTGLGALE 271
           SM E    SWN MI  + Q+   ++A  LF ++ +E K   D+    + LSAC+ +GALE
Sbjct: 221 SMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEITVVAALSACSQIGALE 280

Query: 272 QGIWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMIGGMA 331
            G WIH ++K S I+L+ K+ T LIDMY KCG L+ A  VF   P K I +WN MI G A
Sbjct: 281 TGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDTPRKDIVAWNAMIAGYA 340

Query: 332 MHGKGEAAIELFKDME-TKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFSQVYDIKPR 391
           MHG  + A+ LF +M+    + P +ITF+  L ACAH+GLV +G   F    Q Y IKP+
Sbjct: 341 MHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIRIFESMGQEYGIKPK 400

Query: 392 TEHYGCMVDLYGRSGMLDEAMKLIREMPMSPDAGVLGAFVGACKIHGNIDMGEEIGKRVI 451
            EHYGC+V L GR+G L  A + I+ M M  D+ +  + +G+CK+HG+  +G+EI + +I
Sbjct: 401 IEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKLHGDFVLGKEIAEYLI 460

Query: 452 ELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKLMNDREVKKAAGFSMIELEGVVHEFIAGG 511
            L+  NSG YVLL N+YA  G ++ VA+VR LM ++ + K  G S IE+E  VHEF AG 
Sbjct: 461 GLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGISTIEIENKVHEFRAGD 520

Query: 512 RGHPEAKEIYGKVNEMIECIRCVGYV----TEEEEEEEVEKDNPVYYHSEKLAVAFGLLK 571
           R H ++KEIY  + ++ E I+  GYV    T  ++ EE EK+  +  HSE+LA+A+GL+ 
Sbjct: 521 REHSKSKEIYTMLRKISERIKSHGYVPNTNTVLQDLEETEKEQSLQVHSERLAIAYGLIS 580

Query: 572 TQAGETLRITKNLRVCKDCHQALKLVSKVFERKFIVRDRNRFHHFADGECSCNDYW 621
           T+ G  L+I KNLRVC DCH   KL+SK+  RK ++RDRNRFHHF DG CSC D+W
Sbjct: 581 TKPGSPLKIFKNLRVCSDCHTVTKLISKITGRKIVMRDRNRFHHFTDGSCSCGDFW 632

BLAST of CmaCh10G007920 vs. NCBI nr
Match: gi|778662474|ref|XP_011659892.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis sativus])

HSP 1 Score: 1044.6 bits (2700), Expect = 6.6e-302
Identity = 506/621 (81.48%), Postives = 558/621 (89.86%), Query Frame = 1

Query: 1   MSSLLALQLSASPINSPKVHTSPI-HGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLV 60
           M+SLL L    S  NS K + SPI H L SCSSMSELKQ+HSQIIRLGLSTDN+A+GRL+
Sbjct: 1   MASLLPLHPIPSLPNSTKFNPSPIFHSLSSCSSMSELKQFHSQIIRLGLSTDNNAIGRLI 60

Query: 61  KFCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPN 120
           KFCAV K GDL YALLLF +IP PDA+IYNTLIR YL    P++ LLLYL+MLH  V PN
Sbjct: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLHFNSPKSSLLLYLQMLHNSVFPN 120

Query: 121 KFTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFD 180
           KFTFPS+IRACCIDN++EEGKQIH HV+KFGF  DRF QNNLIHMYANFQSLE+ARRVFD
Sbjct: 121 KFTFPSVIRACCIDNSVEEGKQIHTHVVKFGFSKDRFCQNNLIHMYANFQSLEDARRVFD 180

Query: 181 GIELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFAL 240
            IELPDVV+WTTLLTGYAQ G++DE+L+VFESMPE +SASWNAMIS FVQNNRFHEAF L
Sbjct: 181 CIELPDVVAWTTLLTGYAQLGYVDESLRVFESMPERNSASWNAMISCFVQNNRFHEAFGL 240

Query: 241 FNRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCK 300
           FNRMR EK+VL+KY+AASMLSACTGLGALEQG WIHRYI+++ I+ DSKLATTLIDMYCK
Sbjct: 241 FNRMRIEKVVLEKYVAASMLSACTGLGALEQGKWIHRYIERNGIEFDSKLATTLIDMYCK 300

Query: 301 CGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNV 360
           CGCLD A+EVF  LPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMV PDNITFLNV
Sbjct: 301 CGCLDCAYEVFVHLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVKPDNITFLNV 360

Query: 361 LNACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSP 420
           L+ACAHSGLVEKG++YF  F+QVY I+PRTEHYGCMVDLYGR+G+L+EAMK+I EMPMSP
Sbjct: 361 LSACAHSGLVEKGQHYFYRFTQVYGIEPRTEHYGCMVDLYGRAGLLEEAMKVIDEMPMSP 420

Query: 421 DAGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRK 480
           D GVLGAFVGACKIHGNI++GEE+GKRVIEL+P+NSGRYVLLGNLYAEAGRW+ VAEVRK
Sbjct: 421 DVGVLGAFVGACKIHGNIELGEEVGKRVIELEPTNSGRYVLLGNLYAEAGRWEGVAEVRK 480

Query: 481 LMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEEE 540
           LMNDREVKKAAG SMIELEGVV+EFIAGGR HPEAKEIY K+NEM+ECIR  GYV E E 
Sbjct: 481 LMNDREVKKAAGVSMIELEGVVYEFIAGGRNHPEAKEIYDKLNEMLECIRSEGYVAENEI 540

Query: 541 EEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFERKFI 600
           EE  EKDNPVYYHSEKLA+AFGLLKT+AGE LRITKNLRVCKDCHQALKLVSKVF+RK I
Sbjct: 541 EE--EKDNPVYYHSEKLAIAFGLLKTKAGEILRITKNLRVCKDCHQALKLVSKVFQRKII 600

Query: 601 VRDRNRFHHFADGECSCNDYW 621
           VRDRNRFHHF +GECSCNDYW
Sbjct: 601 VRDRNRFHHFGNGECSCNDYW 619

BLAST of CmaCh10G007920 vs. NCBI nr
Match: gi|700210977|gb|KGN66073.1| (hypothetical protein Csa_1G569490 [Cucumis sativus])

HSP 1 Score: 957.6 bits (2474), Expect = 1.1e-275
Identity = 468/579 (80.83%), Postives = 518/579 (89.46%), Query Frame = 1

Query: 1   MSSLLALQLSASPINSPKVHTSPI-HGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLV 60
           M+SLL L    S  NS K + SPI H L SCSSMSELKQ+HSQIIRLGLSTDN+A+GRL+
Sbjct: 1   MASLLPLHPIPSLPNSTKFNPSPIFHSLSSCSSMSELKQFHSQIIRLGLSTDNNAIGRLI 60

Query: 61  KFCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPN 120
           KFCAV K GDL YALLLF +IP PDA+IYNTLIR YL    P++ LLLYL+MLH  V PN
Sbjct: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLHFNSPKSSLLLYLQMLHNSVFPN 120

Query: 121 KFTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFD 180
           KFTFPS+IRACCIDN++EEGKQIH HV+KFGF  DRF QNNLIHMYANFQSLE+ARRVFD
Sbjct: 121 KFTFPSVIRACCIDNSVEEGKQIHTHVVKFGFSKDRFCQNNLIHMYANFQSLEDARRVFD 180

Query: 181 GIELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFAL 240
            IELPDVV+WTTLLTGYAQ G++DE+L+VFESMPE +SASWNAMIS FVQNNRFHEAF L
Sbjct: 181 CIELPDVVAWTTLLTGYAQLGYVDESLRVFESMPERNSASWNAMISCFVQNNRFHEAFGL 240

Query: 241 FNRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCK 300
           FNRMR EK+VL+KY+AASMLSACTGLGALEQG WIHRYI+++ I+ DSKLATTLIDMYCK
Sbjct: 241 FNRMRIEKVVLEKYVAASMLSACTGLGALEQGKWIHRYIERNGIEFDSKLATTLIDMYCK 300

Query: 301 CGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNV 360
           CGCLD A+EVF  LPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMV PDNITFLNV
Sbjct: 301 CGCLDCAYEVFVHLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVKPDNITFLNV 360

Query: 361 LNACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSP 420
           L+ACAHSGLVEKG++YF  F+QVY I+PRTEHYGCMVDLYGR+G+L+EAMK+I EMPMSP
Sbjct: 361 LSACAHSGLVEKGQHYFYRFTQVYGIEPRTEHYGCMVDLYGRAGLLEEAMKVIDEMPMSP 420

Query: 421 DAGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRK 480
           D GVLGAFVGACKIHGNI++GEE+GKRVIEL+P+NSGRYVLLGNLYAEAGRW+ VAEVRK
Sbjct: 421 DVGVLGAFVGACKIHGNIELGEEVGKRVIELEPTNSGRYVLLGNLYAEAGRWEGVAEVRK 480

Query: 481 LMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEEE 540
           LMNDREVKKAAG SMIELEGVV+EFIAGGR HPEAKEIY K+NEM+ECIR  GYV E E 
Sbjct: 481 LMNDREVKKAAGVSMIELEGVVYEFIAGGRNHPEAKEIYDKLNEMLECIRSEGYVAENEI 540

Query: 541 EEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLR 579
           EE  EKDNPVYYHSEKLA+AFGLLKT+AGE LRITKNLR
Sbjct: 541 EE--EKDNPVYYHSEKLAIAFGLLKTKAGEILRITKNLR 577

BLAST of CmaCh10G007920 vs. NCBI nr
Match: gi|225466163|ref|XP_002263755.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Vitis vinifera])

HSP 1 Score: 938.3 bits (2424), Expect = 6.7e-270
Identity = 442/624 (70.83%), Postives = 532/624 (85.26%), Query Frame = 1

Query: 1   MSSLLALQLSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           MSSL  LQ S   ++S K H  P++GL SCS+M+ELKQYHSQIIRLGLS DNDAMGR++K
Sbjct: 1   MSSLQLLQASPPSLSSAKAHKLPLYGLDSCSTMAELKQYHSQIIRLGLSADNDAMGRVIK 60

Query: 61  FCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCA+ K+GDL+YAL +F  IP+PDAYIYNT+ RGYL+ Q  R C+ +Y  MLHK V PNK
Sbjct: 61  FCAISKSGDLNYALEVFDKIPHPDAYIYNTIFRGYLRWQLARNCIFMYSRMLHKSVSPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDG 180
           FT+P LIRACCID AIEEGKQIHAHVLKFGF AD FS NNLIHMY NFQSLE+ARRVFD 
Sbjct: 121 FTYPPLIRACCIDYAIEEGKQIHAHVLKFGFGADGFSLNNLIHMYVNFQSLEQARRVFDN 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALF 240
           +   DVVSWT+L+TGY+Q GF+D+A +VFE MPE +S SWNAMI+++VQ+NR HEAFALF
Sbjct: 181 MPQRDVVSWTSLITGYSQWGFVDKAREVFELMPERNSVSWNAMIAAYVQSNRLHEAFALF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
           +RMR E +VLDK++AASMLSACTGLGALEQG WIH YI+KS I+LDSKLATT+IDMYCKC
Sbjct: 241 DRMRLENVVLDKFVAASMLSACTGLGALEQGKWIHGYIEKSGIELDSKLATTVIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCL++A EVF +LP+KGISSWNCMIGG+AMHGKGEAAIELFK+ME +MV PD ITF+NVL
Sbjct: 301 GCLEKASEVFNELPQKGISSWNCMIGGLAMHGKGEAAIELFKEMEREMVAPDGITFVNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPD 420
           +ACAHSGLVE+G++YF + ++V  +KP  EH+GCMVDL GR+G+L+EA KLI EMP++PD
Sbjct: 361 SACAHSGLVEEGKHYFQYMTEVLGLKPGMEHFGCMVDLLGRAGLLEEARKLINEMPVNPD 420

Query: 421 AGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKL 480
           AGVLGA VGAC+IHGN ++GE+IGK+VIEL+P NSGRYVLL NLYA AGRW+ VA+VRKL
Sbjct: 421 AGVLGALVGACRIHGNTELGEQIGKKVIELEPHNSGRYVLLANLYASAGRWEDVAKVRKL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEE-- 540
           MNDR VKKA GFSMIE E  V EFIAGGR HP+AKEIY K++E++E IR +GYV + +  
Sbjct: 481 MNDRGVKKAPGFSMIESESGVDEFIAGGRAHPQAKEIYAKLDEILETIRSIGYVPDTDGV 540

Query: 541 --EEEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFER 600
             + +E EK+NP+YYHSEKLA+AFGLLKT+ GETLRI+KNLR+C+DCHQA KL+SKV++R
Sbjct: 541 LHDIDEEEKENPLYYHSEKLAIAFGLLKTKPGETLRISKNLRICRDCHQASKLISKVYDR 600

Query: 601 KFIVRDRNRFHHFADGECSCNDYW 621
           + I+RDRNRFHHF  G CSC DYW
Sbjct: 601 EIIIRDRNRFHHFRMGGCSCKDYW 624

BLAST of CmaCh10G007920 vs. NCBI nr
Match: gi|694387951|ref|XP_009369706.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Pyrus x bretschneideri])

HSP 1 Score: 934.5 bits (2414), Expect = 9.6e-269
Identity = 442/627 (70.49%), Postives = 524/627 (83.57%), Query Frame = 1

Query: 1   MSSLLALQLSASP---INSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGR 60
           M+SL  LQ +      ++SP+ H SPI GL+SCS+M+ELKQ HSQ+IRLGL+TDNDAMGR
Sbjct: 1   MNSLQVLQPTGPTPPSLSSPRTHISPIRGLESCSTMAELKQLHSQVIRLGLATDNDAMGR 60

Query: 61  LVKFCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVL 120
           ++KFCA+ KNGDL YAL +F  +P PDA+IYNT++RGYLQ Q PR C++LY +ML   V 
Sbjct: 61  VIKFCALSKNGDLGYALQVFDAMPQPDAFIYNTVMRGYLQCQLPRDCIVLYSQMLQDFVT 120

Query: 121 PNKFTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRV 180
           PN+FTFPS++RACC D A+ EGKQ+HAHV+K GF  D F QNNLIHMY  FQSLEEARRV
Sbjct: 121 PNRFTFPSVVRACCADGAVVEGKQVHAHVIKLGFGDDGFCQNNLIHMYVKFQSLEEARRV 180

Query: 181 FDGIELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAF 240
           FD +   DVVSWTTL+TGY+Q GF+DEA ++FE MPE +S SWNAMISS+VQ+ RFHEAF
Sbjct: 181 FDKMPRVDVVSWTTLITGYSQRGFVDEAFEMFELMPEKNSVSWNAMISSYVQSGRFHEAF 240

Query: 241 ALFNRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMY 300
           ALF RMR E + LDK+MAASMLSACTGLGALEQG WIH YI+KS I+LDSKLATT+IDMY
Sbjct: 241 ALFQRMRVENVELDKFMAASMLSACTGLGALEQGKWIHGYIEKSGIELDSKLATTIIDMY 300

Query: 301 CKCGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFL 360
           CKCGCL++AFEVF  LP KGISSWNCMIGG+AMHGKGEAA+ELF+ M+  MV PDNITF+
Sbjct: 301 CKCGCLEKAFEVFNGLPRKGISSWNCMIGGLAMHGKGEAAVELFEQMQRDMVAPDNITFV 360

Query: 361 NVLNACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPM 420
           NVL+ACAHSGLVEKG+ YF    +V+ I+PRTEH+GCMVDL GR+G L+EA KLI EMPM
Sbjct: 361 NVLSACAHSGLVEKGQQYFRSMVEVHGIEPRTEHFGCMVDLLGRAGRLEEARKLINEMPM 420

Query: 421 SPDAGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEV 480
           SP+ GVLGA +GACKIHGN+++G+EIG+RVIEL+P NSGRYVLL NLYA AGRWD+VA V
Sbjct: 421 SPNVGVLGALLGACKIHGNVELGDEIGRRVIELEPENSGRYVLLANLYANAGRWDNVANV 480

Query: 481 RKLMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEE 540
           R+LMNDR VKK  GFSMIELEG V EFIAGG  HP+ KEIY KV+EM++CIR  GYV + 
Sbjct: 481 RRLMNDRGVKKVPGFSMIELEGTVSEFIAGGGSHPQTKEIYSKVDEMLKCIRSAGYVPDT 540

Query: 541 E----EEEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKV 600
           E    + +E E++NP+YYHSEKLA+AFGLLKT+  ETLRI+KNLRVCKDCHQA KL+SKV
Sbjct: 541 EGVLHDLDEEERENPLYYHSEKLAIAFGLLKTKPRETLRISKNLRVCKDCHQASKLISKV 600

Query: 601 FERKFIVRDRNRFHHFADGECSCNDYW 621
           F+R+ IVRDRNRFHHF  GECSC DYW
Sbjct: 601 FDREIIVRDRNRFHHFKGGECSCKDYW 627

BLAST of CmaCh10G007920 vs. NCBI nr
Match: gi|645233810|ref|XP_008223521.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Prunus mume])

HSP 1 Score: 933.3 bits (2411), Expect = 2.1e-268
Identity = 441/624 (70.67%), Postives = 524/624 (83.97%), Query Frame = 1

Query: 1   MSSLLALQLSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           M+SL  LQ +   ++SPK   SP+ G++SCS+M+ELKQ HSQ+IRLGL+ DNDAMGR++K
Sbjct: 1   MTSLQVLQATPPHLSSPKTQISPLRGIESCSTMAELKQLHSQVIRLGLAADNDAMGRVIK 60

Query: 61  FCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCA+ KNGDL YAL +F T+ +PDA+IYNT++RGYLQ Q PR C++LY +ML   V PNK
Sbjct: 61  FCALSKNGDLGYALQVFDTMLHPDAFIYNTVMRGYLQCQLPRNCIVLYSQMLQDSVTPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDG 180
           +TFPS+IRACC D+AI E KQ+HAHV+K G+ AD F QNNLIHMY  FQSLEEARRVFD 
Sbjct: 121 YTFPSVIRACCNDDAIGEAKQVHAHVVKLGYGADGFCQNNLIHMYVKFQSLEEARRVFDK 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALF 240
           +   D VSWTTL+TGY+QCGF+DEA ++FE MPE +S SWNAMISS+VQ++RFHEAFALF
Sbjct: 181 MPRMDAVSWTTLITGYSQCGFVDEAFEIFELMPEKNSVSWNAMISSYVQSDRFHEAFALF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
            RMR EK+ LDK+MAASMLSACTGLGALEQG WIH YI+KS I+LDSKLATT+IDMYCKC
Sbjct: 241 QRMRVEKVELDKFMAASMLSACTGLGALEQGKWIHGYIEKSGIELDSKLATTIIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           G L++AFEVF  L  KGISSWNCMIGG+AMHGKGEAAIELF+ M+  MV PDNITF+NVL
Sbjct: 301 GFLEKAFEVFNGLSHKGISSWNCMIGGLAMHGKGEAAIELFEKMQRDMVAPDNITFVNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPD 420
           +ACAHSGLVE+G+ YF    +V+ I+PR EH+GCMVDL GR+GML+EA KLI EMPMSPD
Sbjct: 361 SACAHSGLVEEGQRYFQSMIEVHGIEPRKEHFGCMVDLLGRAGMLEEARKLISEMPMSPD 420

Query: 421 AGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKL 480
            G+LGA +GACKIHGN+++GE IGKRVIEL+P NSGRYVLL NLYA  GRW+ VA VR+L
Sbjct: 421 VGILGALLGACKIHGNVELGEHIGKRVIELEPENSGRYVLLANLYANVGRWEDVANVRRL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEE-- 540
           MNDR VKK  GFSMIELEGVV+EFIAGG  HP+ KEIY KV+EM++CIR  GYV + E  
Sbjct: 481 MNDRGVKKVPGFSMIELEGVVNEFIAGGGAHPQTKEIYAKVDEMLKCIRSAGYVPDTEGV 540

Query: 541 --EEEEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFER 600
             + +E EK+NP+YYHSEKLA+AFGLLKT+ GETLRI+KNLRVCKDCHQA KL+SKVF+R
Sbjct: 541 LHDLDEEEKENPLYYHSEKLAIAFGLLKTKPGETLRISKNLRVCKDCHQASKLISKVFDR 600

Query: 601 KFIVRDRNRFHHFADGECSCNDYW 621
           + IVRDRNRFHHF  GECSC DYW
Sbjct: 601 EIIVRDRNRFHHFKRGECSCKDYW 624

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP449_ARATH2.2e-15645.00Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN... [more]
PP425_ARATH2.5e-14740.76Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana GN... [more]
PP367_ARATH2.6e-14140.73Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN... [more]
PP175_ARATH5.1e-13739.38Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP354_ARATH5.9e-13341.78Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
A0A0A0LWF1_CUCSA7.4e-27680.83Uncharacterized protein OS=Cucumis sativus GN=Csa_1G569490 PE=4 SV=1[more]
F6H9I8_VITVI4.7e-27070.83Putative uncharacterized protein OS=Vitis vinifera GN=VIT_04s0069g00930 PE=4 SV=... [more]
M5Y189_PRUPE3.3e-26870.35Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa018015mg PE=4 SV=1[more]
A0A061EPX5_THECC1.4e-26168.62Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_0214... [more]
A0A0J8BUR1_BETVU7.2e-25566.19Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_7g174460 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT5G66520.11.2e-15745.00 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G48910.11.4e-14840.76 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G06540.11.5e-14240.73 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G29760.12.9e-13839.38 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G37380.13.3e-13441.78 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778662474|ref|XP_011659892.1|6.6e-30281.48PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis s... [more]
gi|700210977|gb|KGN66073.1|1.1e-27580.83hypothetical protein Csa_1G569490 [Cucumis sativus][more]
gi|225466163|ref|XP_002263755.1|6.7e-27070.83PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Vitis vin... [more]
gi|694387951|ref|XP_009369706.1|9.6e-26970.49PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Pyrus x b... [more]
gi|645233810|ref|XP_008223521.1|2.1e-26870.67PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Prunus mu... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh10G007920.1CmaCh10G007920.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 391..416
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 184..212
score: 8.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 83..131
score: 2.1E-8coord: 318..364
score: 2.6E-10coord: 217..262
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 219..251
score: 2.2E-9coord: 291..319
score: 7.3E-5coord: 187..214
score: 1.4E-5coord: 392..415
score: 2.9E-4coord: 86..119
score: 3.8E-4coord: 320..350
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 388..418
score: 8.758coord: 321..351
score: 7.465coord: 119..153
score: 8.331coord: 84..118
score: 9.021coord: 185..219
score: 11.4coord: 16..50
score: 5.207coord: 286..320
score: 10.578coord: 154..184
score: 6.226coord: 251..285
score: 6.829coord: 220..250
score: 8.254coord: 352..387
score: 6.5coord: 454..488
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 448..473
score: 3.9E-8coord: 186..273
score: 3.9E-8coord: 391..415
score: 3.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 196..240
score: 8.97E-6coord: 435..471
score: 8.9
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..495
score: 3.6E
NoneNo IPR availablePANTHERPTHR24015:SF615SUBFAMILY NOT NAMEDcoord: 1..495
score: 3.6E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmaCh10G007920CmoCh10G008200Cucurbita moschata (Rifu)cmacmoB067
CmaCh10G007920Cp4.1LG18g02860Cucurbita pepo (Zucchini)cmacpeB080
CmaCh10G007920Carg12933Silver-seed gourdcarcmaB0119
The following gene(s) are paralogous to this gene:

None