CmaCh16G012950 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh16G012950
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr16: 9797237 .. 9799687 (-)
RNA-Seq ExpressionCmaCh16G012950
SyntenyCmaCh16G012950
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGGGAATGGATTTGTTAAACGTCTCACGGGTGACCTTTTATTTCTTGATTCATCGCCTCTTTCCAAGCTCTTGAACCAATGTGCTCGCTCGAAGTCAGCTAGAGACACGAGTCGTGTTCATGCTTGCATAATTAAATCGCCCTTTGCGTCCGAAGTTTTTATCCAAAATAGGCTCATTGATGTATATGGGAAATGTGGATGTGTGGGTGTTGCTCGCAAGGTGTTTGATAGAATGCTTGAGAGAAATATTTTCTCTTGGAACTCCATTATTTGTGCATTCACTAAGTCCGGATTTCTTGATGATGCTGTCCACATCTTTGAGAAGATGCCTCAAGTTGACCAATGCTCGTGGAATTCTATGATTTCGGGTTTTGAACAACATGATTGCTTTGATGAAGCTTTAAAATATTTTGTTCAAATGCACGGTCATGGATTTTTTATGAATGAATATTCATTCGGTAGTGCTCTCAGTGCTTGTGCAGGTTTACAAGATTTGAAAATGGGTTCCCAAATCCACAGTTTAATATATAGGTCAAATTATTTATCAGATATGTATATGGGCTCTGCTCTAGTAGATATGTACTCTAAATGTGGAAGAGTTGATTGTGCTCGAAGTGTTTTTGATGGAATGACTGTGAGAAGTAGAGTTTCCTGGAATAGCTTGATTACGTGTTATGAACAGAACGGCCCAGTTGATGAGGCTCTTAGTATTTTTGTCGAGATGATCGAATGTGGGGTCGAACCTGATGAAGTAACTCTTGCTAGTGTTGTTAGTGCATGTGCAACTGTCTCGGCAATCAAAGAAGGTCAGCAGATTCATGCTCGAGTTGTGAAATGTGATGAATTTAGAAATGATCTTATTTTAGGCAATGCGTTGCTTGATATGTATGCTAAATGTAATAGGATTAACGAGGCTAGAATAGTTTTCGATCGGATGCCAATTAGGAGTGTGGTGTCTGAAACCTCAATGGTAAGTGGATATGCAAAAGCATCAAGTGTTAAAGCTGCAAGATCTATGTTTTCAAATATGATGGTGAAAGATGTAATTACTTGGAATGCTCTTATTGCCGGATGTACACAAAATGGAGAGAATGAAGAGGCACTTACACTCTTCCGTCTTTTGAAAAGGGAGTCTGTTTGGCCTACACACTACACATTTGGCAATCTCCTCAATGCTTGTGCAAACCTTGCTGATTTGCAGCTTGGCCGACAGGCTCACTCTCATGTTTTAAAACACGGATTTCGATTCCGATATGGAGATGAGTCAGATATCTTTGTTGGGAATTCTCTAATAGATATGTATATGAAATGTGGATCAGTTGAGAGTGGTTGTAGGGTGTTTGAACGTATGTTGGAAAGGGATTGTGTGTCATGGAATGCTATGATAGTTGGATATGCACAAAATGGTTTTGGCAACAAGGCCCTTGGAATTTTCAGTGAAATGTTAGAATCAGGAGAGAAACCAGATCATGTAACAATGATTGGTGTTCTGTCTGCCTGTAGTCATGCCGGGCTGCTCGATGAAGGCCGCCATTACTTTCGATCGATGCGTGCACGACATGGTTTGGTACCGTTAAAGGATCATTATACATGTATGGTTGATTTACTTGGGCGAGCTGGCTGCCTTGAAGAAGCAAAAAATATAATAGAGGAGATGCCAATGCAGCCTGATGCTATCGTTTGGGGATCATTGCTTGCGGCTTGTAAAGTCCATAGGAACATAAAATTGGGGGAATATGTAGTGGAGAAGCTTTTAGAGGTAGATCCTGAGAATTCTGGGCCATATGTTCTTCTCTCTAATATGTATGCTGAACGTGGAGATTGGGGGAATGTTATGAGAATAAGGAAGCTGATGAGACAGAGAGGAGTGGTTAAACAACCAGGTTGCAGTTGGATTGAAATTCAGGGTGAGTTGAATGTTTTTATGGTTAAAGATAAAAGACATGCAAGGAAGCAAGAAATCTACATGCTTTTGAGGACACTTTTACAGCAGATGAAACGGGCTGGATATGTCCCATATGTTGGCAATGACGAGATTGATGAAGAACAATAGAAGATACACGACAAACCCTCATTGGTTAATTTATGAAATCCCACGTTGGTTGGAGAGGAGAACGAAACACTCTTTATAAGGGTGTGGAAATCTCTTCTTAGTGGACACGTCTTAAAAACCTTGAGTGAAAGCCGAAAGAAAAAACTCAAAGATGACAATATCTGCTAACAGTGGGCTTGGACCGTTACAAATTTAAATAAGAAAATGTCCTTGTTTTGATGACGTTTTTTAACTAACTATGCATGTTATGTGATGTTTTCACTTATAAAAATGTATAGATCATTTGTTATCCTCATTAAGGATGTATCTCATTCAATCACATACGAATATTCAAATTTATATTTAAATATCTAAACAAATATATAAATTTGAAAGTTTCATTAGATATATTAA

mRNA sequence

ATGGCGGGGAATGGATTTGTTAAACGTCTCACGGGTGACCTTTTATTTCTTGATTCATCGCCTCTTTCCAAGCTCTTGAACCAATGTGCTCGCTCGAAGTCAGCTAGAGACACGAGTCGTGTTCATGCTTGCATAATTAAATCGCCCTTTGCGTCCGAAGTTTTTATCCAAAATAGGCTCATTGATGTATATGGGAAATGTGGATGTGTGGGTGTTGCTCGCAAGGTGTTTGATAGAATGCTTGAGAGAAATATTTTCTCTTGGAACTCCATTATTTGTGCATTCACTAAGTCCGGATTTCTTGATGATGCTGTCCACATCTTTGAGAAGATGCCTCAAGTTGACCAATGCTCGTGGAATTCTATGATTTCGGGTTTTGAACAACATGATTGCTTTGATGAAGCTTTAAAATATTTTGTTCAAATGCACGGTCATGGATTTTTTATGAATGAATATTCATTCGGTAGTGCTCTCAGTGCTTGTGCAGGTTTACAAGATTTGAAAATGGGTTCCCAAATCCACAGTTTAATATATAGGTCAAATTATTTATCAGATATGTATATGGGCTCTGCTCTAGTAGATATGTACTCTAAATGTGGAAGAGTTGATTGTGCTCGAAGTGTTTTTGATGGAATGACTGTGAGAAGTAGAGTTTCCTGGAATAGCTTGATTACGTGTTATGAACAGAACGGCCCAGTTGATGAGGCTCTTAGTATTTTTGTCGAGATGATCGAATGTGGGGTCGAACCTGATGAAGTAACTCTTGCTAGTGTTGTTAGTGCATGTGCAACTGTCTCGGCAATCAAAGAAGGTCAGCAGATTCATGCTCGAGTTGTGAAATGTGATGAATTTAGAAATGATCTTATTTTAGGCAATGCGTTGCTTGATATGTATGCTAAATGTAATAGGATTAACGAGGCTAGAATAGTTTTCGATCGGATGCCAATTAGGAGTGTGGTGTCTGAAACCTCAATGGTAAGTGGATATGCAAAAGCATCAAGTGTTAAAGCTGCAAGATCTATGTTTTCAAATATGATGGTGAAAGATGTAATTACTTGGAATGCTCTTATTGCCGGATGTACACAAAATGGAGAGAATGAAGAGGCACTTACACTCTTCCGTCTTTTGAAAAGGGAGTCTGTTTGGCCTACACACTACACATTTGGCAATCTCCTCAATGCTTGTGCAAACCTTGCTGATTTGCAGCTTGGCCGACAGGCTCACTCTCATGTTTTAAAACACGGATTTCGATTCCGATATGGAGATGAGTCAGATATCTTTGTTGGGAATTCTCTAATAGATATGTATATGAAATGTGGATCAGTTGAGAGTGGTTGTAGGGTGTTTGAACGTATGTTGGAAAGGGATTGTGTGTCATGGAATGCTATGATAGTTGGATATGCACAAAATGGTTTTGGCAACAAGGCCCTTGGAATTTTCAGTGAAATGTTAGAATCAGGAGAGAAACCAGATCATGTAACAATGATTGGTGTTCTGTCTGCCTGTAGTCATGCCGGGCTGCTCGATGAAGGCCGCCATTACTTTCGATCGATGCGTGCACGACATGGTTTGGTACCGTTAAAGGATCATTATACATGTATGGTTGATTTACTTGGGCGAGCTGGCTGCCTTGAAGAAGCAAAAAATATAATAGAGGAGATGCCAATGCAGCCTGATGCTATCGTTTGGGGATCATTGCTTGCGGCTTGTAAAGTCCATAGGAACATAAAATTGGGGGAATATGTAGTGGAGAAGCTTTTAGAGGTAGATCCTGAGAATTCTGGGCCATATGTTCTTCTCTCTAATATGTATGCTGAACGTGGAGATTGGGGGAATGTTATGAGAATAAGGAAGCTGATGAGACAGAGAGGAGTGGTTAAACAACCAGGTTGCAGTTGGATTGAAATTCAGGGTGAGTTGAATGTTTTTATGGTTAAAGATAAAAGACATGCAAGGAAGCAAGAAATCTACATGCTTTTGAGGACACTTTTACAGCAGATGAAACGGGCTGGATATATATATTAA

Coding sequence (CDS)

ATGGCGGGGAATGGATTTGTTAAACGTCTCACGGGTGACCTTTTATTTCTTGATTCATCGCCTCTTTCCAAGCTCTTGAACCAATGTGCTCGCTCGAAGTCAGCTAGAGACACGAGTCGTGTTCATGCTTGCATAATTAAATCGCCCTTTGCGTCCGAAGTTTTTATCCAAAATAGGCTCATTGATGTATATGGGAAATGTGGATGTGTGGGTGTTGCTCGCAAGGTGTTTGATAGAATGCTTGAGAGAAATATTTTCTCTTGGAACTCCATTATTTGTGCATTCACTAAGTCCGGATTTCTTGATGATGCTGTCCACATCTTTGAGAAGATGCCTCAAGTTGACCAATGCTCGTGGAATTCTATGATTTCGGGTTTTGAACAACATGATTGCTTTGATGAAGCTTTAAAATATTTTGTTCAAATGCACGGTCATGGATTTTTTATGAATGAATATTCATTCGGTAGTGCTCTCAGTGCTTGTGCAGGTTTACAAGATTTGAAAATGGGTTCCCAAATCCACAGTTTAATATATAGGTCAAATTATTTATCAGATATGTATATGGGCTCTGCTCTAGTAGATATGTACTCTAAATGTGGAAGAGTTGATTGTGCTCGAAGTGTTTTTGATGGAATGACTGTGAGAAGTAGAGTTTCCTGGAATAGCTTGATTACGTGTTATGAACAGAACGGCCCAGTTGATGAGGCTCTTAGTATTTTTGTCGAGATGATCGAATGTGGGGTCGAACCTGATGAAGTAACTCTTGCTAGTGTTGTTAGTGCATGTGCAACTGTCTCGGCAATCAAAGAAGGTCAGCAGATTCATGCTCGAGTTGTGAAATGTGATGAATTTAGAAATGATCTTATTTTAGGCAATGCGTTGCTTGATATGTATGCTAAATGTAATAGGATTAACGAGGCTAGAATAGTTTTCGATCGGATGCCAATTAGGAGTGTGGTGTCTGAAACCTCAATGGTAAGTGGATATGCAAAAGCATCAAGTGTTAAAGCTGCAAGATCTATGTTTTCAAATATGATGGTGAAAGATGTAATTACTTGGAATGCTCTTATTGCCGGATGTACACAAAATGGAGAGAATGAAGAGGCACTTACACTCTTCCGTCTTTTGAAAAGGGAGTCTGTTTGGCCTACACACTACACATTTGGCAATCTCCTCAATGCTTGTGCAAACCTTGCTGATTTGCAGCTTGGCCGACAGGCTCACTCTCATGTTTTAAAACACGGATTTCGATTCCGATATGGAGATGAGTCAGATATCTTTGTTGGGAATTCTCTAATAGATATGTATATGAAATGTGGATCAGTTGAGAGTGGTTGTAGGGTGTTTGAACGTATGTTGGAAAGGGATTGTGTGTCATGGAATGCTATGATAGTTGGATATGCACAAAATGGTTTTGGCAACAAGGCCCTTGGAATTTTCAGTGAAATGTTAGAATCAGGAGAGAAACCAGATCATGTAACAATGATTGGTGTTCTGTCTGCCTGTAGTCATGCCGGGCTGCTCGATGAAGGCCGCCATTACTTTCGATCGATGCGTGCACGACATGGTTTGGTACCGTTAAAGGATCATTATACATGTATGGTTGATTTACTTGGGCGAGCTGGCTGCCTTGAAGAAGCAAAAAATATAATAGAGGAGATGCCAATGCAGCCTGATGCTATCGTTTGGGGATCATTGCTTGCGGCTTGTAAAGTCCATAGGAACATAAAATTGGGGGAATATGTAGTGGAGAAGCTTTTAGAGGTAGATCCTGAGAATTCTGGGCCATATGTTCTTCTCTCTAATATGTATGCTGAACGTGGAGATTGGGGGAATGTTATGAGAATAAGGAAGCTGATGAGACAGAGAGGAGTGGTTAAACAACCAGGTTGCAGTTGGATTGAAATTCAGGGTGAGTTGAATGTTTTTATGGTTAAAGATAAAAGACATGCAAGGAAGCAAGAAATCTACATGCTTTTGAGGACACTTTTACAGCAGATGAAACGGGCTGGATATATATATTAA

Protein sequence

MAGNGFVKRLTGDLLFLDSSPLSKLLNQCARSKSARDTSRVHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWNSMISGFEQHDCFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIFVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAKCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNGFGNKALGIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYMLLRTLLQQMKRAGYIY
Homology
BLAST of CmaCh16G012950 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 932.9 bits (2410), Expect = 2.0e-270
Identity = 446/669 (66.67%), Postives = 541/669 (80.87%), Query Frame = 0

Query: 1   MAGNGFVKRLTGDLLFLDSSPLSKLLNQCARSK-SARDTSRVHACIIKSPFASEVFIQNR 60
           MA   F+K       F DSSP +KLL+ C +SK SA     VHA +IKS F++E+FIQNR
Sbjct: 1   MATKSFLKLAADLSSFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNR 60

Query: 61  LIDVYGKCGCVGVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSW 120
           LID Y KCG +   R+VFD+M +RNI++WNS++   TK GFLD+A  +F  MP+ DQC+W
Sbjct: 61  LIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTW 120

Query: 121 NSMISGFEQHDCFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYR 180
           NSM+SGF QHD  +EAL YF  MH  GF +NEYSF S LSAC+GL D+  G Q+HSLI +
Sbjct: 121 NSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAK 180

Query: 181 SNYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSI 240
           S +LSD+Y+GSALVDMYSKCG V+ A+ VFD M  R+ VSWNSLITC+EQNGP  EAL +
Sbjct: 181 SPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDV 240

Query: 241 FVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYA 300
           F  M+E  VEPDEVTLASV+SACA++SAIK GQ++H RVVK D+ RND+IL NA +DMYA
Sbjct: 241 FQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYA 300

Query: 301 KCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAG 360
           KC+RI EAR +FD MPIR+V++ETSM+SGYA A+S KAAR MF+ M  ++V++WNALIAG
Sbjct: 301 KCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAG 360

Query: 361 CTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFR 420
            TQNGENEEAL+LF LLKRESV PTHY+F N+L ACA+LA+L LG QAH HVLKHGF+F+
Sbjct: 361 YTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQ 420

Query: 421 YGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNGFGNKALGI 480
            G+E DIFVGNSLIDMY+KCG VE G  VF +M+ERDCVSWNAMI+G+AQNG+GN+AL +
Sbjct: 421 SGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALEL 480

Query: 481 FSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLG 540
           F EMLESGEKPDH+TMIGVLSAC HAG ++EGRHYF SM    G+ PL+DHYTCMVDLLG
Sbjct: 481 FREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLG 540

Query: 541 RAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVL 600
           RAG LEEAK++IEEMPMQPD+++WGSLLAACKVHRNI LG+YV EKLLEV+P NSGPYVL
Sbjct: 541 RAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVL 600

Query: 601 LSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYML 660
           LSNMYAE G W +VM +RK MR+ GV KQPGCSWI+IQG  +VFMVKDK H RK++I+ L
Sbjct: 601 LSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSL 660

Query: 661 LRTLLQQMK 669
           L  L+ +M+
Sbjct: 661 LDILIAEMR 669

BLAST of CmaCh16G012950 vs. ExPASy Swiss-Prot
Match: Q9SHZ8 (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 479.9 bits (1234), Expect = 4.6e-134
Identity = 248/687 (36.10%), Postives = 397/687 (57.79%), Query Frame = 0

Query: 23  SKLLNQCARSKSARDTSR-VHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRML 82
           + LL +     + R T++ VH  +IKS     V++ N L++VY K G    ARK+FD M 
Sbjct: 17  TNLLQKSVNKSNGRFTAQLVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEMP 76

Query: 83  ERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWNSMISGFEQHDCFDEALKYFVQ 142
            R  FSWN+++ A++K G +D     F+++PQ D  SW +MI G++    + +A++    
Sbjct: 77  LRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGD 136

Query: 143 MHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGR 202
           M   G    +++  + L++ A  + ++ G ++HS I +     ++ + ++L++MY+KCG 
Sbjct: 137 MVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGD 196

Query: 203 VDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIFVEMIECGV------------- 262
              A+ VFD M VR   SWN++I  + Q G +D A++ F +M E  +             
Sbjct: 197 PMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQR 256

Query: 263 -------------------EPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLI 322
                               PD  TLASV+SACA +  +  G+QIH+ +V      + ++
Sbjct: 257 GYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIV 316

Query: 323 LGNALLDMYAKCNRINEARIVFDRMPIRSVVSE--TSMVSGYAKASSVKAARSMFSNMMV 382
           L NAL+ MY++C  +  AR + ++   + +  E  T+++ GY K   +  A+++F ++  
Sbjct: 317 L-NALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKD 376

Query: 383 KDVITWNALIAGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQA 442
           +DV+ W A+I G  Q+G   EA+ LFR +      P  YT   +L+  ++LA L  G+Q 
Sbjct: 377 RDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQI 436

Query: 443 HSHVLKHGFRFRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERM-LERDCVSWNAMIVG 502
           H   +K       G+   + V N+LI MY K G++ S  R F+ +  ERD VSW +MI+ 
Sbjct: 437 HGSAVKS------GEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIA 496

Query: 503 YAQNGFGNKALGIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVP 562
            AQ+G   +AL +F  ML  G +PDH+T +GV SAC+HAGL+++GR YF  M+    ++P
Sbjct: 497 LAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIP 556

Query: 563 LKDHYTCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKL 622
              HY CMVDL GRAG L+EA+  IE+MP++PD + WGSLL+AC+VH+NI LG+   E+L
Sbjct: 557 TLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERL 616

Query: 623 LEVDPENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVK 674
           L ++PENSG Y  L+N+Y+  G W    +IRK M+   V K+ G SWIE++ +++VF V+
Sbjct: 617 LLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVE 676

BLAST of CmaCh16G012950 vs. ExPASy Swiss-Prot
Match: Q9SY02 (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 472.6 bits (1215), Expect = 7.3e-132
Identity = 250/616 (40.58%), Postives = 377/616 (61.20%), Query Frame = 0

Query: 58  NRLIDVYGKCGCVGVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQC 117
           N +I  Y + G   +ARK+FD M ER++ SWN +I  + ++  L  A  +FE MP+ D C
Sbjct: 99  NGMISGYLRNGEFELARKLFDEMPERDLVSWNVMIKGYVRNRNLGKARELFEIMPERDVC 158

Query: 118 SWNSMISGFEQHDCFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLI 177
           SWN+M+SG+ Q+ C D+A   F +M       N+ S+ + LSA   +Q+ KM  +   ++
Sbjct: 159 SWNTMLSGYAQNGCVDDARSVFDRMPE----KNDVSWNALLSAY--VQNSKM--EEACML 218

Query: 178 YRSNYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEAL 237
           ++S     +   + L+  + K  ++  AR  FD M VR  VSWN++IT Y Q+G +DEA 
Sbjct: 219 FKSRENWALVSWNCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEAR 278

Query: 238 SIFVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDM 297
            +F E        D  T  ++VS       ++E +++  ++ + +E     +  NA+L  
Sbjct: 279 QLFDE----SPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPERNE-----VSWNAMLAG 338

Query: 298 YAKCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALI 357
           Y +  R+  A+ +FD MP R+V +  +M++GYA+   +  A+++F  M  +D ++W A+I
Sbjct: 339 YVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMI 398

Query: 358 AGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFR 417
           AG +Q+G + EAL LF  ++RE       +F + L+ CA++  L+LG+Q H  ++K G+ 
Sbjct: 399 AGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGY- 458

Query: 418 FRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNGFGNKAL 477
                E+  FVGN+L+ MY KCGS+E    +F+ M  +D VSWN MI GY+++GFG  AL
Sbjct: 459 -----ETGCFVGNALLLMYCKCGSIEEANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVAL 518

Query: 478 GIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDL 537
             F  M   G KPD  TM+ VLSACSH GL+D+GR YF +M   +G++P   HY CMVDL
Sbjct: 519 RFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDL 578

Query: 538 LGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPY 597
           LGRAG LE+A N+++ MP +PDA +WG+LL A +VH N +L E   +K+  ++PENSG Y
Sbjct: 579 LGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMY 638

Query: 598 VLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIY 657
           VLLSN+YA  G WG+V ++R  MR +GV K PG SWIEIQ + + F V D+ H  K EI+
Sbjct: 639 VLLSNLYASSGRWGDVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIF 691

Query: 658 MLLRTLLQQMKRAGYI 674
             L  L  +MK+AGY+
Sbjct: 699 AFLEELDLRMKKAGYV 691

BLAST of CmaCh16G012950 vs. ExPASy Swiss-Prot
Match: Q9FRI5 (Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H74 PE=2 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 3.2e-127
Identity = 248/682 (36.36%), Postives = 383/682 (56.16%), Query Frame = 0

Query: 31  RSKSARDTSRVHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRMLERNIFSWNS 90
           R  S +    VH  II   F     I NRLIDVY K   +  AR++FD + E +  +  +
Sbjct: 26  RRTSLQLARAVHGNIITFGFQPRAHILNRLIDVYCKSSELNYARQLFDEISEPDKIARTT 85

Query: 91  IICAFTKSGFLDDAVHIFEKMP--QVDQCSWNSMISGFEQHDCFDEALKYFVQMHGHGFF 150
           ++  +  SG +  A  +FEK P    D   +N+MI+GF  ++    A+  F +M   GF 
Sbjct: 86  MVSGYCASGDITLARGVFEKAPVCMRDTVMYNAMITGFSHNNDGYSAINLFCKMKHEGFK 145

Query: 151 MNEYSFGSALSACAGL-QDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGR----VD 210
            + ++F S L+  A +  D K   Q H+   +S       + +ALV +YSKC      + 
Sbjct: 146 PDNFTFASVLAGLALVADDEKQCVQFHAAALKSGAGYITSVSNALVSVYSKCASSPSLLH 205

Query: 211 CARSVFDGMTVRSRVSWNSLITCYEQNGPVD----------------------------- 270
            AR VFD +  +   SW +++T Y +NG  D                             
Sbjct: 206 SARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLEGMDDNMKLVAYNAMISGYVNRG 265

Query: 271 ---EALSIFVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILG 330
              EAL +   M+  G+E DE T  SV+ ACAT   ++ G+Q+HA V++ ++F       
Sbjct: 266 FYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQLGKQVHAYVLRREDF--SFHFD 325

Query: 331 NALLDMYAKCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVI 390
           N+L+ +Y KC + +EAR +F++MP + +VS  +++SGY  +  +  A+ +F  M  K+++
Sbjct: 326 NSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYVSSGHIGEAKLIFKEMKEKNIL 385

Query: 391 TWNALIAGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHV 450
           +W  +I+G  +NG  EE L LF  +KRE   P  Y F   + +CA L     G+Q H+ +
Sbjct: 386 SWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSGAIKSCAVLGAYCNGQQYHAQL 445

Query: 451 LKHGFRFRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNG 510
           LK GF      +S +  GN+LI MY KCG VE   +VF  M   D VSWNA+I    Q+G
Sbjct: 446 LKIGF------DSSLSAGNALITMYAKCGVVEEARQVFRTMPCLDSVSWNALIAALGQHG 505

Query: 511 FGNKALGIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHY 570
            G +A+ ++ EML+ G +PD +T++ VL+ACSHAGL+D+GR YF SM   + + P  DHY
Sbjct: 506 HGAEAVDVYEEMLKKGIRPDRITLLTVLTACSHAGLVDQGRKYFDSMETVYRIPPGADHY 565

Query: 571 TCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDP 630
             ++DLL R+G   +A+++IE +P +P A +W +LL+ C+VH N++LG    +KL  + P
Sbjct: 566 ARLIDLLCRSGKFSDAESVIESLPFKPTAEIWEALLSGCRVHGNMELGIIAADKLFGLIP 625

Query: 631 ENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHA 674
           E+ G Y+LLSNM+A  G W  V R+RKLMR RGV K+  CSWIE++ +++ F+V D  H 
Sbjct: 626 EHDGTYMLLSNMHAATGQWEEVARVRKLMRDRGVKKEVACSWIEMETQVHTFLVDDTSHP 685

BLAST of CmaCh16G012950 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 447.6 bits (1150), Expect = 2.5e-124
Identity = 232/649 (35.75%), Postives = 373/649 (57.47%), Query Frame = 0

Query: 26  LNQCARSKSARDTSRVHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRMLERNI 85
           L+ CA+S++  +  ++H  I+K  +A ++F+QN L+  Y +CG +  ARKVFD M ERN+
Sbjct: 141 LSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNV 200

Query: 86  FSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWNSMISGFEQHDCFDEALKYFVQMHGH 145
            SW S+IC + +  F  DAV +F +M + ++ + NS+                       
Sbjct: 201 VSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSV----------------------- 260

Query: 146 GFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGRVDCA 205
                  +    +SACA L+DL+ G ++++ I  S    +  M SALVDMY KC  +D A
Sbjct: 261 -------TMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDLMVSALVDMYMKCNAIDVA 320

Query: 206 RSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIFVEMIECGVEPDEVTLASVVSACATV 265
           + +FD     +    N++ + Y + G   EAL +F  M++ GV PD +++ S +S+C+ +
Sbjct: 321 KRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQL 380

Query: 266 SAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAKCNRINEARIVFDRMPIRSVVSETSM 325
             I  G+  H  V++ + F +   + NAL+DMY KC+R + A  +FDRM  ++VV+  S+
Sbjct: 381 RNILWGKSCHGYVLR-NGFESWDNICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSI 440

Query: 326 VSGYAKASSVKAARSMFSNMMVKDVITWNALIAGCTQNGENEEALTLF-RLLKRESVWPT 385
           V+GY +   V AA   F  M  K++++WN +I+G  Q    EEA+ +F  +  +E V   
Sbjct: 441 VAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNAD 500

Query: 386 HYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRYGDESDIFVGNSLIDMYMKCGSVES 445
             T  ++ +AC +L  L L +  + ++ K+G +       D+ +G +L+DM+ +CG  ES
Sbjct: 501 GVTMMSIASACGHLGALDLAKWIYYYIEKNGIQL------DVRLGTTLVDMFSRCGDPES 560

Query: 446 GCRVFERMLERDCVSWNAMIVGYAQNGFGNKALGIFSEMLESGEKPDHVTMIGVLSACSH 505
              +F  +  RD  +W A I   A  G   +A+ +F +M+E G KPD V  +G L+ACSH
Sbjct: 561 AMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFVGALTACSH 620

Query: 506 AGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWG 565
            GL+ +G+  F SM   HG+ P   HY CMVDLLGRAG LEEA  +IE+MPM+P+ ++W 
Sbjct: 621 GGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPMEPNDVIWN 680

Query: 566 SLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRG 625
           SLLAAC+V  N+++  Y  EK+  + PE +G YVLLSN+YA  G W ++ ++R  M+++G
Sbjct: 681 SLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKVRLSMKEKG 740

Query: 626 VVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYMLLRTLLQQMKRAGYI 674
           + K PG S I+I+G+ + F   D+ H     I  +L  + Q+    G++
Sbjct: 741 LRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHV 752

BLAST of CmaCh16G012950 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 932.9 bits (2410), Expect = 1.4e-271
Identity = 446/669 (66.67%), Postives = 541/669 (80.87%), Query Frame = 0

Query: 1   MAGNGFVKRLTGDLLFLDSSPLSKLLNQCARSK-SARDTSRVHACIIKSPFASEVFIQNR 60
           MA   F+K       F DSSP +KLL+ C +SK SA     VHA +IKS F++E+FIQNR
Sbjct: 1   MATKSFLKLAADLSSFTDSSPFAKLLDSCIKSKLSAIYVRYVHASVIKSGFSNEIFIQNR 60

Query: 61  LIDVYGKCGCVGVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSW 120
           LID Y KCG +   R+VFD+M +RNI++WNS++   TK GFLD+A  +F  MP+ DQC+W
Sbjct: 61  LIDAYSKCGSLEDGRQVFDKMPQRNIYTWNSVVTGLTKLGFLDEADSLFRSMPERDQCTW 120

Query: 121 NSMISGFEQHDCFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYR 180
           NSM+SGF QHD  +EAL YF  MH  GF +NEYSF S LSAC+GL D+  G Q+HSLI +
Sbjct: 121 NSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAK 180

Query: 181 SNYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSI 240
           S +LSD+Y+GSALVDMYSKCG V+ A+ VFD M  R+ VSWNSLITC+EQNGP  EAL +
Sbjct: 181 SPFLSDVYIGSALVDMYSKCGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDV 240

Query: 241 FVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDMYA 300
           F  M+E  VEPDEVTLASV+SACA++SAIK GQ++H RVVK D+ RND+IL NA +DMYA
Sbjct: 241 FQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYA 300

Query: 301 KCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALIAG 360
           KC+RI EAR +FD MPIR+V++ETSM+SGYA A+S KAAR MF+ M  ++V++WNALIAG
Sbjct: 301 KCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAG 360

Query: 361 CTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFR 420
            TQNGENEEAL+LF LLKRESV PTHY+F N+L ACA+LA+L LG QAH HVLKHGF+F+
Sbjct: 361 YTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQ 420

Query: 421 YGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNGFGNKALGI 480
            G+E DIFVGNSLIDMY+KCG VE G  VF +M+ERDCVSWNAMI+G+AQNG+GN+AL +
Sbjct: 421 SGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEALEL 480

Query: 481 FSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLG 540
           F EMLESGEKPDH+TMIGVLSAC HAG ++EGRHYF SM    G+ PL+DHYTCMVDLLG
Sbjct: 481 FREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDLLG 540

Query: 541 RAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVL 600
           RAG LEEAK++IEEMPMQPD+++WGSLLAACKVHRNI LG+YV EKLLEV+P NSGPYVL
Sbjct: 541 RAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPYVL 600

Query: 601 LSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYML 660
           LSNMYAE G W +VM +RK MR+ GV KQPGCSWI+IQG  +VFMVKDK H RK++I+ L
Sbjct: 601 LSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIHSL 660

Query: 661 LRTLLQQMK 669
           L  L+ +M+
Sbjct: 661 LDILIAEMR 669

BLAST of CmaCh16G012950 vs. TAIR 10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 479.9 bits (1234), Expect = 3.2e-135
Identity = 248/687 (36.10%), Postives = 397/687 (57.79%), Query Frame = 0

Query: 23  SKLLNQCARSKSARDTSR-VHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRML 82
           + LL +     + R T++ VH  +IKS     V++ N L++VY K G    ARK+FD M 
Sbjct: 17  TNLLQKSVNKSNGRFTAQLVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEMP 76

Query: 83  ERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWNSMISGFEQHDCFDEALKYFVQ 142
            R  FSWN+++ A++K G +D     F+++PQ D  SW +MI G++    + +A++    
Sbjct: 77  LRTAFSWNTVLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGD 136

Query: 143 MHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGR 202
           M   G    +++  + L++ A  + ++ G ++HS I +     ++ + ++L++MY+KCG 
Sbjct: 137 MVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGD 196

Query: 203 VDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIFVEMIECGV------------- 262
              A+ VFD M VR   SWN++I  + Q G +D A++ F +M E  +             
Sbjct: 197 PMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQMAERDIVTWNSMISGFNQR 256

Query: 263 -------------------EPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLI 322
                               PD  TLASV+SACA +  +  G+QIH+ +V      + ++
Sbjct: 257 GYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIHSHIVTTGFDISGIV 316

Query: 323 LGNALLDMYAKCNRINEARIVFDRMPIRSVVSE--TSMVSGYAKASSVKAARSMFSNMMV 382
           L NAL+ MY++C  +  AR + ++   + +  E  T+++ GY K   +  A+++F ++  
Sbjct: 317 L-NALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKD 376

Query: 383 KDVITWNALIAGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQA 442
           +DV+ W A+I G  Q+G   EA+ LFR +      P  YT   +L+  ++LA L  G+Q 
Sbjct: 377 RDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSVASSLASLSHGKQI 436

Query: 443 HSHVLKHGFRFRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERM-LERDCVSWNAMIVG 502
           H   +K       G+   + V N+LI MY K G++ S  R F+ +  ERD VSW +MI+ 
Sbjct: 437 HGSAVKS------GEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVSWTSMIIA 496

Query: 503 YAQNGFGNKALGIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVP 562
            AQ+G   +AL +F  ML  G +PDH+T +GV SAC+HAGL+++GR YF  M+    ++P
Sbjct: 497 LAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMKDVDKIIP 556

Query: 563 LKDHYTCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKL 622
              HY CMVDL GRAG L+EA+  IE+MP++PD + WGSLL+AC+VH+NI LG+   E+L
Sbjct: 557 TLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLGKVAAERL 616

Query: 623 LEVDPENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVK 674
           L ++PENSG Y  L+N+Y+  G W    +IRK M+   V K+ G SWIE++ +++VF V+
Sbjct: 617 LLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHKVHVFGVE 676

BLAST of CmaCh16G012950 vs. TAIR 10
Match: AT4G02750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 472.6 bits (1215), Expect = 5.2e-133
Identity = 250/616 (40.58%), Postives = 377/616 (61.20%), Query Frame = 0

Query: 58  NRLIDVYGKCGCVGVARKVFDRMLERNIFSWNSIICAFTKSGFLDDAVHIFEKMPQVDQC 117
           N +I  Y + G   +ARK+FD M ER++ SWN +I  + ++  L  A  +FE MP+ D C
Sbjct: 99  NGMISGYLRNGEFELARKLFDEMPERDLVSWNVMIKGYVRNRNLGKARELFEIMPERDVC 158

Query: 118 SWNSMISGFEQHDCFDEALKYFVQMHGHGFFMNEYSFGSALSACAGLQDLKMGSQIHSLI 177
           SWN+M+SG+ Q+ C D+A   F +M       N+ S+ + LSA   +Q+ KM  +   ++
Sbjct: 159 SWNTMLSGYAQNGCVDDARSVFDRMPE----KNDVSWNALLSAY--VQNSKM--EEACML 218

Query: 178 YRSNYLSDMYMGSALVDMYSKCGRVDCARSVFDGMTVRSRVSWNSLITCYEQNGPVDEAL 237
           ++S     +   + L+  + K  ++  AR  FD M VR  VSWN++IT Y Q+G +DEA 
Sbjct: 219 FKSRENWALVSWNCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEAR 278

Query: 238 SIFVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILGNALLDM 297
            +F E        D  T  ++VS       ++E +++  ++ + +E     +  NA+L  
Sbjct: 279 QLFDE----SPVQDVFTWTAMVSGYIQNRMVEEARELFDKMPERNE-----VSWNAMLAG 338

Query: 298 YAKCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVITWNALI 357
           Y +  R+  A+ +FD MP R+V +  +M++GYA+   +  A+++F  M  +D ++W A+I
Sbjct: 339 YVQGERMEMAKELFDVMPCRNVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMI 398

Query: 358 AGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHVLKHGFR 417
           AG +Q+G + EAL LF  ++RE       +F + L+ CA++  L+LG+Q H  ++K G+ 
Sbjct: 399 AGYSQSGHSFEALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGY- 458

Query: 418 FRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNGFGNKAL 477
                E+  FVGN+L+ MY KCGS+E    +F+ M  +D VSWN MI GY+++GFG  AL
Sbjct: 459 -----ETGCFVGNALLLMYCKCGSIEEANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVAL 518

Query: 478 GIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDL 537
             F  M   G KPD  TM+ VLSACSH GL+D+GR YF +M   +G++P   HY CMVDL
Sbjct: 519 RFFESMKREGLKPDDATMVAVLSACSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDL 578

Query: 538 LGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDPENSGPY 597
           LGRAG LE+A N+++ MP +PDA +WG+LL A +VH N +L E   +K+  ++PENSG Y
Sbjct: 579 LGRAGLLEDAHNLMKNMPFEPDAAIWGTLLGASRVHGNTELAETAADKIFAMEPENSGMY 638

Query: 598 VLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHARKQEIY 657
           VLLSN+YA  G WG+V ++R  MR +GV K PG SWIEIQ + + F V D+ H  K EI+
Sbjct: 639 VLLSNLYASSGRWGDVGKLRVRMRDKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIF 691

Query: 658 MLLRTLLQQMKRAGYI 674
             L  L  +MK+AGY+
Sbjct: 699 AFLEELDLRMKKAGYV 691

BLAST of CmaCh16G012950 vs. TAIR 10
Match: AT1G25360.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 457.2 bits (1175), Expect = 2.3e-128
Identity = 248/682 (36.36%), Postives = 383/682 (56.16%), Query Frame = 0

Query: 31  RSKSARDTSRVHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRMLERNIFSWNS 90
           R  S +    VH  II   F     I NRLIDVY K   +  AR++FD + E +  +  +
Sbjct: 26  RRTSLQLARAVHGNIITFGFQPRAHILNRLIDVYCKSSELNYARQLFDEISEPDKIARTT 85

Query: 91  IICAFTKSGFLDDAVHIFEKMP--QVDQCSWNSMISGFEQHDCFDEALKYFVQMHGHGFF 150
           ++  +  SG +  A  +FEK P    D   +N+MI+GF  ++    A+  F +M   GF 
Sbjct: 86  MVSGYCASGDITLARGVFEKAPVCMRDTVMYNAMITGFSHNNDGYSAINLFCKMKHEGFK 145

Query: 151 MNEYSFGSALSACAGL-QDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGR----VD 210
            + ++F S L+  A +  D K   Q H+   +S       + +ALV +YSKC      + 
Sbjct: 146 PDNFTFASVLAGLALVADDEKQCVQFHAAALKSGAGYITSVSNALVSVYSKCASSPSLLH 205

Query: 211 CARSVFDGMTVRSRVSWNSLITCYEQNGPVD----------------------------- 270
            AR VFD +  +   SW +++T Y +NG  D                             
Sbjct: 206 SARKVFDEILEKDERSWTTMMTGYVKNGYFDLGEELLEGMDDNMKLVAYNAMISGYVNRG 265

Query: 271 ---EALSIFVEMIECGVEPDEVTLASVVSACATVSAIKEGQQIHARVVKCDEFRNDLILG 330
              EAL +   M+  G+E DE T  SV+ ACAT   ++ G+Q+HA V++ ++F       
Sbjct: 266 FYQEALEMVRRMVSSGIELDEFTYPSVIRACATAGLLQLGKQVHAYVLRREDF--SFHFD 325

Query: 331 NALLDMYAKCNRINEARIVFDRMPIRSVVSETSMVSGYAKASSVKAARSMFSNMMVKDVI 390
           N+L+ +Y KC + +EAR +F++MP + +VS  +++SGY  +  +  A+ +F  M  K+++
Sbjct: 326 NSLVSLYYKCGKFDEARAIFEKMPAKDLVSWNALLSGYVSSGHIGEAKLIFKEMKEKNIL 385

Query: 391 TWNALIAGCTQNGENEEALTLFRLLKRESVWPTHYTFGNLLNACANLADLQLGRQAHSHV 450
           +W  +I+G  +NG  EE L LF  +KRE   P  Y F   + +CA L     G+Q H+ +
Sbjct: 386 SWMIMISGLAENGFGEEGLKLFSCMKREGFEPCDYAFSGAIKSCAVLGAYCNGQQYHAQL 445

Query: 451 LKHGFRFRYGDESDIFVGNSLIDMYMKCGSVESGCRVFERMLERDCVSWNAMIVGYAQNG 510
           LK GF      +S +  GN+LI MY KCG VE   +VF  M   D VSWNA+I    Q+G
Sbjct: 446 LKIGF------DSSLSAGNALITMYAKCGVVEEARQVFRTMPCLDSVSWNALIAALGQHG 505

Query: 511 FGNKALGIFSEMLESGEKPDHVTMIGVLSACSHAGLLDEGRHYFRSMRARHGLVPLKDHY 570
            G +A+ ++ EML+ G +PD +T++ VL+ACSHAGL+D+GR YF SM   + + P  DHY
Sbjct: 506 HGAEAVDVYEEMLKKGIRPDRITLLTVLTACSHAGLVDQGRKYFDSMETVYRIPPGADHY 565

Query: 571 TCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWGSLLAACKVHRNIKLGEYVVEKLLEVDP 630
             ++DLL R+G   +A+++IE +P +P A +W +LL+ C+VH N++LG    +KL  + P
Sbjct: 566 ARLIDLLCRSGKFSDAESVIESLPFKPTAEIWEALLSGCRVHGNMELGIIAADKLFGLIP 625

Query: 631 ENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRGVVKQPGCSWIEIQGELNVFMVKDKRHA 674
           E+ G Y+LLSNM+A  G W  V R+RKLMR RGV K+  CSWIE++ +++ F+V D  H 
Sbjct: 626 EHDGTYMLLSNMHAATGQWEEVARVRKLMRDRGVKKEVACSWIEMETQVHTFLVDDTSHP 685

BLAST of CmaCh16G012950 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 447.6 bits (1150), Expect = 1.8e-125
Identity = 232/649 (35.75%), Postives = 373/649 (57.47%), Query Frame = 0

Query: 26  LNQCARSKSARDTSRVHACIIKSPFASEVFIQNRLIDVYGKCGCVGVARKVFDRMLERNI 85
           L+ CA+S++  +  ++H  I+K  +A ++F+QN L+  Y +CG +  ARKVFD M ERN+
Sbjct: 141 LSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNV 200

Query: 86  FSWNSIICAFTKSGFLDDAVHIFEKMPQVDQCSWNSMISGFEQHDCFDEALKYFVQMHGH 145
            SW S+IC + +  F  DAV +F +M + ++ + NS+                       
Sbjct: 201 VSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSV----------------------- 260

Query: 146 GFFMNEYSFGSALSACAGLQDLKMGSQIHSLIYRSNYLSDMYMGSALVDMYSKCGRVDCA 205
                  +    +SACA L+DL+ G ++++ I  S    +  M SALVDMY KC  +D A
Sbjct: 261 -------TMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDLMVSALVDMYMKCNAIDVA 320

Query: 206 RSVFDGMTVRSRVSWNSLITCYEQNGPVDEALSIFVEMIECGVEPDEVTLASVVSACATV 265
           + +FD     +    N++ + Y + G   EAL +F  M++ GV PD +++ S +S+C+ +
Sbjct: 321 KRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQL 380

Query: 266 SAIKEGQQIHARVVKCDEFRNDLILGNALLDMYAKCNRINEARIVFDRMPIRSVVSETSM 325
             I  G+  H  V++ + F +   + NAL+DMY KC+R + A  +FDRM  ++VV+  S+
Sbjct: 381 RNILWGKSCHGYVLR-NGFESWDNICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSI 440

Query: 326 VSGYAKASSVKAARSMFSNMMVKDVITWNALIAGCTQNGENEEALTLF-RLLKRESVWPT 385
           V+GY +   V AA   F  M  K++++WN +I+G  Q    EEA+ +F  +  +E V   
Sbjct: 441 VAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNAD 500

Query: 386 HYTFGNLLNACANLADLQLGRQAHSHVLKHGFRFRYGDESDIFVGNSLIDMYMKCGSVES 445
             T  ++ +AC +L  L L +  + ++ K+G +       D+ +G +L+DM+ +CG  ES
Sbjct: 501 GVTMMSIASACGHLGALDLAKWIYYYIEKNGIQL------DVRLGTTLVDMFSRCGDPES 560

Query: 446 GCRVFERMLERDCVSWNAMIVGYAQNGFGNKALGIFSEMLESGEKPDHVTMIGVLSACSH 505
              +F  +  RD  +W A I   A  G   +A+ +F +M+E G KPD V  +G L+ACSH
Sbjct: 561 AMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGVAFVGALTACSH 620

Query: 506 AGLLDEGRHYFRSMRARHGLVPLKDHYTCMVDLLGRAGCLEEAKNIIEEMPMQPDAIVWG 565
            GL+ +G+  F SM   HG+ P   HY CMVDLLGRAG LEEA  +IE+MPM+P+ ++W 
Sbjct: 621 GGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPMEPNDVIWN 680

Query: 566 SLLAACKVHRNIKLGEYVVEKLLEVDPENSGPYVLLSNMYAERGDWGNVMRIRKLMRQRG 625
           SLLAAC+V  N+++  Y  EK+  + PE +G YVLLSN+YA  G W ++ ++R  M+++G
Sbjct: 681 SLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDMAKVRLSMKEKG 740

Query: 626 VVKQPGCSWIEIQGELNVFMVKDKRHARKQEIYMLLRTLLQQMKRAGYI 674
           + K PG S I+I+G+ + F   D+ H     I  +L  + Q+    G++
Sbjct: 741 LRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHV 752

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SIT72.0e-27066.67Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9SHZ84.6e-13436.10Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
Q9SY027.3e-13240.58Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX... [more]
Q9FRI53.2e-12736.36Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana OX... [more]
Q9LUJ22.5e-12435.75Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT2G13600.11.4e-27166.67Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G22070.13.2e-13536.10pentatricopeptide (PPR) repeat-containing protein [more]
AT4G02750.15.2e-13340.58Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G25360.12.3e-12836.36Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G22690.11.8e-12535.75CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 351..380
e-value: 2.6E-4
score: 19.0
coord: 118..150
e-value: 2.9E-6
score: 25.1
coord: 323..349
e-value: 0.0029
score: 15.6
coord: 86..111
e-value: 1.5E-4
score: 19.8
coord: 58..85
e-value: 1.5E-4
score: 19.7
coord: 430..458
e-value: 3.0E-5
score: 21.9
coord: 458..491
e-value: 7.6E-8
score: 30.1
coord: 531..554
e-value: 0.0013
score: 16.7
coord: 218..252
e-value: 8.3E-9
score: 33.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 218..263
e-value: 1.4E-9
score: 38.0
coord: 456..502
e-value: 1.7E-8
score: 34.5
coord: 348..397
e-value: 9.8E-11
score: 41.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 118..147
e-value: 1.2E-6
score: 28.3
coord: 58..85
e-value: 5.2E-4
score: 20.1
coord: 86..112
e-value: 2.5E-5
score: 24.2
coord: 292..315
e-value: 0.023
score: 14.9
coord: 190..213
e-value: 0.058
score: 13.7
coord: 530..555
e-value: 6.4E-4
score: 19.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 456..490
score: 11.980759
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 84..114
score: 9.755614
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 425..455
score: 8.977363
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 287..321
score: 8.670445
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 216..250
score: 12.441133
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 349..383
score: 11.531345
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 115..149
score: 9.941957
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 417..522
e-value: 1.3E-25
score: 92.4
coord: 523..673
e-value: 8.4E-13
score: 50.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 170..267
e-value: 6.4E-23
score: 83.0
coord: 81..169
e-value: 1.5E-19
score: 72.0
coord: 268..411
e-value: 1.7E-26
score: 94.7
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 185..323
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 145..673
coord: 51..113
NoneNo IPR availablePANTHERPTHR47926:SF125PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 51..113
NoneNo IPR availablePANTHERPTHR47926:SF125PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 145..673
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 96..144
NoneNo IPR availablePANTHERPTHR47926:SF125PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 96..144

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G012950.1CmaCh16G012950.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
biological_process GO:0008380 RNA splicing
cellular_component GO:0009507 chloroplast
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding