CmaCh16G011330 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh16G011330
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
LocationCma_Chr16: 8712164 .. 8713960 (-)
RNA-Seq ExpressionCmaCh16G011330
SyntenyCmaCh16G011330
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAAAAACCCTTCTATACCAAATTACAATTATACCTACAAACACAAAAGCCACTGCTTCAAAAATCTGCCGCCACAATCTCAATTCAAAATGCCTGCAATTTTCAGCCATGTTTCTCGACCAATTACCAAGATCCATGGACCAGCTTCCGAATGGTTCTTGCCGGTGGCGGCCGCACCGCCGAGTTGGCTCTAGCCGAGGCCTTTCAGGCGCTCAAATGCAACCCAAATGGCTACGCTCTGGTTCAACTGGTTCGTGCCTGCACCACCCACGCCTGGGATTCCTGTGGTCACCAGCTTCACAGCTACATTTTGCGATCTGGGTTTGCCTCTAATGTCTTCGTGGGCACAGCCATGGTAAATTTCTATGTCAAAAGTGAGTCCTTTGATAGTGCCCATAAGGTGTTTGATGAAATGCCTCAACCAAATTTGGTTTCTTGGAACTCTTTGATTTCTGGTTATGTGCACTGTGGACAGTTTCGCAATGCTTTGTCTTCGTTTATTGCGCTTGATGGGTCTGAAAATCTCGTGGATTCATATTCTTTGTCTGCAGCTTTAAATGCCTCTGGCCAGCTGGGTTGGCTGATATTGGGTCAGTCGATTCATTCAAAAGTTGTAAAGCTTGGACTAGAAAACAATACCATTGTTGCTAATTGTTTGATTGACACGTATGGAAAATGCGAGTCTTTTGAAGAGGCAGTTAAAGTGTTTAATGACATGATTGATAAGGATACAATTTCATGGAATTCAGTTATTGCAGCCAGTGCTAGAAATGGAAGGCTAGAACAGGCATCGAGGTATTTACGACAGATGCCTCGACCCGATACTATATCATATAACGAACTGATAAATGGTTGTGCTCAATTTGGAGATATTGAAGAAGCTGTAGAAATTCTATCAACGATGGATAGTCCAAACTCTTCATCTTGGAATGCAGTGTTAACAGGATATGTTGATAGGGATGAAGCATGGAAGGCTTTGAGTTTCTTCACTAAGATGCACTCGTGCAACATCGGCATGGATCAATTCACATTTTCAAGCATTTTGAGCGGCGTTGCAGGGCTCTCGGCACTGACATGGGGTTCGTCAATCCATTGTTGCATAACTAAACGTGGTTTAGATACATCGACTGTCGTTGGTAGCGCCTTAATCGATATGTACTCCAAATGTGGGCATGTGAACTATGCTGAAATGATATTTCAGTCATTGCCCAAGAAGAACTTGGTGTCTTGGAATGCAATGATCTCGGGTTTGGCTCGAAATGGCAAAACGATGGAAGTGATCCATCTTTTTGAGAAGTTAAAGACGATGAAAGATGTAACACCTGATAGCATCACATTTCTTAGCATTCTACTATCATGTTCAGATAACCAGGTTCCATTGGAGACCACAATGCAGTACTTCAAATCTATGATTGAGGATTTTGGTATCGAACCAACTGTTGAGCATTGTTGTACTATGGTTAGGCTAATGGGACAAAGAGGGGATGTTTATAGAAGTAAGAAGCTGATACATGAATTGGGGTTCGATTCGAATGGGTCGGTTTGGCGAGCTCTTTTGGGTGCTTGTGCTCTTTGGAAGGATTTAAAACTAGCAAAAGTAGCAGCTGCAAAAGTAATAGAATTAGGGGCTGCTGATGATTATGTACATGTTATGATGTCTAACATATTGGCATCTCATGGAAAATGGAGAGATGTGAGAGAAGTTAGGGAGCTGATGAGAAAGAAAGGGGTTAGAAAGGAAACTGGATATAGTTGGTTGATCGAGATGGAAAACGAAAATTCTTCATCAAACTAG

mRNA sequence

ATGCAAAAACCCTTCTATACCAAATTACAATTATACCTACAAACACAAAAGCCACTGCTTCAAAAATCTGCCGCCACAATCTCAATTCAAAATGCCTGCAATTTTCAGCCATGTTTCTCGACCAATTACCAAGATCCATGGACCAGCTTCCGAATGGTTCTTGCCGGTGGCGGCCGCACCGCCGAGTTGGCTCTAGCCGAGGCCTTTCAGGCGCTCAAATGCAACCCAAATGGCTACGCTCTGGTTCAACTGGTTCGTGCCTGCACCACCCACGCCTGGGATTCCTGTGGTCACCAGCTTCACAGCTACATTTTGCGATCTGGGTTTGCCTCTAATGTCTTCGTGGGCACAGCCATGGTAAATTTCTATGTCAAAAGTGAGTCCTTTGATAGTGCCCATAAGGTGTTTGATGAAATGCCTCAACCAAATTTGGTTTCTTGGAACTCTTTGATTTCTGGTTATGTGCACTGTGGACAGTTTCGCAATGCTTTGTCTTCGTTTATTGCGCTTGATGGGTCTGAAAATCTCGTGGATTCATATTCTTTGTCTGCAGCTTTAAATGCCTCTGGCCAGCTGGGTTGGCTGATATTGGGTCAGTCGATTCATTCAAAAGTTGTAAAGCTTGGACTAGAAAACAATACCATTGTTGCTAATTGTTTGATTGACACGTATGGAAAATGCGAGTCTTTTGAAGAGGCAGTTAAAGTGTTTAATGACATGATTGATAAGGATACAATTTCATGGAATTCAGTTATTGCAGCCAGTGCTAGAAATGGAAGGCTAGAACAGGCATCGAGGTATTTACGACAGATGCCTCGACCCGATACTATATCATATAACGAACTGATAAATGGTTGTGCTCAATTTGGAGATATTGAAGAAGCTGTAGAAATTCTATCAACGATGGATAGTCCAAACTCTTCATCTTGGAATGCAGTGTTAACAGGATATGTTGATAGGGATGAAGCATGGAAGGCTTTGAGTTTCTTCACTAAGATGCACTCGTGCAACATCGGCATGGATCAATTCACATTTTCAAGCATTTTGAGCGGCGTTGCAGGGCTCTCGGCACTGACATGGGGTTCGTCAATCCATTGTTGCATAACTAAACGTGGTTTAGATACATCGACTGTCGTTGGTAGCGCCTTAATCGATATGTACTCCAAATGTGGGCATGTGAACTATGCTGAAATGATATTTCAGTCATTGCCCAAGAAGAACTTGGTGTCTTGGAATGCAATGATCTCGGGTTTGGCTCGAAATGGCAAAACGATGGAAGTGATCCATCTTTTTGAGAAGTTAAAGACGATGAAAGATGTAACACCTGATAGCATCACATTTCTTAGCATTCTACTATCATGTTCAGATAACCAGGTTCCATTGGAGACCACAATGCAGTACTTCAAATCTATGATTGAGGATTTTGGTATCGAACCAACTGTTGAGCATTGTTGTACTATGGTTAGGCTAATGGGACAAAGAGGGGATGTTTATAGAAGTAAGAAGCTGATACATGAATTGGGGTTCGATTCGAATGGGTCGGTTTGGCGAGCTCTTTTGGGTGCTTGTGCTCTTTGGAAGGATTTAAAACTAGCAAAAGTAGCAGCTGCAAAAGTAATAGAATTAGGGGCTGCTGATGATTATGTACATGTTATGATGTCTAACATATTGGCATCTCATGGAAAATGGAGAGATGTGAGAGAAGTTAGGGAGCTGATGAGAAAGAAAGGGGTTAGAAAGGAAACTGGATATAGTTGGTTGATCGAGATGGAAAACGAAAATTCTTCATCAAACTAG

Coding sequence (CDS)

ATGCAAAAACCCTTCTATACCAAATTACAATTATACCTACAAACACAAAAGCCACTGCTTCAAAAATCTGCCGCCACAATCTCAATTCAAAATGCCTGCAATTTTCAGCCATGTTTCTCGACCAATTACCAAGATCCATGGACCAGCTTCCGAATGGTTCTTGCCGGTGGCGGCCGCACCGCCGAGTTGGCTCTAGCCGAGGCCTTTCAGGCGCTCAAATGCAACCCAAATGGCTACGCTCTGGTTCAACTGGTTCGTGCCTGCACCACCCACGCCTGGGATTCCTGTGGTCACCAGCTTCACAGCTACATTTTGCGATCTGGGTTTGCCTCTAATGTCTTCGTGGGCACAGCCATGGTAAATTTCTATGTCAAAAGTGAGTCCTTTGATAGTGCCCATAAGGTGTTTGATGAAATGCCTCAACCAAATTTGGTTTCTTGGAACTCTTTGATTTCTGGTTATGTGCACTGTGGACAGTTTCGCAATGCTTTGTCTTCGTTTATTGCGCTTGATGGGTCTGAAAATCTCGTGGATTCATATTCTTTGTCTGCAGCTTTAAATGCCTCTGGCCAGCTGGGTTGGCTGATATTGGGTCAGTCGATTCATTCAAAAGTTGTAAAGCTTGGACTAGAAAACAATACCATTGTTGCTAATTGTTTGATTGACACGTATGGAAAATGCGAGTCTTTTGAAGAGGCAGTTAAAGTGTTTAATGACATGATTGATAAGGATACAATTTCATGGAATTCAGTTATTGCAGCCAGTGCTAGAAATGGAAGGCTAGAACAGGCATCGAGGTATTTACGACAGATGCCTCGACCCGATACTATATCATATAACGAACTGATAAATGGTTGTGCTCAATTTGGAGATATTGAAGAAGCTGTAGAAATTCTATCAACGATGGATAGTCCAAACTCTTCATCTTGGAATGCAGTGTTAACAGGATATGTTGATAGGGATGAAGCATGGAAGGCTTTGAGTTTCTTCACTAAGATGCACTCGTGCAACATCGGCATGGATCAATTCACATTTTCAAGCATTTTGAGCGGCGTTGCAGGGCTCTCGGCACTGACATGGGGTTCGTCAATCCATTGTTGCATAACTAAACGTGGTTTAGATACATCGACTGTCGTTGGTAGCGCCTTAATCGATATGTACTCCAAATGTGGGCATGTGAACTATGCTGAAATGATATTTCAGTCATTGCCCAAGAAGAACTTGGTGTCTTGGAATGCAATGATCTCGGGTTTGGCTCGAAATGGCAAAACGATGGAAGTGATCCATCTTTTTGAGAAGTTAAAGACGATGAAAGATGTAACACCTGATAGCATCACATTTCTTAGCATTCTACTATCATGTTCAGATAACCAGGTTCCATTGGAGACCACAATGCAGTACTTCAAATCTATGATTGAGGATTTTGGTATCGAACCAACTGTTGAGCATTGTTGTACTATGGTTAGGCTAATGGGACAAAGAGGGGATGTTTATAGAAGTAAGAAGCTGATACATGAATTGGGGTTCGATTCGAATGGGTCGGTTTGGCGAGCTCTTTTGGGTGCTTGTGCTCTTTGGAAGGATTTAAAACTAGCAAAAGTAGCAGCTGCAAAAGTAATAGAATTAGGGGCTGCTGATGATTATGTACATGTTATGATGTCTAACATATTGGCATCTCATGGAAAATGGAGAGATGTGAGAGAAGTTAGGGAGCTGATGAGAAAGAAAGGGGTTAGAAAGGAAACTGGATATAGTTGGTTGATCGAGATGGAAAACGAAAATTCTTCATCAAACTAG

Protein sequence

MQKPFYTKLQLYLQTQKPLLQKSAATISIQNACNFQPCFSTNYQDPWTSFRMVLAGGGRTAELALAEAFQALKCNPNGYALVQLVRACTTHAWDSCGHQLHSYILRSGFASNVFVGTAMVNFYVKSESFDSAHKVFDEMPQPNLVSWNSLISGYVHCGQFRNALSSFIALDGSENLVDSYSLSAALNASGQLGWLILGQSIHSKVVKLGLENNTIVANCLIDTYGKCESFEEAVKVFNDMIDKDTISWNSVIAASARNGRLEQASRYLRQMPRPDTISYNELINGCAQFGDIEEAVEILSTMDSPNSSSWNAVLTGYVDRDEAWKALSFFTKMHSCNIGMDQFTFSSILSGVAGLSALTWGSSIHCCITKRGLDTSTVVGSALIDMYSKCGHVNYAEMIFQSLPKKNLVSWNAMISGLARNGKTMEVIHLFEKLKTMKDVTPDSITFLSILLSCSDNQVPLETTMQYFKSMIEDFGIEPTVEHCCTMVRLMGQRGDVYRSKKLIHELGFDSNGSVWRALLGACALWKDLKLAKVAAAKVIELGAADDYVHVMMSNILASHGKWRDVREVRELMRKKGVRKETGYSWLIEMENENSSSN
Homology
BLAST of CmaCh16G011330 vs. ExPASy Swiss-Prot
Match: Q9FGL1 (Putative pentatricopeptide repeat-containing protein At5g47460 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E103 PE=3 SV=1)

HSP 1 Score: 541.2 bits (1393), Expect = 1.5e-152
Identity = 262/551 (47.55%), Postives = 374/551 (67.88%), Query Frame = 0

Query: 40  STNYQDPWTSFRMVLAGGGRTAELALAEAFQALKCNPNGYALVQLVRACTTHAWDSCGHQ 99
           ST   + W++    LA  G    L  A         P+   LV L+R    + + S   Q
Sbjct: 17  STASSNSWSTIVPALARFGSIGVLRAAVELINDGEKPDASPLVHLLRVSGNYGYVSLCRQ 76

Query: 100 LHSYILRSGFASNVFVGTAMVNFYVKSESFDSAHKVFDEMPQPNLVSWNSLISGYVHCGQ 159
           LH Y+ + GF SN  +  +++ FY  S+S + AHKVFDEMP P+++SWNSL+SGYV  G+
Sbjct: 77  LHGYVTKHGFVSNTRLSNSLMRFYKTSDSLEDAHKVFDEMPDPDVISWNSLVSGYVQSGR 136

Query: 160 FRNALSSFIALDGSENLVDSYSLSAALNASGQLGWLILGQSIHSKVVKLGLE-NNTIVAN 219
           F+  +  F+ L  S+   + +S +AAL A  +L    LG  IHSK+VKLGLE  N +V N
Sbjct: 137 FQEGICLFLELHRSDVFPNEFSFTAALAACARLHLSPLGACIHSKLVKLGLEKGNVVVGN 196

Query: 220 CLIDTYGKCESFEEAVKVFNDMIDKDTISWNSVIAASARNGRLEQASRYLRQMPRPDTIS 279
           CLID YGKC   ++AV VF  M +KDT+SWN+++A+ +RNG+LE    +  QMP PDT++
Sbjct: 197 CLIDMYGKCGFMDDAVLVFQHMEEKDTVSWNAIVASCSRNGKLELGLWFFHQMPNPDTVT 256

Query: 280 YNELINGCAQFGDIEEAVEILSTMDSPNSSSWNAVLTGYVDRDEAWKALSFFTKMHSCNI 339
           YNELI+   + GD   A ++LS M +PNSSSWN +LTGYV+ +++ +A  FFTKMHS  +
Sbjct: 257 YNELIDAFVKSGDFNNAFQVLSDMPNPNSSSWNTILTGYVNSEKSGEATEFFTKMHSSGV 316

Query: 340 GMDQFTFSSILSGVAGLSALTWGSSIHCCITKRGLDTSTVVGSALIDMYSKCGHVNYAEM 399
             D+++ S +L+ VA L+ + WGS IH C  K GLD+  VV SALIDMYSKCG + +AE+
Sbjct: 317 RFDEYSLSIVLAAVAALAVVPWGSLIHACAHKLGLDSRVVVASALIDMYSKCGMLKHAEL 376

Query: 400 IFQSLPKKNLVSWNAMISGLARNGKTMEVIHLFEKLKTMKDVTPDSITFLSILLSCSDNQ 459
           +F ++P+KNL+ WN MISG ARNG ++E I LF +LK  + + PD  TFL++L  CS  +
Sbjct: 377 MFWTMPRKNLIVWNEMISGYARNGDSIEAIKLFNQLKQERFLKPDRFTFLNLLAVCSHCE 436

Query: 460 VPLETTMQYFKSMIEDFGIEPTVEHCCTMVRLMGQRGDVYRSKKLIHELGFDSNGSVWRA 519
           VP+E  + YF+ MI ++ I+P+VEHCC+++R MGQRG+V+++K++I E GF  +G  WRA
Sbjct: 437 VPMEVMLGYFEMMINEYRIKPSVEHCCSLIRAMGQRGEVWQAKQVIQEFGFGYDGVAWRA 496

Query: 520 LLGACALWKDLKLAKVAAAKVIELGAA--DDYVHVMMSNILASHGKWRDVREVRELMRKK 579
           LLGAC+  KDLK AK  AAK+IELG A  D+Y++++MSN+ A H +WR+V ++R++MR+ 
Sbjct: 497 LLGACSARKDLKAAKTVAAKMIELGDADKDEYLYIVMSNLYAYHERWREVGQIRKIMRES 556

Query: 580 GVRKETGYSWL 588
           GV KE G SW+
Sbjct: 557 GVLKEVGSSWI 567

BLAST of CmaCh16G011330 vs. ExPASy Swiss-Prot
Match: Q9FWA6 (Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E90 PE=2 SV=2)

HSP 1 Score: 292.0 bits (746), Expect = 1.6e-77
Identity = 183/608 (30.10%), Postives = 314/608 (51.64%), Query Frame = 0

Query: 49  SFRMVLAGGGRTAELALAEAF----QALKCNPNGYALVQLVRACTTHAWDSCGHQLHSYI 108
           S+  ++AG  +   L+LA  F    Q +    +      ++R+C   +    G QLH++ 
Sbjct: 248 SWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQSIYASVLRSCAALSELRLGGQLHAHA 307

Query: 109 LRSGFASNVFVGTAMVNFYVKSESFDSAHKVFDEMPQPNLVSWNSLISGYVHCGQFRNAL 168
           L+S FA++  V TA ++ Y K ++   A  +FD     N  S+N++I+GY        AL
Sbjct: 308 LKSDFAADGIVRTATLDMYAKCDNMQDAQILFDNSENLNRQSYNAMITGYSQEEHGFKAL 367

Query: 169 SSFIALDGSENLVDSYSLSAALNASGQLGWLILGQSIHSKVVKLGLENNTIVANCLIDTY 228
             F  L  S    D  SLS    A   +  L  G  I+   +K  L  +  VAN  ID Y
Sbjct: 368 LLFHRLMSSGLGFDEISLSGVFRACALVKGLSEGLQIYGLAIKSSLSLDVCVANAAIDMY 427

Query: 229 GKCESFEEAVKVFNDMIDKDTISWNSVIAASARNGR----LEQASRYLRQMPRPDTISYN 288
           GKC++  EA +VF++M  +D +SWN++IAA  +NG+    L      LR    PD  ++ 
Sbjct: 428 GKCQALAEAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETLFLFVSMLRSRIEPDEFTFG 487

Query: 289 ELINGC----------------------------------AQFGDIEEAVEILS------ 348
            ++  C                                  ++ G IEEA +I S      
Sbjct: 488 SILKACTGGSLGYGMEIHSSIVKSGMASNSSVGCSLIDMYSKCGMIEEAEKIHSRFFQRA 547

Query: 349 ----TMDSPNSS----------SWNAVLTGYVDRDEAWKALSFFTKMHSCNIGMDQFTFS 408
               TM+               SWN++++GYV ++++  A   FT+M    I  D+FT++
Sbjct: 548 NVSGTMEELEKMHNKRLQEMCVSWNSIISGYVMKEQSEDAQMLFTRMMEMGITPDKFTYA 607

Query: 409 SILSGVAGLSALTWGSSIHCCITKRGLDTSTVVGSALIDMYSKCGHVNYAEMIFQSLPKK 468
           ++L   A L++   G  IH  + K+ L +   + S L+DMYSKCG ++ + ++F+   ++
Sbjct: 608 TVLDTCANLASAGLGKQIHAQVIKKELQSDVYICSTLVDMYSKCGDLHDSRLMFEKSLRR 667

Query: 469 NLVSWNAMISGLARNGKTMEVIHLFEKLKTMKDVTPDSITFLSILLSCSDNQVPLETTMQ 528
           + V+WNAMI G A +GK  E I LFE++  ++++ P+ +TF+SIL +C+   + ++  ++
Sbjct: 668 DFVTWNAMICGYAHHGKGEEAIQLFERM-ILENIKPNHVTFISILRACAHMGL-IDKGLE 727

Query: 529 YFKSMIEDFGIEPTVEHCCTMVRLMGQRGDVYRSKKLIHELGFDSNGSVWRALLGACALW 588
           YF  M  D+G++P + H   MV ++G+ G V R+ +LI E+ F+++  +WR LLG C + 
Sbjct: 728 YFYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIREMPFEADDVIWRTLLGVCTIH 787

Query: 589 K-DLKLAKVAAAKVIELGAADDYVHVMMSNILASHGKWRDVREVRELMRKKGVRKETGYS 594
           + ++++A+ A A ++ L   D   + ++SN+ A  G W  V ++R  MR   ++KE G S
Sbjct: 788 RNNVEVAEEATAALLRLDPQDSSAYTLLSNVYADAGMWEKVSDLRRNMRGFKLKKEPGCS 847

BLAST of CmaCh16G011330 vs. ExPASy Swiss-Prot
Match: Q9FIB2 (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 291.2 bits (744), Expect = 2.7e-77
Identity = 168/535 (31.40%), Postives = 290/535 (54.21%), Query Frame = 0

Query: 97  GHQLHSYILRSGFAS-NVFVGTAMVNFYVKSESFDSAHKVFDEMPQPNLVSWNSLISGYV 156
           G ++H +++ +G     V +G  +VN Y K  S   A +VF  M   + VSWNS+I+G  
Sbjct: 332 GREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLD 391

Query: 157 HCGQFRNALSSFIALDGSENLVDSYSLSAALNASGQLGWLILGQSIHSKVVKLGLENNTI 216
             G F  A+  + ++   + L  S++L ++L++   L W  LGQ IH + +KLG++ N  
Sbjct: 392 QNGCFIEAVERYKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVS 451

Query: 217 VANCLIDTYGKCESFEEAVKVFNDMIDKDTISWNSVIAASARNGR-----------LEQA 276
           V+N L+  Y +     E  K+F+ M + D +SWNS+I A AR+ R            ++A
Sbjct: 452 VSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQRA 511

Query: 277 SRYLRQ------MPRPDTISYNEL---INGCA--------------------QFGDIEEA 336
            + L +      +    ++S+ EL   I+G A                    + G+++  
Sbjct: 512 GQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGEMDGC 571

Query: 337 VEILSTM-DSPNSSSWNAVLTGYVDRDEAWKALSFFTKMHSCNIGMDQFTFSSILSGVAG 396
            +I S M +  ++ +WN++++GY+  +   KAL     M      +D F ++++LS  A 
Sbjct: 572 EKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFAS 631

Query: 397 LSALTWGSSIHCCITKRGLDTSTVVGSALIDMYSKCGHVNYAEMIFQSLPKKNLVSWNAM 456
           ++ L  G  +H C  +  L++  VVGSAL+DMYSKCG ++YA   F ++P +N  SWN+M
Sbjct: 632 VATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSM 691

Query: 457 ISGLARNGKTMEVIHLFEKLKTMKDVTPDSITFLSILLSCSDNQVPLETTMQYFKSMIED 516
           ISG AR+G+  E + LFE +K      PD +TF+ +L +CS   + LE   ++F+SM + 
Sbjct: 692 ISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGL-LEEGFKHFESMSDS 751

Query: 517 FGIEPTVEHCCTMVRLMGQRGDVYRSKKLIHELGFDSNGSVWRALLGAC--ALWKDLKLA 576
           +G+ P +EH   M  ++G+ G++ + +  I ++    N  +WR +LGAC  A  +  +L 
Sbjct: 752 YGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELG 811

Query: 577 KVAAAKVIELGAADDYVHVMMSNILASHGKWRDVREVRELMRKKGVRKETGYSWL 588
           K AA  + +L   +   +V++ N+ A+ G+W D+ + R+ M+   V+KE GYSW+
Sbjct: 812 KKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWV 865

BLAST of CmaCh16G011330 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 285.4 bits (729), Expect = 1.5e-75
Identity = 169/531 (31.83%), Postives = 272/531 (51.22%), Query Frame = 0

Query: 97  GHQLHSYILRSGFASNVFVGTAMVNFYVKSESFDSAHKVFDEMPQPNLVSWNSLISGYVH 156
           G QLH +IL+SGF     VG ++V FY+K++  DSA KVFDEM + +++SWNS+I+GYV 
Sbjct: 214 GEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVS 273

Query: 157 CGQFRNALSSFIALDGSENLVDSYSLSAALNASGQLGWLILGQSIHSKVVKLGLENNTIV 216
            G     LS F+ +  S   +D  ++ +          + LG+++HS  VK         
Sbjct: 274 NGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRF 333

Query: 217 ANCLIDTYGKCESFEEAVKVFNDMIDKDTISWNSVIAASARNGRLEQASRYLRQMPR--- 276
            N L+D Y KC   + A  VF +M D+  +S+ S+IA  AR G   +A +   +M     
Sbjct: 334 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 393

Query: 277 -PDTISYNELINGCAQF-----------------------------------GDIEEAVE 336
            PD  +   ++N CA++                                   G ++EA  
Sbjct: 394 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 453

Query: 337 ILSTMDSPNSSSWNAVLTGYVDRDEAWKALSFFT-KMHSCNIGMDQFTFSSILSGVAGLS 396
           + S M   +  SWN ++ GY     A +ALS F   +       D+ T + +L   A LS
Sbjct: 454 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 513

Query: 397 ALTWGSSIHCCITKRGLDTSTVVGSALIDMYSKCGHVNYAEMIFQSLPKKNLVSWNAMIS 456
           A   G  IH  I + G  +   V ++L+DMY+KCG +  A M+F  +  K+LVSW  MI+
Sbjct: 514 AFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIA 573

Query: 457 GLARNGKTMEVIHLFEKLKTMKDVTPDSITFLSILLSCSDNQVPLETTMQYFKSMIEDFG 516
           G   +G   E I LF +++    +  D I+F+S+L +CS + + ++   ++F  M  +  
Sbjct: 574 GYGMHGFGKEAIALFNQMR-QAGIEADEISFVSLLYACSHSGL-VDEGWRFFNIMRHECK 633

Query: 517 IEPTVEHCCTMVRLMGQRGDVYRSKKLIHELGFDSNGSVWRALLGACALWKDLKLAKVAA 576
           IEPTVEH   +V ++ + GD+ ++ + I  +    + ++W ALL  C +  D+KLA+  A
Sbjct: 634 IEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVA 693

Query: 577 AKVIELGAADDYVHVMMSNILASHGKWRDVREVRELMRKKGVRKETGYSWL 588
            KV EL   +   +V+M+NI A   KW  V+ +R+ + ++G+RK  G SW+
Sbjct: 694 EKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWI 742

BLAST of CmaCh16G011330 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 283.1 bits (723), Expect = 7.3e-75
Identity = 165/558 (29.57%), Postives = 295/558 (52.87%), Query Frame = 0

Query: 76  PNGYALVQLVRACTTHAWDSCGHQLHSYILRSGFASNVFVGTAMVNFYVKSESFDSAHKV 135
           P  Y    L++ C   A    G ++H  +++SGF+ ++F  T + N Y K    + A KV
Sbjct: 133 PVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKV 192

Query: 136 FDEMPQPNLVSWNSLISGYVHCGQFRNALSSFIALDGSENLVDSY-SLSAALNASGQLGW 195
           FD MP+ +LVSWN++++GY   G  R AL    ++   ENL  S+ ++ + L A   L  
Sbjct: 193 FDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSM-CEENLKPSFITIVSVLPAVSALRL 252

Query: 196 LILGQSIHSKVVKLGLENNTIVANCLIDTYGKCESFEEAVKVFNDMIDKDTISWNSVIAA 255
           + +G+ IH   ++ G ++   ++  L+D Y KC S E A ++F+ M++++ +SWNS+I A
Sbjct: 253 ISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDA 312

Query: 256 SARNGRLEQASRYLRQM----PRPDTISYNELINGCAQFGDIEE---------------- 315
             +N   ++A    ++M     +P  +S    ++ CA  GD+E                 
Sbjct: 313 YVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRN 372

Query: 316 -------------------AVEILSTMDSPNSSSWNAVLTGYVDRDEAWKALSFFTKMHS 375
                              A  +   + S    SWNA++ G+        AL++F++M S
Sbjct: 373 VSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRS 432

Query: 376 CNIGMDQFTFSSILSGVAGLSALTWGSSIHCCITKRGLDTSTVVGSALIDMYSKCGHVNY 435
             +  D FT+ S+++ +A LS       IH  + +  LD +  V +AL+DMY+KCG +  
Sbjct: 433 RTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMI 492

Query: 436 AEMIFQSLPKKNLVSWNAMISGLARNGKTMEVIHLFEKLKTMKDVTPDSITFLSILLSCS 495
           A +IF  + ++++ +WNAMI G   +G     + LFE+++    + P+ +TFLS++ +CS
Sbjct: 493 ARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQ-KGTIKPNGVTFLSVISACS 552

Query: 496 DNQVPLETTMQYFKSMIEDFGIEPTVEHCCTMVRLMGQRGDVYRSKKLIHELGFDSNGSV 555
            + + +E  ++ F  M E++ IE +++H   MV L+G+ G +  +   I ++      +V
Sbjct: 553 HSGL-VEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNV 612

Query: 556 WRALLGACALWKDLKLAKVAAAKVIELGAADDYVHVMMSNILASHGKWRDVREVRELMRK 594
           + A+LGAC + K++  A+ AA ++ EL   D   HV+++NI  +   W  V +VR  M +
Sbjct: 613 YGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLR 672

BLAST of CmaCh16G011330 vs. TAIR 10
Match: AT5G47460.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 541.2 bits (1393), Expect = 1.1e-153
Identity = 262/551 (47.55%), Postives = 374/551 (67.88%), Query Frame = 0

Query: 40  STNYQDPWTSFRMVLAGGGRTAELALAEAFQALKCNPNGYALVQLVRACTTHAWDSCGHQ 99
           ST   + W++    LA  G    L  A         P+   LV L+R    + + S   Q
Sbjct: 17  STASSNSWSTIVPALARFGSIGVLRAAVELINDGEKPDASPLVHLLRVSGNYGYVSLCRQ 76

Query: 100 LHSYILRSGFASNVFVGTAMVNFYVKSESFDSAHKVFDEMPQPNLVSWNSLISGYVHCGQ 159
           LH Y+ + GF SN  +  +++ FY  S+S + AHKVFDEMP P+++SWNSL+SGYV  G+
Sbjct: 77  LHGYVTKHGFVSNTRLSNSLMRFYKTSDSLEDAHKVFDEMPDPDVISWNSLVSGYVQSGR 136

Query: 160 FRNALSSFIALDGSENLVDSYSLSAALNASGQLGWLILGQSIHSKVVKLGLE-NNTIVAN 219
           F+  +  F+ L  S+   + +S +AAL A  +L    LG  IHSK+VKLGLE  N +V N
Sbjct: 137 FQEGICLFLELHRSDVFPNEFSFTAALAACARLHLSPLGACIHSKLVKLGLEKGNVVVGN 196

Query: 220 CLIDTYGKCESFEEAVKVFNDMIDKDTISWNSVIAASARNGRLEQASRYLRQMPRPDTIS 279
           CLID YGKC   ++AV VF  M +KDT+SWN+++A+ +RNG+LE    +  QMP PDT++
Sbjct: 197 CLIDMYGKCGFMDDAVLVFQHMEEKDTVSWNAIVASCSRNGKLELGLWFFHQMPNPDTVT 256

Query: 280 YNELINGCAQFGDIEEAVEILSTMDSPNSSSWNAVLTGYVDRDEAWKALSFFTKMHSCNI 339
           YNELI+   + GD   A ++LS M +PNSSSWN +LTGYV+ +++ +A  FFTKMHS  +
Sbjct: 257 YNELIDAFVKSGDFNNAFQVLSDMPNPNSSSWNTILTGYVNSEKSGEATEFFTKMHSSGV 316

Query: 340 GMDQFTFSSILSGVAGLSALTWGSSIHCCITKRGLDTSTVVGSALIDMYSKCGHVNYAEM 399
             D+++ S +L+ VA L+ + WGS IH C  K GLD+  VV SALIDMYSKCG + +AE+
Sbjct: 317 RFDEYSLSIVLAAVAALAVVPWGSLIHACAHKLGLDSRVVVASALIDMYSKCGMLKHAEL 376

Query: 400 IFQSLPKKNLVSWNAMISGLARNGKTMEVIHLFEKLKTMKDVTPDSITFLSILLSCSDNQ 459
           +F ++P+KNL+ WN MISG ARNG ++E I LF +LK  + + PD  TFL++L  CS  +
Sbjct: 377 MFWTMPRKNLIVWNEMISGYARNGDSIEAIKLFNQLKQERFLKPDRFTFLNLLAVCSHCE 436

Query: 460 VPLETTMQYFKSMIEDFGIEPTVEHCCTMVRLMGQRGDVYRSKKLIHELGFDSNGSVWRA 519
           VP+E  + YF+ MI ++ I+P+VEHCC+++R MGQRG+V+++K++I E GF  +G  WRA
Sbjct: 437 VPMEVMLGYFEMMINEYRIKPSVEHCCSLIRAMGQRGEVWQAKQVIQEFGFGYDGVAWRA 496

Query: 520 LLGACALWKDLKLAKVAAAKVIELGAA--DDYVHVMMSNILASHGKWRDVREVRELMRKK 579
           LLGAC+  KDLK AK  AAK+IELG A  D+Y++++MSN+ A H +WR+V ++R++MR+ 
Sbjct: 497 LLGACSARKDLKAAKTVAAKMIELGDADKDEYLYIVMSNLYAYHERWREVGQIRKIMRES 556

Query: 580 GVRKETGYSWL 588
           GV KE G SW+
Sbjct: 557 GVLKEVGSSWI 567

BLAST of CmaCh16G011330 vs. TAIR 10
Match: AT3G02330.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 292.0 bits (746), Expect = 1.1e-78
Identity = 183/608 (30.10%), Postives = 314/608 (51.64%), Query Frame = 0

Query: 49  SFRMVLAGGGRTAELALAEAF----QALKCNPNGYALVQLVRACTTHAWDSCGHQLHSYI 108
           S+  ++AG  +   L+LA  F    Q +    +      ++R+C   +    G QLH++ 
Sbjct: 248 SWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQSIYASVLRSCAALSELRLGGQLHAHA 307

Query: 109 LRSGFASNVFVGTAMVNFYVKSESFDSAHKVFDEMPQPNLVSWNSLISGYVHCGQFRNAL 168
           L+S FA++  V TA ++ Y K ++   A  +FD     N  S+N++I+GY        AL
Sbjct: 308 LKSDFAADGIVRTATLDMYAKCDNMQDAQILFDNSENLNRQSYNAMITGYSQEEHGFKAL 367

Query: 169 SSFIALDGSENLVDSYSLSAALNASGQLGWLILGQSIHSKVVKLGLENNTIVANCLIDTY 228
             F  L  S    D  SLS    A   +  L  G  I+   +K  L  +  VAN  ID Y
Sbjct: 368 LLFHRLMSSGLGFDEISLSGVFRACALVKGLSEGLQIYGLAIKSSLSLDVCVANAAIDMY 427

Query: 229 GKCESFEEAVKVFNDMIDKDTISWNSVIAASARNGR----LEQASRYLRQMPRPDTISYN 288
           GKC++  EA +VF++M  +D +SWN++IAA  +NG+    L      LR    PD  ++ 
Sbjct: 428 GKCQALAEAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETLFLFVSMLRSRIEPDEFTFG 487

Query: 289 ELINGC----------------------------------AQFGDIEEAVEILS------ 348
            ++  C                                  ++ G IEEA +I S      
Sbjct: 488 SILKACTGGSLGYGMEIHSSIVKSGMASNSSVGCSLIDMYSKCGMIEEAEKIHSRFFQRA 547

Query: 349 ----TMDSPNSS----------SWNAVLTGYVDRDEAWKALSFFTKMHSCNIGMDQFTFS 408
               TM+               SWN++++GYV ++++  A   FT+M    I  D+FT++
Sbjct: 548 NVSGTMEELEKMHNKRLQEMCVSWNSIISGYVMKEQSEDAQMLFTRMMEMGITPDKFTYA 607

Query: 409 SILSGVAGLSALTWGSSIHCCITKRGLDTSTVVGSALIDMYSKCGHVNYAEMIFQSLPKK 468
           ++L   A L++   G  IH  + K+ L +   + S L+DMYSKCG ++ + ++F+   ++
Sbjct: 608 TVLDTCANLASAGLGKQIHAQVIKKELQSDVYICSTLVDMYSKCGDLHDSRLMFEKSLRR 667

Query: 469 NLVSWNAMISGLARNGKTMEVIHLFEKLKTMKDVTPDSITFLSILLSCSDNQVPLETTMQ 528
           + V+WNAMI G A +GK  E I LFE++  ++++ P+ +TF+SIL +C+   + ++  ++
Sbjct: 668 DFVTWNAMICGYAHHGKGEEAIQLFERM-ILENIKPNHVTFISILRACAHMGL-IDKGLE 727

Query: 529 YFKSMIEDFGIEPTVEHCCTMVRLMGQRGDVYRSKKLIHELGFDSNGSVWRALLGACALW 588
           YF  M  D+G++P + H   MV ++G+ G V R+ +LI E+ F+++  +WR LLG C + 
Sbjct: 728 YFYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIREMPFEADDVIWRTLLGVCTIH 787

Query: 589 K-DLKLAKVAAAKVIELGAADDYVHVMMSNILASHGKWRDVREVRELMRKKGVRKETGYS 594
           + ++++A+ A A ++ L   D   + ++SN+ A  G W  V ++R  MR   ++KE G S
Sbjct: 788 RNNVEVAEEATAALLRLDPQDSSAYTLLSNVYADAGMWEKVSDLRRNMRGFKLKKEPGCS 847

BLAST of CmaCh16G011330 vs. TAIR 10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 291.2 bits (744), Expect = 1.9e-78
Identity = 168/535 (31.40%), Postives = 290/535 (54.21%), Query Frame = 0

Query: 97  GHQLHSYILRSGFAS-NVFVGTAMVNFYVKSESFDSAHKVFDEMPQPNLVSWNSLISGYV 156
           G ++H +++ +G     V +G  +VN Y K  S   A +VF  M   + VSWNS+I+G  
Sbjct: 332 GREVHGHVITTGLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLD 391

Query: 157 HCGQFRNALSSFIALDGSENLVDSYSLSAALNASGQLGWLILGQSIHSKVVKLGLENNTI 216
             G F  A+  + ++   + L  S++L ++L++   L W  LGQ IH + +KLG++ N  
Sbjct: 392 QNGCFIEAVERYKSMRRHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVS 451

Query: 217 VANCLIDTYGKCESFEEAVKVFNDMIDKDTISWNSVIAASARNGR-----------LEQA 276
           V+N L+  Y +     E  K+F+ M + D +SWNS+I A AR+ R            ++A
Sbjct: 452 VSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQRA 511

Query: 277 SRYLRQ------MPRPDTISYNEL---INGCA--------------------QFGDIEEA 336
            + L +      +    ++S+ EL   I+G A                    + G+++  
Sbjct: 512 GQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGEMDGC 571

Query: 337 VEILSTM-DSPNSSSWNAVLTGYVDRDEAWKALSFFTKMHSCNIGMDQFTFSSILSGVAG 396
            +I S M +  ++ +WN++++GY+  +   KAL     M      +D F ++++LS  A 
Sbjct: 572 EKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFAS 631

Query: 397 LSALTWGSSIHCCITKRGLDTSTVVGSALIDMYSKCGHVNYAEMIFQSLPKKNLVSWNAM 456
           ++ L  G  +H C  +  L++  VVGSAL+DMYSKCG ++YA   F ++P +N  SWN+M
Sbjct: 632 VATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSM 691

Query: 457 ISGLARNGKTMEVIHLFEKLKTMKDVTPDSITFLSILLSCSDNQVPLETTMQYFKSMIED 516
           ISG AR+G+  E + LFE +K      PD +TF+ +L +CS   + LE   ++F+SM + 
Sbjct: 692 ISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGL-LEEGFKHFESMSDS 751

Query: 517 FGIEPTVEHCCTMVRLMGQRGDVYRSKKLIHELGFDSNGSVWRALLGAC--ALWKDLKLA 576
           +G+ P +EH   M  ++G+ G++ + +  I ++    N  +WR +LGAC  A  +  +L 
Sbjct: 752 YGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELG 811

Query: 577 KVAAAKVIELGAADDYVHVMMSNILASHGKWRDVREVRELMRKKGVRKETGYSWL 588
           K AA  + +L   +   +V++ N+ A+ G+W D+ + R+ M+   V+KE GYSW+
Sbjct: 812 KKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWV 865

BLAST of CmaCh16G011330 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 285.4 bits (729), Expect = 1.0e-76
Identity = 169/531 (31.83%), Postives = 272/531 (51.22%), Query Frame = 0

Query: 97  GHQLHSYILRSGFASNVFVGTAMVNFYVKSESFDSAHKVFDEMPQPNLVSWNSLISGYVH 156
           G QLH +IL+SGF     VG ++V FY+K++  DSA KVFDEM + +++SWNS+I+GYV 
Sbjct: 214 GEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVS 273

Query: 157 CGQFRNALSSFIALDGSENLVDSYSLSAALNASGQLGWLILGQSIHSKVVKLGLENNTIV 216
            G     LS F+ +  S   +D  ++ +          + LG+++HS  VK         
Sbjct: 274 NGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRF 333

Query: 217 ANCLIDTYGKCESFEEAVKVFNDMIDKDTISWNSVIAASARNGRLEQASRYLRQMPR--- 276
            N L+D Y KC   + A  VF +M D+  +S+ S+IA  AR G   +A +   +M     
Sbjct: 334 CNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGI 393

Query: 277 -PDTISYNELINGCAQF-----------------------------------GDIEEAVE 336
            PD  +   ++N CA++                                   G ++EA  
Sbjct: 394 SPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAEL 453

Query: 337 ILSTMDSPNSSSWNAVLTGYVDRDEAWKALSFFT-KMHSCNIGMDQFTFSSILSGVAGLS 396
           + S M   +  SWN ++ GY     A +ALS F   +       D+ T + +L   A LS
Sbjct: 454 VFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLS 513

Query: 397 ALTWGSSIHCCITKRGLDTSTVVGSALIDMYSKCGHVNYAEMIFQSLPKKNLVSWNAMIS 456
           A   G  IH  I + G  +   V ++L+DMY+KCG +  A M+F  +  K+LVSW  MI+
Sbjct: 514 AFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIA 573

Query: 457 GLARNGKTMEVIHLFEKLKTMKDVTPDSITFLSILLSCSDNQVPLETTMQYFKSMIEDFG 516
           G   +G   E I LF +++    +  D I+F+S+L +CS + + ++   ++F  M  +  
Sbjct: 574 GYGMHGFGKEAIALFNQMR-QAGIEADEISFVSLLYACSHSGL-VDEGWRFFNIMRHECK 633

Query: 517 IEPTVEHCCTMVRLMGQRGDVYRSKKLIHELGFDSNGSVWRALLGACALWKDLKLAKVAA 576
           IEPTVEH   +V ++ + GD+ ++ + I  +    + ++W ALL  C +  D+KLA+  A
Sbjct: 634 IEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVA 693

Query: 577 AKVIELGAADDYVHVMMSNILASHGKWRDVREVRELMRKKGVRKETGYSWL 588
            KV EL   +   +V+M+NI A   KW  V+ +R+ + ++G+RK  G SW+
Sbjct: 694 EKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSWI 742

BLAST of CmaCh16G011330 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 283.1 bits (723), Expect = 5.2e-76
Identity = 165/558 (29.57%), Postives = 295/558 (52.87%), Query Frame = 0

Query: 76  PNGYALVQLVRACTTHAWDSCGHQLHSYILRSGFASNVFVGTAMVNFYVKSESFDSAHKV 135
           P  Y    L++ C   A    G ++H  +++SGF+ ++F  T + N Y K    + A KV
Sbjct: 133 PVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKV 192

Query: 136 FDEMPQPNLVSWNSLISGYVHCGQFRNALSSFIALDGSENLVDSY-SLSAALNASGQLGW 195
           FD MP+ +LVSWN++++GY   G  R AL    ++   ENL  S+ ++ + L A   L  
Sbjct: 193 FDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSM-CEENLKPSFITIVSVLPAVSALRL 252

Query: 196 LILGQSIHSKVVKLGLENNTIVANCLIDTYGKCESFEEAVKVFNDMIDKDTISWNSVIAA 255
           + +G+ IH   ++ G ++   ++  L+D Y KC S E A ++F+ M++++ +SWNS+I A
Sbjct: 253 ISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDA 312

Query: 256 SARNGRLEQASRYLRQM----PRPDTISYNELINGCAQFGDIEE---------------- 315
             +N   ++A    ++M     +P  +S    ++ CA  GD+E                 
Sbjct: 313 YVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRN 372

Query: 316 -------------------AVEILSTMDSPNSSSWNAVLTGYVDRDEAWKALSFFTKMHS 375
                              A  +   + S    SWNA++ G+        AL++F++M S
Sbjct: 373 VSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRS 432

Query: 376 CNIGMDQFTFSSILSGVAGLSALTWGSSIHCCITKRGLDTSTVVGSALIDMYSKCGHVNY 435
             +  D FT+ S+++ +A LS       IH  + +  LD +  V +AL+DMY+KCG +  
Sbjct: 433 RTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMI 492

Query: 436 AEMIFQSLPKKNLVSWNAMISGLARNGKTMEVIHLFEKLKTMKDVTPDSITFLSILLSCS 495
           A +IF  + ++++ +WNAMI G   +G     + LFE+++    + P+ +TFLS++ +CS
Sbjct: 493 ARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQ-KGTIKPNGVTFLSVISACS 552

Query: 496 DNQVPLETTMQYFKSMIEDFGIEPTVEHCCTMVRLMGQRGDVYRSKKLIHELGFDSNGSV 555
            + + +E  ++ F  M E++ IE +++H   MV L+G+ G +  +   I ++      +V
Sbjct: 553 HSGL-VEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNV 612

Query: 556 WRALLGACALWKDLKLAKVAAAKVIELGAADDYVHVMMSNILASHGKWRDVREVRELMRK 594
           + A+LGAC + K++  A+ AA ++ EL   D   HV+++NI  +   W  V +VR  M +
Sbjct: 613 YGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLR 672

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FGL11.5e-15247.55Putative pentatricopeptide repeat-containing protein At5g47460 OS=Arabidopsis th... [more]
Q9FWA61.6e-7730.10Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidop... [more]
Q9FIB22.7e-7731.40Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
Q9SN391.5e-7531.83Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q3E6Q17.3e-7529.57Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT5G47460.11.1e-15347.55Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G02330.11.1e-7830.10Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G09950.11.9e-7831.40Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.11.0e-7631.83Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.15.2e-7629.57Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 459..597
e-value: 1.5E-8
score: 36.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 196..268
e-value: 6.1E-9
score: 37.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 269..359
e-value: 2.5E-19
score: 71.3
coord: 362..458
e-value: 1.0E-18
score: 69.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 57..195
e-value: 2.3E-19
score: 71.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 305..353
e-value: 4.1E-7
score: 30.1
coord: 406..452
e-value: 2.2E-8
score: 34.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 409..443
e-value: 1.8E-5
score: 22.6
coord: 218..244
e-value: 1.9E-5
score: 22.5
coord: 277..302
e-value: 6.6E-4
score: 17.7
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 274..302
e-value: 2.3E-5
score: 24.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 553..578
e-value: 0.77
score: 10.1
coord: 117..140
e-value: 0.08
score: 13.2
coord: 218..243
e-value: 2.0E-4
score: 21.4
coord: 145..165
e-value: 5.2E-4
score: 20.1
coord: 246..272
e-value: 0.0017
score: 18.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 213..247
score: 9.273317
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 275..309
score: 10.194067
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 407..437
score: 9.646002
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 64..588
NoneNo IPR availablePANTHERPTHR47928:SF110PPR CONTAINING PLANT-LIKE PROTEINcoord: 64..588

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G011330.1CmaCh16G011330.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding