CmaCh02G006950 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh02G006950
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr02: 4240532 .. 4242979 (+)
RNA-Seq ExpressionCmaCh02G006950
SyntenyCmaCh02G006950
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCGCCGATCAACATTTCTACGACCCGTCGTCACCTATTTAGTTCCAAAACCTCCATGGTTCCACTTATTTCATACGCCCACTGACCCAATCGCTACTTCCAATGAGGTCTCCACCATAATCGAAACTGTCGATCCCATTGAAGATGCATTGGAAACCATAGCCCCTCATATATCATCTGATGTAATTACCTCAGTCATTCAAGAACAGCCGAATGCTCGACTTGGATTTCGACTTTTTATCTGGTCGTTGAGGAGAAGGCACCTGTGCTGCAGCGCCTCGCAGGATTTGATCATTGACAGGTTAGTAAAGGACAATGCCTTTGAATTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCTTCTACTGAAATTTCATCGGACGCCTTCTCTGTATTGATTGAGGCATACTCTAAAGCCGGCATGGAAGAGAAGGCCGTCCAATCGTTTGGCATGATGAAGGATTTTGAATGTAAGCCCAATATTTTTGCTTACAATTTGATTTTGCATGTTTTGGTGCGAAGAGAAGCGTTTTTGTTAGCATTAGCGGTGTATAATCAGATGCTCAAATGTAATTTGAATCCTAATGTGGTTACTTACAGCATTTTGATTCATGGATTCTGTAAAACTAGTAAAACTCAAGAAGCCCTTGTACTCTTTGATGAAATGACTGATAGAGACGTATTGCCCAACGAGATAACCTATTCGATTATCCTTTCTGGGTTGTGTCAAGCTAAGAAAATTGATGATGCACAGAGATTGTTCATTAAGATGAGAGCTAGTGGTTGTAGTCCAGATGTAATCACTTACAATGTTTTGCTTAATGGGTTTTGTAAGTTAGGTTATTTTGATGAAGCTTTTGCATTGTTGAGATCATTTGAGAAGGATGGCCATATTCTTGGAGTCAAAGGGTACAGTTGTTTGATTGATGGCTTGTTTAGGGCTAGGAGATATGATGAAGCACATATGTGGTACCAAAAATTTTCGAGGAAAAATGTAGAGCCTGATGTTATCTTGTATACTATAATGATCCAAGGCTTATGCCAAGAAGGTCGGGTTAACGAGGCATTGGCGTTGTTGGATGAGATGACGGAAAGAGGGTTTAGTCCAGATACTACTTGTTACAATGCTGTAATTAGAGGATTTTGTGATATGGGTCTTTTGGATAAGGCCCAGTCTCTTCGACTCGAGATTTCAAACCACGACTGTTTCCCCGACAACCACACGTATTCCATTCTCATTTGTGGTATGTGTAAGAATGGGTTAATTGATGAGGCACAACATGTATTCAATGAAATGGAGAAGCTTGGATGCCTTCCTTCTGTTGTGACCTTCAATTCTCTCATTGATGGATTTTGCAAGGCTGGTAAGCTTAAGGAAGCTCATCTTTTGTTTTACAAAATGGAGATAGGGAGAAAACCTTCTTTGTTCCTTCGACTTTTGCAAGGTGCCAATAAGGTTCTTGGTACTGTCGATCTCCAAGTTATGTTGGAACAATTATGCGAGTCGGGGTTGATTCATAAGGCCTACAAGCTTCTTATGCAGCTTGTTGAGAGTGGGGTTTTTCCAGACATTAGAACTTACAACATCCTAATCAATGGATTTTGCAAGACCAACAACATCGATGGTGCTTTCAAGCTCTTCAAGGACATGCAACTTAAAGGGCGCTTACCAGATTCAATTACGTATGGAACTCTAATAGATGGGCTCCACAGAGTCGGTAGGGACGAGGATGCTCTAGGGATTTTCGAACAAATGGTAAAGAATGGGTGCAAGCCTGAGTCTTCTGTTTACAAGTCTATCATGACTTGGTCGTGTCGAAGAAAAAAGGTTTCACTCGCGTTTAGTGTTTGGATGAAGTATCTGAGGAATTTTCGTGGCTGGAAAGATGAAAAGGTCAAAGTAGTAGAGGAAAGTTTCGACAAAGGAGACCTTGAAAAGGCGATCTCGAGAATAATCGAAATGGACTTGAACTCAAAAGACTTCGACTTGGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCAGGGAGGGTTTCTGAAGCCTTCGCAATATTTTCTGTTCTTAAGGACTTCAAAAGGATTATAAGTTCAGCAAGCTGCGTGATGTTGATTGGTGGGCTTTGCGTTGAAGGAAAACTTGACCTGGCTGTGGAAGTTTTCCTTTATACACTAGAAACAGGCACTATGTTGATGCCTAGAATTTGTAACCAACTGCTAAGGCATCTTCATTTAGAGGACAGGAAGGATCATGCTTTTGTTCTTATACGTAGAATGGAGGCTTTTGGATATGATATGAATGCTTATCTCCACCACAGTACTAAGTCACTTCTTCATGATCATTGGAAGTCATTGAAAGCTAAAGCTAGACACGAGCAGTGGTTGACGAATTCACAGCAGCAACTCCTAAATGCCACATTTCCTATGGTTGAAAGTAATTAG

mRNA sequence

ATGAAGCGCCGATCAACATTTCTACGACCCGTCGTCACCTATTTAGTTCCAAAACCTCCATGGTTCCACTTATTTCATACGCCCACTGACCCAATCGCTACTTCCAATGAGGTCTCCACCATAATCGAAACTGTCGATCCCATTGAAGATGCATTGGAAACCATAGCCCCTCATATATCATCTGATGTAATTACCTCAGTCATTCAAGAACAGCCGAATGCTCGACTTGGATTTCGACTTTTTATCTGGTCGTTGAGGAGAAGGCACCTGTGCTGCAGCGCCTCGCAGGATTTGATCATTGACAGGTTAGTAAAGGACAATGCCTTTGAATTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCTTCTACTGAAATTTCATCGGACGCCTTCTCTGTATTGATTGAGGCATACTCTAAAGCCGGCATGGAAGAGAAGGCCGTCCAATCGTTTGGCATGATGAAGGATTTTGAATGTAAGCCCAATATTTTTGCTTACAATTTGATTTTGCATGTTTTGGTGCGAAGAGAAGCGTTTTTGTTAGCATTAGCGGTGTATAATCAGATGCTCAAATGTAATTTGAATCCTAATGTGGTTACTTACAGCATTTTGATTCATGGATTCTGTAAAACTAGTAAAACTCAAGAAGCCCTTGTACTCTTTGATGAAATGACTGATAGAGACGTATTGCCCAACGAGATAACCTATTCGATTATCCTTTCTGGGTTGTGTCAAGCTAAGAAAATTGATGATGCACAGAGATTGTTCATTAAGATGAGAGCTAGTGGTTGTAGTCCAGATGTAATCACTTACAATGTTTTGCTTAATGGGTTTTGTAAGTTAGGTTATTTTGATGAAGCTTTTGCATTGTTGAGATCATTTGAGAAGGATGGCCATATTCTTGGAGTCAAAGGGTACAGTTGTTTGATTGATGGCTTGTTTAGGGCTAGGAGATATGATGAAGCACATATGTGGTACCAAAAATTTTCGAGGAAAAATGTAGAGCCTGATGTTATCTTGTATACTATAATGATCCAAGGCTTATGCCAAGAAGGTCGGGTTAACGAGGCATTGGCGTTGTTGGATGAGATGACGGAAAGAGGGTTTAGTCCAGATACTACTTGTTACAATGCTGTAATTAGAGGATTTTGTGATATGGGTCTTTTGGATAAGGCCCAGTCTCTTCGACTCGAGATTTCAAACCACGACTGTTTCCCCGACAACCACACGTATTCCATTCTCATTTGTGGTATGTGTAAGAATGGGTTAATTGATGAGGCACAACATGTATTCAATGAAATGGAGAAGCTTGGATGCCTTCCTTCTGTTGTGACCTTCAATTCTCTCATTGATGGATTTTGCAAGGCTGGTAAGCTTAAGGAAGCTCATCTTTTGTTTTACAAAATGGAGATAGGGAGAAAACCTTCTTTGTTCCTTCGACTTTTGCAAGGTGCCAATAAGGTTCTTGGTACTGTCGATCTCCAAGTTATGTTGGAACAATTATGCGAGTCGGGGTTGATTCATAAGGCCTACAAGCTTCTTATGCAGCTTGTTGAGAGTGGGGTTTTTCCAGACATTAGAACTTACAACATCCTAATCAATGGATTTTGCAAGACCAACAACATCGATGGTGCTTTCAAGCTCTTCAAGGACATGCAACTTAAAGGGCGCTTACCAGATTCAATTACGTATGGAACTCTAATAGATGGGCTCCACAGAGTCGGTAGGGACGAGGATGCTCTAGGGATTTTCGAACAAATGGTAAAGAATGGGTGCAAGCCTGAGTCTTCTGTTTACAAGTCTATCATGACTTGGTCGTGTCGAAGAAAAAAGGTTTCACTCGCGTTTAGTGTTTGGATGAAGTATCTGAGGAATTTTCGTGGCTGGAAAGATGAAAAGGTCAAAGTAGTAGAGGAAAGTTTCGACAAAGGAGACCTTGAAAAGGCGATCTCGAGAATAATCGAAATGGACTTGAACTCAAAAGACTTCGACTTGGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCAGGGAGGGTTTCTGAAGCCTTCGCAATATTTTCTGTTCTTAAGGACTTCAAAAGGATTATAAGTTCAGCAAGCTGCGTGATGTTGATTGGTGGGCTTTGCGTTGAAGGAAAACTTGACCTGGCTGTGGAAGTTTTCCTTTATACACTAGAAACAGGCACTATGTTGATGCCTAGAATTTGTAACCAACTGCTAAGGCATCTTCATTTAGAGGACAGGAAGGATCATGCTTTTGTTCTTATACGTAGAATGGAGGCTTTTGGATATGATATGAATGCTTATCTCCACCACAGTACTAAGTCACTTCTTCATGATCATTGGAAGTCATTGAAAGCTAAAGCTAGACACGAGCAGTGGTTGACGAATTCACAGCAGCAACTCCTAAATGCCACATTTCCTATGGTTGAAAGTAATTAG

Coding sequence (CDS)

ATGAAGCGCCGATCAACATTTCTACGACCCGTCGTCACCTATTTAGTTCCAAAACCTCCATGGTTCCACTTATTTCATACGCCCACTGACCCAATCGCTACTTCCAATGAGGTCTCCACCATAATCGAAACTGTCGATCCCATTGAAGATGCATTGGAAACCATAGCCCCTCATATATCATCTGATGTAATTACCTCAGTCATTCAAGAACAGCCGAATGCTCGACTTGGATTTCGACTTTTTATCTGGTCGTTGAGGAGAAGGCACCTGTGCTGCAGCGCCTCGCAGGATTTGATCATTGACAGGTTAGTAAAGGACAATGCCTTTGAATTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCTTCTACTGAAATTTCATCGGACGCCTTCTCTGTATTGATTGAGGCATACTCTAAAGCCGGCATGGAAGAGAAGGCCGTCCAATCGTTTGGCATGATGAAGGATTTTGAATGTAAGCCCAATATTTTTGCTTACAATTTGATTTTGCATGTTTTGGTGCGAAGAGAAGCGTTTTTGTTAGCATTAGCGGTGTATAATCAGATGCTCAAATGTAATTTGAATCCTAATGTGGTTACTTACAGCATTTTGATTCATGGATTCTGTAAAACTAGTAAAACTCAAGAAGCCCTTGTACTCTTTGATGAAATGACTGATAGAGACGTATTGCCCAACGAGATAACCTATTCGATTATCCTTTCTGGGTTGTGTCAAGCTAAGAAAATTGATGATGCACAGAGATTGTTCATTAAGATGAGAGCTAGTGGTTGTAGTCCAGATGTAATCACTTACAATGTTTTGCTTAATGGGTTTTGTAAGTTAGGTTATTTTGATGAAGCTTTTGCATTGTTGAGATCATTTGAGAAGGATGGCCATATTCTTGGAGTCAAAGGGTACAGTTGTTTGATTGATGGCTTGTTTAGGGCTAGGAGATATGATGAAGCACATATGTGGTACCAAAAATTTTCGAGGAAAAATGTAGAGCCTGATGTTATCTTGTATACTATAATGATCCAAGGCTTATGCCAAGAAGGTCGGGTTAACGAGGCATTGGCGTTGTTGGATGAGATGACGGAAAGAGGGTTTAGTCCAGATACTACTTGTTACAATGCTGTAATTAGAGGATTTTGTGATATGGGTCTTTTGGATAAGGCCCAGTCTCTTCGACTCGAGATTTCAAACCACGACTGTTTCCCCGACAACCACACGTATTCCATTCTCATTTGTGGTATGTGTAAGAATGGGTTAATTGATGAGGCACAACATGTATTCAATGAAATGGAGAAGCTTGGATGCCTTCCTTCTGTTGTGACCTTCAATTCTCTCATTGATGGATTTTGCAAGGCTGGTAAGCTTAAGGAAGCTCATCTTTTGTTTTACAAAATGGAGATAGGGAGAAAACCTTCTTTGTTCCTTCGACTTTTGCAAGGTGCCAATAAGGTTCTTGGTACTGTCGATCTCCAAGTTATGTTGGAACAATTATGCGAGTCGGGGTTGATTCATAAGGCCTACAAGCTTCTTATGCAGCTTGTTGAGAGTGGGGTTTTTCCAGACATTAGAACTTACAACATCCTAATCAATGGATTTTGCAAGACCAACAACATCGATGGTGCTTTCAAGCTCTTCAAGGACATGCAACTTAAAGGGCGCTTACCAGATTCAATTACGTATGGAACTCTAATAGATGGGCTCCACAGAGTCGGTAGGGACGAGGATGCTCTAGGGATTTTCGAACAAATGGTAAAGAATGGGTGCAAGCCTGAGTCTTCTGTTTACAAGTCTATCATGACTTGGTCGTGTCGAAGAAAAAAGGTTTCACTCGCGTTTAGTGTTTGGATGAAGTATCTGAGGAATTTTCGTGGCTGGAAAGATGAAAAGGTCAAAGTAGTAGAGGAAAGTTTCGACAAAGGAGACCTTGAAAAGGCGATCTCGAGAATAATCGAAATGGACTTGAACTCAAAAGACTTCGACTTGGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCAGGGAGGGTTTCTGAAGCCTTCGCAATATTTTCTGTTCTTAAGGACTTCAAAAGGATTATAAGTTCAGCAAGCTGCGTGATGTTGATTGGTGGGCTTTGCGTTGAAGGAAAACTTGACCTGGCTGTGGAAGTTTTCCTTTATACACTAGAAACAGGCACTATGTTGATGCCTAGAATTTGTAACCAACTGCTAAGGCATCTTCATTTAGAGGACAGGAAGGATCATGCTTTTGTTCTTATACGTAGAATGGAGGCTTTTGGATATGATATGAATGCTTATCTCCACCACAGTACTAAGTCACTTCTTCATGATCATTGGAAGTCATTGAAAGCTAAAGCTAGACACGAGCAGTGGTTGACGAATTCACAGCAGCAACTCCTAAATGCCACATTTCCTATGGTTGAAAGTAATTAG

Protein sequence

MKRRSTFLRPVVTYLVPKPPWFHLFHTPTDPIATSNEVSTIIETVDPIEDALETIAPHISSDVITSVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMTWSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNSKDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAVEVFLYTLETGTMLMPRICNQLLRHLHLEDRKDHAFVLIRRMEAFGYDMNAYLHHSTKSLLHDHWKSLKAKARHEQWLTNSQQQLLNATFPMVESN
Homology
BLAST of CmaCh02G006950 vs. ExPASy Swiss-Prot
Match: Q9SAJ5 (Pentatricopeptide repeat-containing protein At1g79540 OS=Arabidopsis thaliana OX=3702 GN=At1g79540 PE=2 SV=1)

HSP 1 Score: 777.3 bits (2006), Expect = 1.7e-223
Identity = 394/787 (50.06%), Postives = 533/787 (67.73%), Query Frame = 0

Query: 7   FLRPVVTYLVPKPPWF-HLFHTPTDPIATSNEVSTIIETVDPIEDALETIAPHISSDVIT 66
           F R V+ +   KP W    + +       S EV +I+    PIE ALE + P +S ++IT
Sbjct: 6   FFRSVIQF-YSKPSWMQRSYSSGNAEFNISGEVISILAKKKPIEPALEPLVPFLSKNIIT 65

Query: 67  SVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELKDSSTE 126
           SVI+++ N +LGFR FIW+ RR  L    S  L+ID L +DN  +LYW+TL+ELK     
Sbjct: 66  SVIKDEVNRQLGFRFFIWASRRERLRSRESFGLVIDMLSEDNGCDLYWQTLEELKSGGVS 125

Query: 127 ISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREA-FLLALA 186
           + S  F VLI AY+K GM EKAV+SFG MK+F+C+P++F YN+IL V++R E  F+LA A
Sbjct: 126 VDSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILRVMMREEVFFMLAFA 185

Query: 187 VYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLC 246
           VYN+MLKCN +PN+ T+ IL+ G  K  +T +A  +FD+MT R + PN +TY+I++SGLC
Sbjct: 186 VYNEMLKCNCSPNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISGLC 245

Query: 247 QAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVK 306
           Q    DDA++LF +M+ SG  PD + +N LL+GFCKLG   EAF LLR FEKDG +LG++
Sbjct: 246 QRGSADDARKLFYEMQTSGNYPDSVAHNALLDGFCKLGRMVEAFELLRLFEKDGFVLGLR 305

Query: 307 GYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEM 366
           GYS LIDGLFRARRY +A   Y    +KN++PD+ILYTI+IQGL + G++ +AL LL  M
Sbjct: 306 GYSSLIDGLFRARRYTQAFELYANMLKKNIKPDIILYTILIQGLSKAGKIEDALKLLSSM 365

Query: 367 TERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLI 426
             +G SPDT CYNAVI+  C  GLL++ +SL+LE+S  + FPD  T++ILIC MC+NGL+
Sbjct: 366 PSKGISPDTYCYNAVIKALCGRGLLEEGRSLQLEMSETESFPDACTHTILICSMCRNGLV 425

Query: 427 DEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQG 486
            EA+ +F E+EK GC PSV TFN+LIDG CK+G+LKEA LL +KME+GR  SLFLRL   
Sbjct: 426 REAEEIFTEIEKSGCSPSVATFNALIDGLCKSGELKEARLLLHKMEVGRPASLFLRLSHS 485

Query: 487 ANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDG 546
            N+   T         + ESG I KAY+ L    ++G  PDI +YN+LINGFC+  +IDG
Sbjct: 486 GNRSFDT---------MVESGSILKAYRDLAHFADTGSSPDIVSYNVLINGFCRAGDIDG 545

Query: 547 AFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMT 606
           A KL   +QLKG  PDS+TY TLI+GLHRVGR+E+A  +F    K+  +   +VY+S+MT
Sbjct: 546 ALKLLNVLQLKGLSPDSVTYNTLINGLHRVGREEEAFKLF--YAKDDFRHSPAVYRSLMT 605

Query: 607 WSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNSKDFD 666
           WSCR++KV +AF++WMKYL+      DE    +E+ F +G+ E+A+ R+IE+D    +  
Sbjct: 606 WSCRKRKVLVAFNLWMKYLKKISCLDDETANEIEQCFKEGETERALRRLIELDTRKDELT 665

Query: 667 LAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAVEVFL 726
           L PYTI+LIGLCQ+GR  EA  +FSVL++ K +++  SCV LI GLC   +LD A+EVFL
Sbjct: 666 LGPYTIWLIGLCQSGRFHEALMVFSVLREKKILVTPPSCVKLIHGLCKREQLDAAIEVFL 725

Query: 727 YTLETGTMLMPRICNQLLRHLHLEDRKDHAFV--LIRRMEAFGYDMNAYL-------HHS 783
           YTL+    LMPR+CN LL  L LE  +    V  L  RME  GY++++ L       H  
Sbjct: 726 YTLDNNFKLMPRVCNYLLSSL-LESTEKMEIVSQLTNRMERAGYNVDSMLRFEILKYHRH 779

BLAST of CmaCh02G006950 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 296.2 bits (757), Expect = 1.1e-78
Identity = 183/669 (27.35%), Postives = 328/669 (49.03%), Query Frame = 0

Query: 102 RLVKDNAFELYWKTLQELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKP 161
           + + D    L +K+LQE  D     SS  F +++++YS+  + +KA+    + +     P
Sbjct: 109 KTLDDEYASLVFKSLQETYDLCYSTSS-VFDLVVKSYSRLSLIDKALSIVHLAQAHGFMP 168

Query: 162 NIFAYNLILHVLVR-REAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVL 221
            + +YN +L   +R +     A  V+ +ML+  ++PNV TY+ILI GFC       AL L
Sbjct: 169 GVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTL 228

Query: 222 FDEMTDRDVLPNEITYSIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCK 281
           FD+M  +  LPN +TY+ ++ G C+ +KIDD  +L   M   G  P++I+YNV++NG C+
Sbjct: 229 FDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCR 288

Query: 282 LGYFDEAFALLRSFEKDGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVIL 341
            G   E   +L    + G+ L    Y+ LI G  +   + +A + + +  R  + P VI 
Sbjct: 289 EGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVIT 348

Query: 342 YTIMIQGLCQEGRVNEALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEIS 401
           YT +I  +C+ G +N A+  LD+M  RG  P+   Y  ++ GF   G +++A  +  E++
Sbjct: 349 YTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMN 408

Query: 402 NHDCFPDNHTYSILICGMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLK 461
           ++   P   TY+ LI G C  G +++A  V  +M++ G  P VV++++++ GFC++  + 
Sbjct: 409 DNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVD 468

Query: 462 EAHLLFYKM-EIGRKPSLFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVE 521
           EA  +  +M E G KP               T+    +++  CE     +A  L  +++ 
Sbjct: 469 EALRVKREMVEKGIKPD--------------TITYSSLIQGFCEQRRTKEACDLYEEMLR 528

Query: 522 SGVFPDIRTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDED 581
            G+ PD  TY  LIN +C   +++ A +L  +M  KG LPD +TY  LI+GL++  R  +
Sbjct: 529 VGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTRE 588

Query: 582 ALGIFEQMVKNGCKPESSVYKSIMTWSCRRKKVSLAFSVWMKYLRNF--RGWKDEKVKVV 641
           A  +  ++      P    Y +++  +C     ++ F   +  ++ F  +G   E  +V 
Sbjct: 589 AKRLLLKLFYEESVPSDVTYHTLIE-NCS----NIEFKSVVSLIKGFCMKGMMTEADQVF 648

Query: 642 EESFDKGDLEKAISRIIEMDLNSKDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRI 701
           E    K               N K  D   Y I + G C+AG + +A+ ++  +     +
Sbjct: 649 ESMLGK---------------NHKP-DGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFL 708

Query: 702 ISSASCVMLIGGLCVEGKLDLAVEVFLYTLETGTMLMPRICNQLLRHLHLEDRKDHAFVL 761
           + + + + L+  L  EGK++    V ++ L +  +        L+   H E   D    +
Sbjct: 709 LHTVTVIALVKALHKEGKVNELNSVIVHVLRSCELSEAEQAKVLVEINHREGNMDVVLDV 741

Query: 762 IRRMEAFGY 767
           +  M   G+
Sbjct: 769 LAEMAKDGF 741

BLAST of CmaCh02G006950 vs. ExPASy Swiss-Prot
Match: Q9LQ14 (Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g62930 PE=2 SV=2)

HSP 1 Score: 293.9 bits (751), Expect = 5.6e-78
Identity = 167/515 (32.43%), Postives = 278/515 (53.98%), Query Frame = 0

Query: 95  SQDLIIDRLVKDNAFELYWKTLQELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMM 154
           S+++++D L  D+A +L+ + +Q    S    S   F+ L+ A +K    +  +     M
Sbjct: 52  SRNVLLD-LKLDDAVDLFGEMVQ----SRPLPSIVEFNKLLSAIAKMNKFDLVISLGERM 111

Query: 155 KDFECKPNIFAYNLILHVLVRREAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKT 214
           ++     ++++YN++++   RR    LALAV  +M+K    P++VT S L++G+C   + 
Sbjct: 112 QNLRISYDLYSYNILINCFCRRSQLPLALAVLGKMMKLGYEPDIVTLSSLLNGYCHGKRI 171

Query: 215 QEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVL 274
            EA+ L D+M   +  PN +T++ ++ GL    K  +A  L  +M A GC PD+ TY  +
Sbjct: 172 SEAVALVDQMFVMEYQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTV 231

Query: 275 LNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNV 334
           +NG CK G  D A +LL+  EK      V  Y+ +ID L   +  ++A   + +   K +
Sbjct: 232 VNGLCKRGDIDLALSLLKKMEKGKIEADVVIYTTIIDALCNYKNVNDALNLFTEMDNKGI 291

Query: 335 EPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQS 394
            P+V+ Y  +I+ LC  GR ++A  LL +M ER  +P+   ++A+I  F   G L +A+ 
Sbjct: 292 RPNVVTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEK 351

Query: 395 LRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFC 454
           L  E+      PD  TYS LI G C +  +DEA+H+F  M    C P+VVT+N+LI GFC
Sbjct: 352 LYDEMIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFC 411

Query: 455 KAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLL 514
           KA +++E   LF +M          R L G      TV    +++ L ++G    A K+ 
Sbjct: 412 KAKRVEEGMELFREMS--------QRGLVG-----NTVTYNTLIQGLFQAGDCDMAQKIF 471

Query: 515 MQLVESGVFPDIRTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRV 574
            ++V  GV PDI TY+IL++G CK   ++ A  +F+ +Q     PD  TY  +I+G+ + 
Sbjct: 472 KKMVSDGVPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKA 531

Query: 575 GRDEDALGIFEQMVKNGCKPESSVYKSIMTWSCRR 610
           G+ ED   +F  +   G KP   +Y ++++  CR+
Sbjct: 532 GKVEDGWDLFCSLSLKGVKPNVIIYTTMISGFCRK 548

BLAST of CmaCh02G006950 vs. ExPASy Swiss-Prot
Match: Q9SXD1 (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 292.7 bits (748), Expect = 1.3e-77
Identity = 165/497 (33.20%), Postives = 263/497 (52.92%), Query Frame = 0

Query: 131 FSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYNQML 190
           FS L+ A +K    +  +     M++     N + Y+++++   RR    LALAV  +M+
Sbjct: 84  FSKLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYSILINCFCRRSQLPLALAVLGKMM 143

Query: 191 KCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKID 250
           K    PN+VT S L++G+C + +  EA+ L D+M      PN +T++ ++ GL    K  
Sbjct: 144 KLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTGYQPNTVTFNTLIHGLFLHNKAS 203

Query: 251 DAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLI 310
           +A  L  +M A GC PD++TY V++NG CK G  D AF LL   E+     GV  Y+ +I
Sbjct: 204 EAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNTII 263

Query: 311 DGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFS 370
           DGL + +  D+A   +++   K + P+V+ Y+ +I  LC  GR ++A  LL +M ER  +
Sbjct: 264 DGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWSDASRLLSDMIERKIN 323

Query: 371 PDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHV 430
           PD   ++A+I  F   G L +A+ L  E+      P   TYS LI G C +  +DEA+ +
Sbjct: 324 PDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCMHDRLDEAKQM 383

Query: 431 FNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKME----IGRKPSLFLRLLQGAN 490
           F  M    C P VVT+N+LI GFCK  +++E   +F +M     +G   +  + L+QG  
Sbjct: 384 FEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFREMSQRGLVGNTVTYNI-LIQGLF 443

Query: 491 KVLGTVDL--------------------QVMLEQLCESGLIHKAYKLLMQLVESGVFPDI 550
           +  G  D+                      +L+ LC++G + KA  +   L  S + P I
Sbjct: 444 QA-GDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGKLEKAMVVFEYLQRSKMEPTI 503

Query: 551 RTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQ 604
            TYNI+I G CK   ++  + LF ++ LKG  PD + Y T+I G  R G  E+A  +F++
Sbjct: 504 YTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNTMISGFCRKGSKEEADALFKE 563

BLAST of CmaCh02G006950 vs. ExPASy Swiss-Prot
Match: Q9CAN5 (Pentatricopeptide repeat-containing protein At1g63080, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g63080 PE=2 SV=1)

HSP 1 Score: 290.0 bits (741), Expect = 8.1e-77
Identity = 163/496 (32.86%), Postives = 259/496 (52.22%), Query Frame = 0

Query: 131 FSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYNQML 190
           FS L+ A +K    +  +     M+      N++ YN++++ L RR     ALA+  +M+
Sbjct: 68  FSKLLSAIAKMKKFDLVISFGEKMEILGVSHNLYTYNIMINCLCRRSQLSFALAILGKMM 127

Query: 191 KCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKID 250
           K    P++VT + L++GFC  ++  EA+ L D+M +    P+ +T++ ++ GL Q  K  
Sbjct: 128 KLGYGPSIVTLNSLLNGFCHGNRISEAVALVDQMVEMGYQPDTVTFTTLVHGLFQHNKAS 187

Query: 251 DAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLI 310
           +A  L  +M   GC PD++TY  ++NG CK G  D A  LL   EK      V  YS +I
Sbjct: 188 EAVALVERMVVKGCQPDLVTYGAVINGLCKRGEPDLALNLLNKMEKGKIEADVVIYSTVI 247

Query: 311 DGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFS 370
           D L + R  D+A   + +   K + PDV  Y+ +I  LC  GR ++A  LL +M ER  +
Sbjct: 248 DSLCKYRHVDDALNLFTEMDNKGIRPDVFTYSSLISCLCNYGRWSDASRLLSDMLERKIN 307

Query: 371 PDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHV 430
           P+   +N++I  F   G L +A+ L  E+      P+  TY+ LI G C +  +DEAQ +
Sbjct: 308 PNVVTFNSLIDAFAKEGKLIEAEKLFDEMIQRSIDPNIVTYNSLINGFCMHDRLDEAQQI 367

Query: 431 FNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKME----IGRKPSLFLRLLQGAN 490
           F  M    CLP VVT+N+LI+GFCKA K+ +   LF  M     +G   + +  L+ G  
Sbjct: 368 FTLMVSKDCLPDVVTYNTLINGFCKAKKVVDGMELFRDMSRRGLVGNTVT-YTTLIHGFF 427

Query: 491 KVLGTVDLQVMLEQ-------------------LCESGLIHKAYKLLMQLVESGVFPDIR 550
           +     + Q++ +Q                   LC++G + KA  +   L +S + PDI 
Sbjct: 428 QASDCDNAQMVFKQMVSDGVHPNIMTYNTLLDGLCKNGKLEKAMVVFEYLQKSKMEPDIY 487

Query: 551 TYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQM 604
           TYNI+  G CK   ++  + LF  + LKG  PD I Y T+I G  + G  E+A  +F +M
Sbjct: 488 TYNIMSEGMCKAGKVEDGWDLFCSLSLKGVKPDVIAYNTMISGFCKKGLKEEAYTLFIKM 547

BLAST of CmaCh02G006950 vs. TAIR 10
Match: AT1G79540.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 777.3 bits (2006), Expect = 1.2e-224
Identity = 394/787 (50.06%), Postives = 533/787 (67.73%), Query Frame = 0

Query: 7   FLRPVVTYLVPKPPWF-HLFHTPTDPIATSNEVSTIIETVDPIEDALETIAPHISSDVIT 66
           F R V+ +   KP W    + +       S EV +I+    PIE ALE + P +S ++IT
Sbjct: 6   FFRSVIQF-YSKPSWMQRSYSSGNAEFNISGEVISILAKKKPIEPALEPLVPFLSKNIIT 65

Query: 67  SVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELKDSSTE 126
           SVI+++ N +LGFR FIW+ RR  L    S  L+ID L +DN  +LYW+TL+ELK     
Sbjct: 66  SVIKDEVNRQLGFRFFIWASRRERLRSRESFGLVIDMLSEDNGCDLYWQTLEELKSGGVS 125

Query: 127 ISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREA-FLLALA 186
           + S  F VLI AY+K GM EKAV+SFG MK+F+C+P++F YN+IL V++R E  F+LA A
Sbjct: 126 VDSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILRVMMREEVFFMLAFA 185

Query: 187 VYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLC 246
           VYN+MLKCN +PN+ T+ IL+ G  K  +T +A  +FD+MT R + PN +TY+I++SGLC
Sbjct: 186 VYNEMLKCNCSPNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISGLC 245

Query: 247 QAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVK 306
           Q    DDA++LF +M+ SG  PD + +N LL+GFCKLG   EAF LLR FEKDG +LG++
Sbjct: 246 QRGSADDARKLFYEMQTSGNYPDSVAHNALLDGFCKLGRMVEAFELLRLFEKDGFVLGLR 305

Query: 307 GYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEM 366
           GYS LIDGLFRARRY +A   Y    +KN++PD+ILYTI+IQGL + G++ +AL LL  M
Sbjct: 306 GYSSLIDGLFRARRYTQAFELYANMLKKNIKPDIILYTILIQGLSKAGKIEDALKLLSSM 365

Query: 367 TERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLI 426
             +G SPDT CYNAVI+  C  GLL++ +SL+LE+S  + FPD  T++ILIC MC+NGL+
Sbjct: 366 PSKGISPDTYCYNAVIKALCGRGLLEEGRSLQLEMSETESFPDACTHTILICSMCRNGLV 425

Query: 427 DEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQG 486
            EA+ +F E+EK GC PSV TFN+LIDG CK+G+LKEA LL +KME+GR  SLFLRL   
Sbjct: 426 REAEEIFTEIEKSGCSPSVATFNALIDGLCKSGELKEARLLLHKMEVGRPASLFLRLSHS 485

Query: 487 ANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDG 546
            N+   T         + ESG I KAY+ L    ++G  PDI +YN+LINGFC+  +IDG
Sbjct: 486 GNRSFDT---------MVESGSILKAYRDLAHFADTGSSPDIVSYNVLINGFCRAGDIDG 545

Query: 547 AFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMT 606
           A KL   +QLKG  PDS+TY TLI+GLHRVGR+E+A  +F    K+  +   +VY+S+MT
Sbjct: 546 ALKLLNVLQLKGLSPDSVTYNTLINGLHRVGREEEAFKLF--YAKDDFRHSPAVYRSLMT 605

Query: 607 WSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNSKDFD 666
           WSCR++KV +AF++WMKYL+      DE    +E+ F +G+ E+A+ R+IE+D    +  
Sbjct: 606 WSCRKRKVLVAFNLWMKYLKKISCLDDETANEIEQCFKEGETERALRRLIELDTRKDELT 665

Query: 667 LAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAVEVFL 726
           L PYTI+LIGLCQ+GR  EA  +FSVL++ K +++  SCV LI GLC   +LD A+EVFL
Sbjct: 666 LGPYTIWLIGLCQSGRFHEALMVFSVLREKKILVTPPSCVKLIHGLCKREQLDAAIEVFL 725

Query: 727 YTLETGTMLMPRICNQLLRHLHLEDRKDHAFV--LIRRMEAFGYDMNAYL-------HHS 783
           YTL+    LMPR+CN LL  L LE  +    V  L  RME  GY++++ L       H  
Sbjct: 726 YTLDNNFKLMPRVCNYLLSSL-LESTEKMEIVSQLTNRMERAGYNVDSMLRFEILKYHRH 779

BLAST of CmaCh02G006950 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 296.2 bits (757), Expect = 8.0e-80
Identity = 183/669 (27.35%), Postives = 328/669 (49.03%), Query Frame = 0

Query: 102 RLVKDNAFELYWKTLQELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKP 161
           + + D    L +K+LQE  D     SS  F +++++YS+  + +KA+    + +     P
Sbjct: 109 KTLDDEYASLVFKSLQETYDLCYSTSS-VFDLVVKSYSRLSLIDKALSIVHLAQAHGFMP 168

Query: 162 NIFAYNLILHVLVR-REAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVL 221
            + +YN +L   +R +     A  V+ +ML+  ++PNV TY+ILI GFC       AL L
Sbjct: 169 GVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTL 228

Query: 222 FDEMTDRDVLPNEITYSIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCK 281
           FD+M  +  LPN +TY+ ++ G C+ +KIDD  +L   M   G  P++I+YNV++NG C+
Sbjct: 229 FDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCR 288

Query: 282 LGYFDEAFALLRSFEKDGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVIL 341
            G   E   +L    + G+ L    Y+ LI G  +   + +A + + +  R  + P VI 
Sbjct: 289 EGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVIT 348

Query: 342 YTIMIQGLCQEGRVNEALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEIS 401
           YT +I  +C+ G +N A+  LD+M  RG  P+   Y  ++ GF   G +++A  +  E++
Sbjct: 349 YTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMN 408

Query: 402 NHDCFPDNHTYSILICGMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLK 461
           ++   P   TY+ LI G C  G +++A  V  +M++ G  P VV++++++ GFC++  + 
Sbjct: 409 DNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVD 468

Query: 462 EAHLLFYKM-EIGRKPSLFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVE 521
           EA  +  +M E G KP               T+    +++  CE     +A  L  +++ 
Sbjct: 469 EALRVKREMVEKGIKPD--------------TITYSSLIQGFCEQRRTKEACDLYEEMLR 528

Query: 522 SGVFPDIRTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDED 581
            G+ PD  TY  LIN +C   +++ A +L  +M  KG LPD +TY  LI+GL++  R  +
Sbjct: 529 VGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTRE 588

Query: 582 ALGIFEQMVKNGCKPESSVYKSIMTWSCRRKKVSLAFSVWMKYLRNF--RGWKDEKVKVV 641
           A  +  ++      P    Y +++  +C     ++ F   +  ++ F  +G   E  +V 
Sbjct: 589 AKRLLLKLFYEESVPSDVTYHTLIE-NCS----NIEFKSVVSLIKGFCMKGMMTEADQVF 648

Query: 642 EESFDKGDLEKAISRIIEMDLNSKDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRI 701
           E    K               N K  D   Y I + G C+AG + +A+ ++  +     +
Sbjct: 649 ESMLGK---------------NHKP-DGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFL 708

Query: 702 ISSASCVMLIGGLCVEGKLDLAVEVFLYTLETGTMLMPRICNQLLRHLHLEDRKDHAFVL 761
           + + + + L+  L  EGK++    V ++ L +  +        L+   H E   D    +
Sbjct: 709 LHTVTVIALVKALHKEGKVNELNSVIVHVLRSCELSEAEQAKVLVEINHREGNMDVVLDV 741

Query: 762 IRRMEAFGY 767
           +  M   G+
Sbjct: 769 LAEMAKDGF 741

BLAST of CmaCh02G006950 vs. TAIR 10
Match: AT1G62930.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 293.9 bits (751), Expect = 4.0e-79
Identity = 167/515 (32.43%), Postives = 278/515 (53.98%), Query Frame = 0

Query: 95  SQDLIIDRLVKDNAFELYWKTLQELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMM 154
           S+++++D L  D+A +L+ + +Q    S    S   F+ L+ A +K    +  +     M
Sbjct: 52  SRNVLLD-LKLDDAVDLFGEMVQ----SRPLPSIVEFNKLLSAIAKMNKFDLVISLGERM 111

Query: 155 KDFECKPNIFAYNLILHVLVRREAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKT 214
           ++     ++++YN++++   RR    LALAV  +M+K    P++VT S L++G+C   + 
Sbjct: 112 QNLRISYDLYSYNILINCFCRRSQLPLALAVLGKMMKLGYEPDIVTLSSLLNGYCHGKRI 171

Query: 215 QEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVL 274
            EA+ L D+M   +  PN +T++ ++ GL    K  +A  L  +M A GC PD+ TY  +
Sbjct: 172 SEAVALVDQMFVMEYQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTV 231

Query: 275 LNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNV 334
           +NG CK G  D A +LL+  EK      V  Y+ +ID L   +  ++A   + +   K +
Sbjct: 232 VNGLCKRGDIDLALSLLKKMEKGKIEADVVIYTTIIDALCNYKNVNDALNLFTEMDNKGI 291

Query: 335 EPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQS 394
            P+V+ Y  +I+ LC  GR ++A  LL +M ER  +P+   ++A+I  F   G L +A+ 
Sbjct: 292 RPNVVTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEK 351

Query: 395 LRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFC 454
           L  E+      PD  TYS LI G C +  +DEA+H+F  M    C P+VVT+N+LI GFC
Sbjct: 352 LYDEMIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFC 411

Query: 455 KAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLL 514
           KA +++E   LF +M          R L G      TV    +++ L ++G    A K+ 
Sbjct: 412 KAKRVEEGMELFREMS--------QRGLVG-----NTVTYNTLIQGLFQAGDCDMAQKIF 471

Query: 515 MQLVESGVFPDIRTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRV 574
            ++V  GV PDI TY+IL++G CK   ++ A  +F+ +Q     PD  TY  +I+G+ + 
Sbjct: 472 KKMVSDGVPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKA 531

Query: 575 GRDEDALGIFEQMVKNGCKPESSVYKSIMTWSCRR 610
           G+ ED   +F  +   G KP   +Y ++++  CR+
Sbjct: 532 GKVEDGWDLFCSLSLKGVKPNVIIYTTMISGFCRK 548

BLAST of CmaCh02G006950 vs. TAIR 10
Match: AT1G62670.1 (rna processing factor 2 )

HSP 1 Score: 292.7 bits (748), Expect = 8.9e-79
Identity = 165/497 (33.20%), Postives = 263/497 (52.92%), Query Frame = 0

Query: 131 FSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYNQML 190
           FS L+ A +K    +  +     M++     N + Y+++++   RR    LALAV  +M+
Sbjct: 84  FSKLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYSILINCFCRRSQLPLALAVLGKMM 143

Query: 191 KCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKID 250
           K    PN+VT S L++G+C + +  EA+ L D+M      PN +T++ ++ GL    K  
Sbjct: 144 KLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTGYQPNTVTFNTLIHGLFLHNKAS 203

Query: 251 DAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLI 310
           +A  L  +M A GC PD++TY V++NG CK G  D AF LL   E+     GV  Y+ +I
Sbjct: 204 EAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNTII 263

Query: 311 DGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFS 370
           DGL + +  D+A   +++   K + P+V+ Y+ +I  LC  GR ++A  LL +M ER  +
Sbjct: 264 DGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWSDASRLLSDMIERKIN 323

Query: 371 PDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHV 430
           PD   ++A+I  F   G L +A+ L  E+      P   TYS LI G C +  +DEA+ +
Sbjct: 324 PDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCMHDRLDEAKQM 383

Query: 431 FNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKME----IGRKPSLFLRLLQGAN 490
           F  M    C P VVT+N+LI GFCK  +++E   +F +M     +G   +  + L+QG  
Sbjct: 384 FEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEGMEVFREMSQRGLVGNTVTYNI-LIQGLF 443

Query: 491 KVLGTVDL--------------------QVMLEQLCESGLIHKAYKLLMQLVESGVFPDI 550
           +  G  D+                      +L+ LC++G + KA  +   L  S + P I
Sbjct: 444 QA-GDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGKLEKAMVVFEYLQRSKMEPTI 503

Query: 551 RTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQ 604
            TYNI+I G CK   ++  + LF ++ LKG  PD + Y T+I G  R G  E+A  +F++
Sbjct: 504 YTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNTMISGFCRKGSKEEADALFKE 563

BLAST of CmaCh02G006950 vs. TAIR 10
Match: AT1G63080.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 290.0 bits (741), Expect = 5.8e-78
Identity = 163/496 (32.86%), Postives = 259/496 (52.22%), Query Frame = 0

Query: 131 FSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYNQML 190
           FS L+ A +K    +  +     M+      N++ YN++++ L RR     ALA+  +M+
Sbjct: 68  FSKLLSAIAKMKKFDLVISFGEKMEILGVSHNLYTYNIMINCLCRRSQLSFALAILGKMM 127

Query: 191 KCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKID 250
           K    P++VT + L++GFC  ++  EA+ L D+M +    P+ +T++ ++ GL Q  K  
Sbjct: 128 KLGYGPSIVTLNSLLNGFCHGNRISEAVALVDQMVEMGYQPDTVTFTTLVHGLFQHNKAS 187

Query: 251 DAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLI 310
           +A  L  +M   GC PD++TY  ++NG CK G  D A  LL   EK      V  YS +I
Sbjct: 188 EAVALVERMVVKGCQPDLVTYGAVINGLCKRGEPDLALNLLNKMEKGKIEADVVIYSTVI 247

Query: 311 DGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFS 370
           D L + R  D+A   + +   K + PDV  Y+ +I  LC  GR ++A  LL +M ER  +
Sbjct: 248 DSLCKYRHVDDALNLFTEMDNKGIRPDVFTYSSLISCLCNYGRWSDASRLLSDMLERKIN 307

Query: 371 PDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHV 430
           P+   +N++I  F   G L +A+ L  E+      P+  TY+ LI G C +  +DEAQ +
Sbjct: 308 PNVVTFNSLIDAFAKEGKLIEAEKLFDEMIQRSIDPNIVTYNSLINGFCMHDRLDEAQQI 367

Query: 431 FNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKME----IGRKPSLFLRLLQGAN 490
           F  M    CLP VVT+N+LI+GFCKA K+ +   LF  M     +G   + +  L+ G  
Sbjct: 368 FTLMVSKDCLPDVVTYNTLINGFCKAKKVVDGMELFRDMSRRGLVGNTVT-YTTLIHGFF 427

Query: 491 KVLGTVDLQVMLEQ-------------------LCESGLIHKAYKLLMQLVESGVFPDIR 550
           +     + Q++ +Q                   LC++G + KA  +   L +S + PDI 
Sbjct: 428 QASDCDNAQMVFKQMVSDGVHPNIMTYNTLLDGLCKNGKLEKAMVVFEYLQKSKMEPDIY 487

Query: 551 TYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQM 604
           TYNI+  G CK   ++  + LF  + LKG  PD I Y T+I G  + G  E+A  +F +M
Sbjct: 488 TYNIMSEGMCKAGKVEDGWDLFCSLSLKGVKPDVIAYNTMISGFCKKGLKEEAYTLFIKM 547

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SAJ51.7e-22350.06Pentatricopeptide repeat-containing protein At1g79540 OS=Arabidopsis thaliana OX... [more]
Q9FIX31.1e-7827.35Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9LQ145.6e-7832.43Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidop... [more]
Q9SXD11.3e-7733.20Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
Q9CAN58.1e-7732.86Pentatricopeptide repeat-containing protein At1g63080, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT1G79540.11.2e-22450.06Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.18.0e-8027.35Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G62930.14.0e-7932.43Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G62670.18.9e-7933.20rna processing factor 2 [more]
AT1G63080.15.8e-7832.86Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 199..233
e-value: 5.0E-9
score: 33.8
coord: 409..442
e-value: 2.8E-9
score: 34.6
coord: 375..407
e-value: 0.0016
score: 16.5
coord: 340..373
e-value: 6.2E-11
score: 39.8
coord: 130..163
e-value: 4.9E-5
score: 21.2
coord: 269..299
e-value: 6.6E-7
score: 27.1
coord: 528..560
e-value: 9.5E-10
score: 36.1
coord: 562..594
e-value: 7.1E-7
score: 27.0
coord: 444..470
e-value: 1.1E-6
score: 26.5
coord: 234..268
e-value: 4.3E-8
score: 30.8
coord: 306..338
e-value: 1.9E-6
score: 25.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 406..455
e-value: 1.0E-16
score: 60.8
coord: 336..385
e-value: 4.2E-16
score: 58.9
coord: 161..210
e-value: 5.3E-15
score: 55.4
coord: 524..572
e-value: 4.2E-17
score: 62.1
coord: 238..280
e-value: 5.9E-13
score: 48.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 668..692
e-value: 0.035
score: 14.4
coord: 704..723
e-value: 0.58
score: 10.5
coord: 306..333
e-value: 0.019
score: 15.2
coord: 130..158
e-value: 0.0015
score: 18.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 197..231
score: 12.649398
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 525..559
score: 12.846701
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 337..371
score: 13.898986
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 127..161
score: 10.205028
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 560..594
score: 12.934392
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 442..476
score: 10.500983
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 302..336
score: 9.426776
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 232..266
score: 12.978237
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 162..196
score: 9.119859
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 372..406
score: 10.270796
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 267..301
score: 11.794416
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 407..441
score: 13.471496
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 64..174
e-value: 7.7E-11
score: 43.6
coord: 282..395
e-value: 1.9E-29
score: 104.3
coord: 175..281
e-value: 8.5E-36
score: 125.0
coord: 473..617
e-value: 9.7E-32
score: 111.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 396..472
e-value: 3.9E-24
score: 87.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 633..813
e-value: 1.3E-12
score: 49.8
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 162..622
NoneNo IPR availablePANTHERPTHR47934PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN PET309, MITOCHONDRIALcoord: 641..769
coord: 98..473
coord: 305..624

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh02G006950.1CmaCh02G006950.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding