CmaCh02G006950 (gene) Cucurbita maxima (Rimu)

NameCmaCh02G006950
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCma_Chr02 : 4240532 .. 4242979 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCGCCGATCAACATTTCTACGACCCGTCGTCACCTATTTAGTTCCAAAACCTCCATGGTTCCACTTATTTCATACGCCCACTGACCCAATCGCTACTTCCAATGAGGTCTCCACCATAATCGAAACTGTCGATCCCATTGAAGATGCATTGGAAACCATAGCCCCTCATATATCATCTGATGTAATTACCTCAGTCATTCAAGAACAGCCGAATGCTCGACTTGGATTTCGACTTTTTATCTGGTCGTTGAGGAGAAGGCACCTGTGCTGCAGCGCCTCGCAGGATTTGATCATTGACAGGTTAGTAAAGGACAATGCCTTTGAATTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCTTCTACTGAAATTTCATCGGACGCCTTCTCTGTATTGATTGAGGCATACTCTAAAGCCGGCATGGAAGAGAAGGCCGTCCAATCGTTTGGCATGATGAAGGATTTTGAATGTAAGCCCAATATTTTTGCTTACAATTTGATTTTGCATGTTTTGGTGCGAAGAGAAGCGTTTTTGTTAGCATTAGCGGTGTATAATCAGATGCTCAAATGTAATTTGAATCCTAATGTGGTTACTTACAGCATTTTGATTCATGGATTCTGTAAAACTAGTAAAACTCAAGAAGCCCTTGTACTCTTTGATGAAATGACTGATAGAGACGTATTGCCCAACGAGATAACCTATTCGATTATCCTTTCTGGGTTGTGTCAAGCTAAGAAAATTGATGATGCACAGAGATTGTTCATTAAGATGAGAGCTAGTGGTTGTAGTCCAGATGTAATCACTTACAATGTTTTGCTTAATGGGTTTTGTAAGTTAGGTTATTTTGATGAAGCTTTTGCATTGTTGAGATCATTTGAGAAGGATGGCCATATTCTTGGAGTCAAAGGGTACAGTTGTTTGATTGATGGCTTGTTTAGGGCTAGGAGATATGATGAAGCACATATGTGGTACCAAAAATTTTCGAGGAAAAATGTAGAGCCTGATGTTATCTTGTATACTATAATGATCCAAGGCTTATGCCAAGAAGGTCGGGTTAACGAGGCATTGGCGTTGTTGGATGAGATGACGGAAAGAGGGTTTAGTCCAGATACTACTTGTTACAATGCTGTAATTAGAGGATTTTGTGATATGGGTCTTTTGGATAAGGCCCAGTCTCTTCGACTCGAGATTTCAAACCACGACTGTTTCCCCGACAACCACACGTATTCCATTCTCATTTGTGGTATGTGTAAGAATGGGTTAATTGATGAGGCACAACATGTATTCAATGAAATGGAGAAGCTTGGATGCCTTCCTTCTGTTGTGACCTTCAATTCTCTCATTGATGGATTTTGCAAGGCTGGTAAGCTTAAGGAAGCTCATCTTTTGTTTTACAAAATGGAGATAGGGAGAAAACCTTCTTTGTTCCTTCGACTTTTGCAAGGTGCCAATAAGGTTCTTGGTACTGTCGATCTCCAAGTTATGTTGGAACAATTATGCGAGTCGGGGTTGATTCATAAGGCCTACAAGCTTCTTATGCAGCTTGTTGAGAGTGGGGTTTTTCCAGACATTAGAACTTACAACATCCTAATCAATGGATTTTGCAAGACCAACAACATCGATGGTGCTTTCAAGCTCTTCAAGGACATGCAACTTAAAGGGCGCTTACCAGATTCAATTACGTATGGAACTCTAATAGATGGGCTCCACAGAGTCGGTAGGGACGAGGATGCTCTAGGGATTTTCGAACAAATGGTAAAGAATGGGTGCAAGCCTGAGTCTTCTGTTTACAAGTCTATCATGACTTGGTCGTGTCGAAGAAAAAAGGTTTCACTCGCGTTTAGTGTTTGGATGAAGTATCTGAGGAATTTTCGTGGCTGGAAAGATGAAAAGGTCAAAGTAGTAGAGGAAAGTTTCGACAAAGGAGACCTTGAAAAGGCGATCTCGAGAATAATCGAAATGGACTTGAACTCAAAAGACTTCGACTTGGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCAGGGAGGGTTTCTGAAGCCTTCGCAATATTTTCTGTTCTTAAGGACTTCAAAAGGATTATAAGTTCAGCAAGCTGCGTGATGTTGATTGGTGGGCTTTGCGTTGAAGGAAAACTTGACCTGGCTGTGGAAGTTTTCCTTTATACACTAGAAACAGGCACTATGTTGATGCCTAGAATTTGTAACCAACTGCTAAGGCATCTTCATTTAGAGGACAGGAAGGATCATGCTTTTGTTCTTATACGTAGAATGGAGGCTTTTGGATATGATATGAATGCTTATCTCCACCACAGTACTAAGTCACTTCTTCATGATCATTGGAAGTCATTGAAAGCTAAAGCTAGACACGAGCAGTGGTTGACGAATTCACAGCAGCAACTCCTAAATGCCACATTTCCTATGGTTGAAAGTAATTAG

mRNA sequence

ATGAAGCGCCGATCAACATTTCTACGACCCGTCGTCACCTATTTAGTTCCAAAACCTCCATGGTTCCACTTATTTCATACGCCCACTGACCCAATCGCTACTTCCAATGAGGTCTCCACCATAATCGAAACTGTCGATCCCATTGAAGATGCATTGGAAACCATAGCCCCTCATATATCATCTGATGTAATTACCTCAGTCATTCAAGAACAGCCGAATGCTCGACTTGGATTTCGACTTTTTATCTGGTCGTTGAGGAGAAGGCACCTGTGCTGCAGCGCCTCGCAGGATTTGATCATTGACAGGTTAGTAAAGGACAATGCCTTTGAATTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCTTCTACTGAAATTTCATCGGACGCCTTCTCTGTATTGATTGAGGCATACTCTAAAGCCGGCATGGAAGAGAAGGCCGTCCAATCGTTTGGCATGATGAAGGATTTTGAATGTAAGCCCAATATTTTTGCTTACAATTTGATTTTGCATGTTTTGGTGCGAAGAGAAGCGTTTTTGTTAGCATTAGCGGTGTATAATCAGATGCTCAAATGTAATTTGAATCCTAATGTGGTTACTTACAGCATTTTGATTCATGGATTCTGTAAAACTAGTAAAACTCAAGAAGCCCTTGTACTCTTTGATGAAATGACTGATAGAGACGTATTGCCCAACGAGATAACCTATTCGATTATCCTTTCTGGGTTGTGTCAAGCTAAGAAAATTGATGATGCACAGAGATTGTTCATTAAGATGAGAGCTAGTGGTTGTAGTCCAGATGTAATCACTTACAATGTTTTGCTTAATGGGTTTTGTAAGTTAGGTTATTTTGATGAAGCTTTTGCATTGTTGAGATCATTTGAGAAGGATGGCCATATTCTTGGAGTCAAAGGGTACAGTTGTTTGATTGATGGCTTGTTTAGGGCTAGGAGATATGATGAAGCACATATGTGGTACCAAAAATTTTCGAGGAAAAATGTAGAGCCTGATGTTATCTTGTATACTATAATGATCCAAGGCTTATGCCAAGAAGGTCGGGTTAACGAGGCATTGGCGTTGTTGGATGAGATGACGGAAAGAGGGTTTAGTCCAGATACTACTTGTTACAATGCTGTAATTAGAGGATTTTGTGATATGGGTCTTTTGGATAAGGCCCAGTCTCTTCGACTCGAGATTTCAAACCACGACTGTTTCCCCGACAACCACACGTATTCCATTCTCATTTGTGGTATGTGTAAGAATGGGTTAATTGATGAGGCACAACATGTATTCAATGAAATGGAGAAGCTTGGATGCCTTCCTTCTGTTGTGACCTTCAATTCTCTCATTGATGGATTTTGCAAGGCTGGTAAGCTTAAGGAAGCTCATCTTTTGTTTTACAAAATGGAGATAGGGAGAAAACCTTCTTTGTTCCTTCGACTTTTGCAAGGTGCCAATAAGGTTCTTGGTACTGTCGATCTCCAAGTTATGTTGGAACAATTATGCGAGTCGGGGTTGATTCATAAGGCCTACAAGCTTCTTATGCAGCTTGTTGAGAGTGGGGTTTTTCCAGACATTAGAACTTACAACATCCTAATCAATGGATTTTGCAAGACCAACAACATCGATGGTGCTTTCAAGCTCTTCAAGGACATGCAACTTAAAGGGCGCTTACCAGATTCAATTACGTATGGAACTCTAATAGATGGGCTCCACAGAGTCGGTAGGGACGAGGATGCTCTAGGGATTTTCGAACAAATGGTAAAGAATGGGTGCAAGCCTGAGTCTTCTGTTTACAAGTCTATCATGACTTGGTCGTGTCGAAGAAAAAAGGTTTCACTCGCGTTTAGTGTTTGGATGAAGTATCTGAGGAATTTTCGTGGCTGGAAAGATGAAAAGGTCAAAGTAGTAGAGGAAAGTTTCGACAAAGGAGACCTTGAAAAGGCGATCTCGAGAATAATCGAAATGGACTTGAACTCAAAAGACTTCGACTTGGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCAGGGAGGGTTTCTGAAGCCTTCGCAATATTTTCTGTTCTTAAGGACTTCAAAAGGATTATAAGTTCAGCAAGCTGCGTGATGTTGATTGGTGGGCTTTGCGTTGAAGGAAAACTTGACCTGGCTGTGGAAGTTTTCCTTTATACACTAGAAACAGGCACTATGTTGATGCCTAGAATTTGTAACCAACTGCTAAGGCATCTTCATTTAGAGGACAGGAAGGATCATGCTTTTGTTCTTATACGTAGAATGGAGGCTTTTGGATATGATATGAATGCTTATCTCCACCACAGTACTAAGTCACTTCTTCATGATCATTGGAAGTCATTGAAAGCTAAAGCTAGACACGAGCAGTGGTTGACGAATTCACAGCAGCAACTCCTAAATGCCACATTTCCTATGGTTGAAAGTAATTAG

Coding sequence (CDS)

ATGAAGCGCCGATCAACATTTCTACGACCCGTCGTCACCTATTTAGTTCCAAAACCTCCATGGTTCCACTTATTTCATACGCCCACTGACCCAATCGCTACTTCCAATGAGGTCTCCACCATAATCGAAACTGTCGATCCCATTGAAGATGCATTGGAAACCATAGCCCCTCATATATCATCTGATGTAATTACCTCAGTCATTCAAGAACAGCCGAATGCTCGACTTGGATTTCGACTTTTTATCTGGTCGTTGAGGAGAAGGCACCTGTGCTGCAGCGCCTCGCAGGATTTGATCATTGACAGGTTAGTAAAGGACAATGCCTTTGAATTATATTGGAAAACTCTTCAAGAGCTTAAGGATTCTTCTACTGAAATTTCATCGGACGCCTTCTCTGTATTGATTGAGGCATACTCTAAAGCCGGCATGGAAGAGAAGGCCGTCCAATCGTTTGGCATGATGAAGGATTTTGAATGTAAGCCCAATATTTTTGCTTACAATTTGATTTTGCATGTTTTGGTGCGAAGAGAAGCGTTTTTGTTAGCATTAGCGGTGTATAATCAGATGCTCAAATGTAATTTGAATCCTAATGTGGTTACTTACAGCATTTTGATTCATGGATTCTGTAAAACTAGTAAAACTCAAGAAGCCCTTGTACTCTTTGATGAAATGACTGATAGAGACGTATTGCCCAACGAGATAACCTATTCGATTATCCTTTCTGGGTTGTGTCAAGCTAAGAAAATTGATGATGCACAGAGATTGTTCATTAAGATGAGAGCTAGTGGTTGTAGTCCAGATGTAATCACTTACAATGTTTTGCTTAATGGGTTTTGTAAGTTAGGTTATTTTGATGAAGCTTTTGCATTGTTGAGATCATTTGAGAAGGATGGCCATATTCTTGGAGTCAAAGGGTACAGTTGTTTGATTGATGGCTTGTTTAGGGCTAGGAGATATGATGAAGCACATATGTGGTACCAAAAATTTTCGAGGAAAAATGTAGAGCCTGATGTTATCTTGTATACTATAATGATCCAAGGCTTATGCCAAGAAGGTCGGGTTAACGAGGCATTGGCGTTGTTGGATGAGATGACGGAAAGAGGGTTTAGTCCAGATACTACTTGTTACAATGCTGTAATTAGAGGATTTTGTGATATGGGTCTTTTGGATAAGGCCCAGTCTCTTCGACTCGAGATTTCAAACCACGACTGTTTCCCCGACAACCACACGTATTCCATTCTCATTTGTGGTATGTGTAAGAATGGGTTAATTGATGAGGCACAACATGTATTCAATGAAATGGAGAAGCTTGGATGCCTTCCTTCTGTTGTGACCTTCAATTCTCTCATTGATGGATTTTGCAAGGCTGGTAAGCTTAAGGAAGCTCATCTTTTGTTTTACAAAATGGAGATAGGGAGAAAACCTTCTTTGTTCCTTCGACTTTTGCAAGGTGCCAATAAGGTTCTTGGTACTGTCGATCTCCAAGTTATGTTGGAACAATTATGCGAGTCGGGGTTGATTCATAAGGCCTACAAGCTTCTTATGCAGCTTGTTGAGAGTGGGGTTTTTCCAGACATTAGAACTTACAACATCCTAATCAATGGATTTTGCAAGACCAACAACATCGATGGTGCTTTCAAGCTCTTCAAGGACATGCAACTTAAAGGGCGCTTACCAGATTCAATTACGTATGGAACTCTAATAGATGGGCTCCACAGAGTCGGTAGGGACGAGGATGCTCTAGGGATTTTCGAACAAATGGTAAAGAATGGGTGCAAGCCTGAGTCTTCTGTTTACAAGTCTATCATGACTTGGTCGTGTCGAAGAAAAAAGGTTTCACTCGCGTTTAGTGTTTGGATGAAGTATCTGAGGAATTTTCGTGGCTGGAAAGATGAAAAGGTCAAAGTAGTAGAGGAAAGTTTCGACAAAGGAGACCTTGAAAAGGCGATCTCGAGAATAATCGAAATGGACTTGAACTCAAAAGACTTCGACTTGGCTCCATACACCATTTTTCTCATTGGATTGTGTCAAGCAGGGAGGGTTTCTGAAGCCTTCGCAATATTTTCTGTTCTTAAGGACTTCAAAAGGATTATAAGTTCAGCAAGCTGCGTGATGTTGATTGGTGGGCTTTGCGTTGAAGGAAAACTTGACCTGGCTGTGGAAGTTTTCCTTTATACACTAGAAACAGGCACTATGTTGATGCCTAGAATTTGTAACCAACTGCTAAGGCATCTTCATTTAGAGGACAGGAAGGATCATGCTTTTGTTCTTATACGTAGAATGGAGGCTTTTGGATATGATATGAATGCTTATCTCCACCACAGTACTAAGTCACTTCTTCATGATCATTGGAAGTCATTGAAAGCTAAAGCTAGACACGAGCAGTGGTTGACGAATTCACAGCAGCAACTCCTAAATGCCACATTTCCTATGGTTGAAAGTAATTAG

Protein sequence

MKRRSTFLRPVVTYLVPKPPWFHLFHTPTDPIATSNEVSTIIETVDPIEDALETIAPHISSDVITSVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMTWSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNSKDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAVEVFLYTLETGTMLMPRICNQLLRHLHLEDRKDHAFVLIRRMEAFGYDMNAYLHHSTKSLLHDHWKSLKAKARHEQWLTNSQQQLLNATFPMVESN
BLAST of CmaCh02G006950 vs. Swiss-Prot
Match: PP133_ARATH (Pentatricopeptide repeat-containing protein At1g79540 OS=Arabidopsis thaliana GN=At1g79540 PE=2 SV=1)

HSP 1 Score: 777.3 bits (2006), Expect = 1.6e-223
Identity = 394/787 (50.06%), Postives = 534/787 (67.85%), Query Frame = 1

Query: 7   FLRPVVTYLVPKPPWFHLFHTPTDP-IATSNEVSTIIETVDPIEDALETIAPHISSDVIT 66
           F R V+ +   KP W    ++  +     S EV +I+    PIE ALE + P +S ++IT
Sbjct: 6   FFRSVIQFY-SKPSWMQRSYSSGNAEFNISGEVISILAKKKPIEPALEPLVPFLSKNIIT 65

Query: 67  SVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELKDSSTE 126
           SVI+++ N +LGFR FIW+ RR  L    S  L+ID L +DN  +LYW+TL+ELK     
Sbjct: 66  SVIKDEVNRQLGFRFFIWASRRERLRSRESFGLVIDMLSEDNGCDLYWQTLEELKSGGVS 125

Query: 127 ISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAF-LLALA 186
           + S  F VLI AY+K GM EKAV+SFG MK+F+C+P++F YN+IL V++R E F +LA A
Sbjct: 126 VDSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILRVMMREEVFFMLAFA 185

Query: 187 VYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLC 246
           VYN+MLKCN +PN+ T+ IL+ G  K  +T +A  +FD+MT R + PN +TY+I++SGLC
Sbjct: 186 VYNEMLKCNCSPNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISGLC 245

Query: 247 QAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVK 306
           Q    DDA++LF +M+ SG  PD + +N LL+GFCKLG   EAF LLR FEKDG +LG++
Sbjct: 246 QRGSADDARKLFYEMQTSGNYPDSVAHNALLDGFCKLGRMVEAFELLRLFEKDGFVLGLR 305

Query: 307 GYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEM 366
           GYS LIDGLFRARRY +A   Y    +KN++PD+ILYTI+IQGL + G++ +AL LL  M
Sbjct: 306 GYSSLIDGLFRARRYTQAFELYANMLKKNIKPDIILYTILIQGLSKAGKIEDALKLLSSM 365

Query: 367 TERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLI 426
             +G SPDT CYNAVI+  C  GLL++ +SL+LE+S  + FPD  T++ILIC MC+NGL+
Sbjct: 366 PSKGISPDTYCYNAVIKALCGRGLLEEGRSLQLEMSETESFPDACTHTILICSMCRNGLV 425

Query: 427 DEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQG 486
            EA+ +F E+EK GC PSV TFN+LIDG CK+G+LKEA LL +KME+GR  SLFLRL   
Sbjct: 426 REAEEIFTEIEKSGCSPSVATFNALIDGLCKSGELKEARLLLHKMEVGRPASLFLRLSHS 485

Query: 487 ANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDG 546
            N+   T         + ESG I KAY+ L    ++G  PDI +YN+LINGFC+  +IDG
Sbjct: 486 GNRSFDT---------MVESGSILKAYRDLAHFADTGSSPDIVSYNVLINGFCRAGDIDG 545

Query: 547 AFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMT 606
           A KL   +QLKG  PDS+TY TLI+GLHRVGR+E+A  +F    K+  +   +VY+S+MT
Sbjct: 546 ALKLLNVLQLKGLSPDSVTYNTLINGLHRVGREEEAFKLF--YAKDDFRHSPAVYRSLMT 605

Query: 607 WSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNSKDFD 666
           WSCR++KV +AF++WMKYL+      DE    +E+ F +G+ E+A+ R+IE+D    +  
Sbjct: 606 WSCRKRKVLVAFNLWMKYLKKISCLDDETANEIEQCFKEGETERALRRLIELDTRKDELT 665

Query: 667 LAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAVEVFL 726
           L PYTI+LIGLCQ+GR  EA  +FSVL++ K +++  SCV LI GLC   +LD A+EVFL
Sbjct: 666 LGPYTIWLIGLCQSGRFHEALMVFSVLREKKILVTPPSCVKLIHGLCKREQLDAAIEVFL 725

Query: 727 YTLETGTMLMPRICNQLLRHLHLEDRKDHAFV--LIRRMEAFGYDMNAYL-------HHS 783
           YTL+    LMPR+CN LL  L LE  +    V  L  RME  GY++++ L       H  
Sbjct: 726 YTLDNNFKLMPRVCNYLLSSL-LESTEKMEIVSQLTNRMERAGYNVDSMLRFEILKYHRH 779

BLAST of CmaCh02G006950 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 296.2 bits (757), Expect = 1.1e-78
Identity = 183/669 (27.35%), Postives = 329/669 (49.18%), Query Frame = 1

Query: 102 RLVKDNAFELYWKTLQELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKP 161
           + + D    L +K+LQE  D     SS  F +++++YS+  + +KA+    + +     P
Sbjct: 109 KTLDDEYASLVFKSLQETYDLCYSTSS-VFDLVVKSYSRLSLIDKALSIVHLAQAHGFMP 168

Query: 162 NIFAYNLILHVLVRREAFL-LALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVL 221
            + +YN +L   +R +  +  A  V+ +ML+  ++PNV TY+ILI GFC       AL L
Sbjct: 169 GVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTL 228

Query: 222 FDEMTDRDVLPNEITYSIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCK 281
           FD+M  +  LPN +TY+ ++ G C+ +KIDD  +L   M   G  P++I+YNV++NG C+
Sbjct: 229 FDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCR 288

Query: 282 LGYFDEAFALLRSFEKDGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVIL 341
            G   E   +L    + G+ L    Y+ LI G  +   + +A + + +  R  + P VI 
Sbjct: 289 EGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVIT 348

Query: 342 YTIMIQGLCQEGRVNEALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEIS 401
           YT +I  +C+ G +N A+  LD+M  RG  P+   Y  ++ GF   G +++A  +  E++
Sbjct: 349 YTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMN 408

Query: 402 NHDCFPDNHTYSILICGMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLK 461
           ++   P   TY+ LI G C  G +++A  V  +M++ G  P VV++++++ GFC++  + 
Sbjct: 409 DNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVD 468

Query: 462 EAHLLFYKM-EIGRKPSLFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVE 521
           EA  +  +M E G KP               T+    +++  CE     +A  L  +++ 
Sbjct: 469 EALRVKREMVEKGIKPD--------------TITYSSLIQGFCEQRRTKEACDLYEEMLR 528

Query: 522 SGVFPDIRTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDED 581
            G+ PD  TY  LIN +C   +++ A +L  +M  KG LPD +TY  LI+GL++  R  +
Sbjct: 529 VGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTRE 588

Query: 582 ALGIFEQMVKNGCKPESSVYKSIMTWSCRRKKVSLAFSVWMKYLRNF--RGWKDEKVKVV 641
           A  +  ++      P    Y +++  +C     ++ F   +  ++ F  +G   E  +V 
Sbjct: 589 AKRLLLKLFYEESVPSDVTYHTLIE-NCS----NIEFKSVVSLIKGFCMKGMMTEADQVF 648

Query: 642 EESFDKGDLEKAISRIIEMDLNSKDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRI 701
           E    K               N K  D   Y I + G C+AG + +A+ ++  +     +
Sbjct: 649 ESMLGK---------------NHKP-DGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFL 708

Query: 702 ISSASCVMLIGGLCVEGKLDLAVEVFLYTLETGTMLMPRICNQLLRHLHLEDRKDHAFVL 761
           + + + + L+  L  EGK++    V ++ L +  +        L+   H E   D    +
Sbjct: 709 LHTVTVIALVKALHKEGKVNELNSVIVHVLRSCELSEAEQAKVLVEINHREGNMDVVLDV 741

Query: 762 IRRMEAFGY 767
           +  M   G+
Sbjct: 769 LAEMAKDGF 741

BLAST of CmaCh02G006950 vs. Swiss-Prot
Match: PPR96_ARATH (Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidopsis thaliana GN=At1g62930 PE=2 SV=2)

HSP 1 Score: 293.9 bits (751), Expect = 5.4e-78
Identity = 167/515 (32.43%), Postives = 278/515 (53.98%), Query Frame = 1

Query: 95  SQDLIIDRLVKDNAFELYWKTLQELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMM 154
           S+++++D L  D+A +L+ + +Q    S    S   F+ L+ A +K    +  +     M
Sbjct: 52  SRNVLLD-LKLDDAVDLFGEMVQ----SRPLPSIVEFNKLLSAIAKMNKFDLVISLGERM 111

Query: 155 KDFECKPNIFAYNLILHVLVRREAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKT 214
           ++     ++++YN++++   RR    LALAV  +M+K    P++VT S L++G+C   + 
Sbjct: 112 QNLRISYDLYSYNILINCFCRRSQLPLALAVLGKMMKLGYEPDIVTLSSLLNGYCHGKRI 171

Query: 215 QEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVL 274
            EA+ L D+M   +  PN +T++ ++ GL    K  +A  L  +M A GC PD+ TY  +
Sbjct: 172 SEAVALVDQMFVMEYQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTV 231

Query: 275 LNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNV 334
           +NG CK G  D A +LL+  EK      V  Y+ +ID L   +  ++A   + +   K +
Sbjct: 232 VNGLCKRGDIDLALSLLKKMEKGKIEADVVIYTTIIDALCNYKNVNDALNLFTEMDNKGI 291

Query: 335 EPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQS 394
            P+V+ Y  +I+ LC  GR ++A  LL +M ER  +P+   ++A+I  F   G L +A+ 
Sbjct: 292 RPNVVTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEK 351

Query: 395 LRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFC 454
           L  E+      PD  TYS LI G C +  +DEA+H+F  M    C P+VVT+N+LI GFC
Sbjct: 352 LYDEMIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFC 411

Query: 455 KAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLL 514
           KA +++E   LF +M          R L G      TV    +++ L ++G    A K+ 
Sbjct: 412 KAKRVEEGMELFREMS--------QRGLVG-----NTVTYNTLIQGLFQAGDCDMAQKIF 471

Query: 515 MQLVESGVFPDIRTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRV 574
            ++V  GV PDI TY+IL++G CK   ++ A  +F+ +Q     PD  TY  +I+G+ + 
Sbjct: 472 KKMVSDGVPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKA 531

Query: 575 GRDEDALGIFEQMVKNGCKPESSVYKSIMTWSCRR 610
           G+ ED   +F  +   G KP   +Y ++++  CR+
Sbjct: 532 GKVEDGWDLFCSLSLKGVKPNVIIYTTMISGFCRK 548

BLAST of CmaCh02G006950 vs. Swiss-Prot
Match: PPR91_ARATH (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 290.0 bits (741), Expect = 7.8e-77
Identity = 160/479 (33.40%), Postives = 259/479 (54.07%), Query Frame = 1

Query: 131 FSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYNQML 190
           FS L+ A +K    +  +     M++     N + Y+++++   RR    LALAV  +M+
Sbjct: 84  FSKLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYSILINCFCRRSQLPLALAVLGKMM 143

Query: 191 KCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKID 250
           K    PN+VT S L++G+C + +  EA+ L D+M      PN +T++ ++ GL    K  
Sbjct: 144 KLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTGYQPNTVTFNTLIHGLFLHNKAS 203

Query: 251 DAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLI 310
           +A  L  +M A GC PD++TY V++NG CK G  D AF LL   E+     GV  Y+ +I
Sbjct: 204 EAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNTII 263

Query: 311 DGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFS 370
           DGL + +  D+A   +++   K + P+V+ Y+ +I  LC  GR ++A  LL +M ER  +
Sbjct: 264 DGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWSDASRLLSDMIERKIN 323

Query: 371 PDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHV 430
           PD   ++A+I  F   G L +A+ L  E+      P   TYS LI G C +  +DEA+ +
Sbjct: 324 PDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCMHDRLDEAKQM 383

Query: 431 FNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANKVLG 490
           F  M    C P VVT+N+LI GFCK  +++E       ME+ R+ S   R L G      
Sbjct: 384 FEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEG------MEVFREMS--QRGLVG-----N 443

Query: 491 TVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDGAFKLFK 550
           TV   ++++ L ++G    A ++  ++V  GV P+I TYN L++G CK   ++ A  +F+
Sbjct: 444 TVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGKLEKAMVVFE 503

Query: 551 DMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMTWSCRR 610
            +Q     P   TY  +I+G+ + G+ ED   +F  +   G KP+   Y ++++  CR+
Sbjct: 504 YLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNTMISGFCRK 549

BLAST of CmaCh02G006950 vs. Swiss-Prot
Match: PPR98_ARATH (Pentatricopeptide repeat-containing protein At1g63080, mitochondrial OS=Arabidopsis thaliana GN=At1g63080 PE=2 SV=1)

HSP 1 Score: 286.2 bits (731), Expect = 1.1e-75
Identity = 156/491 (31.77%), Postives = 255/491 (51.93%), Query Frame = 1

Query: 131 FSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYNQML 190
           FS L+ A +K    +  +     M+      N++ YN++++ L RR     ALA+  +M+
Sbjct: 68  FSKLLSAIAKMKKFDLVISFGEKMEILGVSHNLYTYNIMINCLCRRSQLSFALAILGKMM 127

Query: 191 KCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKID 250
           K    P++VT + L++GFC  ++  EA+ L D+M +    P+ +T++ ++ GL Q  K  
Sbjct: 128 KLGYGPSIVTLNSLLNGFCHGNRISEAVALVDQMVEMGYQPDTVTFTTLVHGLFQHNKAS 187

Query: 251 DAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLI 310
           +A  L  +M   GC PD++TY  ++NG CK G  D A  LL   EK      V  YS +I
Sbjct: 188 EAVALVERMVVKGCQPDLVTYGAVINGLCKRGEPDLALNLLNKMEKGKIEADVVIYSTVI 247

Query: 311 DGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFS 370
           D L + R  D+A   + +   K + PDV  Y+ +I  LC  GR ++A  LL +M ER  +
Sbjct: 248 DSLCKYRHVDDALNLFTEMDNKGIRPDVFTYSSLISCLCNYGRWSDASRLLSDMLERKIN 307

Query: 371 PDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHV 430
           P+   +N++I  F   G L +A+ L  E+      P+  TY+ LI G C +  +DEAQ +
Sbjct: 308 PNVVTFNSLIDAFAKEGKLIEAEKLFDEMIQRSIDPNIVTYNSLINGFCMHDRLDEAQQI 367

Query: 431 FNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANKVLG 490
           F  M    CLP VVT+N+LI+GFCKA K+ +   LF             R +     V  
Sbjct: 368 FTLMVSKDCLPDVVTYNTLINGFCKAKKVVDGMELF-------------RDMSRRGLVGN 427

Query: 491 TVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDGAFKLFK 550
           TV    ++    ++     A  +  Q+V  GV P+I TYN L++G CK   ++ A  +F+
Sbjct: 428 TVTYTTLIHGFFQASDCDNAQMVFKQMVSDGVHPNIMTYNTLLDGLCKNGKLEKAMVVFE 487

Query: 551 DMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMTWSCRRK 610
            +Q     PD  TY  + +G+ + G+ ED   +F  +   G KP+   Y ++++  C++ 
Sbjct: 488 YLQKSKMEPDIYTYNIMSEGMCKAGKVEDGWDLFCSLSLKGVKPDVIAYNTMISGFCKKG 545

Query: 611 KVSLAFSVWMK 622
               A+++++K
Sbjct: 548 LKEEAYTLFIK 545

BLAST of CmaCh02G006950 vs. TrEMBL
Match: A0A0A0KD52_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G134370 PE=4 SV=1)

HSP 1 Score: 1179.5 bits (3050), Expect = 0.0e+00
Identity = 577/783 (73.69%), Postives = 667/783 (85.19%), Query Frame = 1

Query: 1   MKRRSTFLRPVVTYLVPKPPWFHLFHTPTDPIATSNEVSTIIETVDPIEDALETIAPHIS 60
           MK R    RP++ ++VPKP  FH +H+ T+PIATS EVSTIIET+DP+ED L+ I+  I 
Sbjct: 1   MKLRPILFRPIIIHVVPKPTLFHSYHSRTNPIATSIEVSTIIETLDPMEDGLKVISSRIR 60

Query: 61  SDVITSVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELK 120
           S  ITSV+QEQP+ RLGFRLFIWSL+  HL C   QDLII +L+K+NAFELYWK LQELK
Sbjct: 61  SYTITSVLQEQPDTRLGFRLFIWSLKSWHLRCRTVQDLIIGKLIKENAFELYWKVLQELK 120

Query: 121 DSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFL 180
           +S+ +ISS+AFSVLIEAYS+AGM+EKAV+SFG+M+DF+CKP++FA+NLILH LVR+EAFL
Sbjct: 121 NSAIKISSEAFSVLIEAYSEAGMDEKAVESFGLMRDFDCKPDLFAFNLILHFLVRKEAFL 180

Query: 181 LALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIIL 240
           LALAVYNQMLKCNLNP+VVTY ILIHG CKT KTQ+ALVLFDEMTDR +LPN+I YSI+L
Sbjct: 181 LALAVYNQMLKCNLNPDVVTYGILIHGLCKTCKTQDALVLFDEMTDRGILPNQIIYSIVL 240

Query: 241 SGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHI 300
           SGLCQAKKI DAQRLF KMRASGC+ D+ITYNVLLNGFCK GY D+AF LL+   KDGHI
Sbjct: 241 SGLCQAKKIFDAQRLFSKMRASGCNRDLITYNVLLNGFCKSGYLDDAFTLLQLLTKDGHI 300

Query: 301 LGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALAL 360
           LGV GY CLI+GLFRARRY+EAHMWYQK  R+N++PDV+LYTIMI+GL QEGRV EAL L
Sbjct: 301 LGVIGYGCLINGLFRARRYEEAHMWYQKMLRENIKPDVMLYTIMIRGLSQEGRVTEALTL 360

Query: 361 LDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCK 420
           L EMTERG  PDT CYNA+I+GFCDMG LD+A+SLRLEIS HDCFP+NHTYSILICGMCK
Sbjct: 361 LGEMTERGLRPDTICYNALIKGFCDMGYLDEAESLRLEISKHDCFPNNHTYSILICGMCK 420

Query: 421 NGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLR 480
           NGLI++AQH+F EMEKLGCLPSVVTFNSLI+G CKA +L+EA LLFY+MEI RKPSLFLR
Sbjct: 421 NGLINKAQHIFKEMEKLGCLPSVVTFNSLINGLCKANRLEEARLLFYQMEIVRKPSLFLR 480

Query: 481 LLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTN 540
           L QG +KV     LQVM+E+LCESG+I KAYKLLMQLV+SGV PDIRTYNILINGFCK  
Sbjct: 481 LSQGTDKVFDIASLQVMMERLCESGMILKAYKLLMQLVDSGVLPDIRTYNILINGFCKFG 540

Query: 541 NIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYK 600
           NI+GAFKLFK+MQLKG +PDS+TYGTLIDGL+R GR+EDAL IFEQMVK GC PESS YK
Sbjct: 541 NINGAFKLFKEMQLKGHMPDSVTYGTLIDGLYRAGRNEDALEIFEQMVKKGCVPESSTYK 600

Query: 601 SIMTWSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNS 660
           +IMTWSCR   +SLA SVWMKYLR+FRGW+DEKV+VV ESFD  +L+ AI R++EMD+ S
Sbjct: 601 TIMTWSCRENNISLALSVWMKYLRDFRGWEDEKVRVVAESFDNEELQTAIRRLLEMDIKS 660

Query: 661 KDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAV 720
           K+FDLAPYTIFLIGL QA R  EAFAIFSVLKDFK  ISSASCVMLIG LC+   LD+A+
Sbjct: 661 KNFDLAPYTIFLIGLVQAKRDCEAFAIFSVLKDFKMNISSASCVMLIGRLCMVENLDMAM 720

Query: 721 EVFLYTLETGTMLMPRICNQLLRHLHLEDRKDHAFVLIRRMEAFGYDMNAYLHHSTKSLL 780
           +VFL+TLE G  LMP ICNQLL +L   DRKD A  L  RMEA GYD+ A+LH+ TK  L
Sbjct: 721 DVFLFTLERGFRLMPPICNQLLCNLLHLDRKDDALFLANRMEASGYDLGAHLHYRTKLHL 780

Query: 781 HDH 784
           HDH
Sbjct: 781 HDH 783

BLAST of CmaCh02G006950 vs. TrEMBL
Match: F6HKV9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g08720 PE=4 SV=1)

HSP 1 Score: 992.3 bits (2564), Expect = 3.6e-286
Identity = 472/779 (60.59%), Postives = 604/779 (77.54%), Query Frame = 1

Query: 12  VTYLVPKPPWFHLFH----TPTDPIATSNEVSTIIETVDPIEDALETIAPHISSDVITSV 71
           V + +PK   F   H    T     A SNEV T++ETV+P+EDALE +AP +SS+++  V
Sbjct: 11  VLHFIPKQSRFRCLHANLFTTAQGAAISNEVLTVMETVNPMEDALEKLAPFLSSEIVNDV 70

Query: 72  IQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELKDSSTEIS 131
           ++EQ    LGFR FIW+ RRR      + +L+ID L KD+ F+ YWK L+ELK+S+ +I 
Sbjct: 71  MREQRRPELGFRFFIWTTRRRSFRSWVTHNLVIDMLAKDDGFDTYWKILEELKNSNIQIP 130

Query: 132 SDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYN 191
              FSVLI AY+K+GM EKAV+SFG MKDF CKP++F YN ILHV+V++E FLLALAVYN
Sbjct: 131 PPTFSVLIAAYAKSGMAEKAVESFGKMKDFGCKPDVFTYNSILHVMVQKEVFLLALAVYN 190

Query: 192 QMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAK 251
           QMLK N NPN  T+ IL++G CK  KT +AL +FDEMT + + PN + Y+IILSGLCQAK
Sbjct: 191 QMLKLNYNPNRATFVILLNGLCKNGKTDDALKMFDEMTQKGIPPNTMIYTIILSGLCQAK 250

Query: 252 KIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYS 311
           + DD  RL   M+ SGC PD IT N LL+GFCKLG  DEAFALL+ FEK+G++LG+KGYS
Sbjct: 251 RTDDVHRLLNTMKVSGCCPDSITCNALLDGFCKLGQIDEAFALLQLFEKEGYVLGIKGYS 310

Query: 312 CLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTER 371
            LIDGLFRA+RYDE   W +K  +  +EPDV+LYTI+I+G C+ G V+ AL +L++MT+R
Sbjct: 311 SLIDGLFRAKRYDEVQEWCRKMFKAGIEPDVVLYTILIRGFCEVGMVDYALNMLNDMTQR 370

Query: 372 GFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEA 431
           G SPDT CYNA+I+GFCD+GLLDKA+SL+LEIS +DCFP + TY+ILICGMC+NGL+DEA
Sbjct: 371 GLSPDTYCYNALIKGFCDVGLLDKARSLQLEISKNDCFPTSCTYTILICGMCRNGLLDEA 430

Query: 432 QHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANK 491
           + +FN+ME LGC PS++TFN+LIDG CKAG+L+EA  LFYKMEIG+ PSLFLRL QGA++
Sbjct: 431 RQIFNQMENLGCSPSIMTFNALIDGLCKAGELEEARHLFYKMEIGKNPSLFLRLSQGADR 490

Query: 492 VLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDGAFK 551
           V+ T  LQ M+E+LCESGLI KAYKLLMQL +SGV PDI TYN+LINGFCK  NI+GAFK
Sbjct: 491 VMDTASLQTMVERLCESGLILKAYKLLMQLADSGVVPDIMTYNVLINGFCKAKNINGAFK 550

Query: 552 LFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMTWSC 611
           LF+++QLKG  PDS+TYGTLIDG HRV R+EDA  + +QMVKNGC P S+VYK +MTWSC
Sbjct: 551 LFRELQLKGHSPDSVTYGTLIDGFHRVDREEDAFRVLDQMVKNGCTPSSAVYKCLMTWSC 610

Query: 612 RRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNSKDFDLAP 671
           R+ K+S+AFS+W+KYLR+    +DE +K+ EE F+KG+LEKA+  ++EM+    +F++AP
Sbjct: 611 RKGKLSVAFSLWLKYLRSLPSQEDETLKLAEEHFEKGELEKAVRCLLEMNFKLNNFEIAP 670

Query: 672 YTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAVEVFLYTL 731
           YTI+LIGLCQA R  EA  IF VLK+ +  ++  SCVMLI GLC +G L++AV++FLYTL
Sbjct: 671 YTIWLIGLCQARRSEEALKIFLVLKECQMDVNPPSCVMLINGLCKDGNLEMAVDIFLYTL 730

Query: 732 ETGTMLMPRICNQLLRHLHLEDRKDHAFVLIRRMEAFGYDMNAYLHHSTKSLLHDHWKS 787
           E G MLMPRICNQLLR L L+D+  HA  L+ RM + GYD++ YLHH  KS L   WK+
Sbjct: 731 EKGFMLMPRICNQLLRSLILQDKMKHALDLLNRMNSAGYDLDEYLHHRIKSYLLSVWKA 789

BLAST of CmaCh02G006950 vs. TrEMBL
Match: A0A061GXW2_THECC (Pentatricopeptide repeat (PPR) superfamily protein, putative OS=Theobroma cacao GN=TCM_039341 PE=4 SV=1)

HSP 1 Score: 966.5 bits (2497), Expect = 2.1e-278
Identity = 475/795 (59.75%), Postives = 611/795 (76.86%), Query Frame = 1

Query: 1   MKRRSTFLRPVVTYLVPKP-----PWFHLFHTPTDPIATSNEVSTIIETVDPIEDALETI 60
           MK  S F+RP+  +L  K      P F  F +  D  + SNE+ +I++ V+P+E ALE +
Sbjct: 1   MKLPSLFVRPIA-HLRSKTSKFLSPNFSSFSSLQD-FSVSNEIHSILDIVNPMEPALEPL 60

Query: 61  APHISSDVITSVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLV-KDNAFELYWK 120
            P +S D++TS+IQ+QPN +LGFR FIW+++R+ L  SAS  L++D L+ KDN F++YW+
Sbjct: 61  LPFLSPDIVTSIIQDQPNPQLGFRFFIWAMQRKRLRSSASDKLVVDMLLRKDNGFDMYWQ 120

Query: 121 TLQELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLV 180
           TL+E+K     I SDAF VLI  YSK G++EKAV+ FG MKDF+CKP++F YN IL+V+V
Sbjct: 121 TLEEIKKCGALIVSDAFKVLISGYSKLGLDEKAVECFGKMKDFDCKPDVFTYNTILYVMV 180

Query: 181 RREAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEI 240
           RR+  LLALAVYNQMLK N  PN  T+SILI G CK  KT++AL +FDEMT R + PN  
Sbjct: 181 RRKVLLLALAVYNQMLKNNYKPNRATFSILIDGLCKNGKTEDALNMFDEMTQRGIEPNRC 240

Query: 241 TYSIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSF 300
           +Y+II+SGLCQA + DDA RL  KM+ SGCSPD + YN LLNGFC+LG  DEAFALL+SF
Sbjct: 241 SYTIIVSGLCQADRADDACRLLNKMKESGCSPDFVAYNALLNGFCQLGRVDEAFALLQSF 300

Query: 301 EKDGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRV 360
           +KDG +LG++GYS  I+GLFRARR++EA+ WY K   +NV+PDV+LY IM++GL   G+V
Sbjct: 301 QKDGFVLGLRGYSSFINGLFRARRFEEAYAWYTKMFEENVKPDVVLYAIMLRGLSVAGKV 360

Query: 361 NEALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSIL 420
            +A+ LL EMTERG  PDT CYNAVI+GFCD GLLD+A+SL+LEIS++DCFP+  TY+IL
Sbjct: 361 EDAMKLLSEMTERGLVPDTYCYNAVIKGFCDTGLLDQARSLQLEISSYDCFPNACTYTIL 420

Query: 421 ICGMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRK 480
           I GMC+NGL+ EAQ +F+EMEKLGC PSVVTFN+LIDG  KAG+L++AHLLFYKMEIGR 
Sbjct: 421 ISGMCQNGLVGEAQQIFDEMEKLGCFPSVVTFNALIDGLSKAGQLEKAHLLFYKMEIGRN 480

Query: 481 PSLFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILIN 540
           PSLFLRL  G++ VL +  LQ M+EQL ESG I KAY++LMQL + G  PDI TYNILI+
Sbjct: 481 PSLFLRLSHGSSGVLDSSSLQTMVEQLYESGRILKAYRILMQLADGGNVPDIFTYNILIH 540

Query: 541 GFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKP 600
           GFCK  NI+GAFKLFK++QLKG  PDS+TYGTLI+G    GR+EDA  IF+QMVKNGCKP
Sbjct: 541 GFCKAGNINGAFKLFKELQLKGISPDSVTYGTLINGFQMAGREEDAFRIFDQMVKNGCKP 600

Query: 601 ESSVYKSIMTWSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRII 660
             +VY+S+MTWSCRR+KVSLAF++W+ YLR+  G +D  +K VE+ FD+G +EKA+  ++
Sbjct: 601 SVAVYRSLMTWSCRRRKVSLAFNLWLMYLRSLPGRQDTVIKEVEKYFDEGQVEKAVRGLL 660

Query: 661 EMDLNSKDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEG 720
            MD     F +APYTI+LIGLCQAGRV EA  IF +L++ K +++  SCV LI GLC EG
Sbjct: 661 RMDFKLNSFSVAPYTIWLIGLCQAGRVEEALKIFYILEECKVVVTPPSCVRLIVGLCKEG 720

Query: 721 KLDLAVEVFLYTLETGTMLMPRICNQLLRH-LHLEDRKDHAFVLIRRMEAFGYDMNAYLH 780
            LDLAV+VFLYTLE G  LMPRICN LL+  L  +D++ HAF L+ +M +  YD++AYLH
Sbjct: 721 NLDLAVDVFLYTLEQGFKLMPRICNYLLKSLLRSKDKRMHAFGLLSKMNSQRYDLDAYLH 780

Query: 781 HSTKSLLHDHWKSLK 789
            +TKSLL+ HW + K
Sbjct: 781 KTTKSLLYRHWHTWK 793

BLAST of CmaCh02G006950 vs. TrEMBL
Match: B9SU41_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1215850 PE=4 SV=1)

HSP 1 Score: 958.4 bits (2476), Expect = 5.7e-276
Identity = 470/801 (58.68%), Postives = 608/801 (75.91%), Query Frame = 1

Query: 1   MKRRSTFLRPVVTYLVPKPPWFHLFHTPTD-PIATSNEVSTIIETVDPIEDALETIAPHI 60
           MK+  + LR +      KPPW   FHT +    A SNEV TII++V+PIE ALE+  P +
Sbjct: 1   MKKLRSLLREISR---AKPPWKQHFHTYSAVDFAISNEVLTIIDSVNPIEPALESKVPFL 60

Query: 61  SSDVITSVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQEL 120
           S  ++T +I+  PN+ LGFR FIW+ + R L    S ++IID L+KDN FELYW+ L+E+
Sbjct: 61  SPSIVTYIIKNPPNSLLGFRFFIWASKFRRLRSWVSHNMIIDMLIKDNGFELYWQVLKEI 120

Query: 121 KDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAF 180
           K     IS+DAF+VLI+AY+K  M EKAV+SF MMKDF+CKP++F YN +LHV+VR+E  
Sbjct: 121 KRCGFSISADAFTVLIQAYAKMDMIEKAVESFEMMKDFDCKPDVFTYNTVLHVMVRKEVV 180

Query: 181 LLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSII 240
           LLAL +YN+MLK N  PN+ T+SILI G CK+ KTQ AL +FDEMT R +LPN+ITY+II
Sbjct: 181 LLALGIYNRMLKLNCLPNIATFSILIDGMCKSGKTQNALQMFDEMTQRRILPNKITYTII 240

Query: 241 LSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGH 300
           +SGLCQA+K D A RLFI M+  GC PD +TYN LL+GFCKLG  DEA  LL+ FEKD +
Sbjct: 241 ISGLCQAQKADVAYRLFIAMKDHGCIPDSVTYNALLHGFCKLGRVDEALGLLKYFEKDRY 300

Query: 301 ILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALA 360
           +L  +GYSCLIDGLFRARR+++A +WY+K +  N++PDVILYTIM++GL + G+  +AL 
Sbjct: 301 VLDKQGYSCLIDGLFRARRFEDAQVWYRKMTEHNIKPDVILYTIMMKGLSKAGKFKDALR 360

Query: 361 LLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMC 420
           LL+EMTERG  PDT CYNA+I+G+CD+GLLD+A+SL LEIS +DCF    TY+ILICGMC
Sbjct: 361 LLNEMTERGLVPDTHCYNALIKGYCDLGLLDEAKSLHLEISKNDCFSSACTYTILICGMC 420

Query: 421 KNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFL 480
           ++GL+ +AQ +FNEMEK GC PSVVTFN+LIDGFCKAG +++A LLFYKMEIGR PSLFL
Sbjct: 421 RSGLVGDAQQIFNEMEKHGCYPSVVTFNALIDGFCKAGNIEKAQLLFYKMEIGRNPSLFL 480

Query: 481 RLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKT 540
           RL QGAN+VL T  LQ M+EQLC+SGLI KAY +LMQL +SG  P+I TYNILI+GFCK 
Sbjct: 481 RLSQGANRVLDTASLQTMVEQLCDSGLILKAYNILMQLTDSGFAPNIITYNILIHGFCKA 540

Query: 541 NNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVY 600
            NI+GAFKLFK++QLKG  PDS+TYGTLI+GL    R+EDA  + +Q++KNGC P + VY
Sbjct: 541 GNINGAFKLFKELQLKGLSPDSVTYGTLINGLLSANREEDAFTVLDQILKNGCTPITEVY 600

Query: 601 KSIMTWSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLN 660
           KS MTWSCRR K++LAFS+W+KYLR+  G   E +K VEE+F+KG++E+A+  ++EMD  
Sbjct: 601 KSFMTWSCRRNKITLAFSLWLKYLRSIPGRDSEVLKSVEENFEKGEVEEAVRGLLEMDFK 660

Query: 661 SKDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLA 720
             DF LAPYTI+LIGLCQAGR+ EA  IF  L++   +++  SCV LI  L   G LDLA
Sbjct: 661 LNDFQLAPYTIWLIGLCQAGRLEEALKIFFTLEEHNVLVTPPSCVKLIYRLLKVGNLDLA 720

Query: 721 VEVFLYTLETGTMLMPRICNQLLRH-LHLEDRKDHAFVLIRRMEAFGYDMNAYLHHSTKS 780
            E+FLYT++ G MLMPRICN+LL+  L  ED+++ AF L+ RM++ GYD++++LH +TK 
Sbjct: 721 AEIFLYTIDKGYMLMPRICNRLLKSLLRSEDKRNRAFDLLSRMKSLGYDLDSHLHQTTKF 780

Query: 781 LLH---DHWKSLKAKARHEQW 797
           LL     H  SLK     E +
Sbjct: 781 LLQGDAGHQVSLKINFLSESY 798

BLAST of CmaCh02G006950 vs. TrEMBL
Match: V4RKA3_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004347mg PE=4 SV=1)

HSP 1 Score: 935.6 bits (2417), Expect = 4.0e-269
Identity = 453/792 (57.20%), Postives = 594/792 (75.00%), Query Frame = 1

Query: 1   MKRRSTFLRPVVT--YLVPK-PPWFHLFHTPTDP-IATSNEVSTIIETVDPIEDALETIA 60
           MK  S  LRP+ +     PK  P FH  H+P+    +T NEV TI++TV PIE ALE + 
Sbjct: 1   MKLPSLLLRPISSAHQFSPKLSPPFHYLHSPSSAESSTINEVLTILDTVTPIEPALEPLL 60

Query: 61  PHISSDVITSVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTL 120
           P +S   +TSVI +  N ++GFR FIW+ +R+ L   AS   +I  L+K N F+LYW+TL
Sbjct: 61  PFLSKTTVTSVIMKTKNPQVGFRFFIWAAKRKRLRSFASNSAVIRMLLKPNGFDLYWQTL 120

Query: 121 QELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRR 180
            ELK  +  + SD F VLI  Y K G  EKA++SFG MK+F+C+P+++ YN +L+++ R+
Sbjct: 121 DELKSGNVSVVSDVFFVLISGYYKVGDCEKALESFGKMKEFDCQPDVYMYNAVLNIVFRK 180

Query: 181 EAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITY 240
           + FLLALAVY +M+K N  PN+VT+S+LI G  K+ KT+ A+ +FDEMT R +LPN+ TY
Sbjct: 181 QLFLLALAVYYEMVKLNCLPNIVTFSLLIDGLSKSGKTEVAIKMFDEMTQRGILPNKFTY 240

Query: 241 SIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEK 300
           +I++SGLCQ  + D+A RLF+KM+ SGCSPD + YN LLNGFCKL   DEA ALLRSFEK
Sbjct: 241 TIVISGLCQINRADEAYRLFLKMKDSGCSPDFVAYNALLNGFCKLRGVDEALALLRSFEK 300

Query: 301 DGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNE 360
           DG + G+  YSCLIDGLFRA+RYDEA+ WY+K   + +EPDV+LY ++I+GL + G+V +
Sbjct: 301 DGFVPGLGSYSCLIDGLFRAKRYDEAYAWYRKMFEEKIEPDVVLYGVIIRGLSEAGKVKD 360

Query: 361 ALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILIC 420
           A+ LL +M++RG  PD  CYNA+I+GFCD+GLLD+A+SL++EI   D  P+ HT++ILIC
Sbjct: 361 AMKLLSDMSDRGIVPDIYCYNALIKGFCDLGLLDQARSLQVEIWKRDSLPNTHTFTILIC 420

Query: 421 GMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPS 480
           GMC+NG++D+AQ +FN+MEK GC PSV TFN+LIDG CKAG+L++A+LLFYKMEIG+ P+
Sbjct: 421 GMCRNGMVDDAQKLFNKMEKAGCFPSVGTFNALIDGLCKAGELEKANLLFYKMEIGKNPT 480

Query: 481 LFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGF 540
           LFLRL QG N+V     LQ M+EQ C SGLIHKAYK+LMQL ESG  PDI TYNILINGF
Sbjct: 481 LFLRLSQGGNRVHDKASLQTMVEQYCTSGLIHKAYKILMQLAESGNLPDIITYNILINGF 540

Query: 541 CKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPES 600
           CK  NI+GA KLFK++QLKG  PDS+TYGTLI+GL RV R+EDA  IFEQM +NGC P  
Sbjct: 541 CKVGNINGALKLFKELQLKGLSPDSVTYGTLINGLQRVDREEDAFRIFEQMPQNGCTPSP 600

Query: 601 SVYKSIMTWSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEM 660
           +VYKS+MTWSCRR+K+SLAFS+W++YLR+  G  DE +K +EE   KG +E AI  ++EM
Sbjct: 601 AVYKSLMTWSCRRRKISLAFSLWLQYLRDISGRDDESMKSIEEFLQKGKVENAIQGLLEM 660

Query: 661 DLNSKDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKL 720
           D    DF LAPYTI+LIGLCQ G+V EAF IFS+L + K I++  SCV LI GLC  G L
Sbjct: 661 DFKLNDFQLAPYTIWLIGLCQDGQVKEAFNIFSILVECKAIVTPPSCVKLIHGLCKRGYL 720

Query: 721 DLAVEVFLYTLETGTMLMPRICNQLLRHLHL--EDRKDHAFVLIRRMEAFGYDMNAYLHH 780
           DLA++VFLYTL+   +L PR+CN LLR L L  +++K HA+ L+RRM++ GYD++A L+ 
Sbjct: 721 DLAMDVFLYTLKNDFILRPRVCNYLLRSLLLSKDNKKVHAYHLLRRMKSVGYDLDACLYP 780

Query: 781 STKSLLHDHWKS 787
            TKSLL   W +
Sbjct: 781 KTKSLLPGPWNT 792

BLAST of CmaCh02G006950 vs. TAIR10
Match: AT1G79540.1 (AT1G79540.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 777.3 bits (2006), Expect = 9.1e-225
Identity = 394/787 (50.06%), Postives = 534/787 (67.85%), Query Frame = 1

Query: 7   FLRPVVTYLVPKPPWFHLFHTPTDP-IATSNEVSTIIETVDPIEDALETIAPHISSDVIT 66
           F R V+ +   KP W    ++  +     S EV +I+    PIE ALE + P +S ++IT
Sbjct: 6   FFRSVIQFY-SKPSWMQRSYSSGNAEFNISGEVISILAKKKPIEPALEPLVPFLSKNIIT 65

Query: 67  SVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELKDSSTE 126
           SVI+++ N +LGFR FIW+ RR  L    S  L+ID L +DN  +LYW+TL+ELK     
Sbjct: 66  SVIKDEVNRQLGFRFFIWASRRERLRSRESFGLVIDMLSEDNGCDLYWQTLEELKSGGVS 125

Query: 127 ISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAF-LLALA 186
           + S  F VLI AY+K GM EKAV+SFG MK+F+C+P++F YN+IL V++R E F +LA A
Sbjct: 126 VDSYCFCVLISAYAKMGMAEKAVESFGRMKEFDCRPDVFTYNVILRVMMREEVFFMLAFA 185

Query: 187 VYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLC 246
           VYN+MLKCN +PN+ T+ IL+ G  K  +T +A  +FD+MT R + PN +TY+I++SGLC
Sbjct: 186 VYNEMLKCNCSPNLYTFGILMDGLYKKGRTSDAQKMFDDMTGRGISPNRVTYTILISGLC 245

Query: 247 QAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVK 306
           Q    DDA++LF +M+ SG  PD + +N LL+GFCKLG   EAF LLR FEKDG +LG++
Sbjct: 246 QRGSADDARKLFYEMQTSGNYPDSVAHNALLDGFCKLGRMVEAFELLRLFEKDGFVLGLR 305

Query: 307 GYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEM 366
           GYS LIDGLFRARRY +A   Y    +KN++PD+ILYTI+IQGL + G++ +AL LL  M
Sbjct: 306 GYSSLIDGLFRARRYTQAFELYANMLKKNIKPDIILYTILIQGLSKAGKIEDALKLLSSM 365

Query: 367 TERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLI 426
             +G SPDT CYNAVI+  C  GLL++ +SL+LE+S  + FPD  T++ILIC MC+NGL+
Sbjct: 366 PSKGISPDTYCYNAVIKALCGRGLLEEGRSLQLEMSETESFPDACTHTILICSMCRNGLV 425

Query: 427 DEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQG 486
            EA+ +F E+EK GC PSV TFN+LIDG CK+G+LKEA LL +KME+GR  SLFLRL   
Sbjct: 426 REAEEIFTEIEKSGCSPSVATFNALIDGLCKSGELKEARLLLHKMEVGRPASLFLRLSHS 485

Query: 487 ANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDG 546
            N+   T         + ESG I KAY+ L    ++G  PDI +YN+LINGFC+  +IDG
Sbjct: 486 GNRSFDT---------MVESGSILKAYRDLAHFADTGSSPDIVSYNVLINGFCRAGDIDG 545

Query: 547 AFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMT 606
           A KL   +QLKG  PDS+TY TLI+GLHRVGR+E+A  +F    K+  +   +VY+S+MT
Sbjct: 546 ALKLLNVLQLKGLSPDSVTYNTLINGLHRVGREEEAFKLF--YAKDDFRHSPAVYRSLMT 605

Query: 607 WSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNSKDFD 666
           WSCR++KV +AF++WMKYL+      DE    +E+ F +G+ E+A+ R+IE+D    +  
Sbjct: 606 WSCRKRKVLVAFNLWMKYLKKISCLDDETANEIEQCFKEGETERALRRLIELDTRKDELT 665

Query: 667 LAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAVEVFL 726
           L PYTI+LIGLCQ+GR  EA  +FSVL++ K +++  SCV LI GLC   +LD A+EVFL
Sbjct: 666 LGPYTIWLIGLCQSGRFHEALMVFSVLREKKILVTPPSCVKLIHGLCKREQLDAAIEVFL 725

Query: 727 YTLETGTMLMPRICNQLLRHLHLEDRKDHAFV--LIRRMEAFGYDMNAYL-------HHS 783
           YTL+    LMPR+CN LL  L LE  +    V  L  RME  GY++++ L       H  
Sbjct: 726 YTLDNNFKLMPRVCNYLLSSL-LESTEKMEIVSQLTNRMERAGYNVDSMLRFEILKYHRH 779

BLAST of CmaCh02G006950 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 296.2 bits (757), Expect = 6.2e-80
Identity = 183/669 (27.35%), Postives = 329/669 (49.18%), Query Frame = 1

Query: 102 RLVKDNAFELYWKTLQELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKP 161
           + + D    L +K+LQE  D     SS  F +++++YS+  + +KA+    + +     P
Sbjct: 109 KTLDDEYASLVFKSLQETYDLCYSTSS-VFDLVVKSYSRLSLIDKALSIVHLAQAHGFMP 168

Query: 162 NIFAYNLILHVLVRREAFL-LALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVL 221
            + +YN +L   +R +  +  A  V+ +ML+  ++PNV TY+ILI GFC       AL L
Sbjct: 169 GVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTL 228

Query: 222 FDEMTDRDVLPNEITYSIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCK 281
           FD+M  +  LPN +TY+ ++ G C+ +KIDD  +L   M   G  P++I+YNV++NG C+
Sbjct: 229 FDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCR 288

Query: 282 LGYFDEAFALLRSFEKDGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVIL 341
            G   E   +L    + G+ L    Y+ LI G  +   + +A + + +  R  + P VI 
Sbjct: 289 EGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVIT 348

Query: 342 YTIMIQGLCQEGRVNEALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEIS 401
           YT +I  +C+ G +N A+  LD+M  RG  P+   Y  ++ GF   G +++A  +  E++
Sbjct: 349 YTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMN 408

Query: 402 NHDCFPDNHTYSILICGMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLK 461
           ++   P   TY+ LI G C  G +++A  V  +M++ G  P VV++++++ GFC++  + 
Sbjct: 409 DNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVD 468

Query: 462 EAHLLFYKM-EIGRKPSLFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVE 521
           EA  +  +M E G KP               T+    +++  CE     +A  L  +++ 
Sbjct: 469 EALRVKREMVEKGIKPD--------------TITYSSLIQGFCEQRRTKEACDLYEEMLR 528

Query: 522 SGVFPDIRTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDED 581
            G+ PD  TY  LIN +C   +++ A +L  +M  KG LPD +TY  LI+GL++  R  +
Sbjct: 529 VGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTRE 588

Query: 582 ALGIFEQMVKNGCKPESSVYKSIMTWSCRRKKVSLAFSVWMKYLRNF--RGWKDEKVKVV 641
           A  +  ++      P    Y +++  +C     ++ F   +  ++ F  +G   E  +V 
Sbjct: 589 AKRLLLKLFYEESVPSDVTYHTLIE-NCS----NIEFKSVVSLIKGFCMKGMMTEADQVF 648

Query: 642 EESFDKGDLEKAISRIIEMDLNSKDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRI 701
           E    K               N K  D   Y I + G C+AG + +A+ ++  +     +
Sbjct: 649 ESMLGK---------------NHKP-DGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFL 708

Query: 702 ISSASCVMLIGGLCVEGKLDLAVEVFLYTLETGTMLMPRICNQLLRHLHLEDRKDHAFVL 761
           + + + + L+  L  EGK++    V ++ L +  +        L+   H E   D    +
Sbjct: 709 LHTVTVIALVKALHKEGKVNELNSVIVHVLRSCELSEAEQAKVLVEINHREGNMDVVLDV 741

Query: 762 IRRMEAFGY 767
           +  M   G+
Sbjct: 769 LAEMAKDGF 741

BLAST of CmaCh02G006950 vs. TAIR10
Match: AT1G62930.1 (AT1G62930.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 293.9 bits (751), Expect = 3.1e-79
Identity = 167/515 (32.43%), Postives = 278/515 (53.98%), Query Frame = 1

Query: 95  SQDLIIDRLVKDNAFELYWKTLQELKDSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMM 154
           S+++++D L  D+A +L+ + +Q    S    S   F+ L+ A +K    +  +     M
Sbjct: 52  SRNVLLD-LKLDDAVDLFGEMVQ----SRPLPSIVEFNKLLSAIAKMNKFDLVISLGERM 111

Query: 155 KDFECKPNIFAYNLILHVLVRREAFLLALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKT 214
           ++     ++++YN++++   RR    LALAV  +M+K    P++VT S L++G+C   + 
Sbjct: 112 QNLRISYDLYSYNILINCFCRRSQLPLALAVLGKMMKLGYEPDIVTLSSLLNGYCHGKRI 171

Query: 215 QEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVL 274
            EA+ L D+M   +  PN +T++ ++ GL    K  +A  L  +M A GC PD+ TY  +
Sbjct: 172 SEAVALVDQMFVMEYQPNTVTFNTLIHGLFLHNKASEAVALIDRMVARGCQPDLFTYGTV 231

Query: 275 LNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNV 334
           +NG CK G  D A +LL+  EK      V  Y+ +ID L   +  ++A   + +   K +
Sbjct: 232 VNGLCKRGDIDLALSLLKKMEKGKIEADVVIYTTIIDALCNYKNVNDALNLFTEMDNKGI 291

Query: 335 EPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQS 394
            P+V+ Y  +I+ LC  GR ++A  LL +M ER  +P+   ++A+I  F   G L +A+ 
Sbjct: 292 RPNVVTYNSLIRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEK 351

Query: 395 LRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFC 454
           L  E+      PD  TYS LI G C +  +DEA+H+F  M    C P+VVT+N+LI GFC
Sbjct: 352 LYDEMIKRSIDPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFC 411

Query: 455 KAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLL 514
           KA +++E   LF +M          R L G      TV    +++ L ++G    A K+ 
Sbjct: 412 KAKRVEEGMELFREMS--------QRGLVG-----NTVTYNTLIQGLFQAGDCDMAQKIF 471

Query: 515 MQLVESGVFPDIRTYNILINGFCKTNNIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRV 574
            ++V  GV PDI TY+IL++G CK   ++ A  +F+ +Q     PD  TY  +I+G+ + 
Sbjct: 472 KKMVSDGVPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEPDIYTYNIMIEGMCKA 531

Query: 575 GRDEDALGIFEQMVKNGCKPESSVYKSIMTWSCRR 610
           G+ ED   +F  +   G KP   +Y ++++  CR+
Sbjct: 532 GKVEDGWDLFCSLSLKGVKPNVIIYTTMISGFCRK 548

BLAST of CmaCh02G006950 vs. TAIR10
Match: AT1G62670.1 (AT1G62670.1 rna processing factor 2)

HSP 1 Score: 290.0 bits (741), Expect = 4.4e-78
Identity = 160/479 (33.40%), Postives = 259/479 (54.07%), Query Frame = 1

Query: 131 FSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYNQML 190
           FS L+ A +K    +  +     M++     N + Y+++++   RR    LALAV  +M+
Sbjct: 84  FSKLLSAIAKMNKFDVVISLGEQMQNLGIPHNHYTYSILINCFCRRSQLPLALAVLGKMM 143

Query: 191 KCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKID 250
           K    PN+VT S L++G+C + +  EA+ L D+M      PN +T++ ++ GL    K  
Sbjct: 144 KLGYEPNIVTLSSLLNGYCHSKRISEAVALVDQMFVTGYQPNTVTFNTLIHGLFLHNKAS 203

Query: 251 DAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLI 310
           +A  L  +M A GC PD++TY V++NG CK G  D AF LL   E+     GV  Y+ +I
Sbjct: 204 EAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNTII 263

Query: 311 DGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFS 370
           DGL + +  D+A   +++   K + P+V+ Y+ +I  LC  GR ++A  LL +M ER  +
Sbjct: 264 DGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLISCLCNYGRWSDASRLLSDMIERKIN 323

Query: 371 PDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHV 430
           PD   ++A+I  F   G L +A+ L  E+      P   TYS LI G C +  +DEA+ +
Sbjct: 324 PDVFTFSALIDAFVKEGKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCMHDRLDEAKQM 383

Query: 431 FNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANKVLG 490
           F  M    C P VVT+N+LI GFCK  +++E       ME+ R+ S   R L G      
Sbjct: 384 FEFMVSKHCFPDVVTYNTLIKGFCKYKRVEEG------MEVFREMS--QRGLVG-----N 443

Query: 491 TVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDGAFKLFK 550
           TV   ++++ L ++G    A ++  ++V  GV P+I TYN L++G CK   ++ A  +F+
Sbjct: 444 TVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYNTLLDGLCKNGKLEKAMVVFE 503

Query: 551 DMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMTWSCRR 610
            +Q     P   TY  +I+G+ + G+ ED   +F  +   G KP+   Y ++++  CR+
Sbjct: 504 YLQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSLKGVKPDVVAYNTMISGFCRK 549

BLAST of CmaCh02G006950 vs. TAIR10
Match: AT1G63080.1 (AT1G63080.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 286.2 bits (731), Expect = 6.4e-77
Identity = 156/491 (31.77%), Postives = 255/491 (51.93%), Query Frame = 1

Query: 131 FSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYNQML 190
           FS L+ A +K    +  +     M+      N++ YN++++ L RR     ALA+  +M+
Sbjct: 68  FSKLLSAIAKMKKFDLVISFGEKMEILGVSHNLYTYNIMINCLCRRSQLSFALAILGKMM 127

Query: 191 KCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAKKID 250
           K    P++VT + L++GFC  ++  EA+ L D+M +    P+ +T++ ++ GL Q  K  
Sbjct: 128 KLGYGPSIVTLNSLLNGFCHGNRISEAVALVDQMVEMGYQPDTVTFTTLVHGLFQHNKAS 187

Query: 251 DAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYSCLI 310
           +A  L  +M   GC PD++TY  ++NG CK G  D A  LL   EK      V  YS +I
Sbjct: 188 EAVALVERMVVKGCQPDLVTYGAVINGLCKRGEPDLALNLLNKMEKGKIEADVVIYSTVI 247

Query: 311 DGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTERGFS 370
           D L + R  D+A   + +   K + PDV  Y+ +I  LC  GR ++A  LL +M ER  +
Sbjct: 248 DSLCKYRHVDDALNLFTEMDNKGIRPDVFTYSSLISCLCNYGRWSDASRLLSDMLERKIN 307

Query: 371 PDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEAQHV 430
           P+   +N++I  F   G L +A+ L  E+      P+  TY+ LI G C +  +DEAQ +
Sbjct: 308 PNVVTFNSLIDAFAKEGKLIEAEKLFDEMIQRSIDPNIVTYNSLINGFCMHDRLDEAQQI 367

Query: 431 FNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANKVLG 490
           F  M    CLP VVT+N+LI+GFCKA K+ +   LF             R +     V  
Sbjct: 368 FTLMVSKDCLPDVVTYNTLINGFCKAKKVVDGMELF-------------RDMSRRGLVGN 427

Query: 491 TVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDGAFKLFK 550
           TV    ++    ++     A  +  Q+V  GV P+I TYN L++G CK   ++ A  +F+
Sbjct: 428 TVTYTTLIHGFFQASDCDNAQMVFKQMVSDGVHPNIMTYNTLLDGLCKNGKLEKAMVVFE 487

Query: 551 DMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMTWSCRRK 610
            +Q     PD  TY  + +G+ + G+ ED   +F  +   G KP+   Y ++++  C++ 
Sbjct: 488 YLQKSKMEPDIYTYNIMSEGMCKAGKVEDGWDLFCSLSLKGVKPDVIAYNTMISGFCKKG 545

Query: 611 KVSLAFSVWMK 622
               A+++++K
Sbjct: 548 LKEEAYTLFIK 545

BLAST of CmaCh02G006950 vs. NCBI nr
Match: gi|449444522|ref|XP_004140023.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g79540 [Cucumis sativus])

HSP 1 Score: 1179.5 bits (3050), Expect = 0.0e+00
Identity = 577/783 (73.69%), Postives = 667/783 (85.19%), Query Frame = 1

Query: 1   MKRRSTFLRPVVTYLVPKPPWFHLFHTPTDPIATSNEVSTIIETVDPIEDALETIAPHIS 60
           MK R    RP++ ++VPKP  FH +H+ T+PIATS EVSTIIET+DP+ED L+ I+  I 
Sbjct: 1   MKLRPILFRPIIIHVVPKPTLFHSYHSRTNPIATSIEVSTIIETLDPMEDGLKVISSRIR 60

Query: 61  SDVITSVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELK 120
           S  ITSV+QEQP+ RLGFRLFIWSL+  HL C   QDLII +L+K+NAFELYWK LQELK
Sbjct: 61  SYTITSVLQEQPDTRLGFRLFIWSLKSWHLRCRTVQDLIIGKLIKENAFELYWKVLQELK 120

Query: 121 DSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFL 180
           +S+ +ISS+AFSVLIEAYS+AGM+EKAV+SFG+M+DF+CKP++FA+NLILH LVR+EAFL
Sbjct: 121 NSAIKISSEAFSVLIEAYSEAGMDEKAVESFGLMRDFDCKPDLFAFNLILHFLVRKEAFL 180

Query: 181 LALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIIL 240
           LALAVYNQMLKCNLNP+VVTY ILIHG CKT KTQ+ALVLFDEMTDR +LPN+I YSI+L
Sbjct: 181 LALAVYNQMLKCNLNPDVVTYGILIHGLCKTCKTQDALVLFDEMTDRGILPNQIIYSIVL 240

Query: 241 SGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHI 300
           SGLCQAKKI DAQRLF KMRASGC+ D+ITYNVLLNGFCK GY D+AF LL+   KDGHI
Sbjct: 241 SGLCQAKKIFDAQRLFSKMRASGCNRDLITYNVLLNGFCKSGYLDDAFTLLQLLTKDGHI 300

Query: 301 LGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALAL 360
           LGV GY CLI+GLFRARRY+EAHMWYQK  R+N++PDV+LYTIMI+GL QEGRV EAL L
Sbjct: 301 LGVIGYGCLINGLFRARRYEEAHMWYQKMLRENIKPDVMLYTIMIRGLSQEGRVTEALTL 360

Query: 361 LDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCK 420
           L EMTERG  PDT CYNA+I+GFCDMG LD+A+SLRLEIS HDCFP+NHTYSILICGMCK
Sbjct: 361 LGEMTERGLRPDTICYNALIKGFCDMGYLDEAESLRLEISKHDCFPNNHTYSILICGMCK 420

Query: 421 NGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLR 480
           NGLI++AQH+F EMEKLGCLPSVVTFNSLI+G CKA +L+EA LLFY+MEI RKPSLFLR
Sbjct: 421 NGLINKAQHIFKEMEKLGCLPSVVTFNSLINGLCKANRLEEARLLFYQMEIVRKPSLFLR 480

Query: 481 LLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTN 540
           L QG +KV     LQVM+E+LCESG+I KAYKLLMQLV+SGV PDIRTYNILINGFCK  
Sbjct: 481 LSQGTDKVFDIASLQVMMERLCESGMILKAYKLLMQLVDSGVLPDIRTYNILINGFCKFG 540

Query: 541 NIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYK 600
           NI+GAFKLFK+MQLKG +PDS+TYGTLIDGL+R GR+EDAL IFEQMVK GC PESS YK
Sbjct: 541 NINGAFKLFKEMQLKGHMPDSVTYGTLIDGLYRAGRNEDALEIFEQMVKKGCVPESSTYK 600

Query: 601 SIMTWSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNS 660
           +IMTWSCR   +SLA SVWMKYLR+FRGW+DEKV+VV ESFD  +L+ AI R++EMD+ S
Sbjct: 601 TIMTWSCRENNISLALSVWMKYLRDFRGWEDEKVRVVAESFDNEELQTAIRRLLEMDIKS 660

Query: 661 KDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAV 720
           K+FDLAPYTIFLIGL QA R  EAFAIFSVLKDFK  ISSASCVMLIG LC+   LD+A+
Sbjct: 661 KNFDLAPYTIFLIGLVQAKRDCEAFAIFSVLKDFKMNISSASCVMLIGRLCMVENLDMAM 720

Query: 721 EVFLYTLETGTMLMPRICNQLLRHLHLEDRKDHAFVLIRRMEAFGYDMNAYLHHSTKSLL 780
           +VFL+TLE G  LMP ICNQLL +L   DRKD A  L  RMEA GYD+ A+LH+ TK  L
Sbjct: 721 DVFLFTLERGFRLMPPICNQLLCNLLHLDRKDDALFLANRMEASGYDLGAHLHYRTKLHL 780

Query: 781 HDH 784
           HDH
Sbjct: 781 HDH 783

BLAST of CmaCh02G006950 vs. NCBI nr
Match: gi|659112542|ref|XP_008456271.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g79540 [Cucumis melo])

HSP 1 Score: 1139.4 bits (2946), Expect = 0.0e+00
Identity = 570/806 (70.72%), Postives = 665/806 (82.51%), Query Frame = 1

Query: 1   MKRRSTFLRPVVTYLVPKPPWFHLFHTPTDPIATSNEVSTIIETVDPIEDALETIAPHIS 60
           MK R    RP++ ++VPKPP F  +H+ T+PI TS EVSTIIETVDP+ED L+ I+  I+
Sbjct: 1   MKLRPNLFRPIIIHVVPKPPLFQSYHSRTNPIGTSIEVSTIIETVDPMEDGLKVISSRIT 60

Query: 61  SDVITSVIQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELK 120
           S +ITSV+++QPN  LGFRLFIWSL   H    A + LIID+L+KDNAFELYWK LQELK
Sbjct: 61  SYIITSVLRKQPNTLLGFRLFIWSLESSHFRWRALKHLIIDKLIKDNAFELYWKVLQELK 120

Query: 121 DSSTEISSDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFL 180
           +S+ EISSDAFSVLIEAYS+AGMEEKAV+SFG+M+DF+CKPN+FA+NLIL  LVR+EAFL
Sbjct: 121 ESAIEISSDAFSVLIEAYSEAGMEEKAVESFGLMRDFDCKPNLFAFNLILRFLVRKEAFL 180

Query: 181 LALAVYNQMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIIL 240
           LALAVYNQMLKCNLNP+V TY ILIHGFC+T KTQ+ALVLFDEMT R +LPN+I Y+I+L
Sbjct: 181 LALAVYNQMLKCNLNPDVDTYGILIHGFCQTCKTQDALVLFDEMTGRGILPNKIIYTIVL 240

Query: 241 SGLCQAKKIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHI 300
           SGLC+AKKI DAQRLF  M A     D+ TYNVLLNGFCKLGY DEAF LL+   KDGH 
Sbjct: 241 SGLCRAKKILDAQRLFSMMGAR--RRDLRTYNVLLNGFCKLGYLDEAFTLLQQLIKDGHN 300

Query: 301 LGVKGYSCLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALAL 360
           L V GY CLI+GLFRARRY+EAH WY+K  R+N++PDVILYTIMIQGL QEGRV  A+ L
Sbjct: 301 LEVDGYGCLINGLFRARRYEEAHKWYRKMLRENIKPDVILYTIMIQGLSQEGRVTNAVTL 360

Query: 361 LDEMTERGFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCK 420
           L EM ERG  PDT CYNA+I+GFCD+G LDKAQSLRLEISNH CFP NHTYSILICGMCK
Sbjct: 361 LGEMKERGLRPDTICYNALIKGFCDIGYLDKAQSLRLEISNHGCFPTNHTYSILICGMCK 420

Query: 421 NGLIDEAQHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLR 480
           +GLI EAQH+F EMEKLGCLPSVVTFNSLI+G CKA +L+EA LLFY+MEI RKPSLFLR
Sbjct: 421 SGLITEAQHIFKEMEKLGCLPSVVTFNSLINGLCKASRLEEARLLFYQMEIVRKPSLFLR 480

Query: 481 LLQGANKVLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTN 540
           L QG +KVL    LQVM+EQLCESGLI KAYKLLMQLV+SGV PDIRTYNILINGFCK  
Sbjct: 481 LSQGTDKVLDIASLQVMMEQLCESGLILKAYKLLMQLVDSGVLPDIRTYNILINGFCKFE 540

Query: 541 NIDGAFKLFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYK 600
           NI+GAFKLFK+MQ +G +PDS+TYGTLIDGL+RVGR+EDALGIF QM K GC P+SS Y+
Sbjct: 541 NINGAFKLFKEMQTRGHMPDSVTYGTLIDGLYRVGRNEDALGIFRQMEKKGCVPDSSTYR 600

Query: 601 SIMTWSCRRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNS 660
           +IMTW CR K + L  SVWMKYLRNFRGW+DEKV+VVEESFD  +L+ AI R++EMD+ S
Sbjct: 601 TIMTWLCREKNIPLTLSVWMKYLRNFRGWEDEKVRVVEESFDNEELQTAIRRLLEMDVKS 660

Query: 661 KDFDLAPYTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAV 720
           K+FD+APYTIFLIGLC+A RVSEAFAIFSV KDFK  ISSASCV LI GLC   KL+LAV
Sbjct: 661 KNFDVAPYTIFLIGLCKAKRVSEAFAIFSVFKDFKMNISSASCVKLICGLCAVEKLELAV 720

Query: 721 EVFLYTLETGTMLMPRICNQLLRHLHLEDRKDHAFVLIRRMEAFGYDMNAYLHHSTKSLL 780
           +VFL+TLE    +MP ICN+LL HL   DRKD A  L  R+EA GYD+ A+L++ TK LL
Sbjct: 721 DVFLFTLER-FFVMPPICNRLLCHLLDLDRKDDALFLANRLEASGYDLGAHLYYRTKLLL 780

Query: 781 HDHWKSLKAKARHEQWL----TNSQQ 803
           HDH +SL+AKA   +++    T+SQ+
Sbjct: 781 HDHLESLQAKAFMPKYMPLLSTHSQE 803

BLAST of CmaCh02G006950 vs. NCBI nr
Match: gi|645239064|ref|XP_008225970.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g79540 [Prunus mume])

HSP 1 Score: 1010.4 bits (2611), Expect = 1.8e-291
Identity = 492/777 (63.32%), Postives = 606/777 (77.99%), Query Frame = 1

Query: 8   LRPVVTYLVPKPPWFHLFHTPTDPIATSNEVSTIIETVDPIEDALETIAPHISSDVITSV 67
           LRP+ +Y  PKPPW   F+T ++   T+NEV TI+ETV+ +E ALE + P +SS++++ V
Sbjct: 8   LRPI-SYFTPKPPWRRCFNTCSEATVTANEVLTILETVNHMESALEPVVPKLSSEIVSYV 67

Query: 68  IQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELKDSSTEIS 127
           I+EQ N RL FR FIW+ +R  LC   SQ  +ID LV+D+AFELYW+TL++L+D    I 
Sbjct: 68  IREQANPRLVFRFFIWATKRMRLCSRMSQSSVIDMLVRDDAFELYWRTLEQLRDCGLPIG 127

Query: 128 SDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYN 187
           S AF+VLI  Y+K  M EKAV++FG MKDF+C+PN FAYN IL+V+VR+E FLL LAVYN
Sbjct: 128 SAAFAVLINGYAKLDMAEKAVETFGRMKDFDCEPNAFAYNAILYVMVRKELFLLVLAVYN 187

Query: 188 QMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAK 247
           QMLK N  P+  TY ILI+GFCKT KTQ+AL +FDEMT R + PN ITY+I++SGLCQAK
Sbjct: 188 QMLKSNHTPSRNTYGILINGFCKTRKTQDALQMFDEMTQRGIAPNTITYTIVVSGLCQAK 247

Query: 248 KIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYS 307
           +  +A  L   M+ASGC PD+ITYN LL+G+CK G   EA+ALLRSFE+D ++LG+ GY+
Sbjct: 248 RTHEAYTLVEMMKASGCPPDLITYNALLDGYCKSGSIGEAYALLRSFERDDYVLGLNGYT 307

Query: 308 CLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTER 367
           CLI GLF A R+DEAH WY K  +K ++PD++L TI+I+GL   GRV +AL  L+EM ER
Sbjct: 308 CLIHGLFIAGRFDEAHGWYSKMIKKGIKPDIVLCTIIIRGLSDAGRVKDALNFLNEMNER 367

Query: 368 GFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEA 427
           G  PD  CYNAVI+GFCD+GLLD+A+SL L+IS  DCFP+  TY+ILICGMCKNGL+ EA
Sbjct: 368 GLVPDAYCYNAVIKGFCDLGLLDEARSLHLDISKLDCFPNACTYTILICGMCKNGLVGEA 427

Query: 428 QHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANK 487
           Q +FNEMEKLGC+PSVVTFN+LIDG CKA KL+EAHLLFYKMEIGR PSLFLRL QG+N+
Sbjct: 428 QQIFNEMEKLGCVPSVVTFNALIDGLCKASKLEEAHLLFYKMEIGRNPSLFLRLSQGSNR 487

Query: 488 VLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDGAFK 547
           +  +  LQ+ +EQLCESGLI KAYKLL QL +SGV PDI TYNILINGFC+  NI+GAFK
Sbjct: 488 ITDSASLQMKVEQLCESGLILKAYKLLTQLADSGVTPDIITYNILINGFCRAGNINGAFK 547

Query: 548 LFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMTWSC 607
           LFKDMQLKG  PDSITYGTLIDGL RV R+EDA  +F+QMVK+GC P S+VYKS+MTWSC
Sbjct: 548 LFKDMQLKGLSPDSITYGTLIDGLQRVDREEDAFVVFDQMVKHGCMPSSAVYKSLMTWSC 607

Query: 608 RRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNSKDFDLAP 667
           RRKK+SLAFS+W+KYL N    ++EK+K +EE F +G  EKAI  ++EMD+N KDFDL P
Sbjct: 608 RRKKISLAFSLWLKYLSNLPLREEEKIKAIEEDFKEGKTEKAIRGVLEMDVNFKDFDLVP 667

Query: 668 YTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAVEVFLYTL 727
            TI LIGLCQ  RV EA  IFSVL ++K I++  SCV LI GLC EG LD A+ VFLYTL
Sbjct: 668 CTILLIGLCQVRRVHEALRIFSVLDEYKVIVTPPSCVHLINGLCKEGNLDQAIGVFLYTL 727

Query: 728 ETGTMLMPRICNQLLR-HLHLEDRKDHAFVLIRRMEAFGYDMNAYLHHSTKSLLHDH 784
           E G MLMP ICNQLL+  L  +D+KDHA  LI RM +FGYD++ YLH +TK LL  H
Sbjct: 728 EKGFMLMPEICNQLLKCLLRSQDKKDHALDLISRMRSFGYDLDFYLHQTTKFLLECH 783

BLAST of CmaCh02G006950 vs. NCBI nr
Match: gi|225441858|ref|XP_002278530.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g79540 [Vitis vinifera])

HSP 1 Score: 992.3 bits (2564), Expect = 5.1e-286
Identity = 472/779 (60.59%), Postives = 604/779 (77.54%), Query Frame = 1

Query: 12  VTYLVPKPPWFHLFH----TPTDPIATSNEVSTIIETVDPIEDALETIAPHISSDVITSV 71
           V + +PK   F   H    T     A SNEV T++ETV+P+EDALE +AP +SS+++  V
Sbjct: 11  VLHFIPKQSRFRCLHANLFTTAQGAAISNEVLTVMETVNPMEDALEKLAPFLSSEIVNDV 70

Query: 72  IQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELKDSSTEIS 131
           ++EQ    LGFR FIW+ RRR      + +L+ID L KD+ F+ YWK L+ELK+S+ +I 
Sbjct: 71  MREQRRPELGFRFFIWTTRRRSFRSWVTHNLVIDMLAKDDGFDTYWKILEELKNSNIQIP 130

Query: 132 SDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYN 191
              FSVLI AY+K+GM EKAV+SFG MKDF CKP++F YN ILHV+V++E FLLALAVYN
Sbjct: 131 PPTFSVLIAAYAKSGMAEKAVESFGKMKDFGCKPDVFTYNSILHVMVQKEVFLLALAVYN 190

Query: 192 QMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAK 251
           QMLK N NPN  T+ IL++G CK  KT +AL +FDEMT + + PN + Y+IILSGLCQAK
Sbjct: 191 QMLKLNYNPNRATFVILLNGLCKNGKTDDALKMFDEMTQKGIPPNTMIYTIILSGLCQAK 250

Query: 252 KIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYS 311
           + DD  RL   M+ SGC PD IT N LL+GFCKLG  DEAFALL+ FEK+G++LG+KGYS
Sbjct: 251 RTDDVHRLLNTMKVSGCCPDSITCNALLDGFCKLGQIDEAFALLQLFEKEGYVLGIKGYS 310

Query: 312 CLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTER 371
            LIDGLFRA+RYDE   W +K  +  +EPDV+LYTI+I+G C+ G V+ AL +L++MT+R
Sbjct: 311 SLIDGLFRAKRYDEVQEWCRKMFKAGIEPDVVLYTILIRGFCEVGMVDYALNMLNDMTQR 370

Query: 372 GFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEA 431
           G SPDT CYNA+I+GFCD+GLLDKA+SL+LEIS +DCFP + TY+ILICGMC+NGL+DEA
Sbjct: 371 GLSPDTYCYNALIKGFCDVGLLDKARSLQLEISKNDCFPTSCTYTILICGMCRNGLLDEA 430

Query: 432 QHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANK 491
           + +FN+ME LGC PS++TFN+LIDG CKAG+L+EA  LFYKMEIG+ PSLFLRL QGA++
Sbjct: 431 RQIFNQMENLGCSPSIMTFNALIDGLCKAGELEEARHLFYKMEIGKNPSLFLRLSQGADR 490

Query: 492 VLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDGAFK 551
           V+ T  LQ M+E+LCESGLI KAYKLLMQL +SGV PDI TYN+LINGFCK  NI+GAFK
Sbjct: 491 VMDTASLQTMVERLCESGLILKAYKLLMQLADSGVVPDIMTYNVLINGFCKAKNINGAFK 550

Query: 552 LFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMTWSC 611
           LF+++QLKG  PDS+TYGTLIDG HRV R+EDA  + +QMVKNGC P S+VYK +MTWSC
Sbjct: 551 LFRELQLKGHSPDSVTYGTLIDGFHRVDREEDAFRVLDQMVKNGCTPSSAVYKCLMTWSC 610

Query: 612 RRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNSKDFDLAP 671
           R+ K+S+AFS+W+KYLR+    +DE +K+ EE F+KG+LEKA+  ++EM+    +F++AP
Sbjct: 611 RKGKLSVAFSLWLKYLRSLPSQEDETLKLAEEHFEKGELEKAVRCLLEMNFKLNNFEIAP 670

Query: 672 YTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAVEVFLYTL 731
           YTI+LIGLCQA R  EA  IF VLK+ +  ++  SCVMLI GLC +G L++AV++FLYTL
Sbjct: 671 YTIWLIGLCQARRSEEALKIFLVLKECQMDVNPPSCVMLINGLCKDGNLEMAVDIFLYTL 730

Query: 732 ETGTMLMPRICNQLLRHLHLEDRKDHAFVLIRRMEAFGYDMNAYLHHSTKSLLHDHWKS 787
           E G MLMPRICNQLLR L L+D+  HA  L+ RM + GYD++ YLHH  KS L   WK+
Sbjct: 731 EKGFMLMPRICNQLLRSLILQDKMKHALDLLNRMNSAGYDLDEYLHHRIKSYLLSVWKA 789

BLAST of CmaCh02G006950 vs. NCBI nr
Match: gi|694371509|ref|XP_009363299.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g79540 [Pyrus x bretschneideri])

HSP 1 Score: 990.7 bits (2560), Expect = 1.5e-285
Identity = 478/777 (61.52%), Postives = 602/777 (77.48%), Query Frame = 1

Query: 8   LRPVVTYLVPKPPWFHLFHTPTDPIATSNEVSTIIETVDPIEDALETIAPHISSDVITSV 67
           LRP+  Y   KPPW   F T ++   T+NE+ TI+ETV+ +EDALE +AP +SSDV+ SV
Sbjct: 8   LRPIC-YFTLKPPWRRHFSTCSEASVTANELLTILETVNGMEDALEPLAPKLSSDVVRSV 67

Query: 68  IQEQPNARLGFRLFIWSLRRRHLCCSASQDLIIDRLVKDNAFELYWKTLQELKDSSTEIS 127
           I+E+ N +L FR FIW+  R  LC   SQ+ +ID LV+D+AFELYW+TL+++ +    I 
Sbjct: 68  IRERVNPQLAFRFFIWATNRMKLCSRMSQNSVIDMLVRDDAFELYWRTLEQISEYGFPIG 127

Query: 128 SDAFSVLIEAYSKAGMEEKAVQSFGMMKDFECKPNIFAYNLILHVLVRREAFLLALAVYN 187
           SDAF+VLI  Y K    EKAV++F  M+DF CKPN+  YN ILHVLVR+E FLLALAVYN
Sbjct: 128 SDAFAVLINGYDKLDRVEKAVETFARMRDFNCKPNVSTYNSILHVLVRKEVFLLALAVYN 187

Query: 188 QMLKCNLNPNVVTYSILIHGFCKTSKTQEALVLFDEMTDRDVLPNEITYSIILSGLCQAK 247
           QMLK N  P   TY ILI GFCKT +TQ+AL +FDEMT R + PN +TY+I++SGLCQAK
Sbjct: 188 QMLKSNNRPTRNTYGILIDGFCKTMQTQDALQMFDEMTQRGMAPNTVTYTIVVSGLCQAK 247

Query: 248 KIDDAQRLFIKMRASGCSPDVITYNVLLNGFCKLGYFDEAFALLRSFEKDGHILGVKGYS 307
           + D+A RL   M+ SGCSPD+ITY+ LL+G+CK G   +A+ALLRSFE+DG++LG+ GY+
Sbjct: 248 RTDEAHRLVNMMKGSGCSPDLITYHALLDGYCKTGRIGDAYALLRSFERDGYVLGLNGYT 307

Query: 308 CLIDGLFRARRYDEAHMWYQKFSRKNVEPDVILYTIMIQGLCQEGRVNEALALLDEMTER 367
           CLI GLF+ARR+DEAH WY+K  ++ +EPD +L TI+IQGL   GRV++AL+ L EM+E+
Sbjct: 308 CLIQGLFKARRFDEAHGWYRKMIKEGIEPDNVLCTIIIQGLSDAGRVHDALSFLSEMSEK 367

Query: 368 GFSPDTTCYNAVIRGFCDMGLLDKAQSLRLEISNHDCFPDNHTYSILICGMCKNGLIDEA 427
           G  PD  CYNAVI+GFCD+GLLD+A+SL LE+S  DCFP+  TY+ILICGMCKNGL+ EA
Sbjct: 368 GLVPDAYCYNAVIKGFCDLGLLDEARSLHLEVSKQDCFPNACTYTILICGMCKNGLVGEA 427

Query: 428 QHVFNEMEKLGCLPSVVTFNSLIDGFCKAGKLKEAHLLFYKMEIGRKPSLFLRLLQGANK 487
           Q +FNEMEKLGC+P+V TFN+LIDG CKA  L EAHLLFYKMEIGR PSLFLRL QG ++
Sbjct: 428 QQIFNEMEKLGCVPTVATFNALIDGLCKASLLDEAHLLFYKMEIGRNPSLFLRLSQGVDR 487

Query: 488 VLGTVDLQVMLEQLCESGLIHKAYKLLMQLVESGVFPDIRTYNILINGFCKTNNIDGAFK 547
           V  +  LQ  +EQLCESGLI +AYKLLM+L  SGV PDI TYNILINGFCK  NI+GAFK
Sbjct: 488 VTDSTSLQTKVEQLCESGLILQAYKLLMKLANSGVTPDIITYNILINGFCKDGNINGAFK 547

Query: 548 LFKDMQLKGRLPDSITYGTLIDGLHRVGRDEDALGIFEQMVKNGCKPESSVYKSIMTWSC 607
           LFKDMQLKG  PDS+TYGTLIDGL RV R+EDA  +F+QMVKNGC P S+VYK++MTWSC
Sbjct: 548 LFKDMQLKGLSPDSVTYGTLIDGLQRVDREEDAFVVFDQMVKNGCTPSSAVYKALMTWSC 607

Query: 608 RRKKVSLAFSVWMKYLRNFRGWKDEKVKVVEESFDKGDLEKAISRIIEMDLNSKDFDLAP 667
           R++KVSLAFS+W+KYLRN    ++E++K +EE+F +G +EKAI  ++EMD+  K+F+LAP
Sbjct: 608 RKQKVSLAFSLWLKYLRNLPSREEEEIKAIEENFKEGKIEKAIRGLLEMDIKFKEFNLAP 667

Query: 668 YTIFLIGLCQAGRVSEAFAIFSVLKDFKRIISSASCVMLIGGLCVEGKLDLAVEVFLYTL 727
            TI LIG+CQ  RV EA  IFSVL ++K  ++  SCV LI GLC EG LDLA+ VF+YTL
Sbjct: 668 CTILLIGMCQVRRVHEALRIFSVLDEYKVTVTPPSCVHLISGLCKEGNLDLAIGVFIYTL 727

Query: 728 ETGTMLMPRICNQLLR-HLHLEDRKDHAFVLIRRMEAFGYDMNAYLHHSTKSLLHDH 784
           E G MLMP ICN LL+  L  +D+KDHA  L+ RM + GYD+++YL  +TK LL  H
Sbjct: 728 EKGFMLMPEICNTLLKCLLRSQDKKDHALDLVSRMRSLGYDLDSYLQQTTKFLLQCH 783

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP133_ARATH1.6e-22350.06Pentatricopeptide repeat-containing protein At1g79540 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH1.1e-7827.35Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PPR96_ARATH5.4e-7832.43Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidop... [more]
PPR91_ARATH7.8e-7733.40Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
PPR98_ARATH1.1e-7531.77Pentatricopeptide repeat-containing protein At1g63080, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KD52_CUCSA0.0e+0073.69Uncharacterized protein OS=Cucumis sativus GN=Csa_6G134370 PE=4 SV=1[more]
F6HKV9_VITVI3.6e-28660.59Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g08720 PE=4 SV=... [more]
A0A061GXW2_THECC2.1e-27859.75Pentatricopeptide repeat (PPR) superfamily protein, putative OS=Theobroma cacao ... [more]
B9SU41_RICCO5.7e-27658.68Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
V4RKA3_9ROSI4.0e-26957.20Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004347mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G79540.19.1e-22550.06 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.16.2e-8027.35 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G62930.13.1e-7932.43 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G62670.14.4e-7833.40 rna processing factor 2[more]
AT1G63080.16.4e-7731.77 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449444522|ref|XP_004140023.1|0.0e+0073.69PREDICTED: pentatricopeptide repeat-containing protein At1g79540 [Cucumis sativu... [more]
gi|659112542|ref|XP_008456271.1|0.0e+0070.72PREDICTED: pentatricopeptide repeat-containing protein At1g79540 [Cucumis melo][more]
gi|645239064|ref|XP_008225970.1|1.8e-29163.32PREDICTED: pentatricopeptide repeat-containing protein At1g79540 [Prunus mume][more]
gi|225441858|ref|XP_002278530.1|5.1e-28660.59PREDICTED: pentatricopeptide repeat-containing protein At1g79540 [Vitis vinifera... [more]
gi|694371509|ref|XP_009363299.1|1.5e-28561.52PREDICTED: pentatricopeptide repeat-containing protein At1g79540 [Pyrus x bretsc... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh02G006950.1CmaCh02G006950.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 668..692
score: 0.032coord: 306..333
score: 0.017coord: 704..723
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 263..292
score: 1.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 413..455
score: 2.6E-14coord: 524..572
score: 3.2E-17coord: 196..245
score: 1.0E-18coord: 336..385
score: 1.1
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 116..175
score: 6.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 562..594
score: 7.1E-7coord: 234..268
score: 4.3E-8coord: 528..560
score: 9.5E-10coord: 444..470
score: 1.1E-6coord: 306..338
score: 1.9E-6coord: 130..163
score: 4.9E-5coord: 199..233
score: 5.0E-9coord: 409..442
score: 2.8E-9coord: 375..407
score: 0.0016coord: 340..373
score: 6.2E-11coord: 269..299
score: 6.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 267..301
score: 11.794coord: 197..231
score: 12.649coord: 664..698
score: 7.991coord: 407..441
score: 13.471coord: 734..768
score: 5.645coord: 699..733
score: 7.53coord: 372..406
score: 10.271coord: 127..161
score: 10.205coord: 162..196
score: 9.12coord: 560..594
score: 12.934coord: 490..524
score: 8.451coord: 302..336
score: 9.427coord: 595..625
score: 5.327coord: 525..559
score: 12.847coord: 337..371
score: 13.899coord: 232..266
score: 12.978coord: 442..476
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 558..587
score: 1.1E-5coord: 165..379
score: 1.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 498..622
score: 1.78E-5coord: 162..343
score: 1.7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 17..58
score: 2.0E-291coord: 674..740
score: 2.0E-291coord: 103..641
score: 2.0E
NoneNo IPR availablePANTHERPTHR24015:SF768SUBFAMILY NOT NAMEDcoord: 103..641
score: 2.0E-291coord: 674..740
score: 2.0E-291coord: 17..58
score: 2.0E