ClCG03G000810 (gene) Watermelon (Charleston Gray)

NameClCG03G000810
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionPentatricopeptide repeat-containing family protein
LocationCG_Chr03 : 832177 .. 834378 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCTCGAAGCTCTCCATTGATTTCTCTCCAGAATTTCCCAACCCCGAACAACAATTTTCCTTTCAGAAACCATCAAATTCTCTCTACAATCGATCAATGTTCATGTCCAAAGCAATTGAAGCAAGTTCACGCTCACATGCTCCGCACCGGCCTCTTCTTCGACCCCTTCTCCGCCAGCAAGCTCTTCACAGCCTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCGACGTGTTCGACCAAATTCCCCAACCAAATCTCTACATTTGGAATACCCTCATTCGAGCTTACGCTTCCAGCTCCGACCCTTTTCAGAGTTTCGTAATATTTCTGGATTTGCTTGATAAATGTGAGGATTTGCCCAATAATTTCACTTTCCCATTTGTCATTAAGGCGGCTTCGGAGCTAAAAGCGTCACGGGTCGGCAGAGCTGTTCATGGAATGGCGATTAAGTTGTCTTTTGGTATGGATCTTTATATTCTTAATTCGCTTGTGCGATTCTATGGGGCATGTGGGAATTTGAATTTGGCTGAGCGATTGTTTGAGGGTATTTCTTCCAAAGATGTGGTGTCTTGGAATTCGATGATTTCGGCTTTTGCTCAGGGAAACTGTCCAGAAGATGCATTGGACTTGTTTTTGAAAATGGAGGGGGAGAATGTGATGCCCAACTCTGTAACAATGGTGGGTGTTTTATCTGCTTGCGCGAAGATGTTTGATTTGGAGTTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAACAAATCAAAGTGGATTTAACTCTGTGTAATGCCATGCTTGACATGTATACGAAGTGTGGAAGCATTGATGATGCACAGAAGCTGTTTGATGAAATGCCTGAAAGAGATGTCTTCTCGTGGACCACCATGCTTGATGGGTATGCGAAAATGGGCGACTTCGATACTGCTCGGCGAGTGTTTGATGCAATGCCTGTGAAAGAAGTTGCTGCTTGGAATGTTCTCATATCTGCTTATGAACAAAATGGTAAGCCTAAGGAGGCTCTAGCCACATTTAATGAGTTGCAGCTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGGGCAATTGATTTGGGTGGGTGGATTCATGTGTACATAAAAAGGGTGGGGATGGATCTAAATTGCCATTTAAGTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTGCTTTAGAGAAAGCTCTTGAGGTGTTCTATTCAGTGGAGGAGAAAGATGTGTATGTTTGGAGTGCCATGATTGCTGGCTTGGGAATGCATGGCCGTGGGAAGGCGGCAATCAATTTATTCTTCAAAATGCAGGAAGCTAAGGTGAAACCAAATAGTGTAACGTTTACGAATGTATTATGCGCCTGCAGCCATGCTGGATTAGTTGATGAGGGACGGGCGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTCCCTGGGACAAAGCACTATGCGTGTATGGTCGATATTCTCGGTCGTGCGGGGTTTCTTGAAGAAGCTATGGAGTTGATCAATGAAATGCCTATAACTCCGAGTGCCTCTGTCTGGGGTGCTTTGCTTGGTGGCTGCAGCCTTCATACGAATGTTGAGCTTGCAGAACTGGCTAGTGACCAATTGCTCAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCTAACATATATGCCAAAACAGGAAAATGGGATAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACTCTGAACTGAAAAAGGAACCCGGTTGTAGTTCCATTGAAATCAATGGCAACGTCCACGAGTTTCTAGTAGGCGATAATTCCCACCCGTTATCCAGCGACATCTATTCAAAGTTGGACGAGATTGCAACGAAACTAAAATTAGTTGGATATGAACCAAATAAATCCCATCTTCTCCAGCTCGTCGAAGAAGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATTGCATTCGGGCTTATCAGTTCGCCTTCATCTCAACCAATTCGAGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGAAGTTGCCAAGCTTGTATCTAGAGTTTATGACAGAGATATATTACTCCGAGATCGATATCGATTCCATCATTTTCGAAACGGGCAATGCTCATGTATGGATTACTGGTAA

mRNA sequence

ATGGAAGCTCGAAGCTCTCCATTGATTTCTCTCCAGAATTTCCCAACCCCGAACAACAATTTTCCTTTCAGAAACCATCAAATTCTCTCTACAATCGATCAATGTTCATGTCCAAAGCAATTGAAGCAAGTTCACGCTCACATGCTCCGCACCGGCCTCTTCTTCGACCCCTTCTCCGCCAGCAAGCTCTTCACAGCCTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCGACGTGTTCGACCAAATTCCCCAACCAAATCTCTACATTTGGAATACCCTCATTCGAGCTTACGCTTCCAGCTCCGACCCTTTTCAGAGTTTCGTAATATTTCTGGATTTGCTTGATAAATGTGAGGATTTGCCCAATAATTTCACTTTCCCATTTGTCATTAAGGCGGCTTCGGAGCTAAAAGCGTCACGGGTCGGCAGAGCTGTTCATGGAATGGCGATTAAGTTGTCTTTTGGTATGGATCTTTATATTCTTAATTCGCTTGTGCGATTCTATGGGGCATGTGGGAATTTGAATTTGGCTGAGCGATTGTTTGAGGGTATTTCTTCCAAAGATGTGGTGTCTTGGAATTCGATGATTTCGGCTTTTGCTCAGGGAAACTGTCCAGAAGATGCATTGGACTTGTTTTTGAAAATGGAGGGGGAGAATGTGATGCCCAACTCTGTAACAATGGTGGGTGTTTTATCTGCTTGCGCGAAGATGTTTGATTTGGAGTTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAACAAATCAAAGTGGATTTAACTCTGTGTAATGCCATGCTTGACATGTATACGAAGTGTGGAAGCATTGATGATGCACAGAAGCTGTTTGATGAAATGCCTGAAAGAGATGTCTTCTCGTGGACCACCATGCTTGATGGGTATGCGAAAATGGGCGACTTCGATACTGCTCGGCGAGTGTTTGATGCAATGCCTGTGAAAGAAGTTGCTGCTTGGAATGTTCTCATATCTGCTTATGAACAAAATGGTAAGCCTAAGGAGGCTCTAGCCACATTTAATGAGTTGCAGCTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGGGCAATTGATTTGGGTGGGTGGATTCATGTGTACATAAAAAGGGTGGGGATGGATCTAAATTGCCATTTAAGTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTGCTTTAGAGAAAGCTCTTGAGGTGTTCTATTCAGTGGAGGAGAAAGATGTGTATGTTTGGAGTGCCATGATTGCTGGCTTGGGAATGCATGGCCGTGGGAAGGCGGCAATCAATTTATTCTTCAAAATGCAGGAAGCTAAGGTGAAACCAAATAGTGTAACGTTTACGAATGTATTATGCGCCTGCAGCCATGCTGGATTAGTTGATGAGGGACGGGCGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTCCCTGGGACAAAGCACTATGCGTGTATGGTCGATATTCTCGGTCGTGCGGGGTTTCTTGAAGAAGCTATGGAGTTGATCAATGAAATGCCTATAACTCCGAGTGCCTCTGTCTGGGGTGCTTTGCTTGGTGGCTGCAGCCTTCATACGAATGTTGAGCTTGCAGAACTGGCTAGTGACCAATTGCTCAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCTAACATATATGCCAAAACAGGAAAATGGGATAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACTCTGAACTGAAAAAGGAACCCGGTTGTAGTTCCATTGAAATCAATGGCAACGTCCACGAGTTTCTAGTAGGCGATAATTCCCACCCGTTATCCAGCGACATCTATTCAAAGTTGGACGAGATTGCAACGAAACTAAAATTAGTTGGATATGAACCAAATAAATCCCATCTTCTCCAGCTCGTCGAAGAAGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATTGCATTCGGGCTTATCAGTTCGCCTTCATCTCAACCAATTCGAGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGAAGTTGCCAAGCTTGTATCTAGAGTTTATGACAGAGATATATTACTCCGAGATCGATATCGATTCCATCATTTTCGAAACGGGCAATGCTCATGTATGGATTACTGGTAA

Coding sequence (CDS)

ATGGAAGCTCGAAGCTCTCCATTGATTTCTCTCCAGAATTTCCCAACCCCGAACAACAATTTTCCTTTCAGAAACCATCAAATTCTCTCTACAATCGATCAATGTTCATGTCCAAAGCAATTGAAGCAAGTTCACGCTCACATGCTCCGCACCGGCCTCTTCTTCGACCCCTTCTCCGCCAGCAAGCTCTTCACAGCCTCCGCTCTTTCGTCCTTCTCCACTCTCGACTATGCCCGCGACGTGTTCGACCAAATTCCCCAACCAAATCTCTACATTTGGAATACCCTCATTCGAGCTTACGCTTCCAGCTCCGACCCTTTTCAGAGTTTCGTAATATTTCTGGATTTGCTTGATAAATGTGAGGATTTGCCCAATAATTTCACTTTCCCATTTGTCATTAAGGCGGCTTCGGAGCTAAAAGCGTCACGGGTCGGCAGAGCTGTTCATGGAATGGCGATTAAGTTGTCTTTTGGTATGGATCTTTATATTCTTAATTCGCTTGTGCGATTCTATGGGGCATGTGGGAATTTGAATTTGGCTGAGCGATTGTTTGAGGGTATTTCTTCCAAAGATGTGGTGTCTTGGAATTCGATGATTTCGGCTTTTGCTCAGGGAAACTGTCCAGAAGATGCATTGGACTTGTTTTTGAAAATGGAGGGGGAGAATGTGATGCCCAACTCTGTAACAATGGTGGGTGTTTTATCTGCTTGCGCGAAGATGTTTGATTTGGAGTTTGGGAGGTGGGTTTGTTCGTACATTGAAAGGAAACAAATCAAAGTGGATTTAACTCTGTGTAATGCCATGCTTGACATGTATACGAAGTGTGGAAGCATTGATGATGCACAGAAGCTGTTTGATGAAATGCCTGAAAGAGATGTCTTCTCGTGGACCACCATGCTTGATGGGTATGCGAAAATGGGCGACTTCGATACTGCTCGGCGAGTGTTTGATGCAATGCCTGTGAAAGAAGTTGCTGCTTGGAATGTTCTCATATCTGCTTATGAACAAAATGGTAAGCCTAAGGAGGCTCTAGCCACATTTAATGAGTTGCAGCTCAGTAAGATTGCAAAGCCTGATGAAGTCACTTTAGTTAGTACTCTGTCAGCTTGTGCTCAATTGGGGGCAATTGATTTGGGTGGGTGGATTCATGTGTACATAAAAAGGGTGGGGATGGATCTAAATTGCCATTTAAGTTCTTCTCTTGTGGACATGTATGCTAAATGTGGTGCTTTAGAGAAAGCTCTTGAGGTGTTCTATTCAGTGGAGGAGAAAGATGTGTATGTTTGGAGTGCCATGATTGCTGGCTTGGGAATGCATGGCCGTGGGAAGGCGGCAATCAATTTATTCTTCAAAATGCAGGAAGCTAAGGTGAAACCAAATAGTGTAACGTTTACGAATGTATTATGCGCCTGCAGCCATGCTGGATTAGTTGATGAGGGACGGGCGTTTTTCCATGAAATGGAGCCAGTTTATGGGGTTGTCCCTGGGACAAAGCACTATGCGTGTATGGTCGATATTCTCGGTCGTGCGGGGTTTCTTGAAGAAGCTATGGAGTTGATCAATGAAATGCCTATAACTCCGAGTGCCTCTGTCTGGGGTGCTTTGCTTGGTGGCTGCAGCCTTCATACGAATGTTGAGCTTGCAGAACTGGCTAGTGACCAATTGCTCAAGTTGGAGCCTAGAAATCATGGTGCTATTGTACTTTTATCTAACATATATGCCAAAACAGGAAAATGGGATAAGGTTTCTGAGTTGAGGAAACTAATGAGAGACTCTGAACTGAAAAAGGAACCCGGTTGTAGTTCCATTGAAATCAATGGCAACGTCCACGAGTTTCTAGTAGGCGATAATTCCCACCCGTTATCCAGCGACATCTATTCAAAGTTGGACGAGATTGCAACGAAACTAAAATTAGTTGGATATGAACCAAATAAATCCCATCTTCTCCAGCTCGTCGAAGAAGACGACCTCAAGGAACAGGCCTTAAGCCTTCACAGCGAGAAGTTAGCCATTGCATTCGGGCTTATCAGTTCGCCTTCATCTCAACCAATTCGAGTTGTGAAGAATCTTCGGATTTGTGGAGACTGCCATGAAGTTGCCAAGCTTGTATCTAGAGTTTATGACAGAGATATATTACTCCGAGATCGATATCGATTCCATCATTTTCGAAACGGGCAATGCTCATGTATGGATTACTGGTAA

Protein sequence

MEARSSPLISLQNFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLAERLFEGISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFGRWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKMGDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEINGNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHSEKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQCSCMDYW
BLAST of ClCG03G000810 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 939.9 bits (2428), Expect = 1.7e-272
Identity = 449/727 (61.76%), Postives = 572/727 (78.68%), Query Frame = 1

Query: 7   PLISLQNFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSASKLFTA 66
           P  S  N PT NN    R+  I S I++C   +QLKQ H HM+RTG F DP+SASKLF  
Sbjct: 16  PNFSNPNQPTTNNE---RSRHI-SLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 75

Query: 67  SALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNN 126
           +ALSSF++L+YAR VFD+IP+PN + WNTLIRAYAS  DP  S   FLD++ + +  PN 
Sbjct: 76  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 135

Query: 127 FTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLAERLFEG 186
           +TFPF+IKAA+E+ +  +G+++HGMA+K + G D+++ NSL+  Y +CG+L+ A ++F  
Sbjct: 136 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 195

Query: 187 ISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFG 246
           I  KDVVSWNSMI+ F Q   P+ AL+LF KME E+V  + VTMVGVLSACAK+ +LEFG
Sbjct: 196 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 255

Query: 247 RWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKM 306
           R VCSYIE  ++ V+LTL NAMLDMYTKCGSI+DA++LFD M E+D  +WTTMLDGYA  
Sbjct: 256 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 315

Query: 307 GDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST 366
            D++ AR V ++MP K++ AWN LISAYEQNGKP EAL  F+ELQL K  K +++TLVST
Sbjct: 316 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 375

Query: 367 LSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDV 426
           LSACAQ+GA++LG WIH YIK+ G+ +N H++S+L+ MY+KCG LEK+ EVF SVE++DV
Sbjct: 376 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 435

Query: 427 YVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHE 486
           +VWSAMI GL MHG G  A+++F+KMQEA VKPN VTFTNV CACSH GLVDE  + FH+
Sbjct: 436 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 495

Query: 487 MEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVE 546
           ME  YG+VP  KHYAC+VD+LGR+G+LE+A++ I  MPI PS SVWGALLG C +H N+ 
Sbjct: 496 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 555

Query: 547 LAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEIN 606
           LAE+A  +LL+LEPRN GA VLLSNIYAK GKW+ VSELRK MR + LKKEPGCSSIEI+
Sbjct: 556 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 615

Query: 607 GNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHS 666
           G +HEFL GDN+HP+S  +Y KL E+  KLK  GYEP  S +LQ++EE+++KEQ+L+LHS
Sbjct: 616 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 675

Query: 667 EKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQ 726
           EKLAI +GLIS+ + + IRV+KNLR+CGDCH VAKL+S++YDR+I++RDRYRFHHFRNGQ
Sbjct: 676 EKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQ 735

Query: 727 CSCMDYW 734
           CSC D+W
Sbjct: 736 CSCNDFW 738

BLAST of ClCG03G000810 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 592.0 bits (1525), Expect = 8.7e-168
Identity = 317/774 (40.96%), Postives = 460/774 (59.43%), Query Frame = 1

Query: 8   LISLQNFPTPNNNFPF--------------RNHQILSTIDQCSCPKQLKQVHAHMLRTGL 67
           ++S      P++++PF              RNH  LS +  C   + L+ +HA M++ GL
Sbjct: 2   MLSCSPLTVPSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGL 61

Query: 68  FFDPFSASKLFTASALSS-FSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVI 127
               ++ SKL     LS  F  L YA  VF  I +PNL IWNT+ R +A SSDP  +  +
Sbjct: 62  HNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKL 121

Query: 128 FLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYI--------- 187
           ++ ++     LPN++TFPFV+K+ ++ KA + G+ +HG  +KL   +DLY+         
Sbjct: 122 YVCMIS-LGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYV 181

Query: 188 ----------------------LNSLVRFYGACGNLNLAERLFEGISSKDVVSWNSMISA 247
                                   +L++ Y + G +  A++LF+ I  KDVVSWN+MIS 
Sbjct: 182 QNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISG 241

Query: 248 FAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFGRWVCSYIERKQIKVD 307
           +A+    ++AL+LF  M   NV P+  TMV V+SACA+   +E GR V  +I+      +
Sbjct: 242 YAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSN 301

Query: 308 LTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKMGDFDTARRVFDAMPV 367
           L + NA++D+Y+KCG ++ A  LF+ +P +DV SW T++ GY  M  +            
Sbjct: 302 LKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY------------ 361

Query: 368 KEVAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGW 427
                              KEAL  F E+ L     P++VT++S L ACA LGAID+G W
Sbjct: 362 -------------------KEALLLFQEM-LRSGETPNDVTMLSILPACAHLGAIDIGRW 421

Query: 428 IHVYI-KRV-GMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMH 487
           IHVYI KR+ G+     L +SL+DMYAKCG +E A +VF S+  K +  W+AMI G  MH
Sbjct: 422 IHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMH 481

Query: 488 GRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKH 547
           GR  A+ +LF +M++  ++P+ +TF  +L ACSH+G++D GR  F  M   Y + P  +H
Sbjct: 482 GRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEH 541

Query: 548 YACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVELAELASDQLLKLE 607
           Y CM+D+LG +G  +EA E+IN M + P   +W +LL  C +H NVEL E  ++ L+K+E
Sbjct: 542 YGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIE 601

Query: 608 PRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEINGNVHEFLVGDNSH 667
           P N G+ VLLSNIYA  G+W++V++ R L+ D  +KK PGCSSIEI+  VHEF++GD  H
Sbjct: 602 PENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFH 661

Query: 668 PLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHSEKLAIAFGLISSP 727
           P + +IY  L+E+   L+  G+ P+ S +LQ +EE + KE AL  HSEKLAIAFGLIS+ 
Sbjct: 662 PRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEE-EWKEGALRHHSEKLAIAFGLISTK 721

Query: 728 SSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQCSCMDYW 734
               + +VKNLR+C +CHE  KL+S++Y R+I+ RDR RFHHFR+G CSC DYW
Sbjct: 722 PGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of ClCG03G000810 vs. Swiss-Prot
Match: PP311_ARATH (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 9.6e-159
Identity = 281/713 (39.41%), Postives = 438/713 (61.43%), Query Frame = 1

Query: 28  ILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQ 87
           IL  +  C     +KQ+HAH+LRT +  +    S LF  S  SS   L YA +VF  IP 
Sbjct: 15  ILEKLSFCKSLNHIKQLHAHILRTVI--NHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74

Query: 88  P-NLYIWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGR 147
           P    ++N  +R  + SS+P ++ ++F   +       + F+F  ++KA S++ A   G 
Sbjct: 75  PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134

Query: 148 AVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLAERLFEGISSKDVVSWNSMISAFAQGN 207
            +HG+A K++   D ++    +  Y +CG +N A  +F+ +S +DVV+WN+MI  + +  
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194

Query: 208 CPEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFGRWVCSYIERKQIKVDLTLCN 267
             ++A  LF +M+  NVMP+ + +  ++SAC +  ++ + R +  ++    +++D  L  
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254

Query: 268 AMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKMGDFDTARRVFDAMPVKEVAA 327
           A++ MY   G +D A++ F +M  R++F  T M+ GY+K G  D A+ +FD    K++  
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314

Query: 328 WNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 387
           W  +ISAY ++  P+EAL  F E+  S I KPD V++ S +SACA LG +D   W+H  I
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374

Query: 388 KRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAI 447
              G++    ++++L++MYAKCG L+   +VF  +  ++V  WS+MI  L MHG    A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434

Query: 448 NLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDI 507
           +LF +M++  V+PN VTF  VL  CSH+GLV+EG+  F  M   Y + P  +HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494

Query: 508 LGRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVELAELASDQLLKLEPRNHGAI 567
            GRA  L EA+E+I  MP+  +  +WG+L+  C +H  +EL + A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554

Query: 568 VLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEINGNVHEFLVGDNSHPLSSDIY 627
           VL+SNIYA+  +W+ V  +R++M +  + KE G S I+ NG  HEFL+GD  H  S++IY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614

Query: 628 SKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHSEKLAIAFGLISSPSSQP--- 687
           +KLDE+ +KLKL GY P+   +L  VEE++ K+  L  HSEKLA+ FGL++    +    
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674

Query: 688 ---IRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQCSCMDYW 734
              IR+VKNLR+C DCH   KLVS+VY+R+I++RDR RFH ++NG CSC DYW
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of ClCG03G000810 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 536.6 bits (1381), Expect = 4.3e-151
Identity = 278/706 (39.38%), Postives = 427/706 (60.48%), Query Frame = 1

Query: 30  STIDQCSCPKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQPN 89
           S ID  +   QLKQ+HA +L  GL F  F  +KL  AS  SSF  + +AR VFD +P+P 
Sbjct: 26  SLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHAS--SSFGDITFARQVFDDLPRPQ 85

Query: 90  LYIWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVH 149
           ++ WN +IR Y S ++ FQ  ++    +      P++FTFP ++KA S L   ++GR VH
Sbjct: 86  IFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVH 145

Query: 150 GMAIKLSFGMDLYILNSLVRFYGACGNLNLAERLFEGIS--SKDVVSWNSMISAFAQGNC 209
               +L F  D+++ N L+  Y  C  L  A  +FEG+    + +VSW +++SA+AQ   
Sbjct: 146 AQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGE 205

Query: 210 PEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFGRWVCSYIERKQIKVDLTLCNA 269
           P +AL++F +M   +V P+ V +V VL+A   + DL+ GR + + + +  ++++  L  +
Sbjct: 206 PMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLIS 265

Query: 270 MLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKMGDFDTARRVFDAMPVKEVAAW 329
           +  MY KCG +  A+ LFD+M   ++  W  M+ GYAK                      
Sbjct: 266 LNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK---------------------- 325

Query: 330 NVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 389
                    NG  +EA+  F+E+ ++K  +PD +++ S +SACAQ+G+++    ++ Y+ 
Sbjct: 326 ---------NGYAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVG 385

Query: 390 RVGMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAIN 449
           R     +  +SS+L+DM+AKCG++E A  VF    ++DV VWSAMI G G+HGR + AI+
Sbjct: 386 RSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAIS 445

Query: 450 LFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDIL 509
           L+  M+   V PN VTF  +L AC+H+G+V EG  FF+ M   + + P  +HYAC++D+L
Sbjct: 446 LYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMAD-HKINPQQQHYACVIDLL 505

Query: 510 GRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVELAELASDQLLKLEPRNHGAIV 569
           GRAG L++A E+I  MP+ P  +VWGALL  C  H +VEL E A+ QL  ++P N G  V
Sbjct: 506 GRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYV 565

Query: 570 LLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEINGNVHEFLVGDNSHPLSSDIYS 629
            LSN+YA    WD+V+E+R  M++  L K+ GCS +E+ G +  F VGD SHP   +I  
Sbjct: 566 QLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIER 625

Query: 630 KLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHSEKLAIAFGLISSPSSQPIRVV 689
           +++ I ++LK  G+  NK   L  + +++  E+ L  HSE++AIA+GLIS+P   P+R+ 
Sbjct: 626 QVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHSERIAIAYGLISTPQGTPLRIT 685

Query: 690 KNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQCSCMDYW 734
           KNLR C +CH   KL+S++ DR+I++RD  RFHHF++G CSC DYW
Sbjct: 686 KNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694

BLAST of ClCG03G000810 vs. Swiss-Prot
Match: PP219_ARATH (Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis thaliana GN=PCMP-H84 PE=3 SV=1)

HSP 1 Score: 524.6 bits (1350), Expect = 1.7e-147
Identity = 272/707 (38.47%), Postives = 412/707 (58.27%), Query Frame = 1

Query: 27  QILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIP 86
           QI + I        LKQ+H  ++   L  D F  + L   +    F    Y+  +F    
Sbjct: 15  QIKTLISVACTVNHLKQIHVSLINHHLHHDTFLVNLLLKRTLF--FRQTKYSYLLFSHTQ 74

Query: 87  QPNLYIWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGR 146
            PN++++N+LI  + ++    ++  +FL +      L + FTFP V+KA +   + ++G 
Sbjct: 75  FPNIFLYNSLINGFVNNHLFHETLDLFLSIRKHGLYL-HGFTFPLVLKACTRASSRKLGI 134

Query: 147 AVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLAERLFEGISSKDVVSWNSMISAFAQGN 206
            +H + +K  F  D+  + SL+  Y   G LN A +LF+ I  + VV+W ++ S +    
Sbjct: 135 DLHSLVVKCGFNHDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSVVTWTALFSGYTTSG 194

Query: 207 CPEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFGRWVCSYIERKQIKVDLTLCN 266
              +A+DLF KM    V P+S  +V VLSAC  + DL+ G W+  Y+E  +++ +  +  
Sbjct: 195 RHREAIDLFKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKYMEEMEMQKNSFVRT 254

Query: 267 AMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKMGDFDTARRVFDAMPVKEVAA 326
            ++++Y KCG ++ A+ +FD M E+D+ +W+TM+ GYA                      
Sbjct: 255 TLVNLYAKCGKMEKARSVFDSMVEKDIVTWSTMIQGYAS--------------------- 314

Query: 327 WNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 386
                     N  PKE +  F ++ L +  KPD+ ++V  LS+CA LGA+DLG W    I
Sbjct: 315 ----------NSFPKEGIELFLQM-LQENLKPDQFSIVGFLSSCASLGALDLGEWGISLI 374

Query: 387 KRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAI 446
            R     N  ++++L+DMYAKCGA+ +  EVF  ++EKD+ + +A I+GL  +G  K + 
Sbjct: 375 DRHEFLTNLFMANALIDMYAKCGAMARGFEVFKEMKEKDIVIMNAAISGLAKNGHVKLSF 434

Query: 447 NLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDI 506
            +F + ++  + P+  TF  +LC C HAGL+ +G  FF+ +  VY +    +HY CMVD+
Sbjct: 435 AVFGQTEKLGISPDGSTFLGLLCGCVHAGLIQDGLRFFNAISCVYALKRTVEHYGCMVDL 494

Query: 507 LGRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVELAELASDQLLKLEPRNHGAI 566
            GRAG L++A  LI +MP+ P+A VWGALL GC L  + +LAE    +L+ LEP N G  
Sbjct: 495 WGRAGMLDDAYRLICDMPMRPNAIVWGALLSGCRLVKDTQLAETVLKELIALEPWNAGNY 554

Query: 567 VLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEINGNVHEFLVGDNSHPLSSDIY 626
           V LSNIY+  G+WD+ +E+R +M    +KK PG S IE+ G VHEFL  D SHPLS  IY
Sbjct: 555 VQLSNIYSVGGRWDEAAEVRDMMNKKGMKKIPGYSWIELEGKVHEFLADDKSHPLSDKIY 614

Query: 627 SKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHSEKLAIAFGLISSPSSQPIRV 686
           +KL+++  +++L+G+ P    +   VEE++ KE+ L  HSEKLA+A GLIS+   Q IRV
Sbjct: 615 AKLEDLGNEMRLMGFVPTTEFVFFDVEEEE-KERVLGYHSEKLAVALGLISTDHGQVIRV 674

Query: 687 VKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQCSCMDYW 734
           VKNLR+CGDCHEV KL+S++  R+I++RD  RFH F NG CSC DYW
Sbjct: 675 VKNLRVCGDCHEVMKLISKITRREIVVRDNNRFHCFTNGSCSCNDYW 685

BLAST of ClCG03G000810 vs. TrEMBL
Match: A0A0A0M0R9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G530130 PE=4 SV=1)

HSP 1 Score: 1341.3 bits (3470), Expect = 0.0e+00
Identity = 664/733 (90.59%), Postives = 692/733 (94.41%), Query Frame = 1

Query: 1   MEARSSPLISLQNFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSA 60
           MEA S P ISLQNF T NNN  FRNHQILSTID+CS  KQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKLFTASALSSFSTLDYAR++FDQIPQPNLY WNTLIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLA 180
           EDLPN FTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACG+L++A
Sbjct: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFEGISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLSACAKM 240
           ERLF+GIS KDVVSWNSMISAFAQGNCPEDAL+LFLKME ENVMPNSVTMVGVLSACAK 
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240

Query: 241 FDLEFGRWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTML 300
            DLEFGRWVCSYIERK IKVDLTLCNAMLDMYTKCGS+DDAQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360
           DGYAKMGD+D AR VF+AMPVKE+AAWNVLISAYEQNGKPKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVYIKR G+ LNCHL SSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420

Query: 421 VEEKDVYVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEE+DVYVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPNSVTFTNVLCACSHAGLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480

Query: 481 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGGCS 540
           R FFHEMEPVYGVVP  KHYACMVDILGRAGFLEEAMELINEM  TPSASVWGALLG CS
Sbjct: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540

Query: 541 LHTNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGC 600
           LH NVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTG+W+KVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEINGNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQ 660
           SSIE NGNVHEFLVGDN+HPLSS+IYSKL+EIATKLK VGYEPNKSHLLQL+EEDDLKEQ
Sbjct: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGL++   SQPIRVVKNLRICGDCH  AKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRNGQCSCMDYW 734
           HFR+G CSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of ClCG03G000810 vs. TrEMBL
Match: M5WLC4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001946mg PE=4 SV=1)

HSP 1 Score: 1109.0 bits (2867), Expect = 0.0e+00
Identity = 537/738 (72.76%), Postives = 627/738 (84.96%), Query Frame = 1

Query: 1   MEARSSPLISL-----QNFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFF 60
           M + S+PLISL      + PT + +  F +H  LS IDQC+  KQLKQVHA MLRTG+ F
Sbjct: 1   MASLSTPLISLPRHPNSSSPTFSTDLRFSSHPALSLIDQCTSIKQLKQVHAQMLRTGVLF 60

Query: 61  DPFSASKLFTASALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLD 120
           DP+SASKL TASALSSFS+LDYAR VFDQIPQPN+Y WNTLIRAYASSSDP +S ++FLD
Sbjct: 61  DPYSASKLITASALSSFSSLDYARQVFDQIPQPNVYTWNTLIRAYASSSDPAESILVFLD 120

Query: 121 LLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACG 180
           +LD C + P+ +T+PF IKAASEL+A +VGR  HGMAIK S G D+YILNSLV FYG+CG
Sbjct: 121 MLDHCSECPDKYTYPFAIKAASELRALQVGRGFHGMAIKASLGSDIYILNSLVHFYGSCG 180

Query: 181 NLNLAERLFEGISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLS 240
           +L+LA R+F     KDVVSWNSMI+ FAQGNCP++AL+LF +ME ENV PN VTMV VLS
Sbjct: 181 DLDLARRVFMKTPKKDVVSWNSMITVFAQGNCPQEALELFKEMEAENVKPNDVTMVSVLS 240

Query: 241 ACAKMFDLEFGRWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFS 300
           ACAK  DLEFGRWVCS+I+R +IK +LTL NAMLDMY KCGS+DDA++LFD MPE+D+ S
Sbjct: 241 ACAKKVDLEFGRWVCSHIQRNEIKENLTLNNAMLDMYVKCGSVDDAKRLFDRMPEKDIVS 300

Query: 301 WTTMLDGYAKMGDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKI 360
           WTTMLDGYA++G+++ A RVF AMP +++AAWNVLIS+YEQ+GKPKEALA FNELQ SK 
Sbjct: 301 WTTMLDGYAQLGNYEEAWRVFAAMPSQDIAAWNVLISSYEQSGKPKEALAVFNELQKSKS 360

Query: 361 AKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKAL 420
            KPDEVTLVSTL+ACAQLGAIDLGGWIHVYIK+  M LNCHL++SL+DMYAKCG L+KAL
Sbjct: 361 PKPDEVTLVSTLAACAQLGAIDLGGWIHVYIKKQVMKLNCHLTTSLIDMYAKCGDLDKAL 420

Query: 421 EVFYSVEEKDVYVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAG 480
           EVF SVE +DV+VWSAMIAGL MHG+G+ A+  F KM EAKVKPN+VTFTNVLCACSH G
Sbjct: 421 EVFNSVERRDVFVWSAMIAGLAMHGQGRDALEFFSKMLEAKVKPNAVTFTNVLCACSHTG 480

Query: 481 LVDEGRAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGAL 540
           LVDEGR FF++MEPVYGVVPG KHYACMVDILGR+G L+EA+ELI +MPI P+ASVWGAL
Sbjct: 481 LVDEGRTFFYQMEPVYGVVPGIKHYACMVDILGRSGNLDEAVELIEKMPIPPTASVWGAL 540

Query: 541 LGGCSLHTNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELK 600
           LG C LH NV LAE A   LL+L+PRNHGA VLLSNIYA+TGKWD+VS LRK MRD+ +K
Sbjct: 541 LGACKLHGNVVLAEKACSHLLELDPRNHGAYVLLSNIYAETGKWDEVSGLRKHMRDAGIK 600

Query: 601 KEPGCSSIEINGNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEED 660
           KEPGCSSIE+NG+VHEFLVGDNSHPL  +IYSKLDE+A +LK  GY PNKSHLLQ VEE+
Sbjct: 601 KEPGCSSIEVNGSVHEFLVGDNSHPLCKEIYSKLDEMALRLKSNGYVPNKSHLLQFVEEE 660

Query: 661 DLKEQALSLHSEKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRD 720
           D+K+ AL LHSEKLAIAFGLIS   SQPI+VVKNLR+CGDCH VAKL+S++YDR+ILLRD
Sbjct: 661 DMKDHALILHSEKLAIAFGLISLSPSQPIQVVKNLRVCGDCHSVAKLISKLYDREILLRD 720

Query: 721 RYRFHHFRNGQCSCMDYW 734
           RYRFHHFR+G CSC DYW
Sbjct: 721 RYRFHHFRDGHCSCNDYW 738

BLAST of ClCG03G000810 vs. TrEMBL
Match: W9RP57_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022020 PE=4 SV=1)

HSP 1 Score: 1083.2 bits (2800), Expect = 0.0e+00
Identity = 522/739 (70.64%), Postives = 621/739 (84.03%), Query Frame = 1

Query: 1   MEARSSPLIS------LQNFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLF 60
           M A S P++S      L    T NN+  F N+ +LS I+QC+  K+LKQ+HA MLRTGLF
Sbjct: 1   MAALSVPVLSFPHQRKLPTSSTVNNDLRFPNYPLLSLIEQCTSLKELKQIHAQMLRTGLF 60

Query: 61  FDPFSASKLFTASALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFL 120
           FDPFSASKL T  A+SSFS+LDYA  VFDQIP+PNLY WNT+IRAYASSSDP QS V+FL
Sbjct: 61  FDPFSASKLITVCAMSSFSSLDYAHQVFDQIPKPNLYTWNTIIRAYASSSDPIQSIVVFL 120

Query: 121 DLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGAC 180
            +LD+C + PN +T+PFV+KAASELKASRVGR  HGM +K S   D++ILNSLV FYG+C
Sbjct: 121 RMLDQCCESPNKYTYPFVLKAASELKASRVGRGFHGMVMKSSLASDVFILNSLVHFYGSC 180

Query: 181 GNLNLAERLFEGISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVL 240
            +L+ A R+F  I SKDVVSWNSMI AF +G+CP++A  LF +ME EN+ PN +TMVGVL
Sbjct: 181 DDLDSAYRVFLNIPSKDVVSWNSMIKAFVEGDCPDEAFQLFREMEMENLKPNDITMVGVL 240

Query: 241 SACAKMFDLEFGRWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVF 300
            AC K  D+EFGRW+CSYI+R  I V+LTL NAMLDMY KCGS++DA++LFD+MPERDV 
Sbjct: 241 CACGKKADIEFGRWLCSYIQRNGIAVNLTLNNAMLDMYVKCGSVEDAKELFDKMPERDVV 300

Query: 301 SWTTMLDGYAKMGDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSK 360
           SWTTMLDGY +MG +D A RVF+AMP +++AAWNVLIS+YEQNG PKEAL+ F++LQ+SK
Sbjct: 301 SWTTMLDGYTRMGKYDEALRVFEAMPNQDIAAWNVLISSYEQNGMPKEALSVFHKLQVSK 360

Query: 361 IAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKA 420
            AKPDEVTLVS+LSAC+QLG+ID G WIH+YIKR G+ LNCHL++SL+DMYAKCG LEKA
Sbjct: 361 SAKPDEVTLVSSLSACSQLGSIDPGRWIHIYIKRQGIKLNCHLTTSLIDMYAKCGDLEKA 420

Query: 421 LEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHA 480
           LEVF SVE KDVYVWSAMIAGL MHG G+AAI+LF++M +AKVKPN+VTFTN+LCACSH 
Sbjct: 421 LEVFDSVERKDVYVWSAMIAGLAMHGCGRAAIDLFYEMLKAKVKPNAVTFTNILCACSHT 480

Query: 481 GLVDEGRAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGA 540
           GL++EG + F++MEPVY VVPG KHYACMVD+LGR+G L++++E I +MPI P+AS+WGA
Sbjct: 481 GLLEEGTSLFYQMEPVYKVVPGVKHYACMVDMLGRSGRLKDSLEFIEKMPIPPTASIWGA 540

Query: 541 LLGGCSLHTNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSEL 600
           LLG C LH NVELAE A  QLL+L+PRNHGA VLLSNIYA+T KWD+VS LRK MRDS +
Sbjct: 541 LLGACRLHGNVELAEHACGQLLELDPRNHGAYVLLSNIYARTDKWDRVSRLRKAMRDSGI 600

Query: 601 KKEPGCSSIEINGNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEE 660
           KKEPGCSSIEING VHEFLVGDNSHPL  DIY KLDEIA  LK +GY PNKSHLLQLVEE
Sbjct: 601 KKEPGCSSIEINGIVHEFLVGDNSHPLCKDIYEKLDEIAATLKAIGYVPNKSHLLQLVEE 660

Query: 661 DDLKEQALSLHSEKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLR 720
           +D+KEQAL+LHSEKLAIAFGLIS+  SQPIRVVKNLR+CGDCH VAKLVS+VY R+ILLR
Sbjct: 661 EDMKEQALNLHSEKLAIAFGLISTAPSQPIRVVKNLRVCGDCHAVAKLVSKVYKREILLR 720

Query: 721 DRYRFHHFRNGQCSCMDYW 734
           DRYRFHHF++G CSC +YW
Sbjct: 721 DRYRFHHFKDGHCSCGEYW 739

BLAST of ClCG03G000810 vs. TrEMBL
Match: F6GUS6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g06420 PE=4 SV=1)

HSP 1 Score: 1081.2 bits (2795), Expect = 0.0e+00
Identity = 524/721 (72.68%), Postives = 605/721 (83.91%), Query Frame = 1

Query: 13  NFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSF 72
           N  T NN+  F NH  LS IDQCS  KQLKQ+HA MLRTGLFFDPFSAS+L TA+ALS F
Sbjct: 23  NSITLNNDRYFANHPTLSLIDQCSETKQLKQIHAQMLRTGLFFDPFSASRLITAAALSPF 82

Query: 73  STLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFV 132
            +LDYA+ VFDQIP PNLY WNTLIRAYASSS+P QS +IFL +L +  D P+ FTFPF+
Sbjct: 83  PSLDYAQQVFDQIPHPNLYTWNTLIRAYASSSNPHQSLLIFLRMLHQSPDFPDKFTFPFL 142

Query: 133 IKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLAERLFEGISSKDV 192
           IKAASEL+    G+A HGM IK+  G D++ILNSL+ FY  CG L L  R+F  I  +DV
Sbjct: 143 IKAASELEELFTGKAFHGMVIKVLLGSDVFILNSLIHFYAKCGELGLGYRVFVNIPRRDV 202

Query: 193 VSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFGRWVCSY 252
           VSWNSMI+AF QG CPE+AL+LF +ME +NV PN +TMVGVLSACAK  D EFGRWV SY
Sbjct: 203 VSWNSMITAFVQGGCPEEALELFQEMETQNVKPNGITMVGVLSACAKKSDFEFGRWVHSY 262

Query: 253 IERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKMGDFDTA 312
           IER +I   LTL NAMLDMYTKCGS++DA++LFD+MPE+D+ SWTTML GYAK+G++D A
Sbjct: 263 IERNRIGESLTLSNAMLDMYTKCGSVEDAKRLFDKMPEKDIVSWTTMLVGYAKIGEYDAA 322

Query: 313 RRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQ 372
           + +FDAMP +++AAWN LISAYEQ GKPKEAL  F+ELQLSK AKPDEVTLVSTLSACAQ
Sbjct: 323 QGIFDAMPNQDIAAWNALISAYEQCGKPKEALELFHELQLSKTAKPDEVTLVSTLSACAQ 382

Query: 373 LGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAM 432
           LGA+DLGGWIHVYIK+ GM LNCHL++SL+DMY KCG L+KAL VF+SVE KDV+VWSAM
Sbjct: 383 LGAMDLGGWIHVYIKKQGMKLNCHLTTSLIDMYCKCGDLQKALMVFHSVERKDVFVWSAM 442

Query: 433 IAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHEMEPVYG 492
           IAGL MHG GK AI LF KMQE KVKPN+VTFTN+LCACSH GLV+EGR FF++ME VYG
Sbjct: 443 IAGLAMHGHGKDAIALFSKMQEDKVKPNAVTFTNILCACSHVGLVEEGRTFFNQMELVYG 502

Query: 493 VVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVELAELAS 552
           V+PG KHYACMVDILGRAG LEEA+ELI +MP+ P+ASVWGALLG C++H NV LAE A 
Sbjct: 503 VLPGVKHYACMVDILGRAGLLEEAVELIEKMPMAPAASVWGALLGACTIHENVVLAEQAC 562

Query: 553 DQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEINGNVHEF 612
            QL++LEP NHGA VLLSNIYAK GKWD+VS LRKLMRD  LKKEPGCSSIE++G VHEF
Sbjct: 563 SQLIELEPGNHGAYVLLSNIYAKAGKWDRVSGLRKLMRDVGLKKEPGCSSIEVDGIVHEF 622

Query: 613 LVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHSEKLAIA 672
           LVGDNSHP +  IY+KLDEI  +L+ +GY PNKSHLLQLVEE+D+KEQAL LHSEKLAIA
Sbjct: 623 LVGDNSHPSAKKIYAKLDEIVARLETIGYVPNKSHLLQLVEEEDVKEQALFLHSEKLAIA 682

Query: 673 FGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQCSCMDY 732
           FGLIS+  SQPIR+VKNLR+CGDCH VAKLVS++YDR+ILLRDRYRFHHFR G CSCMDY
Sbjct: 683 FGLISTGQSQPIRIVKNLRVCGDCHSVAKLVSKLYDREILLRDRYRFHHFREGHCSCMDY 742

Query: 733 W 734
           W
Sbjct: 743 W 743

BLAST of ClCG03G000810 vs. TrEMBL
Match: B9HP52_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0009s04930g PE=4 SV=1)

HSP 1 Score: 1028.5 bits (2658), Expect = 4.0e-297
Identity = 494/729 (67.76%), Postives = 594/729 (81.48%), Query Frame = 1

Query: 5   SSPLISLQNFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSASKLF 64
           S P+ S     T NN        +   ID+C+  K LKQ+HAHMLRTGLFFDP SA+KLF
Sbjct: 10  SVPISSNPTILTANNEQKSNPSTVPILIDKCANKKHLKQLHAHMLRTGLFFDPPSATKLF 69

Query: 65  TASALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLP 124
           TA ALSS S+LDYA  VFDQIP+PNLY WNTLIRA+ASS  P Q  ++F+ +L + +  P
Sbjct: 70  TACALSSPSSLDYACKVFDQIPRPNLYTWNTLIRAFASSPKPIQGLLVFIQMLHESQRFP 129

Query: 125 NNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLAERLF 184
           N++TFPFVIKAA+E+ +   G+A+HGM +K SFG DL+I NSL+ FY + G+L+ A  +F
Sbjct: 130 NSYTFPFVIKAATEVSSLLAGQAIHGMVMKASFGSDLFISNSLIHFYSSLGDLDSAYLVF 189

Query: 185 EGISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLE 244
             I  KD+VSWNSMIS F QG  PE+AL LF +M+ EN  PN VTMVGVLSACAK  DLE
Sbjct: 190 SKIVEKDIVSWNSMISGFVQGGSPEEALQLFKRMKMENARPNRVTMVGVLSACAKRIDLE 249

Query: 245 FGRWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYA 304
           FGRW C YIER  I ++L L NAMLDMY KCGS++DA++LFD+M E+D+ SWTTM+DGYA
Sbjct: 250 FGRWACDYIERNGIDINLILSNAMLDMYVKCGSLEDARRLFDKMEEKDIVSWTTMIDGYA 309

Query: 305 KMGDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLV 364
           K+GD+D ARRVFD MP +++ AWN LIS+Y+QNGKPKEALA F ELQL+K  KP+EVTL 
Sbjct: 310 KVGDYDAARRVFDVMPREDITAWNALISSYQQNGKPKEALAIFRELQLNKNTKPNEVTLA 369

Query: 365 STLSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEK 424
           STL+ACAQLGA+DLGGWIHVYIK+ G+ LN H+++SL+DMY+KCG LEKALEVFYSVE +
Sbjct: 370 STLAACAQLGAMDLGGWIHVYIKKQGIKLNFHITTSLIDMYSKCGHLEKALEVFYSVERR 429

Query: 425 DVYVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFF 484
           DV+VWSAMIAGL MHG G+AAI+LF KMQE KVKPN+VTFTN+LCACSH+GLVDEGR FF
Sbjct: 430 DVFVWSAMIAGLAMHGHGRAAIDLFSKMQETKVKPNAVTFTNLLCACSHSGLVDEGRLFF 489

Query: 485 HEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTN 544
           ++M PVYGVVPG+KHYACMVDILGRAG LEEA+ELI +MPI PSASVWGALLG C ++ N
Sbjct: 490 NQMRPVYGVVPGSKHYACMVDILGRAGCLEEAVELIEKMPIVPSASVWGALLGACRIYGN 549

Query: 545 VELAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIE 604
           VELAE+A  +LL+ +  NHGA VLLSNIYAK GKWD VS LR+ M+ S L+KEPGCSSIE
Sbjct: 550 VELAEMACSRLLETDSNNHGAYVLLSNIYAKAGKWDCVSRLRQHMKVSGLEKEPGCSSIE 609

Query: 605 INGNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSL 664
           +NG +HEFLVGDNSHPLS++IYSKLDEI  ++K  GY  ++SHLLQ VEE+ +KE AL+L
Sbjct: 610 VNGIIHEFLVGDNSHPLSTEIYSKLDEIVARIKSTGYVSDESHLLQFVEEEYMKEHALNL 669

Query: 665 HSEKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRN 724
           HSEKLAIA+GLI    SQPIR+VKNLR+CGDCH VAKL+S++Y+RDILLRDRYRFHHF  
Sbjct: 670 HSEKLAIAYGLIRMEPSQPIRIVKNLRVCGDCHSVAKLISKLYNRDILLRDRYRFHHFSG 729

Query: 725 GQCSCMDYW 734
           G CSCMDYW
Sbjct: 730 GNCSCMDYW 738

BLAST of ClCG03G000810 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 939.9 bits (2428), Expect = 9.6e-274
Identity = 449/727 (61.76%), Postives = 572/727 (78.68%), Query Frame = 1

Query: 7   PLISLQNFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSASKLFTA 66
           P  S  N PT NN    R+  I S I++C   +QLKQ H HM+RTG F DP+SASKLF  
Sbjct: 16  PNFSNPNQPTTNNE---RSRHI-SLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 75

Query: 67  SALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNN 126
           +ALSSF++L+YAR VFD+IP+PN + WNTLIRAYAS  DP  S   FLD++ + +  PN 
Sbjct: 76  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 135

Query: 127 FTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLAERLFEG 186
           +TFPF+IKAA+E+ +  +G+++HGMA+K + G D+++ NSL+  Y +CG+L+ A ++F  
Sbjct: 136 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 195

Query: 187 ISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFG 246
           I  KDVVSWNSMI+ F Q   P+ AL+LF KME E+V  + VTMVGVLSACAK+ +LEFG
Sbjct: 196 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 255

Query: 247 RWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKM 306
           R VCSYIE  ++ V+LTL NAMLDMYTKCGSI+DA++LFD M E+D  +WTTMLDGYA  
Sbjct: 256 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 315

Query: 307 GDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVST 366
            D++ AR V ++MP K++ AWN LISAYEQNGKP EAL  F+ELQL K  K +++TLVST
Sbjct: 316 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 375

Query: 367 LSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDV 426
           LSACAQ+GA++LG WIH YIK+ G+ +N H++S+L+ MY+KCG LEK+ EVF SVE++DV
Sbjct: 376 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 435

Query: 427 YVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHE 486
           +VWSAMI GL MHG G  A+++F+KMQEA VKPN VTFTNV CACSH GLVDE  + FH+
Sbjct: 436 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 495

Query: 487 MEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVE 546
           ME  YG+VP  KHYAC+VD+LGR+G+LE+A++ I  MPI PS SVWGALLG C +H N+ 
Sbjct: 496 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 555

Query: 547 LAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEIN 606
           LAE+A  +LL+LEPRN GA VLLSNIYAK GKW+ VSELRK MR + LKKEPGCSSIEI+
Sbjct: 556 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 615

Query: 607 GNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHS 666
           G +HEFL GDN+HP+S  +Y KL E+  KLK  GYEP  S +LQ++EE+++KEQ+L+LHS
Sbjct: 616 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 675

Query: 667 EKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQ 726
           EKLAI +GLIS+ + + IRV+KNLR+CGDCH VAKL+S++YDR+I++RDRYRFHHFRNGQ
Sbjct: 676 EKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQ 735

Query: 727 CSCMDYW 734
           CSC D+W
Sbjct: 736 CSCNDFW 738

BLAST of ClCG03G000810 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 592.0 bits (1525), Expect = 4.9e-169
Identity = 317/774 (40.96%), Postives = 460/774 (59.43%), Query Frame = 1

Query: 8   LISLQNFPTPNNNFPF--------------RNHQILSTIDQCSCPKQLKQVHAHMLRTGL 67
           ++S      P++++PF              RNH  LS +  C   + L+ +HA M++ GL
Sbjct: 2   MLSCSPLTVPSSSYPFHFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGL 61

Query: 68  FFDPFSASKLFTASALSS-FSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVI 127
               ++ SKL     LS  F  L YA  VF  I +PNL IWNT+ R +A SSDP  +  +
Sbjct: 62  HNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKL 121

Query: 128 FLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYI--------- 187
           ++ ++     LPN++TFPFV+K+ ++ KA + G+ +HG  +KL   +DLY+         
Sbjct: 122 YVCMIS-LGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYV 181

Query: 188 ----------------------LNSLVRFYGACGNLNLAERLFEGISSKDVVSWNSMISA 247
                                   +L++ Y + G +  A++LF+ I  KDVVSWN+MIS 
Sbjct: 182 QNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISG 241

Query: 248 FAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFGRWVCSYIERKQIKVD 307
           +A+    ++AL+LF  M   NV P+  TMV V+SACA+   +E GR V  +I+      +
Sbjct: 242 YAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSN 301

Query: 308 LTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKMGDFDTARRVFDAMPV 367
           L + NA++D+Y+KCG ++ A  LF+ +P +DV SW T++ GY  M  +            
Sbjct: 302 LKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY------------ 361

Query: 368 KEVAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGW 427
                              KEAL  F E+ L     P++VT++S L ACA LGAID+G W
Sbjct: 362 -------------------KEALLLFQEM-LRSGETPNDVTMLSILPACAHLGAIDIGRW 421

Query: 428 IHVYI-KRV-GMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMH 487
           IHVYI KR+ G+     L +SL+DMYAKCG +E A +VF S+  K +  W+AMI G  MH
Sbjct: 422 IHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMH 481

Query: 488 GRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKH 547
           GR  A+ +LF +M++  ++P+ +TF  +L ACSH+G++D GR  F  M   Y + P  +H
Sbjct: 482 GRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEH 541

Query: 548 YACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVELAELASDQLLKLE 607
           Y CM+D+LG +G  +EA E+IN M + P   +W +LL  C +H NVEL E  ++ L+K+E
Sbjct: 542 YGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIE 601

Query: 608 PRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEINGNVHEFLVGDNSH 667
           P N G+ VLLSNIYA  G+W++V++ R L+ D  +KK PGCSSIEI+  VHEF++GD  H
Sbjct: 602 PENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFH 661

Query: 668 PLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHSEKLAIAFGLISSP 727
           P + +IY  L+E+   L+  G+ P+ S +LQ +EE + KE AL  HSEKLAIAFGLIS+ 
Sbjct: 662 PRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEE-EWKEGALRHHSEKLAIAFGLISTK 721

Query: 728 SSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQCSCMDYW 734
               + +VKNLR+C +CHE  KL+S++Y R+I+ RDR RFHHFR+G CSC DYW
Sbjct: 722 PGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of ClCG03G000810 vs. TAIR10
Match: AT4G14820.1 (AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 562.0 bits (1447), Expect = 5.4e-160
Identity = 281/713 (39.41%), Postives = 438/713 (61.43%), Query Frame = 1

Query: 28  ILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQ 87
           IL  +  C     +KQ+HAH+LRT +  +    S LF  S  SS   L YA +VF  IP 
Sbjct: 15  ILEKLSFCKSLNHIKQLHAHILRTVI--NHKLNSFLFNLSVSSSSINLSYALNVFSSIPS 74

Query: 88  P-NLYIWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGR 147
           P    ++N  +R  + SS+P ++ ++F   +       + F+F  ++KA S++ A   G 
Sbjct: 75  PPESIVFNPFLRDLSRSSEP-RATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGM 134

Query: 148 AVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLAERLFEGISSKDVVSWNSMISAFAQGN 207
            +HG+A K++   D ++    +  Y +CG +N A  +F+ +S +DVV+WN+MI  + +  
Sbjct: 135 ELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFG 194

Query: 208 CPEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFGRWVCSYIERKQIKVDLTLCN 267
             ++A  LF +M+  NVMP+ + +  ++SAC +  ++ + R +  ++    +++D  L  
Sbjct: 195 LVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLT 254

Query: 268 AMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKMGDFDTARRVFDAMPVKEVAA 327
           A++ MY   G +D A++ F +M  R++F  T M+ GY+K G  D A+ +FD    K++  
Sbjct: 255 ALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVC 314

Query: 328 WNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 387
           W  +ISAY ++  P+EAL  F E+  S I KPD V++ S +SACA LG +D   W+H  I
Sbjct: 315 WTTMISAYVESDYPQEALRVFEEMCCSGI-KPDVVSMFSVISACANLGILDKAKWVHSCI 374

Query: 388 KRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAI 447
              G++    ++++L++MYAKCG L+   +VF  +  ++V  WS+MI  L MHG    A+
Sbjct: 375 HVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDAL 434

Query: 448 NLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDI 507
           +LF +M++  V+PN VTF  VL  CSH+GLV+EG+  F  M   Y + P  +HY CMVD+
Sbjct: 435 SLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDL 494

Query: 508 LGRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVELAELASDQLLKLEPRNHGAI 567
            GRA  L EA+E+I  MP+  +  +WG+L+  C +H  +EL + A+ ++L+LEP + GA+
Sbjct: 495 FGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGAL 554

Query: 568 VLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEINGNVHEFLVGDNSHPLSSDIY 627
           VL+SNIYA+  +W+ V  +R++M +  + KE G S I+ NG  HEFL+GD  H  S++IY
Sbjct: 555 VLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIY 614

Query: 628 SKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHSEKLAIAFGLISSPSSQP--- 687
           +KLDE+ +KLKL GY P+   +L  VEE++ K+  L  HSEKLA+ FGL++    +    
Sbjct: 615 AKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVL-WHSEKLALCFGLMNEEKEEEKDS 674

Query: 688 ---IRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQCSCMDYW 734
              IR+VKNLR+C DCH   KLVS+VY+R+I++RDR RFH ++NG CSC DYW
Sbjct: 675 CGVIRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of ClCG03G000810 vs. TAIR10
Match: AT3G12770.1 (AT3G12770.1 mitochondrial editing factor 22)

HSP 1 Score: 536.6 bits (1381), Expect = 2.4e-152
Identity = 278/706 (39.38%), Postives = 427/706 (60.48%), Query Frame = 1

Query: 30  STIDQCSCPKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIPQPN 89
           S ID  +   QLKQ+HA +L  GL F  F  +KL  AS  SSF  + +AR VFD +P+P 
Sbjct: 26  SLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHAS--SSFGDITFARQVFDDLPRPQ 85

Query: 90  LYIWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGRAVH 149
           ++ WN +IR Y S ++ FQ  ++    +      P++FTFP ++KA S L   ++GR VH
Sbjct: 86  IFPWNAIIRGY-SRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVH 145

Query: 150 GMAIKLSFGMDLYILNSLVRFYGACGNLNLAERLFEGIS--SKDVVSWNSMISAFAQGNC 209
               +L F  D+++ N L+  Y  C  L  A  +FEG+    + +VSW +++SA+AQ   
Sbjct: 146 AQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGE 205

Query: 210 PEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFGRWVCSYIERKQIKVDLTLCNA 269
           P +AL++F +M   +V P+ V +V VL+A   + DL+ GR + + + +  ++++  L  +
Sbjct: 206 PMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLIS 265

Query: 270 MLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKMGDFDTARRVFDAMPVKEVAAW 329
           +  MY KCG +  A+ LFD+M   ++  W  M+ GYAK                      
Sbjct: 266 LNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAK---------------------- 325

Query: 330 NVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 389
                    NG  +EA+  F+E+ ++K  +PD +++ S +SACAQ+G+++    ++ Y+ 
Sbjct: 326 ---------NGYAREAIDMFHEM-INKDVRPDTISITSAISACAQVGSLEQARSMYEYVG 385

Query: 390 RVGMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAIN 449
           R     +  +SS+L+DM+AKCG++E A  VF    ++DV VWSAMI G G+HGR + AI+
Sbjct: 386 RSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAIS 445

Query: 450 LFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDIL 509
           L+  M+   V PN VTF  +L AC+H+G+V EG  FF+ M   + + P  +HYAC++D+L
Sbjct: 446 LYRAMERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRMAD-HKINPQQQHYACVIDLL 505

Query: 510 GRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVELAELASDQLLKLEPRNHGAIV 569
           GRAG L++A E+I  MP+ P  +VWGALL  C  H +VEL E A+ QL  ++P N G  V
Sbjct: 506 GRAGHLDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYV 565

Query: 570 LLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEINGNVHEFLVGDNSHPLSSDIYS 629
            LSN+YA    WD+V+E+R  M++  L K+ GCS +E+ G +  F VGD SHP   +I  
Sbjct: 566 QLSNLYAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIER 625

Query: 630 KLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHSEKLAIAFGLISSPSSQPIRVV 689
           +++ I ++LK  G+  NK   L  + +++  E+ L  HSE++AIA+GLIS+P   P+R+ 
Sbjct: 626 QVEWIESRLKEGGFVANKDASLHDLNDEE-AEETLCSHSERIAIAYGLISTPQGTPLRIT 685

Query: 690 KNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQCSCMDYW 734
           KNLR C +CH   KL+S++ DR+I++RD  RFHHF++G CSC DYW
Sbjct: 686 KNLRACVNCHAATKLISKLVDREIVVRDTNRFHHFKDGVCSCGDYW 694

BLAST of ClCG03G000810 vs. TAIR10
Match: AT3G08820.1 (AT3G08820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 524.6 bits (1350), Expect = 9.6e-149
Identity = 272/707 (38.47%), Postives = 412/707 (58.27%), Query Frame = 1

Query: 27  QILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSASKLFTASALSSFSTLDYARDVFDQIP 86
           QI + I        LKQ+H  ++   L  D F  + L   +    F    Y+  +F    
Sbjct: 15  QIKTLISVACTVNHLKQIHVSLINHHLHHDTFLVNLLLKRTLF--FRQTKYSYLLFSHTQ 74

Query: 87  QPNLYIWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNNFTFPFVIKAASELKASRVGR 146
            PN++++N+LI  + ++    ++  +FL +      L + FTFP V+KA +   + ++G 
Sbjct: 75  FPNIFLYNSLINGFVNNHLFHETLDLFLSIRKHGLYL-HGFTFPLVLKACTRASSRKLGI 134

Query: 147 AVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLAERLFEGISSKDVVSWNSMISAFAQGN 206
            +H + +K  F  D+  + SL+  Y   G LN A +LF+ I  + VV+W ++ S +    
Sbjct: 135 DLHSLVVKCGFNHDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSVVTWTALFSGYTTSG 194

Query: 207 CPEDALDLFLKMEGENVMPNSVTMVGVLSACAKMFDLEFGRWVCSYIERKQIKVDLTLCN 266
              +A+DLF KM    V P+S  +V VLSAC  + DL+ G W+  Y+E  +++ +  +  
Sbjct: 195 RHREAIDLFKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKYMEEMEMQKNSFVRT 254

Query: 267 AMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTMLDGYAKMGDFDTARRVFDAMPVKEVAA 326
            ++++Y KCG ++ A+ +FD M E+D+ +W+TM+ GYA                      
Sbjct: 255 TLVNLYAKCGKMEKARSVFDSMVEKDIVTWSTMIQGYAS--------------------- 314

Query: 327 WNVLISAYEQNGKPKEALATFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYI 386
                     N  PKE +  F ++ L +  KPD+ ++V  LS+CA LGA+DLG W    I
Sbjct: 315 ----------NSFPKEGIELFLQM-LQENLKPDQFSIVGFLSSCASLGALDLGEWGISLI 374

Query: 387 KRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYSVEEKDVYVWSAMIAGLGMHGRGKAAI 446
            R     N  ++++L+DMYAKCGA+ +  EVF  ++EKD+ + +A I+GL  +G  K + 
Sbjct: 375 DRHEFLTNLFMANALIDMYAKCGAMARGFEVFKEMKEKDIVIMNAAISGLAKNGHVKLSF 434

Query: 447 NLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEGRAFFHEMEPVYGVVPGTKHYACMVDI 506
            +F + ++  + P+  TF  +LC C HAGL+ +G  FF+ +  VY +    +HY CMVD+
Sbjct: 435 AVFGQTEKLGISPDGSTFLGLLCGCVHAGLIQDGLRFFNAISCVYALKRTVEHYGCMVDL 494

Query: 507 LGRAGFLEEAMELINEMPITPSASVWGALLGGCSLHTNVELAELASDQLLKLEPRNHGAI 566
            GRAG L++A  LI +MP+ P+A VWGALL GC L  + +LAE    +L+ LEP N G  
Sbjct: 495 WGRAGMLDDAYRLICDMPMRPNAIVWGALLSGCRLVKDTQLAETVLKELIALEPWNAGNY 554

Query: 567 VLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGCSSIEINGNVHEFLVGDNSHPLSSDIY 626
           V LSNIY+  G+WD+ +E+R +M    +KK PG S IE+ G VHEFL  D SHPLS  IY
Sbjct: 555 VQLSNIYSVGGRWDEAAEVRDMMNKKGMKKIPGYSWIELEGKVHEFLADDKSHPLSDKIY 614

Query: 627 SKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQALSLHSEKLAIAFGLISSPSSQPIRV 686
           +KL+++  +++L+G+ P    +   VEE++ KE+ L  HSEKLA+A GLIS+   Q IRV
Sbjct: 615 AKLEDLGNEMRLMGFVPTTEFVFFDVEEEE-KERVLGYHSEKLAVALGLISTDHGQVIRV 674

Query: 687 VKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFHHFRNGQCSCMDYW 734
           VKNLR+CGDCHEV KL+S++  R+I++RD  RFH F NG CSC DYW
Sbjct: 675 VKNLRVCGDCHEVMKLISKITRREIVVRDNNRFHCFTNGSCSCNDYW 685

BLAST of ClCG03G000810 vs. NCBI nr
Match: gi|659115085|ref|XP_008457379.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis melo])

HSP 1 Score: 1348.6 bits (3489), Expect = 0.0e+00
Identity = 668/733 (91.13%), Postives = 694/733 (94.68%), Query Frame = 1

Query: 1   MEARSSPLISLQNFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSA 60
           MEA S PLISLQNF T NNN PFRNHQILS ID+CS  KQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPLISLQNFSTLNNNLPFRNHQILSAIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKLFTASALSSFSTLDYAR+VFDQIPQPNLY WN LIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNVFDQIPQPNLYTWNILIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLA 180
           EDLPNNFTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACG+L++A
Sbjct: 121 EDLPNNFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFEGISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLSACAKM 240
           ERLF+GIS KDVVSWNSMISAFAQGNCPEDAL+LFLKME ENVMPNSVTMV VLSACAK 
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVCVLSACAKK 240

Query: 241 FDLEFGRWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTML 300
            DLEFGRWVCSYIERK IK+DLTL NAMLDMYTKCGS+DDAQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKMDLTLRNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360
           DGYAKMGD+D AR VF+AMPVKE+AAWNVLISAYEQNGKPKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARGVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVYIKR G+DLNCHL SSLVDMYAKCGALEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIDLNCHLISSLVDMYAKCGALEKALEVFYS 420

Query: 421 VEEKDVYVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEE+DVYVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPNSVTFTNVLCACSH GLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHTGLVDEG 480

Query: 481 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGGCS 540
           R FFHEMEPVYGVVP TKHYACMVDILGRAGFLEEAMELINEM ITPSASVWGALLG CS
Sbjct: 481 RVFFHEMEPVYGVVPETKHYACMVDILGRAGFLEEAMELINEMSITPSASVWGALLGACS 540

Query: 541 LHTNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGC 600
           LH NVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTG+W+KVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEINGNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQ 660
           SSIE+NGNVHEFLVGDN HPLSS+IYSKLD+IATKLK VGYEPNKSHLLQL+EEDDLKEQ
Sbjct: 601 SSIEVNGNVHEFLVGDNLHPLSSNIYSKLDDIATKLKPVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGL+S   SQPIRVVKNLRICGDCHE AKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVSLAPSQPIRVVKNLRICGDCHEFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRNGQCSCMDYW 734
           HFR+G CSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of ClCG03G000810 vs. NCBI nr
Match: gi|449455158|ref|XP_004145320.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Cucumis sativus])

HSP 1 Score: 1341.3 bits (3470), Expect = 0.0e+00
Identity = 664/733 (90.59%), Postives = 692/733 (94.41%), Query Frame = 1

Query: 1   MEARSSPLISLQNFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSA 60
           MEA S P ISLQNF T NNN  FRNHQILSTID+CS  KQLK+VHA MLRTGLFFDPFSA
Sbjct: 1   MEALSVPSISLQNFSTLNNNLLFRNHQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSA 60

Query: 61  SKLFTASALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKLFTASALSSFSTLDYAR++FDQIPQPNLY WNTLIRAYASSSDPFQSFVIFLDLLDKC
Sbjct: 61  SKLFTASALSSFSTLDYARNLFDQIPQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKC 120

Query: 121 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLA 180
           EDLPN FTFPFVIKAASELKASRVG AVHGMAIKLSFGMDLYILNSLVRFYGACG+L++A
Sbjct: 121 EDLPNKFTFPFVIKAASELKASRVGTAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMA 180

Query: 181 ERLFEGISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLSACAKM 240
           ERLF+GIS KDVVSWNSMISAFAQGNCPEDAL+LFLKME ENVMPNSVTMVGVLSACAK 
Sbjct: 181 ERLFKGISCKDVVSWNSMISAFAQGNCPEDALELFLKMERENVMPNSVTMVGVLSACAKK 240

Query: 241 FDLEFGRWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTML 300
            DLEFGRWVCSYIERK IKVDLTLCNAMLDMYTKCGS+DDAQKLFDEMPERDVFSWT ML
Sbjct: 241 LDLEFGRWVCSYIERKGIKVDLTLCNAMLDMYTKCGSVDDAQKLFDEMPERDVFSWTIML 300

Query: 301 DGYAKMGDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360
           DGYAKMGD+D AR VF+AMPVKE+AAWNVLISAYEQNGKPKEALA FNELQLSKIAKPDE
Sbjct: 301 DGYAKMGDYDAARLVFNAMPVKEIAAWNVLISAYEQNGKPKEALAIFNELQLSKIAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYS 420
           VTLVSTLSACAQLGAIDLGGWIHVYIKR G+ LNCHL SSLVDMYAKCG+LEKALEVFYS
Sbjct: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKREGIVLNCHLISSLVDMYAKCGSLEKALEVFYS 420

Query: 421 VEEKDVYVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VEE+DVYVWSAMIAGLGMHGRGKAAI+LFF+MQEAKVKPNSVTFTNVLCACSHAGLVDEG
Sbjct: 421 VEERDVYVWSAMIAGLGMHGRGKAAIDLFFEMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480

Query: 481 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGGCS 540
           R FFHEMEPVYGVVP  KHYACMVDILGRAGFLEEAMELINEM  TPSASVWGALLG CS
Sbjct: 481 RVFFHEMEPVYGVVPEMKHYACMVDILGRAGFLEEAMELINEMSTTPSASVWGALLGACS 540

Query: 541 LHTNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGC 600
           LH NVEL ELASDQLLKLEPRNHGAIVLLSNIYAKTG+W+KVSELRKLMRD+ELKKEPGC
Sbjct: 541 LHMNVELGELASDQLLKLEPRNHGAIVLLSNIYAKTGRWEKVSELRKLMRDTELKKEPGC 600

Query: 601 SSIEINGNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQ 660
           SSIE NGNVHEFLVGDN+HPLSS+IYSKL+EIATKLK VGYEPNKSHLLQL+EEDDLKEQ
Sbjct: 601 SSIEANGNVHEFLVGDNTHPLSSNIYSKLEEIATKLKSVGYEPNKSHLLQLIEEDDLKEQ 660

Query: 661 ALSLHSEKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFH 720
           ALSLHSEKLAIAFGL++   SQPIRVVKNLRICGDCH  AKLVSRVYDRDILLRDRYRFH
Sbjct: 661 ALSLHSEKLAIAFGLVTLAPSQPIRVVKNLRICGDCHAFAKLVSRVYDRDILLRDRYRFH 720

Query: 721 HFRNGQCSCMDYW 734
           HFR+G CSCMDYW
Sbjct: 721 HFRDGHCSCMDYW 733

BLAST of ClCG03G000810 vs. NCBI nr
Match: gi|595832196|ref|XP_007206426.1| (hypothetical protein PRUPE_ppa001946mg [Prunus persica])

HSP 1 Score: 1109.0 bits (2867), Expect = 0.0e+00
Identity = 537/738 (72.76%), Postives = 627/738 (84.96%), Query Frame = 1

Query: 1   MEARSSPLISL-----QNFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFF 60
           M + S+PLISL      + PT + +  F +H  LS IDQC+  KQLKQVHA MLRTG+ F
Sbjct: 1   MASLSTPLISLPRHPNSSSPTFSTDLRFSSHPALSLIDQCTSIKQLKQVHAQMLRTGVLF 60

Query: 61  DPFSASKLFTASALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLD 120
           DP+SASKL TASALSSFS+LDYAR VFDQIPQPN+Y WNTLIRAYASSSDP +S ++FLD
Sbjct: 61  DPYSASKLITASALSSFSSLDYARQVFDQIPQPNVYTWNTLIRAYASSSDPAESILVFLD 120

Query: 121 LLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACG 180
           +LD C + P+ +T+PF IKAASEL+A +VGR  HGMAIK S G D+YILNSLV FYG+CG
Sbjct: 121 MLDHCSECPDKYTYPFAIKAASELRALQVGRGFHGMAIKASLGSDIYILNSLVHFYGSCG 180

Query: 181 NLNLAERLFEGISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLS 240
           +L+LA R+F     KDVVSWNSMI+ FAQGNCP++AL+LF +ME ENV PN VTMV VLS
Sbjct: 181 DLDLARRVFMKTPKKDVVSWNSMITVFAQGNCPQEALELFKEMEAENVKPNDVTMVSVLS 240

Query: 241 ACAKMFDLEFGRWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFS 300
           ACAK  DLEFGRWVCS+I+R +IK +LTL NAMLDMY KCGS+DDA++LFD MPE+D+ S
Sbjct: 241 ACAKKVDLEFGRWVCSHIQRNEIKENLTLNNAMLDMYVKCGSVDDAKRLFDRMPEKDIVS 300

Query: 301 WTTMLDGYAKMGDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKI 360
           WTTMLDGYA++G+++ A RVF AMP +++AAWNVLIS+YEQ+GKPKEALA FNELQ SK 
Sbjct: 301 WTTMLDGYAQLGNYEEAWRVFAAMPSQDIAAWNVLISSYEQSGKPKEALAVFNELQKSKS 360

Query: 361 AKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKAL 420
            KPDEVTLVSTL+ACAQLGAIDLGGWIHVYIK+  M LNCHL++SL+DMYAKCG L+KAL
Sbjct: 361 PKPDEVTLVSTLAACAQLGAIDLGGWIHVYIKKQVMKLNCHLTTSLIDMYAKCGDLDKAL 420

Query: 421 EVFYSVEEKDVYVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAG 480
           EVF SVE +DV+VWSAMIAGL MHG+G+ A+  F KM EAKVKPN+VTFTNVLCACSH G
Sbjct: 421 EVFNSVERRDVFVWSAMIAGLAMHGQGRDALEFFSKMLEAKVKPNAVTFTNVLCACSHTG 480

Query: 481 LVDEGRAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGAL 540
           LVDEGR FF++MEPVYGVVPG KHYACMVDILGR+G L+EA+ELI +MPI P+ASVWGAL
Sbjct: 481 LVDEGRTFFYQMEPVYGVVPGIKHYACMVDILGRSGNLDEAVELIEKMPIPPTASVWGAL 540

Query: 541 LGGCSLHTNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELK 600
           LG C LH NV LAE A   LL+L+PRNHGA VLLSNIYA+TGKWD+VS LRK MRD+ +K
Sbjct: 541 LGACKLHGNVVLAEKACSHLLELDPRNHGAYVLLSNIYAETGKWDEVSGLRKHMRDAGIK 600

Query: 601 KEPGCSSIEINGNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEED 660
           KEPGCSSIE+NG+VHEFLVGDNSHPL  +IYSKLDE+A +LK  GY PNKSHLLQ VEE+
Sbjct: 601 KEPGCSSIEVNGSVHEFLVGDNSHPLCKEIYSKLDEMALRLKSNGYVPNKSHLLQFVEEE 660

Query: 661 DLKEQALSLHSEKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRD 720
           D+K+ AL LHSEKLAIAFGLIS   SQPI+VVKNLR+CGDCH VAKL+S++YDR+ILLRD
Sbjct: 661 DMKDHALILHSEKLAIAFGLISLSPSQPIQVVKNLRVCGDCHSVAKLISKLYDREILLRD 720

Query: 721 RYRFHHFRNGQCSCMDYW 734
           RYRFHHFR+G CSC DYW
Sbjct: 721 RYRFHHFRDGHCSCNDYW 738

BLAST of ClCG03G000810 vs. NCBI nr
Match: gi|645216337|ref|XP_008220652.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Prunus mume])

HSP 1 Score: 1100.1 bits (2844), Expect = 0.0e+00
Identity = 531/738 (71.95%), Postives = 624/738 (84.55%), Query Frame = 1

Query: 1   MEARSSPLISLQNFPTPNN-----NFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFF 60
           M + S+PLISL   P  ++     +    +H  LS IDQC+  KQLKQVHA MLRTG  F
Sbjct: 1   MASLSTPLISLPRHPNSSSPSFSTDLRLSSHPALSLIDQCTSIKQLKQVHAQMLRTGALF 60

Query: 61  DPFSASKLFTASALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLD 120
           DP+SASKL TASALSSFS+LDYAR VFDQIPQPN+Y WNTLIRAYASSSDP +S +IFL+
Sbjct: 61  DPYSASKLITASALSSFSSLDYARQVFDQIPQPNVYTWNTLIRAYASSSDPAESILIFLE 120

Query: 121 LLDKCEDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACG 180
           +LD C + P+ +T+PF IKAASEL+A +VGR  HGMAIK S G D+YILNSLV FYG+CG
Sbjct: 121 MLDHCSECPDKYTYPFAIKAASELRALQVGRGFHGMAIKASLGSDIYILNSLVHFYGSCG 180

Query: 181 NLNLAERLFEGISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLS 240
           +L+LA R+F     KDVVSWNSMI+ FAQGNCP++AL+LF +ME ENV PN VTMV VLS
Sbjct: 181 DLDLARRVFVKTPKKDVVSWNSMITVFAQGNCPQEALELFKEMEAENVKPNDVTMVSVLS 240

Query: 241 ACAKMFDLEFGRWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFS 300
           ACAK  DLEFGRWVCS+I+R +IK +LTL NAMLDMY KCGS++DA++LFD MPE+D+ S
Sbjct: 241 ACAKKVDLEFGRWVCSHIQRNEIKENLTLNNAMLDMYVKCGSVEDAKRLFDRMPEKDIVS 300

Query: 301 WTTMLDGYAKMGDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKI 360
           WTTMLDGYA++G+++ A RVF AMP +++AAWNVLIS+YEQ+GKPKEALA FNELQ SK 
Sbjct: 301 WTTMLDGYAQLGNYEEAWRVFAAMPSQDIAAWNVLISSYEQSGKPKEALAVFNELQKSKS 360

Query: 361 AKPDEVTLVSTLSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKAL 420
            KPDEVTLVSTL+ACAQLGAIDLGGWIHVYIK+  M LNCHL++SL+DMYAKCG L+KAL
Sbjct: 361 PKPDEVTLVSTLAACAQLGAIDLGGWIHVYIKKQVMKLNCHLTTSLIDMYAKCGDLDKAL 420

Query: 421 EVFYSVEEKDVYVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAG 480
           EVF SVE +DV+VWSAMIAGL MHG+G+ A+  F KM EAKVKPN+VTFTNVLCACSH G
Sbjct: 421 EVFNSVERRDVFVWSAMIAGLAMHGQGRDALEFFSKMLEAKVKPNAVTFTNVLCACSHTG 480

Query: 481 LVDEGRAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGAL 540
           LVDEGR FF++MEPVYG++PG KHYACMVDILGR+G L+EA+ELI +MPI P+ASVWGAL
Sbjct: 481 LVDEGRTFFYQMEPVYGILPGIKHYACMVDILGRSGNLDEAVELIEKMPIPPTASVWGAL 540

Query: 541 LGGCSLHTNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELK 600
           LG C LH NV LAE A   LL+L+PRNHGA VLLSNIYA+TGKWD+VS LRK MRD+ +K
Sbjct: 541 LGACKLHGNVVLAEKACSHLLELDPRNHGAYVLLSNIYAETGKWDEVSGLRKHMRDAGIK 600

Query: 601 KEPGCSSIEINGNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEED 660
           KEPGCSSIE+NG+VHEFLVGDNSHPL  +IYSKLDE+A +LK  GY PNKSHLLQ VEE+
Sbjct: 601 KEPGCSSIEVNGSVHEFLVGDNSHPLCKEIYSKLDEMALRLKSNGYVPNKSHLLQFVEEE 660

Query: 661 DLKEQALSLHSEKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRD 720
           D+K+ AL LHSEKLAIAFGLIS   SQPI+VVKNLR+CGDCH VAK++S++YDR+ILLRD
Sbjct: 661 DMKDHALILHSEKLAIAFGLISLSPSQPIQVVKNLRVCGDCHSVAKIISKLYDREILLRD 720

Query: 721 RYRFHHFRNGQCSCMDYW 734
           RYRFHHFR+G CSC DYW
Sbjct: 721 RYRFHHFRDGHCSCNDYW 738

BLAST of ClCG03G000810 vs. NCBI nr
Match: gi|1009114528|ref|XP_015873737.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 1087.0 bits (2810), Expect = 0.0e+00
Identity = 525/733 (71.62%), Postives = 616/733 (84.04%), Query Frame = 1

Query: 1   MEARSSPLISLQNFPTPNNNFPFRNHQILSTIDQCSCPKQLKQVHAHMLRTGLFFDPFSA 60
           M   S+P +SL   P  N      N+ +LS IDQC+  KQL+QVHA MLRTGLFFDP+S+
Sbjct: 1   MATLSTPPVSLPRHP--NTTTTTVNNDLLSLIDQCTNTKQLQQVHARMLRTGLFFDPYSS 60

Query: 61  SKLFTASALSSFSTLDYARDVFDQIPQPNLYIWNTLIRAYASSSDPFQSFVIFLDLLDKC 120
           SKL TASALSSFS+LDYAR VFDQIPQPNLY WNTLIR YASSS+P QS V+FL++L + 
Sbjct: 61  SKLITASALSSFSSLDYARRVFDQIPQPNLYTWNTLIRGYASSSEPSQSIVVFLEMLYRG 120

Query: 121 EDLPNNFTFPFVIKAASELKASRVGRAVHGMAIKLSFGMDLYILNSLVRFYGACGNLNLA 180
            +LPN FT+PFVIKAASELK   +G   HGM IK S G D++ILNSL+ FYG+CG+L+LA
Sbjct: 121 CELPNKFTYPFVIKAASELKDLLIGTGFHGMVIKSSLGSDVFILNSLIHFYGSCGDLDLA 180

Query: 181 ERLFEGISSKDVVSWNSMISAFAQGNCPEDALDLFLKMEGENVMPNSVTMVGVLSACAKM 240
            R+F  I  KD+VSWNSMI+AF QGN P    ++F +ME ENV PN +TM+GVLSACAK 
Sbjct: 181 YRVFLNIPKKDLVSWNSMITAFVQGNYPNKVFEMFREMELENVKPNDITMLGVLSACAKK 240

Query: 241 FDLEFGRWVCSYIERKQIKVDLTLCNAMLDMYTKCGSIDDAQKLFDEMPERDVFSWTTML 300
            D+EFGRWVCSYIER +I+V+LTL NAMLDMY KCGSI+DA++LFD M E+DV SWTTML
Sbjct: 241 VDIEFGRWVCSYIERNEIRVNLTLNNAMLDMYVKCGSIEDAKRLFDNMQEKDVVSWTTML 300

Query: 301 DGYAKMGDFDTARRVFDAMPVKEVAAWNVLISAYEQNGKPKEALATFNELQLSKIAKPDE 360
           DGYA++  +D A RVF+AMP +++AAWNVLIS+YEQNGKPKEALATF +LQ SK AKPDE
Sbjct: 301 DGYAQLEKYDEAHRVFEAMPCQDIAAWNVLISSYEQNGKPKEALATFRKLQTSKTAKPDE 360

Query: 361 VTLVSTLSACAQLGAIDLGGWIHVYIKRVGMDLNCHLSSSLVDMYAKCGALEKALEVFYS 420
           VTLVSTLS+CAQLGAIDLG WIHVYIK+ G+ +NCHL++SL+DMYAKCG LE+ALEVF S
Sbjct: 361 VTLVSTLSSCAQLGAIDLGRWIHVYIKKQGIKMNCHLTTSLIDMYAKCGELEEALEVFNS 420

Query: 421 VEEKDVYVWSAMIAGLGMHGRGKAAINLFFKMQEAKVKPNSVTFTNVLCACSHAGLVDEG 480
           VE KDV+ WSAMIAGL MHGRG+AA++LF +M EAKVKPN+VTFTN+LCACSH GLVDEG
Sbjct: 421 VERKDVFNWSAMIAGLAMHGRGRAALDLFSRMLEAKVKPNAVTFTNILCACSHTGLVDEG 480

Query: 481 RAFFHEMEPVYGVVPGTKHYACMVDILGRAGFLEEAMELINEMPITPSASVWGALLGGCS 540
           R  FH+MEP+YGVVPG KHY CMVDILGR+G L+EAM+ I EMPI P+ASVWGALL  C 
Sbjct: 481 RNLFHQMEPIYGVVPGLKHYTCMVDILGRSGHLKEAMQFIEEMPIAPNASVWGALLASCR 540

Query: 541 LHTNVELAELASDQLLKLEPRNHGAIVLLSNIYAKTGKWDKVSELRKLMRDSELKKEPGC 600
           LH NVELAE A  QLL+L+P+NHGA VLLSNIYAK+GKWD+VS LRK MRDS LKKEPGC
Sbjct: 541 LHGNVELAEKACSQLLELDPKNHGAYVLLSNIYAKSGKWDQVSGLRKFMRDSGLKKEPGC 600

Query: 601 SSIEINGNVHEFLVGDNSHPLSSDIYSKLDEIATKLKLVGYEPNKSHLLQLVEEDDLKEQ 660
           S IE+NG +HEFLVGDNSHPLS +IYSKLDE+AT+LK VGY PN+SHLLQLVEE+ +KE 
Sbjct: 601 SLIEVNGIIHEFLVGDNSHPLSKEIYSKLDEVATRLKSVGYAPNESHLLQLVEEEGMKEH 660

Query: 661 ALSLHSEKLAIAFGLISSPSSQPIRVVKNLRICGDCHEVAKLVSRVYDRDILLRDRYRFH 720
           AL+LHSEKLAIA+GLI    SQPIRVVKNLR+CGDCH  AKL+SR+Y+R+ILLRDRYRFH
Sbjct: 661 ALNLHSEKLAIAYGLIGMGPSQPIRVVKNLRVCGDCHSFAKLISRLYEREILLRDRYRFH 720

Query: 721 HFRNGQCSCMDYW 734
           HF+ G CSCMD+W
Sbjct: 721 HFKEGHCSCMDFW 731

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP175_ARATH1.7e-27261.76Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PPR21_ARATH8.7e-16840.96Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PP311_ARATH9.6e-15939.41Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana GN... [more]
PP224_ARATH4.3e-15139.38Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
PP219_ARATH1.7e-14738.47Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0M0R9_CUCSA0.0e+0090.59Uncharacterized protein OS=Cucumis sativus GN=Csa_1G530130 PE=4 SV=1[more]
M5WLC4_PRUPE0.0e+0072.76Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001946mg PE=4 SV=1[more]
W9RP57_9ROSA0.0e+0070.64Uncharacterized protein OS=Morus notabilis GN=L484_022020 PE=4 SV=1[more]
F6GUS6_VITVI0.0e+0072.68Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g06420 PE=4 SV=... [more]
B9HP52_POPTR4.0e-29767.76Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
Match NameE-valueIdentityDescription
AT2G29760.19.6e-27461.76 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G08070.14.9e-16940.96 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G14820.15.4e-16039.41 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G12770.12.4e-15239.38 mitochondrial editing factor 22[more]
AT3G08820.19.6e-14938.47 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659115085|ref|XP_008457379.1|0.0e+0091.13PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic ... [more]
gi|449455158|ref|XP_004145320.1|0.0e+0090.59PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic ... [more]
gi|595832196|ref|XP_007206426.1|0.0e+0072.76hypothetical protein PRUPE_ppa001946mg [Prunus persica][more]
gi|645216337|ref|XP_008220652.1|0.0e+0071.95PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic ... [more]
gi|1009114528|ref|XP_015873737.1|0.0e+0071.62PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0031425 chloroplast RNA processing
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
biological_process GO:0006457 protein folding
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0051082 unfolded protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G000810.1ClCG03G000810.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 500..523
score: 0.0019coord: 326..352
score: 1.6E-5coord: 571..592
score: 0.81coord: 92..117
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 291..319
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 190..239
score: 6.1E-10coord: 424..471
score: 7.7
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 500..523
score: 0.0027coord: 265..293
score: 1.5E-4coord: 326..359
score: 3.6E-4coord: 462..495
score: 3.8E-4coord: 427..460
score: 4.8E-6coord: 294..320
score: 1.7E-6coord: 193..226
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 261..291
score: 10.348coord: 292..326
score: 11.345coord: 226..260
score: 5.546coord: 89..119
score: 7.081coord: 496..526
score: 7.476coord: 160..190
score: 7.421coord: 562..596
score: 6.829coord: 394..424
score: 7.541coord: 359..393
score: 6.04coord: 191..225
score: 11.674coord: 460..495
score: 8.188coord: 327..357
score: 6.84coord: 425..459
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 533..584
score: 1.4E-8coord: 264..336
score: 2.3E-6coord: 187..219
score: 2.3E-6coord: 337..468
score: 1.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 511..581
score: 9.97E-7coord: 303..347
score: 9.9
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 3..603
score:
NoneNo IPR availablePANTHERPTHR24015:SF558SUBFAMILY NOT NAMEDcoord: 3..603
score:

The following gene(s) are paralogous to this gene:

None