ClCG05G005100 (gene) Watermelon (Charleston Gray)

NameClCG05G005100
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionPentatricopeptide repeat-containing protein
LocationCG_Chr05 : 4936028 .. 4938507 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATGCTCAGCCACTCAACCTCCGTCCTCCCTCTTCAACTTCACACATACCCCACCAGACCCACCGCTCTCTCCGCCGCTCTCTCCTCCGCCTCCAGCCTCTTCCACCTCAAACAAGTCCACGCTCAAATCCTTCGCTCCAAACTCGAACGCTGTGATTCCAATTCCCTTCTTTTTGAACTTATTCTTTCCTCTTGTGCTCTCTCGCCTAGCCTCGACTATGCCCTCTCTGTGTTTGATCAAATTCCCCAGCCCAAGACCCGTCTCTGCAACAAGCTTCTGCGCCAATTATCACGAGGTTCTGAGCCGGAGTTTACGCTTTTTGTATACGAGAAGATGAGGGCGGAGGGTCTGAGTTTGGATAGGTACTGCTTCCCTCCGCTGTTGAAAGCTGCTTCGAGGAATCTTTCCTTGAGAACGGGGATGGAGATTCATGGGCTCGCGTCGAAGTTGGGATTTGGGTCGGACCCATTTGTGGAGACGGGTTTGGTTAGAATGTACGCAGCCTGTGGACGGATAATGGAAGCTCGGTTGGTGTTTGATAAAATGTCTCACAGGGATGTCGTTGCTTGGAGCATCATGATTGATGGGTATGAAGCTAATGTTGTTGCTCTGCATTTACTAGTGAAGGTTAATGTGGTAAAATTTAGTTGATTTTAGATTTTCATGGAAATTTTGTTTCAGACTTTCAATTTACTATGTTATAACCATGGATATTGTGGCCAATTCTAGTAGTCTTCCAAATGAGAAAGAGTTATAAATTTACTTTGGACGTTGTATTGTTATGGTAGTATCTTTCTATCTTATCATTTTCCAGTTTAAATGCATGGCTGTTTAGGAGACAAATTACTGCGGAACTTATATTTGCCCCTTATCCTCAAATATACAGGTATTGCTTAAGTGGCTTTTATGATCTTGCCTTTCAACTCTTTGAAGAAATGAAGAGAACAGAGTTGGAACCAGATGAGATGATTCTTTCTACAGTTCTTTCTGCATGCGCTCGTGCTGGAAATTTGGATTTTGGAACAAAAATACACGAGTTCATTACTAAGAAGAATATTGTCATGGATCCTCATTTACAAAGTGCTCTCATCACAATGTATGCGAGCTGTGGCTCCATGGACTTGGCTTGGGATTTCTATGAAAAGATTTCCCCCAAGAACATGGTTGTTTCGACTGCCATGGTTTCTGGGCTTGCAAAAGGTGGACAGATTGGAGAAGCTCGCTACGTGTTTGATCAGATGGTAGAGAAGGACTTGATATGTTGGAGCGCAATGATTTCTGGCTATACAGAGAGTGACTGCCCTCAAGAGGCTCTTGTATTATTCAAGAAAATGCAACAGCAGGGAATGAAACCTGATGTAGTCACCATATTGAGTGTTATTTCAGCTTGTGCTCATCTTGGCGCATTAGATCAAGGCAAATGGATACAAACTTATGTTGATAAAAATGGGTTTGGCAAGGCATTATCTATCAATAATGCACTCATTGATATGTATGCCAAATGTGGGAGTCTAGAAGGAGCAAGAAAAGTCTTTGGAAAGATGCCAAAGAAAAATGTAATATCTTGGACAAGTATGATCCATGCTCTTGCAATGCATGGAGATGCTCCTAATGCTTTAAGCTTATTTCATCAAATGAAAGTTGAAAATGTTGAGCCTAATTGGATCACATTTGTAGGGGTGCTTTATGCTTGTAGCCACGGAGGTCTAGTTGAGGAGGGCCGAAGAATATTTCATTCAATGACCGATGAGTATGGCATAAGTCCCAAGCATGAACACTTTGGTTGCATGGTTGACCTCTTTGGCCGTGCAAATCTTCTGAGAGAAGCTCTTGAGGTGATTGAGGCAATGCCATTTGCTCCTAATGCTATTATTTGGGGATCCCTTATGGCTGCTTGTCAGATCCACGGTGAGACTGAGTTAGGAGAATTTGCTGCTAAACAAGTTCTCAAGCTCGAGCCTGATCATGATGGGGCCCTTGTCGTCTTATCAAACATATACGCTAAAGAAAGAAGATGGGAAGACGTTGGGGAAGTTAGAAAACTAATGACCAAGATGGGCGTTTCCAAAGAGAGAGGATGCAGTAGAATTGAATTGAACAATGAGGTCCATGAATTTCAAATGGCAGATAGAAATCACAAGCAAGCAGATCAAATACATCAGAAATTAGATGAGGTAGTTCAAAAGTTGAATCTGGCTGGTTATACGCCACAGACAAATTATGTGCTCGTTGATTTAGACGAAGAGGAAAAGAAGGAATTAGTCCTCTGGCACAGCGAGAAATTGGCACTTTGCTATGCCCTCATGAATGAAGGGCCACGCATTTGCATTATAAAGAACCTTCGAATTTGTGAGGATTGTCATGCTTTTATGAAATTAGCCTCAAAAGTATATGCCAGAGAGATCATCATTAGGGACAGAAGTAGATTTCACCATTACAGAGACGGTTTGTGTTCTTGTAAGGACTACTGGTGA

mRNA sequence

ATGGAAATGCTCAGCCACTCAACCTCCGTCCTCCCTCTTCAACTTCACACATACCCCACCAGACCCACCGCTCTCTCCGCCGCTCTCTCCTCCGCCTCCAGCCTCTTCCACCTCAAACAAGTCCACGCTCAAATCCTTCGCTCCAAACTCGAACGCTGTGATTCCAATTCCCTTCTTTTTGAACTTATTCTTTCCTCTTGTGCTCTCTCGCCTAGCCTCGACTATGCCCTCTCTGTGTTTGATCAAATTCCCCAGCCCAAGACCCGTCTCTGCAACAAGCTTCTGCGCCAATTATCACGAGGTTCTGAGCCGGAGTTTACGCTTTTTGTATACGAGAAGATGAGGGCGGAGGGTCTGAGTTTGGATAGGTACTGCTTCCCTCCGCTGTTGAAAGCTGCTTCGAGGAATCTTTCCTTGAGAACGGGGATGGAGATTCATGGGCTCGCGTCGAAGTTGGGATTTGGGTCGGACCCATTTGTGGAGACGGGTTTGGTTAGAATGTACGCAGCCTGTGGACGGATAATGGAAGCTCGGTTGGTGTTTGATAAAATGTCTCACAGGGATGTCGTTGCTTGGAGCATCATGATTGATGGGTATGAAGCTAATGTTGTTGCTCTGCATTTACTAGTGAAGGTTAATGTGTATCTTTCTATCTTATCATTTTCCAGTTTAAATGCATGGCTGTTTAGGAGACAAATTACTGCGGAACTTATATTTGCCCCTTATCCTCAAATATACAGGTATTGCTTAAGTGGCTTTTATGATCTTGCCTTTCAACTCTTTGAAGAAATGAAGAGAACAGAGTTGGAACCAGATGAGATGATTCTTTCTACAGTTCTTTCTGCATGCGCTCGTGCTGGAAATTTGGATTTTGGAACAAAAATACACGAGTTCATTACTAAGAAGAATATTGTCATGGATCCTCATTTACAAAGTGCTCTCATCACAATGTATGCGAGCTGTGGCTCCATGGACTTGGCTTGGGATTTCTATGAAAAGATTTCCCCCAAGAACATGGTTGTTTCGACTGCCATGGTTTCTGGGCTTGCAAAAGGTGGACAGATTGGAGAAGCTCGCTACGTGTTTGATCAGATGGTAGAGAAGGACTTGATATGTTGGAGCGCAATGATTTCTGGCTATACAGAGAGTGACTGCCCTCAAGAGGCTCTTGTATTATTCAAGAAAATGCAACAGCAGGGAATGAAACCTGATGTAGTCACCATATTGAGTGTTATTTCAGCTTGTGCTCATCTTGGCGCATTAGATCAAGGCAAATGGATACAAACTTATGTTGATAAAAATGGGTTTGGCAAGGCATTATCTATCAATAATGCACTCATTGATATGTATGCCAAATGTGGGAGTCTAGAAGGAGCAAGAAAAGTCTTTGGAAAGATGCCAAAGAAAAATGTAATATCTTGGACAAGTATGATCCATGCTCTTGCAATGCATGGAGATGCTCCTAATGCTTTAAGCTTATTTCATCAAATGAAAGTTGAAAATGTTGAGCCTAATTGGATCACATTTGTAGGGGTGCTTTATGCTTGTAGCCACGGAGGTCTAGTTGAGGAGGGCCGAAGAATATTTCATTCAATGACCGATGAGTATGGCATAAGTCCCAAGCATGAACACTTTGGTTGCATGGTTGACCTCTTTGGCCGTGCAAATCTTCTGAGAGAAGCTCTTGAGGTGATTGAGGCAATGCCATTTGCTCCTAATGCTATTATTTGGGGATCCCTTATGGCTGCTTGTCAGATCCACGGTGAGACTGAGTTAGGAGAATTTGCTGCTAAACAAGTTCTCAAGCTCGAGCCTGATCATGATGGGGCCCTTGTCGTCTTATCAAACATATACGCTAAAGAAAGAAGATGGGAAGACGTTGGGGAAGTTAGAAAACTAATGACCAAGATGGGCGTTTCCAAAGAGAGAGGATGCAGTAGAATTGAATTGAACAATGAGGTCCATGAATTTCAAATGGCAGATAGAAATCACAAGCAAGCAGATCAAATACATCAGAAATTAGATGAGGTAGTTCAAAAGTTGAATCTGGCTGGTTATACGCCACAGACAAATTATGTGCTCGTTGATTTAGACGAAGAGGAAAAGAAGGAATTAGTCCTCTGGCACAGCGAGAAATTGGCACTTTGCTATGCCCTCATGAATGAAGGGCCACGCATTTGCATTATAAAGAACCTTCGAATTTGTGAGGATTGTCATGCTTTTATGAAATTAGCCTCAAAAGTATATGCCAGAGAGATCATCATTAGGGACAGAAGTAGATTTCACCATTACAGAGACGGTTTGTGTTCTTGTAAGGACTACTGGTGA

Coding sequence (CDS)

ATGGAAATGCTCAGCCACTCAACCTCCGTCCTCCCTCTTCAACTTCACACATACCCCACCAGACCCACCGCTCTCTCCGCCGCTCTCTCCTCCGCCTCCAGCCTCTTCCACCTCAAACAAGTCCACGCTCAAATCCTTCGCTCCAAACTCGAACGCTGTGATTCCAATTCCCTTCTTTTTGAACTTATTCTTTCCTCTTGTGCTCTCTCGCCTAGCCTCGACTATGCCCTCTCTGTGTTTGATCAAATTCCCCAGCCCAAGACCCGTCTCTGCAACAAGCTTCTGCGCCAATTATCACGAGGTTCTGAGCCGGAGTTTACGCTTTTTGTATACGAGAAGATGAGGGCGGAGGGTCTGAGTTTGGATAGGTACTGCTTCCCTCCGCTGTTGAAAGCTGCTTCGAGGAATCTTTCCTTGAGAACGGGGATGGAGATTCATGGGCTCGCGTCGAAGTTGGGATTTGGGTCGGACCCATTTGTGGAGACGGGTTTGGTTAGAATGTACGCAGCCTGTGGACGGATAATGGAAGCTCGGTTGGTGTTTGATAAAATGTCTCACAGGGATGTCGTTGCTTGGAGCATCATGATTGATGGGTATGAAGCTAATGTTGTTGCTCTGCATTTACTAGTGAAGGTTAATGTGTATCTTTCTATCTTATCATTTTCCAGTTTAAATGCATGGCTGTTTAGGAGACAAATTACTGCGGAACTTATATTTGCCCCTTATCCTCAAATATACAGGTATTGCTTAAGTGGCTTTTATGATCTTGCCTTTCAACTCTTTGAAGAAATGAAGAGAACAGAGTTGGAACCAGATGAGATGATTCTTTCTACAGTTCTTTCTGCATGCGCTCGTGCTGGAAATTTGGATTTTGGAACAAAAATACACGAGTTCATTACTAAGAAGAATATTGTCATGGATCCTCATTTACAAAGTGCTCTCATCACAATGTATGCGAGCTGTGGCTCCATGGACTTGGCTTGGGATTTCTATGAAAAGATTTCCCCCAAGAACATGGTTGTTTCGACTGCCATGGTTTCTGGGCTTGCAAAAGGTGGACAGATTGGAGAAGCTCGCTACGTGTTTGATCAGATGGTAGAGAAGGACTTGATATGTTGGAGCGCAATGATTTCTGGCTATACAGAGAGTGACTGCCCTCAAGAGGCTCTTGTATTATTCAAGAAAATGCAACAGCAGGGAATGAAACCTGATGTAGTCACCATATTGAGTGTTATTTCAGCTTGTGCTCATCTTGGCGCATTAGATCAAGGCAAATGGATACAAACTTATGTTGATAAAAATGGGTTTGGCAAGGCATTATCTATCAATAATGCACTCATTGATATGTATGCCAAATGTGGGAGTCTAGAAGGAGCAAGAAAAGTCTTTGGAAAGATGCCAAAGAAAAATGTAATATCTTGGACAAGTATGATCCATGCTCTTGCAATGCATGGAGATGCTCCTAATGCTTTAAGCTTATTTCATCAAATGAAAGTTGAAAATGTTGAGCCTAATTGGATCACATTTGTAGGGGTGCTTTATGCTTGTAGCCACGGAGGTCTAGTTGAGGAGGGCCGAAGAATATTTCATTCAATGACCGATGAGTATGGCATAAGTCCCAAGCATGAACACTTTGGTTGCATGGTTGACCTCTTTGGCCGTGCAAATCTTCTGAGAGAAGCTCTTGAGGTGATTGAGGCAATGCCATTTGCTCCTAATGCTATTATTTGGGGATCCCTTATGGCTGCTTGTCAGATCCACGGTGAGACTGAGTTAGGAGAATTTGCTGCTAAACAAGTTCTCAAGCTCGAGCCTGATCATGATGGGGCCCTTGTCGTCTTATCAAACATATACGCTAAAGAAAGAAGATGGGAAGACGTTGGGGAAGTTAGAAAACTAATGACCAAGATGGGCGTTTCCAAAGAGAGAGGATGCAGTAGAATTGAATTGAACAATGAGGTCCATGAATTTCAAATGGCAGATAGAAATCACAAGCAAGCAGATCAAATACATCAGAAATTAGATGAGGTAGTTCAAAAGTTGAATCTGGCTGGTTATACGCCACAGACAAATTATGTGCTCGTTGATTTAGACGAAGAGGAAAAGAAGGAATTAGTCCTCTGGCACAGCGAGAAATTGGCACTTTGCTATGCCCTCATGAATGAAGGGCCACGCATTTGCATTATAAAGAACCTTCGAATTTGTGAGGATTGTCATGCTTTTATGAAATTAGCCTCAAAAGTATATGCCAGAGAGATCATCATTAGGGACAGAAGTAGATTTCACCATTACAGAGACGGTTTGTGTTCTTGTAAGGACTACTGGTGA

Protein sequence

MEMLSHSTSVLPLQLHTYPTRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW
BLAST of ClCG05G005100 vs. Swiss-Prot
Match: PP311_ARATH (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 731.1 bits (1886), Expect = 1.3e-209
Identity = 387/766 (50.52%), Postives = 509/766 (66.45%), Query Frame = 1

Query: 20  TRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSPSLDYALSV 79
           T    +   LS   SL H+KQ+HA ILR+ +     NS LF L +SS +++  L YAL+V
Sbjct: 10  TAANTILEKLSFCKSLNHIKQLHAHILRTVINH-KLNSFLFNLSVSSSSIN--LSYALNV 69

Query: 80  FDQIPQPKTRLC-NKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLS 139
           F  IP P   +  N  LR LSR SEP  T+  Y+++R  G  LD++ F P+LKA S+  +
Sbjct: 70  FSSIPSPPESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSA 129

Query: 140 LRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDG 199
           L  GME+HG+A K+    DPFVETG + MYA+CGRI  AR VFD+MSHRDVV W+ MI+ 
Sbjct: 130 LFEGMELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIER 189

Query: 200 YEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYC-LSGFYDLA 259
           Y                L   +F           +  E+I      I   C  +G     
Sbjct: 190 Y------------CRFGLVDEAFKLFEEMKDSNVMPDEMILC---NIVSACGRTGNMRYN 249

Query: 260 FQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITM 319
             ++E +   ++  D  +L+ +++  A AG +D   +    ++ +N+     + +A+++ 
Sbjct: 250 RAIYEFLIENDVRMDTHLLTALVTMYAGAGCMDMAREFFRKMSVRNL----FVSTAMVSG 309

Query: 320 YASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMI 379
           Y+ CG +D                               +A+ +FDQ  +KDL+CW+ MI
Sbjct: 310 YSKCGRLD-------------------------------DAQVIFDQTEKKDLVCWTTMI 369

Query: 380 SGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFG 439
           S Y ESD PQEAL +F++M   G+KPDVV++ SVISACA+LG LD+ KW+ + +  NG  
Sbjct: 370 SAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWVHSCIHVNGLE 429

Query: 440 KALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQM 499
             LSINNALI+MYAKCG L+  R VF KMP++NV+SW+SMI+AL+MHG+A +ALSLF +M
Sbjct: 430 SELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDALSLFARM 489

Query: 500 KVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANL 559
           K ENVEPN +TFVGVLY CSH GLVEEG++IF SMTDEY I+PK EH+GCMVDLFGRANL
Sbjct: 490 KQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDLFGRANL 549

Query: 560 LREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNI 619
           LREALEVIE+MP A N +IWGSLM+AC+IHGE ELG+FAAK++L+LEPDHDGALV++SNI
Sbjct: 550 LREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGALVLMSNI 609

Query: 620 YAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEV 679
           YA+E+RWEDV  +R++M +  V KE+G SRI+ N + HEF + D+ HKQ+++I+ KLDEV
Sbjct: 610 YAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIYAKLDEV 669

Query: 680 VQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPR--------ICII 739
           V KL LAGY P    VLVD++EEEKK+LVLWHSEKLALC+ LMNE           I I+
Sbjct: 670 VSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEEEKDSCGVIRIV 722

Query: 740 KNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           KNLR+CEDCH F KL SKVY REII+RDR+RFH Y++GLCSC+DYW
Sbjct: 730 KNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of ClCG05G005100 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 5.2e-155
Identity = 294/754 (38.99%), Postives = 449/754 (59.55%), Query Frame = 1

Query: 29  LSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSP---SLDYALSVFDQIPQ 88
           L +  +L  L+ +HAQ+++  L   ++N  L +LI   C LSP    L YA+SVF  I +
Sbjct: 40  LHNCKTLQSLRIIHAQMIKIGLH--NTNYALSKLI-EFCILSPHFEGLPYAISVFKTIQE 99

Query: 89  PKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGMEI 148
           P   + N + R  +  S+P   L +Y  M + GL  + Y FP +LK+ +++ + + G +I
Sbjct: 100 PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 159

Query: 149 HGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVVA 208
           HG   KLG   D +V T L+ MY   GR+ +A  VFDK  HRDVV+++ +I GY +    
Sbjct: 160 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRG-- 219

Query: 209 LHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEMK 268
                    Y+        NA     +I  + + +    I  Y  +G Y  A +LF++M 
Sbjct: 220 ---------YIE-------NAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMM 279

Query: 269 RTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSMD 328
           +T + PDE  + TV+SACA++G+++ G ++H +I       +  + +ALI +Y+ CG ++
Sbjct: 280 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 339

Query: 329 LAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMISGYTESDC 388
            A   +E++                                 KD+I W+ +I GYT  + 
Sbjct: 340 TACGLFERLP-------------------------------YKDVISWNTLIGGYTHMNL 399

Query: 389 PQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKN--GFGKALSIN 448
            +EAL+LF++M + G  P+ VT+LS++ ACAHLGA+D G+WI  Y+DK   G   A S+ 
Sbjct: 400 YKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLR 459

Query: 449 NALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVE 508
            +LIDMYAKCG +E A +VF  +  K++ SW +MI   AMHG A  +  LF +M+   ++
Sbjct: 460 TSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQ 519

Query: 509 PNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREALE 568
           P+ ITFVG+L ACSH G+++ GR IF +MT +Y ++PK EH+GCM+DL G + L +EA E
Sbjct: 520 PDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEE 579

Query: 569 VIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKERR 628
           +I  M   P+ +IW SL+ AC++HG  ELGE  A+ ++K+EP++ G+ V+LSNIYA   R
Sbjct: 580 MINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGR 639

Query: 629 WEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKLNL 688
           W +V + R L+   G+ K  GCS IE+++ VHEF + D+ H +  +I+  L+E+   L  
Sbjct: 640 WNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEK 699

Query: 689 AGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMN--EGPRICIIKNLRICEDCHAF 748
           AG+ P T+ VL +++EE K+  +  HSEKLA+ + L++   G ++ I+KNLR+C +CH  
Sbjct: 700 AGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEA 741

Query: 749 MKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
            KL SK+Y REII RDR+RFHH+RDG+CSC DYW
Sbjct: 760 TKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of ClCG05G005100 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 482.6 bits (1241), Expect = 7.8e-135
Identity = 232/528 (43.94%), Postives = 352/528 (66.67%), Query Frame = 1

Query: 252 GFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQ 311
           G  D A +LF++M+  +++   + +  VLSACA+  NL+FG ++  +I +  + ++  L 
Sbjct: 211 GSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLA 270

Query: 312 SALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLI 371
           +A++ MY  CGS++ A   ++ +  K+ V  T M+ G A       AR V + M +KD++
Sbjct: 271 NAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIV 330

Query: 372 CWSAMISGYTESDCPQEALVLFKKMQ-QQGMKPDVVTILSVISACAHLGALDQGKWIQTY 431
            W+A+IS Y ++  P EAL++F ++Q Q+ MK + +T++S +SACA +GAL+ G+WI +Y
Sbjct: 331 AWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSY 390

Query: 432 VDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNA 491
           + K+G      + +ALI MY+KCG LE +R+VF  + K++V  W++MI  LAMHG    A
Sbjct: 391 IKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEA 450

Query: 492 LSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVD 551
           + +F++M+  NV+PN +TF  V  ACSH GLV+E   +FH M   YGI P+ +H+ C+VD
Sbjct: 451 VDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVD 510

Query: 552 LFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGA 611
           + GR+  L +A++ IEAMP  P+  +WG+L+ AC+IH    L E A  ++L+LEP +DGA
Sbjct: 511 VLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGA 570

Query: 612 LVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQI 671
            V+LSNIYAK  +WE+V E+RK M   G+ KE GCS IE++  +HEF   D  H  ++++
Sbjct: 571 HVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKV 630

Query: 672 HQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVL-WHSEKLALCYALMN-EGPRIC- 731
           + KL EV++KL   GY P+ + VL  ++EEE KE  L  HSEKLA+CY L++ E P++  
Sbjct: 631 YGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIR 690

Query: 732 IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           +IKNLR+C DCH+  KL S++Y REII+RDR RFHH+R+G CSC D+W
Sbjct: 691 VIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738


HSP 2 Score: 348.2 bits (892), Expect = 2.3e-94
Identity = 249/779 (31.96%), Postives = 389/779 (49.94%), Query Frame = 1

Query: 34  SLFHLKQVHAQILRSKL--ERCDSNSLLFELILSSCALSPSLDYALSVFDQIPQPKTRLC 93
           SL  LKQ H  ++R+    +   ++ L     LSS A   SL+YA  VFD+IP+P +   
Sbjct: 42  SLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFA---SLEYARKVFDEIPKPNSFAW 101

Query: 94  NKLLRQLSRGSEPEFTLFVYEKMRAEGLSL-DRYCFPPLLKAASRNLSLRTGMEIHGLAS 153
           N L+R  + G +P  +++ +  M +E     ++Y FP L+KAA+   SL  G  +HG+A 
Sbjct: 102 NTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAV 161

Query: 154 KLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGY----------- 213
           K   GSD FV   L+  Y +CG +  A  VF  +  +DVV+W+ MI+G+           
Sbjct: 162 KSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALE 221

Query: 214 -----EANVVALHLLVKVNVYLSILSFSSLN------AWLFRRQITAELIFAPYPQIYRY 273
                E+  V    +  V V  +     +L       +++   ++   L  A    +  Y
Sbjct: 222 LFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLA-NAMLDMY 281

Query: 274 CLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDP 333
              G  + A +LF+ M+    E D +  +T+L   A + + +   ++   + +K+IV   
Sbjct: 282 TKCGSIEDAKRLFDAME----EKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIV--- 341

Query: 334 HLQSALITMYASCGSMDLAW-DFYEKISPKNMVVS-TAMVSGLAKGGQIGEAR------- 393
              +ALI+ Y   G  + A   F+E    KNM ++   +VS L+   Q+G          
Sbjct: 342 -AWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHS 401

Query: 394 YVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLG 453
           Y+    +  +    SA+I  Y++    +++  +F  ++    K DV    ++I   A  G
Sbjct: 402 YIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVE----KRDVFVWSAMIGGLAMHG 461

Query: 454 ALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIH 513
                                   N  +DM+ K               K N +++T++  
Sbjct: 462 C----------------------GNEAVDMFYKMQEAN---------VKPNGVTFTNVFC 521

Query: 514 ALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGIS 573
           A +  G    A SLFHQM+                  S+ G+V E               
Sbjct: 522 ACSHTGLVDEAESLFHQME------------------SNYGIVPE--------------- 581

Query: 574 PKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQ 633
              +H+ C+VD+ GR+  L +A++ IEAMP  P+  +WG+L+ AC+IH    L E A  +
Sbjct: 582 --EKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTR 641

Query: 634 VLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQM 693
           +L+LEP +DGA V+LSNIYAK  +WE+V E+RK M   G+ KE GCS IE++  +HEF  
Sbjct: 642 LLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLS 701

Query: 694 ADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVL-WHSEKLALCYA 753
            D  H  +++++ KL EV++KL   GY P+ + VL  ++EEE KE  L  HSEKLA+CY 
Sbjct: 702 GDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYG 738

Query: 754 LMN-EGPRIC-IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           L++ E P++  +IKNLR+C DCH+  KL S++Y REII+RDR RFHH+R+G CSC D+W
Sbjct: 762 LISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of ClCG05G005100 vs. Swiss-Prot
Match: PP168_ARATH (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 471.9 bits (1213), Expect = 1.4e-131
Identity = 265/724 (36.60%), Postives = 411/724 (56.77%), Query Frame = 1

Query: 63  ILSSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLD 122
           +LS+ +    +D     FDQ+PQ  +     ++       +    + V   M  EG+   
Sbjct: 86  VLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPT 145

Query: 123 RYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFD 182
           ++    +L + +    + TG ++H    KLG   +  V   L+ MYA CG  M A+ VFD
Sbjct: 146 QFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFD 205

Query: 183 KMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPY 242
           +M  RD+ +W+ MI        ALH+ V   + L++  F  +      R I         
Sbjct: 206 RMVVRDISSWNAMI--------ALHMQVG-QMDLAMAQFEQMA----ERDIVT------- 265

Query: 243 PQIYRYCLSGF----YDL-AFQLFEEMKRTEL-EPDEMILSTVLSACARAGNLDFGTKIH 302
              +   +SGF    YDL A  +F +M R  L  PD   L++VLSACA    L  G +IH
Sbjct: 266 ---WNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIH 325

Query: 303 EFITKKNIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVS--TAMVSGLAKGGQ 362
             I      +   + +ALI+MY+ CG ++ A    E+   K++ +   TA++ G  K G 
Sbjct: 326 SHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGD 385

Query: 363 IGEARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISA 422
           + +A+ +F  + ++D++ W+AMI GY +     EA+ LF+ M   G +P+  T+ +++S 
Sbjct: 386 MNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSV 445

Query: 423 CAHLGALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMP-KKNVIS 482
            + L +L  GK I     K+G   ++S++NALI MYAK G++  A + F  +  +++ +S
Sbjct: 446 ASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVS 505

Query: 483 WTSMIHALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMT 542
           WTSMI ALA HG A  AL LF  M +E + P+ IT+VGV  AC+H GLV +GR+ F  M 
Sbjct: 506 WTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMK 565

Query: 543 DEYGISPKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELG 602
           D   I P   H+ CMVDLFGRA LL+EA E IE MP  P+ + WGSL++AC++H   +LG
Sbjct: 566 DVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLG 625

Query: 603 EFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNE 662
           + AA+++L LEP++ GA   L+N+Y+   +WE+  ++RK M    V KE+G S IE+ ++
Sbjct: 626 KVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHK 685

Query: 663 VHEFQMADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKL 722
           VH F + D  H + ++I+  + ++  ++   GY P T  VL DL+EE K++++  HSEKL
Sbjct: 686 VHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKL 745

Query: 723 ALCYALMN--EGPRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSC 776
           A+ + L++  +   + I+KNLR+C DCH  +K  SK+  REII+RD +RFHH++DG CSC
Sbjct: 746 AIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFCSC 786


HSP 2 Score: 156.4 bits (394), Expect = 1.3e-36
Identity = 126/522 (24.14%), Postives = 227/522 (43.49%), Query Frame = 1

Query: 125 CFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKM 184
           C   L K+ +++    T   +H    K G     ++   L+ +Y+  G  + AR +FD+M
Sbjct: 16  CTNLLQKSVNKSNGRFTAQLVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEM 75

Query: 185 SHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQ 244
             R   +W+ ++  Y           + ++  +   F  L     R  ++   +   Y  
Sbjct: 76  PLRTAFSWNTVLSAYSK---------RGDMDSTCEFFDQLPQ---RDSVSWTTMIVGYKN 135

Query: 245 IYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNI 304
           I      G Y  A ++  +M +  +EP +  L+ VL++ A    ++ G K+H FI K  +
Sbjct: 136 I------GQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGL 195

Query: 305 VMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQ 364
             +  + ++L+ MYA CG   +A   ++++  +++    AM++   + GQ+  A   F+Q
Sbjct: 196 RGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQ 255

Query: 365 MVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGM-KPDVVTILSVISACAHL----- 424
           M E+D++ W++MISG+ +      AL +F KM +  +  PD  T+ SV+SACA+L     
Sbjct: 256 MAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCI 315

Query: 425 GALDQGKWIQTYVDKNGF--------------------------GKALSINN--ALIDMY 484
           G       + T  D +G                            K L I    AL+D Y
Sbjct: 316 GKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGY 375

Query: 485 AKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVEPNWITFV 544
            K G +  A+ +F  +  ++V++WT+MI     HG    A++LF  M      PN  T  
Sbjct: 376 IKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLA 435

Query: 545 GVLYACSHGGLVEEGRRIFHSMT---DEYGISPKHEHFGCMVDLFGRANLLREALEVIEA 604
            +L   S    +  G++I  S     + Y +S  +     ++ ++ +A  +  A    + 
Sbjct: 436 AMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSN----ALITMYAKAGNITSASRAFDL 495

Query: 605 MPFAPNAIIWGSLMAACQIHGETE--LGEFAAKQVLKLEPDH 608
           +    + + W S++ A   HG  E  L  F    +  L PDH
Sbjct: 496 IRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDH 515

BLAST of ClCG05G005100 vs. Swiss-Prot
Match: PPR53_ARATH (Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN=PCMP-H21 PE=2 SV=2)

HSP 1 Score: 471.1 bits (1211), Expect = 2.4e-131
Identity = 253/756 (33.47%), Postives = 440/756 (58.20%), Query Frame = 1

Query: 26  SAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSPSLDYALSVFDQIPQ 85
           S++   +SSL    Q HA+IL+S  +   ++  +   +++S +     + A  V   IP 
Sbjct: 22  SSSYHWSSSLSKTTQAHARILKSGAQ---NDGYISAKLIASYSNYNCFNDADLVLQSIPD 81

Query: 86  PKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGMEI 145
           P     + L+  L++      ++ V+ +M + GL  D +  P L K  +   + + G +I
Sbjct: 82  PTIYSFSSLIYALTKAKLFTQSIGVFSRMFSHGLIPDSHVLPNLFKVCAELSAFKVGKQI 141

Query: 146 HGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVVA 205
           H ++   G   D FV+  +  MY  CGR+ +AR VFD+MS +DVV  S ++  Y A    
Sbjct: 142 HCVSCVSGLDMDAFVQGSMFHMYMRCGRMGDARKVFDRMSDKDVVTCSALLCAY-ARKGC 201

Query: 206 LHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEMK 265
           L  +V++   LS +  S + A +    ++   I + + +      SG++  A  +F+++ 
Sbjct: 202 LEEVVRI---LSEMESSGIEANI----VSWNGILSGFNR------SGYHKEAVVMFQKIH 261

Query: 266 RTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSMD 325
                PD++ +S+VL +   +  L+ G  IH ++ K+ ++ D  + SA+I MY   G + 
Sbjct: 262 HLGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKSGHVY 321

Query: 326 LAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFD----QMVEKDLICWSAMISGYT 385
                + +       V  A ++GL++ G + +A  +F+    Q +E +++ W+++I+G  
Sbjct: 322 GIISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSIIAGCA 381

Query: 386 ESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFGKALS 445
           ++    EAL LF++MQ  G+KP+ VTI S++ AC ++ AL  G+    +  +      + 
Sbjct: 382 QNGKDIEALELFREMQVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLLDNVH 441

Query: 446 INNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVEN 505
           + +ALIDMYAKCG +  ++ VF  MP KN++ W S+++  +MHG A   +S+F  +    
Sbjct: 442 VGSALIDMYAKCGRINLSQIVFNMMPTKNLVCWNSLMNGFSMHGKAKEVMSIFESLMRTR 501

Query: 506 VEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREA 565
           ++P++I+F  +L AC   GL +EG + F  M++EYGI P+ EH+ CMV+L GRA  L+EA
Sbjct: 502 LKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEA 561

Query: 566 LEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKE 625
            ++I+ MPF P++ +WG+L+ +C++    +L E AA+++  LEP++ G  V+LSNIYA +
Sbjct: 562 YDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAK 621

Query: 626 RRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKL 685
             W +V  +R  M  +G+ K  GCS I++ N V+     D++H Q DQI +K+DE+ +++
Sbjct: 622 GMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISKEM 681

Query: 686 NLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMN--EGPRICIIKNLRICEDCH 745
             +G+ P  ++ L D++E+E+++++  HSEKLA+ + L+N  +G  + +IKNLRIC DCH
Sbjct: 682 RKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCH 741

Query: 746 AFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           A +K  S    REI IRD +RFHH++DG+CSC D+W
Sbjct: 742 AVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDFW 760

BLAST of ClCG05G005100 vs. TrEMBL
Match: D7T700_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g03630 PE=4 SV=1)

HSP 1 Score: 910.6 bits (2352), Expect = 1.3e-261
Identity = 472/772 (61.14%), Postives = 568/772 (73.58%), Query Frame = 1

Query: 12  PLQLHTYPTRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSP 71
           P  LH++ T    L +ALSSA+SL HLKQVHAQILRSKL+R  S SLL +L++SSCALS 
Sbjct: 17  PTTLHSHHT----LFSALSSATSLTHLKQVHAQILRSKLDR--STSLLVKLVISSCALSS 76

Query: 72  SLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLK 131
           SLDYALSVF+ IP+P+T LCN+ LR+LSR  EPE TL VYE+MR +GL++DR+ FPPLLK
Sbjct: 77  SLDYALSVFNLIPKPETHLCNRFLRELSRSEEPEKTLLVYERMRTQGLAVDRFSFPPLLK 136

Query: 132 AASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVA 191
           A SR  SL  G+EIHGLA+KLGF SDPFV+TGLVRMYAACGRI EARL+FDKM HRDVV 
Sbjct: 137 ALSRVKSLVEGLEIHGLAAKLGFDSDPFVQTGLVRMYAACGRIAEARLMFDKMFHRDVVT 196

Query: 192 WSIMIDGY------EANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQI 251
           WSIMIDGY         ++    +   NV    +  S++ +   R               
Sbjct: 197 WSIMIDGYCQSGLFNDALLLFEEMKNYNVEPDEMMLSTVLSACGR--------------- 256

Query: 252 YRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIV 311
                +G       + + +    +  D  + S +++  A  G++D    + E +T KN+V
Sbjct: 257 -----AGNLSYGKMIHDFIMENNIVVDPHLQSALVTMYASCGSMDLALNLFEKMTPKNLV 316

Query: 312 MDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQM 371
                 +A++T Y                               +K GQI  AR VF+QM
Sbjct: 317 ----ASTAMVTGY-------------------------------SKLGQIENARSVFNQM 376

Query: 372 VEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGK 431
           V+KDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VT+LSVI+ACAHLGALDQ K
Sbjct: 377 VKKDLVCWSAMISGYAESDSPQEALNLFNEMQSLGIKPDQVTMLSVITACAHLGALDQAK 436

Query: 432 WIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHG 491
           WI  +VDKNGFG AL INNALI+MYAKCGSLE AR++F KMP+KNVISWT MI A AMHG
Sbjct: 437 WIHLFVDKNGFGGALPINNALIEMYAKCGSLERARRIFDKMPRKNVISWTCMISAFAMHG 496

Query: 492 DAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHF 551
           DA +AL  FHQM+ EN+EPN ITFVGVLYACSH GLVEEGR+IF+SM +E+ I+PKH H+
Sbjct: 497 DAGSALRFFHQMEDENIEPNGITFVGVLYACSHAGLVEEGRKIFYSMINEHNITPKHVHY 556

Query: 552 GCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEP 611
           GCMVDLFGRANLLREALE++EAMP APN IIWGSLMAAC++HGE ELGEFAAK++L+L+P
Sbjct: 557 GCMVDLFGRANLLREALELVEAMPLAPNVIIWGSLMAACRVHGEIELGEFAAKRLLELDP 616

Query: 612 DHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHK 671
           DHDGA V LSNIYAK RRWEDVG+VRKLM   G+SKERGCSR ELNNE+HEF +ADR+HK
Sbjct: 617 DHDGAHVFLSNIYAKARRWEDVGQVRKLMKHKGISKERGCSRFELNNEIHEFLVADRSHK 676

Query: 672 QADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPR 731
            AD+I++KL EVV KL L GY+P T  +LVDL+EEEKKE+VLWHSEKLALCY LM +G  
Sbjct: 677 HADEIYEKLYEVVSKLKLVGYSPNTCSILVDLEEEEKKEVVLWHSEKLALCYGLMRDGTG 727

Query: 732 IC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
            C  IIKNLR+CEDCH F+KLASKVY REI++RDR+RFHHY+DG+CSCKDYW
Sbjct: 737 SCIRIIKNLRVCEDCHTFIKLASKVYEREIVVRDRTRFHHYKDGVCSCKDYW 727

BLAST of ClCG05G005100 vs. TrEMBL
Match: B9HUV1_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s09620g PE=4 SV=2)

HSP 1 Score: 880.2 bits (2273), Expect = 1.9e-252
Identity = 462/773 (59.77%), Postives = 555/773 (71.80%), Query Frame = 1

Query: 17  TYPTRPT-----ALSAALSSAS-----SLFHLKQVHAQILRSKLERCDSNSLLFELILSS 76
           T PT P      AL AALSS S     SL HLKQ+HAQ+LRS L      SLL EL+LSS
Sbjct: 6   TLPTIPVPLTSIALHAALSSTSTSTPTSLPHLKQIHAQVLRSNLPP----SLLLELLLSS 65

Query: 77  CALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRA-EGL-SLDRY 136
                SLDYALSVF  +P+  T L NKL R LSR ++PE  L  YEK+R  EGL  +DR+
Sbjct: 66  S----SLDYALSVFTHLPKCHTPLSNKLFRSLSRSAKPETALLAYEKIRLKEGLLGIDRF 125

Query: 137 CFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKM 196
            FPPLLKAASR   L  G EIHG+A+KLGF                              
Sbjct: 126 SFPPLLKAASRASGLNEGKEIHGVATKLGF------------------------------ 185

Query: 197 SHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQ 256
             +D    + ++  Y     +   + +  +    +S+  + AW                 
Sbjct: 186 -DKDPFVQTGLVGMY----ASCDRISEARLVFDKMSYRDVVAWSI--------------M 245

Query: 257 IYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNI 316
           I  Y  SG YD   QLFEEM+ + L+PDEM+L+T++SAC RA NL +G  IH+FI + N 
Sbjct: 246 IDGYHQSGLYDDVLQLFEEMRSSNLKPDEMVLTTIISACGRARNLSYGEAIHDFIIENNF 305

Query: 317 VMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQ 376
           V+D +LQSAL+TMYASCG M++A   + KIS +N+VV TAM+SG ++ G++ +AR +FDQ
Sbjct: 306 VLDTYLQSALLTMYASCGCMEMAQKLFTKISSRNLVVLTAMISGYSRVGRVEDARLIFDQ 365

Query: 377 MVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQG 436
           M EKDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VTILSVISACA LG LD+ 
Sbjct: 366 MEEKDLVCWSAMISGYAESDKPQEALNLFSEMQVFGIKPDQVTILSVISACARLGVLDRA 425

Query: 437 KWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMH 496
           KWI  YVDKNG G AL +NNALIDMYAKCG+L  AR VF KM  +NVISWTSMI+A A+H
Sbjct: 426 KWIHMYVDKNGLGGALPVNNALIDMYAKCGNLGAARGVFEKMQSRNVISWTSMINAFAIH 485

Query: 497 GDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEH 556
           GDA NAL  F+QMK EN++PN +TFVGVLYACSH GLVEEGRR F SMT+E+ I+PKHEH
Sbjct: 486 GDASNALKFFYQMKDENIKPNGVTFVGVLYACSHAGLVEEGRRTFASMTNEHNITPKHEH 545

Query: 557 FGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLE 616
           +GCMVDLFGRANLLR+ALE++E MP APN +IWGSLMAACQIHGE ELGEFAAKQVL+LE
Sbjct: 546 YGCMVDLFGRANLLRDALELVETMPLAPNVVIWGSLMAACQIHGENELGEFAAKQVLELE 605

Query: 617 PDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNH 676
           PDHDGALV LSNIYAK+RRW+DVGE+R LM + G+SKERGCSRIELNN+V+EF MAD+ H
Sbjct: 606 PDHDGALVQLSNIYAKDRRWQDVGELRNLMKQRGISKERGCSRIELNNQVYEFVMADKKH 665

Query: 677 KQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGP 736
           KQAD+I++KLDEVV++L L GYTP T  VLVD++EE KKE+VLWHSEKLALCY LM EG 
Sbjct: 666 KQADKIYEKLDEVVKELKLVGYTPNTRSVLVDVEEEGKKEVVLWHSEKLALCYGLMGEGK 721

Query: 737 RIC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
             C  I+KNLR+CEDCH F+KL SKVY  EII+RDR+RFHHY+ G+CSC DYW
Sbjct: 726 GSCIRIVKNLRVCEDCHTFIKLVSKVYGMEIIVRDRTRFHHYKAGVCSCNDYW 721

BLAST of ClCG05G005100 vs. TrEMBL
Match: A0A067L6S3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02188 PE=4 SV=1)

HSP 1 Score: 852.0 bits (2200), Expect = 5.5e-244
Identity = 437/776 (56.31%), Postives = 555/776 (71.52%), Query Frame = 1

Query: 4   LSHSTS-VLPLQLHTYPTRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFEL 63
           +S STS  LPL L +     T L    SS++SL+HLKQVHAQILRS L    S S+L +L
Sbjct: 1   MSASTSPALPLPLSSATIHTTLLPFLSSSSTSLYHLKQVHAQILRSSL----SPSILLKL 60

Query: 64  ILSSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGL-SL 123
           ILSS +   SL+YALSVF  +P P+  L NK LR LSR S+PE  L VYEK+R +GL  +
Sbjct: 61  ILSSSSSISSLEYALSVFTHLPTPRPALSNKFLRALSRSSKPETVLLVYEKIREDGLFGV 120

Query: 124 DRYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVF 183
           DR+  P LLKAA++  +L  GMEIHG+A+KLG                           F
Sbjct: 121 DRFSLPLLLKAAAKVSALNEGMEIHGVATKLG---------------------------F 180

Query: 184 DKMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAP 243
           DK         S+ +        A   +++  +    +S+  +  W              
Sbjct: 181 DKDPFVQTGLMSLYL--------ACGKILEARLVFDKMSYRDVVTWSI------------ 240

Query: 244 YPQIYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITK 303
              I  Y  +G +D A + FEEMK + ++PD+++LST++SAC+RAGNL +G  +H+FI +
Sbjct: 241 --MINGYYQNGHFDEALKFFEEMKSSNVQPDKVVLSTIISACSRAGNLSYGKAVHDFIIE 300

Query: 304 KNIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYV 363
            NI +DPHL+S LI MYA+CG MD+A + + K+S +N+VVSTAMVSG ++ G + +AR +
Sbjct: 301 NNIEVDPHLESTLIFMYANCGCMDMAKELFFKMSSRNLVVSTAMVSGYSRVGNVKDARLI 360

Query: 364 FDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGAL 423
           FD+M +KDL+CWSAMISGY ESD PQEAL LF +MQ  G++PD VT+LSVISACAHLG L
Sbjct: 361 FDEMDKKDLVCWSAMISGYAESDQPQEALNLFNEMQALGIEPDEVTMLSVISACAHLGVL 420

Query: 424 DQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHAL 483
           DQ K I  +V+++GFG  L +NNALIDMYAKCG LE AR VF KM ++NVISWTSMI+A 
Sbjct: 421 DQAKRIHMFVNESGFGGVLPVNNALIDMYAKCGCLEAARAVFEKMQRRNVISWTSMINAF 480

Query: 484 AMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPK 543
           A+HGDA +AL+ FH+MK EN+EPN +TFVGVLYACSH GLVEEG++IF SM ++Y ISPK
Sbjct: 481 AIHGDANSALNFFHRMKDENIEPNAVTFVGVLYACSHAGLVEEGQKIFASMINDYNISPK 540

Query: 544 HEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVL 603
           HEH+GCMVDLFGRA  LREAL ++E M   PN +IWGSLMAAC++HGETELGEFAA+++L
Sbjct: 541 HEHYGCMVDLFGRAKFLREALNLVETMSLPPNVVIWGSLMAACRVHGETELGEFAAQRLL 600

Query: 604 KLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMAD 663
           +LEP HDGALV+LSNIYAKE+RW+DVG++R LM + G+ KERGCSRIEL+N VHEF  AD
Sbjct: 601 ELEPGHDGALVLLSNIYAKEKRWQDVGQIRNLMKQRGIFKERGCSRIELSNGVHEFSTAD 660

Query: 664 RNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMN 723
           R HKQAD I++KLDEVV  L   GY+P T+ VLVD++EE K E+VLWHSEKLALCY L++
Sbjct: 661 RKHKQADLIYEKLDEVVGNLKFVGYSPDTSVVLVDIEEEAKNEVVLWHSEKLALCYGLIS 720

Query: 724 EGPRIC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           +G   C  I+KNLRICEDCH FMKL SK Y  EII+RDR+RFH Y+DG+CSC DYW
Sbjct: 721 QGKGSCIRIVKNLRICEDCHNFMKLVSKAYELEIIVRDRTRFHRYKDGVCSCNDYW 723

BLAST of ClCG05G005100 vs. TrEMBL
Match: A0A061EMY0_THECC (Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_020645 PE=4 SV=1)

HSP 1 Score: 849.0 bits (2192), Expect = 4.6e-243
Identity = 441/770 (57.27%), Postives = 546/770 (70.91%), Query Frame = 1

Query: 9   SVLPLQLHTYPTRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCA 68
           S++   L + PT P  L   LSS+ SL HLKQ+HAQILRS      S++L+ +L+L    
Sbjct: 10  SLVSPNLKSLPT-PKTLLKTLSSSPSLTHLKQIHAQILRSN--HSHSHTLILKLLL---- 69

Query: 69  LSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPP 128
            SPSL Y+LS+F  +P P   L  + +R LSR S PEF LFVY+++R EG+ +DR+ FPP
Sbjct: 70  FSPSLPYSLSIFSHLPHPLPSLSTRFVRHLSRSSRPEFALFVYQRLRNEGIKIDRFTFPP 129

Query: 129 LLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRD 188
           LLKA +R   L  G EIHG   KLG  SDPFV+TGLV MY ACGR++EAR VFDKMS+RD
Sbjct: 130 LLKAVARVEGLAEGKEIHGFGFKLGLDSDPFVQTGLVGMYLACGRVLEARSVFDKMSYRD 189

Query: 189 VVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRY 248
           +VAWSIMIDGY                LS L   +L  +   ++   E+       I   
Sbjct: 190 IVAWSIMIDGY---------------CLSGLFDDALELFEEMKRANIEVDKFILSSILSA 249

Query: 249 C-LSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMD 308
           C   G  +    + + +    L  D  + S +++  A  G ++   K+   +  KN+V  
Sbjct: 250 CGRVGNLNHGKAIHDYIIEKILVVDSHLQSALMTMYASCGCMEMAQKLFNQMAPKNLV-- 309

Query: 309 PHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVE 368
             + +A+++ Y+                                  +I +AR +FDQMVE
Sbjct: 310 --VSTAMVSGYSR-------------------------------HRRIEDARLIFDQMVE 369

Query: 369 KDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWI 428
           KDL+CWSAMISGY ESD PQEAL LF ++Q  GM+PD VT+LSVISACAHLG L++ KWI
Sbjct: 370 KDLVCWSAMISGYAESDQPQEALRLFNELQSLGMRPDQVTMLSVISACAHLGVLEKAKWI 429

Query: 429 QTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDA 488
             Y DKNGFG AL INNALIDM+AKCGSLE AR VF KM ++NVISWTSMI+A A+HGDA
Sbjct: 430 HVYADKNGFGGALPINNALIDMHAKCGSLERARGVFEKMTRRNVISWTSMINAFAIHGDA 489

Query: 489 PNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGC 548
            NALS FH+MK  +VEPN +TFVGVLYACSH GLV+EG+RIF SM +E+ I+PKHEH+GC
Sbjct: 490 NNALSFFHKMKEAHVEPNGVTFVGVLYACSHAGLVDEGQRIFASMINEHKIAPKHEHYGC 549

Query: 549 MVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDH 608
           MVDLFGRANLLREALE++E MP APN +IWGSLMAACQIHGETELGEFAAK++L+LEPDH
Sbjct: 550 MVDLFGRANLLREALEIVETMPLAPNVVIWGSLMAACQIHGETELGEFAAKRLLELEPDH 609

Query: 609 DGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQA 668
           DGALV+LSNIYAKE++W+DVGE+R LM + G+SKE+GCSRIELNNEVHEF MADRNHKQA
Sbjct: 610 DGALVLLSNIYAKEKKWQDVGELRHLMKERGISKEKGCSRIELNNEVHEFLMADRNHKQA 669

Query: 669 DQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPRIC 728
           D+I++KLDEV+ +L L GY P T  VLVDL+EEEK+E+VLWHSEKLALCY L+N     C
Sbjct: 670 DKIYEKLDEVISQLKLVGYFPNTRSVLVDLEEEEKREVVLWHSEKLALCYGLINGEKDSC 722

Query: 729 --IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
             I+KNLR+CEDCH FMKL SK+Y REI++RDR+RFHHY+DGLCSCKDYW
Sbjct: 730 IRIVKNLRVCEDCHTFMKLVSKLYGREIVVRDRTRFHHYKDGLCSCKDYW 722

BLAST of ClCG05G005100 vs. TrEMBL
Match: I1LFU4_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_11G006200 PE=4 SV=2)

HSP 1 Score: 839.0 bits (2166), Expect = 4.8e-240
Identity = 431/753 (57.24%), Postives = 538/753 (71.45%), Query Frame = 1

Query: 29  LSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCAL-SPS---LDYALSVFDQIP 88
           L+S  +L H+KQ+HAQILRSK++  +SN LL +L+L  C L SPS   LDYALS+F  IP
Sbjct: 19  LASCKTLRHVKQIHAQILRSKMD--NSNLLLLKLVLCCCTLPSPSPSALDYALSLFSHIP 78

Query: 89  QPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGME 148
            P TR  N+LLRQ SRG  PE TL +Y  +R  G  LDR+ FPPLLKA S+  +L  G+E
Sbjct: 79  NPPTRFSNQLLRQFSRGPTPENTLSLYLHLRRNGFPLDRFSFPPLLKAVSKLSALNLGLE 138

Query: 149 IHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVV 208
           IHGLASK GF                               H D    S +I  Y     
Sbjct: 139 IHGLASKFGF------------------------------FHADPFIQSALIAMY----A 198

Query: 209 ALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEM 268
           A   ++        +S   +  W                 I  Y  +  YD   +L+EEM
Sbjct: 199 ACGRIMDARFLFDKMSHRDVVTWNI--------------MIDGYSQNAHYDHVLKLYEEM 258

Query: 269 KRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSM 328
           K +  EPD +IL TVLSACA AGNL +G  IH+FI      +  H+Q++L+ MYA+CG+M
Sbjct: 259 KTSGTEPDAIILCTVLSACAHAGNLSYGKAIHQFIKDNGFRVGSHIQTSLVNMYANCGAM 318

Query: 329 DLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMISGYTESD 388
            LA + Y+++  K+MVVSTAM+SG AK G + +AR++FD+MVEKDL+CWSAMISGY ES 
Sbjct: 319 HLAREVYDQLPSKHMVVSTAMLSGYAKLGMVQDARFIFDRMVEKDLVCWSAMISGYAESY 378

Query: 389 CPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFGKALSINN 448
            P EAL LF +MQ++ + PD +T+LSVISACA++GAL Q KWI TY DKNGFG+ L INN
Sbjct: 379 QPLEALQLFNEMQRRRIVPDQITMLSVISACANVGALVQAKWIHTYADKNGFGRTLPINN 438

Query: 449 ALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVEP 508
           ALIDMYAKCG+L  AR+VF  MP+KNVISW+SMI+A AMHGDA +A++LFH+MK +N+EP
Sbjct: 439 ALIDMYAKCGNLVKAREVFENMPRKNVISWSSMINAFAMHGDADSAIALFHRMKEQNIEP 498

Query: 509 NWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREALEV 568
           N +TF+GVLYACSH GLVEEG++ F SM +E+ ISP+ EH+GCMVDL+ RAN LR+A+E+
Sbjct: 499 NGVTFIGVLYACSHAGLVEEGQKFFSSMINEHRISPQREHYGCMVDLYCRANHLRKAMEL 558

Query: 569 IEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKERRW 628
           IE MPF PN IIWGSLM+ACQ HGE ELGEFAA ++L+LEPDHDGALVVLSNIYAKE+RW
Sbjct: 559 IETMPFPPNVIIWGSLMSACQNHGEIELGEFAATRLLELEPDHDGALVVLSNIYAKEKRW 618

Query: 629 EDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKLNLA 688
           +DVG VRKLM   GVSKE+ CSRIE+NNEVH F MADR HKQ+D+I++KLD VV +L L 
Sbjct: 619 DDVGLVRKLMKHKGVSKEKACSRIEVNNEVHVFMMADRYHKQSDEIYKKLDAVVSQLKLV 678

Query: 689 GYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPRIC--IIKNLRICEDCHAFM 748
           GYTP T+ +LVDL+EEEKKE+VLWHSEKLALCY L+ E    C  I+KNLRICEDCH+FM
Sbjct: 679 GYTPSTSGILVDLEEEEKKEVVLWHSEKLALCYGLIGERKESCIRIVKNLRICEDCHSFM 721

Query: 749 KLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           KL SKV+  EI++RDR+RFHH+  G+CSC+DYW
Sbjct: 739 KLVSKVHRIEIVMRDRTRFHHFNGGICSCRDYW 721

BLAST of ClCG05G005100 vs. TAIR10
Match: AT4G14820.1 (AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 731.1 bits (1886), Expect = 7.1e-211
Identity = 387/766 (50.52%), Postives = 509/766 (66.45%), Query Frame = 1

Query: 20  TRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSPSLDYALSV 79
           T    +   LS   SL H+KQ+HA ILR+ +     NS LF L +SS +++  L YAL+V
Sbjct: 10  TAANTILEKLSFCKSLNHIKQLHAHILRTVINH-KLNSFLFNLSVSSSSIN--LSYALNV 69

Query: 80  FDQIPQPKTRLC-NKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLS 139
           F  IP P   +  N  LR LSR SEP  T+  Y+++R  G  LD++ F P+LKA S+  +
Sbjct: 70  FSSIPSPPESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSA 129

Query: 140 LRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDG 199
           L  GME+HG+A K+    DPFVETG + MYA+CGRI  AR VFD+MSHRDVV W+ MI+ 
Sbjct: 130 LFEGMELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIER 189

Query: 200 YEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYC-LSGFYDLA 259
           Y                L   +F           +  E+I      I   C  +G     
Sbjct: 190 Y------------CRFGLVDEAFKLFEEMKDSNVMPDEMILC---NIVSACGRTGNMRYN 249

Query: 260 FQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITM 319
             ++E +   ++  D  +L+ +++  A AG +D   +    ++ +N+     + +A+++ 
Sbjct: 250 RAIYEFLIENDVRMDTHLLTALVTMYAGAGCMDMAREFFRKMSVRNL----FVSTAMVSG 309

Query: 320 YASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMI 379
           Y+ CG +D                               +A+ +FDQ  +KDL+CW+ MI
Sbjct: 310 YSKCGRLD-------------------------------DAQVIFDQTEKKDLVCWTTMI 369

Query: 380 SGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFG 439
           S Y ESD PQEAL +F++M   G+KPDVV++ SVISACA+LG LD+ KW+ + +  NG  
Sbjct: 370 SAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWVHSCIHVNGLE 429

Query: 440 KALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQM 499
             LSINNALI+MYAKCG L+  R VF KMP++NV+SW+SMI+AL+MHG+A +ALSLF +M
Sbjct: 430 SELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDALSLFARM 489

Query: 500 KVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANL 559
           K ENVEPN +TFVGVLY CSH GLVEEG++IF SMTDEY I+PK EH+GCMVDLFGRANL
Sbjct: 490 KQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDLFGRANL 549

Query: 560 LREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNI 619
           LREALEVIE+MP A N +IWGSLM+AC+IHGE ELG+FAAK++L+LEPDHDGALV++SNI
Sbjct: 550 LREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGALVLMSNI 609

Query: 620 YAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEV 679
           YA+E+RWEDV  +R++M +  V KE+G SRI+ N + HEF + D+ HKQ+++I+ KLDEV
Sbjct: 610 YAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIYAKLDEV 669

Query: 680 VQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPR--------ICII 739
           V KL LAGY P    VLVD++EEEKK+LVLWHSEKLALC+ LMNE           I I+
Sbjct: 670 VSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEEEKDSCGVIRIV 722

Query: 740 KNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           KNLR+CEDCH F KL SKVY REII+RDR+RFH Y++GLCSC+DYW
Sbjct: 730 KNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of ClCG05G005100 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 549.7 bits (1415), Expect = 2.9e-156
Identity = 294/754 (38.99%), Postives = 449/754 (59.55%), Query Frame = 1

Query: 29  LSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSP---SLDYALSVFDQIPQ 88
           L +  +L  L+ +HAQ+++  L   ++N  L +LI   C LSP    L YA+SVF  I +
Sbjct: 40  LHNCKTLQSLRIIHAQMIKIGLH--NTNYALSKLI-EFCILSPHFEGLPYAISVFKTIQE 99

Query: 89  PKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGMEI 148
           P   + N + R  +  S+P   L +Y  M + GL  + Y FP +LK+ +++ + + G +I
Sbjct: 100 PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 159

Query: 149 HGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVVA 208
           HG   KLG   D +V T L+ MY   GR+ +A  VFDK  HRDVV+++ +I GY +    
Sbjct: 160 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRG-- 219

Query: 209 LHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEMK 268
                    Y+        NA     +I  + + +    I  Y  +G Y  A +LF++M 
Sbjct: 220 ---------YIE-------NAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMM 279

Query: 269 RTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSMD 328
           +T + PDE  + TV+SACA++G+++ G ++H +I       +  + +ALI +Y+ CG ++
Sbjct: 280 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 339

Query: 329 LAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMISGYTESDC 388
            A   +E++                                 KD+I W+ +I GYT  + 
Sbjct: 340 TACGLFERLP-------------------------------YKDVISWNTLIGGYTHMNL 399

Query: 389 PQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKN--GFGKALSIN 448
            +EAL+LF++M + G  P+ VT+LS++ ACAHLGA+D G+WI  Y+DK   G   A S+ 
Sbjct: 400 YKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLR 459

Query: 449 NALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVE 508
            +LIDMYAKCG +E A +VF  +  K++ SW +MI   AMHG A  +  LF +M+   ++
Sbjct: 460 TSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQ 519

Query: 509 PNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREALE 568
           P+ ITFVG+L ACSH G+++ GR IF +MT +Y ++PK EH+GCM+DL G + L +EA E
Sbjct: 520 PDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEE 579

Query: 569 VIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKERR 628
           +I  M   P+ +IW SL+ AC++HG  ELGE  A+ ++K+EP++ G+ V+LSNIYA   R
Sbjct: 580 MINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGR 639

Query: 629 WEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKLNL 688
           W +V + R L+   G+ K  GCS IE+++ VHEF + D+ H +  +I+  L+E+   L  
Sbjct: 640 WNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEK 699

Query: 689 AGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMN--EGPRICIIKNLRICEDCHAF 748
           AG+ P T+ VL +++EE K+  +  HSEKLA+ + L++   G ++ I+KNLR+C +CH  
Sbjct: 700 AGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEA 741

Query: 749 MKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
            KL SK+Y REII RDR+RFHH+RDG+CSC DYW
Sbjct: 760 TKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of ClCG05G005100 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 482.6 bits (1241), Expect = 4.4e-136
Identity = 232/528 (43.94%), Postives = 352/528 (66.67%), Query Frame = 1

Query: 252 GFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQ 311
           G  D A +LF++M+  +++   + +  VLSACA+  NL+FG ++  +I +  + ++  L 
Sbjct: 211 GSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLA 270

Query: 312 SALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLI 371
           +A++ MY  CGS++ A   ++ +  K+ V  T M+ G A       AR V + M +KD++
Sbjct: 271 NAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIV 330

Query: 372 CWSAMISGYTESDCPQEALVLFKKMQ-QQGMKPDVVTILSVISACAHLGALDQGKWIQTY 431
            W+A+IS Y ++  P EAL++F ++Q Q+ MK + +T++S +SACA +GAL+ G+WI +Y
Sbjct: 331 AWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSY 390

Query: 432 VDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNA 491
           + K+G      + +ALI MY+KCG LE +R+VF  + K++V  W++MI  LAMHG    A
Sbjct: 391 IKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEA 450

Query: 492 LSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVD 551
           + +F++M+  NV+PN +TF  V  ACSH GLV+E   +FH M   YGI P+ +H+ C+VD
Sbjct: 451 VDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVD 510

Query: 552 LFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGA 611
           + GR+  L +A++ IEAMP  P+  +WG+L+ AC+IH    L E A  ++L+LEP +DGA
Sbjct: 511 VLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGA 570

Query: 612 LVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQI 671
            V+LSNIYAK  +WE+V E+RK M   G+ KE GCS IE++  +HEF   D  H  ++++
Sbjct: 571 HVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKV 630

Query: 672 HQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVL-WHSEKLALCYALMN-EGPRIC- 731
           + KL EV++KL   GY P+ + VL  ++EEE KE  L  HSEKLA+CY L++ E P++  
Sbjct: 631 YGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIR 690

Query: 732 IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           +IKNLR+C DCH+  KL S++Y REII+RDR RFHH+R+G CSC D+W
Sbjct: 691 VIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738


HSP 2 Score: 348.2 bits (892), Expect = 1.3e-95
Identity = 249/779 (31.96%), Postives = 389/779 (49.94%), Query Frame = 1

Query: 34  SLFHLKQVHAQILRSKL--ERCDSNSLLFELILSSCALSPSLDYALSVFDQIPQPKTRLC 93
           SL  LKQ H  ++R+    +   ++ L     LSS A   SL+YA  VFD+IP+P +   
Sbjct: 42  SLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFA---SLEYARKVFDEIPKPNSFAW 101

Query: 94  NKLLRQLSRGSEPEFTLFVYEKMRAEGLSL-DRYCFPPLLKAASRNLSLRTGMEIHGLAS 153
           N L+R  + G +P  +++ +  M +E     ++Y FP L+KAA+   SL  G  +HG+A 
Sbjct: 102 NTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAV 161

Query: 154 KLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGY----------- 213
           K   GSD FV   L+  Y +CG +  A  VF  +  +DVV+W+ MI+G+           
Sbjct: 162 KSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALE 221

Query: 214 -----EANVVALHLLVKVNVYLSILSFSSLN------AWLFRRQITAELIFAPYPQIYRY 273
                E+  V    +  V V  +     +L       +++   ++   L  A    +  Y
Sbjct: 222 LFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLA-NAMLDMY 281

Query: 274 CLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDP 333
              G  + A +LF+ M+    E D +  +T+L   A + + +   ++   + +K+IV   
Sbjct: 282 TKCGSIEDAKRLFDAME----EKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIV--- 341

Query: 334 HLQSALITMYASCGSMDLAW-DFYEKISPKNMVVS-TAMVSGLAKGGQIGEAR------- 393
              +ALI+ Y   G  + A   F+E    KNM ++   +VS L+   Q+G          
Sbjct: 342 -AWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHS 401

Query: 394 YVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLG 453
           Y+    +  +    SA+I  Y++    +++  +F  ++    K DV    ++I   A  G
Sbjct: 402 YIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVE----KRDVFVWSAMIGGLAMHG 461

Query: 454 ALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIH 513
                                   N  +DM+ K               K N +++T++  
Sbjct: 462 C----------------------GNEAVDMFYKMQEAN---------VKPNGVTFTNVFC 521

Query: 514 ALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGIS 573
           A +  G    A SLFHQM+                  S+ G+V E               
Sbjct: 522 ACSHTGLVDEAESLFHQME------------------SNYGIVPE--------------- 581

Query: 574 PKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQ 633
              +H+ C+VD+ GR+  L +A++ IEAMP  P+  +WG+L+ AC+IH    L E A  +
Sbjct: 582 --EKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTR 641

Query: 634 VLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQM 693
           +L+LEP +DGA V+LSNIYAK  +WE+V E+RK M   G+ KE GCS IE++  +HEF  
Sbjct: 642 LLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLS 701

Query: 694 ADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVL-WHSEKLALCYA 753
            D  H  +++++ KL EV++KL   GY P+ + VL  ++EEE KE  L  HSEKLA+CY 
Sbjct: 702 GDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYG 738

Query: 754 LMN-EGPRIC-IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           L++ E P++  +IKNLR+C DCH+  KL S++Y REII+RDR RFHH+R+G CSC D+W
Sbjct: 762 LISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of ClCG05G005100 vs. TAIR10
Match: AT2G22070.1 (AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 471.9 bits (1213), Expect = 7.8e-133
Identity = 265/724 (36.60%), Postives = 411/724 (56.77%), Query Frame = 1

Query: 63  ILSSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLD 122
           +LS+ +    +D     FDQ+PQ  +     ++       +    + V   M  EG+   
Sbjct: 86  VLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPT 145

Query: 123 RYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFD 182
           ++    +L + +    + TG ++H    KLG   +  V   L+ MYA CG  M A+ VFD
Sbjct: 146 QFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFD 205

Query: 183 KMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPY 242
           +M  RD+ +W+ MI        ALH+ V   + L++  F  +      R I         
Sbjct: 206 RMVVRDISSWNAMI--------ALHMQVG-QMDLAMAQFEQMA----ERDIVT------- 265

Query: 243 PQIYRYCLSGF----YDL-AFQLFEEMKRTEL-EPDEMILSTVLSACARAGNLDFGTKIH 302
              +   +SGF    YDL A  +F +M R  L  PD   L++VLSACA    L  G +IH
Sbjct: 266 ---WNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIH 325

Query: 303 EFITKKNIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVS--TAMVSGLAKGGQ 362
             I      +   + +ALI+MY+ CG ++ A    E+   K++ +   TA++ G  K G 
Sbjct: 326 SHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGD 385

Query: 363 IGEARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISA 422
           + +A+ +F  + ++D++ W+AMI GY +     EA+ LF+ M   G +P+  T+ +++S 
Sbjct: 386 MNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSV 445

Query: 423 CAHLGALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMP-KKNVIS 482
            + L +L  GK I     K+G   ++S++NALI MYAK G++  A + F  +  +++ +S
Sbjct: 446 ASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVS 505

Query: 483 WTSMIHALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMT 542
           WTSMI ALA HG A  AL LF  M +E + P+ IT+VGV  AC+H GLV +GR+ F  M 
Sbjct: 506 WTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMK 565

Query: 543 DEYGISPKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELG 602
           D   I P   H+ CMVDLFGRA LL+EA E IE MP  P+ + WGSL++AC++H   +LG
Sbjct: 566 DVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLG 625

Query: 603 EFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNE 662
           + AA+++L LEP++ GA   L+N+Y+   +WE+  ++RK M    V KE+G S IE+ ++
Sbjct: 626 KVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHK 685

Query: 663 VHEFQMADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKL 722
           VH F + D  H + ++I+  + ++  ++   GY P T  VL DL+EE K++++  HSEKL
Sbjct: 686 VHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKL 745

Query: 723 ALCYALMN--EGPRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSC 776
           A+ + L++  +   + I+KNLR+C DCH  +K  SK+  REII+RD +RFHH++DG CSC
Sbjct: 746 AIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFCSC 786


HSP 2 Score: 156.4 bits (394), Expect = 7.3e-38
Identity = 126/522 (24.14%), Postives = 227/522 (43.49%), Query Frame = 1

Query: 125 CFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKM 184
           C   L K+ +++    T   +H    K G     ++   L+ +Y+  G  + AR +FD+M
Sbjct: 16  CTNLLQKSVNKSNGRFTAQLVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEM 75

Query: 185 SHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQ 244
             R   +W+ ++  Y           + ++  +   F  L     R  ++   +   Y  
Sbjct: 76  PLRTAFSWNTVLSAYSK---------RGDMDSTCEFFDQLPQ---RDSVSWTTMIVGYKN 135

Query: 245 IYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNI 304
           I      G Y  A ++  +M +  +EP +  L+ VL++ A    ++ G K+H FI K  +
Sbjct: 136 I------GQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGL 195

Query: 305 VMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQ 364
             +  + ++L+ MYA CG   +A   ++++  +++    AM++   + GQ+  A   F+Q
Sbjct: 196 RGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQ 255

Query: 365 MVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGM-KPDVVTILSVISACAHL----- 424
           M E+D++ W++MISG+ +      AL +F KM +  +  PD  T+ SV+SACA+L     
Sbjct: 256 MAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCI 315

Query: 425 GALDQGKWIQTYVDKNGF--------------------------GKALSINN--ALIDMY 484
           G       + T  D +G                            K L I    AL+D Y
Sbjct: 316 GKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGY 375

Query: 485 AKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVEPNWITFV 544
            K G +  A+ +F  +  ++V++WT+MI     HG    A++LF  M      PN  T  
Sbjct: 376 IKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLA 435

Query: 545 GVLYACSHGGLVEEGRRIFHSMT---DEYGISPKHEHFGCMVDLFGRANLLREALEVIEA 604
            +L   S    +  G++I  S     + Y +S  +     ++ ++ +A  +  A    + 
Sbjct: 436 AMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSN----ALITMYAKAGNITSASRAFDL 495

Query: 605 MPFAPNAIIWGSLMAACQIHGETE--LGEFAAKQVLKLEPDH 608
           +    + + W S++ A   HG  E  L  F    +  L PDH
Sbjct: 496 IRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDH 515

BLAST of ClCG05G005100 vs. TAIR10
Match: AT1G20230.1 (AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 471.1 bits (1211), Expect = 1.3e-132
Identity = 253/756 (33.47%), Postives = 440/756 (58.20%), Query Frame = 1

Query: 26  SAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSPSLDYALSVFDQIPQ 85
           S++   +SSL    Q HA+IL+S  +   ++  +   +++S +     + A  V   IP 
Sbjct: 22  SSSYHWSSSLSKTTQAHARILKSGAQ---NDGYISAKLIASYSNYNCFNDADLVLQSIPD 81

Query: 86  PKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGMEI 145
           P     + L+  L++      ++ V+ +M + GL  D +  P L K  +   + + G +I
Sbjct: 82  PTIYSFSSLIYALTKAKLFTQSIGVFSRMFSHGLIPDSHVLPNLFKVCAELSAFKVGKQI 141

Query: 146 HGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVVA 205
           H ++   G   D FV+  +  MY  CGR+ +AR VFD+MS +DVV  S ++  Y A    
Sbjct: 142 HCVSCVSGLDMDAFVQGSMFHMYMRCGRMGDARKVFDRMSDKDVVTCSALLCAY-ARKGC 201

Query: 206 LHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEMK 265
           L  +V++   LS +  S + A +    ++   I + + +      SG++  A  +F+++ 
Sbjct: 202 LEEVVRI---LSEMESSGIEANI----VSWNGILSGFNR------SGYHKEAVVMFQKIH 261

Query: 266 RTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSMD 325
                PD++ +S+VL +   +  L+ G  IH ++ K+ ++ D  + SA+I MY   G + 
Sbjct: 262 HLGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKSGHVY 321

Query: 326 LAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFD----QMVEKDLICWSAMISGYT 385
                + +       V  A ++GL++ G + +A  +F+    Q +E +++ W+++I+G  
Sbjct: 322 GIISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSIIAGCA 381

Query: 386 ESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFGKALS 445
           ++    EAL LF++MQ  G+KP+ VTI S++ AC ++ AL  G+    +  +      + 
Sbjct: 382 QNGKDIEALELFREMQVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLLDNVH 441

Query: 446 INNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVEN 505
           + +ALIDMYAKCG +  ++ VF  MP KN++ W S+++  +MHG A   +S+F  +    
Sbjct: 442 VGSALIDMYAKCGRINLSQIVFNMMPTKNLVCWNSLMNGFSMHGKAKEVMSIFESLMRTR 501

Query: 506 VEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREA 565
           ++P++I+F  +L AC   GL +EG + F  M++EYGI P+ EH+ CMV+L GRA  L+EA
Sbjct: 502 LKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEA 561

Query: 566 LEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKE 625
            ++I+ MPF P++ +WG+L+ +C++    +L E AA+++  LEP++ G  V+LSNIYA +
Sbjct: 562 YDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAK 621

Query: 626 RRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKL 685
             W +V  +R  M  +G+ K  GCS I++ N V+     D++H Q DQI +K+DE+ +++
Sbjct: 622 GMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISKEM 681

Query: 686 NLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMN--EGPRICIIKNLRICEDCH 745
             +G+ P  ++ L D++E+E+++++  HSEKLA+ + L+N  +G  + +IKNLRIC DCH
Sbjct: 682 RKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCH 741

Query: 746 AFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           A +K  S    REI IRD +RFHH++DG+CSC D+W
Sbjct: 742 AVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDFW 760

BLAST of ClCG05G005100 vs. NCBI nr
Match: gi|1009141445|ref|XP_015888199.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14820 [Ziziphus jujuba])

HSP 1 Score: 935.6 bits (2417), Expect = 5.4e-269
Identity = 485/780 (62.18%), Postives = 572/780 (73.33%), Query Frame = 1

Query: 1   MEMLSHSTSVLPLQLHTYPTRP--TALSAALSSASSLFHLKQVHAQILRSKLERCDSNSL 60
           M  L+ +T  LP      P     + L  ALS+++++  LKQVHAQILRSKL+R  SN L
Sbjct: 1   MSALAQTTLALPPNPSFTPNSAAYSTLFTALSTSTTITQLKQVHAQILRSKLDR--SNPL 60

Query: 61  LFELILSSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEG 120
           L +L+LSSC LSPSLDYALSVF+QI  P T+ CNK LR+LSR +EP   L VY KMR+EG
Sbjct: 61  LIKLVLSSCVLSPSLDYALSVFNQISNPPTQFCNKFLRELSRRAEPSKALLVYGKMRSEG 120

Query: 121 LS-LDRYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEA 180
           L  +DR+ FPP+LKA SR  +L  GMEIHG+ASKLGF                       
Sbjct: 121 LGGVDRFSFPPILKAVSRAEALTEGMEIHGVASKLGF----------------------- 180

Query: 181 RLVFDKMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAEL 240
                    +D    + ++  Y     A   +++  +    +S   +  W          
Sbjct: 181 --------DKDPFVQTGLVRMY----AACGRIMEARLMFDKMSHRDVVTWSI-------- 240

Query: 241 IFAPYPQIYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHE 300
                  I  YC SG +D  F LFEEMK + +EPD MILSTVLSAC RAGNL +G  IH+
Sbjct: 241 ------MIDGYCQSGLFDYVFHLFEEMKSSSVEPDGMILSTVLSACGRAGNLGYGRAIHD 300

Query: 301 FITKKNIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGE 360
           FIT+ N+V+D HL SAL+ MYASCGSMDLA  FY K+SPK++V STAMVSG +K GQI +
Sbjct: 301 FITENNVVLDSHLNSALVAMYASCGSMDLARQFYNKMSPKSLVASTAMVSGYSKLGQIED 360

Query: 361 ARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAH 420
           AR +F+Q+VEKDLICWSAMISGY ESD PQEAL LF +MQ  G++PD VTILSVISACAH
Sbjct: 361 ARLIFNQLVEKDLICWSAMISGYAESDLPQEALRLFNEMQVLGIRPDQVTILSVISACAH 420

Query: 421 LGALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSM 480
           LGALDQ  WI  YVDKNGF  AL +NNALIDMYAKCGSLE A+ VF +MP+KNVISWTSM
Sbjct: 421 LGALDQANWIHIYVDKNGFWGALPVNNALIDMYAKCGSLERAKGVFERMPRKNVISWTSM 480

Query: 481 IHALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYG 540
           I A AMHGDA NALS F++MK EN+EPN +TFVGVLYACSH GLVEEGR  F SM  EY 
Sbjct: 481 ISAFAMHGDANNALSFFNRMKDENIEPNGVTFVGVLYACSHAGLVEEGRNFFASMIREYN 540

Query: 541 ISPKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAA 600
           ++PKHEH+GCMVDLFGRANLLREALEV+EAMP APN +IWGSLMAAC+IHGE ELGEFAA
Sbjct: 541 LTPKHEHYGCMVDLFGRANLLREALEVVEAMPMAPNVVIWGSLMAACRIHGENELGEFAA 600

Query: 601 KQVLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEF 660
           KQ+L+L+PDHDGALVVLSNIYAK++RW+DV +VR LM   G+ KERG SRIELNNEV+EF
Sbjct: 601 KQLLELDPDHDGALVVLSNIYAKQKRWDDVRKVRNLMKNSGIFKERGYSRIELNNEVYEF 660

Query: 661 QMADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCY 720
            M DR HKQADQI++KLD+VV +L L GY P T  VLVDL+EEEKKE+VLWHSEKLALCY
Sbjct: 661 LMGDRKHKQADQIYEKLDKVVSELKLVGYAPNTCSVLVDLEEEEKKEVVLWHSEKLALCY 720

Query: 721 ALM--NEGPRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
            L+       I I+KNLRICEDCH FMKL SKVY +EI+IRDR+RFHHY+DG+CSCKDYW
Sbjct: 721 GLICDKNASSIRIVKNLRICEDCHTFMKLVSKVYGKEIVIRDRTRFHHYKDGVCSCKDYW 729

BLAST of ClCG05G005100 vs. NCBI nr
Match: gi|225432698|ref|XP_002278762.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14820 [Vitis vinifera])

HSP 1 Score: 910.6 bits (2352), Expect = 1.9e-261
Identity = 472/772 (61.14%), Postives = 568/772 (73.58%), Query Frame = 1

Query: 12  PLQLHTYPTRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSP 71
           P  LH++ T    L +ALSSA+SL HLKQVHAQILRSKL+R  S SLL +L++SSCALS 
Sbjct: 17  PTTLHSHHT----LFSALSSATSLTHLKQVHAQILRSKLDR--STSLLVKLVISSCALSS 76

Query: 72  SLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLK 131
           SLDYALSVF+ IP+P+T LCN+ LR+LSR  EPE TL VYE+MR +GL++DR+ FPPLLK
Sbjct: 77  SLDYALSVFNLIPKPETHLCNRFLRELSRSEEPEKTLLVYERMRTQGLAVDRFSFPPLLK 136

Query: 132 AASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVA 191
           A SR  SL  G+EIHGLA+KLGF SDPFV+TGLVRMYAACGRI EARL+FDKM HRDVV 
Sbjct: 137 ALSRVKSLVEGLEIHGLAAKLGFDSDPFVQTGLVRMYAACGRIAEARLMFDKMFHRDVVT 196

Query: 192 WSIMIDGY------EANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQI 251
           WSIMIDGY         ++    +   NV    +  S++ +   R               
Sbjct: 197 WSIMIDGYCQSGLFNDALLLFEEMKNYNVEPDEMMLSTVLSACGR--------------- 256

Query: 252 YRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIV 311
                +G       + + +    +  D  + S +++  A  G++D    + E +T KN+V
Sbjct: 257 -----AGNLSYGKMIHDFIMENNIVVDPHLQSALVTMYASCGSMDLALNLFEKMTPKNLV 316

Query: 312 MDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQM 371
                 +A++T Y                               +K GQI  AR VF+QM
Sbjct: 317 ----ASTAMVTGY-------------------------------SKLGQIENARSVFNQM 376

Query: 372 VEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGK 431
           V+KDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VT+LSVI+ACAHLGALDQ K
Sbjct: 377 VKKDLVCWSAMISGYAESDSPQEALNLFNEMQSLGIKPDQVTMLSVITACAHLGALDQAK 436

Query: 432 WIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHG 491
           WI  +VDKNGFG AL INNALI+MYAKCGSLE AR++F KMP+KNVISWT MI A AMHG
Sbjct: 437 WIHLFVDKNGFGGALPINNALIEMYAKCGSLERARRIFDKMPRKNVISWTCMISAFAMHG 496

Query: 492 DAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHF 551
           DA +AL  FHQM+ EN+EPN ITFVGVLYACSH GLVEEGR+IF+SM +E+ I+PKH H+
Sbjct: 497 DAGSALRFFHQMEDENIEPNGITFVGVLYACSHAGLVEEGRKIFYSMINEHNITPKHVHY 556

Query: 552 GCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEP 611
           GCMVDLFGRANLLREALE++EAMP APN IIWGSLMAAC++HGE ELGEFAAK++L+L+P
Sbjct: 557 GCMVDLFGRANLLREALELVEAMPLAPNVIIWGSLMAACRVHGEIELGEFAAKRLLELDP 616

Query: 612 DHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHK 671
           DHDGA V LSNIYAK RRWEDVG+VRKLM   G+SKERGCSR ELNNE+HEF +ADR+HK
Sbjct: 617 DHDGAHVFLSNIYAKARRWEDVGQVRKLMKHKGISKERGCSRFELNNEIHEFLVADRSHK 676

Query: 672 QADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPR 731
            AD+I++KL EVV KL L GY+P T  +LVDL+EEEKKE+VLWHSEKLALCY LM +G  
Sbjct: 677 HADEIYEKLYEVVSKLKLVGYSPNTCSILVDLEEEEKKEVVLWHSEKLALCYGLMRDGTG 727

Query: 732 IC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
            C  IIKNLR+CEDCH F+KLASKVY REI++RDR+RFHHY+DG+CSCKDYW
Sbjct: 737 SCIRIIKNLRVCEDCHTFIKLASKVYEREIVVRDRTRFHHYKDGVCSCKDYW 727

BLAST of ClCG05G005100 vs. NCBI nr
Match: gi|566189984|ref|XP_002315764.2| (hypothetical protein POPTR_0010s09620g [Populus trichocarpa])

HSP 1 Score: 880.2 bits (2273), Expect = 2.7e-252
Identity = 462/773 (59.77%), Postives = 555/773 (71.80%), Query Frame = 1

Query: 17  TYPTRPT-----ALSAALSSAS-----SLFHLKQVHAQILRSKLERCDSNSLLFELILSS 76
           T PT P      AL AALSS S     SL HLKQ+HAQ+LRS L      SLL EL+LSS
Sbjct: 6   TLPTIPVPLTSIALHAALSSTSTSTPTSLPHLKQIHAQVLRSNLPP----SLLLELLLSS 65

Query: 77  CALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRA-EGL-SLDRY 136
                SLDYALSVF  +P+  T L NKL R LSR ++PE  L  YEK+R  EGL  +DR+
Sbjct: 66  S----SLDYALSVFTHLPKCHTPLSNKLFRSLSRSAKPETALLAYEKIRLKEGLLGIDRF 125

Query: 137 CFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKM 196
            FPPLLKAASR   L  G EIHG+A+KLGF                              
Sbjct: 126 SFPPLLKAASRASGLNEGKEIHGVATKLGF------------------------------ 185

Query: 197 SHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQ 256
             +D    + ++  Y     +   + +  +    +S+  + AW                 
Sbjct: 186 -DKDPFVQTGLVGMY----ASCDRISEARLVFDKMSYRDVVAWSI--------------M 245

Query: 257 IYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNI 316
           I  Y  SG YD   QLFEEM+ + L+PDEM+L+T++SAC RA NL +G  IH+FI + N 
Sbjct: 246 IDGYHQSGLYDDVLQLFEEMRSSNLKPDEMVLTTIISACGRARNLSYGEAIHDFIIENNF 305

Query: 317 VMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQ 376
           V+D +LQSAL+TMYASCG M++A   + KIS +N+VV TAM+SG ++ G++ +AR +FDQ
Sbjct: 306 VLDTYLQSALLTMYASCGCMEMAQKLFTKISSRNLVVLTAMISGYSRVGRVEDARLIFDQ 365

Query: 377 MVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQG 436
           M EKDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VTILSVISACA LG LD+ 
Sbjct: 366 MEEKDLVCWSAMISGYAESDKPQEALNLFSEMQVFGIKPDQVTILSVISACARLGVLDRA 425

Query: 437 KWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMH 496
           KWI  YVDKNG G AL +NNALIDMYAKCG+L  AR VF KM  +NVISWTSMI+A A+H
Sbjct: 426 KWIHMYVDKNGLGGALPVNNALIDMYAKCGNLGAARGVFEKMQSRNVISWTSMINAFAIH 485

Query: 497 GDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEH 556
           GDA NAL  F+QMK EN++PN +TFVGVLYACSH GLVEEGRR F SMT+E+ I+PKHEH
Sbjct: 486 GDASNALKFFYQMKDENIKPNGVTFVGVLYACSHAGLVEEGRRTFASMTNEHNITPKHEH 545

Query: 557 FGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLE 616
           +GCMVDLFGRANLLR+ALE++E MP APN +IWGSLMAACQIHGE ELGEFAAKQVL+LE
Sbjct: 546 YGCMVDLFGRANLLRDALELVETMPLAPNVVIWGSLMAACQIHGENELGEFAAKQVLELE 605

Query: 617 PDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNH 676
           PDHDGALV LSNIYAK+RRW+DVGE+R LM + G+SKERGCSRIELNN+V+EF MAD+ H
Sbjct: 606 PDHDGALVQLSNIYAKDRRWQDVGELRNLMKQRGISKERGCSRIELNNQVYEFVMADKKH 665

Query: 677 KQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGP 736
           KQAD+I++KLDEVV++L L GYTP T  VLVD++EE KKE+VLWHSEKLALCY LM EG 
Sbjct: 666 KQADKIYEKLDEVVKELKLVGYTPNTRSVLVDVEEEGKKEVVLWHSEKLALCYGLMGEGK 721

Query: 737 RIC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
             C  I+KNLR+CEDCH F+KL SKVY  EII+RDR+RFHHY+ G+CSC DYW
Sbjct: 726 GSCIRIVKNLRVCEDCHTFIKLVSKVYGMEIIVRDRTRFHHYKAGVCSCNDYW 721

BLAST of ClCG05G005100 vs. NCBI nr
Match: gi|743859751|ref|XP_011030676.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14820-like [Populus euphratica])

HSP 1 Score: 877.5 bits (2266), Expect = 1.7e-251
Identity = 460/775 (59.35%), Postives = 553/775 (71.35%), Query Frame = 1

Query: 8   TSVLPLQLHTYPTRPTALSAALSSA---SSLFHLKQVHAQILRSKLERCDSNSLLFELIL 67
           ++V  L     P   TAL AALSS    +SL HLKQ+HAQ+LRS L      SLL +L+L
Sbjct: 17  STVTTLPTIPVPLTSTALQAALSSTPTPTSLPHLKQIHAQVLRSNLPP----SLLLKLLL 76

Query: 68  SSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRA-EGL-SLD 127
           SS     SLDYALSVF  +P+  T L NKL R LSR  +PE  L  YEK+R  EGL  +D
Sbjct: 77  SSS----SLDYALSVFTHLPKCHTPLSNKLFRSLSRSDKPETALLAYEKIRLKEGLLGID 136

Query: 128 RYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFD 187
           R+ FPPLLKAASR   L  G EIHG+A+KLGF                            
Sbjct: 137 RFSFPPLLKAASRASGLNEGKEIHGVATKLGF---------------------------- 196

Query: 188 KMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPY 247
               +D    + ++  Y     A   + +  +    +S+  + AW               
Sbjct: 197 ---DKDPFVQTGLVGMY----AACDRISEARLVFDRMSYRDVVAWSI------------- 256

Query: 248 PQIYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKK 307
             I  Y  SG YD   QLFEEM+ + L+PDEM+L+T++SAC RA NL +G  IH+FI + 
Sbjct: 257 -MIDGYHQSGLYDDVLQLFEEMRSSNLKPDEMVLTTIISACGRARNLSYGEAIHDFIIEN 316

Query: 308 NIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVF 367
           N V+D +LQSAL+TMYASCG M++A   +  IS +N+VV TAM+SG ++ G++ +AR +F
Sbjct: 317 NFVLDTYLQSALLTMYASCGCMEMAQKLFTNISSRNLVVLTAMISGFSRVGRVEDARLIF 376

Query: 368 DQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALD 427
           DQM EKDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VTILSVISACA LG LD
Sbjct: 377 DQMEEKDLVCWSAMISGYAESDKPQEALNLFSEMQVFGIKPDQVTILSVISACARLGVLD 436

Query: 428 QGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALA 487
           + KWI  YVDKNG G AL +NNALIDMYAKCG+L  AR VF KM  +NVISWTSMI+A A
Sbjct: 437 RAKWIHMYVDKNGLGGALPVNNALIDMYAKCGNLGAARGVFEKMQSRNVISWTSMINAFA 496

Query: 488 MHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKH 547
           +HGDA NAL  F QMK EN++PN +TFVGVLYACSH GLVEEGRR F SMT+E+ I+PKH
Sbjct: 497 IHGDASNALKFFCQMKDENIKPNGVTFVGVLYACSHAGLVEEGRRAFASMTNEHNITPKH 556

Query: 548 EHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLK 607
           EH+GCMVDLFGRANLLR+ALE++E MP APN +IWGSLMAACQIHGE ELGEFAAKQVL+
Sbjct: 557 EHYGCMVDLFGRANLLRDALELVETMPLAPNVVIWGSLMAACQIHGENELGEFAAKQVLE 616

Query: 608 LEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADR 667
           LEPDHDGALV LSNIYAK+RRW+DVGE+R LM + G+SKERGCS IELNN+VHEF MAD+
Sbjct: 617 LEPDHDGALVQLSNIYAKDRRWQDVGELRNLMKQRGISKERGCSWIELNNQVHEFVMADK 676

Query: 668 NHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNE 727
            HKQAD+I++KLDEVV++L L GYTP TN VLVD++EE KKE+VLWHSEKLALCY LM E
Sbjct: 677 KHKQADKIYEKLDEVVKELKLVGYTPNTNSVLVDVEEEGKKEVVLWHSEKLALCYGLMGE 734

Query: 728 GPRIC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
               C  I+KNLR+CEDCH F+KL SKVY  EII+RDR+RFHHY+ G+CSC DYW
Sbjct: 737 AKGSCIRIVKNLRVCEDCHTFIKLVSKVYGMEIIVRDRTRFHHYKAGVCSCNDYW 734

BLAST of ClCG05G005100 vs. NCBI nr
Match: gi|743821741|ref|XP_011021496.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14820-like [Populus euphratica])

HSP 1 Score: 875.5 bits (2261), Expect = 6.6e-251
Identity = 459/775 (59.23%), Postives = 552/775 (71.23%), Query Frame = 1

Query: 8   TSVLPLQLHTYPTRPTALSAALSSA---SSLFHLKQVHAQILRSKLERCDSNSLLFELIL 67
           ++V  L     P   TAL AALSS    +SL HLKQ+HA +LRS L      SLL +L+L
Sbjct: 17  STVTTLPTIPVPLTSTALQAALSSTPTPTSLPHLKQIHAHVLRSNLPP----SLLLKLLL 76

Query: 68  SSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRA-EGL-SLD 127
           SS     SLDYALSVF  +P+  T L NKL R LSR  +PE  L  YEK+R  EGL  +D
Sbjct: 77  SSS----SLDYALSVFTHLPKCHTPLSNKLFRSLSRSDKPETALLAYEKIRLKEGLLGID 136

Query: 128 RYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFD 187
           R+ FPPLLKAASR   L  G EIHG+A+KLGF                            
Sbjct: 137 RFSFPPLLKAASRASGLNEGKEIHGVATKLGF---------------------------- 196

Query: 188 KMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPY 247
               +D    + ++  Y     A   + +  +    +S+  + AW               
Sbjct: 197 ---DKDPFVQTGLVGMY----AACDRISEARLVFDRMSYRDVVAWSI------------- 256

Query: 248 PQIYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKK 307
             I  Y  SG YD   QLFEEM+ + L+PDEM+L+T++SAC RA NL +G  IH+FI + 
Sbjct: 257 -MIDGYHQSGLYDDVLQLFEEMRSSNLKPDEMVLTTIISACGRARNLSYGEAIHDFIIEN 316

Query: 308 NIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVF 367
           N V+D +LQSAL+TMYASCG M++A   +  IS +N+VV TAM+SG ++ G++ +AR +F
Sbjct: 317 NFVLDTYLQSALLTMYASCGCMEMAQKLFTNISSRNLVVLTAMISGFSRVGRVEDARLIF 376

Query: 368 DQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALD 427
           DQM EKDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VTILSVISACA LG LD
Sbjct: 377 DQMEEKDLVCWSAMISGYAESDKPQEALNLFSEMQVFGIKPDQVTILSVISACARLGVLD 436

Query: 428 QGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALA 487
           + KWI  YVDKNG G AL +NNALIDMYAKCG+L  AR VF KM  +NVISWTSMI+A A
Sbjct: 437 RAKWIHMYVDKNGLGGALPVNNALIDMYAKCGNLGAARGVFEKMQSRNVISWTSMINAFA 496

Query: 488 MHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKH 547
           +HGDA NAL  F QMK EN++PN +TFVGVLYACSH GLVEEGRR F SMT+E+ I+PKH
Sbjct: 497 IHGDASNALKFFCQMKDENIKPNGVTFVGVLYACSHAGLVEEGRRAFASMTNEHNITPKH 556

Query: 548 EHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLK 607
           EH+GCMVDLFGRANLLR+ALE++E MP APN +IWGSLMAACQIHGE ELGEFAAKQVL+
Sbjct: 557 EHYGCMVDLFGRANLLRDALELVETMPLAPNVVIWGSLMAACQIHGENELGEFAAKQVLE 616

Query: 608 LEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADR 667
           LEPDHDGALV LSNIYAK+RRW+DVGE+R LM + G+SKERGCS IELNN+VHEF MAD+
Sbjct: 617 LEPDHDGALVQLSNIYAKDRRWQDVGELRNLMKQRGISKERGCSWIELNNQVHEFVMADK 676

Query: 668 NHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNE 727
            HKQAD+I++KLDEVV++L L GYTP TN VLVD++EE KKE+VLWHSEKLALCY LM E
Sbjct: 677 KHKQADKIYEKLDEVVKELKLVGYTPNTNSVLVDVEEEGKKEVVLWHSEKLALCYGLMGE 734

Query: 728 GPRIC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
               C  I+KNLR+CEDCH F+KL SKVY  EII+RDR+RFHHY+ G+CSC DYW
Sbjct: 737 AKGSCIRIVKNLRVCEDCHTFIKLVSKVYGMEIIVRDRTRFHHYKAGVCSCNDYW 734

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP311_ARATH1.3e-20950.52Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana GN... [more]
PPR21_ARATH5.2e-15538.99Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PP175_ARATH7.8e-13543.94Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP168_ARATH1.4e-13136.60Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana GN... [more]
PPR53_ARATH2.4e-13133.47Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
D7T700_VITVI1.3e-26161.14Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g03630 PE=4 SV=... [more]
B9HUV1_POPTR1.9e-25259.77Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s09620g PE=4 SV=2[more]
A0A067L6S3_JATCU5.5e-24456.31Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02188 PE=4 SV=1[more]
A0A061EMY0_THECC4.6e-24357.27Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_020... [more]
I1LFU4_SOYBN4.8e-24057.24Uncharacterized protein OS=Glycine max GN=GLYMA_11G006200 PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT4G14820.17.1e-21150.52 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.12.9e-15638.99 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G29760.14.4e-13643.94 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G22070.17.8e-13336.60 pentatricopeptide (PPR) repeat-containing protein[more]
AT1G20230.11.3e-13233.47 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|1009141445|ref|XP_015888199.1|5.4e-26962.18PREDICTED: pentatricopeptide repeat-containing protein At4g14820 [Ziziphus jujub... [more]
gi|225432698|ref|XP_002278762.1|1.9e-26161.14PREDICTED: pentatricopeptide repeat-containing protein At4g14820 [Vitis vinifera... [more]
gi|566189984|ref|XP_002315764.2|2.7e-25259.77hypothetical protein POPTR_0010s09620g [Populus trichocarpa][more]
gi|743859751|ref|XP_011030676.1|1.7e-25159.35PREDICTED: pentatricopeptide repeat-containing protein At4g14820-like [Populus e... [more]
gi|743821741|ref|XP_011021496.1|6.6e-25159.23PREDICTED: pentatricopeptide repeat-containing protein At4g14820-like [Populus e... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG05G005100.1ClCG05G005100.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 545..568
score: 0.31coord: 247..266
score: 0.03coord: 164..187
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 469..516
score: 9.6E-10coord: 368..416
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 343..370
score: 1.8E-5coord: 371..405
score: 2.0E-7coord: 245..273
score: 7.0E-4coord: 444..470
score: 3.9E-4coord: 472..506
score: 2.8E-6coord: 508..540
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 439..469
score: 8.089coord: 87..121
score: 7.235coord: 369..403
score: 11.663coord: 307..337
score: 6.774coord: 338..368
score: 8.78coord: 272..306
score: 7.815coord: 157..191
score: 8.331coord: 607..641
score: 6.127coord: 541..571
score: 5.864coord: 404..438
score: 6.412coord: 470..504
score: 11.246coord: 237..271
score: 7.103coord: 505..540
score: 7.476coord: 56..86
score: 5
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 555..626
score: 2.1E-8coord: 317..490
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 11..197
score: 0.0coord: 247..648
score:
NoneNo IPR availablePANTHERPTHR24015:SF581SUBFAMILY NOT NAMEDcoord: 247..648
score: 0.0coord: 11..197
score:

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
ClCG05G005100Bottle gourd (USVL1VR-Ls)lsiwcgB016
ClCG05G005100Bottle gourd (USVL1VR-Ls)lsiwcgB195
ClCG05G005100Cucumber (Gy14) v2cgybwcgB203
ClCG05G005100Cucumber (Gy14) v2cgybwcgB519
ClCG05G005100Melon (DHL92) v3.6.1medwcgB206
ClCG05G005100Melon (DHL92) v3.6.1medwcgB416
ClCG05G005100Silver-seed gourdcarwcgB0111
ClCG05G005100Silver-seed gourdcarwcgB0317
ClCG05G005100Silver-seed gourdcarwcgB0821
ClCG05G005100Cucumber (Chinese Long) v3cucwcgB225
ClCG05G005100Cucumber (Chinese Long) v3cucwcgB587
ClCG05G005100Watermelon (97103) v2wcgwmbB237
ClCG05G005100Watermelon (97103) v2wcgwmbB253
ClCG05G005100Wax gourdwcgwgoB412
ClCG05G005100Wax gourdwcgwgoB478
ClCG05G005100Watermelon (Charleston Gray)wcgwcgB024
ClCG05G005100Watermelon (Charleston Gray)wcgwcgB160
ClCG05G005100Cucumber (Gy14) v1cgywcgB451
ClCG05G005100Cucurbita maxima (Rimu)cmawcgB233
ClCG05G005100Cucurbita maxima (Rimu)cmawcgB795
ClCG05G005100Cucurbita moschata (Rifu)cmowcgB219
ClCG05G005100Cucurbita moschata (Rifu)cmowcgB317
ClCG05G005100Cucurbita moschata (Rifu)cmowcgB792
ClCG05G005100Wild cucumber (PI 183967)cpiwcgB226
ClCG05G005100Wild cucumber (PI 183967)cpiwcgB588
ClCG05G005100Cucumber (Chinese Long) v2cuwcgB559
ClCG05G005100Melon (DHL92) v3.5.1mewcgB212
ClCG05G005100Melon (DHL92) v3.5.1mewcgB427
ClCG05G005100Watermelon (97103) v1wcgwmB271
ClCG05G005100Watermelon (97103) v1wcgwmB311
ClCG05G005100Cucurbita pepo (Zucchini)cpewcgB293
ClCG05G005100Cucurbita pepo (Zucchini)cpewcgB559