ClCG05G005100.1 (mRNA) Watermelon (Charleston Gray)

NameClCG05G005100.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionPentatricopeptide repeat-containing protein
LocationCG_Chr05 : 4936028 .. 4938507 (-)
Sequence length2328
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATGCTCAGCCACTCAACCTCCGTCCTCCCTCTTCAACTTCACACATACCCCACCAGACCCACCGCTCTCTCCGCCGCTCTCTCCTCCGCCTCCAGCCTCTTCCACCTCAAACAAGTCCACGCTCAAATCCTTCGCTCCAAACTCGAACGCTGTGATTCCAATTCCCTTCTTTTTGAACTTATTCTTTCCTCTTGTGCTCTCTCGCCTAGCCTCGACTATGCCCTCTCTGTGTTTGATCAAATTCCCCAGCCCAAGACCCGTCTCTGCAACAAGCTTCTGCGCCAATTATCACGAGGTTCTGAGCCGGAGTTTACGCTTTTTGTATACGAGAAGATGAGGGCGGAGGGTCTGAGTTTGGATAGGTACTGCTTCCCTCCGCTGTTGAAAGCTGCTTCGAGGAATCTTTCCTTGAGAACGGGGATGGAGATTCATGGGCTCGCGTCGAAGTTGGGATTTGGGTCGGACCCATTTGTGGAGACGGGTTTGGTTAGAATGTACGCAGCCTGTGGACGGATAATGGAAGCTCGGTTGGTGTTTGATAAAATGTCTCACAGGGATGTCGTTGCTTGGAGCATCATGATTGATGGGTATGAAGCTAATGTTGTTGCTCTGCATTTACTAGTGAAGGTTAATGTGGTAAAATTTAGTTGATTTTAGATTTTCATGGAAATTTTGTTTCAGACTTTCAATTTACTATGTTATAACCATGGATATTGTGGCCAATTCTAGTAGTCTTCCAAATGAGAAAGAGTTATAAATTTACTTTGGACGTTGTATTGTTATGGTAGTATCTTTCTATCTTATCATTTTCCAGTTTAAATGCATGGCTGTTTAGGAGACAAATTACTGCGGAACTTATATTTGCCCCTTATCCTCAAATATACAGGTATTGCTTAAGTGGCTTTTATGATCTTGCCTTTCAACTCTTTGAAGAAATGAAGAGAACAGAGTTGGAACCAGATGAGATGATTCTTTCTACAGTTCTTTCTGCATGCGCTCGTGCTGGAAATTTGGATTTTGGAACAAAAATACACGAGTTCATTACTAAGAAGAATATTGTCATGGATCCTCATTTACAAAGTGCTCTCATCACAATGTATGCGAGCTGTGGCTCCATGGACTTGGCTTGGGATTTCTATGAAAAGATTTCCCCCAAGAACATGGTTGTTTCGACTGCCATGGTTTCTGGGCTTGCAAAAGGTGGACAGATTGGAGAAGCTCGCTACGTGTTTGATCAGATGGTAGAGAAGGACTTGATATGTTGGAGCGCAATGATTTCTGGCTATACAGAGAGTGACTGCCCTCAAGAGGCTCTTGTATTATTCAAGAAAATGCAACAGCAGGGAATGAAACCTGATGTAGTCACCATATTGAGTGTTATTTCAGCTTGTGCTCATCTTGGCGCATTAGATCAAGGCAAATGGATACAAACTTATGTTGATAAAAATGGGTTTGGCAAGGCATTATCTATCAATAATGCACTCATTGATATGTATGCCAAATGTGGGAGTCTAGAAGGAGCAAGAAAAGTCTTTGGAAAGATGCCAAAGAAAAATGTAATATCTTGGACAAGTATGATCCATGCTCTTGCAATGCATGGAGATGCTCCTAATGCTTTAAGCTTATTTCATCAAATGAAAGTTGAAAATGTTGAGCCTAATTGGATCACATTTGTAGGGGTGCTTTATGCTTGTAGCCACGGAGGTCTAGTTGAGGAGGGCCGAAGAATATTTCATTCAATGACCGATGAGTATGGCATAAGTCCCAAGCATGAACACTTTGGTTGCATGGTTGACCTCTTTGGCCGTGCAAATCTTCTGAGAGAAGCTCTTGAGGTGATTGAGGCAATGCCATTTGCTCCTAATGCTATTATTTGGGGATCCCTTATGGCTGCTTGTCAGATCCACGGTGAGACTGAGTTAGGAGAATTTGCTGCTAAACAAGTTCTCAAGCTCGAGCCTGATCATGATGGGGCCCTTGTCGTCTTATCAAACATATACGCTAAAGAAAGAAGATGGGAAGACGTTGGGGAAGTTAGAAAACTAATGACCAAGATGGGCGTTTCCAAAGAGAGAGGATGCAGTAGAATTGAATTGAACAATGAGGTCCATGAATTTCAAATGGCAGATAGAAATCACAAGCAAGCAGATCAAATACATCAGAAATTAGATGAGGTAGTTCAAAAGTTGAATCTGGCTGGTTATACGCCACAGACAAATTATGTGCTCGTTGATTTAGACGAAGAGGAAAAGAAGGAATTAGTCCTCTGGCACAGCGAGAAATTGGCACTTTGCTATGCCCTCATGAATGAAGGGCCACGCATTTGCATTATAAAGAACCTTCGAATTTGTGAGGATTGTCATGCTTTTATGAAATTAGCCTCAAAAGTATATGCCAGAGAGATCATCATTAGGGACAGAAGTAGATTTCACCATTACAGAGACGGTTTGTGTTCTTGTAAGGACTACTGGTGA

mRNA sequence

ATGGAAATGCTCAGCCACTCAACCTCCGTCCTCCCTCTTCAACTTCACACATACCCCACCAGACCCACCGCTCTCTCCGCCGCTCTCTCCTCCGCCTCCAGCCTCTTCCACCTCAAACAAGTCCACGCTCAAATCCTTCGCTCCAAACTCGAACGCTGTGATTCCAATTCCCTTCTTTTTGAACTTATTCTTTCCTCTTGTGCTCTCTCGCCTAGCCTCGACTATGCCCTCTCTGTGTTTGATCAAATTCCCCAGCCCAAGACCCGTCTCTGCAACAAGCTTCTGCGCCAATTATCACGAGGTTCTGAGCCGGAGTTTACGCTTTTTGTATACGAGAAGATGAGGGCGGAGGGTCTGAGTTTGGATAGGTACTGCTTCCCTCCGCTGTTGAAAGCTGCTTCGAGGAATCTTTCCTTGAGAACGGGGATGGAGATTCATGGGCTCGCGTCGAAGTTGGGATTTGGGTCGGACCCATTTGTGGAGACGGGTTTGGTTAGAATGTACGCAGCCTGTGGACGGATAATGGAAGCTCGGTTGGTGTTTGATAAAATGTCTCACAGGGATGTCGTTGCTTGGAGCATCATGATTGATGGGTATGAAGCTAATGTTGTTGCTCTGCATTTACTAGTGAAGGTTAATGTGTATCTTTCTATCTTATCATTTTCCAGTTTAAATGCATGGCTGTTTAGGAGACAAATTACTGCGGAACTTATATTTGCCCCTTATCCTCAAATATACAGGTATTGCTTAAGTGGCTTTTATGATCTTGCCTTTCAACTCTTTGAAGAAATGAAGAGAACAGAGTTGGAACCAGATGAGATGATTCTTTCTACAGTTCTTTCTGCATGCGCTCGTGCTGGAAATTTGGATTTTGGAACAAAAATACACGAGTTCATTACTAAGAAGAATATTGTCATGGATCCTCATTTACAAAGTGCTCTCATCACAATGTATGCGAGCTGTGGCTCCATGGACTTGGCTTGGGATTTCTATGAAAAGATTTCCCCCAAGAACATGGTTGTTTCGACTGCCATGGTTTCTGGGCTTGCAAAAGGTGGACAGATTGGAGAAGCTCGCTACGTGTTTGATCAGATGGTAGAGAAGGACTTGATATGTTGGAGCGCAATGATTTCTGGCTATACAGAGAGTGACTGCCCTCAAGAGGCTCTTGTATTATTCAAGAAAATGCAACAGCAGGGAATGAAACCTGATGTAGTCACCATATTGAGTGTTATTTCAGCTTGTGCTCATCTTGGCGCATTAGATCAAGGCAAATGGATACAAACTTATGTTGATAAAAATGGGTTTGGCAAGGCATTATCTATCAATAATGCACTCATTGATATGTATGCCAAATGTGGGAGTCTAGAAGGAGCAAGAAAAGTCTTTGGAAAGATGCCAAAGAAAAATGTAATATCTTGGACAAGTATGATCCATGCTCTTGCAATGCATGGAGATGCTCCTAATGCTTTAAGCTTATTTCATCAAATGAAAGTTGAAAATGTTGAGCCTAATTGGATCACATTTGTAGGGGTGCTTTATGCTTGTAGCCACGGAGGTCTAGTTGAGGAGGGCCGAAGAATATTTCATTCAATGACCGATGAGTATGGCATAAGTCCCAAGCATGAACACTTTGGTTGCATGGTTGACCTCTTTGGCCGTGCAAATCTTCTGAGAGAAGCTCTTGAGGTGATTGAGGCAATGCCATTTGCTCCTAATGCTATTATTTGGGGATCCCTTATGGCTGCTTGTCAGATCCACGGTGAGACTGAGTTAGGAGAATTTGCTGCTAAACAAGTTCTCAAGCTCGAGCCTGATCATGATGGGGCCCTTGTCGTCTTATCAAACATATACGCTAAAGAAAGAAGATGGGAAGACGTTGGGGAAGTTAGAAAACTAATGACCAAGATGGGCGTTTCCAAAGAGAGAGGATGCAGTAGAATTGAATTGAACAATGAGGTCCATGAATTTCAAATGGCAGATAGAAATCACAAGCAAGCAGATCAAATACATCAGAAATTAGATGAGGTAGTTCAAAAGTTGAATCTGGCTGGTTATACGCCACAGACAAATTATGTGCTCGTTGATTTAGACGAAGAGGAAAAGAAGGAATTAGTCCTCTGGCACAGCGAGAAATTGGCACTTTGCTATGCCCTCATGAATGAAGGGCCACGCATTTGCATTATAAAGAACCTTCGAATTTGTGAGGATTGTCATGCTTTTATGAAATTAGCCTCAAAAGTATATGCCAGAGAGATCATCATTAGGGACAGAAGTAGATTTCACCATTACAGAGACGGTTTGTGTTCTTGTAAGGACTACTGGTGA

Coding sequence (CDS)

ATGGAAATGCTCAGCCACTCAACCTCCGTCCTCCCTCTTCAACTTCACACATACCCCACCAGACCCACCGCTCTCTCCGCCGCTCTCTCCTCCGCCTCCAGCCTCTTCCACCTCAAACAAGTCCACGCTCAAATCCTTCGCTCCAAACTCGAACGCTGTGATTCCAATTCCCTTCTTTTTGAACTTATTCTTTCCTCTTGTGCTCTCTCGCCTAGCCTCGACTATGCCCTCTCTGTGTTTGATCAAATTCCCCAGCCCAAGACCCGTCTCTGCAACAAGCTTCTGCGCCAATTATCACGAGGTTCTGAGCCGGAGTTTACGCTTTTTGTATACGAGAAGATGAGGGCGGAGGGTCTGAGTTTGGATAGGTACTGCTTCCCTCCGCTGTTGAAAGCTGCTTCGAGGAATCTTTCCTTGAGAACGGGGATGGAGATTCATGGGCTCGCGTCGAAGTTGGGATTTGGGTCGGACCCATTTGTGGAGACGGGTTTGGTTAGAATGTACGCAGCCTGTGGACGGATAATGGAAGCTCGGTTGGTGTTTGATAAAATGTCTCACAGGGATGTCGTTGCTTGGAGCATCATGATTGATGGGTATGAAGCTAATGTTGTTGCTCTGCATTTACTAGTGAAGGTTAATGTGTATCTTTCTATCTTATCATTTTCCAGTTTAAATGCATGGCTGTTTAGGAGACAAATTACTGCGGAACTTATATTTGCCCCTTATCCTCAAATATACAGGTATTGCTTAAGTGGCTTTTATGATCTTGCCTTTCAACTCTTTGAAGAAATGAAGAGAACAGAGTTGGAACCAGATGAGATGATTCTTTCTACAGTTCTTTCTGCATGCGCTCGTGCTGGAAATTTGGATTTTGGAACAAAAATACACGAGTTCATTACTAAGAAGAATATTGTCATGGATCCTCATTTACAAAGTGCTCTCATCACAATGTATGCGAGCTGTGGCTCCATGGACTTGGCTTGGGATTTCTATGAAAAGATTTCCCCCAAGAACATGGTTGTTTCGACTGCCATGGTTTCTGGGCTTGCAAAAGGTGGACAGATTGGAGAAGCTCGCTACGTGTTTGATCAGATGGTAGAGAAGGACTTGATATGTTGGAGCGCAATGATTTCTGGCTATACAGAGAGTGACTGCCCTCAAGAGGCTCTTGTATTATTCAAGAAAATGCAACAGCAGGGAATGAAACCTGATGTAGTCACCATATTGAGTGTTATTTCAGCTTGTGCTCATCTTGGCGCATTAGATCAAGGCAAATGGATACAAACTTATGTTGATAAAAATGGGTTTGGCAAGGCATTATCTATCAATAATGCACTCATTGATATGTATGCCAAATGTGGGAGTCTAGAAGGAGCAAGAAAAGTCTTTGGAAAGATGCCAAAGAAAAATGTAATATCTTGGACAAGTATGATCCATGCTCTTGCAATGCATGGAGATGCTCCTAATGCTTTAAGCTTATTTCATCAAATGAAAGTTGAAAATGTTGAGCCTAATTGGATCACATTTGTAGGGGTGCTTTATGCTTGTAGCCACGGAGGTCTAGTTGAGGAGGGCCGAAGAATATTTCATTCAATGACCGATGAGTATGGCATAAGTCCCAAGCATGAACACTTTGGTTGCATGGTTGACCTCTTTGGCCGTGCAAATCTTCTGAGAGAAGCTCTTGAGGTGATTGAGGCAATGCCATTTGCTCCTAATGCTATTATTTGGGGATCCCTTATGGCTGCTTGTCAGATCCACGGTGAGACTGAGTTAGGAGAATTTGCTGCTAAACAAGTTCTCAAGCTCGAGCCTGATCATGATGGGGCCCTTGTCGTCTTATCAAACATATACGCTAAAGAAAGAAGATGGGAAGACGTTGGGGAAGTTAGAAAACTAATGACCAAGATGGGCGTTTCCAAAGAGAGAGGATGCAGTAGAATTGAATTGAACAATGAGGTCCATGAATTTCAAATGGCAGATAGAAATCACAAGCAAGCAGATCAAATACATCAGAAATTAGATGAGGTAGTTCAAAAGTTGAATCTGGCTGGTTATACGCCACAGACAAATTATGTGCTCGTTGATTTAGACGAAGAGGAAAAGAAGGAATTAGTCCTCTGGCACAGCGAGAAATTGGCACTTTGCTATGCCCTCATGAATGAAGGGCCACGCATTTGCATTATAAAGAACCTTCGAATTTGTGAGGATTGTCATGCTTTTATGAAATTAGCCTCAAAAGTATATGCCAGAGAGATCATCATTAGGGACAGAAGTAGATTTCACCATTACAGAGACGGTTTGTGTTCTTGTAAGGACTACTGGTGA

Protein sequence

MEMLSHSTSVLPLQLHTYPTRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW
BLAST of ClCG05G005100.1 vs. Swiss-Prot
Match: PP311_ARATH (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 731.1 bits (1886), Expect = 1.3e-209
Identity = 387/766 (50.52%), Postives = 509/766 (66.45%), Query Frame = 1

Query: 20  TRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSPSLDYALSV 79
           T    +   LS   SL H+KQ+HA ILR+ +     NS LF L +SS +++  L YAL+V
Sbjct: 10  TAANTILEKLSFCKSLNHIKQLHAHILRTVINH-KLNSFLFNLSVSSSSIN--LSYALNV 69

Query: 80  FDQIPQPKTRLC-NKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLS 139
           F  IP P   +  N  LR LSR SEP  T+  Y+++R  G  LD++ F P+LKA S+  +
Sbjct: 70  FSSIPSPPESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSA 129

Query: 140 LRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDG 199
           L  GME+HG+A K+    DPFVETG + MYA+CGRI  AR VFD+MSHRDVV W+ MI+ 
Sbjct: 130 LFEGMELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIER 189

Query: 200 YEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYC-LSGFYDLA 259
           Y                L   +F           +  E+I      I   C  +G     
Sbjct: 190 Y------------CRFGLVDEAFKLFEEMKDSNVMPDEMILC---NIVSACGRTGNMRYN 249

Query: 260 FQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITM 319
             ++E +   ++  D  +L+ +++  A AG +D   +    ++ +N+     + +A+++ 
Sbjct: 250 RAIYEFLIENDVRMDTHLLTALVTMYAGAGCMDMAREFFRKMSVRNL----FVSTAMVSG 309

Query: 320 YASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMI 379
           Y+ CG +D                               +A+ +FDQ  +KDL+CW+ MI
Sbjct: 310 YSKCGRLD-------------------------------DAQVIFDQTEKKDLVCWTTMI 369

Query: 380 SGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFG 439
           S Y ESD PQEAL +F++M   G+KPDVV++ SVISACA+LG LD+ KW+ + +  NG  
Sbjct: 370 SAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWVHSCIHVNGLE 429

Query: 440 KALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQM 499
             LSINNALI+MYAKCG L+  R VF KMP++NV+SW+SMI+AL+MHG+A +ALSLF +M
Sbjct: 430 SELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDALSLFARM 489

Query: 500 KVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANL 559
           K ENVEPN +TFVGVLY CSH GLVEEG++IF SMTDEY I+PK EH+GCMVDLFGRANL
Sbjct: 490 KQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDLFGRANL 549

Query: 560 LREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNI 619
           LREALEVIE+MP A N +IWGSLM+AC+IHGE ELG+FAAK++L+LEPDHDGALV++SNI
Sbjct: 550 LREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGALVLMSNI 609

Query: 620 YAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEV 679
           YA+E+RWEDV  +R++M +  V KE+G SRI+ N + HEF + D+ HKQ+++I+ KLDEV
Sbjct: 610 YAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIYAKLDEV 669

Query: 680 VQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPR--------ICII 739
           V KL LAGY P    VLVD++EEEKK+LVLWHSEKLALC+ LMNE           I I+
Sbjct: 670 VSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEEEKDSCGVIRIV 722

Query: 740 KNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           KNLR+CEDCH F KL SKVY REII+RDR+RFH Y++GLCSC+DYW
Sbjct: 730 KNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of ClCG05G005100.1 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 5.2e-155
Identity = 294/754 (38.99%), Postives = 449/754 (59.55%), Query Frame = 1

Query: 29  LSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSP---SLDYALSVFDQIPQ 88
           L +  +L  L+ +HAQ+++  L   ++N  L +LI   C LSP    L YA+SVF  I +
Sbjct: 40  LHNCKTLQSLRIIHAQMIKIGLH--NTNYALSKLI-EFCILSPHFEGLPYAISVFKTIQE 99

Query: 89  PKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGMEI 148
           P   + N + R  +  S+P   L +Y  M + GL  + Y FP +LK+ +++ + + G +I
Sbjct: 100 PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 159

Query: 149 HGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVVA 208
           HG   KLG   D +V T L+ MY   GR+ +A  VFDK  HRDVV+++ +I GY +    
Sbjct: 160 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRG-- 219

Query: 209 LHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEMK 268
                    Y+        NA     +I  + + +    I  Y  +G Y  A +LF++M 
Sbjct: 220 ---------YIE-------NAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMM 279

Query: 269 RTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSMD 328
           +T + PDE  + TV+SACA++G+++ G ++H +I       +  + +ALI +Y+ CG ++
Sbjct: 280 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 339

Query: 329 LAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMISGYTESDC 388
            A   +E++                                 KD+I W+ +I GYT  + 
Sbjct: 340 TACGLFERLP-------------------------------YKDVISWNTLIGGYTHMNL 399

Query: 389 PQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKN--GFGKALSIN 448
            +EAL+LF++M + G  P+ VT+LS++ ACAHLGA+D G+WI  Y+DK   G   A S+ 
Sbjct: 400 YKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLR 459

Query: 449 NALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVE 508
            +LIDMYAKCG +E A +VF  +  K++ SW +MI   AMHG A  +  LF +M+   ++
Sbjct: 460 TSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQ 519

Query: 509 PNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREALE 568
           P+ ITFVG+L ACSH G+++ GR IF +MT +Y ++PK EH+GCM+DL G + L +EA E
Sbjct: 520 PDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEE 579

Query: 569 VIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKERR 628
           +I  M   P+ +IW SL+ AC++HG  ELGE  A+ ++K+EP++ G+ V+LSNIYA   R
Sbjct: 580 MINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGR 639

Query: 629 WEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKLNL 688
           W +V + R L+   G+ K  GCS IE+++ VHEF + D+ H +  +I+  L+E+   L  
Sbjct: 640 WNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEK 699

Query: 689 AGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMN--EGPRICIIKNLRICEDCHAF 748
           AG+ P T+ VL +++EE K+  +  HSEKLA+ + L++   G ++ I+KNLR+C +CH  
Sbjct: 700 AGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEA 741

Query: 749 MKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
            KL SK+Y REII RDR+RFHH+RDG+CSC DYW
Sbjct: 760 TKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of ClCG05G005100.1 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 482.6 bits (1241), Expect = 7.8e-135
Identity = 232/528 (43.94%), Postives = 352/528 (66.67%), Query Frame = 1

Query: 252 GFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQ 311
           G  D A +LF++M+  +++   + +  VLSACA+  NL+FG ++  +I +  + ++  L 
Sbjct: 211 GSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLA 270

Query: 312 SALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLI 371
           +A++ MY  CGS++ A   ++ +  K+ V  T M+ G A       AR V + M +KD++
Sbjct: 271 NAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIV 330

Query: 372 CWSAMISGYTESDCPQEALVLFKKMQ-QQGMKPDVVTILSVISACAHLGALDQGKWIQTY 431
            W+A+IS Y ++  P EAL++F ++Q Q+ MK + +T++S +SACA +GAL+ G+WI +Y
Sbjct: 331 AWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSY 390

Query: 432 VDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNA 491
           + K+G      + +ALI MY+KCG LE +R+VF  + K++V  W++MI  LAMHG    A
Sbjct: 391 IKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEA 450

Query: 492 LSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVD 551
           + +F++M+  NV+PN +TF  V  ACSH GLV+E   +FH M   YGI P+ +H+ C+VD
Sbjct: 451 VDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVD 510

Query: 552 LFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGA 611
           + GR+  L +A++ IEAMP  P+  +WG+L+ AC+IH    L E A  ++L+LEP +DGA
Sbjct: 511 VLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGA 570

Query: 612 LVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQI 671
            V+LSNIYAK  +WE+V E+RK M   G+ KE GCS IE++  +HEF   D  H  ++++
Sbjct: 571 HVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKV 630

Query: 672 HQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVL-WHSEKLALCYALMN-EGPRIC- 731
           + KL EV++KL   GY P+ + VL  ++EEE KE  L  HSEKLA+CY L++ E P++  
Sbjct: 631 YGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIR 690

Query: 732 IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           +IKNLR+C DCH+  KL S++Y REII+RDR RFHH+R+G CSC D+W
Sbjct: 691 VIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738


HSP 2 Score: 348.2 bits (892), Expect = 2.3e-94
Identity = 249/779 (31.96%), Postives = 389/779 (49.94%), Query Frame = 1

Query: 34  SLFHLKQVHAQILRSKL--ERCDSNSLLFELILSSCALSPSLDYALSVFDQIPQPKTRLC 93
           SL  LKQ H  ++R+    +   ++ L     LSS A   SL+YA  VFD+IP+P +   
Sbjct: 42  SLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFA---SLEYARKVFDEIPKPNSFAW 101

Query: 94  NKLLRQLSRGSEPEFTLFVYEKMRAEGLSL-DRYCFPPLLKAASRNLSLRTGMEIHGLAS 153
           N L+R  + G +P  +++ +  M +E     ++Y FP L+KAA+   SL  G  +HG+A 
Sbjct: 102 NTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAV 161

Query: 154 KLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGY----------- 213
           K   GSD FV   L+  Y +CG +  A  VF  +  +DVV+W+ MI+G+           
Sbjct: 162 KSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALE 221

Query: 214 -----EANVVALHLLVKVNVYLSILSFSSLN------AWLFRRQITAELIFAPYPQIYRY 273
                E+  V    +  V V  +     +L       +++   ++   L  A    +  Y
Sbjct: 222 LFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLA-NAMLDMY 281

Query: 274 CLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDP 333
              G  + A +LF+ M+    E D +  +T+L   A + + +   ++   + +K+IV   
Sbjct: 282 TKCGSIEDAKRLFDAME----EKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIV--- 341

Query: 334 HLQSALITMYASCGSMDLAW-DFYEKISPKNMVVS-TAMVSGLAKGGQIGEAR------- 393
              +ALI+ Y   G  + A   F+E    KNM ++   +VS L+   Q+G          
Sbjct: 342 -AWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHS 401

Query: 394 YVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLG 453
           Y+    +  +    SA+I  Y++    +++  +F  ++    K DV    ++I   A  G
Sbjct: 402 YIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVE----KRDVFVWSAMIGGLAMHG 461

Query: 454 ALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIH 513
                                   N  +DM+ K               K N +++T++  
Sbjct: 462 C----------------------GNEAVDMFYKMQEAN---------VKPNGVTFTNVFC 521

Query: 514 ALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGIS 573
           A +  G    A SLFHQM+                  S+ G+V E               
Sbjct: 522 ACSHTGLVDEAESLFHQME------------------SNYGIVPE--------------- 581

Query: 574 PKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQ 633
              +H+ C+VD+ GR+  L +A++ IEAMP  P+  +WG+L+ AC+IH    L E A  +
Sbjct: 582 --EKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTR 641

Query: 634 VLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQM 693
           +L+LEP +DGA V+LSNIYAK  +WE+V E+RK M   G+ KE GCS IE++  +HEF  
Sbjct: 642 LLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLS 701

Query: 694 ADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVL-WHSEKLALCYA 753
            D  H  +++++ KL EV++KL   GY P+ + VL  ++EEE KE  L  HSEKLA+CY 
Sbjct: 702 GDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYG 738

Query: 754 LMN-EGPRIC-IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           L++ E P++  +IKNLR+C DCH+  KL S++Y REII+RDR RFHH+R+G CSC D+W
Sbjct: 762 LISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of ClCG05G005100.1 vs. Swiss-Prot
Match: PP168_ARATH (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 471.9 bits (1213), Expect = 1.4e-131
Identity = 265/724 (36.60%), Postives = 411/724 (56.77%), Query Frame = 1

Query: 63  ILSSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLD 122
           +LS+ +    +D     FDQ+PQ  +     ++       +    + V   M  EG+   
Sbjct: 86  VLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPT 145

Query: 123 RYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFD 182
           ++    +L + +    + TG ++H    KLG   +  V   L+ MYA CG  M A+ VFD
Sbjct: 146 QFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFD 205

Query: 183 KMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPY 242
           +M  RD+ +W+ MI        ALH+ V   + L++  F  +      R I         
Sbjct: 206 RMVVRDISSWNAMI--------ALHMQVG-QMDLAMAQFEQMA----ERDIVT------- 265

Query: 243 PQIYRYCLSGF----YDL-AFQLFEEMKRTEL-EPDEMILSTVLSACARAGNLDFGTKIH 302
              +   +SGF    YDL A  +F +M R  L  PD   L++VLSACA    L  G +IH
Sbjct: 266 ---WNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIH 325

Query: 303 EFITKKNIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVS--TAMVSGLAKGGQ 362
             I      +   + +ALI+MY+ CG ++ A    E+   K++ +   TA++ G  K G 
Sbjct: 326 SHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGD 385

Query: 363 IGEARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISA 422
           + +A+ +F  + ++D++ W+AMI GY +     EA+ LF+ M   G +P+  T+ +++S 
Sbjct: 386 MNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSV 445

Query: 423 CAHLGALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMP-KKNVIS 482
            + L +L  GK I     K+G   ++S++NALI MYAK G++  A + F  +  +++ +S
Sbjct: 446 ASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVS 505

Query: 483 WTSMIHALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMT 542
           WTSMI ALA HG A  AL LF  M +E + P+ IT+VGV  AC+H GLV +GR+ F  M 
Sbjct: 506 WTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMK 565

Query: 543 DEYGISPKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELG 602
           D   I P   H+ CMVDLFGRA LL+EA E IE MP  P+ + WGSL++AC++H   +LG
Sbjct: 566 DVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLG 625

Query: 603 EFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNE 662
           + AA+++L LEP++ GA   L+N+Y+   +WE+  ++RK M    V KE+G S IE+ ++
Sbjct: 626 KVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHK 685

Query: 663 VHEFQMADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKL 722
           VH F + D  H + ++I+  + ++  ++   GY P T  VL DL+EE K++++  HSEKL
Sbjct: 686 VHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKL 745

Query: 723 ALCYALMN--EGPRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSC 776
           A+ + L++  +   + I+KNLR+C DCH  +K  SK+  REII+RD +RFHH++DG CSC
Sbjct: 746 AIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFCSC 786


HSP 2 Score: 156.4 bits (394), Expect = 1.3e-36
Identity = 126/522 (24.14%), Postives = 227/522 (43.49%), Query Frame = 1

Query: 125 CFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKM 184
           C   L K+ +++    T   +H    K G     ++   L+ +Y+  G  + AR +FD+M
Sbjct: 16  CTNLLQKSVNKSNGRFTAQLVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEM 75

Query: 185 SHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQ 244
             R   +W+ ++  Y           + ++  +   F  L     R  ++   +   Y  
Sbjct: 76  PLRTAFSWNTVLSAYSK---------RGDMDSTCEFFDQLPQ---RDSVSWTTMIVGYKN 135

Query: 245 IYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNI 304
           I      G Y  A ++  +M +  +EP +  L+ VL++ A    ++ G K+H FI K  +
Sbjct: 136 I------GQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGL 195

Query: 305 VMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQ 364
             +  + ++L+ MYA CG   +A   ++++  +++    AM++   + GQ+  A   F+Q
Sbjct: 196 RGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQ 255

Query: 365 MVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGM-KPDVVTILSVISACAHL----- 424
           M E+D++ W++MISG+ +      AL +F KM +  +  PD  T+ SV+SACA+L     
Sbjct: 256 MAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCI 315

Query: 425 GALDQGKWIQTYVDKNGF--------------------------GKALSINN--ALIDMY 484
           G       + T  D +G                            K L I    AL+D Y
Sbjct: 316 GKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGY 375

Query: 485 AKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVEPNWITFV 544
            K G +  A+ +F  +  ++V++WT+MI     HG    A++LF  M      PN  T  
Sbjct: 376 IKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLA 435

Query: 545 GVLYACSHGGLVEEGRRIFHSMT---DEYGISPKHEHFGCMVDLFGRANLLREALEVIEA 604
            +L   S    +  G++I  S     + Y +S  +     ++ ++ +A  +  A    + 
Sbjct: 436 AMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSN----ALITMYAKAGNITSASRAFDL 495

Query: 605 MPFAPNAIIWGSLMAACQIHGETE--LGEFAAKQVLKLEPDH 608
           +    + + W S++ A   HG  E  L  F    +  L PDH
Sbjct: 496 IRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDH 515

BLAST of ClCG05G005100.1 vs. Swiss-Prot
Match: PPR53_ARATH (Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN=PCMP-H21 PE=2 SV=2)

HSP 1 Score: 471.1 bits (1211), Expect = 2.4e-131
Identity = 253/756 (33.47%), Postives = 440/756 (58.20%), Query Frame = 1

Query: 26  SAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSPSLDYALSVFDQIPQ 85
           S++   +SSL    Q HA+IL+S  +   ++  +   +++S +     + A  V   IP 
Sbjct: 22  SSSYHWSSSLSKTTQAHARILKSGAQ---NDGYISAKLIASYSNYNCFNDADLVLQSIPD 81

Query: 86  PKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGMEI 145
           P     + L+  L++      ++ V+ +M + GL  D +  P L K  +   + + G +I
Sbjct: 82  PTIYSFSSLIYALTKAKLFTQSIGVFSRMFSHGLIPDSHVLPNLFKVCAELSAFKVGKQI 141

Query: 146 HGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVVA 205
           H ++   G   D FV+  +  MY  CGR+ +AR VFD+MS +DVV  S ++  Y A    
Sbjct: 142 HCVSCVSGLDMDAFVQGSMFHMYMRCGRMGDARKVFDRMSDKDVVTCSALLCAY-ARKGC 201

Query: 206 LHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEMK 265
           L  +V++   LS +  S + A +    ++   I + + +      SG++  A  +F+++ 
Sbjct: 202 LEEVVRI---LSEMESSGIEANI----VSWNGILSGFNR------SGYHKEAVVMFQKIH 261

Query: 266 RTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSMD 325
                PD++ +S+VL +   +  L+ G  IH ++ K+ ++ D  + SA+I MY   G + 
Sbjct: 262 HLGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKSGHVY 321

Query: 326 LAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFD----QMVEKDLICWSAMISGYT 385
                + +       V  A ++GL++ G + +A  +F+    Q +E +++ W+++I+G  
Sbjct: 322 GIISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSIIAGCA 381

Query: 386 ESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFGKALS 445
           ++    EAL LF++MQ  G+KP+ VTI S++ AC ++ AL  G+    +  +      + 
Sbjct: 382 QNGKDIEALELFREMQVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLLDNVH 441

Query: 446 INNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVEN 505
           + +ALIDMYAKCG +  ++ VF  MP KN++ W S+++  +MHG A   +S+F  +    
Sbjct: 442 VGSALIDMYAKCGRINLSQIVFNMMPTKNLVCWNSLMNGFSMHGKAKEVMSIFESLMRTR 501

Query: 506 VEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREA 565
           ++P++I+F  +L AC   GL +EG + F  M++EYGI P+ EH+ CMV+L GRA  L+EA
Sbjct: 502 LKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEA 561

Query: 566 LEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKE 625
            ++I+ MPF P++ +WG+L+ +C++    +L E AA+++  LEP++ G  V+LSNIYA +
Sbjct: 562 YDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAK 621

Query: 626 RRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKL 685
             W +V  +R  M  +G+ K  GCS I++ N V+     D++H Q DQI +K+DE+ +++
Sbjct: 622 GMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISKEM 681

Query: 686 NLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMN--EGPRICIIKNLRICEDCH 745
             +G+ P  ++ L D++E+E+++++  HSEKLA+ + L+N  +G  + +IKNLRIC DCH
Sbjct: 682 RKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCH 741

Query: 746 AFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           A +K  S    REI IRD +RFHH++DG+CSC D+W
Sbjct: 742 AVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDFW 760

BLAST of ClCG05G005100.1 vs. TrEMBL
Match: D7T700_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g03630 PE=4 SV=1)

HSP 1 Score: 910.6 bits (2352), Expect = 1.3e-261
Identity = 472/772 (61.14%), Postives = 568/772 (73.58%), Query Frame = 1

Query: 12  PLQLHTYPTRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSP 71
           P  LH++ T    L +ALSSA+SL HLKQVHAQILRSKL+R  S SLL +L++SSCALS 
Sbjct: 17  PTTLHSHHT----LFSALSSATSLTHLKQVHAQILRSKLDR--STSLLVKLVISSCALSS 76

Query: 72  SLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLK 131
           SLDYALSVF+ IP+P+T LCN+ LR+LSR  EPE TL VYE+MR +GL++DR+ FPPLLK
Sbjct: 77  SLDYALSVFNLIPKPETHLCNRFLRELSRSEEPEKTLLVYERMRTQGLAVDRFSFPPLLK 136

Query: 132 AASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVA 191
           A SR  SL  G+EIHGLA+KLGF SDPFV+TGLVRMYAACGRI EARL+FDKM HRDVV 
Sbjct: 137 ALSRVKSLVEGLEIHGLAAKLGFDSDPFVQTGLVRMYAACGRIAEARLMFDKMFHRDVVT 196

Query: 192 WSIMIDGY------EANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQI 251
           WSIMIDGY         ++    +   NV    +  S++ +   R               
Sbjct: 197 WSIMIDGYCQSGLFNDALLLFEEMKNYNVEPDEMMLSTVLSACGR--------------- 256

Query: 252 YRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIV 311
                +G       + + +    +  D  + S +++  A  G++D    + E +T KN+V
Sbjct: 257 -----AGNLSYGKMIHDFIMENNIVVDPHLQSALVTMYASCGSMDLALNLFEKMTPKNLV 316

Query: 312 MDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQM 371
                 +A++T Y                               +K GQI  AR VF+QM
Sbjct: 317 ----ASTAMVTGY-------------------------------SKLGQIENARSVFNQM 376

Query: 372 VEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGK 431
           V+KDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VT+LSVI+ACAHLGALDQ K
Sbjct: 377 VKKDLVCWSAMISGYAESDSPQEALNLFNEMQSLGIKPDQVTMLSVITACAHLGALDQAK 436

Query: 432 WIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHG 491
           WI  +VDKNGFG AL INNALI+MYAKCGSLE AR++F KMP+KNVISWT MI A AMHG
Sbjct: 437 WIHLFVDKNGFGGALPINNALIEMYAKCGSLERARRIFDKMPRKNVISWTCMISAFAMHG 496

Query: 492 DAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHF 551
           DA +AL  FHQM+ EN+EPN ITFVGVLYACSH GLVEEGR+IF+SM +E+ I+PKH H+
Sbjct: 497 DAGSALRFFHQMEDENIEPNGITFVGVLYACSHAGLVEEGRKIFYSMINEHNITPKHVHY 556

Query: 552 GCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEP 611
           GCMVDLFGRANLLREALE++EAMP APN IIWGSLMAAC++HGE ELGEFAAK++L+L+P
Sbjct: 557 GCMVDLFGRANLLREALELVEAMPLAPNVIIWGSLMAACRVHGEIELGEFAAKRLLELDP 616

Query: 612 DHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHK 671
           DHDGA V LSNIYAK RRWEDVG+VRKLM   G+SKERGCSR ELNNE+HEF +ADR+HK
Sbjct: 617 DHDGAHVFLSNIYAKARRWEDVGQVRKLMKHKGISKERGCSRFELNNEIHEFLVADRSHK 676

Query: 672 QADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPR 731
            AD+I++KL EVV KL L GY+P T  +LVDL+EEEKKE+VLWHSEKLALCY LM +G  
Sbjct: 677 HADEIYEKLYEVVSKLKLVGYSPNTCSILVDLEEEEKKEVVLWHSEKLALCYGLMRDGTG 727

Query: 732 IC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
            C  IIKNLR+CEDCH F+KLASKVY REI++RDR+RFHHY+DG+CSCKDYW
Sbjct: 737 SCIRIIKNLRVCEDCHTFIKLASKVYEREIVVRDRTRFHHYKDGVCSCKDYW 727

BLAST of ClCG05G005100.1 vs. TrEMBL
Match: B9HUV1_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s09620g PE=4 SV=2)

HSP 1 Score: 880.2 bits (2273), Expect = 1.9e-252
Identity = 462/773 (59.77%), Postives = 555/773 (71.80%), Query Frame = 1

Query: 17  TYPTRPT-----ALSAALSSAS-----SLFHLKQVHAQILRSKLERCDSNSLLFELILSS 76
           T PT P      AL AALSS S     SL HLKQ+HAQ+LRS L      SLL EL+LSS
Sbjct: 6   TLPTIPVPLTSIALHAALSSTSTSTPTSLPHLKQIHAQVLRSNLPP----SLLLELLLSS 65

Query: 77  CALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRA-EGL-SLDRY 136
                SLDYALSVF  +P+  T L NKL R LSR ++PE  L  YEK+R  EGL  +DR+
Sbjct: 66  S----SLDYALSVFTHLPKCHTPLSNKLFRSLSRSAKPETALLAYEKIRLKEGLLGIDRF 125

Query: 137 CFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKM 196
            FPPLLKAASR   L  G EIHG+A+KLGF                              
Sbjct: 126 SFPPLLKAASRASGLNEGKEIHGVATKLGF------------------------------ 185

Query: 197 SHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQ 256
             +D    + ++  Y     +   + +  +    +S+  + AW                 
Sbjct: 186 -DKDPFVQTGLVGMY----ASCDRISEARLVFDKMSYRDVVAWSI--------------M 245

Query: 257 IYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNI 316
           I  Y  SG YD   QLFEEM+ + L+PDEM+L+T++SAC RA NL +G  IH+FI + N 
Sbjct: 246 IDGYHQSGLYDDVLQLFEEMRSSNLKPDEMVLTTIISACGRARNLSYGEAIHDFIIENNF 305

Query: 317 VMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQ 376
           V+D +LQSAL+TMYASCG M++A   + KIS +N+VV TAM+SG ++ G++ +AR +FDQ
Sbjct: 306 VLDTYLQSALLTMYASCGCMEMAQKLFTKISSRNLVVLTAMISGYSRVGRVEDARLIFDQ 365

Query: 377 MVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQG 436
           M EKDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VTILSVISACA LG LD+ 
Sbjct: 366 MEEKDLVCWSAMISGYAESDKPQEALNLFSEMQVFGIKPDQVTILSVISACARLGVLDRA 425

Query: 437 KWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMH 496
           KWI  YVDKNG G AL +NNALIDMYAKCG+L  AR VF KM  +NVISWTSMI+A A+H
Sbjct: 426 KWIHMYVDKNGLGGALPVNNALIDMYAKCGNLGAARGVFEKMQSRNVISWTSMINAFAIH 485

Query: 497 GDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEH 556
           GDA NAL  F+QMK EN++PN +TFVGVLYACSH GLVEEGRR F SMT+E+ I+PKHEH
Sbjct: 486 GDASNALKFFYQMKDENIKPNGVTFVGVLYACSHAGLVEEGRRTFASMTNEHNITPKHEH 545

Query: 557 FGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLE 616
           +GCMVDLFGRANLLR+ALE++E MP APN +IWGSLMAACQIHGE ELGEFAAKQVL+LE
Sbjct: 546 YGCMVDLFGRANLLRDALELVETMPLAPNVVIWGSLMAACQIHGENELGEFAAKQVLELE 605

Query: 617 PDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNH 676
           PDHDGALV LSNIYAK+RRW+DVGE+R LM + G+SKERGCSRIELNN+V+EF MAD+ H
Sbjct: 606 PDHDGALVQLSNIYAKDRRWQDVGELRNLMKQRGISKERGCSRIELNNQVYEFVMADKKH 665

Query: 677 KQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGP 736
           KQAD+I++KLDEVV++L L GYTP T  VLVD++EE KKE+VLWHSEKLALCY LM EG 
Sbjct: 666 KQADKIYEKLDEVVKELKLVGYTPNTRSVLVDVEEEGKKEVVLWHSEKLALCYGLMGEGK 721

Query: 737 RIC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
             C  I+KNLR+CEDCH F+KL SKVY  EII+RDR+RFHHY+ G+CSC DYW
Sbjct: 726 GSCIRIVKNLRVCEDCHTFIKLVSKVYGMEIIVRDRTRFHHYKAGVCSCNDYW 721

BLAST of ClCG05G005100.1 vs. TrEMBL
Match: A0A067L6S3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02188 PE=4 SV=1)

HSP 1 Score: 852.0 bits (2200), Expect = 5.5e-244
Identity = 437/776 (56.31%), Postives = 555/776 (71.52%), Query Frame = 1

Query: 4   LSHSTS-VLPLQLHTYPTRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFEL 63
           +S STS  LPL L +     T L    SS++SL+HLKQVHAQILRS L    S S+L +L
Sbjct: 1   MSASTSPALPLPLSSATIHTTLLPFLSSSSTSLYHLKQVHAQILRSSL----SPSILLKL 60

Query: 64  ILSSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGL-SL 123
           ILSS +   SL+YALSVF  +P P+  L NK LR LSR S+PE  L VYEK+R +GL  +
Sbjct: 61  ILSSSSSISSLEYALSVFTHLPTPRPALSNKFLRALSRSSKPETVLLVYEKIREDGLFGV 120

Query: 124 DRYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVF 183
           DR+  P LLKAA++  +L  GMEIHG+A+KLG                           F
Sbjct: 121 DRFSLPLLLKAAAKVSALNEGMEIHGVATKLG---------------------------F 180

Query: 184 DKMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAP 243
           DK         S+ +        A   +++  +    +S+  +  W              
Sbjct: 181 DKDPFVQTGLMSLYL--------ACGKILEARLVFDKMSYRDVVTWSI------------ 240

Query: 244 YPQIYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITK 303
              I  Y  +G +D A + FEEMK + ++PD+++LST++SAC+RAGNL +G  +H+FI +
Sbjct: 241 --MINGYYQNGHFDEALKFFEEMKSSNVQPDKVVLSTIISACSRAGNLSYGKAVHDFIIE 300

Query: 304 KNIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYV 363
            NI +DPHL+S LI MYA+CG MD+A + + K+S +N+VVSTAMVSG ++ G + +AR +
Sbjct: 301 NNIEVDPHLESTLIFMYANCGCMDMAKELFFKMSSRNLVVSTAMVSGYSRVGNVKDARLI 360

Query: 364 FDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGAL 423
           FD+M +KDL+CWSAMISGY ESD PQEAL LF +MQ  G++PD VT+LSVISACAHLG L
Sbjct: 361 FDEMDKKDLVCWSAMISGYAESDQPQEALNLFNEMQALGIEPDEVTMLSVISACAHLGVL 420

Query: 424 DQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHAL 483
           DQ K I  +V+++GFG  L +NNALIDMYAKCG LE AR VF KM ++NVISWTSMI+A 
Sbjct: 421 DQAKRIHMFVNESGFGGVLPVNNALIDMYAKCGCLEAARAVFEKMQRRNVISWTSMINAF 480

Query: 484 AMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPK 543
           A+HGDA +AL+ FH+MK EN+EPN +TFVGVLYACSH GLVEEG++IF SM ++Y ISPK
Sbjct: 481 AIHGDANSALNFFHRMKDENIEPNAVTFVGVLYACSHAGLVEEGQKIFASMINDYNISPK 540

Query: 544 HEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVL 603
           HEH+GCMVDLFGRA  LREAL ++E M   PN +IWGSLMAAC++HGETELGEFAA+++L
Sbjct: 541 HEHYGCMVDLFGRAKFLREALNLVETMSLPPNVVIWGSLMAACRVHGETELGEFAAQRLL 600

Query: 604 KLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMAD 663
           +LEP HDGALV+LSNIYAKE+RW+DVG++R LM + G+ KERGCSRIEL+N VHEF  AD
Sbjct: 601 ELEPGHDGALVLLSNIYAKEKRWQDVGQIRNLMKQRGIFKERGCSRIELSNGVHEFSTAD 660

Query: 664 RNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMN 723
           R HKQAD I++KLDEVV  L   GY+P T+ VLVD++EE K E+VLWHSEKLALCY L++
Sbjct: 661 RKHKQADLIYEKLDEVVGNLKFVGYSPDTSVVLVDIEEEAKNEVVLWHSEKLALCYGLIS 720

Query: 724 EGPRIC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           +G   C  I+KNLRICEDCH FMKL SK Y  EII+RDR+RFH Y+DG+CSC DYW
Sbjct: 721 QGKGSCIRIVKNLRICEDCHNFMKLVSKAYELEIIVRDRTRFHRYKDGVCSCNDYW 723

BLAST of ClCG05G005100.1 vs. TrEMBL
Match: A0A061EMY0_THECC (Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_020645 PE=4 SV=1)

HSP 1 Score: 849.0 bits (2192), Expect = 4.6e-243
Identity = 441/770 (57.27%), Postives = 546/770 (70.91%), Query Frame = 1

Query: 9   SVLPLQLHTYPTRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCA 68
           S++   L + PT P  L   LSS+ SL HLKQ+HAQILRS      S++L+ +L+L    
Sbjct: 10  SLVSPNLKSLPT-PKTLLKTLSSSPSLTHLKQIHAQILRSN--HSHSHTLILKLLL---- 69

Query: 69  LSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPP 128
            SPSL Y+LS+F  +P P   L  + +R LSR S PEF LFVY+++R EG+ +DR+ FPP
Sbjct: 70  FSPSLPYSLSIFSHLPHPLPSLSTRFVRHLSRSSRPEFALFVYQRLRNEGIKIDRFTFPP 129

Query: 129 LLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRD 188
           LLKA +R   L  G EIHG   KLG  SDPFV+TGLV MY ACGR++EAR VFDKMS+RD
Sbjct: 130 LLKAVARVEGLAEGKEIHGFGFKLGLDSDPFVQTGLVGMYLACGRVLEARSVFDKMSYRD 189

Query: 189 VVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRY 248
           +VAWSIMIDGY                LS L   +L  +   ++   E+       I   
Sbjct: 190 IVAWSIMIDGY---------------CLSGLFDDALELFEEMKRANIEVDKFILSSILSA 249

Query: 249 C-LSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMD 308
           C   G  +    + + +    L  D  + S +++  A  G ++   K+   +  KN+V  
Sbjct: 250 CGRVGNLNHGKAIHDYIIEKILVVDSHLQSALMTMYASCGCMEMAQKLFNQMAPKNLV-- 309

Query: 309 PHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVE 368
             + +A+++ Y+                                  +I +AR +FDQMVE
Sbjct: 310 --VSTAMVSGYSR-------------------------------HRRIEDARLIFDQMVE 369

Query: 369 KDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWI 428
           KDL+CWSAMISGY ESD PQEAL LF ++Q  GM+PD VT+LSVISACAHLG L++ KWI
Sbjct: 370 KDLVCWSAMISGYAESDQPQEALRLFNELQSLGMRPDQVTMLSVISACAHLGVLEKAKWI 429

Query: 429 QTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDA 488
             Y DKNGFG AL INNALIDM+AKCGSLE AR VF KM ++NVISWTSMI+A A+HGDA
Sbjct: 430 HVYADKNGFGGALPINNALIDMHAKCGSLERARGVFEKMTRRNVISWTSMINAFAIHGDA 489

Query: 489 PNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGC 548
            NALS FH+MK  +VEPN +TFVGVLYACSH GLV+EG+RIF SM +E+ I+PKHEH+GC
Sbjct: 490 NNALSFFHKMKEAHVEPNGVTFVGVLYACSHAGLVDEGQRIFASMINEHKIAPKHEHYGC 549

Query: 549 MVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDH 608
           MVDLFGRANLLREALE++E MP APN +IWGSLMAACQIHGETELGEFAAK++L+LEPDH
Sbjct: 550 MVDLFGRANLLREALEIVETMPLAPNVVIWGSLMAACQIHGETELGEFAAKRLLELEPDH 609

Query: 609 DGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQA 668
           DGALV+LSNIYAKE++W+DVGE+R LM + G+SKE+GCSRIELNNEVHEF MADRNHKQA
Sbjct: 610 DGALVLLSNIYAKEKKWQDVGELRHLMKERGISKEKGCSRIELNNEVHEFLMADRNHKQA 669

Query: 669 DQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPRIC 728
           D+I++KLDEV+ +L L GY P T  VLVDL+EEEK+E+VLWHSEKLALCY L+N     C
Sbjct: 670 DKIYEKLDEVISQLKLVGYFPNTRSVLVDLEEEEKREVVLWHSEKLALCYGLINGEKDSC 722

Query: 729 --IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
             I+KNLR+CEDCH FMKL SK+Y REI++RDR+RFHHY+DGLCSCKDYW
Sbjct: 730 IRIVKNLRVCEDCHTFMKLVSKLYGREIVVRDRTRFHHYKDGLCSCKDYW 722

BLAST of ClCG05G005100.1 vs. TrEMBL
Match: I1LFU4_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_11G006200 PE=4 SV=2)

HSP 1 Score: 839.0 bits (2166), Expect = 4.8e-240
Identity = 431/753 (57.24%), Postives = 538/753 (71.45%), Query Frame = 1

Query: 29  LSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCAL-SPS---LDYALSVFDQIP 88
           L+S  +L H+KQ+HAQILRSK++  +SN LL +L+L  C L SPS   LDYALS+F  IP
Sbjct: 19  LASCKTLRHVKQIHAQILRSKMD--NSNLLLLKLVLCCCTLPSPSPSALDYALSLFSHIP 78

Query: 89  QPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGME 148
            P TR  N+LLRQ SRG  PE TL +Y  +R  G  LDR+ FPPLLKA S+  +L  G+E
Sbjct: 79  NPPTRFSNQLLRQFSRGPTPENTLSLYLHLRRNGFPLDRFSFPPLLKAVSKLSALNLGLE 138

Query: 149 IHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVV 208
           IHGLASK GF                               H D    S +I  Y     
Sbjct: 139 IHGLASKFGF------------------------------FHADPFIQSALIAMY----A 198

Query: 209 ALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEM 268
           A   ++        +S   +  W                 I  Y  +  YD   +L+EEM
Sbjct: 199 ACGRIMDARFLFDKMSHRDVVTWNI--------------MIDGYSQNAHYDHVLKLYEEM 258

Query: 269 KRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSM 328
           K +  EPD +IL TVLSACA AGNL +G  IH+FI      +  H+Q++L+ MYA+CG+M
Sbjct: 259 KTSGTEPDAIILCTVLSACAHAGNLSYGKAIHQFIKDNGFRVGSHIQTSLVNMYANCGAM 318

Query: 329 DLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMISGYTESD 388
            LA + Y+++  K+MVVSTAM+SG AK G + +AR++FD+MVEKDL+CWSAMISGY ES 
Sbjct: 319 HLAREVYDQLPSKHMVVSTAMLSGYAKLGMVQDARFIFDRMVEKDLVCWSAMISGYAESY 378

Query: 389 CPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFGKALSINN 448
            P EAL LF +MQ++ + PD +T+LSVISACA++GAL Q KWI TY DKNGFG+ L INN
Sbjct: 379 QPLEALQLFNEMQRRRIVPDQITMLSVISACANVGALVQAKWIHTYADKNGFGRTLPINN 438

Query: 449 ALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVEP 508
           ALIDMYAKCG+L  AR+VF  MP+KNVISW+SMI+A AMHGDA +A++LFH+MK +N+EP
Sbjct: 439 ALIDMYAKCGNLVKAREVFENMPRKNVISWSSMINAFAMHGDADSAIALFHRMKEQNIEP 498

Query: 509 NWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREALEV 568
           N +TF+GVLYACSH GLVEEG++ F SM +E+ ISP+ EH+GCMVDL+ RAN LR+A+E+
Sbjct: 499 NGVTFIGVLYACSHAGLVEEGQKFFSSMINEHRISPQREHYGCMVDLYCRANHLRKAMEL 558

Query: 569 IEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKERRW 628
           IE MPF PN IIWGSLM+ACQ HGE ELGEFAA ++L+LEPDHDGALVVLSNIYAKE+RW
Sbjct: 559 IETMPFPPNVIIWGSLMSACQNHGEIELGEFAATRLLELEPDHDGALVVLSNIYAKEKRW 618

Query: 629 EDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKLNLA 688
           +DVG VRKLM   GVSKE+ CSRIE+NNEVH F MADR HKQ+D+I++KLD VV +L L 
Sbjct: 619 DDVGLVRKLMKHKGVSKEKACSRIEVNNEVHVFMMADRYHKQSDEIYKKLDAVVSQLKLV 678

Query: 689 GYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPRIC--IIKNLRICEDCHAFM 748
           GYTP T+ +LVDL+EEEKKE+VLWHSEKLALCY L+ E    C  I+KNLRICEDCH+FM
Sbjct: 679 GYTPSTSGILVDLEEEEKKEVVLWHSEKLALCYGLIGERKESCIRIVKNLRICEDCHSFM 721

Query: 749 KLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           KL SKV+  EI++RDR+RFHH+  G+CSC+DYW
Sbjct: 739 KLVSKVHRIEIVMRDRTRFHHFNGGICSCRDYW 721

BLAST of ClCG05G005100.1 vs. TAIR10
Match: AT4G14820.1 (AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 731.1 bits (1886), Expect = 7.1e-211
Identity = 387/766 (50.52%), Postives = 509/766 (66.45%), Query Frame = 1

Query: 20  TRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSPSLDYALSV 79
           T    +   LS   SL H+KQ+HA ILR+ +     NS LF L +SS +++  L YAL+V
Sbjct: 10  TAANTILEKLSFCKSLNHIKQLHAHILRTVINH-KLNSFLFNLSVSSSSIN--LSYALNV 69

Query: 80  FDQIPQPKTRLC-NKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLS 139
           F  IP P   +  N  LR LSR SEP  T+  Y+++R  G  LD++ F P+LKA S+  +
Sbjct: 70  FSSIPSPPESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSA 129

Query: 140 LRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDG 199
           L  GME+HG+A K+    DPFVETG + MYA+CGRI  AR VFD+MSHRDVV W+ MI+ 
Sbjct: 130 LFEGMELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIER 189

Query: 200 YEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYC-LSGFYDLA 259
           Y                L   +F           +  E+I      I   C  +G     
Sbjct: 190 Y------------CRFGLVDEAFKLFEEMKDSNVMPDEMILC---NIVSACGRTGNMRYN 249

Query: 260 FQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITM 319
             ++E +   ++  D  +L+ +++  A AG +D   +    ++ +N+     + +A+++ 
Sbjct: 250 RAIYEFLIENDVRMDTHLLTALVTMYAGAGCMDMAREFFRKMSVRNL----FVSTAMVSG 309

Query: 320 YASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMI 379
           Y+ CG +D                               +A+ +FDQ  +KDL+CW+ MI
Sbjct: 310 YSKCGRLD-------------------------------DAQVIFDQTEKKDLVCWTTMI 369

Query: 380 SGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFG 439
           S Y ESD PQEAL +F++M   G+KPDVV++ SVISACA+LG LD+ KW+ + +  NG  
Sbjct: 370 SAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWVHSCIHVNGLE 429

Query: 440 KALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQM 499
             LSINNALI+MYAKCG L+  R VF KMP++NV+SW+SMI+AL+MHG+A +ALSLF +M
Sbjct: 430 SELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDALSLFARM 489

Query: 500 KVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANL 559
           K ENVEPN +TFVGVLY CSH GLVEEG++IF SMTDEY I+PK EH+GCMVDLFGRANL
Sbjct: 490 KQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDLFGRANL 549

Query: 560 LREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNI 619
           LREALEVIE+MP A N +IWGSLM+AC+IHGE ELG+FAAK++L+LEPDHDGALV++SNI
Sbjct: 550 LREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGALVLMSNI 609

Query: 620 YAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEV 679
           YA+E+RWEDV  +R++M +  V KE+G SRI+ N + HEF + D+ HKQ+++I+ KLDEV
Sbjct: 610 YAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIYAKLDEV 669

Query: 680 VQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPR--------ICII 739
           V KL LAGY P    VLVD++EEEKK+LVLWHSEKLALC+ LMNE           I I+
Sbjct: 670 VSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEEEKDSCGVIRIV 722

Query: 740 KNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           KNLR+CEDCH F KL SKVY REII+RDR+RFH Y++GLCSC+DYW
Sbjct: 730 KNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of ClCG05G005100.1 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 549.7 bits (1415), Expect = 2.9e-156
Identity = 294/754 (38.99%), Postives = 449/754 (59.55%), Query Frame = 1

Query: 29  LSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSP---SLDYALSVFDQIPQ 88
           L +  +L  L+ +HAQ+++  L   ++N  L +LI   C LSP    L YA+SVF  I +
Sbjct: 40  LHNCKTLQSLRIIHAQMIKIGLH--NTNYALSKLI-EFCILSPHFEGLPYAISVFKTIQE 99

Query: 89  PKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGMEI 148
           P   + N + R  +  S+P   L +Y  M + GL  + Y FP +LK+ +++ + + G +I
Sbjct: 100 PNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQI 159

Query: 149 HGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVVA 208
           HG   KLG   D +V T L+ MY   GR+ +A  VFDK  HRDVV+++ +I GY +    
Sbjct: 160 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRG-- 219

Query: 209 LHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEMK 268
                    Y+        NA     +I  + + +    I  Y  +G Y  A +LF++M 
Sbjct: 220 ---------YIE-------NAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMM 279

Query: 269 RTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSMD 328
           +T + PDE  + TV+SACA++G+++ G ++H +I       +  + +ALI +Y+ CG ++
Sbjct: 280 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 339

Query: 329 LAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLICWSAMISGYTESDC 388
            A   +E++                                 KD+I W+ +I GYT  + 
Sbjct: 340 TACGLFERLP-------------------------------YKDVISWNTLIGGYTHMNL 399

Query: 389 PQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKN--GFGKALSIN 448
            +EAL+LF++M + G  P+ VT+LS++ ACAHLGA+D G+WI  Y+DK   G   A S+ 
Sbjct: 400 YKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLR 459

Query: 449 NALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVE 508
            +LIDMYAKCG +E A +VF  +  K++ SW +MI   AMHG A  +  LF +M+   ++
Sbjct: 460 TSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQ 519

Query: 509 PNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREALE 568
           P+ ITFVG+L ACSH G+++ GR IF +MT +Y ++PK EH+GCM+DL G + L +EA E
Sbjct: 520 PDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEE 579

Query: 569 VIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKERR 628
           +I  M   P+ +IW SL+ AC++HG  ELGE  A+ ++K+EP++ G+ V+LSNIYA   R
Sbjct: 580 MINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGR 639

Query: 629 WEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKLNL 688
           W +V + R L+   G+ K  GCS IE+++ VHEF + D+ H +  +I+  L+E+   L  
Sbjct: 640 WNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEK 699

Query: 689 AGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMN--EGPRICIIKNLRICEDCHAF 748
           AG+ P T+ VL +++EE K+  +  HSEKLA+ + L++   G ++ I+KNLR+C +CH  
Sbjct: 700 AGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEA 741

Query: 749 MKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
            KL SK+Y REII RDR+RFHH+RDG+CSC DYW
Sbjct: 760 TKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of ClCG05G005100.1 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 482.6 bits (1241), Expect = 4.4e-136
Identity = 232/528 (43.94%), Postives = 352/528 (66.67%), Query Frame = 1

Query: 252 GFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQ 311
           G  D A +LF++M+  +++   + +  VLSACA+  NL+FG ++  +I +  + ++  L 
Sbjct: 211 GSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLA 270

Query: 312 SALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQMVEKDLI 371
           +A++ MY  CGS++ A   ++ +  K+ V  T M+ G A       AR V + M +KD++
Sbjct: 271 NAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIV 330

Query: 372 CWSAMISGYTESDCPQEALVLFKKMQ-QQGMKPDVVTILSVISACAHLGALDQGKWIQTY 431
            W+A+IS Y ++  P EAL++F ++Q Q+ MK + +T++S +SACA +GAL+ G+WI +Y
Sbjct: 331 AWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSY 390

Query: 432 VDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNA 491
           + K+G      + +ALI MY+KCG LE +R+VF  + K++V  W++MI  LAMHG    A
Sbjct: 391 IKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEA 450

Query: 492 LSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVD 551
           + +F++M+  NV+PN +TF  V  ACSH GLV+E   +FH M   YGI P+ +H+ C+VD
Sbjct: 451 VDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVD 510

Query: 552 LFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGA 611
           + GR+  L +A++ IEAMP  P+  +WG+L+ AC+IH    L E A  ++L+LEP +DGA
Sbjct: 511 VLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGA 570

Query: 612 LVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQI 671
            V+LSNIYAK  +WE+V E+RK M   G+ KE GCS IE++  +HEF   D  H  ++++
Sbjct: 571 HVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKV 630

Query: 672 HQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVL-WHSEKLALCYALMN-EGPRIC- 731
           + KL EV++KL   GY P+ + VL  ++EEE KE  L  HSEKLA+CY L++ E P++  
Sbjct: 631 YGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIR 690

Query: 732 IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           +IKNLR+C DCH+  KL S++Y REII+RDR RFHH+R+G CSC D+W
Sbjct: 691 VIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738


HSP 2 Score: 348.2 bits (892), Expect = 1.3e-95
Identity = 249/779 (31.96%), Postives = 389/779 (49.94%), Query Frame = 1

Query: 34  SLFHLKQVHAQILRSKL--ERCDSNSLLFELILSSCALSPSLDYALSVFDQIPQPKTRLC 93
           SL  LKQ H  ++R+    +   ++ L     LSS A   SL+YA  VFD+IP+P +   
Sbjct: 42  SLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFA---SLEYARKVFDEIPKPNSFAW 101

Query: 94  NKLLRQLSRGSEPEFTLFVYEKMRAEGLSL-DRYCFPPLLKAASRNLSLRTGMEIHGLAS 153
           N L+R  + G +P  +++ +  M +E     ++Y FP L+KAA+   SL  G  +HG+A 
Sbjct: 102 NTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAV 161

Query: 154 KLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGY----------- 213
           K   GSD FV   L+  Y +CG +  A  VF  +  +DVV+W+ MI+G+           
Sbjct: 162 KSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALE 221

Query: 214 -----EANVVALHLLVKVNVYLSILSFSSLN------AWLFRRQITAELIFAPYPQIYRY 273
                E+  V    +  V V  +     +L       +++   ++   L  A    +  Y
Sbjct: 222 LFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLA-NAMLDMY 281

Query: 274 CLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDP 333
              G  + A +LF+ M+    E D +  +T+L   A + + +   ++   + +K+IV   
Sbjct: 282 TKCGSIEDAKRLFDAME----EKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIV--- 341

Query: 334 HLQSALITMYASCGSMDLAW-DFYEKISPKNMVVS-TAMVSGLAKGGQIGEAR------- 393
              +ALI+ Y   G  + A   F+E    KNM ++   +VS L+   Q+G          
Sbjct: 342 -AWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHS 401

Query: 394 YVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLG 453
           Y+    +  +    SA+I  Y++    +++  +F  ++    K DV    ++I   A  G
Sbjct: 402 YIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVE----KRDVFVWSAMIGGLAMHG 461

Query: 454 ALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIH 513
                                   N  +DM+ K               K N +++T++  
Sbjct: 462 C----------------------GNEAVDMFYKMQEAN---------VKPNGVTFTNVFC 521

Query: 514 ALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGIS 573
           A +  G    A SLFHQM+                  S+ G+V E               
Sbjct: 522 ACSHTGLVDEAESLFHQME------------------SNYGIVPE--------------- 581

Query: 574 PKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQ 633
              +H+ C+VD+ GR+  L +A++ IEAMP  P+  +WG+L+ AC+IH    L E A  +
Sbjct: 582 --EKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTR 641

Query: 634 VLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQM 693
           +L+LEP +DGA V+LSNIYAK  +WE+V E+RK M   G+ KE GCS IE++  +HEF  
Sbjct: 642 LLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLS 701

Query: 694 ADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVL-WHSEKLALCYA 753
            D  H  +++++ KL EV++KL   GY P+ + VL  ++EEE KE  L  HSEKLA+CY 
Sbjct: 702 GDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYG 738

Query: 754 LMN-EGPRIC-IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           L++ E P++  +IKNLR+C DCH+  KL S++Y REII+RDR RFHH+R+G CSC D+W
Sbjct: 762 LISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of ClCG05G005100.1 vs. TAIR10
Match: AT2G22070.1 (AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 471.9 bits (1213), Expect = 7.8e-133
Identity = 265/724 (36.60%), Postives = 411/724 (56.77%), Query Frame = 1

Query: 63  ILSSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLD 122
           +LS+ +    +D     FDQ+PQ  +     ++       +    + V   M  EG+   
Sbjct: 86  VLSAYSKRGDMDSTCEFFDQLPQRDSVSWTTMIVGYKNIGQYHKAIRVMGDMVKEGIEPT 145

Query: 123 RYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFD 182
           ++    +L + +    + TG ++H    KLG   +  V   L+ MYA CG  M A+ VFD
Sbjct: 146 QFTLTNVLASVAATRCMETGKKVHSFIVKLGLRGNVSVSNSLLNMYAKCGDPMMAKFVFD 205

Query: 183 KMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPY 242
           +M  RD+ +W+ MI        ALH+ V   + L++  F  +      R I         
Sbjct: 206 RMVVRDISSWNAMI--------ALHMQVG-QMDLAMAQFEQMA----ERDIVT------- 265

Query: 243 PQIYRYCLSGF----YDL-AFQLFEEMKRTEL-EPDEMILSTVLSACARAGNLDFGTKIH 302
              +   +SGF    YDL A  +F +M R  L  PD   L++VLSACA    L  G +IH
Sbjct: 266 ---WNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCIGKQIH 325

Query: 303 EFITKKNIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVS--TAMVSGLAKGGQ 362
             I      +   + +ALI+MY+ CG ++ A    E+   K++ +   TA++ G  K G 
Sbjct: 326 SHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGYIKLGD 385

Query: 363 IGEARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISA 422
           + +A+ +F  + ++D++ W+AMI GY +     EA+ LF+ M   G +P+  T+ +++S 
Sbjct: 386 MNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLAAMLSV 445

Query: 423 CAHLGALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMP-KKNVIS 482
            + L +L  GK I     K+G   ++S++NALI MYAK G++  A + F  +  +++ +S
Sbjct: 446 ASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSASRAFDLIRCERDTVS 505

Query: 483 WTSMIHALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMT 542
           WTSMI ALA HG A  AL LF  M +E + P+ IT+VGV  AC+H GLV +GR+ F  M 
Sbjct: 506 WTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTHAGLVNQGRQYFDMMK 565

Query: 543 DEYGISPKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELG 602
           D   I P   H+ CMVDLFGRA LL+EA E IE MP  P+ + WGSL++AC++H   +LG
Sbjct: 566 DVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWGSLLSACRVHKNIDLG 625

Query: 603 EFAAKQVLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNE 662
           + AA+++L LEP++ GA   L+N+Y+   +WE+  ++RK M    V KE+G S IE+ ++
Sbjct: 626 KVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGRVKKEQGFSWIEVKHK 685

Query: 663 VHEFQMADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKL 722
           VH F + D  H + ++I+  + ++  ++   GY P T  VL DL+EE K++++  HSEKL
Sbjct: 686 VHVFGVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDLEEEVKEQILRHHSEKL 745

Query: 723 ALCYALMN--EGPRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSC 776
           A+ + L++  +   + I+KNLR+C DCH  +K  SK+  REII+RD +RFHH++DG CSC
Sbjct: 746 AIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREIIVRDTTRFHHFKDGFCSC 786


HSP 2 Score: 156.4 bits (394), Expect = 7.3e-38
Identity = 126/522 (24.14%), Postives = 227/522 (43.49%), Query Frame = 1

Query: 125 CFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKM 184
           C   L K+ +++    T   +H    K G     ++   L+ +Y+  G  + AR +FD+M
Sbjct: 16  CTNLLQKSVNKSNGRFTAQLVHCRVIKSGLMFSVYLMNNLMNVYSKTGYALHARKLFDEM 75

Query: 185 SHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQ 244
             R   +W+ ++  Y           + ++  +   F  L     R  ++   +   Y  
Sbjct: 76  PLRTAFSWNTVLSAYSK---------RGDMDSTCEFFDQLPQ---RDSVSWTTMIVGYKN 135

Query: 245 IYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNI 304
           I      G Y  A ++  +M +  +EP +  L+ VL++ A    ++ G K+H FI K  +
Sbjct: 136 I------GQYHKAIRVMGDMVKEGIEPTQFTLTNVLASVAATRCMETGKKVHSFIVKLGL 195

Query: 305 VMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQ 364
             +  + ++L+ MYA CG   +A   ++++  +++    AM++   + GQ+  A   F+Q
Sbjct: 196 RGNVSVSNSLLNMYAKCGDPMMAKFVFDRMVVRDISSWNAMIALHMQVGQMDLAMAQFEQ 255

Query: 365 MVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGM-KPDVVTILSVISACAHL----- 424
           M E+D++ W++MISG+ +      AL +F KM +  +  PD  T+ SV+SACA+L     
Sbjct: 256 MAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEKLCI 315

Query: 425 GALDQGKWIQTYVDKNGF--------------------------GKALSINN--ALIDMY 484
           G       + T  D +G                            K L I    AL+D Y
Sbjct: 316 GKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRGTKDLKIEGFTALLDGY 375

Query: 485 AKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVENVEPNWITFV 544
            K G +  A+ +F  +  ++V++WT+MI     HG    A++LF  M      PN  T  
Sbjct: 376 IKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFRSMVGGGQRPNSYTLA 435

Query: 545 GVLYACSHGGLVEEGRRIFHSMT---DEYGISPKHEHFGCMVDLFGRANLLREALEVIEA 604
            +L   S    +  G++I  S     + Y +S  +     ++ ++ +A  +  A    + 
Sbjct: 436 AMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSN----ALITMYAKAGNITSASRAFDL 495

Query: 605 MPFAPNAIIWGSLMAACQIHGETE--LGEFAAKQVLKLEPDH 608
           +    + + W S++ A   HG  E  L  F    +  L PDH
Sbjct: 496 IRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDH 515

BLAST of ClCG05G005100.1 vs. TAIR10
Match: AT1G20230.1 (AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 471.1 bits (1211), Expect = 1.3e-132
Identity = 253/756 (33.47%), Postives = 440/756 (58.20%), Query Frame = 1

Query: 26  SAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSPSLDYALSVFDQIPQ 85
           S++   +SSL    Q HA+IL+S  +   ++  +   +++S +     + A  V   IP 
Sbjct: 22  SSSYHWSSSLSKTTQAHARILKSGAQ---NDGYISAKLIASYSNYNCFNDADLVLQSIPD 81

Query: 86  PKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLKAASRNLSLRTGMEI 145
           P     + L+  L++      ++ V+ +M + GL  D +  P L K  +   + + G +I
Sbjct: 82  PTIYSFSSLIYALTKAKLFTQSIGVFSRMFSHGLIPDSHVLPNLFKVCAELSAFKVGKQI 141

Query: 146 HGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVAWSIMIDGYEANVVA 205
           H ++   G   D FV+  +  MY  CGR+ +AR VFD+MS +DVV  S ++  Y A    
Sbjct: 142 HCVSCVSGLDMDAFVQGSMFHMYMRCGRMGDARKVFDRMSDKDVVTCSALLCAY-ARKGC 201

Query: 206 LHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQIYRYCLSGFYDLAFQLFEEMK 265
           L  +V++   LS +  S + A +    ++   I + + +      SG++  A  +F+++ 
Sbjct: 202 LEEVVRI---LSEMESSGIEANI----VSWNGILSGFNR------SGYHKEAVVMFQKIH 261

Query: 266 RTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIVMDPHLQSALITMYASCGSMD 325
                PD++ +S+VL +   +  L+ G  IH ++ K+ ++ D  + SA+I MY   G + 
Sbjct: 262 HLGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKSGHVY 321

Query: 326 LAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFD----QMVEKDLICWSAMISGYT 385
                + +       V  A ++GL++ G + +A  +F+    Q +E +++ W+++I+G  
Sbjct: 322 GIISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSIIAGCA 381

Query: 386 ESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGKWIQTYVDKNGFGKALS 445
           ++    EAL LF++MQ  G+KP+ VTI S++ AC ++ AL  G+    +  +      + 
Sbjct: 382 QNGKDIEALELFREMQVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLLDNVH 441

Query: 446 INNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHGDAPNALSLFHQMKVEN 505
           + +ALIDMYAKCG +  ++ VF  MP KN++ W S+++  +MHG A   +S+F  +    
Sbjct: 442 VGSALIDMYAKCGRINLSQIVFNMMPTKNLVCWNSLMNGFSMHGKAKEVMSIFESLMRTR 501

Query: 506 VEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHFGCMVDLFGRANLLREA 565
           ++P++I+F  +L AC   GL +EG + F  M++EYGI P+ EH+ CMV+L GRA  L+EA
Sbjct: 502 LKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEA 561

Query: 566 LEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEPDHDGALVVLSNIYAKE 625
            ++I+ MPF P++ +WG+L+ +C++    +L E AA+++  LEP++ G  V+LSNIYA +
Sbjct: 562 YDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAK 621

Query: 626 RRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHKQADQIHQKLDEVVQKL 685
             W +V  +R  M  +G+ K  GCS I++ N V+     D++H Q DQI +K+DE+ +++
Sbjct: 622 GMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISKEM 681

Query: 686 NLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMN--EGPRICIIKNLRICEDCH 745
             +G+ P  ++ L D++E+E+++++  HSEKLA+ + L+N  +G  + +IKNLRIC DCH
Sbjct: 682 RKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCH 741

Query: 746 AFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
           A +K  S    REI IRD +RFHH++DG+CSC D+W
Sbjct: 742 AVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDFW 760

BLAST of ClCG05G005100.1 vs. NCBI nr
Match: gi|1009141445|ref|XP_015888199.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14820 [Ziziphus jujuba])

HSP 1 Score: 935.6 bits (2417), Expect = 5.4e-269
Identity = 485/780 (62.18%), Postives = 572/780 (73.33%), Query Frame = 1

Query: 1   MEMLSHSTSVLPLQLHTYPTRP--TALSAALSSASSLFHLKQVHAQILRSKLERCDSNSL 60
           M  L+ +T  LP      P     + L  ALS+++++  LKQVHAQILRSKL+R  SN L
Sbjct: 1   MSALAQTTLALPPNPSFTPNSAAYSTLFTALSTSTTITQLKQVHAQILRSKLDR--SNPL 60

Query: 61  LFELILSSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEG 120
           L +L+LSSC LSPSLDYALSVF+QI  P T+ CNK LR+LSR +EP   L VY KMR+EG
Sbjct: 61  LIKLVLSSCVLSPSLDYALSVFNQISNPPTQFCNKFLRELSRRAEPSKALLVYGKMRSEG 120

Query: 121 LS-LDRYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEA 180
           L  +DR+ FPP+LKA SR  +L  GMEIHG+ASKLGF                       
Sbjct: 121 LGGVDRFSFPPILKAVSRAEALTEGMEIHGVASKLGF----------------------- 180

Query: 181 RLVFDKMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAEL 240
                    +D    + ++  Y     A   +++  +    +S   +  W          
Sbjct: 181 --------DKDPFVQTGLVRMY----AACGRIMEARLMFDKMSHRDVVTWSI-------- 240

Query: 241 IFAPYPQIYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHE 300
                  I  YC SG +D  F LFEEMK + +EPD MILSTVLSAC RAGNL +G  IH+
Sbjct: 241 ------MIDGYCQSGLFDYVFHLFEEMKSSSVEPDGMILSTVLSACGRAGNLGYGRAIHD 300

Query: 301 FITKKNIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGE 360
           FIT+ N+V+D HL SAL+ MYASCGSMDLA  FY K+SPK++V STAMVSG +K GQI +
Sbjct: 301 FITENNVVLDSHLNSALVAMYASCGSMDLARQFYNKMSPKSLVASTAMVSGYSKLGQIED 360

Query: 361 ARYVFDQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAH 420
           AR +F+Q+VEKDLICWSAMISGY ESD PQEAL LF +MQ  G++PD VTILSVISACAH
Sbjct: 361 ARLIFNQLVEKDLICWSAMISGYAESDLPQEALRLFNEMQVLGIRPDQVTILSVISACAH 420

Query: 421 LGALDQGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSM 480
           LGALDQ  WI  YVDKNGF  AL +NNALIDMYAKCGSLE A+ VF +MP+KNVISWTSM
Sbjct: 421 LGALDQANWIHIYVDKNGFWGALPVNNALIDMYAKCGSLERAKGVFERMPRKNVISWTSM 480

Query: 481 IHALAMHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYG 540
           I A AMHGDA NALS F++MK EN+EPN +TFVGVLYACSH GLVEEGR  F SM  EY 
Sbjct: 481 ISAFAMHGDANNALSFFNRMKDENIEPNGVTFVGVLYACSHAGLVEEGRNFFASMIREYN 540

Query: 541 ISPKHEHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAA 600
           ++PKHEH+GCMVDLFGRANLLREALEV+EAMP APN +IWGSLMAAC+IHGE ELGEFAA
Sbjct: 541 LTPKHEHYGCMVDLFGRANLLREALEVVEAMPMAPNVVIWGSLMAACRIHGENELGEFAA 600

Query: 601 KQVLKLEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEF 660
           KQ+L+L+PDHDGALVVLSNIYAK++RW+DV +VR LM   G+ KERG SRIELNNEV+EF
Sbjct: 601 KQLLELDPDHDGALVVLSNIYAKQKRWDDVRKVRNLMKNSGIFKERGYSRIELNNEVYEF 660

Query: 661 QMADRNHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCY 720
            M DR HKQADQI++KLD+VV +L L GY P T  VLVDL+EEEKKE+VLWHSEKLALCY
Sbjct: 661 LMGDRKHKQADQIYEKLDKVVSELKLVGYAPNTCSVLVDLEEEEKKEVVLWHSEKLALCY 720

Query: 721 ALM--NEGPRICIIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
            L+       I I+KNLRICEDCH FMKL SKVY +EI+IRDR+RFHHY+DG+CSCKDYW
Sbjct: 721 GLICDKNASSIRIVKNLRICEDCHTFMKLVSKVYGKEIVIRDRTRFHHYKDGVCSCKDYW 729

BLAST of ClCG05G005100.1 vs. NCBI nr
Match: gi|225432698|ref|XP_002278762.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14820 [Vitis vinifera])

HSP 1 Score: 910.6 bits (2352), Expect = 1.9e-261
Identity = 472/772 (61.14%), Postives = 568/772 (73.58%), Query Frame = 1

Query: 12  PLQLHTYPTRPTALSAALSSASSLFHLKQVHAQILRSKLERCDSNSLLFELILSSCALSP 71
           P  LH++ T    L +ALSSA+SL HLKQVHAQILRSKL+R  S SLL +L++SSCALS 
Sbjct: 17  PTTLHSHHT----LFSALSSATSLTHLKQVHAQILRSKLDR--STSLLVKLVISSCALSS 76

Query: 72  SLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRAEGLSLDRYCFPPLLK 131
           SLDYALSVF+ IP+P+T LCN+ LR+LSR  EPE TL VYE+MR +GL++DR+ FPPLLK
Sbjct: 77  SLDYALSVFNLIPKPETHLCNRFLRELSRSEEPEKTLLVYERMRTQGLAVDRFSFPPLLK 136

Query: 132 AASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKMSHRDVVA 191
           A SR  SL  G+EIHGLA+KLGF SDPFV+TGLVRMYAACGRI EARL+FDKM HRDVV 
Sbjct: 137 ALSRVKSLVEGLEIHGLAAKLGFDSDPFVQTGLVRMYAACGRIAEARLMFDKMFHRDVVT 196

Query: 192 WSIMIDGY------EANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQI 251
           WSIMIDGY         ++    +   NV    +  S++ +   R               
Sbjct: 197 WSIMIDGYCQSGLFNDALLLFEEMKNYNVEPDEMMLSTVLSACGR--------------- 256

Query: 252 YRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNIV 311
                +G       + + +    +  D  + S +++  A  G++D    + E +T KN+V
Sbjct: 257 -----AGNLSYGKMIHDFIMENNIVVDPHLQSALVTMYASCGSMDLALNLFEKMTPKNLV 316

Query: 312 MDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQM 371
                 +A++T Y                               +K GQI  AR VF+QM
Sbjct: 317 ----ASTAMVTGY-------------------------------SKLGQIENARSVFNQM 376

Query: 372 VEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQGK 431
           V+KDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VT+LSVI+ACAHLGALDQ K
Sbjct: 377 VKKDLVCWSAMISGYAESDSPQEALNLFNEMQSLGIKPDQVTMLSVITACAHLGALDQAK 436

Query: 432 WIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMHG 491
           WI  +VDKNGFG AL INNALI+MYAKCGSLE AR++F KMP+KNVISWT MI A AMHG
Sbjct: 437 WIHLFVDKNGFGGALPINNALIEMYAKCGSLERARRIFDKMPRKNVISWTCMISAFAMHG 496

Query: 492 DAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEHF 551
           DA +AL  FHQM+ EN+EPN ITFVGVLYACSH GLVEEGR+IF+SM +E+ I+PKH H+
Sbjct: 497 DAGSALRFFHQMEDENIEPNGITFVGVLYACSHAGLVEEGRKIFYSMINEHNITPKHVHY 556

Query: 552 GCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLEP 611
           GCMVDLFGRANLLREALE++EAMP APN IIWGSLMAAC++HGE ELGEFAAK++L+L+P
Sbjct: 557 GCMVDLFGRANLLREALELVEAMPLAPNVIIWGSLMAACRVHGEIELGEFAAKRLLELDP 616

Query: 612 DHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNHK 671
           DHDGA V LSNIYAK RRWEDVG+VRKLM   G+SKERGCSR ELNNE+HEF +ADR+HK
Sbjct: 617 DHDGAHVFLSNIYAKARRWEDVGQVRKLMKHKGISKERGCSRFELNNEIHEFLVADRSHK 676

Query: 672 QADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGPR 731
            AD+I++KL EVV KL L GY+P T  +LVDL+EEEKKE+VLWHSEKLALCY LM +G  
Sbjct: 677 HADEIYEKLYEVVSKLKLVGYSPNTCSILVDLEEEEKKEVVLWHSEKLALCYGLMRDGTG 727

Query: 732 IC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
            C  IIKNLR+CEDCH F+KLASKVY REI++RDR+RFHHY+DG+CSCKDYW
Sbjct: 737 SCIRIIKNLRVCEDCHTFIKLASKVYEREIVVRDRTRFHHYKDGVCSCKDYW 727

BLAST of ClCG05G005100.1 vs. NCBI nr
Match: gi|566189984|ref|XP_002315764.2| (hypothetical protein POPTR_0010s09620g [Populus trichocarpa])

HSP 1 Score: 880.2 bits (2273), Expect = 2.7e-252
Identity = 462/773 (59.77%), Postives = 555/773 (71.80%), Query Frame = 1

Query: 17  TYPTRPT-----ALSAALSSAS-----SLFHLKQVHAQILRSKLERCDSNSLLFELILSS 76
           T PT P      AL AALSS S     SL HLKQ+HAQ+LRS L      SLL EL+LSS
Sbjct: 6   TLPTIPVPLTSIALHAALSSTSTSTPTSLPHLKQIHAQVLRSNLPP----SLLLELLLSS 65

Query: 77  CALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRA-EGL-SLDRY 136
                SLDYALSVF  +P+  T L NKL R LSR ++PE  L  YEK+R  EGL  +DR+
Sbjct: 66  S----SLDYALSVFTHLPKCHTPLSNKLFRSLSRSAKPETALLAYEKIRLKEGLLGIDRF 125

Query: 137 CFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFDKM 196
            FPPLLKAASR   L  G EIHG+A+KLGF                              
Sbjct: 126 SFPPLLKAASRASGLNEGKEIHGVATKLGF------------------------------ 185

Query: 197 SHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPYPQ 256
             +D    + ++  Y     +   + +  +    +S+  + AW                 
Sbjct: 186 -DKDPFVQTGLVGMY----ASCDRISEARLVFDKMSYRDVVAWSI--------------M 245

Query: 257 IYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKKNI 316
           I  Y  SG YD   QLFEEM+ + L+PDEM+L+T++SAC RA NL +G  IH+FI + N 
Sbjct: 246 IDGYHQSGLYDDVLQLFEEMRSSNLKPDEMVLTTIISACGRARNLSYGEAIHDFIIENNF 305

Query: 317 VMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVFDQ 376
           V+D +LQSAL+TMYASCG M++A   + KIS +N+VV TAM+SG ++ G++ +AR +FDQ
Sbjct: 306 VLDTYLQSALLTMYASCGCMEMAQKLFTKISSRNLVVLTAMISGYSRVGRVEDARLIFDQ 365

Query: 377 MVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALDQG 436
           M EKDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VTILSVISACA LG LD+ 
Sbjct: 366 MEEKDLVCWSAMISGYAESDKPQEALNLFSEMQVFGIKPDQVTILSVISACARLGVLDRA 425

Query: 437 KWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALAMH 496
           KWI  YVDKNG G AL +NNALIDMYAKCG+L  AR VF KM  +NVISWTSMI+A A+H
Sbjct: 426 KWIHMYVDKNGLGGALPVNNALIDMYAKCGNLGAARGVFEKMQSRNVISWTSMINAFAIH 485

Query: 497 GDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKHEH 556
           GDA NAL  F+QMK EN++PN +TFVGVLYACSH GLVEEGRR F SMT+E+ I+PKHEH
Sbjct: 486 GDASNALKFFYQMKDENIKPNGVTFVGVLYACSHAGLVEEGRRTFASMTNEHNITPKHEH 545

Query: 557 FGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLKLE 616
           +GCMVDLFGRANLLR+ALE++E MP APN +IWGSLMAACQIHGE ELGEFAAKQVL+LE
Sbjct: 546 YGCMVDLFGRANLLRDALELVETMPLAPNVVIWGSLMAACQIHGENELGEFAAKQVLELE 605

Query: 617 PDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADRNH 676
           PDHDGALV LSNIYAK+RRW+DVGE+R LM + G+SKERGCSRIELNN+V+EF MAD+ H
Sbjct: 606 PDHDGALVQLSNIYAKDRRWQDVGELRNLMKQRGISKERGCSRIELNNQVYEFVMADKKH 665

Query: 677 KQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNEGP 736
           KQAD+I++KLDEVV++L L GYTP T  VLVD++EE KKE+VLWHSEKLALCY LM EG 
Sbjct: 666 KQADKIYEKLDEVVKELKLVGYTPNTRSVLVDVEEEGKKEVVLWHSEKLALCYGLMGEGK 721

Query: 737 RIC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
             C  I+KNLR+CEDCH F+KL SKVY  EII+RDR+RFHHY+ G+CSC DYW
Sbjct: 726 GSCIRIVKNLRVCEDCHTFIKLVSKVYGMEIIVRDRTRFHHYKAGVCSCNDYW 721

BLAST of ClCG05G005100.1 vs. NCBI nr
Match: gi|743859751|ref|XP_011030676.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14820-like [Populus euphratica])

HSP 1 Score: 877.5 bits (2266), Expect = 1.7e-251
Identity = 460/775 (59.35%), Postives = 553/775 (71.35%), Query Frame = 1

Query: 8   TSVLPLQLHTYPTRPTALSAALSSA---SSLFHLKQVHAQILRSKLERCDSNSLLFELIL 67
           ++V  L     P   TAL AALSS    +SL HLKQ+HAQ+LRS L      SLL +L+L
Sbjct: 17  STVTTLPTIPVPLTSTALQAALSSTPTPTSLPHLKQIHAQVLRSNLPP----SLLLKLLL 76

Query: 68  SSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRA-EGL-SLD 127
           SS     SLDYALSVF  +P+  T L NKL R LSR  +PE  L  YEK+R  EGL  +D
Sbjct: 77  SSS----SLDYALSVFTHLPKCHTPLSNKLFRSLSRSDKPETALLAYEKIRLKEGLLGID 136

Query: 128 RYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFD 187
           R+ FPPLLKAASR   L  G EIHG+A+KLGF                            
Sbjct: 137 RFSFPPLLKAASRASGLNEGKEIHGVATKLGF---------------------------- 196

Query: 188 KMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPY 247
               +D    + ++  Y     A   + +  +    +S+  + AW               
Sbjct: 197 ---DKDPFVQTGLVGMY----AACDRISEARLVFDRMSYRDVVAWSI------------- 256

Query: 248 PQIYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKK 307
             I  Y  SG YD   QLFEEM+ + L+PDEM+L+T++SAC RA NL +G  IH+FI + 
Sbjct: 257 -MIDGYHQSGLYDDVLQLFEEMRSSNLKPDEMVLTTIISACGRARNLSYGEAIHDFIIEN 316

Query: 308 NIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVF 367
           N V+D +LQSAL+TMYASCG M++A   +  IS +N+VV TAM+SG ++ G++ +AR +F
Sbjct: 317 NFVLDTYLQSALLTMYASCGCMEMAQKLFTNISSRNLVVLTAMISGFSRVGRVEDARLIF 376

Query: 368 DQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALD 427
           DQM EKDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VTILSVISACA LG LD
Sbjct: 377 DQMEEKDLVCWSAMISGYAESDKPQEALNLFSEMQVFGIKPDQVTILSVISACARLGVLD 436

Query: 428 QGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALA 487
           + KWI  YVDKNG G AL +NNALIDMYAKCG+L  AR VF KM  +NVISWTSMI+A A
Sbjct: 437 RAKWIHMYVDKNGLGGALPVNNALIDMYAKCGNLGAARGVFEKMQSRNVISWTSMINAFA 496

Query: 488 MHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKH 547
           +HGDA NAL  F QMK EN++PN +TFVGVLYACSH GLVEEGRR F SMT+E+ I+PKH
Sbjct: 497 IHGDASNALKFFCQMKDENIKPNGVTFVGVLYACSHAGLVEEGRRAFASMTNEHNITPKH 556

Query: 548 EHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLK 607
           EH+GCMVDLFGRANLLR+ALE++E MP APN +IWGSLMAACQIHGE ELGEFAAKQVL+
Sbjct: 557 EHYGCMVDLFGRANLLRDALELVETMPLAPNVVIWGSLMAACQIHGENELGEFAAKQVLE 616

Query: 608 LEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADR 667
           LEPDHDGALV LSNIYAK+RRW+DVGE+R LM + G+SKERGCS IELNN+VHEF MAD+
Sbjct: 617 LEPDHDGALVQLSNIYAKDRRWQDVGELRNLMKQRGISKERGCSWIELNNQVHEFVMADK 676

Query: 668 NHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNE 727
            HKQAD+I++KLDEVV++L L GYTP TN VLVD++EE KKE+VLWHSEKLALCY LM E
Sbjct: 677 KHKQADKIYEKLDEVVKELKLVGYTPNTNSVLVDVEEEGKKEVVLWHSEKLALCYGLMGE 734

Query: 728 GPRIC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
               C  I+KNLR+CEDCH F+KL SKVY  EII+RDR+RFHHY+ G+CSC DYW
Sbjct: 737 AKGSCIRIVKNLRVCEDCHTFIKLVSKVYGMEIIVRDRTRFHHYKAGVCSCNDYW 734

BLAST of ClCG05G005100.1 vs. NCBI nr
Match: gi|743821741|ref|XP_011021496.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14820-like [Populus euphratica])

HSP 1 Score: 875.5 bits (2261), Expect = 6.6e-251
Identity = 459/775 (59.23%), Postives = 552/775 (71.23%), Query Frame = 1

Query: 8   TSVLPLQLHTYPTRPTALSAALSSA---SSLFHLKQVHAQILRSKLERCDSNSLLFELIL 67
           ++V  L     P   TAL AALSS    +SL HLKQ+HA +LRS L      SLL +L+L
Sbjct: 17  STVTTLPTIPVPLTSTALQAALSSTPTPTSLPHLKQIHAHVLRSNLPP----SLLLKLLL 76

Query: 68  SSCALSPSLDYALSVFDQIPQPKTRLCNKLLRQLSRGSEPEFTLFVYEKMRA-EGL-SLD 127
           SS     SLDYALSVF  +P+  T L NKL R LSR  +PE  L  YEK+R  EGL  +D
Sbjct: 77  SSS----SLDYALSVFTHLPKCHTPLSNKLFRSLSRSDKPETALLAYEKIRLKEGLLGID 136

Query: 128 RYCFPPLLKAASRNLSLRTGMEIHGLASKLGFGSDPFVETGLVRMYAACGRIMEARLVFD 187
           R+ FPPLLKAASR   L  G EIHG+A+KLGF                            
Sbjct: 137 RFSFPPLLKAASRASGLNEGKEIHGVATKLGF---------------------------- 196

Query: 188 KMSHRDVVAWSIMIDGYEANVVALHLLVKVNVYLSILSFSSLNAWLFRRQITAELIFAPY 247
               +D    + ++  Y     A   + +  +    +S+  + AW               
Sbjct: 197 ---DKDPFVQTGLVGMY----AACDRISEARLVFDRMSYRDVVAWSI------------- 256

Query: 248 PQIYRYCLSGFYDLAFQLFEEMKRTELEPDEMILSTVLSACARAGNLDFGTKIHEFITKK 307
             I  Y  SG YD   QLFEEM+ + L+PDEM+L+T++SAC RA NL +G  IH+FI + 
Sbjct: 257 -MIDGYHQSGLYDDVLQLFEEMRSSNLKPDEMVLTTIISACGRARNLSYGEAIHDFIIEN 316

Query: 308 NIVMDPHLQSALITMYASCGSMDLAWDFYEKISPKNMVVSTAMVSGLAKGGQIGEARYVF 367
           N V+D +LQSAL+TMYASCG M++A   +  IS +N+VV TAM+SG ++ G++ +AR +F
Sbjct: 317 NFVLDTYLQSALLTMYASCGCMEMAQKLFTNISSRNLVVLTAMISGFSRVGRVEDARLIF 376

Query: 368 DQMVEKDLICWSAMISGYTESDCPQEALVLFKKMQQQGMKPDVVTILSVISACAHLGALD 427
           DQM EKDL+CWSAMISGY ESD PQEAL LF +MQ  G+KPD VTILSVISACA LG LD
Sbjct: 377 DQMEEKDLVCWSAMISGYAESDKPQEALNLFSEMQVFGIKPDQVTILSVISACARLGVLD 436

Query: 428 QGKWIQTYVDKNGFGKALSINNALIDMYAKCGSLEGARKVFGKMPKKNVISWTSMIHALA 487
           + KWI  YVDKNG G AL +NNALIDMYAKCG+L  AR VF KM  +NVISWTSMI+A A
Sbjct: 437 RAKWIHMYVDKNGLGGALPVNNALIDMYAKCGNLGAARGVFEKMQSRNVISWTSMINAFA 496

Query: 488 MHGDAPNALSLFHQMKVENVEPNWITFVGVLYACSHGGLVEEGRRIFHSMTDEYGISPKH 547
           +HGDA NAL  F QMK EN++PN +TFVGVLYACSH GLVEEGRR F SMT+E+ I+PKH
Sbjct: 497 IHGDASNALKFFCQMKDENIKPNGVTFVGVLYACSHAGLVEEGRRAFASMTNEHNITPKH 556

Query: 548 EHFGCMVDLFGRANLLREALEVIEAMPFAPNAIIWGSLMAACQIHGETELGEFAAKQVLK 607
           EH+GCMVDLFGRANLLR+ALE++E MP APN +IWGSLMAACQIHGE ELGEFAAKQVL+
Sbjct: 557 EHYGCMVDLFGRANLLRDALELVETMPLAPNVVIWGSLMAACQIHGENELGEFAAKQVLE 616

Query: 608 LEPDHDGALVVLSNIYAKERRWEDVGEVRKLMTKMGVSKERGCSRIELNNEVHEFQMADR 667
           LEPDHDGALV LSNIYAK+RRW+DVGE+R LM + G+SKERGCS IELNN+VHEF MAD+
Sbjct: 617 LEPDHDGALVQLSNIYAKDRRWQDVGELRNLMKQRGISKERGCSWIELNNQVHEFVMADK 676

Query: 668 NHKQADQIHQKLDEVVQKLNLAGYTPQTNYVLVDLDEEEKKELVLWHSEKLALCYALMNE 727
            HKQAD+I++KLDEVV++L L GYTP TN VLVD++EE KKE+VLWHSEKLALCY LM E
Sbjct: 677 KHKQADKIYEKLDEVVKELKLVGYTPNTNSVLVDVEEEGKKEVVLWHSEKLALCYGLMGE 734

Query: 728 GPRIC--IIKNLRICEDCHAFMKLASKVYAREIIIRDRSRFHHYRDGLCSCKDYW 776
               C  I+KNLR+CEDCH F+KL SKVY  EII+RDR+RFHHY+ G+CSC DYW
Sbjct: 737 AKGSCIRIVKNLRVCEDCHTFIKLVSKVYGMEIIVRDRTRFHHYKAGVCSCNDYW 734

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP311_ARATH1.3e-20950.52Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana GN... [more]
PPR21_ARATH5.2e-15538.99Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PP175_ARATH7.8e-13543.94Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP168_ARATH1.4e-13136.60Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana GN... [more]
PPR53_ARATH2.4e-13133.47Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
D7T700_VITVI1.3e-26161.14Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g03630 PE=4 SV=... [more]
B9HUV1_POPTR1.9e-25259.77Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s09620g PE=4 SV=2[more]
A0A067L6S3_JATCU5.5e-24456.31Uncharacterized protein OS=Jatropha curcas GN=JCGZ_02188 PE=4 SV=1[more]
A0A061EMY0_THECC4.6e-24357.27Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_020... [more]
I1LFU4_SOYBN4.8e-24057.24Uncharacterized protein OS=Glycine max GN=GLYMA_11G006200 PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT4G14820.17.1e-21150.52 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.12.9e-15638.99 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G29760.14.4e-13643.94 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G22070.17.8e-13336.60 pentatricopeptide (PPR) repeat-containing protein[more]
AT1G20230.11.3e-13233.47 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|1009141445|ref|XP_015888199.1|5.4e-26962.18PREDICTED: pentatricopeptide repeat-containing protein At4g14820 [Ziziphus jujub... [more]
gi|225432698|ref|XP_002278762.1|1.9e-26161.14PREDICTED: pentatricopeptide repeat-containing protein At4g14820 [Vitis vinifera... [more]
gi|566189984|ref|XP_002315764.2|2.7e-25259.77hypothetical protein POPTR_0010s09620g [Populus trichocarpa][more]
gi|743859751|ref|XP_011030676.1|1.7e-25159.35PREDICTED: pentatricopeptide repeat-containing protein At4g14820-like [Populus e... [more]
gi|743821741|ref|XP_011021496.1|6.6e-25159.23PREDICTED: pentatricopeptide repeat-containing protein At4g14820-like [Populus e... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
ClCG05G005100ClCG05G005100gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
ClCG05G005100.1ClCG05G005100.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
ClCG05G005100.1.cds2ClCG05G005100.1.cds2CDS
ClCG05G005100.1.cds1ClCG05G005100.1.cds1CDS


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 545..568
score: 0.31coord: 247..266
score: 0.03coord: 164..187
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 469..516
score: 9.6E-10coord: 368..416
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 343..370
score: 1.8E-5coord: 371..405
score: 2.0E-7coord: 245..273
score: 7.0E-4coord: 444..470
score: 3.9E-4coord: 472..506
score: 2.8E-6coord: 508..540
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 439..469
score: 8.089coord: 87..121
score: 7.235coord: 369..403
score: 11.663coord: 307..337
score: 6.774coord: 338..368
score: 8.78coord: 272..306
score: 7.815coord: 157..191
score: 8.331coord: 607..641
score: 6.127coord: 541..571
score: 5.864coord: 404..438
score: 6.412coord: 470..504
score: 11.246coord: 237..271
score: 7.103coord: 505..540
score: 7.476coord: 56..86
score: 5
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 555..626
score: 2.1E-8coord: 317..490
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 11..197
score: 0.0coord: 247..648
score:
NoneNo IPR availablePANTHERPTHR24015:SF581SUBFAMILY NOT NAMEDcoord: 247..648
score: 0.0coord: 11..197
score: