Csa1G003520 (gene) Cucumber (Chinese Long) v2

NameCsa1G003520
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat-containing protein; contains IPR002885 (Pentatricopeptide repeat), IPR011990 (Tetratricopeptide-like helical)
LocationChr1 : 592299 .. 594662 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAAGATAAAGAGTACTATTTTTTACTCCTCCTTCGTGAAAGAATTCTTTGTCGATTGAAATGTTGGCAGCTCTGTTCCACTGAATTGAACTATCTATAGATGATATATTGCCAAATTTAGAATCTTTCGAGCTTCACTTTCATCTCAAATGGCGTCGATAGTCGGTTGCCTTCCCAATATATCTCTGACTTCCATAACCCAGTTCCCTGAAAACCCAAAATCTTTGATTCTTCAGCAATGCAAAACTCCAAAAGACCTCCAGCAAGTTCACGCTCACCTTCTCAAAACTCGCCGTCTCCTCGACCCCATCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCGACACCATAGATTATGCCCTTTCCATTTTCAACCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCTTTCAAGCGATCGCCTGATAATGCCCTTCTCTTGTTCAAGAAAATGCATGAAAAGTCAGTTCAGCATGACAAATTCACTTTCTCCTCTGTCTTAAAGGCTTGCTCTAGAATGAAAGCGCTGAGGGAAGGCGAACAGGTCCACGCGTTGATTCTGAAATCTGGGTTCAAATCAAATGAGTTTGTCGAGAATACTTTGATTCAGATGTATGCGAATTGTGGACAAATTGGGGTTGCACGTCATGTGTTTGATGGAATGCCGGAAAGAAGCATAGTTGCGTGGAATTCGATGTTGTCTGGTTATACGAAAAATGGGCTTTGGGATGAGGTCGTGAAGCTTTTTCGAAAAATTTTGGAACTGCGTATTGAATTTGATGATGTTACAATGATTAGTGTATTGATGGCTTGTGGAAGATTAGCGAATCTGGAAATAGGGGAGTTGATTGGTGAGTATATTGTGTCAAAAGGGCTAAGACGAAACAATACTCTAACGACTTCGCTGATTGATATGTATGCCAAATGTGGTCAAGTTGATACCGCTAGAAAGTTGTTCGATGAAATGGATAAAAGAGATGTTGTTGCTTGGAGTGCAATGATCTCGGGGTATGCTCAAGCTGATCGATGTAAAGAAGCTCTTAATCTGTTCCATGAGATGCAGAAGGGAAATGTATATCCAAACGAGGTAACAATGGTCAGTGTTCTCTATTCGTGCGCTATGCTTGGAGCATACGAAACAGGTAAGTGGGTTCATTTCTACATCAAAAAGAAGAAGATGAAGCTCACGGTTACTCTTGGAACTCAGCTGATAGATTTTTATGCTAAATGTGGGTATATAGATAGATCAGTTGAAGTTTTCAAGGAAATGTCTTTCAAGAATGTGTTCACATGGACAGCATTAATTCAAGGTCTTGCCAATAATGGAGAAGGGAAAATGGCTCTGGAATTCTTTTCCTCGATGCTAGAGAATGATGTAAAGCCAAATGATGTAACTTTCATTGGCGTTCTGTCTGCTTGTAGCCACGCTTGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATACTTGGACGTGCTGGGTTTCTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCTTCCCTCCCAATGCTGTTGTTTGGAGAACACTATTGGCTTCATGTAGAGCTCATAAAAACATTGAAATGGCAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCACAGTGGAGATTACATTCTTCTGTCAAATACTTATGCATTGGTTGGTAGGGTTGAGGATGCAATCAGGGTAAGATCTTTGATAAAAGAGAAGGAGATTAAGAAGATTCCAGGTTGTAGTTTGATTGAGCTCGATGGTGTTGTACATGAGTTTTTTTCAGAAGATGGAGAACATAAGCACTCCAAGGAAATACATGACGCGTTAGATAAAATGATGAAGCAGATCAAGAGGCTCGGATATGTGCCCAACACAGACGATGCTAGACTGGAGGCTGAGGAAGAGAGCAAAGAAACTTCAGTGTCGCATCATAGTGAGAAGCTTGCTATTGCTTATGGTCTGATCCGAACGTCTCCTCGAACCACTATTAGAATTTCAAAAAACCTTAGGATGTGTAGGGACTGCCATAATGCAACGAAGTTTATATCACAAGTCTTTGAAAGAATGATTATTGTTAGGGATCGGAACCGTTTTCATCATTTTAAAGATGGCCTTTGCTCCTGTAATGACTATTGGTGAGTCTTTTATAGTTGATGGAACATCCATTGTTAGAGAGAAGTAGATGGAACATCGTGTTGATTGGTTACAATGCCTAAATATAGGCAACTCATTCAGTTATATATCTTCAATGGCAGTATCATTTATG

mRNA sequence

ATGGCGTCGATAGTCGGTTGCCTTCCCAATATATCTCTGACTTCCATAACCCAGTTCCCTGAAAACCCAAAATCTTTGATTCTTCAGCAATGCAAAACTCCAAAAGACCTCCAGCAAGTTCACGCTCACCTTCTCAAAACTCGCCGTCTCCTCGACCCCATCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCGACACCATAGATTATGCCCTTTCCATTTTCAACCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCTTTCAAGCGATCGCCTGATAATGCCCTTCTCTTGTTCAAGAAAATGCATGAAAAGTCAGTTCAGCATGACAAATTCACTTTCTCCTCTGTCTTAAAGGCTTGCTCTAGAATGAAAGCGCTGAGGGAAGGCGAACAGGTCCACGCGTTGATTCTGAAATCTGGGTTCAAATCAAATGAGTTTGTCGAGAATACTTTGATTCAGATGTATGCGAATTGTGGACAAATTGGGGTTGCACGTCATGTGTTTGATGGAATGCCGGAAAGAAGCATAGTTGCGTGGAATTCGATGTTGTCTGGTTATACGAAAAATGGGCTTTGGGATGAGGTCGTGAAGCTTTTTCGAAAAATTTTGGAACTGCGTATTGAATTTGATGATGTTACAATGATTAGTGTATTGATGGCTTGTGGAAGATTAGCGAATCTGGAAATAGGGGAGTTGATTGGTGAGTATATTGTGTCAAAAGGGCTAAGACGAAACAATACTCTAACGACTTCGCTGATTGATATGTATGCCAAATGTGGTCAAGTTGATACCGCTAGAAAGTTGTTCGATGAAATGGATAAAAGAGATGTTGTTGCTTGGAGTGCAATGATCTCGGGGTATGCTCAAGCTGATCGATGTAAAGAAGCTCTTAATCTGTTCCATGAGATGCAGAAGGGAAATGTATATCCAAACGAGGTAACAATGGTCAGTGTTCTCTATTCGTGCGCTATGCTTGGAGCATACGAAACAGGTAAGTGGGTTCATTTCTACATCAAAAAGAAGAAGATGAAGCTCACGGTTACTCTTGGAACTCAGCTGATAGATTTTTATGCTAAATGTGGGTATATAGATAGATCAGTTGAAGTTTTCAAGGAAATGTCTTTCAAGAATGTGTTCACATGGACAGCATTAATTCAAGGTCTTGCCAATAATGGAGAAGGGAAAATGGCTCTGGAATTCTTTTCCTCGATGCTAGAGAATGATGTAAAGCCAAATGATGTAACTTTCATTGGCGTTCTGTCTGCTTGTAGCCACGCTTGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATACTTGGACGTGCTGGGTTTCTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCTTCCCTCCCAATGCTGTTGTTTGGAGAACACTATTGGCTTCATGTAGAGCTCATAAAAACATTGAAATGGCAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCACAGTGGAGATTACATTCTTCTGTCAAATACTTATGCATTGGTTGGTAGGGTTGAGGATGCAATCAGGGTAAGATCTTTGATAAAAGAGAAGGAGATTAAGAAGATTCCAGGTTGTAGTTTGATTGAGCTCGATGGTGTTGTACATGAGTTTTTTTCAGAAGATGGAGAACATAAGCACTCCAAGGAAATACATGACGCGTTAGATAAAATGATGAAGCAGATCAAGAGGCTCGGATATGTGCCCAACACAGACGATGCTAGACTGGAGGCTGAGGAAGAGAGCAAAGAAACTTCAGTGTCGCATCATAGTGAGAAGCTTGCTATTGCTTATGGTCTGATCCGAACGTCTCCTCGAACCACTATTAGAATTTCAAAAAACCTTAGGATGTGTAGGGACTGCCATAATGCAACGAAGTTTATATCACAAGTCTTTGAAAGAATGATTATTGTTAGGGATCGGAACCGTTTTCATCATTTTAAAGATGGCCTTTGCTCCTGTAATGACTATTGGTGA

Coding sequence (CDS)

ATGGCGTCGATAGTCGGTTGCCTTCCCAATATATCTCTGACTTCCATAACCCAGTTCCCTGAAAACCCAAAATCTTTGATTCTTCAGCAATGCAAAACTCCAAAAGACCTCCAGCAAGTTCACGCTCACCTTCTCAAAACTCGCCGTCTCCTCGACCCCATCATTACAGAAGCCGTTCTCGAGTCCGCAGCTTTACTCCTTCCCGACACCATAGATTATGCCCTTTCCATTTTCAACCATATCGACAAACCCGAATCGTCGGCTTACAATGTTATGATCAGGGGCCTTGCTTTCAAGCGATCGCCTGATAATGCCCTTCTCTTGTTCAAGAAAATGCATGAAAAGTCAGTTCAGCATGACAAATTCACTTTCTCCTCTGTCTTAAAGGCTTGCTCTAGAATGAAAGCGCTGAGGGAAGGCGAACAGGTCCACGCGTTGATTCTGAAATCTGGGTTCAAATCAAATGAGTTTGTCGAGAATACTTTGATTCAGATGTATGCGAATTGTGGACAAATTGGGGTTGCACGTCATGTGTTTGATGGAATGCCGGAAAGAAGCATAGTTGCGTGGAATTCGATGTTGTCTGGTTATACGAAAAATGGGCTTTGGGATGAGGTCGTGAAGCTTTTTCGAAAAATTTTGGAACTGCGTATTGAATTTGATGATGTTACAATGATTAGTGTATTGATGGCTTGTGGAAGATTAGCGAATCTGGAAATAGGGGAGTTGATTGGTGAGTATATTGTGTCAAAAGGGCTAAGACGAAACAATACTCTAACGACTTCGCTGATTGATATGTATGCCAAATGTGGTCAAGTTGATACCGCTAGAAAGTTGTTCGATGAAATGGATAAAAGAGATGTTGTTGCTTGGAGTGCAATGATCTCGGGGTATGCTCAAGCTGATCGATGTAAAGAAGCTCTTAATCTGTTCCATGAGATGCAGAAGGGAAATGTATATCCAAACGAGGTAACAATGGTCAGTGTTCTCTATTCGTGCGCTATGCTTGGAGCATACGAAACAGGTAAGTGGGTTCATTTCTACATCAAAAAGAAGAAGATGAAGCTCACGGTTACTCTTGGAACTCAGCTGATAGATTTTTATGCTAAATGTGGGTATATAGATAGATCAGTTGAAGTTTTCAAGGAAATGTCTTTCAAGAATGTGTTCACATGGACAGCATTAATTCAAGGTCTTGCCAATAATGGAGAAGGGAAAATGGCTCTGGAATTCTTTTCCTCGATGCTAGAGAATGATGTAAAGCCAAATGATGTAACTTTCATTGGCGTTCTGTCTGCTTGTAGCCACGCTTGTCTGGTTGATCAAGGTCGACATCTTTTCAATAGCATGAGAAGAGATTTTGATATTGAGCCAAGGATTGAGCATTATGGTTGCATGGTTGATATACTTGGACGTGCTGGGTTTCTTGAAGAAGCCTATCAGTTCATAGATAACATGCCCTTCCCTCCCAATGCTGTTGTTTGGAGAACACTATTGGCTTCATGTAGAGCTCATAAAAACATTGAAATGGCAGAAAAATCATTGGAACACATAACTCGATTGGAGCCTGCTCACAGTGGAGATTACATTCTTCTGTCAAATACTTATGCATTGGTTGGTAGGGTTGAGGATGCAATCAGGGTAAGATCTTTGATAAAAGAGAAGGAGATTAAGAAGATTCCAGGTTGTAGTTTGATTGAGCTCGATGGTGTTGTACATGAGTTTTTTTCAGAAGATGGAGAACATAAGCACTCCAAGGAAATACATGACGCGTTAGATAAAATGATGAAGCAGATCAAGAGGCTCGGATATGTGCCCAACACAGACGATGCTAGACTGGAGGCTGAGGAAGAGAGCAAAGAAACTTCAGTGTCGCATCATAGTGAGAAGCTTGCTATTGCTTATGGTCTGATCCGAACGTCTCCTCGAACCACTATTAGAATTTCAAAAAACCTTAGGATGTGTAGGGACTGCCATAATGCAACGAAGTTTATATCACAAGTCTTTGAAAGAATGATTATTGTTAGGGATCGGAACCGTTTTCATCATTTTAAAGATGGCCTTTGCTCCTGTAATGACTATTGGTGA

Protein sequence

MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW*
BLAST of Csa1G003520 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 529.3 bits (1362), Expect = 6.6e-149
Identity = 272/730 (37.26%), Postives = 427/730 (58.49%), Query Frame = 1

Query: 8   LPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLK-----TRRLLDPIITEAVLES 67
           LP+ S         +P   +L  CKT + L+ +HA ++K     T   L  +I   +L  
Sbjct: 20  LPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSP 79

Query: 68  AALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKF 127
               LP    YA+S+F  I +P    +N M RG A    P +AL L+  M    +  + +
Sbjct: 80  HFEGLP----YAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSY 139

Query: 128 TFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGM 187
           TF  VLK+C++ KA +EG+Q+H  +LK G   + +V  +LI MY   G++  A  VFD  
Sbjct: 140 TFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKS 199

Query: 188 PERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGE 247
           P R +V++ +++ GY   G  +   KLF    E+ ++ D V+  +++       N +   
Sbjct: 200 PHRDVVSYTALIKGYASRGYIENAQKLFD---EIPVK-DVVSWNAMISGYAETGNYKEAL 259

Query: 248 LIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARK------------------------ 307
            + + ++   +R + +   +++   A+ G ++  R+                        
Sbjct: 260 ELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLY 319

Query: 308 -----------LFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTMV 367
                      LF+ +  +DV++W+ +I GY   +  KEAL LF EM +    PN+VTM+
Sbjct: 320 SKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTML 379

Query: 368 SVLYSCAMLGAYETGKWVHFYIKKKKMKLT--VTLGTQLIDFYAKCGYIDRSVEVFKEMS 427
           S+L +CA LGA + G+W+H YI K+   +T   +L T LID YAKCG I+ + +VF  + 
Sbjct: 380 SILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSIL 439

Query: 428 FKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLVDQGRH 487
            K++ +W A+I G A +G    + + FS M +  ++P+D+TF+G+LSACSH+ ++D GRH
Sbjct: 440 HKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRH 499

Query: 488 LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAH 547
           +F +M +D+ + P++EHYGCM+D+LG +G  +EA + I+ M   P+ V+W +LL +C+ H
Sbjct: 500 IFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMH 559

Query: 548 KNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSL 607
            N+E+ E   E++ ++EP + G Y+LLSN YA  GR  +  + R+L+ +K +KK+PGCS 
Sbjct: 560 GNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSS 619

Query: 608 IELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVS 667
           IE+D VVHEF   D  H  ++EI+  L++M   +++ G+VP+T +   E EEE KE ++ 
Sbjct: 620 IEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALR 679

Query: 668 HHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFK 696
           HHSEKLAIA+GLI T P T + I KNLR+CR+CH ATK IS++++R II RDR RFHHF+
Sbjct: 680 HHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFR 739

BLAST of Csa1G003520 vs. Swiss-Prot
Match: PP311_ARATH (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 527.3 bits (1357), Expect = 2.5e-148
Identity = 268/708 (37.85%), Postives = 421/708 (59.46%), Query Frame = 1

Query: 28  LQQCKTPKDLQQVHAHLLKT--RRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKP- 87
           L  CK+   ++Q+HAH+L+T     L+  +    + S+++     + YAL++F+ I  P 
Sbjct: 19  LSFCKSLNHIKQLHAHILRTVINHKLNSFLFNLSVSSSSI----NLSYALNVFSSIPSPP 78

Query: 88  ESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVH 147
           ES  +N  +R L+    P   +L ++++     + D+F+F  +LKA S++ AL EG ++H
Sbjct: 79  ESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGMELH 138

Query: 148 ALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWD 207
            +  K     + FVE   + MYA+CG+I  AR+VFD M  R +V WN+M+  Y + GL D
Sbjct: 139 GVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFGLVD 198

Query: 208 EVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLI 267
           E  KLF ++ +  +  D++ + +++ ACGR  N+     I E+++   +R +  L T+L+
Sbjct: 199 EAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLTALV 258

Query: 268 DMYA-------------------------------KCGQVDTARKLFDEMDKRDVVAWSA 327
            MYA                               KCG++D A+ +FD+ +K+D+V W+ 
Sbjct: 259 TMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVCWTT 318

Query: 328 MISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKK 387
           MIS Y ++D  +EAL +F EM    + P+ V+M SV+ +CA LG  +  KWVH  I    
Sbjct: 319 MISAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWVHSCIHVNG 378

Query: 388 MKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS 447
           ++  +++   LI+ YAKCG +D + +VF++M  +NV +W+++I  L+ +GE   AL  F+
Sbjct: 379 LESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDALSLFA 438

Query: 448 SMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRA 507
            M + +V+PN+VTF+GVL  CSH+ LV++G+ +F SM  +++I P++EHYGCMVD+ GRA
Sbjct: 439 RMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDLFGRA 498

Query: 508 GFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLS 567
             L EA + I++MP   N V+W +L+++CR H  +E+ + + + I  LEP H G  +L+S
Sbjct: 499 NLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGALVLMS 558

Query: 568 NTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD 627
           N YA   R ED   +R +++EK + K  G S I+ +G  HEF   D  HK S EI+  LD
Sbjct: 559 NIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIYAKLD 618

Query: 628 KMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRT------TIR 687
           +++ ++K  GYVP+     ++ EEE K+  V  HSEKLA+ +GL+             IR
Sbjct: 619 EVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEEEKDSCGVIR 678

Query: 688 ISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           I KNLR+C DCH   K +S+V+ER IIVRDR RFH +K+GLCSC DYW
Sbjct: 679 IVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of Csa1G003520 vs. Swiss-Prot
Match: PP219_ARATH (Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis thaliana GN=PCMP-H84 PE=3 SV=1)

HSP 1 Score: 520.8 bits (1340), Expect = 2.3e-146
Identity = 264/685 (38.54%), Postives = 407/685 (59.42%), Query Frame = 1

Query: 11  ISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDT 70
           +++ S T   +  K+LI   C T   L+Q+H  L+      D  +   +L+         
Sbjct: 4   VTVPSATSKVQQIKTLISVAC-TVNHLKQIHVSLINHHLHHDTFLVNLLLKRTLFFRQTK 63

Query: 71  IDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKA 130
             Y L  F+H   P    YN +I G          L LF  + +  +    FTF  VLKA
Sbjct: 64  YSYLL--FSHTQFPNIFLYNSLINGFVNNHLFHETLDLFLSIRKHGLYLHGFTFPLVLKA 123

Query: 131 CSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAW 190
           C+R  + + G  +H+L++K GF  +     +L+ +Y+  G++  A  +FD +P+RS+V W
Sbjct: 124 CTRASSRKLGIDLHSLVVKCGFNHDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSVVTW 183

Query: 191 NSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVS 250
            ++ SGYT +G   E + LF+K++E+ ++ D   ++ VL AC  + +L+ GE I +Y+  
Sbjct: 184 TALFSGYTTSGRHREAIDLFKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKYMEE 243

Query: 251 KGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNL 310
             +++N+ + T+L+++YAKCG+++ AR +FD M ++D+V WS MI GYA     KE + L
Sbjct: 244 MEMQKNSFVRTTLVNLYAKCGKMEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEGIEL 303

Query: 311 FHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAK 370
           F +M + N+ P++ ++V  L SCA LGA + G+W    I + +    + +   LID YAK
Sbjct: 304 FLQMLQENLKPDQFSIVGFLSSCASLGALDLGEWGISLIDRHEFLTNLFMANALIDMYAK 363

Query: 371 CGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGV 430
           CG + R  EVFKEM  K++    A I GLA NG  K++   F    +  + P+  TF+G+
Sbjct: 364 CGAMARGFEVFKEMKEKDIVIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTFLGL 423

Query: 431 LSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPP 490
           L  C HA L+  G   FN++   + ++  +EHYGCMVD+ GRAG L++AY+ I +MP  P
Sbjct: 424 LCGCVHAGLIQDGLRFFNAISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMPMRP 483

Query: 491 NAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRS 550
           NA+VW  LL+ CR  K+ ++AE  L+ +  LEP ++G+Y+ LSN Y++ GR ++A  VR 
Sbjct: 484 NAIVWGALLSGCRLVKDTQLAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAEVRD 543

Query: 551 LIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDD 610
           ++ +K +KKIPG S IEL+G VHEF ++D  H  S +I+  L+ +  +++ +G+VP T+ 
Sbjct: 544 MMNKKGMKKIPGYSWIELEGKVHEFLADDKSHPLSDKIYAKLEDLGNEMRLMGFVPTTEF 603

Query: 611 ARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFE 670
              + EEE KE  + +HSEKLA+A GLI T     IR+ KNLR+C DCH   K IS++  
Sbjct: 604 VFFDVEEEEKERVLGYHSEKLAVALGLISTDHGQVIRVVKNLRVCGDCHEVMKLISKITR 663

Query: 671 RMIIVRDRNRFHHFKDGLCSCNDYW 696
           R I+VRD NRFH F +G CSCNDYW
Sbjct: 664 REIVVRDNNRFHCFTNGSCSCNDYW 685

BLAST of Csa1G003520 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 510.4 bits (1313), Expect = 3.2e-143
Identity = 268/704 (38.07%), Postives = 417/704 (59.23%), Query Frame = 1

Query: 27  ILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPES 86
           ++++C + + L+Q H H+++T    DP     +   AAL    +++YA  +F+ I KP S
Sbjct: 36  LIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNS 95

Query: 87  SAYNVMIRGLAFKRSPDNALLLFKKM-HEKSVQHDKFTFSSVLKACSRMKALREGEQVHA 146
            A+N +IR  A    P  ++  F  M  E     +K+TF  ++KA + + +L  G+ +H 
Sbjct: 96  FAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHG 155

Query: 147 LILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDE 206
           + +KS   S+ FV N+LI  Y +CG +  A  VF  + E+ +V+WNSM++G+ + G  D+
Sbjct: 156 MAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDK 215

Query: 207 ----------------------VVKLFRKILELRI-----EFDDVTMISVLMACGRLANL 266
                                 V+    KI  L        + +   ++V +     A L
Sbjct: 216 ALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLAN-AML 275

Query: 267 EIGELIGEYIVSKGL-----RRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSA 326
           ++    G    +K L      ++N   T+++D YA     + AR++ + M ++D+VAW+A
Sbjct: 276 DMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNA 335

Query: 327 MISGYAQADRCKEALNLFHEMQ-KGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKK 386
           +IS Y Q  +  EAL +FHE+Q + N+  N++T+VS L +CA +GA E G+W+H YIKK 
Sbjct: 336 LISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKH 395

Query: 387 KMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFF 446
            +++   + + LI  Y+KCG +++S EVF  +  ++VF W+A+I GLA +G G  A++ F
Sbjct: 396 GIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMF 455

Query: 447 SSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGR 506
             M E +VKPN VTF  V  ACSH  LVD+   LF+ M  ++ I P  +HY C+VD+LGR
Sbjct: 456 YKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGR 515

Query: 507 AGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILL 566
           +G+LE+A +FI+ MP PP+  VW  LL +C+ H N+ +AE +   +  LEP + G ++LL
Sbjct: 516 SGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLL 575

Query: 567 SNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDAL 626
           SN YA +G+ E+   +R  ++   +KK PGCS IE+DG++HEF S D  H  S++++  L
Sbjct: 576 SNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKL 635

Query: 627 DKMMKQIKRLGYVPNTDDA-RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKN 686
            ++M+++K  GY P      ++  EEE KE S++ HSEKLAI YGLI T     IR+ KN
Sbjct: 636 HEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKN 695

Query: 687 LRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           LR+C DCH+  K ISQ+++R IIVRDR RFHHF++G CSCND+W
Sbjct: 696 LRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of Csa1G003520 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 504.2 bits (1297), Expect = 2.3e-141
Identity = 259/625 (41.44%), Postives = 381/625 (60.96%), Query Frame = 1

Query: 71  IDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKA 130
           ++ A  +F+ + + +  ++N ++ G +       AL + K M E++++    T  SVL A
Sbjct: 186 VNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPA 245

Query: 131 CSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAW 190
            S ++ +  G+++H   ++SGF S   +   L+ MYA CG +  AR +FDGM ER++V+W
Sbjct: 246 VSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSW 305

Query: 191 NSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVS 250
           NSM+  Y +N    E + +F+K+L+  ++  DV+++  L AC  L +LE G  I +  V 
Sbjct: 306 NSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVE 365

Query: 251 KGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNL 310
            GL RN ++  SLI MY KC +VDTA  +F ++  R +V+W+AMI G+AQ  R  +ALN 
Sbjct: 366 LGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNY 425

Query: 311 FHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAK 370
           F +M+   V P+  T VSV+ + A L      KW+H  + +  +   V + T L+D YAK
Sbjct: 426 FSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAK 485

Query: 371 CGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGV 430
           CG I  +  +F  MS ++V TW A+I G   +G GK ALE F  M +  +KPN VTF+ V
Sbjct: 486 CGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSV 545

Query: 431 LSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPP 490
           +SACSH+ LV+ G   F  M+ ++ IE  ++HYG MVD+LGRAG L EA+ FI  MP  P
Sbjct: 546 ISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKP 605

Query: 491 NAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRS 550
              V+  +L +C+ HKN+  AEK+ E +  L P   G ++LL+N Y      E   +VR 
Sbjct: 606 AVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRV 665

Query: 551 LIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDD 610
            +  + ++K PGCS++E+   VH FFS    H  SK+I+  L+K++  IK  GYVP+T +
Sbjct: 666 SMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDT-N 725

Query: 611 ARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFE 670
             L  E + KE  +S HSEKLAI++GL+ T+  TTI + KNLR+C DCHNATK+IS V  
Sbjct: 726 LVLGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTG 785

Query: 671 RMIIVRDRNRFHHFKDGLCSCNDYW 696
           R I+VRD  RFHHFK+G CSC DYW
Sbjct: 786 REIVVRDMQRFHHFKNGACSCGDYW 809


HSP 2 Score: 267.7 bits (683), Expect = 3.6e-70
Identity = 146/485 (30.10%), Postives = 261/485 (53.81%), Query Frame = 1

Query: 21  ENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNH 80
           E+P +L+L++C + K+L+Q+   + K     +      ++  +      ++D A  +F  
Sbjct: 37  EHPAALLLERCSSLKELRQILPLVFKNGLYQEHFFQTKLV--SLFCRYGSVDEAARVFEP 96

Query: 81  IDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREG 140
           ID   +  Y+ M++G A     D AL  F +M    V+   + F+ +LK C     LR G
Sbjct: 97  IDSKLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVG 156

Query: 141 EQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKN 200
           +++H L++KSGF  + F    L  MYA C Q+  AR VFD MPER +V+WN++++GY++N
Sbjct: 157 KEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQN 216

Query: 201 GLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLT 260
           G+    +++ + + E  ++   +T++SVL A   L  + +G+ I  Y +  G      ++
Sbjct: 217 GMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS 276

Query: 261 TSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVY 320
           T+L+DMYAKCG ++TAR+LFD M +R+VV+W++MI  Y Q +  KEA+ +F +M    V 
Sbjct: 277 TALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVK 336

Query: 321 PNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEV 380
           P +V+++  L++CA LG  E G+++H    +  +   V++   LI  Y KC  +D +  +
Sbjct: 337 PTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASM 396

Query: 381 FKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLV 440
           F ++  + + +W A+I G A NG    AL +FS M    VKP+  T++ V++A +   + 
Sbjct: 397 FGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSIT 456

Query: 441 DQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLA 500
              + +   + R   ++  +     +VD+  + G +  A + I +M    +   W  ++ 
Sbjct: 457 HHAKWIHGVVMRSC-LDKNVFVTTALVDMYAKCGAIMIA-RLIFDMMSERHVTTWNAMID 516

Query: 501 SCRAH 506
               H
Sbjct: 517 GYGTH 517

BLAST of Csa1G003520 vs. TrEMBL
Match: F6GTR8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g09300 PE=4 SV=1)

HSP 1 Score: 1007.7 bits (2604), Expect = 7.0e-291
Identity = 479/682 (70.23%), Postives = 579/682 (84.90%), Query Frame = 1

Query: 14  TSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDY 73
           TSI+ FPENPK+LIL+QCKT +DL ++HAHL+KTR LL P + E +LESAA+LLP ++DY
Sbjct: 17  TSISLFPENPKTLILEQCKTIRDLNEIHAHLIKTRLLLKPKVAENLLESAAILLPTSMDY 76

Query: 74  ALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSR 133
           A+SIF  ID+P+S AYN+MIRG   K+SP  A+LLFK+MHE SVQ D+FTF  +LK CSR
Sbjct: 77  AVSIFRQIDEPDSPAYNIMIRGFTLKQSPHEAILLFKEMHENSVQPDEFTFPCILKVCSR 136

Query: 134 MKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSM 193
           ++AL EGEQ+HALI+K GF S+ FV+NTLI MYANCG++ VAR VFD M ER++  WNSM
Sbjct: 137 LQALSEGEQIHALIMKCGFGSHGFVKNTLIHMYANCGEVEVARRVFDEMSERNVRTWNSM 196

Query: 194 LSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGL 253
            +GYTK+G W+EVVKLF ++LEL I FD+VT++SVL ACGRLA+LE+GE I  Y+  KGL
Sbjct: 197 FAGYTKSGNWEEVVKLFHEMLELDIRFDEVTLVSVLTACGRLADLELGEWINRYVEEKGL 256

Query: 254 RRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHE 313
           + N TL TSL+DMYAKCGQVDTAR+LFD+MD+RDVVAWSAMISGY+QA RC+EAL+LFHE
Sbjct: 257 KGNPTLITSLVDMYAKCGQVDTARRLFDQMDRRDVVAWSAMISGYSQASRCREALDLFHE 316

Query: 314 MQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGY 373
           MQK N+ PNE+TMVS+L SCA+LGA ETGKWVHF+IKKK+MKLTVTLGT L+DFYAKCG 
Sbjct: 317 MQKANIDPNEITMVSILSSCAVLGALETGKWVHFFIKKKRMKLTVTLGTALMDFYAKCGS 376

Query: 374 IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSA 433
           ++ S+EVF +M  KNV +WT LIQGLA+NG+GK ALE+F  MLE +V+PNDVTFIGVLSA
Sbjct: 377 VESSIEVFGKMPVKNVLSWTVLIQGLASNGQGKKALEYFYLMLEKNVEPNDVTFIGVLSA 436

Query: 434 CSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAV 493
           CSHA LVD+GR LF SM RDF IEPRIEHYGCMVDILGRAG +EEA+QFI NMP  PNAV
Sbjct: 437 CSHAGLVDEGRDLFVSMSRDFGIEPRIEHYGCMVDILGRAGLIEEAFQFIKNMPIQPNAV 496

Query: 494 VWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIK 553
           +WRTLLASC+ HKN+E+ E+SL+ +  LEP HSGDYILLSN YA VGR EDA++VR  +K
Sbjct: 497 IWRTLLASCKVHKNVEIGEESLKQLIILEPTHSGDYILLSNIYASVGRWEDALKVRGEMK 556

Query: 554 EKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARL 613
           EK IKK PGCSLIELDGV+HEFF+ED  H  S+EI++A++ MMKQIK  GYVPNT +ARL
Sbjct: 557 EKGIKKTPGCSLIELDGVIHEFFAEDNVHSQSEEIYNAIEDMMKQIKSAGYVPNTAEARL 616

Query: 614 EAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMI 673
           +AEE+ KE+SVSHHSEKLAIA+GLI++ P TTIRI+KNLR+C DCHNATK +S+VF R I
Sbjct: 617 DAEEDDKESSVSHHSEKLAIAFGLIKSPPGTTIRITKNLRVCTDCHNATKLVSKVFNREI 676

Query: 674 IVRDRNRFHHFKDGLCSCNDYW 696
           +VRDR RFHHFK+G CSCNDYW
Sbjct: 677 VVRDRTRFHHFKEGSCSCNDYW 698

BLAST of Csa1G003520 vs. TrEMBL
Match: M5W9L5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa024573mg PE=4 SV=1)

HSP 1 Score: 994.2 bits (2569), Expect = 8.0e-287
Identity = 483/687 (70.31%), Postives = 575/687 (83.70%), Query Frame = 1

Query: 9   PNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLP 68
           P  ++T+I QFP NPK+LILQQCKT +DL QVHAHL+KTR LL+P ITE +LESAA+LLP
Sbjct: 13  PLTAITTIPQFPHNPKTLILQQCKTTRDLNQVHAHLIKTRLLLNPTITENLLESAAILLP 72

Query: 69  DTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVL 128
           + +DYALSIF+++D+P++  YN+MIR L +K SP  A LLFKKM E S + D+FT SS+L
Sbjct: 73  NAMDYALSIFHNLDEPDTLVYNIMIRSLTYKLSPLEAFLLFKKMQESSAEPDEFTLSSIL 132

Query: 129 KACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIV 188
           KACS+++ALREGEQ+HA I+K GFKSN FVENTLI MYA CG++ VAR VFDG+PER+ +
Sbjct: 133 KACSKLRALREGEQIHAHIVKCGFKSNGFVENTLIHMYATCGELEVARRVFDGLPERARM 192

Query: 189 AWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYI 248
           AWNSML+GY KN  WDEVVKLF ++L+L + FD+VT+ SVL ACGRLANLE+GE IG+YI
Sbjct: 193 AWNSMLAGYMKNKCWDEVVKLFHEMLKLGVGFDEVTLTSVLTACGRLANLELGEWIGDYI 252

Query: 249 VSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEAL 308
            +  L+ N  L TSL+DMYAKCGQV+TAR+ FD MD+RDVVAWSAMISGY+QA+RC+EAL
Sbjct: 253 EANRLKGNIALVTSLVDMYAKCGQVETARRFFDRMDRRDVVAWSAMISGYSQANRCREAL 312

Query: 309 NLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFY 368
           +LFH+MQK NV PNEVTMVSVLYSCA+LGA +TGKWV FYIKK+K+KLTV LGT LIDFY
Sbjct: 313 DLFHDMQKANVDPNEVTMVSVLYSCAVLGALKTGKWVEFYIKKEKLKLTVNLGTALIDFY 372

Query: 369 AKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFI 428
           AKCG ID S+EVF  M   NVF+WTALIQGLA+NG+GK ALE+F  M E ++KPN+VTFI
Sbjct: 373 AKCGCIDSSIEVFNRMPSTNVFSWTALIQGLASNGQGKGALEYFQLMQEKNIKPNNVTFI 432

Query: 429 GVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPF 488
            VLSACSHA LV++GR+LF SM +DF IEPRIEHYG MVDILGRAG +EEAYQFI NMP 
Sbjct: 433 AVLSACSHAGLVNEGRNLFTSMIKDFGIEPRIEHYGSMVDILGRAGLIEEAYQFIKNMPI 492

Query: 489 PPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRV 548
            PNAVVWRTLLASCRAHKN+E+ E+SL+HI  LE  HSGDYILLSN YA V R EDAIRV
Sbjct: 493 QPNAVVWRTLLASCRAHKNVEIGEESLKHIISLETPHSGDYILLSNIYASVDRREDAIRV 552

Query: 549 RSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNT 608
           R  ++EK I+K PGCSLIELDGV++EFF+ED    H +E+++A   MMK+IK  GYVP T
Sbjct: 553 RDQMREKGIEKAPGCSLIELDGVIYEFFAEDKACPHLEEVYNATHDMMKRIKEAGYVPYT 612

Query: 609 DDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQV 668
            DARL+AEE+ KE SVSHHSEKLAIA+GLIRT P TT+RISKNLR+C DCHNATK IS+V
Sbjct: 613 TDARLDAEEDEKEASVSHHSEKLAIAFGLIRTLPGTTLRISKNLRVCTDCHNATKMISKV 672

Query: 669 FERMIIVRDRNRFHHFKDGLCSCNDYW 696
           F R I+VRD NRFHHFK+G CSCNDYW
Sbjct: 673 FNRQIVVRDWNRFHHFKEGSCSCNDYW 699

BLAST of Csa1G003520 vs. TrEMBL
Match: W9RUI0_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_002061 PE=4 SV=1)

HSP 1 Score: 979.2 bits (2530), Expect = 2.7e-282
Identity = 479/688 (69.62%), Postives = 576/688 (83.72%), Query Frame = 1

Query: 12  SLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTI 71
           ++T+I++FP+NPK+LILQQCKT KDL Q+HAHLLKT  L  P I E VLESAA+LLPD +
Sbjct: 18  AITTISEFPQNPKTLILQQCKTTKDLNQIHAHLLKTSLLHSPAIAENVLESAAILLPDAM 77

Query: 72  DYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKAC 131
           DYALSIF  ID+P+SSAYNVMIRGL +K+S   A+LLFK M E SVQ D+FTF SVLKAC
Sbjct: 78  DYALSIFRRIDRPDSSAYNVMIRGLIYKKSNHEAVLLFKNMLENSVQRDEFTFPSVLKAC 137

Query: 132 SRMKALREGEQVHALILK-SGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAW 191
           SR+ AL EGEQ+HA I+K SG KSN FV+NTLI MYA+CG+I +AR+VFD MP R ++ W
Sbjct: 138 SRLGALSEGEQIHAQIVKYSGLKSNAFVQNTLIHMYASCGEIEIARNVFDKMPRRHVMTW 197

Query: 192 NSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVS 251
           NS+L+GY KN  WDEVV+LFR++ E   EFD++T+ISVL ACGR  +LE+GE IGEY+ +
Sbjct: 198 NSILTGYVKNERWDEVVRLFREMRESSFEFDEITLISVLTACGRAGDLELGEWIGEYVEA 257

Query: 252 KGLRRNN-TLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALN 311
             L ++   L TSLIDMY KCGQVDTAR+LFD++D+RDVVAWSAMISGY+  DR +EAL+
Sbjct: 258 NELMKSKLALITSLIDMYGKCGQVDTARRLFDQIDRRDVVAWSAMISGYSHGDRGREALD 317

Query: 312 LFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYA 371
           LF EMQ+ NV PNEVTMVSVLYSCA+LGA+ETGKWV FYI+K KMKLTV LGT LIDFYA
Sbjct: 318 LFKEMQEANVEPNEVTMVSVLYSCAVLGAFETGKWVRFYIEKNKMKLTVILGTALIDFYA 377

Query: 372 KCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIG 431
           KCG I+ S+EVF +M ++NVF+WTALIQGLA+NG+GK AL++F  M E +V PNDVTFIG
Sbjct: 378 KCGSIEGSIEVFDKMPYRNVFSWTALIQGLASNGQGKKALKYFKQMQEKNVDPNDVTFIG 437

Query: 432 VLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFP 491
           VLSACSHA LV++GR LF SM  D+ IEPRIEHYGCMVDILGR+G ++EAY+FI NMP  
Sbjct: 438 VLSACSHAGLVEEGRKLFISMSNDYGIEPRIEHYGCMVDILGRSGLIQEAYEFIKNMPIR 497

Query: 492 PNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVR 551
           PNAVVWRTLLASC+AHKN+++ E+SL++I RLEPAHSGDYILLSN YA VGR +DA+RVR
Sbjct: 498 PNAVVWRTLLASCKAHKNVKIGEESLKNIIRLEPAHSGDYILLSNLYASVGRRDDAMRVR 557

Query: 552 SLIKEKEIKK-IPGCSLIELDGVVHEFFSEDGE-HKHSKEIHDALDKMMKQIKRLGYVPN 611
           + +KEK   K  PGCSLIELD V++EFF+ED   H HSKE+++A + MM+QIK  GYVPN
Sbjct: 558 NQMKEKRTNKTAPGCSLIELDAVIYEFFAEDNNGHPHSKEVYNATEDMMRQIKSAGYVPN 617

Query: 612 TDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQ 671
           T DARL+AEEE KE SVSHHSEKLAIA+GLIRTSP TTIR+SKNLR+C DCHNA K IS+
Sbjct: 618 TADARLDAEEEDKEASVSHHSEKLAIAFGLIRTSPVTTIRVSKNLRVCTDCHNAAKLISK 677

Query: 672 VFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           VF+R I++RDRNRFHHFK+G CSCNDYW
Sbjct: 678 VFKREIVLRDRNRFHHFKEGSCSCNDYW 705

BLAST of Csa1G003520 vs. TrEMBL
Match: A0A067FPY8_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g005476mg PE=4 SV=1)

HSP 1 Score: 933.7 bits (2412), Expect = 1.3e-268
Identity = 446/688 (64.83%), Postives = 558/688 (81.10%), Query Frame = 1

Query: 9   PNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLP 68
           P  ++T+ITQFPENPK+LI+QQCKT KDL QVHAHL+K+R  L+P I+E +LE+AA+L+P
Sbjct: 8   PAKTVTTITQFPENPKTLIVQQCKTTKDLNQVHAHLIKSRFHLNPTISENLLEAAAILIP 67

Query: 69  -DTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSV 128
             T+DYALSIF+ I++P+SSAYN+MIR    K+SP  A++L+K M + SV+ D+FTF+  
Sbjct: 68  ATTMDYALSIFHKINEPDSSAYNIMIRAFTLKQSPQEAVMLYKTMLQNSVEPDRFTFACT 127

Query: 129 LKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSI 188
           LKACSR++AL EGEQ+HA ILKSGF   + V NTLI +YANCG+I +AR +FD M  R +
Sbjct: 128 LKACSRIRALEEGEQIHAQILKSGFGCRQLVTNTLIHLYANCGRIDIARKMFDRMSNRDV 187

Query: 189 VAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEY 248
            +WNSM SGY K   W E+V LF ++ +L ++FD+VT+I+VLMACGRLA++E+G  I EY
Sbjct: 188 FSWNSMFSGYVKTECWREIVDLFNEMRDLGVKFDEVTLINVLMACGRLADIELGGWISEY 247

Query: 249 IVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEA 308
           +  K L  N  L T+++DMYAKCG VD AR+LF++M+ +DVVAWSAMISGY+QA RCKEA
Sbjct: 248 MEEKELNGNVKLMTAVVDMYAKCGHVDKARRLFEQMNIKDVVAWSAMISGYSQARRCKEA 307

Query: 309 LNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDF 368
           L +FH+MQ  NV PNEVTMVSVL  CA+LGA ETGKWVH Y+KKK+M+LT+TLGT L+DF
Sbjct: 308 LGVFHDMQMANVVPNEVTMVSVLSCCAVLGALETGKWVHLYVKKKRMELTITLGTALMDF 367

Query: 369 YAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTF 428
           YAKCG I+ +VEVFK+M  KNVF WT LIQ LA+NG+G+ ALE +  M E +++PNDV F
Sbjct: 368 YAKCGLIENAVEVFKKMPLKNVFFWTVLIQCLASNGQGERALETYYIMREKNIEPNDVAF 427

Query: 429 IGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMP 488
           I VLSACSH  +VD+GR LF SM RDFD+EPR+EHYGCMVDILGRAG +EEAYQFI NMP
Sbjct: 428 IAVLSACSHVGMVDEGRELFVSMSRDFDLEPRMEHYGCMVDILGRAGLVEEAYQFIKNMP 487

Query: 489 FPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIR 548
            PPN V+WRTLLA+CRAHKN+++ E+SL+++  LEP HSGDYILLS+ YA  GR EDA+R
Sbjct: 488 IPPNPVIWRTLLAACRAHKNVKVGEESLKNLVTLEPMHSGDYILLSDIYASAGRCEDALR 547

Query: 549 VRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPN 608
           V + ++E+ IKK PGCSLIELDG ++EF +ED    H KE++DA + MMK+IK  GYVPN
Sbjct: 548 VMNQMREQGIKKTPGCSLIELDGEIYEFLAEDNMCPHFKEVYDATENMMKRIKSAGYVPN 607

Query: 609 TDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQ 668
           T DARL+AEE+ KE SV+HHSEKLAIA+GLIR SP TTIRISKNLR+C DCHNATK IS+
Sbjct: 608 TADARLDAEEDDKEASVAHHSEKLAIAFGLIRASPGTTIRISKNLRVCTDCHNATKIISK 667

Query: 669 VFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           VF R I+VRDR RFHHFK+G CSCNDYW
Sbjct: 668 VFNREIVVRDRTRFHHFKEGSCSCNDYW 695

BLAST of Csa1G003520 vs. TrEMBL
Match: V4TIJ5_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10024603mg PE=4 SV=1)

HSP 1 Score: 932.2 bits (2408), Expect = 3.7e-268
Identity = 446/688 (64.83%), Postives = 557/688 (80.96%), Query Frame = 1

Query: 9   PNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLP 68
           P  ++T+ITQFPENPK+LI+QQCKT KDL QVHAHL+K+R  L+P I+E +LE+AA+L+P
Sbjct: 8   PAKTVTTITQFPENPKTLIVQQCKTTKDLNQVHAHLIKSRFHLNPTISENLLEAAAILIP 67

Query: 69  D-TIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSV 128
             T+DYALSIF+ I++P+SSAYN+MIR    K+SP  A++L+K M E S++ D+FTF+  
Sbjct: 68  AATMDYALSIFHKINEPDSSAYNIMIRAFTLKQSPQEAVMLYKTMLENSLEPDRFTFACT 127

Query: 129 LKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSI 188
           LKACSR++AL EGEQ+HA ILKSGF   + V NTLI +YA CG+I +AR +FD M  R +
Sbjct: 128 LKACSRIRALEEGEQIHAQILKSGFGCRQLVTNTLIHLYAICGRIDIARKMFDRMSNRDV 187

Query: 189 VAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEY 248
            +WNSM SGY K   W E+V LF ++ +L ++FD+VT+I+VLMACGRLA++E+G  I EY
Sbjct: 188 FSWNSMFSGYVKTECWREIVDLFNEMRDLGVKFDEVTLINVLMACGRLADIELGGWISEY 247

Query: 249 IVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEA 308
           +  K L  N  L T+++DMYAKCG VD AR+LF++M+ +DVVAWSAMISGY+QA RCKEA
Sbjct: 248 MEEKELNGNVKLMTAVVDMYAKCGHVDKARRLFEQMNIKDVVAWSAMISGYSQARRCKEA 307

Query: 309 LNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDF 368
           L +FH+MQ  NV PNEVTMVSVL  CA+LGA ETGKWVH Y+KKK+M+LT+TLGT L+DF
Sbjct: 308 LGVFHDMQMANVVPNEVTMVSVLSCCAVLGALETGKWVHLYVKKKRMELTITLGTALMDF 367

Query: 369 YAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTF 428
           YAKCG I+ +VEVFK+M  KNVF WT LIQ LA+NG+G+ ALE +  M E +++PNDVTF
Sbjct: 368 YAKCGLIENAVEVFKKMPLKNVFFWTVLIQCLASNGQGERALETYYIMREKNIEPNDVTF 427

Query: 429 IGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMP 488
           I VLSACSH  +VD+GR LF SM RDFD+EPR+EHYGCMVDILGRAG +EEAYQFI NMP
Sbjct: 428 IAVLSACSHVGMVDEGRELFVSMSRDFDLEPRMEHYGCMVDILGRAGLIEEAYQFIKNMP 487

Query: 489 FPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIR 548
            PPN V+WRTLLA+CRAHKN+E+ E+SL+++  LEP HSGDY LLS+ YA  GR EDA+R
Sbjct: 488 IPPNPVIWRTLLAACRAHKNVEVGEESLKNLVTLEPMHSGDYFLLSDIYASAGRCEDALR 547

Query: 549 VRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPN 608
           V + ++E+ IKK PGCSLIELDG ++EF +ED    H KE++DA + MMK+IK  GYVPN
Sbjct: 548 VMNQMREQGIKKTPGCSLIELDGEIYEFLAEDNMCPHFKEVYDATENMMKRIKSAGYVPN 607

Query: 609 TDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQ 668
           T DARL+AEE+ KE SV+HHSEKLAIA+GLIR SP TTIRISKNLR+C DCHNATK IS+
Sbjct: 608 TADARLDAEEDDKEASVAHHSEKLAIAFGLIRASPGTTIRISKNLRVCTDCHNATKIISK 667

Query: 669 VFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           VF R I+VRDR RFHHFK+G CSCNDYW
Sbjct: 668 VFNREIVVRDRTRFHHFKEGSCSCNDYW 695

BLAST of Csa1G003520 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 529.3 bits (1362), Expect = 3.7e-150
Identity = 272/730 (37.26%), Postives = 427/730 (58.49%), Query Frame = 1

Query: 8   LPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLK-----TRRLLDPIITEAVLES 67
           LP+ S         +P   +L  CKT + L+ +HA ++K     T   L  +I   +L  
Sbjct: 20  LPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSP 79

Query: 68  AALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKF 127
               LP    YA+S+F  I +P    +N M RG A    P +AL L+  M    +  + +
Sbjct: 80  HFEGLP----YAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSY 139

Query: 128 TFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGM 187
           TF  VLK+C++ KA +EG+Q+H  +LK G   + +V  +LI MY   G++  A  VFD  
Sbjct: 140 TFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKS 199

Query: 188 PERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGE 247
           P R +V++ +++ GY   G  +   KLF    E+ ++ D V+  +++       N +   
Sbjct: 200 PHRDVVSYTALIKGYASRGYIENAQKLFD---EIPVK-DVVSWNAMISGYAETGNYKEAL 259

Query: 248 LIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARK------------------------ 307
            + + ++   +R + +   +++   A+ G ++  R+                        
Sbjct: 260 ELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLY 319

Query: 308 -----------LFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVYPNEVTMV 367
                      LF+ +  +DV++W+ +I GY   +  KEAL LF EM +    PN+VTM+
Sbjct: 320 SKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTML 379

Query: 368 SVLYSCAMLGAYETGKWVHFYIKKKKMKLT--VTLGTQLIDFYAKCGYIDRSVEVFKEMS 427
           S+L +CA LGA + G+W+H YI K+   +T   +L T LID YAKCG I+ + +VF  + 
Sbjct: 380 SILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSIL 439

Query: 428 FKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLVDQGRH 487
            K++ +W A+I G A +G    + + FS M +  ++P+D+TF+G+LSACSH+ ++D GRH
Sbjct: 440 HKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRH 499

Query: 488 LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAH 547
           +F +M +D+ + P++EHYGCM+D+LG +G  +EA + I+ M   P+ V+W +LL +C+ H
Sbjct: 500 IFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMH 559

Query: 548 KNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSL 607
            N+E+ E   E++ ++EP + G Y+LLSN YA  GR  +  + R+L+ +K +KK+PGCS 
Sbjct: 560 GNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSS 619

Query: 608 IELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARLEAEEESKETSVS 667
           IE+D VVHEF   D  H  ++EI+  L++M   +++ G+VP+T +   E EEE KE ++ 
Sbjct: 620 IEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALR 679

Query: 668 HHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFK 696
           HHSEKLAIA+GLI T P T + I KNLR+CR+CH ATK IS++++R II RDR RFHHF+
Sbjct: 680 HHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFR 739

BLAST of Csa1G003520 vs. TAIR10
Match: AT4G14820.1 (AT4G14820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 527.3 bits (1357), Expect = 1.4e-149
Identity = 268/708 (37.85%), Postives = 421/708 (59.46%), Query Frame = 1

Query: 28  LQQCKTPKDLQQVHAHLLKT--RRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKP- 87
           L  CK+   ++Q+HAH+L+T     L+  +    + S+++     + YAL++F+ I  P 
Sbjct: 19  LSFCKSLNHIKQLHAHILRTVINHKLNSFLFNLSVSSSSI----NLSYALNVFSSIPSPP 78

Query: 88  ESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREGEQVH 147
           ES  +N  +R L+    P   +L ++++     + D+F+F  +LKA S++ AL EG ++H
Sbjct: 79  ESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSALFEGMELH 138

Query: 148 ALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWD 207
            +  K     + FVE   + MYA+CG+I  AR+VFD M  R +V WN+M+  Y + GL D
Sbjct: 139 GVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIERYCRFGLVD 198

Query: 208 EVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLTTSLI 267
           E  KLF ++ +  +  D++ + +++ ACGR  N+     I E+++   +R +  L T+L+
Sbjct: 199 EAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMDTHLLTALV 258

Query: 268 DMYA-------------------------------KCGQVDTARKLFDEMDKRDVVAWSA 327
            MYA                               KCG++D A+ +FD+ +K+D+V W+ 
Sbjct: 259 TMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEKKDLVCWTT 318

Query: 328 MISGYAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKK 387
           MIS Y ++D  +EAL +F EM    + P+ V+M SV+ +CA LG  +  KWVH  I    
Sbjct: 319 MISAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWVHSCIHVNG 378

Query: 388 MKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS 447
           ++  +++   LI+ YAKCG +D + +VF++M  +NV +W+++I  L+ +GE   AL  F+
Sbjct: 379 LESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEASDALSLFA 438

Query: 448 SMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRA 507
            M + +V+PN+VTF+GVL  CSH+ LV++G+ +F SM  +++I P++EHYGCMVD+ GRA
Sbjct: 439 RMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGCMVDLFGRA 498

Query: 508 GFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLS 567
             L EA + I++MP   N V+W +L+++CR H  +E+ + + + I  LEP H G  +L+S
Sbjct: 499 NLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDHDGALVLMS 558

Query: 568 NTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALD 627
           N YA   R ED   +R +++EK + K  G S I+ +G  HEF   D  HK S EI+  LD
Sbjct: 559 NIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQSNEIYAKLD 618

Query: 628 KMMKQIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRT------TIR 687
           +++ ++K  GYVP+     ++ EEE K+  V  HSEKLA+ +GL+             IR
Sbjct: 619 EVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEEEKDSCGVIR 678

Query: 688 ISKNLRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           I KNLR+C DCH   K +S+V+ER IIVRDR RFH +K+GLCSC DYW
Sbjct: 679 IVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of Csa1G003520 vs. TAIR10
Match: AT3G08820.1 (AT3G08820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 520.8 bits (1340), Expect = 1.3e-147
Identity = 264/685 (38.54%), Postives = 407/685 (59.42%), Query Frame = 1

Query: 11  ISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDT 70
           +++ S T   +  K+LI   C T   L+Q+H  L+      D  +   +L+         
Sbjct: 4   VTVPSATSKVQQIKTLISVAC-TVNHLKQIHVSLINHHLHHDTFLVNLLLKRTLFFRQTK 63

Query: 71  IDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKA 130
             Y L  F+H   P    YN +I G          L LF  + +  +    FTF  VLKA
Sbjct: 64  YSYLL--FSHTQFPNIFLYNSLINGFVNNHLFHETLDLFLSIRKHGLYLHGFTFPLVLKA 123

Query: 131 CSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAW 190
           C+R  + + G  +H+L++K GF  +     +L+ +Y+  G++  A  +FD +P+RS+V W
Sbjct: 124 CTRASSRKLGIDLHSLVVKCGFNHDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSVVTW 183

Query: 191 NSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVS 250
            ++ SGYT +G   E + LF+K++E+ ++ D   ++ VL AC  + +L+ GE I +Y+  
Sbjct: 184 TALFSGYTTSGRHREAIDLFKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKYMEE 243

Query: 251 KGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNL 310
             +++N+ + T+L+++YAKCG+++ AR +FD M ++D+V WS MI GYA     KE + L
Sbjct: 244 MEMQKNSFVRTTLVNLYAKCGKMEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEGIEL 303

Query: 311 FHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAK 370
           F +M + N+ P++ ++V  L SCA LGA + G+W    I + +    + +   LID YAK
Sbjct: 304 FLQMLQENLKPDQFSIVGFLSSCASLGALDLGEWGISLIDRHEFLTNLFMANALIDMYAK 363

Query: 371 CGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGV 430
           CG + R  EVFKEM  K++    A I GLA NG  K++   F    +  + P+  TF+G+
Sbjct: 364 CGAMARGFEVFKEMKEKDIVIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTFLGL 423

Query: 431 LSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPP 490
           L  C HA L+  G   FN++   + ++  +EHYGCMVD+ GRAG L++AY+ I +MP  P
Sbjct: 424 LCGCVHAGLIQDGLRFFNAISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMPMRP 483

Query: 491 NAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRS 550
           NA+VW  LL+ CR  K+ ++AE  L+ +  LEP ++G+Y+ LSN Y++ GR ++A  VR 
Sbjct: 484 NAIVWGALLSGCRLVKDTQLAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAEVRD 543

Query: 551 LIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDD 610
           ++ +K +KKIPG S IEL+G VHEF ++D  H  S +I+  L+ +  +++ +G+VP T+ 
Sbjct: 544 MMNKKGMKKIPGYSWIELEGKVHEFLADDKSHPLSDKIYAKLEDLGNEMRLMGFVPTTEF 603

Query: 611 ARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFE 670
              + EEE KE  + +HSEKLA+A GLI T     IR+ KNLR+C DCH   K IS++  
Sbjct: 604 VFFDVEEEEKERVLGYHSEKLAVALGLISTDHGQVIRVVKNLRVCGDCHEVMKLISKITR 663

Query: 671 RMIIVRDRNRFHHFKDGLCSCNDYW 696
           R I+VRD NRFH F +G CSCNDYW
Sbjct: 664 REIVVRDNNRFHCFTNGSCSCNDYW 685

BLAST of Csa1G003520 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 510.4 bits (1313), Expect = 1.8e-144
Identity = 268/704 (38.07%), Postives = 417/704 (59.23%), Query Frame = 1

Query: 27  ILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNHIDKPES 86
           ++++C + + L+Q H H+++T    DP     +   AAL    +++YA  +F+ I KP S
Sbjct: 36  LIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNS 95

Query: 87  SAYNVMIRGLAFKRSPDNALLLFKKM-HEKSVQHDKFTFSSVLKACSRMKALREGEQVHA 146
            A+N +IR  A    P  ++  F  M  E     +K+TF  ++KA + + +L  G+ +H 
Sbjct: 96  FAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHG 155

Query: 147 LILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKNGLWDE 206
           + +KS   S+ FV N+LI  Y +CG +  A  VF  + E+ +V+WNSM++G+ + G  D+
Sbjct: 156 MAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDK 215

Query: 207 ----------------------VVKLFRKILELRI-----EFDDVTMISVLMACGRLANL 266
                                 V+    KI  L        + +   ++V +     A L
Sbjct: 216 ALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLAN-AML 275

Query: 267 EIGELIGEYIVSKGL-----RRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSA 326
           ++    G    +K L      ++N   T+++D YA     + AR++ + M ++D+VAW+A
Sbjct: 276 DMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNA 335

Query: 327 MISGYAQADRCKEALNLFHEMQ-KGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKK 386
           +IS Y Q  +  EAL +FHE+Q + N+  N++T+VS L +CA +GA E G+W+H YIKK 
Sbjct: 336 LISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKH 395

Query: 387 KMKLTVTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFF 446
            +++   + + LI  Y+KCG +++S EVF  +  ++VF W+A+I GLA +G G  A++ F
Sbjct: 396 GIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMF 455

Query: 447 SSMLENDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGR 506
             M E +VKPN VTF  V  ACSH  LVD+   LF+ M  ++ I P  +HY C+VD+LGR
Sbjct: 456 YKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGR 515

Query: 507 AGFLEEAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILL 566
           +G+LE+A +FI+ MP PP+  VW  LL +C+ H N+ +AE +   +  LEP + G ++LL
Sbjct: 516 SGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLL 575

Query: 567 SNTYALVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDAL 626
           SN YA +G+ E+   +R  ++   +KK PGCS IE+DG++HEF S D  H  S++++  L
Sbjct: 576 SNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKL 635

Query: 627 DKMMKQIKRLGYVPNTDDA-RLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKN 686
            ++M+++K  GY P      ++  EEE KE S++ HSEKLAI YGLI T     IR+ KN
Sbjct: 636 HEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKN 695

Query: 687 LRMCRDCHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           LR+C DCH+  K ISQ+++R IIVRDR RFHHF++G CSCND+W
Sbjct: 696 LRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of Csa1G003520 vs. TAIR10
Match: AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 504.2 bits (1297), Expect = 1.3e-142
Identity = 259/625 (41.44%), Postives = 381/625 (60.96%), Query Frame = 1

Query: 71  IDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKA 130
           ++ A  +F+ + + +  ++N ++ G +       AL + K M E++++    T  SVL A
Sbjct: 186 VNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFITIVSVLPA 245

Query: 131 CSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAW 190
            S ++ +  G+++H   ++SGF S   +   L+ MYA CG +  AR +FDGM ER++V+W
Sbjct: 246 VSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSW 305

Query: 191 NSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVS 250
           NSM+  Y +N    E + +F+K+L+  ++  DV+++  L AC  L +LE G  I +  V 
Sbjct: 306 NSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVE 365

Query: 251 KGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNL 310
            GL RN ++  SLI MY KC +VDTA  +F ++  R +V+W+AMI G+AQ  R  +ALN 
Sbjct: 366 LGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNY 425

Query: 311 FHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAK 370
           F +M+   V P+  T VSV+ + A L      KW+H  + +  +   V + T L+D YAK
Sbjct: 426 FSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAK 485

Query: 371 CGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGV 430
           CG I  +  +F  MS ++V TW A+I G   +G GK ALE F  M +  +KPN VTF+ V
Sbjct: 486 CGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSV 545

Query: 431 LSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPP 490
           +SACSH+ LV+ G   F  M+ ++ IE  ++HYG MVD+LGRAG L EA+ FI  MP  P
Sbjct: 546 ISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKP 605

Query: 491 NAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRS 550
              V+  +L +C+ HKN+  AEK+ E +  L P   G ++LL+N Y      E   +VR 
Sbjct: 606 AVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRV 665

Query: 551 LIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDD 610
            +  + ++K PGCS++E+   VH FFS    H  SK+I+  L+K++  IK  GYVP+T +
Sbjct: 666 SMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDT-N 725

Query: 611 ARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFE 670
             L  E + KE  +S HSEKLAI++GL+ T+  TTI + KNLR+C DCHNATK+IS V  
Sbjct: 726 LVLGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTG 785

Query: 671 RMIIVRDRNRFHHFKDGLCSCNDYW 696
           R I+VRD  RFHHFK+G CSC DYW
Sbjct: 786 REIVVRDMQRFHHFKNGACSCGDYW 809


HSP 2 Score: 267.7 bits (683), Expect = 2.0e-71
Identity = 146/485 (30.10%), Postives = 261/485 (53.81%), Query Frame = 1

Query: 21  ENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDYALSIFNH 80
           E+P +L+L++C + K+L+Q+   + K     +      ++  +      ++D A  +F  
Sbjct: 37  EHPAALLLERCSSLKELRQILPLVFKNGLYQEHFFQTKLV--SLFCRYGSVDEAARVFEP 96

Query: 81  IDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSRMKALREG 140
           ID   +  Y+ M++G A     D AL  F +M    V+   + F+ +LK C     LR G
Sbjct: 97  IDSKLNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVG 156

Query: 141 EQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSMLSGYTKN 200
           +++H L++KSGF  + F    L  MYA C Q+  AR VFD MPER +V+WN++++GY++N
Sbjct: 157 KEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQN 216

Query: 201 GLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGLRRNNTLT 260
           G+    +++ + + E  ++   +T++SVL A   L  + +G+ I  Y +  G      ++
Sbjct: 217 GMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS 276

Query: 261 TSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHEMQKGNVY 320
           T+L+DMYAKCG ++TAR+LFD M +R+VV+W++MI  Y Q +  KEA+ +F +M    V 
Sbjct: 277 TALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVK 336

Query: 321 PNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGYIDRSVEV 380
           P +V+++  L++CA LG  E G+++H    +  +   V++   LI  Y KC  +D +  +
Sbjct: 337 PTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASM 396

Query: 381 FKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSACSHACLV 440
           F ++  + + +W A+I G A NG    AL +FS M    VKP+  T++ V++A +   + 
Sbjct: 397 FGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSIT 456

Query: 441 DQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAVVWRTLLA 500
              + +   + R   ++  +     +VD+  + G +  A + I +M    +   W  ++ 
Sbjct: 457 HHAKWIHGVVMRSC-LDKNVFVTTALVDMYAKCGAIMIA-RLIFDMMSERHVTTWNAMID 516

Query: 501 SCRAH 506
               H
Sbjct: 517 GYGTH 517

BLAST of Csa1G003520 vs. NCBI nr
Match: gi|449440989|ref|XP_004138266.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Cucumis sativus])

HSP 1 Score: 1399.0 bits (3620), Expect = 0.0e+00
Identity = 695/695 (100.00%), Postives = 695/695 (100.00%), Query Frame = 1

Query: 1   MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL 60
           MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL
Sbjct: 1   MASIVGCLPNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVL 60

Query: 61  ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD 120
           ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD
Sbjct: 61  ESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHD 120

Query: 121 KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD 180
           KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD
Sbjct: 121 KFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFD 180

Query: 181 GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI 240
           GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI
Sbjct: 181 GMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEI 240

Query: 241 GELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ 300
           GELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ
Sbjct: 241 GELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQ 300

Query: 301 ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTL 360
           ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTL
Sbjct: 301 ADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTL 360

Query: 361 GTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV 420
           GTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV
Sbjct: 361 GTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDV 420

Query: 421 KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY 480
           KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY
Sbjct: 421 KPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAY 480

Query: 481 QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG 540
           QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG
Sbjct: 481 QFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVG 540

Query: 541 RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK 600
           RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK
Sbjct: 541 RVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIK 600

Query: 601 RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHN 660
           RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHN
Sbjct: 601 RLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHN 660

Query: 661 ATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           ATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW
Sbjct: 661 ATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 695

BLAST of Csa1G003520 vs. NCBI nr
Match: gi|659129063|ref|XP_008464509.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Cucumis melo])

HSP 1 Score: 1332.8 bits (3448), Expect = 0.0e+00
Identity = 663/698 (94.99%), Postives = 677/698 (96.99%), Query Frame = 1

Query: 1   MASIVGCLPNISLTSITQ---FPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITE 60
           MASIVGCLP  SLTSITQ   FPENPKSLILQQCKTPKDL+QVHAHLLKTRRLLDPIITE
Sbjct: 1   MASIVGCLPITSLTSITQISQFPENPKSLILQQCKTPKDLRQVHAHLLKTRRLLDPIITE 60

Query: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSV 120
           AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHE SV
Sbjct: 61  AVLESAALLLPDTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHENSV 120

Query: 121 QHDKFTFSSVLKACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180
           QHDKFTFSSVLKACSRM+ L+EGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH
Sbjct: 121 QHDKFTFSSVLKACSRMRGLKEGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARH 180

Query: 181 VFDGMPERSIVAWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLAN 240
           VFDGMPER IVAWNSMLSGYTKNGLWDEVVKLF+KILEL I FDDVTMISVLMACGRLAN
Sbjct: 181 VFDGMPERGIVAWNSMLSGYTKNGLWDEVVKLFQKILELNIGFDDVTMISVLMACGRLAN 240

Query: 241 LEIGELIGEYIVSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISG 300
           LE+GELIGEYIVSKGLRRNNTL TSLIDMYAKCG++DTARKLF+EMDKRDVVAWSAMISG
Sbjct: 241 LEMGELIGEYIVSKGLRRNNTLITSLIDMYAKCGRIDTARKLFNEMDKRDVVAWSAMISG 300

Query: 301 YAQADRCKEALNLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLT 360
           YAQADRCKEALNLFHEMQKGNV PNEVTMVSVLYSCAMLGAY+TGKWVHFYIKKKKMKLT
Sbjct: 301 YAQADRCKEALNLFHEMQKGNVDPNEVTMVSVLYSCAMLGAYQTGKWVHFYIKKKKMKLT 360

Query: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLE 420
           VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFS MLE
Sbjct: 361 VTLGTQLIDFYAKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSLMLE 420

Query: 421 NDVKPNDVTFIGVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480
           NDVKPNDVTFIGVLSACSHACLVDQGR+LFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE
Sbjct: 421 NDVKPNDVTFIGVLSACSHACLVDQGRNLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLE 480

Query: 481 EAYQFIDNMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYA 540
           EAYQFID+MPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEP HSGDYILLSNTYA
Sbjct: 481 EAYQFIDSMPFPPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPTHSGDYILLSNTYA 540

Query: 541 LVGRVEDAIRVRSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600
           LVGRVEDAIRVRSLIKEKEIKK PGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK
Sbjct: 541 LVGRVEDAIRVRSLIKEKEIKKTPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMK 600

Query: 601 QIKRLGYVPNTDDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRD 660
           QIK LGYVPN + ARLEAEEE+KETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMC D
Sbjct: 601 QIKTLGYVPNIEGARLEAEEENKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCGD 660

Query: 661 CHNATKFISQVFERMIIVRDRNRFHHFKDGLCSCNDYW 696
           CHNATK+ISQ FERMIIVRDRNRFHHFKDGLCSC DYW
Sbjct: 661 CHNATKYISQAFERMIIVRDRNRFHHFKDGLCSCKDYW 698

BLAST of Csa1G003520 vs. NCBI nr
Match: gi|225456890|ref|XP_002277458.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g08070 [Vitis vinifera])

HSP 1 Score: 1007.7 bits (2604), Expect = 1.0e-290
Identity = 479/682 (70.23%), Postives = 579/682 (84.90%), Query Frame = 1

Query: 14  TSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLPDTIDY 73
           TSI+ FPENPK+LIL+QCKT +DL ++HAHL+KTR LL P + E +LESAA+LLP ++DY
Sbjct: 17  TSISLFPENPKTLILEQCKTIRDLNEIHAHLIKTRLLLKPKVAENLLESAAILLPTSMDY 76

Query: 74  ALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVLKACSR 133
           A+SIF  ID+P+S AYN+MIRG   K+SP  A+LLFK+MHE SVQ D+FTF  +LK CSR
Sbjct: 77  AVSIFRQIDEPDSPAYNIMIRGFTLKQSPHEAILLFKEMHENSVQPDEFTFPCILKVCSR 136

Query: 134 MKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIVAWNSM 193
           ++AL EGEQ+HALI+K GF S+ FV+NTLI MYANCG++ VAR VFD M ER++  WNSM
Sbjct: 137 LQALSEGEQIHALIMKCGFGSHGFVKNTLIHMYANCGEVEVARRVFDEMSERNVRTWNSM 196

Query: 194 LSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYIVSKGL 253
            +GYTK+G W+EVVKLF ++LEL I FD+VT++SVL ACGRLA+LE+GE I  Y+  KGL
Sbjct: 197 FAGYTKSGNWEEVVKLFHEMLELDIRFDEVTLVSVLTACGRLADLELGEWINRYVEEKGL 256

Query: 254 RRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEALNLFHE 313
           + N TL TSL+DMYAKCGQVDTAR+LFD+MD+RDVVAWSAMISGY+QA RC+EAL+LFHE
Sbjct: 257 KGNPTLITSLVDMYAKCGQVDTARRLFDQMDRRDVVAWSAMISGYSQASRCREALDLFHE 316

Query: 314 MQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFYAKCGY 373
           MQK N+ PNE+TMVS+L SCA+LGA ETGKWVHF+IKKK+MKLTVTLGT L+DFYAKCG 
Sbjct: 317 MQKANIDPNEITMVSILSSCAVLGALETGKWVHFFIKKKRMKLTVTLGTALMDFYAKCGS 376

Query: 374 IDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFIGVLSA 433
           ++ S+EVF +M  KNV +WT LIQGLA+NG+GK ALE+F  MLE +V+PNDVTFIGVLSA
Sbjct: 377 VESSIEVFGKMPVKNVLSWTVLIQGLASNGQGKKALEYFYLMLEKNVEPNDVTFIGVLSA 436

Query: 434 CSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPFPPNAV 493
           CSHA LVD+GR LF SM RDF IEPRIEHYGCMVDILGRAG +EEA+QFI NMP  PNAV
Sbjct: 437 CSHAGLVDEGRDLFVSMSRDFGIEPRIEHYGCMVDILGRAGLIEEAFQFIKNMPIQPNAV 496

Query: 494 VWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRVRSLIK 553
           +WRTLLASC+ HKN+E+ E+SL+ +  LEP HSGDYILLSN YA VGR EDA++VR  +K
Sbjct: 497 IWRTLLASCKVHKNVEIGEESLKQLIILEPTHSGDYILLSNIYASVGRWEDALKVRGEMK 556

Query: 554 EKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNTDDARL 613
           EK IKK PGCSLIELDGV+HEFF+ED  H  S+EI++A++ MMKQIK  GYVPNT +ARL
Sbjct: 557 EKGIKKTPGCSLIELDGVIHEFFAEDNVHSQSEEIYNAIEDMMKQIKSAGYVPNTAEARL 616

Query: 614 EAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQVFERMI 673
           +AEE+ KE+SVSHHSEKLAIA+GLI++ P TTIRI+KNLR+C DCHNATK +S+VF R I
Sbjct: 617 DAEEDDKESSVSHHSEKLAIAFGLIKSPPGTTIRITKNLRVCTDCHNATKLVSKVFNREI 676

Query: 674 IVRDRNRFHHFKDGLCSCNDYW 696
           +VRDR RFHHFK+G CSCNDYW
Sbjct: 677 VVRDRTRFHHFKEGSCSCNDYW 698

BLAST of Csa1G003520 vs. NCBI nr
Match: gi|595844774|ref|XP_007208802.1| (hypothetical protein PRUPE_ppa024573mg [Prunus persica])

HSP 1 Score: 994.2 bits (2569), Expect = 1.2e-286
Identity = 483/687 (70.31%), Postives = 575/687 (83.70%), Query Frame = 1

Query: 9   PNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLP 68
           P  ++T+I QFP NPK+LILQQCKT +DL QVHAHL+KTR LL+P ITE +LESAA+LLP
Sbjct: 13  PLTAITTIPQFPHNPKTLILQQCKTTRDLNQVHAHLIKTRLLLNPTITENLLESAAILLP 72

Query: 69  DTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVL 128
           + +DYALSIF+++D+P++  YN+MIR L +K SP  A LLFKKM E S + D+FT SS+L
Sbjct: 73  NAMDYALSIFHNLDEPDTLVYNIMIRSLTYKLSPLEAFLLFKKMQESSAEPDEFTLSSIL 132

Query: 129 KACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIV 188
           KACS+++ALREGEQ+HA I+K GFKSN FVENTLI MYA CG++ VAR VFDG+PER+ +
Sbjct: 133 KACSKLRALREGEQIHAHIVKCGFKSNGFVENTLIHMYATCGELEVARRVFDGLPERARM 192

Query: 189 AWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYI 248
           AWNSML+GY KN  WDEVVKLF ++L+L + FD+VT+ SVL ACGRLANLE+GE IG+YI
Sbjct: 193 AWNSMLAGYMKNKCWDEVVKLFHEMLKLGVGFDEVTLTSVLTACGRLANLELGEWIGDYI 252

Query: 249 VSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEAL 308
            +  L+ N  L TSL+DMYAKCGQV+TAR+ FD MD+RDVVAWSAMISGY+QA+RC+EAL
Sbjct: 253 EANRLKGNIALVTSLVDMYAKCGQVETARRFFDRMDRRDVVAWSAMISGYSQANRCREAL 312

Query: 309 NLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFY 368
           +LFH+MQK NV PNEVTMVSVLYSCA+LGA +TGKWV FYIKK+K+KLTV LGT LIDFY
Sbjct: 313 DLFHDMQKANVDPNEVTMVSVLYSCAVLGALKTGKWVEFYIKKEKLKLTVNLGTALIDFY 372

Query: 369 AKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFI 428
           AKCG ID S+EVF  M   NVF+WTALIQGLA+NG+GK ALE+F  M E ++KPN+VTFI
Sbjct: 373 AKCGCIDSSIEVFNRMPSTNVFSWTALIQGLASNGQGKGALEYFQLMQEKNIKPNNVTFI 432

Query: 429 GVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPF 488
            VLSACSHA LV++GR+LF SM +DF IEPRIEHYG MVDILGRAG +EEAYQFI NMP 
Sbjct: 433 AVLSACSHAGLVNEGRNLFTSMIKDFGIEPRIEHYGSMVDILGRAGLIEEAYQFIKNMPI 492

Query: 489 PPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRV 548
            PNAVVWRTLLASCRAHKN+E+ E+SL+HI  LE  HSGDYILLSN YA V R EDAIRV
Sbjct: 493 QPNAVVWRTLLASCRAHKNVEIGEESLKHIISLETPHSGDYILLSNIYASVDRREDAIRV 552

Query: 549 RSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNT 608
           R  ++EK I+K PGCSLIELDGV++EFF+ED    H +E+++A   MMK+IK  GYVP T
Sbjct: 553 RDQMREKGIEKAPGCSLIELDGVIYEFFAEDKACPHLEEVYNATHDMMKRIKEAGYVPYT 612

Query: 609 DDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQV 668
            DARL+AEE+ KE SVSHHSEKLAIA+GLIRT P TT+RISKNLR+C DCHNATK IS+V
Sbjct: 613 TDARLDAEEDEKEASVSHHSEKLAIAFGLIRTLPGTTLRISKNLRVCTDCHNATKMISKV 672

Query: 669 FERMIIVRDRNRFHHFKDGLCSCNDYW 696
           F R I+VRD NRFHHFK+G CSCNDYW
Sbjct: 673 FNRQIVVRDWNRFHHFKEGSCSCNDYW 699

BLAST of Csa1G003520 vs. NCBI nr
Match: gi|645268002|ref|XP_008239334.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Prunus mume])

HSP 1 Score: 989.9 bits (2558), Expect = 2.2e-285
Identity = 478/687 (69.58%), Postives = 577/687 (83.99%), Query Frame = 1

Query: 9   PNISLTSITQFPENPKSLILQQCKTPKDLQQVHAHLLKTRRLLDPIITEAVLESAALLLP 68
           P  ++T+I+QFP NPK+LILQQCKT +DL QVHAHL+KTR LL+P ITE  LESAA+LLP
Sbjct: 13  PLTAITTISQFPHNPKTLILQQCKTTRDLNQVHAHLIKTRLLLNPAITENFLESAAILLP 72

Query: 69  DTIDYALSIFNHIDKPESSAYNVMIRGLAFKRSPDNALLLFKKMHEKSVQHDKFTFSSVL 128
           + +DYA+S+F+++D+P++  YN+MIR L +K+SP  A LLFKKM E S + D+FT SS+L
Sbjct: 73  NAMDYAVSVFHNLDEPDTLVYNIMIRSLTYKQSPLEAFLLFKKMQESSAEPDEFTLSSIL 132

Query: 129 KACSRMKALREGEQVHALILKSGFKSNEFVENTLIQMYANCGQIGVARHVFDGMPERSIV 188
           KACS+++ALREGEQ+HA ++K GF SN FVENTLI MYA CG++ VAR VFDG+PER+ +
Sbjct: 133 KACSKLRALREGEQIHAHVVKCGFMSNGFVENTLIHMYATCGELEVARRVFDGLPERARM 192

Query: 189 AWNSMLSGYTKNGLWDEVVKLFRKILELRIEFDDVTMISVLMACGRLANLEIGELIGEYI 248
           AWNSML+GY KN  WDEVVKLF ++L+L + FD+VT+ISVL ACGRLANLE+GE IG+YI
Sbjct: 193 AWNSMLAGYMKNKCWDEVVKLFHEMLKLGVGFDEVTLISVLTACGRLANLELGEWIGDYI 252

Query: 249 VSKGLRRNNTLTTSLIDMYAKCGQVDTARKLFDEMDKRDVVAWSAMISGYAQADRCKEAL 308
            +  L+ N  L TSL+DMYAKCGQV+TAR+ FD+MD+RDVVAWSAMISGY+QA+RC+EAL
Sbjct: 253 EANRLKVNIALVTSLVDMYAKCGQVETARRFFDQMDRRDVVAWSAMISGYSQANRCREAL 312

Query: 309 NLFHEMQKGNVYPNEVTMVSVLYSCAMLGAYETGKWVHFYIKKKKMKLTVTLGTQLIDFY 368
           +LFH+MQK NV PNEVTMVSVLYSCA+LGA +TGKWV FYIKKKK+KLTV LGT LIDFY
Sbjct: 313 DLFHDMQKANVDPNEVTMVSVLYSCAVLGALKTGKWVEFYIKKKKLKLTVNLGTALIDFY 372

Query: 369 AKCGYIDRSVEVFKEMSFKNVFTWTALIQGLANNGEGKMALEFFSSMLENDVKPNDVTFI 428
           AKCG ID S+EVF  M   NVF+WTALIQGLA+NG+GK ALE+F  M E ++KPN+VTFI
Sbjct: 373 AKCGCIDSSIEVFNRMPSTNVFSWTALIQGLASNGQGKGALEYFQLMQEKNIKPNNVTFI 432

Query: 429 GVLSACSHACLVDQGRHLFNSMRRDFDIEPRIEHYGCMVDILGRAGFLEEAYQFIDNMPF 488
            VLSACSHA LV++GR+LF SM +DF IEPRIEHYG MVDILGRAG +EEAYQFI +MP 
Sbjct: 433 AVLSACSHAGLVNEGRNLFTSMIKDFGIEPRIEHYGSMVDILGRAGLIEEAYQFIKSMPI 492

Query: 489 PPNAVVWRTLLASCRAHKNIEMAEKSLEHITRLEPAHSGDYILLSNTYALVGRVEDAIRV 548
            PNAVVWRTL ASCRAHKN+E+ E+SL+HI  LE  HSGDYILLSN YA V R EDAI+V
Sbjct: 493 QPNAVVWRTLFASCRAHKNVEIGEESLKHIISLEAPHSGDYILLSNIYASVDRREDAIQV 552

Query: 549 RSLIKEKEIKKIPGCSLIELDGVVHEFFSEDGEHKHSKEIHDALDKMMKQIKRLGYVPNT 608
           R+ ++EK I+K PGCSLIELDGV++EFF+ED    H +E+++A   MMK+IK  GYVP T
Sbjct: 553 RNQMREKGIEKAPGCSLIELDGVIYEFFAEDKACPHLEEVYNATHDMMKRIKEAGYVPYT 612

Query: 609 DDARLEAEEESKETSVSHHSEKLAIAYGLIRTSPRTTIRISKNLRMCRDCHNATKFISQV 668
            DARL+AEE+ KE SVSHHSEKLAIA+GLIRT P TT+RISKNLR+C DCHNATK IS+V
Sbjct: 613 ADARLDAEEDDKEASVSHHSEKLAIAFGLIRTLPGTTLRISKNLRVCTDCHNATKMISKV 672

Query: 669 FERMIIVRDRNRFHHFKDGLCSCNDYW 696
           F R I+VRD NRFHHFK+G CSCNDYW
Sbjct: 673 FNRQIVVRDWNRFHHFKEGSCSCNDYW 699

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR21_ARATH6.6e-14937.26Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PP311_ARATH2.5e-14837.85Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana GN... [more]
PP219_ARATH2.3e-14638.54Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis th... [more]
PP175_ARATH3.2e-14338.07Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PPR32_ARATH2.3e-14141.44Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
F6GTR8_VITVI7.0e-29170.23Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g09300 PE=4 SV=... [more]
M5W9L5_PRUPE8.0e-28770.31Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa024573mg PE=4 SV=1[more]
W9RUI0_9ROSA2.7e-28269.62Uncharacterized protein OS=Morus notabilis GN=L484_002061 PE=4 SV=1[more]
A0A067FPY8_CITSI1.3e-26864.83Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g005476mg PE=4 SV=1[more]
V4TIJ5_9ROSI3.7e-26864.83Uncharacterized protein OS=Citrus clementina GN=CICLE_v10024603mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G08070.13.7e-15037.26 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G14820.11.4e-14937.85 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G08820.11.3e-14738.54 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G29760.11.8e-14438.07 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G11290.11.3e-14241.44 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449440989|ref|XP_004138266.1|0.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Cucumis s... [more]
gi|659129063|ref|XP_008464509.1|0.0e+0094.99PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Cucumis m... [more]
gi|225456890|ref|XP_002277458.1|1.0e-29070.23PREDICTED: pentatricopeptide repeat-containing protein At1g08070 [Vitis vinifera... [more]
gi|595844774|ref|XP_007208802.1|1.2e-28670.31hypothetical protein PRUPE_ppa024573mg [Prunus persica][more]
gi|645268002|ref|XP_008239334.1|2.2e-28569.58PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Prunus mu... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G003520.1Csa1G003520.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 462..486
score: 0.089coord: 188..217
score: 8.2E-8coord: 160..186
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 387..435
score: 1.9E-11coord: 85..132
score: 4.9E-10coord: 287..333
score: 9.6
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 188..221
score: 2.1E-6coord: 390..423
score: 3.1E-7coord: 289..323
score: 1.1E-7coord: 261..289
score: 6.5E-6coord: 88..118
score: 0
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 459..489
score: 6.445coord: 357..387
score: 6.796coord: 186..220
score: 11.027coord: 423..453
score: 6.873coord: 256..286
score: 9.35coord: 491..521
score: 5.886coord: 322..356
score: 6.007coord: 221..255
score: 5.634coord: 85..119
score: 9.087coord: 155..185
score: 7.794coord: 287..321
score: 12.2coord: 120..154
score: 9.153coord: 388..422
score: 11.696coord: 525..559
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 373..418
score: 2.8E-5coord: 253..315
score: 9.3E-6coord: 488..547
score: 2.8E-5coord: 184..216
score: 9.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 267..311
score: 6.28E-6coord: 474..547
score: 6.2
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 14..566
score:
NoneNo IPR availablePANTHERPTHR24015:SF469PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 14..566
score:

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Csa1G003520Csa3G598390Cucumber (Chinese Long) v2cucuB014