Cucsa.251540 (gene) Cucumber (Gy14) v1

NameCucsa.251540
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationscaffold02229 : 588733 .. 590366 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACGACCATTTCCCTCTTCTTCTCCAACGGTTTGTTTTTCCAATGCTGATTTCTTTGCACTTGTTAACTCCACCCTCATCTTCTCTTCTTCTCTTCTGTTCTTCCAAACCCAAAAAATCCAAGAAAGAGAGAAGGAAACTTCTCCACCAAAAACTTCTCCGCATTAGCAAAGCTAAACAGTCCACTGATCTCTCCTTCCCCAAATCCTCGCCAACCCCTCTCTTAATCCACCCCAAACCCTTCTTCCAGTCCAAAATTCAAGCCCTTGATGCTGTTCTCACCGACCTTGAAGCTTCCATCGACAATGGCCTCTTTATTGATCCTGAAATTTTCTCTTCCCTTTTGGAACTTTGTTACCAATTGCAAGCTATTCACCATGGTATTCGGATTCATCGCTTAATACCCACCAATCTTTTAAGGAGAAATGTGGGTATTTCTTCTAAGCTTCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGGATGCACACCAGGTGTTCGACGAAATGGGTAATCGTAATTTCTCTGCATTTGCTTGGAATTCTCTTATTTCTGGATACGCGGAACTTGGTCTTTATGAAGATGCTCTGGCGCTTTACTTTCAAATGGAGGAAGAAGGTGTTGAACCTGACAATTTCACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTCGATTCAAATCGGAGAGGCGGTGCATCGGCATGTAGTTCGTTCTGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTTTAGTTGACATGTATTCCAAATGTGGTTGCATTGTGAGGGCTAGGAAAGTGTTTGATCAGATTGAGTATAAGGATATAGTTTCCTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCACTTTGAGGCATTAGACATCTTTGATCAAATGATTCAAGAAGGTTACGAGCCCGATTCGGTTGCTTTGTCCACCCTACTTTCTAACATTTCGTCAATGAAATTCAAGTTACATATTCATGGATGGGTAATTCGACATGGAGTCGAATGGAATTTGTCCATTGCTAACTCCTTGATAGTCATGTATGCCAAATGTGGTAAGCTTAACAGAGCAAAATGGCTGTTCCAGCAAATGCCTCAAAAGGACATGGTCTCATGGAACTCCATAATCTCTGCTCATTTCAATAGCGCAGAAGCTTTGACATATTTCGAAGTGATGGAAAGCCTTGGTGTTTCGCCAGACGGTGTAACATTTGTGTCATTGTTATCAACTTGTGCTCATCTGGGGTTGGTGAAGGAaGGGGGAAaTTGTATTTTTtGATGAAGGGGAAGTACGGAATAAGACCAACAATTGAACATTATGCTTGTATGGTGAATCTTTACGGGAGAGCAGGGATGATTGAAGAAGCTTATAAAATCATAACGAAAGGGATGGAGATCGAGGCAGGTCCGACCATATGGGGGGCGTTGTTGTATGCGTGTTATCTCCATAGCGATGTAGATATCGCTGAGATTGCTGCTGAAAGACTCTTCGAATTGGAGCCAGATAATGAGCTCAATTTTGAGCTTTTGATGAAGATTTATGGCAATGCTGGGAGATCGGAGGACGAGAAGCGAGTGAAATTAATGATGGCAGAACGAGGACTGAATTCATAG

mRNA sequence

ATGAACGACCATTTCCCTCTTCTTCTCCAACGCAAAGCTAAACAGTCCACTGATCTCTCCTTCCCCAAATCCTCGCCAACCCCTCTCTTAATCCACCCCAAACCCTTCTTCCAGTCCAAAATTCAAGCCCTTGATGCTGTTCTCACCGACCTTGAAGCTTCCATCGACAATGGCCTCTTTATTGATCCTGAAATTTTCTCTTCCCTTTTGGAACTTTGTTACCAATTGCAAGCTATTCACCATGGTATTCGGATTCATCGCTTAATACCCACCAATCTTTTAAGGAGAAATGTGGGTATTTCTTCTAAGCTTCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGGATGCACACCAGGTGTTCGACGAAATGGGTAATCGTAATTTCTCTGCATTTGCTTGGAATTCTCTTATTTCTGGATACGCGGAACTTGGTCTTTATGAAGATGCTCTGGCGCTTTACTTTCAAATGGAGGAAGAAGGTGTTGAACCTGACAATTTCACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTCGATTCAAATCGGAGAGGCGGTGCATCGGCATGTAGTTCGTTCTGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTTTAGTTGACATGTATTCCAAATGTGGTTGCATTGTGAGGGCTAGGAAAGTGTTTGATCAGATTGAGTATAAGGATATAGTTTCCTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCACTTTGAGGCATTAGACATCTTTGATCAAATGATTCAAGAAGGTTACGAGCCCGATTCGGTTGCTTTGTCCACCCTACTTTCTAACATTTCGTCAATGAAATTCAAGTTACATATTCATGGATGGGTAATTCGACATGGAGTCGAATGGAATTTGTCCATTGCTAACTCCTTGATAGTCATGTATGCCAAATGTGGTAAGCTTAACAGAGCAAAATGGCTGTTCCAGCAAATGCCTCAAAAGGACATGGTCTCATGGAACTCCATAATCTCTGCTCATTTCAATAGCGCAGAAGCTTTGACATATTTCGAAGTGATGGAAAGCCTTGGTGTTTCGCCAGACGGTGTAACATTTGTGTCATTGTTATCAACTTGTGCTCATCTGGGGTTGGGGAAGTACGGAATAAGACCAACAATTGAACATTATGCTTGTATGGTGAATCTTTACGGGAGAGCAGGGATGATTGAAGAAGCTTATAAAATCATAACGAAAGGGATGGAGATCGAGGCAGGTCCGACCATATGGGGGGCGTTGTTGTATGCGTGTTATCTCCATAGCGATGTAGATATCGCTGAGATTGCTGCTGAAAGACTCTTCGAATTGGAGCCAGATAATGAGCTCAATTTTGAGCTTTTGATGAAGATTTATGGCAATGCTGGGAGATCGGAGGACGAGAAGCGAGTGAAATTAATGATGGCAGAACGAGGACTGAATTCATAG

Coding sequence (CDS)

ATGAACGACCATTTCCCTCTTCTTCTCCAACGCAAAGCTAAACAGTCCACTGATCTCTCCTTCCCCAAATCCTCGCCAACCCCTCTCTTAATCCACCCCAAACCCTTCTTCCAGTCCAAAATTCAAGCCCTTGATGCTGTTCTCACCGACCTTGAAGCTTCCATCGACAATGGCCTCTTTATTGATCCTGAAATTTTCTCTTCCCTTTTGGAACTTTGTTACCAATTGCAAGCTATTCACCATGGTATTCGGATTCATCGCTTAATACCCACCAATCTTTTAAGGAGAAATGTGGGTATTTCTTCTAAGCTTCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGGATGCACACCAGGTGTTCGACGAAATGGGTAATCGTAATTTCTCTGCATTTGCTTGGAATTCTCTTATTTCTGGATACGCGGAACTTGGTCTTTATGAAGATGCTCTGGCGCTTTACTTTCAAATGGAGGAAGAAGGTGTTGAACCTGACAATTTCACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTCGATTCAAATCGGAGAGGCGGTGCATCGGCATGTAGTTCGTTCTGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTTTAGTTGACATGTATTCCAAATGTGGTTGCATTGTGAGGGCTAGGAAAGTGTTTGATCAGATTGAGTATAAGGATATAGTTTCCTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCACTTTGAGGCATTAGACATCTTTGATCAAATGATTCAAGAAGGTTACGAGCCCGATTCGGTTGCTTTGTCCACCCTACTTTCTAACATTTCGTCAATGAAATTCAAGTTACATATTCATGGATGGGTAATTCGACATGGAGTCGAATGGAATTTGTCCATTGCTAACTCCTTGATAGTCATGTATGCCAAATGTGGTAAGCTTAACAGAGCAAAATGGCTGTTCCAGCAAATGCCTCAAAAGGACATGGTCTCATGGAACTCCATAATCTCTGCTCATTTCAATAGCGCAGAAGCTTTGACATATTTCGAAGTGATGGAAAGCCTTGGTGTTTCGCCAGACGGTGTAACATTTGTGTCATTGTTATCAACTTGTGCTCATCTGGGGTTGGGGAAGTACGGAATAAGACCAACAATTGAACATTATGCTTGTATGGTGAATCTTTACGGGAGAGCAGGGATGATTGAAGAAGCTTATAAAATCATAACGAAAGGGATGGAGATCGAGGCAGGTCCGACCATATGGGGGGCGTTGTTGTATGCGTGTTATCTCCATAGCGATGTAGATATCGCTGAGATTGCTGCTGAAAGACTCTTCGAATTGGAGCCAGATAATGAGCTCAATTTTGAGCTTTTGATGAAGATTTATGGCAATGCTGGGAGATCGGAGGACGAGAAGCGAGTGAAATTAATGATGGCAGAACGAGGACTGAATTCATAG

Protein sequence

MNDHFPLLLQRKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCAHLGLGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGLNS*
BLAST of Cucsa.251540 vs. Swiss-Prot
Match: PP337_ARATH (Pentatricopeptide repeat-containing protein At4g25270, chloroplastic OS=Arabidopsis thaliana GN=PCMP-E53 PE=3 SV=1)

HSP 1 Score: 615.1 bits (1585), Expect = 6.4e-175
Identity = 301/483 (62.32%), Postives = 373/483 (77.23%), Query Frame = 1

Query: 17  TDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFI-DPEIFSSLLELCYQ 76
           T LSF K SPTPLLI  +   +++++ALD+V+TDLE S   G+ + +PEIF+SLLE CY 
Sbjct: 45  TSLSFTKPSPTPLLIEKQSIHRTQLEALDSVITDLETSAQKGISLTEPEIFASLLETCYS 104

Query: 77  LQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWN 136
           L+AI HG+R+H LIP  LLR N+GISSKL+RLYAS GY E AH+VFD M  R+ S FAWN
Sbjct: 105 LRAIDHGVRVHHLIPPYLLRNNLGISSKLVRLYASCGYAEVAHEVFDRMSKRDSSPFAWN 164

Query: 137 SLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRS 196
           SLISGYAELG YEDA+ALYFQM E+GV+PD FTFPRVLKACGGIGS+QIGEA+HR +V+ 
Sbjct: 165 SLISGYAELGQYEDAMALYFQMAEDGVKPDRFTFPRVLKACGGIGSVQIGEAIHRDLVKE 224

Query: 197 GFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIF 256
           GF  DV+VLNALV MY+KCG IV+AR VFD I +KD VSWNSMLTGY  HGL  EALDIF
Sbjct: 225 GFGYDVYVLNALVVMYAKCGDIVKARNVFDMIPHKDYVSWNSMLTGYLHHGLLHEALDIF 284

Query: 257 DQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKL 316
             M+Q G EPD VA+S++L+ + S K    +HGWVIR G+EW LS+AN+LIV+Y+K G+L
Sbjct: 285 RLMVQNGIEPDKVAISSVLARVLSFKHGRQLHGWVIRRGMEWELSVANALIVLYSKRGQL 344

Query: 317 NRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCAHL 376
            +A ++F QM ++D VSWN+IISAH  ++  L YFE M      PDG+TFVS+LS CA+ 
Sbjct: 345 GQACFIFDQMLERDTVSWNAIISAHSKNSNGLKYFEQMHRANAKPDGITFVSVLSLCANT 404

Query: 377 GL------------GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWG 436
           G+             +YGI P +EHYACMVNLYGRAGM+EEAY +I + M +EAGPT+WG
Sbjct: 405 GMVEDGERLFSLMSKEYGIDPKMEHYACMVNLYGRAGMMEEAYSMIVQEMGLEAGPTVWG 464

Query: 437 ALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERG 487
           ALLYACYLH + DI E+AA+RLFELEPDNE NFELL++IY  A R+ED +RV+ MM +RG
Sbjct: 465 ALLYACYLHGNTDIGEVAAQRLFELEPDNEHNFELLIRIYSKAKRAEDVERVRQMMVDRG 524

BLAST of Cucsa.251540 vs. Swiss-Prot
Match: PP265_ARATH (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana GN=CRR2 PE=2 SV=1)

HSP 1 Score: 298.9 bits (764), Expect = 1.0e-79
Identity = 164/445 (36.85%), Postives = 269/445 (60.45%), Query Frame = 1

Query: 64  EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE 123
           + +  L+  C    ++   +R+HR I  N   ++  +++KL+ +Y+  G ++ A +VFD+
Sbjct: 78  QTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGSVDYARKVFDK 137

Query: 124 MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGG----I 183
              R    + WN+L       G  E+ L LY++M   GVE D FT+  VLKAC      +
Sbjct: 138 TRKRTI--YVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVASECTV 197

Query: 184 GSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSML 243
             +  G+ +H H+ R G++  V+++  LVDMY++ GC+  A  VF  +  +++VSW++M+
Sbjct: 198 NHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMI 257

Query: 244 TGYTRHGLHFEALDIFDQMIQEGYE--PDSVALSTLL---SNISSMKFKLHIHGWVIRHG 303
             Y ++G  FEAL  F +M++E  +  P+SV + ++L   +++++++    IHG+++R G
Sbjct: 258 ACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRG 317

Query: 304 VEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTYFE 363
           ++  L + ++L+ MY +CGKL   + +F +M  +D+VSWNS+IS+   H    +A+  FE
Sbjct: 318 LDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQIFE 377

Query: 364 VMESLGVSPDGVTFVSLLSTCAHLGL---GK---------YGIRPTIEHYACMVNLYGRA 423
            M + G SP  VTFVS+L  C+H GL   GK         +GI+P IEHYACMV+L GRA
Sbjct: 378 EMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRA 437

Query: 424 GMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELL 483
             ++EA K++ + M  E GP +WG+LL +C +H +V++AE A+ RLF LEP N  N+ LL
Sbjct: 438 NRLDEAAKMV-QDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLL 497

Query: 484 MKIYGNAGRSEDEKRVKLMMAERGL 485
             IY  A   ++ KRVK ++  RGL
Sbjct: 498 ADIYAEAQMWDEVKRVKKLLEHRGL 519

BLAST of Cucsa.251540 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 8.8e-76
Identity = 164/447 (36.69%), Postives = 252/447 (56.38%), Query Frame = 1

Query: 57  NGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMED 116
           +G+ ID     S+   C   + I  G  +H +       R     + LL +Y+  G ++ 
Sbjct: 290 SGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDS 349

Query: 117 AHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKAC 176
           A  VF EM +R  S  ++ S+I+GYA  GL  +A+ L+ +MEEEG+ PD +T   VL  C
Sbjct: 350 AKAVFREMSDR--SVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCC 409

Query: 177 GGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWN 236
                +  G+ VH  +  +    D+FV NAL+DMY+KCG +  A  VF ++  KDI+SWN
Sbjct: 410 ARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWN 469

Query: 237 SMLTGYTRHGLHFEALDIFDQMIQE-GYEPDSVALSTLL---SNISSMKFKLHIHGWVIR 296
           +++ GY+++    EAL +F+ +++E  + PD   ++ +L   +++S+      IHG+++R
Sbjct: 470 TIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMR 529

Query: 297 HGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTY 356
           +G   +  +ANSL+ MYAKCG L  A  LF  +  KD+VSW  +I+    H    EA+  
Sbjct: 530 NGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIAL 589

Query: 357 FEVMESLGVSPDGVTFVSLLSTCAHLGLGKYG------------IRPTIEHYACMVNLYG 416
           F  M   G+  D ++FVSLL  C+H GL   G            I PT+EHYAC+V++  
Sbjct: 590 FNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLA 649

Query: 417 RAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE 476
           R G + +AY+ I + M I    TIWGALL  C +H DV +AE  AE++FELEP+N   + 
Sbjct: 650 RTGDLIKAYRFI-ENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYV 709

Query: 477 LLMKIYGNAGRSEDEKRVKLMMAERGL 485
           L+  IY  A + E  KR++  + +RGL
Sbjct: 710 LMANIYAEAEKWEQVKRLRKRIGQRGL 733


HSP 2 Score: 187.2 bits (474), Expect = 4.3e-46
Identity = 106/328 (32.32%), Postives = 181/328 (55.18%), Query Frame = 1

Query: 55  IDNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYM 114
           + +G+ +D   FS + +    L+++H G ++H  I  +       + + L+  Y     +
Sbjct: 187 MSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRV 246

Query: 115 EDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLK 174
           + A +VFDEM  R+    +WNS+I+GY   GL E  L+++ QM   G+E D  T   V  
Sbjct: 247 DSARKVFDEMTERD--VISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFA 306

Query: 175 ACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVS 234
            C     I +G AVH   V++ F+ +    N L+DMYSKCG +  A+ VF ++  + +VS
Sbjct: 307 GCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVS 366

Query: 235 WNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKF---KLHIHGWVI 294
           + SM+ GY R GL  EA+ +F++M +EG  PD   ++ +L+  +  +       +H W+ 
Sbjct: 367 YTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIK 426

Query: 295 RHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSA---EALT 354
            + + +++ ++N+L+ MYAKCG +  A+ +F +M  KD++SWN+II  +  +    EAL+
Sbjct: 427 ENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALS 486

Query: 355 YFE-VMESLGVSPDGVTFVSLLSTCAHL 376
            F  ++E    SPD  T   +L  CA L
Sbjct: 487 LFNLLLEEKRFSPDERTVACVLPACASL 512


HSP 3 Score: 179.1 bits (453), Expect = 1.2e-43
Identity = 117/402 (29.10%), Postives = 207/402 (51.49%), Query Frame = 1

Query: 61  IDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQV 120
           IDP    S+L+LC   +++  G  +   I  N    +  + SKL  +Y + G +++A +V
Sbjct: 92  IDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRV 151

Query: 121 FDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIG 180
           FDE+  +   A  WN L++  A+ G +  ++ L+ +M   GVE D++TF  V K+   + 
Sbjct: 152 FDEV--KIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLR 211

Query: 181 SIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLT 240
           S+  GE +H  +++SGF     V N+LV  Y K   +  ARKVFD++  +D++SWNS++ 
Sbjct: 212 SVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIIN 271

Query: 241 GYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISS---MKFKLHIHGWVIRHGVEW 300
           GY  +GL  + L +F QM+  G E D   + ++ +  +    +     +H   ++     
Sbjct: 272 GYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSR 331

Query: 301 NLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFN---SAEALTYFEVME 360
                N+L+ MY+KCG L+ AK +F++M  + +VS+ S+I+ +     + EA+  FE ME
Sbjct: 332 EDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEME 391

Query: 361 SLGVSPDGVTFVSLLSTCAHLGLGKYGIRP-----------TIEHYACMVNLYGRAGMIE 420
             G+SPD  T  ++L+ CA   L   G R             I     ++++Y + G ++
Sbjct: 392 EEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQ 451

Query: 421 EAYKIIT--KGMEIEAGPTIWGALLYACYLHSDVDIAEIAAE 444
           EA  + +  +  +I +  TI G     CY +  + +  +  E
Sbjct: 452 EAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLE 491

BLAST of Cucsa.251540 vs. Swiss-Prot
Match: PP219_ARATH (Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis thaliana GN=PCMP-H84 PE=3 SV=1)

HSP 1 Score: 282.7 bits (722), Expect = 7.5e-75
Identity = 165/455 (36.26%), Postives = 256/455 (56.26%), Query Frame = 1

Query: 50  DLEASI-DNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLY 109
           DL  SI  +GL++    F  +L+ C +  +   GI +H L+       +V   + LL +Y
Sbjct: 97  DLFLSIRKHGLYLHGFTFPLVLKACTRASSRKLGIDLHSLVVKCGFNHDVAAMTSLLSIY 156

Query: 110 ASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFT 169
           +  G + DAH++FDE+ +R  S   W +L SGY   G + +A+ L+ +M E GV+PD++ 
Sbjct: 157 SGSGRLNDAHKLFDEIPDR--SVVTWTALFSGYTTSGRHREAIDLFKKMVEMGVKPDSYF 216

Query: 170 FPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIE 229
             +VL AC  +G +  GE + +++       + FV   LV++Y+KCG + +AR VFD + 
Sbjct: 217 IVQVLSACVHVGDLDSGEWIVKYMEEMEMQKNSFVRTTLVNLYAKCGKMEKARSVFDSMV 276

Query: 230 YKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHG 289
            KDIV+W++M+ GY  +    E +++F QM+QE  +PD  ++   LS+ +S+   L +  
Sbjct: 277 EKDIVTWSTMIQGYASNSFPKEGIELFLQMLQENLKPDQFSIVGFLSSCASLG-ALDLGE 336

Query: 290 WVI----RHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSA 349
           W I    RH    NL +AN+LI MYAKCG + R   +F++M +KD+V  N+ IS    + 
Sbjct: 337 WGISLIDRHEFLTNLFMANALIDMYAKCGAMARGFEVFKEMKEKDIVIMNAAISGLAKNG 396

Query: 350 EALTYFEVM---ESLGVSPDGVTFVSLLSTCAHLGLGK------------YGIRPTIEHY 409
                F V    E LG+SPDG TF+ LL  C H GL +            Y ++ T+EHY
Sbjct: 397 HVKLSFAVFGQTEKLGISPDGSTFLGLLCGCVHAGLIQDGLRFFNAISCVYALKRTVEHY 456

Query: 410 ACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELE 469
            CMV+L+GRAGM+++AY++I   M +     +WGALL  C L  D  +AE   + L  LE
Sbjct: 457 GCMVDLWGRAGMLDDAYRLIC-DMPMRPNAIVWGALLSGCRLVKDTQLAETVLKELIALE 516

Query: 470 PDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL 485
           P N  N+  L  IY   GR ++   V+ MM ++G+
Sbjct: 517 PWNAGNYVQLSNIYSVGGRWDEAAEVRDMMNKKGM 547


HSP 2 Score: 170.6 bits (431), Expect = 4.1e-41
Identity = 106/349 (30.37%), Postives = 191/349 (54.73%), Query Frame = 1

Query: 79  IHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLI 138
           ++H  +IH  +  + L  +  + + LL+    F   + ++ +F      N   F +NSLI
Sbjct: 26  VNHLKQIHVSLINHHLHHDTFLVNLLLKRTLFFRQTKYSYLLFSHTQFPNI--FLYNSLI 85

Query: 139 SGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFA 198
           +G+    L+ + L L+  + + G+    FTFP VLKAC    S ++G  +H  VV+ GF 
Sbjct: 86  NGFVNNHLFHETLDLFLSIRKHGLYLHGFTFPLVLKACTRASSRKLGIDLHSLVVKCGFN 145

Query: 199 GDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQM 258
            DV  + +L+ +YS  G +  A K+FD+I  + +V+W ++ +GYT  G H EA+D+F +M
Sbjct: 146 HDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSVVTWTALFSGYTTSGRHREAIDLFKKM 205

Query: 259 IQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEW----NLSIANSLIVMYAKCGK 318
           ++ G +PDS  +  +LS    +   L    W++++  E     N  +  +L+ +YAKCGK
Sbjct: 206 VEMGVKPDSYFIVQVLSACVHVG-DLDSGEWIVKYMEEMEMQKNSFVRTTLVNLYAKCGK 265

Query: 319 LNRAKWLFQQMPQKDMVSWNSIISAHFNSA---EALTYFEVMESLGVSPDGVTFVSLLST 378
           + +A+ +F  M +KD+V+W+++I  + +++   E +  F  M    + PD  + V  LS+
Sbjct: 266 MEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEGIELFLQMLQENLKPDQFSIVGFLSS 325

Query: 379 CAHLG---LGKYGIRPTIEHYACMVNLYGRAGMIEEAYK--IITKGMEI 416
           CA LG   LG++GI   I+ +  + NL+    +I+   K   + +G E+
Sbjct: 326 CASLGALDLGEWGI-SLIDRHEFLTNLFMANALIDMYAKCGAMARGFEV 370

BLAST of Cucsa.251540 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 279.6 bits (714), Expect = 6.3e-74
Identity = 155/447 (34.68%), Postives = 262/447 (58.61%), Query Frame = 1

Query: 58  GLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDA 117
           G  +D  + +SL+ +  Q   +    ++    P     R+V   + L++ YAS GY+E+A
Sbjct: 164 GCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSP----HRDVVSYTALIKGYASRGYIENA 223

Query: 118 HQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACG 177
            ++FDE+  ++    +WN++ISGYAE G Y++AL L+  M +  V PD  T   V+ AC 
Sbjct: 224 QKLFDEIPVKD--VVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACA 283

Query: 178 GIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNS 237
             GSI++G  VH  +   GF  ++ ++NAL+D+YSKCG +  A  +F+++ YKD++SWN+
Sbjct: 284 QSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNT 343

Query: 238 MLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLL---SNISSMKFKLHIHGWVIRH- 297
           ++ GYT   L+ EAL +F +M++ G  P+ V + ++L   +++ ++     IH ++ +  
Sbjct: 344 LIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRL 403

Query: 298 -GVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSII---SAHFNSAEALTY 357
            GV    S+  SLI MYAKCG +  A  +F  +  K + SWN++I   + H  +  +   
Sbjct: 404 KGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDL 463

Query: 358 FEVMESLGVSPDGVTFVSLLSTCAHLG---LGK---------YGIRPTIEHYACMVNLYG 417
           F  M  +G+ PD +TFV LLS C+H G   LG+         Y + P +EHY CM++L G
Sbjct: 464 FSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLG 523

Query: 418 RAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE 477
            +G+ +EA ++I   ME+E    IW +LL AC +H +V++ E  AE L ++EP+N  ++ 
Sbjct: 524 HSGLFKEAEEMINM-MEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYV 583

Query: 478 LLMKIYGNAGRSEDEKRVKLMMAERGL 485
           LL  IY +AGR  +  + + ++ ++G+
Sbjct: 584 LLSNIYASAGRWNEVAKTRALLNDKGM 603


HSP 2 Score: 130.2 bits (326), Expect = 6.2e-29
Identity = 118/460 (25.65%), Postives = 213/460 (46.30%), Query Frame = 1

Query: 68  SLLELCYQLQAIHHGIRIH-RLIPTNLLRRNVGISSKLLR---LYASFGYMEDAHQVFDE 127
           SLL  C  LQ++     IH ++I   L   N  +S KL+    L   F  +  A  VF  
Sbjct: 38  SLLHNCKTLQSLRI---IHAQMIKIGLHNTNYALS-KLIEFCILSPHFEGLPYAISVFKT 97

Query: 128 MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ 187
           +   N     WN++  G+A       AL LY  M   G+ P+++TFP VLK+C    + +
Sbjct: 98  IQEPNL--LIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFK 157

Query: 188 IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT 247
            G+ +H HV++ G   D++V  +L+ MY + G +  A KVFD+  ++D+VS+ +++ GY 
Sbjct: 158 EGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYA 217

Query: 248 RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNIS---SMKFKLHIHGWVIRHGVEWNLS 307
             G    A  +FD++  +    D V+ + ++S  +   + K  L +   +++  V  + S
Sbjct: 218 SRGYIENAQKLFDEIPVK----DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDES 277

Query: 308 IANSLIVMYAKC-----------------------------------GKLNRAKWLFQQM 367
              +++   A+                                    G+L  A  LF+++
Sbjct: 278 TMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERL 337

Query: 368 PQKDMVSWNSIIS--AHFN-SAEALTYFEVMESLGVSPDGVTFVSLLSTCAHLG---LGK 427
           P KD++SWN++I    H N   EAL  F+ M   G +P+ VT +S+L  CAHLG   +G+
Sbjct: 338 PYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGR 397

Query: 428 Y----------GIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYAC 468
           +          G+         ++++Y + G IE A+++      +    + W A+++  
Sbjct: 398 WIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNS--ILHKSLSSWNAMIFGF 457

BLAST of Cucsa.251540 vs. TrEMBL
Match: A0A0A0L5M0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G176270 PE=4 SV=1)

HSP 1 Score: 951.8 bits (2459), Expect = 3.2e-274
Identity = 478/495 (96.57%), Postives = 478/495 (96.57%), Query Frame = 1

Query: 4   HFPLLLQRKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDP 63
           H  LL   KAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDP
Sbjct: 50  HQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDP 109

Query: 64  EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE 123
           EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE
Sbjct: 110 EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE 169

Query: 124 MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ 183
           MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ
Sbjct: 170 MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ 229

Query: 184 IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT 243
           IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT
Sbjct: 230 IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT 289

Query: 244 RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIAN 303
           RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIAN
Sbjct: 290 RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIAN 349

Query: 304 SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGV 363
           SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGV
Sbjct: 350 SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGV 409

Query: 364 TFVSLLSTCAHLGL------------GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITK 423
           TFVSLLSTCAHLGL            GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITK
Sbjct: 410 TFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITK 469

Query: 424 GMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSED 483
           GMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSED
Sbjct: 470 GMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSED 529

Query: 484 EKRVKLMMAERGLNS 487
           EKRVKLMMAERGLNS
Sbjct: 530 EKRVKLMMAERGLNS 544

BLAST of Cucsa.251540 vs. TrEMBL
Match: M5XGA4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa019039mg PE=4 SV=1)

HSP 1 Score: 670.6 bits (1729), Expect = 1.4e-189
Identity = 332/485 (68.45%), Postives = 391/485 (80.62%), Query Frame = 1

Query: 14  KQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSLLELC 73
           + +  LSFPK+ PTPL+I  KP  Q+K+QALDAV+ DLEA+I  G+ +D E F+SLLE C
Sbjct: 35  QSNNSLSFPKTIPTPLIICHKPHSQTKLQALDAVVNDLEAAIGKGINVDTETFASLLETC 94

Query: 74  YQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFA 133
           YQ QA+ +G+R+HRLIP ++LRRNVGISSKLLRLYAS GY+E+AHQVFDEM  R+ SAFA
Sbjct: 95  YQFQAMDYGLRVHRLIPRSVLRRNVGISSKLLRLYASHGYIEEAHQVFDEMPKRDVSAFA 154

Query: 134 WNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVV 193
           WNSLISGYAELGLYEDA+ALYFQMEEEGVEPD FTFPRVLKACGGIG IQIGEAVHRH+V
Sbjct: 155 WNSLISGYAELGLYEDAMALYFQMEEEGVEPDRFTFPRVLKACGGIGFIQIGEAVHRHIV 214

Query: 194 RSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALD 253
           R G   D FVLNALVDMY+KCG IV+ARKVFD+I  +D VSWN+MLT Y RHGL  +ALD
Sbjct: 215 RLGLLNDRFVLNALVDMYAKCGDIVKARKVFDKITSRDHVSWNTMLTSYMRHGLLSQALD 274

Query: 254 IFDQMIQEGYEPDSVALSTLLSNI-SSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKC 313
           IF +M+ EG++ DSVA+ST+L    SS++  + IHGWVIR GVEWNLSIAN+LI  Y+  
Sbjct: 275 IFHEMLHEGHQADSVAISTILGAAESSLEIVIQIHGWVIRQGVEWNLSIANALIAAYSNH 334

Query: 314 GKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTC 373
            KLNRA+WLF  M ++D+++WN++ISAH  S EAL +FE MES G  PD +TFVS+LSTC
Sbjct: 335 RKLNRARWLFCHMSERDVITWNTMISAHSKSPEALLFFEQMESSGALPDSITFVSILSTC 394

Query: 374 AHLGL------------GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPT 433
           AHLGL             +Y I P +EHYACMVNLYGRAG I EAY II  GME EAGPT
Sbjct: 395 AHLGLVKDGERLYSVMKNRYRISPIMEHYACMVNLYGRAGRIREAYGIIVDGMEFEAGPT 454

Query: 434 IWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMA 486
           +WGALLYACYLH +VDI E+AAERLFELEPDNE NFELL+KIYGN GR ED +RV+LMM 
Sbjct: 455 VWGALLYACYLHGNVDIGEVAAERLFELEPDNEYNFELLIKIYGNVGRLEDVERVRLMMV 514

BLAST of Cucsa.251540 vs. TrEMBL
Match: D7T277_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_16s0022g00970 PE=4 SV=1)

HSP 1 Score: 666.0 bits (1717), Expect = 3.5e-188
Identity = 330/481 (68.61%), Postives = 393/481 (81.70%), Query Frame = 1

Query: 18  DLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSLLELCYQLQ 77
           +L FPKSSPTPLLI+ KP   +K+QAL+A+L DL+ASI +G+ +D +IFSSLLE C+QLQ
Sbjct: 34  NLVFPKSSPTPLLINHKPRNHTKLQALEALLRDLQASIQDGITVDAQIFSSLLETCFQLQ 93

Query: 78  AIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSL 137
           A  HGIRIHRLIPT+LLR++V +SSKLLRLYAS G +E+AH++FD+M  RN SAFAWNSL
Sbjct: 94  AFDHGIRIHRLIPTSLLRKSVALSSKLLRLYASIGRIEEAHRLFDQMSRRNRSAFAWNSL 153

Query: 138 ISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGF 197
           ISGYAELGLYEDA+ALYFQMEEEGV PD FTFPRVLKACGGIGSI +GE VHRHVVR GF
Sbjct: 154 ISGYAELGLYEDAMALYFQMEEEGVVPDRFTFPRVLKACGGIGSISVGEEVHRHVVRCGF 213

Query: 198 AGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQ 257
           A D FVLNALVDMY+KCG IV+ARKVFD+I  +D VSWNSMLTGY RHGL  +AL IF +
Sbjct: 214 ADDGFVLNALVDMYAKCGDIVKARKVFDKIVCRDSVSWNSMLTGYIRHGLPLQALSIFRR 273

Query: 258 MIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNR 317
           M+Q G+EPD+VA+ST+++ + S+K    IHGWV+R GV+WNLSIANSLIV+Y+  GKL++
Sbjct: 274 MLQYGFEPDAVAISTVVTGVPSLKLAGQIHGWVLRRGVQWNLSIANSLIVLYSNHGKLDQ 333

Query: 318 AKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCAHLGL 377
           A WLF  MP++D+VSWNSIISAH    +A+TYF  M+   V PD VTFVSLLS CAHLGL
Sbjct: 334 ACWLFDHMPERDVVSWNSIISAHRKDLKAITYFSRMQKADVLPDVVTFVSLLSACAHLGL 393

Query: 378 GK------------YGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGAL 437
            K            YG+ P++EHYACMVNLYGRAG+IEEAY+II K ME EAGPT+WGAL
Sbjct: 394 VKDGEGLFSMMREDYGMIPSMEHYACMVNLYGRAGLIEEAYEIIEKRMEFEAGPTVWGAL 453

Query: 438 LYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGLN 487
           LYACY H +VDI +IAAE LFELEPDNE NFELLM IY N GR ED ++V+ MMA+RG +
Sbjct: 454 LYACYFHHNVDIGKIAAECLFELEPDNEHNFELLMNIYRNVGRLEDVEKVRKMMADRGFD 513

BLAST of Cucsa.251540 vs. TrEMBL
Match: A0A067JQW8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26236 PE=4 SV=1)

HSP 1 Score: 656.4 bits (1692), Expect = 2.8e-185
Identity = 322/483 (66.67%), Postives = 388/483 (80.33%), Query Frame = 1

Query: 14  KQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSLLELC 73
           +  + LSFP SSPTPLLI  KP+  +K++AL+ V+ D+E+S++ G+ ID +IFSSLLE C
Sbjct: 42  RNGSTLSFPNSSPTPLLIKQKPYTLTKLEALENVVKDIESSVEKGIKIDTQIFSSLLETC 101

Query: 74  YQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFA 133
           YQL AI HGIRIH+LIPTNLLR+N GISSKLLRLYAS G M++AHQ+FD+M  R+ SAFA
Sbjct: 102 YQLDAIGHGIRIHQLIPTNLLRKNTGISSKLLRLYASCGQMDEAHQLFDQMCKRDESAFA 161

Query: 134 WNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVV 193
           WNSLI+GYAELGLYEDA+ALYFQMEEEGVEPD FTFPRVLK CGG+G IQ+GEAVHR +V
Sbjct: 162 WNSLIAGYAELGLYEDAIALYFQMEEEGVEPDQFTFPRVLKVCGGLGMIQVGEAVHRDIV 221

Query: 194 RSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALD 253
           R G A D FVLNALVDMY+KCG IV+AR++FD+I  K  VSWNSMLTGY RHGL  EAL 
Sbjct: 222 RLGLANDEFVLNALVDMYAKCGDIVKARRIFDKISCKVSVSWNSMLTGYLRHGLLVEALA 281

Query: 254 IFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCG 313
           IF  M+Q G E DSVA+S++L+ ++S+K  + IHGW++R G+EW+L IANSLIV+Y+  G
Sbjct: 282 IFRSMLQAGVELDSVAISSVLAKVTSLKLGVQIHGWILRRGMEWDLCIANSLIVVYSSNG 341

Query: 314 KLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCA 373
           KL++A+WLF  M ++D+VSWNSIISAH    E LTYFE ME  GV PD +TFVS+LS CA
Sbjct: 342 KLDQARWLFDNMLERDVVSWNSIISAHHKDPEVLTYFERMEKDGVLPDNITFVSILSACA 401

Query: 374 HLGL------------GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTI 433
           HLGL             KYGI+P +EHYACMVNLYGRAG+I++AY II   ME +AGPT 
Sbjct: 402 HLGLVTDGERLFSLMSEKYGIKPIMEHYACMVNLYGRAGLIKDAYAIIANKMEFDAGPTA 461

Query: 434 WGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAE 485
           WGALLYACYLH +VDI EIAA+ LFELEPDN+ NFELLMKIY +AGR ED K+VK MM +
Sbjct: 462 WGALLYACYLHGNVDIGEIAAQSLFELEPDNKHNFELLMKIYSDAGRLEDVKKVKTMMVD 521

BLAST of Cucsa.251540 vs. TrEMBL
Match: A0A061FZF0_THECC (Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_014953 PE=3 SV=1)

HSP 1 Score: 653.7 bits (1685), Expect = 1.8e-184
Identity = 323/482 (67.01%), Postives = 378/482 (78.42%), Query Frame = 1

Query: 15  QSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSLLELCY 74
           +ST L F KSSPTPLLI+ KPF Q+K+QALDAV+ DLEAS+ NG+ I  EIFSSLLE CY
Sbjct: 292 KSTALPFRKSSPTPLLINHKPFTQTKLQALDAVVKDLEASVKNGMNITSEIFSSLLETCY 351

Query: 75  QLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAW 134
           QL++I  GI+IH L+P  LLR+N GISSKLLRLYAS G++E AHQVFDEM  RN SAF W
Sbjct: 352 QLKSIDQGIKIHNLVPKTLLRKNTGISSKLLRLYASCGHIESAHQVFDEMSKRNESAFPW 411

Query: 135 NSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVR 194
           NSLISGYAELG YEDALA+YFQMEEEGVEPD +TFPR LKAC GIG IQIGEAVHR VVR
Sbjct: 412 NSLISGYAELGQYEDALAIYFQMEEEGVEPDRYTFPRALKACAGIGLIQIGEAVHRDVVR 471

Query: 195 SGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDI 254
            GF  D FVLNAL+DMY+KCG IV+AR+VFD I  KD VSWNSMLTGY RHGL  EAL++
Sbjct: 472 KGFGNDGFVLNALIDMYAKCGDIVKARRVFDNIACKDTVSWNSMLTGYIRHGLLVEALEV 531

Query: 255 FDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGK 314
           F  MI+EGYEPD VA+ST+LS + S+K  L IHGW++R G EWNLS+ N+LIV+Y+  GK
Sbjct: 532 FRGMIREGYEPDPVAMSTILSGVWSLKIALQIHGWILRRGNEWNLSVVNALIVVYSNHGK 591

Query: 315 LNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCAH 374
           L+RA WLF ++P+ D+VSWNSIIS H    EAL YFE M S G  PD +TFV++LS CAH
Sbjct: 592 LDRASWLFHRIPEPDVVSWNSIISGHSKRPEALVYFEQMVSGGTLPDSITFVAILSACAH 651

Query: 375 LGL------------GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIW 434
           LG              KY I P +EHYACMVNLYGRAG+I+EA+ +I + ME EAGPT+W
Sbjct: 652 LGFVRDGEQLFSLMRKKYAINPIMEHYACMVNLYGRAGLIDEAFTLIVERMEFEAGPTVW 711

Query: 435 GALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAER 485
           GALL+AC +H  +D+ EIAA+ LFELEPDNE NFELL KIY NAGR ED +RV  MM +R
Sbjct: 712 GALLHACSVHGHIDVGEIAAQNLFELEPDNEHNFELLKKIYSNAGRLEDVERVSKMMLDR 771

BLAST of Cucsa.251540 vs. TAIR10
Match: AT4G25270.1 (AT4G25270.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 615.1 bits (1585), Expect = 3.6e-176
Identity = 301/483 (62.32%), Postives = 373/483 (77.23%), Query Frame = 1

Query: 17  TDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFI-DPEIFSSLLELCYQ 76
           T LSF K SPTPLLI  +   +++++ALD+V+TDLE S   G+ + +PEIF+SLLE CY 
Sbjct: 45  TSLSFTKPSPTPLLIEKQSIHRTQLEALDSVITDLETSAQKGISLTEPEIFASLLETCYS 104

Query: 77  LQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWN 136
           L+AI HG+R+H LIP  LLR N+GISSKL+RLYAS GY E AH+VFD M  R+ S FAWN
Sbjct: 105 LRAIDHGVRVHHLIPPYLLRNNLGISSKLVRLYASCGYAEVAHEVFDRMSKRDSSPFAWN 164

Query: 137 SLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRS 196
           SLISGYAELG YEDA+ALYFQM E+GV+PD FTFPRVLKACGGIGS+QIGEA+HR +V+ 
Sbjct: 165 SLISGYAELGQYEDAMALYFQMAEDGVKPDRFTFPRVLKACGGIGSVQIGEAIHRDLVKE 224

Query: 197 GFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIF 256
           GF  DV+VLNALV MY+KCG IV+AR VFD I +KD VSWNSMLTGY  HGL  EALDIF
Sbjct: 225 GFGYDVYVLNALVVMYAKCGDIVKARNVFDMIPHKDYVSWNSMLTGYLHHGLLHEALDIF 284

Query: 257 DQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKL 316
             M+Q G EPD VA+S++L+ + S K    +HGWVIR G+EW LS+AN+LIV+Y+K G+L
Sbjct: 285 RLMVQNGIEPDKVAISSVLARVLSFKHGRQLHGWVIRRGMEWELSVANALIVLYSKRGQL 344

Query: 317 NRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCAHL 376
            +A ++F QM ++D VSWN+IISAH  ++  L YFE M      PDG+TFVS+LS CA+ 
Sbjct: 345 GQACFIFDQMLERDTVSWNAIISAHSKNSNGLKYFEQMHRANAKPDGITFVSVLSLCANT 404

Query: 377 GL------------GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWG 436
           G+             +YGI P +EHYACMVNLYGRAGM+EEAY +I + M +EAGPT+WG
Sbjct: 405 GMVEDGERLFSLMSKEYGIDPKMEHYACMVNLYGRAGMMEEAYSMIVQEMGLEAGPTVWG 464

Query: 437 ALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERG 487
           ALLYACYLH + DI E+AA+RLFELEPDNE NFELL++IY  A R+ED +RV+ MM +RG
Sbjct: 465 ALLYACYLHGNTDIGEVAAQRLFELEPDNEHNFELLIRIYSKAKRAEDVERVRQMMVDRG 524

BLAST of Cucsa.251540 vs. TAIR10
Match: AT3G46790.1 (AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 298.9 bits (764), Expect = 5.7e-81
Identity = 164/445 (36.85%), Postives = 269/445 (60.45%), Query Frame = 1

Query: 64  EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE 123
           + +  L+  C    ++   +R+HR I  N   ++  +++KL+ +Y+  G ++ A +VFD+
Sbjct: 78  QTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGSVDYARKVFDK 137

Query: 124 MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGG----I 183
              R    + WN+L       G  E+ L LY++M   GVE D FT+  VLKAC      +
Sbjct: 138 TRKRTI--YVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVASECTV 197

Query: 184 GSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSML 243
             +  G+ +H H+ R G++  V+++  LVDMY++ GC+  A  VF  +  +++VSW++M+
Sbjct: 198 NHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMI 257

Query: 244 TGYTRHGLHFEALDIFDQMIQEGYE--PDSVALSTLL---SNISSMKFKLHIHGWVIRHG 303
             Y ++G  FEAL  F +M++E  +  P+SV + ++L   +++++++    IHG+++R G
Sbjct: 258 ACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRG 317

Query: 304 VEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTYFE 363
           ++  L + ++L+ MY +CGKL   + +F +M  +D+VSWNS+IS+   H    +A+  FE
Sbjct: 318 LDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQIFE 377

Query: 364 VMESLGVSPDGVTFVSLLSTCAHLGL---GK---------YGIRPTIEHYACMVNLYGRA 423
            M + G SP  VTFVS+L  C+H GL   GK         +GI+P IEHYACMV+L GRA
Sbjct: 378 EMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRA 437

Query: 424 GMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELL 483
             ++EA K++ + M  E GP +WG+LL +C +H +V++AE A+ RLF LEP N  N+ LL
Sbjct: 438 NRLDEAAKMV-QDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLL 497

Query: 484 MKIYGNAGRSEDEKRVKLMMAERGL 485
             IY  A   ++ KRVK ++  RGL
Sbjct: 498 ADIYAEAQMWDEVKRVKKLLEHRGL 519

BLAST of Cucsa.251540 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 285.8 bits (730), Expect = 5.0e-77
Identity = 164/447 (36.69%), Postives = 252/447 (56.38%), Query Frame = 1

Query: 57  NGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMED 116
           +G+ ID     S+   C   + I  G  +H +       R     + LL +Y+  G ++ 
Sbjct: 290 SGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDS 349

Query: 117 AHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKAC 176
           A  VF EM +R  S  ++ S+I+GYA  GL  +A+ L+ +MEEEG+ PD +T   VL  C
Sbjct: 350 AKAVFREMSDR--SVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCC 409

Query: 177 GGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWN 236
                +  G+ VH  +  +    D+FV NAL+DMY+KCG +  A  VF ++  KDI+SWN
Sbjct: 410 ARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWN 469

Query: 237 SMLTGYTRHGLHFEALDIFDQMIQE-GYEPDSVALSTLL---SNISSMKFKLHIHGWVIR 296
           +++ GY+++    EAL +F+ +++E  + PD   ++ +L   +++S+      IHG+++R
Sbjct: 470 TIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMR 529

Query: 297 HGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTY 356
           +G   +  +ANSL+ MYAKCG L  A  LF  +  KD+VSW  +I+    H    EA+  
Sbjct: 530 NGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIAL 589

Query: 357 FEVMESLGVSPDGVTFVSLLSTCAHLGLGKYG------------IRPTIEHYACMVNLYG 416
           F  M   G+  D ++FVSLL  C+H GL   G            I PT+EHYAC+V++  
Sbjct: 590 FNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLA 649

Query: 417 RAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE 476
           R G + +AY+ I + M I    TIWGALL  C +H DV +AE  AE++FELEP+N   + 
Sbjct: 650 RTGDLIKAYRFI-ENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYV 709

Query: 477 LLMKIYGNAGRSEDEKRVKLMMAERGL 485
           L+  IY  A + E  KR++  + +RGL
Sbjct: 710 LMANIYAEAEKWEQVKRLRKRIGQRGL 733


HSP 2 Score: 187.2 bits (474), Expect = 2.4e-47
Identity = 106/328 (32.32%), Postives = 181/328 (55.18%), Query Frame = 1

Query: 55  IDNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYM 114
           + +G+ +D   FS + +    L+++H G ++H  I  +       + + L+  Y     +
Sbjct: 187 MSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRV 246

Query: 115 EDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLK 174
           + A +VFDEM  R+    +WNS+I+GY   GL E  L+++ QM   G+E D  T   V  
Sbjct: 247 DSARKVFDEMTERD--VISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFA 306

Query: 175 ACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVS 234
            C     I +G AVH   V++ F+ +    N L+DMYSKCG +  A+ VF ++  + +VS
Sbjct: 307 GCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVS 366

Query: 235 WNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKF---KLHIHGWVI 294
           + SM+ GY R GL  EA+ +F++M +EG  PD   ++ +L+  +  +       +H W+ 
Sbjct: 367 YTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIK 426

Query: 295 RHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSA---EALT 354
            + + +++ ++N+L+ MYAKCG +  A+ +F +M  KD++SWN+II  +  +    EAL+
Sbjct: 427 ENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALS 486

Query: 355 YFE-VMESLGVSPDGVTFVSLLSTCAHL 376
            F  ++E    SPD  T   +L  CA L
Sbjct: 487 LFNLLLEEKRFSPDERTVACVLPACASL 512


HSP 3 Score: 179.1 bits (453), Expect = 6.6e-45
Identity = 117/402 (29.10%), Postives = 207/402 (51.49%), Query Frame = 1

Query: 61  IDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQV 120
           IDP    S+L+LC   +++  G  +   I  N    +  + SKL  +Y + G +++A +V
Sbjct: 92  IDPRTLCSVLQLCADSKSLKDGKEVDNFIRGNGFVIDSNLGSKLSLMYTNCGDLKEASRV 151

Query: 121 FDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIG 180
           FDE+  +   A  WN L++  A+ G +  ++ L+ +M   GVE D++TF  V K+   + 
Sbjct: 152 FDEV--KIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLR 211

Query: 181 SIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLT 240
           S+  GE +H  +++SGF     V N+LV  Y K   +  ARKVFD++  +D++SWNS++ 
Sbjct: 212 SVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIIN 271

Query: 241 GYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISS---MKFKLHIHGWVIRHGVEW 300
           GY  +GL  + L +F QM+  G E D   + ++ +  +    +     +H   ++     
Sbjct: 272 GYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSR 331

Query: 301 NLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFN---SAEALTYFEVME 360
                N+L+ MY+KCG L+ AK +F++M  + +VS+ S+I+ +     + EA+  FE ME
Sbjct: 332 EDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEME 391

Query: 361 SLGVSPDGVTFVSLLSTCAHLGLGKYGIRP-----------TIEHYACMVNLYGRAGMIE 420
             G+SPD  T  ++L+ CA   L   G R             I     ++++Y + G ++
Sbjct: 392 EEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQ 451

Query: 421 EAYKIIT--KGMEIEAGPTIWGALLYACYLHSDVDIAEIAAE 444
           EA  + +  +  +I +  TI G     CY +  + +  +  E
Sbjct: 452 EAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLE 491

BLAST of Cucsa.251540 vs. TAIR10
Match: AT3G08820.1 (AT3G08820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 282.7 bits (722), Expect = 4.2e-76
Identity = 165/455 (36.26%), Postives = 256/455 (56.26%), Query Frame = 1

Query: 50  DLEASI-DNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLY 109
           DL  SI  +GL++    F  +L+ C +  +   GI +H L+       +V   + LL +Y
Sbjct: 97  DLFLSIRKHGLYLHGFTFPLVLKACTRASSRKLGIDLHSLVVKCGFNHDVAAMTSLLSIY 156

Query: 110 ASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFT 169
           +  G + DAH++FDE+ +R  S   W +L SGY   G + +A+ L+ +M E GV+PD++ 
Sbjct: 157 SGSGRLNDAHKLFDEIPDR--SVVTWTALFSGYTTSGRHREAIDLFKKMVEMGVKPDSYF 216

Query: 170 FPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIE 229
             +VL AC  +G +  GE + +++       + FV   LV++Y+KCG + +AR VFD + 
Sbjct: 217 IVQVLSACVHVGDLDSGEWIVKYMEEMEMQKNSFVRTTLVNLYAKCGKMEKARSVFDSMV 276

Query: 230 YKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHG 289
            KDIV+W++M+ GY  +    E +++F QM+QE  +PD  ++   LS+ +S+   L +  
Sbjct: 277 EKDIVTWSTMIQGYASNSFPKEGIELFLQMLQENLKPDQFSIVGFLSSCASLG-ALDLGE 336

Query: 290 WVI----RHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSA 349
           W I    RH    NL +AN+LI MYAKCG + R   +F++M +KD+V  N+ IS    + 
Sbjct: 337 WGISLIDRHEFLTNLFMANALIDMYAKCGAMARGFEVFKEMKEKDIVIMNAAISGLAKNG 396

Query: 350 EALTYFEVM---ESLGVSPDGVTFVSLLSTCAHLGLGK------------YGIRPTIEHY 409
                F V    E LG+SPDG TF+ LL  C H GL +            Y ++ T+EHY
Sbjct: 397 HVKLSFAVFGQTEKLGISPDGSTFLGLLCGCVHAGLIQDGLRFFNAISCVYALKRTVEHY 456

Query: 410 ACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELE 469
            CMV+L+GRAGM+++AY++I   M +     +WGALL  C L  D  +AE   + L  LE
Sbjct: 457 GCMVDLWGRAGMLDDAYRLIC-DMPMRPNAIVWGALLSGCRLVKDTQLAETVLKELIALE 516

Query: 470 PDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL 485
           P N  N+  L  IY   GR ++   V+ MM ++G+
Sbjct: 517 PWNAGNYVQLSNIYSVGGRWDEAAEVRDMMNKKGM 547


HSP 2 Score: 170.6 bits (431), Expect = 2.3e-42
Identity = 106/349 (30.37%), Postives = 191/349 (54.73%), Query Frame = 1

Query: 79  IHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLI 138
           ++H  +IH  +  + L  +  + + LL+    F   + ++ +F      N   F +NSLI
Sbjct: 26  VNHLKQIHVSLINHHLHHDTFLVNLLLKRTLFFRQTKYSYLLFSHTQFPNI--FLYNSLI 85

Query: 139 SGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFA 198
           +G+    L+ + L L+  + + G+    FTFP VLKAC    S ++G  +H  VV+ GF 
Sbjct: 86  NGFVNNHLFHETLDLFLSIRKHGLYLHGFTFPLVLKACTRASSRKLGIDLHSLVVKCGFN 145

Query: 199 GDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQM 258
            DV  + +L+ +YS  G +  A K+FD+I  + +V+W ++ +GYT  G H EA+D+F +M
Sbjct: 146 HDVAAMTSLLSIYSGSGRLNDAHKLFDEIPDRSVVTWTALFSGYTTSGRHREAIDLFKKM 205

Query: 259 IQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEW----NLSIANSLIVMYAKCGK 318
           ++ G +PDS  +  +LS    +   L    W++++  E     N  +  +L+ +YAKCGK
Sbjct: 206 VEMGVKPDSYFIVQVLSACVHVG-DLDSGEWIVKYMEEMEMQKNSFVRTTLVNLYAKCGK 265

Query: 319 LNRAKWLFQQMPQKDMVSWNSIISAHFNSA---EALTYFEVMESLGVSPDGVTFVSLLST 378
           + +A+ +F  M +KD+V+W+++I  + +++   E +  F  M    + PD  + V  LS+
Sbjct: 266 MEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEGIELFLQMLQENLKPDQFSIVGFLSS 325

Query: 379 CAHLG---LGKYGIRPTIEHYACMVNLYGRAGMIEEAYK--IITKGMEI 416
           CA LG   LG++GI   I+ +  + NL+    +I+   K   + +G E+
Sbjct: 326 CASLGALDLGEWGI-SLIDRHEFLTNLFMANALIDMYAKCGAMARGFEV 370

BLAST of Cucsa.251540 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 279.6 bits (714), Expect = 3.6e-75
Identity = 155/447 (34.68%), Postives = 262/447 (58.61%), Query Frame = 1

Query: 58  GLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDA 117
           G  +D  + +SL+ +  Q   +    ++    P     R+V   + L++ YAS GY+E+A
Sbjct: 164 GCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSP----HRDVVSYTALIKGYASRGYIENA 223

Query: 118 HQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACG 177
            ++FDE+  ++    +WN++ISGYAE G Y++AL L+  M +  V PD  T   V+ AC 
Sbjct: 224 QKLFDEIPVKD--VVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACA 283

Query: 178 GIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNS 237
             GSI++G  VH  +   GF  ++ ++NAL+D+YSKCG +  A  +F+++ YKD++SWN+
Sbjct: 284 QSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNT 343

Query: 238 MLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLL---SNISSMKFKLHIHGWVIRH- 297
           ++ GYT   L+ EAL +F +M++ G  P+ V + ++L   +++ ++     IH ++ +  
Sbjct: 344 LIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRL 403

Query: 298 -GVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSII---SAHFNSAEALTY 357
            GV    S+  SLI MYAKCG +  A  +F  +  K + SWN++I   + H  +  +   
Sbjct: 404 KGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDL 463

Query: 358 FEVMESLGVSPDGVTFVSLLSTCAHLG---LGK---------YGIRPTIEHYACMVNLYG 417
           F  M  +G+ PD +TFV LLS C+H G   LG+         Y + P +EHY CM++L G
Sbjct: 464 FSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLG 523

Query: 418 RAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE 477
            +G+ +EA ++I   ME+E    IW +LL AC +H +V++ E  AE L ++EP+N  ++ 
Sbjct: 524 HSGLFKEAEEMINM-MEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYV 583

Query: 478 LLMKIYGNAGRSEDEKRVKLMMAERGL 485
           LL  IY +AGR  +  + + ++ ++G+
Sbjct: 584 LLSNIYASAGRWNEVAKTRALLNDKGM 603


HSP 2 Score: 130.2 bits (326), Expect = 3.5e-30
Identity = 118/460 (25.65%), Postives = 213/460 (46.30%), Query Frame = 1

Query: 68  SLLELCYQLQAIHHGIRIH-RLIPTNLLRRNVGISSKLLR---LYASFGYMEDAHQVFDE 127
           SLL  C  LQ++     IH ++I   L   N  +S KL+    L   F  +  A  VF  
Sbjct: 38  SLLHNCKTLQSLRI---IHAQMIKIGLHNTNYALS-KLIEFCILSPHFEGLPYAISVFKT 97

Query: 128 MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ 187
           +   N     WN++  G+A       AL LY  M   G+ P+++TFP VLK+C    + +
Sbjct: 98  IQEPNL--LIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFK 157

Query: 188 IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT 247
            G+ +H HV++ G   D++V  +L+ MY + G +  A KVFD+  ++D+VS+ +++ GY 
Sbjct: 158 EGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYA 217

Query: 248 RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNIS---SMKFKLHIHGWVIRHGVEWNLS 307
             G    A  +FD++  +    D V+ + ++S  +   + K  L +   +++  V  + S
Sbjct: 218 SRGYIENAQKLFDEIPVK----DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDES 277

Query: 308 IANSLIVMYAKC-----------------------------------GKLNRAKWLFQQM 367
              +++   A+                                    G+L  A  LF+++
Sbjct: 278 TMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERL 337

Query: 368 PQKDMVSWNSIIS--AHFN-SAEALTYFEVMESLGVSPDGVTFVSLLSTCAHLG---LGK 427
           P KD++SWN++I    H N   EAL  F+ M   G +P+ VT +S+L  CAHLG   +G+
Sbjct: 338 PYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGR 397

Query: 428 Y----------GIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYAC 468
           +          G+         ++++Y + G IE A+++      +    + W A+++  
Sbjct: 398 WIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNS--ILHKSLSSWNAMIFGF 457

BLAST of Cucsa.251540 vs. NCBI nr
Match: gi|778688525|ref|XP_011652769.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucumis sativus])

HSP 1 Score: 951.8 bits (2459), Expect = 4.6e-274
Identity = 478/495 (96.57%), Postives = 478/495 (96.57%), Query Frame = 1

Query: 4   HFPLLLQRKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDP 63
           H  LL   KAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDP
Sbjct: 299 HQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDP 358

Query: 64  EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE 123
           EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE
Sbjct: 359 EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE 418

Query: 124 MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ 183
           MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ
Sbjct: 419 MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ 478

Query: 184 IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT 243
           IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT
Sbjct: 479 IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT 538

Query: 244 RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIAN 303
           RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIAN
Sbjct: 539 RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIAN 598

Query: 304 SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGV 363
           SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGV
Sbjct: 599 SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGV 658

Query: 364 TFVSLLSTCAHLGL------------GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITK 423
           TFVSLLSTCAHLGL            GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITK
Sbjct: 659 TFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITK 718

Query: 424 GMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSED 483
           GMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSED
Sbjct: 719 GMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSED 778

Query: 484 EKRVKLMMAERGLNS 487
           EKRVKLMMAERGLNS
Sbjct: 779 EKRVKLMMAERGLNS 793

BLAST of Cucsa.251540 vs. NCBI nr
Match: gi|700202147|gb|KGN57280.1| (hypothetical protein Csa_3G176270 [Cucumis sativus])

HSP 1 Score: 951.8 bits (2459), Expect = 4.6e-274
Identity = 478/495 (96.57%), Postives = 478/495 (96.57%), Query Frame = 1

Query: 4   HFPLLLQRKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDP 63
           H  LL   KAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDP
Sbjct: 50  HQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDP 109

Query: 64  EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE 123
           EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE
Sbjct: 110 EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE 169

Query: 124 MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ 183
           MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ
Sbjct: 170 MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ 229

Query: 184 IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT 243
           IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT
Sbjct: 230 IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT 289

Query: 244 RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIAN 303
           RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIAN
Sbjct: 290 RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIAN 349

Query: 304 SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGV 363
           SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGV
Sbjct: 350 SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGV 409

Query: 364 TFVSLLSTCAHLGL------------GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITK 423
           TFVSLLSTCAHLGL            GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITK
Sbjct: 410 TFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITK 469

Query: 424 GMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSED 483
           GMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSED
Sbjct: 470 GMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSED 529

Query: 484 EKRVKLMMAERGLNS 487
           EKRVKLMMAERGLNS
Sbjct: 530 EKRVKLMMAERGLNS 544

BLAST of Cucsa.251540 vs. NCBI nr
Match: gi|659077243|ref|XP_008439101.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucumis melo])

HSP 1 Score: 909.1 bits (2348), Expect = 3.4e-261
Identity = 458/495 (92.53%), Postives = 465/495 (93.94%), Query Frame = 1

Query: 4   HFPLLLQRKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDP 63
           H  LL   KAKQSTDLSFPKSS TPLLIH KPFFQSKIQALDAVLTDLE SIDNGL IDP
Sbjct: 287 HQKLLRISKAKQSTDLSFPKSSSTPLLIHSKPFFQSKIQALDAVLTDLETSIDNGLLIDP 346

Query: 64  EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDE 123
           EIFSSLLELCYQL+AIHHGIRIHRLIPTNLLRRNVGISSKLLRLYAS GYMEDAHQVFDE
Sbjct: 347 EIFSSLLELCYQLRAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASSGYMEDAHQVFDE 406

Query: 124 MGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQ 183
           MG RNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTFPRVLKACGGIGSIQ
Sbjct: 407 MGKRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIQ 466

Query: 184 IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYT 243
           IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQI YKDIVSWNSMLTGYT
Sbjct: 467 IGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIVYKDIVSWNSMLTGYT 526

Query: 244 RHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIAN 303
           RHGLHFEALDIFDQMIQEGY+PDSVALSTLLSNI S+KFKLHIHGWVIRHGVEWNLSIAN
Sbjct: 527 RHGLHFEALDIFDQMIQEGYKPDSVALSTLLSNILSLKFKLHIHGWVIRHGVEWNLSIAN 586

Query: 304 SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGV 363
           SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFN+ EALTYFEVMESLGV PD V
Sbjct: 587 SLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNTTEALTYFEVMESLGVLPDRV 646

Query: 364 TFVSLLSTCAHLGL------------GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITK 423
           TFVSLLSTCAHLGL            GKY IRPTIEHYACMVNLYGRAGMIEEAYKIITK
Sbjct: 647 TFVSLLSTCAHLGLVKEGGRLYSLMKGKYRIRPTIEHYACMVNLYGRAGMIEEAYKIITK 706

Query: 424 GMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSED 483
           GMEIEAGPTIWGALLYACYLH +VDIAEIAAERLFELEPDNELNFELLMKIYGNAGRS+D
Sbjct: 707 GMEIEAGPTIWGALLYACYLHRNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSDD 766

Query: 484 EKRVKLMMAERGLNS 487
           EKRVKLMMAERGLNS
Sbjct: 767 EKRVKLMMAERGLNS 781

BLAST of Cucsa.251540 vs. NCBI nr
Match: gi|1009123035|ref|XP_015878328.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 684.9 bits (1766), Expect = 1.0e-193
Identity = 332/490 (67.76%), Postives = 402/490 (82.04%), Query Frame = 1

Query: 8   LLQRKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFS 67
           L + K   ++ +SF K SPTPLL+  +   Q+K++ALD V+ D+EA +DNG+ +D EIFS
Sbjct: 291 LNKNKNNNASAVSFLKPSPTPLLLIKQKPSQTKLEALDVVVKDIEALVDNGIDVDVEIFS 350

Query: 68  SLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNR 127
           SLLE CY+LQAI +G+RIHRLIP NLLR+NVG+SSKL+RLYA+ GY++ AH+VFD+M  R
Sbjct: 351 SLLETCYRLQAIQYGVRIHRLIPANLLRKNVGLSSKLVRLYAACGYVDKAHEVFDQMSKR 410

Query: 128 NFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEA 187
           + SAFAWNSLISG+AELGLYEDALALYFQMEEEGVEPD +TFPRVLKAC G+G IQIG A
Sbjct: 411 DSSAFAWNSLISGHAELGLYEDALALYFQMEEEGVEPDRYTFPRVLKACAGVGFIQIGVA 470

Query: 188 VHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGL 247
           VHRH+VRSGF  D FVLNALVDMY+KCG IV+ARKVFD+I   D+VSWNSMLTGYTRHGL
Sbjct: 471 VHRHIVRSGFLDDGFVLNALVDMYAKCGDIVKARKVFDKIASPDLVSWNSMLTGYTRHGL 530

Query: 248 HFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIV 307
             EALD+F QM+Q+GY+PDS+A+S +LS ++S+K  + IHGW +RHGVEWNLSIANSLIV
Sbjct: 531 VLEALDLFCQMLQQGYQPDSIAISAILSGVTSLKLGVQIHGWAVRHGVEWNLSIANSLIV 590

Query: 308 MYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVS 367
           MY+  GKL +A+WLF+ MP++D+VSWNSIISAH    EAL +FE ME  GV PD +TFVS
Sbjct: 591 MYSCHGKLVQARWLFENMPERDIVSWNSIISAHSKEPEALIFFEQMEKAGVLPDNITFVS 650

Query: 368 LLSTCAHLGL------------GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEI 427
           LLS CAHLGL             +YG+   +EHY CMVNLYGRAG+IEEAY +I +GMEI
Sbjct: 651 LLSACAHLGLVKEGERLFSIMRNRYGMSSIMEHYGCMVNLYGRAGLIEEAYHLIVEGMEI 710

Query: 428 EAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRV 486
           EAGPT+WGALLYAC LH +V++ E+AAERLFELEPDNE N+ELLMKIYGNAGR ED K V
Sbjct: 711 EAGPTVWGALLYACCLHGNVEVGEVAAERLFELEPDNEHNYELLMKIYGNAGRVEDVKTV 770

BLAST of Cucsa.251540 vs. NCBI nr
Match: gi|657945057|ref|XP_008377840.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Malus domestica])

HSP 1 Score: 675.6 bits (1742), Expect = 6.3e-191
Identity = 330/485 (68.04%), Postives = 398/485 (82.06%), Query Frame = 1

Query: 14  KQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSLLELC 73
           + +  LSFPKS+PTPL+I+ KP  Q+K+QALDAV+ DLEAS+D G+ +D + F+SLLE+C
Sbjct: 273 QNTNSLSFPKSTPTPLIIYHKPSAQTKLQALDAVVKDLEASVDKGVNVDTQTFASLLEIC 332

Query: 74  YQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFA 133
           YQL+A+ +  R+H+LIP +LLRRNVG+SSKLLRLYA+ G ME+AH+VFDEM  R+ SAFA
Sbjct: 333 YQLEAMKYCHRVHKLIPRSLLRRNVGLSSKLLRLYAASGRMEEAHKVFDEMPKRDASAFA 392

Query: 134 WNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVV 193
           WN+LISGYAELGLYEDA+ALYFQMEEEGVEPD FTFPRVLKACGGIG IQIGEAVHRHVV
Sbjct: 393 WNALISGYAELGLYEDAMALYFQMEEEGVEPDRFTFPRVLKACGGIGFIQIGEAVHRHVV 452

Query: 194 RSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALD 253
           RSG   D FVLNALVDMY+KCG IV+ARKVFD+I  +D VSWN+MLT Y RHG+  +ALD
Sbjct: 453 RSGLLNDRFVLNALVDMYAKCGDIVKARKVFDKISSRDKVSWNTMLTSYMRHGVLLQALD 512

Query: 254 IFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCG 313
           IF QM  EGY+PDSV++ST+L+ + S++  + IHGWVIR GVEWNLSIAN+LI  Y+K  
Sbjct: 513 IFRQMFDEGYQPDSVSISTILAAVQSLQLVVQIHGWVIRMGVEWNLSIANALIAAYSKHH 572

Query: 314 KLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCA 373
           +LNRA+WLF  MP++D+V+WN+IISAH  S EAL YF+ ME  G  PD +TFVS+LS CA
Sbjct: 573 ELNRARWLFSHMPERDVVTWNTIISAHSKSREALLYFDQMEKDGALPDSITFVSILSACA 632

Query: 374 HLGL------------GKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTI 433
           +LGL             +Y I P +EHYACMVNLYGR+G I+EAY IIT GME EAGPT+
Sbjct: 633 NLGLVKDGQRLYSVMKNRYRISPIMEHYACMVNLYGRSGRIKEAYGIITDGMEFEAGPTV 692

Query: 434 WGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAE 487
           WGALLYACYLHS+VDI E+AAE+LF+LEPDNE NFELLM IYGN GR ED +RV+LMM E
Sbjct: 693 WGALLYACYLHSNVDIGEVAAEKLFDLEPDNEYNFELLMMIYGNVGRLEDVERVRLMMME 752

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP337_ARATH6.4e-17562.32Pentatricopeptide repeat-containing protein At4g25270, chloroplastic OS=Arabidop... [more]
PP265_ARATH1.0e-7936.85Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
PP320_ARATH8.8e-7636.69Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
PP219_ARATH7.5e-7536.26Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis th... [more]
PPR21_ARATH6.3e-7434.68Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L5M0_CUCSA3.2e-27496.57Uncharacterized protein OS=Cucumis sativus GN=Csa_3G176270 PE=4 SV=1[more]
M5XGA4_PRUPE1.4e-18968.45Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa019039mg PE=4 SV=1[more]
D7T277_VITVI3.5e-18868.61Putative uncharacterized protein OS=Vitis vinifera GN=VIT_16s0022g00970 PE=4 SV=... [more]
A0A067JQW8_JATCU2.8e-18566.67Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26236 PE=4 SV=1[more]
A0A061FZF0_THECC1.8e-18467.01Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_0149... [more]
Match NameE-valueIdentityDescription
AT4G25270.13.6e-17662.32 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G46790.15.7e-8136.85 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G18750.15.0e-7736.69 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G08820.14.2e-7636.26 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.13.6e-7534.68 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778688525|ref|XP_011652769.1|4.6e-27496.57PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic ... [more]
gi|700202147|gb|KGN57280.1|4.6e-27496.57hypothetical protein Csa_3G176270 [Cucumis sativus][more]
gi|659077243|ref|XP_008439101.1|3.4e-26192.53PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic ... [more]
gi|1009123035|ref|XP_015878328.1|1.0e-19367.76PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic ... [more]
gi|657945057|ref|XP_008377840.1|6.3e-19168.04PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009117 nucleotide metabolic process
biological_process GO:0016310 phosphorylation
biological_process GO:0072528 pyrimidine-containing compound biosynthetic process
biological_process GO:0008380 RNA splicing
biological_process GO:0044711 single-organism biosynthetic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0019201 nucleotide kinase activity
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.251540.1Cucsa.251540.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 303..328
score: 4.0E-4coord: 389..409
score: 0.05coord: 104..128
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 230..275
score: 2.3E-10coord: 132..176
score: 2.6
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 233..267
score: 6.0E-9coord: 133..165
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 97..127
score: 7.333coord: 385..419
score: 7.717coord: 452..486
score: 7.147coord: 231..265
score: 13.197coord: 130..164
score: 12.912coord: 200..230
score: 7.947coord: 165..199
score: 6.533coord: 298..332
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 197..246
score: 4.8E-4coord: 115..160
score: 4.8E-4coord: 247..325
score: 1.1E-6coord: 380..471
score: 1.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 108..154
score: 1.71E-5coord: 400..470
score: 1.7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 10..485
score: 1.8E
NoneNo IPR availablePANTHERPTHR24015:SF497SUBFAMILY NOT NAMEDcoord: 10..485
score: 1.8E

The following gene(s) are paralogous to this gene:

None