Csa1G666460 (gene) Cucumber (Chinese Long) v2

NameCsa1G666460
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat-containing protein, putative; contains IPR002885 (Pentatricopeptide repeat), IPR011990 (Tetratricopeptide-like helical)
LocationChr1 : 27079553 .. 27081602 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCGGTTCTTCTTAAGAGAGAAAAGGTATTGGATTATCTATGAGGTGTCCAAGTGCGGACATACCTCGTGGATTTCTTATCCAAAGGGCTAATGCTTTTACAATTGAATCCTTTCAATCTTTTGTTGCTCGTTTCTCTCAGCACTAGATAGACAAAAGGAGCGAAACTCGTTTGCAAATTCTGTTCAAAGTTTAAACAAAGGACGATGCTCCATCTCCAACACCACCTAATGGCATAGCAAGCGTCATCAACGAGCATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAATCCATTAGCGAAGCCAACAATTCCCCAATCACGTTCAGATTCCATCCCCGCTTGCAGATTTTCAAACAAAACCCATCTCAGAAATGTCACTTCTTCTGCTGAATTTAGACAACCCCATTTCCCCAATCTCGATAACAGAGATGCTCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAGTCCCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACTAAACTCATTAAAGGGTTTTTTAATTCGAGGAATTTAAAGAAAGCTATGAGAGTTATGGAGATTTTGGAAACCTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATCAGTGGGTTTAGTAAAGCCAACCAAATTGATTCTGCAAACCAGGTGTTTGATAGAATGCGCAGCAGGGGTTTTTCTCCTGATGTCGTTACTTACAATATAATGATTGGGAGTTTGTGTAGTAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAGTTGGTGTCGAGGGGCCTCCGTCCTGACTTGTATACATACAATGCCATCATTCGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCTTGGATTTTGTTCGACATTTATCAGCTAGAGGGTGTAATCCAGATGTGGTATCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCCGGTGGGAAGATGGGGAGAGGCTTATGAAAGACATGGTCCTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTTTGTCGCGAAGGGAGAGTAAGGGAAGCCGTGAATGTGTTGGAGGTGATGAAGGAGAAAGGGTTAACACCAGATTCATATAGCTATGATCCACTGATTTCCGCCTTCTGCAAAGAAGGGAGATTGGATTTAGCAATTGAGTATTTGGAAAAAATGGTTTCTGATGGTTGTTTGCCCGATATTGTAAACTACAATACAATTTTGGCTACACTTTGTAAATTTGGTTGTGCTGATCTTGCTTTAGACGTCTTTGAGAAGCTGGATGAAGTGGGTTGCCCTCCAACTGTGAGGGCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAACAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCCGATGAGATAACATACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTTGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGGTTCCAGCCGACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAATGAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCCGGTTGGCGAGCAGAGGCTATGGAGTTAGCCAACAGTCTGTACAGATTGGGAGTTATTTCTGGAGATTCTTCCAAGCGTTTGAACAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAAGCTGATGTTGACTATAGAATTTTCAACTTTTGGCTATTTTGTTTTGTTTTCTTT

mRNA sequence

ATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAATCCATTAGCGAAGCCAACAATTCCCCAATCACGTTCAGATTCCATCCCCGCTTGCAGATTTTCAAACAAAACCCATCTCAGAAATGTCACTTCTTCTGCTGAATTTAGACAACCCCATTTCCCCAATCTCGATAACAGAGATGCTCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAGTCCCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACTAAACTCATTAAAGGGTTTTTTAATTCGAGGAATTTAAAGAAAGCTATGAGAGTTATGGAGATTTTGGAAACCTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATCAGTGGGTTTAGTAAAGCCAACCAAATTGATTCTGCAAACCAGGTGTTTGATAGAATGCGCAGCAGGGGTTTTTCTCCTGATGTCGTTACTTACAATATAATGATTGGGAGTTTGTGTAGTAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAGTTGGTGTCGAGGGGCCTCCGTCCTGACTTGTATACATACAATGCCATCATTCGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCTTGGATTTTGTTCGACATTTATCAGCTAGAGGGTGTAATCCAGATGTGGTATCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCCGGTGGGAAGATGGGGAGAGGCTTATGAAAGACATGGTCCTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTTTGTCGCGAAGGGAGAGTAAGGGAAGCCGTGAATGTGTTGGAGGTGATGAAGGAGAAAGGGTTAACACCAGATTCATATAGCTATGATCCACTGATTTCCGCCTTCTGCAAAGAAGGGAGATTGGATTTAGCAATTGAGTATTTGGAAAAAATGGTTTCTGATGGTTGTTTGCCCGATATTGTAAACTACAATACAATTTTGGCTACACTTTGTAAATTTGGTTGTGCTGATCTTGCTTTAGACGTCTTTGAGAAGCTGGATGAAGTGGGTTGCCCTCCAACTGTGAGGGCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAACAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCCGATGAGATAACATACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTTGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGGTTCCAGCCGACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAATGAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCCGGTTGGCGAGCAGAGGCTATGGAGTTAGCCAACAGTCTGTACAGATTGGGAGTTATTTCTGGAGATTCTTCCAAGCGTTTGAACAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAAGCTGA

Coding sequence (CDS)

ATGTTTTCATCGGAATTTCTCCCTCAGAGCCTCCATTTCACCAATCCATTAGCGAAGCCAACAATTCCCCAATCACGTTCAGATTCCATCCCCGCTTGCAGATTTTCAAACAAAACCCATCTCAGAAATGTCACTTCTTCTGCTGAATTTAGACAACCCCATTTCCCCAATCTCGATAACAGAGATGCTCATTTGATGAAACTCCTCAACAGATCCTGCAGAGCTGGGAAGCACAACGAGTCCCTCTATTTTCTCGAAAGCGTGGTGAGTAAAGGCTTCAAACCTGATGTTGTGCTCTGTACTAAACTCATTAAAGGGTTTTTTAATTCGAGGAATTTAAAGAAAGCTATGAGAGTTATGGAGATTTTGGAAACCTATGGTGACCCTGATGTTTATTCTTACAATGCTATGATCAGTGGGTTTAGTAAAGCCAACCAAATTGATTCTGCAAACCAGGTGTTTGATAGAATGCGCAGCAGGGGTTTTTCTCCTGATGTCGTTACTTACAATATAATGATTGGGAGTTTGTGTAGTAGGGGGAAGCTTGAGCTTGCTTTTGAAGTTATGGATGAGCTTTTGAAGGATGGGTGTAAGCCATCTGTGATTACTTACACAATTCTTATAGAAGCAACCATTCTTGAAGGTAGAATCAATGAAGCTCTTGAGCTGTTTGATGAGTTGGTGTCGAGGGGCCTCCGTCCTGACTTGTATACATACAATGCCATCATTCGAGGTATTTGCAAGGAAGGAATGGAGGATCGAGCCTTGGATTTTGTTCGACATTTATCAGCTAGAGGGTGTAATCCAGATGTGGTATCATACAATATTCTGCTGCGTTCTTTTCTAAACAAAAGCCGGTGGGAAGATGGGGAGAGGCTTATGAAAGACATGGTCCTAAGTGGCTGTGAGCCGAATGTCGTTACTCACAGCATTTTAATTAGTTCGTTTTGTCGCGAAGGGAGAGTAAGGGAAGCCGTGAATGTGTTGGAGGTGATGAAGGAGAAAGGGTTAACACCAGATTCATATAGCTATGATCCACTGATTTCCGCCTTCTGCAAAGAAGGGAGATTGGATTTAGCAATTGAGTATTTGGAAAAAATGGTTTCTGATGGTTGTTTGCCCGATATTGTAAACTACAATACAATTTTGGCTACACTTTGTAAATTTGGTTGTGCTGATCTTGCTTTAGACGTCTTTGAGAAGCTGGATGAAGTGGGTTGCCCTCCAACTGTGAGGGCCTACAACACAATGTTCAGTGCACTTTGGAGCTGTGGGAACAAGATCAAGGCTCTGGAGATGATATCAGAAATGATAAGAAAAGGAATTGATCCCGATGAGATAACATACAATTCTCTGATCTCATGCTTGTGTCGGGACGGGTTGGTTGATGAGGCTATTGGATTGTTGGTAGACATGGAAGCTACCAGGTTCCAGCCGACAGTGATCAGCTTCAACATTGTGCTTCTGGGAATGTGTAAAGCACACAGGGTTTTTGAAGGCATTGAGTTGCTAATAACAATGGTTGAAAAAGGTTGCCTACCGAATGAAACTAGTTACGTCTTGTTAATCGAGGGGATCGCTTATGCCGGTTGGCGAGCAGAGGCTATGGAGTTAGCCAACAGTCTGTACAGATTGGGAGTTATTTCTGGAGATTCTTCCAAGCGTTTGAACAAGACATTTCCAATGCTTGACGTTTATAAAGGGCTAAGCTTATCAGAAAGCAAGAACCAACTCTTGCAAAGCTGA

Protein sequence

MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS*
BLAST of Csa1G666460 vs. Swiss-Prot
Match: PP213_ARATH (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 723.0 bits (1865), Expect = 2.6e-207
Identity = 348/555 (62.70%), Postives = 442/555 (79.64%), Query Frame = 1

Query: 11  LHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDNRDAHLMKLLN 70
           L F+N  + P     RS S    R    T   + T   E RQ H  +L  RD  ++K+ +
Sbjct: 40  LTFSN--SNPNNDNGRSFSSSGARNLQTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFH 99

Query: 71  RSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDPD 130
           RSCR+G + ESL+ LE++V KG+ PDV+LCTKLIKGFF  RN+ KA+RVMEILE +G PD
Sbjct: 100 RSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPD 159

Query: 131 VYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMD 190
           V++YNA+I+GF K N+ID A +V DRMRS+ FSPD VTYNIMIGSLCSRGKL+LA +V++
Sbjct: 160 VFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLN 219

Query: 191 ELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEG 250
           +LL D C+P+VITYTILIEAT+LEG ++EAL+L DE++SRGL+PD++TYN IIRG+CKEG
Sbjct: 220 QLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEG 279

Query: 251 MEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHS 310
           M DRA + VR+L  +GC PDV+SYNILLR+ LN+ +WE+GE+LM  M    C+PNVVT+S
Sbjct: 280 MVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYS 339

Query: 311 ILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSD 370
           ILI++ CR+G++ EA+N+L++MKEKGLTPD+YSYDPLI+AFC+EGRLD+AIE+LE M+SD
Sbjct: 340 ILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISD 399

Query: 371 GCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKA 430
           GCLPDIVNYNT+LATLCK G AD AL++F KL EVGC P   +YNTMFSALWS G+KI+A
Sbjct: 400 GCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRA 459

Query: 431 LEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIVLLG 490
           L MI EM+  GIDPDEITYNS+ISCLCR+G+VDEA  LLVDM +  F P+V+++NIVLLG
Sbjct: 460 LHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLG 519

Query: 491 MCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGVISG 550
            CKAHR+ + I +L +MV  GC PNET+Y +LIEGI +AG+RAEAMELAN L R+  IS 
Sbjct: 520 FCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISE 579

Query: 551 DSSKRLNKTFPMLDV 566
            S KRL++TFP+L+V
Sbjct: 580 YSFKRLHRTFPLLNV 592

BLAST of Csa1G666460 vs. Swiss-Prot
Match: PPR28_ARATH (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 452.2 bits (1162), Expect = 8.5e-126
Identity = 223/489 (45.60%), Postives = 318/489 (65.03%), Query Frame = 1

Query: 69  LNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGD 128
           L +  R G+  E   FLE++V  G  PD++ CT LI+GF      +KA +++EILE  G 
Sbjct: 109 LRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGA 168

Query: 129 -PDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFE 188
            PDV +YN MISG+ KA +I++A  V DRM     SPDVVTYN ++ SLC  GKL+ A E
Sbjct: 169 VPDVITYNVMISGYCKAGEINNALSVLDRM---SVSPDVVTYNTILRSLCDSGKLKQAME 228

Query: 189 VMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGIC 248
           V+D +L+  C P VITYTILIEAT  +  +  A++L DE+  RG  PD+ TYN ++ GIC
Sbjct: 229 VLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGIC 288

Query: 249 KEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVV 308
           KEG  D A+ F+  + + GC P+V+++NI+LRS  +  RW D E+L+ DM+  G  P+VV
Sbjct: 289 KEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVV 348

Query: 309 THSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKM 368
           T +ILI+  CR+G +  A+++LE M + G  P+S SY+PL+  FCKE ++D AIEYLE+M
Sbjct: 349 TFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERM 408

Query: 369 VSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNK 428
           VS GC PDIV YNT+L  LCK G  + A+++  +L   GC P +  YNT+   L   G  
Sbjct: 409 VSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKT 468

Query: 429 IKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIV 488
            KA++++ EM  K + PD ITY+SL+  L R+G VDEAI    + E    +P  ++FN +
Sbjct: 469 GKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSI 528

Query: 489 LLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGV 548
           +LG+CK+ +    I+ L+ M+ +GC PNETSY +LIEG+AY G   EA+EL N L   G+
Sbjct: 529 MLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGL 588

Query: 549 ISGDSSKRL 557
           +   S++++
Sbjct: 589 MKKSSAEQV 594

BLAST of Csa1G666460 vs. Swiss-Prot
Match: PP131_ARATH (Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidopsis thaliana GN=At1g79080 PE=2 SV=1)

HSP 1 Score: 292.4 bits (747), Expect = 1.1e-77
Identity = 170/487 (34.91%), Postives = 273/487 (56.06%), Query Frame = 1

Query: 79  NESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGD-PDVYSYNAM 138
           ++S   LES+V+ G KP+V   T+L+     +  LKKA+RV+E++ + G  PD  +Y  +
Sbjct: 88  SDSFSHLESLVTGGHKPNVAHSTQLLYDLCKANRLKKAIRVIELMVSSGIIPDASAYTYL 147

Query: 139 ISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGC 198
           ++   K   +  A Q+ ++M   G+  + VTYN ++  LC  G L  + + ++ L++ G 
Sbjct: 148 VNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTYNALVRGLCMLGSLNQSLQFVERLMQKGL 207

Query: 199 KPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALD 258
            P+  TY+ L+EA   E   +EA++L DE++ +G  P+L +YN ++ G CKEG  D A+ 
Sbjct: 208 APNAFTYSFLLEAAYKERGTDEAVKLLDEIIVKGGEPNLVSYNVLLTGFCKEGRTDDAMA 267

Query: 259 FVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFC 318
             R L A+G   +VVSYNILLR      RWE+   L+ +M      P+VVT++ILI+S  
Sbjct: 268 LFRELPAKGFKANVVSYNILLRCLCCDGRWEEANSLLAEMDGGDRAPSVVTYNILINSLA 327

Query: 319 REGRVREAVNVLEVMKEKG--LTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPD 378
             GR  +A+ VL+ M +        + SY+P+I+  CKEG++DL ++ L++M+   C P+
Sbjct: 328 FHGRTEQALQVLKEMSKGNHQFRVTATSYNPVIARLCKEGKVDLVVKCLDEMIYRRCKPN 387

Query: 379 IVNYNTILATLCKFGC-ADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMI 438
              YN I  +LC+       A  + + L       T   Y ++ ++L   GN   A +++
Sbjct: 388 EGTYNAI-GSLCEHNSKVQEAFYIIQSLSNKQKCCTHDFYKSVITSLCRKGNTFAAFQLL 447

Query: 439 SEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM-EATRFQPTVISFNIVLLGMCK 498
            EM R G DPD  TY++LI  LC +G+   A+ +L  M E+   +PTV +FN ++LG+CK
Sbjct: 448 YEMTRCGFDPDAHTYSALIRGLCLEGMFTGAMEVLSIMEESENCKPTVDNFNAMILGLCK 507

Query: 499 AHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGVISGDSS 558
             R    +E+   MVEK  +PNET+Y +L+EGIA+      A E+ + L    VI  ++ 
Sbjct: 508 IRRTDLAMEVFEMMVEKKRMPNETTYAILVEGIAHEDELELAKEVLDELRLRKVIGQNAV 567

Query: 559 KRLNKTF 561
            R+   F
Sbjct: 568 DRIVMQF 573

BLAST of Csa1G666460 vs. Swiss-Prot
Match: PPR36_ARATH (Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidopsis thaliana GN=At1g12300 PE=2 SV=1)

HSP 1 Score: 269.2 bits (687), Expect = 1.0e-70
Identity = 141/470 (30.00%), Postives = 252/470 (53.62%), Query Frame = 1

Query: 67  KLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM-EILET 126
           +L +   +  +++  L   + +  KG   ++   + +I  F   R L  A   M +I++ 
Sbjct: 93  RLFSAIAKTKQYDLVLALCKQMELKGIAHNLYTLSIMINCFCRCRKLCLAFSAMGKIIKL 152

Query: 127 YGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELA 186
             +P+  +++ +I+G     ++  A ++ DRM   G  PD++T N ++  LC  GK   A
Sbjct: 153 GYEPNTITFSTLINGLCLEGRVSEALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEA 212

Query: 187 FEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRG 246
             ++D++++ GC+P+ +TY  ++      G+   A+EL  ++  R ++ D   Y+ II G
Sbjct: 213 MLLIDKMVEYGCQPNAVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDG 272

Query: 247 ICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPN 306
           +CK G  D A +    +  +G   ++++YNIL+  F N  RW+DG +L++DM+     PN
Sbjct: 273 LCKHGSLDNAFNLFNEMEMKGITTNIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPN 332

Query: 307 VVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLE 366
           VVT S+LI SF +EG++REA  + + M  +G+ PD+ +Y  LI  FCKE  LD A + ++
Sbjct: 333 VVTFSVLIDSFVKEGKLREAEELHKEMIHRGIAPDTITYTSLIDGFCKENHLDKANQMVD 392

Query: 367 KMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCG 426
            MVS GC P+I  +N ++   CK    D  L++F K+   G       YNT+       G
Sbjct: 393 LMVSKGCDPNIRTFNILINGYCKANRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELG 452

Query: 427 NKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFN 486
               A E+  EM+ + + P+ +TY  L+  LC +G  ++A+ +   +E ++ +  +  +N
Sbjct: 453 KLNVAKELFQEMVSRKVPPNIVTYKILLDGLCDNGESEKALEIFEKIEKSKMELDIGIYN 512

Query: 487 IVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEA 536
           I++ GMC A +V +  +L  ++  KG  P   +Y ++I G+   G  +EA
Sbjct: 513 IIIHGMCNASKVDDAWDLFCSLPLKGVKPGVKTYNIMIGGLCKKGPLSEA 562


HSP 2 Score: 241.5 bits (615), Expect = 2.3e-62
Identity = 137/504 (27.18%), Postives = 256/504 (50.79%), Query Frame = 1

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG 127
           ++N  CR  K   +   +  ++  G++P+ +  + LI G      + +A+ +++ +   G
Sbjct: 129 MINCFCRCRKLCLAFSAMGKIIKLGYEPNTITFSTLINGLCLEGRVSEALELVDRMVEMG 188

Query: 128 D-PDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAF 187
             PD+ + N +++G   + +   A  + D+M   G  P+ VTY  ++  +C  G+  LA 
Sbjct: 189 HKPDLITINTLVNGLCLSGKEAEAMLLIDKMVEYGCQPNAVTYGPVLNVMCKSGQTALAM 248

Query: 188 EVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGI 247
           E++ ++ +   K   + Y+I+I+     G ++ A  LF+E+  +G+  ++ TYN +I G 
Sbjct: 249 ELLRKMEERNIKLDAVKYSIIIDGLCKHGSLDNAFNLFNEMEMKGITTNIITYNILIGGF 308

Query: 248 CKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNV 307
           C  G  D     +R +  R  NP+VV++++L+ SF+ + +  + E L K+M+  G  P+ 
Sbjct: 309 CNAGRWDDGAKLLRDMIKRKINPNVVTFSVLIDSFVKEGKLREAEELHKEMIHRGIAPDT 368

Query: 308 VTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEK 367
           +T++ LI  FC+E  + +A  ++++M  KG  P+  +++ LI+ +CK  R+D  +E   K
Sbjct: 369 ITYTSLIDGFCKENHLDKANQMVDLMVSKGCDPNIRTFNILINGYCKANRIDDGLELFRK 428

Query: 368 MVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGN 427
           M   G + D V YNT++   C+ G  ++A ++F+++     PP +  Y  +   L   G 
Sbjct: 429 MSLRGVVADTVTYNTLIQGFCELGKLNVAKELFQEMVSRKVPPNIVTYKILLDGLCDNGE 488

Query: 428 KIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNI 487
             KALE+  ++ +  ++ D   YN +I  +C    VD+A  L   +     +P V ++NI
Sbjct: 489 SEKALEIFEKIEKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPLKGVKPGVKTYNI 548

Query: 488 VLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLG 547
           ++ G+CK   + E   L   M E G  P+  +Y +LI      G   ++++L   L R G
Sbjct: 549 MIGGLCKKGPLSEAELLFRKMEEDGHAPDGWTYNILIRAHLGDGDATKSVKLIEELKRCG 608

Query: 548 ----------VISGDSSKRLNKTF 561
                     VI   S  RL K+F
Sbjct: 609 FSVDASTIKMVIDMLSDGRLKKSF 632


HSP 3 Score: 179.9 bits (455), Expect = 8.2e-44
Identity = 116/454 (25.55%), Postives = 209/454 (46.04%), Query Frame = 1

Query: 85  LESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDPDVYSYNAMISGFSKA 144
           L S VSK  +P ++    L     N  N + +         + D ++     + SG    
Sbjct: 9   LSSQVSKFVQPRLLETGTLRIALINCPN-ELSFCCERGFSAFSDRNLSYRERLRSGLVDI 68

Query: 145 NQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGCKPSVITY 204
              D+ +   D + SR   P V+ ++ +  ++    + +L   +  ++   G   ++ T 
Sbjct: 69  KADDAIDLFRDMIHSRPL-PTVIDFSRLFSAIAKTKQYDLVLALCKQMELKGIAHNLYTL 128

Query: 205 TILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALDFVRHLSA 264
           +I+I       ++  A     +++  G  P+  T++ +I G+C EG    AL+ V  +  
Sbjct: 129 SIMINCFCRCRKLCLAFSAMGKIIKLGYEPNTITFSTLINGLCLEGRVSEALELVDRMVE 188

Query: 265 RGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFCREGRVRE 324
            G  PD+++ N L+       +  +   L+  MV  GC+PN VT+  +++  C+ G+   
Sbjct: 189 MGHKPDLITINTLVNGLCLSGKEAEAMLLIDKMVEYGCQPNAVTYGPVLNVMCKSGQTAL 248

Query: 325 AVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPDIVNYNTILA 384
           A+ +L  M+E+ +  D+  Y  +I   CK G LD A     +M   G   +I+ YN ++ 
Sbjct: 249 AMELLRKMEERNIKLDAVKYSIIIDGLCKHGSLDNAFNLFNEMEMKGITTNIITYNILIG 308

Query: 385 TLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMISEMIRKGIDP 444
             C  G  D    +   + +    P V  ++ +  +    G   +A E+  EMI +GI P
Sbjct: 309 GFCNAGRWDDGAKLLRDMIKRKINPNVVTFSVLIDSFVKEGKLREAEELHKEMIHRGIAP 368

Query: 445 DEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIVLLGMCKAHRVFEGIELL 504
           D ITY SLI   C++  +D+A  ++  M +    P + +FNI++ G CKA+R+ +G+EL 
Sbjct: 369 DTITYTSLIDGFCKENHLDKANQMVDLMVSKGCDPNIRTFNILINGYCKANRIDDGLELF 428

Query: 505 ITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMEL 539
             M  +G + +  +Y  LI+G    G    A EL
Sbjct: 429 RKMSLRGVVADTVTYNTLIQGFCELGKLNVAKEL 460

BLAST of Csa1G666460 vs. Swiss-Prot
Match: PP281_ARATH (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana GN=MEE40 PE=2 SV=1)

HSP 1 Score: 269.2 bits (687), Expect = 1.0e-70
Identity = 142/469 (30.28%), Postives = 246/469 (52.45%), Query Frame = 1

Query: 76  GKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGD--PDVYS 135
           G  + +L   E +V  G     V    ++ GF     ++ A+  ++ +       PD Y+
Sbjct: 238 GDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYT 297

Query: 136 YNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELL 195
           +N +++G  KA  +  A ++ D M   G+ PDV TYN +I  LC  G+++ A EV+D+++
Sbjct: 298 FNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMI 357

Query: 196 KDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMED 255
              C P+ +TY  LI     E ++ EA EL   L S+G+ PD+ T+N++I+G+C      
Sbjct: 358 TRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHR 417

Query: 256 RALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILI 315
            A++    + ++GC PD  +YN+L+ S  +K + ++   ++K M LSGC  +V+T++ LI
Sbjct: 418 VAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLI 477

Query: 316 SSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCL 375
             FC+  + REA  + + M+  G++ +S +Y+ LI   CK  R++ A + +++M+ +G  
Sbjct: 478 DGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQK 537

Query: 376 PDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEM 435
           PD   YN++L   C+ G    A D+ + +   GC P +  Y T+ S L   G    A ++
Sbjct: 538 PDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKL 597

Query: 436 ISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM-EATRFQPTVISFNIVLLGMC 495
           +  +  KGI+     YN +I  L R     EAI L  +M E     P  +S+ IV  G+C
Sbjct: 598 LRSIQMKGINLTPHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLC 657

Query: 496 K-AHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 541
                + E ++ L+ ++EKG +P  +S  +L EG+         ++L N
Sbjct: 658 NGGGPIREAVDFLVELLEKGFVPEFSSLYMLAEGLLTLSMEETLVKLVN 706


HSP 2 Score: 244.6 bits (623), Expect = 2.7e-63
Identity = 131/449 (29.18%), Postives = 236/449 (52.56%), Query Frame = 1

Query: 92  GFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG-DPDVYSYNAMISGFSKANQIDSA 151
           G KPDV     LIK    +  L+ A+ ++E + +YG  PD  ++  ++ G+ +   +D A
Sbjct: 184 GIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGA 243

Query: 152 NQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELL-KDGCKPSVITYTILIE 211
            ++ ++M   G S   V+ N+++   C  G++E A   + E+  +DG  P   T+  L+ 
Sbjct: 244 LRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVN 303

Query: 212 ATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNP 271
                G +  A+E+ D ++  G  PD+YTYN++I G+CK G    A++ +  +  R C+P
Sbjct: 304 GLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSP 363

Query: 272 DVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVL 331
           + V+YN L+ +   +++ E+   L + +   G  P+V T + LI   C     R A+ + 
Sbjct: 364 NTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELF 423

Query: 332 EVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKF 391
           E M+ KG  PD ++Y+ LI + C +G+LD A+  L++M   GC   ++ YNT++   CK 
Sbjct: 424 EEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKA 483

Query: 392 GCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITY 451
                A ++F++++  G       YNT+   L        A +++ +MI +G  PD+ TY
Sbjct: 484 NKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTY 543

Query: 452 NSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVE 511
           NSL++  CR G + +A  ++  M +   +P ++++  ++ G+CKA RV    +LL ++  
Sbjct: 544 NSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQM 603

Query: 512 KGCLPNETSYVLLIEGIAYAGWRAEAMEL 539
           KG      +Y  +I+G+       EA+ L
Sbjct: 604 KGINLTPHAYNPVIQGLFRKRKTTEAINL 632


HSP 3 Score: 222.2 bits (565), Expect = 1.4e-56
Identity = 135/490 (27.55%), Postives = 248/490 (50.61%), Query Frame = 1

Query: 56  PNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKK 115
           PN     A   ++L R  R+G  ++    LE + S   +        LI+ +       +
Sbjct: 77  PNFSPEPALYEEILLRLGRSGSFDDMKKILEDMKSSRCEMGTSTFLILIESYAQFELQDE 136

Query: 116 AMRVME-ILETYG-DPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMI 175
            + V++ +++ +G  PD + YN M++     N +        +M   G  PDV T+N++I
Sbjct: 137 ILSVVDWMIDEFGLKPDTHFYNRMLNLLVDGNSLKLVEISHAKMSVWGIKPDVSTFNVLI 196

Query: 176 GSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLR 235
            +LC   +L  A  +++++   G  P   T+T +++  I EG ++ AL + +++V  G  
Sbjct: 197 KALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCS 256

Query: 236 PDLYTYNAIIRGICKEGMEDRALDFVRHLSAR-GCNPDVVSYNILLRSFLNKSRWEDGER 295
               + N I+ G CKEG  + AL+F++ +S + G  PD  ++N L+         +    
Sbjct: 257 WSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHAIE 316

Query: 296 LMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFC 355
           +M  M+  G +P+V T++ +IS  C+ G V+EAV VL+ M  +  +P++ +Y+ LIS  C
Sbjct: 317 IMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLC 376

Query: 356 KEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVR 415
           KE +++ A E    + S G LPD+  +N+++  LC      +A+++FE++   GC P   
Sbjct: 377 KENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEF 436

Query: 416 AYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM 475
            YN +  +L S G   +AL M+ +M   G     ITYN+LI   C+     EA  +  +M
Sbjct: 437 TYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDEM 496

Query: 476 EATRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWR 535
           E        +++N ++ G+CK+ RV +  +L+  M+ +G  P++ +Y  L+      G  
Sbjct: 497 EVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDI 556

Query: 536 AEAMELANSL 543
            +A ++  ++
Sbjct: 557 KKAADIVQAM 566

BLAST of Csa1G666460 vs. TrEMBL
Match: A0A0A0M3C6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G666460 PE=4 SV=1)

HSP 1 Score: 1161.7 bits (3004), Expect = 0.0e+00
Identity = 581/581 (100.00%), Postives = 581/581 (100.00%), Query Frame = 1

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA
Sbjct: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of Csa1G666460 vs. TrEMBL
Match: A0A061F8R7_THECC (Pentatricopeptide repeat (PPR-like) superfamily protein OS=Theobroma cacao GN=TCM_026185 PE=4 SV=1)

HSP 1 Score: 800.8 bits (2067), Expect = 1.1e-228
Identity = 392/582 (67.35%), Postives = 480/582 (82.47%), Query Frame = 1

Query: 1   MFSSEFLPQSLHFTNPLAKPTI-PQSRSDSIPAC-------RFSNKTHLRNVTSSAEFRQ 60
           +FS+E +  SL FT    KPT    S   S+ +C         S   + + V  SAE R 
Sbjct: 3   LFSTELVTHSLPFTTQQLKPTSNSHSHHTSLVSCLNHESQDSSSKSRNNQKVRVSAETRP 62

Query: 61  PHFPNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRN 120
            H  + D ++ HLMKLLNRSC+AGK+NE+ YFLE +V KG+KPDVVLCTK+IKGFFN RN
Sbjct: 63  THLLSFDFKETHLMKLLNRSCKAGKYNEAFYFLECMVGKGYKPDVVLCTKMIKGFFNGRN 122

Query: 121 LKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIM 180
           ++KA RV+EILE YG+PDV++YNA+ISGF K N++D AN+V DRMRSRGFSPDVVTYNIM
Sbjct: 123 VEKATRVIEILEKYGEPDVFAYNAIISGFCKMNRLDFANKVLDRMRSRGFSPDVVTYNIM 182

Query: 181 IGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGL 240
           IGS CSRGKL+ A++V+++LLKD CKPSVITYTILIEAT+L+G INEA++L DE++S+GL
Sbjct: 183 IGSFCSRGKLDSAYKVINQLLKDNCKPSVITYTILIEATMLQGEINEAMKLLDEMLSKGL 242

Query: 241 RPDLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGER 300
           RPD++TYNAIIRG+CK+GM +RA  FVR L ARGC PDV+SYNILLR  LN+ +W +GE+
Sbjct: 243 RPDMFTYNAIIRGMCKDGMVNRAFKFVRSLKARGCQPDVISYNILLRVLLNQGKWAEGEK 302

Query: 301 LMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFC 360
           L+ +MV  GCEPNVVT+SILISS CREG++ EAVNVL++MKE+GLTPD+YSYDPLISAFC
Sbjct: 303 LVTEMVSRGCEPNVVTYSILISSLCREGKLEEAVNVLKMMKERGLTPDAYSYDPLISAFC 362

Query: 361 KEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVR 420
           KEGRLDLAIE+L+ M+SDGCLPDIVNYNT+LATLCK G A+ AL++FEKL EVGCPP V 
Sbjct: 363 KEGRLDLAIEFLDCMISDGCLPDIVNYNTVLATLCKNGKAEQALEIFEKLREVGCPPNVS 422

Query: 421 AYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM 480
           +YNTMFSALWS G+K+KALEMISEM+ K I PDEITYNSLISCLCRDG+VDEAI LLVDM
Sbjct: 423 SYNTMFSALWSSGDKVKALEMISEMLSKRIGPDEITYNSLISCLCRDGMVDEAIELLVDM 482

Query: 481 EATRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWR 540
             +   PTVIS+NIVLLG+CK HR+ + IE+L  MV+K C PNET+Y+LLIEGI +AGWR
Sbjct: 483 GCSGIPPTVISYNIVLLGLCKVHRINDAIEVLAAMVDKRCQPNETTYILLIEGIGFAGWR 542

Query: 541 AEAMELANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSES 575
           +EAMELAN+L+R+  IS DS KRLN+TFP+LDVYK  + S+S
Sbjct: 543 SEAMELANALFRMEAISKDSFKRLNRTFPLLDVYKEFAGSDS 584

BLAST of Csa1G666460 vs. TrEMBL
Match: A0A0D2TML8_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G275900 PE=4 SV=1)

HSP 1 Score: 797.3 bits (2058), Expect = 1.2e-227
Identity = 388/583 (66.55%), Postives = 481/583 (82.50%), Query Frame = 1

Query: 1   MFSSEFLPQSLHFTNPLAKPTI-PQSRSDSIPACRFSNKT--------HLRNVTSSAEFR 60
           +FS+E +P  L F     KP     S   S+ +C     T        + + V  SAE R
Sbjct: 4   LFSTELIPHGLPFHPQQLKPVSNSSSHHTSLVSCLSHEGTKDSIRKSRNNQKVRVSAETR 63

Query: 61  QPHFPNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSR 120
             H  + D +++HLMKLLNRSC++GK++E+ YFLE +V KG+KPDVVLCTK+IKGFFN R
Sbjct: 64  PTHLSSFDFKESHLMKLLNRSCKSGKYHEAFYFLECMVGKGYKPDVVLCTKMIKGFFNGR 123

Query: 121 NLKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNI 180
           N++KA+RVME+LETYG+PDV++YNA+ISGF K N++D AN+V DRMRSRGFSPDVVTYNI
Sbjct: 124 NVEKAIRVMEMLETYGEPDVFAYNALISGFCKMNRLDFANKVLDRMRSRGFSPDVVTYNI 183

Query: 181 MIGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRG 240
           MIGSLCSRGKL+ A++V+++LLKD CKPSVITYTILIEATIL+G INEA++L DE+++ G
Sbjct: 184 MIGSLCSRGKLDSAYKVLNQLLKDNCKPSVITYTILIEATILQGGINEAMKLLDEMLANG 243

Query: 241 LRPDLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGE 300
           LRPD++TYNAIIRG+CK+GM  RA +FVR L+ARGC PDV+SYNILLR+ LN+ +W +GE
Sbjct: 244 LRPDMFTYNAIIRGMCKDGMVGRAFEFVRGLNARGCQPDVISYNILLRALLNQGKWIEGE 303

Query: 301 RLMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAF 360
           +L+ +MV  GCEPNVVT+SILIS  CREG+V EAVNVL++MKE+GLTPD+YSYDPLISAF
Sbjct: 304 KLVTEMVSRGCEPNVVTYSILISCLCREGKVEEAVNVLKMMKERGLTPDAYSYDPLISAF 363

Query: 361 CKEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTV 420
           CKE RLDLAI++L+ M+SDGCLPDIVNYNTIL+TLCK G AD AL++FEKL EVGCPP V
Sbjct: 364 CKERRLDLAIQFLDYMISDGCLPDIVNYNTILSTLCKNGKADQALEIFEKLSEVGCPPNV 423

Query: 421 RAYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVD 480
            +YNTMF+ALWS G+K KALEMI EM+ K I PDEITYNSLISCLCRDG+VDEAI LL+D
Sbjct: 424 SSYNTMFTALWSTGDKFKALEMILEMLNKRIGPDEITYNSLISCLCRDGMVDEAIELLID 483

Query: 481 MEATRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGW 540
           M  ++ QPTVIS+NIVLLG+CK HR+ + IE+L  MVEKGC PNET+YVLL EGI +AGW
Sbjct: 484 MGRSKIQPTVISYNIVLLGLCKVHRIDDAIEVLAAMVEKGCQPNETTYVLLTEGIGFAGW 543

Query: 541 RAEAMELANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSES 575
           R++AMELAN+L+R+  IS D+ KRL KTFP+LDVYK  +LS+S
Sbjct: 544 RSQAMELANALFRMEAISEDTFKRLTKTFPLLDVYKEFALSDS 586

BLAST of Csa1G666460 vs. TrEMBL
Match: W9RHZ3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_024505 PE=4 SV=1)

HSP 1 Score: 793.9 bits (2049), Expect = 1.3e-226
Identity = 389/586 (66.38%), Postives = 487/586 (83.11%), Query Frame = 1

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRS---------DSIPACRFSNKTHLRNVTSSAEFR 60
           + S+EFLPQ+L F+    + T  QS +          S    R  NK  LR V  S E +
Sbjct: 3   IISTEFLPQTLPFSPQPKQHTSRQSHTCLSCRNPSQSSTDIYRKKNKKPLR-VRVSVETK 62

Query: 61  QPHFP-NLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNS 120
            P+   N D  ++HL+K++NRSC++GK+NE+LYFLE +VSKGFKPDV+LCTK+++GFFNS
Sbjct: 63  SPNSQSNSDFSESHLLKVINRSCKSGKYNEALYFLELMVSKGFKPDVILCTKVMRGFFNS 122

Query: 121 RNLKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYN 180
           RN+ KA+RVMEILE +G+PD++SYNAMISGF KAN+++ AN+V DRMR +GFSPD +TYN
Sbjct: 123 RNIPKAIRVMEILEKHGEPDLFSYNAMISGFCKANRVELANKVLDRMRVQGFSPDTITYN 182

Query: 181 IMIGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSR 240
           IMIGSLCSRGK+++AF+V+DELL+D CKPSVITYTILIEATI EG +++A+E+ +E++SR
Sbjct: 183 IMIGSLCSRGKVDMAFKVLDELLRDNCKPSVITYTILIEATISEGGVDKAMEVLEEMLSR 242

Query: 241 GLRPDLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDG 300
           GL PD++TYNAI+RG+C+EGM DRA +FVR L A+GC+P+V+SYNILLR+ LN+ +W DG
Sbjct: 243 GLLPDMFTYNAIVRGMCREGMLDRAFEFVRSLEAKGCSPNVISYNILLRALLNRGKWSDG 302

Query: 301 ERLMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISA 360
           E+++ DMV  GCEPNVVT+SILIS+ CR+G+V +AVNVL+ MKEKG+TPD+YSYDPLISA
Sbjct: 303 EKILSDMVSRGCEPNVVTYSILISTLCRDGKVEDAVNVLKAMKEKGITPDAYSYDPLISA 362

Query: 361 FCKEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPT 420
           FCKEGRLDLAIE+++ M+SDG LPDIVNYNTILA LCK G AD AL++FEKL EVGCPPT
Sbjct: 363 FCKEGRLDLAIEFMDYMISDGSLPDIVNYNTILAALCKNGNADHALEIFEKLGEVGCPPT 422

Query: 421 VRAYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLV 480
           V +YNTMFSALW+CG +IKALEMISEM+ K I+PDEITYNSLISCLCR+G+V+EAIGLL+
Sbjct: 423 VSSYNTMFSALWNCGERIKALEMISEMVSKRINPDEITYNSLISCLCREGMVNEAIGLLI 482

Query: 481 DMEATRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAG 540
           DMEA  F+ +VIS+NIVLLG+CKA R+ + IELL  MVEKGC PNET+Y LLIEGI +AG
Sbjct: 483 DMEAGGFKLSVISYNIVLLGLCKARRIDDAIELLAAMVEKGCRPNETTYTLLIEGIGFAG 542

Query: 541 WRAEAMELANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKN 577
           WR EAM LAN L+ +  IS  S KRLNKTFPMLDVYK L+LSE KN
Sbjct: 543 WRVEAMGLANLLFDIEAISEHSFKRLNKTFPMLDVYKELTLSEIKN 587

BLAST of Csa1G666460 vs. TrEMBL
Match: A0A067JIP8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23550 PE=4 SV=1)

HSP 1 Score: 781.9 bits (2018), Expect = 5.2e-223
Identity = 377/581 (64.89%), Postives = 475/581 (81.76%), Query Frame = 1

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKT---HLRN---VTSSAEFRQPH 60
           +FS EF+P+S+ FT    +     S   ++  C   N     +LRN   ++ SAE R  H
Sbjct: 3   LFSIEFIPRSITFTTQQRQKPTSNSFHSTVVGCLHPNLNDTDNLRNPPNLSVSAETRPTH 62

Query: 61  FPNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLK 120
             +LD ++ HL+KLLNRSC+AGK+NESLYFLE +V KG+KPDV++CTKLIKGFFNSR + 
Sbjct: 63  VLSLDFKEIHLIKLLNRSCKAGKYNESLYFLECMVDKGYKPDVIMCTKLIKGFFNSRKID 122

Query: 121 KAMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIG 180
           KA+RVMEILE YG PDV++YNA+ISGF KANQI++AN+V DRM+SRGF PDVVTYNIMIG
Sbjct: 123 KAIRVMEILEIYGKPDVFAYNALISGFCKANQIENANRVLDRMKSRGFLPDVVTYNIMIG 182

Query: 181 SLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRP 240
           S CSRGKL+LA +V +ELLKD CKP+VITYTILIEATILEG I+EA++L DE++SRGL P
Sbjct: 183 SFCSRGKLDLALKVFEELLKDNCKPTVITYTILIEATILEGGIDEAMKLLDEMLSRGLEP 242

Query: 241 DLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLM 300
           D +TYNAIIRG+CKE M DRA D VR L++R C PDV++YNIL+R+ LN+ +W +GE+L+
Sbjct: 243 DTFTYNAIIRGMCKEDMVDRAFDLVRILNSRACRPDVITYNILVRALLNQGKWNEGEKLL 302

Query: 301 KDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKE 360
            +M+  GC+PN VTHSILI + C +G+V EAV++L+ MKEKGL P++Y YDPLI+AFC+E
Sbjct: 303 NEMIARGCKPNAVTHSILIGALCHDGKVEEAVSLLKSMKEKGLKPNAYCYDPLIAAFCRE 362

Query: 361 GRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAY 420
           G L LAIE+L+ M+SDGCLPDIVNYNTI+A LCK G AD AL+VF+KL+EVGCPPTV +Y
Sbjct: 363 GNLHLAIEFLDYMISDGCLPDIVNYNTIMAGLCKKGNADHALEVFQKLNEVGCPPTVSSY 422

Query: 421 NTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEA 480
           NTMFSALWSCG+K +AL MI EM+ +GIDPDEITYNSLISCLCRDG+V+EAIGLLVDME 
Sbjct: 423 NTMFSALWSCGDKYRALGMILEMLNQGIDPDEITYNSLISCLCRDGMVNEAIGLLVDMEK 482

Query: 481 TRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAE 540
           +RFQP ++S+NI+LLG+CKA+RV + IE+L  ++EKGC PNET+Y LLIEG+ + G RAE
Sbjct: 483 SRFQPNIVSYNIILLGLCKANRVNDAIEVLAAIIEKGCQPNETTYTLLIEGVGFTGLRAE 542

Query: 541 AMELANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESK 576
           AM LAN+L+ +  IS DS KRLN+TFP+L+VYK  S S SK
Sbjct: 543 AMRLANALHYMNAISEDSFKRLNRTFPLLNVYKDFSFSNSK 583

BLAST of Csa1G666460 vs. TAIR10
Match: AT3G04760.1 (AT3G04760.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 723.0 bits (1865), Expect = 1.5e-208
Identity = 348/555 (62.70%), Postives = 442/555 (79.64%), Query Frame = 1

Query: 11  LHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDNRDAHLMKLLN 70
           L F+N  + P     RS S    R    T   + T   E RQ H  +L  RD  ++K+ +
Sbjct: 40  LTFSN--SNPNNDNGRSFSSSGARNLQTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFH 99

Query: 71  RSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDPD 130
           RSCR+G + ESL+ LE++V KG+ PDV+LCTKLIKGFF  RN+ KA+RVMEILE +G PD
Sbjct: 100 RSCRSGNYIESLHLLETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPD 159

Query: 131 VYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMD 190
           V++YNA+I+GF K N+ID A +V DRMRS+ FSPD VTYNIMIGSLCSRGKL+LA +V++
Sbjct: 160 VFAYNALINGFCKMNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLN 219

Query: 191 ELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEG 250
           +LL D C+P+VITYTILIEAT+LEG ++EAL+L DE++SRGL+PD++TYN IIRG+CKEG
Sbjct: 220 QLLSDNCQPTVITYTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEG 279

Query: 251 MEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHS 310
           M DRA + VR+L  +GC PDV+SYNILLR+ LN+ +WE+GE+LM  M    C+PNVVT+S
Sbjct: 280 MVDRAFEMVRNLELKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYS 339

Query: 311 ILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSD 370
           ILI++ CR+G++ EA+N+L++MKEKGLTPD+YSYDPLI+AFC+EGRLD+AIE+LE M+SD
Sbjct: 340 ILITTLCRDGKIEEAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISD 399

Query: 371 GCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKA 430
           GCLPDIVNYNT+LATLCK G AD AL++F KL EVGC P   +YNTMFSALWS G+KI+A
Sbjct: 400 GCLPDIVNYNTVLATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRA 459

Query: 431 LEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIVLLG 490
           L MI EM+  GIDPDEITYNS+ISCLCR+G+VDEA  LLVDM +  F P+V+++NIVLLG
Sbjct: 460 LHMILEMMSNGIDPDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLG 519

Query: 491 MCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGVISG 550
            CKAHR+ + I +L +MV  GC PNET+Y +LIEGI +AG+RAEAMELAN L R+  IS 
Sbjct: 520 FCKAHRIEDAINVLESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISE 579

Query: 551 DSSKRLNKTFPMLDV 566
            S KRL++TFP+L+V
Sbjct: 580 YSFKRLHRTFPLLNV 592

BLAST of Csa1G666460 vs. TAIR10
Match: AT1G09900.1 (AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 452.2 bits (1162), Expect = 4.8e-127
Identity = 223/489 (45.60%), Postives = 318/489 (65.03%), Query Frame = 1

Query: 69  LNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGD 128
           L +  R G+  E   FLE++V  G  PD++ CT LI+GF      +KA +++EILE  G 
Sbjct: 109 LRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGA 168

Query: 129 -PDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFE 188
            PDV +YN MISG+ KA +I++A  V DRM     SPDVVTYN ++ SLC  GKL+ A E
Sbjct: 169 VPDVITYNVMISGYCKAGEINNALSVLDRM---SVSPDVVTYNTILRSLCDSGKLKQAME 228

Query: 189 VMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGIC 248
           V+D +L+  C P VITYTILIEAT  +  +  A++L DE+  RG  PD+ TYN ++ GIC
Sbjct: 229 VLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGIC 288

Query: 249 KEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVV 308
           KEG  D A+ F+  + + GC P+V+++NI+LRS  +  RW D E+L+ DM+  G  P+VV
Sbjct: 289 KEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVV 348

Query: 309 THSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKM 368
           T +ILI+  CR+G +  A+++LE M + G  P+S SY+PL+  FCKE ++D AIEYLE+M
Sbjct: 349 TFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERM 408

Query: 369 VSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNK 428
           VS GC PDIV YNT+L  LCK G  + A+++  +L   GC P +  YNT+   L   G  
Sbjct: 409 VSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKT 468

Query: 429 IKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIV 488
            KA++++ EM  K + PD ITY+SL+  L R+G VDEAI    + E    +P  ++FN +
Sbjct: 469 GKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSI 528

Query: 489 LLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGV 548
           +LG+CK+ +    I+ L+ M+ +GC PNETSY +LIEG+AY G   EA+EL N L   G+
Sbjct: 529 MLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGL 588

Query: 549 ISGDSSKRL 557
           +   S++++
Sbjct: 589 MKKSSAEQV 594

BLAST of Csa1G666460 vs. TAIR10
Match: AT1G79080.1 (AT1G79080.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 292.4 bits (747), Expect = 6.4e-79
Identity = 170/487 (34.91%), Postives = 273/487 (56.06%), Query Frame = 1

Query: 79  NESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGD-PDVYSYNAM 138
           ++S   LES+V+ G KP+V   T+L+     +  LKKA+RV+E++ + G  PD  +Y  +
Sbjct: 88  SDSFSHLESLVTGGHKPNVAHSTQLLYDLCKANRLKKAIRVIELMVSSGIIPDASAYTYL 147

Query: 139 ISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGC 198
           ++   K   +  A Q+ ++M   G+  + VTYN ++  LC  G L  + + ++ L++ G 
Sbjct: 148 VNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTYNALVRGLCMLGSLNQSLQFVERLMQKGL 207

Query: 199 KPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALD 258
            P+  TY+ L+EA   E   +EA++L DE++ +G  P+L +YN ++ G CKEG  D A+ 
Sbjct: 208 APNAFTYSFLLEAAYKERGTDEAVKLLDEIIVKGGEPNLVSYNVLLTGFCKEGRTDDAMA 267

Query: 259 FVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFC 318
             R L A+G   +VVSYNILLR      RWE+   L+ +M      P+VVT++ILI+S  
Sbjct: 268 LFRELPAKGFKANVVSYNILLRCLCCDGRWEEANSLLAEMDGGDRAPSVVTYNILINSLA 327

Query: 319 REGRVREAVNVLEVMKEKG--LTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPD 378
             GR  +A+ VL+ M +        + SY+P+I+  CKEG++DL ++ L++M+   C P+
Sbjct: 328 FHGRTEQALQVLKEMSKGNHQFRVTATSYNPVIARLCKEGKVDLVVKCLDEMIYRRCKPN 387

Query: 379 IVNYNTILATLCKFGC-ADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMI 438
              YN I  +LC+       A  + + L       T   Y ++ ++L   GN   A +++
Sbjct: 388 EGTYNAI-GSLCEHNSKVQEAFYIIQSLSNKQKCCTHDFYKSVITSLCRKGNTFAAFQLL 447

Query: 439 SEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM-EATRFQPTVISFNIVLLGMCK 498
            EM R G DPD  TY++LI  LC +G+   A+ +L  M E+   +PTV +FN ++LG+CK
Sbjct: 448 YEMTRCGFDPDAHTYSALIRGLCLEGMFTGAMEVLSIMEESENCKPTVDNFNAMILGLCK 507

Query: 499 AHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLGVISGDSS 558
             R    +E+   MVEK  +PNET+Y +L+EGIA+      A E+ + L    VI  ++ 
Sbjct: 508 IRRTDLAMEVFEMMVEKKRMPNETTYAILVEGIAHEDELELAKEVLDELRLRKVIGQNAV 567

Query: 559 KRLNKTF 561
            R+   F
Sbjct: 568 DRIVMQF 573

BLAST of Csa1G666460 vs. TAIR10
Match: AT1G12300.1 (AT1G12300.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 269.2 bits (687), Expect = 5.8e-72
Identity = 141/470 (30.00%), Postives = 252/470 (53.62%), Query Frame = 1

Query: 67  KLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM-EILET 126
           +L +   +  +++  L   + +  KG   ++   + +I  F   R L  A   M +I++ 
Sbjct: 93  RLFSAIAKTKQYDLVLALCKQMELKGIAHNLYTLSIMINCFCRCRKLCLAFSAMGKIIKL 152

Query: 127 YGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELA 186
             +P+  +++ +I+G     ++  A ++ DRM   G  PD++T N ++  LC  GK   A
Sbjct: 153 GYEPNTITFSTLINGLCLEGRVSEALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEA 212

Query: 187 FEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRG 246
             ++D++++ GC+P+ +TY  ++      G+   A+EL  ++  R ++ D   Y+ II G
Sbjct: 213 MLLIDKMVEYGCQPNAVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDG 272

Query: 247 ICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPN 306
           +CK G  D A +    +  +G   ++++YNIL+  F N  RW+DG +L++DM+     PN
Sbjct: 273 LCKHGSLDNAFNLFNEMEMKGITTNIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPN 332

Query: 307 VVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLE 366
           VVT S+LI SF +EG++REA  + + M  +G+ PD+ +Y  LI  FCKE  LD A + ++
Sbjct: 333 VVTFSVLIDSFVKEGKLREAEELHKEMIHRGIAPDTITYTSLIDGFCKENHLDKANQMVD 392

Query: 367 KMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCG 426
            MVS GC P+I  +N ++   CK    D  L++F K+   G       YNT+       G
Sbjct: 393 LMVSKGCDPNIRTFNILINGYCKANRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELG 452

Query: 427 NKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFN 486
               A E+  EM+ + + P+ +TY  L+  LC +G  ++A+ +   +E ++ +  +  +N
Sbjct: 453 KLNVAKELFQEMVSRKVPPNIVTYKILLDGLCDNGESEKALEIFEKIEKSKMELDIGIYN 512

Query: 487 IVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEA 536
           I++ GMC A +V +  +L  ++  KG  P   +Y ++I G+   G  +EA
Sbjct: 513 IIIHGMCNASKVDDAWDLFCSLPLKGVKPGVKTYNIMIGGLCKKGPLSEA 562


HSP 2 Score: 241.5 bits (615), Expect = 1.3e-63
Identity = 137/504 (27.18%), Postives = 256/504 (50.79%), Query Frame = 1

Query: 68  LLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG 127
           ++N  CR  K   +   +  ++  G++P+ +  + LI G      + +A+ +++ +   G
Sbjct: 129 MINCFCRCRKLCLAFSAMGKIIKLGYEPNTITFSTLINGLCLEGRVSEALELVDRMVEMG 188

Query: 128 D-PDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAF 187
             PD+ + N +++G   + +   A  + D+M   G  P+ VTY  ++  +C  G+  LA 
Sbjct: 189 HKPDLITINTLVNGLCLSGKEAEAMLLIDKMVEYGCQPNAVTYGPVLNVMCKSGQTALAM 248

Query: 188 EVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGI 247
           E++ ++ +   K   + Y+I+I+     G ++ A  LF+E+  +G+  ++ TYN +I G 
Sbjct: 249 ELLRKMEERNIKLDAVKYSIIIDGLCKHGSLDNAFNLFNEMEMKGITTNIITYNILIGGF 308

Query: 248 CKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNV 307
           C  G  D     +R +  R  NP+VV++++L+ SF+ + +  + E L K+M+  G  P+ 
Sbjct: 309 CNAGRWDDGAKLLRDMIKRKINPNVVTFSVLIDSFVKEGKLREAEELHKEMIHRGIAPDT 368

Query: 308 VTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEK 367
           +T++ LI  FC+E  + +A  ++++M  KG  P+  +++ LI+ +CK  R+D  +E   K
Sbjct: 369 ITYTSLIDGFCKENHLDKANQMVDLMVSKGCDPNIRTFNILINGYCKANRIDDGLELFRK 428

Query: 368 MVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGN 427
           M   G + D V YNT++   C+ G  ++A ++F+++     PP +  Y  +   L   G 
Sbjct: 429 MSLRGVVADTVTYNTLIQGFCELGKLNVAKELFQEMVSRKVPPNIVTYKILLDGLCDNGE 488

Query: 428 KIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNI 487
             KALE+  ++ +  ++ D   YN +I  +C    VD+A  L   +     +P V ++NI
Sbjct: 489 SEKALEIFEKIEKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPLKGVKPGVKTYNI 548

Query: 488 VLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELANSLYRLG 547
           ++ G+CK   + E   L   M E G  P+  +Y +LI      G   ++++L   L R G
Sbjct: 549 MIGGLCKKGPLSEAELLFRKMEEDGHAPDGWTYNILIRAHLGDGDATKSVKLIEELKRCG 608

Query: 548 ----------VISGDSSKRLNKTF 561
                     VI   S  RL K+F
Sbjct: 609 FSVDASTIKMVIDMLSDGRLKKSF 632


HSP 3 Score: 179.9 bits (455), Expect = 4.6e-45
Identity = 116/454 (25.55%), Postives = 209/454 (46.04%), Query Frame = 1

Query: 85  LESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGDPDVYSYNAMISGFSKA 144
           L S VSK  +P ++    L     N  N + +         + D ++     + SG    
Sbjct: 9   LSSQVSKFVQPRLLETGTLRIALINCPN-ELSFCCERGFSAFSDRNLSYRERLRSGLVDI 68

Query: 145 NQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELLKDGCKPSVITY 204
              D+ +   D + SR   P V+ ++ +  ++    + +L   +  ++   G   ++ T 
Sbjct: 69  KADDAIDLFRDMIHSRPL-PTVIDFSRLFSAIAKTKQYDLVLALCKQMELKGIAHNLYTL 128

Query: 205 TILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALDFVRHLSA 264
           +I+I       ++  A     +++  G  P+  T++ +I G+C EG    AL+ V  +  
Sbjct: 129 SIMINCFCRCRKLCLAFSAMGKIIKLGYEPNTITFSTLINGLCLEGRVSEALELVDRMVE 188

Query: 265 RGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFCREGRVRE 324
            G  PD+++ N L+       +  +   L+  MV  GC+PN VT+  +++  C+ G+   
Sbjct: 189 MGHKPDLITINTLVNGLCLSGKEAEAMLLIDKMVEYGCQPNAVTYGPVLNVMCKSGQTAL 248

Query: 325 AVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPDIVNYNTILA 384
           A+ +L  M+E+ +  D+  Y  +I   CK G LD A     +M   G   +I+ YN ++ 
Sbjct: 249 AMELLRKMEERNIKLDAVKYSIIIDGLCKHGSLDNAFNLFNEMEMKGITTNIITYNILIG 308

Query: 385 TLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMISEMIRKGIDP 444
             C  G  D    +   + +    P V  ++ +  +    G   +A E+  EMI +GI P
Sbjct: 309 GFCNAGRWDDGAKLLRDMIKRKINPNVVTFSVLIDSFVKEGKLREAEELHKEMIHRGIAP 368

Query: 445 DEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIVLLGMCKAHRVFEGIELL 504
           D ITY SLI   C++  +D+A  ++  M +    P + +FNI++ G CKA+R+ +G+EL 
Sbjct: 369 DTITYTSLIDGFCKENHLDKANQMVDLMVSKGCDPNIRTFNILINGYCKANRIDDGLELF 428

Query: 505 ITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMEL 539
             M  +G + +  +Y  LI+G    G    A EL
Sbjct: 429 RKMSLRGVVADTVTYNTLIQGFCELGKLNVAKEL 460

BLAST of Csa1G666460 vs. TAIR10
Match: AT3G53700.1 (AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 269.2 bits (687), Expect = 5.8e-72
Identity = 142/469 (30.28%), Postives = 246/469 (52.45%), Query Frame = 1

Query: 76  GKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYGD--PDVYS 135
           G  + +L   E +V  G     V    ++ GF     ++ A+  ++ +       PD Y+
Sbjct: 238 GDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYT 297

Query: 136 YNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELL 195
           +N +++G  KA  +  A ++ D M   G+ PDV TYN +I  LC  G+++ A EV+D+++
Sbjct: 298 FNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMI 357

Query: 196 KDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMED 255
              C P+ +TY  LI     E ++ EA EL   L S+G+ PD+ T+N++I+G+C      
Sbjct: 358 TRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHR 417

Query: 256 RALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILI 315
            A++    + ++GC PD  +YN+L+ S  +K + ++   ++K M LSGC  +V+T++ LI
Sbjct: 418 VAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLI 477

Query: 316 SSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCL 375
             FC+  + REA  + + M+  G++ +S +Y+ LI   CK  R++ A + +++M+ +G  
Sbjct: 478 DGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQK 537

Query: 376 PDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEM 435
           PD   YN++L   C+ G    A D+ + +   GC P +  Y T+ S L   G    A ++
Sbjct: 538 PDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKL 597

Query: 436 ISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM-EATRFQPTVISFNIVLLGMC 495
           +  +  KGI+     YN +I  L R     EAI L  +M E     P  +S+ IV  G+C
Sbjct: 598 LRSIQMKGINLTPHAYNPVIQGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLC 657

Query: 496 K-AHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 541
                + E ++ L+ ++EKG +P  +S  +L EG+         ++L N
Sbjct: 658 NGGGPIREAVDFLVELLEKGFVPEFSSLYMLAEGLLTLSMEETLVKLVN 706


HSP 2 Score: 244.6 bits (623), Expect = 1.5e-64
Identity = 131/449 (29.18%), Postives = 236/449 (52.56%), Query Frame = 1

Query: 92  GFKPDVVLCTKLIKGFFNSRNLKKAMRVMEILETYG-DPDVYSYNAMISGFSKANQIDSA 151
           G KPDV     LIK    +  L+ A+ ++E + +YG  PD  ++  ++ G+ +   +D A
Sbjct: 184 GIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGA 243

Query: 152 NQVFDRMRSRGFSPDVVTYNIMIGSLCSRGKLELAFEVMDELL-KDGCKPSVITYTILIE 211
            ++ ++M   G S   V+ N+++   C  G++E A   + E+  +DG  P   T+  L+ 
Sbjct: 244 LRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVN 303

Query: 212 ATILEGRINEALELFDELVSRGLRPDLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNP 271
                G +  A+E+ D ++  G  PD+YTYN++I G+CK G    A++ +  +  R C+P
Sbjct: 304 GLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSP 363

Query: 272 DVVSYNILLRSFLNKSRWEDGERLMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVL 331
           + V+YN L+ +   +++ E+   L + +   G  P+V T + LI   C     R A+ + 
Sbjct: 364 NTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELF 423

Query: 332 EVMKEKGLTPDSYSYDPLISAFCKEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKF 391
           E M+ KG  PD ++Y+ LI + C +G+LD A+  L++M   GC   ++ YNT++   CK 
Sbjct: 424 EEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKA 483

Query: 392 GCADLALDVFEKLDEVGCPPTVRAYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITY 451
                A ++F++++  G       YNT+   L        A +++ +MI +G  PD+ TY
Sbjct: 484 NKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTY 543

Query: 452 NSLISCLCRDGLVDEAIGLLVDMEATRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVE 511
           NSL++  CR G + +A  ++  M +   +P ++++  ++ G+CKA RV    +LL ++  
Sbjct: 544 NSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQM 603

Query: 512 KGCLPNETSYVLLIEGIAYAGWRAEAMEL 539
           KG      +Y  +I+G+       EA+ L
Sbjct: 604 KGINLTPHAYNPVIQGLFRKRKTTEAINL 632


HSP 3 Score: 222.2 bits (565), Expect = 8.1e-58
Identity = 135/490 (27.55%), Postives = 248/490 (50.61%), Query Frame = 1

Query: 56  PNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKK 115
           PN     A   ++L R  R+G  ++    LE + S   +        LI+ +       +
Sbjct: 77  PNFSPEPALYEEILLRLGRSGSFDDMKKILEDMKSSRCEMGTSTFLILIESYAQFELQDE 136

Query: 116 AMRVME-ILETYG-DPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMI 175
            + V++ +++ +G  PD + YN M++     N +        +M   G  PDV T+N++I
Sbjct: 137 ILSVVDWMIDEFGLKPDTHFYNRMLNLLVDGNSLKLVEISHAKMSVWGIKPDVSTFNVLI 196

Query: 176 GSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLR 235
            +LC   +L  A  +++++   G  P   T+T +++  I EG ++ AL + +++V  G  
Sbjct: 197 KALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCS 256

Query: 236 PDLYTYNAIIRGICKEGMEDRALDFVRHLSAR-GCNPDVVSYNILLRSFLNKSRWEDGER 295
               + N I+ G CKEG  + AL+F++ +S + G  PD  ++N L+         +    
Sbjct: 257 WSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHAIE 316

Query: 296 LMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFC 355
           +M  M+  G +P+V T++ +IS  C+ G V+EAV VL+ M  +  +P++ +Y+ LIS  C
Sbjct: 317 IMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLC 376

Query: 356 KEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVR 415
           KE +++ A E    + S G LPD+  +N+++  LC      +A+++FE++   GC P   
Sbjct: 377 KENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEF 436

Query: 416 AYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM 475
            YN +  +L S G   +AL M+ +M   G     ITYN+LI   C+     EA  +  +M
Sbjct: 437 TYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDEM 496

Query: 476 EATRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWR 535
           E        +++N ++ G+CK+ RV +  +L+  M+ +G  P++ +Y  L+      G  
Sbjct: 497 EVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDI 556

Query: 536 AEAMELANSL 543
            +A ++  ++
Sbjct: 557 KKAADIVQAM 566

BLAST of Csa1G666460 vs. NCBI nr
Match: gi|449449675|ref|XP_004142590.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis sativus])

HSP 1 Score: 1161.7 bits (3004), Expect = 0.0e+00
Identity = 581/581 (100.00%), Postives = 581/581 (100.00%), Query Frame = 1

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN
Sbjct: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA
Sbjct: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of Csa1G666460 vs. NCBI nr
Match: gi|659086091|ref|XP_008443759.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Cucumis melo])

HSP 1 Score: 1119.8 bits (2895), Expect = 0.0e+00
Identity = 559/581 (96.21%), Postives = 570/581 (98.11%), Query Frame = 1

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRNVTSSAEFRQPHFPNLDN 60
           MFSSEFLPQSLHFTNPL+KPTIPQS SDSIP  RFSNKT+LRNVTSSAE RQPHFPNLDN
Sbjct: 1   MFSSEFLPQSLHFTNPLSKPTIPQSHSDSIPTRRFSNKTYLRNVTSSAESRQPHFPNLDN 60

Query: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAMRVM 120
           RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKA+RVM
Sbjct: 61  RDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKKAVRVM 120

Query: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGSLCSRG 180
           EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPD+VTYNIMIGSLCSRG
Sbjct: 121 EILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDIVTYNIMIGSLCSRG 180

Query: 181 KLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPDLYTYN 240
           KL LAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDEL+SRGLRPDLYTYN
Sbjct: 181 KLALAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELLSRGLRPDLYTYN 240

Query: 241 AIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMKDMVLS 300
           AIIRGICKEGMEDRA+DFVR LSARGCNPDVVSYNILLRSFLNKSRWEDGE+LMKDMVLS
Sbjct: 241 AIIRGICKEGMEDRAVDFVRDLSARGCNPDVVSYNILLRSFLNKSRWEDGEKLMKDMVLS 300

Query: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEGRLDLA 360
           GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPD+YSYDPLISAFCKEGRLDLA
Sbjct: 301 GCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDAYSYDPLISAFCKEGRLDLA 360

Query: 361 IEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYNTMFSA 420
           IEYL+KMVSDGCLPDIVNYNTILATLCKFGCADLALD+FEKLDEVGCPP V +YNTMFSA
Sbjct: 361 IEYLDKMVSDGCLPDIVNYNTILATLCKFGCADLALDIFEKLDEVGCPPNVSSYNTMFSA 420

Query: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATRFQPT 480
           LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT FQPT
Sbjct: 421 LWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEATSFQPT 480

Query: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEAMELAN 540
           VISFNIVLLGMCKAHRVFEGIELLITMVEKGC PNETSYVLLIEGIAYAGWRAEAMELAN
Sbjct: 481 VISFNIVLLGMCKAHRVFEGIELLITMVEKGCPPNETSYVLLIEGIAYAGWRAEAMELAN 540

Query: 541 SLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 582
           SLYRLGVIS DSSKRLNKTFPMLDVYKGLSLSESKNQLLQS
Sbjct: 541 SLYRLGVISEDSSKRLNKTFPMLDVYKGLSLSESKNQLLQS 581

BLAST of Csa1G666460 vs. NCBI nr
Match: gi|645222808|ref|XP_008218329.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Prunus mume])

HSP 1 Score: 817.4 bits (2110), Expect = 1.6e-233
Identity = 392/581 (67.47%), Postives = 491/581 (84.51%), Query Frame = 1

Query: 3   SSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRN-------VTSSAEFRQPHF 62
           S++ LPQ+LH T    K T     +++  +CR    ++ RN       V  SAE +  H 
Sbjct: 12  STKLLPQNLHATTAQLKLTSHSPHTNAAVSCRVCISSNGRNSSRNLIKVMVSAETKPTHL 71

Query: 63  PNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLKK 122
           PN D ++ HL+K+L+RSC+AG+ NESLYFLE +V+KG+KPDV+LCTKLIKGFFNSRN++K
Sbjct: 72  PNYDIKETHLIKVLSRSCKAGQFNESLYFLELMVNKGYKPDVILCTKLIKGFFNSRNIEK 131

Query: 123 AMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIGS 182
           A+RVM+ILE YG+PD++SYNA+ISGF KAN+I+SAN+V DRMRS+GFSPDVVTYNIMIGS
Sbjct: 132 AIRVMQILEKYGEPDLFSYNALISGFCKANRIESANKVLDRMRSQGFSPDVVTYNIMIGS 191

Query: 183 LCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRPD 242
           LC+RGKL LA +VMD+L+KD C+P+VITYTILIEATI++G I+EA++L DE++SRGL+PD
Sbjct: 192 LCTRGKLGLALKVMDQLVKDNCRPTVITYTILIEATIVDGGIDEAMKLLDEMLSRGLKPD 251

Query: 243 LYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLMK 302
           +YTYNA+IRG+C++GM DRA  FVR L ++GC P+V+SYNILLR+ LN+ +WE+GE+L+ 
Sbjct: 252 MYTYNAVIRGMCRDGMLDRAFQFVRSLDSKGCPPNVISYNILLRALLNRGKWEEGEKLVT 311

Query: 303 DMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKEG 362
           +M   GCEPNVVT+SILIS+ CR+G+V +AVNVL++MK+KGLTPD+YSYDPL+SAFCKEG
Sbjct: 312 NMCSRGCEPNVVTYSILISTLCRDGKVEDAVNVLKIMKKKGLTPDAYSYDPLVSAFCKEG 371

Query: 363 RLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAYN 422
           RLDLAIE+L+ M+SDGCLPDIVNYNTILA LCK G AD AL +FE L EVGCPP V +YN
Sbjct: 372 RLDLAIEFLDYMISDGCLPDIVNYNTILAALCKSGKADQALQIFENLGEVGCPPNVSSYN 431

Query: 423 TMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEAT 482
           TMFSALW+CG++++AL M+SEMI KGI PDEITYNSLISCLCRDG+VDEAIGLLVDME  
Sbjct: 432 TMFSALWNCGDRVRALGMVSEMIGKGIKPDEITYNSLISCLCRDGMVDEAIGLLVDMETG 491

Query: 483 RFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAEA 542
            FQPTVIS+NI+LLG+CK  RV + I++L  MVEKGC PNET+Y+LLIEGI +AGWRAEA
Sbjct: 492 GFQPTVISYNILLLGLCKTRRVVDAIQVLTEMVEKGCRPNETTYILLIEGIGFAGWRAEA 551

Query: 543 MELANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKN 577
           MELANS++ L  IS DS KRLN+TFPMLDV+K L+LSE KN
Sbjct: 552 MELANSVFSLRAISEDSFKRLNRTFPMLDVFKELTLSEIKN 592

BLAST of Csa1G666460 vs. NCBI nr
Match: gi|1009117842|ref|XP_015875536.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 810.4 bits (2092), Expect = 2.0e-231
Identity = 397/582 (68.21%), Postives = 489/582 (84.02%), Query Frame = 1

Query: 1   MFSSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFS--NKT----HLRNVTSSAEFRQPH 60
           + S+EFLP SL F + L KP      + +  +CR    N+T    + R    SA+ R   
Sbjct: 3   IISTEFLPHSLPFASHL-KPISNLHPNTTAVSCRIHGLNETIKSRNQRKAKVSADTRSTQ 62

Query: 61  FPNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRNLK 120
           F   D ++ HLMK+LNRSC+AGK+NE+LYFLE +V+KG+KPDV+LCTK+I+GFF+SRN++
Sbjct: 63  FQTNDFKETHLMKVLNRSCKAGKYNEALYFLELLVNKGYKPDVILCTKIIRGFFSSRNIE 122

Query: 121 KAMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIMIG 180
           KA+RVMEILE YG+PD+++YNA+ISGFSKAN+I+ AN+V DRMRS+GFSPD++TYNIMIG
Sbjct: 123 KAIRVMEILEKYGEPDLFAYNALISGFSKANRIELANKVLDRMRSQGFSPDIITYNIMIG 182

Query: 181 SLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGLRP 240
           SLCSRGKLE A +VMD+LLKD CKP+VITYTILIEATI E  I+EA++L +E++SRGL P
Sbjct: 183 SLCSRGKLESALKVMDKLLKDNCKPTVITYTILIEATISERGIDEAMKLLEEMLSRGLLP 242

Query: 241 DLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGERLM 300
           D++TYNAI+RG+C+EGM DRA DFVR L A+GC PDV+SYNILLR+ LN+ +W++GE+L+
Sbjct: 243 DMFTYNAIMRGMCREGMVDRAFDFVRSLDAKGCGPDVISYNILLRALLNRGKWDEGEKLL 302

Query: 301 KDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFCKE 360
            DMV  GCEPNVVT+SILISS C EG+V +AVNVL+ MKEK LTPD+YSYDPLISAFCKE
Sbjct: 303 SDMVSRGCEPNVVTYSILISSLCNEGKVEDAVNVLKAMKEKRLTPDAYSYDPLISAFCKE 362

Query: 361 GRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVRAY 420
           GRLDLAIE+++ M+SDG LPDIVNYNTILA LCK G AD AL +F+ LD VGCPP V +Y
Sbjct: 363 GRLDLAIEFMDYMISDGSLPDIVNYNTILAALCKRGNADRALQIFQNLDVVGCPPNVSSY 422

Query: 421 NTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDMEA 480
           NTMFSALW CG++I+ALEMISEM+ KGIDPDEITYNSLISCLCRDG+V+EAIGLLVDME+
Sbjct: 423 NTMFSALWGCGDRIRALEMISEMVSKGIDPDEITYNSLISCLCRDGMVNEAIGLLVDMES 482

Query: 481 TRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWRAE 540
             F P+VIS+NI+LLG+CKA R+ + IE+L  MVEKGCLPNET+++LLIEGI +AGWRAE
Sbjct: 483 GGFPPSVISYNILLLGLCKARRIDDAIEVLAAMVEKGCLPNETTFILLIEGIGFAGWRAE 542

Query: 541 AMELANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKN 577
           AMELA+SL+ L  IS DS KRLNKTFPMLDVYK L+LSE KN
Sbjct: 543 AMELASSLFGLKAISEDSFKRLNKTFPMLDVYKELTLSEIKN 583

BLAST of Csa1G666460 vs. NCBI nr
Match: gi|470138050|ref|XP_004304772.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic [Fragaria vesca subsp. vesca])

HSP 1 Score: 801.2 bits (2068), Expect = 1.2e-228
Identity = 386/584 (66.10%), Postives = 485/584 (83.05%), Query Frame = 1

Query: 3   SSEFLPQSLHFTNPLAKPTIPQSRSDSIPACRFSNKTHLRN----------VTSSAEFRQ 62
           S+E LP S H T+ L KPT   S   +  +CR S+ + + N          V+ SAE + 
Sbjct: 5   STELLPHSFHTTSQL-KPT-SHSHHPTALSCRASSASSISNGRNSSRNPTRVSVSAEPKS 64

Query: 63  PHFPNLDNRDAHLMKLLNRSCRAGKHNESLYFLESVVSKGFKPDVVLCTKLIKGFFNSRN 122
               N D +D HLMK+LNRSC+AG++NE++YFLE +V+KG+KPDV+LCTKLIKGFFNSRN
Sbjct: 65  TQLQNYDFKDTHLMKVLNRSCKAGQYNEAIYFLELMVNKGYKPDVILCTKLIKGFFNSRN 124

Query: 123 LKKAMRVMEILETYGDPDVYSYNAMISGFSKANQIDSANQVFDRMRSRGFSPDVVTYNIM 182
           ++KA+RVM+ILE YG+PD+++YNA+ISGF KAN+I+SAN+V DRM+S+GF PDVVTYNIM
Sbjct: 125 IEKAIRVMQILEQYGEPDLFAYNALISGFCKANRIESANKVLDRMKSQGFKPDVVTYNIM 184

Query: 183 IGSLCSRGKLELAFEVMDELLKDGCKPSVITYTILIEATILEGRINEALELFDELVSRGL 242
           IGSLCSRGKL LA +VMD L++D CKP+VITYTILIEA IL+G INEA++L DE++SRGL
Sbjct: 185 IGSLCSRGKLGLALQVMDRLVRDNCKPTVITYTILIEAIILDGGINEAMKLLDEMLSRGL 244

Query: 243 RPDLYTYNAIIRGICKEGMEDRALDFVRHLSARGCNPDVVSYNILLRSFLNKSRWEDGER 302
           +PD+YTYNAI+RG+C+EGM DRA +FV+   A+GC P+V+SYNILLR+ LN+ +WE+GE 
Sbjct: 245 KPDMYTYNAIVRGMCREGMLDRAFEFVKCFDAKGCAPNVISYNILLRALLNRGKWEEGEN 304

Query: 303 LMKDMVLSGCEPNVVTHSILISSFCREGRVREAVNVLEVMKEKGLTPDSYSYDPLISAFC 362
           L+ +M   GCEPNVVT+SILIS+ CR+G+V + +NVL++MKEKGLTPD+YSYDPLIS FC
Sbjct: 305 LVANMCARGCEPNVVTYSILISTLCRDGKVEDGMNVLKIMKEKGLTPDAYSYDPLISCFC 364

Query: 363 KEGRLDLAIEYLEKMVSDGCLPDIVNYNTILATLCKFGCADLALDVFEKLDEVGCPPTVR 422
           KEGRLDLAIE L+ M+SDGCLPDIVNYNT+LA LCK G AD AL++FE L EVGCPP V 
Sbjct: 365 KEGRLDLAIELLDCMISDGCLPDIVNYNTVLAALCKNGSADQALEIFENLGEVGCPPNVS 424

Query: 423 AYNTMFSALWSCGNKIKALEMISEMIRKGIDPDEITYNSLISCLCRDGLVDEAIGLLVDM 482
           +YNTMFSALW+CG++++AL M+S+M+ KGI+PDEITYNSLISCLCRDG+V+EAIGLLVDM
Sbjct: 425 SYNTMFSALWNCGDRVRALGMVSDMVSKGIEPDEITYNSLISCLCRDGMVNEAIGLLVDM 484

Query: 483 EATRFQPTVISFNIVLLGMCKAHRVFEGIELLITMVEKGCLPNETSYVLLIEGIAYAGWR 542
           EA  FQPTVI++NIVLLG+ KA R+ + IE+   MVEKGC PNET+Y+LLIEGI +AGWR
Sbjct: 485 EAGGFQPTVITYNIVLLGLSKARRIVDAIEVFTAMVEKGCRPNETTYILLIEGIGFAGWR 544

Query: 543 AEAMELANSLYRLGVISGDSSKRLNKTFPMLDVYKGLSLSESKN 577
           AEAMELA S+Y L  I  DS KRL++TFPMLDVYK L+LSE +N
Sbjct: 545 AEAMELAKSVYSLSAICEDSFKRLSRTFPMLDVYKELTLSEIEN 586

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP213_ARATH2.6e-20762.70Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
PPR28_ARATH8.5e-12645.60Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN... [more]
PP131_ARATH1.1e-7734.91Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidop... [more]
PPR36_ARATH1.0e-7030.00Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidop... [more]
PP281_ARATH1.0e-7030.28Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0M3C6_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G666460 PE=4 SV=1[more]
A0A061F8R7_THECC1.1e-22867.35Pentatricopeptide repeat (PPR-like) superfamily protein OS=Theobroma cacao GN=TC... [more]
A0A0D2TML8_GOSRA1.2e-22766.55Uncharacterized protein OS=Gossypium raimondii GN=B456_007G275900 PE=4 SV=1[more]
W9RHZ3_9ROSA1.3e-22666.38Uncharacterized protein OS=Morus notabilis GN=L484_024505 PE=4 SV=1[more]
A0A067JIP8_JATCU5.2e-22364.89Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23550 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G04760.11.5e-20862.70 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G09900.14.8e-12745.60 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G79080.16.4e-7934.91 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G12300.15.8e-7230.00 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G53700.15.8e-7230.28 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449449675|ref|XP_004142590.1|0.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic ... [more]
gi|659086091|ref|XP_008443759.1|0.0e+0096.21PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic ... [more]
gi|645222808|ref|XP_008218329.1|1.6e-23367.47PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic ... [more]
gi|1009117842|ref|XP_015875536.1|2.0e-23168.21PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic ... [more]
gi|470138050|ref|XP_004304772.1|1.2e-22866.10PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU163667cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G666460.1Csa1G666460.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU163667CU163667transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 440..473
score: 1.7E-13coord: 335..368
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 214..248
score: 1.3E-9coord: 479..525
score: 3.5E-8coord: 269..318
score: 4.1E-15coord: 129..178
score: 7.4E-19coord: 374..421
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 85..122
score: 4.9E-6coord: 179..210
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 447..480
score: 2.2E-8coord: 342..376
score: 3.0E-9coord: 307..341
score: 7.0E-9coord: 482..516
score: 7.7E-5coord: 132..166
score: 5.3E-11coord: 202..235
score: 1.1E-6coord: 272..306
score: 4.1E-8coord: 413..446
score: 4.7E-4coord: 379..411
score: 3.1E-5coord: 167..201
score: 1.0E-9coord: 237..271
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 130..164
score: 13.943coord: 480..514
score: 9.942coord: 375..409
score: 11.181coord: 96..126
score: 6.697coord: 165..199
score: 13.362coord: 515..549
score: 6.982coord: 445..479
score: 12.54coord: 305..339
score: 13.208coord: 410..444
score: 10.523coord: 340..374
score: 12.967coord: 61..95
score: 7.278coord: 270..304
score: 11.937coord: 235..269
score: 12.057coord: 200..234
score: 11
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 176..370
score: 5.3E-10coord: 407..467
score: 5.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 65..556
score:
NoneNo IPR availablePANTHERPTHR24015:SF572SUBFAMILY NOT NAMEDcoord: 65..556
score:

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None