Csa1G418260.1 (mRNA) Cucumber (Chinese Long) v2

NameCsa1G418260.1
TypemRNA
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat-containing protein, putative; contains IPR002885 (Pentatricopeptide repeat), IPR011990 (Tetratricopeptide-like helical), IPR015943 (WD40/YVTN repeat-like-containing domain)
LocationChr1 : 15140418 .. 15144363 (-)
Sequence length2436
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATAAGAAGAGAGTGGCTATTCCTCTCGTTTGCCATGGTCATTCTCGACCGATTGTCGATCTATTTTCTAGCCTTGTTACGCCCGATGGATTCTTCCTCGTCAGTGCCAGCAAGGGTATGATTCAAGATTTAAAACTTTTGCTTTGATTTAAGACTTTAATTTATGTATCGCCAAATTTTCTGGATATTAATTATGATTTTGGATCTTTTTTTTTTTGTCCAGTGTCAGTTCTTTTGACTTTGATTGATAGTTGTCCTAGATGTGGGTATTGCTATTTTAGCTTTCTTTGATGAGAGATTTGTTGAATGAATCCCATGTCGTTTGCTCAAAGCTTAATCCGGAGTTTACTTTTATGGGTTGCCTTATAAATATGTTATTCCCATTCAGATTCCAATCCCATGCTTAGAAATGGTGAAAATGGTGATTGGATTGGTACATTTGAAGGTCACAAAGGCGCAGTTTGGAGTTGTTGTCTGGATACCAATGCTTTACGTGCTGCTACTGGTTCTGCTGATTTTTCAGCGTATGTCCTTCTCATTGGTTTTCATGATACGTTTTAAGAACTGACATATCATTGACTTTTCAGTGTCATAACATTGTATGCGGTATATAATTAAATTAGATATCTTCTCTCAGAACTGAGGAGAGGAAATCTTAAATAATGCTTATAGCATTGATTGGGAGTAGCAATTTTTCATAGCTTTTTTAGATTAACCGATCCGTGTAGATCTGGAATTTTTACTTATAATAAATCATCGTATCAAATTCTAATTAATTTGATTCCCATCTTCTGAGTTCTGCTTTTTGGTCTGTGTATTCTTCTATATATACATTTAGCATTTTCAATTTCCTATGTGTTTCCTCGTTATGATTATCAAATATTGCTTGCTTTTATTTGAGTGATGGCACTCAGGCCAACTTTCTTTTTGACACAAGTTGATGAATTATGTTTTATTTCTAGAGGGATTGAAGGAAAGAAAAAAAAAGAATTGAAAAGGGTCCTAGATTTCTAATCCAGTGGAAAGTGCGTATTCTGTTTACTTTTTAAATTTCTTGATTCAGTGGGTTATACACAAAGATGAATTGGGATGTATGGGTACTCAATGATTTTATTTTGACTTTCTTTGAAATGAATCTATGATCATGAAAGATCGTTTTATAGGTCCTTTTTTTAATAGCTAAAAATAATATCAATGAATAAAATCAGTTATTAATTTTATTTTGACTTTCTTGCTATTGCATTTGCACCATGACTTTTTGGCTGAAGTACTCATTGTGGTCTTTTTCCATTCCCCCAAGCATTTGCTTCCTCTAGACTGCACCACCTTCTTACTCCTCACCCTTTTCTTGGGCATATGGTAGAGTTTGTCGTAATTATTTTGTGATTGATTATTCGTAAGCTTTATTTTGATGACAGGATATTGTTTGTTAAATGTTACTCCATTTCCTATAATAATAATAAGTAAATAACACCTTATTTGAAAGGAAAGTTAAATGTTTCTTTCTAGTGATTTTGTGGACAATGAAGATTGAAATCATTCCTGCATATTGTGATTAAACTTACAGGAAAGTATAGGATGCATTAACCTGGAGGTGTAGTGCACTCATTCGAGCAGAAGCATTTTGATTGTGCCTTTGCTTTTTCAGAGGTAATTAAGCATGCTTTTATTAGTGGCATATCTTTATCACTTTCTTGTTACACACACATCCCTGCATAAGATAATATCCTATGAACTAACGTGTTCTTTCTGTTTGTTTTATTAGATTTTCCAATTCTTCAAACACTCACAAAGCTCGCACATGCTTCCTTTATCAATCATTCGAAGAGTCCAGTTTATATCCCGTCATTTTTCTTCAAGCCCCCATTTAGTTCCAGTTCTGCTTCGAATCTCCAAACTAACAAAGAAATCATGCATCGAGTGTCTTCGGAACTGCAAGTCCATGGATCAACTCAAACAAATTCAGAGTCAGATCTTTCGAATTGGTCTTGAAGGAGACAGGGACACAATAAACAAATTGATGGCATTCTGTGCAGACTCATCTCTTGGCAACTTGCGCTATGCAGAGAAGATATTCAATTATGTTCAAGACCCATCTCTGTTTGTTTATAATGTGATGGTTAAAATGTATGCCAAAAGGGGTATTCTCAGAAAAGTCCTTTTGCTCTTTCAACAGTTGAGGGAAGATGGATTGTGGCCTGATGGTTTTACTTACCCATTTGTTCTGAAAGCTATTGGTTGCTTACGGGACGTGAGGCAAGGTGAAAAGGTTCGTGGCTTTATAGTGAAGACAGGAATGGATTTGGATAATTATGTCTATAATTCACTTATAGATATGTATTATGAATTAAGCAATGTTGAGAATGCTAAGAAGTTATTTGATGAAATGACGACTAGAGATTCGGTTTCTTGGAATGTTATGATTTCTGGGTATGTTAGGTGTCGTAGATTTGAGGATGCTATCAATACATTTAGGGAAATGCAGCAAGAGGGCAATGAGAAACCTGATGAAGCTACTGTAGTTAGCACTCTTTCTGCTTGTACAGCACTGAAAAATCTGGAGCTTGGAGATGAAATTCACAACTATGTTAGAAAGGAGCTTGGTTTTACCACTCGAATCGACAATGCCTTATTAGATATGTATGCAAAATGTGGTTGTCTAAATATTGCCCGCAACATATTTGATGAAATGTCTATGAAAAATGTAATTTGTTGGACTAGCATGATCTCTGGCTATATAAACTGTGGTGATTTAAGAGAGGCTAGAGACTTGTTTGACAAAAGTCCAGTTAGAGATGTTGTTTTGTGGACAGCTATGATAAATGGGTATGTGCAGTTCCACCATTTTGATGATGCTGTGGCTCTGTTTCGCGAAATGCAAATTCAAAGGGTAAAACCTGATAAGTTCACAGTGGTCACTCTCCTCACAGGTTGCGCGCAGTTGGGAGCTCTAGAACAAGGGAAATGGATTCATGGATACCTAGATGAAAACAGAATAACAATGGATGTGGTTGTTGGTACTGCACTCATTGAAATGTATTCCAAATGTGGTTGTGTAGATAAATCATTAGAAATTTTCTATGAGTTAGAAGACAAGGACACGGCATCTTGGACCTCAATTATATGCGGTCTTGCAATGAATGGTAAGACGAGTGAAGCACTCCGGCTGTTCTCAGAAATGGAACGTGTGGGGGCTAAACCTGATGATATCACCTTCATTGGAGTTTTAAGTGCCTGTAGTCATGGTGGGCTGGTTGAGGAAGGGCGTAGGTTTTTCAACTCGATGAAAAAGGTTCACCGAATTGAGCCGAAGGTAGAGCATTATGGGTGTGTAATTGACCTTCTTGGTAGAGCTGGGTTATTGGATGAAGCAGAGGAACTCATACAAGAGATTCCGATTGAAAATTGCGAAATTGTGGTTCCTCTGTATGGTGCTTTGCTCAGTGCTTGTAGAATCCACAATAATGTTGACATGGGTGAAAGGCTGGCCAAAAAACTGGAGAACATTGAATCATGTGATTCTAGCATTCACACACTTCTTGCGAATATATATGCTTCTGTCGATAGGTGGGAAGATGCAAAGAAAGTGAGAAGGAAAATGAAAGAACTTGGAGTGAAGAAGATGCCTGGGTGTAGCTTGATCGAAGTTGATGGCATTGTTCACGAGTTTCTTGTCGGGGATCCATCTCATCCAGAAATGATGGAGATATGTTCCATGTTGAATAGAGTGACTGGACAGTTACTAGGATTAAAGGAATCTCAGGTTGAAAGTGTGATGCCACTCTACAAGGACACTCAACACTGCAATTTTGTAGAATCTTAAGTGATTTGCTGTGTAGGAAGAAAGGCGTTTAACAATATCAGAAGTGATGGTCAAATTAGAACACAAACTGGAAATACAATTTCTTGATTTTTTTAGGAAAGTTATTTCAAATGACAAAGC

mRNA sequence

ATGGATAAGAAGAGAGTGGCTATTCCTCTCGTTTGCCATGGTCATTCTCGACCGATTGTCGATCTATTTTCTAGCCTTGTTACGCCCGATGGATTCTTCCTCGTCAGTGCCAGCAAGGATTCCAATCCCATGCTTAGAAATGGTGAAAATGGTGATTGGATTGGTACATTTGAAGGTCACAAAGGCGCAGTTTGGAGTTGTTGTCTGGATACCAATGCTTTACGTGCTGCTACTGGTTCTGCTGATTTTTCAGCTACTCATTGTGGTCTTTTTCCATTCCCCCAAGCATTTGCTTCCTCTAGACTGCACCACCTTCTTACTCCTCACCCTTTTCTTGGGCATATGAAGCATTTTGATTGTGCCTTTGCTTTTTCAGAGATTTTCCAATTCTTCAAACACTCACAAAGCTCGCACATGCTTCCTTTATCAATCATTCGAAGAGTCCAGTTTATATCCCGTCATTTTTCTTCAAGCCCCCATTTAGTTCCAGTTCTGCTTCGAATCTCCAAACTAACAAAGAAATCATGCATCGAGTGTCTTCGGAACTGCAAGTCCATGGATCAACTCAAACAAATTCAGAGTCAGATCTTTCGAATTGGTCTTGAAGGAGACAGGGACACAATAAACAAATTGATGGCATTCTGTGCAGACTCATCTCTTGGCAACTTGCGCTATGCAGAGAAGATATTCAATTATGTTCAAGACCCATCTCTGTTTGTTTATAATGTGATGGTTAAAATGTATGCCAAAAGGGGTATTCTCAGAAAAGTCCTTTTGCTCTTTCAACAGTTGAGGGAAGATGGATTGTGGCCTGATGGTTTTACTTACCCATTTGTTCTGAAAGCTATTGGTTGCTTACGGGACGTGAGGCAAGGTGAAAAGGTTCGTGGCTTTATAGTGAAGACAGGAATGGATTTGGATAATTATGTCTATAATTCACTTATAGATATGTATTATGAATTAAGCAATGTTGAGAATGCTAAGAAGTTATTTGATGAAATGACGACTAGAGATTCGGTTTCTTGGAATGTTATGATTTCTGGGTATGTTAGGTGTCGTAGATTTGAGGATGCTATCAATACATTTAGGGAAATGCAGCAAGAGGGCAATGAGAAACCTGATGAAGCTACTGTAGTTAGCACTCTTTCTGCTTGTACAGCACTGAAAAATCTGGAGCTTGGAGATGAAATTCACAACTATGTTAGAAAGGAGCTTGGTTTTACCACTCGAATCGACAATGCCTTATTAGATATGTATGCAAAATGTGGTTGTCTAAATATTGCCCGCAACATATTTGATGAAATGTCTATGAAAAATGTAATTTGTTGGACTAGCATGATCTCTGGCTATATAAACTGTGGTGATTTAAGAGAGGCTAGAGACTTGTTTGACAAAAGTCCAGTTAGAGATGTTGTTTTGTGGACAGCTATGATAAATGGGTATGTGCAGTTCCACCATTTTGATGATGCTGTGGCTCTGTTTCGCGAAATGCAAATTCAAAGGGTAAAACCTGATAAGTTCACAGTGGTCACTCTCCTCACAGGTTGCGCGCAGTTGGGAGCTCTAGAACAAGGGAAATGGATTCATGGATACCTAGATGAAAACAGAATAACAATGGATGTGGTTGTTGGTACTGCACTCATTGAAATGTATTCCAAATGTGGTTGTGTAGATAAATCATTAGAAATTTTCTATGAGTTAGAAGACAAGGACACGGCATCTTGGACCTCAATTATATGCGGTCTTGCAATGAATGGTAAGACGAGTGAAGCACTCCGGCTGTTCTCAGAAATGGAACGTGTGGGGGCTAAACCTGATGATATCACCTTCATTGGAGTTTTAAGTGCCTGTAGTCATGGTGGGCTGGTTGAGGAAGGGCGTAGGTTTTTCAACTCGATGAAAAAGGTTCACCGAATTGAGCCGAAGGTAGAGCATTATGGGTGTGTAATTGACCTTCTTGGTAGAGCTGGGTTATTGGATGAAGCAGAGGAACTCATACAAGAGATTCCGATTGAAAATTGCGAAATTGTGGTTCCTCTGTATGGTGCTTTGCTCAGTGCTTGTAGAATCCACAATAATGTTGACATGGGTGAAAGGCTGGCCAAAAAACTGGAGAACATTGAATCATGTGATTCTAGCATTCACACACTTCTTGCGAATATATATGCTTCTGTCGATAGGTGGGAAGATGCAAAGAAAGTGAGAAGGAAAATGAAAGAACTTGGAGTGAAGAAGATGCCTGGGTGTAGCTTGATCGAAGTTGATGGCATTGTTCACGAGTTTCTTGTCGGGGATCCATCTCATCCAGAAATGATGGAGATATGTTCCATGTTGAATAGAGTGACTGGACAGTTACTAGGATTAAAGGAATCTCAGGTTGAAAGTGTGATGCCACTCTACAAGGACACTCAACACTGCAATTTTGTAGAATCTTAA

Coding sequence (CDS)

ATGGATAAGAAGAGAGTGGCTATTCCTCTCGTTTGCCATGGTCATTCTCGACCGATTGTCGATCTATTTTCTAGCCTTGTTACGCCCGATGGATTCTTCCTCGTCAGTGCCAGCAAGGATTCCAATCCCATGCTTAGAAATGGTGAAAATGGTGATTGGATTGGTACATTTGAAGGTCACAAAGGCGCAGTTTGGAGTTGTTGTCTGGATACCAATGCTTTACGTGCTGCTACTGGTTCTGCTGATTTTTCAGCTACTCATTGTGGTCTTTTTCCATTCCCCCAAGCATTTGCTTCCTCTAGACTGCACCACCTTCTTACTCCTCACCCTTTTCTTGGGCATATGAAGCATTTTGATTGTGCCTTTGCTTTTTCAGAGATTTTCCAATTCTTCAAACACTCACAAAGCTCGCACATGCTTCCTTTATCAATCATTCGAAGAGTCCAGTTTATATCCCGTCATTTTTCTTCAAGCCCCCATTTAGTTCCAGTTCTGCTTCGAATCTCCAAACTAACAAAGAAATCATGCATCGAGTGTCTTCGGAACTGCAAGTCCATGGATCAACTCAAACAAATTCAGAGTCAGATCTTTCGAATTGGTCTTGAAGGAGACAGGGACACAATAAACAAATTGATGGCATTCTGTGCAGACTCATCTCTTGGCAACTTGCGCTATGCAGAGAAGATATTCAATTATGTTCAAGACCCATCTCTGTTTGTTTATAATGTGATGGTTAAAATGTATGCCAAAAGGGGTATTCTCAGAAAAGTCCTTTTGCTCTTTCAACAGTTGAGGGAAGATGGATTGTGGCCTGATGGTTTTACTTACCCATTTGTTCTGAAAGCTATTGGTTGCTTACGGGACGTGAGGCAAGGTGAAAAGGTTCGTGGCTTTATAGTGAAGACAGGAATGGATTTGGATAATTATGTCTATAATTCACTTATAGATATGTATTATGAATTAAGCAATGTTGAGAATGCTAAGAAGTTATTTGATGAAATGACGACTAGAGATTCGGTTTCTTGGAATGTTATGATTTCTGGGTATGTTAGGTGTCGTAGATTTGAGGATGCTATCAATACATTTAGGGAAATGCAGCAAGAGGGCAATGAGAAACCTGATGAAGCTACTGTAGTTAGCACTCTTTCTGCTTGTACAGCACTGAAAAATCTGGAGCTTGGAGATGAAATTCACAACTATGTTAGAAAGGAGCTTGGTTTTACCACTCGAATCGACAATGCCTTATTAGATATGTATGCAAAATGTGGTTGTCTAAATATTGCCCGCAACATATTTGATGAAATGTCTATGAAAAATGTAATTTGTTGGACTAGCATGATCTCTGGCTATATAAACTGTGGTGATTTAAGAGAGGCTAGAGACTTGTTTGACAAAAGTCCAGTTAGAGATGTTGTTTTGTGGACAGCTATGATAAATGGGTATGTGCAGTTCCACCATTTTGATGATGCTGTGGCTCTGTTTCGCGAAATGCAAATTCAAAGGGTAAAACCTGATAAGTTCACAGTGGTCACTCTCCTCACAGGTTGCGCGCAGTTGGGAGCTCTAGAACAAGGGAAATGGATTCATGGATACCTAGATGAAAACAGAATAACAATGGATGTGGTTGTTGGTACTGCACTCATTGAAATGTATTCCAAATGTGGTTGTGTAGATAAATCATTAGAAATTTTCTATGAGTTAGAAGACAAGGACACGGCATCTTGGACCTCAATTATATGCGGTCTTGCAATGAATGGTAAGACGAGTGAAGCACTCCGGCTGTTCTCAGAAATGGAACGTGTGGGGGCTAAACCTGATGATATCACCTTCATTGGAGTTTTAAGTGCCTGTAGTCATGGTGGGCTGGTTGAGGAAGGGCGTAGGTTTTTCAACTCGATGAAAAAGGTTCACCGAATTGAGCCGAAGGTAGAGCATTATGGGTGTGTAATTGACCTTCTTGGTAGAGCTGGGTTATTGGATGAAGCAGAGGAACTCATACAAGAGATTCCGATTGAAAATTGCGAAATTGTGGTTCCTCTGTATGGTGCTTTGCTCAGTGCTTGTAGAATCCACAATAATGTTGACATGGGTGAAAGGCTGGCCAAAAAACTGGAGAACATTGAATCATGTGATTCTAGCATTCACACACTTCTTGCGAATATATATGCTTCTGTCGATAGGTGGGAAGATGCAAAGAAAGTGAGAAGGAAAATGAAAGAACTTGGAGTGAAGAAGATGCCTGGGTGTAGCTTGATCGAAGTTGATGGCATTGTTCACGAGTTTCTTGTCGGGGATCCATCTCATCCAGAAATGATGGAGATATGTTCCATGTTGAATAGAGTGACTGGACAGTTACTAGGATTAAAGGAATCTCAGGTTGAAAGTGTGATGCCACTCTACAAGGACACTCAACACTGCAATTTTGTAGAATCTTAA

Protein sequence

MDKKRVAIPLVCHGHSRPIVDLFSSLVTPDGFFLVSASKDSNPMLRNGENGDWIGTFEGHKGAVWSCCLDTNALRAATGSADFSATHCGLFPFPQAFASSRLHHLLTPHPFLGHMKHFDCAFAFSEIFQFFKHSQSSHMLPLSIIRRVQFISRHFSSSPHLVPVLLRISKLTKKSCIECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELGFTTRIDNALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLLGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMMEICSMLNRVTGQLLGLKESQVESVMPLYKDTQHCNFVES*
BLAST of Csa1G418260.1 vs. Swiss-Prot
Match: PPR65_ARATH (Pentatricopeptide repeat-containing protein At1g31430 OS=Arabidopsis thaliana GN=PCMP-E55 PE=2 SV=1)

HSP 1 Score: 677.9 bits (1748), Expect = 1.3e-193
Identity = 331/565 (58.58%), Postives = 427/565 (75.58%), Query Frame = 1

Query: 233 VQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQG 292
           +Q PSL +YN M+K  A      KVL LF +LR  GL+PD FT P VLK+IG LR V +G
Sbjct: 6   LQTPSLLMYNKMLKSLADGKSFTKVLALFGELRGQGLYPDNFTLPVVLKSIGRLRKVIEG 65

Query: 293 EKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRC 352
           EKV G+ VK G++ D+YV NSL+ MY  L  +E   K+FDEM  RD VSWN +IS YV  
Sbjct: 66  EKVHGYAVKAGLEFDSYVSNSLMGMYASLGKIEITHKVFDEMPQRDVVSWNGLISSYVGN 125

Query: 353 RRFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELGFTTRID 412
            RFEDAI  F+ M QE N K DE T+VSTLSAC+ALKNLE+G+ I+ +V  E   + RI 
Sbjct: 126 GRFEDAIGVFKRMSQESNLKFDEGTIVSTLSACSALKNLEIGERIYRFVVTEFEMSVRIG 185

Query: 413 NALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVV 472
           NAL+DM+ KCGCL+ AR +FD M  KNV CWTSM+ GY++ G + EAR LF++SPV+DVV
Sbjct: 186 NALVDMFCKCGCLDKARAVFDSMRDKNVKCWTSMVFGYVSTGRIDEARVLFERSPVKDVV 245

Query: 473 LWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYL 532
           LWTAM+NGYVQF+ FD+A+ LFR MQ   ++PD F +V+LLTGCAQ GALEQGKWIHGY+
Sbjct: 246 LWTAMMNGYVQFNRFDEALELFRCMQTAGIRPDNFVLVSLLTGCAQTGALEQGKWIHGYI 305

Query: 533 DENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEAL 592
           +ENR+T+D VVGTAL++MY+KCGC++ +LE+FYE++++DTASWTS+I GLAMNG +  AL
Sbjct: 306 NENRVTVDKVVGTALVDMYAKCGCIETALEVFYEIKERDTASWTSLIYGLAMNGMSGRAL 365

Query: 593 RLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDL 652
            L+ EME VG + D ITF+ VL+AC+HGG V EGR+ F+SM + H ++PK EH  C+IDL
Sbjct: 366 DLYYEMENVGVRLDAITFVAVLTACNHGGFVAEGRKIFHSMTERHNVQPKSEHCSCLIDL 425

Query: 653 LGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDS 712
           L RAGLLDEAEELI ++  E+ E +VP+Y +LLSA R + NV + ER+A+KLE +E  DS
Sbjct: 426 LCRAGLLDEAEELIDKMRGESDETLVPVYCSLLSAARNYGNVKIAERVAEKLEKVEVSDS 485

Query: 713 SIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDP--SHPE 772
           S HTLLA++YAS +RWED   VRRKMK+LG++K PGCS IE+DG+ HEF+VGD   SHP+
Sbjct: 486 SAHTLLASVYASANRWEDVTNVRRKMKDLGIRKFPGCSSIEIDGVGHEFIVGDDLLSHPK 545

Query: 773 MMEICSMLNRVTGQLLGLKESQVES 796
           M EI SML++ T  +L L+  +++S
Sbjct: 546 MDEINSMLHQTTNLMLDLEHKEIDS 570

BLAST of Csa1G418260.1 vs. Swiss-Prot
Match: PP169_ARATH (Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E28 PE=2 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 7.4e-144
Identity = 258/607 (42.50%), Postives = 397/607 (65.40%), Query Frame = 1

Query: 177 IECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDP 236
           +  L  CK +  LKQIQ+Q+   GL  D    ++L+AFCA S    L Y+ KI   +++P
Sbjct: 57  LSLLEKCKLLLHLKQIQAQMIINGLILDPFASSRLIAFCALSESRYLDYSVKILKGIENP 116

Query: 237 SLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLW---PDGFTYPFVLKAIGCLRDVRQGE 296
           ++F +NV ++ +++    ++  LL++Q+   G     PD FTYP + K    LR    G 
Sbjct: 117 NIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSLGH 176

Query: 297 KVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCR 356
            + G ++K  ++L ++V+N+ I M+    ++ENA+K+FDE   RD VSWN +I+GY +  
Sbjct: 177 MILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKKIG 236

Query: 357 RFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKE-LGFTTRID 416
             E AI  ++ M+ EG  KPD+ T++  +S+C+ L +L  G E + YV++  L  T  + 
Sbjct: 237 EAEKAIYVYKLMESEG-VKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKENGLRMTIPLV 296

Query: 417 NALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVV 476
           NAL+DM++KCG ++ AR IFD +  + ++ WT+MISGY  CG L  +R LFD    +DVV
Sbjct: 297 NALMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLDVSRKLFDDMEEKDVV 356

Query: 477 LWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYL 536
           LW AMI G VQ     DA+ALF+EMQ    KPD+ T++  L+ C+QLGAL+ G WIH Y+
Sbjct: 357 LWNAMIGGSVQAKRGQDALALFQEMQTSNTKPDEITMIHCLSACSQLGALDVGIWIHRYI 416

Query: 537 DENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEAL 596
           ++  ++++V +GT+L++MY+KCG + ++L +F+ ++ +++ ++T+II GLA++G  S A+
Sbjct: 417 EKYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTYTAIIGGLALHGDASTAI 476

Query: 597 RLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDL 656
             F+EM   G  PD+ITFIG+LSAC HGG+++ GR +F+ MK    + P+++HY  ++DL
Sbjct: 477 SYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKSRFNLNPQLKHYSIMVDL 536

Query: 657 LGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDS 716
           LGRAGLL+EA+ L++ +P+E    V   +GALL  CR+H NV++GE+ AKKL  ++  DS
Sbjct: 537 LGRAGLLEEADRLMESMPMEADAAV---WGALLFGCRMHGNVELGEKAAKKLLELDPSDS 596

Query: 717 SIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMM 776
            I+ LL  +Y   + WEDAK+ RR M E GV+K+PGCS IEV+GIV EF+V D S PE  
Sbjct: 597 GIYVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIEVNGIVCEFIVRDKSRPESE 656

Query: 777 EICSMLN 780
           +I   L+
Sbjct: 657 KIYDRLH 659

BLAST of Csa1G418260.1 vs. Swiss-Prot
Match: PP235_ARATH (Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis thaliana GN=PCMP-E51 PE=3 SV=2)

HSP 1 Score: 497.3 bits (1279), Expect = 3.2e-139
Identity = 252/609 (41.38%), Postives = 397/609 (65.19%), Query Frame = 1

Query: 177 IECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDP 236
           I  L  CK+ DQ KQ+ SQ    G+  +     KL  F      G++ YA K+F  + +P
Sbjct: 38  ISILGVCKTTDQFKQLHSQSITRGVAPNPTFQKKLFVFWCSRLGGHVSYAYKLFVKIPEP 97

Query: 237 SLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRD---VRQGE 296
            + V+N M+K ++K     + + L+  + ++G+ PD  T+PF+L   G  RD   +  G+
Sbjct: 98  DVVVWNNMIKGWSKVDCDGEGVRLYLNMLKEGVTPDSHTFPFLLN--GLKRDGGALACGK 157

Query: 297 KVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCR 356
           K+   +VK G+  + YV N+L+ MY     ++ A+ +FD     D  SWN+MISGY R +
Sbjct: 158 KLHCHVVKFGLGSNLYVQNALVKMYSLCGLMDMARGVFDRRCKEDVFSWNLMISGYNRMK 217

Query: 357 RFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRK-ELGFTTRID 416
            +E++I    EM++     P   T++  LSAC+ +K+ +L   +H YV + +   + R++
Sbjct: 218 EYEESIELLVEMERN-LVSPTSVTLLLVLSACSKVKDKDLCKRVHEYVSECKTEPSLRLE 277

Query: 417 NALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVV 476
           NAL++ YA CG ++IA  IF  M  ++VI WTS++ GY+  G+L+ AR  FD+ PVRD +
Sbjct: 278 NALVNAYAACGEMDIAVRIFRSMKARDVISWTSIVKGYVERGNLKLARTYFDQMPVRDRI 337

Query: 477 LWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYL 536
            WT MI+GY++   F++++ +FREMQ   + PD+FT+V++LT CA LG+LE G+WI  Y+
Sbjct: 338 SWTIMIDGYLRAGCFNESLEIFREMQSAGMIPDEFTMVSVLTACAHLGSLEIGEWIKTYI 397

Query: 537 DENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEAL 596
           D+N+I  DVVVG ALI+MY KCGC +K+ ++F++++ +D  +WT+++ GLA NG+  EA+
Sbjct: 398 DKNKIKNDVVVGNALIDMYFKCGCSEKAQKVFHDMDQRDKFTWTAMVVGLANNGQGQEAI 457

Query: 597 RLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDL 656
           ++F +M+ +  +PDDIT++GVLSAC+H G+V++ R+FF  M+  HRIEP + HYGC++D+
Sbjct: 458 KVFFQMQDMSIQPDDITYLGVLSACNHSGMVDQARKFFAKMRSDHRIEPSLVHYGCMVDM 517

Query: 657 LGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDS 716
           LGRAGL+ EA E+++++P+    IV   +GALL A R+HN+  M E  AKK+  +E  + 
Sbjct: 518 LGRAGLVKEAYEILRKMPMNPNSIV---WGALLGASRLHNDEPMAELAAKKILELEPDNG 577

Query: 717 SIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMM 776
           +++ LL NIYA   RW+D ++VRRK+ ++ +KK PG SLIEV+G  HEF+ GD SH +  
Sbjct: 578 AVYALLCNIYAGCKRWKDLREVRRKIVDVAIKKTPGFSLIEVNGFAHEFVAGDKSHLQSE 637

Query: 777 EICSMLNRV 782
           EI   L  +
Sbjct: 638 EIYMKLEEL 640

BLAST of Csa1G418260.1 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 456.4 bits (1173), Expect = 6.3e-127
Identity = 228/612 (37.25%), Postives = 374/612 (61.11%), Query Frame = 1

Query: 177 IECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDP 236
           I  +  C S+ QLKQ    + R G   D  + +KL A  A SS  +L YA K+F+ +  P
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKP 93

Query: 237 SLFVYNVMVKMYAKR-GILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKV 296
           + F +N +++ YA     +  +      + E   +P+ +T+PF++KA   +  +  G+ +
Sbjct: 94  NSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSL 153

Query: 297 RGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRF 356
            G  VK+ +  D +V NSLI  Y+   ++++A K+F  +  +D VSWN MI+G+V+    
Sbjct: 154 HGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSP 213

Query: 357 EDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKE-LGFTTRIDNA 416
           + A+  F++M+ E + K    T+V  LSAC  ++NLE G ++ +Y+ +  +     + NA
Sbjct: 214 DKALELFKKMESE-DVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANA 273

Query: 417 LLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLW 476
           +LDMY KCG +  A+ +FD M  K+ + WT+M+ GY    D   AR++ +  P +D+V W
Sbjct: 274 MLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAW 333

Query: 477 TAMINGYVQFHHFDDAVALFREMQIQR-VKPDKFTVVTLLTGCAQLGALEQGKWIHGYLD 536
            A+I+ Y Q    ++A+ +F E+Q+Q+ +K ++ T+V+ L+ CAQ+GALE G+WIH Y+ 
Sbjct: 334 NALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIK 393

Query: 537 ENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALR 596
           ++ I M+  V +ALI MYSKCG ++KS E+F  +E +D   W+++I GLAM+G  +EA+ 
Sbjct: 394 KHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVD 453

Query: 597 LFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLL 656
           +F +M+    KP+ +TF  V  ACSH GLV+E    F+ M+  + I P+ +HY C++D+L
Sbjct: 454 MFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVL 513

Query: 657 GRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDSS 716
           GR+G L++A + I+ +PI     V   +GALL AC+IH N+++ E    +L  +E  +  
Sbjct: 514 GRSGYLEKAVKFIEAMPIPPSTSV---WGALLGACKIHANLNLAEMACTRLLELEPRNDG 573

Query: 717 IHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMME 776
            H LL+NIYA + +WE+  ++R+ M+  G+KK PGCS IE+DG++HEFL GD +HP   +
Sbjct: 574 AHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEK 633

Query: 777 ICSMLNRVTGQL 786
           +   L+ V  +L
Sbjct: 634 VYGKLHEVMEKL 641


HSP 2 Score: 146.0 bits (367), Expect = 1.8e-33
Identity = 140/550 (25.45%), Postives = 244/550 (44.36%), Query Frame = 1

Query: 219 SLGNLRYAEKIFNYVQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPF 278
           S G+L  A K+F  +++  +  +N M+  + ++G   K L LF+++  + +     T   
Sbjct: 178 SCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVG 237

Query: 279 VLKAIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRD 338
           VL A   +R++  G +V  +I +  ++++  + N+++DMY +  ++E+AK+LFD M  +D
Sbjct: 238 VLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKD 297

Query: 339 SVSWNVMISGYVRCRRFEDAINTFREMQQ-------------EGNEKPDEA--------- 398
           +V+W  M+ GY     +E A      M Q             E N KP+EA         
Sbjct: 298 NVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQL 357

Query: 399 ---------TVVSTLSACTALKNLELGDEIHNYVRKE-LGFTTRIDNALLDMYAKCGCLN 458
                    T+VSTLSAC  +  LELG  IH+Y++K  +     + +AL+ MY+KCG L 
Sbjct: 358 QKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLE 417

Query: 459 IARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDV----VLWTAMINGYV 518
            +R +F+ +  ++V  W++MI G    G   EA D+F K    +V    V +T +     
Sbjct: 418 KSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACS 477

Query: 519 QFHHFDDAVALFREMQIQ-RVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDV 578
                D+A +LF +M+    + P++     ++    + G LE+      +++   I    
Sbjct: 478 HTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAV---KFIEAMPIPPST 537

Query: 579 VVGTALI---EMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEM 638
            V  AL+   ++++     + +     ELE ++  +   +    A  GK      L   M
Sbjct: 538 SVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHM 597

Query: 639 ERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGC---VIDLLGR 698
              G K +        S+    G++ E    F S    H +  KV  YG    V++ L  
Sbjct: 598 RVTGLKKEP-----GCSSIEIDGMIHE----FLSGDNAHPMSEKV--YGKLHEVMEKLKS 657

Query: 699 AGLLDEAEELIQEIPIENC-EIVVPLYGALLSAC--RIHNNVDMGERLAKKLENIESCDS 723
            G   E  +++Q I  E   E  + L+   L+ C   I        R+ K L     C  
Sbjct: 658 NGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRVCGDC-H 712

BLAST of Csa1G418260.1 vs. Swiss-Prot
Match: PP249_ARATH (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN=PCMP-H56 PE=2 SV=1)

HSP 1 Score: 436.4 bits (1121), Expect = 6.8e-121
Identity = 228/596 (38.26%), Postives = 365/596 (61.24%), Query Frame = 1

Query: 191 QIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDPSLFVYNVMVKMYAK 250
           QI   I ++G   D    N L+ F A+   G L  A K+F+ + + ++  +  M+  YA+
Sbjct: 155 QIHGLIVKMGYAKDLFVQNSLVHFYAEC--GELDSARKVFDEMSERNVVSWTSMICGYAR 214

Query: 251 RGILRKVL-LLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKVRGFIVKTGMDLDNY 310
           R   +  + L F+ +R++ + P+  T   V+ A   L D+  GEKV  FI  +G+++++ 
Sbjct: 215 RDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDL 274

Query: 311 VYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRFEDAINTFREMQQEG 370
           + ++L+DMY + + ++ AK+LFDE    +    N M S YVR     +A+  F  M   G
Sbjct: 275 MVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSG 334

Query: 371 NEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELGFTT--RIDNALLDMYAKCGCLNI 430
             +PD  +++S +S+C+ L+N+  G   H YV +  GF +   I NAL+DMY KC   + 
Sbjct: 335 -VRPDRISMLSAISSCSQLRNILWGKSCHGYVLRN-GFESWDNICNALIDMYMKCHRQDT 394

Query: 431 ARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMINGYVQFHHF 490
           A  IFD MS K V+ W S+++GY+  G++  A + F+  P +++V W  +I+G VQ   F
Sbjct: 395 AFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLF 454

Query: 491 DDAVALFREMQIQR-VKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTA 550
           ++A+ +F  MQ Q  V  D  T++++ + C  LGAL+  KWI+ Y+++N I +DV +GT 
Sbjct: 455 EEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTT 514

Query: 551 LIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMERVGAKPD 610
           L++M+S+CG  + ++ IF  L ++D ++WT+ I  +AM G    A+ LF +M   G KPD
Sbjct: 515 LVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPD 574

Query: 611 DITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLLGRAGLLDEAEELI 670
            + F+G L+ACSHGGLV++G+  F SM K+H + P+  HYGC++DLLGRAGLL+EA +LI
Sbjct: 575 GVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLI 634

Query: 671 QEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDSSIHTLLANIYASVD 730
           +++P+E  +++   + +LL+ACR+  NV+M    A+K++ +    +  + LL+N+YAS  
Sbjct: 635 EDMPMEPNDVI---WNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAG 694

Query: 731 RWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMMEICSMLNRVT 783
           RW D  KVR  MKE G++K PG S I++ G  HEF  GD SHPEM  I +ML+ V+
Sbjct: 695 RWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVS 743


HSP 2 Score: 248.8 bits (634), Expect = 2.0e-64
Identity = 157/521 (30.13%), Postives = 257/521 (49.33%), Query Frame = 1

Query: 163 PVLLRISKLTKKSCIECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMA-FCADSSLG 222
           P LL  SK TK +    L+NCK++D+LK     + + GL+ D  TI KL+A  C   +  
Sbjct: 23  PSLLNQSKCTKATP-SSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRE 82

Query: 223 NLRYAEKIF-NYVQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVL 282
           +L +A+++F N     + F+YN +++ YA  G+  + +LLF ++   G+ PD +T+PF L
Sbjct: 83  SLSFAKEVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGL 142

Query: 283 KAIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSV 342
            A    R    G ++ G IVK G   D +V NSL+  Y E   +++A+K+FDEM+ R+ V
Sbjct: 143 SACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVV 202

Query: 343 SWNVMISGYVRCRRFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNY 402
           SW  MI GY R    +DA++ F  M ++    P+  T+V  +SAC  L++LE G++++ +
Sbjct: 203 SWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAF 262

Query: 403 VRKE-LGFTTRIDNALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREA 462
           +R   +     + +AL+DMY KC  +++A+ +FDE    N+    +M S Y+  G  REA
Sbjct: 263 IRNSGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREA 322

Query: 463 RDLFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQL 522
                                          + +F  M    V+PD+ ++++ ++ C+QL
Sbjct: 323 -------------------------------LGVFNLMMDSGVRPDRISMLSAISSCSQL 382

Query: 523 GALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSII 582
             +  GK  HGY+  N       +  ALI+MY KC   D +  IF  + +K   +W SI+
Sbjct: 383 RNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSIV 442

Query: 583 CGLAMNGKTSEALRLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRI 642
            G   NG+   A   F  M     + + +++  ++S    G L EE    F SM+    +
Sbjct: 443 AGYVENGEVDAAWETFETM----PEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGV 502

Query: 643 EPKVEHYGCVIDLLGRAGLLDEAEELIQEIPIENCEIVVPL 681
                    +    G  G LD A+ +   I     ++ V L
Sbjct: 503 NADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRL 507

BLAST of Csa1G418260.1 vs. TrEMBL
Match: F6HH61_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0016g04660 PE=4 SV=1)

HSP 1 Score: 928.7 bits (2399), Expect = 4.8e-267
Identity = 452/662 (68.28%), Postives = 543/662 (82.02%), Query Frame = 1

Query: 139 MLPLSIIRRVQFISRHFSSSP-HLVPVLLRISKLTKKSCIECLRNCKSMDQLKQIQSQIF 198
           ML  +  +  +F S HF S P HL       S  TKKSCI  L+NCKSM  LKQIQ+QI 
Sbjct: 1   MLSQTKFQLFKFTSLHFLSKPLHLSTS----SHFTKKSCIFLLKNCKSMQHLKQIQTQIL 60

Query: 199 RIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDPSLFVYNVMVKMYAKRGILRKV 258
           R G     DT+NK M  C D S+GNL YAE+IFNY+  P LF+YN+++K + K G  RK 
Sbjct: 61  RTGFHQSGDTLNKFMVCCTDPSIGNLHYAERIFNYIDIPGLFIYNLVIKAFTKNGSFRKA 120

Query: 259 LLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLIDM 318
           +LLF+QLRE+GL PD FTYPFV KAIGCL +VR+GEKV GF+VK+G++ D YV NSL+DM
Sbjct: 121 VLLFRQLREEGLSPDNFTYPFVFKAIGCLGEVREGEKVYGFVVKSGLEFDTYVCNSLMDM 180

Query: 319 YYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRFEDAINTFREMQQEGNEKPDEAT 378
           Y E+  V+N +++F+EM  RD VSWNV+ISGYV+CRR+EDA++ FR MQQ+ + +P+EAT
Sbjct: 181 YAEVGRVQNLRQVFEEMPQRDVVSWNVLISGYVKCRRYEDAVDVFRRMQQQSSLRPNEAT 240

Query: 379 VVSTLSACTALKNLELGDEIHNYVRKELGFTTRIDNALLDMYAKCGCLNIARNIFDEMSM 438
           VVSTLSAC ALK LELG EIH YVR++LGFT +I NAL+DMY KCG L+IAR IF++M +
Sbjct: 241 VVSTLSACIALKMLELGKEIHRYVREQLGFTIKIGNALVDMYCKCGHLSIAREIFNDMPI 300

Query: 439 KNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFREM 498
           K VICWTSM+SGY+NCG L EAR+LF++SPVRDVVLWTAMINGYVQF+ FDDAVALFREM
Sbjct: 301 KTVICWTSMVSGYVNCGQLDEARELFERSPVRDVVLWTAMINGYVQFNRFDDAVALFREM 360

Query: 499 QIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCV 558
           QI+RV PD+FT+V LLTGCAQLG LEQGKWIHGY+DEN+I +D VVGTALIEMY+KCG +
Sbjct: 361 QIKRVSPDRFTLVALLTGCAQLGTLEQGKWIHGYIDENKIMIDAVVGTALIEMYAKCGFI 420

Query: 559 DKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMERVGAKPDDITFIGVLSAC 618
           +KSLEIF  L++KDTASWTSIICGLAMNGKTS+AL LF+EM + G KPDDITFIGVLSAC
Sbjct: 421 EKSLEIFNGLKEKDTASWTSIICGLAMNGKTSKALELFAEMVQTGVKPDDITFIGVLSAC 480

Query: 619 SHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLLGRAGLLDEAEELIQEIPIENCEIV 678
           SHGGLVEEGR+ F SM  V++IEPK+EHYGC+IDLLGRAG LDEAEELI++ P  N E++
Sbjct: 481 SHGGLVEEGRKHFRSMTAVYQIEPKLEHYGCLIDLLGRAGQLDEAEELIEKSPNVNNEVI 540

Query: 679 VPLYGALLSACRIHNNVDMGERLAKKLENIESCDSSIHTLLANIYASVDRWEDAKKVRRK 738
           VPLYGALLSACR H NV+MGER+AK+L  IES DSS+HTLLANIYAS DRWED  KVRRK
Sbjct: 541 VPLYGALLSACRTHGNVEMGERVAKRLVGIESGDSSVHTLLANIYASADRWEDVTKVRRK 600

Query: 739 MKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMMEICSMLNRVTGQLLGLKESQVESVM 798
           MK+LGVKK+PGCS +EV+GIVHEFLVGD SHPEM EI SML+ +   LLGL E+++E  +
Sbjct: 601 MKDLGVKKVPGCSSVEVNGIVHEFLVGDASHPEMREIYSMLDSIAKPLLGLDENEMEGEI 658

Query: 799 PL 800
           P+
Sbjct: 661 PV 658

BLAST of Csa1G418260.1 vs. TrEMBL
Match: V4UPA8_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10007671mg PE=4 SV=1)

HSP 1 Score: 913.7 bits (2360), Expect = 1.6e-262
Identity = 448/658 (68.09%), Postives = 536/658 (81.46%), Query Frame = 1

Query: 137 SHMLPLSIIRRVQFISRHFSSSPHLVPVLLRISKLTKKSCIECLRNCKSMDQLKQIQSQI 196
           S  LP S+ R++ +     S S H        S LTKKSCI  L+NCKS+ QLKQIQ+QI
Sbjct: 3   SQRLPNSVSRQIIWKPVPQSDSSHT-------STLTKKSCIYLLKNCKSITQLKQIQAQI 62

Query: 197 FRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDPSLFVYNVMVKMYAKRGILRK 256
           F+IGL+ + +T+NKLM FC   S GNL YAEKIF  +Q P L  YN+++K +AK+G  RK
Sbjct: 63  FQIGLQQNPETLNKLMVFCTQPSHGNLLYAEKIFGSIQSPCLLAYNLLIKAFAKKGSFRK 122

Query: 257 VLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLID 316
            LLLF +LRE G+ PD FTYPFV KA+GCL +V++GEKV G++VKTG++ D YV NS++D
Sbjct: 123 SLLLFSKLRERGVSPDNFTYPFVFKAVGCLGEVKKGEKVHGYVVKTGLEFDTYVCNSIMD 182

Query: 317 MYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRFEDAINTFREMQQEGNEKPDEA 376
           MY  L  + N KKLFDEM  +D VSWNV ISG+V+C RFEDA++ FR M+Q  N  PDE 
Sbjct: 183 MYAVLGKICNVKKLFDEMPDKDVVSWNVSISGHVKCMRFEDAVDVFRRMRQGCNLMPDEG 242

Query: 377 TVVSTLSACTALKNLELGDEIHNYVRKELGFTTRIDNALLDMYAKCGCLNIARNIFDEMS 436
           TVVSTLSACTALKNLELG EIH Y+ +EL FT  + NALLDMY KCGCL+ AR +FDEM 
Sbjct: 243 TVVSTLSACTALKNLELGKEIHRYINQELEFTPIMGNALLDMYCKCGCLSEARELFDEMP 302

Query: 437 MKNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFRE 496
            KNVICWTSM+SGY+NCG L +ARDLFD+SPVRD+VLWTAMINGYVQF+ FD+AVALFRE
Sbjct: 303 NKNVICWTSMVSGYVNCGQLEKARDLFDRSPVRDIVLWTAMINGYVQFNRFDEAVALFRE 362

Query: 497 MQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGC 556
           MQI R+KPDKF +V LLTGCAQLGALEQGKWIHGY++ENRIT+D VV TALIEMY+KCG 
Sbjct: 363 MQIIRLKPDKFILVALLTGCAQLGALEQGKWIHGYINENRITVDAVVATALIEMYAKCGL 422

Query: 557 VDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMERVGAKPDDITFIGVLSA 616
           ++K+LEIFYEL +KD ASWTSIICGLAMNGK ++AL LFS+M   GAKPDDITFIGVLSA
Sbjct: 423 IEKALEIFYELREKDAASWTSIICGLAMNGKINKALELFSQMISGGAKPDDITFIGVLSA 482

Query: 617 CSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLLGRAGLLDEAEELIQEIPIENCEI 676
           CSHGGLV+EGRRFFN+M +V++I+PK+EHYGC+IDLLGRAGLLDEAEE I++IP EN EI
Sbjct: 483 CSHGGLVDEGRRFFNTMTEVYQIQPKLEHYGCLIDLLGRAGLLDEAEEWIRKIPNENNEI 542

Query: 677 VVPLYGALLSACRIHNNVDMGERLAKKLENIESCDSSIHTLLANIYASVDRWEDAKKVRR 736
           +VPLYGALLSACRI+ NVDMGE+LA  LE IES DSS HTLLANIYAS +RWED   VR+
Sbjct: 543 IVPLYGALLSACRIYGNVDMGEKLAALLEKIESKDSSFHTLLANIYASANRWEDVTNVRQ 602

Query: 737 KMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMMEICSMLNRVTGQLLGLKESQVE 795
           KMKE+GV+K+PGCS IE++GI+HEFLVGDPSH EM EI SML+R+   LL  K++ +E
Sbjct: 603 KMKEMGVRKVPGCSSIEINGIIHEFLVGDPSHSEMKEIYSMLDRMAKTLLDSKQNAME 653

BLAST of Csa1G418260.1 vs. TrEMBL
Match: B9STP1_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_0826200 PE=4 SV=1)

HSP 1 Score: 890.2 bits (2299), Expect = 1.9e-255
Identity = 431/639 (67.45%), Postives = 523/639 (81.85%), Query Frame = 1

Query: 171 LTKKSCIECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIF 230
           L+++SCI  L++CKSM  LKQI +QIFR+GL  D  ++NKLMAFC D   GNL YAEK+F
Sbjct: 35  LSQQSCISYLKSCKSMTHLKQIHAQIFRVGLHQDIVSLNKLMAFCTDPFNGNLNYAEKMF 94

Query: 231 NYVQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVR 290
            Y++ P L +YN+++K +AK+G  ++ L+LF +LREDGLWPD FTYPFV KAIG L +V 
Sbjct: 95  KYIRYPCLLIYNLIIKAFAKKGNYKRTLVLFSKLREDGLWPDNFTYPFVFKAIGYLGEVS 154

Query: 291 QGEKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYV 350
           + EK+RG + KTG++ D YV NSLIDMY +L+  +  K LFDEM  RD +SWNVMISGYV
Sbjct: 155 KAEKLRGLVTKTGLEFDTYVRNSLIDMYAQLALTDVMKMLFDEMPDRDVISWNVMISGYV 214

Query: 351 RCRRFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELGFTTR 410
           +CRRFEDAIN F  MQ+E    PDEATVVSTLSACTALK LELG +IH+YVR  + FT  
Sbjct: 215 KCRRFEDAINVFCRMQEESGLMPDEATVVSTLSACTALKRLELGKKIHHYVRDNVKFTPI 274

Query: 411 IDNALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRD 470
           I NALLDMY KCGCL+IAR +F+EM  KNVICWT+M+SGY NCG+L EAR+LF+ SP+RD
Sbjct: 275 IGNALLDMYCKCGCLSIARAVFEEMPSKNVICWTTMVSGYANCGELEEARELFEGSPIRD 334

Query: 471 VVLWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHG 530
           VV+WTAMINGYVQF+ FD+AVALFREMQI++VKPDKF VV+LLTGCAQ GA+EQGKWIH 
Sbjct: 335 VVIWTAMINGYVQFNRFDEAVALFREMQIRKVKPDKFIVVSLLTGCAQTGAIEQGKWIHE 394

Query: 531 YLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSE 590
           ++DENRI +D VVGTALIEMY+KCG ++K+LEIFY L  KDTASWTSIICGLAMNGKTS+
Sbjct: 395 FIDENRIPIDAVVGTALIEMYAKCGFIEKALEIFYGLRVKDTASWTSIICGLAMNGKTSK 454

Query: 591 ALRLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVI 650
           AL LFS+M++ G +PDDITFIGVLSACSHGGLVEEGR+FFNSM+  ++I+PKVEHYGC++
Sbjct: 455 ALELFSKMKQAGVRPDDITFIGVLSACSHGGLVEEGRKFFNSMRMEYQIKPKVEHYGCLV 514

Query: 651 DLLGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESC 710
           DLLGRAGLL+EAEELI++IP EN  I VPLYG+LLSACRI+ NV+MGER+AK+L   ES 
Sbjct: 515 DLLGRAGLLNEAEELIKKIPDENKAITVPLYGSLLSACRIYGNVEMGERVAKQLVKFESS 574

Query: 711 DSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPE 770
           DSS+HTLLANIYA  DRWED  KVRRKMK+LGVKK PGCS IEVD I+HEF  G PSHPE
Sbjct: 575 DSSVHTLLANIYAFADRWEDVTKVRRKMKDLGVKKTPGCSSIEVDSIIHEFFSGHPSHPE 634

Query: 771 MMEICSMLNRVTGQLLGLKESQV--ESVMPLYKDTQHCN 808
           M EI  MLN +   LLG  ++++  E ++ +  D Q C+
Sbjct: 635 MREIYYMLNIMAKPLLGSAKNEMEGEDLVGMTFDEQGCS 673

BLAST of Csa1G418260.1 vs. TrEMBL
Match: A0A067GKX6_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g007077mg PE=4 SV=1)

HSP 1 Score: 884.4 bits (2284), Expect = 1.0e-253
Identity = 428/609 (70.28%), Postives = 510/609 (83.74%), Query Frame = 1

Query: 186 MDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDPSLFVYNVMV 245
           M QLKQIQ+QIF+IGL+ + +T+NKLM FC   S GNL YAEKIF  +Q P L  YN+++
Sbjct: 1   MTQLKQIQAQIFQIGLQQNPETLNKLMVFCTHPSHGNLLYAEKIFGSIQSPCLLAYNLLI 60

Query: 246 KMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKVRGFIVKTGMD 305
           K +AK+G  RK LLLF +LRE G+ PD FTYPFV KA+G L +V++GEKV G++VKTG++
Sbjct: 61  KAFAKKGSFRKSLLLFSKLRERGVSPDNFTYPFVFKAVGWLGEVKKGEKVHGYVVKTGLE 120

Query: 306 LDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRFEDAINTFREM 365
            D YV NS++DMY  L  + N KKLFDEM  +D VSWNV ISG+V+C RFEDA++ FR M
Sbjct: 121 FDTYVCNSIMDMYGVLGKICNVKKLFDEMPDKDVVSWNVSISGHVKCMRFEDAVDVFRRM 180

Query: 366 QQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELGFTTRIDNALLDMYAKCGCL 425
           +Q  N  PDE TVVSTLSACTALKNLELG EIH Y+ +EL FT  + NALLDMY KCGCL
Sbjct: 181 RQGCNLMPDEGTVVSTLSACTALKNLELGKEIHRYINQELEFTPIMGNALLDMYCKCGCL 240

Query: 426 NIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMINGYVQFH 485
           + AR +FDEM  KNVICWTSM+SGY+NCG L +ARDLFD+SPVRD+VLWTAMINGYVQF+
Sbjct: 241 SEARELFDEMPNKNVICWTSMVSGYVNCGQLEKARDLFDRSPVRDIVLWTAMINGYVQFN 300

Query: 486 HFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGT 545
            FD+AVALFREMQI R+KPDKF +V LLTGCAQLGALEQGKWIHGY++ENRIT+D VV T
Sbjct: 301 RFDEAVALFREMQIIRLKPDKFILVALLTGCAQLGALEQGKWIHGYINENRITVDAVVAT 360

Query: 546 ALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMERVGAKP 605
           ALIEMY+KCG ++K+LEIFYEL +KD ASWTSIICGLAMNGK ++AL LFS+M   GAKP
Sbjct: 361 ALIEMYAKCGLIEKALEIFYELREKDAASWTSIICGLAMNGKINKALELFSQMISGGAKP 420

Query: 606 DDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLLGRAGLLDEAEEL 665
           DDITFIGVLSACSHGGLV+EGRRFFN+M +V++I+PK+EHYGC+IDLLGRAGLLDEAEEL
Sbjct: 421 DDITFIGVLSACSHGGLVDEGRRFFNTMTEVYQIQPKLEHYGCLIDLLGRAGLLDEAEEL 480

Query: 666 IQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDSSIHTLLANIYASV 725
           I++IP EN EI+VPLYGALLSACRI+ NVDMGE+LA  LE IES DSS HTLLANIYAS 
Sbjct: 481 IRKIPNENNEIIVPLYGALLSACRIYGNVDMGEKLAALLEKIESKDSSFHTLLANIYASA 540

Query: 726 DRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMMEICSMLNRVTGQL 785
           +RWED   VR+KMKE+GV+K+PGCS IE++GI+HEFLVGDPSH EM EI SML+R+   L
Sbjct: 541 NRWEDVTNVRQKMKEMGVRKVPGCSSIEINGIIHEFLVGDPSHSEMKEIYSMLDRMAKTL 600

Query: 786 LGLKESQVE 795
           L  K++ +E
Sbjct: 601 LDSKQNAME 609

BLAST of Csa1G418260.1 vs. TrEMBL
Match: A0A118JW06_CYNCS (Pentatricopeptide repeat-containing protein OS=Cynara cardunculus var. scolymus GN=Ccrd_004112 PE=4 SV=1)

HSP 1 Score: 866.7 bits (2238), Expect = 2.3e-248
Identity = 413/630 (65.56%), Postives = 515/630 (81.75%), Query Frame = 1

Query: 151 ISRHFSS-SPHLVPVLLRISKLTKKSCIECLRNCKSMDQLKQIQSQIFRIGLEGDRDTIN 210
           +SRHFS   PH      +    TK++CI  L+NCKSM+QLKQIQ+QIF +GL  + D I 
Sbjct: 9   LSRHFSLLRPHRFSTTQQFLIPTKRTCIHLLKNCKSMNQLKQIQTQIFVLGLAQNVDAIK 68

Query: 211 KLMAFCADSSLGNLRYAEKIFNYVQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGL 270
           K+MAF AD S+GNL YA++IF+ ++ P LFVYNVM+K Y K G   K L LF Q+R DGL
Sbjct: 69  KIMAFSADPSVGNLSYAQRIFDRIETPYLFVYNVMIKAYTKSGDFGKALCLFDQMRVDGL 128

Query: 271 WPDGFTYPFVLKAIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKK 330
           WPD +TYPFV K+IGCLR+V  GEK+ GF+VK+G + D YV NS++DMY EL   E+ KK
Sbjct: 129 WPDNYTYPFVFKSIGCLREVLTGEKIHGFVVKSGAEFDCYVCNSVMDMYGELGRSEDMKK 188

Query: 331 LFDEMTTRDSVSWNVMISGYVRCRRFEDAINTFREMQQEGNEKPDEATVVSTLSACTALK 390
           +FDEM  RD VSWNV+ISGYVRC++FEDA+  + +M++E + +PDEATVVSTLSAC ALK
Sbjct: 189 VFDEMPERDLVSWNVLISGYVRCKKFEDAVGVYLQMREEESVRPDEATVVSTLSACIALK 248

Query: 391 NLELGDEIHNYVRKELGFTTRIDNALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISG 450
           NLELG EIH+YV  E+GFTT I NALLDMY+KCGCL++AR IFD +  KNVICWTSM+SG
Sbjct: 249 NLELGKEIHHYVTHEIGFTTIIGNALLDMYSKCGCLDVAREIFDGLPKKNVICWTSMVSG 308

Query: 451 YINCGDLREARDLFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTV 510
           Y++CG L +AR LFD+SPV+D+VLWTAMINGYVQF++ D+A+ LF++MQ  R+KPDKFTV
Sbjct: 309 YVSCGQLDDARLLFDRSPVKDIVLWTAMINGYVQFNNVDEAMVLFQQMQTYRIKPDKFTV 368

Query: 511 VTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELED 570
           V LLTGCAQ+GALEQG+WIH Y++E+RI +D V GTALI+MY+KCG ++KSLE+FY L++
Sbjct: 369 VALLTGCAQVGALEQGEWIHEYMNEHRIIIDAVCGTALIDMYAKCGRIEKSLEVFYGLQE 428

Query: 571 KDTASWTSIICGLAMNGKTSEALRLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRF 630
           KDTASWTSIIC L++NGK+ +AL+LFSEM+  G +PDDITFIGVL+ACSHGGLVEEGRR 
Sbjct: 429 KDTASWTSIICALSLNGKSGKALQLFSEMKEYGFRPDDITFIGVLNACSHGGLVEEGRRH 488

Query: 631 FNSMKKVHRIEPKVEHYGCVIDLLGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACR 690
           F SMK V+ IEPK+EHYGC+IDLLGRAGLL EAE+++ +IP E  EI+VP+YGALLSACR
Sbjct: 489 FESMKSVYEIEPKIEHYGCLIDLLGRAGLLKEAEKIVNKIPKEKDEILVPVYGALLSACR 548

Query: 691 IHNNVDMGERLAKKLENIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGC 750
           ++ +VDMGE LA +L  IE  DSSIHTL+ANIYAS  RWED KKVR KM+ +GV+K PGC
Sbjct: 549 LYGDVDMGEHLADRLSEIEDGDSSIHTLMANIYASAGRWEDVKKVRSKMRAIGVRKEPGC 608

Query: 751 SLIEVDGIVHEFLVGDPSHPEMMEICSMLN 780
           S IEV+G VHEFLVGD SHP+M+++ S LN
Sbjct: 609 SSIEVNGNVHEFLVGDASHPDMIDVYSSLN 638

BLAST of Csa1G418260.1 vs. TAIR10
Match: AT1G31430.1 (AT1G31430.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 677.9 bits (1748), Expect = 7.5e-195
Identity = 331/565 (58.58%), Postives = 427/565 (75.58%), Query Frame = 1

Query: 233 VQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQG 292
           +Q PSL +YN M+K  A      KVL LF +LR  GL+PD FT P VLK+IG LR V +G
Sbjct: 6   LQTPSLLMYNKMLKSLADGKSFTKVLALFGELRGQGLYPDNFTLPVVLKSIGRLRKVIEG 65

Query: 293 EKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRC 352
           EKV G+ VK G++ D+YV NSL+ MY  L  +E   K+FDEM  RD VSWN +IS YV  
Sbjct: 66  EKVHGYAVKAGLEFDSYVSNSLMGMYASLGKIEITHKVFDEMPQRDVVSWNGLISSYVGN 125

Query: 353 RRFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELGFTTRID 412
            RFEDAI  F+ M QE N K DE T+VSTLSAC+ALKNLE+G+ I+ +V  E   + RI 
Sbjct: 126 GRFEDAIGVFKRMSQESNLKFDEGTIVSTLSACSALKNLEIGERIYRFVVTEFEMSVRIG 185

Query: 413 NALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVV 472
           NAL+DM+ KCGCL+ AR +FD M  KNV CWTSM+ GY++ G + EAR LF++SPV+DVV
Sbjct: 186 NALVDMFCKCGCLDKARAVFDSMRDKNVKCWTSMVFGYVSTGRIDEARVLFERSPVKDVV 245

Query: 473 LWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYL 532
           LWTAM+NGYVQF+ FD+A+ LFR MQ   ++PD F +V+LLTGCAQ GALEQGKWIHGY+
Sbjct: 246 LWTAMMNGYVQFNRFDEALELFRCMQTAGIRPDNFVLVSLLTGCAQTGALEQGKWIHGYI 305

Query: 533 DENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEAL 592
           +ENR+T+D VVGTAL++MY+KCGC++ +LE+FYE++++DTASWTS+I GLAMNG +  AL
Sbjct: 306 NENRVTVDKVVGTALVDMYAKCGCIETALEVFYEIKERDTASWTSLIYGLAMNGMSGRAL 365

Query: 593 RLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDL 652
            L+ EME VG + D ITF+ VL+AC+HGG V EGR+ F+SM + H ++PK EH  C+IDL
Sbjct: 366 DLYYEMENVGVRLDAITFVAVLTACNHGGFVAEGRKIFHSMTERHNVQPKSEHCSCLIDL 425

Query: 653 LGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDS 712
           L RAGLLDEAEELI ++  E+ E +VP+Y +LLSA R + NV + ER+A+KLE +E  DS
Sbjct: 426 LCRAGLLDEAEELIDKMRGESDETLVPVYCSLLSAARNYGNVKIAERVAEKLEKVEVSDS 485

Query: 713 SIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDP--SHPE 772
           S HTLLA++YAS +RWED   VRRKMK+LG++K PGCS IE+DG+ HEF+VGD   SHP+
Sbjct: 486 SAHTLLASVYASANRWEDVTNVRRKMKDLGIRKFPGCSSIEIDGVGHEFIVGDDLLSHPK 545

Query: 773 MMEICSMLNRVTGQLLGLKESQVES 796
           M EI SML++ T  +L L+  +++S
Sbjct: 546 MDEINSMLHQTTNLMLDLEHKEIDS 570

BLAST of Csa1G418260.1 vs. TAIR10
Match: AT2G22410.1 (AT2G22410.1 SLOW GROWTH 1)

HSP 1 Score: 512.7 bits (1319), Expect = 4.2e-145
Identity = 258/607 (42.50%), Postives = 397/607 (65.40%), Query Frame = 1

Query: 177 IECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDP 236
           +  L  CK +  LKQIQ+Q+   GL  D    ++L+AFCA S    L Y+ KI   +++P
Sbjct: 57  LSLLEKCKLLLHLKQIQAQMIINGLILDPFASSRLIAFCALSESRYLDYSVKILKGIENP 116

Query: 237 SLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLW---PDGFTYPFVLKAIGCLRDVRQGE 296
           ++F +NV ++ +++    ++  LL++Q+   G     PD FTYP + K    LR    G 
Sbjct: 117 NIFSWNVTIRGFSESENPKESFLLYKQMLRHGCCESRPDHFTYPVLFKVCADLRLSSLGH 176

Query: 297 KVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCR 356
            + G ++K  ++L ++V+N+ I M+    ++ENA+K+FDE   RD VSWN +I+GY +  
Sbjct: 177 MILGHVLKLRLELVSHVHNASIHMFASCGDMENARKVFDESPVRDLVSWNCLINGYKKIG 236

Query: 357 RFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKE-LGFTTRID 416
             E AI  ++ M+ EG  KPD+ T++  +S+C+ L +L  G E + YV++  L  T  + 
Sbjct: 237 EAEKAIYVYKLMESEG-VKPDDVTMIGLVSSCSMLGDLNRGKEFYEYVKENGLRMTIPLV 296

Query: 417 NALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVV 476
           NAL+DM++KCG ++ AR IFD +  + ++ WT+MISGY  CG L  +R LFD    +DVV
Sbjct: 297 NALMDMFSKCGDIHEARRIFDNLEKRTIVSWTTMISGYARCGLLDVSRKLFDDMEEKDVV 356

Query: 477 LWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYL 536
           LW AMI G VQ     DA+ALF+EMQ    KPD+ T++  L+ C+QLGAL+ G WIH Y+
Sbjct: 357 LWNAMIGGSVQAKRGQDALALFQEMQTSNTKPDEITMIHCLSACSQLGALDVGIWIHRYI 416

Query: 537 DENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEAL 596
           ++  ++++V +GT+L++MY+KCG + ++L +F+ ++ +++ ++T+II GLA++G  S A+
Sbjct: 417 EKYSLSLNVALGTSLVDMYAKCGNISEALSVFHGIQTRNSLTYTAIIGGLALHGDASTAI 476

Query: 597 RLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDL 656
             F+EM   G  PD+ITFIG+LSAC HGG+++ GR +F+ MK    + P+++HY  ++DL
Sbjct: 477 SYFNEMIDAGIAPDEITFIGLLSACCHGGMIQTGRDYFSQMKSRFNLNPQLKHYSIMVDL 536

Query: 657 LGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDS 716
           LGRAGLL+EA+ L++ +P+E    V   +GALL  CR+H NV++GE+ AKKL  ++  DS
Sbjct: 537 LGRAGLLEEADRLMESMPMEADAAV---WGALLFGCRMHGNVELGEKAAKKLLELDPSDS 596

Query: 717 SIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMM 776
            I+ LL  +Y   + WEDAK+ RR M E GV+K+PGCS IEV+GIV EF+V D S PE  
Sbjct: 597 GIYVLLDGMYGEANMWEDAKRARRMMNERGVEKIPGCSSIEVNGIVCEFIVRDKSRPESE 656

Query: 777 EICSMLN 780
           +I   L+
Sbjct: 657 KIYDRLH 659

BLAST of Csa1G418260.1 vs. TAIR10
Match: AT3G15930.1 (AT3G15930.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 497.3 bits (1279), Expect = 1.8e-140
Identity = 252/609 (41.38%), Postives = 397/609 (65.19%), Query Frame = 1

Query: 177 IECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDP 236
           I  L  CK+ DQ KQ+ SQ    G+  +     KL  F      G++ YA K+F  + +P
Sbjct: 38  ISILGVCKTTDQFKQLHSQSITRGVAPNPTFQKKLFVFWCSRLGGHVSYAYKLFVKIPEP 97

Query: 237 SLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRD---VRQGE 296
            + V+N M+K ++K     + + L+  + ++G+ PD  T+PF+L   G  RD   +  G+
Sbjct: 98  DVVVWNNMIKGWSKVDCDGEGVRLYLNMLKEGVTPDSHTFPFLLN--GLKRDGGALACGK 157

Query: 297 KVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCR 356
           K+   +VK G+  + YV N+L+ MY     ++ A+ +FD     D  SWN+MISGY R +
Sbjct: 158 KLHCHVVKFGLGSNLYVQNALVKMYSLCGLMDMARGVFDRRCKEDVFSWNLMISGYNRMK 217

Query: 357 RFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRK-ELGFTTRID 416
            +E++I    EM++     P   T++  LSAC+ +K+ +L   +H YV + +   + R++
Sbjct: 218 EYEESIELLVEMERN-LVSPTSVTLLLVLSACSKVKDKDLCKRVHEYVSECKTEPSLRLE 277

Query: 417 NALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVV 476
           NAL++ YA CG ++IA  IF  M  ++VI WTS++ GY+  G+L+ AR  FD+ PVRD +
Sbjct: 278 NALVNAYAACGEMDIAVRIFRSMKARDVISWTSIVKGYVERGNLKLARTYFDQMPVRDRI 337

Query: 477 LWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYL 536
            WT MI+GY++   F++++ +FREMQ   + PD+FT+V++LT CA LG+LE G+WI  Y+
Sbjct: 338 SWTIMIDGYLRAGCFNESLEIFREMQSAGMIPDEFTMVSVLTACAHLGSLEIGEWIKTYI 397

Query: 537 DENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEAL 596
           D+N+I  DVVVG ALI+MY KCGC +K+ ++F++++ +D  +WT+++ GLA NG+  EA+
Sbjct: 398 DKNKIKNDVVVGNALIDMYFKCGCSEKAQKVFHDMDQRDKFTWTAMVVGLANNGQGQEAI 457

Query: 597 RLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDL 656
           ++F +M+ +  +PDDIT++GVLSAC+H G+V++ R+FF  M+  HRIEP + HYGC++D+
Sbjct: 458 KVFFQMQDMSIQPDDITYLGVLSACNHSGMVDQARKFFAKMRSDHRIEPSLVHYGCMVDM 517

Query: 657 LGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDS 716
           LGRAGL+ EA E+++++P+    IV   +GALL A R+HN+  M E  AKK+  +E  + 
Sbjct: 518 LGRAGLVKEAYEILRKMPMNPNSIV---WGALLGASRLHNDEPMAELAAKKILELEPDNG 577

Query: 717 SIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMM 776
           +++ LL NIYA   RW+D ++VRRK+ ++ +KK PG SLIEV+G  HEF+ GD SH +  
Sbjct: 578 AVYALLCNIYAGCKRWKDLREVRRKIVDVAIKKTPGFSLIEVNGFAHEFVAGDKSHLQSE 637

Query: 777 EICSMLNRV 782
           EI   L  +
Sbjct: 638 EIYMKLEEL 640

BLAST of Csa1G418260.1 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 456.4 bits (1173), Expect = 3.6e-128
Identity = 228/612 (37.25%), Postives = 374/612 (61.11%), Query Frame = 1

Query: 177 IECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDP 236
           I  +  C S+ QLKQ    + R G   D  + +KL A  A SS  +L YA K+F+ +  P
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKP 93

Query: 237 SLFVYNVMVKMYAKR-GILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKV 296
           + F +N +++ YA     +  +      + E   +P+ +T+PF++KA   +  +  G+ +
Sbjct: 94  NSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSL 153

Query: 297 RGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRF 356
            G  VK+ +  D +V NSLI  Y+   ++++A K+F  +  +D VSWN MI+G+V+    
Sbjct: 154 HGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSP 213

Query: 357 EDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKE-LGFTTRIDNA 416
           + A+  F++M+ E + K    T+V  LSAC  ++NLE G ++ +Y+ +  +     + NA
Sbjct: 214 DKALELFKKMESE-DVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANA 273

Query: 417 LLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLW 476
           +LDMY KCG +  A+ +FD M  K+ + WT+M+ GY    D   AR++ +  P +D+V W
Sbjct: 274 MLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAW 333

Query: 477 TAMINGYVQFHHFDDAVALFREMQIQR-VKPDKFTVVTLLTGCAQLGALEQGKWIHGYLD 536
            A+I+ Y Q    ++A+ +F E+Q+Q+ +K ++ T+V+ L+ CAQ+GALE G+WIH Y+ 
Sbjct: 334 NALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIK 393

Query: 537 ENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALR 596
           ++ I M+  V +ALI MYSKCG ++KS E+F  +E +D   W+++I GLAM+G  +EA+ 
Sbjct: 394 KHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVD 453

Query: 597 LFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLL 656
           +F +M+    KP+ +TF  V  ACSH GLV+E    F+ M+  + I P+ +HY C++D+L
Sbjct: 454 MFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVL 513

Query: 657 GRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDSS 716
           GR+G L++A + I+ +PI     V   +GALL AC+IH N+++ E    +L  +E  +  
Sbjct: 514 GRSGYLEKAVKFIEAMPIPPSTSV---WGALLGACKIHANLNLAEMACTRLLELEPRNDG 573

Query: 717 IHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMME 776
            H LL+NIYA + +WE+  ++R+ M+  G+KK PGCS IE+DG++HEFL GD +HP   +
Sbjct: 574 AHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEK 633

Query: 777 ICSMLNRVTGQL 786
           +   L+ V  +L
Sbjct: 634 VYGKLHEVMEKL 641


HSP 2 Score: 146.0 bits (367), Expect = 1.0e-34
Identity = 140/550 (25.45%), Postives = 244/550 (44.36%), Query Frame = 1

Query: 219 SLGNLRYAEKIFNYVQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPF 278
           S G+L  A K+F  +++  +  +N M+  + ++G   K L LF+++  + +     T   
Sbjct: 178 SCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVG 237

Query: 279 VLKAIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRD 338
           VL A   +R++  G +V  +I +  ++++  + N+++DMY +  ++E+AK+LFD M  +D
Sbjct: 238 VLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKD 297

Query: 339 SVSWNVMISGYVRCRRFEDAINTFREMQQ-------------EGNEKPDEA--------- 398
           +V+W  M+ GY     +E A      M Q             E N KP+EA         
Sbjct: 298 NVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQL 357

Query: 399 ---------TVVSTLSACTALKNLELGDEIHNYVRKE-LGFTTRIDNALLDMYAKCGCLN 458
                    T+VSTLSAC  +  LELG  IH+Y++K  +     + +AL+ MY+KCG L 
Sbjct: 358 QKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLE 417

Query: 459 IARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDV----VLWTAMINGYV 518
            +R +F+ +  ++V  W++MI G    G   EA D+F K    +V    V +T +     
Sbjct: 418 KSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACS 477

Query: 519 QFHHFDDAVALFREMQIQ-RVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDV 578
                D+A +LF +M+    + P++     ++    + G LE+      +++   I    
Sbjct: 478 HTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAV---KFIEAMPIPPST 537

Query: 579 VVGTALI---EMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEM 638
            V  AL+   ++++     + +     ELE ++  +   +    A  GK      L   M
Sbjct: 538 SVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHM 597

Query: 639 ERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGC---VIDLLGR 698
              G K +        S+    G++ E    F S    H +  KV  YG    V++ L  
Sbjct: 598 RVTGLKKEP-----GCSSIEIDGMIHE----FLSGDNAHPMSEKV--YGKLHEVMEKLKS 657

Query: 699 AGLLDEAEELIQEIPIENC-EIVVPLYGALLSAC--RIHNNVDMGERLAKKLENIESCDS 723
            G   E  +++Q I  E   E  + L+   L+ C   I        R+ K L     C  
Sbjct: 658 NGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRVCGDC-H 712

BLAST of Csa1G418260.1 vs. TAIR10
Match: AT3G22690.1 (AT3G22690.1 Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885))

HSP 1 Score: 436.4 bits (1121), Expect = 3.8e-122
Identity = 228/596 (38.26%), Postives = 365/596 (61.24%), Query Frame = 1

Query: 191 QIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDPSLFVYNVMVKMYAK 250
           QI   I ++G   D    N L+ F A+   G L  A K+F+ + + ++  +  M+  YA+
Sbjct: 155 QIHGLIVKMGYAKDLFVQNSLVHFYAEC--GELDSARKVFDEMSERNVVSWTSMICGYAR 214

Query: 251 RGILRKVL-LLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKVRGFIVKTGMDLDNY 310
           R   +  + L F+ +R++ + P+  T   V+ A   L D+  GEKV  FI  +G+++++ 
Sbjct: 215 RDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDL 274

Query: 311 VYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRFEDAINTFREMQQEG 370
           + ++L+DMY + + ++ AK+LFDE    +    N M S YVR     +A+  F  M   G
Sbjct: 275 MVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSG 334

Query: 371 NEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELGFTT--RIDNALLDMYAKCGCLNI 430
             +PD  +++S +S+C+ L+N+  G   H YV +  GF +   I NAL+DMY KC   + 
Sbjct: 335 -VRPDRISMLSAISSCSQLRNILWGKSCHGYVLRN-GFESWDNICNALIDMYMKCHRQDT 394

Query: 431 ARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMINGYVQFHHF 490
           A  IFD MS K V+ W S+++GY+  G++  A + F+  P +++V W  +I+G VQ   F
Sbjct: 395 AFRIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLF 454

Query: 491 DDAVALFREMQIQR-VKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTA 550
           ++A+ +F  MQ Q  V  D  T++++ + C  LGAL+  KWI+ Y+++N I +DV +GT 
Sbjct: 455 EEAIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTT 514

Query: 551 LIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMERVGAKPD 610
           L++M+S+CG  + ++ IF  L ++D ++WT+ I  +AM G    A+ LF +M   G KPD
Sbjct: 515 LVDMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPD 574

Query: 611 DITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLLGRAGLLDEAEELI 670
            + F+G L+ACSHGGLV++G+  F SM K+H + P+  HYGC++DLLGRAGLL+EA +LI
Sbjct: 575 GVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLI 634

Query: 671 QEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDSSIHTLLANIYASVD 730
           +++P+E  +++   + +LL+ACR+  NV+M    A+K++ +    +  + LL+N+YAS  
Sbjct: 635 EDMPMEPNDVI---WNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAG 694

Query: 731 RWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMMEICSMLNRVT 783
           RW D  KVR  MKE G++K PG S I++ G  HEF  GD SHPEM  I +ML+ V+
Sbjct: 695 RWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVS 743


HSP 2 Score: 248.8 bits (634), Expect = 1.1e-65
Identity = 157/521 (30.13%), Postives = 257/521 (49.33%), Query Frame = 1

Query: 163 PVLLRISKLTKKSCIECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMA-FCADSSLG 222
           P LL  SK TK +    L+NCK++D+LK     + + GL+ D  TI KL+A  C   +  
Sbjct: 23  PSLLNQSKCTKATP-SSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARSCELGTRE 82

Query: 223 NLRYAEKIF-NYVQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVL 282
           +L +A+++F N     + F+YN +++ YA  G+  + +LLF ++   G+ PD +T+PF L
Sbjct: 83  SLSFAKEVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGL 142

Query: 283 KAIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSV 342
            A    R    G ++ G IVK G   D +V NSL+  Y E   +++A+K+FDEM+ R+ V
Sbjct: 143 SACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVV 202

Query: 343 SWNVMISGYVRCRRFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNY 402
           SW  MI GY R    +DA++ F  M ++    P+  T+V  +SAC  L++LE G++++ +
Sbjct: 203 SWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAF 262

Query: 403 VRKE-LGFTTRIDNALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREA 462
           +R   +     + +AL+DMY KC  +++A+ +FDE    N+    +M S Y+  G  REA
Sbjct: 263 IRNSGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREA 322

Query: 463 RDLFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQL 522
                                          + +F  M    V+PD+ ++++ ++ C+QL
Sbjct: 323 -------------------------------LGVFNLMMDSGVRPDRISMLSAISSCSQL 382

Query: 523 GALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSII 582
             +  GK  HGY+  N       +  ALI+MY KC   D +  IF  + +K   +W SI+
Sbjct: 383 RNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSIV 442

Query: 583 CGLAMNGKTSEALRLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRI 642
            G   NG+   A   F  M     + + +++  ++S    G L EE    F SM+    +
Sbjct: 443 AGYVENGEVDAAWETFETM----PEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGV 502

Query: 643 EPKVEHYGCVIDLLGRAGLLDEAEELIQEIPIENCEIVVPL 681
                    +    G  G LD A+ +   I     ++ V L
Sbjct: 503 NADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRL 507

BLAST of Csa1G418260.1 vs. NCBI nr
Match: gi|700210313|gb|KGN65409.1| (hypothetical protein Csa_1G418260 [Cucumis sativus])

HSP 1 Score: 1650.6 bits (4273), Expect = 0.0e+00
Identity = 811/811 (100.00%), Postives = 811/811 (100.00%), Query Frame = 1

Query: 1   MDKKRVAIPLVCHGHSRPIVDLFSSLVTPDGFFLVSASKDSNPMLRNGENGDWIGTFEGH 60
           MDKKRVAIPLVCHGHSRPIVDLFSSLVTPDGFFLVSASKDSNPMLRNGENGDWIGTFEGH
Sbjct: 1   MDKKRVAIPLVCHGHSRPIVDLFSSLVTPDGFFLVSASKDSNPMLRNGENGDWIGTFEGH 60

Query: 61  KGAVWSCCLDTNALRAATGSADFSATHCGLFPFPQAFASSRLHHLLTPHPFLGHMKHFDC 120
           KGAVWSCCLDTNALRAATGSADFSATHCGLFPFPQAFASSRLHHLLTPHPFLGHMKHFDC
Sbjct: 61  KGAVWSCCLDTNALRAATGSADFSATHCGLFPFPQAFASSRLHHLLTPHPFLGHMKHFDC 120

Query: 121 AFAFSEIFQFFKHSQSSHMLPLSIIRRVQFISRHFSSSPHLVPVLLRISKLTKKSCIECL 180
           AFAFSEIFQFFKHSQSSHMLPLSIIRRVQFISRHFSSSPHLVPVLLRISKLTKKSCIECL
Sbjct: 121 AFAFSEIFQFFKHSQSSHMLPLSIIRRVQFISRHFSSSPHLVPVLLRISKLTKKSCIECL 180

Query: 181 RNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDPSLFV 240
           RNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDPSLFV
Sbjct: 181 RNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDPSLFV 240

Query: 241 YNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKVRGFIV 300
           YNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKVRGFIV
Sbjct: 241 YNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKVRGFIV 300

Query: 301 KTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRFEDAIN 360
           KTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRFEDAIN
Sbjct: 301 KTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRFEDAIN 360

Query: 361 TFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELGFTTRIDNALLDMYA 420
           TFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELGFTTRIDNALLDMYA
Sbjct: 361 TFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELGFTTRIDNALLDMYA 420

Query: 421 KCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMING 480
           KCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMING
Sbjct: 421 KCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMING 480

Query: 481 YVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMD 540
           YVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMD
Sbjct: 481 YVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMD 540

Query: 541 VVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMER 600
           VVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMER
Sbjct: 541 VVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMER 600

Query: 601 VGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLLGRAGLLD 660
           VGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLLGRAGLLD
Sbjct: 601 VGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLLGRAGLLD 660

Query: 661 EAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDSSIHTLLAN 720
           EAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDSSIHTLLAN
Sbjct: 661 EAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLENIESCDSSIHTLLAN 720

Query: 721 IYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMMEICSMLNR 780
           IYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMMEICSMLNR
Sbjct: 721 IYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMMEICSMLNR 780

Query: 781 VTGQLLGLKESQVESVMPLYKDTQHCNFVES 812
           VTGQLLGLKESQVESVMPLYKDTQHCNFVES
Sbjct: 781 VTGQLLGLKESQVESVMPLYKDTQHCNFVES 811

BLAST of Csa1G418260.1 vs. NCBI nr
Match: gi|659130070|ref|XP_008464984.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g31430 [Cucumis melo])

HSP 1 Score: 1243.4 bits (3216), Expect = 0.0e+00
Identity = 622/673 (92.42%), Postives = 639/673 (94.95%), Query Frame = 1

Query: 139 MLPLSIIRRVQFISRHFSSSPHLVPVLLRISKLTKKSCIECLRNCKSMDQLKQIQSQIFR 198
           MLPLSIIRRVQFISRHFSSSPHLV V +RISK TKKSCIE LRNCKSM+QLK+IQSQIFR
Sbjct: 1   MLPLSIIRRVQFISRHFSSSPHLVTVPIRISKPTKKSCIEYLRNCKSMEQLKRIQSQIFR 60

Query: 199 IGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDPSLFVYNVMVKMYAKRGILRKVL 258
           IGLEGDRD INKLMAFCAD SLGNLRYAEKIF+YVQDPSLFVYNVMVKMYAKRG+LRKVL
Sbjct: 61  IGLEGDRDIINKLMAFCADLSLGNLRYAEKIFDYVQDPSLFVYNVMVKMYAKRGLLRKVL 120

Query: 259 LLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLIDMY 318
           LLFQQLRED LWPD FTYPFVLKAIGCLRDV QGEK+ GF+VKTGM+LDNYV NSL+DMY
Sbjct: 121 LLFQQLREDQLWPDNFTYPFVLKAIGCLRDVGQGEKLHGFVVKTGMNLDNYVCNSLMDMY 180

Query: 319 YELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRFEDAINTFREMQQEGNEKPDEATV 378
            EL NVENAKKLFDEMTTRDSVSWNVMISGYV CRRFEDAINTFREMQQEGNEKPDEATV
Sbjct: 181 SELGNVENAKKLFDEMTTRDSVSWNVMISGYVGCRRFEDAINTFREMQQEGNEKPDEATV 240

Query: 379 VSTLSACTALKNLELGDEIHNYVRKELGFTTRIDNALLDMYAKCGCLNIARNIFDEMSMK 438
           VSTLSACTALKNLELGDEIHNYVRKELGFT RIDNALLDMYAKCGCLNI+RNIFDEM MK
Sbjct: 241 VSTLSACTALKNLELGDEIHNYVRKELGFTPRIDNALLDMYAKCGCLNISRNIFDEMPMK 300

Query: 439 NVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFREMQ 498
           NVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFREMQ
Sbjct: 301 NVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFREMQ 360

Query: 499 IQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVD 558
           IQ+VKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRIT+DVVVGTALIEMYSKCGCVD
Sbjct: 361 IQKVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITIDVVVGTALIEMYSKCGCVD 420

Query: 559 KSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMERVGAKPDDITFIGVLSACS 618
           KSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEME VGAKPDDITFIGVLSACS
Sbjct: 421 KSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMELVGAKPDDITFIGVLSACS 480

Query: 619 HGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLLGRAGLLDEAEELIQEIPIENCEIVV 678
           HGGLVEEGRRFFNSMKKV+RIEPKVEHYGCV+DLLGRAGLLDEAEELIQEI IENCEIVV
Sbjct: 481 HGGLVEEGRRFFNSMKKVYRIEPKVEHYGCVVDLLGRAGLLDEAEELIQEISIENCEIVV 540

Query: 679 PLYGALLSACRIHNNVDMGERLAKKLENIESCDSSIHTLLANIYASVDRWEDAKKVRRKM 738
            LYGALLSACRIHNNVDMGERLAKKL NIE CDSSIH LLANIYAS DRWEDAKKVRRKM
Sbjct: 541 SLYGALLSACRIHNNVDMGERLAKKLVNIEPCDSSIHALLANIYASADRWEDAKKVRRKM 600

Query: 739 KELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMMEICSMLNRVTGQLLGLKESQVESVMP 798
           KELGVKKMPGCS IEVDGIVHEFLVGDPSHPE +EI SMLNRV+ QLLGLKESQ   +M 
Sbjct: 601 KELGVKKMPGCSSIEVDGIVHEFLVGDPSHPETIEIRSMLNRVSRQLLGLKESQ---LMS 660

Query: 799 LYKDTQHCNFVES 812
              DTQHCNFVES
Sbjct: 661 FDNDTQHCNFVES 670

BLAST of Csa1G418260.1 vs. NCBI nr
Match: gi|359484390|ref|XP_002281719.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g31430 [Vitis vinifera])

HSP 1 Score: 928.7 bits (2399), Expect = 6.9e-267
Identity = 452/662 (68.28%), Postives = 543/662 (82.02%), Query Frame = 1

Query: 139 MLPLSIIRRVQFISRHFSSSP-HLVPVLLRISKLTKKSCIECLRNCKSMDQLKQIQSQIF 198
           ML  +  +  +F S HF S P HL       S  TKKSCI  L+NCKSM  LKQIQ+QI 
Sbjct: 1   MLSQTKFQLFKFTSLHFLSKPLHLSTS----SHFTKKSCIFLLKNCKSMQHLKQIQTQIL 60

Query: 199 RIGLEGDRDTINKLMAFCADSSLGNLRYAEKIFNYVQDPSLFVYNVMVKMYAKRGILRKV 258
           R G     DT+NK M  C D S+GNL YAE+IFNY+  P LF+YN+++K + K G  RK 
Sbjct: 61  RTGFHQSGDTLNKFMVCCTDPSIGNLHYAERIFNYIDIPGLFIYNLVIKAFTKNGSFRKA 120

Query: 259 LLLFQQLREDGLWPDGFTYPFVLKAIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLIDM 318
           +LLF+QLRE+GL PD FTYPFV KAIGCL +VR+GEKV GF+VK+G++ D YV NSL+DM
Sbjct: 121 VLLFRQLREEGLSPDNFTYPFVFKAIGCLGEVREGEKVYGFVVKSGLEFDTYVCNSLMDM 180

Query: 319 YYELSNVENAKKLFDEMTTRDSVSWNVMISGYVRCRRFEDAINTFREMQQEGNEKPDEAT 378
           Y E+  V+N +++F+EM  RD VSWNV+ISGYV+CRR+EDA++ FR MQQ+ + +P+EAT
Sbjct: 181 YAEVGRVQNLRQVFEEMPQRDVVSWNVLISGYVKCRRYEDAVDVFRRMQQQSSLRPNEAT 240

Query: 379 VVSTLSACTALKNLELGDEIHNYVRKELGFTTRIDNALLDMYAKCGCLNIARNIFDEMSM 438
           VVSTLSAC ALK LELG EIH YVR++LGFT +I NAL+DMY KCG L+IAR IF++M +
Sbjct: 241 VVSTLSACIALKMLELGKEIHRYVREQLGFTIKIGNALVDMYCKCGHLSIAREIFNDMPI 300

Query: 439 KNVICWTSMISGYINCGDLREARDLFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFREM 498
           K VICWTSM+SGY+NCG L EAR+LF++SPVRDVVLWTAMINGYVQF+ FDDAVALFREM
Sbjct: 301 KTVICWTSMVSGYVNCGQLDEARELFERSPVRDVVLWTAMINGYVQFNRFDDAVALFREM 360

Query: 499 QIQRVKPDKFTVVTLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCV 558
           QI+RV PD+FT+V LLTGCAQLG LEQGKWIHGY+DEN+I +D VVGTALIEMY+KCG +
Sbjct: 361 QIKRVSPDRFTLVALLTGCAQLGTLEQGKWIHGYIDENKIMIDAVVGTALIEMYAKCGFI 420

Query: 559 DKSLEIFYELEDKDTASWTSIICGLAMNGKTSEALRLFSEMERVGAKPDDITFIGVLSAC 618
           +KSLEIF  L++KDTASWTSIICGLAMNGKTS+AL LF+EM + G KPDDITFIGVLSAC
Sbjct: 421 EKSLEIFNGLKEKDTASWTSIICGLAMNGKTSKALELFAEMVQTGVKPDDITFIGVLSAC 480

Query: 619 SHGGLVEEGRRFFNSMKKVHRIEPKVEHYGCVIDLLGRAGLLDEAEELIQEIPIENCEIV 678
           SHGGLVEEGR+ F SM  V++IEPK+EHYGC+IDLLGRAG LDEAEELI++ P  N E++
Sbjct: 481 SHGGLVEEGRKHFRSMTAVYQIEPKLEHYGCLIDLLGRAGQLDEAEELIEKSPNVNNEVI 540

Query: 679 VPLYGALLSACRIHNNVDMGERLAKKLENIESCDSSIHTLLANIYASVDRWEDAKKVRRK 738
           VPLYGALLSACR H NV+MGER+AK+L  IES DSS+HTLLANIYAS DRWED  KVRRK
Sbjct: 541 VPLYGALLSACRTHGNVEMGERVAKRLVGIESGDSSVHTLLANIYASADRWEDVTKVRRK 600

Query: 739 MKELGVKKMPGCSLIEVDGIVHEFLVGDPSHPEMMEICSMLNRVTGQLLGLKESQVESVM 798
           MK+LGVKK+PGCS +EV+GIVHEFLVGD SHPEM EI SML+ +   LLGL E+++E  +
Sbjct: 601 MKDLGVKKVPGCSSVEVNGIVHEFLVGDASHPEMREIYSMLDSIAKPLLGLDENEMEGEI 658

Query: 799 PL 800
           P+
Sbjct: 661 PV 658

BLAST of Csa1G418260.1 vs. NCBI nr
Match: gi|1009160176|ref|XP_015898211.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g31430 [Ziziphus jujuba])

HSP 1 Score: 922.5 bits (2383), Expect = 5.0e-265
Identity = 452/640 (70.62%), Postives = 529/640 (82.66%), Query Frame = 1

Query: 151 ISRHFSSSPHLVPVLLRISKLTKKSCIECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINK 210
           I R F S+P +       S LT +SCI+ L+ CK+M QLKQIQ+QIF  GL   R  +NK
Sbjct: 13  IFRGFFSTPLVFTKFSGKSILTGESCIQYLQRCKTMRQLKQIQTQIFMAGLHQSRGHLNK 72

Query: 211 LMAFCADSSLGNLRYAEKIFNYVQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLW 270
           LM FC   SLGN++YAEKIF Y+QDP LFVYNVMVK +AK G  RK + LF +LRE GLW
Sbjct: 73  LMIFCTHPSLGNMQYAEKIFGYIQDPCLFVYNVMVKAFAKLGSFRKTIFLFWRLREGGLW 132

Query: 271 PDGFTYPFVLKAIGCLRDVRQGEKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKL 330
           PD +TYPFVLKAIGCL ++R+GEK  GF++KTG++ D YV NSLIDMY +L  VE  +KL
Sbjct: 133 PDNYTYPFVLKAIGCLGEIREGEKAHGFVIKTGLEFDTYVCNSLIDMYSQLGKVEYFRKL 192

Query: 331 FDEMTTRDSVSWNVMISGYVRCRRFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKN 390
           F+EM  RDSVSWNV ISGYVRCRRF+DA+N FR M  EGNEKPDEAT+VSTL ACTALK 
Sbjct: 193 FEEMPERDSVSWNVTISGYVRCRRFDDALNVFRRMAAEGNEKPDEATIVSTLPACTALKK 252

Query: 391 LELGDEIHNYVRKELGFTTRIDNALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGY 450
           LELG EIH+YVR EL  TT I NALLDMYAKCGCL  AR IFDEM  KNVICWTSM+SGY
Sbjct: 253 LELGREIHDYVRSELELTTIISNALLDMYAKCGCLGEARKIFDEMPTKNVICWTSMVSGY 312

Query: 451 INCGDLREARDLFDKSPVRDVVLWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVV 510
           +NCG L EAR+LF++ PVRDVVLWTAMINGYVQ++  DDAVALF+EMQ +RV+ DKFTVV
Sbjct: 313 VNCGKLDEARELFERVPVRDVVLWTAMINGYVQYNSVDDAVALFQEMQTRRVRADKFTVV 372

Query: 511 TLLTGCAQLGALEQGKWIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDK 570
           +LLTGCAQLGALEQGKWIHGY++E  I MD VVGTALIEMY+KCG +DKSL+IF  L++K
Sbjct: 373 SLLTGCAQLGALEQGKWIHGYIEECGIKMDAVVGTALIEMYAKCGSIDKSLDIFNRLKEK 432

Query: 571 DTASWTSIICGLAMNGKTSEALRLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFF 630
           DTASWTSIICGLAMNGKTS+AL +FS+M+++G +PDDITF+GVLSACSHGGLVEEGR+FF
Sbjct: 433 DTASWTSIICGLAMNGKTSKALEMFSKMKQLGIQPDDITFVGVLSACSHGGLVEEGRQFF 492

Query: 631 NSMKKVHRIEPKVEHYGCVIDLLGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRI 690
            SM++++ IEPK+EHYGC+IDLL RAGLLDEAEELI+++P    EIVVPLYGALLSACRI
Sbjct: 493 CSMREMYDIEPKLEHYGCLIDLLSRAGLLDEAEELIEKVPDNKNEIVVPLYGALLSACRI 552

Query: 691 HNNVDMGERLAKKLENIESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCS 750
           + NV MGER+A+KLEN ES DSS+H LLANIYAS DRW D KKVRRKMK+LGV+K+PGCS
Sbjct: 553 YGNVTMGERVAEKLENFESSDSSVHMLLANIYASADRWGDVKKVRRKMKDLGVRKVPGCS 612

Query: 751 LIEVDGIVHEFLVGDPSHPEMMEICSMLNRVTGQLLGLKE 791
            IEV+G VHEFLVGD SHPEM EI S+L R+   L G ++
Sbjct: 613 SIEVNGTVHEFLVGDASHPEMTEITSLLCRMIKPLSGSED 652

BLAST of Csa1G418260.1 vs. NCBI nr
Match: gi|719988634|ref|XP_010252400.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g31430 [Nelumbo nucifera])

HSP 1 Score: 914.4 bits (2362), Expect = 1.4e-262
Identity = 430/628 (68.47%), Postives = 536/628 (85.35%), Query Frame = 1

Query: 167 RISKLTKKSCIECLRNCKSMDQLKQIQSQIFRIGLEGDRDTINKLMAFCADSSLGNLRYA 226
           +  +LTKK C+  L+NCKSM +LKQIQSQIFR+GL  +RD +NKLM FC D + GNLRYA
Sbjct: 25  KTGRLTKKECLLYLQNCKSMKELKQIQSQIFRVGLHQNRDALNKLMVFCTDPNSGNLRYA 84

Query: 227 EKIFNYVQDPSLFVYNVMVKMYAKRGILRKVLLLFQQLREDGLWPDGFTYPFVLKAIGCL 286
           E+IF+Y+Q+P LF++N+M+K +AK+G  RKVLLLF +LRED L PD FTYPFVLKAIGCL
Sbjct: 85  ERIFSYIQEPCLFIFNLMIKTFAKKGNFRKVLLLFNRLREDDLSPDNFTYPFVLKAIGCL 144

Query: 287 RDVRQGEKVRGFIVKTGMDLDNYVYNSLIDMYYELSNVENAKKLFDEMTTRDSVSWNVMI 346
           R V +G  + GFIVKTG + D+YV NSL+DMY E+ ++E  ++LF+EM+ RD++SWNV+I
Sbjct: 145 RAVSEGRNIHGFIVKTGFEFDSYVRNSLMDMYAEMGDMETLRRLFEEMSQRDAISWNVLI 204

Query: 347 SGYVRCRRFEDAINTFREMQQEGNEKPDEATVVSTLSACTALKNLELGDEIHNYVRKELG 406
           SGYV+  RF+DA++ F++M+Q+   +PDEATVVSTLSAC AL N+ELG EIH Y+ +EL 
Sbjct: 205 SGYVKSGRFDDALSVFQQMKQQSFVRPDEATVVSTLSACVALGNVELGKEIHLYIDRELE 264

Query: 407 FTTRIDNALLDMYAKCGCLNIARNIFDEMSMKNVICWTSMISGYINCGDLREARDLFDKS 466
           FTT I NALLDMY+KCG L++AR IFDEM  KNVI WTSM+SGY+NCG L EAR+LFD++
Sbjct: 265 FTTVIRNALLDMYSKCGYLSLARQIFDEMPDKNVISWTSMVSGYVNCGQLDEARELFDRT 324

Query: 467 PVRDVVLWTAMINGYVQFHHFDDAVALFREMQIQRVKPDKFTVVTLLTGCAQLGALEQGK 526
           PVRDV+LWTAMINGYVQ++ FD A+ LFREMQ++RVKPDKFT+V LLTGCAQLGALEQGK
Sbjct: 325 PVRDVILWTAMINGYVQYNQFDKALTLFREMQMKRVKPDKFTLVALLTGCAQLGALEQGK 384

Query: 527 WIHGYLDENRITMDVVVGTALIEMYSKCGCVDKSLEIFYELEDKDTASWTSIICGLAMNG 586
           WIHGY+DEN +T+D +VGTALI+MY+KCGC++KS+EIF  +E+KD ASWT+IICGLAMNG
Sbjct: 385 WIHGYIDENMVTIDAIVGTALIDMYAKCGCIEKSIEIFKRIEEKDRASWTAIICGLAMNG 444

Query: 587 KTSEALRLFSEMERVGAKPDDITFIGVLSACSHGGLVEEGRRFFNSMKKVHRIEPKVEHY 646
           +T++AL LFSEM+ VG KPDDITFIGVLSACSHGGLVEEGRR F+SM+K+++IEPK+EHY
Sbjct: 445 QTTKALELFSEMKLVGVKPDDITFIGVLSACSHGGLVEEGRRHFDSMRKLYQIEPKLEHY 504

Query: 647 GCVIDLLGRAGLLDEAEELIQEIPIENCEIVVPLYGALLSACRIHNNVDMGERLAKKLEN 706
           GC IDLLGRAGLL+EAEE I++IP +N  IVVPL+GALL ACRIH NV+MGER+A+ L+ 
Sbjct: 505 GCFIDLLGRAGLLNEAEEFIEKIPSDNIGIVVPLWGALLGACRIHGNVEMGERVARHLDG 564

Query: 707 IESCDSSIHTLLANIYASVDRWEDAKKVRRKMKELGVKKMPGCSLIEVDGIVHEFLVGDP 766
           IES +S +HTLLANIYA+ DRWED  KVR+KMKELG+KK+PGCSLIEV+GIVHEFLVGD 
Sbjct: 565 IESNNSGVHTLLANIYAAADRWEDVTKVRKKMKELGIKKVPGCSLIEVNGIVHEFLVGDT 624

Query: 767 SHPEMMEICSMLNRVTGQLLGLKESQVE 795
           SH EM EICS L+ +   L GL+E+ ++
Sbjct: 625 SHEEMKEICSTLHHMAKLLQGLEENMID 652

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR65_ARATH1.3e-19358.58Pentatricopeptide repeat-containing protein At1g31430 OS=Arabidopsis thaliana GN... [more]
PP169_ARATH7.4e-14442.50Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidop... [more]
PP235_ARATH3.2e-13941.38Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis th... [more]
PP175_ARATH6.3e-12737.25Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP249_ARATH6.8e-12138.26Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
F6HH61_VITVI4.8e-26768.28Putative uncharacterized protein OS=Vitis vinifera GN=VIT_11s0016g04660 PE=4 SV=... [more]
V4UPA8_9ROSI1.6e-26268.09Uncharacterized protein OS=Citrus clementina GN=CICLE_v10007671mg PE=4 SV=1[more]
B9STP1_RICCO1.9e-25567.45Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A067GKX6_CITSI1.0e-25370.28Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g007077mg PE=4 SV=1[more]
A0A118JW06_CYNCS2.3e-24865.56Pentatricopeptide repeat-containing protein OS=Cynara cardunculus var. scolymus ... [more]
Match NameE-valueIdentityDescription
AT1G31430.17.5e-19558.58 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT2G22410.14.2e-14542.50 SLOW GROWTH 1[more]
AT3G15930.11.8e-14041.38 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G29760.13.6e-12837.25 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G22690.13.8e-12238.26 Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatrico... [more]
Match NameE-valueIdentityDescription
gi|700210313|gb|KGN65409.1|0.0e+00100.00hypothetical protein Csa_1G418260 [Cucumis sativus][more]
gi|659130070|ref|XP_008464984.1|0.0e+0092.42PREDICTED: pentatricopeptide repeat-containing protein At1g31430 [Cucumis melo][more]
gi|359484390|ref|XP_002281719.2|6.9e-26768.28PREDICTED: pentatricopeptide repeat-containing protein At1g31430 [Vitis vinifera... [more]
gi|1009160176|ref|XP_015898211.1|5.0e-26570.63PREDICTED: pentatricopeptide repeat-containing protein At1g31430 [Ziziphus jujub... [more]
gi|719988634|ref|XP_010252400.1|1.4e-26268.47PREDICTED: pentatricopeptide repeat-containing protein At1g31430 [Nelumbo nucife... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
IPR015943WD40/YVTN_repeat-like_dom_sf
IPR017986WD40_repeat_dom
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Csa1G418260Csa1G418260gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Csa1G418260.1Csa1G418260.1-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G418260.1.utr3p1Csa1G418260.1.utr3p1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G418260.1.cds5Csa1G418260.1.cds5CDS
Csa1G418260.1.cds4Csa1G418260.1.cds4CDS
Csa1G418260.1.cds3Csa1G418260.1.cds3CDS
Csa1G418260.1.cds2Csa1G418260.1.cds2CDS
Csa1G418260.1.cds1Csa1G418260.1.cds1CDS


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 413..439
score: 0.0091coord: 310..337
score: 9.7E-5coord: 645..670
score: 0.0041coord: 240..269
score: 0.003coord: 716..743
score: 0.097coord: 442..465
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 338..385
score: 9.2E-10coord: 570..618
score: 9.1E-11coord: 470..517
score: 4.6
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 472..505
score: 7.2E-7coord: 309..338
score: 2.6E-4coord: 340..374
score: 1.4E-8coord: 441..465
score: 2.4E-4coord: 240..269
score: 0.001coord: 574..606
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 505..539
score: 6.741coord: 237..271
score: 10.468coord: 606..636
score: 7.706coord: 338..372
score: 12.485coord: 540..570
score: 7.925coord: 408..438
score: 7.147coord: 439..469
score: 8.802coord: 272..306
score: 5.448coord: 642..676
score: 8.364coord: 711..745
score: 8.977coord: 571..605
score: 12.112coord: 470..504
score: 11.575coord: 307..337
score:
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 447..466
score: 2.0E-7coord: 565..615
score: 2.0E-7coord: 310..382
score: 2.0E-7coord: 709..741
score: 2.
IPR015943WD40/YVTN repeat-like-containing domainGENE3DG3DSA:2.130.10.10coord: 12..84
score: 4.
IPR017986WD40-repeat-containing domainunknownSSF50978WD40 repeat-likecoord: 6..85
score: 6.41
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 157..752
score:
NoneNo IPR availablePANTHERPTHR24015:SF311SUBFAMILY NOT NAMEDcoord: 157..752
score: