Csa5G189930 (gene) Cucumber (Chinese Long) v2

NameCsa5G189930
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat-containing protein; contains IPR002885 (Pentatricopeptide repeat)
LocationChr5 : 8551263 .. 8552693 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGGCATGGCGGTTGAGCATTCAACAACTTGGAATGTTACAACGCTTCCCTCGAGCGTTCTTCCGTCTCGCGAGTCCTTTACTTCAAATGGGGCACCGGATGTCTTATTCCACATATGCTTCACATCCCCCACTCTCCGACCTCCATCAAACCATGCCCTCAGTACAGTTTATTTCGTTACCTTTGTGCTGTTATCTCATGGGACTTTTTAGGAACATTGATACTCTTATCAAGTTCCATGGGTTGCTTATAGTACACGGCCTTATCGGTAATCTTCTTTGTGATACCAAGTTGGTTGGTGTTTATGGCGCACTTGGGGATGTGAGGTCTGCTCGAATGGTGTTCGATCAAATGCCCAACCCCGATTTTTATGCGTGGAAGGTGATGATCAGGTGGTATTTTTTGAATGACTTGTTTGTGGATGTTATCCCATTCTATAATCGCATGAGAATGTCTTTTAGGGAATGCGATAACATAATTTTTTCTATTATTTTGAAAGCCTGTAGCGAATTGCGTGAAATTGTTGAAGGCAGAAAGGTCCATTGTCAGATTGTGAAGGTGGGGGGTCCGGATAGCTTTGTAATGACTGGTTTGATAGATATGTATGGTAAATGTGGGCAGGTTGAGTGCTCAAGTGCTGTGTTTGAAGAAATCATGGATAAAAATGTGGTTTCATGGACTTCAATGATTGCGGGATATGTACAAAATAATTGTGCGGAGGAGGGTCTGGTTTTATTCAATCGGATGAGAGATGCATTGGTTGAAAGCAACCCATTTACTTTAGGAAGCATAATAAATGCGTGTACAAAATTAAGAGCTTTACATCAAGGAAAATGGGTGCATGGCTATGCCATTAAGAACATTGCTGAACTTAGCTCTTTCTTGGCGACTACTTTTTTGGACATGTATGTTAAATGTGGGCAAACTAGAGATGCTCGCATGATATATGACGAACTACCTACTATTGATCTTGTTTCATGGACTGCAATGATAGTTGGATATACCCAAGCTAGGCAACCCAACGATGGATTGAGGCTTTTTGCTGATGAAATAAGGTCAGATCTCTTACCTAATTCTGTCACTGCTGCAAGTGTTCTTTCAGCATGTTCGGTTTCTGGTAATTTGAATTTAGGAATGTCAGTTCATGGACTTGGGATTAAAATTGGGCTGGAAGAATGTGTGGTGAAGAATGCTCTTATTGACATGTATGCTAAATGCCATAAGATTGGTGATGCTTATGCTATATTTCATGGGGTTTTGGAAAAAGATGTGATTACTTGGAACTCTATGATATCTGGGTATGCTCAAAATGGATCTGCTTATGATGCCCTCCGCCTCTTTAATCAAATGAGATTATACTTCCTTGCACCTGATGTAATAACGCTGGTGAGCACCCTTTCAGCATGTGCCACCCTTGGTGCTGTATAG

mRNA sequence

ATGTTGGCATGGCGGTTGAGCATTCAACAACTTGGAATGTTACAACGCTTCCCTCGAGCGTTCTTCCGTCTCGCGAGTCCTTTACTTCAAATGGGGCACCGGATGTCTTATTCCACATATGCTTCACATCCCCCACTCTCCGACCTCCATCAAACCATGCCCTCAGTACAGTTTATTTCGTTACCTTTGTGCTGTTATCTCATGGGACTTTTTAGGAACATTGATACTCTTATCAAGTTCCATGGGTTGCTTATAGTACACGGCCTTATCGGTAATCTTCTTTGTGATACCAAGTTGGTTGGTGTTTATGGCGCACTTGGGGATGTGAGGTCTGCTCGAATGGTGTTCGATCAAATGCCCAACCCCGATTTTTATGCGTGGAAGGTGATGATCAGGTGGTATTTTTTGAATGACTTGTTTGTGGATGTTATCCCATTCTATAATCGCATGAGAATGTCTTTTAGGGAATGCGATAACATAATTTTTTCTATTATTTTGAAAGCCTGTAGCGAATTGCGTGAAATTGTTGAAGGCAGAAAGGTCCATTGTCAGATTGTGAAGGTGGGGGGTCCGGATAGCTTTGTAATGACTGGTTTGATAGATATGTATGGTAAATGTGGGCAGGTTGAGTGCTCAAGTGCTGTGTTTGAAGAAATCATGGATAAAAATGTGGTTTCATGGACTTCAATGATTGCGGGATATGTACAAAATAATTGTGCGGAGGAGGGTCTGGTTTTATTCAATCGGATGAGAGATGCATTGGTTGAAAGCAACCCATTTACTTTAGGAAGCATAATAAATGCGTGTACAAAATTAAGAGCTTTACATCAAGGAAAATGGGTGCATGGCTATGCCATTAAGAACATTGCTGAACTTAGCTCTTTCTTGGCGACTACTTTTTTGGACATGTATGTTAAATGTGGGCAAACTAGAGATGCTCGCATGATATATGACGAACTACCTACTATTGATCTTGTTTCATGGACTGCAATGATAGTTGGATATACCCAAGCTAGGCAACCCAACGATGGATTGAGGCTTTTTGCTGATGAAATAAGGTCAGATCTCTTACCTAATTCTGTCACTGCTGCAAGTGTTCTTTCAGCATGTTCGGTTTCTGGTAATTTGAATTTAGGAATGTCAGTTCATGGACTTGGGATTAAAATTGGGCTGGAAGAATGTGTGGTGAAGAATGCTCTTATTGACATGTATGCTAAATGCCATAAGATTGGTGATGCTTATGCTATATTTCATGGGGTTTTGGAAAAAGATGTGATTACTTGGAACTCTATGATATCTGGGTATGCTCAAAATGGATCTGCTTATGATGCCCTCCGCCTCTTTAATCAAATGAGATTATACTTCCTTGCACCTGATGTAATAACGCTGGTGAGCACCCTTTCAGCATGTGCCACCCTTGGTGCTGTATAG

Coding sequence (CDS)

ATGTTGGCATGGCGGTTGAGCATTCAACAACTTGGAATGTTACAACGCTTCCCTCGAGCGTTCTTCCGTCTCGCGAGTCCTTTACTTCAAATGGGGCACCGGATGTCTTATTCCACATATGCTTCACATCCCCCACTCTCCGACCTCCATCAAACCATGCCCTCAGTACAGTTTATTTCGTTACCTTTGTGCTGTTATCTCATGGGACTTTTTAGGAACATTGATACTCTTATCAAGTTCCATGGGTTGCTTATAGTACACGGCCTTATCGGTAATCTTCTTTGTGATACCAAGTTGGTTGGTGTTTATGGCGCACTTGGGGATGTGAGGTCTGCTCGAATGGTGTTCGATCAAATGCCCAACCCCGATTTTTATGCGTGGAAGGTGATGATCAGGTGGTATTTTTTGAATGACTTGTTTGTGGATGTTATCCCATTCTATAATCGCATGAGAATGTCTTTTAGGGAATGCGATAACATAATTTTTTCTATTATTTTGAAAGCCTGTAGCGAATTGCGTGAAATTGTTGAAGGCAGAAAGGTCCATTGTCAGATTGTGAAGGTGGGGGGTCCGGATAGCTTTGTAATGACTGGTTTGATAGATATGTATGGTAAATGTGGGCAGGTTGAGTGCTCAAGTGCTGTGTTTGAAGAAATCATGGATAAAAATGTGGTTTCATGGACTTCAATGATTGCGGGATATGTACAAAATAATTGTGCGGAGGAGGGTCTGGTTTTATTCAATCGGATGAGAGATGCATTGGTTGAAAGCAACCCATTTACTTTAGGAAGCATAATAAATGCGTGTACAAAATTAAGAGCTTTACATCAAGGAAAATGGGTGCATGGCTATGCCATTAAGAACATTGCTGAACTTAGCTCTTTCTTGGCGACTACTTTTTTGGACATGTATGTTAAATGTGGGCAAACTAGAGATGCTCGCATGATATATGACGAACTACCTACTATTGATCTTGTTTCATGGACTGCAATGATAGTTGGATATACCCAAGCTAGGCAACCCAACGATGGATTGAGGCTTTTTGCTGATGAAATAAGGTCAGATCTCTTACCTAATTCTGTCACTGCTGCAAGTGTTCTTTCAGCATGTTCGGTTTCTGGTAATTTGAATTTAGGAATGTCAGTTCATGGACTTGGGATTAAAATTGGGCTGGAAGAATGTGTGGTGAAGAATGCTCTTATTGACATGTATGCTAAATGCCATAAGATTGGTGATGCTTATGCTATATTTCATGGGGTTTTGGAAAAAGATGTGATTACTTGGAACTCTATGATATCTGGGTATGCTCAAAATGGATCTGCTTATGATGCCCTCCGCCTCTTTAATCAAATGAGATTATACTTCCTTGCACCTGATGTAATAACGCTGGTGAGCACCCTTTCAGCATGTGCCACCCTTGGTGCTGTATAG

Protein sequence

MLAWRLSIQQLGMLQRFPRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV*
BLAST of Csa5G189930 vs. Swiss-Prot
Match: PP146_ARATH (Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E47 PE=3 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 1.2e-117
Identity = 212/422 (50.24%), Postives = 289/422 (68.48%), Query Frame = 1

Query: 55  SVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARM 114
           S+ + +   C  L+    NID+L + HG+L  +GL+G++   TKLV +YG  G  + AR+
Sbjct: 38  SLHYAASSPCFLLLSKCTNIDSLRQSHGVLTGNGLMGDISIATKLVSLYGFFGYTKDARL 97

Query: 115 VFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELRE 174
           VFDQ+P PDFY WKVM+R Y LN   V+V+  Y+ +       D+I+FS  LKAC+EL++
Sbjct: 98  VFDQIPEPDFYLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQD 157

Query: 175 IVEGRKVHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGY 234
           +  G+K+HCQ+VKV   D+ V+TGL+DMY KCG+++ +  VF +I  +NVV WTSMIAGY
Sbjct: 158 LDNGKKIHCQLVKVPSFDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAGY 217

Query: 235 VQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSS 294
           V+N+  EEGLVLFNRMR+  V  N +T G++I ACTKL ALHQGKW HG  +K+  ELSS
Sbjct: 218 VKNDLCEEGLVLFNRMRENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSGIELSS 277

Query: 295 FLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRS 354
            L T+ LDMYVKCG   +AR +++E   +DLV WTAMIVGYT     N+ L LF      
Sbjct: 278 CLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMKGV 337

Query: 355 DLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAY 414
           ++ PN VT ASVLS C +  NL LG SVHGL IK+G+ +  V NAL+ MYAKC++  DA 
Sbjct: 338 EIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNVANALVHMYAKCYQNRDAK 397

Query: 415 AIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLG 474
            +F    EKD++ WNS+ISG++QNGS ++AL LF++M    + P+ +T+ S  SACA+LG
Sbjct: 398 YVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACASLG 457

Query: 475 AV 477
           ++
Sbjct: 458 SL 459


HSP 2 Score: 231.5 bits (589), Expect = 1.9e-59
Identity = 136/409 (33.25%), Postives = 214/409 (52.32%), Query Frame = 1

Query: 72  RNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMI 131
           +++D   K H  L+      N++  T L+ +Y   G+++SA  VF+ +   +   W  MI
Sbjct: 156 QDLDNGKKIHCQLVKVPSFDNVVL-TGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMI 215

Query: 132 RWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGP 191
             Y  NDL  + +  +NRMR +    +   +  ++ AC++L  + +G+  H  +VK G  
Sbjct: 216 AGYVKNDLCEEGLVLFNRMRENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSGIE 275

Query: 192 -DSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRM 251
             S ++T L+DMY KCG +  +  VF E    ++V WT+MI GY  N    E L LF +M
Sbjct: 276 LSSCLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKM 335

Query: 252 RDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQT 311
           +   ++ N  T+ S+++ C  +  L  G+ VHG +IK +    + +A   + MY KC Q 
Sbjct: 336 KGVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIK-VGIWDTNVANALVHMYAKCYQN 395

Query: 312 RDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSAC 371
           RDA+ +++     D+V+W ++I G++Q    ++ L LF       + PN VT AS+ SAC
Sbjct: 396 RDAKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSAC 455

Query: 372 SVSGNLNLGMSVHGLGIKIGL---EECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVIT 431
           +  G+L +G S+H   +K+G        V  AL+D YAKC     A  IF  + EK+ IT
Sbjct: 456 ASLGSLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDTIEEKNTIT 515

Query: 432 WNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           W++MI GY + G    +L LF +M      P+  T  S LSAC   G V
Sbjct: 516 WSAMIGGYGKQGDTIGSLELFEEMLKKQQKPNESTFTSILSACGHTGMV 562


HSP 3 Score: 179.5 bits (454), Expect = 8.7e-44
Identity = 110/360 (30.56%), Postives = 178/360 (49.44%), Query Frame = 1

Query: 80  FHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDL 139
           FHG L+  G+  +    T L+ +Y   GD+ +AR VF++  + D   W  MI  Y  N  
Sbjct: 264 FHGCLVKSGIELSSCLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGS 323

Query: 140 FVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGPDSFVMTGL 199
             + +  + +M+    + + +  + +L  C  +  +  GR VH   +KVG  D+ V   L
Sbjct: 324 VNEALSLFQKMKGVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNVANAL 383

Query: 200 IDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNP 259
           + MY KC Q   +  VFE   +K++V+W S+I+G+ QN    E L LF+RM    V  N 
Sbjct: 384 VHMYAKCYQNRDAKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNG 443

Query: 260 FTLGSIINACTKLRALHQGKWVHGYAIK--NIAELSSFLATTFLDMYVKCGQTRDARMIY 319
            T+ S+ +AC  L +L  G  +H Y++K   +A  S  + T  LD Y KCG  + AR+I+
Sbjct: 444 VTVASLFSACASLGSLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIF 503

Query: 320 DELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLN 379
           D +   + ++W+AMI GY +       L LF + ++    PN  T  S+LSAC  +G +N
Sbjct: 504 DTIEEKNTITWSAMIGGYGKQGDTIGSLELFEEMLKKQQKPNESTFTSILSACGHTGMVN 563

Query: 380 LGMSVHGLGIKIGLEECVVKN--ALIDMYAKCHKIGDAYAIFHGV-LEKDVITWNSMISG 435
            G        K        K+   ++DM A+  ++  A  I   + ++ DV  + + + G
Sbjct: 564 EGKKYFSSMYKDYNFTPSTKHYTCMVDMLARAGELEQALDIIEKMPIQPDVRCFGAFLHG 623

BLAST of Csa5G189930 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 249.6 bits (636), Expect = 6.9e-65
Identity = 136/404 (33.66%), Postives = 225/404 (55.69%), Query Frame = 1

Query: 77  LIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFL 136
           L + H  L+V GL  +    TKL+    + GD+  AR VFD +P P  + W  +IR Y  
Sbjct: 37  LKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSR 96

Query: 137 NDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG-GPDSFV 196
           N+ F D +  Y+ M+++    D+  F  +LKACS L  +  GR VH Q+ ++G   D FV
Sbjct: 97  NNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFV 156

Query: 197 MTGLIDMYGKCGQVECSSAVFE--EIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDA 256
             GLI +Y KC ++  +  VFE   + ++ +VSWT++++ Y QN    E L +F++MR  
Sbjct: 157 QNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKM 216

Query: 257 LVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDA 316
            V+ +   L S++NA T L+ L QG+ +H   +K   E+   L  +   MY KCGQ   A
Sbjct: 217 DVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATA 276

Query: 317 RMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVS 376
           ++++D++ + +L+ W AMI GY +     + + +F + I  D+ P++++  S +SAC+  
Sbjct: 277 KILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQV 336

Query: 377 GNLNLGMSVHG-LGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMI 436
           G+L    S++  +G     ++  + +ALIDM+AKC  +  A  +F   L++DV+ W++MI
Sbjct: 337 GSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMI 396

Query: 437 SGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
            GY  +G A +A+ L+  M    + P+ +T +  L AC   G V
Sbjct: 397 VGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMV 440


HSP 2 Score: 159.5 bits (402), Expect = 9.4e-38
Identity = 97/365 (26.58%), Postives = 183/365 (50.14%), Query Frame = 1

Query: 81  HGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPD--FYAWKVMIRWYFLND 140
           H  +   G   ++     L+ +Y     + SAR VF+ +P P+    +W  ++  Y  N 
Sbjct: 142 HAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNG 201

Query: 141 LFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG---GPDSFV 200
             ++ +  +++MR    + D +    +L A + L+++ +GR +H  +VK+G    PD  +
Sbjct: 202 EPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPD--L 261

Query: 201 MTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALV 260
           +  L  MY KCGQV  +  +F+++   N++ W +MI+GY +N  A E + +F+ M +  V
Sbjct: 262 LISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDV 321

Query: 261 ESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARM 320
             +  ++ S I+AC ++ +L Q + ++ Y  ++      F+++  +DM+ KCG    AR+
Sbjct: 322 RPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARL 381

Query: 321 IYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGN 380
           ++D     D+V W+AMIVGY    +  + + L+    R  + PN VT   +L AC+ SG 
Sbjct: 382 VFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGM 441

Query: 381 LNLG------MSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGV-LEKDVITW 434
           +  G      M+ H +  +     CV     ID+  +   +  AY +   + ++  V  W
Sbjct: 442 VREGWWFFNRMADHKINPQQQHYACV-----IDLLGRAGHLDQAYEVIKCMPVQPGVTVW 499

BLAST of Csa5G189930 vs. Swiss-Prot
Match: PP319_ARATH (Pentatricopeptide repeat-containing protein At4g18520 OS=Arabidopsis thaliana GN=PCMP-A2 PE=2 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 1.0e-60
Identity = 137/395 (34.68%), Postives = 214/395 (54.18%), Query Frame = 1

Query: 81  HGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLF 140
           HG ++  G +GNL+ ++ LV  Y   G++ SA   FD M   D  +W  +I         
Sbjct: 207 HGNMVKVG-VGNLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHG 266

Query: 141 VDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVK-VGGPDSFVMTGL 200
           +  I  +  M   +   +      ILKACSE + +  GR+VH  +VK +   D FV T L
Sbjct: 267 IKAIGMFIGMLNHWFLPNEFTVCSILKACSEEKALRFGRQVHSLVVKRMIKTDVFVGTSL 326

Query: 201 IDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNP 260
           +DMY KCG++     VF+ + ++N V+WTS+IA + +    EE + LF  M+   + +N 
Sbjct: 327 MDMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANN 386

Query: 261 FTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDE 320
            T+ SI+ AC  + AL  GK +H   IKN  E + ++ +T + +Y KCG++RDA  +  +
Sbjct: 387 LTVVSILRACGSVGALLLGKELHAQIIKNSIEKNVYIGSTLVWLYCKCGESRDAFNVLQQ 446

Query: 321 LPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLG 380
           LP+ D+VSWTAMI G +     ++ L    + I+  + PN  T +S L AC+ S +L +G
Sbjct: 447 LPSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFTYSSALKACANSESLLIG 506

Query: 381 MSVHGLGIK-IGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQN 440
            S+H +  K   L    V +ALI MYAKC  + +A+ +F  + EK++++W +MI GYA+N
Sbjct: 507 RSIHSIAKKNHALSNVFVGSALIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYARN 566

Query: 441 GSAYDALRLFNQMRLYFLAPDVITLVSTLSACATL 474
           G   +AL+L  +M       D     + LS C  +
Sbjct: 567 GFCREALKLMYRMEAEGFEVDDYIFATILSTCGDI 600


HSP 2 Score: 201.1 bits (510), Expect = 2.8e-50
Identity = 115/368 (31.25%), Postives = 192/368 (52.17%), Query Frame = 1

Query: 106 LGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIP-FYNRMRMSFRECDNIIFSI 165
           LGD+  AR VFD MP  +   W  MI  Y    L  +    F + ++   R  +  +F  
Sbjct: 130 LGDLVYARKVFDSMPEKNTVTWTAMIDGYLKYGLEDEAFALFEDYVKHGIRFTNERMFVC 189

Query: 166 ILKACSELREIVEGRKVHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNV 225
           +L  CS   E   GR+VH  +VKVG  +  V + L+  Y +CG++  +   F+ + +K+V
Sbjct: 190 LLNLCSRRAEFELGRQVHGNMVKVGVGNLIVESSLVYFYAQCGELTSALRAFDMMEEKDV 249

Query: 226 VSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGY 285
           +SWT++I+   +     + + +F  M +     N FT+ SI+ AC++ +AL  G+ VH  
Sbjct: 250 ISWTAVISACSRKGHGIKAIGMFIGMLNHWFLPNEFTVCSILKACSEEKALRFGRQVHSL 309

Query: 286 AIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDG 345
            +K + +   F+ T+ +DMY KCG+  D R ++D +   + V+WT++I  + +     + 
Sbjct: 310 VVKRMIKTDVFVGTSLMDMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEA 369

Query: 346 LRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECV-VKNALIDM 405
           + LF    R  L+ N++T  S+L AC   G L LG  +H   IK  +E+ V + + L+ +
Sbjct: 370 ISLFRIMKRRHLIANNLTVVSILRACGSVGALLLGKELHAQIIKNSIEKNVYIGSTLVWL 429

Query: 406 YAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITL 465
           Y KC +  DA+ +   +  +DV++W +MISG +  G   +AL    +M    + P+  T 
Sbjct: 430 YCKCGESRDAFNVLQQLPSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFTY 489

Query: 466 VSTLSACA 472
            S L ACA
Sbjct: 490 SSALKACA 497


HSP 3 Score: 105.9 bits (263), Expect = 1.2e-21
Identity = 57/194 (29.38%), Postives = 98/194 (50.52%), Query Frame = 1

Query: 279 KWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQA 338
           K +H  A+K   +   +     +   V+ G    AR ++D +P  + V+WTAMI GY + 
Sbjct: 102 KRIHAMALKCFDDQVIYFGNNLISSCVRLGDLVYARKVFDSMPEKNTVTWTAMIDGYLKY 161

Query: 339 RQPNDGLRLFADEIRSDL-LPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVK 398
              ++   LF D ++  +   N      +L+ CS      LG  VHG  +K+G+   +V+
Sbjct: 162 GLEDEAFALFEDYVKHGIRFTNERMFVCLLNLCSRRAEFELGRQVHGNMVKVGVGNLIVE 221

Query: 399 NALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLA 458
           ++L+  YA+C ++  A   F  + EKDVI+W ++IS  ++ G    A+ +F  M  ++  
Sbjct: 222 SSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHGIKAIGMFIGMLNHWFL 281

Query: 459 PDVITLVSTLSACA 472
           P+  T+ S L AC+
Sbjct: 282 PNEFTVCSILKACS 295

BLAST of Csa5G189930 vs. Swiss-Prot
Match: PP181_ARATH (Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana GN=PCMP-E19 PE=3 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 1.0e-60
Identity = 140/426 (32.86%), Postives = 216/426 (50.70%), Query Frame = 1

Query: 61  LPLCCYLMGLFRNIDTLI------KFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARM 120
           LP    L G+F+   +L       + H L++     G++  DT LVG+Y   G V     
Sbjct: 115 LPNAYTLAGIFKAESSLQSSTVGRQAHALVVKMSSFGDIYVDTSLVGMYCKAGLVEDGLK 174

Query: 121 VFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDN--IIFSIILKACSEL 180
           VF  MP  + Y W  M+  Y       + I  +N       E  +   +F+ +L + +  
Sbjct: 175 VFAYMPERNTYTWSTMVSGYATRGRVEEAIKVFNLFLREKEEGSDSDYVFTAVLSSLAAT 234

Query: 181 REIVEGRKVHCQIVKVGGPDSFVMTG-LIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMI 240
             +  GR++HC  +K G      ++  L+ MY KC  +  +  +F+   D+N ++W++M+
Sbjct: 235 IYVGLGRQIHCITIKNGLLGFVALSNALVTMYSKCESLNEACKMFDSSGDRNSITWSAMV 294

Query: 241 AGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAE 300
            GY QN  + E + LF+RM  A ++ + +T+  ++NAC+ +  L +GK +H + +K   E
Sbjct: 295 TGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVGVLNACSDICYLEEGKQLHSFLLKLGFE 354

Query: 301 LSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADE 360
              F  T  +DMY K G   DAR  +D L   D+  WT++I GY Q     + L L+   
Sbjct: 355 RHLFATTALVDMYAKAGCLADARKGFDCLQERDVALWTSLISGYVQNSDNEEALILYRRM 414

Query: 361 IRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLE-ECVVKNALIDMYAKCHKI 420
             + ++PN  T ASVL ACS    L LG  VHG  IK G   E  + +AL  MY+KC  +
Sbjct: 415 KTAGIIPNDPTMASVLKACSSLATLELGKQVHGHTIKHGFGLEVPIGSALSTMYSKCGSL 474

Query: 421 GDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSAC 477
            D   +F     KDV++WN+MISG + NG   +AL LF +M    + PD +T V+ +SAC
Sbjct: 475 EDGNLVFRRTPNKDVVSWNAMISGLSHNGQGDEALELFEEMLAEGMEPDDVTFVNIISAC 534


HSP 2 Score: 180.6 bits (457), Expect = 3.9e-44
Identity = 105/365 (28.77%), Postives = 189/365 (51.78%), Query Frame = 1

Query: 79  KFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLND 138
           + H + I +GL+G +     LV +Y     +  A  +FD   + +   W  M+  Y  N 
Sbjct: 242 QIHCITIKNGLLGFVALSNALVTMYSKCESLNEACKMFDSSGDRNSITWSAMVTGYSQNG 301

Query: 139 LFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGPDS-FVMT 198
             ++ +  ++RM  +  +        +L ACS++  + EG+++H  ++K+G     F  T
Sbjct: 302 ESLEAVKLFSRMFSAGIKPSEYTIVGVLNACSDICYLEEGKQLHSFLLKLGFERHLFATT 361

Query: 199 GLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVES 258
            L+DMY K G +  +   F+ + +++V  WTS+I+GYVQN+  EE L+L+ RM+ A +  
Sbjct: 362 ALVDMYAKAGCLADARKGFDCLQERDVALWTSLISGYVQNSDNEEALILYRRMKTAGIIP 421

Query: 259 NPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIY 318
           N  T+ S++ AC+ L  L  GK VHG+ IK+   L   + +    MY KCG   D  +++
Sbjct: 422 NDPTMASVLKACSSLATLELGKQVHGHTIKHGFGLEVPIGSALSTMYSKCGSLEDGNLVF 481

Query: 319 DELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLN 378
              P  D+VSW AMI G +   Q ++ L LF + +   + P+ VT  +++SACS  G + 
Sbjct: 482 RRTPNKDVVSWNAMISGLSHNGQGDEALELFEEMLAEGMEPDDVTFVNIISACSHKGFVE 541

Query: 379 LG-MSVHGLGIKIGLEECVVKNA-LIDMYAKCHKIGDAYAIFHGV-LEKDVITWNSMISG 438
            G    + +  +IGL+  V   A ++D+ ++  ++ +A        ++  +  W  ++S 
Sbjct: 542 RGWFYFNMMSDQIGLDPKVDHYACMVDLLSRAGQLKEAKEFIESANIDHGLCLWRILLSA 601

Query: 439 YAQNG 440
              +G
Sbjct: 602 CKNHG 606


HSP 3 Score: 173.7 bits (439), Expect = 4.8e-42
Identity = 117/412 (28.40%), Postives = 193/412 (46.84%), Query Frame = 1

Query: 72  RNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMI 131
           RN+      HG +I  G    +     LV  Y   G +  A  +F+ +   D  +W  +I
Sbjct: 28  RNLVAGRAVHGQIIRTGASTCIQHANVLVNFYAKCGKLAKAHSIFNAIICKDVVSWNSLI 87

Query: 132 RWYFLN---DLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKV 191
             Y  N        V+  +  MR      +    + I KA S L+    GR+ H  +VK+
Sbjct: 88  TGYSQNGGISSSYTVMQLFREMRAQDILPNAYTLAGIFKAESSLQSSTVGRQAHALVVKM 147

Query: 192 GG-PDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLF 251
               D +V T L+ MY K G VE    VF  + ++N  +W++M++GY      EE + +F
Sbjct: 148 SSFGDIYVDTSLVGMYCKAGLVEDGLKVFAYMPERNTYTWSTMVSGYATRGRVEEAIKVF 207

Query: 252 NRMRDALVESNP--FTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYV 311
           N       E +   +   +++++      +  G+ +H   IKN       L+   + MY 
Sbjct: 208 NLFLREKEEGSDSDYVFTAVLSSLAATIYVGLGRQIHCITIKNGLLGFVALSNALVTMYS 267

Query: 312 KCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAAS 371
           KC    +A  ++D     + ++W+AM+ GY+Q  +  + ++LF+    + + P+  T   
Sbjct: 268 KCESLNEACKMFDSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVG 327

Query: 372 VLSACSVSGNLNLGMSVHGLGIKIGLEECV-VKNALIDMYAKCHKIGDAYAIFHGVLEKD 431
           VL+ACS    L  G  +H   +K+G E  +    AL+DMYAK   + DA   F  + E+D
Sbjct: 328 VLNACSDICYLEEGKQLHSFLLKLGFERHLFATTALVDMYAKAGCLADARKGFDCLQERD 387

Query: 432 VITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           V  W S+ISGY QN    +AL L+ +M+   + P+  T+ S L AC++L  +
Sbjct: 388 VALWTSLISGYVQNSDNEEALILYRRMKTAGIIPNDPTMASVLKACSSLATL 439


HSP 4 Score: 90.1 bits (222), Expect = 7.0e-17
Identity = 47/125 (37.60%), Postives = 75/125 (60.00%), Query Frame = 1

Query: 353 RSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVK-NALIDMYAKCHKIG 412
           +++L P++ T    L+  S   NL  G +VHG  I+ G   C+   N L++ YAKC K+ 
Sbjct: 7   QTELNPHTSTLLKKLTHHSQQRNLVAGRAVHGQIIRTGASTCIQHANVLVNFYAKCGKLA 66

Query: 413 DAYAIFHGVLEKDVITWNSMISGYAQNG---SAYDALRLFNQMRLYFLAPDVITLVSTLS 472
            A++IF+ ++ KDV++WNS+I+GY+QNG   S+Y  ++LF +MR   + P+  TL     
Sbjct: 67  KAHSIFNAIICKDVVSWNSLITGYSQNGGISSSYTVMQLFREMRAQDILPNAYTLAGIFK 126

Query: 473 ACATL 474
           A ++L
Sbjct: 127 AESSL 131

BLAST of Csa5G189930 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 235.0 bits (598), Expect = 1.8e-60
Identity = 140/404 (34.65%), Postives = 212/404 (52.48%), Query Frame = 1

Query: 79  KFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLND 138
           + HG ++  G          LV  Y     V SAR VFD+M   D  +W  +I  Y  N 
Sbjct: 216 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 275

Query: 139 LFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKV--GGPDSFVM 198
           L    +  + +M +S  E D      +   C++ R I  GR VH   VK      D F  
Sbjct: 276 LAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCN 335

Query: 199 TGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVE 258
           T L+DMY KCG ++ + AVF E+ D++VVS+TSMIAGY +   A E + LF  M +  + 
Sbjct: 336 T-LLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGIS 395

Query: 259 SNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMI 318
            + +T+ +++N C + R L +GK VH +  +N      F++   +DMY KCG  ++A ++
Sbjct: 396 PDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELV 455

Query: 319 YDELPTIDLVSWTAMIVGYTQARQPNDGLRLF---ADEIRSDLLPNSVTAASVLSACSVS 378
           + E+   D++SW  +I GY++    N+ L LF    +E R    P+  T A VL AC+  
Sbjct: 456 FSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKR--FSPDERTVACVLPACASL 515

Query: 379 GNLNLGMSVHGLGIKIG-LEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMI 438
              + G  +HG  ++ G   +  V N+L+DMYAKC  +  A+ +F  +  KD+++W  MI
Sbjct: 516 SAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMI 575

Query: 439 SGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           +GY  +G   +A+ LFNQMR   +  D I+ VS L AC+  G V
Sbjct: 576 AGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLV 616


HSP 2 Score: 205.7 bits (522), Expect = 1.1e-51
Identity = 132/412 (32.04%), Postives = 211/412 (51.21%), Query Frame = 1

Query: 72  RNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMI 131
           + +D  I+ +G +I   L       +KL  +Y   GD++ A  VFD++       W +++
Sbjct: 114 KEVDNFIRGNGFVIDSNL------GSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILM 173

Query: 132 RWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG-G 191
                +  F   I  + +M  S  E D+  FS + K+ S LR +  G ++H  I+K G G
Sbjct: 174 NELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFG 233

Query: 192 PDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRM 251
             + V   L+  Y K  +V+ +  VF+E+ +++V+SW S+I GYV N  AE+GL +F +M
Sbjct: 234 ERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQM 293

Query: 252 RDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQT 311
             + +E +  T+ S+   C   R +  G+ VH   +K           T LDMY KCG  
Sbjct: 294 LVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDL 353

Query: 312 RDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSAC 371
             A+ ++ E+    +VS+T+MI GY +     + ++LF +     + P+  T  +VL+ C
Sbjct: 354 DSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCC 413

Query: 372 SVSGNLNLGMSVH------GLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKD 431
           +    L+ G  VH       LG  I      V NAL+DMYAKC  + +A  +F  +  KD
Sbjct: 414 ARYRLLDEGKRVHEWIKENDLGFDI-----FVSNALMDMYAKCGSMQEAELVFSEMRVKD 473

Query: 432 VITWNSMISGYAQNGSAYDALRLFN-QMRLYFLAPDVITLVSTLSACATLGA 476
           +I+WN++I GY++N  A +AL LFN  +     +PD  T+   L ACA+L A
Sbjct: 474 IISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSA 514


HSP 3 Score: 163.7 bits (413), Expect = 5.0e-39
Identity = 103/349 (29.51%), Postives = 172/349 (49.28%), Query Frame = 1

Query: 95  CDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSF 154
           C+T L+ +Y   GD+ SA+ VF +M +    ++  MI  Y    L  + +  +  M    
Sbjct: 334 CNT-LLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEG 393

Query: 155 RECDNIIFSIILKACSELREIVEGRKVHCQIVKVG-GPDSFVMTGLIDMYGKCGQVECSS 214
              D    + +L  C+  R + EG++VH  I +   G D FV   L+DMY KCG ++ + 
Sbjct: 394 ISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAE 453

Query: 215 AVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRM-RDALVESNPFTLGSIINACTKL 274
            VF E+  K+++SW ++I GY +N  A E L LFN +  +     +  T+  ++ AC  L
Sbjct: 454 LVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASL 513

Query: 275 RALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMI 334
            A  +G+ +HGY ++N       +A + +DMY KCG    A M++D++ + DLVSWT MI
Sbjct: 514 SAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMI 573

Query: 335 VGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLE 394
            GY       + + LF    ++ +  + ++  S+L ACS SG ++ G         I   
Sbjct: 574 AGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRF----FNIMRH 633

Query: 395 ECVVK------NALIDMYAKCHKIGDAYAIFHGV-LEKDVITWNSMISG 435
           EC ++        ++DM A+   +  AY     + +  D   W +++ G
Sbjct: 634 ECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCG 677


HSP 4 Score: 112.8 bits (281), Expect = 1.0e-23
Identity = 79/290 (27.24%), Postives = 133/290 (45.86%), Query Frame = 1

Query: 54  PSVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSAR 113
           P V  ++  L C     +R +D   + H  +  + L  ++     L+ +Y   G ++ A 
Sbjct: 395 PDVYTVTAVLNC--CARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAE 454

Query: 114 MVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFR-ECDNIIFSIILKACSEL 173
           +VF +M   D  +W  +I  Y  N    + +  +N +    R   D    + +L AC+ L
Sbjct: 455 LVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASL 514

Query: 174 REIVEGRKVHCQIVKVGG-PDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMI 233
               +GR++H  I++ G   D  V   L+DMY KCG +  +  +F++I  K++VSWT MI
Sbjct: 515 SAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMI 574

Query: 234 AGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAE 293
           AGY  +   +E + LFN+MR A +E++  +  S++ AC+     H G    G+   NI  
Sbjct: 575 AGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACS-----HSGLVDEGWRFFNIMR 634

Query: 294 LSSFLATT------FLDMYVKCGQTRDARMIYDELP-TIDLVSWTAMIVG 335
               +  T       +DM  + G    A    + +P   D   W A++ G
Sbjct: 635 HECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCG 677

BLAST of Csa5G189930 vs. TrEMBL
Match: A0A0A0KLZ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G189930 PE=4 SV=1)

HSP 1 Score: 963.8 bits (2490), Expect = 8.0e-278
Identity = 476/476 (100.00%), Postives = 476/476 (100.00%), Query Frame = 1

Query: 1   MLAWRLSIQQLGMLQRFPRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFIS 60
           MLAWRLSIQQLGMLQRFPRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFIS
Sbjct: 1   MLAWRLSIQQLGMLQRFPRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFIS 60

Query: 61  LPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMP 120
           LPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMP
Sbjct: 61  LPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMP 120

Query: 121 NPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRK 180
           NPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRK
Sbjct: 121 NPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRK 180

Query: 181 VHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCA 240
           VHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCA
Sbjct: 181 VHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCA 240

Query: 241 EEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTF 300
           EEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTF
Sbjct: 241 EEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTF 300

Query: 301 LDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNS 360
           LDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNS
Sbjct: 301 LDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNS 360

Query: 361 VTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGV 420
           VTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGV
Sbjct: 361 VTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGV 420

Query: 421 LEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           LEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV
Sbjct: 421 LEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 476

BLAST of Csa5G189930 vs. TrEMBL
Match: A0A061EAZ8_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_011604 PE=4 SV=1)

HSP 1 Score: 578.2 bits (1489), Expect = 9.4e-162
Identity = 276/443 (62.30%), Postives = 351/443 (79.23%), Query Frame = 1

Query: 35  MSYSTYASHP-PLSDLHQTMPSVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNL 94
           +SY+T   HP     + +T+ S+  ISL  C  L+GL RNID+L K H L +++G+ G+L
Sbjct: 29  LSYTT--DHPLEYPSMDRTLASMHSISLNPCFALLGLCRNIDSLKKVHALFVINGIKGDL 88

Query: 95  LCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMS 154
           LCDTKLV +YG  G +  AR++FDQ+P+PDFY+WKVMIRWYFLNDL +++I FY RMRMS
Sbjct: 89  LCDTKLVSLYGLFGHIGCARLMFDQIPDPDFYSWKVMIRWYFLNDLCMEIIGFYARMRMS 148

Query: 155 FRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSS 214
            R CDN++FS++LKACSE+R+I EGRKVHCQIVK G PDSFV TGL+DMY KCG++ECS 
Sbjct: 149 VRMCDNVVFSVVLKACSEMRDIDEGRKVHCQIVKAGNPDSFVQTGLVDMYAKCGEIECSR 208

Query: 215 AVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLR 274
            VF EI+D+NVVSWTSMIAGYVQN+CAE+ LVLFNRMR+A+VE N FTLGS++ AC KL 
Sbjct: 209 KVFSEIIDRNVVSWTSMIAGYVQNDCAEDALVLFNRMREAMVEGNEFTLGSLVTACGKLG 268

Query: 275 ALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIV 334
           ALHQGKWVHGY IKN  EL+S+  TT LDMYVKCG  RDAR ++DEL ++DLVSWTAMIV
Sbjct: 269 ALHQGKWVHGYVIKNGIELNSYSVTTLLDMYVKCGSIRDARSVFDELSSVDLVSWTAMIV 328

Query: 335 GYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEE 394
           GY+Q+  P++ L+LF D+    +LPN+VT AS+LSAC+   NL+ G  VH LGI++GL++
Sbjct: 329 GYSQSGFPDEALKLFIDKKWFGILPNAVTIASLLSACAQLSNLSFGRLVHALGIQLGLKD 388

Query: 395 CVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRL 454
             V NAL+DMYAKC  IGDA  IF  V +K++I WNS+ISGY+QNGSAY+A  LF+QMR 
Sbjct: 389 STVINALVDMYAKCGMIGDARYIFETVSDKNIIAWNSIISGYSQNGSAYEAFELFHQMRS 448

Query: 455 YFLAPDVITLVSTLSACATLGAV 477
             ++PD +T+VS  SACA+LGA+
Sbjct: 449 KSVSPDAVTVVSIFSACASLGAL 469

BLAST of Csa5G189930 vs. TrEMBL
Match: A0A061EAZ8_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_011604 PE=4 SV=1)

HSP 1 Score: 226.1 bits (575), Expect = 9.1e-56
Identity = 132/408 (32.35%), Postives = 218/408 (53.43%), Query Frame = 1

Query: 72  RNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMI 131
           R+ID   K H  ++  G   + +  T LV +Y   G++  +R VF ++ + +  +W  MI
Sbjct: 166 RDIDEGRKVHCQIVKAGNPDSFV-QTGLVDMYAKCGEIECSRKVFSEIIDRNVVSWTSMI 225

Query: 132 RWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGP 191
             Y  ND   D +  +NRMR +  E +      ++ AC +L  + +G+ VH  ++K G  
Sbjct: 226 AGYVQNDCAEDALVLFNRMREAMVEGNEFTLGSLVTACGKLGALHQGKWVHGYVIKNGIE 285

Query: 192 -DSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRM 251
            +S+ +T L+DMY KCG +  + +VF+E+   ++VSWT+MI GY Q+   +E L LF   
Sbjct: 286 LNSYSVTTLLDMYVKCGSIRDARSVFDELSSVDLVSWTAMIVGYSQSGFPDEALKLFIDK 345

Query: 252 RDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQT 311
           +   +  N  T+ S+++AC +L  L  G+ VH   I+ +    S +    +DMY KCG  
Sbjct: 346 KWFGILPNAVTIASLLSACAQLSNLSFGRLVHALGIQ-LGLKDSTVINALVDMYAKCGMI 405

Query: 312 RDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSAC 371
            DAR I++ +   ++++W ++I GY+Q     +   LF       + P++VT  S+ SAC
Sbjct: 406 GDARYIFETVSDKNIIAWNSIISGYSQNGSAYEAFELFHQMRSKSVSPDAVTVVSIFSAC 465

Query: 372 SVSGNLNLGMSVHGLGIKIGL--EECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITW 431
           +  G L +G S+H    K GL      V  A+++ YAK      A AIF  + EK+ +TW
Sbjct: 466 ASLGALQVGSSLHAYSTKGGLLSSSVYVGTAVLNFYAKSGDSKSARAIFDSMGEKNTVTW 525

Query: 432 NSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           ++MI GY   G +  +L LFN M    + P+ +   + LSAC   G++
Sbjct: 526 SAMIGGYGIQGDSSGSLALFNDMVKENVEPNEVIFTTILSACGHTGSL 571


HSP 2 Score: 92.0 bits (227), Expect = 2.0e-15
Identity = 66/248 (26.61%), Postives = 116/248 (46.77%), Query Frame = 1

Query: 81  HGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLF 140
           H L I  GL  + + +  LV +Y   G +  AR +F+ + + +  AW  +I  Y  N   
Sbjct: 376 HALGIQLGLKDSTVINA-LVDMYAKCGMIGDARYIFETVSDKNIIAWNSIISGYSQNGSA 435

Query: 141 VDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGPDS--FVMTG 200
            +    +++MR      D +    I  AC+ L  +  G  +H    K G   S  +V T 
Sbjct: 436 YEAFELFHQMRSKSVSPDAVTVVSIFSACASLGALQVGSSLHAYSTKGGLLSSSVYVGTA 495

Query: 201 LIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESN 260
           +++ Y K G  + + A+F+ + +KN V+W++MI GY     +   L LFN M    VE N
Sbjct: 496 VLNFYAKSGDSKSARAIFDSMGEKNTVTWSAMIGGYGIQGDSSGSLALFNDMVKENVEPN 555

Query: 261 PFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLAT-----TFLDMYVKCGQTRDA 320
                +I++AC    +L +G W +     ++ +  +F+ +       +DM  + G+  +A
Sbjct: 556 EVIFTTILSACGHTGSLGEG-WKY---FNSMCQDYNFVPSMKHYACMVDMLARAGRLEEA 615

Query: 321 RMIYDELP 322
               D+LP
Sbjct: 616 WDFIDKLP 618


HSP 3 Score: 48.1 bits (113), Expect = 3.4e-02
Identity = 41/160 (25.62%), Postives = 67/160 (41.88%), Query Frame = 1

Query: 55  SVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGN-LLCDTKLVGVYGALGDVRSAR 114
           +V  +S+   C  +G  +   +L   H      GL+ + +   T ++  Y   GD +SAR
Sbjct: 453 AVTVVSIFSACASLGALQVGSSL---HAYSTKGGLLSSSVYVGTAVLNFYAKSGDSKSAR 512

Query: 115 MVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELR 174
            +FD M   +   W  MI  Y +       +  +N M     E + +IF+ IL AC    
Sbjct: 513 AIFDSMGEKNTVTWSAMIGGYGIQGDSSGSLALFNDMVKENVEPNEVIFTTILSACGHTG 572

Query: 175 EIVEGRKVH---CQIVKVGGPDSFVMTGLIDMYGKCGQVE 211
            + EG K     CQ      P       ++DM  + G++E
Sbjct: 573 SLGEGWKYFNSMCQDYNF-VPSMKHYACMVDMLARAGRLE 608


HSP 4 Score: 560.1 bits (1442), Expect = 2.6e-156
Identity = 268/429 (62.47%), Postives = 335/429 (78.09%), Query Frame = 1

Query: 48  DLHQTMPSVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALG 107
           ++ +T+ S+Q IS   C  L+G+ + + +L K H LL+VHGL  +LLC+TKLV +YG+ G
Sbjct: 26  EIDRTIASIQSISSNPCFSLLGICKTVSSLRKIHALLVVHGLSEDLLCETKLVSLYGSFG 85

Query: 108 DVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYN-RMRMSFRECDNIIFSIIL 167
            V  AR++FD++ NPD Y+WKVMIRWYFLND + +++ FYN R+R    E DN++FSI+L
Sbjct: 86  HVECARLMFDRIRNPDLYSWKVMIRWYFLNDSYSEIVQFYNTRLRKCLNEYDNVVFSIVL 145

Query: 168 KACSELREIVEGRKVHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVS 227
           KACSELRE  EGRK+HCQIVKVG PDSFV+TGL+DMY KC +VE S  VF+EI+D+NVV 
Sbjct: 146 KACSELRETDEGRKLHCQIVKVGSPDSFVLTGLVDMYAKCREVEDSRRVFDEILDRNVVC 205

Query: 228 WTSMIAGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAI 287
           WTSMI GYVQN+C +EGLVLFNRMR+ LVE N +TLGS++ ACTKL ALHQGKWVHGY I
Sbjct: 206 WTSMIVGYVQNDCLKEGLVLFNRMREGLVEGNQYTLGSLVTACTKLGALHQGKWVHGYVI 265

Query: 288 KNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLR 347
           K+  +L+SFL T  LD+Y KCG  RDA  ++DEL TIDLVSWTAMIVGY Q   P + L+
Sbjct: 266 KSGFDLNSFLVTPLLDLYFKCGDIRDAFSVFDELSTIDLVSWTAMIVGYAQRGYPREALK 325

Query: 348 LFADEIRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAK 407
           LF DE   DLLPN+VT +SVLSAC+ +G+LN+G SVH LGIK+G E+   +NAL+DMYAK
Sbjct: 326 LFTDERWKDLLPNTVTTSSVLSACAQTGSLNMGRSVHCLGIKLGSEDATFENALVDMYAK 385

Query: 408 CHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVST 467
           CH IGDA  +F  V +KDVI WNS+ISGY QNG AY+AL LF+QMR   + PD ITLVS 
Sbjct: 386 CHMIGDARYVFETVFDKDVIAWNSIISGYTQNGYAYEALELFDQMRSDSVYPDAITLVSV 445

Query: 468 LSACATLGA 476
           LSACA++GA
Sbjct: 446 LSACASVGA 454

BLAST of Csa5G189930 vs. TrEMBL
Match: A5AY98_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00900 PE=4 SV=1)

HSP 1 Score: 232.3 bits (591), Expect = 1.3e-57
Identity = 137/382 (35.86%), Postives = 212/382 (55.50%), Query Frame = 1

Query: 97  TKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRE 156
           T LV +Y    +V  +R VFD++ + +   W  MI  Y  ND   + +  +NRMR    E
Sbjct: 176 TGLVDMYAKCREVEDSRRVFDEILDRNVVCWTSMIVGYVQNDCLKEGLVLFNRMREGLVE 235

Query: 157 CDNIIFSIILKACSELREIVEGRKVHCQIVKVGGP-DSFVMTGLIDMYGKCGQVECSSAV 216
            +      ++ AC++L  + +G+ VH  ++K G   +SF++T L+D+Y KCG +  + +V
Sbjct: 236 GNQYTLGSLVTACTKLGALHQGKWVHGYVIKSGFDLNSFLVTPLLDLYFKCGDIRDAFSV 295

Query: 217 FEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRAL 276
           F+E+   ++VSWT+MI GY Q     E L LF   R   +  N  T  S+++AC +  +L
Sbjct: 296 FDELSTIDLVSWTAMIVGYAQRGYPREALKLFTDERWKDLLPNTVTTSSVLSACAQTGSL 355

Query: 277 HQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGY 336
           + G+ VH   IK  +E ++F     +DMY KC    DAR +++ +   D+++W ++I GY
Sbjct: 356 NMGRSVHCLGIKLGSEDATF-ENALVDMYAKCHMIGDARYVFETVFDKDVIAWNSIISGY 415

Query: 337 TQARQPNDGLRLFADEIRSD-LLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGL--E 396
           TQ     + L LF D++RSD + P+++T  SVLSAC+  G   +G S+HG  IK GL   
Sbjct: 416 TQNGYAYEALELF-DQMRSDSVYPDAITLVSVLSACASVGAYRVGSSLHGYAIKAGLLSG 475

Query: 397 ECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMR 456
              V  AL++ YAKC     A  IF  + EK+ ITW++MI GY   G    +L LF  M 
Sbjct: 476 SVYVGTALLNFYAKCGDAESARVIFDEMGEKNTITWSAMIGGYGIQGDCSRSLELFGDML 535

Query: 457 LYFLAPDVITLVSTLSACATLG 475
              L P+ +   + LSAC+  G
Sbjct: 536 KEKLEPNEVIFTTILSACSHSG 555


HSP 2 Score: 94.4 bits (233), Expect = 4.1e-16
Identity = 61/235 (25.96%), Postives = 113/235 (48.09%), Query Frame = 1

Query: 96  DTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFR 155
           +  LV +Y     +  AR VF+ + + D  AW  +I  Y  N    + +  +++MR    
Sbjct: 376 ENALVDMYAKCHMIGDARYVFETVFDKDVIAWNSIISGYTQNGYAYEALELFDQMRSDSV 435

Query: 156 ECDNIIFSIILKACSELREIVEGRKVHCQIVKVG--GPDSFVMTGLIDMYGKCGQVECSS 215
             D I    +L AC+ +     G  +H   +K G      +V T L++ Y KCG  E + 
Sbjct: 436 YPDAITLVSVLSACASVGAYRVGSSLHGYAIKAGLLSGSVYVGTALLNFYAKCGDAESAR 495

Query: 216 AVFEEIMDKNVVSWTSMIAGY-VQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKL 275
            +F+E+ +KN ++W++MI GY +Q +C+   L LF  M    +E N     +I++AC+  
Sbjct: 496 VIFDEMGEKNTITWSAMIGGYGIQGDCS-RSLELFGDMLKEKLEPNEVIFTTILSACS-- 555

Query: 276 RALHQGKWVHGYAIKN-IAELSSFLAT-----TFLDMYVKCGQTRDARMIYDELP 322
              H G    G+   N + ++ +F+ +       +D+  + G+  +A    +++P
Sbjct: 556 ---HSGMLGEGWRYFNTMCQVYNFVPSMKHYACMVDLLARAGRLEEALDFIEKIP 604


HSP 3 Score: 59.7 bits (143), Expect = 1.1e-05
Identity = 45/169 (26.63%), Postives = 77/169 (45.56%), Query Frame = 1

Query: 55  SVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLI-GNLLCDTKLVGVYGALGDVRSAR 114
           ++  +S+   C  +G +R   +L   HG  I  GL+ G++   T L+  Y   GD  SAR
Sbjct: 439 AITLVSVLSACASVGAYRVGSSL---HGYAIKAGLLSGSVYVGTALLNFYAKCGDAESAR 498

Query: 115 MVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELR 174
           ++FD+M   +   W  MI  Y +       +  +  M     E + +IF+ IL ACS   
Sbjct: 499 VIFDEMGEKNTITWSAMIGGYGIQGDCSRSLELFGDMLKEKLEPNEVIFTTILSACSHSG 558

Query: 175 EIVEGRK---VHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEI 220
            + EG +     CQ+     P       ++D+  + G++E +    E+I
Sbjct: 559 MLGEGWRYFNTMCQVYNF-VPSMKHYACMVDLLARAGRLEEALDFIEKI 603


HSP 4 Score: 555.1 bits (1429), Expect = 8.5e-155
Identity = 274/472 (58.05%), Postives = 349/472 (73.94%), Query Frame = 1

Query: 5   RLSIQQLGMLQRFPRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLPLC 64
           RL +  L   Q + R F+        + H++        PP  D    + S+ +I    C
Sbjct: 8   RLRLSFLLHNQLYKRRFYFQLRNFSYLTHQLPLD-----PPQFD--HNIASIHYIFSHPC 67

Query: 65  CYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDF 124
             L+G  +NI +L K HGLLIV GL G+LLC+TKLV +YG+ GD+ +AR+VFD++PNPD 
Sbjct: 68  FNLLGFCKNIYSLKKVHGLLIVDGLDGDLLCNTKLVSLYGSFGDIDAARVVFDRIPNPDL 127

Query: 125 YAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQ 184
           Y+WKVM+RWYFL+DL+ ++   Y+RM++  +E DN++FSI+LKACSELR I EGRK+HCQ
Sbjct: 128 YSWKVMLRWYFLSDLYWEIFGLYSRMKICVKEYDNVMFSIVLKACSELRCIDEGRKIHCQ 187

Query: 185 IVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGL 244
           +VKVG PDSFV+TGL+DMY KCG++E S  VF+E +D+NVVSWTSMIAGYVQN+C  EGL
Sbjct: 188 VVKVGDPDSFVLTGLVDMYAKCGEIESSRHVFDENLDRNVVSWTSMIAGYVQNDCPAEGL 247

Query: 245 VLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMY 304
            LFNRMR+  V  N FTLGS++ ACTKL ALHQGKWVHG+AIK+  EL+S+L T  LDMY
Sbjct: 248 TLFNRMREGFVGGNQFTLGSLVTACTKLGALHQGKWVHGFAIKSGVELNSYLVTALLDMY 307

Query: 305 VKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAA 364
           VKCG  +DAR ++DEL ++DLVSWTAMIVGYTQ+   ++ L+LF DE + D LPN VT  
Sbjct: 308 VKCGVIKDARSVFDELSSVDLVSWTAMIVGYTQSGLFHEALKLFMDE-KFDALPNDVTIV 367

Query: 365 SVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKD 424
           +VLSAC+  GNLNLG SVHGLGIK+G  +  V NAL+DMYAKCH   DA  IF  +  KD
Sbjct: 368 TVLSACAQLGNLNLGRSVHGLGIKLGFRQSTVANALVDMYAKCHMNRDASFIFERISHKD 427

Query: 425 VITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           V+ WNS+ISGY QNGSAY+AL LF+QMR+  + PD +TLVS  SACA LGA+
Sbjct: 428 VVAWNSIISGYYQNGSAYEALELFHQMRMELVLPDAVTLVSVFSACALLGAL 471

BLAST of Csa5G189930 vs. TrEMBL
Match: A0A067L9V5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15672 PE=4 SV=1)

HSP 1 Score: 238.8 bits (608), Expect = 1.4e-59
Identity = 141/384 (36.72%), Postives = 213/384 (55.47%), Query Frame = 1

Query: 97  TKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRE 156
           T LV +Y   G++ S+R VFD+  + +  +W  MI  Y  ND   + +  +NRMR  F  
Sbjct: 193 TGLVDMYAKCGEIESSRHVFDENLDRNVVSWTSMIAGYVQNDCPAEGLTLFNRMREGFVG 252

Query: 157 CDNIIFSIILKACSELREIVEGRKVHCQIVKVGGP-DSFVMTGLIDMYGKCGQVECSSAV 216
            +      ++ AC++L  + +G+ VH   +K G   +S+++T L+DMY KCG ++ + +V
Sbjct: 253 GNQFTLGSLVTACTKLGALHQGKWVHGFAIKSGVELNSYLVTALLDMYVKCGVIKDARSV 312

Query: 217 FEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLF-NRMRDALVESNPFTLGSIINACTKLRA 276
           F+E+   ++VSWT+MI GY Q+    E L LF +   DAL   N  T+ ++++AC +L  
Sbjct: 313 FDELSSVDLVSWTAMIVGYTQSGLFHEALKLFMDEKFDAL--PNDVTIVTVLSACAQLGN 372

Query: 277 LHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVG 336
           L+ G+ VHG  IK +    S +A   +DMY KC   RDA  I++ +   D+V+W ++I G
Sbjct: 373 LNLGRSVHGLGIK-LGFRQSTVANALVDMYAKCHMNRDASFIFERISHKDVVAWNSIISG 432

Query: 337 YTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGL--E 396
           Y Q     + L LF       +LP++VT  SV SAC++ G L  G S+H   IK GL   
Sbjct: 433 YYQNGSAYEALELFHQMRMELVLPDAVTLVSVFSACALLGALRAGSSLHAYSIKEGLLSS 492

Query: 397 ECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMR 456
              V  AL+  YAKC     A  +F  + EK+ +TW++MI GY   G A  +L LFN M 
Sbjct: 493 NVYVGTALLTFYAKCGDANSARTVFDSMGEKNNVTWSAMIGGYGIQGDANGSLALFNDML 552

Query: 457 LYFLAPDVITLVSTLSACATLGAV 477
              L P+ I   + LSAC+  G V
Sbjct: 553 KKDLKPNEIIFTTILSACSHTGMV 573


HSP 2 Score: 94.4 bits (233), Expect = 4.1e-16
Identity = 70/280 (25.00%), Postives = 129/280 (46.07%), Query Frame = 1

Query: 56  VQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMV 115
           V  +++   C  +G   N++     HGL I  G   + + +  LV +Y      R A  +
Sbjct: 356 VTIVTVLSACAQLG---NLNLGRSVHGLGIKLGFRQSTVANA-LVDMYAKCHMNRDASFI 415

Query: 116 FDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREI 175
           F+++ + D  AW  +I  Y+ N    + +  +++MRM     D +    +  AC+ L  +
Sbjct: 416 FERISHKDVVAWNSIISGYYQNGSAYEALELFHQMRMELVLPDAVTLVSVFSACALLGAL 475

Query: 176 VEGRKVHCQIVKVG--GPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAG 235
             G  +H   +K G    + +V T L+  Y KCG    +  VF+ + +KN V+W++MI G
Sbjct: 476 RAGSSLHAYSIKEGLLSSNVYVGTALLTFYAKCGDANSARTVFDSMGEKNNVTWSAMIGG 535

Query: 236 YVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAI-KNIAEL 295
           Y     A   L LFN M    ++ N     +I++AC+     H G    G+ +  ++ + 
Sbjct: 536 YGIQGDANGSLALFNDMLKKDLKPNEIIFTTILSACS-----HTGMVGEGWNLFISMCKE 595

Query: 296 SSFLAT-----TFLDMYVKCGQTRDARMIYDELPTIDLVS 328
            +F+ +       +D+  + G+  +A    D++P    VS
Sbjct: 596 HNFVPSMKHYACMVDLLARSGRLEEAWEFIDKMPVQPNVS 626


HSP 3 Score: 59.3 bits (142), Expect = 1.5e-05
Identity = 51/202 (25.25%), Postives = 89/202 (44.06%), Query Frame = 1

Query: 55  SVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIG-NLLCDTKLVGVYGALGDVRSAR 114
           +V  +S+   C L+G  R   +L   H   I  GL+  N+   T L+  Y   GD  SAR
Sbjct: 455 AVTLVSVFSACALLGALRAGSSL---HAYSIKEGLLSSNVYVGTALLTFYAKCGDANSAR 514

Query: 115 MVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELR 174
            VFD M   +   W  MI  Y +       +  +N M     + + IIF+ IL ACS   
Sbjct: 515 TVFDSMGEKNNVTWSAMIGGYGIQGDANGSLALFNDMLKKDLKPNEIIFTTILSACSHTG 574

Query: 175 EIVEGRKVHCQIVKVGG--PDSFVMTGLIDMYGKCGQVECSSAVFEEI-MDKNVVSWTSM 234
            + EG  +   + K     P       ++D+  + G++E +    +++ +  NV  + + 
Sbjct: 575 MVGEGWNLFISMCKEHNFVPSMKHYACMVDLLARSGRLEEAWEFIDKMPVQPNVSLFGAF 634

Query: 235 IAGYVQNNCAEEGLVLFNRMRD 253
           + G   ++  + G +   RM++
Sbjct: 635 LHGCGLHSRFDLGEIAIRRMQE 653


HSP 4 Score: 553.9 bits (1426), Expect = 1.9e-154
Identity = 259/394 (65.74%), Postives = 322/394 (81.73%), Query Frame = 1

Query: 81  HGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLF 140
           H LL++HGL  +LLC TKL+ +YG+ G V+ AR++FDQMP+PDFY+WKVM+RWYF+++L+
Sbjct: 4   HSLLVLHGLSNDLLCRTKLISLYGSFGYVKCARLLFDQMPSPDFYSWKVMLRWYFMHNLY 63

Query: 141 VDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGPDSFVMTGLI 200
            +V+ FY  MR+  RE DN++FSI+LKACSELR+  EGRKVHCQIVKV  PDSFV+TGL+
Sbjct: 64  AEVMGFYTSMRICVREHDNVVFSIVLKACSELRDFNEGRKVHCQIVKVASPDSFVLTGLV 123

Query: 201 DMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNPF 260
           D+Y KCG +ECS AVF+ I+D NVV WTSMI GYVQN+C ++GLVLFNRMR+ L++ N F
Sbjct: 124 DVYAKCGWIECSRAVFDGIVDGNVVCWTSMIVGYVQNDCPQDGLVLFNRMREELIKGNQF 183

Query: 261 TLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDEL 320
           TLGS++ ACTKLRALHQGKW+HG+ IK   E+SSFL T+ LDMYVKCG  R AR I+DEL
Sbjct: 184 TLGSVLTACTKLRALHQGKWIHGHLIKTGIEVSSFLVTSLLDMYVKCGDIRYARSIFDEL 243

Query: 321 PTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLGM 380
           P IDLVSWTAMIVGYTQ+  P++ L+LF DE    LLPNS+T ASVLS+C+ S NLNLG 
Sbjct: 244 PAIDLVSWTAMIVGYTQSGCPDEALKLFTDEKWVGLLPNSITTASVLSSCAQSCNLNLGR 303

Query: 381 SVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGS 440
           S+HGLGIK+GLE+  V+NAL+DMYAKCH IGDA  IF  +L+K+VI WNS+ISGY+QNGS
Sbjct: 304 SIHGLGIKLGLEDSTVRNALVDMYAKCHMIGDARYIFETILDKNVIAWNSIISGYSQNGS 363

Query: 441 AYDALRLFNQMRLYFLAPDVITLVSTLSACATLG 475
           AY+AL+LF+QMR    + D  TL S LSAC TLG
Sbjct: 364 AYEALQLFHQMRSESFSHDAFTLASVLSACTTLG 397

BLAST of Csa5G189930 vs. TAIR10
Match: AT2G03380.1 (AT2G03380.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 424.9 bits (1091), Expect = 6.7e-119
Identity = 212/422 (50.24%), Postives = 289/422 (68.48%), Query Frame = 1

Query: 55  SVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARM 114
           S+ + +   C  L+    NID+L + HG+L  +GL+G++   TKLV +YG  G  + AR+
Sbjct: 38  SLHYAASSPCFLLLSKCTNIDSLRQSHGVLTGNGLMGDISIATKLVSLYGFFGYTKDARL 97

Query: 115 VFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELRE 174
           VFDQ+P PDFY WKVM+R Y LN   V+V+  Y+ +       D+I+FS  LKAC+EL++
Sbjct: 98  VFDQIPEPDFYLWKVMLRCYCLNKESVEVVKLYDLLMKHGFRYDDIVFSKALKACTELQD 157

Query: 175 IVEGRKVHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGY 234
           +  G+K+HCQ+VKV   D+ V+TGL+DMY KCG+++ +  VF +I  +NVV WTSMIAGY
Sbjct: 158 LDNGKKIHCQLVKVPSFDNVVLTGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMIAGY 217

Query: 235 VQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSS 294
           V+N+  EEGLVLFNRMR+  V  N +T G++I ACTKL ALHQGKW HG  +K+  ELSS
Sbjct: 218 VKNDLCEEGLVLFNRMRENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSGIELSS 277

Query: 295 FLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRS 354
            L T+ LDMYVKCG   +AR +++E   +DLV WTAMIVGYT     N+ L LF      
Sbjct: 278 CLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKMKGV 337

Query: 355 DLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAY 414
           ++ PN VT ASVLS C +  NL LG SVHGL IK+G+ +  V NAL+ MYAKC++  DA 
Sbjct: 338 EIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNVANALVHMYAKCYQNRDAK 397

Query: 415 AIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLG 474
            +F    EKD++ WNS+ISG++QNGS ++AL LF++M    + P+ +T+ S  SACA+LG
Sbjct: 398 YVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSACASLG 457

Query: 475 AV 477
           ++
Sbjct: 458 SL 459


HSP 2 Score: 231.5 bits (589), Expect = 1.1e-60
Identity = 136/409 (33.25%), Postives = 214/409 (52.32%), Query Frame = 1

Query: 72  RNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMI 131
           +++D   K H  L+      N++  T L+ +Y   G+++SA  VF+ +   +   W  MI
Sbjct: 156 QDLDNGKKIHCQLVKVPSFDNVVL-TGLLDMYAKCGEIKSAHKVFNDITLRNVVCWTSMI 215

Query: 132 RWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGP 191
             Y  NDL  + +  +NRMR +    +   +  ++ AC++L  + +G+  H  +VK G  
Sbjct: 216 AGYVKNDLCEEGLVLFNRMRENNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVKSGIE 275

Query: 192 -DSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRM 251
             S ++T L+DMY KCG +  +  VF E    ++V WT+MI GY  N    E L LF +M
Sbjct: 276 LSSCLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALSLFQKM 335

Query: 252 RDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQT 311
           +   ++ N  T+ S+++ C  +  L  G+ VHG +IK +    + +A   + MY KC Q 
Sbjct: 336 KGVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIK-VGIWDTNVANALVHMYAKCYQN 395

Query: 312 RDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSAC 371
           RDA+ +++     D+V+W ++I G++Q    ++ L LF       + PN VT AS+ SAC
Sbjct: 396 RDAKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNGVTVASLFSAC 455

Query: 372 SVSGNLNLGMSVHGLGIKIGL---EECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVIT 431
           +  G+L +G S+H   +K+G        V  AL+D YAKC     A  IF  + EK+ IT
Sbjct: 456 ASLGSLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDTIEEKNTIT 515

Query: 432 WNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           W++MI GY + G    +L LF +M      P+  T  S LSAC   G V
Sbjct: 516 WSAMIGGYGKQGDTIGSLELFEEMLKKQQKPNESTFTSILSACGHTGMV 562


HSP 3 Score: 179.5 bits (454), Expect = 4.9e-45
Identity = 110/360 (30.56%), Postives = 178/360 (49.44%), Query Frame = 1

Query: 80  FHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDL 139
           FHG L+  G+  +    T L+ +Y   GD+ +AR VF++  + D   W  MI  Y  N  
Sbjct: 264 FHGCLVKSGIELSSCLVTSLLDMYVKCGDISNARRVFNEHSHVDLVMWTAMIVGYTHNGS 323

Query: 140 FVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGPDSFVMTGL 199
             + +  + +M+    + + +  + +L  C  +  +  GR VH   +KVG  D+ V   L
Sbjct: 324 VNEALSLFQKMKGVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNVANAL 383

Query: 200 IDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNP 259
           + MY KC Q   +  VFE   +K++V+W S+I+G+ QN    E L LF+RM    V  N 
Sbjct: 384 VHMYAKCYQNRDAKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRMNSESVTPNG 443

Query: 260 FTLGSIINACTKLRALHQGKWVHGYAIK--NIAELSSFLATTFLDMYVKCGQTRDARMIY 319
            T+ S+ +AC  L +L  G  +H Y++K   +A  S  + T  LD Y KCG  + AR+I+
Sbjct: 444 VTVASLFSACASLGSLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIF 503

Query: 320 DELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLN 379
           D +   + ++W+AMI GY +       L LF + ++    PN  T  S+LSAC  +G +N
Sbjct: 504 DTIEEKNTITWSAMIGGYGKQGDTIGSLELFEEMLKKQQKPNESTFTSILSACGHTGMVN 563

Query: 380 LGMSVHGLGIKIGLEECVVKN--ALIDMYAKCHKIGDAYAIFHGV-LEKDVITWNSMISG 435
            G        K        K+   ++DM A+  ++  A  I   + ++ DV  + + + G
Sbjct: 564 EGKKYFSSMYKDYNFTPSTKHYTCMVDMLARAGELEQALDIIEKMPIQPDVRCFGAFLHG 623

BLAST of Csa5G189930 vs. TAIR10
Match: AT3G12770.1 (AT3G12770.1 mitochondrial editing factor 22)

HSP 1 Score: 249.6 bits (636), Expect = 3.9e-66
Identity = 136/404 (33.66%), Postives = 225/404 (55.69%), Query Frame = 1

Query: 77  LIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFL 136
           L + H  L+V GL  +    TKL+    + GD+  AR VFD +P P  + W  +IR Y  
Sbjct: 37  LKQIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSR 96

Query: 137 NDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG-GPDSFV 196
           N+ F D +  Y+ M+++    D+  F  +LKACS L  +  GR VH Q+ ++G   D FV
Sbjct: 97  NNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFV 156

Query: 197 MTGLIDMYGKCGQVECSSAVFE--EIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDA 256
             GLI +Y KC ++  +  VFE   + ++ +VSWT++++ Y QN    E L +F++MR  
Sbjct: 157 QNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKM 216

Query: 257 LVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDA 316
            V+ +   L S++NA T L+ L QG+ +H   +K   E+   L  +   MY KCGQ   A
Sbjct: 217 DVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATA 276

Query: 317 RMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVS 376
           ++++D++ + +L+ W AMI GY +     + + +F + I  D+ P++++  S +SAC+  
Sbjct: 277 KILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQV 336

Query: 377 GNLNLGMSVHG-LGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMI 436
           G+L    S++  +G     ++  + +ALIDM+AKC  +  A  +F   L++DV+ W++MI
Sbjct: 337 GSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMI 396

Query: 437 SGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
            GY  +G A +A+ L+  M    + P+ +T +  L AC   G V
Sbjct: 397 VGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGMV 440


HSP 2 Score: 159.5 bits (402), Expect = 5.3e-39
Identity = 97/365 (26.58%), Postives = 183/365 (50.14%), Query Frame = 1

Query: 81  HGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPD--FYAWKVMIRWYFLND 140
           H  +   G   ++     L+ +Y     + SAR VF+ +P P+    +W  ++  Y  N 
Sbjct: 142 HAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNG 201

Query: 141 LFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG---GPDSFV 200
             ++ +  +++MR    + D +    +L A + L+++ +GR +H  +VK+G    PD  +
Sbjct: 202 EPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPD--L 261

Query: 201 MTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALV 260
           +  L  MY KCGQV  +  +F+++   N++ W +MI+GY +N  A E + +F+ M +  V
Sbjct: 262 LISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINKDV 321

Query: 261 ESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARM 320
             +  ++ S I+AC ++ +L Q + ++ Y  ++      F+++  +DM+ KCG    AR+
Sbjct: 322 RPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSVEGARL 381

Query: 321 IYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGN 380
           ++D     D+V W+AMIVGY    +  + + L+    R  + PN VT   +L AC+ SG 
Sbjct: 382 VFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLMACNHSGM 441

Query: 381 LNLG------MSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGV-LEKDVITW 434
           +  G      M+ H +  +     CV     ID+  +   +  AY +   + ++  V  W
Sbjct: 442 VREGWWFFNRMADHKINPQQQHYACV-----IDLLGRAGHLDQAYEVIKCMPVQPGVTVW 499

BLAST of Csa5G189930 vs. TAIR10
Match: AT4G18520.1 (AT4G18520.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 235.7 bits (600), Expect = 5.8e-62
Identity = 137/395 (34.68%), Postives = 214/395 (54.18%), Query Frame = 1

Query: 81  HGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLF 140
           HG ++  G +GNL+ ++ LV  Y   G++ SA   FD M   D  +W  +I         
Sbjct: 207 HGNMVKVG-VGNLIVESSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHG 266

Query: 141 VDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVK-VGGPDSFVMTGL 200
           +  I  +  M   +   +      ILKACSE + +  GR+VH  +VK +   D FV T L
Sbjct: 267 IKAIGMFIGMLNHWFLPNEFTVCSILKACSEEKALRFGRQVHSLVVKRMIKTDVFVGTSL 326

Query: 201 IDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNP 260
           +DMY KCG++     VF+ + ++N V+WTS+IA + +    EE + LF  M+   + +N 
Sbjct: 327 MDMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANN 386

Query: 261 FTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDE 320
            T+ SI+ AC  + AL  GK +H   IKN  E + ++ +T + +Y KCG++RDA  +  +
Sbjct: 387 LTVVSILRACGSVGALLLGKELHAQIIKNSIEKNVYIGSTLVWLYCKCGESRDAFNVLQQ 446

Query: 321 LPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLG 380
           LP+ D+VSWTAMI G +     ++ L    + I+  + PN  T +S L AC+ S +L +G
Sbjct: 447 LPSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFTYSSALKACANSESLLIG 506

Query: 381 MSVHGLGIK-IGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQN 440
            S+H +  K   L    V +ALI MYAKC  + +A+ +F  + EK++++W +MI GYA+N
Sbjct: 507 RSIHSIAKKNHALSNVFVGSALIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYARN 566

Query: 441 GSAYDALRLFNQMRLYFLAPDVITLVSTLSACATL 474
           G   +AL+L  +M       D     + LS C  +
Sbjct: 567 GFCREALKLMYRMEAEGFEVDDYIFATILSTCGDI 600


HSP 2 Score: 201.1 bits (510), Expect = 1.6e-51
Identity = 115/368 (31.25%), Postives = 192/368 (52.17%), Query Frame = 1

Query: 106 LGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIP-FYNRMRMSFRECDNIIFSI 165
           LGD+  AR VFD MP  +   W  MI  Y    L  +    F + ++   R  +  +F  
Sbjct: 130 LGDLVYARKVFDSMPEKNTVTWTAMIDGYLKYGLEDEAFALFEDYVKHGIRFTNERMFVC 189

Query: 166 ILKACSELREIVEGRKVHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNV 225
           +L  CS   E   GR+VH  +VKVG  +  V + L+  Y +CG++  +   F+ + +K+V
Sbjct: 190 LLNLCSRRAEFELGRQVHGNMVKVGVGNLIVESSLVYFYAQCGELTSALRAFDMMEEKDV 249

Query: 226 VSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGY 285
           +SWT++I+   +     + + +F  M +     N FT+ SI+ AC++ +AL  G+ VH  
Sbjct: 250 ISWTAVISACSRKGHGIKAIGMFIGMLNHWFLPNEFTVCSILKACSEEKALRFGRQVHSL 309

Query: 286 AIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDG 345
            +K + +   F+ T+ +DMY KCG+  D R ++D +   + V+WT++I  + +     + 
Sbjct: 310 VVKRMIKTDVFVGTSLMDMYAKCGEISDCRKVFDGMSNRNTVTWTSIIAAHAREGFGEEA 369

Query: 346 LRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECV-VKNALIDM 405
           + LF    R  L+ N++T  S+L AC   G L LG  +H   IK  +E+ V + + L+ +
Sbjct: 370 ISLFRIMKRRHLIANNLTVVSILRACGSVGALLLGKELHAQIIKNSIEKNVYIGSTLVWL 429

Query: 406 YAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITL 465
           Y KC +  DA+ +   +  +DV++W +MISG +  G   +AL    +M    + P+  T 
Sbjct: 430 YCKCGESRDAFNVLQQLPSRDVVSWTAMISGCSSLGHESEALDFLKEMIQEGVEPNPFTY 489

Query: 466 VSTLSACA 472
            S L ACA
Sbjct: 490 SSALKACA 497


HSP 3 Score: 105.9 bits (263), Expect = 6.9e-23
Identity = 57/194 (29.38%), Postives = 98/194 (50.52%), Query Frame = 1

Query: 279 KWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQA 338
           K +H  A+K   +   +     +   V+ G    AR ++D +P  + V+WTAMI GY + 
Sbjct: 102 KRIHAMALKCFDDQVIYFGNNLISSCVRLGDLVYARKVFDSMPEKNTVTWTAMIDGYLKY 161

Query: 339 RQPNDGLRLFADEIRSDL-LPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVK 398
              ++   LF D ++  +   N      +L+ CS      LG  VHG  +K+G+   +V+
Sbjct: 162 GLEDEAFALFEDYVKHGIRFTNERMFVCLLNLCSRRAEFELGRQVHGNMVKVGVGNLIVE 221

Query: 399 NALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLA 458
           ++L+  YA+C ++  A   F  + EKDVI+W ++IS  ++ G    A+ +F  M  ++  
Sbjct: 222 SSLVYFYAQCGELTSALRAFDMMEEKDVISWTAVISACSRKGHGIKAIGMFIGMLNHWFL 281

Query: 459 PDVITLVSTLSACA 472
           P+  T+ S L AC+
Sbjct: 282 PNEFTVCSILKACS 295

BLAST of Csa5G189930 vs. TAIR10
Match: AT2G33680.1 (AT2G33680.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 235.7 bits (600), Expect = 5.8e-62
Identity = 140/426 (32.86%), Postives = 216/426 (50.70%), Query Frame = 1

Query: 61  LPLCCYLMGLFRNIDTLI------KFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARM 120
           LP    L G+F+   +L       + H L++     G++  DT LVG+Y   G V     
Sbjct: 115 LPNAYTLAGIFKAESSLQSSTVGRQAHALVVKMSSFGDIYVDTSLVGMYCKAGLVEDGLK 174

Query: 121 VFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDN--IIFSIILKACSEL 180
           VF  MP  + Y W  M+  Y       + I  +N       E  +   +F+ +L + +  
Sbjct: 175 VFAYMPERNTYTWSTMVSGYATRGRVEEAIKVFNLFLREKEEGSDSDYVFTAVLSSLAAT 234

Query: 181 REIVEGRKVHCQIVKVGGPDSFVMTG-LIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMI 240
             +  GR++HC  +K G      ++  L+ MY KC  +  +  +F+   D+N ++W++M+
Sbjct: 235 IYVGLGRQIHCITIKNGLLGFVALSNALVTMYSKCESLNEACKMFDSSGDRNSITWSAMV 294

Query: 241 AGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAE 300
            GY QN  + E + LF+RM  A ++ + +T+  ++NAC+ +  L +GK +H + +K   E
Sbjct: 295 TGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVGVLNACSDICYLEEGKQLHSFLLKLGFE 354

Query: 301 LSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADE 360
              F  T  +DMY K G   DAR  +D L   D+  WT++I GY Q     + L L+   
Sbjct: 355 RHLFATTALVDMYAKAGCLADARKGFDCLQERDVALWTSLISGYVQNSDNEEALILYRRM 414

Query: 361 IRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLE-ECVVKNALIDMYAKCHKI 420
             + ++PN  T ASVL ACS    L LG  VHG  IK G   E  + +AL  MY+KC  +
Sbjct: 415 KTAGIIPNDPTMASVLKACSSLATLELGKQVHGHTIKHGFGLEVPIGSALSTMYSKCGSL 474

Query: 421 GDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSAC 477
            D   +F     KDV++WN+MISG + NG   +AL LF +M    + PD +T V+ +SAC
Sbjct: 475 EDGNLVFRRTPNKDVVSWNAMISGLSHNGQGDEALELFEEMLAEGMEPDDVTFVNIISAC 534


HSP 2 Score: 180.6 bits (457), Expect = 2.2e-45
Identity = 105/365 (28.77%), Postives = 189/365 (51.78%), Query Frame = 1

Query: 79  KFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLND 138
           + H + I +GL+G +     LV +Y     +  A  +FD   + +   W  M+  Y  N 
Sbjct: 242 QIHCITIKNGLLGFVALSNALVTMYSKCESLNEACKMFDSSGDRNSITWSAMVTGYSQNG 301

Query: 139 LFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGPDS-FVMT 198
             ++ +  ++RM  +  +        +L ACS++  + EG+++H  ++K+G     F  T
Sbjct: 302 ESLEAVKLFSRMFSAGIKPSEYTIVGVLNACSDICYLEEGKQLHSFLLKLGFERHLFATT 361

Query: 199 GLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVES 258
            L+DMY K G +  +   F+ + +++V  WTS+I+GYVQN+  EE L+L+ RM+ A +  
Sbjct: 362 ALVDMYAKAGCLADARKGFDCLQERDVALWTSLISGYVQNSDNEEALILYRRMKTAGIIP 421

Query: 259 NPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIY 318
           N  T+ S++ AC+ L  L  GK VHG+ IK+   L   + +    MY KCG   D  +++
Sbjct: 422 NDPTMASVLKACSSLATLELGKQVHGHTIKHGFGLEVPIGSALSTMYSKCGSLEDGNLVF 481

Query: 319 DELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLN 378
              P  D+VSW AMI G +   Q ++ L LF + +   + P+ VT  +++SACS  G + 
Sbjct: 482 RRTPNKDVVSWNAMISGLSHNGQGDEALELFEEMLAEGMEPDDVTFVNIISACSHKGFVE 541

Query: 379 LG-MSVHGLGIKIGLEECVVKNA-LIDMYAKCHKIGDAYAIFHGV-LEKDVITWNSMISG 438
            G    + +  +IGL+  V   A ++D+ ++  ++ +A        ++  +  W  ++S 
Sbjct: 542 RGWFYFNMMSDQIGLDPKVDHYACMVDLLSRAGQLKEAKEFIESANIDHGLCLWRILLSA 601

Query: 439 YAQNG 440
              +G
Sbjct: 602 CKNHG 606


HSP 3 Score: 173.7 bits (439), Expect = 2.7e-43
Identity = 117/412 (28.40%), Postives = 193/412 (46.84%), Query Frame = 1

Query: 72  RNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMI 131
           RN+      HG +I  G    +     LV  Y   G +  A  +F+ +   D  +W  +I
Sbjct: 28  RNLVAGRAVHGQIIRTGASTCIQHANVLVNFYAKCGKLAKAHSIFNAIICKDVVSWNSLI 87

Query: 132 RWYFLN---DLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKV 191
             Y  N        V+  +  MR      +    + I KA S L+    GR+ H  +VK+
Sbjct: 88  TGYSQNGGISSSYTVMQLFREMRAQDILPNAYTLAGIFKAESSLQSSTVGRQAHALVVKM 147

Query: 192 GG-PDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLF 251
               D +V T L+ MY K G VE    VF  + ++N  +W++M++GY      EE + +F
Sbjct: 148 SSFGDIYVDTSLVGMYCKAGLVEDGLKVFAYMPERNTYTWSTMVSGYATRGRVEEAIKVF 207

Query: 252 NRMRDALVESNP--FTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYV 311
           N       E +   +   +++++      +  G+ +H   IKN       L+   + MY 
Sbjct: 208 NLFLREKEEGSDSDYVFTAVLSSLAATIYVGLGRQIHCITIKNGLLGFVALSNALVTMYS 267

Query: 312 KCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAAS 371
           KC    +A  ++D     + ++W+AM+ GY+Q  +  + ++LF+    + + P+  T   
Sbjct: 268 KCESLNEACKMFDSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRMFSAGIKPSEYTIVG 327

Query: 372 VLSACSVSGNLNLGMSVHGLGIKIGLEECV-VKNALIDMYAKCHKIGDAYAIFHGVLEKD 431
           VL+ACS    L  G  +H   +K+G E  +    AL+DMYAK   + DA   F  + E+D
Sbjct: 328 VLNACSDICYLEEGKQLHSFLLKLGFERHLFATTALVDMYAKAGCLADARKGFDCLQERD 387

Query: 432 VITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           V  W S+ISGY QN    +AL L+ +M+   + P+  T+ S L AC++L  +
Sbjct: 388 VALWTSLISGYVQNSDNEEALILYRRMKTAGIIPNDPTMASVLKACSSLATL 439


HSP 4 Score: 90.1 bits (222), Expect = 3.9e-18
Identity = 47/125 (37.60%), Postives = 75/125 (60.00%), Query Frame = 1

Query: 353 RSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVK-NALIDMYAKCHKIG 412
           +++L P++ T    L+  S   NL  G +VHG  I+ G   C+   N L++ YAKC K+ 
Sbjct: 7   QTELNPHTSTLLKKLTHHSQQRNLVAGRAVHGQIIRTGASTCIQHANVLVNFYAKCGKLA 66

Query: 413 DAYAIFHGVLEKDVITWNSMISGYAQNG---SAYDALRLFNQMRLYFLAPDVITLVSTLS 472
            A++IF+ ++ KDV++WNS+I+GY+QNG   S+Y  ++LF +MR   + P+  TL     
Sbjct: 67  KAHSIFNAIICKDVVSWNSLITGYSQNGGISSSYTVMQLFREMRAQDILPNAYTLAGIFK 126

Query: 473 ACATL 474
           A ++L
Sbjct: 127 AESSL 131

BLAST of Csa5G189930 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 235.0 bits (598), Expect = 9.9e-62
Identity = 140/404 (34.65%), Postives = 212/404 (52.48%), Query Frame = 1

Query: 79  KFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLND 138
           + HG ++  G          LV  Y     V SAR VFD+M   D  +W  +I  Y  N 
Sbjct: 216 QLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNG 275

Query: 139 LFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKV--GGPDSFVM 198
           L    +  + +M +S  E D      +   C++ R I  GR VH   VK      D F  
Sbjct: 276 LAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCN 335

Query: 199 TGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVE 258
           T L+DMY KCG ++ + AVF E+ D++VVS+TSMIAGY +   A E + LF  M +  + 
Sbjct: 336 T-LLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGIS 395

Query: 259 SNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMI 318
            + +T+ +++N C + R L +GK VH +  +N      F++   +DMY KCG  ++A ++
Sbjct: 396 PDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELV 455

Query: 319 YDELPTIDLVSWTAMIVGYTQARQPNDGLRLF---ADEIRSDLLPNSVTAASVLSACSVS 378
           + E+   D++SW  +I GY++    N+ L LF    +E R    P+  T A VL AC+  
Sbjct: 456 FSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKR--FSPDERTVACVLPACASL 515

Query: 379 GNLNLGMSVHGLGIKIG-LEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMI 438
              + G  +HG  ++ G   +  V N+L+DMYAKC  +  A+ +F  +  KD+++W  MI
Sbjct: 516 SAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMI 575

Query: 439 SGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           +GY  +G   +A+ LFNQMR   +  D I+ VS L AC+  G V
Sbjct: 576 AGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLV 616


HSP 2 Score: 205.7 bits (522), Expect = 6.4e-53
Identity = 132/412 (32.04%), Postives = 211/412 (51.21%), Query Frame = 1

Query: 72  RNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMI 131
           + +D  I+ +G +I   L       +KL  +Y   GD++ A  VFD++       W +++
Sbjct: 114 KEVDNFIRGNGFVIDSNL------GSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILM 173

Query: 132 RWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG-G 191
                +  F   I  + +M  S  E D+  FS + K+ S LR +  G ++H  I+K G G
Sbjct: 174 NELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFG 233

Query: 192 PDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRM 251
             + V   L+  Y K  +V+ +  VF+E+ +++V+SW S+I GYV N  AE+GL +F +M
Sbjct: 234 ERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQM 293

Query: 252 RDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQT 311
             + +E +  T+ S+   C   R +  G+ VH   +K           T LDMY KCG  
Sbjct: 294 LVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDL 353

Query: 312 RDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSAC 371
             A+ ++ E+    +VS+T+MI GY +     + ++LF +     + P+  T  +VL+ C
Sbjct: 354 DSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCC 413

Query: 372 SVSGNLNLGMSVH------GLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKD 431
           +    L+ G  VH       LG  I      V NAL+DMYAKC  + +A  +F  +  KD
Sbjct: 414 ARYRLLDEGKRVHEWIKENDLGFDI-----FVSNALMDMYAKCGSMQEAELVFSEMRVKD 473

Query: 432 VITWNSMISGYAQNGSAYDALRLFN-QMRLYFLAPDVITLVSTLSACATLGA 476
           +I+WN++I GY++N  A +AL LFN  +     +PD  T+   L ACA+L A
Sbjct: 474 IISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSA 514


HSP 3 Score: 163.7 bits (413), Expect = 2.8e-40
Identity = 103/349 (29.51%), Postives = 172/349 (49.28%), Query Frame = 1

Query: 95  CDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSF 154
           C+T L+ +Y   GD+ SA+ VF +M +    ++  MI  Y    L  + +  +  M    
Sbjct: 334 CNT-LLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEG 393

Query: 155 RECDNIIFSIILKACSELREIVEGRKVHCQIVKVG-GPDSFVMTGLIDMYGKCGQVECSS 214
              D    + +L  C+  R + EG++VH  I +   G D FV   L+DMY KCG ++ + 
Sbjct: 394 ISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAE 453

Query: 215 AVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRM-RDALVESNPFTLGSIINACTKL 274
            VF E+  K+++SW ++I GY +N  A E L LFN +  +     +  T+  ++ AC  L
Sbjct: 454 LVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASL 513

Query: 275 RALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMI 334
            A  +G+ +HGY ++N       +A + +DMY KCG    A M++D++ + DLVSWT MI
Sbjct: 514 SAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMI 573

Query: 335 VGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLE 394
            GY       + + LF    ++ +  + ++  S+L ACS SG ++ G         I   
Sbjct: 574 AGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRF----FNIMRH 633

Query: 395 ECVVK------NALIDMYAKCHKIGDAYAIFHGV-LEKDVITWNSMISG 435
           EC ++        ++DM A+   +  AY     + +  D   W +++ G
Sbjct: 634 ECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCG 677


HSP 4 Score: 112.8 bits (281), Expect = 5.7e-25
Identity = 79/290 (27.24%), Postives = 133/290 (45.86%), Query Frame = 1

Query: 54  PSVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSAR 113
           P V  ++  L C     +R +D   + H  +  + L  ++     L+ +Y   G ++ A 
Sbjct: 395 PDVYTVTAVLNC--CARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAE 454

Query: 114 MVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFR-ECDNIIFSIILKACSEL 173
           +VF +M   D  +W  +I  Y  N    + +  +N +    R   D    + +L AC+ L
Sbjct: 455 LVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASL 514

Query: 174 REIVEGRKVHCQIVKVGG-PDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMI 233
               +GR++H  I++ G   D  V   L+DMY KCG +  +  +F++I  K++VSWT MI
Sbjct: 515 SAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMI 574

Query: 234 AGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAE 293
           AGY  +   +E + LFN+MR A +E++  +  S++ AC+     H G    G+   NI  
Sbjct: 575 AGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACS-----HSGLVDEGWRFFNIMR 634

Query: 294 LSSFLATT------FLDMYVKCGQTRDARMIYDELP-TIDLVSWTAMIVG 335
               +  T       +DM  + G    A    + +P   D   W A++ G
Sbjct: 635 HECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCG 677

BLAST of Csa5G189930 vs. NCBI nr
Match: gi|700195421|gb|KGN50598.1| (hypothetical protein Csa_5G189930 [Cucumis sativus])

HSP 1 Score: 963.8 bits (2490), Expect = 1.1e-277
Identity = 476/476 (100.00%), Postives = 476/476 (100.00%), Query Frame = 1

Query: 1   MLAWRLSIQQLGMLQRFPRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFIS 60
           MLAWRLSIQQLGMLQRFPRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFIS
Sbjct: 1   MLAWRLSIQQLGMLQRFPRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFIS 60

Query: 61  LPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMP 120
           LPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMP
Sbjct: 61  LPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMP 120

Query: 121 NPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRK 180
           NPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRK
Sbjct: 121 NPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRK 180

Query: 181 VHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCA 240
           VHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCA
Sbjct: 181 VHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCA 240

Query: 241 EEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTF 300
           EEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTF
Sbjct: 241 EEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTF 300

Query: 301 LDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNS 360
           LDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNS
Sbjct: 301 LDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNS 360

Query: 361 VTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGV 420
           VTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGV
Sbjct: 361 VTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGV 420

Query: 421 LEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           LEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV
Sbjct: 421 LEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 476

BLAST of Csa5G189930 vs. NCBI nr
Match: gi|778708407|ref|XP_011656184.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 945.7 bits (2443), Expect = 3.2e-272
Identity = 467/474 (98.52%), Postives = 469/474 (98.95%), Query Frame = 1

Query: 1   MLAWRLSIQQLGMLQRFPRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFIS 60
           MLAWRLSIQQLGMLQRFPRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFIS
Sbjct: 1   MLAWRLSIQQLGMLQRFPRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFIS 60

Query: 61  LPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMP 120
           LPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMP
Sbjct: 61  LPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMP 120

Query: 121 NPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRK 180
           NPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRK
Sbjct: 121 NPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRK 180

Query: 181 VHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCA 240
           VHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCA
Sbjct: 181 VHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCA 240

Query: 241 EEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTF 300
           EEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTF
Sbjct: 241 EEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTF 300

Query: 301 LDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNS 360
           LDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNS
Sbjct: 301 LDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNS 360

Query: 361 VTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGV 420
           VTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGV
Sbjct: 361 VTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGV 420

Query: 421 LEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLG 475
           LEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVS  +  A +G
Sbjct: 421 LEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSWFTWSAMIG 474

BLAST of Csa5G189930 vs. NCBI nr
Match: gi|778708407|ref|XP_011656184.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 110.2 bits (274), Expect = 1.0e-20
Identity = 87/325 (26.77%), Postives = 147/325 (45.23%), Query Frame = 1

Query: 97  TKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRE 156
           T L+ +YG  G V  +  VF+++ + +  +W  MI  Y  N+   + +  +NRMR +  E
Sbjct: 197 TGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVE 256

Query: 157 CDNIIFSIILKACSELREIVEGRKVHCQIVK-VGGPDSFVMTGLIDMYGKCGQVECSSAV 216
            +      I+ AC++LR + +G+ VH   +K +    SF+ T  +DMY KCGQ   +  +
Sbjct: 257 SNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMI 316

Query: 217 FEEIMDKNVVSWTSMI----------------AGYVQNNCAEEGLVLFNRMRDALVESNP 276
           ++E+   ++VSWT+MI                A  ++++     +   + +    V  N 
Sbjct: 317 YDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGN- 376

Query: 277 FTLGSIINA---------CTKLRALHQ-----GKWVHGYAI-KNIAELSSFLATTFLDMY 336
             LG  ++          C    AL        K    YAI   + E       + +  Y
Sbjct: 377 LNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGY 436

Query: 337 VKCGQTRDARMIYDEL-------PTIDLVSW---TAMIVGYTQARQPNDGLRLFADEIRS 380
            + G   DA  +++++         I LVSW   +AMI GY      +  L +F++ ++ 
Sbjct: 437 AQNGSAYDALRLFNQMRLYFLAPDVITLVSWFTWSAMIGGYGVQGDGSGSLSIFSNMLKE 496


HSP 2 Score: 885.6 bits (2287), Expect = 4.0e-254
Identity = 442/467 (94.65%), Postives = 445/467 (95.29%), Query Frame = 1

Query: 13  MLQRF---PRAFFRLASPLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLPLCCYLMG 72
           MLQRF   PRAF  L   LLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISL  CCYLMG
Sbjct: 1   MLQRFFSLPRAFSHLTGSLLQMGHRMSYSTYASHPPLSDLHQTMPSVQFISLHSCCYLMG 60

Query: 73  LFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKV 132
           LFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMP+PDFYAWKV
Sbjct: 61  LFRNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPDPDFYAWKV 120

Query: 133 MIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVG 192
           MIRWYFLNDLFVDVIPFYN MRMSFRECDNIIFSIILKACSELREI EGRKVHCQIVKVG
Sbjct: 121 MIRWYFLNDLFVDVIPFYNCMRMSFRECDNIIFSIILKACSELREIDEGRKVHCQIVKVG 180

Query: 193 GPDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNR 252
           GPDSFVMTGLIDMYGKC QVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNR
Sbjct: 181 GPDSFVMTGLIDMYGKCRQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNR 240

Query: 253 MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQ 312
           MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNI E SSFLATTFLDMYVKCGQ
Sbjct: 241 MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIVEFSSFLATTFLDMYVKCGQ 300

Query: 313 TRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSA 372
           TRDARMI+DELPTIDLVSWTAMIVGYTQA QPNDGLRLFADEIRSDLLPNSVTAASVLSA
Sbjct: 301 TRDARMIFDELPTIDLVSWTAMIVGYTQASQPNDGLRLFADEIRSDLLPNSVTAASVLSA 360

Query: 373 CSVSGNLNLGMSVHGLGIKIGLEECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWN 432
           CSVSGNLNLGMSVHGLGIK+GLEEC VKNALIDMYAKCHKIGDAY IFHGVLEKDVITWN
Sbjct: 361 CSVSGNLNLGMSVHGLGIKLGLEECAVKNALIDMYAKCHKIGDAYVIFHGVLEKDVITWN 420

Query: 433 SMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           SMISGYAQNGSAYDALRLFNQMR Y LAPD ITLVSTLSA ATLGAV
Sbjct: 421 SMISGYAQNGSAYDALRLFNQMRSYSLAPDAITLVSTLSASATLGAV 467

BLAST of Csa5G189930 vs. NCBI nr
Match: gi|659114052|ref|XP_008456885.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial [Cucumis melo])

HSP 1 Score: 216.9 bits (551), Expect = 7.9e-53
Identity = 134/409 (32.76%), Postives = 217/409 (53.06%), Query Frame = 1

Query: 72  RNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMI 131
           R ID   K H  ++  G   + +  T L+ +YG    V  +  VF+++ + +  +W  MI
Sbjct: 164 REIDEGRKVHCQIVKVGGPDSFVM-TGLIDMYGKCRQVECSSAVFEEIMDKNVVSWTSMI 223

Query: 132 RWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVK-VGG 191
             Y  N+   + +  +NRMR +  E +      I+ AC++LR + +G+ VH   +K +  
Sbjct: 224 AGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIVE 283

Query: 192 PDSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLF-NR 251
             SF+ T  +DMY KCGQ   +  +F+E+   ++VSWT+MI GY Q +   +GL LF + 
Sbjct: 284 FSSFLATTFLDMYVKCGQTRDARMIFDELPTIDLVSWTAMIVGYTQASQPNDGLRLFADE 343

Query: 252 MRDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQ 311
           +R  L+  N  T  S+++AC+    L+ G  VHG  IK   E  + +    +DMY KC +
Sbjct: 344 IRSDLLP-NSVTAASVLSACSVSGNLNLGMSVHGLGIKLGLEECA-VKNALIDMYAKCHK 403

Query: 312 TRDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSA 371
             DA +I+  +   D+++W +MI GY Q     D LRLF       L P+++T  S LSA
Sbjct: 404 IGDAYVIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRSYSLAPDAITLVSTLSA 463

Query: 372 CSVSGNLNLGMSVHGLGIKIGL--EECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVIT 431
            +  G + +G S+H   +K GL      +  AL++ YAKC     A  +F  + +K++IT
Sbjct: 464 SATLGAVQVGSSLHAYSVKEGLFSSNLYIGTALLNFYAKCGDAKSARTVFDSMGDKNIIT 523

Query: 432 WNSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           W++MI GY   G    +L +F+ M    L P+ +   + LSAC++ G V
Sbjct: 524 WSAMIGGYGVQGDGSGSLSIFSDMLKEDLKPNEVIFTTILSACSSSGMV 569


HSP 2 Score: 578.2 bits (1489), Expect = 1.3e-161
Identity = 276/443 (62.30%), Postives = 351/443 (79.23%), Query Frame = 1

Query: 35  MSYSTYASHP-PLSDLHQTMPSVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNL 94
           +SY+T   HP     + +T+ S+  ISL  C  L+GL RNID+L K H L +++G+ G+L
Sbjct: 29  LSYTT--DHPLEYPSMDRTLASMHSISLNPCFALLGLCRNIDSLKKVHALFVINGIKGDL 88

Query: 95  LCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMS 154
           LCDTKLV +YG  G +  AR++FDQ+P+PDFY+WKVMIRWYFLNDL +++I FY RMRMS
Sbjct: 89  LCDTKLVSLYGLFGHIGCARLMFDQIPDPDFYSWKVMIRWYFLNDLCMEIIGFYARMRMS 148

Query: 155 FRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSS 214
            R CDN++FS++LKACSE+R+I EGRKVHCQIVK G PDSFV TGL+DMY KCG++ECS 
Sbjct: 149 VRMCDNVVFSVVLKACSEMRDIDEGRKVHCQIVKAGNPDSFVQTGLVDMYAKCGEIECSR 208

Query: 215 AVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLR 274
            VF EI+D+NVVSWTSMIAGYVQN+CAE+ LVLFNRMR+A+VE N FTLGS++ AC KL 
Sbjct: 209 KVFSEIIDRNVVSWTSMIAGYVQNDCAEDALVLFNRMREAMVEGNEFTLGSLVTACGKLG 268

Query: 275 ALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIV 334
           ALHQGKWVHGY IKN  EL+S+  TT LDMYVKCG  RDAR ++DEL ++DLVSWTAMIV
Sbjct: 269 ALHQGKWVHGYVIKNGIELNSYSVTTLLDMYVKCGSIRDARSVFDELSSVDLVSWTAMIV 328

Query: 335 GYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEE 394
           GY+Q+  P++ L+LF D+    +LPN+VT AS+LSAC+   NL+ G  VH LGI++GL++
Sbjct: 329 GYSQSGFPDEALKLFIDKKWFGILPNAVTIASLLSACAQLSNLSFGRLVHALGIQLGLKD 388

Query: 395 CVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRL 454
             V NAL+DMYAKC  IGDA  IF  V +K++I WNS+ISGY+QNGSAY+A  LF+QMR 
Sbjct: 389 STVINALVDMYAKCGMIGDARYIFETVSDKNIIAWNSIISGYSQNGSAYEAFELFHQMRS 448

Query: 455 YFLAPDVITLVSTLSACATLGAV 477
             ++PD +T+VS  SACA+LGA+
Sbjct: 449 KSVSPDAVTVVSIFSACASLGAL 469

BLAST of Csa5G189930 vs. NCBI nr
Match: gi|590699556|ref|XP_007045956.1| (Pentatricopeptide repeat superfamily protein [Theobroma cacao])

HSP 1 Score: 226.1 bits (575), Expect = 1.3e-55
Identity = 132/408 (32.35%), Postives = 218/408 (53.43%), Query Frame = 1

Query: 72  RNIDTLIKFHGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMI 131
           R+ID   K H  ++  G   + +  T LV +Y   G++  +R VF ++ + +  +W  MI
Sbjct: 166 RDIDEGRKVHCQIVKAGNPDSFV-QTGLVDMYAKCGEIECSRKVFSEIIDRNVVSWTSMI 225

Query: 132 RWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGP 191
             Y  ND   D +  +NRMR +  E +      ++ AC +L  + +G+ VH  ++K G  
Sbjct: 226 AGYVQNDCAEDALVLFNRMREAMVEGNEFTLGSLVTACGKLGALHQGKWVHGYVIKNGIE 285

Query: 192 -DSFVMTGLIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRM 251
            +S+ +T L+DMY KCG +  + +VF+E+   ++VSWT+MI GY Q+   +E L LF   
Sbjct: 286 LNSYSVTTLLDMYVKCGSIRDARSVFDELSSVDLVSWTAMIVGYSQSGFPDEALKLFIDK 345

Query: 252 RDALVESNPFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQT 311
           +   +  N  T+ S+++AC +L  L  G+ VH   I+ +    S +    +DMY KCG  
Sbjct: 346 KWFGILPNAVTIASLLSACAQLSNLSFGRLVHALGIQ-LGLKDSTVINALVDMYAKCGMI 405

Query: 312 RDARMIYDELPTIDLVSWTAMIVGYTQARQPNDGLRLFADEIRSDLLPNSVTAASVLSAC 371
            DAR I++ +   ++++W ++I GY+Q     +   LF       + P++VT  S+ SAC
Sbjct: 406 GDARYIFETVSDKNIIAWNSIISGYSQNGSAYEAFELFHQMRSKSVSPDAVTVVSIFSAC 465

Query: 372 SVSGNLNLGMSVHGLGIKIGL--EECVVKNALIDMYAKCHKIGDAYAIFHGVLEKDVITW 431
           +  G L +G S+H    K GL      V  A+++ YAK      A AIF  + EK+ +TW
Sbjct: 466 ASLGALQVGSSLHAYSTKGGLLSSSVYVGTAVLNFYAKSGDSKSARAIFDSMGEKNTVTW 525

Query: 432 NSMISGYAQNGSAYDALRLFNQMRLYFLAPDVITLVSTLSACATLGAV 477
           ++MI GY   G +  +L LFN M    + P+ +   + LSAC   G++
Sbjct: 526 SAMIGGYGIQGDSSGSLALFNDMVKENVEPNEVIFTTILSACGHTGSL 571


HSP 2 Score: 92.0 bits (227), Expect = 2.9e-15
Identity = 66/248 (26.61%), Postives = 116/248 (46.77%), Query Frame = 1

Query: 81  HGLLIVHGLIGNLLCDTKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLF 140
           H L I  GL  + + +  LV +Y   G +  AR +F+ + + +  AW  +I  Y  N   
Sbjct: 376 HALGIQLGLKDSTVINA-LVDMYAKCGMIGDARYIFETVSDKNIIAWNSIISGYSQNGSA 435

Query: 141 VDVIPFYNRMRMSFRECDNIIFSIILKACSELREIVEGRKVHCQIVKVGGPDS--FVMTG 200
            +    +++MR      D +    I  AC+ L  +  G  +H    K G   S  +V T 
Sbjct: 436 YEAFELFHQMRSKSVSPDAVTVVSIFSACASLGALQVGSSLHAYSTKGGLLSSSVYVGTA 495

Query: 201 LIDMYGKCGQVECSSAVFEEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESN 260
           +++ Y K G  + + A+F+ + +KN V+W++MI GY     +   L LFN M    VE N
Sbjct: 496 VLNFYAKSGDSKSARAIFDSMGEKNTVTWSAMIGGYGIQGDSSGSLALFNDMVKENVEPN 555

Query: 261 PFTLGSIINACTKLRALHQGKWVHGYAIKNIAELSSFLAT-----TFLDMYVKCGQTRDA 320
                +I++AC    +L +G W +     ++ +  +F+ +       +DM  + G+  +A
Sbjct: 556 EVIFTTILSACGHTGSLGEG-WKY---FNSMCQDYNFVPSMKHYACMVDMLARAGRLEEA 615

Query: 321 RMIYDELP 322
               D+LP
Sbjct: 616 WDFIDKLP 618


HSP 3 Score: 48.1 bits (113), Expect = 4.9e-02
Identity = 41/160 (25.62%), Postives = 67/160 (41.88%), Query Frame = 1

Query: 55  SVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGN-LLCDTKLVGVYGALGDVRSAR 114
           +V  +S+   C  +G  +   +L   H      GL+ + +   T ++  Y   GD +SAR
Sbjct: 453 AVTVVSIFSACASLGALQVGSSL---HAYSTKGGLLSSSVYVGTAVLNFYAKSGDSKSAR 512

Query: 115 MVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRECDNIIFSIILKACSELR 174
            +FD M   +   W  MI  Y +       +  +N M     E + +IF+ IL AC    
Sbjct: 513 AIFDSMGEKNTVTWSAMIGGYGIQGDSSGSLALFNDMVKENVEPNEVIFTTILSACGHTG 572

Query: 175 EIVEGRKVH---CQIVKVGGPDSFVMTGLIDMYGKCGQVE 211
            + EG K     CQ      P       ++DM  + G++E
Sbjct: 573 SLGEGWKYFNSMCQDYNF-VPSMKHYACMVDMLARAGRLE 608


HSP 4 Score: 574.7 bits (1480), Expect = 1.5e-160
Identity = 274/438 (62.56%), Postives = 346/438 (79.00%), Query Frame = 1

Query: 37  YSTYASHPPLSDLHQTMPSVQFISLPLCCYLMGLFRNIDTLIKFHGLLIVHGLIGNLLCD 96
           Y   +S PP  DL +T+ S + +    C  L+ L RNID+L K H LL++HGL  +LLC 
Sbjct: 31  YQLPSSEPP--DLSETLASTRSVFSNPCFNLLVLCRNIDSLKKVHSLLVLHGLSDDLLCR 90

Query: 97  TKLVGVYGALGDVRSARMVFDQMPNPDFYAWKVMIRWYFLNDLFVDVIPFYNRMRMSFRE 156
           TKL+ +YG+ G V+ AR++FDQMP+PDFY+WKVM+RWYF+++L+ +V+ FY  MR+  RE
Sbjct: 91  TKLISLYGSFGYVKCARLLFDQMPSPDFYSWKVMLRWYFMHNLYAEVMGFYTHMRICVRE 150

Query: 157 CDNIIFSIILKACSELREIVEGRKVHCQIVKVGGPDSFVMTGLIDMYGKCGQVECSSAVF 216
            DN++FSI+LKACSELR+  EGRKVHCQ+VKV  PDSFV+TGL+D+Y KCG +ECS AVF
Sbjct: 151 HDNVVFSIVLKACSELRDFNEGRKVHCQVVKVASPDSFVLTGLVDVYAKCGWIECSRAVF 210

Query: 217 EEIMDKNVVSWTSMIAGYVQNNCAEEGLVLFNRMRDALVESNPFTLGSIINACTKLRALH 276
           + I+D+NVV WTSMI GYVQN+C ++GLVLFNRMR+ L++ N FTLGS++ ACTKLRALH
Sbjct: 211 DGIVDRNVVCWTSMIVGYVQNDCPQDGLVLFNRMREELIKGNQFTLGSVLTACTKLRALH 270

Query: 277 QGKWVHGYAIKNIAELSSFLATTFLDMYVKCGQTRDARMIYDELPTIDLVSWTAMIVGYT 336
           QGKW+HG+ IK   E+SSFL T+ LDMYVKCG  R AR I+DELP IDLVSWTAMIVGYT
Sbjct: 271 QGKWIHGHLIKTGIEVSSFLVTSLLDMYVKCGDIRYARSIFDELPAIDLVSWTAMIVGYT 330

Query: 337 QARQPNDGLRLFADEIRSDLLPNSVTAASVLSACSVSGNLNLGMSVHGLGIKIGLEECVV 396
           Q+  P++ L+LF DE    LLPNS+T ASVLS+C+ S NLNLG S+HGLGIK+GLE+  V
Sbjct: 331 QSGCPDEALKLFTDEKWVGLLPNSITTASVLSSCAQSYNLNLGRSIHGLGIKLGLEDSTV 390

Query: 397 KNALIDMYAKCHKIGDAYAIFHGVLEKDVITWNSMISGYAQNGSAYDALRLFNQMRLYFL 456
           +NAL+DMYAKCH IGDA  IF  +L+K+VI WNS+ISGY+QNGSA +AL+LF+QMR    
Sbjct: 391 RNALVDMYAKCHMIGDARYIFETILDKNVIAWNSIISGYSQNGSACEALQLFHQMRSESF 450

Query: 457 APDVITLVSTLSACATLG 475
           + D  TL S LSAC TLG
Sbjct: 451 SHDAFTLASVLSACTTLG 466

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP146_ARATH1.2e-11750.24Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidop... [more]
PP224_ARATH6.9e-6533.66Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
PP319_ARATH1.0e-6034.68Pentatricopeptide repeat-containing protein At4g18520 OS=Arabidopsis thaliana GN... [more]
PP181_ARATH1.0e-6032.86Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana GN... [more]
PP320_ARATH1.8e-6034.65Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
A0A0A0KLZ1_CUCSA8.0e-278100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_5G189930 PE=4 SV=1[more]
A0A061EAZ8_THECC9.4e-16262.30Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_011604 PE... [more]
A0A061EAZ8_THECC9.1e-5632.35Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_011604 PE... [more]
A5AY98_VITVI1.3e-5735.86Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00900 PE=4 SV=... [more]
A0A067L9V5_JATCU1.4e-5936.72Uncharacterized protein OS=Jatropha curcas GN=JCGZ_15672 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G03380.16.7e-11950.24 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G12770.13.9e-6633.66 mitochondrial editing factor 22[more]
AT4G18520.15.8e-6234.68 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G33680.15.8e-6232.86 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G18750.19.9e-6234.65 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700195421|gb|KGN50598.1|1.1e-277100.00hypothetical protein Csa_5G189930 [Cucumis sativus][more]
gi|778708407|ref|XP_011656184.1|3.2e-27298.52PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial-... [more]
gi|778708407|ref|XP_011656184.1|1.0e-2026.77PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial-... [more]
gi|659114052|ref|XP_008456885.1|7.9e-5332.76PREDICTED: pentatricopeptide repeat-containing protein At2g03380, mitochondrial ... [more]
gi|590699556|ref|XP_007045956.1|1.3e-5532.35Pentatricopeptide repeat superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa5G189930.1Csa5G189930.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 298..320
score: 0.63coord: 326..348
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 423..471
score: 1.4E-9coord: 222..270
score: 4.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 225..252
score: 4.6E-5coord: 426..459
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 424..458
score: 11.674coord: 324..358
score: 9.482coord: 393..423
score: 6.314coord: 158..188
score: 5.853coord: 293..323
score: 6.171coord: 192..222
score: 7.53coord: 258..288
score: 5.36coord: 92..126
score: 6.095coord: 223..257
score: 9
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 79..359
score: 4.1E-194coord: 394..475
score: 4.1E-194coord: 19..57
score: 4.1E
NoneNo IPR availablePANTHERPTHR24015:SF51SUBFAMILY NOT NAMEDcoord: 79..359
score: 4.1E-194coord: 394..475
score: 4.1E-194coord: 19..57
score: 4.1E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Csa5G189930CSPI05G10120Wild cucumber (PI 183967)cpicuB240
The following gene(s) are paralogous to this gene:

None