Csa2G139850 (gene) Cucumber (Chinese Long) v2

NameCsa2G139850
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat-containing protein; contains IPR002885 (Pentatricopeptide repeat), IPR011990 (Tetratricopeptide-like helical)
LocationChr2 : 8670907 .. 8673275 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCGCTCTTATAAAACGGCAGAAACAGGCATCGAAAATTTTTGGAGTGCTCAAAATCCCATATCAATAGATTCATTACTTCAATGATTTTTAATCTATGCTTCTTCGAAGGAATGGATCTGGAAGTAACATAATGAAAGATCTCCACGTTCTTTTCAACCCAAGGATCGCTTTTTTCAGTTCAATGTTTTCTTCATCATCACCTCCGATTTCATTTCTGGAAACCCATTTCATCGATCTAATTCATGCGTCCAATTCAACCCACAAGCTCCGTCAGATCCATGGTCAACTCTATCGCTGCAACGTCTTCTCCAGCAGCCGGGTTGTGACCCAGTTCATCTCTTCCTGTTCTTCGCTAAATTCTGTCGACTATGCCATTTCCATCTTTCAGCGGTTCGAGTTGAAGAACAGTTACCTTTTTAATGCGTTAATTCGAGGACTCGCTGAAAATTCCAGGTTCGAGAGCTCAATTTCTTTCTTTGTTTTGATGCTAAAGTGGAAAATTAGCCCTGATAGGCTTACTTTTCCGTTTGTGCTGAAATCAGCGGCGGCTCTTTCCAATGGTGGCGTTGGGAGGGCTTTGCATTGTGGGATTTTGAAGTTTGGTCTTGAGTTTGATTCTTTTGTGAGGGTTTCGTTGGTTGATATGTACGTGAAAGTTGAGGAGTTGGGTTCTGCCTTGAAGGTGTTTGATGAAAGTCCTGAGAGTGTTAAGAATGGAAGTGTGTTGATTTGGAATGTTCTTATTCATGGGTATTGTAGAATGGGGGATTTAGTAAAAGCTACGGAGCTATTCGACTCAATGCCGAAGAAGGACACGGGATCTTGGAACAGTTTGATCAATGGTTTCATGAAAATGGGGGACATGGGTCGAGCAAAGGAACTGTTTGTTAAAATGCCTGAAAAAAATGTTGTTTCTTGGACCACAATGGTGAATGGATTTTCACAGAATGGAGACCCTGAAAAGGCACTGGAAACTTTCTTTTGTATGCTTGAAGAAGGCGCTCGGCCAAATGATTACACAATTGTCTCCGCACTTTCAGCTTGTGCAAAAATTGGTGCATTAGATGCTGGTTTAAGGATTCATAATTATCTTTCAGGCAATGGTTTCAAACTAAATCTAGTTATTGGAACTGCTCTGGTGGATATGTATGCAAAATGTGGAAATATTGAGCATGCAGAAAAAGTGTTTCATGAAACAAAAGAAAAGGGCCTTCTTATTTGGAGTGTTATGATCTGGGGCTGGGCAATCCATGGACATTTTAGGAAAGCCTTACAATACTTTGAATGGATGAAGTTTACAGGTTTGATTTCATATTGTGATTGTTATCTCTGAGTTTAATATTTTGCTGTTTATAAACTCAAATCTCAGCATGTTTGCAGGAACAAAGCCAGATAGTGTGGTCTTTCTTGCTGTTCTTAACGCATGCTCCCATTCTGGACAAGTAAACGAAGGACTTAAGTTTTTCGACAATATGAGGCGCGGCTACTTGATTGAACCTTCTATGAAGCATTACACATTGGTTGTAGACATGCTAGGCAGGGCTGGCAGACTAGATGAAGCTCTAAAGTTCATACGTGCGATGCCCATAACTCCTGATTTTGTGGTGTGGGGTGCTCTATTTTGTGCATGTAGGACTCATAAGAATGTTGAAATGGCAGAACTAGCATCCAAAAAGCTTCTTCAGCTTGAACCCAAGCATCCGGGGAGTTATGTATTTTTGTCAAACGCATATGCTTCAGTAGGGAGATGGGACGATGCGGAGAGAGTGAGAGTTTCTATGCGAGATCACGGTGCACACAAAGATCCAGGATGGAGCTTTATTGAAGTGGATCATAAATTACATAGATTTGTGGCTGGAGATAACACTCATAACCGTGCTGTAGAGATATACTCAAAATTAGATGAGATAAGTGCAAGTGCAAGGGAAAAGGGATACACGAAAGAAATTGAGTGCGTACTTCATAATATTGAAGAGGAAGAAAAGGAAGAAGCATTGGGATATCACAGTGAGAAGTTGGCACTTGCTTTTGGGATCGTCAGTACGAGGCCCGGAACGACTGTTAGGATAGTGAAAAACCTTAGAGTTTGTGTGGATTGCCATTCTTTCATGAAATATGCCAGTAAAATGAGTAAGAGGGAGATCATTTTGAGGGATATGAAGCGATTCCATCATTTCAATGATGGTGTTTGTTCATGTGGAGATTATTGGTAAAGAATTGTTGATCAGAGAGTGGAACTGCGGATGCTTTCAAGAACACATACAAACTAACAATTGCAGCCTCTTTGCATTAACGGTTTGATGCCCTAAAAACAATAATGGCTTCTGAGAAGTAAAAGATCCCTTAATCATCCAAGTTCTTGACTGCC

mRNA sequence

ATGCTTCTTCGAAGGAATGGATCTGGAAGTAACATAATGAAAGATCTCCACGTTCTTTTCAACCCAAGGATCGCTTTTTTCAGTTCAATGTTTTCTTCATCATCACCTCCGATTTCATTTCTGGAAACCCATTTCATCGATCTAATTCATGCGTCCAATTCAACCCACAAGCTCCGTCAGATCCATGGTCAACTCTATCGCTGCAACGTCTTCTCCAGCAGCCGGGTTGTGACCCAGTTCATCTCTTCCTGTTCTTCGCTAAATTCTGTCGACTATGCCATTTCCATCTTTCAGCGGTTCGAGTTGAAGAACAGTTACCTTTTTAATGCGTTAATTCGAGGACTCGCTGAAAATTCCAGGTTCGAGAGCTCAATTTCTTTCTTTGTTTTGATGCTAAAGTGGAAAATTAGCCCTGATAGGCTTACTTTTCCGTTTGTGCTGAAATCAGCGGCGGCTCTTTCCAATGGTGGCGTTGGGAGGGCTTTGCATTGTGGGATTTTGAAGTTTGGTCTTGAGTTTGATTCTTTTGTGAGGGTTTCGTTGGTTGATATGTACGTGAAAGTTGAGGAGTTGGGTTCTGCCTTGAAGGTGTTTGATGAAAGTCCTGAGAGTGTTAAGAATGGAAGTGTGTTGATTTGGAATGTTCTTATTCATGGGTATTGTAGAATGGGGGATTTAGTAAAAGCTACGGAGCTATTCGACTCAATGCCGAAGAAGGACACGGGATCTTGGAACAGTTTGATCAATGGTTTCATGAAAATGGGGGACATGGGTCGAGCAAAGGAACTGTTTGTTAAAATGCCTGAAAAAAATGTTGTTTCTTGGACCACAATGGTGAATGGATTTTCACAGAATGGAGACCCTGAAAAGGCACTGGAAACTTTCTTTTGTATGCTTGAAGAAGGCGCTCGGCCAAATGATTACACAATTGTCTCCGCACTTTCAGCTTGTGCAAAAATTGGTGCATTAGATGCTGGTTTAAGGATTCATAATTATCTTTCAGGCAATGGTTTCAAACTAAATCTAGTTATTGGAACTGCTCTGGTGGATATGTATGCAAAATGTGGAAATATTGAGCATGCAGAAAAAGTGTTTCATGAAACAAAAGAAAAGGGCCTTCTTATTTGGAGTGTTATGATCTGGGGCTGGGCAATCCATGGACATTTTAGGAAAGCCTTACAATACTTTGAATGGATGAAGTTTACAGGAACAAAGCCAGATAGTGTGGTCTTTCTTGCTGTTCTTAACGCATGCTCCCATTCTGGACAAGTAAACGAAGGACTTAAGTTTTTCGACAATATGAGGCGCGGCTACTTGATTGAACCTTCTATGAAGCATTACACATTGGTTGTAGACATGCTAGGCAGGGCTGGCAGACTAGATGAAGCTCTAAAGTTCATACGTGCGATGCCCATAACTCCTGATTTTGTGGTGTGGGGTGCTCTATTTTGTGCATGTAGGACTCATAAGAATGTTGAAATGGCAGAACTAGCATCCAAAAAGCTTCTTCAGCTTGAACCCAAGCATCCGGGGAGTTATGTATTTTTGTCAAACGCATATGCTTCAGTAGGGAGATGGGACGATGCGGAGAGAGTGAGAGTTTCTATGCGAGATCACGGTGCACACAAAGATCCAGGATGGAGCTTTATTGAAGTGGATCATAAATTACATAGATTTGTGGCTGGAGATAACACTCATAACCGTGCTGTAGAGATATACTCAAAATTAGATGAGATAAGTGCAAGTGCAAGGGAAAAGGGATACACGAAAGAAATTGAGTGCGTACTTCATAATATTGAAGAGGAAGAAAAGGAAGAAGCATTGGGATATCACAGTGAGAAGTTGGCACTTGCTTTTGGGATCGTCAGTACGAGGCCCGGAACGACTGTTAGGATAGTGAAAAACCTTAGAGTTTGTGTGGATTGCCATTCTTTCATGAAATATGCCAGTAAAATGAGTAAGAGGGAGATCATTTTGAGGGATATGAAGCGATTCCATCATTTCAATGATGGTGTTTGTTCATGTGGAGATTATTGGTAA

Coding sequence (CDS)

ATGCTTCTTCGAAGGAATGGATCTGGAAGTAACATAATGAAAGATCTCCACGTTCTTTTCAACCCAAGGATCGCTTTTTTCAGTTCAATGTTTTCTTCATCATCACCTCCGATTTCATTTCTGGAAACCCATTTCATCGATCTAATTCATGCGTCCAATTCAACCCACAAGCTCCGTCAGATCCATGGTCAACTCTATCGCTGCAACGTCTTCTCCAGCAGCCGGGTTGTGACCCAGTTCATCTCTTCCTGTTCTTCGCTAAATTCTGTCGACTATGCCATTTCCATCTTTCAGCGGTTCGAGTTGAAGAACAGTTACCTTTTTAATGCGTTAATTCGAGGACTCGCTGAAAATTCCAGGTTCGAGAGCTCAATTTCTTTCTTTGTTTTGATGCTAAAGTGGAAAATTAGCCCTGATAGGCTTACTTTTCCGTTTGTGCTGAAATCAGCGGCGGCTCTTTCCAATGGTGGCGTTGGGAGGGCTTTGCATTGTGGGATTTTGAAGTTTGGTCTTGAGTTTGATTCTTTTGTGAGGGTTTCGTTGGTTGATATGTACGTGAAAGTTGAGGAGTTGGGTTCTGCCTTGAAGGTGTTTGATGAAAGTCCTGAGAGTGTTAAGAATGGAAGTGTGTTGATTTGGAATGTTCTTATTCATGGGTATTGTAGAATGGGGGATTTAGTAAAAGCTACGGAGCTATTCGACTCAATGCCGAAGAAGGACACGGGATCTTGGAACAGTTTGATCAATGGTTTCATGAAAATGGGGGACATGGGTCGAGCAAAGGAACTGTTTGTTAAAATGCCTGAAAAAAATGTTGTTTCTTGGACCACAATGGTGAATGGATTTTCACAGAATGGAGACCCTGAAAAGGCACTGGAAACTTTCTTTTGTATGCTTGAAGAAGGCGCTCGGCCAAATGATTACACAATTGTCTCCGCACTTTCAGCTTGTGCAAAAATTGGTGCATTAGATGCTGGTTTAAGGATTCATAATTATCTTTCAGGCAATGGTTTCAAACTAAATCTAGTTATTGGAACTGCTCTGGTGGATATGTATGCAAAATGTGGAAATATTGAGCATGCAGAAAAAGTGTTTCATGAAACAAAAGAAAAGGGCCTTCTTATTTGGAGTGTTATGATCTGGGGCTGGGCAATCCATGGACATTTTAGGAAAGCCTTACAATACTTTGAATGGATGAAGTTTACAGGAACAAAGCCAGATAGTGTGGTCTTTCTTGCTGTTCTTAACGCATGCTCCCATTCTGGACAAGTAAACGAAGGACTTAAGTTTTTCGACAATATGAGGCGCGGCTACTTGATTGAACCTTCTATGAAGCATTACACATTGGTTGTAGACATGCTAGGCAGGGCTGGCAGACTAGATGAAGCTCTAAAGTTCATACGTGCGATGCCCATAACTCCTGATTTTGTGGTGTGGGGTGCTCTATTTTGTGCATGTAGGACTCATAAGAATGTTGAAATGGCAGAACTAGCATCCAAAAAGCTTCTTCAGCTTGAACCCAAGCATCCGGGGAGTTATGTATTTTTGTCAAACGCATATGCTTCAGTAGGGAGATGGGACGATGCGGAGAGAGTGAGAGTTTCTATGCGAGATCACGGTGCACACAAAGATCCAGGATGGAGCTTTATTGAAGTGGATCATAAATTACATAGATTTGTGGCTGGAGATAACACTCATAACCGTGCTGTAGAGATATACTCAAAATTAGATGAGATAAGTGCAAGTGCAAGGGAAAAGGGATACACGAAAGAAATTGAGTGCGTACTTCATAATATTGAAGAGGAAGAAAAGGAAGAAGCATTGGGATATCACAGTGAGAAGTTGGCACTTGCTTTTGGGATCGTCAGTACGAGGCCCGGAACGACTGTTAGGATAGTGAAAAACCTTAGAGTTTGTGTGGATTGCCATTCTTTCATGAAATATGCCAGTAAAATGAGTAAGAGGGAGATCATTTTGAGGGATATGAAGCGATTCCATCATTTCAATGATGGTGTTTGTTCATGTGGAGATTATTGGTAA

Protein sequence

MLLRRNGSGSNIMKDLHVLFNPRIAFFSSMFSSSSPPISFLETHFIDLIHASNSTHKLRQIHGQLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHKDPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMKRFHHFNDGVCSCGDYW*
BLAST of Csa2G139850 vs. Swiss-Prot
Match: PPR10_ARATH (Pentatricopeptide repeat-containing protein At1g04840 OS=Arabidopsis thaliana GN=PCMP-H64 PE=2 SV=1)

HSP 1 Score: 729.2 bits (1881), Expect = 4.2e-209
Identity = 361/670 (53.88%), Postives = 470/670 (70.15%), Query Frame = 1

Query: 13  MKDLHVLFNPRIAFFSSMFSS---SSPPISFLETHFIDLIHASNSTHKLRQIHGQLYRCN 72
           MK L V+F P+ +     F +   +SP     E+HFI LIHA   T  LR +H Q+ R  
Sbjct: 1   MKSLSVIFKPKSSPAKIYFPADRQASPD----ESHFISLIHACKDTASLRHVHAQILRRG 60

Query: 73  VFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFV 132
           V SS RV  Q +S  S L S DY++SIF+  E +N ++ NALIRGL EN+RFESS+  F+
Sbjct: 61  VLSS-RVAAQLVSCSSLLKSPDYSLSIFRNSEERNPFVLNALIRGLTENARFESSVRHFI 120

Query: 133 LMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVE 192
           LML+  + PDRLTFPFVLKS + L    +GRALH   LK  ++ DSFVR+SLVDMY K  
Sbjct: 121 LMLRLGVKPDRLTFPFVLKSNSKLGFRWLGRALHAATLKNFVDCDSFVRLSLVDMYAKTG 180

Query: 193 ELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWNSLIN 252
           +L  A +VF+ESP+ +K  S+LIWNVLI+GYCR  D+  AT LF SMP++++GSW++LI 
Sbjct: 181 QLKHAFQVFEESPDRIKKESILIWNVLINGYCRAKDMHMATTLFRSMPERNSGSWSTLIK 240

Query: 253 GFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYT 312
           G++  G++ RAK+LF  MPEKNVVSWTT++NGFSQ GD E A+ T+F MLE+G +PN+YT
Sbjct: 241 GYVDSGELNRAKQLFELMPEKNVVSWTTLINGFSQTGDYETAISTYFEMLEKGLKPNEYT 300

Query: 313 IVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETK 372
           I + LSAC+K GAL +G+RIH Y+  NG KL+  IGTALVDMYAKCG ++ A  VF    
Sbjct: 301 IAAVLSACSKSGALGSGIRIHGYILDNGIKLDRAIGTALVDMYAKCGELDCAATVFSNMN 360

Query: 373 EKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVNEGLK 432
            K +L W+ MI GWA+HG F +A+Q F  M ++G KPD VVFLAVL AC +S +V+ GL 
Sbjct: 361 HKDILSWTAMIQGWAVHGRFHQAIQCFRQMMYSGEKPDEVVFLAVLTACLNSSEVDLGLN 420

Query: 433 FFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALFCACRTH 492
           FFD+MR  Y IEP++KHY LVVD+LGRAG+L+EA + +  MPI PD   W AL+ AC+ H
Sbjct: 421 FFDSMRLDYAIEPTLKHYVLVVDLLGRAGKLNEAHELVENMPINPDLTTWAALYRACKAH 480

Query: 493 KNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHKDPGWSF 552
           K    AE  S+ LL+L+P+  GSY+FL   +AS G   D E+ R+S++     +  GWS+
Sbjct: 481 KGYRRAESVSQNLLELDPELCGSYIFLDKTHASKGNIQDVEKRRLSLQKRIKERSLGWSY 540

Query: 553 IEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEEKEEALG 612
           IE+D +L++F AGD +H    EI  KLDEI + A +KGY    +  +H+IEEEEKE   G
Sbjct: 541 IELDGQLNKFSAGDYSHKLTQEIGLKLDEIISLAIQKGYNPGADWSIHDIEEEEKENVTG 600

Query: 613 YHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMKRFHHFN 672
            HSEKLAL  G + T PGTT+RI+KNLR+C DCHS MKY SK+S+R+I+LRD ++FHHF 
Sbjct: 601 IHSEKLALTLGFLRTAPGTTIRIIKNLRICGDCHSLMKYVSKISQRDILLRDARQFHHFK 660

Query: 673 DGVCSCGDYW 680
           DG CSCGDYW
Sbjct: 661 DGRCSCGDYW 665

BLAST of Csa2G139850 vs. Swiss-Prot
Match: PP367_ARATH (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 474.2 bits (1219), Expect = 2.4e-132
Identity = 251/640 (39.22%), Postives = 373/640 (58.28%), Query Frame = 1

Query: 48  LIHASNSTHKLRQIHGQLYRCNVFSSSRVVTQFISSC-------SSLNSVDYAISIFQRF 107
           L+ + +S   L+ IHG L R ++ S   V ++ ++ C          N + YA  IF + 
Sbjct: 18  LLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFSQI 77

Query: 108 ELKNSYLFNALIRGLAENSRFESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGR 167
           +  N ++FN LIR  +  +    +  F+  MLK +I PD +TFPF++K+++ +    VG 
Sbjct: 78  QNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLVGE 137

Query: 168 ALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGY 227
             H  I++FG + D +V  SLV MY     + +A ++F +    +    V+ W  ++ GY
Sbjct: 138 QTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQ----MGFRDVVSWTSMVAGY 197

Query: 228 CRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVN 287
           C+ G +  A E+FD                               +MP +N+ +W+ M+N
Sbjct: 198 CKCGMVENAREMFD-------------------------------EMPHRNLFTWSIMIN 257

Query: 288 GFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKL 347
           G+++N   EKA++ F  M  EG   N+  +VS +S+CA +GAL+ G R + Y+  +   +
Sbjct: 258 GYAKNNCFEKAIDLFEFMKREGVVANETVMVSVISSCAHLGALEFGERAYEYVVKSHMTV 317

Query: 348 NLVIGTALVDMYAKCGNIEHAEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMK 407
           NL++GTALVDM+ +CG+IE A  VF    E   L WS +I G A+HGH  KA+ YF  M 
Sbjct: 318 NLILGTALVDMFWRCGDIEKAIHVFEGLPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMI 377

Query: 408 FTGTKPDSVVFLAVLNACSHSGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRL 467
             G  P  V F AVL+ACSH G V +GL+ ++NM++ + IEP ++HY  +VDMLGRAG+L
Sbjct: 378 SLGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKL 437

Query: 468 DEALKFIRAMPITPDFVVWGALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAY 527
            EA  FI  M + P+  + GAL  AC+ +KN E+AE     L++++P+H G YV LSN Y
Sbjct: 438 AEAENFILKMHVKPNAPILGALLGACKIYKNTEVAERVGNMLIKVKPEHSGYYVLLSNIY 497

Query: 528 ASVGRWDDAERVRVSMRDHGAHKDPGWSFIEVDHKLHRFVAGDN-THNRAVEIYSKLDEI 587
           A  G+WD  E +R  M++    K PGWS IE+D K+++F  GD+  H    +I  K +EI
Sbjct: 498 ACAGQWDKIESLRDMMKEKLVKKPPGWSLIEIDGKINKFTMGDDQKHPEMGKIRRKWEEI 557

Query: 588 SASAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVC 647
               R  GY         +++EEEKE ++  HSEKLA+A+G++ T+PGTT+RIVKNLRVC
Sbjct: 558 LGKIRLIGYKGNTGDAFFDVDEEEKESSIHMHSEKLAIAYGMMKTKPGTTIRIVKNLRVC 617

Query: 648 VDCHSFMKYASKMSKREIILRDMKRFHHFNDGVCSCGDYW 680
            DCH+  K  S++  RE+I+RD  RFHHF +GVCSC DYW
Sbjct: 618 EDCHTVTKLISEVYGRELIVRDRNRFHHFRNGVCSCRDYW 622

BLAST of Csa2G139850 vs. Swiss-Prot
Match: PP425_ARATH (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 466.1 bits (1198), Expect = 6.7e-130
Identity = 264/669 (39.46%), Postives = 378/669 (56.50%), Query Frame = 1

Query: 21  NPRIAFFSSMFSSSSPPISFLETHFIDLIHASNSTHKLRQIHGQLYRCNVFSSSRVVTQF 80
           NP    FS   +S +   +   +     I+   +   L QIH    +      +    + 
Sbjct: 2   NPTQTLFSPGGNSPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEI 61

Query: 81  ISSCSSLN----SVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSIS---FFVLMLK 140
           +  C++ +     +DYA  IF +   +N + +N +IRG +E+   ++ I+   F+ +M  
Sbjct: 62  LRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSD 121

Query: 141 WKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEELGS 200
             + P+R TFP VLK+ A       G+ +H   LK+G   D FV  +LV MYV    +  
Sbjct: 122 EFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFM-- 181

Query: 201 ALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGS---WNSLING 260
                       K+  VL +  +I       D+V  T+      +K  G    WN +I+G
Sbjct: 182 ------------KDARVLFYKNIIEK-----DMVVMTDR-----RKRDGEIVLWNVMIDG 241

Query: 261 FMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTI 320
           +M++GD   A+ LF KM +++VVSW TM++G+S NG  + A+E F  M +   RPN  T+
Sbjct: 242 YMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFREMKKGDIRPNYVTL 301

Query: 321 VSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETKE 380
           VS L A +++G+L+ G  +H Y   +G +++ V+G+AL+DMY+KCG IE A  VF     
Sbjct: 302 VSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPR 361

Query: 381 KGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVNEGLKF 440
           + ++ WS MI G+AIHG    A+  F  M+  G +P  V ++ +L ACSH G V EG ++
Sbjct: 362 ENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRY 421

Query: 441 FDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALFCACRTHK 500
           F  M     +EP ++HY  +VD+LGR+G LDEA +FI  MPI PD V+W AL  ACR   
Sbjct: 422 FSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQG 481

Query: 501 NVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHKDPGWSFI 560
           NVEM +  +  L+ + P   G+YV LSN YAS G W +   +R+ M++    KDPG S I
Sbjct: 482 NVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLI 541

Query: 561 EVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEEKEEALGY 620
           ++D  LH FV  D++H +A EI S L EIS   R  GY      VL N+EEE+KE  L Y
Sbjct: 542 DIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENVLHY 601

Query: 621 HSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMKRFHHFND 680
           HSEK+A AFG++ST PG  +RIVKNLR+C DCHS +K  SK+ KR+I +RD KRFHHF D
Sbjct: 602 HSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQD 646

BLAST of Csa2G139850 vs. Swiss-Prot
Match: PP301_ARATH (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 464.9 bits (1195), Expect = 1.5e-129
Identity = 246/622 (39.55%), Postives = 358/622 (57.56%), Query Frame = 1

Query: 81  ISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFVLMLKW------ 140
           +S  +    VD A S+F R   KN   +NAL+    +NS+ E +   F     W      
Sbjct: 164 LSGYAQNGCVDDARSVFDRMPEKNDVSWNALLSAYVQNSKMEEACMLFKSRENWALVSWN 223

Query: 141 ----------KISPDRLTFPFV-LKSAAALSNGGVGRALHCGILKFGLEFDS------FV 200
                     KI   R  F  + ++   + +    G A    I +    FD       F 
Sbjct: 224 CLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEARQLFDESPVQDVFT 283

Query: 201 RVSLVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMP 260
             ++V  Y++   +  A ++FD+ PE     + + WN ++ GY +   +  A ELFD MP
Sbjct: 284 WTAMVSGYIQNRMVEEARELFDKMPER----NEVSWNAMLAGYVQGERMEMAKELFDVMP 343

Query: 261 KKDTGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFC 320
            ++  +WN++I G+ + G +  AK LF KMP+++ VSW  M+ G+SQ+G   +AL  F  
Sbjct: 344 CRNVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMIAGYSQSGHSFEALRLFVQ 403

Query: 321 MLEEGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGN 380
           M  EG R N  +  SALS CA + AL+ G ++H  L   G++    +G AL+ MY KCG+
Sbjct: 404 MEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALLLMYCKCGS 463

Query: 381 IEHAEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNA 440
           IE A  +F E   K ++ W+ MI G++ HG    AL++FE MK  G KPD    +AVL+A
Sbjct: 464 IEEANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSA 523

Query: 441 CSHSGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFV 500
           CSH+G V++G ++F  M + Y + P+ +HY  +VD+LGRAG L++A   ++ MP  PD  
Sbjct: 524 CSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLMKNMPFEPDAA 583

Query: 501 VWGALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMR 560
           +WG L  A R H N E+AE A+ K+  +EP++ G YV LSN YAS GRW D  ++RV MR
Sbjct: 584 IWGTLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWGDVGKLRVRMR 643

Query: 561 DHGAHKDPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLH 620
           D G  K PG+S+IE+ +K H F  GD  H    EI++ L+E+    ++ GY  +   VLH
Sbjct: 644 DKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLRMKKAGYVSKTSVVLH 703

Query: 621 NIEEEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREI 680
           ++EEEEKE  + YHSE+LA+A+GI+    G  +R++KNLRVC DCH+ +KY ++++ R I
Sbjct: 704 DVEEEEKERMVRYHSERLAVAYGIMRVSSGRPIRVIKNLRVCEDCHNAIKYMARITGRLI 763


HSP 2 Score: 116.7 bits (291), Expect = 9.9e-25
Identity = 95/405 (23.46%), Postives = 182/405 (44.94%), Query Frame = 1

Query: 91  DYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFVLMLKWKISPDRLTFPFVLKSA 150
           + A  +F     ++   +N +I+G   N     +   F +M      P+R    +    +
Sbjct: 112 ELARKLFDEMPERDLVSWNVMIKGYVRNRNLGKARELFEIM------PERDVCSWNTMLS 171

Query: 151 AALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDESPESVKNGSV 210
               NG V  A    +     E +     +L+  YV+  ++  A  +F    +S +N ++
Sbjct: 172 GYAQNGCVDDAR--SVFDRMPEKNDVSWNALLSAYVQNSKMEEACMLF----KSRENWAL 231

Query: 211 LIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRAKELFVKMPEK 270
           + WN L+ G+ +   +V+A + FDSM  +D  SWN++I G+ + G +  A++LF + P +
Sbjct: 232 VSWNCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEARQLFDESPVQ 291

Query: 271 NVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKIGALDAGLRIH 330
           +V +WT MV+G+ QN   E+A E F  M E     N+ +  + L+   +   ++    + 
Sbjct: 292 DVFTWTAMVSGYIQNRMVEEARELFDKMPER----NEVSWNAMLAGYVQGERMEMAKELF 351

Query: 331 NYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETKEKGLLIWSVMIWGWAIHGHFR 390
           + +       N+     ++  YA+CG I  A+ +F +  ++  + W+ MI G++  GH  
Sbjct: 352 DVMPCR----NVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMIAGYSQSGHSF 411

Query: 391 KALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVNEGLKFFDNMRRGYLIEPSMKHYTLV 450
           +AL+ F  M+  G + +   F + L+ C+    +  G +    + +G           L+
Sbjct: 412 EALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALL 471

Query: 451 VDMLGRAGRLDEALKFIRAMPITPDFVVWGALFCACRTHKNVEMA 496
           + M  + G ++EA    + M    D V W  +      H   E+A
Sbjct: 472 L-MYCKCGSIEEANDLFKEM-AGKDIVSWNTMIAGYSRHGFGEVA 494


HSP 3 Score: 112.5 bits (280), Expect = 1.9e-23
Identity = 94/360 (26.11%), Postives = 160/360 (44.44%), Query Frame = 1

Query: 182 VDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDT 241
           +  Y++      AL+VF   P      S + +N +I GY R G+   A +LFD MP++D 
Sbjct: 71  ISSYMRTGRCNEALRVFKRMPR----WSSVSYNGMISGYLRNGEFELARKLFDEMPERDL 130

Query: 242 GSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEE 301
            SWN +I G+++  ++G+A+ELF  MPE++V SW TM++G++QNG  + A   F  M E+
Sbjct: 131 VSWNVMIKGYVRNRNLGKARELFEIMPERDVCSWNTMLSGYAQNGCVDDARSVFDRMPEK 190

Query: 302 GARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHA 361
               ND +  + LSA  +   ++    +  + S   +   LV    L+  + K   I  A
Sbjct: 191 ----NDVSWNALLSAYVQNSKMEEACML--FKSRENWA--LVSWNCLLGGFVKKKKIVEA 250

Query: 362 EKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHS 421
            + F     + ++ W+ +I G+A  G   +A Q F+         D   + A+++    +
Sbjct: 251 RQFFDSMNVRDVVSWNTIITGYAQSGKIDEARQLFD----ESPVQDVFTWTAMVSGYIQN 310

Query: 422 GQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGA 481
             V E  + FD M      E +   +  ++    +  R++ A +    MP   +   W  
Sbjct: 311 RMVEEARELFDKMP-----ERNEVSWNAMLAGYVQGERMEMAKELFDVMPCR-NVSTWNT 370

Query: 482 LFCACRTHKNVEMAELASKKLLQLEPKH-PGSYVFLSNAYASVGRWDDAERVRVSMRDHG 541
           +         +  A    K L    PK  P S+  +   Y+  G   +A R+ V M   G
Sbjct: 371 MITGYAQCGKISEA----KNLFDKMPKRDPVSWAAMIAGYSQSGHSFEALRLFVQMEREG 404


HSP 4 Score: 100.9 bits (250), Expect = 5.6e-20
Identity = 90/356 (25.28%), Postives = 158/356 (44.38%), Query Frame = 1

Query: 81  ISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFVLMLKWKISPDR 140
           ISS       + A+ +F+R    +S  +N +I G   N  FE +   F  M      P+R
Sbjct: 71  ISSYMRTGRCNEALRVFKRMPRWSSVSYNGMISGYLRNGEFELARKLFDEM------PER 130

Query: 141 LTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDE 200
               + +     + N  +G+A    + +   E D     +++  Y +   +  A  VFD 
Sbjct: 131 DLVSWNVMIKGYVRNRNLGKARE--LFEIMPERDVCSWNTMLSGYAQNGCVDDARSVFDR 190

Query: 201 SPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRA 260
            PE  KN   + WN L+  Y +   + +A  LF S       SWN L+ GF+K   +  A
Sbjct: 191 MPE--KND--VSWNALLSAYVQNSKMEEACMLFKSRENWALVSWNCLLGGFVKKKKIVEA 250

Query: 261 KELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKI 320
           ++ F  M  ++VVSW T++ G++Q+G  ++A + F    +E    + +T  + +S   + 
Sbjct: 251 RQFFDSMNVRDVVSWNTIITGYAQSGKIDEARQLF----DESPVQDVFTWTAMVSGYIQN 310

Query: 321 GALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETKEKGLLIWSVMI 380
             ++    + + +     + N V   A++  Y +   +E A+++F     + +  W+ MI
Sbjct: 311 RMVEEARELFDKMP----ERNEVSWNAMLAGYVQGERMEMAKELFDVMPCRNVSTWNTMI 370

Query: 381 WGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVNEGLKFFDNMRR 437
            G+A  G   +A   F+ M     K D V + A++   S SG   E L+ F  M R
Sbjct: 371 TGYAQCGKISEAKNLFDKM----PKRDPVSWAAMIAGYSQSGHSFEALRLFVQMER 402

BLAST of Csa2G139850 vs. Swiss-Prot
Match: PP122_ARATH (Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana GN=PCMP-H71 PE=2 SV=1)

HSP 1 Score: 464.5 bits (1194), Expect = 1.9e-129
Identity = 242/641 (37.75%), Postives = 371/641 (57.88%), Query Frame = 1

Query: 44  HFIDLIHASNSTHKLRQIHGQLYRCNVFSSSRVVTQFISSC--SSLNSVDYAISIFQRFE 103
           H + L+++  +   L QIHG   +  V + S    + I  C  S  +++ YA  +   F 
Sbjct: 7   HCLSLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFP 66

Query: 104 LKNSYLFNALIRGLAENSRFESSISFFV-LMLKWKISPDRLTFPFVLKSAAALSNGGVGR 163
             ++++FN L+RG +E+    +S++ FV +M K  + PD  +F FV+K+     +   G 
Sbjct: 67  EPDAFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGF 126

Query: 164 ALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGY 223
            +HC  LK GLE   FV  +L+ MY     +  A KVFDE  +     +++ WN +I   
Sbjct: 127 QMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQP----NLVAWNAVITAC 186

Query: 224 CRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVN 283
            R  D+  A E+FD M  ++  SWN ++ G++K G++  AK +F +MP ++ VSW+TM+ 
Sbjct: 187 FRGNDVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIV 246

Query: 284 GFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKL 343
           G + NG   ++   F  +   G  PN+ ++   LSAC++ G+ + G  +H ++   G+  
Sbjct: 247 GIAHNGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSW 306

Query: 344 NLVIGTALVDMYAKCGNIEHAEKVFHETKEKGLLI-WSVMIWGWAIHGHFRKALQYFEWM 403
            + +  AL+DMY++CGN+  A  VF   +EK  ++ W+ MI G A+HG   +A++ F  M
Sbjct: 307 IVSVNNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEM 366

Query: 404 KFTGTKPDSVVFLAVLNACSHSGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGR 463
              G  PD + F+++L+ACSH+G + EG  +F  M+R Y IEP ++HY  +VD+ GR+G+
Sbjct: 367 TAYGVTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGK 426

Query: 464 LDEALKFIRAMPITPDFVVWGALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNA 523
           L +A  FI  MPI P  +VW  L  AC +H N+E+AE   ++L +L+P + G  V LSNA
Sbjct: 427 LQKAYDFICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLLSNA 486

Query: 524 YASVGRWDDAERVRVSMRDHGAHKDPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEI 583
           YA+ G+W D   +R SM      K   WS +EV   +++F AG+      +E + KL EI
Sbjct: 487 YATAGKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFTAGEKKKGIDIEAHEKLKEI 546

Query: 584 SASAR-EKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRV 643
               + E GYT E+   L+++EEEEKE+ +  HSEKLALAF +     G  +RIVKNLR+
Sbjct: 547 ILRLKDEAGYTPEVASALYDVEEEEKEDQVSKHSEKLALAFALARLSKGANIRIVKNLRI 606

Query: 644 CVDCHSFMKYASKMSKREIILRDMKRFHHFNDGVCSCGDYW 680
           C DCH+ MK  SK+   EI++RD  RFH F DG CSC DYW
Sbjct: 607 CRDCHAVMKLTSKVYGVEILVRDRNRFHSFKDGSCSCRDYW 643

BLAST of Csa2G139850 vs. TrEMBL
Match: F6GWJ6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0029g01130 PE=4 SV=1)

HSP 1 Score: 934.1 bits (2413), Expect = 9.6e-269
Identity = 459/676 (67.90%), Postives = 546/676 (80.77%), Query Frame = 1

Query: 8   SGSNIMKDLHVLFNPRIAFFSSMFSSSSP----PISFLETHFIDLIHASNSTHKLRQIHG 67
           S S  +K L+ LF P      +   +++     P    ETHFI LIHASN+  +L QIH 
Sbjct: 2   SKSQGLKALNALFKPTSPPAKTTTVTTTTRAHGPSRSPETHFIPLIHASNTLPQLHQIHA 61

Query: 68  QLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFES 127
           Q++  N+FS+SRVVTQ ISS  SL S+DYA+SIF+ F+  N ++FNALIRGLAENSRFE 
Sbjct: 62  QIFLHNLFSNSRVVTQLISSSCSLKSLDYALSIFRCFDHPNLFVFNALIRGLAENSRFEG 121

Query: 128 SISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVD 187
           S+S FVLML+  I PDRLT PFVLKS AAL + G+GR LH G++K GLEFDSFVRVSLVD
Sbjct: 122 SVSHFVLMLRLSIRPDRLTLPFVLKSVAALVDVGLGRCLHGGVMKLGLEFDSFVRVSLVD 181

Query: 188 MYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGS 247
           MYVK+ ELG  L++FDESP+  K  S+L+WNVLI+G C++GDL KA  LF++MP+++ GS
Sbjct: 182 MYVKIGELGFGLQLFDESPQRNKAESILLWNVLINGCCKVGDLSKAASLFEAMPERNAGS 241

Query: 248 WNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGA 307
           WNSLINGF++ GD+ RA+ELFV+MPEKNVVSWTTM+NGFSQNGD EKAL  F+ MLEEG 
Sbjct: 242 WNSLINGFVRNGDLDRARELFVQMPEKNVVSWTTMINGFSQNGDHEKALSMFWRMLEEGV 301

Query: 308 RPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEK 367
           RPND T+VSAL AC KIGAL  G RIHNYLS NGF+LN  IGTALVDMYAKCGNI+ A +
Sbjct: 302 RPNDLTVVSALLACTKIGALQVGERIHNYLSSNGFQLNRGIGTALVDMYAKCGNIKSASR 361

Query: 368 VFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQ 427
           VF ETK K LL WSVMIWGWAIHG F +ALQ F  MK  G  PD V+FLA+L ACSHSG 
Sbjct: 362 VFVETKGKDLLTWSVMIWGWAIHGCFDQALQCFVKMKSAGINPDEVIFLAILTACSHSGN 421

Query: 428 VNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALF 487
           V++GL FF++MR  Y IEP+MKHYTL+VD+LGRAGRLDEAL FI++MPI PDFV+WGALF
Sbjct: 422 VDQGLNFFESMRLDYSIEPTMKHYTLIVDLLGRAGRLDEALSFIQSMPINPDFVIWGALF 481

Query: 488 CACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHK 547
           CACR HKN+EMAEL ++KLLQLEPKHPGSYVFLSN YA+VGRW+D ERVR  M++ G  K
Sbjct: 482 CACRAHKNIEMAELTAEKLLQLEPKHPGSYVFLSNVYAAVGRWEDVERVRTLMKNRGVEK 541

Query: 548 DPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEE 607
           DPGWS+IEV+ ++H FVAGD+ H RA EI  KL+EI+ASA+++GY  E   VLHNIEEEE
Sbjct: 542 DPGWSYIEVEGQVHSFVAGDHAHVRAEEISLKLEEITASAKQEGYMPETAWVLHNIEEEE 601

Query: 608 KEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMK 667
           KE+ALG HSEKLALAFG++ST PG+T+RIVKNLRVC DCHS MKYASK+S+REIILRD+K
Sbjct: 602 KEDALGSHSEKLALAFGLISTAPGSTIRIVKNLRVCGDCHSMMKYASKLSRREIILRDIK 661

Query: 668 RFHHFNDGVCSCGDYW 680
           RFHHF DG CSCGDYW
Sbjct: 662 RFHHFKDGTCSCGDYW 677

BLAST of Csa2G139850 vs. TrEMBL
Match: A0A061EK73_THECC (Tetratricopeptide repeat-like superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_020290 PE=4 SV=1)

HSP 1 Score: 893.6 bits (2308), Expect = 1.4e-256
Identity = 443/682 (64.96%), Postives = 532/682 (78.01%), Query Frame = 1

Query: 13  MKDLHVLFN-----PRIAFFSSMFSSSSPPISF----------LETHFIDLIHASNSTHK 72
           MK L +LF       +    SS F    PPIS           L+THF  LI +S +T +
Sbjct: 1   MKSLRLLFERNSSPAKTNSSSSSFKKPKPPISHGSSSSSSQDPLKTHFASLIQSSKTTLQ 60

Query: 73  LRQIHGQLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAE 132
           LRQIH Q++R N+ SSS + T  IS+ SSL S+ YAIS+F  F  K+ +LFNALIRGL +
Sbjct: 61  LRQIHAQIFRRNLSSSSNLTTLLISASSSLKSIPYAISLFNHFHHKSIFLFNALIRGLTD 120

Query: 133 NSRFESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFV 192
           NS  ESSIS F+LML   + PD+LT+PFVLKS A L    +G  LH  I+K G+EFDSFV
Sbjct: 121 NSLLESSISHFLLMLSLGVRPDKLTYPFVLKSIAGLGLRCLGLILHGRIIKSGVEFDSFV 180

Query: 193 RVSLVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMP 252
           RV+LV+MYVK++ELG AL+VFDESPE  K+GS+L+WNVLI+GYC+ G+L KA ELF++ P
Sbjct: 181 RVALVEMYVKLKELGFALQVFDESPERNKSGSILLWNVLINGYCKDGNLGKAMELFEATP 240

Query: 253 KKDTGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFC 312
           +++ GSWNSLINGFM+ GD+ +A ELF +M EK+VVSWTTMVNGFSQNGD EKAL  FF 
Sbjct: 241 ERNIGSWNSLINGFMRNGDLDKAVELFDEMKEKDVVSWTTMVNGFSQNGDHEKALSMFFK 300

Query: 313 MLEEGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGN 372
           MLE   RPND T+V ALSACAKIGAL+AG RIH+Y+  NGF+LN  IG ALVDMYAKCG+
Sbjct: 301 MLEAALRPNDLTLVPALSACAKIGALEAGARIHDYVLENGFRLNKAIGAALVDMYAKCGD 360

Query: 373 IEHAEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNA 432
           I+ A KVF ETKE+ +L WSVMIWGWAIHG++ +A+Q F+ M F+G KPD VVFLA+L A
Sbjct: 361 IQSASKVFDETKERDILTWSVMIWGWAIHGYYEQAIQCFKKMMFSGIKPDGVVFLALLTA 420

Query: 433 CSHSGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFV 492
           CSHSGQVN GL FFD+MR  Y IEP+MKHYTLVVD+LGRAG+LDE+LKFI+ MP++PDFV
Sbjct: 421 CSHSGQVNLGLNFFDSMRLDYSIEPTMKHYTLVVDLLGRAGQLDESLKFIQRMPMSPDFV 480

Query: 493 VWGALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMR 552
            WGALFCACR HKN++MAEL S+ LLQLEPKHPGSYVFLSN YA+VGRW+D ERVR+ M+
Sbjct: 481 TWGALFCACRAHKNIKMAELVSQNLLQLEPKHPGSYVFLSNVYAAVGRWEDVERVRMLMQ 540

Query: 553 DHGAHKDPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLH 612
           +    KDPGWS+IEV  ++H FVAGD+ H  A EIY KL+EI A  R+ GY  E   VLH
Sbjct: 541 NRAVDKDPGWSYIEVGGEMHSFVAGDHAHKHAREIYLKLEEIVAGTRQHGYMPETGWVLH 600

Query: 613 NIEEEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREI 672
           NIEEEEKE+ALG HSEKLALAF ++ T PGTT+RIVKNLRVC DCHS MKYASKMS+REI
Sbjct: 601 NIEEEEKEDALGSHSEKLALAFALIRTSPGTTIRIVKNLRVCGDCHSLMKYASKMSQREI 660

Query: 673 ILRDMKRFHHFNDGVCSCGDYW 680
           +LRD+KRFHHF DG CSCGDYW
Sbjct: 661 VLRDIKRFHHFKDGACSCGDYW 682

BLAST of Csa2G139850 vs. TrEMBL
Match: A0A067KWK1_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01110 PE=4 SV=1)

HSP 1 Score: 888.6 bits (2295), Expect = 4.6e-255
Identity = 432/674 (64.09%), Postives = 530/674 (78.64%), Query Frame = 1

Query: 13  MKDLHVLF---NPRIAFFSSMFSSSSPPISFL----ETHFIDLIHASNSTHKLRQIHGQL 72
           M+  H LF   N      SS   +SSP  +      ETH I LIHAS ++ +L QIH Q+
Sbjct: 1   MRSRHALFKAKNSPAKTTSSREPTSSPNKALSQNPSETHLISLIHASKTSRQLHQIHAQI 60

Query: 73  YRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSI 132
           +  N+ +SS++ TQ ISS SS   +DYAI++F  +  KNS+LFNALIRGL  NS FES+I
Sbjct: 61  FLHNLSTSSQIATQLISSSSSRKFIDYAITVFNHYYPKNSFLFNALIRGLTNNSLFESAI 120

Query: 133 SFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMY 192
           S F+LML+  + PD+LT+PFVLKS A L + G+GRALH  I K G EFD FVR+S+VD Y
Sbjct: 121 SHFILMLRSDVKPDQLTYPFVLKSIATLCSEGLGRALHGMIYKSGFEFDLFVRISMVDAY 180

Query: 193 VKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWN 252
           VKVEELGSALK+FDESP+     S L+WNVLI+G C++G + KA +LF++MP++ T SWN
Sbjct: 181 VKVEELGSALKLFDESPQRFYGESTLLWNVLINGCCKVGSMRKAVDLFETMPERTTASWN 240

Query: 253 SLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARP 312
           SLINGF++ GD+ RA ELF +MPEKNVVSWTTMVNG S NGD EKAL  F  ML+ G +P
Sbjct: 241 SLINGFLRSGDLERANELFGRMPEKNVVSWTTMVNGLSHNGDHEKALSLFSKMLQVGVKP 300

Query: 313 NDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVF 372
           ND+TIVSALSACAKIGAL+AG+RIH YL+ NGF+LN  IGTALVDMYAKCG+IE A +VF
Sbjct: 301 NDFTIVSALSACAKIGALEAGVRIHRYLTDNGFRLNAKIGTALVDMYAKCGSIESASQVF 360

Query: 373 HETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVN 432
            ETKEK +L W+VMIWGWAIHGH  +A+Q F  M + G +PD VVFLA+L AC+H+G+V+
Sbjct: 361 RETKEKDVLTWTVMIWGWAIHGHSEEAIQCFRQMMYAGIRPDEVVFLAILTACTHAGKVD 420

Query: 433 EGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALFCA 492
            GL FF +M   Y IEPSMKHY L+VD+LGRAGRL++ALKFI  MPITPDFV+WGALFC 
Sbjct: 421 LGLNFFKSMELDYSIEPSMKHYALIVDLLGRAGRLNQALKFIERMPITPDFVIWGALFCT 480

Query: 493 CRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHKDP 552
           CR HKN+++AELA++KLL+LEPKHPGSYVFLSN YA+VGRW+DAERVR  M++ G  KDP
Sbjct: 481 CRAHKNIKLAELAAQKLLELEPKHPGSYVFLSNVYAAVGRWEDAERVRSLMQNRGIEKDP 540

Query: 553 GWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEEKE 612
           GWS++EV+ ++H F AGD++H  A +IY KL++I A A+ +GY    E VLHNIEEEEKE
Sbjct: 541 GWSYVEVEGQVHSFAAGDSSHKDAKDIYLKLEQIVAGAKGQGYMPGTEWVLHNIEEEEKE 600

Query: 613 EALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMKRF 672
           +ALG HSEKLALAFG++ T PG T+RIVKNLRVC DCHS MKYASKMS+REIILRD+KRF
Sbjct: 601 DALGSHSEKLALAFGLIRTSPGMTLRIVKNLRVCGDCHSLMKYASKMSQREIILRDIKRF 660

Query: 673 HHFNDGVCSCGDYW 680
           HHF DG+CSCGDYW
Sbjct: 661 HHFKDGICSCGDYW 674

BLAST of Csa2G139850 vs. TrEMBL
Match: A0A0D2PM74_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G031300 PE=4 SV=1)

HSP 1 Score: 878.6 bits (2269), Expect = 4.8e-252
Identity = 434/652 (66.56%), Postives = 523/652 (80.21%), Query Frame = 1

Query: 28  SSMFSSSSPPISFLETHFIDLIHASNSTHKLRQIHGQLYRCNVFSSSRVVTQFISSCSSL 87
           SS  SSS  P   L+THF  LI ++ +T +LRQIH Q+ R ++ SS+ + T  IS  SSL
Sbjct: 54  SSQSSSSQDP---LKTHFSSLIKSTETTLQLRQIHAQILRRHLSSSANLTTLLISVSSSL 113

Query: 88  NSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFVLMLKWKISPDRLTFPFVL 147
            S+ YA+SIF     K+ +LFNALIRGL ENS F+SS+S F+LML+ ++ PD+LT+PFVL
Sbjct: 114 KSIPYALSIFNNSHHKSLFLFNALIRGLTENSHFQSSVSHFLLMLRHRVRPDKLTYPFVL 173

Query: 148 KSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDESPESVKN 207
           KS A L    +G  LH  I+K G+EFDSFVRVSLV+MYVK+EE+G AL+VFDESPE  K+
Sbjct: 174 KSVAGLGLRFLGLILHGRIIKSGVEFDSFVRVSLVEMYVKLEEMGFALQVFDESPERNKS 233

Query: 208 GSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRAKELFVKM 267
            S+L+WNVLI+G CR+GDL KATELF++MP+++ GSWNS ING MK GD+ +A +LF +M
Sbjct: 234 ESILLWNVLINGCCRVGDLEKATELFEAMPERNIGSWNSFINGLMKNGDLNKAMQLFDEM 293

Query: 268 PEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKIGALDAGL 327
            EK+VVSWTT+VNG SQNGD +KAL  FF MLE G RPND T+VSALSACAKIGAL+AG+
Sbjct: 294 KEKDVVSWTTIVNGLSQNGDHQKALSMFFKMLEVGLRPNDLTLVSALSACAKIGALEAGV 353

Query: 328 RIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETKEKGLLIWSVMIWGWAIHG 387
           RIHNY   NG +LN     ALVDMYAKCGNI  A KVF ETKEK +  WSVMIWGWA HG
Sbjct: 354 RIHNYFVENGLRLNKATAAALVDMYAKCGNILSASKVFEETKEKDIRTWSVMIWGWATHG 413

Query: 388 HFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVNEGLKFFDNMRRGYLIEPSMKHY 447
            + +A++ F+ M F+G KPD+VVFLA+L ACSHSGQV+ GL FFD+MR  Y IEP+MKHY
Sbjct: 414 FYGQAIRCFKKMMFSGIKPDAVVFLALLTACSHSGQVDLGLNFFDSMRFDYSIEPTMKHY 473

Query: 448 TLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALFCACRTHKNVEMAELASKKLLQLEP 507
           TLVVD+LGRAGRLDEA+KFI+ MPI+PDFV WGALFCACR HKN++MAEL S+KLLQLEP
Sbjct: 474 TLVVDLLGRAGRLDEAMKFIQRMPISPDFVAWGALFCACRAHKNIKMAELVSEKLLQLEP 533

Query: 508 KHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHKDPGWSFIEVDHKLHRFVAGDNTHN 567
           KHPGSYVFLSN YA+VGRW+D ERVR+ M++    KDPGWS+IEV+ ++H FVAGD+ H 
Sbjct: 534 KHPGSYVFLSNVYAAVGRWEDVERVRMLMQNQAVGKDPGWSYIEVNGQVHSFVAGDHDHK 593

Query: 568 RAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGIVSTRPG 627
           RA EIY KL+EI + ARE+GY  E   VLHNIEEEEKE+ALG HSEKLALAF +++T PG
Sbjct: 594 RAREIYLKLEEIVSGAREQGYMPETGWVLHNIEEEEKEDALGSHSEKLALAFALMNTSPG 653

Query: 628 TTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMKRFHHFNDGVCSCGDYW 680
           TT+RIVKNLRVC DCHS MK ASKMS+REIILRD+KRFHHF  GVCSCGDYW
Sbjct: 654 TTIRIVKNLRVCGDCHSLMKCASKMSQREIILRDIKRFHHFKYGVCSCGDYW 702

BLAST of Csa2G139850 vs. TrEMBL
Match: B9GFV9_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0001s32380g PE=4 SV=1)

HSP 1 Score: 825.1 bits (2130), Expect = 6.3e-236
Identity = 406/653 (62.17%), Postives = 499/653 (76.42%), Query Frame = 1

Query: 28  SSMFSSSSPPISFLETHFIDLIHASNSTHKLRQIHGQLYRCNVFSSSRVVTQFISSCSSL 87
           SS+ +   PP +  E HFI LIH S +  +L QIH Q+   N+ SSS + TQ ISS S  
Sbjct: 67  SSLSALFIPPTTPTEAHFISLIHGSKTILQLHQIHAQIIIHNLSSSSLITTQLISSSSLR 126

Query: 88  NSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFVLMLKWKISPDRLTFPFVL 147
            S+++++++F   + KN + FNALIRGL  NS F ++I  F LML+  I PDRLT+PFVL
Sbjct: 127 KSINHSLAVFNHHKPKNLFTFNALIRGLTTNSHFFNAIFHFRLMLRSGIKPDRLTYPFVL 186

Query: 148 KSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDESPESVKN 207
           KS A L +  +G A+HC IL+ G+E DSFVRVSLVDMYVKVE+LGSA KVFDESPE   +
Sbjct: 187 KSMAGLFSTELGMAIHCMILRCGIELDSFVRVSLVDMYVKVEKLGSAFKVFDESPERFDS 246

Query: 208 GS-VLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRAKELFVK 267
           GS  L+WNVLI G C+ G + KA +LF +MPKK+  SW++LI+GF K GDM RA ELF +
Sbjct: 247 GSSALLWNVLIKGCCKAGSMKKAVKLFKAMPKKENVSWSTLIDGFAKNGDMDRAMELFDQ 306

Query: 268 MPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKIGALDAG 327
           MPEKNVVSWTTMV+GFS+NGD EKAL  F  MLEEG RPN +TIVSALSACAKIG L+AG
Sbjct: 307 MPEKNVVSWTTMVDGFSRNGDSEKALSMFSKMLEEGVRPNAFTIVSALSACAKIGGLEAG 366

Query: 328 LRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETKEKGLLIWSVMIWGWAIH 387
           LRIH Y+  NG  L   +GTALVDMYAKCGNIE A +VF ET++K +  W+VMIWGWAIH
Sbjct: 367 LRIHKYIKDNGLHLTEALGTALVDMYAKCGNIESASEVFGETEQKSIRTWTVMIWGWAIH 426

Query: 388 GHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVNEGLKFFDNMRRGYLIEPSMKH 447
           GH  +A+  F+ M F G KPD VVFLA+L AC HSGQV+ GL FFD+MR  Y IEPSMKH
Sbjct: 427 GHSEQAIACFKQMMFAGIKPDEVVFLALLTACMHSGQVDIGLNFFDSMRLDYCIEPSMKH 486

Query: 448 YTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALFCACRTHKNVEMAELASKKLLQLE 507
           YTL+VDMLGR+G+L EAL+FI  MP+ PDFV+WGALFCACR HK  +MA+ A  KLL+LE
Sbjct: 487 YTLIVDMLGRSGQLKEALRFIERMPMNPDFVIWGALFCACRAHKKTKMAKFALNKLLKLE 546

Query: 508 PKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHKDPGWSFIEVDHKLHRFVAGDNTH 567
           P H G+Y+FLSNAYA++G+W+DAERVRV M++ G HK+ GWS IEV+ ++HRFV+GD+ H
Sbjct: 547 PTHTGNYIFLSNAYAALGQWEDAERVRVLMQNRGVHKNSGWSCIEVEGQVHRFVSGDHDH 606

Query: 568 NRAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGIVSTRP 627
             +  I  KL+EI A A ++GY    E VLHN+E+EEKE+ LG H EKLALAF ++ T P
Sbjct: 607 KDSKAICLKLEEIMAGAVKQGYIPGTEWVLHNMEQEEKEDVLGSHGEKLALAFALICTSP 666

Query: 628 GTTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMKRFHHFNDGVCSCGDYW 680
           G T+RIVKNL+VC DCHS MKYASK+S+REI+LRDMKRFHHF DG CSC D+W
Sbjct: 667 GMTIRIVKNLQVCGDCHSLMKYASKISQREIMLRDMKRFHHFKDGSCSCRDHW 719

BLAST of Csa2G139850 vs. TAIR10
Match: AT1G04840.1 (AT1G04840.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 729.2 bits (1881), Expect = 2.4e-210
Identity = 361/670 (53.88%), Postives = 470/670 (70.15%), Query Frame = 1

Query: 13  MKDLHVLFNPRIAFFSSMFSS---SSPPISFLETHFIDLIHASNSTHKLRQIHGQLYRCN 72
           MK L V+F P+ +     F +   +SP     E+HFI LIHA   T  LR +H Q+ R  
Sbjct: 1   MKSLSVIFKPKSSPAKIYFPADRQASPD----ESHFISLIHACKDTASLRHVHAQILRRG 60

Query: 73  VFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFV 132
           V SS RV  Q +S  S L S DY++SIF+  E +N ++ NALIRGL EN+RFESS+  F+
Sbjct: 61  VLSS-RVAAQLVSCSSLLKSPDYSLSIFRNSEERNPFVLNALIRGLTENARFESSVRHFI 120

Query: 133 LMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVE 192
           LML+  + PDRLTFPFVLKS + L    +GRALH   LK  ++ DSFVR+SLVDMY K  
Sbjct: 121 LMLRLGVKPDRLTFPFVLKSNSKLGFRWLGRALHAATLKNFVDCDSFVRLSLVDMYAKTG 180

Query: 193 ELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWNSLIN 252
           +L  A +VF+ESP+ +K  S+LIWNVLI+GYCR  D+  AT LF SMP++++GSW++LI 
Sbjct: 181 QLKHAFQVFEESPDRIKKESILIWNVLINGYCRAKDMHMATTLFRSMPERNSGSWSTLIK 240

Query: 253 GFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYT 312
           G++  G++ RAK+LF  MPEKNVVSWTT++NGFSQ GD E A+ T+F MLE+G +PN+YT
Sbjct: 241 GYVDSGELNRAKQLFELMPEKNVVSWTTLINGFSQTGDYETAISTYFEMLEKGLKPNEYT 300

Query: 313 IVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETK 372
           I + LSAC+K GAL +G+RIH Y+  NG KL+  IGTALVDMYAKCG ++ A  VF    
Sbjct: 301 IAAVLSACSKSGALGSGIRIHGYILDNGIKLDRAIGTALVDMYAKCGELDCAATVFSNMN 360

Query: 373 EKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVNEGLK 432
            K +L W+ MI GWA+HG F +A+Q F  M ++G KPD VVFLAVL AC +S +V+ GL 
Sbjct: 361 HKDILSWTAMIQGWAVHGRFHQAIQCFRQMMYSGEKPDEVVFLAVLTACLNSSEVDLGLN 420

Query: 433 FFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALFCACRTH 492
           FFD+MR  Y IEP++KHY LVVD+LGRAG+L+EA + +  MPI PD   W AL+ AC+ H
Sbjct: 421 FFDSMRLDYAIEPTLKHYVLVVDLLGRAGKLNEAHELVENMPINPDLTTWAALYRACKAH 480

Query: 493 KNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHKDPGWSF 552
           K    AE  S+ LL+L+P+  GSY+FL   +AS G   D E+ R+S++     +  GWS+
Sbjct: 481 KGYRRAESVSQNLLELDPELCGSYIFLDKTHASKGNIQDVEKRRLSLQKRIKERSLGWSY 540

Query: 553 IEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEEKEEALG 612
           IE+D +L++F AGD +H    EI  KLDEI + A +KGY    +  +H+IEEEEKE   G
Sbjct: 541 IELDGQLNKFSAGDYSHKLTQEIGLKLDEIISLAIQKGYNPGADWSIHDIEEEEKENVTG 600

Query: 613 YHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMKRFHHFN 672
            HSEKLAL  G + T PGTT+RI+KNLR+C DCHS MKY SK+S+R+I+LRD ++FHHF 
Sbjct: 601 IHSEKLALTLGFLRTAPGTTIRIIKNLRICGDCHSLMKYVSKISQRDILLRDARQFHHFK 660

Query: 673 DGVCSCGDYW 680
           DG CSCGDYW
Sbjct: 661 DGRCSCGDYW 665

BLAST of Csa2G139850 vs. TAIR10
Match: AT5G06540.1 (AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 474.2 bits (1219), Expect = 1.4e-133
Identity = 251/640 (39.22%), Postives = 373/640 (58.28%), Query Frame = 1

Query: 48  LIHASNSTHKLRQIHGQLYRCNVFSSSRVVTQFISSC-------SSLNSVDYAISIFQRF 107
           L+ + +S   L+ IHG L R ++ S   V ++ ++ C          N + YA  IF + 
Sbjct: 18  LLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFSQI 77

Query: 108 ELKNSYLFNALIRGLAENSRFESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGR 167
           +  N ++FN LIR  +  +    +  F+  MLK +I PD +TFPF++K+++ +    VG 
Sbjct: 78  QNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLVGE 137

Query: 168 ALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGY 227
             H  I++FG + D +V  SLV MY     + +A ++F +    +    V+ W  ++ GY
Sbjct: 138 QTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQ----MGFRDVVSWTSMVAGY 197

Query: 228 CRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVN 287
           C+ G +  A E+FD                               +MP +N+ +W+ M+N
Sbjct: 198 CKCGMVENAREMFD-------------------------------EMPHRNLFTWSIMIN 257

Query: 288 GFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKL 347
           G+++N   EKA++ F  M  EG   N+  +VS +S+CA +GAL+ G R + Y+  +   +
Sbjct: 258 GYAKNNCFEKAIDLFEFMKREGVVANETVMVSVISSCAHLGALEFGERAYEYVVKSHMTV 317

Query: 348 NLVIGTALVDMYAKCGNIEHAEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMK 407
           NL++GTALVDM+ +CG+IE A  VF    E   L WS +I G A+HGH  KA+ YF  M 
Sbjct: 318 NLILGTALVDMFWRCGDIEKAIHVFEGLPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMI 377

Query: 408 FTGTKPDSVVFLAVLNACSHSGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRL 467
             G  P  V F AVL+ACSH G V +GL+ ++NM++ + IEP ++HY  +VDMLGRAG+L
Sbjct: 378 SLGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKL 437

Query: 468 DEALKFIRAMPITPDFVVWGALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAY 527
            EA  FI  M + P+  + GAL  AC+ +KN E+AE     L++++P+H G YV LSN Y
Sbjct: 438 AEAENFILKMHVKPNAPILGALLGACKIYKNTEVAERVGNMLIKVKPEHSGYYVLLSNIY 497

Query: 528 ASVGRWDDAERVRVSMRDHGAHKDPGWSFIEVDHKLHRFVAGDN-THNRAVEIYSKLDEI 587
           A  G+WD  E +R  M++    K PGWS IE+D K+++F  GD+  H    +I  K +EI
Sbjct: 498 ACAGQWDKIESLRDMMKEKLVKKPPGWSLIEIDGKINKFTMGDDQKHPEMGKIRRKWEEI 557

Query: 588 SASAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVC 647
               R  GY         +++EEEKE ++  HSEKLA+A+G++ T+PGTT+RIVKNLRVC
Sbjct: 558 LGKIRLIGYKGNTGDAFFDVDEEEKESSIHMHSEKLAIAYGMMKTKPGTTIRIVKNLRVC 617

Query: 648 VDCHSFMKYASKMSKREIILRDMKRFHHFNDGVCSCGDYW 680
            DCH+  K  S++  RE+I+RD  RFHHF +GVCSC DYW
Sbjct: 618 EDCHTVTKLISEVYGRELIVRDRNRFHHFRNGVCSCRDYW 622

BLAST of Csa2G139850 vs. TAIR10
Match: AT5G48910.1 (AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 466.1 bits (1198), Expect = 3.8e-131
Identity = 264/669 (39.46%), Postives = 378/669 (56.50%), Query Frame = 1

Query: 21  NPRIAFFSSMFSSSSPPISFLETHFIDLIHASNSTHKLRQIHGQLYRCNVFSSSRVVTQF 80
           NP    FS   +S +   +   +     I+   +   L QIH    +      +    + 
Sbjct: 2   NPTQTLFSPGGNSPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEI 61

Query: 81  ISSCSSLN----SVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSIS---FFVLMLK 140
           +  C++ +     +DYA  IF +   +N + +N +IRG +E+   ++ I+   F+ +M  
Sbjct: 62  LRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSD 121

Query: 141 WKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEELGS 200
             + P+R TFP VLK+ A       G+ +H   LK+G   D FV  +LV MYV    +  
Sbjct: 122 EFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFM-- 181

Query: 201 ALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGS---WNSLING 260
                       K+  VL +  +I       D+V  T+      +K  G    WN +I+G
Sbjct: 182 ------------KDARVLFYKNIIEK-----DMVVMTDR-----RKRDGEIVLWNVMIDG 241

Query: 261 FMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTI 320
           +M++GD   A+ LF KM +++VVSW TM++G+S NG  + A+E F  M +   RPN  T+
Sbjct: 242 YMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFREMKKGDIRPNYVTL 301

Query: 321 VSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETKE 380
           VS L A +++G+L+ G  +H Y   +G +++ V+G+AL+DMY+KCG IE A  VF     
Sbjct: 302 VSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPR 361

Query: 381 KGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVNEGLKF 440
           + ++ WS MI G+AIHG    A+  F  M+  G +P  V ++ +L ACSH G V EG ++
Sbjct: 362 ENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRY 421

Query: 441 FDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALFCACRTHK 500
           F  M     +EP ++HY  +VD+LGR+G LDEA +FI  MPI PD V+W AL  ACR   
Sbjct: 422 FSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQG 481

Query: 501 NVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHKDPGWSFI 560
           NVEM +  +  L+ + P   G+YV LSN YAS G W +   +R+ M++    KDPG S I
Sbjct: 482 NVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLI 541

Query: 561 EVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEEKEEALGY 620
           ++D  LH FV  D++H +A EI S L EIS   R  GY      VL N+EEE+KE  L Y
Sbjct: 542 DIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENVLHY 601

Query: 621 HSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMKRFHHFND 680
           HSEK+A AFG++ST PG  +RIVKNLR+C DCHS +K  SK+ KR+I +RD KRFHHF D
Sbjct: 602 HSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQD 646

BLAST of Csa2G139850 vs. TAIR10
Match: AT4G02750.1 (AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 464.9 bits (1195), Expect = 8.4e-131
Identity = 246/622 (39.55%), Postives = 358/622 (57.56%), Query Frame = 1

Query: 81  ISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFVLMLKW------ 140
           +S  +    VD A S+F R   KN   +NAL+    +NS+ E +   F     W      
Sbjct: 164 LSGYAQNGCVDDARSVFDRMPEKNDVSWNALLSAYVQNSKMEEACMLFKSRENWALVSWN 223

Query: 141 ----------KISPDRLTFPFV-LKSAAALSNGGVGRALHCGILKFGLEFDS------FV 200
                     KI   R  F  + ++   + +    G A    I +    FD       F 
Sbjct: 224 CLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEARQLFDESPVQDVFT 283

Query: 201 RVSLVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMP 260
             ++V  Y++   +  A ++FD+ PE     + + WN ++ GY +   +  A ELFD MP
Sbjct: 284 WTAMVSGYIQNRMVEEARELFDKMPER----NEVSWNAMLAGYVQGERMEMAKELFDVMP 343

Query: 261 KKDTGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFC 320
            ++  +WN++I G+ + G +  AK LF KMP+++ VSW  M+ G+SQ+G   +AL  F  
Sbjct: 344 CRNVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMIAGYSQSGHSFEALRLFVQ 403

Query: 321 MLEEGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGN 380
           M  EG R N  +  SALS CA + AL+ G ++H  L   G++    +G AL+ MY KCG+
Sbjct: 404 MEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALLLMYCKCGS 463

Query: 381 IEHAEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNA 440
           IE A  +F E   K ++ W+ MI G++ HG    AL++FE MK  G KPD    +AVL+A
Sbjct: 464 IEEANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSA 523

Query: 441 CSHSGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFV 500
           CSH+G V++G ++F  M + Y + P+ +HY  +VD+LGRAG L++A   ++ MP  PD  
Sbjct: 524 CSHTGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLMKNMPFEPDAA 583

Query: 501 VWGALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMR 560
           +WG L  A R H N E+AE A+ K+  +EP++ G YV LSN YAS GRW D  ++RV MR
Sbjct: 584 IWGTLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWGDVGKLRVRMR 643

Query: 561 DHGAHKDPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLH 620
           D G  K PG+S+IE+ +K H F  GD  H    EI++ L+E+    ++ GY  +   VLH
Sbjct: 644 DKGVKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLRMKKAGYVSKTSVVLH 703

Query: 621 NIEEEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREI 680
           ++EEEEKE  + YHSE+LA+A+GI+    G  +R++KNLRVC DCH+ +KY ++++ R I
Sbjct: 704 DVEEEEKERMVRYHSERLAVAYGIMRVSSGRPIRVIKNLRVCEDCHNAIKYMARITGRLI 763


HSP 2 Score: 116.7 bits (291), Expect = 5.6e-26
Identity = 95/405 (23.46%), Postives = 182/405 (44.94%), Query Frame = 1

Query: 91  DYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFVLMLKWKISPDRLTFPFVLKSA 150
           + A  +F     ++   +N +I+G   N     +   F +M      P+R    +    +
Sbjct: 112 ELARKLFDEMPERDLVSWNVMIKGYVRNRNLGKARELFEIM------PERDVCSWNTMLS 171

Query: 151 AALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDESPESVKNGSV 210
               NG V  A    +     E +     +L+  YV+  ++  A  +F    +S +N ++
Sbjct: 172 GYAQNGCVDDAR--SVFDRMPEKNDVSWNALLSAYVQNSKMEEACMLF----KSRENWAL 231

Query: 211 LIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRAKELFVKMPEK 270
           + WN L+ G+ +   +V+A + FDSM  +D  SWN++I G+ + G +  A++LF + P +
Sbjct: 232 VSWNCLLGGFVKKKKIVEARQFFDSMNVRDVVSWNTIITGYAQSGKIDEARQLFDESPVQ 291

Query: 271 NVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKIGALDAGLRIH 330
           +V +WT MV+G+ QN   E+A E F  M E     N+ +  + L+   +   ++    + 
Sbjct: 292 DVFTWTAMVSGYIQNRMVEEARELFDKMPER----NEVSWNAMLAGYVQGERMEMAKELF 351

Query: 331 NYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETKEKGLLIWSVMIWGWAIHGHFR 390
           + +       N+     ++  YA+CG I  A+ +F +  ++  + W+ MI G++  GH  
Sbjct: 352 DVMPCR----NVSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMIAGYSQSGHSF 411

Query: 391 KALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVNEGLKFFDNMRRGYLIEPSMKHYTLV 450
           +AL+ F  M+  G + +   F + L+ C+    +  G +    + +G           L+
Sbjct: 412 EALRLFVQMEREGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALL 471

Query: 451 VDMLGRAGRLDEALKFIRAMPITPDFVVWGALFCACRTHKNVEMA 496
           + M  + G ++EA    + M    D V W  +      H   E+A
Sbjct: 472 L-MYCKCGSIEEANDLFKEM-AGKDIVSWNTMIAGYSRHGFGEVA 494


HSP 3 Score: 112.5 bits (280), Expect = 1.1e-24
Identity = 94/360 (26.11%), Postives = 160/360 (44.44%), Query Frame = 1

Query: 182 VDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDT 241
           +  Y++      AL+VF   P      S + +N +I GY R G+   A +LFD MP++D 
Sbjct: 71  ISSYMRTGRCNEALRVFKRMPR----WSSVSYNGMISGYLRNGEFELARKLFDEMPERDL 130

Query: 242 GSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEE 301
            SWN +I G+++  ++G+A+ELF  MPE++V SW TM++G++QNG  + A   F  M E+
Sbjct: 131 VSWNVMIKGYVRNRNLGKARELFEIMPERDVCSWNTMLSGYAQNGCVDDARSVFDRMPEK 190

Query: 302 GARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHA 361
               ND +  + LSA  +   ++    +  + S   +   LV    L+  + K   I  A
Sbjct: 191 ----NDVSWNALLSAYVQNSKMEEACML--FKSRENWA--LVSWNCLLGGFVKKKKIVEA 250

Query: 362 EKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHS 421
            + F     + ++ W+ +I G+A  G   +A Q F+         D   + A+++    +
Sbjct: 251 RQFFDSMNVRDVVSWNTIITGYAQSGKIDEARQLFD----ESPVQDVFTWTAMVSGYIQN 310

Query: 422 GQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGA 481
             V E  + FD M      E +   +  ++    +  R++ A +    MP   +   W  
Sbjct: 311 RMVEEARELFDKMP-----ERNEVSWNAMLAGYVQGERMEMAKELFDVMPCR-NVSTWNT 370

Query: 482 LFCACRTHKNVEMAELASKKLLQLEPKH-PGSYVFLSNAYASVGRWDDAERVRVSMRDHG 541
           +         +  A    K L    PK  P S+  +   Y+  G   +A R+ V M   G
Sbjct: 371 MITGYAQCGKISEA----KNLFDKMPKRDPVSWAAMIAGYSQSGHSFEALRLFVQMEREG 404


HSP 4 Score: 100.9 bits (250), Expect = 3.2e-21
Identity = 90/356 (25.28%), Postives = 158/356 (44.38%), Query Frame = 1

Query: 81  ISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSISFFVLMLKWKISPDR 140
           ISS       + A+ +F+R    +S  +N +I G   N  FE +   F  M      P+R
Sbjct: 71  ISSYMRTGRCNEALRVFKRMPRWSSVSYNGMISGYLRNGEFELARKLFDEM------PER 130

Query: 141 LTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDE 200
               + +     + N  +G+A    + +   E D     +++  Y +   +  A  VFD 
Sbjct: 131 DLVSWNVMIKGYVRNRNLGKARE--LFEIMPERDVCSWNTMLSGYAQNGCVDDARSVFDR 190

Query: 201 SPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRA 260
            PE  KN   + WN L+  Y +   + +A  LF S       SWN L+ GF+K   +  A
Sbjct: 191 MPE--KND--VSWNALLSAYVQNSKMEEACMLFKSRENWALVSWNCLLGGFVKKKKIVEA 250

Query: 261 KELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKI 320
           ++ F  M  ++VVSW T++ G++Q+G  ++A + F    +E    + +T  + +S   + 
Sbjct: 251 RQFFDSMNVRDVVSWNTIITGYAQSGKIDEARQLF----DESPVQDVFTWTAMVSGYIQN 310

Query: 321 GALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVFHETKEKGLLIWSVMI 380
             ++    + + +     + N V   A++  Y +   +E A+++F     + +  W+ MI
Sbjct: 311 RMVEEARELFDKMP----ERNEVSWNAMLAGYVQGERMEMAKELFDVMPCRNVSTWNTMI 370

Query: 381 WGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVNEGLKFFDNMRR 437
            G+A  G   +A   F+ M     K D V + A++   S SG   E L+ F  M R
Sbjct: 371 TGYAQCGKISEAKNLFDKM----PKRDPVSWAAMIAGYSQSGHSFEALRLFVQMER 402

BLAST of Csa2G139850 vs. TAIR10
Match: AT1G74630.1 (AT1G74630.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 464.5 bits (1194), Expect = 1.1e-130
Identity = 242/641 (37.75%), Postives = 371/641 (57.88%), Query Frame = 1

Query: 44  HFIDLIHASNSTHKLRQIHGQLYRCNVFSSSRVVTQFISSC--SSLNSVDYAISIFQRFE 103
           H + L+++  +   L QIHG   +  V + S    + I  C  S  +++ YA  +   F 
Sbjct: 7   HCLSLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFP 66

Query: 104 LKNSYLFNALIRGLAENSRFESSISFFV-LMLKWKISPDRLTFPFVLKSAAALSNGGVGR 163
             ++++FN L+RG +E+    +S++ FV +M K  + PD  +F FV+K+     +   G 
Sbjct: 67  EPDAFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGF 126

Query: 164 ALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGY 223
            +HC  LK GLE   FV  +L+ MY     +  A KVFDE  +     +++ WN +I   
Sbjct: 127 QMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQP----NLVAWNAVITAC 186

Query: 224 CRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVN 283
            R  D+  A E+FD M  ++  SWN ++ G++K G++  AK +F +MP ++ VSW+TM+ 
Sbjct: 187 FRGNDVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIV 246

Query: 284 GFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKL 343
           G + NG   ++   F  +   G  PN+ ++   LSAC++ G+ + G  +H ++   G+  
Sbjct: 247 GIAHNGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSW 306

Query: 344 NLVIGTALVDMYAKCGNIEHAEKVFHETKEKGLLI-WSVMIWGWAIHGHFRKALQYFEWM 403
            + +  AL+DMY++CGN+  A  VF   +EK  ++ W+ MI G A+HG   +A++ F  M
Sbjct: 307 IVSVNNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEM 366

Query: 404 KFTGTKPDSVVFLAVLNACSHSGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGR 463
              G  PD + F+++L+ACSH+G + EG  +F  M+R Y IEP ++HY  +VD+ GR+G+
Sbjct: 367 TAYGVTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGK 426

Query: 464 LDEALKFIRAMPITPDFVVWGALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNA 523
           L +A  FI  MPI P  +VW  L  AC +H N+E+AE   ++L +L+P + G  V LSNA
Sbjct: 427 LQKAYDFICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLLSNA 486

Query: 524 YASVGRWDDAERVRVSMRDHGAHKDPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEI 583
           YA+ G+W D   +R SM      K   WS +EV   +++F AG+      +E + KL EI
Sbjct: 487 YATAGKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFTAGEKKKGIDIEAHEKLKEI 546

Query: 584 SASAR-EKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRV 643
               + E GYT E+   L+++EEEEKE+ +  HSEKLALAF +     G  +RIVKNLR+
Sbjct: 547 ILRLKDEAGYTPEVASALYDVEEEEKEDQVSKHSEKLALAFALARLSKGANIRIVKNLRI 606

Query: 644 CVDCHSFMKYASKMSKREIILRDMKRFHHFNDGVCSCGDYW 680
           C DCH+ MK  SK+   EI++RD  RFH F DG CSC DYW
Sbjct: 607 CRDCHAVMKLTSKVYGVEILVRDRNRFHSFKDGSCSCRDYW 643

BLAST of Csa2G139850 vs. NCBI nr
Match: gi|449442481|ref|XP_004139010.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g04840 [Cucumis sativus])

HSP 1 Score: 1381.3 bits (3574), Expect = 0.0e+00
Identity = 679/679 (100.00%), Postives = 679/679 (100.00%), Query Frame = 1

Query: 1   MLLRRNGSGSNIMKDLHVLFNPRIAFFSSMFSSSSPPISFLETHFIDLIHASNSTHKLRQ 60
           MLLRRNGSGSNIMKDLHVLFNPRIAFFSSMFSSSSPPISFLETHFIDLIHASNSTHKLRQ
Sbjct: 1   MLLRRNGSGSNIMKDLHVLFNPRIAFFSSMFSSSSPPISFLETHFIDLIHASNSTHKLRQ 60

Query: 61  IHGQLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSR 120
           IHGQLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSR
Sbjct: 61  IHGQLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSR 120

Query: 121 FESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVS 180
           FESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVS
Sbjct: 121 FESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVS 180

Query: 181 LVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKD 240
           LVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKD
Sbjct: 181 LVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKD 240

Query: 241 TGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLE 300
           TGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLE
Sbjct: 241 TGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLE 300

Query: 301 EGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEH 360
           EGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEH
Sbjct: 301 EGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEH 360

Query: 361 AEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSH 420
           AEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSH
Sbjct: 361 AEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSH 420

Query: 421 SGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWG 480
           SGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWG
Sbjct: 421 SGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWG 480

Query: 481 ALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHG 540
           ALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHG
Sbjct: 481 ALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHG 540

Query: 541 AHKDPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIE 600
           AHKDPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIE
Sbjct: 541 AHKDPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIE 600

Query: 601 EEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILR 660
           EEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILR
Sbjct: 601 EEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILR 660

Query: 661 DMKRFHHFNDGVCSCGDYW 680
           DMKRFHHFNDGVCSCGDYW
Sbjct: 661 DMKRFHHFNDGVCSCGDYW 679

BLAST of Csa2G139850 vs. NCBI nr
Match: gi|659114785|ref|XP_008457226.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g04840 [Cucumis melo])

HSP 1 Score: 1322.0 bits (3420), Expect = 0.0e+00
Identity = 651/679 (95.88%), Postives = 663/679 (97.64%), Query Frame = 1

Query: 1   MLLRRNGSGSNIMKDLHVLFNPRIAFFSSMFSSSSPPISFLETHFIDLIHASNSTHKLRQ 60
           MLL RNG+GSNIMKDLHVLFNPRIAF SSMFSSSS  IS LETHFIDLIHASNSTHKLRQ
Sbjct: 1   MLLPRNGTGSNIMKDLHVLFNPRIAFLSSMFSSSSLRISSLETHFIDLIHASNSTHKLRQ 60

Query: 61  IHGQLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSR 120
           IHGQLYRCNVFSSSRVVTQFISSCS LN+VDYA+SIFQRFELKNSYLFNALIRGLAENSR
Sbjct: 61  IHGQLYRCNVFSSSRVVTQFISSCSLLNAVDYAVSIFQRFELKNSYLFNALIRGLAENSR 120

Query: 121 FESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVS 180
           FESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGL FDSFVRVS
Sbjct: 121 FESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLVFDSFVRVS 180

Query: 181 LVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKD 240
           LVDMYVKV ELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKD
Sbjct: 181 LVDMYVKVGELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKD 240

Query: 241 TGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLE 300
           TGSWNSLINGFMKMGDMGRAKELF KMPEKNVVSWTTMVNGFSQNGDP+KALETFFCMLE
Sbjct: 241 TGSWNSLINGFMKMGDMGRAKELFEKMPEKNVVSWTTMVNGFSQNGDPQKALETFFCMLE 300

Query: 301 EGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEH 360
           EGARPNDYTIVSALSACAKIGALDAGL IHNYLSGNGFKLNLVIGTALVDM+AKCGNIE+
Sbjct: 301 EGARPNDYTIVSALSACAKIGALDAGLSIHNYLSGNGFKLNLVIGTALVDMHAKCGNIEY 360

Query: 361 AEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSH 420
           AEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSH
Sbjct: 361 AEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSH 420

Query: 421 SGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWG 480
           SGQVNEGLKFFD+MRR YLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWG
Sbjct: 421 SGQVNEGLKFFDSMRRSYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWG 480

Query: 481 ALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHG 540
           ALFCACR HKNVEMAELAS+KLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRD G
Sbjct: 481 ALFCACRAHKNVEMAELASEKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDSG 540

Query: 541 AHKDPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIE 600
           AHKDPGWSFIEVDHKLHRFVAGDNTH+RAVEIYS LDEISASAREKGYTKEIECVLHNIE
Sbjct: 541 AHKDPGWSFIEVDHKLHRFVAGDNTHSRAVEIYSMLDEISASAREKGYTKEIECVLHNIE 600

Query: 601 EEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILR 660
           EEEKEEALGYHSEKLALAFGI+STRPGTTVRIVKNLRVCVDCHSFMKY SK++KREIILR
Sbjct: 601 EEEKEEALGYHSEKLALAFGILSTRPGTTVRIVKNLRVCVDCHSFMKYTSKLTKREIILR 660

Query: 661 DMKRFHHFNDGVCSCGDYW 680
           DMKRFHHF DGVCSCGDYW
Sbjct: 661 DMKRFHHFYDGVCSCGDYW 679

BLAST of Csa2G139850 vs. NCBI nr
Match: gi|359477907|ref|XP_002270439.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g04840 [Vitis vinifera])

HSP 1 Score: 934.1 bits (2413), Expect = 1.4e-268
Identity = 459/676 (67.90%), Postives = 546/676 (80.77%), Query Frame = 1

Query: 8   SGSNIMKDLHVLFNPRIAFFSSMFSSSSP----PISFLETHFIDLIHASNSTHKLRQIHG 67
           S S  +K L+ LF P      +   +++     P    ETHFI LIHASN+  +L QIH 
Sbjct: 2   SKSQGLKALNALFKPTSPPAKTTTVTTTTRAHGPSRSPETHFIPLIHASNTLPQLHQIHA 61

Query: 68  QLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFES 127
           Q++  N+FS+SRVVTQ ISS  SL S+DYA+SIF+ F+  N ++FNALIRGLAENSRFE 
Sbjct: 62  QIFLHNLFSNSRVVTQLISSSCSLKSLDYALSIFRCFDHPNLFVFNALIRGLAENSRFEG 121

Query: 128 SISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVD 187
           S+S FVLML+  I PDRLT PFVLKS AAL + G+GR LH G++K GLEFDSFVRVSLVD
Sbjct: 122 SVSHFVLMLRLSIRPDRLTLPFVLKSVAALVDVGLGRCLHGGVMKLGLEFDSFVRVSLVD 181

Query: 188 MYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGS 247
           MYVK+ ELG  L++FDESP+  K  S+L+WNVLI+G C++GDL KA  LF++MP+++ GS
Sbjct: 182 MYVKIGELGFGLQLFDESPQRNKAESILLWNVLINGCCKVGDLSKAASLFEAMPERNAGS 241

Query: 248 WNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGA 307
           WNSLINGF++ GD+ RA+ELFV+MPEKNVVSWTTM+NGFSQNGD EKAL  F+ MLEEG 
Sbjct: 242 WNSLINGFVRNGDLDRARELFVQMPEKNVVSWTTMINGFSQNGDHEKALSMFWRMLEEGV 301

Query: 308 RPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEK 367
           RPND T+VSAL AC KIGAL  G RIHNYLS NGF+LN  IGTALVDMYAKCGNI+ A +
Sbjct: 302 RPNDLTVVSALLACTKIGALQVGERIHNYLSSNGFQLNRGIGTALVDMYAKCGNIKSASR 361

Query: 368 VFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQ 427
           VF ETK K LL WSVMIWGWAIHG F +ALQ F  MK  G  PD V+FLA+L ACSHSG 
Sbjct: 362 VFVETKGKDLLTWSVMIWGWAIHGCFDQALQCFVKMKSAGINPDEVIFLAILTACSHSGN 421

Query: 428 VNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALF 487
           V++GL FF++MR  Y IEP+MKHYTL+VD+LGRAGRLDEAL FI++MPI PDFV+WGALF
Sbjct: 422 VDQGLNFFESMRLDYSIEPTMKHYTLIVDLLGRAGRLDEALSFIQSMPINPDFVIWGALF 481

Query: 488 CACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHK 547
           CACR HKN+EMAEL ++KLLQLEPKHPGSYVFLSN YA+VGRW+D ERVR  M++ G  K
Sbjct: 482 CACRAHKNIEMAELTAEKLLQLEPKHPGSYVFLSNVYAAVGRWEDVERVRTLMKNRGVEK 541

Query: 548 DPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEE 607
           DPGWS+IEV+ ++H FVAGD+ H RA EI  KL+EI+ASA+++GY  E   VLHNIEEEE
Sbjct: 542 DPGWSYIEVEGQVHSFVAGDHAHVRAEEISLKLEEITASAKQEGYMPETAWVLHNIEEEE 601

Query: 608 KEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMK 667
           KE+ALG HSEKLALAFG++ST PG+T+RIVKNLRVC DCHS MKYASK+S+REIILRD+K
Sbjct: 602 KEDALGSHSEKLALAFGLISTAPGSTIRIVKNLRVCGDCHSMMKYASKLSRREIILRDIK 661

Query: 668 RFHHFNDGVCSCGDYW 680
           RFHHF DG CSCGDYW
Sbjct: 662 RFHHFKDGTCSCGDYW 677

BLAST of Csa2G139850 vs. NCBI nr
Match: gi|590656604|ref|XP_007034318.1| (Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 893.6 bits (2308), Expect = 2.1e-256
Identity = 443/682 (64.96%), Postives = 532/682 (78.01%), Query Frame = 1

Query: 13  MKDLHVLFN-----PRIAFFSSMFSSSSPPISF----------LETHFIDLIHASNSTHK 72
           MK L +LF       +    SS F    PPIS           L+THF  LI +S +T +
Sbjct: 1   MKSLRLLFERNSSPAKTNSSSSSFKKPKPPISHGSSSSSSQDPLKTHFASLIQSSKTTLQ 60

Query: 73  LRQIHGQLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAE 132
           LRQIH Q++R N+ SSS + T  IS+ SSL S+ YAIS+F  F  K+ +LFNALIRGL +
Sbjct: 61  LRQIHAQIFRRNLSSSSNLTTLLISASSSLKSIPYAISLFNHFHHKSIFLFNALIRGLTD 120

Query: 133 NSRFESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFV 192
           NS  ESSIS F+LML   + PD+LT+PFVLKS A L    +G  LH  I+K G+EFDSFV
Sbjct: 121 NSLLESSISHFLLMLSLGVRPDKLTYPFVLKSIAGLGLRCLGLILHGRIIKSGVEFDSFV 180

Query: 193 RVSLVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMP 252
           RV+LV+MYVK++ELG AL+VFDESPE  K+GS+L+WNVLI+GYC+ G+L KA ELF++ P
Sbjct: 181 RVALVEMYVKLKELGFALQVFDESPERNKSGSILLWNVLINGYCKDGNLGKAMELFEATP 240

Query: 253 KKDTGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFC 312
           +++ GSWNSLINGFM+ GD+ +A ELF +M EK+VVSWTTMVNGFSQNGD EKAL  FF 
Sbjct: 241 ERNIGSWNSLINGFMRNGDLDKAVELFDEMKEKDVVSWTTMVNGFSQNGDHEKALSMFFK 300

Query: 313 MLEEGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGN 372
           MLE   RPND T+V ALSACAKIGAL+AG RIH+Y+  NGF+LN  IG ALVDMYAKCG+
Sbjct: 301 MLEAALRPNDLTLVPALSACAKIGALEAGARIHDYVLENGFRLNKAIGAALVDMYAKCGD 360

Query: 373 IEHAEKVFHETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNA 432
           I+ A KVF ETKE+ +L WSVMIWGWAIHG++ +A+Q F+ M F+G KPD VVFLA+L A
Sbjct: 361 IQSASKVFDETKERDILTWSVMIWGWAIHGYYEQAIQCFKKMMFSGIKPDGVVFLALLTA 420

Query: 433 CSHSGQVNEGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFV 492
           CSHSGQVN GL FFD+MR  Y IEP+MKHYTLVVD+LGRAG+LDE+LKFI+ MP++PDFV
Sbjct: 421 CSHSGQVNLGLNFFDSMRLDYSIEPTMKHYTLVVDLLGRAGQLDESLKFIQRMPMSPDFV 480

Query: 493 VWGALFCACRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMR 552
            WGALFCACR HKN++MAEL S+ LLQLEPKHPGSYVFLSN YA+VGRW+D ERVR+ M+
Sbjct: 481 TWGALFCACRAHKNIKMAELVSQNLLQLEPKHPGSYVFLSNVYAAVGRWEDVERVRMLMQ 540

Query: 553 DHGAHKDPGWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLH 612
           +    KDPGWS+IEV  ++H FVAGD+ H  A EIY KL+EI A  R+ GY  E   VLH
Sbjct: 541 NRAVDKDPGWSYIEVGGEMHSFVAGDHAHKHAREIYLKLEEIVAGTRQHGYMPETGWVLH 600

Query: 613 NIEEEEKEEALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREI 672
           NIEEEEKE+ALG HSEKLALAF ++ T PGTT+RIVKNLRVC DCHS MKYASKMS+REI
Sbjct: 601 NIEEEEKEDALGSHSEKLALAFALIRTSPGTTIRIVKNLRVCGDCHSLMKYASKMSQREI 660

Query: 673 ILRDMKRFHHFNDGVCSCGDYW 680
           +LRD+KRFHHF DG CSCGDYW
Sbjct: 661 VLRDIKRFHHFKDGACSCGDYW 682

BLAST of Csa2G139850 vs. NCBI nr
Match: gi|802588866|ref|XP_012071119.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g04840 [Jatropha curcas])

HSP 1 Score: 888.6 bits (2295), Expect = 6.7e-255
Identity = 432/674 (64.09%), Postives = 530/674 (78.64%), Query Frame = 1

Query: 13  MKDLHVLF---NPRIAFFSSMFSSSSPPISFL----ETHFIDLIHASNSTHKLRQIHGQL 72
           M+  H LF   N      SS   +SSP  +      ETH I LIHAS ++ +L QIH Q+
Sbjct: 1   MRSRHALFKAKNSPAKTTSSREPTSSPNKALSQNPSETHLISLIHASKTSRQLHQIHAQI 60

Query: 73  YRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRFELKNSYLFNALIRGLAENSRFESSI 132
           +  N+ +SS++ TQ ISS SS   +DYAI++F  +  KNS+LFNALIRGL  NS FES+I
Sbjct: 61  FLHNLSTSSQIATQLISSSSSRKFIDYAITVFNHYYPKNSFLFNALIRGLTNNSLFESAI 120

Query: 133 SFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMY 192
           S F+LML+  + PD+LT+PFVLKS A L + G+GRALH  I K G EFD FVR+S+VD Y
Sbjct: 121 SHFILMLRSDVKPDQLTYPFVLKSIATLCSEGLGRALHGMIYKSGFEFDLFVRISMVDAY 180

Query: 193 VKVEELGSALKVFDESPESVKNGSVLIWNVLIHGYCRMGDLVKATELFDSMPKKDTGSWN 252
           VKVEELGSALK+FDESP+     S L+WNVLI+G C++G + KA +LF++MP++ T SWN
Sbjct: 181 VKVEELGSALKLFDESPQRFYGESTLLWNVLINGCCKVGSMRKAVDLFETMPERTTASWN 240

Query: 253 SLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARP 312
           SLINGF++ GD+ RA ELF +MPEKNVVSWTTMVNG S NGD EKAL  F  ML+ G +P
Sbjct: 241 SLINGFLRSGDLERANELFGRMPEKNVVSWTTMVNGLSHNGDHEKALSLFSKMLQVGVKP 300

Query: 313 NDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKLNLVIGTALVDMYAKCGNIEHAEKVF 372
           ND+TIVSALSACAKIGAL+AG+RIH YL+ NGF+LN  IGTALVDMYAKCG+IE A +VF
Sbjct: 301 NDFTIVSALSACAKIGALEAGVRIHRYLTDNGFRLNAKIGTALVDMYAKCGSIESASQVF 360

Query: 373 HETKEKGLLIWSVMIWGWAIHGHFRKALQYFEWMKFTGTKPDSVVFLAVLNACSHSGQVN 432
            ETKEK +L W+VMIWGWAIHGH  +A+Q F  M + G +PD VVFLA+L AC+H+G+V+
Sbjct: 361 RETKEKDVLTWTVMIWGWAIHGHSEEAIQCFRQMMYAGIRPDEVVFLAILTACTHAGKVD 420

Query: 433 EGLKFFDNMRRGYLIEPSMKHYTLVVDMLGRAGRLDEALKFIRAMPITPDFVVWGALFCA 492
            GL FF +M   Y IEPSMKHY L+VD+LGRAGRL++ALKFI  MPITPDFV+WGALFC 
Sbjct: 421 LGLNFFKSMELDYSIEPSMKHYALIVDLLGRAGRLNQALKFIERMPITPDFVIWGALFCT 480

Query: 493 CRTHKNVEMAELASKKLLQLEPKHPGSYVFLSNAYASVGRWDDAERVRVSMRDHGAHKDP 552
           CR HKN+++AELA++KLL+LEPKHPGSYVFLSN YA+VGRW+DAERVR  M++ G  KDP
Sbjct: 481 CRAHKNIKLAELAAQKLLELEPKHPGSYVFLSNVYAAVGRWEDAERVRSLMQNRGIEKDP 540

Query: 553 GWSFIEVDHKLHRFVAGDNTHNRAVEIYSKLDEISASAREKGYTKEIECVLHNIEEEEKE 612
           GWS++EV+ ++H F AGD++H  A +IY KL++I A A+ +GY    E VLHNIEEEEKE
Sbjct: 541 GWSYVEVEGQVHSFAAGDSSHKDAKDIYLKLEQIVAGAKGQGYMPGTEWVLHNIEEEEKE 600

Query: 613 EALGYHSEKLALAFGIVSTRPGTTVRIVKNLRVCVDCHSFMKYASKMSKREIILRDMKRF 672
           +ALG HSEKLALAFG++ T PG T+RIVKNLRVC DCHS MKYASKMS+REIILRD+KRF
Sbjct: 601 DALGSHSEKLALAFGLIRTSPGMTLRIVKNLRVCGDCHSLMKYASKMSQREIILRDIKRF 660

Query: 673 HHFNDGVCSCGDYW 680
           HHF DG+CSCGDYW
Sbjct: 661 HHFKDGICSCGDYW 674

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR10_ARATH4.2e-20953.88Pentatricopeptide repeat-containing protein At1g04840 OS=Arabidopsis thaliana GN... [more]
PP367_ARATH2.4e-13239.22Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN... [more]
PP425_ARATH6.7e-13039.46Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana GN... [more]
PP301_ARATH1.5e-12939.55Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana GN... [more]
PP122_ARATH1.9e-12937.75Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
F6GWJ6_VITVI9.6e-26967.90Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0029g01130 PE=4 SV=... [more]
A0A061EK73_THECC1.4e-25664.96Tetratricopeptide repeat-like superfamily protein isoform 1 OS=Theobroma cacao G... [more]
A0A067KWK1_JATCU4.6e-25564.09Uncharacterized protein OS=Jatropha curcas GN=JCGZ_01110 PE=4 SV=1[more]
A0A0D2PM74_GOSRA4.8e-25266.56Uncharacterized protein OS=Gossypium raimondii GN=B456_005G031300 PE=4 SV=1[more]
B9GFV9_POPTR6.3e-23662.17Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
Match NameE-valueIdentityDescription
AT1G04840.12.4e-21053.88 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G06540.11.4e-13339.22 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G48910.13.8e-13139.46 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G02750.18.4e-13139.55 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G74630.11.1e-13037.75 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449442481|ref|XP_004139010.1|0.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At1g04840 [Cucumis sativu... [more]
gi|659114785|ref|XP_008457226.1|0.0e+0095.88PREDICTED: pentatricopeptide repeat-containing protein At1g04840 [Cucumis melo][more]
gi|359477907|ref|XP_002270439.2|1.4e-26867.90PREDICTED: pentatricopeptide repeat-containing protein At1g04840 [Vitis vinifera... [more]
gi|590656604|ref|XP_007034318.1|2.1e-25664.96Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao][more]
gi|802588866|ref|XP_012071119.1|6.7e-25564.09PREDICTED: pentatricopeptide repeat-containing protein At1g04840 [Jatropha curca... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa2G139850.1Csa2G139850.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 446..470
score: 0.027coord: 375..397
score: 0.53coord: 346..373
score: 2.2E-4coord: 513..540
score: 0.55coord: 409..436
score: 0.012coord: 107..134
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 210..236
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 270..319
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 346..374
score: 4.1E-4coord: 273..306
score: 3.5E-6coord: 212..239
score: 5.0E-7coord: 243..273
score: 8.7E-6coord: 107..139
score: 6.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 407..437
score: 8.725coord: 271..305
score: 12.101coord: 341..375
score: 9.504coord: 509..543
score: 8.133coord: 376..406
score: 6.873coord: 209..243
score: 10.863coord: 443..473
score: 7.607coord: 244..270
score: 7.048coord: 104..138
score: 10.041coord: 174..204
score: 6.193coord: 306..340
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 184..289
score: 9.0E-6coord: 458..532
score: 9.4E-9coord: 290..393
score: 9.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 415..532
score: 9.32E-9coord: 348..379
score: 9.32E-9coord: 183..249
score: 9.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 22..550
score:
NoneNo IPR availablePANTHERPTHR24015:SF790SUBFAMILY NOT NAMEDcoord: 22..550
score: