Csa7G043000 (gene) Cucumber (Chinese Long) v2

NameCsa7G043000
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat-containing protein, putative; contains IPR002885 (Pentatricopeptide repeat)
LocationChr7 : 2330709 .. 2332142 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTACGTCACACCCCTCCGAGTGGTTACTTCAACTTCTTCAACGTTTCCTCAAGAACTCAACTAAAATCCCACAAATCCACGCCCTTCTACTCACCCAAGGCCATCTCTTCAACAATTCCCAAACTTCCTCCAATCCGAAATGCCTCATCACCCTTCTCTACAACACTCTCATCAGAGCCTATTTAAACTTTAACCGCCCTCGTTTCGCCCTCCTTCTCTACACCCAAATGCTCTCCCAACAAACCAAGCCCAACTTCCACACTTTCCCTTCCATCATCAAATCCGCCACCATTTGTTCCTCTCTTCTCCCCAAATTGATCCACGCACATGCTTTCAAGATCGGCGTTCTCACCGACCCCGTTGTCCTGACATCCTTTGTTTCCTCTTACGCCGACCTTCGTGAACTGGCTAATGCACGTAAGGTGTTTGATGAAATTACCAATCCTTGTATTGTTGCGTTTAACTCTATGCTTGATGCTTTTGTCAAAAATGGAGACTTGGGTTCTGCTGTTTTCATGTTTAGGTCCATGCCAGAGCATGATGTGGTTTCTTGGACTAGTGTTATTAATGGGTTTTGGTGGAATGGGCGCTTTCTTGAGGCTCTCTGGTTTTTTCATGTGATGATGATGAGTGGTTCGGTTAAGCCAAATGAAGCTACTTATGTGAGTGTACTTTCTTCTTCTGCTAATTTAGATGCTGAAGGAGTTCTTTGTCGTGGGAAAGAAGTACATGCTTACATTATTAGAAATGAGGGTGAGTTTAGTGTTTTTATAGGGACGGGGTTGATAGATTTTTATGGGAAGATGGGATTATTAGGATGTGCGAGAACTGTTTTTAATCAAATGAAGAAGAGGGAGGTCTGCACTTGGAATGCAATGATCTCTTCATTTGCATCCAATGGTAGAGAAACAGAGGCATTGGACCTATTTGCCACTATGAAGGTTGAAGGAATACATCCTAATGAGGTTACATTTGTTGCAATTCTCACAGCGTGTGCTCGTGGCAAGCTCGTCAAATTGGGATTGCAGTTATTCCAATCAATGTTATACGATTTCTCAATTGTACCAATTACTGAGCATTATGTATGTGTGGTTGATCTCTTAGGCAAAGCAGGGCTTTTGCGAGAGGCAACTGAATTCATAGAGTCCATGCCATTTGATCCGGATGCTTCTGTTTTGGGAGCTCTTTTGAGTGCATGTAAAATTCATGGAGCCACTGAACTGGGGAATGAAGTGGGAAGAAGATTGCTTGAGATGCAGCCACGACATTGTGGTCGGTATGTGACTTTGGCGAGTATGAATGCTGGAGCAGAGAAATGGAATCGTGCTGCAGTTATAAGACGTGTAATGGCAGACGCCAGGATTCAGAAAACTCCAGCTTATAGTAGAGTAGATCCAATGCAAAATCTGGTGCTAGTATCACCCTCTTGA

mRNA sequence

ATGTTTACGTCACACCCCTCCGAGTGGTTACTTCAACTTCTTCAACGTTTCCTCAAGAACTCAACTAAAATCCCACAAATCCACGCCCTTCTACTCACCCAAGGCCATCTCTTCAACAATTCCCAAACTTCCTCCAATCCGAAATGCCTCATCACCCTTCTCTACAACACTCTCATCAGAGCCTATTTAAACTTTAACCGCCCTCGTTTCGCCCTCCTTCTCTACACCCAAATGCTCTCCCAACAAACCAAGCCCAACTTCCACACTTTCCCTTCCATCATCAAATCCGCCACCATTTGTTCCTCTCTTCTCCCCAAATTGATCCACGCACATGCTTTCAAGATCGGCGTTCTCACCGACCCCGTTGTCCTGACATCCTTTGTTTCCTCTTACGCCGACCTTCGTGAACTGGCTAATGCACGTAAGGTGTTTGATGAAATTACCAATCCTTGTATTGTTGCGTTTAACTCTATGCTTGATGCTTTTGTCAAAAATGGAGACTTGGGTTCTGCTGTTTTCATGTTTAGGTCCATGCCAGAGCATGATGTGGTTTCTTGGACTAGTGTTATTAATGGGTTTTGGTGGAATGGGCGCTTTCTTGAGGCTCTCTGGTTTTTTCATGTGATGATGATGAGTGGTTCGGTTAAGCCAAATGAAGCTACTTATGTGAGTGTACTTTCTTCTTCTGCTAATTTAGATGCTGAAGGAGTTCTTTGTCGTGGGAAAGAAGTACATGCTTACATTATTAGAAATGAGGGTGAGTTTAGTGTTTTTATAGGGACGGGGTTGATAGATTTTTATGGGAAGATGGGATTATTAGGATGTGCGAGAACTGTTTTTAATCAAATGAAGAAGAGGGAGGTCTGCACTTGGAATGCAATGATCTCTTCATTTGCATCCAATGGTAGAGAAACAGAGGCATTGGACCTATTTGCCACTATGAAGGTTGAAGGAATACATCCTAATGAGGTTACATTTGTTGCAATTCTCACAGCGTGTGCTCGTGGCAAGCTCGTCAAATTGGGATTGCAGTTATTCCAATCAATGTTATACGATTTCTCAATTGTACCAATTACTGAGCATTATGTATGTGTGGTTGATCTCTTAGGCAAAGCAGGGCTTTTGCGAGAGGCAACTGAATTCATAGAGTCCATGCCATTTGATCCGGATGCTTCTGTTTTGGGAGCTCTTTTGAGTGCATGTAAAATTCATGGAGCCACTGAACTGGGGAATGAAGTGGGAAGAAGATTGCTTGAGATGCAGCCACGACATTGTGGTCGGTATGTGACTTTGGCGAGTATGAATGCTGGAGCAGAGAAATGGAATCGTGCTGCAGTTATAAGACGTGTAATGGCAGACGCCAGGATTCAGAAAACTCCAGCTTATAGTAGAGTAGATCCAATGCAAAATCTGGTGCTAGTATCACCCTCTTGA

Coding sequence (CDS)

ATGTTTACGTCACACCCCTCCGAGTGGTTACTTCAACTTCTTCAACGTTTCCTCAAGAACTCAACTAAAATCCCACAAATCCACGCCCTTCTACTCACCCAAGGCCATCTCTTCAACAATTCCCAAACTTCCTCCAATCCGAAATGCCTCATCACCCTTCTCTACAACACTCTCATCAGAGCCTATTTAAACTTTAACCGCCCTCGTTTCGCCCTCCTTCTCTACACCCAAATGCTCTCCCAACAAACCAAGCCCAACTTCCACACTTTCCCTTCCATCATCAAATCCGCCACCATTTGTTCCTCTCTTCTCCCCAAATTGATCCACGCACATGCTTTCAAGATCGGCGTTCTCACCGACCCCGTTGTCCTGACATCCTTTGTTTCCTCTTACGCCGACCTTCGTGAACTGGCTAATGCACGTAAGGTGTTTGATGAAATTACCAATCCTTGTATTGTTGCGTTTAACTCTATGCTTGATGCTTTTGTCAAAAATGGAGACTTGGGTTCTGCTGTTTTCATGTTTAGGTCCATGCCAGAGCATGATGTGGTTTCTTGGACTAGTGTTATTAATGGGTTTTGGTGGAATGGGCGCTTTCTTGAGGCTCTCTGGTTTTTTCATGTGATGATGATGAGTGGTTCGGTTAAGCCAAATGAAGCTACTTATGTGAGTGTACTTTCTTCTTCTGCTAATTTAGATGCTGAAGGAGTTCTTTGTCGTGGGAAAGAAGTACATGCTTACATTATTAGAAATGAGGGTGAGTTTAGTGTTTTTATAGGGACGGGGTTGATAGATTTTTATGGGAAGATGGGATTATTAGGATGTGCGAGAACTGTTTTTAATCAAATGAAGAAGAGGGAGGTCTGCACTTGGAATGCAATGATCTCTTCATTTGCATCCAATGGTAGAGAAACAGAGGCATTGGACCTATTTGCCACTATGAAGGTTGAAGGAATACATCCTAATGAGGTTACATTTGTTGCAATTCTCACAGCGTGTGCTCGTGGCAAGCTCGTCAAATTGGGATTGCAGTTATTCCAATCAATGTTATACGATTTCTCAATTGTACCAATTACTGAGCATTATGTATGTGTGGTTGATCTCTTAGGCAAAGCAGGGCTTTTGCGAGAGGCAACTGAATTCATAGAGTCCATGCCATTTGATCCGGATGCTTCTGTTTTGGGAGCTCTTTTGAGTGCATGTAAAATTCATGGAGCCACTGAACTGGGGAATGAAGTGGGAAGAAGATTGCTTGAGATGCAGCCACGACATTGTGGTCGGTATGTGACTTTGGCGAGTATGAATGCTGGAGCAGAGAAATGGAATCGTGCTGCAGTTATAAGACGTGTAATGGCAGACGCCAGGATTCAGAAAACTCCAGCTTATAGTAGAGTAGATCCAATGCAAAATCTGGTGCTAGTATCACCCTCTTGA

Protein sequence

MFTSHPSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIRAYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLTDPVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLDAEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVDPMQNLVLVSPS*
BLAST of Csa7G043000 vs. Swiss-Prot
Match: PPR30_ARATH (Putative pentatricopeptide repeat-containing protein At1g10330 OS=Arabidopsis thaliana GN=PCMP-E71 PE=3 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 7.5e-120
Identity = 230/458 (50.22%), Postives = 309/458 (67.47%), Query Frame = 1

Query: 11  LQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIRAYLNFNRPRF 70
           L LLQRFL +S +I QIH +LLT   L  +   +   KC+    YNTLIR+YL     + 
Sbjct: 17  LHLLQRFLYSSNQIKQIHTVLLTSNALVASRWKT---KCV----YNTLIRSYLTTGEYKT 76

Query: 71  ALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKL---IHAHAFKIGVLTDPVVLTSF 130
           +L L+T ML+   +PN  TFPS+IK+A  CSS        +H  A K G L DP V TSF
Sbjct: 77  SLALFTHMLASHVQPNNLTFPSLIKAA--CSSFSVSYGVALHGQALKRGFLWDPFVQTSF 136

Query: 131 VSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPEHDVVSWT 190
           V  Y ++ +L ++RK+FD+I NPC+VA NS+LDA  +NG++  A   F+ MP  DVVSWT
Sbjct: 137 VRFYGEVGDLESSRKMFDDILNPCVVACNSLLDACGRNGEMDYAFEYFQRMPVTDVVSWT 196

Query: 191 SVINGFWWNGRFLEALWFFHVMMMS--GSVKPNEATYVSVLSSSANLDAEGVLCRGKEVH 250
           +VINGF   G   +AL  F  M+ +    + PNEAT+VSVLSS AN D  G+   GK++H
Sbjct: 197 TVINGFSKKGLHAKALMVFGEMIQNERAVITPNEATFVSVLSSCANFDQGGIRL-GKQIH 256

Query: 251 AYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFASNGRET 310
            Y++  E   +  +GT L+D YGK G L  A T+F+Q++ ++VC WNA+IS+ ASNGR  
Sbjct: 257 GYVMSKEIILTTTLGTALLDMYGKAGDLEMALTIFDQIRDKKVCAWNAIISALASNGRPK 316

Query: 311 EALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITEHYVCV 370
           +AL++F  MK   +HPN +T +AILTACAR KLV LG+QLF S+  ++ I+P +EHY CV
Sbjct: 317 QALEMFEMMKSSYVHPNGITLLAILTACARSKLVDLGIQLFSSICSEYKIIPTSEHYGCV 376

Query: 371 VDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEMQPRHC 430
           VDL+G+AGLL +A  FI+S+PF+PDASVLGALL ACKIH  TELGN VG++L+ +QP+HC
Sbjct: 377 VDLIGRAGLLVDAANFIQSLPFEPDASVLGALLGACKIHENTELGNTVGKQLIGLQPQHC 436

Query: 431 GRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYS 464
           G+YV L++ NA    W+ A  +R+ M +A I+K PAYS
Sbjct: 437 GQYVALSTFNALDSNWSEAEKMRKAMIEAGIRKIPAYS 464

BLAST of Csa7G043000 vs. Swiss-Prot
Match: PP449_ARATH (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 293.9 bits (751), Expect = 3.2e-78
Identity = 181/478 (37.87%), Postives = 263/478 (55.02%), Query Frame = 1

Query: 11  LQLLQRFLKNSTKIPQIHALLLTQGHL-----------FNNSQTSSN--PKCLI------ 70
           +  LQR  K   ++ QIHA +L  G +           F  S TSS+  P   I      
Sbjct: 18  MSCLQRCSKQE-ELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFD 77

Query: 71  ---TLLYNTLIRAYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKL- 130
              T L+N +IR +   + P  +LLLY +ML      N +TFPS++K+ +  S+      
Sbjct: 78  RPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQ 137

Query: 131 IHAHAFKIGVLTDPVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGD 190
           IHA   K+G   D   + S ++SYA       A  +FD I  P  V++NS++  +VK G 
Sbjct: 138 IHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGK 197

Query: 191 LGSAVFMFRSMPEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLS 250
           +  A+ +FR M E + +SWT++I+G+       EAL  FH M  S  V+P+  +  + LS
Sbjct: 198 MDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNS-DVEPDNVSLANALS 257

Query: 251 SSANLDAEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKRE 310
           + A L   G L +GK +H+Y+ +        +G  LID Y K G +  A  VF  +KK+ 
Sbjct: 258 ACAQL---GALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKS 317

Query: 311 VCTWNAMISSFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQ 370
           V  W A+IS +A +G   EA+  F  M+  GI PN +TF A+LTAC+   LV+ G  +F 
Sbjct: 318 VQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFY 377

Query: 371 SMLYDFSIVPITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGAT 430
           SM  D+++ P  EHY C+VDLLG+AGLL EA  FI+ MP  P+A + GALL AC+IH   
Sbjct: 378 SMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNI 437

Query: 431 ELGNEVGRRLLEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRV 466
           ELG E+G  L+ + P H GRYV  A+++A  +KW++AA  RR+M +  + K P  S +
Sbjct: 438 ELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI 490

BLAST of Csa7G043000 vs. Swiss-Prot
Match: PP371_ARATH (Pentatricopeptide repeat-containing protein At5g08510 OS=Arabidopsis thaliana GN=PCMP-E20 PE=2 SV=1)

HSP 1 Score: 289.7 bits (740), Expect = 6.0e-77
Identity = 162/429 (37.76%), Postives = 247/429 (57.58%), Query Frame = 1

Query: 37  LFNNSQTSSNPKCLITLLYNTLIRAYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKS 96
           LF++ Q S       T LYN LI+AY   ++P  +++LY  +     +P+ HTF  I  +
Sbjct: 38  LFDHHQNSC------TFLYNKLIQAYYVHHQPHESIVLYNLLSFDGLRPSHHTFNFIFAA 97

Query: 97  ATICSSLLP-KLIHAHAFKIGVLTDPVVLTSFVSSYADLRELANARKVFDEITNPCIVAF 156
           +   SS  P +L+H+  F+ G  +D    T+ +++YA L  L  AR+VFDE++   +  +
Sbjct: 98  SASFSSARPLRLLHSQFFRSGFESDSFCCTTLITAYAKLGALCCARRVFDEMSKRDVPVW 157

Query: 157 NSMLDAFVKNGDLGSAVFMFRSMPEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSV 216
           N+M+  + + GD+ +A+ +F SMP  +V SWT+VI+GF  NG + EAL  F  M    SV
Sbjct: 158 NAMITGYQRRGDMKAAMELFDSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCMEKDKSV 217

Query: 217 KPNEATYVSVLSSSANLDAEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGC 276
           KPN  T VSVL + ANL   G L  G+ +  Y   N    ++++    I+ Y K G++  
Sbjct: 218 KPNHITVVSVLPACANL---GELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGMIDV 277

Query: 277 ARTVFNQM-KKREVCTWNAMISSFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACA 336
           A+ +F ++  +R +C+WN+MI S A++G+  EAL LFA M  EG  P+ VTFV +L AC 
Sbjct: 278 AKRLFEELGNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACV 337

Query: 337 RGKLVKLGLQLFQSMLYDFSIVPITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVL 396
            G +V  G +LF+SM     I P  EHY C++DLLG+ G L+EA + I++MP  PDA V 
Sbjct: 338 HGGMVVKGQELFKSMEEVHKISPKLEHYGCMIDLLGRVGKLQEAYDLIKTMPMKPDAVVW 397

Query: 397 GALLSACKIHGATELGNEVGRRLLEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADA 456
           G LL AC  HG  E+       L +++P + G  V ++++ A  EKW+    +R++M   
Sbjct: 398 GTLLGACSFHGNVEIAEIASEALFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLMKKE 457

Query: 457 RIQKTPAYS 464
            + K   YS
Sbjct: 458 TMTKAAGYS 457


HSP 2 Score: 83.6 bits (205), Expect = 6.5e-15
Identity = 69/313 (22.04%), Postives = 135/313 (43.13%), Query Frame = 1

Query: 106 KLIHAHAFKIGVLTDPVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKN 165
           K +HAH  + GV     +L   +     +  L  ARK+FD   N C   +N ++ A+  +
Sbjct: 5   KQLHAHCLRTGVDETKDLLQRLLL----IPNLVYARKLFDHHQNSCTFLYNKLIQAYYVH 64

Query: 166 GDLGSAVFMFRSM------PEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNE 225
                ++ ++  +      P H   ++    +  + + R L  L   H        + + 
Sbjct: 65  HQPHESIVLYNLLSFDGLRPSHHTFNFIFAASASFSSARPLRLL---HSQFFRSGFESDS 124

Query: 226 ATYVSVLSSSANLDAEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTV 285
               +++++ A L   G LC  + V   + + +    V +   +I  Y + G +  A  +
Sbjct: 125 FCCTTLITAYAKL---GALCCARRVFDEMSKRD----VPVWNAMITGYQRRGDMKAAMEL 184

Query: 286 FNQMKKREVCTWNAMISSFASNGRETEALDLFATM-KVEGIHPNEVTFVAILTACARGKL 345
           F+ M ++ V +W  +IS F+ NG  +EAL +F  M K + + PN +T V++L ACA    
Sbjct: 185 FDSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCMEKDKSVKPNHITVVSVLPACANLGE 244

Query: 346 VKLGLQL----FQSMLYDFSIVPITEHYVC--VVDLLGKAGLLREATEFIESMPFDPDAS 405
           +++G +L     ++  +D         YVC   +++  K G++  A    E +    +  
Sbjct: 245 LEIGRRLEGYARENGFFD-------NIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLC 296

BLAST of Csa7G043000 vs. Swiss-Prot
Match: PP369_ARATH (Pentatricopeptide repeat-containing protein At5g08305 OS=Arabidopsis thaliana GN=PCMP-E105 PE=2 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 1.5e-75
Identity = 163/415 (39.28%), Postives = 246/415 (59.28%), Query Frame = 1

Query: 55  YNTLIRAYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSL-LPKLIHAHAF 114
           +N +IR + N   P  ++ +Y QML     P+  T+P ++KS++  S+  L   +H    
Sbjct: 76  WNFVIRGFSNSRNPEKSISVYIQMLRFGLLPDHMTYPFLMKSSSRLSNRKLGGSLHCSVV 135

Query: 115 KIGVLTDPVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVF 174
           K G+  D  +  + +  Y   R+ A+ARK+FDE+ +  +V +NS+LDA+ K+GD+ SA  
Sbjct: 136 KSGLEWDLFICNTLIHMYGSFRDQASARKLFDEMPHKNLVTWNSILDAYAKSGDVVSARL 195

Query: 175 MFRSMPEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLD 234
           +F  M E DVV+W+S+I+G+   G + +AL  F  MM  GS K NE T VSV+ + A+L 
Sbjct: 196 VFDEMSERDVVTWSSMIDGYVKRGEYNKALEIFDQMMRMGSSKANEVTMVSVICACAHL- 255

Query: 235 AEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQ--MKKREVCTW 294
             G L RGK VH YI+      +V + T LID Y K G +G A +VF +  +K+ +   W
Sbjct: 256 --GALNRGKTVHRYILDVHLPLTVILQTSLIDMYAKCGSIGDAWSVFYRASVKETDALMW 315

Query: 295 NAMISSFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLY 354
           NA+I   AS+G   E+L LF  M+   I P+E+TF+ +L AC+ G LVK     F+S L 
Sbjct: 316 NAIIGGLASHGFIRESLQLFHKMRESKIDPDEITFLCLLAACSHGGLVKEAWHFFKS-LK 375

Query: 355 DFSIVPITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGN 414
           +    P +EHY C+VD+L +AGL+++A +FI  MP  P  S+LGALL+ C  HG  EL  
Sbjct: 376 ESGAEPKSEHYACMVDVLSRAGLVKDAHDFISEMPIKPTGSMLGALLNGCINHGNLELAE 435

Query: 415 EVGRRLLEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVD 467
            VG++L+E+QP + GRYV LA++ A  +++  A  +R  M    ++K   +S +D
Sbjct: 436 TVGKKLIELQPHNDGRYVGLANVYAINKQFRAARSMREAMEKKGVKKIAGHSILD 486


HSP 2 Score: 64.3 bits (155), Expect = 4.1e-09
Identity = 45/190 (23.68%), Postives = 84/190 (44.21%), Query Frame = 1

Query: 161 AFVKNGDLGSAVFMFRSMPEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEA 220
           A   +GD+  A      + +     W  VI GF  N R  E     ++ M+   + P+  
Sbjct: 51  ALSSSGDVDYAYKFLSKLSDPPNYGWNFVIRGFS-NSRNPEKSISVYIQMLRFGLLPDHM 110

Query: 221 TYVSVLSSSANLDAEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVF 280
           TY  ++ SS+ L    +   G  +H  ++++  E+ +FI   LI  YG       AR +F
Sbjct: 111 TYPFLMKSSSRLSNRKL---GGSLHCSVVKSGLEWDLFICNTLIHMYGSFRDQASARKLF 170

Query: 281 NQMKKREVCTWNAMISSFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVK 340
           ++M  + + TWN+++ ++A +G    A  +F  M    +    VT+ +++    +     
Sbjct: 171 DEMPHKNLVTWNSILDAYAKSGDVVSARLVFDEMSERDV----VTWSSMIDGYVKRGEYN 230

Query: 341 LGLQLFQSML 351
             L++F  M+
Sbjct: 231 KALEIFDQMM 232

BLAST of Csa7G043000 vs. Swiss-Prot
Match: PP435_ARATH (Putative pentatricopeptide repeat-containing protein At5g59200, chloroplastic OS=Arabidopsis thaliana GN=PCMP-E41 PE=3 SV=1)

HSP 1 Score: 270.0 bits (689), Expect = 4.9e-71
Identity = 163/473 (34.46%), Postives = 248/473 (52.43%), Query Frame = 1

Query: 16  RFLKNSTKIPQIHALLLTQGH----------------------LFNNSQTSSNPKCLITL 75
           R  KN   +P IHA ++   H                       ++     SNP      
Sbjct: 37  RSCKNIAHVPSIHAKIIRTFHDQDAFVVFELIRVCSTLDSVDYAYDVFSYVSNPN---VY 96

Query: 76  LYNTLIRAYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAF 135
           LY  +I  +++  R    + LY +M+     P+ +   S++K+   C   + + IHA   
Sbjct: 97  LYTAMIDGFVSSGRSADGVSLYHRMIHNSVLPDNYVITSVLKA---CDLKVCREIHAQVL 156

Query: 136 KIGVLTDPVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVF 195
           K+G  +   V    +  Y    EL NA+K+FDE+ +   VA   M++ + + G +  A+ 
Sbjct: 157 KLGFGSSRSVGLKMMEIYGKSGELVNAKKMFDEMPDRDHVAATVMINCYSECGFIKEALE 216

Query: 196 MFRSMPEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLD 255
           +F+ +   D V WT++I+G   N    +AL  F  M M  +V  NE T V VLS+ ++L 
Sbjct: 217 LFQDVKIKDTVCWTAMIDGLVRNKEMNKALELFREMQME-NVSANEFTAVCVLSACSDL- 276

Query: 256 AEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNA 315
             G L  G+ VH+++     E S F+G  LI+ Y + G +  AR VF  M+ ++V ++N 
Sbjct: 277 --GALELGRWVHSFVENQRMELSNFVGNALINMYSRCGDINEARRVFRVMRDKDVISYNT 336

Query: 316 MISSFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDF 375
           MIS  A +G   EA++ F  M   G  PN+VT VA+L AC+ G L+ +GL++F SM   F
Sbjct: 337 MISGLAMHGASVEAINEFRDMVNRGFRPNQVTLVALLNACSHGGLLDIGLEVFNSMKRVF 396

Query: 376 SIVPITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEV 435
           ++ P  EHY C+VDLLG+ G L EA  FIE++P +PD  +LG LLSACKIHG  ELG ++
Sbjct: 397 NVEPQIEHYGCIVDLLGRVGRLEEAYRFIENIPIEPDHIMLGTLLSACKIHGNMELGEKI 456

Query: 436 GRRLLEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVD 467
            +RL E +    G YV L+++ A + KW  +  IR  M D+ I+K P  S ++
Sbjct: 457 AKRLFESENPDSGTYVLLSNLYASSGKWKESTEIRESMRDSGIEKEPGCSTIE 499

BLAST of Csa7G043000 vs. TrEMBL
Match: A0A0A0K5I5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G043000 PE=4 SV=1)

HSP 1 Score: 955.3 bits (2468), Expect = 2.8e-275
Identity = 477/477 (100.00%), Postives = 477/477 (100.00%), Query Frame = 1

Query: 1   MFTSHPSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIR 60
           MFTSHPSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIR
Sbjct: 1   MFTSHPSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIR 60

Query: 61  AYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLTD 120
           AYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLTD
Sbjct: 61  AYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLTD 120

Query: 121 PVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPE 180
           PVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPE
Sbjct: 121 PVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPE 180

Query: 181 HDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLDAEGVLCR 240
           HDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLDAEGVLCR
Sbjct: 181 HDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLDAEGVLCR 240

Query: 241 GKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFAS 300
           GKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFAS
Sbjct: 241 GKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFAS 300

Query: 301 NGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITE 360
           NGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITE
Sbjct: 301 NGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITE 360

Query: 361 HYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEM 420
           HYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEM
Sbjct: 361 HYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEM 420

Query: 421 QPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVDPMQNLVLVSPS 478
           QPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVDPMQNLVLVSPS
Sbjct: 421 QPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVDPMQNLVLVSPS 477

BLAST of Csa7G043000 vs. TrEMBL
Match: B9NA20_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0018s03960g PE=4 SV=1)

HSP 1 Score: 548.5 bits (1412), Expect = 8.0e-153
Identity = 282/469 (60.13%), Postives = 345/469 (73.56%), Query Frame = 1

Query: 5   HPSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTS--SNPKCLITLLYNTLIRAY 64
           HP E LLQLLQ F+K+  ++ QIH+LL T+G LF N  TS   N K   TLL+NTL RAY
Sbjct: 4   HPPELLLQLLQHFIKHQNQVKQIHSLLTTKGLLFYNPNTSPNDNSKWKTTLLFNTLNRAY 63

Query: 65  LNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLTDPV 124
           LNF + +  L L+  ML+ QT PN HTFPS+IK+AT     +   +H  A   GVL DP 
Sbjct: 64  LNFGQHQQTLHLFALMLAHQTPPNSHTFPSVIKAATHSCLFIGTSLHTQAINRGVLYDPF 123

Query: 125 VLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPEHD 184
           + TS +  Y+    L NA KVFDEI++PCIV +N+ LDA+ KNGD+GSA  +F+SMP+ D
Sbjct: 124 IQTSLLGMYSQFGYLLNACKVFDEISHPCIVEYNATLDAYAKNGDMGSACCLFKSMPKRD 183

Query: 185 VVSWTSVINGFWWNGRFLEALWFFHVMMMSGS-----VKPNEATYVSVLSSSANLDAEGV 244
           VVSWTSVINGF  NG F EA+  F  MM+        VKPNEATYVSVLSS ANLD  GV
Sbjct: 184 VVSWTSVINGFAKNGLFGEAIRLFREMMLHDDVKCCFVKPNEATYVSVLSSCANLDERGV 243

Query: 245 LCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISS 304
           LC GK++H YI+RNE   +VFIGT LIDFYGK+G L  A  V+NQM  ++VCTWNA+ISS
Sbjct: 244 LCIGKQIHGYIVRNEVFVTVFIGTTLIDFYGKVGCLSNAIRVYNQMMVKKVCTWNAIISS 303

Query: 305 FASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVP 364
            A+NGRE +ALD+F  MK EG+ PNEVTF+A+LTACAR KLV++GL+LFQSM  +F +VP
Sbjct: 304 LANNGREEQALDMFKKMKGEGLCPNEVTFIAVLTACARAKLVEIGLELFQSMAGEFGLVP 363

Query: 365 ITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRL 424
           I EHY CVVDLLG AGLLREA+EFI  MPF+PDAS LGALL ACKIHGA +LGNEVG RL
Sbjct: 364 IMEHYGCVVDLLGMAGLLREASEFIRRMPFEPDASALGALLGACKIHGAIDLGNEVGSRL 423

Query: 425 LEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVD 467
           LE+QP+HCG+YV L+S++ G  +W  AA IR+ M +ARI+  PA S +D
Sbjct: 424 LELQPQHCGQYVALSSIHVGVNRWGVAADIRKTMVEARIRIVPACSLID 472

BLAST of Csa7G043000 vs. TrEMBL
Match: M5WYI9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023408mg PE=4 SV=1)

HSP 1 Score: 541.6 bits (1394), Expect = 9.8e-151
Identity = 279/470 (59.36%), Postives = 344/470 (73.19%), Query Frame = 1

Query: 4   SHPSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIRAYL 63
           S   E LL+LLQRFLK   +I QIH+ L++ GHL ++   SS  K + TLLYNTLIRA+L
Sbjct: 4   SSSPECLLELLQRFLKRPNQINQIHSRLISHGHLLHSPNPSSASKWMTTLLYNTLIRAHL 63

Query: 64  NFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKL---IHAHAFKIGVLTD 123
            F++     LL+T ML+ Q  PN HTFP +IK+A+  SS  P L   +H    K GVL D
Sbjct: 64  GFSQAHKPFLLFTHMLAHQAPPNSHTFPPLIKAASASSS--PNLGAPLHTQVVKRGVLHD 123

Query: 124 PVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPE 183
             + TS VS YA    L +ARKVF+EI+ PC+VA+N+M+D F KNGD+GSAV +F+SMP+
Sbjct: 124 SFIQTSLVSFYAQFGILCDARKVFEEISEPCVVAYNAMIDGFGKNGDVGSAVSLFQSMPK 183

Query: 184 HDVVSWTSVINGFWWNGRFLEALWFFHVM----MMSGSVKPNEATYVSVLSSSANLDAEG 243
            DVVSWTSVINGF  NG F E + FF +M    +M   VKPNEATYV+VLSS ANLD  G
Sbjct: 184 KDVVSWTSVINGFGRNGSFSEGIQFFKMMVHEDLMGCFVKPNEATYVTVLSSCANLDGWG 243

Query: 244 VLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMIS 303
            L  GK++H Y+IR E EF+ F+GT LID YGKMG +  A  VF +M   EVCTWNAMIS
Sbjct: 244 SLYWGKQIHGYVIRKEIEFTAFLGTALIDLYGKMGYVRSAENVFKKMVVTEVCTWNAMIS 303

Query: 304 SFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIV 363
           + + NG+E EALDLF  MK E + PN VTFVA+LTACARGKLV  GL+LF+SM  DF + 
Sbjct: 304 ALSLNGKEREALDLFEKMKRERLQPNAVTFVAVLTACARGKLVNFGLELFRSMSNDFGVE 363

Query: 364 PITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRR 423
           PI EHY CVVDLLG+AGL  EATE I+SMPF+PDASVLGALL +CKIHG  ELGN+VG++
Sbjct: 364 PIMEHYGCVVDLLGRAGLFLEATELIKSMPFEPDASVLGALLGSCKIHGTAELGNKVGQK 423

Query: 424 LLEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVD 467
           LLE+QP+HCGRYV L+S+NAG E+W+RAA +R+ M  A I+K PAYS +D
Sbjct: 424 LLELQPQHCGRYVVLSSINAGTERWDRAAAVRQAMVHAGIRKIPAYSVID 471

BLAST of Csa7G043000 vs. TrEMBL
Match: W9QQL2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_011924 PE=4 SV=1)

HSP 1 Score: 533.5 bits (1373), Expect = 2.7e-148
Identity = 276/460 (60.00%), Postives = 347/460 (75.43%), Query Frame = 1

Query: 7   SEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIRAYLNFN 66
           SE LLQLLQRF K+  ++ QIH LL+T+GHL   ++   N     TLL+N LIR +L F 
Sbjct: 5   SELLLQLLQRFPKHPNQVQQIHCLLITEGHLLLRNRKWKN-----TLLFNALIRTHLGFG 64

Query: 67  RPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKL---IHAHAFKIGVLTDPVV 126
           + R +LLL+ +ML  +  PN HTFPS+IKSA   SS LP L   +HA A K GVL DP V
Sbjct: 65  QTRKSLLLFARMLLHRAPPNGHTFPSLIKSA---SSSLPSLGPPLHAQALKRGVLRDPFV 124

Query: 127 LTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPEHDV 186
            TS +S YA+   L +ARKVF+E++  C+VA N+M+DAF KNGD+GSA+ +F  M + DV
Sbjct: 125 QTSLLSFYAECGNLRSARKVFEEMSERCVVACNAMIDAFCKNGDMGSALCIFEGMIKRDV 184

Query: 187 VSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLDAEGVLCRGKE 246
           VSWTSV+NGF  N RF EA+  F   MMS SVKPNEATYVSVLSS A+LD  G +  GK+
Sbjct: 185 VSWTSVVNGFRMNRRFCEAVRVFG-SMMSCSVKPNEATYVSVLSSCASLDGWGGIYIGKQ 244

Query: 247 VHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFASNGR 306
           +H +I+RNE   SVF+GT  ID YGK GLL  A+ VF+ M  +E C WNAMIS+ +SNGR
Sbjct: 245 IHGHILRNEIHLSVFMGTAFIDLYGKSGLLNAAKNVFDGMVVKETCCWNAMISALSSNGR 304

Query: 307 ETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITEHYV 366
           E EALD F  +K+EG+ PNEVTFVA+LTACARGKLV+ GL+LF+SML DF +VP+ EHY 
Sbjct: 305 EKEALDFFERIKMEGLQPNEVTFVAVLTACARGKLVEFGLELFRSMLNDFGVVPVMEHYG 364

Query: 367 CVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEMQPR 426
           CVVDLLG+AGLLREA EF++SMPF+PDASVLGALL A KIHG TELGNEVG++L++MQP+
Sbjct: 365 CVVDLLGRAGLLREAAEFVQSMPFEPDASVLGALLGASKIHGTTELGNEVGQKLIDMQPQ 424

Query: 427 HCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYS 464
           H GRYV L+++NA  ++WNRAA +R++M +A +QKT AYS
Sbjct: 425 HSGRYVVLSNINAATQRWNRAAYLRKMMLNAGVQKTLAYS 455

BLAST of Csa7G043000 vs. TrEMBL
Match: A0A061GGY2_THECC (Tetratricopeptide repeat-like superfamily protein, putative OS=Theobroma cacao GN=TCM_030317 PE=4 SV=1)

HSP 1 Score: 532.7 bits (1371), Expect = 4.5e-148
Identity = 284/466 (60.94%), Postives = 345/466 (74.03%), Query Frame = 1

Query: 7   SEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIRAYLNFN 66
           SE LLQ LQRF++   +I QIH+LL+T G L +N  ++++ K   TLLYNTLIRAYLN  
Sbjct: 3   SESLLQHLQRFIQRPNQIKQIHSLLITGGLLLHNHYSTAS-KWKTTLLYNTLIRAYLNVK 62

Query: 67  RPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKL----IHAHAFKIGVLTDPV 126
               +LLL+T+ML  QT PN HTFPS+ K+A   S  L  L    +HA A K GVL+DP 
Sbjct: 63  PFHHSLLLFTRMLGHQTPPNSHTFPSLFKAAAAASLSLASLTCAPLHAQALKRGVLSDPF 122

Query: 127 VLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPEHD 186
           V TS +  YA L  L+NA KVF+EI NPCIVA N+MLDAF +NGD+GSA+ +F SM E D
Sbjct: 123 VQTSLLGVYAKLGRLSNASKVFEEIFNPCIVACNAMLDAFGRNGDMGSALLLFESMIEKD 182

Query: 187 VVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLDAEGVLCRGK 246
           VVSWTSVINGF  + +F EA+  F   MM   VKPNEATYV+VLS  AN +  G L +GK
Sbjct: 183 VVSWTSVINGFARSKQFKEAIRVFE-NMMEFWVKPNEATYVNVLSCCANSEGGGSLYQGK 242

Query: 247 EVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFASNG 306
           ++H Y++RNE   +V++GT LIDFYGK G    A  VFNQM  REV TWNAMISS A NG
Sbjct: 243 QIHGYMLRNEVVMTVYMGTALIDFYGKKGCSETAVRVFNQMLVREVFTWNAMISSLACNG 302

Query: 307 RETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITEHY 366
           RE +ALD+F  MKVEG+ PNEVT VA+LTACAR K V+LG +LFQSM   + IVP+ EHY
Sbjct: 303 REEKALDMFEKMKVEGVCPNEVTLVAVLTACARTKRVELGSELFQSMYCQYGIVPMMEHY 362

Query: 367 VCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEMQP 426
            C+VDLLG+AGLL EATEFI SMPF PDASVLGALL+ACKIHGA ELGNEVGR+LLE+QP
Sbjct: 363 GCMVDLLGRAGLLTEATEFIGSMPFQPDASVLGALLNACKIHGAIELGNEVGRKLLELQP 422

Query: 427 RHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVDPM 469
           RHCG YV L+S+NA  E+W+RAA +R+ + +ARI+K PAYS +  M
Sbjct: 423 RHCGLYVALSSINADLERWDRAADLRKALVEARIRKVPAYSLISSM 466

BLAST of Csa7G043000 vs. TAIR10
Match: AT1G10330.1 (AT1G10330.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 432.2 bits (1110), Expect = 4.2e-121
Identity = 230/458 (50.22%), Postives = 309/458 (67.47%), Query Frame = 1

Query: 11  LQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIRAYLNFNRPRF 70
           L LLQRFL +S +I QIH +LLT   L  +   +   KC+    YNTLIR+YL     + 
Sbjct: 17  LHLLQRFLYSSNQIKQIHTVLLTSNALVASRWKT---KCV----YNTLIRSYLTTGEYKT 76

Query: 71  ALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKL---IHAHAFKIGVLTDPVVLTSF 130
           +L L+T ML+   +PN  TFPS+IK+A  CSS        +H  A K G L DP V TSF
Sbjct: 77  SLALFTHMLASHVQPNNLTFPSLIKAA--CSSFSVSYGVALHGQALKRGFLWDPFVQTSF 136

Query: 131 VSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPEHDVVSWT 190
           V  Y ++ +L ++RK+FD+I NPC+VA NS+LDA  +NG++  A   F+ MP  DVVSWT
Sbjct: 137 VRFYGEVGDLESSRKMFDDILNPCVVACNSLLDACGRNGEMDYAFEYFQRMPVTDVVSWT 196

Query: 191 SVINGFWWNGRFLEALWFFHVMMMS--GSVKPNEATYVSVLSSSANLDAEGVLCRGKEVH 250
           +VINGF   G   +AL  F  M+ +    + PNEAT+VSVLSS AN D  G+   GK++H
Sbjct: 197 TVINGFSKKGLHAKALMVFGEMIQNERAVITPNEATFVSVLSSCANFDQGGIRL-GKQIH 256

Query: 251 AYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFASNGRET 310
            Y++  E   +  +GT L+D YGK G L  A T+F+Q++ ++VC WNA+IS+ ASNGR  
Sbjct: 257 GYVMSKEIILTTTLGTALLDMYGKAGDLEMALTIFDQIRDKKVCAWNAIISALASNGRPK 316

Query: 311 EALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITEHYVCV 370
           +AL++F  MK   +HPN +T +AILTACAR KLV LG+QLF S+  ++ I+P +EHY CV
Sbjct: 317 QALEMFEMMKSSYVHPNGITLLAILTACARSKLVDLGIQLFSSICSEYKIIPTSEHYGCV 376

Query: 371 VDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEMQPRHC 430
           VDL+G+AGLL +A  FI+S+PF+PDASVLGALL ACKIH  TELGN VG++L+ +QP+HC
Sbjct: 377 VDLIGRAGLLVDAANFIQSLPFEPDASVLGALLGACKIHENTELGNTVGKQLIGLQPQHC 436

Query: 431 GRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYS 464
           G+YV L++ NA    W+ A  +R+ M +A I+K PAYS
Sbjct: 437 GQYVALSTFNALDSNWSEAEKMRKAMIEAGIRKIPAYS 464

BLAST of Csa7G043000 vs. TAIR10
Match: AT5G66520.1 (AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 293.9 bits (751), Expect = 1.8e-79
Identity = 181/478 (37.87%), Postives = 263/478 (55.02%), Query Frame = 1

Query: 11  LQLLQRFLKNSTKIPQIHALLLTQGHL-----------FNNSQTSSN--PKCLI------ 70
           +  LQR  K   ++ QIHA +L  G +           F  S TSS+  P   I      
Sbjct: 18  MSCLQRCSKQE-ELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFD 77

Query: 71  ---TLLYNTLIRAYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKL- 130
              T L+N +IR +   + P  +LLLY +ML      N +TFPS++K+ +  S+      
Sbjct: 78  RPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQ 137

Query: 131 IHAHAFKIGVLTDPVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGD 190
           IHA   K+G   D   + S ++SYA       A  +FD I  P  V++NS++  +VK G 
Sbjct: 138 IHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGK 197

Query: 191 LGSAVFMFRSMPEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLS 250
           +  A+ +FR M E + +SWT++I+G+       EAL  FH M  S  V+P+  +  + LS
Sbjct: 198 MDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNS-DVEPDNVSLANALS 257

Query: 251 SSANLDAEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKRE 310
           + A L   G L +GK +H+Y+ +        +G  LID Y K G +  A  VF  +KK+ 
Sbjct: 258 ACAQL---GALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKS 317

Query: 311 VCTWNAMISSFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQ 370
           V  W A+IS +A +G   EA+  F  M+  GI PN +TF A+LTAC+   LV+ G  +F 
Sbjct: 318 VQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFY 377

Query: 371 SMLYDFSIVPITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGAT 430
           SM  D+++ P  EHY C+VDLLG+AGLL EA  FI+ MP  P+A + GALL AC+IH   
Sbjct: 378 SMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNI 437

Query: 431 ELGNEVGRRLLEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRV 466
           ELG E+G  L+ + P H GRYV  A+++A  +KW++AA  RR+M +  + K P  S +
Sbjct: 438 ELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI 490

BLAST of Csa7G043000 vs. TAIR10
Match: AT5G08510.1 (AT5G08510.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 289.7 bits (740), Expect = 3.4e-78
Identity = 162/429 (37.76%), Postives = 247/429 (57.58%), Query Frame = 1

Query: 37  LFNNSQTSSNPKCLITLLYNTLIRAYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKS 96
           LF++ Q S       T LYN LI+AY   ++P  +++LY  +     +P+ HTF  I  +
Sbjct: 38  LFDHHQNSC------TFLYNKLIQAYYVHHQPHESIVLYNLLSFDGLRPSHHTFNFIFAA 97

Query: 97  ATICSSLLP-KLIHAHAFKIGVLTDPVVLTSFVSSYADLRELANARKVFDEITNPCIVAF 156
           +   SS  P +L+H+  F+ G  +D    T+ +++YA L  L  AR+VFDE++   +  +
Sbjct: 98  SASFSSARPLRLLHSQFFRSGFESDSFCCTTLITAYAKLGALCCARRVFDEMSKRDVPVW 157

Query: 157 NSMLDAFVKNGDLGSAVFMFRSMPEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSV 216
           N+M+  + + GD+ +A+ +F SMP  +V SWT+VI+GF  NG + EAL  F  M    SV
Sbjct: 158 NAMITGYQRRGDMKAAMELFDSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCMEKDKSV 217

Query: 217 KPNEATYVSVLSSSANLDAEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGC 276
           KPN  T VSVL + ANL   G L  G+ +  Y   N    ++++    I+ Y K G++  
Sbjct: 218 KPNHITVVSVLPACANL---GELEIGRRLEGYARENGFFDNIYVCNATIEMYSKCGMIDV 277

Query: 277 ARTVFNQM-KKREVCTWNAMISSFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACA 336
           A+ +F ++  +R +C+WN+MI S A++G+  EAL LFA M  EG  P+ VTFV +L AC 
Sbjct: 278 AKRLFEELGNQRNLCSWNSMIGSLATHGKHDEALTLFAQMLREGEKPDAVTFVGLLLACV 337

Query: 337 RGKLVKLGLQLFQSMLYDFSIVPITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVL 396
            G +V  G +LF+SM     I P  EHY C++DLLG+ G L+EA + I++MP  PDA V 
Sbjct: 338 HGGMVVKGQELFKSMEEVHKISPKLEHYGCMIDLLGRVGKLQEAYDLIKTMPMKPDAVVW 397

Query: 397 GALLSACKIHGATELGNEVGRRLLEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADA 456
           G LL AC  HG  E+       L +++P + G  V ++++ A  EKW+    +R++M   
Sbjct: 398 GTLLGACSFHGNVEIAEIASEALFKLEPTNPGNCVIMSNIYAANEKWDGVLRMRKLMKKE 457

Query: 457 RIQKTPAYS 464
            + K   YS
Sbjct: 458 TMTKAAGYS 457


HSP 2 Score: 83.6 bits (205), Expect = 3.7e-16
Identity = 69/313 (22.04%), Postives = 135/313 (43.13%), Query Frame = 1

Query: 106 KLIHAHAFKIGVLTDPVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKN 165
           K +HAH  + GV     +L   +     +  L  ARK+FD   N C   +N ++ A+  +
Sbjct: 5   KQLHAHCLRTGVDETKDLLQRLLL----IPNLVYARKLFDHHQNSCTFLYNKLIQAYYVH 64

Query: 166 GDLGSAVFMFRSM------PEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNE 225
                ++ ++  +      P H   ++    +  + + R L  L   H        + + 
Sbjct: 65  HQPHESIVLYNLLSFDGLRPSHHTFNFIFAASASFSSARPLRLL---HSQFFRSGFESDS 124

Query: 226 ATYVSVLSSSANLDAEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTV 285
               +++++ A L   G LC  + V   + + +    V +   +I  Y + G +  A  +
Sbjct: 125 FCCTTLITAYAKL---GALCCARRVFDEMSKRD----VPVWNAMITGYQRRGDMKAAMEL 184

Query: 286 FNQMKKREVCTWNAMISSFASNGRETEALDLFATM-KVEGIHPNEVTFVAILTACARGKL 345
           F+ M ++ V +W  +IS F+ NG  +EAL +F  M K + + PN +T V++L ACA    
Sbjct: 185 FDSMPRKNVTSWTTVISGFSQNGNYSEALKMFLCMEKDKSVKPNHITVVSVLPACANLGE 244

Query: 346 VKLGLQL----FQSMLYDFSIVPITEHYVC--VVDLLGKAGLLREATEFIESMPFDPDAS 405
           +++G +L     ++  +D         YVC   +++  K G++  A    E +    +  
Sbjct: 245 LEIGRRLEGYARENGFFD-------NIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLC 296

BLAST of Csa7G043000 vs. TAIR10
Match: AT5G08305.1 (AT5G08305.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 285.0 bits (728), Expect = 8.3e-77
Identity = 163/415 (39.28%), Postives = 246/415 (59.28%), Query Frame = 1

Query: 55  YNTLIRAYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSL-LPKLIHAHAF 114
           +N +IR + N   P  ++ +Y QML     P+  T+P ++KS++  S+  L   +H    
Sbjct: 76  WNFVIRGFSNSRNPEKSISVYIQMLRFGLLPDHMTYPFLMKSSSRLSNRKLGGSLHCSVV 135

Query: 115 KIGVLTDPVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVF 174
           K G+  D  +  + +  Y   R+ A+ARK+FDE+ +  +V +NS+LDA+ K+GD+ SA  
Sbjct: 136 KSGLEWDLFICNTLIHMYGSFRDQASARKLFDEMPHKNLVTWNSILDAYAKSGDVVSARL 195

Query: 175 MFRSMPEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLD 234
           +F  M E DVV+W+S+I+G+   G + +AL  F  MM  GS K NE T VSV+ + A+L 
Sbjct: 196 VFDEMSERDVVTWSSMIDGYVKRGEYNKALEIFDQMMRMGSSKANEVTMVSVICACAHL- 255

Query: 235 AEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQ--MKKREVCTW 294
             G L RGK VH YI+      +V + T LID Y K G +G A +VF +  +K+ +   W
Sbjct: 256 --GALNRGKTVHRYILDVHLPLTVILQTSLIDMYAKCGSIGDAWSVFYRASVKETDALMW 315

Query: 295 NAMISSFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLY 354
           NA+I   AS+G   E+L LF  M+   I P+E+TF+ +L AC+ G LVK     F+S L 
Sbjct: 316 NAIIGGLASHGFIRESLQLFHKMRESKIDPDEITFLCLLAACSHGGLVKEAWHFFKS-LK 375

Query: 355 DFSIVPITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGN 414
           +    P +EHY C+VD+L +AGL+++A +FI  MP  P  S+LGALL+ C  HG  EL  
Sbjct: 376 ESGAEPKSEHYACMVDVLSRAGLVKDAHDFISEMPIKPTGSMLGALLNGCINHGNLELAE 435

Query: 415 EVGRRLLEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVD 467
            VG++L+E+QP + GRYV LA++ A  +++  A  +R  M    ++K   +S +D
Sbjct: 436 TVGKKLIELQPHNDGRYVGLANVYAINKQFRAARSMREAMEKKGVKKIAGHSILD 486


HSP 2 Score: 64.3 bits (155), Expect = 2.3e-10
Identity = 45/190 (23.68%), Postives = 84/190 (44.21%), Query Frame = 1

Query: 161 AFVKNGDLGSAVFMFRSMPEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEA 220
           A   +GD+  A      + +     W  VI GF  N R  E     ++ M+   + P+  
Sbjct: 51  ALSSSGDVDYAYKFLSKLSDPPNYGWNFVIRGFS-NSRNPEKSISVYIQMLRFGLLPDHM 110

Query: 221 TYVSVLSSSANLDAEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVF 280
           TY  ++ SS+ L    +   G  +H  ++++  E+ +FI   LI  YG       AR +F
Sbjct: 111 TYPFLMKSSSRLSNRKL---GGSLHCSVVKSGLEWDLFICNTLIHMYGSFRDQASARKLF 170

Query: 281 NQMKKREVCTWNAMISSFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVK 340
           ++M  + + TWN+++ ++A +G    A  +F  M    +    VT+ +++    +     
Sbjct: 171 DEMPHKNLVTWNSILDAYAKSGDVVSARLVFDEMSERDV----VTWSSMIDGYVKRGEYN 230

Query: 341 LGLQLFQSML 351
             L++F  M+
Sbjct: 231 KALEIFDQMM 232

BLAST of Csa7G043000 vs. TAIR10
Match: AT5G59200.1 (AT5G59200.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 270.0 bits (689), Expect = 2.8e-72
Identity = 163/473 (34.46%), Postives = 248/473 (52.43%), Query Frame = 1

Query: 16  RFLKNSTKIPQIHALLLTQGH----------------------LFNNSQTSSNPKCLITL 75
           R  KN   +P IHA ++   H                       ++     SNP      
Sbjct: 37  RSCKNIAHVPSIHAKIIRTFHDQDAFVVFELIRVCSTLDSVDYAYDVFSYVSNPN---VY 96

Query: 76  LYNTLIRAYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAF 135
           LY  +I  +++  R    + LY +M+     P+ +   S++K+   C   + + IHA   
Sbjct: 97  LYTAMIDGFVSSGRSADGVSLYHRMIHNSVLPDNYVITSVLKA---CDLKVCREIHAQVL 156

Query: 136 KIGVLTDPVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVF 195
           K+G  +   V    +  Y    EL NA+K+FDE+ +   VA   M++ + + G +  A+ 
Sbjct: 157 KLGFGSSRSVGLKMMEIYGKSGELVNAKKMFDEMPDRDHVAATVMINCYSECGFIKEALE 216

Query: 196 MFRSMPEHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLD 255
           +F+ +   D V WT++I+G   N    +AL  F  M M  +V  NE T V VLS+ ++L 
Sbjct: 217 LFQDVKIKDTVCWTAMIDGLVRNKEMNKALELFREMQME-NVSANEFTAVCVLSACSDL- 276

Query: 256 AEGVLCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNA 315
             G L  G+ VH+++     E S F+G  LI+ Y + G +  AR VF  M+ ++V ++N 
Sbjct: 277 --GALELGRWVHSFVENQRMELSNFVGNALINMYSRCGDINEARRVFRVMRDKDVISYNT 336

Query: 316 MISSFASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDF 375
           MIS  A +G   EA++ F  M   G  PN+VT VA+L AC+ G L+ +GL++F SM   F
Sbjct: 337 MISGLAMHGASVEAINEFRDMVNRGFRPNQVTLVALLNACSHGGLLDIGLEVFNSMKRVF 396

Query: 376 SIVPITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEV 435
           ++ P  EHY C+VDLLG+ G L EA  FIE++P +PD  +LG LLSACKIHG  ELG ++
Sbjct: 397 NVEPQIEHYGCIVDLLGRVGRLEEAYRFIENIPIEPDHIMLGTLLSACKIHGNMELGEKI 456

Query: 436 GRRLLEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVD 467
            +RL E +    G YV L+++ A + KW  +  IR  M D+ I+K P  S ++
Sbjct: 457 AKRLFESENPDSGTYVLLSNLYASSGKWKESTEIRESMRDSGIEKEPGCSTIE 499

BLAST of Csa7G043000 vs. NCBI nr
Match: gi|449453704|ref|XP_004144596.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g10330 [Cucumis sativus])

HSP 1 Score: 955.3 bits (2468), Expect = 4.1e-275
Identity = 477/477 (100.00%), Postives = 477/477 (100.00%), Query Frame = 1

Query: 1   MFTSHPSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIR 60
           MFTSHPSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIR
Sbjct: 1   MFTSHPSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIR 60

Query: 61  AYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLTD 120
           AYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLTD
Sbjct: 61  AYLNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLTD 120

Query: 121 PVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPE 180
           PVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPE
Sbjct: 121 PVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPE 180

Query: 181 HDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLDAEGVLCR 240
           HDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLDAEGVLCR
Sbjct: 181 HDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLDAEGVLCR 240

Query: 241 GKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFAS 300
           GKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFAS
Sbjct: 241 GKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFAS 300

Query: 301 NGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITE 360
           NGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITE
Sbjct: 301 NGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITE 360

Query: 361 HYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEM 420
           HYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEM
Sbjct: 361 HYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEM 420

Query: 421 QPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVDPMQNLVLVSPS 478
           QPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVDPMQNLVLVSPS
Sbjct: 421 QPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVDPMQNLVLVSPS 477

BLAST of Csa7G043000 vs. NCBI nr
Match: gi|659110867|ref|XP_008455452.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g10330 [Cucumis melo])

HSP 1 Score: 901.7 bits (2329), Expect = 5.3e-259
Identity = 452/474 (95.36%), Postives = 456/474 (96.20%), Query Frame = 1

Query: 4   SHPSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIRAYL 63
           SH SEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCL TLLYNTLIRAYL
Sbjct: 2   SHSSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTSSNPKCLTTLLYNTLIRAYL 61

Query: 64  NFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLTDPVV 123
             NRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATI S LLPKLIHAHAFKIGVLTDPVV
Sbjct: 62  TLNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATIYSPLLPKLIHAHAFKIGVLTDPVV 121

Query: 124 LTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPEHDV 183
           LTSFVSSY DLREL NARKVFDEIT+PCIVAFNSMLDAFVKNGDLGSAVFMFR MPE DV
Sbjct: 122 LTSFVSSYGDLRELGNARKVFDEITDPCIVAFNSMLDAFVKNGDLGSAVFMFRYMPERDV 181

Query: 184 VSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLDAEGVLCRGKE 243
           VSWTSV+NGFWWNGRFLEALWFF +MMMSGSVKPNEATYVSVLSS ANLDAEGVLCRGKE
Sbjct: 182 VSWTSVVNGFWWNGRFLEALWFFQMMMMSGSVKPNEATYVSVLSSCANLDAEGVLCRGKE 241

Query: 244 VHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFASNGR 303
           VHAYIIRNEGEFSVFIGT LIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFASN R
Sbjct: 242 VHAYIIRNEGEFSVFIGTALIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFASNSR 301

Query: 304 ETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITEHYV 363
           ETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITEHYV
Sbjct: 302 ETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPITEHYV 361

Query: 364 CVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLEMQPR 423
           CVVDLLGKAGLLREATE IESMPFDPDASVLGALLSACKIHGA ELGNEVGRRLL MQPR
Sbjct: 362 CVVDLLGKAGLLREATEIIESMPFDPDASVLGALLSACKIHGAIELGNEVGRRLLVMQPR 421

Query: 424 HCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVDPMQNLVLVSPS 478
           HCGRYVTLASMNAGAEKWNRAAVIR VMADARIQKTPAYSRVDPMQNLVL+SPS
Sbjct: 422 HCGRYVTLASMNAGAEKWNRAAVIRSVMADARIQKTPAYSRVDPMQNLVLISPS 475

BLAST of Csa7G043000 vs. NCBI nr
Match: gi|566214128|ref|XP_006371823.1| (pentatricopeptide repeat-containing family protein [Populus trichocarpa])

HSP 1 Score: 548.5 bits (1412), Expect = 1.1e-152
Identity = 282/469 (60.13%), Postives = 345/469 (73.56%), Query Frame = 1

Query: 5   HPSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFNNSQTS--SNPKCLITLLYNTLIRAY 64
           HP E LLQLLQ F+K+  ++ QIH+LL T+G LF N  TS   N K   TLL+NTL RAY
Sbjct: 4   HPPELLLQLLQHFIKHQNQVKQIHSLLTTKGLLFYNPNTSPNDNSKWKTTLLFNTLNRAY 63

Query: 65  LNFNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLTDPV 124
           LNF + +  L L+  ML+ QT PN HTFPS+IK+AT     +   +H  A   GVL DP 
Sbjct: 64  LNFGQHQQTLHLFALMLAHQTPPNSHTFPSVIKAATHSCLFIGTSLHTQAINRGVLYDPF 123

Query: 125 VLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPEHD 184
           + TS +  Y+    L NA KVFDEI++PCIV +N+ LDA+ KNGD+GSA  +F+SMP+ D
Sbjct: 124 IQTSLLGMYSQFGYLLNACKVFDEISHPCIVEYNATLDAYAKNGDMGSACCLFKSMPKRD 183

Query: 185 VVSWTSVINGFWWNGRFLEALWFFHVMMMSGS-----VKPNEATYVSVLSSSANLDAEGV 244
           VVSWTSVINGF  NG F EA+  F  MM+        VKPNEATYVSVLSS ANLD  GV
Sbjct: 184 VVSWTSVINGFAKNGLFGEAIRLFREMMLHDDVKCCFVKPNEATYVSVLSSCANLDERGV 243

Query: 245 LCRGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISS 304
           LC GK++H YI+RNE   +VFIGT LIDFYGK+G L  A  V+NQM  ++VCTWNA+ISS
Sbjct: 244 LCIGKQIHGYIVRNEVFVTVFIGTTLIDFYGKVGCLSNAIRVYNQMMVKKVCTWNAIISS 303

Query: 305 FASNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVP 364
            A+NGRE +ALD+F  MK EG+ PNEVTF+A+LTACAR KLV++GL+LFQSM  +F +VP
Sbjct: 304 LANNGREEQALDMFKKMKGEGLCPNEVTFIAVLTACARAKLVEIGLELFQSMAGEFGLVP 363

Query: 365 ITEHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRL 424
           I EHY CVVDLLG AGLLREA+EFI  MPF+PDAS LGALL ACKIHGA +LGNEVG RL
Sbjct: 364 IMEHYGCVVDLLGMAGLLREASEFIRRMPFEPDASALGALLGACKIHGAIDLGNEVGSRL 423

Query: 425 LEMQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVD 467
           LE+QP+HCG+YV L+S++ G  +W  AA IR+ M +ARI+  PA S +D
Sbjct: 424 LELQPQHCGQYVALSSIHVGVNRWGVAADIRKTMVEARIRIVPACSLID 472

BLAST of Csa7G043000 vs. NCBI nr
Match: gi|1009169010|ref|XP_015902967.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g10330 [Ziziphus jujuba])

HSP 1 Score: 543.1 bits (1398), Expect = 4.8e-151
Identity = 282/466 (60.52%), Postives = 348/466 (74.68%), Query Frame = 1

Query: 5   HPSEWLLQLLQRFLKN----STKIPQIHALLLTQGHLFNNSQTSSNPKCLITLLYNTLIR 64
           H  E+LLQLLQ FL+N      +I QIH+LL+T G+L       SN K + TLLYN LIR
Sbjct: 3   HLPEFLLQLLQGFLQNPKPNQNQIKQIHSLLITCGNLL-----CSNSKWMNTLLYNALIR 62

Query: 65  AYLNFNRPRF-ALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLT 124
           AYL+F    + +L+L+  ML+ +  PN HTFPS+IK+A+  S      +H+ A K GVL 
Sbjct: 63  AYLSFGHAHYPSLILFNHMLAHKAPPNSHTFPSLIKAASSSSPSFATSLHSQALKRGVLR 122

Query: 125 DPVVLTSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMP 184
           DP + TSF+  YA    L +ARK+F EIT PCIVA N+MLDA  KNGD+GSA+ +F+ MP
Sbjct: 123 DPFIQTSFLCFYAQYALLCDARKMFGEITQPCIVAHNAMLDALTKNGDMGSALLLFKRMP 182

Query: 185 EHDVVSWTSVINGFWWNGRFLEALWFFHVMMMSGSVKPNEATYVSVLSSSANLDAEGVLC 244
           E DVVSWTSVINGF  NG F EA+ FF    MS  V+PNEATYVSV+SS AN D  G L 
Sbjct: 183 ERDVVSWTSVINGFGRNGCFHEAIEFFK--KMSCLVEPNEATYVSVISSCANFDGCGALY 242

Query: 245 RGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFA 304
            GK++HAYIIRN+   + FIGT LID YGK G L  A  VFNQ+  +E+C+WNAMIS+ A
Sbjct: 243 LGKQIHAYIIRNQTLLTAFIGTALIDLYGKTGCLKSAANVFNQIVFKEICSWNAMISALA 302

Query: 305 SNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPIT 364
           SNG E EALDLF TMK EG+ PN+VTFVA+LTACARGKLV+LGL+LF+SM  DF IVPI 
Sbjct: 303 SNGMEKEALDLFETMKKEGLQPNKVTFVAVLTACARGKLVELGLELFRSMSLDFRIVPIM 362

Query: 365 EHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLE 424
           EHY CVVDLLG+AGLLREA EFI+SMPF+PDASVLGAL+ ACKI+G T+LGNEVG++LL+
Sbjct: 363 EHYGCVVDLLGRAGLLREAAEFIKSMPFEPDASVLGALIGACKIYGTTKLGNEVGKKLLQ 422

Query: 425 MQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRV 466
           +QP+ CGRYV L+++NAG EKW+ AA +R++M D  I+K PAYS +
Sbjct: 423 LQPQRCGRYVALSNINAGMEKWDSAATLRKMMVDVGIRKIPAYSMI 461

BLAST of Csa7G043000 vs. NCBI nr
Match: gi|743917926|ref|XP_011002958.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g10330 [Populus euphratica])

HSP 1 Score: 542.7 bits (1397), Expect = 6.3e-151
Identity = 278/467 (59.53%), Postives = 348/467 (74.52%), Query Frame = 1

Query: 6   PSEWLLQLLQRFLKNSTKIPQIHALLLTQGHLFN-NSQTSSNPKCLITLLYNTLIRAYLN 65
           P+E LLQLLQ F+K+  ++ QIH+L+  +G L+N N+    N K   TLLYNTLIRAYLN
Sbjct: 5   PAELLLQLLQHFIKHQNQVKQIHSLITAKGLLYNPNTCPKDNSKWKTTLLYNTLIRAYLN 64

Query: 66  FNRPRFALLLYTQMLSQQTKPNFHTFPSIIKSATICSSLLPKLIHAHAFKIGVLTDPVVL 125
           F + +  L L+  ML++QT PN HTFPSIIK+AT     +   +H  A   GVL DP + 
Sbjct: 65  FGQHQKTLHLFALMLARQTPPNSHTFPSIIKAATHSCLSIGTSLHTQAINRGVLYDPFIQ 124

Query: 126 TSFVSSYADLRELANARKVFDEITNPCIVAFNSMLDAFVKNGDLGSAVFMFRSMPEHDVV 185
           TS +  Y+   +L NA KVFDEI++PCIV +N+MLDA+ KNGD+GSA  +F+SMP+ DVV
Sbjct: 125 TSLLGMYSQFGDLLNACKVFDEISHPCIVEYNAMLDAYAKNGDMGSACCLFKSMPKRDVV 184

Query: 186 SWTSVINGFWWNGRFLEALWFFHVMMMSGS-----VKPNEATYVSVLSSSANLDAEGVLC 245
           SWTSVINGF  NG   EA+  F  MM+        VKPNEATYVSVLSS ANLD  GVL 
Sbjct: 185 SWTSVINGFAKNGLLGEAIRLFREMMLHDDVKCCFVKPNEATYVSVLSSCANLDERGVLS 244

Query: 246 RGKEVHAYIIRNEGEFSVFIGTGLIDFYGKMGLLGCARTVFNQMKKREVCTWNAMISSFA 305
            GK++H  I+RNE   +VFIGT L+DFYGK+G L  A  V+N+M  ++VCTWNA+ISS A
Sbjct: 245 IGKQIHGSIVRNEVFVTVFIGTSLVDFYGKVGCLSNAIRVYNKMVVKKVCTWNAIISSLA 304

Query: 306 SNGRETEALDLFATMKVEGIHPNEVTFVAILTACARGKLVKLGLQLFQSMLYDFSIVPIT 365
           +NGRE +ALD+F  MK EG+ PNEVTF+A+LTACAR KLV++GL+LFQSM  +F +VPI 
Sbjct: 305 NNGREEQALDMFKKMKGEGLCPNEVTFIAVLTACARAKLVEIGLELFQSMAGEFGLVPIM 364

Query: 366 EHYVCVVDLLGKAGLLREATEFIESMPFDPDASVLGALLSACKIHGATELGNEVGRRLLE 425
           EHY CVVDLLG+AGLLREA+EFI  MPF+PDASVLGALL ACKIHGA +LGNEVG RLLE
Sbjct: 365 EHYGCVVDLLGRAGLLREASEFIRRMPFEPDASVLGALLGACKIHGAIDLGNEVGSRLLE 424

Query: 426 MQPRHCGRYVTLASMNAGAEKWNRAAVIRRVMADARIQKTPAYSRVD 467
           +QP+HCG+YV L+S++AG  +W  AA IR+ M +AR +K PA S +D
Sbjct: 425 LQPQHCGQYVALSSIHAGVNRWGVAADIRKTMVEARGRKVPACSLID 471

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR30_ARATH7.5e-12050.22Putative pentatricopeptide repeat-containing protein At1g10330 OS=Arabidopsis th... [more]
PP449_ARATH3.2e-7837.87Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN... [more]
PP371_ARATH6.0e-7737.76Pentatricopeptide repeat-containing protein At5g08510 OS=Arabidopsis thaliana GN... [more]
PP369_ARATH1.5e-7539.28Pentatricopeptide repeat-containing protein At5g08305 OS=Arabidopsis thaliana GN... [more]
PP435_ARATH4.9e-7134.46Putative pentatricopeptide repeat-containing protein At5g59200, chloroplastic OS... [more]
Match NameE-valueIdentityDescription
A0A0A0K5I5_CUCSA2.8e-275100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_7G043000 PE=4 SV=1[more]
B9NA20_POPTR8.0e-15360.13Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
M5WYI9_PRUPE9.8e-15159.36Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023408mg PE=4 SV=1[more]
W9QQL2_9ROSA2.7e-14860.00Uncharacterized protein OS=Morus notabilis GN=L484_011924 PE=4 SV=1[more]
A0A061GGY2_THECC4.5e-14860.94Tetratricopeptide repeat-like superfamily protein, putative OS=Theobroma cacao G... [more]
Match NameE-valueIdentityDescription
AT1G10330.14.2e-12150.22 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G66520.11.8e-7937.87 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G08510.13.4e-7837.76 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G08305.18.3e-7739.28 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G59200.12.8e-7234.46 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449453704|ref|XP_004144596.1|4.1e-275100.00PREDICTED: putative pentatricopeptide repeat-containing protein At1g10330 [Cucum... [more]
gi|659110867|ref|XP_008455452.1|5.3e-25995.36PREDICTED: putative pentatricopeptide repeat-containing protein At1g10330 [Cucum... [more]
gi|566214128|ref|XP_006371823.1|1.1e-15260.13pentatricopeptide repeat-containing family protein [Populus trichocarpa][more]
gi|1009169010|ref|XP_015902967.1|4.8e-15160.52PREDICTED: putative pentatricopeptide repeat-containing protein At1g10330 [Zizip... [more]
gi|743917926|ref|XP_011002958.1|6.3e-15159.53PREDICTED: putative pentatricopeptide repeat-containing protein At1g10330 [Popul... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa7G043000.1Csa7G043000.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 184..213
score: 3.7E-5coord: 55..80
score: 0.025coord: 153..182
score: 2.7E-5coord: 262..287
score: 0.047coord: 362..385
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 288..335
score: 6.4
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 289..323
score: 1.6E-8coord: 153..184
score: 3.2E-5coord: 55..86
score: 7.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 182..217
score: 9.591coord: 151..181
score: 8.539coord: 256..286
score: 6.467coord: 287..321
score: 11.597coord: 120..150
score: 6.423coord: 358..388
score: 5.634coord: 322..357
score: 7.465coord: 51..85
score:
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 5..465
score: 2.4E
NoneNo IPR availablePANTHERPTHR24015:SF729SUBFAMILY NOT NAMEDcoord: 5..465
score: 2.4E

The following gene(s) are paralogous to this gene:

None