CSPI04G06820 (gene) Wild cucumber (PI 183967)

NameCSPI04G06820
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing family protein
LocationChr4 : 4755455 .. 4758134 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCCTGGGCTCCACGTTCTTCTCCTCCCGCCCTCTACTCCCTCTCCGCCTCCGACCTCGCGGCGTTTTTCCTTTCACCCTAGCTCAGCCTCCATCCTTTACAATCCATCTCAATATTCTTCTTCTTCCTCCTCCTCTATTGAACCTCATCAGTTCAATCTTCCTTAAACCTATAATTTGCAATTTTAGTTCTTCAATGCATTTGACAAGATTTAAAATCAGTAAGACAACTCCTGTATTGTTTCCCTTCTCTCGTCGGCTGGTCTGTGTGTCTTCCACCCAACCGCATAAAGAACACCATCAGGATCCGCCCTGGCAGTCCCAGGATCAGTTGCATCTTTGGGTATCTTCTGTTCTTTCTCATTCATCTCTCGACTCTTCTAAATGTAGTGCTCTCTTACCCCATTTGTCTCCTTCTCAATTTGATCAGCTCTTCTTCTCCATTGGATTGAAAGCCAACCCCATGACTTGTCTTAATTTTTTTTACTTTGCGTCTAATTCTTTCAAATTTCGATTTACCATCCATTCTTGTTGTATATTGATTCTTTTGCTTATTCGTTCTAAGTTTATACCCCCCGCAAGACTGCTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTTGGATTCAGAAAAGTTTCACATTGAGATAGCTAATGCATTGTTTGGTTTAACTTCAGTTGTTGGGCGGTTTGAATGGACACAGGCATTTGATTTATTGATACATGTATACAGCACACAATTCAGAAATCTTGGCTTTAGTTGCGCTGTGGATGTGTTTTATTTGCTTGCTCGTAAGGGTACCTTTCCATCGTTAAAGACTTGTAATTTTTTATTGAGCTCGTTGGTAAAGGCTAATGAATTTGAGAAGTGTTGTGAAGTATTTCGAGTGATGTCCGAAGGAGCTTGTCCAGATGTTTTCTCATTTACGAACGTGATAAATGCTTTGTGCAAGGGAGGGAAGATGGAAAACGCCATTGAGTTATTCATGAAAATGGAGAAGTTGGGGATTTCTCCCAATGTTGTTACTTATAATTGTATTATTAATGGTTTATGCCAGAATGGGAGATTAGACAATGCCTTTGAGCTCAAAGAGAAGATGACAGTGAAAGGGGTACAGCCAAATCTTAAAACTTATGGTGCGCTTATTAATGGTTTGATAAAACTAAACTTTTTTGACAAAGTGAATCATGTTTTAGATGAAATGATTGGTGCGGGTTTTAATCCAAATGTAGTTGTCTTCAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCGAAGGAGCACTTAAGATCAAAGATGTGATGATATCCAAAAATATAACTCCTACTTCAGTTACTTTATATAGTCTCATGCAAGGATTTTGCAAAAGTGATCAAATTGAGCATGCAGAGAATGCCCTTGAGGAGATATTATCAAGTGGGCTATCTATACACCCGGATAATTGTTATTCGGTTGTCCACTGGCTATGTAAAAAGTTTAGGTACCATTCTGCATTCCGATTTACTAAGATGATGTTATCTAGGAACTTCAGGCCTAGTGATCTACTCTTAACCATGTTGGTATGTGGGTTGTGCAAGGATGGTAAACATTTAGAAGCAACTGAACTTTGGTTTAGGTTATTGGAGAAAGGGTCTCCAGCAAGTAAGGTGACCTCCAATGCTCTAATACATGGACTTTGTGGGGCTGGTAAATTGCCAGAGGCTTCTAGAATTGTCAAAGAGATGTTAGAGAGGGGTCTTCCAATGGATCGGATCACATACAATGCACTCATCTTAGGTTTTTGCAATGAGGGAAAAGTTGAGGGATGCTTTAGACTTAGAGAAGAGATGACCAAACGAGGAATTCAGCCAGATATCTATACTTACAATTTTCTATTGCGTGGACTGTGCAATGTAGGAAAATTGGATGATGCTATTAAACTTTGGGATGAATTCAAAGCTAGTGGGCTGATTTCTAACATTCACACTTACGGGATAATGATGGATGGTTATTGTAAAGCTAACAGAATCGAAGATGTTGAAAATTTATTTAATGAATTGCTCTCTAAGAAAATGGAGCTGAATTCCATTGTCTACAATATAATTATCAAAGCACATTGCCAGAATGGAAATGTAGCTGCAGCTTTGCAACTTCTTGAAAATATGAAAAGCAAGGGAATTTTACCAAATTGTGCCACGTATTCTTCTCTAATACACGGCGTGTGCAACATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAAAGGAAGGATTTGTGCCGAATGTTGTTTGCTATACTGCATTAATTGGCGGTTATTGTAAGCTGGGGCAAATGGATACTGCTGAATCTACTTGGCTTGAGATGATCTCTTTTAACATACATCCTAACAAATTTACCTACACTGTCATGATCGACGGCTACTGTAAATTAGGGAATATGGAAAAAGCAAATAACCTTCTGATAAAAATGAAAGAAAGTGGAATCGTCCCAGATGTTGTTACTTACAATGTCTTGACTAATGGATTTTGTAAGGCAAATGACATGGACAATGCTTTTAAAGTATGTGATCAAATGGCCACTGAAGGATTACCTGTAGATGAAATTACTTACACTACACTCGTACATGGTTGGAATCCACCTACAATCACTGGCCAAGACTGATCGAATTTCTGCAGAG

mRNA sequence

ATGCATTTGACAAGATTTAAAATCAGTAAGACAACTCCTGTATTGTTTCCCTTCTCTCGTCGGCTGGTCTGTGTGTCTTCCACCCAACCGCATAAAGAACACCATCAGGATCCGCCCTGGCAGTCCCAGGATCAGTTGCATCTTTGGGTATCTTCTGTTCTTTCTCATTCATCTCTCGACTCTTCTAAATGTAGTGCTCTCTTACCCCATTTGTCTCCTTCTCAATTTGATCAGCTCTTCTTCTCCATTGGATTGAAAGCCAACCCCATGACTTGTCTTAATTTTTTTTACTTTGCGTCTAATTCTTTCAAATTTCGATTTACCATCCATTCTTGTTGTATATTGATTCTTTTGCTTATTCGTTCTAAGTTTATACCCCCCGCAAGACTGCTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTTGGATTCAGAAAAGTTTCACATTGAGATAGCTAATGCATTGTTTGGTTTAACTTCAGTTGTTGGGCGGTTTGAATGGACACAGGCATTTGATTTATTGATACATGTATACAGCACACAATTCAGAAATCTTGGCTTTAGTTGCGCTGTGGATGTGTTTTATTTGCTTGCTCGTAAGGGTACCTTTCCATCGTTAAAGACTTGTAATTTTTTATTGAGCTCGTTGGTAAAGGCTAATGAATTTGAGAAGTGTTGTGAAGTATTTCGAGTGATGTCCGAAGGAGCTTGTCCAGATGTTTTCTCATTTACGAACGTGATAAATGCTTTGTGCAAGGGAGGGAAGATGGAAAACGCCATTGAGTTATTCATGAAAATGGAGAAGTTGGGGATTTCTCCCAATGTTGTTACTTATAATTGTATTATTAATGGTTTATGCCAGAATGGGAGATTAGACAATGCCTTTGAGCTCAAAGAGAAGATGACAGTGAAAGGGGTACAGCCAAATCTTAAAACTTATGGTGCGCTTATTAATGGTTTGATAAAACTAAACTTTTTTGACAAAGTGAATCATGTTTTAGATGAAATGATTGGTGCGGGTTTTAATCCAAATGTAGTTGTCTTCAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCGAAGGAGCACTTAAGATCAAAGATGTGATGATATCCAAAAATATAACTCCTACTTCAGTTACTTTATATAGTCTCATGCAAGGATTTTGCAAAAGTGATCAAATTGAGCATGCAGAGAATGCCCTTGAGGAGATATTATCAAGTGGGCTATCTATACACCCGGATAATTGTTATTCGGTTGTCCACTGGCTATGTAAAAAGTTTAGGTACCATTCTGCATTCCGATTTACTAAGATGATGTTATCTAGGAACTTCAGGCCTAGTGATCTACTCTTAACCATGTTGGTATGTGGGTTGTGCAAGGATGGTAAACATTTAGAAGCAACTGAACTTTGGTTTAGGTTATTGGAGAAAGGGTCTCCAGCAAGTAAGGTGACCTCCAATGCTCTAATACATGGACTTTGTGGGGCTGGTAAATTGCCAGAGGCTTCTAGAATTGTCAAAGAGATGTTAGAGAGGGGTCTTCCAATGGATCGGATCACATACAATGCACTCATCTTAGGTTTTTGCAATGAGGGAAAAGTTGAGGGATGCTTTAGACTTAGAGAAGAGATGACCAAACGAGGAATTCAGCCAGATATCTATACTTACAATTTTCTATTGCGTGGACTGTGCAATGTAGGAAAATTGGATGATGCTATTAAACTTTGGGATGAATTCAAAGCTAGTGGGCTGATTTCTAACATTCACACTTACGGGATAATGATGGATGGTTATTGTAAAGCTAACAGAATCGAAGATGTTGAAAATTTATTTAATGAATTGCTCTCTAAGAAAATGGAGCTGAATTCCATTGTCTACAATATAATTATCAAAGCACATTGCCAGAATGGAAATGTAGCTGCAGCTTTGCAACTTCTTGAAAATATGAAAAGCAAGGGAATTTTACCAAATTGTGCCACGTATTCTTCTCTAATACACGGCGTGTGCAACATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAAAGGAAGGATTTGTGCCGAATGTTGTTTGCTATACTGCATTAATTGGCGGTTATTGTAAGCTGGGGCAAATGGATACTGCTGAATCTACTTGGCTTGAGATGATCTCTTTTAACATACATCCTAACAAATTTACCTACACTGTCATGATCGACGGCTACTGTAAATTAGGGAATATGGAAAAAGCAAATAACCTTCTGATAAAAATGAAAGAAAGTGGAATCGTCCCAGATGTTGTTACTTACAATGTCTTGACTAATGGATTTTGTAAGGCAAATGACATGGACAATGCTTTTAAAGTATGTGATCAAATGGCCACTGAAGGATTACCTGTAGATGAAATTACTTACACTACACTCGTACATGGTTGGAATCCACCTACAATCACTGGCCAAGACTGA

Coding sequence (CDS)

ATGCATTTGACAAGATTTAAAATCAGTAAGACAACTCCTGTATTGTTTCCCTTCTCTCGTCGGCTGGTCTGTGTGTCTTCCACCCAACCGCATAAAGAACACCATCAGGATCCGCCCTGGCAGTCCCAGGATCAGTTGCATCTTTGGGTATCTTCTGTTCTTTCTCATTCATCTCTCGACTCTTCTAAATGTAGTGCTCTCTTACCCCATTTGTCTCCTTCTCAATTTGATCAGCTCTTCTTCTCCATTGGATTGAAAGCCAACCCCATGACTTGTCTTAATTTTTTTTACTTTGCGTCTAATTCTTTCAAATTTCGATTTACCATCCATTCTTGTTGTATATTGATTCTTTTGCTTATTCGTTCTAAGTTTATACCCCCCGCAAGACTGCTTCTGATTCGTTTGATAGACGGGAATCTCCCGGTGTTGAATTTGGATTCAGAAAAGTTTCACATTGAGATAGCTAATGCATTGTTTGGTTTAACTTCAGTTGTTGGGCGGTTTGAATGGACACAGGCATTTGATTTATTGATACATGTATACAGCACACAATTCAGAAATCTTGGCTTTAGTTGCGCTGTGGATGTGTTTTATTTGCTTGCTCGTAAGGGTACCTTTCCATCGTTAAAGACTTGTAATTTTTTATTGAGCTCGTTGGTAAAGGCTAATGAATTTGAGAAGTGTTGTGAAGTATTTCGAGTGATGTCCGAAGGAGCTTGTCCAGATGTTTTCTCATTTACGAACGTGATAAATGCTTTGTGCAAGGGAGGGAAGATGGAAAACGCCATTGAGTTATTCATGAAAATGGAGAAGTTGGGGATTTCTCCCAATGTTGTTACTTATAATTGTATTATTAATGGTTTATGCCAGAATGGGAGATTAGACAATGCCTTTGAGCTCAAAGAGAAGATGACAGTGAAAGGGGTACAGCCAAATCTTAAAACTTATGGTGCGCTTATTAATGGTTTGATAAAACTAAACTTTTTTGACAAAGTGAATCATGTTTTAGATGAAATGATTGGTGCGGGTTTTAATCCAAATGTAGTTGTCTTCAATAATTTAATTGATGGATACTGCAAAATGGGAAATATCGAAGGAGCACTTAAGATCAAAGATGTGATGATATCCAAAAATATAACTCCTACTTCAGTTACTTTATATAGTCTCATGCAAGGATTTTGCAAAAGTGATCAAATTGAGCATGCAGAGAATGCCCTTGAGGAGATATTATCAAGTGGGCTATCTATACACCCGGATAATTGTTATTCGGTTGTCCACTGGCTATGTAAAAAGTTTAGGTACCATTCTGCATTCCGATTTACTAAGATGATGTTATCTAGGAACTTCAGGCCTAGTGATCTACTCTTAACCATGTTGGTATGTGGGTTGTGCAAGGATGGTAAACATTTAGAAGCAACTGAACTTTGGTTTAGGTTATTGGAGAAAGGGTCTCCAGCAAGTAAGGTGACCTCCAATGCTCTAATACATGGACTTTGTGGGGCTGGTAAATTGCCAGAGGCTTCTAGAATTGTCAAAGAGATGTTAGAGAGGGGTCTTCCAATGGATCGGATCACATACAATGCACTCATCTTAGGTTTTTGCAATGAGGGAAAAGTTGAGGGATGCTTTAGACTTAGAGAAGAGATGACCAAACGAGGAATTCAGCCAGATATCTATACTTACAATTTTCTATTGCGTGGACTGTGCAATGTAGGAAAATTGGATGATGCTATTAAACTTTGGGATGAATTCAAAGCTAGTGGGCTGATTTCTAACATTCACACTTACGGGATAATGATGGATGGTTATTGTAAAGCTAACAGAATCGAAGATGTTGAAAATTTATTTAATGAATTGCTCTCTAAGAAAATGGAGCTGAATTCCATTGTCTACAATATAATTATCAAAGCACATTGCCAGAATGGAAATGTAGCTGCAGCTTTGCAACTTCTTGAAAATATGAAAAGCAAGGGAATTTTACCAAATTGTGCCACGTATTCTTCTCTAATACACGGCGTGTGCAACATTGGTCTTGTTGAAGATGCAAAGCATCTTATTGATGAAATGAGAAAGGAAGGATTTGTGCCGAATGTTGTTTGCTATACTGCATTAATTGGCGGTTATTGTAAGCTGGGGCAAATGGATACTGCTGAATCTACTTGGCTTGAGATGATCTCTTTTAACATACATCCTAACAAATTTACCTACACTGTCATGATCGACGGCTACTGTAAATTAGGGAATATGGAAAAAGCAAATAACCTTCTGATAAAAATGAAAGAAAGTGGAATCGTCCCAGATGTTGTTACTTACAATGTCTTGACTAATGGATTTTGTAAGGCAAATGACATGGACAATGCTTTTAAAGTATGTGATCAAATGGCCACTGAAGGATTACCTGTAGATGAAATTACTTACACTACACTCGTACATGGTTGGAATCCACCTACAATCACTGGCCAAGACTGA
BLAST of CSPI04G06820 vs. Swiss-Prot
Match: PP325_ARATH (Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidopsis thaliana GN=At4g19440 PE=2 SV=2)

HSP 1 Score: 729.6 bits (1882), Expect = 3.9e-209
Identity = 374/773 (48.38%), Postives = 509/773 (65.85%), Query Frame = 1

Query: 42  SQDQLHLWVSSVLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASN 101
           S   LH  +SSVLS  SLD  +C  L+  LSP +FD+LF     K NP T L+FF  AS+
Sbjct: 72  SDRHLHERLSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASD 131

Query: 102 SFKFRFTIHSCCILILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGL 161
           SF F F++ S C+LI LL+ +  +  AR++LIRLI+GN+PVL        + IA+A+  L
Sbjct: 132 SFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLRDSRVAIADAMASL 191

Query: 162 TSVVGRFEWTQAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVK 221
           +         +  DLLI VY TQF+  G   A+DVF +LA KG FPS  TCN LL+SLV+
Sbjct: 192 SLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLVR 251

Query: 222 ANEFEKCCEVFRVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTY 281
           ANEF+KCCE F V+ +G  PDV+ FT  INA CKGGK+E A++LF KME+ G++PNVVT+
Sbjct: 252 ANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFCKGGKVEEAVKLFSKMEEAGVAPNVVTF 311

Query: 282 NCIINGLCQNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIG 341
           N +I+GL   GR D AF  KEKM  +G++P L TY  L+ GL +         VL EM  
Sbjct: 312 NTVIDGLGMCGRYDEAFMFKEKMVERGMEPTLITYSILVKGLTRAKRIGDAYFVLKEMTK 371

Query: 342 AGFNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEH 401
            GF PNV+V+NNLID + + G++  A++IKD+M+SK ++ TS T  +L++G+CK+ Q ++
Sbjct: 372 KGFPPNVIVYNNLIDSFIEAGSLNKAIEIKDLMVSKGLSLTSSTYNTLIKGYCKNGQADN 431

Query: 402 AENALEEILSSGLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVC 461
           AE  L+E+LS G +++  +  SV+  LC    + SA RF   ML RN  P   LLT L+ 
Sbjct: 432 AERLLKEMLSIGFNVNQGSFTSVICLLCSHLMFDSALRFVGEMLLRNMSPGGGLLTTLIS 491

Query: 462 GLCKDGKHLEATELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPM 521
           GLCK GKH +A ELWF+ L KG      TSNAL+HGLC AGKL EA RI KE+L RG  M
Sbjct: 492 GLCKHGKHSKALELWFQFLNKGFVVDTRTSNALLHGLCEAGKLDEAFRIQKEILGRGCVM 551

Query: 522 DRITYNALILGFCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLW 581
           DR++YN LI G C + K++  F   +EM KRG++PD YTY+ L+ GL N+ K+++AI+ W
Sbjct: 552 DRVSYNTLISGCCGKKKLDEAFMFLDEMVKRGLKPDNYTYSILICGLFNMNKVEEAIQFW 611

Query: 582 DEFKASGLISNIHTYGIMMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQN 641
           D+ K +G++ +++TY +M+DG CKA R E+ +  F+E++SK ++ N++VYN +I+A+C++
Sbjct: 612 DDCKRNGMLPDVYTYSVMIDGCCKAERTEEGQEFFDEMMSKNVQPNTVVYNHLIRAYCRS 671

Query: 642 GNVAAALQLLENMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCY 701
           G ++ AL+L E+MK KGI PN ATY+SLI G+  I  VE+AK L +EMR EG  PNV  Y
Sbjct: 672 GRLSMALELREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLLFEEMRMEGLEPNVFHY 731

Query: 702 TALIGGYCKLGQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKE 761
           TALI GY KLGQM   E    EM S N+HPNK TYTVMI GY + GN+ +A+ LL +M+E
Sbjct: 732 TALIDGYGKLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYARDGNVTEASRLLNEMRE 791

Query: 762 SGIVPDVVTYNVLTNGFCKANDMDNAFKVCDQMATEGLPVDEITYTTLVHGWN 815
            GIVPD +TY     G+ K   +  AFK            DE  Y  ++ GWN
Sbjct: 792 KGIVPDSITYKEFIYGYLKQGGVLEAFK----------GSDEENYAAIIEGWN 834

BLAST of CSPI04G06820 vs. Swiss-Prot
Match: PP437_ARATH (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 313.9 bits (803), Expect = 5.1e-84
Identity = 185/634 (29.18%), Postives = 313/634 (49.37%), Query Frame = 1

Query: 198 YLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGACP--DVF----------- 257
           +L   +G   S  +   L+ +LVKAN F     + + +   A    DVF           
Sbjct: 93  FLGLHRGFDHSTASFCILIHALVKANLFWPASSLLQTLLLRALKPSDVFNVLFSCYEKCK 152

Query: 258 -----SFTNVINALCKGGKMENAIELF-MKMEKLGISPNVVTYNCIINGLCQNGRLDNAF 317
                SF  +I    +  ++ + + +F M + K+ + P V T + +++GL +      A 
Sbjct: 153 LSSSSSFDLLIQHYVRSRRVLDGVLVFKMMITKVSLLPEVRTLSALLHGLVKFRHFGLAM 212

Query: 318 ELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVFNNLIDGY 377
           EL   M   G++P++  Y  +I  L +L    +   ++  M   G + N+V +N LIDG 
Sbjct: 213 ELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMIAHMEATGCDVNIVPYNVLIDGL 272

Query: 378 CKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHP 437
           CK   +  A+ IK  +  K++ P  VT  +L+ G CK  + E     ++E+L    S   
Sbjct: 273 CKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFEIGLEMMDEMLCLRFSPSE 332

Query: 438 DNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFR 497
               S+V  L K+ +   A    K ++     P+  +   L+  LCK  K  EA  L+ R
Sbjct: 333 AAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALIDSLCKGRKFHEAELLFDR 392

Query: 498 LLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGK 557
           + + G   + VT + LI   C  GKL  A   + EM++ GL +    YN+LI G C  G 
Sbjct: 393 MGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLKLSVYPYNSLINGHCKFGD 452

Query: 558 VEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGI 617
           +        EM  + ++P + TY  L+ G C+ GK++ A++L+ E    G+  +I+T+  
Sbjct: 453 ISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRLYHEMTGKGIAPSIYTFTT 512

Query: 618 MMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKG 677
           ++ G  +A  I D   LFNE+    ++ N + YN++I+ +C+ G+++ A + L+ M  KG
Sbjct: 513 LLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCEEGDMSKAFEFLKEMTEKG 572

Query: 678 ILPNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAE 737
           I+P+  +Y  LIHG+C  G   +AK  +D + K     N +CYT L+ G+C+ G+++ A 
Sbjct: 573 IVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEAL 632

Query: 738 STWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYNVLTNGF 797
           S   EM+   +  +   Y V+IDG  K  + +    LL +M + G+ PD V Y  + +  
Sbjct: 633 SVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMIDAK 692

Query: 798 CKANDMDNAFKVCDQMATEGLPVDEITYTTLVHG 813
            K  D   AF + D M  EG   +E+TYT +++G
Sbjct: 693 SKTGDFKEAFGIWDLMINEGCVPNEVTYTAVING 726

BLAST of CSPI04G06820 vs. Swiss-Prot
Match: PP120_ARATH (Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis thaliana GN=At1g74580 PE=3 SV=1)

HSP 1 Score: 311.2 bits (796), Expect = 3.3e-83
Identity = 176/609 (28.90%), Postives = 307/609 (50.41%), Query Frame = 1

Query: 193 AVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVF-RVMSEGACPDVFSFTNVIN 252
           AV+VF  +      P++ + N ++S LV +  F++  +V+ R+   G  PDV+SFT  + 
Sbjct: 95  AVNVFERMDFYDCEPTVFSYNAIMSVLVDSGYFDQAHKVYMRMRDRGITPDVYSFTIRMK 154

Query: 253 ALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFELKEKMTVKGVQP 312
           + CK  +   A+ L   M   G   NVV Y  ++ G  +       +EL  KM   GV  
Sbjct: 155 SFCKTSRPHAALRLLNNMSSQGCEMNVVAYCTVVGGFYEENFKAEGYELFGKMLASGVSL 214

Query: 313 NLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVFNNLIDGYCKMGNIEGALKIK 372
            L T+  L+  L K     +   +LD++I  G  PN+  +N  I G C+ G ++GA+++ 
Sbjct: 215 CLSTFNKLLRVLCKKGDVKECEKLLDKVIKRGVLPNLFTYNLFIQGLCQRGELDGAVRMV 274

Query: 373 DVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDNCYSVVHWLCKK 432
             +I +   P  +T  +L+ G CK+ + + AE  L ++++ GL        +++   CK 
Sbjct: 275 GCLIEQGPKPDVITYNNLIYGLCKNSKFQEAEVYLGKMVNEGLEPDSYTYNTLIAGYCKG 334

Query: 433 FRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLLEKGSPASKVTS 492
                A R     +   F P       L+ GLC +G+   A  L+   L KG   + +  
Sbjct: 335 GMVQLAERIVGDAVFNGFVPDQFTYRSLIDGLCHEGETNRALALFNEALGKGIKPNVILY 394

Query: 493 NALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVEGCFRLREEMTK 552
           N LI GL   G + EA+++  EM E+GL  +  T+N L+ G C  G V     L + M  
Sbjct: 395 NTLIKGLSNQGMILEAAQLANEMSEKGLIPEVQTFNILVNGLCKMGCVSDADGLVKVMIS 454

Query: 553 RGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMMDGYCKANRIED 612
           +G  PDI+T+N L+ G     K+++A+++ D    +G+  +++TY  +++G CK ++ ED
Sbjct: 455 KGYFPDIFTFNILIHGYSTQLKMENALEILDVMLDNGVDPDVYTYNSLLNGLCKTSKFED 514

Query: 613 VENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGILPNCATYSSLIH 672
           V   +  ++ K    N   +NI++++ C+   +  AL LLE MK+K + P+  T+ +LI 
Sbjct: 515 VMETYKTMVEKGCAPNLFTFNILLESLCRYRKLDEALGLLEEMKNKSVNPDAVTFGTLID 574

Query: 673 GVCNIGLVEDAKHLIDEMRKEGFV-PNVVCYTALIGGYCKLGQMDTAESTWLEMISFNIH 732
           G C  G ++ A  L  +M +   V  +   Y  +I  + +   +  AE  + EM+   + 
Sbjct: 575 GFCKNGDLDGAYTLFRKMEEAYKVSSSTPTYNIIIHAFTEKLNVTMAEKLFQEMVDRCLG 634

Query: 733 PNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYNVLTNGFCKANDMDNAFKV 792
           P+ +TY +M+DG+CK GN+      L++M E+G +P + T   + N  C  + +  A  +
Sbjct: 635 PDGYTYRLMVDGFCKTGNVNLGYKFLLEMMENGFIPSLTTLGRVINCLCVEDRVYEAAGI 694

Query: 793 CDQMATEGL 800
             +M  +GL
Sbjct: 695 IHRMVQKGL 703

BLAST of CSPI04G06820 vs. Swiss-Prot
Match: PP445_ARATH (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 309.3 bits (791), Expect = 1.3e-82
Identity = 224/826 (27.12%), Postives = 386/826 (46.73%), Query Frame = 1

Query: 7   KISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLDSSKCSA 66
           K S    V  P +RR  C  S  P   +  +    S    H  +S +   +   S    +
Sbjct: 26  KFSTDVTVPSPVTRRQFC--SVSPLLRNLPEEESDSMSVPHRLLSILSKPNWHKSPSLKS 85

Query: 67  LLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSCCILILLLIRSKFIP 126
           ++  +SPS    LF    L  +P T LNF ++ S + +++ +++S   L+ LLI + ++ 
Sbjct: 86  MVSAISPSHVSSLF---SLDLDPKTALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVG 145

Query: 127 PA---RLLLIRLIDG---NLPVLNL-----DSEKFHIE----------IANAL--FGLTS 186
                RLL+I+  D     L VL+L       E+F ++          + N+L  FGL  
Sbjct: 146 VVFKIRLLMIKSCDSVGDALYVLDLCRKMNKDERFELKYKLIIGCYNTLLNSLARFGLVD 205

Query: 187 VVGRFEWTQAFDLLIHVYSTQFRNLGFSC-------AVDVFYLLARKGTFPSLKTCNFLL 246
            + +       D +     T  + +   C       A      +   G  P   T   L+
Sbjct: 206 EMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLI 265

Query: 247 SSLVKANEFEKCCEVFRVMSEGACP-DVFSFTNVINALCKGGKMENAIELFMKMEKLGIS 306
               +  + +   +VF  M    C  +  ++T++I+ LC   +++ A++LF+KM+     
Sbjct: 266 MGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECF 325

Query: 307 PNVVTYNCIINGLCQNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHV 366
           P V TY  +I  LC + R   A  L ++M   G++PN+ TY  LI+ L     F+K   +
Sbjct: 326 PTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEKAREL 385

Query: 367 LDEMIGAGFNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCK 426
           L +M+  G  PNV+ +N LI+GYCK G IE A+ + ++M S+ ++P + T   L++G+CK
Sbjct: 386 LGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCK 445

Query: 427 SDQIEHAENALEEILSSGLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLL 486
           S+ +  A   L ++L   +        S++   C+   + SA+R   +M  R   P    
Sbjct: 446 SN-VHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWT 505

Query: 487 LTMLVCGLCKDGKHLEATELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEML 546
            T ++  LCK  +  EA +L+  L +KG   + V   ALI G C AGK+ EA  ++++ML
Sbjct: 506 YTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLEKML 565

Query: 547 ERGLPMDRITYNALILGFCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLD 606
            +    + +T+NALI G C +GK++    L E+M K G+QP + T   L+  L   G  D
Sbjct: 566 SKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLLKDGDFD 625

Query: 607 DAIKLWDEFKASGLISNIHTYGIMMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIII 666
            A   + +  +SG   + HTY   +  YC+  R+ D E++  ++    +  +   Y+ +I
Sbjct: 626 HAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLFTYSSLI 685

Query: 667 KAHCQNGNVAAALQLLENMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLIDEM--RKEG 726
           K +   G    A  +L+ M+  G  P+  T+ SLI            KHL++    +++G
Sbjct: 686 KGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLI------------KHLLEMKYGKQKG 745

Query: 727 FVPNVVCYTALIGGYCKLGQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKAN 786
             P +   + ++       + DT      +M+  ++ PN  +Y  +I G C++GN+  A 
Sbjct: 746 SEPELCAMSNMM-------EFDTVVELLEKMVEHSVTPNAKSYEKLILGICEVGNLRVAE 805

Query: 787 NLLIKM-KESGIVPDVVTYNVLTNGFCKANDMDNAFKVCDQMATEG 799
            +   M +  GI P  + +N L +  CK    + A KV D M   G
Sbjct: 806 KVFDHMQRNEGISPSELVFNALLSCCCKLKKHNEAAKVVDDMICVG 826

BLAST of CSPI04G06820 vs. Swiss-Prot
Match: PPR39_ARATH (Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidopsis thaliana GN=At1g12775 PE=2 SV=1)

HSP 1 Score: 307.0 bits (785), Expect = 6.3e-82
Identity = 164/604 (27.15%), Postives = 314/604 (51.99%), Query Frame = 1

Query: 209 LKTCNFLLSSLVKANEFEKCCEV-FRVMSEGACPDVFSFTNVINALCKGGKMENAIELFM 268
           L+T    ++ +   NE   CCE  F   S+       S+ + +++   G K ++A++LF 
Sbjct: 22  LETGTLRIALINCPNELLFCCERGFSTFSDRN----LSYRDKLSSGLVGIKADDAVDLFR 81

Query: 269 KMEKLGISPNVVTYNCIINGLCQNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLN 328
            M +    P V+ +N + + + +  + +    L ++M  KG+  ++ T   +IN   +  
Sbjct: 82  DMIQSRPLPTVIDFNRLFSAIAKTKQYELVLALCKQMESKGIAHSIYTLSIMINCFCRCR 141

Query: 329 FFDKVNHVLDEMIGAGFNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLY 388
                   + +++  G+ P+ V+FN L++G C    +  AL++ D M+     PT +TL 
Sbjct: 142 KLSYAFSTMGKIMKLGYEPDTVIFNTLLNGLCLECRVSEALELVDRMVEMGHKPTLITLN 201

Query: 389 SLMQGFCKSDQIEHAENALEEILSSGLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSR 448
           +L+ G C + ++  A   ++ ++ +G   +      V++ +CK  +   A    + M  R
Sbjct: 202 TLVNGLCLNGKVSDAVVLIDRMVETGFQPNEVTYGPVLNVMCKSGQTALAMELLRKMEER 261

Query: 449 NFRPSDLLLTMLVCGLCKDGKHLEATELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEA 508
           N +   +  ++++ GLCKDG    A  L+  +  KG  A  +T N LI G C AG+  + 
Sbjct: 262 NIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDG 321

Query: 509 SRIVKEMLERGLPMDRITYNALILGFCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRG 568
           ++++++M++R +  + +T++ LI  F  EGK+    +L +EM +RGI P+  TYN L+ G
Sbjct: 322 AKLLRDMIKRKISPNVVTFSVLIDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDG 381

Query: 569 LCNVGKLDDAIKLWDEFKASGLISNIHTYGIMMDGYCKANRIEDVENLFNELLSKKMELN 628
            C   +L++AI++ D   + G   +I T+ I+++GYCKANRI+D   LF E+  + +  N
Sbjct: 382 FCKENRLEEAIQMVDLMISKGCDPDIMTFNILINGYCKANRIDDGLELFREMSLRGVIAN 441

Query: 629 SIVYNIIIKAHCQNGNVAAALQLLENMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLID 688
           ++ YN +++  CQ+G +  A +L + M S+ + P+  +Y  L+ G+C+ G +E A  +  
Sbjct: 442 TVTYNTLVQGFCQSGKLEVAKKLFQEMVSRRVRPDIVSYKILLDGLCDNGELEKALEIFG 501

Query: 689 EMRKEGFVPNVVCYTALIGGYCKLGQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLG 748
           ++ K     ++  Y  +I G C   ++D A   +  +    +  +   Y +MI   C+  
Sbjct: 502 KIEKSKMELDIGIYMIIIHGMCNASKVDDAWDLFCSLPLKGVKLDARAYNIMISELCRKD 561

Query: 749 NMEKANNLLIKMKESGIVPDVVTYNVLTNGFCKANDMDNAFKVCDQMATEGLPVDEITYT 808
           ++ KA+ L  KM E G  PD +TYN+L       +D   A ++ ++M + G P D  T  
Sbjct: 562 SLSKADILFRKMTEEGHAPDELTYNILIRAHLGDDDATTAAELIEEMKSSGFPADVSTVK 621

Query: 809 TLVH 812
            +++
Sbjct: 622 MVIN 621

BLAST of CSPI04G06820 vs. TrEMBL
Match: A0A0A0L008_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055990 PE=4 SV=1)

HSP 1 Score: 1442.6 bits (3733), Expect = 0.0e+00
Identity = 706/710 (99.44%), Postives = 708/710 (99.72%), Query Frame = 1

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60
           MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD
Sbjct: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60

Query: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSCCILILLLI 120
           SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHS C LILLLI
Sbjct: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120

Query: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240
           YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC
Sbjct: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240

Query: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFEL 300
           PDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFEL
Sbjct: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFEL 300

Query: 301 KEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVFNNLIDGYCK 360
           KEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIG+GFNPNVVVFNNLIDGYCK
Sbjct: 301 KEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGSGFNPNVVVFNNLIDGYCK 360

Query: 361 MGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDN 420
           MGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDN
Sbjct: 361 MGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDN 420

Query: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLL 480
           CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVE 540
           EKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVE
Sbjct: 481 EKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVE 540

Query: 541 GCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMM 600
           GCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMM
Sbjct: 541 GCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMM 600

Query: 601 DGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGIL 660
           +GYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGIL
Sbjct: 601 EGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGIL 660

Query: 661 PNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCK 711
           PNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCK
Sbjct: 661 PNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCK 710

BLAST of CSPI04G06820 vs. TrEMBL
Match: D7TFE9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0087g00360 PE=4 SV=1)

HSP 1 Score: 980.3 bits (2533), Expect = 1.4e-282
Identity = 486/819 (59.34%), Postives = 600/819 (73.26%), Query Frame = 1

Query: 8   ISKTTPVLFPFSRRLVCVSSTQPHKEH---HQDPPWQSQDQLHLWVSSVLSHSSLDSSKC 67
           + K TP+  P +R L CV+S  PH       Q+ P  S   L   V+S+LS+ SLDS++C
Sbjct: 8   LPKPTPIFCPIARPLTCVTSAAPHPPSPLPSQNQPPSSDHALLKSVTSILSNPSLDSTQC 67

Query: 68  SALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSCCILILLLIRSKF 127
             L+PHLSP QFD +FFS+    NP T LNFFYFAS+S  FRFT+ S C+L+  LI S F
Sbjct: 68  KQLIPHLSPHQFDSVFFSVRRNVNPKTALNFFYFASDSCGFRFTLRSYCVLMRSLIVSGF 127

Query: 128 IPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHVYSTQ 187
           + PARLLLIRLID  LPVL  D +  HIEIA+A+  L  V        A DLLIHVY TQ
Sbjct: 128 VSPARLLLIRLIDRKLPVLFGDPKNRHIEIASAMADLNEVGESGVAVAAVDLLIHVYCTQ 187

Query: 188 FRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGACPDVF 247
           FRN+GF  A+ VF  LA KG FP++KTC FLLSSLVKANE EK   VF  M +G  PDV+
Sbjct: 188 FRNVGFRNAIGVFRFLANKGVFPTVKTCTFLLSSLVKANELEKSYWVFETMRQGVSPDVY 247

Query: 248 SFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFELKEKM 307
            F+  INA CKGGK+E+AI+LF  MEKLG+SPNVVTYN +I+GLC++G LD AF  KEKM
Sbjct: 248 LFSTAINAFCKGGKVEDAIQLFFDMEKLGVSPNVVTYNNLIHGLCKHGNLDEAFRFKEKM 307

Query: 308 TVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVFNNLIDGYCKMGNI 367
              GV   L TY  LINGL+KL  F++ N VL E +  GF PN VV+N LIDGYCKMGN+
Sbjct: 308 VKDGVNATLITYSVLINGLMKLEKFNEANSVLKETLEKGFTPNEVVYNTLIDGYCKMGNL 367

Query: 368 EGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDNCYSV 427
             AL+I+  M+SK I P SVTL S++QGFCK  Q+E AE  LEE+LS G SI+P    ++
Sbjct: 368 GDALRIRGDMVSKGINPNSVTLNSIIQGFCKIGQMEQAECILEEMLSRGFSINPGAFTTI 427

Query: 428 VHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLLEKGS 487
           +HWLC   R+ SA RF + ML RN RP+D LLT LV GLCK+GKH +A ELWFRLLEKG 
Sbjct: 428 IHWLCMNSRFESALRFLREMLLRNMRPNDGLLTTLVGGLCKEGKHSDAVELWFRLLEKGF 487

Query: 488 PASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVEGCFR 547
            A+ VT+NALIHGLC  G + EA R++K+MLERG  +D+ITYN LI G C EGKVE  F+
Sbjct: 488 GANLVTTNALIHGLCKTGNMQEAVRLLKKMLERGFVLDKITYNTLISGCCKEGKVEEGFK 547

Query: 548 LREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMMDGYC 607
           LR EM K+GI+PD +TYN L+ G+C +GKLD+A+ LW+E K+  L+ N++TYG+M+DGYC
Sbjct: 548 LRGEMVKQGIEPDTFTYNLLIHGMCRIGKLDEAVNLWNECKSRDLVPNVYTYGVMIDGYC 607

Query: 608 KANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGILPNCA 667
           KA++IE+ E LF ELL++ +ELNS+VYN +I+A+C+NGN   A +L ++M+SKGI P  A
Sbjct: 608 KADKIEEGEKLFTELLTQNLELNSVVYNTLIRAYCRNGNTVEAFKLHDDMRSKGIPPTTA 667

Query: 668 TYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAESTWLEM 727
           TYSSLIHG+CNIG +EDAK LIDEMRKEG +PNVVCYTALIGGYCKLGQMD   +   EM
Sbjct: 668 TYSSLIHGMCNIGRMEDAKCLIDEMRKEGLLPNVVCYTALIGGYCKLGQMDKVVNVLQEM 727

Query: 728 ISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYNVLTNGFCKANDM 787
            S++IHPNK TYTVMIDGY K G+M+ A  LL +M   GIVPD VTYNVLTNGFCK   +
Sbjct: 728 SSYDIHPNKITYTVMIDGYSKSGDMKTAAKLLHEMVGKGIVPDTVTYNVLTNGFCKEGKI 787

Query: 788 DNAFKVCDQMATEGLPVDEITYTTLVHGWNPPT-ITGQD 823
           +  FK+CD M+ EGLP+DEITYTTLVHGW  P+ +T Q+
Sbjct: 788 EEGFKICDYMSQEGLPLDEITYTTLVHGWQQPSALTNQE 826

BLAST of CSPI04G06820 vs. TrEMBL
Match: A0A067E580_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003295mg PE=4 SV=1)

HSP 1 Score: 978.0 bits (2527), Expect = 7.0e-282
Identity = 500/831 (60.17%), Postives = 604/831 (72.68%), Query Frame = 1

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSST-QPHKEHH------QDPPWQSQDQLHL-WVSS 60
           M L R  I K   +    SR L  V+ST Q  +E H      Q PP QS +Q  L WVSS
Sbjct: 1   MDLRRLSIPKPCSLSIAVSRPLTHVTSTAQQQQELHNRNQQQQPPPPQSSNQSLLKWVSS 60

Query: 61  VLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSC 120
           VLS  SLD SKC   LP+LSP +FD LFFSI    NP T L FFYFAS S  FRFT+ S 
Sbjct: 61  VLSKQSLDPSKCKLFLPNLSPQEFDTLFFSIRSNVNPKTALKFFYFASQSCNFRFTVRSY 120

Query: 121 CILILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKF-HIEIANALFGLTSVVGRFEWT 180
           C+LI LL+ S  + PARLLLIRLIDG +PVL   +    HIEIA+ +  L          
Sbjct: 121 CLLIRLLLFSNLLSPARLLLIRLIDGKMPVLYASNPSIRHIEIASQMVDLNVTSEPALGV 180

Query: 181 QAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEV 240
           Q  DLL+HVY TQF+NLGF  A+DVF + + KG FPSLKTCNFLL+SLVKANE +K  EV
Sbjct: 181 QIADLLVHVYCTQFKNLGFGYAIDVFSIFSSKGIFPSLKTCNFLLNSLVKANEVQKGIEV 240

Query: 241 FRVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQN 300
           F  M  G  PDVF F+  INA CK G++E+AI LF KME+LGI+PNVVTYN II+GLC+N
Sbjct: 241 FETMCRGVSPDVFLFSTAINAFCKRGRIEDAIGLFTKMEELGIAPNVVTYNNIIHGLCRN 300

Query: 301 GRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVF 360
           GRL  AF LKEKM ++ V+P+L TY  LINGLIKL  FD  N VL EM   GF PN VV+
Sbjct: 301 GRLYEAFHLKEKMVLREVEPSLITYSILINGLIKLEKFDDANFVLKEMSVRGFVPNYVVY 360

Query: 361 NNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILS 420
           N LIDGYCK GNI  ALKI+D M+SK ++P SVT  SL+ GFCKS Q+++AENALEE+LS
Sbjct: 361 NTLIDGYCKKGNISEALKIRDDMVSKGMSPNSVTFNSLIHGFCKSGQMDNAENALEEMLS 420

Query: 421 SGLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLE 480
            GLSI+     SV+ WLC   R++SA  FTK ML RN RP D LLT+LV GLCK+GK  E
Sbjct: 421 RGLSINQGAYTSVIKWLCINSRFNSALHFTKEMLLRNLRPGDGLLTLLVSGLCKNGKQAE 480

Query: 481 ATELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALIL 540
           ATEL FRL EKG   + VTSNALIHG+C AG L EA +++ EML+RGL +D++TYN LIL
Sbjct: 481 ATELCFRLFEKGFTVNTVTSNALIHGMCEAGNLKEAGKLLMEMLQRGLILDKVTYNTLIL 540

Query: 541 GFCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLIS 600
           G C +GK E  F+L+E+M KRGIQPD YTYN LL GLC++GK+++AI+LW+E K +    
Sbjct: 541 GCCKDGKPEEGFKLKEDMIKRGIQPDNYTYNLLLHGLCSLGKMEEAIELWEECKRTVFGP 600

Query: 601 NIHTYGIMMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLL 660
           +I+TYG+M+DG+CKA++IE+ E LFNE++SKKMELN +VYN +I+A+C+ GN  AA +L 
Sbjct: 601 DIYTYGVMIDGFCKADKIEEGETLFNEMISKKMELNPVVYNTLIRAYCKIGNTTAAFRLS 660

Query: 661 ENMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKL 720
            +MKS+GILP   TYSSLIHG+CNIGL+EDAK L DEMRKEG +PNV CYTALIGGYCKL
Sbjct: 661 NDMKSRGILPTSVTYSSLIHGLCNIGLIEDAKCLFDEMRKEGLLPNVACYTALIGGYCKL 720

Query: 721 GQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTY 780
           GQMD AES   EM S NIHPNK TYT+MI GYCKLG+M++A  LL  M E GI PD +TY
Sbjct: 721 GQMDEAESVLQEMASINIHPNKITYTIMIGGYCKLGDMKEAAKLLNVMAEKGISPDSITY 780

Query: 781 NVLTNGFCKANDMDNAFKVCDQMATEGLPVDEITYTTLVHGWNPPTITGQD 823
           NV  +G CK  +++ AFKVCD+M +EGL +DEITYTTL+ GW   TIT QD
Sbjct: 781 NVFMDGHCKGGNVEEAFKVCDRMLSEGLSLDEITYTTLIDGWQSSTITNQD 831

BLAST of CSPI04G06820 vs. TrEMBL
Match: V4V2M8_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000274mg PE=4 SV=1)

HSP 1 Score: 977.6 bits (2526), Expect = 9.2e-282
Identity = 500/831 (60.17%), Postives = 603/831 (72.56%), Query Frame = 1

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSST-QPHKEHH------QDPPWQSQDQLHL-WVSS 60
           M L R  I K   +    SR L  V+ST Q  +E H      Q PP QS +Q  L WVSS
Sbjct: 1   MDLRRLSIPKPCSLSIAVSRPLTHVTSTAQQQQELHNRNQQQQPPPPQSSNQSLLKWVSS 60

Query: 61  VLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSC 120
           VLS  SLD SKC   LP+LSP +FD LFFSI    NP T L FFYFAS S  FRFT+ S 
Sbjct: 61  VLSKQSLDPSKCKLFLPNLSPQEFDTLFFSIRSNVNPKTALKFFYFASQSCNFRFTVRSY 120

Query: 121 CILILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKF-HIEIANALFGLTSVVGRFEWT 180
           C+LI LL+ S  + PARLLLIRLIDG +PVL   +    HIEIA+ +  L          
Sbjct: 121 CLLIRLLLFSNLLSPARLLLIRLIDGKMPVLYASNPSIRHIEIASQMVDLNVTSEPALGV 180

Query: 181 QAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEV 240
           Q  DLL+HVY TQF+NLGF  A+DVF + + KG FPSLKTCNFLL+SLVKANE +K  EV
Sbjct: 181 QIADLLVHVYCTQFKNLGFGYAIDVFSIFSNKGIFPSLKTCNFLLNSLVKANEVQKGIEV 240

Query: 241 FRVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQN 300
           F  M  G  PDVF F+  INA CK G++E+AI LF KME+LGI+PNVVTYN II+GLC+N
Sbjct: 241 FETMCRGVSPDVFLFSTAINAFCKRGRIEDAIGLFTKMEELGIAPNVVTYNNIIHGLCRN 300

Query: 301 GRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVF 360
           GRL  AF LKEKM ++ V+P+L TY  LINGLIKL  FD  N VL EM   GF PN VV+
Sbjct: 301 GRLYEAFHLKEKMVLREVEPSLITYSILINGLIKLEKFDDANFVLKEMSVRGFVPNYVVY 360

Query: 361 NNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILS 420
           N LIDGYCK GNI  ALKI+D M+SK ++P SVT  SL+ GFCKS Q+++AENALEE+LS
Sbjct: 361 NTLIDGYCKKGNISEALKIRDDMVSKGMSPNSVTFNSLIHGFCKSGQMDNAENALEEMLS 420

Query: 421 SGLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLE 480
            GLSI+     SV+ WLC   R+ SA  FTK ML RN RP D LLT+LV GLCK+GK  E
Sbjct: 421 RGLSINQGAYTSVIKWLCINSRFDSALHFTKEMLLRNLRPGDGLLTLLVSGLCKNGKQAE 480

Query: 481 ATELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALIL 540
           ATEL FRL EKG   + VTSNALIHG+C AG L EA +++ EML+RGL +D++TYN LIL
Sbjct: 481 ATELCFRLFEKGFTVNTVTSNALIHGMCEAGNLKEAGKLLMEMLQRGLILDKVTYNTLIL 540

Query: 541 GFCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLIS 600
           G C +GK E  F+L+E+M KRGIQPD YTYN LL GLC++GK+++AI+LW+E K +    
Sbjct: 541 GCCKDGKPEEGFKLKEDMIKRGIQPDNYTYNLLLHGLCSLGKMEEAIELWEECKRTVFGP 600

Query: 601 NIHTYGIMMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLL 660
           +I+TYG+M+DG+CKA++IE+ E LFNE++SKKMELN +VYN +I+A+C+ GN  AA +L 
Sbjct: 601 DIYTYGVMIDGFCKADKIEEGETLFNEMISKKMELNPVVYNTLIRAYCKIGNTTAAFRLS 660

Query: 661 ENMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKL 720
            +MKS+GILP   TYSSLIHG+CNIGL+EDAK L DEMRKEG +PNV CYTALIGGYCKL
Sbjct: 661 NDMKSRGILPTSVTYSSLIHGLCNIGLIEDAKCLFDEMRKEGLLPNVACYTALIGGYCKL 720

Query: 721 GQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTY 780
           GQMD AES   EM S NIHPNK TYT+MI GYCKLG+M++A  LL  M E GI PD +TY
Sbjct: 721 GQMDEAESVLQEMASINIHPNKITYTIMIGGYCKLGDMKEAAKLLNVMAEKGISPDSITY 780

Query: 781 NVLTNGFCKANDMDNAFKVCDQMATEGLPVDEITYTTLVHGWNPPTITGQD 823
           NV  +G CK  +++ AFKVCD+M +EGL +DEITYTTL+ GW   TIT QD
Sbjct: 781 NVFMDGHCKGGNVEEAFKVCDRMLSEGLSLDEITYTTLIDGWQSSTITNQD 831

BLAST of CSPI04G06820 vs. TrEMBL
Match: M5WX26_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001463mg PE=4 SV=1)

HSP 1 Score: 964.5 bits (2492), Expect = 8.0e-278
Identity = 486/798 (60.90%), Postives = 590/798 (73.93%), Query Frame = 1

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVS-STQPHKEHHQDPPWQ-------SQDQLHLWVSS 60
           M L R  ISK T +LF  +R L CV+ + Q  KE  Q PP Q           LH WVSS
Sbjct: 1   MDLRRLSISKPT-LLFRINRPLTCVTCNLQRPKEPPQPPPLQVPKEPQPPNQSLHNWVSS 60

Query: 61  VLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSC 120
           +LS  SLDSSKC AL+P LS  +FD++F SI    NP T L+FFYFAS SFKF+FT+ S 
Sbjct: 61  ILSKPSLDSSKCKALIPLLSSHEFDRVFCSISSNVNPKTALHFFYFASESFKFQFTVRSF 120

Query: 121 CILILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQ 180
           C+L+ LLI S  + PARLLLIRLIDGN+PVL  +  + H+EIA A+  L +V  +    Q
Sbjct: 121 CVLVRLLILSNLVSPARLLLIRLIDGNVPVLYANHNQRHMEIAIAMLDLNTVSTQGLGVQ 180

Query: 181 AFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVF 240
           A DLLIHVY TQF+N+GF  A+D F + ++KG FPSLKTCNFLLSSLVKANE  K  +VF
Sbjct: 181 ALDLLIHVYCTQFKNMGFGYAIDAFVIFSKKGVFPSLKTCNFLLSSLVKANELHKSYDVF 240

Query: 241 RVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNG 300
            VM  G  PDV+ FT  INA CKGGK+++AI LF KME LGI PNVVTYN II+GLC++ 
Sbjct: 241 EVMCRGVSPDVYLFTTAINAFCKGGKVDDAIGLFSKMEGLGIVPNVVTYNNIIHGLCKSR 300

Query: 301 RLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVFN 360
           RL  AF+ K+KM    V P+L TY  LINGLIKL  F   N VL EM   GF PN VV+N
Sbjct: 301 RLVEAFQFKKKMIENNVSPSLITYSVLINGLIKLEKFHDANCVLKEMCNRGFVPNEVVYN 360

Query: 361 NLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSS 420
            LIDGYCK GNI  ALKI+D M+S  +TP SVTL SL+QGFC+SDQ +HAE  L++I+S 
Sbjct: 361 TLIDGYCKTGNISEALKIRDNMLSNGLTPNSVTLNSLLQGFCRSDQFDHAEQVLDKIISG 420

Query: 421 GLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEA 480
           GLSI+   C+SV+HWLC K R+ SA +FT  ML RNFRPSD LLT LV GLCKDGKH EA
Sbjct: 421 GLSINQAVCFSVIHWLCMKSRFDSALKFTTEMLLRNFRPSDSLLTTLVGGLCKDGKHSEA 480

Query: 481 TELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILG 540
             LWFRL EKG  A+  TSNALIHGLC +  + E   ++K MLERGL +DRI+YN LILG
Sbjct: 481 LGLWFRLWEKGVAANTATSNALIHGLCESRSMQEVVMLLKPMLERGLVLDRISYNTLILG 540

Query: 541 FCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISN 600
            C EGKVE  F+L+EEM K+GI+PD YTYN L+ GLCN+GK+DDA+KLWDE +  GL+ N
Sbjct: 541 CCKEGKVEEGFKLKEEMAKQGIEPDTYTYNLLMHGLCNMGKVDDAVKLWDECENRGLVPN 600

Query: 601 IHTYGIMMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLE 660
           ++TYG+M+DGYC+A R+++ ENLF++L++K++ELNS+VYN +I+A+C +GN+ AAL L  
Sbjct: 601 VYTYGVMIDGYCQAGRMKEGENLFSKLVNKEVELNSVVYNTLIRAYCTDGNMTAALGLRC 660

Query: 661 NMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLG 720
           +MK KGI P+C TYSSLIHG+CNIG VEDAK L+DEMRK+G +PNVVCYTALI GYCKLG
Sbjct: 661 DMKKKGIQPSCGTYSSLIHGLCNIGDVEDAKCLLDEMRKDGLLPNVVCYTALIHGYCKLG 720

Query: 721 QMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYN 780
           QMD   S +LEM S NI PNK TYTVMIDGY KLGNME+A  LL +M + GI PD VTYN
Sbjct: 721 QMDKVRSAFLEMSSDNIQPNKITYTVMIDGYSKLGNMEEATKLLCEMAKMGIAPDAVTYN 780

Query: 781 VLTNGFCKANDMDNAFKV 791
            LTNGFCK   ++ AF+V
Sbjct: 781 ALTNGFCKERMVEEAFEV 797

BLAST of CSPI04G06820 vs. TAIR10
Match: AT4G19440.1 (AT4G19440.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 729.6 bits (1882), Expect = 2.2e-210
Identity = 374/773 (48.38%), Postives = 509/773 (65.85%), Query Frame = 1

Query: 42  SQDQLHLWVSSVLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASN 101
           S   LH  +SSVLS  SLD  +C  L+  LSP +FD+LF     K NP T L+FF  AS+
Sbjct: 59  SDRHLHERLSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASD 118

Query: 102 SFKFRFTIHSCCILILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGL 161
           SF F F++ S C+LI LL+ +  +  AR++LIRLI+GN+PVL        + IA+A+  L
Sbjct: 119 SFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLRDSRVAIADAMASL 178

Query: 162 TSVVGRFEWTQAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVK 221
           +         +  DLLI VY TQF+  G   A+DVF +LA KG FPS  TCN LL+SLV+
Sbjct: 179 SLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLVR 238

Query: 222 ANEFEKCCEVFRVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTY 281
           ANEF+KCCE F V+ +G  PDV+ FT  INA CKGGK+E A++LF KME+ G++PNVVT+
Sbjct: 239 ANEFQKCCEAFDVVCKGVSPDVYLFTTAINAFCKGGKVEEAVKLFSKMEEAGVAPNVVTF 298

Query: 282 NCIINGLCQNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIG 341
           N +I+GL   GR D AF  KEKM  +G++P L TY  L+ GL +         VL EM  
Sbjct: 299 NTVIDGLGMCGRYDEAFMFKEKMVERGMEPTLITYSILVKGLTRAKRIGDAYFVLKEMTK 358

Query: 342 AGFNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEH 401
            GF PNV+V+NNLID + + G++  A++IKD+M+SK ++ TS T  +L++G+CK+ Q ++
Sbjct: 359 KGFPPNVIVYNNLIDSFIEAGSLNKAIEIKDLMVSKGLSLTSSTYNTLIKGYCKNGQADN 418

Query: 402 AENALEEILSSGLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVC 461
           AE  L+E+LS G +++  +  SV+  LC    + SA RF   ML RN  P   LLT L+ 
Sbjct: 419 AERLLKEMLSIGFNVNQGSFTSVICLLCSHLMFDSALRFVGEMLLRNMSPGGGLLTTLIS 478

Query: 462 GLCKDGKHLEATELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPM 521
           GLCK GKH +A ELWF+ L KG      TSNAL+HGLC AGKL EA RI KE+L RG  M
Sbjct: 479 GLCKHGKHSKALELWFQFLNKGFVVDTRTSNALLHGLCEAGKLDEAFRIQKEILGRGCVM 538

Query: 522 DRITYNALILGFCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLW 581
           DR++YN LI G C + K++  F   +EM KRG++PD YTY+ L+ GL N+ K+++AI+ W
Sbjct: 539 DRVSYNTLISGCCGKKKLDEAFMFLDEMVKRGLKPDNYTYSILICGLFNMNKVEEAIQFW 598

Query: 582 DEFKASGLISNIHTYGIMMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQN 641
           D+ K +G++ +++TY +M+DG CKA R E+ +  F+E++SK ++ N++VYN +I+A+C++
Sbjct: 599 DDCKRNGMLPDVYTYSVMIDGCCKAERTEEGQEFFDEMMSKNVQPNTVVYNHLIRAYCRS 658

Query: 642 GNVAAALQLLENMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCY 701
           G ++ AL+L E+MK KGI PN ATY+SLI G+  I  VE+AK L +EMR EG  PNV  Y
Sbjct: 659 GRLSMALELREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLLFEEMRMEGLEPNVFHY 718

Query: 702 TALIGGYCKLGQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKE 761
           TALI GY KLGQM   E    EM S N+HPNK TYTVMI GY + GN+ +A+ LL +M+E
Sbjct: 719 TALIDGYGKLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYARDGNVTEASRLLNEMRE 778

Query: 762 SGIVPDVVTYNVLTNGFCKANDMDNAFKVCDQMATEGLPVDEITYTTLVHGWN 815
            GIVPD +TY     G+ K   +  AFK            DE  Y  ++ GWN
Sbjct: 779 KGIVPDSITYKEFIYGYLKQGGVLEAFK----------GSDEENYAAIIEGWN 821

BLAST of CSPI04G06820 vs. TAIR10
Match: AT5G59900.1 (AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 313.9 bits (803), Expect = 2.9e-85
Identity = 185/634 (29.18%), Postives = 313/634 (49.37%), Query Frame = 1

Query: 198 YLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGACP--DVF----------- 257
           +L   +G   S  +   L+ +LVKAN F     + + +   A    DVF           
Sbjct: 93  FLGLHRGFDHSTASFCILIHALVKANLFWPASSLLQTLLLRALKPSDVFNVLFSCYEKCK 152

Query: 258 -----SFTNVINALCKGGKMENAIELF-MKMEKLGISPNVVTYNCIINGLCQNGRLDNAF 317
                SF  +I    +  ++ + + +F M + K+ + P V T + +++GL +      A 
Sbjct: 153 LSSSSSFDLLIQHYVRSRRVLDGVLVFKMMITKVSLLPEVRTLSALLHGLVKFRHFGLAM 212

Query: 318 ELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVFNNLIDGY 377
           EL   M   G++P++  Y  +I  L +L    +   ++  M   G + N+V +N LIDG 
Sbjct: 213 ELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMIAHMEATGCDVNIVPYNVLIDGL 272

Query: 378 CKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHP 437
           CK   +  A+ IK  +  K++ P  VT  +L+ G CK  + E     ++E+L    S   
Sbjct: 273 CKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFEIGLEMMDEMLCLRFSPSE 332

Query: 438 DNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFR 497
               S+V  L K+ +   A    K ++     P+  +   L+  LCK  K  EA  L+ R
Sbjct: 333 AAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALIDSLCKGRKFHEAELLFDR 392

Query: 498 LLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGK 557
           + + G   + VT + LI   C  GKL  A   + EM++ GL +    YN+LI G C  G 
Sbjct: 393 MGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLKLSVYPYNSLINGHCKFGD 452

Query: 558 VEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGI 617
           +        EM  + ++P + TY  L+ G C+ GK++ A++L+ E    G+  +I+T+  
Sbjct: 453 ISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRLYHEMTGKGIAPSIYTFTT 512

Query: 618 MMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKG 677
           ++ G  +A  I D   LFNE+    ++ N + YN++I+ +C+ G+++ A + L+ M  KG
Sbjct: 513 LLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCEEGDMSKAFEFLKEMTEKG 572

Query: 678 ILPNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAE 737
           I+P+  +Y  LIHG+C  G   +AK  +D + K     N +CYT L+ G+C+ G+++ A 
Sbjct: 573 IVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEAL 632

Query: 738 STWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYNVLTNGF 797
           S   EM+   +  +   Y V+IDG  K  + +    LL +M + G+ PD V Y  + +  
Sbjct: 633 SVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMIDAK 692

Query: 798 CKANDMDNAFKVCDQMATEGLPVDEITYTTLVHG 813
            K  D   AF + D M  EG   +E+TYT +++G
Sbjct: 693 SKTGDFKEAFGIWDLMINEGCVPNEVTYTAVING 726

BLAST of CSPI04G06820 vs. TAIR10
Match: AT1G74580.1 (AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 311.2 bits (796), Expect = 1.9e-84
Identity = 176/609 (28.90%), Postives = 307/609 (50.41%), Query Frame = 1

Query: 193 AVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVF-RVMSEGACPDVFSFTNVIN 252
           AV+VF  +      P++ + N ++S LV +  F++  +V+ R+   G  PDV+SFT  + 
Sbjct: 95  AVNVFERMDFYDCEPTVFSYNAIMSVLVDSGYFDQAHKVYMRMRDRGITPDVYSFTIRMK 154

Query: 253 ALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFELKEKMTVKGVQP 312
           + CK  +   A+ L   M   G   NVV Y  ++ G  +       +EL  KM   GV  
Sbjct: 155 SFCKTSRPHAALRLLNNMSSQGCEMNVVAYCTVVGGFYEENFKAEGYELFGKMLASGVSL 214

Query: 313 NLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVFNNLIDGYCKMGNIEGALKIK 372
            L T+  L+  L K     +   +LD++I  G  PN+  +N  I G C+ G ++GA+++ 
Sbjct: 215 CLSTFNKLLRVLCKKGDVKECEKLLDKVIKRGVLPNLFTYNLFIQGLCQRGELDGAVRMV 274

Query: 373 DVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDNCYSVVHWLCKK 432
             +I +   P  +T  +L+ G CK+ + + AE  L ++++ GL        +++   CK 
Sbjct: 275 GCLIEQGPKPDVITYNNLIYGLCKNSKFQEAEVYLGKMVNEGLEPDSYTYNTLIAGYCKG 334

Query: 433 FRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLLEKGSPASKVTS 492
                A R     +   F P       L+ GLC +G+   A  L+   L KG   + +  
Sbjct: 335 GMVQLAERIVGDAVFNGFVPDQFTYRSLIDGLCHEGETNRALALFNEALGKGIKPNVILY 394

Query: 493 NALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVEGCFRLREEMTK 552
           N LI GL   G + EA+++  EM E+GL  +  T+N L+ G C  G V     L + M  
Sbjct: 395 NTLIKGLSNQGMILEAAQLANEMSEKGLIPEVQTFNILVNGLCKMGCVSDADGLVKVMIS 454

Query: 553 RGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMMDGYCKANRIED 612
           +G  PDI+T+N L+ G     K+++A+++ D    +G+  +++TY  +++G CK ++ ED
Sbjct: 455 KGYFPDIFTFNILIHGYSTQLKMENALEILDVMLDNGVDPDVYTYNSLLNGLCKTSKFED 514

Query: 613 VENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGILPNCATYSSLIH 672
           V   +  ++ K    N   +NI++++ C+   +  AL LLE MK+K + P+  T+ +LI 
Sbjct: 515 VMETYKTMVEKGCAPNLFTFNILLESLCRYRKLDEALGLLEEMKNKSVNPDAVTFGTLID 574

Query: 673 GVCNIGLVEDAKHLIDEMRKEGFV-PNVVCYTALIGGYCKLGQMDTAESTWLEMISFNIH 732
           G C  G ++ A  L  +M +   V  +   Y  +I  + +   +  AE  + EM+   + 
Sbjct: 575 GFCKNGDLDGAYTLFRKMEEAYKVSSSTPTYNIIIHAFTEKLNVTMAEKLFQEMVDRCLG 634

Query: 733 PNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYNVLTNGFCKANDMDNAFKV 792
           P+ +TY +M+DG+CK GN+      L++M E+G +P + T   + N  C  + +  A  +
Sbjct: 635 PDGYTYRLMVDGFCKTGNVNLGYKFLLEMMENGFIPSLTTLGRVINCLCVEDRVYEAAGI 694

Query: 793 CDQMATEGL 800
             +M  +GL
Sbjct: 695 IHRMVQKGL 703

BLAST of CSPI04G06820 vs. TAIR10
Match: AT5G65560.1 (AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 309.3 bits (791), Expect = 7.1e-84
Identity = 224/826 (27.12%), Postives = 386/826 (46.73%), Query Frame = 1

Query: 7   KISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLDSSKCSA 66
           K S    V  P +RR  C  S  P   +  +    S    H  +S +   +   S    +
Sbjct: 26  KFSTDVTVPSPVTRRQFC--SVSPLLRNLPEEESDSMSVPHRLLSILSKPNWHKSPSLKS 85

Query: 67  LLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSCCILILLLIRSKFIP 126
           ++  +SPS    LF    L  +P T LNF ++ S + +++ +++S   L+ LLI + ++ 
Sbjct: 86  MVSAISPSHVSSLF---SLDLDPKTALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVG 145

Query: 127 PA---RLLLIRLIDG---NLPVLNL-----DSEKFHIE----------IANAL--FGLTS 186
                RLL+I+  D     L VL+L       E+F ++          + N+L  FGL  
Sbjct: 146 VVFKIRLLMIKSCDSVGDALYVLDLCRKMNKDERFELKYKLIIGCYNTLLNSLARFGLVD 205

Query: 187 VVGRFEWTQAFDLLIHVYSTQFRNLGFSC-------AVDVFYLLARKGTFPSLKTCNFLL 246
            + +       D +     T  + +   C       A      +   G  P   T   L+
Sbjct: 206 EMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLI 265

Query: 247 SSLVKANEFEKCCEVFRVMSEGACP-DVFSFTNVINALCKGGKMENAIELFMKMEKLGIS 306
               +  + +   +VF  M    C  +  ++T++I+ LC   +++ A++LF+KM+     
Sbjct: 266 MGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECF 325

Query: 307 PNVVTYNCIINGLCQNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHV 366
           P V TY  +I  LC + R   A  L ++M   G++PN+ TY  LI+ L     F+K   +
Sbjct: 326 PTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEKAREL 385

Query: 367 LDEMIGAGFNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCK 426
           L +M+  G  PNV+ +N LI+GYCK G IE A+ + ++M S+ ++P + T   L++G+CK
Sbjct: 386 LGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCK 445

Query: 427 SDQIEHAENALEEILSSGLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLL 486
           S+ +  A   L ++L   +        S++   C+   + SA+R   +M  R   P    
Sbjct: 446 SN-VHKAMGVLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWT 505

Query: 487 LTMLVCGLCKDGKHLEATELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEML 546
            T ++  LCK  +  EA +L+  L +KG   + V   ALI G C AGK+ EA  ++++ML
Sbjct: 506 YTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLEKML 565

Query: 547 ERGLPMDRITYNALILGFCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLD 606
            +    + +T+NALI G C +GK++    L E+M K G+QP + T   L+  L   G  D
Sbjct: 566 SKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLLKDGDFD 625

Query: 607 DAIKLWDEFKASGLISNIHTYGIMMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIII 666
            A   + +  +SG   + HTY   +  YC+  R+ D E++  ++    +  +   Y+ +I
Sbjct: 626 HAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLFTYSSLI 685

Query: 667 KAHCQNGNVAAALQLLENMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLIDEM--RKEG 726
           K +   G    A  +L+ M+  G  P+  T+ SLI            KHL++    +++G
Sbjct: 686 KGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLI------------KHLLEMKYGKQKG 745

Query: 727 FVPNVVCYTALIGGYCKLGQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKAN 786
             P +   + ++       + DT      +M+  ++ PN  +Y  +I G C++GN+  A 
Sbjct: 746 SEPELCAMSNMM-------EFDTVVELLEKMVEHSVTPNAKSYEKLILGICEVGNLRVAE 805

Query: 787 NLLIKM-KESGIVPDVVTYNVLTNGFCKANDMDNAFKVCDQMATEG 799
            +   M +  GI P  + +N L +  CK    + A KV D M   G
Sbjct: 806 KVFDHMQRNEGISPSELVFNALLSCCCKLKKHNEAAKVVDDMICVG 826

BLAST of CSPI04G06820 vs. TAIR10
Match: AT1G12775.1 (AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 307.0 bits (785), Expect = 3.5e-83
Identity = 164/604 (27.15%), Postives = 314/604 (51.99%), Query Frame = 1

Query: 209 LKTCNFLLSSLVKANEFEKCCEV-FRVMSEGACPDVFSFTNVINALCKGGKMENAIELFM 268
           L+T    ++ +   NE   CCE  F   S+       S+ + +++   G K ++A++LF 
Sbjct: 22  LETGTLRIALINCPNELLFCCERGFSTFSDRN----LSYRDKLSSGLVGIKADDAVDLFR 81

Query: 269 KMEKLGISPNVVTYNCIINGLCQNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLN 328
            M +    P V+ +N + + + +  + +    L ++M  KG+  ++ T   +IN   +  
Sbjct: 82  DMIQSRPLPTVIDFNRLFSAIAKTKQYELVLALCKQMESKGIAHSIYTLSIMINCFCRCR 141

Query: 329 FFDKVNHVLDEMIGAGFNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLY 388
                   + +++  G+ P+ V+FN L++G C    +  AL++ D M+     PT +TL 
Sbjct: 142 KLSYAFSTMGKIMKLGYEPDTVIFNTLLNGLCLECRVSEALELVDRMVEMGHKPTLITLN 201

Query: 389 SLMQGFCKSDQIEHAENALEEILSSGLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSR 448
           +L+ G C + ++  A   ++ ++ +G   +      V++ +CK  +   A    + M  R
Sbjct: 202 TLVNGLCLNGKVSDAVVLIDRMVETGFQPNEVTYGPVLNVMCKSGQTALAMELLRKMEER 261

Query: 449 NFRPSDLLLTMLVCGLCKDGKHLEATELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEA 508
           N +   +  ++++ GLCKDG    A  L+  +  KG  A  +T N LI G C AG+  + 
Sbjct: 262 NIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDG 321

Query: 509 SRIVKEMLERGLPMDRITYNALILGFCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRG 568
           ++++++M++R +  + +T++ LI  F  EGK+    +L +EM +RGI P+  TYN L+ G
Sbjct: 322 AKLLRDMIKRKISPNVVTFSVLIDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDG 381

Query: 569 LCNVGKLDDAIKLWDEFKASGLISNIHTYGIMMDGYCKANRIEDVENLFNELLSKKMELN 628
            C   +L++AI++ D   + G   +I T+ I+++GYCKANRI+D   LF E+  + +  N
Sbjct: 382 FCKENRLEEAIQMVDLMISKGCDPDIMTFNILINGYCKANRIDDGLELFREMSLRGVIAN 441

Query: 629 SIVYNIIIKAHCQNGNVAAALQLLENMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLID 688
           ++ YN +++  CQ+G +  A +L + M S+ + P+  +Y  L+ G+C+ G +E A  +  
Sbjct: 442 TVTYNTLVQGFCQSGKLEVAKKLFQEMVSRRVRPDIVSYKILLDGLCDNGELEKALEIFG 501

Query: 689 EMRKEGFVPNVVCYTALIGGYCKLGQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLG 748
           ++ K     ++  Y  +I G C   ++D A   +  +    +  +   Y +MI   C+  
Sbjct: 502 KIEKSKMELDIGIYMIIIHGMCNASKVDDAWDLFCSLPLKGVKLDARAYNIMISELCRKD 561

Query: 749 NMEKANNLLIKMKESGIVPDVVTYNVLTNGFCKANDMDNAFKVCDQMATEGLPVDEITYT 808
           ++ KA+ L  KM E G  PD +TYN+L       +D   A ++ ++M + G P D  T  
Sbjct: 562 SLSKADILFRKMTEEGHAPDELTYNILIRAHLGDDDATTAAELIEEMKSSGFPADVSTVK 621

Query: 809 TLVH 812
            +++
Sbjct: 622 MVIN 621

BLAST of CSPI04G06820 vs. NCBI nr
Match: gi|449462543|ref|XP_004149000.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Cucumis sativus])

HSP 1 Score: 1680.2 bits (4350), Expect = 0.0e+00
Identity = 818/822 (99.51%), Postives = 820/822 (99.76%), Query Frame = 1

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60
           MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD
Sbjct: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60

Query: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSCCILILLLI 120
           SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHS C LILLLI
Sbjct: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120

Query: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240
           YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC
Sbjct: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240

Query: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFEL 300
           PDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFEL
Sbjct: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFEL 300

Query: 301 KEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVFNNLIDGYCK 360
           KEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIG+GFNPNVVVFNNLIDGYCK
Sbjct: 301 KEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGSGFNPNVVVFNNLIDGYCK 360

Query: 361 MGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDN 420
           MGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDN
Sbjct: 361 MGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDN 420

Query: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLL 480
           CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVE 540
           EKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVE
Sbjct: 481 EKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVE 540

Query: 541 GCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMM 600
           GCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMM
Sbjct: 541 GCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMM 600

Query: 601 DGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGIL 660
           +GYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGIL
Sbjct: 601 EGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGIL 660

Query: 661 PNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST 720
           PNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST
Sbjct: 661 PNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST 720

Query: 721 WLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYNVLTNGFCK 780
           WLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYNVLTNGFCK
Sbjct: 721 WLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYNVLTNGFCK 780

Query: 781 ANDMDNAFKVCDQMATEGLPVDEITYTTLVHGWNPPTITGQD 823
           ANDMDNAFKVCDQMATEGLPVDEITYTTLVHGWNPPTITGQD
Sbjct: 781 ANDMDNAFKVCDQMATEGLPVDEITYTTLVHGWNPPTITGQD 822

BLAST of CSPI04G06820 vs. NCBI nr
Match: gi|659102008|ref|XP_008451904.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 1589.7 bits (4115), Expect = 0.0e+00
Identity = 768/822 (93.43%), Postives = 794/822 (96.59%), Query Frame = 1

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60
           MHLTRFKI+KT PVLFPFSRRL CVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLS+SSLD
Sbjct: 1   MHLTRFKINKTIPVLFPFSRRLACVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSNSSLD 60

Query: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSCCILILLLI 120
           SSKCSALLPHLSP QFDQLFFSIGLKANPMTCLNFFYFAS+SFKFRFTIHS CILILLL+
Sbjct: 61  SSKCSALLPHLSPFQFDQLFFSIGLKANPMTCLNFFYFASDSFKFRFTIHSYCILILLLV 120

Query: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
            SKF+PPARLLLIRLIDGNLPVLN D +KFHIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 HSKFLPPARLLLIRLIDGNLPVLNSDFKKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240
           YSTQFRNLGF CA+DVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVF+VMSEG C
Sbjct: 181 YSTQFRNLGFGCAIDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFQVMSEGVC 240

Query: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFEL 300
           PDVFSFTNVINALCKGGKME A ELFMKMEKLGISPNVVTYNCIINGLCQNGRLD+AFEL
Sbjct: 241 PDVFSFTNVINALCKGGKMEKATELFMKMEKLGISPNVVTYNCIINGLCQNGRLDHAFEL 300

Query: 301 KEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVFNNLIDGYCK 360
           KEKMT++GVQPNLKTYGAL+NGLIKL  FDKVNH+LDEMIGAGF PNVVVFNNLIDGYCK
Sbjct: 301 KEKMTIEGVQPNLKTYGALVNGLIKLKCFDKVNHILDEMIGAGFYPNVVVFNNLIDGYCK 360

Query: 361 MGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDN 420
           MGNI+ AL+IKDVMISKNITPTSVTLY+L+QGFCKSDQIE AENALEEILS+GLSIHPD 
Sbjct: 361 MGNIKEALRIKDVMISKNITPTSVTLYTLLQGFCKSDQIEQAENALEEILSNGLSIHPDK 420

Query: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLL 480
           CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSD LLT+LVCGLCKDGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDPLLTILVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVE 540
           EKGSPASKVTSNALIHGLC AG LPEASRIVKEMLERGLP+DRITYNALILGFC EGKVE
Sbjct: 481 EKGSPASKVTSNALIHGLCEAGNLPEASRIVKEMLERGLPLDRITYNALILGFCKEGKVE 540

Query: 541 GCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMM 600
           GCFRL+EEMTKRGIQPDIYTYNFLLRGLCN GKLDDAIKLWDEFKASG ISN+HTYG+MM
Sbjct: 541 GCFRLKEEMTKRGIQPDIYTYNFLLRGLCNAGKLDDAIKLWDEFKASGPISNVHTYGVMM 600

Query: 601 DGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGIL 660
           DGYCKANRIEDVENLFNELLSKKMELNSIVYNIII+AHCQNGNVAAALQL ENMKSKGIL
Sbjct: 601 DGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIRAHCQNGNVAAALQLRENMKSKGIL 660

Query: 661 PNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST 720
           PNCATYSSLIHG+C+IGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST
Sbjct: 661 PNCATYSSLIHGMCDIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLGQMDTAEST 720

Query: 721 WLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYNVLTNGFCK 780
           WLEMISFNIHPNKFTYTVMIDGYCKLGNMEKA NLL KMKESGIVPDVVTYNVLTNGFCK
Sbjct: 721 WLEMISFNIHPNKFTYTVMIDGYCKLGNMEKAYNLLTKMKESGIVPDVVTYNVLTNGFCK 780

Query: 781 ANDMDNAFKVCDQMATEGLPVDEITYTTLVHGWNPPTITGQD 823
           ANDMDNAFKVCDQMATEGL VDEITYTTLVHGWN PTITGQD
Sbjct: 781 ANDMDNAFKVCDQMATEGLSVDEITYTTLVHGWNRPTITGQD 822

BLAST of CSPI04G06820 vs. NCBI nr
Match: gi|700198298|gb|KGN53456.1| (hypothetical protein Csa_4G055990 [Cucumis sativus])

HSP 1 Score: 1442.6 bits (3733), Expect = 0.0e+00
Identity = 706/710 (99.44%), Postives = 708/710 (99.72%), Query Frame = 1

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60
           MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD
Sbjct: 1   MHLTRFKISKTTPVLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLD 60

Query: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSCCILILLLI 120
           SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHS C LILLLI
Sbjct: 61  SSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLI 120

Query: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180
           RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV
Sbjct: 121 RSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQAFDLLIHV 180

Query: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240
           YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC
Sbjct: 181 YSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVFRVMSEGAC 240

Query: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFEL 300
           PDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFEL
Sbjct: 241 PDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNGRLDNAFEL 300

Query: 301 KEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVFNNLIDGYCK 360
           KEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIG+GFNPNVVVFNNLIDGYCK
Sbjct: 301 KEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGSGFNPNVVVFNNLIDGYCK 360

Query: 361 MGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDN 420
           MGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDN
Sbjct: 361 MGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSSGLSIHPDN 420

Query: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLL 480
           CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLL
Sbjct: 421 CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEATELWFRLL 480

Query: 481 EKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVE 540
           EKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVE
Sbjct: 481 EKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILGFCNEGKVE 540

Query: 541 GCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMM 600
           GCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMM
Sbjct: 541 GCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISNIHTYGIMM 600

Query: 601 DGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGIL 660
           +GYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGIL
Sbjct: 601 EGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLENMKSKGIL 660

Query: 661 PNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCK 711
           PNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCK
Sbjct: 661 PNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCK 710

BLAST of CSPI04G06820 vs. NCBI nr
Match: gi|659102010|ref|XP_008451905.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 1291.9 bits (3342), Expect = 0.0e+00
Identity = 618/659 (93.78%), Postives = 638/659 (96.81%), Query Frame = 1

Query: 164 VVGRFEWTQAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKAN 223
           VVGRFEWTQAFDLLIHVYSTQFRNLGF CA+DVFYLLARKGTFPSLKTCNFLLSSLVKAN
Sbjct: 11  VVGRFEWTQAFDLLIHVYSTQFRNLGFGCAIDVFYLLARKGTFPSLKTCNFLLSSLVKAN 70

Query: 224 EFEKCCEVFRVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNC 283
           EFEKCCEVF+VMSEG CPDVFSFTNVINALCKGGKME A ELFMKMEKLGISPNVVTYNC
Sbjct: 71  EFEKCCEVFQVMSEGVCPDVFSFTNVINALCKGGKMEKATELFMKMEKLGISPNVVTYNC 130

Query: 284 IINGLCQNGRLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAG 343
           IINGLCQNGRLD+AFELKEKMT++GVQPNLKTYGAL+NGLIKL  FDKVNH+LDEMIGAG
Sbjct: 131 IINGLCQNGRLDHAFELKEKMTIEGVQPNLKTYGALVNGLIKLKCFDKVNHILDEMIGAG 190

Query: 344 FNPNVVVFNNLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAE 403
           F PNVVVFNNLIDGYCKMGNI+ AL+IKDVMISKNITPTSVTLY+L+QGFCKSDQIE AE
Sbjct: 191 FYPNVVVFNNLIDGYCKMGNIKEALRIKDVMISKNITPTSVTLYTLLQGFCKSDQIEQAE 250

Query: 404 NALEEILSSGLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGL 463
           NALEEILS+GLSIHPD CYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSD LLT+LVCGL
Sbjct: 251 NALEEILSNGLSIHPDKCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDPLLTILVCGL 310

Query: 464 CKDGKHLEATELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDR 523
           CKDGKHLEATELWFRLLEKGSPASKVTSNALIHGLC AG LPEASRIVKEMLERGLP+DR
Sbjct: 311 CKDGKHLEATELWFRLLEKGSPASKVTSNALIHGLCEAGNLPEASRIVKEMLERGLPLDR 370

Query: 524 ITYNALILGFCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDE 583
           ITYNALILGFC EGKVEGCFRL+EEMTKRGIQPDIYTYNFLLRGLCN GKLDDAIKLWDE
Sbjct: 371 ITYNALILGFCKEGKVEGCFRLKEEMTKRGIQPDIYTYNFLLRGLCNAGKLDDAIKLWDE 430

Query: 584 FKASGLISNIHTYGIMMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGN 643
           FKASG ISN+HTYG+MMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIII+AHCQNGN
Sbjct: 431 FKASGPISNVHTYGVMMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIRAHCQNGN 490

Query: 644 VAAALQLLENMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTA 703
           VAAALQL ENMKSKGILPNCATYSSLIHG+C+IGLVEDAKHLIDEMRKEGFVPNVVCYTA
Sbjct: 491 VAAALQLRENMKSKGILPNCATYSSLIHGMCDIGLVEDAKHLIDEMRKEGFVPNVVCYTA 550

Query: 704 LIGGYCKLGQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESG 763
           LIGGYCKLGQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKA NLL KMKESG
Sbjct: 551 LIGGYCKLGQMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKAYNLLTKMKESG 610

Query: 764 IVPDVVTYNVLTNGFCKANDMDNAFKVCDQMATEGLPVDEITYTTLVHGWNPPTITGQD 823
           IVPDVVTYNVLTNGFCKANDMDNAFKVCDQMATEGL VDEITYTTLVHGWN PTITGQD
Sbjct: 611 IVPDVVTYNVLTNGFCKANDMDNAFKVCDQMATEGLSVDEITYTTLVHGWNRPTITGQD 669

BLAST of CSPI04G06820 vs. NCBI nr
Match: gi|645243357|ref|XP_008227937.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic [Prunus mume])

HSP 1 Score: 1003.0 bits (2592), Expect = 2.9e-289
Identity = 505/825 (61.21%), Postives = 612/825 (74.18%), Query Frame = 1

Query: 1   MHLTRFKISKTTPVLFPFSRRLVCVS-STQPHKEHHQDPPWQ-------SQDQLHLWVSS 60
           M L R  ISK T +LF  +R L CV+ + Q  KE  Q PP Q           LH WVSS
Sbjct: 5   MDLRRLSISKPT-LLFRINRPLTCVTCNLQRPKEPPQPPPLQVPKEPQPPNQSLHNWVSS 64

Query: 61  VLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSC 120
           +LS  SLDSSKC AL+P LS  +FD++F SI    NP T L+FFYFAS SFKF+FT  S 
Sbjct: 65  ILSKPSLDSSKCKALIPLLSSQEFDRVFCSISSNVNPKTALHFFYFASESFKFQFTARSF 124

Query: 121 CILILLLIRSKFIPPARLLLIRLIDGNLPVLNLDSEKFHIEIANALFGLTSVVGRFEWTQ 180
           C+L+ LLI S  + PARLLLIRLIDGN+PVL  +  + H+EIA A+  L +V  +    Q
Sbjct: 125 CVLVRLLILSNLVSPARLLLIRLIDGNVPVLYANHNQRHMEIAIAMLDLNTVSTQGLGVQ 184

Query: 181 AFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEKCCEVF 240
           A DLLIHVY TQF+N+GF  A+D F + ++KG FPSLKTCNFLLSSLVKANE  K  +VF
Sbjct: 185 ALDLLIHVYCTQFKNMGFGYAIDAFVIFSKKGVFPSLKTCNFLLSSLVKANELHKSYDVF 244

Query: 241 RVMSEGACPDVFSFTNVINALCKGGKMENAIELFMKMEKLGISPNVVTYNCIINGLCQNG 300
            VM  G  PDV+ FT  INA CKGGK+++AI LF KME LGI PNVVTYN II+GLC++ 
Sbjct: 245 EVMCRGVSPDVYLFTTAINAFCKGGKVDDAIGLFSKMEGLGIVPNVVTYNNIIHGLCKSK 304

Query: 301 RLDNAFELKEKMTVKGVQPNLKTYGALINGLIKLNFFDKVNHVLDEMIGAGFNPNVVVFN 360
           RL  AF+ K+KM    V P+L TY  LINGLIKL  F   N VL EM   GF PN VV+N
Sbjct: 305 RLVEAFQFKKKMIENNVGPSLITYSVLINGLIKLEKFHDANCVLKEMCNRGFVPNEVVYN 364

Query: 361 NLIDGYCKMGNIEGALKIKDVMISKNITPTSVTLYSLMQGFCKSDQIEHAENALEEILSS 420
            LIDGYCK GNI  ALKI+D M+S  +TP SVTL SL+QGFC+SDQ +HAE  L++I S 
Sbjct: 365 TLIDGYCKTGNISEALKIRDNMLSNGLTPNSVTLNSLLQGFCRSDQFDHAEQVLDKIFSG 424

Query: 421 GLSIHPDNCYSVVHWLCKKFRYHSAFRFTKMMLSRNFRPSDLLLTMLVCGLCKDGKHLEA 480
           GLSI+   C+SV+HWLC K R+ SA +FT  ML RNFRPSD LLT LV GLCKDGKH EA
Sbjct: 425 GLSINQAVCFSVIHWLCMKSRFDSALKFTTEMLLRNFRPSDSLLTTLVGGLCKDGKHSEA 484

Query: 481 TELWFRLLEKGSPASKVTSNALIHGLCGAGKLPEASRIVKEMLERGLPMDRITYNALILG 540
             LWFRL EKG  A+  TSNALIHGLC +  + E   ++K MLERGL +DRI+YN LILG
Sbjct: 485 LGLWFRLWEKGVAANTATSNALIHGLCESRSMQEVVMLLKPMLERGLVLDRISYNTLILG 544

Query: 541 FCNEGKVEGCFRLREEMTKRGIQPDIYTYNFLLRGLCNVGKLDDAIKLWDEFKASGLISN 600
            C EGKVE  F+L+EEM K+GI+PD YTYN L+ GLCN+GK+DDAIKLWDE +  GL+ N
Sbjct: 545 CCKEGKVEEGFKLKEEMAKQGIEPDTYTYNLLMHGLCNMGKVDDAIKLWDECENRGLVPN 604

Query: 601 IHTYGIMMDGYCKANRIEDVENLFNELLSKKMELNSIVYNIIIKAHCQNGNVAAALQLLE 660
           ++TYG+M+DGYC+A R+++ ENLF++L++K++ELNS+VYNI+I+A+C +GN+ AAL L  
Sbjct: 605 VYTYGVMIDGYCQAGRMKEGENLFSKLVNKEVELNSVVYNILIRAYCTDGNMTAALGLRC 664

Query: 661 NMKSKGILPNCATYSSLIHGVCNIGLVEDAKHLIDEMRKEGFVPNVVCYTALIGGYCKLG 720
           +MK KGI P+C TYSSLIHG+CNIG VEDAK L+DEMRK+G +PNVVCYTALI GYCKLG
Sbjct: 665 DMKKKGIQPSCGTYSSLIHGLCNIGNVEDAKCLLDEMRKDGLLPNVVCYTALIHGYCKLG 724

Query: 721 QMDTAESTWLEMISFNIHPNKFTYTVMIDGYCKLGNMEKANNLLIKMKESGIVPDVVTYN 780
           QMD   S +LEM S NI PNK TYTVMIDGY KLGNME+A  LL +M + GI PD VTYN
Sbjct: 725 QMDKVRSAFLEMSSDNIQPNKITYTVMIDGYSKLGNMEEATKLLCEMAKMGIAPDAVTYN 784

Query: 781 VLTNGFCKANDMDNAFKVCDQMATEGLPVDEITYTTLVHGWNPPT 818
            LTNGFCK   ++ AF+VCD M+++G+ +DEITYTTLVHG + PT
Sbjct: 785 ALTNGFCKERMVEEAFEVCDHMSSKGVGLDEITYTTLVHGLHQPT 828

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP325_ARATH3.9e-20948.38Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidop... [more]
PP437_ARATH5.1e-8429.18Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
PP120_ARATH3.3e-8328.90Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis th... [more]
PP445_ARATH1.3e-8227.12Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN... [more]
PPR39_ARATH6.3e-8227.15Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L008_CUCSA0.0e+0099.44Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055990 PE=4 SV=1[more]
D7TFE9_VITVI1.4e-28259.34Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0087g00360 PE=4 SV=... [more]
A0A067E580_CITSI7.0e-28260.17Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003295mg PE=4 SV=1[more]
V4V2M8_9ROSI9.2e-28260.17Uncharacterized protein OS=Citrus clementina GN=CICLE_v10000274mg PE=4 SV=1[more]
M5WX26_PRUPE8.0e-27860.90Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001463mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G19440.12.2e-21048.38 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G59900.12.9e-8529.18 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G74580.11.9e-8428.90 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G65560.17.1e-8427.12 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G12775.13.5e-8327.15 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449462543|ref|XP_004149000.1|0.0e+0099.51PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic ... [more]
gi|659102008|ref|XP_008451904.1|0.0e+0093.43PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic ... [more]
gi|700198298|gb|KGN53456.1|0.0e+0099.44hypothetical protein Csa_4G055990 [Cucumis sativus][more]
gi|659102010|ref|XP_008451905.1|0.0e+0093.78PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic ... [more]
gi|645243357|ref|XP_008227937.1|2.9e-28961.21PREDICTED: pentatricopeptide repeat-containing protein At4g19440, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G06820.1CSPI04G06820.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 211..237
score: 0.05coord: 457..483
score: 0.021coord: 595..622
score: 6.2E-6coord: 489..519
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 238..270
score: 8.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 766..812
score: 1.3E-13coord: 276..325
score: 1.0E-17coord: 522..570
score: 2.8E-18coord: 696..745
score: 6.7E-17coord: 627..675
score: 2.4E-16coord: 346..395
score: 2.2
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 734..768
score: 2.3E-10coord: 245..278
score: 1.1E-7coord: 315..348
score: 1.0E-5coord: 594..627
score: 5.9E-8coord: 349..382
score: 3.9E-7coord: 524..558
score: 4.3E-8coord: 665..698
score: 2.2E-8coord: 492..522
score: 2.7E-5coord: 279..312
score: 5.8E-9coord: 211..237
score: 1.9E-4coord: 699..732
score: 2.7E-8coord: 769..802
score: 1.8E-6coord: 559..590
score: 8.8E-7coord: 629..663
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 697..731
score: 11.751coord: 417..451
score: 7.114coord: 522..556
score: 12.803coord: 627..661
score: 12.781coord: 767..801
score: 11.674coord: 557..591
score: 12.167coord: 452..486
score: 7.947coord: 242..276
score: 12.891coord: 208..238
score: 7.53coord: 277..311
score: 13.482coord: 382..416
score: 8.988coord: 347..381
score: 11.959coord: 662..696
score: 12.551coord: 592..626
score: 11.279coord: 487..521
score: 11.126coord: 732..766
score: 13.625coord: 312..346
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 453..753
score: 4.0E-5coord: 219..399
score: 2.0E-4coord: 400..420
score: 4.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 556..633
score: 1.29E-5coord: 414..426
score: 1.29E-5coord: 459..520
score: 1.29E-5coord: 221..304
score: 1.2
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 451..808
score: 4.6E-296coord: 222..415
score: 4.6E-296coord: 7..82
score: 4.6E
NoneNo IPR availablePANTHERPTHR24015:SF473SUBFAMILY NOT NAMEDcoord: 451..808
score: 4.6E-296coord: 222..415
score: 4.6E-296coord: 7..82
score: 4.6E