CSPI02G04670 (gene) Wild cucumber (PI 183967)

NameCSPI02G04670
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein
LocationChr2 : 3347672 .. 3350136 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTTTGTTGAAGGTACCGTTTGAAGTTAGCGAGCTAATTCTTGAATTTTCAAAGCTCAATTTCTCAGAAGCTTACGGGAGATGAAGGAGAATGTGTCAATCCATTTTGGTTAGGATGAACATGAGAGAAAGCAAGCCAGCCCGACGAACAAGGATGAACTTGTAATAAGCTTCAAACTTTACAAGGATGAACTTTACTTTACCTTAGCCCTAACCTTTTTCTCTTCGTTTTCTGGGAAATTCCAGTCGATGTCTTCCGAGTGAAAAACATGAACGTCATCATTCGCCACTCCTCGGCGTCGAGAATTTTCCCGGCGAAAATTGATCGGAACTGCATTTTCTCGACGACCCATCTTCCGTTCTGCACTTACAATCCAACCTGTACGGCTCCCATATCGAACGATTTCGACCCACTAATCATCTCAGACCTTATTTCCAGGCAACAATGGTCAATCCTCAAATCCCATGTCAAATTCAAAAGTCCCATCGATTTTCTTCACCAATTGATGGGTTCGGGAGACGTTGACCCATTGCTTGTTCTCAGGTACTTCAATTGGTCCCGGAGAGAGCTTAATGTTAATTACAGCATTGAACTCATCTGCAGGCTCTTAAATTTATTGGCCAATGCTAAACATTACCCCAAAATTCGATCAATTCTCGATTCTTTTGTAAAGGGTGAAACAAACTGCTCGATTTCTTTGATTTTTCACTCGCTTTCGGTGTGTAGTGGTCAATTTTGTGCTAATTCGATAATTGCTGATATGTTGGTGTTAGCCTATGTGGAAAATTCGAAAACAGTTTTGGGATTGGAGGCGTTTAAGAGAGCTGGTGATTATAGGTATAAGTTATCTGTGTTATCTTGCAACCCATTGTTGAGTGCCTTGGTGAAGGAGAATGAATTTGGGGGTGTGGAATTTGTGTATAAAGAGATGATTAGAAGGAAAATTAGTCCTAATTTAATTACATTTAATACTGTGATTAATGGGTTATGTAAGGTTGGGAAATTGAATAAAGCTGGAGATGTTGTTGATGATATGAAAGTATGGGGATTCTGGCCCAATGTGGTTACCTATAACACTCTCATTGATGGATACTGTAAAATGGGTAGGGTTGGGAAGATGTATAAAGCTGATGCTATTTTGAAGGAAATGGTGGAAAATAAAGTCAGCCCAAATAGTGTAACATTTAACGTATTGATTGATGGGTTTTGTAAAGATGAAAACTTATCTGCTGCTTTGAAGGTGTTTGAGGAAATGCAGAGTCAGGGTCTGAAGCCTACTGTAGTAACATACAATTCGTTAGTAAATGGTTTGTGTAATGAAGGAAAACTGAATGAGGCTAAAGTTCTTCTTGATGAAATGTTGAGTTCAAACTTGAAGCCTAATGTTATTACTTATAATGCTCTGATTAATGGGTATTGTAAGAAGAAGTTGTTGGAGGAAGCTAGGGAGTTGTTTGATAATATTGGAAAACAAGGGCTGACTCCCAATGTTATAACATTCAATACATTACTTCATGGATATTGCAAGTTTGGGAAGATGGAAGAGGCATTTTTGCTCCAGAAGGTAATGTTAGAGAAGGGGTTTCTCCCAAATGCCTCAACCTACAATTGCCTTATTGTTGGTTTTTGTCGGGAGGGAAAAATGGAGGAAGTCAAGAATCTTTTAAATGAAATGCAATGTAGAGGTGTGAAAGCTGATACAGTGACGTATAATATTTTGATAAGTGCATGGTGTGAGAAAAAGGAGCCAAAAAAGGCAGCACGACTTATTGATGAGATGCTCGACAAGGGATTAAAACCAAGTCATTTGACATATAACATTTTGTTGAATGGCTATTGCATGGAAGGAAACTTAAGGGCTGCTTTGAATCTGAGGAAGCAAATGGAGAAAGAAGGAAGATGGGCAAATGTGGTAACGTATAATGTGTTGATTCAAGGTTATTGTAGAAAGGGGAAATTGGAAGATGCAAATGGACTTCTTAATGAGATGCTGGAGAAGGGGTTAATACCGAATCGAACTACTTATGAGATAATTAAAGAGGAAATGATGGAGAAAGGATTTCTTCCTGATATAGAAGGTCATCTTTATCACGCCTCCCAGTAGAAGTTAATGCAACTGTACCGGTGGAATAAAATTGTACATACACTTTAAGTCGTTGATCTGGCCAACATGATCAAAGCTAGAGGACATTTCAAGGTAACTAATATATCCAAGGACCTTACTTGACCTTACTTGAACATGACCGATTCTAAAAATTAGCTACCCAGGAGTGGAACAAACACCATCTTTAGCATTTGAGAGCTTAGCTTATGATGTGATGTGATGTTCCATGTCTTTAAATCATATAGATGGTATATAACTCACTCAATCAGGTTAATTTAGTAGTCAATTGTTATACTGATGCTGATATACCTTCTCCATTGTTTAATGTAGCATCATAACATGCAGTTGCTTTATTACTTAG

mRNA sequence

ATGAACGTCATCATTCGCCACTCCTCGGCGTCGAGAATTTTCCCGGCGAAAATTGATCGGAACTGCATTTTCTCGACGACCCATCTTCCGTTCTGCACTTACAATCCAACCTGTACGGCTCCCATATCGAACGATTTCGACCCACTAATCATCTCAGACCTTATTTCCAGGCAACAATGGTCAATCCTCAAATCCCATGTCAAATTCAAAAGTCCCATCGATTTTCTTCACCAATTGATGGGTTCGGGAGACGTTGACCCATTGCTTGTTCTCAGGTACTTCAATTGGTCCCGGAGAGAGCTTAATGTTAATTACAGCATTGAACTCATCTGCAGGCTCTTAAATTTATTGGCCAATGCTAAACATTACCCCAAAATTCGATCAATTCTCGATTCTTTTGTAAAGGGTGAAACAAACTGCTCGATTTCTTTGATTTTTCACTCGCTTTCGGTGTGTAGTGGTCAATTTTGTGCTAATTCGATAATTGCTGATATGTTGGTGTTAGCCTATGTGGAAAATTCGAAAACAGTTTTGGGATTGGAGGCGTTTAAGAGAGCTGGTGATTATAGGTATAAGTTATCTGTGTTATCTTGCAACCCATTGTTGAGTGCCTTGGTGAAGGAGAATGAATTTGGGGGTGTGGAATTTGTGTATAAAGAGATGATTAGAAGGAAAATTAGTCCTAATTTAATTACATTTAATACTGTGATTAATGGGTTATGTAAGGTTGGGAAATTGAATAAAGCTGGAGATGTTGTTGATGATATGAAAGTATGGGGATTCTGGCCCAATGTGGTTACCTATAACACTCTCATTGATGGATACTGTAAAATGGGTAGGGTTGGGAAGATGTATAAAGCTGATGCTATTTTGAAGGAAATGGTGGAAAATAAAGTCAGCCCAAATAGTGTAACATTTAACGTATTGATTGATGGGTTTTGTAAAGATGAAAACTTATCTGCTGCTTTGAAGGTGTTTGAGGAAATGCAGAGTCAGGGTCTGAAGCCTACTGTAGTAACATACAATTCGTTAGTAAATGGTTTGTGTAATGAAGGAAAACTGAATGAGGCTAAAGTTCTTCTTGATGAAATGTTGAGTTCAAACTTGAAGCCTAATGTTATTACTTATAATGCTCTGATTAATGGGTATTGTAAGAAGAAGTTGTTGGAGGAAGCTAGGGAGTTGTTTGATAATATTGGAAAACAAGGGCTGACTCCCAATGTTATAACATTCAATACATTACTTCATGGATATTGCAAGTTTGGGAAGATGGAAGAGGCATTTTTGCTCCAGAAGGTAATGTTAGAGAAGGGGTTTCTCCCAAATGCCTCAACCTACAATTGCCTTATTGTTGGTTTTTGTCGGGAGGGAAAAATGGAGGAAGTCAAGAATCTTTTAAATGAAATGCAATGTAGAGGTGTGAAAGCTGATACAGTGACGTATAATATTTTGATAAGTGCATGGTGTGAGAAAAAGGAGCCAAAAAAGGCAGCACGACTTATTGATGAGATGCTCGACAAGGGATTAAAACCAAGTCATTTGACATATAACATTTTGTTGAATGGCTATTGCATGGAAGGAAACTTAAGGGCTGCTTTGAATCTGAGGAAGCAAATGGAGAAAGAAGGAAGATGGGCAAATGTGGTAACGTATAATGTGTTGATTCAAGGTTATTGTAGAAAGGGGAAATTGGAAGATGCAAATGGACTTCTTAATGAGATGCTGGAGAAGGGGTTAATACCGAATCGAACTACTTATGAGATAATTAAAGAGGAAATGATGGAGAAAGGATTTCTTCCTGATATAGAAGGTCATCTTTATCACGCCTCCCAGTAG

Coding sequence (CDS)

ATGAACGTCATCATTCGCCACTCCTCGGCGTCGAGAATTTTCCCGGCGAAAATTGATCGGAACTGCATTTTCTCGACGACCCATCTTCCGTTCTGCACTTACAATCCAACCTGTACGGCTCCCATATCGAACGATTTCGACCCACTAATCATCTCAGACCTTATTTCCAGGCAACAATGGTCAATCCTCAAATCCCATGTCAAATTCAAAAGTCCCATCGATTTTCTTCACCAATTGATGGGTTCGGGAGACGTTGACCCATTGCTTGTTCTCAGGTACTTCAATTGGTCCCGGAGAGAGCTTAATGTTAATTACAGCATTGAACTCATCTGCAGGCTCTTAAATTTATTGGCCAATGCTAAACATTACCCCAAAATTCGATCAATTCTCGATTCTTTTGTAAAGGGTGAAACAAACTGCTCGATTTCTTTGATTTTTCACTCGCTTTCGGTGTGTAGTGGTCAATTTTGTGCTAATTCGATAATTGCTGATATGTTGGTGTTAGCCTATGTGGAAAATTCGAAAACAGTTTTGGGATTGGAGGCGTTTAAGAGAGCTGGTGATTATAGGTATAAGTTATCTGTGTTATCTTGCAACCCATTGTTGAGTGCCTTGGTGAAGGAGAATGAATTTGGGGGTGTGGAATTTGTGTATAAAGAGATGATTAGAAGGAAAATTAGTCCTAATTTAATTACATTTAATACTGTGATTAATGGGTTATGTAAGGTTGGGAAATTGAATAAAGCTGGAGATGTTGTTGATGATATGAAAGTATGGGGATTCTGGCCCAATGTGGTTACCTATAACACTCTCATTGATGGATACTGTAAAATGGGTAGGGTTGGGAAGATGTATAAAGCTGATGCTATTTTGAAGGAAATGGTGGAAAATAAAGTCAGCCCAAATAGTGTAACATTTAACGTATTGATTGATGGGTTTTGTAAAGATGAAAACTTATCTGCTGCTTTGAAGGTGTTTGAGGAAATGCAGAGTCAGGGTCTGAAGCCTACTGTAGTAACATACAATTCGTTAGTAAATGGTTTGTGTAATGAAGGAAAACTGAATGAGGCTAAAGTTCTTCTTGATGAAATGTTGAGTTCAAACTTGAAGCCTAATGTTATTACTTATAATGCTCTGATTAATGGGTATTGTAAGAAGAAGTTGTTGGAGGAAGCTAGGGAGTTGTTTGATAATATTGGAAAACAAGGGCTGACTCCCAATGTTATAACATTCAATACATTACTTCATGGATATTGCAAGTTTGGGAAGATGGAAGAGGCATTTTTGCTCCAGAAGGTAATGTTAGAGAAGGGGTTTCTCCCAAATGCCTCAACCTACAATTGCCTTATTGTTGGTTTTTGTCGGGAGGGAAAAATGGAGGAAGTCAAGAATCTTTTAAATGAAATGCAATGTAGAGGTGTGAAAGCTGATACAGTGACGTATAATATTTTGATAAGTGCATGGTGTGAGAAAAAGGAGCCAAAAAAGGCAGCACGACTTATTGATGAGATGCTCGACAAGGGATTAAAACCAAGTCATTTGACATATAACATTTTGTTGAATGGCTATTGCATGGAAGGAAACTTAAGGGCTGCTTTGAATCTGAGGAAGCAAATGGAGAAAGAAGGAAGATGGGCAAATGTGGTAACGTATAATGTGTTGATTCAAGGTTATTGTAGAAAGGGGAAATTGGAAGATGCAAATGGACTTCTTAATGAGATGCTGGAGAAGGGGTTAATACCGAATCGAACTACTTATGAGATAATTAAAGAGGAAATGATGGAGAAAGGATTTCTTCCTGATATAGAAGGTCATCTTTATCACGCCTCCCAGTAG
BLAST of CSPI02G04670 vs. Swiss-Prot
Match: PPR27_ARATH (Pentatricopeptide repeat-containing protein At1g09820 OS=Arabidopsis thaliana GN=At1g09820 PE=2 SV=1)

HSP 1 Score: 677.6 bits (1747), Expect = 1.3e-193
Identity = 329/581 (56.63%), Postives = 432/581 (74.35%), Query Frame = 1

Query: 32  CTYNPTCT-APISNDFDPLIISDLISRQQWSILKSHVKFKSPIDFLHQLMGSGDVDPLLV 91
           C+ + T T +P    +D  +I+DLI +Q WS L  HV   +P +   QL+ S ++DP L 
Sbjct: 26  CSSSSTITGSPCPPRYDVAVIADLIEKQHWSKLGVHVTDINPNELFRQLISS-ELDPDLC 85

Query: 92  LRYFNWSRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSFVKGETNCSISLIFHSLS 151
           LRY++W  +  +++ S+EL  +LL+ LANAK Y KIRS LD FV+  ++  +  IFH++S
Sbjct: 86  LRYYSWLVKNSDISVSLELTFKLLHSLANAKRYSKIRSFLDGFVRNGSDHQVHSIFHAIS 145

Query: 152 VCSGQFCANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSALVKENE 211
           +C    C NSIIADMLVLAY  NS+  LG EAFKR+G Y YKLS LSC PL+ AL+KEN 
Sbjct: 146 MCDN-VCVNSIIADMLVLAYANNSRFELGFEAFKRSGYYGYKLSALSCKPLMIALLKENR 205

Query: 212 FGGVEFVYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPNVVTYNT 271
              VE+VYKEMIRRKI PN+ TFN VIN LCK GK+NKA DV++DMKV+G  PNVV+YNT
Sbjct: 206 SADVEYVYKEMIRRKIQPNVFTFNVVINALCKTGKMNKARDVMEDMKVYGCSPNVVSYNT 265

Query: 272 LIDGYCKMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALKVFEEMQ 331
           LIDGYCK+G  GKMYKADA+LKEMVEN VSPN  TFN+LIDGF KD+NL  ++KVF+EM 
Sbjct: 266 LIDGYCKLGGNGKMYKADAVLKEMVENDVSPNLTTFNILIDGFWKDDNLPGSMKVFKEML 325

Query: 332 SQGLKPTVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLE 391
            Q +KP V++YNSL+NGLCN GK++EA  + D+M+S+ ++PN+ITYNALING+CK  +L+
Sbjct: 326 DQDVKPNVISYNSLINGLCNGGKISEAISMRDKMVSAGVQPNLITYNALINGFCKNDMLK 385

Query: 392 EARELFDNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNASTYNCLI 451
           EA ++F ++  QG  P    +N L+  YCK GK+++ F L++ M  +G +P+  TYNCLI
Sbjct: 386 EALDMFGSVKGQGAVPTTRMYNMLIDAYCKLGKIDDGFALKEEMEREGIVPDVGTYNCLI 445

Query: 452 VGFCREGKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEMLDKGLK 511
            G CR G +E  K L +++  +G+  D VT++IL+  +C K E +KAA L+ EM   GLK
Sbjct: 446 AGLCRNGNIEAAKKLFDQLTSKGL-PDLVTFHILMEGYCRKGESRKAAMLLKEMSKMGLK 505

Query: 512 PSHLTYNILLNGYCMEGNLRAALNLRKQMEKEGRW-ANVVTYNVLIQGYCRKGKLEDANG 571
           P HLTYNI++ GYC EGNL+AA N+R QMEKE R   NV +YNVL+QGY +KGKLEDAN 
Sbjct: 506 PRHLTYNIVMKGYCKEGNLKAATNMRTQMEKERRLRMNVASYNVLLQGYSQKGKLEDANM 565

Query: 572 LLNEMLEKGLIPNRTTYEIIKEEMMEKGFLPDIEGHLYHAS 611
           LLNEMLEKGL+PNR TYEI+KEEM+++GF+PDIEGHL++ S
Sbjct: 566 LLNEMLEKGLVPNRITYEIVKEEMVDQGFVPDIEGHLFNVS 603

BLAST of CSPI02G04670 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 324.3 bits (830), Expect = 2.8e-87
Identity = 179/507 (35.31%), Postives = 285/507 (56.21%), Query Frame = 1

Query: 86  DPLLVLRYFNWSRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSFVKGET--NCSIS 145
           D  L+L++ NW+    +  +++   C  L++L   K Y K   IL   V  +T  +   S
Sbjct: 61  DQALILKFLNWANP--HQFFTLRCKCITLHILTKFKLY-KTAQILAEDVAAKTLDDEYAS 120

Query: 146 LIFHSLSVCSGQFCANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLS 205
           L+F SL        + S + D++V +Y   S     L     A  + +   VLS N +L 
Sbjct: 121 LVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLD 180

Query: 206 ALVK-ENEFGGVEFVYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFW 265
           A ++ +      E V+KEM+  ++SPN+ T+N +I G C  G ++ A  + D M+  G  
Sbjct: 181 ATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCL 240

Query: 266 PNVVTYNTLIDGYCKMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAA 325
           PNVVTYNTLIDGYCK+ ++   +K   +L+ M    + PN +++NV+I+G C++  +   
Sbjct: 241 PNVVTYNTLIDGYCKLRKIDDGFK---LLRSMALKGLEPNLISYNVVINGLCREGRMKEV 300

Query: 326 LKVFEEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALING 385
             V  EM  +G     VTYN+L+ G C EG  ++A V+  EML   L P+VITY +LI+ 
Sbjct: 301 SFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHS 360

Query: 386 YCKKKLLEEARELFDNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPN 445
            CK   +  A E  D +  +GL PN  T+ TL+ G+ + G M EA+ + + M + GF P+
Sbjct: 361 MCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPS 420

Query: 446 ASTYNCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLID 505
             TYN LI G C  GKME+   +L +M+ +G+  D V+Y+ ++S +C   +  +A R+  
Sbjct: 421 VVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKR 480

Query: 506 EMLDKGLKPSHLTYNILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKG 565
           EM++KG+KP  +TY+ L+ G+C +   + A +L ++M + G   +  TY  LI  YC +G
Sbjct: 481 EMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEG 540

Query: 566 KLEDANGLLNEMLEKGLIPNRTTYEII 590
            LE A  L NEM+EKG++P+  TY ++
Sbjct: 541 DLEKALQLHNEMVEKGVLPDVVTYSVL 561

BLAST of CSPI02G04670 vs. Swiss-Prot
Match: PPR12_ARATH (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 1.7e-79
Identity = 174/571 (30.47%), Postives = 309/571 (54.12%), Query Frame = 1

Query: 30  PFCTYNPTCTAPISNDFDPLIISDLISRQQWSILKS----HVKFKSPIDFLHQLMGSGDV 89
           PF  Y+P   +    +F   I + +  R+   + +S      KFK+  D L  ++     
Sbjct: 42  PFPDYSPKKASVRDTEFVHQITNVIKLRRAEPLRRSLKPYECKFKT--DHLIWVLMKIKC 101

Query: 90  DPLLVLRYFNWSRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSF-VKGETNCSISL 149
           D  LVL +F+W+R   + N  +E +C +++L   +K     +S++ SF  + + N + S 
Sbjct: 102 DYRLVLDFFDWARSRRDSN--LESLCIVIHLAVASKDLKVAQSLISSFWERPKLNVTDSF 161

Query: 150 I--FHSLSVCSGQFCANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLL 209
           +  F  L      + ++  + D+     V+          F++  +Y   LSV SCN  L
Sbjct: 162 VQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYL 221

Query: 210 SALVKE-NEFGGVEFVYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGF 269
           + L K+  +      V++E     +  N+ ++N VI+ +C++G++ +A  ++  M++ G+
Sbjct: 222 TRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGY 281

Query: 270 WPNVVTYNTLIDGYCKMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSA 329
            P+V++Y+T+++GYC+ G + K++K   +++ M    + PNS  +  +I   C+   L+ 
Sbjct: 282 TPDVISYSTVVNGYCRFGELDKVWK---LIEVMKRKGLKPNSYIYGSIIGLLCRICKLAE 341

Query: 330 ALKVFEEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALIN 389
           A + F EM  QG+ P  V Y +L++G C  G +  A     EM S ++ P+V+TY A+I+
Sbjct: 342 AEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIIS 401

Query: 390 GYCKKKLLEEARELFDNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLP 449
           G+C+   + EA +LF  +  +GL P+ +TF  L++GYCK G M++AF +   M++ G  P
Sbjct: 402 GFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSP 461

Query: 450 NASTYNCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLI 509
           N  TY  LI G C+EG ++    LL+EM   G++ +  TYN +++  C+    ++A +L+
Sbjct: 462 NVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLV 521

Query: 510 DEMLDKGLKPSHLTYNILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRK 569
            E    GL    +TY  L++ YC  G +  A  + K+M  +G    +VT+NVL+ G+C  
Sbjct: 522 GEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLH 581

Query: 570 GKLEDANGLLNEMLEKGLIPNRTTYEIIKEE 593
           G LED   LLN ML KG+ PN TT+  + ++
Sbjct: 582 GMLEDGEKLLNWMLAKGIAPNATTFNSLVKQ 605

BLAST of CSPI02G04670 vs. Swiss-Prot
Match: PPR99_ARATH (Pentatricopeptide repeat-containing protein At1g63130, mitochondrial OS=Arabidopsis thaliana GN=At1g63130 PE=2 SV=1)

HSP 1 Score: 297.0 bits (759), Expect = 4.8e-79
Identity = 150/453 (33.11%), Postives = 259/453 (57.17%), Query Frame = 1

Query: 165 MLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEFVYKEMIRR 224
           +L+  +   S+  L L    +     Y+  +++ N LL+     N       +  +M+  
Sbjct: 121 ILINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLNSLLNGFCHGNRISDAVSLVGQMVEM 180

Query: 225 KISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYCKMGRVGKM 284
              P+  TFNT+I+GL +  + ++A  +VD M V G  P++VTY  +++G CK G +   
Sbjct: 181 GYQPDSFTFNTLIHGLFRHNRASEAVALVDRMVVKGCQPDLVTYGIVVNGLCKRGDIDL- 240

Query: 285 YKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKPTVVTYNSL 344
             A ++LK+M + K+ P  V +N +ID  C  +N++ AL +F EM ++G++P VVTYNSL
Sbjct: 241 --ALSLLKKMEQGKIEPGVVIYNTIIDALCNYKNVNDALNLFTEMDNKGIRPNVVTYNSL 300

Query: 345 VNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDNIGKQGL 404
           +  LCN G+ ++A  LL +M+   + PNV+T++ALI+ + K+  L EA +L+D + K+ +
Sbjct: 301 IRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSI 360

Query: 405 TPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCREGKMEEVKN 464
            P++ T+++L++G+C   +++EA  + ++M+ K   PN  TYN LI GFC+  +++E   
Sbjct: 361 DPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKRVDEGME 420

Query: 465 LLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTYNILLNGYC 524
           L  EM  RG+  +TVTY  LI  + + +E   A  +  +M+  G+ P  +TY+ILL+G C
Sbjct: 421 LFREMSQRGLVGNTVTYTTLIHGFFQARECDNAQIVFKQMVSDGVLPDIMTYSILLDGLC 480

Query: 525 MEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRT 584
             G +  AL + + +++     ++ TYN++I+G C+ GK+ED   L   +  KG+ PN  
Sbjct: 481 NNGKVETALVVFEYLQRSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVV 540

Query: 585 TY----------------EIIKEEMMEKGFLPD 602
           TY                + +  EM E+G LPD
Sbjct: 541 TYTTMMSGFCRKGLKEEADALFREMKEEGPLPD 570

BLAST of CSPI02G04670 vs. Swiss-Prot
Match: PP432_ARATH (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 293.9 bits (751), Expect = 4.1e-78
Identity = 161/505 (31.88%), Postives = 270/505 (53.47%), Query Frame = 1

Query: 89  LVLRYFNWSRRE--LNVNYSIELICRLLNLLANAKHYPKIRSILD--SFVKGETNCSISL 148
           L L++  W  ++  L  ++ ++L+C   ++L  A+ Y   R IL   S + G+++     
Sbjct: 52  LALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSFVFGA 111

Query: 149 IFHSLSVCSGQFCANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSA 208
           +  +  +C+    +N  + D+L+  Y+        LE F+  G Y +  SV +CN +L +
Sbjct: 112 LMTTYRLCN----SNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGS 171

Query: 209 LVKENEFGGVEFVYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPN 268
           +VK  E   V    KEM++RKI P++ TFN +IN LC  G   K+  ++  M+  G+ P 
Sbjct: 172 VVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPT 231

Query: 269 VVTYNTLIDGYCKMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALK 328
           +VTYNT++  YCK GR      A  +L  M    V  +  T+N+LI   C+   ++    
Sbjct: 232 IVTYNTVLHWYCKKGR---FKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYL 291

Query: 329 VFEEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYC 388
           +  +M+ + + P  VTYN+L+NG  NEGK+  A  LL+EMLS  L PN +T+NALI+G+ 
Sbjct: 292 LLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHI 351

Query: 389 KKKLLEEARELFDNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNAS 448
            +   +EA ++F  +  +GLTP+ +++  LL G CK  + + A      M   G      
Sbjct: 352 SEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRI 411

Query: 449 TYNCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEM 508
           TY  +I G C+ G ++E   LLNEM   G+  D VTY+ LI+ +C+    K A  ++  +
Sbjct: 412 TYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRI 471

Query: 509 LDKGLKPSHLTYNILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKL 568
              GL P+ + Y+ L+   C  G L+ A+ + + M  EG   +  T+NVL+   C+ GK+
Sbjct: 472 YRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKV 531

Query: 569 EDANGLLNEMLEKGLIPNRTTYEII 590
            +A   +  M   G++PN  +++ +
Sbjct: 532 AEAEEFMRCMTSDGILPNTVSFDCL 549

BLAST of CSPI02G04670 vs. TrEMBL
Match: W9SHF9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_001609 PE=4 SV=1)

HSP 1 Score: 776.5 bits (2004), Expect = 2.3e-221
Identity = 370/569 (65.03%), Postives = 461/569 (81.02%), Query Frame = 1

Query: 38  CTAPISNDFDPLIISDLISRQQWSILK-SHVKFKSPIDFLHQLMGSGDVDPLLVLRYFNW 97
           CTA IS+ F+  ++S+LI++Q WS LK +H+   +P   L QL  S +VDP L+ RYFNW
Sbjct: 26  CTASISHTFNAPLVSELIAKQHWSELKRTHLTDSNPTKLLQQLFES-EVDPDLIFRYFNW 85

Query: 98  SRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSFVKGETNCSISLIFHSLSVCSGQF 157
           S +ELN+++++EL CRLL+ LA AK Y KIR+ LD FVK     S   IFHSLS+ S +F
Sbjct: 86  SHKELNISHTLELTCRLLHSLATAKKYSKIRAFLDGFVKRNVEHSNFTIFHSLSISSDRF 145

Query: 158 CANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEF 217
           C +SII DMLVLAY +N K+ L  EAFKRAGDY +KLS LS NPLL ALVKEN+ G VEF
Sbjct: 146 CTSSIIVDMLVLAYAKNLKSHLAFEAFKRAGDYGFKLSALSLNPLLCALVKENKIGQVEF 205

Query: 218 VYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYC 277
           VYKEMIRRKI+ +L TF+ V+NGLCK GKLNKAGD++ DMK +G  PNVVTYN LIDGYC
Sbjct: 206 VYKEMIRRKITGDLYTFSIVVNGLCKAGKLNKAGDIIQDMKAFGVLPNVVTYNILIDGYC 265

Query: 278 KMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKP 337
           KMG++GKMYKA+AIL+EMV NK+ PN +T+N+LI+GFCKDEN++A +KVFEEMQ QGLKP
Sbjct: 266 KMGKLGKMYKAEAILREMVANKICPNEITYNILINGFCKDENVAAGMKVFEEMQRQGLKP 325

Query: 338 TVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELF 397
            VVTYNSL++GLC EGK  EA  L DEML   LKPNV+T+NAL+ G+CKKK++ EARE+F
Sbjct: 326 NVVTYNSLIDGLCTEGKHEEACDLKDEMLGCGLKPNVVTFNALVKGFCKKKMIREAREVF 385

Query: 398 DNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCRE 457
           D+IG QGL PN+IT+NTL+  YCK G M+EAFL + +M EKG LP+ASTYNCLI GF R 
Sbjct: 386 DDIGVQGLAPNIITYNTLIDAYCKNGMMDEAFLSRSLMWEKGVLPDASTYNCLIAGFGRH 445

Query: 458 GKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTY 517
           G ME+ +++L+EMQ +G+KAD +TYNILI A+C+K E +KA R++ ++ DKGL PSHLTY
Sbjct: 446 GDMEKARDILDEMQNKGLKADLITYNILIDAFCKKGETRKATRILKDVFDKGLSPSHLTY 505

Query: 518 NILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLE 577
           N L++GYC +GNL+AALN+R QMEK+G+ ANV TYNVLI+G+C KGKLEDANGLLNEMLE
Sbjct: 506 NTLMDGYCKQGNLKAALNVRAQMEKDGKRANVATYNVLIKGFCEKGKLEDANGLLNEMLE 565

Query: 578 KGLIPNRTTYEIIKEEMMEKGFLPDIEGH 606
           KGL PN+ TYEI+KEEMM+KGF+PDIEGH
Sbjct: 566 KGLNPNQITYEIVKEEMMDKGFVPDIEGH 593

BLAST of CSPI02G04670 vs. TrEMBL
Match: W9RZ47_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_005698 PE=4 SV=1)

HSP 1 Score: 776.5 bits (2004), Expect = 2.3e-221
Identity = 370/569 (65.03%), Postives = 461/569 (81.02%), Query Frame = 1

Query: 38  CTAPISNDFDPLIISDLISRQQWSILK-SHVKFKSPIDFLHQLMGSGDVDPLLVLRYFNW 97
           CTA IS+ F+  ++S+LI++Q WS LK +H+   +P   L QL  S +VDP L+ RYFNW
Sbjct: 25  CTASISHTFNAPLVSELIAKQHWSELKRTHLTDSNPTKLLQQLFES-EVDPDLIFRYFNW 84

Query: 98  SRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSFVKGETNCSISLIFHSLSVCSGQF 157
           S +ELN+++++EL CRLL+ LA AK Y KIR+ LD FVK     S   IFHSLS+ S +F
Sbjct: 85  SHKELNISHTLELTCRLLHSLATAKKYSKIRAFLDGFVKRNVEHSNFTIFHSLSISSDRF 144

Query: 158 CANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEF 217
           C +SII DMLVLAY +N K+ L  EAFKRAGDY +KLS LS NPLL ALVKEN+ G VEF
Sbjct: 145 CTSSIIVDMLVLAYAKNLKSHLAFEAFKRAGDYGFKLSALSLNPLLCALVKENKIGQVEF 204

Query: 218 VYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYC 277
           VYKEMIRRKI+ +L TF+ V+NGLCK GKLNKAGD++ DMK +G  PNVVTYN LIDGYC
Sbjct: 205 VYKEMIRRKITGDLYTFSIVVNGLCKAGKLNKAGDIIQDMKAFGVLPNVVTYNILIDGYC 264

Query: 278 KMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKP 337
           KMG++GKMYKA+AIL+EMV NK+ PN +T+N+LI+GFCKDEN++A +KVFEEMQ QGLKP
Sbjct: 265 KMGKLGKMYKAEAILREMVANKICPNEITYNILINGFCKDENVAAGMKVFEEMQRQGLKP 324

Query: 338 TVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELF 397
            VVTYNSL++GLC EGK  EA  L DEML   LKPNV+T+NAL+ G+CKKK++ EARE+F
Sbjct: 325 NVVTYNSLIDGLCTEGKHEEACDLKDEMLGCGLKPNVVTFNALVKGFCKKKMIREAREVF 384

Query: 398 DNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCRE 457
           D+IG QGL PN+IT+NTL+  YCK G M+EAFL + +M EKG LP+ASTYNCLI GF R 
Sbjct: 385 DDIGVQGLAPNIITYNTLIDAYCKNGMMDEAFLSRSLMWEKGVLPDASTYNCLIAGFGRH 444

Query: 458 GKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTY 517
           G ME+ +++L+EMQ +G+KAD +TYNILI A+C+K E +KA R++ ++ DKGL PSHLTY
Sbjct: 445 GDMEKARDILDEMQNKGLKADLITYNILIDAFCKKGETRKATRILKDVFDKGLSPSHLTY 504

Query: 518 NILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLE 577
           N L++GYC +GNL+AALN+R QMEK+G+ ANV TYNVLI+G+C KGKLEDANGLLNEMLE
Sbjct: 505 NTLMDGYCKQGNLKAALNVRAQMEKDGKRANVATYNVLIKGFCEKGKLEDANGLLNEMLE 564

Query: 578 KGLIPNRTTYEIIKEEMMEKGFLPDIEGH 606
           KGL PN+ TYEI+KEEMM+KGF+PDIEGH
Sbjct: 565 KGLNPNQITYEIVKEEMMDKGFVPDIEGH 592

BLAST of CSPI02G04670 vs. TrEMBL
Match: A0A0D2SMN1_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_007G306300 PE=4 SV=1)

HSP 1 Score: 772.7 bits (1994), Expect = 3.3e-220
Identity = 365/561 (65.06%), Postives = 457/561 (81.46%), Query Frame = 1

Query: 50  IISDLISRQQWSILKSHVKFKSPIDFLHQLMGSGDVDPLLVLRYFNWSRRELNVNYSIEL 109
           I+++LIS+Q W  LK+H++  SPI  L QL+ S  VDP L LRYFNWS +E N+++S+E 
Sbjct: 28  IVTELISKQHWKSLKTHLQKASPITLLQQLLDSR-VDPCLTLRYFNWSEKEFNLSHSLEH 87

Query: 110 ICRLLNLLANAKHYPKIRSILDSFVKGETNCSISLIFHSLSVCSGQFCANSIIADMLVLA 169
            C L++ LANAK YPK+RS L SFVK E + S+S IFH++SV    FCA+SIIADMLVLA
Sbjct: 88  SCMLIHSLANAKRYPKMRSFLYSFVKNEKSISVSSIFHAISVSGDSFCASSIIADMLVLA 147

Query: 170 YVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEFVYKEMIRRKISPN 229
           YV N K+ L  EAFKRAGDY +KL+ +SCNPLLSALVKE++   VE++YKEMIRR+I  N
Sbjct: 148 YVNNVKSHLAFEAFKRAGDYGFKLTAVSCNPLLSALVKEDKIEDVEYMYKEMIRRRIEVN 207

Query: 230 LITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYCKMGRVGKMYKADA 289
            I+FNTVINGLCKVGKLNKA DV+ DMK WG  P+V+TYNTLI GYCK GR+GKMYKADA
Sbjct: 208 AISFNTVINGLCKVGKLNKASDVIQDMKAWGVLPDVITYNTLISGYCKKGRIGKMYKADA 267

Query: 290 ILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKPTVVTYNSLVNGLC 349
           ILKEM+ N+V P+ +T+N+LIDGFCKDENL AA+KVF EM++QGLKPTVVTYNSL+N L 
Sbjct: 268 ILKEMIANEVRPDEITYNILIDGFCKDENLMAAMKVFREMETQGLKPTVVTYNSLINKLG 327

Query: 350 NEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDNIGKQGLTPNVI 409
            EGKL+EA  +LDEM+ S LKPNV+TYN LINGYCKK  ++EA +LFD++ KQG+ P V+
Sbjct: 328 LEGKLDEASGMLDEMVGSGLKPNVVTYNVLINGYCKKGRMKEATDLFDDVVKQGIAPTVV 387

Query: 410 TFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCREGKMEEVKNLLNEM 469
           T+NTL++ YCK G+ME+AF L++ M++KG  P+ +TYNCLI G C EG +  V+ L+NEM
Sbjct: 388 TYNTLIYAYCKDGRMEDAFSLRESMVDKGTFPDVTTYNCLISGLCGEGNITAVRKLINEM 447

Query: 470 QCRGVKADTVTYNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTYNILLNGYCMEGNL 529
             +G+K + VTYNIL+ A C   E +KAARL+DEM+  GL+P+ +TYN L++GYC EGNL
Sbjct: 448 LNKGLKVNVVTYNILVDALCNDGESRKAARLLDEMVKMGLRPNQITYNTLMDGYCREGNL 507

Query: 530 RAALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEII 589
           RAALN+R +MEKEG  ANVVTYNVLI+G C+KGKLEDANGLLNEMLEKGLIPNRTTYEI+
Sbjct: 508 RAALNVRTRMEKEGMLANVVTYNVLIKGLCKKGKLEDANGLLNEMLEKGLIPNRTTYEIV 567

Query: 590 KEEMMEKGFLPDIEGHLYHAS 611
           K EM++KGF+PDIEGH+Y+ S
Sbjct: 568 KVEMVDKGFIPDIEGHMYNIS 587

BLAST of CSPI02G04670 vs. TrEMBL
Match: F6I4S5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0060g01220 PE=4 SV=1)

HSP 1 Score: 754.2 bits (1946), Expect = 1.2e-214
Identity = 368/586 (62.80%), Postives = 455/586 (77.65%), Query Frame = 1

Query: 25  STTHLPFCTYNPTCTAPISNDFDPLIISDLISRQQWSILKSHVKFKSPIDFLHQLMGSGD 84
           + T  PF + N T T P S+ FD   IS LI++Q WS LK+ VK  +P   L  L  S +
Sbjct: 26  NNTKTPFSSPNSTYTTPNSHTFDTPTISQLIAKQHWSKLKTIVKETNPSSLLQHLFNS-E 85

Query: 85  VDPLLVLRYFNWSRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSFVKGETNCSISL 144
             P L+L YF W+++E    +++E  CRLL+LLANAK+Y KIR++LDSF K   + S S 
Sbjct: 86  AQPDLILCYFKWTQKEFGAIHNVEQFCRLLHLLANAKNYNKIRALLDSFAKN-AHYSNST 145

Query: 145 IFHSLSVCSGQFCANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSA 204
           IFHSLSV     CANSII DMLV AYV+N +  L LE F RAGDY ++LS LSCNP+L +
Sbjct: 146 IFHSLSVLGSWGCANSIIVDMLVWAYVKNGEMDLALEGFDRAGDYGFRLSALSCNPMLVS 205

Query: 205 LVKENEFGGVEFVYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPN 264
           LVKE   G VE VYKEMIRR+I  N++TF+ VINGLCKVGK  KAGDVV+DMK WGF P+
Sbjct: 206 LVKEGRIGVVESVYKEMIRRRIGVNVVTFDVVINGLCKVGKFQKAGDVVEDMKAWGFSPS 265

Query: 265 VVTYNTLIDGYCKMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALK 324
           V+TYNT+IDGYCK    GKM+KADA+LKEMV  ++ PN +TFN+LIDGFC+DEN++AA K
Sbjct: 266 VITYNTIIDGYCK---AGKMFKADALLKEMVAKRIHPNEITFNILIDGFCRDENVTAAKK 325

Query: 325 VFEEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYC 384
           VFEEMQ QGL+P VVTYNSL+NGLC+ GKL+EA  L D+M    LKPNV+TYNALING+C
Sbjct: 326 VFEEMQRQGLQPNVVTYNSLINGLCSNGKLDEALGLQDKMSGMGLKPNVVTYNALINGFC 385

Query: 385 KKKLLEEARELFDNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNAS 444
           KKK+L+EARE+ D+IGK+GL PNVITFNTL+  Y K G+M++AFLL+ +ML+ G  PN S
Sbjct: 386 KKKMLKEAREMLDDIGKRGLAPNVITFNTLIDAYGKAGRMDDAFLLRSMMLDTGVCPNVS 445

Query: 445 TYNCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEM 504
           TYNCLIVGFCREG ++E + L  EM+  G+KAD VTYNIL+ A C+K E +KA RL+DEM
Sbjct: 446 TYNCLIVGFCREGNVKEARKLAKEMEGNGLKADLVTYNILVDALCKKGETRKAVRLLDEM 505

Query: 505 LDKGLKPSHLTYNILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKL 564
            + GL PSHLTYN L++GY  EGN  AALN+R  MEK+GR AN+VTYNVLI+G+C KGKL
Sbjct: 506 FEVGLNPSHLTYNALIDGYFREGNSTAALNVRTLMEKKGRRANIVTYNVLIKGFCNKGKL 565

Query: 565 EDANGLLNEMLEKGLIPNRTTYEIIKEEMMEKGFLPDIEGHLYHAS 611
           E+AN LLNEMLEKGLIPNRTTY+I+++EMMEKGF+PDI+GHLY+ S
Sbjct: 566 EEANRLLNEMLEKGLIPNRTTYDILRDEMMEKGFIPDIDGHLYNVS 606

BLAST of CSPI02G04670 vs. TrEMBL
Match: A0A067JV12_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23217 PE=4 SV=1)

HSP 1 Score: 739.6 bits (1908), Expect = 3.1e-210
Identity = 350/560 (62.50%), Postives = 447/560 (79.82%), Query Frame = 1

Query: 51  ISDLISRQQWSILKSHVKFKSPIDFLHQLMGSGDVDPLLVLRYFNWSRRELNVNYSIELI 110
           I+DL+++Q WS L++ +K  SPI  LHQL+ S +VDP L LRY  WS++EL +++S+EL 
Sbjct: 38  IADLVAKQHWSELRTRLKNTSPITLLHQLLSS-EVDPELTLRYLTWSKKELKLSHSLELT 97

Query: 111 CRLLNLLANAKHYPKIRSILDSFVKGETNCSISLIFHSLSVCSGQFCANSIIADMLVLAY 170
            R+L+ LA  K Y  IRS LD+FVK E N  +S IFH++SV    FCANSII DML+LAY
Sbjct: 98  FRILHSLAYTKKYSNIRSFLDNFVKNE-NYPVSSIFHAISVGGDSFCANSIIVDMLLLAY 157

Query: 171 VENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEFVYKEMIRRKISPNL 230
           V+N KT  G E FKRAGDY +KLSV+SCNPLLSALVK+ E G +EFVY EMI+R+I P+L
Sbjct: 158 VKNLKTHFGFETFKRAGDYGFKLSVISCNPLLSALVKDREIGDMEFVYNEMIKRRIHPSL 217

Query: 231 ITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYCKMGRVGKMYKADAI 290
           I+FN VINGLCK GKLNKA D+++DMKVWG   NVVTYNTLIDGYCKMG++GKMYK+DA+
Sbjct: 218 ISFNIVINGLCKAGKLNKATDIIEDMKVWGVSANVVTYNTLIDGYCKMGKIGKMYKSDAV 277

Query: 291 LKEMVENKVSPNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKPTVVTYNSLVNGLCN 350
           LKEMV N + PN VT+N+LI+GFCKD N+SAA+K+F EMQ QGLKPTVVTYNSL+NGLC 
Sbjct: 278 LKEMVSNGICPNEVTYNILINGFCKDGNVSAAMKIFAEMQRQGLKPTVVTYNSLINGLCT 337

Query: 351 EGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDNIGKQGLTPNVIT 410
           +G ++EA  L D M++S LKPN IT N+LING+CK K++++A E F+++ K G+ PNV+T
Sbjct: 338 DGNIDEAIALWDRMVASGLKPNAITQNSLINGFCKNKMVKKAVESFNDMPKLGIAPNVVT 397

Query: 411 FNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCREGKMEEVKNLLNEMQ 470
           +NT++   CK G+ME+AF L+ VML++G  PN  TYNCLI G C++  +E  +NL++EM 
Sbjct: 398 YNTIIDACCKAGRMEDAFALRDVMLDRGVPPNVCTYNCLIAGLCKKEDVEAARNLVDEMV 457

Query: 471 CRGVKADTVTYNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTYNILLNGYCMEGNLR 530
            + +KAD VTYNILI + C K E +KA RL+DE+  KGL PSH+TYN L++GYC EGNLR
Sbjct: 458 GKDLKADLVTYNILIDSLCNKGESRKAMRLLDEVSRKGLNPSHVTYNTLMDGYCKEGNLR 517

Query: 531 AALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIK 590
           AALNL+  MEK  R  N+VTYNVLI+G+C+KGK E AN LLNEMLEKGL+PNRTT+EI++
Sbjct: 518 AALNLKTIMEKGVRLPNIVTYNVLIKGFCKKGKFEYANDLLNEMLEKGLVPNRTTFEIVR 577

Query: 591 EEMMEKGFLPDIEGHLYHAS 611
           EEMMEKGF+PDIEGH+Y+ S
Sbjct: 578 EEMMEKGFVPDIEGHIYNLS 595

BLAST of CSPI02G04670 vs. TAIR10
Match: AT1G09820.1 (AT1G09820.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 677.6 bits (1747), Expect = 7.4e-195
Identity = 329/581 (56.63%), Postives = 432/581 (74.35%), Query Frame = 1

Query: 32  CTYNPTCT-APISNDFDPLIISDLISRQQWSILKSHVKFKSPIDFLHQLMGSGDVDPLLV 91
           C+ + T T +P    +D  +I+DLI +Q WS L  HV   +P +   QL+ S ++DP L 
Sbjct: 26  CSSSSTITGSPCPPRYDVAVIADLIEKQHWSKLGVHVTDINPNELFRQLISS-ELDPDLC 85

Query: 92  LRYFNWSRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSFVKGETNCSISLIFHSLS 151
           LRY++W  +  +++ S+EL  +LL+ LANAK Y KIRS LD FV+  ++  +  IFH++S
Sbjct: 86  LRYYSWLVKNSDISVSLELTFKLLHSLANAKRYSKIRSFLDGFVRNGSDHQVHSIFHAIS 145

Query: 152 VCSGQFCANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSALVKENE 211
           +C    C NSIIADMLVLAY  NS+  LG EAFKR+G Y YKLS LSC PL+ AL+KEN 
Sbjct: 146 MCDN-VCVNSIIADMLVLAYANNSRFELGFEAFKRSGYYGYKLSALSCKPLMIALLKENR 205

Query: 212 FGGVEFVYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPNVVTYNT 271
              VE+VYKEMIRRKI PN+ TFN VIN LCK GK+NKA DV++DMKV+G  PNVV+YNT
Sbjct: 206 SADVEYVYKEMIRRKIQPNVFTFNVVINALCKTGKMNKARDVMEDMKVYGCSPNVVSYNT 265

Query: 272 LIDGYCKMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALKVFEEMQ 331
           LIDGYCK+G  GKMYKADA+LKEMVEN VSPN  TFN+LIDGF KD+NL  ++KVF+EM 
Sbjct: 266 LIDGYCKLGGNGKMYKADAVLKEMVENDVSPNLTTFNILIDGFWKDDNLPGSMKVFKEML 325

Query: 332 SQGLKPTVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLE 391
            Q +KP V++YNSL+NGLCN GK++EA  + D+M+S+ ++PN+ITYNALING+CK  +L+
Sbjct: 326 DQDVKPNVISYNSLINGLCNGGKISEAISMRDKMVSAGVQPNLITYNALINGFCKNDMLK 385

Query: 392 EARELFDNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNASTYNCLI 451
           EA ++F ++  QG  P    +N L+  YCK GK+++ F L++ M  +G +P+  TYNCLI
Sbjct: 386 EALDMFGSVKGQGAVPTTRMYNMLIDAYCKLGKIDDGFALKEEMEREGIVPDVGTYNCLI 445

Query: 452 VGFCREGKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEMLDKGLK 511
            G CR G +E  K L +++  +G+  D VT++IL+  +C K E +KAA L+ EM   GLK
Sbjct: 446 AGLCRNGNIEAAKKLFDQLTSKGL-PDLVTFHILMEGYCRKGESRKAAMLLKEMSKMGLK 505

Query: 512 PSHLTYNILLNGYCMEGNLRAALNLRKQMEKEGRW-ANVVTYNVLIQGYCRKGKLEDANG 571
           P HLTYNI++ GYC EGNL+AA N+R QMEKE R   NV +YNVL+QGY +KGKLEDAN 
Sbjct: 506 PRHLTYNIVMKGYCKEGNLKAATNMRTQMEKERRLRMNVASYNVLLQGYSQKGKLEDANM 565

Query: 572 LLNEMLEKGLIPNRTTYEIIKEEMMEKGFLPDIEGHLYHAS 611
           LLNEMLEKGL+PNR TYEI+KEEM+++GF+PDIEGHL++ S
Sbjct: 566 LLNEMLEKGLVPNRITYEIVKEEMVDQGFVPDIEGHLFNVS 603

BLAST of CSPI02G04670 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 324.3 bits (830), Expect = 1.6e-88
Identity = 179/507 (35.31%), Postives = 285/507 (56.21%), Query Frame = 1

Query: 86  DPLLVLRYFNWSRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSFVKGET--NCSIS 145
           D  L+L++ NW+    +  +++   C  L++L   K Y K   IL   V  +T  +   S
Sbjct: 61  DQALILKFLNWANP--HQFFTLRCKCITLHILTKFKLY-KTAQILAEDVAAKTLDDEYAS 120

Query: 146 LIFHSLSVCSGQFCANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLS 205
           L+F SL        + S + D++V +Y   S     L     A  + +   VLS N +L 
Sbjct: 121 LVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLD 180

Query: 206 ALVK-ENEFGGVEFVYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFW 265
           A ++ +      E V+KEM+  ++SPN+ T+N +I G C  G ++ A  + D M+  G  
Sbjct: 181 ATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCL 240

Query: 266 PNVVTYNTLIDGYCKMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAA 325
           PNVVTYNTLIDGYCK+ ++   +K   +L+ M    + PN +++NV+I+G C++  +   
Sbjct: 241 PNVVTYNTLIDGYCKLRKIDDGFK---LLRSMALKGLEPNLISYNVVINGLCREGRMKEV 300

Query: 326 LKVFEEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALING 385
             V  EM  +G     VTYN+L+ G C EG  ++A V+  EML   L P+VITY +LI+ 
Sbjct: 301 SFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHS 360

Query: 386 YCKKKLLEEARELFDNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPN 445
            CK   +  A E  D +  +GL PN  T+ TL+ G+ + G M EA+ + + M + GF P+
Sbjct: 361 MCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPS 420

Query: 446 ASTYNCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLID 505
             TYN LI G C  GKME+   +L +M+ +G+  D V+Y+ ++S +C   +  +A R+  
Sbjct: 421 VVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKR 480

Query: 506 EMLDKGLKPSHLTYNILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKG 565
           EM++KG+KP  +TY+ L+ G+C +   + A +L ++M + G   +  TY  LI  YC +G
Sbjct: 481 EMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEG 540

Query: 566 KLEDANGLLNEMLEKGLIPNRTTYEII 590
            LE A  L NEM+EKG++P+  TY ++
Sbjct: 541 DLEKALQLHNEMVEKGVLPDVVTYSVL 561

BLAST of CSPI02G04670 vs. TAIR10
Match: AT1G05670.1 (AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 298.5 bits (763), Expect = 9.3e-81
Identity = 174/571 (30.47%), Postives = 309/571 (54.12%), Query Frame = 1

Query: 30  PFCTYNPTCTAPISNDFDPLIISDLISRQQWSILKS----HVKFKSPIDFLHQLMGSGDV 89
           PF  Y+P   +    +F   I + +  R+   + +S      KFK+  D L  ++     
Sbjct: 42  PFPDYSPKKASVRDTEFVHQITNVIKLRRAEPLRRSLKPYECKFKT--DHLIWVLMKIKC 101

Query: 90  DPLLVLRYFNWSRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSF-VKGETNCSISL 149
           D  LVL +F+W+R   + N  +E +C +++L   +K     +S++ SF  + + N + S 
Sbjct: 102 DYRLVLDFFDWARSRRDSN--LESLCIVIHLAVASKDLKVAQSLISSFWERPKLNVTDSF 161

Query: 150 I--FHSLSVCSGQFCANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLL 209
           +  F  L      + ++  + D+     V+          F++  +Y   LSV SCN  L
Sbjct: 162 VQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYL 221

Query: 210 SALVKE-NEFGGVEFVYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGF 269
           + L K+  +      V++E     +  N+ ++N VI+ +C++G++ +A  ++  M++ G+
Sbjct: 222 TRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGY 281

Query: 270 WPNVVTYNTLIDGYCKMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSA 329
            P+V++Y+T+++GYC+ G + K++K   +++ M    + PNS  +  +I   C+   L+ 
Sbjct: 282 TPDVISYSTVVNGYCRFGELDKVWK---LIEVMKRKGLKPNSYIYGSIIGLLCRICKLAE 341

Query: 330 ALKVFEEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALIN 389
           A + F EM  QG+ P  V Y +L++G C  G +  A     EM S ++ P+V+TY A+I+
Sbjct: 342 AEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIIS 401

Query: 390 GYCKKKLLEEARELFDNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLP 449
           G+C+   + EA +LF  +  +GL P+ +TF  L++GYCK G M++AF +   M++ G  P
Sbjct: 402 GFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSP 461

Query: 450 NASTYNCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLI 509
           N  TY  LI G C+EG ++    LL+EM   G++ +  TYN +++  C+    ++A +L+
Sbjct: 462 NVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLV 521

Query: 510 DEMLDKGLKPSHLTYNILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRK 569
            E    GL    +TY  L++ YC  G +  A  + K+M  +G    +VT+NVL+ G+C  
Sbjct: 522 GEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLH 581

Query: 570 GKLEDANGLLNEMLEKGLIPNRTTYEIIKEE 593
           G LED   LLN ML KG+ PN TT+  + ++
Sbjct: 582 GMLEDGEKLLNWMLAKGIAPNATTFNSLVKQ 605

BLAST of CSPI02G04670 vs. TAIR10
Match: AT1G63130.1 (AT1G63130.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 297.0 bits (759), Expect = 2.7e-80
Identity = 150/453 (33.11%), Postives = 259/453 (57.17%), Query Frame = 1

Query: 165 MLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEFVYKEMIRR 224
           +L+  +   S+  L L    +     Y+  +++ N LL+     N       +  +M+  
Sbjct: 121 ILINCFCRRSQLSLALAVLAKMMKLGYEPDIVTLNSLLNGFCHGNRISDAVSLVGQMVEM 180

Query: 225 KISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYCKMGRVGKM 284
              P+  TFNT+I+GL +  + ++A  +VD M V G  P++VTY  +++G CK G +   
Sbjct: 181 GYQPDSFTFNTLIHGLFRHNRASEAVALVDRMVVKGCQPDLVTYGIVVNGLCKRGDIDL- 240

Query: 285 YKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKPTVVTYNSL 344
             A ++LK+M + K+ P  V +N +ID  C  +N++ AL +F EM ++G++P VVTYNSL
Sbjct: 241 --ALSLLKKMEQGKIEPGVVIYNTIIDALCNYKNVNDALNLFTEMDNKGIRPNVVTYNSL 300

Query: 345 VNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDNIGKQGL 404
           +  LCN G+ ++A  LL +M+   + PNV+T++ALI+ + K+  L EA +L+D + K+ +
Sbjct: 301 IRCLCNYGRWSDASRLLSDMIERKINPNVVTFSALIDAFVKEGKLVEAEKLYDEMIKRSI 360

Query: 405 TPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCREGKMEEVKN 464
            P++ T+++L++G+C   +++EA  + ++M+ K   PN  TYN LI GFC+  +++E   
Sbjct: 361 DPDIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLIKGFCKAKRVDEGME 420

Query: 465 LLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTYNILLNGYC 524
           L  EM  RG+  +TVTY  LI  + + +E   A  +  +M+  G+ P  +TY+ILL+G C
Sbjct: 421 LFREMSQRGLVGNTVTYTTLIHGFFQARECDNAQIVFKQMVSDGVLPDIMTYSILLDGLC 480

Query: 525 MEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRT 584
             G +  AL + + +++     ++ TYN++I+G C+ GK+ED   L   +  KG+ PN  
Sbjct: 481 NNGKVETALVVFEYLQRSKMEPDIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVV 540

Query: 585 TY----------------EIIKEEMMEKGFLPD 602
           TY                + +  EM E+G LPD
Sbjct: 541 TYTTMMSGFCRKGLKEEADALFREMKEEGPLPD 570

BLAST of CSPI02G04670 vs. TAIR10
Match: AT5G55840.1 (AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 293.9 bits (751), Expect = 2.3e-79
Identity = 161/505 (31.88%), Postives = 270/505 (53.47%), Query Frame = 1

Query: 89  LVLRYFNWSRRE--LNVNYSIELICRLLNLLANAKHYPKIRSILD--SFVKGETNCSISL 148
           L L++  W  ++  L  ++ ++L+C   ++L  A+ Y   R IL   S + G+++     
Sbjct: 92  LALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSFVFGA 151

Query: 149 IFHSLSVCSGQFCANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSA 208
           +  +  +C+    +N  + D+L+  Y+        LE F+  G Y +  SV +CN +L +
Sbjct: 152 LMTTYRLCN----SNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGS 211

Query: 209 LVKENEFGGVEFVYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPN 268
           +VK  E   V    KEM++RKI P++ TFN +IN LC  G   K+  ++  M+  G+ P 
Sbjct: 212 VVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPT 271

Query: 269 VVTYNTLIDGYCKMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALK 328
           +VTYNT++  YCK GR      A  +L  M    V  +  T+N+LI   C+   ++    
Sbjct: 272 IVTYNTVLHWYCKKGR---FKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYL 331

Query: 329 VFEEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYC 388
           +  +M+ + + P  VTYN+L+NG  NEGK+  A  LL+EMLS  L PN +T+NALI+G+ 
Sbjct: 332 LLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHI 391

Query: 389 KKKLLEEARELFDNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNAS 448
            +   +EA ++F  +  +GLTP+ +++  LL G CK  + + A      M   G      
Sbjct: 392 SEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRI 451

Query: 449 TYNCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEM 508
           TY  +I G C+ G ++E   LLNEM   G+  D VTY+ LI+ +C+    K A  ++  +
Sbjct: 452 TYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRI 511

Query: 509 LDKGLKPSHLTYNILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKL 568
              GL P+ + Y+ L+   C  G L+ A+ + + M  EG   +  T+NVL+   C+ GK+
Sbjct: 512 YRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKV 571

Query: 569 EDANGLLNEMLEKGLIPNRTTYEII 590
            +A   +  M   G++PN  +++ +
Sbjct: 572 AEAEEFMRCMTSDGILPNTVSFDCL 589

BLAST of CSPI02G04670 vs. NCBI nr
Match: gi|449454139|ref|XP_004144813.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g09820 [Cucumis sativus])

HSP 1 Score: 1248.0 bits (3228), Expect = 0.0e+00
Identity = 611/611 (100.00%), Postives = 611/611 (100.00%), Query Frame = 1

Query: 1   MNVIIRHSSASRIFPAKIDRNCIFSTTHLPFCTYNPTCTAPISNDFDPLIISDLISRQQW 60
           MNVIIRHSSASRIFPAKIDRNCIFSTTHLPFCTYNPTCTAPISNDFDPLIISDLISRQQW
Sbjct: 1   MNVIIRHSSASRIFPAKIDRNCIFSTTHLPFCTYNPTCTAPISNDFDPLIISDLISRQQW 60

Query: 61  SILKSHVKFKSPIDFLHQLMGSGDVDPLLVLRYFNWSRRELNVNYSIELICRLLNLLANA 120
           SILKSHVKFKSPIDFLHQLMGSGDVDPLLVLRYFNWSRRELNVNYSIELICRLLNLLANA
Sbjct: 61  SILKSHVKFKSPIDFLHQLMGSGDVDPLLVLRYFNWSRRELNVNYSIELICRLLNLLANA 120

Query: 121 KHYPKIRSILDSFVKGETNCSISLIFHSLSVCSGQFCANSIIADMLVLAYVENSKTVLGL 180
           KHYPKIRSILDSFVKGETNCSISLIFHSLSVCSGQFCANSIIADMLVLAYVENSKTVLGL
Sbjct: 121 KHYPKIRSILDSFVKGETNCSISLIFHSLSVCSGQFCANSIIADMLVLAYVENSKTVLGL 180

Query: 181 EAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEFVYKEMIRRKISPNLITFNTVINGL 240
           EAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEFVYKEMIRRKISPNLITFNTVINGL
Sbjct: 181 EAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEFVYKEMIRRKISPNLITFNTVINGL 240

Query: 241 CKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYCKMGRVGKMYKADAILKEMVENKVS 300
           CKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYCKMGRVGKMYKADAILKEMVENKVS
Sbjct: 241 CKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYCKMGRVGKMYKADAILKEMVENKVS 300

Query: 301 PNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVL 360
           PNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVL
Sbjct: 301 PNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVL 360

Query: 361 LDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDNIGKQGLTPNVITFNTLLHGYCK 420
           LDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDNIGKQGLTPNVITFNTLLHGYCK
Sbjct: 361 LDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDNIGKQGLTPNVITFNTLLHGYCK 420

Query: 421 FGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVT 480
           FGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVT
Sbjct: 421 FGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVT 480

Query: 481 YNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTYNILLNGYCMEGNLRAALNLRKQME 540
           YNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTYNILLNGYCMEGNLRAALNLRKQME
Sbjct: 481 YNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTYNILLNGYCMEGNLRAALNLRKQME 540

Query: 541 KEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIKEEMMEKGFLP 600
           KEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIKEEMMEKGFLP
Sbjct: 541 KEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIKEEMMEKGFLP 600

Query: 601 DIEGHLYHASQ 612
           DIEGHLYHASQ
Sbjct: 601 DIEGHLYHASQ 611

BLAST of CSPI02G04670 vs. NCBI nr
Match: gi|659070136|ref|XP_008453581.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g09820 [Cucumis melo])

HSP 1 Score: 1163.7 bits (3009), Expect = 0.0e+00
Identity = 572/610 (93.77%), Postives = 585/610 (95.90%), Query Frame = 1

Query: 1   MNVIIRHSSASRIFPAKIDRNCIFSTTHLPFCTYNPTCTAPISNDFDPLIISDLISRQQW 60
           MNVIIR SSASRIFP KIDRN IFSTTHL FCTYN TCTAP SNDFDPLIISDLISRQ+W
Sbjct: 1   MNVIIRLSSASRIFPVKIDRNYIFSTTHLSFCTYNSTCTAPTSNDFDPLIISDLISRQRW 60

Query: 61  SILKSHVKFKSPIDFLHQLMGSGDVDPLLVLRYFNWSRRELNVNYSIELICRLLNLLANA 120
           SILKSHVKFKSPIDFLHQLM SG VDPLLVLRYFNWSRREL VNYSIELICRLL+LLAN 
Sbjct: 61  SILKSHVKFKSPIDFLHQLMCSGAVDPLLVLRYFNWSRRELKVNYSIELICRLLHLLANV 120

Query: 121 KHYPKIRSILDSFVKGETNCSISLIFHSLSVCSGQFCANSIIADMLVLAYVENSKTVLGL 180
           K+YPKIRS+LDSFVKGETNCSISLIFHSLSVCS QFCANSIIADMLVLAYV+NSKTVLGL
Sbjct: 121 KYYPKIRSVLDSFVKGETNCSISLIFHSLSVCSDQFCANSIIADMLVLAYVQNSKTVLGL 180

Query: 181 EAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEFVYKEMIRRKISPNLITFNTVINGL 240
           EAFKRAGDYRYKLSVLSCNPLLSALVKE+EFG VEFVYKEMIRRKISPNLITFN VINGL
Sbjct: 181 EAFKRAGDYRYKLSVLSCNPLLSALVKESEFGDVEFVYKEMIRRKISPNLITFNIVINGL 240

Query: 241 CKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYCKMGRVGKMYKADAILKEMVENKVS 300
           CKVGKLNKAGDV+DDMKVWGFWPN VTYNTLIDGYCKMGRVGKMYKADAILKEMV NKVS
Sbjct: 241 CKVGKLNKAGDVIDDMKVWGFWPNAVTYNTLIDGYCKMGRVGKMYKADAILKEMVGNKVS 300

Query: 301 PNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVL 360
           PN VTFNVLIDGFCKDEN+S ALKVFEEMQSQGLKPTVVTYNSL+NG+CNEGKLNEAKVL
Sbjct: 301 PNIVTFNVLIDGFCKDENVSGALKVFEEMQSQGLKPTVVTYNSLINGMCNEGKLNEAKVL 360

Query: 361 LDEMLSSNLKPNVITYNALINGYCKKKLLEEARELFDNIGKQGLTPNVITFNTLLHGYCK 420
           LDEMLSSNLKPNVITYNALINGYCKKK LEEARELFDNIGKQGLTPNVITFNTLL GYCK
Sbjct: 361 LDEMLSSNLKPNVITYNALINGYCKKKKLEEARELFDNIGKQGLTPNVITFNTLLDGYCK 420

Query: 421 FGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVT 480
            GKMEEAFLLQKVMLEKGFLP+ STYNCLIVGFCREGKMEEVKNLLNEM+CRGVKADTVT
Sbjct: 421 CGKMEEAFLLQKVMLEKGFLPDVSTYNCLIVGFCREGKMEEVKNLLNEMECRGVKADTVT 480

Query: 481 YNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTYNILLNGYCMEGNLRAALNLRKQME 540
           YNILISAWCEKKEPKKAARLIDEMLD+GLKPSHLTYNILLNGYCMEGNLRAALNLRKQME
Sbjct: 481 YNILISAWCEKKEPKKAARLIDEMLDRGLKPSHLTYNILLNGYCMEGNLRAALNLRKQME 540

Query: 541 KEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIKEEMMEKGFLP 600
           KE  WANVVTYNVLI GYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIK EMMEKGFLP
Sbjct: 541 KERIWANVVTYNVLILGYCRKGKLEDANGLLNEMLEKGLIPNRTTYEIIKVEMMEKGFLP 600

Query: 601 DIEGHLYHAS 611
           DIEGHLYHAS
Sbjct: 601 DIEGHLYHAS 610

BLAST of CSPI02G04670 vs. NCBI nr
Match: gi|645273268|ref|XP_008241796.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g09820 [Prunus mume])

HSP 1 Score: 785.8 bits (2028), Expect = 5.5e-224
Identity = 381/584 (65.24%), Postives = 462/584 (79.11%), Query Frame = 1

Query: 27  THLPFCTYNPTCTAPISNDFDPLIISDLISRQQWSILKSHVKFKSPIDFLHQLMGSGDVD 86
           T +  C+     T+P S  F   IIS+LI++Q WS LK+H+K  + I  LHQL  +G  D
Sbjct: 24  TCIRLCSLISISTSPSSQPFSLPIISELIAKQHWSELKTHLKDSNFIAVLHQLFDAG-AD 83

Query: 87  PLLVLRYFNWSRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSFVKGETNCSISLIF 146
           P+L+LRYF+WS++  NV + +E  CRLL+ LANAK Y KIR+ LD FV+     S S IF
Sbjct: 84  PVLILRYFSWSQKNFNVTHPLEFTCRLLHSLANAKKYSKIRAFLDGFVRNNEKRSNSSIF 143

Query: 147 HSLSVCSGQFCANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSALV 206
           H LS+C  QFCANS+I DMLVLAYV+N KT LG EAF+RAGDY +KLSVLSCNPLLSALV
Sbjct: 144 HMLSMCGNQFCANSVIIDMLVLAYVKNMKTRLGFEAFQRAGDYGFKLSVLSCNPLLSALV 203

Query: 207 KENEFGGVEFVYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPNVV 266
           KENE G VE+VYKEM+RR+I  +L TF+ VINGLCKVGKLNKA DV +DMK WG  PNVV
Sbjct: 204 KENEIGYVEYVYKEMVRRRIEADLFTFSIVINGLCKVGKLNKARDVTNDMKAWGISPNVV 263

Query: 267 TYNTLIDGYCKMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALKVF 326
           TYN LIDGYCK G +GKM+KADAILKEMV N V PN +TFN+LIDGFCKDENL++A+KVF
Sbjct: 264 TYNILIDGYCKKGGLGKMHKADAILKEMVANNVHPNEITFNILIDGFCKDENLASAVKVF 323

Query: 327 EEMQSQGLKPTVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKK 386
           EEM+ QGLKP V+TYNSL+NGLC  GKL+EA  L DEML   LKPN++TYNALING+CKK
Sbjct: 324 EEMK-QGLKPNVITYNSLINGLCCNGKLDEACGLRDEMLGLGLKPNIVTYNALINGFCKK 383

Query: 387 KLLEEARELFDNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNASTY 446
           K+++EA+ELFD+I   GL PN IT+NTL+  YCK G MEEA+ L   MLE+   PN ST+
Sbjct: 384 KMMKEAKELFDDIMNGGLVPNAITYNTLIDAYCKHGMMEEAYALHNSMLERRVSPNTSTF 443

Query: 447 NCLIVGFCREGKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEMLD 506
           NC I  FCR+G ME  +  L+EM+ RG+KAD +TYN+LI A+C++ E +KA  L++EM  
Sbjct: 444 NCWIACFCRQGNMELARKFLHEMEVRGLKADPITYNLLIDAFCKEGESRKAEGLLNEMFK 503

Query: 507 KGLKPSHLTYNILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKLED 566
           KGL PSH+TYN L++GYC EGNL+AALN+R QMEKEG+ AN+VTYNVLI+G+C KGKL+ 
Sbjct: 504 KGLSPSHVTYNTLMDGYCKEGNLKAALNVRLQMEKEGKRANIVTYNVLIKGHCMKGKLKV 563

Query: 567 ANGLLNEMLEKGLIPNRTTYEIIKEEMMEKGFLPDIEGHLYHAS 611
           AN LLNEMLEKGL+PNRTTYEI+KEEMMEKGFLPDIEGHLY+ S
Sbjct: 564 ANELLNEMLEKGLVPNRTTYEIVKEEMMEKGFLPDIEGHLYNIS 605

BLAST of CSPI02G04670 vs. NCBI nr
Match: gi|703135857|ref|XP_010106002.1| (hypothetical protein L484_001609 [Morus notabilis])

HSP 1 Score: 776.5 bits (2004), Expect = 3.3e-221
Identity = 370/569 (65.03%), Postives = 461/569 (81.02%), Query Frame = 1

Query: 38  CTAPISNDFDPLIISDLISRQQWSILK-SHVKFKSPIDFLHQLMGSGDVDPLLVLRYFNW 97
           CTA IS+ F+  ++S+LI++Q WS LK +H+   +P   L QL  S +VDP L+ RYFNW
Sbjct: 26  CTASISHTFNAPLVSELIAKQHWSELKRTHLTDSNPTKLLQQLFES-EVDPDLIFRYFNW 85

Query: 98  SRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSFVKGETNCSISLIFHSLSVCSGQF 157
           S +ELN+++++EL CRLL+ LA AK Y KIR+ LD FVK     S   IFHSLS+ S +F
Sbjct: 86  SHKELNISHTLELTCRLLHSLATAKKYSKIRAFLDGFVKRNVEHSNFTIFHSLSISSDRF 145

Query: 158 CANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEF 217
           C +SII DMLVLAY +N K+ L  EAFKRAGDY +KLS LS NPLL ALVKEN+ G VEF
Sbjct: 146 CTSSIIVDMLVLAYAKNLKSHLAFEAFKRAGDYGFKLSALSLNPLLCALVKENKIGQVEF 205

Query: 218 VYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYC 277
           VYKEMIRRKI+ +L TF+ V+NGLCK GKLNKAGD++ DMK +G  PNVVTYN LIDGYC
Sbjct: 206 VYKEMIRRKITGDLYTFSIVVNGLCKAGKLNKAGDIIQDMKAFGVLPNVVTYNILIDGYC 265

Query: 278 KMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKP 337
           KMG++GKMYKA+AIL+EMV NK+ PN +T+N+LI+GFCKDEN++A +KVFEEMQ QGLKP
Sbjct: 266 KMGKLGKMYKAEAILREMVANKICPNEITYNILINGFCKDENVAAGMKVFEEMQRQGLKP 325

Query: 338 TVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELF 397
            VVTYNSL++GLC EGK  EA  L DEML   LKPNV+T+NAL+ G+CKKK++ EARE+F
Sbjct: 326 NVVTYNSLIDGLCTEGKHEEACDLKDEMLGCGLKPNVVTFNALVKGFCKKKMIREAREVF 385

Query: 398 DNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCRE 457
           D+IG QGL PN+IT+NTL+  YCK G M+EAFL + +M EKG LP+ASTYNCLI GF R 
Sbjct: 386 DDIGVQGLAPNIITYNTLIDAYCKNGMMDEAFLSRSLMWEKGVLPDASTYNCLIAGFGRH 445

Query: 458 GKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTY 517
           G ME+ +++L+EMQ +G+KAD +TYNILI A+C+K E +KA R++ ++ DKGL PSHLTY
Sbjct: 446 GDMEKARDILDEMQNKGLKADLITYNILIDAFCKKGETRKATRILKDVFDKGLSPSHLTY 505

Query: 518 NILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLE 577
           N L++GYC +GNL+AALN+R QMEK+G+ ANV TYNVLI+G+C KGKLEDANGLLNEMLE
Sbjct: 506 NTLMDGYCKQGNLKAALNVRAQMEKDGKRANVATYNVLIKGFCEKGKLEDANGLLNEMLE 565

Query: 578 KGLIPNRTTYEIIKEEMMEKGFLPDIEGH 606
           KGL PN+ TYEI+KEEMM+KGF+PDIEGH
Sbjct: 566 KGLNPNQITYEIVKEEMMDKGFVPDIEGH 593

BLAST of CSPI02G04670 vs. NCBI nr
Match: gi|703144178|ref|XP_010108224.1| (hypothetical protein L484_005698 [Morus notabilis])

HSP 1 Score: 776.5 bits (2004), Expect = 3.3e-221
Identity = 370/569 (65.03%), Postives = 461/569 (81.02%), Query Frame = 1

Query: 38  CTAPISNDFDPLIISDLISRQQWSILK-SHVKFKSPIDFLHQLMGSGDVDPLLVLRYFNW 97
           CTA IS+ F+  ++S+LI++Q WS LK +H+   +P   L QL  S +VDP L+ RYFNW
Sbjct: 25  CTASISHTFNAPLVSELIAKQHWSELKRTHLTDSNPTKLLQQLFES-EVDPDLIFRYFNW 84

Query: 98  SRRELNVNYSIELICRLLNLLANAKHYPKIRSILDSFVKGETNCSISLIFHSLSVCSGQF 157
           S +ELN+++++EL CRLL+ LA AK Y KIR+ LD FVK     S   IFHSLS+ S +F
Sbjct: 85  SHKELNISHTLELTCRLLHSLATAKKYSKIRAFLDGFVKRNVEHSNFTIFHSLSISSDRF 144

Query: 158 CANSIIADMLVLAYVENSKTVLGLEAFKRAGDYRYKLSVLSCNPLLSALVKENEFGGVEF 217
           C +SII DMLVLAY +N K+ L  EAFKRAGDY +KLS LS NPLL ALVKEN+ G VEF
Sbjct: 145 CTSSIIVDMLVLAYAKNLKSHLAFEAFKRAGDYGFKLSALSLNPLLCALVKENKIGQVEF 204

Query: 218 VYKEMIRRKISPNLITFNTVINGLCKVGKLNKAGDVVDDMKVWGFWPNVVTYNTLIDGYC 277
           VYKEMIRRKI+ +L TF+ V+NGLCK GKLNKAGD++ DMK +G  PNVVTYN LIDGYC
Sbjct: 205 VYKEMIRRKITGDLYTFSIVVNGLCKAGKLNKAGDIIQDMKAFGVLPNVVTYNILIDGYC 264

Query: 278 KMGRVGKMYKADAILKEMVENKVSPNSVTFNVLIDGFCKDENLSAALKVFEEMQSQGLKP 337
           KMG++GKMYKA+AIL+EMV NK+ PN +T+N+LI+GFCKDEN++A +KVFEEMQ QGLKP
Sbjct: 265 KMGKLGKMYKAEAILREMVANKICPNEITYNILINGFCKDENVAAGMKVFEEMQRQGLKP 324

Query: 338 TVVTYNSLVNGLCNEGKLNEAKVLLDEMLSSNLKPNVITYNALINGYCKKKLLEEARELF 397
            VVTYNSL++GLC EGK  EA  L DEML   LKPNV+T+NAL+ G+CKKK++ EARE+F
Sbjct: 325 NVVTYNSLIDGLCTEGKHEEACDLKDEMLGCGLKPNVVTFNALVKGFCKKKMIREAREVF 384

Query: 398 DNIGKQGLTPNVITFNTLLHGYCKFGKMEEAFLLQKVMLEKGFLPNASTYNCLIVGFCRE 457
           D+IG QGL PN+IT+NTL+  YCK G M+EAFL + +M EKG LP+ASTYNCLI GF R 
Sbjct: 385 DDIGVQGLAPNIITYNTLIDAYCKNGMMDEAFLSRSLMWEKGVLPDASTYNCLIAGFGRH 444

Query: 458 GKMEEVKNLLNEMQCRGVKADTVTYNILISAWCEKKEPKKAARLIDEMLDKGLKPSHLTY 517
           G ME+ +++L+EMQ +G+KAD +TYNILI A+C+K E +KA R++ ++ DKGL PSHLTY
Sbjct: 445 GDMEKARDILDEMQNKGLKADLITYNILIDAFCKKGETRKATRILKDVFDKGLSPSHLTY 504

Query: 518 NILLNGYCMEGNLRAALNLRKQMEKEGRWANVVTYNVLIQGYCRKGKLEDANGLLNEMLE 577
           N L++GYC +GNL+AALN+R QMEK+G+ ANV TYNVLI+G+C KGKLEDANGLLNEMLE
Sbjct: 505 NTLMDGYCKQGNLKAALNVRAQMEKDGKRANVATYNVLIKGFCEKGKLEDANGLLNEMLE 564

Query: 578 KGLIPNRTTYEIIKEEMMEKGFLPDIEGH 606
           KGL PN+ TYEI+KEEMM+KGF+PDIEGH
Sbjct: 565 KGLNPNQITYEIVKEEMMDKGFVPDIEGH 592

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR27_ARATH1.3e-19356.63Pentatricopeptide repeat-containing protein At1g09820 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH2.8e-8735.31Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PPR12_ARATH1.7e-7930.47Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
PPR99_ARATH4.8e-7933.11Pentatricopeptide repeat-containing protein At1g63130, mitochondrial OS=Arabidop... [more]
PP432_ARATH4.1e-7831.88Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
W9SHF9_9ROSA2.3e-22165.03Uncharacterized protein OS=Morus notabilis GN=L484_001609 PE=4 SV=1[more]
W9RZ47_9ROSA2.3e-22165.03Uncharacterized protein OS=Morus notabilis GN=L484_005698 PE=4 SV=1[more]
A0A0D2SMN1_GOSRA3.3e-22065.06Uncharacterized protein OS=Gossypium raimondii GN=B456_007G306300 PE=4 SV=1[more]
F6I4S5_VITVI1.2e-21462.80Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0060g01220 PE=4 SV=... [more]
A0A067JV12_JATCU3.1e-21062.50Uncharacterized protein OS=Jatropha curcas GN=JCGZ_23217 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G09820.17.4e-19556.63 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT5G39710.11.6e-8835.31 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G05670.19.3e-8130.47 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G63130.12.7e-8033.11 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G55840.12.3e-7931.88 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449454139|ref|XP_004144813.1|0.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At1g09820 [Cucumis sativu... [more]
gi|659070136|ref|XP_008453581.1|0.0e+0093.77PREDICTED: pentatricopeptide repeat-containing protein At1g09820 [Cucumis melo][more]
gi|645273268|ref|XP_008241796.1|5.5e-22465.24PREDICTED: pentatricopeptide repeat-containing protein At1g09820 [Prunus mume][more]
gi|703135857|ref|XP_010106002.1|3.3e-22165.03hypothetical protein L484_001609 [Morus notabilis][more]
gi|703144178|ref|XP_010108224.1|3.3e-22165.03hypothetical protein L484_005698 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G04670.1CSPI02G04670.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 197..226
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 437..470
score: 9.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 301..350
score: 1.3E-20coord: 371..420
score: 1.0E-19coord: 477..524
score: 3.6E-17coord: 228..277
score: 1.1E-18coord: 547..589
score: 3.6
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 266..302
score: 1.3E-7coord: 374..408
score: 4.7E-9coord: 515..543
score: 1.2E-6coord: 304..338
score: 2.4E-10coord: 479..512
score: 2.6E-8coord: 549..582
score: 3.5E-10coord: 445..477
score: 2.7E-9coord: 409..443
score: 3.0E-8coord: 339..373
score: 7.7E-10coord: 197..229
score: 8.1E-6coord: 231..263
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 547..581
score: 14.601coord: 264..301
score: 12.123coord: 442..476
score: 12.573coord: 229..263
score: 11.685coord: 477..511
score: 13.274coord: 194..228
score: 9.186coord: 337..371
score: 13.23coord: 302..336
score: 13.735coord: 372..406
score: 13.581coord: 512..546
score: 10.391coord: 407..441
score: 13
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 480..576
score: 9.0E-8coord: 310..424
score: 9.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 121..607
score: 1.5E-236coord: 13..97
score: 1.5E
NoneNo IPR availablePANTHERPTHR24015:SF745SUBFAMILY NOT NAMEDcoord: 121..607
score: 1.5E-236coord: 13..97
score: 1.5E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 310..462
score: 5.49E-6coord: 415..579
score: 2.8

The following gene(s) are paralogous to this gene:

None