CSPI02G25470 (gene) Wild cucumber (PI 183967)

NameCSPI02G25470
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr2 : 21677543 .. 21680250 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCCAATGAAATTCATCCTTAGCTTCACCATAATTCATTTTGCTTCATCAATTCGAGAATGAACTAAACTAAGGCTGTTCGTTCCCGCGCACATTTGAAATGAGACCCCATTTCCCAGAATTAGCTACTCGATTGAGCAGAGCCATACTTTCAATTTCAAATCAAACAACCCCTGCTGGATCATGGACCCCTTCACTGGAGCAGAATTTGCATCGACTCGGTTTTCGCCAAATGCTGAATCCATCTCTCGTCTCTCAAGTCATCGACCCTCATCTTCTTTCCCATCACTCCCTCGCTCTTGGTTTCTTCAATTGGGCTTCTCAGCAACCTGGCTTCACCCACAATTCCGATTCCTACAATTCCATTCTCAAGTCTCTCTCTCTTTCACGCCATTTTGGGCCCATTCATAGTCTCTTGAAACAGGTAAAAACTCAGAAAATTGGCCTCGATTTATCAGTTTATCGCGCTGTTATTGATTCCTTGATCATTGCCAAGAAGACCCATGATGCTTTTTTGGTTTTTAATGAGGTTACTTCAATTACTCATATTATTGGATCCGAGCTCTGTAATTCGCTTTTGGCTGCTCTCGCTTCTGATGGGTTTTTTGAGCATGCCCAGAAGGTTTTCGATGAAATGTCTCTTAAATCTATTCCTTTTAACACTCTTGGATTTGGTGTGTTTATATGGAGGATTTGTAGAAATACTGATGTAGTTAAAGTTTTGAACATGATAGATGGTGCCAGGACCAATAATTCGGATATCAATGGTTCTGTTATTGCCACATTGATCATTCACGGGCTCTGTGAGGCATCTAGACTTGAAGAAGCTTCAAATATTTTGGACGAGCTTAAGAATAGGGGTTGCAAGCCTGACTTTTTGACGTACTGGATTCTTGGAGAAGCATTTCAGTCAGCAAGGAATGTGGTTGACAGGGAGAAAATCCTGAAGAAGAAGAGAAAGTTGGGGGTAGCTCCTAGGCTTAATGATTATAAGGAGTACTTATTTGTTTTAATAGCTGGGAGACGGATACGTGAAGCTAAAGAGTTAGGTGAAGTTATTGTCAAAGGAAATTTTCCTATGGATGAAGAGGTTTCTAATGTGCTGATAGGGTCAGTGGCTTCCGTTGATCCTTACTCTGCTATTATGTTCTTCAAGTTCATGGTTGAAAAAGGGAGGTTTCCAACTCTCCTGACTTTAAGAAATCTGAGTAGAAATTTGTGCAAGCATGGAAAAACTGACGAACTGTTGGAAGTTTTCCAAGTTCTGTGTATAAATAACTACTTCAATGATTTGGATAGATATCATTTAAGAATTTCATTCTTATGCAAGGCTGGAAAGGTGAAAGAGGCCTACGGTGTTTTGCAGGAGATGAAGAAAAATGGATTTGACCCTGATGTATCTTTTTACAATTCTGTCCTGGAAGCATGTTGTAGAGAAGATTTGCTTCGGCCTGCTAGGAAACTGTGGGATGAGATGTTTGCTGGTGGCTGTTGTGGTAATTTGAAGACGTATAGTATCCTCATTCAAAAGTTTTCAAAATCCAATCAAATCGAGGAAGCTTTGGTGCTTTACAGTCATATGCTTGGAAAAAACGTCGAACCTGACATTGCAATCTACACGTCCCTGCTTCAAGGGCTTTGTCAGGATTCACAACTTGAAGCTGCTTTTGAAGTCTTTAGCAAGTCTGTTGAACAGGATGTAAATCTTGCGGCAACCTTGCTGAGCACCTTTATCCTGTGTCTGTGTAAAGTAGGCACGTTACTTTCCCTGGTTACATATGCCATGTAAATTCTTGTGCGTTTCACGTTTATACATATATGCTTACTGTCTGGAGTCATTTGTTTCTCAGTTAACTAGTCCCTTGGTCACTGCCATAGCTGACCCTTTACAAACATCTCTTTTATTGGCCAGATCACTCTTATTTAGGGTTGTCTCTTAGAGTACCCCCATTTTGGCTTTATTAGCTAGACACACTTCATAAACAGCCAAAGCCATTGTTGACTCTGTAAGATCCTCGCTTTATTTCGCCAAATCACTCTCGTTGGAATTAATCTCTTTTACTGCTCCATTTTCAGCTTTGTTACGCAAACACTTATCTTGAATTAGCATTGTCCACATCTAGCATCTTCTTGTTCTAAGGAGGCTTCTGTAGAACAAAAAATTGGTTTTATATTCAACATTTTAATTTCGTGATGGAGCTTTTATTTAACCTGGATTCCAAATGCAGGTCATTTCCTTGCTGCTTCAAAATTACTCCGTGGTCTAGCAAGCGACATCGCTCATCCAGACTCTCATGTAACTTTACTGAAAGGTTTTGCAGATGCTGGAGAGGTTTCACTCGCTAAGCAGCATGTAGAGTGGGTTCAAGAAACTTCTCCATCAATGTTGTCTGTTATATCCACCGAGTTATTAGCATTTCTTCCTTCCTCTCCAAAAGCAGATCCAATTTTACAGATTCTTCAAACAGTACAAGAACTATCACGTTTCAGCCATTGATGAAGATATGACAAAGATCTTTAGTAGATTCTATCCTATGTGATCAAGTCGGATACTCTTTCTTTACGAAAACAAATTTTATTGAGAAGAACTGAAAGAATACAAGAGCATAAAAAAATCAGTGCCCCAAAAACACCCCTACTAAAAGAGGGAGGACCAACTAAGTAAAATGCTACAAGGAGAAAACCGGATACTGTTTGTTCTATTGCGGC

mRNA sequence

ATGAGACCCCATTTCCCAGAATTAGCTACTCGATTGAGCAGAGCCATACTTTCAATTTCAAATCAAACAACCCCTGCTGGATCATGGACCCCTTCACTGGAGCAGAATTTGCATCGACTCGGTTTTCGCCAAATGCTGAATCCATCTCTCGTCTCTCAAGTCATCGACCCTCATCTTCTTTCCCATCACTCCCTCGCTCTTGGTTTCTTCAATTGGGCTTCTCAGCAACCTGGCTTCACCCACAATTCCGATTCCTACAATTCCATTCTCAAGTCTCTCTCTCTTTCACGCCATTTTGGGCCCATTCATAGTCTCTTGAAACAGGTAAAAACTCAGAAAATTGGCCTCGATTTATCAGTTTATCGCGCTGTTATTGATTCCTTGATCATTGCCAAGAAGACCCATGATGCTTTTTTGGTTTTTAATGAGGTTACTTCAATTACTCATATTATTGGATCCGAGCTCTGTAATTCGCTTTTGGCTGCTCTCGCTTCTGATGGGTTTTTTGAGCATGCCCAGAAGGTTTTCGATGAAATGTCTCTTAAATCTATTCCTTTTAACACTCTTGGATTTGGTGTGTTTATATGGAGGATTTGTAGAAATACTGATGTAGTTAAAGTTTTGAACATGATAGATGGTGCCAGGACCAATAATTCGGATATCAATGGTTCTGTTATTGCCACATTGATCATTCACGGGCTCTGTGAGGCATCTAGACTTGAAGAAGCTTCAAATATTTTGGACGAGCTTAAGAATAGGGGTTGCAAGCCTGACTTTTTGACGTACTGGATTCTTGGAGAAGCATTTCAGTCAGCAAGGAATGTGGTTGACAGGGAGAAAATCCTGAAGAAGAAGAGAAAGTTGGGGGTAGCTCCTAGGCTTAATGATTATAAGGAGTACTTATTTGTTTTAATAGCTGGGAGACGGATACGTGAAGCTAAAGAGTTAGGTGAAGTTATTGTCAAAGGAAATTTTCCTATGGATGAAGAGGTTTCTAATGTGCTGATAGGGTCAGTGGCTTCCGTTGATCCTTACTCTGCTATTATGTTCTTCAAGTTCATGGTTGAAAAAGGGAGGTTTCCAACTCTCCTGACTTTAAGAAATCTGAGTAGAAATTTGTGCAAGCATGGAAAAACTGACGAACTGTTGGAAGTTTTCCAAGTTCTGTGTATAAATAACTACTTCAATGATTTGGATAGATATCATTTAAGAATTTCATTCTTATGCAAGGCTGGAAAGGTGAAAGAGGCCTACGGTGTTTTGCAGGAGATGAAGAAAAATGGATTTGACCCTGATGTATCTTTTTACAATTCTGTCCTGGAAGCATGTTGTAGAGAAGATTTGCTTCGGCCTGCTAGGAAACTGTGGGATGAGATGTTTGCTGGTGGCTGTTGTGGTAATTTGAAGACGTATAGTATCCTCATTCAAAAGTTTTCAAAATCCAATCAAATCGAGGAAGCTTTGGTGCTTTACAGTCATATGCTTGGAAAAAACGTCGAACCTGACATTGCAATCTACACGTCCCTGCTTCAAGGGCTTTGTCAGGATTCACAACTTGAAGCTGCTTTTGAAGTCTTTAGCAAGTCTGTTGAACAGGATGTAAATCTTGCGGCAACCTTGCTGAGCACCTTTATCCTGTGTCTGTGTAAAGTAGGTCATTTCCTTGCTGCTTCAAAATTACTCCGTGGTCTAGCAAGCGACATCGCTCATCCAGACTCTCATGTAACTTTACTGAAAGGTTTTGCAGATGCTGGAGAGGTTTCACTCGCTAAGCAGCATGTAGAGTGGGTTCAAGAAACTTCTCCATCAATGTTGTCTGTTATATCCACCGAGTTATTAGCATTTCTTCCTTCCTCTCCAAAAGCAGATCCAATTTTACAGATTCTTCAAACAGTACAAGAACTATCACGTTTCAGCCATTGA

Coding sequence (CDS)

ATGAGACCCCATTTCCCAGAATTAGCTACTCGATTGAGCAGAGCCATACTTTCAATTTCAAATCAAACAACCCCTGCTGGATCATGGACCCCTTCACTGGAGCAGAATTTGCATCGACTCGGTTTTCGCCAAATGCTGAATCCATCTCTCGTCTCTCAAGTCATCGACCCTCATCTTCTTTCCCATCACTCCCTCGCTCTTGGTTTCTTCAATTGGGCTTCTCAGCAACCTGGCTTCACCCACAATTCCGATTCCTACAATTCCATTCTCAAGTCTCTCTCTCTTTCACGCCATTTTGGGCCCATTCATAGTCTCTTGAAACAGGTAAAAACTCAGAAAATTGGCCTCGATTTATCAGTTTATCGCGCTGTTATTGATTCCTTGATCATTGCCAAGAAGACCCATGATGCTTTTTTGGTTTTTAATGAGGTTACTTCAATTACTCATATTATTGGATCCGAGCTCTGTAATTCGCTTTTGGCTGCTCTCGCTTCTGATGGGTTTTTTGAGCATGCCCAGAAGGTTTTCGATGAAATGTCTCTTAAATCTATTCCTTTTAACACTCTTGGATTTGGTGTGTTTATATGGAGGATTTGTAGAAATACTGATGTAGTTAAAGTTTTGAACATGATAGATGGTGCCAGGACCAATAATTCGGATATCAATGGTTCTGTTATTGCCACATTGATCATTCACGGGCTCTGTGAGGCATCTAGACTTGAAGAAGCTTCAAATATTTTGGACGAGCTTAAGAATAGGGGTTGCAAGCCTGACTTTTTGACGTACTGGATTCTTGGAGAAGCATTTCAGTCAGCAAGGAATGTGGTTGACAGGGAGAAAATCCTGAAGAAGAAGAGAAAGTTGGGGGTAGCTCCTAGGCTTAATGATTATAAGGAGTACTTATTTGTTTTAATAGCTGGGAGACGGATACGTGAAGCTAAAGAGTTAGGTGAAGTTATTGTCAAAGGAAATTTTCCTATGGATGAAGAGGTTTCTAATGTGCTGATAGGGTCAGTGGCTTCCGTTGATCCTTACTCTGCTATTATGTTCTTCAAGTTCATGGTTGAAAAAGGGAGGTTTCCAACTCTCCTGACTTTAAGAAATCTGAGTAGAAATTTGTGCAAGCATGGAAAAACTGACGAACTGTTGGAAGTTTTCCAAGTTCTGTGTATAAATAACTACTTCAATGATTTGGATAGATATCATTTAAGAATTTCATTCTTATGCAAGGCTGGAAAGGTGAAAGAGGCCTACGGTGTTTTGCAGGAGATGAAGAAAAATGGATTTGACCCTGATGTATCTTTTTACAATTCTGTCCTGGAAGCATGTTGTAGAGAAGATTTGCTTCGGCCTGCTAGGAAACTGTGGGATGAGATGTTTGCTGGTGGCTGTTGTGGTAATTTGAAGACGTATAGTATCCTCATTCAAAAGTTTTCAAAATCCAATCAAATCGAGGAAGCTTTGGTGCTTTACAGTCATATGCTTGGAAAAAACGTCGAACCTGACATTGCAATCTACACGTCCCTGCTTCAAGGGCTTTGTCAGGATTCACAACTTGAAGCTGCTTTTGAAGTCTTTAGCAAGTCTGTTGAACAGGATGTAAATCTTGCGGCAACCTTGCTGAGCACCTTTATCCTGTGTCTGTGTAAAGTAGGTCATTTCCTTGCTGCTTCAAAATTACTCCGTGGTCTAGCAAGCGACATCGCTCATCCAGACTCTCATGTAACTTTACTGAAAGGTTTTGCAGATGCTGGAGAGGTTTCACTCGCTAAGCAGCATGTAGAGTGGGTTCAAGAAACTTCTCCATCAATGTTGTCTGTTATATCCACCGAGTTATTAGCATTTCTTCCTTCCTCTCCAAAAGCAGATCCAATTTTACAGATTCTTCAAACAGTACAAGAACTATCACGTTTCAGCCATTGA
BLAST of CSPI02G25470 vs. Swiss-Prot
Match: PP380_ARATH (Pentatricopeptide repeat-containing protein At5g14080 OS=Arabidopsis thaliana GN=At5g14080 PE=2 SV=2)

HSP 1 Score: 643.3 bits (1658), Expect = 2.9e-183
Identity = 317/634 (50.00%), Postives = 450/634 (70.98%), Query Frame = 1

Query: 1   MRPHFPELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60
           MRP   ELA R+ R +L +S  +  A  W+P +EQ+LH LGFR  ++PSLV++VIDP LL
Sbjct: 1   MRPA-TELAVRIGRELLKVSGSSRAARIWSPLIEQSLHGLGFRHSISPSLVARVIDPFLL 60

Query: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120
           +HHSLALGFFNWA+QQPG++H+S SY+SI KSLSLSR F  + +L KQVK+ KI LD SV
Sbjct: 61  NHHSLALGFFNWAAQQPGYSHDSISYHSIFKSLSLSRQFSAMDALFKQVKSNKILLDSSV 120

Query: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180
           YR++ID+L++ +K   AF V  E  S    I  ++CN LLA L SDG +++AQK+F +M 
Sbjct: 121 YRSLIDTLVLGRKAQSAFWVLEEAFSTGQEIHPDVCNRLLAGLTSDGCYDYAQKLFVKMR 180

Query: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240
            K +  NTLGFGV+I   CR+++  ++L ++D  +  N +INGS+IA LI+H LC+ SR 
Sbjct: 181 HKGVSLNTLGFGVYIGWFCRSSETNQLLRLVDEVKKANLNINGSIIALLILHSLCKCSRE 240

Query: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300
            +A  IL+EL+N  CKPDF+ Y ++ EAF    N+ +R+ +LKKKRKLGVAPR +DY+ +
Sbjct: 241 MDAFYILEELRNIDCKPDFMAYRVIAEAFVVTGNLYERQVVLKKKRKLGVAPRSSDYRAF 300

Query: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360
           +  LI+ +R+ EAKE+ EVIV G FPMD ++ + LIGSV++VDP SA+ F  +MV  G+ 
Sbjct: 301 ILDLISAKRLTEAKEVAEVIVSGKFPMDNDILDALIGSVSAVDPDSAVEFLVYMVSTGKL 360

Query: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420
           P + TL  LS+NLC+H K+D L++ +++L    YF++L  Y L ISFLCKAG+V+E+Y  
Sbjct: 361 PAIRTLSKLSKNLCRHDKSDHLIKAYELLSSKGYFSELQSYSLMISFLCKAGRVRESYTA 420

Query: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480
           LQEMKK G  PDVS YN+++EACC+ +++RPA+KLWDEMF  GC  NL TY++LI+K S+
Sbjct: 421 LQEMKKEGLAPDVSLYNALIEACCKAEMIRPAKKLWDEMFVEGCKMNLTTYNVLIRKLSE 480

Query: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQD-VNLAAT 540
             + EE+L L+  ML + +EPD  IY SL++GLC+++++EAA EVF K +E+D   +   
Sbjct: 481 EGEAEESLRLFDKMLERGIEPDETIYMSLIEGLCKETKIEAAMEVFRKCMERDHKTVTRR 540

Query: 541 LLSTFILCLCKVGHFLAASKLLRGLASDIAHPDSHVTLLKGFADAGEVSLAKQHVEWVQE 600
           +LS F+L LC  GH   AS+LLR     + H  +HV LLK  ADA EV +  +H++W++E
Sbjct: 541 VLSEFVLNLCSNGHSGEASQLLRE-REHLEHTGAHVVLLKCVADAKEVEIGIRHMQWIKE 600

Query: 601 TSPSMLSVISTELLAFLPSSPKADPILQILQTVQ 634
            SPS++  IS++LLA   SS   D IL  ++ ++
Sbjct: 601 VSPSLVHTISSDLLASFCSSSDPDSILPFIRAIE 632

BLAST of CSPI02G25470 vs. Swiss-Prot
Match: PPR18_ARATH (Pentatricopeptide repeat-containing protein At1g06710, mitochondrial OS=Arabidopsis thaliana GN=At1g06710 PE=3 SV=1)

HSP 1 Score: 164.1 bits (414), Expect = 5.1e-39
Identity = 140/580 (24.14%), Postives = 258/580 (44.48%), Query Frame = 1

Query: 42  FRQMLNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGP 101
           FR+ L+ SLV +V+   L++  S  + FF WA +Q G+ H +  YN+++  +        
Sbjct: 126 FREKLSESLVIEVL--RLIARPSAVISFFVWAGRQIGYKHTAPVYNALVDLIVRDDDEKV 185

Query: 102 IHSLLKQVKTQKIGLDLSVYRAVIDSLIIAKKTHDAFLVFNE----VTSITHIIGSELCN 161
               L+Q++      D  V+   ++ L+     + +F +  E    +            N
Sbjct: 186 PEEFLQQIRDD----DKEVFGEFLNVLVRKHCRNGSFSIALEELGRLKDFRFRPSRSTYN 245

Query: 162 SLLAALASDGFFEHAQKVFDEMSLKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTN 221
            L+ A       + A  +  EMSL ++  +      F + +C+     + L +++   T 
Sbjct: 246 CLIQAFLKADRLDSASLIHREMSLANLRMDGFTLRCFAYSLCKVGKWREALTLVE---TE 305

Query: 222 NSDINGSVIATLIIHGLCEASRLEEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVD 281
           N  +  +V  T +I GLCEAS  EEA + L+ ++   C P+ +TY  L     + + +  
Sbjct: 306 NF-VPDTVFYTKLISGLCEASLFEEAMDFLNRMRATSCLPNVVTYSTLLCGCLNKKQLGR 365

Query: 282 REKILKKKRKLGVAPRLNDYKEYLFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIG 341
            +++L      G  P    +   +           A +L + +VK        V N+LIG
Sbjct: 366 CKRVLNMMMMEGCYPSPKIFNSLVHAYCTSGDHSYAYKLLKKMVKCGHMPGYVVYNILIG 425

Query: 342 SVAS-VDPYSAIMF------FKFMVEKGRFPTLLTLRNLSRNLCKHGKTDELLEVFQVLC 401
           S+    D  +  +       +  M+  G     + + + +R LC  GK ++   V + + 
Sbjct: 426 SICGDKDSLNCDLLDLAEKAYSEMLAAGVVLNKINVSSFTRCLCSAGKYEKAFSVIREMI 485

Query: 402 INNYFNDLDRYHLRISFLCKAGKVKEAYGVLQEMKKNGFDPDVSFYNSVLEACCREDLLR 461
              +  D   Y   +++LC A K++ A+ + +EMK+ G   DV  Y  ++++ C+  L+ 
Sbjct: 486 GQGFIPDTSTYSKVLNYLCNASKMELAFLLFEEMKRGGLVADVYTYTIMVDSFCKAGLIE 545

Query: 462 PARKLWDEMFAGGCCGNLKTYSILIQKFSKSNQIEEALVLYSHMLGKNVEPDIAIYTSLL 521
            ARK ++EM   GC  N+ TY+ LI  + K+ ++  A  L+  ML +   P+I  Y++L+
Sbjct: 546 QARKWFNEMREVGCTPNVVTYTALIHAYLKAKKVSYANELFETMLSEGCLPNIVTYSALI 605

Query: 522 QGLCQDSQLEAAFEVF-----SKSV--------EQDVNLAATLLSTFILCL---CKVGHF 581
            G C+  Q+E A ++F     SK V        + D N     + T+   L   CK    
Sbjct: 606 DGHCKAGQVEKACQIFERMCGSKDVPDVDMYFKQYDDNSERPNVVTYGALLDGFCKSHRV 665

Query: 582 LAASKLLRGLASDIAHPDSHV--TLLKGFADAGEVSLAKQ 593
             A KLL  ++ +   P+  V   L+ G    G++  A++
Sbjct: 666 EEARKLLDAMSMEGCEPNQIVYDALIDGLCKVGKLDEAQE 695

BLAST of CSPI02G25470 vs. Swiss-Prot
Match: PP442_ARATH (Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidopsis thaliana GN=At5g61990 PE=2 SV=1)

HSP 1 Score: 160.2 bits (404), Expect = 7.4e-38
Identity = 108/445 (24.27%), Postives = 201/445 (45.17%), Query Frame = 1

Query: 85  SYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSVYRAVIDSLIIAKKTHDAFLVFNEV 144
           +Y+ ++  L   +      SLL ++ +  + LD   Y  +ID L+  +    A  + +E+
Sbjct: 279 TYDVLIDGLCKIKRLEDAKSLLVEMDSLGVSLDNHTYSLLIDGLLKGRNADAAKGLVHEM 338

Query: 145 TSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMSLKSIPFNTLGFGVFIWRICRNTDV 204
            S    I   + +  +  ++ +G  E A+ +FD M    +      +   I   CR  +V
Sbjct: 339 VSHGINIKPYMYDCCICVMSKEGVMEKAKALFDGMIASGLIPQAQAYASLIEGYCREKNV 398

Query: 205 VKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRLEEASNILDELKNRGCKPDFLTYWI 264
            +   ++   +  N  I+     T ++ G+C +  L+ A NI+ E+   GC+P+ + Y  
Sbjct: 399 RQGYELLVEMKKRNIVISPYTYGT-VVKGMCSSGDLDGAYNIVKEMIASGCRPNVVIYTT 458

Query: 265 LGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEYLFVLIAGRRIREAKE-LGEVIVKG 324
           L + F       D  ++LK+ ++ G+AP +  Y   +  L   +R+ EA+  L E++  G
Sbjct: 459 LIKTFLQNSRFGDAMRVLKEMKEQGIAPDIFCYNSLIIGLSKAKRMDEARSFLVEMVENG 518

Query: 325 NFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRFPTLLTLRNLSRNLCKHGKTDELL 384
             P        + G + + +  SA  + K M E G  P  +    L    CK GK  E  
Sbjct: 519 LKPNAFTYGAFISGYIEASEFASADKYVKEMRECGVLPNKVLCTGLINEYCKKGKVIEAC 578

Query: 385 EVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGVLQEMKKNGFDPDVSFYNSVLEAC 444
             ++ +       D   Y + ++ L K  KV +A  + +EM+  G  PDV  Y  ++   
Sbjct: 579 SAYRSMVDQGILGDAKTYTVLMNGLFKNDKVDDAEEIFREMRGKGIAPDVFSYGVLINGF 638

Query: 445 CREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSKSNQIEEALVLYSHMLGKNVEPDI 504
            +   ++ A  ++DEM   G   N+  Y++L+  F +S +IE+A  L   M  K + P+ 
Sbjct: 639 SKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEIEKAKELLDEMSVKGLHPNA 698

Query: 505 AIYTSLLQGLCQDSQLEAAFEVFSK 529
             Y +++ G C+   L  AF +F +
Sbjct: 699 VTYCTIIDGYCKSGDLAEAFRLFDE 722

BLAST of CSPI02G25470 vs. Swiss-Prot
Match: PP120_ARATH (Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis thaliana GN=At1g74580 PE=3 SV=1)

HSP 1 Score: 154.1 bits (388), Expect = 5.3e-36
Identity = 124/517 (23.98%), Postives = 220/517 (42.55%), Query Frame = 1

Query: 78  GFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSVYRAVIDSLIIAKKTHDA 137
           G T +  S+   +KS   +        LL  + +Q   +++  Y  V+          + 
Sbjct: 141 GITPDVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEMNVVAYCTVVGGFYEENFKAEG 200

Query: 138 FLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMSLKSIPFNTLGFGVFIWR 197
           + +F ++ +    +     N LL  L   G  +  +K+ D++  + +  N   + +FI  
Sbjct: 201 YELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLLDKVIKRGVLPNLFTYNLFIQG 260

Query: 198 ICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRLEEASNILDELKNRGCKP 257
           +C+  ++   + M+ G           +    +I+GLC+ S+ +EA   L ++ N G +P
Sbjct: 261 LCQRGELDGAVRMV-GCLIEQGPKPDVITYNNLIYGLCKNSKFQEAEVYLGKMVNEGLEP 320

Query: 258 DFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEYLFVLI-AGRRIREAKEL 317
           D  TY  L   +     V   E+I+      G  P    Y+  +  L   G   R     
Sbjct: 321 DSYTYNTLIAGYCKGGMVQLAERIVGDAVFNGFVPDQFTYRSLIDGLCHEGETNRALALF 380

Query: 318 GEVIVKGNFPMDEEVSNVLIGSVASVDPY-SAIMFFKFMVEKGRFPTLLTLRNLSRNLCK 377
            E + KG  P +  + N LI  +++      A      M EKG  P + T   L   LCK
Sbjct: 381 NEALGKGIKP-NVILYNTLIKGLSNQGMILEAAQLANEMSEKGLIPEVQTFNILVNGLCK 440

Query: 378 HGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGVLQEMKKNGFDPDVSF 437
            G   +   + +V+    YF D+  +++ I       K++ A  +L  M  NG DPDV  
Sbjct: 441 MGCVSDADGLVKVMISKGYFPDIFTFNILIHGYSTQLKMENALEILDVMLDNGVDPDVYT 500

Query: 438 YNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSKSNQIEEALVLYSHML 497
           YNS+L   C+        + +  M   GC  NL T++IL++   +  +++EAL L   M 
Sbjct: 501 YNSLLNGLCKTSKFEDVMETYKTMVEKGCAPNLFTFNILLESLCRYRKLDEALGLLEEMK 560

Query: 498 GKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVE-QDVNLAATLLSTFILCLCKVGHF 557
            K+V PD   + +L+ G C++  L+ A+ +F K  E   V+ +    +  I    +  + 
Sbjct: 561 NKSVNPDAVTFGTLIDGFCKNGDLDGAYTLFRKMEEAYKVSSSTPTYNIIIHAFTEKLNV 620

Query: 558 LAASKLLRGLASDIAHPDSHV--TLLKGFADAGEVSL 590
             A KL + +      PD +    ++ GF   G V+L
Sbjct: 621 TMAEKLFQEMVDRCLGPDGYTYRLMVDGFCKTGNVNL 655

BLAST of CSPI02G25470 vs. Swiss-Prot
Match: PP281_ARATH (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana GN=MEE40 PE=2 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 1.5e-35
Identity = 117/523 (22.37%), Postives = 220/523 (42.07%), Query Frame = 1

Query: 67  LGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSVYRAVID 126
           L   +W   + G   ++  YN +L  L        +     ++    I  D+S +  +I 
Sbjct: 138 LSVVDWMIDEFGLKPDTHFYNRMLNLLVDGNSLKLVEISHAKMSVWGIKPDVSTFNVLIK 197

Query: 127 SLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMSLKSIPF 186
           +L  A +   A L+  ++ S   +   +   +++     +G  + A ++ ++M      +
Sbjct: 198 ALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCSW 257

Query: 187 NTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRLEEASNI 246
           + +   V +   C+   V   LN I      +           +++GLC+A  ++ A  I
Sbjct: 258 SNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHAIEI 317

Query: 247 LDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEYLFVLIA 306
           +D +   G  PD  TY  +         V +  ++L +      +P    Y   +  L  
Sbjct: 318 MDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLCK 377

Query: 307 GRRIREAKELGEVIV-KGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRFPTLLT 366
             ++ EA EL  V+  KG  P     ++++ G   + +   A+  F+ M  KG  P   T
Sbjct: 378 ENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFT 437

Query: 367 LRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGVLQEMK 426
              L  +LC  GK DE L + + + ++     +  Y+  I   CKA K +EA  +  EM+
Sbjct: 438 YNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDEME 497

Query: 427 KNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSKSNQIE 486
            +G   +   YN++++  C+   +  A +L D+M   G   +  TY+ L+  F +   I+
Sbjct: 498 VHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDIK 557

Query: 487 EALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATLLSTFI 546
           +A  +   M     EPDI  Y +L+ GLC+  ++E A ++      + +NL     +  I
Sbjct: 558 KAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYNPVI 617

Query: 547 LCLCKVGHFLAASKLLRG-LASDIAHPD--SHVTLLKGFADAG 586
             L +      A  L R  L  + A PD  S+  + +G  + G
Sbjct: 618 QGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGG 660

BLAST of CSPI02G25470 vs. TrEMBL
Match: A0A0A0LMX0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G405040 PE=4 SV=1)

HSP 1 Score: 1269.6 bits (3284), Expect = 0.0e+00
Identity = 637/640 (99.53%), Postives = 640/640 (100.00%), Query Frame = 1

Query: 1   MRPHFPELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60
           MRPHFPELATRLSRAILSISNQT+PAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL
Sbjct: 1   MRPHFPELATRLSRAILSISNQTSPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60

Query: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120
           SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV
Sbjct: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120

Query: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180
           YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS
Sbjct: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180

Query: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240
           LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL
Sbjct: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240

Query: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300
           EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY
Sbjct: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300

Query: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360
           LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF
Sbjct: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360

Query: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420
           PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV
Sbjct: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420

Query: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480
           LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK
Sbjct: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480

Query: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL 540
           SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL
Sbjct: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL 540

Query: 541 LSTFILCLCKVGHFLAASKLLRGLASDIAHPDSHVTLLKGFADAGEVSLAKQHVEWVQET 600
           LSTFILCLCKVGHFLAASKLLRGLASD+AHPDSHVTLLKGFADAGEVSLAKQHVEWVQET
Sbjct: 541 LSTFILCLCKVGHFLAASKLLRGLASDVAHPDSHVTLLKGFADAGEVSLAKQHVEWVQET 600

Query: 601 SPSMLSVISTELLAFLPSSPKADPILQILQTVQELSRFSH 641
           SPSMLSVISTELLAFLPSSPKADPIL+ILQTVQELSRFSH
Sbjct: 601 SPSMLSVISTELLAFLPSSPKADPILEILQTVQELSRFSH 640

BLAST of CSPI02G25470 vs. TrEMBL
Match: M5WSE0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002717mg PE=4 SV=1)

HSP 1 Score: 842.0 bits (2174), Expect = 4.7e-241
Identity = 415/630 (65.87%), Postives = 511/630 (81.11%), Query Frame = 1

Query: 7   ELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLLSHHSLA 66
           ELA+R+SR ++S SN T P  SW PSLE  LH+LG R  L+PSLV++VIDP LL HHSLA
Sbjct: 8   ELASRISRVLISASNHTRPTRSWNPSLENILHQLGCRDSLSPSLVARVIDPFLLPHHSLA 67

Query: 67  LGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSVYRAVID 126
           LGFFNWASQQP F+H S +Y S+LKSLS SR F  I +LLKQVK QKIGLD SVYR+VI 
Sbjct: 68  LGFFNWASQQPSFSHTSITYKSVLKSLSFSRQFNAIDALLKQVKAQKIGLDASVYRSVIA 127

Query: 127 SLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMSLKSIPF 186
           SLII +KTH+AFLVF+EV+S+   IG E+CNSLLAALA DG+FE+AQKVFDEM+LK+IP 
Sbjct: 128 SLIIGRKTHNAFLVFSEVSSLIKDIGHEICNSLLAALACDGYFEYAQKVFDEMTLKAIPL 187

Query: 187 NTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRLEEASNI 246
           +TLGFGVFIWR+C + ++ K L+M+D  R   S+INGSV A LIIHG C+ASR+ EA  +
Sbjct: 188 STLGFGVFIWRLCGHAELGKTLSMLDEVRRGGSEINGSVTALLIIHGFCQASRVSEAFWV 247

Query: 247 LDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEYLFVLIA 306
           LDEL++R CKPDF+ Y I+ EAF+S  +VVD EK+LKKKRKLGVAPR NDY++++F LI+
Sbjct: 248 LDELRSRQCKPDFMAYRIVAEAFRSTGSVVDVEKVLKKKRKLGVAPRTNDYRQFIFDLIS 307

Query: 307 GRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRFPTLLTL 366
            R+I EAKELGEVI+ GNFP+D++V NVLIGSV+++DP SAI+FF+FM+EK RFPTLLTL
Sbjct: 308 ERQICEAKELGEVIISGNFPIDDDVLNVLIGSVSAIDPLSAIVFFRFMIEKQRFPTLLTL 367

Query: 367 RNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGVLQEMKK 426
            NLSRNLCKH  TDELL VFQVL   +YF DL+ Y++ +SFLCKAG VKEAYGVLQEMKK
Sbjct: 368 CNLSRNLCKHSNTDELLVVFQVLASGDYFKDLETYNVMVSFLCKAGMVKEAYGVLQEMKK 427

Query: 427 NGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSKSNQIEE 486
            G  PDVS YNS++E CCREDLLRPA++LWDEMFA GC GNLKTY+ILI+KFS+  Q++E
Sbjct: 428 KGLGPDVSTYNSLIETCCREDLLRPAKRLWDEMFASGCRGNLKTYNILIRKFSEVGQVDE 487

Query: 487 ALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATLLSTFIL 546
           A  L+ HMLGK V PD+  YTSLL+GLCQ+++L+AAF+VF KSVEQD  LA  +L TF  
Sbjct: 488 AQRLFYHMLGKGVAPDVMTYTSLLEGLCQETKLQAAFDVFRKSVEQDFMLAQNVLGTFTR 547

Query: 547 CLCKVGHFLAASKLLRGLASDIAHPDSHVTLLKGFADAGEVSLAKQHVEWVQETSPSMLS 606
            LCK G FL ASKLL GL++D+A  DSHV LLK  ADA E+ +A +HV+WVQ+TSPSML 
Sbjct: 548 SLCKAGFFLDASKLLCGLSNDVAQSDSHVILLKYLADAKEIPVAIEHVKWVQQTSPSMLQ 607

Query: 607 VISTELLAFLPSSPKADPILQILQTVQELS 637
           ++S ELLA L SS + +P  Q++QT+QE+S
Sbjct: 608 IVSAELLASLSSSSRLEPTRQLVQTIQEIS 637

BLAST of CSPI02G25470 vs. TrEMBL
Match: A0A067F918_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006281mg PE=4 SV=1)

HSP 1 Score: 790.8 bits (2041), Expect = 1.2e-225
Identity = 395/635 (62.20%), Postives = 494/635 (77.80%), Query Frame = 1

Query: 1   MRPHFPELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60
           MRP   +LATR+S+AI+S SN+T PA  WTP LEQ LH+LG R  L+PSLV++VI+P+LL
Sbjct: 3   MRPA-TDLATRISQAIISASNRTRPARKWTPLLEQTLHQLGLRDSLSPSLVARVINPYLL 62

Query: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120
           +HHSLALGFFNWASQQP FTH+  SY+SILKSLSLSR    I S+LKQVK  KI LD SV
Sbjct: 63  THHSLALGFFNWASQQPNFTHSPLSYHSILKSLSLSRQINAIDSVLKQVKVNKITLDSSV 122

Query: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180
           YR +I SLI  K T  AF VFNEV      IG E+CNSLLA LASDG+ ++A K+FDEMS
Sbjct: 123 YRFIIPSLIQGKNTQKAFSVFNEVKFNCEDIGPEICNSLLAVLASDGYIDNALKMFDEMS 182

Query: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTN-NSDINGSVIATLIIHGLCEASR 240
            + + F+T+GFGVFIW+ C N  + +VL+M+D  R   NS INGSVIA LIIHG C+  R
Sbjct: 183 HRGVEFSTIGFGVFIWKFCENAKLGQVLSMLDEVRKRENSMINGSVIAVLIIHGFCKGKR 242

Query: 241 LEEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKE 300
           +EEA  +LDEL+ R CKPDF+ Y I+ E F+   +V +RE +LKKKRKLGVAPR NDY+E
Sbjct: 243 VEEAFKVLDELRIRECKPDFIAYRIVAEEFKLMGSVFEREVVLKKKRKLGVAPRTNDYRE 302

Query: 301 YLFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGR 360
           ++  LI  RRI EAKELGEVIV G F +D++V N LIGSV+S+DP SAI+FF FM+EKGR
Sbjct: 303 FILGLIVERRICEAKELGEVIVSGKFTIDDDVLNALIGSVSSIDPRSAIVFFNFMIEKGR 362

Query: 361 FPTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYG 420
            PTL TL NLS+NLCK  K+DEL+EV++VL  N+YF D++ Y++ +SFLC +G+++EAYG
Sbjct: 363 VPTLSTLSNLSKNLCKRNKSDELVEVYKVLSANDYFTDMESYNVMVSFLCTSGRLREAYG 422

Query: 421 VLQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFS 480
           V+QEMK+ G DPDVSFYNS++EACCREDLLRPA+KLWD+MFA GC GNLKTY+ILI KFS
Sbjct: 423 VIQEMKRKGLDPDVSFYNSLMEACCREDLLRPAKKLWDQMFASGCSGNLKTYNILISKFS 482

Query: 481 KSNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAAT 540
           +  +IE AL L+ +ML K V PD   YTSLL+GLCQ++ L+AAFEVF+KSV  DV LA +
Sbjct: 483 EVGEIEGALRLFHNMLEKGVAPDATTYTSLLEGLCQETNLQAAFEVFNKSVNHDVMLARS 542

Query: 541 LLSTFILCLCKVGHFLAASKLLRGLASDIAHPDSHVTLLKGFADAGEVSLAKQHVEWVQE 600
           +LSTF++ LC+ GHFL A+KLLRGL+SD+ H DSHV LLK  ADA EV +A +H++W+QE
Sbjct: 543 ILSTFMISLCRRGHFLVATKLLRGLSSDLGHSDSHVILLKSLADAREVEMAIEHIKWIQE 602

Query: 601 TSPSMLSVISTELLAFLPSSPKADPILQILQTVQE 635
           +SP+ML  IS EL A L SS   +PIL +L  +QE
Sbjct: 603 SSPTMLQEISAELFASLSSSSYPEPILLLLHALQE 636

BLAST of CSPI02G25470 vs. TrEMBL
Match: V4U308_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014547mg PE=4 SV=1)

HSP 1 Score: 790.4 bits (2040), Expect = 1.6e-225
Identity = 395/635 (62.20%), Postives = 494/635 (77.80%), Query Frame = 1

Query: 1   MRPHFPELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60
           MRP   +LATR+S+AI+S SN+T PA  WTP LEQ LH+LG R  L+PSLV++VI+P+LL
Sbjct: 3   MRPA-TDLATRISQAIISASNRTRPARKWTPLLEQTLHQLGLRDSLSPSLVARVINPYLL 62

Query: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120
           +HHSLALGFFNWASQQP FTH+  SY+SILKSLSLSR    I S+LKQVK  KI LD SV
Sbjct: 63  THHSLALGFFNWASQQPNFTHSPLSYHSILKSLSLSRQINAIDSVLKQVKVNKITLDSSV 122

Query: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180
           YR +I SLI  K T  AF VFNEV      IG E+CNSLLA LASDG+ ++A K+FDEMS
Sbjct: 123 YRFIIPSLIQGKNTQKAFSVFNEVKFNCEDIGPEICNSLLAVLASDGYIDNALKMFDEMS 182

Query: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTN-NSDINGSVIATLIIHGLCEASR 240
            + + F+T+GFGVFIW+ C N  + +VL+M+D  R   NS INGSVIA LIIHG C+  R
Sbjct: 183 HRGVEFSTIGFGVFIWKFCENAKLGQVLSMLDEVRKRENSMINGSVIAVLIIHGFCKGKR 242

Query: 241 LEEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKE 300
           +EEA  +LDEL+ R CKPDF+ Y I+ E F+   +V +RE +LKKKRKLGVAPR NDY+E
Sbjct: 243 VEEAFKVLDELRIRECKPDFIAYRIVAEEFKLMGSVFEREVVLKKKRKLGVAPRTNDYRE 302

Query: 301 YLFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGR 360
           ++  LI  RRI EAKELGEVIV G F +D++V N LIGSV+S+DP SAI+FF FM+EKGR
Sbjct: 303 FILGLIVERRICEAKELGEVIVSGKFTIDDDVLNALIGSVSSIDPRSAIVFFNFMIEKGR 362

Query: 361 FPTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYG 420
            PTL TL NLS+NLCK  K+DEL+EV++VL  N+YF D++ Y++ +SFLC +G+++EAYG
Sbjct: 363 VPTLSTLSNLSKNLCKRNKSDELVEVYKVLSANDYFTDMESYNVMVSFLCTSGRLREAYG 422

Query: 421 VLQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFS 480
           V+QEMK+ G DPDVSFYNS++EACCREDLLRPA+KLWD+MFA GC GNLKTY+ILI KFS
Sbjct: 423 VIQEMKRKGLDPDVSFYNSLMEACCREDLLRPAKKLWDQMFASGCSGNLKTYNILISKFS 482

Query: 481 KSNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAAT 540
           +  +IE AL L+ +ML K V PD   YTSLL+GLCQ++ L+AAFEVF+KSV QDV LA +
Sbjct: 483 EVGEIEGALRLFHNMLEKGVAPDATTYTSLLEGLCQETNLQAAFEVFNKSVNQDVMLARS 542

Query: 541 LLSTFILCLCKVGHFLAASKLLRGLASDIAHPDSHVTLLKGFADAGEVSLAKQHVEWVQE 600
           +LSTF++ LC+ GHFL A+KLL GL+SD+ H DSHV LLK  ADA EV +A +H++W+QE
Sbjct: 543 ILSTFMISLCRRGHFLVATKLLHGLSSDLGHSDSHVILLKSLADAREVEMAIEHIKWIQE 602

Query: 601 TSPSMLSVISTELLAFLPSSPKADPILQILQTVQE 635
           +SP+ML  IS EL A L SS   +PIL +L  +QE
Sbjct: 603 SSPTMLQEISEELFASLSSSSYPEPILLLLHALQE 636

BLAST of CSPI02G25470 vs. TrEMBL
Match: A0A061ED42_THECC (Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_017261 PE=4 SV=1)

HSP 1 Score: 788.5 bits (2035), Expect = 6.2e-225
Identity = 385/629 (61.21%), Postives = 493/629 (78.38%), Query Frame = 1

Query: 7   ELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLLSHHSLA 66
           +LA R+ RA++S SN   P  +WT SLEQ LHRLG R  L+PSLV++VID  L +HH LA
Sbjct: 6   DLANRIGRALISASNHAIPTRTWTASLEQTLHRLGCRDSLSPSLVARVIDSFLSTHHCLA 65

Query: 67  LGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSVYRAVID 126
           LGFFNWASQQPG+ H+S SY SILKSLS SR F  + +LLKQVK QK+ LD SVYR +I 
Sbjct: 66  LGFFNWASQQPGYCHDSISYQSILKSLSFSRQFNAVETLLKQVKAQKLSLDSSVYRFIIS 125

Query: 127 SLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMSLKSIPF 186
           SLI  KKT +A  VFNEV S +  +G+ELCNSLLAAL SDG+F H+QKVFDEM  K + F
Sbjct: 126 SLIKGKKTQNAVWVFNEVNSPSAELGAELCNSLLAALVSDGYFAHSQKVFDEMFQKGVVF 185

Query: 187 NTLGFGVFIWRICRNTDVVKVLNMIDGARTNNS-DINGSVIATLIIHGLCEASRLEEASN 246
           NT+GFG+FIW  C+N ++ KVL+++D A+  +S ++NGS+IA L++HGLC +SR  EA  
Sbjct: 186 NTIGFGLFIWSFCKNGELNKVLSLLDEAKKGSSWEVNGSIIAVLVVHGLCFSSRESEALW 245

Query: 247 ILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEYLFVLI 306
           +LDEL++RGCKPDF+ Y I+ EAF+ + +VV+RE +LKKKRKLGVAPR NDY+E++  LI
Sbjct: 246 VLDELRSRGCKPDFIAYRIVAEAFRKSSSVVERELVLKKKRKLGVAPRSNDYREFILGLI 305

Query: 307 AGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRFPTLLT 366
           + RRI EA++LGEVIV GNFP++++V + LIGSV+S+DP SAIMF  FMV KG+ PTL+T
Sbjct: 306 SERRICEARDLGEVIVSGNFPVEDDVLDALIGSVSSIDPGSAIMFLNFMVGKGKLPTLIT 365

Query: 367 LRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGVLQEMK 426
           L NLSRNLCKHGK DELLEV+QVL  ++YF D++ Y++ +SFLC AG+V+EAY VLQEMK
Sbjct: 366 LSNLSRNLCKHGKVDELLEVYQVLSFHDYFLDMESYNVMVSFLCTAGRVREAYEVLQEMK 425

Query: 427 KNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSKSNQIE 486
           K G  P+V FYNS++EACCREDL+RPA++LWDEMFA GC GNL TY+ILI K S+  ++E
Sbjct: 426 KKGLGPNVFFYNSLMEACCREDLVRPAKRLWDEMFASGCAGNLNTYNILIGKLSQIGEVE 485

Query: 487 EALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATLLSTFI 546
           EAL L+ HM  K V PD   YT+LL+GLCQ+S+ E+AFE+F+KSVEQD+ LA ++L TF+
Sbjct: 486 EALCLFQHMAEKGVAPDGTTYTNLLEGLCQESKFESAFEIFNKSVEQDMMLAQSILRTFV 545

Query: 547 LCLCKVGHFLAASKLLRGLASDIAHPDSHVTLLKGFADAGEVSLAKQHVEWVQETSPSML 606
           + LC+ G FL ASKLL GL+SDI H DSHV +LK  ADA E+  A QH++W+QETSPSML
Sbjct: 546 IHLCRKGQFLVASKLLCGLSSDIIHSDSHVVMLKCLADAKEIQFAIQHIQWIQETSPSML 605

Query: 607 SVISTELLAFLPSSPKADPILQILQTVQE 635
             I T+L A L S+ + D I Q+LQ +QE
Sbjct: 606 QTIFTKLAASLSSTSRPDSIEQLLQAIQE 634

BLAST of CSPI02G25470 vs. TAIR10
Match: AT5G14080.1 (AT5G14080.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 643.3 bits (1658), Expect = 1.6e-184
Identity = 317/634 (50.00%), Postives = 450/634 (70.98%), Query Frame = 1

Query: 1   MRPHFPELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60
           MRP   ELA R+ R +L +S  +  A  W+P +EQ+LH LGFR  ++PSLV++VIDP LL
Sbjct: 1   MRPA-TELAVRIGRELLKVSGSSRAARIWSPLIEQSLHGLGFRHSISPSLVARVIDPFLL 60

Query: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120
           +HHSLALGFFNWA+QQPG++H+S SY+SI KSLSLSR F  + +L KQVK+ KI LD SV
Sbjct: 61  NHHSLALGFFNWAAQQPGYSHDSISYHSIFKSLSLSRQFSAMDALFKQVKSNKILLDSSV 120

Query: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180
           YR++ID+L++ +K   AF V  E  S    I  ++CN LLA L SDG +++AQK+F +M 
Sbjct: 121 YRSLIDTLVLGRKAQSAFWVLEEAFSTGQEIHPDVCNRLLAGLTSDGCYDYAQKLFVKMR 180

Query: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240
            K +  NTLGFGV+I   CR+++  ++L ++D  +  N +INGS+IA LI+H LC+ SR 
Sbjct: 181 HKGVSLNTLGFGVYIGWFCRSSETNQLLRLVDEVKKANLNINGSIIALLILHSLCKCSRE 240

Query: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300
            +A  IL+EL+N  CKPDF+ Y ++ EAF    N+ +R+ +LKKKRKLGVAPR +DY+ +
Sbjct: 241 MDAFYILEELRNIDCKPDFMAYRVIAEAFVVTGNLYERQVVLKKKRKLGVAPRSSDYRAF 300

Query: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360
           +  LI+ +R+ EAKE+ EVIV G FPMD ++ + LIGSV++VDP SA+ F  +MV  G+ 
Sbjct: 301 ILDLISAKRLTEAKEVAEVIVSGKFPMDNDILDALIGSVSAVDPDSAVEFLVYMVSTGKL 360

Query: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420
           P + TL  LS+NLC+H K+D L++ +++L    YF++L  Y L ISFLCKAG+V+E+Y  
Sbjct: 361 PAIRTLSKLSKNLCRHDKSDHLIKAYELLSSKGYFSELQSYSLMISFLCKAGRVRESYTA 420

Query: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480
           LQEMKK G  PDVS YN+++EACC+ +++RPA+KLWDEMF  GC  NL TY++LI+K S+
Sbjct: 421 LQEMKKEGLAPDVSLYNALIEACCKAEMIRPAKKLWDEMFVEGCKMNLTTYNVLIRKLSE 480

Query: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQD-VNLAAT 540
             + EE+L L+  ML + +EPD  IY SL++GLC+++++EAA EVF K +E+D   +   
Sbjct: 481 EGEAEESLRLFDKMLERGIEPDETIYMSLIEGLCKETKIEAAMEVFRKCMERDHKTVTRR 540

Query: 541 LLSTFILCLCKVGHFLAASKLLRGLASDIAHPDSHVTLLKGFADAGEVSLAKQHVEWVQE 600
           +LS F+L LC  GH   AS+LLR     + H  +HV LLK  ADA EV +  +H++W++E
Sbjct: 541 VLSEFVLNLCSNGHSGEASQLLRE-REHLEHTGAHVVLLKCVADAKEVEIGIRHMQWIKE 600

Query: 601 TSPSMLSVISTELLAFLPSSPKADPILQILQTVQ 634
            SPS++  IS++LLA   SS   D IL  ++ ++
Sbjct: 601 VSPSLVHTISSDLLASFCSSSDPDSILPFIRAIE 632

BLAST of CSPI02G25470 vs. TAIR10
Match: AT1G06710.1 (AT1G06710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 164.1 bits (414), Expect = 2.9e-40
Identity = 140/580 (24.14%), Postives = 258/580 (44.48%), Query Frame = 1

Query: 42  FRQMLNPSLVSQVIDPHLLSHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGP 101
           FR+ L+ SLV +V+   L++  S  + FF WA +Q G+ H +  YN+++  +        
Sbjct: 126 FREKLSESLVIEVL--RLIARPSAVISFFVWAGRQIGYKHTAPVYNALVDLIVRDDDEKV 185

Query: 102 IHSLLKQVKTQKIGLDLSVYRAVIDSLIIAKKTHDAFLVFNE----VTSITHIIGSELCN 161
               L+Q++      D  V+   ++ L+     + +F +  E    +            N
Sbjct: 186 PEEFLQQIRDD----DKEVFGEFLNVLVRKHCRNGSFSIALEELGRLKDFRFRPSRSTYN 245

Query: 162 SLLAALASDGFFEHAQKVFDEMSLKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTN 221
            L+ A       + A  +  EMSL ++  +      F + +C+     + L +++   T 
Sbjct: 246 CLIQAFLKADRLDSASLIHREMSLANLRMDGFTLRCFAYSLCKVGKWREALTLVE---TE 305

Query: 222 NSDINGSVIATLIIHGLCEASRLEEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVD 281
           N  +  +V  T +I GLCEAS  EEA + L+ ++   C P+ +TY  L     + + +  
Sbjct: 306 NF-VPDTVFYTKLISGLCEASLFEEAMDFLNRMRATSCLPNVVTYSTLLCGCLNKKQLGR 365

Query: 282 REKILKKKRKLGVAPRLNDYKEYLFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIG 341
            +++L      G  P    +   +           A +L + +VK        V N+LIG
Sbjct: 366 CKRVLNMMMMEGCYPSPKIFNSLVHAYCTSGDHSYAYKLLKKMVKCGHMPGYVVYNILIG 425

Query: 342 SVAS-VDPYSAIMF------FKFMVEKGRFPTLLTLRNLSRNLCKHGKTDELLEVFQVLC 401
           S+    D  +  +       +  M+  G     + + + +R LC  GK ++   V + + 
Sbjct: 426 SICGDKDSLNCDLLDLAEKAYSEMLAAGVVLNKINVSSFTRCLCSAGKYEKAFSVIREMI 485

Query: 402 INNYFNDLDRYHLRISFLCKAGKVKEAYGVLQEMKKNGFDPDVSFYNSVLEACCREDLLR 461
              +  D   Y   +++LC A K++ A+ + +EMK+ G   DV  Y  ++++ C+  L+ 
Sbjct: 486 GQGFIPDTSTYSKVLNYLCNASKMELAFLLFEEMKRGGLVADVYTYTIMVDSFCKAGLIE 545

Query: 462 PARKLWDEMFAGGCCGNLKTYSILIQKFSKSNQIEEALVLYSHMLGKNVEPDIAIYTSLL 521
            ARK ++EM   GC  N+ TY+ LI  + K+ ++  A  L+  ML +   P+I  Y++L+
Sbjct: 546 QARKWFNEMREVGCTPNVVTYTALIHAYLKAKKVSYANELFETMLSEGCLPNIVTYSALI 605

Query: 522 QGLCQDSQLEAAFEVF-----SKSV--------EQDVNLAATLLSTFILCL---CKVGHF 581
            G C+  Q+E A ++F     SK V        + D N     + T+   L   CK    
Sbjct: 606 DGHCKAGQVEKACQIFERMCGSKDVPDVDMYFKQYDDNSERPNVVTYGALLDGFCKSHRV 665

Query: 582 LAASKLLRGLASDIAHPDSHV--TLLKGFADAGEVSLAKQ 593
             A KLL  ++ +   P+  V   L+ G    G++  A++
Sbjct: 666 EEARKLLDAMSMEGCEPNQIVYDALIDGLCKVGKLDEAQE 695

BLAST of CSPI02G25470 vs. TAIR10
Match: AT5G61990.1 (AT5G61990.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 160.2 bits (404), Expect = 4.2e-39
Identity = 108/445 (24.27%), Postives = 201/445 (45.17%), Query Frame = 1

Query: 85  SYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSVYRAVIDSLIIAKKTHDAFLVFNEV 144
           +Y+ ++  L   +      SLL ++ +  + LD   Y  +ID L+  +    A  + +E+
Sbjct: 279 TYDVLIDGLCKIKRLEDAKSLLVEMDSLGVSLDNHTYSLLIDGLLKGRNADAAKGLVHEM 338

Query: 145 TSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMSLKSIPFNTLGFGVFIWRICRNTDV 204
            S    I   + +  +  ++ +G  E A+ +FD M    +      +   I   CR  +V
Sbjct: 339 VSHGINIKPYMYDCCICVMSKEGVMEKAKALFDGMIASGLIPQAQAYASLIEGYCREKNV 398

Query: 205 VKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRLEEASNILDELKNRGCKPDFLTYWI 264
            +   ++   +  N  I+     T ++ G+C +  L+ A NI+ E+   GC+P+ + Y  
Sbjct: 399 RQGYELLVEMKKRNIVISPYTYGT-VVKGMCSSGDLDGAYNIVKEMIASGCRPNVVIYTT 458

Query: 265 LGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEYLFVLIAGRRIREAKE-LGEVIVKG 324
           L + F       D  ++LK+ ++ G+AP +  Y   +  L   +R+ EA+  L E++  G
Sbjct: 459 LIKTFLQNSRFGDAMRVLKEMKEQGIAPDIFCYNSLIIGLSKAKRMDEARSFLVEMVENG 518

Query: 325 NFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRFPTLLTLRNLSRNLCKHGKTDELL 384
             P        + G + + +  SA  + K M E G  P  +    L    CK GK  E  
Sbjct: 519 LKPNAFTYGAFISGYIEASEFASADKYVKEMRECGVLPNKVLCTGLINEYCKKGKVIEAC 578

Query: 385 EVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGVLQEMKKNGFDPDVSFYNSVLEAC 444
             ++ +       D   Y + ++ L K  KV +A  + +EM+  G  PDV  Y  ++   
Sbjct: 579 SAYRSMVDQGILGDAKTYTVLMNGLFKNDKVDDAEEIFREMRGKGIAPDVFSYGVLINGF 638

Query: 445 CREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSKSNQIEEALVLYSHMLGKNVEPDI 504
            +   ++ A  ++DEM   G   N+  Y++L+  F +S +IE+A  L   M  K + P+ 
Sbjct: 639 SKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEIEKAKELLDEMSVKGLHPNA 698

Query: 505 AIYTSLLQGLCQDSQLEAAFEVFSK 529
             Y +++ G C+   L  AF +F +
Sbjct: 699 VTYCTIIDGYCKSGDLAEAFRLFDE 722

BLAST of CSPI02G25470 vs. TAIR10
Match: AT1G74580.1 (AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 154.1 bits (388), Expect = 3.0e-37
Identity = 124/517 (23.98%), Postives = 220/517 (42.55%), Query Frame = 1

Query: 78  GFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSVYRAVIDSLIIAKKTHDA 137
           G T +  S+   +KS   +        LL  + +Q   +++  Y  V+          + 
Sbjct: 141 GITPDVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEMNVVAYCTVVGGFYEENFKAEG 200

Query: 138 FLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMSLKSIPFNTLGFGVFIWR 197
           + +F ++ +    +     N LL  L   G  +  +K+ D++  + +  N   + +FI  
Sbjct: 201 YELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLLDKVIKRGVLPNLFTYNLFIQG 260

Query: 198 ICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRLEEASNILDELKNRGCKP 257
           +C+  ++   + M+ G           +    +I+GLC+ S+ +EA   L ++ N G +P
Sbjct: 261 LCQRGELDGAVRMV-GCLIEQGPKPDVITYNNLIYGLCKNSKFQEAEVYLGKMVNEGLEP 320

Query: 258 DFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEYLFVLI-AGRRIREAKEL 317
           D  TY  L   +     V   E+I+      G  P    Y+  +  L   G   R     
Sbjct: 321 DSYTYNTLIAGYCKGGMVQLAERIVGDAVFNGFVPDQFTYRSLIDGLCHEGETNRALALF 380

Query: 318 GEVIVKGNFPMDEEVSNVLIGSVASVDPY-SAIMFFKFMVEKGRFPTLLTLRNLSRNLCK 377
            E + KG  P +  + N LI  +++      A      M EKG  P + T   L   LCK
Sbjct: 381 NEALGKGIKP-NVILYNTLIKGLSNQGMILEAAQLANEMSEKGLIPEVQTFNILVNGLCK 440

Query: 378 HGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGVLQEMKKNGFDPDVSF 437
            G   +   + +V+    YF D+  +++ I       K++ A  +L  M  NG DPDV  
Sbjct: 441 MGCVSDADGLVKVMISKGYFPDIFTFNILIHGYSTQLKMENALEILDVMLDNGVDPDVYT 500

Query: 438 YNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSKSNQIEEALVLYSHML 497
           YNS+L   C+        + +  M   GC  NL T++IL++   +  +++EAL L   M 
Sbjct: 501 YNSLLNGLCKTSKFEDVMETYKTMVEKGCAPNLFTFNILLESLCRYRKLDEALGLLEEMK 560

Query: 498 GKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVE-QDVNLAATLLSTFILCLCKVGHF 557
            K+V PD   + +L+ G C++  L+ A+ +F K  E   V+ +    +  I    +  + 
Sbjct: 561 NKSVNPDAVTFGTLIDGFCKNGDLDGAYTLFRKMEEAYKVSSSTPTYNIIIHAFTEKLNV 620

Query: 558 LAASKLLRGLASDIAHPDSHV--TLLKGFADAGEVSL 590
             A KL + +      PD +    ++ GF   G V+L
Sbjct: 621 TMAEKLFQEMVDRCLGPDGYTYRLMVDGFCKTGNVNL 655

BLAST of CSPI02G25470 vs. TAIR10
Match: AT3G53700.1 (AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 152.5 bits (384), Expect = 8.7e-37
Identity = 117/523 (22.37%), Postives = 220/523 (42.07%), Query Frame = 1

Query: 67  LGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSVYRAVID 126
           L   +W   + G   ++  YN +L  L        +     ++    I  D+S +  +I 
Sbjct: 138 LSVVDWMIDEFGLKPDTHFYNRMLNLLVDGNSLKLVEISHAKMSVWGIKPDVSTFNVLIK 197

Query: 127 SLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMSLKSIPF 186
           +L  A +   A L+  ++ S   +   +   +++     +G  + A ++ ++M      +
Sbjct: 198 ALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCSW 257

Query: 187 NTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRLEEASNI 246
           + +   V +   C+   V   LN I      +           +++GLC+A  ++ A  I
Sbjct: 258 SNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHAIEI 317

Query: 247 LDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEYLFVLIA 306
           +D +   G  PD  TY  +         V +  ++L +      +P    Y   +  L  
Sbjct: 318 MDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLCK 377

Query: 307 GRRIREAKELGEVIV-KGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRFPTLLT 366
             ++ EA EL  V+  KG  P     ++++ G   + +   A+  F+ M  KG  P   T
Sbjct: 378 ENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFT 437

Query: 367 LRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGVLQEMK 426
              L  +LC  GK DE L + + + ++     +  Y+  I   CKA K +EA  +  EM+
Sbjct: 438 YNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDEME 497

Query: 427 KNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSKSNQIE 486
            +G   +   YN++++  C+   +  A +L D+M   G   +  TY+ L+  F +   I+
Sbjct: 498 VHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDIK 557

Query: 487 EALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATLLSTFI 546
           +A  +   M     EPDI  Y +L+ GLC+  ++E A ++      + +NL     +  I
Sbjct: 558 KAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYNPVI 617

Query: 547 LCLCKVGHFLAASKLLRG-LASDIAHPD--SHVTLLKGFADAG 586
             L +      A  L R  L  + A PD  S+  + +G  + G
Sbjct: 618 QGLFRKRKTTEAINLFREMLEQNEAPPDAVSYRIVFRGLCNGG 660

BLAST of CSPI02G25470 vs. NCBI nr
Match: gi|778673190|ref|XP_011649945.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g14080 isoform X1 [Cucumis sativus])

HSP 1 Score: 1269.6 bits (3284), Expect = 0.0e+00
Identity = 637/640 (99.53%), Postives = 640/640 (100.00%), Query Frame = 1

Query: 1   MRPHFPELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60
           MRPHFPELATRLSRAILSISNQT+PAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL
Sbjct: 1   MRPHFPELATRLSRAILSISNQTSPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60

Query: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120
           SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV
Sbjct: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120

Query: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180
           YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS
Sbjct: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180

Query: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240
           LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL
Sbjct: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240

Query: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300
           EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY
Sbjct: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300

Query: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360
           LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF
Sbjct: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360

Query: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420
           PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV
Sbjct: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420

Query: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480
           LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK
Sbjct: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480

Query: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL 540
           SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL
Sbjct: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL 540

Query: 541 LSTFILCLCKVGHFLAASKLLRGLASDIAHPDSHVTLLKGFADAGEVSLAKQHVEWVQET 600
           LSTFILCLCKVGHFLAASKLLRGLASD+AHPDSHVTLLKGFADAGEVSLAKQHVEWVQET
Sbjct: 541 LSTFILCLCKVGHFLAASKLLRGLASDVAHPDSHVTLLKGFADAGEVSLAKQHVEWVQET 600

Query: 601 SPSMLSVISTELLAFLPSSPKADPILQILQTVQELSRFSH 641
           SPSMLSVISTELLAFLPSSPKADPIL+ILQTVQELSRFSH
Sbjct: 601 SPSMLSVISTELLAFLPSSPKADPILEILQTVQELSRFSH 640

BLAST of CSPI02G25470 vs. NCBI nr
Match: gi|659081400|ref|XP_008441315.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g14080 isoform X1 [Cucumis melo])

HSP 1 Score: 1204.1 bits (3114), Expect = 0.0e+00
Identity = 605/640 (94.53%), Postives = 617/640 (96.41%), Query Frame = 1

Query: 1   MRPHFPELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60
           MRPH PELATRLSRAILSISNQT+PAGSWTPSLEQNLHRLGFRQ LNPSLVSQVIDPHLL
Sbjct: 1   MRPHLPELATRLSRAILSISNQTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL 60

Query: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120
           SH+SLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV
Sbjct: 61  SHYSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120

Query: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180
           YR+VIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAAL+SDGF+E A KVFDEMS
Sbjct: 121 YRSVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALSSDGFYEQATKVFDEMS 180

Query: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240
           LK IPFNTLG GVFIW++CRNTDVVKVLNMID  RTNNSD+NGS+IATLIIHGLC ASRL
Sbjct: 181 LKCIPFNTLGLGVFIWKVCRNTDVVKVLNMIDDVRTNNSDVNGSIIATLIIHGLCGASRL 240

Query: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300
            EASNILDELKNRGCKPDFLTYWILGEAFQSA NVVDREKILKKKRKLGVAPRLNDYKEY
Sbjct: 241 VEASNILDELKNRGCKPDFLTYWILGEAFQSAGNVVDREKILKKKRKLGVAPRLNDYKEY 300

Query: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360
           LF LIAG+RIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF
Sbjct: 301 LFALIAGKRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360

Query: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420
           PTLLTLRNLSRNLCKHGKTDELLEVFQVLCI NYFNDLDRYHLRISFLCKAGKVKEAYGV
Sbjct: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCIKNYFNDLDRYHLRISFLCKAGKVKEAYGV 420

Query: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480
           LQEMKKNGF PD SFYNSVLEACCREDLLRPARKLWDEMFA GC GNLKTYSILIQKFSK
Sbjct: 421 LQEMKKNGFAPDASFYNSVLEACCREDLLRPARKLWDEMFASGCSGNLKTYSILIQKFSK 480

Query: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL 540
           SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQ SQLE AFEVFSKSVEQDVNLAATL
Sbjct: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQGSQLETAFEVFSKSVEQDVNLAATL 540

Query: 541 LSTFILCLCKVGHFLAASKLLRGLASDIAHPDSHVTLLKGFADAGEVSLAKQHVEWVQET 600
           LSTFILCLCKVGHF AASKLLRGLAS IAHPDSHVTLLKGFADAGEV LAKQHVEWV ET
Sbjct: 541 LSTFILCLCKVGHFHAASKLLRGLASGIAHPDSHVTLLKGFADAGEVPLAKQHVEWVHET 600

Query: 601 SPSMLSVISTELLAFLPSSPKADPILQILQTVQELSRFSH 641
           SPSMLSVISTELLAFLPSSPKADPILQILQT+QELSRFS+
Sbjct: 601 SPSMLSVISTELLAFLPSSPKADPILQILQTIQELSRFSN 640

BLAST of CSPI02G25470 vs. NCBI nr
Match: gi|778673195|ref|XP_011649946.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g14080 isoform X2 [Cucumis sativus])

HSP 1 Score: 1100.9 bits (2846), Expect = 0.0e+00
Identity = 550/551 (99.82%), Postives = 551/551 (100.00%), Query Frame = 1

Query: 1   MRPHFPELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60
           MRPHFPELATRLSRAILSISNQT+PAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL
Sbjct: 1   MRPHFPELATRLSRAILSISNQTSPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60

Query: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120
           SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV
Sbjct: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120

Query: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180
           YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS
Sbjct: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180

Query: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240
           LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL
Sbjct: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240

Query: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300
           EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY
Sbjct: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300

Query: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360
           LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF
Sbjct: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360

Query: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420
           PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV
Sbjct: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420

Query: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480
           LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK
Sbjct: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480

Query: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL 540
           SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL
Sbjct: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL 540

Query: 541 LSTFILCLCKV 552
           LSTFILCLCKV
Sbjct: 541 LSTFILCLCKV 551

BLAST of CSPI02G25470 vs. NCBI nr
Match: gi|659081404|ref|XP_008441317.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g14080 isoform X2 [Cucumis melo])

HSP 1 Score: 1049.3 bits (2712), Expect = 2.8e-303
Identity = 524/556 (94.24%), Postives = 535/556 (96.22%), Query Frame = 1

Query: 1   MRPHFPELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60
           MRPH PELATRLSRAILSISNQT+PAGSWTPSLEQNLHRLGFRQ LNPSLVSQVIDPHLL
Sbjct: 1   MRPHLPELATRLSRAILSISNQTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL 60

Query: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120
           SH+SLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV
Sbjct: 61  SHYSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120

Query: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180
           YR+VIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAAL+SDGF+E A KVFDEMS
Sbjct: 121 YRSVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALSSDGFYEQATKVFDEMS 180

Query: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240
           LK IPFNTLG GVFIW++CRNTDVVKVLNMID  RTNNSD+NGS+IATLIIHGLC ASRL
Sbjct: 181 LKCIPFNTLGLGVFIWKVCRNTDVVKVLNMIDDVRTNNSDVNGSIIATLIIHGLCGASRL 240

Query: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300
            EASNILDELKNRGCKPDFLTYWILGEAFQSA NVVDREKILKKKRKLGVAPRLNDYKEY
Sbjct: 241 VEASNILDELKNRGCKPDFLTYWILGEAFQSAGNVVDREKILKKKRKLGVAPRLNDYKEY 300

Query: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360
           LF LIAG+RIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF
Sbjct: 301 LFALIAGKRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360

Query: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420
           PTLLTLRNLSRNLCKHGKTDELLEVFQVLCI NYFNDLDRYHLRISFLCKAGKVKEAYGV
Sbjct: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCIKNYFNDLDRYHLRISFLCKAGKVKEAYGV 420

Query: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480
           LQEMKKNGF PD SFYNSVLEACCREDLLRPARKLWDEMFA GC GNLKTYSILIQKFSK
Sbjct: 421 LQEMKKNGFAPDASFYNSVLEACCREDLLRPARKLWDEMFASGCSGNLKTYSILIQKFSK 480

Query: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL 540
           SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQ SQLE AFEVFSKSVEQDVNLAATL
Sbjct: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQGSQLETAFEVFSKSVEQDVNLAATL 540

Query: 541 LSTFILCLCKVGHFLA 557
           LSTFILCLCKVG  L+
Sbjct: 541 LSTFILCLCKVGTLLS 556

BLAST of CSPI02G25470 vs. NCBI nr
Match: gi|659081406|ref|XP_008441318.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g14080 isoform X3 [Cucumis melo])

HSP 1 Score: 1046.2 bits (2704), Expect = 2.4e-302
Identity = 522/551 (94.74%), Postives = 532/551 (96.55%), Query Frame = 1

Query: 1   MRPHFPELATRLSRAILSISNQTTPAGSWTPSLEQNLHRLGFRQMLNPSLVSQVIDPHLL 60
           MRPH PELATRLSRAILSISNQT+PAGSWTPSLEQNLHRLGFRQ LNPSLVSQVIDPHLL
Sbjct: 1   MRPHLPELATRLSRAILSISNQTSPAGSWTPSLEQNLHRLGFRQTLNPSLVSQVIDPHLL 60

Query: 61  SHHSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120
           SH+SLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV
Sbjct: 61  SHYSLALGFFNWASQQPGFTHNSDSYNSILKSLSLSRHFGPIHSLLKQVKTQKIGLDLSV 120

Query: 121 YRAVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALASDGFFEHAQKVFDEMS 180
           YR+VIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAAL+SDGF+E A KVFDEMS
Sbjct: 121 YRSVIDSLIIAKKTHDAFLVFNEVTSITHIIGSELCNSLLAALSSDGFYEQATKVFDEMS 180

Query: 181 LKSIPFNTLGFGVFIWRICRNTDVVKVLNMIDGARTNNSDINGSVIATLIIHGLCEASRL 240
           LK IPFNTLG GVFIW++CRNTDVVKVLNMID  RTNNSD+NGS+IATLIIHGLC ASRL
Sbjct: 181 LKCIPFNTLGLGVFIWKVCRNTDVVKVLNMIDDVRTNNSDVNGSIIATLIIHGLCGASRL 240

Query: 241 EEASNILDELKNRGCKPDFLTYWILGEAFQSARNVVDREKILKKKRKLGVAPRLNDYKEY 300
            EASNILDELKNRGCKPDFLTYWILGEAFQSA NVVDREKILKKKRKLGVAPRLNDYKEY
Sbjct: 241 VEASNILDELKNRGCKPDFLTYWILGEAFQSAGNVVDREKILKKKRKLGVAPRLNDYKEY 300

Query: 301 LFVLIAGRRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360
           LF LIAG+RIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF
Sbjct: 301 LFALIAGKRIREAKELGEVIVKGNFPMDEEVSNVLIGSVASVDPYSAIMFFKFMVEKGRF 360

Query: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCINNYFNDLDRYHLRISFLCKAGKVKEAYGV 420
           PTLLTLRNLSRNLCKHGKTDELLEVFQVLCI NYFNDLDRYHLRISFLCKAGKVKEAYGV
Sbjct: 361 PTLLTLRNLSRNLCKHGKTDELLEVFQVLCIKNYFNDLDRYHLRISFLCKAGKVKEAYGV 420

Query: 421 LQEMKKNGFDPDVSFYNSVLEACCREDLLRPARKLWDEMFAGGCCGNLKTYSILIQKFSK 480
           LQEMKKNGF PD SFYNSVLEACCREDLLRPARKLWDEMFA GC GNLKTYSILIQKFSK
Sbjct: 421 LQEMKKNGFAPDASFYNSVLEACCREDLLRPARKLWDEMFASGCSGNLKTYSILIQKFSK 480

Query: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQDSQLEAAFEVFSKSVEQDVNLAATL 540
           SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQ SQLE AFEVFSKSVEQDVNLAATL
Sbjct: 481 SNQIEEALVLYSHMLGKNVEPDIAIYTSLLQGLCQGSQLETAFEVFSKSVEQDVNLAATL 540

Query: 541 LSTFILCLCKV 552
           LSTFILCLCKV
Sbjct: 541 LSTFILCLCKV 551

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP380_ARATH2.9e-18350.00Pentatricopeptide repeat-containing protein At5g14080 OS=Arabidopsis thaliana GN... [more]
PPR18_ARATH5.1e-3924.14Pentatricopeptide repeat-containing protein At1g06710, mitochondrial OS=Arabidop... [more]
PP442_ARATH7.4e-3824.27Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidop... [more]
PP120_ARATH5.3e-3623.98Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis th... [more]
PP281_ARATH1.5e-3522.37Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LMX0_CUCSA0.0e+0099.53Uncharacterized protein OS=Cucumis sativus GN=Csa_2G405040 PE=4 SV=1[more]
M5WSE0_PRUPE4.7e-24165.87Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002717mg PE=4 SV=1[more]
A0A067F918_CITSI1.2e-22562.20Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006281mg PE=4 SV=1[more]
V4U308_9ROSI1.6e-22562.20Uncharacterized protein OS=Citrus clementina GN=CICLE_v10014547mg PE=4 SV=1[more]
A0A061ED42_THECC6.2e-22561.21Tetratricopeptide repeat-like superfamily protein OS=Theobroma cacao GN=TCM_0172... [more]
Match NameE-valueIdentityDescription
AT5G14080.11.6e-18450.00 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G06710.12.9e-4024.14 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G61990.14.2e-3924.27 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G74580.13.0e-3723.98 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G53700.18.7e-3722.37 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778673190|ref|XP_011649945.1|0.0e+0099.53PREDICTED: pentatricopeptide repeat-containing protein At5g14080 isoform X1 [Cuc... [more]
gi|659081400|ref|XP_008441315.1|0.0e+0094.53PREDICTED: pentatricopeptide repeat-containing protein At5g14080 isoform X1 [Cuc... [more]
gi|778673195|ref|XP_011649946.1|0.0e+0099.82PREDICTED: pentatricopeptide repeat-containing protein At5g14080 isoform X2 [Cuc... [more]
gi|659081404|ref|XP_008441317.1|2.8e-30394.24PREDICTED: pentatricopeptide repeat-containing protein At5g14080 isoform X2 [Cuc... [more]
gi|659081406|ref|XP_008441318.1|2.4e-30294.74PREDICTED: pentatricopeptide repeat-containing protein At5g14080 isoform X3 [Cuc... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G25470.1CSPI02G25470.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 405..429
score: 8.4E-5coord: 156..181
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 497..528
score: 4.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 224..265
score: 1.4E-8coord: 431..479
score: 1.7
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 470..503
score: 8.3E-6coord: 228..258
score: 2.8E-6coord: 404..433
score: 9.6E-7coord: 505..534
score: 0.0018coord: 436..467
score: 7.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 82..116
score: 6.5coord: 223..257
score: 10.435coord: 467..501
score: 11.104coord: 397..431
score: 11.082coord: 502..536
score: 9.361coord: 117..151
score: 5.985coord: 362..396
score: 6.051coord: 258..292
score: 6.423coord: 537..571
score: 5.47coord: 432..466
score: 9.986coord: 152..186
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 366..602
score: 3.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..529
score: 1.4E-239coord: 563..640
score: 1.4E
NoneNo IPR availablePANTHERPTHR24015:SF351SUBFAMILY NOT NAMEDcoord: 1..529
score: 1.4E-239coord: 563..640
score: 1.4E