CSPI01G33460 (gene) Wild cucumber (PI 183967)

NameCSPI01G33460
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein
LocationChr1 : 28384031 .. 28390328 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTTATTTATTTTCTAAATTGTTGTATTTGGTGGAATTTTACCGTAGTTAATGCTAAACCATATCTGGATATCTACAAAACACCATTGTTGCACGCCGAAGCTGATTCCGGCGTCCCGCTTGTTGTCACAACCTTCTCTGTCGCATTACTTTCACTCTAGCATGTCTTTATTTTTGGGTTCTGCTCCCTCTCGCCACTTTTCTCTTTGGTTCTCTTCGTACAGATTACTACCACTCTTTTTAAGTGTTCTCCTCACACCGGTGGGCGCTTGTGTGACTGGCGTGTAGATTAGAGCATGACCCTCAACTCCATTGACTCTAAATCGTATTACCCATCGCTTGTAGAGTTCAAATATGTAAGTTCCAACCTATTTGAACACGAACCATCTACAAATTCATTTTTCCTTTTATGATTTCGAGCTAATTTTACACAGATGACGAAACCCAAGGTGAAAAGCCCACTAGGTATTTTTTTGTGCAAGTAATATCATTTAAATTAATTTTGGTTAAATTACTAAATGATGAAAATTTGTTTCAGGTTGGCAAATGGCTTTTAAGTTGGGTCAAACTAGATAATAAGGTTAGTAATTTTACTATGGGTTCGCCTTAGTCTTTTTTGGGTTTGTTGTAACTGTGTTTGCTGAATTTTATGTTAAACTTGTGAGTGTGAAGTATGTGAAAAGTCATTAGTTAGTGTTTTGAGCTTGAAACTCTTACGAACTTTGTTGTTTGTTATGCATCAACTATTAAGTGAACACATGTTTTGTAGGATGAGCATTATGCTATGTTTATAAATGGAGTGTTTATGAAGATTGATGTTCTGGATGCTTTTCAGTGATTGTATGTGTTTGTTCTTGCATTTTAATTGGTTTGTTGATGTTACGTTTTAATGAGGGTTATAGAAGTGATGATGCTGAAAGGAAGATGGAATCATGGAGAACATGTTTAAGTAGAAGGTTTAAGTTGTGGGTGGATAGGAAAGAAAATAAATTTTAAGGGACAAAAGTTTTTCAAGAGACTAGAATTGAAGTCCCCCGACTCTTGGGTGTGTAAATATATATAGAAAGAATGAAATGAGATCGCACTTATGAAGTTTAAGTTTTGGGTGTGTATATATAAAAATGAGATGAGGCAATTAAGAAAAATACCACTGGAGATGGACTTTGGAACTTCAAAAGTATGAATTTTTGTAGTTGAGGATAAGTCTTGCTTGGAAAAAAAAAACACCTTTTTCTATATGGTTCTGTAGGAAAAAGTTATGAGATAGGTTTCTTATATGTGGTGCCATAATTTGGTTCTTTTGTATGCCAATGCTTGTATTCGTTCATCTTTTTCTCAGTACAAGTTGTTGCTTTTATAAAAAGAAAAAAAATGGAAAAAGTTATTGAAAATTTGAAAAGCAGACCTTTTACATTTGTAAGAGATGTCACACAACACGCCAAGTACAGAGACAATGGTAATTAAAGAGTGAGAGAGATAATAGTGACACACAAGAGTTGGTAATCCAGTTTGGTGATACAAGACTCCCTTTAAGCTTGTGTGACTCCCTCTCACTTGATTGAATCTTTCCTTTTTCTTTTCGTGAACTCCCTTCACATCAAGAATGAACCTTAGGGCCCCCACCTAAGGAAGAGAAATCTTTCTCAATTGATTCGGCTTTAGGCTCCCCCAAAAGCAATGTAGCGTGTCTCGAAATCAATACAGTCTTCAACAAACAACTCACAAGATTTAGTGCCACACTCACAAAAAGTCGTTTGCTTCAAACACAATGCACTGTCAAACTTTCAATGGTAGAACAAGCAAAATAACAGGTTCGACAACCCTATCAACGAATGCTTCTTGAGCTTAAATAACATCACGGAGTAGGGAAAATATCCATTGAATAAATCGGCAAAGCTCTAAAATGAAAACATCCCAGTGGATCAAATCTGCAGAATAATGTAAATATGGAAAGAAATATTATGCAGGATCCGCAACTTTCATAACAATAGCTGAGACAAGATCATTTTGAAGCATATTCTAGGACAATTACAAAAACAACTCAAGCTAGAAGTTAGGTATCTTAAAACAAATATATTTAGTCAATTAGCAAGGCATTAACAATTATGAGATAGTAAAAGAAGCGAGAATTAAAAATTCAGCCATATCGAGAGATTAAGCAAATTAATGTATCAAGACTCCTCATAAATAAATTTAGCCTGACAAGTTATGGTTTGTGTTTGAAAGTATTTTGAATGGGCAGTTGGTCCATTGACTATGAAACTCAACTCTAGTGTGAGAATTCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCCCCCGTAAAAAGATTGAGTAGGGACAATCAAAATGTTGGAGGAATGCCTCATTAAGCTTGAGGATTTATTTTTCTAAAAACAAAAGCTGGATCTGCAAAAAAAAGTTGAAGGAAGATCTTTTAGCATGAGCTACTAACAATAATTGGCAAGAAAATATATGAACGAAACGCTGAGGTTCTATTAATAAAAAATTTAAACTCGATGGGCTCTTGAGCAAGGTTTGGGAGAATTTGACGTAGAAATCAAATTAAAACAAAGAAACTAATTAAGTCATTAGTTTGGTGGGAAAAAAGAAGTTAGGCAAGCCTGAAGTAAATGAAAAAGTTGTTTTAGTATTGAATAAACAAAGTTGAAGTAGCTCAAGGATTGAACTCCATTAAGGAAAAAGAAAACTCTTGATAGACCCAAGTTTATCATCATCATTGGATACATCAACTTTGTTTGTTAAGAGTAAAATGGACAAAATTTCTTAAAACTCGTTTGTCACAGAGACATGAGGTTACTCTGGGACTAAGGAGCAACAGTGAGGATACTTTAGGCATGTTTTGGTGGTCGATCACTTATGTAGTAAACTTTTCAAGACAATTGTTTTTCTGAAGTAAGTAGCTACTAGGGAAGGCTCGCAGTATTCCTTCCCAGATGGAAGTGGTTTATAGGTTAACCTTGACTAGGTGGTAGAGATTTGTAGTCACCTAAGCTTATAGGCCACGGTAGGAGGTTTGTTCTTAATACTCGTAGTTGAACCTTTGGCTCAGCTGATTCACAATGACACCATTTTTCCCAGCAAAAGATACGAGAAGACCAGAGAGAACATATGTTTGCGTTGCCTCAGTGTTAACTTTTTCTAATGGCACCAGAAACTTAAAGTCCTTCAGTAATGGAATGGACTTGATGGAGGTTAGTGCTCATGCGAGATATTTACTAAATAAAATAGGCTTATTTACCCCACTTGTCATTTGCTCAAAATTGTTTCTCAATTTGAAGTCCTGTAATTGGTTAAGTGCTTGACAAGCTGGAGGGCTTATTGGCTTTATTATTTGTTTATTACTACATAAACATAGGTTGAATTGTTATGAACCAATTGCCCTAACATGATCATTGAAATGCTTTACGGTGCAGAATTGTTATGAACCAATTGAAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCACCCTGTTTGACCAAATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCACCGATGCTGGTTGCTTTACTGTCAAATGTGTTCCCAAGGTTGCTCTCCGAATCAGTATTCATTCACCTTTCTCTTTCCCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAGTCAGGATTTGCTTCTGATATGTTTGCTATGACGGCATTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCACGCCAACTGTTTGATGAAATGCCTGTTCGAGATATACCCACCTGGAATTCGTTGATTGCGGGTTATGCAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAGCTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGTGTTCTTCCTGCCTGTTCTCAGCTTGGGGCATTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGCATATGTGAGCAATGCGGTACTGGAATTGCATGCTAGGTGTGGGAACATCGAGGAAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGCATTGATGCTCTTCAGCTTTATGATCAAATGTTGGTGAGTTTTCTATATCCTTCTTCATTTGTTTTTGTTCTGAATTTTAGTTTCTAAATCACTAATGAACTTCAGAAGTATAAGTGAGTACATAGTTAATCTTTTATTTATGTTCACATGTTTTGTATGTTTGGTAGCTGGTATAGGTTAGTGATAATAGAACACCTACTTTGGATGATTCTTATTCAATGGAAACAACAAAAGGAATTACAAGGAAATGGTAGAGCACCCAGTCTTGAACAAGAACAAAGGCAACTAACAAAGAACGGAAAGTAAACCCTACCCTTCCCCAGGCCTTTTCTATCTCTCAAGGAATAAACAAATCCCTCACACAATTTTCATACCCTCACTCTCAGACTCCCTCACCATTTAAACCTTCTCCTTTCCCTAACTAACTGTGGGACCCGGCCTTAGCAGCTATAGTGCCCATTACACATGCACTTCCTTTGTTCCCTCTCCTACTGTATTACCATATAATAGGTGCCCTAACATTACGCCTTCTCTAAAATCACATTCTCCTTAAGGTGAAATCTGGAAATTGCTGTTGAAAATCATAACAAATCTCCTAGGCAGCTTTATAAGGAGGTAGCACTTGCCAACAAATCAAGACCTCCCACACTCCCGTTGTAGGGTGCCTTTGATATCCATAAACTTCCTCTAGCTTGGCTACTCATTCATAATTTTTCGACAAGTGACTAGTTGCTGCACCTCAGAATGCTCACTAACACTCCCTCAACTGTGAGACATGAGACACCGGATGAATAGAGGCCGATGGAGGTAATTCCAGTTTTTATGCCACTGGTCCAATTCTTTCAAGCAAAAATTTTGGAGATAATTTTTCACTACTCCGTTTTCGCAATGACGAGTGCCCGTTTGAAGTTGGTCGCCTAGCTGTTTTAAGAGGAGCAAGTGCCTCTCACCTAAGGCGAGCCCTCGGTTGCACCTCGAAAACACTGGCTGCAGTATTATTTTCTCTTCAATTAAAACTGATTTCCATTTGTTGAGTCTTCTACATATTTCAAGCCTACAAAGGAGAGGGAAATGTTAAAACGATACAACATTAAATTTTCCTTCACCTTAAACTATAGCTAAATCTACAAGGTTGAAGTTGTTGTGTTATTGTATGTTTAAGCATTTATATTAATAGTCTTATTACATCTTTTGCATTTATACTGACGGTGGTGTTATATCTTTCTCACAACAGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGAGGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGGAGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGCAATGTTGAATTGGGTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGATTAAGGAAGATGATGAAAGGAGGACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAAGAGTGGTGAAATATATGCTTTACTTCATAAAATTTATGACATTATTAAACTTCATAAGCATGTACATCAGGATCAAAACGAAGATGAAGAACTACTCTATTCTTCGTAA

mRNA sequence

ATGAACCAATTGAAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCACCCTGTTTGACCAAATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCACCGATGCTGGTTGCTTTACTGTCAAATGTGTTCCCAAGGTTGCTCTCCGAATCAGTATTCATTCACCTTTCTCTTTCCCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAGTCAGGATTTGCTTCTGATATGTTTGCTATGACGGCATTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCACGCCAACTGTTTGATGAAATGCCTGTTCGAGATATACCCACCTGGAATTCGTTGATTGCGGGTTATGCAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAGCTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGTGTTCTTCCTGCCTGTTCTCAGCTTGGGGCATTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGCATATGTGAGCAATGCGGTACTGGAATTGCATGCTAGGTGTGGGAACATCGAGGAAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGCATTGATGCTCTTCAGCTTTATGATCAAATGTTGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGAGGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGGAGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGCAATGTTGAATTGGGTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGATTAAGGAAGATGATGAAAGGAGGACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAAGAGTGGTGAAATATATGCTTTACTTCATAAAATTTATGACATTATTAAACTTCATAAGCATGTACATCAGGATCAAAACGAAGATGAAGAACTACTCTATTCTTCGTAA

Coding sequence (CDS)

ATGAACCAATTGAAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCACCCTGTTTGACCAAATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCACCGATGCTGGTTGCTTTACTGTCAAATGTGTTCCCAAGGTTGCTCTCCGAATCAGTATTCATTCACCTTTCTCTTTCCCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAGTCAGGATTTGCTTCTGATATGTTTGCTATGACGGCATTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCACGCCAACTGTTTGATGAAATGCCTGTTCGAGATATACCCACCTGGAATTCGTTGATTGCGGGTTATGCAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAGCTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGTGTTCTTCCTGCCTGTTCTCAGCTTGGGGCATTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGCATATGTGAGCAATGCGGTACTGGAATTGCATGCTAGGTGTGGGAACATCGAGGAAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGCATTGATGCTCTTCAGCTTTATGATCAAATGTTGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGAGGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGGAGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGCAATGTTGAATTGGGTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGATTAAGGAAGATGATGAAAGGAGGACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAAGAGTGGTGAAATATATGCTTTACTTCATAAAATTTATGACATTATTAAACTTCATAAGCATGTACATCAGGATCAAAACGAAGATGAAGAACTACTCTATTCTTCGTAA
BLAST of CSPI01G33460 vs. Swiss-Prot
Match: PP371_ARATH (Pentatricopeptide repeat-containing protein At5g08510 OS=Arabidopsis thaliana GN=PCMP-E20 PE=2 SV=1)

HSP 1 Score: 590.1 bits (1520), Expect = 2.3e-167
Identity = 277/509 (54.42%), Postives = 375/509 (73.67%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MN +KQ+HA+ LR G+D TK L+++LL +P+L YA  LFD       +LYNK IQ +   
Sbjct: 1   MNGIKQLHAHCLRTGVDETKDLLQRLLLIPNLVYARKLFDHHQNSCTFLYNKLIQAYYVH 60

Query: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120
             PH   +LY  +   G  P+ ++F F+F A AS  +  P ++LHS F +SGF SD F  
Sbjct: 61  HQPHESIVLYNLLSFDGLRPSHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCC 120

Query: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180
           T L+  YAKLG L  AR++FDEM  RD+P WN++I GY R G M+AA+ELF+ MP +NV 
Sbjct: 121 TTLITAYAKLGALCCARRVFDEMSKRDVPVWNAMITGYQRRGDMKAAMELFDSMPRKNVT 180

Query: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240
           SWT +ISG++QNG Y++AL+MF+ +E +K  KPN +++ SVLPAC+ LG L+IG+R+E Y
Sbjct: 181 SWTTVISGFSQNGNYSEALKMFLCMEKDKSVKPNHITVVSVLPACANLGELEIGRRLEGY 240

Query: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300
           AR NGFF N YV NA +E++++CG I+ A+++F+E+G++RNLCSWN+MI  LA HG+  +
Sbjct: 241 ARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDE 300

Query: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360
           AL L+ QML    +PD VTFVGLLLAC HGGMV +G++LF+SME   +++PKLEHYGC++
Sbjct: 301 ALTLFAQMLREGEKPDAVTFVGLLLACVHGGMVVKGQELFKSMEEVHKISPKLEHYGCMI 360

Query: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420
           DLLGR G+LQEAY+LI+ MPM PD+V+WGTLLGACSFHGNVE+ E+A+E+LFKLEP NPG
Sbjct: 361 DLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFHGNVEIAEIASEALFKLEPTNPG 420

Query: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSG 480
           N VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S 
Sbjct: 421 NCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSYFVEVGVDVHKFTVEDKSHPRSY 480

Query: 481 EIYALLHKIYDIIKLHKHVHQDQNEDEEL 509
           EIY +L +I+  +KL K       + E+L
Sbjct: 481 EIYQVLEEIFRRMKLEKSRFDSLLQPEQL 509

BLAST of CSPI01G33460 vs. Swiss-Prot
Match: PP165_ARATH (Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana GN=PCMP-E78 PE=2 SV=1)

HSP 1 Score: 407.5 bits (1046), Expect = 2.1e-112
Identity = 206/507 (40.63%), Postives = 321/507 (63.31%), Query Frame = 1

Query: 2   NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQIPKPSVYLYNKFIQTF 61
           N+ K+I+A  + +GL  + F++ K++    ++ D+ YA  LF+Q+  P+V+LYN  I+ +
Sbjct: 24  NEWKKINASIIIHGLSQSSFMVTKMVDFCDKIEDMDYATRLFNQVSNPNVFLYNSIIRAY 83

Query: 62  SSIGHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASD 121
           +          +Y Q+  +    P++++F F+F +CASL + Y G+ +H H CK G    
Sbjct: 84  THNSLYCDVIRIYKQLLRKSFELPDRFTFPFMFKSCASLGSCYLGKQVHGHLCKFGPRFH 143

Query: 122 MFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPV 181
           +    AL+DMY K   L  A ++FDEM  RD+ +WNSL++GYAR G M+ A  LF+ M  
Sbjct: 144 VVTENALIDMYMKFDDLVDAHKVFDEMYERDVISWNSLLSGYARLGQMKKAKGLFHLMLD 203

Query: 182 RNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKR 241
           + ++SWTA+ISGY   G Y +A++ F  ++   G +P+E+S+ SVLP+C+QLG+L++GK 
Sbjct: 204 KTIVSWTAMISGYTGIGCYVEAMDFFREMQLA-GIEPDEISLISVLPSCAQLGSLELGKW 263

Query: 242 IEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHG 301
           I  YA   GF K   V NA++E++++CG I +A Q+F ++  K ++ SW+TMI G A HG
Sbjct: 264 IHLYAERRGFLKQTGVCNALIEMYSKCGVISQAIQLFGQMEGK-DVISWSTMISGYAYHG 323

Query: 302 RCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHY 361
               A++ +++M   K++P+ +TF+GLL AC+H GM  EG + F+ M   +Q+ PK+EHY
Sbjct: 324 NAHGAIETFNEMQRAKVKPNGITFLGLLSACSHVGMWQEGLRYFDMMRQDYQIEPKIEHY 383

Query: 362 GCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEP 421
           GCL+D+L RAG+L+ A  + + MPM PDS IWG+LL +C   GN+++  VA + L +LEP
Sbjct: 384 GCLIDVLARAGKLERAVEITKTMPMKPDSKIWGSLLSSCRTPGNLDVALVAMDHLVELEP 443

Query: 422 WNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHL 481
            + GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S  
Sbjct: 444 EDMGNYVLLANIYADLGKWEDVSRLRKMIRNENMKKTPGGSLIEVNNIVQEFVSGDNSKP 503

Query: 482 KSGEIYALLHKIYDIIKLHKHVHQDQN 504
              EI  +L             HQDQ+
Sbjct: 504 FWTEISIVLQLFTS--------HQDQD 520

BLAST of CSPI01G33460 vs. Swiss-Prot
Match: PP354_ARATH (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana GN=ELI1 PE=3 SV=1)

HSP 1 Score: 395.6 bits (1015), Expect = 8.3e-109
Identity = 205/506 (40.51%), Postives = 318/506 (62.85%), Query Frame = 1

Query: 1   MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQIPKPSVYLYNKF 60
           ++++ QIHA  LR N L H ++ +  L           + ++  LF Q   P ++L+   
Sbjct: 42  VDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSLALFHQTIDPDLFLFTAA 101

Query: 61  IQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGF 120
           I T S  G   + +LLY Q+ S   +PN+++F+ L  +C++      G+++H+H  K G 
Sbjct: 102 INTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSCSTK----SGKLIHTHVLKFGL 161

Query: 121 ASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNK 180
             D +  T L+D+YAK G + SA+++FD MP R + +  ++I  YA+ G++EAA  LF+ 
Sbjct: 162 GIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITCYAKQGNVEAARALFDS 221

Query: 181 MPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDI 240
           M  R+++SW  +I GYAQ+G    AL +F  L  E   KP+E+++ + L ACSQ+GAL+ 
Sbjct: 222 MCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEITVVAALSACSQIGALET 281

Query: 241 GKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLA 300
           G+ I  + +++    N  V   +++++++CG++EEA  VF++   ++++ +WN MI G A
Sbjct: 282 GRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDT-PRKDIVAWNAMIAGYA 341

Query: 301 VHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPK 360
           +HG   DAL+L+++M  I  ++P D+TF+G L AC H G+V EG ++FESM  ++ + PK
Sbjct: 342 MHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIRIFESMGQEYGIKPK 401

Query: 361 LEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLF 420
           +EHYGCLV LLGRAG+L+ AY  I+NM M  DSV+W ++LG+C  HG+  LG+  AE L 
Sbjct: 402 IEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKLHGDFVLGKEIAEYLI 461

Query: 421 KLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVED 480
            L   N G YV+LSNIYA  GD+ GVA++R +MK   I K  G S IE+ + +HEF   D
Sbjct: 462 GLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGISTIEIENKVHEFRAGD 521

Query: 481 RSHLKSGEIYALLHKIYDIIKLHKHV 499
           R H KS EIY +L KI + IK H +V
Sbjct: 522 REHSKSKEIYTMLRKISERIKSHGYV 542

BLAST of CSPI01G33460 vs. Swiss-Prot
Match: PPR70_ARATH (Pentatricopeptide repeat-containing protein At1g33350 OS=Arabidopsis thaliana GN=PCMP-E57 PE=2 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 3.5e-107
Identity = 193/500 (38.60%), Postives = 310/500 (62.00%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQ-----LPDLPYACTLFDQIPKPSVYLYNKFIQ 60
           +N LKQ+ ++ + +GL H+ FL  KLL+     L +L YA  +FD+   P+ +LY   + 
Sbjct: 37  LNHLKQVQSFMIVSGLSHSHFLCFKLLRFCTLRLCNLSYARFIFDRFSFPNTHLYAAVLT 96

Query: 61  TFSSIG--HPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLFNVYPGQMLHSHFCKSG 120
            +SS    H    +  +  M ++    PN + +  +  +   L + +   ++H+H  KSG
Sbjct: 97  AYSSSLPLHASSAFSFFRLMVNRSVPRPNHFIYPLVLKSTPYLSSAFSTPLVHTHLFKSG 156

Query: 121 FASDMFAMTALLDMYAK-LGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELF 180
           F   +   TALL  YA  +  +  ARQLFDEM  R++ +W ++++GYARSG +  A+ LF
Sbjct: 157 FHLYVVVQTALLHSYASSVSHITLARQLFDEMSERNVVSWTAMLSGYARSGDISNAVALF 216

Query: 181 NKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGAL 240
             MP R+V SW A+++   QNG + +A+ +F  + NE   +PNEV++  VL AC+Q G L
Sbjct: 217 EDMPERDVPSWNAILAACTQNGLFLEAVSLFRRMINEPSIRPNEVTVVCVLSACAQTGTL 276

Query: 241 DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 300
            + K I A+A       + +VSN++++L+ +CGN+EEA  VF ++ SK++L +WN+MI  
Sbjct: 277 QLAKGIHAFAYRRDLSSDVFVSNSLVDLYGKCGNLEEASSVF-KMASKKSLTAWNSMINC 336

Query: 301 LAVHGRCIDALQLYDQML---IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQ 360
            A+HGR  +A+ ++++M+   I  ++PD +TF+GLL ACTHGG+V++GR  F+ M ++F 
Sbjct: 337 FALHGRSEEAIAVFEEMMKLNINDIKPDHITFIGLLNACTHGGLVSKGRGYFDLMTNRFG 396

Query: 361 VAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAA 420
           + P++EHYGCL+DLLGRAG   EA  ++  M M  D  IWG+LL AC  HG+++L EVA 
Sbjct: 397 IEPRIEHYGCLIDLLGRAGRFDEALEVMSTMKMKADEAIWGSLLNACKIHGHLDLAEVAV 456

Query: 421 ESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEF 480
           ++L  L P N G   +++N+Y   G+W    R RKM+K  +  K  G+S IE+ + +H+F
Sbjct: 457 KNLVALNPNNGGYVAMMANLYGEMGNWEEARRARKMIKHQNAYKPPGWSRIEIDNEVHQF 516

Query: 481 IVEDRSHLKSGEIYALLHKI 489
              D+SH ++ EIY +L  +
Sbjct: 517 YSLDKSHPETEEIYMILDSL 535

BLAST of CSPI01G33460 vs. Swiss-Prot
Match: PP433_ARATH (Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana GN=PCMP-E13 PE=2 SV=1)

HSP 1 Score: 382.5 bits (981), Expect = 7.3e-105
Identity = 192/497 (38.63%), Postives = 314/497 (63.18%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPD----LPYACTLFDQIPKPSVYLYNKFIQT 60
           +  LKQ H Y +  GL+     + K ++       L YA ++F   P P+ YL+N  I+ 
Sbjct: 28  LKTLKQSHCYMIITGLNRDNLNVAKFIEACSNAGHLRYAYSVFTHQPCPNTYLHNTMIRA 87

Query: 61  FSSIGHPHRCWL---LYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGF 120
            S +  P+   +   +Y ++ +    P+ ++F F+      + +V+ G+ +H      GF
Sbjct: 88  LSLLDEPNAHSIAITVYRKLWALCAKPDTFTFPFVLKIAVRVSDVWFGRQIHGQVVVFGF 147

Query: 121 ASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNK 180
            S +  +T L+ MY   G L  AR++FDEM V+D+  WN+L+AGY + G M+ A  L   
Sbjct: 148 DSSVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVWNALLAGYGKVGEMDEARSLLEM 207

Query: 181 MP--VRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGAL 240
           MP  VRN +SWT +ISGYA++G+ ++A+E+F  +  E   +P+EV++ +VL AC+ LG+L
Sbjct: 208 MPCWVRNEVSWTCVISGYAKSGRASEAIEVFQRMLMEN-VEPDEVTLLAVLSACADLGSL 267

Query: 241 DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 300
           ++G+RI +Y  + G  +   ++NAV++++A+ GNI +A  VF+ + ++RN+ +W T+I G
Sbjct: 268 ELGERICSYVDHRGMNRAVSLNNAVIDMYAKSGNITKALDVFECV-NERNVVTWTTIIAG 327

Query: 301 LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP 360
           LA HG   +AL ++++M+   +RP+DVTF+ +L AC+H G V  G++LF SM SK+ + P
Sbjct: 328 LATHGHGAEALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKRLFNSMRSKYGIHP 387

Query: 361 KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL 420
            +EHYGC++DLLGRAG+L+EA  +I++MP   ++ IWG+LL A + H ++ELGE A   L
Sbjct: 388 NIEHYGCMIDLLGRAGKLREADEVIKSMPFKANAAIWGSLLAASNVHHDLELGERALSEL 447

Query: 421 FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE 480
            KLEP N GNY++L+N+Y+  G W     +R MMKG  + K AG S IEV + +++FI  
Sbjct: 448 IKLEPNNSGNYMLLANLYSNLGRWDESRMMRNMMKGIGVKKMAGESSIEVENRVYKFISG 507

Query: 481 DRSHLKSGEIYALLHKI 489
           D +H +   I+ +L ++
Sbjct: 508 DLTHPQVERIHEILQEM 522

BLAST of CSPI01G33460 vs. TrEMBL
Match: A0A0A0LY28_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G690140 PE=4 SV=1)

HSP 1 Score: 1057.0 bits (2732), Expect = 7.4e-306
Identity = 510/512 (99.61%), Postives = 510/512 (99.61%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60

Query: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120
           GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM
Sbjct: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120

Query: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180
           TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI
Sbjct: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180

Query: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240
           SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY
Sbjct: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240

Query: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300
           ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
Sbjct: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300

Query: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360
           ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV
Sbjct: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360

Query: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420
           DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG
Sbjct: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420

Query: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 480
           NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE
Sbjct: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 480

Query: 481 IYALLHKIYDIIKLHKHVHQDQNEDEELLYSS 513
           IYALLHKIYDIIKLHKHVH D NEDEELLYSS
Sbjct: 481 IYALLHKIYDIIKLHKHVHHDPNEDEELLYSS 512

BLAST of CSPI01G33460 vs. TrEMBL
Match: F6I6G7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g00030 PE=4 SV=1)

HSP 1 Score: 728.4 bits (1879), Expect = 6.0e-207
Identity = 347/506 (68.58%), Postives = 417/506 (82.41%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MN+LKQI AY+LRNG++HTK LI  LLQ+P +PYA  LFD IPKP+V+LYNK IQ +SS 
Sbjct: 1   MNRLKQIQAYTLRNGIEHTKQLIVSLLQIPSIPYAHKLFDFIPKPTVFLYNKLIQAYSSH 60

Query: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120
           G  H+C+ LY QMC QGCSPN++SFTFLF ACASL +   G+MLH+HF KSGF  D+FA+
Sbjct: 61  GPHHQCFSLYTQMCLQGCSPNEHSFTFLFSACASLSSHQQGRMLHTHFVKSGFGCDVFAL 120

Query: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180
           TAL+DMYAKLG+L  AR+ FDEM VRD+PTWNS+IAGYAR G +E ALELF  MP RNV 
Sbjct: 121 TALVDMYAKLGLLSLARKQFDEMTVRDVPTWNSMIAGYARCGDLEGALELFRLMPARNVT 180

Query: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240
           SWTA+ISGYAQNG+YAKAL MF+ +E E   +PNEV++ASVLPAC+ LGAL++G+RIE Y
Sbjct: 181 SWTAMISGYAQNGQYAKALSMFLMMEEETEMRPNEVTLASVLPACANLGALEVGERIEVY 240

Query: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300
           AR NG+FKN YVSNA+LE++ARCG I++A  VF+EI  +RNLCSWN+MIMGLAVHGRC +
Sbjct: 241 ARGNGYFKNLYVSNALLEMYARCGRIDKAWGVFEEIDGRRNLCSWNSMIMGLAVHGRCDE 300

Query: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360
           A++L+ +ML     PDDVTFVG+LLACTHGGMV EG+  FESME  F +APKLEHYGC+V
Sbjct: 301 AIELFYKMLREGAAPDDVTFVGVLLACTHGGMVVEGQHFFESMERDFSIAPKLEHYGCMV 360

Query: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420
           DLLGRAGEL+EA++LI  MPM PDSV+WGTLLGACSFHG+VEL E AA +LF+LEP NPG
Sbjct: 361 DLLGRAGELREAHDLILRMPMEPDSVVWGTLLGACSFHGHVELAEKAAGALFELEPSNPG 420

Query: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 480
           NYVILSNIYA AG W GVARLRK+MKGG ITK AGYS+IE G  IH+FIVEDRSH +S E
Sbjct: 421 NYVILSNIYATAGRWDGVARLRKLMKGGKITKAAGYSFIEEGGHIHKFIVEDRSHSRSDE 480

Query: 481 IYALLHKIYDIIKLHKHVHQDQNEDE 507
           IYALL ++   +KLH +V+   +E E
Sbjct: 481 IYALLDEVSMKMKLHGNVNDSDSEIE 506

BLAST of CSPI01G33460 vs. TrEMBL
Match: W9RDF1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_023466 PE=4 SV=1)

HSP 1 Score: 705.7 bits (1820), Expect = 4.2e-200
Identity = 331/510 (64.90%), Postives = 413/510 (80.98%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHA++LRNG+DHT  LI KLL++P++ YA  LFD IP+P+V+LYN+ I+ +S  
Sbjct: 4   MNQLKQIHAHTLRNGVDHTSILILKLLEIPNILYARNLFDLIPEPTVFLYNRLIKAYSFH 63

Query: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120
           G  H+C  LY +MC QGC+PN++SFT LF  C+SL +   GQM+HSHF K G   D+FA+
Sbjct: 64  GQHHQCLFLYRRMCLQGCTPNEHSFTLLFSVCSSLSSRQLGQMMHSHFVKLGHVRDIFAL 123

Query: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180
           TAL+DMYAKLGML  AR+ FDE  VR  PTWNS+++GYARSG ME A ELF  MP RNV+
Sbjct: 124 TALVDMYAKLGMLDCARKQFDEKRVRGTPTWNSMLSGYARSGDMEGASELFRLMPQRNVV 183

Query: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240
           SWTA+ISGY++NG+YAKAL MF+ +E E+  +PN ++IASVLPAC+ LGAL++G+R+E Y
Sbjct: 184 SWTAMISGYSKNGQYAKALAMFLQMEKERDVRPNAITIASVLPACANLGALEVGERVEEY 243

Query: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300
           AR  GF K+ YVSNAVLE++A+CG I+ A++VFDEIG +RNLCSWN+MIMGLAVHGRC +
Sbjct: 244 ARKVGFLKDLYVSNAVLEMYAKCGRIDTARRVFDEIGRRRNLCSWNSMIMGLAVHGRCNE 303

Query: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360
           AL LY+QM   ++ PDDVTFVGL+LACTHGGM  +G+QLF+SME KF + PKLEHYGC+V
Sbjct: 304 ALDLYEQMTTVRIAPDDVTFVGLILACTHGGMAMKGQQLFKSMEPKFGITPKLEHYGCMV 363

Query: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420
           DLLGRAG+LQEAY+LIQ M M PD+VIWG LLGACSFHGNVEL E AAESLF+LE WNP 
Sbjct: 364 DLLGRAGKLQEAYDLIQGMSMKPDNVIWGALLGACSFHGNVELAEKAAESLFELESWNPA 423

Query: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 480
           NYVILSNIYA A  W GVA+LRK+MKGG ITK AGYS+IE G  +H+FIVED+SH +S E
Sbjct: 424 NYVILSNIYASARRWDGVAKLRKVMKGGKITKAAGYSFIEEGGQVHKFIVEDKSHPRSDE 483

Query: 481 IYALLHKIYDIIKLHKHVHQDQNEDEELLY 511
           IYALL+K Y  ++L+++      EDEE+ +
Sbjct: 484 IYALLNKFYAKVRLYRNDTDCLTEDEEMQF 513

BLAST of CSPI01G33460 vs. TrEMBL
Match: A0A067K069_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14219 PE=4 SV=1)

HSP 1 Score: 699.9 bits (1805), Expect = 2.3e-198
Identity = 334/509 (65.62%), Postives = 413/509 (81.14%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MN+LKQIHA+++RNG+DHTK LI + L++P++ YA  LF+ IP P+ +LYNK IQ +S  
Sbjct: 1   MNRLKQIHAFTIRNGIDHTKTLIVEALKIPNISYAHNLFNLIPSPTAFLYNKLIQAYSFQ 60

Query: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120
             P+RC  LY QM  + C PN+++FTFLF AC    +   GQ+LH+H  KSGF  D+FA+
Sbjct: 61  SQPYRCLSLYSQMRFKNCLPNEHTFTFLFAACICFSSDLHGQILHTHLLKSGFNFDIFAL 120

Query: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180
           TAL+DMYAKLG LRSAR +FDE+  +DIPTWN+LIAGY+R G+ME AL+LF +MP +NV+
Sbjct: 121 TALVDMYAKLGKLRSARHVFDEITFKDIPTWNALIAGYSRCGNMEEALDLFRRMPYKNVV 180

Query: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240
           SWTA+ISGY+QNG+YAKALEMF+ +E EKG  PNEV+IASVLPAC+ LGAL++G+RIEAY
Sbjct: 181 SWTAMISGYSQNGEYAKALEMFLNMEKEKGLAPNEVTIASVLPACANLGALEVGERIEAY 240

Query: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDE-IGSKRNLCSWNTMIMGLAVHGRCI 300
           AR NG   N YVSNA+L+++ARCG I+ A+QVFDE IG ++NLCSWN+MIMGLA+HGR  
Sbjct: 241 ARKNGLLSNMYVSNALLDMYARCGKIDVARQVFDEIIGKRKNLCSWNSMIMGLAIHGRSH 300

Query: 301 DALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCL 360
           DALQLY+QM+     PDDVTFVGLLLACTHGGMV +GRQLF+SME KF+++PKLEHYGC+
Sbjct: 301 DALQLYNQMMREGTAPDDVTFVGLLLACTHGGMVVKGRQLFQSMERKFRISPKLEHYGCM 360

Query: 361 VDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNP 420
           VDLLGRAGELQEAY +I++MPM PDSVIWG LLGACSFH NVEL E+AAESLF+LEPWNP
Sbjct: 361 VDLLGRAGELQEAYEIIKSMPMRPDSVIWGALLGACSFHKNVELAEIAAESLFELEPWNP 420

Query: 421 GNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSG 480
           GNYVILSNIYA AG W GVA LRK+MKGGHITK AGYS+IE    I +FIV D SH +  
Sbjct: 421 GNYVILSNIYATAGRWDGVAMLRKLMKGGHITKAAGYSFIEEEGEIQKFIVGDVSHPRCE 480

Query: 481 EIYALLHKIYDIIKLHKHVHQDQNEDEEL 509
           EIY LL++    +KL + V   ++E +EL
Sbjct: 481 EIYRLLNEFSATMKL-QSVANFESELQEL 508

BLAST of CSPI01G33460 vs. TrEMBL
Match: A0A0L9T568_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan181s000600 PE=4 SV=1)

HSP 1 Score: 697.2 bits (1798), Expect = 1.5e-197
Identity = 328/497 (66.00%), Postives = 404/497 (81.29%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           M Q+KQIH Y+LRNG+D TK LIEKLL++P+L YA T+    PKP+++LYNK IQ +SS 
Sbjct: 1   MRQVKQIHGYTLRNGIDQTKILIEKLLEIPNLHYAHTVLHHSPKPTLFLYNKLIQAYSSH 60

Query: 61  G-HPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFA 120
             H H+C+ LY QM   G +PNQ++F FLF AC SL +   GQMLH+HF KSGF  D+FA
Sbjct: 61  PQHQHQCFSLYYQMRLHGFAPNQHTFNFLFSACTSLSSHSLGQMLHTHFTKSGFEPDLFA 120

Query: 121 MTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNV 180
            T+LLDMY+K+GML  ARQLFDEMPVR +PTWN+++ GYA+ G ME ALELF  MP RN+
Sbjct: 121 ATSLLDMYSKVGMLGLARQLFDEMPVRGVPTWNAIMYGYAKFGDMEGALELFGLMPKRNL 180

Query: 181 ISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEA 240
           +SWT +ISGY++N ++ +AL +F+ +E EKG  PNEV++AS+LPACS LGAL+IG+R+EA
Sbjct: 181 VSWTTMISGYSRNKRFGEALGLFLKMEKEKGIVPNEVTLASILPACSNLGALEIGQRVEA 240

Query: 241 YARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCI 300
           YAR NGFFKN YVSNAVLE++A+CG I+ A +VF+EIG  RNLCSWN+MIMGLAVHG+C 
Sbjct: 241 YARKNGFFKNLYVSNAVLEMYAKCGKIDVAWRVFNEIGRFRNLCSWNSMIMGLAVHGQCC 300

Query: 301 DALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCL 360
            A +LYDQML     PDDVTFVGLLLACTHGGMV +GR +F+SM + F + PKLEHYGC+
Sbjct: 301 KAFELYDQMLGEGTSPDDVTFVGLLLACTHGGMVEKGRHIFKSMTTSFYIIPKLEHYGCM 360

Query: 361 VDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNP 420
           VDLLGRAG+L+EAY +IQ+MPM PDSV+WG LLGACSFHGNVEL E+AAESLF LEPWNP
Sbjct: 361 VDLLGRAGQLREAYEVIQSMPMKPDSVMWGALLGACSFHGNVELAEIAAESLFVLEPWNP 420

Query: 421 GNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSG 480
           GNYVILSNIYA AG W GVA+LRK+MKG  ITK AG+S+IE G  +H+FIVEDRSH KS 
Sbjct: 421 GNYVILSNIYASAGQWDGVAKLRKVMKGSEITKSAGHSFIEEGGQLHKFIVEDRSHPKSN 480

Query: 481 EIYALLHKIYDIIKLHK 497
           EI ALL  +Y++I L++
Sbjct: 481 EIVALLDGVYEMINLNR 497

BLAST of CSPI01G33460 vs. TAIR10
Match: AT5G08510.1 (AT5G08510.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 590.1 bits (1520), Expect = 1.3e-168
Identity = 277/509 (54.42%), Postives = 375/509 (73.67%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MN +KQ+HA+ LR G+D TK L+++LL +P+L YA  LFD       +LYNK IQ +   
Sbjct: 1   MNGIKQLHAHCLRTGVDETKDLLQRLLLIPNLVYARKLFDHHQNSCTFLYNKLIQAYYVH 60

Query: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120
             PH   +LY  +   G  P+ ++F F+F A AS  +  P ++LHS F +SGF SD F  
Sbjct: 61  HQPHESIVLYNLLSFDGLRPSHHTFNFIFAASASFSSARPLRLLHSQFFRSGFESDSFCC 120

Query: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180
           T L+  YAKLG L  AR++FDEM  RD+P WN++I GY R G M+AA+ELF+ MP +NV 
Sbjct: 121 TTLITAYAKLGALCCARRVFDEMSKRDVPVWNAMITGYQRRGDMKAAMELFDSMPRKNVT 180

Query: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240
           SWT +ISG++QNG Y++AL+MF+ +E +K  KPN +++ SVLPAC+ LG L+IG+R+E Y
Sbjct: 181 SWTTVISGFSQNGNYSEALKMFLCMEKDKSVKPNHITVVSVLPACANLGELEIGRRLEGY 240

Query: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300
           AR NGFF N YV NA +E++++CG I+ A+++F+E+G++RNLCSWN+MI  LA HG+  +
Sbjct: 241 ARENGFFDNIYVCNATIEMYSKCGMIDVAKRLFEELGNQRNLCSWNSMIGSLATHGKHDE 300

Query: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360
           AL L+ QML    +PD VTFVGLLLAC HGGMV +G++LF+SME   +++PKLEHYGC++
Sbjct: 301 ALTLFAQMLREGEKPDAVTFVGLLLACVHGGMVVKGQELFKSMEEVHKISPKLEHYGCMI 360

Query: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420
           DLLGR G+LQEAY+LI+ MPM PD+V+WGTLLGACSFHGNVE+ E+A+E+LFKLEP NPG
Sbjct: 361 DLLGRVGKLQEAYDLIKTMPMKPDAVVWGTLLGACSFHGNVEIAEIASEALFKLEPTNPG 420

Query: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSG 480
           N VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S 
Sbjct: 421 NCVIMSNIYAANEKWDGVLRMRKLMKKETMTKAAGYSYFVEVGVDVHKFTVEDKSHPRSY 480

Query: 481 EIYALLHKIYDIIKLHKHVHQDQNEDEEL 509
           EIY +L +I+  +KL K       + E+L
Sbjct: 481 EIYQVLEEIFRRMKLEKSRFDSLLQPEQL 509

BLAST of CSPI01G33460 vs. TAIR10
Match: AT2G20540.1 (AT2G20540.1 mitochondrial editing factor 21)

HSP 1 Score: 407.5 bits (1046), Expect = 1.2e-113
Identity = 206/507 (40.63%), Postives = 321/507 (63.31%), Query Frame = 1

Query: 2   NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQIPKPSVYLYNKFIQTF 61
           N+ K+I+A  + +GL  + F++ K++    ++ D+ YA  LF+Q+  P+V+LYN  I+ +
Sbjct: 24  NEWKKINASIIIHGLSQSSFMVTKMVDFCDKIEDMDYATRLFNQVSNPNVFLYNSIIRAY 83

Query: 62  SSIGHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASD 121
           +          +Y Q+  +    P++++F F+F +CASL + Y G+ +H H CK G    
Sbjct: 84  THNSLYCDVIRIYKQLLRKSFELPDRFTFPFMFKSCASLGSCYLGKQVHGHLCKFGPRFH 143

Query: 122 MFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPV 181
           +    AL+DMY K   L  A ++FDEM  RD+ +WNSL++GYAR G M+ A  LF+ M  
Sbjct: 144 VVTENALIDMYMKFDDLVDAHKVFDEMYERDVISWNSLLSGYARLGQMKKAKGLFHLMLD 203

Query: 182 RNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKR 241
           + ++SWTA+ISGY   G Y +A++ F  ++   G +P+E+S+ SVLP+C+QLG+L++GK 
Sbjct: 204 KTIVSWTAMISGYTGIGCYVEAMDFFREMQLA-GIEPDEISLISVLPSCAQLGSLELGKW 263

Query: 242 IEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHG 301
           I  YA   GF K   V NA++E++++CG I +A Q+F ++  K ++ SW+TMI G A HG
Sbjct: 264 IHLYAERRGFLKQTGVCNALIEMYSKCGVISQAIQLFGQMEGK-DVISWSTMISGYAYHG 323

Query: 302 RCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHY 361
               A++ +++M   K++P+ +TF+GLL AC+H GM  EG + F+ M   +Q+ PK+EHY
Sbjct: 324 NAHGAIETFNEMQRAKVKPNGITFLGLLSACSHVGMWQEGLRYFDMMRQDYQIEPKIEHY 383

Query: 362 GCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEP 421
           GCL+D+L RAG+L+ A  + + MPM PDS IWG+LL +C   GN+++  VA + L +LEP
Sbjct: 384 GCLIDVLARAGKLERAVEITKTMPMKPDSKIWGSLLSSCRTPGNLDVALVAMDHLVELEP 443

Query: 422 WNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHL 481
            + GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S  
Sbjct: 444 EDMGNYVLLANIYADLGKWEDVSRLRKMIRNENMKKTPGGSLIEVNNIVQEFVSGDNSKP 503

Query: 482 KSGEIYALLHKIYDIIKLHKHVHQDQN 504
              EI  +L             HQDQ+
Sbjct: 504 FWTEISIVLQLFTS--------HQDQD 520

BLAST of CSPI01G33460 vs. TAIR10
Match: AT4G37380.1 (AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 395.6 bits (1015), Expect = 4.7e-110
Identity = 205/506 (40.51%), Postives = 318/506 (62.85%), Query Frame = 1

Query: 1   MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQIPKPSVYLYNKF 60
           ++++ QIHA  LR N L H ++ +  L           + ++  LF Q   P ++L+   
Sbjct: 42  VDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSLALFHQTIDPDLFLFTAA 101

Query: 61  IQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGF 120
           I T S  G   + +LLY Q+ S   +PN+++F+ L  +C++      G+++H+H  K G 
Sbjct: 102 INTASINGLKDQAFLLYVQLLSSEINPNEFTFSSLLKSCSTK----SGKLIHTHVLKFGL 161

Query: 121 ASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNK 180
             D +  T L+D+YAK G + SA+++FD MP R + +  ++I  YA+ G++EAA  LF+ 
Sbjct: 162 GIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITCYAKQGNVEAARALFDS 221

Query: 181 MPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDI 240
           M  R+++SW  +I GYAQ+G    AL +F  L  E   KP+E+++ + L ACSQ+GAL+ 
Sbjct: 222 MCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEITVVAALSACSQIGALET 281

Query: 241 GKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLA 300
           G+ I  + +++    N  V   +++++++CG++EEA  VF++   ++++ +WN MI G A
Sbjct: 282 GRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDT-PRKDIVAWNAMIAGYA 341

Query: 301 VHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPK 360
           +HG   DAL+L+++M  I  ++P D+TF+G L AC H G+V EG ++FESM  ++ + PK
Sbjct: 342 MHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIRIFESMGQEYGIKPK 401

Query: 361 LEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLF 420
           +EHYGCLV LLGRAG+L+ AY  I+NM M  DSV+W ++LG+C  HG+  LG+  AE L 
Sbjct: 402 IEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKLHGDFVLGKEIAEYLI 461

Query: 421 KLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVED 480
            L   N G YV+LSNIYA  GD+ GVA++R +MK   I K  G S IE+ + +HEF   D
Sbjct: 462 GLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGISTIEIENKVHEFRAGD 521

Query: 481 RSHLKSGEIYALLHKIYDIIKLHKHV 499
           R H KS EIY +L KI + IK H +V
Sbjct: 522 REHSKSKEIYTMLRKISERIKSHGYV 542

BLAST of CSPI01G33460 vs. TAIR10
Match: AT1G33350.1 (AT1G33350.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 390.2 bits (1001), Expect = 2.0e-108
Identity = 193/500 (38.60%), Postives = 310/500 (62.00%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQ-----LPDLPYACTLFDQIPKPSVYLYNKFIQ 60
           +N LKQ+ ++ + +GL H+ FL  KLL+     L +L YA  +FD+   P+ +LY   + 
Sbjct: 37  LNHLKQVQSFMIVSGLSHSHFLCFKLLRFCTLRLCNLSYARFIFDRFSFPNTHLYAAVLT 96

Query: 61  TFSSIG--HPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLFNVYPGQMLHSHFCKSG 120
            +SS    H    +  +  M ++    PN + +  +  +   L + +   ++H+H  KSG
Sbjct: 97  AYSSSLPLHASSAFSFFRLMVNRSVPRPNHFIYPLVLKSTPYLSSAFSTPLVHTHLFKSG 156

Query: 121 FASDMFAMTALLDMYAK-LGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELF 180
           F   +   TALL  YA  +  +  ARQLFDEM  R++ +W ++++GYARSG +  A+ LF
Sbjct: 157 FHLYVVVQTALLHSYASSVSHITLARQLFDEMSERNVVSWTAMLSGYARSGDISNAVALF 216

Query: 181 NKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGAL 240
             MP R+V SW A+++   QNG + +A+ +F  + NE   +PNEV++  VL AC+Q G L
Sbjct: 217 EDMPERDVPSWNAILAACTQNGLFLEAVSLFRRMINEPSIRPNEVTVVCVLSACAQTGTL 276

Query: 241 DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 300
            + K I A+A       + +VSN++++L+ +CGN+EEA  VF ++ SK++L +WN+MI  
Sbjct: 277 QLAKGIHAFAYRRDLSSDVFVSNSLVDLYGKCGNLEEASSVF-KMASKKSLTAWNSMINC 336

Query: 301 LAVHGRCIDALQLYDQML---IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQ 360
            A+HGR  +A+ ++++M+   I  ++PD +TF+GLL ACTHGG+V++GR  F+ M ++F 
Sbjct: 337 FALHGRSEEAIAVFEEMMKLNINDIKPDHITFIGLLNACTHGGLVSKGRGYFDLMTNRFG 396

Query: 361 VAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAA 420
           + P++EHYGCL+DLLGRAG   EA  ++  M M  D  IWG+LL AC  HG+++L EVA 
Sbjct: 397 IEPRIEHYGCLIDLLGRAGRFDEALEVMSTMKMKADEAIWGSLLNACKIHGHLDLAEVAV 456

Query: 421 ESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEF 480
           ++L  L P N G   +++N+Y   G+W    R RKM+K  +  K  G+S IE+ + +H+F
Sbjct: 457 KNLVALNPNNGGYVAMMANLYGEMGNWEEARRARKMIKHQNAYKPPGWSRIEIDNEVHQF 516

Query: 481 IVEDRSHLKSGEIYALLHKI 489
              D+SH ++ EIY +L  +
Sbjct: 517 YSLDKSHPETEEIYMILDSL 535

BLAST of CSPI01G33460 vs. TAIR10
Match: AT5G56310.1 (AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 382.5 bits (981), Expect = 4.1e-106
Identity = 192/497 (38.63%), Postives = 314/497 (63.18%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPD----LPYACTLFDQIPKPSVYLYNKFIQT 60
           +  LKQ H Y +  GL+     + K ++       L YA ++F   P P+ YL+N  I+ 
Sbjct: 28  LKTLKQSHCYMIITGLNRDNLNVAKFIEACSNAGHLRYAYSVFTHQPCPNTYLHNTMIRA 87

Query: 61  FSSIGHPHRCWL---LYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGF 120
            S +  P+   +   +Y ++ +    P+ ++F F+      + +V+ G+ +H      GF
Sbjct: 88  LSLLDEPNAHSIAITVYRKLWALCAKPDTFTFPFVLKIAVRVSDVWFGRQIHGQVVVFGF 147

Query: 121 ASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNK 180
            S +  +T L+ MY   G L  AR++FDEM V+D+  WN+L+AGY + G M+ A  L   
Sbjct: 148 DSSVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVWNALLAGYGKVGEMDEARSLLEM 207

Query: 181 MP--VRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGAL 240
           MP  VRN +SWT +ISGYA++G+ ++A+E+F  +  E   +P+EV++ +VL AC+ LG+L
Sbjct: 208 MPCWVRNEVSWTCVISGYAKSGRASEAIEVFQRMLMEN-VEPDEVTLLAVLSACADLGSL 267

Query: 241 DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG 300
           ++G+RI +Y  + G  +   ++NAV++++A+ GNI +A  VF+ + ++RN+ +W T+I G
Sbjct: 268 ELGERICSYVDHRGMNRAVSLNNAVIDMYAKSGNITKALDVFECV-NERNVVTWTTIIAG 327

Query: 301 LAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAP 360
           LA HG   +AL ++++M+   +RP+DVTF+ +L AC+H G V  G++LF SM SK+ + P
Sbjct: 328 LATHGHGAEALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKRLFNSMRSKYGIHP 387

Query: 361 KLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESL 420
            +EHYGC++DLLGRAG+L+EA  +I++MP   ++ IWG+LL A + H ++ELGE A   L
Sbjct: 388 NIEHYGCMIDLLGRAGKLREADEVIKSMPFKANAAIWGSLLAASNVHHDLELGERALSEL 447

Query: 421 FKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVE 480
            KLEP N GNY++L+N+Y+  G W     +R MMKG  + K AG S IEV + +++FI  
Sbjct: 448 IKLEPNNSGNYMLLANLYSNLGRWDESRMMRNMMKGIGVKKMAGESSIEVENRVYKFISG 507

Query: 481 DRSHLKSGEIYALLHKI 489
           D +H +   I+ +L ++
Sbjct: 508 DLTHPQVERIHEILQEM 522

BLAST of CSPI01G33460 vs. NCBI nr
Match: gi|778664334|ref|XP_011660274.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Cucumis sativus])

HSP 1 Score: 1057.0 bits (2732), Expect = 1.1e-305
Identity = 510/512 (99.61%), Postives = 510/512 (99.61%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI
Sbjct: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60

Query: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120
           GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM
Sbjct: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120

Query: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180
           TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI
Sbjct: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180

Query: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240
           SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY
Sbjct: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240

Query: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300
           ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
Sbjct: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300

Query: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360
           ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV
Sbjct: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360

Query: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420
           DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG
Sbjct: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420

Query: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 480
           NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE
Sbjct: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 480

Query: 481 IYALLHKIYDIIKLHKHVHQDQNEDEELLYSS 513
           IYALLHKIYDIIKLHKHVH D NEDEELLYSS
Sbjct: 481 IYALLHKIYDIIKLHKHVHHDPNEDEELLYSS 512

BLAST of CSPI01G33460 vs. NCBI nr
Match: gi|359489593|ref|XP_003633947.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Vitis vinifera])

HSP 1 Score: 728.4 bits (1879), Expect = 8.7e-207
Identity = 347/506 (68.58%), Postives = 417/506 (82.41%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MN+LKQI AY+LRNG++HTK LI  LLQ+P +PYA  LFD IPKP+V+LYNK IQ +SS 
Sbjct: 1   MNRLKQIQAYTLRNGIEHTKQLIVSLLQIPSIPYAHKLFDFIPKPTVFLYNKLIQAYSSH 60

Query: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120
           G  H+C+ LY QMC QGCSPN++SFTFLF ACASL +   G+MLH+HF KSGF  D+FA+
Sbjct: 61  GPHHQCFSLYTQMCLQGCSPNEHSFTFLFSACASLSSHQQGRMLHTHFVKSGFGCDVFAL 120

Query: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180
           TAL+DMYAKLG+L  AR+ FDEM VRD+PTWNS+IAGYAR G +E ALELF  MP RNV 
Sbjct: 121 TALVDMYAKLGLLSLARKQFDEMTVRDVPTWNSMIAGYARCGDLEGALELFRLMPARNVT 180

Query: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240
           SWTA+ISGYAQNG+YAKAL MF+ +E E   +PNEV++ASVLPAC+ LGAL++G+RIE Y
Sbjct: 181 SWTAMISGYAQNGQYAKALSMFLMMEEETEMRPNEVTLASVLPACANLGALEVGERIEVY 240

Query: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300
           AR NG+FKN YVSNA+LE++ARCG I++A  VF+EI  +RNLCSWN+MIMGLAVHGRC +
Sbjct: 241 ARGNGYFKNLYVSNALLEMYARCGRIDKAWGVFEEIDGRRNLCSWNSMIMGLAVHGRCDE 300

Query: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360
           A++L+ +ML     PDDVTFVG+LLACTHGGMV EG+  FESME  F +APKLEHYGC+V
Sbjct: 301 AIELFYKMLREGAAPDDVTFVGVLLACTHGGMVVEGQHFFESMERDFSIAPKLEHYGCMV 360

Query: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420
           DLLGRAGEL+EA++LI  MPM PDSV+WGTLLGACSFHG+VEL E AA +LF+LEP NPG
Sbjct: 361 DLLGRAGELREAHDLILRMPMEPDSVVWGTLLGACSFHGHVELAEKAAGALFELEPSNPG 420

Query: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 480
           NYVILSNIYA AG W GVARLRK+MKGG ITK AGYS+IE G  IH+FIVEDRSH +S E
Sbjct: 421 NYVILSNIYATAGRWDGVARLRKLMKGGKITKAAGYSFIEEGGHIHKFIVEDRSHSRSDE 480

Query: 481 IYALLHKIYDIIKLHKHVHQDQNEDE 507
           IYALL ++   +KLH +V+   +E E
Sbjct: 481 IYALLDEVSMKMKLHGNVNDSDSEIE 506

BLAST of CSPI01G33460 vs. NCBI nr
Match: gi|1000956866|ref|XP_015577366.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Ricinus communis])

HSP 1 Score: 708.4 bits (1827), Expect = 9.3e-201
Identity = 331/489 (67.69%), Postives = 407/489 (83.23%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHAY+LRNG+D+ K L E+L+Q+P++PYA  L D IP P+V+LYNK IQ +S  
Sbjct: 1   MNQLKQIHAYTLRNGIDYNKTLTERLIQIPNVPYAHKLIDLIPSPNVFLYNKLIQAYSFQ 60

Query: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120
              H+C+ +Y QM S+ C+ NQ++FTFLF ACAS F+    QMLH+HF KSGF SD+ A+
Sbjct: 61  NQLHQCFSIYSQMRSRNCTGNQHTFTFLFAACASFFSPLHAQMLHTHFKKSGFESDVIAL 120

Query: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180
           TAL+DMY KLGM+  A ++FDE+PVRDIPTWN+LIAGY+R G ME AL++F  MP RNV+
Sbjct: 121 TALVDMYCKLGMVAFAHRVFDEIPVRDIPTWNALIAGYSRCGDMEGALKIFKLMPDRNVV 180

Query: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240
           SWTA+ISGY+QNG+YAKALE+F+ +E E G +PNEV+IAS+LPAC+ LGAL++G RIE Y
Sbjct: 181 SWTAMISGYSQNGRYAKALELFLKMEKENGLRPNEVTIASILPACANLGALEVGDRIETY 240

Query: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDE-IGSKRNLCSWNTMIMGLAVHGRCI 300
           AR NG  +N YVSNA+LE++ARCG I+ A++VFD+ IG +RNLCSWN+MIMGLA+HGR  
Sbjct: 241 ARENGLLRNLYVSNALLEMYARCGKIDMARKVFDKIIGKRRNLCSWNSMIMGLAIHGRSH 300

Query: 301 DALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCL 360
           DAL LY++MLI  + PDDVTFVG+LLACTHGGM+ +GRQLF+SME KF++APKLEHYGC+
Sbjct: 301 DALHLYNRMLIEGIAPDDVTFVGILLACTHGGMLVKGRQLFQSMERKFRIAPKLEHYGCM 360

Query: 361 VDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNP 420
           VDLLGRAGELQEAYNLI++MPM PDSVIWG LLGACSFH NVE  E+AA SLF+LEPWNP
Sbjct: 361 VDLLGRAGELQEAYNLIKSMPMKPDSVIWGALLGACSFHKNVEYAEIAAGSLFELEPWNP 420

Query: 421 GNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSG 480
           GNYVILSNIYA  G W GVA+LRK+MKGG ITK AGYS+IE G  I +FIVED SH +S 
Sbjct: 421 GNYVILSNIYASVGRWDGVAKLRKLMKGGQITKTAGYSFIEGGGKIEKFIVEDLSHPRSD 480

Query: 481 EIYALLHKI 489
           EIY LL++I
Sbjct: 481 EIYTLLNEI 489

BLAST of CSPI01G33460 vs. NCBI nr
Match: gi|703114586|ref|XP_010100697.1| (hypothetical protein L484_023466 [Morus notabilis])

HSP 1 Score: 705.7 bits (1820), Expect = 6.0e-200
Identity = 331/510 (64.90%), Postives = 413/510 (80.98%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHA++LRNG+DHT  LI KLL++P++ YA  LFD IP+P+V+LYN+ I+ +S  
Sbjct: 4   MNQLKQIHAHTLRNGVDHTSILILKLLEIPNILYARNLFDLIPEPTVFLYNRLIKAYSFH 63

Query: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120
           G  H+C  LY +MC QGC+PN++SFT LF  C+SL +   GQM+HSHF K G   D+FA+
Sbjct: 64  GQHHQCLFLYRRMCLQGCTPNEHSFTLLFSVCSSLSSRQLGQMMHSHFVKLGHVRDIFAL 123

Query: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180
           TAL+DMYAKLGML  AR+ FDE  VR  PTWNS+++GYARSG ME A ELF  MP RNV+
Sbjct: 124 TALVDMYAKLGMLDCARKQFDEKRVRGTPTWNSMLSGYARSGDMEGASELFRLMPQRNVV 183

Query: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240
           SWTA+ISGY++NG+YAKAL MF+ +E E+  +PN ++IASVLPAC+ LGAL++G+R+E Y
Sbjct: 184 SWTAMISGYSKNGQYAKALAMFLQMEKERDVRPNAITIASVLPACANLGALEVGERVEEY 243

Query: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300
           AR  GF K+ YVSNAVLE++A+CG I+ A++VFDEIG +RNLCSWN+MIMGLAVHGRC +
Sbjct: 244 ARKVGFLKDLYVSNAVLEMYAKCGRIDTARRVFDEIGRRRNLCSWNSMIMGLAVHGRCNE 303

Query: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360
           AL LY+QM   ++ PDDVTFVGL+LACTHGGM  +G+QLF+SME KF + PKLEHYGC+V
Sbjct: 304 ALDLYEQMTTVRIAPDDVTFVGLILACTHGGMAMKGQQLFKSMEPKFGITPKLEHYGCMV 363

Query: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420
           DLLGRAG+LQEAY+LIQ M M PD+VIWG LLGACSFHGNVEL E AAESLF+LE WNP 
Sbjct: 364 DLLGRAGKLQEAYDLIQGMSMKPDNVIWGALLGACSFHGNVELAEKAAESLFELESWNPA 423

Query: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 480
           NYVILSNIYA A  W GVA+LRK+MKGG ITK AGYS+IE G  +H+FIVED+SH +S E
Sbjct: 424 NYVILSNIYASARRWDGVAKLRKVMKGGKITKAAGYSFIEEGGQVHKFIVEDKSHPRSDE 483

Query: 481 IYALLHKIYDIIKLHKHVHQDQNEDEELLY 511
           IYALL+K Y  ++L+++      EDEE+ +
Sbjct: 484 IYALLNKFYAKVRLYRNDTDCLTEDEEMQF 513

BLAST of CSPI01G33460 vs. NCBI nr
Match: gi|697182734|ref|XP_009600379.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Nicotiana tomentosiformis])

HSP 1 Score: 704.1 bits (1816), Expect = 1.8e-199
Identity = 330/493 (66.94%), Postives = 407/493 (82.56%), Query Frame = 1

Query: 1   MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI 60
           MNQLKQIHA++LRNG+D T+FLI KL+++P++PYA  +FD IP+P+V+LYNK IQ +SS 
Sbjct: 1   MNQLKQIHAHTLRNGIDFTQFLITKLIEIPNIPYAHKVFDNIPRPAVFLYNKLIQAYSSH 60

Query: 61  GHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCKSGFASDMFAM 120
           G P +C+ LY QM  QGCSPN +SFTFLF AC +  +   GQM H HF K GF  D++A+
Sbjct: 61  GLPSQCFSLYIQMRRQGCSPNPHSFTFLFAACTNRSSPIQGQMFHVHFVKWGFKFDIYAL 120

Query: 121 TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVI 180
           TAL+DMYAK+G+L +AR+LFDEM ++D+PTWNSLIAGYA++G++E A +LF+ MP RNVI
Sbjct: 121 TALVDMYAKMGLLPAARKLFDEMEMKDVPTWNSLIAGYAKNGNVEEAFKLFSAMPSRNVI 180

Query: 181 SWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAY 240
           SWTA+ISGY+QNGKYA AL ++  +E ++G KPNEV+IASVLPAC+ LGAL++G++IEAY
Sbjct: 181 SWTAMISGYSQNGKYANALAVYKEMERDRGVKPNEVTIASVLPACANLGALEVGEKIEAY 240

Query: 241 ARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID 300
           AR NG+FKN +V NAV+E++ +CG I+ A Q+F EIG +RNLCSWNTMIMGLAVHG+  +
Sbjct: 241 ARANGYFKNMFVCNAVVEMYMKCGRIDRAMQLFHEIGRRRNLCSWNTMIMGLAVHGKGDE 300

Query: 301 ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLV 360
           AL+L+DQML     PDDVTFVG +LACTHGGMVA+G +L   ME +F +APKLEHYGC+V
Sbjct: 301 ALKLFDQMLGEGNAPDDVTFVGAILACTHGGMVAKGWELLSLMEQRFSIAPKLEHYGCMV 360

Query: 361 DLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPG 420
           DLLGRAG+LQEAY+LIQ+MPM PDSVIWGTLLGACSFHGNVEL E AAE L  LEPWNPG
Sbjct: 361 DLLGRAGKLQEAYDLIQSMPMRPDSVIWGTLLGACSFHGNVELAEKAAEFLSVLEPWNPG 420

Query: 421 NYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGE 480
           NYVILSNIYA AG W GVARLRK+MK   ITK AGYS+IE G  IH+FIVED+SH KS E
Sbjct: 421 NYVILSNIYARAGRWDGVARLRKLMKSSQITKAAGYSFIEEGGDIHKFIVEDKSHHKSNE 480

Query: 481 IYALLHKIYDIIK 494
           IYALL  +  I+K
Sbjct: 481 IYALLDLVTTILK 493

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP371_ARATH2.3e-16754.42Pentatricopeptide repeat-containing protein At5g08510 OS=Arabidopsis thaliana GN... [more]
PP165_ARATH2.1e-11240.63Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana GN... [more]
PP354_ARATH8.3e-10940.51Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
PPR70_ARATH3.5e-10738.60Pentatricopeptide repeat-containing protein At1g33350 OS=Arabidopsis thaliana GN... [more]
PP433_ARATH7.3e-10538.63Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LY28_CUCSA7.4e-30699.61Uncharacterized protein OS=Cucumis sativus GN=Csa_1G690140 PE=4 SV=1[more]
F6I6G7_VITVI6.0e-20768.58Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0046g00030 PE=4 SV=... [more]
W9RDF1_9ROSA4.2e-20064.90Uncharacterized protein OS=Morus notabilis GN=L484_023466 PE=4 SV=1[more]
A0A067K069_JATCU2.3e-19865.62Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14219 PE=4 SV=1[more]
A0A0L9T568_PHAAN1.5e-19766.00Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan181s000600 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G08510.11.3e-16854.42 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G20540.11.2e-11340.63 mitochondrial editing factor 21[more]
AT4G37380.14.7e-11040.51 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G33350.12.0e-10838.60 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G56310.14.1e-10638.63 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778664334|ref|XP_011660274.1|1.1e-30599.61PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Cucumis sativu... [more]
gi|359489593|ref|XP_003633947.1|8.7e-20768.58PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Vitis vinifera... [more]
gi|1000956866|ref|XP_015577366.1|9.3e-20167.69PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Ricinus commun... [more]
gi|703114586|ref|XP_010100697.1|6.0e-20064.90hypothetical protein L484_023466 [Morus notabilis][more]
gi|697182734|ref|XP_009600379.1|1.8e-19966.94PREDICTED: pentatricopeptide repeat-containing protein At5g08510 [Nicotiana tome... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G33460.1CSPI01G33460.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 120..147
score: 1.2E-4coord: 180..203
score: 6.2E-6coord: 355..379
score: 0.25coord: 150..178
score: 1.0E-8coord: 254..278
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 281..327
score: 7.8E-8coord: 45..93
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 180..214
score: 4.5E-5coord: 150..177
score: 2.8E-7coord: 121..148
score: 3.3E-4coord: 48..81
score: 2.2E-5coord: 284..316
score: 3.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 46..80
score: 9.175coord: 384..414
score: 5.119coord: 281..315
score: 10.424coord: 249..279
score: 8.122coord: 147..181
score: 11.992coord: 116..146
score: 8.934coord: 352..382
score: 6.73coord: 182..213
score: 7.191coord: 316..346
score: 7.815coord: 418..452
score: 5.821coord: 214..248
score: 5
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 383..435
score: 2.0E-6coord: 124..314
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 22..459
score: 2.6E