CSPI03G39810 (gene) Wild cucumber (PI 183967)

NameCSPI03G39810
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationChr3 : 34147753 .. 34151456 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTAAATTCATAATATTGAAAATCACCTCCACGTCTAGGGAATTTCACAGAGGTAATATAAACTTTTCTAGTTGATATTCAAATTGAAGGAATCATCATTTCTGCAGAAAAATTCGTGGCTTGGGAAACCCTATTTAATCCCATCGCTCTCAAATGCTGCCTTGTAAATTTTCTTTTTATTTTTCTCTCTACACAGATTCCCCACATGGAGAACAGTATTTACACAATCCTCACTATTGGTCGCTGGGAGTCACTGAATCACATGAACTATAAGTTCGCTTCACTAAGACCAATTCATGGAGTTTTAGCGCTGAAATTCCTCAAGTGGGTCATCAAACAGCCTGGTTTGGAACCCAACCACCTCACTCATATACTCGGTATTACTACTCATGTACTTGTTAGAGCTAGACTGTACGGTTATGCCAAATCAATTCTGAAGCATTTAGCTCAGAAAAATTCTGGGTCCAACTTTCTTTTTGGTGTTCTTATGGATACATACCCTCTTTGCAGCTCAAACCCTGCAGTTTTTGACCTTTTAATTAGAGTTTATTTGCGGCAAGGAATGGTTGGACACGCTGTAAATACTTTTTCTTCCATGCTCATTCGTGGGTTTAAGCCATCTGTTTATACTTGTAACATGATCATGGCTTCCATGGTTAAGAACTGTAGAGCTCACTTGGTTTGGTCTTTTTTTAAGCAAATGCTTACCAGTAGAGTTTGTCCAAACGTTTCCAGTTTTAATATACTCATAAGTGTTCTATGTGTGCAAGGGAAGCTTAAGAAAGCTGTTAACATCTTAACAATGATGGAGAGGAATGGCTATGTTCCTACTATAGTTAGTTATAATACGTTGCTTAGTTGGTGCTGTAAGAAGGGAAGATTTAAATTTGCACTTGTGCTGATTCATCATATGGAGTGCAAGGGAATTCAAGCAGACGTCTGTACATACAATATGTTTATTGATAGTTTGTGCAGAAACAGTAGAAGCGCACAGGGGTATTTAGTTTTGAAGAAAATGAGGAATAAGATGATAACTCCTAATGAAGTTTCTTACAACACCTTGATTAATGGCTTTGTAAAGGAGGGAAAGATAGGGGTTGCTACTCGGGTTTTCAATGAGATGATAGAGCTTAATCTTTCACCAAACCTCATTACTTACAATATTCTAATTAATGGATACTGCATTAATGGCAATTTTGAAGAAGCATTGAGAGTTTTGGATGTGATGGAAGCAAATGACGTGAGGCCTAATGAGGTTACTATTGGAACTCTTTTAAATGGTCTATACAAGAGTGCCAAATTTGACGTAGCTAGAAATATTCTGGAGAGATATTCTATCAATAGAACATCTCTTAATTGTATCTCACATACTGTGATGATTGATGGGCTATGCAGAAATGGGTTGCTTGATGAAGCCTTTCAATTACTAATTGAGATGTGCAAGGATGGTGTTCATCCTGATATCATAACTTTTTCAGTGCTTATAAATGGATTCTGCAAAGTTGGGAATATTAACAAGGCAAAGGAGGTTATGTCGAAAATATATAGAGAAGGATTTGTTCCAAACAATGTTATTTTCTCTACATTAATATATAACTCTTGTAAGGTTGGAAATGTTTATGAAGCGATGAAGTTCTATGCTGCTATGAATTTGAATGGGCAAAATGCAGACAATTTCACATGTAATTCGTTAGTCGCTTCTCTTTGTGAAAATGGAAAACTAGTAGAAGCAGAGGAATTCTTGCATCACATTAGTAGGATTGGTCTTGTTCCTAACTCTGTTACATTTGATTGTATCATAAACGGATATGCAAATGTAGGAGATGGGTCAGGGGCATTTTCAGTGTTCGATAAAATGATTAGTTGTGGTCATCACCCTAGTCCTTTCACCTATGGCAGTCTATTGAAAGTGTTATGCAAGGGACAGAATTTTTGGGAAGCAAGAAAACTATTGAAAAAGCTCCACTGCATTCCGTTGGCTGTTGATACTATATCGTACAACACATTGATTGTTGAGATAAGTAAGTCAGGAAATTTATTGGAAGCAGTTCGCCTATTTGAGGAGATGATTCAGAATAATATTCTACCGGATAGTTATACATACACTTGTATTCTGTCTGGATTAATTAGAGAAGGGAGATTGGTCTGTGCCTTCATATTCTTGGGAAGACTCATGCAAAAAGAAATTCTAACATTGAATTCAATTGTGTACACTTGTTTCATTGATGGCCTTTTCAAGGCTGGCCAGTCAAAGGCTGCATTATATCTTTTTAAGGAAATGGAGGAAAAAGGCCTCTCCGTAGATTTGATTGCTCTTAATTCAATTACAGATGGATATTCAAGGATGGGAAAAGTGTTTAGTGCCAGTTCTCTCATTTCAAAAACGAGAAACAAAAATGTAATACCTAACTTGACTACATTTAATATATTGCTACATGGTTACTCCAGAGGACAGGATATAATGAGTTGCTTTAAGTTGTATAACCTTATGAGGAGATCGGGCTTTTTTCCTAACAGATTAACATACCATTCTCTTATTCTTGGACTTTGCAACCATGGTATGTTGGAACTTGGAATTAAGATTTTGAAAATGTTTATTGCTGAAAGTTCTACTATTGATGACTTGACATTTAATATGCTCATTAGGAAGTGTTGTGAAATCAATGACCTGGATAAAGTCATTGATTTGACTCATAACATGGAAGTCTTTAGGGTTTCTCTCGATAAAGACACACAAAAAGCCGTTACTGATGTGCTTGTTAGAAGGATGGTTTCCCAAAATTATTTCGTTTTTATGCATGAAATGCTCAAAAAGGGTTTTATCCCTACATCTAAACAATATTGCACAATGATGAAACGAATGTGTCGAGTGGGGGACATACAGGGGGCATTTAAATTAAAAGATCAGATGGTGGCACTTGGCATAAGTTTGGACGATGCCGCAGAATGTGCTATGGTTCGAGGGCTTGCACTTTGTGGGAAAATTGAAGAGGCAATGTGGATTCTTCAAAGGATGCTTAGGATGAAGAAAATTCCTACTACCAGCACGTTTACAACTTTGATGCACGTCTTCTGTAAAAAAGACAATTTTAAAGAGGCACATAATTTGAAGATCCTTATGGAGCATTATCGTGTGAAGCTTGATATAGTCGCTTACAATGTTCTCATTTCTGCGTGTTGCGCTAATGGTGATGTTATAACTGCACTTGACTTTTATGAAGAGATAAAACAGAAAGGTCTCTTGCCAAACATGACGACCTACAGAGTTCTAGTTTCTGCTATTAGTACAAAGCATTATGTTTCTAGGGGTGAAATAGTTCTCAAGGACTTGAATGATAGAGGATTAGTGTCTGGGTATTTAGATGGGAAGTTGCAAAAATCTTGCAGGGATTTTGTAGTTGCCATTAAAAAACTGAACTCCTTAAAGCCCAATCAAGGAAATTAAGTCAACAACAAACTGAAATACCACTGAATTTGATTCTGAGCACAAGGAAAAATCTAGTTCTTGATGGGAATGACTAGTATATCCTTCCAAGTTGATACAACTAAGGTTTGGCTTGCCTAAGCTTGTTTCTTTTAAGAATATTTGCAACCTTTCTGTAATTTTTTGCTGAGAATTTTATTTTGTTTATCCATGTACTGATGTGCCCTTTCAATTGCTGCAGGTGTGTTACAATCATGTATCACATACTTGCTTGGCCAACA

mRNA sequence

ATGGAGAACAGTATTTACACAATCCTCACTATTGGTCGCTGGGAGTCACTGAATCACATGAACTATAAGTTCGCTTCACTAAGACCAATTCATGGAGTTTTAGCGCTGAAATTCCTCAAGTGGGTCATCAAACAGCCTGGTTTGGAACCCAACCACCTCACTCATATACTCGGTATTACTACTCATGTACTTGTTAGAGCTAGACTGTACGGTTATGCCAAATCAATTCTGAAGCATTTAGCTCAGAAAAATTCTGGGTCCAACTTTCTTTTTGGTGTTCTTATGGATACATACCCTCTTTGCAGCTCAAACCCTGCAGTTTTTGACCTTTTAATTAGAGTTTATTTGCGGCAAGGAATGGTTGGACACGCTGTAAATACTTTTTCTTCCATGCTCATTCGTGGGTTTAAGCCATCTGTTTATACTTGTAACATGATCATGGCTTCCATGGTTAAGAACTGTAGAGCTCACTTGGTTTGGTCTTTTTTTAAGCAAATGCTTACCAGTAGAGTTTGTCCAAACGTTTCCAGTTTTAATATACTCATAAGTGTTCTATGTGTGCAAGGGAAGCTTAAGAAAGCTGTTAACATCTTAACAATGATGGAGAGGAATGGCTATGTTCCTACTATAGTTAGTTATAATACGTTGCTTAGTTGGTGCTGTAAGAAGGGAAGATTTAAATTTGCACTTGTGCTGATTCATCATATGGAGTGCAAGGGAATTCAAGCAGACGTCTGTACATACAATATGTTTATTGATAGTTTGTGCAGAAACAGTAGAAGCGCACAGGGGTATTTAGTTTTGAAGAAAATGAGGAATAAGATGATAACTCCTAATGAAGTTTCTTACAACACCTTGATTAATGGCTTTGTAAAGGAGGGAAAGATAGGGGTTGCTACTCGGGTTTTCAATGAGATGATAGAGCTTAATCTTTCACCAAACCTCATTACTTACAATATTCTAATTAATGGATACTGCATTAATGGCAATTTTGAAGAAGCATTGAGAGTTTTGGATGTGATGGAAGCAAATGACGTGAGGCCTAATGAGGTTACTATTGGAACTCTTTTAAATGGTCTATACAAGAGTGCCAAATTTGACGTAGCTAGAAATATTCTGGAGAGATATTCTATCAATAGAACATCTCTTAATTGTATCTCACATACTGTGATGATTGATGGGCTATGCAGAAATGGGTTGCTTGATGAAGCCTTTCAATTACTAATTGAGATGTGCAAGGATGGTGTTCATCCTGATATCATAACTTTTTCAGTGCTTATAAATGGATTCTGCAAAGTTGGGAATATTAACAAGGCAAAGGAGGTTATGTCGAAAATATATAGAGAAGGATTTGTTCCAAACAATGTTATTTTCTCTACATTAATATATAACTCTTGTAAGGTTGGAAATGTTTATGAAGCGATGAAGTTCTATGCTGCTATGAATTTGAATGGGCAAAATGCAGACAATTTCACATGTAATTCGTTAGTCGCTTCTCTTTGTGAAAATGGAAAACTAGTAGAAGCAGAGGAATTCTTGCATCACATTAGTAGGATTGGTCTTGTTCCTAACTCTGTTACATTTGATTGTATCATAAACGGATATGCAAATGTAGGAGATGGGTCAGGGGCATTTTCAGTGTTCGATAAAATGATTAGTTGTGGTCATCACCCTAGTCCTTTCACCTATGGCAGTCTATTGAAAGTGTTATGCAAGGGACAGAATTTTTGGGAAGCAAGAAAACTATTGAAAAAGCTCCACTGCATTCCGTTGGCTGTTGATACTATATCGTACAACACATTGATTGTTGAGATAAGTAAGTCAGGAAATTTATTGGAAGCAGTTCGCCTATTTGAGGAGATGATTCAGAATAATATTCTACCGGATAGTTATACATACACTTGTATTCTGTCTGGATTAATTAGAGAAGGGAGATTGGTCTGTGCCTTCATATTCTTGGGAAGACTCATGCAAAAAGAAATTCTAACATTGAATTCAATTGTGTACACTTGTTTCATTGATGGCCTTTTCAAGGCTGGCCAGTCAAAGGCTGCATTATATCTTTTTAAGGAAATGGAGGAAAAAGGCCTCTCCGTAGATTTGATTGCTCTTAATTCAATTACAGATGGATATTCAAGGATGGGAAAAGTGTTTAGTGCCAGTTCTCTCATTTCAAAAACGAGAAACAAAAATGTAATACCTAACTTGACTACATTTAATATATTGCTACATGGTTACTCCAGAGGACAGGATATAATGAGTTGCTTTAAGTTGTATAACCTTATGAGGAGATCGGGCTTTTTTCCTAACAGATTAACATACCATTCTCTTATTCTTGGACTTTGCAACCATGGTATGTTGGAACTTGGAATTAAGATTTTGAAAATGTTTATTGCTGAAAGTTCTACTATTGATGACTTGACATTTAATATGCTCATTAGGAAGTGTTGTGAAATCAATGACCTGGATAAAGTCATTGATTTGACTCATAACATGGAAGTCTTTAGGGTTTCTCTCGATAAAGACACACAAAAAGCCGTTACTGATGTGCTTGTTAGAAGGATGGTTTCCCAAAATTATTTCGTTTTTATGCATGAAATGCTCAAAAAGGGTTTTATCCCTACATCTAAACAATATTGCACAATGATGAAACGAATGTGTCGAGTGGGGGACATACAGGGGGCATTTAAATTAAAAGATCAGATGGTGGCACTTGGCATAAGTTTGGACGATGCCGCAGAATGTGCTATGGTTCGAGGGCTTGCACTTTGTGGGAAAATTGAAGAGGCAATGTGGATTCTTCAAAGGATGCTTAGGATGAAGAAAATTCCTACTACCAGCACGTTTACAACTTTGATGCACGTCTTCTGTAAAAAAGACAATTTTAAAGAGGCACATAATTTGAAGATCCTTATGGAGCATTATCGTGTGAAGCTTGATATAGTCGCTTACAATGTTCTCATTTCTGCGTGTTGCGCTAATGGTGATGTTATAACTGCACTTGACTTTTATGAAGAGATAAAACAGAAAGGTCTCTTGCCAAACATGACGACCTACAGAGTTCTAGTTTCTGCTATTAGTACAAAGCATTATGTTTCTAGGGGTGAAATAGTTCTCAAGGACTTGAATGATAGAGGATTAGTGTCTGGGTATTTAGATGGGAAGTTGCAAAAATCTTGCAGGGATTTTGTAGTTGCCATTAAAAAACTGAACTCCTTAAAGCCCAATCAAGGAAATTAA

Coding sequence (CDS)

ATGGAGAACAGTATTTACACAATCCTCACTATTGGTCGCTGGGAGTCACTGAATCACATGAACTATAAGTTCGCTTCACTAAGACCAATTCATGGAGTTTTAGCGCTGAAATTCCTCAAGTGGGTCATCAAACAGCCTGGTTTGGAACCCAACCACCTCACTCATATACTCGGTATTACTACTCATGTACTTGTTAGAGCTAGACTGTACGGTTATGCCAAATCAATTCTGAAGCATTTAGCTCAGAAAAATTCTGGGTCCAACTTTCTTTTTGGTGTTCTTATGGATACATACCCTCTTTGCAGCTCAAACCCTGCAGTTTTTGACCTTTTAATTAGAGTTTATTTGCGGCAAGGAATGGTTGGACACGCTGTAAATACTTTTTCTTCCATGCTCATTCGTGGGTTTAAGCCATCTGTTTATACTTGTAACATGATCATGGCTTCCATGGTTAAGAACTGTAGAGCTCACTTGGTTTGGTCTTTTTTTAAGCAAATGCTTACCAGTAGAGTTTGTCCAAACGTTTCCAGTTTTAATATACTCATAAGTGTTCTATGTGTGCAAGGGAAGCTTAAGAAAGCTGTTAACATCTTAACAATGATGGAGAGGAATGGCTATGTTCCTACTATAGTTAGTTATAATACGTTGCTTAGTTGGTGCTGTAAGAAGGGAAGATTTAAATTTGCACTTGTGCTGATTCATCATATGGAGTGCAAGGGAATTCAAGCAGACGTCTGTACATACAATATGTTTATTGATAGTTTGTGCAGAAACAGTAGAAGCGCACAGGGGTATTTAGTTTTGAAGAAAATGAGGAATAAGATGATAACTCCTAATGAAGTTTCTTACAACACCTTGATTAATGGCTTTGTAAAGGAGGGAAAGATAGGGGTTGCTACTCGGGTTTTCAATGAGATGATAGAGCTTAATCTTTCACCAAACCTCATTACTTACAATATTCTAATTAATGGATACTGCATTAATGGCAATTTTGAAGAAGCATTGAGAGTTTTGGATGTGATGGAAGCAAATGACGTGAGGCCTAATGAGGTTACTATTGGAACTCTTTTAAATGGTCTATACAAGAGTGCCAAATTTGACGTAGCTAGAAATATTCTGGAGAGATATTCTATCAATAGAACATCTCTTAATTGTATCTCACATACTGTGATGATTGATGGGCTATGCAGAAATGGGTTGCTTGATGAAGCCTTTCAATTACTAATTGAGATGTGCAAGGATGGTGTTCATCCTGATATCATAACTTTTTCAGTGCTTATAAATGGATTCTGCAAAGTTGGGAATATTAACAAGGCAAAGGAGGTTATGTCGAAAATATATAGAGAAGGATTTGTTCCAAACAATGTTATTTTCTCTACATTAATATATAACTCTTGTAAGGTTGGAAATGTTTATGAAGCGATGAAGTTCTATGCTGCTATGAATTTGAATGGGCAAAATGCAGACAATTTCACATGTAATTCGTTAGTCGCTTCTCTTTGTGAAAATGGAAAACTAGTAGAAGCAGAGGAATTCTTGCATCACATTAGTAGGATTGGTCTTGTTCCTAACTCTGTTACATTTGATTGTATCATAAACGGATATGCAAATGTAGGAGATGGGTCAGGGGCATTTTCAGTGTTCGATAAAATGATTAGTTGTGGTCATCACCCTAGTCCTTTCACCTATGGCAGTCTATTGAAAGTGTTATGCAAGGGACAGAATTTTTGGGAAGCAAGAAAACTATTGAAAAAGCTCCACTGCATTCCGTTGGCTGTTGATACTATATCGTACAACACATTGATTGTTGAGATAAGTAAGTCAGGAAATTTATTGGAAGCAGTTCGCCTATTTGAGGAGATGATTCAGAATAATATTCTACCGGATAGTTATACATACACTTGTATTCTGTCTGGATTAATTAGAGAAGGGAGATTGGTCTGTGCCTTCATATTCTTGGGAAGACTCATGCAAAAAGAAATTCTAACATTGAATTCAATTGTGTACACTTGTTTCATTGATGGCCTTTTCAAGGCTGGCCAGTCAAAGGCTGCATTATATCTTTTTAAGGAAATGGAGGAAAAAGGCCTCTCCGTAGATTTGATTGCTCTTAATTCAATTACAGATGGATATTCAAGGATGGGAAAAGTGTTTAGTGCCAGTTCTCTCATTTCAAAAACGAGAAACAAAAATGTAATACCTAACTTGACTACATTTAATATATTGCTACATGGTTACTCCAGAGGACAGGATATAATGAGTTGCTTTAAGTTGTATAACCTTATGAGGAGATCGGGCTTTTTTCCTAACAGATTAACATACCATTCTCTTATTCTTGGACTTTGCAACCATGGTATGTTGGAACTTGGAATTAAGATTTTGAAAATGTTTATTGCTGAAAGTTCTACTATTGATGACTTGACATTTAATATGCTCATTAGGAAGTGTTGTGAAATCAATGACCTGGATAAAGTCATTGATTTGACTCATAACATGGAAGTCTTTAGGGTTTCTCTCGATAAAGACACACAAAAAGCCGTTACTGATGTGCTTGTTAGAAGGATGGTTTCCCAAAATTATTTCGTTTTTATGCATGAAATGCTCAAAAAGGGTTTTATCCCTACATCTAAACAATATTGCACAATGATGAAACGAATGTGTCGAGTGGGGGACATACAGGGGGCATTTAAATTAAAAGATCAGATGGTGGCACTTGGCATAAGTTTGGACGATGCCGCAGAATGTGCTATGGTTCGAGGGCTTGCACTTTGTGGGAAAATTGAAGAGGCAATGTGGATTCTTCAAAGGATGCTTAGGATGAAGAAAATTCCTACTACCAGCACGTTTACAACTTTGATGCACGTCTTCTGTAAAAAAGACAATTTTAAAGAGGCACATAATTTGAAGATCCTTATGGAGCATTATCGTGTGAAGCTTGATATAGTCGCTTACAATGTTCTCATTTCTGCGTGTTGCGCTAATGGTGATGTTATAACTGCACTTGACTTTTATGAAGAGATAAAACAGAAAGGTCTCTTGCCAAACATGACGACCTACAGAGTTCTAGTTTCTGCTATTAGTACAAAGCATTATGTTTCTAGGGGTGAAATAGTTCTCAAGGACTTGAATGATAGAGGATTAGTGTCTGGGTATTTAGATGGGAAGTTGCAAAAATCTTGCAGGGATTTTGTAGTTGCCATTAAAAAACTGAACTCCTTAAAGCCCAATCAAGGAAATTAA
BLAST of CSPI03G39810 vs. Swiss-Prot
Match: PP432_ARATH (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 1124.4 bits (2907), Expect = 0.0e+00
Identity = 556/1078 (51.58%), Postives = 762/1078 (70.69%), Query Frame = 1

Query: 1    MENSIYTILTIGRWESLNHMNYKFASLRPIHGVLALKFLKWVIKQPGLEPNHLTHILGIT 60
            ME SIY ILTI RW SLNHM+Y+ A LR +HG LALKFLKWV+KQPGLE +H+  ++ IT
Sbjct: 19   MEKSIYNILTIDRWGSLNHMDYRQARLRLVHGKLALKFLKWVVKQPGLETDHIVQLVCIT 78

Query: 61   THVLVRARLYGYAKSILKHLAQKNSGSNFLFGVLMDTYPLCSSNPAVFDLLIRVYLRQGM 120
            TH+LVRAR+Y  A+ ILK L+  +  S+F+FG LM TY LC+SNP+V+D+LIRVYLR+GM
Sbjct: 79   THILVRARMYDPARHILKELSLMSGKSSFVFGALMTTYRLCNSNPSVYDILIRVYLREGM 138

Query: 121  VGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFNI 180
            +  ++  F  M + GF PSVYTCN I+ S+VK+     VWSF K+ML  ++CP+V++FNI
Sbjct: 139  IQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFNI 198

Query: 181  LISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECKG 240
            LI+VLC +G  +K+  ++  ME++GY PTIV+YNT+L W CKKGRFK A+ L+ HM+ KG
Sbjct: 199  LINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKG 258

Query: 241  IQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVAT 300
            + ADVCTYNM I  LCR++R A+GYL+L+ MR +MI PNEV+YNTLINGF  EGK+ +A+
Sbjct: 259  VDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIAS 318

Query: 301  RVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNGL 360
            ++ NEM+   LSPN +T+N LI+G+   GNF+EAL++  +MEA  + P+EV+ G LL+GL
Sbjct: 319  QLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGL 378

Query: 361  YKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPDI 420
             K+A+FD+AR    R   N   +  I++T MIDGLC+NG LDEA  LL EM KDG+ PDI
Sbjct: 379  CKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDI 438

Query: 421  ITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAA 480
            +T+S LINGFCKVG    AKE++ +IYR G  PN +I+STLIYN C++G + EA++ Y A
Sbjct: 439  VTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEA 498

Query: 481  MNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVGD 540
            M L G   D+FT N LV SLC+ GK+ EAEEF+  ++  G++PN+V+FDC+INGY N G+
Sbjct: 499  MILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNSGE 558

Query: 541  GSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYNT 600
            G  AFSVFD+M   GHHP+ FTYGSLLK LCKG +  EA K LK LH +P AVDT+ YNT
Sbjct: 559  GLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEKFLKSLHAVPAAVDTVMYNT 618

Query: 601  LIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQKE 660
            L+  + KSGNL +AV LF EM+Q +ILPDSYTYT ++SGL R+G+ V A +F      + 
Sbjct: 619  LLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTSLISGLCRKGKTVIAILFAKEAEARG 678

Query: 661  ILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEEKGLSVDLIALNSITDGYSRMGKVFSA 720
             +  N ++YTCF+DG+FKAGQ KA +Y  ++M+  G + D++  N++ DGYSRMGK+   
Sbjct: 679  NVLPNKVMYTCFVDGMFKAGQWKAGIYFREQMDNLGHTPDIVTTNAMIDGYSRMGKIEKT 738

Query: 721  SSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLILG 780
            + L+ +  N+N  PNLTT+NILLHGYS+ +D+ + F LY  +  +G  P++LT HSL+LG
Sbjct: 739  NDLLPEMGNQNGGPNLTTYNILLHGYSKRKDVSTSFLLYRSIILNGILPDKLTCHSLVLG 798

Query: 781  LCNHGMLELGIKILKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSLD 840
            +C   MLE+G+KILK FI     +D  TFNMLI KCC   +++   DL   M    +SLD
Sbjct: 799  ICESNMLEIGLKILKAFICRGVEVDRYTFNMLISKCCANGEINWAFDLVKVMTSLGISLD 858

Query: 841  KDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLKD 900
            KDT  A+  VL R    Q   + +HEM K+G  P S++Y  ++  +CRVGDI+ AF +K+
Sbjct: 859  KDTCDAMVSVLNRNHRFQESRMVLHEMSKQGISPESRKYIGLINGLCRVGDIKTAFVVKE 918

Query: 901  QMVALGISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKKD 960
            +M+A  I   + AE AMVR LA CGK +EA  +L+ ML+MK +PT ++FTTLMH+ CK  
Sbjct: 919  EMIAHKICPPNVAESAMVRALAKCGKADEATLLLRFMLKMKLVPTIASFTTLMHLCCKNG 978

Query: 961  NFKEAHNLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTYR 1020
            N  EA  L+++M +  +KLD+V+YNVLI+  CA GD+  A + YEE+K  G L N TTY+
Sbjct: 979  NVIEALELRVVMSNCGLKLDLVSYNVLITGLCAKGDMALAFELYEEMKGDGFLANATTYK 1038

Query: 1021 VLVSAISTKHYVSRG-EIVLKDLNDRGLVSGYLDGKLQKSCRDFVVAIKKLNSLKPNQ 1078
             L+  +  +     G +I+LKDL  RG ++       Q S R+  +A++KL +L+ N+
Sbjct: 1039 ALIRGLLARETAFSGADIILKDLLARGFITSM--SLSQDSHRNLKMAMEKLKALQSNK 1094

BLAST of CSPI03G39810 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 303.1 bits (775), Expect = 1.2e-80
Identity = 191/654 (29.20%), Postives = 326/654 (49.85%), Query Frame = 1

Query: 34  LALKFLKWVIKQPGLEPNHLTHILG--ITTHVLVRARLYGYAKSILKHLAQKNSGSNF-- 93
           L LKFL W        P+    +    IT H+L + +LY  A+ + + +A K     +  
Sbjct: 64  LILKFLNWA------NPHQFFTLRCKCITLHILTKFKLYKTAQILAEDVAAKTLDDEYAS 123

Query: 94  -LFGVLMDTYPLCSSNPAVFDLLIRVYLRQGMVGHAVNTFSSMLIRGFKPSVYTCNMIMA 153
            +F  L +TY LC S  +VFDL+++ Y R  ++  A++        GF P V + N ++ 
Sbjct: 124 LVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLD 183

Query: 154 SMVKNCR-AHLVWSFFKQMLTSRVCPNVSSFNILISVLCVQGKLKKAVNILTMMERNGYV 213
           + +++ R      + FK+ML S+V PNV ++NILI   C  G +  A+ +   ME  G +
Sbjct: 184 ATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCL 243

Query: 214 PTIVSYNTLLSWCCKKGRFKFALVLIHHMECKGIQADVCTYNMFIDSLCRNSRSAQGYLV 273
           P +V+YNTL+   CK  +      L+  M  KG++ ++ +YN+ I+ LCR  R  +   V
Sbjct: 244 PNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFV 303

Query: 274 LKKMRNKMITPNEVSYNTLINGFVKEGKIGVATRVFNEMIELNLSPNLITYNILINGYCI 333
           L +M  +  + +EV+YNTLI G+ KEG    A  +  EM+   L+P++ITY  LI+  C 
Sbjct: 304 LTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCK 363

Query: 334 NGNFEEALRVLDVMEANDVRPNEVTIGTLLNGLYKSAKFDVARNILERYSINRTSLNCIS 393
            GN   A+  LD M    + PNE T  TL++G  +    + A  +L   + N  S + ++
Sbjct: 364 AGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVT 423

Query: 394 HTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPDIITFSVLINGFCKVGNINKAKEVMSKIY 453
           +  +I+G C  G +++A  +L +M + G+ PD++++S +++GFC+  ++++A  V  ++ 
Sbjct: 424 YNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMV 483

Query: 454 REGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAAMNLNGQNADNFTCNSLVASLCENGKLV 513
            +G  P+ + +S+LI   C+     EA   Y  M   G   D FT  +L+ + C  G L 
Sbjct: 484 EKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLE 543

Query: 514 EAEEFLHHISRIGLVPNSVTFDCIINGYANVGDGSGAFSVFDKMISCGHHPSPFTYGSLL 573
           +A +  + +   G++P+ VT+  +ING         A  +  K+      PS  TY +L+
Sbjct: 544 KALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLI 603

Query: 574 KVLCKGQNFWEARKLLKKLHCIPLAVDTISYNTLIVEISKSGNLLEAVRLFEEMIQNNIL 633
           +  C    F     L+K   C+                   G + EA ++FE M+  N  
Sbjct: 604 E-NCSNIEFKSVVSLIKGF-CM------------------KGMMTEADQVFESMLGKNHK 663

Query: 634 PDSYTYTCILSGLIREGRLVCAFIFLGRLMQKEILTLNSIVYTCFIDGLFKAGQ 682
           PD   Y  ++ G  R G +  A+  L + M K    L+++     +  L K G+
Sbjct: 664 PDGTAYNIMIHGHCRAGDIRKAYT-LYKEMVKSGFLLHTVTVIALVKALHKEGK 690

BLAST of CSPI03G39810 vs. Swiss-Prot
Match: PP437_ARATH (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 294.7 bits (753), Expect = 4.2e-78
Identity = 216/805 (26.83%), Postives = 365/805 (45.34%), Query Frame = 1

Query: 34  LALKFLKWVIKQPGLEPNHLTHILGITTHVLVRARLYGYAKSILKHLAQKNSGSNFLFGV 93
           L L+F  ++    G +  H T    I  H LV+A L+  A S+L+ L  +    + +F V
Sbjct: 86  LGLRFFNFLGLHRGFD--HSTASFCILIHALVKANLFWPASSLLQTLLLRALKPSDVFNV 145

Query: 94  LMDTYPLCS-SNPAVFDLLIRVYLRQGMVGHAVNTFSSMLIR-GFKPSVYTCNMIMASMV 153
           L   Y  C  S+ + FDLLI+ Y+R   V   V  F  M+ +    P V T + ++  +V
Sbjct: 146 LFSCYEKCKLSSSSSFDLLIQHYVRSRRVLDGVLVFKMMITKVSLLPEVRTLSALLHGLV 205

Query: 154 KNCRAHLVWSFFKQMLTSRVCPNVSSFNILISVLCVQGKLKKAVNILTMMERNGYVPTIV 213
           K     L    F  M++  + P+V  +  +I  LC    L +A  ++  ME  G    IV
Sbjct: 206 KFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMIAHMEATGCDVNIV 265

Query: 214 SYNTLLSWCCKKGRFKFALVLIHHMECKGIQADVCTYNMFIDSLCRNSRSAQGYLVLKKM 273
            YN L+   CKK +   A+ +   +  K ++ DV TY   +  LC+      G  ++ +M
Sbjct: 266 PYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFEIGLEMMDEM 325

Query: 274 RNKMITPNEVSYNTLINGFVKEGKIGVATRVFNEMIELNLSPNLITYNILINGYCINGNF 333
                +P+E + ++L+ G  K GKI  A  +   +++  +SPNL  YN LI+  C    F
Sbjct: 326 LCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALIDSLCKGRKF 385

Query: 334 EEALRVLDVMEANDVRPNEVTIGTLLNGLYKSAKFDVARNILERYSINRTSLNCISHTVM 393
            EA  + D M    +RPN+VT   L++   +  K D A + L         L+   +  +
Sbjct: 386 HEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLKLSVYPYNSL 445

Query: 394 IDGLCRNGLLDEAFQLLIEMCKDGVHPDIITFSVLINGFCKVGNINKAKEVMSKIYREGF 453
           I+G C+ G +  A   + EM    + P ++T++ L+ G+C  G INKA  +  ++  +G 
Sbjct: 446 INGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRLYHEMTGKGI 505

Query: 454 VPNNVIFSTLIYNSCKVGNVYEAMKFYAAMNLNGQNADNFTCNSLVASLCENGKLVEAEE 513
            P+   F+TL+    + G + +A+K +  M       +  T N ++   CE G + +A E
Sbjct: 506 APSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCEEGDMSKAFE 565

Query: 514 FLHHISRIGLVPNSVTFDCIINGYANVGDGSGAFSVFDKMISCGHHPSPFTYGSLLKVLC 573
           FL  ++  G+VP++ ++  +I+G    G  S A    D +       +   Y  LL   C
Sbjct: 566 FLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFC 625

Query: 574 KGQNFWEARKLLKKLHCIPLAVDTISYNTLIVEISKSGNLLEAVRLFEEMIQNNILPDSY 633
           +     EA  + +++    + +D + Y  LI    K  +      L +EM    + PD  
Sbjct: 626 REGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDV 685

Query: 634 TYTCILSGLIREGRLVCAFIFLGRLMQKEILTLNSIVYTCFIDGLFKAGQSKAALYLFKE 693
            YT ++    + G    AF  +  LM  E    N + YT  I+GL KAG    A  L  +
Sbjct: 686 IYTSMIDAKSKTGDFKEAF-GIWDLMINEGCVPNEVTYTAVINGLCKAGFVNEAEVLCSK 745

Query: 694 MEEKGLSVDLIA----LNSITDGYSRMGKVFSASSLISKTRNKNVIPNLTTFNILLHGYS 753
           M+      + +     L+ +T G   M K     + I     K ++ N  T+N+L+ G+ 
Sbjct: 746 MQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAIL----KGLLANTATYNMLIRGFC 805

Query: 754 RGQDIMSCFKLYNLMRRSGFFPNRLTYHSLILGLCNHGMLELGIKILKMFIAESSTIDDL 813
           R   I    +L   M   G  P+ +TY ++I  LC    ++  I++      +    D +
Sbjct: 806 RQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIELWNSMTEKGIRPDRV 865

Query: 814 TFNMLIRKCCEINDLDKVIDLTHNM 833
            +N LI  CC   ++ K  +L + M
Sbjct: 866 AYNTLIHGCCVAGEMGKATELRNEM 883

BLAST of CSPI03G39810 vs. Swiss-Prot
Match: PPR12_ARATH (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 292.4 bits (747), Expect = 2.1e-77
Identity = 179/579 (30.92%), Postives = 304/579 (52.50%), Query Frame = 1

Query: 57  LGITTHVLVRARLYGYAKSILKHLAQK---NSGSNFL--FGVLMDTYPLCSSNPAVFDLL 116
           L I  H+ V ++    A+S++    ++   N   +F+  F +L+ TY    S+P VFD+ 
Sbjct: 122 LCIVIHLAVASKDLKVAQSLISSFWERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVF 181

Query: 117 IRVYLRQGMVGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNC-RAHLVWSFFKQMLTSR 176
            +V +  G++  A   F  ML  G   SV +CN+ +  + K+C +       F++     
Sbjct: 182 FQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVG 241

Query: 177 VCPNVSSFNILISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFAL 236
           VC NV+S+NI+I  +C  G++K+A ++L +ME  GY P ++SY+T+++  C+ G      
Sbjct: 242 VCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVW 301

Query: 237 VLIHHMECKGIQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGF 296
            LI  M+ KG++ +   Y   I  LCR  + A+      +M  + I P+ V Y TLI+GF
Sbjct: 302 KLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGF 361

Query: 297 VKEGKIGVATRVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNE 356
            K G I  A++ F EM   +++P+++TY  +I+G+C  G+  EA ++   M    + P+ 
Sbjct: 362 CKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDS 421

Query: 357 VTIGTLLNGLYKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIE 416
           VT   L+NG  K+     A  +         S N +++T +IDGLC+ G LD A +LL E
Sbjct: 422 VTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHE 481

Query: 417 MCKDGVHPDIITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGN 476
           M K G+ P+I T++ ++NG CK GNI +A +++ +    G   + V ++TL+   CK G 
Sbjct: 482 MWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGE 541

Query: 477 VYEAMKFYAAMNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDC 536
           + +A +    M   G      T N L+   C +G L + E+ L+ +   G+ PN+ TF+ 
Sbjct: 542 MDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNS 601

Query: 537 IINGYANVGDGSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIP 596
           ++  Y    +   A +++  M S G  P   TY +L+K  CK +N  EA  L +++    
Sbjct: 602 LVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKG 661

Query: 597 LAVDTISYNTLIVEISKSGNLLEAVRLFEEMIQNNILPD 630
            +V   +Y+ LI    K    LEA  +F++M +  +  D
Sbjct: 662 FSVSVSTYSVLIKGFLKRKKFLEAREVFDQMRREGLAAD 700

BLAST of CSPI03G39810 vs. Swiss-Prot
Match: PP360_ARATH (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 285.0 bits (728), Expect = 3.3e-75
Identity = 170/611 (27.82%), Postives = 313/611 (51.23%), Query Frame = 1

Query: 52  HLTHILGITTHVLVRARLYGYAKSILKHLAQKNSGSNF-LFGVLMDTYPLCSSNPAVFDL 111
           H +  L    H+LVR+     A+S L  + +++  S   +   L  T+  C SN +VFDL
Sbjct: 111 HTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSGVSRLEIVNSLDSTFSNCGSNDSVFDL 170

Query: 112 LIRVYLRQGMVGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSR 171
           LIR Y++   +  A   F+ +  +GF  S+  CN ++ S+V+     L W  ++++  S 
Sbjct: 171 LIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEISRSG 230

Query: 172 VCPNVSSFNILISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFAL 231
           V  NV + NI+++ LC  GK++K    L+ ++  G  P IV+YNTL+S    KG  + A 
Sbjct: 231 VGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLMEEAF 290

Query: 232 VLIHHMECKGIQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGF 291
            L++ M  KG    V TYN  I+ LC++ +  +   V  +M    ++P+  +Y +L+   
Sbjct: 291 ELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRSLLMEA 350

Query: 292 VKEGKIGVATRVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNE 351
            K+G +    +VF++M   ++ P+L+ ++ +++ +  +GN ++AL   + ++   + P+ 
Sbjct: 351 CKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAGLIPDN 410

Query: 352 VTIGTLLNGLYKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIE 411
           V    L+ G  +     VA N+         +++ +++  ++ GLC+  +L EA +L  E
Sbjct: 411 VIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEADKLFNE 470

Query: 412 MCKDGVHPDIITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGN 471
           M +  + PD  T ++LI+G CK+GN+  A E+  K+  +    + V ++TL+    KVG+
Sbjct: 471 MTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGFGKVGD 530

Query: 472 VYEAMKFYAAMNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDC 531
           +  A + +A M          + + LV +LC  G L EA      +    + P  +  + 
Sbjct: 531 IDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTVMICNS 590

Query: 532 IINGYANVGDGSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIP 591
           +I GY   G+ S   S  +KMIS G  P   +Y +L+    + +N  +A  L+KK+    
Sbjct: 591 MIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKKMEEEQ 650

Query: 592 --LAVDTISYNTLIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVC 651
             L  D  +YN+++    +   + EA  +  +MI+  + PD  TYTC+++G + +  L  
Sbjct: 651 GGLVPDVFTYNSILHGFCRQNQMKEAEVVLRKMIERGVNPDRSTYTCMINGFVSQDNLTE 710

Query: 652 AFIFLGRLMQK 660
           AF     ++Q+
Sbjct: 711 AFRIHDEMLQR 721

BLAST of CSPI03G39810 vs. TrEMBL
Match: A0A067EZ46_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g000951mg PE=4 SV=1)

HSP 1 Score: 1283.1 bits (3319), Expect = 0.0e+00
Identity = 635/1080 (58.80%), Postives = 807/1080 (74.72%), Query Frame = 1

Query: 1    MENSIYTILTIGRWESLNHMNYKFASLRPIHGVLALKFLKWVIKQPGLEPNHLTHILGIT 60
            ME SIYT+LTI RWESLNHM YK ASLRP+HG LALKFL WV+ QPGLE  HLTHIL +T
Sbjct: 1    MEKSIYTLLTIDRWESLNHMEYKLASLRPVHGRLALKFLNWVMNQPGLELKHLTHILCLT 60

Query: 61   THVLVRARLYGYAKSILKHLAQKNSGSNFLFGVLMDTYPLCSSNPAVFDLLIRVYLRQGM 120
            THVLV+ R+Y  AK IL+ LAQ   G N +FG LM+TYPLC+SNP+VFDLLIRVYLR+GM
Sbjct: 61   THVLVKTRMYEDAKLILRQLAQMGIGQNSVFGSLMNTYPLCNSNPSVFDLLIRVYLREGM 120

Query: 121  VGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFNI 180
            V +A+ TF  M  RGF PSVYTCNM+++ M+K+ R   VW  F  ML  ++CPNV++FNI
Sbjct: 121  VEYALETFQLMGFRGFNPSVYTCNMMLSFMLKDRRVDSVWLLFDDMLDRKICPNVATFNI 180

Query: 181  LISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECKG 240
            LI+V CV+GKLKKA  +L  ME +GYVP IV+YNTLL+W CKKGR+K A  LI  M  KG
Sbjct: 181  LINVSCVEGKLKKAGYLLRKMEESGYVPNIVTYNTLLNWYCKKGRYKAAFKLIDCMASKG 240

Query: 241  IQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVAT 300
            I+ADVCTYNMFID LCRN+RSA+GYL+LK MR +MITPNEV+YNTLINGFVKEGKI VA+
Sbjct: 241  IEADVCTYNMFIDDLCRNNRSAKGYLLLKNMRKRMITPNEVTYNTLINGFVKEGKIQVAS 300

Query: 301  RVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNGL 360
            RVF+EM  LN SPN ITYN LI+G+C  GNF+EA R+L +ME   +RPNEV+ G LLNG 
Sbjct: 301  RVFDEMSMLNFSPNSITYNELIDGHCCKGNFKEAFRLLAMMEEMGLRPNEVSYGALLNGF 360

Query: 361  YKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPDI 420
             K AKFD+AR++LER   N  S++CI++T +IDGLC+ GLLDEA QL  +M KDG++PD+
Sbjct: 361  CKHAKFDLARSLLERMRTNGISISCIAYTSVIDGLCKCGLLDEAMQLFNKMFKDGLNPDL 420

Query: 421  ITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAA 480
            ITFSVLINGFCKVG   KAK V+ K+YR+G VPN +I+STLIY  CK+G V EAMK YA 
Sbjct: 421  ITFSVLINGFCKVGMTRKAKAVLCKMYRDGLVPNKIIYSTLIYYFCKMGKVTEAMKVYAV 480

Query: 481  MNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVGD 540
            MN N Q +D+FTCN LVASLC+ GK+ EAE+++ H+ RIG+VPNS+TFDC+I+GY  +GD
Sbjct: 481  MNRNAQGSDHFTCNMLVASLCKGGKVCEAEDYVGHMKRIGVVPNSITFDCMIDGYGTLGD 540

Query: 541  GSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYNT 600
            G  AFS+FD+M+  GHHPS FTYGSLLK LCKG N  EA++ L  LH IP AVDT++YNT
Sbjct: 541  GLKAFSMFDEMVKLGHHPSIFTYGSLLKGLCKGGNLKEAKRFLNSLHHIPSAVDTVAYNT 600

Query: 601  LIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQKE 660
            ++ E  KSGNL EA+ L +EM+Q N+LPD YTYT +L+GL R+G++V A +F  +++ K 
Sbjct: 601  ILAETCKSGNLWEAIVLLDEMVQFNLLPDRYTYTILLAGLCRKGKVVSALLFFEKVVSKR 660

Query: 661  ILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEEKGLSVDLIALNSITDGYSRMGKVFSA 720
              + N++++TC +DGLFKAGQSKAA+++ K M+++G+  D IA N++ DG+SRMG +  A
Sbjct: 661  TFSPNNVMFTCLVDGLFKAGQSKAAMHISKIMDKEGVYPDTIAFNAVMDGFSRMGNMMMA 720

Query: 721  SSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLILG 780
            + L+S  R++ + P+L T+NILLHGYS+ +D++ C  L N M+  G  P++LT HSLILG
Sbjct: 721  NDLLSTMRSRKLCPSLATYNILLHGYSKKKDLLMCSMLLNTMKMEGLLPDKLTCHSLILG 780

Query: 781  LCNHGMLELGIKILKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSLD 840
             C  GMLE+G K LK  IAE + +D  TFN+L+RKCCE  ++ K  DL + M +  V  D
Sbjct: 781  FCETGMLEVGFKFLKKMIAEGTMVDCFTFNVLMRKCCEAGEMGKAFDLFNIMNMLGVVPD 840

Query: 841  KDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLKD 900
             +TQ A+   L R    Q     +  M +KG  P   QY T++  MCRVG+ QGAFKLKD
Sbjct: 841  TNTQDAIIMGLKRIAAFQESHFVLRGMAEKGLTPKCTQYITLINGMCRVGNFQGAFKLKD 900

Query: 901  QMVALGISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKKD 960
            +M ALGIS  D AE AMVRGLA CGK+EEAM +L RMLRM+ +PT +TFTTL+H FCK+ 
Sbjct: 901  EMEALGISSSDVAESAMVRGLAHCGKVEEAMLVLNRMLRMRLVPTIATFTTLIHKFCKEA 960

Query: 961  NFKEAHNLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTYR 1020
             F +A  LK  ME   VKLD+V+YNVLIS  CANGDV+ A + YEE+K KGL PN TTY 
Sbjct: 961  KFVDALKLKGTMELSGVKLDVVSYNVLISGLCANGDVMPAFELYEEMKHKGLCPNSTTYS 1020

Query: 1021 VLVSAISTK-HYVSRGEIVLKDLNDRGLVSGYLDGKLQKSCRDFVVAIKKLNSLKPNQGN 1080
            VL+ AIS K + + +GEI+LKD+ +RG +S   DG  Q      + A++KL S K N+ N
Sbjct: 1021 VLIDAISKKENNLVKGEILLKDIQERGFISWNWDGSTQHLHEGLINALRKLKSFKKNRRN 1080

BLAST of CSPI03G39810 vs. TrEMBL
Match: V4UNJ1_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10007298mg PE=4 SV=1)

HSP 1 Score: 1278.5 bits (3307), Expect = 0.0e+00
Identity = 632/1080 (58.52%), Postives = 805/1080 (74.54%), Query Frame = 1

Query: 1    MENSIYTILTIGRWESLNHMNYKFASLRPIHGVLALKFLKWVIKQPGLEPNHLTHILGIT 60
            ME SIYT+LTI RWESLNHM YK ASLRP+HG LALKFL WV+ QPGLE  HLTHIL +T
Sbjct: 1    MEKSIYTLLTIDRWESLNHMEYKLASLRPVHGRLALKFLNWVMNQPGLELKHLTHILCLT 60

Query: 61   THVLVRARLYGYAKSILKHLAQKNSGSNFLFGVLMDTYPLCSSNPAVFDLLIRVYLRQGM 120
            THVLV+ R+Y  AK IL+ LAQ   G N +FG LM+TYPLC+SNP+VFDLLIRVYLR+GM
Sbjct: 61   THVLVKTRMYEDAKLILRQLAQMGIGQNSVFGSLMNTYPLCNSNPSVFDLLIRVYLREGM 120

Query: 121  VGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFNI 180
            V +A+ TF  M  RGF PSVYTCNM+++ M+K+ R    W  F  ML  ++CPNV++FNI
Sbjct: 121  VEYALETFQLMGFRGFNPSVYTCNMMLSFMLKDRRVDSAWLLFDDMLGRKICPNVATFNI 180

Query: 181  LISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECKG 240
            LI+V CV+GKLKKA  +L  ME +GYVP IV+YNTLL+W CKKGR+K A  LI  M  KG
Sbjct: 181  LINVSCVEGKLKKAGYLLRKMEESGYVPNIVTYNTLLNWYCKKGRYKAAFKLIDCMASKG 240

Query: 241  IQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVAT 300
            I+ADVCTYNMFID LCRN+RSA+GYL+LK MR +MITPNEV+YN LINGFVKEGKI VA+
Sbjct: 241  IEADVCTYNMFIDDLCRNNRSAKGYLLLKNMRKRMITPNEVTYNNLINGFVKEGKIQVAS 300

Query: 301  RVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNGL 360
            RVF+EM  LN SPN ITYN LI+G+C  GNF+EA R+L +ME   +RPNEV+ G LLNG 
Sbjct: 301  RVFDEMSMLNFSPNSITYNELIDGHCCKGNFKEAFRLLAMMEEMGLRPNEVSYGALLNGF 360

Query: 361  YKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPDI 420
             K AKFD+AR++LER   N  S++CI++T +IDGLC+ GLLDEA Q+  +M KDG++PD+
Sbjct: 361  CKHAKFDLARSLLERMRTNGISISCIAYTSVIDGLCKCGLLDEAMQVFNKMFKDGLNPDL 420

Query: 421  ITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAA 480
            ITFSVLINGFCKVG   KAK V+ K+YR+G VPN +I+STLIY  CK+G V EAMK YA 
Sbjct: 421  ITFSVLINGFCKVGMTRKAKAVLCKMYRDGLVPNKIIYSTLIYYFCKMGKVMEAMKVYAV 480

Query: 481  MNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVGD 540
            MN N Q +D+FTCN LVASLC+ GK+ EAE+++ H+ RIG+VPNS+TFDC+I+GY  +GD
Sbjct: 481  MNRNAQGSDHFTCNMLVASLCKGGKVCEAEDYVGHMKRIGVVPNSITFDCMIDGYGTLGD 540

Query: 541  GSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYNT 600
            G  AFS+FD+M+  GHHPS FTYGSLLK LCKG N  EA++ L  LH IP AVDT++YNT
Sbjct: 541  GLKAFSMFDEMVKLGHHPSIFTYGSLLKGLCKGGNLKEAKRFLNSLHHIPSAVDTVAYNT 600

Query: 601  LIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQKE 660
            ++ E  KSGNL EA+ L +EM+Q N+LPD YTYT +L+GL R+G++V A +F  +++ K 
Sbjct: 601  ILAETCKSGNLWEAIVLLDEMVQFNLLPDRYTYTILLAGLCRKGKVVSALLFFEKVVSKR 660

Query: 661  ILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEEKGLSVDLIALNSITDGYSRMGKVFSA 720
              + N++++TC +DGLFKAGQSKAA+++ K M+++G+  D IA N++ DG+SRMG +  A
Sbjct: 661  TFSPNNVMFTCLVDGLFKAGQSKAAMHISKIMDKEGVYPDTIAFNAVMDGFSRMGNMMMA 720

Query: 721  SSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLILG 780
            + L+S  R++ + P+L T+NILLHGYS+ +D++ C  L N M+  G  P++LT HSLILG
Sbjct: 721  NDLLSTMRSRKLCPSLATYNILLHGYSKKKDLLMCSMLLNTMKMEGLLPDKLTCHSLILG 780

Query: 781  LCNHGMLELGIKILKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSLD 840
             C  GMLE+G K LK  IAE + +D  TFN+L+RKCCE  ++ K  DL + M +  V  D
Sbjct: 781  FCETGMLEVGFKFLKKMIAEGTMVDCFTFNVLMRKCCEAGEMGKAFDLFNIMNMLGVVPD 840

Query: 841  KDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLKD 900
             +TQ A+   L R    Q     +  M +KG  P   QY T++  MCRVG+ QGAFKLKD
Sbjct: 841  TNTQDAIIMGLKRIAAFQESHFVLRGMAEKGLTPKCTQYITLINGMCRVGNFQGAFKLKD 900

Query: 901  QMVALGISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKKD 960
            +M ALGIS  D AE AMVRGLA CGK+EEAM +L RMLRM+ +PT +TFTTL+H FCK+ 
Sbjct: 901  EMEALGISSSDVAESAMVRGLAHCGKVEEAMLVLNRMLRMRLVPTIATFTTLIHKFCKEA 960

Query: 961  NFKEAHNLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTYR 1020
             F +A  LK  ME   VKLD+V+YNVLIS  CANGDV+ A + YEE+K KGL PN TTY 
Sbjct: 961  KFVDALKLKGTMELSGVKLDVVSYNVLISGLCANGDVMPAFELYEEMKHKGLCPNSTTYS 1020

Query: 1021 VLVSAISTK-HYVSRGEIVLKDLNDRGLVSGYLDGKLQKSCRDFVVAIKKLNSLKPNQGN 1080
            VL+ AIS K + + +GEI+LKD+ +RG +S   DG  Q      + A++KL S K N+ N
Sbjct: 1021 VLIDAISKKENNLVKGEILLKDIQERGFISWNWDGSTQHLHEGLINALRKLKSFKKNRRN 1080

BLAST of CSPI03G39810 vs. TrEMBL
Match: B9S9V6_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_0522600 PE=4 SV=1)

HSP 1 Score: 1265.4 bits (3273), Expect = 0.0e+00
Identity = 617/1072 (57.56%), Postives = 806/1072 (75.19%), Query Frame = 1

Query: 8    ILTIGRWESLNHMNYKFASLRPIHGVLALKFLKWVIKQPGLEPNHLTHILGITTHVLVRA 67
            +LTI RWESLNHM YK ASLRP+HG LALKFL WVI+QPGLE  HLTH+L ITTH+LVRA
Sbjct: 1    MLTIDRWESLNHMEYKLASLRPVHGRLALKFLNWVIQQPGLELRHLTHMLSITTHILVRA 60

Query: 68   RLYGYAKSILKHLAQKNSGSNFLFGVLMDTYPLCSSNPAVFDLLIRVYLRQGMVGHAVNT 127
            RLY  AKSILKHL+Q   GS  +FG LM+TYPLC SNP+VFDLLIRVYLR+GMVG A+ T
Sbjct: 61   RLYENAKSILKHLSQMGVGSKSVFGALMNTYPLCKSNPSVFDLLIRVYLREGMVGDALET 120

Query: 128  FSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFNILISVLCV 187
            F  M IRGF PSVYTCNM++  +VK  +   VW FFK+ML  RVCP+VS+FNILI+VLCV
Sbjct: 121  FRLMGIRGFNPSVYTCNMLLGKLVKERKVGAVWLFFKEMLARRVCPDVSTFNILINVLCV 180

Query: 188  QGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECKGIQADVCT 247
            +GKLKKA  +L  ME +GYVP++V+YNT+L+W CKKGR+K AL LI  M  KGI+AD CT
Sbjct: 181  EGKLKKAGYLLKKMEESGYVPSVVTYNTVLNWYCKKGRYKAALELIDQMGSKGIEADACT 240

Query: 248  YNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVATRVFNEMI 307
            YNM +D LC+N+RSA+GYL+LKKMR +MI+PNE++YN++INGFVKEGKIG ATR+F EM 
Sbjct: 241  YNMLVDDLCKNNRSAKGYLLLKKMRKRMISPNEITYNSIINGFVKEGKIGAATRIFQEMS 300

Query: 308  ELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNGLYKSAKFD 367
             LNL PN +TYN LI+G+C +GNFE+AL +L++MEA   +PNEV+   LLNGL + AKF+
Sbjct: 301  MLNLLPNCVTYNALIDGHCHDGNFEQALTILEMMEATGPKPNEVSYSALLNGLCRHAKFE 360

Query: 368  VARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPDIITFSVLI 427
            ++++ILER  +N   + CI++T MIDGLCRNGLL+E+ +LL +M KDGV PD++TFSVLI
Sbjct: 361  LSKSILERMRMNGMIVGCIAYTAMIDGLCRNGLLNESVKLLDKMLKDGVVPDVVTFSVLI 420

Query: 428  NGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAAMNLNGQN 487
            NGFC+VG I   KE++ K+Y+ G  PN++I++TLIYN CK G+V EA K Y AM+  G +
Sbjct: 421  NGFCRVGKIKNVKEIICKMYKAGLAPNSIIYTTLIYNYCKTGDVVEAFKVYVAMSRIGYD 480

Query: 488  ADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVGDGSGAFSV 547
            A+ F CN LV+SLC++GK+  AE F HH+S+IG VPNS+TFDCIINGY N G+G  AFS+
Sbjct: 481  ANCFICNVLVSSLCKDGKVGVAEYFFHHMSKIGNVPNSITFDCIINGYGNSGNGLKAFSM 540

Query: 548  FDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYNTLIVEISK 607
            FD+MI  GHHPS FTYG LLK LC+   F EA++LL KLH IP AVDT++YNT++VE  K
Sbjct: 541  FDEMIKAGHHPSHFTYGGLLKALCRAGKFKEAKRLLDKLHYIPSAVDTVTYNTILVETFK 600

Query: 608  SGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQKEILTLNSI 667
            SG L +AV LF+EM+Q N+LPDSYTY  I +GLIR G++V A  F G L+ K  ++   +
Sbjct: 601  SGMLTDAVALFDEMVQRNVLPDSYTYAIIFAGLIRRGKMVAALHFYGNLLGKGAVSPEKV 660

Query: 668  VYTCFIDGLFKAGQSKAALYLFKEMEEKGLSVDLIALNSITDGYSRMGKVFSASSLISKT 727
            +YT F+DGLF+AGQSKAALY  ++ME+ GL  DLIA N I +GYSRMGK+  A  + +  
Sbjct: 661  MYTTFVDGLFRAGQSKAALYFCEDMEKNGLCADLIATNVILNGYSRMGKMAKAGDIFTMM 720

Query: 728  -RNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLILGLCNHGM 787
                 + P+L T+NILLHGY++ +++  C  LYN+M R+G FP++LT HSLILG C   M
Sbjct: 721  WSGITISPSLATYNILLHGYAKKKNLSKCSNLYNIMMRTGIFPDKLTCHSLILGFCKSAM 780

Query: 788  LELGIKILKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSLDKDTQKA 847
            L++G+K+LK  + +   +D  TFNMLI K CE +++ K  DL + M +F +  D  T  +
Sbjct: 781  LDVGLKLLKKMLLDGVAVDQCTFNMLIMKYCETDEVGKAFDLVNIMNLFDIFPDMTTHDS 840

Query: 848  VTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLKDQMVALG 907
            +  VL R    Q   + +HEML++G IP  +QY  ++ RMCR+G I GAFKLKD+M ALG
Sbjct: 841  IISVLSRVSTVQESHLLLHEMLERGCIPDRRQYIALVNRMCRMGHIHGAFKLKDEMEALG 900

Query: 908  ISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKKDNFKEAH 967
            IS  D AE A+VRGLA CGK+EEA  +L  MLR   IPT +TFTTLMH+FC+ ++  EA 
Sbjct: 901  ISSGDVAESALVRGLAKCGKVEEAKLVLDFMLRKSLIPTIATFTTLMHMFCRNESLVEAL 960

Query: 968  NLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTYRVLVSAI 1027
             LK  M+   VKLD++AYNVLIS  CA+GDV +AL  Y+EIKQ+GL PNMTTY +L+ AI
Sbjct: 961  KLKDTMDFCDVKLDVIAYNVLISGLCADGDVASALKLYKEIKQRGLWPNMTTYCILIDAI 1020

Query: 1028 STKHY-VSRGEIVLKDLNDRGLVSGYLDGKLQKSCRDFVVAIKKLNSLKPNQ 1078
             T    +++GE++LKDL +RG++SG+  G +++     ++A+ +L S+K N+
Sbjct: 1021 FTNDISLAKGEVLLKDLQERGVISGHWCGGIRQG---LIIAMDRLKSMKANR 1069

BLAST of CSPI03G39810 vs. TrEMBL
Match: M5W514_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021196mg PE=4 SV=1)

HSP 1 Score: 1259.2 bits (3257), Expect = 0.0e+00
Identity = 619/1026 (60.33%), Postives = 772/1026 (75.24%), Query Frame = 1

Query: 1    MENSIYTILTIGRWESLNHMNYKFASLRPIHGVLALKFLKWVIKQPGLEPNHLTHILGIT 60
            ME SIY ILTI RWESLNHM+Y+ ASLRP+HG LALKFL WVIKQPGLE NHLTHIL +T
Sbjct: 1    MEKSIYAILTIDRWESLNHMDYRLASLRPVHGRLALKFLNWVIKQPGLELNHLTHILSVT 60

Query: 61   THVLVRARLYGYAKSILKHLAQKNSGSNFLFGVLMDTYPLCSSNPAVFDLLIRVYLRQGM 120
            TH+LVRAR+Y  AKSIL HL Q       +FG LMDTY LC+SNP+VFDLLIRVYLR+GM
Sbjct: 61   THILVRARMYDSAKSILGHLLQMGIAPKPVFGALMDTYSLCNSNPSVFDLLIRVYLREGM 120

Query: 121  VGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFNI 180
            V +AV T   M  RGF+PS  TCNMI+A + K+ +A  VWSFFK+ML +++CP+V++FNI
Sbjct: 121  VDYAVETSYLMGFRGFRPSTCTCNMILAWLAKDQKAGSVWSFFKEMLANKICPDVATFNI 180

Query: 181  LISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECKG 240
            LIS+LCV+GKLKKA  +L  ME++GYVP IVSYNTLL+W CKKGR+K A  LI HM  KG
Sbjct: 181  LISLLCVEGKLKKASYLLRKMEKSGYVPNIVSYNTLLNWYCKKGRYKTAFELIDHMGSKG 240

Query: 241  IQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVAT 300
            I+ADVCTYNM I  LCRN+RSA+GYL+LKKMR K ++PNEV+YN LINGFV EGK+GVAT
Sbjct: 241  IEADVCTYNMLIGDLCRNNRSAKGYLLLKKMRRKKLSPNEVTYNILINGFVMEGKLGVAT 300

Query: 301  RVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNGL 360
            RVF+EM   NLSPN +T+N LI G C NG  EEA R+LD+MEA  +RPNEV+ G LLNGL
Sbjct: 301  RVFDEMSTFNLSPNFVTFNALIGGLCQNGKLEEAFRLLDMMEAMGLRPNEVSYGALLNGL 360

Query: 361  YKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPDI 420
             K AKFD+AR++ ER  +N   ++C  +T ++DGLC+NGLLDEA QL   M +DGV PDI
Sbjct: 361  CKHAKFDLARSLFERMRMNGIVISCTIYTAIMDGLCKNGLLDEAMQLFNMMVQDGVDPDI 420

Query: 421  ITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAA 480
            I FSVL+NG C+ G +  A+E++ KIY+ G  PN +I STLIYNSCK+GN+ EA+K YA 
Sbjct: 421  IAFSVLVNGLCRAGKMKHAREILCKIYKAGLAPNRIICSTLIYNSCKMGNIVEALKIYAV 480

Query: 481  MNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVGD 540
            MN NG  AD FTCN LVASLCE GK+  AE+F+ H+  +GL P+SVT+DCIING+ N+G+
Sbjct: 481  MNHNGHGADRFTCNILVASLCEAGKVEVAEDFMRHMGSMGLDPDSVTYDCIINGHGNMGN 540

Query: 541  GSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYNT 600
            G  +FS+FD+MI  GHHP+PFTYGS+LK LCKG NF EARK LKKLH IP  VDT+ YNT
Sbjct: 541  GLKSFSMFDEMIKSGHHPTPFTYGSILKGLCKGGNFGEARKFLKKLHGIPSVVDTVIYNT 600

Query: 601  LIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQKE 660
            +I E  KSGNL EAV L +EM++NN+LPD YTY  +L+GL R+G++V A +  G+LM K 
Sbjct: 601  IIYETCKSGNLQEAVSLLDEMVENNVLPDDYTYGSLLAGLCRKGKMVAAILLFGKLMGKV 660

Query: 661  ILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEEKGLSVDLIALNSITDGYSRMGKVFSA 720
              + ++I+YTC +DGLFK GQSKAALYLF+EME KGL +D +A N + DGYSRMGK+  A
Sbjct: 661  TCSQSAIMYTCLVDGLFKTGQSKAALYLFEEMENKGLYLDTVACNVMIDGYSRMGKLMKA 720

Query: 721  SSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLILG 780
            + L S  R+  + PNL T+NILLHGYS+ +D++ C  LYN M R+  FP++LT HSLILG
Sbjct: 721  NELFSTMRSSRLCPNLATYNILLHGYSKNRDLVKCSMLYNNMIRARLFPDKLTCHSLILG 780

Query: 781  LCNHGMLELGIKILKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSLD 840
            LC  GML++G K+L   I E +  D LT NML+ K  E   + K  +L   + + RVS +
Sbjct: 781  LCESGMLDVGHKMLNKMIMEGAIADHLTVNMLVSKYSETGKMVKAFELVSVLNLLRVSAN 840

Query: 841  KDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLKD 900
             DT  A+ + L R    Q     ++EML+KGF P    Y T++  MCRVGDIQGAF+LKD
Sbjct: 841  IDTHVAILNGLFRSQDFQASRALLYEMLEKGFTPKDTHYFTLINGMCRVGDIQGAFELKD 900

Query: 901  QMVALGISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKKD 960
             + ALG++  D AE A+VRGLA CGKIEEAM +L RMLRMK IPTT+TFTTLMH+FCK+ 
Sbjct: 901  HIEALGVTTSDIAESALVRGLAKCGKIEEAMLVLDRMLRMKLIPTTATFTTLMHMFCKQA 960

Query: 961  NFKEAHNLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTYR 1020
            N   A  L+  ME   VKLD+  +NVLIS  CANGDV+ A + YEE+KQ+GL+PN TTY 
Sbjct: 961  NLAVALKLRGTMECCGVKLDVPVFNVLISGLCANGDVVVAFELYEEMKQRGLMPNTTTYT 1020

Query: 1021 VLVSAI 1027
            +L+ A+
Sbjct: 1021 LLIGAV 1026

BLAST of CSPI03G39810 vs. TrEMBL
Match: F6GYT0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0117g00250 PE=4 SV=1)

HSP 1 Score: 1245.7 bits (3222), Expect = 0.0e+00
Identity = 627/1079 (58.11%), Postives = 788/1079 (73.03%), Query Frame = 1

Query: 1    MENSIYTILTIGRWESLNHMNYKFASLRPIHGVLALKFLKWVIKQPGLEPNHLTHILGIT 60
            +E+SIYTILTI RWESLNHM Y    LRP+HG LALKFLKWVIKQPGLE  HLTH+  +T
Sbjct: 59   VESSIYTILTIDRWESLNHMAYGLKQLRPVHGRLALKFLKWVIKQPGLELKHLTHMYCLT 118

Query: 61   THVLVRARLYGYAKSILKHLAQKNSGSNFLFGVLMDTYPLCSSNPAVFDLLIRVYLRQGM 120
             H+LV+AR+Y  AKSIL+HL Q   GS  +FG LMDTYPLC+S P+VFDLLIRVYL++GM
Sbjct: 119  AHILVKARMYDSAKSILRHLCQMGIGSKSIFGALMDTYPLCNSIPSVFDLLIRVYLKEGM 178

Query: 121  VGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFNI 180
            + +AV TF  + + GFKPSVYTCNMI+ASMVK+ R  LVWS F++M    +CPNV +FNI
Sbjct: 179  IDYAVETFELVGLVGFKPSVYTCNMILASMVKDKRTELVWSLFREMSDKGICPNVGTFNI 238

Query: 181  LISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECKG 240
            LI+ LCV+G LKKA N+L  ME NG+VPTIV+YNTLL+W CKKGR+K A+ LI +M CKG
Sbjct: 239  LINGLCVEGNLKKAGNLLKQMEENGFVPTIVTYNTLLNWYCKKGRYKAAIELIDYMICKG 298

Query: 241  IQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVAT 300
            I+ADVCTYN+FID+LC N RSA+ YL+LKKMR +MI+PNEV+YNTLINGFVKEGKIGVA 
Sbjct: 299  IEADVCTYNVFIDNLCTNHRSAKAYLLLKKMRKEMISPNEVTYNTLINGFVKEGKIGVAA 358

Query: 301  RVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNGL 360
            +VFNEM + +LSPN +TYN LI G+C  G+FEEALR+LD MEA  +R NEVT GTLLNGL
Sbjct: 359  QVFNEMSKFDLSPNCVTYNALIGGHCHVGDFEEALRLLDHMEAAGLRLNEVTYGTLLNGL 418

Query: 361  YKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPDI 420
             K  KF++A+ +LER  +N   +  I++TV+IDGLC+NG+LDEA QL+  M KDGV+PD+
Sbjct: 419  CKHEKFELAKRLLERMRVNDMVVGHIAYTVLIDGLCKNGMLDEAVQLVGNMYKDGVNPDV 478

Query: 421  ITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAA 480
            IT+S LINGFC+VGNI  AKE++ ++YR G V N +I+STLIYN C+ GNV EAMK YA 
Sbjct: 479  ITYSSLINGFCRVGNIKSAKEIICRMYRSGLVLNKIIYSTLIYNFCQHGNVTEAMKVYAV 538

Query: 481  MNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVGD 540
            MN NG  AD+FTCN LV+SLC +GKL EAE+FL H+SRIGLVPNS+T+DCIINGY ++GD
Sbjct: 539  MNCNGHGADHFTCNVLVSSLCRDGKLGEAEKFLCHMSRIGLVPNSITYDCIINGYGSIGD 598

Query: 541  GSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYNT 600
               AFS FD MI CG HPS FTYGSLLK LCKG N  EA+K L +LH IP AVD++ YNT
Sbjct: 599  PLNAFSFFDDMIKCGQHPSFFTYGSLLKGLCKGGNLVEAKKFLNRLHYIPGAVDSVMYNT 658

Query: 601  LIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQKE 660
            L+ E  KSGNL EAV LF++M+QNN+LPDSYTY+ +L+GL R+G+ V A    G  M + 
Sbjct: 659  LLAETCKSGNLHEAVALFDKMVQNNVLPDSYTYSSLLTGLCRKGKAVTAVCLFGTAMGRG 718

Query: 661  ILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEEKGLSVDLIALNSITDGYSRMGKVFSA 720
             L  N ++YTC +DGL KAG  KAA Y F+EM +KG   D +A N+I D  SR G++  A
Sbjct: 719  TLFPNHVMYTCLVDGLSKAGHPKAAFYFFEEMMKKGTCPDTVAFNAIIDSCSRRGQMMKA 778

Query: 721  SSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLILG 780
            +   S  R   V PNL T+NILLHG+S+ Q ++    LY+ M R G FP++LT+HSLILG
Sbjct: 779  NDFFSTMRWWGVCPNLATYNILLHGFSKKQALLRYLSLYSTMMREGIFPDKLTFHSLILG 838

Query: 781  LCNHGMLELGIKILKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSLD 840
            L   G+ +LG+K+L   I E +  D  TFN+LI K  E   + K  DL + M    V  D
Sbjct: 839  LSKSGIPDLGVKLLGKMIMEGTLADQFTFNILINKYSESGKMRKAFDLVNFMNTLGVFPD 898

Query: 841  KDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLKD 900
            +DT   + + L ++   +   V +HEML+ G IP   QY T++  MCRVGDIQGAFKLKD
Sbjct: 899  RDTYNHIFNGLNKKSAFRESTVVLHEMLENGVIPKHAQYITLINGMCRVGDIQGAFKLKD 958

Query: 901  QMVALGISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKKD 960
            +M ALG    + AE AMVRGL  CGK E+AM +L  MLRM+ +PT +TFTTLMH FC+  
Sbjct: 959  EMEALGFGSHEVAESAMVRGLLHCGKTEDAMLVLDHMLRMRLLPTIATFTTLMHRFCRDA 1018

Query: 961  NFKEAHNLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTYR 1020
               EA  LK +ME   +KLD+VAYNVLI   CANGD   A + YEE++ + L PN+TTY 
Sbjct: 1019 KIAEALKLKGVMELCGLKLDVVAYNVLIMGMCANGDSAAAFELYEEMRHRDLCPNITTYA 1078

Query: 1021 VLVSAISTKHYVSRGEIVLKDLNDRGLVSGYLDGKLQKSCRDFVVAIKKLNSLKPNQGN 1080
            VLV AIS  + + +GE +L DL +RGL+S    G  Q   ++  VA+ KLN ++  + N
Sbjct: 1079 VLVDAISAANNLIQGEKLLTDLQERGLIS--WGGSTQHLDKELTVAMGKLNYIRFKRRN 1135

BLAST of CSPI03G39810 vs. TAIR10
Match: AT5G55840.1 (AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 1124.4 bits (2907), Expect = 0.0e+00
Identity = 556/1078 (51.58%), Postives = 762/1078 (70.69%), Query Frame = 1

Query: 1    MENSIYTILTIGRWESLNHMNYKFASLRPIHGVLALKFLKWVIKQPGLEPNHLTHILGIT 60
            ME SIY ILTI RW SLNHM+Y+ A LR +HG LALKFLKWV+KQPGLE +H+  ++ IT
Sbjct: 59   MEKSIYNILTIDRWGSLNHMDYRQARLRLVHGKLALKFLKWVVKQPGLETDHIVQLVCIT 118

Query: 61   THVLVRARLYGYAKSILKHLAQKNSGSNFLFGVLMDTYPLCSSNPAVFDLLIRVYLRQGM 120
            TH+LVRAR+Y  A+ ILK L+  +  S+F+FG LM TY LC+SNP+V+D+LIRVYLR+GM
Sbjct: 119  THILVRARMYDPARHILKELSLMSGKSSFVFGALMTTYRLCNSNPSVYDILIRVYLREGM 178

Query: 121  VGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFNI 180
            +  ++  F  M + GF PSVYTCN I+ S+VK+     VWSF K+ML  ++CP+V++FNI
Sbjct: 179  IQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFNI 238

Query: 181  LISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECKG 240
            LI+VLC +G  +K+  ++  ME++GY PTIV+YNT+L W CKKGRFK A+ L+ HM+ KG
Sbjct: 239  LINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKG 298

Query: 241  IQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVAT 300
            + ADVCTYNM I  LCR++R A+GYL+L+ MR +MI PNEV+YNTLINGF  EGK+ +A+
Sbjct: 299  VDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIAS 358

Query: 301  RVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNGL 360
            ++ NEM+   LSPN +T+N LI+G+   GNF+EAL++  +MEA  + P+EV+ G LL+GL
Sbjct: 359  QLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGL 418

Query: 361  YKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPDI 420
             K+A+FD+AR    R   N   +  I++T MIDGLC+NG LDEA  LL EM KDG+ PDI
Sbjct: 419  CKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDI 478

Query: 421  ITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAA 480
            +T+S LINGFCKVG    AKE++ +IYR G  PN +I+STLIYN C++G + EA++ Y A
Sbjct: 479  VTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEA 538

Query: 481  MNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVGD 540
            M L G   D+FT N LV SLC+ GK+ EAEEF+  ++  G++PN+V+FDC+INGY N G+
Sbjct: 539  MILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNSGE 598

Query: 541  GSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYNT 600
            G  AFSVFD+M   GHHP+ FTYGSLLK LCKG +  EA K LK LH +P AVDT+ YNT
Sbjct: 599  GLKAFSVFDEMTKVGHHPTFFTYGSLLKGLCKGGHLREAEKFLKSLHAVPAAVDTVMYNT 658

Query: 601  LIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQKE 660
            L+  + KSGNL +AV LF EM+Q +ILPDSYTYT ++SGL R+G+ V A +F      + 
Sbjct: 659  LLTAMCKSGNLAKAVSLFGEMVQRSILPDSYTYTSLISGLCRKGKTVIAILFAKEAEARG 718

Query: 661  ILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEEKGLSVDLIALNSITDGYSRMGKVFSA 720
             +  N ++YTCF+DG+FKAGQ KA +Y  ++M+  G + D++  N++ DGYSRMGK+   
Sbjct: 719  NVLPNKVMYTCFVDGMFKAGQWKAGIYFREQMDNLGHTPDIVTTNAMIDGYSRMGKIEKT 778

Query: 721  SSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLILG 780
            + L+ +  N+N  PNLTT+NILLHGYS+ +D+ + F LY  +  +G  P++LT HSL+LG
Sbjct: 779  NDLLPEMGNQNGGPNLTTYNILLHGYSKRKDVSTSFLLYRSIILNGILPDKLTCHSLVLG 838

Query: 781  LCNHGMLELGIKILKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSLD 840
            +C   MLE+G+KILK FI     +D  TFNMLI KCC   +++   DL   M    +SLD
Sbjct: 839  ICESNMLEIGLKILKAFICRGVEVDRYTFNMLISKCCANGEINWAFDLVKVMTSLGISLD 898

Query: 841  KDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLKD 900
            KDT  A+  VL R    Q   + +HEM K+G  P S++Y  ++  +CRVGDI+ AF +K+
Sbjct: 899  KDTCDAMVSVLNRNHRFQESRMVLHEMSKQGISPESRKYIGLINGLCRVGDIKTAFVVKE 958

Query: 901  QMVALGISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKKD 960
            +M+A  I   + AE AMVR LA CGK +EA  +L+ ML+MK +PT ++FTTLMH+ CK  
Sbjct: 959  EMIAHKICPPNVAESAMVRALAKCGKADEATLLLRFMLKMKLVPTIASFTTLMHLCCKNG 1018

Query: 961  NFKEAHNLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTYR 1020
            N  EA  L+++M +  +KLD+V+YNVLI+  CA GD+  A + YEE+K  G L N TTY+
Sbjct: 1019 NVIEALELRVVMSNCGLKLDLVSYNVLITGLCAKGDMALAFELYEEMKGDGFLANATTYK 1078

Query: 1021 VLVSAISTKHYVSRG-EIVLKDLNDRGLVSGYLDGKLQKSCRDFVVAIKKLNSLKPNQ 1078
             L+  +  +     G +I+LKDL  RG ++       Q S R+  +A++KL +L+ N+
Sbjct: 1079 ALIRGLLARETAFSGADIILKDLLARGFITSM--SLSQDSHRNLKMAMEKLKALQSNK 1134

BLAST of CSPI03G39810 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 303.1 bits (775), Expect = 6.7e-82
Identity = 191/654 (29.20%), Postives = 326/654 (49.85%), Query Frame = 1

Query: 34  LALKFLKWVIKQPGLEPNHLTHILG--ITTHVLVRARLYGYAKSILKHLAQKNSGSNF-- 93
           L LKFL W        P+    +    IT H+L + +LY  A+ + + +A K     +  
Sbjct: 64  LILKFLNWA------NPHQFFTLRCKCITLHILTKFKLYKTAQILAEDVAAKTLDDEYAS 123

Query: 94  -LFGVLMDTYPLCSSNPAVFDLLIRVYLRQGMVGHAVNTFSSMLIRGFKPSVYTCNMIMA 153
            +F  L +TY LC S  +VFDL+++ Y R  ++  A++        GF P V + N ++ 
Sbjct: 124 LVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLD 183

Query: 154 SMVKNCR-AHLVWSFFKQMLTSRVCPNVSSFNILISVLCVQGKLKKAVNILTMMERNGYV 213
           + +++ R      + FK+ML S+V PNV ++NILI   C  G +  A+ +   ME  G +
Sbjct: 184 ATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCL 243

Query: 214 PTIVSYNTLLSWCCKKGRFKFALVLIHHMECKGIQADVCTYNMFIDSLCRNSRSAQGYLV 273
           P +V+YNTL+   CK  +      L+  M  KG++ ++ +YN+ I+ LCR  R  +   V
Sbjct: 244 PNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFV 303

Query: 274 LKKMRNKMITPNEVSYNTLINGFVKEGKIGVATRVFNEMIELNLSPNLITYNILINGYCI 333
           L +M  +  + +EV+YNTLI G+ KEG    A  +  EM+   L+P++ITY  LI+  C 
Sbjct: 304 LTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCK 363

Query: 334 NGNFEEALRVLDVMEANDVRPNEVTIGTLLNGLYKSAKFDVARNILERYSINRTSLNCIS 393
            GN   A+  LD M    + PNE T  TL++G  +    + A  +L   + N  S + ++
Sbjct: 364 AGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVT 423

Query: 394 HTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPDIITFSVLINGFCKVGNINKAKEVMSKIY 453
           +  +I+G C  G +++A  +L +M + G+ PD++++S +++GFC+  ++++A  V  ++ 
Sbjct: 424 YNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMV 483

Query: 454 REGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAAMNLNGQNADNFTCNSLVASLCENGKLV 513
            +G  P+ + +S+LI   C+     EA   Y  M   G   D FT  +L+ + C  G L 
Sbjct: 484 EKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLE 543

Query: 514 EAEEFLHHISRIGLVPNSVTFDCIINGYANVGDGSGAFSVFDKMISCGHHPSPFTYGSLL 573
           +A +  + +   G++P+ VT+  +ING         A  +  K+      PS  TY +L+
Sbjct: 544 KALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLI 603

Query: 574 KVLCKGQNFWEARKLLKKLHCIPLAVDTISYNTLIVEISKSGNLLEAVRLFEEMIQNNIL 633
           +  C    F     L+K   C+                   G + EA ++FE M+  N  
Sbjct: 604 E-NCSNIEFKSVVSLIKGF-CM------------------KGMMTEADQVFESMLGKNHK 663

Query: 634 PDSYTYTCILSGLIREGRLVCAFIFLGRLMQKEILTLNSIVYTCFIDGLFKAGQ 682
           PD   Y  ++ G  R G +  A+  L + M K    L+++     +  L K G+
Sbjct: 664 PDGTAYNIMIHGHCRAGDIRKAYT-LYKEMVKSGFLLHTVTVIALVKALHKEGK 690

BLAST of CSPI03G39810 vs. TAIR10
Match: AT5G59900.1 (AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 294.7 bits (753), Expect = 2.4e-79
Identity = 216/805 (26.83%), Postives = 365/805 (45.34%), Query Frame = 1

Query: 34  LALKFLKWVIKQPGLEPNHLTHILGITTHVLVRARLYGYAKSILKHLAQKNSGSNFLFGV 93
           L L+F  ++    G +  H T    I  H LV+A L+  A S+L+ L  +    + +F V
Sbjct: 86  LGLRFFNFLGLHRGFD--HSTASFCILIHALVKANLFWPASSLLQTLLLRALKPSDVFNV 145

Query: 94  LMDTYPLCS-SNPAVFDLLIRVYLRQGMVGHAVNTFSSMLIR-GFKPSVYTCNMIMASMV 153
           L   Y  C  S+ + FDLLI+ Y+R   V   V  F  M+ +    P V T + ++  +V
Sbjct: 146 LFSCYEKCKLSSSSSFDLLIQHYVRSRRVLDGVLVFKMMITKVSLLPEVRTLSALLHGLV 205

Query: 154 KNCRAHLVWSFFKQMLTSRVCPNVSSFNILISVLCVQGKLKKAVNILTMMERNGYVPTIV 213
           K     L    F  M++  + P+V  +  +I  LC    L +A  ++  ME  G    IV
Sbjct: 206 KFRHFGLAMELFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMIAHMEATGCDVNIV 265

Query: 214 SYNTLLSWCCKKGRFKFALVLIHHMECKGIQADVCTYNMFIDSLCRNSRSAQGYLVLKKM 273
            YN L+   CKK +   A+ +   +  K ++ DV TY   +  LC+      G  ++ +M
Sbjct: 266 PYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFEIGLEMMDEM 325

Query: 274 RNKMITPNEVSYNTLINGFVKEGKIGVATRVFNEMIELNLSPNLITYNILINGYCINGNF 333
                +P+E + ++L+ G  K GKI  A  +   +++  +SPNL  YN LI+  C    F
Sbjct: 326 LCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALIDSLCKGRKF 385

Query: 334 EEALRVLDVMEANDVRPNEVTIGTLLNGLYKSAKFDVARNILERYSINRTSLNCISHTVM 393
            EA  + D M    +RPN+VT   L++   +  K D A + L         L+   +  +
Sbjct: 386 HEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLKLSVYPYNSL 445

Query: 394 IDGLCRNGLLDEAFQLLIEMCKDGVHPDIITFSVLINGFCKVGNINKAKEVMSKIYREGF 453
           I+G C+ G +  A   + EM    + P ++T++ L+ G+C  G INKA  +  ++  +G 
Sbjct: 446 INGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRLYHEMTGKGI 505

Query: 454 VPNNVIFSTLIYNSCKVGNVYEAMKFYAAMNLNGQNADNFTCNSLVASLCENGKLVEAEE 513
            P+   F+TL+    + G + +A+K +  M       +  T N ++   CE G + +A E
Sbjct: 506 APSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCEEGDMSKAFE 565

Query: 514 FLHHISRIGLVPNSVTFDCIINGYANVGDGSGAFSVFDKMISCGHHPSPFTYGSLLKVLC 573
           FL  ++  G+VP++ ++  +I+G    G  S A    D +       +   Y  LL   C
Sbjct: 566 FLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFC 625

Query: 574 KGQNFWEARKLLKKLHCIPLAVDTISYNTLIVEISKSGNLLEAVRLFEEMIQNNILPDSY 633
           +     EA  + +++    + +D + Y  LI    K  +      L +EM    + PD  
Sbjct: 626 REGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDV 685

Query: 634 TYTCILSGLIREGRLVCAFIFLGRLMQKEILTLNSIVYTCFIDGLFKAGQSKAALYLFKE 693
            YT ++    + G    AF  +  LM  E    N + YT  I+GL KAG    A  L  +
Sbjct: 686 IYTSMIDAKSKTGDFKEAF-GIWDLMINEGCVPNEVTYTAVINGLCKAGFVNEAEVLCSK 745

Query: 694 MEEKGLSVDLIA----LNSITDGYSRMGKVFSASSLISKTRNKNVIPNLTTFNILLHGYS 753
           M+      + +     L+ +T G   M K     + I     K ++ N  T+N+L+ G+ 
Sbjct: 746 MQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAIL----KGLLANTATYNMLIRGFC 805

Query: 754 RGQDIMSCFKLYNLMRRSGFFPNRLTYHSLILGLCNHGMLELGIKILKMFIAESSTIDDL 813
           R   I    +L   M   G  P+ +TY ++I  LC    ++  I++      +    D +
Sbjct: 806 RQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIELWNSMTEKGIRPDRV 865

Query: 814 TFNMLIRKCCEINDLDKVIDLTHNM 833
            +N LI  CC   ++ K  +L + M
Sbjct: 866 AYNTLIHGCCVAGEMGKATELRNEM 883

BLAST of CSPI03G39810 vs. TAIR10
Match: AT1G05670.1 (AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 292.4 bits (747), Expect = 1.2e-78
Identity = 179/579 (30.92%), Postives = 304/579 (52.50%), Query Frame = 1

Query: 57  LGITTHVLVRARLYGYAKSILKHLAQK---NSGSNFL--FGVLMDTYPLCSSNPAVFDLL 116
           L I  H+ V ++    A+S++    ++   N   +F+  F +L+ TY    S+P VFD+ 
Sbjct: 122 LCIVIHLAVASKDLKVAQSLISSFWERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVF 181

Query: 117 IRVYLRQGMVGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNC-RAHLVWSFFKQMLTSR 176
            +V +  G++  A   F  ML  G   SV +CN+ +  + K+C +       F++     
Sbjct: 182 FQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVG 241

Query: 177 VCPNVSSFNILISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFAL 236
           VC NV+S+NI+I  +C  G++K+A ++L +ME  GY P ++SY+T+++  C+ G      
Sbjct: 242 VCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVW 301

Query: 237 VLIHHMECKGIQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGF 296
            LI  M+ KG++ +   Y   I  LCR  + A+      +M  + I P+ V Y TLI+GF
Sbjct: 302 KLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGF 361

Query: 297 VKEGKIGVATRVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNE 356
            K G I  A++ F EM   +++P+++TY  +I+G+C  G+  EA ++   M    + P+ 
Sbjct: 362 CKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDS 421

Query: 357 VTIGTLLNGLYKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIE 416
           VT   L+NG  K+     A  +         S N +++T +IDGLC+ G LD A +LL E
Sbjct: 422 VTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHE 481

Query: 417 MCKDGVHPDIITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGN 476
           M K G+ P+I T++ ++NG CK GNI +A +++ +    G   + V ++TL+   CK G 
Sbjct: 482 MWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGE 541

Query: 477 VYEAMKFYAAMNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDC 536
           + +A +    M   G      T N L+   C +G L + E+ L+ +   G+ PN+ TF+ 
Sbjct: 542 MDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNS 601

Query: 537 IINGYANVGDGSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIP 596
           ++  Y    +   A +++  M S G  P   TY +L+K  CK +N  EA  L +++    
Sbjct: 602 LVKQYCIRNNLKAATAIYKDMCSRGVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKG 661

Query: 597 LAVDTISYNTLIVEISKSGNLLEAVRLFEEMIQNNILPD 630
            +V   +Y+ LI    K    LEA  +F++M +  +  D
Sbjct: 662 FSVSVSTYSVLIKGFLKRKKFLEAREVFDQMRREGLAAD 700

BLAST of CSPI03G39810 vs. TAIR10
Match: AT5G01110.1 (AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 285.0 bits (728), Expect = 1.9e-76
Identity = 170/611 (27.82%), Postives = 313/611 (51.23%), Query Frame = 1

Query: 52  HLTHILGITTHVLVRARLYGYAKSILKHLAQKNSGSNF-LFGVLMDTYPLCSSNPAVFDL 111
           H +  L    H+LVR+     A+S L  + +++  S   +   L  T+  C SN +VFDL
Sbjct: 111 HTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSGVSRLEIVNSLDSTFSNCGSNDSVFDL 170

Query: 112 LIRVYLRQGMVGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSR 171
           LIR Y++   +  A   F+ +  +GF  S+  CN ++ S+V+     L W  ++++  S 
Sbjct: 171 LIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEISRSG 230

Query: 172 VCPNVSSFNILISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFAL 231
           V  NV + NI+++ LC  GK++K    L+ ++  G  P IV+YNTL+S    KG  + A 
Sbjct: 231 VGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLMEEAF 290

Query: 232 VLIHHMECKGIQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGF 291
            L++ M  KG    V TYN  I+ LC++ +  +   V  +M    ++P+  +Y +L+   
Sbjct: 291 ELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRSLLMEA 350

Query: 292 VKEGKIGVATRVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNE 351
            K+G +    +VF++M   ++ P+L+ ++ +++ +  +GN ++AL   + ++   + P+ 
Sbjct: 351 CKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAGLIPDN 410

Query: 352 VTIGTLLNGLYKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIE 411
           V    L+ G  +     VA N+         +++ +++  ++ GLC+  +L EA +L  E
Sbjct: 411 VIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEADKLFNE 470

Query: 412 MCKDGVHPDIITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGN 471
           M +  + PD  T ++LI+G CK+GN+  A E+  K+  +    + V ++TL+    KVG+
Sbjct: 471 MTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGFGKVGD 530

Query: 472 VYEAMKFYAAMNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDC 531
           +  A + +A M          + + LV +LC  G L EA      +    + P  +  + 
Sbjct: 531 IDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTVMICNS 590

Query: 532 IINGYANVGDGSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIP 591
           +I GY   G+ S   S  +KMIS G  P   +Y +L+    + +N  +A  L+KK+    
Sbjct: 591 MIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKKMEEEQ 650

Query: 592 --LAVDTISYNTLIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVC 651
             L  D  +YN+++    +   + EA  +  +MI+  + PD  TYTC+++G + +  L  
Sbjct: 651 GGLVPDVFTYNSILHGFCRQNQMKEAEVVLRKMIERGVNPDRSTYTCMINGFVSQDNLTE 710

Query: 652 AFIFLGRLMQK 660
           AF     ++Q+
Sbjct: 711 AFRIHDEMLQR 721

BLAST of CSPI03G39810 vs. NCBI nr
Match: gi|778685780|ref|XP_011652273.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g55840 [Cucumis sativus])

HSP 1 Score: 1913.7 bits (4956), Expect = 0.0e+00
Identity = 955/960 (99.48%), Postives = 958/960 (99.79%), Query Frame = 1

Query: 120  MVGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFN 179
            MVGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFN
Sbjct: 1    MVGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFN 60

Query: 180  ILISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECK 239
            ILISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECK
Sbjct: 61   ILISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECK 120

Query: 240  GIQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVA 299
            GIQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVA
Sbjct: 121  GIQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVA 180

Query: 300  TRVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNG 359
            TRVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNG
Sbjct: 181  TRVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNG 240

Query: 360  LYKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPD 419
            LYKSAKFDVARNILERY INRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPD
Sbjct: 241  LYKSAKFDVARNILERYCINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPD 300

Query: 420  IITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYA 479
            IITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYA
Sbjct: 301  IITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYA 360

Query: 480  AMNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVG 539
            AMNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVG
Sbjct: 361  AMNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVG 420

Query: 540  DGSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYN 599
            DGSGAFSVFD+MISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYN
Sbjct: 421  DGSGAFSVFDRMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYN 480

Query: 600  TLIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQK 659
            TLIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQK
Sbjct: 481  TLIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQK 540

Query: 660  EILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEEKGLSVDLIALNSITDGYSRMGKVFS 719
            EILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEEKGLS+DLIALNSITDGYSRMGKVFS
Sbjct: 541  EILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEEKGLSLDLIALNSITDGYSRMGKVFS 600

Query: 720  ASSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLIL 779
            ASSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLIL
Sbjct: 601  ASSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLIL 660

Query: 780  GLCNHGMLELGIKILKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSL 839
            GLCNHGMLELGIK+LKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSL
Sbjct: 661  GLCNHGMLELGIKMLKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSL 720

Query: 840  DKDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLK 899
            DKDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLK
Sbjct: 721  DKDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLK 780

Query: 900  DQMVALGISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKK 959
            DQMVALGISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKK
Sbjct: 781  DQMVALGISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKK 840

Query: 960  DNFKEAHNLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTY 1019
            DNFKEAHNLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTY
Sbjct: 841  DNFKEAHNLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTY 900

Query: 1020 RVLVSAISTKHYVSRGEIVLKDLNDRGLVSGYLDGKLQKSCRDFVVAIKKLNSLKPNQGN 1079
            RVLVSAISTKHYVSRGEIVLKDLNDRGLVSGYLDGK QKSCRDFVVAIKKLNSLKPNQGN
Sbjct: 901  RVLVSAISTKHYVSRGEIVLKDLNDRGLVSGYLDGKSQKSCRDFVVAIKKLNSLKPNQGN 960

BLAST of CSPI03G39810 vs. NCBI nr
Match: gi|659085584|ref|XP_008443499.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g55840 isoform X1 [Cucumis melo])

HSP 1 Score: 1882.8 bits (4876), Expect = 0.0e+00
Identity = 934/985 (94.82%), Postives = 963/985 (97.77%), Query Frame = 1

Query: 95   MDTYPLCSSNPAVFDLLIRVYLRQGMVGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNC 154
            MDTYPLCSSNPAVFDLLIRVYLRQGMVGHAVNTFSSMLIRGFKPSVYTCNMIMASMV+NC
Sbjct: 1    MDTYPLCSSNPAVFDLLIRVYLRQGMVGHAVNTFSSMLIRGFKPSVYTCNMIMASMVENC 60

Query: 155  RAHLVWSFFKQMLTSRVCPNVSSFNILISVLCVQGKLKKAVNILTMMERNGYVPTIVSYN 214
            RAHLVWSFFKQMLTSRVCPNVSSFNILISVLCVQGK KKAVNILTMMERNGY+PTIVSYN
Sbjct: 61   RAHLVWSFFKQMLTSRVCPNVSSFNILISVLCVQGKFKKAVNILTMMERNGYIPTIVSYN 120

Query: 215  TLLSWCCKKGRFKFALVLIHHMECKGIQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNK 274
            TLLSWCCKKGRFK AL+LIHHMECKGIQADVCTYNMFI+SLCRNSRSAQGYLVLKKMR K
Sbjct: 121  TLLSWCCKKGRFKSALMLIHHMECKGIQADVCTYNMFINSLCRNSRSAQGYLVLKKMRKK 180

Query: 275  MITPNEVSYNTLINGFVKEGKIGVATRVFNEMIELNLSPNLITYNILINGYCINGNFEEA 334
            MITPNEVSYNTLINGFVKEGKIGVATRVFNEM+ELNLSPNLITYNILING+CING+FEEA
Sbjct: 181  MITPNEVSYNTLINGFVKEGKIGVATRVFNEMLELNLSPNLITYNILINGHCINGDFEEA 240

Query: 335  LRVLDVMEANDVRPNEVTIGTLLNGLYKSAKFDVARNILERYSINRTSLNCISHTVMIDG 394
            LRVLDVMEANDVRPNEVTIGTLLNGLYKSAKFD+ARNILERY INRTSLN ISHTVMIDG
Sbjct: 241  LRVLDVMEANDVRPNEVTIGTLLNGLYKSAKFDIARNILERYRINRTSLNYISHTVMIDG 300

Query: 395  LCRNGLLDEAFQLLIEMCKDGVHPDIITFSVLINGFCKVGNINKAKEVMSKIYREGFVPN 454
            LCRNGLLDEAFQLLI+MC DGVHPDIITFSVLINGFCKVGNINKAKEVMSKIYREGFVPN
Sbjct: 301  LCRNGLLDEAFQLLIKMCNDGVHPDIITFSVLINGFCKVGNINKAKEVMSKIYREGFVPN 360

Query: 455  NVIFSTLIYNSCKVGNVYEAMKFYAAMNLNGQNADNFTCNSLVASLCENGKLVEAEEFLH 514
            NVIFSTLIYNSCKVGNVYEAMKFYAAMNLNGQNADNFTCNSLVASLCENGKLVEAEEFL 
Sbjct: 361  NVIFSTLIYNSCKVGNVYEAMKFYAAMNLNGQNADNFTCNSLVASLCENGKLVEAEEFLD 420

Query: 515  HISRIGLVPNSVTFDCIINGYANVGDGSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQ 574
            HI+RIGLVPNSVTFDCIINGYANVGDGSGAFSVFDKMIS GHHPSPFTYGSLLKVLC+GQ
Sbjct: 421  HITRIGLVPNSVTFDCIINGYANVGDGSGAFSVFDKMISSGHHPSPFTYGSLLKVLCRGQ 480

Query: 575  NFWEARKLLKKLHCIPLAVDTISYNTLIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYT 634
            NFWEARKLLKKLHCIPLAVDTISYNTLIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYT
Sbjct: 481  NFWEARKLLKKLHCIPLAVDTISYNTLIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYT 540

Query: 635  CILSGLIREGRLVCAFIFLGRLMQKEILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEE 694
            CILSGLIREGRLVCAFIFLGRLMQK ILT+NS+VYTC IDGLFKAGQ KAALYLFKEMEE
Sbjct: 541  CILSGLIREGRLVCAFIFLGRLMQKGILTMNSVVYTCLIDGLFKAGQPKAALYLFKEMEE 600

Query: 695  KGLSVDLIALNSITDGYSRMGKVFSASSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMS 754
            KGLS+D IALNSI DGYSRMGKVFSA SLISKTRNKNVIPNLTTFNILLHGYSRG+DIMS
Sbjct: 601  KGLSLDSIALNSIIDGYSRMGKVFSARSLISKTRNKNVIPNLTTFNILLHGYSRGKDIMS 660

Query: 755  CFKLYNLMRRSGFFPNRLTYHSLILGLCNHGMLELGIKILKMFIAESSTIDDLTFNMLIR 814
            CFKLY LMRRSGFFPNRLTYHSLILGLCNHGMLELG+K+LKM IAESSTIDDLTFNMLIR
Sbjct: 661  CFKLYYLMRRSGFFPNRLTYHSLILGLCNHGMLELGVKMLKMSIAESSTIDDLTFNMLIR 720

Query: 815  KCCEINDLDKVIDLTHNMEVFRVSLDKDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIP 874
            KCCEINDLDKVIDLTHNMEVF VSLDKDTQKAVTDVLV+RMVSQNYFVFMHEMLKKGFIP
Sbjct: 721  KCCEINDLDKVIDLTHNMEVFGVSLDKDTQKAVTDVLVKRMVSQNYFVFMHEMLKKGFIP 780

Query: 875  TSKQYCTMMKRMCRVGDIQGAFKLKDQMVALGISLDDAAECAMVRGLALCGKIEEAMWIL 934
            TS+QY TMMKR+CRVGDIQGAFKLKDQMVALG+SLDD AECAMVRGLALCGKIEEAMWIL
Sbjct: 781  TSRQYSTMMKRLCRVGDIQGAFKLKDQMVALGVSLDDVAECAMVRGLALCGKIEEAMWIL 840

Query: 935  QRMLRMKKIPTTSTFTTLMHVFCKKDNFKEAHNLKILMEHYRVKLDIVAYNVLISACCAN 994
            QRMLRMKKIPTTSTFTTLMHV CKKDNFKEAHNLKILMEHYRVKLDIVAYNVLISACCA+
Sbjct: 841  QRMLRMKKIPTTSTFTTLMHVLCKKDNFKEAHNLKILMEHYRVKLDIVAYNVLISACCAS 900

Query: 995  GDVITALDFYEEIKQKGLLPNMTTYRVLVSAISTKHYVSRGEIVLKDLNDRGLVSGYLDG 1054
            GDVITALDFYEEIKQKGLLPNMTTYRVLVSAISTKHYVSRGEI+LKDLNDRGLVSGY+DG
Sbjct: 901  GDVITALDFYEEIKQKGLLPNMTTYRVLVSAISTKHYVSRGEILLKDLNDRGLVSGYIDG 960

Query: 1055 KLQKSCRDFVVAIKKLNSLKPNQGN 1080
            K QKSC++F+VA+ KLNSL+PNQGN
Sbjct: 961  KSQKSCKNFLVAMNKLNSLRPNQGN 985

BLAST of CSPI03G39810 vs. NCBI nr
Match: gi|659085594|ref|XP_008443504.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g55840 isoform X2 [Cucumis melo])

HSP 1 Score: 1831.2 bits (4742), Expect = 0.0e+00
Identity = 909/960 (94.69%), Postives = 938/960 (97.71%), Query Frame = 1

Query: 120  MVGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFN 179
            MVGHAVNTFSSMLIRGFKPSVYTCNMIMASMV+NCRAHLVWSFFKQMLTSRVCPNVSSFN
Sbjct: 1    MVGHAVNTFSSMLIRGFKPSVYTCNMIMASMVENCRAHLVWSFFKQMLTSRVCPNVSSFN 60

Query: 180  ILISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECK 239
            ILISVLCVQGK KKAVNILTMMERNGY+PTIVSYNTLLSWCCKKGRFK AL+LIHHMECK
Sbjct: 61   ILISVLCVQGKFKKAVNILTMMERNGYIPTIVSYNTLLSWCCKKGRFKSALMLIHHMECK 120

Query: 240  GIQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVA 299
            GIQADVCTYNMFI+SLCRNSRSAQGYLVLKKMR KMITPNEVSYNTLINGFVKEGKIGVA
Sbjct: 121  GIQADVCTYNMFINSLCRNSRSAQGYLVLKKMRKKMITPNEVSYNTLINGFVKEGKIGVA 180

Query: 300  TRVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNG 359
            TRVFNEM+ELNLSPNLITYNILING+CING+FEEALRVLDVMEANDVRPNEVTIGTLLNG
Sbjct: 181  TRVFNEMLELNLSPNLITYNILINGHCINGDFEEALRVLDVMEANDVRPNEVTIGTLLNG 240

Query: 360  LYKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPD 419
            LYKSAKFD+ARNILERY INRTSLN ISHTVMIDGLCRNGLLDEAFQLLI+MC DGVHPD
Sbjct: 241  LYKSAKFDIARNILERYRINRTSLNYISHTVMIDGLCRNGLLDEAFQLLIKMCNDGVHPD 300

Query: 420  IITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYA 479
            IITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYA
Sbjct: 301  IITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYA 360

Query: 480  AMNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVG 539
            AMNLNGQNADNFTCNSLVASLCENGKLVEAEEFL HI+RIGLVPNSVTFDCIINGYANVG
Sbjct: 361  AMNLNGQNADNFTCNSLVASLCENGKLVEAEEFLDHITRIGLVPNSVTFDCIINGYANVG 420

Query: 540  DGSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYN 599
            DGSGAFSVFDKMIS GHHPSPFTYGSLLKVLC+GQNFWEARKLLKKLHCIPLAVDTISYN
Sbjct: 421  DGSGAFSVFDKMISSGHHPSPFTYGSLLKVLCRGQNFWEARKLLKKLHCIPLAVDTISYN 480

Query: 600  TLIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQK 659
            TLIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQK
Sbjct: 481  TLIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQK 540

Query: 660  EILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEEKGLSVDLIALNSITDGYSRMGKVFS 719
             ILT+NS+VYTC IDGLFKAGQ KAALYLFKEMEEKGLS+D IALNSI DGYSRMGKVFS
Sbjct: 541  GILTMNSVVYTCLIDGLFKAGQPKAALYLFKEMEEKGLSLDSIALNSIIDGYSRMGKVFS 600

Query: 720  ASSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLIL 779
            A SLISKTRNKNVIPNLTTFNILLHGYSRG+DIMSCFKLY LMRRSGFFPNRLTYHSLIL
Sbjct: 601  ARSLISKTRNKNVIPNLTTFNILLHGYSRGKDIMSCFKLYYLMRRSGFFPNRLTYHSLIL 660

Query: 780  GLCNHGMLELGIKILKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSL 839
            GLCNHGMLELG+K+LKM IAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVF VSL
Sbjct: 661  GLCNHGMLELGVKMLKMSIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFGVSL 720

Query: 840  DKDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLK 899
            DKDTQKAVTDVLV+RMVSQNYFVFMHEMLKKGFIPTS+QY TMMKR+CRVGDIQGAFKLK
Sbjct: 721  DKDTQKAVTDVLVKRMVSQNYFVFMHEMLKKGFIPTSRQYSTMMKRLCRVGDIQGAFKLK 780

Query: 900  DQMVALGISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKK 959
            DQMVALG+SLDD AECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHV CKK
Sbjct: 781  DQMVALGVSLDDVAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVLCKK 840

Query: 960  DNFKEAHNLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTY 1019
            DNFKEAHNLKILMEHYRVKLDIVAYNVLISACCA+GDVITALDFYEEIKQKGLLPNMTTY
Sbjct: 841  DNFKEAHNLKILMEHYRVKLDIVAYNVLISACCASGDVITALDFYEEIKQKGLLPNMTTY 900

Query: 1020 RVLVSAISTKHYVSRGEIVLKDLNDRGLVSGYLDGKLQKSCRDFVVAIKKLNSLKPNQGN 1079
            RVLVSAISTKHYVSRGEI+LKDLNDRGLVSGY+DGK QKSC++F+VA+ KLNSL+PNQGN
Sbjct: 901  RVLVSAISTKHYVSRGEILLKDLNDRGLVSGYIDGKSQKSCKNFLVAMNKLNSLRPNQGN 960

BLAST of CSPI03G39810 vs. NCBI nr
Match: gi|659085596|ref|XP_008443505.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g55840 isoform X3 [Cucumis melo])

HSP 1 Score: 1677.5 bits (4343), Expect = 0.0e+00
Identity = 831/880 (94.43%), Postives = 859/880 (97.61%), Query Frame = 1

Query: 200  MMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECKGIQADVCTYNMFIDSLCRNS 259
            MMERNGY+PTIVSYNTLLSWCCKKGRFK AL+LIHHMECKGIQADVCTYNMFI+SLCRNS
Sbjct: 1    MMERNGYIPTIVSYNTLLSWCCKKGRFKSALMLIHHMECKGIQADVCTYNMFINSLCRNS 60

Query: 260  RSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVATRVFNEMIELNLSPNLITYN 319
            RSAQGYLVLKKMR KMITPNEVSYNTLINGFVKEGKIGVATRVFNEM+ELNLSPNLITYN
Sbjct: 61   RSAQGYLVLKKMRKKMITPNEVSYNTLINGFVKEGKIGVATRVFNEMLELNLSPNLITYN 120

Query: 320  ILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNGLYKSAKFDVARNILERYSIN 379
            ILING+CING+FEEALRVLDVMEANDVRPNEVTIGTLLNGLYKSAKFD+ARNILERY IN
Sbjct: 121  ILINGHCINGDFEEALRVLDVMEANDVRPNEVTIGTLLNGLYKSAKFDIARNILERYRIN 180

Query: 380  RTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPDIITFSVLINGFCKVGNINKA 439
            RTSLN ISHTVMIDGLCRNGLLDEAFQLLI+MC DGVHPDIITFSVLINGFCKVGNINKA
Sbjct: 181  RTSLNYISHTVMIDGLCRNGLLDEAFQLLIKMCNDGVHPDIITFSVLINGFCKVGNINKA 240

Query: 440  KEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAAMNLNGQNADNFTCNSLVAS 499
            KEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAAMNLNGQNADNFTCNSLVAS
Sbjct: 241  KEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAAMNLNGQNADNFTCNSLVAS 300

Query: 500  LCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVGDGSGAFSVFDKMISCGHHPS 559
            LCENGKLVEAEEFL HI+RIGLVPNSVTFDCIINGYANVGDGSGAFSVFDKMIS GHHPS
Sbjct: 301  LCENGKLVEAEEFLDHITRIGLVPNSVTFDCIINGYANVGDGSGAFSVFDKMISSGHHPS 360

Query: 560  PFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYNTLIVEISKSGNLLEAVRLFE 619
            PFTYGSLLKVLC+GQNFWEARKLLKKLHCIPLAVDTISYNTLIVEISKSGNLLEAVRLFE
Sbjct: 361  PFTYGSLLKVLCRGQNFWEARKLLKKLHCIPLAVDTISYNTLIVEISKSGNLLEAVRLFE 420

Query: 620  EMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQKEILTLNSIVYTCFIDGLFKA 679
            EMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQK ILT+NS+VYTC IDGLFKA
Sbjct: 421  EMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQKGILTMNSVVYTCLIDGLFKA 480

Query: 680  GQSKAALYLFKEMEEKGLSVDLIALNSITDGYSRMGKVFSASSLISKTRNKNVIPNLTTF 739
            GQ KAALYLFKEMEEKGLS+D IALNSI DGYSRMGKVFSA SLISKTRNKNVIPNLTTF
Sbjct: 481  GQPKAALYLFKEMEEKGLSLDSIALNSIIDGYSRMGKVFSARSLISKTRNKNVIPNLTTF 540

Query: 740  NILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLILGLCNHGMLELGIKILKMFIA 799
            NILLHGYSRG+DIMSCFKLY LMRRSGFFPNRLTYHSLILGLCNHGMLELG+K+LKM IA
Sbjct: 541  NILLHGYSRGKDIMSCFKLYYLMRRSGFFPNRLTYHSLILGLCNHGMLELGVKMLKMSIA 600

Query: 800  ESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSLDKDTQKAVTDVLVRRMVSQN 859
            ESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVF VSLDKDTQKAVTDVLV+RMVSQN
Sbjct: 601  ESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFGVSLDKDTQKAVTDVLVKRMVSQN 660

Query: 860  YFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLKDQMVALGISLDDAAECAMVR 919
            YFVFMHEMLKKGFIPTS+QY TMMKR+CRVGDIQGAFKLKDQMVALG+SLDD AECAMVR
Sbjct: 661  YFVFMHEMLKKGFIPTSRQYSTMMKRLCRVGDIQGAFKLKDQMVALGVSLDDVAECAMVR 720

Query: 920  GLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKKDNFKEAHNLKILMEHYRVKL 979
            GLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHV CKKDNFKEAHNLKILMEHYRVKL
Sbjct: 721  GLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVLCKKDNFKEAHNLKILMEHYRVKL 780

Query: 980  DIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTYRVLVSAISTKHYVSRGEIVL 1039
            DIVAYNVLISACCA+GDVITALDFYEEIKQKGLLPNMTTYRVLVSAISTKHYVSRGEI+L
Sbjct: 781  DIVAYNVLISACCASGDVITALDFYEEIKQKGLLPNMTTYRVLVSAISTKHYVSRGEILL 840

Query: 1040 KDLNDRGLVSGYLDGKLQKSCRDFVVAIKKLNSLKPNQGN 1080
            KDLNDRGLVSGY+DGK QKSC++F+VA+ KLNSL+PNQGN
Sbjct: 841  KDLNDRGLVSGYIDGKSQKSCKNFLVAMNKLNSLRPNQGN 880

BLAST of CSPI03G39810 vs. NCBI nr
Match: gi|641840213|gb|KDO59135.1| (hypothetical protein CISIN_1g000951mg [Citrus sinensis])

HSP 1 Score: 1283.1 bits (3319), Expect = 0.0e+00
Identity = 635/1080 (58.80%), Postives = 807/1080 (74.72%), Query Frame = 1

Query: 1    MENSIYTILTIGRWESLNHMNYKFASLRPIHGVLALKFLKWVIKQPGLEPNHLTHILGIT 60
            ME SIYT+LTI RWESLNHM YK ASLRP+HG LALKFL WV+ QPGLE  HLTHIL +T
Sbjct: 1    MEKSIYTLLTIDRWESLNHMEYKLASLRPVHGRLALKFLNWVMNQPGLELKHLTHILCLT 60

Query: 61   THVLVRARLYGYAKSILKHLAQKNSGSNFLFGVLMDTYPLCSSNPAVFDLLIRVYLRQGM 120
            THVLV+ R+Y  AK IL+ LAQ   G N +FG LM+TYPLC+SNP+VFDLLIRVYLR+GM
Sbjct: 61   THVLVKTRMYEDAKLILRQLAQMGIGQNSVFGSLMNTYPLCNSNPSVFDLLIRVYLREGM 120

Query: 121  VGHAVNTFSSMLIRGFKPSVYTCNMIMASMVKNCRAHLVWSFFKQMLTSRVCPNVSSFNI 180
            V +A+ TF  M  RGF PSVYTCNM+++ M+K+ R   VW  F  ML  ++CPNV++FNI
Sbjct: 121  VEYALETFQLMGFRGFNPSVYTCNMMLSFMLKDRRVDSVWLLFDDMLDRKICPNVATFNI 180

Query: 181  LISVLCVQGKLKKAVNILTMMERNGYVPTIVSYNTLLSWCCKKGRFKFALVLIHHMECKG 240
            LI+V CV+GKLKKA  +L  ME +GYVP IV+YNTLL+W CKKGR+K A  LI  M  KG
Sbjct: 181  LINVSCVEGKLKKAGYLLRKMEESGYVPNIVTYNTLLNWYCKKGRYKAAFKLIDCMASKG 240

Query: 241  IQADVCTYNMFIDSLCRNSRSAQGYLVLKKMRNKMITPNEVSYNTLINGFVKEGKIGVAT 300
            I+ADVCTYNMFID LCRN+RSA+GYL+LK MR +MITPNEV+YNTLINGFVKEGKI VA+
Sbjct: 241  IEADVCTYNMFIDDLCRNNRSAKGYLLLKNMRKRMITPNEVTYNTLINGFVKEGKIQVAS 300

Query: 301  RVFNEMIELNLSPNLITYNILINGYCINGNFEEALRVLDVMEANDVRPNEVTIGTLLNGL 360
            RVF+EM  LN SPN ITYN LI+G+C  GNF+EA R+L +ME   +RPNEV+ G LLNG 
Sbjct: 301  RVFDEMSMLNFSPNSITYNELIDGHCCKGNFKEAFRLLAMMEEMGLRPNEVSYGALLNGF 360

Query: 361  YKSAKFDVARNILERYSINRTSLNCISHTVMIDGLCRNGLLDEAFQLLIEMCKDGVHPDI 420
             K AKFD+AR++LER   N  S++CI++T +IDGLC+ GLLDEA QL  +M KDG++PD+
Sbjct: 361  CKHAKFDLARSLLERMRTNGISISCIAYTSVIDGLCKCGLLDEAMQLFNKMFKDGLNPDL 420

Query: 421  ITFSVLINGFCKVGNINKAKEVMSKIYREGFVPNNVIFSTLIYNSCKVGNVYEAMKFYAA 480
            ITFSVLINGFCKVG   KAK V+ K+YR+G VPN +I+STLIY  CK+G V EAMK YA 
Sbjct: 421  ITFSVLINGFCKVGMTRKAKAVLCKMYRDGLVPNKIIYSTLIYYFCKMGKVTEAMKVYAV 480

Query: 481  MNLNGQNADNFTCNSLVASLCENGKLVEAEEFLHHISRIGLVPNSVTFDCIINGYANVGD 540
            MN N Q +D+FTCN LVASLC+ GK+ EAE+++ H+ RIG+VPNS+TFDC+I+GY  +GD
Sbjct: 481  MNRNAQGSDHFTCNMLVASLCKGGKVCEAEDYVGHMKRIGVVPNSITFDCMIDGYGTLGD 540

Query: 541  GSGAFSVFDKMISCGHHPSPFTYGSLLKVLCKGQNFWEARKLLKKLHCIPLAVDTISYNT 600
            G  AFS+FD+M+  GHHPS FTYGSLLK LCKG N  EA++ L  LH IP AVDT++YNT
Sbjct: 541  GLKAFSMFDEMVKLGHHPSIFTYGSLLKGLCKGGNLKEAKRFLNSLHHIPSAVDTVAYNT 600

Query: 601  LIVEISKSGNLLEAVRLFEEMIQNNILPDSYTYTCILSGLIREGRLVCAFIFLGRLMQKE 660
            ++ E  KSGNL EA+ L +EM+Q N+LPD YTYT +L+GL R+G++V A +F  +++ K 
Sbjct: 601  ILAETCKSGNLWEAIVLLDEMVQFNLLPDRYTYTILLAGLCRKGKVVSALLFFEKVVSKR 660

Query: 661  ILTLNSIVYTCFIDGLFKAGQSKAALYLFKEMEEKGLSVDLIALNSITDGYSRMGKVFSA 720
              + N++++TC +DGLFKAGQSKAA+++ K M+++G+  D IA N++ DG+SRMG +  A
Sbjct: 661  TFSPNNVMFTCLVDGLFKAGQSKAAMHISKIMDKEGVYPDTIAFNAVMDGFSRMGNMMMA 720

Query: 721  SSLISKTRNKNVIPNLTTFNILLHGYSRGQDIMSCFKLYNLMRRSGFFPNRLTYHSLILG 780
            + L+S  R++ + P+L T+NILLHGYS+ +D++ C  L N M+  G  P++LT HSLILG
Sbjct: 721  NDLLSTMRSRKLCPSLATYNILLHGYSKKKDLLMCSMLLNTMKMEGLLPDKLTCHSLILG 780

Query: 781  LCNHGMLELGIKILKMFIAESSTIDDLTFNMLIRKCCEINDLDKVIDLTHNMEVFRVSLD 840
             C  GMLE+G K LK  IAE + +D  TFN+L+RKCCE  ++ K  DL + M +  V  D
Sbjct: 781  FCETGMLEVGFKFLKKMIAEGTMVDCFTFNVLMRKCCEAGEMGKAFDLFNIMNMLGVVPD 840

Query: 841  KDTQKAVTDVLVRRMVSQNYFVFMHEMLKKGFIPTSKQYCTMMKRMCRVGDIQGAFKLKD 900
             +TQ A+   L R    Q     +  M +KG  P   QY T++  MCRVG+ QGAFKLKD
Sbjct: 841  TNTQDAIIMGLKRIAAFQESHFVLRGMAEKGLTPKCTQYITLINGMCRVGNFQGAFKLKD 900

Query: 901  QMVALGISLDDAAECAMVRGLALCGKIEEAMWILQRMLRMKKIPTTSTFTTLMHVFCKKD 960
            +M ALGIS  D AE AMVRGLA CGK+EEAM +L RMLRM+ +PT +TFTTL+H FCK+ 
Sbjct: 901  EMEALGISSSDVAESAMVRGLAHCGKVEEAMLVLNRMLRMRLVPTIATFTTLIHKFCKEA 960

Query: 961  NFKEAHNLKILMEHYRVKLDIVAYNVLISACCANGDVITALDFYEEIKQKGLLPNMTTYR 1020
             F +A  LK  ME   VKLD+V+YNVLIS  CANGDV+ A + YEE+K KGL PN TTY 
Sbjct: 961  KFVDALKLKGTMELSGVKLDVVSYNVLISGLCANGDVMPAFELYEEMKHKGLCPNSTTYS 1020

Query: 1021 VLVSAISTK-HYVSRGEIVLKDLNDRGLVSGYLDGKLQKSCRDFVVAIKKLNSLKPNQGN 1080
            VL+ AIS K + + +GEI+LKD+ +RG +S   DG  Q      + A++KL S K N+ N
Sbjct: 1021 VLIDAISKKENNLVKGEILLKDIQERGFISWNWDGSTQHLHEGLINALRKLKSFKKNRRN 1080

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP432_ARATH0.0e+0051.58Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH1.2e-8029.20Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PP437_ARATH4.2e-7826.83Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
PPR12_ARATH2.1e-7730.92Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
PP360_ARATH3.3e-7527.82Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A067EZ46_CITSI0.0e+0058.80Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g000951mg PE=4 SV=1[more]
V4UNJ1_9ROSI0.0e+0058.52Uncharacterized protein OS=Citrus clementina GN=CICLE_v10007298mg PE=4 SV=1[more]
B9S9V6_RICCO0.0e+0057.56Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
M5W514_PRUPE0.0e+0060.33Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021196mg PE=4 SV=1[more]
F6GYT0_VITVI0.0e+0058.11Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0117g00250 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT5G55840.10.0e+0051.58 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.16.7e-8229.20 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G59900.12.4e-7926.83 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G05670.11.2e-7830.92 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT5G01110.11.9e-7627.82 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778685780|ref|XP_011652273.1|0.0e+0099.48PREDICTED: pentatricopeptide repeat-containing protein At5g55840 [Cucumis sativu... [more]
gi|659085584|ref|XP_008443499.1|0.0e+0094.82PREDICTED: pentatricopeptide repeat-containing protein At5g55840 isoform X1 [Cuc... [more]
gi|659085594|ref|XP_008443504.1|0.0e+0094.69PREDICTED: pentatricopeptide repeat-containing protein At5g55840 isoform X2 [Cuc... [more]
gi|659085596|ref|XP_008443505.1|0.0e+0094.43PREDICTED: pentatricopeptide repeat-containing protein At5g55840 isoform X3 [Cuc... [more]
gi|641840213|gb|KDO59135.1|0.0e+0058.80hypothetical protein CISIN_1g000951mg [Citrus sinensis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G39810.1CSPI03G39810.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 879..907
score: 0.11coord: 916..940
score: 0.017coord: 947..973
score: 0.025coord: 456..483
score: 1.1coord: 108..136
score: 0.35coord: 808..834
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 485..514
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 734..783
score: 2.6E-11coord: 208..257
score: 2.0E-14coord: 594..641
score: 3.1E-13coord: 386..432
score: 1.1E-15coord: 138..186
score: 7.8E-9coord: 665..712
score: 2.8E-8coord: 523..572
score: 6.3E-13coord: 278..326
score: 8.3E-18coord: 980..1025
score: 1.0
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 387..420
score: 1.5E-8coord: 667..700
score: 2.5E-8coord: 211..244
score: 3.5E-7coord: 177..209
score: 3.1E-6coord: 141..175
score: 1.2E-5coord: 108..140
score: 3.6E-4coord: 596..630
score: 1.2E-5coord: 491..524
score: 2.6E-4coord: 526..559
score: 7.7E-7coord: 246..279
score: 4.2E-4coord: 281..314
score: 7.8E-9coord: 316..350
score: 1.5E-10coord: 421..454
score: 1.4E-7coord: 982..1015
score: 2.7E-7coord: 879..910
score: 5.7E-5coord: 738..770
score: 6.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 629..663
score: 8.046coord: 1015..1049
score: 6.741coord: 770..804
score: 7.805coord: 524..558
score: 10.928coord: 349..383
score: 6.347coord: 559..593
score: 7.596coord: 735..769
score: 10.786coord: 174..208
score: 11.52coord: 419..453
score: 12.014coord: 104..138
score: 9.997coord: 454..488
score: 7.87coord: 945..979
score: 8.199coord: 980..1014
score: 12.573coord: 384..418
score: 12.233coord: 314..348
score: 13.395coord: 805..839
score: 8.353coord: 209..243
score: 10.633coord: 910..944
score: 8.769coord: 139..173
score: 8.484coord: 840..874
score: 5.492coord: 594..628
score: 11.86coord: 489..523
score: 11.115coord: 700..734
score: 8.418coord: 665..699
score: 10.928coord: 244..278
score: 10.008coord: 279..313
score: 12.43coord: 875..909
score:
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 283..336
score: 2.6E-5coord: 476..625
score: 2.6E-5coord: 910..945
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 976..1021
score: 2.0E-243coord: 881..938
score: 2.0E-243coord: 277..634
score: 2.0E-243coord: 103..237
score: 2.0E
NoneNo IPR availablePANTHERPTHR24015:SF651SUBFAMILY NOT NAMEDcoord: 976..1021
score: 2.0E-243coord: 103..237
score: 2.0E-243coord: 881..938
score: 2.0E-243coord: 277..634
score: 2.0E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 531..707
score: 4.7

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI03G39810Wax gourdcpiwgoB238
CSPI03G39810Wax gourdcpiwgoB267
CSPI03G39810Wax gourdcpiwgoB356
CSPI03G39810Wild cucumber (PI 183967)cpicpiB020
CSPI03G39810Cucumber (Gy14) v1cgycpiB314
CSPI03G39810Cucurbita maxima (Rimu)cmacpiB121
CSPI03G39810Cucurbita maxima (Rimu)cmacpiB241
CSPI03G39810Cucurbita maxima (Rimu)cmacpiB287
CSPI03G39810Cucurbita maxima (Rimu)cmacpiB711
CSPI03G39810Cucurbita pepo (Zucchini)cpecpiB395
CSPI03G39810Cucurbita moschata (Rifu)cmocpiB116
CSPI03G39810Cucurbita moschata (Rifu)cmocpiB229
CSPI03G39810Cucurbita moschata (Rifu)cmocpiB277
CSPI03G39810Cucurbita moschata (Rifu)cmocpiB705
CSPI03G39810Cucumber (Chinese Long) v2cpicuB106
CSPI03G39810Melon (DHL92) v3.5.1cpimeB162
CSPI03G39810Melon (DHL92) v3.5.1cpimeB171
CSPI03G39810Melon (DHL92) v3.5.1cpimeB189
CSPI03G39810Watermelon (Charleston Gray)cpiwcgB179
CSPI03G39810Watermelon (Charleston Gray)cpiwcgB195
CSPI03G39810Watermelon (Charleston Gray)cpiwcgB234
CSPI03G39810Watermelon (Charleston Gray)cpiwcgB251
CSPI03G39810Watermelon (97103) v1cpiwmB185
CSPI03G39810Watermelon (97103) v1cpiwmB198
CSPI03G39810Watermelon (97103) v1cpiwmB238
CSPI03G39810Watermelon (97103) v1cpiwmB260
CSPI03G39810Watermelon (97103) v1cpiwmB280
CSPI03G39810Cucurbita pepo (Zucchini)cpecpiB173
CSPI03G39810Cucurbita pepo (Zucchini)cpecpiB599
CSPI03G39810Cucurbita pepo (Zucchini)cpecpiB674
CSPI03G39810Bottle gourd (USVL1VR-Ls)cpilsiB174
CSPI03G39810Bottle gourd (USVL1VR-Ls)cpilsiB184
CSPI03G39810Bottle gourd (USVL1VR-Ls)cpilsiB193
CSPI03G39810Bottle gourd (USVL1VR-Ls)cpilsiB223
CSPI03G39810Melon (DHL92) v3.6.1cpimedB155
CSPI03G39810Melon (DHL92) v3.6.1cpimedB182
CSPI03G39810Silver-seed gourdcarcpiB0291
CSPI03G39810Silver-seed gourdcarcpiB0401
CSPI03G39810Silver-seed gourdcarcpiB0472
CSPI03G39810Silver-seed gourdcarcpiB0719
CSPI03G39810Cucumber (Chinese Long) v3cpicucB128
CSPI03G39810Cucumber (Chinese Long) v3cpicucB131
CSPI03G39810Watermelon (97103) v2cpiwmbB189
CSPI03G39810Watermelon (97103) v2cpiwmbB220
CSPI03G39810Watermelon (97103) v2cpiwmbB236
CSPI03G39810Watermelon (97103) v2cpiwmbB262