CSPI06G23760 (gene) Wild cucumber (PI 183967)

NameCSPI06G23760
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein
LocationChr6 : 21395333 .. 21397767 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTCGTTATTGGTTGAATCTTGTTATGAAAAAATTGTTAGAAGCAATTTTATTTTGATATTGATTTGGGAATTTAAAAAGTGATGTAAAAGCGAAGAATTCCTTTTCTTCCATTCCTCTCCTTTGCAAAGACCACCGTGGAAGAACCGTAGGACTGCCAATGCACAATGGCATTCGGTTATCAATCTCCATTCCAACCCCTAGTCACCTTCTCTTTCGAATCCTTCATTCTTACTCTGGTTCCGCTCACATCGACACTGTCCCTCCACCATCATCTCCACCATTCAAATGCTCAATCTCTCCCCTTACAATCTCTGCGACTCTTCAAAACCTTCTGCAGCCGCTCTCTGCGCCGGGCCCACCTCCGATTCTATCTTATGCCCCCGTTTTCCAGTTCCTTACTGGCCTAAACATGTTGAAATTGGGCCATCAAGTTCACGCCCATATGCTTCTCCGCGGCCTTCAGCCCACTGCGCTGGTTGGCTCCAAGATGGTTGCGTTTTATGCCAGTTCGGGTGATATTGATTCCTCTGTTTCGGTTTTCAATGGGATTGGTGAGCCTTCCTCTCTCTTGTTTAATTCCATGATTCGAGCTTATGCTCGATATGGGTTTGCAGAGAGAACTGTTGCCACTTATTTTTCTATGCATTCTTGGGGCTTTACTGGGGATTACTTTACTTTCCCTTTTGTTCTTAAGTCTTCTGTGGAGTTGTTGAGTGTTTGGATGGGGAAATGTGTTCATGGGCTGATTTTGAGAATTGGGTTGCAGTTTGATTTGTATGTGGCTACTTCTTTGATTATTCTGTATGGGAAATGTGGTGAAATAAATGATGCGGGTAAGGTGTTTGATAATATGACTATTAGAGATGTTTCATCTTGGAATGCTTTACTTGCTGGTTATATGAAGAGTGGGTGTATTGATGCTGCACTGGCGATTTTCGAGAGAATGCCATGGAGGAATATTGTCTCTTGGACGACTATGATTTCTGGATACTCACAGAGCGGCTTGGCACAGCAGGCATTGAGTTTGTTTGATGAAATGGTGAAAGAAGATTCACGAGTAAGACCGAACTGGGTGACTATAATGAGCGTCCTCCCAGCTTGTGCACAATTATCGACACTTGAACGCGGAAGGCAGATTCATGAGTTGGCTTGTCGGATGGGTTTGAATTCAAATGCTTCTGTGCTGATTGCGCTTACTGCAATGTATGCAAAATGTGGAAGCTTAGTCGATGCTCGCAACTGTTTCGACAAGCTTAATAGAAATGAAAAGAATTTGATTGCTTGGAATACCATGATAACTGCTTATGCGTCCTATGGACATGGGCTTCAAGCAGTGTCAACCTTTCGGGAGATGATCCAAGCAGGCATTCAACCGGACGATATTACATTCACAGGATTGTTATCCGGTTGCAGCCATTCAGGTCTTGTTGATGTTGGTTTAAAGTACTTCAATCACATGAGCACCACATATTCGATCAATCCCAGAGTTGAGCATTATGCCTGTGTTGCCGATCTCTTAGGTCGAGCAGGGAGATTAGCTGAAGCAAGTAAACTTGTAGGTGAAATGCCAATGCCTGCAGGACCAAGTATTTGGGGTTCACTATTAGCTGCCTGTCGAAAACACCGTAATCTGGAAATGGCAGAAACTGCAGCAAGAAAGCTTTTTGTCCTAGAACCAGAAAACACCGGCAACTACGTTCTACTTTCCAACATGTATGCTGAAGCCGGAAGGTGGCAGGAAGTTGACAAACTGAGAGCAATTGTGAAATCCCAAGGGACAAAGAAAAGTCCAGGTTGCAGTTGGATCGAGATAAATGGCAAAGCACATATGTTTCTTGGTGGCGATACATCTCACCCTCAAGGCAAAGAAATCTACATGTTCTTGGAGGCATTGCCAGAGAAGATGAAGGCAGCTGGCTATTTTCCTGATACAAGCTATGTACTGCACGACATCAGCGAGGAAGAGAAAGAATTCAACCTCATTGCACACAGTGAGAAGCTCGCCGTTGCATTCGGGATTCTTAACACTCCTGCTGAAACTGTTCTCCGGGTGACAAAGAACTTGAGAATTTGTGGGGACTGCCACACTGCAATGGTGTTCATTTCAGAGATATATGGGCGGGAAGTCATTGTTAGAGATATCAATCGGTTTCATCACTTTAAAGGGGGTTGTTGCTCTTGTGGAGATTACTGGTGATTCATACCATTACATTTATATACATAAATGGAAATTGATGCGAGTGACATGGCCAAATTGAAATGGGAAAGCGTATAATCTTAGGGGATCATGTTATTAAGGTGAGTGGTTACTTTCAAACATTTGATATGAAAATTGTTTTAAACGTATCATTAAACTTAGTTTGTGTATAAGTGTTTGTATAACTTGCTTTTATAAAAGTGTACTATCGATAATGATCTCTAACGGAAGTTTTT

mRNA sequence

ATGCACAATGGCATTCGGTTATCAATCTCCATTCCAACCCCTAGTCACCTTCTCTTTCGAATCCTTCATTCTTACTCTGGTTCCGCTCACATCGACACTGTCCCTCCACCATCATCTCCACCATTCAAATGCTCAATCTCTCCCCTTACAATCTCTGCGACTCTTCAAAACCTTCTGCAGCCGCTCTCTGCGCCGGGCCCACCTCCGATTCTATCTTATGCCCCCGTTTTCCAGTTCCTTACTGGCCTAAACATGTTGAAATTGGGCCATCAAGTTCACGCCCATATGCTTCTCCGCGGCCTTCAGCCCACTGCGCTGGTTGGCTCCAAGATGGTTGCGTTTTATGCCAGTTCGGGTGATATTGATTCCTCTGTTTCGGTTTTCAATGGGATTGGTGAGCCTTCCTCTCTCTTGTTTAATTCCATGATTCGAGCTTATGCTCGATATGGGTTTGCAGAGAGAACTGTTGCCACTTATTTTTCTATGCATTCTTGGGGCTTTACTGGGGATTACTTTACTTTCCCTTTTGTTCTTAAGTCTTCTGTGGAGTTGTTGAGTGTTTGGATGGGGAAATGTGTTCATGGGCTGATTTTGAGAATTGGGTTGCAGTTTGATTTGTATGTGGCTACTTCTTTGATTATTCTGTATGGGAAATGTGGTGAAATAAATGATGCGGGTAAGGTGTTTGATAATATGACTATTAGAGATGTTTCATCTTGGAATGCTTTACTTGCTGGTTATATGAAGAGTGGGTGTATTGATGCTGCACTGGCGATTTTCGAGAGAATGCCATGGAGGAATATTGTCTCTTGGACGACTATGATTTCTGGATACTCACAGAGCGGCTTGGCACAGCAGGCATTGAGTTTGTTTGATGAAATGGTGAAAGAAGATTCACGAGTAAGACCGAACTGGGTGACTATAATGAGCGTCCTCCCAGCTTGTGCACAATTATCGACACTTGAACGCGGAAGGCAGATTCATGAGTTGGCTTGTCGGATGGGTTTGAATTCAAATGCTTCTGTGCTGATTGCGCTTACTGCAATGTATGCAAAATGTGGAAGCTTAGTCGATGCTCGCAACTGTTTCGACAAGCTTAATAGAAATGAAAAGAATTTGATTGCTTGGAATACCATGATAACTGCTTATGCGTCCTATGGACATGGGCTTCAAGCAGTGTCAACCTTTCGGGAGATGATCCAAGCAGGCATTCAACCGGACGATATTACATTCACAGGATTGTTATCCGGTTGCAGCCATTCAGGTCTTGTTGATGTTGGTTTAAAGTACTTCAATCACATGAGCACCACATATTCGATCAATCCCAGAGTTGAGCATTATGCCTGTGTTGCCGATCTCTTAGGTCGAGCAGGGAGATTAGCTGAAGCAAGTAAACTTGTAGGTGAAATGCCAATGCCTGCAGGACCAAGTATTTGGGGTTCACTATTAGCTGCCTGTCGAAAACACCGTAATCTGGAAATGGCAGAAACTGCAGCAAGAAAGCTTTTTGTCCTAGAACCAGAAAACACCGGCAACTACGTTCTACTTTCCAACATGTATGCTGAAGCCGGAAGGTGGCAGGAAGTTGACAAACTGAGAGCAATTGTGAAATCCCAAGGGACAAAGAAAAGTCCAGGTTGCAGTTGGATCGAGATAAATGGCAAAGCACATATGTTTCTTGGTGGCGATACATCTCACCCTCAAGGCAAAGAAATCTACATGTTCTTGGAGGCATTGCCAGAGAAGATGAAGGCAGCTGGCTATTTTCCTGATACAAGCTATGTACTGCACGACATCAGCGAGGAAGAGAAAGAATTCAACCTCATTGCACACAGTGAGAAGCTCGCCGTTGCATTCGGGATTCTTAACACTCCTGCTGAAACTGTTCTCCGGGTGACAAAGAACTTGAGAATTTGTGGGGACTGCCACACTGCAATGGTGTTCATTTCAGAGATATATGGGCGGGAAGTCATTGTTAGAGATATCAATCGGTTTCATCACTTTAAAGGGGGTTGTTGCTCTTGTGGAGATTACTGGTGA

Coding sequence (CDS)

ATGCACAATGGCATTCGGTTATCAATCTCCATTCCAACCCCTAGTCACCTTCTCTTTCGAATCCTTCATTCTTACTCTGGTTCCGCTCACATCGACACTGTCCCTCCACCATCATCTCCACCATTCAAATGCTCAATCTCTCCCCTTACAATCTCTGCGACTCTTCAAAACCTTCTGCAGCCGCTCTCTGCGCCGGGCCCACCTCCGATTCTATCTTATGCCCCCGTTTTCCAGTTCCTTACTGGCCTAAACATGTTGAAATTGGGCCATCAAGTTCACGCCCATATGCTTCTCCGCGGCCTTCAGCCCACTGCGCTGGTTGGCTCCAAGATGGTTGCGTTTTATGCCAGTTCGGGTGATATTGATTCCTCTGTTTCGGTTTTCAATGGGATTGGTGAGCCTTCCTCTCTCTTGTTTAATTCCATGATTCGAGCTTATGCTCGATATGGGTTTGCAGAGAGAACTGTTGCCACTTATTTTTCTATGCATTCTTGGGGCTTTACTGGGGATTACTTTACTTTCCCTTTTGTTCTTAAGTCTTCTGTGGAGTTGTTGAGTGTTTGGATGGGGAAATGTGTTCATGGGCTGATTTTGAGAATTGGGTTGCAGTTTGATTTGTATGTGGCTACTTCTTTGATTATTCTGTATGGGAAATGTGGTGAAATAAATGATGCGGGTAAGGTGTTTGATAATATGACTATTAGAGATGTTTCATCTTGGAATGCTTTACTTGCTGGTTATATGAAGAGTGGGTGTATTGATGCTGCACTGGCGATTTTCGAGAGAATGCCATGGAGGAATATTGTCTCTTGGACGACTATGATTTCTGGATACTCACAGAGCGGCTTGGCACAGCAGGCATTGAGTTTGTTTGATGAAATGGTGAAAGAAGATTCACGAGTAAGACCGAACTGGGTGACTATAATGAGCGTCCTCCCAGCTTGTGCACAATTATCGACACTTGAACGCGGAAGGCAGATTCATGAGTTGGCTTGTCGGATGGGTTTGAATTCAAATGCTTCTGTGCTGATTGCGCTTACTGCAATGTATGCAAAATGTGGAAGCTTAGTCGATGCTCGCAACTGTTTCGACAAGCTTAATAGAAATGAAAAGAATTTGATTGCTTGGAATACCATGATAACTGCTTATGCGTCCTATGGACATGGGCTTCAAGCAGTGTCAACCTTTCGGGAGATGATCCAAGCAGGCATTCAACCGGACGATATTACATTCACAGGATTGTTATCCGGTTGCAGCCATTCAGGTCTTGTTGATGTTGGTTTAAAGTACTTCAATCACATGAGCACCACATATTCGATCAATCCCAGAGTTGAGCATTATGCCTGTGTTGCCGATCTCTTAGGTCGAGCAGGGAGATTAGCTGAAGCAAGTAAACTTGTAGGTGAAATGCCAATGCCTGCAGGACCAAGTATTTGGGGTTCACTATTAGCTGCCTGTCGAAAACACCGTAATCTGGAAATGGCAGAAACTGCAGCAAGAAAGCTTTTTGTCCTAGAACCAGAAAACACCGGCAACTACGTTCTACTTTCCAACATGTATGCTGAAGCCGGAAGGTGGCAGGAAGTTGACAAACTGAGAGCAATTGTGAAATCCCAAGGGACAAAGAAAAGTCCAGGTTGCAGTTGGATCGAGATAAATGGCAAAGCACATATGTTTCTTGGTGGCGATACATCTCACCCTCAAGGCAAAGAAATCTACATGTTCTTGGAGGCATTGCCAGAGAAGATGAAGGCAGCTGGCTATTTTCCTGATACAAGCTATGTACTGCACGACATCAGCGAGGAAGAGAAAGAATTCAACCTCATTGCACACAGTGAGAAGCTCGCCGTTGCATTCGGGATTCTTAACACTCCTGCTGAAACTGTTCTCCGGGTGACAAAGAACTTGAGAATTTGTGGGGACTGCCACACTGCAATGGTGTTCATTTCAGAGATATATGGGCGGGAAGTCATTGTTAGAGATATCAATCGGTTTCATCACTTTAAAGGGGGTTGTTGCTCTTGTGGAGATTACTGGTGA
BLAST of CSPI06G23760 vs. Swiss-Prot
Match: PP271_ARATH (Putative pentatricopeptide repeat-containing protein At3g49142 OS=Arabidopsis thaliana GN=PCMP-H77 PE=3 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 6.2e-136
Identity = 260/630 (41.27%), Postives = 368/630 (58.41%), Query Frame = 1

Query: 92  VHAHMLLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNGIGEPSSLLFNSMIRAYARYGF 151
           VH+ ++L  L+  + +G K++  YAS  D+ S+  VF+ I E + ++ N MIR+Y   GF
Sbjct: 61  VHSRIILEDLRCNSSLGVKLMRAYASLKDVASARKVFDEIPERNVIIINVMIRSYVNNGF 120

Query: 152 AERTVATYFSMHSWGFTGDYFTFPFVLKSSVELLSVWMGKCVHGLILRIGLQFDLYVATS 211
               V  + +M       D++TFP VLK+     ++ +G+ +HG   ++GL   L+V   
Sbjct: 121 YGEGVKVFGTMCGCNVRPDHYTFPCVLKACSCSGTIVIGRKIHGSATKVGLSSTLFVGNG 180

Query: 212 LIILYGKCGEINDAGKVFDNMTIRDVSSWNALL--------------------------- 271
           L+ +YGKCG +++A  V D M+ RDV SWN+L+                           
Sbjct: 181 LVSMYGKCGFLSEARLVLDEMSRRDVVSWNSLVVGYAQNQRFDDALEVCREMESVKISHD 240

Query: 272 AGYMKSGC----------IDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM 331
           AG M S            +     +F +M  +++VSW  MI  Y ++ +  +A+ L+  M
Sbjct: 241 AGTMASLLPAVSNTTTENVMYVKDMFFKMGKKSLVSWNVMIGVYMKNAMPVEAVELYSRM 300

Query: 332 VKEDSRVRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCG 391
             E     P+ V+I SVLPAC   S L  G++IH    R  L  N  +  AL  MYAKCG
Sbjct: 301 --EADGFEPDAVSITSVLPACGDTSALSLGKKIHGYIERKKLIPNLLLENALIDMYAKCG 360

Query: 392 SLVDARNCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGL 451
            L  AR+ F+  N   +++++W  MI+AY   G G  AV+ F ++  +G+ PD I F   
Sbjct: 361 CLEKARDVFE--NMKSRDVVSWTAMISAYGFSGRGCDAVALFSKLQDSGLVPDSIAFVTT 420

Query: 452 LSGCSHSGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPA 511
           L+ CSH+GL++ G   F  M+  Y I PR+EH AC+ DLLGRAG++ EA + + +M M  
Sbjct: 421 LAACSHAGLLEEGRSCFKLMTDHYKITPRLEHLACMVDLLGRAGKVKEAYRFIQDMSMEP 480

Query: 512 GPSIWGSLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRA 571
              +WG+LL ACR H + ++   AA KLF L PE +G YVLLSN+YA+AGRW+EV  +R 
Sbjct: 481 NERVWGALLGACRVHSDTDIGLLAADKLFQLAPEQSGYYVLLSNIYAKAGRWEEVTNIRN 540

Query: 572 IVKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSY 631
           I+KS+G KK+PG S +E+N   H FL GD SHPQ  EIY  L+ L +KMK  GY PD+  
Sbjct: 541 IMKSKGLKKNPGASNVEVNRIIHTFLVGDRSHPQSDEIYRELDVLVKKMKELGYVPDSES 600

Query: 632 VLHDISEEEKEFNLIAHSEKLAVAFGILNTPAE-----TVLRVTKNLRICGDCHTAMVFI 680
            LHD+ EE+KE +L  HSEKLA+ F ++NT  E       +R+TKNLRICGDCH A   I
Sbjct: 601 ALHDVEEEDKETHLAVHSEKLAIVFALMNTKEEEEDSNNTIRITKNLRICGDCHVAAKLI 660

BLAST of CSPI06G23760 vs. Swiss-Prot
Match: PPR53_ARATH (Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN=PCMP-H21 PE=2 SV=2)

HSP 1 Score: 479.2 bits (1232), Expect = 7.6e-134
Identity = 262/692 (37.86%), Postives = 388/692 (56.07%), Query Frame = 1

Query: 31  IDTVPPPSSPPFKCSISPLTISATLQNLLQPLS---APGPPPILSYAP-VFQFLTGLNML 90
           + ++P P+   F   I  LT +      +   S   + G  P     P +F+    L+  
Sbjct: 73  LQSIPDPTIYSFSSLIYALTKAKLFTQSIGVFSRMFSHGLIPDSHVLPNLFKVCAELSAF 132

Query: 91  KLGHQVHAHMLLRGLQPTALVGSKM-------------------------------VAFY 150
           K+G Q+H    + GL   A V   M                               +  Y
Sbjct: 133 KVGKQIHCVSCVSGLDMDAFVQGSMFHMYMRCGRMGDARKVFDRMSDKDVVTCSALLCAY 192

Query: 151 ASSGDIDSSVSVFNGIG----EPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDY 210
           A  G ++  V + + +     E + + +N ++  + R G+ +  V  +  +H  GF  D 
Sbjct: 193 ARKGCLEEVVRILSEMESSGIEANIVSWNGILSGFNRSGYHKEAVVMFQKIHHLGFCPDQ 252

Query: 211 FTFPFVLKSSVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDN 270
            T   VL S  +   + MG+ +HG +++ GL  D  V +++I +YGK G +     +F+ 
Sbjct: 253 VTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKSGHVYGIISLFNQ 312

Query: 271 MTIRDVSSWNALLAGYMKSGCIDAALAIFERMPWR----NIVSWTTMISGYSQSGLAQQA 330
             + +    NA + G  ++G +D AL +FE    +    N+VSWT++I+G +Q+G   +A
Sbjct: 313 FEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSIIAGCAQNGKDIEA 372

Query: 331 LSLFDEMVKEDSRVRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALT 390
           L LF EM  + + V+PN VTI S+LPAC  ++ L  GR  H  A R+ L  N  V  AL 
Sbjct: 373 LELFREM--QVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLLDNVHVGSALI 432

Query: 391 AMYAKCGSLVDARNCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPD 450
            MYAKCG +  ++  F+ +    KNL+ WN+++  ++ +G   + +S F  +++  ++PD
Sbjct: 433 DMYAKCGRINLSQIVFNMMPT--KNLVCWNSLMNGFSMHGKAKEVMSIFESLMRTRLKPD 492

Query: 451 DITFTGLLSGCSHSGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLV 510
            I+FT LLS C   GL D G KYF  MS  Y I PR+EHY+C+ +LLGRAG+L EA  L+
Sbjct: 493 FISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLI 552

Query: 511 GEMPMPAGPSIWGSLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQ 570
            EMP      +WG+LL +CR   N+++AE AA KLF LEPEN G YVLLSN+YA  G W 
Sbjct: 553 KEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAKGMWT 612

Query: 571 EVDKLRAIVKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAG 630
           EVD +R  ++S G KK+PGCSWI++  + +  L GD SHPQ  +I   ++ + ++M+ +G
Sbjct: 613 EVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISKEMRKSG 672

Query: 631 YFPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMV 680
           + P+  + LHD+ E+E+E  L  HSEKLAV FG+LNTP  T L+V KNLRICGDCH  + 
Sbjct: 673 HRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHAVIK 732

BLAST of CSPI06G23760 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 478.0 bits (1229), Expect = 1.7e-133
Identity = 250/643 (38.88%), Postives = 372/643 (57.85%), Query Frame = 1

Query: 57  NLLQPLSAPGPPPILSYAP---------------VFQFL----TGLNMLKLGHQVHAHML 116
           N L    A GP P+LS                   F FL      ++ L LG  +H   +
Sbjct: 99  NTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAV 158

Query: 117 LRGLQPTALVGSKMVAFYASSGDIDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVA 176
              +     V + ++  Y S GD+DS+  VF  I E   + +NSMI  + + G  ++ + 
Sbjct: 159 KSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALE 218

Query: 177 TYFSMHSWGFTGDYFTFPFVLKSSVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYG 236
            +  M S      + T   VL +  ++ ++  G+ V   I    +  +L +A +++ +Y 
Sbjct: 219 LFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYT 278

Query: 237 KCGEINDAGKVFDNMTIRDVSSWNALLAGYMKSGCIDAALAIFERMPWRNIVSWTTMISG 296
           KCG I DA ++FD M  +D  +W  +L GY  S   +AA  +   MP ++IV+W  +IS 
Sbjct: 279 KCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISA 338

Query: 297 YSQSGLAQQALSLFDEMVKEDSRVRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLN 356
           Y Q+G   +AL +F E+  + + ++ N +T++S L ACAQ+  LE GR IH    + G+ 
Sbjct: 339 YEQNGKPNEALIVFHELQLQKN-MKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIR 398

Query: 357 SNASVLIALTAMYAKCGSLVDARNCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFR 416
            N  V  AL  MY+KCG L  +R  F+ + +  +++  W+ MI   A +G G +AV  F 
Sbjct: 399 MNFHVTSALIHMYSKCGDLEKSREVFNSVEK--RDVFVWSAMIGGLAMHGCGNEAVDMFY 458

Query: 417 EMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRA 476
           +M +A ++P+ +TFT +   CSH+GLVD     F+ M + Y I P  +HYAC+ D+LGR+
Sbjct: 459 KMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRS 518

Query: 477 GRLAEASKLVGEMPMPAGPSIWGSLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLS 536
           G L +A K +  MP+P   S+WG+LL AC+ H NL +AE A  +L  LEP N G +VLLS
Sbjct: 519 GYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLS 578

Query: 537 NMYAEAGRWQEVDKLRAIVKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLE 596
           N+YA+ G+W+ V +LR  ++  G KK PGCS IEI+G  H FL GD +HP  +++Y  L 
Sbjct: 579 NIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLH 638

Query: 597 ALPEKMKAAGYFPDTSYVLHDISEEE-KEFNLIAHSEKLAVAFGILNTPAETVLRVTKNL 656
            + EK+K+ GY P+ S VL  I EEE KE +L  HSEKLA+ +G+++T A  V+RV KNL
Sbjct: 639 EVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNL 698

Query: 657 RICGDCHTAMVFISEIYGREVIVRDINRFHHFKGGCCSCGDYW 680
           R+CGDCH+    IS++Y RE+IVRD  RFHHF+ G CSC D+W
Sbjct: 699 RVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of CSPI06G23760 vs. Swiss-Prot
Match: PP251_ARATH (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 476.5 bits (1225), Expect = 4.9e-133
Identity = 260/673 (38.63%), Postives = 371/673 (55.13%), Query Frame = 1

Query: 43  KCSISPLTISATLQNLLQPLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQ 102
           +C       S  L + ++  ++   P    +  V +  T +  L+ G  VH  ++  G+ 
Sbjct: 78  RCFTDQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMD 137

Query: 103 PTALVGSKMVAFYA-----------------------SSGD-------------IDSSVS 162
                G+ ++  YA                       +SGD             IDS   
Sbjct: 138 CDLYTGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRR 197

Query: 163 VFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVELLS 222
           VF  +     + +N++I  YA+ G  E  +     M +     D FT   VL    E + 
Sbjct: 198 VFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVD 257

Query: 223 VWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSWNALLAG 282
           V  GK +HG ++R G+  D+Y+ +SL+ +Y K   I D+ +VF  +  RD  SWN+L+AG
Sbjct: 258 VIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAG 317

Query: 283 YMKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSRVRPNWV 342
           Y+                               Q+G   +AL LF +MV   ++V+P  V
Sbjct: 318 YV-------------------------------QNGRYNEALRLFRQMV--TAKVKPGAV 377

Query: 343 TIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDARNCFDKL 402
              SV+PACA L+TL  G+Q+H    R G  SN  +  AL  MY+KCG++  AR  FD++
Sbjct: 378 AFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRM 437

Query: 403 NRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSHSGLVDV 462
           N  ++  ++W  +I  +A +GHG +AVS F EM + G++P+ + F  +L+ CSH GLVD 
Sbjct: 438 NVLDE--VSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDE 497

Query: 463 GLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWGSLLAAC 522
              YFN M+  Y +N  +EHYA VADLLGRAG+L EA   + +M +    S+W +LL++C
Sbjct: 498 AWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSC 557

Query: 523 RKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQGTKKSPG 582
             H+NLE+AE  A K+F ++ EN G YVL+ NMYA  GRW+E+ KLR  ++ +G +K P 
Sbjct: 558 SVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPA 617

Query: 583 CSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDISEEEKEF 642
           CSWIE+  K H F+ GD SHP   +I  FL+A+ E+M+  GY  DTS VLHD+ EE K  
Sbjct: 618 CSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRE 677

Query: 643 NLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVRDINRFH 680
            L  HSE+LAVAFGI+NT   T +RVTKN+RIC DCH A+ FIS+I  RE+IVRD +RFH
Sbjct: 678 LLFGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFH 715

BLAST of CSPI06G23760 vs. Swiss-Prot
Match: PP285_ARATH (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 475.7 bits (1223), Expect = 8.4e-133
Identity = 248/617 (40.19%), Postives = 368/617 (59.64%), Query Frame = 1

Query: 68  PPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRG-LQPTALVGSKMVAFYASSGDIDSSVS 127
           P   + + V    + L ML+ G ++HA+ L  G L   + VGS +V  Y +   + S   
Sbjct: 300 PDEFTISSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRR 359

Query: 128 VFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMH-SWGFTGDYFTFPFVLKSSVELL 187
           VF+G+ +    L+N+MI  Y++    +  +  +  M  S G   +  T   V+ + V   
Sbjct: 360 VFDGMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSG 419

Query: 188 SVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSWNALLA 247
           +    + +HG +++ GL  D +V  +L+ +Y + G+I+ A ++F  M  RD+ +WN ++ 
Sbjct: 420 AFSRKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMIT 479

Query: 248 GYMKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSRV--RP 307
           GY+ S   + AL +  +M                         +L  ++ K  SRV  +P
Sbjct: 480 GYVFSEHHEDALLLLHKMQ------------------------NLERKVSKGASRVSLKP 539

Query: 308 NWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDARNCF 367
           N +T+M++LP+CA LS L +G++IH  A +  L ++ +V  AL  MYAKCG L  +R  F
Sbjct: 540 NSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVF 599

Query: 368 DKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSHSGL 427
           D++   +KN+I WN +I AY  +G+G +A+   R M+  G++P+++TF  + + CSHSG+
Sbjct: 600 DQIP--QKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVFAACSHSGM 659

Query: 428 VDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPS-IWGSL 487
           VD GL+ F  M   Y + P  +HYACV DLLGRAGR+ EA +L+  MP     +  W SL
Sbjct: 660 VDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSL 719

Query: 488 LAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQGTK 547
           L A R H NLE+ E AA+ L  LEP    +YVLL+N+Y+ AG W +  ++R  +K QG +
Sbjct: 720 LGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRRNMKEQGVR 779

Query: 548 KSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDISEE 607
           K PGCSWIE   + H F+ GD+SHPQ +++  +LE L E+M+  GY PDTS VLH++ E+
Sbjct: 780 KEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGYVPDTSCVLHNVEED 839

Query: 608 EKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVRDI 667
           EKE  L  HSEKLA+AFGILNT   T++RV KNLR+C DCH A  FIS+I  RE+I+RD+
Sbjct: 840 EKEILLCGHSEKLAIAFGILNTSPGTIIRVAKNLRVCNDCHLATKFISKIVDREIILRDV 890

Query: 668 NRFHHFKGGCCSCGDYW 680
            RFH FK G CSCGDYW
Sbjct: 900 RRFHRFKNGTCSCGDYW 890

BLAST of CSPI06G23760 vs. TrEMBL
Match: A0A0A0KEZ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G430650 PE=4 SV=1)

HSP 1 Score: 1379.0 bits (3568), Expect = 0.0e+00
Identity = 677/679 (99.71%), Postives = 677/679 (99.71%), Query Frame = 1

Query: 1   MHNGIRLSISIPTPSHLLFRILHSYSGSAHIDTVPPPSSPPFKCSISPLTISATLQNLLQ 60
           MHNGIRLSISIPTPSHLLFRILHSYSGSAHIDTVPPPSSPPFKCSISPLTISATLQNLLQ
Sbjct: 1   MHNGIRLSISIPTPSHLLFRILHSYSGSAHIDTVPPPSSPPFKCSISPLTISATLQNLLQ 60

Query: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120
           PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSW 240
           SVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSW
Sbjct: 181 SVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSW 240

Query: 241 NALLAGYMKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSR 300
           NALLAGY KSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDS 
Sbjct: 241 NALLAGYTKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360
           VRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR
Sbjct: 301 VRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360

Query: 361 NCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSH 420
           NCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSH
Sbjct: 361 NCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSH 420

Query: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWG 480
           SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWG
Sbjct: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWG 480

Query: 481 SLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540
           SLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG
Sbjct: 481 SLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540

Query: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDIS 600
           TKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDIS
Sbjct: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDIS 600

Query: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660
           EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660

Query: 661 DINRFHHFKGGCCSCGDYW 680
           DINRFHHFKGGCCSCGDYW
Sbjct: 661 DINRFHHFKGGCCSCGDYW 679

BLAST of CSPI06G23760 vs. TrEMBL
Match: M5X3I7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002838mg PE=4 SV=1)

HSP 1 Score: 902.9 bits (2332), Expect = 2.4e-259
Identity = 431/625 (68.96%), Postives = 509/625 (81.44%), Query Frame = 1

Query: 56  QNLLQPLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFY 115
           + LL+ L A  P  I  YAP+FQ LT  N+LKLG QVHA M LRGL+P A +G+KMVA Y
Sbjct: 4   RTLLKSLLAQDPTCISFYAPIFQSLTSQNLLKLGQQVHAQMALRGLEPNAFLGAKMVAMY 63

Query: 116 ASSGDIDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFP 175
           ASS ++DS+V++F+ +  PS+LL+NS+IRAY  YG++E+T+  Y  MH  G  GD FT+P
Sbjct: 64  ASSDNLDSAVNIFHRVNNPSTLLYNSIIRAYTLYGYSEKTMEIYGQMHRLGLKGDNFTYP 123

Query: 176 FVLKSSVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIR 235
           FVLK    L S+W+GKCVH L LRIGL  D+YV TSLI +Y KCGE++DA   FD MT+R
Sbjct: 124 FVLKCCANLSSIWLGKCVHSLSLRIGLASDMYVGTSLIDMYVKCGEMSDARSSFDKMTVR 183

Query: 236 DVSSWNALLAGYMKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMV 295
           DVSSWNAL+AGYMK G I  A  +F RMP +NIVSWT MISGY+Q+GLA+QAL LFDEM+
Sbjct: 184 DVSSWNALIAGYMKDGEICFAEDLFRRMPCKNIVSWTAMISGYTQNGLAEQALVLFDEML 243

Query: 296 KEDSRVRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGS 355
           ++DS V+PNWVTIMSVLPACA  + LERGRQIH  A R GL+SN S+  AL AMYAKCGS
Sbjct: 244 RKDSEVKPNWVTIMSVLPACAHSAALERGRQIHNFASRTGLDSNTSIQTALLAMYAKCGS 303

Query: 356 LVDARNCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLL 415
           L DAR CF+++++ E +L+AWNTMITAYAS+G G +AVSTF +MI AG+QPD+ITFTGLL
Sbjct: 304 LSDARQCFERVHQTENSLVAWNTMITAYASHGRGSEAVSTFEDMIGAGLQPDNITFTGLL 363

Query: 416 SGCSHSGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAG 475
           SGCSHSGLVD GLKYFN M T YSI PRVEHYACV DLLGRAGRL EA  LV +MPM AG
Sbjct: 364 SGCSHSGLVDGGLKYFNCMKTIYSIEPRVEHYACVVDLLGRAGRLVEAIDLVSKMPMQAG 423

Query: 476 PSIWGSLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAI 535
           PSIWG+LL+ACRKH NLE+AE AARKLF+LEP+N+GNYVLLSN+YA+AG W+EVD LRA+
Sbjct: 424 PSIWGALLSACRKHHNLEIAEIAARKLFILEPDNSGNYVLLSNIYADAGMWKEVDDLRAL 483

Query: 536 VKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIY-MFLEALPEKMKAAGYFPDTSY 595
           +KSQG KK+PGCSWIE+NGKAH+FLGGDT HPQ KEIY + LE LP K+KAAGY PDTS+
Sbjct: 484 LKSQGMKKNPGCSWIEVNGKAHLFLGGDTCHPQAKEIYEVLLEELPNKIKAAGYVPDTSF 543

Query: 596 VLHDISEEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYG 655
           VLHD+SEEEKE NL  HSEKLA+AFG+LN     VLRVTKNLRICGDCHTA   IS IY 
Sbjct: 544 VLHDVSEEEKEHNLTTHSEKLAIAFGLLNASPGVVLRVTKNLRICGDCHTATKLISRIYE 603

Query: 656 REVIVRDINRFHHFKGGCCSCGDYW 680
           RE+IVRD+NRFHHF+ GCCSCGDYW
Sbjct: 604 REIIVRDLNRFHHFRDGCCSCGDYW 628

BLAST of CSPI06G23760 vs. TrEMBL
Match: A0A0D2SZE2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G028200 PE=4 SV=1)

HSP 1 Score: 871.7 bits (2251), Expect = 5.9e-250
Identity = 426/653 (65.24%), Postives = 505/653 (77.34%), Query Frame = 1

Query: 29  AHIDTVPPPSSPP-FKCSI-SPLTISATLQNLLQPLSAPGPPPILSYAPVFQFLTGLNML 88
           A + T+ P   P   KC+   P   ++TL  LLQP+S   PPP LSYAP+FQFLTG N L
Sbjct: 21  AFLSTIHPHIDPSQTKCTTPKPFPYTSTLPTLLQPISDQNPPPHLSYAPLFQFLTGQNFL 80

Query: 89  KLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNGIGEPSSLLFNSMIRAY 148
           KLG Q+HAHM L GLQP A +G+KMVA YASSGD++S+V+VF  I +P+SLL+NS+IRAY
Sbjct: 81  KLGQQIHAHMTLHGLQPNAFLGAKMVAMYASSGDLESAVTVFRKIKDPTSLLYNSIIRAY 140

Query: 149 ARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVELLSVWMGKCVHGLILRIGLQFDL 208
              G+  +T+  Y  MHS    GD FTFPFVLKS   +L VWMG+CVHG  LR GL+ D 
Sbjct: 141 TNNGYPLKTIDIYREMHSLRLKGDNFTFPFVLKSCANVLDVWMGECVHGQSLRFGLELDA 200

Query: 209 YVATSLIILYGKCGEINDAGKVFDNMTIRDVSSWNALLAGYMKSGCIDAALAIFERMPWR 268
           YV TSLI  Y K GE+ DA KVFD MT+R VSSWNAL+AGYMK G I  A  +F  MP R
Sbjct: 201 YVGTSLIDFYVKVGELRDANKVFDLMTVRAVSSWNALIAGYMKEGEIRVAEDLFRGMPCR 260

Query: 269 NIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSRVRPNWVTIMSVLPACAQLSTLERGRQ 328
           NIVSWT+MISGY+Q+GLA++ALSLFDEM+KEDS V+PNWVTIMSVLPACA  ++ ERGR+
Sbjct: 261 NIVSWTSMISGYTQNGLAEEALSLFDEMLKEDSEVKPNWVTIMSVLPACAHSASFERGRR 320

Query: 329 IHELACRMGLNSNASVLIALTAMYAKCGSLVDARNCFDKLNRNEKNLIAWNTMITAYASY 388
           I+E   R+GL SN SV  AL AMYAKCGSLV AR CFD++  NEKNL AWNTMITAYAS+
Sbjct: 321 INEYVNRIGLESNPSVQTALIAMYAKCGSLVSARCCFDRILENEKNLCAWNTMITAYASH 380

Query: 389 GHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFNHMSTTYSINPRVEH 448
           G GL++VSTF  M++AG+ PD ITFTGLLSGCSHSG+V+ GL+YFN M T YS+ PR EH
Sbjct: 381 GQGLESVSTFENMVRAGVYPDAITFTGLLSGCSHSGIVEFGLRYFNSMQTKYSVEPRHEH 440

Query: 449 YACVADLLGRAGRLAEASKLVGEMPMPAGPSIWGSLLAACRKHRNLEMAETAARKLFVLE 508
           YACV DLL RAGRL EA + + ++PM  GPSIWG+LLAACRK RNLE+AE AA++LFVLE
Sbjct: 441 YACVVDLLARAGRLVEAKEFIKKIPMQPGPSIWGALLAACRKSRNLEIAEIAAKELFVLE 500

Query: 509 PENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQGTKKSPGCSWIEINGKAHMFLGGDTSH 568
           PEN+ NY+LLSNMYAEAG W+EVDKLRA +K +G KK+PGCSWIEI GKAH+FL GD SH
Sbjct: 501 PENSCNYILLSNMYAEAGMWKEVDKLRARLKCEGIKKNPGCSWIEIKGKAHLFLSGDLSH 560

Query: 569 PQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTPA 628
           PQ KEIY  LEALPEK+KAAGY P+T +VLHDISEEEKE NLI H               
Sbjct: 561 PQSKEIYNLLEALPEKIKAAGYIPNTGFVLHDISEEEKEQNLIIH--------------- 620

Query: 629 ETVLRVTKNLRICGDCHTAMVFISEIYGREVIVRDINRFHHFKGGCCSCGDYW 680
             ++R+TKNLRICGDCHT + FIS+IY RE++VRD+NRFHHF+ G CSCGDYW
Sbjct: 621 --IIRITKNLRICGDCHTVIKFISKIYEREIVVRDVNRFHHFRHGACSCGDYW 656

BLAST of CSPI06G23760 vs. TrEMBL
Match: K4B1Y4_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 866.7 bits (2238), Expect = 1.9e-248
Identity = 414/625 (66.24%), Postives = 493/625 (78.88%), Query Frame = 1

Query: 55  LQNLLQPLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAF 114
           L+ +LQPL     PP  +YA +FQFL G N +KLG QVHAHM +RG+ P  LV +KMVA 
Sbjct: 2   LKIILQPLYQNSFPPS-TYASIFQFLVGKNFVKLGQQVHAHMAVRGVSPNGLVAAKMVAM 61

Query: 115 YASSGDIDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTF 174
           YASSG+IDS+  +F+   EPSSLL+N+MIRA   YG  +RT+  +F MHS GF GD FTF
Sbjct: 62  YASSGEIDSASYIFDSATEPSSLLYNAMIRALTLYGITKRTIEIFFQMHSLGFRGDNFTF 121

Query: 175 PFVLKSSVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTI 234
           PFV KS  +L  VW GKCVH LILR G  FD+YV TSL+ +Y KCG++ DA K+FD M +
Sbjct: 122 PFVFKSCADLSDVWCGKCVHSLILRSGFVFDMYVGTSLVDMYVKCGDLIDARKLFDEMPV 181

Query: 235 RDVSSWNALLAGYMKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM 294
           RDVS+WN L+AGYMK G    A  +FE MP RNIVSWT MISGY+Q+GLA ++L LFD+M
Sbjct: 182 RDVSAWNVLIAGYMKDGLFKDAEELFEEMPIRNIVSWTAMISGYAQNGLADESLQLFDKM 241

Query: 295 VKEDSRVRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCG 354
           +  DS VRPNWVT+MSVLPACA  + L+RG++IH  A   GL  N SV  AL AMYAKCG
Sbjct: 242 LDPDSEVRPNWVTVMSVLPACAHSAALDRGKKIHSFAREAGLEKNPSVQTALIAMYAKCG 301

Query: 355 SLVDARNCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGL 414
           SLVDAR CFD++N  EK L+AWNTMITAYAS+G G +AVSTF +M++AGIQPD ITFTGL
Sbjct: 302 SLVDARLCFDQINPREKKLVAWNTMITAYASHGFGREAVSTFEDMLRAGIQPDKITFTGL 361

Query: 415 LSGCSHSGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPA 474
           LSGCSHSGLVDVGL+YF+ MS  Y +    +HYACV DLLGRAGRL EA  L+ +MPM A
Sbjct: 362 LSGCSHSGLVDVGLRYFDCMSLVYFVEKGHDHYACVVDLLGRAGRLVEAYNLISQMPMAA 421

Query: 475 GPSIWGSLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRA 534
           GPSIWGSLLAA R HRNLE+AE AA+KLF+LEP+N+GNY++LSNMYAEAG W+EV  LR 
Sbjct: 422 GPSIWGSLLAAGRSHRNLEIAELAAKKLFILEPDNSGNYIVLSNMYAEAGMWEEVTHLRI 481

Query: 535 IVKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSY 594
             KS+   KSPGCSWIE +GKAH+FLGGDTSHPQ ++IY+FLEALP K+KAAGY PDT++
Sbjct: 482 QQKSRRIMKSPGCSWIEFDGKAHLFLGGDTSHPQAEQIYLFLEALPAKIKAAGYMPDTTF 541

Query: 595 VLHDISEEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYG 654
            LHD+SEEEKE NL +HSE+LA+AFGILNT   TVLRVTKNLRICGDCHTA+  +S+IY 
Sbjct: 542 ALHDVSEEEKEQNLSSHSERLAIAFGILNTSPGTVLRVTKNLRICGDCHTAIKLVSKIYE 601

Query: 655 REVIVRDINRFHHFKGGCCSCGDYW 680
           RE+IVRD+NRFHHFK G CSC DYW
Sbjct: 602 REIIVRDVNRFHHFKDGSCSCRDYW 625

BLAST of CSPI06G23760 vs. TrEMBL
Match: W9QT12_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_002732 PE=4 SV=1)

HSP 1 Score: 850.1 bits (2195), Expect = 1.8e-243
Identity = 413/651 (63.44%), Postives = 515/651 (79.11%), Query Frame = 1

Query: 30  HID-TVPPPSSPPFKCSISPLTISATLQNLLQPLSAPGPPPILSYAPVFQFLTGLNMLKL 89
           H D ++P    PP+      L++ +TL++L Q      PP + SYA +FQ LTG N+L+L
Sbjct: 33  HFDVSLPKHQIPPW------LSLVSTLRSLAQD-----PPQVSSYAAIFQSLTGKNLLRL 92

Query: 90  GHQVHAHMLLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNGIGEPSSLLFNSMIRAYAR 149
           G QVH+HM LR L+P A +G+KM+A YAS+GD+ S+V+VF  I  PS+LL NS+IRAY+ 
Sbjct: 93  GRQVHSHMSLRALEPDAFLGAKMIAMYASAGDLRSAVAVFRRIKYPSALLCNSIIRAYSW 152

Query: 150 YGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVELLSVWMGKCVHGLILRIGLQFDLYV 209
           + F ++T+  YF M S G   D+FT+PFVLKS  +L  V MG+  HGL LR G + D YV
Sbjct: 153 HWFPKKTIGVYFRMRSLGLKADHFTYPFVLKSCADLSDVRMGRYAHGLSLRTGFEEDFYV 212

Query: 210 ATSLIILYGKCGEINDAGKVFDNMTIRDVSSWNALLAGYMKSGCIDAALAIFERMPWRNI 269
            TSLI +Y KCG I DA K+FD MT+RD+SSWNAL+AGYMK G I  A  +F RM  RNI
Sbjct: 213 GTSLINMYVKCGGIGDARKMFDVMTVRDISSWNALIAGYMKIGEIRLAEDLFGRMVRRNI 272

Query: 270 VSWTTMISGYSQSGLAQQALSLFDEMVKEDSRVRPNWVTIMSVLPACAQLSTLERGRQIH 329
           VSWT MISGY+Q+GLA QAL LFD+M+++DS ++P WVTIMSVLPACA  + LERGR+IH
Sbjct: 273 VSWTAMISGYAQNGLAGQALVLFDKMLEDDSGIKPTWVTIMSVLPACAHSAALERGREIH 332

Query: 330 ELACRMGLNSNASVLIALTAMYAKCGSLVDARNCFDKLNRNEKNLIAWNTMITAYASYGH 389
           +LA R+GL+S+ SV  AL AMYA+CGSL +A  CFD++++++K+L+ WNTMI+AYAS+G 
Sbjct: 333 KLASRIGLDSDVSVQSALIAMYARCGSLAEACQCFDRIHQHKKDLVVWNTMISAYASHGR 392

Query: 390 GLQAVSTFREMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFNHMSTTYSINPRVEHYA 449
           GL++VSTF +MI+A IQPD I+FTGLLSGCSHSGLVD+G+KYFN M T Y++ P V+H A
Sbjct: 393 GLESVSTFEDMIRARIQPDIISFTGLLSGCSHSGLVDLGIKYFNRMKTMYNVEPEVQHCA 452

Query: 450 CVADLLGRAGRLAEASKLVGEMPMPAGPSIWGSLLAACRKHRNLEMAETAARKLFVLEPE 509
           CV DLLGRAGRL EA +L+ +MPM AG S WG+LLAACRKHRNLE+AE AA+KLFVLEP 
Sbjct: 453 CVVDLLGRAGRLVEAKELIDKMPMQAGASAWGALLAACRKHRNLELAEVAAKKLFVLEPY 512

Query: 510 NTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQ 569
           ++ NYV LSNMYAEAG W+EV  LR ++K +G +K+PGCSWIE+NGKAHMFLGGDTSHPQ
Sbjct: 513 SSANYVHLSNMYAEAGMWKEVANLRDLLKYRGIRKTPGCSWIEVNGKAHMFLGGDTSHPQ 572

Query: 570 GKEIYMFLEALPEKMKAAGYFPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTPAET 629
            +EIYMFLE+LPEKMK AGY PDTS VLHD+SEEEKE NL +HSEKLA+AFG+LNT   T
Sbjct: 573 TREIYMFLESLPEKMKQAGYVPDTSPVLHDLSEEEKEHNLTSHSEKLAIAFGLLNTSPST 632

Query: 630 VLRVTKNLRICGDCHTAMVFISEIYGREVIVRDINRFHHFKGGCCSCGDYW 680
           ++RVTKNLRIC DCHTA  FIS+I+ RE+IVRD+NRFHHF  G CSCGDYW
Sbjct: 633 IIRVTKNLRICVDCHTATKFISKIFRREIIVRDLNRFHHFTDGSCSCGDYW 672

BLAST of CSPI06G23760 vs. TAIR10
Match: AT3G49142.1 (AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 486.1 bits (1250), Expect = 3.5e-137
Identity = 260/630 (41.27%), Postives = 368/630 (58.41%), Query Frame = 1

Query: 92  VHAHMLLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNGIGEPSSLLFNSMIRAYARYGF 151
           VH+ ++L  L+  + +G K++  YAS  D+ S+  VF+ I E + ++ N MIR+Y   GF
Sbjct: 61  VHSRIILEDLRCNSSLGVKLMRAYASLKDVASARKVFDEIPERNVIIINVMIRSYVNNGF 120

Query: 152 AERTVATYFSMHSWGFTGDYFTFPFVLKSSVELLSVWMGKCVHGLILRIGLQFDLYVATS 211
               V  + +M       D++TFP VLK+     ++ +G+ +HG   ++GL   L+V   
Sbjct: 121 YGEGVKVFGTMCGCNVRPDHYTFPCVLKACSCSGTIVIGRKIHGSATKVGLSSTLFVGNG 180

Query: 212 LIILYGKCGEINDAGKVFDNMTIRDVSSWNALL--------------------------- 271
           L+ +YGKCG +++A  V D M+ RDV SWN+L+                           
Sbjct: 181 LVSMYGKCGFLSEARLVLDEMSRRDVVSWNSLVVGYAQNQRFDDALEVCREMESVKISHD 240

Query: 272 AGYMKSGC----------IDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM 331
           AG M S            +     +F +M  +++VSW  MI  Y ++ +  +A+ L+  M
Sbjct: 241 AGTMASLLPAVSNTTTENVMYVKDMFFKMGKKSLVSWNVMIGVYMKNAMPVEAVELYSRM 300

Query: 332 VKEDSRVRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCG 391
             E     P+ V+I SVLPAC   S L  G++IH    R  L  N  +  AL  MYAKCG
Sbjct: 301 --EADGFEPDAVSITSVLPACGDTSALSLGKKIHGYIERKKLIPNLLLENALIDMYAKCG 360

Query: 392 SLVDARNCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGL 451
            L  AR+ F+  N   +++++W  MI+AY   G G  AV+ F ++  +G+ PD I F   
Sbjct: 361 CLEKARDVFE--NMKSRDVVSWTAMISAYGFSGRGCDAVALFSKLQDSGLVPDSIAFVTT 420

Query: 452 LSGCSHSGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPA 511
           L+ CSH+GL++ G   F  M+  Y I PR+EH AC+ DLLGRAG++ EA + + +M M  
Sbjct: 421 LAACSHAGLLEEGRSCFKLMTDHYKITPRLEHLACMVDLLGRAGKVKEAYRFIQDMSMEP 480

Query: 512 GPSIWGSLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRA 571
              +WG+LL ACR H + ++   AA KLF L PE +G YVLLSN+YA+AGRW+EV  +R 
Sbjct: 481 NERVWGALLGACRVHSDTDIGLLAADKLFQLAPEQSGYYVLLSNIYAKAGRWEEVTNIRN 540

Query: 572 IVKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSY 631
           I+KS+G KK+PG S +E+N   H FL GD SHPQ  EIY  L+ L +KMK  GY PD+  
Sbjct: 541 IMKSKGLKKNPGASNVEVNRIIHTFLVGDRSHPQSDEIYRELDVLVKKMKELGYVPDSES 600

Query: 632 VLHDISEEEKEFNLIAHSEKLAVAFGILNTPAE-----TVLRVTKNLRICGDCHTAMVFI 680
            LHD+ EE+KE +L  HSEKLA+ F ++NT  E       +R+TKNLRICGDCH A   I
Sbjct: 601 ALHDVEEEDKETHLAVHSEKLAIVFALMNTKEEEEDSNNTIRITKNLRICGDCHVAAKLI 660

BLAST of CSPI06G23760 vs. TAIR10
Match: AT1G20230.1 (AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 479.2 bits (1232), Expect = 4.3e-135
Identity = 262/692 (37.86%), Postives = 388/692 (56.07%), Query Frame = 1

Query: 31  IDTVPPPSSPPFKCSISPLTISATLQNLLQPLS---APGPPPILSYAP-VFQFLTGLNML 90
           + ++P P+   F   I  LT +      +   S   + G  P     P +F+    L+  
Sbjct: 73  LQSIPDPTIYSFSSLIYALTKAKLFTQSIGVFSRMFSHGLIPDSHVLPNLFKVCAELSAF 132

Query: 91  KLGHQVHAHMLLRGLQPTALVGSKM-------------------------------VAFY 150
           K+G Q+H    + GL   A V   M                               +  Y
Sbjct: 133 KVGKQIHCVSCVSGLDMDAFVQGSMFHMYMRCGRMGDARKVFDRMSDKDVVTCSALLCAY 192

Query: 151 ASSGDIDSSVSVFNGIG----EPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDY 210
           A  G ++  V + + +     E + + +N ++  + R G+ +  V  +  +H  GF  D 
Sbjct: 193 ARKGCLEEVVRILSEMESSGIEANIVSWNGILSGFNRSGYHKEAVVMFQKIHHLGFCPDQ 252

Query: 211 FTFPFVLKSSVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDN 270
            T   VL S  +   + MG+ +HG +++ GL  D  V +++I +YGK G +     +F+ 
Sbjct: 253 VTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKSGHVYGIISLFNQ 312

Query: 271 MTIRDVSSWNALLAGYMKSGCIDAALAIFERMPWR----NIVSWTTMISGYSQSGLAQQA 330
             + +    NA + G  ++G +D AL +FE    +    N+VSWT++I+G +Q+G   +A
Sbjct: 313 FEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSIIAGCAQNGKDIEA 372

Query: 331 LSLFDEMVKEDSRVRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALT 390
           L LF EM  + + V+PN VTI S+LPAC  ++ L  GR  H  A R+ L  N  V  AL 
Sbjct: 373 LELFREM--QVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLLDNVHVGSALI 432

Query: 391 AMYAKCGSLVDARNCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPD 450
            MYAKCG +  ++  F+ +    KNL+ WN+++  ++ +G   + +S F  +++  ++PD
Sbjct: 433 DMYAKCGRINLSQIVFNMMPT--KNLVCWNSLMNGFSMHGKAKEVMSIFESLMRTRLKPD 492

Query: 451 DITFTGLLSGCSHSGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLV 510
            I+FT LLS C   GL D G KYF  MS  Y I PR+EHY+C+ +LLGRAG+L EA  L+
Sbjct: 493 FISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLI 552

Query: 511 GEMPMPAGPSIWGSLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQ 570
            EMP      +WG+LL +CR   N+++AE AA KLF LEPEN G YVLLSN+YA  G W 
Sbjct: 553 KEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIYAAKGMWT 612

Query: 571 EVDKLRAIVKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAG 630
           EVD +R  ++S G KK+PGCSWI++  + +  L GD SHPQ  +I   ++ + ++M+ +G
Sbjct: 613 EVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEISKEMRKSG 672

Query: 631 YFPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMV 680
           + P+  + LHD+ E+E+E  L  HSEKLAV FG+LNTP  T L+V KNLRICGDCH  + 
Sbjct: 673 HRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICGDCHAVIK 732

BLAST of CSPI06G23760 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 478.0 bits (1229), Expect = 9.6e-135
Identity = 250/643 (38.88%), Postives = 372/643 (57.85%), Query Frame = 1

Query: 57  NLLQPLSAPGPPPILSYAP---------------VFQFL----TGLNMLKLGHQVHAHML 116
           N L    A GP P+LS                   F FL      ++ L LG  +H   +
Sbjct: 99  NTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAV 158

Query: 117 LRGLQPTALVGSKMVAFYASSGDIDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVA 176
              +     V + ++  Y S GD+DS+  VF  I E   + +NSMI  + + G  ++ + 
Sbjct: 159 KSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALE 218

Query: 177 TYFSMHSWGFTGDYFTFPFVLKSSVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYG 236
            +  M S      + T   VL +  ++ ++  G+ V   I    +  +L +A +++ +Y 
Sbjct: 219 LFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYT 278

Query: 237 KCGEINDAGKVFDNMTIRDVSSWNALLAGYMKSGCIDAALAIFERMPWRNIVSWTTMISG 296
           KCG I DA ++FD M  +D  +W  +L GY  S   +AA  +   MP ++IV+W  +IS 
Sbjct: 279 KCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISA 338

Query: 297 YSQSGLAQQALSLFDEMVKEDSRVRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLN 356
           Y Q+G   +AL +F E+  + + ++ N +T++S L ACAQ+  LE GR IH    + G+ 
Sbjct: 339 YEQNGKPNEALIVFHELQLQKN-MKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIR 398

Query: 357 SNASVLIALTAMYAKCGSLVDARNCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFR 416
            N  V  AL  MY+KCG L  +R  F+ + +  +++  W+ MI   A +G G +AV  F 
Sbjct: 399 MNFHVTSALIHMYSKCGDLEKSREVFNSVEK--RDVFVWSAMIGGLAMHGCGNEAVDMFY 458

Query: 417 EMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRA 476
           +M +A ++P+ +TFT +   CSH+GLVD     F+ M + Y I P  +HYAC+ D+LGR+
Sbjct: 459 KMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRS 518

Query: 477 GRLAEASKLVGEMPMPAGPSIWGSLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLS 536
           G L +A K +  MP+P   S+WG+LL AC+ H NL +AE A  +L  LEP N G +VLLS
Sbjct: 519 GYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLS 578

Query: 537 NMYAEAGRWQEVDKLRAIVKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLE 596
           N+YA+ G+W+ V +LR  ++  G KK PGCS IEI+G  H FL GD +HP  +++Y  L 
Sbjct: 579 NIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLH 638

Query: 597 ALPEKMKAAGYFPDTSYVLHDISEEE-KEFNLIAHSEKLAVAFGILNTPAETVLRVTKNL 656
            + EK+K+ GY P+ S VL  I EEE KE +L  HSEKLA+ +G+++T A  V+RV KNL
Sbjct: 639 EVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNL 698

Query: 657 RICGDCHTAMVFISEIYGREVIVRDINRFHHFKGGCCSCGDYW 680
           R+CGDCH+    IS++Y RE+IVRD  RFHHF+ G CSC D+W
Sbjct: 699 RVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of CSPI06G23760 vs. TAIR10
Match: AT3G23330.1 (AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 476.5 bits (1225), Expect = 2.8e-134
Identity = 260/673 (38.63%), Postives = 371/673 (55.13%), Query Frame = 1

Query: 43  KCSISPLTISATLQNLLQPLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQ 102
           +C       S  L + ++  ++   P    +  V +  T +  L+ G  VH  ++  G+ 
Sbjct: 78  RCFTDQSLFSKALASFVEMRASGRCPDHNVFPSVLKSCTMMMDLRFGESVHGFIVRLGMD 137

Query: 103 PTALVGSKMVAFYA-----------------------SSGD-------------IDSSVS 162
                G+ ++  YA                       +SGD             IDS   
Sbjct: 138 CDLYTGNALMNMYAKLLGMGSKISVGNVFDEMPQRTSNSGDEDVKAETCIMPFGIDSVRR 197

Query: 163 VFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVELLS 222
           VF  +     + +N++I  YA+ G  E  +     M +     D FT   VL    E + 
Sbjct: 198 VFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYVD 257

Query: 223 VWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSWNALLAG 282
           V  GK +HG ++R G+  D+Y+ +SL+ +Y K   I D+ +VF  +  RD  SWN+L+AG
Sbjct: 258 VIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAG 317

Query: 283 YMKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSRVRPNWV 342
           Y+                               Q+G   +AL LF +MV   ++V+P  V
Sbjct: 318 YV-------------------------------QNGRYNEALRLFRQMV--TAKVKPGAV 377

Query: 343 TIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDARNCFDKL 402
              SV+PACA L+TL  G+Q+H    R G  SN  +  AL  MY+KCG++  AR  FD++
Sbjct: 378 AFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAARKIFDRM 437

Query: 403 NRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSHSGLVDV 462
           N  ++  ++W  +I  +A +GHG +AVS F EM + G++P+ + F  +L+ CSH GLVD 
Sbjct: 438 NVLDE--VSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSHVGLVDE 497

Query: 463 GLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWGSLLAAC 522
              YFN M+  Y +N  +EHYA VADLLGRAG+L EA   + +M +    S+W +LL++C
Sbjct: 498 AWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWSTLLSSC 557

Query: 523 RKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQGTKKSPG 582
             H+NLE+AE  A K+F ++ EN G YVL+ NMYA  GRW+E+ KLR  ++ +G +K P 
Sbjct: 558 SVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKGLRKKPA 617

Query: 583 CSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDISEEEKEF 642
           CSWIE+  K H F+ GD SHP   +I  FL+A+ E+M+  GY  DTS VLHD+ EE K  
Sbjct: 618 CSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVDEEHKRE 677

Query: 643 NLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVRDINRFH 680
            L  HSE+LAVAFGI+NT   T +RVTKN+RIC DCH A+ FIS+I  RE+IVRD +RFH
Sbjct: 678 LLFGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVRDNSRFH 715

BLAST of CSPI06G23760 vs. TAIR10
Match: AT3G57430.1 (AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 475.7 bits (1223), Expect = 4.7e-134
Identity = 248/617 (40.19%), Postives = 368/617 (59.64%), Query Frame = 1

Query: 68  PPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRG-LQPTALVGSKMVAFYASSGDIDSSVS 127
           P   + + V    + L ML+ G ++HA+ L  G L   + VGS +V  Y +   + S   
Sbjct: 300 PDEFTISSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCNCKQVLSGRR 359

Query: 128 VFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMH-SWGFTGDYFTFPFVLKSSVELL 187
           VF+G+ +    L+N+MI  Y++    +  +  +  M  S G   +  T   V+ + V   
Sbjct: 360 VFDGMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSG 419

Query: 188 SVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSWNALLA 247
           +    + +HG +++ GL  D +V  +L+ +Y + G+I+ A ++F  M  RD+ +WN ++ 
Sbjct: 420 AFSRKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMIT 479

Query: 248 GYMKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSRV--RP 307
           GY+ S   + AL +  +M                         +L  ++ K  SRV  +P
Sbjct: 480 GYVFSEHHEDALLLLHKMQ------------------------NLERKVSKGASRVSLKP 539

Query: 308 NWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDARNCF 367
           N +T+M++LP+CA LS L +G++IH  A +  L ++ +V  AL  MYAKCG L  +R  F
Sbjct: 540 NSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVF 599

Query: 368 DKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSHSGL 427
           D++   +KN+I WN +I AY  +G+G +A+   R M+  G++P+++TF  + + CSHSG+
Sbjct: 600 DQIP--QKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVFAACSHSGM 659

Query: 428 VDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPS-IWGSL 487
           VD GL+ F  M   Y + P  +HYACV DLLGRAGR+ EA +L+  MP     +  W SL
Sbjct: 660 VDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSL 719

Query: 488 LAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQGTK 547
           L A R H NLE+ E AA+ L  LEP    +YVLL+N+Y+ AG W +  ++R  +K QG +
Sbjct: 720 LGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRRNMKEQGVR 779

Query: 548 KSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDISEE 607
           K PGCSWIE   + H F+ GD+SHPQ +++  +LE L E+M+  GY PDTS VLH++ E+
Sbjct: 780 KEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGYVPDTSCVLHNVEED 839

Query: 608 EKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVRDI 667
           EKE  L  HSEKLA+AFGILNT   T++RV KNLR+C DCH A  FIS+I  RE+I+RD+
Sbjct: 840 EKEILLCGHSEKLAIAFGILNTSPGTIIRVAKNLRVCNDCHLATKFISKIVDREIILRDV 890

Query: 668 NRFHHFKGGCCSCGDYW 680
            RFH FK G CSCGDYW
Sbjct: 900 RRFHRFKNGTCSCGDYW 890

BLAST of CSPI06G23760 vs. NCBI nr
Match: gi|449445033|ref|XP_004140278.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis sativus])

HSP 1 Score: 1379.0 bits (3568), Expect = 0.0e+00
Identity = 677/679 (99.71%), Postives = 677/679 (99.71%), Query Frame = 1

Query: 1   MHNGIRLSISIPTPSHLLFRILHSYSGSAHIDTVPPPSSPPFKCSISPLTISATLQNLLQ 60
           MHNGIRLSISIPTPSHLLFRILHSYSGSAHIDTVPPPSSPPFKCSISPLTISATLQNLLQ
Sbjct: 1   MHNGIRLSISIPTPSHLLFRILHSYSGSAHIDTVPPPSSPPFKCSISPLTISATLQNLLQ 60

Query: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120
           PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSW 240
           SVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSW
Sbjct: 181 SVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSW 240

Query: 241 NALLAGYMKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSR 300
           NALLAGY KSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDS 
Sbjct: 241 NALLAGYTKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360
           VRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR
Sbjct: 301 VRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360

Query: 361 NCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSH 420
           NCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSH
Sbjct: 361 NCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSH 420

Query: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWG 480
           SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWG
Sbjct: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWG 480

Query: 481 SLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540
           SLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG
Sbjct: 481 SLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540

Query: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDIS 600
           TKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDIS
Sbjct: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDIS 600

Query: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660
           EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660

Query: 661 DINRFHHFKGGCCSCGDYW 680
           DINRFHHFKGGCCSCGDYW
Sbjct: 661 DINRFHHFKGGCCSCGDYW 679

BLAST of CSPI06G23760 vs. NCBI nr
Match: gi|659112126|ref|XP_008456075.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis melo])

HSP 1 Score: 1332.0 bits (3446), Expect = 0.0e+00
Identity = 655/679 (96.47%), Postives = 665/679 (97.94%), Query Frame = 1

Query: 1   MHNGIRLSISIPTPSHLLFRILHSYSGSAHIDTVPPPSSPPFKCSISPLTISATLQNLLQ 60
           MHNGIRLSISIPTP+ LLFRILHSYSGSAHI+TVPPPSSP FKCSISPLTISATLQNLLQ
Sbjct: 1   MHNGIRLSISIPTPTLLLFRILHSYSGSAHIETVPPPSSPLFKCSISPLTISATLQNLLQ 60

Query: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120
           PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSW 240
           S +LLSVWMGKCVHGLILRIGL  DLYVATSLI LYGKCGEIN+AGKVFDNMTIRDVSSW
Sbjct: 181 SADLLSVWMGKCVHGLILRIGLHCDLYVATSLIDLYGKCGEINEAGKVFDNMTIRDVSSW 240

Query: 241 NALLAGYMKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSR 300
           NALLAGYMKSGC+DAA+AIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM+KEDS 
Sbjct: 241 NALLAGYMKSGCVDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMMKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360
           VRPNWVTIMSVLPACAQLSTLERG QIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR
Sbjct: 301 VRPNWVTIMSVLPACAQLSTLERGTQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360

Query: 361 NCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSH 420
           NCFDKLNR+EKNLIAWNTMITAYASYGHGL+AVSTFREMIQAGIQPDDITFTGLLSGCSH
Sbjct: 361 NCFDKLNRSEKNLIAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420

Query: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWG 480
           SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLV EMPMPAG SIWG
Sbjct: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVDEMPMPAGASIWG 480

Query: 481 SLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540
           SLLAACRKHRNLEMAE AARKLFVLEPEN+GNYVLLSNMYAEAGRWQEVDKLRAIVKSQG
Sbjct: 481 SLLAACRKHRNLEMAEIAARKLFVLEPENSGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540

Query: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDIS 600
           TKKSPGCSWIEINGKAHMFLGGDTSHPQ KEIYMFLEALPEKMKAAGY PDTSYVLHDIS
Sbjct: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600

Query: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660
           EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660

Query: 661 DINRFHHFKGGCCSCGDYW 680
           DINRFHHFKGG CSCGDYW
Sbjct: 661 DINRFHHFKGGSCSCGDYW 679

BLAST of CSPI06G23760 vs. NCBI nr
Match: gi|823203737|ref|XP_012436245.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Gossypium raimondii])

HSP 1 Score: 906.7 bits (2342), Expect = 2.4e-260
Identity = 439/653 (67.23%), Postives = 519/653 (79.48%), Query Frame = 1

Query: 29  AHIDTVPPPSSPP-FKCSI-SPLTISATLQNLLQPLSAPGPPPILSYAPVFQFLTGLNML 88
           A + T+ P   P   KC+   P   ++TL  LLQP+S   PPP LSYAP+FQFLTG N L
Sbjct: 21  AFLSTIHPHIDPSQTKCTTPKPFPYTSTLPTLLQPISDQNPPPHLSYAPLFQFLTGQNFL 80

Query: 89  KLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNGIGEPSSLLFNSMIRAY 148
           KLG Q+HAHM L GLQP A +G+KMVA YASSGD++S+V+VF  I +P+SLL+NS+IRAY
Sbjct: 81  KLGQQIHAHMTLHGLQPNAFLGAKMVAMYASSGDLESAVTVFRKIKDPTSLLYNSIIRAY 140

Query: 149 ARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVELLSVWMGKCVHGLILRIGLQFDL 208
              G+  +T+  Y  MHS    GD FTFPFVLKS   +L VWMG+CVHG  LR GL+ D 
Sbjct: 141 TNNGYPLKTIDIYREMHSLRLKGDNFTFPFVLKSCANVLDVWMGECVHGQSLRFGLELDA 200

Query: 209 YVATSLIILYGKCGEINDAGKVFDNMTIRDVSSWNALLAGYMKSGCIDAALAIFERMPWR 268
           YV TSLI  Y K GE+ DA KVFD MT+R VSSWNAL+AGYMK G I  A  +F  MP R
Sbjct: 201 YVGTSLIDFYVKVGELRDANKVFDLMTVRAVSSWNALIAGYMKEGEIRVAEDLFRGMPCR 260

Query: 269 NIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSRVRPNWVTIMSVLPACAQLSTLERGRQ 328
           NIVSWT+MISGY+Q+GLA++ALSLFDEM+KEDS V+PNWVTIMSVLPACA  ++ ERGR+
Sbjct: 261 NIVSWTSMISGYTQNGLAEEALSLFDEMLKEDSEVKPNWVTIMSVLPACAHSASFERGRR 320

Query: 329 IHELACRMGLNSNASVLIALTAMYAKCGSLVDARNCFDKLNRNEKNLIAWNTMITAYASY 388
           I+E   R+GL SN SV  AL AMYAKCGSLV AR CFD++  NEKNL AWNTMITAYAS+
Sbjct: 321 INEYVNRIGLESNPSVQTALIAMYAKCGSLVSARCCFDRILENEKNLCAWNTMITAYASH 380

Query: 389 GHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSHSGLVDVGLKYFNHMSTTYSINPRVEH 448
           G GL++VSTF  M++AG+ PD ITFTGLLSGCSHSG+V+ GL+YFN M T YS+ PR EH
Sbjct: 381 GQGLESVSTFENMVRAGVYPDAITFTGLLSGCSHSGIVEFGLRYFNSMQTKYSVEPRHEH 440

Query: 449 YACVADLLGRAGRLAEASKLVGEMPMPAGPSIWGSLLAACRKHRNLEMAETAARKLFVLE 508
           YACV DLL RAGRL EA + + ++PM  GPSIWG+LLAACRK RNLE+AE AA++LFVLE
Sbjct: 441 YACVVDLLARAGRLVEAKEFIKKIPMQPGPSIWGALLAACRKSRNLEIAEIAAKELFVLE 500

Query: 509 PENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQGTKKSPGCSWIEINGKAHMFLGGDTSH 568
           PEN+ NY+LLSNMYAEAG W+EVDKLRA +K +G KK+PGCSWIEI GKAH+FL GD SH
Sbjct: 501 PENSCNYILLSNMYAEAGMWKEVDKLRARLKCEGIKKNPGCSWIEIKGKAHLFLSGDLSH 560

Query: 569 PQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDISEEEKEFNLIAHSEKLAVAFGILNTPA 628
           PQ KEIY  LEALPEK+KAAGY P+T +VLHDISEEEKE NLI HSEKLA+AFG+LNT  
Sbjct: 561 PQSKEIYNLLEALPEKIKAAGYIPNTGFVLHDISEEEKEQNLIIHSEKLAIAFGLLNTNP 620

Query: 629 ETVLRVTKNLRICGDCHTAMVFISEIYGREVIVRDINRFHHFKGGCCSCGDYW 680
           E V+R+TKNLRICGDCHT + FIS+IY RE++VRD+NRFHHF+ G CSCGDYW
Sbjct: 621 EVVIRITKNLRICGDCHTVIKFISKIYEREIVVRDVNRFHHFRHGACSCGDYW 673

BLAST of CSPI06G23760 vs. NCBI nr
Match: gi|596016252|ref|XP_007218862.1| (hypothetical protein PRUPE_ppa002838mg [Prunus persica])

HSP 1 Score: 902.9 bits (2332), Expect = 3.4e-259
Identity = 431/625 (68.96%), Postives = 509/625 (81.44%), Query Frame = 1

Query: 56  QNLLQPLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFY 115
           + LL+ L A  P  I  YAP+FQ LT  N+LKLG QVHA M LRGL+P A +G+KMVA Y
Sbjct: 4   RTLLKSLLAQDPTCISFYAPIFQSLTSQNLLKLGQQVHAQMALRGLEPNAFLGAKMVAMY 63

Query: 116 ASSGDIDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFP 175
           ASS ++DS+V++F+ +  PS+LL+NS+IRAY  YG++E+T+  Y  MH  G  GD FT+P
Sbjct: 64  ASSDNLDSAVNIFHRVNNPSTLLYNSIIRAYTLYGYSEKTMEIYGQMHRLGLKGDNFTYP 123

Query: 176 FVLKSSVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIR 235
           FVLK    L S+W+GKCVH L LRIGL  D+YV TSLI +Y KCGE++DA   FD MT+R
Sbjct: 124 FVLKCCANLSSIWLGKCVHSLSLRIGLASDMYVGTSLIDMYVKCGEMSDARSSFDKMTVR 183

Query: 236 DVSSWNALLAGYMKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMV 295
           DVSSWNAL+AGYMK G I  A  +F RMP +NIVSWT MISGY+Q+GLA+QAL LFDEM+
Sbjct: 184 DVSSWNALIAGYMKDGEICFAEDLFRRMPCKNIVSWTAMISGYTQNGLAEQALVLFDEML 243

Query: 296 KEDSRVRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGS 355
           ++DS V+PNWVTIMSVLPACA  + LERGRQIH  A R GL+SN S+  AL AMYAKCGS
Sbjct: 244 RKDSEVKPNWVTIMSVLPACAHSAALERGRQIHNFASRTGLDSNTSIQTALLAMYAKCGS 303

Query: 356 LVDARNCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLL 415
           L DAR CF+++++ E +L+AWNTMITAYAS+G G +AVSTF +MI AG+QPD+ITFTGLL
Sbjct: 304 LSDARQCFERVHQTENSLVAWNTMITAYASHGRGSEAVSTFEDMIGAGLQPDNITFTGLL 363

Query: 416 SGCSHSGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAG 475
           SGCSHSGLVD GLKYFN M T YSI PRVEHYACV DLLGRAGRL EA  LV +MPM AG
Sbjct: 364 SGCSHSGLVDGGLKYFNCMKTIYSIEPRVEHYACVVDLLGRAGRLVEAIDLVSKMPMQAG 423

Query: 476 PSIWGSLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAI 535
           PSIWG+LL+ACRKH NLE+AE AARKLF+LEP+N+GNYVLLSN+YA+AG W+EVD LRA+
Sbjct: 424 PSIWGALLSACRKHHNLEIAEIAARKLFILEPDNSGNYVLLSNIYADAGMWKEVDDLRAL 483

Query: 536 VKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIY-MFLEALPEKMKAAGYFPDTSY 595
           +KSQG KK+PGCSWIE+NGKAH+FLGGDT HPQ KEIY + LE LP K+KAAGY PDTS+
Sbjct: 484 LKSQGMKKNPGCSWIEVNGKAHLFLGGDTCHPQAKEIYEVLLEELPNKIKAAGYVPDTSF 543

Query: 596 VLHDISEEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYG 655
           VLHD+SEEEKE NL  HSEKLA+AFG+LN     VLRVTKNLRICGDCHTA   IS IY 
Sbjct: 544 VLHDVSEEEKEHNLTTHSEKLAIAFGLLNASPGVVLRVTKNLRICGDCHTATKLISRIYE 603

Query: 656 REVIVRDINRFHHFKGGCCSCGDYW 680
           RE+IVRD+NRFHHF+ GCCSCGDYW
Sbjct: 604 REIIVRDLNRFHHFRDGCCSCGDYW 628

BLAST of CSPI06G23760 vs. NCBI nr
Match: gi|720077886|ref|XP_010241184.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g49142 [Nelumbo nucifera])

HSP 1 Score: 899.0 bits (2322), Expect = 4.9e-258
Identity = 426/626 (68.05%), Postives = 513/626 (81.95%), Query Frame = 1

Query: 54  TLQNLLQPLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVA 113
           +L+ LL+P+    PP I+SYAP+FQFLTG + LKLG QVHAHM LRGLQP A +G+KMVA
Sbjct: 17  SLRILLEPIKQ-NPPQIVSYAPIFQFLTGTHSLKLGKQVHAHMTLRGLQPNAFLGAKMVA 76

Query: 114 FYASSGDIDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFT 173
            YASSGDIDS+ +VF+ +  PSSLL+NS+IR Y R+G+ ERT+ TYF M+S G   DYFT
Sbjct: 77  MYASSGDIDSAETVFDQVSFPSSLLYNSIIRGYTRFGYYERTLKTYFIMNSQGLRPDYFT 136

Query: 174 FPFVLKSSVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMT 233
           FPFVLKSS EL  +  GKCVHG  LRIGL++DLYV TSLI +Y KCGE+++A K+FD M 
Sbjct: 137 FPFVLKSSAELSCLRTGKCVHGKSLRIGLEYDLYVGTSLIDMYVKCGELSNAHKLFDRMH 196

Query: 234 IRDVSSWNALLAGYMKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDE 293
           ++DVSSWNAL+AGYM++G I  A A+F+ MP RNI+SWT MISGY+QSGLA +ALSLF E
Sbjct: 197 VKDVSSWNALIAGYMRNGVIQIAEALFQSMPKRNIISWTAMISGYTQSGLADRALSLFGE 256

Query: 294 MVKEDSRVRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKC 353
           M++ DS V+PNWVTIMSVLPACA  + LE G++IH  A  +GL+ + SV  AL AMYAKC
Sbjct: 257 MLRVDSEVKPNWVTIMSVLPACAHSAALEYGKKIHSYASEIGLDKSFSVQTALIAMYAKC 316

Query: 354 GSLVDARNCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTG 413
           GSL+DA +CF+++   EK+LI WNTMI AYAS+G G +AVSTFR MI+ G+QPD ITF G
Sbjct: 317 GSLIDACHCFERIPEKEKSLITWNTMIAAYASHGCGKEAVSTFRNMIKCGVQPDAITFLG 376

Query: 414 LLSGCSHSGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMP 473
           LLS CSHSGLVDVGL+YFN M+  YS++PR EHYACV DLL RAGR+ EA +L+  MPM 
Sbjct: 377 LLSSCSHSGLVDVGLEYFNCMTRIYSVDPRAEHYACVVDLLARAGRIVEAKELIDRMPMQ 436

Query: 474 AGPSIWGSLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLR 533
           A PSIWG+LLAACR H NLE+ E AA++LF+LEPEN+GNY+LLSNMYAE GRW+EV+ LR
Sbjct: 437 ASPSIWGALLAACRNHGNLEIGEIAAKQLFILEPENSGNYILLSNMYAEVGRWEEVNNLR 496

Query: 534 AIVKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTS 593
           A++K+QG KKSPGCSW EINGK H+FLGGDTSHPQ KEIYM L  LP+K+KAAGY PDTS
Sbjct: 497 ALLKNQGVKKSPGCSWTEINGKCHLFLGGDTSHPQMKEIYMLLGDLPKKIKAAGYIPDTS 556

Query: 594 YVLHDISEEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIY 653
           +VLHD+SEEEKE NL  HSEKLA+AFG+LNT   TV+ VTKNLRICGDCHTA+ FIS IY
Sbjct: 557 FVLHDVSEEEKEHNLTMHSEKLAIAFGLLNTSPATVIXVTKNLRICGDCHTAIKFISRIY 616

Query: 654 GREVIVRDINRFHHFKGGCCSCGDYW 680
           GRE++VRD+NRFHHFK G CSCGDYW
Sbjct: 617 GREIVVRDVNRFHHFKDGSCSCGDYW 641

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP271_ARATH6.2e-13641.27Putative pentatricopeptide repeat-containing protein At3g49142 OS=Arabidopsis th... [more]
PPR53_ARATH7.6e-13437.86Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN... [more]
PP175_ARATH1.7e-13338.88Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP251_ARATH4.9e-13338.63Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
PP285_ARATH8.4e-13340.19Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KEZ1_CUCSA0.0e+0099.71Uncharacterized protein OS=Cucumis sativus GN=Csa_6G430650 PE=4 SV=1[more]
M5X3I7_PRUPE2.4e-25968.96Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002838mg PE=4 SV=1[more]
A0A0D2SZE2_GOSRA5.9e-25065.24Uncharacterized protein OS=Gossypium raimondii GN=B456_008G028200 PE=4 SV=1[more]
K4B1Y4_SOLLC1.9e-24866.24Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1[more]
W9QT12_9ROSA1.8e-24363.44Uncharacterized protein OS=Morus notabilis GN=L484_002732 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G49142.13.5e-13741.27 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G20230.14.3e-13537.86 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G29760.19.6e-13538.88 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G23330.12.8e-13438.63 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G57430.14.7e-13440.19 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449445033|ref|XP_004140278.1|0.0e+0099.71PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis s... [more]
gi|659112126|ref|XP_008456075.1|0.0e+0096.47PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis m... [more]
gi|823203737|ref|XP_012436245.1|2.4e-26067.23PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Gossypium... [more]
gi|596016252|ref|XP_007218862.1|3.4e-25968.96hypothetical protein PRUPE_ppa002838mg [Prunus persica][more]
gi|720077886|ref|XP_010241184.1|4.9e-25868.05PREDICTED: putative pentatricopeptide repeat-containing protein At3g49142 [Nelum... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G23760.1CSPI06G23760.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 210..233
score: 0.082coord: 139..167
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 267..316
score: 3.2E-10coord: 371..419
score: 1.7
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 375..407
score: 2.1E-6coord: 239..263
score: 6.3E-5coord: 269..298
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 205..235
score: 7.103coord: 267..301
score: 11.312coord: 236..266
score: 9.854coord: 304..338
score: 5.601coord: 509..543
score: 6.84coord: 339..369
score: 5.897coord: 372..406
score: 11.192coord: 135..169
score: 8.046coord: 443..477
score: 6.095coord: 407..437
score: 7.509coord: 69..103
score: 5
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 238..299
score: 4.4E-11coord: 338..408
score: 4.4E-11coord: 472..528
score: 4.4
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 267..307
score: 2.32E-7coord: 346..529
score: 2.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 69..550
score:
NoneNo IPR availablePANTHERPTHR24015:SF728SUBFAMILY NOT NAMEDcoord: 69..550
score:

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI06G23760Csa6G430650Cucumber (Chinese Long) v2cpicuB313
CSPI06G23760CsaV3_6G038880Cucumber (Chinese Long) v3cpicucB363
CSPI06G23760Cucsa.086910Cucumber (Gy14) v1cgycpiB092
CSPI06G23760CsGy6G023110Cucumber (Gy14) v2cgybcpiB287
The following gene(s) are paralogous to this gene:

None