CSPI01G31950 (gene) Wild cucumber (PI 183967)

NameCSPI01G31950
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPectin lyase-like superfamily protein
LocationChr1 : 26647898 .. 26654864 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCAAAATTTCCAACCACCCATCATTTCAACCAACATTCATCCTCCTCGCTCTCATCCTTCTGATCTCCCATACAATAAGAAGATCGCACTCGGCGAATTTGTTGAACATTTTGGATTTTGGTGCCATACGTGGTTATGATTCGAGCCAAGCCATTCGTCGTGCATGGGCAGTGGCCTGCAAGTCTGAAGAATCTACCATTGTCTACATTCCGAAGGGAAGGTTTCTTGTCCAACCAATGGAATTCCATGGTGGAGGGTGTCATAATGAAGATATTAGTTTCCATATTGATGGAGCTCTCATCGGGCCTCCTGACTACCGAATCCTTGGTAATGTTGAAAGTTGGCTAAGCTTTGTAGAGGTTAATGGCGTTTCTTTGACTGGTGGAGTTCTTGATGCCAATGGTGAAGCCTTGTGGTCTTGTAAATTCTCCACTACCCACTGCCCCGTTGGAGCCCGGGTAACTACTTCTGCAACTATCTTTCTTATTCCCTTTGGAGACTTTAAATTTTAGGACCCTCTTTTTTTATTAATAAAAAAACACTAAAATACAAAAATTCAATTTTTTATGATCACTTTTTTGGCCTAAAATAATTTATGTATTACAAATTTAGTATTAAAAGAAGTTGTACATAGTTTTTTTTTTTATATATAAATATAGCTAAAACAATTAAAAATAAAATAGATAAATTATGACAATTTGGGAATACACAAAGATTTATATCTTAAAAGGAATACACACCTTTTTTTTTTTATTTTAAAAAATGGTCATTTAATCGCACATTTATTTGTAAATTGAGAATTTGGGAACATTACCCAAATACTAATAATTTTGATTGTTATAAATACTATGTAAATATAATTCACTTTGTCGTACTTTTTTTTTCAAATAGACAAATTCAAAGTGTGTTTGGTTGAGATTCTAAAGTGTTTAATAAAAAAAACTATTTTAAAAAAATGGTAGTGTTTGAAAAATTAAAACAATGCATTTTTTAAAAATTCATTTATTAAACAATCTATTTTGGAGAAATCTTTAAAAACCAAATTCACATATATACACGTTTATAAAAAATGTATAACCAAATTCACATATATACACACGTATAAATTTTAAACTCATTTCCAAAAAGTGTTTAAAATGAAATGTATTATTGAAAAACATTTTGTTTAATTCAATCCAAACGAACCTTAATCTATATATAATAAATCTTAAATATATATCTTTAAAGGATCTTAGTACTAAAAATGCGGAAAAAAATATCGTAAACAATAGAATTGATAAAAGATATTTATAAAATATAGTAAAATTCTAGATTTTATCAATCATGAATAATATGTTGGCTATCAATGTCTATTAGTGCACTGATAAACATCGATAGCAAGTTTTGTTTATCATTGATAAAATATAAATTTTTGCTATATATTATAATTTGAAAATATAGTTAATCAAAACACATTTACCCTAAAAAAATGTTTTATTTTATGATGTAGACTCTAAGTTTTAGGGACTCAAAGAATATAAGGATTAGAGGGTTGGTGTCTAGAAACAGCCAATTATTCCATATAGTGATCAATGGATGTAAGAATGTACTAGTAGAAGAAGTGAACGTCATTGCGGCAAGTAATAGTCCAAACACTGATGGCATTCACGTGGAAACATCCACGCATGTGACCATCATTGACTCTACCATTCAAACGGGAGATGATTGTATTTCTATCGGTCCTGGATCCTACAATCTTTGGATTCAACGCATTCGATGTGGACCTGGTCATGGCATTAGGTACTTTTATATCTTCAATCTTTCTACTCTTTCAACTATGACTTTTTAGATCATACAAAACATACTCTTGATTTATGTCATTGATTTTATATTTTTTTTTAGTTATTAAAATTTTAGTATTTTTTAGTTCAACCCTGGTTAGATTAGGAGATCAAGTCATCACTCTTGAACAACAATGGGATGCTCATTATCATTATTATCATTCTTTAGGGTTAGCTTAATTTTCTTTCCTTTTTTTTTTTTACTATATAATGTGTAGCAATTTAAGAGATAACAGTCATTTTTTAAAATTGCAAATATAACGATTGCAATATTTGCAATAGTCTATCAGTCATAAACTTTTATCGTTGATAGACTCTTGAGATCTATCAGCAATAGACTAATATTTGCAATATGGGTTTATCCACTATAGGTTTTGTATCATTGATAGATTATAACAGATTTTGTTATATTTGATTTTTTTTTGAAAATGTTGCTACTATATGTTGTTATATTTGCAATTGTCTCCGACTTTTTTAACTAAGCTATTGTTCATAATTATTTTATGTATTGTTTCATTGTCATGGAAAGCTTAGTAACAAAATTGTTTAGATTTGGCAACAATCGTGTACGAAAAAAATCTTTAAAAACAAACCGTTTAAGACTTATTCCAAATATAAACGATTGTGTAGCCAAATCTAAACTACCCAAAATCTTGAAAAAAAAAATTATTCTAAATTTAAACGATCTTATAGCCAAATTTAAACGATAGTTAAATTTATAGATACCAAAATCATGTATTAAATATATTAGGCGCATTAATGACATGTTGATGGAGGCATTTTTGGTATTTTTTTTATTGTGGGTGTGTAGACTTTTTCTATTTTCAAAATTGTTCTATACATTATAAATATTTTGACAATTGATGGAAATCACTGAAAAATAGTTGTTCGTAGTTGGGGGTAATGGTCACCAAACAACGGTCGATGAAAGTATCTTGATAACGATAACCAAAAGATGGTGGACAAAAGTCAACAAACAATGGTCACATAAAGTTAACTGACAATAGTCACTGAACAACAGTTGACAAAATTTGGTCGACAACAATCACCAAAACATGATCATATGTAGTTGCCCAACAAATGCTATAGAGAAAATGATCAGTGAAAGTTGGCCAACAATAGGCACAAAAAATAGTCGACAAAAGTTTATTGACAACTTTCATTGAAAATCAGTTGACAAAAGTTAGATGACAATGCACCAAACAATGATCGAAAAAAGTTGACAAAAGTTTATTATATTCACTAGAAAACAATGAATGAAAGTTGACAAGAAACTATCATATAAAATAGTTAACAAGTTGCCAGACAATGGTCACTGAACAACATTTGACAACAATGACTAAGAAACGATTTTCCGTAGTTGACCAACAACTATTATAAAAAAAACAGCCGATGAAAGTTGGCCAACAACAATTACCAAAAATGATCGACCAAAGTTGACCAACAACGATGATCAAAATTTGGACGACAACGATCATCCAACAACAGTCCACAAAAGTTGGCTAACAATGATCACCAAATTACTATCAACTAAAGTGGGGTAAAAAATGGACACCGAGTAACAATCGTCGATAGTTGTCTAGCTTGAAAAACAGTCAAAGAAAGTTTACCGACAACAGTCTACACAAATGATTGATGAAAATTGATCGAAAAATAGTCGATAAAAGTTGATAGACCACGAACACTTAAATAAAAATCGATAAAGGTTAATCGATAACAGTTGCGAGAAAATAGTACATGAAAATAATCGTCAACAGTAAAAAGAAAATGGTTGCAAAAAAATGAAGTCTGAAGGTTCGTTGACTACGATCAACAGTGTAAGTTGGTGCACCGACGAAGATCACTAGATAACGATCATAAAAAATCTAGTCAAAGTTGCTTGGCAAAGATTGTCGAAAAATGATGATACCGCATTGCGATGAAGTTTGGTCACTACAAGTTGACAGGTGGTGGACATCAAAAATGTTAGTTGAAGATTGATCGACGATGGTTGTCGAAGGTTGGTGGACAATAGTTTCCAAAAAAACAATATAGATGAAAGTTGATCGTCAAAGATTGATGAAAGCATATAAATGGTTACCGAAGGTTCGTTGGCGATGATCAATGTTGTGGTAGGTACTTGTTCTATGAAGATCCCCAAAACATCGTCACCCGAAGTTTGGTTAGCGACAATTGTTGAAGATTAGTCAGAGGAGGTTGTGGGAAAGACGGTAGACAAAAACTGGTTGTCGATGATTTCATGAAAAACGATGATTGGAGAATCACAAAAGGTGTGACTAACGACAATTGGTATAGGTCATTGTGTCAAAGGTGACTAGCAAGAAGAGTCAGAAGTGTGATAACTCAAGTTTTTCATATAAAGTAAGAAGATTGACCATTAAACATTTAGATTTCATTTTTTAAAAAATTGATCTTTAATATTTATCCTAAACCACATTATTTTATAAACACATTTTTTCAAAATCGATTTTAGGTAATAATCAAACACAATCATTTTTTTTTTTCAAAACAACTCATCTTTTACTTTAATCACAAAAATACTATTCAAATACATTCTTAAATTATTTCTTATTTCTTCGGGTTAGGTCTAAACTTTTTCCATATATAGTCTTTAACCAATGTCTGACCTCATGGAAAGAATTAAATATCTTAATTGTTGAACTATTTAATTCGTCTCCATAAAAATTAATTTTGTTTTGATTTTCAACATTTTTTTTTGTTGGTCATCATAACACAAATTTTGGTTTCATTTTCAGAAACCCTATAACCTAACCATCATACATTAGGCTTCAAACATTTGTCCACATCAGCACCTTACACATCCTATACATAAAACAAAAGAGATCCCATGCATGGGTTTCCTTATGTTAAATACTTTATTGTAAAATATTTATCTTAGATACTTTAATGTGCCCACTTCTTATCCATTCTCTACCCATATTTTAAAAAGTCAAGAAAAAATTTATAAACTAAAACATAAATAGTTTTTGAAAATTTGTTTCCTTTAATCTTGTATTTAAGAAAGATAAGAAATGTGAGGATTAATTTTTTTTAAGAAATTAAAGGATTGTTAAATGGGTTTAAAATATTCCATACCCTCCATCAAAGTTTTAAAATAAAACAAAATATTTGAATTTTAAGACTAAAATAAAATAAGTCAAAAGTGTTAGCAACAATAGTGTTAGGCAATCATGGATCCCTAAAAATCGCTAAAATTTTGAGCTTACTAACTCTATTCATTAGAATTTTAAACTTTGATCAGTATATTAATTTCTACTATTCATATTATATTTTATAAGAATAGTATAAGATATATAATTTATCAATTAAACTCCTCAACCTTTATAAGTGAATCAATTAAAGCGTTCGGCTAGGATTTTCTTTAAAAATTATCTATGTAATTTGCTAACCCAACCATTAAAAAAGTCTATACATGTATAAAAATTGATACATAAGCGATTTTCAAATTAAAGTCCTCTAAAGGAAATATGTAGATTAAAAATCTAATTTAAAATTTTGGGACAATACTTGTGAAGAATATGAGATAAAGAAACATAGATAGAAAATGTAGTTGAGAGTTTATAGTTTATAGAGAAATGATCGTTTTCTCCCCATTATGTGTTCAGCATAGGAAGCTTGGCACACAACATGAACGAGCCTGGGGTAGGAAACATCACCGTTGCCAATGCAATATTTTACGGTACTCAAAATGGCCTGAGAATAAAGTCATGGGCCAGACCCAGCACTGGCTTTGTGTACGGAGTTCAATTTCTTGGAGCCACCATGCATAATGTCCAAAACCCAATTCTCATTGATCAACACTACTGCCCCAACAACTTTGACTGTCCTGATCAGGTATCAGTCAAATATCTATCAAAATTTATCACTAATAGATGATATAACAAATACTGATTTATCACTCATAAAACATAAGAGTCTATTAGTGATATACTTTACTATATTTGTAATTTTTTTAAAATATTGCTACACAGTTAGGGTGTGTTTGGTTGAGATTATAAAGTGTTTAGTAGTGTTTGGCAAATTAAAATAATGCATTTTCTAAAAATTAATTTATAAAATAATCTATAAAAACGAAACATTAAGTGTTTAAATTTTAAACTCATTTCTGAGAAAGTGTTTAAAAGAAAATGTATTATTGAAAAACATTTTGTTCTTGATTCAATCTAAACTTATCCTTAATAATTATTCTAAAAATTGCTACTTATTATGATTACCATTTTTTTTCTAAACAATGTCCAATGTGTTTAACACAACAATGGTAAAATTTAGGTCGGACTTCTCATCTATAAAGTAGAATGTCATATCAATTGCTACTAAACTAAATTCACTTTGACAACAAATATCTAACCAATAATTAATAATGTATCGCCTATCAAAAACAAATTATTCATATTTTGATTCAAATAAGAAAAAAAAGTTAACTAACATTTTAAAATATGGTTTTTTTAATTTTAATATGTAAATTTGTTTAAGGGATATTTTTAAAAATAGCAAAATAATTTAAAATATTTAGAAGTTATAACAAAATTTTGAATTACTTCAATGATAGTCTTCTATCACTGTAGTAAAACTACTAATGATAGAATTTGCTATAGATTCGTAAATATTTTATAAATTTTGCTATTCTCAACTTCATGTTTAATGTTTTTTAAAATATATCATTTGTTTATTTTGCAAAAGTGAAGTAACAATTGACATCAAGTGCTCTTGGAGGTTGGGCCAAATAAAACATCCAACTCATCTTATTTTTTTTAGTATCTAACAATTTTCCAATAGTTATTTTACTTTATTTACAAGTTTTAAGCTTTGAAGGCAGAATAATATAATTTGGATGAGCTTATTGTTGTTTGAGCAATATCCACACACTTTCTTAAAATATTTAACTATGATTTGATAACAGGAATCAGGAATCAAGATAAGTAATATAATCTATAAAGACATTGTAGGAACTTCAGCAACACCAATTGCCATAAAATTTGATTGTAGTTCAAAAAACCCTTGCAATGGAATAAGACTTGAAGATGTTCGCTTGACATACCAAAATGAAGAAGCTAAATCCTCTTGTGAATATGCAAAGGGGAAAACATTAGGTTTAGTTCAACCAGAAGGTTGTTTTGAATGA

mRNA sequence

ATGGCCAAAATTTCCAACCACCCATCATTTCAACCAACATTCATCCTCCTCGCTCTCATCCTTCTGATCTCCCATACAATAAGAAGATCGCACTCGGCGAATTTGTTGAACATTTTGGATTTTGGTGCCATACGTGGTTATGATTCGAGCCAAGCCATTCGTCGTGCATGGGCAGTGGCCTGCAAGTCTGAAGAATCTACCATTGTCTACATTCCGAAGGGAAGGTTTCTTGTCCAACCAATGGAATTCCATGGTGGAGGGTGTCATAATGAAGATATTAGTTTCCATATTGATGGAGCTCTCATCGGGCCTCCTGACTACCGAATCCTTGGTAATGTTGAAAGTTGGCTAAGCTTTGTAGAGGTTAATGGCGTTTCTTTGACTGGTGGAGTTCTTGATGCCAATGGTGAAGCCTTGTGGTCTTGTAAATTCTCCACTACCCACTGCCCCGTTGGAGCCCGGACTCTAAGTTTTAGGGACTCAAAGAATATAAGGATTAGAGGGTTGGTGTCTAGAAACAGCCAATTATTCCATATAGTGATCAATGGATGTAAGAATGTACTAGTAGAAGAAGTGAACGTCATTGCGGCAAGTAATAGTCCAAACACTGATGGCATTCACGTGGAAACATCCACGCATGTGACCATCATTGACTCTACCATTCAAACGGGAGATGATTGTATTTCTATCGGTCCTGGATCCTACAATCTTTGGATTCAACGCATTCGATGTGGACCTGGTCATGGCATTAGCATAGGAAGCTTGGCACACAACATGAACGAGCCTGGGGTAGGAAACATCACCGTTGCCAATGCAATATTTTACGGTACTCAAAATGGCCTGAGAATAAAGTCATGGGCCAGACCCAGCACTGGCTTTGTGTACGGAGTTCAATTTCTTGGAGCCACCATGCATAATGTCCAAAACCCAATTCTCATTGATCAACACTACTGCCCCAACAACTTTGACTGTCCTGATCAGGAATCAGGAATCAAGATAAGTAATATAATCTATAAAGACATTGTAGGAACTTCAGCAACACCAATTGCCATAAAATTTGATTGTAGTTCAAAAAACCCTTGCAATGGAATAAGACTTGAAGATGTTCGCTTGACATACCAAAATGAAGAAGCTAAATCCTCTTGTGAATATGCAAAGGGGAAAACATTAGGTTTAGTTCAACCAGAAGGTTGTTTTGAATGA

Coding sequence (CDS)

ATGGCCAAAATTTCCAACCACCCATCATTTCAACCAACATTCATCCTCCTCGCTCTCATCCTTCTGATCTCCCATACAATAAGAAGATCGCACTCGGCGAATTTGTTGAACATTTTGGATTTTGGTGCCATACGTGGTTATGATTCGAGCCAAGCCATTCGTCGTGCATGGGCAGTGGCCTGCAAGTCTGAAGAATCTACCATTGTCTACATTCCGAAGGGAAGGTTTCTTGTCCAACCAATGGAATTCCATGGTGGAGGGTGTCATAATGAAGATATTAGTTTCCATATTGATGGAGCTCTCATCGGGCCTCCTGACTACCGAATCCTTGGTAATGTTGAAAGTTGGCTAAGCTTTGTAGAGGTTAATGGCGTTTCTTTGACTGGTGGAGTTCTTGATGCCAATGGTGAAGCCTTGTGGTCTTGTAAATTCTCCACTACCCACTGCCCCGTTGGAGCCCGGACTCTAAGTTTTAGGGACTCAAAGAATATAAGGATTAGAGGGTTGGTGTCTAGAAACAGCCAATTATTCCATATAGTGATCAATGGATGTAAGAATGTACTAGTAGAAGAAGTGAACGTCATTGCGGCAAGTAATAGTCCAAACACTGATGGCATTCACGTGGAAACATCCACGCATGTGACCATCATTGACTCTACCATTCAAACGGGAGATGATTGTATTTCTATCGGTCCTGGATCCTACAATCTTTGGATTCAACGCATTCGATGTGGACCTGGTCATGGCATTAGCATAGGAAGCTTGGCACACAACATGAACGAGCCTGGGGTAGGAAACATCACCGTTGCCAATGCAATATTTTACGGTACTCAAAATGGCCTGAGAATAAAGTCATGGGCCAGACCCAGCACTGGCTTTGTGTACGGAGTTCAATTTCTTGGAGCCACCATGCATAATGTCCAAAACCCAATTCTCATTGATCAACACTACTGCCCCAACAACTTTGACTGTCCTGATCAGGAATCAGGAATCAAGATAAGTAATATAATCTATAAAGACATTGTAGGAACTTCAGCAACACCAATTGCCATAAAATTTGATTGTAGTTCAAAAAACCCTTGCAATGGAATAAGACTTGAAGATGTTCGCTTGACATACCAAAATGAAGAAGCTAAATCCTCTTGTGAATATGCAAAGGGGAAAACATTAGGTTTAGTTCAACCAGAAGGTTGTTTTGAATGA
BLAST of CSPI01G31950 vs. Swiss-Prot
Match: PGLR_PRUPE (Polygalacturonase OS=Prunus persica PE=2 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 1.0e-117
Identity = 199/352 (56.53%), Postives = 252/352 (71.59%), Query Frame = 1

Query: 48  DSSQAIRRAWAVACKSEESTIVYIPKGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDY 107
           DS++A   AWA AC S    ++Y+P G F ++ + F  G C N  I+F I G L+ P DY
Sbjct: 42  DSTKAFLSAWAKACASMNPGVIYVPAGTFFLRDVVF-SGPCKNNAITFRIAGTLVAPSDY 101

Query: 108 RILGNVESWLSFVEVNGVSLTGGVLDANGEALWSCKFSTTH-CPVGARTLSFRDSKNIRI 167
           R++GN  +W+ F  VNGV+++GG+LD  G ALW+CK      CP GA TL F DS NI +
Sbjct: 102 RVIGNAANWIFFHHVNGVTISGGILDGQGTALWACKACHGESCPSGATTLGFSDSNNIVV 161

Query: 168 RGLVSRNSQLFHIVINGCKNVLVEEVNVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDD 227
            GL S NSQ+FHIVIN  +NV ++ V V  + NSPNTDGIHV+ S+ VTI++S I TGDD
Sbjct: 162 SGLASLNSQMFHIVINDFQNVQMQGVRVSRSGNSPNTDGIHVQMSSGVTILNSKIATGDD 221

Query: 228 CISIGPGSYNLWIQRIRCGPGHGISIGSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSW 287
           C+SIGPG+ NLWI+ + CGPGHGISIGSL     E GV N+TV    F GTQNGLRIKSW
Sbjct: 222 CVSIGPGTSNLWIEGVACGPGHGISIGSLGKEQEEAGVQNVTVKTVTFSGTQNGLRIKSW 281

Query: 288 ARPSTGFVYGVQFLGATMHNVQNPILIDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSA 347
            RPSTGF   + F  ATM NV+NPI+IDQHYCP+N  CP Q SG++IS++ Y+DI GTSA
Sbjct: 282 GRPSTGFARNILFQHATMVNVENPIVIDQHYCPDNKGCPGQVSGVQISDVTYEDIHGTSA 341

Query: 348 TPIAIKFDCSSKNPCNGIRLEDVRLTYQNEEAKSSCEYAKGKTLGLVQPEGC 399
           T +A+KFDCS K+PC  I+LEDV+LTY+N+ A+SSC +A G T G+VQP  C
Sbjct: 342 TEVAVKFDCSPKHPCREIKLEDVKLTYKNQAAESSCSHADGTTEGVVQPTSC 392

BLAST of CSPI01G31950 vs. Swiss-Prot
Match: PGLR6_ARATH (Probable polygalacturonase At2g43860 OS=Arabidopsis thaliana GN=At2g43860 PE=2 SV=1)

HSP 1 Score: 369.4 bits (947), Expect = 5.0e-101
Identity = 189/385 (49.09%), Postives = 246/385 (63.90%), Query Frame = 1

Query: 19  LILLISHTIRRSHSANLLNILDFGAIR--GYDSSQAIRRAWAVACKSEESTIVYIPKGRF 78
           LI  ++H I      + LN+L +GA      DS++A   AW VAC S   T + +PKGRF
Sbjct: 21  LISSLAHPI-----PSTLNVLSYGAKPDGSKDSTKAFLAAWDVACASANPTTIIVPKGRF 80

Query: 79  LVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGGVLDANG 138
           LV  + FHG  C    IS  I G+++ P D+RI+ + + W+ F +V  VS+ GG+LDA G
Sbjct: 81  LVGNLVFHGNECKQAPISIRIAGSIVAPEDFRIIASSKHWIWFEDVTDVSIYGGILDAQG 140

Query: 139 EALWSCKFSTTH-CPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEEVNVI 198
            +LW CK +  H CP GA++L F  S NI+I GL S NSQ FHIVI+   NV ++ V V 
Sbjct: 141 TSLWKCKNNGGHNCPTGAKSLVFSGSNNIKISGLTSINSQKFHIVIDNSNNVNIDGVKVS 200

Query: 199 AASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGISIGSL 258
           A  NSPNTDGIHVE+S  V I +S I TGDDCISIGPGS N++IQ IRCGPGHGISIGSL
Sbjct: 201 ADENSPNTDGIHVESSHSVHITNSRIGTGDDCISIGPGSTNVFIQTIRCGPGHGISIGSL 260

Query: 259 AHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPILIDQ 318
                E GV N+TV+N  F GT NG+RIK+W + S  F   + F    M  V+NPI+IDQ
Sbjct: 261 GRAEEEQGVDNVTVSNVDFMGTNNGVRIKTWGKDSNSFARNIVFQHINMKMVKNPIIIDQ 320

Query: 319 HYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRLTYQN 378
           HYC +   CP QESG+K+SN+ Y+DI GTS T +A+  DCS + PC GI ++DV L   +
Sbjct: 321 HYCLHK-PCPKQESGVKVSNVRYEDIHGTSNTEVAVLLDCSKEKPCTGIVMDDVNLVSVH 380

Query: 379 EEAKSSCEYAKGKTLGLVQPEGCFE 401
             A++SC+ A G    +V    C +
Sbjct: 381 RPAQASCDNANGSANDVVPFTPCLK 399

BLAST of CSPI01G31950 vs. Swiss-Prot
Match: PGLR2_PLAAC (Exopolygalacturonase (Fragment) OS=Platanus acerifolia GN=plaa2 PE=1 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 3.5e-70
Identity = 143/361 (39.61%), Postives = 204/361 (56.51%), Query Frame = 1

Query: 32  SANLLNILDFGAIRGYDSSQAIRRAWAVACKSEESTIVYIPKGRFLVQPMEFHGGGCHNE 91
           S ++ N+ D+GA    D SQA+ +AW  AC S+  + V IPKG + +  +   G  C   
Sbjct: 6   SGSVFNVNDYGAKGAGDISQAVMKAWKAACASQGPSTVLIPKGNYNMGEVAMQGP-CKGS 65

Query: 92  DISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTG-GVLDANGEALWS---CKFSTT 151
            I F IDG +  P D     + + W+SF  ++G++++G G LD  G+  W+   C     
Sbjct: 66  KIGFQIDGVVKAPADPSKFKS-DGWVSFYRIDGLTVSGTGTLDGQGQTAWAKNNCD-KNP 125

Query: 152 HCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEEVNVIAASNSPNTDGIH 211
           +C   A  L F   K+  +R + S NS++FHI +  C+++  + V V A   S NTDGIH
Sbjct: 126 NCKHAAMNLRFDFLKHAMVRDITSLNSKMFHINVLECEDITFQHVTVTAPGTSINTDGIH 185

Query: 212 VETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGISIGSLAHNMNEPGVGNI 271
           V  S  VTI ++ I TGDDCISIGPGS N+ I ++ CGPGHGISIGSL    NE  V  I
Sbjct: 186 VGISKGVTITNTKIATGDDCISIGPGSQNVTITQVNCGPGHGISIGSLGRYNNEKEVRGI 245

Query: 272 TVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPILIDQHYCPNNFDCPDQ 331
           TV    F GT NG+R+K+W     G    + F   TM+NVQNP+++DQ YCP        
Sbjct: 246 TVKGCTFSGTMNGVRVKTWPNSPPGAATDLTFQDLTMNNVQNPVILDQEYCPYGQCSRQA 305

Query: 332 ESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRLTYQNE--EAKSSCEYA 387
            S IK+SNI + +I GTS   +A+   CS   PC+ +++ ++ L+Y+     A S+C   
Sbjct: 306 PSRIKLSNINFNNIRGTSTGKVAVVIACSHGMPCSNMKIGEINLSYRGAGGPATSTCSNV 363

BLAST of CSPI01G31950 vs. Swiss-Prot
Match: QRT2_ARATH (Polygalacturonase QRT2 OS=Arabidopsis thaliana GN=QRT2 PE=1 SV=2)

HSP 1 Score: 262.7 bits (670), Expect = 6.6e-69
Identity = 153/392 (39.03%), Postives = 211/392 (53.83%), Query Frame = 1

Query: 25  HTIRRSHSANL-----------LNILDFGA-IRGYDSSQAIRRAWAVACKSEESTIVYIP 84
           H  R SH  N             N+  FGA   G D S+A  +AW  AC S     +  P
Sbjct: 49  HNTRNSHLKNRHGYAPRSSPRSFNVNTFGAKANGNDDSKAFMKAWEAACSSTGIVYIVAP 108

Query: 85  KGRFLVQPMEFHGGGCHNEDISFHIDG---ALIGPPDYRILGNVESWLSFVEVNGVSLTG 144
           K R  +       G C +  I F I G   A   P DY+       W+ F  VN + + G
Sbjct: 109 KNRDYMLKAVTFSGPCKSSLIIFKIYGRIEAWENPSDYK---ERRHWIVFENVNNLRVEG 168

Query: 145 GV-LDANGEALW--SCKFSTTHCPVGART-LSFRDSKNIRIRGLVSRNSQLFHIVINGCK 204
           G  +D NG   W  SCK +     +GA T ++F +  N+R+  +   N+Q  H+    CK
Sbjct: 169 GGRIDGNGHIWWPKSCKINPQLPCLGAPTAVTFVECNNLRVSNIRLENAQQMHLTFQDCK 228

Query: 205 NVLVEEVNVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCG 264
           NV    + V + ++SPNTDGIHV  + ++ I DS ++TGDDCISI  GS N+    I CG
Sbjct: 229 NVKALNLMVTSPADSPNTDGIHVSGTQNILIQDSIVRTGDDCISIVSGSENVRATGITCG 288

Query: 265 PGHGISIGSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMH 324
           PGHGISIGSL  + +E  V N+ V  A   GT NG+RIK+W +   G    + F    M 
Sbjct: 289 PGHGISIGSLGEDNSEAYVSNVVVNKATLIGTTNGVRIKTW-QGGHGMAKNIIFQDIIMK 348

Query: 325 NVQNPILIDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIR 384
           NV NPI+I+Q YC     CP+Q+S +++SN++YK+I GTS+ PIA+KF CS   PC GI 
Sbjct: 349 NVTNPIIINQDYCDRVEACPEQKSAVQVSNVLYKNIQGTSSRPIAVKFVCSKNIPCRGIS 408

Query: 385 LEDVRLTYQNEE--AKSSCEYAKGKTLGLVQP 396
           +++V+L  Q ++  +K+SC   K  T G V P
Sbjct: 409 MQNVKLVDQTQQDVSKASCSNVKLDTRGNVSP 436

BLAST of CSPI01G31950 vs. Swiss-Prot
Match: PGLR_ACTDE (Polygalacturonase OS=Actinidia deliciosa PE=2 SV=1)

HSP 1 Score: 260.4 bits (664), Expect = 3.3e-68
Identity = 145/373 (38.87%), Postives = 210/373 (56.30%), Query Frame = 1

Query: 32  SANLLNILDFGAIR-GYDSSQAIRRAWAVACKSEESTIVYIPKGRFLVQPMEFHGGGCHN 91
           ++  +N+ DFGA   G D ++A  +AW  AC S  S ++ +PK  +LV+P+ F  G C  
Sbjct: 86  ASKTVNVDDFGAKGDGRDDTKAFEKAWKAACSSTSSAVLLVPKKNYLVRPISF-SGPC-K 145

Query: 92  EDISFHIDGALIGP---PDYRILGNVESWLSFVEVNGVSLT-GGVLDANGEALW--SCKF 151
             ++  I G +       DYR  G    WL F  V  + +  GG ++ NG+  W  SCK 
Sbjct: 146 SGLTMQIYGTIEASDDRSDYRKDG--RHWLVFDSVQNLRVEGGGTINGNGKIWWQNSCKT 205

Query: 152 S-TTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEEVNVIAASNSPNT 211
           +    C      L+F  SK++ ++ L   N+Q  H+  + C NV    + V A  NSPNT
Sbjct: 206 NKALPCKDAPTALTFYKSKHVIVKNLKIENAQQIHVSFDNCVNVQASNLMVTAPENSPNT 265

Query: 212 DGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGISIGSLAHNMNEPG 271
           DGIHV  + ++ I    I TGDDCISI  GS  + +  I CGPGHGISIGSL +  +E  
Sbjct: 266 DGIHVTGTQNIHISSCVIGTGDDCISIVNGSRKVRVNDITCGPGHGISIGSLGYGNSEAH 325

Query: 272 VGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPILIDQHYCPNNFD 331
           V ++ V  A   GT NG+RIK+W +  +G    ++F    MHNV+NPI+IDQ+YC  +  
Sbjct: 326 VSDVVVNGAKLCGTTNGVRIKTW-QGGSGSASNIKFQNVEMHNVENPIIIDQNYCDQDKP 385

Query: 332 CPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRLTYQ-NEEAKSSC 391
           C +Q S +++ N++Y++I GT A+ +AI FDCS + PC GI LEDV L  +    AK+ C
Sbjct: 386 CQEQSSAVQVKNVVYQNIKGTCASNVAITFDCSKRFPCQGIVLEDVDLEIEGGAAAKALC 445

Query: 392 EYAKGKTLGLVQP 396
              +    G+V P
Sbjct: 446 NNVELSETGVVSP 453

BLAST of CSPI01G31950 vs. TrEMBL
Match: U5FYY7_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s18510g PE=3 SV=1)

HSP 1 Score: 484.6 bits (1246), Expect = 1.2e-133
Identity = 238/400 (59.50%), Postives = 294/400 (73.50%), Query Frame = 1

Query: 1   MAKISNHPSFQPTFILLALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWA 60
           MAK+   PS      LL  I L+S  I  S +  + N+L +GA      DS+QA   AW 
Sbjct: 1   MAKLLCIPS------LLLFIFLVSLNINISSAKTVYNVLTYGARPNGKTDSTQAFLHAWT 60

Query: 61  VACKSEESTIVYIPKGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLS 120
            AC S  STI+YIPKGR+L+  + F GG C + DI+  IDG LI P DYRILG   +WLS
Sbjct: 61  AACGSTNSTIIYIPKGRYLLGSVAFTGGNCKSPDITIRIDGTLIAPEDYRILGLASNWLS 120

Query: 121 FVEVNGVSLTGGVLDANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFH 180
           F  V+GVS+ GG LDA G  LW CK   ++CP GA TLSF +S NI+I GL+S NSQ+FH
Sbjct: 121 FESVSGVSIVGGALDARGSPLWDCKSKGSNCPAGATTLSFVNSNNIKINGLLSLNSQMFH 180

Query: 181 IVINGCKNVLVEEVNVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLW 240
           IVINGC+NV V+ V VIAA +SPNTDGIHV+ ST V I++S+I+TGDDCISIGPG+ NLW
Sbjct: 181 IVINGCQNVQVQGVRVIAAGDSPNTDGIHVQLSTDVVIMNSSIKTGDDCISIGPGTKNLW 240

Query: 241 IQRIRCGPGHGISIGSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQ 300
           I+R+RCGPGHGISIGSLA  M+E GV N+TV + IF GT NG RIKSWAR STGF   ++
Sbjct: 241 IERVRCGPGHGISIGSLAKTMDEAGVQNVTVKSTIFTGTTNGFRIKSWARHSTGFAQAIR 300

Query: 301 FLGATMHNVQNPILIDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSK 360
           F+GATM NVQNPI+IDQ+YCP+N +CP++ SGI+IS++IY+ I GTSATP+AIKFDCS K
Sbjct: 301 FIGATMINVQNPIIIDQNYCPHNLNCPNEVSGIQISDVIYQGIRGTSATPVAIKFDCSFK 360

Query: 361 NPCNGIRLEDVRLTYQNEEAKSSCEYAKGKTLGLVQPEGC 399
            PC GI L++V LTY N+EA+S+C  A GK  G VQP+ C
Sbjct: 361 YPCKGITLQNVNLTYLNKEAQSTCTNAIGKISGQVQPDNC 394

BLAST of CSPI01G31950 vs. TrEMBL
Match: A0A067KU09_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10497 PE=3 SV=1)

HSP 1 Score: 483.4 bits (1243), Expect = 2.6e-133
Identity = 238/387 (61.50%), Postives = 285/387 (73.64%), Query Frame = 1

Query: 14  FILLALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWAVACKSEESTIVYI 73
           FI  AL+      I  S +    N+L +GA      DS+ A   AW  AC S +ST++YI
Sbjct: 12  FIFFALL-----NINPSFARTSFNVLSYGAKPNGVTDSTNAFLDAWTAACGSNDSTMIYI 71

Query: 74  PKGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGGV 133
           PKGR+LV  M F G  C + DI+  IDG L+ P DYRILG V+ WLSF   NGVS+ GG 
Sbjct: 72  PKGRYLVGAMVFKGS-CKSSDITIRIDGTLVAPGDYRILGQVDDWLSFKGANGVSIVGGA 131

Query: 134 LDANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEE 193
           LDANG  LW+CK   ++CP GA TL F +S NI+I GL+S NSQ+FHI INGC+NV VE 
Sbjct: 132 LDANGSPLWACKAKGSNCPDGATTLRFTNSNNIKISGLLSLNSQMFHIAINGCQNVSVEG 191

Query: 194 VNVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGIS 253
           V VIA+ +SPNTDGIHV+ ST+V II+S I+TGDDCISIGPG+ NL+I+RIRCGPGHGIS
Sbjct: 192 VKVIASGDSPNTDGIHVQHSTNVVIINSVIKTGDDCISIGPGAKNLYIERIRCGPGHGIS 251

Query: 254 IGSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPI 313
           IGSL  ++ E GV N+TV + IF  TQNG RIKSWARPS GFV GVQF+GA M NVQNPI
Sbjct: 252 IGSLGWDLEEEGVRNVTVNSTIFADTQNGFRIKSWARPSNGFVQGVQFVGAIMRNVQNPI 311

Query: 314 LIDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRL 373
           +IDQHYCP+N DCP Q SGIKIS+++Y+ I GTSA  +AIKFDCSSK PCNGIRL DV L
Sbjct: 312 VIDQHYCPHNIDCPTQVSGIKISDVLYQGIRGTSAKSVAIKFDCSSKFPCNGIRLHDVNL 371

Query: 374 TYQNEEAKSSCEYAKGKTLGLVQPEGC 399
           TY N+ A+S C    GKT+GLVQP+GC
Sbjct: 372 TYSNQVAQSFCANVIGKTVGLVQPDGC 392

BLAST of CSPI01G31950 vs. TrEMBL
Match: A0A067KUR4_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_03487 PE=3 SV=1)

HSP 1 Score: 483.0 bits (1242), Expect = 3.5e-133
Identity = 234/387 (60.47%), Postives = 288/387 (74.42%), Query Frame = 1

Query: 14  FILLALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWAVACKSEESTIVYI 73
           FI  AL+ + S   R S     +N+L +GA      DS++A   AW+ AC S +ST++Y+
Sbjct: 11  FIFFALLNINSSCARTS-----INVLSYGAKPNGVTDSTKAFLDAWSAACSSNDSTMIYV 70

Query: 74  PKGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGGV 133
           PKGR+LV  M F+G  C + DI+  IDG L+ P DYRIL     WLSF +VNGVS+ GG 
Sbjct: 71  PKGRYLVGAMAFNGD-CKSSDITIRIDGTLVAPGDYRILSQAHDWLSFNKVNGVSIVGGA 130

Query: 134 LDANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEE 193
           LD  G  LW+CK   ++CP GA TL F +S NI+I GL+S NSQ+FHI INGC+NV VE 
Sbjct: 131 LDGKGSPLWACKAKGSNCPDGATTLRFANSNNIKINGLLSLNSQMFHIAINGCQNVSVEG 190

Query: 194 VNVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGIS 253
           V VIA+ +SPNTDGIHV+ S +V II+S I+TGDDCISIGPG+ NL+I RIRCGPGHGIS
Sbjct: 191 VKVIASGDSPNTDGIHVQNSANVAIINSAIKTGDDCISIGPGAKNLYIDRIRCGPGHGIS 250

Query: 254 IGSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPI 313
           IGSL  +M E GV N+TV + IF  TQNG RIKSWARPS GFV GVQF+GA M NVQNPI
Sbjct: 251 IGSLGRDMEEEGVQNVTVKSTIFADTQNGFRIKSWARPSNGFVEGVQFIGAIMRNVQNPI 310

Query: 314 LIDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRL 373
           +IDQHYCP+N +CP+Q SGIK+S++IY+DI GTSATP+AIKFDCSSK PCNGI+L +V L
Sbjct: 311 VIDQHYCPHNLNCPNQVSGIKVSDVIYEDIRGTSATPVAIKFDCSSKFPCNGIKLHNVNL 370

Query: 374 TYQNEEAKSSCEYAKGKTLGLVQPEGC 399
           T+ N+ A+S C    GKT+GLVQP GC
Sbjct: 371 THSNQVAQSFCANVVGKTIGLVQPNGC 391

BLAST of CSPI01G31950 vs. TrEMBL
Match: A0A0D2V8V5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G004100 PE=3 SV=1)

HSP 1 Score: 481.9 bits (1239), Expect = 7.7e-133
Identity = 232/384 (60.42%), Postives = 289/384 (75.26%), Query Frame = 1

Query: 17  LALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWAVACKSEESTIVYIPKG 76
           + LIL     I  + +    N+L+FGA      DS++A   AW  AC S +STI+Y+PKG
Sbjct: 11  ILLILFFMLAINSTSALTKYNVLNFGAKPNGKTDSTKAFLMAWKAACASADSTIIYVPKG 70

Query: 77  RFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGGVLDA 136
           R+L+  M F GG C +  I F IDG L+ P DYR+LG    WLSF  VNGVS+ GG LDA
Sbjct: 71  RYLLGSMAFQGG-CKSPQIIFRIDGTLVAPQDYRVLGKSTDWLSFEGVNGVSILGGALDA 130

Query: 137 NGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEEVNV 196
            G +LW+CK S ++CP GA TLSF +SKNIRIR L+S NSQ+FHIVINGC+NV V+ V +
Sbjct: 131 KGPSLWACKASHSNCPSGATTLSFTNSKNIRIRSLLSLNSQMFHIVINGCENVNVQGVRI 190

Query: 197 IAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGISIGS 256
           IAA NSPNTDGIHV+ S +V II  +I+TGDDCISIGPG+ NLW++++ CGPGHGISIGS
Sbjct: 191 IAAGNSPNTDGIHVQLSKNVNIIKCSIKTGDDCISIGPGTKNLWVEQVTCGPGHGISIGS 250

Query: 257 LAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPILID 316
           LA ++ E GV NIT+   IF GTQNGLRIKSWARPSTGFV GV+F+ + M NVQNPI+ID
Sbjct: 251 LAKDLKEEGVQNITMKKTIFLGTQNGLRIKSWARPSTGFVQGVRFMDSLMVNVQNPIVID 310

Query: 317 QHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRLTYQ 376
           Q+YCP+N +CP+Q SGIKI +IIY+ I GTS+T +AIKFDCSSKNPC GIRL++V L+Y 
Sbjct: 311 QNYCPHNLNCPNQVSGIKIKDIIYEGIRGTSSTQVAIKFDCSSKNPCTGIRLQNVNLSYL 370

Query: 377 NEEAKSSCEYAKGKTLGLVQPEGC 399
           N+ A+SSC   +GK L LV+PE C
Sbjct: 371 NKPAQSSCSNVRGKALNLVRPESC 393

BLAST of CSPI01G31950 vs. TrEMBL
Match: B9HYF5_POPTR (Polygalacturonase family protein OS=Populus trichocarpa GN=POPTR_0010s18500g PE=3 SV=2)

HSP 1 Score: 476.1 bits (1224), Expect = 4.2e-131
Identity = 230/386 (59.59%), Postives = 288/386 (74.61%), Query Frame = 1

Query: 15  ILLALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWAVACKSEESTIVYIP 74
           IL  + L+  + I  S +  + N+  +GA      DS+QA   AWA AC S + TI+YIP
Sbjct: 11  ILFFIFLVSLNNINISSAETIYNVQTYGAKPNGKTDSTQAFLDAWAAACGSTDPTIIYIP 70

Query: 75  KGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGGVL 134
           +GR+L+  + F GG C + DI   IDG LI P DYRILG   +WLSF  V+GVS+ GG L
Sbjct: 71  EGRYLLGSVAFTGGNCKSPDIIVRIDGTLIAPEDYRILGLASNWLSFEGVSGVSIVGGAL 130

Query: 135 DANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEEV 194
           DA G  LW CK   ++CP GA TLSF +S NI+I GL+S NSQ+FHIVINGC+NV V+ V
Sbjct: 131 DAKGSPLWDCKSKGSNCPAGATTLSFVNSNNIKINGLLSLNSQMFHIVINGCQNVQVQGV 190

Query: 195 NVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGISI 254
            VIAA +SPNTDGIHV+ ST V I++S+I+TGDDCISIGPG+ NLWI+R+RCGPGHGISI
Sbjct: 191 RVIAAGDSPNTDGIHVQLSTDVVIMNSSIKTGDDCISIGPGTKNLWIERVRCGPGHGISI 250

Query: 255 GSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPIL 314
           GSLA +M+E GV N+TV + IF GT NG RIKSWAR STGF   ++F+GATM NVQNPI+
Sbjct: 251 GSLAKSMDEAGVQNVTVKSTIFTGTTNGFRIKSWARHSTGFAQAIRFIGATMINVQNPII 310

Query: 315 IDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRLT 374
           IDQ+YCP+N +CP + SGI+IS++IY+ I GTSATP+AIKFDCS K PC GI L++V LT
Sbjct: 311 IDQNYCPHNLNCPTEVSGIQISDVIYQGIRGTSATPVAIKFDCSFKYPCKGITLQNVNLT 370

Query: 375 YQNEEAKSSCEYAKGKTLGLVQPEGC 399
           Y N+EA+S+C  A GKT G VQP+ C
Sbjct: 371 YLNKEARSTCTNAIGKTYGQVQPDNC 396

BLAST of CSPI01G31950 vs. TAIR10
Match: AT1G65570.1 (AT1G65570.1 Pectin lyase-like superfamily protein)

HSP 1 Score: 428.7 bits (1101), Expect = 3.9e-120
Identity = 210/393 (53.44%), Postives = 267/393 (67.94%), Query Frame = 1

Query: 14  FILLALILLISHTIRRSHSANL-----LNILDFGAIRG--YDSSQAIRRAWAVACKSEES 73
           F+    +  I  TI  SH         LN+L FGA      +S++A   AW  AC  E+S
Sbjct: 4   FLSFVQVFSIVITIIMSHFGQFDARTSLNVLSFGANPNGIVESAKAFSDAWDAACGVEDS 63

Query: 74  TIVYIPKGRFLVQ-PMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGV 133
            ++Y+PKGR+LV   + F G  C + +I+  IDG LIGP DY +LG  E+W SF  V+ V
Sbjct: 64  VVIYVPKGRYLVSGEVRFEGESCKSREITLRIDGTLIGPQDYSLLGKKENWFSFSGVHNV 123

Query: 134 SLTGGVLDANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCK 193
           ++ GG  DA G  LWSCK +  +CP GA TL F DS N++I+G++S NSQLFHI IN C+
Sbjct: 124 TVLGGSFDAKGSTLWSCKANGYNCPEGATTLRFMDSNNVKIKGVLSLNSQLFHIAINRCR 183

Query: 194 NVLVEEVNVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCG 253
           N+ +E+V +IA   SPNTDGIH++ ST + + +++I+TGDDCISIGPG+ NL +  I CG
Sbjct: 184 NIKIEDVRIIAPDESPNTDGIHIQLSTDIEVRNASIKTGDDCISIGPGTKNLMVDGITCG 243

Query: 254 PGHGISIGSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMH 313
           PGHGISIGSLA ++ E GV N+TV NA+F  T NGLRIKSW R S GFV  V+FLGA M 
Sbjct: 244 PGHGISIGSLAKSIEEQGVENVTVKNAVFVRTDNGLRIKSWPRHSNGFVERVRFLGAIMV 303

Query: 314 NVQNPILIDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIR 373
           NV  PILIDQ+YCP +  CP QESGIKI+++IY  I+GTSAT IAIK DCS K PC GIR
Sbjct: 304 NVSYPILIDQNYCPGDSSCPSQESGIKINDVIYSGIMGTSATEIAIKMDCSEKVPCTGIR 363

Query: 374 LEDVRLTYQNEEAKSSCEYAKGKTLGLVQPEGC 399
           ++ + LT   E AK+SC    GK LGLV P GC
Sbjct: 364 MQAINLTSYGEAAKTSCTNVSGKQLGLVTPSGC 396

BLAST of CSPI01G31950 vs. TAIR10
Match: AT2G43870.1 (AT2G43870.1 Pectin lyase-like superfamily protein)

HSP 1 Score: 423.7 bits (1088), Expect = 1.3e-118
Identity = 197/382 (51.57%), Postives = 266/382 (69.63%), Query Frame = 1

Query: 16  LLALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWAVACKSEESTIVYIPK 75
           + +L++L       S SA   N+L FGA      D+++A    W  AC S     + +PK
Sbjct: 1   MASLLVLFVFFFISSCSAQSYNVLSFGAKPDGKTDATKAFMAVWQTACASSRPVTIVVPK 60

Query: 76  GRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGGVLD 135
           GRFL++ + F G  C  + ++F IDG L+ P DYR++GN + W+ F  ++G+++ GGVLD
Sbjct: 61  GRFLLRSVTFDGSKCKPKPVTFRIDGTLVAPADYRVIGNEDYWIFFQHLDGITVYGGVLD 120

Query: 136 ANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEEVN 195
           A G +LW CK S  +CP GA T+ F+ S N+ + GL S NSQ+FH+VINGC NV ++ V 
Sbjct: 121 ARGASLWDCKKSGKNCPSGATTIGFQSSSNVVVSGLTSLNSQMFHVVINGCNNVKLQGVK 180

Query: 196 VIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGISIG 255
           V+AA NSPNTDGIHV++S+ V+I ++ I TGDDC+SIGPG+  LWI+ + CGPGHGISIG
Sbjct: 181 VLAAGNSPNTDGIHVQSSSSVSIFNTKISTGDDCVSIGPGTNGLWIENVACGPGHGISIG 240

Query: 256 SLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPILI 315
           SL  +  E GV N+TV    F GT NG+RIKSWARPS+GF   ++F    M+NV+NPI+I
Sbjct: 241 SLGKDSVESGVQNVTVKTVTFTGTDNGVRIKSWARPSSGFAKNIRFQHCVMNNVENPIII 300

Query: 316 DQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRLTY 375
           DQ+YCP++ DCP Q SGIKIS++++ DI GTSAT + +K DCSSK PC GIRLEDV+LTY
Sbjct: 301 DQNYCPDH-DCPRQVSGIKISDVLFVDIHGTSATEVGVKLDCSSKKPCTGIRLEDVKLTY 360

Query: 376 QNEEAKSSCEYAKGKTLGLVQP 396
           QN+ A S+C +A G   G  QP
Sbjct: 361 QNKPAASACTHAGGIEAGFFQP 381

BLAST of CSPI01G31950 vs. TAIR10
Match: AT3G59850.1 (AT3G59850.1 Pectin lyase-like superfamily protein)

HSP 1 Score: 423.3 bits (1087), Expect = 1.6e-118
Identity = 199/386 (51.55%), Postives = 268/386 (69.43%), Query Frame = 1

Query: 15  ILLALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWAVACKSEESTIVYIP 74
           +L+ L+ L+S +   S SA   NIL +GA      DS++A    WA AC S +   + +P
Sbjct: 5   LLIVLLFLLSVS---SSSAQTYNILSYGAKPDGKTDSTKAFTVLWAKACASVKPVTILVP 64

Query: 75  KGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGGVL 134
           KGRFL++ + F G  C  + ++F I G L+ P DYR++G    W+ F  ++G+S+ GGVL
Sbjct: 65  KGRFLLRSIIFDGSKCKRKSVTFRIQGTLVAPSDYRVIGKENYWILFQHLDGISVYGGVL 124

Query: 135 DANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEEV 194
           DA G +LWSCK S  +CP GA ++ F+ S+N+ I GL S NSQ+FH+ INGC NV ++ V
Sbjct: 125 DAQGASLWSCKKSGKNCPSGATSIGFQSSRNVVISGLTSLNSQMFHVAINGCSNVKLDGV 184

Query: 195 NVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGISI 254
            V A  NSPNTDGIHV++S+ V+I++S I TGDDC+SIGPG+  LWI+ + CGPGHGISI
Sbjct: 185 KVSADGNSPNTDGIHVQSSSTVSILNSKISTGDDCVSIGPGTNGLWIENVACGPGHGISI 244

Query: 255 GSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPIL 314
           GSL     E GV NITV  A F GT+NG+RIKSWARPS GF   ++F    M+NVQNPI+
Sbjct: 245 GSLGKESVEVGVQNITVKTATFTGTENGVRIKSWARPSNGFAKNIRFQHCVMNNVQNPIV 304

Query: 315 IDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRLT 374
           IDQ+YCP N +CP+Q SGIKIS++++ DI GTSAT + +K DCSSK PC GIR++DV+LT
Sbjct: 305 IDQNYCPGNENCPNQVSGIKISDVMFFDIHGTSATEVGVKLDCSSKKPCTGIRIQDVKLT 364

Query: 375 YQNEEAKSSCEYAKGKTLGLVQPEGC 399
           Y+N+ A + C +A G   G  +P  C
Sbjct: 365 YRNKPATTDCSHAGGSEAGFQRPNSC 387

BLAST of CSPI01G31950 vs. TAIR10
Match: AT2G43890.1 (AT2G43890.1 Pectin lyase-like superfamily protein)

HSP 1 Score: 399.8 bits (1026), Expect = 1.9e-111
Identity = 190/388 (48.97%), Postives = 265/388 (68.30%), Query Frame = 1

Query: 16  LLALILLISHT---IRRSHSANLLNILDFGAIRG--YDSSQAIRRAWAVACKSEESTIVY 75
           L+A++L+   +   ++ S +A+  N++ FGA      DS++A   AW  AC+S  +  V 
Sbjct: 6   LVAVLLMFFSSFLLMKSSTAASNYNVVSFGAKPDGRTDSTKAFLGAWQAACRSAAAVTVT 65

Query: 76  IPKGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGG 135
           +P+G FL++P+EF G  C +  I+F I G ++ P DYR LGN   W+ FV+VN +S+ GG
Sbjct: 66  VPRGSFLLKPVEFRGP-CRSR-ITFQIYGTIVAPSDYRGLGNSGYWILFVKVNRISIIGG 125

Query: 136 VLDANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVE 195
            LDA G + W+C+ S   CPVGAR+++F  + ++ + GL S NSQ  H+VIN C NV+V 
Sbjct: 126 TLDARGASFWACRKSGKSCPVGARSMTFNWANDVVVSGLTSINSQTTHLVINSCNNVIVR 185

Query: 196 EVNVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGI 255
           +V ++A   SPNTDG+HV+ S  VT+ D T  TGDDCISIGPG+ NL++ ++ CGPGHGI
Sbjct: 186 KVKLVAPDQSPNTDGLHVQGSAGVTVTDGTFHTGDDCISIGPGTRNLYMSKLNCGPGHGI 245

Query: 256 SIGSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNP 315
           SIGSL  + NE GV NIT+ N++F G+ NG+RIK+WAR STGFV  V F    M NVQNP
Sbjct: 246 SIGSLGRDANEAGVENITLINSVFSGSDNGVRIKTWARQSTGFVRNVLFQNLIMKNVQNP 305

Query: 316 ILIDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVR 375
           I++DQ+YCP+N  CP Q SG+KIS ++Y++I GTS T  A+ FDCS  NPC  IRL D++
Sbjct: 306 IIVDQNYCPSNQGCPKQGSGVKISQVVYRNIQGTSRTQQALTFDCSRSNPCQAIRLHDIK 365

Query: 376 LTYQNEEAKSSCEYAKGKTLGLVQPEGC 399
           LT+    A S+C+  KG   G+V P+GC
Sbjct: 366 LTFNGRSATSTCKNIKGVKAGVVMPQGC 391

BLAST of CSPI01G31950 vs. TAIR10
Match: AT1G05660.1 (AT1G05660.1 Pectin lyase-like superfamily protein)

HSP 1 Score: 386.0 bits (990), Expect = 2.9e-107
Identity = 191/387 (49.35%), Postives = 258/387 (66.67%), Query Frame = 1

Query: 15  ILLALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWAVACKSEESTIVYIP 74
           +L  L+  I  +I  S   N+ N++ FGA      DS+ A  +AW  AC S  S  V +P
Sbjct: 10  LLFTLLTFIDVSISAS---NVFNVVSFGAKPDGVTDSTGAFLKAWQGACVSASSATVVVP 69

Query: 75  KGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGGVL 134
           KG FL++ + F GG C ++ I+F + G +I P DYR  GN   W+ F +VN  SL GG  
Sbjct: 70  KGTFLLKVITF-GGPCKSK-ITFQVAGTVIAPEDYRTFGNSGFWILFNKVNRFSLVGGTF 129

Query: 135 DANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEEV 194
           DA     WSC+ S  +CP G R++SF  +K++ I G+ S NSQ+ H+ +NGC NV+V  V
Sbjct: 130 DARANGFWSCRKSGQNCPPGVRSISFNSAKDVIISGVKSMNSQVTHMTLNGCTNVVVRNV 189

Query: 195 NVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGISI 254
            ++A  NSPNTDG HV+ ST VT   ST+QTGDDC++IGPG+ NL I ++ CGPGHG+SI
Sbjct: 190 KLVAPGNSPNTDGFHVQHSTGVTFTGSTVQTGDDCVAIGPGTRNLLITKLACGPGHGVSI 249

Query: 255 GSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPIL 314
           GSLA  + E GV N+TV++++F G+QNG+RIKSWARPS GFV  V F    M NV+NPI+
Sbjct: 250 GSLAKELKEDGVENVTVSSSVFTGSQNGVRIKSWARPSNGFVRTVFFQDLVMKNVENPII 309

Query: 315 IDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRLT 374
           IDQ+YCP +  CP++ SG+KIS + YK+I GTSAT  A+K  CS  +PC GI L+D++LT
Sbjct: 310 IDQNYCPTHEGCPNEYSGVKISQVTYKNIQGTSATQEAMKLVCSKSSPCTGITLQDIKLT 369

Query: 375 Y-QNEEAKSSCEYAKGKTLGLVQPEGC 399
           Y +   A S C  A GK+LG++QP  C
Sbjct: 370 YNKGTPATSFCFNAVGKSLGVIQPTSC 391

BLAST of CSPI01G31950 vs. NCBI nr
Match: gi|659085882|ref|XP_008443656.1| (PREDICTED: polygalacturonase [Cucumis melo])

HSP 1 Score: 793.1 bits (2047), Expect = 2.2e-226
Identity = 383/404 (94.80%), Postives = 392/404 (97.03%), Query Frame = 1

Query: 1   MAKISNHPSF----QPTFILLALILLISHTIRRSHSANLLNILDFGAIRGYDSSQAIRRA 60
           MAKISNH SF    QPT ILLAL++LISHTIRRSHSANLLNILDFGAIRGYDSSQAI RA
Sbjct: 1   MAKISNHSSFHSPFQPTIILLALVILISHTIRRSHSANLLNILDFGAIRGYDSSQAIHRA 60

Query: 61  WAVACKSEESTIVYIPKGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESW 120
           WAVACKSEESTIVYIPKGRFLVQP+EFHGGGCHNEDISFHIDGALIGPPDYRILGNVESW
Sbjct: 61  WAVACKSEESTIVYIPKGRFLVQPIEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESW 120

Query: 121 LSFVEVNGVSLTGGVLDANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQL 180
           LSFVEVNGVSLTGGVL ANGEALWSCKFSTTHCPVGARTLSFRDS NIRIRGL+SRNSQL
Sbjct: 121 LSFVEVNGVSLTGGVLYANGEALWSCKFSTTHCPVGARTLSFRDSNNIRIRGLMSRNSQL 180

Query: 181 FHIVINGCKNVLVEEVNVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYN 240
           FHIVINGCKNVLVEEVNV+AASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYN
Sbjct: 181 FHIVINGCKNVLVEEVNVMAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYN 240

Query: 241 LWIQRIRCGPGHGISIGSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYG 300
           +WIQRIRCGPGHGISIGSLAHNMNEPGVGN+TV+ AIFYGTQNGLRIKSWARPSTGFVYG
Sbjct: 241 IWIQRIRCGPGHGISIGSLAHNMNEPGVGNVTVSKAIFYGTQNGLRIKSWARPSTGFVYG 300

Query: 301 VQFLGATMHNVQNPILIDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCS 360
           VQFLGATMHNVQNPILIDQHYCPNN +CP QESGIKISNIIYKDIVGTSATPIAIKFDCS
Sbjct: 301 VQFLGATMHNVQNPILIDQHYCPNNINCPGQESGIKISNIIYKDIVGTSATPIAIKFDCS 360

Query: 361 SKNPCNGIRLEDVRLTYQNEEAKSSCEYAKGKTLGLVQPEGCFE 401
           SKNPCNGIRLEDVRLTYQNEEAKSSCEYAKGKTLGLVQPEGCFE
Sbjct: 361 SKNPCNGIRLEDVRLTYQNEEAKSSCEYAKGKTLGLVQPEGCFE 404

BLAST of CSPI01G31950 vs. NCBI nr
Match: gi|566191496|ref|XP_006378626.1| (hypothetical protein POPTR_0010s18510g [Populus trichocarpa])

HSP 1 Score: 484.6 bits (1246), Expect = 1.7e-133
Identity = 238/400 (59.50%), Postives = 294/400 (73.50%), Query Frame = 1

Query: 1   MAKISNHPSFQPTFILLALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWA 60
           MAK+   PS      LL  I L+S  I  S +  + N+L +GA      DS+QA   AW 
Sbjct: 1   MAKLLCIPS------LLLFIFLVSLNINISSAKTVYNVLTYGARPNGKTDSTQAFLHAWT 60

Query: 61  VACKSEESTIVYIPKGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLS 120
            AC S  STI+YIPKGR+L+  + F GG C + DI+  IDG LI P DYRILG   +WLS
Sbjct: 61  AACGSTNSTIIYIPKGRYLLGSVAFTGGNCKSPDITIRIDGTLIAPEDYRILGLASNWLS 120

Query: 121 FVEVNGVSLTGGVLDANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFH 180
           F  V+GVS+ GG LDA G  LW CK   ++CP GA TLSF +S NI+I GL+S NSQ+FH
Sbjct: 121 FESVSGVSIVGGALDARGSPLWDCKSKGSNCPAGATTLSFVNSNNIKINGLLSLNSQMFH 180

Query: 181 IVINGCKNVLVEEVNVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLW 240
           IVINGC+NV V+ V VIAA +SPNTDGIHV+ ST V I++S+I+TGDDCISIGPG+ NLW
Sbjct: 181 IVINGCQNVQVQGVRVIAAGDSPNTDGIHVQLSTDVVIMNSSIKTGDDCISIGPGTKNLW 240

Query: 241 IQRIRCGPGHGISIGSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQ 300
           I+R+RCGPGHGISIGSLA  M+E GV N+TV + IF GT NG RIKSWAR STGF   ++
Sbjct: 241 IERVRCGPGHGISIGSLAKTMDEAGVQNVTVKSTIFTGTTNGFRIKSWARHSTGFAQAIR 300

Query: 301 FLGATMHNVQNPILIDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSK 360
           F+GATM NVQNPI+IDQ+YCP+N +CP++ SGI+IS++IY+ I GTSATP+AIKFDCS K
Sbjct: 301 FIGATMINVQNPIIIDQNYCPHNLNCPNEVSGIQISDVIYQGIRGTSATPVAIKFDCSFK 360

Query: 361 NPCNGIRLEDVRLTYQNEEAKSSCEYAKGKTLGLVQPEGC 399
            PC GI L++V LTY N+EA+S+C  A GK  G VQP+ C
Sbjct: 361 YPCKGITLQNVNLTYLNKEAQSTCTNAIGKISGQVQPDNC 394

BLAST of CSPI01G31950 vs. NCBI nr
Match: gi|802614667|ref|XP_012074772.1| (PREDICTED: polygalacturonase-like [Jatropha curcas])

HSP 1 Score: 483.4 bits (1243), Expect = 3.8e-133
Identity = 238/387 (61.50%), Postives = 285/387 (73.64%), Query Frame = 1

Query: 14  FILLALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWAVACKSEESTIVYI 73
           FI  AL+      I  S +    N+L +GA      DS+ A   AW  AC S +ST++YI
Sbjct: 12  FIFFALL-----NINPSFARTSFNVLSYGAKPNGVTDSTNAFLDAWTAACGSNDSTMIYI 71

Query: 74  PKGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGGV 133
           PKGR+LV  M F G  C + DI+  IDG L+ P DYRILG V+ WLSF   NGVS+ GG 
Sbjct: 72  PKGRYLVGAMVFKGS-CKSSDITIRIDGTLVAPGDYRILGQVDDWLSFKGANGVSIVGGA 131

Query: 134 LDANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEE 193
           LDANG  LW+CK   ++CP GA TL F +S NI+I GL+S NSQ+FHI INGC+NV VE 
Sbjct: 132 LDANGSPLWACKAKGSNCPDGATTLRFTNSNNIKISGLLSLNSQMFHIAINGCQNVSVEG 191

Query: 194 VNVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGIS 253
           V VIA+ +SPNTDGIHV+ ST+V II+S I+TGDDCISIGPG+ NL+I+RIRCGPGHGIS
Sbjct: 192 VKVIASGDSPNTDGIHVQHSTNVVIINSVIKTGDDCISIGPGAKNLYIERIRCGPGHGIS 251

Query: 254 IGSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPI 313
           IGSL  ++ E GV N+TV + IF  TQNG RIKSWARPS GFV GVQF+GA M NVQNPI
Sbjct: 252 IGSLGWDLEEEGVRNVTVNSTIFADTQNGFRIKSWARPSNGFVQGVQFVGAIMRNVQNPI 311

Query: 314 LIDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRL 373
           +IDQHYCP+N DCP Q SGIKIS+++Y+ I GTSA  +AIKFDCSSK PCNGIRL DV L
Sbjct: 312 VIDQHYCPHNIDCPTQVSGIKISDVLYQGIRGTSAKSVAIKFDCSSKFPCNGIRLHDVNL 371

Query: 374 TYQNEEAKSSCEYAKGKTLGLVQPEGC 399
           TY N+ A+S C    GKT+GLVQP+GC
Sbjct: 372 TYSNQVAQSFCANVIGKTVGLVQPDGC 392

BLAST of CSPI01G31950 vs. NCBI nr
Match: gi|802584438|ref|XP_012070142.1| (PREDICTED: polygalacturonase-like [Jatropha curcas])

HSP 1 Score: 483.0 bits (1242), Expect = 5.0e-133
Identity = 234/387 (60.47%), Postives = 288/387 (74.42%), Query Frame = 1

Query: 14  FILLALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWAVACKSEESTIVYI 73
           FI  AL+ + S   R S     +N+L +GA      DS++A   AW+ AC S +ST++Y+
Sbjct: 11  FIFFALLNINSSCARTS-----INVLSYGAKPNGVTDSTKAFLDAWSAACSSNDSTMIYV 70

Query: 74  PKGRFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGGV 133
           PKGR+LV  M F+G  C + DI+  IDG L+ P DYRIL     WLSF +VNGVS+ GG 
Sbjct: 71  PKGRYLVGAMAFNGD-CKSSDITIRIDGTLVAPGDYRILSQAHDWLSFNKVNGVSIVGGA 130

Query: 134 LDANGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEE 193
           LD  G  LW+CK   ++CP GA TL F +S NI+I GL+S NSQ+FHI INGC+NV VE 
Sbjct: 131 LDGKGSPLWACKAKGSNCPDGATTLRFANSNNIKINGLLSLNSQMFHIAINGCQNVSVEG 190

Query: 194 VNVIAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGIS 253
           V VIA+ +SPNTDGIHV+ S +V II+S I+TGDDCISIGPG+ NL+I RIRCGPGHGIS
Sbjct: 191 VKVIASGDSPNTDGIHVQNSANVAIINSAIKTGDDCISIGPGAKNLYIDRIRCGPGHGIS 250

Query: 254 IGSLAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPI 313
           IGSL  +M E GV N+TV + IF  TQNG RIKSWARPS GFV GVQF+GA M NVQNPI
Sbjct: 251 IGSLGRDMEEEGVQNVTVKSTIFADTQNGFRIKSWARPSNGFVEGVQFIGAIMRNVQNPI 310

Query: 314 LIDQHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRL 373
           +IDQHYCP+N +CP+Q SGIK+S++IY+DI GTSATP+AIKFDCSSK PCNGI+L +V L
Sbjct: 311 VIDQHYCPHNLNCPNQVSGIKVSDVIYEDIRGTSATPVAIKFDCSSKFPCNGIKLHNVNL 370

Query: 374 TYQNEEAKSSCEYAKGKTLGLVQPEGC 399
           T+ N+ A+S C    GKT+GLVQP GC
Sbjct: 371 THSNQVAQSFCANVVGKTIGLVQPNGC 391

BLAST of CSPI01G31950 vs. NCBI nr
Match: gi|823258297|ref|XP_012461856.1| (PREDICTED: polygalacturonase-like [Gossypium raimondii])

HSP 1 Score: 482.6 bits (1241), Expect = 6.5e-133
Identity = 233/384 (60.68%), Postives = 289/384 (75.26%), Query Frame = 1

Query: 17  LALILLISHTIRRSHSANLLNILDFGAIRG--YDSSQAIRRAWAVACKSEESTIVYIPKG 76
           + LIL     I  + +    N+L+FGA      DS++A   AW  AC S +STI+Y+PKG
Sbjct: 11  ILLILFFMLAINSTSALTKYNVLNFGAKPNGKTDSTKAFLMAWKAACASADSTIIYVPKG 70

Query: 77  RFLVQPMEFHGGGCHNEDISFHIDGALIGPPDYRILGNVESWLSFVEVNGVSLTGGVLDA 136
           R+L+  M F GG C +  I F IDG L+ P DYR+LG    WLSF  VNGVS+ GG LDA
Sbjct: 71  RYLLGSMAFQGG-CKSPQIIFRIDGTLVAPQDYRVLGKSTDWLSFQGVNGVSILGGALDA 130

Query: 137 NGEALWSCKFSTTHCPVGARTLSFRDSKNIRIRGLVSRNSQLFHIVINGCKNVLVEEVNV 196
            G +LW+CK S ++CP GA TLSF +SKNIRIR L+S NSQ+FHIVINGC+NV V+ V +
Sbjct: 131 KGPSLWACKASHSNCPSGATTLSFTNSKNIRIRSLLSLNSQMFHIVINGCENVNVQGVRI 190

Query: 197 IAASNSPNTDGIHVETSTHVTIIDSTIQTGDDCISIGPGSYNLWIQRIRCGPGHGISIGS 256
           IAA NSPNTDGIHV+ S +V II  +I+TGDDCISIGPG+ NLWI++I CGPG+GISIGS
Sbjct: 191 IAAGNSPNTDGIHVQLSKNVNIIKCSIKTGDDCISIGPGTKNLWIEQITCGPGYGISIGS 250

Query: 257 LAHNMNEPGVGNITVANAIFYGTQNGLRIKSWARPSTGFVYGVQFLGATMHNVQNPILID 316
           LA ++ E GV N+TV N IF GTQNGLRIKSWARPSTGFV GV+F+ + M NVQNPI+ID
Sbjct: 251 LAKDLKEEGVQNVTVRNTIFLGTQNGLRIKSWARPSTGFVQGVRFMDSLMRNVQNPIVID 310

Query: 317 QHYCPNNFDCPDQESGIKISNIIYKDIVGTSATPIAIKFDCSSKNPCNGIRLEDVRLTYQ 376
           Q+YCP+N +CP+Q SGIKI +IIY+ I GTS+T +AIKFDCS KNPC GIRL++V L+Y 
Sbjct: 311 QNYCPHNLNCPNQVSGIKIKDIIYEGIRGTSSTEVAIKFDCSPKNPCTGIRLQNVNLSYL 370

Query: 377 NEEAKSSCEYAKGKTLGLVQPEGC 399
           N+ A+SSC   +GK L LV+PE C
Sbjct: 371 NKPAQSSCSNVRGKALNLVRPESC 393

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PGLR_PRUPE1.0e-11756.53Polygalacturonase OS=Prunus persica PE=2 SV=1[more]
PGLR6_ARATH5.0e-10149.09Probable polygalacturonase At2g43860 OS=Arabidopsis thaliana GN=At2g43860 PE=2 S... [more]
PGLR2_PLAAC3.5e-7039.61Exopolygalacturonase (Fragment) OS=Platanus acerifolia GN=plaa2 PE=1 SV=1[more]
QRT2_ARATH6.6e-6939.03Polygalacturonase QRT2 OS=Arabidopsis thaliana GN=QRT2 PE=1 SV=2[more]
PGLR_ACTDE3.3e-6838.87Polygalacturonase OS=Actinidia deliciosa PE=2 SV=1[more]
Match NameE-valueIdentityDescription
U5FYY7_POPTR1.2e-13359.50Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s18510g PE=3 SV=1[more]
A0A067KU09_JATCU2.6e-13361.50Uncharacterized protein OS=Jatropha curcas GN=JCGZ_10497 PE=3 SV=1[more]
A0A067KUR4_JATCU3.5e-13360.47Uncharacterized protein OS=Jatropha curcas GN=JCGZ_03487 PE=3 SV=1[more]
A0A0D2V8V5_GOSRA7.7e-13360.42Uncharacterized protein OS=Gossypium raimondii GN=B456_013G004100 PE=3 SV=1[more]
B9HYF5_POPTR4.2e-13159.59Polygalacturonase family protein OS=Populus trichocarpa GN=POPTR_0010s18500g PE=... [more]
Match NameE-valueIdentityDescription
AT1G65570.13.9e-12053.44 Pectin lyase-like superfamily protein[more]
AT2G43870.11.3e-11851.57 Pectin lyase-like superfamily protein[more]
AT3G59850.11.6e-11851.55 Pectin lyase-like superfamily protein[more]
AT2G43890.11.9e-11148.97 Pectin lyase-like superfamily protein[more]
AT1G05660.12.9e-10749.35 Pectin lyase-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659085882|ref|XP_008443656.1|2.2e-22694.80PREDICTED: polygalacturonase [Cucumis melo][more]
gi|566191496|ref|XP_006378626.1|1.7e-13359.50hypothetical protein POPTR_0010s18510g [Populus trichocarpa][more]
gi|802614667|ref|XP_012074772.1|3.8e-13361.50PREDICTED: polygalacturonase-like [Jatropha curcas][more]
gi|802584438|ref|XP_012070142.1|5.0e-13360.47PREDICTED: polygalacturonase-like [Jatropha curcas][more]
gi|823258297|ref|XP_012461856.1|6.5e-13360.68PREDICTED: polygalacturonase-like [Gossypium raimondii][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000743Glyco_hydro_28
IPR006626PbH1
IPR011050Pectin_lyase_fold/virulence
IPR012334Pectin_lyas_fold
Vocabulary: Molecular Function
TermDefinition
GO:0004650polygalacturonase activity
Vocabulary: Biological Process
TermDefinition
GO:0005975carbohydrate metabolic process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005982 starch metabolic process
biological_process GO:0005985 sucrose metabolic process
biological_process GO:0005975 carbohydrate metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004650 polygalacturonase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G31950.1CSPI01G31950.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000743Glycoside hydrolase, family 28PFAMPF00295Glyco_hydro_28coord: 61..386
score: 1.4
IPR000743Glycoside hydrolase, family 28PROSITEPS00502POLYGALACTURONASEcoord: 241..254
scor
IPR006626Parallel beta-helix repeatSMARTSM00710pbh1coord: 234..254
score: 5700.0coord: 264..285
score: 82.0coord: 184..210
score: 150.0coord: 294..315
score: 5400.0coord: 211..232
score:
IPR011050Pectin lyase fold/virulence factorunknownSSF51126Pectin lyase-likecoord: 25..399
score: 2.67E
IPR012334Pectin lyase foldGENE3DG3DSA:2.160.20.10coord: 18..398
score: 1.8E
NoneNo IPR availablePANTHERPTHR31375FAMILY NOT NAMEDcoord: 14..400
score: 6.1E
NoneNo IPR availablePANTHERPTHR31375:SF2POLYGALACTURONASE/PECTINASEcoord: 14..400
score: 6.1E