CSPI04G05120 (gene) Wild cucumber (PI 183967)

NameCSPI04G05120
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionHeparanase
LocationChr4 : 3457364 .. 3459946 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAATACCAAATATTCCTTTTGATTTTGCTTGTTGCTTTCATTCCAAGAACCATTCTTGGTCAAAATGTTACAATGGGTAAAATTGTGGTTGAGGGAATCACAAAAATAGCAGAAACTGATGAGAATTTCATTTGCTTCACTTTGGACATTTGGCCTCATGATGAGTGCAGTCAACCCAACCTTTGTGTTTGGGATAGCCATGCATCAGTCCTCAATGTGGTAAGTAGAGAGTTGTTGATCTTTGGAGACATTCAAAGAAAGTAATTCATAGTATATTTTATTAATCAAATTTTCTTTTCTTAGGATTTGTCTCTTCCTATCATCAATAAAGCCGTTCAAGGTAAGTTCGCTCTTCCAATACATAAACAAATTACTACTTAACCTTTACGATAAATTCTAATTAGCTTAATTTTATGTATAATATTAGCTTTCAAGACATTACGAATTAGAGTTGGAGGTACCTTGCAAGATAGGTTGATTTATAATATTGGAGAAGGCTTCAAGGGCAATTGTCATCCATTTGAAGCTGATGATAGTTTACTATTTGACTTTACCGAAGGATGTTTGTATATGGAAAGGTGGGATGATTTGAACAACTTTTTTAACAATACAGGGTATGTTTATCTTTCTTCTCACAACCAAACTATCACCTAAATATTTTTCTTTCTACATATTTTCTCAATATTTTATATATTTTCATCACCATTAAAATTAGTTTTTTTTTAAAAAAAAAAAAATACAACAGGGCCATTGTAACTTTTGGCTTAAATGCTCTATTGGGTAAGTACCATACACAAGGAATGCAATGGGAAGGCAATTGGAACTACACTAATGCTGAAGCTCTAATTAAATATACTGTGGACAAGAATTATCAAATAAACTCATGGGAGTTTGGTAAGTTAAACAATACTAAATTTCAATTCTCTATTAGGGAATATGCCATTTGCAATAGAGGAGAGAAGATTCTTTGTATTCTTTCTTGTGTAAATGGTGATTCCATATCTTGACGAAAGAAATGTACTCTCTCCATAAATATAACTATATATTTCATGCTAATTAAGGAAGCATATATGTGATTGCTGGAATATACAGGCAATGAATTGGCTGGACGAAACAGTATTGGTGCAAGCATTAGTGCTTCACAATATGCAAAAGACCTACTAAAGCTTCGAGAAATCGTAGATCGTTTATACAAAAATTCCCAACAAAAACCTTTGATTGTGGCACCTGGTGCATTCTTTGATGACAAATGGTACCATGAACTTGTTACAAAAACTGGACCAAAGGTTGTTAGTGTTCTCACTCACCATATCTATAACATGGGTGCAGGTAAAATTAGCTACTTATCTATATATTAATTATTATGAACTAATCATAATTAACTCTTTTAATTTTGGTCTATATATATGTATAGGTGATGATCCCAAATTGATTTATAGGTTTGTTAATCCAACATACCTAAGTCAAGTATCAAACACATTTAAACAACTTAAGAACATAGTTCAAAAGCATGCCCCTTGGTCTTCTGCTTGGGTTGGTGAAGCTGGTGGAGCCTATCAAGGTGGAGCCTATCGTATTTCTGATTCATTTATCAATAGTTTTTGGTAACAAATTATTCTTCTTTTACATAAATTTACTTTTTTTTTCTCTCATTAATTAATCTTTTTTTTTCACCCATTTCATTTAAATATGATTTTGAATAATAACGTAGGTACTTAGATCAACTTGGGATGGCCGCTTTCTACAATACCAAAGTATATTGTAGACAAACTTTGATCGGTGGATTTTATAGTGTCCTCAAAGCTAAAACTTTAGTCCCTACACCGGATTACTACGGGTACGACGAATATTAACATTTATAAAAGCTTAATATTGTGTATGAGTTTTTTTGACATGATCCAATTATCATCCGATCAAGTGGGTTGCCACATGAGATTGATCGATATGCACATAAATTGATTGGACACTCAAACCAACATCACAAAAATTAAAATAATTATCTTACTTAATTTGTTATTTATTTTTTGTTAGTGCACTTCTCTTCCACCGACTTATGGGTCCAGGTGTTCTCAAAGTTCATAACAAAGTCTCTACTTATCTCCGCACCTATGCTCATTGCTCAAGAGGAAGGGTACATCATATGATCTTTTATTTCAAATTTTAAAAAATTAATCCTATACAAACACGAGTTTCCACCTTCAAATTGATCATATACATCATTTTCTTTTGTAGTCTGGCATATCCATGCTTTTCATCAATTTGAGCAACACAACCGAATTTGCAATAAACGTCAAAGACCACATGACCCTAAGTTTGCACAAAAGAAGGAAGCCCAAGCATGGTTCATCTTCAATTAATAATTTGGGAACACCAAGGGAGGAATATCATTTGACACCACAAAATGGTCTTCTTAGAAGCTCTAATGTGCTTTTAAATGGAAAGGCATTGCAACTTACAAGCGAAGGAGAATTGCCAAATCTTACACCAATTTATAAAGACAGCAACTCTTCTATAAATATTGCTACCTGGTCGATTGCCTTCGTTGTCATCCCTGACTTTGTAGCTATTGGGTGCAACTAA

mRNA sequence

ATGGAATACCAAATATTCCTTTTGATTTTGCTTGTTGCTTTCATTCCAAGAACCATTCTTGGTCAAAATGTTACAATGGGTAAAATTGTGGTTGAGGGAATCACAAAAATAGCAGAAACTGATGAGAATTTCATTTGCTTCACTTTGGACATTTGGCCTCATGATGAGTGCAGTCAACCCAACCTTTGTGTTTGGGATAGCCATGCATCAGTCCTCAATGTGGATTTGTCTCTTCCTATCATCAATAAAGCCGTTCAAGCTTTCAAGACATTACGAATTAGAGTTGGAGGTACCTTGCAAGATAGGTTGATTTATAATATTGGAGAAGGCTTCAAGGGCAATTGTCATCCATTTGAAGCTGATGATAGTTTACTATTTGACTTTACCGAAGGATGTTTGTATATGGAAAGGTGGGATGATTTGAACAACTTTTTTAACAATACAGGGGCCATTGTAACTTTTGGCTTAAATGCTCTATTGGGTAAGTACCATACACAAGGAATGCAATGGGAAGGCAATTGGAACTACACTAATGCTGAAGCTCTAATTAAATATACTGTGGACAAGAATTATCAAATAAACTCATGGGAGTTTGGCAATGAATTGGCTGGACGAAACAGTATTGGTGCAAGCATTAGTGCTTCACAATATGCAAAAGACCTACTAAAGCTTCGAGAAATCGTAGATCGTTTATACAAAAATTCCCAACAAAAACCTTTGATTGTGGCACCTGGTGCATTCTTTGATGACAAATGGTACCATGAACTTGTTACAAAAACTGGACCAAAGGTTGTTAGTGTTCTCACTCACCATATCTATAACATGGGTGCAGGTGATGATCCCAAATTGATTTATAGGTTTGTTAATCCAACATACCTAAGTCAAGTATCAAACACATTTAAACAACTTAAGAACATAGTTCAAAAGCATGCCCCTTGGTCTTCTGCTTGGGTTGGTGAAGCTGGTGGAGCCTATCAAGGTGGAGCCTATCGTATTTCTGATTCATTTATCAATAGTTTTTGGTACTTAGATCAACTTGGGATGGCCGCTTTCTACAATACCAAAGTATATTGTAGACAAACTTTGATCGGTGGATTTTATAGTGTCCTCAAAGCTAAAACTTTAGTCCCTACACCGGATTACTACGGTGCACTTCTCTTCCACCGACTTATGGGTCCAGGTGTTCTCAAAGTTCATAACAAAGTCTCTACTTATCTCCGCACCTATGCTCATTGCTCAAGAGGAAGGTCTGGCATATCCATGCTTTTCATCAATTTGAGCAACACAACCGAATTTGCAATAAACGTCAAAGACCACATGACCCTAAGTTTGCACAAAAGAAGGAAGCCCAAGCATGGTTCATCTTCAATTAATAATTTGGGAACACCAAGGGAGGAATATCATTTGACACCACAAAATGGTCTTCTTAGAAGCTCTAATGTGCTTTTAAATGGAAAGGCATTGCAACTTACAAGCGAAGGAGAATTGCCAAATCTTACACCAATTTATAAAGACAGCAACTCTTCTATAAATATTGCTACCTGGTCGATTGCCTTCGTTGTCATCCCTGACTTTGTAGCTATTGGGTGCAACTAA

Coding sequence (CDS)

ATGGAATACCAAATATTCCTTTTGATTTTGCTTGTTGCTTTCATTCCAAGAACCATTCTTGGTCAAAATGTTACAATGGGTAAAATTGTGGTTGAGGGAATCACAAAAATAGCAGAAACTGATGAGAATTTCATTTGCTTCACTTTGGACATTTGGCCTCATGATGAGTGCAGTCAACCCAACCTTTGTGTTTGGGATAGCCATGCATCAGTCCTCAATGTGGATTTGTCTCTTCCTATCATCAATAAAGCCGTTCAAGCTTTCAAGACATTACGAATTAGAGTTGGAGGTACCTTGCAAGATAGGTTGATTTATAATATTGGAGAAGGCTTCAAGGGCAATTGTCATCCATTTGAAGCTGATGATAGTTTACTATTTGACTTTACCGAAGGATGTTTGTATATGGAAAGGTGGGATGATTTGAACAACTTTTTTAACAATACAGGGGCCATTGTAACTTTTGGCTTAAATGCTCTATTGGGTAAGTACCATACACAAGGAATGCAATGGGAAGGCAATTGGAACTACACTAATGCTGAAGCTCTAATTAAATATACTGTGGACAAGAATTATCAAATAAACTCATGGGAGTTTGGCAATGAATTGGCTGGACGAAACAGTATTGGTGCAAGCATTAGTGCTTCACAATATGCAAAAGACCTACTAAAGCTTCGAGAAATCGTAGATCGTTTATACAAAAATTCCCAACAAAAACCTTTGATTGTGGCACCTGGTGCATTCTTTGATGACAAATGGTACCATGAACTTGTTACAAAAACTGGACCAAAGGTTGTTAGTGTTCTCACTCACCATATCTATAACATGGGTGCAGGTGATGATCCCAAATTGATTTATAGGTTTGTTAATCCAACATACCTAAGTCAAGTATCAAACACATTTAAACAACTTAAGAACATAGTTCAAAAGCATGCCCCTTGGTCTTCTGCTTGGGTTGGTGAAGCTGGTGGAGCCTATCAAGGTGGAGCCTATCGTATTTCTGATTCATTTATCAATAGTTTTTGGTACTTAGATCAACTTGGGATGGCCGCTTTCTACAATACCAAAGTATATTGTAGACAAACTTTGATCGGTGGATTTTATAGTGTCCTCAAAGCTAAAACTTTAGTCCCTACACCGGATTACTACGGTGCACTTCTCTTCCACCGACTTATGGGTCCAGGTGTTCTCAAAGTTCATAACAAAGTCTCTACTTATCTCCGCACCTATGCTCATTGCTCAAGAGGAAGGTCTGGCATATCCATGCTTTTCATCAATTTGAGCAACACAACCGAATTTGCAATAAACGTCAAAGACCACATGACCCTAAGTTTGCACAAAAGAAGGAAGCCCAAGCATGGTTCATCTTCAATTAATAATTTGGGAACACCAAGGGAGGAATATCATTTGACACCACAAAATGGTCTTCTTAGAAGCTCTAATGTGCTTTTAAATGGAAAGGCATTGCAACTTACAAGCGAAGGAGAATTGCCAAATCTTACACCAATTTATAAAGACAGCAACTCTTCTATAAATATTGCTACCTGGTCGATTGCCTTCGTTGTCATCCCTGACTTTGTAGCTATTGGGTGCAACTAA
BLAST of CSPI04G05120 vs. Swiss-Prot
Match: HPSE1_ARATH (Heparanase-like protein 1 OS=Arabidopsis thaliana GN=At5g07830 PE=2 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 2.1e-163
Identity = 268/542 (49.45%), Postives = 391/542 (72.14%), Query Frame = 1

Query: 5   IFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQPNLCV 64
           +FL  LL+  +P   + Q +    IV++G  ++ ETDENF+C TLD WPHD+C+    C 
Sbjct: 10  VFLGCLLL--VPEKTMAQEMKRASIVIQGARRVCETDENFVCATLDWWPHDKCNYDQ-CP 69

Query: 65  WDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEADDSL 124
           W  ++SV+N+DL+ P++ KA++AFK LRIR+GG+LQD++IY++G   K  C PF+  +S 
Sbjct: 70  W-GYSSVINMDLTRPLLTKAIKAFKPLRIRIGGSLQDQVIYDVGN-LKTPCRPFQKMNSG 129

Query: 125 LFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAEALIK 184
           LF F++GCL+M+RWD+LN+F   TGA+VTFGLNAL G++  +G  W G W++ N +  + 
Sbjct: 130 LFGFSKGCLHMKRWDELNSFLTATGAVVTFGLNALRGRHKLRGKAWGGAWDHINTQDFLN 189

Query: 185 YTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQ-QKPLIVA 244
           YTV K Y I+SWEFGNEL+G + +GAS+SA  Y KDL+ L+++++++YKNS   KP++VA
Sbjct: 190 YTVSKGYVIDSWEFGNELSG-SGVGASVSAELYGKDLIVLKDVINKVYKNSWLHKPILVA 249

Query: 245 PGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTFKQL 304
           PG F++ +WY +L+  +GP VV V+THHIYN+G+G+DP L+ + ++P+YLSQVS TFK +
Sbjct: 250 PGGFYEQQWYTKLLEISGPSVVDVVTHHIYNLGSGNDPALVKKIMDPSYLSQVSKTFKDV 309

Query: 305 KNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQTLI 364
              +Q+H PW+S WVGE+GGAY  G   +SD+FI+SFWYLDQLGM+A +NTKVYCRQTL+
Sbjct: 310 NQTIQEHGPWASPWVGESGGAYNSGGRHVSDTFIDSFWYLDQLGMSARHNTKVYCRQTLV 369

Query: 365 GGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGISMLF 424
           GGFY +L+  T VP PDYY ALL+HRLMG GVL V       LR YAHCS+GR+G+++L 
Sbjct: 370 GGFYGLLEKGTFVPNPDYYSALLWHRLMGKGVLAVQTDGPPQLRVYAHCSKGRAGVTLLL 429

Query: 425 INLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTP--------------REEYHL 484
           INLSN ++F ++V + + + L+   + K   S ++ L  P              REEYHL
Sbjct: 430 INLSNQSDFTVSVSNGINVVLNAESRKK--KSLLDTLKRPFSWIGSKASDGYLNREEYHL 489

Query: 485 TPQNGLLRSSNVLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIG 532
           TP+NG+LRS  ++LNGK+L+ T+ G++P+L P+ +  NS +N+   S++F+V+P+F A  
Sbjct: 490 TPENGVLRSKTMVLNGKSLKPTATGDIPSLEPVLRSVNSPLNVLPLSMSFIVLPNFDASA 543

BLAST of CSPI04G05120 vs. Swiss-Prot
Match: HPSE2_ARATH (Heparanase-like protein 2 OS=Arabidopsis thaliana GN=At5g61250 PE=2 SV=1)

HSP 1 Score: 554.3 bits (1427), Expect = 1.5e-156
Identity = 266/543 (48.99%), Postives = 377/543 (69.43%), Query Frame = 1

Query: 1   MEYQIFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQP 60
           M + + + +  +  +P    G N+    +V++G  +IAETDENFIC TLD WP ++C+  
Sbjct: 1   MGFNVVVFLSCLLLLPPVTFGSNMERTTLVIDGSRRIAETDENFICATLDWWPPEKCNYD 60

Query: 61  NLCVWDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEA 120
             C W  +AS++N++L+ P++ KA+QAF+TLRIR+GG+LQD++IY++G+  K  C  F+ 
Sbjct: 61  Q-CPW-GYASLINLNLASPLLAKAIQAFRTLRIRIGGSLQDQVIYDVGD-LKTPCTQFKK 120

Query: 121 DDSLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAE 180
            D  LF F+EGCLYM+RWD++N+FFN TGAIVTFGLNAL G+    G  W G+W++TN +
Sbjct: 121 TDDGLFGFSEGCLYMKRWDEVNHFFNATGAIVTFGLNALHGRNKLNGTAWGGDWDHTNTQ 180

Query: 181 ALIKYTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPL 240
             + YTV K Y I+SWEFGNEL+G + I AS+S   Y KDL+ L+ ++  +YKNS+ KPL
Sbjct: 181 DFMNYTVSKGYAIDSWEFGNELSG-SGIWASVSVELYGKDLIVLKNVIKNVYKNSRTKPL 240

Query: 241 IVAPGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 300
           +VAPG FF+++WY EL+  +GP V+ VLTHHIYN+G G+DPKL+ + ++P YLS +S  F
Sbjct: 241 VVAPGGFFEEQWYSELLRLSGPGVLDVLTHHIYNLGPGNDPKLVNKILDPNYLSGISELF 300

Query: 301 KQLKNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQ 360
             +   +Q+H PW++AWVGEAGGA+  G  ++S++FINSFWYLDQLG+++ +NTKVYCRQ
Sbjct: 301 ANVNQTIQEHGPWAAAWVGEAGGAFNSGGRQVSETFINSFWYLDQLGISSKHNTKVYCRQ 360

Query: 361 TLIGGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGIS 420
            L+GGFY +L+ +T VP PDYY ALL+HRLMG G+L V    S YLR Y HCS+ R+GI+
Sbjct: 361 ALVGGFYGLLEKETFVPNPDYYSALLWHRLMGKGILGVQTTASEYLRAYVHCSKRRAGIT 420

Query: 421 MLFINLSNTTEFAINVKDHMTLSLH---KRRKP-----KHGSSSINNLGTP----REEYH 480
           +L INLS  T F + V + + + L     +RK      K   S + N  +     REEYH
Sbjct: 421 ILLINLSKHTTFTVAVSNGVKVVLQAESMKRKSFLETIKSKVSWVGNKASDGYLNREEYH 480

Query: 481 LTPQNGLLRSSNVLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAI 532
           L+P++G LRS  +LLNGK L  T+ G++P L P+     S + I   SI+F+V+P F A 
Sbjct: 481 LSPKDGDLRSKIMLLNGKPLVPTATGDIPKLEPVRHGVKSPVYINPLSISFIVLPTFDAP 539

BLAST of CSPI04G05120 vs. Swiss-Prot
Match: HPSE3_ARATH (Heparanase-like protein 3 OS=Arabidopsis thaliana GN=At5g34940 PE=2 SV=2)

HSP 1 Score: 454.1 bits (1167), Expect = 2.1e-126
Identity = 235/532 (44.17%), Postives = 329/532 (61.84%), Query Frame = 1

Query: 5   IFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQPNLCV 64
           I L + +  F+  T+       G + V G   +   DE+FIC TLD WP ++C   + C 
Sbjct: 9   IVLFLCVFQFLDCTVSSAVEENGTVFVYGRAAVGTIDEDFICATLDWWPPEKCDYGS-CS 68

Query: 65  WDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEADDSL 124
           WD HAS+LN+DL+  I+  A++AF  L+IR+GGTLQD +IY   +  K  C PF  + S+
Sbjct: 69  WD-HASILNLDLNNVILQNAIKAFAPLKIRIGGTLQDIVIYETPDS-KQPCLPFTKNSSI 128

Query: 125 LFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAEALIK 184
           LF +T+GCL M RWD+LN FF  TG  V FGLNAL G+      +  G WNYTNAE+ I+
Sbjct: 129 LFGYTQGCLPMRRWDELNAFFRKTGTKVIFGLNALSGRSIKSNGEAIGAWNYTNAESFIR 188

Query: 185 YTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPLIVAP 244
           +T + NY I+ WE GNEL G + +GA + A+QYA D + LR IV+R+YKN    PL++ P
Sbjct: 189 FTAENNYTIDGWELGNELCG-SGVGARVGANQYAIDTINLRNIVNRVYKNVSPMPLVIGP 248

Query: 245 GAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTFKQLK 304
           G FF+  W+ E + K     ++  T HIY++G G D  LI + +NP+YL Q + +F+ LK
Sbjct: 249 GGFFEVDWFTEYLNKA-ENSLNATTRHIYDLGPGVDEHLIEKILNPSYLDQEAKSFRSLK 308

Query: 305 NIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQTLIG 364
           NI++  +  + AWVGE+GGAY  G   +S++F+ SFWYLDQLGMA+ Y+TK YCRQ+LIG
Sbjct: 309 NIIKNSSTKAVAWVGESGGAYNSGRNLVSNAFVYSFWYLDQLGMASLYDTKTYCRQSLIG 368

Query: 365 GFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGISMLFI 424
           G Y +L      P PDYY AL++ +LMG   L      +  +R+Y HC+R   GI++L +
Sbjct: 369 GNYGLLNTTNFTPNPDYYSALIWRQLMGRKALFTTFSGTKKIRSYTHCARQSKGITVLLM 428

Query: 425 NLSNTTEFAINVKDHMTLSL-HKRRKPKHGSSSINNLGTP-----REEYHLTPQNGLLRS 484
           NL NTT     V+ + + SL H +    +  +S    G P     REEYHLT ++G L S
Sbjct: 429 NLDNTTTVVAKVELNNSFSLRHTKHMKSYKRASSQLFGGPNGVIQREEYHLTAKDGNLHS 488

Query: 485 SNVLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGC 531
             +LLNG ALQ+ S G+LP + PI+ +S   I IA +SI FV + + V   C
Sbjct: 489 QTMLLNGNALQVNSMGDLPPIEPIHINSTEPITIAPYSIVFVHMRNVVVPAC 535

BLAST of CSPI04G05120 vs. Swiss-Prot
Match: BAGLU_SCUBA (Baicalin-beta-D-glucuronidase OS=Scutellaria baicalensis GN=SGUS PE=1 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 7.8e-102
Identity = 201/519 (38.73%), Postives = 299/519 (57.61%), Query Frame = 1

Query: 19  ILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQPNLCVWDSHASVLNVDLSL 78
           ++G+  T+ KI       +A+TDEN++C TLD+WP  +C+  N C W   +S LN+DL+ 
Sbjct: 23  VIGEETTIVKIEEN---PVAQTDENYVCATLDLWPPTKCNYGN-CPWGK-SSFLNLDLNN 82

Query: 79  PIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEADDSLLFDFTEGCLYMERW 138
            II  AV+ F  L++R GGTLQDRL+Y        +   F  + +L+ DF+  CL ++RW
Sbjct: 83  NIIRNAVKEFAPLKLRFGGTLQDRLVYQTSRDEPCDS-TFYNNTNLILDFSHACLSLDRW 142

Query: 139 DDLNNFFNNTGAIVTFGLNALLGK------------YHTQGMQWEGNWNYTNAEALIKYT 198
           D++N F   TG+   FGLNAL GK            Y  +     G W+Y+N++ LI+Y+
Sbjct: 143 DEINQFILETGSEAVFGLNALRGKTVEIKGIIKDGQYLGETTTAVGEWDYSNSKFLIEYS 202

Query: 199 VDKNYQ-INSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPLIVAPG 258
           + K Y+ I  W  GNEL G +++   +S   YA D  KL E+V  +Y++    PLI+APG
Sbjct: 203 LKKGYKHIRGWTLGNELGG-HTLFIGVSPEDYANDAKKLHELVKEIYQDQGTMPLIIAPG 262

Query: 259 AFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNT-FKQLK 318
           A FD +WY E + +T    + V THH+YN+G+G D  L    +  ++  + + + ++ L+
Sbjct: 263 AIFDLEWYTEFIDRTPE--LHVATHHMYNLGSGGDDALKDVLLTASFFDEATKSMYEGLQ 322

Query: 319 NIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQTLIG 378
            IV +    + AW+GEAGGA+  G   IS++FIN FWYL+ LG +A  +TK +CRQTL G
Sbjct: 323 KIVNRPGTKAVAWIGEAGGAFNSGQDGISNTFINGFWYLNMLGYSALLDTKTFCRQTLTG 382

Query: 379 GFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGISMLFI 438
           G Y +L+  T +P PDYY ALL+HRLMG  VLK     +  +  YAHC++  +GI+ML +
Sbjct: 383 GNYGLLQTGTYIPNPDYYSALLWHRLMGSKVLKTEIVGTKNVYIYAHCAKKSNGITMLVL 442

Query: 439 NLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSNVLLN 498
           N    +   I++                     +  G+ REEYHLTP N  L+S  V LN
Sbjct: 443 NHDGESSVKISLDP-------------------SKYGSKREEYHLTPVNNNLQSRLVKLN 502

Query: 499 GKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIP 524
           G+ L L   G +P L P+ KD++  + +A +S  FV +P
Sbjct: 503 GELLHLDPSGVIPALNPVEKDNSKQLEVAPYSFMFVHLP 513

BLAST of CSPI04G05120 vs. Swiss-Prot
Match: HPSE_CHICK (Heparanase OS=Gallus gallus GN=HPSE PE=1 SV=1)

HSP 1 Score: 155.2 bits (391), Expect = 2.0e-36
Identity = 120/404 (29.70%), Postives = 187/404 (46.29%), Query Frame = 1

Query: 139 DDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAEALIKYTVDKNYQINSWEF 198
           D L+ F +++G  + FGLNALL +    G+QW+ +    NA+ L+ Y   ++Y I SWE 
Sbjct: 150 DILHTFASSSGFRLVFGLNALLRR---AGLQWDSS----NAKQLLGYCAQRSYNI-SWEL 209

Query: 199 GNELAG-RNSIGASISASQYAKDLLKLREIVDR--LYKNSQQKPLIVAPGAFFDDKWYHE 258
           GNE    R   G  I   Q  +D + LR+++ +  LY++++   L V             
Sbjct: 210 GNEPNSFRKKSGICIDGFQLGRDFVHLRQLLSQHPLYRHAELYGLDVGQPRKHTQHLLRS 269

Query: 259 LVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTFKQLKNIVQKHAPWSS 318
            +   G  + SV  HH Y  G     +    F++P  L   +     +  IV+   P   
Sbjct: 270 FMKSGGKAIDSVTWHHYYVNGRSATRE---DFLSPEVLDSFATAIHDVLGIVEATVPGKK 329

Query: 319 AWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQTLIG-GFYSVLKAKT 378
            W+GE G AY GGA ++S++++  F +LD+LG+AA     V  RQ   G G Y ++ A  
Sbjct: 330 VWLGETGSAYGGGAPQLSNTYVAGFMWLDKLGLAARRGIDVVMRQVSFGAGSYHLVDA-G 389

Query: 379 LVPTPDYYGALLFHRLMGPGVLK--VHNKVSTYLRTYAHCSRGR------SGISMLFINL 438
             P PDY+ +LL+ RL+G  VL+  V    +   R Y HC+  R        +++  +NL
Sbjct: 390 FKPLPDYWLSLLYKRLVGTRVLQASVEQADARRPRVYLHCTNPRHPKYREGDVTLFALNL 449

Query: 439 SNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSNVLLNGK 498
           SN T+     K   + S+ +     HG  SI                    S  V LNG+
Sbjct: 450 SNVTQSLQLPKQLWSKSVDQYLLLPHGKDSI-------------------LSREVQLNGR 509

Query: 499 ALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGC 531
            LQ+  +  LP L  +     S++ +  +S  F VI +  AI C
Sbjct: 510 LLQMVDDETLPALHEMALAPGSTLGLPAFSYGFYVIRNAKAIAC 522

BLAST of CSPI04G05120 vs. TrEMBL
Match: A0A0A0KUF1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G043860 PE=4 SV=1)

HSP 1 Score: 1091.3 bits (2821), Expect = 0.0e+00
Identity = 530/531 (99.81%), Postives = 530/531 (99.81%), Query Frame = 1

Query: 1   MEYQIFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQP 60
           MEYQIFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQP
Sbjct: 1   MEYQIFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQP 60

Query: 61  NLCVWDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEA 120
           NLCVWDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEA
Sbjct: 61  NLCVWDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEA 120

Query: 121 DDSLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAE 180
           DDSLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAE
Sbjct: 121 DDSLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAE 180

Query: 181 ALIKYTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPL 240
           ALIKYTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPL
Sbjct: 181 ALIKYTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPL 240

Query: 241 IVAPGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 300
           IVAPGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF
Sbjct: 241 IVAPGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 300

Query: 301 KQLKNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQ 360
           KQLKNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQ
Sbjct: 301 KQLKNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQ 360

Query: 361 TLIGGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGIS 420
           TLIGGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSR RSGIS
Sbjct: 361 TLIGGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRERSGIS 420

Query: 421 MLFINLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSN 480
           MLFINLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSN
Sbjct: 421 MLFINLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSN 480

Query: 481 VLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGCN 532
           VLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGCN
Sbjct: 481 VLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGCN 531

BLAST of CSPI04G05120 vs. TrEMBL
Match: A0A0A0KTJ9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G390000 PE=4 SV=1)

HSP 1 Score: 953.7 bits (2464), Expect = 9.2e-275
Identity = 453/531 (85.31%), Postives = 492/531 (92.66%), Query Frame = 1

Query: 1   MEYQIFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQP 60
           MEYQIFLLIL+ AFIPRTILG NVT GKIVV+G TKIAETDENFICFTLDIWPHDECSQP
Sbjct: 12  MEYQIFLLILVFAFIPRTILGLNVTTGKIVVDGTTKIAETDENFICFTLDIWPHDECSQP 71

Query: 61  NLCVWDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEA 120
           NLCVWD HAS+LN+DLSLPI+NKAVQAFKTLRIRVGGTLQDRLIYNIG+GFKGNC+PFEA
Sbjct: 72  NLCVWDGHASMLNMDLSLPILNKAVQAFKTLRIRVGGTLQDRLIYNIGDGFKGNCNPFEA 131

Query: 121 DDSLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAE 180
              LLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKY+T+G+QWEGNWNY+NAE
Sbjct: 132 HKGLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYSNAE 191

Query: 181 ALIKYTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPL 240
           ALIKYTV+K Y INSWEFGNELAG NSIGAS+SASQYAKDLLKLR+I+DRLYKNSQQKPL
Sbjct: 192 ALIKYTVEKKYNINSWEFGNELAGPNSIGASVSASQYAKDLLKLRQIIDRLYKNSQQKPL 251

Query: 241 IVAPGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 300
           IVAPGAFFDDKWY ELVTKTG  VVS LTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF
Sbjct: 252 IVAPGAFFDDKWYDELVTKTGSNVVSALTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 311

Query: 301 KQLKNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQ 360
           +QLKNI++KHAPW+SAWVGEAGGAY GG   ISD+FINSFWYLDQLGMAA YNTKVYCRQ
Sbjct: 312 RQLKNIIEKHAPWASAWVGEAGGAYHGGGLHISDTFINSFWYLDQLGMAASYNTKVYCRQ 371

Query: 361 TLIGGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGIS 420
           TL+GG+Y VL+ KT +PTPDYYGALLFHRLMG  VLKV N VS+YLRTYAHCSRGRSG++
Sbjct: 372 TLVGGYYGVLRTKTFIPTPDYYGALLFHRLMGSSVLKVDNNVSSYLRTYAHCSRGRSGVT 431

Query: 421 MLFINLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSN 480
           MLFINLSNTTEF IN+++HM LSLHK  KPKH SS   N+GT REEYHLTPQNGLLRSS 
Sbjct: 432 MLFINLSNTTEFTINIENHMNLSLHKS-KPKHSSSK--NVGTQREEYHLTPQNGLLRSST 491

Query: 481 VLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGCN 532
           VLLNGKAL+LT+EGE+P+LTP+Y+DSNSSI+I  WSIAF+VIPDFVAIGCN
Sbjct: 492 VLLNGKALELTNEGEVPDLTPVYRDSNSSISIPNWSIAFIVIPDFVAIGCN 539

BLAST of CSPI04G05120 vs. TrEMBL
Match: A0A0A0L5V7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G165650 PE=4 SV=1)

HSP 1 Score: 719.2 bits (1855), Expect = 3.8e-204
Identity = 343/525 (65.33%), Postives = 417/525 (79.43%), Query Frame = 1

Query: 6   FLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQPNLCVW 65
           F+LI LVAFIP  I G+NVTMGKIVV+G  + A+TDEN+IC T+D WP +ECS    C+W
Sbjct: 6   FVLIFLVAFIP-IIYGKNVTMGKIVVDGTIRKAQTDENYICMTIDYWPFNECSTLP-CLW 65

Query: 66  DSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEADDSLL 125
           D +AS L ++LSLP + KAVQAFKTLRIRVGG+LQD+LIY++G  FKGNC  F  + S L
Sbjct: 66  DGNASALILNLSLPTLTKAVQAFKTLRIRVGGSLQDKLIYDVGS-FKGNCPQFARNSSAL 125

Query: 126 FDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAEALIKY 185
           F  ++GCL MERWDDLN FFN TGAIVTFGLNALLG++HT G+QWEG+WNYTNAEA I+Y
Sbjct: 126 FQISDGCLSMERWDDLNQFFNKTGAIVTFGLNALLGRHHTTGLQWEGDWNYTNAEAFIQY 185

Query: 186 TVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPLIVAPG 245
           T++KNY+INSWEFGNE+ G NSIGA+++++QY KDL+KLREI+DRLY NSQQK  I AP 
Sbjct: 186 TIEKNYRINSWEFGNEMVGHNSIGANVTSAQYEKDLIKLREIIDRLYNNSQQKASIAAPS 245

Query: 246 AFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTFKQLKN 305
           AFF   WY + V  TGP +V +LTHHIYNMGAGDDPK+I  FV+P YLS+ S  F+QLKN
Sbjct: 246 AFFYAPWYKDFVNGTGPGIVDILTHHIYNMGAGDDPKVINNFVDPNYLSKESKDFQQLKN 305

Query: 306 IVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQTLIGG 365
           IV+  APWS AWVGEAGG + GG+  IS++F++ FWY+DQL MAA YNTKVYCRQTL+GG
Sbjct: 306 IVENDAPWSVAWVGEAGGTFHGGSPYISNTFVDGFWYIDQLAMAALYNTKVYCRQTLVGG 365

Query: 366 FYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGISMLFIN 425
           FY +L   TL P+PDYYGALLFHRLMG GVLKV N VS+YLRTYAHCS+ RSG++MLFIN
Sbjct: 366 FYGILLPHTLAPSPDYYGALLFHRLMGSGVLKVDNNVSSYLRTYAHCSKERSGVTMLFIN 425

Query: 426 LSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSNVLLNG 485
           LSN TEF ++++++M             S+S+ +  + REEYHL P NGL+RSS VLLNG
Sbjct: 426 LSNETEFTVDIENNMM------------STSLADKASQREEYHLIPNNGLVRSSTVLLNG 485

Query: 486 KALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGC 531
             L+ T +G+LP+LTPIY+DSNSSI IATWSI FVVIP F A  C
Sbjct: 486 NLLETTEDGDLPDLTPIYRDSNSSITIATWSIVFVVIPHFEASAC 515

BLAST of CSPI04G05120 vs. TrEMBL
Match: V4LLM9_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10013186mg PE=4 SV=1)

HSP 1 Score: 591.7 bits (1524), Expect = 9.2e-166
Identity = 282/542 (52.03%), Postives = 393/542 (72.51%), Query Frame = 1

Query: 5   IFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQPNLCV 64
           +FL  LL+  +P T + Q+V    IV+EG ++I+ETDENF+C TLD WPHD+C+  N C 
Sbjct: 9   VFLSCLLL--VPETTMAQDVKHASIVIEGASRISETDENFVCATLDWWPHDKCNYDN-CP 68

Query: 65  WDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEADDSL 124
           W  ++SV+N+DLS P++ KAVQAFK LRIR+GG+LQD++IY++G   K  C PF   +S 
Sbjct: 69  W-GYSSVINMDLSRPLLAKAVQAFKPLRIRIGGSLQDQVIYDVGN-LKTPCRPFRKMNSG 128

Query: 125 LFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAEALIK 184
           LF F++GCL+M+RWD+LN+F   TGAIVTFGLNAL G++  +G  W G WN+ N +  I 
Sbjct: 129 LFGFSKGCLHMKRWDELNSFLTETGAIVTFGLNALHGRHKLRGNAWGGAWNHINTQDFIN 188

Query: 185 YTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQ-QKPLIVA 244
           YTV K Y I+SWEFGNEL+G N +GAS+SA  Y KDL+ LRE+++++YK+SQ  KP +VA
Sbjct: 189 YTVSKGYAIDSWEFGNELSG-NGVGASVSAELYGKDLIVLREVINKVYKDSQLTKPSLVA 248

Query: 245 PGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTFKQL 304
           PG F++ +WY +L+  +GP VV V+THHIYN+G+G+DP+L+ + +NP+YLS++S TFK +
Sbjct: 249 PGGFYEQQWYSKLLQISGPGVVDVVTHHIYNLGSGNDPELVKKILNPSYLSKISETFKNV 308

Query: 305 KNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQTLI 364
              +Q+H PW+S WVGE+GGAY  G   +SD+FI+SFWYLDQLGM++ +NTKVYCRQTL+
Sbjct: 309 NQTIQEHGPWASPWVGESGGAYNSGGRHVSDTFIDSFWYLDQLGMSSKHNTKVYCRQTLV 368

Query: 365 GGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGISMLF 424
           GGFY +L+  T VP PDYY ALL+HRLMG GVL V +    +LR YAHCS+GR+G++ML 
Sbjct: 369 GGFYGLLEKGTFVPNPDYYSALLWHRLMGKGVLAVRSDGPPHLRVYAHCSKGRAGVTMLL 428

Query: 425 INLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTP--------------REEYHL 484
           INLSN T F ++V + + + L+   K +   S ++ L  P              REEYHL
Sbjct: 429 INLSNQTCFTVSVSNGVNIVLNAESKQRK-KSLLDTLKKPFSWIGSKASDGYLNREEYHL 488

Query: 485 TPQNGLLRSSNVLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIG 532
           TP+NG  RS  + LNGK L+ T+ G++PNL P+ +  NS +N+   S++F+V+P+F A  
Sbjct: 489 TPENGDFRSKTMNLNGKPLKPTATGDIPNLEPVLRGVNSPVNVLPLSMSFIVLPNFDASA 543

BLAST of CSPI04G05120 vs. TrEMBL
Match: A0A078EDE0_BRANA (BnaC09g48030D protein OS=Brassica napus GN=BnaC09g48030D PE=4 SV=1)

HSP 1 Score: 583.9 bits (1504), Expect = 1.9e-163
Identity = 273/541 (50.46%), Postives = 391/541 (72.27%), Query Frame = 1

Query: 5   IFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQPNLCV 64
           IFLL+  +  +P   + +++    IV++G ++I ETDENF+C TLD WPHD+C+  N C 
Sbjct: 7   IFLLLGCLLQVPERTMARDMKRASIVIQGASRITETDENFVCATLDWWPHDKCNYDN-CP 66

Query: 65  WDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEADDSL 124
           W  ++SV+N+DLS P++ KA+QAFK LRIR+GG+LQD++IY++G   +  CHPF    S 
Sbjct: 67  W-GYSSVINMDLSRPLLTKAIQAFKPLRIRIGGSLQDQVIYDVGN-LQTPCHPFRKMSSG 126

Query: 125 LFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAEALIK 184
           LF F++GCL+M+RWD+L++F   +GAIVTFGLNAL G++  +G  W G WN+ N +  I 
Sbjct: 127 LFGFSKGCLHMKRWDELHSFLTKSGAIVTFGLNALHGRHKLRGNAWGGAWNHVNTQDFIN 186

Query: 185 YTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQ-QKPLIVA 244
           YTV   Y I+SWEFGNEL+G   +GAS+SA  Y KD++ LR+I+D++YK+S+  KP +VA
Sbjct: 187 YTVSNGYAIDSWEFGNELSG-TGVGASVSAELYGKDVIVLRDIIDKMYKDSKLTKPSLVA 246

Query: 245 PGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTFKQL 304
           PG F++ +WY +L+  +GP VV V+THHIYN+G+G+DP+L+ + ++P+YLS+V+ TFK +
Sbjct: 247 PGGFYEQQWYTKLLEISGPDVVDVVTHHIYNLGSGNDPQLVKKILDPSYLSRVAETFKNV 306

Query: 305 KNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQTLI 364
              +Q+H PW+S WVGE+GGAY  G  R+SD+FI+SFWYLDQLGM++ +NTKVYCRQTL+
Sbjct: 307 NKTIQEHGPWASPWVGESGGAYNSGGRRVSDTFIDSFWYLDQLGMSSKHNTKVYCRQTLV 366

Query: 365 GGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGISMLF 424
           GGFY +L+  T VP PDYY ALL+HRLMG GVL V       LR YAHCS+GR G+++L 
Sbjct: 367 GGFYGLLEKGTFVPNPDYYSALLWHRLMGKGVLAVQTDGPPQLRVYAHCSKGREGVTLLL 426

Query: 425 INLSNTTEFAINVKDHMTLSLHKRRKPKHGS---------SSINNLGTP----REEYHLT 484
           INLSN ++F ++V + + ++L+   KPK  S         S I N  +     REEYHLT
Sbjct: 427 INLSNQSDFTVSVSNGVNMALNVESKPKKKSLLDTLKKPFSWIGNKASDGYLNREEYHLT 486

Query: 485 PQNGLLRSSNVLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGC 532
           P+NG LRS  ++LNGK L+ T  G++PNL P+ +  NS + +++ S++F+V+P F A  C
Sbjct: 487 PENGELRSKTMVLNGKPLKPTETGDIPNLEPVIRGVNSPVCVSSLSMSFIVLPSFDASAC 543

BLAST of CSPI04G05120 vs. TAIR10
Match: AT5G07830.1 (AT5G07830.1 glucuronidase 2)

HSP 1 Score: 577.0 bits (1486), Expect = 1.2e-164
Identity = 268/542 (49.45%), Postives = 391/542 (72.14%), Query Frame = 1

Query: 5   IFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQPNLCV 64
           +FL  LL+  +P   + Q +    IV++G  ++ ETDENF+C TLD WPHD+C+    C 
Sbjct: 10  VFLGCLLL--VPEKTMAQEMKRASIVIQGARRVCETDENFVCATLDWWPHDKCNYDQ-CP 69

Query: 65  WDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEADDSL 124
           W  ++SV+N+DL+ P++ KA++AFK LRIR+GG+LQD++IY++G   K  C PF+  +S 
Sbjct: 70  W-GYSSVINMDLTRPLLTKAIKAFKPLRIRIGGSLQDQVIYDVGN-LKTPCRPFQKMNSG 129

Query: 125 LFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAEALIK 184
           LF F++GCL+M+RWD+LN+F   TGA+VTFGLNAL G++  +G  W G W++ N +  + 
Sbjct: 130 LFGFSKGCLHMKRWDELNSFLTATGAVVTFGLNALRGRHKLRGKAWGGAWDHINTQDFLN 189

Query: 185 YTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQ-QKPLIVA 244
           YTV K Y I+SWEFGNEL+G + +GAS+SA  Y KDL+ L+++++++YKNS   KP++VA
Sbjct: 190 YTVSKGYVIDSWEFGNELSG-SGVGASVSAELYGKDLIVLKDVINKVYKNSWLHKPILVA 249

Query: 245 PGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTFKQL 304
           PG F++ +WY +L+  +GP VV V+THHIYN+G+G+DP L+ + ++P+YLSQVS TFK +
Sbjct: 250 PGGFYEQQWYTKLLEISGPSVVDVVTHHIYNLGSGNDPALVKKIMDPSYLSQVSKTFKDV 309

Query: 305 KNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQTLI 364
              +Q+H PW+S WVGE+GGAY  G   +SD+FI+SFWYLDQLGM+A +NTKVYCRQTL+
Sbjct: 310 NQTIQEHGPWASPWVGESGGAYNSGGRHVSDTFIDSFWYLDQLGMSARHNTKVYCRQTLV 369

Query: 365 GGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGISMLF 424
           GGFY +L+  T VP PDYY ALL+HRLMG GVL V       LR YAHCS+GR+G+++L 
Sbjct: 370 GGFYGLLEKGTFVPNPDYYSALLWHRLMGKGVLAVQTDGPPQLRVYAHCSKGRAGVTLLL 429

Query: 425 INLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTP--------------REEYHL 484
           INLSN ++F ++V + + + L+   + K   S ++ L  P              REEYHL
Sbjct: 430 INLSNQSDFTVSVSNGINVVLNAESRKK--KSLLDTLKRPFSWIGSKASDGYLNREEYHL 489

Query: 485 TPQNGLLRSSNVLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIG 532
           TP+NG+LRS  ++LNGK+L+ T+ G++P+L P+ +  NS +N+   S++F+V+P+F A  
Sbjct: 490 TPENGVLRSKTMVLNGKSLKPTATGDIPSLEPVLRSVNSPLNVLPLSMSFIVLPNFDASA 543

BLAST of CSPI04G05120 vs. TAIR10
Match: AT5G61250.2 (AT5G61250.2 glucuronidase 1)

HSP 1 Score: 554.3 bits (1427), Expect = 8.2e-158
Identity = 266/543 (48.99%), Postives = 377/543 (69.43%), Query Frame = 1

Query: 1   MEYQIFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQP 60
           M + + + +  +  +P    G N+    +V++G  +IAETDENFIC TLD WP ++C+  
Sbjct: 1   MGFNVVVFLSCLLLLPPVTFGSNMERTTLVIDGSRRIAETDENFICATLDWWPPEKCNYD 60

Query: 61  NLCVWDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEA 120
             C W  +AS++N++L+ P++ KA+QAF+TLRIR+GG+LQD++IY++G+  K  C  F+ 
Sbjct: 61  Q-CPW-GYASLINLNLASPLLAKAIQAFRTLRIRIGGSLQDQVIYDVGD-LKTPCTQFKK 120

Query: 121 DDSLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAE 180
            D  LF F+EGCLYM+RWD++N+FFN TGAIVTFGLNAL G+    G  W G+W++TN +
Sbjct: 121 TDDGLFGFSEGCLYMKRWDEVNHFFNATGAIVTFGLNALHGRNKLNGTAWGGDWDHTNTQ 180

Query: 181 ALIKYTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPL 240
             + YTV K Y I+SWEFGNEL+G + I AS+S   Y KDL+ L+ ++  +YKNS+ KPL
Sbjct: 181 DFMNYTVSKGYAIDSWEFGNELSG-SGIWASVSVELYGKDLIVLKNVIKNVYKNSRTKPL 240

Query: 241 IVAPGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 300
           +VAPG FF+++WY EL+  +GP V+ VLTHHIYN+G G+DPKL+ + ++P YLS +S  F
Sbjct: 241 VVAPGGFFEEQWYSELLRLSGPGVLDVLTHHIYNLGPGNDPKLVNKILDPNYLSGISELF 300

Query: 301 KQLKNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQ 360
             +   +Q+H PW++AWVGEAGGA+  G  ++S++FINSFWYLDQLG+++ +NTKVYCRQ
Sbjct: 301 ANVNQTIQEHGPWAAAWVGEAGGAFNSGGRQVSETFINSFWYLDQLGISSKHNTKVYCRQ 360

Query: 361 TLIGGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGIS 420
            L+GGFY +L+ +T VP PDYY ALL+HRLMG G+L V    S YLR Y HCS+ R+GI+
Sbjct: 361 ALVGGFYGLLEKETFVPNPDYYSALLWHRLMGKGILGVQTTASEYLRAYVHCSKRRAGIT 420

Query: 421 MLFINLSNTTEFAINVKDHMTLSLH---KRRKP-----KHGSSSINNLGTP----REEYH 480
           +L INLS  T F + V + + + L     +RK      K   S + N  +     REEYH
Sbjct: 421 ILLINLSKHTTFTVAVSNGVKVVLQAESMKRKSFLETIKSKVSWVGNKASDGYLNREEYH 480

Query: 481 LTPQNGLLRSSNVLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAI 532
           L+P++G LRS  +LLNGK L  T+ G++P L P+     S + I   SI+F+V+P F A 
Sbjct: 481 LSPKDGDLRSKIMLLNGKPLVPTATGDIPKLEPVRHGVKSPVYINPLSISFIVLPTFDAP 539

BLAST of CSPI04G05120 vs. TAIR10
Match: AT5G34940.2 (AT5G34940.2 glucuronidase 3)

HSP 1 Score: 454.1 bits (1167), Expect = 1.2e-127
Identity = 235/532 (44.17%), Postives = 329/532 (61.84%), Query Frame = 1

Query: 5   IFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQPNLCV 64
           I L + +  F+  T+       G + V G   +   DE+FIC TLD WP ++C   + C 
Sbjct: 9   IVLFLCVFQFLDCTVSSAVEENGTVFVYGRAAVGTIDEDFICATLDWWPPEKCDYGS-CS 68

Query: 65  WDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEADDSL 124
           WD HAS+LN+DL+  I+  A++AF  L+IR+GGTLQD +IY   +  K  C PF  + S+
Sbjct: 69  WD-HASILNLDLNNVILQNAIKAFAPLKIRIGGTLQDIVIYETPDS-KQPCLPFTKNSSI 128

Query: 125 LFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAEALIK 184
           LF +T+GCL M RWD+LN FF  TG  V FGLNAL G+      +  G WNYTNAE+ I+
Sbjct: 129 LFGYTQGCLPMRRWDELNAFFRKTGTKVIFGLNALSGRSIKSNGEAIGAWNYTNAESFIR 188

Query: 185 YTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPLIVAP 244
           +T + NY I+ WE GNEL G + +GA + A+QYA D + LR IV+R+YKN    PL++ P
Sbjct: 189 FTAENNYTIDGWELGNELCG-SGVGARVGANQYAIDTINLRNIVNRVYKNVSPMPLVIGP 248

Query: 245 GAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTFKQLK 304
           G FF+  W+ E + K     ++  T HIY++G G D  LI + +NP+YL Q + +F+ LK
Sbjct: 249 GGFFEVDWFTEYLNKA-ENSLNATTRHIYDLGPGVDEHLIEKILNPSYLDQEAKSFRSLK 308

Query: 305 NIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQTLIG 364
           NI++  +  + AWVGE+GGAY  G   +S++F+ SFWYLDQLGMA+ Y+TK YCRQ+LIG
Sbjct: 309 NIIKNSSTKAVAWVGESGGAYNSGRNLVSNAFVYSFWYLDQLGMASLYDTKTYCRQSLIG 368

Query: 365 GFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGISMLFI 424
           G Y +L      P PDYY AL++ +LMG   L      +  +R+Y HC+R   GI++L +
Sbjct: 369 GNYGLLNTTNFTPNPDYYSALIWRQLMGRKALFTTFSGTKKIRSYTHCARQSKGITVLLM 428

Query: 425 NLSNTTEFAINVKDHMTLSL-HKRRKPKHGSSSINNLGTP-----REEYHLTPQNGLLRS 484
           NL NTT     V+ + + SL H +    +  +S    G P     REEYHLT ++G L S
Sbjct: 429 NLDNTTTVVAKVELNNSFSLRHTKHMKSYKRASSQLFGGPNGVIQREEYHLTAKDGNLHS 488

Query: 485 SNVLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGC 531
             +LLNG ALQ+ S G+LP + PI+ +S   I IA +SI FV + + V   C
Sbjct: 489 QTMLLNGNALQVNSMGDLPPIEPIHINSTEPITIAPYSIVFVHMRNVVVPAC 535

BLAST of CSPI04G05120 vs. NCBI nr
Match: gi|700198112|gb|KGN53270.1| (hypothetical protein Csa_4G043860 [Cucumis sativus])

HSP 1 Score: 1091.3 bits (2821), Expect = 0.0e+00
Identity = 530/531 (99.81%), Postives = 530/531 (99.81%), Query Frame = 1

Query: 1   MEYQIFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQP 60
           MEYQIFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQP
Sbjct: 1   MEYQIFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQP 60

Query: 61  NLCVWDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEA 120
           NLCVWDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEA
Sbjct: 61  NLCVWDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEA 120

Query: 121 DDSLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAE 180
           DDSLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAE
Sbjct: 121 DDSLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAE 180

Query: 181 ALIKYTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPL 240
           ALIKYTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPL
Sbjct: 181 ALIKYTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPL 240

Query: 241 IVAPGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 300
           IVAPGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF
Sbjct: 241 IVAPGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 300

Query: 301 KQLKNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQ 360
           KQLKNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQ
Sbjct: 301 KQLKNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQ 360

Query: 361 TLIGGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGIS 420
           TLIGGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSR RSGIS
Sbjct: 361 TLIGGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRERSGIS 420

Query: 421 MLFINLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSN 480
           MLFINLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSN
Sbjct: 421 MLFINLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSN 480

Query: 481 VLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGCN 532
           VLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGCN
Sbjct: 481 VLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGCN 531

BLAST of CSPI04G05120 vs. NCBI nr
Match: gi|449445228|ref|XP_004140375.1| (PREDICTED: heparanase-like protein 1 [Cucumis sativus])

HSP 1 Score: 953.7 bits (2464), Expect = 1.3e-274
Identity = 453/531 (85.31%), Postives = 492/531 (92.66%), Query Frame = 1

Query: 1   MEYQIFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQP 60
           MEYQIFLLIL+ AFIPRTILG NVT GKIVV+G TKIAETDENFICFTLDIWPHDECSQP
Sbjct: 1   MEYQIFLLILVFAFIPRTILGLNVTTGKIVVDGTTKIAETDENFICFTLDIWPHDECSQP 60

Query: 61  NLCVWDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEA 120
           NLCVWD HAS+LN+DLSLPI+NKAVQAFKTLRIRVGGTLQDRLIYNIG+GFKGNC+PFEA
Sbjct: 61  NLCVWDGHASMLNMDLSLPILNKAVQAFKTLRIRVGGTLQDRLIYNIGDGFKGNCNPFEA 120

Query: 121 DDSLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAE 180
              LLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKY+T+G+QWEGNWNY+NAE
Sbjct: 121 HKGLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYSNAE 180

Query: 181 ALIKYTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPL 240
           ALIKYTV+K Y INSWEFGNELAG NSIGAS+SASQYAKDLLKLR+I+DRLYKNSQQKPL
Sbjct: 181 ALIKYTVEKKYNINSWEFGNELAGPNSIGASVSASQYAKDLLKLRQIIDRLYKNSQQKPL 240

Query: 241 IVAPGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 300
           IVAPGAFFDDKWY ELVTKTG  VVS LTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF
Sbjct: 241 IVAPGAFFDDKWYDELVTKTGSNVVSALTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 300

Query: 301 KQLKNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQ 360
           +QLKNI++KHAPW+SAWVGEAGGAY GG   ISD+FINSFWYLDQLGMAA YNTKVYCRQ
Sbjct: 301 RQLKNIIEKHAPWASAWVGEAGGAYHGGGLHISDTFINSFWYLDQLGMAASYNTKVYCRQ 360

Query: 361 TLIGGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGIS 420
           TL+GG+Y VL+ KT +PTPDYYGALLFHRLMG  VLKV N VS+YLRTYAHCSRGRSG++
Sbjct: 361 TLVGGYYGVLRTKTFIPTPDYYGALLFHRLMGSSVLKVDNNVSSYLRTYAHCSRGRSGVT 420

Query: 421 MLFINLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSN 480
           MLFINLSNTTEF IN+++HM LSLHK  KPKH SS   N+GT REEYHLTPQNGLLRSS 
Sbjct: 421 MLFINLSNTTEFTINIENHMNLSLHKS-KPKHSSSK--NVGTQREEYHLTPQNGLLRSST 480

Query: 481 VLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGCN 532
           VLLNGKAL+LT+EGE+P+LTP+Y+DSNSSI+I  WSIAF+VIPDFVAIGCN
Sbjct: 481 VLLNGKALELTNEGEVPDLTPVYRDSNSSISIPNWSIAFIVIPDFVAIGCN 528

BLAST of CSPI04G05120 vs. NCBI nr
Match: gi|700195824|gb|KGN51001.1| (hypothetical protein Csa_5G390000 [Cucumis sativus])

HSP 1 Score: 953.7 bits (2464), Expect = 1.3e-274
Identity = 453/531 (85.31%), Postives = 492/531 (92.66%), Query Frame = 1

Query: 1   MEYQIFLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQP 60
           MEYQIFLLIL+ AFIPRTILG NVT GKIVV+G TKIAETDENFICFTLDIWPHDECSQP
Sbjct: 12  MEYQIFLLILVFAFIPRTILGLNVTTGKIVVDGTTKIAETDENFICFTLDIWPHDECSQP 71

Query: 61  NLCVWDSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEA 120
           NLCVWD HAS+LN+DLSLPI+NKAVQAFKTLRIRVGGTLQDRLIYNIG+GFKGNC+PFEA
Sbjct: 72  NLCVWDGHASMLNMDLSLPILNKAVQAFKTLRIRVGGTLQDRLIYNIGDGFKGNCNPFEA 131

Query: 121 DDSLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAE 180
              LLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKY+T+G+QWEGNWNY+NAE
Sbjct: 132 HKGLLFDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYNTKGIQWEGNWNYSNAE 191

Query: 181 ALIKYTVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPL 240
           ALIKYTV+K Y INSWEFGNELAG NSIGAS+SASQYAKDLLKLR+I+DRLYKNSQQKPL
Sbjct: 192 ALIKYTVEKKYNINSWEFGNELAGPNSIGASVSASQYAKDLLKLRQIIDRLYKNSQQKPL 251

Query: 241 IVAPGAFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 300
           IVAPGAFFDDKWY ELVTKTG  VVS LTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF
Sbjct: 252 IVAPGAFFDDKWYDELVTKTGSNVVSALTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTF 311

Query: 301 KQLKNIVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQ 360
           +QLKNI++KHAPW+SAWVGEAGGAY GG   ISD+FINSFWYLDQLGMAA YNTKVYCRQ
Sbjct: 312 RQLKNIIEKHAPWASAWVGEAGGAYHGGGLHISDTFINSFWYLDQLGMAASYNTKVYCRQ 371

Query: 361 TLIGGFYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGIS 420
           TL+GG+Y VL+ KT +PTPDYYGALLFHRLMG  VLKV N VS+YLRTYAHCSRGRSG++
Sbjct: 372 TLVGGYYGVLRTKTFIPTPDYYGALLFHRLMGSSVLKVDNNVSSYLRTYAHCSRGRSGVT 431

Query: 421 MLFINLSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSN 480
           MLFINLSNTTEF IN+++HM LSLHK  KPKH SS   N+GT REEYHLTPQNGLLRSS 
Sbjct: 432 MLFINLSNTTEFTINIENHMNLSLHKS-KPKHSSSK--NVGTQREEYHLTPQNGLLRSST 491

Query: 481 VLLNGKALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGCN 532
           VLLNGKAL+LT+EGE+P+LTP+Y+DSNSSI+I  WSIAF+VIPDFVAIGCN
Sbjct: 492 VLLNGKALELTNEGEVPDLTPVYRDSNSSISIPNWSIAFIVIPDFVAIGCN 539

BLAST of CSPI04G05120 vs. NCBI nr
Match: gi|659076900|ref|XP_008438923.1| (PREDICTED: heparanase-like protein 2 [Cucumis melo])

HSP 1 Score: 724.9 bits (1870), Expect = 1.0e-205
Identity = 346/525 (65.90%), Postives = 420/525 (80.00%), Query Frame = 1

Query: 6   FLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQPNLCVW 65
           F+LI LVAFIP  I G+NVTMG IVV+G T+I ETDEN+IC T+D WP +ECS    C+W
Sbjct: 6   FVLIFLVAFIPM-IYGKNVTMGNIVVDGTTRITETDENYICMTIDYWPFNECSTIP-CLW 65

Query: 66  DSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEADDSLL 125
           D +AS LN++LSLP + KAVQAFKTLRIRVGG+LQD+LIY++G  FKGNC  F  + + +
Sbjct: 66  DGNASALNLNLSLPTLTKAVQAFKTLRIRVGGSLQDKLIYDVGS-FKGNCPQFVRNSTAM 125

Query: 126 FDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAEALIKY 185
           F  +EGCL MERWDDLN FFN TGAIVTFGLNALLG+ HT G++WEG WNYTNAEA I+Y
Sbjct: 126 FHISEGCLSMERWDDLNQFFNKTGAIVTFGLNALLGRQHTSGLRWEGEWNYTNAEAFIQY 185

Query: 186 TVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPLIVAPG 245
           T++KNY+INSWEFGNE+ G NSIG +I+++QYAKDL+KLREI+DRLY NSQQKPLI AP 
Sbjct: 186 TIEKNYRINSWEFGNEMVGHNSIGVNITSAQYAKDLIKLREIIDRLYNNSQQKPLIAAPS 245

Query: 246 AFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTFKQLKN 305
           AFFD  WY + V  TGP +V +LTHHIYNMGAG DPK+I  F++P YLS+ S  F+QLKN
Sbjct: 246 AFFDASWYKDFVYGTGPGIVDILTHHIYNMGAGYDPKVIDNFLDPNYLSKESRDFQQLKN 305

Query: 306 IVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQTLIGG 365
           IV+ +APWS AWVGEAGG + GG+  IS++F++ FWY+DQL MAA YNTKVYCRQTL+GG
Sbjct: 306 IVENNAPWSVAWVGEAGGTFHGGSPYISNTFVDGFWYIDQLAMAALYNTKVYCRQTLVGG 365

Query: 366 FYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGISMLFIN 425
           FY +L    L P+PDYYGALLFHRLMG GVLKV N VS+YLRTYAHC++ RSG++MLFIN
Sbjct: 366 FYGILLPYILAPSPDYYGALLFHRLMGSGVLKVDNNVSSYLRTYAHCTKERSGVTMLFIN 425

Query: 426 LSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSNVLLNG 485
           LSN TEF I++K++ T+++    KP           + REEYHLTP NGL+RSS VLLNG
Sbjct: 426 LSNQTEFTIDIKNN-TMNMGLPNKP-----------SQREEYHLTPNNGLVRSSTVLLNG 485

Query: 486 KALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGC 531
             L+ T +G+LP+LTPIY+DSNSSI +ATWSI FVVIPDF A  C
Sbjct: 486 NLLKTTEDGDLPDLTPIYRDSNSSITVATWSIVFVVIPDFEAPAC 515

BLAST of CSPI04G05120 vs. NCBI nr
Match: gi|778679004|ref|XP_011651069.1| (PREDICTED: heparanase-like protein 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 719.2 bits (1855), Expect = 5.5e-204
Identity = 343/525 (65.33%), Postives = 417/525 (79.43%), Query Frame = 1

Query: 6   FLLILLVAFIPRTILGQNVTMGKIVVEGITKIAETDENFICFTLDIWPHDECSQPNLCVW 65
           F+LI LVAFIP  I G+NVTMGKIVV+G  + A+TDEN+IC T+D WP +ECS    C+W
Sbjct: 44  FVLIFLVAFIP-IIYGKNVTMGKIVVDGTIRKAQTDENYICMTIDYWPFNECSTLP-CLW 103

Query: 66  DSHASVLNVDLSLPIINKAVQAFKTLRIRVGGTLQDRLIYNIGEGFKGNCHPFEADDSLL 125
           D +AS L ++LSLP + KAVQAFKTLRIRVGG+LQD+LIY++G  FKGNC  F  + S L
Sbjct: 104 DGNASALILNLSLPTLTKAVQAFKTLRIRVGGSLQDKLIYDVGS-FKGNCPQFARNSSAL 163

Query: 126 FDFTEGCLYMERWDDLNNFFNNTGAIVTFGLNALLGKYHTQGMQWEGNWNYTNAEALIKY 185
           F  ++GCL MERWDDLN FFN TGAIVTFGLNALLG++HT G+QWEG+WNYTNAEA I+Y
Sbjct: 164 FQISDGCLSMERWDDLNQFFNKTGAIVTFGLNALLGRHHTTGLQWEGDWNYTNAEAFIQY 223

Query: 186 TVDKNYQINSWEFGNELAGRNSIGASISASQYAKDLLKLREIVDRLYKNSQQKPLIVAPG 245
           T++KNY+INSWEFGNE+ G NSIGA+++++QY KDL+KLREI+DRLY NSQQK  I AP 
Sbjct: 224 TIEKNYRINSWEFGNEMVGHNSIGANVTSAQYEKDLIKLREIIDRLYNNSQQKASIAAPS 283

Query: 246 AFFDDKWYHELVTKTGPKVVSVLTHHIYNMGAGDDPKLIYRFVNPTYLSQVSNTFKQLKN 305
           AFF   WY + V  TGP +V +LTHHIYNMGAGDDPK+I  FV+P YLS+ S  F+QLKN
Sbjct: 284 AFFYAPWYKDFVNGTGPGIVDILTHHIYNMGAGDDPKVINNFVDPNYLSKESKDFQQLKN 343

Query: 306 IVQKHAPWSSAWVGEAGGAYQGGAYRISDSFINSFWYLDQLGMAAFYNTKVYCRQTLIGG 365
           IV+  APWS AWVGEAGG + GG+  IS++F++ FWY+DQL MAA YNTKVYCRQTL+GG
Sbjct: 344 IVENDAPWSVAWVGEAGGTFHGGSPYISNTFVDGFWYIDQLAMAALYNTKVYCRQTLVGG 403

Query: 366 FYSVLKAKTLVPTPDYYGALLFHRLMGPGVLKVHNKVSTYLRTYAHCSRGRSGISMLFIN 425
           FY +L   TL P+PDYYGALLFHRLMG GVLKV N VS+YLRTYAHCS+ RSG++MLFIN
Sbjct: 404 FYGILLPHTLAPSPDYYGALLFHRLMGSGVLKVDNNVSSYLRTYAHCSKERSGVTMLFIN 463

Query: 426 LSNTTEFAINVKDHMTLSLHKRRKPKHGSSSINNLGTPREEYHLTPQNGLLRSSNVLLNG 485
           LSN TEF ++++++M             S+S+ +  + REEYHL P NGL+RSS VLLNG
Sbjct: 464 LSNETEFTVDIENNMM------------STSLADKASQREEYHLIPNNGLVRSSTVLLNG 523

Query: 486 KALQLTSEGELPNLTPIYKDSNSSINIATWSIAFVVIPDFVAIGC 531
             L+ T +G+LP+LTPIY+DSNSSI IATWSI FVVIP F A  C
Sbjct: 524 NLLETTEDGDLPDLTPIYRDSNSSITIATWSIVFVVIPHFEASAC 553

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HPSE1_ARATH2.1e-16349.45Heparanase-like protein 1 OS=Arabidopsis thaliana GN=At5g07830 PE=2 SV=1[more]
HPSE2_ARATH1.5e-15648.99Heparanase-like protein 2 OS=Arabidopsis thaliana GN=At5g61250 PE=2 SV=1[more]
HPSE3_ARATH2.1e-12644.17Heparanase-like protein 3 OS=Arabidopsis thaliana GN=At5g34940 PE=2 SV=2[more]
BAGLU_SCUBA7.8e-10238.73Baicalin-beta-D-glucuronidase OS=Scutellaria baicalensis GN=SGUS PE=1 SV=1[more]
HPSE_CHICK2.0e-3629.70Heparanase OS=Gallus gallus GN=HPSE PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KUF1_CUCSA0.0e+0099.81Uncharacterized protein OS=Cucumis sativus GN=Csa_4G043860 PE=4 SV=1[more]
A0A0A0KTJ9_CUCSA9.2e-27585.31Uncharacterized protein OS=Cucumis sativus GN=Csa_5G390000 PE=4 SV=1[more]
A0A0A0L5V7_CUCSA3.8e-20465.33Uncharacterized protein OS=Cucumis sativus GN=Csa_3G165650 PE=4 SV=1[more]
V4LLM9_EUTSA9.2e-16652.03Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10013186mg PE=4 SV=1[more]
A0A078EDE0_BRANA1.9e-16350.46BnaC09g48030D protein OS=Brassica napus GN=BnaC09g48030D PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G07830.11.2e-16449.45 glucuronidase 2[more]
AT5G61250.28.2e-15848.99 glucuronidase 1[more]
AT5G34940.21.2e-12744.17 glucuronidase 3[more]
Match NameE-valueIdentityDescription
gi|700198112|gb|KGN53270.1|0.0e+0099.81hypothetical protein Csa_4G043860 [Cucumis sativus][more]
gi|449445228|ref|XP_004140375.1|1.3e-27485.31PREDICTED: heparanase-like protein 1 [Cucumis sativus][more]
gi|700195824|gb|KGN51001.1|1.3e-27485.31hypothetical protein Csa_5G390000 [Cucumis sativus][more]
gi|659076900|ref|XP_008438923.1|1.0e-20565.90PREDICTED: heparanase-like protein 2 [Cucumis melo][more]
gi|778679004|ref|XP_011651069.1|5.5e-20465.33PREDICTED: heparanase-like protein 2 isoform X1 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005199Glyco_hydro_79
IPR017853Glycoside_hydrolase_SF
Vocabulary: Molecular Function
TermDefinition
GO:0016798hydrolase activity, acting on glycosyl bonds
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0016020 membrane
molecular_function GO:0016798 hydrolase activity, acting on glycosyl bonds

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G05120.1CSPI04G05120.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005199Glycoside hydrolase, family 79PANTHERPTHR14363HEPARANASE-RELATEDcoord: 5..531
score: 4.7E
IPR005199Glycoside hydrolase, family 79PFAMPF03662Glyco_hydro_79ncoord: 28..347
score: 8.7E
IPR017853Glycoside hydrolase superfamilyunknownSSF51445(Trans)glycosidasescoord: 41..391
score: 1.46
NoneNo IPR availablePANTHERPTHR14363:SF17HEPARANASE-LIKE PROTEIN 1-RELATEDcoord: 5..531
score: 4.7E