Cucsa.106100 (gene) Cucumber (Gy14) v1

NameCucsa.106100
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionAspartic proteinase nepenthesin-1, putative
Locationscaffold00929 : 513414 .. 516242 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATTTCGAAATCTCTCCATTTCCCCCTTTCTCTTCTTCTTCTTCTTCTTCTTCCTCTTCTCTCCATTGGCGTCGATGCTCGTTCGAGCTCGTTCAACCTCGGTAATGGTGATAACCATGAGAAGGGTCTTCTTCAGCTTTTTCAGAATTTTCCATGGAAGGAGCATGGAGAAGCTGTTGTTAATTGCATCTTTCACAAGCCAAGTGAGTGTTTCTCTTTTTTTACACTAATCTTCATTGTTGTTGTTGTTAATTATCTTTGGGATCTGTTTTTTCTTTTAATGAAAAATATGACCTTTTTGCTTTTGCTTTTTTGTAAGGATATAGGATTTGCCATTTTTTTTCTTGTTTTAAAATGTGAACTCTCATCATTAATTGCTTTTTTCTTTTCTTTATAAAGTTAATTAAATTTGGGTTTGTGTGATTTTCTAGTAAAAGCTAATCTTTTTTTGGGCTCTTGGGTTATGTTTAAATCTCTTTTTAAAGGGTTTTTTTTTTCTTTTAAAAGAAAAAACATATCAAACTATTAACTCACATTATTACATTTTGAATTTCTTAAACAGAAATTTGTTTTTCCTCTTTTAGGTTTTTTAGCAAAAAATACTTTGTATTTTTTTTCTTTCTGTTGGGCTATTTCTCTCTTCTGTTTTCTTTTTTTCCTTTTTTTTTGGGAAATTACATTCACCAATTAATAATGGATTTTTCTTTCCCTTATTGTGCAAATTTCAATCAAGTTATTAATGTACCTTTTTTTTTCTTAAACTATTTTATTAGAATCATAAGACTTGAAATGATAAGTAAAAAAACATTATTATAATTTTTCACCAAAGTATTCATTCTAGATTCTATTTCTAAAGAATACAAATGAAATATATACACACACCCACATTATCATAGTAGTTTGCTAAATGAGGGTTTGAAGATTTAACTTCTTCTTTTTTTGTGTTAAATTAAGTAATACTTCAAAATTTAATAAATCAAATACCGACATGCTCCATATTGAAACTATATGAAAACTATCTCTAATGGTAAGATTTCCAAGTTACCCTTGATTAAAGATTTGGGTGAGTAGGTAACATCGTTTTACATTAAAGGTTATATTGGTAATTAAGAAATTTGAGAGGGATGTCAAGATATATATGTATGTATGTATATGTATTATGATTTTGTTTAAAAAATTTAAATATTTTAATTATTGTCAGATATGTAATTTGTTGTTTTGTTAAGGAGATAACTATAAATTCTATTATATTTTTTATACTGTTTATCTTGTGAAGCCCACGGGAAAAAAGAAAAAAAAATGGTGTTTTACAATACAAGTAGTATTTTCACCAATATTCACGTGGAGGGAAATCCACAATTGAATTAGCTTCAATATAATTAATTTTGTGTCTCAATTAATTATGGAATTTGAATTTCAGAGATAACGAAGGGAATAACGACACTGGAAATGAAACAGAGAGATTACTGTTCAGGCAAAATAACAGACTGGGAAAAGATTTTTCAAAACCGTATCATCCTCGACGCTATTAACGTCAATTCCCTTTTTTCACATTTCAAATCAGCCATTTTTCCCGGCCAGACTCACCAACTCTCCGATTCCCAAATCCCCATTTCCTCCGGCGCCAGGCTCCAAACACTCAATTACATCGTCACCGTCGGCATCGGCGGCCAGAATTCCACTCTCATCGTTGACACCGGCAGCGATCTCACTTGGGTTCAATGCCTCCCTTGCCGCCTCTGTTACAACCAACAAGAACCCCTCTTCAATCCCTCTAATTCCTCTTCCTTCCTCTCCCTTCCTTGTAATTCCCCTACCTGTGTGGCTCTTCAACCCACCGCCGGAAGCTCCGGCCTCTGTTCTAACAAAAACTCAACTTCCTGCGACTACCAGATCGACTACGGCGATGGATCTTACTCCCGTGGGGAACTCGGATTCGAGAAGCTGACTTTAGGTAAAACTGAGATTGATAATTTCATATTCGGATGTGGCCGGAATAACAAAGGTTTATTTGGAGGAGCTTCGGGATTAATGGGTTTAGCTAGAAGTGAATTATCTCTGGTTTCTCAAACGTCCTCTCTTTTTGGTAGTGTTTTTTCTTACTGTTTACCAACAACTGGAGTTGGATCTTCAGGTTCCTTAACATTGGGGGGAGCCGATTTCTCAAATTTCAAGAACATTTCACCAATTTCCTATACAAGAATGATTCAAAACCCACAAATGTCGAATTTCTACTTTCTGAATCTGACTGGAATTTCAATCGGTGGGGTTAATTTGAATGTGCCTCGTTTATCTTCAAACGAAGGGGTTTTGAGTTTACTCGATTCGGGAACAGTGATAACAAGGCTATCTCCATCGATTTACAAAGCTTTCAAAGCAGAATTTGAGAAACAATTTTCTGGGTATCGAACTACACCAGGATTTTCGATTCTTAATACTTGTTTTAATCTAACAGGGTACGAAGAAGTGAATATTCCTACTGTGAAATTTATCTTTGAAGGCAATGCAGAGATGATTGTTGATGTTGAAGGGGTTTTTTATTTTGTGAAATCTGATGCTTCACAGATTTGTTTAGCGTTTGCGAGTTTGGGTTATGAAGATCAGACGATGATAATTGGGAATTATCAGCAGAAGAATCAAAGGGTTATTTATAATTCTAAAGAATCTAAAGTGGGTTTTGCAGGGGAGCCTTGCAGTTTTTAAGAATTTTTCCCGGAAAATGGGTGATTTTCCCGGAAAATTTGATTGGATCTTGAGAAATATTGAGGCTGATTATTTTAATGTAATGGGTTGGGTTGGATTTGACATTGGGTTAAAACGAAAT

mRNA sequence

ATGGAGATTTCGAAATCTCTCCATTTCCCCctttctcttcttcttcttcttcttcttcctcttctCTCCATTGGCGTCGATGCTCGTTCGAGCTCGTTCAACCTCGGTAATGGTGATAACCATGAGAAGGGTCTTCTTCAGCTTTTTCAGAATTTTCCATGGAAGGAGCATGGAGAAGCTGTTGTTAATTGCATCTTTCACAAGCCAAAGATAACGAAGGGAATAACGACACTGGAAATGAAACAGAGAGATTACTGTTCAGGCAAAATAACAGACTGGGAAAAGATTTTTCAAAACCGTATCATCCTCGACGCTATTAACGTCAATTCCCTTTTTTCACATTTCAAATCAGCCATTTTTCCCGGCCAGACTCACCAACTCTCCGATTCCCAAATCCCCATTTCCTCCGGCGCCAGGCTCCAAACACTCAATTACATCGTCACCGTCGGCATCGGCGGCCAGAATTCCACTCTCATCGTTGACACCGGCAGCGATCTCACTTGGGTTCAATGCCTCCCTTGCCGCCTCTGTTACAACCAACAAGAACCCCTCTTCAATCCCTCTAATTCCTCTTCCTTCCTCTCCCTTCCTTGTAATTCCCCTACCTGTGTGGCTCTTCAACCCACCGCCGGAAGCTCCGGCCTCTGTTCTAACAAAAACTCAACTTCCTGCGACTACCAGATCGACTACGGCGATGGATCTTACTCCCGTGGGGAACTCGGATTCGAGAAGCTGACTTTAGGTAAAACTGAGATTGATAATTTCATATTCGGATGTGGCCGGAATAACAAAGGTTTATTTGGAGGAGCTTCGGGATTAATGGGTTTAGCTAGAAGTGAATTATCTCTGGTTTCTCAAACGTCCTCTCTTTTTGGTAGTGTTTTTTCTTACTGTTTACCAACAACTGGAGTTGGATCTTCAGGTTCCTTAACATTGGGGGGAGCCGATTTCTCAAATTTCAAGAACATTTCACCAATTTCCTATACAAGAATGATTCAAAACCCACAAATGTCGAATTTCTACTTTCTGAATCTGACTGGAATTTCAATCGGTGGGGTTAATTTGAATGTGCCTCGTTTATCTTCAAACGAAGGGGTTTTGAGTTTACTCGATTCGGGAACAGTGATAACAAGGCTATCTCCATCGATTTACAAAGCTTTCAAAGCAGAATTTGAGAAACAATTTTCTGGGTATCGAACTACACCAGGATTTTCGATTCTTAATACTTGTTTTAATCTAACAGGGTACGAAGAAGTGAATATTCCTACTGTGAAATTTATCTTTGAAGGCAATGCAGAGATGATTGTTGATGTTGAAGGGGTTTTTTATTTTGTGAAATCTGATGCTTCACAGATTTGTTTAGCGTTTGCGAGTTTGGGTTATGAAGATCAGACGATGATAATTGGGAATTATCAGCAGAAGAATCAAAGGGTTATTTATAATTCTAAAGAATCTAAAGTGGGTTTTGCAGGGGAGCCTTGCAGTTTTTAAGAATTTTTCCCGGAAAATGGGTGATTTTCCCGGAAAATTTGATTGGATCTTGAGAAATATTGAGGCTGATTATTTTAATGTAATGGGTTGGGTTGGATTTGACATTGGGTTAAAACGAAAT

Coding sequence (CDS)

ATGGAGATTTCGAAATCTCTCCATTTCCCCCTTTCTCTTCTTCTTCTTCTTCTTCTTCCTCTTCTCTCCATTGGCGTCGATGCTCGTTCGAGCTCGTTCAACCTCGGTAATGGTGATAACCATGAGAAGGGTCTTCTTCAGCTTTTTCAGAATTTTCCATGGAAGGAGCATGGAGAAGCTGTTGTTAATTGCATCTTTCACAAGCCAAAGATAACGAAGGGAATAACGACACTGGAAATGAAACAGAGAGATTACTGTTCAGGCAAAATAACAGACTGGGAAAAGATTTTTCAAAACCGTATCATCCTCGACGCTATTAACGTCAATTCCCTTTTTTCACATTTCAAATCAGCCATTTTTCCCGGCCAGACTCACCAACTCTCCGATTCCCAAATCCCCATTTCCTCCGGCGCCAGGCTCCAAACACTCAATTACATCGTCACCGTCGGCATCGGCGGCCAGAATTCCACTCTCATCGTTGACACCGGCAGCGATCTCACTTGGGTTCAATGCCTCCCTTGCCGCCTCTGTTACAACCAACAAGAACCCCTCTTCAATCCCTCTAATTCCTCTTCCTTCCTCTCCCTTCCTTGTAATTCCCCTACCTGTGTGGCTCTTCAACCCACCGCCGGAAGCTCCGGCCTCTGTTCTAACAAAAACTCAACTTCCTGCGACTACCAGATCGACTACGGCGATGGATCTTACTCCCGTGGGGAACTCGGATTCGAGAAGCTGACTTTAGGTAAAACTGAGATTGATAATTTCATATTCGGATGTGGCCGGAATAACAAAGGTTTATTTGGAGGAGCTTCGGGATTAATGGGTTTAGCTAGAAGTGAATTATCTCTGGTTTCTCAAACGTCCTCTCTTTTTGGTAGTGTTTTTTCTTACTGTTTACCAACAACTGGAGTTGGATCTTCAGGTTCCTTAACATTGGGGGGAGCCGATTTCTCAAATTTCAAGAACATTTCACCAATTTCCTATACAAGAATGATTCAAAACCCACAAATGTCGAATTTCTACTTTCTGAATCTGACTGGAATTTCAATCGGTGGGGTTAATTTGAATGTGCCTCGTTTATCTTCAAACGAAGGGGTTTTGAGTTTACTCGATTCGGGAACAGTGATAACAAGGCTATCTCCATCGATTTACAAAGCTTTCAAAGCAGAATTTGAGAAACAATTTTCTGGGTATCGAACTACACCAGGATTTTCGATTCTTAATACTTGTTTTAATCTAACAGGGTACGAAGAAGTGAATATTCCTACTGTGAAATTTATCTTTGAAGGCAATGCAGAGATGATTGTTGATGTTGAAGGGGTTTTTTATTTTGTGAAATCTGATGCTTCACAGATTTGTTTAGCGTTTGCGAGTTTGGGTTATGAAGATCAGACGATGATAATTGGGAATTATCAGCAGAAGAATCAAAGGGTTATTTATAATTCTAAAGAATCTAAAGTGGGTTTTGCAGGGGAGCCTTGCAGTTTTTAA

Protein sequence

MEISKSLHFPLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGEAVVNCIFHKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCSF*
BLAST of Cucsa.106100 vs. Swiss-Prot
Match: ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 309.3 bits (791), Expect = 7.6e-83
Identity = 172/398 (43.22%), Postives = 237/398 (59.55%), Query Frame = 1

Query: 103 LDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNS--TLIV 162
           LD   VNS+ S     +      +   + +P   G+ L + NYIVTVG+G   +  +LI 
Sbjct: 90  LDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIF 149

Query: 163 DTGSDLTWVQCLPC-RLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNK 222
           DTGSDLTW QC PC R CY+Q+EP+FNPS S+S+ ++ C+S  C +L    G++G CS  
Sbjct: 150 DTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSAS 209

Query: 223 NSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEI-DNFIFGCGRNNKGLFGGASGLMGLAR 282
           N   C Y I YGD S+S G L  EK TL  +++ D   FGCG NN+GLF G +GL+GL R
Sbjct: 210 N---CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGR 269

Query: 283 SELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMS 342
            +LS  SQT++ +  +FSYCLP++    +G LT G A  S     +PIS          +
Sbjct: 270 DKLSFPSQTATAYNKIFSYCLPSSA-SYTGHLTFGSAGISRSVKFTPISTIT-----DGT 329

Query: 343 NFYFLNLTGISIGGVNLNVPR-LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSG 402
           +FY LN+  I++GG  L +P  + S  G  +L+DSGTVITRL P  Y A ++ F+ + S 
Sbjct: 330 SFYGLNIVAITVGGQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSK 389

Query: 403 YRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFA 462
           Y TT G SIL+TCF+L+G++ V IP V F F G A + +  +G+FY  K   SQ+CLAFA
Sbjct: 390 YPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFA 449

Query: 463 SLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 496
               +    I GN QQ+   V+Y+    +VGFA   CS
Sbjct: 450 GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474

BLAST of Cucsa.106100 vs. Swiss-Prot
Match: AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1)

HSP 1 Score: 268.9 bits (686), Expect = 1.1e-70
Identity = 160/414 (38.65%), Postives = 236/414 (57.00%), Query Frame = 1

Query: 86  CSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNY 145
           CS   +D        I  D   V S++S   S     +  +   +++P  SG  L + NY
Sbjct: 74  CSHLSSDARVDHDEIIRRDQARVESIYSKL-SKNSANEVSEAKSTELPAKSGITLGSGNY 133

Query: 146 IVTVGIGG--QNSTLIVDTGSDLTWVQCLPCR-LCYNQQEPLFNPSNSSSFLSLPCNSPT 205
           IVT+GIG    + +L+ DTGSDLTW QC PC   CY+Q+EP FNPS+SS++ ++ C+SP 
Sbjct: 134 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPM 193

Query: 206 CVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEI-DNFIFGCGR 265
           C   +        CS  N   C Y I YGD S+++G L  EK TL  +++ ++  FGCG 
Sbjct: 194 CEDAES-------CSASN---CVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGE 253

Query: 266 NNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFK 325
           NN+GLF G +GL+GL   +LSL +QT++ + ++FSYCLP+    S+G LT G A  S   
Sbjct: 254 NNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESV 313

Query: 326 NISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNV-PRLSSNEGVLSLLDSGTVITRLS 385
             +PIS       P   N Y +++ GIS+G   L + P   S EG  +++DSGTV TRL 
Sbjct: 314 KFTPIS-----SFPSAFN-YGIDIIGISVGDKELAITPNSFSTEG--AIIDSGTVFTRLP 373

Query: 386 PSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEG 445
             +Y   ++ F+++ S Y++T G+ + +TC++ TG + V  PT+ F F G+  + +D  G
Sbjct: 374 TKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSG 433

Query: 446 VFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 495
           +   +K   SQ+CLAFA  G +D   I GN QQ    V+Y+    +VGFA   C
Sbjct: 434 ISLPIK--ISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Cucsa.106100 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 241.9 bits (616), Expect = 1.5e-62
Identity = 134/410 (32.68%), Postives = 212/410 (51.71%), Query Frame = 1

Query: 100 RIILDAINVNSLFSHFKSAIFPGQT--HQLSDSQIPISSGARLQTLNYIVTVGIGG--QN 159
           R+  D   V+++       + P     ++++D    I SG    +  Y V +G+G   ++
Sbjct: 84  RMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRD 143

Query: 160 STLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGL 219
             +++D+GSD+ WVQC PC+LCY Q +P+F+P+ S S+  + C S  C  ++ +   SG 
Sbjct: 144 QYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSG- 203

Query: 220 CSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMG 279
                   C Y++ YGDGSY++G L  E LT  KT + N   GCG  N+G+F GA+GL+G
Sbjct: 204 -------GCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLG 263

Query: 280 LARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI--SYTRMIQ 339
           +    +S V Q S   G  F YCL + G  S+GSL  G       +   P+  S+  +++
Sbjct: 264 IGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFG-------REALPVGASWVPLVR 323

Query: 340 NPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSL---------LDSGTVITRLSPSIY 399
           NP+  +FY++ L G+ +GGV + +P     +GV  L         +D+GT +TRL  + Y
Sbjct: 324 NPRAPSFYYVGLKGLGVGGVRIPLP-----DGVFDLTETGDGGVVMDTGTAVTRLPTAAY 383

Query: 400 KAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYF 459
            AF+  F+ Q +      G SI +TC++L+G+  V +PTV F F     + +     F  
Sbjct: 384 VAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARN-FLM 443

Query: 460 VKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 495
              D+   C AFA+        IIGN QQ+  +V ++     VGF    C
Sbjct: 444 PVDDSGTYCFAFAA--SPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Cucsa.106100 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 235.0 bits (598), Expect = 1.8e-60
Identity = 152/436 (34.86%), Postives = 227/436 (52.06%), Query Frame = 1

Query: 77  TLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQ--THQLSDSQIPI 136
           TL +   D  S   T  +++F +R+  D+  V S+ +   +A  PG+  TH         
Sbjct: 73  TLNLDHIDALSSNKTP-DELFSSRLQRDSRRVKSIAT--LAAQIPGRNVTHAPRPGGFSS 132

Query: 137 S--SGARLQTLNYIVTVGIG--GQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNS 196
           S  SG    +  Y   +G+G   +   +++DTGSD+ W+QC PCR CY+Q +P+F+P  S
Sbjct: 133 SVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 192

Query: 197 SSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT 256
            ++ ++PC+SP C  L  +AG    C+ +  T C YQ+ YGDGS++ G+   E LT  + 
Sbjct: 193 KTYATIPCSSPHCRRLD-SAG----CNTRRKT-CLYQVSYGDGSFTVGDFSTETLTFRRN 252

Query: 257 EIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSS-GS 316
            +     GCG +N+GLF GA+GL+GL + +LS   QT   F   FSYCL      S   S
Sbjct: 253 RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS 312

Query: 317 LTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSS------- 376
           +  G A  S     +P     ++ NP++  FY++ L GIS+GG    VP +++       
Sbjct: 313 VVFGNAAVSRIARFTP-----LLSNPKLDTFYYVGLLGISVGGT--RVPGVTASLFKLDQ 372

Query: 377 --NEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 436
             N GV  ++DSGT +TRL    Y A +  F       +  P FS+ +TCF+L+   EV 
Sbjct: 373 IGNGGV--IIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVK 432

Query: 437 IPTVKFIFEGNAEMIVDVEGVFYFVKSDAS-QICLAFASLGYEDQTMIIGNYQQKNQRVI 496
           +PTV   F G     V +    Y +  D + + C AFA  G      IIGN QQ+  RV+
Sbjct: 433 VPTVVLHFRG---ADVSLPATNYLIPVDTNGKFCFAFA--GTMGGLSIIGNIQQQGFRVV 485

BLAST of Cucsa.106100 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 222.2 bits (565), Expect = 1.2e-56
Identity = 138/442 (31.22%), Postives = 225/442 (50.90%), Query Frame = 1

Query: 77  TLEMKQRD-YCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIF------------PGQ 136
           +LE+  RD + + +  D++ +  +R+  D+  V  + +  + A+                
Sbjct: 81  SLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDT 140

Query: 137 THQLSDSQIPISSGARLQTLNYIVTVGIG--GQNSTLIVDTGSDLTWVQCLPCRLCYNQQ 196
            +Q  D   P+ SGA   +  Y   +G+G   +   L++DTGSD+ W+QC PC  CY Q 
Sbjct: 141 RYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQS 200

Query: 197 EPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELG 256
           +P+FNP++SS++ SL C++P C  L+ +A     C    S  C YQ+ YGDGS++ GEL 
Sbjct: 201 DPVFNPTSSSTYKSLTCSAPQCSLLETSA-----C---RSNKCLYQVSYGDGSFTVGELA 260

Query: 257 FEKLTLGKT-EIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLP 316
            + +T G + +I+N   GCG +N+GLF GA+GL+GL    LS+ +Q  +   + FSYCL 
Sbjct: 261 TDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKA---TSFSYCLV 320

Query: 317 TTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP-- 376
               G S SL             +P     +++N ++  FY++ L+G S+GG  + +P  
Sbjct: 321 DRDSGKSSSLDFNSVQLGGGDATAP-----LLRNKKIDTFYYVGLSGFSVGGEKVVLPDA 380

Query: 377 ----RLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRT-TPGFSILNTCFNL 436
                 S + GV  +LD GT +TRL    Y + +  F K     +  +   S+ +TC++ 
Sbjct: 381 IFDVDASGSGGV--ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDF 440

Query: 437 TGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVK-SDASQICLAFASLGYEDQTMIIGNYQ 495
           +    V +PTV F F G   +  D+    Y +   D+   C AFA         IIGN Q
Sbjct: 441 SSLSTVKVPTVAFHFTGGKSL--DLPAKNYLIPVDDSGTFCFAFAPT--SSSLSIIGNVQ 500

BLAST of Cucsa.106100 vs. TrEMBL
Match: A0A0A0K8J2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G431320 PE=3 SV=1)

HSP 1 Score: 997.7 bits (2578), Expect = 5.2e-288
Identity = 495/496 (99.80%), Postives = 495/496 (99.80%), Query Frame = 1

Query: 1   MEISKSLHFPLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGEA 60
           MEISKSLHFPLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGEA
Sbjct: 1   MEISKSLHFPLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGEA 60

Query: 61  VVNCIFHKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIF 120
           VVNCIF KPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIF
Sbjct: 61  VVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIF 120

Query: 121 PGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQ 180
           PGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQ
Sbjct: 121 PGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQ 180

Query: 181 QEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGEL 240
           QEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGEL
Sbjct: 181 QEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGEL 240

Query: 241 GFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLP 300
           GFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLP
Sbjct: 241 GFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLP 300

Query: 301 TTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRL 360
           TTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRL
Sbjct: 301 TTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRL 360

Query: 361 SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 420
           SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN
Sbjct: 361 SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 420

Query: 421 IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIY 480
           IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIY
Sbjct: 421 IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIY 480

Query: 481 NSKESKVGFAGEPCSF 497
           NSKESKVGFAGEPCSF
Sbjct: 481 NSKESKVGFAGEPCSF 496

BLAST of Cucsa.106100 vs. TrEMBL
Match: M5WTF1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005040mg PE=3 SV=1)

HSP 1 Score: 583.2 bits (1502), Expect = 3.0e-163
Identity = 299/457 (65.43%), Postives = 362/457 (79.21%), Query Frame = 1

Query: 42  EKGLLQLFQNFPWKEHGEAVVN-CIFHKPKITKGITTLEMKQRDYCSGKITDWEKIFQNR 101
           EK +L+L Q F W++HG      C+  K +  KG T LE+K RDYCSGKI DW+K  Q R
Sbjct: 30  EKKVLKL-QEFRWRQHGGTRSTVCLSQKSRKEKGATILEIKHRDYCSGKIVDWDKKQQKR 89

Query: 102 IILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIV 161
           +I D ++V SL S FK+ +  G+   LS++QIP++SG RLQTLNYIVTV +GG+N T+IV
Sbjct: 90  LIFDDLHVRSLQSQFKNRV-SGRIKDLSEAQIPLTSGIRLQTLNYIVTVELGGRNMTVIV 149

Query: 162 DTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKN 221
           DTGSDLTWVQC PC+LCYNQQEPLFN S S S+ S+ CNS TC ALQ   G+SG C + N
Sbjct: 150 DTGSDLTWVQCQPCKLCYNQQEPLFNSSASPSYKSVLCNSSTCQALQFDTGNSGACGS-N 209

Query: 222 STSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSE 281
            TSC+Y ++YGDGSY+RGELG + L+LG T ++NF+FGCGRNNKGLFGGASGLMGL RSE
Sbjct: 210 PTSCNYVVNYGDGSYTRGELGSDHLSLGATPVNNFVFGCGRNNKGLFGGASGLMGLGRSE 269

Query: 282 -LSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSN 341
            +SLVSQTS+LFG VFSYCLPTT   +SGSL +GG D S +KN +PISYTRM+ NP++S 
Sbjct: 270 SVSLVSQTSALFGGVFSYCLPTTEATASGSLIMGG-DASIYKNSTPISYTRMVPNPELST 329

Query: 342 FYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYR 401
           FYFLNLTGISIGGV L     +S  G+L  +DSGTVI+RL+PS+YKA KAEF KQFSGY 
Sbjct: 330 FYFLNLTGISIGGVALQNQSFASG-GIL--IDSGTVISRLAPSVYKAVKAEFLKQFSGYP 389

Query: 402 TTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASL 461
             PGF+IL+TCFNL+ Y+EV+IPT+KF FEGNAE+ VDV G+FY VK+DASQICLA ASL
Sbjct: 390 PAPGFAILDTCFNLSAYQEVSIPTLKFHFEGNAELNVDVTGIFYLVKTDASQICLALASL 449

Query: 462 GYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCSF 497
            YED+  IIGNYQQKNQRVIYN+K+SK+GFA E CSF
Sbjct: 450 SYEDEIGIIGNYQQKNQRVIYNTKDSKLGFAEESCSF 479

BLAST of Cucsa.106100 vs. TrEMBL
Match: A0A061GAK7_THECC (Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_027560 PE=3 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 8.0e-156
Identity = 291/489 (59.51%), Postives = 365/489 (74.64%), Query Frame = 1

Query: 12  SLLLLLLLPLLSIGVDARSSSFNLGNGDNH---EKGLLQLFQNFPWKEHGEAVVNCIFHK 71
           ++ LLL   +LS+ V     S  L NG  H   EK LL+L Q+F WK+  +A   C+  K
Sbjct: 4   AMALLLFPSMLSLLV----FSLILHNGVVHCFGEKKLLKL-QHFQWKQKWDAST-CLSQK 63

Query: 72  PKITKGITTLEMKQRDYC-SGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQL 131
            +  KG T LEMK RDYC  G + DW K+ Q R+ILD + V SL +  K+  F G+T  +
Sbjct: 64  SRKEKGATILEMKHRDYCYGGGVKDWNKLLQKRLILDDLRVQSLQARIKNKAF-GKTEGV 123

Query: 132 SDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNP 191
           SD++IP++SG  L TLNYIVTV +GG+   +IVDTGSDLTWVQC PC+ CYNQ+EPLFNP
Sbjct: 124 SDTRIPLTSGVELGTLNYIVTVELGGRKMRVIVDTGSDLTWVQCQPCKSCYNQKEPLFNP 183

Query: 192 SNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL 251
           S S S+ ++ CNS TC +L    G++G+C N N  +C+Y + YGDGSY+RGEL  + L+L
Sbjct: 184 SASPSYQTVSCNSSTCQSLAFATGNTGICGN-NPPTCNYIVSYGDGSYTRGELAHDHLSL 243

Query: 252 GKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSS 311
           GKT +DNF+FGCGRNNKGLFGGASGLMGL RS +SLVSQT+ +FG  FSYCLP+T  G+S
Sbjct: 244 GKTPVDNFVFGCGRNNKGLFGGASGLMGLGRSSISLVSQTTDIFGGFFSYCLPSTQAGAS 303

Query: 312 GSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVL 371
           GSL LGG + S +K  S ISYTRMI NPQ+S FYFLNLTG+S+GGV L  P  +  +G +
Sbjct: 304 GSLVLGG-NSSVYKTSSAISYTRMIPNPQLSTFYFLNLTGVSVGGVTL--PDSTFGKGAM 363

Query: 372 SLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFI 431
            L+DSGTVITRL PSIYKA KAEF KQFSG+ + P FSIL+TCFNL+ Y+EV++PT+K  
Sbjct: 364 -LIDSGTVITRLPPSIYKALKAEFMKQFSGFPSAPAFSILDTCFNLSAYQEVDVPTIKMQ 423

Query: 432 FEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKV 491
           FEGNAEM VDV GVFYFVK+DASQ+CLA ASL +ED+  II NYQQ+NQRVIY++K++K+
Sbjct: 424 FEGNAEMNVDVTGVFYFVKTDASQVCLALASLSFEDEIGIIANYQQRNQRVIYDTKKAKL 480

Query: 492 GFAGEPCSF 497
           GFA E CSF
Sbjct: 484 GFAHESCSF 480

BLAST of Cucsa.106100 vs. TrEMBL
Match: A0A067EW14_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011482mg PE=3 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 2.0e-154
Identity = 287/489 (58.69%), Postives = 361/489 (73.82%), Query Frame = 1

Query: 10  PLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQL-FQNFPWKEHGEAVVNCIFH- 69
           PL++L LLL  ++S+        F L  G +  +G  +L      W++   +  +C+ H 
Sbjct: 7   PLTILSLLLPLMVSL--------FLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQ 66

Query: 70  KPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQL 129
           K +I  G  TLE+K ++YCSGKI DW +  QNR+ILD ++V  L S  K+ I  G    +
Sbjct: 67  KSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI-SGNIKDV 126

Query: 130 SDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNP 189
           S+++IP++SG RLQTLNYI T+ +GG+N T+IVDTGSDLTWVQC PC+ CYNQQ+P+F+P
Sbjct: 127 SNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 186

Query: 190 SNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL 249
           S S S+  + CNS TC AL+   G+SG+CS+ +   C+Y + YGDGSY+RGELG E L L
Sbjct: 187 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 246

Query: 250 GKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTT-GVGS 309
           GK  +++FIFGCGRNNKGLFGG SGLMGL RS+LSLVSQTS +FG +FSYCLP+T   G+
Sbjct: 247 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 306

Query: 310 SGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGV 369
           SGSL LGG + S FKN +PI+YT MI NPQ++ FY LNLTGISIGG  L     +   G+
Sbjct: 307 SGSLILGG-NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GI 366

Query: 370 LSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKF 429
             L+DSGTVITRL PSIY A KAEF KQFSG+ + PGFSIL+TCFNL+ Y+EVNIP VK 
Sbjct: 367 --LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 426

Query: 430 IFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESK 489
            FEGNAEM VDV G+ YFVKSDASQ+CLA ASL YED+T IIGNYQQKNQRVIY++K S+
Sbjct: 427 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 482

Query: 490 VGFAGEPCS 496
           +GFAGE CS
Sbjct: 487 LGFAGEDCS 482

BLAST of Cucsa.106100 vs. TrEMBL
Match: B9RGP5_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1442500 PE=3 SV=1)

HSP 1 Score: 550.8 bits (1418), Expect = 1.7e-153
Identity = 276/419 (65.87%), Postives = 329/419 (78.52%), Query Frame = 1

Query: 80  MKQRDYC--SGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSG 139
           MK RD+C  SGK TDW K  Q  +ILD   V SL S  KS IF G      DSQIP+SSG
Sbjct: 1   MKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKS-IFSGNNIDALDSQIPLSSG 60

Query: 140 ARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLP 199
            RLQTLNYIVTV IGG+N T+IVDTGSDLTWVQC PCRLCYNQQ+PLFNPS S S+ ++ 
Sbjct: 61  VRLQTLNYIVTVEIGGRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTIL 120

Query: 200 CNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIF 259
           CNS TC +LQ   G+ G+C + N+ +C+Y ++YGDGSY+RG+LG E+L LG T + NFIF
Sbjct: 121 CNSSTCQSLQYATGNLGVCGS-NTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIF 180

Query: 260 GCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADF 319
           GCGRNNKGLFGGASGLMGL +S+LSLVSQTS++F  VFSYCLPTT   +SGSL LGG + 
Sbjct: 181 GCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASGSLILGG-NS 240

Query: 320 SNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVIT 379
           S +KN +PISYTRMI NPQ+  FYFLNLTGISIGGV L  P    + G+L  +DSGTVIT
Sbjct: 241 SVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQS-GIL--IDSGTVIT 300

Query: 380 RLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVD 439
           RL P +Y+  KAEF KQFSG+ + P FSIL+TCFNL GY+EV+IPT++  FEGNAE+ VD
Sbjct: 301 RLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFEGNAELTVD 360

Query: 440 VEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCSF 497
           V G+FYFVK+DASQ+CLA ASL ++D+  IIGNYQQ+NQRVIYN+KESK+GFA E CSF
Sbjct: 361 VTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEACSF 413

BLAST of Cucsa.106100 vs. TAIR10
Match: AT1G79720.1 (AT1G79720.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 500.7 bits (1288), Expect = 1.0e-141
Identity = 265/490 (54.08%), Postives = 334/490 (68.16%), Query Frame = 1

Query: 10  PLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPW--KEHGEAVVNCIFH 69
           PL L+ L LL  +  GVD              EK +L +  N  W  K+  EA  +C   
Sbjct: 13  PLLLVFLFLLSCVVHGVD--------------EKKILSVHNNI-WSPKKSYEASTSCFSR 72

Query: 70  KPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQL 129
                +  TTLEMK R+ CSGK  D  K  +  ++LD I V SL    K+         +
Sbjct: 73  SLGKGRESTTLEMKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSV 132

Query: 130 SDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNP 189
           S++QIP++SG +L++LNYIVTV +GG+N +LIVDTGSDLTWVQC PCR CYNQQ PL++P
Sbjct: 133 SETQIPLTSGIKLESLNYIVTVELGGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDP 192

Query: 190 SNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNS---TSCDYQIDYGDGSYSRGELGFEK 249
           S SSS+ ++ CNS TC  L     +SG C   N    T C+Y + YGDGSY+RG+L  E 
Sbjct: 193 SVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASES 252

Query: 250 LTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGV 309
           + LG T+++NF+FGCGRNNKGLFGG+SGLMGL RS +SLVSQT   F  VFSYCLP+   
Sbjct: 253 ILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLED 312

Query: 310 GSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNE 369
           G+SGSL+ G  D S + N + +SYT ++QNPQ+ +FY LNLTG SIGGV L     SS+ 
Sbjct: 313 GASGSLSFGN-DSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK----SSSF 372

Query: 370 GVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTV 429
           G   L+DSGTVITRL PSIYKA K EF KQFSG+ T PG+SIL+TCFNLT YE+++IP +
Sbjct: 373 GRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPII 432

Query: 430 KFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKE 489
           K IF+GNAE+ VDV GVFYFVK DAS +CLA ASL YE++  IIGNYQQKNQRVIY++ +
Sbjct: 433 KMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQ 482

Query: 490 SKVGFAGEPC 495
            ++G  GE C
Sbjct: 493 ERLGIVGENC 482

BLAST of Cucsa.106100 vs. TAIR10
Match: AT5G10770.1 (AT5G10770.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 309.3 bits (791), Expect = 4.3e-84
Identity = 172/398 (43.22%), Postives = 237/398 (59.55%), Query Frame = 1

Query: 103 LDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNS--TLIV 162
           LD   VNS+ S     +      +   + +P   G+ L + NYIVTVG+G   +  +LI 
Sbjct: 90  LDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIF 149

Query: 163 DTGSDLTWVQCLPC-RLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNK 222
           DTGSDLTW QC PC R CY+Q+EP+FNPS S+S+ ++ C+S  C +L    G++G CS  
Sbjct: 150 DTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSAS 209

Query: 223 NSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEI-DNFIFGCGRNNKGLFGGASGLMGLAR 282
           N   C Y I YGD S+S G L  EK TL  +++ D   FGCG NN+GLF G +GL+GL R
Sbjct: 210 N---CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGR 269

Query: 283 SELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMS 342
            +LS  SQT++ +  +FSYCLP++    +G LT G A  S     +PIS          +
Sbjct: 270 DKLSFPSQTATAYNKIFSYCLPSSA-SYTGHLTFGSAGISRSVKFTPISTIT-----DGT 329

Query: 343 NFYFLNLTGISIGGVNLNVPR-LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSG 402
           +FY LN+  I++GG  L +P  + S  G  +L+DSGTVITRL P  Y A ++ F+ + S 
Sbjct: 330 SFYGLNIVAITVGGQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSK 389

Query: 403 YRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFA 462
           Y TT G SIL+TCF+L+G++ V IP V F F G A + +  +G+FY  K   SQ+CLAFA
Sbjct: 390 YPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFA 449

Query: 463 SLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 496
               +    I GN QQ+   V+Y+    +VGFA   CS
Sbjct: 450 GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474

BLAST of Cucsa.106100 vs. TAIR10
Match: AT5G10760.1 (AT5G10760.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 268.9 bits (686), Expect = 6.4e-72
Identity = 160/414 (38.65%), Postives = 236/414 (57.00%), Query Frame = 1

Query: 86  CSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNY 145
           CS   +D        I  D   V S++S   S     +  +   +++P  SG  L + NY
Sbjct: 74  CSHLSSDARVDHDEIIRRDQARVESIYSKL-SKNSANEVSEAKSTELPAKSGITLGSGNY 133

Query: 146 IVTVGIGG--QNSTLIVDTGSDLTWVQCLPCR-LCYNQQEPLFNPSNSSSFLSLPCNSPT 205
           IVT+GIG    + +L+ DTGSDLTW QC PC   CY+Q+EP FNPS+SS++ ++ C+SP 
Sbjct: 134 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPM 193

Query: 206 CVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEI-DNFIFGCGR 265
           C   +        CS  N   C Y I YGD S+++G L  EK TL  +++ ++  FGCG 
Sbjct: 194 CEDAES-------CSASN---CVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGE 253

Query: 266 NNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFK 325
           NN+GLF G +GL+GL   +LSL +QT++ + ++FSYCLP+    S+G LT G A  S   
Sbjct: 254 NNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESV 313

Query: 326 NISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNV-PRLSSNEGVLSLLDSGTVITRLS 385
             +PIS       P   N Y +++ GIS+G   L + P   S EG  +++DSGTV TRL 
Sbjct: 314 KFTPIS-----SFPSAFN-YGIDIIGISVGDKELAITPNSFSTEG--AIIDSGTVFTRLP 373

Query: 386 PSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEG 445
             +Y   ++ F+++ S Y++T G+ + +TC++ TG + V  PT+ F F G+  + +D  G
Sbjct: 374 TKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSG 433

Query: 446 VFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 495
           +   +K   SQ+CLAFA  G +D   I GN QQ    V+Y+    +VGFA   C
Sbjct: 434 ISLPIK--ISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Cucsa.106100 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 241.9 bits (616), Expect = 8.4e-64
Identity = 134/410 (32.68%), Postives = 212/410 (51.71%), Query Frame = 1

Query: 100 RIILDAINVNSLFSHFKSAIFPGQT--HQLSDSQIPISSGARLQTLNYIVTVGIGG--QN 159
           R+  D   V+++       + P     ++++D    I SG    +  Y V +G+G   ++
Sbjct: 84  RMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRD 143

Query: 160 STLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGL 219
             +++D+GSD+ WVQC PC+LCY Q +P+F+P+ S S+  + C S  C  ++ +   SG 
Sbjct: 144 QYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSG- 203

Query: 220 CSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMG 279
                   C Y++ YGDGSY++G L  E LT  KT + N   GCG  N+G+F GA+GL+G
Sbjct: 204 -------GCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLG 263

Query: 280 LARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI--SYTRMIQ 339
           +    +S V Q S   G  F YCL + G  S+GSL  G       +   P+  S+  +++
Sbjct: 264 IGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFG-------REALPVGASWVPLVR 323

Query: 340 NPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSL---------LDSGTVITRLSPSIY 399
           NP+  +FY++ L G+ +GGV + +P     +GV  L         +D+GT +TRL  + Y
Sbjct: 324 NPRAPSFYYVGLKGLGVGGVRIPLP-----DGVFDLTETGDGGVVMDTGTAVTRLPTAAY 383

Query: 400 KAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYF 459
            AF+  F+ Q +      G SI +TC++L+G+  V +PTV F F     + +     F  
Sbjct: 384 VAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARN-FLM 443

Query: 460 VKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 495
              D+   C AFA+        IIGN QQ+  +V ++     VGF    C
Sbjct: 444 PVDDSGTYCFAFAA--SPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Cucsa.106100 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 235.7 bits (600), Expect = 6.0e-62
Identity = 134/370 (36.22%), Postives = 196/370 (52.97%), Query Frame = 1

Query: 134 ISSGARLQTLNYIVTVGIG--GQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSS 193
           + SG    +  Y + +G+G    N  +++DTGSD+ W+QC PC+ CYNQ + +F+P  S 
Sbjct: 124 VISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSK 183

Query: 194 SFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTE 253
           +F ++PC S  C  L      S  C  + S +C YQ+ YGDGS++ G+   E LT     
Sbjct: 184 TFATVPCGSRLCRRLD----DSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR 243

Query: 254 IDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLT 313
           +D+   GCG +N+GLF GA+GL+GL R  LS  SQT + +   FSYCL       S S  
Sbjct: 244 VDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKP 303

Query: 314 LGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLS--- 373
                F N        +T ++ NP++  FY+L L GIS+GG    VP +S ++  L    
Sbjct: 304 PSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGG--SRVPGVSESQFKLDATG 363

Query: 374 ----LLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTV 433
               ++DSGT +TRL+   Y A +  F    +  +  P +S+ +TCF+L+G   V +PTV
Sbjct: 364 NGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTV 423

Query: 434 KFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKE 493
            F F G  E+ +        V ++  + C AFA  G      IIGN QQ+  RV Y+   
Sbjct: 424 VFHF-GGGEVSLPASNYLIPVNTE-GRFCFAFA--GTMGSLSIIGNIQQQGFRVAYDLVG 483

Query: 494 SKVGFAGEPC 495
           S+VGF    C
Sbjct: 484 SRVGFLSRAC 483

BLAST of Cucsa.106100 vs. NCBI nr
Match: gi|778728858|ref|XP_004135889.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus])

HSP 1 Score: 997.7 bits (2578), Expect = 7.4e-288
Identity = 495/496 (99.80%), Postives = 495/496 (99.80%), Query Frame = 1

Query: 1   MEISKSLHFPLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGEA 60
           MEISKSLHFPLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGEA
Sbjct: 1   MEISKSLHFPLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPWKEHGEA 60

Query: 61  VVNCIFHKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIF 120
           VVNCIF KPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIF
Sbjct: 61  VVNCIFQKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIF 120

Query: 121 PGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQ 180
           PGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQ
Sbjct: 121 PGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQ 180

Query: 181 QEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGEL 240
           QEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGEL
Sbjct: 181 QEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGEL 240

Query: 241 GFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLP 300
           GFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLP
Sbjct: 241 GFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLP 300

Query: 301 TTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRL 360
           TTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRL
Sbjct: 301 TTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRL 360

Query: 361 SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 420
           SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN
Sbjct: 361 SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVN 420

Query: 421 IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIY 480
           IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIY
Sbjct: 421 IPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIY 480

Query: 481 NSKESKVGFAGEPCSF 497
           NSKESKVGFAGEPCSF
Sbjct: 481 NSKESKVGFAGEPCSF 496

BLAST of Cucsa.106100 vs. NCBI nr
Match: gi|659122560|ref|XP_008461208.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo])

HSP 1 Score: 949.9 bits (2454), Expect = 1.8e-273
Identity = 475/497 (95.57%), Postives = 484/497 (97.38%), Query Frame = 1

Query: 1   MEISKSLHFPLSLLLLLLLPLLSIGVDARSSSFNLGNGDN-HEKGLLQLFQNFPWKEHGE 60
           ME+SKSLHFPLSLL LLLLPLL I VDARSS   +GNG N HEKGLLQLFQNFPWKEHGE
Sbjct: 3   MEVSKSLHFPLSLLFLLLLPLLFIIVDARSS---VGNGGNYHEKGLLQLFQNFPWKEHGE 62

Query: 61  AVVNCIFHKPKITKGITTLEMKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAI 120
           AVVNCIF KPKITKGITTLEMKQRDYCSGKITD EKIFQNRIILDAINVNSL SH KSAI
Sbjct: 63  AVVNCIFQKPKITKGITTLEMKQRDYCSGKITDLEKIFQNRIILDAINVNSLLSHVKSAI 122

Query: 121 FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN 180
           FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN
Sbjct: 123 FPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYN 182

Query: 181 QQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE 240
           QQEPLFNPSNSSSFLSLPC+SPTC+ALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE
Sbjct: 183 QQEPLFNPSNSSSFLSLPCSSPTCLALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGE 242

Query: 241 LGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCL 300
           LG+EKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSS+FGS+FSYCL
Sbjct: 243 LGYEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSVFGSIFSYCL 302

Query: 301 PTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR 360
           PTTGVGSSGSLTLGG DFS+FKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR
Sbjct: 303 PTTGVGSSGSLTLGGTDFSSFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR 362

Query: 361 LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV 420
           LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV
Sbjct: 363 LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEV 422

Query: 421 NIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVI 480
           NIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRV+
Sbjct: 423 NIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVV 482

Query: 481 YNSKESKVGFAGEPCSF 497
           YNSKESKVGFAGEPCSF
Sbjct: 483 YNSKESKVGFAGEPCSF 496

BLAST of Cucsa.106100 vs. NCBI nr
Match: gi|595931085|ref|XP_007215297.1| (hypothetical protein PRUPE_ppa005040mg [Prunus persica])

HSP 1 Score: 583.2 bits (1502), Expect = 4.4e-163
Identity = 299/457 (65.43%), Postives = 362/457 (79.21%), Query Frame = 1

Query: 42  EKGLLQLFQNFPWKEHGEAVVN-CIFHKPKITKGITTLEMKQRDYCSGKITDWEKIFQNR 101
           EK +L+L Q F W++HG      C+  K +  KG T LE+K RDYCSGKI DW+K  Q R
Sbjct: 30  EKKVLKL-QEFRWRQHGGTRSTVCLSQKSRKEKGATILEIKHRDYCSGKIVDWDKKQQKR 89

Query: 102 IILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIV 161
           +I D ++V SL S FK+ +  G+   LS++QIP++SG RLQTLNYIVTV +GG+N T+IV
Sbjct: 90  LIFDDLHVRSLQSQFKNRV-SGRIKDLSEAQIPLTSGIRLQTLNYIVTVELGGRNMTVIV 149

Query: 162 DTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKN 221
           DTGSDLTWVQC PC+LCYNQQEPLFN S S S+ S+ CNS TC ALQ   G+SG C + N
Sbjct: 150 DTGSDLTWVQCQPCKLCYNQQEPLFNSSASPSYKSVLCNSSTCQALQFDTGNSGACGS-N 209

Query: 222 STSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSE 281
            TSC+Y ++YGDGSY+RGELG + L+LG T ++NF+FGCGRNNKGLFGGASGLMGL RSE
Sbjct: 210 PTSCNYVVNYGDGSYTRGELGSDHLSLGATPVNNFVFGCGRNNKGLFGGASGLMGLGRSE 269

Query: 282 -LSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSN 341
            +SLVSQTS+LFG VFSYCLPTT   +SGSL +GG D S +KN +PISYTRM+ NP++S 
Sbjct: 270 SVSLVSQTSALFGGVFSYCLPTTEATASGSLIMGG-DASIYKNSTPISYTRMVPNPELST 329

Query: 342 FYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYR 401
           FYFLNLTGISIGGV L     +S  G+L  +DSGTVI+RL+PS+YKA KAEF KQFSGY 
Sbjct: 330 FYFLNLTGISIGGVALQNQSFASG-GIL--IDSGTVISRLAPSVYKAVKAEFLKQFSGYP 389

Query: 402 TTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASL 461
             PGF+IL+TCFNL+ Y+EV+IPT+KF FEGNAE+ VDV G+FY VK+DASQICLA ASL
Sbjct: 390 PAPGFAILDTCFNLSAYQEVSIPTLKFHFEGNAELNVDVTGIFYLVKTDASQICLALASL 449

Query: 462 GYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCSF 497
            YED+  IIGNYQQKNQRVIYN+K+SK+GFA E CSF
Sbjct: 450 SYEDEIGIIGNYQQKNQRVIYNTKDSKLGFAEESCSF 479

BLAST of Cucsa.106100 vs. NCBI nr
Match: gi|645245370|ref|XP_008228848.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Prunus mume])

HSP 1 Score: 575.5 bits (1482), Expect = 9.1e-161
Identity = 295/457 (64.55%), Postives = 360/457 (78.77%), Query Frame = 1

Query: 42  EKGLLQLFQNFPWKEHGEAVVN-CIFHKPKITKGITTLEMKQRDYCSGKITDWEKIFQNR 101
           EK +L+L Q F W++ G      C+  K +  KG T LE+K RDYCSGKI DW K  Q R
Sbjct: 30  EKKVLKL-QEFRWRQRGGTRSTVCLSQKSRKEKGATILEIKHRDYCSGKIVDWNKKQQKR 89

Query: 102 IILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIV 161
           +I D ++V SL S FK+ ++ G+    S++QIP++SG RLQTLNYIVT+ +GG+N T+IV
Sbjct: 90  LIFDDLHVRSLQSQFKNRVY-GRIKDASEAQIPLTSGIRLQTLNYIVTLELGGRNMTVIV 149

Query: 162 DTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKN 221
           DTGSDLTWVQC PC+LCYNQQEPLFN S S S+ S+ CNS TC ALQ   G+SG C + N
Sbjct: 150 DTGSDLTWVQCQPCKLCYNQQEPLFNSSASPSYKSVLCNSSTCQALQFDTGNSGACGS-N 209

Query: 222 STSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSE 281
            TSC+Y ++YGDGSY+RGELG + L+LG T ++NF+FGCGRNNKGLFGGASGLMGL RSE
Sbjct: 210 PTSCNYVVNYGDGSYTRGELGSDHLSLGATPVNNFVFGCGRNNKGLFGGASGLMGLGRSE 269

Query: 282 -LSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSN 341
            +SLVSQTS+LFG VFSYCLPTT   +SGSL +GG D S +KN +PISYTRM+ NP++S+
Sbjct: 270 SVSLVSQTSALFGGVFSYCLPTTEATASGSLIMGG-DASIYKNSTPISYTRMVPNPELSS 329

Query: 342 FYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYR 401
           FYFLNLTGISIGGV L     +S  G+L  +DSGTVI+RL+P +Y+A KAEF KQFSGY 
Sbjct: 330 FYFLNLTGISIGGVALQAQSFASG-GIL--IDSGTVISRLAPLVYEAVKAEFLKQFSGYP 389

Query: 402 TTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASL 461
             PGFSIL+TCFNL+ Y+EV+IPT+KF FEGNAE+ VDV GVFY VK+DASQICLA ASL
Sbjct: 390 PAPGFSILDTCFNLSAYQEVSIPTLKFHFEGNAELNVDVTGVFYLVKTDASQICLALASL 449

Query: 462 GYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCSF 497
            YED+  IIGNYQQKN+RVIYN+K+SK+GFA E CSF
Sbjct: 450 SYEDEIGIIGNYQQKNRRVIYNTKDSKLGFAEESCSF 479

BLAST of Cucsa.106100 vs. NCBI nr
Match: gi|1000979824|ref|XP_015570839.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Ricinus communis])

HSP 1 Score: 572.4 bits (1474), Expect = 7.7e-160
Identity = 299/493 (60.65%), Postives = 363/493 (73.63%), Query Frame = 1

Query: 8   HFPLSLLLLLLLPLLSIGVDARSSSFNLGNGDNHEKGLLQLFQNFPW--KEHGEAVVNCI 67
           H+  +LL LL+  L  +   A+S           EK +L L Q + W  K + +   +C+
Sbjct: 14  HYYYTLLSLLVFLLTVVNGGAQSLQ---------EKKVLSL-QEYQWQLKSNTDTNSSCL 73

Query: 68  FHKPKITKGITTLEMKQRDYC--SGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQ 127
             K K  KG T LEMK RD+C  SGK TDW K  Q  +ILD   V SL S  KS IF G 
Sbjct: 74  SQKSKREKGATILEMKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKS-IFSGN 133

Query: 128 THQLSDSQIPISSGARLQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEP 187
                DSQIP+SSG RLQTLNYIVTV IGG+N T+IVDTGSDLTWVQC PCRLCYNQQ+P
Sbjct: 134 NIDALDSQIPLSSGVRLQTLNYIVTVEIGGRNMTVIVDTGSDLTWVQCQPCRLCYNQQDP 193

Query: 188 LFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFE 247
           LFNPS S S+ ++ CNS TC +LQ   G+ G+C + N+ +C+Y ++YGDGSY+RG+LG E
Sbjct: 194 LFNPSGSPSYQTILCNSSTCQSLQYATGNLGVCGS-NTPTCNYVVNYGDGSYTRGDLGME 253

Query: 248 KLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTG 307
           +L LG T + NFIFGCGRNNKGLFGGASGLMGL +S+LSLVSQTS++F  VFSYCLPTT 
Sbjct: 254 QLNLGTTHVSNFIFGCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTA 313

Query: 308 VGSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSN 367
             +SGSL LGG + S +KN +PISYTRMI NPQ+  FYFLNLTGISIGGV L  P    +
Sbjct: 314 ADASGSLILGG-NSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQS 373

Query: 368 EGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPT 427
            G+L  +DSGTVITRL P +Y+  KAEF KQFSG+ + P FSIL+TCFNL GY+EV+IPT
Sbjct: 374 -GIL--IDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPT 433

Query: 428 VKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSK 487
           ++  FEGNAE+ VDV G+FYFVK+DASQ+CLA ASL ++D+  IIGNYQQ+NQRVIYN+K
Sbjct: 434 IRMQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTK 490

Query: 488 ESKVGFAGEPCSF 497
           ESK+GFA E CSF
Sbjct: 494 ESKLGFAAEACSF 490

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPA_ARATH7.6e-8343.22Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana GN=At5g10770 ... [more]
AED1_ARATH1.1e-7038.65Aspartyl protease AED1 OS=Arabidopsis thaliana GN=AED1 PE=2 SV=1[more]
ASPG2_ARATH1.5e-6232.68Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
APF2_ARATH1.8e-6034.86Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
ASPG1_ARATH1.2e-5631.22Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0K8J2_CUCSA5.2e-28899.80Uncharacterized protein OS=Cucumis sativus GN=Csa_7G431320 PE=3 SV=1[more]
M5WTF1_PRUPE3.0e-16365.43Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005040mg PE=3 SV=1[more]
A0A061GAK7_THECC8.0e-15659.51Eukaryotic aspartyl protease family protein OS=Theobroma cacao GN=TCM_027560 PE=... [more]
A0A067EW14_CITSI2.0e-15458.69Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011482mg PE=3 SV=1[more]
B9RGP5_RICCO1.7e-15365.87Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_1442500 ... [more]
Match NameE-valueIdentityDescription
AT1G79720.11.0e-14154.08 Eukaryotic aspartyl protease family protein[more]
AT5G10770.14.3e-8443.22 Eukaryotic aspartyl protease family protein[more]
AT5G10760.16.4e-7238.65 Eukaryotic aspartyl protease family protein[more]
AT3G20015.18.4e-6432.68 Eukaryotic aspartyl protease family protein[more]
AT3G61820.16.0e-6236.22 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778728858|ref|XP_004135889.2|7.4e-28899.80PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus][more]
gi|659122560|ref|XP_008461208.1|1.8e-27395.57PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo][more]
gi|595931085|ref|XP_007215297.1|4.4e-16365.43hypothetical protein PRUPE_ppa005040mg [Prunus persica][more]
gi|645245370|ref|XP_008228848.1|9.1e-16164.55PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Prunus mume][more]
gi|1000979824|ref|XP_015570839.1|7.7e-16060.65PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.106100.1Cucsa.106100.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 368..379
score: 3.0E-6coord: 308..321
score: 3.0E-6coord: 149..169
score: 3.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 4..28
score: 1.7E-223coord: 66..495
score: 1.7E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 158..169
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 325..495
score: 8.1E-38coord: 132..314
score: 7.7
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 140..494
score: 8.17
NoneNo IPR availablePANTHERPTHR13683:SF263SUBFAMILY NOT NAMEDcoord: 4..28
score: 1.7E-223coord: 66..495
score: 1.7E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cucsa.106100Cucsa.271320Cucumber (Gy14) v1cgycgyB054
The following block(s) are covering this gene:
GeneOrganismBlock
Cucsa.106100Cucurbita pepo (Zucchini)cgycpeB0230
Cucsa.106100Cucurbita maxima (Rimu)cgycmaB0238
Cucsa.106100Cucurbita moschata (Rifu)cgycmoB0238
Cucsa.106100Cucurbita moschata (Rifu)cgycmoB0240