Csa1G022480 (gene) Cucumber (Chinese Long) v2

NameCsa1G022480
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionAspartic proteinase nepenthesin-1, putative; contains IPR001461 (Peptidase A1), IPR021109 (Aspartic peptidase)
LocationChr1 : 2298826 .. 2300295 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACACTTCACTTTCCTATGCTTTTCTCTTCCTAACATTCTTCGCTTCCCTTCACTTCCCATCAATTCTCTCTCGCAAACTAACAACACAATCTCCTTATTCCACTAATACCTTTGATGTCTCGGCCTCCATTAACCAAGCCTTAAATGCCCTTTCCATCAAACCCAAACCCTTTCAAACCACTCACTCTAATTACCATTCAAGTTCACCTCTATCTCTCTCACTCCACCCTAGATTGACCGTTCATAACCCTTCTTACGAGGACTATGGTAGTCTTGTTAGGGCCCGACTCGCTCGTGGTGCCGCCCGAGCTCAATCCCTTAATAGAAAGCTTGAGCTCTCTTTAAAAGGGGGTAAACAATTTGGTAGAAGAATTAATGGATCTGATTCTACAAATTCACTTACTGCTCCCGTTACTTCAGGGGCGAGTCAAGGAGCTGGCGAATATTTTGCTCGGATTGGAGTCGGTCAACCTGTGCAATCGTATTTTTTTGTGCCTGATACTGGAAGTGATGTTTCGTGGCTACAATGTCAACCATGTGATGGTGAGAATGGTTGTTATAAACAAATTGGCCCGATATTTGACCCGAAATCGTCGTCTTCTTATAGTCCGCTGTCTTGCGATTCGGAACAATGTCATTTGTTGGATGAAGCTGCTTGTGATGCCAATTCATGTATCTACGAAGTTGAATATGGTGATGGATCATTCACAGTCGGTGAACTTGCCACCGAAACGTTCTCATTTCGACATTCTAATTCCATTCCTAATCTCCCGATTGGTTGTGGTCATGACAACGAAGGCCTCTTTGTTGGGGCTGCTGGTTTAATTGGCCTCGGTGGTGGGGCAATTTCCCTTTCTTCTCAATTAGAAGCGACGTCATTTTCATATTGTCTGGTCGACCTAGACTCAGAATCATCCTCCACTCTTGACTTCAACGCAGACCAACCAAGCGACTCACTCACCTCCCCACTTGTGAAAAATGATCGATTCCCCACGTTTCGGTACGTGAAAGTAATTGGGATGAGCGTCGGTGGGAAGCCTTTACCGATTTCCTCGTCAAGCTTTGAAATCGATGAATCAGGATCAGGGGGAATAATCGTGGATTCTGGGACGACTATAACTGAAATACCAAGTGATGTGTATGACGTGTTAAGAGACGCGTTCGTGGGGCTAACAAAGAACCTCCCACCAGCACCAGGGGTATCACCGTTTGATACTTGTTATGACTTATCAAGTCAATCGAATGTGGAGGTGCCAACCATAGCATTTATTTTACCAGGGGAAAACTCGCTACAACTACCAGCAAAAAATTGTTTGTTCCAAGTGGACTCAGCTGGAACTTTCTGCTTGGCGTTTCTTCCATCAACATTTCCACTTTCTATTATTGGGAACGTCCAACAACAAGGAATACGTGTCAGCTATGACTTGGCCAACTCTCTCGTCGGATTCTCGACTGATAAATGTTAA

mRNA sequence

ATGAACACTTCACTTTCCTATGCTTTTCTCTTCCTAACATTCTTCGCTTCCCTTCACTTCCCATCAATTCTCTCTCGCAAACTAACAACACAATCTCCTTATTCCACTAATACCTTTGATGTCTCGGCCTCCATTAACCAAGCCTTAAATGCCCTTTCCATCAAACCCAAACCCTTTCAAACCACTCACTCTAATTACCATTCAAGTTCACCTCTATCTCTCTCACTCCACCCTAGATTGACCGTTCATAACCCTTCTTACGAGGACTATGGTAGTCTTGTTAGGGCCCGACTCGCTCGTGGTGCCGCCCGAGCTCAATCCCTTAATAGAAAGCTTGAGCTCTCTTTAAAAGGGGGTAAACAATTTGGTAGAAGAATTAATGGATCTGATTCTACAAATTCACTTACTGCTCCCGTTACTTCAGGGGCGAGTCAAGGAGCTGGCGAATATTTTGCTCGGATTGGAGTCGGTCAACCTGTGCAATCGTATTTTTTTGTGCCTGATACTGGAAGTGATGTTTCGTGGCTACAATGTCAACCATGTGATGGTGAGAATGGTTGTTATAAACAAATTGGCCCGATATTTGACCCGAAATCGTCGTCTTCTTATAGTCCGCTGTCTTGCGATTCGGAACAATGTCATTTGTTGGATGAAGCTGCTTGTGATGCCAATTCATGTATCTACGAAGTTGAATATGGTGATGGATCATTCACAGTCGGTGAACTTGCCACCGAAACGTTCTCATTTCGACATTCTAATTCCATTCCTAATCTCCCGATTGGTTGTGGTCATGACAACGAAGGCCTCTTTGTTGGGGCTGCTGGTTTAATTGGCCTCGGTGGTGGGGCAATTTCCCTTTCTTCTCAATTAGAAGCGACGTCATTTTCATATTGTCTGGTCGACCTAGACTCAGAATCATCCTCCACTCTTGACTTCAACGCAGACCAACCAAGCGACTCACTCACCTCCCCACTTGTGAAAAATGATCGATTCCCCACGTTTCGGTACGTGAAAGTAATTGGGATGAGCGTCGGTGGGAAGCCTTTACCGATTTCCTCGTCAAGCTTTGAAATCGATGAATCAGGATCAGGGGGAATAATCGTGGATTCTGGGACGACTATAACTGAAATACCAAGTGATGTGTATGACGTGTTAAGAGACGCGTTCGTGGGGCTAACAAAGAACCTCCCACCAGCACCAGGGGTATCACCGTTTGATACTTGTTATGACTTATCAAGTCAATCGAATGTGGAGGTGCCAACCATAGCATTTATTTTACCAGGGGAAAACTCGCTACAACTACCAGCAAAAAATTGTTTGTTCCAAGTGGACTCAGCTGGAACTTTCTGCTTGGCGTTTCTTCCATCAACATTTCCACTTTCTATTATTGGGAACGTCCAACAACAAGGAATACGTGTCAGCTATGACTTGGCCAACTCTCTCGTCGGATTCTCGACTGATAAATGTTAA

Coding sequence (CDS)

ATGAACACTTCACTTTCCTATGCTTTTCTCTTCCTAACATTCTTCGCTTCCCTTCACTTCCCATCAATTCTCTCTCGCAAACTAACAACACAATCTCCTTATTCCACTAATACCTTTGATGTCTCGGCCTCCATTAACCAAGCCTTAAATGCCCTTTCCATCAAACCCAAACCCTTTCAAACCACTCACTCTAATTACCATTCAAGTTCACCTCTATCTCTCTCACTCCACCCTAGATTGACCGTTCATAACCCTTCTTACGAGGACTATGGTAGTCTTGTTAGGGCCCGACTCGCTCGTGGTGCCGCCCGAGCTCAATCCCTTAATAGAAAGCTTGAGCTCTCTTTAAAAGGGGGTAAACAATTTGGTAGAAGAATTAATGGATCTGATTCTACAAATTCACTTACTGCTCCCGTTACTTCAGGGGCGAGTCAAGGAGCTGGCGAATATTTTGCTCGGATTGGAGTCGGTCAACCTGTGCAATCGTATTTTTTTGTGCCTGATACTGGAAGTGATGTTTCGTGGCTACAATGTCAACCATGTGATGGTGAGAATGGTTGTTATAAACAAATTGGCCCGATATTTGACCCGAAATCGTCGTCTTCTTATAGTCCGCTGTCTTGCGATTCGGAACAATGTCATTTGTTGGATGAAGCTGCTTGTGATGCCAATTCATGTATCTACGAAGTTGAATATGGTGATGGATCATTCACAGTCGGTGAACTTGCCACCGAAACGTTCTCATTTCGACATTCTAATTCCATTCCTAATCTCCCGATTGGTTGTGGTCATGACAACGAAGGCCTCTTTGTTGGGGCTGCTGGTTTAATTGGCCTCGGTGGTGGGGCAATTTCCCTTTCTTCTCAATTAGAAGCGACGTCATTTTCATATTGTCTGGTCGACCTAGACTCAGAATCATCCTCCACTCTTGACTTCAACGCAGACCAACCAAGCGACTCACTCACCTCCCCACTTGTGAAAAATGATCGATTCCCCACGTTTCGGTACGTGAAAGTAATTGGGATGAGCGTCGGTGGGAAGCCTTTACCGATTTCCTCGTCAAGCTTTGAAATCGATGAATCAGGATCAGGGGGAATAATCGTGGATTCTGGGACGACTATAACTGAAATACCAAGTGATGTGTATGACGTGTTAAGAGACGCGTTCGTGGGGCTAACAAAGAACCTCCCACCAGCACCAGGGGTATCACCGTTTGATACTTGTTATGACTTATCAAGTCAATCGAATGTGGAGGTGCCAACCATAGCATTTATTTTACCAGGGGAAAACTCGCTACAACTACCAGCAAAAAATTGTTTGTTCCAAGTGGACTCAGCTGGAACTTTCTGCTTGGCGTTTCTTCCATCAACATTTCCACTTTCTATTATTGGGAACGTCCAACAACAAGGAATACGTGTCAGCTATGACTTGGCCAACTCTCTCGTCGGATTCTCGACTGATAAATGTTAA

Protein sequence

MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC*
BLAST of Csa1G022480 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 460.7 bits (1184), Expect = 2.0e-128
Identity = 239/480 (49.79%), Postives = 323/480 (67.29%), Query Frame = 1

Query: 25  SRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSN---------YHSSSPLSLS 84
           SR L+T  P  TN  DV +S+ Q    LS+ P     T +          ++SSSPLSL 
Sbjct: 26  SRSLST--PPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSDPVFFNSSSPLSLE 85

Query: 85  LHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRR-INGSDS--- 144
           LH R T     ++DY SL  +RL R ++R   +  K+  +++G  +   + +   D+   
Sbjct: 86  LHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQ 145

Query: 145 TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQI 204
           T  LT PV SGASQG+GEYF+RIGVG P +  + V DTGSDV+W+QC+PC     CY+Q 
Sbjct: 146 TEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCAD---CYQQS 205

Query: 205 GPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRH 264
            P+F+P SSS+Y  L+C + QC LL+ +AC +N C+Y+V YGDGSFTVGELAT+T +F +
Sbjct: 206 DPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGN 265

Query: 265 SNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLD 324
           S  I N+ +GCGHDNEGLF GAAGL+GLGGG +S+++Q++ATSFSYCLVD DS  SS+LD
Sbjct: 266 SGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLD 325

Query: 325 FNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDS 384
           FN+ Q      T+PL++N +  TF YV + G SVGG+ + +  + F++D SGSGG+I+D 
Sbjct: 326 FNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDC 385

Query: 385 GTTITEIPSDVYDVLRDAFVGLTKNLPP-APGVSPFDTCYDLSSQSNVEVPTIAFILPGE 444
           GT +T + +  Y+ LRDAF+ LT NL   +  +S FDTCYD SS S V+VPT+AF   G 
Sbjct: 386 GTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGG 445

Query: 445 NSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 490
            SL LPAKN L  VD +GTFC AF P++  LSIIGNVQQQG R++YDL+ +++G S +KC
Sbjct: 446 KSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Csa1G022480 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 361.3 bits (926), Expect = 1.7e-98
Identity = 200/483 (41.41%), Postives = 277/483 (57.35%), Query Frame = 1

Query: 14  FFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSNYHSSSPLS 73
           FF  LH    L   L++ S  S   F +   +   L   +  P  F  TH +  SSS  +
Sbjct: 6   FFFFLH----LHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPD-FNNTHFSDESSSKYT 65

Query: 74  LSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRRINGSDS-- 133
           L L  R    + +Y ++   + AR+ R   R  ++ R++          G+ I  SDS  
Sbjct: 66  LRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRIS---------GKVIPSSDSRY 125

Query: 134 -TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 193
             N   + + SG  QG+GEYF RIGVG P +  + V D+GSD+ W+QCQPC     CYKQ
Sbjct: 126 EVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC---KLCYKQ 185

Query: 194 IGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFR 253
             P+FDP  S SY+ +SC S  C  ++ + C +  C YEV YGDGS+T G LA ET +F 
Sbjct: 186 SDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFA 245

Query: 254 HSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATS---FSYCLVDLDSESS 313
            +  + N+ +GCGH N G+F+GAAGL+G+GGG++S   QL   +   F YCLV   ++S+
Sbjct: 246 KT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST 305

Query: 314 STLDFNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGI 373
            +L F  +  P  +   PLV+N R P+F YV + G+ VGG  +P+    F++ E+G GG+
Sbjct: 306 GSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGV 365

Query: 374 IVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFIL 433
           ++D+GT +T +P+  Y   RD F   T NLP A GVS FDTCYDLS   +V VPT++F  
Sbjct: 366 VMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYF 425

Query: 434 PGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFST 490
                L LPA+N L  VD +GT+C AF  S   LSIIGN+QQ+GI+VS+D AN  VGF  
Sbjct: 426 TEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGP 470

BLAST of Csa1G022480 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 343.6 bits (880), Expect = 3.6e-93
Identity = 209/499 (41.88%), Postives = 287/499 (57.52%), Query Frame = 1

Query: 4   SLSYAFLFLTFFASLH-----FPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKP 63
           SL + FL L  F+SL      FP+  S  L   SP S      S S+ ++          
Sbjct: 11  SLCFFFLSLPSFSSLPSFQTLFPN--SHSLPCASPVSFQPDSDSESLLES---------E 70

Query: 64  FQTTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKG 123
           F++  S+  SSS ++L+L     + +    D   L  +RL R + R +S+      +   
Sbjct: 71  FESG-SDSESSSSITLNLDHIDALSSNKTPD--ELFSSRLQRDSRRVKSI------ATLA 130

Query: 124 GKQFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 183
            +  GR +  +      ++ V SG SQG+GEYF R+GVG P +  + V DTGSD+ WLQC
Sbjct: 131 AQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC 190

Query: 184 QPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDAN--SCIYEVEYGDGS 243
            PC     CY Q  PIFDP+ S +Y+ + C S  C  LD A C+    +C+Y+V YGDGS
Sbjct: 191 APC---RRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGS 250

Query: 244 FTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEAT--- 303
           FTVG+ +TET +FR  N +  + +GCGHDNEGLFVGAAGL+GLG G +S   Q       
Sbjct: 251 FTVGDFSTETLTFRR-NRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQ 310

Query: 304 SFSYCLVDLDSES--SSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLP- 363
            FSYCLVD  + S  SS +  NA     +  +PL+ N +  TF YV ++G+SVGG  +P 
Sbjct: 311 KFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPG 370

Query: 364 ISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYD 423
           +++S F++D+ G+GG+I+DSGT++T +    Y  +RDAF    K L  AP  S FDTC+D
Sbjct: 371 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFD 430

Query: 424 LSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQG 483
           LS+ + V+VPT+     G + + LPA N L  VD+ G FC AF  +   LSIIGN+QQQG
Sbjct: 431 LSNMNEVKVPTVVLHFRGAD-VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQG 484

Query: 484 IRVSYDLANSLVGFSTDKC 490
            RV YDLA+S VGF+   C
Sbjct: 491 FRVVYDLASSRVGFAPGGC 484

BLAST of Csa1G022480 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 265.0 bits (676), Expect = 1.6e-69
Identity = 154/386 (39.90%), Postives = 219/386 (56.74%), Query Frame = 1

Query: 111 KLELSLKGGKQFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTG 170
           K +L  +  ++  RR+   ++  +  + V +    G GEY   + +G P Q +  + DTG
Sbjct: 56  KFQLLERAIERGSRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTG 115

Query: 171 SDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEV 230
           SD+ W QCQPC     C+ Q  PIF+P+ SSS+S L C S+ C  L    C  N C Y  
Sbjct: 116 SDLIWTQCQPC---TQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTY 175

Query: 231 EYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVG-AAGLIGLGGGAISLSSQ 290
            YGDGS T G + TET +F  S SIPN+  GCG +N+G   G  AGL+G+G G +SL SQ
Sbjct: 176 GYGDGSETQGSMGTETLTF-GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQ 235

Query: 291 LEATSFSYCLVDLDSESSSTLDFNADQPSDSLTSP---LVKNDRFPTFRYVKVIGMSVGG 350
           L+ T FSYC+  + S + S L   +   S +  SP   L+++ + PTF Y+ + G+SVG 
Sbjct: 236 LDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 295

Query: 351 KPLPISSSSFEID-ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSP- 410
             LPI  S+F ++  +G+GGII+DSGTT+T   ++ Y  +R  F+    NLP   G S  
Sbjct: 296 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI-NLPVVNGSSSG 355

Query: 411 FDTCYDL-SSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSII 470
           FD C+   S  SN+++PT      G   L+LP++N  F   S G  CLA   S+  +SI 
Sbjct: 356 FDLCFQTPSDPSNLQIPTFVMHFDG-GDLELPSEN-YFISPSNGLICLAMGSSSQGMSIF 415

Query: 471 GNVQQQGIRVSYDLANSLVGFSTDKC 490
           GN+QQQ + V YD  NS+V F++ +C
Sbjct: 416 GNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Csa1G022480 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 1.5e-67
Identity = 155/386 (40.16%), Postives = 216/386 (55.96%), Query Frame = 1

Query: 112 LELSLKGGKQFGRRINGS-DSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTG 171
           ++ ++K G++  R IN    S++ +  PV +G     GEY   + +G P  S+  + DTG
Sbjct: 61  IKRAIKRGERRMRSINAMLQSSSGIETPVYAGD----GEYLMNVAIGTPDSSFSAIMDTG 120

Query: 172 SDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEV 231
           SD+ W QC+PC     C+ Q  PIF+P+ SSS+S L C+S+ C  L    C+ N C Y  
Sbjct: 121 SDLIWTQCEPC---TQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYTY 180

Query: 232 EYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVG-AAGLIGLGGGAISLSSQ 291
            YGDGS T G +ATETF+F  ++S+PN+  GCG DN+G   G  AGLIG+G G +SL SQ
Sbjct: 181 GYGDGSTTQGYMATETFTF-ETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQ 240

Query: 292 LEATSFSYCLVDLDSESSSTLDFN---ADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGG 351
           L    FSYC+    S S STL      +  P  S ++ L+ +   PT+ Y+ + G++VGG
Sbjct: 241 LGVGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGG 300

Query: 352 KPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPA-PGVSPF 411
             L I SS+F++ + G+GG+I+DSGTT+T +P D Y+ +  AF     NLP      S  
Sbjct: 301 DNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI-NLPTVDESSSGL 360

Query: 412 DTCYDLSSQ-SNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAF-LPSTFPLSII 471
            TC+   S  S V+VP I+    G   L L  +N L    + G  CLA    S   +SI 
Sbjct: 361 STCFQQPSDGSTVQVPEISMQFDG-GVLNLGEQNILIS-PAEGVICLAMGSSSQLGISIF 420

Query: 472 GNVQQQGIRVSYDLANSLVGFSTDKC 490
           GN+QQQ  +V YDL N  V F   +C
Sbjct: 421 GNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Csa1G022480 vs. TrEMBL
Match: A0A0A0LS14_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G022480 PE=3 SV=1)

HSP 1 Score: 982.6 bits (2539), Expect = 1.7e-283
Identity = 489/489 (100.00%), Postives = 489/489 (100.00%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180
           QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180

Query: 181 CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240
           CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG
Sbjct: 181 CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240

Query: 241 ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV 300
           ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV
Sbjct: 241 ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV 300

Query: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360
           DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE
Sbjct: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360

Query: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420
           SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP
Sbjct: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420

Query: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480
           TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS
Sbjct: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480

Query: 481 LVGFSTDKC 490
           LVGFSTDKC
Sbjct: 481 LVGFSTDKC 489

BLAST of Csa1G022480 vs. TrEMBL
Match: A0A0A0LPJ3_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_1G022490 PE=3 SV=1)

HSP 1 Score: 694.5 bits (1791), Expect = 9.2e-197
Identity = 353/490 (72.04%), Postives = 408/490 (83.27%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLS  FLFLT F SL FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  N-HSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180

Query: 181 PCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240
           PC  EN CYKQ  PIFDPKSSSSYSPLSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240

Query: 241 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300

Query: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360
           V+LDS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360

Query: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420

Query: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480

Query: 481 SLVGFSTDKC 490
           SLVGFST+KC
Sbjct: 481 SLVGFSTNKC 487

BLAST of Csa1G022480 vs. TrEMBL
Match: A0A0A0LPR7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G021980 PE=3 SV=1)

HSP 1 Score: 583.9 bits (1504), Expect = 1.8e-163
Identity = 282/334 (84.43%), Postives = 307/334 (91.92%), Query Frame = 1

Query: 156 VGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHL 215
           VGQP Q  FFV DTGSDV+WLQC PC G+NGCY+QI PIFDP+ SSSY+P+SCDSEQC L
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 216 LDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAG 275
           LDEA C+ NSCIY+VEYGDGSFT+GELATET +F HSNSIPN+ IGCGHDNEGLFVGA G
Sbjct: 63  LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122

Query: 276 LIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFR 335
           LIGLGGGAIS+SSQL+A+SFSYCLVD+DS S STLDFN D PSDSL SPLVKNDRFP+FR
Sbjct: 123 LIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFR 182

Query: 336 YVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKN 395
           YVKVIGMSVGGKPLPISSS FEIDESG GGIIVDSGTTIT++PSDVY+VLR+AF+GLT N
Sbjct: 183 YVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTN 242

Query: 396 LPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLP 455
           LPPAP +SPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAF+ 
Sbjct: 243 LPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFVS 302

Query: 456 STFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 490
           +TFPLSIIGN QQQGIRVSYDL NSLVGFST+KC
Sbjct: 303 ATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336

BLAST of Csa1G022480 vs. TrEMBL
Match: M5WPB5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004726mg PE=3 SV=1)

HSP 1 Score: 518.5 bits (1334), Expect = 9.1e-144
Identity = 270/497 (54.33%), Postives = 351/497 (70.62%), Query Frame = 1

Query: 3   TSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKP---KPF 62
           T+  Y  +   F  +  FPS  SR L+ ++   T   DVSAS+ QA + LS  P   KP 
Sbjct: 4   TAFLYLAILSAFTLTSLFPSTHSRSLSEET---TTLLDVSASLTQAHDVLSFNPQTLKPL 63

Query: 63  --QTTHSNYHSSSPL----SLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLE 122
             Q T +  H+ +PL    SL L PR  +HN  ++DY SLV++RL R +AR  SL+ KL+
Sbjct: 64  DRQETQAQAHTLTPLNSSFSLQLLPRDALHNSQHKDYESLVQSRLGRDSARVNSLHTKLQ 123

Query: 123 LSLKGGKQFGRR-INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSD 182
           L ++  K+     ++       L+ PV SG SQG+GEYF RIGVG P +S + V DTGSD
Sbjct: 124 LVVQNIKKSDLEPMHTEIRPEDLSTPVVSGVSQGSGEYFTRIGVGTPAKSLYMVLDTGSD 183

Query: 183 VSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEY 242
           ++WLQC+PC   + CY+Q  P+F+P  SS+Y P++CDS QCH L  +AC A+ C+Y+V Y
Sbjct: 184 INWLQCEPC---SDCYQQTDPVFNPTGSSTYRPVTCDSAQCHSLHVSACRADKCLYQVSY 243

Query: 243 GDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEA 302
           GDGS+TVG+  TET SF +S +I N+ +GCGHDNEGLFVGAAGL+GLGGGA+SL SQ +A
Sbjct: 244 GDGSYTVGDFVTETVSFGNSGAIHNVGLGCGHDNEGLFVGAAGLLGLGGGALSLPSQFKA 303

Query: 303 TSFSYCLVDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPIS 362
           TSFSYCLV+ DS +SSTL+FN+  PSDS+T+PL+K+ R  TF YV + G SVGG+P+ + 
Sbjct: 304 TSFSYCLVNRDSSTSSTLEFNSAPPSDSVTAPLLKDSRVETFYYVGLKGFSVGGQPVSVP 363

Query: 363 SSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLS 422
            S FE+DESG+GGIIVDSGT IT + ++ Y+ LRDAF  LT++LP A G + FDTCYDLS
Sbjct: 364 PSVFEVDESGNGGIIVDSGTAITRLQTEAYNSLRDAFKRLTRDLPSASGFALFDTCYDLS 423

Query: 423 SQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIR 482
           S+S V+VPT++F+     SL LPAKN L  VDSAGTFC AF P++   SIIGNVQQQG R
Sbjct: 424 SRSRVQVPTVSFLFADGKSLSLPAKNYLIPVDSAGTFCFAFAPTSSSPSIIGNVQQQGTR 483

Query: 483 VSYDLANSLVGFSTDKC 490
           VSYDLAN+ VGFS +KC
Sbjct: 484 VSYDLANNRVGFSPNKC 494

BLAST of Csa1G022480 vs. TrEMBL
Match: A0A0A0LQD2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042220 PE=3 SV=1)

HSP 1 Score: 509.2 bits (1310), Expect = 5.5e-141
Identity = 264/487 (54.21%), Postives = 339/487 (69.61%), Query Frame = 1

Query: 9   FLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSNYH- 68
           FLFL FF SL FP  LSR  +  SP+S+ + DVSAS+ QA   L   P    +     H 
Sbjct: 10  FLFL-FFLSL-FPFTLSRSSSHLSPHSSASLDVSASLQQANQVLKFDPTASISFQQQVHL 69

Query: 69  ----SSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFG 128
               SS   SL LHPR ++HN  ++DY SLV +RL+R ++R +S+  +LE +L   K+  
Sbjct: 70  VPSNSSFSFSLQLHPRDSLHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSD 129

Query: 129 RR-INGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCD 188
              +        L+ P+ SG SQG+GEYF+R+GVGQP + ++ V DTGSD++WLQCQPC 
Sbjct: 130 LEPLKTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC- 189

Query: 189 GENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGEL 248
               CY+Q  PIFDP+SSSS++ L C+S+QC  L+ + C A+ C+Y+V YGDGSFTVGE 
Sbjct: 190 --TDCYQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEF 249

Query: 249 ATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLVDL 308
             ET +F +S  I N+ +GCGHDNEGLFVG+AGL+GLGGG++SL+SQ++A+SFSYCLVD 
Sbjct: 250 VIETLTFGNSGMINNVAVGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCLVDR 309

Query: 309 DSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESG 368
           DS SSS L+FN+  PSDS+ +PL+K+ +  TF YV + GMSVGG+ L I  + F++D+SG
Sbjct: 310 DSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSG 369

Query: 369 SGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTI 428
            GGIIVDSGT IT + +  Y+ LRDAFV  T  L    G + FDTCYDLSSQS V +PT+
Sbjct: 370 YGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTV 429

Query: 429 AFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLV 488
           +F   G  SLQLP KN L  VDS GTFC AF P+T  LSIIGNVQQQG RV YDLANS+V
Sbjct: 430 SFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVV 489

Query: 489 GFSTDKC 490
           GFS  KC
Sbjct: 490 GFSPHKC 491

BLAST of Csa1G022480 vs. TAIR10
Match: AT1G25510.1 (AT1G25510.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 478.0 bits (1229), Expect = 6.9e-135
Identity = 248/489 (50.72%), Postives = 333/489 (68.10%), Query Frame = 1

Query: 4   SLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTH 63
           S +Y+F F  FF + H  S+ SR L   S  +T+  +V+ SI++     S +    Q   
Sbjct: 2   SPNYSFFFFIFFLTSH-SSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLN--QQEE 61

Query: 64  SNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKG-GKQF 123
             + +SS  SL LH R++V    + DY SL  ARL R  AR +SL  +L+L++    K  
Sbjct: 62  QTHSASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKAD 121

Query: 124 GRRINGSDSTNS--LTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 183
            + I+   +T    + AP+ SG +QG+GEYF R+G+G+P +  + V DTGSDV+WLQC P
Sbjct: 122 LKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP 181

Query: 184 CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 243
           C     CY Q  PIF+P SSSSY PLSCD+ QC+ L+ + C   +C+YEV YGDGS+TVG
Sbjct: 182 CAD---CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVG 241

Query: 244 ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV 303
           + ATET +   S  + N+ +GCGH NEGLFVGAAGL+GLGGG ++L SQL  TSFSYCLV
Sbjct: 242 DFATETLTIG-STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLV 301

Query: 304 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 363
           D DS+S+ST+DF      D++ +PL++N +  TF Y+ + G+SVGG+ L I  SSFE+DE
Sbjct: 302 DRDSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDE 361

Query: 364 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 423
           SGSGGII+DSGT +T + +++Y+ LRD+FV  T +L  A GV+ FDTCY+LS+++ VEVP
Sbjct: 362 SGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVP 421

Query: 424 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 483
           T+AF  PG   L LPAKN +  VDS GTFCLAF P+   L+IIGNVQQQG RV++DLANS
Sbjct: 422 TVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANS 481

Query: 484 LVGFSTDKC 490
           L+GFS++KC
Sbjct: 482 LIGFSSNKC 483

BLAST of Csa1G022480 vs. TAIR10
Match: AT3G18490.1 (AT3G18490.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 460.7 bits (1184), Expect = 1.1e-129
Identity = 239/480 (49.79%), Postives = 323/480 (67.29%), Query Frame = 1

Query: 25  SRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSN---------YHSSSPLSLS 84
           SR L+T  P  TN  DV +S+ Q    LS+ P     T +          ++SSSPLSL 
Sbjct: 26  SRSLST--PPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSDPVFFNSSSPLSLE 85

Query: 85  LHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRR-INGSDS--- 144
           LH R T     ++DY SL  +RL R ++R   +  K+  +++G  +   + +   D+   
Sbjct: 86  LHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQ 145

Query: 145 TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQI 204
           T  LT PV SGASQG+GEYF+RIGVG P +  + V DTGSDV+W+QC+PC     CY+Q 
Sbjct: 146 TEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCAD---CYQQS 205

Query: 205 GPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRH 264
            P+F+P SSS+Y  L+C + QC LL+ +AC +N C+Y+V YGDGSFTVGELAT+T +F +
Sbjct: 206 DPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGN 265

Query: 265 SNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLD 324
           S  I N+ +GCGHDNEGLF GAAGL+GLGGG +S+++Q++ATSFSYCLVD DS  SS+LD
Sbjct: 266 SGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLD 325

Query: 325 FNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDS 384
           FN+ Q      T+PL++N +  TF YV + G SVGG+ + +  + F++D SGSGG+I+D 
Sbjct: 326 FNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDC 385

Query: 385 GTTITEIPSDVYDVLRDAFVGLTKNLPP-APGVSPFDTCYDLSSQSNVEVPTIAFILPGE 444
           GT +T + +  Y+ LRDAF+ LT NL   +  +S FDTCYD SS S V+VPT+AF   G 
Sbjct: 386 GTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGG 445

Query: 445 NSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 490
            SL LPAKN L  VD +GTFC AF P++  LSIIGNVQQQG R++YDL+ +++G S +KC
Sbjct: 446 KSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Csa1G022480 vs. TAIR10
Match: AT3G20015.1 (AT3G20015.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 361.3 bits (926), Expect = 9.4e-100
Identity = 200/483 (41.41%), Postives = 277/483 (57.35%), Query Frame = 1

Query: 14  FFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSNYHSSSPLS 73
           FF  LH    L   L++ S  S   F +   +   L   +  P  F  TH +  SSS  +
Sbjct: 6   FFFFLH----LHLHLSSSSSISFPDFQIIDVLQPPLTVTATLPD-FNNTHFSDESSSKYT 65

Query: 74  LSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRRINGSDS-- 133
           L L  R    + +Y ++   + AR+ R   R  ++ R++          G+ I  SDS  
Sbjct: 66  LRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRIS---------GKVIPSSDSRY 125

Query: 134 -TNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQ 193
             N   + + SG  QG+GEYF RIGVG P +  + V D+GSD+ W+QCQPC     CYKQ
Sbjct: 126 EVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC---KLCYKQ 185

Query: 194 IGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFR 253
             P+FDP  S SY+ +SC S  C  ++ + C +  C YEV YGDGS+T G LA ET +F 
Sbjct: 186 SDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFA 245

Query: 254 HSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATS---FSYCLVDLDSESS 313
            +  + N+ +GCGH N G+F+GAAGL+G+GGG++S   QL   +   F YCLV   ++S+
Sbjct: 246 KT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDST 305

Query: 314 STLDFNADQ-PSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGI 373
            +L F  +  P  +   PLV+N R P+F YV + G+ VGG  +P+    F++ E+G GG+
Sbjct: 306 GSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGV 365

Query: 374 IVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFIL 433
           ++D+GT +T +P+  Y   RD F   T NLP A GVS FDTCYDLS   +V VPT++F  
Sbjct: 366 VMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFYF 425

Query: 434 PGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFST 490
                L LPA+N L  VD +GT+C AF  S   LSIIGN+QQ+GI+VS+D AN  VGF  
Sbjct: 426 TEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGP 470

BLAST of Csa1G022480 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 343.6 bits (880), Expect = 2.0e-94
Identity = 209/499 (41.88%), Postives = 287/499 (57.52%), Query Frame = 1

Query: 4   SLSYAFLFLTFFASLH-----FPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKP 63
           SL + FL L  F+SL      FP+  S  L   SP S      S S+ ++          
Sbjct: 11  SLCFFFLSLPSFSSLPSFQTLFPN--SHSLPCASPVSFQPDSDSESLLES---------E 70

Query: 64  FQTTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKG 123
           F++  S+  SSS ++L+L     + +    D   L  +RL R + R +S+      +   
Sbjct: 71  FESG-SDSESSSSITLNLDHIDALSSNKTPD--ELFSSRLQRDSRRVKSI------ATLA 130

Query: 124 GKQFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 183
            +  GR +  +      ++ V SG SQG+GEYF R+GVG P +  + V DTGSD+ WLQC
Sbjct: 131 AQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC 190

Query: 184 QPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDAN--SCIYEVEYGDGS 243
            PC     CY Q  PIFDP+ S +Y+ + C S  C  LD A C+    +C+Y+V YGDGS
Sbjct: 191 APC---RRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGS 250

Query: 244 FTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEAT--- 303
           FTVG+ +TET +FR  N +  + +GCGHDNEGLFVGAAGL+GLG G +S   Q       
Sbjct: 251 FTVGDFSTETLTFRR-NRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQ 310

Query: 304 SFSYCLVDLDSES--SSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLP- 363
            FSYCLVD  + S  SS +  NA     +  +PL+ N +  TF YV ++G+SVGG  +P 
Sbjct: 311 KFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPG 370

Query: 364 ISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYD 423
           +++S F++D+ G+GG+I+DSGT++T +    Y  +RDAF    K L  AP  S FDTC+D
Sbjct: 371 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFD 430

Query: 424 LSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQG 483
           LS+ + V+VPT+     G + + LPA N L  VD+ G FC AF  +   LSIIGN+QQQG
Sbjct: 431 LSNMNEVKVPTVVLHFRGAD-VSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQG 484

Query: 484 IRVSYDLANSLVGFSTDKC 490
            RV YDLA+S VGF+   C
Sbjct: 491 FRVVYDLASSRVGFAPGGC 484

BLAST of Csa1G022480 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 342.8 bits (878), Expect = 3.5e-94
Identity = 212/497 (42.66%), Postives = 285/497 (57.34%), Query Frame = 1

Query: 10  LFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQTTHSNYHSS 69
           L  + FA L F S  S +  T      NT   SA+++         P+    T  +  S 
Sbjct: 9   LAFSVFAVLFFTSSASSQYQT---LVVNTLPSSATLSW--------PESESLTDESL-SE 68

Query: 70  SPLSLSLHPRLTVHNPSYEDYG--SLVRARLARGAARAQSLNRKLELSLKGGKQFGRRIN 129
           S  SLS+H        S+ D     L   RL R + R +S+     +S   G+   +R  
Sbjct: 69  STTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVST--GRNATKRT- 128

Query: 130 GSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGC 189
              +    +  V SG SQG+GEYF R+GVG P  + + V DTGSDV WLQC PC     C
Sbjct: 129 -PRTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC---KAC 188

Query: 190 YKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAA-C---DANSCIYEVEYGDGSFTVGELA 249
           Y Q   IFDPK S +++ + C S  C  LD+++ C    + +C+Y+V YGDGSFT G+ +
Sbjct: 189 YNQTDAIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFS 248

Query: 250 TETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEAT---SFSYCLV 309
           TET +F H   + ++P+GCGHDNEGLFVGAAGL+GLG G +S  SQ +      FSYCLV
Sbjct: 249 TETLTF-HGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLV 308

Query: 310 DLDSESSS-----TLDF-NADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLP-ISS 369
           D  S  SS     T+ F NA  P  S+ +PL+ N +  TF Y++++G+SVGG  +P +S 
Sbjct: 309 DRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSE 368

Query: 370 SSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAF-VGLTKNLPPAPGVSPFDTCYDLS 429
           S F++D +G+GG+I+DSGT++T +    Y  LRDAF +G TK L  AP  S FDTC+DLS
Sbjct: 369 SQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATK-LKRAPSYSLFDTCFDLS 428

Query: 430 SQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIR 489
             + V+VPT+ F   G   + LPA N L  V++ G FC AF  +   LSIIGN+QQQG R
Sbjct: 429 GMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFR 483

BLAST of Csa1G022480 vs. NCBI nr
Match: gi|778664722|ref|XP_004138237.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 982.6 bits (2539), Expect = 2.4e-283
Identity = 489/489 (100.00%), Postives = 489/489 (100.00%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ
Sbjct: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK
Sbjct: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180
           QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP
Sbjct: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180

Query: 181 CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240
           CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG
Sbjct: 181 CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240

Query: 241 ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV 300
           ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV
Sbjct: 241 ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV 300

Query: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360
           DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE
Sbjct: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360

Query: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420
           SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP
Sbjct: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420

Query: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480
           TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS
Sbjct: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480

Query: 481 LVGFSTDKC 490
           LVGFSTDKC
Sbjct: 481 LVGFSTDKC 489

BLAST of Csa1G022480 vs. NCBI nr
Match: gi|659106557|ref|XP_008453383.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo])

HSP 1 Score: 844.3 bits (2180), Expect = 1.0e-241
Identity = 424/489 (86.71%), Postives = 449/489 (91.82%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLSYA LFLT F  L FPSILSRKLT QSPYST TFDVSASINQALNALSIKPKPFQ
Sbjct: 2   MNTSLSYALLFLTIFTFLQFPSILSRKLTAQSPYSTTTFDVSASINQALNALSIKPKPFQ 61

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           T HS YHS+SPLSLSLHPRLTVHNPSY+DYG+LVRARLAR A R QSLNRKLELSL G K
Sbjct: 62  T-HS-YHSNSPLSLSLHPRLTVHNPSYKDYGTLVRARLARHATRVQSLNRKLELSLNGAK 121

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQP 180
           QFG+RINGS STNSLTAPVTSGAS G GEYFARIGVGQPVQS+F VPDTGSDV+WLQC+P
Sbjct: 122 QFGKRINGSASTNSLTAPVTSGASHGDGEYFARIGVGQPVQSFFLVPDTGSDVTWLQCKP 181

Query: 181 CDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVG 240
           C  EN C+KQ+ PIFDPKSSSSYS LSC+SEQC LLDEA C +NSCIYEVEYGDGSFT+G
Sbjct: 182 CANENACFKQLDPIFDPKSSSSYSSLSCNSEQCQLLDEAGCSSNSCIYEVEYGDGSFTIG 241

Query: 241 ELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLV 300
           ELATET SF +SNSIPNLPIGCGHDNEGLF  AAGLIGLGGGAISLSSQL+A+SFSYCLV
Sbjct: 242 ELATETLSFGNSNSIPNLPIGCGHDNEGLFDAAAGLIGLGGGAISLSSQLQASSFSYCLV 301

Query: 301 DLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 360
           DLDS+SSSTLDFNADQPSDSLTSPLVKN+RFP+FRYVKVIGMSVGGK LPISSS FEIDE
Sbjct: 302 DLDSDSSSTLDFNADQPSDSLTSPLVKNNRFPSFRYVKVIGMSVGGKRLPISSSRFEIDE 361

Query: 361 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 420
           SGSGGIIVDSGTTIT++PSDVYDVLRDAFVGLT NLP APGVSPFDTCYDLSSQS+VEVP
Sbjct: 362 SGSGGIIVDSGTTITQLPSDVYDVLRDAFVGLTTNLPTAPGVSPFDTCYDLSSQSSVEVP 421

Query: 421 TIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANS 480
            IAFILPG  SL+LPAKNCL QVDSAGTFCLAFLP TFPLSIIGNVQQQGIRVSYDL NS
Sbjct: 422 IIAFILPGGKSLKLPAKNCLIQVDSAGTFCLAFLPGTFPLSIIGNVQQQGIRVSYDLDNS 481

Query: 481 LVGFSTDKC 490
           +VGF+T+KC
Sbjct: 482 IVGFATNKC 488

BLAST of Csa1G022480 vs. NCBI nr
Match: gi|659106559|ref|XP_008453384.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo])

HSP 1 Score: 703.4 bits (1814), Expect = 2.9e-199
Identity = 360/490 (73.47%), Postives = 414/490 (84.49%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           M TSLS  FLFLT F SL F SILSRKLT QSPYST+ FDV AS NQALNALSIKPK  Q
Sbjct: 1   MKTSLSSVFLFLTIFTSLQFSSILSRKLT-QSPYSTSIFDVLASTNQALNALSIKPKHLQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
           T HS+  +SS LSL L+PRL++HNPSY+DY SLVRARLAR AAR Q LNR LE SL GGK
Sbjct: 61  T-HSHLPNSS-LSLPLYPRLSLHNPSYKDYDSLVRARLARDAARVQFLNRNLEHSLNGGK 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180
            FG   NGS   +S+TAPV SG S+G+G EY A++GVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 DFGEVTNGSLIGDSITAPVVSGQSKGSGAEYLAQVGVGQPVKLFYLVPDTGSDVTWLQCQ 180

Query: 181 PCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240
           PC  EN CYKQI PIFDPKSSSSY+PLSC+S+QC LLD   C++ +CIY+V YGDGSFT 
Sbjct: 181 PCATENACYKQIDPIFDPKSSSSYTPLSCNSQQCGLLDRPNCNSGTCIYQVHYGDGSFTT 240

Query: 241 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300

Query: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360
           V++DS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKVIG+SVGGK LPISS+ FEID
Sbjct: 301 VNMDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVIGISVGGKTLPISSTRFEID 360

Query: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+LSSQSNVEV
Sbjct: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNLSSQSNVEV 420

Query: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480
           PTIAF+L G  SL+LPA+N L +VD+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFVLSGGTSLRLPARNYLIRVDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480

Query: 481 SLVGFSTDKC 490
           SLVGFST+KC
Sbjct: 481 SLVGFSTNKC 487

BLAST of Csa1G022480 vs. NCBI nr
Match: gi|449440933|ref|XP_004138238.1| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 694.5 bits (1791), Expect = 1.3e-196
Identity = 353/490 (72.04%), Postives = 408/490 (83.27%), Query Frame = 1

Query: 1   MNTSLSYAFLFLTFFASLHFPSILSRKLTTQSPYSTNTFDVSASINQALNALSIKPKPFQ 60
           MNTSLS  FLFLT F SL FPSILSRKLT  S YST+ FDVSAS NQAL+ALSIKPKP Q
Sbjct: 1   MNTSLSSVFLFLTIFTSLQFPSILSRKLTPSS-YSTSIFDVSASTNQALDALSIKPKPLQ 60

Query: 61  TTHSNYHSSSPLSLSLHPRLTVHNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGK 120
             HS+   +SP SL L+PRL +HNPSY+DY +LVRARL R AAR Q LNR LE SL GG 
Sbjct: 61  N-HSHL-PNSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGT 120

Query: 121 QFGRRINGSDSTNSLTAPVTSGASQGAG-EYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ 180
            FG  IN S   +S+TAPV SG S+G+G EY A+IGVGQPV+ ++ VPDTGSDV+WLQCQ
Sbjct: 121 HFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQ 180

Query: 181 PCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTV 240
           PC  EN CYKQ  PIFDPKSSSSYSPLSC+S+QC LLD+A C++++CIY+V YGDGSFT 
Sbjct: 181 PCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTT 240

Query: 241 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 300
           GELATET SF +SNSIPNLPIGCGHDNEGLF G AGLIGLGGGAISLSSQL+A+SFSYCL
Sbjct: 241 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 300

Query: 301 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 360
           V+LDS+SSSTL+FN++ PSDSLTSPLVKNDRF ++RYVKV+G+SVGGK LPIS + FEID
Sbjct: 301 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 360

Query: 361 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 420
           ESG GGIIVDSGT I+ +PSDVY+ LR+AFV LT +L PAPG+S FDTCY+ S QSNVEV
Sbjct: 361 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 420

Query: 421 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 480
           PTIAF+L    SL+LPA+N L  +D+AGT+CLAF+ +   LSIIG+ QQQGIRVSYDL N
Sbjct: 421 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 480

Query: 481 SLVGFSTDKC 490
           SLVGFST+KC
Sbjct: 481 SLVGFSTNKC 487

BLAST of Csa1G022480 vs. NCBI nr
Match: gi|778664719|ref|XP_004138341.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus])

HSP 1 Score: 583.9 bits (1504), Expect = 2.5e-163
Identity = 282/334 (84.43%), Postives = 307/334 (91.92%), Query Frame = 1

Query: 156 VGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHL 215
           VGQP Q  FFV DTGSDV+WLQC PC G+NGCY+QI PIFDP+ SSSY+P+SCDSEQC L
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 216 LDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAG 275
           LDEA C+ NSCIY+VEYGDGSFT+GELATET +F HSNSIPN+ IGCGHDNEGLFVGA G
Sbjct: 63  LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122

Query: 276 LIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFR 335
           LIGLGGGAIS+SSQL+A+SFSYCLVD+DS S STLDFN D PSDSL SPLVKNDRFP+FR
Sbjct: 123 LIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFR 182

Query: 336 YVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKN 395
           YVKVIGMSVGGKPLPISSS FEIDESG GGIIVDSGTTIT++PSDVY+VLR+AF+GLT N
Sbjct: 183 YVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTN 242

Query: 396 LPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLP 455
           LPPAP +SPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAF+ 
Sbjct: 243 LPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFVS 302

Query: 456 STFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 490
           +TFPLSIIGN QQQGIRVSYDL NSLVGFST+KC
Sbjct: 303 ATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASPG1_ARATH2.0e-12849.79Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
ASPG2_ARATH1.7e-9841.41Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
APF2_ARATH3.6e-9341.88Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP1_NEPGR1.6e-6939.90Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
NEP2_NEPGR1.5e-6740.16Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LS14_CUCSA1.7e-283100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G022480 PE=3 SV=1[more]
A0A0A0LPJ3_CUCSA9.2e-19772.04Aspartic proteinase nepenthesin-1 OS=Cucumis sativus GN=Csa_1G022490 PE=3 SV=1[more]
A0A0A0LPR7_CUCSA1.8e-16384.43Uncharacterized protein OS=Cucumis sativus GN=Csa_1G021980 PE=3 SV=1[more]
M5WPB5_PRUPE9.1e-14454.33Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004726mg PE=3 SV=1[more]
A0A0A0LQD2_CUCSA5.5e-14154.21Uncharacterized protein OS=Cucumis sativus GN=Csa_1G042220 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G25510.16.9e-13550.72 Eukaryotic aspartyl protease family protein[more]
AT3G18490.11.1e-12949.79 Eukaryotic aspartyl protease family protein[more]
AT3G20015.19.4e-10041.41 Eukaryotic aspartyl protease family protein[more]
AT1G01300.12.0e-9441.88 Eukaryotic aspartyl protease family protein[more]
AT3G61820.13.5e-9442.66 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778664722|ref|XP_004138237.2|2.4e-283100.00PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
gi|659106557|ref|XP_008453383.1|1.0e-24186.71PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1 [Cucumis melo][more]
gi|659106559|ref|XP_008453384.1|2.9e-19973.47PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis melo][more]
gi|449440933|ref|XP_004138238.1|1.3e-19672.04PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
gi|778664719|ref|XP_004138341.2|2.5e-16384.43PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0050896 response to stimulus
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G022480.1Csa1G022480.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 2..24
score: 1.7E-200coord: 62..489
score: 1.7E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 320..489
score: 3.2E-44coord: 134..308
score: 2.3
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 145..489
score: 7.37
NoneNo IPR availablePANTHERPTHR13683:SF274ASPARTYL PROTEASE FAMILY PROTEINcoord: 62..489
score: 1.7E-200coord: 2..24
score: 1.7E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Csa1G022480Silver-seed gourdcarcuB0416
Csa1G022480Silver-seed gourdcarcuB0491
Csa1G022480Silver-seed gourdcarcuB0543
Csa1G022480Silver-seed gourdcarcuB0622
Csa1G022480Silver-seed gourdcarcuB0856
Csa1G022480Silver-seed gourdcarcuB1056
Csa1G022480Watermelon (97103) v2cuwmbB055
Csa1G022480Watermelon (97103) v2cuwmbB061
Csa1G022480Watermelon (97103) v2cuwmbB079
Csa1G022480Wax gourdcuwgoB012
Csa1G022480Wax gourdcuwgoB040
Csa1G022480Wax gourdcuwgoB071
Csa1G022480Cucumber (Chinese Long) v2cucuB000
Csa1G022480Cucumber (Chinese Long) v2cucuB030
Csa1G022480Cucumber (Gy14) v1cgycuB299
Csa1G022480Cucumber (Gy14) v1cgycuB405
Csa1G022480Cucurbita maxima (Rimu)cmacuB061
Csa1G022480Cucurbita maxima (Rimu)cmacuB087
Csa1G022480Cucurbita maxima (Rimu)cmacuB221
Csa1G022480Cucurbita maxima (Rimu)cmacuB393
Csa1G022480Cucurbita maxima (Rimu)cmacuB404
Csa1G022480Cucurbita maxima (Rimu)cmacuB686
Csa1G022480Cucurbita moschata (Rifu)cmocuB076
Csa1G022480Cucurbita moschata (Rifu)cmocuB050
Csa1G022480Cucurbita moschata (Rifu)cmocuB207
Csa1G022480Cucurbita moschata (Rifu)cmocuB389
Csa1G022480Cucurbita moschata (Rifu)cmocuB679
Csa1G022480Melon (DHL92) v3.5.1cumeB010
Csa1G022480Melon (DHL92) v3.5.1cumeB033
Csa1G022480Melon (DHL92) v3.5.1cumeB055
Csa1G022480Watermelon (Charleston Gray)cuwcgB058
Csa1G022480Watermelon (Charleston Gray)cuwcgB063
Csa1G022480Watermelon (Charleston Gray)cuwcgB082
Csa1G022480Watermelon (97103) v1cuwmB039
Csa1G022480Watermelon (97103) v1cuwmB063
Csa1G022480Watermelon (97103) v1cuwmB095
Csa1G022480Cucurbita pepo (Zucchini)cpecuB024
Csa1G022480Cucurbita pepo (Zucchini)cpecuB352
Csa1G022480Cucurbita pepo (Zucchini)cpecuB381
Csa1G022480Cucurbita pepo (Zucchini)cpecuB509
Csa1G022480Cucurbita pepo (Zucchini)cpecuB654
Csa1G022480Bottle gourd (USVL1VR-Ls)culsiB025
Csa1G022480Bottle gourd (USVL1VR-Ls)culsiB054
Csa1G022480Bottle gourd (USVL1VR-Ls)culsiB071
Csa1G022480Cucumber (Gy14) v2cgybcuB001
Csa1G022480Cucumber (Gy14) v2cgybcuB005
Csa1G022480Cucumber (Gy14) v2cgybcuB196
Csa1G022480Melon (DHL92) v3.6.1cumedB005
Csa1G022480Melon (DHL92) v3.6.1cumedB028
Csa1G022480Melon (DHL92) v3.6.1cumedB047