CsGy3G030020 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy3G030020
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionaspartic proteinase PCS1
LocationGy14Chr3: 30208010 .. 30209954 (+)
RNA-Seq ExpressionCsGy3G030020
SyntenyCsGy3G030020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAATAGCGAACACTCTCTTGAGATCTCAGCGTCGATTTCTTTATTCTTTTTTGGAATTTATTGGCTTTTTTTGCTTTTAATTACTGGGATTTATATTTGTTGGATCTTGGATTTGGCTGTGGAAAGGAAAAGTTTCTCAGAGGATTGTAAAGATTTCATTTCGTAACACTCTGTTCTCGTAAACCAAAACAGAGCCAACCAACTTCTTCGGCTTTCTTCCTTCTATATCTTCCAACTGCTCCAACTGTCTATCAATTCCAAATTTTCAGGAAAGGAAAAAGAAAATGAGGGATTATTGCTTTGCTTTTAATTTCTCAAGCGTTAAATTTCTCAAATCTTGTTTTCTTTTCTTCTTCTGTACTTTGTTTTCTGTCTTTCACAGTATTCATCTTTGTTCCTCACTAAACCCAGCTCTTGTTCTTCCTCTAAAAACCCAAGTGATTCCTCCGGAATCTGTCCGGAGATCTCCCGATAAGCTTCCTTTCCGGCATAATATCAGTCTCACCGTCTCACTGACAGTCGGAACCCCACCGCAAAATGTCACGATGGTCATCGACACTGGCAGTGAACTCTCATGGCTCCATTGCAACACATCACAAAACTCTTCTTCTTCATCTTCGACGTTCAACCCGGTCTGGTCATCGTCTTACAGTCCGATCCCTTGCTCTTCCTCCACCTGCACCGACCAAACTCGAGATTTTCCTATTCGGCCTTCTTGTGATTCCAATCAGTTCTGTCACGCCACTCTGTCTTACGCTGATGCCTCTTCTTCAGAAGGAAATCTTGCCACCGATACTTTTTACATCGGTAGTTCCGGAATTCCAAATGTAGTCTTCGGTTGTATGGATTCAATTTTCAGCTCCAACAGTGAAGAGGATTCCAAAAACACTGGTTTAATGGGTATGAATCGTGGATCTCTCTCTTTTGTTTCTCAAATGGGTTTCCCTAAATTTTCCTACTGCATATCGGAATACGATTTTTCCGGTTTGTTATTACTCGGTGATGCAAATTTTTCATGGCTGGCTCCATTGAATTACACTCCACTCATCGAAATGTCCACCCCATTACCATATTTCGATCGGGTAGCTTACACGGTTCAGCTCGAAGGAATCAAAGTCGCCCACAAGTTACTTCCGATACCGGAATCCGTTTTCGAACCGGACCACACCGGAGCAGGTCAGACAATGGTCGACTCAGGCACCCAGTTCACTTTCCTTCTCGGACCGGCCTACACCGCACTACGTGACCACTTCCTAAACAAAACCGCCGGTTCACTACGGGTTTACGAGGATTCAAATTTTGTTTTCCAAGGGGCCATGGATCTTTGCTACCGGGTTCCAACAAACCAAACCCGGCTCCCGCCTCTACCATCAGTAACGCTGGTCTTTCGAGGCGCGGAAATGACGGTAACCGGTGATCGGATTCTGTACCGAGTGCCTGGGGAAAGAAGAGGAAACGATTCGATTCATTGTTTTACATTCGGAAATTCGGATCTGTTGGGCGTGGAAGCGTTTGTGATAGGTCATCTTCATCAACAGAACGTGTGGATGGAATTCGATCTGAAAAAATCCCGAATCGGGTTGGCGGAGATTCGGTGCGATTTAGCGGGTCAGAAACTGGGGATGGGCCTGTAAATGAACCGGGCCCCTTTTGATTGAAGCGTTTTAACTATCTGGGTAACTTATCACGTGATATTGATTAAAATCTTACGGGTTTTCGTGTGTAGTCGTAGGTCCATGCTATTATGATGGTCCTTCATCTTTTAATACTCTCCGTATAATTGGGTCCCACCTACCATCTCAGATATTATAAACTAAAAGATAAAAAAAGAAGAAGCAAAAGTGAGTTTGATCCTACGGATGAAGTCAGTTTCAGTGAGCTGCGATAGCGCCACGTGTGACAGTTTTCTTTCTTTTTATTTTTTTTCTCCCTCCCTAGCTGTAC

mRNA sequence

CGAATAGCGAACACTCTCTTGAGATCTCAGCGTCGATTTCTTTATTCTTTTTTGGAATTTATTGGCTTTTTTTGCTTTTAATTACTGGGATTTATATTTGTTGGATCTTGGATTTGGCTGTGGAAAGGAAAAGTTTCTCAGAGGATTGTAAAGATTTCATTTCGTAACACTCTGTTCTCGTAAACCAAAACAGAGCCAACCAACTTCTTCGGCTTTCTTCCTTCTATATCTTCCAACTGCTCCAACTGTCTATCAATTCCAAATTTTCAGGAAAGGAAAAAGAAAATGAGGGATTATTGCTTTGCTTTTAATTTCTCAAGCGTTAAATTTCTCAAATCTTGTTTTCTTTTCTTCTTCTGTACTTTGTTTTCTGTCTTTCACAGTATTCATCTTTGTTCCTCACTAAACCCAGCTCTTGTTCTTCCTCTAAAAACCCAAGTGATTCCTCCGGAATCTGTCCGGAGATCTCCCGATAAGCTTCCTTTCCGGCATAATATCAGTCTCACCGTCTCACTGACAGTCGGAACCCCACCGCAAAATGTCACGATGGTCATCGACACTGGCAGTGAACTCTCATGGCTCCATTGCAACACATCACAAAACTCTTCTTCTTCATCTTCGACGTTCAACCCGGTCTGGTCATCGTCTTACAGTCCGATCCCTTGCTCTTCCTCCACCTGCACCGACCAAACTCGAGATTTTCCTATTCGGCCTTCTTGTGATTCCAATCAGTTCTGTCACGCCACTCTGTCTTACGCTGATGCCTCTTCTTCAGAAGGAAATCTTGCCACCGATACTTTTTACATCGGTAGTTCCGGAATTCCAAATGTAGTCTTCGGTTGTATGGATTCAATTTTCAGCTCCAACAGTGAAGAGGATTCCAAAAACACTGGTTTAATGGGTATGAATCGTGGATCTCTCTCTTTTGTTTCTCAAATGGGTTTCCCTAAATTTTCCTACTGCATATCGGAATACGATTTTTCCGGTTTGTTATTACTCGGTGATGCAAATTTTTCATGGCTGGCTCCATTGAATTACACTCCACTCATCGAAATGTCCACCCCATTACCATATTTCGATCGGGTAGCTTACACGGTTCAGCTCGAAGGAATCAAAGTCGCCCACAAGTTACTTCCGATACCGGAATCCGTTTTCGAACCGGACCACACCGGAGCAGGTCAGACAATGGTCGACTCAGGCACCCAGTTCACTTTCCTTCTCGGACCGGCCTACACCGCACTACGTGACCACTTCCTAAACAAAACCGCCGGTTCACTACGGGTTTACGAGGATTCAAATTTTGTTTTCCAAGGGGCCATGGATCTTTGCTACCGGGTTCCAACAAACCAAACCCGGCTCCCGCCTCTACCATCAGTAACGCTGGTCTTTCGAGGCGCGGAAATGACGGTAACCGGTGATCGGATTCTGTACCGAGTGCCTGGGGAAAGAAGAGGAAACGATTCGATTCATTGTTTTACATTCGGAAATTCGGATCTGTTGGGCGTGGAAGCGTTTGTGATAGGTCATCTTCATCAACAGAACGTGTGGATGGAATTCGATCTGAAAAAATCCCGAATCGGGTTGGCGGAGATTCGGTGCGATTTAGCGGGTCAGAAACTGGGGATGGGCCTGTAAATGAACCGGGCCCCTTTTGATTGAAGCGTTTTAACTATCTGGGTAACTTATCACGTGATATTGATTAAAATCTTACGGGTTTTCGTGTGTAGTCGTAGGTCCATGCTATTATGATGGTCCTTCATCTTTTAATACTCTCCGTATAATTGGGTCCCACCTACCATCTCAGATATTATAAACTAAAAGATAAAAAAAGAAGAAGCAAAAGTGAGTTTGATCCTACGGATGAAGTCAGTTTCAGTGAGCTGCGATAGCGCCACGTGTGACAGTTTTCTTTCTTTTTATTTTTTTTCTCCCTCCCTAGCTGTAC

Coding sequence (CDS)

ATGAGGGATTATTGCTTTGCTTTTAATTTCTCAAGCGTTAAATTTCTCAAATCTTGTTTTCTTTTCTTCTTCTGTACTTTGTTTTCTGTCTTTCACAGTATTCATCTTTGTTCCTCACTAAACCCAGCTCTTGTTCTTCCTCTAAAAACCCAAGTGATTCCTCCGGAATCTGTCCGGAGATCTCCCGATAAGCTTCCTTTCCGGCATAATATCAGTCTCACCGTCTCACTGACAGTCGGAACCCCACCGCAAAATGTCACGATGGTCATCGACACTGGCAGTGAACTCTCATGGCTCCATTGCAACACATCACAAAACTCTTCTTCTTCATCTTCGACGTTCAACCCGGTCTGGTCATCGTCTTACAGTCCGATCCCTTGCTCTTCCTCCACCTGCACCGACCAAACTCGAGATTTTCCTATTCGGCCTTCTTGTGATTCCAATCAGTTCTGTCACGCCACTCTGTCTTACGCTGATGCCTCTTCTTCAGAAGGAAATCTTGCCACCGATACTTTTTACATCGGTAGTTCCGGAATTCCAAATGTAGTCTTCGGTTGTATGGATTCAATTTTCAGCTCCAACAGTGAAGAGGATTCCAAAAACACTGGTTTAATGGGTATGAATCGTGGATCTCTCTCTTTTGTTTCTCAAATGGGTTTCCCTAAATTTTCCTACTGCATATCGGAATACGATTTTTCCGGTTTGTTATTACTCGGTGATGCAAATTTTTCATGGCTGGCTCCATTGAATTACACTCCACTCATCGAAATGTCCACCCCATTACCATATTTCGATCGGGTAGCTTACACGGTTCAGCTCGAAGGAATCAAAGTCGCCCACAAGTTACTTCCGATACCGGAATCCGTTTTCGAACCGGACCACACCGGAGCAGGTCAGACAATGGTCGACTCAGGCACCCAGTTCACTTTCCTTCTCGGACCGGCCTACACCGCACTACGTGACCACTTCCTAAACAAAACCGCCGGTTCACTACGGGTTTACGAGGATTCAAATTTTGTTTTCCAAGGGGCCATGGATCTTTGCTACCGGGTTCCAACAAACCAAACCCGGCTCCCGCCTCTACCATCAGTAACGCTGGTCTTTCGAGGCGCGGAAATGACGGTAACCGGTGATCGGATTCTGTACCGAGTGCCTGGGGAAAGAAGAGGAAACGATTCGATTCATTGTTTTACATTCGGAAATTCGGATCTGTTGGGCGTGGAAGCGTTTGTGATAGGTCATCTTCATCAACAGAACGTGTGGATGGAATTCGATCTGAAAAAATCCCGAATCGGGTTGGCGGAGATTCGGTGCGATTTAGCGGGTCAGAAACTGGGGATGGGCCTGTAA

Protein sequence

MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRRSPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL*
Homology
BLAST of CsGy3G030020 vs. ExPASy Swiss-Prot
Match: Q9LZL3 (Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1)

HSP 1 Score: 567.8 bits (1462), Expect = 1.1e-160
Identity = 288/426 (67.61%), Postives = 341/426 (80.05%), Query Frame = 0

Query: 28  FSVFHSIHLCSSLNPALVLPLKTQVIPPESVRRSPDKLPFRHNISLTVSLTVGTPPQNVT 87
           FS F S    SS +  LVLPLKT++ P +   R  DKL F HN++LTV+LTVGTPPQN++
Sbjct: 34  FSSFSS----SSSSQTLVLPLKTRITPTD--HRPTDKLHFHHNVTLTVTLTVGTPPQNIS 93

Query: 88  MVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCSSSTCTDQTRDFPIRPSCDS 147
           MVIDTGSELSWL CN S N +  ++ F+P  SSSYSPIPCSS TC  +TRDF I  SCDS
Sbjct: 94  MVIDTGSELSWLRCNRSSNPNPVNN-FDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDS 153

Query: 148 NQFCHATLSYADASSSEGNLATDTFYIG-SSGIPNVVFGCMDSIFSSNSEEDSKNTGLMG 207
           ++ CHATLSYADASSSEGNLA + F+ G S+   N++FGCM S+  S+ EED+K TGL+G
Sbjct: 154 DKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLG 213

Query: 208 MNRGSLSFVSQMGFPKFSYCIS-EYDFSGLLLLGDANFSWLAPLNYTPLIEMSTPLPYFD 267
           MNRGSLSF+SQMGFPKFSYCIS   DF G LLLGD+NF+WL PLNYTPLI +STPLPYFD
Sbjct: 214 MNRGSLSFISQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFD 273

Query: 268 RVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLN 327
           RVAYTVQL GIKV  KLLPIP+SV  PDHTGAGQTMVDSGTQFTFLLGP YTALR HFLN
Sbjct: 274 RVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLN 333

Query: 328 KTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTR---LPPLPSVTLVFRGAEMTVTGDRILY 387
           +T G L VYED +FVFQG MDLCYR+   + R   L  LP+V+LVF GAE+ V+G  +LY
Sbjct: 334 RTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLY 393

Query: 388 RVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNVWMEFDLKKSRIGLAEIRCDLAG 447
           RVP    GNDS++CFTFGNSDL+G+EA+VIGH HQQN+W+EFDL++SRIGLA + CD++G
Sbjct: 394 RVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECDVSG 452

Query: 448 QKLGMG 449
           Q+LG+G
Sbjct: 454 QRLGIG 452

BLAST of CsGy3G030020 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 6.8e-41
Identity = 120/368 (32.61%), Postives = 189/368 (51.36%), Query Frame = 0

Query: 75  VSLTVGTPPQNVTMVIDTGSELSWLHCN-TSQNSSSSSSTFNPVWSSSYSPIPCSSSTCT 134
           ++L++GTP Q  + ++DTGS+L W  C   +Q  + S+  FNP  SSS+S +PCSS  C 
Sbjct: 97  MNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQ 156

Query: 135 DQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMDSIFSS 194
             +      P+C SN FC  T  Y D S ++G++ T+T   GS  IPN+ FGC +   ++
Sbjct: 157 ALS-----SPTC-SNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGE---NN 216

Query: 195 NSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFS--GLLLLGDANFSWLAPLNY 254
                    GL+GM RG LS  SQ+   KFSYC++    S    LLLG    S  A    
Sbjct: 217 QGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPN 276

Query: 255 TPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFE-PDHTGAGQTMVDSGTQFTF 314
           T LI+ S+ +P F    Y + L G+ V    LPI  S F    + G G  ++DSGT  T+
Sbjct: 277 TTLIQ-SSQIPTF----YYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTY 336

Query: 315 LLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPSVTLVFRG 374
            +  AY ++R  F+++   +L V   S+  F    DLC++ P++ + L  +P+  + F G
Sbjct: 337 FVNNAYQSVRQEFISQI--NLPVVNGSSSGF----DLCFQTPSDPSNL-QIPTFVMHFDG 396

Query: 375 AEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNVWMEFDLKKSR 434
            ++ +  +            ++ + C   G+S   G+  F  G++ QQN+ + +D   S 
Sbjct: 397 GDLELPSENYFI------SPSNGLICLAMGSSS-QGMSIF--GNIQQQNMLVVYDTGNSV 434

Query: 435 IGLAEIRC 439
           +  A  +C
Sbjct: 457 VSFASAQC 434

BLAST of CsGy3G030020 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 161.0 bits (406), Expect = 3.1e-38
Identity = 111/367 (30.25%), Postives = 183/367 (49.86%), Query Frame = 0

Query: 75  VSLTVGTPPQNVTMVIDTGSELSWLHCN-TSQNSSSSSSTFNPVWSSSYSPIPCSSSTCT 134
           +++ +GTP  + + ++DTGS+L W  C   +Q  S  +  FNP  SSS+S +PC S  C 
Sbjct: 98  MNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQ 157

Query: 135 DQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMDSIFSS 194
           D   +     +C++N+ C  T  Y D S+++G +AT+TF   +S +PN+ FGC +    +
Sbjct: 158 DLPSE-----TCNNNE-CQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGE---DN 217

Query: 195 NSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFS--GLLLLGDANFSWLAPLNY 254
                    GL+GM  G LS  SQ+G  +FSYC++ Y  S    L LG A          
Sbjct: 218 QGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPS 277

Query: 255 TPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDSGTQFTFL 314
           T LI  S    Y     Y + L+GI V    L IP S F+    G G  ++DSGT  T+L
Sbjct: 278 TTLIHSSLNPTY-----YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYL 337

Query: 315 LGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPSVTLVFRGA 374
              AY A+   F ++   +L   ++S+      +  C++ P++ + +  +P +++ F G 
Sbjct: 338 PQDAYNAVAQAFTDQI--NLPTVDESS----SGLSTCFQQPSDGSTV-QVPEISMQFDGG 397

Query: 375 EMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNVWMEFDLKKSRI 434
            + + G++ +   P E      + C   G+S  LG+  F  G++ QQ   + +DL+   +
Sbjct: 398 VLNL-GEQNILISPAE-----GVICLAMGSSSQLGISIF--GNIQQQETQVLYDLQNLAV 435

Query: 435 GLAEIRC 439
                +C
Sbjct: 458 SFVPTQC 435

BLAST of CsGy3G030020 vs. ExPASy Swiss-Prot
Match: O04496 (Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1)

HSP 1 Score: 138.7 bits (348), Expect = 1.7e-31
Identity = 117/378 (30.95%), Postives = 167/378 (44.18%), Query Frame = 0

Query: 69  HNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCS 128
           H  +  V   +GTPPQ + MV+DT ++  WL C+     S++S++FN   SS+YS + CS
Sbjct: 100 HIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCS 159

Query: 129 SSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMD 188
           ++ CT Q R      S      C    SY   SS   +L  DT  +    IPN  FGC++
Sbjct: 160 TAQCT-QARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCIN 219

Query: 189 SIFSSNSEEDSKNTGLMGMNRGSLSFVSQ---MGFPKFSYCISEYD---FSGLLLLGDAN 248
           S  S NS       GLMG+ RG +S VSQ   +    FSYC+  +    FSG L LG   
Sbjct: 220 SA-SGNSLPPQ---GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG--L 279

Query: 249 FSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMV 308
                 + YTPL+  +   P      Y V L G+ V    +P+       D      T++
Sbjct: 280 LGQPKSIRYTPLLR-NPRRPSL----YYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTII 339

Query: 309 DSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLP 368
           DSGT  T    P Y A+RD F  +          S+F   GA D C+            P
Sbjct: 340 DSGTVITRFAQPVYEAIRDEFRKQV-------NVSSFSTLGAFDTCFSADNENV----AP 399

Query: 369 SVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTF-GNSDLLGVEAFVIGHLHQQNVW 428
            +TL     ++ +  +  L           ++ C +  G          VI +L QQN+ 
Sbjct: 400 KITLHMTSLDLKLPMENTLI-----HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLR 449

Query: 429 MEFDLKKSRIGLAEIRCD 440
           + FD+  SRIG+A   C+
Sbjct: 460 ILFDVPNSRIGIAPEPCN 449

BLAST of CsGy3G030020 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 136.7 bits (343), Expect = 6.3e-31
Identity = 119/370 (32.16%), Postives = 178/370 (48.11%), Query Frame = 0

Query: 77  LTVGTPPQNVTMVIDTGSELSWLHCNTSQNS-SSSSSTFNPVWSSSYSPIPCSSSTCTDQ 136
           L VGTP + V MV+DTGS++ WL C   +   S S   F+P  S +Y+ IPCSS  C   
Sbjct: 146 LGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHC--- 205

Query: 137 TRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMDSIFSSNS 196
            R           + C   +SY D S + G+ +T+T     + +  V  GC       N 
Sbjct: 206 -RRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGC----GHDNE 265

Query: 197 EEDSKNTGLMGMNRGSLSFVSQMGF---PKFSYCISEYDFS---GLLLLGDANFSWLAPL 256
                  GL+G+ +G LSF  Q G     KFSYC+ +   S     ++ G+A  S +A  
Sbjct: 266 GLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIA-- 325

Query: 257 NYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLP-IPESVFEPDHTGAGQTMVDSGTQF 316
            +TPL+      P  D   Y V L GI V    +P +  S+F+ D  G G  ++DSGT  
Sbjct: 326 RFTPLLSN----PKLD-TFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSV 385

Query: 317 TFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPSVTLVF 376
           T L+ PAY A+RD F     G+  +    +F      D C+ + +N   +  +P+V L F
Sbjct: 386 TRLIRPAYIAMRDAF---RVGAKTLKRAPDF---SLFDTCFDL-SNMNEV-KVPTVVLHF 445

Query: 377 RGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNVWMEFDLKK 436
           RGA++++      Y +P +  G     CF F  + + G+   +IG++ QQ   + +DL  
Sbjct: 446 RGADVSLPATN--YLIPVDTNGK---FCFAFAGT-MGGLS--IIGNIQQQGFRVVYDLAS 484

Query: 437 SRIGLAEIRC 439
           SR+G A   C
Sbjct: 506 SRVGFAPGGC 484

BLAST of CsGy3G030020 vs. NCBI nr
Match: XP_004137780.1 (aspartic proteinase PCS1 [Cucumis sativus] >KGN58858.1 hypothetical protein Csa_000967 [Cucumis sativus])

HSP 1 Score: 902 bits (2330), Expect = 0.0
Identity = 449/449 (100.00%), Postives = 449/449 (100.00%), Query Frame = 0

Query: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60
           MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR
Sbjct: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60

Query: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSS 120
           SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSS
Sbjct: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSS 120

Query: 121 SYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIP 180
           SYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIP
Sbjct: 121 SYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIP 180

Query: 181 NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD 240
           NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD
Sbjct: 181 NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD 240

Query: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQT 300
           ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQT
Sbjct: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQT 300

Query: 301 MVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPP 360
           MVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPP
Sbjct: 301 MVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPP 360

Query: 361 LPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV 420
           LPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV
Sbjct: 361 LPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV 420

Query: 421 WMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449
           WMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 421 WMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449

BLAST of CsGy3G030020 vs. NCBI nr
Match: XP_008442528.1 (PREDICTED: aspartic proteinase PCS1 [Cucumis melo] >KAA0044101.1 aspartic proteinase PCS1 [Cucumis melo var. makuwa] >TYK25036.1 aspartic proteinase PCS1 [Cucumis melo var. makuwa])

HSP 1 Score: 861 bits (2225), Expect = 1.19e-314
Identity = 431/451 (95.57%), Postives = 442/451 (98.00%), Query Frame = 0

Query: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60
           MR YC AF+ +++KFLKSCFLFFFC LFSVF++I LCSS+NPALVLPLKTQVIPPESVRR
Sbjct: 1   MRGYCLAFDSANIKFLKSCFLFFFCILFSVFYNIDLCSSVNPALVLPLKTQVIPPESVRR 60

Query: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS--TFNPVW 120
           SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS  TFNPV 
Sbjct: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSSSTFNPVR 120

Query: 121 SSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG 180
           SSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIG+SG
Sbjct: 121 SSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGNSG 180

Query: 181 IPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLL 240
           IPNVVFGCMDSIFSSN+EEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLL
Sbjct: 181 IPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLL 240

Query: 241 GDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG 300
           GDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG
Sbjct: 241 GDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG 300

Query: 301 QTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRL 360
           QTMVDSGTQFTFLLGPAYTALRDHFLN+TAGSLR+YED NFVFQGAMDLCYRVPTNQTRL
Sbjct: 301 QTMVDSGTQFTFLLGPAYTALRDHFLNQTAGSLRLYEDPNFVFQGAMDLCYRVPTNQTRL 360

Query: 361 PPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQ 420
           PPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQ
Sbjct: 361 PPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQ 420

Query: 421 NVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449
           NVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 421 NVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 451

BLAST of CsGy3G030020 vs. NCBI nr
Match: XP_038905417.1 (aspartic proteinase PCS1 [Benincasa hispida])

HSP 1 Score: 853 bits (2203), Expect = 2.57e-311
Identity = 426/449 (94.88%), Postives = 435/449 (96.88%), Query Frame = 0

Query: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60
           MRDYCFAFN SS KFLK CF+FFFCTLF VF +  LCSSLNPALVLPLKTQVIPPESVRR
Sbjct: 1   MRDYCFAFNSSSNKFLKYCFIFFFCTLFCVFQNSKLCSSLNPALVLPLKTQVIPPESVRR 60

Query: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSS 120
           SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCN SQNSSSSSSTFNPV SS
Sbjct: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSSTFNPVRSS 120

Query: 121 SYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIP 180
           SY+PIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIG+SGIP
Sbjct: 121 SYTPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGNSGIP 180

Query: 181 NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD 240
           NVVFGCMDSIFSSN+EEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD
Sbjct: 181 NVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD 240

Query: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQT 300
           ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKV+HKLLPIPESVFEPDHTGAGQT
Sbjct: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVSHKLLPIPESVFEPDHTGAGQT 300

Query: 301 MVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPP 360
           MVDSGTQFTFLLGPAY ALRD F+N+TAGS+RV ED NFVFQGAMDLCYRVP NQTRLPP
Sbjct: 301 MVDSGTQFTFLLGPAYAALRDEFVNQTAGSIRVLEDPNFVFQGAMDLCYRVPINQTRLPP 360

Query: 361 LPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV 420
           LPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV
Sbjct: 361 LPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV 420

Query: 421 WMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449
           WMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 421 WMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449

BLAST of CsGy3G030020 vs. NCBI nr
Match: XP_022983486.1 (aspartic proteinase PCS1 [Cucurbita maxima])

HSP 1 Score: 829 bits (2141), Expect = 7.22e-302
Identity = 415/449 (92.43%), Postives = 428/449 (95.32%), Query Frame = 0

Query: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60
           MRDYC AFN S+ KFLKS F FF C LFSVF ++ LCSSLNPAL+LPLKTQVIPPES+RR
Sbjct: 1   MRDYCIAFNSSNHKFLKSFFPFFLCILFSVFQNLILCSSLNPALLLPLKTQVIPPESIRR 60

Query: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSS 120
           SPDKLPFRHN+SLTVSLTVGTPPQNVTMVIDTGSELSWLHCN SQNSSSSSSTFNP  SS
Sbjct: 61  SPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSSTFNPAGSS 120

Query: 121 SYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIP 180
           SY+PIPCSSSTCTDQTRDFPI  SCDSN  CHATLSYADASSSEG LATDTFYIG+SGI 
Sbjct: 121 SYTPIPCSSSTCTDQTRDFPIPASCDSNHLCHATLSYADASSSEGTLATDTFYIGNSGIS 180

Query: 181 NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD 240
           NVVFGCMDSIFSSN+EEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD
Sbjct: 181 NVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD 240

Query: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQT 300
           ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKV+HKLLPIPESVFEPDHTGAGQT
Sbjct: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVSHKLLPIPESVFEPDHTGAGQT 300

Query: 301 MVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPP 360
           MVDSGTQFTFLLGPAYTALRD FLN+TAGS+RV+EDSNFVFQGAMDLCYRVP NQTRLPP
Sbjct: 301 MVDSGTQFTFLLGPAYTALRDEFLNRTAGSIRVFEDSNFVFQGAMDLCYRVPMNQTRLPP 360

Query: 361 LPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV 420
           LPSVTLVFRGAEMTVTGDRILYRVPGE RGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV
Sbjct: 361 LPSVTLVFRGAEMTVTGDRILYRVPGEIRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV 420

Query: 421 WMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449
           WMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 421 WMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449

BLAST of CsGy3G030020 vs. NCBI nr
Match: XP_023528618.1 (aspartic proteinase PCS1 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 823 bits (2125), Expect = 2.05e-299
Identity = 414/450 (92.00%), Postives = 427/450 (94.89%), Query Frame = 0

Query: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60
           MRDYC AFN S+ KFLKS F FF CTLFSVF ++ LCSS NPAL+LPLKTQVIPPES+RR
Sbjct: 1   MRDYCIAFNSSNHKFLKSLFPFFLCTLFSVFQNLILCSSQNPALLLPLKTQVIPPESIRR 60

Query: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS-TFNPVWS 120
           SPDKLPFRHN+SLTVSLTVGTPPQNVTMVIDTGSELSWLHCN SQNSSSSSS TFNP  S
Sbjct: 61  SPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSSSTFNPAGS 120

Query: 121 SSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGI 180
           SSY+PIPCSSSTCTDQTRDFPI  SCDSN  CHATLSYADASSSEG LATDTFYIG+SGI
Sbjct: 121 SSYTPIPCSSSTCTDQTRDFPIPASCDSNHLCHATLSYADASSSEGTLATDTFYIGNSGI 180

Query: 181 PNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLG 240
            NVVFGCMDSIFSSN+EEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLG
Sbjct: 181 SNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLG 240

Query: 241 DANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQ 300
           DANFSWLAPLNYTPLIEM+TPLPYFDRVAYTVQLEGIKV+HKLLPIPESVFEPDHTGAGQ
Sbjct: 241 DANFSWLAPLNYTPLIEMTTPLPYFDRVAYTVQLEGIKVSHKLLPIPESVFEPDHTGAGQ 300

Query: 301 TMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLP 360
           TMVDSGTQFTFLLGPAYTALRD FLN+TAGS RV+EDSNFVFQGAMDLCYRVP NQTRLP
Sbjct: 301 TMVDSGTQFTFLLGPAYTALRDEFLNRTAGSFRVFEDSNFVFQGAMDLCYRVPMNQTRLP 360

Query: 361 PLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQN 420
           PLPSVTLVFRGAEMTVTGDRILYRVPGE RGNDSIHCFTFGNSDLLGVEAFVIGHLHQQN
Sbjct: 361 PLPSVTLVFRGAEMTVTGDRILYRVPGEIRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQN 420

Query: 421 VWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449
           VWMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 421 VWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 450

BLAST of CsGy3G030020 vs. ExPASy TrEMBL
Match: A0A0A0LAI7 (Aspartic proteinase nepenthesin-2 OS=Cucumis sativus OX=3659 GN=Csa_3G734110 PE=3 SV=1)

HSP 1 Score: 902 bits (2330), Expect = 0.0
Identity = 449/449 (100.00%), Postives = 449/449 (100.00%), Query Frame = 0

Query: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60
           MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR
Sbjct: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60

Query: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSS 120
           SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSS
Sbjct: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSS 120

Query: 121 SYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIP 180
           SYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIP
Sbjct: 121 SYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIP 180

Query: 181 NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD 240
           NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD
Sbjct: 181 NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD 240

Query: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQT 300
           ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQT
Sbjct: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQT 300

Query: 301 MVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPP 360
           MVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPP
Sbjct: 301 MVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPP 360

Query: 361 LPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV 420
           LPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV
Sbjct: 361 LPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV 420

Query: 421 WMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449
           WMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 421 WMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449

BLAST of CsGy3G030020 vs. ExPASy TrEMBL
Match: A0A5D3DN23 (Aspartic proteinase PCS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G001570 PE=3 SV=1)

HSP 1 Score: 861 bits (2225), Expect = 5.77e-315
Identity = 431/451 (95.57%), Postives = 442/451 (98.00%), Query Frame = 0

Query: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60
           MR YC AF+ +++KFLKSCFLFFFC LFSVF++I LCSS+NPALVLPLKTQVIPPESVRR
Sbjct: 1   MRGYCLAFDSANIKFLKSCFLFFFCILFSVFYNIDLCSSVNPALVLPLKTQVIPPESVRR 60

Query: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS--TFNPVW 120
           SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS  TFNPV 
Sbjct: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSSSTFNPVR 120

Query: 121 SSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG 180
           SSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIG+SG
Sbjct: 121 SSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGNSG 180

Query: 181 IPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLL 240
           IPNVVFGCMDSIFSSN+EEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLL
Sbjct: 181 IPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLL 240

Query: 241 GDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG 300
           GDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG
Sbjct: 241 GDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG 300

Query: 301 QTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRL 360
           QTMVDSGTQFTFLLGPAYTALRDHFLN+TAGSLR+YED NFVFQGAMDLCYRVPTNQTRL
Sbjct: 301 QTMVDSGTQFTFLLGPAYTALRDHFLNQTAGSLRLYEDPNFVFQGAMDLCYRVPTNQTRL 360

Query: 361 PPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQ 420
           PPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQ
Sbjct: 361 PPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQ 420

Query: 421 NVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449
           NVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 421 NVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 451

BLAST of CsGy3G030020 vs. ExPASy TrEMBL
Match: A0A1S3B5W4 (aspartic proteinase PCS1 OS=Cucumis melo OX=3656 GN=LOC103486376 PE=3 SV=1)

HSP 1 Score: 861 bits (2225), Expect = 5.77e-315
Identity = 431/451 (95.57%), Postives = 442/451 (98.00%), Query Frame = 0

Query: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60
           MR YC AF+ +++KFLKSCFLFFFC LFSVF++I LCSS+NPALVLPLKTQVIPPESVRR
Sbjct: 1   MRGYCLAFDSANIKFLKSCFLFFFCILFSVFYNIDLCSSVNPALVLPLKTQVIPPESVRR 60

Query: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS--TFNPVW 120
           SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS  TFNPV 
Sbjct: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSSSTFNPVR 120

Query: 121 SSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG 180
           SSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIG+SG
Sbjct: 121 SSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGNSG 180

Query: 181 IPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLL 240
           IPNVVFGCMDSIFSSN+EEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLL
Sbjct: 181 IPNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLL 240

Query: 241 GDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG 300
           GDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG
Sbjct: 241 GDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAG 300

Query: 301 QTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRL 360
           QTMVDSGTQFTFLLGPAYTALRDHFLN+TAGSLR+YED NFVFQGAMDLCYRVPTNQTRL
Sbjct: 301 QTMVDSGTQFTFLLGPAYTALRDHFLNQTAGSLRLYEDPNFVFQGAMDLCYRVPTNQTRL 360

Query: 361 PPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQ 420
           PPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQ
Sbjct: 361 PPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQ 420

Query: 421 NVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449
           NVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 421 NVWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 451

BLAST of CsGy3G030020 vs. ExPASy TrEMBL
Match: A0A6J1J2F5 (aspartic proteinase PCS1 OS=Cucurbita maxima OX=3661 GN=LOC111482077 PE=3 SV=1)

HSP 1 Score: 829 bits (2141), Expect = 3.49e-302
Identity = 415/449 (92.43%), Postives = 428/449 (95.32%), Query Frame = 0

Query: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60
           MRDYC AFN S+ KFLKS F FF C LFSVF ++ LCSSLNPAL+LPLKTQVIPPES+RR
Sbjct: 1   MRDYCIAFNSSNHKFLKSFFPFFLCILFSVFQNLILCSSLNPALLLPLKTQVIPPESIRR 60

Query: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSS 120
           SPDKLPFRHN+SLTVSLTVGTPPQNVTMVIDTGSELSWLHCN SQNSSSSSSTFNP  SS
Sbjct: 61  SPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSSTFNPAGSS 120

Query: 121 SYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIP 180
           SY+PIPCSSSTCTDQTRDFPI  SCDSN  CHATLSYADASSSEG LATDTFYIG+SGI 
Sbjct: 121 SYTPIPCSSSTCTDQTRDFPIPASCDSNHLCHATLSYADASSSEGTLATDTFYIGNSGIS 180

Query: 181 NVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD 240
           NVVFGCMDSIFSSN+EEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD
Sbjct: 181 NVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGD 240

Query: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQT 300
           ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKV+HKLLPIPESVFEPDHTGAGQT
Sbjct: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVSHKLLPIPESVFEPDHTGAGQT 300

Query: 301 MVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPP 360
           MVDSGTQFTFLLGPAYTALRD FLN+TAGS+RV+EDSNFVFQGAMDLCYRVP NQTRLPP
Sbjct: 301 MVDSGTQFTFLLGPAYTALRDEFLNRTAGSIRVFEDSNFVFQGAMDLCYRVPMNQTRLPP 360

Query: 361 LPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV 420
           LPSVTLVFRGAEMTVTGDRILYRVPGE RGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV
Sbjct: 361 LPSVTLVFRGAEMTVTGDRILYRVPGEIRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNV 420

Query: 421 WMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449
           WMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 421 WMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449

BLAST of CsGy3G030020 vs. ExPASy TrEMBL
Match: A0A6J1F9C1 (aspartic proteinase PCS1 OS=Cucurbita moschata OX=3662 GN=LOC111442007 PE=3 SV=1)

HSP 1 Score: 822 bits (2124), Expect = 1.41e-299
Identity = 414/450 (92.00%), Postives = 427/450 (94.89%), Query Frame = 0

Query: 1   MRDYCFAFNFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRR 60
           MRDYC AFN S+ KFLKS F FF CTLFSVF ++ LCSS NPAL+LPLKTQVIPPES+RR
Sbjct: 1   MRDYCIAFNSSNHKFLKSLFPFFLCTLFSVFQNLILCSSQNPALLLPLKTQVIPPESIRR 60

Query: 61  SPDKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS-TFNPVWS 120
           SPDKLPFRHN+SLTVSLTVGTPPQNVTMVIDTGSELSWLHCN SQNSSSSSS TFNP  S
Sbjct: 61  SPDKLPFRHNVSLTVSLTVGTPPQNVTMVIDTGSELSWLHCNRSQNSSSSSSSTFNPAGS 120

Query: 121 SSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGI 180
           SSY+PIPCSSSTCTD+TRDFPI  SCDSN  CHATLSYADASSSEG LATDTFYIG+SGI
Sbjct: 121 SSYTPIPCSSSTCTDRTRDFPIPASCDSNHLCHATLSYADASSSEGTLATDTFYIGNSGI 180

Query: 181 PNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLG 240
            NVVFGCMDSIFSSN+EEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLG
Sbjct: 181 SNVVFGCMDSIFSSNNEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLG 240

Query: 241 DANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQ 300
           DANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKV+HKLLPIPESVFEPDHTGAGQ
Sbjct: 241 DANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVSHKLLPIPESVFEPDHTGAGQ 300

Query: 301 TMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLP 360
           TMVDSGTQFTFLLGPAYTALRD FLN+TAGS RV+EDSNFVFQGAMDLCYRVP NQTRLP
Sbjct: 301 TMVDSGTQFTFLLGPAYTALRDEFLNRTAGSFRVFEDSNFVFQGAMDLCYRVPMNQTRLP 360

Query: 361 PLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQN 420
           PLPSVTLVFRGAEMTVTGDRILYRVPGE RGNDSIHCFTFGNSDLLGVEAFVIGHLHQQN
Sbjct: 361 PLPSVTLVFRGAEMTVTGDRILYRVPGEIRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQN 420

Query: 421 VWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 449
           VWMEFDLKKSRIGLAEIRCDLAGQKLGMGL
Sbjct: 421 VWMEFDLKKSRIGLAEIRCDLAGQKLGMGL 450

BLAST of CsGy3G030020 vs. TAIR 10
Match: AT5G02190.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 567.8 bits (1462), Expect = 7.9e-162
Identity = 288/426 (67.61%), Postives = 341/426 (80.05%), Query Frame = 0

Query: 28  FSVFHSIHLCSSLNPALVLPLKTQVIPPESVRRSPDKLPFRHNISLTVSLTVGTPPQNVT 87
           FS F S    SS +  LVLPLKT++ P +   R  DKL F HN++LTV+LTVGTPPQN++
Sbjct: 34  FSSFSS----SSSSQTLVLPLKTRITPTD--HRPTDKLHFHHNVTLTVTLTVGTPPQNIS 93

Query: 88  MVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCSSSTCTDQTRDFPIRPSCDS 147
           MVIDTGSELSWL CN S N +  ++ F+P  SSSYSPIPCSS TC  +TRDF I  SCDS
Sbjct: 94  MVIDTGSELSWLRCNRSSNPNPVNN-FDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDS 153

Query: 148 NQFCHATLSYADASSSEGNLATDTFYIG-SSGIPNVVFGCMDSIFSSNSEEDSKNTGLMG 207
           ++ CHATLSYADASSSEGNLA + F+ G S+   N++FGCM S+  S+ EED+K TGL+G
Sbjct: 154 DKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLG 213

Query: 208 MNRGSLSFVSQMGFPKFSYCIS-EYDFSGLLLLGDANFSWLAPLNYTPLIEMSTPLPYFD 267
           MNRGSLSF+SQMGFPKFSYCIS   DF G LLLGD+NF+WL PLNYTPLI +STPLPYFD
Sbjct: 214 MNRGSLSFISQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFD 273

Query: 268 RVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLN 327
           RVAYTVQL GIKV  KLLPIP+SV  PDHTGAGQTMVDSGTQFTFLLGP YTALR HFLN
Sbjct: 274 RVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLN 333

Query: 328 KTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTR---LPPLPSVTLVFRGAEMTVTGDRILY 387
           +T G L VYED +FVFQG MDLCYR+   + R   L  LP+V+LVF GAE+ V+G  +LY
Sbjct: 334 RTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLY 393

Query: 388 RVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNVWMEFDLKKSRIGLAEIRCDLAG 447
           RVP    GNDS++CFTFGNSDL+G+EA+VIGH HQQN+W+EFDL++SRIGLA + CD++G
Sbjct: 394 RVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECDVSG 452

Query: 448 QKLGMG 449
           Q+LG+G
Sbjct: 454 QRLGIG 452

BLAST of CsGy3G030020 vs. TAIR 10
Match: AT2G39710.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 533.1 bits (1372), Expect = 2.2e-151
Identity = 274/443 (61.85%), Postives = 327/443 (73.81%), Query Frame = 0

Query: 9   NFSSVKFLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESVRRSPDKLPFR 68
           NF  +  L   F   FC            SS N  L+  LKTQ +P    + S DKL FR
Sbjct: 15  NFLRISVLLLIFPLTFCK----------TSSTNQTLLFSLKTQKLP----QSSSDKLSFR 74

Query: 69  HNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCS 128
           HN++LTV+L VG PPQN++MV+DTGSELSWLHC  S N     S FNPV SS+YSP+PCS
Sbjct: 75  HNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSPN---LGSVFNPVSSSTYSPVPCS 134

Query: 129 SSTCTDQTRDFPIRPSCD-SNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCM 188
           S  C  +TRD PI  SCD     CH  +SYADA+S EGNLA +TF IGS   P  +FGCM
Sbjct: 135 SPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCM 194

Query: 189 DSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYDFSGLLLLGDANFSWLA 248
           DS  SSNSEED+K+TGLMGMNRGSLSFV+Q+GF KFSYCIS  D SG LLLGDA++SWL 
Sbjct: 195 DSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDSSGFLLLGDASYSWLG 254

Query: 249 PLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDSGTQ 308
           P+ YTPL+  STPLPYFDRVAYTVQLEGI+V  K+L +P+SVF PDHTGAGQTMVDSGTQ
Sbjct: 255 PIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQ 314

Query: 309 FTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRV-PTNQTRLPPLPSVTL 368
           FTFL+GP YTAL++ F+ +T   LR+ +D +FVFQG MDLCY+V  T +     LP V+L
Sbjct: 315 FTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSL 374

Query: 369 VFRGAEMTVTGDRILYRVPGE-RRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNVWMEFD 428
           +FRGAEM+V+G ++LYRV G    G + ++CFTFGNSDLLG+EAFVIGH HQQNVWMEFD
Sbjct: 375 MFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFD 434

Query: 429 LKKSRIGLA-EIRCDLAGQKLGM 448
           L KSR+G A  +RCDLA Q+LG+
Sbjct: 435 LAKSRVGFAGNVRCDLASQRLGL 440

BLAST of CsGy3G030020 vs. TAIR 10
Match: AT1G66180.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 244.2 bits (622), Expect = 2.0e-64
Identity = 160/448 (35.71%), Postives = 235/448 (52.46%), Query Frame = 0

Query: 17  KSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPESV-----RRSPD-------- 76
           K  F FFF    S+  S+ L     P   LP+ T             R++P         
Sbjct: 6   KPLFFFFFLNYVSLSTSLSLHL---PLTSLPISTTTNSHRFTTSLLSRKNPSPSSPPYNF 65

Query: 77  KLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYS 136
           +  F+++++L +SL +GTPPQ   MV+DTGS+LSW+ C+  +      ++F+P  SSS+S
Sbjct: 66  RSRFKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDPSLSSSFS 125

Query: 137 PIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGI-PNV 196
            +PCS   C  +  DF +  SCDSN+ CH +  YAD + +EGNL  +     ++ I P +
Sbjct: 126 TLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPL 185

Query: 197 VFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISE------YDFSGLL 256
           + GC        + E S + G++GMNRG LSFVSQ    KFSYCI        +  +G  
Sbjct: 186 ILGC--------ATESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSF 245

Query: 257 LLGD----ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEP 316
            LGD      F +++ L +      S  +P  D +AYTV + GI+   K L I  SVF P
Sbjct: 246 YLGDNPNSHGFKYVSLLTFPE----SQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRP 305

Query: 317 DHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVP 376
           D  G+GQTMVDSG++FT L+  AY  +R   + +    L+      +V+ G  D+C+   
Sbjct: 306 DAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLK----KGYVYGGTADMCF--D 365

Query: 377 TNQTRLPPL-PSVTLVF-RGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAF 436
            N   +P L   +  VF RG E+ V  +R+L  V G       IHC   G S +LG  + 
Sbjct: 366 GNVAMIPRLIGDLVFVFTRGVEILVPKERVLVNVGG------GIHCVGIGRSSMLGAASN 425

Query: 437 VIGHLHQQNVWMEFDLKKSRIGLAEIRC 439
           +IG++HQQN+W+EFD+   R+G A+  C
Sbjct: 426 IIGNVHQQNLWVEFDVTNRRVGFAKADC 426

BLAST of CsGy3G030020 vs. TAIR 10
Match: AT5G37540.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 234.2 bits (596), Expect = 2.1e-61
Identity = 156/459 (33.99%), Postives = 237/459 (51.63%), Query Frame = 0

Query: 15  FLKSCFLFFFCTLFSVFHSIHLCSSLNPALVLPLKTQVIPPES----------VRRSP-- 74
           FLK  ++FFF       +S+ L  S + +L  PL +  + P +           RR+P  
Sbjct: 9   FLKLLYIFFF-----FCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSP 68

Query: 75  --DKLPFRHNI----SLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSS---SSSST 134
                 FR NI    +L +SL +GTP Q+  +V+DTGS+LSW+ C+  +        +++
Sbjct: 69  PSSPYTFRSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTS 128

Query: 135 FNPVWSSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTF- 194
           F+P  SSS+S +PCS   C  +  DF +  SCDSN+ CH +  YAD + +EGNL  + F 
Sbjct: 129 FDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFT 188

Query: 195 YIGSSGIPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD- 254
           +  S   P ++ GC        ++E +   G++GMN G LSF+SQ    KFSYCI     
Sbjct: 189 FSNSQTTPPLILGC--------AKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSN 248

Query: 255 -----FSGLLLLGD----ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKL 314
                 +G   LGD      F +++ L +      S  +P  D +AYTV L+GI++  K 
Sbjct: 249 RPGLASTGSFYLGDNPNSRGFKYVSLLTF----PQSQRMPNLDPLAYTVPLQGIRIGQKR 308

Query: 315 LPIPESVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQ 374
           L IP SVF PD  G+GQTMVDSG++FT L+  AY  +++  +      L+      +V+ 
Sbjct: 309 LNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLK----KGYVYG 368

Query: 375 GAMDLCYRVPTNQTRLPPLPSVTLVF-RGAEMTVTGDRILYRVPGERRGNDSIHCFTFGN 434
              D+C+    +      +  +   F RG E+ V    +L  V G       IHC   G 
Sbjct: 369 STADMCFDGNHSMEIGRLIGDLVFEFGRGVEILVEKQSLLVNVGG------GIHCVGIGR 428

Query: 435 SDLLGVEAFVIGHLHQQNVWMEFDLKKSRIGLAEIRCDL 441
           S +LG  + +IG++HQQN+W+EFD+   R+G ++  C L
Sbjct: 429 SSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKAECRL 440

BLAST of CsGy3G030020 vs. TAIR 10
Match: AT1G09750.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 138.7 bits (348), Expect = 1.2e-32
Identity = 117/378 (30.95%), Postives = 167/378 (44.18%), Query Frame = 0

Query: 69  HNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCS 128
           H  +  V   +GTPPQ + MV+DT ++  WL C+     S++S++FN   SS+YS + CS
Sbjct: 100 HIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCS 159

Query: 129 SSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMD 188
           ++ CT Q R      S      C    SY   SS   +L  DT  +    IPN  FGC++
Sbjct: 160 TAQCT-QARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCIN 219

Query: 189 SIFSSNSEEDSKNTGLMGMNRGSLSFVSQ---MGFPKFSYCISEYD---FSGLLLLGDAN 248
           S  S NS       GLMG+ RG +S VSQ   +    FSYC+  +    FSG L LG   
Sbjct: 220 SA-SGNSLPPQ---GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG--L 279

Query: 249 FSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMV 308
                 + YTPL+  +   P      Y V L G+ V    +P+       D      T++
Sbjct: 280 LGQPKSIRYTPLLR-NPRRPSL----YYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTII 339

Query: 309 DSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLP 368
           DSGT  T    P Y A+RD F  +          S+F   GA D C+            P
Sbjct: 340 DSGTVITRFAQPVYEAIRDEFRKQV-------NVSSFSTLGAFDTCFSADNENV----AP 399

Query: 369 SVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTF-GNSDLLGVEAFVIGHLHQQNVW 428
            +TL     ++ +  +  L           ++ C +  G          VI +L QQN+ 
Sbjct: 400 KITLHMTSLDLKLPMENTLI-----HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLR 449

Query: 429 MEFDLKKSRIGLAEIRCD 440
           + FD+  SRIG+A   C+
Sbjct: 460 ILFDVPNSRIGIAPEPCN 449

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LZL31.1e-16067.61Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1[more]
Q766C36.8e-4132.61Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C23.1e-3830.25Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
O044961.7e-3130.95Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1[more]
Q9LNJ36.3e-3132.16Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
XP_004137780.10.0100.00aspartic proteinase PCS1 [Cucumis sativus] >KGN58858.1 hypothetical protein Csa_... [more]
XP_008442528.11.19e-31495.57PREDICTED: aspartic proteinase PCS1 [Cucumis melo] >KAA0044101.1 aspartic protei... [more]
XP_038905417.12.57e-31194.88aspartic proteinase PCS1 [Benincasa hispida][more]
XP_022983486.17.22e-30292.43aspartic proteinase PCS1 [Cucurbita maxima][more]
XP_023528618.12.05e-29992.00aspartic proteinase PCS1 isoform X2 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A0A0LAI70.0100.00Aspartic proteinase nepenthesin-2 OS=Cucumis sativus OX=3659 GN=Csa_3G734110 PE=... [more]
A0A5D3DN235.77e-31595.57Aspartic proteinase PCS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A1S3B5W45.77e-31595.57aspartic proteinase PCS1 OS=Cucumis melo OX=3656 GN=LOC103486376 PE=3 SV=1[more]
A0A6J1J2F53.49e-30292.43aspartic proteinase PCS1 OS=Cucurbita maxima OX=3661 GN=LOC111482077 PE=3 SV=1[more]
A0A6J1F9C11.41e-29992.00aspartic proteinase PCS1 OS=Cucurbita moschata OX=3662 GN=LOC111442007 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G02190.17.9e-16267.61Eukaryotic aspartyl protease family protein [more]
AT2G39710.12.2e-15161.85Eukaryotic aspartyl protease family protein [more]
AT1G66180.12.0e-6435.71Eukaryotic aspartyl protease family protein [more]
AT5G37540.12.1e-6133.99Eukaryotic aspartyl protease family protein [more]
AT1G09750.11.2e-3230.95Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 410..425
score: 18.31
coord: 300..311
score: 32.91
coord: 79..99
score: 46.19
IPR001461Aspartic peptidase A1 familyPANTHERPTHR47965ASPARTYL PROTEASE-RELATEDcoord: 20..447
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 244..441
e-value: 3.2E-43
score: 149.5
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 67..240
e-value: 3.1E-37
score: 130.4
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 74..438
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 75..240
e-value: 7.6E-39
score: 133.7
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 268..433
e-value: 4.6E-36
score: 124.0
NoneNo IPR availablePANTHERPTHR47965:SF64ASPARTIC PROTEINASE PCS1coord: 20..447
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 88..99
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 73..434
score: 35.691284
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 74..438
e-value: 1.54976E-72
score: 227.531

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy3G030020.2CsGy3G030020.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0046274 lignin catabolic process
biological_process GO:0012501 programmed cell death
biological_process GO:0006508 proteolysis
cellular_component GO:0048046 apoplast
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0005507 copper ion binding
molecular_function GO:0052716 hydroquinone:oxygen oxidoreductase activity