Cp4.1LG14g09420 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG14g09420
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionEukaryotic aspartyl protease family protein
LocationCp4.1LG14: 7966429 .. 7967877 (+)
RNA-Seq ExpressionCp4.1LG14g09420
SyntenyCp4.1LG14g09420
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCCCTGTTTTTCTCTTCCTACTCTGTTTTCTCCTTTCCTCCCCTGTTTTCTCCTCACAGCTTCTTCTTTTACCTCTGTCTAATTCCTTATCATCCTCTTCTGATTTCAACAACACCCACAACCTCCTTAAATCCACCGCCGCCCGCTCTTCCGCCCGCTTCCACCACCGCCGTCGTACCCACCACCGCAGCCACCTCTCTCTGCCCCTTTCCCCCGGTGGCGATTATACTCTCTCCTTCAACCTTGGTTCCGAGTCTCAAAAGATTTCCCTCTATATGGACACTGGAAGCGACCTTGTCTGGTTCCCCTGTTCCCCATTTGAGTGTATTCTCTGTGAAGGCAAACCCAAAATTCAATCCCCTTTGCCCAAAATCGCAGATAAAAAATCAGTTTCCTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGGGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCCCGATGTCCACTTGAATCCATTGAAGTTTCTGAGTGTTCTTCTTTTTCTTGTCCGCCGTTTTATTATGCTTATGGCGATGGGAGTTTAATTGGTCGGCTTTATAGAGACAGTCTCAGTTTGCCCGCGCCGGCGCCGGCACCGTCACCGGCGATTAATGTTCGGAATTTTACTTTTGGGTGTGCCCACTCAGCGTTAGGCGAGCCAATCGGTGTCGCCGGATTCGGCCGAGGGTTGTTGTCGATGCCGAGTCAACTCGCTACTTTCTCGCCTCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCGGACCGAGTTCGCCGCCCGAGTCCGCTGATTCTCGGCCGGTACTACGGCAGCGAGACGGAGTTTATTTACACTTCCTTGCTTGAGAATCCAAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGGATTTCAGTTGGGTCGGTGAGGATTCCGGCGCCGGAGTTTTTGAAAAGGGTGGATGAGGGTGGCAGCGGCGGCGTTGTGGTGGATTCCGGTACTACTTTTACTATGTTGCCAGCAGGTTTGTATAACTCGGTGGTAGCCCAGTTTGAGAACCGGACCGGGCGAGTTGCAAGCCGGGCGAGTCGGATTGAAGAAAACACCGGATTGAGCCCTTGCTATTACTACGAGAACTCAGTGGAAGTGCCACGTGTCGTGTTACATTTCGTTGGGGAGAAATCCAGTGTGGTGCTTCCTAGAAAGAATTATTTCTACGAGTTTTTGGACGGTGGAGATGGGGTGGAGAGAAAGAGAAAAGTCGGGTGTTTGATGCTGATGAACGGTGGGGATGAGGCTGAACTGGCAGGTGGGCCCGGTGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTGGCATATGATTTGGAAAACAATCGGGTCGGGTTCGCCCGGCGACAGTGTTCAACCCTTTGGGACAGCTTGAACCGCAGTTAA

mRNA sequence

ATGGCTTCCCCTGTTTTTCTCTTCCTACTCTGTTTTCTCCTTTCCTCCCCTGTTTTCTCCTCACAGCTTCTTCTTTTACCTCTGTCTAATTCCTTATCATCCTCTTCTGATTTCAACAACACCCACAACCTCCTTAAATCCACCGCCGCCCGCTCTTCCGCCCGCTTCCACCACCGCCGTCGTACCCACCACCGCAGCCACCTCTCTCTGCCCCTTTCCCCCGGTGGCGATTATACTCTCTCCTTCAACCTTGGTTCCGAGTCTCAAAAGATTTCCCTCTATATGGACACTGGAAGCGACCTTGTCTGGTTCCCCTGTTCCCCATTTGAGTGTATTCTCTGTGAAGGCAAACCCAAAATTCAATCCCCTTTGCCCAAAATCGCAGATAAAAAATCAGTTTCCTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGGGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCCCGATGTCCACTTGAATCCATTGAAGTTTCTGAGTGTTCTTCTTTTTCTTGTCCGCCGTTTTATTATGCTTATGGCGATGGGAGTTTAATTGGTCGGCTTTATAGAGACAGTCTCAGTTTGCCCGCGCCGGCGCCGGCACCGTCACCGGCGATTAATGTTCGGAATTTTACTTTTGGGTGTGCCCACTCAGCGTTAGGCGAGCCAATCGGTGTCGCCGGATTCGGCCGAGGGTTGTTGTCGATGCCGAGTCAACTCGCTACTTTCTCGCCTCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCGGACCGAGTTCGCCGCCCGAGTCCGCTGATTCTCGGCCGGTACTACGGCAGCGAGACGGAGTTTATTTACACTTCCTTGCTTGAGAATCCAAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGGATTTCAGTTGGGTCGGTGAGGATTCCGGCGCCGGAGTTTTTGAAAAGGGTGGATGAGGGTGGCAGCGGCGGCGTTGTGGTGGATTCCGGTACTACTTTTACTATGTTGCCAGCAGGTTTGTATAACTCGGTGGTAGCCCAGTTTGAGAACCGGACCGGGCGAGTTGCAAGCCGGGCGAGTCGGATTGAAGAAAACACCGGATTGAGCCCTTGCTATTACTACGAGAACTCAGTGGAAGTGCCACGTGTCGTGTTACATTTCGTTGGGGAGAAATCCAGTGTGGTGCTTCCTAGAAAGAATTATTTCTACGAGTTTTTGGACGGTGGAGATGGGGTGGAGAGAAAGAGAAAAGTCGGGTGTTTGATGCTGATGAACGGTGGGGATGAGGCTGAACTGGCAGGTGGGCCCGGTGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTGGCATATGATTTGGAAAACAATCGGGTCGGGTTCGCCCGGCGACAGTGTTCAACCCTTTGGGACAGCTTGAACCGCAGTTAA

Coding sequence (CDS)

ATGGCTTCCCCTGTTTTTCTCTTCCTACTCTGTTTTCTCCTTTCCTCCCCTGTTTTCTCCTCACAGCTTCTTCTTTTACCTCTGTCTAATTCCTTATCATCCTCTTCTGATTTCAACAACACCCACAACCTCCTTAAATCCACCGCCGCCCGCTCTTCCGCCCGCTTCCACCACCGCCGTCGTACCCACCACCGCAGCCACCTCTCTCTGCCCCTTTCCCCCGGTGGCGATTATACTCTCTCCTTCAACCTTGGTTCCGAGTCTCAAAAGATTTCCCTCTATATGGACACTGGAAGCGACCTTGTCTGGTTCCCCTGTTCCCCATTTGAGTGTATTCTCTGTGAAGGCAAACCCAAAATTCAATCCCCTTTGCCCAAAATCGCAGATAAAAAATCAGTTTCCTGCAGCGCCGCCGCATGCTCCGCCGCCCACGGGGGCTCCCTCTCCGCCTCCCACCTCTGTGCAATTTCCCGATGTCCACTTGAATCCATTGAAGTTTCTGAGTGTTCTTCTTTTTCTTGTCCGCCGTTTTATTATGCTTATGGCGATGGGAGTTTAATTGGTCGGCTTTATAGAGACAGTCTCAGTTTGCCCGCGCCGGCGCCGGCACCGTCACCGGCGATTAATGTTCGGAATTTTACTTTTGGGTGTGCCCACTCAGCGTTAGGCGAGCCAATCGGTGTCGCCGGATTCGGCCGAGGGTTGTTGTCGATGCCGAGTCAACTCGCTACTTTCTCGCCTCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCGGACCGAGTTCGCCGCCCGAGTCCGCTGATTCTCGGCCGGTACTACGGCAGCGAGACGGAGTTTATTTACACTTCCTTGCTTGAGAATCCAAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGGATTTCAGTTGGGTCGGTGAGGATTCCGGCGCCGGAGTTTTTGAAAAGGGTGGATGAGGGTGGCAGCGGCGGCGTTGTGGTGGATTCCGGTACTACTTTTACTATGTTGCCAGCAGGTTTGTATAACTCGGTGGTAGCCCAGTTTGAGAACCGGACCGGGCGAGTTGCAAGCCGGGCGAGTCGGATTGAAGAAAACACCGGATTGAGCCCTTGCTATTACTACGAGAACTCAGTGGAAGTGCCACGTGTCGTGTTACATTTCGTTGGGGAGAAATCCAGTGTGGTGCTTCCTAGAAAGAATTATTTCTACGAGTTTTTGGACGGTGGAGATGGGGTGGAGAGAAAGAGAAAAGTCGGGTGTTTGATGCTGATGAACGGTGGGGATGAGGCTGAACTGGCAGGTGGGCCCGGTGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTGGCATATGATTTGGAAAACAATCGGGTCGGGTTCGCCCGGCGACAGTGTTCAACCCTTTGGGACAGCTTGAACCGCAGTTAA

Protein sequence

MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVERKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLNRS
Homology
BLAST of Cp4.1LG14g09420 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 589.0 bits (1517), Expect = 5.0e-167
Identity = 294/479 (61.38%), Postives = 355/479 (74.11%), Query Frame = 0

Query: 24  LLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHHRSHLSLPLSPGGDYTLSFN 83
           LLL LS+SLS+S   ++  +LLKS+++RSSARF        +  LSLP+S G DY +S +
Sbjct: 29  LLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 88

Query: 84  LGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIADK-KSVSCSAAACSA 143
           +GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   ++    +VSCS+ +CSA
Sbjct: 89  VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 148

Query: 144 AHGGSLSASHLCAISRCPLESIEVSEC--SSFSCPPFYYAYGDGSLIGRLYRDSLSLPAP 203
           AH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+ +LY DSLSL   
Sbjct: 149 AH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSL--- 208

Query: 204 APAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPSQLATFSPQLGNRFSYCLVSH 263
                P+++V NFTFGCAH+ L EPIGVAGFGRG LS+P+QLA  SP LGN FSYCLVSH
Sbjct: 209 -----PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSH 268

Query: 264 SFAADRVRRPSPLILGRYYG--------------------SETEFIYTSLLENPKHPYFY 323
           SF +DRVRRPSPLILGR+                       + EF++T +LENPKHPYFY
Sbjct: 269 SFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFY 328

Query: 324 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 383
           SV L GIS+G   IPAP  L+R+D+ G GGVVVDSGTTFTMLPA  YNSVV +F++R GR
Sbjct: 329 SVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGR 388

Query: 384 VASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVER 443
           V  RA R+E ++G+SPCYY   +V+VP +VLHF G +SSV LPR+NYFYEF+DGGDG E 
Sbjct: 389 VHERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEE 448

Query: 444 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 480
           KRK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEV YDL N RVGFA+R+C++LWDSL
Sbjct: 449 KRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of Cp4.1LG14g09420 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 4.5e-35
Identity = 122/407 (29.98%), Postives = 183/407 (44.96%), Query Frame = 0

Query: 76  GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIADKKSVSC 135
           G+Y ++ ++G+ +Q  S  MDTGSDL+W  C P  C  C          P    + S S 
Sbjct: 93  GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQP--CTQC-----FNQSTPIFNPQGSSSF 152

Query: 136 SAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGS-LIGRLYRDS 195
           S   CS         S LC       +++    CS+  C  + Y YGDGS   G +  ++
Sbjct: 153 STLPCS---------SQLC-------QALSSPTCSNNFC-QYTYGYGDGSETQGSMGTET 212

Query: 196 LSLPAPAPAPSPAINVRNFTFGCAHS----ALGEPIGVAGFGRGLLSMPSQLATFSPQLG 255
           L+          ++++ N TFGC  +      G   G+ G GRG LS+PSQL        
Sbjct: 213 LTF--------GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV------ 272

Query: 256 NRFSYCLVSHSFAADRVRRPSPLILGRYYGSETE-FIYTSLLENPKHPYFYSVGLAGISV 315
            +FSYC+     +      PS L+LG    S T     T+L+++ + P FY + L G+SV
Sbjct: 273 TKFSYCMTPIGSST-----PSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSV 332

Query: 316 GSVRIPA-PEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASRI 375
           GS R+P  P         G+GG+++DSGTT T      Y SV  +F ++        S  
Sbjct: 333 GSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGS-- 392

Query: 376 EENTGLSPCYYY---ENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVERKRKVG 435
             ++G   C+      +++++P  V+HF G    + LP +NYF    +G         + 
Sbjct: 393 --SSGFDLCFQTPSDPSNLQIPTFVMHFDG--GDLELPSENYFISPSNG---------LI 434

Query: 436 CLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQC 473
           CL + +      +        GN QQQ   V YD  N+ V FA  QC
Sbjct: 453 CLAMGSSSQGMSI-------FGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Cp4.1LG14g09420 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 148.3 bits (373), Expect = 2.3e-34
Identity = 133/412 (32.28%), Postives = 188/412 (45.63%), Query Frame = 0

Query: 72  LSPG-GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIADK 131
           LS G G+Y     +G+ ++ + + +DTGSD+VW  C+P  C  C  +       P    +
Sbjct: 135 LSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCYSQSD-----PIFDPR 194

Query: 132 KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSL-IGR 191
           KS + +   CS+ H   L ++  C   R          C       +  +YGDGS  +G 
Sbjct: 195 KSKTYATIPCSSPHCRRLDSAG-CNTRR--------KTCL------YQVSYGDGSFTVGD 254

Query: 192 LYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVA---GFGRGLLSMPSQLATFS 251
              ++L+             V+    GC H   G  +G A   G G+G LS P Q     
Sbjct: 255 FSTETLTFRRN--------RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQT---G 314

Query: 252 PQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFYSVGLAG 311
            +   +FSYCLV  S ++    +PS ++ G    S     +T LL NPK   FY VGL G
Sbjct: 315 HRFNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIA-RFTPLLSNPKLDTFYYVGLLG 374

Query: 312 ISVGSVRIP-APEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRA 371
           ISVG  R+P     L ++D+ G+GGV++DSGT+ T L    Y ++   F  R G  A   
Sbjct: 375 ISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVG--AKTL 434

Query: 372 SRIEENTGLSPCYYYE--NSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVERKRK 431
            R  + +    C+     N V+VP VVLHF G  + V LP  NY                
Sbjct: 435 KRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLIP------------- 485

Query: 432 VGCLMLMNGGDEAELAGGPG--ATLGNYQQQGFEVAYDLENNRVGFARRQCS 474
               +  NG      AG  G  + +GN QQQGF V YDL ++RVGFA   C+
Sbjct: 495 ----VDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of Cp4.1LG14g09420 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 147.1 bits (370), Expect = 5.0e-34
Identity = 121/403 (30.02%), Postives = 172/403 (42.68%), Query Frame = 0

Query: 76  GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIADKKSVSC 135
           G+Y     +G+ ++++ L +DTGSD+ W  C P  C  C  +          +  KS++C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP--CADCYQQSDPVFNPTSSSTYKSLTC 219

Query: 136 SAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSL-IGRLYRDS 195
           SA  CS                      +E S C S  C  +  +YGDGS  +G L  D+
Sbjct: 220 SAPQCSL---------------------LETSACRSNKC-LYQVSYGDGSFTVGELATDT 279

Query: 196 LSLPAPAPAPSPAINVRNFTFGCAHSALG---EPIGVAGFGRGLLSMPSQLATFSPQLGN 255
           ++          +  + N   GC H   G      G+ G G G+LS+ +Q+   S     
Sbjct: 280 VTF-------GNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATS----- 339

Query: 256 RFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFYSVGLAGISVGS 315
            FSYCLV            + + LG   G  T      LL N K   FY VGL+G SVG 
Sbjct: 340 -FSYCLVDRDSGKSSSLDFNSVQLGG--GDAT----APLLRNKKIDTFYYVGLSGFSVGG 399

Query: 316 VRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASRIEEN 375
            ++  P+ +  VD  GSGGV++D GT  T L    YNS+   F   T  +   +S I   
Sbjct: 400 EKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSI--- 459

Query: 376 TGLSPCYYYE--NSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVERKRKVGCLML 435
           +    CY +   ++V+VP V  HF G K S+ LP KNY     D G          C   
Sbjct: 460 SLFDTCYDFSSLSTVKVPTVAFHFTGGK-SLDLPAKNYLIPVDDSG--------TFCFAF 500

Query: 436 MNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQC 473
                   +       +GN QQQG  + YDL  N +G +  +C
Sbjct: 520 APTSSSLSI-------IGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Cp4.1LG14g09420 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 6.1e-32
Identity = 125/442 (28.28%), Postives = 190/442 (42.99%), Query Frame = 0

Query: 42  HNLLKSTAARSSARFHH-RRRTHHRSHLSLPLSPG-GDYTLSFNLGSESQKISLYMDTGS 101
           + L+K    R   R           S +  P+  G G+Y ++  +G+     S  MDTGS
Sbjct: 58  YELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGS 117

Query: 102 DLVWFPCSPFECILCEGKPKIQSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRC 161
           DL+W  C P  C  C        P P    + S S S   C + +   L           
Sbjct: 118 DLIWTQCEP--CTQC-----FSQPTPIFNPQDSSSFSTLPCESQYCQDL----------- 177

Query: 162 PLESIEVSECSSFSCPPFYYAYGDGSLI-GRLYRDSLSLPAPAPAPSPAINVRNFTFGCA 221
           P E+   +EC       + Y YGDGS   G +  ++ +            +V N  FGC 
Sbjct: 178 PSETCNNNECQ------YTYGYGDGSTTQGYMATETFTFETS--------SVPNIAFGCG 237

Query: 222 HS----ALGEPIGVAGFGRGLLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLI 281
                   G   G+ G G G LS+PSQL         +FSYC+ S+  ++     PS L 
Sbjct: 238 EDNQGFGQGNGAGLIGMGWGPLSLPSQLGV------GQFSYCMTSYGSSS-----PSTLA 297

Query: 282 LGRYYGSETE-FIYTSLLENPKHPYFYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVV 341
           LG       E    T+L+ +  +P +Y + L GI+VG   +  P    ++ + G+GG+++
Sbjct: 298 LGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMII 357

Query: 342 DSGTTFTMLPAGLYNSVVAQFENRTGRVASRASRIEENTGLSPCYYYE---NSVEVPRVV 401
           DSGTT T LP   YN+V   F ++     +  +  E ++GLS C+      ++V+VP + 
Sbjct: 358 DSGTTLTYLPQDAYNAVAQAFTDQ----INLPTVDESSSGLSTCFQQPSDGSTVQVPEIS 417

Query: 402 LHFVGEKSSVVLPRKNYFYEFLDGGDGVERKRKVGCLMLMNGGDEAELAGGPGATLGNYQ 461
           + F G    + L  +N      +G         V CL +   G  ++L     +  GN Q
Sbjct: 418 MQFDG--GVLNLGEQNILISPAEG---------VICLAM---GSSSQLG---ISIFGNIQ 435

Query: 462 QQGFEVAYDLENNRVGFARRQC 473
           QQ  +V YDL+N  V F   QC
Sbjct: 478 QQETQVLYDLQNLAVSFVPTQC 435

BLAST of Cp4.1LG14g09420 vs. NCBI nr
Match: XP_023553227.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 956 bits (2470), Expect = 0.0
Identity = 482/482 (100.00%), Postives = 482/482 (100.00%), Query Frame = 0

Query: 1   MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60
           MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR
Sbjct: 1   MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60

Query: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120
           RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI
Sbjct: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120

Query: 121 QSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180
           QSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA
Sbjct: 121 QSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180

Query: 181 YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS 240
           YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS
Sbjct: 181 YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS 240

Query: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFY 300
           QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFY
Sbjct: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFY 300

Query: 301 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360
           SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR
Sbjct: 301 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360

Query: 361 VASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVER 420
           VASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVER
Sbjct: 361 VASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVER 420

Query: 421 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480
           KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN
Sbjct: 421 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480

Query: 481 RS 482
           RS
Sbjct: 481 RS 482

BLAST of Cp4.1LG14g09420 vs. NCBI nr
Match: KAG6577689.1 (putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 939 bits (2426), Expect = 0.0
Identity = 473/482 (98.13%), Postives = 478/482 (99.17%), Query Frame = 0

Query: 1   MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60
           MASPVFLFLLCFL+SSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR
Sbjct: 1   MASPVFLFLLCFLISSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60

Query: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120
           RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI
Sbjct: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120

Query: 121 QSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180
           QSPLPKI+++KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA
Sbjct: 121 QSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180

Query: 181 YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS 240
           YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS
Sbjct: 181 YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS 240

Query: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFY 300
           QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTS+LENPKHPYFY
Sbjct: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFY 300

Query: 301 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360
           SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR
Sbjct: 301 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360

Query: 361 VASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVER 420
           VASRASRIEENTGLSPCY YE SVEVPRVVLHFVGEKSSV LPRKNYFYEFLDGGDGV R
Sbjct: 361 VASRASRIEENTGLSPCYSYEKSVEVPRVVLHFVGEKSSVELPRKNYFYEFLDGGDGVGR 420

Query: 421 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480
           KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN
Sbjct: 421 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480

Query: 481 RS 482
           RS
Sbjct: 481 RS 482

BLAST of Cp4.1LG14g09420 vs. NCBI nr
Match: XP_022923540.1 (probable aspartyl protease At4g16563 [Cucurbita moschata])

HSP 1 Score: 938 bits (2424), Expect = 0.0
Identity = 473/482 (98.13%), Postives = 477/482 (98.96%), Query Frame = 0

Query: 1   MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60
           MASPVFLFLLCFL SSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR
Sbjct: 1   MASPVFLFLLCFLFSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60

Query: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120
           RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI
Sbjct: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120

Query: 121 QSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180
           QSPLPKI+++KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA
Sbjct: 121 QSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180

Query: 181 YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS 240
           YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS
Sbjct: 181 YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS 240

Query: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFY 300
           QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTS+LENPKHPYFY
Sbjct: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFY 300

Query: 301 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360
           SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR
Sbjct: 301 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360

Query: 361 VASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVER 420
           VASRASRIEENTGLSPCY YE SVEVPRVVLHFVGEKSSV LPRKNYFYEFLDGGDGV R
Sbjct: 361 VASRASRIEENTGLSPCYSYEKSVEVPRVVLHFVGEKSSVELPRKNYFYEFLDGGDGVGR 420

Query: 421 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480
           KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN
Sbjct: 421 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480

Query: 481 RS 482
           RS
Sbjct: 481 RS 482

BLAST of Cp4.1LG14g09420 vs. NCBI nr
Match: XP_023007805.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 922 bits (2382), Expect = 0.0
Identity = 466/482 (96.68%), Postives = 474/482 (98.34%), Query Frame = 0

Query: 1   MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60
           MASPVFLFLLCFLL SPVFSSQ+LLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR
Sbjct: 1   MASPVFLFLLCFLLPSPVFSSQILLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60

Query: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120
           RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI
Sbjct: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120

Query: 121 QSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180
           QSPLPKI+++KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA
Sbjct: 121 QSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180

Query: 181 YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS 240
           YGDGSLIGRLYRDSLSLPAPAP  SPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMP 
Sbjct: 181 YGDGSLIGRLYRDSLSLPAPAP--SPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPI 240

Query: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFY 300
           QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTS+LENPKHPYFY
Sbjct: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFY 300

Query: 301 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360
           SVGLAGISVGSV IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR
Sbjct: 301 SVGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360

Query: 361 VASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVER 420
           VASRAS+IEENTGLSPCYYYE SVEVPRVVLHFVGEKSSV+LPRKNYFYEFLDGGDGV R
Sbjct: 361 VASRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGR 420

Query: 421 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480
           K KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN
Sbjct: 421 KIKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480

Query: 481 RS 482
           RS
Sbjct: 481 RS 480

BLAST of Cp4.1LG14g09420 vs. NCBI nr
Match: XP_038905814.1 (probable aspartyl protease At4g16563 [Benincasa hispida])

HSP 1 Score: 882 bits (2278), Expect = 0.0
Identity = 444/483 (91.93%), Postives = 463/483 (95.86%), Query Frame = 0

Query: 1   MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHR 60
           MAS VF+ LLCFLLSSPVFSSQLLLLPLS+SLSSS SDFNNTHNLLKSTAARSSARFHHR
Sbjct: 1   MASSVFVLLLCFLLSSPVFSSQLLLLPLSHSLSSSISDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RRT H +HLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTQHHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 IQSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180
           +QSPLPKI++ KSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIE+SECSSFSCPPFYY
Sbjct: 121 VQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYY 180

Query: 181 AYGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMP 240
           AYGDGSLI RLYRDSLSLPAPAP  SPAINVRNFTFGCAH+ALGEP+GVAGFGRG LSMP
Sbjct: 181 AYGDGSLIARLYRDSLSLPAPAP--SPAINVRNFTFGCAHTALGEPVGVAGFGRGTLSMP 240

Query: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYF 300
           SQLATFSPQLGNRFSYCLVSHSFAA+RVRRPSPLILGRYYG ETEFIYTSLLENPKHPYF
Sbjct: 241 SQLATFSPQLGNRFSYCLVSHSFAAERVRRPSPLILGRYYGGETEFIYTSLLENPKHPYF 300

Query: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTG 360
           YSVGL GISVG++ IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLY+SVVA FENRTG
Sbjct: 301 YSVGLTGISVGNMMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAAFENRTG 360

Query: 361 RVASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVE 420
           RVA+RA RIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSV+LP+KNYFYEFLDGGDGV 
Sbjct: 361 RVANRARRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVLLPKKNYFYEFLDGGDGVG 420

Query: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 480
           +KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDL  NRVGFARRQCSTLWDSL
Sbjct: 421 KKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLAKNRVGFARRQCSTLWDSL 480

Query: 481 NRS 482
           NRS
Sbjct: 481 NRS 481

BLAST of Cp4.1LG14g09420 vs. ExPASy TrEMBL
Match: A0A6J1EC44 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111431201 PE=3 SV=1)

HSP 1 Score: 938 bits (2424), Expect = 0.0
Identity = 473/482 (98.13%), Postives = 477/482 (98.96%), Query Frame = 0

Query: 1   MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60
           MASPVFLFLLCFL SSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR
Sbjct: 1   MASPVFLFLLCFLFSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60

Query: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120
           RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI
Sbjct: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120

Query: 121 QSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180
           QSPLPKI+++KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA
Sbjct: 121 QSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180

Query: 181 YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS 240
           YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS
Sbjct: 181 YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS 240

Query: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFY 300
           QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTS+LENPKHPYFY
Sbjct: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFY 300

Query: 301 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360
           SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR
Sbjct: 301 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360

Query: 361 VASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVER 420
           VASRASRIEENTGLSPCY YE SVEVPRVVLHFVGEKSSV LPRKNYFYEFLDGGDGV R
Sbjct: 361 VASRASRIEENTGLSPCYSYEKSVEVPRVVLHFVGEKSSVELPRKNYFYEFLDGGDGVGR 420

Query: 421 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480
           KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN
Sbjct: 421 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480

Query: 481 RS 482
           RS
Sbjct: 481 RS 482

BLAST of Cp4.1LG14g09420 vs. ExPASy TrEMBL
Match: A0A6J1L3Z9 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111500303 PE=3 SV=1)

HSP 1 Score: 922 bits (2382), Expect = 0.0
Identity = 466/482 (96.68%), Postives = 474/482 (98.34%), Query Frame = 0

Query: 1   MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60
           MASPVFLFLLCFLL SPVFSSQ+LLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR
Sbjct: 1   MASPVFLFLLCFLLPSPVFSSQILLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRR 60

Query: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120
           RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI
Sbjct: 61  RTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKI 120

Query: 121 QSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180
           QSPLPKI+++KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA
Sbjct: 121 QSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYA 180

Query: 181 YGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPS 240
           YGDGSLIGRLYRDSLSLPAPAP  SPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMP 
Sbjct: 181 YGDGSLIGRLYRDSLSLPAPAP--SPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPI 240

Query: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFY 300
           QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTS+LENPKHPYFY
Sbjct: 241 QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFY 300

Query: 301 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360
           SVGLAGISVGSV IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR
Sbjct: 301 SVGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 360

Query: 361 VASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVER 420
           VASRAS+IEENTGLSPCYYYE SVEVPRVVLHFVGEKSSV+LPRKNYFYEFLDGGDGV R
Sbjct: 361 VASRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGR 420

Query: 421 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480
           K KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN
Sbjct: 421 KIKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 480

Query: 481 RS 482
           RS
Sbjct: 481 RS 480

BLAST of Cp4.1LG14g09420 vs. ExPASy TrEMBL
Match: A0A5D3CP11 (Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1017G00280 PE=3 SV=1)

HSP 1 Score: 862 bits (2227), Expect = 3.62e-314
Identity = 434/484 (89.67%), Postives = 456/484 (94.21%), Query Frame = 0

Query: 3   SPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHRRR 62
           SPVF+FLLCFLLSSPVFSSQ+ LLPLS+SLSSS SDFN+THNLLKSTA RSSARFH    
Sbjct: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR--- 63

Query: 63  THHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 122
            H  +HLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPKIQ
Sbjct: 64  -HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123

Query: 123 SPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAY 182
           SPLPKI++ KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIE+SECSSFSCPPFYYAY
Sbjct: 124 SPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183

Query: 183 GDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPSQ 242
           GDGSL+ RLYRDSLSLP PAPAPSP INVRNFTFGCAH+ LGEP+GVAGFGRG+LSMPSQ
Sbjct: 184 GDGSLVARLYRDSLSLPTPAPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQ 243

Query: 243 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFYS 302
           LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+  ETEFIYTSLLENPKHPYFYS
Sbjct: 244 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYS 303

Query: 303 VGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRV 362
           VGLAGISVG+VRIPAPEFL++VDE GSGGVVVDSGTTFTMLP+GLY SVVA+FENRTG+V
Sbjct: 304 VGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKV 363

Query: 363 ASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVE-- 422
           A+RA RIEENTGLSPCYYY+NSV VPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGV   
Sbjct: 364 ANRARRIEENTGLSPCYYYQNSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEV 423

Query: 423 -RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDS 482
            RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWD+
Sbjct: 424 GRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDN 483

BLAST of Cp4.1LG14g09420 vs. ExPASy TrEMBL
Match: A0A0A0L5I7 (Pepsin A OS=Cucumis sativus OX=3659 GN=Csa_3G020060 PE=3 SV=1)

HSP 1 Score: 860 bits (2223), Expect = 1.30e-313
Identity = 436/482 (90.46%), Postives = 454/482 (94.19%), Query Frame = 0

Query: 3   SPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHRRR 62
           SPVF+FLLCFLLSSPVFSSQ+ LLPLS+SLSSS SDFNNTHNLLKSTA RSSARFH    
Sbjct: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR--- 63

Query: 63  THHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 122
            H  +HLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPKIQ
Sbjct: 64  -HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123

Query: 123 SPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAY 182
           SPLPKIA+ KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIE+SECSSFSCPPFYYAY
Sbjct: 124 SPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183

Query: 183 GDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPSQ 242
           GDGSL+ RLYRDSLSLP PAP  SP INVRNFTFGCAH+ LGEP+GVAGFGRG+LSMPSQ
Sbjct: 184 GDGSLVARLYRDSLSLPTPAP--SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQ 243

Query: 243 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFYS 302
           LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYY  ETEFIYTSLLENPKHPYFYS
Sbjct: 244 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYS 303

Query: 303 VGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRV 362
           VGLAGISVG++RIPAPEFL +VDEGGSGGVVVDSGTTFTMLPAGLY SVVA+FENRTG+V
Sbjct: 304 VGLAGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKV 363

Query: 363 ASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVE-R 422
           A+RA RIEENTGLSPCYYYENSV VPRVVLHFVGEKS+VVLPRKNYFYEFLDGGDGV  R
Sbjct: 364 ANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGR 423

Query: 423 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLN 482
           KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWD+LN
Sbjct: 424 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLN 479

BLAST of Cp4.1LG14g09420 vs. ExPASy TrEMBL
Match: A0A1S3BK28 (aspartic proteinase nepenthesin-1 OS=Cucumis melo OX=3656 GN=LOC103490888 PE=3 SV=1)

HSP 1 Score: 854 bits (2206), Expect = 5.26e-311
Identity = 433/484 (89.46%), Postives = 454/484 (93.80%), Query Frame = 0

Query: 3   SPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHRRR 62
           SPVF+FLLCFLLSSPVFSSQ+ LLPLS+SLSSS SDFN+THNLLKSTA RSSARFH    
Sbjct: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR--- 63

Query: 63  THHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 122
            H  +HLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPKIQ
Sbjct: 64  -HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123

Query: 123 SPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAY 182
           SPLPKI++ KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIE+SECSSFSCPPFYYAY
Sbjct: 124 SPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183

Query: 183 GDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPSQ 242
           GDGSL+ RLYRDSLSLP PAP  SP INVRNFTFGCAH+ LGEP+GVAGFGRG+LSMPSQ
Sbjct: 184 GDGSLVARLYRDSLSLPTPAP--SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQ 243

Query: 243 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFYS 302
           LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+  ETEFIYTSLLENPKHPYFYS
Sbjct: 244 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYS 303

Query: 303 VGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRV 362
           VGLAGISVG+VRIPAPEFL++VDE GSGGVVVDSGTTFTMLP+GLY SVVA+FENRTG+V
Sbjct: 304 VGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKV 363

Query: 363 ASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVE-- 422
           A+RA RIEENTGLSPCYYYENSV VPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGV   
Sbjct: 364 ANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEV 423

Query: 423 -RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDS 482
            RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWD+
Sbjct: 424 GRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDN 481

BLAST of Cp4.1LG14g09420 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 589.0 bits (1517), Expect = 3.6e-168
Identity = 294/479 (61.38%), Postives = 355/479 (74.11%), Query Frame = 0

Query: 24  LLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHHRSHLSLPLSPGGDYTLSFN 83
           LLL LS+SLS+S   ++  +LLKS+++RSSARF        +  LSLP+S G DY +S +
Sbjct: 29  LLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISLS 88

Query: 84  LGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIADK-KSVSCSAAACSA 143
           +GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   ++    +VSCS+ +CSA
Sbjct: 89  VGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCSA 148

Query: 144 AHGGSLSASHLCAISRCPLESIEVSEC--SSFSCPPFYYAYGDGSLIGRLYRDSLSLPAP 203
           AH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+ +LY DSLSL   
Sbjct: 149 AH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSL--- 208

Query: 204 APAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPSQLATFSPQLGNRFSYCLVSH 263
                P+++V NFTFGCAH+ L EPIGVAGFGRG LS+P+QLA  SP LGN FSYCLVSH
Sbjct: 209 -----PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSH 268

Query: 264 SFAADRVRRPSPLILGRYYG--------------------SETEFIYTSLLENPKHPYFY 323
           SF +DRVRRPSPLILGR+                       + EF++T +LENPKHPYFY
Sbjct: 269 SFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFY 328

Query: 324 SVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGR 383
           SV L GIS+G   IPAP  L+R+D+ G GGVVVDSGTTFTMLPA  YNSVV +F++R GR
Sbjct: 329 SVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGR 388

Query: 384 VASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVER 443
           V  RA R+E ++G+SPCYY   +V+VP +VLHF G +SSV LPR+NYFYEF+DGGDG E 
Sbjct: 389 VHERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEE 448

Query: 444 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 480
           KRK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEV YDL N RVGFA+R+C++LWDSL
Sbjct: 449 KRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of Cp4.1LG14g09420 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 199.5 bits (506), Expect = 6.1e-51
Identity = 167/500 (33.40%), Postives = 237/500 (47.40%), Query Frame = 0

Query: 5   VFLFLLCFLLSSPVFSSQLLLLPLSNSLSSSSDFNNTHNLLKSTAARSSARFHHRRRTHH 64
           +FLFLL  LL +    +Q       N  SSSS F     L KS+ +  + +   + R   
Sbjct: 8   LFLFLLITLLLNTTNKTQ--ARQHKNPSSSSSSF-LVLTLTKSSVSLPTPKSQTQERIKK 67

Query: 65  -RSHLSLPLSP----GGDYTLSFNLGSESQKISLYMDTGSDLVWFPCS--PFECILCEG- 124
             S + + + P       Y ++ N+G+  Q + +Y+DTGSDL W PC    F+CI C   
Sbjct: 68  PLSSVDVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDL 127

Query: 125 ------KPKIQSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECS 184
                  P + SPL      +  SC+++ C   H  S +    CA++ C +  +  S C 
Sbjct: 128 KNNDLKSPSVFSPLHSSTSFRD-SCASSFCVEIH-SSDNPFDPCAVAGCSVSMLLKSTCV 187

Query: 185 SFSCPPFYYAYGDGSLI-GRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVA 244
              CP F Y YG+G LI G L RD L         +   +V  F+FGC  S   EPIG+A
Sbjct: 188 R-PCPSFAYTYGEGGLISGILTRDILK--------ARTRDVPRFSFGCVTSTYREPIGIA 247

Query: 245 GFGRGLLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGS---ETEFI 304
           GFGRGLLS+PSQL      L   FS+C +   F  +     SPLILG    S        
Sbjct: 248 GFGRGLLSLPSQLGF----LEKGFSHCFLPFKF-VNNPNISSPLILGASALSINLTDSLQ 307

Query: 305 YTSLLENPKHPYFYSVGLAGISVGSVRIP--APEFLKRVDEGGSGGVVVDSGTTFTMLPA 364
           +T +L  P +P  Y +GL  I++G+   P   P  L++ D  G+GG++VDSGTT+T LP 
Sbjct: 308 FTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPE 367

Query: 365 GLYNSVVAQFENRTGRVASRASRIEENTGLSPCY----------YYENSVEV--PRVVLH 424
             Y+ ++   ++       RA+  E  TG   CY            EN V +  P +  H
Sbjct: 368 PFYSQLLTTLQSTI--TYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFH 427

Query: 425 FVGEKSSVVLPRKNYFYEFLDGGDGVERKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQ 473
           F+   ++++LP+ N FY      DG      V CL+  N  D      GP    G++QQQ
Sbjct: 428 FL-NNATLLLPQGNSFYAMSAPSDG----SVVQCLLFQNMEDGDY---GPAGVFGSFQQQ 478

BLAST of Cp4.1LG14g09420 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 180.6 bits (457), Expect = 2.9e-45
Identity = 158/504 (31.35%), Postives = 231/504 (45.83%), Query Frame = 0

Query: 1   MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSSSD-FNNTHNLLKSTAARSSARFH-- 60
           MAS +F F L FL  S V + +L L P S+S  S  D + +   L +S+ AR+    H  
Sbjct: 1   MASSIFFFFLIFL--SVVSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGT 60

Query: 61  ---------HRRRTHHRSHLSLPLSPG--GDYTLSFNLGSESQKISLYMDTGSDLVWFPC 120
                        T   + +  PLS    G Y++S + G+ SQ I    DTGS LVW PC
Sbjct: 61  SIKPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPC 120

Query: 121 -SPFECILCEGKPKIQSPLPKIADKKS-----VSCSAAACSAAHGGSLSASHLCAISRCP 180
            S + C  C+      + +P+   K S     + C +  C   +G ++         +C 
Sbjct: 121 TSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV---------QCR 180

Query: 181 LESIEVSECSSFSCPPFYYAYGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHS 240
                   C +  CPP+   YG GS  G L  + L          P + V +F  GC+  
Sbjct: 181 GCDPNTRNC-TVGCPPYILQYGLGSTAGVLITEKLDF--------PDLTVPDFVVGCSII 240

Query: 241 ALGEPIGVAGFGRGLLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYY- 300
           +  +P G+AGFGRG +S+PSQ+         RFS+CLVS  F    V     L  G  + 
Sbjct: 241 STRQPAGIAGFGRGPVSLPSQMNL------KRFSHCLVSRRFDDTNVTTDLDLDTGSGHN 300

Query: 301 -GSETE-FIYTSLLENPK-----HPYFYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVV 360
            GS+T    YT   +NP         +Y + L  I VG   +  P         G GG +
Sbjct: 301 SGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSI 360

Query: 361 VDSGTTFTMLPAGLYNSVVAQFENRTGRVASRASRIEENTGLSPCYYY--ENSVEVPRVV 420
           VDSG+TFT +   ++  V  +F ++     +R   +E+ TGL PC+    +  V VP ++
Sbjct: 361 VDSGSTFTFMERPVFELVAEEFASQMSNY-TREKDLEKETGLGPCFNISGKGDVTVPELI 420

Query: 421 LHFVGEKSSVVLPRKNYFYEFLDGGDGVERKRKVGCLMLMNGGDEAELAG-GPGATLGNY 474
             F G  + + LP  NYF  F+   D V       CL +++        G GP   LG++
Sbjct: 421 FEFKG-GAKLELPLSNYF-TFVGNTDTV-------CLTVVSDKTVNPSGGTGPAIILGSF 468

BLAST of Cp4.1LG14g09420 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 155.2 bits (391), Expect = 1.3e-37
Identity = 130/423 (30.73%), Postives = 188/423 (44.44%), Query Frame = 0

Query: 62  THHRSHLSLPLSPG-----GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEG 121
           T     +  PL  G     G+Y     +G  ++++ + +DTGSD+ W  C+P  C  C  
Sbjct: 127 TTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP--CADCYH 186

Query: 122 KPKIQSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPP 181
           + +        +  + +SC    C+A                     +EVSEC + +C  
Sbjct: 187 QTEPIFEPSSSSSYEPLSCDTPQCNA---------------------LEVSECRNATC-L 246

Query: 182 FYYAYGDGS-LIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVA---GFG 241
           +  +YGDGS  +G    ++L++ +          V+N   GC HS  G  +G A   G G
Sbjct: 247 YEVSYGDGSYTVGDFATETLTIGSTL--------VQNVAVGCGHSNEGLFVGAAGLLGLG 306

Query: 242 RGLLSMPSQLATFSPQLGNRFSYCLVSH-SFAADRVRRPSPLILGRYYGSETEFIYTSLL 301
            GLL++PSQL T S      FSYCLV   S +A  V   + L          + +   LL
Sbjct: 307 GGLLALPSQLNTTS------FSYCLVDRDSDSASTVDFGTSL--------SPDAVVAPLL 366

Query: 302 ENPKHPYFYSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVV 361
            N +   FY +GL GISVG   +  P+    +DE GSGG+++DSGT  T L   +YNS+ 
Sbjct: 367 RNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLR 426

Query: 362 AQFENRTGRVASRASRIEENTGLSPCYYY--ENSVEVPRVVLHFVGEKSSVVLPRKNYFY 421
             F   T  +   A     +T    CY    + +VEVP V  HF G K  + LP KNY  
Sbjct: 427 DSFVKGTLDLEKAAGVAMFDT----CYNLSAKTTVEVPTVAFHFPGGK-MLALPAKNYMI 483

Query: 422 EFLDGGDGVERKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFAR 473
                         VG   L      + L     A +GN QQQG  V +DL N+ +GF+ 
Sbjct: 487 PV----------DSVGTFCLAFAPTASSL-----AIIGNVQQQGTRVTFDLANSLIGFSS 483

BLAST of Cp4.1LG14g09420 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 148.3 bits (373), Expect = 1.6e-35
Identity = 133/412 (32.28%), Postives = 188/412 (45.63%), Query Frame = 0

Query: 72  LSPG-GDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIADK 131
           LS G G+Y     +G+ ++ + + +DTGSD+VW  C+P  C  C  +       P    +
Sbjct: 135 LSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCYSQSD-----PIFDPR 194

Query: 132 KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYYAYGDGSL-IGR 191
           KS + +   CS+ H   L ++  C   R          C       +  +YGDGS  +G 
Sbjct: 195 KSKTYATIPCSSPHCRRLDSAG-CNTRR--------KTCL------YQVSYGDGSFTVGD 254

Query: 192 LYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVA---GFGRGLLSMPSQLATFS 251
              ++L+             V+    GC H   G  +G A   G G+G LS P Q     
Sbjct: 255 FSTETLTFRRN--------RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQT---G 314

Query: 252 PQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYFYSVGLAG 311
            +   +FSYCLV  S ++    +PS ++ G    S     +T LL NPK   FY VGL G
Sbjct: 315 HRFNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIA-RFTPLLSNPKLDTFYYVGLLG 374

Query: 312 ISVGSVRIP-APEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRVASRA 371
           ISVG  R+P     L ++D+ G+GGV++DSGT+ T L    Y ++   F  R G  A   
Sbjct: 375 ISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAF--RVG--AKTL 434

Query: 372 SRIEENTGLSPCYYYE--NSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVERKRK 431
            R  + +    C+     N V+VP VVLHF G  + V LP  NY                
Sbjct: 435 KRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLIP------------- 485

Query: 432 VGCLMLMNGGDEAELAGGPG--ATLGNYQQQGFEVAYDLENNRVGFARRQCS 474
               +  NG      AG  G  + +GN QQQGF V YDL ++RVGFA   C+
Sbjct: 495 ----VDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q940R45.0e-16761.38Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q766C34.5e-3529.98Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LNJ32.3e-3432.28Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q9LS405.0e-3430.02Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q766C26.1e-3228.28Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Match NameE-valueIdentityDescription
XP_023553227.10.0100.00probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
KAG6577689.10.098.13putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022923540.10.098.13probable aspartyl protease At4g16563 [Cucurbita moschata][more]
XP_023007805.10.096.68probable aspartyl protease At4g16563 [Cucurbita maxima][more]
XP_038905814.10.091.93probable aspartyl protease At4g16563 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1EC440.098.13probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114312... [more]
A0A6J1L3Z90.096.68probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111500303... [more]
A0A5D3CP113.62e-31489.67Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A0A0L5I71.30e-31390.46Pepsin A OS=Cucumis sativus OX=3659 GN=Csa_3G020060 PE=3 SV=1[more]
A0A1S3BK285.26e-31189.46aspartic proteinase nepenthesin-1 OS=Cucumis melo OX=3656 GN=LOC103490888 PE=3 S... [more]
Match NameE-valueIdentityDescription
AT4G16563.13.6e-16861.38Eukaryotic aspartyl protease family protein [more]
AT5G45120.16.1e-5133.40Eukaryotic aspartyl protease family protein [more]
AT3G52500.12.9e-4531.35Eukaryotic aspartyl protease family protein [more]
AT1G25510.11.3e-3730.73Eukaryotic aspartyl protease family protein [more]
AT1G01300.11.6e-3532.28Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 68..275
e-value: 2.1E-34
score: 121.1
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 276..478
e-value: 1.6E-47
score: 163.6
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 73..477
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 299..468
e-value: 6.0E-27
score: 94.4
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 78..263
e-value: 1.0E-28
score: 100.8
NoneNo IPR availablePANTHERPTHR47967:SF26BNAA01G17170D PROTEINcoord: 1..477
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 1..477
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 331..342
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 78..468
score: 31.290274
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 78..472
e-value: 7.81409E-68
score: 216.36

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g09420.1Cp4.1LG14g09420.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity