Cp4.1LG20g06980 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g06980
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionaspartic proteinase CDR1-like
LocationCp4.1LG20: 5521604 .. 5522969 (+)
RNA-Seq ExpressionCp4.1LG20g06980
SyntenyCp4.1LG20g06980
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AAATCTAAAATCCATGCAATCATTTCTTCATCTCCCATGGCGCCTACCATATTTCTCCTTCTCGCACTGCTTTCCATTGCGGAGTCCACTATCGACAAAAACGGTGGTCTCAAGCTGGAACTCATCCGCCGCTGTGTCTCACCCGACAACGTTTCACCGATGGCAGCCAAATCACGAATTTGGCCGGAAACCAGTGAATTTATAGTGAAAATCGCTGTCGGAACGCCGTCGACGGAGGTGCATGCAATCCTCGACACTGGCAGTGATTTATTTTGGGCTCAGTGTCGTCCATGTGTGAAATGTTACCAGCAAACGAATCCGATTTACAACCCTTTGAAATCGTCAACCTTTCGAACCCTTTCTTGCAAGTCGTCGCAGTGCCATTTGAGGGGGTCCGGTGCGGCGTGCTCCAGCACCGATACGTGTAAGTATGACTATGGGTATGAAAGTGGATCTACGCAGGGAGAATTGGCGACTGAAAAAATGGTGGTAACTTCGAGGTCTGGAGCGACGACGCCGTTTCCGGAGGTGGTGTTTGGTTGTGGACATAATAATAGTGGAACGTTTAATGCGAATAAAATGGGATTGATCGGATTTGGAAGAGGAGCGATTTCCTCGTTTCTCAGGTATATATATTATCACTTTCTATTATTTTATTATTTTTAAAAACATTATTCGCATTACATATTGAAACGAACAGTTAATTATATTATTATTCAAATAACTATAAAGTTCAAAACCTTATAGTTAATGTAAATTCATATTCACAGATAGGTCCATCGGTCGGCGGCAGAAAGTTCTCCCTTTGTCTGATGCCATACAACACCGACCCAAGAATCTCAAGTAGCCTCTCAATAGGGTCGGGTTCTGAAGTTCAAGGACCCGGAGTCATCACAGCCCAACTCGTTCGAGTATCCCACCAGACATATTACTCTCTCACCCTCACGGGAATCTCTGTCGGAAAAACCCTTGTTCCATACAGTATGTCGAGACCTCCGGCCAAGGGGAACACGATTCTCGATACCAGCACGCCGCAGACTCTCCTCCCCAAAGAATTGTACGGACGATTGGCTGCCGAAGTTCAGCGGCATATCCCGTCAAAGCCCATTGATGATACACTTTGCTACAAAGATAATGTGGGGGATTTGGTGATGACCTTGCACTTCGACGGCGGCGTGGATCTGCGATTGAGTACGGTTCAGACTTTCAAAAGATGCCGGATGGATCCTTTTGCTTCACCGCGATGGGCGTTGACGACAACGATGCACTCATCGGGAACAGTATGATGGCAAATTTTTTAGTTGGGTATGATATTGACAATATGACGGTGTCGTTTAAGCCCACTGATTGCACCAAAATTGGTTGA

mRNA sequence

AAATCTAAAATCCATGCAATCATTTCTTCATCTCCCATGGCGCCTACCATATTTCTCCTTCTCGCACTGCTTTCCATTGCGGAGTCCACTATCGACAAAAACGGTGGTCTCAAGCTGGAACTCATCCGCCGCTGTGTCTCACCCGACAACGTTTCACCGATGGCAGCCAAATCACGAATTTGGCCGGAAACCAGTGAATTTATAGTGAAAATCGCTGTCGGAACGCCGTCGACGGAGGTGCATGCAATCCTCGACACTGGCAGTGATTTATTTTGGGCTCAGTGTCGTCCATGTGTGAAATGTTACCAGCAAACGAATCCGATTTACAACCCTTTGAAATCGTCAACCTTTCGAACCCTTTCTTGCAAGTCGTCGCAGTGCCATTTGAGGGGGTCCGGTGCGGCGTGCTCCAGCACCGATACGTGTAAGTATGACTATGGGTATGAAAGTGGATCTACGCAGGGAGAATTGGCGACTGAAAAAATGGTGGTAACTTCGAGGTCTGGAGCGACGACGCCGTTTCCGGAGGTGGTGTTTGGTTGTGGACATAATAATAGTGGAACGTTTAATGCGAATAAAATGGGATTGATCGGATTTGGAAGAGGAGCGATTTCCTCGTTTCTCAGTTCAAAACCTTATAGTCCATCGGTCGGCGGCAGAAAGTTCTCCCTTTGTCTGATGCCATACAACACCGACCCAAGAATCTCAAGTAGCCTCTCAATAGGGTCGGGTTCTGAAGTTCAAGGACCCGGAGTCATCACAGCCCAACTCGTTCGAGTATCCCACCAGACATATTACTCTCTCACCCTCACGGGAATCTCTGTCGGAAAAACCCTTGTTCCATACAGTATGTCGAGACCTCCGGCCAAGGGGAACACGATTCTCGATACCAGCACGCCGCAGACTCTCCTCCCCAAAGAATTGTACGGACGATTGGCTGCCGAAGTTCAGCGGCATATCCCGTCAAAGCCCATTGATGATACACTTTGCTACAAAGATAATGTGGGGGATTTGGTGATGACCTTGCACTTCGACGGCGGCGTGGATCTGCGATTGAGTACGATGCCGGATGGATCCTTTTGCTTCACCGCGATGGGCGTTGACGACAACGATGCACTCATCGGGAACAGTATGATGGCAAATTTTTTAGTTGGGTATGATATTGACAATATGACGGTGTCGTTTAAGCCCACTGATTGCACCAAAATTGGTTGA

Coding sequence (CDS)

AAATCTAAAATCCATGCAATCATTTCTTCATCTCCCATGGCGCCTACCATATTTCTCCTTCTCGCACTGCTTTCCATTGCGGAGTCCACTATCGACAAAAACGGTGGTCTCAAGCTGGAACTCATCCGCCGCTGTGTCTCACCCGACAACGTTTCACCGATGGCAGCCAAATCACGAATTTGGCCGGAAACCAGTGAATTTATAGTGAAAATCGCTGTCGGAACGCCGTCGACGGAGGTGCATGCAATCCTCGACACTGGCAGTGATTTATTTTGGGCTCAGTGTCGTCCATGTGTGAAATGTTACCAGCAAACGAATCCGATTTACAACCCTTTGAAATCGTCAACCTTTCGAACCCTTTCTTGCAAGTCGTCGCAGTGCCATTTGAGGGGGTCCGGTGCGGCGTGCTCCAGCACCGATACGTGTAAGTATGACTATGGGTATGAAAGTGGATCTACGCAGGGAGAATTGGCGACTGAAAAAATGGTGGTAACTTCGAGGTCTGGAGCGACGACGCCGTTTCCGGAGGTGGTGTTTGGTTGTGGACATAATAATAGTGGAACGTTTAATGCGAATAAAATGGGATTGATCGGATTTGGAAGAGGAGCGATTTCCTCGTTTCTCAGTTCAAAACCTTATAGTCCATCGGTCGGCGGCAGAAAGTTCTCCCTTTGTCTGATGCCATACAACACCGACCCAAGAATCTCAAGTAGCCTCTCAATAGGGTCGGGTTCTGAAGTTCAAGGACCCGGAGTCATCACAGCCCAACTCGTTCGAGTATCCCACCAGACATATTACTCTCTCACCCTCACGGGAATCTCTGTCGGAAAAACCCTTGTTCCATACAGTATGTCGAGACCTCCGGCCAAGGGGAACACGATTCTCGATACCAGCACGCCGCAGACTCTCCTCCCCAAAGAATTGTACGGACGATTGGCTGCCGAAGTTCAGCGGCATATCCCGTCAAAGCCCATTGATGATACACTTTGCTACAAAGATAATGTGGGGGATTTGGTGATGACCTTGCACTTCGACGGCGGCGTGGATCTGCGATTGAGTACGATGCCGGATGGATCCTTTTGCTTCACCGCGATGGGCGTTGACGACAACGATGCACTCATCGGGAACAGTATGATGGCAAATTTTTTAGTTGGGTATGATATTGACAATATGACGGTGTCGTTTAAGCCCACTGATTGCACCAAAATTGGTTGA

Protein sequence

KSKIHAIISSSPMAPTIFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWPETSEFIVKIAVGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGSGAACSSTDTCKYDYGYESGSTQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNANKMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGVITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRPPAKGNTILDTSTPQTLLPKELYGRLAAEVQRHIPSKPIDDTLCYKDNVGDLVMTLHFDGGVDLRLSTMPDGSFCFTAMGVDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG
Homology
BLAST of Cp4.1LG20g06980 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 238.0 bits (606), Expect = 1.8e-61
Identity = 139/358 (38.83%), Postives = 202/358 (56.42%), Query Frame = 0

Query: 64  TSEFIVKIAVGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCK 123
           + E+++ +++GTP   + AI DTGSDL W QC PC  CY Q +P+++P  SST++ +SC 
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCS 146

Query: 124 SSQCHLRGSGAACSSTD-TCKYDYGYESGS-TQGELATEKMVVTSRSGATTPFPEVVFGC 183
           SSQC    + A+CS+ D TC Y   Y   S T+G +A + + + S          ++ GC
Sbjct: 147 SSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 206

Query: 184 GHNNSGTFNANKMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSI 243
           GHNN+GTFN    G++G G G +S     K    S+ G KFS CL+P  +    +S ++ 
Sbjct: 207 GHNNAGTFNKKGSGIVGLGGGPVSLI---KQLGDSIDG-KFSYCLVPLTSKKDQTSKINF 266

Query: 244 GSGSEVQGPGVITAQLV-RVSHQTYYSLTLTGISVGKTLVPYSMS-RPPAKGNTILDTST 303
           G+ + V G GV++  L+ + S +T+Y LTL  ISVG   + YS S    ++GN I+D+ T
Sbjct: 267 GTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGT 326

Query: 304 PQTLLPKELYGRLAAEVQRHIPSKPIDD-----TLCYKDNVGDL---VMTLHFDGGVDLR 363
             TLLP E Y  L   V   I ++   D     +LCY    GDL   V+T+HFD G D++
Sbjct: 327 TLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY-SATGDLKVPVITMHFD-GADVK 386

Query: 364 LST------MPDGSFCFTAMGVDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKI 404
           L +      + +   CF   G   + ++ GN    NFLVGYD  + TVSFKPTDC K+
Sbjct: 387 LDSSNAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAKM 437

BLAST of Cp4.1LG20g06980 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 201.1 bits (510), Expect = 2.5e-50
Identity = 131/367 (35.69%), Postives = 197/367 (53.68%), Query Frame = 0

Query: 66  EFIVKIAVGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSS 125
           EF + I +GTP  +V AI DTGSDL W QC+PC +CY++  PI++  KSST+++  C S 
Sbjct: 84  EFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSR 143

Query: 126 QCH-LRGSGAAC-SSTDTCKYDYGY-ESGSTQGELATEKMVVTSRSGATTPFPEVVFGCG 185
            C  L  +   C  S + CKY Y Y +   ++G++ATE + + S SG+   FP  VFGCG
Sbjct: 144 NCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCG 203

Query: 186 HNNSGTFNANKMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIG 245
           +NN GTF+    G+IG G G + S +S    S S   +KFS CL   +     +S +++G
Sbjct: 204 YNNGGTFDETGSGIIGLGGGHL-SLISQLGSSIS---KKFSYCLSHKSATTNGTSVINLG 263

Query: 246 S----GSEVQGPGVITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSR---------PPA 305
           +     S  +  GV++  LV     TYY LTL  ISVGK  +PY+ S             
Sbjct: 264 TNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSET 323

Query: 306 KGNTILDTSTPQTLLPKELYGRLAAEVQRHIP-SKPIDD-----TLCYKD---NVGDLVM 365
            GN I+D+ T  TLL    + + ++ V+  +  +K + D     + C+K     +G   +
Sbjct: 324 SGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEIGLPEI 383

Query: 366 TLHFDGGVDLRLS------TMPDGSFCFTAMGVDDNDALIGNSMMANFLVGYDIDNMTVS 402
           T+HF  G D+RLS       + +   C + +   +  A+ GN    +FLVGYD++  TVS
Sbjct: 384 TVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTTE-VAIYGNFAQMDFLVGYDLETRTVS 443

BLAST of Cp4.1LG20g06980 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 182.6 bits (462), Expect = 9.0e-45
Identity = 121/371 (32.61%), Postives = 175/371 (47.17%), Query Frame = 0

Query: 53  PMAAKSRIWPETSEFIVKIAVGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPL 112
           P   ++ ++    E+++ +++GTP+    AI+DTGSDL W QC+PC +C+ Q+ PI+NP 
Sbjct: 81  PSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQ 140

Query: 113 KSSTFRTLSCKSSQCHLRGSGAACSSTDTCKYDYGYESGS-TQGELATEKMVVTSRSGAT 172
            SS+F TL C S  C    S     S + C+Y YGY  GS TQG + TE +   S S   
Sbjct: 141 GSSSFSTLPCSSQLCQALSSPTC--SNNFCQYTYGYGDGSETQGSMGTETLTFGSVS--- 200

Query: 173 TPFPEVVFGCGHNNSGTFNANKMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNT 232
              P + FGCG NN G    N  GL+G GRG +S           +   KFS C+ P  +
Sbjct: 201 --IPNITFGCGENNQGFGQGNGAGLVGMGRGPLS-------LPSQLDVTKFSYCMTPIGS 260

Query: 233 DPRISSSLSIGSGSEVQGPGVITAQLVRVSH-QTYYSLTLTGISVGKTLVP-----YSMS 292
                S+L +GS +     G     L++ S   T+Y +TL G+SVG T +P     ++++
Sbjct: 261 S--TPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALN 320

Query: 293 RPPAKGNTILDTSTPQTLLPKELYGRLAAEVQRHIPSKPIDDT-----LCYK-----DNV 352
                G  I+D+ T  T      Y  +  E    I    ++ +     LC++      N+
Sbjct: 321 SNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNL 380

Query: 353 GDLVMTLHFDGGVDLRLST-----MPDGSFCFTAMGVDDND-ALIGNSMMANFLVGYDID 401
                 +HFDGG DL L +      P       AMG      ++ GN    N LV YD  
Sbjct: 381 QIPTFVMHFDGG-DLELPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTG 434

BLAST of Cp4.1LG20g06980 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 3.8e-43
Identity = 127/395 (32.15%), Postives = 189/395 (47.85%), Query Frame = 0

Query: 38  KLELIRRCVSPDN----------VSPMAAKSRIWPETSEFIVKIAVGTPSTEVHAILDTG 97
           K ELI+R +               S    ++ ++    E+++ +A+GTP +   AI+DTG
Sbjct: 57  KYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTG 116

Query: 98  SDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGSGAACSSTDTCKYDYG 157
           SDL W QC PC +C+ Q  PI+NP  SS+F TL C+S  C    S   C++ + C+Y YG
Sbjct: 117 SDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPS-ETCNNNE-CQYTYG 176

Query: 158 YESGS-TQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNANKMGLIGFGRGAISS 217
           Y  GS TQG +ATE          T+  P + FGCG +N G    N  GLIG G G +S 
Sbjct: 177 YGDGSTTQGYMATETFTF-----ETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLS- 236

Query: 218 FLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGVITAQLVRVS-HQTY 277
                     +G  +FS C+  Y +     S+L++GS +     G  +  L+  S + TY
Sbjct: 237 ------LPSQLGVGQFSYCMTSYGSSS--PSTLALGSAASGVPEGSPSTTLIHSSLNPTY 296

Query: 278 YSLTLTGISVG--KTLVPYS--MSRPPAKGNTILDTSTPQTLLPKELYGRLAAEVQRHIP 337
           Y +TL GI+VG     +P S    +    G  I+D+ T  T LP++ Y  +A      I 
Sbjct: 297 YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN 356

Query: 338 SKPIDD-----TLCYKD-NVGDLV----MTLHFDGGV----DLRLSTMPDGSFCFTAMGV 397
              +D+     + C++  + G  V    +++ FDGGV    +  +   P       AMG 
Sbjct: 357 LPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISPAEGVICLAMGS 416

Query: 398 DD--NDALIGNSMMANFLVGYDIDNMTVSFKPTDC 401
                 ++ GN       V YD+ N+ VSF PT C
Sbjct: 417 SSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Cp4.1LG20g06980 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 157.1 bits (396), Expect = 4.1e-37
Identity = 113/361 (31.30%), Postives = 164/361 (45.43%), Query Frame = 0

Query: 64  TSEFIVKIAVGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCK 123
           + E+  ++ VGTP+  V+ +LDTGSD+ W QC PC +CY Q++PI++P KS T+ T+ C 
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCS 198

Query: 124 SSQCHLRGSGAACSSTDTCKYDYGYESGS-TQGELATEKMVVTSRSGATTPFPEVVFGCG 183
           S  C    S    +   TC Y   Y  GS T G+ +TE +              V  GCG
Sbjct: 199 SPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNR-----VKGVALGCG 258

Query: 184 HNNSGTFNANKMGLIGFGRGAISSFLSSKPYSPSVGGR---KFSLCLMPYNTDPRISSSL 243
           H+N G F     GL+G G+G +S       +    G R   KFS CL+  +   + SS  
Sbjct: 259 HDNEGLF-VGAAGLLGLGKGKLS-------FPGQTGHRFNQKFSYCLVDRSASSKPSS-- 318

Query: 244 SIGSGSEVQGPGVITAQLVRVSHQTYYSLTLTGISVGKTLVP---YSMSRPPAKGN--TI 303
            +   + V      T  L      T+Y + L GISVG T VP    S+ +    GN   I
Sbjct: 319 VVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVI 378

Query: 304 LDTSTPQTLLPKELY------GRLAAEVQRHIPSKPIDDTLCYKDNVGDL---VMTLHFD 363
           +D+ T  T L +  Y       R+ A+  +  P   + DT     N+ ++    + LHF 
Sbjct: 379 IDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR 438

Query: 364 G------GVDLRLSTMPDGSFCFTAMGVDDNDALIGNSMMANFLVGYDIDNMTVSFKPTD 401
           G        +  +    +G FCF   G     ++IGN     F V YD+ +  V F P  
Sbjct: 439 GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGG 484

BLAST of Cp4.1LG20g06980 vs. NCBI nr
Match: XP_022933094.1 (aspartic proteinase CDR1-like [Cucurbita moschata])

HSP 1 Score: 711 bits (1835), Expect = 6.32e-257
Identity = 362/398 (90.95%), Postives = 371/398 (93.22%), Query Frame = 0

Query: 13  MAPTIFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWPETSEFIVKIA 72
           MAPTIFLLLALLSIAEST DK  GLKLELIRR VSP N+SPM AKS+IWPETSEFIVKIA
Sbjct: 1   MAPTIFLLLALLSIAESTADKGSGLKLELIRRRVSPGNISPMVAKSQIWPETSEFIVKIA 60

Query: 73  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGS 132
           VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIY+P KSSTFRTLSCKS QCHLRGS
Sbjct: 61  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGS 120

Query: 133 GAACSSTDTCKYDYGYESGSTQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNAN 192
           GAACS TDTC+YDYGYES STQGELATEKM VTSRSGATTPFPEVVFGCGHNNSGTFNAN
Sbjct: 121 GAACSGTDTCQYDYGYESRSTQGELATEKMAVTSRSGATTPFPEVVFGCGHNNSGTFNAN 180

Query: 193 KMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 252
           +MGLIGFGRGAIS F+S     PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV
Sbjct: 181 EMGLIGFGRGAIS-FVSQ--IGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 240

Query: 253 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRPPAKGNTILDTSTPQTLLPKELYGRL 312
           ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMS P AKGNTILDT TPQTLLPKELYGRL
Sbjct: 241 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSGPLAKGNTILDTGTPQTLLPKELYGRL 300

Query: 313 AAEVQRHIPSKPIDDTLCYKDNVGDLVMTLHFDGGVDLRLST------MPDGSFCFTAMG 372
           AAEV+RHIP+KPIDDTLCYKDN+GDLVMTLHFDG VDLRLST      MPDGSFCFTAMG
Sbjct: 301 AAEVRRHIPTKPIDDTLCYKDNLGDLVMTLHFDGDVDLRLSTVQTFNKMPDGSFCFTAMG 360

Query: 373 VDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 404
           VDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG
Sbjct: 361 VDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 395

BLAST of Cp4.1LG20g06980 vs. NCBI nr
Match: XP_022932203.1 (aspartic proteinase CDR1-like [Cucurbita moschata])

HSP 1 Score: 711 bits (1835), Expect = 6.32e-257
Identity = 364/398 (91.46%), Postives = 370/398 (92.96%), Query Frame = 0

Query: 13  MAPTIFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWPETSEFIVKIA 72
           MAPTIFLLLALLSIAEST DK  GLKLELIRR VSP NVSPM AKS+IWPETSEFIVKIA
Sbjct: 1   MAPTIFLLLALLSIAESTADKGSGLKLELIRRRVSPGNVSPMVAKSQIWPETSEFIVKIA 60

Query: 73  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGS 132
           VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIY+P KSSTFRTLSCKS QCHLRGS
Sbjct: 61  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGS 120

Query: 133 GAACSSTDTCKYDYGYESGSTQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNAN 192
           GAACS TDTCKYDYGYESGSTQGELATEKM VTSRSGATT FPEVVFGCGHNNSGTFNAN
Sbjct: 121 GAACSGTDTCKYDYGYESGSTQGELATEKMAVTSRSGATTSFPEVVFGCGHNNSGTFNAN 180

Query: 193 KMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 252
           +MGLIGFGRGAIS F+S     PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV
Sbjct: 181 EMGLIGFGRGAIS-FVSQ--IGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 240

Query: 253 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRPPAKGNTILDTSTPQTLLPKELYGRL 312
           ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMS P AKGNTILDT TPQTLLPKELYGRL
Sbjct: 241 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSGPLAKGNTILDTGTPQTLLPKELYGRL 300

Query: 313 AAEVQRHIPSKPIDDTLCYKDNVGDLVMTLHFDGGVDLRLST------MPDGSFCFTAMG 372
           AAEV+RHIPSKPIDDTLCYKDN+GDLVMTLHFDG VDLRLST      MPDGSFCFTAMG
Sbjct: 301 AAEVRRHIPSKPIDDTLCYKDNLGDLVMTLHFDGDVDLRLSTVQTFNKMPDGSFCFTAMG 360

Query: 373 VDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 404
           VDDNDALIGNSMMANFLVGYDIDNMTVSFK TDCTKIG
Sbjct: 361 VDDNDALIGNSMMANFLVGYDIDNMTVSFKSTDCTKIG 395

BLAST of Cp4.1LG20g06980 vs. NCBI nr
Match: KAG6583648.1 (Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 707 bits (1824), Expect = 3.00e-255
Identity = 362/398 (90.95%), Postives = 368/398 (92.46%), Query Frame = 0

Query: 13  MAPTIFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWPETSEFIVKIA 72
           MAPTIFLLLALLSIAEST DK  GLKLELIRR VSP NVSPM AKS+IWPETSEFIVKIA
Sbjct: 1   MAPTIFLLLALLSIAESTADKGSGLKLELIRRRVSPGNVSPMVAKSQIWPETSEFIVKIA 60

Query: 73  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGS 132
           VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIY+P KSSTFRTLSCKS QCHLRGS
Sbjct: 61  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGS 120

Query: 133 GAACSSTDTCKYDYGYESGSTQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNAN 192
           GAAC  TDTCKYDYGYESGSTQGELATEKM VTSRSG TTPFPEVVFGCGH NSGTFNAN
Sbjct: 121 GAACFGTDTCKYDYGYESGSTQGELATEKMAVTSRSGVTTPFPEVVFGCGHYNSGTFNAN 180

Query: 193 KMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 252
           +MGLIGFGRGAIS F+S     PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV
Sbjct: 181 EMGLIGFGRGAIS-FVSQ--IGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 240

Query: 253 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRPPAKGNTILDTSTPQTLLPKELYGRL 312
           ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMS P AKGNTILDT TPQTLLPKELYGRL
Sbjct: 241 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSGPLAKGNTILDTGTPQTLLPKELYGRL 300

Query: 313 AAEVQRHIPSKPIDDTLCYKDNVGDLVMTLHFDGGVDLRLST------MPDGSFCFTAMG 372
           AAEV+RHIPSKPIDDTLCYKDN+GDLVMTLHF G VDLRLST      MPDGSFCFTAMG
Sbjct: 301 AAEVRRHIPSKPIDDTLCYKDNLGDLVMTLHFVGDVDLRLSTVQTFNKMPDGSFCFTAMG 360

Query: 373 VDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 404
           VDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG
Sbjct: 361 VDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 395

BLAST of Cp4.1LG20g06980 vs. NCBI nr
Match: XP_022987324.1 (aspartic proteinase CDR1-like [Cucurbita maxima])

HSP 1 Score: 680 bits (1755), Expect = 1.01e-244
Identity = 349/399 (87.47%), Postives = 360/399 (90.23%), Query Frame = 0

Query: 13  MAPTIFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWPETSEFIVKIA 72
           MAPTIFLLL LLSIAEST  K GGLKLELIRR +SP NVSPMAAKS+IWPETSEFIVKIA
Sbjct: 1   MAPTIFLLLTLLSIAESTAGKGGGLKLELIRRRLSPGNVSPMAAKSQIWPETSEFIVKIA 60

Query: 73  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGS 132
           +GTP TEVHAILDTGSDLFWAQCRPC KCYQQTNPIY+P KSSTFRTLSCKS QCHLRGS
Sbjct: 61  IGTPPTEVHAILDTGSDLFWAQCRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGS 120

Query: 133 GAACSSTDTCKYDYGYESGSTQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNAN 192
           GAACS TDTCKY YGY SGSTQGELA+EKM VTSRSGATTPFP VVFGCGHNNSGTFNAN
Sbjct: 121 GAACSGTDTCKYSYGYGSGSTQGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNAN 180

Query: 193 KMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 252
           +MGLIGFGRGAIS F+S     PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEV+GPGV
Sbjct: 181 EMGLIGFGRGAIS-FVSQ--IGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGV 240

Query: 253 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRPPAKGNTILDTSTPQTLLPKELYGRL 312
           ITAQLVR S QT YSLTLTGISV KTLVPYS S PPAKGN +LDT TP TLLPKELYGRL
Sbjct: 241 ITAQLVRTSDQTSYSLTLTGISVRKTLVPYSTSGPPAKGNAVLDTGTPPTLLPKELYGRL 300

Query: 313 AAEVQRHIPSKPIDD-TLCYKDNVGDLVMTLHFDGGVDLRLST------MPDGSFCFTAM 372
           AAEV+RHIPSKPIDD TLCYKDN+GDLVMTLHFDGGVDLRLST      MPDGSFCFTAM
Sbjct: 301 AAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTVQTFNKMPDGSFCFTAM 360

Query: 373 GVDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 404
           GVDD DALIGNSMMANFLVGYDIDNMTVSFKPTDCTK G
Sbjct: 361 GVDDKDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKAG 396

BLAST of Cp4.1LG20g06980 vs. NCBI nr
Match: KAG6601733.1 (Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 677 bits (1747), Expect = 1.67e-243
Identity = 346/399 (86.72%), Postives = 362/399 (90.73%), Query Frame = 0

Query: 13  MAPTIFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWPETSEFIVKIA 72
           MAPTIF++LALLSIAEST+ K GGLKLELI+R +SP NVSPMAAKS+IWPETSEFIVKIA
Sbjct: 1   MAPTIFIVLALLSIAESTVGKGGGLKLELIQRRLSPGNVSPMAAKSQIWPETSEFIVKIA 60

Query: 73  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGS 132
           VGTP TEVHAILDTGSDLFWAQCRPC KCY+QTNPIY+P KSSTFRTLSCKS QCHLRGS
Sbjct: 61  VGTPPTEVHAILDTGSDLFWAQCRPCAKCYRQTNPIYDPSKSSTFRTLSCKSPQCHLRGS 120

Query: 133 GAACSSTDTCKYDYGYESGSTQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNAN 192
           GAACS TDTCKY YGY SGSTQGELATEKM VTSRSGATTPF  VVFGCGHNNSGTFNAN
Sbjct: 121 GAACSGTDTCKYGYGYGSGSTQGELATEKMAVTSRSGATTPFSGVVFGCGHNNSGTFNAN 180

Query: 193 KMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 252
           +MGLIGFGRGAIS F+S     PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEV+GPGV
Sbjct: 181 EMGLIGFGRGAIS-FVSQ--IGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGV 240

Query: 253 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRPPAKGNTILDTSTPQTLLPKELYGRL 312
           ITAQLVR   QT YSLTLTGISVGKTLVPYSMS PPAKGN +LDT TP TLLPKELYGRL
Sbjct: 241 ITAQLVRTPDQTSYSLTLTGISVGKTLVPYSMSGPPAKGNAVLDTGTPPTLLPKELYGRL 300

Query: 313 AAEVQRHIPSKPIDD-TLCYKDNVGDLVMTLHFDGGVDLRLST------MPDGSFCFTAM 372
           AAEV+RHIPSKP+DD TLCYKDN+GDLVMTLHF+GGVDLRLST      M DGSFCFTAM
Sbjct: 301 AAEVRRHIPSKPVDDDTLCYKDNLGDLVMTLHFEGGVDLRLSTVQTFNKMSDGSFCFTAM 360

Query: 373 GVDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 404
           GVDD DALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG
Sbjct: 361 GVDDKDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 396

BLAST of Cp4.1LG20g06980 vs. ExPASy TrEMBL
Match: A0A6J1F107 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111438521 PE=3 SV=1)

HSP 1 Score: 711 bits (1835), Expect = 3.06e-257
Identity = 364/398 (91.46%), Postives = 370/398 (92.96%), Query Frame = 0

Query: 13  MAPTIFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWPETSEFIVKIA 72
           MAPTIFLLLALLSIAEST DK  GLKLELIRR VSP NVSPM AKS+IWPETSEFIVKIA
Sbjct: 1   MAPTIFLLLALLSIAESTADKGSGLKLELIRRRVSPGNVSPMVAKSQIWPETSEFIVKIA 60

Query: 73  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGS 132
           VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIY+P KSSTFRTLSCKS QCHLRGS
Sbjct: 61  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGS 120

Query: 133 GAACSSTDTCKYDYGYESGSTQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNAN 192
           GAACS TDTCKYDYGYESGSTQGELATEKM VTSRSGATT FPEVVFGCGHNNSGTFNAN
Sbjct: 121 GAACSGTDTCKYDYGYESGSTQGELATEKMAVTSRSGATTSFPEVVFGCGHNNSGTFNAN 180

Query: 193 KMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 252
           +MGLIGFGRGAIS F+S     PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV
Sbjct: 181 EMGLIGFGRGAIS-FVSQ--IGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 240

Query: 253 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRPPAKGNTILDTSTPQTLLPKELYGRL 312
           ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMS P AKGNTILDT TPQTLLPKELYGRL
Sbjct: 241 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSGPLAKGNTILDTGTPQTLLPKELYGRL 300

Query: 313 AAEVQRHIPSKPIDDTLCYKDNVGDLVMTLHFDGGVDLRLST------MPDGSFCFTAMG 372
           AAEV+RHIPSKPIDDTLCYKDN+GDLVMTLHFDG VDLRLST      MPDGSFCFTAMG
Sbjct: 301 AAEVRRHIPSKPIDDTLCYKDNLGDLVMTLHFDGDVDLRLSTVQTFNKMPDGSFCFTAMG 360

Query: 373 VDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 404
           VDDNDALIGNSMMANFLVGYDIDNMTVSFK TDCTKIG
Sbjct: 361 VDDNDALIGNSMMANFLVGYDIDNMTVSFKSTDCTKIG 395

BLAST of Cp4.1LG20g06980 vs. ExPASy TrEMBL
Match: A0A6J1EYS6 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111439864 PE=3 SV=1)

HSP 1 Score: 711 bits (1835), Expect = 3.06e-257
Identity = 362/398 (90.95%), Postives = 371/398 (93.22%), Query Frame = 0

Query: 13  MAPTIFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWPETSEFIVKIA 72
           MAPTIFLLLALLSIAEST DK  GLKLELIRR VSP N+SPM AKS+IWPETSEFIVKIA
Sbjct: 1   MAPTIFLLLALLSIAESTADKGSGLKLELIRRRVSPGNISPMVAKSQIWPETSEFIVKIA 60

Query: 73  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGS 132
           VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIY+P KSSTFRTLSCKS QCHLRGS
Sbjct: 61  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGS 120

Query: 133 GAACSSTDTCKYDYGYESGSTQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNAN 192
           GAACS TDTC+YDYGYES STQGELATEKM VTSRSGATTPFPEVVFGCGHNNSGTFNAN
Sbjct: 121 GAACSGTDTCQYDYGYESRSTQGELATEKMAVTSRSGATTPFPEVVFGCGHNNSGTFNAN 180

Query: 193 KMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 252
           +MGLIGFGRGAIS F+S     PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV
Sbjct: 181 EMGLIGFGRGAIS-FVSQ--IGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 240

Query: 253 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRPPAKGNTILDTSTPQTLLPKELYGRL 312
           ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMS P AKGNTILDT TPQTLLPKELYGRL
Sbjct: 241 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSGPLAKGNTILDTGTPQTLLPKELYGRL 300

Query: 313 AAEVQRHIPSKPIDDTLCYKDNVGDLVMTLHFDGGVDLRLST------MPDGSFCFTAMG 372
           AAEV+RHIP+KPIDDTLCYKDN+GDLVMTLHFDG VDLRLST      MPDGSFCFTAMG
Sbjct: 301 AAEVRRHIPTKPIDDTLCYKDNLGDLVMTLHFDGDVDLRLSTVQTFNKMPDGSFCFTAMG 360

Query: 373 VDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 404
           VDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG
Sbjct: 361 VDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 395

BLAST of Cp4.1LG20g06980 vs. ExPASy TrEMBL
Match: A0A6J1JIJ5 (aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111484909 PE=3 SV=1)

HSP 1 Score: 680 bits (1755), Expect = 4.88e-245
Identity = 349/399 (87.47%), Postives = 360/399 (90.23%), Query Frame = 0

Query: 13  MAPTIFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWPETSEFIVKIA 72
           MAPTIFLLL LLSIAEST  K GGLKLELIRR +SP NVSPMAAKS+IWPETSEFIVKIA
Sbjct: 1   MAPTIFLLLTLLSIAESTAGKGGGLKLELIRRRLSPGNVSPMAAKSQIWPETSEFIVKIA 60

Query: 73  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGS 132
           +GTP TEVHAILDTGSDLFWAQCRPC KCYQQTNPIY+P KSSTFRTLSCKS QCHLRGS
Sbjct: 61  IGTPPTEVHAILDTGSDLFWAQCRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGS 120

Query: 133 GAACSSTDTCKYDYGYESGSTQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNAN 192
           GAACS TDTCKY YGY SGSTQGELA+EKM VTSRSGATTPFP VVFGCGHNNSGTFNAN
Sbjct: 121 GAACSGTDTCKYSYGYGSGSTQGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNAN 180

Query: 193 KMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 252
           +MGLIGFGRGAIS F+S     PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEV+GPGV
Sbjct: 181 EMGLIGFGRGAIS-FVSQ--IGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGV 240

Query: 253 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRPPAKGNTILDTSTPQTLLPKELYGRL 312
           ITAQLVR S QT YSLTLTGISV KTLVPYS S PPAKGN +LDT TP TLLPKELYGRL
Sbjct: 241 ITAQLVRTSDQTSYSLTLTGISVRKTLVPYSTSGPPAKGNAVLDTGTPPTLLPKELYGRL 300

Query: 313 AAEVQRHIPSKPIDD-TLCYKDNVGDLVMTLHFDGGVDLRLST------MPDGSFCFTAM 372
           AAEV+RHIPSKPIDD TLCYKDN+GDLVMTLHFDGGVDLRLST      MPDGSFCFTAM
Sbjct: 301 AAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTVQTFNKMPDGSFCFTAM 360

Query: 373 GVDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 404
           GVDD DALIGNSMMANFLVGYDIDNMTVSFKPTDCTK G
Sbjct: 361 GVDDKDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKAG 396

BLAST of Cp4.1LG20g06980 vs. ExPASy TrEMBL
Match: A0A6J1IFH1 (aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111473895 PE=3 SV=1)

HSP 1 Score: 674 bits (1739), Expect = 1.34e-242
Identity = 347/399 (86.97%), Postives = 358/399 (89.72%), Query Frame = 0

Query: 13  MAPTIFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWPETSEFIVKIA 72
           MAPTIFLLL LLSIAEST  K GGLKLELIRR +SP NVSPMAAKS+IWPETSEFIVKIA
Sbjct: 1   MAPTIFLLLTLLSIAESTAGKGGGLKLELIRRRLSPGNVSPMAAKSQIWPETSEFIVKIA 60

Query: 73  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGS 132
           +GTP TEVHAILDTGSDLFWAQCRPC KCYQQTNPIY+P KSSTFR LSCKS QCHLRGS
Sbjct: 61  IGTPPTEVHAILDTGSDLFWAQCRPCAKCYQQTNPIYDPSKSSTFRILSCKSPQCHLRGS 120

Query: 133 GAACSSTDTCKYDYGYESGSTQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNAN 192
           GAACS TDTCKY YGY SGSTQGELA+EKMVVTSRSGATT FP VVFGCGHNNSGTFNAN
Sbjct: 121 GAACSGTDTCKYSYGYGSGSTQGELASEKMVVTSRSGATTSFPGVVFGCGHNNSGTFNAN 180

Query: 193 KMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 252
           +MGLIGFGRGAIS F+S     PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEV+GPGV
Sbjct: 181 EMGLIGFGRGAIS-FVSQ--IGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGV 240

Query: 253 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRPPAKGNTILDTSTPQTLLPKELYGRL 312
           ITAQLVR S QT YSLTLTGISV KTLVPYS S PPAKGN +LDT TP TLLPKELYGRL
Sbjct: 241 ITAQLVRTSDQTSYSLTLTGISVRKTLVPYSTSGPPAKGNAVLDTGTPPTLLPKELYGRL 300

Query: 313 AAEVQRHIPSKPIDD-TLCYKDNVGDLVMTLHFDGGVDLRLST------MPDGSFCFTAM 372
           AAEV+RHIPSKPIDD TLCYKDN+GDLVMTLHFDGGVDLRLST      MPDGSFCFTAM
Sbjct: 301 AAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTIQTFNKMPDGSFCFTAM 360

Query: 373 GVDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 404
            VDD DALIGNSMMANFLVGYDIDNMTVSFKPTDCTK G
Sbjct: 361 SVDDKDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKAG 396

BLAST of Cp4.1LG20g06980 vs. ExPASy TrEMBL
Match: A0A6J1ID07 (aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111473899 PE=3 SV=1)

HSP 1 Score: 672 bits (1735), Expect = 5.43e-242
Identity = 346/399 (86.72%), Postives = 357/399 (89.47%), Query Frame = 0

Query: 13  MAPTIFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWPETSEFIVKIA 72
           MAPTIFLLL LLSIAEST  K GGLKLELIRR +SP NVSPMAAKS+IWPETSEFIVKIA
Sbjct: 1   MAPTIFLLLTLLSIAESTAGKGGGLKLELIRRRLSPGNVSPMAAKSQIWPETSEFIVKIA 60

Query: 73  VGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSSQCHLRGS 132
           +GTP TEVHAILDTGSDLFWAQCRPC KCYQQTNPIY+P KSSTFRTLSCK  QCHLRGS
Sbjct: 61  IGTPPTEVHAILDTGSDLFWAQCRPCAKCYQQTNPIYDPSKSSTFRTLSCKLPQCHLRGS 120

Query: 133 GAACSSTDTCKYDYGYESGSTQGELATEKMVVTSRSGATTPFPEVVFGCGHNNSGTFNAN 192
           GAACS TDTCKY YGY SGSTQGELA+EKM VTSRSGATTPFP VVFGCGHNNSGTFNAN
Sbjct: 121 GAACSGTDTCKYSYGYGSGSTQGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNAN 180

Query: 193 KMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVQGPGV 252
           +MGLIGFGRGAIS F+S     PSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEV+GPGV
Sbjct: 181 EMGLIGFGRGAIS-FVSQ--IGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGV 240

Query: 253 ITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRPPAKGNTILDTSTPQTLLPKELYGRL 312
           ITAQLVR S QT YSLTLTGISV KTLVPYS S PPAKGN +LDT TP TLLPKELYGRL
Sbjct: 241 ITAQLVRTSDQTSYSLTLTGISVRKTLVPYSTSGPPAKGNAVLDTGTPPTLLPKELYGRL 300

Query: 313 AAEVQRHIPSKPIDD-TLCYKDNVGDLVMTLHFDGGVDLRLST------MPDGSFCFTAM 372
           AAEV+RHIPSKPIDD TLCYKDN+GDLVMTLHFD GVDLRLST      MPDGSFCFTAM
Sbjct: 301 AAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDDGVDLRLSTVQTFNKMPDGSFCFTAM 360

Query: 373 GVDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKIG 404
           GVD  DALIGNSMMANFLVGYDIDNMTVSFKPTDCTK G
Sbjct: 361 GVDHKDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKAG 396

BLAST of Cp4.1LG20g06980 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 238.0 bits (606), Expect = 1.3e-62
Identity = 139/358 (38.83%), Postives = 202/358 (56.42%), Query Frame = 0

Query: 64  TSEFIVKIAVGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCK 123
           + E+++ +++GTP   + AI DTGSDL W QC PC  CY Q +P+++P  SST++ +SC 
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCS 146

Query: 124 SSQCHLRGSGAACSSTD-TCKYDYGYESGS-TQGELATEKMVVTSRSGATTPFPEVVFGC 183
           SSQC    + A+CS+ D TC Y   Y   S T+G +A + + + S          ++ GC
Sbjct: 147 SSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 206

Query: 184 GHNNSGTFNANKMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSI 243
           GHNN+GTFN    G++G G G +S     K    S+ G KFS CL+P  +    +S ++ 
Sbjct: 207 GHNNAGTFNKKGSGIVGLGGGPVSLI---KQLGDSIDG-KFSYCLVPLTSKKDQTSKINF 266

Query: 244 GSGSEVQGPGVITAQLV-RVSHQTYYSLTLTGISVGKTLVPYSMS-RPPAKGNTILDTST 303
           G+ + V G GV++  L+ + S +T+Y LTL  ISVG   + YS S    ++GN I+D+ T
Sbjct: 267 GTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGT 326

Query: 304 PQTLLPKELYGRLAAEVQRHIPSKPIDD-----TLCYKDNVGDL---VMTLHFDGGVDLR 363
             TLLP E Y  L   V   I ++   D     +LCY    GDL   V+T+HFD G D++
Sbjct: 327 TLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY-SATGDLKVPVITMHFD-GADVK 386

Query: 364 LST------MPDGSFCFTAMGVDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKI 404
           L +      + +   CF   G   + ++ GN    NFLVGYD  + TVSFKPTDC K+
Sbjct: 387 LDSSNAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAKM 437

BLAST of Cp4.1LG20g06980 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 234.6 bits (597), Expect = 1.4e-61
Identity = 134/372 (36.02%), Postives = 206/372 (55.38%), Query Frame = 0

Query: 47  SPDNVSPMAAKSRIWPETSEFIVKIAVGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTN 106
           S D+ SP + +S I     E+++ I++GTP   + AI DTGSDL W QC PC  CYQQT+
Sbjct: 66  SNDDASPNSPQSFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTS 125

Query: 107 PIYNPLKSSTFRTLSCKSSQCHLRGSGAACSSTDTCKYDYGYESGS-TQGELATEKMVVT 166
           P+++P +SST+R +SC SSQC      +  +  +TC Y   Y   S T+G++A + + + 
Sbjct: 126 PLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMG 185

Query: 167 SRSGATTPFPEVVFGCGHNNSGTFNANKMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLC 226
           S          ++ GCGH N+GTF+    G+IG G G+ S     +    S+ G KFS C
Sbjct: 186 SSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLR---KSING-KFSYC 245

Query: 227 LMPYNTDPRISSSLSIGSGSEVQGPGVITAQLVRVSHQTYYSLTLTGISVGKTLVPY-SM 286
           L+P+ ++  ++S ++ G+   V G GV++  +V+    TYY L L  ISVG   + + S 
Sbjct: 246 LVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTST 305

Query: 287 SRPPAKGNTILDTSTPQTLLPKELYGRLAAEVQRHIPSKPIDD-----TLCYKDNVGDLV 346
                +GN ++D+ T  TLLP   Y  L + V   I ++ + D     +LCY+D+    V
Sbjct: 306 IFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSSFKV 365

Query: 347 --MTLHFDGGVDLRLSTM------PDGSFCFTAMGVDDNDALIGNSMMANFLVGYDIDNM 404
             +T+HF GG D++L  +       +   CF A   ++   + GN    NFLVGYD  + 
Sbjct: 366 PDITVHFKGG-DVKLGNLNTFVAVSEDVSCF-AFAANEQLTIFGNLAQMNFLVGYDTVSG 425

BLAST of Cp4.1LG20g06980 vs. TAIR 10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 210.7 bits (535), Expect = 2.2e-54
Identity = 134/365 (36.71%), Postives = 202/365 (55.34%), Query Frame = 0

Query: 66  EFIVKIAVGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSS 125
           E+ + I++GTP ++V AI DTGSDL W QC+PC +CY+Q +P+++  KSST++T SC S 
Sbjct: 84  EYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSK 143

Query: 126 QCH-LRGSGAAC-SSTDTCKYDYGYESGS-TQGELATEKMVVTSRSGATTPFPEVVFGCG 185
            C  L      C  S D CKY Y Y   S T+G++ATE + + S SG++  FP  VFGCG
Sbjct: 144 TCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGCG 203

Query: 186 HNNSGTFNANKMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIG 245
           +NN GTF     G+IG G G +S  L S+  S    G+KFS CL         +S +++G
Sbjct: 204 YNNGGTFEETGSGIIGLGGGPLS--LVSQLGSSI--GKKFSYCLSHTAATTNGTSVINLG 263

Query: 246 SGSEVQGP----GVITAQLVRVSHQTYYSLTLTGISVGKTLVPYS-----MSRPPAK--G 305
           + S    P      +T  L++   +TYY LTL  ++VGKT +PY+     ++   +K  G
Sbjct: 264 TNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTG 323

Query: 306 NTILDTSTPQTLLPKELYGRLAAEVQRHIP-SKPIDD-----TLCYKD---NVGDLVMTL 365
           N I+D+ T  TLL    Y      V+  +  +K + D     T C+K     +G   +T+
Sbjct: 324 NIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKSGDKEIGLPAITM 383

Query: 366 HFDGGVDLRLS------TMPDGSFCFTAMGVDDNDALIGNSMMANFLVGYDIDNMTVSFK 402
           HF    D++LS       + + + C + +   +  A+ GN +  +FLVGYD++  TVSF+
Sbjct: 384 HFT-NADVKLSPINAFVKLNEDTVCLSMIPTTE-VAIYGNMVQMDFLVGYDLETKTVSFQ 442

BLAST of Cp4.1LG20g06980 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 201.1 bits (510), Expect = 1.7e-51
Identity = 131/367 (35.69%), Postives = 197/367 (53.68%), Query Frame = 0

Query: 66  EFIVKIAVGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKSS 125
           EF + I +GTP  +V AI DTGSDL W QC+PC +CY++  PI++  KSST+++  C S 
Sbjct: 84  EFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSR 143

Query: 126 QCH-LRGSGAAC-SSTDTCKYDYGY-ESGSTQGELATEKMVVTSRSGATTPFPEVVFGCG 185
            C  L  +   C  S + CKY Y Y +   ++G++ATE + + S SG+   FP  VFGCG
Sbjct: 144 NCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCG 203

Query: 186 HNNSGTFNANKMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIG 245
           +NN GTF+    G+IG G G + S +S    S S   +KFS CL   +     +S +++G
Sbjct: 204 YNNGGTFDETGSGIIGLGGGHL-SLISQLGSSIS---KKFSYCLSHKSATTNGTSVINLG 263

Query: 246 S----GSEVQGPGVITAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSR---------PPA 305
           +     S  +  GV++  LV     TYY LTL  ISVGK  +PY+ S             
Sbjct: 264 TNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSET 323

Query: 306 KGNTILDTSTPQTLLPKELYGRLAAEVQRHIP-SKPIDD-----TLCYKD---NVGDLVM 365
            GN I+D+ T  TLL    + + ++ V+  +  +K + D     + C+K     +G   +
Sbjct: 324 SGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEIGLPEI 383

Query: 366 TLHFDGGVDLRLS------TMPDGSFCFTAMGVDDNDALIGNSMMANFLVGYDIDNMTVS 402
           T+HF  G D+RLS       + +   C + +   +  A+ GN    +FLVGYD++  TVS
Sbjct: 384 TVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTTE-VAIYGNFAQMDFLVGYDLETRTVS 443

BLAST of Cp4.1LG20g06980 vs. TAIR 10
Match: AT2G28040.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 189.1 bits (479), Expect = 6.9e-48
Identity = 133/415 (32.05%), Postives = 201/415 (48.43%), Query Frame = 0

Query: 17  IFLLLALLSIAESTIDKNGGLKLELIRRCVSPDNVSPMAAKSRIWP------------ET 76
           IFL +    +  +T     G  ++LI R          A+ SR++             +T
Sbjct: 10  IFLQIITYFLITTTASSPQGFTIDLIHR-------RSNASSSRVFNTQLGSPYADTVFDT 69

Query: 77  SEFIVKIAVGTPSTEVHAILDTGSDLFWAQCRPCVKCYQQTNPIYNPLKSSTFRTLSCKS 136
            E+++K+ +GTP  E+ A+LDTGS+  W QC PCV CY QT PI++P KSSTF+ + C +
Sbjct: 70  YEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT 129

Query: 137 SQCHLRGSGAACSSTDTCKYDYGYESGS-TQGELATEKMVVTSRSGATTPFPEVVFGCGH 196
                           +C Y+  Y   S T+G L TE + + S SG     PE + GCG 
Sbjct: 130 HD-------------HSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGR 189

Query: 197 NNSGTFNANKMGLIGFGRGAISSFLSSKPYSPSVGGRKFSLCLMPYNTDPRISSSLSIGS 256
           NNSG F     G++G  RG        K     +GG      LM Y    + +S ++ G+
Sbjct: 190 NNSG-FKPGFAGVVGLDRG-------PKSLITQMGGEYPG--LMSYCFAGKGTSKINFGA 249

Query: 257 GSEVQGPGVI-TAQLVRVSHQTYYSLTLTGISVGKTLVPYSMSRP--PAKGNTILDTSTP 316
            + V G GV+ T   V+ +   +Y L L  +SVG T +  ++  P    KGN ++D+ + 
Sbjct: 250 NAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIE-TVGTPFHALKGNIVIDSGST 309

Query: 317 QTLLPKELYGRLAAEVQRHIPSK--PIDDTLCYKDNVGDL--VMTLHFDGGVDLRL---- 376
            T  P+     +   V++ + +   P  D LCY     D+  V+T+HF GG DL L    
Sbjct: 310 LTYFPESYCNLVRKAVEQVVTAVRFPRSDILCYYSKTIDIFPVITMHFSGGADLVLDKYN 369

Query: 377 ---STMPDGSFCFTAM-GVDDNDALIGNSMMANFLVGYDIDNMTVSFKPTDCTKI 404
              ++   G FC   +      +A+ GN    NFLVGYD  ++ VSFKPT+C+ +
Sbjct: 370 MYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCSAL 393

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6XBF81.8e-6138.83Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q3EBM52.5e-5035.69Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q766C39.0e-4532.61Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C23.8e-4332.15Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LNJ34.1e-3731.30Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
XP_022933094.16.32e-25790.95aspartic proteinase CDR1-like [Cucurbita moschata][more]
XP_022932203.16.32e-25791.46aspartic proteinase CDR1-like [Cucurbita moschata][more]
KAG6583648.13.00e-25590.95Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022987324.11.01e-24487.47aspartic proteinase CDR1-like [Cucurbita maxima][more]
KAG6601733.11.67e-24386.72Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
A0A6J1F1073.06e-25791.46aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111438521 PE=3... [more]
A0A6J1EYS63.06e-25790.95aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111439864 PE=3... [more]
A0A6J1JIJ54.88e-24587.47aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111484909 PE=3 S... [more]
A0A6J1IFH11.34e-24286.97aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111473895 PE=3 S... [more]
A0A6J1ID075.43e-24286.72aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111473899 PE=3 S... [more]
Match NameE-valueIdentityDescription
AT5G33340.11.3e-6238.83Eukaryotic aspartyl protease family protein [more]
AT1G64830.11.4e-6136.02Eukaryotic aspartyl protease family protein [more]
AT1G31450.12.2e-5436.71Eukaryotic aspartyl protease family protein [more]
AT2G35615.11.7e-5135.69Eukaryotic aspartyl protease family protein [more]
AT2G28040.16.9e-4832.05Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 67..243
e-value: 4.1E-46
score: 157.4
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 249..403
e-value: 4.6E-27
score: 96.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 46..242
e-value: 8.0E-43
score: 148.6
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 60..400
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 266..396
e-value: 3.5E-13
score: 49.6
NoneNo IPR availablePANTHERPTHR47967:SF39ASPARTYL PROTEASE FAMILY PROTEIN, PUTATIVE-RELATEDcoord: 23..402
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 23..402
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 82..93
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 67..396
score: 30.109512
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 66..400
e-value: 3.36086E-61
score: 196.715

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g06980.1Cp4.1LG20g06980.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity