CmaCh14G016410 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh14G016410
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
Descriptionaspartic proteinase CDR1-like
LocationCma_Chr14: 12280023 .. 12281285 (+)
RNA-Seq ExpressionCmaCh14G016410
SyntenyCmaCh14G016410
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTACCGACTGAAGTTGGCTTCACTGCACGTTTGATTCACCGCGACTCACCATTATCACCATTTTACGATCATGTCATGTCGTACACTGCATGGATCGAGGCGACCATTCATCGTTCTAGGTCTCGGCTGAATTATCTGTATTACAACATGTTATCAAAAGATACATTAGACAATGATTTGTCACTCTCACCCACATTGGTTCATGAAGGTGGTGAATACCTTATGAGTTTCAACATTGGAAATCCTCCAAGTCAAGTGATGGGATTTGCAGACACGTCAAATGGTCTTATTTGGGTGCAATGCTCAGACTGCAATAGCCAATGTGAGCCAGAAAAAGGCCCCTTCACCAAGTTCCTCCCTTCCAAATCCTTCACCTATGAGATGGAGCCATGTGGCTCTAACGTTTGCAATTCCTTAACTGGCTTCCAGACCTGCAATTCATCAGACAGATCGTGCAAATATAGATTAGTGTATGAAGATAACTCTGAAACAAGTGGAAATCTTTCATCTGATAGTTTTAGTTTTGATACCACAGATGGTAAACATGTGGATGTTGGCTATTTGAACTTTGGCTGTTCAGAAGCTCCTTTAACAGGAGGCATGCAGAGTTACATGGGCAGTGTGGGATTGAACCAAACACCCCTGTCATTAATTTCTCAATTGGGTATCAAGAAATTCTCCTACTGCTTAGTTCCTTTCAATTTAGGCTCAACAAGTAAAATGTATTTCGGATCATTACCTGTGACTTCTGGAGGTCAAACTCCTCTGTTATATCCCAATTCAGATGCTTATTATGTGAAGGTTCTCGGAATCAGCGTCGGCGCTGATGATCCCAACTTGGAAGGAGTTTTTGATGTATATGATGTCAGAGATGGGTGGATCATAGATTCAGGAACAACATACTCAAGTCTTGAAACAGATGCATTTGATCGTTTGCTAGCTAAATTCATTACACTACCAGATCTACAGCAGAAAAAAGAGGACCCTAGGAACAGATTCGAGTTGTGTTTTGCAGCAAATGCAAATGATATGGAGACATTTCCAGGTGTTACAGTTCATTTTGATGGTGCAGAATTAATTCTTAATGTAGAAAGTACCTTTGTGAAGATAGAGGATGATGGAATTATCTGCCTTGCCCTTCTGCGCTCTGGATCTCCAGTCTCTATATTAGGGAACTTTCAGCTGCAAAACTGCCATGTTGGGTATGACCTTGAAGCTCAAGTTGTTTCCTTTGCTCCTGTTGACTGTGCTGATTCCTAA

mRNA sequence

ATGGTACCGACTGAAGTTGGCTTCACTGCACGTTTGATTCACCGCGACTCACCATTATCACCATTTTACGATCATGTCATGTCGTACACTGCATGGATCGAGGCGACCATTCATCGTTCTAGGTCTCGGCTGAATTATCTGTATTACAACATGTTATCAAAAGATACATTAGACAATGATTTGTCACTCTCACCCACATTGGTTCATGAAGGTGGTGAATACCTTATGAGTTTCAACATTGGAAATCCTCCAAGTCAAGTGATGGGATTTGCAGACACGTCAAATGGTCTTATTTGGGTGCAATGCTCAGACTGCAATAGCCAATGTGAGCCAGAAAAAGGCCCCTTCACCAAGTTCCTCCCTTCCAAATCCTTCACCTATGAGATGGAGCCATGTGGCTCTAACGTTTGCAATTCCTTAACTGGCTTCCAGACCTGCAATTCATCAGACAGATCGTGCAAATATAGATTAGTGTATGAAGATAACTCTGAAACAAGTGGAAATCTTTCATCTGATAGTTTTAGTTTTGATACCACAGATGGTAAACATGTGGATGTTGGCTATTTGAACTTTGGCTGTTCAGAAGCTCCTTTAACAGGAGGCATGCAGAGTTACATGGGCAGTGTGGGATTGAACCAAACACCCCTGTCATTAATTTCTCAATTGGGTATCAAGAAATTCTCCTACTGCTTAGTTCCTTTCAATTTAGGCTCAACAAGTAAAATGTATTTCGGATCATTACCTGTGACTTCTGGAGGTCAAACTCCTCTGTTATATCCCAATTCAGATGCTTATTATGTGAAGGTTCTCGGAATCAGCGTCGGCGCTGATGATCCCAACTTGGAAGGAGTTTTTGATGTATATGATGTCAGAGATGGGTGGATCATAGATTCAGGAACAACATACTCAAGTCTTGAAACAGATGCATTTGATCGTTTGCTAGCTAAATTCATTACACTACCAGATCTACAGCAGAAAAAAGAGGACCCTAGGAACAGATTCGAGTTGTGTTTTGCAGCAAATGCAAATGATATGGAGACATTTCCAGGTGTTACAGTTCATTTTGATGGTGCAGAATTAATTCTTAATGTAGAAAGTACCTTTGTGAAGATAGAGGATGATGGAATTATCTGCCTTGCCCTTCTGCGCTCTGGATCTCCAGTCTCTATATTAGGGAACTTTCAGCTGCAAAACTGCCATGTTGGGTATGACCTTGAAGCTCAAGTTGTTTCCTTTGCTCCTGTTGACTGTGCTGATTCCTAA

Coding sequence (CDS)

ATGGTACCGACTGAAGTTGGCTTCACTGCACGTTTGATTCACCGCGACTCACCATTATCACCATTTTACGATCATGTCATGTCGTACACTGCATGGATCGAGGCGACCATTCATCGTTCTAGGTCTCGGCTGAATTATCTGTATTACAACATGTTATCAAAAGATACATTAGACAATGATTTGTCACTCTCACCCACATTGGTTCATGAAGGTGGTGAATACCTTATGAGTTTCAACATTGGAAATCCTCCAAGTCAAGTGATGGGATTTGCAGACACGTCAAATGGTCTTATTTGGGTGCAATGCTCAGACTGCAATAGCCAATGTGAGCCAGAAAAAGGCCCCTTCACCAAGTTCCTCCCTTCCAAATCCTTCACCTATGAGATGGAGCCATGTGGCTCTAACGTTTGCAATTCCTTAACTGGCTTCCAGACCTGCAATTCATCAGACAGATCGTGCAAATATAGATTAGTGTATGAAGATAACTCTGAAACAAGTGGAAATCTTTCATCTGATAGTTTTAGTTTTGATACCACAGATGGTAAACATGTGGATGTTGGCTATTTGAACTTTGGCTGTTCAGAAGCTCCTTTAACAGGAGGCATGCAGAGTTACATGGGCAGTGTGGGATTGAACCAAACACCCCTGTCATTAATTTCTCAATTGGGTATCAAGAAATTCTCCTACTGCTTAGTTCCTTTCAATTTAGGCTCAACAAGTAAAATGTATTTCGGATCATTACCTGTGACTTCTGGAGGTCAAACTCCTCTGTTATATCCCAATTCAGATGCTTATTATGTGAAGGTTCTCGGAATCAGCGTCGGCGCTGATGATCCCAACTTGGAAGGAGTTTTTGATGTATATGATGTCAGAGATGGGTGGATCATAGATTCAGGAACAACATACTCAAGTCTTGAAACAGATGCATTTGATCGTTTGCTAGCTAAATTCATTACACTACCAGATCTACAGCAGAAAAAAGAGGACCCTAGGAACAGATTCGAGTTGTGTTTTGCAGCAAATGCAAATGATATGGAGACATTTCCAGGTGTTACAGTTCATTTTGATGGTGCAGAATTAATTCTTAATGTAGAAAGTACCTTTGTGAAGATAGAGGATGATGGAATTATCTGCCTTGCCCTTCTGCGCTCTGGATCTCCAGTCTCTATATTAGGGAACTTTCAGCTGCAAAACTGCCATGTTGGGTATGACCTTGAAGCTCAAGTTGTTTCCTTTGCTCCTGTTGACTGTGCTGATTCCTAA

Protein sequence

MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDNDLSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFLPSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTDGKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS
Homology
BLAST of CmaCh14G016410 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 241.9 bits (616), Expect = 1.3e-62
Identity = 156/424 (36.79%), Postives = 227/424 (53.54%), Query Frame = 0

Query: 5   EVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDNDLSLS 64
           ++GFTA LIHRDSP SPFY+ + + +  +   IHRS +R+    ++   K   DN     
Sbjct: 28  KLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRV----FHFTEK---DNTPQPQ 87

Query: 65  PTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFLPSKS 124
             L    GEYLM+ +IG PP  +M  ADT + L+W QC+ C+  C  +  P   F P  S
Sbjct: 88  IDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCD-DCYTQVDPL--FDPKTS 147

Query: 125 FTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTDGKHV 184
            TY+   C S+ C +L    +C+++D +C Y L Y DNS T GN++ D+ +  ++D + +
Sbjct: 148 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPM 207

Query: 185 DVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIK---KFSYCLVPF--NLGST 244
            +  +  GC         +   G VGL   P+SLI QLG     KFSYCLVP       T
Sbjct: 208 QLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQT 267

Query: 245 SKMYFGSLPVTSGG---QTPLLYPNSDA--YYVKVLGISVGADDPNLEGVFDVYDVRDGW 304
           SK+ FG+  + SG     TPL+   S    YY+ +  ISVG+      G  D        
Sbjct: 268 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSG-SDSESSEGNI 327

Query: 305 IIDSGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVH 364
           IIDSGTT + L T+ +  L     +  D  +KK+DP++   LC++A   D++  P +T+H
Sbjct: 328 IIDSGTTLTLLPTEFYSELEDAVASSID-AEKKQDPQSGLSLCYSA-TGDLKV-PVITMH 387

Query: 365 FDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAP 419
           FDGA++ L+  + FV++ +D ++C A  R     SI GN    N  VGYD  ++ VSF P
Sbjct: 388 FDGADVKLDSSNAFVQVSED-LVCFA-FRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKP 435

BLAST of CmaCh14G016410 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 215.7 bits (548), Expect = 1.0e-54
Identity = 159/438 (36.30%), Postives = 221/438 (50.46%), Query Frame = 0

Query: 8   FTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDNDLSLSPTL 67
           F+  LIHRDSPLSP Y+  ++ T  + A   RS SR     + +   D       L   L
Sbjct: 26  FSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRRFNHQLSQTD-------LQSGL 85

Query: 68  VHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFLPSKSFTY 127
           +   GE+ MS  IG PP +V   ADT + L WVQC  C  QC  E GP   F   KS TY
Sbjct: 86  IGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC-QQCYKENGPI--FDKKKSSTY 145

Query: 128 EMEPCGSNVCNSLTGFQT-CNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTDGKHVDV 187
           + EPC S  C +L+  +  C+ S+  CKYR  Y D S + G++++++ S D+  G  V  
Sbjct: 146 KSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSF 205

Query: 188 GYLNFGCSEAPLTGGMQSYMGS--VGLNQTPLSLISQLG---IKKFSYCL---------- 247
               FGC      GG     GS  +GL    LSLISQLG    KKFSYCL          
Sbjct: 206 PGTVFGCGYN--NGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGT 265

Query: 248 VPFNLGSTSKMYFGSLPVTSG-GQTPLLYPNSDAYYVKVL-GISVG---------ADDPN 307
              NLG+ S     SL   SG   TPL+      YY   L  ISVG         + +PN
Sbjct: 266 SVINLGTNS--IPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPN 325

Query: 308 LEGVFDVYDVRDGWIIDSGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAA 367
            +G+    +     IIDSGTT + LE   FD+  +         ++  DP+     CF +
Sbjct: 326 DDGILS--ETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKS 385

Query: 368 NANDMETFPGVTVHFDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCH 419
            + ++   P +TVHF GA++ L+  + FVK+ +D ++CL+++ + + V+I GNF   +  
Sbjct: 386 GSAEI-GLPEITVHFTGADVRLSPINAFVKLSED-MVCLSMVPT-TEVAIYGNFAQMDFL 444

BLAST of CmaCh14G016410 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 190.7 bits (483), Expect = 3.5e-47
Identity = 143/425 (33.65%), Postives = 207/425 (48.71%), Query Frame = 0

Query: 7   GFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDNDLSLSPT 66
           GF   L H DS  +      ++    +E  I R   RL  L       + + N  S   T
Sbjct: 40  GFQIMLEHVDSGKN------LTKFQLLERAIERGSRRLQRL-------EAMLNGPSGVET 99

Query: 67  LVHEG-GEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFLPSKSF 126
            V+ G GEYLM+ +IG P        DT + LIW QC  C +QC  +  P   F P  S 
Sbjct: 100 SVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC-TQCFNQSTPI--FNPQGSS 159

Query: 127 TYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTDGKHVD 186
           ++   PC S +C +L+   TC  S+  C+Y   Y D SET G++ +++ +F +     V 
Sbjct: 160 SFSTLPCSSQLCQALSS-PTC--SNNFCQYTYGYGDGSETQGSMGTETLTFGS-----VS 219

Query: 187 VGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTSKMYFG 246
           +  + FGC E     G  +  G VG+ + PLSL SQL + KFSYC+ P    + S +  G
Sbjct: 220 IPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLLLG 279

Query: 247 SL--PVTSGGQTPLLYPNSDA---YYVKVLGISVGAD----DPNLEGVFDVYDVRDGWII 306
           SL   VT+G     L  +S     YY+ + G+SVG+     DP+   + +  +   G II
Sbjct: 280 SLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFAL-NSNNGTGGIII 339

Query: 307 DSGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDME-TFPGVTVHF 366
           DSGTT +    +A+  +  +FI+  +L        + F+LCF   ++      P   +HF
Sbjct: 340 DSGTTLTYFVNNAYQSVRQEFISQINL-PVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHF 399

Query: 367 DGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPV 421
           DG +L L  E+ F+    +G+ICLA+  S   +SI GN Q QN  V YD    VVSFA  
Sbjct: 400 DGGDLELPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASA 437

BLAST of CmaCh14G016410 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 178.3 bits (451), Expect = 1.8e-43
Identity = 127/368 (34.51%), Postives = 185/368 (50.27%), Query Frame = 0

Query: 66  TLVHEG-GEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFLPSKS 125
           T V+ G GEYLM+  IG P S      DT + LIW QC  C +QC  +  P   F P  S
Sbjct: 87  TPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPC-TQCFSQPTPI--FNPQDS 146

Query: 126 FTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTDGKHV 185
            ++   PC S  C  L   +TCN+++  C+Y   Y D S T G +++++F+F+T+     
Sbjct: 147 SSFSTLPCESQYCQDLPS-ETCNNNE--CQYTYGYGDGSTTQGYMATETFTFETS----- 206

Query: 186 DVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTSKMYF 245
            V  + FGC E     G  +  G +G+   PLSL SQLG+ +FSYC+  +   S S +  
Sbjct: 207 SVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLAL 266

Query: 246 GSLP--VTSGG-QTPLLYP--NSDAYYVKVLGISVGADDPNL-EGVFDVY-DVRDGWIID 305
           GS    V  G   T L++   N   YY+ + GI+VG D+  +    F +  D   G IID
Sbjct: 267 GSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIID 326

Query: 306 SGTTYSSLETDAFDRLLAKF---ITLPDLQQKKEDPRNRFELCFAANANDMET-FPGVTV 365
           SGTT + L  DA++ +   F   I LP +    ++  +    CF   ++      P +++
Sbjct: 327 SGTTLTYLPQDAYNAVAQAFTDQINLPTV----DESSSGLSTCFQQPSDGSTVQVPEISM 386

Query: 366 HFDGAELILNVESTFVKIEDDGIICLALLRSGS-PVSILGNFQLQNCHVGYDLEAQVVSF 421
            FDG  L L  ++  +    +G+ICLA+  S    +SI GN Q Q   V YDL+   VSF
Sbjct: 387 QFDGGVLNLGEQNILIS-PAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSF 438

BLAST of CmaCh14G016410 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 1.4e-35
Identity = 108/355 (30.42%), Postives = 166/355 (46.76%), Query Frame = 0

Query: 72  GEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFLPSKSFTYEMEP 131
           GEY     +G P  ++    DT + + W+QC  C + C  +  P   F P+ S TY+   
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPC-ADCYQQSDPV--FNPTSSSTYKSLT 219

Query: 132 CGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTDGKHVDVGYLNF 191
           C +  C SL     C S+   C Y++ Y D S T G L++D+ +F    G    +  +  
Sbjct: 220 CSAPQC-SLLETSACRSN--KCLYQVSYGDGSFTVGELATDTVTF----GNSGKINNVAL 279

Query: 192 GC---SEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTSKMYFGSLP 251
           GC   +E   TG      G +GL    LS+ +Q+    FSYCLV  + G +S + F S+ 
Sbjct: 280 GCGHDNEGLFTGA----AGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQ 339

Query: 252 VTSGGQTPLLYPNSDA---YYVKVLGISVGADDPNL-EGVFDV-YDVRDGWIIDSGTTYS 311
           +  G  T  L  N      YYV + G SVG +   L + +FDV      G I+D GT  +
Sbjct: 340 LGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVT 399

Query: 312 SLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAE-LIL 371
            L+T A++ L   F+ L    +K     + F+ C+  ++      P V  HF G + L L
Sbjct: 400 RLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDL 459

Query: 372 NVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDC 418
             ++  + ++D G  C A   + S +SI+GN Q Q   + YDL   V+  +   C
Sbjct: 460 PAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmaCh14G016410 vs. ExPASy TrEMBL
Match: A0A6J1IXB1 (aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111479389 PE=3 SV=1)

HSP 1 Score: 867.8 bits (2241), Expect = 1.8e-248
Identity = 420/420 (100.00%), Postives = 420/420 (100.00%), Query Frame = 0

Query: 1   MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDND 60
           MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDND
Sbjct: 26  MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDND 85

Query: 61  LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL 120
           LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL
Sbjct: 86  LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL 145

Query: 121 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTD 180
           PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTD
Sbjct: 146 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTD 205

Query: 181 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 240
           GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS
Sbjct: 206 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 265

Query: 241 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGT 300
           KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGT
Sbjct: 266 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGT 325

Query: 301 TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAEL 360
           TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAEL
Sbjct: 326 TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAEL 385

Query: 361 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS 420
           ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS
Sbjct: 386 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS 445

BLAST of CmaCh14G016410 vs. ExPASy TrEMBL
Match: A0A6J1GWK9 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111457819 PE=3 SV=1)

HSP 1 Score: 819.3 bits (2115), Expect = 7.3e-234
Identity = 399/420 (95.00%), Postives = 407/420 (96.90%), Query Frame = 0

Query: 1   MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDND 60
           MVPT+VGFTARLIHRDSPLSPFY+HVM+ TA IEAT+HRSRSRLNYLYYNMLS+ TLDND
Sbjct: 26  MVPTDVGFTARLIHRDSPLSPFYNHVMTNTARIEATVHRSRSRLNYLYYNMLSRKTLDND 85

Query: 61  LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL 120
           LSLSPTLVHEGGEYLMSFNIGNP SQVMGFADTSNGLIWVQCSDCNS C+ EKGPFTK L
Sbjct: 86  LSLSPTLVHEGGEYLMSFNIGNPSSQVMGFADTSNGLIWVQCSDCNSHCDAEKGPFTKLL 145

Query: 121 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTD 180
           PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDR CKYRLVYEDNSETSG LSSDSFSFDTTD
Sbjct: 146 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRWCKYRLVYEDNSETSGTLSSDSFSFDTTD 205

Query: 181 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 240
           GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS
Sbjct: 206 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 265

Query: 241 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGT 300
           KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNL+GVFDVYDVRDGWIIDSGT
Sbjct: 266 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLDGVFDVYDVRDGWIIDSGT 325

Query: 301 TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAEL 360
           TYSSLETDAFDRLLAKF TLP+LQQKKEDPRNRFELCFAANANDMETFP VTVH DGAEL
Sbjct: 326 TYSSLETDAFDRLLAKFNTLPELQQKKEDPRNRFELCFAANANDMETFPDVTVHLDGAEL 385

Query: 361 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS 420
           ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQN HVGYDLEAQVVSFAPVDCADS
Sbjct: 386 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNYHVGYDLEAQVVSFAPVDCADS 445

BLAST of CmaCh14G016410 vs. ExPASy TrEMBL
Match: A0A5D3CXD4 (Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold209G00490 PE=3 SV=1)

HSP 1 Score: 722.2 bits (1863), Expect = 1.2e-204
Identity = 357/424 (84.20%), Postives = 384/424 (90.57%), Query Frame = 0

Query: 1   MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYY-NMLSKDTLDN 60
           MV  EVGFTARLIH DSPLSPFY+H M+ TA IEAT+HRSRSRL+YLYY N LS++TLDN
Sbjct: 3   MVSNEVGFTARLIHHDSPLSPFYNHAMTGTARIEATVHRSRSRLSYLYYINKLSENTLDN 62

Query: 61  DLSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEK-GPFTK 120
           D+SLSPTLV+EGGEYLMSFNIGNPPSQVMGF DTSNGLIWVQCS+CNSQCEPEK GP TK
Sbjct: 63  DVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGPTTK 122

Query: 121 FLPSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDT 180
           FL SKSFTYEMEPCGSN CNSLTGF+TCNSSD+ CKYRLVY DN  TSG LSSDSF FDT
Sbjct: 123 FLSSKSFTYEMEPCGSNFCNSLTGFKTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDT 182

Query: 181 TDGKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFN-LG 240
           +DGK VDVG+LNFGCSEAPLTG  QSY G VGLNQTPLSLISQLGIKKFSYCLVPFN LG
Sbjct: 183 SDGKLVDVGFLNFGCSEAPLTGDEQSYTGRVGLNQTPLSLISQLGIKKFSYCLVPFNSLG 242

Query: 241 STSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIID 300
           STSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+G D+P+ +GVFDVYDVRDGWIID
Sbjct: 243 STSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFDGVFDVYDVRDGWIID 302

Query: 301 SGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCF-AANANDMETFPGVTVHFD 360
           +G TYSSLETDAFD LLAKF+ L +  Q+K DP++RFELCF  ANAND+E+FP  TVHFD
Sbjct: 303 TGITYSSLETDAFDSLLAKFLALKNFPQRKNDPKDRFELCFELANANDLESFPDATVHFD 362

Query: 361 GAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVD 420
           GA+LILNVESTFVKIEDDGI CLALLRSGSPVSILGNFQLQN HVGYDLEAQV+SFAPVD
Sbjct: 363 GADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPVD 422

BLAST of CmaCh14G016410 vs. ExPASy TrEMBL
Match: A0A1S3AZA6 (aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC103484191 PE=3 SV=1)

HSP 1 Score: 722.2 bits (1863), Expect = 1.2e-204
Identity = 357/424 (84.20%), Postives = 384/424 (90.57%), Query Frame = 0

Query: 1   MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYY-NMLSKDTLDN 60
           MV  EVGFTARLIH DSPLSPFY+H M+ TA IEAT+HRSRSRL+YLYY N LS++TLDN
Sbjct: 17  MVSNEVGFTARLIHHDSPLSPFYNHAMTGTARIEATVHRSRSRLSYLYYINKLSENTLDN 76

Query: 61  DLSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEK-GPFTK 120
           D+SLSPTLV+EGGEYLMSFNIGNPPSQVMGF DTSNGLIWVQCS+CNSQCEPEK GP TK
Sbjct: 77  DVSLSPTLVNEGGEYLMSFNIGNPPSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGPTTK 136

Query: 121 FLPSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDT 180
           FL SKSFTYEMEPCGSN CNSLTGF+TCNSSD+ CKYRLVY DN  TSG LSSDSF FDT
Sbjct: 137 FLSSKSFTYEMEPCGSNFCNSLTGFKTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDT 196

Query: 181 TDGKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFN-LG 240
           +DGK VDVG+LNFGCSEAPLTG  QSY G VGLNQTPLSLISQLGIKKFSYCLVPFN LG
Sbjct: 197 SDGKLVDVGFLNFGCSEAPLTGDEQSYTGRVGLNQTPLSLISQLGIKKFSYCLVPFNSLG 256

Query: 241 STSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIID 300
           STSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+G D+P+ +GVFDVYDVRDGWIID
Sbjct: 257 STSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFDGVFDVYDVRDGWIID 316

Query: 301 SGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCF-AANANDMETFPGVTVHFD 360
           +G TYSSLETDAFD LLAKF+ L +  Q+K DP++RFELCF  ANAND+E+FP  TVHFD
Sbjct: 317 TGITYSSLETDAFDSLLAKFLALKNFPQRKNDPKDRFELCFELANANDLESFPDATVHFD 376

Query: 361 GAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVD 420
           GA+LILNVESTFVKIEDDGI CLALLRSGSPVSILGNFQLQN HVGYDLEAQV+SFAPVD
Sbjct: 377 GADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPVD 436

BLAST of CmaCh14G016410 vs. ExPASy TrEMBL
Match: A0A0A0L7U3 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G151520 PE=3 SV=1)

HSP 1 Score: 717.6 bits (1851), Expect = 3.0e-203
Identity = 356/424 (83.96%), Postives = 382/424 (90.09%), Query Frame = 0

Query: 1   MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYY-NMLSKDTLDN 60
           MV  EVGFTARLIH DSPLSPFY+H M+ TA IEAT+HRSRSRLNYLYY N LS++ LDN
Sbjct: 3   MVSNEVGFTARLIHHDSPLSPFYNHTMTDTARIEATVHRSRSRLNYLYYINKLSENALDN 62

Query: 61  DLSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEK-GPFTK 120
           D+SLSPTLV+EGGEYLMSFNIGNP SQVMGF DTSNGLIWVQCS+CNSQCEPEK G  TK
Sbjct: 63  DVSLSPTLVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTK 122

Query: 121 FLPSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDT 180
           FL SKSFTYEMEPCGSN CNSLTGFQTCNSSD+ CKYRLVY DN  TSG LSSDSF FDT
Sbjct: 123 FLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDT 182

Query: 181 TDGKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPF-NLG 240
           +DG  VDVG+LNFGCSEAPLTG  QSY G+VGLNQTPLSLISQLGIKKFSYCLVPF NLG
Sbjct: 183 SDGMLVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIKKFSYCLVPFNNLG 242

Query: 241 STSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIID 300
           STSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGIS+G D+P+ +GVFDVY+VRDGWIID
Sbjct: 243 STSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFDGVFDVYEVRDGWIID 302

Query: 301 SGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCF-AANANDMETFPGVTVHFD 360
           +G TYSSLETDAFD LLAKF+TL D  Q+K+DP+ RFELCF   NAND+E+FP VTVHFD
Sbjct: 303 TGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTVHFD 362

Query: 361 GAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVD 420
           GA+LILNVESTFVKIEDDGI CLALLRSGSPVSILGNFQLQN HVGYDLEAQV+SFAPVD
Sbjct: 363 GADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPVD 422

BLAST of CmaCh14G016410 vs. NCBI nr
Match: XP_022979793.1 (aspartic proteinase CDR1-like [Cucurbita maxima])

HSP 1 Score: 867.8 bits (2241), Expect = 3.7e-248
Identity = 420/420 (100.00%), Postives = 420/420 (100.00%), Query Frame = 0

Query: 1   MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDND 60
           MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDND
Sbjct: 26  MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDND 85

Query: 61  LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL 120
           LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL
Sbjct: 86  LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL 145

Query: 121 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTD 180
           PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTD
Sbjct: 146 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTD 205

Query: 181 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 240
           GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS
Sbjct: 206 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 265

Query: 241 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGT 300
           KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGT
Sbjct: 266 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGT 325

Query: 301 TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAEL 360
           TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAEL
Sbjct: 326 TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAEL 385

Query: 361 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS 420
           ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS
Sbjct: 386 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS 445

BLAST of CmaCh14G016410 vs. NCBI nr
Match: XP_023528351.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 839.7 bits (2168), Expect = 1.1e-239
Identity = 408/420 (97.14%), Postives = 414/420 (98.57%), Query Frame = 0

Query: 1   MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDND 60
           MVPTEVGFTARLIHRDSP+SPFYDHVM+ TA IEAT+HRSRSRLNYLYYNMLSK+TLDND
Sbjct: 26  MVPTEVGFTARLIHRDSPVSPFYDHVMTNTAQIEATVHRSRSRLNYLYYNMLSKNTLDND 85

Query: 61  LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL 120
           LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL
Sbjct: 86  LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL 145

Query: 121 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTD 180
           PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDR CKYRLVYEDNSETSGNLSSDSFSFDTTD
Sbjct: 146 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRWCKYRLVYEDNSETSGNLSSDSFSFDTTD 205

Query: 181 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 240
           GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS
Sbjct: 206 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 265

Query: 241 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGT 300
           KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNL+GVFDVYDVRDGWIIDSGT
Sbjct: 266 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLDGVFDVYDVRDGWIIDSGT 325

Query: 301 TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAEL 360
           TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFP VTVH DGAEL
Sbjct: 326 TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPDVTVHLDGAEL 385

Query: 361 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS 420
           ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQN HVGYDLEAQVVSFAPV+CADS
Sbjct: 386 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNYHVGYDLEAQVVSFAPVNCADS 445

BLAST of CmaCh14G016410 vs. NCBI nr
Match: KAG6582237.1 (Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 825.1 bits (2130), Expect = 2.8e-235
Identity = 402/420 (95.71%), Postives = 407/420 (96.90%), Query Frame = 0

Query: 1   MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDND 60
           MVPTEVGFTARLIHRDSPLSPFYDHVM+ TA IEAT+HRSRSRLNYLYYNMLS+ TLDND
Sbjct: 17  MVPTEVGFTARLIHRDSPLSPFYDHVMTNTARIEATVHRSRSRLNYLYYNMLSRKTLDND 76

Query: 61  LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL 120
           LSLSPTLVHEGGEYLMSFNIGNP SQVMGFADTSNGLIWVQCSDCNS CEPEKGPFTKFL
Sbjct: 77  LSLSPTLVHEGGEYLMSFNIGNPSSQVMGFADTSNGLIWVQCSDCNSHCEPEKGPFTKFL 136

Query: 121 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTD 180
           PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDR CKYRLVYEDNSETSG LSSDSFSFDTTD
Sbjct: 137 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRWCKYRLVYEDNSETSGTLSSDSFSFDTTD 196

Query: 181 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 240
           GKHVDVGYLNFGCSEAPL GGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS
Sbjct: 197 GKHVDVGYLNFGCSEAPLIGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 256

Query: 241 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGT 300
           KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVG DDPNL+GVFDVYDVRDGWIIDSGT
Sbjct: 257 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGTDDPNLDGVFDVYDVRDGWIIDSGT 316

Query: 301 TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAEL 360
           TYSSLETDAFDRLLAKF TLP+LQQKKEDPRNRFELCFAANANDMETFP VTVH DGAEL
Sbjct: 317 TYSSLETDAFDRLLAKFNTLPELQQKKEDPRNRFELCFAANANDMETFPDVTVHLDGAEL 376

Query: 361 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS 420
           ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQN HVGYDLEAQVVSFAPVDCADS
Sbjct: 377 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNYHVGYDLEAQVVSFAPVDCADS 436

BLAST of CmaCh14G016410 vs. NCBI nr
Match: KAG7018636.1 (Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 821.6 bits (2121), Expect = 3.0e-234
Identity = 401/420 (95.48%), Postives = 405/420 (96.43%), Query Frame = 0

Query: 1   MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDND 60
           MVPTEVGFTARLIHRDSPLSPFYDHVM  TA IEAT+HRSRSRLNYLYYNMLS+ TLDN 
Sbjct: 17  MVPTEVGFTARLIHRDSPLSPFYDHVMMNTARIEATVHRSRSRLNYLYYNMLSRKTLDNG 76

Query: 61  LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL 120
           LSL PTLVHEGGEYLMSFNIGNP SQVMGFADTSNGLIWVQCSDCNS CEPEKGPFTKFL
Sbjct: 77  LSLLPTLVHEGGEYLMSFNIGNPSSQVMGFADTSNGLIWVQCSDCNSHCEPEKGPFTKFL 136

Query: 121 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTD 180
           PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDR CKYRLVYEDNSETSG LSSDSFSFDTTD
Sbjct: 137 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRWCKYRLVYEDNSETSGTLSSDSFSFDTTD 196

Query: 181 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 240
           GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS
Sbjct: 197 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 256

Query: 241 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGT 300
           KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVG DDPNL+GVFDVYDVRDGWIIDSGT
Sbjct: 257 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGTDDPNLDGVFDVYDVRDGWIIDSGT 316

Query: 301 TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAEL 360
           TYSSLETDAFDRLLAKF TLP+LQQKKEDPRNRFELCFAANANDMETFP VTVH DGAEL
Sbjct: 317 TYSSLETDAFDRLLAKFNTLPELQQKKEDPRNRFELCFAANANDMETFPDVTVHLDGAEL 376

Query: 361 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS 420
           ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQN HVGYDLEAQVVSFAPVDCADS
Sbjct: 377 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNYHVGYDLEAQVVSFAPVDCADS 436

BLAST of CmaCh14G016410 vs. NCBI nr
Match: XP_022955985.1 (aspartic proteinase CDR1-like [Cucurbita moschata])

HSP 1 Score: 819.3 bits (2115), Expect = 1.5e-233
Identity = 399/420 (95.00%), Postives = 407/420 (96.90%), Query Frame = 0

Query: 1   MVPTEVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDND 60
           MVPT+VGFTARLIHRDSPLSPFY+HVM+ TA IEAT+HRSRSRLNYLYYNMLS+ TLDND
Sbjct: 26  MVPTDVGFTARLIHRDSPLSPFYNHVMTNTARIEATVHRSRSRLNYLYYNMLSRKTLDND 85

Query: 61  LSLSPTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFL 120
           LSLSPTLVHEGGEYLMSFNIGNP SQVMGFADTSNGLIWVQCSDCNS C+ EKGPFTK L
Sbjct: 86  LSLSPTLVHEGGEYLMSFNIGNPSSQVMGFADTSNGLIWVQCSDCNSHCDAEKGPFTKLL 145

Query: 121 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTD 180
           PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDR CKYRLVYEDNSETSG LSSDSFSFDTTD
Sbjct: 146 PSKSFTYEMEPCGSNVCNSLTGFQTCNSSDRWCKYRLVYEDNSETSGTLSSDSFSFDTTD 205

Query: 181 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 240
           GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS
Sbjct: 206 GKHVDVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPFNLGSTS 265

Query: 241 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLEGVFDVYDVRDGWIIDSGT 300
           KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNL+GVFDVYDVRDGWIIDSGT
Sbjct: 266 KMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISVGADDPNLDGVFDVYDVRDGWIIDSGT 325

Query: 301 TYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVHFDGAEL 360
           TYSSLETDAFDRLLAKF TLP+LQQKKEDPRNRFELCFAANANDMETFP VTVH DGAEL
Sbjct: 326 TYSSLETDAFDRLLAKFNTLPELQQKKEDPRNRFELCFAANANDMETFPDVTVHLDGAEL 385

Query: 361 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAPVDCADS 420
           ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQN HVGYDLEAQVVSFAPVDCADS
Sbjct: 386 ILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNYHVGYDLEAQVVSFAPVDCADS 445

BLAST of CmaCh14G016410 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 241.9 bits (616), Expect = 9.3e-64
Identity = 156/424 (36.79%), Postives = 227/424 (53.54%), Query Frame = 0

Query: 5   EVGFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDNDLSLS 64
           ++GFTA LIHRDSP SPFY+ + + +  +   IHRS +R+    ++   K   DN     
Sbjct: 28  KLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRV----FHFTEK---DNTPQPQ 87

Query: 65  PTLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFLPSKS 124
             L    GEYLM+ +IG PP  +M  ADT + L+W QC+ C+  C  +  P   F P  S
Sbjct: 88  IDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCD-DCYTQVDPL--FDPKTS 147

Query: 125 FTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTDGKHV 184
            TY+   C S+ C +L    +C+++D +C Y L Y DNS T GN++ D+ +  ++D + +
Sbjct: 148 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPM 207

Query: 185 DVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIK---KFSYCLVPF--NLGST 244
            +  +  GC         +   G VGL   P+SLI QLG     KFSYCLVP       T
Sbjct: 208 QLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQT 267

Query: 245 SKMYFGSLPVTSGG---QTPLLYPNSDA--YYVKVLGISVGADDPNLEGVFDVYDVRDGW 304
           SK+ FG+  + SG     TPL+   S    YY+ +  ISVG+      G  D        
Sbjct: 268 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSG-SDSESSEGNI 327

Query: 305 IIDSGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETFPGVTVH 364
           IIDSGTT + L T+ +  L     +  D  +KK+DP++   LC++A   D++  P +T+H
Sbjct: 328 IIDSGTTLTLLPTEFYSELEDAVASSID-AEKKQDPQSGLSLCYSA-TGDLKV-PVITMH 387

Query: 365 FDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFAP 419
           FDGA++ L+  + FV++ +D ++C A  R     SI GN    N  VGYD  ++ VSF P
Sbjct: 388 FDGADVKLDSSNAFVQVSED-LVCFA-FRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKP 435

BLAST of CmaCh14G016410 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 215.7 bits (548), Expect = 7.1e-56
Identity = 159/438 (36.30%), Postives = 221/438 (50.46%), Query Frame = 0

Query: 8   FTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKDTLDNDLSLSPTL 67
           F+  LIHRDSPLSP Y+  ++ T  + A   RS SR     + +   D       L   L
Sbjct: 26  FSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRRFNHQLSQTD-------LQSGL 85

Query: 68  VHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFLPSKSFTY 127
           +   GE+ MS  IG PP +V   ADT + L WVQC  C  QC  E GP   F   KS TY
Sbjct: 86  IGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC-QQCYKENGPI--FDKKKSSTY 145

Query: 128 EMEPCGSNVCNSLTGFQT-CNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTDGKHVDV 187
           + EPC S  C +L+  +  C+ S+  CKYR  Y D S + G++++++ S D+  G  V  
Sbjct: 146 KSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSF 205

Query: 188 GYLNFGCSEAPLTGGMQSYMGS--VGLNQTPLSLISQLG---IKKFSYCL---------- 247
               FGC      GG     GS  +GL    LSLISQLG    KKFSYCL          
Sbjct: 206 PGTVFGCGYN--NGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGT 265

Query: 248 VPFNLGSTSKMYFGSLPVTSG-GQTPLLYPNSDAYYVKVL-GISVG---------ADDPN 307
              NLG+ S     SL   SG   TPL+      YY   L  ISVG         + +PN
Sbjct: 266 SVINLGTNS--IPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPN 325

Query: 308 LEGVFDVYDVRDGWIIDSGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAA 367
            +G+    +     IIDSGTT + LE   FD+  +         ++  DP+     CF +
Sbjct: 326 DDGILS--ETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKS 385

Query: 368 NANDMETFPGVTVHFDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCH 419
            + ++   P +TVHF GA++ L+  + FVK+ +D ++CL+++ + + V+I GNF   +  
Sbjct: 386 GSAEI-GLPEITVHFTGADVRLSPINAFVKLSED-MVCLSMVPT-TEVAIYGNFAQMDFL 444

BLAST of CmaCh14G016410 vs. TAIR 10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 201.1 bits (510), Expect = 1.8e-51
Identity = 148/432 (34.26%), Postives = 211/432 (48.84%), Query Frame = 0

Query: 9   TARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYLYYNMLSKD-TLDNDLSLSPTL 68
           T  LIHRDSP SP Y+             H    RLN  +   +S+         L   L
Sbjct: 30  TVELIHRDSPHSPLYN-----------PHHTVSDRLNAAFLRSISRSRRFTTKTDLQSGL 89

Query: 69  VHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFLPSKSFTY 128
           +  GGEY MS +IG PPS+V   ADT + L WVQC  C  QC  +  P   F   KS TY
Sbjct: 90  ISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPC-QQCYKQNSPL--FDKKKSSTY 149

Query: 129 EMEPCGSNVCNSLTGFQT-CNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTDGKHVDV 188
           + E C S  C +L+  +  C+ S   CKYR  Y DNS T G++++++ S D++ G  V  
Sbjct: 150 KTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSF 209

Query: 189 GYLNFGCSEAPLTGGMQSYMGS--VGLNQTPLSLISQLGI---KKFSYCL--VPFNLGST 248
               FGC      GG     GS  +GL   PLSL+SQLG    KKFSYCL         T
Sbjct: 210 PGTVFGCGYN--NGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNGT 269

Query: 249 SKMYFGSLPVTSGGQ-------TPLLYPNSDAYYVKVL-GISVGADD-PNLEGVFDV--- 308
           S +  G+  + S          TPL+  + + YY   L  ++VG    P   G + +   
Sbjct: 270 SVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGK 329

Query: 309 YDVRDG-WIIDSGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDME 368
              R G  IIDSGTT + L++  +D             ++  DP+     CF +   ++ 
Sbjct: 330 SSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKSGDKEI- 389

Query: 369 TFPGVTVHFDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLE 419
             P +T+HF  A++ L+  + FVK+ +D  +CL+++ + + V+I GN    +  VGYDLE
Sbjct: 390 GLPAITMHFTNADVKLSPINAFVKLNED-TVCLSMIPT-TEVAIYGNMVQMDFLVGYDLE 442

BLAST of CmaCh14G016410 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 191.0 bits (484), Expect = 1.9e-48
Identity = 139/425 (32.71%), Postives = 211/425 (49.65%), Query Frame = 0

Query: 7   GFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRS-RSRLNYLYYNMLSKDTLDNDLSLSP 66
           GFT  LIHRDSP SPFY+   + +  +   I RS RS L +      S D    + S   
Sbjct: 25  GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQF------SNDDASPN-SPQS 84

Query: 67  TLVHEGGEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFLPSKSF 126
            +    GEYLM+ +IG PP  ++  ADT + LIW QC+ C   C  +  P   F P +S 
Sbjct: 85  FITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPC-EDCYQQTSPL--FDPKESS 144

Query: 127 TYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTDGKHVD 186
           TY    C S+ C +L    +C++ + +C Y + Y DNS T G+++ D+ +  ++  + V 
Sbjct: 145 TYRKVSCSSSQCRALED-ASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVS 204

Query: 187 VGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIK---KFSYCLVPF--NLGSTS 246
           +  +  GC          +  G +GL     SL+SQL      KFSYCLVPF    G TS
Sbjct: 205 LRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTS 264

Query: 247 KMYFGSLPVTSGG---QTPLLYPNSDAYY-VKVLGISVGADDPNLEGVFDVYDVRDG-WI 306
           K+ FG+  + SG     T ++  +   YY + +  ISVG+    ++    ++   +G  +
Sbjct: 265 KINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGS--KKIQFTSTIFGTGEGNIV 324

Query: 307 IDSGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCFAANANDMETF--PGVTV 366
           IDSGTT + L ++ F   L   +      ++ +DP     LC+     D  +F  P +TV
Sbjct: 325 IDSGTTLTLLPSN-FYYELESVVASTIKAERVQDPDGILSLCY----RDSSSFKVPDITV 384

Query: 367 HFDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLEAQVVSFA 419
           HF G ++ L   +TFV + +D + C A   +   ++I GN    N  VGYD  +  VSF 
Sbjct: 385 HFKGGDVKLGNLNTFVAVSED-VSCFA-FAANEQLTIFGNLAQMNFLVGYDTVSGTVSFK 429

BLAST of CmaCh14G016410 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 185.7 bits (470), Expect = 7.9e-47
Identity = 148/431 (34.34%), Postives = 212/431 (49.19%), Query Frame = 0

Query: 7   GFTARLIHRDSPLSPFYDHVMSYTAWIEATIHRSRSRLNYL-YYNMLSKDTLDNDLSLSP 66
           GF   L H DS  +      ++    I+  I+R   RLN L    +L+  +  +D +   
Sbjct: 44  GFRLSLRHVDSGKN------LTKIQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIK 103

Query: 67  TLVHEG-GEYLMSFNIGNPPSQVMGFADTSNGLIWVQCSDCNSQCEPEKGPFTKFLPSKS 126
              H G GE+LM  +IGNP  +     DT + LIW QC  C ++C  +  P   F P KS
Sbjct: 104 APTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPC-TECFDQPTPI--FDPEKS 163

Query: 127 FTYEMEPCGSNVCNSLTGFQTCNSSDRSCKYRLVYEDNSETSGNLSSDSFSFDTTDGKHV 186
            +Y    C S +CN+L     CN    +C+Y   Y D S T G L++++F+F+  +    
Sbjct: 164 SSYSKVGCSSGLCNALPR-SNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN---- 223

Query: 187 DVGYLNFGCSEAPLTGGMQSYMGSVGLNQTPLSLISQLGIKKFSYCLVPF-NLGSTSKMY 246
            +  + FGC       G     G VGL + PLSLISQL   KFSYCL    +  ++S ++
Sbjct: 224 SISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLF 283

Query: 247 FGSLP---VTSGG--------QTPLLYPNSDA---YYVKVLGISVGADDPNLE-GVFDV- 306
            GSL    V   G        +T  L  N D    YY+++ GI+VGA   ++E   F++ 
Sbjct: 284 IGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELA 343

Query: 307 YDVRDGWIIDSGTTYSSLETDAFDRLLAKFITLPDLQQKKEDPRNRFELCF-AANANDME 366
            D   G IIDSGTT + LE  AF  L  +F +   L    +      +LCF   +A    
Sbjct: 344 EDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSL-PVDDSGSTGLDLCFKLPDAAKNI 403

Query: 367 TFPGVTVHFDGAELILNVESTFVKIEDDGIICLALLRSGSPVSILGNFQLQNCHVGYDLE 418
             P +  HF GA+L L  E+  V     G++CLA + S + +SI GN Q QN +V +DLE
Sbjct: 404 AVPKMIFHFKGADLELPGENYMVADSSTGVLCLA-MGSSNGMSIFGNVQQQNFNVLHDLE 458

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6XBF81.3e-6236.79Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q3EBM51.0e-5436.30Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q766C33.5e-4733.65Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C21.8e-4334.51Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LS401.4e-3530.42Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
A0A6J1IXB11.8e-248100.00aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111479389 PE=3 S... [more]
A0A6J1GWK97.3e-23495.00aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111457819 PE=3... [more]
A0A5D3CXD41.2e-20484.20Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sc... [more]
A0A1S3AZA61.2e-20484.20aspartic proteinase CDR1-like OS=Cucumis melo OX=3656 GN=LOC103484191 PE=3 SV=1[more]
A0A0A0L7U33.0e-20383.96Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G15152... [more]
Match NameE-valueIdentityDescription
XP_022979793.13.7e-248100.00aspartic proteinase CDR1-like [Cucurbita maxima][more]
XP_023528351.11.1e-23997.14aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
KAG6582237.12.8e-23595.71Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7018636.13.0e-23495.48Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022955985.11.5e-23395.00aspartic proteinase CDR1-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT5G33340.19.3e-6436.79Eukaryotic aspartyl protease family protein [more]
AT2G35615.17.1e-5636.30Eukaryotic aspartyl protease family protein [more]
AT1G31450.11.8e-5134.26Eukaryotic aspartyl protease family protein [more]
AT1G64830.11.9e-4832.71Eukaryotic aspartyl protease family protein [more]
AT2G03200.17.9e-4734.34Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 74..246
e-value: 4.0E-37
score: 128.1
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 264..413
e-value: 2.7E-25
score: 89.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 250..420
e-value: 1.5E-37
score: 130.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 50..245
e-value: 5.8E-35
score: 123.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 67..417
NoneNo IPR availablePANTHERPTHR47967:SF66ASPARTIC PROTEINASE CDR1-RELATEDcoord: 5..418
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 5..418
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 74..413
score: 28.892973
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 73..417
e-value: 1.89397E-71
score: 223.679

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G016410.1CmaCh14G016410.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005576 extracellular region