CmoCh08G008920 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh08G008920
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionaspartic proteinase CDR1-like
LocationCmo_Chr08: 5779017 .. 5780336 (-)
RNA-Seq ExpressionCmoCh08G008920
SyntenyCmoCh08G008920
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTCTCAATTCTATGTTCTTCTACCTTTTCCTCTTCTCCTCTCTAGCAACCACCAATGGTGGCGGTGGGGATGGTTTCACCACCTCTCTCTTCCACCGCGATACCCTTTTCCCTACTCTCCACCTTCCATCTCCCTCCCGCTACGAACGCCTCACTAATGCCTTCCGCCGCTCCTTCTCTCGTTACGCCACCCTCCTCGACCATTCCACGACCGTCTCCACCACAGGCATCCACACCCCACTCATTCCCAACAATGAAGATGGAGAGTATGTAATGTCCATCTCCATGGGAACCCCACCGATCAAATTCTTCGCCATTGCGGACACTGGAAGCGACCTAACGTGGACCCAATGCATGCCATGCATCAATTGCTTCAACCAATCAACTCCTCTTTTCAATCCACGTAATTCCTCTTCCTACCGCCCCGTACCATGCACGTCCACTACATGCCGCTCCATCAGCATTTCCCCTTGCATGCCCCACCATCCATCTTGCACTTACAACTACAACTACGCAGACCAATCCTTCACTAACGGTGACCTTGCATTTGAAAAGCTCACCATTGGGTCCTTCAAACTCCACAACACAATCATTGGATGCGGCCACCAGAACGGTGGCATCTTCAAAGGACGTACCTCAGGAATCATTGGACTCGGCACGGGTTCTATCTCTTTAGTCTCTCAAATGAGAAAGATTGCCAATCTCAAAAGACGCTTCTCATATTGCTTACCCACCTTCTTCAGTGATTCAAATGTTACCGGTAGAATAAACTTCGGCCGAAACGCGGTCGTTTCGGGGCGTAAAGGGCATAAAGTCGTTTCTACCCCTCTAGTGTTGAAATTTCCTTCTCCCTTCTATTTCTTGACACTTGAAGCAATCTCTATTGCAAACAAGCGATTGGAAGCTGCTAACGTGTCGAGTGCTTTAGAACGAGGAAATATTATTATTGATTCCGGTACAACATTGACGTTTTTGCCTCGAAATTTGTACAATGCCGTCATTTCAACTTTGGCTAGTGTTGTTCGAGCAAAACGAGTGGAAGATCCATTGGGGATTCTTGAATTGTGCTACGTTGCTGCCAAAGTTGATGATTTGGATATTCCAATCATTACGACACATTTTTCTGGTGGTGCTGCTGTGAAGTTGTTACCGTTAAACACGTTTGTGACCGTGGCTGATAATGTGACTTGTTTGAGTTTTAAGTCATCGTCTGATGATGATTTGAACATTTTTGGGAACTTGGCACAAGTCAACTTTTTGATTGGATATGATCTAGAGCGAAAGAGATTGTCGTTCAAACATAAAGTCTGTGCTTAG

mRNA sequence

ATGCCTCTCAATTCTATGTTCTTCTACCTTTTCCTCTTCTCCTCTCTAGCAACCACCAATGGTGGCGGTGGGGATGGTTTCACCACCTCTCTCTTCCACCGCGATACCCTTTTCCCTACTCTCCACCTTCCATCTCCCTCCCGCTACGAACGCCTCACTAATGCCTTCCGCCGCTCCTTCTCTCGTTACGCCACCCTCCTCGACCATTCCACGACCGTCTCCACCACAGGCATCCACACCCCACTCATTCCCAACAATGAAGATGGAGAGTATGTAATGTCCATCTCCATGGGAACCCCACCGATCAAATTCTTCGCCATTGCGGACACTGGAAGCGACCTAACGTGGACCCAATGCATGCCATGCATCAATTGCTTCAACCAATCAACTCCTCTTTTCAATCCACGTAATTCCTCTTCCTACCGCCCCGTACCATGCACGTCCACTACATGCCGCTCCATCAGCATTTCCCCTTGCATGCCCCACCATCCATCTTGCACTTACAACTACAACTACGCAGACCAATCCTTCACTAACGGTGACCTTGCATTTGAAAAGCTCACCATTGGGTCCTTCAAACTCCACAACACAATCATTGGATGCGGCCACCAGAACGGTGGCATCTTCAAAGGACGTACCTCAGGAATCATTGGACTCGGCACGGGTTCTATCTCTTTAGTCTCTCAAATGAGAAAGATTGCCAATCTCAAAAGACGCTTCTCATATTGCTTACCCACCTTCTTCAGTGATTCAAATGTTACCGGTAGAATAAACTTCGGCCGAAACGCGGTCGTTTCGGGGCGTAAAGGGCATAAAGTCGTTTCTACCCCTCTAGTGTTGAAATTTCCTTCTCCCTTCTATTTCTTGACACTTGAAGCAATCTCTATTGCAAACAAGCGATTGGAAGCTGCTAACGTGTCGAGTGCTTTAGAACGAGGAAATATTATTATTGATTCCGGTACAACATTGACGTTTTTGCCTCGAAATTTGTACAATGCCGTCATTTCAACTTTGGCTAGTGTTGTTCGAGCAAAACGAGTGGAAGATCCATTGGGGATTCTTGAATTGTGCTACGTTGCTGCCAAAGTTGATGATTTGGATATTCCAATCATTACGACACATTTTTCTGGTGGTGCTGCTGTGAAGTTGTTACCGTTAAACACGTTTGTGACCGTGGCTGATAATGTGACTTGTTTGAGTTTTAAGTCATCGTCTGATGATGATTTGAACATTTTTGGGAACTTGGCACAAGTCAACTTTTTGATTGGATATGATCTAGAGCGAAAGAGATTGTCGTTCAAACATAAAGTCTGTGCTTAG

Coding sequence (CDS)

ATGCCTCTCAATTCTATGTTCTTCTACCTTTTCCTCTTCTCCTCTCTAGCAACCACCAATGGTGGCGGTGGGGATGGTTTCACCACCTCTCTCTTCCACCGCGATACCCTTTTCCCTACTCTCCACCTTCCATCTCCCTCCCGCTACGAACGCCTCACTAATGCCTTCCGCCGCTCCTTCTCTCGTTACGCCACCCTCCTCGACCATTCCACGACCGTCTCCACCACAGGCATCCACACCCCACTCATTCCCAACAATGAAGATGGAGAGTATGTAATGTCCATCTCCATGGGAACCCCACCGATCAAATTCTTCGCCATTGCGGACACTGGAAGCGACCTAACGTGGACCCAATGCATGCCATGCATCAATTGCTTCAACCAATCAACTCCTCTTTTCAATCCACGTAATTCCTCTTCCTACCGCCCCGTACCATGCACGTCCACTACATGCCGCTCCATCAGCATTTCCCCTTGCATGCCCCACCATCCATCTTGCACTTACAACTACAACTACGCAGACCAATCCTTCACTAACGGTGACCTTGCATTTGAAAAGCTCACCATTGGGTCCTTCAAACTCCACAACACAATCATTGGATGCGGCCACCAGAACGGTGGCATCTTCAAAGGACGTACCTCAGGAATCATTGGACTCGGCACGGGTTCTATCTCTTTAGTCTCTCAAATGAGAAAGATTGCCAATCTCAAAAGACGCTTCTCATATTGCTTACCCACCTTCTTCAGTGATTCAAATGTTACCGGTAGAATAAACTTCGGCCGAAACGCGGTCGTTTCGGGGCGTAAAGGGCATAAAGTCGTTTCTACCCCTCTAGTGTTGAAATTTCCTTCTCCCTTCTATTTCTTGACACTTGAAGCAATCTCTATTGCAAACAAGCGATTGGAAGCTGCTAACGTGTCGAGTGCTTTAGAACGAGGAAATATTATTATTGATTCCGGTACAACATTGACGTTTTTGCCTCGAAATTTGTACAATGCCGTCATTTCAACTTTGGCTAGTGTTGTTCGAGCAAAACGAGTGGAAGATCCATTGGGGATTCTTGAATTGTGCTACGTTGCTGCCAAAGTTGATGATTTGGATATTCCAATCATTACGACACATTTTTCTGGTGGTGCTGCTGTGAAGTTGTTACCGTTAAACACGTTTGTGACCGTGGCTGATAATGTGACTTGTTTGAGTTTTAAGTCATCGTCTGATGATGATTTGAACATTTTTGGGAACTTGGCACAAGTCAACTTTTTGATTGGATATGATCTAGAGCGAAAGAGATTGTCGTTCAAACATAAAGTCTGTGCTTAG

Protein sequence

MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSFSRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCMPCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNGDLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRFSYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKRLEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVAAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNFLIGYDLERKRLSFKHKVCA
Homology
BLAST of CmoCh08G008920 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 1.9e-88
Identity = 189/446 (42.38%), Postives = 268/446 (60.09%), Query Frame = 0

Query: 4   NSMFFYLFLFSSLATTNGGGGD--GFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSFS 63
           +S+   L L SSL  +N       GFT  L HRD+     + P  +  +RL NA  RS +
Sbjct: 6   SSVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVN 65

Query: 64  RYATLLDHSTTVSTTGIHTPLIP-NNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 123
           R     +   T        P I   +  GEY+M++S+GTPP    AIADTGSDL WTQC 
Sbjct: 66  RVFHFTEKDNTPQ------PQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCA 125

Query: 124 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSI-SISPCMPHHPSCTYNYNYADQSFTN 183
           PC +C+ Q  PLF+P+ SS+Y+ V C+S+ C ++ + + C  +  +C+Y+ +Y D S+T 
Sbjct: 126 PCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTK 185

Query: 184 GDLAFEKLTIGS-----FKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIA 243
           G++A + LT+GS      +L N IIGCGH N G F  + SGI+GLG G +SL+ Q+    
Sbjct: 186 GNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD-- 245

Query: 244 NLKRRFSYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLK-FPSPFYFLTLEA 303
           ++  +FSYCL    S  + T +INFG NA+VS   G  VVSTPL+ K     FY+LTL++
Sbjct: 246 SIDGKFSYCLVPLTSKKDQTSKINFGTNAIVS---GSGVVSTPLIAKASQETFYYLTLKS 305

Query: 304 ISIANKRLEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGI 363
           IS+ +K+++ +   S    GNIIIDSGTTLT LP   Y+ +   +AS + A++ +DP   
Sbjct: 306 ISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSG 365

Query: 364 LELCYVAAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFG 423
           L LCY A    DL +P+IT HF  GA VKL   N FV V++++ C +F+ S     +I+G
Sbjct: 366 LSLCYSA--TGDLKVPVITMHFD-GADVKLDSSNAFVQVSEDLVCFAFRGS--PSFSIYG 425

Query: 424 NLAQVNFLIGYDLERKRLSFKHKVCA 440
           N+AQ+NFL+GYD   K +SFK   CA
Sbjct: 426 NVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CmoCh08G008920 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 317.0 bits (811), Expect = 3.3e-85
Identity = 187/446 (41.93%), Postives = 271/446 (60.76%), Query Frame = 0

Query: 11  FLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSFSRYATLLDHS 70
           FLF S+  ++ G    F+  L HRD+    ++ P  +  +RL  AF RS SR +   +H 
Sbjct: 10  FLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSR-SRRFNHQ 69

Query: 71  TTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCMPCINCFNQST 130
             +S T + + LI    DGE+ MSI++GTPPIK FAIADTGSDLTW QC PC  C+ ++ 
Sbjct: 70  --LSQTDLQSGLI--GADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENG 129

Query: 131 PLFNPRNSSSYRPVPCTSTTCRSISISP--CMPHHPSCTYNYNYADQSFTNGDLAFEKLT 190
           P+F+ + SS+Y+  PC S  C+++S +   C   +  C Y Y+Y DQSF+ GD+A E ++
Sbjct: 130 PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVS 189

Query: 191 IGS-----FKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRFSYC 250
           I S          T+ GCG+ NGG F    SGIIGLG G +SL+SQ+   +++ ++FSYC
Sbjct: 190 IDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG--SSISKKFSYC 249

Query: 251 LPTFFSDSNVTGRINFGRNAVVSG-RKGHKVVSTPLVLKFPSPFYFLTLEAISIANKRLE 310
           L    + +N T  IN G N++ S   K   VVSTPLV K P  +Y+LTLEAIS+  K++ 
Sbjct: 250 LSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIP 309

Query: 311 AANVS--------SALERGNIIIDSGTTLTFLPRNLYNAVISTL-ASVVRAKRVEDPLGI 370
               S         +   GNIIIDSGTTLT L    ++   S +  SV  AKRV DP G+
Sbjct: 310 YTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGL 369

Query: 371 LELCYVAAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFG 430
           L  C+ +    ++ +P IT HF+ GA V+L P+N FV +++++ CLS   ++  ++ I+G
Sbjct: 370 LSHCFKSGSA-EIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTT--EVAIYG 429

Query: 431 NLAQVNFLIGYDLERKRLSFKHKVCA 440
           N AQ++FL+GYDLE + +SF+H  C+
Sbjct: 430 NFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of CmoCh08G008920 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 229.2 bits (583), Expect = 9.2e-59
Identity = 139/397 (35.01%), Postives = 214/397 (53.90%), Query Frame = 0

Query: 47  SRYERLTNAFRRSFSRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFA 106
           ++YE +  A +R   R  ++  ++   S++GI TP+     DGEY+M++++GTP   F A
Sbjct: 56  TKYELIKRAIKRGERRMRSI--NAMLQSSSGIETPVYAG--DGEYLMNVAIGTPDSSFSA 115

Query: 107 IADTGSDLTWTQCMPCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSC 166
           I DTGSDL WTQC PC  CF+Q TP+FNP++SSS+  +PC S  C+ +    C  ++  C
Sbjct: 116 IMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETC--NNNEC 175

Query: 167 TYNYNYADQSFTNGDLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISL 226
            Y Y Y D S T G +A E  T  +  + N   GCG  N G  +G  +G+IG+G G +SL
Sbjct: 176 QYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSL 235

Query: 227 VSQMRKIANLKRRFSYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSP- 286
            SQ+        +FSYC+ ++ S S  T  +    + V  G       ST L+    +P 
Sbjct: 236 PSQLG-----VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSP-----STTLIHSSLNPT 295

Query: 287 FYFLTLEAISIANKRLEAANVSSALE---RGNIIIDSGTTLTFLPRNLYNAVISTLASVV 346
           +Y++TL+ I++    L   + +  L+    G +IIDSGTTLT+LP++ YNAV       +
Sbjct: 296 YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI 355

Query: 347 RAKRVEDPLGILELCY-VAAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSF 406
               V++    L  C+   +    + +P I+  F GG  + L   N  ++ A+ V CL+ 
Sbjct: 356 NLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGG-VLNLGEQNILISPAEGVICLAM 415

Query: 407 KSSSDDDLNIFGNLAQVNFLIGYDLERKRLSFKHKVC 439
            SSS   ++IFGN+ Q    + YDL+   +SF    C
Sbjct: 416 GSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CmoCh08G008920 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 228.8 bits (582), Expect = 1.2e-58
Identity = 140/397 (35.26%), Postives = 207/397 (52.14%), Query Frame = 0

Query: 47  SRYERLTNAFRRSFSRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFA 106
           ++++ L  A  R   R   L   +     +G+ T +     DGEY+M++S+GTP   F A
Sbjct: 55  TKFQLLERAIERGSRRLQRL--EAMLNGPSGVETSVYAG--DGEYLMNLSIGTPAQPFSA 114

Query: 107 IADTGSDLTWTQCMPCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSC 166
           I DTGSDL WTQC PC  CFNQSTP+FNP+ SSS+  +PC+S  C+++S   C  +   C
Sbjct: 115 IMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNF--C 174

Query: 167 TYNYNYADQSFTNGDLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISL 226
            Y Y Y D S T G +  E LT GS  + N   GCG  N G  +G  +G++G+G G +SL
Sbjct: 175 QYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSL 234

Query: 227 VSQMRKIANLKRRFSYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPF 286
            SQ+        +FSYC+    S +     +    N+V +G     ++ +  +      F
Sbjct: 235 PSQLD-----VTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQI----PTF 294

Query: 287 YFLTLEAISIANKRL----EAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVV 346
           Y++TL  +S+ + RL     A  ++S    G IIIDSGTTLT+   N Y +V     S +
Sbjct: 295 YYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQI 354

Query: 347 RAKRVEDPLGILELCY-VAAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSF 406
               V       +LC+   +   +L IP    HF GG  ++L   N F++ ++ + CL+ 
Sbjct: 355 NLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELPSENYFISPSNGLICLAM 414

Query: 407 KSSSDDDLNIFGNLAQVNFLIGYDLERKRLSFKHKVC 439
            SSS   ++IFGN+ Q N L+ YD     +SF    C
Sbjct: 415 GSSS-QGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmoCh08G008920 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 8.6e-49
Identity = 123/379 (32.45%), Postives = 197/379 (51.98%), Query Frame = 0

Query: 68  DHSTTVSTTGIHTPLI--PNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCMPCINC 127
           +  T   T  + TP++   +   GEY   I +GTP  + + + DTGSD+ W QC PC +C
Sbjct: 137 NEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADC 196

Query: 128 FNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNGDLAFE 187
           + QS P+FNP +SS+Y+ + C++  C  +  S C  +   C Y  +Y D SFT G+LA +
Sbjct: 197 YQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSN--KCLYQVSYGDGSFTVGELATD 256

Query: 188 KLTIG-SFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRFSYCL 247
            +T G S K++N  +GCGH N G+F G  +G++GLG G +S+ +QM+  +     FSYCL
Sbjct: 257 TVTFGNSGKINNVALGCGHDNEGLFTG-AAGLLGLGGGVLSITNQMKATS-----FSYCL 316

Query: 248 PTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKRL--- 307
                DS  +  ++F  N+V  G  G    +  L  K    FY++ L   S+  +++   
Sbjct: 317 VD--RDSGKSSSLDF--NSVQLG--GGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLP 376

Query: 308 EAANVSSALERGNIIIDSGTTLTFLPRNLYNAVIST-LASVVRAKRVEDPLGILELCYVA 367
           +A     A   G +I+D GT +T L    YN++    L   V  K+    + + + CY  
Sbjct: 377 DAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDF 436

Query: 368 AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVT-CLSFKSSSDDDLNIFGNLAQVN 427
           + +  + +P +  HF+GG ++ L   N  + V D+ T C +F  +S   L+I GN+ Q  
Sbjct: 437 SSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTS-SSLSIIGNVQQQG 496

Query: 428 FLIGYDLERKRLSFKHKVC 439
             I YDL +  +      C
Sbjct: 497 TRITYDLSKNVIGLSGNKC 500

BLAST of CmoCh08G008920 vs. ExPASy TrEMBL
Match: A0A6J1HHU3 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111464208 PE=3 SV=1)

HSP 1 Score: 884.0 bits (2283), Expect = 2.5e-253
Identity = 439/439 (100.00%), Postives = 439/439 (100.00%), Query Frame = 0

Query: 1   MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 60
           MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF
Sbjct: 1   MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 60

Query: 61  SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 120
           SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM
Sbjct: 61  SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 120

Query: 121 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 180
           PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG
Sbjct: 121 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 180

Query: 181 DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240
           DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF
Sbjct: 181 DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240

Query: 241 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR 300
           SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR
Sbjct: 241 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR 300

Query: 301 LEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVA 360
           LEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVA
Sbjct: 301 LEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVA 360

Query: 361 AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNF 420
           AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNF
Sbjct: 361 AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNF 420

Query: 421 LIGYDLERKRLSFKHKVCA 440
           LIGYDLERKRLSFKHKVCA
Sbjct: 421 LIGYDLERKRLSFKHKVCA 439

BLAST of CmoCh08G008920 vs. ExPASy TrEMBL
Match: A0A6J1KKI8 (aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111494873 PE=3 SV=1)

HSP 1 Score: 786.9 bits (2031), Expect = 4.2e-224
Identity = 401/440 (91.14%), Postives = 410/440 (93.18%), Query Frame = 0

Query: 1   MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 60
           MPL S+F +LFLFSSLATTNGG GDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF
Sbjct: 1   MPLISIFVHLFLFSSLATTNGGRGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 60

Query: 61  SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 120
           SRYATLLDHSTTVSTTGIHTPLIPNNE+GEYVMS            IADTGSDLTWTQCM
Sbjct: 61  SRYATLLDHSTTVSTTGIHTPLIPNNEEGEYVMS------------IADTGSDLTWTQCM 120

Query: 121 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 180
           PC+NCFNQSTPLFNPRNSSSYRPVPCTST CRSISI PCMPHHPSCTYNYNYADQSFTNG
Sbjct: 121 PCLNCFNQSTPLFNPRNSSSYRPVPCTSTACRSISIFPCMPHHPSCTYNYNYADQSFTNG 180

Query: 181 DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240
           DLAFEKLTIGSFKLHNTIIGCGHQNGG FKGRTSGIIGLGTGSISLVSQMRKIANLKRRF
Sbjct: 181 DLAFEKLTIGSFKLHNTIIGCGHQNGGAFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240

Query: 241 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR 300
           SYCLPTFFSDSNVTGRINFGR AVVSGR   KV+STPL+ KFPSPF+FLTLEAISIANKR
Sbjct: 241 SYCLPTFFSDSNVTGRINFGRKAVVSGR---KVISTPLMSKFPSPFHFLTLEAISIANKR 300

Query: 301 LEAA-NVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYV 360
           LEA  NVSSALERGNIIIDSGTTLTFLP+ LYNAVISTLASVVRAKRVEDP GILELCYV
Sbjct: 301 LEATNNVSSALERGNIIIDSGTTLTFLPQYLYNAVISTLASVVRAKRVEDPSGILELCYV 360

Query: 361 AAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVN 420
           AAKVDDLDIPIIT HFSGGAAVKLLPLNTFVTVADNVTCLSFKSS DD+LNIFGNLAQVN
Sbjct: 361 AAKVDDLDIPIITAHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSFDDNLNIFGNLAQVN 420

Query: 421 FLIGYDLERKRLSFKHKVCA 440
            LIGYDLERKRLSFKHKVCA
Sbjct: 421 ILIGYDLERKRLSFKHKVCA 425

BLAST of CmoCh08G008920 vs. ExPASy TrEMBL
Match: A0A6J1FP39 (aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111447216 PE=3 SV=1)

HSP 1 Score: 573.2 bits (1476), Expect = 9.5e-160
Identity = 297/440 (67.50%), Postives = 350/440 (79.55%), Query Frame = 0

Query: 5   SMFFYLFLFS-SLATTN---GGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 64
           S+FF LFL S S AT +   GGGG GFTTSLFHRD+    L+ PS S Y+RLTNAFRRSF
Sbjct: 5   SIFFCLFLISFSQATAHGGVGGGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAFRRSF 64

Query: 65  SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 124
           SR  TLL+ +  VS TGIH+ +IP  +DGE++MSIS+GTP +K  AIADTGSDLTWTQCM
Sbjct: 65  SRSDTLLNRAAAVSITGIHSRIIP--DDGEFLMSISIGTPRVKIMAIADTGSDLTWTQCM 124

Query: 125 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 184
           PC  CFNQS P+FNPR S SYR V CTS  CRS+    C P + +C+Y Y+Y DQSFT G
Sbjct: 125 PCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSFTYG 184

Query: 185 DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 244
           DLA EK+TIGSFKL+ T+IGCGH NGG F   TSGIIGLG G +SL+SQMRKIA +KRRF
Sbjct: 185 DLASEKITIGSFKLYKTLIGCGHVNGGTFSEDTSGIIGLGGGPLSLISQMRKIAAVKRRF 244

Query: 245 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR 304
           SYCLPTFFSD NVTG+I+FG+ A+VSGR   KVVSTPLVLK P+ FY+LTLEA+S+ANKR
Sbjct: 245 SYCLPTFFSDKNVTGKISFGKKAIVSGR---KVVSTPLVLKEPNTFYYLTLEAMSVANKR 304

Query: 305 LEAA-NVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYV 364
            +AA N+S+A+E+GNI+IDSGTTLT LP+NLY  V STLA VV+AKRV DP G+L+LC+ 
Sbjct: 305 FKAANNMSTAVEQGNILIDSGTTLTILPQNLYKGVASTLAHVVKAKRVNDPTGVLDLCFA 364

Query: 365 AAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVN 424
           A  VD L+IP+IT HF+G A VKLLPLNTF  VADNV CL+F  S+  +  IFGNLAQVN
Sbjct: 365 ACSVDHLNIPVITAHFAGNADVKLLPLNTFAMVADNVACLAFVPSA--NFAIFGNLAQVN 424

Query: 425 FLIGYDLERKRLSFKHKVCA 440
           FL+GYDLERKRLSFK+ VCA
Sbjct: 425 FLVGYDLERKRLSFKYNVCA 437

BLAST of CmoCh08G008920 vs. ExPASy TrEMBL
Match: A0A6J1J858 (aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111483445 PE=3 SV=1)

HSP 1 Score: 555.1 bits (1429), Expect = 2.7e-154
Identity = 283/436 (64.91%), Postives = 336/436 (77.06%), Query Frame = 0

Query: 5   SMFFYLFLFS-SLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSFSRY 64
           S+FFY FLFS S++ T  GGG+GFTTS+ HRD+L   LH PS S YERLT AF RSFSR 
Sbjct: 5   SIFFYFFLFSFSVSATGRGGGNGFTTSIIHRDSLLSPLHNPSVSHYERLTGAFNRSFSRS 64

Query: 65  ATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCMPCI 124
            TL + + TVST G+H+PLIP  + GE+++S+S+GTPP+ F AIADTGSDLTWTQC+PC+
Sbjct: 65  TTLTNRAATVSTGGVHSPLIP--DSGEFLISLSIGTPPVDFTAIADTGSDLTWTQCLPCV 124

Query: 125 NCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNGDLA 184
            CFNQS P+FNP  SSSY  V CTS TC SI    C P   +CTY Y+Y DQSFT+GDLA
Sbjct: 125 KCFNQSNPIFNPHRSSSYSNVSCTSDTCNSIVSHRCGPDLKTCTYGYSYGDQSFTHGDLA 184

Query: 185 FEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRFSYC 244
            EK+TIGSFKL+  +IGCGH+NGG F G TSGI+GLG G +SLVSQ+  IA +KRRFSYC
Sbjct: 185 SEKITIGSFKLNKVVIGCGHENGGTFLGETSGIVGLGGGPLSLVSQLNTIAAVKRRFSYC 244

Query: 245 LPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKRLEA 304
           LPTFFSD+N+TG+I+FG  A VSGR   KVVSTPLV K P  +YFLTLEA+SIANKR E 
Sbjct: 245 LPTFFSDANITGKISFGEEAAVSGR---KVVSTPLVQKHPDTYYFLTLEAVSIANKRFEV 304

Query: 305 A-NVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVAAK 364
           A N+SSA+  GNIIIDSGTTLT LP NLY+ V+STLA VV+AK+V DP GILELCY  + 
Sbjct: 305 ANNMSSAVVEGNIIIDSGTTLTLLPPNLYDGVVSTLAKVVKAKQVNDPTGILELCYGVSS 364

Query: 365 VDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNFLI 424
           VDDL+IPIIT HF GGAAV+L   NTF  V ++V CL+   +      IFGNLAQVNFL+
Sbjct: 365 VDDLNIPIITAHFVGGAAVELQSENTFALVNEDVACLTLAPAM--KFAIFGNLAQVNFLV 424

Query: 425 GYDLERKRLSFKHKVC 439
           GYDL+RK +SFK  VC
Sbjct: 425 GYDLDRKTVSFKRTVC 433

BLAST of CmoCh08G008920 vs. ExPASy TrEMBL
Match: A0A0A0KZZ3 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G055410 PE=3 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 6.0e-154
Identity = 285/438 (65.07%), Postives = 339/438 (77.40%), Query Frame = 0

Query: 3   LNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSFSR 62
           ++  F++L  FSS  T +GGG  GFTTSLF RD+    LH PS SRY+ L +AFRRSFSR
Sbjct: 4   ISIFFYFLLFFSSKVTAHGGGHHGFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSR 63

Query: 63  YATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCMPC 122
            ATLL H T+VST  I +P+IP  + GE++MSI +GTPP+   AIADTGSDLTWTQC+PC
Sbjct: 64  SATLLTHLTSVSTACIRSPIIP--DSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPC 123

Query: 123 INCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNGDL 182
             CFNQS P+FNPR SSSYR V C S TCRS+    C P   SC+Y Y+Y D+SFT GDL
Sbjct: 124 RECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDL 183

Query: 183 AFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRFSY 242
           A +++TIGSFKL  T+IGCGHQNGG F G TSGIIGLG GS+SLVSQMR IA +K RFSY
Sbjct: 184 ASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSY 243

Query: 243 CLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKRLE 302
           CLPTFFS++N+TG I+FGR AVVSGR   +VVSTPLV + P  FYFLTLEAIS+  KR +
Sbjct: 244 CLPTFFSNANITGTISFGRKAVVSGR---QVVSTPLVPRSPDTFYFLTLEAISVGKKRFK 303

Query: 303 AAN-VSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVAA 362
           AAN +S+    GNIIIDSGTTLT LPR+LY  V STLA V++AKRV+DP GILELCY A 
Sbjct: 304 AANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAG 363

Query: 363 KVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNFL 422
           +VDDL+IPIIT HF+GGA VKLLP+NTF  VADNVTCL+F  ++   + IFGNLAQ+NF 
Sbjct: 364 QVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPAT--QVAIFGNLAQINFE 423

Query: 423 IGYDLERKRLSFKHKVCA 440
           +GYDL  KRLSF+ K+CA
Sbjct: 424 VGYDLGNKRLSFEPKLCA 434

BLAST of CmoCh08G008920 vs. NCBI nr
Match: XP_022964071.1 (aspartic proteinase CDR1-like [Cucurbita moschata])

HSP 1 Score: 884.0 bits (2283), Expect = 5.2e-253
Identity = 439/439 (100.00%), Postives = 439/439 (100.00%), Query Frame = 0

Query: 1   MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 60
           MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF
Sbjct: 1   MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 60

Query: 61  SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 120
           SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM
Sbjct: 61  SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 120

Query: 121 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 180
           PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG
Sbjct: 121 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 180

Query: 181 DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240
           DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF
Sbjct: 181 DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240

Query: 241 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR 300
           SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR
Sbjct: 241 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR 300

Query: 301 LEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVA 360
           LEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVA
Sbjct: 301 LEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVA 360

Query: 361 AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNF 420
           AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNF
Sbjct: 361 AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNF 420

Query: 421 LIGYDLERKRLSFKHKVCA 440
           LIGYDLERKRLSFKHKVCA
Sbjct: 421 LIGYDLERKRLSFKHKVCA 439

BLAST of CmoCh08G008920 vs. NCBI nr
Match: KAG6593678.1 (Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 863.6 bits (2230), Expect = 7.3e-247
Identity = 430/439 (97.95%), Postives = 433/439 (98.63%), Query Frame = 0

Query: 1   MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 60
           MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRY+RLTNAFRRSF
Sbjct: 1   MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYQRLTNAFRRSF 60

Query: 61  SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 120
           SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM
Sbjct: 61  SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 120

Query: 121 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 180
           PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG
Sbjct: 121 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 180

Query: 181 DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240
           DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF
Sbjct: 181 DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240

Query: 241 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR 300
           SYCLPTFFSDSNVTGRINFGRNAVVSGR   KVVSTPLVLKFPSPFYFLTLEAISIANKR
Sbjct: 241 SYCLPTFFSDSNVTGRINFGRNAVVSGR---KVVSTPLVLKFPSPFYFLTLEAISIANKR 300

Query: 301 LEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVA 360
           LEA NVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRV+DPLGILELCYVA
Sbjct: 301 LEAVNVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVKDPLGILELCYVA 360

Query: 361 AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNF 420
           AKVDDLDIPIITTHFSGGA VKLLPLNTFVT+ DNVTCLSFKSSSDDDLNIFGNLAQVNF
Sbjct: 361 AKVDDLDIPIITTHFSGGATVKLLPLNTFVTITDNVTCLSFKSSSDDDLNIFGNLAQVNF 420

Query: 421 LIGYDLERKRLSFKHKVCA 440
           LIGYDLERKRLSFKHKVCA
Sbjct: 421 LIGYDLERKRLSFKHKVCA 436

BLAST of CmoCh08G008920 vs. NCBI nr
Match: XP_023514483.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 857.8 bits (2215), Expect = 4.0e-245
Identity = 427/439 (97.27%), Postives = 431/439 (98.18%), Query Frame = 0

Query: 1   MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 60
           MPL SMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRY+RLTNAFRRSF
Sbjct: 1   MPLISMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYDRLTNAFRRSF 60

Query: 61  SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 120
           SRYATLL H TTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM
Sbjct: 61  SRYATLLHHFTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 120

Query: 121 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 180
           PCINCFNQSTPLFNPRNS+SYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG
Sbjct: 121 PCINCFNQSTPLFNPRNSTSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 180

Query: 181 DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240
           DLAFEKLTIGSFKLHNTIIGCGH+NGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF
Sbjct: 181 DLAFEKLTIGSFKLHNTIIGCGHKNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240

Query: 241 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR 300
           SYCLPTFFSDSNVTGRINFGRNAVVSGRKG KVVSTPLVLKFPSPFYFLTLEAISIANKR
Sbjct: 241 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGRKVVSTPLVLKFPSPFYFLTLEAISIANKR 300

Query: 301 LEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVA 360
           LEA NVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDP GILELCYVA
Sbjct: 301 LEAVNVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPSGILELCYVA 360

Query: 361 AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNF 420
           AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDL+IFGNLAQVNF
Sbjct: 361 AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLSIFGNLAQVNF 420

Query: 421 LIGYDLERKRLSFKHKVCA 440
           LIGYDLE KRL FKHKVCA
Sbjct: 421 LIGYDLEGKRLFFKHKVCA 439

BLAST of CmoCh08G008920 vs. NCBI nr
Match: XP_023000629.1 (aspartic proteinase CDR1-like [Cucurbita maxima])

HSP 1 Score: 786.9 bits (2031), Expect = 8.7e-224
Identity = 401/440 (91.14%), Postives = 410/440 (93.18%), Query Frame = 0

Query: 1   MPLNSMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 60
           MPL S+F +LFLFSSLATTNGG GDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF
Sbjct: 1   MPLISIFVHLFLFSSLATTNGGRGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 60

Query: 61  SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 120
           SRYATLLDHSTTVSTTGIHTPLIPNNE+GEYVMS            IADTGSDLTWTQCM
Sbjct: 61  SRYATLLDHSTTVSTTGIHTPLIPNNEEGEYVMS------------IADTGSDLTWTQCM 120

Query: 121 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 180
           PC+NCFNQSTPLFNPRNSSSYRPVPCTST CRSISI PCMPHHPSCTYNYNYADQSFTNG
Sbjct: 121 PCLNCFNQSTPLFNPRNSSSYRPVPCTSTACRSISIFPCMPHHPSCTYNYNYADQSFTNG 180

Query: 181 DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240
           DLAFEKLTIGSFKLHNTIIGCGHQNGG FKGRTSGIIGLGTGSISLVSQMRKIANLKRRF
Sbjct: 181 DLAFEKLTIGSFKLHNTIIGCGHQNGGAFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 240

Query: 241 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR 300
           SYCLPTFFSDSNVTGRINFGR AVVSGR   KV+STPL+ KFPSPF+FLTLEAISIANKR
Sbjct: 241 SYCLPTFFSDSNVTGRINFGRKAVVSGR---KVISTPLMSKFPSPFHFLTLEAISIANKR 300

Query: 301 LEAA-NVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYV 360
           LEA  NVSSALERGNIIIDSGTTLTFLP+ LYNAVISTLASVVRAKRVEDP GILELCYV
Sbjct: 301 LEATNNVSSALERGNIIIDSGTTLTFLPQYLYNAVISTLASVVRAKRVEDPSGILELCYV 360

Query: 361 AAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVN 420
           AAKVDDLDIPIIT HFSGGAAVKLLPLNTFVTVADNVTCLSFKSS DD+LNIFGNLAQVN
Sbjct: 361 AAKVDDLDIPIITAHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSFDDNLNIFGNLAQVN 420

Query: 421 FLIGYDLERKRLSFKHKVCA 440
            LIGYDLERKRLSFKHKVCA
Sbjct: 421 ILIGYDLERKRLSFKHKVCA 425

BLAST of CmoCh08G008920 vs. NCBI nr
Match: XP_023543528.1 (aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 574.7 bits (1480), Expect = 6.8e-160
Identity = 295/440 (67.05%), Postives = 350/440 (79.55%), Query Frame = 0

Query: 5   SMFFYLFLFS-SLATTNGG---GGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSF 64
           S+FF  FL S S AT +GG   GG GFTTSLFHRD+    L+ PS S Y+RLTNAFRRSF
Sbjct: 5   SIFFCFFLISFSQATVHGGVGDGGHGFTTSLFHRDSFLSPLYNPSLSHYDRLTNAFRRSF 64

Query: 65  SRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 124
           SR  TLL+ +  VSTTGIH+ +IP  +DGE++MSIS+GTP +K  AIADTGSDLTWTQCM
Sbjct: 65  SRSDTLLNRAAAVSTTGIHSRIIP--DDGEFLMSISIGTPRVKIMAIADTGSDLTWTQCM 124

Query: 125 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNG 184
           PC  CFNQS P+FNPR S SYR V CTS  CRS+    C P + +C+Y Y+Y DQSFT G
Sbjct: 125 PCHKCFNQSFPIFNPRRSFSYRHVSCTSNACRSLDDYRCGPDNRTCSYGYSYGDQSFTYG 184

Query: 185 DLAFEKLTIGSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 244
           DLA EK+T+GSFKL+ T+IGCGH NGG F G TSGIIGLG G +SL+SQMRKIA +KRRF
Sbjct: 185 DLASEKITVGSFKLYKTVIGCGHVNGGTFSGDTSGIIGLGGGPLSLISQMRKIAAVKRRF 244

Query: 245 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR 304
           SYCLPTFFSD NVTG+I+FG+ A+VSGR   KV+STPLVLK P+ FY++TL+A+S+ANKR
Sbjct: 245 SYCLPTFFSDKNVTGKISFGKKAIVSGR---KVISTPLVLKEPNTFYYVTLKAMSVANKR 304

Query: 305 LEAA-NVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYV 364
            +AA N+S+A+ERGNI+IDSGTTLT LP NLY  V STLA VV+AKRV DP G+L+LC+ 
Sbjct: 305 FKAANNMSAAVERGNILIDSGTTLTILPPNLYKGVASTLAHVVKAKRVNDPTGVLDLCFA 364

Query: 365 AAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVN 424
              VD L+IP+IT HF+GGA VKLLPLNTF  VADNV CL+F  S+  +  IFGNLAQVN
Sbjct: 365 TRSVDHLNIPVITAHFAGGADVKLLPLNTFAMVADNVACLAFVPSA--NFAIFGNLAQVN 424

Query: 425 FLIGYDLERKRLSFKHKVCA 440
           FL+GYDLERKRLSFK+ VCA
Sbjct: 425 FLVGYDLERKRLSFKYNVCA 437

BLAST of CmoCh08G008920 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 358.6 bits (919), Expect = 7.1e-99
Identity = 202/439 (46.01%), Postives = 274/439 (62.41%), Query Frame = 0

Query: 6   MFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSFSRYAT 65
           +F  L     L+  N    DGFT  L HRD+     +  + +  +R+ NA RRS     +
Sbjct: 5   IFATLLSLLLLSNVNAYPKDGFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRS---ARS 64

Query: 66  LLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCMPCINC 125
            L  S   ++       I +N  GEY+M+IS+GTPP+   AIADTGSDL WTQC PC +C
Sbjct: 65  TLQFSNDDASPNSPQSFITSNR-GEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDC 124

Query: 126 FNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSFTNGDLAFE 185
           + Q++PLF+P+ SS+YR V C+S+ CR++  + C     +C+Y   Y D S+T GD+A +
Sbjct: 125 YQQTSPLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVD 184

Query: 186 KLTIGS-----FKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRF 245
            +T+GS       L N IIGCGH+N G F    SGIIGLG GS SLVSQ+RK  N K  F
Sbjct: 185 TVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGK--F 244

Query: 246 SYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSPFYFLTLEAISIANKR 305
           SYCL  F S++ +T +INFG N +VS   G  VVST +V K P+ +YFL LEAIS+ +K+
Sbjct: 245 SYCLVPFTSETGLTSKINFGTNGIVS---GDGVVSTSMVKKDPATYYFLNLEAISVGSKK 304

Query: 306 LEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGILELCYVA 365
           ++  +       GNI+IDSGTTLT LP N Y  + S +AS ++A+RV+DP GIL LCY  
Sbjct: 305 IQFTSTIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCY-- 364

Query: 366 AKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFGNLAQVNF 425
                  +P IT HF GG  VKL  LNTFV V+++V+C +F  ++++ L IFGNLAQ+NF
Sbjct: 365 RDSSSFKVPDITVHFKGG-DVKLGNLNTFVAVSEDVSCFAF--AANEQLTIFGNLAQMNF 424

Query: 426 LIGYDLERKRLSFKHKVCA 440
           L+GYD     +SFK   C+
Sbjct: 425 LVGYDTVSGTVSFKKTDCS 429

BLAST of CmoCh08G008920 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 327.8 bits (839), Expect = 1.3e-89
Identity = 189/446 (42.38%), Postives = 268/446 (60.09%), Query Frame = 0

Query: 4   NSMFFYLFLFSSLATTNGGGGD--GFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSFS 63
           +S+   L L SSL  +N       GFT  L HRD+     + P  +  +RL NA  RS +
Sbjct: 6   SSVLLSLCLLSSLFLSNANAKPKLGFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVN 65

Query: 64  RYATLLDHSTTVSTTGIHTPLIP-NNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCM 123
           R     +   T        P I   +  GEY+M++S+GTPP    AIADTGSDL WTQC 
Sbjct: 66  RVFHFTEKDNTPQ------PQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCA 125

Query: 124 PCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSI-SISPCMPHHPSCTYNYNYADQSFTN 183
           PC +C+ Q  PLF+P+ SS+Y+ V C+S+ C ++ + + C  +  +C+Y+ +Y D S+T 
Sbjct: 126 PCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTK 185

Query: 184 GDLAFEKLTIGS-----FKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIA 243
           G++A + LT+GS      +L N IIGCGH N G F  + SGI+GLG G +SL+ Q+    
Sbjct: 186 GNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGD-- 245

Query: 244 NLKRRFSYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLK-FPSPFYFLTLEA 303
           ++  +FSYCL    S  + T +INFG NA+VS   G  VVSTPL+ K     FY+LTL++
Sbjct: 246 SIDGKFSYCLVPLTSKKDQTSKINFGTNAIVS---GSGVVSTPLIAKASQETFYYLTLKS 305

Query: 304 ISIANKRLEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPLGI 363
           IS+ +K+++ +   S    GNIIIDSGTTLT LP   Y+ +   +AS + A++ +DP   
Sbjct: 306 ISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSG 365

Query: 364 LELCYVAAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFG 423
           L LCY A    DL +P+IT HF  GA VKL   N FV V++++ C +F+ S     +I+G
Sbjct: 366 LSLCYSA--TGDLKVPVITMHFD-GADVKLDSSNAFVQVSEDLVCFAFRGS--PSFSIYG 425

Query: 424 NLAQVNFLIGYDLERKRLSFKHKVCA 440
           N+AQ+NFL+GYD   K +SFK   CA
Sbjct: 426 NVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CmoCh08G008920 vs. TAIR 10
Match: AT2G35615.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 317.0 bits (811), Expect = 2.4e-86
Identity = 187/446 (41.93%), Postives = 271/446 (60.76%), Query Frame = 0

Query: 11  FLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSFSRYATLLDHS 70
           FLF S+  ++ G    F+  L HRD+    ++ P  +  +RL  AF RS SR +   +H 
Sbjct: 10  FLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSR-SRRFNHQ 69

Query: 71  TTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCMPCINCFNQST 130
             +S T + + LI    DGE+ MSI++GTPPIK FAIADTGSDLTW QC PC  C+ ++ 
Sbjct: 70  --LSQTDLQSGLI--GADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENG 129

Query: 131 PLFNPRNSSSYRPVPCTSTTCRSISISP--CMPHHPSCTYNYNYADQSFTNGDLAFEKLT 190
           P+F+ + SS+Y+  PC S  C+++S +   C   +  C Y Y+Y DQSF+ GD+A E ++
Sbjct: 130 PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVS 189

Query: 191 IGS-----FKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLKRRFSYC 250
           I S          T+ GCG+ NGG F    SGIIGLG G +SL+SQ+   +++ ++FSYC
Sbjct: 190 IDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLG--SSISKKFSYC 249

Query: 251 LPTFFSDSNVTGRINFGRNAVVSG-RKGHKVVSTPLVLKFPSPFYFLTLEAISIANKRLE 310
           L    + +N T  IN G N++ S   K   VVSTPLV K P  +Y+LTLEAIS+  K++ 
Sbjct: 250 LSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIP 309

Query: 311 AANVS--------SALERGNIIIDSGTTLTFLPRNLYNAVISTL-ASVVRAKRVEDPLGI 370
               S         +   GNIIIDSGTTLT L    ++   S +  SV  AKRV DP G+
Sbjct: 310 YTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGL 369

Query: 371 LELCYVAAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDLNIFG 430
           L  C+ +    ++ +P IT HF+ GA V+L P+N FV +++++ CLS   ++  ++ I+G
Sbjct: 370 LSHCFKSGSA-EIGLPEITVHFT-GADVRLSPINAFVKLSEDMVCLSMVPTT--EVAIYG 429

Query: 431 NLAQVNFLIGYDLERKRLSFKHKVCA 440
           N AQ++FL+GYDLE + +SF+H  C+
Sbjct: 430 NFAQMDFLVGYDLETRTVSFQHMDCS 444

BLAST of CmoCh08G008920 vs. TAIR 10
Match: AT1G31450.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 303.9 bits (777), Expect = 2.1e-82
Identity = 185/450 (41.11%), Postives = 261/450 (58.00%), Query Frame = 0

Query: 5   SMFFYLFLFSSLATTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFRRSFSRYA 64
           S+    F F+S ++ N    +  T  L HRD+    L+ P  +  +RL  AF RS SR  
Sbjct: 10  SLLAISFFFASNSSAN---RENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRSR 69

Query: 65  TLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWTQCMPCIN 124
                    + T + + LI N   GEY MSIS+GTPP K FAIADTGSDLTW QC PC  
Sbjct: 70  RF------TTKTDLQSGLISNG--GEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQ 129

Query: 125 CFNQSTPLFNPRNSSSYRPVPCTSTTCRSIS--ISPCMPHHPSCTYNYNYADQSFTNGDL 184
           C+ Q++PLF+ + SS+Y+   C S TC+++S     C      C Y Y+Y D SFT GD+
Sbjct: 130 CYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDV 189

Query: 185 AFEKLTI-----GSFKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRKIANLK 244
           A E ++I      S     T+ GCG+ NGG F+   SGIIGLG G +SLVSQ+   +++ 
Sbjct: 190 ATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLG--SSIG 249

Query: 245 RRFSYCLPTFFSDSNVTGRINFGRNAVVSG-RKGHKVVSTPLVLKFPSPFYFLTLEAISI 304
           ++FSYCL    + +N T  IN G N++ S   K    ++TPL+ K P  +YFLTLEA+++
Sbjct: 250 KKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTV 309

Query: 305 ANKRLE------AANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTL-ASVVRAKRVED 364
              +L         N  S+   GNIIIDSGTTLT L    Y+   + +  SV  AKRV D
Sbjct: 310 GKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSD 369

Query: 365 PLGILELCYVAAKVDDLDIPIITTHFSGGAAVKLLPLNTFVTVADNVTCLSFKSSSDDDL 424
           P G+L  C+ +    ++ +P IT HF+  A VKL P+N FV + ++  CLS   ++  ++
Sbjct: 370 PQGLLTHCFKSGD-KEIGLPAITMHFT-NADVKLSPINAFVKLNEDTVCLSMIPTT--EV 429

Query: 425 NIFGNLAQVNFLIGYDLERKRLSFKHKVCA 440
            I+GN+ Q++FL+GYDLE K +SF+   C+
Sbjct: 430 AIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442

BLAST of CmoCh08G008920 vs. TAIR 10
Match: AT2G28010.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 229.9 bits (585), Expect = 3.8e-60
Identity = 164/450 (36.44%), Postives = 223/450 (49.56%), Query Frame = 0

Query: 1   MPLNSMFFYLFLFSSLA---TTNGGGGDGFTTSLFHRDTLFPTLHLPSPSRYERLTNAFR 60
           M L +    LFL  SL    TT      GFT  L HR +          +   R++N   
Sbjct: 1   MSLATTIIVLFLQISLCFLFTTTASPPHGFTMDLIHRRS----------NASSRVSNTQS 60

Query: 61  RSFSRYATLLDHSTTVSTTGIHTPLIPNNEDGEYVMSISMGTPPIKFFAIADTGSDLTWT 120
            S     T+ D+S                    Y+M + +GTPP +  AI DTGS++TWT
Sbjct: 61  GSSPYANTVFDNSV-------------------YLMKLQVGTPPFEIQAIIDTGSEITWT 120

Query: 121 QCMPCINCFNQSTPLFNPRNSSSYRPVPCTSTTCRSISISPCMPHHPSCTYNYNYADQSF 180
           QC+PC++C+ Q+ P+F+P  SS+++   C                  SC Y  +Y D ++
Sbjct: 121 QCLPCVHCYEQNAPIFDPSKSSTFKEKRCDG---------------HSCPYEVDYFDHTY 180

Query: 181 TNGDLAFEKLTIGS-----FKLHNTIIGCGHQNGGIFKGRTSGIIGLGTGSISLVSQMRK 240
           T G LA E +T+ S     F +  TIIGCGH N   FK   SG++GL  G  SL++QM  
Sbjct: 181 TMGTLATETITLHSTSGEPFVMPETIIGCGHNNSW-FKPSFSGMVGLNWGPSSLITQMG- 240

Query: 241 IANLKRRFSYCLPTFFSDSNVTGRINFGRNAVVSGRKGHKVVSTPLVLKFPSP-FYFLTL 300
                   SYC    FS    T +INFG NA+V+   G  VVST + +    P FY+L L
Sbjct: 241 -GEYPGLMSYC----FSGQG-TSKINFGANAIVA---GDGVVSTTMFMTTAKPGFYYLNL 300

Query: 301 EAISIANKRLEAANVSSALERGNIIIDSGTTLTFLPRNLYNAVISTLASVVRAKRVEDPL 360
           +A+S+ N R+E    +     GNI+IDSGTTLT+ P +  N V   +  VV A R  DP 
Sbjct: 301 DAVSVGNTRIETMGTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPT 360

Query: 361 GILELCYVAAKVDDLDI-PIITTHFSGGAAVKLLPLNTFVTVAD-NVTCLSFKSSSDDDL 420
           G   LCY     D +DI P+IT HFSGG  + L   N ++   +  V CL+   +S    
Sbjct: 361 GNDMLCY---NSDTIDIFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQE 392

Query: 421 NIFGNLAQVNFLIGYDLERKRLSFKHKVCA 440
            IFGN AQ NFL+GYD     +SF    C+
Sbjct: 421 AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q6XBF81.9e-8842.38Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q3EBM53.3e-8541.93Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q766C29.2e-5935.01Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q766C31.2e-5835.26Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LS408.6e-4932.45Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
A0A6J1HHU32.5e-253100.00aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111464208 PE=3... [more]
A0A6J1KKI84.2e-22491.14aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111494873 PE=3 S... [more]
A0A6J1FP399.5e-16067.50aspartic proteinase CDR1-like OS=Cucurbita moschata OX=3662 GN=LOC111447216 PE=3... [more]
A0A6J1J8582.7e-15464.91aspartic proteinase CDR1-like OS=Cucurbita maxima OX=3661 GN=LOC111483445 PE=3 S... [more]
A0A0A0KZZ36.0e-15465.07Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G05541... [more]
Match NameE-valueIdentityDescription
XP_022964071.15.2e-253100.00aspartic proteinase CDR1-like [Cucurbita moschata][more]
KAG6593678.17.3e-24797.95Aspartic proteinase CDR1, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023514483.14.0e-24597.27aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
XP_023000629.18.7e-22491.14aspartic proteinase CDR1-like [Cucurbita maxima][more]
XP_023543528.16.8e-16067.05aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT1G64830.17.1e-9946.01Eukaryotic aspartyl protease family protein [more]
AT5G33340.11.3e-8942.38Eukaryotic aspartyl protease family protein [more]
AT2G35615.12.4e-8641.93Eukaryotic aspartyl protease family protein [more]
AT1G31450.12.1e-8241.11Eukaryotic aspartyl protease family protein [more]
AT2G28010.13.8e-6036.44Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 286..434
e-value: 4.6E-27
score: 94.8
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 91..260
e-value: 6.4E-52
score: 176.3
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 268..439
e-value: 4.3E-45
score: 155.7
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 67..260
e-value: 9.7E-53
score: 181.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 85..438
NoneNo IPR availablePANTHERPTHR47967:SF66ASPARTIC PROTEINASE CDR1-RELATEDcoord: 10..438
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 10..438
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 315..326
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 91..434
score: 43.09787
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 90..438
e-value: 8.04189E-86
score: 261.429

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh08G008920.1CmoCh08G008920.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity