CmoCh13G004760 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh13G004760
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionPeptidase A1 domain-containing protein
LocationCmo_Chr13: 5783925 .. 5785352 (-)
RNA-Seq ExpressionCmoCh13G004760
SyntenyCmoCh13G004760
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGACAAACCCTAGCAAACCCTAAAACCAAATTCCTTAAAGATTCTCTAGTTCTTGGTCTTGTTCATTCAAGAACTTCCCTCCTAACCCCAAAAAGAGGCTATAATTCCCTTTTAACGAAGAGAATTAAGCCAATGGAAATGGGTAATGATGATGTTATAGAGCCATTGAGGGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACTGGAAGTGACCTTACATGGGTACCTTGTGGGAACCTCTCATTTGATTGCCAAGATTGTGATGAGTATCAAAACAATGTTTTAGGTCCAAAATTGGCTGCTTTTTTGCCTACTCATTCTTCTACTTCCATTAGAGACACTTGTGGGAGCTCCTTTTGCATTGATATCCATAGCTCTGATAACCCTTTTGACCCTTGCACAATTGCTGGCTGTTCCCTTGCTACCCTTGTAAAGGGCACTTGCCCTAGACCATGCCCTTCATTCTCTTACACTTATGGGGCTAGTGGCCTTGTAATTGGAACCCTAACTAAAGATGTCATTTTTATCCATGGAAATTCCCCAAATTCCTCAAGAAAAATCCCTAAATTTTGCTTTGGATGTGTTGGTGCCACTTATAGAGAGCCAATTGGGATTGCTGGCTTTGGTAGAGGCCTTCTTTCTTTACCTTCTCAATTAGGGTTTTCTCATAAGGGCTTCTCTCATTGTTTCTTGCCCTTTAAATTCTCAAATAACCCTAATTTTTCAAGCCCTTTGATTCTTGGTAATCTAGCTATTTCTTCTAAAGAACATTTGAAATTCACCCCTTTTTTGAAAAGTCCATTTTACCCTAATTATTACTATATTGGGCTTGAGTCAATCACTATTGGAAATGGTGAAAATTACTCTAGATTTGGAGTTTCCTTGCAATTGAGAGAGATTGACACAAAGGGTAATGGTGGAATTTTGATTGATTCTGGTACTACTTATACTCATTTACCAGAACCATTATATTCACAGCTTATTTCAAATCTTGAGTCATTAATAAGCTATCCAAGAGCTAAAGAACATGAACTCAATACTGGGTTTGATCTTTGTTACAAAGTTCCTTATAAAAACAACACCTTTTTTAGTGATGAATTTGAGCTTCCTTCTATAACATTTCATTTTTTAAACAATGTTAGTGTTGTTTTGCCTCAAGGGAACAGTTTTTATGCCATGGCTGCTCCTAGTAACTCCACTGTTGTGAAATGCTTGCTGTTTCAAAGCATGGACGGCGACGGAGACGGGCCGGCGGGCATTTTCGGGAGCTTTCAACAGCAAAATTTGGAGGTTGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTGAAGCTATGGATTGTGCTTCTGTTGCTGTGTCTCAAGGACTTCATAAGAAGGAATGA

mRNA sequence

ATGGGACAAACCCTAGCAAACCCTAAAACCAAATTCCTTAAAGATTCTCTAGTTCTTGGTCTTGTTCATTCAAGAACTTCCCTCCTAACCCCAAAAAGAGGCTATAATTCCCTTTTAACGAAGAGAATTAAGCCAATGGAAATGGGTAATGATGATGTTATAGAGCCATTGAGGGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACTGGAAGTGACCTTACATGGGTACCTTGTGGGAACCTCTCATTTGATTGCCAAGATTGTGATGAGTATCAAAACAATGTTTTAGGTCCAAAATTGGCTGCTTTTTTGCCTACTCATTCTTCTACTTCCATTAGAGACACTTGTGGGAGCTCCTTTTGCATTGATATCCATAGCTCTGATAACCCTTTTGACCCTTGCACAATTGCTGGCTGTTCCCTTGCTACCCTTGTAAAGGGCACTTGCCCTAGACCATGCCCTTCATTCTCTTACACTTATGGGGCTAGTGGCCTTGTAATTGGAACCCTAACTAAAGATGTCATTTTTATCCATGGAAATTCCCCAAATTCCTCAAGAAAAATCCCTAAATTTTGCTTTGGATGTGTTGGTGCCACTTATAGAGAGCCAATTGGGATTGCTGGCTTTGGTAGAGGCCTTCTTTCTTTACCTTCTCAATTAGGGTTTTCTCATAAGGGCTTCTCTCATTGTTTCTTGCCCTTTAAATTCTCAAATAACCCTAATTTTTCAAGCCCTTTGATTCTTGGTAATCTAGCTATTTCTTCTAAAGAACATTTGAAATTCACCCCTTTTTTGAAAAGTCCATTTTACCCTAATTATTACTATATTGGGCTTGAGTCAATCACTATTGGAAATGGTGAAAATTACTCTAGATTTGGAGTTTCCTTGCAATTGAGAGAGATTGACACAAAGGGTAATGGTGGAATTTTGATTGATTCTGGTACTACTTATACTCATTTACCAGAACCATTATATTCACAGCTTATTTCAAATCTTGAGTCATTAATAAGCTATCCAAGAGCTAAAGAACATGAACTCAATACTGGGTTTGATCTTTGTTACAAAGTTCCTTATAAAAACAACACCTTTTTTAGTGATGAATTTGAGCTTCCTTCTATAACATTTCATTTTTTAAACAATGTTAGTGTTGTTTTGCCTCAAGGGAACAGTTTTTATGCCATGGCTGCTCCTAGTAACTCCACTGTTGTGAAATGCTTGCTGTTTCAAAGCATGGACGGCGACGGAGACGGGCCGGCGGGCATTTTCGGGAGCTTTCAACAGCAAAATTTGGAGGTTGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTGAAGCTATGGATTGTGCTTCTGTTGCTGTGTCTCAAGGACTTCATAAGAAGGAATGA

Coding sequence (CDS)

ATGGGACAAACCCTAGCAAACCCTAAAACCAAATTCCTTAAAGATTCTCTAGTTCTTGGTCTTGTTCATTCAAGAACTTCCCTCCTAACCCCAAAAAGAGGCTATAATTCCCTTTTAACGAAGAGAATTAAGCCAATGGAAATGGGTAATGATGATGTTATAGAGCCATTGAGGGAAATTAGGGATGGTTATTTGATGTCCTTAACATTAGGGACACCCCCACAAGTTATTCAAGTGTATATGGACACTGGAAGTGACCTTACATGGGTACCTTGTGGGAACCTCTCATTTGATTGCCAAGATTGTGATGAGTATCAAAACAATGTTTTAGGTCCAAAATTGGCTGCTTTTTTGCCTACTCATTCTTCTACTTCCATTAGAGACACTTGTGGGAGCTCCTTTTGCATTGATATCCATAGCTCTGATAACCCTTTTGACCCTTGCACAATTGCTGGCTGTTCCCTTGCTACCCTTGTAAAGGGCACTTGCCCTAGACCATGCCCTTCATTCTCTTACACTTATGGGGCTAGTGGCCTTGTAATTGGAACCCTAACTAAAGATGTCATTTTTATCCATGGAAATTCCCCAAATTCCTCAAGAAAAATCCCTAAATTTTGCTTTGGATGTGTTGGTGCCACTTATAGAGAGCCAATTGGGATTGCTGGCTTTGGTAGAGGCCTTCTTTCTTTACCTTCTCAATTAGGGTTTTCTCATAAGGGCTTCTCTCATTGTTTCTTGCCCTTTAAATTCTCAAATAACCCTAATTTTTCAAGCCCTTTGATTCTTGGTAATCTAGCTATTTCTTCTAAAGAACATTTGAAATTCACCCCTTTTTTGAAAAGTCCATTTTACCCTAATTATTACTATATTGGGCTTGAGTCAATCACTATTGGAAATGGTGAAAATTACTCTAGATTTGGAGTTTCCTTGCAATTGAGAGAGATTGACACAAAGGGTAATGGTGGAATTTTGATTGATTCTGGTACTACTTATACTCATTTACCAGAACCATTATATTCACAGCTTATTTCAAATCTTGAGTCATTAATAAGCTATCCAAGAGCTAAAGAACATGAACTCAATACTGGGTTTGATCTTTGTTACAAAGTTCCTTATAAAAACAACACCTTTTTTAGTGATGAATTTGAGCTTCCTTCTATAACATTTCATTTTTTAAACAATGTTAGTGTTGTTTTGCCTCAAGGGAACAGTTTTTATGCCATGGCTGCTCCTAGTAACTCCACTGTTGTGAAATGCTTGCTGTTTCAAAGCATGGACGGCGACGGAGACGGGCCGGCGGGCATTTTCGGGAGCTTTCAACAGCAAAATTTGGAGGTTGTTTATGATTTGGAGAAGGAAAGATTAGGGTTTGAAGCTATGGATTGTGCTTCTGTTGCTGTGTCTCAAGGACTTCATAAGAAGGAATGA

Protein sequence

MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE
Homology
BLAST of CmoCh13G004760 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 216.1 bits (549), Expect = 8.7e-55
Identity = 149/439 (33.94%), Postives = 207/439 (47.15%), Query Frame = 0

Query: 64  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSS 123
           YL+SL++G+    + +Y+DTGSDL W PC    F C  C   ++  L P   + L   SS
Sbjct: 83  YLISLSVGSSSSAVSLYLDTGSDLVWFPC--RPFTCILC---ESKPLPPSPPSSL---SS 142

Query: 124 TSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFSYTYGASGLV 183
           ++   +C S  C   HSS    D C I+ C L  +  G C     PCP F Y YG  G +
Sbjct: 143 SATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYG-DGSL 202

Query: 184 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SH 243
           +  L  D + +       S  +  F FGC   T  EPIG+AGFGRG LSLP+QL     H
Sbjct: 203 VAKLYSDSLSL------PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPH 262

Query: 244 KG--FSHCFLPFKF-SNNPNFSSPLILGNLA-------------------ISSKEHLKFT 303
            G  FS+C +   F S+     SPLILG                         K    FT
Sbjct: 263 LGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFT 322

Query: 304 PFLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPE 363
             L++P +P +Y + L+ I+IG             LR ID  G GG+++DSGTT+T LP 
Sbjct: 323 EMLENPKHPYFYSVSLQGISIGK----RNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPA 382

Query: 364 PLYSQLISNLESLIS--YPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFL-N 423
             Y+ ++   +S +   + RA   E ++G   CY   Y N T      ++P++  HF  N
Sbjct: 383 KFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCY---YLNQT-----VKVPALVLHFAGN 442

Query: 424 NVSVVLPQGNSFYAMA----APSNSTVVKCLLFQSMDGDGD---GPAGIFGSFQQQNLEV 466
             SV LP+ N FY              + CL+  +   + +   G   I G++QQQ  EV
Sbjct: 443 RSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEV 494

BLAST of CmoCh13G004760 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 147.1 bits (370), Expect = 5.0e-34
Identity = 122/408 (29.90%), Postives = 169/408 (41.42%), Query Frame = 0

Query: 64  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSS 123
           YLM+L++GTP Q     MDTGSDL W         CQ C +  N         F P  SS
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWT-------QCQPCTQCFNQ----STPIFNPQGSS 154

Query: 124 TSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGT 183
           +     C S  C                      L   TC      ++Y YG      G+
Sbjct: 155 SFSTLPCSSQLC--------------------QALSSPTCSNNFCQYTYGYGDGSETQGS 214

Query: 184 LTKDVIFIHGNSPNSSRKIPKFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSHK 243
           +  + +         S  IP   FGC     G       G+ G GRG LSLPSQL  +  
Sbjct: 215 MGTETLTF------GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTK- 274

Query: 244 GFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGN 303
            FS+C  P   S   N    L+LG+LA S       T  ++S   P +YYI L  +++G+
Sbjct: 275 -FSYCMTPIGSSTPSN----LLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 334

Query: 304 GE---NYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAK 363
                + S F ++         G GGI+IDSGTT T+     Y  +     S I+ P   
Sbjct: 335 TRLPIDPSAFALN------SNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVV- 394

Query: 364 EHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTV 423
            +  ++GFDLC++ P           ++P+   HF +   + LP  N F    +PSN  +
Sbjct: 395 -NGSSSGFDLCFQTPSD-----PSNLQIPTFVMHF-DGGDLELPSENYF---ISPSNGLI 436

Query: 424 VKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCAS 465
             CL      G       IFG+ QQQN+ VVYD     + F +  C +
Sbjct: 455 --CLAM----GSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQCGA 436

BLAST of CmoCh13G004760 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 1.1e-33
Identity = 122/437 (27.92%), Postives = 179/437 (40.96%), Query Frame = 0

Query: 32  KRGYNSLLTKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVP 91
           KRG   +  + I  M   +  +  P+      YLM++ +GTP       MDTGSDL W  
Sbjct: 66  KRGERRM--RSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQ 125

Query: 92  CGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIA 151
           C      C  C      +       F P  SS+     C S +C D+     P + C   
Sbjct: 126 CE----PCTQCFSQPTPI-------FNPQDSSSFSTLPCESQYCQDL-----PSETCNNN 185

Query: 152 GCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGC-- 211
            C                ++Y YG      G +  +      +S      +P   FGC  
Sbjct: 186 EC---------------QYTYGYGDGSTTQGYMATETFTFETSS------VPNIAFGCGE 245

Query: 212 --VGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAI 271
              G       G+ G G G LSLPSQLG     FS+C   +  S+     S L LG+ A 
Sbjct: 246 DNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQ--FSYCMTSYGSSS----PSTLALGSAAS 305

Query: 272 SSKEHLKFTPFLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDS 331
              E    T  + S   P YYYI L+ IT+G G+N    G+     ++   G GG++IDS
Sbjct: 306 GVPEGSPSTTLIHSSLNPTYYYITLQGITVG-GDN---LGIPSSTFQLQDDGTGGMIIDS 365

Query: 332 GTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSI 391
           GTT T+LP+  Y+ +       I+ P   E   ++G   C++ P   +T      ++P I
Sbjct: 366 GTTLTYLPQDAYNAVAQAFTDQINLPTVDES--SSGLSTCFQQPSDGST-----VQVPEI 425

Query: 392 TFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVV 451
           +  F   V  +  Q      + +P+   +  CL   S    G     IFG+ QQQ  +V+
Sbjct: 426 SMQFDGGVLNLGEQN----ILISPAEGVI--CLAMGSSSQLG---ISIFGNIQQQETQVL 437

Query: 452 YDLEKERLGFEAMDCAS 465
           YDL+   + F    C +
Sbjct: 486 YDLQNLAVSFVPTQCGA 437

BLAST of CmoCh13G004760 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 143.3 bits (360), Expect = 7.2e-33
Identity = 123/426 (28.87%), Postives = 182/426 (42.72%), Query Frame = 0

Query: 50  NDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNV 109
           +  V+  L +    Y   L +GTP + + + +DTGSD+ W+ C      C+ C    + +
Sbjct: 128 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA----PCRRCYSQSDPI 187

Query: 110 LGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPS 169
                  F P  S T     C S  C  + S          AGC+     + TC      
Sbjct: 188 -------FDPRKSKTYATIPCSSPHCRRLDS----------AGCNTR---RKTC-----L 247

Query: 170 FSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGC--------VGATYREPIGIA 229
           +  +YG     +G  + + +    N      ++     GC        VGA      G+ 
Sbjct: 248 YQVSYGDGSFTVGDFSTETLTFRRN------RVKGVALGCGHDNEGLFVGAA-----GLL 307

Query: 230 GFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFL 289
           G G+G LS P Q G  F+ K FS+C +    S+ P   S ++ GN A+S     +FTP L
Sbjct: 308 GLGKGKLSFPGQTGHRFNQK-FSYCLVDRSASSKP---SSVVFGNAAVS--RIARFTPLL 367

Query: 290 KSPFYPNYYYIGLESITIGNGENYSRF-GVSLQLREIDTKGNGGILIDSGTTYTHLPEPL 349
            +P    +YY+GL  I++G     +R  GV+  L ++D  GNGG++IDSGT+ T L  P 
Sbjct: 368 SNPKLDTFYYVGLLGISVGG----TRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPA 427

Query: 350 YSQLISNLE-SLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSV 409
           Y  +         +  RA +  L   FD C+ +   N      E ++P++  HF     V
Sbjct: 428 YIAMRDAFRVGAKTLKRAPDFSL---FDTCFDLSNMN------EVKVPTVVLHF-RGADV 485

Query: 410 VLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGF 464
            LP  N  Y +   +N     C  F    G       I G+ QQQ   VVYDL   R+GF
Sbjct: 488 SLPATN--YLIPVDTNGKF--CFAFAGTMGG----LSIIGNIQQQGFRVVYDLASSRVGF 485

BLAST of CmoCh13G004760 vs. ExPASy Swiss-Prot
Match: Q7XV21 (Aspartyl protease 37 OS=Oryza sativa subsp. japonica OX=39947 GN=AP37 PE=3 SV=2)

HSP 1 Score: 119.0 bits (297), Expect = 1.4e-25
Identity = 117/438 (26.71%), Postives = 181/438 (41.32%), Query Frame = 0

Query: 56  PLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLA 115
           P+      YL+ L +GTPP      +DT SDL W  C      C  C    + +  P++ 
Sbjct: 81  PIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ----PCTGCYHQVDPMFNPRV- 140

Query: 116 AFLPTHSSTSIRDTCGSSFC--IDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYT 175
                 SST     C S  C  +D+H   +  D               +C      ++YT
Sbjct: 141 ------SSTYAALPCSSDTCDELDVHRCGHDDDE--------------SC-----QYTYT 200

Query: 176 YGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGCV-----GATYREPIGIAGFGRGLL 235
           Y  +    GTL  D + I     ++ R +    FGC      GA   +  G+ G GRG L
Sbjct: 201 YSGNATTEGTLAVDKLVI---GEDAFRGV---AFGCSTSSTGGAPPPQASGVVGLGRGPL 260

Query: 236 SLPSQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLK--FTPFLKSPFYPN 295
           SL SQL  S + F++C LP   S  P     L+LG  A +++        P  + P YP+
Sbjct: 261 SLVSQL--SVRRFAYC-LPPPASRIP---GKLVLGADADAARNATNRIAVPMRRDPRYPS 320

Query: 296 YYYIGLESITIGNGENYSRFGVSLQLREIDT-------------------KGNGGILIDS 355
           YYY+ L+ + IG+         +       T                       G++ID 
Sbjct: 321 YYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDI 380

Query: 356 GTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSI 415
            +T T L   LY +L+++LE  I  PR     L  G DLC+ +P   +    D   +P++
Sbjct: 381 ASTITFLEASLYDELVNDLEVEIRLPRGTGSSL--GLDLCFILP---DGVAFDRVYVPAV 440

Query: 416 TFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVV 466
              F +   + L +   F    A    + + CL+    +    G   I G+FQQQN++V+
Sbjct: 441 ALAF-DGRWLRLDKARLF----AEDRESGMMCLMVGRAEA---GSVSILGNFQQQNMQVL 463

BLAST of CmoCh13G004760 vs. ExPASy TrEMBL
Match: A0A6J1EHM1 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111434252 PE=3 SV=1)

HSP 1 Score: 986.5 bits (2549), Expect = 3.9e-284
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0

Query: 1   MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREI 60
           MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREI
Sbjct: 24  MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREI 83

Query: 61  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 120
           RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT
Sbjct: 84  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 143

Query: 121 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 180
           HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV
Sbjct: 144 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 203

Query: 181 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 240
           IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG
Sbjct: 204 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 263

Query: 241 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNG 300
           FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNG
Sbjct: 264 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNG 323

Query: 301 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL 360
           ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL
Sbjct: 324 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL 383

Query: 361 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 420
           NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL
Sbjct: 384 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 443

Query: 421 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE 476
           LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE
Sbjct: 444 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE 498

BLAST of CmoCh13G004760 vs. ExPASy TrEMBL
Match: A0A6J1KLG7 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111495254 PE=3 SV=1)

HSP 1 Score: 966.8 bits (2498), Expect = 3.2e-278
Identity = 466/475 (98.11%), Postives = 468/475 (98.53%), Query Frame = 0

Query: 1   MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREI 60
           MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSL  KRIKPMEMG+DDVIEPLREI
Sbjct: 22  MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGDDDVIEPLREI 81

Query: 61  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 120
           RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT
Sbjct: 82  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 141

Query: 121 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 180
           HSSTSIR+TCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSFSYTYGASGLV
Sbjct: 142 HSSTSIRETCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGACPRPCPSFSYTYGASGLV 201

Query: 181 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 240
           IGTLTKD IFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG
Sbjct: 202 IGTLTKDAIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 261

Query: 241 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNG 300
           FSHCFLPFKFSNNP FSSPLILGNLAISSKEHLKFTP LKSPFYPNYYYIGLESITIGNG
Sbjct: 262 FSHCFLPFKFSNNPKFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNG 321

Query: 301 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL 360
           ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLIS LESLISYPRAKEHEL
Sbjct: 322 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISILESLISYPRAKEHEL 381

Query: 361 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 420
           NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL
Sbjct: 382 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 441

Query: 421 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE 476
           LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE
Sbjct: 442 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE 496

BLAST of CmoCh13G004760 vs. ExPASy TrEMBL
Match: A0A5A7TNC9 (Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold129G00970 PE=3 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 1.8e-236
Identity = 404/490 (82.45%), Postives = 444/490 (90.61%), Query Frame = 0

Query: 3   QTLA-NPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEM--GNDDVIEPLRE 62
           QTLA NPKT F KDSLVLGLVHSRTSLLTPK+GYN +  KR+K M+   G+D+VIEPLRE
Sbjct: 26  QTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNFISKKRMKAMDQMDGDDNVIEPLRE 85

Query: 63  IRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLP 122
           IRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCGNLSFDCQDC+EYQNN+ GPKLAAFLP
Sbjct: 86  IRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPKLAAFLP 145

Query: 123 THSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGL 182
           THSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTYGASG+
Sbjct: 146 THSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGV 205

Query: 183 VIGTLTKDVIFIHG-------NSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPS 242
           V G+LT+DV+F+HG       N+ N+++++P+FCFGCVGATYREPIGIAGFGRGLLSLP 
Sbjct: 206 VTGSLTRDVLFMHGNYHNNNNNNSNNNKQVPRFCFGCVGATYREPIGIAGFGRGLLSLPF 265

Query: 243 QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK-EHLKFTPFLKSPFYPNYYYIG 302
           QLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LAISSK E+L+FTP LKSP YPNYYYIG
Sbjct: 266 QLGFSQKGFSHCFLPFKFSNNPNFSSPLILGHLAISSKDENLQFTPLLKSPIYPNYYYIG 325

Query: 303 LESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLIS 362
           LESITIGNG N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLES+IS
Sbjct: 326 LESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIS 385

Query: 363 YPRAKEHELNTGFDLCYKVPYK-NNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAA 422
           YPRAK+ ELNTGFDLCYKVP K NN+ F D+ +LPSITFHFLNNVSVVLPQGN+FYAMAA
Sbjct: 386 YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDSQLPSITFHFLNNVSVVLPQGNNFYAMAA 445

Query: 423 PSNSTVVKCLLFQSMDGDGD-------GPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCA 474
           P NSTVVKCLL+QSMDG GD       GPAGIFGSFQQQNL+VVYDLEKERLGF+AMDC 
Sbjct: 446 PINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNLQVVYDLEKERLGFQAMDCV 505

BLAST of CmoCh13G004760 vs. ExPASy TrEMBL
Match: A0A1S3CAK9 (aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103498305 PE=3 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 1.8e-236
Identity = 404/490 (82.45%), Postives = 444/490 (90.61%), Query Frame = 0

Query: 3   QTLA-NPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEM--GNDDVIEPLRE 62
           QTLA NPKT F KDSLVLGLVHSRTSLLTPK+GYN +  KR+K M+   G+D+VIEPLRE
Sbjct: 26  QTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNFISKKRMKAMDQMDGDDNVIEPLRE 85

Query: 63  IRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLP 122
           IRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCGNLSFDCQDC+EYQNN+ GPKLAAFLP
Sbjct: 86  IRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPKLAAFLP 145

Query: 123 THSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGL 182
           THSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSF+YTYGASG+
Sbjct: 146 THSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFAYTYGASGV 205

Query: 183 VIGTLTKDVIFIHG-------NSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPS 242
           V G+LT+DV+F+HG       N+ N+++++P+FCFGCVGATYREPIGIAGFGRGLLSLP 
Sbjct: 206 VTGSLTRDVLFMHGNYHNNNNNNSNNNKQVPRFCFGCVGATYREPIGIAGFGRGLLSLPF 265

Query: 243 QLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK-EHLKFTPFLKSPFYPNYYYIG 302
           QLGFS KGFSHCFLPFKFSNNPNFSSPLILG+LAISSK E+L+FTP LKSP YPNYYYIG
Sbjct: 266 QLGFSQKGFSHCFLPFKFSNNPNFSSPLILGHLAISSKDENLQFTPLLKSPIYPNYYYIG 325

Query: 303 LESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLIS 362
           LESITIGNG N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLES+IS
Sbjct: 326 LESITIGNGNNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLESVIS 385

Query: 363 YPRAKEHELNTGFDLCYKVPYK-NNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAA 422
           YPRAK+ ELNTGFDLCYKVP K NN+ F D+ +LPSITFHFLNNVSVVLPQGN+FYAMAA
Sbjct: 386 YPRAKQVELNTGFDLCYKVPCKNNNSSFVDDSQLPSITFHFLNNVSVVLPQGNNFYAMAA 445

Query: 423 PSNSTVVKCLLFQSMDGDGD-------GPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCA 474
           P NSTVVKCLL+QSMDG GD       GPAGIFGSFQQQNL+VVYDLEKERLGF+AMDC 
Sbjct: 446 PINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNLQVVYDLEKERLGFQAMDCV 505

BLAST of CmoCh13G004760 vs. ExPASy TrEMBL
Match: A0A0A0LYP0 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G704590 PE=3 SV=1)

HSP 1 Score: 826.2 bits (2133), Expect = 6.8e-236
Identity = 402/486 (82.72%), Postives = 441/486 (90.74%), Query Frame = 0

Query: 3   QTLA-NPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEM--GNDDVIEPLRE 62
           QTLA NPKT F KDSLVLGLVHSRTSLLTPK+GYN +  KR+K M+   G+D+VIEPLRE
Sbjct: 26  QTLATNPKTNFPKDSLVLGLVHSRTSLLTPKKGYNFISKKRMKAMDQTDGDDNVIEPLRE 85

Query: 63  IRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLP 122
           IRDGYLMSL++GTPPQV+QVYMDTGSDLTWVPCGNLSFDCQDC+EYQNN+ GP+LAAFLP
Sbjct: 86  IRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLP 145

Query: 123 THSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGL 182
           THSSTSIRDTCGSSFC+DIHSSDNPFDPCTIAGCSLA+LVKGTCPRPCPSF+YTYGASG+
Sbjct: 146 THSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGV 205

Query: 183 VIGTLTKDVIFIHG---NSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF 242
           V G+LT+DV+F HG   N+ N++++IP+FCFGCVGATYREPIGIAGFGRGLLSLP QLGF
Sbjct: 206 VTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGF 265

Query: 243 SHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK-EHLKFTPFLKSPFYPNYYYIGLESI 302
           SHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSK E+L+FTP LKSP YPNYYYIGLESI
Sbjct: 266 SHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIGLESI 325

Query: 303 TIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRA 362
           TIGNG+N  RFGVS +LREIDTKGNGG+LIDSGTTYTHLPEPLYSQLISNLE +I YPRA
Sbjct: 326 TIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVIGYPRA 385

Query: 363 KEHELNTGFDLCYKVPYK-NNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNS 422
           K+ ELNTGFDLCYKVP K NN+ F D+ +LPSITFHFLNNVSVVLPQGN+FYAMAAP NS
Sbjct: 386 KQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAPINS 445

Query: 423 TVVKCLLFQSMDGDGD-------GPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAV 474
           TVVKCLL+QSMDG GD       GPAGIFGSFQQQN+EVVYDLEKERLGF+ MDC SVA 
Sbjct: 446 TVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDCVSVAA 505

BLAST of CmoCh13G004760 vs. NCBI nr
Match: XP_022927421.1 (probable aspartyl protease At4g16563 [Cucurbita moschata])

HSP 1 Score: 986.5 bits (2549), Expect = 8.1e-284
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0

Query: 1   MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREI 60
           MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREI
Sbjct: 24  MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREI 83

Query: 61  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 120
           RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT
Sbjct: 84  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 143

Query: 121 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 180
           HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV
Sbjct: 144 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 203

Query: 181 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 240
           IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG
Sbjct: 204 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 263

Query: 241 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNG 300
           FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNG
Sbjct: 264 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNG 323

Query: 301 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL 360
           ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL
Sbjct: 324 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL 383

Query: 361 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 420
           NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL
Sbjct: 384 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 443

Query: 421 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE 476
           LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE
Sbjct: 444 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE 498

BLAST of CmoCh13G004760 vs. NCBI nr
Match: KAG7019432.1 (putative aspartyl protease [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 978.0 bits (2527), Expect = 2.9e-281
Identity = 471/475 (99.16%), Postives = 471/475 (99.16%), Query Frame = 0

Query: 1   MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREI 60
           MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSL  KRIKPMEMGNDDVIEPLREI
Sbjct: 239 MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREI 298

Query: 61  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 120
           RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT
Sbjct: 299 RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 358

Query: 121 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 180
           HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV
Sbjct: 359 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 418

Query: 181 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 240
           IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG
Sbjct: 419 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 478

Query: 241 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNG 300
           FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTP LKSPFYPNYYYIGLESITIGNG
Sbjct: 479 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNG 538

Query: 301 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL 360
           ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL
Sbjct: 539 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL 598

Query: 361 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 420
           NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL
Sbjct: 599 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 658

Query: 421 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE 476
           LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFE MDCASVAVSQGLHKKE
Sbjct: 659 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHKKE 713

BLAST of CmoCh13G004760 vs. NCBI nr
Match: KAG6583807.1 (putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 977.2 bits (2525), Expect = 4.9e-281
Identity = 470/475 (98.95%), Postives = 471/475 (99.16%), Query Frame = 0

Query: 1   MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREI 60
           MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSL  KRIKPMEMGNDDVIEPLREI
Sbjct: 26  MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREI 85

Query: 61  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 120
           RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT
Sbjct: 86  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 145

Query: 121 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 180
           HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV
Sbjct: 146 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 205

Query: 181 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 240
           IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG
Sbjct: 206 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 265

Query: 241 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNG 300
           FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTP LKSPFYPNYYYIGLESITIGNG
Sbjct: 266 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNG 325

Query: 301 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL 360
           ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQ+ISNLESLISYPRAKEHEL
Sbjct: 326 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQIISNLESLISYPRAKEHEL 385

Query: 361 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 420
           NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL
Sbjct: 386 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 445

Query: 421 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE 476
           LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFE MDCASVAVSQGLHKKE
Sbjct: 446 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHKKE 500

BLAST of CmoCh13G004760 vs. NCBI nr
Match: XP_023520027.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 975.7 bits (2521), Expect = 1.4e-280
Identity = 470/475 (98.95%), Postives = 470/475 (98.95%), Query Frame = 0

Query: 1   MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREI 60
           MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSL  KRIKPMEMGNDDVIEPLREI
Sbjct: 24  MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGNDDVIEPLREI 83

Query: 61  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 120
           RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT
Sbjct: 84  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 143

Query: 121 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 180
           HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV
Sbjct: 144 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 203

Query: 181 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 240
           IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLP QLGFSHKG
Sbjct: 204 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPYQLGFSHKG 263

Query: 241 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNG 300
           FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTP LKSPFYPNYYYIGLESITIGNG
Sbjct: 264 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNG 323

Query: 301 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL 360
           ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL
Sbjct: 324 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL 383

Query: 361 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 420
           NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL
Sbjct: 384 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 443

Query: 421 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE 476
           LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFE MDCASVAVSQGLHKKE
Sbjct: 444 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEGMDCASVAVSQGLHKKE 498

BLAST of CmoCh13G004760 vs. NCBI nr
Match: XP_023000974.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 966.8 bits (2498), Expect = 6.6e-278
Identity = 466/475 (98.11%), Postives = 468/475 (98.53%), Query Frame = 0

Query: 1   MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREI 60
           MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSL  KRIKPMEMG+DDVIEPLREI
Sbjct: 22  MGQTLANPKTKFLKDSLVLGLVHSRTSLLTPKRGYNSLSRKRIKPMEMGDDDVIEPLREI 81

Query: 61  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 120
           RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT
Sbjct: 82  RDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPT 141

Query: 121 HSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLV 180
           HSSTSIR+TCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKG CPRPCPSFSYTYGASGLV
Sbjct: 142 HSSTSIRETCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGACPRPCPSFSYTYGASGLV 201

Query: 181 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 240
           IGTLTKD IFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG
Sbjct: 202 IGTLTKDAIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKG 261

Query: 241 FSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNG 300
           FSHCFLPFKFSNNP FSSPLILGNLAISSKEHLKFTP LKSPFYPNYYYIGLESITIGNG
Sbjct: 262 FSHCFLPFKFSNNPKFSSPLILGNLAISSKEHLKFTPLLKSPFYPNYYYIGLESITIGNG 321

Query: 301 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHEL 360
           ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLIS LESLISYPRAKEHEL
Sbjct: 322 ENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISILESLISYPRAKEHEL 381

Query: 361 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 420
           NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL
Sbjct: 382 NTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCL 441

Query: 421 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE 476
           LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE
Sbjct: 442 LFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHKKE 496

BLAST of CmoCh13G004760 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 595.1 bits (1533), Expect = 4.9e-170
Identity = 294/463 (63.50%), Postives = 355/463 (76.67%), Query Frame = 0

Query: 17  LVLGLVHSRTSLLTPKRGYNSLLTKRIKPMEMGNDDVIEPLREIRDGYLMSLTLGTPPQV 76
           LVL L  S  SL TPK    S   +RIK      D V+EPLRE+RDGYL++L +GTPPQ 
Sbjct: 40  LVLTLTKSSVSLPTPK----SQTQERIKKPLSSVDVVMEPLREVRDGYLITLNIGTPPQA 99

Query: 77  IQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSSTSIRDTCGSSFCI 136
           +QVY+DTGSDLTWVPCGNLSFDC +C + +NN L    + F P HSSTS RD+C SSFC+
Sbjct: 100 VQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDL-KSPSVFSPLHSSTSFRDSCASSFCV 159

Query: 137 DIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGTLTKDVIFIHGNSP 196
           +IHSSDNPFDPC +AGCS++ L+K TC RPCPSF+YTYG  GL+ G LT+D++       
Sbjct: 160 EIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDIL------K 219

Query: 197 NSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGFSHCFLPFKFSNNPNF 256
             +R +P+F FGCV +TYREPIGIAGFGRGLLSLPSQLGF  KGFSHCFLPFKF NNPN 
Sbjct: 220 ARTRDVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNI 279

Query: 257 SSPLILG--NLAISSKEHLKFTPFLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLRE 316
           SSPLILG   L+I+  + L+FTP L +P YPN YYIGLESITIG   N +   V L LR+
Sbjct: 280 SSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIGLESITIGT--NITPTQVPLTLRQ 339

Query: 317 IDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLISYPRAKEHELNTGFDLCYKVPYKN 376
            D++GNGG+L+DSGTTYTHLPEP YSQL++ L+S I+YPRA E E  TGFDLCYKVP  N
Sbjct: 340 FDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITYPRATETESRTGFDLCYKVPCPN 399

Query: 377 NTFFSDEFEL----PSITFHFLNNVSVVLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGD 436
           N   S E ++    PSITFHFLNN +++LPQGNSFYAM+APS+ +VV+CLLFQ+M+    
Sbjct: 400 NNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDY 459

Query: 437 GPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASVAVSQGLHK 474
           GPAG+FGSFQQQN++VVYDLEKER+GF+AMDC   A S GL++
Sbjct: 460 GPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCVLEAASHGLNQ 489

BLAST of CmoCh13G004760 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 216.1 bits (549), Expect = 6.2e-56
Identity = 149/439 (33.94%), Postives = 207/439 (47.15%), Query Frame = 0

Query: 64  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSS 123
           YL+SL++G+    + +Y+DTGSDL W PC    F C  C   ++  L P   + L   SS
Sbjct: 83  YLISLSVGSSSSAVSLYLDTGSDLVWFPC--RPFTCILC---ESKPLPPSPPSSL---SS 142

Query: 124 TSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTC---PRPCPSFSYTYGASGLV 183
           ++   +C S  C   HSS    D C I+ C L  +  G C     PCP F Y YG  G +
Sbjct: 143 SATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYG-DGSL 202

Query: 184 IGTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGF--SH 243
           +  L  D + +       S  +  F FGC   T  EPIG+AGFGRG LSLP+QL     H
Sbjct: 203 VAKLYSDSLSL------PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPH 262

Query: 244 KG--FSHCFLPFKF-SNNPNFSSPLILGNLA-------------------ISSKEHLKFT 303
            G  FS+C +   F S+     SPLILG                         K    FT
Sbjct: 263 LGNSFSYCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFT 322

Query: 304 PFLKSPFYPNYYYIGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPE 363
             L++P +P +Y + L+ I+IG             LR ID  G GG+++DSGTT+T LP 
Sbjct: 323 EMLENPKHPYFYSVSLQGISIGK----RNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPA 382

Query: 364 PLYSQLISNLESLIS--YPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFL-N 423
             Y+ ++   +S +   + RA   E ++G   CY   Y N T      ++P++  HF  N
Sbjct: 383 KFYNSVVEEFDSRVGRVHERADRVEPSSGMSPCY---YLNQT-----VKVPALVLHFAGN 442

Query: 424 NVSVVLPQGNSFYAMA----APSNSTVVKCLLFQSMDGDGD---GPAGIFGSFQQQNLEV 466
             SV LP+ N FY              + CL+  +   + +   G   I G++QQQ  EV
Sbjct: 443 RSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEV 494

BLAST of CmoCh13G004760 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 181.8 bits (460), Expect = 1.3e-45
Identity = 138/416 (33.17%), Postives = 193/416 (46.39%), Query Frame = 0

Query: 63  GYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKL-AAFLPTH 122
           GY +SL+ GTP Q I    DTGS L W+PC +  + C  CD    + L P L   F+P +
Sbjct: 89  GYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTS-RYLCSGCD---FSGLDPTLIPRFIPKN 148

Query: 123 SSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVI 182
           SS+S    C S  C  ++    P   C   GC   T     C   CP +   YG      
Sbjct: 149 SSSSKIIGCQSPKCQFLY---GPNVQC--RGCDPNT---RNCTVGCPPYILQYGLGSTAG 208

Query: 183 GTLTKDVIFIHGNSPNSSRKIPKFCFGCVGATYREPIGIAGFGRGLLSLPSQLGFSHKGF 242
             +T+ + F           +P F  GC   + R+P GIAGFGRG +SLPSQ+    K F
Sbjct: 209 VLITEKLDF-------PDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNL--KRF 268

Query: 243 SHCFLPFKFSNNPNFSSPLIL----GNLAISSKEHLKFTPFLKSPFYPN-----YYYIGL 302
           SHC +  +F ++ N ++ L L    G+ + S    L +TPF K+P   N     YYY+ L
Sbjct: 269 SHCLVSRRF-DDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNL 328

Query: 303 ESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESLIS- 362
             I +G         +  +     T G+GG ++DSG+T+T +  P++  +     S +S 
Sbjct: 329 RRIYVGR----KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSN 388

Query: 363 YPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMAAP 422
           Y R K+ E  TG   C+ +  K       +  +P + F F     + LP  N F  +   
Sbjct: 389 YTREKDLEKETGLGPCFNISGKG------DVTVPELIFEFKGGAKLELPLSNYFTFV--- 448

Query: 423 SNSTVVKCLLFQS----MDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCA 464
             +T   CL   S        G GPA I GSFQQQN  V YDLE +R GF    C+
Sbjct: 449 -GNTDTVCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468

BLAST of CmoCh13G004760 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 147.9 bits (372), Expect = 2.1e-35
Identity = 131/416 (31.49%), Postives = 181/416 (43.51%), Query Frame = 0

Query: 64  YLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNVLGPKLAAFLPTHSS 123
           +LM L++G P       +DTGSDL W  C      C +C +    +       F P  SS
Sbjct: 107 FLMELSIGNPAVKYSAIVDTGSDLIWTQCK----PCTECFDQPTPI-------FDPEKSS 166

Query: 124 TSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPSFSYTYGASGLVIGT 183
           +  +  C S  C  +  S+   D             K  C      + YTYG      G 
Sbjct: 167 SYSKVGCSSGLCNALPRSNCNED-------------KDAC-----EYLYTYGDYSSTRGL 226

Query: 184 L-TKDVIFIHGNSPNSSRKIPKFCFGC----VGATYREPIGIAGFGRGLLSLPSQLGFSH 243
           L T+   F   NS      I    FGC     G  + +  G+ G GRG LSL SQL    
Sbjct: 227 LATETFTFEDENS------ISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQL--KE 286

Query: 244 KGFSHCFLPFKFSNNPNFSSPLILGNLA--ISSK-------EHLKFTPFLKSPFYPNYYY 303
             FS+C    + S     SS L +G+LA  I +K       E  K    L++P  P++YY
Sbjct: 287 TKFSYCLTSIEDS---EASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYY 346

Query: 304 IGLESITIGNGENYSRFGVSLQLREIDTKGNGGILIDSGTTYTHLPEPLYSQLISNLESL 363
           + L+ IT+G      R  V     E+   G GG++IDSGTT T+L E  +  L     S 
Sbjct: 347 LELQGITVG----AKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSR 406

Query: 364 ISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSVVLPQGNSFYAMA 423
           +S P   +   +TG DLC+K+P       +    +P + FHF     + LP  N   A  
Sbjct: 407 MSLP--VDDSGSTGLDLCFKLPDA-----AKNIAVPKMIFHF-KGADLELPGENYMVA-- 461

Query: 424 APSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGFEAMDCASV 466
              +ST V CL   S +G       IFG+ QQQN  V++DLEKE + F   +C  +
Sbjct: 467 --DSSTGVLCLAMGSSNG-----MSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461

BLAST of CmoCh13G004760 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 143.3 bits (360), Expect = 5.1e-34
Identity = 123/426 (28.87%), Postives = 182/426 (42.72%), Query Frame = 0

Query: 50  NDDVIEPLREIRDGYLMSLTLGTPPQVIQVYMDTGSDLTWVPCGNLSFDCQDCDEYQNNV 109
           +  V+  L +    Y   L +GTP + + + +DTGSD+ W+ C      C+ C    + +
Sbjct: 128 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCA----PCRRCYSQSDPI 187

Query: 110 LGPKLAAFLPTHSSTSIRDTCGSSFCIDIHSSDNPFDPCTIAGCSLATLVKGTCPRPCPS 169
                  F P  S T     C S  C  + S          AGC+     + TC      
Sbjct: 188 -------FDPRKSKTYATIPCSSPHCRRLDS----------AGCNTR---RKTC-----L 247

Query: 170 FSYTYGASGLVIGTLTKDVIFIHGNSPNSSRKIPKFCFGC--------VGATYREPIGIA 229
           +  +YG     +G  + + +    N      ++     GC        VGA      G+ 
Sbjct: 248 YQVSYGDGSFTVGDFSTETLTFRRN------RVKGVALGCGHDNEGLFVGAA-----GLL 307

Query: 230 GFGRGLLSLPSQLG--FSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKEHLKFTPFL 289
           G G+G LS P Q G  F+ K FS+C +    S+ P   S ++ GN A+S     +FTP L
Sbjct: 308 GLGKGKLSFPGQTGHRFNQK-FSYCLVDRSASSKP---SSVVFGNAAVS--RIARFTPLL 367

Query: 290 KSPFYPNYYYIGLESITIGNGENYSRF-GVSLQLREIDTKGNGGILIDSGTTYTHLPEPL 349
            +P    +YY+GL  I++G     +R  GV+  L ++D  GNGG++IDSGT+ T L  P 
Sbjct: 368 SNPKLDTFYYVGLLGISVGG----TRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPA 427

Query: 350 YSQLISNLE-SLISYPRAKEHELNTGFDLCYKVPYKNNTFFSDEFELPSITFHFLNNVSV 409
           Y  +         +  RA +  L   FD C+ +   N      E ++P++  HF     V
Sbjct: 428 YIAMRDAFRVGAKTLKRAPDFSL---FDTCFDLSNMN------EVKVPTVVLHF-RGADV 485

Query: 410 VLPQGNSFYAMAAPSNSTVVKCLLFQSMDGDGDGPAGIFGSFQQQNLEVVYDLEKERLGF 464
            LP  N  Y +   +N     C  F    G       I G+ QQQ   VVYDL   R+GF
Sbjct: 488 SLPATN--YLIPVDTNGKF--CFAFAGTMGG----LSIIGNIQQQGFRVVYDLASSRVGF 485

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q940R48.7e-5533.94Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q766C35.0e-3429.90Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C21.1e-3327.92Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LNJ37.2e-3328.87Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q7XV211.4e-2526.71Aspartyl protease 37 OS=Oryza sativa subsp. japonica OX=39947 GN=AP37 PE=3 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1EHM13.9e-284100.00probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114342... [more]
A0A6J1KLG73.2e-27898.11probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111495254... [more]
A0A5A7TNC91.8e-23682.45Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3CAK91.8e-23682.45aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103498305 PE=3 S... [more]
A0A0A0LYP06.8e-23682.72Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G70459... [more]
Match NameE-valueIdentityDescription
XP_022927421.18.1e-284100.00probable aspartyl protease At4g16563 [Cucurbita moschata][more]
KAG7019432.12.9e-28199.16putative aspartyl protease [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6583807.14.9e-28198.95putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023520027.11.4e-28098.95probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
XP_023000974.16.6e-27898.11probable aspartyl protease At4g16563 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT5G45120.14.9e-17063.50Eukaryotic aspartyl protease family protein [more]
AT4G16563.16.2e-5633.94Eukaryotic aspartyl protease family protein [more]
AT3G52500.11.3e-4533.17Eukaryotic aspartyl protease family protein [more]
AT2G03200.12.1e-3531.49Eukaryotic aspartyl protease family protein [more]
AT1G01300.15.1e-3428.87Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 58..261
e-value: 2.6E-30
score: 107.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 262..468
e-value: 1.7E-46
score: 160.2
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 63..463
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 64..264
e-value: 1.2E-28
score: 100.5
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 288..458
e-value: 2.5E-28
score: 98.9
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 26..468
NoneNo IPR availablePANTHERPTHR47967:SF47CHLOROPLAST NUCLEOID DNA-BINDING PROTEIN-LIKEcoord: 26..468
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 323..334
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 64..458
score: 30.538881
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 63..462
e-value: 4.55362E-74
score: 232.153

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh13G004760.1CmoCh13G004760.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity