CmUC10G185070 (gene) Watermelon (USVL531) v1

Overview
NameCmUC10G185070
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionEukaryotic aspartyl protease family protein
LocationCmU531Chr10: 2990458 .. 2991903 (+)
RNA-Seq ExpressionCmUC10G185070
SyntenyCmUC10G185070
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCCCTGTTTTTGTCTTCCTCCTCTGTTTTCTCCTCTCTTCCCCTGTTTTCTCCTCACAAATTCTCCTCCTACCTCTTACCCATTCCTTATCATCCTCAATATCAGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCCGCCCGCTCCTCCGCTCGATTCCACCACCGCCGCCGTGCTCACCACCGCAGCCACCTCTCTCTCCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTACATGGACACCGGCAGCGACCTCGTTTGGTTCCCCTGTTCCCCGTTTGAATGTATTCTTTGTGAAGGCAAACCAAAAATTCAATCCCCTTTGCCCAAAATCTCAAATAACAAATCAGTTTCTTGCAGCGCACCCGCCTGCTCCGCCGCCCATGGTGGCTCCCTCTCCGCCTCTCACCTCTGTGCAATTTCTCAATGTCCACTTGAATCCATTGAAATTTCTGAGTGTTCCTCTTTTTCCTGTCCCCCGTTTTATTATGCTTACGGCGATGGGAGTTTAATTGCTCGGCTCTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCGCCCTCACCGGCGATTAATGTTCGGAATTTTACTTTTGGGTGTGCCCACACGACGCTCGGCGAGCCGGTTGGGGTTGCCGGATTCGGCCGGGGGACGTTGTCGATGCCGAGTCAACTCGCCACTTTCTCACCCCAACTTGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCAACAGACCGAGTTCGCCGCCCGAGTCCGTTGATTCTCGGCCGGTACTACGGCCGCGAGACGGAGTTCATTTACACTTCATTGCTTGAGAATCCGAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGGATTTCAGTCGGGAACGTGAAGATTCCGGCGCCGGAATTTTTGAAAAAAGTGGATGAGGGTGGGAGCGGCGGCGTTGTGGTGGATTCCGGCACTACTTTCACTATGCTGCCGGCGGGATTGTATGACTCGGTGGTGGCGGAGTTTGAGAACCGGACCGGAAGAGTTGCGAACCGGGCAAGACGGATTGAAGAAAGTATCGGTTTGAGCCCTTGCTACTACTATGAGGGCTCAGTTGAAGTGCCACGTGTCGTGTTGCATTTCGTTGGGGAACAATCCAGTGTCGTGCTTCCTAGGAAGAATTATTTCTATGAGTTTTTGGACAGTGGAGATGGGGTGGGGAGGAAAAGAAAAGTTGGGTGTTTGATGCTGATGAACGGTGGAGATGAGGTTGAGCTGGCAGGTGGGCCCGGGGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTAGTTTATGATTTGGAAAAGAACCGGGTCGGGTTCGCCCGGCGGCAATGTTCCACTCTTTGGGACAGCTTGAACCGGAGTTAG

mRNA sequence

ATGGCTTCCCCTGTTTTTGTCTTCCTCCTCTGTTTTCTCCTCTCTTCCCCTGTTTTCTCCTCACAAATTCTCCTCCTACCTCTTACCCATTCCTTATCATCCTCAATATCAGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCCGCCCGCTCCTCCGCTCGATTCCACCACCGCCGCCGTGCTCACCACCGCAGCCACCTCTCTCTCCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTACATGGACACCGGCAGCGACCTCGTTTGGTTCCCCTGTTCCCCGTTTGAATGTATTCTTTGTGAAGGCAAACCAAAAATTCAATCCCCTTTGCCCAAAATCTCAAATAACAAATCAGTTTCTTGCAGCGCACCCGCCTGCTCCGCCGCCCATGGTGGCTCCCTCTCCGCCTCTCACCTCTGTGCAATTTCTCAATGTCCACTTGAATCCATTGAAATTTCTGAGTGTTCCTCTTTTTCCTGTCCCCCGTTTTATTATGCTTACGGCGATGGGAGTTTAATTGCTCGGCTCTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCGCCCTCACCGGCGATTAATGTTCGGAATTTTACTTTTGGGTGTGCCCACACGACGCTCGGCGAGCCGGTTGGGGTTGCCGGATTCGGCCGGGGGACGTTGTCGATGCCGAGTCAACTCGCCACTTTCTCACCCCAACTTGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCAACAGACCGAGTTCGCCGCCCGAGTCCGTTGATTCTCGGCCGGTACTACGGCCGCGAGACGGAGTTCATTTACACTTCATTGCTTGAGAATCCGAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGGATTTCAGTCGGGAACGTGAAGATTCCGGCGCCGGAATTTTTGAAAAAAGTGGATGAGGGTGGGAGCGGCGGCGTTGTGGTGGATTCCGGCACTACTTTCACTATGCTGCCGGCGGGATTGTATGACTCGGTGGTGGCGGAGTTTGAGAACCGGACCGGAAGAGTTGCGAACCGGGCAAGACGGATTGAAGAAAGTATCGGTTTGAGCCCTTGCTACTACTATGAGGGCTCAGTTGAAGTGCCACGTGTCGTGTTGCATTTCGTTGGGGAACAATCCAGTGTCGTGCTTCCTAGGAAGAATTATTTCTATGAGTTTTTGGACAGTGGAGATGGGGTGGGGAGGAAAAGAAAAGTTGGGTGTTTGATGCTGATGAACGGTGGAGATGAGGTTGAGCTGGCAGGTGGGCCCGGGGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTAGTTTATGATTTGGAAAAGAACCGGGTCGGGTTCGCCCGGCGGCAATGTTCCACTCTTTGGGACAGCTTGAACCGGAGTTAG

Coding sequence (CDS)

ATGGCTTCCCCTGTTTTTGTCTTCCTCCTCTGTTTTCTCCTCTCTTCCCCTGTTTTCTCCTCACAAATTCTCCTCCTACCTCTTACCCATTCCTTATCATCCTCAATATCAGATTTCAACAACACCCACAACCTCCTCAAATCCACCGCCGCCCGCTCCTCCGCTCGATTCCACCACCGCCGCCGTGCTCACCACCGCAGCCACCTCTCTCTCCCCCTCTCCCCTGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTACATGGACACCGGCAGCGACCTCGTTTGGTTCCCCTGTTCCCCGTTTGAATGTATTCTTTGTGAAGGCAAACCAAAAATTCAATCCCCTTTGCCCAAAATCTCAAATAACAAATCAGTTTCTTGCAGCGCACCCGCCTGCTCCGCCGCCCATGGTGGCTCCCTCTCCGCCTCTCACCTCTGTGCAATTTCTCAATGTCCACTTGAATCCATTGAAATTTCTGAGTGTTCCTCTTTTTCCTGTCCCCCGTTTTATTATGCTTACGGCGATGGGAGTTTAATTGCTCGGCTCTATAGAGATAGCCTCAGTTTGCCGGCGCCGGCGCCCTCACCGGCGATTAATGTTCGGAATTTTACTTTTGGGTGTGCCCACACGACGCTCGGCGAGCCGGTTGGGGTTGCCGGATTCGGCCGGGGGACGTTGTCGATGCCGAGTCAACTCGCCACTTTCTCACCCCAACTTGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCAACAGACCGAGTTCGCCGCCCGAGTCCGTTGATTCTCGGCCGGTACTACGGCCGCGAGACGGAGTTCATTTACACTTCATTGCTTGAGAATCCGAAGCATCCTTATTTTTACTCGGTTGGGTTGGCCGGGATTTCAGTCGGGAACGTGAAGATTCCGGCGCCGGAATTTTTGAAAAAAGTGGATGAGGGTGGGAGCGGCGGCGTTGTGGTGGATTCCGGCACTACTTTCACTATGCTGCCGGCGGGATTGTATGACTCGGTGGTGGCGGAGTTTGAGAACCGGACCGGAAGAGTTGCGAACCGGGCAAGACGGATTGAAGAAAGTATCGGTTTGAGCCCTTGCTACTACTATGAGGGCTCAGTTGAAGTGCCACGTGTCGTGTTGCATTTCGTTGGGGAACAATCCAGTGTCGTGCTTCCTAGGAAGAATTATTTCTATGAGTTTTTGGACAGTGGAGATGGGGTGGGGAGGAAAAGAAAAGTTGGGTGTTTGATGCTGATGAACGGTGGAGATGAGGTTGAGCTGGCAGGTGGGCCCGGGGCCACGCTTGGGAACTACCAACAACAGGGTTTTGAGGTAGTTTATGATTTGGAAAAGAACCGGGTCGGGTTCGCCCGGCGGCAATGTTCCACTCTTTGGGACAGCTTGAACCGGAGTTAG

Protein sequence

MASPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHRRRAHHRSHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYFYSVGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRVANRARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVGRKRKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
Homology
BLAST of CmUC10G185070 vs. NCBI nr
Match: XP_038905814.1 (probable aspartyl protease At4g16563 [Benincasa hispida])

HSP 1 Score: 923.3 bits (2385), Expect = 8.5e-265
Identity = 453/481 (94.18%), Postives = 464/481 (96.47%), Query Frame = 0

Query: 1   MASPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHR 60
           MAS VFV LLCFLLSSPVFSSQ+LLLPL+HSLSSSISDFNNTHNLLKSTAARSSARFHHR
Sbjct: 1   MASSVFVLLLCFLLSSPVFSSQLLLLPLSHSLSSSISDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRAHHRSHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RR  H +HLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTQHHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 IQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYY 180
           +QSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYY
Sbjct: 121 VQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYY 180

Query: 181 AYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQ 240
           AYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHT LGEPVGVAGFGRGTLSMPSQ
Sbjct: 181 AYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTALGEPVGVAGFGRGTLSMPSQ 240

Query: 241 LATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYFYS 300
           LATFSPQLGNRFSYCLVSHSFA +RVRRPSPLILGRYYG ETEFIYTSLLENPKHPYFYS
Sbjct: 241 LATFSPQLGNRFSYCLVSHSFAAERVRRPSPLILGRYYGGETEFIYTSLLENPKHPYFYS 300

Query: 301 VGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRV 360
           VGL GISVGN+ IPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVA FENRTGRV
Sbjct: 301 VGLTGISVGNMMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAAFENRTGRV 360

Query: 361 ANRARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVGRK 420
           ANRARRIEE+ GLSPCYYYE SVEVPRVVLHFVGE+SSV+LP+KNYFYEFLD GDGVG+K
Sbjct: 361 ANRARRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVLLPKKNYFYEFLDGGDGVGKK 420

Query: 421 RKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNR 480
           RKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDL KNRVGFARRQCSTLWDSLNR
Sbjct: 421 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLAKNRVGFARRQCSTLWDSLNR 480

Query: 481 S 482
           S
Sbjct: 481 S 481

BLAST of CmUC10G185070 vs. NCBI nr
Match: XP_023007805.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 899.8 bits (2324), Expect = 1.0e-257
Identity = 444/481 (92.31%), Postives = 460/481 (95.63%), Query Frame = 0

Query: 1   MASPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHR 60
           MASPVF+FLLCFLL SPVFSSQILLLPL++SLSSS SDFNNTHNLLKSTAARSSARFHHR
Sbjct: 1   MASPVFLFLLCFLLPSPVFSSQILLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRAHHRSHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RR HHRSHLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 IQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYY 180
           IQSPLPKISN KSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQ 240
           AYGDGSLI RLYRDSLSLPAPAPSPAINVRNFTFGCAH+ LGEP+GVAGFGRG LSMP Q
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPIQ 240

Query: 241 LATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYFYS 300
           LATFSPQLGNRFSYCLVSHSFA DRVRRPSPLILGRYYG ETEFIYTS+LENPKHPYFYS
Sbjct: 241 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYS 300

Query: 301 VGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRV 360
           VGLAGISVG+V IPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLY+SVVA+FENRTGRV
Sbjct: 301 VGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRV 360

Query: 361 ANRARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVGRK 420
           A+RA +IEE+ GLSPCYYYE SVEVPRVVLHFVGE+SSV+LPRKNYFYEFLD GDGVGRK
Sbjct: 361 ASRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRK 420

Query: 421 RKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNR 480
            KVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNR
Sbjct: 421 IKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLNR 480

Query: 481 S 482
           S
Sbjct: 481 S 480

BLAST of CmUC10G185070 vs. NCBI nr
Match: XP_023553227.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 899.4 bits (2323), Expect = 1.3e-257
Identity = 445/483 (92.13%), Postives = 463/483 (95.86%), Query Frame = 0

Query: 1   MASPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHR 60
           MASPVF+FLLCFLLSSPVFSSQ+LLLPL++SLSSS SDFNNTHNLLKSTAARSSARFHHR
Sbjct: 1   MASPVFLFLLCFLLSSPVFSSQLLLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRAHHRSHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RR HHRSHLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 IQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYY 180
           IQSPLPKI++ KSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKIADKKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIARLYRDSLSL--PAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMP 240
           AYGDGSLI RLYRDSLSL  PAPAPSPAINVRNFTFGCAH+ LGEP+GVAGFGRG LSMP
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMP 240

Query: 241 SQLATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYF 300
           SQLATFSPQLGNRFSYCLVSHSFA DRVRRPSPLILGRYYG ETEFIYTSLLENPKHPYF
Sbjct: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSLLENPKHPYF 300

Query: 301 YSVGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTG 360
           YSVGLAGISVG+V+IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLY+SVVA+FENRTG
Sbjct: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTG 360

Query: 361 RVANRARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVG 420
           RVA+RA RIEE+ GLSPCYYYE SVEVPRVVLHFVGE+SSVVLPRKNYFYEFLD GDGV 
Sbjct: 361 RVASRASRIEENTGLSPCYYYENSVEVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVE 420

Query: 421 RKRKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 480
           RKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSL
Sbjct: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 480

Query: 481 NRS 482
           NRS
Sbjct: 481 NRS 482

BLAST of CmUC10G185070 vs. NCBI nr
Match: KAG6577689.1 (putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 897.5 bits (2318), Expect = 5.0e-257
Identity = 444/483 (91.93%), Postives = 462/483 (95.65%), Query Frame = 0

Query: 1   MASPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHR 60
           MASPVF+FLLCFL+SSPVFSSQ+LLLPL++SLSSS SDFNNTHNLLKSTAARSSARFHHR
Sbjct: 1   MASPVFLFLLCFLISSPVFSSQLLLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRAHHRSHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RR HHRSHLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 IQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYY 180
           IQSPLPKISN KSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIARLYRDSLSL--PAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMP 240
           AYGDGSLI RLYRDSLSL  PAPAPSPAINVRNFTFGCAH+ LGEP+GVAGFGRG LSMP
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMP 240

Query: 241 SQLATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYF 300
           SQLATFSPQLGNRFSYCLVSHSFA DRVRRPSPLILGRYYG ETEFIYTS+LENPKHPYF
Sbjct: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYF 300

Query: 301 YSVGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTG 360
           YSVGLAGISVG+V+IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLY+SVVA+FENRTG
Sbjct: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTG 360

Query: 361 RVANRARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVG 420
           RVA+RA RIEE+ GLSPCY YE SVEVPRVVLHFVGE+SSV LPRKNYFYEFLD GDGVG
Sbjct: 361 RVASRASRIEENTGLSPCYSYEKSVEVPRVVLHFVGEKSSVELPRKNYFYEFLDGGDGVG 420

Query: 421 RKRKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 480
           RKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSL
Sbjct: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 480

Query: 481 NRS 482
           NRS
Sbjct: 481 NRS 482

BLAST of CmUC10G185070 vs. NCBI nr
Match: XP_022923540.1 (probable aspartyl protease At4g16563 [Cucurbita moschata])

HSP 1 Score: 896.7 bits (2316), Expect = 8.5e-257
Identity = 444/483 (91.93%), Postives = 461/483 (95.45%), Query Frame = 0

Query: 1   MASPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHR 60
           MASPVF+FLLCFL SSPVFSSQ+LLLPL++SLSSS SDFNNTHNLLKSTAARSSARFHHR
Sbjct: 1   MASPVFLFLLCFLFSSPVFSSQLLLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRAHHRSHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RR HHRSHLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 IQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYY 180
           IQSPLPKISN KSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIARLYRDSLSL--PAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMP 240
           AYGDGSLI RLYRDSLSL  PAPAPSPAINVRNFTFGCAH+ LGEP+GVAGFGRG LSMP
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMP 240

Query: 241 SQLATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYF 300
           SQLATFSPQLGNRFSYCLVSHSFA DRVRRPSPLILGRYYG ETEFIYTS+LENPKHPYF
Sbjct: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYF 300

Query: 301 YSVGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTG 360
           YSVGLAGISVG+V+IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLY+SVVA+FENRTG
Sbjct: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTG 360

Query: 361 RVANRARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVG 420
           RVA+RA RIEE+ GLSPCY YE SVEVPRVVLHFVGE+SSV LPRKNYFYEFLD GDGVG
Sbjct: 361 RVASRASRIEENTGLSPCYSYEKSVEVPRVVLHFVGEKSSVELPRKNYFYEFLDGGDGVG 420

Query: 421 RKRKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 480
           RKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSL
Sbjct: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 480

Query: 481 NRS 482
           NRS
Sbjct: 481 NRS 482

BLAST of CmUC10G185070 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 590.5 bits (1521), Expect = 1.7e-167
Identity = 297/478 (62.13%), Postives = 356/478 (74.48%), Query Frame = 0

Query: 24  LLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHRRRAHHRSHLSLPLSPGGDYTLSF 83
           LLL L+HSLS+S    +  H LLKS+++RSSARF        +  LSLP+S G DY +S 
Sbjct: 29  LLLHLSHSLSTSKHSSSPLH-LLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISL 88

Query: 84  NLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNN-KSVSCSAPACS 143
           ++GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   +S++  +VSCS+P+CS
Sbjct: 89  SVGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCS 148

Query: 144 AAHGGSLSASHLCAISQCPLESIEISEC--SSFSCPPFYYAYGDGSLIARLYRDSLSLPA 203
           AAH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+A+LY DSLSL  
Sbjct: 149 AAH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSL-- 208

Query: 204 PAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHS 263
               P+++V NFTFGCAHTTL EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHS
Sbjct: 209 ----PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHS 268

Query: 264 FATDRVRRPSPLILGRYYG--------------------RETEFIYTSLLENPKHPYFYS 323
           F +DRVRRPSPLILGR+                      ++ EF++T +LENPKHPYFYS
Sbjct: 269 FDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYS 328

Query: 324 VGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRV 383
           V L GIS+G   IPAP  L+++D+ G GGVVVDSGTTFTMLPA  Y+SVV EF++R GRV
Sbjct: 329 VSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRV 388

Query: 384 ANRARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVGRK 443
             RA R+E S G+SPCYY   +V+VP +VLHF G +SSV LPR+NYFYEF+D GDG   K
Sbjct: 389 HERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEK 448

Query: 444 RKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 479
           RK+GCLMLMNGGDE EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWDSL
Sbjct: 449 RKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of CmUC10G185070 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 4.5e-35
Identity = 124/402 (30.85%), Postives = 173/402 (43.03%), Query Frame = 0

Query: 77  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSC 136
           G+Y     +G+ + ++ L +DTGSD+ W  C P  C  C  +          S  KS++C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP--CADCYQQSDPVFNPTSSSTYKSLTC 219

Query: 137 SAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAYGDGSL-IARLYRDS 196
           SAP CS                      +E S C S  C  +  +YGDGS  +  L  D+
Sbjct: 220 SAPQCSL---------------------LETSACRSNKC-LYQVSYGDGSFTVGELATDT 279

Query: 197 LSLPAPAPSPAINVRNFTFGCAHTTLG---EPVGVAGFGRGTLSMPSQLATFSPQLGNRF 256
           ++        +  + N   GC H   G      G+ G G G LS+ +Q+   S      F
Sbjct: 280 VTF-----GNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATS------F 339

Query: 257 SYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYFYSVGLAGISVGNVK 316
           SYCLV            + + LG   G  T      LL N K   FY VGL+G SVG  K
Sbjct: 340 SYCLVDRDSGKSSSLDFNSVQLGG--GDAT----APLLRNKKIDTFYYVGLSGFSVGGEK 399

Query: 317 IPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRVANRARRIEESIG 376
           +  P+ +  VD  GSGGV++D GT  T L    Y+S+   F   T  +    ++   SI 
Sbjct: 400 VVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNL----KKGSSSIS 459

Query: 377 L-SPCYYYE--GSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVGRKRKVGCLMLM 436
           L   CY +    +V+VP V  HF G + S+ LP KNY     DSG          C    
Sbjct: 460 LFDTCYDFSSLSTVKVPTVAFHFTGGK-SLDLPAKNYLIPVDDSG--------TFCFAFA 500

Query: 437 NGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 472
                + +       +GN QQQG  + YDL KN +G +  +C
Sbjct: 520 PTSSSLSI-------IGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmUC10G185070 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 3.8e-34
Identity = 134/426 (31.46%), Postives = 185/426 (43.43%), Query Frame = 0

Query: 58  HHRRRAHHRSHLSLPLSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCE 117
           H  R     S +   LS G G+Y     +G+ +  + + +DTGSD+VW  C+P  C  C 
Sbjct: 120 HAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAP--CRRCY 179

Query: 118 GKPKIQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCP 177
            +        K     ++ CS+P C        +      + Q                 
Sbjct: 180 SQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQ----------------- 239

Query: 178 PFYYAYGDGSL-IARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVA---GFGR 237
               +YGDGS  +     ++L+           V+    GC H   G  VG A   G G+
Sbjct: 240 ---VSYGDGSFTVGDFSTETLTFRRN------RVKGVALGCGHDNEGLFVGAAGLLGLGK 299

Query: 238 GTLSMPSQLATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGR-YYGRETEFIYTSLLE 297
           G LS P Q      +   +FSYCLV  S ++    +PS ++ G     R   F  T LL 
Sbjct: 300 GKLSFPGQT---GHRFNQKFSYCLVDRSASS----KPSSVVFGNAAVSRIARF--TPLLS 359

Query: 298 NPKHPYFYSVGLAGISVGNVKIP-APEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVV 357
           NPK   FY VGL GISVG  ++P     L K+D+ G+GGV++DSGT+ T L    Y ++ 
Sbjct: 360 NPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMR 419

Query: 358 AEFENRTGRVANRARRIEESIGLSPCYYYE--GSVEVPRVVLHFVGEQSSVVLPRKNYFY 417
             F  R G  A   +R  +      C+       V+VP VVLHF G  + V LP  NY  
Sbjct: 420 DAF--RVG--AKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRG--ADVSLPATNYLI 479

Query: 418 EFLDSGDGVGRKRKVGCLMLMNGGDEVELAGGPG--ATLGNYQQQGFEVVYDLEKNRVGF 473
             +D+                NG      AG  G  + +GN QQQGF VVYDL  +RVGF
Sbjct: 480 P-VDT----------------NGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGF 485

BLAST of CmUC10G185070 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 139.0 bits (349), Expect = 1.4e-31
Identity = 156/531 (29.38%), Postives = 215/531 (40.49%), Query Frame = 0

Query: 1   MASPVFVFLLCFLLSSPVFSS---------QILLLPLTHSLSSSISDFNNTHNLLKSTAA 60
           M  P+F F L   L     SS          +L  PLT  +++++ DFNNTH     +++
Sbjct: 1   MLLPLFFFFLHLHLHLSSSSSISFPDFQIIDVLQPPLT--VTATLPDFNNTH-FSDESSS 60

Query: 61  RSSARFHHR--------RRAHHRSHLS-------------------LPLSP--------- 120
           + + R  HR        R  HHR H                     +P S          
Sbjct: 61  KYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFG 120

Query: 121 ----------GGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPL 180
                      G+Y +   +GS      + +D+GSD+VW  C P  C LC          
Sbjct: 121 SDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQP--CKLC---------- 180

Query: 181 PKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAYGDG 240
                 +S     PA S ++ G    S +C       + IE S C S  C  +   YGDG
Sbjct: 181 ----YKQSDPVFDPAKSGSYTGVSCGSSVC-------DRIENSGCHSGGC-RYEVMYGDG 240

Query: 241 SLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVA---GFGRGTLSMPSQLA 300
           S      + +L+L     +  + VRN   GC H   G  +G A   G G G++S   QL 
Sbjct: 241 SYT----KGTLALETLTFAKTV-VRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQL- 300

Query: 301 TFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYFYSVG 360
             S Q G  F YCLVS    TD       L+ GR         +  L+ NP+ P FY VG
Sbjct: 301 --SGQTGGAFGYCLVSR--GTDST---GSLVFGR-EALPVGASWVPLVRNPRAPSFYYVG 360

Query: 361 LAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRVAN 420
           L G+ VG V+IP P+ +  + E G GGVV+D+GT  T LP   Y +    F+++T   AN
Sbjct: 361 LKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQT---AN 420

Query: 421 RARRIEESIGLSPCYYYEG--SVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVGRK 472
             R    SI    CY   G  SV VP V  +F  E   + LP +N+     DSG      
Sbjct: 421 LPRASGVSI-FDTCYDLSGFVSVRVPTVSFYFT-EGPVLTLPARNFLMPVDDSG------ 470

BLAST of CmUC10G185070 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 138.3 bits (347), Expect = 2.3e-31
Identity = 125/445 (28.09%), Postives = 186/445 (41.80%), Query Frame = 0

Query: 43  HNLLKSTAARSSARFHH-RRRAHHRSHLSLPLSPG-GDYTLSFNLGSESHKISLYMDTGS 102
           + L+K    R   R           S +  P+  G G+Y ++  +G+     S  MDTGS
Sbjct: 58  YELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGS 117

Query: 103 DLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQC 162
           DL+W  C P  C  C        P P  +   S S S   C + +   L           
Sbjct: 118 DLIWTQCEP--CTQC-----FSQPTPIFNPQDSSSFSTLPCESQYCQDL----------- 177

Query: 163 PLESIEISECSSFSCPPFYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTT 222
           P E+   +EC       + Y YGDGS   + Y  + +      S    V N  FGC    
Sbjct: 178 PSETCNNNECQ------YTYGYGDGS-TTQGYMATETFTFETSS----VPNIAFGCGEDN 237

Query: 223 ----LGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGR 282
                G   G+ G G G LS+PSQL         +FSYC+ S+  ++     PS L LG 
Sbjct: 238 QGFGQGNGAGLIGMGWGPLSLPSQLGV------GQFSYCMTSYGSSS-----PSTLALGS 297

Query: 283 YYGRETE-FIYTSLLENPKHPYFYSVGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSG 342
                 E    T+L+ +  +P +Y + L GI+VG   +  P    ++ + G+GG+++DSG
Sbjct: 298 AASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSG 357

Query: 343 TTFTMLPAGLYDSVVAEFENRTGRVANRARRIEESIGLSPCYYY--EGS-VEVPRVVLHF 402
           TT T LP   Y++V   F ++     N     E S GLS C+    +GS V+VP + + F
Sbjct: 358 TTLTYLPQDAYNAVAQAFTDQ----INLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQF 417

Query: 403 ------VGEQSSVVLPRKNYFYEFLDSGDGVGRKRKVGCLMLMNGGDEVELAGGPGATLG 462
                 +GEQ+ ++ P +      + S   +G                        +  G
Sbjct: 418 DGGVLNLGEQNILISPAEGVICLAMGSSSQLGI-----------------------SIFG 435

Query: 463 NYQQQGFEVVYDLEKNRVGFARRQC 472
           N QQQ  +V+YDL+   V F   QC
Sbjct: 478 NIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CmUC10G185070 vs. ExPASy TrEMBL
Match: A0A6J1L3Z9 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111500303 PE=3 SV=1)

HSP 1 Score: 899.8 bits (2324), Expect = 4.9e-258
Identity = 444/481 (92.31%), Postives = 460/481 (95.63%), Query Frame = 0

Query: 1   MASPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHR 60
           MASPVF+FLLCFLL SPVFSSQILLLPL++SLSSS SDFNNTHNLLKSTAARSSARFHHR
Sbjct: 1   MASPVFLFLLCFLLPSPVFSSQILLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRAHHRSHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RR HHRSHLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 IQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYY 180
           IQSPLPKISN KSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQ 240
           AYGDGSLI RLYRDSLSLPAPAPSPAINVRNFTFGCAH+ LGEP+GVAGFGRG LSMP Q
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMPIQ 240

Query: 241 LATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYFYS 300
           LATFSPQLGNRFSYCLVSHSFA DRVRRPSPLILGRYYG ETEFIYTS+LENPKHPYFYS
Sbjct: 241 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYFYS 300

Query: 301 VGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRV 360
           VGLAGISVG+V IPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLY+SVVA+FENRTGRV
Sbjct: 301 VGLAGISVGSVMIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTGRV 360

Query: 361 ANRARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVGRK 420
           A+RA +IEE+ GLSPCYYYE SVEVPRVVLHFVGE+SSV+LPRKNYFYEFLD GDGVGRK
Sbjct: 361 ASRASQIEENTGLSPCYYYEKSVEVPRVVLHFVGEKSSVMLPRKNYFYEFLDGGDGVGRK 420

Query: 421 RKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNR 480
            KVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNR
Sbjct: 421 IKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSLNR 480

Query: 481 S 482
           S
Sbjct: 481 S 480

BLAST of CmUC10G185070 vs. ExPASy TrEMBL
Match: A0A0A0L5I7 (Pepsin A OS=Cucumis sativus OX=3659 GN=Csa_3G020060 PE=3 SV=1)

HSP 1 Score: 896.7 bits (2316), Expect = 4.1e-257
Identity = 443/480 (92.29%), Postives = 457/480 (95.21%), Query Frame = 0

Query: 3   SPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHRRR 62
           SPVF+FLLCFLLSSPVFSSQI LLPL+HSLSSSISDFNNTHNLLKSTA RSSARFH    
Sbjct: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNNTHNLLKSTATRSSARFHR--- 63

Query: 63  AHHRSHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 122
            H  +HLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ
Sbjct: 64  -HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123

Query: 123 SPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAY 182
           SPLPKI+NNKSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIEISECSSFSCPPFYYAY
Sbjct: 124 SPLPKIANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183

Query: 183 GDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQLA 242
           GDGSL+ARLYRDSLSLP PAPSP INVRNFTFGCAHTTLGEPVGVAGFGRG LSMPSQLA
Sbjct: 184 GDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLA 243

Query: 243 TFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYFYSVG 302
           TFSPQLGNRFSYCLVSHSFA DRVRRPSPLILGRYY  ETEFIYTSLLENPKHPYFYSVG
Sbjct: 244 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVG 303

Query: 303 LAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRVAN 362
           LAGISVGN++IPAPEFL KVDEGGSGGVVVDSGTTFTMLPAGLY+SVVAEFENRTG+VAN
Sbjct: 304 LAGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVAN 363

Query: 363 RARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDG-VGRKR 422
           RARRIEE+ GLSPCYYYE SV VPRVVLHFVGE+S+VVLPRKNYFYEFLD GDG VGRKR
Sbjct: 364 RARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKR 423

Query: 423 KVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS 482
           KVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+LNRS
Sbjct: 424 KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS 479

BLAST of CmUC10G185070 vs. ExPASy TrEMBL
Match: A0A6J1EC44 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111431201 PE=3 SV=1)

HSP 1 Score: 896.7 bits (2316), Expect = 4.1e-257
Identity = 444/483 (91.93%), Postives = 461/483 (95.45%), Query Frame = 0

Query: 1   MASPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHR 60
           MASPVF+FLLCFL SSPVFSSQ+LLLPL++SLSSS SDFNNTHNLLKSTAARSSARFHHR
Sbjct: 1   MASPVFLFLLCFLFSSPVFSSQLLLLPLSNSLSSS-SDFNNTHNLLKSTAARSSARFHHR 60

Query: 61  RRAHHRSHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPK 120
           RR HHRSHLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDLVWFPCSPFECILCEGKPK
Sbjct: 61  RRTHHRSHLSLPLSPGGDYTLSFNLGSESQKISLYMDTGSDLVWFPCSPFECILCEGKPK 120

Query: 121 IQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYY 180
           IQSPLPKISN KSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIE+SECSSFSCPPFYY
Sbjct: 121 IQSPLPKISNQKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEVSECSSFSCPPFYY 180

Query: 181 AYGDGSLIARLYRDSLSL--PAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMP 240
           AYGDGSLI RLYRDSLSL  PAPAPSPAINVRNFTFGCAH+ LGEP+GVAGFGRG LSMP
Sbjct: 181 AYGDGSLIGRLYRDSLSLPAPAPAPSPAINVRNFTFGCAHSALGEPIGVAGFGRGLLSMP 240

Query: 241 SQLATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYF 300
           SQLATFSPQLGNRFSYCLVSHSFA DRVRRPSPLILGRYYG ETEFIYTS+LENPKHPYF
Sbjct: 241 SQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYGSETEFIYTSMLENPKHPYF 300

Query: 301 YSVGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTG 360
           YSVGLAGISVG+V+IPAPEFLK+VDEGGSGGVVVDSGTTFTMLPAGLY+SVVA+FENRTG
Sbjct: 301 YSVGLAGISVGSVRIPAPEFLKRVDEGGSGGVVVDSGTTFTMLPAGLYNSVVAQFENRTG 360

Query: 361 RVANRARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVG 420
           RVA+RA RIEE+ GLSPCY YE SVEVPRVVLHFVGE+SSV LPRKNYFYEFLD GDGVG
Sbjct: 361 RVASRASRIEENTGLSPCYSYEKSVEVPRVVLHFVGEKSSVELPRKNYFYEFLDGGDGVG 420

Query: 421 RKRKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 480
           RKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSL
Sbjct: 421 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVAYDLENNRVGFARRQCSTLWDSL 480

Query: 481 NRS 482
           NRS
Sbjct: 481 NRS 482

BLAST of CmUC10G185070 vs. ExPASy TrEMBL
Match: A0A1S3BK28 (aspartic proteinase nepenthesin-1 OS=Cucumis melo OX=3656 GN=LOC103490888 PE=3 SV=1)

HSP 1 Score: 892.5 bits (2305), Expect = 7.8e-256
Identity = 442/482 (91.70%), Postives = 457/482 (94.81%), Query Frame = 0

Query: 3   SPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHRRR 62
           SPVF+FLLCFLLSSPVFSSQI LLPL+HSLSSSISDFN+THNLLKSTA RSSARFH    
Sbjct: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR--- 63

Query: 63  AHHRSHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 122
            H  +HLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ
Sbjct: 64  -HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123

Query: 123 SPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAY 182
           SPLPKISNNKSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIEISECSSFSCPPFYYAY
Sbjct: 124 SPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183

Query: 183 GDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQLA 242
           GDGSL+ARLYRDSLSLP PAPSP INVRNFTFGCAHTTLGEPVGVAGFGRG LSMPSQLA
Sbjct: 184 GDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLA 243

Query: 243 TFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYFYSVG 302
           TFSPQLGNRFSYCLVSHSFA DRVRRPSPLILGRY+  ETEFIYTSLLENPKHPYFYSVG
Sbjct: 244 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVG 303

Query: 303 LAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRVAN 362
           LAGISVGNV+IPAPEFL+KVDE GSGGVVVDSGTTFTMLP+GLY+SVVAEFENRTG+VAN
Sbjct: 304 LAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVAN 363

Query: 363 RARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDG---VGR 422
           RARRIEE+ GLSPCYYYE SV VPRVVLHFVGE+SSVVLPRKNYFYEFLD GDG   VGR
Sbjct: 364 RARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGR 423

Query: 423 KRKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLN 482
           KRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+LN
Sbjct: 424 KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLN 481

BLAST of CmUC10G185070 vs. ExPASy TrEMBL
Match: A0A5D3CP11 (Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1017G00280 PE=3 SV=1)

HSP 1 Score: 887.9 bits (2293), Expect = 1.9e-254
Identity = 442/484 (91.32%), Postives = 458/484 (94.63%), Query Frame = 0

Query: 3   SPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHRRR 62
           SPVF+FLLCFLLSSPVFSSQI LLPL+HSLSSSISDFN+THNLLKSTA RSSARFH    
Sbjct: 4   SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR--- 63

Query: 63  AHHRSHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 122
            H  +HLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ
Sbjct: 64  -HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQ 123

Query: 123 SPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAY 182
           SPLPKISNNKSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIEISECSSFSCPPFYYAY
Sbjct: 124 SPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAY 183

Query: 183 GDGSLIARLYRDSLSL--PAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQ 242
           GDGSL+ARLYRDSLSL  PAPAPSP INVRNFTFGCAHTTLGEPVGVAGFGRG LSMPSQ
Sbjct: 184 GDGSLVARLYRDSLSLPTPAPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQ 243

Query: 243 LATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYFYS 302
           LATFSPQLGNRFSYCLVSHSFA DRVRRPSPLILGRY+  ETEFIYTSLLENPKHPYFYS
Sbjct: 244 LATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYS 303

Query: 303 VGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRV 362
           VGLAGISVGNV+IPAPEFL+KVDE GSGGVVVDSGTTFTMLP+GLY+SVVAEFENRTG+V
Sbjct: 304 VGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKV 363

Query: 363 ANRARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDG---V 422
           ANRARRIEE+ GLSPCYYY+ SV VPRVVLHFVGE+SSVVLPRKNYFYEFLD GDG   V
Sbjct: 364 ANRARRIEENTGLSPCYYYQNSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEV 423

Query: 423 GRKRKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDS 482
           GRKRKVGCLMLMNGGDE ELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+
Sbjct: 424 GRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDN 483

BLAST of CmUC10G185070 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 590.5 bits (1521), Expect = 1.2e-168
Identity = 297/478 (62.13%), Postives = 356/478 (74.48%), Query Frame = 0

Query: 24  LLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHRRRAHHRSHLSLPLSPGGDYTLSF 83
           LLL L+HSLS+S    +  H LLKS+++RSSARF        +  LSLP+S G DY +S 
Sbjct: 29  LLLHLSHSLSTSKHSSSPLH-LLKSSSSRSSARFRRHHHKQQQQQLSLPISSGSDYLISL 88

Query: 84  NLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNN-KSVSCSAPACS 143
           ++GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP   +S++  +VSCS+P+CS
Sbjct: 89  SVGSSSSAVSLYLDTGSDLVWFPCRPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCS 148

Query: 144 AAHGGSLSASHLCAISQCPLESIEISEC--SSFSCPPFYYAYGDGSLIARLYRDSLSLPA 203
           AAH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSL+A+LY DSLSL  
Sbjct: 149 AAH-SSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSL-- 208

Query: 204 PAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHS 263
               P+++V NFTFGCAHTTL EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHS
Sbjct: 209 ----PSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHS 268

Query: 264 FATDRVRRPSPLILGRYYG--------------------RETEFIYTSLLENPKHPYFYS 323
           F +DRVRRPSPLILGR+                      ++ EF++T +LENPKHPYFYS
Sbjct: 269 FDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYS 328

Query: 324 VGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRV 383
           V L GIS+G   IPAP  L+++D+ G GGVVVDSGTTFTMLPA  Y+SVV EF++R GRV
Sbjct: 329 VSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRV 388

Query: 384 ANRARRIEESIGLSPCYYYEGSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVGRK 443
             RA R+E S G+SPCYY   +V+VP +VLHF G +SSV LPR+NYFYEF+D GDG   K
Sbjct: 389 HERADRVEPSSGMSPCYYLNQTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEK 448

Query: 444 RKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL 479
           RK+GCLMLMNGGDE EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWDSL
Sbjct: 449 RKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWDSL 498

BLAST of CmUC10G185070 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 193.4 bits (490), Expect = 4.3e-49
Identity = 158/495 (31.92%), Postives = 229/495 (46.26%), Query Frame = 0

Query: 5   VFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHRRRAH 64
           +F+FLL  LL +    +Q        S SSS      T + +     +S  +   ++   
Sbjct: 8   LFLFLLITLLLNTTNKTQARQHKNPSSSSSSFLVLTLTKSSVSLPTPKSQTQERIKKPLS 67

Query: 65  HRSHLSLPLSPGGD-YTLSFNLGSESHKISLYMDTGSDLVWFPCS--PFECILCEG---- 124
               +  PL    D Y ++ N+G+    + +Y+DTGSDL W PC    F+CI C      
Sbjct: 68  SVDVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNN 127

Query: 125 ---KPKIQSPLPKISNNKSVSCSAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFS 184
               P + SPL   ++ +  SC++  C   H  S +    CA++ C +  +  S C    
Sbjct: 128 DLKSPSVFSPLHSSTSFRD-SCASSFCVEIH-SSDNPFDPCAVAGCSVSMLLKSTCVR-P 187

Query: 185 CPPFYYAYGDGSLIAR-LYRDSLSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVAGFGRG 244
           CP F Y YG+G LI+  L RD L       +   +V  F+FGC  +T  EP+G+AGFGRG
Sbjct: 188 CPSFAYTYGEGGLISGILTRDILK------ARTRDVPRFSFGCVTSTYREPIGIAGFGRG 247

Query: 245 TLSMPSQLATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGR---YYGRETEFIYTSLL 304
            LS+PSQL      L   FS+C +   F  +     SPLILG             +T +L
Sbjct: 248 LLSLPSQLGF----LEKGFSHCFLPFKF-VNNPNISSPLILGASALSINLTDSLQFTPML 307

Query: 305 ENPKHPYFYSVGLAGISVGNVKIP--APEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDS 364
             P +P  Y +GL  I++G    P   P  L++ D  G+GG++VDSGTT+T LP   Y  
Sbjct: 308 NTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQ 367

Query: 365 VVAEFENRTGRVANRARRIEESIGLSPCY----------YYEGSVEV--PRVVLHFVGEQ 424
           ++   ++       RA   E   G   CY            E  V +  P +  HF+   
Sbjct: 368 LLTTLQSTI--TYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFL-NN 427

Query: 425 SSVVLPRKNYFYEFLDSGDGVGRKRKVGCLMLMNGGDEVELAGGPGATLGNYQQQGFEVV 472
           ++++LP+ N FY      DG      V CL+  N  D      GP    G++QQQ  +VV
Sbjct: 428 ATLLLPQGNSFYAMSAPSDG----SVVQCLLFQNMEDG---DYGPAGVFGSFQQQNVKVV 478

BLAST of CmUC10G185070 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 186.0 bits (471), Expect = 6.9e-47
Identity = 156/502 (31.08%), Postives = 230/502 (45.82%), Query Frame = 0

Query: 1   MASPVFVFLLCFLLSSPVFSSQILLLPLTHSLSSSISDFNNTHNLLKSTAARSSARFHHR 60
           MAS +F F L FL  S V + ++ L P +HS  S    + +   L +S+ AR+    H  
Sbjct: 1   MASSIFFFFLIFL--SVVSAVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGT 60

Query: 61  RRAHHRSHLS-----------LPLSPG--GDYTLSFNLGSESHKISLYMDTGSDLVWFPC 120
                   LS            PLS    G Y++S + G+ S  I    DTGS LVW PC
Sbjct: 61  SIKPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPC 120

Query: 121 -SPFECILCEGKPKIQSPLPKI-----SNNKSVSCSAPACSAAHGGSLSASHLCAISQCP 180
            S + C  C+      + +P+      S++K + C +P C   +G ++         QC 
Sbjct: 121 TSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV---------QCR 180

Query: 181 LESIEISECSSFSCPPFYYAYGDGSLIARLYRDSLSLPAPAPSPAINVRNFTFGCAHTTL 240
                   C +  CPP+   YG GS    L  + L        P + V +F  GC+  + 
Sbjct: 181 GCDPNTRNC-TVGCPPYILQYGLGSTAGVLITEKLDF------PDLTVPDFVVGCSIIST 240

Query: 241 GEPVGVAGFGRGTLSMPSQLATFSPQLGNRFSYCLVSHSFATDRVRRPSPLILGRYY--G 300
            +P G+AGFGRG +S+PSQ+         RFS+CLVS  F    V     L  G  +  G
Sbjct: 241 RQPAGIAGFGRGPVSLPSQMNL------KRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSG 300

Query: 301 RETE-FIYTSLLENPK-----HPYFYSVGLAGISVGNVKIPAPEFLKKVDEGGSGGVVVD 360
            +T    YT   +NP         +Y + L  I VG   +  P         G GG +VD
Sbjct: 301 SKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVD 360

Query: 361 SGTTFTMLPAGLYDSVVAEFENRTGRVANRARRIEESIGLSPCYYY--EGSVEVPRVVLH 420
           SG+TFT +   +++ V  EF ++      R + +E+  GL PC+    +G V VP ++  
Sbjct: 361 SGSTFTFMERPVFELVAEEFASQMSNY-TREKDLEKETGLGPCFNISGKGDVTVPELIFE 420

Query: 421 FVGEQSSVVLPRKNYFYEFLDSGDGVGRKRKVGCLMLMNGGDEVELAG-GPGATLGNYQQ 473
           F G  + + LP  NYF  F+ + D V       CL +++        G GP   LG++QQ
Sbjct: 421 FKG-GAKLELPLSNYF-TFVGNTDTV-------CLTVVSDKTVNPSGGTGPAIILGSFQQ 468

BLAST of CmUC10G185070 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 151.0 bits (380), Expect = 2.5e-36
Identity = 121/404 (29.95%), Postives = 183/404 (45.30%), Query Frame = 0

Query: 77  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSC 136
           G+Y     +G  + ++ + +DTGSD+ W  C+P  C  C  + +        S+ + +SC
Sbjct: 146 GEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTP--CADCYHQTEPIFEPSSSSSYEPLSC 205

Query: 137 SAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAYGDGS-LIARLYRDS 196
             P C+A                     +E+SEC + +C  +  +YGDGS  +     ++
Sbjct: 206 DTPQCNA---------------------LEVSECRNATC-LYEVSYGDGSYTVGDFATET 265

Query: 197 LSLPAPAPSPAINVRNFTFGCAHTTLGEPVGVA---GFGRGTLSMPSQLATFSPQLGNRF 256
           L++ +        V+N   GC H+  G  VG A   G G G L++PSQL T S      F
Sbjct: 266 LTIGSTL------VQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTS------F 325

Query: 257 SYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYFYSVGLAGISVGNVK 316
           SYCLV     +      S +  G       + +   LL N +   FY +GL GISVG   
Sbjct: 326 SYCLVDRDSDS-----ASTVDFGT--SLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGEL 385

Query: 317 IPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRVANRARRIEESIG 376
           +  P+   ++DE GSGG+++DSGT  T L   +Y+S+   F   T         +E++ G
Sbjct: 386 LQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGT-------LDLEKAAG 445

Query: 377 LS---PCYYYEG--SVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVGRKRKVGCLM 436
           ++    CY      +VEVP V  HF G +  + LP KNY        D VG      CL 
Sbjct: 446 VAMFDTCYNLSAKTTVEVPTVAFHFPGGK-MLALPAKNYMIPV----DSVG----TFCLA 483

Query: 437 LMNGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 472
                  +       A +GN QQQG  V +DL  + +GF+  +C
Sbjct: 506 FAPTASSL-------AIIGNVQQQGTRVTFDLANSLIGFSSNKC 483

BLAST of CmUC10G185070 vs. TAIR 10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 150.6 bits (379), Expect = 3.2e-36
Identity = 124/402 (30.85%), Postives = 173/402 (43.03%), Query Frame = 0

Query: 77  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSC 136
           G+Y     +G+ + ++ L +DTGSD+ W  C P  C  C  +          S  KS++C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEP--CADCYQQSDPVFNPTSSSTYKSLTC 219

Query: 137 SAPACSAAHGGSLSASHLCAISQCPLESIEISECSSFSCPPFYYAYGDGSL-IARLYRDS 196
           SAP CS                      +E S C S  C  +  +YGDGS  +  L  D+
Sbjct: 220 SAPQCSL---------------------LETSACRSNKC-LYQVSYGDGSFTVGELATDT 279

Query: 197 LSLPAPAPSPAINVRNFTFGCAHTTLG---EPVGVAGFGRGTLSMPSQLATFSPQLGNRF 256
           ++        +  + N   GC H   G      G+ G G G LS+ +Q+   S      F
Sbjct: 280 VTF-----GNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATS------F 339

Query: 257 SYCLVSHSFATDRVRRPSPLILGRYYGRETEFIYTSLLENPKHPYFYSVGLAGISVGNVK 316
           SYCLV            + + LG   G  T      LL N K   FY VGL+G SVG  K
Sbjct: 340 SYCLVDRDSGKSSSLDFNSVQLGG--GDAT----APLLRNKKIDTFYYVGLSGFSVGGEK 399

Query: 317 IPAPEFLKKVDEGGSGGVVVDSGTTFTMLPAGLYDSVVAEFENRTGRVANRARRIEESIG 376
           +  P+ +  VD  GSGGV++D GT  T L    Y+S+   F   T  +    ++   SI 
Sbjct: 400 VVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNL----KKGSSSIS 459

Query: 377 L-SPCYYYE--GSVEVPRVVLHFVGEQSSVVLPRKNYFYEFLDSGDGVGRKRKVGCLMLM 436
           L   CY +    +V+VP V  HF G + S+ LP KNY     DSG          C    
Sbjct: 460 LFDTCYDFSSLSTVKVPTVAFHFTGGK-SLDLPAKNYLIPVDDSG--------TFCFAFA 500

Query: 437 NGGDEVELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 472
                + +       +GN QQQG  + YDL KN +G +  +C
Sbjct: 520 PTSSSLSI-------IGNVQQQGTRITYDLSKNVIGLSGNKC 500

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905814.18.5e-26594.18probable aspartyl protease At4g16563 [Benincasa hispida][more]
XP_023007805.11.0e-25792.31probable aspartyl protease At4g16563 [Cucurbita maxima][more]
XP_023553227.11.3e-25792.13probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
KAG6577689.15.0e-25791.93putative aspartyl protease, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022923540.18.5e-25791.93probable aspartyl protease At4g16563 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q940R41.7e-16762.13Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q9LS404.5e-3530.85Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LNJ33.8e-3431.46Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q9LHE31.4e-3129.38Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q766C22.3e-3128.09Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Match NameE-valueIdentityDescription
A0A6J1L3Z94.9e-25892.31probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111500303... [more]
A0A0A0L5I74.1e-25792.29Pepsin A OS=Cucumis sativus OX=3659 GN=Csa_3G020060 PE=3 SV=1[more]
A0A6J1EC444.1e-25791.93probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114312... [more]
A0A1S3BK287.8e-25691.70aspartic proteinase nepenthesin-1 OS=Cucumis melo OX=3656 GN=LOC103490888 PE=3 S... [more]
A0A5D3CP111.9e-25491.32Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
Match NameE-valueIdentityDescription
AT4G16563.11.2e-16862.13Eukaryotic aspartyl protease family protein [more]
AT5G45120.14.3e-4931.92Eukaryotic aspartyl protease family protein [more]
AT3G52500.16.9e-4731.08Eukaryotic aspartyl protease family protein [more]
AT1G25510.12.5e-3629.95Eukaryotic aspartyl protease family protein [more]
AT3G18490.13.2e-3630.85Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 298..467
e-value: 1.1E-26
score: 93.5
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 276..477
e-value: 1.3E-47
score: 163.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 69..275
e-value: 4.7E-34
score: 120.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 74..476
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 79..273
e-value: 1.2E-29
score: 103.8
NoneNo IPR availablePANTHERPTHR47967:SF26BNAA01G17170D PROTEINcoord: 1..476
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 1..476
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 330..341
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 79..467
score: 30.503099
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 79..471
e-value: 5.43031E-69
score: 219.442

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC10G185070.1CmUC10G185070.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity