Clc10G18000 (gene) Watermelon (cordophanus) v2

Overview
NameClc10G18000
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionEukaryotic aspartyl protease family protein
LocationClcChr10: 31876428 .. 31879460 (-)
RNA-Seq ExpressionClc10G18000
SyntenyClc10G18000
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTGGTAGTTTTCAATTCTTTTCTTTTTTAATTTGTTTCCTTGATTTCAATCTTCATGTTTCATATTTTCTCACTTAAAAGTAAAATGAAAAACAAAAATTACGTGATTGAAAAATATTTATAATTTCACGTTCTTTCTTTCTCATAAAAAAAAATTCATTTCAAACAGAACTCTTTTGTTCCGTGTTCTCTTAAAAGTTTAAATAAAAAAAATGTGTAAAAAAGAAGAAAAAAAAACTAGATATTTTAAATTTGGAAATATCACTTTTGGTCGTTAAGATCTGGTTAGATTGCAATTGAGTTTCTAAGTTTCAAACTCAACATTTTTATCCATAAAATTTACATTTGGTAACATTTTTAGTCTTTATCAATGTTTTTATTAATTAGTAATGAAAAGCTAACGTGACACATATATTTTTCAAATAAAAACATAGGTATTATGTTGTTTAATAATTAAAAAATTAGAATTCTTTTTTTTTTAAAAAAAAAAGAACATAGGTGTTATGTTGGTAACTAAAAAAAAAAAACCGAAGGGACCAAAATTGCCACAAAATGCAAACTTCAAGGATAAAGTTGTTGAACTTGAAAACTTAGCGACTCAAACACAACATAAATCAAGTTTCAAGTACCAAAAGTGTAATTTAAATCTTTGGGCTATATATTTTAAATAATTATTTAACATTTTATAAAAATAAATCACTTTTTCAAAATTAATTTGTGTTTTTTCAAAATTTAAAAATTTTTGCAATTCCATAATCTTGGTATTATGATTTCTTTTTTAATTTTGAAGAAACACATTATTTTTTTTTAGGGTTAAAATATTATTTTAGTCCATATACTTTGAAATTTGTTCAATTTTAGTCCTTATACTTTAAAATGTCGAATTTTAGTCCTTGTACTTTAAATGAATCTTAAATTTAGTTCTTACTACTTGTTTACTGTAGATTTTTCATTATTTTTGTTTTTTTTTCTTTTATTAACATTGTCACTATAGATTTTGAAAATATATTCACATATTATGTTTATTTGCATGAAAATTAGTACGATTATGTAATCAACTTCCTAAATTTAAGATTTATTTCAAACACAAAGACTAAAGTTGGACTTTTGAAAATATAGAAATTAAAATTAAACAAATTTTAAAGTACATGACTAAAATGGTATTTGAATATTTTTTAGTACAACAATTAGAAAATTAATTCCACAGATGTGGTGATTCATAATACTCAGTGCCAATTTCTTTTTTATTTTAATTTATAACAACTGTACTATGTGAAGTGTAGTAAGTAGAAAGACTAGAAGAGCGTCCGACTAAGCTGAGCGAAGAAGACCATACCACGTTCCTCACTCCTCAGTTCCATAGTTCCAACGGCCAAAAAAGAGCCCTTGTGACTTCTGACTTGTGACCGTGAAACTTCTTATGTAATGTGTTTTCCACGTGGCTTACTTTCATTTGTCACTACTTTCTCTTCTCTGAAACCTTAGCCCTGCTTCCAGCTCACCGCTTTGTCCCTTCTTTGCTCTTCCTCCCCATTTATACTTCAAACCCCCCTTTCTCAACCTTCTCATCAACATAACTCTCTTTCCCTTCATACATTTCCAATCCATAACAATATCAGGATTACATTACAAGCAGGAAGAAGCCAACCCAAAGCTTCTTCCATGGCGTCTCCTTCCCCTCTCTCTTTCTTCTACATTCTCCTCTTCTCCTCTGTTTCCTCCATTGCCAACACCAACCCAATCACCCTCCCTCTCCACGCCTTCCCCCACCTTCCTTCTTCAGATCCACTCCAAACTCTCACTTTCCTCGCCTCTGCTTCCCAAAACAGAGCTCATCAAATCAAAACCCCCAAATCCAACTCTGTTTCCAAGTCCCCTCTCTCCCCTCACAGCTATGGAGCTTACTCAACTCCACTCAGCTTTGGTACTCCACAACAGACTCTGCATTTGATCTTCGATACAGGTAGTAGCCTCGTTTGGTTCCCTTGTACTTCCAGATATCTCTGTTCCGAATGTTCCTTCCCCAAAATAGATCCCGCCGGAATCCCCAGATTTGTCCCCAAATTGTCTTCCTCTTCCAAGCTTGTCGGTTGCCAGAATCCCAAATGTGCTTGGATTTTTGGCCCCGATGTCAAATCTCAATGCCGGAGTTGTAACCCCAAAACAGAGAACTGTACCCAAACTTGCCCTGCTTACGTCGTTCAGTACGGTTCCGGCTCCACGGCTGGGCTTTTGCTATCGGAAACGCTTGATTTTCCCGATAAGAAAATCCCCAATTTTGTTGTTGGCTGTTCGTTTTTGTCGATCCATCAACCCTCTGGAATCGCCGGATTCGGCCGAGGATCTGAATCGCTCCCCTCGCAAATGGGTCTCAAGAAATTCGCGTACTGCCTTGCGTCTCGGAAATTCGACGACTCGCCGCATTCTGGTGAGCTGATTCTAGATTCCACCGGCGTGAAGACCGGCGGTCTCACCTACACGCCGTTCCGGCAGAACCCCTCTGTTTCTAACCACGCTTATAAAGAATACTATTACTTAAGCATACGCAAAATCCTCGTCGGAAACCAGGCCGTGAAGGTGCTGTACAAGTATCTGGTGCCGGGCCCCGACGGCAACGGTGGATCTATCATCGATTCCGGCTCCACCTTCACGTTTATGGACAAACCAGTTTTCGAGGCAGTGGCGCAAGAGTTCGAGAAGCAGTTGGCGAACCGGACGAGAGCCACCGATGTGGAATCTCTCACCGGATTACGGCCGTGTTTCGACATTTCGAAGGACAAATCGGTGGATTTTCCGGAGCTGATTTTCCAGTTTAAAGGCGGAGCGAAATGGGCTCTGCCGTTGAGTAACTATTTCGCTTTAGTCAGTAGCTCCGGCGTGGCGTGTTTGACGGTTGTGACGCATAAGACGGAGGCGGGCGGCGGCGGTGGGCCGTCTGTGATTTTGGGGGCTTTCCAGCAGCAGAATTTCTATGTGGAGTATGATTTGGTGAATGAAAGATTGGGATTTCGGCAACAGACTTGCACTTAG

mRNA sequence

ATGTTTGAAAGACTAGAAGAGCGTCCGACTAAGCTGAGCGAAGAAGACCATACCACGTTCCTCACTCCTCAGTTCCATAGTTCCAACGGCCAAAAAAGAGCCCTTCCCTGCTTCCAGCTCACCGCTTTGTCCCTTCTTTGCTCTTCCTCCCCATTTATACTTCAAACCCCCCTTTCTCAACCTTCTCATCAACATAACTCTCTTTCCCTTCATACATTTCCAATCCATAACAATATCAGGATTACATTACAAGCAGGAAGAAGCCAACCCAAAGCTTCTTCCATGGCGTCTCCTTCCCCTCTCTCTTTCTTCTACATTCTCCTCTTCTCCTCTGTTTCCTCCATTGCCAACACCAACCCAATCACCCTCCCTCTCCACGCCTTCCCCCACCTTCCTTCTTCAGATCCACTCCAAACTCTCACTTTCCTCGCCTCTGCTTCCCAAAACAGAGCTCATCAAATCAAAACCCCCAAATCCAACTCTGTTTCCAAGTCCCCTCTCTCCCCTCACAGCTATGGAGCTTACTCAACTCCACTCAGCTTTGGTACTCCACAACAGACTCTGCATTTGATCTTCGATACAGGTAGTAGCCTCGTTTGGTTCCCTTGTACTTCCAGATATCTCTGTTCCGAATGTTCCTTCCCCAAAATAGATCCCGCCGGAATCCCCAGATTTGTCCCCAAATTGTCTTCCTCTTCCAAGCTTGTCGGTTGCCAGAATCCCAAATGTGCTTGGATTTTTGGCCCCGATGTCAAATCTCAATGCCGGAGTTGTAACCCCAAAACAGAGAACTGTACCCAAACTTGCCCTGCTTACGTCGTTCAGTACGGTTCCGGCTCCACGGCTGGGCTTTTGCTATCGGAAACGCTTGATTTTCCCGATAAGAAAATCCCCAATTTTGTTGTTGGCTGTTCGTTTTTGTCGATCCATCAACCCTCTGGAATCGCCGGATTCGGCCGAGGATCTGAATCGCTCCCCTCGCAAATGGGTCTCAAGAAATTCGCGTACTGCCTTGCGTCTCGGAAATTCGACGACTCGCCGCATTCTGGTGAGCTGATTCTAGATTCCACCGGCGTGAAGACCGGCGGTCTCACCTACACGCCGTTCCGGCAGAACCCCTCTGTTTCTAACCACGCTTATAAAGAATACTATTACTTAAGCATACGCAAAATCCTCGTCGGAAACCAGGCCGTGAAGGTGCTGTACAAGTATCTGGTGCCGGGCCCCGACGGCAACGGTGGATCTATCATCGATTCCGGCTCCACCTTCACGTTTATGGACAAACCAGTTTTCGAGGCAGTGGCGCAAGAGTTCGAGAAGCAGTTGGCGAACCGGACGAGAGCCACCGATGTGGAATCTCTCACCGGATTACGGCCGTGTTTCGACATTTCGAAGGACAAATCGGTGGATTTTCCGGAGCTGATTTTCCAGTTTAAAGGCGGAGCGAAATGGGCTCTGCCGTTGAGTAACTATTTCGCTTTAGTCAGTAGCTCCGGCGTGGCGTGTTTGACGGTTGTGACGCATAAGACGGAGGCGGGCGGCGGCGGTGGGCCGTCTGTGATTTTGGGGGCTTTCCAGCAGCAGAATTTCTATGTGGAGTATGATTTGGTGAATGAAAGATTGGGATTTCGGCAACAGACTTGCACTTAG

Coding sequence (CDS)

ATGTTTGAAAGACTAGAAGAGCGTCCGACTAAGCTGAGCGAAGAAGACCATACCACGTTCCTCACTCCTCAGTTCCATAGTTCCAACGGCCAAAAAAGAGCCCTTCCCTGCTTCCAGCTCACCGCTTTGTCCCTTCTTTGCTCTTCCTCCCCATTTATACTTCAAACCCCCCTTTCTCAACCTTCTCATCAACATAACTCTCTTTCCCTTCATACATTTCCAATCCATAACAATATCAGGATTACATTACAAGCAGGAAGAAGCCAACCCAAAGCTTCTTCCATGGCGTCTCCTTCCCCTCTCTCTTTCTTCTACATTCTCCTCTTCTCCTCTGTTTCCTCCATTGCCAACACCAACCCAATCACCCTCCCTCTCCACGCCTTCCCCCACCTTCCTTCTTCAGATCCACTCCAAACTCTCACTTTCCTCGCCTCTGCTTCCCAAAACAGAGCTCATCAAATCAAAACCCCCAAATCCAACTCTGTTTCCAAGTCCCCTCTCTCCCCTCACAGCTATGGAGCTTACTCAACTCCACTCAGCTTTGGTACTCCACAACAGACTCTGCATTTGATCTTCGATACAGGTAGTAGCCTCGTTTGGTTCCCTTGTACTTCCAGATATCTCTGTTCCGAATGTTCCTTCCCCAAAATAGATCCCGCCGGAATCCCCAGATTTGTCCCCAAATTGTCTTCCTCTTCCAAGCTTGTCGGTTGCCAGAATCCCAAATGTGCTTGGATTTTTGGCCCCGATGTCAAATCTCAATGCCGGAGTTGTAACCCCAAAACAGAGAACTGTACCCAAACTTGCCCTGCTTACGTCGTTCAGTACGGTTCCGGCTCCACGGCTGGGCTTTTGCTATCGGAAACGCTTGATTTTCCCGATAAGAAAATCCCCAATTTTGTTGTTGGCTGTTCGTTTTTGTCGATCCATCAACCCTCTGGAATCGCCGGATTCGGCCGAGGATCTGAATCGCTCCCCTCGCAAATGGGTCTCAAGAAATTCGCGTACTGCCTTGCGTCTCGGAAATTCGACGACTCGCCGCATTCTGGTGAGCTGATTCTAGATTCCACCGGCGTGAAGACCGGCGGTCTCACCTACACGCCGTTCCGGCAGAACCCCTCTGTTTCTAACCACGCTTATAAAGAATACTATTACTTAAGCATACGCAAAATCCTCGTCGGAAACCAGGCCGTGAAGGTGCTGTACAAGTATCTGGTGCCGGGCCCCGACGGCAACGGTGGATCTATCATCGATTCCGGCTCCACCTTCACGTTTATGGACAAACCAGTTTTCGAGGCAGTGGCGCAAGAGTTCGAGAAGCAGTTGGCGAACCGGACGAGAGCCACCGATGTGGAATCTCTCACCGGATTACGGCCGTGTTTCGACATTTCGAAGGACAAATCGGTGGATTTTCCGGAGCTGATTTTCCAGTTTAAAGGCGGAGCGAAATGGGCTCTGCCGTTGAGTAACTATTTCGCTTTAGTCAGTAGCTCCGGCGTGGCGTGTTTGACGGTTGTGACGCATAAGACGGAGGCGGGCGGCGGCGGTGGGCCGTCTGTGATTTTGGGGGCTTTCCAGCAGCAGAATTTCTATGTGGAGTATGATTTGGTGAATGAAAGATTGGGATTTCGGCAACAGACTTGCACTTAG

Protein sequence

MFERLEERPTKLSEEDHTTFLTPQFHSSNGQKRALPCFQLTALSLLCSSSPFILQTPLSQPSHQHNSLSLHTFPIHNNIRITLQAGRSQPKASSMASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT
Homology
BLAST of Clc10G18000 vs. NCBI nr
Match: XP_038905730.1 (probable aspartyl protease At4g16563 [Benincasa hispida])

HSP 1 Score: 899.0 bits (2322), Expect = 2.0e-257
Identity = 438/455 (96.26%), Postives = 446/455 (98.02%), Query Frame = 0

Query: 95  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQI 154
           MA PS LSFFYILLFSSVS+IANTNPITLPL+AFPHL SSDPLQTLTFLASASQNRAHQI
Sbjct: 1   MAPPSSLSFFYILLFSSVSAIANTNPITLPLNAFPHLSSSDPLQTLTFLASASQNRAHQI 60

Query: 155 KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 214
           KTPKSNSVSKSPL PHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF
Sbjct: 61  KTPKSNSVSKSPLFPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120

Query: 215 PKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 274
           PKIDP GIPRFVPKLSSSSKLVGCQNPKCAWIFGP+VKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPEVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 275 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 334
           QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 335 AYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG 394
           AYCLASRKFDDSPHSGELILDSTGVKT GL+YTPFRQNPSVSNHAYKEYYYL+IRKI VG
Sbjct: 241 AYCLASRKFDDSPHSGELILDSTGVKTSGLSYTPFRQNPSVSNHAYKEYYYLNIRKIFVG 300

Query: 395 NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESL 454
           NQAVKV YK+LVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESL
Sbjct: 301 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESL 360

Query: 455 TGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGG 514
           TGLRPCFDISKDKSV+FPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGG
Sbjct: 361 TGLRPCFDISKDKSVEFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGG 420

Query: 515 GGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT 550
           GGGPSVI GAFQQQNFYVEYDLVNE+LGFRQQTCT
Sbjct: 421 GGGPSVIFGAFQQQNFYVEYDLVNEKLGFRQQTCT 455

BLAST of Clc10G18000 vs. NCBI nr
Match: XP_008442902.1 (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo] >KAA0043829.1 aspartic proteinase nepenthesin-2 [Cucumis melo var. makuwa] >TYK25303.1 aspartic proteinase nepenthesin-2 [Cucumis melo var. makuwa])

HSP 1 Score: 874.4 bits (2258), Expect = 5.2e-250
Identity = 422/455 (92.75%), Postives = 443/455 (97.36%), Query Frame = 0

Query: 95  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQI 154
           MASPSPLSFFYILLFSS+S+I+N+NPITLPL++ PHL SSDPLQ LTFLASAS+NRAH+I
Sbjct: 1   MASPSPLSFFYILLFSSLSAISNSNPITLPLNSSPHLSSSDPLQALTFLASASKNRAHRI 60

Query: 155 KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 214
           KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLC+ECSF
Sbjct: 61  KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCTECSF 120

Query: 215 PKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 274
           PKIDP GIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 275 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 334
           QYGSGSTAGLLLSETLDFP+KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPNKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 335 AYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG 394
           AYCLASRKFDDS HSG+LILDS+GVKT GLTYT FRQNPSVSNHAYKEYYYL+IRKI+VG
Sbjct: 241 AYCLASRKFDDSAHSGQLILDSSGVKTSGLTYTSFRQNPSVSNHAYKEYYYLNIRKIIVG 300

Query: 395 NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESL 454
           NQAVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPV + VAQEFEKQLANRTRATDVE+L
Sbjct: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLDVVAQEFEKQLANRTRATDVETL 360

Query: 455 TGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGG 514
           TGLRPCFD+SK+KSV+FPELIFQFKGGAKWALPL+NYFALVSSSGVACLTVVTH TE GG
Sbjct: 361 TGLRPCFDVSKEKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHNTEDGG 420

Query: 515 GGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT 550
           GGGPSVILGAFQQQNFYVEYDLVNERLGFR+QTCT
Sbjct: 421 GGGPSVILGAFQQQNFYVEYDLVNERLGFRKQTCT 455

BLAST of Clc10G18000 vs. NCBI nr
Match: XP_004136706.1 (probable aspartyl protease At4g16563 [Cucumis sativus] >KGN59188.1 hypothetical protein Csa_002380 [Cucumis sativus])

HSP 1 Score: 863.6 bits (2230), Expect = 9.1e-247
Identity = 419/457 (91.68%), Postives = 441/457 (96.50%), Query Frame = 0

Query: 95  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQI 154
           MASPSPLSFFY+LLFSS+S+IA++NPITLPL++FPHL S DPLQ LTFLAS+SQ RAHQI
Sbjct: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60

Query: 155 KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 214
           KTPKSNSV KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF
Sbjct: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120

Query: 215 PKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 274
           PKIDP GIPRFVPKLSSSSKLVGCQNPKC+WIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 275 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 334
           QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 335 AYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG 394
           AYCLASRKFDDSPHSG+LILDSTGVK+ GLTYTPFRQNPSVSN+AYKEYYYL+IRKI+VG
Sbjct: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300

Query: 395 NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESL 454
           NQAVKV YK+LVPGPDGNGGSIIDSGSTFTFMDKPV E VA+EFEKQLAN TRATDVE+L
Sbjct: 301 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360

Query: 455 TGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTE--A 514
           TGLRPCFDISK+KSV FPELIFQFKGGAKWALPL+NYFALVSSSGVACLTVVTH+ E   
Sbjct: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420

Query: 515 GGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT 550
           GGGGGPSVILGAFQQQNFYVEYDLVN+RLGFRQQTC+
Sbjct: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457

BLAST of Clc10G18000 vs. NCBI nr
Match: XP_022982947.1 (probable aspartyl protease At4g16563 [Cucurbita maxima])

HSP 1 Score: 844.3 bits (2180), Expect = 5.7e-241
Identity = 412/457 (90.15%), Postives = 430/457 (94.09%), Query Frame = 0

Query: 95  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQI 154
           MA P PL FFYILL SSVS+IA+TNPIT+PL +FPH  SSDPLQTL FLASASQNRAHQI
Sbjct: 1   MAPPPPLCFFYILLVSSVSAIADTNPITIPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 155 KTPK--SNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 214
           K PK  SNSVSKSPLSPHSYGAYSTPLSFGTP QTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSESNSVSKSPLSPHSYGAYSTPLSFGTPPQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 215 SFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 274
           SFPKIDPAGIPRF+PKLSS+SKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY
Sbjct: 121 SFPKIDPAGIPRFIPKLSSTSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 275 VVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 334
           VVQYGSGSTAGLLLSETLDFPDKK  NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFPDKKFTNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 335 KFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKIL 394
           KFAYCLASRKFDDSPH+GELILDS+G KT GL+YTPFRQNPSVSNHAYKEYYYL+IRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLSYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 395 VGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVE 454
           VG +AVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQE EKQLANRTRATDVE
Sbjct: 301 VGKKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 455 SLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEA 514
           SLTGLRPCFDISKDKSV+FPEL FQ KGGAKW LPLSNYFALVSSSGVACLTVVTHKT A
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFQLKGGAKWGLPLSNYFALVSSSGVACLTVVTHKT-A 420

Query: 515 GGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT 550
             GGGPS+ILGAFQQQNFYVEYDLVN+++GFRQQTC+
Sbjct: 421 DSGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of Clc10G18000 vs. NCBI nr
Match: XP_023528159.1 (probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 843.2 bits (2177), Expect = 1.3e-240
Identity = 414/457 (90.59%), Postives = 430/457 (94.09%), Query Frame = 0

Query: 95  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQI 154
           MA P  L FFYILL SSVS+IA+TNPITLPL +FPH  SSDPLQTL FLASASQNRAHQI
Sbjct: 1   MAPPPLLCFFYILLLSSVSAIADTNPITLPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 155 KTP--KSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 214
           K P  KSNSVSKSPLSPHSYGAYSTPLSFGTP QTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSKSNSVSKSPLSPHSYGAYSTPLSFGTPSQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 215 SFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 274
           SFPKIDPAGIPRF+PKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY
Sbjct: 121 SFPKIDPAGIPRFIPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 275 VVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 334
           VVQYGSGSTAGLLLSETLDF +KKI NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFANKKITNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 335 KFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKIL 394
           KFAYCLASRKFDDSPH+GELILDS+G KT GLTYTPFRQNPSVSNHAYKEYYYL+IRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLTYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 395 VGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVE 454
           VGN+AVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQE EKQLANRTRATDVE
Sbjct: 301 VGNKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 455 SLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEA 514
           SLTGLRPCFDISKDKSV+FPEL FQ KGGAKWALPLSNYFALVSSSGVACLTVVTHK  A
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFQLKGGAKWALPLSNYFALVSSSGVACLTVVTHKA-A 420

Query: 515 GGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT 550
             GGGPS+ILGAFQQQNFYVEYDLVN+++GFRQQTC+
Sbjct: 421 DSGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of Clc10G18000 vs. ExPASy Swiss-Prot
Match: Q940R4 (Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g16563 PE=2 SV=1)

HSP 1 Score: 190.7 bits (483), Expect = 4.5e-47
Identity = 145/485 (29.90%), Postives = 214/485 (44.12%), Query Frame = 0

Query: 111 SVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPH 170
           SVSS++    + L         SS PL  L   +S S  R  +    +       P+S  
Sbjct: 21  SVSSLSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPIS-- 80

Query: 171 SYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLS 230
           S   Y   LS G+    + L  DTGS LVWFPC   + C  C    + P+        LS
Sbjct: 81  SGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESKPLPPSP----PSSLS 140

Query: 231 SSSKLVGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTENCTQT---CPAYVVQYGSGST 290
           SS+  V C +P C+        S  C   N      +T +C  +   CP +   YG GS 
Sbjct: 141 SSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSL 200

Query: 291 AGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFA 350
              L S++L  P   + NF  GC+  ++ +P G+AGFGRG  SLP+Q+ +        F+
Sbjct: 201 VAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFS 260

Query: 351 YCLASRKFDDS--PHSGELIL--------------------DSTGVKTGGLTYTPFRQNP 410
           YCL S  FD         LIL                    D    K     +T   +NP
Sbjct: 261 YCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENP 320

Query: 411 SVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEA 470
               H Y  +Y +S++ I +G + +           +G GG ++DSG+TFT +    + +
Sbjct: 321 ---KHPY--FYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNS 380

Query: 471 VAQEFEKQLAN-RTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGG-AKWALPLSNY 530
           V +EF+ ++     RA  VE  +G+ PC+ +  +++V  P L+  F G  +   LP  NY
Sbjct: 381 VVEEFDSRVGRVHERADRVEPSSGMSPCYYL--NQTVKVPALVLHFAGNRSSVTLPRRNY 440

Query: 531 FALVSSSG--------VACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGF 549
           F      G        + CL ++    E+   GG   ILG +QQQ F V YDL+N R+GF
Sbjct: 441 FYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGF 491

BLAST of Clc10G18000 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 7.2e-37
Identity = 143/491 (29.12%), Postives = 205/491 (41.75%), Query Frame = 0

Query: 90  PKASSMASPSPLSF--------FYILLFSSVSSIANTNPITLPLHAFPHLPSS-DPLQTL 149
           P + S+   SP+SF             F S S   +++ ITL L     L S+  P +  
Sbjct: 33  PNSHSLPCASPVSFQPDSDSESLLESEFESGSDSESSSSITLNLDHIDALSSNKTPDELF 92

Query: 150 TFLASASQNRAHQIKT-------------PKSNSVSKSPLSPHSYGA--YSTPLSFGTPQ 209
           +        R   I T             P+    S S +S  S G+  Y T L  GTP 
Sbjct: 93  SSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPA 152

Query: 210 QTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAW 269
           + ++++ DTGS +VW  C     C  C + + DP     F P+ S +   + C +P C  
Sbjct: 153 RYVYMVLDTGSDIVWLQCAP---CRRC-YSQSDPI----FDPRKSKTYATIPCSSPHCR- 212

Query: 270 IFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGC 329
                 +     CN + + C      Y V YG GS T G   +ETL F   ++    +GC
Sbjct: 213 ------RLDSAGCNTRRKTC-----LYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGC 272

Query: 330 SFLS---IHQPSGIAGFGRGSESLPSQMGLK---KFAYCLASRKFDDSPHSGELILDSTG 389
              +       +G+ G G+G  S P Q G +   KF+YCL  R     P S   ++    
Sbjct: 273 GHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS---VVFGNA 332

Query: 390 VKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVK-VLYKYLVPGPDGNGGSII 449
             +    +TP   NP +       +YY+ +  I VG   V  V          GNGG II
Sbjct: 333 AVSRIARFTPLLSNPKLDT-----FYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVII 392

Query: 450 DSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQ 509
           DSG++ T + +P + A+   F        RA D         CFD+S    V  P ++  
Sbjct: 393 DSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFD---TCFDLSNMNEVKVPTVVLH 452

Query: 510 FKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLV 549
           F+ GA  +LP +NY   V ++G  C         AG  GG S+I G  QQQ F V YDL 
Sbjct: 453 FR-GADVSLPATNYLIPVDTNGKFCFAF------AGTMGGLSII-GNIQQQGFRVVYDLA 484

BLAST of Clc10G18000 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 1.8e-35
Identity = 126/386 (32.64%), Postives = 173/386 (44.82%), Query Frame = 0

Query: 173 GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC-SFPKIDPAGIPRFVPKLSS 232
           G Y   ++ GTP  +   I DTGS L+W  C     C++C S P       P F P+ SS
Sbjct: 94  GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEP---CTQCFSQP------TPIFNPQDSS 153

Query: 233 SSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTA-GLLLSETL 292
           S   + C++  C      D+ S         E C      Y   YG GST  G + +ET 
Sbjct: 154 SFSTLPCESQYC-----QDLPS---------ETCNNNECQYTYGYGDGSTTQGYMATETF 213

Query: 293 DFPDKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDS 352
            F    +PN   GC            +G+ G G G  SLPSQ+G+ +F+YC+ S     S
Sbjct: 214 TFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYG-SSS 273

Query: 353 PHSGELILDSTGVKTGGLTYTPFRQ--NPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKY 412
           P +  L   ++GV  G  + T      NP+        YYY++++ I VG   + +    
Sbjct: 274 PSTLALGSAASGVPEGSPSTTLIHSSLNPT--------YYYITLQGITVGGDNLGIPSST 333

Query: 413 LVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDIS 472
                DG GG IIDSG+T T++ +  + AVAQ F  Q+      T  ES +GL  CF   
Sbjct: 334 FQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI---NLPTVDESSSGLSTCFQQP 393

Query: 473 KDKS-VDFPELIFQFKGGAKWALPLSNYFALVS-SSGVACLTVVTHKTEAGGGGGPSVIL 532
            D S V  PE+  QF GG    L L     L+S + GV CL + +  ++ G       I 
Sbjct: 394 SDGSTVQVPEISMQFDGG---VLNLGEQNILISPAEGVICLAMGS-SSQLG-----ISIF 435

Query: 533 GAFQQQNFYVEYDLVNERLGFRQQTC 549
           G  QQQ   V YDL N  + F    C
Sbjct: 454 GNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Clc10G18000 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 3.7e-33
Identity = 123/429 (28.67%), Postives = 174/429 (40.56%), Query Frame = 0

Query: 130 HLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSY---GAYSTPLSFGTPQQ 189
           H+ S   L     L  A +  + +++  ++     S +    Y   G Y   LS GTP Q
Sbjct: 47  HVDSGKNLTKFQLLERAIERGSRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQ 106

Query: 190 TLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWI 249
               I DTGS L+W  C     C   S P  +P G        SSS   + C +  C  +
Sbjct: 107 PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQG--------SSSFSTLPCSSQLCQAL 166

Query: 250 FGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGC- 309
             P               C+     Y   YG GS T G + +ETL F    IPN   GC 
Sbjct: 167 SSP--------------TCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCG 226

Query: 310 ---SFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKT 369
                      +G+ G GRG  SLPSQ+ + KF+YC+       S  S  L+       T
Sbjct: 227 ENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMT--PIGSSTPSNLLLGSLANSVT 286

Query: 370 GGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKV-LYKYLVPGPDGNGGSIIDSG 429
            G   T   Q+  +       +YY+++  + VG+  + +    + +   +G GG IIDSG
Sbjct: 287 AGSPNTTLIQSSQIPT-----FYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSG 346

Query: 430 STFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKS-VDFPELIFQFK 489
           +T T+     +++V QEF  Q+          S +G   CF    D S +  P  +  F 
Sbjct: 347 TTLTYFVNNAYQSVRQEFISQI---NLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFD 406

Query: 490 GGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNE 549
           GG    LP  NYF +  S+G+ CL +       G       I G  QQQN  V YD  N 
Sbjct: 407 GG-DLELPSENYF-ISPSNGLICLAM-------GSSSQGMSIFGNIQQQNMLVVYDTGNS 434

BLAST of Clc10G18000 vs. ExPASy Swiss-Prot
Match: O04496 (Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1)

HSP 1 Score: 135.2 bits (339), Expect = 2.3e-30
Identity = 122/426 (28.64%), Postives = 180/426 (42.25%), Query Frame = 0

Query: 130 HLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLH 189
           H+ SSD    LT+L+S    +      PK  SV  +  +    G Y      GTP Q + 
Sbjct: 66  HMASSDS-HRLTYLSSLVAGK------PKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMF 125

Query: 190 LIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGP 249
           ++ DT +  VW PC+    CS CS           F    SS+   V C   +C    G 
Sbjct: 126 MVLDTSNDAVWLPCSG---CSGCSNASTS------FNTNSSSTYSTVSCSTAQCTQARG- 185

Query: 250 DVKSQCRSCNPKTENCTQTCPAYVVQYGSGST-AGLLLSETLDFPDKKIPNFVVGC---S 309
                C S +P+   C     ++   YG  S+ +  L+ +TL      IPNF  GC   +
Sbjct: 186 ---LTCPSSSPQPSVC-----SFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSA 245

Query: 310 FLSIHQPSGIAGFGRGSESLPSQ---MGLKKFAYCLASRKFDDSPHSGELILDSTGVKTG 369
             +   P G+ G GRG  SL SQ   +    F+YCL S  F     SG L L   G +  
Sbjct: 246 SGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPS--FRSFYFSGSLKLGLLG-QPK 305

Query: 370 GLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGST 429
            + YTP  +NP          YY+++  + VG+  V V   YL    +   G+IIDSG+ 
Sbjct: 306 SIRYTPLLRNP-----RRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTV 365

Query: 430 FTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGA 489
            T   +PV+EA+  EF KQ+      +   +L     CF  S D     P++        
Sbjct: 366 ITRFAQPVYEAIRDEFRKQV----NVSSFSTLGAFDTCF--SADNENVAPKITLHMT-SL 425

Query: 490 KWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLG 549
              LP+ N     S+  + CL++   +  A        ++   QQQN  + +D+ N R+G
Sbjct: 426 DLKLPMENTLIHSSAGTLTCLSMAGIRQNA---NAVLNVIANLQQQNLRILFDVPNSRIG 448

BLAST of Clc10G18000 vs. ExPASy TrEMBL
Match: A0A5A7TRK2 (Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004410 PE=3 SV=1)

HSP 1 Score: 874.4 bits (2258), Expect = 2.5e-250
Identity = 422/455 (92.75%), Postives = 443/455 (97.36%), Query Frame = 0

Query: 95  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQI 154
           MASPSPLSFFYILLFSS+S+I+N+NPITLPL++ PHL SSDPLQ LTFLASAS+NRAH+I
Sbjct: 1   MASPSPLSFFYILLFSSLSAISNSNPITLPLNSSPHLSSSDPLQALTFLASASKNRAHRI 60

Query: 155 KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 214
           KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLC+ECSF
Sbjct: 61  KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCTECSF 120

Query: 215 PKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 274
           PKIDP GIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 275 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 334
           QYGSGSTAGLLLSETLDFP+KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPNKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 335 AYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG 394
           AYCLASRKFDDS HSG+LILDS+GVKT GLTYT FRQNPSVSNHAYKEYYYL+IRKI+VG
Sbjct: 241 AYCLASRKFDDSAHSGQLILDSSGVKTSGLTYTSFRQNPSVSNHAYKEYYYLNIRKIIVG 300

Query: 395 NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESL 454
           NQAVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPV + VAQEFEKQLANRTRATDVE+L
Sbjct: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLDVVAQEFEKQLANRTRATDVETL 360

Query: 455 TGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGG 514
           TGLRPCFD+SK+KSV+FPELIFQFKGGAKWALPL+NYFALVSSSGVACLTVVTH TE GG
Sbjct: 361 TGLRPCFDVSKEKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHNTEDGG 420

Query: 515 GGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT 550
           GGGPSVILGAFQQQNFYVEYDLVNERLGFR+QTCT
Sbjct: 421 GGGPSVILGAFQQQNFYVEYDLVNERLGFRKQTCT 455

BLAST of Clc10G18000 vs. ExPASy TrEMBL
Match: A0A1S3B6B5 (aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103486666 PE=3 SV=1)

HSP 1 Score: 874.4 bits (2258), Expect = 2.5e-250
Identity = 422/455 (92.75%), Postives = 443/455 (97.36%), Query Frame = 0

Query: 95  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQI 154
           MASPSPLSFFYILLFSS+S+I+N+NPITLPL++ PHL SSDPLQ LTFLASAS+NRAH+I
Sbjct: 1   MASPSPLSFFYILLFSSLSAISNSNPITLPLNSSPHLSSSDPLQALTFLASASKNRAHRI 60

Query: 155 KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 214
           KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLC+ECSF
Sbjct: 61  KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCTECSF 120

Query: 215 PKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 274
           PKIDP GIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 275 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 334
           QYGSGSTAGLLLSETLDFP+KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPNKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 335 AYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG 394
           AYCLASRKFDDS HSG+LILDS+GVKT GLTYT FRQNPSVSNHAYKEYYYL+IRKI+VG
Sbjct: 241 AYCLASRKFDDSAHSGQLILDSSGVKTSGLTYTSFRQNPSVSNHAYKEYYYLNIRKIIVG 300

Query: 395 NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESL 454
           NQAVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPV + VAQEFEKQLANRTRATDVE+L
Sbjct: 301 NQAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVLDVVAQEFEKQLANRTRATDVETL 360

Query: 455 TGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGG 514
           TGLRPCFD+SK+KSV+FPELIFQFKGGAKWALPL+NYFALVSSSGVACLTVVTH TE GG
Sbjct: 361 TGLRPCFDVSKEKSVEFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHNTEDGG 420

Query: 515 GGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT 550
           GGGPSVILGAFQQQNFYVEYDLVNERLGFR+QTCT
Sbjct: 421 GGGPSVILGAFQQQNFYVEYDLVNERLGFRKQTCT 455

BLAST of Clc10G18000 vs. ExPASy TrEMBL
Match: A0A0A0LBI9 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G778440 PE=3 SV=1)

HSP 1 Score: 863.6 bits (2230), Expect = 4.4e-247
Identity = 419/457 (91.68%), Postives = 441/457 (96.50%), Query Frame = 0

Query: 95  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQI 154
           MASPSPLSFFY+LLFSS+S+IA++NPITLPL++FPHL S DPLQ LTFLAS+SQ RAHQI
Sbjct: 1   MASPSPLSFFYLLLFSSLSAIAHSNPITLPLNSFPHLSSPDPLQALTFLASSSQTRAHQI 60

Query: 155 KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 214
           KTPKSNSV KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF
Sbjct: 61  KTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSF 120

Query: 215 PKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 274
           PKIDP GIPRFVPKLSSSSKLVGCQNPKC+WIFGPDVKSQCRSCNPKTENCTQTCPAYVV
Sbjct: 121 PKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVV 180

Query: 275 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 334
           QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF
Sbjct: 181 QYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF 240

Query: 335 AYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG 394
           AYCLASRKFDDSPHSG+LILDSTGVK+ GLTYTPFRQNPSVSN+AYKEYYYL+IRKI+VG
Sbjct: 241 AYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVG 300

Query: 395 NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESL 454
           NQAVKV YK+LVPGPDGNGGSIIDSGSTFTFMDKPV E VA+EFEKQLAN TRATDVE+L
Sbjct: 301 NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETL 360

Query: 455 TGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTE--A 514
           TGLRPCFDISK+KSV FPELIFQFKGGAKWALPL+NYFALVSSSGVACLTVVTH+ E   
Sbjct: 361 TGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGG 420

Query: 515 GGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT 550
           GGGGGPSVILGAFQQQNFYVEYDLVN+RLGFRQQTC+
Sbjct: 421 GGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457

BLAST of Clc10G18000 vs. ExPASy TrEMBL
Match: A0A6J1IXY3 (probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111481640 PE=3 SV=1)

HSP 1 Score: 844.3 bits (2180), Expect = 2.8e-241
Identity = 412/457 (90.15%), Postives = 430/457 (94.09%), Query Frame = 0

Query: 95  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQI 154
           MA P PL FFYILL SSVS+IA+TNPIT+PL +FPH  SSDPLQTL FLASASQNRAHQI
Sbjct: 1   MAPPPPLCFFYILLVSSVSAIADTNPITIPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 155 KTPK--SNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 214
           K PK  SNSVSKSPLSPHSYGAYSTPLSFGTP QTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSESNSVSKSPLSPHSYGAYSTPLSFGTPPQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 215 SFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 274
           SFPKIDPAGIPRF+PKLSS+SKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY
Sbjct: 121 SFPKIDPAGIPRFIPKLSSTSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 275 VVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 334
           VVQYGSGSTAGLLLSETLDFPDKK  NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFPDKKFTNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 335 KFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKIL 394
           KFAYCLASRKFDDSPH+GELILDS+G KT GL+YTPFRQNPSVSNHAYKEYYYL+IRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLSYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 395 VGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVE 454
           VG +AVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQE EKQLANRTRATDVE
Sbjct: 301 VGKKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 455 SLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEA 514
           SLTGLRPCFDISKDKSV+FPEL FQ KGGAKW LPLSNYFALVSSSGVACLTVVTHKT A
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFQLKGGAKWGLPLSNYFALVSSSGVACLTVVTHKT-A 420

Query: 515 GGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT 550
             GGGPS+ILGAFQQQNFYVEYDLVN+++GFRQQTC+
Sbjct: 421 DSGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of Clc10G18000 vs. ExPASy TrEMBL
Match: A0A6J1F3G5 (probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC111441834 PE=3 SV=1)

HSP 1 Score: 842.0 bits (2174), Expect = 1.4e-240
Identity = 413/457 (90.37%), Postives = 429/457 (93.87%), Query Frame = 0

Query: 95  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQI 154
           MA P PL FFYILL SSVS+IA+TNPITLPL +FPH  SSDPLQTL FLASASQNRAHQI
Sbjct: 1   MAPPPPLCFFYILLLSSVSAIADTNPITLPLSSFPHHSSSDPLQTLNFLASASQNRAHQI 60

Query: 155 KTP--KSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC 214
           K P  KSNSVSKSPLSPHSYGAYSTPLSFGTP QTLHLIFDTGSSLVW PCTS+YLCSEC
Sbjct: 61  KAPKSKSNSVSKSPLSPHSYGAYSTPLSFGTPSQTLHLIFDTGSSLVWLPCTSKYLCSEC 120

Query: 215 SFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 274
           SFPKIDPA IPRF+PKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY
Sbjct: 121 SFPKIDPARIPRFIPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY 180

Query: 275 VVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 334
           VVQYGSGSTAGLLLSETLDFP+KKI NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK
Sbjct: 181 VVQYGSGSTAGLLLSETLDFPNKKITNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLK 240

Query: 335 KFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKIL 394
           KFAYCLASRKFDDSPH+GELILDS+G KT GLTYTPFRQNPSVSNHAYKEYYYL+IRKI 
Sbjct: 241 KFAYCLASRKFDDSPHAGELILDSSGAKTSGLTYTPFRQNPSVSNHAYKEYYYLTIRKIF 300

Query: 395 VGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVE 454
           VGN+AVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQE EKQLANRTRATDVE
Sbjct: 301 VGNKAVKVPYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEIEKQLANRTRATDVE 360

Query: 455 SLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEA 514
           SLTGLRPCFDISKDKSV+FPEL F  KGGAKWA PLSNYFALVSSSGVACLTVVTHK  A
Sbjct: 361 SLTGLRPCFDISKDKSVEFPELTFHLKGGAKWAPPLSNYFALVSSSGVACLTVVTHKA-A 420

Query: 515 GGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT 550
             GGGPS+ILGAFQQQNFYVEYDLVN+++GFRQQTC+
Sbjct: 421 ESGGGPSIILGAFQQQNFYVEYDLVNQKIGFRQQTCS 456

BLAST of Clc10G18000 vs. TAIR 10
Match: AT3G52500.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 533.5 bits (1373), Expect = 2.0e-151
Identity = 269/470 (57.23%), Postives = 337/470 (71.70%), Query Frame = 0

Query: 103 FFYILLFSSVSSIANTNPITLPLHAFPHLPSS--DPLQTLTFLASASQNRAHQIK----- 162
           FF+ L+F SV S      + LPL  F H   S  DP  +L  LA +S  RAH++K     
Sbjct: 6   FFFFLIFLSVVS-----AVKLPLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSI 65

Query: 163 ----------TPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTS 222
                     T  S +V KSPLS  SYG YS  LSFGTP QT+  +FDTGSSLVW PCTS
Sbjct: 66  KPDEDALSSTTTASATVVKSPLSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTS 125

Query: 223 RYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENC 282
           RYLCS C F  +DP  IPRF+PK SSSSK++GCQ+PKC +++GP+V  QCR C+P T NC
Sbjct: 126 RYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNV--QCRGCDPNTRNC 185

Query: 283 TQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESL 342
           T  CP Y++QYG GSTAG+L++E LDFPD  +P+FVVGCS +S  QP+GIAGFGRG  SL
Sbjct: 186 TVGCPPYILQYGLGSTAGVLITEKLDFPDLTVPDFVVGCSIISTRQPAGIAGFGRGPVSL 245

Query: 343 PSQMGLKKFAYCLASRKFDDSPHSGELILD-----STGVKTGGLTYTPFRQNPSVSNHAY 402
           PSQM LK+F++CL SR+FDD+  + +L LD     ++G KT GLTYTPFR+NP+VSN A+
Sbjct: 246 PSQMNLKRFSHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAF 305

Query: 403 KEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEK 462
            EYYYL++R+I VG + VK+ YKYL PG +G+GGSI+DSGSTFTFM++PVFE VA+EF  
Sbjct: 306 LEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFAS 365

Query: 463 QLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGV 522
           Q++N TR  D+E  TGL PCF+IS    V  PELIF+FKGGAK  LPLSNYF  V ++  
Sbjct: 366 QMSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDT 425

Query: 523 ACLTVVTHKT-EAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT 550
            CLTVV+ KT    GG GP++ILG+FQQQN+ VEYDL N+R GF ++ C+
Sbjct: 426 VCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468

BLAST of Clc10G18000 vs. TAIR 10
Match: AT4G16563.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 190.7 bits (483), Expect = 3.2e-48
Identity = 145/485 (29.90%), Postives = 214/485 (44.12%), Query Frame = 0

Query: 111 SVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPH 170
           SVSS++    + L         SS PL  L   +S S  R  +    +       P+S  
Sbjct: 21  SVSSLSTPLLLHLSHSLSTSKHSSSPLHLLKSSSSRSSARFRRHHHKQQQQQLSLPIS-- 80

Query: 171 SYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLS 230
           S   Y   LS G+    + L  DTGS LVWFPC   + C  C    + P+        LS
Sbjct: 81  SGSDYLISLSVGSSSSAVSLYLDTGSDLVWFPCRP-FTCILCESKPLPPSP----PSSLS 140

Query: 231 SSSKLVGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTENCTQT---CPAYVVQYGSGST 290
           SS+  V C +P C+        S  C   N      +T +C  +   CP +   YG GS 
Sbjct: 141 SSATTVSCSSPSCSAAHSSLPSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSL 200

Query: 291 AGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFA 350
              L S++L  P   + NF  GC+  ++ +P G+AGFGRG  SLP+Q+ +        F+
Sbjct: 201 VAKLYSDSLSLPSVSVSNFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFS 260

Query: 351 YCLASRKFDDS--PHSGELIL--------------------DSTGVKTGGLTYTPFRQNP 410
           YCL S  FD         LIL                    D    K     +T   +NP
Sbjct: 261 YCLVSHSFDSDRVRRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENP 320

Query: 411 SVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEA 470
               H Y  +Y +S++ I +G + +           +G GG ++DSG+TFT +    + +
Sbjct: 321 ---KHPY--FYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNS 380

Query: 471 VAQEFEKQLAN-RTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGG-AKWALPLSNY 530
           V +EF+ ++     RA  VE  +G+ PC+ +  +++V  P L+  F G  +   LP  NY
Sbjct: 381 VVEEFDSRVGRVHERADRVEPSSGMSPCYYL--NQTVKVPALVLHFAGNRSSVTLPRRNY 440

Query: 531 FALVSSSG--------VACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGF 549
           F      G        + CL ++    E+   GG   ILG +QQQ F V YDL+N R+GF
Sbjct: 441 FYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGF 491

BLAST of Clc10G18000 vs. TAIR 10
Match: AT5G45120.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 188.0 bits (476), Expect = 2.1e-47
Identity = 145/483 (30.02%), Postives = 214/483 (44.31%), Query Frame = 0

Query: 106 ILLFSSVSSIAN-TNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVS- 165
           + LF  ++ + N TN      H  P   SS      +FL       +  + TPKS +   
Sbjct: 8   LFLFLLITLLLNTTNKTQARQHKNPSSSSS------SFLVLTLTKSSVSLPTPKSQTQER 67

Query: 166 -KSPLSP---------HSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTS-RYLCSEC 225
            K PLS               Y   L+ GTP Q + +  DTGS L W PC +  + C EC
Sbjct: 68  IKKPLSSVDVVMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIEC 127

Query: 226 SFPKIDPAGIPR-FVPKLSSSSKLVGCQNPKCAWI------FGPDVKSQCRSCNPKTENC 285
              K +    P  F P  SS+S    C +  C  I      F P   + C         C
Sbjct: 128 YDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTC 187

Query: 286 TQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSES 345
            + CP++   YG G   +G+L  + L    + +P F  GC   +  +P GIAGFGRG  S
Sbjct: 188 VRPCPSFAYTYGEGGLISGILTRDILKARTRDVPRFSFGCVTSTYREPIGIAGFGRGLLS 247

Query: 346 LPSQMGL--KKFAYCLASRKFDDSPH-SGELILDSTGVK---TGGLTYTPFRQNPSVSNH 405
           LPSQ+G   K F++C    KF ++P+ S  LIL ++ +    T  L +TP    P     
Sbjct: 248 LPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTP----- 307

Query: 406 AYKEYYYLSIRKILVGNQAVKVLYKYLVPGPD--GNGGSIIDSGSTFTFMDKPVFEAVAQ 465
            Y   YY+ +  I +G           +   D  GNGG ++DSG+T+T + +P +  +  
Sbjct: 308 MYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLT 367

Query: 466 EFEKQLANRTRATDVESLTGLRPCF----------DISKDKSVDFPELIFQFKGGAKWAL 525
             +  +    RAT+ ES TG   C+           +  D  + FP + F F   A   L
Sbjct: 368 TLQSTI-TYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLL 427

Query: 526 PLSN-YFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQ 549
           P  N ++A+ + S  + +  +  +    G  GP+ + G+FQQQN  V YDL  ER+GF+ 
Sbjct: 428 PQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQA 478

BLAST of Clc10G18000 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 164.9 bits (416), Expect = 1.9e-40
Identity = 159/492 (32.32%), Postives = 217/492 (44.11%), Query Frame = 0

Query: 93  SSMASPSPLSFFYILLFS---SVSS-----IANTNPITLPLHAF----PHLPSSDPLQTL 152
           +S +S S L  F+++LFS   SVSS     I  T P  LP   F     H+ S   L  +
Sbjct: 2   ASSSSSSLLFPFFLILFSCLISVSSSRRSLIDRTLPKNLPRSGFRLSLRHVDSGKNLTKI 61

Query: 153 TFLASASQNRAHQI------------KTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTL 212
             +        H++              P   +  K+P    S G +   LS G P    
Sbjct: 62  QKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIKAPTHGGS-GEFLMELSIGNPAVKY 121

Query: 213 HLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFG 272
             I DTGS L+W  C     C+EC          P F P+ SSS   VGC +  C  +  
Sbjct: 122 SAIVDTGSDLIWTQCKP---CTECF-----DQPTPIFDPEKSSSYSKVGCSSGLCNAL-- 181

Query: 273 PDVKSQCRSCNPKTENCTQTCPAYVVQYGS-GSTAGLLLSETLDFPDK-KIPNFVVGCSF 332
                   +CN   + C      Y+  YG   ST GLL +ET  F D+  I     GC  
Sbjct: 182 -----PRSNCNEDKDAC-----EYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGV 241

Query: 333 LS----IHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDS--TGV- 392
            +      Q SG+ G GRG  SL SQ+   KF+YCL S   +DS  S  L + S  +G+ 
Sbjct: 242 ENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTS--IEDSEASSSLFIGSLASGIV 301

Query: 393 -KTGGLTYTPFRQNPS-VSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSII 452
            KTG        +  S + N     +YYL ++ I VG + + V         DG GG II
Sbjct: 302 NKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMII 361

Query: 453 DSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDI-SKDKSVDFPELIF 512
           DSG+T T++++  F+ + +EF  ++   +   D    TGL  CF +    K++  P++IF
Sbjct: 362 DSGTTITYLEETAFKVLKEEFTSRM---SLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIF 421

Query: 513 QFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDL 549
            FK GA   LP  NY    SS+GV CL +       G   G S I G  QQQNF V +DL
Sbjct: 422 HFK-GADLELPGENYMVADSSTGVLCLAM-------GSSNGMS-IFGNVQQQNFNVLHDL 458

BLAST of Clc10G18000 vs. TAIR 10
Match: AT2G42980.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 162.5 bits (410), Expect = 9.4e-40
Identity = 125/397 (31.49%), Postives = 179/397 (45.09%), Query Frame = 0

Query: 173 GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSS 232
           G Y   +  GTP +   LI DTGS L W  C   Y C   +    D        PK S+S
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYD--------PKTSAS 217

Query: 233 SKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 292
            K + C +P+C+ I  PD   QC S N       Q+CP Y   YG  S T G    ET  
Sbjct: 218 FKNITCNDPRCSLISSPDPPVQCESDN-------QSCP-YFYWYGDRSNTTGDFAVETFT 277

Query: 293 F---------PDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFAY 352
                      + K+ N + GC   +       SG+ G GRG  S  SQ+       F+Y
Sbjct: 278 VNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSY 337

Query: 353 CLASRKFDDSPHSGELIL--DSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG 412
           CL  R   ++  S +LI   D   +    L +T F        ++ + +YY+ I+ ILVG
Sbjct: 338 CLVDRN-SNTNVSSKLIFGEDKDLLNHTNLNFTSFVNG---KENSVETFYYIQIKSILVG 397

Query: 413 NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEF-EKQLANRTRATDVES 472
            +A+ +  +      DG+GG+IIDSG+T ++  +P +E +  +F EK   N     D   
Sbjct: 398 GKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPV 457

Query: 473 LTGLRPCFDIS--KDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTE 532
           L    PCF++S  ++ ++  PEL   F  G  W  P  N F  +S   + CL ++     
Sbjct: 458 LD---PCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSED-LVCLAIL----- 517

Query: 533 AGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC 549
            G       I+G +QQQNF++ YD    RLGF    C
Sbjct: 518 -GTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038905730.12.0e-25796.26probable aspartyl protease At4g16563 [Benincasa hispida][more]
XP_008442902.15.2e-25092.75PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo] >KAA0043829.1 aspart... [more]
XP_004136706.19.1e-24791.68probable aspartyl protease At4g16563 [Cucumis sativus] >KGN59188.1 hypothetical ... [more]
XP_022982947.15.7e-24190.15probable aspartyl protease At4g16563 [Cucurbita maxima][more]
XP_023528159.11.3e-24090.59probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q940R44.5e-4729.90Probable aspartyl protease At4g16563 OS=Arabidopsis thaliana OX=3702 GN=At4g1656... [more]
Q9LNJ37.2e-3729.12Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q766C21.8e-3532.64Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q766C33.7e-3328.67Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
O044962.3e-3028.64Aspartyl protease AED3 OS=Arabidopsis thaliana OX=3702 GN=AED3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7TRK22.5e-25092.75Aspartic proteinase nepenthesin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3B6B52.5e-25092.75aspartic proteinase nepenthesin-2 OS=Cucumis melo OX=3656 GN=LOC103486666 PE=3 S... [more]
A0A0A0LBI94.4e-24791.68Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G77844... [more]
A0A6J1IXY32.8e-24190.15probable aspartyl protease At4g16563 OS=Cucurbita maxima OX=3661 GN=LOC111481640... [more]
A0A6J1F3G51.4e-24090.37probable aspartyl protease At4g16563 OS=Cucurbita moschata OX=3662 GN=LOC1114418... [more]
Match NameE-valueIdentityDescription
AT3G52500.12.0e-15157.23Eukaryotic aspartyl protease family protein [more]
AT4G16563.13.2e-4829.90Eukaryotic aspartyl protease family protein [more]
AT5G45120.12.1e-4730.02Eukaryotic aspartyl protease family protein [more]
AT2G03200.11.9e-4032.32Eukaryotic aspartyl protease family protein [more]
AT2G42980.19.4e-4031.49Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 415..426
score: 29.09
coord: 520..535
score: 30.89
coord: 181..201
score: 50.56
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 384..544
e-value: 1.4E-33
score: 116.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 158..353
e-value: 4.3E-34
score: 120.1
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 354..549
e-value: 1.3E-49
score: 170.3
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 169..548
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 175..352
e-value: 2.2E-29
score: 102.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..15
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..25
NoneNo IPR availablePANTHERPTHR47967:SF36BNACNNG47670D PROTEINcoord: 101..548
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 101..548
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 190..201
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 175..544
score: 35.38715
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 174..548
e-value: 4.31935E-79
score: 247.561

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc10G18000.2Clc10G18000.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity