Cla97C05G080940 (gene) Watermelon (97103) v2

NameCla97C05G080940
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptionaspartyl protease family protein 2
LocationCla97Chr05 : 840620 .. 842032 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGCAAACACCAATTCATTTCCCTTTATCTTCTTCCTCCTCACTCTTCTCTCTTTCTCCACCGCCTTCTCCGATTTCCAAACCCTAATTCCCACGTCTCTTCCTTCCTCACCTTCCTTCTTACCCTCGGATTCCGAATCCTTTATATCCTCCGACGCCACCGAATCGGAGCTTGGCTTAACATTGCATCTCCACCATTTGGACGCTCTCTCTCTCAACCGAACGCCGGAGGAGCTCTTCCACCTCCGCCTTCAAAGAGACGCTCTCCGAGTCAAGAAGCTGAGTTCACTCGGTGCTTCCTCTCGAAATGTGAGCCAGGCCAGTGGGACCGGTTTCAGTAGCTCCGTAATCTCGGGACTCGCTCAGGGCAGCGGCGAGTATTTCACGCGCATCGGCGTCGGCACGCCGCCCAAGTATGTCTACATGGTACTCGACACCGGCAGTGACATCGTTTGGCTACAGTGTGCTCCTTGTAAGAATTGCTATTCTCAGACCGACCCTGTTTTCAACCCGGTTAAGTCCGGATCCTTCGCCAAGGTCCTCTGCCGGACGCCGCTGTGCCGTCGGCTTGAATCTCCGGGGTGCAACCAGCGCCAGACGTGTCTCTACCAAGTTTCTTACGGCGACGGTTCATACACAACCGGCGAGTTTGTCACCGAAACCCTCACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGTGGCCACGATAATGAAGGCTTGTTCGTTGGTGCGGCTGGGCTTTTAGGTCTCGGTCGGGGAGGGTTGTCGTTTCCGTCGCAAACCGGCCGGACTTTCAACCAGAAATTCTCTTACTGCTTGGTGGACCGGTCCGCCTCTTCCAAACCGTCTTCCGTCGTCTTCGGCAACTCCGCCGTCTCTCGAACCGCCCGGTTCACTCCTCTCCTTACAAACCCTAGGTTGGATACGTTTTACTACGTTGAACTGCTTGGAATCAGCGTCGGAGGAACGCCCGTCTCCGGCATCTCCGCTTCACATTTCAAGCTCGATCCGACCGGTAACGGTGGAGTAATCATCGATTGTGGTACTTCTGTTACTCGATTGAACCGACCAGCATACATTGCTCTGCGCGATGCCTTCCGTGCCGGAGCTTCGAGTTTGAAATCGGCGCCGGAGTTTTCTCTCTTCGATACTTGCTACGATCTGTCTGGGAAGACAACGGTGAAGGTCCCGACGGTGGTGCTGCACTTCAGAGGCGCTGACGTATCGTTACCGGCGTCCAATTATCTGATCCCGGTCGATGGCAGCGGCCGATTCTGCTTTGCCTTCGCCGGAACGACCAGTGGACTGTCGATTATCGGCAACATTCAGCAGCAAGGATTTCGGGTGGTGTACGATTTGGCGAATTCTCGGGTCGGATTTTCTCCGCGTGGTTGCGCCTAA

mRNA sequence

ATGGAAGCAAACACCAATTCATTTCCCTTTATCTTCTTCCTCCTCACTCTTCTCTCTTTCTCCACCGCCTTCTCCGATTTCCAAACCCTAATTCCCACGTCTCTTCCTTCCTCACCTTCCTTCTTACCCTCGGATTCCGAATCCTTTATATCCTCCGACGCCACCGAATCGGAGCTTGGCTTAACATTGCATCTCCACCATTTGGACGCTCTCTCTCTCAACCGAACGCCGGAGGAGCTCTTCCACCTCCGCCTTCAAAGAGACGCTCTCCGAGTCAAGAAGCTGAGTTCACTCGGTGCTTCCTCTCGAAATGTGAGCCAGGCCAGTGGGACCGGTTTCAGTAGCTCCGTAATCTCGGGACTCGCTCAGGGCAGCGGCGAGTATTTCACGCGCATCGGCGTCGGCACGCCGCCCAAGTATGTCTACATGGTACTCGACACCGGCAGTGACATCGTTTGGCTACAGTGTGCTCCTTGTAAGAATTGCTATTCTCAGACCGACCCTGTTTTCAACCCGGTTAAGTCCGGATCCTTCGCCAAGGTCCTCTGCCGGACGCCGCTGTGCCGTCGGCTTGAATCTCCGGGGTGCAACCAGCGCCAGACGTGTCTCTACCAAGTTTCTTACGGCGACGGTTCATACACAACCGGCGAGTTTGTCACCGAAACCCTCACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGTGGCCACGATAATGAAGGCTTGTTCGTTGGTGCGGCTGGGCTTTTAGGTCTCGGTCGGGGAGGGTTGTCGTTTCCGTCGCAAACCGGCCGGACTTTCAACCAGAAATTCTCTTACTGCTTGGTGGACCGGTCCGCCTCTTCCAAACCGTCTTCCGTCGTCTTCGGCAACTCCGCCGTCTCTCGAACCGCCCGGTTCACTCCTCTCCTTACAAACCCTAGGTTGGATACGTTTTACTACGTTGAACTGCTTGGAATCAGCGTCGGAGGAACGCCCGTCTCCGGCATCTCCGCTTCACATTTCAAGCTCGATCCGACCGGTAACGGTGGAGTAATCATCGATTGTGGTACTTCTGTTACTCGATTGAACCGACCAGCATACATTGCTCTGCGCGATGCCTTCCGTGCCGGAGCTTCGAGTTTGAAATCGGCGCCGGAGTTTTCTCTCTTCGATACTTGCTACGATCTGTCTGGGAAGACAACGGTGAAGGTCCCGACGGTGGTGCTGCACTTCAGAGGCGCTGACGTATCGTTACCGGCGTCCAATTATCTGATCCCGGTCGATGGCAGCGGCCGATTCTGCTTTGCCTTCGCCGGAACGACCAGTGGACTGTCGATTATCGGCAACATTCAGCAGCAAGGATTTCGGGTGGTGTACGATTTGGCGAATTCTCGGGTCGGATTTTCTCCGCGTGGTTGCGCCTAA

Coding sequence (CDS)

ATGGAAGCAAACACCAATTCATTTCCCTTTATCTTCTTCCTCCTCACTCTTCTCTCTTTCTCCACCGCCTTCTCCGATTTCCAAACCCTAATTCCCACGTCTCTTCCTTCCTCACCTTCCTTCTTACCCTCGGATTCCGAATCCTTTATATCCTCCGACGCCACCGAATCGGAGCTTGGCTTAACATTGCATCTCCACCATTTGGACGCTCTCTCTCTCAACCGAACGCCGGAGGAGCTCTTCCACCTCCGCCTTCAAAGAGACGCTCTCCGAGTCAAGAAGCTGAGTTCACTCGGTGCTTCCTCTCGAAATGTGAGCCAGGCCAGTGGGACCGGTTTCAGTAGCTCCGTAATCTCGGGACTCGCTCAGGGCAGCGGCGAGTATTTCACGCGCATCGGCGTCGGCACGCCGCCCAAGTATGTCTACATGGTACTCGACACCGGCAGTGACATCGTTTGGCTACAGTGTGCTCCTTGTAAGAATTGCTATTCTCAGACCGACCCTGTTTTCAACCCGGTTAAGTCCGGATCCTTCGCCAAGGTCCTCTGCCGGACGCCGCTGTGCCGTCGGCTTGAATCTCCGGGGTGCAACCAGCGCCAGACGTGTCTCTACCAAGTTTCTTACGGCGACGGTTCATACACAACCGGCGAGTTTGTCACCGAAACCCTCACCTTCCGGCGGACTAAAGTGGAGCGCGTAGCCCTAGGTTGTGGCCACGATAATGAAGGCTTGTTCGTTGGTGCGGCTGGGCTTTTAGGTCTCGGTCGGGGAGGGTTGTCGTTTCCGTCGCAAACCGGCCGGACTTTCAACCAGAAATTCTCTTACTGCTTGGTGGACCGGTCCGCCTCTTCCAAACCGTCTTCCGTCGTCTTCGGCAACTCCGCCGTCTCTCGAACCGCCCGGTTCACTCCTCTCCTTACAAACCCTAGGTTGGATACGTTTTACTACGTTGAACTGCTTGGAATCAGCGTCGGAGGAACGCCCGTCTCCGGCATCTCCGCTTCACATTTCAAGCTCGATCCGACCGGTAACGGTGGAGTAATCATCGATTGTGGTACTTCTGTTACTCGATTGAACCGACCAGCATACATTGCTCTGCGCGATGCCTTCCGTGCCGGAGCTTCGAGTTTGAAATCGGCGCCGGAGTTTTCTCTCTTCGATACTTGCTACGATCTGTCTGGGAAGACAACGGTGAAGGTCCCGACGGTGGTGCTGCACTTCAGAGGCGCTGACGTATCGTTACCGGCGTCCAATTATCTGATCCCGGTCGATGGCAGCGGCCGATTCTGCTTTGCCTTCGCCGGAACGACCAGTGGACTGTCGATTATCGGCAACATTCAGCAGCAAGGATTTCGGGTGGTGTACGATTTGGCGAATTCTCGGGTCGGATTTTCTCCGCGTGGTTGCGCCTAA

Protein sequence

MEANTNSFPFIFFLLTLLSFSTAFSDFQTLIPTSLPSSPSFLPSDSESFISSDATESELGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGASSRNVSQASGTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA
BLAST of Cla97C05G080940 vs. NCBI nr
Match: XP_004133810.1 (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis sativus])

HSP 1 Score: 882.1 bits (2278), Expect = 8.4e-253
Identity = 444/471 (94.27%), Postives = 457/471 (97.03%), Query Frame = 0

Query: 1   MEANTNSFPFIFFLLTLLSFSTAFSDFQTLIPTSLPSSPSFLPSDSESFISSDATESELG 60
           ME NT S PFIFFLLT+LS +TAFSDFQTL  TSLPSSPSFLPSDS SF+SS+AT+SELG
Sbjct: 1   MEPNTISLPFIFFLLTVLSLATAFSDFQTLPLTSLPSSPSFLPSDSNSFLSSEATQSELG 60

Query: 61  LTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGASSRNVSQASG-TGFSSSVIS 120
           L LHLHHLDALS NRTPEELFHLRLQRDA+RVKKLSSLGA+SRN+S+  G TGFSSSVIS
Sbjct: 61  LELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSKPGGTTGFSSSVIS 120

Query: 121 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 180
           GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA
Sbjct: 121 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 180

Query: 181 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGH 240
           KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE+VALGCGH
Sbjct: 181 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGH 240

Query: 241 DNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT 300
           DNEGLFVGAAGLLGLGRGGLSFPSQ GRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT
Sbjct: 241 DNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT 300

Query: 301 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLN 360
           ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI+ASHFKLD TGNGGVIIDCGTSVTRLN
Sbjct: 301 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLN 360

Query: 361 RPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 420
           +PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY
Sbjct: 361 KPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 420

Query: 421 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA 471
           LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA+SRVGFSPRGCA
Sbjct: 421 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471

BLAST of Cla97C05G080940 vs. NCBI nr
Match: KGN56421.1 (Aspartic proteinase nepenthesin-1 [Cucumis sativus])

HSP 1 Score: 882.1 bits (2278), Expect = 8.4e-253
Identity = 444/471 (94.27%), Postives = 457/471 (97.03%), Query Frame = 0

Query: 1   MEANTNSFPFIFFLLTLLSFSTAFSDFQTLIPTSLPSSPSFLPSDSESFISSDATESELG 60
           ME NT S PFIFFLLT+LS +TAFSDFQTL  TSLPSSPSFLPSDS SF+SS+AT+SELG
Sbjct: 42  MEPNTISLPFIFFLLTVLSLATAFSDFQTLPLTSLPSSPSFLPSDSNSFLSSEATQSELG 101

Query: 61  LTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGASSRNVSQASG-TGFSSSVIS 120
           L LHLHHLDALS NRTPEELFHLRLQRDA+RVKKLSSLGA+SRN+S+  G TGFSSSVIS
Sbjct: 102 LELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSKPGGTTGFSSSVIS 161

Query: 121 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 180
           GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA
Sbjct: 162 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 221

Query: 181 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGH 240
           KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE+VALGCGH
Sbjct: 222 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGH 281

Query: 241 DNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT 300
           DNEGLFVGAAGLLGLGRGGLSFPSQ GRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT
Sbjct: 282 DNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT 341

Query: 301 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLN 360
           ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI+ASHFKLD TGNGGVIIDCGTSVTRLN
Sbjct: 342 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLN 401

Query: 361 RPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 420
           +PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY
Sbjct: 402 KPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 461

Query: 421 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA 471
           LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA+SRVGFSPRGCA
Sbjct: 462 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 512

BLAST of Cla97C05G080940 vs. NCBI nr
Match: XP_008437888.1 (PREDICTED: aspartyl protease family protein 2 [Cucumis melo])

HSP 1 Score: 880.9 bits (2275), Expect = 1.9e-252
Identity = 446/472 (94.49%), Postives = 458/472 (97.03%), Query Frame = 0

Query: 1   MEANTNSFPFIFF-LLTLLSFSTAFSDFQTLIPTSLPSSPSFLPSDSESFISSDATESEL 60
           MEANT S PFIFF LL +LS STAFSDFQTLI  SLPSSPSFLPSDS SF+SS+ATE+EL
Sbjct: 3   MEANTISLPFIFFLLLAILSLSTAFSDFQTLILRSLPSSPSFLPSDSNSFLSSEATETEL 62

Query: 61  GLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGASSRNVSQASG-TGFSSSVI 120
           GL LHLHHLDALS NRTPEELFHLRLQRDA+RVKKLSSLGA+SRN+S+ SG TGFSSSVI
Sbjct: 63  GLELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSRPSGTTGFSSSVI 122

Query: 121 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 180
           SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF
Sbjct: 123 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 182

Query: 181 AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCG 240
           AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE+VALGCG
Sbjct: 183 AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCG 242

Query: 241 HDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSR 300
           HDNEGLFVGAAGLLGLGRGGLSFPSQ GRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSR
Sbjct: 243 HDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSR 302

Query: 301 TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRL 360
           TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGIS+SHFKLD TGNGGVIIDCGTSVTRL
Sbjct: 303 TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISSSHFKLDRTGNGGVIIDCGTSVTRL 362

Query: 361 NRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN 420
           N+PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN
Sbjct: 363 NKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN 422

Query: 421 YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA 471
           YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA+SRVGFSPRGCA
Sbjct: 423 YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 474

BLAST of Cla97C05G080940 vs. NCBI nr
Match: XP_022980038.1 (aspartyl protease family protein 2-like [Cucurbita maxima])

HSP 1 Score: 849.7 bits (2194), Expect = 4.6e-243
Identity = 432/474 (91.14%), Postives = 446/474 (94.09%), Query Frame = 0

Query: 1   MEANTNSFPFIFFLLTLLSFSTAFSDFQTLIPTSLPSSPSFLPSD----SESFISSDATE 60
           M A T+ FPFIFFLLTLL  STAFSDFQTL+P  LP+SPSFL  +    S+SF SS+ATE
Sbjct: 1   MVAKTSPFPFIFFLLTLLPLSTAFSDFQTLVPRPLPTSPSFLAPESTEGSDSF-SSEATE 60

Query: 61  SELGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGASSRNVSQASGTGFSSS 120
           SE GL LHLHHLD+LSL+RTPEELFHLRLQRDALRV KLS L A+SRNVS+ASGTGFSSS
Sbjct: 61  SEPGLALHLHHLDSLSLSRTPEELFHLRLQRDALRVNKLSLLAAASRNVSRASGTGFSSS 120

Query: 121 VISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSG 180
           VISGLAQGSGEYFTRIGVGTPP+YVY+VLDTGSDIVWLQCAPCKNCYSQTDPVF+PVKSG
Sbjct: 121 VISGLAQGSGEYFTRIGVGTPPRYVYLVLDTGSDIVWLQCAPCKNCYSQTDPVFDPVKSG 180

Query: 181 SFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG 240
           SF+KVLCRTPLC RLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG
Sbjct: 181 SFSKVLCRTPLCGRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG 240

Query: 241 CGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAV 300
           CGHDNEGLFVGAAGLLGLGRGGLSFPSQTGR FNQKFSYCLVDRSASSKPSSVVFGNSAV
Sbjct: 241 CGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRAFNQKFSYCLVDRSASSKPSSVVFGNSAV 300

Query: 301 SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVT 360
           SRTARFTPLLTNPRLDTFYYVELLGISVGG PVSGIS  HFKLD TGNGGVIIDCGTSVT
Sbjct: 301 SRTARFTPLLTNPRLDTFYYVELLGISVGGRPVSGISPLHFKLDSTGNGGVIIDCGTSVT 360

Query: 361 RLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPA 420
           RLNRPAYIALRDAFRAGASSLKSA EFSLFDTCYDLSGKTTVKVPTVVLHFR ADVSLPA
Sbjct: 361 RLNRPAYIALRDAFRAGASSLKSAAEFSLFDTCYDLSGKTTVKVPTVVLHFRNADVSLPA 420

Query: 421 SNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA 471
           SNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA SRVGFSPRGCA
Sbjct: 421 SNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 473

BLAST of Cla97C05G080940 vs. NCBI nr
Match: XP_022924595.1 (aspartyl protease family protein 2-like [Cucurbita moschata])

HSP 1 Score: 848.6 bits (2191), Expect = 1.0e-242
Identity = 432/474 (91.14%), Postives = 443/474 (93.46%), Query Frame = 0

Query: 1   MEANTNSFPFIFFLLTLLSFSTAFSDFQTLIPTSLPSSPSFLP----SDSESFISSDATE 60
           M A T+ F FIF LLTLLS STAFSDFQTL+P  LP+SPS L      DS+SF SS+ATE
Sbjct: 1   MVAKTSPFTFIFVLLTLLSLSTAFSDFQTLVPRPLPTSPSSLAPESNEDSDSFFSSEATE 60

Query: 61  SELGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGASSRNVSQASGTGFSSS 120
           SE GL LHLHHLD+LSL+RTPEELFHLRLQRDALRV KLS L A S NVS+ASGTGFSSS
Sbjct: 61  SEPGLALHLHHLDSLSLSRTPEELFHLRLQRDALRVNKLSLLAAVSPNVSRASGTGFSSS 120

Query: 121 VISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSG 180
           VISGLAQGSGEYFTRIGVGTPP+YVYMVLDTGSDIVWLQCAPCKNCYSQTDPVF+PVKSG
Sbjct: 121 VISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFDPVKSG 180

Query: 181 SFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG 240
           SF+KVLCRTPLC RLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG
Sbjct: 181 SFSKVLCRTPLCGRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALG 240

Query: 241 CGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAV 300
           CGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAV
Sbjct: 241 CGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAV 300

Query: 301 SRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVT 360
           SRTARFTPLLTNPRLDTFYYVELLGISVGG PVSGIS  HFKLD TGNGGVIIDCGTSVT
Sbjct: 301 SRTARFTPLLTNPRLDTFYYVELLGISVGGRPVSGISPLHFKLDSTGNGGVIIDCGTSVT 360

Query: 361 RLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPA 420
           RLNRPAYIALRDAFRAGASSLKSA EFSLFDTCYDLSGKTTVKVPTVVLHFR ADVSLPA
Sbjct: 361 RLNRPAYIALRDAFRAGASSLKSAAEFSLFDTCYDLSGKTTVKVPTVVLHFRNADVSLPA 420

Query: 421 SNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA 471
           SNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA SRVGFSPRGCA
Sbjct: 421 SNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLAGSRVGFSPRGCA 474

BLAST of Cla97C05G080940 vs. TrEMBL
Match: tr|A0A0A0L8K0|A0A0A0L8K0_CUCSA (Aspartic proteinase nepenthesin-1 OS=Cucumis sativus OX=3659 GN=Csa_3G119540 PE=3 SV=1)

HSP 1 Score: 882.1 bits (2278), Expect = 5.5e-253
Identity = 444/471 (94.27%), Postives = 457/471 (97.03%), Query Frame = 0

Query: 1   MEANTNSFPFIFFLLTLLSFSTAFSDFQTLIPTSLPSSPSFLPSDSESFISSDATESELG 60
           ME NT S PFIFFLLT+LS +TAFSDFQTL  TSLPSSPSFLPSDS SF+SS+AT+SELG
Sbjct: 42  MEPNTISLPFIFFLLTVLSLATAFSDFQTLPLTSLPSSPSFLPSDSNSFLSSEATQSELG 101

Query: 61  LTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGASSRNVSQASG-TGFSSSVIS 120
           L LHLHHLDALS NRTPEELFHLRLQRDA+RVKKLSSLGA+SRN+S+  G TGFSSSVIS
Sbjct: 102 LELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSKPGGTTGFSSSVIS 161

Query: 121 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 180
           GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA
Sbjct: 162 GLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFA 221

Query: 181 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGH 240
           KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE+VALGCGH
Sbjct: 222 KVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGH 281

Query: 241 DNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT 300
           DNEGLFVGAAGLLGLGRGGLSFPSQ GRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT
Sbjct: 282 DNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRT 341

Query: 301 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLN 360
           ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI+ASHFKLD TGNGGVIIDCGTSVTRLN
Sbjct: 342 ARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLN 401

Query: 361 RPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 420
           +PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY
Sbjct: 402 KPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNY 461

Query: 421 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA 471
           LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA+SRVGFSPRGCA
Sbjct: 462 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 512

BLAST of Cla97C05G080940 vs. TrEMBL
Match: tr|A0A1S3AV66|A0A1S3AV66_CUCME (aspartyl protease family protein 2 OS=Cucumis melo OX=3656 GN=LOC103483183 PE=3 SV=1)

HSP 1 Score: 880.9 bits (2275), Expect = 1.2e-252
Identity = 446/472 (94.49%), Postives = 458/472 (97.03%), Query Frame = 0

Query: 1   MEANTNSFPFIFF-LLTLLSFSTAFSDFQTLIPTSLPSSPSFLPSDSESFISSDATESEL 60
           MEANT S PFIFF LL +LS STAFSDFQTLI  SLPSSPSFLPSDS SF+SS+ATE+EL
Sbjct: 3   MEANTISLPFIFFLLLAILSLSTAFSDFQTLILRSLPSSPSFLPSDSNSFLSSEATETEL 62

Query: 61  GLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGASSRNVSQASG-TGFSSSVI 120
           GL LHLHHLDALS NRTPEELFHLRLQRDA+RVKKLSSLGA+SRN+S+ SG TGFSSSVI
Sbjct: 63  GLELHLHHLDALSFNRTPEELFHLRLQRDAIRVKKLSSLGATSRNLSRPSGTTGFSSSVI 122

Query: 121 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 180
           SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF
Sbjct: 123 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 182

Query: 181 AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCG 240
           AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE+VALGCG
Sbjct: 183 AKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCG 242

Query: 241 HDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSR 300
           HDNEGLFVGAAGLLGLGRGGLSFPSQ GRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSR
Sbjct: 243 HDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSR 302

Query: 301 TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRL 360
           TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGIS+SHFKLD TGNGGVIIDCGTSVTRL
Sbjct: 303 TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISSSHFKLDRTGNGGVIIDCGTSVTRL 362

Query: 361 NRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN 420
           N+PAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN
Sbjct: 363 NKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASN 422

Query: 421 YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA 471
           YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLA+SRVGFSPRGCA
Sbjct: 423 YLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 474

BLAST of Cla97C05G080940 vs. TrEMBL
Match: tr|A0A2P5AIW4|A0A2P5AIW4_PARAD (Aspartic peptidase OS=Parasponia andersonii OX=3476 GN=PanWU01x14_327890 PE=3 SV=1)

HSP 1 Score: 693.0 bits (1787), Expect = 4.8e-196
Identity = 355/476 (74.58%), Postives = 401/476 (84.24%), Query Frame = 0

Query: 2   EANTNSFPFIFFLLTLLSFSTAFSD---FQTLIPTSLPSSPSFLPSDSESFISSDA--TE 61
           +A    F F  F    ++ STA +D   +QTL+  +L + P+    +S+   S     +E
Sbjct: 4   KARNTHFFFFSFSAIFVTLSTALTDPIQYQTLVVNTLSTPPTLSWPESQLSGSDPGPDSE 63

Query: 62  SELGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGASSR--NVSQASGTGFS 121
           +E  L+L LHHLDALS +++PE+LF LRLQRDA+RVK L SL AS+    V   SG+GFS
Sbjct: 64  TESTLSLQLHHLDALSTDQSPEQLFDLRLQRDAMRVKSLYSLVASTNGSRVGYGSGSGFS 123

Query: 122 SSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVK 181
           SSVISGLAQGSGEYFTR+GVGTPP+YVYMVLDTGSD+VWLQCAPCK CY+Q DPVF+P K
Sbjct: 124 SSVISGLAQGSGEYFTRLGVGTPPRYVYMVLDTGSDVVWLQCAPCKKCYTQADPVFDPAK 183

Query: 182 SGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVA 241
           S SFA + C +PLCR+L+SPGCNQR+ CLYQVSYGDGS+TTGEF TETLTFRRT+V RVA
Sbjct: 184 SRSFAGIPCGSPLCRKLDSPGCNQRKQCLYQVSYGDGSFTTGEFSTETLTFRRTRVARVA 243

Query: 242 LGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNS 301
           LGCGHDNEGLFVGAAGLLGLGRG LSFPSQTG  FN+KFSYCLVDRSA+SKPSSVVFG+S
Sbjct: 244 LGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGYRFNRKFSYCLVDRSATSKPSSVVFGDS 303

Query: 302 AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTS 361
           AVSRTARFTPLL NP+LDTFYY+EL+GISVGG  V GISA+ FKLD  GNGGVIID GTS
Sbjct: 304 AVSRTARFTPLLANPKLDTFYYLELVGISVGGARVPGISAALFKLDNAGNGGVIIDSGTS 363

Query: 362 VTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSL 421
           VTRL RPAY+ALRD+FRAGAS+LK APEFSLFDTCYDLSGK+ VKVPTVVLHFRGADVSL
Sbjct: 364 VTRLTRPAYLALRDSFRAGASNLKRAPEFSLFDTCYDLSGKSEVKVPTVVLHFRGADVSL 423

Query: 422 PASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA 471
           PA+NYLIPVD SG FCFAFAGT SGLSIIGNIQQQGFRVVYDLA SRVGF+PRGCA
Sbjct: 424 PATNYLIPVDSSGTFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRVGFAPRGCA 479

BLAST of Cla97C05G080940 vs. TrEMBL
Match: tr|A0A251LUR5|A0A251LUR5_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_01G245800 PE=3 SV=1)

HSP 1 Score: 691.8 bits (1784), Expect = 1.1e-195
Identity = 362/482 (75.10%), Postives = 399/482 (82.78%), Query Frame = 0

Query: 1   MEANTNSFPFIFFLLTLLSFSTAFSD---FQTLIPTSLPSSPSF----LPSDSESFISSD 60
           ME         F     LS ST  S    + TL+   LPS P+       S+SE+  +S+
Sbjct: 1   MEGKARPVLLFFSFTIFLSLSTTSSSSLRYHTLVLNPLPSQPTLSWPASDSESETLTASN 60

Query: 61  ATE---SELGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGASSRNVSQASG 120
           ATE    E  L++ LHHLDALSLN+TP++LF LRL RDA RV  LSSL AS+        
Sbjct: 61  ATEIDPEESTLSVQLHHLDALSLNKTPQQLFCLRLHRDASRVVALSSLAASAAAAPGGRV 120

Query: 121 T-GFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPV 180
           T GFSSSVISGLAQGSGEYFTRIGVGTPP+YVYMVLDTGSDIVW+QCAPCK CYSQ+DPV
Sbjct: 121 TGGFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKKCYSQSDPV 180

Query: 181 FNPVKSGSFAKVLCRTPLCRRLESPGCN-QRQTCLYQVSYGDGSYTTGEFVTETLTFRRT 240
           F+P KS SFA + C +PLC RL+SPGCN Q+QTC+YQVSYGDGS+T G+F TETLTFRRT
Sbjct: 181 FDPRKSRSFAGIPCGSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFATETLTFRRT 240

Query: 241 KVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSS 300
           +V RVALGCGHDNEGLFVGAAGLLGLGRG LSFPSQTGR FN+KFSYCLVDRSASSKPSS
Sbjct: 241 RVGRVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNRKFSYCLVDRSASSKPSS 300

Query: 301 VVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVI 360
           VVFG+SA+SRTARFTPL++NP+LDTFYYVELLGISVGGT V GI+AS FKLD TGNGGVI
Sbjct: 301 VVFGDSAISRTARFTPLISNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVI 360

Query: 361 IDCGTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFR 420
           ID GTSVTRL RPAYIALRDAFR GA+SLK APEFSLFDTC+DLSG+T VKVPTVVLHFR
Sbjct: 361 IDSGTSVTRLTRPAYIALRDAFRVGATSLKKAPEFSLFDTCFDLSGQTEVKVPTVVLHFR 420

Query: 421 GADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRG 471
           GADVSLPASNYLIPVD +G FCFAFAGT SGLSIIGNIQQQGFRVVYDLA SRVGF+PRG
Sbjct: 421 GADVSLPASNYLIPVDSNGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLAGSRVGFAPRG 480

BLAST of Cla97C05G080940 vs. TrEMBL
Match: tr|A0A2I4EG53|A0A2I4EG53_9ROSI (aspartyl protease family protein 2 OS=Juglans regia OX=51240 GN=LOC108989274 PE=3 SV=1)

HSP 1 Score: 691.4 bits (1783), Expect = 1.4e-195
Identity = 354/463 (76.46%), Postives = 393/463 (84.88%), Query Frame = 0

Query: 11  IFFLLTLLSFSTAFSDFQTLIPTSLPSSP---SFLPSDSESFISSDATESELGLTLHLHH 70
           IFF    +S ST+   +QTL+   L ++P   S+  S+SES +S     +    TL LHH
Sbjct: 15  IFFXXXSISSSTSLR-YQTLVLNPLSTTPHSLSWPESESESVVSDSTVAT---TTLELHH 74

Query: 71  LDALSLNRTPEELFHLRLQRDALRVKKLSSLGASSRNVSQASGTGFSSSVISGLAQGSGE 130
           LD+LSLN+TPE+LFHLRLQRDA RVK L+SL A   N S+A G GFSSSVISGLAQGSGE
Sbjct: 75  LDSLSLNKTPEQLFHLRLQRDAFRVKALTSLAAVG-NRSRAHGAGFSSSVISGLAQGSGE 134

Query: 131 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPL 190
           YFTRIGVGTPPKYVYMVLDTGSD+VW+QCAPC+ CYSQ DPVF+P KS SFA + C +PL
Sbjct: 135 YFTRIGVGTPPKYVYMVLDTGSDVVWVQCAPCRKCYSQVDPVFDPRKSRSFAGISCGSPL 194

Query: 191 CRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVG 250
           C +L+SPGCN R+TCLYQVSYGDGS+TTG+F TETLTFR T+V RVALGCGH+N+GLFVG
Sbjct: 195 CLKLDSPGCNSRKTCLYQVSYGDGSFTTGDFSTETLTFRGTRVGRVALGCGHNNQGLFVG 254

Query: 251 AAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLT 310
           AAGLLGLGRG LSFPSQTGR FN+KFSYCLVDRSASS+PSS+VFG+ AVSRTARFTPL+ 
Sbjct: 255 AAGLLGLGRGRLSFPSQTGRQFNRKFSYCLVDRSASSRPSSIVFGDPAVSRTARFTPLIA 314

Query: 311 NPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALR 370
           NP+LDTFYYVEL+GISVGGTPV GISAS FKLD TGNGGVIID GTSVTRL RPAY ALR
Sbjct: 315 NPKLDTFYYVELVGISVGGTPVPGISASFFKLDRTGNGGVIIDSGTSVTRLTRPAYNALR 374

Query: 371 DAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSG 430
           DAFR G SSLK A +FSLFDTCYDLSGKT VKVPTVVLHFRGADV LPA+NYLIPVD  G
Sbjct: 375 DAFRIGTSSLKRASDFSLFDTCYDLSGKTEVKVPTVVLHFRGADVPLPATNYLIPVDSDG 434

Query: 431 RFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA 471
            FCFAFAGT SGLSI+GNIQQQGFRVVYDLA SRVGFSPRGCA
Sbjct: 435 TFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAGSRVGFSPRGCA 472

BLAST of Cla97C05G080940 vs. Swiss-Prot
Match: sp|Q9LNJ3|APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 630.6 bits (1625), Expect = 1.4e-179
Identity = 333/477 (69.81%), Postives = 379/477 (79.45%), Query Frame = 0

Query: 8   FPFIFFLLTLLSFSTAFSDFQTLIPT--SLP-SSPSFLPSDSE-------SFISSDATES 67
           F   FF L+L SFS      QTL P   SLP +SP     DS+                 
Sbjct: 10  FSLCFFFLSLPSFS-XXXXXQTLFPNSHSLPCASPVSFQPDSDXXXXXXXXXXXXXXXXX 69

Query: 68  ELGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGAS--SRNVSQASGT-GFS 127
                 +L H+DALS N+TP+ELF  RLQRD+ RVK +++L A    RNV+ A    GFS
Sbjct: 70  XXXXXXNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGFS 129

Query: 128 SSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVK 187
           SSV+SGL+QGSGEYFTR+GVGTP +YVYMVLDTGSDIVWLQCAPC+ CYSQ+DP+F+P K
Sbjct: 130 SSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRK 189

Query: 188 SGSFAKVLCRTPLCRRLESPGCN-QRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERV 247
           S ++A + C +P CRRL+S GCN +R+TCLYQVSYGDGS+T G+F TETLTFRR +V+ V
Sbjct: 190 SKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGV 249

Query: 248 ALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGN 307
           ALGCGHDNEGLFVGAAGLLGLG+G LSFP QTG  FNQKFSYCLVDRSASSKPSSVVFGN
Sbjct: 250 ALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGN 309

Query: 308 SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGT 367
           +AVSR ARFTPLL+NP+LDTFYYV LLGISVGGT V G++AS FKLD  GNGGVIID GT
Sbjct: 310 AAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 369

Query: 368 SVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVS 427
           SVTRL RPAYIA+RDAFR GA +LK AP+FSLFDTC+DLS    VKVPTVVLHFRGADVS
Sbjct: 370 SVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVS 429

Query: 428 LPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA 471
           LPA+NYLIPVD +G+FCFAFAGT  GLSIIGNIQQQGFRVVYDLA+SRVGF+P GCA
Sbjct: 430 LPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of Cla97C05G080940 vs. Swiss-Prot
Match: sp|Q9LS40|ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 394.8 bits (1013), Expect = 1.3e-108
Identity = 229/470 (48.72%), Postives = 284/470 (60.43%), Query Frame = 0

Query: 16  TLLSFSTAFSDFQTLIPTSLPSSPSFLPSDSESFISSDATESELGLTLHLHHLDALSLNR 75
           T+LS     S   T  P SL S P F  S            S L L LH       S ++
Sbjct: 49  TILSLDPTRSSLTTTKPESL-SDPVFFNS-----------SSPLSLELHSRDTFVASQHK 108

Query: 76  TPEELFHLRLQRDALRVKKL-------------SSLGASSRNVSQASGTGFSSSVISGLA 135
             + L   RL+RD+ RV  +             S L       ++      ++ V+SG +
Sbjct: 109 DYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGAS 168

Query: 136 QGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVL 195
           QGSGEYF+RIGVGTP K +Y+VLDTGSD+ W+QC PC +CY Q+DPVFNP  S ++  + 
Sbjct: 169 QGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLT 228

Query: 196 CRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRT-KVERVALGCGHDN 255
           C  P C  LE+  C   + CLYQVSYGDGS+T GE  T+T+TF  + K+  VALGCGHDN
Sbjct: 229 CSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDN 288

Query: 256 EGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTAR 315
           EGLF GAAGLLGLG G LS  +Q   T    FSYCLVDR  S K SS+ F +  +     
Sbjct: 289 EGLFTGAAGLLGLGGGVLSITNQMKAT---SFSYCLVDRD-SGKSSSLDFNSVQLGGGDA 348

Query: 316 FTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRP 375
             PLL N ++DTFYYV L G SVGG  V  +  + F +D +G+GGVI+DCGT+VTRL   
Sbjct: 349 TAPLLRNKKIDTFYYVGLSGFSVGGEKVV-LPDAIFDVDASGSGGVILDCGTAVTRLQTQ 408

Query: 376 AYIALRDAF-RAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA-DVSLPASNY 435
           AY +LRDAF +   +  K +   SLFDTCYD S  +TVKVPTV  HF G   + LPA NY
Sbjct: 409 AYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNY 468

Query: 436 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGC 470
           LIPVD SG FCFAFA T+S LSIIGN+QQQG R+ YDL+ + +G S   C
Sbjct: 469 LIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Cla97C05G080940 vs. Swiss-Prot
Match: sp|Q9LHE3|ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 390.6 bits (1002), Expect = 2.5e-107
Identity = 216/453 (47.68%), Postives = 282/453 (62.25%), Query Frame = 0

Query: 24  FSDFQTLIPTSLPSSPSFLPSDSESFISSDATESELGLTLHLHHLDALS--LNRTPEELF 83
           F DFQ +     P + +    D  +   SD  ES    TL L H D       R      
Sbjct: 24  FPDFQIIDVLQPPLTVTATLPDFNNTHFSD--ESSSKYTLRLLHRDRFPSVTYRNHHHRL 83

Query: 84  HLRLQRDALRV----KKLSSLGASSRNVSQASGTGFSSSVISGLAQGSGEYFTRIGVGTP 143
           H R++RD  RV    +++S     S + S+     F S ++SG+ QGSGEYF RIGVG+P
Sbjct: 84  HARMRRDTDRVSAILRRISGKVIPSSD-SRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSP 143

Query: 144 PKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCN 203
           P+  YMV+D+GSD+VW+QC PCK CY Q+DPVF+P KSGS+  V C + +C R+E+ GC+
Sbjct: 144 PRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCH 203

Query: 204 QRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRG 263
               C Y+V YGDGSYT G    ETLTF +T V  VA+GCGH N G+F+GAAGLLG+G G
Sbjct: 204 S-GGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGG 263

Query: 264 GLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYV 323
            +SF  Q        F YCLV R   S   S+VFG  A+   A + PL+ NPR  +FYYV
Sbjct: 264 SMSFVGQLSGQTGGAFGYCLVSRGTDS-TGSLVFGREALPVGASWVPLVRNPRAPSFYYV 323

Query: 324 ELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSL 383
            L G+ VGG  +  +    F L  TG+GGV++D GT+VTRL   AY+A RD F++  ++L
Sbjct: 324 GLKGLGVGGVRIP-LPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANL 383

Query: 384 KSAPEFSLFDTCYDLSGKTTVKVPTVVLHF-RGADVSLPASNYLIPVDGSGRFCFAFAGT 443
             A   S+FDTCYDLSG  +V+VPTV  +F  G  ++LPA N+L+PVD SG +CFAFA +
Sbjct: 384 PRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAAS 443

Query: 444 TSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGC 470
            +GLSIIGNIQQ+G +V +D AN  VGF P  C
Sbjct: 444 PTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Cla97C05G080940 vs. Swiss-Prot
Match: sp|Q9LEW3|AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1)

HSP 1 Score: 282.7 bits (722), Expect = 7.3e-75
Identity = 178/437 (40.73%), Postives = 244/437 (55.84%), Query Frame = 0

Query: 40  SFLPSDSESFISSDATESELGL-TLHLHHLDALSLNRTPEELFHLR-LQRDALRVKKL-S 99
           S  PS S    SS A+ ++  L  +H+H   A S   +   + H   ++RD  RV+ + S
Sbjct: 44  SLFPSSSSCVPSSKASNTKSSLRVVHMH--GACSHLSSDARVDHDEIIRRDQARVESIYS 103

Query: 100 SLGASSRN-VSQASGTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQ 159
            L  +S N VS+A  T   +   SG+  GSG Y   IG+GTP   + +V DTGSD+ W Q
Sbjct: 104 KLSKNSANEVSEAKSTELPAK--SGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQ 163

Query: 160 CAPC-KNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYT 219
           C PC  +CYSQ +P FNP  S ++  V C +P+C   ES  C+    C+Y + YGD S+T
Sbjct: 164 CEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAES--CS-ASNCVYSIVYGDKSFT 223

Query: 220 TGEFVTETLTFRRTKV-ERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKF 279
            G    E  T   + V E V  GCG +N+GLF G AGLLGLG G LS P+QT  T+N  F
Sbjct: 224 QGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIF 283

Query: 280 SYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGIS 339
           SYCL   +++S    + FG++ +S + +FTP+ + P     Y ++++GISVG   ++ I+
Sbjct: 284 SYCLPSFTSNS-TGHLTFGSAGISESVKFTPISSFPSAFN-YGIDIIGISVGDKELA-IT 343

Query: 340 ASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLS 399
            + F  +     G IID GT  TRL    Y  LR  F+   SS KS   + LFDTCYD +
Sbjct: 344 PNSFSTE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFT 403

Query: 400 GKTTVKVPTVVLHFRGAD-VSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFR 459
           G  TV  PT+   F G+  V L  S   +P+  S + C AFAG     +I GN+QQ    
Sbjct: 404 GLDTVTYPTIAFSFAGSTVVELDGSGISLPIKIS-QVCLAFAGNDDLPAIFGNVQQTTLD 463

Query: 460 VVYDLANSRVGFSPRGC 470
           VVYD+A  RVGF+P GC
Sbjct: 464 VVYDVAGGRVGFAPNGC 464

BLAST of Cla97C05G080940 vs. Swiss-Prot
Match: sp|Q8S9J6|ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 275.0 bits (702), Expect = 1.5e-72
Identity = 173/446 (38.79%), Postives = 239/446 (53.59%), Query Frame = 0

Query: 40  SFLPSDSESFISS---DATESELGLTLHLHHLDALSLNRTPEELFHLRLQR-DALRVKKL 99
           S LPS S S + S     T+S L +T H H   +   N       H+ + R D  RV  +
Sbjct: 40  SLLPSSSSSCVLSPRASTTKSSLHVT-HRHGTCSRLNNGKATSPDHVEILRLDQARVNSI 99

Query: 100 S---SLGASSRNVSQASGTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIV 159
               S   ++ +VS++  T   +    G   GSG Y   +G+GTP   + ++ DTGSD+ 
Sbjct: 100 HSKLSKKLATDHVSESKSTDLPAK--DGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLT 159

Query: 160 WLQCAPC-KNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLES----PGCNQRQTCLYQVS 219
           W QC PC + CY Q +P+FNP KS S+  V C +  C  L S     G      C+Y + 
Sbjct: 160 WTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQ 219

Query: 220 YGDGSYTTGEFVTETLTFRRTKV-ERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTG 279
           YGD S++ G    E  T   + V + V  GCG +N+GLF G AGLLGLGR  LSFPSQT 
Sbjct: 220 YGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTA 279

Query: 280 RTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGG 339
             +N+ FSYCL   S++S    + FG++ +SR+ +FTP+ T     +FY + ++ I+VGG
Sbjct: 280 TAYNKIFSYCL--PSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGG 339

Query: 340 TPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLF 399
             +  I ++ F        G +ID GT +TRL   AY ALR +F+A  S   +    S+ 
Sbjct: 340 QKLP-IPSTVF-----STPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSIL 399

Query: 400 DTCYDLSGKTTVKVPTVVLHFRGADVSLPASNYLIPVDGSGRFCFAFAGTT--SGLSIIG 459
           DTC+DLSG  TV +P V   F G  V    S  +  V    + C AFAG +  S  +I G
Sbjct: 400 DTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFG 459

Query: 460 NIQQQGFRVVYDLANSRVGFSPRGCA 471
           N+QQQ   VVYD A  RVGF+P GC+
Sbjct: 460 NVQQQTLEVVYDGAGGRVGFAPNGCS 474

BLAST of Cla97C05G080940 vs. TAIR10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 630.6 bits (1625), Expect = 8.0e-181
Identity = 333/477 (69.81%), Postives = 379/477 (79.45%), Query Frame = 0

Query: 8   FPFIFFLLTLLSFSTAFSDFQTLIPT--SLP-SSPSFLPSDSE-------SFISSDATES 67
           F   FF L+L SFS      QTL P   SLP +SP     DS+                 
Sbjct: 10  FSLCFFFLSLPSFS-XXXXXQTLFPNSHSLPCASPVSFQPDSDXXXXXXXXXXXXXXXXX 69

Query: 68  ELGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKLSSLGAS--SRNVSQASGT-GFS 127
                 +L H+DALS N+TP+ELF  RLQRD+ RVK +++L A    RNV+ A    GFS
Sbjct: 70  XXXXXXNLDHIDALSSNKTPDELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGFS 129

Query: 128 SSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVK 187
           SSV+SGL+QGSGEYFTR+GVGTP +YVYMVLDTGSDIVWLQCAPC+ CYSQ+DP+F+P K
Sbjct: 130 SSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRK 189

Query: 188 SGSFAKVLCRTPLCRRLESPGCN-QRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERV 247
           S ++A + C +P CRRL+S GCN +R+TCLYQVSYGDGS+T G+F TETLTFRR +V+ V
Sbjct: 190 SKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGV 249

Query: 248 ALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGN 307
           ALGCGHDNEGLFVGAAGLLGLG+G LSFP QTG  FNQKFSYCLVDRSASSKPSSVVFGN
Sbjct: 250 ALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGN 309

Query: 308 SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGT 367
           +AVSR ARFTPLL+NP+LDTFYYV LLGISVGGT V G++AS FKLD  GNGGVIID GT
Sbjct: 310 AAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 369

Query: 368 SVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVS 427
           SVTRL RPAYIA+RDAFR GA +LK AP+FSLFDTC+DLS    VKVPTVVLHFRGADVS
Sbjct: 370 SVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVS 429

Query: 428 LPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGCA 471
           LPA+NYLIPVD +G+FCFAFAGT  GLSIIGNIQQQGFRVVYDLA+SRVGF+P GCA
Sbjct: 430 LPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of Cla97C05G080940 vs. TAIR10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 588.2 bits (1515), Expect = 4.5e-168
Identity = 311/478 (65.06%), Postives = 368/478 (76.99%), Query Frame = 0

Query: 6   NSFPFIFFLLTLLSFSTAFSDFQTLIPTSLPSSPSFLPSDSESFISSDATESELGLTLHL 65
           N+  F  F + L   S+A S +QTL+  +LPSS +    +SES      +ES   L++HL
Sbjct: 7   NTLAFSVFAV-LFFTSSASSQYQTLVVNTLPSSATLSWPESESLTDESLSESTTSLSVHL 66

Query: 66  HHLDALS--LNRTPEELFHLRLQRDALRVKKLSSLGASS--RNVSQ---ASGTGFSSSVI 125
            H+DALS   + +P +LF+LRLQRD+LRVK ++SL A S  RN ++    +  GFS +VI
Sbjct: 67  SHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTPRTAGGFSGAVI 126

Query: 126 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 185
           SGL+QGSGEYF R+GVGTP   VYMVLDTGSD+VWLQC+PCK CY+QTD +F+P KS +F
Sbjct: 127 SGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTF 186

Query: 186 AKVLCRTPLCRRL-ESPGCNQR--QTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVAL 245
           A V C + LCRRL +S  C  R  +TCLYQVSYGDGS+T G+F TETLTF   +V+ V L
Sbjct: 187 ATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPL 246

Query: 246 GCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDR----SASSKPSSVVF 305
           GCGHDNEGLFVGAAGLLGLGRGGLSFPSQT   +N KFSYCLVDR    S+S  PS++VF
Sbjct: 247 GCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVF 306

Query: 306 GNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDC 365
           GN+AV +T+ FTPLLTNP+LDTFYY++LLGISVGG+ V G+S S FKLD TGNGGVIID 
Sbjct: 307 GNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDS 366

Query: 366 GTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGAD 425
           GTSVTRL +PAY+ALRDAFR GA+ LK AP +SLFDTC+DLSG TTVKVPTVV HF G +
Sbjct: 367 GTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGE 426

Query: 426 VSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGC 470
           VSLPASNYLIPV+  GRFCFAFAGT   LSIIGNIQQQGFRV YDL  SRVGF  R C
Sbjct: 427 VSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of Cla97C05G080940 vs. TAIR10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 404.4 bits (1038), Expect = 9.3e-113
Identity = 239/489 (48.88%), Postives = 298/489 (60.94%), Query Frame = 0

Query: 7   SFPFIFFLLTLLSFSTAFSDFQTLIPTSLPS-------------SPSFLPSDSESFISSD 66
           ++ F FF+  L S S+ FS       T+  S             + SF  +  E    + 
Sbjct: 4   NYSFFFFIFFLTSHSSVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEE--QTH 63

Query: 67  ATESELGLTLHLHHLDALSLNRTPEELFHLRLQRDALRVKKL-SSLGASSRNVSQASGTG 126
           +  S   L LH       + +   + L   RL RD  RVK L + L  +  N+S+A    
Sbjct: 64  SASSSFSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKP 123

Query: 127 FS-----------SSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKN 186
            S           + +ISG  QGSGEYFTR+G+G P + VYMVLDTGSD+ WLQC PC +
Sbjct: 124 ISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCAD 183

Query: 187 CYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTE 246
           CY QT+P+F P  S S+  + C TP C  LE   C +  TCLY+VSYGDGSYT G+F TE
Sbjct: 184 CYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSEC-RNATCLYEVSYGDGSYTVGDFATE 243

Query: 247 TLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRS 306
           TLT   T V+ VA+GCGH NEGLFVGAAGLLGLG G L+ PSQ   T    FSYCLVDR 
Sbjct: 244 TLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRD 303

Query: 307 ASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDP 366
           + S  S+V FG S +S  A   PLL N +LDTFYY+ L GISVGG  +  I  S F++D 
Sbjct: 304 SDS-ASTVDFGTS-LSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQ-IPQSSFEMDE 363

Query: 367 TGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVP 426
           +G+GG+IID GT+VTRL    Y +LRD+F  G   L+ A   ++FDTCY+LS KTTV+VP
Sbjct: 364 SGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVP 423

Query: 427 TVVLHFRGAD-VSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANS 470
           TV  HF G   ++LPA NY+IPVD  G FC AFA T S L+IIGN+QQQG RV +DLANS
Sbjct: 424 TVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANS 483

BLAST of Cla97C05G080940 vs. TAIR10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 394.8 bits (1013), Expect = 7.4e-110
Identity = 229/470 (48.72%), Postives = 284/470 (60.43%), Query Frame = 0

Query: 16  TLLSFSTAFSDFQTLIPTSLPSSPSFLPSDSESFISSDATESELGLTLHLHHLDALSLNR 75
           T+LS     S   T  P SL S P F  S            S L L LH       S ++
Sbjct: 49  TILSLDPTRSSLTTTKPESL-SDPVFFNS-----------SSPLSLELHSRDTFVASQHK 108

Query: 76  TPEELFHLRLQRDALRVKKL-------------SSLGASSRNVSQASGTGFSSSVISGLA 135
             + L   RL+RD+ RV  +             S L       ++      ++ V+SG +
Sbjct: 109 DYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGAS 168

Query: 136 QGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVL 195
           QGSGEYF+RIGVGTP K +Y+VLDTGSD+ W+QC PC +CY Q+DPVFNP  S ++  + 
Sbjct: 169 QGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLT 228

Query: 196 CRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRT-KVERVALGCGHDN 255
           C  P C  LE+  C   + CLYQVSYGDGS+T GE  T+T+TF  + K+  VALGCGHDN
Sbjct: 229 CSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDN 288

Query: 256 EGLFVGAAGLLGLGRGGLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTAR 315
           EGLF GAAGLLGLG G LS  +Q   T    FSYCLVDR  S K SS+ F +  +     
Sbjct: 289 EGLFTGAAGLLGLGGGVLSITNQMKAT---SFSYCLVDRD-SGKSSSLDFNSVQLGGGDA 348

Query: 316 FTPLLTNPRLDTFYYVELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRP 375
             PLL N ++DTFYYV L G SVGG  V  +  + F +D +G+GGVI+DCGT+VTRL   
Sbjct: 349 TAPLLRNKKIDTFYYVGLSGFSVGGEKVV-LPDAIFDVDASGSGGVILDCGTAVTRLQTQ 408

Query: 376 AYIALRDAF-RAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA-DVSLPASNY 435
           AY +LRDAF +   +  K +   SLFDTCYD S  +TVKVPTV  HF G   + LPA NY
Sbjct: 409 AYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNY 468

Query: 436 LIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGC 470
           LIPVD SG FCFAFA T+S LSIIGN+QQQG R+ YDL+ + +G S   C
Sbjct: 469 LIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Cla97C05G080940 vs. TAIR10
Match: AT3G20015.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 390.6 bits (1002), Expect = 1.4e-108
Identity = 216/453 (47.68%), Postives = 282/453 (62.25%), Query Frame = 0

Query: 24  FSDFQTLIPTSLPSSPSFLPSDSESFISSDATESELGLTLHLHHLDALS--LNRTPEELF 83
           F DFQ +     P + +    D  +   SD  ES    TL L H D       R      
Sbjct: 24  FPDFQIIDVLQPPLTVTATLPDFNNTHFSD--ESSSKYTLRLLHRDRFPSVTYRNHHHRL 83

Query: 84  HLRLQRDALRV----KKLSSLGASSRNVSQASGTGFSSSVISGLAQGSGEYFTRIGVGTP 143
           H R++RD  RV    +++S     S + S+     F S ++SG+ QGSGEYF RIGVG+P
Sbjct: 84  HARMRRDTDRVSAILRRISGKVIPSSD-SRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSP 143

Query: 144 PKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCN 203
           P+  YMV+D+GSD+VW+QC PCK CY Q+DPVF+P KSGS+  V C + +C R+E+ GC+
Sbjct: 144 PRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCH 203

Query: 204 QRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVERVALGCGHDNEGLFVGAAGLLGLGRG 263
               C Y+V YGDGSYT G    ETLTF +T V  VA+GCGH N G+F+GAAGLLG+G G
Sbjct: 204 S-GGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGG 263

Query: 264 GLSFPSQTGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYV 323
            +SF  Q        F YCLV R   S   S+VFG  A+   A + PL+ NPR  +FYYV
Sbjct: 264 SMSFVGQLSGQTGGAFGYCLVSRGTDS-TGSLVFGREALPVGASWVPLVRNPRAPSFYYV 323

Query: 324 ELLGISVGGTPVSGISASHFKLDPTGNGGVIIDCGTSVTRLNRPAYIALRDAFRAGASSL 383
            L G+ VGG  +  +    F L  TG+GGV++D GT+VTRL   AY+A RD F++  ++L
Sbjct: 324 GLKGLGVGGVRIP-LPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANL 383

Query: 384 KSAPEFSLFDTCYDLSGKTTVKVPTVVLHF-RGADVSLPASNYLIPVDGSGRFCFAFAGT 443
             A   S+FDTCYDLSG  +V+VPTV  +F  G  ++LPA N+L+PVD SG +CFAFA +
Sbjct: 384 PRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAAS 443

Query: 444 TSGLSIIGNIQQQGFRVVYDLANSRVGFSPRGC 470
            +GLSIIGNIQQ+G +V +D AN  VGF P  C
Sbjct: 444 PTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004133810.18.4e-25394.27PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis sativus][more]
KGN56421.18.4e-25394.27Aspartic proteinase nepenthesin-1 [Cucumis sativus][more]
XP_008437888.11.9e-25294.49PREDICTED: aspartyl protease family protein 2 [Cucumis melo][more]
XP_022980038.14.6e-24391.14aspartyl protease family protein 2-like [Cucurbita maxima][more]
XP_022924595.11.0e-24291.14aspartyl protease family protein 2-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A0A0L8K0|A0A0A0L8K0_CUCSA5.5e-25394.27Aspartic proteinase nepenthesin-1 OS=Cucumis sativus OX=3659 GN=Csa_3G119540 PE=... [more]
tr|A0A1S3AV66|A0A1S3AV66_CUCME1.2e-25294.49aspartyl protease family protein 2 OS=Cucumis melo OX=3656 GN=LOC103483183 PE=3 ... [more]
tr|A0A2P5AIW4|A0A2P5AIW4_PARAD4.8e-19674.58Aspartic peptidase OS=Parasponia andersonii OX=3476 GN=PanWU01x14_327890 PE=3 SV... [more]
tr|A0A251LUR5|A0A251LUR5_MANES1.1e-19575.10Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_01G245800 PE=3 SV=... [more]
tr|A0A2I4EG53|A0A2I4EG53_9ROSI1.4e-19576.46aspartyl protease family protein 2 OS=Juglans regia OX=51240 GN=LOC108989274 PE=... [more]
Match NameE-valueIdentityDescription
sp|Q9LNJ3|APF2_ARATH1.4e-17969.81Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
sp|Q9LS40|ASPG1_ARATH1.3e-10848.72Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q9LHE3|ASPG2_ARATH2.5e-10747.68Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q9LEW3|AED1_ARATH7.3e-7540.73Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1[more]
sp|Q8S9J6|ASPA_ARATH1.5e-7238.79Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At... [more]
Match NameE-valueIdentityDescription
AT1G01300.18.0e-18169.81Eukaryotic aspartyl protease family protein[more]
AT3G61820.14.5e-16865.06Eukaryotic aspartyl protease family protein[more]
AT1G25510.19.3e-11348.88Eukaryotic aspartyl protease family protein[more]
AT3G18490.17.4e-11048.72Eukaryotic aspartyl protease family protein[more]
AT3G20015.11.4e-10847.68Eukaryotic aspartyl protease family protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR033873CND41-like
IPR033121PEPTIDASE_A1
IPR001969Aspartic_peptidase_AS
IPR032861TAXi_N
IPR021109Peptidase_aspartic_dom_sf
IPR032799TAXi_C
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0042545 cell wall modification
biological_process GO:0006508 proteolysis
biological_process GO:0009664 plant-type cell wall organization
biological_process GO:0030163 protein catabolic process
biological_process GO:0080167 response to karrikin
cellular_component GO:0016020 membrane
cellular_component GO:0009505 plant-type cell wall
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0008233 peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G080940.1Cla97C05G080940.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 347..358
score: 39.97
coord: 134..154
score: 40.7
coord: 441..456
score: 26.83
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 23..469
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 315..465
e-value: 1.2E-36
score: 125.9
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 104..289
e-value: 8.8E-53
score: 181.3
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 290..470
e-value: 4.9E-55
score: 188.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 121..469
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 128..293
e-value: 3.0E-55
score: 187.1
NoneNo IPR availablePANTHERPTHR13683:SF308ASPARTYL PROTEASE-RELATEDcoord: 23..469
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 143..154
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 128..465
score: 47.248
IPR033873CND41-likeCDDcd05472cnd41_likecoord: 127..469
e-value: 3.93237E-135
score: 393.946

The following gene(s) are paralogous to this gene:

None