Cla97C01G008360 (gene) Watermelon (97103) v2

NameCla97C01G008360
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionAspartyl protease family protein
LocationCla97Chr01 : 8632533 .. 8634113 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTACCTTTCCTTCTCCTTCTTCCTCTTTTAGCCACTGCGGTCTCCCCGGTTGCCACTGGTCCAGCCGCTACTTACCCGGCCACCCAACTCCTAAATGTCAAAGACACAATCAAAGAAACAGAAACCAGACCCTCTATACTACCACAAGATCTTGACATCCATGAAAACTACCCTACTTTTGACAACAGCAGCAGTCAGAGTCAATGGAAGCTCAAGCTCTTTCATAGAGATAAGCTACCCCTCAACTTCGACCCCGACCATCGCCGTCGTTTCAAGGAGCGTATTGAAAGAGATTCCAAAAGGGTCTCCTCTCTGCTCCACAAACTCTCCAATGGCAGCGACGAGCAGGTGATCAGGGGTGAAGCTAAGAAGGAAGGAGGGAGGCAATTGCCTTCCTTTGAATTTATATGAACATACTCAATATTGCTTCAAATATTATATATAAAAAATGTTGGAATGGAAATTCTTTTGTTTCTTTCTGATTTGGTTGTGGCCCTGCAGGTGACGGATTTTGGGTCGGACGTGGTCTCCGGCACGGAGCAGGGGAGTGGAGAGTACTTCGTGAGGATCGGCGTCGGCAGCCCGCCGAGGAGCCAATACGTGGTGATTGATTCCGGCAGCGACATTGTTTGGGTGCAATGCCAGCCTTGCAGCGAATGCTACCAACAGTCCGACCCGGTGTTTGACCCGGCCGGTTCCGCCTCCTACGCTGGAATCTCCTGCGACTCCTCAGTGTGCGACCGCCTCGATAACGCAGGCTGTAACGATGGCCGGTGCCGGTACGAGGTGTCGTACGGCGACGGATCCTACACCCGCGGCACACTGGCGCTCGAAACCCTAACTTTCGGGCGGGTCGTAATCCGAAACATCGCGATCGGCTGCGGCCATATGAACCGAGGAATGTTCGTCGGAGCCGCAGGGTTGCTCGGCCTCGGCGGTGGCGCCATGTCGTTCGTCGGCCAACTCGGCGGCCAGACTGGCGGCGCGTTCAGCTACTGTTTGGTTAGTCGAGGCACCGAGTCCACTGGAACGCTGGAGTTCGGCCGTGGCGCTATGCCAGTCGGCGGCGCGTGGGTTCCCCTAATCCGAAACCCACGCGCTCCAAGTTTCTACTACGTCGGGCTTTCAGGACTCGGAGTCGGAGGGATCCGAGTTCCAATACCCGAACAGATCTTCGAACTCACCGATCTAGGGTACGGTGGCGTGGTGATGGACACCGGAACCGCCGTGACGAGGTTACCGGCGCCAGCGTACGAAGCATTCCGAGACACCTTCATCGGACAAACGGCAAACCTGCCTCGATCGGGGAGAGTATCGATCTTCGACACATGCTATAACCTAAACGGGTTCGTATCGGTAAGGGTACCGACGGTGTCCTTCTACTTCTCCGGTGGGCCGATACTGACGTTGCCGGCGAGGAACTTCCTGATTCCGGTGGACAGCGAAGGGACTTTTTGCTTCGCATTTGCAGCATCGGCGTCGGGATTGTCGATAATAGGAAACATTCAGCAAGAAGGGATTCAAATCTCCATTGATGGATCAAATGGGTTTGTGGGATTTGGGCCAAGTATTTGTTAA

mRNA sequence

ATGCTACCTTTCCTTCTCCTTCTTCCTCTTTTAGCCACTGCGGTCTCCCCGGTTGCCACTGGTCCAGCCGCTACTTACCCGGCCACCCAACTCCTAAATGTCAAAGACACAATCAAAGAAACAGAAACCAGACCCTCTATACTACCACAAGATCTTGACATCCATGAAAACTACCCTACTTTTGACAACAGCAGCAGTCAGAGTCAATGGAAGCTCAAGCTCTTTCATAGAGATAAGCTACCCCTCAACTTCGACCCCGACCATCGCCGTCGTTTCAAGGAGCGTATTGAAAGAGATTCCAAAAGGGTCTCCTCTCTGCTCCACAAACTCTCCAATGGCAGCGACGAGCAGGTGACGGATTTTGGGTCGGACGTGGTCTCCGGCACGGAGCAGGGGAGTGGAGAGTACTTCGTGAGGATCGGCGTCGGCAGCCCGCCGAGGAGCCAATACGTGGTGATTGATTCCGGCAGCGACATTGTTTGGGTGCAATGCCAGCCTTGCAGCGAATGCTACCAACAGTCCGACCCGGTGTTTGACCCGGCCGGTTCCGCCTCCTACGCTGGAATCTCCTGCGACTCCTCAGTGTGCGACCGCCTCGATAACGCAGGCTGTAACGATGGCCGGTGCCGGTACGAGGTGTCGTACGGCGACGGATCCTACACCCGCGGCACACTGGCGCTCGAAACCCTAACTTTCGGGCGGGTCGTAATCCGAAACATCGCGATCGGCTGCGGCCATATGAACCGAGGAATGTTCGTCGGAGCCGCAGGGTTGCTCGGCCTCGGCGGTGGCGCCATGTCGTTCGTCGGCCAACTCGGCGGCCAGACTGGCGGCGCGTTCAGCTACTGTTTGGTTAGTCGAGGCACCGAGTCCACTGGAACGCTGGAGTTCGGCCGTGGCGCTATGCCAGTCGGCGGCGCGTGGGTTCCCCTAATCCGAAACCCACGCGCTCCAAGTTTCTACTACGTCGGGCTTTCAGGACTCGGAGTCGGAGGGATCCGAGTTCCAATACCCGAACAGATCTTCGAACTCACCGATCTAGGGTACGGTGGCGTGGTGATGGACACCGGAACCGCCGTGACGAGGTTACCGGCGCCAGCGTACGAAGCATTCCGAGACACCTTCATCGGACAAACGGCAAACCTGCCTCGATCGGGGAGAGTATCGATCTTCGACACATGCTATAACCTAAACGGGTTCGTATCGGTAAGGGTACCGACGGTGTCCTTCTACTTCTCCGGTGGGCCGATACTGACGTTGCCGGCGAGGAACTTCCTGATTCCGGTGGACAGCGAAGGGACTTTTTGCTTCGCATTTGCAGCATCGGCGTCGGGATTGTCGATAATAGGAAACATTCAGCAAGAAGGGATTCAAATCTCCATTGATGGATCAAATGGGTTTGTGGGATTTGGGCCAAGTATTTGTTAA

Coding sequence (CDS)

ATGCTACCTTTCCTTCTCCTTCTTCCTCTTTTAGCCACTGCGGTCTCCCCGGTTGCCACTGGTCCAGCCGCTACTTACCCGGCCACCCAACTCCTAAATGTCAAAGACACAATCAAAGAAACAGAAACCAGACCCTCTATACTACCACAAGATCTTGACATCCATGAAAACTACCCTACTTTTGACAACAGCAGCAGTCAGAGTCAATGGAAGCTCAAGCTCTTTCATAGAGATAAGCTACCCCTCAACTTCGACCCCGACCATCGCCGTCGTTTCAAGGAGCGTATTGAAAGAGATTCCAAAAGGGTCTCCTCTCTGCTCCACAAACTCTCCAATGGCAGCGACGAGCAGGTGACGGATTTTGGGTCGGACGTGGTCTCCGGCACGGAGCAGGGGAGTGGAGAGTACTTCGTGAGGATCGGCGTCGGCAGCCCGCCGAGGAGCCAATACGTGGTGATTGATTCCGGCAGCGACATTGTTTGGGTGCAATGCCAGCCTTGCAGCGAATGCTACCAACAGTCCGACCCGGTGTTTGACCCGGCCGGTTCCGCCTCCTACGCTGGAATCTCCTGCGACTCCTCAGTGTGCGACCGCCTCGATAACGCAGGCTGTAACGATGGCCGGTGCCGGTACGAGGTGTCGTACGGCGACGGATCCTACACCCGCGGCACACTGGCGCTCGAAACCCTAACTTTCGGGCGGGTCGTAATCCGAAACATCGCGATCGGCTGCGGCCATATGAACCGAGGAATGTTCGTCGGAGCCGCAGGGTTGCTCGGCCTCGGCGGTGGCGCCATGTCGTTCGTCGGCCAACTCGGCGGCCAGACTGGCGGCGCGTTCAGCTACTGTTTGGTTAGTCGAGGCACCGAGTCCACTGGAACGCTGGAGTTCGGCCGTGGCGCTATGCCAGTCGGCGGCGCGTGGGTTCCCCTAATCCGAAACCCACGCGCTCCAAGTTTCTACTACGTCGGGCTTTCAGGACTCGGAGTCGGAGGGATCCGAGTTCCAATACCCGAACAGATCTTCGAACTCACCGATCTAGGGTACGGTGGCGTGGTGATGGACACCGGAACCGCCGTGACGAGGTTACCGGCGCCAGCGTACGAAGCATTCCGAGACACCTTCATCGGACAAACGGCAAACCTGCCTCGATCGGGGAGAGTATCGATCTTCGACACATGCTATAACCTAAACGGGTTCGTATCGGTAAGGGTACCGACGGTGTCCTTCTACTTCTCCGGTGGGCCGATACTGACGTTGCCGGCGAGGAACTTCCTGATTCCGGTGGACAGCGAAGGGACTTTTTGCTTCGCATTTGCAGCATCGGCGTCGGGATTGTCGATAATAGGAAACATTCAGCAAGAAGGGATTCAAATCTCCATTGATGGATCAAATGGGTTTGTGGGATTTGGGCCAAGTATTTGTTAA

Protein sequence

MLPFLLLLPLLATAVSPVATGPAATYPATQLLNVKDTIKETETRPSILPQDLDIHENYPTFDNSSSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKRVSSLLHKLSNGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVVIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC
BLAST of Cla97C01G008360 vs. NCBI nr
Match: XP_008465249.1 (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo])

HSP 1 Score: 902.1 bits (2330), Expect = 7.9e-259
Identity = 452/477 (94.76%), Postives = 461/477 (96.65%), Query Frame = 0

Query: 1   MLPFLLLLPLLATAVSPVATGPAATYPATQLLNVKDTIKETETRPSILPQDLDIHENYPT 60
           MLP LLLLPLLATAVS VATGPAATYPATQLLNVKDTIKETET PS LPQDL++HENYP 
Sbjct: 1   MLPLLLLLPLLATAVSSVATGPAATYPATQLLNVKDTIKETETTPSRLPQDLNLHENYPL 60

Query: 61  F--DNSSSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKRVSSLLHKLSNGSDEQV 120
           F  DN+SSQSQWKLKLFHRDKLPLNFD +H RRFKERI RDSKRVSSLL  LSN SDEQV
Sbjct: 61  FELDNNSSQSQWKLKLFHRDKLPLNFDTNHPRRFKERISRDSKRVSSLLRLLSNASDEQV 120

Query: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180
           TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF
Sbjct: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180

Query: 181 DPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVVIR 240
           DPAGSA+YAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGR++IR
Sbjct: 181 DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRILIR 240

Query: 241 NIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300
           NIAIGCGHMNRGMF+GAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG
Sbjct: 241 NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300

Query: 301 RGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360
           RGAMPVG AWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT
Sbjct: 301 RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360

Query: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420
           AVTRLPAPAYEAFRDTFIGQTANLPRS RVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL
Sbjct: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420

Query: 421 TLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           TLPARNFLIPVD EGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGP+IC
Sbjct: 421 TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477

BLAST of Cla97C01G008360 vs. NCBI nr
Match: XP_004150193.1 (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus])

HSP 1 Score: 898.7 bits (2321), Expect = 8.7e-258
Identity = 452/477 (94.76%), Postives = 461/477 (96.65%), Query Frame = 0

Query: 1   MLPFLLLLPLLATAVSPVATGPAATYPATQLLNVKDTIKETETRPSILPQDLDIHENYPT 60
           MLPF LLL LLATAV+ VATGPAATYPATQLLNVKDTIKE ET PS LPQDL++HENYP 
Sbjct: 1   MLPF-LLLSLLATAVASVATGPAATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPI 60

Query: 61  F--DNSSSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKRVSSLLHKLSNGSDEQV 120
           F  DN+SSQSQWKLKLFHRDKLPLNFDPDH RRFKERI RDSKRVSSLL  LS+GSDEQV
Sbjct: 61  FELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSGSDEQV 120

Query: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180
           TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF
Sbjct: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180

Query: 181 DPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVVIR 240
           DPAGSA+YAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRV+IR
Sbjct: 181 DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIR 240

Query: 241 NIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300
           NIAIGCGHMNRGMF+GAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG
Sbjct: 241 NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300

Query: 301 RGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360
           RGAMPVG AWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT
Sbjct: 301 RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360

Query: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420
           AVTRLPAPAYEAFRDTFIGQTANLPRS RVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL
Sbjct: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420

Query: 421 TLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           TLPARNFLIPVD EGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGP+IC
Sbjct: 421 TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476

BLAST of Cla97C01G008360 vs. NCBI nr
Match: XP_023514620.1 (protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 863.6 bits (2230), Expect = 3.1e-247
Identity = 436/478 (91.21%), Postives = 451/478 (94.35%), Query Frame = 0

Query: 1   MLP-FLLLLPLLAT--AVSPVATGPAATYPATQLLNVKDTIKETETRPSILPQDLDIHEN 60
           MLP FLLLLP L T  AV  VATG  A YPATQLL+VKDTIKETE +PS LPQDL+++EN
Sbjct: 1   MLPFFLLLLPPLLTRAAVDTVATGLVANYPATQLLHVKDTIKETEIKPSRLPQDLELYEN 60

Query: 61  YPTFDNSSSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKRVSSLLHKLSNGSDEQ 120
           YP  DN S+Q+QWKL+LFHRDKLPLNFDPDHRRRFKERI RD +RVSSLL +LSNGSDEQ
Sbjct: 61  YPPIDN-STQNQWKLELFHRDKLPLNFDPDHRRRFKERIGRDVQRVSSLLRRLSNGSDEQ 120

Query: 121 VTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPV 180
           VTDFGSDV+SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPV
Sbjct: 121 VTDFGSDVISGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPV 180

Query: 181 FDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVVI 240
           FDPA SASYAGISCDSSVC RLDNAGCNDGRCRYEVSYGDGSYTRG LALETLTFGRV+I
Sbjct: 181 FDPASSASYAGISCDSSVCGRLDNAGCNDGRCRYEVSYGDGSYTRGNLALETLTFGRVLI 240

Query: 241 RNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEF 300
           RNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEF
Sbjct: 241 RNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEF 300

Query: 301 GRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTG 360
           GR AMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTG
Sbjct: 301 GRSAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTG 360

Query: 361 TAVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPI 420
           TAVTRLP PAYEAFRDTFIGQTANLPRS  VSIFDTCY+LNGFVSVRVPTVSFYFSGGPI
Sbjct: 361 TAVTRLPVPAYEAFRDTFIGQTANLPRSREVSIFDTCYDLNGFVSVRVPTVSFYFSGGPI 420

Query: 421 LTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           LTLPARNFLIPVD EGTFCFAFAAS SGLSIIGNIQQEGIQIS+DG+NGFVGFGPSIC
Sbjct: 421 LTLPARNFLIPVDGEGTFCFAFAASPSGLSIIGNIQQEGIQISVDGANGFVGFGPSIC 477

BLAST of Cla97C01G008360 vs. NCBI nr
Match: XP_022990025.1 (protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucurbita maxima])

HSP 1 Score: 860.9 bits (2223), Expect = 2.0e-246
Identity = 434/479 (90.61%), Postives = 451/479 (94.15%), Query Frame = 0

Query: 1   MLPFLLLL---PLLA-TAVSPVATGPAATYPATQLLNVKDTIKETETRPSILPQDLDIHE 60
           MLPFLLLL   PLL   A+  VATG AA YPATQLL+VKDTIKETE +PS LPQDL+++E
Sbjct: 1   MLPFLLLLLLPPLLTRAAIDTVATGLAANYPATQLLHVKDTIKETEIKPSRLPQDLELYE 60

Query: 61  NYPTFDNSSSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKRVSSLLHKLSNGSDE 120
           NYP  DN S+Q+QWKL+LFHRDKLPLNFDPDH RRFKERI RD +RVSSLL +LSNGSDE
Sbjct: 61  NYPPIDN-STQNQWKLELFHRDKLPLNFDPDHCRRFKERIGRDVERVSSLLRRLSNGSDE 120

Query: 121 QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDP 180
           QVT+FGSDV+SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDP
Sbjct: 121 QVTEFGSDVISGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDP 180

Query: 181 VFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVV 240
           VFDPA SASYAGISCDSSVC RLDNAGCNDGRCRYEVSYGDGSYTRG LALETLTFGRV+
Sbjct: 181 VFDPASSASYAGISCDSSVCGRLDNAGCNDGRCRYEVSYGDGSYTRGNLALETLTFGRVL 240

Query: 241 IRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLE 300
           IRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLE
Sbjct: 241 IRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLE 300

Query: 301 FGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDT 360
           FGR AMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDT
Sbjct: 301 FGRSAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDT 360

Query: 361 GTAVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGGP 420
           GTAVTRLP PAYEAFRDTFIGQTANLPRS  VSIFDTCY+LNGFVSVRVPTVSFYFSGGP
Sbjct: 361 GTAVTRLPVPAYEAFRDTFIGQTANLPRSREVSIFDTCYDLNGFVSVRVPTVSFYFSGGP 420

Query: 421 ILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           ILTLPARNFLIPVD EGTFCFAFAAS SGLSIIGNIQQEGIQIS+DG+NGFVGFGPSIC
Sbjct: 421 ILTLPARNFLIPVDGEGTFCFAFAASPSGLSIIGNIQQEGIQISVDGANGFVGFGPSIC 478

BLAST of Cla97C01G008360 vs. NCBI nr
Match: XP_022921534.1 (protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucurbita moschata])

HSP 1 Score: 855.5 bits (2209), Expect = 8.5e-245
Identity = 430/480 (89.58%), Postives = 446/480 (92.92%), Query Frame = 0

Query: 1   MLPFL-----LLLPLLATAVSPVATGPAATYPATQLLNVKDTIKETETRPSILPQDLDIH 60
           MLPFL             AV  VA+  AA+YPATQLL+VKDTIKETE +PS LPQDL++H
Sbjct: 1   MLPFLXXXXXXXXXXXRAAVDTVASSLAASYPATQLLHVKDTIKETEIKPSRLPQDLELH 60

Query: 61  ENYPTFDNSSSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKRVSSLLHKLSNGSD 120
           ENYP  DN S+Q+QWKL+LFHRDKLPLNFDPDHRRRFKERI RD +RVSSLL +LSNGSD
Sbjct: 61  ENYPPIDN-STQNQWKLELFHRDKLPLNFDPDHRRRFKERIGRDVERVSSLLRRLSNGSD 120

Query: 121 EQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD 180
           EQVTDFGSDV+SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD
Sbjct: 121 EQVTDFGSDVISGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD 180

Query: 181 PVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRV 240
           PVFDPA SASYAGISCDSSVC RLDNAGCNDGRCRYEVSYGDGSYTRG LALETLTFGRV
Sbjct: 181 PVFDPASSASYAGISCDSSVCGRLDNAGCNDGRCRYEVSYGDGSYTRGNLALETLTFGRV 240

Query: 241 VIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTL 300
           +IRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTL
Sbjct: 241 LIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTL 300

Query: 301 EFGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMD 360
           EFGR AMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMD
Sbjct: 301 EFGRSAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMD 360

Query: 361 TGTAVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGG 420
           TGTAVTRLP PAYEAFRDTFIGQTANLPRS  VSIFDTCY+LNGFVSVRVPTVSFYFSGG
Sbjct: 361 TGTAVTRLPVPAYEAFRDTFIGQTANLPRSREVSIFDTCYDLNGFVSVRVPTVSFYFSGG 420

Query: 421 PILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           PILTLPARNFLIPVD EGTFCFAFAAS SGLSIIGNIQQEGIQIS+DG+NGFVGFGPSIC
Sbjct: 421 PILTLPARNFLIPVDGEGTFCFAFAASPSGLSIIGNIQQEGIQISVDGANGFVGFGPSIC 479

BLAST of Cla97C01G008360 vs. TrEMBL
Match: tr|A0A1S3CNF7|A0A1S3CNF7_CUCME (protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Cucumis melo OX=3656 GN=LOC103502903 PE=3 SV=1)

HSP 1 Score: 902.1 bits (2330), Expect = 5.2e-259
Identity = 452/477 (94.76%), Postives = 461/477 (96.65%), Query Frame = 0

Query: 1   MLPFLLLLPLLATAVSPVATGPAATYPATQLLNVKDTIKETETRPSILPQDLDIHENYPT 60
           MLP LLLLPLLATAVS VATGPAATYPATQLLNVKDTIKETET PS LPQDL++HENYP 
Sbjct: 1   MLPLLLLLPLLATAVSSVATGPAATYPATQLLNVKDTIKETETTPSRLPQDLNLHENYPL 60

Query: 61  F--DNSSSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKRVSSLLHKLSNGSDEQV 120
           F  DN+SSQSQWKLKLFHRDKLPLNFD +H RRFKERI RDSKRVSSLL  LSN SDEQV
Sbjct: 61  FELDNNSSQSQWKLKLFHRDKLPLNFDTNHPRRFKERISRDSKRVSSLLRLLSNASDEQV 120

Query: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180
           TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF
Sbjct: 121 TDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVF 180

Query: 181 DPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVVIR 240
           DPAGSA+YAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGR++IR
Sbjct: 181 DPAGSATYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRILIR 240

Query: 241 NIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300
           NIAIGCGHMNRGMF+GAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG
Sbjct: 241 NIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 300

Query: 301 RGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360
           RGAMPVG AWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT
Sbjct: 301 RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGT 360

Query: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420
           AVTRLPAPAYEAFRDTFIGQTANLPRS RVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL
Sbjct: 361 AVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPIL 420

Query: 421 TLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           TLPARNFLIPVD EGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGP+IC
Sbjct: 421 TLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 477

BLAST of Cla97C01G008360 vs. TrEMBL
Match: tr|A0A2P5R9B1|A0A2P5R9B1_GOSBA (Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_DD19674 PE=3 SV=1)

HSP 1 Score: 691.8 bits (1784), Expect = 1.1e-195
Identity = 345/480 (71.88%), Postives = 401/480 (83.54%), Query Frame = 0

Query: 5   LLLLPLLATAVSPVATGPAATYPATQLLNVKDTIKETETRPSILPQDLDIHENYPTFDNS 64
           ++L+ +L   +S VAT   A+YP  QLLNVK T+  TE     +P+ L   E++   D S
Sbjct: 9   IVLVAMLHLTLSSVAT---ASYPDFQLLNVKQTLIGTE-----IPRPLQTSEHHQVSDVS 68

Query: 65  SSQSQWKLKLFHRDKLPLNFDP---DHRRRFKERIERDSKRVSSLLHKLSNGSDE----- 124
            +Q +WKLKL HRDKL  N      DH RRF  R++RD KRV+SLL +LS G        
Sbjct: 69  ETQGKWKLKLVHRDKLSSNTSATFRDHSRRFHARMQRDVKRVASLLRRLSGGGGHDGGAA 128

Query: 125 -QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD 184
            +V DFGSDVVSG +QGSGEYFVR+GVGSPP+SQY+VIDSGSDIVWVQCQPC++CY+QSD
Sbjct: 129 YEVNDFGSDVVSGMDQGSGEYFVRLGVGSPPKSQYMVIDSGSDIVWVQCQPCNQCYRQSD 188

Query: 185 PVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRV 244
           PVFDPA SASYAGISC S+VCDR++N+GCN GRCRYEV YGDGSYT+GTLALETLTFGR 
Sbjct: 189 PVFDPADSASYAGISCSSAVCDRIENSGCNAGRCRYEVLYGDGSYTKGTLALETLTFGRT 248

Query: 245 VIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTL 304
           V++N+AIGCGH+NRGMF+GAAGLLGLGGG++S VGQLGGQTGGAFSYCLVSRG++++G+L
Sbjct: 249 VVKNVAIGCGHINRGMFIGAAGLLGLGGGSLSLVGQLGGQTGGAFSYCLVSRGSDASGSL 308

Query: 305 EFGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMD 364
           EFGRGAMPVG AWVPL+RNP+APSFYYVGLSGLGVGGIRVP+ E IF+LT+LGYGGVVMD
Sbjct: 309 EFGRGAMPVGAAWVPLLRNPQAPSFYYVGLSGLGVGGIRVPVSEDIFQLTELGYGGVVMD 368

Query: 365 TGTAVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGG 424
           TGTAV+R P  AY+A RD FI QTANLPR   VSIFDTCYNL+ FV++RVPTVSFYFSGG
Sbjct: 369 TGTAVSRFPTLAYKALRDAFIAQTANLPRISTVSIFDTCYNLSDFVTIRVPTVSFYFSGG 428

Query: 425 PILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           PILTLPA NFLIPVD  GTFC AFA+S SGLSIIGNIQQEGIQIS DG+NGFVGFGP++C
Sbjct: 429 PILTLPASNFLIPVDDVGTFCLAFASSTSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 480

BLAST of Cla97C01G008360 vs. TrEMBL
Match: tr|A0A1U8N9X5|A0A1U8N9X5_GOSHI (protein ASPARTIC PROTEASE IN GUARD CELL 2-like OS=Gossypium hirsutum OX=3635 GN=LOC107946097 PE=3 SV=1)

HSP 1 Score: 689.5 bits (1778), Expect = 5.3e-195
Identity = 344/480 (71.67%), Postives = 400/480 (83.33%), Query Frame = 0

Query: 5   LLLLPLLATAVSPVATGPAATYPATQLLNVKDTIKETETRPSILPQDLDIHENYPTFDNS 64
           ++L+ +L   +S VAT   A+YP  QLLNVK T+  TE     +P+ L   E++   D S
Sbjct: 9   IVLVAMLHLTLSSVAT---ASYPDFQLLNVKQTLIGTE-----IPRPLQTSEHHQVSDVS 68

Query: 65  SSQSQWKLKLFHRDKLPLNFDP---DHRRRFKERIERDSKRVSSLLHKLSNGSDE----- 124
            +Q +WKLKL HRDKL  N      DH RR   R++RD KRV+SLL +LS G        
Sbjct: 69  ETQGKWKLKLVHRDKLSSNTSATFRDHSRRLHARMQRDVKRVASLLRRLSGGGGHDGGAA 128

Query: 125 -QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD 184
            +V DFGSDVVSG +QGSGEYFVR+GVGSPP+SQY+VIDSGSDIVWVQCQPC++CY+QSD
Sbjct: 129 YEVNDFGSDVVSGMDQGSGEYFVRLGVGSPPKSQYMVIDSGSDIVWVQCQPCNQCYRQSD 188

Query: 185 PVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRV 244
           PVFDPA SASYAGISC S+VCDR++N+GCN GRCRYEV YGDGSYT+GTLALETLTFGR 
Sbjct: 189 PVFDPADSASYAGISCSSAVCDRIENSGCNAGRCRYEVLYGDGSYTKGTLALETLTFGRT 248

Query: 245 VIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTL 304
           V++N+AIGCGH+NRGMF+GAAGLLGLGGG++S VGQLGGQTGGAFSYCLVSRG++++G+L
Sbjct: 249 VVKNVAIGCGHINRGMFIGAAGLLGLGGGSLSLVGQLGGQTGGAFSYCLVSRGSDASGSL 308

Query: 305 EFGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMD 364
           EFGRGAMPVG AWVPL+RNP+APSFYYVGLSGLGVGGIRVP+ E IF+LT+LGYGGVVMD
Sbjct: 309 EFGRGAMPVGAAWVPLLRNPQAPSFYYVGLSGLGVGGIRVPVSEDIFQLTELGYGGVVMD 368

Query: 365 TGTAVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGG 424
           TGTAV+R P  AY+A RD FI QTANLPR   VSIFDTCYNL+ FV++RVPTVSFYFSGG
Sbjct: 369 TGTAVSRFPTLAYKALRDAFIAQTANLPRISTVSIFDTCYNLSDFVTIRVPTVSFYFSGG 428

Query: 425 PILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           PILTLPA NFLIPVD  GTFC AFA+S SGLSIIGNIQQEGIQIS DG+NGFVGFGP++C
Sbjct: 429 PILTLPASNFLIPVDDVGTFCLAFASSTSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 480

BLAST of Cla97C01G008360 vs. TrEMBL
Match: tr|A0A0D2T690|A0A0D2T690_GOSRA (Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_008G147500 PE=3 SV=1)

HSP 1 Score: 687.2 bits (1772), Expect = 2.6e-194
Identity = 343/480 (71.46%), Postives = 399/480 (83.12%), Query Frame = 0

Query: 5   LLLLPLLATAVSPVATGPAATYPATQLLNVKDTIKETETRPSILPQDLDIHENYPTFDNS 64
           ++L+ +L   +S  AT   A+YP  QLLNVK T+  T+     +P+ L   E++   D S
Sbjct: 9   IVLVAMLHLTLSSAAT---ASYPDFQLLNVKQTLIGTK-----IPRPLQTSEHHQVSDVS 68

Query: 65  SSQSQWKLKLFHRDKLPLNFDP---DHRRRFKERIERDSKRVSSLLHKLSNGSDE----- 124
            +Q +WKLKL HRDKL  N      DH RRF  R++RD KRV+SLL +LS G        
Sbjct: 69  ETQGKWKLKLVHRDKLSSNTSATFRDHSRRFHARMQRDVKRVASLLRRLSGGGGHDGGAA 128

Query: 125 -QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD 184
            +V DFGSDVVSG +QGSGEYFVRIGVGSPP+SQY+VIDSGSDIVWVQCQPC++CY+QSD
Sbjct: 129 YEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPKSQYMVIDSGSDIVWVQCQPCNQCYRQSD 188

Query: 185 PVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRV 244
           PVFDPA SASYAGISC S+VCDR++N+GCN GRCRYEV YGDGSYT+GTLALETLTFGR 
Sbjct: 189 PVFDPADSASYAGISCSSAVCDRIENSGCNAGRCRYEVLYGDGSYTKGTLALETLTFGRT 248

Query: 245 VIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTL 304
           V++N+AIGCGH+NRGMF+GAAGLLGLGGG++S VGQLGGQTGGAFSYCLVSRG++++G+L
Sbjct: 249 VVKNVAIGCGHINRGMFIGAAGLLGLGGGSLSLVGQLGGQTGGAFSYCLVSRGSDASGSL 308

Query: 305 EFGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMD 364
           EFGRGAMPVG AWVPL+RNP+APSFYYVGLSGLGVGGIRVP+ E IF+LT+LGYGGVVMD
Sbjct: 309 EFGRGAMPVGAAWVPLLRNPQAPSFYYVGLSGLGVGGIRVPVSEDIFQLTELGYGGVVMD 368

Query: 365 TGTAVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGG 424
           TGTAV+R P  AY+A RD FI QTANLPR   VSIFDTCY L+ FV++RVPTVSFYFSGG
Sbjct: 369 TGTAVSRFPTLAYKALRDAFIAQTANLPRISTVSIFDTCYKLSDFVTIRVPTVSFYFSGG 428

Query: 425 PILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           PILTLPA NFLIPVD  GTFC AFA+S SGLSIIGNIQQEGIQIS DG+NGFVGFGP++C
Sbjct: 429 PILTLPASNFLIPVDDVGTFCLAFASSTSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 480

BLAST of Cla97C01G008360 vs. TrEMBL
Match: tr|A0A061E1W9|A0A061E1W9_THECC (Aspartic proteinase nepenthesin-1, putative OS=Theobroma cacao OX=3641 GN=TCM_007665 PE=3 SV=1)

HSP 1 Score: 684.1 bits (1764), Expect = 2.2e-193
Identity = 346/480 (72.08%), Postives = 402/480 (83.75%), Query Frame = 0

Query: 5   LLLLPLLATAVSPVATGPAATYPATQLLNVKDTIKETETRPSILPQDLDIHENYPTFDNS 64
           ++L+ +L   VS +AT   A++P  QLLNVK T+  T+ +P+ L +  + HE   +  + 
Sbjct: 9   MILVAVLQLTVSGIAT---ASHPDFQLLNVKQTLIGTK-KPTPL-KTFEYHEQ--SNASE 68

Query: 65  SSQSQWKLKLFHRDKLPLNFDP---DHRRRFKERIERDSKRVSSLLHKLSNGSDE----- 124
           S Q +WKLKL HRDKL  N      DH  RF  R++RD KRV+SL+  LS G        
Sbjct: 69  SDQGKWKLKLVHRDKLFSNTTTAFHDHSHRFLARMQRDVKRVASLVRLLSGGGGHDGDAA 128

Query: 125 -QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD 184
            +V DFGSDVVSG +QGSGEYFVRIGVGSPPRSQY+VIDSGSDIVWVQCQPC++CY+QSD
Sbjct: 129 YEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCYRQSD 188

Query: 185 PVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRV 244
           PVFDPA SASY+G+SC SSVCDR++N+GC+ GRCRYEV YGDGSYT+GTLALETLTFGR 
Sbjct: 189 PVFDPANSASYSGVSCTSSVCDRIENSGCHAGRCRYEVMYGDGSYTKGTLALETLTFGRT 248

Query: 245 VIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTL 304
           V++N+AIGCGH+NRGMF+GAAGLLG+GGG+MS VGQLGGQTGGAFSYCLVSRG++++G+L
Sbjct: 249 VVKNVAIGCGHINRGMFIGAAGLLGVGGGSMSLVGQLGGQTGGAFSYCLVSRGSDASGSL 308

Query: 305 EFGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMD 364
            FGRGAMPVG AWVPL+RNPRAPSFYYVGLSGLGVGGIRVP+ E  F L++LGYGGVVMD
Sbjct: 309 VFGRGAMPVGAAWVPLLRNPRAPSFYYVGLSGLGVGGIRVPVSEDTFRLSELGYGGVVMD 368

Query: 365 TGTAVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGG 424
           TGTAVTR P  AY AFRD F+ QTANLPR+  VSIFDTCYNL+GFVSVRVPTVSFYFSGG
Sbjct: 369 TGTAVTRFPTLAYNAFRDAFVAQTANLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGG 428

Query: 425 PILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           PILTLPARNFLIPVD  GTFCFAFA+SASGLSIIGNIQQEGIQIS DG+NGFVGFGP++C
Sbjct: 429 PILTLPARNFLIPVDDVGTFCFAFASSASGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481

BLAST of Cla97C01G008360 vs. Swiss-Prot
Match: sp|Q9LHE3|ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 633.3 bits (1632), Expect = 2.2e-180
Identity = 303/439 (69.02%), Postives = 362/439 (82.46%), Query Frame = 0

Query: 47  ILPQDLDIHENYPTFDNS----SSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKR 106
           +L   L +    P F+N+     S S++ L+L HRD+ P     +H  R   R+ RD+ R
Sbjct: 32  VLQPPLTVTATLPDFNNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDR 91

Query: 107 VSSLLHKL------SNGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSG 166
           VS++L ++      S+ S  +V DFGSD+VSG +QGSGEYFVRIGVGSPPR QY+VIDSG
Sbjct: 92  VSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSG 151

Query: 167 SDIVWVQCQPCSECYQQSDPVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYG 226
           SD+VWVQCQPC  CY+QSDPVFDPA S SY G+SC SSVCDR++N+GC+ G CRYEV YG
Sbjct: 152 SDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYG 211

Query: 227 DGSYTRGTLALETLTFGRVVIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQT 286
           DGSYT+GTLALETLTF + V+RN+A+GCGH NRGMF+GAAGLLG+GGG+MSFVGQL GQT
Sbjct: 212 DGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQT 271

Query: 287 GGAFSYCLVSRGTESTGTLEFGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP 346
           GGAF YCLVSRGT+STG+L FGR A+PVG +WVPL+RNPRAPSFYYVGL GLGVGG+R+P
Sbjct: 272 GGAFGYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIP 331

Query: 347 IPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYN 406
           +P+ +F+LT+ G GGVVMDTGTAVTRLP  AY AFRD F  QTANLPR+  VSIFDTCY+
Sbjct: 332 LPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYD 391

Query: 407 LNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEG 466
           L+GFVSVRVPTVSFYF+ GP+LTLPARNFL+PVD  GT+CFAFAAS +GLSIIGNIQQEG
Sbjct: 392 LSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEG 451

Query: 467 IQISIDGSNGFVGFGPSIC 476
           IQ+S DG+NGFVGFGP++C
Sbjct: 452 IQVSFDGANGFVGFGPNVC 470

BLAST of Cla97C01G008360 vs. Swiss-Prot
Match: sp|Q9LS40|ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 403.7 bits (1036), Expect = 2.9e-111
Identity = 220/500 (44.00%), Postives = 312/500 (62.40%), Query Frame = 0

Query: 4   FLLLLPLLATAVSPVATGPA----ATYPATQLLNVKDTIKETETRPSILPQDLDIHE--- 63
           FL LL ++  ++    T  +    +T P T +L+V  ++++T+T  S+ P    +     
Sbjct: 6   FLSLLAVVTLSLFLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKP 65

Query: 64  ---NYPTFDNSSSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKRVSSLLHKLS-- 123
              + P F NSS  S   L+L  RD    +   D++     R+ERDS RV+ ++ K+   
Sbjct: 66  ESLSDPVFFNSS--SPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFA 125

Query: 124 -NGSDE-------------QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGS 183
             G D              Q  D  + VVSG  QGSGEYF RIGVG+P +  Y+V+D+GS
Sbjct: 126 VEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGS 185

Query: 184 DIVWVQCQPCSECYQQSDPVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGD 243
           D+ W+QC+PC++CYQQSDPVF+P  S++Y  ++C +  C  L+ + C   +C Y+VSYGD
Sbjct: 186 DVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGD 245

Query: 244 GSYTRGTLALETLTFGRV-VIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQT 303
           GS+T G LA +T+TFG    I N+A+GCGH N G+F GAAGLLGLGGG +S   Q+    
Sbjct: 246 GSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQM---K 305

Query: 304 GGAFSYCLVSRGTESTGTLEFGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP 363
             +FSYCLV R +  + +L+F    +  G A  PL+RN +  +FYYVGLSG  VGG +V 
Sbjct: 306 ATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVV 365

Query: 364 IPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPR-SGRVSIFDTCY 423
           +P+ IF++   G GGV++D GTAVTRL   AY + RD F+  T NL + S  +S+FDTCY
Sbjct: 366 LPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCY 425

Query: 424 NLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQE 476
           + +   +V+VPTV+F+F+GG  L LPA+N+LIPVD  GTFCFAFA ++S LSIIGN+QQ+
Sbjct: 426 DFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQ 485

BLAST of Cla97C01G008360 vs. Swiss-Prot
Match: sp|Q9LNJ3|APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 391.7 bits (1005), Expect = 1.1e-107
Identity = 210/412 (50.97%), Postives = 267/412 (64.81%), Query Frame = 0

Query: 74  LFHRDKLPLNFDPDHRRRFKERIERDSKRVSSLLHKLSNGSDEQVT------DFGSDVVS 133
           L H D L  N  PD    F  R++RDS+RV S+    +      VT       F S VVS
Sbjct: 76  LDHIDALSSNKTPD--ELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVS 135

Query: 134 GTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSASYA 193
           G  QGSGEYF R+GVG+P R  Y+V+D+GSDIVW+QC PC  CY QSDP+FDP  S +YA
Sbjct: 136 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYA 195

Query: 194 GISCDSSVCDRLDNAGCNDGR--CRYEVSYGDGSYTRGTLALETLTFGRVVIRNIAIGCG 253
            I C S  C RLD+AGCN  R  C Y+VSYGDGS+T G  + ETLTF R  ++ +A+GCG
Sbjct: 196 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCG 255

Query: 254 HMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES-TGTLEFGRGAMPV 313
           H N G+FVGAAGLLGLG G +SF GQ G +    FSYCLV R   S   ++ FG  A+  
Sbjct: 256 HDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSR 315

Query: 314 GGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP-IPEQIFELTDLGYGGVVMDTGTAVTRL 373
              + PL+ NP+  +FYYVGL G+ VGG RVP +   +F+L  +G GGV++D+GT+VTRL
Sbjct: 316 IARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRL 375

Query: 374 PAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPAR 433
             PAY A RD F      L R+   S+FDTC++L+    V+VPTV  +F G  + +LPA 
Sbjct: 376 IRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV-SLPAT 435

Query: 434 NFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           N+LIPVD+ G FCFAFA +  GLSIIGNIQQ+G ++  D ++  VGF P  C
Sbjct: 436 NYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Cla97C01G008360 vs. Swiss-Prot
Match: sp|Q766C3|NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 257.7 bits (657), Expect = 2.6e-67
Identity = 147/386 (38.08%), Postives = 219/386 (56.74%), Query Frame = 0

Query: 96  IERDSKRVSSLLHKLSNGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDS 155
           IER S+R+  L         E + +  S V +    G GEY + + +G+P +    ++D+
Sbjct: 64  IERGSRRLQRL---------EAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDT 123

Query: 156 GSDIVWVQCQPCSECYQQSDPVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSY 215
           GSD++W QCQPC++C+ QS P+F+P GS+S++ + C S +C  L +  C++  C+Y   Y
Sbjct: 124 GSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGY 183

Query: 216 GDGSYTRGTLALETLTFGRVVIRNIAIGCGHMNRGMFVG-AAGLLGLGGGAMSFVGQLGG 275
           GDGS T+G++  ETLTFG V I NI  GCG  N+G   G  AGL+G+G G +S   QL  
Sbjct: 184 GDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDV 243

Query: 276 QTGGAFSYCLVSRGTESTGTLEFG--RGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGG 335
                FSYC+   G+ +   L  G    ++  G     LI++ + P+FYY+ L+GL VG 
Sbjct: 244 T---KFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGS 303

Query: 336 IRVPIPEQIFEL-TDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLP-RSGRVSI 395
            R+PI    F L ++ G GG+++D+GT +T     AY++ R  FI Q  NLP  +G  S 
Sbjct: 304 TRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQ-INLPVVNGSSSG 363

Query: 396 FDTCYNLNGFVS-VRVPTVSFYFSGGPILTLPARNFLIPVDSEGTFCFAFAASASGLSII 455
           FD C+      S +++PT   +F GG  L LP+ N+ I   S G  C A  +S+ G+SI 
Sbjct: 364 FDLCFQTPSDPSNLQIPTFVMHFDGGD-LELPSENYFIS-PSNGLICLAMGSSSQGMSIF 423

Query: 456 GNIQQEGIQISIDGSNGFVGFGPSIC 476
           GNIQQ+ + +  D  N  V F  + C
Sbjct: 424 GNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Cla97C01G008360 vs. Swiss-Prot
Match: sp|Q766C2|NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 256.5 bits (654), Expect = 5.7e-67
Identity = 150/389 (38.56%), Postives = 216/389 (55.53%), Query Frame = 0

Query: 93  KERIERDSKRVSSLLHKLSNGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVV 152
           K  I+R  +R+ S+   L + S  +   +          G GEY + + +G+P  S   +
Sbjct: 62  KRAIKRGERRMRSINAMLQSSSGIETPVYA---------GDGEYLMNVAIGTPDSSFSAI 121

Query: 153 IDSGSDIVWVQCQPCSECYQQSDPVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYE 212
           +D+GSD++W QC+PC++C+ Q  P+F+P  S+S++ + C+S  C  L +  CN+  C+Y 
Sbjct: 122 MDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYT 181

Query: 213 VSYGDGSYTRGTLALETLTFGRVVIRNIAIGCGHMNRGMFVG-AAGLLGLGGGAMSFVGQ 272
             YGDGS T+G +A ET TF    + NIA GCG  N+G   G  AGL+G+G G +S   Q
Sbjct: 182 YGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQ 241

Query: 273 LGGQTGGAFSYCLVSRGTESTGTLEFGRGA--MPVGGAWVPLIRNPRAPSFYYVGLSGLG 332
           LG    G FSYC+ S G+ S  TL  G  A  +P G     LI +   P++YY+ L G+ 
Sbjct: 242 LG---VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGIT 301

Query: 333 VGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSGRVS 392
           VGG  + IP   F+L D G GG+++D+GT +T LP  AY A    F  Q  NLP     S
Sbjct: 302 VGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQ-INLPTVDESS 361

Query: 393 I-FDTCYNL-NGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDSEGTFCFAFAASAS-GL 452
               TC+   +   +V+VP +S  F GG +L L  +N LI   +EG  C A  +S+  G+
Sbjct: 362 SGLSTCFQQPSDGSTVQVPEISMQFDGG-VLNLGEQNILIS-PAEGVICLAMGSSSQLGI 421

Query: 453 SIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           SI GNIQQ+  Q+  D  N  V F P+ C
Sbjct: 422 SIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Cla97C01G008360 vs. TAIR10
Match: AT3G20015.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 633.3 bits (1632), Expect = 1.2e-181
Identity = 303/439 (69.02%), Postives = 362/439 (82.46%), Query Frame = 0

Query: 47  ILPQDLDIHENYPTFDNS----SSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKR 106
           +L   L +    P F+N+     S S++ L+L HRD+ P     +H  R   R+ RD+ R
Sbjct: 32  VLQPPLTVTATLPDFNNTHFSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDR 91

Query: 107 VSSLLHKL------SNGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSG 166
           VS++L ++      S+ S  +V DFGSD+VSG +QGSGEYFVRIGVGSPPR QY+VIDSG
Sbjct: 92  VSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSG 151

Query: 167 SDIVWVQCQPCSECYQQSDPVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYG 226
           SD+VWVQCQPC  CY+QSDPVFDPA S SY G+SC SSVCDR++N+GC+ G CRYEV YG
Sbjct: 152 SDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYG 211

Query: 227 DGSYTRGTLALETLTFGRVVIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQT 286
           DGSYT+GTLALETLTF + V+RN+A+GCGH NRGMF+GAAGLLG+GGG+MSFVGQL GQT
Sbjct: 212 DGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQT 271

Query: 287 GGAFSYCLVSRGTESTGTLEFGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP 346
           GGAF YCLVSRGT+STG+L FGR A+PVG +WVPL+RNPRAPSFYYVGL GLGVGG+R+P
Sbjct: 272 GGAFGYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIP 331

Query: 347 IPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYN 406
           +P+ +F+LT+ G GGVVMDTGTAVTRLP  AY AFRD F  QTANLPR+  VSIFDTCY+
Sbjct: 332 LPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYD 391

Query: 407 LNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEG 466
           L+GFVSVRVPTVSFYF+ GP+LTLPARNFL+PVD  GT+CFAFAAS +GLSIIGNIQQEG
Sbjct: 392 LSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEG 451

Query: 467 IQISIDGSNGFVGFGPSIC 476
           IQ+S DG+NGFVGFGP++C
Sbjct: 452 IQVSFDGANGFVGFGPNVC 470

BLAST of Cla97C01G008360 vs. TAIR10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 404.1 bits (1037), Expect = 1.2e-112
Identity = 204/469 (43.50%), Postives = 296/469 (63.11%), Query Frame = 0

Query: 22  PAATYPATQLLNVKDTIKETETRPSILPQDLDIHENYPTFDNSSSQSQWKLKLFHRDKLP 81
           P  +   T +LNV D+I  T+   S          N       S+ S + L+L  R  + 
Sbjct: 26  PETSTTTTSILNVADSIHRTKYTSS-------FRLNQQEEQTHSASSSFSLQLHSRVSVR 85

Query: 82  LNFDPDHRRRFKERIERDSKRVSSLLHKL---------------SNGSDEQVTDFGSDVV 141
                D++     R+ RD+ RV SL+ +L               S     +  D  + ++
Sbjct: 86  GTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPISTMYTTEEQDIEAPLI 145

Query: 142 SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSASY 201
           SGT QGSGEYF R+G+G P R  Y+V+D+GSD+ W+QC PC++CY Q++P+F+P+ S+SY
Sbjct: 146 SGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSY 205

Query: 202 AGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVVIRNIAIGCGH 261
             +SCD+  C+ L+ + C +  C YEVSYGDGSYT G  A ETLT G  +++N+A+GCGH
Sbjct: 206 EPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVGCGH 265

Query: 262 MNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVGG 321
            N G+FVGAAGLLGLGGG ++   QL      +FSYCLV R ++S  T++FG    P   
Sbjct: 266 SNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVDFGTSLSP-DA 325

Query: 322 AWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAP 381
              PL+RN +  +FYY+GL+G+ VGG  + IP+  FE+ + G GG+++D+GTAVTRL   
Sbjct: 326 VVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTE 385

Query: 382 AYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFL 441
            Y + RD+F+  T +L ++  V++FDTCYNL+   +V VPTV+F+F GG +L LPA+N++
Sbjct: 386 IYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYM 445

Query: 442 IPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           IPVDS GTFC AFA +AS L+IIGN+QQ+G +++ D +N  +GF  + C
Sbjct: 446 IPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483

BLAST of Cla97C01G008360 vs. TAIR10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 403.7 bits (1036), Expect = 1.6e-112
Identity = 220/500 (44.00%), Postives = 312/500 (62.40%), Query Frame = 0

Query: 4   FLLLLPLLATAVSPVATGPA----ATYPATQLLNVKDTIKETETRPSILPQDLDIHE--- 63
           FL LL ++  ++    T  +    +T P T +L+V  ++++T+T  S+ P    +     
Sbjct: 6   FLSLLAVVTLSLFLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKP 65

Query: 64  ---NYPTFDNSSSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKRVSSLLHKLS-- 123
              + P F NSS  S   L+L  RD    +   D++     R+ERDS RV+ ++ K+   
Sbjct: 66  ESLSDPVFFNSS--SPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFA 125

Query: 124 -NGSDE-------------QVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGS 183
             G D              Q  D  + VVSG  QGSGEYF RIGVG+P +  Y+V+D+GS
Sbjct: 126 VEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGS 185

Query: 184 DIVWVQCQPCSECYQQSDPVFDPAGSASYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGD 243
           D+ W+QC+PC++CYQQSDPVF+P  S++Y  ++C +  C  L+ + C   +C Y+VSYGD
Sbjct: 186 DVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGD 245

Query: 244 GSYTRGTLALETLTFGRV-VIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQT 303
           GS+T G LA +T+TFG    I N+A+GCGH N G+F GAAGLLGLGGG +S   Q+    
Sbjct: 246 GSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQM---K 305

Query: 304 GGAFSYCLVSRGTESTGTLEFGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP 363
             +FSYCLV R +  + +L+F    +  G A  PL+RN +  +FYYVGLSG  VGG +V 
Sbjct: 306 ATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVV 365

Query: 364 IPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPR-SGRVSIFDTCY 423
           +P+ IF++   G GGV++D GTAVTRL   AY + RD F+  T NL + S  +S+FDTCY
Sbjct: 366 LPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCY 425

Query: 424 NLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQE 476
           + +   +V+VPTV+F+F+GG  L LPA+N+LIPVD  GTFCFAFA ++S LSIIGN+QQ+
Sbjct: 426 DFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQ 485

BLAST of Cla97C01G008360 vs. TAIR10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 391.7 bits (1005), Expect = 6.3e-109
Identity = 210/412 (50.97%), Postives = 267/412 (64.81%), Query Frame = 0

Query: 74  LFHRDKLPLNFDPDHRRRFKERIERDSKRVSSLLHKLSNGSDEQVT------DFGSDVVS 133
           L H D L  N  PD    F  R++RDS+RV S+    +      VT       F S VVS
Sbjct: 76  LDHIDALSSNKTPD--ELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVS 135

Query: 134 GTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSASYA 193
           G  QGSGEYF R+GVG+P R  Y+V+D+GSDIVW+QC PC  CY QSDP+FDP  S +YA
Sbjct: 136 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYA 195

Query: 194 GISCDSSVCDRLDNAGCNDGR--CRYEVSYGDGSYTRGTLALETLTFGRVVIRNIAIGCG 253
            I C S  C RLD+AGCN  R  C Y+VSYGDGS+T G  + ETLTF R  ++ +A+GCG
Sbjct: 196 TIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCG 255

Query: 254 HMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTES-TGTLEFGRGAMPV 313
           H N G+FVGAAGLLGLG G +SF GQ G +    FSYCLV R   S   ++ FG  A+  
Sbjct: 256 HDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSR 315

Query: 314 GGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP-IPEQIFELTDLGYGGVVMDTGTAVTRL 373
              + PL+ NP+  +FYYVGL G+ VGG RVP +   +F+L  +G GGV++D+GT+VTRL
Sbjct: 316 IARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRL 375

Query: 374 PAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPAR 433
             PAY A RD F      L R+   S+FDTC++L+    V+VPTV  +F G  + +LPA 
Sbjct: 376 IRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV-SLPAT 435

Query: 434 NFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPSIC 476
           N+LIPVD+ G FCFAFA +  GLSIIGNIQQ+G ++  D ++  VGF P  C
Sbjct: 436 NYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of Cla97C01G008360 vs. TAIR10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 364.0 bits (933), Expect = 1.4e-100
Identity = 198/430 (46.05%), Postives = 267/430 (62.09%), Query Frame = 0

Query: 64  SSSQSQWKLKLFHRDKLPLNFDPDHRRRFKERIERDSKRVSSL--LHKLSNG------SD 123
           S S +   + L H D L    D      F  R++RDS RV S+  L  +S G      + 
Sbjct: 55  SESTTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTP 114

Query: 124 EQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD 183
                F   V+SG  QGSGEYF+R+GVG+P  + Y+V+D+GSD+VW+QC PC  CY Q+D
Sbjct: 115 RTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTD 174

Query: 184 PVFDPAGSASYAGISCDSSVCDRLDNAG-CNDGR---CRYEVSYGDGSYTRGTLALETLT 243
            +FDP  S ++A + C S +C RLD++  C   R   C Y+VSYGDGS+T G  + ETLT
Sbjct: 175 AIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT 234

Query: 244 FGRVVIRNIAIGCGHMNRGMFVGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSR---- 303
           F    + ++ +GCGH N G+FVGAAGLLGLG G +SF  Q   +  G FSYCLV R    
Sbjct: 235 FHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSG 294

Query: 304 -GTESTGTLEFGRGAMPVGGAWVPLIRNPRAPSFYYVGLSGLGVGGIRVP-IPEQIFELT 363
             ++   T+ FG  A+P    + PL+ NP+  +FYY+ L G+ VGG RVP + E  F+L 
Sbjct: 295 SSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLD 354

Query: 364 DLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSGRVSIFDTCYNLNGFVSVRV 423
             G GGV++D+GT+VTRL  PAY A RD F      L R+   S+FDTC++L+G  +V+V
Sbjct: 355 ATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKV 414

Query: 424 PTVSFYFSGGPILTLPARNFLIPVDSEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSN 476
           PTV F+F GG + +LPA N+LIPV++EG FCFAFA +   LSIIGNIQQ+G +++ D   
Sbjct: 415 PTVVFHFGGGEV-SLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVG 474

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008465249.17.9e-25994.76PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis melo][more]
XP_004150193.18.7e-25894.76PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus][more]
XP_023514620.13.1e-24791.21protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucurbita pepo subsp. pepo][more]
XP_022990025.12.0e-24690.61protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucurbita maxima][more]
XP_022921534.18.5e-24589.58protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A1S3CNF7|A0A1S3CNF7_CUCME5.2e-25994.76protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Cucumis melo OX=3656 GN=LOC10350290... [more]
tr|A0A2P5R9B1|A0A2P5R9B1_GOSBA1.1e-19571.88Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_DD19674 PE=3 SV... [more]
tr|A0A1U8N9X5|A0A1U8N9X5_GOSHI5.3e-19571.67protein ASPARTIC PROTEASE IN GUARD CELL 2-like OS=Gossypium hirsutum OX=3635 GN=... [more]
tr|A0A0D2T690|A0A0D2T690_GOSRA2.6e-19471.46Uncharacterized protein OS=Gossypium raimondii OX=29730 GN=B456_008G147500 PE=3 ... [more]
tr|A0A061E1W9|A0A061E1W9_THECC2.2e-19372.08Aspartic proteinase nepenthesin-1, putative OS=Theobroma cacao OX=3641 GN=TCM_00... [more]
Match NameE-valueIdentityDescription
sp|Q9LHE3|ASPG2_ARATH2.2e-18069.02Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q9LS40|ASPG1_ARATH2.9e-11144.00Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q9LNJ3|APF2_ARATH1.1e-10750.97Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
sp|Q766C3|NEP1_NEPGR2.6e-6738.08Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
sp|Q766C2|NEP2_NEPGR5.7e-6738.56Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Match NameE-valueIdentityDescription
AT3G20015.11.2e-18169.02Eukaryotic aspartyl protease family protein[more]
AT1G25510.11.2e-11243.50Eukaryotic aspartyl protease family protein[more]
AT3G18490.11.6e-11244.00Eukaryotic aspartyl protease family protein[more]
AT1G01300.16.3e-10950.97Eukaryotic aspartyl protease family protein[more]
AT3G61820.11.4e-10046.05Eukaryotic aspartyl protease family protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR033121PEPTIDASE_A1
IPR001969Aspartic_peptidase_AS
IPR001461Aspartic_peptidase_A1
IPR032861TAXi_N
IPR021109Peptidase_aspartic_dom_sf
IPR032799TAXi_C
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030163 protein catabolic process
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0008233 peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G008360.1Cla97C01G008360.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 321..471
e-value: 2.3E-30
score: 105.4
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 307..475
e-value: 4.7E-43
score: 148.8
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 111..298
e-value: 2.9E-48
score: 166.5
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 131..475
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 136..298
e-value: 1.3E-50
score: 172.1
NoneNo IPR availablePANTHERPTHR13683:SF265PROTEIN ASPARTIC PROTEASE IN GUARD CELL 2coord: 55..475
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 55..475
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 151..162
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 136..471
score: 44.887