Cla97C05G097040 (gene) Watermelon (97103) v2

NameCla97C05G097040
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionAspartyl protease family protein
LocationCla97Chr05 : 26382599 .. 26386791 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGTCTCTTTCTTCAATGCTTTTCTTTGCTTTTTGTTTCCTTTTCTTTCAATCTTTTGCCGGAAAAGTCTCGCCTGACTCCCACCGCCTCACCGTCGAACTCGCCGGACTTCTCCCCTCCGCCTCCTGTACCCACCGCTCCCCCCAAGGTCTCTCTTTTTCTCCTCTTTCCTCTCTTTACAATTTTCCAGTTTATTAATTAAATTTCATTGTTAATTTTTTGACACATTTATCCCCTTTTAAATTATTGGACTGGAGTTTTTATTTTAGTAGAATTAAAAGTTAATTTTCTCTGAACCAGGAAGATCTTTTTTTTTTTTTAATAATATATATATTATAGCATTTGGATTTGGATTTGGATGTTATGTTAATATATTTGGGCACGAATCACCATTAATTTCTCTGTCAAACATATGCTTACGTGTCCTGAACATAGCCATGACTATTTTTCCAACTTTTAATGTGTATTTTTATACTTCACTTTTTAATTATAGCTCAGGGAGAAAATAAAAGATTCCAAATTTGAGTAATTTGAAGAAAGAAATAATAATAATAACAATAATAATTAAAAGATCAATTGGGTCTGTCTGTCTGCCTCTCAGCTTTTATTATTATTTTTTTTTCTTTTTGTCAAGATTGAAGAAATTTTCATCGGTGGGGTTGTTGGTTTTGTGTCTAAAAAGTTAAAGGGAAAATCAATTTATTGAAAAGAAAAGAAAAAAAAAGTCAATAAATGGGCCCAAATTTCAAATTATTAAAATACGGCCACAAATTATTTGCTTAATTATGGATTTTACTTACCTTTTTTGATAATAGTTTCTTTATATGTAATGCACTTTGCAAAAAAGATTTTCAAGATTTCTACCAATTTTACGAGTTCATAGAATATATATATATTTGAGGAAGAAAATCTTTCTTGAAATCTTGCTAAATATTTTTATATGCTCTTAAAATATGGGAAAAAAGAATTAAATATTGAAATGAGTTATACTACTAGATGAGCTTAAATTTGAAGTAAGCATGTTTCCAAACAAGTCAAATGATAGGGATGGAATAATTATTTTTAATAAAAAAAAAATTTGTTTATATTTAGAATTGGAAAAATAACTCATACATAGAGAGTCAAAAGTACAAAAAGACAAAGAAGTACACCGTCAGAGGATGCAAATATTCCGCAATCTCATACGGCTGCATAATTAAAGCTCCAACCGAAGCCGCCTCACTAATTGCATTTTCGAAACTATCAACCGCTCTATGTGGGCCTGTTGAGGCCCATATTATGAATACTGCCCATTATTAGCCCAAAACCGTAATCATATTTTACAATTTATTTTATATTTTATTTTCTTTGAAACTATATATCTTCTTCTTCTTCTTTTATTATTATTTTTTTAAAAGATAATTATTCATATGTTATGATGTATTTGTGAATGTTCTGAATATTAAAATTATAATCATTTTACTTTAACCGATCTTAACATAAGTAAAATAGTCAAGAATTCACTCTTTTTATTTGCCATATAATTATTCATATGTTATAGAACTTTTTCTACCTTTATTTTAATTGAATGAAAAACTATATGTATAATTTTTTTTTTTTTTTTTTTGTATTATTTACTTCGAGATTAGATAGTTAAATTATAGATTGTCATGTGACAATTTTTTTTTTAGAAAAATCTTTTATATTTTTTTTCCTTCACAATTGAAATACTCATTTTTTGTTGGTGGCTAGTAAAGTAGGTATAGTTCATTGATAAGTGACAAGTAATATCTTGCACGAGATCGGAATTTCAAAACTTCATCCCATTTACTAATTAGCAAGTAGGATGAACAAGAAAAATTCATATATTTTGGTTGTTGTGGGCTTGTGGTAGCCCATATTTCAGGTGTAAGACAACAATCATCACTAGAGGTAATTCATCGGCATGGGCCATGTGGCAGCGAAGTTTCAAAAGCTCCGACAGCTATCGAAATGTTCGTGCAAGATCAGGCTCGAGTGGACTTCATCCACTCAAAATTCTCAGGGGAGCTCAAATCTATCGACCGTTTACGACCATCAAAGGCCACAAAAATTCCGGCAAAGTCCGGTGCCACAATTGGTTCTGGCAATTACATAGTGAACGTTGGTTTAGGGACGCCAAAGAAATACTTGTCGCTTATATTTGACACCGGTAGTGATCTGACCTGGACTCAGTGCGAGCCTTGTGCTAGATATTGCTACAATCAAAAGGACCCTGTTTTCGCCCCTTCTCAATCCACCTCCTATTCCAACATTTCTTGTTCTTCGTCTGAATGCTCTCAGCTTGAATCGGGCACCGGTAATTTCATTTCTCTTTACTTACTCTTTTATAAAGAATTTCTAAGACTTGATTCCGTAAAAAATTTTGTTTCTGATTTTTGTGTTTAAAAATTAAGCTTTTTTTTTTTTTTTAATTTCTTAATGATTTGTATTTTTTAAAAAATAAGACAATCTTTTAAAAACTATTGTCTCTCATTTTTAAATTTTAGCTTATTTTATTAAAATAATGATAAAAAAGATGAAGAAGAAAAAGCAAGAATTTATAAACTTAATTTTGAAAACTAAAAATAAAAAATATATATTACTAATCAAGACATACACTTTGTAGATTTCTATAGTTTCTCTAATGCCTCTTTAAATTTTCACACAAAATGAAAGCTAGCTATTTTAATCAAAGCTTAATTTATATTTTAGTGGCCAACTTAGTTTTTGAGTTAAATTATAATTGTAGTCTCTAAATTTTGTAATAGGTTTATTTTGGTTTTTAACATCCAAAACTATATAATAAATCCATAAATTCTTAATTTGTGTCTACTATGTCTCTAGATCTTTAATATTGTGTATAAATATTTCAAAATATTGAAAATTATGAAAAGTTAATTGATATGTTAGATGTAAATTTGAAATTTAGAGCTTTTAATTTTATATTTAGTAAATCAAAGACTCTTATGAAATATCAAAATTTAATATTAAATTTGTAATTTAAAAAATTCAAACATAAAAAAGACACATTCCTAAAAGTTTAGGATTATAATTATAATTTAACTTTTATTTTAACTTAAAGGTGTGGGATTTAAAATTTGGGAGGATAATTATAGAATTTAAATAGAGTAACAAATGTCTAAAATTTAACGAAAAATTATCTTTTTTGCACATAAATTTTTAAATATTATATTTTTAGTTGTTTTAAGTTTTGAATTTGATTTCAATTAACAAACAAATAAAATTAGACCCAAATTTCGAAACCATGATTGAGGTATTTTCCCCAAAATTTAATTAATCAAATTTAGCTTGACAATTAATGACTTGAAAGGTTAGTTAACAATCGATGTTTTGCAAATACAGGGAACCAGCCTGGTTGCTCCGCCGCGAAAGCTTGCATTTACGGAATACAATACGGCGATCAATCTTTCTCCGTCGGATATTTCGCCAGAGAAACTCTAACCTTAACCTCCACCGACGTTATCGACAATTTCCTCTTCGGTTGCGGCCAAAACAACCGTGGACTCTTCGGCAGCGCCGCCGGTCTCATCGGCCTCGGCCAGGACAAAATCTCCATCGTCAAACAAACCGCCCAAAAGTACGGCCAGATCTTCGCCTACTGCCTCCCTAAAACCTCGAGTTCCACCGGTTACCTAACCTTCGGCGACGGCGGCGGCGGCGGCGGCGCTCTGAAATACACGCCGATCACAAAAGCACACGGCGTCGCGAATTTCTACGGCGTCGACATCGTAGGAATGAAGGTCGGCGGAACTCAGATTCCGATCTCGCCATCAGTCTTCTCGACCTCCGGCGCAATCATCGATTCCGGCACGGTTATCACACGGCTGCCGCCGGATGCGTACAGCGCCTTGAAATCAGCGTTTCAGAAAGGAATGGCGAAGTATCCAAAGGCGCCGGAGCTTTCTATTTTGGACACGTGTTACGATCTGAGCAAATATAGCATCGTAGAGATCCCTAAAGTGTCGTTTCTTTTCAAAGGAGGAACGGAACTCGATCTGGACAGCACCGGAATCTTGTACGGAGCAGCGACGAGGCAGGTGTGTCTGGCGTTCGCCGGTAATCAAGATCCGAGCACCGTCACCATTATTGGGAACGTGCAGCAGAAGACACTGCAGGTGGTTTACGATGTCGGTGGAGGGAAGATTGGGTTTGGTTATAATGGTTGTTAG

mRNA sequence

ATGGCGTCTCTTTCTTCAATGCTTTTCTTTGCTTTTTGTTTCCTTTTCTTTCAATCTTTTGCCGGAAAAGTCTCGCCTGACTCCCACCGCCTCACCGTCGAACTCGCCGGACTTCTCCCCTCCGCCTCCTGTACCCACCGCTCCCCCCAAGCCCATATTTCAGGTGTAAGACAACAATCATCACTAGAGGTAATTCATCGGCATGGGCCATGTGGCAGCGAAGTTTCAAAAGCTCCGACAGCTATCGAAATGTTCGTGCAAGATCAGGCTCGAGTGGACTTCATCCACTCAAAATTCTCAGGGGAGCTCAAATCTATCGACCGTTTACGACCATCAAAGGCCACAAAAATTCCGGCAAAGTCCGGTGCCACAATTGGTTCTGGCAATTACATAGTGAACGTTGGTTTAGGGACGCCAAAGAAATACTTGTCGCTTATATTTGACACCGGTAGTGATCTGACCTGGACTCAGTGCGAGCCTTGTGCTAGATATTGCTACAATCAAAAGGACCCTGTTTTCGCCCCTTCTCAATCCACCTCCTATTCCAACATTTCTTGTTCTTCGTCTGAATGCTCTCAGCTTGAATCGGGCACCGGGAACCAGCCTGGTTGCTCCGCCGCGAAAGCTTGCATTTACGGAATACAATACGGCGATCAATCTTTCTCCGTCGGATATTTCGCCAGAGAAACTCTAACCTTAACCTCCACCGACGTTATCGACAATTTCCTCTTCGGTTGCGGCCAAAACAACCGTGGACTCTTCGGCAGCGCCGCCGGTCTCATCGGCCTCGGCCAGGACAAAATCTCCATCGTCAAACAAACCGCCCAAAAGTACGGCCAGATCTTCGCCTACTGCCTCCCTAAAACCTCGAGTTCCACCGGTTACCTAACCTTCGGCGACGGCGGCGGCGGCGGCGGCGCTCTGAAATACACGCCGATCACAAAAGCACACGGCGTCGCGAATTTCTACGGCGTCGACATCGTAGGAATGAAGGTCGGCGGAACTCAGATTCCGATCTCGCCATCAGTCTTCTCGACCTCCGGCGCAATCATCGATTCCGGCACGGTTATCACACGGCTGCCGCCGGATGCGTACAGCGCCTTGAAATCAGCGTTTCAGAAAGGAATGGCGAAGTATCCAAAGGCGCCGGAGCTTTCTATTTTGGACACGTGTTACGATCTGAGCAAATATAGCATCGTAGAGATCCCTAAAGTGTCGTTTCTTTTCAAAGGAGGAACGGAACTCGATCTGGACAGCACCGGAATCTTGTACGGAGCAGCGACGAGGCAGGTGTGTCTGGCGTTCGCCGGTAATCAAGATCCGAGCACCGTCACCATTATTGGGAACGTGCAGCAGAAGACACTGCAGGTGGTTTACGATGTCGGTGGAGGGAAGATTGGGTTTGGTTATAATGGTTGTTAG

Coding sequence (CDS)

ATGGCGTCTCTTTCTTCAATGCTTTTCTTTGCTTTTTGTTTCCTTTTCTTTCAATCTTTTGCCGGAAAAGTCTCGCCTGACTCCCACCGCCTCACCGTCGAACTCGCCGGACTTCTCCCCTCCGCCTCCTGTACCCACCGCTCCCCCCAAGCCCATATTTCAGGTGTAAGACAACAATCATCACTAGAGGTAATTCATCGGCATGGGCCATGTGGCAGCGAAGTTTCAAAAGCTCCGACAGCTATCGAAATGTTCGTGCAAGATCAGGCTCGAGTGGACTTCATCCACTCAAAATTCTCAGGGGAGCTCAAATCTATCGACCGTTTACGACCATCAAAGGCCACAAAAATTCCGGCAAAGTCCGGTGCCACAATTGGTTCTGGCAATTACATAGTGAACGTTGGTTTAGGGACGCCAAAGAAATACTTGTCGCTTATATTTGACACCGGTAGTGATCTGACCTGGACTCAGTGCGAGCCTTGTGCTAGATATTGCTACAATCAAAAGGACCCTGTTTTCGCCCCTTCTCAATCCACCTCCTATTCCAACATTTCTTGTTCTTCGTCTGAATGCTCTCAGCTTGAATCGGGCACCGGGAACCAGCCTGGTTGCTCCGCCGCGAAAGCTTGCATTTACGGAATACAATACGGCGATCAATCTTTCTCCGTCGGATATTTCGCCAGAGAAACTCTAACCTTAACCTCCACCGACGTTATCGACAATTTCCTCTTCGGTTGCGGCCAAAACAACCGTGGACTCTTCGGCAGCGCCGCCGGTCTCATCGGCCTCGGCCAGGACAAAATCTCCATCGTCAAACAAACCGCCCAAAAGTACGGCCAGATCTTCGCCTACTGCCTCCCTAAAACCTCGAGTTCCACCGGTTACCTAACCTTCGGCGACGGCGGCGGCGGCGGCGGCGCTCTGAAATACACGCCGATCACAAAAGCACACGGCGTCGCGAATTTCTACGGCGTCGACATCGTAGGAATGAAGGTCGGCGGAACTCAGATTCCGATCTCGCCATCAGTCTTCTCGACCTCCGGCGCAATCATCGATTCCGGCACGGTTATCACACGGCTGCCGCCGGATGCGTACAGCGCCTTGAAATCAGCGTTTCAGAAAGGAATGGCGAAGTATCCAAAGGCGCCGGAGCTTTCTATTTTGGACACGTGTTACGATCTGAGCAAATATAGCATCGTAGAGATCCCTAAAGTGTCGTTTCTTTTCAAAGGAGGAACGGAACTCGATCTGGACAGCACCGGAATCTTGTACGGAGCAGCGACGAGGCAGGTGTGTCTGGCGTTCGCCGGTAATCAAGATCCGAGCACCGTCACCATTATTGGGAACGTGCAGCAGAAGACACTGCAGGTGGTTTACGATGTCGGTGGAGGGAAGATTGGGTTTGGTTATAATGGTTGTTAG

Protein sequence

MASLSSMLFFAFCFLFFQSFAGKVSPDSHRLTVELAGLLPSASCTHRSPQAHISGVRQQSSLEVIHRHGPCGSEVSKAPTAIEMFVQDQARVDFIHSKFSGELKSIDRLRPSKATKIPAKSGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLESGTGNQPGCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLTFGDGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITRLPPDAYSALKSAFQKGMAKYPKAPELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGAATRQVCLAFAGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC
BLAST of Cla97C05G097040 vs. NCBI nr
Match: XP_022936652.1 (aspartyl protease family protein At5g10770-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 840.9 bits (2171), Expect = 2.2e-240
Identity = 425/473 (89.85%), Postives = 439/473 (92.81%), Query Frame = 0

Query: 1   MASLSSMLFFAFCFLFFQSFAGKVSPDSHRLTVELAGLLPSASCTHRSPQAHISGVRQQS 60
           MA LSSMLFFAFCFLFF S AGKVSPD+HRLTVELA LLPSA+C  RS QA  SG R QS
Sbjct: 1   MAPLSSMLFFAFCFLFFYSSAGKVSPDAHRLTVELADLLPSAACNRRSVQAQDSGRRTQS 60

Query: 61  SLEVIHRHGPCGSEVSKAPTAIEMFVQDQARVDFIHSKFSGELKSIDRLRPSKATKIPAK 120
           SLEVIHRHGPCG   SKAPTA E+FVQDQARVDFIHSKFSG+ KSIDRLRPSKATKIPAK
Sbjct: 61  SLEVIHRHGPCGGGESKAPTAAELFVQDQARVDFIHSKFSGDFKSIDRLRPSKATKIPAK 120

Query: 121 SGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTS 180
           SGATIGSGNYIVNV LGTPKKYLSLIFDTGSDLTWTQCEPCARYCY+QKDPVFAPSQST+
Sbjct: 121 SGATIGSGNYIVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYSQKDPVFAPSQSTT 180

Query: 181 YSNISCSSSECSQLESGTGNQPGCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVID 240
           YSNI+CSS  CSQLESGTGNQPGCSAAK+CIYGIQYGDQSFSVGYFA+ETLTLT TDVI 
Sbjct: 181 YSNITCSSPICSQLESGTGNQPGCSAAKSCIYGIQYGDQSFSVGYFAKETLTLTPTDVIS 240

Query: 241 NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLTFGD 300
           NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIF+YCLPKTSSSTGYL FG 
Sbjct: 241 NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPKTSSSTGYLAFGG 300

Query: 301 GXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITRL 360
            XXXXXALKYTPITKAHGVANFYGVDIVGMKVGG QIPIS SVFSTSGAIIDSGTVITRL
Sbjct: 301 SXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGIQIPISASVFSTSGAIIDSGTVITRL 360

Query: 361 PPDAYSALKSAFQKGMAKYPKAPELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDST 420
           PPDAYSALKSAFQKGM KYPKAPELSILDTCYDLSKY+ VEIPKV FLFKGGT L+LD T
Sbjct: 361 PPDAYSALKSAFQKGMTKYPKAPELSILDTCYDLSKYTSVEIPKVDFLFKGGTVLELDGT 420

Query: 421 GILYGAATRQVCLAFAGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
           GILYGA+T QVCLAFAGN DPS+V IIGNVQQKTLQVVYDVGGGKIGFGYNGC
Sbjct: 421 GILYGASTTQVCLAFAGNMDPSSVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473

BLAST of Cla97C05G097040 vs. NCBI nr
Match: XP_008452815.1 (PREDICTED: aspartyl protease family protein At5g10770-like [Cucumis melo])

HSP 1 Score: 838.2 bits (2164), Expect = 1.4e-239
Identity = 424/476 (89.08%), Postives = 447/476 (93.91%), Query Frame = 0

Query: 1   MASLSSML--FFAFCFLFFQSFAGK-VSPDSHRLTVELAGLLPSASCTHRSPQAHISGVR 60
           MASL S++  FFAF FLFFQSFAGK +SPDSH LTVELA L PSASCT RSPQAH S V 
Sbjct: 1   MASLPSIILFFFAFSFLFFQSFAGKLLSPDSHYLTVELADLFPSASCTRRSPQAHTSSVG 60

Query: 61  QQSSLEVIHRHGPCGSEVSKAPTAIEMFVQDQARVDFIHSKFSGELKSIDRLRPSKATKI 120
           +QSSLEVIHRHGPCG +VS APTA EMFVQDQARVDFIHSKF+GEL+S+DRLRPSKATKI
Sbjct: 61  EQSSLEVIHRHGPCGDDVSNAPTAAEMFVQDQARVDFIHSKFAGELESVDRLRPSKATKI 120

Query: 121 PAKSGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQ 180
           PAKSGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQC+PCARYCYNQKDPVFAPSQ
Sbjct: 121 PAKSGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFAPSQ 180

Query: 181 STSYSNISCSSSECSQLESGTGNQPGCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTD 240
           ST+YSNISCSSS+CSQLESGTGNQPGCSAA+ACIYGIQYGDQSFSVGYFA+ETLTLTS D
Sbjct: 181 STTYSNISCSSSDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSND 240

Query: 241 VIDNFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLT 300
           VI+NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIF+YCLPKTSSSTGYLT
Sbjct: 241 VIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPKTSSSTGYLT 300

Query: 301 FGDGXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVI 360
           F   XXXXX LKYTPITKAHGVANFYG+DIVG+KVGGTQIPIS SVFSTSGAIIDSGTVI
Sbjct: 301 F-XXXXXXXXLKYTPITKAHGVANFYGLDIVGIKVGGTQIPISSSVFSTSGAIIDSGTVI 360

Query: 361 TRLPPDAYSALKSAFQKGMAKYPKAPELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDL 420
           TRLPPDAYSALKSAFQKGMA+YPKAPELSILDTCYDLSKYS ++IPKV  +FKG  ELDL
Sbjct: 361 TRLPPDAYSALKSAFQKGMARYPKAPELSILDTCYDLSKYSTIQIPKVGIMFKGQEELDL 420

Query: 421 DSTGILYGAATRQVCLAFAGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
           D TGI+YGA+T QVCLAFAGNQDPSTV IIGNVQQKTLQVVYDVGGGKIGFGYNGC
Sbjct: 421 DGTGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 475

BLAST of Cla97C05G097040 vs. NCBI nr
Match: XP_023535774.1 (aspartyl protease family protein At5g10770-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 836.6 bits (2160), Expect = 4.1e-239
Identity = 422/473 (89.22%), Postives = 436/473 (92.18%), Query Frame = 0

Query: 1   MASLSSMLFFAFCFLFFQSFAGKVSPDSHRLTVELAGLLPSASCTHRSPQAHISGVRQQS 60
           MA LSSMLFFAFCFLFF S AGKVSP +HRLTVELA LLPSA+C  RS QA  SG R QS
Sbjct: 1   MAPLSSMLFFAFCFLFFHSSAGKVSPGAHRLTVELADLLPSATCNRRSVQAQDSGRRTQS 60

Query: 61  SLEVIHRHGPCGSEVSKAPTAIEMFVQDQARVDFIHSKFSGELKSIDRLRPSKATKIPAK 120
           SLEVIHRHGPCG   SKAPTA E+FVQDQ RVDFIHSKFSG+ KS+DRLRPSKATKIPAK
Sbjct: 61  SLEVIHRHGPCGGGESKAPTAAELFVQDQGRVDFIHSKFSGDFKSVDRLRPSKATKIPAK 120

Query: 121 SGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTS 180
           SGATIGSGNYIVNV LGTPKKYLSLIFDTGSDLTWTQCEPCARYCY+QKDPVFAPSQST+
Sbjct: 121 SGATIGSGNYIVNVALGTPKKYLSLIFDTGSDLTWTQCEPCARYCYSQKDPVFAPSQSTT 180

Query: 181 YSNISCSSSECSQLESGTGNQPGCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVID 240
           YSNI+CSS  CSQLESGTGNQPGCS AK+CIYGIQYGDQSFSVGYFA+ETLTLT TDVI 
Sbjct: 181 YSNITCSSPICSQLESGTGNQPGCSTAKSCIYGIQYGDQSFSVGYFAKETLTLTPTDVIS 240

Query: 241 NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLTFGD 300
           NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIF+YCLPKTSSSTGYL FG 
Sbjct: 241 NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPKTSSSTGYLAFGG 300

Query: 301 GXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITRL 360
            XXXXXALKYTPITKAHGVANFYGVDIVGMKVGG QIPIS SVFSTSGAIIDSGTVITRL
Sbjct: 301 SXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGIQIPISASVFSTSGAIIDSGTVITRL 360

Query: 361 PPDAYSALKSAFQKGMAKYPKAPELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDST 420
           PPDAYSALKSAFQKGM KYPKAPELSILDTCYDLSKY+ VEIPKV FLFKGGT L+LD T
Sbjct: 361 PPDAYSALKSAFQKGMTKYPKAPELSILDTCYDLSKYTSVEIPKVDFLFKGGTVLELDGT 420

Query: 421 GILYGAATRQVCLAFAGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
           GILYGA+T QVCLAFAGN DPSTV IIGNVQQKTLQVVYDVGGGKIGFGYNGC
Sbjct: 421 GILYGASTTQVCLAFAGNMDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473

BLAST of Cla97C05G097040 vs. NCBI nr
Match: XP_022977010.1 (aspartyl protease family protein At5g10770-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 835.9 bits (2158), Expect = 6.9e-239
Identity = 422/473 (89.22%), Postives = 437/473 (92.39%), Query Frame = 0

Query: 1   MASLSSMLFFAFCFLFFQSFAGKVSPDSHRLTVELAGLLPSASCTHRSPQAHISGVRQQS 60
           MA LSSML FAFCF FF S AGKVSPD+HRLTVELA LLPSA+C  RS QA ISG R QS
Sbjct: 1   MAPLSSMLLFAFCFFFFHSSAGKVSPDAHRLTVELADLLPSAACNRRSVQAQISGGRTQS 60

Query: 61  SLEVIHRHGPCGSEVSKAPTAIEMFVQDQARVDFIHSKFSGELKSIDRLRPSKATKIPAK 120
           SLEVIHRHGPCG   SKAPTA E+FVQDQARVDFIHSKFSG+ KSIDRLRPSKATKIPAK
Sbjct: 61  SLEVIHRHGPCGDGNSKAPTAAELFVQDQARVDFIHSKFSGDFKSIDRLRPSKATKIPAK 120

Query: 121 SGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTS 180
           SGATIGSGNYIVNV LGTPKKYLSLIFDTGSDLTWTQC PCARYCY+QKDPVFAPSQST+
Sbjct: 121 SGATIGSGNYIVNVALGTPKKYLSLIFDTGSDLTWTQCAPCARYCYSQKDPVFAPSQSTT 180

Query: 181 YSNISCSSSECSQLESGTGNQPGCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVID 240
           YSNI+CSS  CSQLESGTGNQPGCSAAK+CIYGIQYGDQSFSVGYFA+ETLTLT TDVI 
Sbjct: 181 YSNITCSSPICSQLESGTGNQPGCSAAKSCIYGIQYGDQSFSVGYFAKETLTLTPTDVIS 240

Query: 241 NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLTFGD 300
           NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIF+YCLPKTSSSTGYL FG 
Sbjct: 241 NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPKTSSSTGYLAFGG 300

Query: 301 GXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITRL 360
            XXXXXALKYTPITKAHGVANFYGVDIVGMKVGG QIPIS SVFSTSGAIIDSGTVITRL
Sbjct: 301 SXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGIQIPISASVFSTSGAIIDSGTVITRL 360

Query: 361 PPDAYSALKSAFQKGMAKYPKAPELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDST 420
           PPDAYSALKSAFQKGM KYPKAPELSILDTCYDLSKY+ VEIPKV FLFKGGT L+LD T
Sbjct: 361 PPDAYSALKSAFQKGMTKYPKAPELSILDTCYDLSKYTSVEIPKVDFLFKGGTVLELDGT 420

Query: 421 GILYGAATRQVCLAFAGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
           GILYGA+T QVCLAFAGN DPS+V IIGNVQQKT+QVVYDVGGGKIGFGYNGC
Sbjct: 421 GILYGASTTQVCLAFAGNLDPSSVAIIGNVQQKTMQVVYDVGGGKIGFGYNGC 473

BLAST of Cla97C05G097040 vs. NCBI nr
Match: XP_004150866.1 (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis sativus])

HSP 1 Score: 833.6 bits (2152), Expect = 3.4e-238
Identity = 419/474 (88.40%), Postives = 444/474 (93.67%), Query Frame = 0

Query: 1   MASLSS-MLFFAFCFLFFQSFAGKVSPDSHRLTVELAGLLPSASCTHRSPQAHISGVRQQ 60
           MASLSS MLFFAF  LFFQ+FAGK+SPDSH LTV+LAGL PSASCT RSPQ H S + +Q
Sbjct: 1   MASLSSIMLFFAFSSLFFQAFAGKLSPDSHFLTVDLAGLFPSASCTRRSPQVHTSSLGEQ 60

Query: 61  SSLEVIHRHGPCGSEVSKAPTAIEMFVQDQARVDFIHSKFSGELKSIDRLRPSKATKIPA 120
           SSLEVIHRHGPCG EVS APTA EM V+DQ+RVDFIHSK +GEL+S+DRLR SKATKIPA
Sbjct: 61  SSLEVIHRHGPCGDEVSNAPTAAEMLVKDQSRVDFIHSKIAGELESVDRLRGSKATKIPA 120

Query: 121 KSGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQST 180
           KSGATIGSGNYIV+VGLGTPKKYLSLIFDTGSDLTWTQC+PCARYCYNQKDPVF PSQST
Sbjct: 121 KSGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQST 180

Query: 181 SYSNISCSSSECSQLESGTGNQPGCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVI 240
           +YSNISCSS +CSQLESGTGNQPGCSAA+ACIYGIQYGDQSFSVGYFA+ETLTLTSTDVI
Sbjct: 181 TYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSTDVI 240

Query: 241 DNFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLTFG 300
           +NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQ+F+YCLPKTSSSTGYLTF 
Sbjct: 241 ENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFSYCLPKTSSSTGYLTF- 300

Query: 301 DGXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITR 360
             XXXXX LKYTPITKAHGVANFYGVDIVGMKVGGTQIPIS SVFSTSGAIIDSGTVITR
Sbjct: 301 XXXXXXXXLKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSGAIIDSGTVITR 360

Query: 361 LPPDAYSALKSAFQKGMAKYPKAPELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDS 420
           LPPDAYSALKSAF+KGMAKYPKAPELSILDTCYDLSKYS ++IPKV F+FKGG ELDLD 
Sbjct: 361 LPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVFKGGEELDLDG 420

Query: 421 TGILYGAATRQVCLAFAGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
            GI+YGA+T QVCLAFAGNQDPSTV IIGNVQQKTLQVVYDVGGGKIGFGYNGC
Sbjct: 421 IGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473

BLAST of Cla97C05G097040 vs. TrEMBL
Match: tr|A0A1S3BVJ7|A0A1S3BVJ7_CUCME (aspartyl protease family protein At5g10770-like OS=Cucumis melo OX=3656 GN=LOC103493725 PE=3 SV=1)

HSP 1 Score: 838.2 bits (2164), Expect = 9.2e-240
Identity = 424/476 (89.08%), Postives = 447/476 (93.91%), Query Frame = 0

Query: 1   MASLSSML--FFAFCFLFFQSFAGK-VSPDSHRLTVELAGLLPSASCTHRSPQAHISGVR 60
           MASL S++  FFAF FLFFQSFAGK +SPDSH LTVELA L PSASCT RSPQAH S V 
Sbjct: 1   MASLPSIILFFFAFSFLFFQSFAGKLLSPDSHYLTVELADLFPSASCTRRSPQAHTSSVG 60

Query: 61  QQSSLEVIHRHGPCGSEVSKAPTAIEMFVQDQARVDFIHSKFSGELKSIDRLRPSKATKI 120
           +QSSLEVIHRHGPCG +VS APTA EMFVQDQARVDFIHSKF+GEL+S+DRLRPSKATKI
Sbjct: 61  EQSSLEVIHRHGPCGDDVSNAPTAAEMFVQDQARVDFIHSKFAGELESVDRLRPSKATKI 120

Query: 121 PAKSGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQ 180
           PAKSGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQC+PCARYCYNQKDPVFAPSQ
Sbjct: 121 PAKSGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFAPSQ 180

Query: 181 STSYSNISCSSSECSQLESGTGNQPGCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTD 240
           ST+YSNISCSSS+CSQLESGTGNQPGCSAA+ACIYGIQYGDQSFSVGYFA+ETLTLTS D
Sbjct: 181 STTYSNISCSSSDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSND 240

Query: 241 VIDNFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLT 300
           VI+NFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIF+YCLPKTSSSTGYLT
Sbjct: 241 VIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFSYCLPKTSSSTGYLT 300

Query: 301 FGDGXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVI 360
           F   XXXXX LKYTPITKAHGVANFYG+DIVG+KVGGTQIPIS SVFSTSGAIIDSGTVI
Sbjct: 301 F-XXXXXXXXLKYTPITKAHGVANFYGLDIVGIKVGGTQIPISSSVFSTSGAIIDSGTVI 360

Query: 361 TRLPPDAYSALKSAFQKGMAKYPKAPELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDL 420
           TRLPPDAYSALKSAFQKGMA+YPKAPELSILDTCYDLSKYS ++IPKV  +FKG  ELDL
Sbjct: 361 TRLPPDAYSALKSAFQKGMARYPKAPELSILDTCYDLSKYSTIQIPKVGIMFKGQEELDL 420

Query: 421 DSTGILYGAATRQVCLAFAGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
           D TGI+YGA+T QVCLAFAGNQDPSTV IIGNVQQKTLQVVYDVGGGKIGFGYNGC
Sbjct: 421 DGTGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 475

BLAST of Cla97C05G097040 vs. TrEMBL
Match: tr|A0A0A0L0Z8|A0A0A0L0Z8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G651750 PE=3 SV=1)

HSP 1 Score: 711.4 bits (1835), Expect = 1.3e-201
Identity = 353/390 (90.51%), Postives = 373/390 (95.64%), Query Frame = 0

Query: 84  MFVQDQARVDFIHSKFSGELKSIDRLRPSKATKIPAKSGATIGSGNYIVNVGLGTPKKYL 143
           M V+DQ+RVDFIHSK +GEL+S+DRLR SKATKIPAKSGATIGSGNYIV+VGLGTPKKYL
Sbjct: 1   MLVKDQSRVDFIHSKIAGELESVDRLRGSKATKIPAKSGATIGSGNYIVSVGLGTPKKYL 60

Query: 144 SLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLESGTGNQPG 203
           SLIFDTGSDLTWTQC+PCARYCYNQKDPVF PSQST+YSNISCSS +CSQLESGTGNQPG
Sbjct: 61  SLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSPDCSQLESGTGNQPG 120

Query: 204 CSAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFGSAAGLIGL 263
           CSAA+ACIYGIQYGDQSFSVGYFA+ETLTLTSTDVI+NFLFGCGQNNRGLFGSAAGLIGL
Sbjct: 121 CSAARACIYGIQYGDQSFSVGYFAKETLTLTSTDVIENFLFGCGQNNRGLFGSAAGLIGL 180

Query: 264 GQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLTFGDGXXXXXALKYTPITKAHGVANFY 323
           GQDKISIVKQTAQKYGQ+F+YCLPKTSSSTGYLTF   XXXXX LKYTPITKAHGVANFY
Sbjct: 181 GQDKISIVKQTAQKYGQVFSYCLPKTSSSTGYLTF-XXXXXXXXLKYTPITKAHGVANFY 240

Query: 324 GVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITRLPPDAYSALKSAFQKGMAKYPKAP 383
           GVDIVGMKVGGTQIPIS SVFSTSGAIIDSGTVITRLPPDAYSALKSAF+KGMAKYPKAP
Sbjct: 241 GVDIVGMKVGGTQIPISSSVFSTSGAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAP 300

Query: 384 ELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGAATRQVCLAFAGNQDPST 443
           ELSILDTCYDLSKYS ++IPKV F+FKGG ELDLD  GI+YGA+T QVCLAFAGNQDPST
Sbjct: 301 ELSILDTCYDLSKYSTIQIPKVGFVFKGGEELDLDGIGIMYGASTSQVCLAFAGNQDPST 360

Query: 444 VTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
           V IIGNVQQKTLQVVYDVGGGKIGFGYNGC
Sbjct: 361 VAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 389

BLAST of Cla97C05G097040 vs. TrEMBL
Match: tr|A0A2P4KEM6|A0A2P4KEM6_QUESU (Aspartyl protease family protein OS=Quercus suber OX=58331 GN=CFP56_14592 PE=3 SV=1)

HSP 1 Score: 568.9 bits (1465), Expect = 1.0e-158
Identity = 282/449 (62.81%), Postives = 347/449 (77.28%), Query Frame = 0

Query: 28  SHRLTVELAGLLPSASCTHRSPQAHISGVRQQSSLEVIHRHGPCGS---EVSKAPTAIEM 87
           +H  T++++ LLPS +C+     +   G  +++SL V+H+HGPC +   + +KAPT  E+
Sbjct: 44  THMHTLQVSSLLPSTTCS-----SSTKGSDKRASLTVVHKHGPCSTLKLDKAKAPTVDEI 103

Query: 88  FVQDQARVDFIHSKFSGELKSIDRLRPSKATKIPAKSGATIGSGNYIVNVGLGTPKKYLS 147
             QDQARV+ IHS+ S +L S D LR S+ + IPAKSG+TIGSGNYIV VGLGTPKK LS
Sbjct: 104 LEQDQARVNSIHSRLSKKL-SRDDLRASEDSTIPAKSGSTIGSGNYIVTVGLGTPKKDLS 163

Query: 148 LIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLESGTGNQPGC 207
           LIFDTGSDLTWTQC+PCAR CY QK+P F PSQS +YSNISC SS CSQL S TGN PGC
Sbjct: 164 LIFDTGSDLTWTQCKPCARSCYQQKEPTFDPSQSATYSNISCRSSVCSQLTSATGNTPGC 223

Query: 208 SAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFGSAAGLIGLG 267
           S +  CIYGIQYGDQSFSVG+F++E LTLTSTDV +NFLFGCGQNN+GLFG AAGL+GLG
Sbjct: 224 STS-TCIYGIQYGDQSFSVGFFSKEKLTLTSTDVFNNFLFGCGQNNQGLFGGAAGLLGLG 283

Query: 268 QDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLTFGDGXXXXXALKYTPITKAHGVANFYG 327
           +D +S+V+QTA KYG++F+YCLP +SSSTG+LTFG       A+K+TP++K+    +FYG
Sbjct: 284 RDTLSLVEQTASKYGRVFSYCLPSSSSSTGHLTFGKSAGTSSAIKFTPLSKSAQGTSFYG 343

Query: 328 VDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITRLPPDAYSALKSAFQKGMAKYPKAPE 387
           +DIVG+ +GG ++ I  S FS +G IIDSGTVI+RLPP AYSALK+AF++ M KYP    
Sbjct: 344 LDIVGISLGGRKLSIPTSAFSNAGTIIDSGTVISRLPPAAYSALKAAFRQAMKKYPSTGA 403

Query: 388 LSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGAATRQVCLAFAGNQDPSTV 447
           LSILDTCYDLSKY+   IPK+SF F GG  +DLD+ GILY   T QVCLAFAGN D S+V
Sbjct: 404 LSILDTCYDLSKYTTFSIPKISFSFSGGVNVDLDAAGILYAQKTSQVCLAFAGNSDDSSV 463

Query: 448 TIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
            I GNVQQK L VVYDV G ++GFG  GC
Sbjct: 464 GIFGNVQQKRLDVVYDVAGARVGFGPAGC 485

BLAST of Cla97C05G097040 vs. TrEMBL
Match: tr|M5XLM2|M5XLM2_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G397600 PE=3 SV=1)

HSP 1 Score: 559.3 bits (1440), Expect = 8.3e-156
Identity = 274/452 (60.62%), Postives = 344/452 (76.11%), Query Frame = 0

Query: 29  HRLTVELAGLLPSASC-THRSPQAHISGVRQQSSLEVIHRHGPCG---SEVSKAPTAIEM 88
           H  TVE+  LLP+ +C +  S + H+S     S L+V+H+HGPC       SK PT  ++
Sbjct: 40  HAHTVEVNSLLPATTCSSSSSTKGHMSKHASSSVLKVVHKHGPCSRLKKHKSKTPTHAQI 99

Query: 89  FVQDQARVDFIHSKFSG--ELKSIDRLRPSKATKIPAKSGATIGSGNYIVNVGLGTPKKY 148
             QDQARV+ IHS+ +   +LKS+D LR S AT IPA+SG+ +G+GNYIVNVGLG+PKK 
Sbjct: 100 LQQDQARVNSIHSRVNSKKQLKSVDDLRESAATTIPAQSGSVVGAGNYIVNVGLGSPKKQ 159

Query: 149 LSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLESGTGNQP 208
           LSLIFDTGSDLTWTQC PC + CY QK+P+F PS S SY+N+SC+S+ C+QL S TGN P
Sbjct: 160 LSLIFDTGSDLTWTQCRPCVKSCYKQKEPIFDPSLSASYANVSCTSATCTQLGSATGNTP 219

Query: 209 GCSAA-KACIYGIQYGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFGSAAGLI 268
           GC+A+   CIYGIQYGDQSFSVGYF +E L+LT+TDV D FLFGCGQNN+GLFG AAGL+
Sbjct: 220 GCTASTSTCIYGIQYGDQSFSVGYFGKEKLSLTNTDVFDGFLFGCGQNNQGLFGGAAGLL 279

Query: 269 GLGQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLTFGDGXXXXXALKYTPITKAHGVAN 328
           GLG+++IS+V+Q+A+KY + F+YCLP TSSSTGYL+FG G     A+K+T ++      +
Sbjct: 280 GLGRNQISLVEQSAKKYNRFFSYCLPSTSSSTGYLSFGKGGGSSNAVKFTALSTVSQGDS 339

Query: 329 FYGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITRLPPDAYSALKSAFQKGMAKYPK 388
           FYG+++VG+ VGGT++PIS SVFS+SG IIDSGTVITRLPP AYS+LK+AF++ M  YP 
Sbjct: 340 FYGLNVVGINVGGTKLPISASVFSSSGTIIDSGTVITRLPPTAYSSLKAAFRQRMKSYPL 399

Query: 389 APELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGAATRQVCLAFAGNQDP 448
             ELSILDTCYD S +  V  PK+SF+F GG   DLD+TGILY A+  QVCLAFAGN D 
Sbjct: 400 TQELSILDTCYDFSSFKTVSYPKISFVFDGGLTQDLDATGILYVASADQVCLAFAGNGDD 459

Query: 449 STVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
           S + I GNVQQK LQVVYD+ GGK+GF    C
Sbjct: 460 SDIGIFGNVQQKRLQVVYDIAGGKVGFAPAAC 491

BLAST of Cla97C05G097040 vs. TrEMBL
Match: tr|A0A0S3S6Z9|A0A0S3S6Z9_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis OX=157739 GN=Vigan.05G213800 PE=3 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 5.4e-155
Identity = 286/488 (58.61%), Postives = 363/488 (74.39%), Query Frame = 0

Query: 1   MASLSSMLFFA----FCFLFFQSF-------AGKVSPDSHRL-TVELAGLLPSASCTHRS 60
           MA   S LFF+    F F FF S        A +   +S+ L  + L  LLPS+SC+   
Sbjct: 1   MAIQISSLFFSSLTVFIFFFFSSLEKSFAFEATREDTESNNLHLLHLNSLLPSSSCS--- 60

Query: 61  PQAHISGVRQQSSLEVIHRHGPCG--SEVSKAPTAIEMFVQDQARVDFIHSKFSGELKSI 120
             + I G +++ SLEV+H++GPC   ++  +  T  ++   D+ RV +IHS+ S EL   
Sbjct: 61  --SSIKGSKRKGSLEVVHKYGPCSQQNDGERKVTDSDILNLDKERVKYIHSRISKELGGD 120

Query: 121 DRLRPSKATKIPAKSGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCEPCARYCY 180
           D ++   +  +PAKSG+ IGSGNY V VGLGTPK+ LSLIFDTGSDLTWTQC+PCAR CY
Sbjct: 121 DSVKELDSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCQPCARSCY 180

Query: 181 NQKDPVFAPSQSTSYSNISCSSSECSQLESGTGNQPGCSAA-KACIYGIQYGDQSFSVGY 240
            Q+DP+F PS+S +Y+NI+C+S+ C+QL S TGN PGCSA+ KACIYGIQYGDQSFSVGY
Sbjct: 181 EQQDPIFDPSKSRTYANITCTSTLCTQLSSATGNDPGCSASTKACIYGIQYGDQSFSVGY 240

Query: 241 FARETLTLTSTDVIDNFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFAYC 300
           F+RE LT+T+TDVIDNFLFGCGQNN+GLFG +AGLIGLG+  IS V+QTA KY +IF+YC
Sbjct: 241 FSRERLTVTTTDVIDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTASKYNKIFSYC 300

Query: 301 LPKTSSSTGYLTFGDGXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISPSVFS 360
           LP TSSSTG+LTFG G      L YTP +     ++FYG+DIV + V G ++ ++PS+FS
Sbjct: 301 LPSTSSSTGHLTFGGG--AYRKLLYTPFSTISRGSSFYGLDIVSITVSGAKLSLTPSLFS 360

Query: 361 TSGAIIDSGTVITRLPPDAYSALKSAFQKGMAKYPKAPELSILDTCYDLSKYSIVEIPKV 420
           + GAIIDSGTVITRLPP AY+AL+SAF++GM+KYP APELSILDTCYDLS Y ++ IPK+
Sbjct: 361 SGGAIIDSGTVITRLPPTAYAALRSAFRQGMSKYPSAPELSILDTCYDLSAYKVISIPKI 420

Query: 421 SFLFKGGTELDLDSTGILYGAATRQVCLAFAGNQDPSTVTIIGNVQQKTLQVVYDVGGGK 474
           +F+F G   ++L   GILY A+T+QVCLAFA N D S VTI GNVQQ+TL+VVYD+G GK
Sbjct: 421 NFVFGGSVTVELQPQGILYVASTKQVCLAFAANGDDSDVTIFGNVQQRTLEVVYDLGNGK 480

BLAST of Cla97C05G097040 vs. Swiss-Prot
Match: sp|Q8S9J6|ASPA_ARATH (Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At5g10770 PE=2 SV=1)

HSP 1 Score: 498.4 bits (1282), Expect = 8.6e-140
Identity = 255/451 (56.54%), Postives = 326/451 (72.28%), Query Frame = 0

Query: 27  DSHRLTVELAGLLPSASCT-HRSPQAHISGVRQQSSLEVIHRHGPC---GSEVSKAPTAI 86
           DSH  T++++ LLPS+S +   SP+A  +    +SSL V HRHG C    +  + +P  +
Sbjct: 32  DSH--TIQVSSLLPSSSSSCVLSPRASTT----KSSLHVTHRHGTCSRLNNGKATSPDHV 91

Query: 87  EMFVQDQARVDFIHSKFSGELKSIDRLRPSKATKIPAKSGATIGSGNYIVNVGLGTPKKY 146
           E+   DQARV+ IHSK S +L + D +  SK+T +PAK G+T+GSGNYIV VGLGTPK  
Sbjct: 92  EILRLDQARVNSIHSKLSKKL-ATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKND 151

Query: 147 LSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLESGTGNQP 206
           LSLIFDTGSDLTWTQC+PC R CY+QK+P+F PS+STSY N+SCSS+ C  L S TGN  
Sbjct: 152 LSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAG 211

Query: 207 GCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFGSAAGLIG 266
            CSA+  CIYGIQYGDQSFSVG+ A+E  TLT++DV D   FGCG+NN+GLF   AGL+G
Sbjct: 212 SCSASN-CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLG 271

Query: 267 LGQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLTFGDGXXXXXALKYTPITKAHGVANF 326
           LG+DK+S   QTA  Y +IF+YCLP ++S TG+LTFG       ++K+TPI+      +F
Sbjct: 272 LGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFGSA-GISRSVKFTPISTITDGTSF 331

Query: 327 YGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITRLPPDAYSALKSAFQKGMAKYPKA 386
           YG++IV + VGG ++PI  +VFST GA+IDSGTVITRLPP AY+AL+S+F+  M+KYP  
Sbjct: 332 YGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTT 391

Query: 387 PELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGAATRQVCLAFAGNQDPS 446
             +SILDTC+DLS +  V IPKV+F F GG  ++L S GI Y     QVCLAFAGN D S
Sbjct: 392 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDS 451

Query: 447 TVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
              I GNVQQ+TL+VVYD  GG++GF  NGC
Sbjct: 452 NAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 473

BLAST of Cla97C05G097040 vs. Swiss-Prot
Match: sp|Q9LEW3|AED1_ARATH (Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1)

HSP 1 Score: 416.8 bits (1070), Expect = 3.3e-115
Identity = 225/458 (49.13%), Postives = 299/458 (65.28%), Query Frame = 0

Query: 18  QSFAGKVSPDSHRLTVELAGLLPSASCTHRSPQAHISGVRQQSSLEVIHRHGPCGSEVSK 77
           +S +GKV  DS+  T++++ L PS+S    S +A       +SSL V+H HG C    S 
Sbjct: 28  KSDSGKVL-DSY--TIQVSSLFPSSSSCVPSSKAS----NTKSSLRVVHMHGACSHLSSD 87

Query: 78  APT-AIEMFVQDQARVDFIHSKFSGELKSIDRLRPSKATKIPAKSGATIGSGNYIVNVGL 137
           A     E+  +DQARV+ I+SK S    S + +  +K+T++PAKSG T+GSGNYIV +G+
Sbjct: 88  ARVDHDEIIRRDQARVESIYSKLS--KNSANEVSEAKSTELPAKSGITLGSGNYIVTIGI 147

Query: 138 GTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLES 197
           GTPK  LSL+FDTGSDLTWTQCEPC   CY+QK+P F PS S++Y N+SCSS  C   ES
Sbjct: 148 GTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAES 207

Query: 198 GTGNQPGCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFGS 257
                  CSA+  C+Y I YGD+SF+ G+ A+E  TLT++DV+++  FGCG+NN+GLF  
Sbjct: 208 -------CSASN-CVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDG 267

Query: 258 AAGLIGLGQDKISIVKQTAQKYGQIFAYCLPK-TSSSTGYLTFGDGXXXXXALKYTPITK 317
            AGL+GLG  K+S+  QT   Y  IF+YCLP  TS+STG+LTFG       ++K+TPI+ 
Sbjct: 268 VAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSA-GISESVKFTPISS 327

Query: 318 AHGVANFYGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITRLPPDAYSALKSAFQKG 377
                N YG+DI+G+ VG  ++ I+P+ FST GAIIDSGTV TRLP   Y+ L+S F++ 
Sbjct: 328 FPSAFN-YGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEK 387

Query: 378 MAKYPKAPELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGAATRQVCLAF 437
           M+ Y       + DTCYD +    V  P ++F F G T ++LD +GI       QVCLAF
Sbjct: 388 MSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKISQVCLAF 447

Query: 438 AGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
           AGN D     I GNVQQ TL VVYDV GG++GF  NGC
Sbjct: 448 AGNDD--LPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Cla97C05G097040 vs. Swiss-Prot
Match: sp|Q9LNJ3|APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 272.3 bits (695), Expect = 1.0e-71
Identity = 150/363 (41.32%), Postives = 217/363 (59.78%), Query Frame = 0

Query: 121 SGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTS 180
           SG + GSG Y   +G+GTP +Y+ ++ DTGSD+ W QC PC R CY+Q DP+F P +S +
Sbjct: 133 SGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR-CYSQSDPIFDPRKSKT 192

Query: 181 YSNISCSSSECSQLESGTGNQPGCSA-AKACIYGIQYGDQSFSVGYFARETLTLTSTDVI 240
           Y+ I CSS  C +L+S      GC+   K C+Y + YGD SF+VG F+ ETLT     V 
Sbjct: 193 YATIPCSSPHCRRLDSA-----GCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRV- 252

Query: 241 DNFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFAYCLPKTSSST--GYLT 300
                GCG +N GLF  AAGL+GLG+ K+S   QT  ++ Q F+YCL   S+S+    + 
Sbjct: 253 KGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVV 312

Query: 301 FGDGXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGTQIP-ISPSVF-----STSGAII 360
           FG+      A ++TP+     +  FY V ++G+ VGGT++P ++ S+F        G II
Sbjct: 313 FGNAAVSRIA-RFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVII 372

Query: 361 DSGTVITRLPPDAYSALKSAFQKGMAKYPKAPELSILDTCYDLSKYSIVEIPKVSFLFKG 420
           DSGT +TRL   AY A++ AF+ G     +AP+ S+ DTC+DLS  + V++P V   F+ 
Sbjct: 373 DSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR- 432

Query: 421 GTELDLDSTGILYGAATR-QVCLAFAGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGY 474
           G ++ L +T  L    T  + C AFAG      ++IIGN+QQ+  +VVYD+   ++GF  
Sbjct: 433 GADVSLPATNYLIPVDTNGKFCFAFAGTM--GGLSIIGNIQQQGFRVVYDLASSRVGFAP 484

BLAST of Cla97C05G097040 vs. Swiss-Prot
Match: sp|Q9LHE3|ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 264.2 bits (674), Expect = 2.7e-69
Identity = 155/395 (39.24%), Postives = 219/395 (55.44%), Query Frame = 0

Query: 87  QDQARVDFIHSKFSGE-LKSIDRLRPSKATKIPAKSGATIGSGNYIVNVGLGTPKKYLSL 146
           +D  RV  I  + SG+ + S D             SG   GSG Y V +G+G+P +   +
Sbjct: 87  RDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYM 146

Query: 147 IFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLESGTGNQPGCS 206
           + D+GSD+ W QC+PC + CY Q DPVF P++S SY+ +SC SS C ++E+      GC 
Sbjct: 147 VIDSGSDMVWVQCQPC-KLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENS-----GCH 206

Query: 207 AAKACIYGIQYGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFGSAAGLIGLGQ 266
           +   C Y + YGD S++ G  A ETLT   T V+ N   GCG  NRG+F  AAGL+G+G 
Sbjct: 207 SG-GCRYEVMYGDGSYTKGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGG 266

Query: 267 DKISIVKQTAQKYGQIFAYCL-PKTSSSTGYLTFGDGXXXXXALKYTPITKAHGVANFYG 326
             +S V Q + + G  F YCL  + + STG L FG       A  + P+ +     +FY 
Sbjct: 267 GSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREALPVGA-SWVPLVRNPRAPSFYY 326

Query: 327 VDIVGMKVGGTQIPISPSVFSTS-----GAIIDSGTVITRLPPDAYSALKSAFQKGMAKY 386
           V + G+ VGG +IP+   VF  +     G ++D+GT +TRLP  AY A +  F+   A  
Sbjct: 327 VGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANL 386

Query: 387 PKAPELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGA-ATRQVCLAFAGN 446
           P+A  +SI DTCYDLS +  V +P VSF F  G  L L +   L     +   C AFA +
Sbjct: 387 PRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAAS 446

Query: 447 QDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
             P+ ++IIGN+QQ+ +QV +D   G +GFG N C
Sbjct: 447 --PTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of Cla97C05G097040 vs. Swiss-Prot
Match: sp|Q9LS40|ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 251.9 bits (642), Expect = 1.4e-65
Identity = 151/405 (37.28%), Postives = 219/405 (54.07%), Query Frame = 0

Query: 87  QDQARVDFIHSKFSGELKSIDR--LRP---------SKATKIPAKSGATIGSGNYIVNVG 146
           +D +RV  I +K    ++ +DR  L+P         ++    P  SGA+ GSG Y   +G
Sbjct: 108 RDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIG 167

Query: 147 LGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLE 206
           +GTP K + L+ DTGSD+ W QCEPCA  CY Q DPVF P+ S++Y +++CS+ +CS LE
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQCEPCAD-CYQQSDPVFNPTSSSTYKSLTCSAPQCSLLE 227

Query: 207 SGTGNQPGCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFG 266
           +       C + K C+Y + YGD SF+VG  A +T+T  ++  I+N   GCG +N GLF 
Sbjct: 228 TS-----ACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFT 287

Query: 267 SAAGLIGLGQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLTFGDGXXXXXALKYTPITK 326
            AAGL+GLG   +SI   T Q     F+YCL    S        +           P+ +
Sbjct: 288 GAAGLLGLGGGVLSI---TNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLR 347

Query: 327 AHGVANFYGVDIVGMKVGGTQIPISPSVF-----STSGAIIDSGTVITRLPPDAYSALKS 386
              +  FY V + G  VGG ++ +  ++F      + G I+D GT +TRL   AY++L+ 
Sbjct: 348 NKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRD 407

Query: 387 AFQKGMAKYPK-APELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGA-AT 446
           AF K      K +  +S+ DTCYD S  S V++P V+F F GG  LDL +   L     +
Sbjct: 408 AFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDS 467

Query: 447 RQVCLAFAGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
              C AFA     S+++IIGNVQQ+  ++ YD+    IG   N C
Sbjct: 468 GTFCFAFAPTS--SSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of Cla97C05G097040 vs. TAIR10
Match: AT5G10770.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 498.4 bits (1282), Expect = 4.8e-141
Identity = 255/451 (56.54%), Postives = 326/451 (72.28%), Query Frame = 0

Query: 27  DSHRLTVELAGLLPSASCT-HRSPQAHISGVRQQSSLEVIHRHGPC---GSEVSKAPTAI 86
           DSH  T++++ LLPS+S +   SP+A  +    +SSL V HRHG C    +  + +P  +
Sbjct: 32  DSH--TIQVSSLLPSSSSSCVLSPRASTT----KSSLHVTHRHGTCSRLNNGKATSPDHV 91

Query: 87  EMFVQDQARVDFIHSKFSGELKSIDRLRPSKATKIPAKSGATIGSGNYIVNVGLGTPKKY 146
           E+   DQARV+ IHSK S +L + D +  SK+T +PAK G+T+GSGNYIV VGLGTPK  
Sbjct: 92  EILRLDQARVNSIHSKLSKKL-ATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKND 151

Query: 147 LSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLESGTGNQP 206
           LSLIFDTGSDLTWTQC+PC R CY+QK+P+F PS+STSY N+SCSS+ C  L S TGN  
Sbjct: 152 LSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAG 211

Query: 207 GCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFGSAAGLIG 266
            CSA+  CIYGIQYGDQSFSVG+ A+E  TLT++DV D   FGCG+NN+GLF   AGL+G
Sbjct: 212 SCSASN-CIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLG 271

Query: 267 LGQDKISIVKQTAQKYGQIFAYCLPKTSSSTGYLTFGDGXXXXXALKYTPITKAHGVANF 326
           LG+DK+S   QTA  Y +IF+YCLP ++S TG+LTFG       ++K+TPI+      +F
Sbjct: 272 LGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFGSA-GISRSVKFTPISTITDGTSF 331

Query: 327 YGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITRLPPDAYSALKSAFQKGMAKYPKA 386
           YG++IV + VGG ++PI  +VFST GA+IDSGTVITRLPP AY+AL+S+F+  M+KYP  
Sbjct: 332 YGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTT 391

Query: 387 PELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGAATRQVCLAFAGNQDPS 446
             +SILDTC+DLS +  V IPKV+F F GG  ++L S GI Y     QVCLAFAGN D S
Sbjct: 392 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDS 451

Query: 447 TVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
              I GNVQQ+TL+VVYD  GG++GF  NGC
Sbjct: 452 NAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 473

BLAST of Cla97C05G097040 vs. TAIR10
Match: AT5G10760.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 416.8 bits (1070), Expect = 1.8e-116
Identity = 225/458 (49.13%), Postives = 299/458 (65.28%), Query Frame = 0

Query: 18  QSFAGKVSPDSHRLTVELAGLLPSASCTHRSPQAHISGVRQQSSLEVIHRHGPCGSEVSK 77
           +S +GKV  DS+  T++++ L PS+S    S +A       +SSL V+H HG C    S 
Sbjct: 28  KSDSGKVL-DSY--TIQVSSLFPSSSSCVPSSKAS----NTKSSLRVVHMHGACSHLSSD 87

Query: 78  APT-AIEMFVQDQARVDFIHSKFSGELKSIDRLRPSKATKIPAKSGATIGSGNYIVNVGL 137
           A     E+  +DQARV+ I+SK S    S + +  +K+T++PAKSG T+GSGNYIV +G+
Sbjct: 88  ARVDHDEIIRRDQARVESIYSKLS--KNSANEVSEAKSTELPAKSGITLGSGNYIVTIGI 147

Query: 138 GTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLES 197
           GTPK  LSL+FDTGSDLTWTQCEPC   CY+QK+P F PS S++Y N+SCSS  C   ES
Sbjct: 148 GTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAES 207

Query: 198 GTGNQPGCSAAKACIYGIQYGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFGS 257
                  CSA+  C+Y I YGD+SF+ G+ A+E  TLT++DV+++  FGCG+NN+GLF  
Sbjct: 208 -------CSASN-CVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDG 267

Query: 258 AAGLIGLGQDKISIVKQTAQKYGQIFAYCLPK-TSSSTGYLTFGDGXXXXXALKYTPITK 317
            AGL+GLG  K+S+  QT   Y  IF+YCLP  TS+STG+LTFG       ++K+TPI+ 
Sbjct: 268 VAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSA-GISESVKFTPISS 327

Query: 318 AHGVANFYGVDIVGMKVGGTQIPISPSVFSTSGAIIDSGTVITRLPPDAYSALKSAFQKG 377
                N YG+DI+G+ VG  ++ I+P+ FST GAIIDSGTV TRLP   Y+ L+S F++ 
Sbjct: 328 FPSAFN-YGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEK 387

Query: 378 MAKYPKAPELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGAATRQVCLAF 437
           M+ Y       + DTCYD +    V  P ++F F G T ++LD +GI       QVCLAF
Sbjct: 388 MSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKISQVCLAF 447

Query: 438 AGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
           AGN D     I GNVQQ TL VVYDV GG++GF  NGC
Sbjct: 448 AGNDD--LPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Cla97C05G097040 vs. TAIR10
Match: AT1G79720.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 297.0 bits (759), Expect = 2.1e-80
Identity = 173/444 (38.96%), Postives = 247/444 (55.63%), Query Frame = 0

Query: 41  SASCTHRSPQAHISGVRQQSSLEVIHRHGPCGSEVSKAPTAIEMFVQDQARVDFIHSKFS 100
           S SC  RS    +   R+ ++LE+ HR    G  +          V D  RV  +  K  
Sbjct: 51  STSCFSRS----LGKGRESTTLEMKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIK 110

Query: 101 GELKSIDRLRPSKATKIPAKSGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCEP 160
               S      S+ T+IP  SG  + S NYIV V LG   K +SLI DTGSDLTW QC+P
Sbjct: 111 AMTSSTTEQSVSE-TQIPLTSGIKLESLNYIVTVELG--GKNMSLIVDTGSDLTWVQCQP 170

Query: 161 CARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLESGTGNQPGCSAAKA-----CIYGIQ 220
           C R CYNQ+ P++ PS S+SY  + C+SS C  L + T N   C          C Y + 
Sbjct: 171 C-RSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVS 230

Query: 221 YGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTA 280
           YGD S++ G  A E++ L  T  ++NF+FGCG+NN+GLFG ++GL+GLG+  +S+V QT 
Sbjct: 231 YGDGSYTRGDLASESILLGDTK-LENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTL 290

Query: 281 QKYGQIFAYCLPK-TSSSTGYLTFGDG---XXXXXALKYTPITKAHGVANFYGVDIVGMK 340
           + +  +F+YCLP     ++G L+FG+         ++ YTP+ +   + +FY +++ G  
Sbjct: 291 KTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGAS 350

Query: 341 VGGTQIPISPSVFSTSGAIIDSGTVITRLPPDAYSALKSAFQKGMAKYPKAPELSILDTC 400
           +GG ++  S       G +IDSGTVITRLPP  Y A+K  F K  + +P AP  SILDTC
Sbjct: 351 IGGVELKSSS---FGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTC 410

Query: 401 YDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGAA--TRQVCLAFAGNQDPSTVTIIGN 460
           ++L+ Y  + IP +  +F+G  EL++D TG+ Y        VCLA A     + V IIGN
Sbjct: 411 FNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGN 470

Query: 461 VQQKTLQVVYDVGGGKIGFGYNGC 474
            QQK  +V+YD    ++G     C
Sbjct: 471 YQQKNQRVIYDTTQERLGIVGENC 482

BLAST of Cla97C05G097040 vs. TAIR10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 272.3 bits (695), Expect = 5.5e-73
Identity = 150/363 (41.32%), Postives = 217/363 (59.78%), Query Frame = 0

Query: 121 SGATIGSGNYIVNVGLGTPKKYLSLIFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTS 180
           SG + GSG Y   +G+GTP +Y+ ++ DTGSD+ W QC PC R CY+Q DP+F P +S +
Sbjct: 133 SGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR-CYSQSDPIFDPRKSKT 192

Query: 181 YSNISCSSSECSQLESGTGNQPGCSA-AKACIYGIQYGDQSFSVGYFARETLTLTSTDVI 240
           Y+ I CSS  C +L+S      GC+   K C+Y + YGD SF+VG F+ ETLT     V 
Sbjct: 193 YATIPCSSPHCRRLDSA-----GCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRV- 252

Query: 241 DNFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQIFAYCLPKTSSST--GYLT 300
                GCG +N GLF  AAGL+GLG+ K+S   QT  ++ Q F+YCL   S+S+    + 
Sbjct: 253 KGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVV 312

Query: 301 FGDGXXXXXALKYTPITKAHGVANFYGVDIVGMKVGGTQIP-ISPSVF-----STSGAII 360
           FG+      A ++TP+     +  FY V ++G+ VGGT++P ++ S+F        G II
Sbjct: 313 FGNAAVSRIA-RFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVII 372

Query: 361 DSGTVITRLPPDAYSALKSAFQKGMAKYPKAPELSILDTCYDLSKYSIVEIPKVSFLFKG 420
           DSGT +TRL   AY A++ AF+ G     +AP+ S+ DTC+DLS  + V++P V   F+ 
Sbjct: 373 DSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR- 432

Query: 421 GTELDLDSTGILYGAATR-QVCLAFAGNQDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGY 474
           G ++ L +T  L    T  + C AFAG      ++IIGN+QQ+  +VVYD+   ++GF  
Sbjct: 433 GADVSLPATNYLIPVDTNGKFCFAFAGTM--GGLSIIGNIQQQGFRVVYDLASSRVGFAP 484

BLAST of Cla97C05G097040 vs. TAIR10
Match: AT3G20015.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 264.2 bits (674), Expect = 1.5e-70
Identity = 155/395 (39.24%), Postives = 219/395 (55.44%), Query Frame = 0

Query: 87  QDQARVDFIHSKFSGE-LKSIDRLRPSKATKIPAKSGATIGSGNYIVNVGLGTPKKYLSL 146
           +D  RV  I  + SG+ + S D             SG   GSG Y V +G+G+P +   +
Sbjct: 87  RDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYM 146

Query: 147 IFDTGSDLTWTQCEPCARYCYNQKDPVFAPSQSTSYSNISCSSSECSQLESGTGNQPGCS 206
           + D+GSD+ W QC+PC + CY Q DPVF P++S SY+ +SC SS C ++E+      GC 
Sbjct: 147 VIDSGSDMVWVQCQPC-KLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENS-----GCH 206

Query: 207 AAKACIYGIQYGDQSFSVGYFARETLTLTSTDVIDNFLFGCGQNNRGLFGSAAGLIGLGQ 266
           +   C Y + YGD S++ G  A ETLT   T V+ N   GCG  NRG+F  AAGL+G+G 
Sbjct: 207 SG-GCRYEVMYGDGSYTKGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGG 266

Query: 267 DKISIVKQTAQKYGQIFAYCL-PKTSSSTGYLTFGDGXXXXXALKYTPITKAHGVANFYG 326
             +S V Q + + G  F YCL  + + STG L FG       A  + P+ +     +FY 
Sbjct: 267 GSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREALPVGA-SWVPLVRNPRAPSFYY 326

Query: 327 VDIVGMKVGGTQIPISPSVFSTS-----GAIIDSGTVITRLPPDAYSALKSAFQKGMAKY 386
           V + G+ VGG +IP+   VF  +     G ++D+GT +TRLP  AY A +  F+   A  
Sbjct: 327 VGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANL 386

Query: 387 PKAPELSILDTCYDLSKYSIVEIPKVSFLFKGGTELDLDSTGILYGA-ATRQVCLAFAGN 446
           P+A  +SI DTCYDLS +  V +P VSF F  G  L L +   L     +   C AFA +
Sbjct: 387 PRASGVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAAS 446

Query: 447 QDPSTVTIIGNVQQKTLQVVYDVGGGKIGFGYNGC 474
             P+ ++IIGN+QQ+ +QV +D   G +GFG N C
Sbjct: 447 --PTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022936652.12.2e-24089.85aspartyl protease family protein At5g10770-like isoform X1 [Cucurbita moschata][more]
XP_008452815.11.4e-23989.08PREDICTED: aspartyl protease family protein At5g10770-like [Cucumis melo][more]
XP_023535774.14.1e-23989.22aspartyl protease family protein At5g10770-like [Cucurbita pepo subsp. pepo][more]
XP_022977010.16.9e-23989.22aspartyl protease family protein At5g10770-like isoform X1 [Cucurbita maxima][more]
XP_004150866.13.4e-23888.40PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis sativus][more]
Match NameE-valueIdentityDescription
tr|A0A1S3BVJ7|A0A1S3BVJ7_CUCME9.2e-24089.08aspartyl protease family protein At5g10770-like OS=Cucumis melo OX=3656 GN=LOC10... [more]
tr|A0A0A0L0Z8|A0A0A0L0Z8_CUCSA1.3e-20190.51Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G651750 PE=3 SV=1[more]
tr|A0A2P4KEM6|A0A2P4KEM6_QUESU1.0e-15862.81Aspartyl protease family protein OS=Quercus suber OX=58331 GN=CFP56_14592 PE=3 S... [more]
tr|M5XLM2|M5XLM2_PRUPE8.3e-15660.62Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G397600 PE=3 SV=1[more]
tr|A0A0S3S6Z9|A0A0S3S6Z9_PHAAN5.4e-15558.61Uncharacterized protein OS=Vigna angularis var. angularis OX=157739 GN=Vigan.05G... [more]
Match NameE-valueIdentityDescription
sp|Q8S9J6|ASPA_ARATH8.6e-14056.54Aspartyl protease family protein At5g10770 OS=Arabidopsis thaliana OX=3702 GN=At... [more]
sp|Q9LEW3|AED1_ARATH3.3e-11549.13Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1[more]
sp|Q9LNJ3|APF2_ARATH1.0e-7141.32Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
sp|Q9LHE3|ASPG2_ARATH2.7e-6939.24Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
sp|Q9LS40|ASPG1_ARATH1.4e-6537.28Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Match NameE-valueIdentityDescription
AT5G10770.14.8e-14156.54Eukaryotic aspartyl protease family protein[more]
AT5G10760.11.8e-11649.13Eukaryotic aspartyl protease family protein[more]
AT1G79720.12.1e-8038.96Eukaryotic aspartyl protease family protein[more]
AT1G01300.15.5e-7341.32Eukaryotic aspartyl protease family protein[more]
AT3G20015.11.5e-7039.24Eukaryotic aspartyl protease family protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR033873CND41-like
IPR033121PEPTIDASE_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
IPR032799TAXi_C
IPR032861TAXi_N
IPR001461Aspartic_peptidase_A1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0030163 protein catabolic process
biological_process GO:0050896 response to stimulus
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0008233 peptidase activity
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G097040.1Cla97C05G097040.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 136..156
score: 42.92
coord: 445..460
score: 29.78
coord: 349..360
score: 40.45
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 42..473
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 130..300
e-value: 1.0E-52
score: 178.9
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 323..468
e-value: 3.4E-29
score: 101.6
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 305..473
e-value: 1.3E-43
score: 150.8
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 106..300
e-value: 7.8E-51
score: 174.9
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 124..473
NoneNo IPR availablePANTHERPTHR13683:SF393SUBFAMILY NOT NAMEDcoord: 42..473
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 145..156
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 130..469
score: 43.939
IPR033873CND41-likeCDDcd05472cnd41_likecoord: 129..473
e-value: 1.67768E-136
score: 397.412

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C05G097040Silver-seed gourdcarwmbB0808
Cla97C05G097040Cucurbita maxima (Rimu)cmawmbB027
Cla97C05G097040Cucurbita maxima (Rimu)cmawmbB480
Cla97C05G097040Cucurbita moschata (Rifu)cmowmbB462