Cla97C02G026270.1 (mRNA) Watermelon (97103) v2

NameCla97C02G026270.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionAspartic proteinase Asp1
LocationCla97Chr02 : 83028 .. 86064 (+)
Sequence length1290
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAAAAGGGTATTGATGATACTGGTTCTGATGGTGGCCTCCATGAGCTGTTTGGCTCTCTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGGAGGCCAATTCTGTCGGTTCCGGCCGCATCTTCTTCGTTTGCTTCATCCTCCATCGTGTTGCCTCTTCAAGGGAACGTCTTCCCAAATGGGTAAGCCATAACTCTGCTCAAAGATACGACGAAGGTTTAGCTTTTGTTTTTCCCATTTCGATTTTACTCTACTGTTACAGAGTATGAAGTGGCTGAAGAGGAAGTTCAATGTGAATTTGATGACTCCTAATATACTTTTCACATTGATAGTATTGTAATAATCGCAGTAGGCCTTCCCCATTCTTAATCTTATTGTATCTTCTCTCACGTGTTAAATTGGAGAAAAGATGATTACGAAACTCTGTTCTCTCATGTTTGCTTCTCTCGCGTGTTCTATTCCTGTGCAATAGCTTTGGAGGAGTGTATTTATGACTGTTAGCAAAATAACACATCCCCAAATTAAGTACTTTTAAAATCTTCACACCCATTTTACTGAAGTTTTTAATTAGTTACAAACGGTTTGAAAATAAAATATATATTTCTTTGTTTTGTTCGAGGCTAACTTATTTACTGTCAGATCCTAAATACTGATTTCCATCAATATGTTTTGAGGTCATTTATTCCGTGTCTTTTTTCAGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCAGACACTGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAACAGTGCACTGAGGTAAATTTTTATTTGATGCTAATTTGGAGGTTCTTGTAAATATGATGACCCAGATGATAGAAGATACTTACTAAACTACAGCTTTGCCTTCAGTTAAGTTAGTTTTGATCATAAGTAATTCAATATTTCATTTTTGAATAGTCTATGTTCTATGTTCCTCCTCTATTTCTTGATGTGCTTAGTGCTTCTTTTTTCCCTCGGTCTCTCACTACATTAAATAAACGTTGCATTTCTTTTGGGGTTGAGAGTTAAGGTTGATTGCTTCAAATTGATGGCCTTGTTTCCTTTTCAGACACTTCATCCGCTCTATCAACCAAGCAACGATCTTGTGCCGTGTAAGGACCCTCTGTGTATGTCCTTGCACTCATCTATGGACCACAGATGTGAGAACCCAGATCAATGTGACTACGAGGTTGAGTATGCAGATGGTGGTTCGTCCCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCCCGTTTGGCCCTTGGGTAAACTGCTCGACTGCCTTCAGTAATTCATCTTAATTCAGTGTACTGCTGAGTTCCTAACTCTCTAAATCAACCTGGCTACTTCCGATTACAACCACACATGAAAGTTTTGAGAATGAAACATCATATCACGCTGTCATATCTTATGTTTTGCACTTTCCTGATATTGCAAAGTAATGATTAGCATGAATGTAAAGATGGAGTTTATGTTGAAAAACAAACGTGTTCATTCATGCTGACGGGTTACTTCAAGGAAACAATTAAGATAGCAACGGGCACTAGATTTTGTGGACATAATGATTCGGATTTGTATTTTTCATAGACATCAATAAAGTTTATCTTGTACAAGAATTTCATGCATTTCTAGTCCTTACAAACCGTTGATGTCAGCTCATGTGGTCAAATAGCATGCATGTAAAGTCAGCATGTAAGCTAGCTATTTGAATCTATAGTACTGAGTTTGGTTGGTTCTGTTTGATTACTCAGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATTCTTGGTCTTGGAAGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCCTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGTAAACAACCTGCCTTGATCATATGTATATATATATCTTGTTATGATTCATGTTAGCTTCTGTTTTAATGAAACCTGCCTCACGTGATTGTATCAGGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGATCTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGGTAAATCATCTTTAAAAATAACTGTTGACTATTCTGGGAAAAATAGATCTTTCTGGCTCTTGAAAACTGATGGATAATACACAATGATTATTCAATTCATGTTGGCACTGTTCTTGTAGTTGAATAGAGAACTAGCTGGAAAACCGCTAAGAGAAGCCATGGACGACGACACACTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAAGCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGGTAAAAGCTCCATGCCTTCAACTCGAACTGCTATTCACTAAACATTTTCTCTTTGGATTCCTAAGTTGAATAATTTCACTTATAAGCTATAATTTTGGAATGGCAGTCCATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAACTTCGAATATCATTGGTGGTACGTTATTGCATGCATTGCATGTTAATGTTTTCTCGCAATTGCAAATACCAATGCGTTTAAAACCATGCAAAATTATTCTTTTATTAGAGAGAAAAAAAGCCCCATTTCTGTATGATTTTGTTGACAGATATATCAATGCAAGATAAGATGGTAGTATACAACAACGAGAAGCAAGCAATTGGATGGGCTACGGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAA

mRNA sequence

ATGGGGAAAAGGGTATTGATGATACTGGTTCTGATGGTGGCCTCCATGAGCTGTTTGGCTCTCTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGGAGGCCAATTCTGTCGGTTCCGGCCGCATCTTCTTCGTTTGCTTCATCCTCCATCGTGTTGCCTCTTCAAGGGAACGTCTTCCCAAATGGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCAGACACTGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAACAGTGCACTGAGACACTTCATCCGCTCTATCAACCAAGCAACGATCTTGTGCCGTGTAAGGACCCTCTGTGTATGTCCTTGCACTCATCTATGGACCACAGATGTGAGAACCCAGATCAATGTGACTACGAGGTTGAGTATGCAGATGGTGGTTCGTCCCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCCCGTTTGGCCCTTGGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATTCTTGGTCTTGGAAGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCCTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGATCTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGTTGAATAGAGAACTAGCTGGAAAACCGCTAAGAGAAGCCATGGACGACGACACACTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAAGCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGTCCATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAACTTCGAATATCATTGGTGATATATCAATGCAAGATAAGATGGTAGTATACAACAACGAGAAGCAAGCAATTGGATGGGCTACGGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAA

Coding sequence (CDS)

ATGGGGAAAAGGGTATTGATGATACTGGTTCTGATGGTGGCCTCCATGAGCTGTTTGGCTCTCTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGGAGGCCAATTCTGTCGGTTCCGGCCGCATCTTCTTCGTTTGCTTCATCCTCCATCGTGTTGCCTCTTCAAGGGAACGTCTTCCCAAATGGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCAGACACTGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAACAGTGCACTGAGACACTTCATCCGCTCTATCAACCAAGCAACGATCTTGTGCCGTGTAAGGACCCTCTGTGTATGTCCTTGCACTCATCTATGGACCACAGATGTGAGAACCCAGATCAATGTGACTACGAGGTTGAGTATGCAGATGGTGGTTCGTCCCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCCCGTTTGGCCCTTGGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATTCTTGGTCTTGGAAGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCCTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGATCTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGTTGAATAGAGAACTAGCTGGAAAACCGCTAAGAGAAGCCATGGACGACGACACACTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAAGCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGTCCATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAACTTCGAATATCATTGGTGATATATCAATGCAAGATAAGATGGTAGTATACAACAACGAGAAGCAAGCAATTGGATGGGCTACGGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAA

Protein sequence

MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGSM
BLAST of Cla97C02G026270.1 vs. NCBI nr
Match: XP_004147327.2 (PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis sativus])

HSP 1 Score: 869.4 bits (2245), Expect = 5.1e-249
Identity = 414/428 (96.73%), Postives = 422/428 (98.60%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGKRVL++LVLMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 1   MGKRVLVVLVLMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWRGRKP KSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P EGY+IISSMGNVCLGILNGTDVGLE SNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 421 VPKSQVSS 428

BLAST of Cla97C02G026270.1 vs. NCBI nr
Match: XP_011649449.1 (PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis sativus])

HSP 1 Score: 863.6 bits (2230), Expect = 2.8e-247
Identity = 414/432 (95.83%), Postives = 422/432 (97.69%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGKRVL++LVLMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 1   MGKRVLVVLVLMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 ----CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFF 240
               CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFF
Sbjct: 181 CQLICGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFF 240

Query: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300
           GDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAY
Sbjct: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300

Query: 301 QVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKA 360
           QVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKP KSLRDVRKYFKPLALSFSSGGRSKA
Sbjct: 301 QVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKA 360

Query: 361 VFEIPMEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATA 420
           VFEIP EGY+IISSMGNVCLGILNGTDVGLE SNIIGDISMQDKMVVYNNEKQAIGWATA
Sbjct: 361 VFEIPTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA 420

Query: 421 NCDRVPKSRVGS 429
           NCDRVPKS+V S
Sbjct: 421 NCDRVPKSQVSS 432

BLAST of Cla97C02G026270.1 vs. NCBI nr
Match: XP_008460823.1 (PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo])

HSP 1 Score: 861.3 bits (2224), Expect = 1.4e-246
Identity = 410/428 (95.79%), Postives = 420/428 (98.13%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P+EGY+IISSMGNVCLGILNGTDVGLE SNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 421 VPKSQVSS 428

BLAST of Cla97C02G026270.1 vs. NCBI nr
Match: XP_008460822.1 (PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis melo])

HSP 1 Score: 855.5 bits (2209), Expect = 7.7e-245
Identity = 410/432 (94.91%), Postives = 420/432 (97.22%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 ----CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFF 240
               CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFF
Sbjct: 181 CQLICGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFF 240

Query: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300
           GDGIYDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAY
Sbjct: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300

Query: 301 QVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKA 360
           QVLTSLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGRSKA
Sbjct: 301 QVLTSLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKA 360

Query: 361 VFEIPMEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATA 420
           VFEIP+EGY+IISSMGNVCLGILNGTDVGLE SNIIGDISMQDKMVVYNNEKQAIGWATA
Sbjct: 361 VFEIPIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA 420

Query: 421 NCDRVPKSRVGS 429
           NCDRVPKS+V S
Sbjct: 421 NCDRVPKSQVSS 432

BLAST of Cla97C02G026270.1 vs. NCBI nr
Match: XP_022157721.1 (aspartic proteinase Asp1 isoform X2 [Momordica charantia])

HSP 1 Score: 823.2 bits (2125), Expect = 4.2e-235
Identity = 389/428 (90.89%), Postives = 410/428 (95.79%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MG  +L ILVLMVASM+CLA  SASSFFKDKPWERR+PILSV A SSSFASSSIVLPLQG
Sbjct: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLY+PS+DLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSS+DHRCENPDQCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQ PGSS YHPMDGILGLG+GAVS+VSQLHNQGI+RNV+GHCFSS+GGGYLFFGD I
Sbjct: 181 CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YD +R+VWTPMSRDYPKHYSPG GELIFNGRSTGLRNLF VFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           PMEGYLI+SSMGNVCLGILNGT+VGL+ SNIIGDISM DK+V+YNNEKQAIGWATANCDR
Sbjct: 361 PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKSR  +
Sbjct: 421 VPKSRAAA 428

BLAST of Cla97C02G026270.1 vs. TrEMBL
Match: tr|A0A1S3CDB2|A0A1S3CDB2_CUCME (aspartic proteinase Asp1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=3 SV=1)

HSP 1 Score: 861.3 bits (2224), Expect = 9.2e-247
Identity = 410/428 (95.79%), Postives = 420/428 (98.13%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P+EGY+IISSMGNVCLGILNGTDVGLE SNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 421 VPKSQVSS 428

BLAST of Cla97C02G026270.1 vs. TrEMBL
Match: tr|A0A1S3CDB4|A0A1S3CDB4_CUCME (aspartic proteinase Asp1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=3 SV=1)

HSP 1 Score: 855.5 bits (2209), Expect = 5.1e-245
Identity = 410/432 (94.91%), Postives = 420/432 (97.22%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 ----CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFF 240
               CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFF
Sbjct: 181 CQLICGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFF 240

Query: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300
           GDGIYDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAY
Sbjct: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300

Query: 301 QVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKA 360
           QVLTSLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGRSKA
Sbjct: 301 QVLTSLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKA 360

Query: 361 VFEIPMEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATA 420
           VFEIP+EGY+IISSMGNVCLGILNGTDVGLE SNIIGDISMQDKMVVYNNEKQAIGWATA
Sbjct: 361 VFEIPIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA 420

Query: 421 NCDRVPKSRVGS 429
           NCDRVPKS+V S
Sbjct: 421 NCDRVPKSQVSS 432

BLAST of Cla97C02G026270.1 vs. TrEMBL
Match: tr|A0A0A0LKB0|A0A0A0LKB0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G338820 PE=3 SV=1)

HSP 1 Score: 733.8 bits (1893), Expect = 2.2e-208
Identity = 344/352 (97.73%), Postives = 348/352 (98.86%), Query Frame = 0

Query: 77  PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRC 136
           PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRC
Sbjct: 35  PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRC 94

Query: 137 ENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD 196
           ENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD
Sbjct: 95  ENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD 154

Query: 197 GILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPYRLVWTPMSRDYP 256
           GILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGIYDPYRLVWTPMSRDYP
Sbjct: 155 GILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVWTPMSRDYP 214

Query: 257 KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAM 316
           KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAM
Sbjct: 215 KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAM 274

Query: 317 DDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCL 376
           DDDTLPLCWRGRKP KSLRDVRKYFKPLALSFSSGGRSKAVFEIP EGY+IISSMGNVCL
Sbjct: 275 DDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPTEGYMIISSMGNVCL 334

Query: 377 GILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGS 429
           GILNGTDVGLE SNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKS+V S
Sbjct: 335 GILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQVSS 386

BLAST of Cla97C02G026270.1 vs. TrEMBL
Match: tr|A0A1S3CDR6|A0A1S3CDR6_CUCME (aspartic proteinase Asp1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=3 SV=1)

HSP 1 Score: 724.5 bits (1869), Expect = 1.3e-205
Identity = 342/356 (96.07%), Postives = 348/356 (97.75%), Query Frame = 0

Query: 77  PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRC 136
           PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRC
Sbjct: 35  PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRC 94

Query: 137 ENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG----CGYDQDPGSSSY 196
           ENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG    CGYDQDPGSSSY
Sbjct: 95  ENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCQLICGYDQDPGSSSY 154

Query: 197 HPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPYRLVWTPMS 256
           HPMDGILGLGRGAVS+VSQLHNQGIVRNVVGHCF+SKGGGYLFFGDGIYDPYRLVWTPMS
Sbjct: 155 HPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVWTPMS 214

Query: 257 RDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPL 316
           RDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPL
Sbjct: 215 RDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPL 274

Query: 317 REAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMG 376
           REAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGRSKAVFEIP+EGY+IISSMG
Sbjct: 275 REAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPIEGYMIISSMG 334

Query: 377 NVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGS 429
           NVCLGILNGTDVGLE SNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKS+V S
Sbjct: 335 NVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQVSS 390

BLAST of Cla97C02G026270.1 vs. TrEMBL
Match: tr|M5WHY2|M5WHY2_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_4G279700 PE=3 SV=1)

HSP 1 Score: 600.9 bits (1548), Expect = 2.2e-168
Identity = 285/427 (66.74%), Postives = 339/427 (79.39%), Query Frame = 0

Query: 2   GKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASS---SFASSSIVLPL 61
           GK   ++L++ +  M   A  S++SF       RR+ +L   A SS   + A+SSIVLP+
Sbjct: 5   GKSGWLLLLMSLLVMGLSATMSSASFGDQYHRGRRKTMLPDEATSSLGLNRAASSIVLPV 64

Query: 62  QGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLV 121
            GNV+P G YNVTL +GQPPKPYFLDPDTGSDLTWLQCDAPC +CTE  HP Y+P+NDLV
Sbjct: 65  HGNVYPIGSYNVTLNIGQPPKPYFLDPDTGSDLTWLQCDAPCVRCTEAPHPFYRPNNDLV 124

Query: 122 PCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLA 181
            CKDPLC +LH+   H+C+NP+QCDYEVEYADGGSSLGVLVRD F LN TNG+     LA
Sbjct: 125 VCKDPLCEALHAPGSHKCDNPEQCDYEVEYADGGSSLGVLVRDAFLLNFTNGNQRTTHLA 184

Query: 182 LGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGD 241
           LGCGYDQ PG SSYHP+DG+LGLG+G  S+VSQL NQG+VR+V+GHC S +GGG+ F GD
Sbjct: 185 LGCGYDQLPG-SSYHPIDGVLGLGKGKSSIVSQLSNQGLVRHVIGHCLSGRGGGFFFLGD 244

Query: 242 GIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQV 301
           G+YD  R+VWTPMS DY KHYSPG  ELI  G+STG RNL +VFDSGSSYTY N+QAYQ 
Sbjct: 245 GLYDSSRIVWTPMSPDYAKHYSPGLAELIVGGKSTGFRNLVMVFDSGSSYTYLNSQAYQF 304

Query: 302 LTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVF 361
           LTS L REL GKPL+EA+DD TLPLCW+GRKPF+++RDV+ YFKPLAL F+SG +    F
Sbjct: 305 LTSWLKRELTGKPLKEALDDRTLPLCWKGRKPFRNIRDVKTYFKPLALRFASGRKDTTQF 364

Query: 362 EIPMEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATANC 421
           E+P E YLIISS GNVCLGILNG++VGL+ SNIIGDISMQDKMV+Y+NEKQ IGW   NC
Sbjct: 365 ELPPEAYLIISSKGNVCLGILNGSEVGLQNSNIIGDISMQDKMVIYDNEKQMIGWGPGNC 424

Query: 422 DRVPKSR 426
           D++PKSR
Sbjct: 425 DKLPKSR 430

BLAST of Cla97C02G026270.1 vs. Swiss-Prot
Match: sp|Q0IU52|ASP1_ORYSJ (Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica OX=39947 GN=ASP1 PE=2 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 3.5e-92
Identity = 183/388 (47.16%), Postives = 254/388 (65.46%), Query Frame = 0

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++VL L GNV+P G + +T+ +G P K YFLD DTGS LTWLQCDAPC  C    H L
Sbjct: 21  SSAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVL 80

Query: 111 YQPS-NDLVPCKDPLCMSLHSSM--DHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNL 170
           Y+P+   LV C D LC  L++ +    RC +  QCDY ++Y D  SS+GVLV D F L+ 
Sbjct: 81  YKPTPKKLVTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSA 140

Query: 171 TNGDPIRPRLALGCGYDQDPGSSSYH-PMDGILGLGRGAVSMVSQLHNQGIV-RNVVGHC 230
           +NG      +A GCGYDQ   + +   P+D ILGL RG V+++SQL +QG++ ++V+GHC
Sbjct: 141 SNGTN-PTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 200

Query: 231 FSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIF--NGRSTGLRNLFVVFD 290
            SSKGGG+LFFGD       + WTPM+R++ K+YSPG G L F  N ++     + V+FD
Sbjct: 201 ISSKGGGFLFFGDAQVPTSGVTWTPMNREH-KYYSPGHGTLHFDSNSKAISAAPMAVIFD 260

Query: 291 SGSSYTYFNAQAYQ----VLTSLLNRELAGKPLREAMDDD-TLPLCWRGRKPFKSLRDVR 350
           SG++YTYF AQ YQ    V+ S LN E   K L E  + D  L +CW+G+    ++ +V+
Sbjct: 261 SGATYTYFAAQPYQATLSVVKSTLNSEC--KFLTEVTEKDRALTVCWKGKDKIVTIDEVK 320

Query: 351 KYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGT--DVGLETSNIIGDIS 410
           K F+ L+L F+ G + KA  EIP E YLIIS  G+VCLGIL+G+   + L  +N+IG I+
Sbjct: 321 KCFRSLSLEFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGIT 380

Query: 411 MQDKMVVYNNEKQAIGWATANCDRVPKS 425
           M D+MV+Y++E+  +GW    CDR+P+S
Sbjct: 381 MLDQMVIYDSERSLLGWVNYQCDRIPRS 402

BLAST of Cla97C02G026270.1 vs. Swiss-Prot
Match: sp|A2ZC67|ASP1_ORYSI (Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica OX=39946 GN=ASP1 PE=2 SV=2)

HSP 1 Score: 324.3 bits (830), Expect = 2.0e-87
Identity = 173/386 (44.82%), Postives = 248/386 (64.25%), Query Frame = 0

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++VL L GNV+P G + VT+ +G P KPYFLD DTGS LTWLQCD PC  C +  H L
Sbjct: 21  SSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGL 80

Query: 111 YQPS-NDLVPCKDPLCMSLHSSM--DHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNL 170
           Y+P     V C +  C  L++ +    +C   +QC Y ++Y  GGSS+GVL+ D F L  
Sbjct: 81  YKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLPA 140

Query: 171 TNGDPIRPRLALGCGYDQDPGSSSY-HPMDGILGLGRGAVSMVSQLHNQGIV-RNVVGHC 230
           +NG      +A GCGY+Q   + +   P++GILGLGRG V+++SQL +QG++ ++V+GHC
Sbjct: 141 SNGTN-PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHC 200

Query: 231 FSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGL--RNLFVVFD 290
            SSKG G+LFFGD       + W+PM+R++ KHYSP  G L FN  S  +    + V+FD
Sbjct: 201 ISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLQFNSNSKPISAAPMEVIFD 260

Query: 291 SGSSYTYFNAQAYQVLTSLLNRELAG--KPLREAMDDD-TLPLCWRGRKPFKSLRDVRKY 350
           SG++YTYF  Q Y    S++   L+   K L E  + D  L +CW+G+   +++ +V+K 
Sbjct: 261 SGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKKC 320

Query: 351 FKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGT--DVGLETSNIIGDISMQ 410
           F+ L+L F+ G + KA  EIP E YLIIS  G+VCLGIL+G+     L  +N+IG I+M 
Sbjct: 321 FRSLSLKFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGITML 380

Query: 411 DKMVVYNNEKQAIGWATANCDRVPKS 425
           D+MV+Y++E+  +GW    CDR+P+S
Sbjct: 381 DQMVIYDSERSLLGWVNYQCDRIPRS 402

BLAST of Cla97C02G026270.1 vs. Swiss-Prot
Match: sp|Q9M9A8|APCB1_ARATH (Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1)

HSP 1 Score: 300.1 bits (767), Expect = 4.1e-80
Identity = 170/396 (42.93%), Postives = 238/396 (60.10%), Query Frame = 0

Query: 45  ASSSFASSSIVLPLQGNVFPNGFYNVTLYVGQPP--KPYFLDPDTGSDLTWLQCDAPCQQ 104
           ++ S  SS+ + P+ GNV+P+G Y   + VG+P   + Y LD DTGS+LTW+QCDAPC  
Sbjct: 180 SAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTS 239

Query: 105 CTETLHPLYQPSND-LVPCKDPLCMSL-HSSMDHRCENPDQCDYEVEYADGGSSLGVLVR 164
           C +  + LY+P  D LV   +  C+ +  + +   CEN  QCDYE+EYAD   S+GVL +
Sbjct: 240 CAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTK 299

Query: 165 DVFPLNLTNGDPIRPRLALGCGYDQDP-GSSSYHPMDGILGLGRGAVSMVSQLHNQGIVR 224
           D F L L NG      +  GCGYDQ     ++    DGILGL R  +S+ SQL ++GI+ 
Sbjct: 300 DKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIIS 359

Query: 225 NVVGHCFSS--KGGGYLFFGDGIYDPYRLVWTPMSRD--------YPKHYSPGFGELIFN 284
           NVVGHC +S   G GY+F G  +   + + W PM  D             S G G L  +
Sbjct: 360 NVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLD 419

Query: 285 GRSTGLRNLFVVFDSGSSYTYFNAQAY-QVLTSLLNRELAGKPLREAMDDDTLPLCWRGR 344
           G +  +    V+FD+GSSYTYF  QAY Q++TSL  +E++G  L     D+TLP+CWR +
Sbjct: 420 GENGRVGK--VLFDTGSSYTYFPNQAYSQLVTSL--QEVSGLELTRDDSDETLPICWRAK 479

Query: 345 K--PFKSLRDVRKYFKPLALSFSSGGR--SKAVFEIPMEGYLIISSMGNVCLGILNGTDV 404
              PF SL DV+K+F+P+ L   S     S+ +  I  E YLIIS+ GNVCLGIL+G+ V
Sbjct: 480 TNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLL-IQPEDYLIISNKGNVCLGILDGSSV 539

Query: 405 GLETSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 421
              ++ I+GDISM+  ++VY+N K+ IGW  ++C R
Sbjct: 540 HDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVR 570

BLAST of Cla97C02G026270.1 vs. Swiss-Prot
Match: sp|Q9S9K4|ASPL2_ARATH (Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g65240 PE=3 SV=2)

HSP 1 Score: 149.4 bits (376), Expect = 8.9e-35
Identity = 118/401 (29.43%), Postives = 180/401 (44.89%), Query Frame = 0

Query: 52  SSIVLPLQGN--VFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHP 111
           +SI LPL G+  V   G Y   + +G PPK Y +  DTGSD+ W+ C  PC +C    + 
Sbjct: 56  ASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNL 115

Query: 112 LYQPS---------NDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVR 171
            ++ S         +  V C D  C  +  S    C+    C Y + YAD  +S G  +R
Sbjct: 116 NFRLSLFDMNASSTSKKVGCDDDFCSFI--SQSDSCQPALGCSYHIVYADESTSDGKFIR 175

Query: 172 DVFPLNLTNGD----PIRPRLALGCGYDQDPG-SSSYHPMDGILGLGRGAVSMVSQLHNQ 231
           D+  L    GD    P+   +  GCG DQ     +    +DG++G G+   S++SQL   
Sbjct: 176 DMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAAT 235

Query: 232 GIVRNVVGHCFSS-KGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTG 291
           G  + V  HC  + KGGG   F  G+ D  ++  TPM  +   HY+     +  +G S  
Sbjct: 236 GDAKRVFSHCLDNVKGGG--IFAVGVVDSPKVKTTPMVPN-QMHYNVMLMGMDVDGTSLD 295

Query: 292 L-----RNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRK 351
           L     RN   + DSG++  YF    Y    SL+   LA +P++  + ++T        +
Sbjct: 296 LPRSIVRNGGTIVDSGTTLAYFPKVLYD---SLIETILARQPVKLHIVEETF-------Q 355

Query: 352 PFKSLRDVRKYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLETS 411
            F    +V + F P++  F    +      +    YL        C G   G     E S
Sbjct: 356 CFSFSTNVDEAFPPVSFEFEDSVK----LTVYPHDYLFTLEEELYCFGWQAGGLTTDERS 415

Query: 412 NII--GDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGS 429
            +I  GD+ + +K+VVY+ + + IGWA  NC    K + GS
Sbjct: 416 EVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSSIKIKDGS 436

BLAST of Cla97C02G026270.1 vs. Swiss-Prot
Match: sp|Q8VYV9|APF1_ARATH (Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 SV=1)

HSP 1 Score: 126.7 bits (317), Expect = 6.1e-28
Identity = 117/377 (31.03%), Postives = 165/377 (43.77%), Query Frame = 0

Query: 67  FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHP---------LYQP---- 126
           + NVT  VG P   + +  DTGSDL WL CD  C  C   L           +Y P    
Sbjct: 105 YANVT--VGTPSDWFMVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNASS 164

Query: 127 SNDLVPCKDPLCMSLHSSMDHRCENPD-QCDYEVEY-ADGGSSLGVLVRDVFPL--NLTN 186
           ++  VPC   LC     +   RC +P+  C Y++ Y ++G SS GVLV DV  L  N  +
Sbjct: 165 TSTKVPCNSTLC-----TRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKS 224

Query: 187 GDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSK 246
              I  R+  GCG  Q          +G+ GLG   +S+ S L  +GI  N    CF + 
Sbjct: 225 SKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGND 284

Query: 247 GGGYLFFGD-GIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSY 306
           G G + FGD G  D      TP++   P          I  G +TG      VFDSG+S+
Sbjct: 285 GAGRISFGDKGSVDQRE---TPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSF 344

Query: 307 TYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPL--CWRGRKPFKSLRDVRKYFKPLAL 366
           TY    AY +++   N     K  R    D  LP   C+       +L   +  F+  A+
Sbjct: 345 TYLTDAAYTLISESFNSLALDK--RYQTTDSELPFEYCY-------ALSPNKDSFQYPAV 404

Query: 367 SFS-SGGRSKAVFE----IPMEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKM 419
           + +  GG S  V+     IPM+   +       CL I+      +E  +IIG   M    
Sbjct: 405 NLTMKGGSSYPVYHPLVVIPMKDTDV------YCLAIMK-----IEDISIIGQNFMTGYR 449

BLAST of Cla97C02G026270.1 vs. TAIR10
Match: AT4G33490.2 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 575.5 bits (1482), Expect = 2.8e-164
Identity = 272/420 (64.76%), Postives = 331/420 (78.81%), Query Frame = 0

Query: 5   VLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSF--ASSSIVLPLQGNV 64
           V  ++VLMV S+  L   SA  F     W +          S  F  A SS+V P+ GNV
Sbjct: 6   VRFMIVLMVMSL-VLGFSSAVDF----RWRK------TAGFSDRFTRAVSSVVFPVHGNV 65

Query: 65  FPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKD 124
           +P G+YNVT+ +GQPP+PY+LD DTGSDLTWLQCDAPC +C E  HPLYQPS+DL+PC D
Sbjct: 66  YPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCND 125

Query: 125 PLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCG 184
           PLC +LH + + RCE P+QCDYEVEYADGGSSLGVLVRDVF +N T G  + PRLALGCG
Sbjct: 126 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCG 185

Query: 185 YDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYD 244
           YDQ PG+SS+HP+DG+LGLGRG VS++SQLH+QG V+NV+GHC SS GGG LFFGD +YD
Sbjct: 186 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 245

Query: 245 PYRLVWTPMSRDYPKHYSPGF-GELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTS 304
             R+ WTPMSR+Y KHYSP   GEL+F GR+TGL+NL  VFDSGSSYTYFN++AYQ +T 
Sbjct: 246 SSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTY 305

Query: 305 LLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEIP 364
           LL REL+GKPL+EA DD TLPLCW+GR+PF S+ +V+KYFKPLALSF +G RSK +FEIP
Sbjct: 306 LLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIP 365

Query: 365 MEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNEKQAIGWATANCDRV 422
            E YLIIS  GNVCLGILNGT++GL+  N+IGDISMQD+M++Y+NEKQ+IGW   +CD +
Sbjct: 366 PEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 414

BLAST of Cla97C02G026270.1 vs. TAIR10
Match: AT1G44130.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 430.6 bits (1106), Expect = 1.1e-120
Identity = 199/396 (50.25%), Postives = 284/396 (71.72%), Query Frame = 0

Query: 39  ILSVPAASSSF-------ASSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDL 98
           ++ VP + SS        + SS+V PL GNVFP G+Y+V + +G PPK +  D DTGSDL
Sbjct: 13  LVIVPLSKSSIFKTFIKSSPSSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDL 72

Query: 99  TWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENP-DQCDYEVEYAD 158
           TW+QCDAPC  CT   +  Y+P  +++PC +P+C +LH      C NP +QCDYEV+YAD
Sbjct: 73  TWVQCDAPCSGCTLPPNLQYKPKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYAD 132

Query: 159 GGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD-GILGLGRGAVSMV 218
            GSS+G LV D FPL L NG  ++P +A GCGYDQ   S+   P   G+LGLGRG + ++
Sbjct: 133 QGSSMGALVTDQFPLKLVNGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLL 192

Query: 219 SQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPYRLVWTP-MSRDYPKHYSPGFGELIF 278
           +QL + G+ RNVVGHC SSKGGG+LFFGD +     + WTP +S+D   HY+ G  +L+F
Sbjct: 193 TQLVSAGLTRNVVGHCLSSKGGGFLFFGDNLVPSIGVAWTPLLSQD--NHYTTGPADLLF 252

Query: 279 NGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGR 338
           NG+ TGL+ L ++FD+GSSYTYFN++AYQ + +L+  +L   PL+ A +D TLP+CW+G 
Sbjct: 253 NGKPTGLKGLKLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGA 312

Query: 339 KPFKSLRDVRKYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLET 398
           KPFKS+ +V+ +FK + ++F++G R+  ++  P E YLI+S  GNVCLG+LNG++VGL+ 
Sbjct: 313 KPFKSVLEVKNFFKTITINFTNGRRNTQLYLAP-ELYLIVSKTGNVCLGLLNGSEVGLQN 372

Query: 399 SNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKS 425
           SN+IGDISMQ  M++Y+NEKQ +GW +++C+++PK+
Sbjct: 373 SNVIGDISMQGLMMIYDNEKQQLGWVSSDCNKLPKT 405

BLAST of Cla97C02G026270.1 vs. TAIR10
Match: AT1G77480.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 404.1 bits (1037), Expect = 1.1e-112
Identity = 191/375 (50.93%), Postives = 258/375 (68.80%), Query Frame = 0

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++V P+ GNV+P G+Y V L +G PPK + LD DTGSDLTW+QCDAPC  CT+     
Sbjct: 50  SSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ 109

Query: 111 YQPSNDLVPCKDPLCMSLHSSMDHRCENP-DQCDYEVEYADGGSSLGVLVRDVFPLNLTN 170
           Y+P+++ +PC   LC  L    D  C +P DQCDYE+ Y+D  SS+G LV D  PL L N
Sbjct: 110 YKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLAN 169

Query: 171 GDPIRPRLALGCGYD-QDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVVGHCFSS 230
           G  +  RL  GCGYD Q+PG        GILGLGRG V + +QL + GI +NV+ HC S 
Sbjct: 170 GSIMNLRLTFGCGYDQQNPGPHXXXXTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSH 229

Query: 231 KGGGYLFFGDGIYDPYRLVWTPMSRDYP-KHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 290
            G G+L  GD +     + WT ++ + P K+Y  G  EL+FN ++TG++ + VVFDSGSS
Sbjct: 230 TGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSS 289

Query: 291 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALS 350
           YTYFNA+AYQ +  L+ ++L GKPL +  DD +LP+CW+G+KP KSL +V+KYFK + L 
Sbjct: 290 YTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLR 349

Query: 351 FSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLETSNIIGDISMQDKMVVYNNE 410
           F +  ++  +F++P E YLII+  G VCLGILNGT++GLE  NIIGDIS Q  MV+Y+NE
Sbjct: 350 FGN-QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNE 409

Query: 411 KQAIGWATANCDRVP 423
           KQ IGW +++CD++P
Sbjct: 410 KQRIGWISSDCDKLP 423

BLAST of Cla97C02G026270.1 vs. TAIR10
Match: AT1G49050.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 300.1 bits (767), Expect = 2.2e-81
Identity = 170/396 (42.93%), Postives = 238/396 (60.10%), Query Frame = 0

Query: 45  ASSSFASSSIVLPLQGNVFPNGFYNVTLYVGQPP--KPYFLDPDTGSDLTWLQCDAPCQQ 104
           ++ S  SS+ + P+ GNV+P+G Y   + VG+P   + Y LD DTGS+LTW+QCDAPC  
Sbjct: 180 SAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTS 239

Query: 105 CTETLHPLYQPSND-LVPCKDPLCMSL-HSSMDHRCENPDQCDYEVEYADGGSSLGVLVR 164
           C +  + LY+P  D LV   +  C+ +  + +   CEN  QCDYE+EYAD   S+GVL +
Sbjct: 240 CAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTK 299

Query: 165 DVFPLNLTNGDPIRPRLALGCGYDQDP-GSSSYHPMDGILGLGRGAVSMVSQLHNQGIVR 224
           D F L L NG      +  GCGYDQ     ++    DGILGL R  +S+ SQL ++GI+ 
Sbjct: 300 DKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIIS 359

Query: 225 NVVGHCFSS--KGGGYLFFGDGIYDPYRLVWTPMSRD--------YPKHYSPGFGELIFN 284
           NVVGHC +S   G GY+F G  +   + + W PM  D             S G G L  +
Sbjct: 360 NVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLD 419

Query: 285 GRSTGLRNLFVVFDSGSSYTYFNAQAY-QVLTSLLNRELAGKPLREAMDDDTLPLCWRGR 344
           G +  +    V+FD+GSSYTYF  QAY Q++TSL  +E++G  L     D+TLP+CWR +
Sbjct: 420 GENGRVGK--VLFDTGSSYTYFPNQAYSQLVTSL--QEVSGLELTRDDSDETLPICWRAK 479

Query: 345 K--PFKSLRDVRKYFKPLALSFSSGGR--SKAVFEIPMEGYLIISSMGNVCLGILNGTDV 404
              PF SL DV+K+F+P+ L   S     S+ +  I  E YLIIS+ GNVCLGIL+G+ V
Sbjct: 480 TNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLL-IQPEDYLIISNKGNVCLGILDGSSV 539

Query: 405 GLETSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 421
              ++ I+GDISM+  ++VY+N K+ IGW  ++C R
Sbjct: 540 HDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVR 570

BLAST of Cla97C02G026270.1 vs. TAIR10
Match: AT1G65240.1 (Eukaryotic aspartyl protease family protein)

HSP 1 Score: 149.4 bits (376), Expect = 4.9e-36
Identity = 118/401 (29.43%), Postives = 180/401 (44.89%), Query Frame = 0

Query: 52  SSIVLPLQGN--VFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHP 111
           +SI LPL G+  V   G Y   + +G PPK Y +  DTGSD+ W+ C  PC +C    + 
Sbjct: 56  ASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNL 115

Query: 112 LYQPS---------NDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVR 171
            ++ S         +  V C D  C  +  S    C+    C Y + YAD  +S G  +R
Sbjct: 116 NFRLSLFDMNASSTSKKVGCDDDFCSFI--SQSDSCQPALGCSYHIVYADESTSDGKFIR 175

Query: 172 DVFPLNLTNGD----PIRPRLALGCGYDQDPG-SSSYHPMDGILGLGRGAVSMVSQLHNQ 231
           D+  L    GD    P+   +  GCG DQ     +    +DG++G G+   S++SQL   
Sbjct: 176 DMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAAT 235

Query: 232 GIVRNVVGHCFSS-KGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTG 291
           G  + V  HC  + KGGG   F  G+ D  ++  TPM  +   HY+     +  +G S  
Sbjct: 236 GDAKRVFSHCLDNVKGGG--IFAVGVVDSPKVKTTPMVPN-QMHYNVMLMGMDVDGTSLD 295

Query: 292 L-----RNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRK 351
           L     RN   + DSG++  YF    Y    SL+   LA +P++  + ++T        +
Sbjct: 296 LPRSIVRNGGTIVDSGTTLAYFPKVLYD---SLIETILARQPVKLHIVEETF-------Q 355

Query: 352 PFKSLRDVRKYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLETS 411
            F    +V + F P++  F    +      +    YL        C G   G     E S
Sbjct: 356 CFSFSTNVDEAFPPVSFEFEDSVK----LTVYPHDYLFTLEEELYCFGWQAGGLTTDERS 415

Query: 412 NII--GDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGS 429
            +I  GD+ + +K+VVY+ + + IGWA  NC    K + GS
Sbjct: 416 EVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSSIKIKDGS 436

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004147327.25.1e-24996.73PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis sativus][more]
XP_011649449.12.8e-24795.83PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis sativus][more]
XP_008460823.11.4e-24695.79PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo][more]
XP_008460822.17.7e-24594.91PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis melo][more]
XP_022157721.14.2e-23590.89aspartic proteinase Asp1 isoform X2 [Momordica charantia][more]
Match NameE-valueIdentityDescription
tr|A0A1S3CDB2|A0A1S3CDB2_CUCME9.2e-24795.79aspartic proteinase Asp1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=3... [more]
tr|A0A1S3CDB4|A0A1S3CDB4_CUCME5.1e-24594.91aspartic proteinase Asp1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=3... [more]
tr|A0A0A0LKB0|A0A0A0LKB0_CUCSA2.2e-20897.73Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G338820 PE=3 SV=1[more]
tr|A0A1S3CDR6|A0A1S3CDR6_CUCME1.3e-20596.07aspartic proteinase Asp1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=3... [more]
tr|M5WHY2|M5WHY2_PRUPE2.2e-16866.74Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_4G279700 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q0IU52|ASP1_ORYSJ3.5e-9247.16Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica OX=39947 GN=ASP1 PE=2 S... [more]
sp|A2ZC67|ASP1_ORYSI2.0e-8744.82Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica OX=39946 GN=ASP1 PE=2 SV=... [more]
sp|Q9M9A8|APCB1_ARATH4.1e-8042.93Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1[more]
sp|Q9S9K4|ASPL2_ARATH8.9e-3529.43Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=At1g65240 ... [more]
sp|Q8VYV9|APF1_ARATH6.1e-2831.03Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 ... [more]
Match NameE-valueIdentityDescription
AT4G33490.22.8e-16464.76Eukaryotic aspartyl protease family protein[more]
AT1G44130.11.1e-12050.25Eukaryotic aspartyl protease family protein[more]
AT1G77480.11.1e-11250.93Eukaryotic aspartyl protease family protein[more]
AT1G49050.12.2e-8142.93Eukaryotic aspartyl protease family protein[more]
AT1G65240.14.9e-3629.43Eukaryotic aspartyl protease family protein[more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR033121PEPTIDASE_A1
IPR001461Aspartic_peptidase_A1
IPR032799TAXi_C
IPR032861TAXi_N
IPR021109Peptidase_aspartic_dom_sf
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0008233 peptidase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla97C02G026270Cla97C02G026270gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla97C02G026270.1Cla97C02G026270.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C02G026270.1.exon.1Cla97C02G026270.1.exon.1exon
Cla97C02G026270.1.exon.2Cla97C02G026270.1.exon.2exon
Cla97C02G026270.1.exon.3Cla97C02G026270.1.exon.3exon
Cla97C02G026270.1.exon.4Cla97C02G026270.1.exon.4exon
Cla97C02G026270.1.exon.5Cla97C02G026270.1.exon.5exon
Cla97C02G026270.1.exon.6Cla97C02G026270.1.exon.6exon
Cla97C02G026270.1.exon.7Cla97C02G026270.1.exon.7exon
Cla97C02G026270.1.exon.8Cla97C02G026270.1.exon.8exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C02G026270.1.CDS.1Cla97C02G026270.1.CDS.1CDS
Cla97C02G026270.1.CDS.2Cla97C02G026270.1.CDS.2CDS
Cla97C02G026270.1.CDS.3Cla97C02G026270.1.CDS.3CDS
Cla97C02G026270.1.CDS.4Cla97C02G026270.1.CDS.4CDS
Cla97C02G026270.1.CDS.5Cla97C02G026270.1.CDS.5CDS
Cla97C02G026270.1.CDS.6Cla97C02G026270.1.CDS.6CDS
Cla97C02G026270.1.CDS.7Cla97C02G026270.1.CDS.7CDS
Cla97C02G026270.1.CDS.8Cla97C02G026270.1.CDS.8CDS


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 243..421
e-value: 1.3E-27
score: 98.3
IPR021109Aspartic peptidase domain superfamilyGENE3DG3DSA:2.40.70.10coord: 47..238
e-value: 2.0E-42
score: 147.5
IPR021109Aspartic peptidase domain superfamilySUPERFAMILYSSF50630Acid proteasescoord: 60..421
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 68..238
e-value: 4.0E-49
score: 167.2
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 279..414
e-value: 2.9E-15
score: 56.4
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 48..425
NoneNo IPR availablePANTHERPTHR13683:SF227ASPARTYL PROTEASE FAMILY PROTEINcoord: 48..425
NoneNo IPR availablePROSITEPS51257PROKAR_LIPOPROTEINcoord: 1..18
score: 6.0
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 68..414
score: 34.135