Clc04G09950 (gene) Watermelon (cordophanus) v2

Overview
NameClc04G09950
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
Descriptionaspartic proteinase-like protein 2
LocationClcChr04: 23525383 .. 23530449 (+)
RNA-Seq ExpressionClc04G09950
SyntenyClc04G09950
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAAAAGTGAAGAAGAAAGCCCCCACTTTGATCTTCGTCGCTAAGTTTCCCTCTCCATTCCACACAACAACAACAACCCATTTTCCCATTGAATCCCGCCAAAAAAAACCCACAAAAAATGCGCCTGTTCTTTTGCCTCATATCCGCCTTAGCCTCCGTCGTCGTCGTCGTCGTCACGGGCACGGCGGCTACCTCCCCCAACCATTTCCCTCTCTACAGAGCCTTCCCCCACTTTCCCACTCCCCACTTCCACTCCCTCACAGCTCGCGACCGCCTCCGCCATTCCCGCGTCTTGCGACGACTCGCCGGTGGCATCGTCAATTTCTCCGTTAAAGGCTCCTCCGATCCTTTCATCGGGTGCGTCGTAGTTTTCTTTATTCGTCTTTGCTTATTTCGTGCAATTTCCATTTCTTTTCTTACTATATACATGGCCGATTGTTTTAGTCTTACACTTCTGATTTCATAATCGGATGATCGTCTGTCTTAAACAATACTTGATAATTAACCTGTTTTTGGTGGGTTCTACAGAAAACCCAATTTGTTTTTATCTTAATCTTTTTCTTTTCCTTCAAGTTACATGGAAGTGAATTGGTTTATTGCATAATTTGTACCGTCCGGTGTTATTTAATTGCGGCTTCGGGTCTTCGGGTCTTGGTTTTTGTAGGCTTTATTTCACCAAAGTGAAGTTGGGAAATCCTGAGAGGGAATTCAATGTGCAGATCGATACTGGGAGTGATATTTTGTGGGTCACTTGCAGTCCTTGCGACGGCTGTCCTCAATCAAGCGGACTTGGAGTAATATTCTGTTTTCTTATGTTTTTGTTACTTATTTTACTGATTATATTTTCGATTGAGCTTTAAGGTCATCATGCCTCTCACTTAATGAGGAGGTTTCAGAGTTTGCCAATAATGGATATCATAACTTCCTTGCAAAATGGAGAGTTTTTTGCTCGACTCCTTAGTGCTTATGGTGTTTTATCTGTACAGGTTGAACTCAACTTATTTGATGCTACAAAGTCATCCTCTGCCAGGGTTGTTCCCTGCTCAGATCCAATATGTGCTGCAATTCCAACCACCACAGATCAATGCTTATCCCAGGCTGACCATTGCGGTTACACCTTCCATTATCGGGATAGAAGTGGGACGTCGGGCTTTTATGTTACTGATTTGATGCATTTTGACATACTTCTGGGGGAGTCAACGATTGCAAATTCTTCAGCCCCTATTGTGTTTGGGTGAGTTGATAGATCTTATTTAGACATTCTATACCTGTACAGGATATTAGATTTGAACTGCTCAAAAGTACAAAGGGCAGAAGGGCACTCAAGTCTTAAAGAAATTTACCAGTTATTTGGTGGTTTATTCTGGTTTAAACTTGTTTTTCTTCAGCGAGCTATGATTTGTGTTACTTGTGAAGGTTGAAGTCACTGTGTACTTATAAGTTATAAGTTATAATATTGTGTACACAGCATGTGATTTCAGGGAAGAATGACATTGTTGGAGGGATTTTTTAAAAAAAATATTTATCGAAACATGTTTTCACTGGGCTTTCGTTTATGTATATATAAAGACGAAAAGAGATTTTAACCCATCCTCCCTAACTGCCTTTGTGCCTCGTGAAGACTTAGATATCATGTTGTGTATCACCAAGTCGGTAAACCCGTTGTAGATTTCAGGCCAAATATCATATTTGATATTCCAACCATCAGAAGTTATTGTATACGTCATGCCCCTGAGCTGAATTGATATTATGACCAGTAATACAATATTTTGGGGCTGTAGTATATTTTGTTCTTGATAACTACAGAAGTAAAATTGATAGTGAAAAGCACTAGGTGTATATTTATCGAATTGAATGAAAATTCTGCTCTACTTATCCACTCTACTTGATCTTTTGATTCCATGTGTGTATTTATTCATTTTTGGTTGGGTACTTGAAATCACTGCTTTGTTTGGTTGTCAATGCAGGTGTAGCGTATATCAGTATGGGGATTTGACTCGGGCAACCAAAGCACTTGATGGAATTTTTGGGTTTGGTCAAGGGGAGTTCTCAGTTATTTCACAATTGTCTTCTCGAGGAATCACACCTAAAGTATTCTCCCATTGTTTAAAAGGAGGGGAAAATGGAGGGGGTATCTTGGTTCTTGGTGAGATTCTGGAGCCCAGCATTGTTTATAGTCCACTTATTCCATCTCAGTATGTCCTCTATCTTGTGAAAGATTTGGTTATTGGTTCTACTTATTCTTTATGATGTCCATGAAATTTTGAATCCCTACTTTACCCCCAAATATTTCCTAAGAATTACAAACCTCAACGACTTTCATGCATTACCCCAATAGGATCCATCTTTGATCAGATTTTCCAGAGACATCACTTGTTTTTTAAAGGCTAGAGTTTTCAATGTTGAGGAAATAGAGATGTTATATATAATTCATATAAGTGCTTATGAAGAAAATGTCCTCTTGAAGTCGCTCTAAGTTTTAAACCTCTTTTAACCACTGCTCTTGGAAATTTTCTCACAATCAATATTTTTGAGATCAATAAGAAGTCTCTCGTTTGGAACCGCATGACATTAAGATGCTGATTCTGGTCTTGGAAAGATGATGCTACCCAACCATCATTTGAGGTTTTAAGACTGGCATCATACATCAAAAGAGGGATGGCCTCCATGAAAGAAAGAGTTATGCCAAGTTTTGTGCTTTGACTTAATTCTTTAATATCTGAAAGTTTTTCTTTGTTCTAACTAATGGTGCCATATTTTGTAATTGATCTCCACTGTTTATGTCTGTTCTTCCTTCATCAGGGGATGAATTCTACTTTAGGGAACTTATATAGAAATATTGCACTGTTCTTAACCCCTTTGCCGACTGTGTCACTCTATCAATGAGTTTTTGTTCGTCTCGGGCGACAATTTTTTTTTTCCTGTTTTCTTTTGCCTGTTTTATTTCTCTTGTTTTGTTTTGTTAGAGATATATAAGATATGCATTTCTATATGCTTATAATTTTATGTATAACTTATTCACATTTTAATTGAATTTCAGGCCGCACTATACCTTAAATTTACAGAGTATTGCACTCAGCGGGCAATTTTTTGCAAGCCCCACCGTGTTTCCAATATCAAATGCAGGAGAAACTATCATTGATTCTGGAACAACTTTGGCTTACATTGTGGAAGAAGTCTATGATTGGATTATCAGTGTGGTAAGTTTGTGTTCTTATATGTCTCTTACTGTTAGGTTTGATTTCTTTGGGGAGTGGTTGGGGTTCCATTACTTTCTTGCAATGAACTTCATGATTATAATATATAGTTATTGAAGGGATGATTCAACTAAACGGATATCAGTTGGTGTTTGATTCAACGTGATTACCCTTTGTGCAATAGTTTGTGAAAAAAATGATATTCTACTGATAATATGATTTTTTTCATTATAGCTTGTTGCAACTCTAGTTACAACTCTGTTTAGGCCGTTCTTTCATGTTAGTGCTGAGGAGTCCTTGTTCTTATATTATTTGTTGGAATGCCTTGAATCTGTCATAGACGCCTCTGTATCTGTAATATCTGATATCTATCTAATAATAAAACTACTCTCTTCCCCTACCCTTGGACATAGCTAACAAACTGTTAGTGAACCATGTAAATCTTTGTGGTCATCTTTACTTTTTACGTTTTCTCACTTAATTTGCTTGTTGATTTTATTGATGTTATAACATCATCAGCTATTTATCTCCCCTTCTGATCAACTGTTTGCAAGACACGTACTGGTGGAAATTTTGATAATGCTTTCATCTATATGCACTGGTTCTGACTTTACTGTTGCTTACTTGCAGATAACTTCTGCTGTTTCTCAATCAGCCACTCCCACAATTTCGAGGGGTAACCAATGTTATCGAGTCTCTACTAGGTGTGTGACTTAACCAAATAATATTGGAAACTATGATTAATTTTCAATGAGAAATGTGCCTTTGTTCTTTTCTTTTTTTAGGAACTTAATGATTATATGTTTTTATGTTTAGTATAGCAGAAATATTTCCTGTGGTCAGCTTTAATTTTGAGGGTACTGCATCCATGGTGGTGACACCTGAAGAATATCTTCAGTTTGACTCCATAGTAAGTTGCTCAATAATTTTGCTAGCCATCATTTCGGTTAATTTCATTCATCCATCCAGATTTCTGAGAGATTTAGATATTGAATTATGGCATTTGTATAATGCAAAACAATTCTTTTGCAGGAACCAGCTTTATGGTGCATAGGTTTTCAGAAAGCTGAGGATGGAGTAAACATTTTAGGAGGTTAAATTCTGAATTTAGCCTCAAATTTCCCATCCACAAAAAAAAAGAAAAGAAAAAAAATTCAAATTAGAATTCCACACAAAAACAGCCTCACATTCCTATTATTTTTGTCAAATTTCAAATTAAAAAGAAAAAAGAAAAAACAAAACGAACTTGAGTAGGTTGTGGTTTGTGTCATGAAACGATTAATGAATAGTAACCACAATCAACCATATGATCATTGTTAATGAATTTCTTCATTTTCAATGACAAACATCCCTTCATTTTGTAGATCTTGTTTTGAAAGATAAGATCATTATCTACGACTTGGCTCGACAACGGATTGGATGGGCGAATTATGACTGTAAGTTAGACCAAGTCTTGAATGCATAAAGTATTAACCTCCAAATCATTTCTGATTGAAGACAACTGCTGTTTGACAGGTTCGTCGTCTGTAAATGTTTCTGTAACATCCGGGAAGGATGTGTTCATCAATGAAGGACAGCTGAGTGTAAACAGCTCCTCAAGAAAGCATTTTTATCAGCTGCTCAACATTGTCATTGTATTACTAATACATTTAAAATTGTTCTGAAGTCCATTTTTGTTAATAACCCTGAGGGTTCTCTTGTCTTCATTATTTGTGCTAGCTGCATTCGCCTGGCCAATCTGTATTTTGAATGCTTGGTTGGGCTGCTTATCGCCAATCTGCGAAGAGCAAATGTTCCTGTGCCTGCATCTTACATCTGGCTTAATGCAGGCTCCTTTTCTGTGAGGTCGTCGTGCAGCTGCAACCCATAA

mRNA sequence

CAAAAAGTGAAGAAGAAAGCCCCCACTTTGATCTTCGTCGCTAAGTTTCCCTCTCCATTCCACACAACAACAACAACCCATTTTCCCATTGAATCCCGCCAAAAAAAACCCACAAAAAATGCGCCTGTTCTTTTGCCTCATATCCGCCTTAGCCTCCGTCGTCGTCGTCGTCGTCACGGGCACGGCGGCTACCTCCCCCAACCATTTCCCTCTCTACAGAGCCTTCCCCCACTTTCCCACTCCCCACTTCCACTCCCTCACAGCTCGCGACCGCCTCCGCCATTCCCGCGTCTTGCGACGACTCGCCGGTGGCATCGTCAATTTCTCCGTTAAAGGCTCCTCCGATCCTTTCATCGGGCTTTATTTCACCAAAGTGAAGTTGGGAAATCCTGAGAGGGAATTCAATGTGCAGATCGATACTGGGAGTGATATTTTGTGGGTCACTTGCAGTCCTTGCGACGGCTGTCCTCAATCAAGCGGACTTGGAGTTGAACTCAACTTATTTGATGCTACAAAGTCATCCTCTGCCAGGGTTGTTCCCTGCTCAGATCCAATATGTGCTGCAATTCCAACCACCACAGATCAATGCTTATCCCAGGCTGACCATTGCGGTTACACCTTCCATTATCGGGATAGAAGTGGGACGTCGGGCTTTTATGTTACTGATTTGATGCATTTTGACATACTTCTGGGGGAGTCAACGATTGCAAATTCTTCAGCCCCTATTGTGTTTGGGTGTAGCGTATATCAGTATGGGGATTTGACTCGGGCAACCAAAGCACTTGATGGAATTTTTGGGTTTGGTCAAGGGGAGTTCTCAGTTATTTCACAATTGTCTTCTCGAGGAATCACACCTAAAGTATTCTCCCATTGTTTAAAAGGAGGGGAAAATGGAGGGGGTATCTTGGTTCTTGGTGAGATTCTGGAGCCCAGCATTGTTTATAGTCCACTTATTCCATCTCAGCCGCACTATACCTTAAATTTACAGAGTATTGCACTCAGCGGGCAATTTTTTGCAAGCCCCACCGTGTTTCCAATATCAAATGCAGGAGAAACTATCATTGATTCTGGAACAACTTTGGCTTACATTGTGGAAGAAGTCTATGATTGGATTATCAGTGTGATAACTTCTGCTGTTTCTCAATCAGCCACTCCCACAATTTCGAGGGGTAACCAATGTTATCGAGTCTCTACTAGTATAGCAGAAATATTTCCTGTGGTCAGCTTTAATTTTGAGGGTACTGCATCCATGGTGGTGACACCTGAAGAATATCTTCAGTTTGACTCCATAGAACCAGCTTTATGGTGCATAGGTTTTCAGAAAGCTGAGGATGGAGTAAACATTTTAGGAGATCTTGTTTTGAAAGATAAGATCATTATCTACGACTTGGCTCGACAACGGATTGGATGGGCGAATTATGACTGTTCGTCGTCTGTAAATGTTTCTGTAACATCCGGGAAGGATGTGTTCATCAATGAAGGACAGCTGAGTTCCATTTTTGTTAATAACCCTGAGGGTTCTCTTGTCTTCATTATTTGTGCTAGCTGCATTCGCCTGGCCAATCTGTATTTTGAATGCTTGGTTGGGCTGCTTATCGCCAATCTGCGAAGAGCAAATGTTCCTGTGCCTGCATCTTACATCTGGCTTAATGCAGGCTCCTTTTCTGTGAGGTCGTCGTGCAGCTGCAACCCATAA

Coding sequence (CDS)

ATGCGCCTGTTCTTTTGCCTCATATCCGCCTTAGCCTCCGTCGTCGTCGTCGTCGTCACGGGCACGGCGGCTACCTCCCCCAACCATTTCCCTCTCTACAGAGCCTTCCCCCACTTTCCCACTCCCCACTTCCACTCCCTCACAGCTCGCGACCGCCTCCGCCATTCCCGCGTCTTGCGACGACTCGCCGGTGGCATCGTCAATTTCTCCGTTAAAGGCTCCTCCGATCCTTTCATCGGGCTTTATTTCACCAAAGTGAAGTTGGGAAATCCTGAGAGGGAATTCAATGTGCAGATCGATACTGGGAGTGATATTTTGTGGGTCACTTGCAGTCCTTGCGACGGCTGTCCTCAATCAAGCGGACTTGGAGTTGAACTCAACTTATTTGATGCTACAAAGTCATCCTCTGCCAGGGTTGTTCCCTGCTCAGATCCAATATGTGCTGCAATTCCAACCACCACAGATCAATGCTTATCCCAGGCTGACCATTGCGGTTACACCTTCCATTATCGGGATAGAAGTGGGACGTCGGGCTTTTATGTTACTGATTTGATGCATTTTGACATACTTCTGGGGGAGTCAACGATTGCAAATTCTTCAGCCCCTATTGTGTTTGGGTGTAGCGTATATCAGTATGGGGATTTGACTCGGGCAACCAAAGCACTTGATGGAATTTTTGGGTTTGGTCAAGGGGAGTTCTCAGTTATTTCACAATTGTCTTCTCGAGGAATCACACCTAAAGTATTCTCCCATTGTTTAAAAGGAGGGGAAAATGGAGGGGGTATCTTGGTTCTTGGTGAGATTCTGGAGCCCAGCATTGTTTATAGTCCACTTATTCCATCTCAGCCGCACTATACCTTAAATTTACAGAGTATTGCACTCAGCGGGCAATTTTTTGCAAGCCCCACCGTGTTTCCAATATCAAATGCAGGAGAAACTATCATTGATTCTGGAACAACTTTGGCTTACATTGTGGAAGAAGTCTATGATTGGATTATCAGTGTGATAACTTCTGCTGTTTCTCAATCAGCCACTCCCACAATTTCGAGGGGTAACCAATGTTATCGAGTCTCTACTAGTATAGCAGAAATATTTCCTGTGGTCAGCTTTAATTTTGAGGGTACTGCATCCATGGTGGTGACACCTGAAGAATATCTTCAGTTTGACTCCATAGAACCAGCTTTATGGTGCATAGGTTTTCAGAAAGCTGAGGATGGAGTAAACATTTTAGGAGATCTTGTTTTGAAAGATAAGATCATTATCTACGACTTGGCTCGACAACGGATTGGATGGGCGAATTATGACTGTTCGTCGTCTGTAAATGTTTCTGTAACATCCGGGAAGGATGTGTTCATCAATGAAGGACAGCTGAGTTCCATTTTTGTTAATAACCCTGAGGGTTCTCTTGTCTTCATTATTTGTGCTAGCTGCATTCGCCTGGCCAATCTGTATTTTGAATGCTTGGTTGGGCTGCTTATCGCCAATCTGCGAAGAGCAAATGTTCCTGTGCCTGCATCTTACATCTGGCTTAATGCAGGCTCCTTTTCTGTGAGGTCGTCGTGCAGCTGCAACCCATAA

Protein sequence

MRLFFCLISALASVVVVVVTGTAATSPNHFPLYRAFPHFPTPHFHSLTARDRLRHSRVLRRLAGGIVNFSVKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQSSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTSGFYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQFFASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVSTSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKDKIIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLSSIFVNNPEGSLVFIICASCIRLANLYFECLVGLLIANLRRANVPVPASYIWLNAGSFSVRSSCSCNP
Homology
BLAST of Clc04G09950 vs. NCBI nr
Match: XP_038880817.1 (aspartic proteinase 39-like [Benincasa hispida])

HSP 1 Score: 878.6 bits (2269), Expect = 2.6e-251
Identity = 431/458 (94.10%), Postives = 444/458 (96.94%), Query Frame = 0

Query: 1   MRLFFCLISALASVVVVVVTGTAATSPNHFPLYRAFPHFPTPHFHSLTARDRLRHSRVLR 60
           MRLFFCLISALAS  VVVV GTAATS NHFPL+R FPHFPTPHFHSLTARDRLRHSR+LR
Sbjct: 1   MRLFFCLISALAS-AVVVVAGTAATSLNHFPLHRTFPHFPTPHFHSLTARDRLRHSRLLR 60

Query: 61  RLAGGIVNFSVKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQSS 120
           RLAGGIVNFSVKGSSDPF+GLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQSS
Sbjct: 61  RLAGGIVNFSVKGSSDPFVGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQSS 120

Query: 121 GLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTSGFY 180
           GLG+ELNLFDATKSSSARVVPCSDPICAAIPTT DQCLSQ D CGYTFHYRDRSGTSGFY
Sbjct: 121 GLGIELNLFDATKSSSARVVPCSDPICAAIPTTRDQCLSQTDRCGYTFHYRDRSGTSGFY 180

Query: 181 VTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVISQLS 240
           +TD MHFDILLGESTIANSSAPIVFGCS+YQYGDLTRATKALDGIFGFGQGEFSVISQLS
Sbjct: 181 ITDSMHFDILLGESTIANSSAPIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLS 240

Query: 241 SRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQFFA 300
           SRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQ F+
Sbjct: 241 SRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQLFS 300

Query: 301 SPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVSTS 360
           +PTVFPISNAGETIIDSGTTLAY+VEEVYDWI+SVITSAVSQSATPTISRGNQCYRVSTS
Sbjct: 301 NPTVFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGNQCYRVSTS 360

Query: 361 IAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKDKII 420
           IAEIFP +SFNFEG A+MVVTPEEYLQFDSIEPALWCIGFQKAED  NILGDLVLKDKII
Sbjct: 361 IAEIFPEISFNFEGIAAMVVTPEEYLQFDSIEPALWCIGFQKAEDRTNILGDLVLKDKII 420

Query: 421 IYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 459
           +YDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS
Sbjct: 421 VYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 457

BLAST of Clc04G09950 vs. NCBI nr
Match: XP_008462514.1 (PREDICTED: aspartic proteinase-like protein 2 [Cucumis melo])

HSP 1 Score: 857.1 bits (2213), Expect = 8.2e-245
Identity = 416/458 (90.83%), Postives = 438/458 (95.63%), Query Frame = 0

Query: 1   MRLFFCLISALASVVVVVVTGTAATSPNHFPLYRAFPHFPTPHFHSLTARDRLRHSRVLR 60
           MRLFFC I AL SVV V V GTAA SPNHF L+RAFPHFP+P FHSL ARDRLRHSR+LR
Sbjct: 1   MRLFFCFIYALVSVVAVAVAGTAAISPNHFLLHRAFPHFPSPQFHSLKARDRLRHSRLLR 60

Query: 61  RLAGGIVNFSVKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQSS 120
           RLAGGIVNFSVKGSS+PF+GLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCP++S
Sbjct: 61  RLAGGIVNFSVKGSSNPFVGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPETS 120

Query: 121 GLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTSGFY 180
           GLG+ELNLFD TKSSSARV+PC+DPICAA+ TTTDQCLSQ DHC YTFHYRDRSGTSGFY
Sbjct: 121 GLGIELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLSQIDHCSYTFHYRDRSGTSGFY 180

Query: 181 VTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVISQLS 240
           VTD MHFDILLGESTIANSSA IVFGCS+YQYGDLTRATKALDGIFGFG+GEFSVISQLS
Sbjct: 181 VTDSMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGRGEFSVISQLS 240

Query: 241 SRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQFFA 300
           SRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQ F 
Sbjct: 241 SRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQLFP 300

Query: 301 SPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVSTS 360
           +PT FPISNAGETIIDSGTTLAY+VEEVYDWI+SVITSAVSQSATPTISRG+QC+RVSTS
Sbjct: 301 NPTTFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSTS 360

Query: 361 IAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKDKII 420
           +AEIFPV+SFNFEG ASMVVTPEEYLQFDSIEPALWCIGFQKAEDG+NILGDLVLKDKII
Sbjct: 361 VAEIFPVLSFNFEGVASMVVTPEEYLQFDSIEPALWCIGFQKAEDGLNILGDLVLKDKII 420

Query: 421 IYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 459
           +YDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS
Sbjct: 421 VYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 458

BLAST of Clc04G09950 vs. NCBI nr
Match: KAG6595193.1 (Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 851.7 bits (2199), Expect = 3.4e-243
Identity = 427/517 (82.59%), Postives = 464/517 (89.75%), Query Frame = 0

Query: 1   MRLFFCLISALASVVV---VVVTGTAATSPNHFPLYRAFPHFPTPHFHSLTARDRLRHSR 60
           MRLFFCLIS L S  +   V V+  AA S +HFPL+RAFPH PTPHFHSL ARDRLRHSR
Sbjct: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60

Query: 61  VLRRLAGGIVNFSVKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCP 120
           VLRRL GGIV+FSVKGSSD F+GLY+TKVKLGNP+REFNVQIDTGSDILWV CSPCDGCP
Sbjct: 61  VLRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120

Query: 121 QSSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTS 180
           QSSGLG+ELNLFD   SSSAR+V CSDPIC+A+PTTT+QCLSQ D+C YTF YRDRS TS
Sbjct: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTTNQCLSQNDNCNYTFQYRDRSATS 180

Query: 181 GFYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVIS 240
           GFYVTD M+FD++LGES IANSSA IVFGCS+YQYGDLTR T ALDGIFGFG+GEFSVIS
Sbjct: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240

Query: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQ 300
           QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIA+SGQ
Sbjct: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300

Query: 301 FFASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRV 360
            F +PTVF ISNAG TIIDSGTTLAY+VE+VY+WI+SVITSAVSQS TPTISRG+QCYRV
Sbjct: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360

Query: 361 STSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKD 420
           STS++E+FPV+SFNFEG ASMV+ PEEYLQFDSIEPAL CIGFQKAEDG+NILGDLVLKD
Sbjct: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420

Query: 421 KIIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLSSIFVNNPEGSLVFIICASC 480
           KI++YDLARQRIGWANYDCSSSVNVSVTSGKDVFI  GQLSS   NN E SLVFIICAS 
Sbjct: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFIG-GQLSS---NNSEDSLVFIICASH 480

Query: 481 IRLANLYFECLVGLLIANLRRANV-PVPASYIWLNAG 514
           + LANLYFECLVGL+I   RRANV   PAS+IWLNAG
Sbjct: 481 LHLANLYFECLVGLVITQARRANVCAPPASFIWLNAG 513

BLAST of Clc04G09950 vs. NCBI nr
Match: TYK07355.1 (aspartic proteinase-like protein 2 [Cucumis melo var. makuwa])

HSP 1 Score: 849.4 bits (2193), Expect = 1.7e-242
Identity = 415/460 (90.22%), Postives = 437/460 (95.00%), Query Frame = 0

Query: 1   MRLFFCLISALASV--VVVVVTGTAATSPNHFPLYRAFPHFPTPHFHSLTARDRLRHSRV 60
           MRLFFC I AL SV  V V V GTAA  PNHF L+RAFPHFP+P FHSL ARDRLRHSR+
Sbjct: 1   MRLFFCFIYALVSVVAVAVAVAGTAAIFPNHFLLHRAFPHFPSPQFHSLKARDRLRHSRL 60

Query: 61  LRRLAGGIVNFSVKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQ 120
           LRRLAGGIVNFSVKGSS+PF+GLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCP+
Sbjct: 61  LRRLAGGIVNFSVKGSSNPFVGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPE 120

Query: 121 SSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTSG 180
           +SGLG+ELNLFD TKSSSARV+PC+DPICAA+ TT DQCLSQ DHC YTFHYRDRSGTSG
Sbjct: 121 TSGLGIELNLFDTTKSSSARVLPCTDPICAAVSTTADQCLSQIDHCSYTFHYRDRSGTSG 180

Query: 181 FYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVISQ 240
           FYVTD MHFDILLGESTIANSSA IVFGCS+YQYGDLTRATKALDGIFGFGQGEFSVISQ
Sbjct: 181 FYVTDSMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQ 240

Query: 241 LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQF 300
           LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQ 
Sbjct: 241 LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQL 300

Query: 301 FASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVS 360
           F +PT+FPISNAGETIIDSGTTLAY+VEEVYDWI+SVITSAVSQSATPTISRG+QC+RVS
Sbjct: 301 FPNPTMFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVS 360

Query: 361 TSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKDK 420
           TS+AEIFPV+SFNFEG ASMVVTPEEYLQFDSIEPALWCIGFQKAEDG+NILGDLVLKDK
Sbjct: 361 TSVAEIFPVLSFNFEGIASMVVTPEEYLQFDSIEPALWCIGFQKAEDGLNILGDLVLKDK 420

Query: 421 IIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 459
           II+YDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS
Sbjct: 421 IIVYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 460

BLAST of Clc04G09950 vs. NCBI nr
Match: XP_011657680.1 (aspartic proteinase-like protein 2 [Cucumis sativus] >KGN48213.1 hypothetical protein Csa_003406 [Cucumis sativus])

HSP 1 Score: 842.4 bits (2175), Expect = 2.1e-240
Identity = 410/460 (89.13%), Postives = 435/460 (94.57%), Query Frame = 0

Query: 1   MRLFFCLISALASVVVVVVTGTAATS--PNHFPLYRAFPHFPTPHFHSLTARDRLRHSRV 60
           MRLFFC I ALASVV + + GTA  S  PNHF L+RAFPHFP+PHFHSL ARDRLRHSR+
Sbjct: 1   MRLFFCFIYALASVVALTLAGTAVISPGPNHFLLHRAFPHFPSPHFHSLKARDRLRHSRL 60

Query: 61  LRRLAGGIVNFSVKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQ 120
           LRRLAGGIVNFSVKGSS+PF+GLYFTKVKLGNP REFNVQIDTGSDILWVTCSPCDGCP 
Sbjct: 61  LRRLAGGIVNFSVKGSSNPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPD 120

Query: 121 SSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTSG 180
           SSGLG+ELNLFD TKSSSARV+PC+DPICAA+ TTTDQCL+Q DHC Y+FHYRDRSGTSG
Sbjct: 121 SSGLGIELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSG 180

Query: 181 FYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVISQ 240
           FYVTD MHFDILLGESTIANSSA IVFGCS+YQYGDLTRATKALDGIFGFGQGEFSVISQ
Sbjct: 181 FYVTDSMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQ 240

Query: 241 LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQF 300
           LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTL LQSIALSGQ 
Sbjct: 241 LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQL 300

Query: 301 FASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVS 360
           F +PT+FPISNAGETIIDSGTTLAY+VEEVYDWI+SVITSAVSQSATPTISRG+QC+RVS
Sbjct: 301 FPNPTMFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVS 360

Query: 361 TSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKDK 420
            S+A+IFPV+ FNFEG ASMVVTPEEYLQFDSIEPALWCIGFQKAEDG+NILGDLVLKDK
Sbjct: 361 MSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIEPALWCIGFQKAEDGLNILGDLVLKDK 420

Query: 421 IIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 459
           II+YDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS
Sbjct: 421 IIVYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 460

BLAST of Clc04G09950 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 288.1 bits (736), Expect = 2.0e-76
Identity = 163/435 (37.47%), Postives = 252/435 (57.93%), Query Frame = 0

Query: 47  LTARDRLRHSRVLRRLAGGIVNFSVKGSS-DPFIGLYFTKVKLGNPEREFNVQIDTGSDI 106
           L + D  RH+R+L       ++  + G S    IGLYFTK+KLG+P +E+ VQ+DTGSDI
Sbjct: 47  LKSHDSFRHARMLAN-----IDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDI 106

Query: 107 LWVTCSPCDGCPQSSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCG 166
           LWV C+PC  CP  + LG+ L+L+D+  SS+++ V C D  C+ I   ++ C ++   C 
Sbjct: 107 LWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFI-MQSETCGAKKP-CS 166

Query: 167 YTFHYRDRSGTSGFYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGI 226
           Y   Y D S + G ++ D +  + + G    A  +  +VFGC   Q G L +   A+DGI
Sbjct: 167 YHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGI 226

Query: 227 FGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHY 286
            GFGQ   S+ISQL++ G T ++FSHCL    NGGGI  +GE+  P +  +P++P+Q HY
Sbjct: 227 MGFGQSNTSIISQLAAGGSTKRIFSHCL-DNMNGGGIFAVGEVESPVVKTTPIVPNQVHY 286

Query: 287 TLNLQSIALSGQFFASPTVFPISNA-GETIIDSGTTLAYIVEEVYDWIISVITSAVSQSA 346
            + L+ + + G     P     +N  G TIIDSGTTLAY+ + +Y+ +I  IT A  Q  
Sbjct: 287 NVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKIT-AKQQVK 346

Query: 347 TPTISRGNQCYRVSTSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQK-- 406
              +     C+  +++  + FPVV+ +FE +  + V P +YL   S+   ++C G+Q   
Sbjct: 347 LHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYL--FSLREDMYCFGWQSGG 406

Query: 407 --AEDGVNI--LGDLVLKDKIIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 466
              +DG ++  LGDLVL +K+++YDL  + IGWA+++CSSS+ V   SG    +    L 
Sbjct: 407 MTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVKDGSGAAYQLGAENLI 466

Query: 467 SIFVNNPEGSLVFII 474
           S   +   G+LV ++
Sbjct: 467 SAASSVMNGTLVTLL 470

BLAST of Clc04G09950 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 286.2 bits (731), Expect = 7.6e-76
Identity = 162/424 (38.21%), Postives = 234/424 (55.19%), Query Frame = 0

Query: 43  HFHSLTARDRLRHSRVLRRLAGGIVNFSVKGSSD-PFIGLYFTKVKLGNPEREFNVQIDT 102
           HF S    D  RHSR+L       ++  + G S    +GLYFTK+KLG+P +E++VQ+DT
Sbjct: 42  HFKS---HDTRRHSRML-----ASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDT 101

Query: 103 GSDILWVTCSPCDGCPQSSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQA 162
           GSDILW+ C PC  CP  + L   L+LFD   SS+++ V C D  C+ I + +D C   A
Sbjct: 102 GSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFI-SQSDSC-QPA 161

Query: 163 DHCGYTFHYRDRSGTSGFYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKA 222
             C Y   Y D S + G ++ D++  + + G+         +VFGC   Q G L     A
Sbjct: 162 LGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSA 221

Query: 223 LDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPS 282
           +DG+ GFGQ   SV+SQL++ G   +VFSHCL     GGGI  +G +  P +  +P++P+
Sbjct: 222 VDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL-DNVKGGGIFAVGVVDSPKVKTTPMVPN 281

Query: 283 QPHYTLNLQSIALSGQFFASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVS 342
           Q HY + L  + + G     P    I   G TI+DSGTTLAY  + +YD +I  I  A  
Sbjct: 282 QMHYNVMLMGMDVDGTSLDLPR--SIVRNGGTIVDSGTTLAYFPKVLYDSLIETIL-ARQ 341

Query: 343 QSATPTISRGNQCYRVSTSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQ 402
                 +    QC+  ST++ E FP VSF FE +  + V P +YL   ++E  L+C G+Q
Sbjct: 342 PVKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYL--FTLEEELYCFGWQ 401

Query: 403 KA------EDGVNILGDLVLKDKIIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEG 460
                      V +LGDLVL +K+++YDL  + IGWA+++CSSS+ +   SG    +   
Sbjct: 402 AGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSSIKIKDGSGGVYSVGAD 449

BLAST of Clc04G09950 vs. ExPASy Swiss-Prot
Match: Q9M9A8 (Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1)

HSP 1 Score: 138.3 bits (347), Expect = 2.5e-31
Identity = 113/410 (27.56%), Postives = 181/410 (44.15%), Query Frame = 0

Query: 58  VLRRLAGGIVN----FSVKGSSDPFIGLYFTKVKLGNPE--REFNVQIDTGSDILWVTC- 117
           VL   AG I +    F V G+  P  GLY+T++ +G PE  + +++ IDTGS++ W+ C 
Sbjct: 176 VLSTSAGSIDSSTTIFPVGGNVYP-DGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCD 235

Query: 118 SPCDGCPQSSGLGVELNLFDATKSSSARVVPCSDPICAAIPTT--TDQCLSQADHCGYTF 177
           +PC  C + +       L+   K +   +V  S+  C  +     T+ C      C Y  
Sbjct: 236 APCTSCAKGAN-----QLYKPRKDN---LVRSSEAFCVEVQRNQLTEHC-ENCHQCDYEI 295

Query: 178 HYRDRSGTSGFYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGF 237
            Y D S + G    D  H  + L   ++A S   IVFGC   Q G L       DGI G 
Sbjct: 296 EYADHSYSMGVLTKDKFH--LKLHNGSLAESD--IVFGCGYDQQGLLLNTLLKTDGILGL 355

Query: 238 GQGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPS--IVYSPLIPSQPHYT 297
            + + S+ SQL+SRGI   V  HCL    NG G + +G  L PS  + + P++       
Sbjct: 356 SRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDA 415

Query: 298 LNLQSIALS-GQFFASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSAT 357
             +Q   +S GQ   S         G+ + D+G++  Y   + Y  +++ +        T
Sbjct: 416 YQMQVTKMSYGQGMLS-LDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELT 475

Query: 358 PTISRGNQ--CYRVSTS--------IAEIFPVVSFNFEG-----TASMVVTPEEYLQFDS 417
              S      C+R  T+        + + F  ++          +  +++ PE+YL   +
Sbjct: 476 RDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISN 535

Query: 418 IEPALWCIGF---QKAEDGVN-ILGDLVLKDKIIIYDLARQRIGWANYDC 437
                 C+G        DG   ILGD+ ++  +I+YD  ++RIGW   DC
Sbjct: 536 --KGNVCLGILDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDC 568

BLAST of Clc04G09950 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 132.1 bits (331), Expect = 1.8e-29
Identity = 115/436 (26.38%), Postives = 185/436 (42.43%), Query Frame = 0

Query: 26  SPNHFPLYRAFPHFPTPHFHSLTARDRLRHSRVLRRLAGGIVNFSVKGSSDPFIGLYFTK 85
           SP + P              S++   R  H      L  G++            G +F  
Sbjct: 38  SPIYNPQITVTDRLNAAFLRSVSRSRRFNHQLSQTDLQSGLIGAD---------GEFFMS 97

Query: 86  VKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQSSGLGVELNLFDATKSSSARVVPCSDP 145
           + +G P  +     DTGSD+ WV C PC  C + +G      +FD  KSS+ +  PC   
Sbjct: 98  ITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENG-----PIFDKKKSSTYKSEPCDSR 157

Query: 146 ICAAIPTTTDQCLSQADHCGYTFHYRDRSGTSGFYVTDLMHFDILLGESTIANSSAPIVF 205
            C A+ +T   C    + C Y + Y D+S + G   T+ +  D   G      S    VF
Sbjct: 158 NCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPV---SFPGTVF 217

Query: 206 GCSVYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL---KGGENGGGI 265
           GC  Y  G     T +  GI G G G  S+ISQL S     K FS+CL       NG  +
Sbjct: 218 GCG-YNNGGTFDETGS--GIIGLGGGHLSLISQLGSS--ISKKFSYCLSHKSATTNGTSV 277

Query: 266 LVLGEILEPS-------IVYSPLIPSQP--HYTLNLQSIAL--------SGQFFASPTVF 325
           + LG    PS       +V +PL+  +P  +Y L L++I++           +  +    
Sbjct: 278 INLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGI 337

Query: 326 PISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVSTSIAEI- 385
               +G  IIDSGTTL  +    +D   S +  +V+ +   +  +G   +   +  AEI 
Sbjct: 338 LSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEIG 397

Query: 386 FPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKDKIIIYDL 441
            P ++ +F G A + ++P     F  +   + C+      + V I G+    D ++ YDL
Sbjct: 398 LPEITVHFTG-ADVRLSPIN--AFVKLSEDMVCLSMVPTTE-VAIYGNFAQMDFLVGYDL 447

BLAST of Clc04G09950 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 131.3 bits (329), Expect = 3.1e-29
Identity = 115/404 (28.47%), Postives = 185/404 (45.79%), Query Frame = 0

Query: 54  RHSRVLRRLAGGIVNFS-VKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSP 113
           R SR L+RL   +   S V+ S     G Y   + +G P + F+  +DTGSD++W  C P
Sbjct: 66  RGSRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQP 125

Query: 114 CDGCPQSSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRD 173
           C  C   S       +F+   SSS   +PCS  +C A+ + T       + C YT+ Y D
Sbjct: 126 CTQCFNQS-----TPIFNPQGSSSFSTLPCSSQLCQALSSPT----CSNNFCQYTYGYGD 185

Query: 174 RSGTSGFYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGE 233
            S T G   T+ + F    G  +I N    I FGC     G   +   A  G+ G G+G 
Sbjct: 186 GSETQGSMGTETLTF----GSVSIPN----ITFGCGENNQG-FGQGNGA--GLVGMGRGP 245

Query: 234 FSVISQLSSRGITPKVFSHCLKG-GENGGGILVLGEIL--------EPSIVYSPLIPSQP 293
            S+ SQL    +T   FS+C+   G +    L+LG +           +++ S  IP+  
Sbjct: 246 LSLPSQLD---VTK--FSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFY 305

Query: 294 HYTLNLQSIALSGQFFASPTVFPISN---AGETIIDSGTTLAYIVEEVYDWIISVITSAV 353
           + TLN  S+  S +    P+ F +++    G  IIDSGTTL Y V   Y    SV    +
Sbjct: 306 YITLNGLSVG-STRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQ---SVRQEFI 365

Query: 354 SQSATPTISRGNQ----CYRVSTSIAEI-FPVVSFNFEGTASMVVTPEEYLQFDSIEPAL 413
           SQ   P ++  +     C++  +  + +  P    +F+G   + +  E Y  F S    L
Sbjct: 366 SQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDG-GDLELPSENY--FISPSNGL 425

Query: 414 WCIGFQKAEDGVNILGDLVLKDKIIIYDLARQRIGWANYDCSSS 440
            C+    +  G++I G++  ++ +++YD     + +A+  C +S
Sbjct: 426 ICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQCGAS 437

BLAST of Clc04G09950 vs. ExPASy TrEMBL
Match: A0A1S3CH65 (aspartic proteinase-like protein 2 OS=Cucumis melo OX=3656 GN=LOC103500850 PE=3 SV=1)

HSP 1 Score: 857.1 bits (2213), Expect = 4.0e-245
Identity = 416/458 (90.83%), Postives = 438/458 (95.63%), Query Frame = 0

Query: 1   MRLFFCLISALASVVVVVVTGTAATSPNHFPLYRAFPHFPTPHFHSLTARDRLRHSRVLR 60
           MRLFFC I AL SVV V V GTAA SPNHF L+RAFPHFP+P FHSL ARDRLRHSR+LR
Sbjct: 1   MRLFFCFIYALVSVVAVAVAGTAAISPNHFLLHRAFPHFPSPQFHSLKARDRLRHSRLLR 60

Query: 61  RLAGGIVNFSVKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQSS 120
           RLAGGIVNFSVKGSS+PF+GLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCP++S
Sbjct: 61  RLAGGIVNFSVKGSSNPFVGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPETS 120

Query: 121 GLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTSGFY 180
           GLG+ELNLFD TKSSSARV+PC+DPICAA+ TTTDQCLSQ DHC YTFHYRDRSGTSGFY
Sbjct: 121 GLGIELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLSQIDHCSYTFHYRDRSGTSGFY 180

Query: 181 VTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVISQLS 240
           VTD MHFDILLGESTIANSSA IVFGCS+YQYGDLTRATKALDGIFGFG+GEFSVISQLS
Sbjct: 181 VTDSMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGRGEFSVISQLS 240

Query: 241 SRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQFFA 300
           SRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQ F 
Sbjct: 241 SRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQLFP 300

Query: 301 SPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVSTS 360
           +PT FPISNAGETIIDSGTTLAY+VEEVYDWI+SVITSAVSQSATPTISRG+QC+RVSTS
Sbjct: 301 NPTTFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSTS 360

Query: 361 IAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKDKII 420
           +AEIFPV+SFNFEG ASMVVTPEEYLQFDSIEPALWCIGFQKAEDG+NILGDLVLKDKII
Sbjct: 361 VAEIFPVLSFNFEGVASMVVTPEEYLQFDSIEPALWCIGFQKAEDGLNILGDLVLKDKII 420

Query: 421 IYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 459
           +YDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS
Sbjct: 421 VYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 458

BLAST of Clc04G09950 vs. ExPASy TrEMBL
Match: A0A5D3C7C6 (Aspartic proteinase-like protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold202G00740 PE=3 SV=1)

HSP 1 Score: 849.4 bits (2193), Expect = 8.3e-243
Identity = 415/460 (90.22%), Postives = 437/460 (95.00%), Query Frame = 0

Query: 1   MRLFFCLISALASV--VVVVVTGTAATSPNHFPLYRAFPHFPTPHFHSLTARDRLRHSRV 60
           MRLFFC I AL SV  V V V GTAA  PNHF L+RAFPHFP+P FHSL ARDRLRHSR+
Sbjct: 1   MRLFFCFIYALVSVVAVAVAVAGTAAIFPNHFLLHRAFPHFPSPQFHSLKARDRLRHSRL 60

Query: 61  LRRLAGGIVNFSVKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQ 120
           LRRLAGGIVNFSVKGSS+PF+GLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCP+
Sbjct: 61  LRRLAGGIVNFSVKGSSNPFVGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPE 120

Query: 121 SSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTSG 180
           +SGLG+ELNLFD TKSSSARV+PC+DPICAA+ TT DQCLSQ DHC YTFHYRDRSGTSG
Sbjct: 121 TSGLGIELNLFDTTKSSSARVLPCTDPICAAVSTTADQCLSQIDHCSYTFHYRDRSGTSG 180

Query: 181 FYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVISQ 240
           FYVTD MHFDILLGESTIANSSA IVFGCS+YQYGDLTRATKALDGIFGFGQGEFSVISQ
Sbjct: 181 FYVTDSMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQ 240

Query: 241 LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQF 300
           LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQ 
Sbjct: 241 LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQL 300

Query: 301 FASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVS 360
           F +PT+FPISNAGETIIDSGTTLAY+VEEVYDWI+SVITSAVSQSATPTISRG+QC+RVS
Sbjct: 301 FPNPTMFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVS 360

Query: 361 TSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKDK 420
           TS+AEIFPV+SFNFEG ASMVVTPEEYLQFDSIEPALWCIGFQKAEDG+NILGDLVLKDK
Sbjct: 361 TSVAEIFPVLSFNFEGIASMVVTPEEYLQFDSIEPALWCIGFQKAEDGLNILGDLVLKDK 420

Query: 421 IIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 459
           II+YDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS
Sbjct: 421 IIVYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 460

BLAST of Clc04G09950 vs. ExPASy TrEMBL
Match: A0A0A0KF78 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G448710 PE=3 SV=1)

HSP 1 Score: 842.4 bits (2175), Expect = 1.0e-240
Identity = 410/460 (89.13%), Postives = 435/460 (94.57%), Query Frame = 0

Query: 1   MRLFFCLISALASVVVVVVTGTAATS--PNHFPLYRAFPHFPTPHFHSLTARDRLRHSRV 60
           MRLFFC I ALASVV + + GTA  S  PNHF L+RAFPHFP+PHFHSL ARDRLRHSR+
Sbjct: 1   MRLFFCFIYALASVVALTLAGTAVISPGPNHFLLHRAFPHFPSPHFHSLKARDRLRHSRL 60

Query: 61  LRRLAGGIVNFSVKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQ 120
           LRRLAGGIVNFSVKGSS+PF+GLYFTKVKLGNP REFNVQIDTGSDILWVTCSPCDGCP 
Sbjct: 61  LRRLAGGIVNFSVKGSSNPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPD 120

Query: 121 SSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTSG 180
           SSGLG+ELNLFD TKSSSARV+PC+DPICAA+ TTTDQCL+Q DHC Y+FHYRDRSGTSG
Sbjct: 121 SSGLGIELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSG 180

Query: 181 FYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVISQ 240
           FYVTD MHFDILLGESTIANSSA IVFGCS+YQYGDLTRATKALDGIFGFGQGEFSVISQ
Sbjct: 181 FYVTDSMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQ 240

Query: 241 LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQF 300
           LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTL LQSIALSGQ 
Sbjct: 241 LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQL 300

Query: 301 FASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVS 360
           F +PT+FPISNAGETIIDSGTTLAY+VEEVYDWI+SVITSAVSQSATPTISRG+QC+RVS
Sbjct: 301 FPNPTMFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVS 360

Query: 361 TSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKDK 420
            S+A+IFPV+ FNFEG ASMVVTPEEYLQFDSIEPALWCIGFQKAEDG+NILGDLVLKDK
Sbjct: 361 MSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIEPALWCIGFQKAEDGLNILGDLVLKDK 420

Query: 421 IIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 459
           II+YDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS
Sbjct: 421 IIVYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 460

BLAST of Clc04G09950 vs. ExPASy TrEMBL
Match: A0A6J1HFZ7 (aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111463213 PE=3 SV=1)

HSP 1 Score: 783.1 bits (2021), Expect = 7.3e-223
Identity = 386/461 (83.73%), Postives = 421/461 (91.32%), Query Frame = 0

Query: 1   MRLFFCLISALASVVV---VVVTGTAATSPNHFPLYRAFPHFPTPHFHSLTARDRLRHSR 60
           MRLFFCLIS L S  +   V V+  AA S +HFPL+RAFPH PTPHFHSL ARDRLRHSR
Sbjct: 1   MRLFFCLISFLLSFFLTGTVPVSAAAAFSAHHFPLHRAFPHPPTPHFHSLRARDRLRHSR 60

Query: 61  VLRRLAGGIVNFSVKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCP 120
           VLRRL GGIV+FSVKGSS+ F+GLY+TKVKLGNP+REFNVQIDTGSDILWV CSPCDGCP
Sbjct: 61  VLRRLRGGIVDFSVKGSSEQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCP 120

Query: 121 QSSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTS 180
           QSSGLG+ELNLFD   SSSAR+V CSDPIC+A PTTT+QCLSQ D+C YTF YRDRS TS
Sbjct: 121 QSSGLGIELNLFDTAMSSSARLVSCSDPICSAAPTTTNQCLSQNDNCNYTFQYRDRSATS 180

Query: 181 GFYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVIS 240
           GFYVTD M+FD++LGES IANSSA IVFGCS+YQYGDLTR T ALDGIFGFG+GEFSVIS
Sbjct: 181 GFYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVIS 240

Query: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQ 300
           QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIA+SGQ
Sbjct: 241 QLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQ 300

Query: 301 FFASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRV 360
            F +PTVF ISNAG TIIDSGTTLAY+VE+VY+WI+SVITSAVSQS TPTISRG+QCYRV
Sbjct: 301 PFPNPTVFSISNAGGTIIDSGTTLAYLVEDVYNWIVSVITSAVSQSTTPTISRGSQCYRV 360

Query: 361 STSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKD 420
           STS++E+FPV+SFNFEG ASMV+ PEEYLQFDSIEPAL CIGFQKAEDG+NILGDLVLKD
Sbjct: 361 STSVSEVFPVISFNFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKD 420

Query: 421 KIIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 459
           KI++YDLARQRIGWANYDCSSSVNVSVTSGKDVFI +GQLS
Sbjct: 421 KIVVYDLARQRIGWANYDCSSSVNVSVTSGKDVFI-DGQLS 460

BLAST of Clc04G09950 vs. ExPASy TrEMBL
Match: A0A6J1IB21 (aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111471277 PE=3 SV=1)

HSP 1 Score: 775.8 bits (2002), Expect = 1.2e-220
Identity = 382/460 (83.04%), Postives = 417/460 (90.65%), Query Frame = 0

Query: 1   MRLFFCLISALASVVV--VVVTGTAATSPNHFPLYRAFPHFPTPHFHSLTARDRLRHSRV 60
           MRLFFCLIS L S  +   V    AA S +HFPL+RA PH PTPHF+SL ARDRLRHSRV
Sbjct: 1   MRLFFCLISFLLSFFLTGTVPVSAAAFSAHHFPLHRALPHSPTPHFYSLRARDRLRHSRV 60

Query: 61  LRRLAGGIVNFSVKGSSDPFIGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQ 120
           LRRL GGIV+FSVKGSSD F+GLY+TKVKLGNP+REFNVQIDTGSDILWV CSPCDGCPQ
Sbjct: 61  LRRLRGGIVDFSVKGSSDQFVGLYYTKVKLGNPQREFNVQIDTGSDILWVNCSPCDGCPQ 120

Query: 121 SSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTSG 180
           SSGLG+ELNLFD   SSSAR+V CSDPIC+A+PTT +QCLSQ D+C YTF YRDRS TSG
Sbjct: 121 SSGLGIELNLFDTAMSSSARLVSCSDPICSAVPTTANQCLSQNDNCNYTFQYRDRSATSG 180

Query: 181 FYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVISQ 240
           FYVTD M+FD++LGES IANSSA IVFGCS+YQYGDLTR T ALDGIFGFG+GEFSVISQ
Sbjct: 181 FYVTDSMYFDMILGESVIANSSAAIVFGCSIYQYGDLTRTTVALDGIFGFGRGEFSVISQ 240

Query: 241 LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQF 300
           LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIA+SGQ 
Sbjct: 241 LSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIAISGQP 300

Query: 301 FASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVS 360
           F +PTVF IS+AG TIIDSGTTLAY+VEEVY+WI+SVITSAVSQS TPTISRG+QCYRVS
Sbjct: 301 FPNPTVFSISSAGGTIIDSGTTLAYLVEEVYNWIVSVITSAVSQSTTPTISRGSQCYRVS 360

Query: 361 TSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQKAEDGVNILGDLVLKDK 420
           TSI+E+FPV+SF FEG ASMV+ PEEYLQFDSIEPAL CIGFQKAEDG+NILGDLVLKDK
Sbjct: 361 TSISEVFPVISFKFEGIASMVLKPEEYLQFDSIEPALRCIGFQKAEDGINILGDLVLKDK 420

Query: 421 IIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQLS 459
           I++YDLARQR+GWANYDCSSSVNVSVTSGKDVFI +GQLS
Sbjct: 421 IVVYDLARQRVGWANYDCSSSVNVSVTSGKDVFI-DGQLS 459

BLAST of Clc04G09950 vs. TAIR 10
Match: AT2G36670.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 540.4 bits (1391), Expect = 1.6e-153
Identity = 279/477 (58.49%), Postives = 346/477 (72.54%), Query Frame = 0

Query: 1   MRLFFCLISALASVVVVVVTGTAATS-PNHF-----------PLYRAFPHFPTPHFHSLT 60
           MR    L+ A A  V + VTG AA+  P+ +           PL RAFP         L 
Sbjct: 1   MRTLRSLMLAAALAVALAVTGFAASPLPSAYAKYAAGPTKILPLQRAFPLDELVELSELR 60

Query: 61  ARDRLRHSRVL-----RRLAGGIVNFSVKGSSDPF-IGLYFTKVKLGNPEREFNVQIDTG 120
           ARDR+RH+R+L     +   GG+V+F V+GSSDP+ +GLYFTKVKLG+P  EFNVQIDTG
Sbjct: 61  ARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTG 120

Query: 121 SDILWVTCSPCDGCPQSSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQAD 180
           SDILWVTCS C  CP SSGLG++L+ FDA  S +A  V CSDPIC+++  TT    S+ +
Sbjct: 121 SDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQCSENN 180

Query: 181 HCGYTFHYRDRSGTSGFYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRATKAL 240
            CGY+F Y D SGTSG+Y+TD  +FD +LGES +ANSSAPIVFGCS YQ GDLT++ KA+
Sbjct: 181 QCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAV 240

Query: 241 DGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQ 300
           DGIFGFG+G+ SV+SQLSSRGITP VFSHCLKG  +GGG+ VLGEIL P +VYSPL+PSQ
Sbjct: 241 DGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLVPSQ 300

Query: 301 PHYTLNLQSIALSGQFF-ASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVS 360
           PHY LNL SI ++GQ       VF  SN   TI+D+GTTL Y+V+E YD  ++ I+++VS
Sbjct: 301 PHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVS 360

Query: 361 QSATPTISRGNQCYRVSTSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSI--EPALWCIG 420
           Q  TP IS G QCY VSTSI+++FP VS NF G ASM++ P++YL    I    ++WCIG
Sbjct: 361 QLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIG 420

Query: 421 FQKAEDGVNILGDLVLKDKIIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINEGQ 457
           FQKA +   ILGDLVLKDK+ +YDLARQRIGWA+YDCS SVNVS+TSGKD+ +N GQ
Sbjct: 421 FQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCSMSVNVSITSGKDI-VNSGQ 476

BLAST of Clc04G09950 vs. TAIR 10
Match: AT2G36670.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 534.6 bits (1376), Expect = 8.7e-152
Identity = 278/482 (57.68%), Postives = 345/482 (71.58%), Query Frame = 0

Query: 1   MRLFFCLISALASVVVVVVTGTAATS-PNHF-----------PLYRAFPHFPTPHFHSLT 60
           MR    L+ A A  V + VTG AA+  P+ +           PL RAFP         L 
Sbjct: 1   MRTLRSLMLAAALAVALAVTGFAASPLPSAYAKYAAGPTKILPLQRAFPLDELVELSELR 60

Query: 61  ARDRLRHSRVL-----RRLAGGIVNFSVKGSSDPFI------GLYFTKVKLGNPEREFNV 120
           ARDR+RH+R+L     +   GG+V+F V+GSSDP++       LYFTKVKLG+P  EFNV
Sbjct: 61  ARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPPTEFNV 120

Query: 121 QIDTGSDILWVTCSPCDGCPQSSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQC 180
           QIDTGSDILWVTCS C  CP SSGLG++L+ FDA  S +A  V CSDPIC+++  TT   
Sbjct: 121 QIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQ 180

Query: 181 LSQADHCGYTFHYRDRSGTSGFYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTR 240
            S+ + CGY+F Y D SGTSG+Y+TD  +FD +LGES +ANSSAPIVFGCS YQ GDLT+
Sbjct: 181 CSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTK 240

Query: 241 ATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSP 300
           + KA+DGIFGFG+G+ SV+SQLSSRGITP VFSHCLKG  +GGG+ VLGEIL P +VYSP
Sbjct: 241 SDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSP 300

Query: 301 LIPSQPHYTLNLQSIALSGQFF-ASPTVFPISNAGETIIDSGTTLAYIVEEVYDWIISVI 360
           L+PSQPHY LNL SI ++GQ       VF  SN   TI+D+GTTL Y+V+E YD  ++ I
Sbjct: 301 LVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAI 360

Query: 361 TSAVSQSATPTISRGNQCYRVSTSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSI--EPA 420
           +++VSQ  TP IS G QCY VSTSI+++FP VS NF G ASM++ P++YL    I    +
Sbjct: 361 SNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGAS 420

Query: 421 LWCIGFQKAEDGVNILGDLVLKDKIIIYDLARQRIGWANYDCSSSVNVSVTSGKDVFINE 457
           +WCIGFQKA +   ILGDLVLKDK+ +YDLARQRIGWA+YDCS SVNVS+TSGKD+ +N 
Sbjct: 421 MWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDCSMSVNVSITSGKDI-VNS 480

BLAST of Clc04G09950 vs. TAIR 10
Match: AT5G22850.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 486.5 bits (1251), Expect = 2.7e-137
Identity = 245/458 (53.49%), Postives = 327/458 (71.40%), Query Frame = 0

Query: 12  ASVVVVVVTGTAATS---PNHFPLYRAFPHFPTPHFHSLTARDRLRHSRVLRRLAGGIVN 71
           A++++  +   A  S   P    L R  P         L ARD  RH R+L+ L GG+++
Sbjct: 8   AAILICCLLPAAVLSYGFPAALKLERVIPANHEMELSQLKARDEARHGRLLQSL-GGVID 67

Query: 72  FSVKGSSDPF-IGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQSSGLGVELN 131
           F V G+ DPF +GLY+TK++LG P R+F VQ+DTGSD+LWV+C+ C+GCPQ+SGL ++LN
Sbjct: 68  FPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLN 127

Query: 132 LFDATKSSSARVVPCSDPICA-AIPTTTDQCLSQADHCGYTFHYRDRSGTSGFYVTDLMH 191
            FD   S +A  + CSD  C+  I ++   C  Q + C YTF Y D SGTSGFYV+D++ 
Sbjct: 128 FFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQ 187

Query: 192 FDILLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITP 251
           FD+++G S + NS+AP+VFGCS  Q GDL ++ +A+DGIFGFGQ   SVISQL+S+GI P
Sbjct: 188 FDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAP 247

Query: 252 KVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQFF-ASPTVF 311
           +VFSHCLKG   GGGILVLGEI+EP++V++PL+PSQPHY +NL SI+++GQ    +P+VF
Sbjct: 248 RVFSHCLKGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVF 307

Query: 312 PISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVSTSIAEIF 371
             SN   TIID+GTTLAY+ E  Y   +  IT+AVSQS  P +S+GNQCY ++TS+ +IF
Sbjct: 308 STSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITTSVGDIF 367

Query: 372 PVVSFNFEGTASMVVTPEEYL--QFDSIEPALWCIGFQKAED-GVNILGDLVLKDKIIIY 431
           P VS NF G ASM + P++YL  Q +    A+WCIGFQ+ ++ G+ ILGDLVLKDKI +Y
Sbjct: 368 PPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVY 427

Query: 432 DLARQRIGWANYDCSSSVNVSVT--SGKDVFINEGQLS 459
           DL  QRIGWANYDCS+SVNVS T  SG+  ++N GQ S
Sbjct: 428 DLVGQRIGWANYDCSTSVNVSATSSSGRSEYVNAGQFS 464

BLAST of Clc04G09950 vs. TAIR 10
Match: AT1G08210.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 454.5 bits (1168), Expect = 1.1e-127
Identity = 237/453 (52.32%), Postives = 305/453 (67.33%), Query Frame = 0

Query: 11  LASVVVVVVTGTAATSPNHFPLYRAFPHFPTPHFHSLTARDRLRHSRVLRRLAGGIVNFS 70
           +A+V+++  T  A  S     L R  P         L A D  RH R+L+   GG+VNF 
Sbjct: 12  IAAVLLLAATTLACGSDAVLKLERLIPPNHELGLTELRAFDSARHGRLLQSPVGGVVNFP 71

Query: 71  VKGSSDPF-IGLYFTKVKLGNPEREFNVQIDTGSDILWVTCSPCDGCPQSSGLGVELNLF 130
           V G+SDPF +GLY+TKVKLG P REFNVQIDTGSD+LWV+C+ C+GCP++S L ++L+ F
Sbjct: 72  VDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFF 131

Query: 131 DATKSSSARVVPCSDPICAAIPTTTDQCLSQADHCGYTFHYRDRSGTSGFYVTDLMHFDI 190
           D   SSSA +V CSD  C +   T   C S  + C Y+F Y D SGTSG+Y++D M FD 
Sbjct: 132 DPGVSSSASLVSCSDRRCYSNFQTESGC-SPNNLCSYSFKYGDGSGTSGYYISDFMSFDT 191

Query: 191 LLGESTIANSSAPIVFGCSVYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVF 250
           ++  +   NSSAP VFGCS  Q GDL R  +A+DGIFG GQG  SVISQL+ +G+ P+VF
Sbjct: 192 VITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVF 251

Query: 251 SHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLNLQSIALSGQFF-ASPTVFPIS 310
           SHCLKG ++GGGI+VLG+I  P  VY+PL+PSQPHY +NLQSIA++GQ     P+VF I+
Sbjct: 252 SHCLKGDKSGGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIA 311

Query: 311 NAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQSATPTISRGNQCYRVSTSIAEIFPVV 370
               TIID+GTTLAY+ +E Y   I  + +AVSQ   P      QC+ ++    ++FP V
Sbjct: 312 TGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQV 371

Query: 371 SFNFEGTASMVVTPEEYLQ-FDSIEPALWCIGFQK-AEDGVNILGDLVLKDKIIIYDLAR 430
           S +F G ASMV+ P  YLQ F S   ++WCIGFQ+ +   + ILGDLVLKDK+++YDL R
Sbjct: 372 SLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVR 431

Query: 431 QRIGWANYDCSSSVNVSVTSG---KDVFINEGQ 457
           QRIGWA YDCS  VNVS + G   KDV IN GQ
Sbjct: 432 QRIGWAEYDCSLEVNVSASRGGRSKDV-INTGQ 462

BLAST of Clc04G09950 vs. TAIR 10
Match: AT1G05840.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 297.0 bits (759), Expect = 3.0e-80
Identity = 155/406 (38.18%), Postives = 235/406 (57.88%), Query Frame = 0

Query: 46  SLTARDRLRHSRVLRRLAGGIVNFSVKGSSDPFI-GLYFTKVKLGNPEREFNVQIDTGSD 105
           SLTA       R L  LAG  ++  + G+  P I GLY+ K+ +G P + + VQ+DTGSD
Sbjct: 45  SLTALKEHDDRRQLTILAG--IDLPLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSD 104

Query: 106 ILWVTCSPCDGCPQSSGLGVELNLFDATKSSSARVVPCSDPICAAIPTTTDQCLSQADHC 165
           I+WV C  C  CP+ S LG+EL L++  +S S ++V C D  C  I             C
Sbjct: 105 IMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKANMSC 164

Query: 166 GYTFHYRDRSGTSGFYVTDLMHFDILLGESTIANSSAPIVFGCSVYQYGDLTRAT-KALD 225
            Y   Y D S T+G++V D++ +D + G+     ++  ++FGC   Q GDL  +  +ALD
Sbjct: 165 PYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALD 224

Query: 226 GIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQP 285
           GI GFG+   S+ISQL+S G   K+F+HCL  G NGGGI  +G +++P +  +PL+P+QP
Sbjct: 225 GILGFGKANSSMISQLASSGRVKKIFAHCL-DGRNGGGIFAIGRVVQPKVNMTPLVPNQP 284

Query: 286 HYTLNLQSIALSGQFFASPT-VFPISNAGETIIDSGTTLAYIVEEVYDWIISVITSAVSQ 345
           HY +N+ ++ +  +F   P  +F   +    IIDSGTTLAY+ E +Y+ ++  ITS    
Sbjct: 285 HYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPA 344

Query: 346 SATPTISRGNQCYRVSTSIAEIFPVVSFNFEGTASMVVTPEEYLQFDSIEPALWCIGFQK 405
                + +  +C++ S  + E FP V+F+FE +  + V P +YL        +WCIG+Q 
Sbjct: 345 LKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYL---FPHEGMWCIGWQN 404

Query: 406 A------EDGVNILGDLVLKDKIIIYDLARQRIGWANYDCSSSVNV 443
           +         + +LGDLVL +K+++YDL  Q IGW  Y+CSSS+ V
Sbjct: 405 SAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCSSSIKV 444

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038880817.12.6e-25194.10aspartic proteinase 39-like [Benincasa hispida][more]
XP_008462514.18.2e-24590.83PREDICTED: aspartic proteinase-like protein 2 [Cucumis melo][more]
KAG6595193.13.4e-24382.59Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. soror... [more]
TYK07355.11.7e-24290.22aspartic proteinase-like protein 2 [Cucumis melo var. makuwa][more]
XP_011657680.12.1e-24089.13aspartic proteinase-like protein 2 [Cucumis sativus] >KGN48213.1 hypothetical pr... [more]
Match NameE-valueIdentityDescription
Q4V3D22.0e-7637.47Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q9S9K47.6e-7638.21Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q9M9A82.5e-3127.56Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1[more]
Q3EBM51.8e-2926.38Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q766C33.1e-2928.47Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Match NameE-valueIdentityDescription
A0A1S3CH654.0e-24590.83aspartic proteinase-like protein 2 OS=Cucumis melo OX=3656 GN=LOC103500850 PE=3 ... [more]
A0A5D3C7C68.3e-24390.22Aspartic proteinase-like protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E56... [more]
A0A0A0KF781.0e-24089.13Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G44871... [more]
A0A6J1HFZ77.3e-22383.73aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111463213... [more]
A0A6J1IB211.2e-22083.04aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111471277 P... [more]
Match NameE-valueIdentityDescription
AT2G36670.21.6e-15358.49Eukaryotic aspartyl protease family protein [more]
AT2G36670.18.7e-15257.68Eukaryotic aspartyl protease family protein [more]
AT5G22850.12.7e-13753.49Eukaryotic aspartyl protease family protein [more]
AT1G08210.11.1e-12752.32Eukaryotic aspartyl protease family protein [more]
AT1G05840.13.0e-8038.18Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 313..324
score: 40.66
coord: 408..423
score: 35.03
coord: 88..108
score: 48.95
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 5..455
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 82..266
e-value: 1.3E-40
score: 139.5
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 66..267
e-value: 1.1E-46
score: 161.3
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 268..443
e-value: 1.5E-36
score: 127.7
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 78..440
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 284..432
e-value: 4.1E-19
score: 68.9
NoneNo IPR availablePANTHERPTHR13683:SF851EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 5..455
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 82..432
score: 40.71846
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 82..436
e-value: 7.68354E-55
score: 183.618

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc04G09950.2Clc04G09950.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity