Clc01G19160 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G19160
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
Descriptionaspartic proteinase-like protein 1
LocationClcChr01: 31517989 .. 31521956 (-)
RNA-Seq ExpressionClc01G19160
SyntenyClc01G19160
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCTCTCTCTCTCTCTCTCGCTGCTTCTTCTTTACAGTTCACTTCACTTCTTCTTTACAACTTCCGGCTGAACTCGCCGACCTTTTTTCTGCGATCTCTCTTCCGTAATTTGGGTTTCGCCATAGCCCTCGGATTCTTTCTGTTTCTGTTTCCCATATTGGTCCCCCGGTGGGTAATTGAGGCTCCGAATTGTTGGCAGCTTTCTTCTGAAGCTCCAGATCTCGTCTTCGAGCTAATTCGTTCGTCCTTTTAAGGACTGAGTTGCAATGTCGCTTCGGAATCTGATTTTGTTGCTGCTGGTGGTGATTGGCGTTCACCAGGCGGTGTCGATTACGTTCACATCCAGGATACTTCACAGGTTCTCTGAGGAGATGAAGGCGCTTAGGGTTGCAGGGAGTACAAATACGAGTGTACGAGCACTATGGCCTGAGAAGGGGAGCATGGAGTATTATCAGGAGCTTGTGAGTGGTGACTTCCAGAGGCAGAAGATGAAGCTTGGCTCTCGGTTTCAGTTGCTTTTCCCGTCTGAAGGCAGCAAAACCATTGCGCTGGGAAATGACTTTGGCTGGTCACCATTACTCTACCTTCTCCTTTTCTTTTCTTTTCTTTTTTTTTTTTTCTCCTTCCCTGTTCTTGCTATTTTTGTTCTTCCGTTTTGCATTTACTTTTCTGTTTGTTTTTCTTTTGTCGCCGTGATTCTCTGCAAACCTTTCCTTTGAGTAATATCTCTCTCTCACCCCCACACACACCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATCCCCACCTCCTAATAGCCTTTTGCATAGTGGCACCTTTATCAGAAAATTTGTGTTATAAAATGTTATCCTACGCACTTGAAAACCATCTTGTTATTTACGTATTACAGGTTGCATTACACCTGGATCGATATCGGGACACCGAGTGTTTCATTTCTGGTTGCATTGGATGCTGGGAGTGATCTACTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCCTTGTCTGCAAGTTACTATGGCAGTCTGGTAGGCCTGCTTTTCAGAATCTGTTCTTTCTACATAGGCATTAATTACTTCTAATCCACGACTTACGCTTTATCGCCAAACATATTACCTTTTCGTATCCGTTAGAGCGAGGGTCATACTTGGACAAGTTGTGTTCTGAAATTCAAAGCTTTTTGTAGTGCTTTTTTGCGGGAAAGAAAGGCATGGTCGGTTCCAACGGGAAAAAAAAAAAGAAAAAATTGCCCGAGAGGAGACGGACATGAAAGACTCTATCATCTTGCATCTAAAGAAAAACGATTGGCTAAATCATTTTTGGCTCAAATGTATTCGTGGATTCTGAAGTCTTTTTAAATAAGACAGCAAGTTTAGCACTTATTTCCGATGATTGATTGGCAGGTGCTCCTGCTCCCTCATGTTTGGGGCATTATTTATAATGCAGGATAAAGATCTCAACGAATACCGTCCATCCAGTTCAAGCACGAGCAAGCATATATCTTGCAGCCATGATTTGTGCGAGTCAGGCCAAAGCTGCCAAAGTCCGAAGCAGTCATGTCCTTATGTTATTGATTACATTACTGAAAATACTTCAAGTTCAGGACTACTAATTCAGGATGTGTTGCATCTTTCATCTGGTTGTGATAATTCATCTAATTGTATGATTCAGGCTCCGATCATTTTAGGGTACAACCCATTGATCCTTTATTTTGTTGTTCTAATTGTTTGCTCGGCATGCCAACTGCAGCAATGAGGAAGGATGCTGTATTTATATTACCCTTTTTTGCTCTCTAATACAGGTGTGGAATGAAACAAAGTGGCGGTTATCTAAGTGGAGTCGCTCCAGATGGTCTTTTTGGATTGGGGCTAGGAGAAATTTCTGTTCTTAGTTCCCTTGCAAAAGAAGAATTGGTGCAGAACTCTTTCTCACTGTGTTTTAATGAGGATGGATCCGGCAGAATTTTTTTTGGAGACGAGGGACCAGCAAGTCAACAAACAACTTCATTTGTGCCATTAGATGGGAAATAGTAATTACCATACTGAATTATCTTATTGTTTATGATCATGGTTTGTTATATATGCTGACTGAAGTCCTTGTCATACCTGAATCTCTCAGTGAAACCTACATTGTCGGGGTCGAAGCATGTTGTATTGAGAATTCATGCCTCAAGCAGACAAGTTTTAAAGCATTGATAGATAGTGGAACATCGTTTACATATCTTCCAGAGGAAGCATATGAAAATATTGTGATGGAGGTACATTCTTTTTGTAAGCTTTGTACTTCATTTATAGTTTGATTGTCCTTTAACCAGTTCCCCCTTTGATAGTTTGATAAAAGGTTAAACACTACAAGCACCGTCTCCTTTAAAGGATATCCATGGAAGTATTGCTATAAGATCAGGTGACATCAATGAGATGGAAAACCTTTAAAATCCTTTTCCCTTTATAACTCCCTACCTTGTGCATGTCTAATGTTAATGTTGTGGGTTTTTCTAACAGTGCAGATGCAATGCCGAAGGTTCCATCTGTGACATTGTTGTTCCCACAAAACAATAGCTTTGTGGTTCATGATCCTGTGTTCCCTATCTATGGCGATCAGGTAGATGAAGCTTCCTTCTGAGAATGTTCAAGTTTACATGATTTGATTTAATGAATGTGGATGAAGGTGTGGACTTGATCTTTGAAATTTGTTATAATTTGCCATTAATGGAATAACTTCCACTTTTAGTGGGCATAAAAGTACCCTCAAACATCAAACTTGGAATCATAGTAGTGAATTTGATAGCTTCTGAATACTCCAGAAAATAGAGCCCCCAAGGAAATGAAGAATAAGAACAACTAAAACCATAAAAGAGGAAAAGAATGTGTGCTTAAACCATCAAGATGAGGTAGTGCCCTCTCTTTGATACAGGTTAGCAGTTACTCTCTAGTTCTAACTATCTAATAAATGATATAATTGATTTAACTATCCTAAAGAAGTCACATGTTATGCATCACTTTAGTACAAGAAAATAATACCTGTGGTAATACTAAAACAGTACTATGCATGCAGGGGTTAGCTGGATTTTGTTTTGCAGTTCTCCCTGCTGATGGAGATATTGGAATACTGGGACGTAAGCTCACAATGTCTCCTCAAATTTAGATTGCTAATATATGATATGTGTGCTATATGCATAGTAAGTTGTTCATCTAATGAAAACGGTAATTTTGTACGTTGTCTCGTGCAGAAAATTACATGACTGGATACCGGATGGTATTTGATAGGGACAATTTGAAGTTGGGTTGGTCACGCGCAAATTGTAAGATCTCCAACTCCTAGTACTTGTCATGAATTCGCAGCCTCTTAGCAGATGCTCACATCTTGTGCATGGTAGACCTAAAGAAAAATTTTTGAACTGTCTTAACCACCATTTTTGTTCTGGCTCCGTTGTGTGTGGGTTGGTTATTTGTGTTTCTCATTTGTTAAATGAAAGTACGGTTTTTCCATTTTATTGTAAACTCGTTTCTTGGACATATGTAGAGTTTCCTTTATCATGTGAACTTCCATTCTTATTTCTACCATGTCCATGTCAAAGGTCAAGATCTCAGTAACGAAAAGAAAATGCCTCTTGCTCCTGCAAAAGAGACGCCGCCCAACCCATTACCAGCCAATGAGCAGCAGAGCGCTCCAGGGGGGCACGCGGTGGCTCCTGCTGTAGCTGGAAGGGCCCCCTCTAAACCATCAGGTGCTGCCCCTTGCTTCATCATCCCATCCAGCTTTTATTCAATCAGATTGCCGCACCTGCTCCTTCTGGTACTCTACCTTGTTTCTTCTTGCGTGTGATGTAGATTCTGAGTTTATCTAAGTTTTACATGTAAATGAAGGTCGTCTCTTTACATCGTCCCCCCAGCAAGGACATGTAGATCTTTGTGTCCTTGCAAGCTGCATCAATGGCATTTGCCTTTTTTTTTTTTTTTTTCTT

mRNA sequence

TCTCTCTCTCTCTCTCTCTCGCTGCTTCTTCTTTACAGTTCACTTCACTTCTTCTTTACAACTTCCGGCTGAACTCGCCGACCTTTTTTCTGCGATCTCTCTTCCGTAATTTGGGTTTCGCCATAGCCCTCGGATTCTTTCTGTTTCTGTTTCCCATATTGGTCCCCCGGTGGGTAATTGAGGCTCCGAATTGTTGGCAGCTTTCTTCTGAAGCTCCAGATCTCGTCTTCGAGCTAATTCGTTCGTCCTTTTAAGGACTGAGTTGCAATGTCGCTTCGGAATCTGATTTTGTTGCTGCTGGTGGTGATTGGCGTTCACCAGGCGGTGTCGATTACGTTCACATCCAGGATACTTCACAGGTTCTCTGAGGAGATGAAGGCGCTTAGGGTTGCAGGGAGTACAAATACGAGTGTACGAGCACTATGGCCTGAGAAGGGGAGCATGGAGTATTATCAGGAGCTTGTGAGTGGTGACTTCCAGAGGCAGAAGATGAAGCTTGGCTCTCGGTTTCAGTTGCTTTTCCCGTCTGAAGGCAGCAAAACCATTGCGCTGGGAAATGACTTTGGCTGGTTGCATTACACCTGGATCGATATCGGGACACCGAGTGTTTCATTTCTGGTTGCATTGGATGCTGGGAGTGATCTACTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCCTTGTCTGCAAGTTACTATGGCAGTCTGAGCGAGGGTCATACTTGGACAAGTTGTGTTCTGAAATTCAAAGCTTTTTGTAGTGCTTTTTTGCGGGAAAGAAAGGCATGGTGCTCCTGCTCCCTCATGTTTGGGGCATTATTTATAATGCAGGATAAAGATCTCAACGAATACCGTCCATCCAGTTCAAGCACGAGCAAGCATATATCTTGCAGCCATGATTTGTGCGAGTCAGGCCAAAGCTGCCAAAGTCCGAAGCAGTCATGTCCTTATGTTATTGATTACATTACTGAAAATACTTCAAGTTCAGGACTACTAATTCAGGATGTGTTGCATCTTTCATCTGGTTGTGATAATTCATCTAATTGTATGATTCAGGCTCCGATCATTTTAGGGTGTGGAATGAAACAAAGTGGCGGTTATCTAAGTGGAGTCGCTCCAGATGGTCTTTTTGGATTGGGGCTAGGAGAAATTTCTGTTCTTAGTTCCCTTGCAAAAGAAGAATTGGTGCAGAACTCTTTCTCACTGTGTTTTAATGAGGATGGATCCGGCAGAATTTTTTTTGGAGACGAGGGACCAGCAAGTCAACAAACAACTTCATTTGTGCCATTAGATGGGAAATATGAAACCTACATTGTCGGGGTCGAAGCATGTTGTATTGAGAATTCATGCCTCAAGCAGACAAGTTTTAAAGCATTGATAGATAGTGGAACATCGTTTACATATCTTCCAGAGGAAGCATATGAAAATATTGTGATGGAGTTTGATAAAAGGTTAAACACTACAAGCACCGTCTCCTTTAAAGGATATCCATGGAAGTATTGCTATAAGATCAGTGCAGATGCAATGCCGAAGGTTCCATCTGTGACATTGTTGTTCCCACAAAACAATAGCTTTGTGGTTCATGATCCTGTGTTCCCTATCTATGGCGATCAGGGGTTAGCTGGATTTTGTTTTGCAGTTCTCCCTGCTGATGGAGATATTGGAATACTGGGACAAAATTACATGACTGGATACCGGATGGTATTTGATAGGGACAATTTGAAGTTGGGTTGGTCACGCGCAAATTGTCAAGATCTCAGTAACGAAAAGAAAATGCCTCTTGCTCCTGCAAAAGAGACGCCGCCCAACCCATTACCAGCCAATGAGCAGCAGAGCGCTCCAGGGGGGCACGCGGTGGCTCCTGCTGTAGCTGGAAGGGCCCCCTCTAAACCATCAGGTGCTGCCCCTTGCTTCATCATCCCATCCAGCTTTTATTCAATCAGATTGCCGCACCTGCTCCTTCTGGTACTCTACCTTGTTTCTTCTTGCGTGTGATGTAGATTCTGAGTTTATCTAAGTTTTACATGTAAATGAAGGTCGTCTCTTTACATCGTCCCCCCAGCAAGGACATGTAGATCTTTGTGTCCTTGCAAGCTGCATCAATGGCATTTGCCTTTTTTTTTTTTTTTTTCTT

Coding sequence (CDS)

ATGTCGCTTCGGAATCTGATTTTGTTGCTGCTGGTGGTGATTGGCGTTCACCAGGCGGTGTCGATTACGTTCACATCCAGGATACTTCACAGGTTCTCTGAGGAGATGAAGGCGCTTAGGGTTGCAGGGAGTACAAATACGAGTGTACGAGCACTATGGCCTGAGAAGGGGAGCATGGAGTATTATCAGGAGCTTGTGAGTGGTGACTTCCAGAGGCAGAAGATGAAGCTTGGCTCTCGGTTTCAGTTGCTTTTCCCGTCTGAAGGCAGCAAAACCATTGCGCTGGGAAATGACTTTGGCTGGTTGCATTACACCTGGATCGATATCGGGACACCGAGTGTTTCATTTCTGGTTGCATTGGATGCTGGGAGTGATCTACTTTGGGTTCCTTGTGATTGCATACAATGTGCTCCCTTGTCTGCAAGTTACTATGGCAGTCTGAGCGAGGGTCATACTTGGACAAGTTGTGTTCTGAAATTCAAAGCTTTTTGTAGTGCTTTTTTGCGGGAAAGAAAGGCATGGTGCTCCTGCTCCCTCATGTTTGGGGCATTATTTATAATGCAGGATAAAGATCTCAACGAATACCGTCCATCCAGTTCAAGCACGAGCAAGCATATATCTTGCAGCCATGATTTGTGCGAGTCAGGCCAAAGCTGCCAAAGTCCGAAGCAGTCATGTCCTTATGTTATTGATTACATTACTGAAAATACTTCAAGTTCAGGACTACTAATTCAGGATGTGTTGCATCTTTCATCTGGTTGTGATAATTCATCTAATTGTATGATTCAGGCTCCGATCATTTTAGGGTGTGGAATGAAACAAAGTGGCGGTTATCTAAGTGGAGTCGCTCCAGATGGTCTTTTTGGATTGGGGCTAGGAGAAATTTCTGTTCTTAGTTCCCTTGCAAAAGAAGAATTGGTGCAGAACTCTTTCTCACTGTGTTTTAATGAGGATGGATCCGGCAGAATTTTTTTTGGAGACGAGGGACCAGCAAGTCAACAAACAACTTCATTTGTGCCATTAGATGGGAAATATGAAACCTACATTGTCGGGGTCGAAGCATGTTGTATTGAGAATTCATGCCTCAAGCAGACAAGTTTTAAAGCATTGATAGATAGTGGAACATCGTTTACATATCTTCCAGAGGAAGCATATGAAAATATTGTGATGGAGTTTGATAAAAGGTTAAACACTACAAGCACCGTCTCCTTTAAAGGATATCCATGGAAGTATTGCTATAAGATCAGTGCAGATGCAATGCCGAAGGTTCCATCTGTGACATTGTTGTTCCCACAAAACAATAGCTTTGTGGTTCATGATCCTGTGTTCCCTATCTATGGCGATCAGGGGTTAGCTGGATTTTGTTTTGCAGTTCTCCCTGCTGATGGAGATATTGGAATACTGGGACAAAATTACATGACTGGATACCGGATGGTATTTGATAGGGACAATTTGAAGTTGGGTTGGTCACGCGCAAATTGTCAAGATCTCAGTAACGAAAAGAAAATGCCTCTTGCTCCTGCAAAAGAGACGCCGCCCAACCCATTACCAGCCAATGAGCAGCAGAGCGCTCCAGGGGGGCACGCGGTGGCTCCTGCTGTAGCTGGAAGGGCCCCCTCTAAACCATCAGGTGCTGCCCCTTGCTTCATCATCCCATCCAGCTTTTATTCAATCAGATTGCCGCACCTGCTCCTTCTGGTACTCTACCTTGTTTCTTCTTGCGTGTGA

Protein sequence

MSLRNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLMFGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVFDRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPSKPSGAAPCFIIPSSFYSIRLPHLLLLVLYLVSSCV
Homology
BLAST of Clc01G19160 vs. NCBI nr
Match: XP_038882807.1 (aspartic proteinase-like protein 1 isoform X1 [Benincasa hispida])

HSP 1 Score: 1009.2 bits (2608), Expect = 1.4e-290
Identity = 510/575 (88.70%), Postives = 522/575 (90.78%), Query Frame = 0

Query: 1   MSLRNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSME 60
           MSL+NLILLLL+VI VHQAVSITFTSRILHRFSEEMKALRV+GSTNTSVRA WPEKGSME
Sbjct: 1   MSLQNLILLLLMVIAVHQAVSITFTSRILHRFSEEMKALRVSGSTNTSVRASWPEKGSME 60

Query: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120
           YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL
Sbjct: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLM 180
           DAGSDLLWVPCDCIQCAPLSASYYGSL                                 
Sbjct: 121 DAGSDLLWVPCDCIQCAPLSASYYGSL--------------------------------- 180

Query: 181 FGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSS 240
                   DKDLNEYRPSSSSTSKHISCSH+LCESGQSCQSPKQSCPYVIDYITENTSSS
Sbjct: 181 --------DKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYITENTSSS 240

Query: 241 GLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300
           GLLIQDVLHLS GC NSSNCMIQAP+ILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS
Sbjct: 241 GLLIQDVLHLSFGCGNSSNCMIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300

Query: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 360
           LAKEELVQNSFSLCFNEDGSGRIFFGD+GPASQQ TSFVPLDGKYETY+VGVEACCIENS
Sbjct: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDKGPASQQMTSFVPLDGKYETYVVGVEACCIENS 360

Query: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 420
           CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM
Sbjct: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 420

Query: 421 PKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVF 480
           PKVP+VTLLFP NNSFVVHDPVFP+YGD+GLAGFCFA+LPADGDIGILGQNYMTGYRMVF
Sbjct: 421 PKVPTVTLLFPLNNSFVVHDPVFPVYGDEGLAGFCFAILPADGDIGILGQNYMTGYRMVF 480

Query: 481 DRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS 540
           DRD+LKLGWSRANC DLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS
Sbjct: 481 DRDDLKLGWSRANCLDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS 533

Query: 541 KPSGAAPCFIIPSSFYSIRLPHLLLLVLYLVSSCV 576
           KPS A PCF I SSFYSIRLPHLLLLV YLVSSCV
Sbjct: 541 KPSAAVPCF-ITSSFYSIRLPHLLLLVFYLVSSCV 533

BLAST of Clc01G19160 vs. NCBI nr
Match: XP_004143563.2 (aspartic proteinase-like protein 1 isoform X1 [Cucumis sativus] >KAE8647563.1 hypothetical protein Csa_003286 [Cucumis sativus])

HSP 1 Score: 998.0 bits (2579), Expect = 3.3e-287
Identity = 505/575 (87.83%), Postives = 517/575 (89.91%), Query Frame = 0

Query: 1   MSLRNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSME 60
           MSLRNL+LLLL+VI VHQ VSITFTSRILHRFSEEMKALR +GSTNTSVR  WPEKGSME
Sbjct: 1   MSLRNLLLLLLMVIFVHQVVSITFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSME 60

Query: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120
           YYQELVSGDF+RQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL
Sbjct: 61  YYQELVSGDFRRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLM 180
           DAGSDLLWVPC+CIQCAPLSASYYGSL                                 
Sbjct: 121 DAGSDLLWVPCNCIQCAPLSASYYGSL--------------------------------- 180

Query: 181 FGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSS 240
                   DKDLNEYRPSSSSTSKHISCSH+LC+SGQSCQSPKQSCPYVIDYITENTSSS
Sbjct: 181 --------DKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSS 240

Query: 241 GLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300
           GLLIQDVLHLSSGC+NSSNC IQAP+ILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS
Sbjct: 241 GLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300

Query: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 360
           LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS
Sbjct: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 360

Query: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 420
           CLKQTSFKALIDSGTSFTYLPEEAYENIV+EFDKRLNTTS VSFKGYPWKYCYKISADAM
Sbjct: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAM 420

Query: 421 PKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVF 480
           PKVPSVTLLFP NNSFVVHDPVFPIYGDQGLAGFCFA+LPADGDIGILGQNYMTGYRMVF
Sbjct: 421 PKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVF 480

Query: 481 DRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS 540
           DRDNLKLGWS ANCQDLSNEKKMPL PAKETPPNPLPA+EQQSA GGHAVAPAVAGRAPS
Sbjct: 481 DRDNLKLGWSHANCQDLSNEKKMPLTPAKETPPNPLPADEQQSASGGHAVAPAVAGRAPS 533

Query: 541 KPSGAAPCFIIPSSFYSIRLPHLLLLVLYLVSSCV 576
           KPS A PCF IPS FYSIRLPHLLLL L LVSSCV
Sbjct: 541 KPSAATPCF-IPSRFYSIRLPHLLLLALCLVSSCV 533

BLAST of Clc01G19160 vs. NCBI nr
Match: XP_008440641.1 (PREDICTED: aspartic proteinase-like protein 1 isoform X1 [Cucumis melo] >KAA0036277.1 aspartic proteinase-like protein 1 isoform X1 [Cucumis melo var. makuwa] >TYK12671.1 aspartic proteinase-like protein 1 isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 993.0 bits (2566), Expect = 1.0e-285
Identity = 505/576 (87.67%), Postives = 518/576 (89.93%), Query Frame = 0

Query: 1   MSLRNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSME 60
           MSLRNL++LLL+VI VHQAVSITFTSRILHRFSEEMKALRV+ STNTSVR  WPEKGSME
Sbjct: 1   MSLRNLVMLLLMVIFVHQAVSITFTSRILHRFSEEMKALRVSVSTNTSVRVSWPEKGSME 60

Query: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120
           YYQELVSGDF+RQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL
Sbjct: 61  YYQELVSGDFRRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLM 180
           DAGSDLLW+PC+CIQCAPLSASYYGSL                                 
Sbjct: 121 DAGSDLLWIPCNCIQCAPLSASYYGSL--------------------------------- 180

Query: 181 FGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSS 240
                   DKDLNEYRPSSSSTSKHISCSH+LC+SGQSCQSPKQSCPYVIDYITENTSSS
Sbjct: 181 --------DKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSS 240

Query: 241 GLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300
           GLLIQDVLHLSSGC+NSSNC IQAP+ILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS
Sbjct: 241 GLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300

Query: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 360
           LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQ TSFVPLDGKYETYIVGVEACCIENS
Sbjct: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENS 360

Query: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 420
           CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTS VSFKGYPWKYCYKISADAM
Sbjct: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSAVSFKGYPWKYCYKISADAM 420

Query: 421 PKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVF 480
           PKVPSVTLLFP NNSFVVHDPVFPIYGDQGLAGFCFA+LPADGDIGILGQNYMTGYRMVF
Sbjct: 421 PKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVF 480

Query: 481 DRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS 540
           DRDNLKLGWS ANCQDLSNEKKMPL PAKETPPNPLPANEQQSA GGHAVAPAVAGRAPS
Sbjct: 481 DRDNLKLGWSHANCQDLSNEKKMPLTPAKETPPNPLPANEQQSASGGHAVAPAVAGRAPS 534

Query: 541 KPSGAA-PCFIIPSSFYSIRLPHLLLLVLYLVSSCV 576
           KPS AA PCF IPS FYSIRLP+LLLL L LVSSCV
Sbjct: 541 KPSAAATPCF-IPSKFYSIRLPYLLLLALCLVSSCV 534

BLAST of Clc01G19160 vs. NCBI nr
Match: XP_022950779.1 (aspartic proteinase-like protein 1 isoform X1 [Cucurbita moschata] >XP_022950781.1 aspartic proteinase-like protein 1 isoform X1 [Cucurbita moschata])

HSP 1 Score: 979.5 bits (2531), Expect = 1.2e-281
Identity = 496/575 (86.26%), Postives = 513/575 (89.22%), Query Frame = 0

Query: 1   MSLRNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSME 60
           MSLRNLILLLL+VI VHQAVSITFTSR+LHRFSE+MKALRV+GST T VRA WPEKGSME
Sbjct: 1   MSLRNLILLLLMVIAVHQAVSITFTSRLLHRFSEDMKALRVSGST-TGVRASWPEKGSME 60

Query: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120
           YYQELVSGDFQRQKMKLGSRFQ LFPSEGSKTI LGNDFGWLHYTWIDIGTPSVSFLVAL
Sbjct: 61  YYQELVSGDFQRQKMKLGSRFQWLFPSEGSKTIELGNDFGWLHYTWIDIGTPSVSFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLM 180
           DAGSDLLWVPCDCIQCAPLSASYYGSL                                 
Sbjct: 121 DAGSDLLWVPCDCIQCAPLSASYYGSL--------------------------------- 180

Query: 181 FGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSS 240
                   DKDLNEYRPS SSTSKHISCSH+LC+SGQSCQSPKQSCPYVIDY TENTSSS
Sbjct: 181 --------DKDLNEYRPSKSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYTTENTSSS 240

Query: 241 GLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300
           GLLIQDVLHLSSGC+NSSNC IQAP+ILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS
Sbjct: 241 GLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300

Query: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 360
           LAKE LV NSFSLCFNEDGSGRIFFGDEGPASQQ TSFVPLD KYE YIVGVEACCI NS
Sbjct: 301 LAKEGLVPNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDEKYEAYIVGVEACCIGNS 360

Query: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 420
           CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLN TSTVSFKGYPWKYCYKIS DAM
Sbjct: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNVTSTVSFKGYPWKYCYKISTDAM 420

Query: 421 PKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVF 480
           PKVPSVTLLFP NNSFVVHDPVFPIYGDQGLAGFCF++LPADGDIGILGQNYMTGYRMVF
Sbjct: 421 PKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFSILPADGDIGILGQNYMTGYRMVF 480

Query: 481 DRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS 540
           DRDNLKLGWSRANCQDLSN+K+MP+APAKETPPNPLPANEQQSAPGGHAVAPA+AGRAPS
Sbjct: 481 DRDNLKLGWSRANCQDLSNDKEMPIAPAKETPPNPLPANEQQSAPGGHAVAPAIAGRAPS 532

Query: 541 KPSGAAPCFIIPSSFYSIRLPHLLLLVLYLVSSCV 576
           KPS AAPC I+PSSFYSIRLPHL+LLVL LVS+CV
Sbjct: 541 KPSAAAPC-IMPSSFYSIRLPHLILLVLCLVSTCV 532

BLAST of Clc01G19160 vs. NCBI nr
Match: XP_022978453.1 (aspartic proteinase-like protein 1 isoform X1 [Cucurbita maxima] >XP_022978454.1 aspartic proteinase-like protein 1 isoform X1 [Cucurbita maxima])

HSP 1 Score: 976.1 bits (2522), Expect = 1.3e-280
Identity = 494/575 (85.91%), Postives = 511/575 (88.87%), Query Frame = 0

Query: 1   MSLRNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSME 60
           MSLRNLILLLL+VI VHQ VSITFTSR+LHRFS++MKA RV+GST T VRA WPEKGSME
Sbjct: 1   MSLRNLILLLLMVIAVHQVVSITFTSRLLHRFSKDMKAFRVSGST-TGVRASWPEKGSME 60

Query: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120
           YYQELVSGDFQRQKMKLGSRFQ LFPSEGSKTI LGNDFGWLHYTWIDIGTPSVSFLVAL
Sbjct: 61  YYQELVSGDFQRQKMKLGSRFQWLFPSEGSKTIELGNDFGWLHYTWIDIGTPSVSFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLM 180
           DAGSDLLWVPCDCIQCAPLSASYYGSL                                 
Sbjct: 121 DAGSDLLWVPCDCIQCAPLSASYYGSL--------------------------------- 180

Query: 181 FGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSS 240
                   DKDLNEYRPS SSTSKHISCSH+LC+SGQSCQSPKQSCPYVIDY TENTSSS
Sbjct: 181 --------DKDLNEYRPSKSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYTTENTSSS 240

Query: 241 GLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300
           GLLIQDVLHLSSGC+NSSNC IQAP+ILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS
Sbjct: 241 GLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300

Query: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 360
           LAKE LV NSFSLCFNEDGSGRIFFGDEGPASQQ TSFVPLD KYE YIVGVEACCI NS
Sbjct: 301 LAKEGLVPNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDEKYEAYIVGVEACCIGNS 360

Query: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 420
           CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLN TSTVSFKGYPWKYCYKIS DAM
Sbjct: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNVTSTVSFKGYPWKYCYKISTDAM 420

Query: 421 PKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVF 480
           PKVPSVTLLFP NNSFVVHDPVFPIYGDQGLAGFCF++LPADGDIGILGQNYMTGYRMVF
Sbjct: 421 PKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFSILPADGDIGILGQNYMTGYRMVF 480

Query: 481 DRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS 540
           DRDNLKLGWSRANCQDLSN+K+MP+APAKETPPNPLPANEQQSAPGGHAVAPA+AGRAPS
Sbjct: 481 DRDNLKLGWSRANCQDLSNDKEMPIAPAKETPPNPLPANEQQSAPGGHAVAPAIAGRAPS 532

Query: 541 KPSGAAPCFIIPSSFYSIRLPHLLLLVLYLVSSCV 576
           KPS AAPC IIPSSFYSIRLPHL+LLVL LVS+CV
Sbjct: 541 KPSAAAPC-IIPSSFYSIRLPHLILLVLCLVSTCV 532

BLAST of Clc01G19160 vs. ExPASy Swiss-Prot
Match: Q9LX20 (Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At5g10080 PE=2 SV=1)

HSP 1 Score: 554.3 bits (1427), Expect = 1.6e-156
Identity = 294/572 (51.40%), Postives = 386/572 (67.48%), Query Frame = 0

Query: 7   ILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSMEYYQELV 66
           +L  ++ +   + ++  F+SR++HRFS+E +A     S++ S+    P K S+EYY+ L 
Sbjct: 8   LLFCVLFLATEETLASLFSSRLIHRFSDEGRASIKTPSSSDSL----PNKQSLEYYRLLA 67

Query: 67  SGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDL 126
             DF+RQ+M LG++ Q L PSEGSKTI+ GNDFGWLHYTWIDIGTPSVSFLVALD GS+L
Sbjct: 68  ESDFRRQRMNLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNL 127

Query: 127 LWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLMFGALFI 186
           LW+PC+C+QCAPL+++YY SL+                                      
Sbjct: 128 LWIPCNCVQCAPLTSTYYSSLA-------------------------------------- 187

Query: 187 MQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQD 246
              KDLNEY PSSSSTSK   CSH LC+S   C+SPK+ CPY ++Y++ NTSSSGLL++D
Sbjct: 188 --TKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVED 247

Query: 247 VLHLSSGCDN---SSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAK 306
           +LHL+   +N   + +  ++A +++GCG KQSG YL GVAPDGL GLG  EISV S L+K
Sbjct: 248 ILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSK 307

Query: 307 EELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLD-GKYETYIVGVEACCIENSCL 366
             L++NSFSLCF+E+ SGRI+FGD GP+ QQ+T F+ LD  KY  YIVGVEACCI NSCL
Sbjct: 308 AGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCL 367

Query: 367 KQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPK 426
           KQTSF   IDSG SFTYLPEE Y  + +E D+ +N TS  +F+G  W+YCY+ SA+  PK
Sbjct: 368 KQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSK-NFEGVSWEYCYESSAE--PK 427

Query: 427 VPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGD-IGILGQNYMTGYRMVFD 486
           VP++ L F  NN+FV+H P+F     QGL  FC  + P+  + IG +GQNYM GYRMVFD
Sbjct: 428 VPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSGQEGIGSIGQNYMRGYRMVFD 487

Query: 487 RDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPSK 546
           R+N+KLGWS + CQ+   E     +P   + PNPLP +EQQS  GGHAV+PA+AG+ PSK
Sbjct: 488 RENMKLGWSPSKCQEDKIEPPQ-ASPGSTSSPNPLPTDEQQSR-GGHAVSPAIAGKTPSK 526

Query: 547 PSGAAPCFIIPSSFYSI-RLPHLLLLVLYLVS 573
              ++  +    SF SI RL + LLL+ +L S
Sbjct: 548 TPSSSSSY----SFSSIMRLFNSLLLLHWLAS 526

BLAST of Clc01G19160 vs. ExPASy Swiss-Prot
Match: Q8VYV9 (Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 SV=1)

HSP 1 Score: 278.1 bits (710), Expect = 2.3e-73
Identity = 181/502 (36.06%), Postives = 261/502 (51.99%), Query Frame = 0

Query: 30  HRFSEEMKALRVAGSTNTSVRALWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFP-SE 89
           HRFS+++  +              P + S +YY+ +   D   +  +L +  Q L   S+
Sbjct: 39  HRFSDQVVGVLPGDGL--------PNRDSSKYYRVMAHRDRLIRGRRLANEDQSLVTFSD 98

Query: 90  GSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLS 149
           G++T+ + +  G+LHY  + +GTPS  F+VALD GSDL W+PCDC  C            
Sbjct: 99  GNETVRV-DALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNC------------ 158

Query: 150 EGHTWTSCVLKFKAFCSAFLRERKAWCSCSLMFGALFIMQDKDLNEYRPSSSSTSKHISC 209
                              +RE KA    SL           DLN Y P++SSTS  + C
Sbjct: 159 -------------------VRELKAPGGSSL-----------DLNIYSPNASSTSTKVPC 218

Query: 210 SHDLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCDNSSNCMIQAPIIL 269
           +  LC  G  C SP+  CPY I Y++  TSS+G+L++DVLHL S  ++ S+  I A +  
Sbjct: 219 NSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVS--NDKSSKAIPARVTF 278

Query: 270 GCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDE 329
           GCG  Q+G +  G AP+GLFGLGL +ISV S LAKE +  NSFS+CF  DG+GRI FGD+
Sbjct: 279 GCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDK 338

Query: 330 GPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENI 389
           G   Q+ T  + +   + TY + V    +  +      F A+ DSGTSFTYL + AY  I
Sbjct: 339 GSVDQRETP-LNIRQPHPTYNITVTKISVGGN-TGDLEFDAVFDSGTSFTYLTDAAYTLI 398

Query: 390 VMEF-----DKRLNTTSTVSFKGYPWKYCYKISADAMP-KVPSVTLLFPQNNSFVVHDPV 449
              F     DKR  TT +      P++YCY +S +    + P+V L     +S+ V+ P+
Sbjct: 399 SESFNSLALDKRYQTTDS----ELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPL 458

Query: 450 FPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVFDRDNLKLGWSRANCQDLSNEKK 509
             +   +    +C A++  + DI I+GQN+MTGYR+VFDR+ L LGW  ++C        
Sbjct: 459 V-VIPMKDTDVYCLAIMKIE-DISIIGQNFMTGYRVVFDREKLILGWKESDCY------- 467

Query: 510 MPLAPAKETPPNPLPANEQQSA 525
                  ET    LP+N   S+
Sbjct: 519 -----TGETSARTLPSNRSSSS 467

BLAST of Clc01G19160 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 114.0 bits (284), Expect = 5.6e-24
Identity = 123/505 (24.36%), Postives = 202/505 (40.00%), Query Frame = 0

Query: 4   RNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSMEYYQ 63
           R L +++ V + V +  S  F  +  H+F+ + K                    ++E+++
Sbjct: 5   RKLCIVVAVFVIVIEFASANFVFKAQHKFAGKKK--------------------NLEHFK 64

Query: 64  ELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAG 123
              S D +R    L S   +  P  G   +    D   L++T I +G+P   + V +D G
Sbjct: 65  ---SHDTRRHSRMLAS---IDLPLGGDSRV----DSVGLYFTKIKLGSPPKEYHVQVDTG 124

Query: 124 SDLLWVPC-DCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLMFG 183
           SD+LW+ C  C +C                 T   L F+                     
Sbjct: 125 SDILWINCKPCPKCP----------------TKTNLNFR--------------------- 184

Query: 184 ALFIMQDKDLNEYRPSSSSTSKHISCSHDLC---ESGQSCQSPKQSCPYVIDYITENTSS 243
                    L+ +  ++SSTSK + C  D C       SCQ P   C Y I Y  E+T S
Sbjct: 185 ---------LSLFDMNASSTSKKVGCDDDFCSFISQSDSCQ-PALGCSYHIVYADEST-S 244

Query: 244 SGLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSG-VAPDGLFGLGLGEISVL 303
            G  I+D+L L     +     +   ++ GCG  QSG   +G  A DG+ G G    SVL
Sbjct: 245 DGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVL 304

Query: 304 SSLAKEELVQNSFSLCF-NEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVE---- 363
           S LA     +  FS C  N  G G    G       +TT  VP    Y   ++G++    
Sbjct: 305 SQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGT 364

Query: 364 ACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCY 423
           +  +  S ++      ++DSGT+  Y P+  Y++++     R      +  + +    C+
Sbjct: 365 SLDLPRSIVRNGG--TIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVEETF---QCF 424

Query: 424 KISADAMPKVPSVTLLFPQNNSFVV--HDPVFPIYGDQGLAGFCFAVLPAD--GDIGILG 483
             S +     P V+  F  +    V  HD +F +  +    G+    L  D   ++ +LG
Sbjct: 425 SFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLG 426

Query: 484 QNYMTGYRMVFDRDNLKLGWSRANC 495
              ++   +V+D DN  +GW+  NC
Sbjct: 485 DLVLSNKLVVYDLDNEVIGWADHNC 426

BLAST of Clc01G19160 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 107.1 bits (266), Expect = 6.9e-22
Identity = 124/506 (24.51%), Postives = 201/506 (39.72%), Query Frame = 0

Query: 8   LLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSMEYYQELVS 67
           ++ +V + V Q VS  F   + H+F+ + K L                        EL S
Sbjct: 13  IVAVVFVLVIQVVSGNFVFNVTHKFAGKEKQL-----------------------SELKS 72

Query: 68  GDFQRQKMKLGSRFQLLFPSEG-SKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDL 127
            D  R    L +   +  P  G S+  ++G     L++T I +G+P   + V +D GSD+
Sbjct: 73  HDSFRHARMLAN---IDLPLGGDSRADSIG-----LYFTKIKLGSPPKEYYVQVDTGSDI 132

Query: 128 LWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLMFGALFI 187
           LWV      CAP                                    C      G    
Sbjct: 133 LWV-----NCAPCPK---------------------------------CPVKTDLGI--- 192

Query: 188 MQDKDLNEYRPSSSSTSKHISCSHDLCE---SGQSCQSPKQSCPYVIDYITENTSSSGLL 247
                L+ Y   +SSTSK++ C  D C      ++C   K+ C Y + Y  + ++S G  
Sbjct: 193 ----PLSLYDSKTSSTSKNVGCEDDFCSFIMQSETC-GAKKPCSYHVVY-GDGSTSDGDF 252

Query: 248 IQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGY-LSGVAPDGLFGLGLGEISVLSSLA 307
           I+D + L     N     +   ++ GCG  QSG    +  A DG+ G G    S++S LA
Sbjct: 253 IKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLA 312

Query: 308 KEELVQNSFSLCF-NEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVG--VEACCIE- 367
                +  FS C  N +G G    G+      +TT  VP    Y   + G  V+   I+ 
Sbjct: 313 AGGSTKRIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDL 372

Query: 368 NSCLKQTSFK--ALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKIS 427
              L  T+     +IDSGT+  YLP+  Y +++ +   +      +  + +    C+  +
Sbjct: 373 PPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQETFA---CFSFT 432

Query: 428 ADAMPKVPSVTLLFPQNNSFVV--HDPVFPIYGDQGLAGFCF-----AVLPADG-DIGIL 487
           ++     P V L F  +    V  HD +F +  D     +CF      +   DG D+ +L
Sbjct: 433 SNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDM----YCFGWQSGGMTTQDGADVILL 433

Query: 488 GQNYMTGYRMVFDRDNLKLGWSRANC 495
           G   ++   +V+D +N  +GW+  NC
Sbjct: 493 GDLVLSNKLVVYDLENEVIGWADHNC 433

BLAST of Clc01G19160 vs. ExPASy Swiss-Prot
Match: Q9LEW3 (Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1)

HSP 1 Score: 90.5 bits (223), Expect = 6.7e-17
Identity = 87/331 (26.28%), Postives = 144/331 (43.50%), Query Frame = 0

Query: 174 WCSCSLMFGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYI 233
           W  C    G+ +  ++   N   PSSSST +++SCS  +CE  +SC +   +C Y I Y 
Sbjct: 157 WTQCEPCLGSCYSQKEPKFN---PSSSSTYQNVSCSSPMCEDAESCSA--SNCVYSIVY- 216

Query: 234 TENTSSSGLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLG 293
            + + + G L ++   L       +N  +   +  GCG + + G   GVA  GL GLG G
Sbjct: 217 GDKSFTQGFLAKEKFTL-------TNSDVLEDVYFGCG-ENNQGLFDGVA--GLLGLGPG 276

Query: 294 EISVLSSLAKEELVQNSFSLC---FNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIV 353
           ++S+ +         N FS C   F  + +G + FG  G    ++  F P+      +  
Sbjct: 277 KLSLPAQTT--TTYNNIFSYCLPSFTSNSTGHLTFGSAG--ISESVKFTPISSFPSAFNY 336

Query: 354 GVEACCI----ENSCLKQTSFK---ALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVS 413
           G++   I    +   +   SF    A+IDSGT FT LP + Y  +   F +++++  + S
Sbjct: 337 GIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTS 396

Query: 414 FKGYPWKYCYKISADAMPKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADG 473
             G  +  CY  +   +  V   T+ F    S VV      I     ++  C A    D 
Sbjct: 397 GYGL-FDTCYDFT--GLDTVTYPTIAFSFAGSTVVELDGSGISLPIKISQVCLAFAGNDD 456

Query: 474 DIGILGQNYMTGYRMVFDRDNLKLGWSRANC 495
              I G    T   +V+D    ++G++   C
Sbjct: 457 LPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464

BLAST of Clc01G19160 vs. ExPASy TrEMBL
Match: A0A5D3CLH5 (Aspartic proteinase-like protein 1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G002590 PE=3 SV=1)

HSP 1 Score: 993.0 bits (2566), Expect = 5.1e-286
Identity = 505/576 (87.67%), Postives = 518/576 (89.93%), Query Frame = 0

Query: 1   MSLRNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSME 60
           MSLRNL++LLL+VI VHQAVSITFTSRILHRFSEEMKALRV+ STNTSVR  WPEKGSME
Sbjct: 1   MSLRNLVMLLLMVIFVHQAVSITFTSRILHRFSEEMKALRVSVSTNTSVRVSWPEKGSME 60

Query: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120
           YYQELVSGDF+RQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL
Sbjct: 61  YYQELVSGDFRRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLM 180
           DAGSDLLW+PC+CIQCAPLSASYYGSL                                 
Sbjct: 121 DAGSDLLWIPCNCIQCAPLSASYYGSL--------------------------------- 180

Query: 181 FGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSS 240
                   DKDLNEYRPSSSSTSKHISCSH+LC+SGQSCQSPKQSCPYVIDYITENTSSS
Sbjct: 181 --------DKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSS 240

Query: 241 GLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300
           GLLIQDVLHLSSGC+NSSNC IQAP+ILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS
Sbjct: 241 GLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300

Query: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 360
           LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQ TSFVPLDGKYETYIVGVEACCIENS
Sbjct: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENS 360

Query: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 420
           CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTS VSFKGYPWKYCYKISADAM
Sbjct: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSAVSFKGYPWKYCYKISADAM 420

Query: 421 PKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVF 480
           PKVPSVTLLFP NNSFVVHDPVFPIYGDQGLAGFCFA+LPADGDIGILGQNYMTGYRMVF
Sbjct: 421 PKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVF 480

Query: 481 DRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS 540
           DRDNLKLGWS ANCQDLSNEKKMPL PAKETPPNPLPANEQQSA GGHAVAPAVAGRAPS
Sbjct: 481 DRDNLKLGWSHANCQDLSNEKKMPLTPAKETPPNPLPANEQQSASGGHAVAPAVAGRAPS 534

Query: 541 KPSGAA-PCFIIPSSFYSIRLPHLLLLVLYLVSSCV 576
           KPS AA PCF IPS FYSIRLP+LLLL L LVSSCV
Sbjct: 541 KPSAAATPCF-IPSKFYSIRLPYLLLLALCLVSSCV 534

BLAST of Clc01G19160 vs. ExPASy TrEMBL
Match: A0A1S3B270 (aspartic proteinase-like protein 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103484997 PE=3 SV=1)

HSP 1 Score: 993.0 bits (2566), Expect = 5.1e-286
Identity = 505/576 (87.67%), Postives = 518/576 (89.93%), Query Frame = 0

Query: 1   MSLRNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSME 60
           MSLRNL++LLL+VI VHQAVSITFTSRILHRFSEEMKALRV+ STNTSVR  WPEKGSME
Sbjct: 1   MSLRNLVMLLLMVIFVHQAVSITFTSRILHRFSEEMKALRVSVSTNTSVRVSWPEKGSME 60

Query: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120
           YYQELVSGDF+RQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL
Sbjct: 61  YYQELVSGDFRRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLM 180
           DAGSDLLW+PC+CIQCAPLSASYYGSL                                 
Sbjct: 121 DAGSDLLWIPCNCIQCAPLSASYYGSL--------------------------------- 180

Query: 181 FGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSS 240
                   DKDLNEYRPSSSSTSKHISCSH+LC+SGQSCQSPKQSCPYVIDYITENTSSS
Sbjct: 181 --------DKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSS 240

Query: 241 GLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300
           GLLIQDVLHLSSGC+NSSNC IQAP+ILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS
Sbjct: 241 GLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300

Query: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 360
           LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQ TSFVPLDGKYETYIVGVEACCIENS
Sbjct: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDGKYETYIVGVEACCIENS 360

Query: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 420
           CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTS VSFKGYPWKYCYKISADAM
Sbjct: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSAVSFKGYPWKYCYKISADAM 420

Query: 421 PKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVF 480
           PKVPSVTLLFP NNSFVVHDPVFPIYGDQGLAGFCFA+LPADGDIGILGQNYMTGYRMVF
Sbjct: 421 PKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVF 480

Query: 481 DRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS 540
           DRDNLKLGWS ANCQDLSNEKKMPL PAKETPPNPLPANEQQSA GGHAVAPAVAGRAPS
Sbjct: 481 DRDNLKLGWSHANCQDLSNEKKMPLTPAKETPPNPLPANEQQSASGGHAVAPAVAGRAPS 534

Query: 541 KPSGAA-PCFIIPSSFYSIRLPHLLLLVLYLVSSCV 576
           KPS AA PCF IPS FYSIRLP+LLLL L LVSSCV
Sbjct: 541 KPSAAATPCF-IPSKFYSIRLPYLLLLALCLVSSCV 534

BLAST of Clc01G19160 vs. ExPASy TrEMBL
Match: A0A6J1GFS3 (aspartic proteinase-like protein 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111453783 PE=3 SV=1)

HSP 1 Score: 979.5 bits (2531), Expect = 5.8e-282
Identity = 496/575 (86.26%), Postives = 513/575 (89.22%), Query Frame = 0

Query: 1   MSLRNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSME 60
           MSLRNLILLLL+VI VHQAVSITFTSR+LHRFSE+MKALRV+GST T VRA WPEKGSME
Sbjct: 1   MSLRNLILLLLMVIAVHQAVSITFTSRLLHRFSEDMKALRVSGST-TGVRASWPEKGSME 60

Query: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120
           YYQELVSGDFQRQKMKLGSRFQ LFPSEGSKTI LGNDFGWLHYTWIDIGTPSVSFLVAL
Sbjct: 61  YYQELVSGDFQRQKMKLGSRFQWLFPSEGSKTIELGNDFGWLHYTWIDIGTPSVSFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLM 180
           DAGSDLLWVPCDCIQCAPLSASYYGSL                                 
Sbjct: 121 DAGSDLLWVPCDCIQCAPLSASYYGSL--------------------------------- 180

Query: 181 FGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSS 240
                   DKDLNEYRPS SSTSKHISCSH+LC+SGQSCQSPKQSCPYVIDY TENTSSS
Sbjct: 181 --------DKDLNEYRPSKSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYTTENTSSS 240

Query: 241 GLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300
           GLLIQDVLHLSSGC+NSSNC IQAP+ILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS
Sbjct: 241 GLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300

Query: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 360
           LAKE LV NSFSLCFNEDGSGRIFFGDEGPASQQ TSFVPLD KYE YIVGVEACCI NS
Sbjct: 301 LAKEGLVPNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDEKYEAYIVGVEACCIGNS 360

Query: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 420
           CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLN TSTVSFKGYPWKYCYKIS DAM
Sbjct: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNVTSTVSFKGYPWKYCYKISTDAM 420

Query: 421 PKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVF 480
           PKVPSVTLLFP NNSFVVHDPVFPIYGDQGLAGFCF++LPADGDIGILGQNYMTGYRMVF
Sbjct: 421 PKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFSILPADGDIGILGQNYMTGYRMVF 480

Query: 481 DRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS 540
           DRDNLKLGWSRANCQDLSN+K+MP+APAKETPPNPLPANEQQSAPGGHAVAPA+AGRAPS
Sbjct: 481 DRDNLKLGWSRANCQDLSNDKEMPIAPAKETPPNPLPANEQQSAPGGHAVAPAIAGRAPS 532

Query: 541 KPSGAAPCFIIPSSFYSIRLPHLLLLVLYLVSSCV 576
           KPS AAPC I+PSSFYSIRLPHL+LLVL LVS+CV
Sbjct: 541 KPSAAAPC-IMPSSFYSIRLPHLILLVLCLVSTCV 532

BLAST of Clc01G19160 vs. ExPASy TrEMBL
Match: A0A6J1IU36 (aspartic proteinase-like protein 1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111478432 PE=3 SV=1)

HSP 1 Score: 976.1 bits (2522), Expect = 6.4e-281
Identity = 494/575 (85.91%), Postives = 511/575 (88.87%), Query Frame = 0

Query: 1   MSLRNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSME 60
           MSLRNLILLLL+VI VHQ VSITFTSR+LHRFS++MKA RV+GST T VRA WPEKGSME
Sbjct: 1   MSLRNLILLLLMVIAVHQVVSITFTSRLLHRFSKDMKAFRVSGST-TGVRASWPEKGSME 60

Query: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120
           YYQELVSGDFQRQKMKLGSRFQ LFPSEGSKTI LGNDFGWLHYTWIDIGTPSVSFLVAL
Sbjct: 61  YYQELVSGDFQRQKMKLGSRFQWLFPSEGSKTIELGNDFGWLHYTWIDIGTPSVSFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLM 180
           DAGSDLLWVPCDCIQCAPLSASYYGSL                                 
Sbjct: 121 DAGSDLLWVPCDCIQCAPLSASYYGSL--------------------------------- 180

Query: 181 FGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSS 240
                   DKDLNEYRPS SSTSKHISCSH+LC+SGQSCQSPKQSCPYVIDY TENTSSS
Sbjct: 181 --------DKDLNEYRPSKSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYTTENTSSS 240

Query: 241 GLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300
           GLLIQDVLHLSSGC+NSSNC IQAP+ILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS
Sbjct: 241 GLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300

Query: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 360
           LAKE LV NSFSLCFNEDGSGRIFFGDEGPASQQ TSFVPLD KYE YIVGVEACCI NS
Sbjct: 301 LAKEGLVPNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVPLDEKYEAYIVGVEACCIGNS 360

Query: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 420
           CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLN TSTVSFKGYPWKYCYKIS DAM
Sbjct: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNVTSTVSFKGYPWKYCYKISTDAM 420

Query: 421 PKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVF 480
           PKVPSVTLLFP NNSFVVHDPVFPIYGDQGLAGFCF++LPADGDIGILGQNYMTGYRMVF
Sbjct: 421 PKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFSILPADGDIGILGQNYMTGYRMVF 480

Query: 481 DRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS 540
           DRDNLKLGWSRANCQDLSN+K+MP+APAKETPPNPLPANEQQSAPGGHAVAPA+AGRAPS
Sbjct: 481 DRDNLKLGWSRANCQDLSNDKEMPIAPAKETPPNPLPANEQQSAPGGHAVAPAIAGRAPS 532

Query: 541 KPSGAAPCFIIPSSFYSIRLPHLLLLVLYLVSSCV 576
           KPS AAPC IIPSSFYSIRLPHL+LLVL LVS+CV
Sbjct: 541 KPSAAAPC-IIPSSFYSIRLPHLILLVLCLVSTCV 532

BLAST of Clc01G19160 vs. ExPASy TrEMBL
Match: A0A6J1HE55 (aspartic proteinase-like protein 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111463362 PE=3 SV=1)

HSP 1 Score: 970.7 bits (2508), Expect = 2.7e-279
Identity = 488/575 (84.87%), Postives = 508/575 (88.35%), Query Frame = 0

Query: 1   MSLRNLILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSME 60
           MSLRNL+L+LL++I  HQA+SI FTSRILHRFSEEMKALRV+ STNTSVR  WPEKGSME
Sbjct: 1   MSLRNLVLVLLMMISAHQAMSIAFTSRILHRFSEEMKALRVSRSTNTSVRVSWPEKGSME 60

Query: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVAL 120
           YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHY WIDIGTPSVSFLVAL
Sbjct: 61  YYQELVSGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYAWIDIGTPSVSFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLM 180
           DAGSDLLW+PCDCIQCAPLSASYYGSL                                 
Sbjct: 121 DAGSDLLWIPCDCIQCAPLSASYYGSL--------------------------------- 180

Query: 181 FGALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSS 240
                   DKDLNEYRPSSSSTSKHISCSH+LCESGQSCQSPKQSCPYVIDY+TENTSSS
Sbjct: 181 --------DKDLNEYRPSSSSTSKHISCSHNLCESGQSCQSPKQSCPYVIDYLTENTSSS 240

Query: 241 GLLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300
           GLLIQDVLHLSSGC+NSSNC IQAP++LGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS
Sbjct: 241 GLLIQDVLHLSSGCENSSNCTIQAPVVLGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSS 300

Query: 301 LAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 360
           LAKE LVQNSFSLCFNEDGSGRIFFGDEGPASQQ TSFV LDGKYE YIVGVEACCI NS
Sbjct: 301 LAKEGLVQNSFSLCFNEDGSGRIFFGDEGPASQQMTSFVLLDGKYEAYIVGVEACCIGNS 360

Query: 361 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 420
           CL+QTSFKALIDSGTSFTYLPEE YEN+VMEFDKRLNTTSTV+FKGYPWKYCYKISADAM
Sbjct: 361 CLEQTSFKALIDSGTSFTYLPEEVYENVVMEFDKRLNTTSTVTFKGYPWKYCYKISADAM 420

Query: 421 PKVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVF 480
           PKVPSVTLLFP NNSFVVHDPVFPIYGDQGLAGFCFAVLP DGDIGILGQNYMTGYRMVF
Sbjct: 421 PKVPSVTLLFPLNNSFVVHDPVFPIYGDQGLAGFCFAVLPTDGDIGILGQNYMTGYRMVF 480

Query: 481 DRDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPS 540
           DR+NLKL WSRANCQDLSNEKKMPLAP+KETPPNPLPANEQQS   GHAVAPAVAGRAPS
Sbjct: 481 DRENLKLSWSRANCQDLSNEKKMPLAPSKETPPNPLPANEQQSVSEGHAVAPAVAGRAPS 533

Query: 541 KPSGAAPCFIIPSSFYSIRLPHLLLLVLYLVSSCV 576
           KPS A PCF IPS FY++RL HLLLLV YLVS+CV
Sbjct: 541 KPSAATPCF-IPSCFYTVRLLHLLLLVFYLVSTCV 533

BLAST of Clc01G19160 vs. TAIR 10
Match: AT5G10080.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 554.3 bits (1427), Expect = 1.2e-157
Identity = 294/572 (51.40%), Postives = 386/572 (67.48%), Query Frame = 0

Query: 7   ILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSMEYYQELV 66
           +L  ++ +   + ++  F+SR++HRFS+E +A     S++ S+    P K S+EYY+ L 
Sbjct: 8   LLFCVLFLATEETLASLFSSRLIHRFSDEGRASIKTPSSSDSL----PNKQSLEYYRLLA 67

Query: 67  SGDFQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDL 126
             DF+RQ+M LG++ Q L PSEGSKTI+ GNDFGWLHYTWIDIGTPSVSFLVALD GS+L
Sbjct: 68  ESDFRRQRMNLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNL 127

Query: 127 LWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLMFGALFI 186
           LW+PC+C+QCAPL+++YY SL+                                      
Sbjct: 128 LWIPCNCVQCAPLTSTYYSSLA-------------------------------------- 187

Query: 187 MQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQD 246
              KDLNEY PSSSSTSK   CSH LC+S   C+SPK+ CPY ++Y++ NTSSSGLL++D
Sbjct: 188 --TKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVED 247

Query: 247 VLHLSSGCDN---SSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAK 306
           +LHL+   +N   + +  ++A +++GCG KQSG YL GVAPDGL GLG  EISV S L+K
Sbjct: 248 ILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSK 307

Query: 307 EELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLD-GKYETYIVGVEACCIENSCL 366
             L++NSFSLCF+E+ SGRI+FGD GP+ QQ+T F+ LD  KY  YIVGVEACCI NSCL
Sbjct: 308 AGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCL 367

Query: 367 KQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAMPK 426
           KQTSF   IDSG SFTYLPEE Y  + +E D+ +N TS  +F+G  W+YCY+ SA+  PK
Sbjct: 368 KQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSK-NFEGVSWEYCYESSAE--PK 427

Query: 427 VPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGD-IGILGQNYMTGYRMVFD 486
           VP++ L F  NN+FV+H P+F     QGL  FC  + P+  + IG +GQNYM GYRMVFD
Sbjct: 428 VPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSGQEGIGSIGQNYMRGYRMVFD 487

Query: 487 RDNLKLGWSRANCQDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGRAPSK 546
           R+N+KLGWS + CQ+   E     +P   + PNPLP +EQQS  GGHAV+PA+AG+ PSK
Sbjct: 488 RENMKLGWSPSKCQEDKIEPPQ-ASPGSTSSPNPLPTDEQQSR-GGHAVSPAIAGKTPSK 526

Query: 547 PSGAAPCFIIPSSFYSI-RLPHLLLLVLYLVS 573
              ++  +    SF SI RL + LLL+ +L S
Sbjct: 548 TPSSSSSY----SFSSIMRLFNSLLLLHWLAS 526

BLAST of Clc01G19160 vs. TAIR 10
Match: AT4G35880.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 298.5 bits (763), Expect = 1.1e-80
Identity = 181/502 (36.06%), Postives = 276/502 (54.98%), Query Frame = 0

Query: 7   ILLLLVVIGVHQAVSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSMEYYQELV 66
           ++ +L+++         FT  + HRFS+E+K      S +T   A +P KGS EY+  LV
Sbjct: 12  LIPILMLLSFGSCNGRIFTFEMHHRFSDEVK----QWSDSTGRFAKFPPKGSFEYFNALV 71

Query: 67  SGDFQRQKMKLG-----SRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALD 126
             D+  +  +L      S   L F S+G+ T  + +  G+LHYT + +GTP + F+VALD
Sbjct: 72  LRDWLIRGRRLSESESESESSLTF-SDGNSTSRI-SSLGFLHYTTVKLGTPGMRFMVALD 131

Query: 127 AGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLMF 186
            GSDL WVPCDC +CAP         +EG T+ S                          
Sbjct: 132 TGSDLFWVPCDCGKCAP---------TEGATYAS-------------------------- 191

Query: 187 GALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSSG 246
                  + +L+ Y P  S+T+K ++C++ LC     C     +CPY++ Y++  TS+SG
Sbjct: 192 -------EFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSG 251

Query: 247 LLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSL 306
           +L++DV+HL++   N     ++A +  GCG  QSG +L   AP+GLFGLG+ +ISV S L
Sbjct: 252 ILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVL 311

Query: 307 AKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSC 366
           A+E LV +SFS+CF  DG GRI FGD+G + Q+ T F  L+  +  Y + V    +  + 
Sbjct: 312 AREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRV-GTT 371

Query: 367 LKQTSFKALIDSGTSFTYLPEEAYENIVMEF-----DKRLNTTSTVSFKGYPWKYCYKIS 426
           L    F AL D+GTSFTYL +  Y  +   F     DKR +  S +     P++YCY +S
Sbjct: 372 LIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRI-----PFEYCYDMS 431

Query: 427 ADAMPK-VPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADGDIGILGQNYMTG 486
            DA    +PS++L    N+ F ++DP+  +   +G   +C A++ +  ++ I+GQNYMTG
Sbjct: 432 NDANASLIPSLSLTMKGNSHFTINDPII-VISTEGELVYCLAIVKS-SELNIIGQNYMTG 454

Query: 487 YRMVFDRDNLKLGWSRANCQDL 498
           YR+VFDR+ L L W + +C D+
Sbjct: 492 YRVVFDREKLVLAWKKFDCYDI 454

BLAST of Clc01G19160 vs. TAIR 10
Match: AT2G17760.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 278.1 bits (710), Expect = 1.6e-74
Identity = 181/502 (36.06%), Postives = 261/502 (51.99%), Query Frame = 0

Query: 30  HRFSEEMKALRVAGSTNTSVRALWPEKGSMEYYQELVSGDFQRQKMKLGSRFQLLFP-SE 89
           HRFS+++  +              P + S +YY+ +   D   +  +L +  Q L   S+
Sbjct: 39  HRFSDQVVGVLPGDGL--------PNRDSSKYYRVMAHRDRLIRGRRLANEDQSLVTFSD 98

Query: 90  GSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASYYGSLS 149
           G++T+ + +  G+LHY  + +GTPS  F+VALD GSDL W+PCDC  C            
Sbjct: 99  GNETVRV-DALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNC------------ 158

Query: 150 EGHTWTSCVLKFKAFCSAFLRERKAWCSCSLMFGALFIMQDKDLNEYRPSSSSTSKHISC 209
                              +RE KA    SL           DLN Y P++SSTS  + C
Sbjct: 159 -------------------VRELKAPGGSSL-----------DLNIYSPNASSTSTKVPC 218

Query: 210 SHDLCESGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCDNSSNCMIQAPIIL 269
           +  LC  G  C SP+  CPY I Y++  TSS+G+L++DVLHL S  ++ S+  I A +  
Sbjct: 219 NSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVS--NDKSSKAIPARVTF 278

Query: 270 GCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIFFGDE 329
           GCG  Q+G +  G AP+GLFGLGL +ISV S LAKE +  NSFS+CF  DG+GRI FGD+
Sbjct: 279 GCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDK 338

Query: 330 GPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFKALIDSGTSFTYLPEEAYENI 389
           G   Q+ T  + +   + TY + V    +  +      F A+ DSGTSFTYL + AY  I
Sbjct: 339 GSVDQRETP-LNIRQPHPTYNITVTKISVGGN-TGDLEFDAVFDSGTSFTYLTDAAYTLI 398

Query: 390 VMEF-----DKRLNTTSTVSFKGYPWKYCYKISADAMP-KVPSVTLLFPQNNSFVVHDPV 449
              F     DKR  TT +      P++YCY +S +    + P+V L     +S+ V+ P+
Sbjct: 399 SESFNSLALDKRYQTTDS----ELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPL 458

Query: 450 FPIYGDQGLAGFCFAVLPADGDIGILGQNYMTGYRMVFDRDNLKLGWSRANCQDLSNEKK 509
             +   +    +C A++  + DI I+GQN+MTGYR+VFDR+ L LGW  ++C        
Sbjct: 459 V-VIPMKDTDVYCLAIMKIE-DISIIGQNFMTGYRVVFDREKLILGWKESDCY------- 467

Query: 510 MPLAPAKETPPNPLPANEQQSA 525
                  ET    LP+N   S+
Sbjct: 519 -----TGETSARTLPSNRSSSS 467

BLAST of Clc01G19160 vs. TAIR 10
Match: AT3G51330.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 257.3 bits (656), Expect = 2.9e-68
Identity = 185/552 (33.51%), Postives = 276/552 (50.00%), Query Frame = 0

Query: 6   LILLLLVVIGVHQA-VSITFTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSMEYYQE 65
           L+ LL+V  G+ +   S  F+  + H FS+ +K        +  +  L PEKGS+EY++ 
Sbjct: 10  LLSLLVVCWGLERCEASGKFSFEVHHMFSDRVK-------QSLGLDDLVPEKGSLEYFKV 69

Query: 66  LVSGDFQRQKMKLGSRFQ---LLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALD 125
           L   D   +   L S  +   + F   G++TI++ +  G+LHY  + +GTP+  FLVALD
Sbjct: 70  LAQRDRLIRGRGLASNNEETPITF-MRGNRTISI-DLLGFLHYANVSVGTPATWFLVALD 129

Query: 126 AGSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLMF 185
            GSDL W+PC+C                    ++C+   K                    
Sbjct: 130 TGSDLFWLPCNC-------------------GSTCIRDLK-------------------- 189

Query: 186 GALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSSG 245
             + + Q + LN Y P++SSTS  I CS D C     C SP  SCPY I Y++++T ++G
Sbjct: 190 -EVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTTG 249

Query: 246 LLIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSL 305
            L +DVLHL +  ++     ++A I LGCG  Q+G   S  A +GL GLGL + SV S L
Sbjct: 250 TLFEDVLHLVT--EDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSIL 309

Query: 306 AKEELVQNSFSLCFNE--DGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIEN 365
           AK ++  NSFS+CF    D  GRI FGD+G   Q  T  +P +    TY V V    +  
Sbjct: 310 AKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPS-PTYAVSVTEVSVGG 369

Query: 366 SCLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADA 425
             +      AL D+GTSFT+L E  Y  I   FD  +           P+++CY +S + 
Sbjct: 370 DAV-GVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNK 429

Query: 426 MPKV-PSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPA-DGDIGILGQNYMTGYR 485
              + P V + F   +   + +P+F ++ +   A +C  +L + D  I I+GQN+M+GYR
Sbjct: 430 TTILFPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGYR 489

Query: 486 MVFDRDNLKLGWSRANC-QDLSNEKKMPLAPAKETP----PNPLPANEQQSAPGGHAVAP 545
           +VFDR+ + LGW R++C +D S E   P  P  E P      PLP+      P   A  P
Sbjct: 490 IVFDRERMILGWKRSDCFEDESLESTTPPPPETEAPSPSASTPLPS---LLPPPAAATPP 505

BLAST of Clc01G19160 vs. TAIR 10
Match: AT3G51350.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 224.6 bits (571), Expect = 2.1e-58
Identity = 167/542 (30.81%), Postives = 256/542 (47.23%), Query Frame = 0

Query: 6   LILLLLVVIGVHQAVSIT--FTSRILHRFSEEMKALRVAGSTNTSVRALWPEKGSMEYYQ 65
           ++L +LVV    +    T  F   + H FS+ +K     G        L PE+GS+EY++
Sbjct: 9   VLLSVLVVCWGFERCEATGKFGFEVHHIFSDSVKQSLGLGD-------LVPEQGSLEYFK 68

Query: 66  ELVSGD-FQRQKMKLGSRFQLLFPSEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDA 125
            L   D   R +    +  +     +G          G L+Y  + +GTP  SFLVALD 
Sbjct: 69  VLAHRDRLIRGRGLASNNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDT 128

Query: 126 GSDLLWVPCDCIQCAPLSASYYGSLSEGHTWTSCVLKFKAFCSAFLRERKAWCSCSLMFG 185
           GSDL W+PC+C                    T+C+   +                     
Sbjct: 129 GSDLFWLPCNC-------------------GTTCIRDLE--------------------- 188

Query: 186 ALFIMQDKDLNEYRPSSSSTSKHISCSHDLCESGQSCQSPKQSCPYVIDYITENTSSSGL 245
            + + Q   LN Y P++S+TS  I CS   C   + C SP   CPY I Y + +T + G 
Sbjct: 189 DIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-SNSTGTKGT 248

Query: 246 LIQDVLHLSSGCDNSSNCMIQAPIILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLA 305
           L+QDVLHL++  +N +   ++A + LGCG KQ+G +    + +G+ GLG+   SV S LA
Sbjct: 249 LLQDVLHLATEDENLT--PVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLA 308

Query: 306 KEELVQNSFSLCFNE--DGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENS 365
           K  +  NSFS+CF       GRI FGD G   Q+ T F+ +      Y V +    +   
Sbjct: 309 KANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISV-APSTAYGVNISGVSVAGD 368

Query: 366 CLKQTSFKALIDSGTSFTYLPEEAYENIVMEFDKRLNTTSTVSFKGYPWKYCYKISADAM 425
            +    F A  D+G+SFT+L E AY  +   FD+ +           P+++CY +S +A 
Sbjct: 369 PVDIRLF-AKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLSPNAT 428

Query: 426 P-KVPSVTLLFPQNNSFVVHDPVFPIYGDQGLAGFCFAVLPADG-DIGILGQNYMTGYRM 485
             + P V + F   +  ++++P F     +G   +C  VL + G  I ++GQN++ GYR+
Sbjct: 429 TIQFPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGYRI 488

Query: 486 VFDRDNLKLGWSRANC-QDLSNEKKMPLAPAKETPPNPLPANEQQSAPGGHAVAPAVAGR 540
           VFDR+ + LGW ++ C +D S E   P  P  E P   +      SAP   ++ P V+  
Sbjct: 489 VFDRERMILGWKQSLCFEDESLESTTPPPPEVEAPAPSV------SAPPPRSLPPTVSAT 492

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882807.11.4e-29088.70aspartic proteinase-like protein 1 isoform X1 [Benincasa hispida][more]
XP_004143563.23.3e-28787.83aspartic proteinase-like protein 1 isoform X1 [Cucumis sativus] >KAE8647563.1 hy... [more]
XP_008440641.11.0e-28587.67PREDICTED: aspartic proteinase-like protein 1 isoform X1 [Cucumis melo] >KAA0036... [more]
XP_022950779.11.2e-28186.26aspartic proteinase-like protein 1 isoform X1 [Cucurbita moschata] >XP_022950781... [more]
XP_022978453.11.3e-28085.91aspartic proteinase-like protein 1 isoform X1 [Cucurbita maxima] >XP_022978454.1... [more]
Match NameE-valueIdentityDescription
Q9LX201.6e-15651.40Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At5g10080 ... [more]
Q8VYV92.3e-7336.06Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 ... [more]
Q9S9K45.6e-2424.36Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q4V3D26.9e-2224.51Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q9LEW36.7e-1726.28Aspartyl protease AED1 OS=Arabidopsis thaliana OX=3702 GN=AED1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3CLH55.1e-28687.67Aspartic proteinase-like protein 1 isoform X1 OS=Cucumis melo var. makuwa OX=119... [more]
A0A1S3B2705.1e-28687.67aspartic proteinase-like protein 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC1034... [more]
A0A6J1GFS35.8e-28286.26aspartic proteinase-like protein 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=L... [more]
A0A6J1IU366.4e-28185.91aspartic proteinase-like protein 1 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC... [more]
A0A6J1HE552.7e-27984.87aspartic proteinase-like protein 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=L... [more]
Match NameE-valueIdentityDescription
AT5G10080.11.2e-15751.40Eukaryotic aspartyl protease family protein [more]
AT4G35880.11.1e-8036.06Eukaryotic aspartyl protease family protein [more]
AT2G17760.11.6e-7436.06Eukaryotic aspartyl protease family protein [more]
AT3G51330.12.9e-6833.51Eukaryotic aspartyl protease family protein [more]
AT3G51350.12.1e-5830.81Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 466..481
score: 36.46
coord: 109..129
score: 49.89
coord: 369..380
score: 43.68
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 9..511
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 92..327
e-value: 4.7E-37
score: 129.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 330..496
e-value: 8.6E-27
score: 96.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 98..497
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 104..327
e-value: 1.4E-34
score: 119.9
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 363..490
e-value: 5.2E-13
score: 49.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 504..538
NoneNo IPR availablePANTHERPTHR13683:SF743ASPARTIC PROTEINASE-LIKE PROTEIN 1coord: 9..511
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 369..380
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 103..490
score: 37.891792

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G19160.2Clc01G19160.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity