CmoCh04G024170 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh04G024170
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionaspartic proteinase-like protein 1
LocationCmo_Chr04: 17967740 .. 17975273 (+)
RNA-Seq ExpressionCmoCh04G024170
SyntenyCmoCh04G024170
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCGCAGCGTACGCACAAACAGCGCATTATAGCACTCTCACAGTCTCACCCAATTTGATTTATCGTCGTCACCATTCATGATTCACGCTTCAATCTCTACTCCGTTTGCAGAAGGCTCGGTTCCTTGCCTTTTCCCCTAATCCGGATGCTCTCTCGCCGTTGCAGAAGTTCATTCTTACGAAATCTGATCTTCATCTTCAGTTTCCATGGCGAATTTCTTGGTAGTTTTGCTCTTTGTAGCTTGTTTCTTTGTGGACAGTTCTGTTGCGCTTAGGCTCTCGTCGAGGCTCATTCATCGATTCTCCGATGAGGCGAAAGCGCTTTGGAAGTCCAGGAACGGTAATGCGTCTGGAAAATTTTGGCCGAGGAGGAATAGCTTGAAGTATTTTGAAACGCTTAAGGATTATGACTTGAAGAGGCGGAGACTGAAGATCGGATCGAAGTACGAGGTGATTTTTCCTTCTGAAGGAAACGAGGTCGTGTTCTTTGGGAACGAATTTGATTGGTTAGTATGCTTGAGCTTAACTGTTCCAGGACGTGATCGCTTTATTGACTAGCTAACTTCTAAGTTCTACCAATTGTGTAGTTTTCTTTACTTTCACAGCAACGATAATGTTTAATTTTATCAGCATTTCCTTCACCATTAAAAATTTGAAGCAGCATGGAGGGAGTATTTTCTTTACCTTGTTTGTTAGAGCGATTAAAGAATTCAGTGGAACTAGTTGTCCAGGTTACATTACACATGGATTGATATAGGAACACCGAGTGTTTCGTTTCTCGTTGCATTGGATGCTGGAAGTGATCTGCTCTGGGTTCCGTGTGATTGCATTCAATGTGCTCCCTTGTCTGCAAGTCATTATAGCTCACTGGTATGACTGCAATCACCTACATTAATTTCTTCATTTCTTTAATGCCTCTTTTAACATATTCCAAGTCGTATTTGGGATGTGAGCTGCTGAACTAGTTCATGAACTGGATAACTAGTCTCCTTTTCCTCCAAATTCAAATAAGAAGTTAAATTACTGTGCAAGAATCCTAATCATCGAATCATAAAAGATTGAGGTTTACTGATGTGGATATCGAAACATTCATATGTTCCGTATCCTTCAAAGTTGAAACCTATCGTTAGTCTGTTGTAAAGCGACAATCTCTCGTTCTATCAGTTAAATTTGCCAAATAACTTGCGAAATGTTACTTCTAAAGAGAGATCTGTATTCAAAATCCTTCCACTCAGATATATCATTAACGGAATAAAAGAAAAAAAAACAAAAAGACGAACTAATGTACATCAAAAGTAAGCTCTTATCTAAATCTATACTGATAATTAAATGGACACCGCATATATATACGATTGTTTTCAATTAATTACCTCTTTCTTTTATGTTCCGATCTATCTTAGTATGTCTCATTACTTGTCTAGTAGTCAAATAAGAACTGTGACAGGTTTTTCAGTTAATTTCACCAATTGACTGGCAGGTGCTCTCTTCTGTTTGGGGCTGTCTATGATGCAGGATAGGGATCTGAGTGCGTACAATCCAGCTTTATCAAACACCAGCCAGTACCTTTCCTGCAGTCATCAATTATGTGCTTGGAGCACAACTTGCAAAAGTCCTGATGATCCCTGCACTTATAAAAGAGATTATTACACAGATAATACATCAACTTCTGGATTTATGATTGAAGATAAATTGCATTTGGCATCTTTCAGCAAACATGGTACACAACGTCTTTTGCAGGCCTCAGTTGTATTAGGGTGGGCTCTTACTGCGTCACATACTTTGATGCTTTTCTTTGATGTTTTCTTCTTCTTGAACAATGTTGAAGTTGAGATATTGGAATGCCACCTGAATCTTTGCCTGTGATCTCCCCCTCTCCCTCTTGATGCAGCTGTGGTAGGAAACAGAGTGGTTACTATTTGGATGGGGCTGCTCCTGATGGTGTTATGGGGTTGGGTCCTGGAAACATTTCAGTGCCAACCTTATTGGCAAAAGCAGGATTGGTTAGAAATACATTCTCGCTTTGTTTTGATAATAATGGTTCTGGGAGAATCCTCTTTGGGGACAATGGTCCTGCCACCCAGCAAACAACACAATTTTTGCCATTATTTGGTGAATTGTATGTGATCTAGCAACTAAATTATTAGCTAAGCCAATTCTAACCTTTATTTTCTCTTGAAGTTATCTATTATCACCAACACCATTAAGCCATCCATTCCTTGTCTCAGTGATGCCTATTTTGTCGAGGTGGAGTCCTTTTGTGTTGGGAGTTCCTGTCTGCAGAAAAGTGGATTCCACGCATTGGTGGACAGTGGCTCGTCTTTTACATATCTTCCTACAGAAATCTATAAAAAGATAGTCTTTGAGGTGATACTGGAATATAATGATGTGTGATATTTCAATTTTGTTTGGCCTAAAGTCTCGAATTACTTAATGCAGTTTGACAAACAAGTAAAATTAAATGCTACCAGGATAATTCTCCAAGAATTTCCCTGGAATTACTGCTATAATTCCAGGTAGGTGCTTGAAAAAGGTAAAATAAATATTGTATTTTCCTCATTTTTTAAGGGTTGACAGACAGTTCTTTTATTGTGGAACAGTTCGCTGGAGTCCTCTTATATTCCTAGTATGAAACTCGTGTTTCCTCTGAATCAAAGCTTTATACATGATCCTGTGTATACCCTCCCTGACAGCCAAGTAAGTTAGACATCTTGCTCTTTCTTCCCAAGTAATTTTACCCAAATGATTATTCCCAGTGGGCCCTTAAAAATATTGATTTCTTTCCTCAATTTGTTTGTTTGGGACTGGGCAAGCTAGAAATGTTTTTGAACACAGTTTGAGGACACGAAGTCATGTTTGGGCAAGTGCTGAAATATTTGTTCCCTTCTCTGGCTTTTCTTTTATTGCTATGGATGGAGGCTATTGCATTCAAAATACCCCTTTTCTAGAATGTCAACAACTTGACTTTAAAGGACTTCATATCCTTTTTGCCTCGGTGGTTACGATTGATCTGTTTTGGTCGAAGTATGAACTTTATAAAATTTAATGTGCAGGGATATAAACTGTTTTGTTTAACCTTAGAAGAGACAGATGATGATTACGGTGTAATTGGACGTAAGTAGGACTTTTAATTCTGTGGTTGTATATGCTACTTTTTCAATGATTTTCTATAATATAGTTCCTTTTCTTCTCCTACCACACGATTACACATAGGAGCTTGGATTCGTAAATGCATTTAATGATAAAAACTTGTGATATCTTCTTCCATTTGTGCTACAGAAAACTTGATGGTGGGTTACCGGCTGGTTTTTGACAGAGAAAATCTTCAGTTGGGTTGGTCCAAGTCCAAATGTAAGCATGTTTTCATTATTATTTACACTTTGAATTTTCTTCTAACAGTGTGCATTAGCTGATCTTCCATCACTCGAAAGATTATCTGGAAGAAAGAGTTTGGTTAAGGCAAAAATGAAATCTTGCTTTTTGAATTTTCTCAGCAACTTCTGATTTCAAGGATATTCAATTCTTCCTGTGTCGAGATTTAAGTTTCTGTGCATTGCTTATATACTTCTGGCCTTTTTATTTATTTATTTTATATCAGCATGGAAGAAGAACTTATATTGAGGCTTGAGCATGATTCAATTCCCTTTTCTTCATTCGACATTTATCACTGTATAAAATTGCAGGCCTAGATATCAACCATGGCGAAGCAGGCCATGCCAAACCACCTTCAAATGACGGATCACCAACTGCATTACCAACCGATGGACATCTAAGCCCTCCAAATAGGCAAGAAATTGCACCCACTGCTGCTAGGGCGTTTTCCAAATCGTCCCTAACTGCACCCCATTTTTCTCTCTTCAGCTACTGTTGTTTGAGGTTATTCTTGTTGCTTTTTGAGTTTGTTGAGTCCATGCTGTAAATATTGCCTTTTCCGACTGGCATTTCAATTACTTAATAATTTACCCCTTCCTGTAAGTTGCTGTATCATATAGGAAAAAATAAGTTACATTTTTTTTACTGGGATTTGAGTATTATATAGACGTTTTTTTCAATTCAAGCATCAGTGCAAGTGCCAGTTCTACGTGTAGATTATTTGTAAACTCTATCATCCCTTATTTATTTTTATTTTTTGAAGTGGCATGCTTAAACATGTTAACAAATTACCAAGGTTCTTCTGAATATGTTGGAAAGATTTAAGTTTTATGTTTGTCCAGAATTTCAGCACAGGCACAAAAGAAAGTGAACTTGTTGGATGGCAGCTGACCAAGTCCAGTACCTCTCCTACTAATTGCTATGCTGCTGTATAATGAGACAGGCACTTACCTCCTCTATTTGTGTTATTTACCGTTGAAATTTCTACAGAAATCTCTTTGAAAGTGATGAAAACTATCTGACAGGTTCAATCAACTGGAGATGGGAATCTTTTTTGGATAACATGAAAGTTTTTAGTACGCAGGTACTTGCTCTTTTTCTTCACACCATGTCTTTCATCTTTTATTCAAGAATTATTTGAATTCTTATTGAAATAAAAAAAAATTAAAAAAAATTAAAAAATTAAAACTACTTTGCATTTCCAAAACAAAACTAAAATTTGAATACATGTTTATATTTATAAAGTAGATCACAGAACAAATAAGCTCCCTGTGTGAAAACTAAAATTCAAATAGTTATCAAAATGAACCCTAGTTTTGATTAAGATTCACTTGTTGATTATTTCATAACTGCCACCATAGTTATGTTTAGTGTAATGATAATAACTTGGTGGTTATGTTTAGTGTAATGATAATAACTTGGTGGTGATTACGTATTAGTTACTTTAAAACTACCATCAATCCTTTTTAGTGTGATGATAACAATTCCTTTTTAGTGTGATGATAACGATTTGGTGGTTGTCGCATGTTGGTTGTTTCATAGCTACCACCATAGTTGCCCTAAGGTGATGAAAAGGATTTTGTGGAAAGTTTACAGGGAGTATGCCTAGGTAACCATTAGTCTCTCAATTTTACCTTAACTATATAATATATGTCGTAGATGGGTATTTTATTGAGTATTTTAAATATTCATCGTTTTCCTAAAATGATTTTTAAGTGAGGGCAAATTTATGGCACTAGAGTTACAAATAGACTAGAGAAGCCAATTATGGAGCAACTCTAATCCTTATGTTTTCTTCATTTCAAACACTTTCTTTTGAGTTAGTTTATTGATTTAATTGGCAGTAGAAAACTCTTAGAAATTAAAATGATGGTGCTTATTGAAAATTGCATTATTGAATTATTTAGTTAAATTACAAGCATTCTAAAAATCATTTAAATGGATATCTAGAAGATGATTTTACTTTGCTATATATGGGTATTCTTTCATTTGTTTTTAAATTACAAAACAATTTATAAGTAAGCTAAACTTAGACCAGTACGTTTACAAAAGAAAATCGAATCGTTACACTTGGGTACGCATGATTGCACTAAGTAAAGTAGCGCCTATAACAATGGACACTACACATCATGATTTTTATTCCCACATGAAAAACATTTCATCATGTAAGGACCTAGTAAAGGAAAAAACACAACATTTTGTTAACATGTGATTTGTAAACTAGAGAAGACAAATGCATATCATAAAGTTCATATGAGGCTAACAGCATAGGACAAATTAGCGCCTATATTTGCATATACATCTTTGTCTAGAACCAATTGGGTCAAGTCATTCCTTTTGCACAGATTCATATATGAGGGTAGCATTTTGGGCGTAAGAACTAATACTTTGTATGAAGGATATGCTGATTAAGATACAAGGTAGCATTTTAAAGGACAGACACATTTCTCAAGAAACAACAAAAAGTGGTACTAAAACACCAACCTCAATATTGAGTGTTGTCAAAAACCCACGGTTGGTGCAAGAAAAGCTCTCGCCCATTCATTTGCACTGCTCCTGAGGCTATATCCTCCGCACGCTTCCTTGCTGTTTCATCTCTTTTAGCAGATGCCTCGGCATCCTGCAAATAAGAGGAGATAGTTGGAGATAAATGTCGAACAGCAACACAACTCATTTACGTCACTGATGCAGAAATTGCCTCATAATTGACTAGTGGAATGGTTCAGGTTCTTTGCCGTTCAGTGATTAGTTAACTTGAACGTACAAGAACCAACCTGTGTTTCTGCCACTGACAATCAATTGTGAAGCATGTGAGTTTAGAACGTTAATAGCCTTATAATTTCAAGATAGCTAAGCTCAAGCCTACCGCTAGCAGATATTGTTACGTATCGTATCGTTGTCAGCCTCACAATTTTAAAATGTGTCTGCTAGGGAGAGATTTCCACACTCTTATAGGGAATGTTTTGTTCCCCTCTCCAATAGATGTAGGATCTCACAATCCACCCCTCTTAGGATCATAGAGTCCTCGCAGGCATACTGTCCAGTGTCTGGCTCTGATACCATTTGTAATAGCTCAAGCCTACCGCTAATAGATATTGTTTGCTTTGGTCCGTTACATATCGTCAATCGTTCTGTCAACCTAAGTATAGCTCAATCGACTAGAAGCTCTGATACCATTTGTAATAGCTCAAGCCTACCGCTAATAGATATTGTTTGCTTTGGTCCGTTACATATCGTCAATCGTTCTGTCAACCTAAGTATAGCTCAATCGACTAGAAGCTCTGATACCATTTGTAATAGCTCAAGCCTACCGCTAATAGATATTGTTTGCTTTGGTCCGTTACATATCGTCAATCGTTCTGTCAACCTAAGTATAGCTCAATCGACTAGAAGCTCTGATACCATTTGTAATAGCTCAAGCCTACCGCTAATAGATATTGTTTGCTTTGGTCCGTTACATATCGTCAATCGTTCTGTCAACCTAAGTATAGCTCAATCGACTAGAAGCTCTGATACCATTTGTAATAGCTCAAGCCTACCGCTAATAGATATTGTTTGCTTTGGTCCGTTACATATCGTCAATCGTTCTGTCAACCTAAGTATAGCTCAATCGACTAGAAGTCCAGTCTCAGCCTCAGGTCTATTTGGCACGAGCCAGATAAAGTACATTCATACATGTTACAATCCCAAAATTTAAGCCTCATCTTCGCACAACAGAGGAAACTATTCCTGAATACCATCCTAATTGAACGAATCCTTGAGATTTAAAATTGTGATAATTTCAATTTTCAAACAACTCGCCCAATCTATGCAACCCTAATTCTCGAATTTCACTGATTTGGCAGGAAATACAAGAAATCGGCACCTAAAAGAACTCATACAGATCAATCGCACATCAAAGAGCAGAAATGGAAAGCGGCCAAAGAAAAGCTTGGAAGAGATTGATGAAGAAGAAGAAGAAGGGAAAGTACCTTGTGGCGCTTCCACGATAGGAACTGAGCCTCCGTGATGGGATTCGCGCTCGGATCATGAGACGAGAGCTTCTCTCGCTCCACAACCGCTTCTTCTGCCCCCATTTTGTTGAGCGTGGATCAGATTCCTGGATTTGATAGCCCCTCTCCAATGGATTGCTGCTTCAACCTCTGTTATATTCGAGATGGAGAGAG

mRNA sequence

CGCGCAGCGTACGCACAAACAGCGCATTATAGCACTCTCACAGTCTCACCCAATTTGATTTATCGTCGTCACCATTCATGATTCACGCTTCAATCTCTACTCCGTTTGCAGAAGGCTCGGTTCCTTGCCTTTTCCCCTAATCCGGATGCTCTCTCGCCGTTGCAGAAGTTCATTCTTACGAAATCTGATCTTCATCTTCAGTTTCCATGGCGAATTTCTTGGTAGTTTTGCTCTTTGTAGCTTGTTTCTTTGTGGACAGTTCTGTTGCGCTTAGGCTCTCGTCGAGGCTCATTCATCGATTCTCCGATGAGGCGAAAGCGCTTTGGAAGTCCAGGAACGGTAATGCGTCTGGAAAATTTTGGCCGAGGAGGAATAGCTTGAAGTATTTTGAAACGCTTAAGGATTATGACTTGAAGAGGCGGAGACTGAAGATCGGATCGAAGTACGAGGTGATTTTTCCTTCTGAAGGAAACGAGGTCGTGTTCTTTGGGAACGAATTTGATTGGTTACATTACACATGGATTGATATAGGAACACCGAGTGTTTCGTTTCTCGTTGCATTGGATGCTGGAAGTGATCTGCTCTGGGTTCCGTGTGATTGCATTCAATGTGCTCCCTTGTCTGCAAGTCATTATAGCTCACTGGATAGGGATCTGAGTGCGTACAATCCAGCTTTATCAAACACCAGCCAGTACCTTTCCTGCAGTCATCAATTATGTGCTTGGAGCACAACTTGCAAAAGTCCTGATGATCCCTGCACTTATAAAAGAGATTATTACACAGATAATACATCAACTTCTGGATTTATGATTGAAGATAAATTGCATTTGGCATCTTTCAGCAAACATGGTACACAACGTCTTTTGCAGGCCTCAGTTGTATTAGGCTGTGGTAGGAAACAGAGTGGTTACTATTTGGATGGGGCTGCTCCTGATGGTGTTATGGGGTTGGGTCCTGGAAACATTTCAGTGCCAACCTTATTGGCAAAAGCAGGATTGGTTAGAAATACATTCTCGCTTTGTTTTGATAATAATGGTTCTGGGAGAATCCTCTTTGGGGACAATGGTCCTGCCACCCAGCAAACAACACAATTTTTGCCATTATTTGGTGAATTTGATGCCTATTTTGTCGAGGTGGAGTCCTTTTGTGTTGGGAGTTCCTGTCTGCAGAAAAGTGGATTCCACGCATTGGTGGACAGTGGCTCGTCTTTTACATATCTTCCTACAGAAATCTATAAAAAGATAGTCTTTGAGTTTGACAAACAAGTAAAATTAAATGCTACCAGGATAATTCTCCAAGAATTTCCCTGGAATTACTGCTATAATTCCAGTTCGCTGGAGTCCTCTTATATTCCTAGTATGAAACTCGTGTTTCCTCTGAATCAAAGCTTTATACATGATCCTGTGTATACCCTCCCTGACAGCCAAGGATATAAACTGTTTTGTTTAACCTTAGAAGAGACAGATGATGATTACGGTGTAATTGGACAAAACTTGATGGTGGGTTACCGGCTGGTTTTTGACAGAGAAAATCTTCAGTTGGGTTGGTCCAAGTCCAAATGCCTAGATATCAACCATGGCGAAGCAGGCCATGCCAAACCACCTTCAAATGACGGATCACCAACTGCATTACCAACCGATGGACATCTAAGCCCTCCAAATAGGCAAGAAATTGCACCCACTGCTGCTAGGGCGTTTTCCAAATCGTCCCTAACTGCACCCCATTTTTCTCTCTTCAGCTACTGTTGTTTGAGGTTATTCTTGTTGCTTTTTGAGTTTGTTGAGTCCATGCTGTAAATATTGCCTTTTCCGACTGGCATTTCAATTACTTAATAATTTACCCCTTCCTGTAAGTTGCTGTATCATATAGGAAAAAATAAGTTACATTTTTTTTACTGGGATTTGAGTATTATATAGACGTTTTTTTCAATTCAAGCATCAGTGCAAGTGCCAGTTCTACGTGTAGATTATTTGTAAACTCTATCATCCCTTATTTATTTTTATTTTTTGAAGTGGCATGCTTAAACATGTTAACAAATTACCAAGGTTCTTCTGAATATGTTGGAAAGATTTAAGTTTTATGTTTGTCCAGAATTTCAGCACAGGCACAAAAGAAAGTGAACTTGTTGGATGGCAGCTGACCAAGTCCAGTACCTCTCCTACTAATTGCTATGCTGCTGTATAATGAGACAGGTTCAATCAACTGGAGATGGGAATCTTTTTTGGATAACATGAAAGTTTTTAGTACGCAGGAAATACAAGAAATCGGCACCTAAAAGAACTCATACAGATCAATCGCACATCAAAGAGCAGAAATGGAAAGCGGCCAAAGAAAAGCTTGGAAGAGATTGATGAAGAAGAAGAAGAAGGGAAAGTACCTTGTGGCGCTTCCACGATAGGAACTGAGCCTCCGTGATGGGATTCGCGCTCGGATCATGAGACGAGAGCTTCTCTCGCTCCACAACCGCTTCTTCTGCCCCCATTTTGTTGAGCGTGGATCAGATTCCTGGATTTGATAGCCCCTCTCCAATGGATTGCTGCTTCAACCTCTGTTATATTCGAGATGGAGAGAG

Coding sequence (CDS)

ATGGCGAATTTCTTGGTAGTTTTGCTCTTTGTAGCTTGTTTCTTTGTGGACAGTTCTGTTGCGCTTAGGCTCTCGTCGAGGCTCATTCATCGATTCTCCGATGAGGCGAAAGCGCTTTGGAAGTCCAGGAACGGTAATGCGTCTGGAAAATTTTGGCCGAGGAGGAATAGCTTGAAGTATTTTGAAACGCTTAAGGATTATGACTTGAAGAGGCGGAGACTGAAGATCGGATCGAAGTACGAGGTGATTTTTCCTTCTGAAGGAAACGAGGTCGTGTTCTTTGGGAACGAATTTGATTGGTTACATTACACATGGATTGATATAGGAACACCGAGTGTTTCGTTTCTCGTTGCATTGGATGCTGGAAGTGATCTGCTCTGGGTTCCGTGTGATTGCATTCAATGTGCTCCCTTGTCTGCAAGTCATTATAGCTCACTGGATAGGGATCTGAGTGCGTACAATCCAGCTTTATCAAACACCAGCCAGTACCTTTCCTGCAGTCATCAATTATGTGCTTGGAGCACAACTTGCAAAAGTCCTGATGATCCCTGCACTTATAAAAGAGATTATTACACAGATAATACATCAACTTCTGGATTTATGATTGAAGATAAATTGCATTTGGCATCTTTCAGCAAACATGGTACACAACGTCTTTTGCAGGCCTCAGTTGTATTAGGCTGTGGTAGGAAACAGAGTGGTTACTATTTGGATGGGGCTGCTCCTGATGGTGTTATGGGGTTGGGTCCTGGAAACATTTCAGTGCCAACCTTATTGGCAAAAGCAGGATTGGTTAGAAATACATTCTCGCTTTGTTTTGATAATAATGGTTCTGGGAGAATCCTCTTTGGGGACAATGGTCCTGCCACCCAGCAAACAACACAATTTTTGCCATTATTTGGTGAATTTGATGCCTATTTTGTCGAGGTGGAGTCCTTTTGTGTTGGGAGTTCCTGTCTGCAGAAAAGTGGATTCCACGCATTGGTGGACAGTGGCTCGTCTTTTACATATCTTCCTACAGAAATCTATAAAAAGATAGTCTTTGAGTTTGACAAACAAGTAAAATTAAATGCTACCAGGATAATTCTCCAAGAATTTCCCTGGAATTACTGCTATAATTCCAGTTCGCTGGAGTCCTCTTATATTCCTAGTATGAAACTCGTGTTTCCTCTGAATCAAAGCTTTATACATGATCCTGTGTATACCCTCCCTGACAGCCAAGGATATAAACTGTTTTGTTTAACCTTAGAAGAGACAGATGATGATTACGGTGTAATTGGACAAAACTTGATGGTGGGTTACCGGCTGGTTTTTGACAGAGAAAATCTTCAGTTGGGTTGGTCCAAGTCCAAATGCCTAGATATCAACCATGGCGAAGCAGGCCATGCCAAACCACCTTCAAATGACGGATCACCAACTGCATTACCAACCGATGGACATCTAAGCCCTCCAAATAGGCAAGAAATTGCACCCACTGCTGCTAGGGCGTTTTCCAAATCGTCCCTAACTGCACCCCATTTTTCTCTCTTCAGCTACTGTTGTTTGAGGTTATTCTTGTTGCTTTTTGAGTTTGTTGAGTCCATGCTGTAA

Protein sequence

MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLFGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETDDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML
Homology
BLAST of CmoCh04G024170 vs. ExPASy Swiss-Prot
Match: Q9LX20 (Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At5g10080 PE=2 SV=1)

HSP 1 Score: 461.1 bits (1185), Expect = 1.7e-128
Identity = 250/530 (47.17%), Postives = 346/530 (65.28%), Query Frame = 0

Query: 8   LLFVACFF-VDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETLKD 67
           LLF   F   + ++A   SSRLIHRFSDE +A  K+ + + S    P + SL+Y+  L +
Sbjct: 8   LLFCVLFLATEETLASLFSSRLIHRFSDEGRASIKTPSSSDS---LPNKQSLEYYRLLAE 67

Query: 68  YDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLL 127
            D +R+R+ +G+K + + PSEG++ +  GN+F WLHYTWIDIGTPSVSFLVALD GS+LL
Sbjct: 68  SDFRRQRMNLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNLL 127

Query: 128 WVPCDCIQCAPLSASHYSSL-DRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCT 187
           W+PC+C+QCAPL++++YSSL  +DL+ YNP+ S+TS+   CSH+LC  ++ C+SP + C 
Sbjct: 128 WIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCP 187

Query: 188 YKRDYYTDNTSTSGFMIEDKLHLASFSKH---GTQRLLQASVVLGCGRKQSGYYLDGAAP 247
           Y  +Y + NTS+SG ++ED LHL   + +        ++A VV+GCG+KQSG YLDG AP
Sbjct: 188 YTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAP 247

Query: 248 DGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPL-FG 307
           DG+MGLGP  ISVP+ L+KAGL+RN+FSLCFD   SGRI FGD GP+ QQ+T FL L   
Sbjct: 248 DGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNN 307

Query: 308 EFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRI 367
           ++  Y V VE+ C+G+SCL+++ F   +DSG SFTYLP EIY+K+  E D+ +  NAT  
Sbjct: 308 KYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHI--NATSK 367

Query: 368 ILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTLEET- 427
             +   W YCY SS+     +P++KL F  N +F IH P++    SQG   FCL +  + 
Sbjct: 368 NFEGVSWEYCYESSA--EPKVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSG 427

Query: 428 DDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDG 487
            +  G IGQN M GYR+VFDREN++LGWS SKC + +  E   A P S   SP  LPTD 
Sbjct: 428 QEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE-DKIEPPQASPGST-SSPNPLPTDE 487

Query: 488 HLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML 530
             S          A +  SK+  ++  +S  S   L   LLL  ++ S++
Sbjct: 488 QQSRGGHAVSPAIAGKTPSKTPSSSSSYSFSSIMRLFNSLLLLHWLASLM 528

BLAST of CmoCh04G024170 vs. ExPASy Swiss-Prot
Match: Q8VYV9 (Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 SV=1)

HSP 1 Score: 280.8 bits (717), Expect = 3.2e-74
Identity = 186/493 (37.73%), Postives = 266/493 (53.96%), Query Frame = 0

Query: 30  HRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETL--KDYDLKRRRLKIGSKYEVIFPSE 89
           HRFSD+         G   G   P R+S KY+  +  +D  ++ RRL    +  V F S+
Sbjct: 39  HRFSDQVV-------GVLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSLVTF-SD 98

Query: 90  GNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCA-PLSASHYSSL 149
           GNE V   +   +LHY  + +GTPS  F+VALD GSDL W+PCDC  C   L A   SSL
Sbjct: 99  GNETVRV-DALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSL 158

Query: 150 DRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCTYKRDYYTDNTSTSGFMIEDKL 209
             DL+ Y+P  S+TS  + C+  LC     C SP+  C Y+  Y ++ TS++G ++ED L
Sbjct: 159 --DLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVL 218

Query: 210 HLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGAAPDGVMGLGPGNISVPTLLAKAGLVR 269
           HL S  K  + + + A V  GCG+ Q+G + DGAAP+G+ GLG  +ISVP++LAK G+  
Sbjct: 219 HLVSNDK--SSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAA 278

Query: 270 NTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLFGEFDAYFVEVESFCVGSSCLQKSGFH 329
           N+FS+CF N+G+GRI FGD G   Q+ T  L +      Y + V    VG +      F 
Sbjct: 279 NSFSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNTGDLE-FD 338

Query: 330 ALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRIILQEFPWNYCYNSSSLESSY-IPSM 389
           A+ DSG+SFTYL    Y  I   F+        +    E P+ YCY  S  + S+  P++
Sbjct: 339 AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 398

Query: 390 KLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTLEETDDDYGVIGQNLMVGYRLVFDRENLQ 449
            L      S+ ++ P+  +P  +   ++CL + +  +D  +IGQN M GYR+VFDRE L 
Sbjct: 399 NLTMKGGSSYPVYHPLVVIP-MKDTDVYCLAIMKI-EDISIIGQNFMTGYRVVFDREKLI 458

Query: 450 LGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGHLSP-----PNRQEIAPTAARAFSK 509
           LGW +S C     GE      PSN  S +A P      P     P+++    T + A+S 
Sbjct: 459 LGWKESDCYT---GETSARTLPSNRSSSSARPPASSFDPEATNIPSQRPNTSTTSAAYSL 511

Query: 510 S-SLTAPHFSLFS 512
           S SL+   FS+ +
Sbjct: 519 SISLSLFFFSILA 511

BLAST of CmoCh04G024170 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 2.7e-28
Identity = 104/371 (28.03%), Postives = 179/371 (48.25%), Query Frame = 0

Query: 101 LHYTWIDIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSASHYSSLDRDLSAYNPALSN 160
           L++T I +G+P   + V +D GSD+LWV C  C +C P+     + L   LS Y+   S+
Sbjct: 77  LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC-PVK----TDLGIPLSLYDSKTSS 136

Query: 161 TSQYLSCSHQLCAW---STTCKSPDDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGT 220
           TS+ + C    C++   S TC     PC+Y    Y D +++ G  I+D + L   + +  
Sbjct: 137 TSKNVGCEDDFCSFIMQSETC-GAKKPCSY-HVVYGDGSTSDGDFIKDNITLEQVTGNLR 196

Query: 221 QRLLQASVVLGCGRKQSGYY-LDGAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDN 280
              L   VV GCG+ QSG      +A DG+MG G  N S+ + LA  G  +  FS C DN
Sbjct: 197 TAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDN 256

Query: 281 -NGSGRILFGDNGPATQQTTQFLPLFGEFDAYF----VEVESFCVGSSCLQKSG-FHALV 340
            NG G    G+      +TT  +P    ++       V+ +   +  S    +G    ++
Sbjct: 257 MNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTII 316

Query: 341 DSGSSFTYLPTEIYKKIVFEF--DKQVKLNATRIILQEFPWNYCYNSSSLESSYIPSMKL 400
           DSG++  YLP  +Y  ++ +    +QVKL+   ++ + F    C++ +S      P + L
Sbjct: 317 DSGTTLAYLPQNLYNSLIEKITAKQQVKLH---MVQETFA---CFSFTSNTDKAFPVVNL 376

Query: 401 VFPLN---QSFIHDPVYTLPDSQ---GYKLFCLTLEETDDDYGVIGQNLMVGYRLVFDRE 453
            F  +     + HD +++L +     G++   +T ++   D  ++G  ++    +V+D E
Sbjct: 377 HFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQD-GADVILLGDLVLSNKLVVYDLE 433

BLAST of CmoCh04G024170 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 124.4 bits (311), Expect = 3.8e-27
Identity = 122/441 (27.66%), Postives = 205/441 (46.49%), Query Frame = 0

Query: 47  ASGKFWPRRNSLKYFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWI 106
           A  KF  ++ +L++F   K +D +R    + S   +  P  G+  V    +   L++T I
Sbjct: 29  AQHKFAGKKKNLEHF---KSHDTRRHSRMLAS---IDLPLGGDSRV----DSVGLYFTKI 88

Query: 107 DIGTPSVSFLVALDAGSDLLWVPC-DCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLS 166
            +G+P   + V +D GSD+LW+ C  C +C        ++L+  LS ++   S+TS+ + 
Sbjct: 89  KLGSPPKEYHVQVDTGSDILWINCKPCPKCPT-----KTNLNFRLSLFDMNASSTSKKVG 148

Query: 167 CSHQLCAW---STTCKSPDDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQA 226
           C    C++   S +C+ P   C+Y    Y D +++ G  I D L L   +       L  
Sbjct: 149 CDDDFCSFISQSDSCQ-PALGCSY-HIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQ 208

Query: 227 SVVLGCGRKQSGYYLDG-AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDN-NGSGR 286
            VV GCG  QSG   +G +A DGVMG G  N SV + LA  G  +  FS C DN  G G 
Sbjct: 209 EVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGI 268

Query: 287 ILFGDNGPATQQTTQFLPLFGEFDAYF----VEVESFCVGSSCLQKSGFHALVDSGSSFT 346
              G       +TT  +P    ++       V+  S  +  S ++  G   +VDSG++  
Sbjct: 269 FAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGG--TIVDSGTTLA 328

Query: 347 YLPTEIYKKIVFEF--DKQVKLNATRIILQEFPWNYCYNSSSLESSYIP-------SMKL 406
           Y P  +Y  ++      + VKL+      Q F +     S++++ ++ P       S+KL
Sbjct: 329 YFPKVLYDSLIETILARQPVKLHIVEETFQCFSF-----STNVDEAFPPVSFEFEDSVKL 388

Query: 407 -VFPLNQSFIHDPVYTLPDSQ---GYKLFCLTLEETDDDYGVIGQNLMVGYRLVFDRENL 462
            V+P      HD ++TL +     G++   LT +E  +   ++G  ++    +V+D +N 
Sbjct: 389 TVYP------HDYLFTLEEELYCFGWQAGGLTTDERSEVI-LLGDLVLSNKLVVYDLDNE 438

BLAST of CmoCh04G024170 vs. ExPASy Swiss-Prot
Match: Q9M9A8 (Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1)

HSP 1 Score: 115.2 bits (287), Expect = 2.3e-24
Identity = 112/409 (27.38%), Postives = 177/409 (43.28%), Query Frame = 0

Query: 83  IFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVS--FLVALDAGSDLLWVPCD--CIQCAPL 142
           IFP  GN         D L+YT I +G P     + + +D GS+L W+ CD  C  CA  
Sbjct: 190 IFPVGGNVYP------DGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKG 249

Query: 143 SASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCTYKRDYYTDNTSTS 202
           +   Y     +L   + A     Q     +QL      C      C Y+ + Y D++ + 
Sbjct: 250 ANQLYKPRKDNLVRSSEAFCVEVQ----RNQLTEHCENCHQ----CDYEIE-YADHSYSM 309

Query: 203 GFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDG-AAPDGVMGLGPGNISVPT 262
           G + +DK HL    K     L ++ +V GCG  Q G  L+     DG++GL    IS+P+
Sbjct: 310 GVLTKDKFHL----KLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPS 369

Query: 263 LLAKAGLVRNTFSLCF--DNNGSGRILFGDNGPATQQTTQFLPLF--GEFDAYFVEVESF 322
            LA  G++ N    C   D NG G I  G +   +   T ++P+      DAY ++V   
Sbjct: 370 QLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMT-WVPMLHDSRLDAYQMQVTKM 429

Query: 323 CVGSSCLQKSGFH-----ALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRIILQE--- 382
             G   L   G +      L D+GSS+TY P + Y ++V    +   L  TR    E   
Sbjct: 430 SYGQGMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLP 489

Query: 383 FPW----NYCYNS-SSLESSYIP-----SMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLT 442
             W    N+ ++S S ++  + P       K +    +  I    Y +  ++G    CL 
Sbjct: 490 ICWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGN--VCLG 549

Query: 443 LEE----TDDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCL---DINH 458
           + +     D    ++G   M G+ +V+D    ++GW KS C+   +I+H
Sbjct: 550 ILDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVRPREIDH 576

BLAST of CmoCh04G024170 vs. ExPASy TrEMBL
Match: A0A6J1HK50 (aspartic proteinase-like protein 1 OS=Cucurbita moschata OX=3662 GN=LOC111463738 PE=3 SV=1)

HSP 1 Score: 1082.0 bits (2797), Expect = 0.0e+00
Identity = 529/529 (100.00%), Postives = 529/529 (100.00%), Query Frame = 0

Query: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60
           MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKY
Sbjct: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60

Query: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120
           FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD
Sbjct: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120

Query: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180
           AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP
Sbjct: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180

Query: 181 DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240
           DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA
Sbjct: 181 DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240

Query: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300
           APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF
Sbjct: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300

Query: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360
           GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR
Sbjct: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360

Query: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420
           IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD
Sbjct: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420

Query: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH 480
           DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH
Sbjct: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH 480

Query: 481 LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML 530
           LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML
Sbjct: 481 LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML 529

BLAST of CmoCh04G024170 vs. ExPASy TrEMBL
Match: A0A6J1CJM3 (aspartic proteinase-like protein 1 OS=Momordica charantia OX=3673 GN=LOC111011771 PE=3 SV=1)

HSP 1 Score: 866.7 bits (2238), Expect = 5.0e-248
Identity = 424/509 (83.30%), Postives = 462/509 (90.77%), Query Frame = 0

Query: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60
           MAN +VVLL +ACF  DSSVA   SS+LIHRFS+EAKALW+SR+GN S KFWPRR+SLKY
Sbjct: 1   MANCVVVLLLMACFLADSSVAHTFSSKLIHRFSEEAKALWESRSGNVSRKFWPRRDSLKY 60

Query: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120
           FE L D DLKRRRLKIGSK E++ PSEG+EV+FFGNEFDWLHYTWIDIGTPSVSFLVALD
Sbjct: 61  FELLMDNDLKRRRLKIGSKDEMLLPSEGSEVLFFGNEFDWLHYTWIDIGTPSVSFLVALD 120

Query: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180
            GSDLLWVPCDCIQCAPLSAS+YS LDRDLS YNPALS+TS++LSC HQLCAWS TCK P
Sbjct: 121 VGSDLLWVPCDCIQCAPLSASYYSVLDRDLSEYNPALSSTSKHLSCDHQLCAWSATCKRP 180

Query: 181 DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240
           D+PCTYKRDYY+DNTS+SGFMIEDKLHLASFSKH  + L+QASVVLGCGRKQSG YLDGA
Sbjct: 181 DEPCTYKRDYYSDNTSSSGFMIEDKLHLASFSKHAKRSLVQASVVLGCGRKQSGGYLDGA 240

Query: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300
           APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFD NGSGRILFGDNG ATQQTT+FLPLF
Sbjct: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDGNGSGRILFGDNGSATQQTTRFLPLF 300

Query: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360
           GEFDAYFV VESFCVGSSCLQKSGF ALVDSGSSFTYLP E+Y++IVFEFDKQVKLNATR
Sbjct: 301 GEFDAYFVGVESFCVGSSCLQKSGFQALVDSGSSFTYLPAEVYRRIVFEFDKQVKLNATR 360

Query: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420
           I LQEFPW+YCYN SSLES+ IPSMKLVFPLNQSFIHDPVY LP +QGYK+FCLTLEETD
Sbjct: 361 ITLQEFPWSYCYNVSSLESANIPSMKLVFPLNQSFIHDPVYVLPVNQGYKIFCLTLEETD 420

Query: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH 480
           DDYG+IGQNLMVGYR+VFDRENL+LGWSKSKCLDIN  ++ +AKPPSNDGSP A+P++  
Sbjct: 421 DDYGIIGQNLMVGYRMVFDRENLKLGWSKSKCLDIN-SKSYNAKPPSNDGSPIAIPSNEQ 480

Query: 481 LSPPNRQEIAPTAAR-AFSKSSLTAPHFS 509
            SPPNRQ IAPTA+R   SKSS TA HFS
Sbjct: 481 KSPPNRQAIAPTASRTTSSKSSPTASHFS 508

BLAST of CmoCh04G024170 vs. ExPASy TrEMBL
Match: A0A5A7V9R0 (Aspartic proteinase-like protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold313G001730 PE=3 SV=1)

HSP 1 Score: 848.2 bits (2190), Expect = 1.9e-242
Identity = 416/524 (79.39%), Postives = 456/524 (87.02%), Query Frame = 0

Query: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKS-RNGNASGKFWPRRNSLK 60
           MAN  ++LL +AC FVD S+ L LS +L+HRFSDEAK+LWKS R GN S KFWP RNSLK
Sbjct: 1   MANCALLLLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWKSTRTGNVSAKFWPPRNSLK 60

Query: 61  YFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVAL 120
           YF+ L DYDLKRRRLKIGSKY+++FPSEG++V+FFGNEF+WLHYTWIDIGTP V FLVAL
Sbjct: 61  YFQMLLDYDLKRRRLKIGSKYDMLFPSEGSQVIFFGNEFNWLHYTWIDIGTPRVPFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKS 180
           D GSDLLWVPCDC+QCAPLSAS+YS LDRDLS YNPALS+TS++L C HQLCAWSTTCKS
Sbjct: 121 DVGSDLLWVPCDCVQCAPLSASYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKS 180

Query: 181 PDDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDG 240
           P+DPCTYKRDYY+DNTSTSG+MIEDKLHL SFSKHGT  LLQASVVLGCGRKQSG YLDG
Sbjct: 181 PNDPCTYKRDYYSDNTSTSGYMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDG 240

Query: 241 AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPL 300
           AAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFDNNGSGRI+FGD+GPATQQTTQFLPL
Sbjct: 241 AAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRIIFGDDGPATQQTTQFLPL 300

Query: 301 FGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNAT 360
           FGEF AYF+ VESFCVGSSCLQ+SGF ALVDSGSSFTYLP E+YKKIVFEFDKQVK NAT
Sbjct: 301 FGEFAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNAT 360

Query: 361 RIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEET 420
           RI+LQE PWNYCYN S+L S  IPSMKLVFPLNQ FIHDPVY LP +QGYK+FCLTLEET
Sbjct: 361 RIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQIFIHDPVYILPANQGYKVFCLTLEET 420

Query: 421 DDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDG---SPTALP 480
           D+DYGVIGQNLMVGYR+VFDRENL+LGWSKSKCLDIN     HAKPPSN+G   SP ALP
Sbjct: 421 DEDYGVIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSSTTEHAKPPSNNGNAKSPIALP 480

Query: 481 TDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLL 521
                 P NRQ IAPTAAR  SKSSL+A +FS      L  FL+
Sbjct: 481 ------PTNRQAIAPTAARTSSKSSLSASYFSPLLLLLLAAFLV 518

BLAST of CmoCh04G024170 vs. ExPASy TrEMBL
Match: A0A1S3CD36 (aspartic proteinase-like protein 1 OS=Cucumis melo OX=3656 GN=LOC103499293 PE=3 SV=1)

HSP 1 Score: 847.8 bits (2189), Expect = 2.4e-242
Identity = 416/524 (79.39%), Postives = 456/524 (87.02%), Query Frame = 0

Query: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKS-RNGNASGKFWPRRNSLK 60
           MAN  ++LL +AC FVD S+ L LS +L+HRFSDEAK+LWKS R GN S KFWP RNSLK
Sbjct: 1   MANCSLLLLLIACLFVDCSLGLTLSLKLVHRFSDEAKSLWKSTRTGNVSAKFWPPRNSLK 60

Query: 61  YFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVAL 120
           YF+ L DYDLKRRRLKIGSKY+++FPSEG++V+FFGNEF+WLHYTWIDIGTP V FLVAL
Sbjct: 61  YFQMLLDYDLKRRRLKIGSKYDMLFPSEGSQVIFFGNEFNWLHYTWIDIGTPRVPFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKS 180
           D GSDLLWVPCDC+QCAPLSAS+YS LDRDLS YNPALS+TS++L C HQLCAWSTTCKS
Sbjct: 121 DVGSDLLWVPCDCVQCAPLSASYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKS 180

Query: 181 PDDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDG 240
           P+DPCTYKRDYY+DNTSTSG+MIEDKLHL SFSKHGT  LLQASVVLGCGRKQSG YLDG
Sbjct: 181 PNDPCTYKRDYYSDNTSTSGYMIEDKLHLTSFSKHGTHSLLQASVVLGCGRKQSGSYLDG 240

Query: 241 AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPL 300
           AAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFDNNGSGRI+FGD+GPATQQTTQFLPL
Sbjct: 241 AAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRIIFGDDGPATQQTTQFLPL 300

Query: 301 FGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNAT 360
           FGEF AYF+ VESFCVGSSCLQ+SGF ALVDSGSSFTYLP E+YKKIVFEFDKQVK NAT
Sbjct: 301 FGEFAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKFNAT 360

Query: 361 RIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEET 420
           RI+LQE PWNYCYN S+L S  IPSMKLVFPLNQ FIHDPVY LP +QGYK+FCLTLEET
Sbjct: 361 RIVLQELPWNYCYNLSTLVSFNIPSMKLVFPLNQIFIHDPVYILPANQGYKVFCLTLEET 420

Query: 421 DDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDG---SPTALP 480
           D+DYGVIGQNLMVGYR+VFDRENL+LGWSKSKCLDIN     HAKPPSN+G   SP ALP
Sbjct: 421 DEDYGVIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSSTTEHAKPPSNNGNAKSPIALP 480

Query: 481 TDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLL 521
                 P NRQ IAPTAAR  SKSSL+A +FS      L  FL+
Sbjct: 481 ------PTNRQAIAPTAARTSSKSSLSASYFSPLLLLLLAAFLV 518

BLAST of CmoCh04G024170 vs. ExPASy TrEMBL
Match: A0A0A0KN37 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G407090 PE=3 SV=1)

HSP 1 Score: 835.9 bits (2158), Expect = 9.5e-239
Identity = 412/524 (78.63%), Postives = 453/524 (86.45%), Query Frame = 0

Query: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKS-RNGNASGKFWPRRNSLK 60
           MAN  ++LLF+A  FV+ S+AL LS  L+HRFSDEAK+LW+S R GN S KFWP  NSLK
Sbjct: 1   MANCALLLLFIASLFVNCSLALTLSLNLVHRFSDEAKSLWESRRTGNVSAKFWPPTNSLK 60

Query: 61  YFETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVAL 120
           YF+ L DYDLKRRRL IGSKY+V+FPSEG++V+FFGNEF+WLHYTWID+GTPSV FLVAL
Sbjct: 61  YFQMLMDYDLKRRRLNIGSKYDVLFPSEGSQVIFFGNEFNWLHYTWIDLGTPSVPFLVAL 120

Query: 121 DAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKS 180
           D GSDLLWVPCDCIQCAPLSA++YS LDRDLS YNPALS+TS++L C HQLCAWSTTCKS
Sbjct: 121 DVGSDLLWVPCDCIQCAPLSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKS 180

Query: 181 PDDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDG 240
            +DPCTYKRDYY+DNTSTSGFMIEDKL L SFSKHGT  LLQASVV GCGRKQSG YLDG
Sbjct: 181 ANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGRKQSGSYLDG 240

Query: 241 AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPL 300
           AAPDGVMGLGPGNISVPTLLA+ GLVRNTFSLCFDNNGSGRILFGD+GPATQQTTQFLPL
Sbjct: 241 AAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPATQQTTQFLPL 300

Query: 301 FGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNAT 360
           FGEF AYF+ VESFCVGSSCLQ+SGF ALVDSGSSFTYLP E+YKKIVFEFDKQVK+NAT
Sbjct: 301 FGEFAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNAT 360

Query: 361 RIILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEET 420
           RI+L+E PWNYCYN S+L S  IPSM+LVFPLNQ FIHDPVY LP +QGYK+FCLTLEET
Sbjct: 361 RIVLRELPWNYCYNISTLVSFNIPSMQLVFPLNQIFIHDPVYVLPANQGYKVFCLTLEET 420

Query: 421 DDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDG---SPTALP 480
           D+DYGVIGQNLMVGYR+VFDRENL+LGWSKSKCLDIN     HAKPPSN+G   SP ALP
Sbjct: 421 DEDYGVIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSSTTEHAKPPSNNGNAKSPIALP 480

Query: 481 TDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLL 521
                 P NRQ IAPTAAR  SKSSL+A HFS      L  FL+
Sbjct: 481 ------PTNRQAIAPTAARTSSKSSLSASHFSPLLLLLLAAFLV 518

BLAST of CmoCh04G024170 vs. NCBI nr
Match: XP_022963449.1 (aspartic proteinase-like protein 1 [Cucurbita moschata])

HSP 1 Score: 1082.0 bits (2797), Expect = 0.0e+00
Identity = 529/529 (100.00%), Postives = 529/529 (100.00%), Query Frame = 0

Query: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60
           MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKY
Sbjct: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60

Query: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120
           FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD
Sbjct: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120

Query: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180
           AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP
Sbjct: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180

Query: 181 DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240
           DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA
Sbjct: 181 DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240

Query: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300
           APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF
Sbjct: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300

Query: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360
           GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR
Sbjct: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360

Query: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420
           IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD
Sbjct: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420

Query: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH 480
           DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH
Sbjct: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH 480

Query: 481 LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML 530
           LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML
Sbjct: 481 LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML 529

BLAST of CmoCh04G024170 vs. NCBI nr
Match: KAG7032744.1 (Aspartic proteinase-like protein 1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 525/529 (99.24%), Postives = 526/529 (99.43%), Query Frame = 0

Query: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60
           MANFLVVLLFV CF VDSSVALRLSSRL+HRFSDEAKALWKSRNGNASGKFWPRRNSLKY
Sbjct: 1   MANFLVVLLFVGCFLVDSSVALRLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60

Query: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120
           FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD
Sbjct: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120

Query: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180
           AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP
Sbjct: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180

Query: 181 DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240
           DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA
Sbjct: 181 DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240

Query: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300
           APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF
Sbjct: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300

Query: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360
           GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR
Sbjct: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360

Query: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420
           IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD
Sbjct: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420

Query: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH 480
           DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH
Sbjct: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH 480

Query: 481 LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML 530
           LSPPNRQEIAPTAARAF KSSLTAPHFSLFSYCCLRLFLLLFEFVESML
Sbjct: 481 LSPPNRQEIAPTAARAFYKSSLTAPHFSLFSYCCLRLFLLLFEFVESML 529

BLAST of CmoCh04G024170 vs. NCBI nr
Match: XP_023545003.1 (aspartic proteinase-like protein 1 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1071.6 bits (2770), Expect = 2.0e-309
Identity = 524/529 (99.05%), Postives = 526/529 (99.43%), Query Frame = 0

Query: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60
           MANFLVVLLFVACF VDSSVALRLSSRL+HRFSDEAKALWKSRNGNASGKFWPRRNSLKY
Sbjct: 1   MANFLVVLLFVACFLVDSSVALRLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60

Query: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120
           FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD
Sbjct: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120

Query: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180
           AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP
Sbjct: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180

Query: 181 DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240
           DDPC+YKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA
Sbjct: 181 DDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240

Query: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300
           APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF
Sbjct: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300

Query: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360
           GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR
Sbjct: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360

Query: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420
           IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD
Sbjct: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420

Query: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH 480
           DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEA HAKPPSNDGSPTALPTDGH
Sbjct: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDGSPTALPTDGH 480

Query: 481 LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML 530
           LSPPNRQEIAPTAARAFSKSSLTAPHFSLFS CCLRLFLLLFEFVESML
Sbjct: 481 LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSCCCLRLFLLLFEFVESML 529

BLAST of CmoCh04G024170 vs. NCBI nr
Match: KAG6602050.1 (Aspartic proteinase-like protein 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 995.0 bits (2571), Expect = 2.5e-286
Identity = 481/486 (98.97%), Postives = 482/486 (99.18%), Query Frame = 0

Query: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60
           MANFLVVLLFV CF VDSSVALRLSSRL+HRFSDEAKALWKSRNGNASGKFWPRRNSLKY
Sbjct: 1   MANFLVVLLFVGCFLVDSSVALRLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60

Query: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120
           FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD
Sbjct: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120

Query: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180
           AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP
Sbjct: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180

Query: 181 DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240
           DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA
Sbjct: 181 DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240

Query: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300
           APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF
Sbjct: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300

Query: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360
           GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR
Sbjct: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360

Query: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420
           IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD
Sbjct: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420

Query: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH 480
           DDYGVIGQNLM GYRLVFDRENLQLGWSKSKCLDINHGEA HAKPPSNDGSPTALPTDGH
Sbjct: 421 DDYGVIGQNLMAGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDGSPTALPTDGH 480

Query: 481 LSPPNR 487
           LSPPNR
Sbjct: 481 LSPPNR 486

BLAST of CmoCh04G024170 vs. NCBI nr
Match: XP_023545012.1 (aspartic proteinase-like protein 1 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 955.3 bits (2468), Expect = 2.2e-274
Identity = 481/529 (90.93%), Postives = 484/529 (91.49%), Query Frame = 0

Query: 1   MANFLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60
           MANFLVVLLFVACF VDSSVALRLSSRL+HRFSDEAKALWKSRNGNASGKFWPRRNSLKY
Sbjct: 1   MANFLVVLLFVACFLVDSSVALRLSSRLVHRFSDEAKALWKSRNGNASGKFWPRRNSLKY 60

Query: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 120
           FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDW             S L+ L 
Sbjct: 61  FETLKDYDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDW------------CSLLLGLS 120

Query: 121 AGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180
                                     DRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP
Sbjct: 121 -----------------------MMQDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSP 180

Query: 181 DDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240
           DDPC+YKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA
Sbjct: 181 DDPCSYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGA 240

Query: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300
           APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF
Sbjct: 241 APDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLF 300

Query: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360
           GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR
Sbjct: 301 GEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATR 360

Query: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420
           IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD
Sbjct: 361 IILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSFIHDPVYTLPDSQGYKLFCLTLEETD 420

Query: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGH 480
           DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEA HAKPPSNDGSPTALPTDGH
Sbjct: 421 DDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEADHAKPPSNDGSPTALPTDGH 480

Query: 481 LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML 530
           LSPPNRQEIAPTAARAFSKSSLTAPHFSLFS CCLRLFLLLFEFVESML
Sbjct: 481 LSPPNRQEIAPTAARAFSKSSLTAPHFSLFSCCCLRLFLLLFEFVESML 494

BLAST of CmoCh04G024170 vs. TAIR 10
Match: AT5G10080.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 461.1 bits (1185), Expect = 1.2e-129
Identity = 250/530 (47.17%), Postives = 346/530 (65.28%), Query Frame = 0

Query: 8   LLFVACFF-VDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETLKD 67
           LLF   F   + ++A   SSRLIHRFSDE +A  K+ + + S    P + SL+Y+  L +
Sbjct: 8   LLFCVLFLATEETLASLFSSRLIHRFSDEGRASIKTPSSSDS---LPNKQSLEYYRLLAE 67

Query: 68  YDLKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLL 127
            D +R+R+ +G+K + + PSEG++ +  GN+F WLHYTWIDIGTPSVSFLVALD GS+LL
Sbjct: 68  SDFRRQRMNLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTGSNLL 127

Query: 128 WVPCDCIQCAPLSASHYSSL-DRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCT 187
           W+PC+C+QCAPL++++YSSL  +DL+ YNP+ S+TS+   CSH+LC  ++ C+SP + C 
Sbjct: 128 WIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCP 187

Query: 188 YKRDYYTDNTSTSGFMIEDKLHLASFSKH---GTQRLLQASVVLGCGRKQSGYYLDGAAP 247
           Y  +Y + NTS+SG ++ED LHL   + +        ++A VV+GCG+KQSG YLDG AP
Sbjct: 188 YTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAP 247

Query: 248 DGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPL-FG 307
           DG+MGLGP  ISVP+ L+KAGL+RN+FSLCFD   SGRI FGD GP+ QQ+T FL L   
Sbjct: 248 DGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNN 307

Query: 308 EFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRI 367
           ++  Y V VE+ C+G+SCL+++ F   +DSG SFTYLP EIY+K+  E D+ +  NAT  
Sbjct: 308 KYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHI--NATSK 367

Query: 368 ILQEFPWNYCYNSSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTLEET- 427
             +   W YCY SS+     +P++KL F  N +F IH P++    SQG   FCL +  + 
Sbjct: 368 NFEGVSWEYCYESSA--EPKVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSG 427

Query: 428 DDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDG 487
            +  G IGQN M GYR+VFDREN++LGWS SKC + +  E   A P S   SP  LPTD 
Sbjct: 428 QEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE-DKIEPPQASPGST-SSPNPLPTDE 487

Query: 488 HLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLFEFVESML 530
             S          A +  SK+  ++  +S  S   L   LLL  ++ S++
Sbjct: 488 QQSRGGHAVSPAIAGKTPSKTPSSSSSYSFSSIMRLFNSLLLLHWLASLM 528

BLAST of CmoCh04G024170 vs. TAIR 10
Match: AT4G35880.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 285.0 bits (728), Expect = 1.2e-76
Identity = 179/459 (39.00%), Postives = 264/459 (57.52%), Query Frame = 0

Query: 4   FLVVLLFVACFFVDSSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFE- 63
           FL+ +L +  F   S      +  + HRFSDE K  W    G  + KF P + S +YF  
Sbjct: 11  FLIPILMLLSF--GSCNGRIFTFEMHHRFSDEVKQ-WSDSTGRFA-KF-PPKGSFEYFNA 70

Query: 64  -TLKDYDLKRRRL---KIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVA 123
             L+D+ ++ RRL   +  S+  + F S+GN      +   +LHYT + +GTP + F+VA
Sbjct: 71  LVLRDWLIRGRRLSESESESESSLTF-SDGNSTSRI-SSLGFLHYTTVKLGTPGMRFMVA 130

Query: 124 LDAGSDLLWVPCDCIQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCK 183
           LD GSDL WVPCDC +CAP   + Y+S + +LS YNP +S T++ ++C++ LCA    C 
Sbjct: 131 LDTGSDLFWVPCDCGKCAPTEGATYAS-EFELSIYNPKVSTTNKKVTCNNSLCAQRNQCL 190

Query: 184 SPDDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLD 243
                C Y   Y +  TSTSG ++ED +HL +  K+  +  ++A V  GCG+ QSG +LD
Sbjct: 191 GTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLD 250

Query: 244 GAAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLP 303
            AAP+G+ GLG   ISVP++LA+ GLV ++FS+CF ++G GRI FGD G + Q+ T F  
Sbjct: 251 IAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-N 310

Query: 304 LFGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNA 363
           L      Y + V    VG++ +    F AL D+G+SFTYL   +Y  +   F  Q + + 
Sbjct: 311 LNPSHPNYNITVTRVRVGTTLIDDE-FTALFDTGTSFTYLVDPMYTTVSESFHSQAQ-DK 370

Query: 364 TRIILQEFPWNYCYN-SSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTL 423
                   P+ YCY+ S+   +S IPS+ L    N  F I+DP+  +  ++G  ++CL +
Sbjct: 371 RHSPDSRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDPIIVI-STEGELVYCLAI 430

Query: 424 EETDDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDI 456
            ++  +  +IGQN M GYR+VFDRE L L W K  C DI
Sbjct: 431 VKS-SELNIIGQNYMTGYRVVFDREKLVLAWKKFDCYDI 454

BLAST of CmoCh04G024170 vs. TAIR 10
Match: AT2G17760.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 280.8 bits (717), Expect = 2.3e-75
Identity = 186/493 (37.73%), Postives = 266/493 (53.96%), Query Frame = 0

Query: 30  HRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETL--KDYDLKRRRLKIGSKYEVIFPSE 89
           HRFSD+         G   G   P R+S KY+  +  +D  ++ RRL    +  V F S+
Sbjct: 39  HRFSDQVV-------GVLPGDGLPNRDSSKYYRVMAHRDRLIRGRRLANEDQSLVTF-SD 98

Query: 90  GNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCA-PLSASHYSSL 149
           GNE V   +   +LHY  + +GTPS  F+VALD GSDL W+PCDC  C   L A   SSL
Sbjct: 99  GNETVRV-DALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSL 158

Query: 150 DRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCTYKRDYYTDNTSTSGFMIEDKL 209
             DL+ Y+P  S+TS  + C+  LC     C SP+  C Y+  Y ++ TS++G ++ED L
Sbjct: 159 --DLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVL 218

Query: 210 HLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGAAPDGVMGLGPGNISVPTLLAKAGLVR 269
           HL S  K  + + + A V  GCG+ Q+G + DGAAP+G+ GLG  +ISVP++LAK G+  
Sbjct: 219 HLVSNDK--SSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAA 278

Query: 270 NTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLFGEFDAYFVEVESFCVGSSCLQKSGFH 329
           N+FS+CF N+G+GRI FGD G   Q+ T  L +      Y + V    VG +      F 
Sbjct: 279 NSFSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNTGDLE-FD 338

Query: 330 ALVDSGSSFTYLPTEIYKKIVFEFDKQVKLNATRIILQEFPWNYCYNSSSLESSY-IPSM 389
           A+ DSG+SFTYL    Y  I   F+        +    E P+ YCY  S  + S+  P++
Sbjct: 339 AVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 398

Query: 390 KLVFPLNQSF-IHDPVYTLPDSQGYKLFCLTLEETDDDYGVIGQNLMVGYRLVFDRENLQ 449
            L      S+ ++ P+  +P  +   ++CL + +  +D  +IGQN M GYR+VFDRE L 
Sbjct: 399 NLTMKGGSSYPVYHPLVVIP-MKDTDVYCLAIMKI-EDISIIGQNFMTGYRVVFDREKLI 458

Query: 450 LGWSKSKCLDINHGEAGHAKPPSNDGSPTALPTDGHLSP-----PNRQEIAPTAARAFSK 509
           LGW +S C     GE      PSN  S +A P      P     P+++    T + A+S 
Sbjct: 459 LGWKESDCYT---GETSARTLPSNRSSSSARPPASSFDPEATNIPSQRPNTSTTSAAYSL 511

Query: 510 S-SLTAPHFSLFS 512
           S SL+   FS+ +
Sbjct: 519 SISLSLFFFSILA 511

BLAST of CmoCh04G024170 vs. TAIR 10
Match: AT3G51330.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 240.0 bits (611), Expect = 4.4e-63
Identity = 179/536 (33.40%), Postives = 267/536 (49.81%), Query Frame = 0

Query: 4   FLVVLLFVACFFVDSSVAL-RLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFE 63
           F+++ L V C+ ++   A  + S  + H FSD  K               P + SL+YF+
Sbjct: 8   FVLLSLLVVCWGLERCEASGKFSFEVHHMFSDRVK------QSLGLDDLVPEKGSLEYFK 67

Query: 64  TLKDYD--LKRRRLKIGSKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALD 123
            L   D  ++ R L   ++   I    GN  +   +   +LHY  + +GTP+  FLVALD
Sbjct: 68  VLAQRDRLIRGRGLASNNEETPITFMRGNRTISI-DLLGFLHYANVSVGTPATWFLVALD 127

Query: 124 AGSDLLWVPCDC-IQCAPLSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKS 183
            GSDL W+PC+C   C         S  R L+ Y+P  S+TS  + CS   C  S+ C S
Sbjct: 128 TGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSS 187

Query: 184 PDDPCTYKRDYYTDNTSTSGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDG 243
           P   C Y+  Y + +T T+G + ED LHL +    G +  ++A++ LGCG+ Q+G+    
Sbjct: 188 PASSCPYQIQYLSKDTFTTGTLFEDVLHLVT-EDEGLEP-VKANITLGCGKNQTGFLQSS 247

Query: 244 AAPDGVMGLGPGNISVPTLLAKAGLVRNTFSLCFDN--NGSGRILFGDNGPATQQTTQFL 303
           AA +G++GLG  + SVP++LAKA +  N+FS+CF N  +  GRI FGD G   Q  T  L
Sbjct: 248 AAVNGLLGLGLKDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLL 307

Query: 304 PLFGEFDAYFVEVESFCVGSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQVKLN 363
           P       Y V V    VG   +      AL D+G+SFT+L    Y  I   FD  V  +
Sbjct: 308 PT-EPSPTYAVSVTEVSVGGDAVGVQ-LLALFDTGTSFTHLLEPEYGLITKAFDDHV-TD 367

Query: 364 ATRIILQEFPWNYCYNSSSLESSYI-PSMKLVFP-LNQSFIHDPVYTLPDSQGYKLFCL- 423
             R I  E P+ +CY+ S  +++ + P + + F   +Q F+ +P++ + +     ++CL 
Sbjct: 368 KRRPIDPELPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLG 427

Query: 424 TLEETDDDYGVIGQNLMVGYRLVFDRENLQLGWSKSKCLDINHGEAGHAKPPSNDGS--- 483
            L+  D    +IGQN M GYR+VFDRE + LGW +S C +    E+    PP  +     
Sbjct: 428 ILKSVDFKINIIGQNFMSGYRIVFDRERMILGWKRSDCFEDESLESTTPPPPETEAPSPS 487

Query: 484 -----PTALPTDGHLSPPNRQEIAPTAARAFSKSSLTAPHFSLFSYCCLRLFLLLF 523
                P+ LP     +PP   +I P  +   S +   A    L S   L L LL F
Sbjct: 488 ASTPLPSLLPPPAAATPP---QIDPRNSTRNSGTGTAANLVPLASQLLLLLPLLAF 528

BLAST of CmoCh04G024170 vs. TAIR 10
Match: AT3G51360.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 235.3 bits (599), Expect = 1.1e-61
Identity = 159/476 (33.40%), Postives = 241/476 (50.63%), Query Frame = 0

Query: 18  SSVALRLSSRLIHRFSDEAKALWKSRNGNASGKFWPRRNSLKYFETLKDYDLKRRRLKIG 77
           SSV+  LS  + HRFS++ K +         G   P   SL Y++ L   D  R+     
Sbjct: 16  SSVSGSLSFEIHHRFSEQVKTV-------LGGHGLPEMGSLDYYKALVHRDRGRQLTSNN 75

Query: 78  SKYEVIFPSEGNEVVFFGNEFDWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCDCIQCAP 137
           +    I  ++GN       E  +LHY  + IGTP+  FLVALD GSDL W+PC+C     
Sbjct: 76  NNQTTISFAQGNST----EEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCV 135

Query: 138 LSASHYSSLDRDLSAYNPALSNTSQYLSCSHQLCAWSTTCKSPDDPCTYKRDYYTDNTST 197
            S          L+ YNP+ S +S  ++C+  LCA    C SP   C Y+  Y +  + +
Sbjct: 136 RSMETDQGERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKS 195

Query: 198 SGFMIEDKLHLASFSKHGTQRLLQASVVLGCGRKQSGYYLDGAAPDGVMGLGPGNISVPT 257
           +G ++ED +H++  ++ G  R   A +  GC   Q G + +  A +G+MGL   +I+VP 
Sbjct: 196 TGVLVEDVIHMS--TEEGEAR--DARITFGCSESQLGLFKE-VAVNGIMGLAIADIAVPN 255

Query: 258 LLAKAGLVRNTFSLCFDNNGSGRILFGDNGPATQQTTQFLPLFGEFDAYF--VEVESFCV 317
           +L KAG+  ++FS+CF  NG G I FGD G + Q  T   PL G     F  V +  F V
Sbjct: 256 MLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQLET---PLSGTISPMFYDVSITKFKV 315

Query: 318 GSSCLQKSGFHALVDSGSSFTYLPTEIYKKIVFEFDKQV---KLNATRIILQEFPWNYCY 377
           G   +  + F A  DSG++ T+L    Y  +   F   V   +L+ +     + P+ +CY
Sbjct: 316 GKVTVD-TEFTATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKS----VDSPFEFCY 375

Query: 378 -NSSSLESSYIPSMKLVFPLNQSF-IHDPVYTLPDSQG-YKLFCL-TLEETDDDYGVIGQ 437
             +S+ +   +PS+        ++ +  P+     S G ++++CL  L++ + D+ +IGQ
Sbjct: 376 IITSTSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQ 435

Query: 438 NLMVGYRLVFDRENLQLGWSKSKCLDIN--HGEAGHAKPPSNDGSPTALPTDGHLS 483
           N M  YR+V DRE   LGW KS C D N   G    AKPPS   +PT+ P   +LS
Sbjct: 436 NFMTNYRIVHDRERRILGWKKSNCNDTNGFTGPTALAKPPSM--APTSSPRTINLS 465

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LX201.7e-12847.17Aspartic proteinase-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=At5g10080 ... [more]
Q8VYV93.2e-7437.73Aspartyl protease family protein 1 OS=Arabidopsis thaliana OX=3702 GN=APF1 PE=2 ... [more]
Q4V3D22.7e-2828.03Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q9S9K43.8e-2727.66Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q9M9A82.3e-2427.38Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1HK500.0e+00100.00aspartic proteinase-like protein 1 OS=Cucurbita moschata OX=3662 GN=LOC111463738... [more]
A0A6J1CJM35.0e-24883.30aspartic proteinase-like protein 1 OS=Momordica charantia OX=3673 GN=LOC11101177... [more]
A0A5A7V9R01.9e-24279.39Aspartic proteinase-like protein 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E56... [more]
A0A1S3CD362.4e-24279.39aspartic proteinase-like protein 1 OS=Cucumis melo OX=3656 GN=LOC103499293 PE=3 ... [more]
A0A0A0KN379.5e-23978.63Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G40709... [more]
Match NameE-valueIdentityDescription
XP_022963449.10.0e+00100.00aspartic proteinase-like protein 1 [Cucurbita moschata][more]
KAG7032744.10.0e+0099.24Aspartic proteinase-like protein 1 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023545003.12.0e-30999.05aspartic proteinase-like protein 1 isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG6602050.12.5e-28698.97Aspartic proteinase-like protein 1, partial [Cucurbita argyrosperma subsp. soror... [more]
XP_023545012.12.2e-27490.93aspartic proteinase-like protein 1 isoform X2 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT5G10080.11.2e-12947.17Eukaryotic aspartyl protease family protein [more]
AT4G35880.11.2e-7639.00Eukaryotic aspartyl protease family protein [more]
AT2G17760.12.3e-7537.73Eukaryotic aspartyl protease family protein [more]
AT3G51330.14.4e-6333.40Eukaryotic aspartyl protease family protein [more]
AT3G51360.11.1e-6133.40Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 424..439
score: 26.71
coord: 327..338
score: 39.92
coord: 108..128
score: 49.89
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 5..475
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 286..457
e-value: 1.7E-25
score: 91.4
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 93..285
e-value: 2.2E-43
score: 150.5
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 97..454
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 326..448
e-value: 1.6E-13
score: 50.7
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 103..285
e-value: 8.8E-37
score: 127.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 461..486
NoneNo IPR availablePANTHERPTHR13683:SF339EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 5..475
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 327..338
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 102..448
score: 36.120651

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G024170.1CmoCh04G024170.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity