CmoCh02G000010 (gene) Cucurbita moschata (Rifu)

NameCmoCh02G000010
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEukaryotic aspartyl protease family protein
LocationCmo_Chr02 : 13644 .. 20357 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CAATTAGTTAGCCCAAAAGAAATACCCAAGCCCAACCCAACTCGTCTCCCTCGCTTCAACTCTCTCTCTCTCTCTCTCTCTCTCATCTTACACCGCCGGCAACTTCTCGAATTGCTTCTCTGCAATGGCGACCACAGCTTGTTTCATCATCGTCAGTAGGAACAACATCCCTATTTATGAAGCTGAAGTCGGATCTGCTGCTAAAGTATTTCCTTCTTTTTCATTTTCTATCGTTTCACGTTTAGATCGCATTTTTCCTTCACGAGATCGCCAGAAAAGAAAAAAAAGATGATTTCATTTTGGTTCTTTAATTACTTTAGTTTTGACTGTCATTGGTCGCTCATCTTTTTGGCTTGACATGAAAGAGAGAGGATTCTGCTCAGCTGCATCAATTTATATTGCACGCGTCCCTTGACATTGTTCAAGACCTGGCATGGACTACTAGTGCTATGTGAGGTCTTATTCCTTCAGTTTTATTATTAGATCATCGAGATACCTCTTTTTTCTTCTATTTCCTTTGAAATTCCCTTTAGATTCTTTTTCTTGGTGCGTTTTTTGCAGGTTCTTGAAAGCAGTCGATAGATTCAATGATCTGGTGGTGTCTGTTTATGTAACCGCTGGCCATATCCTTTAATATCGAGGGGGGTTTTTCGGGAACAATGTGATGGGAATTGAGTCAAATTTCATCAATTTATTTACTTTTAAAAGAGCCTCTTTGGTCTTTGGTCTTAAGAATGCTGGTTTTAAGAATTAATGATTGCCCTTCTCTTTTTTTTCCCGGTCCAATATTTGTTCTCGAGAATCTATACCCTCACCACCCCGATTGCATTACTGAAGTATTTATACTGATAAAACGGTCCCAATCCTTTTTTAAGATATAGCAGGCAAAAATTTTCATTGTAATGGGATCAATGTTGTACTATGATTCTGGAATTCTATATACATTTGAGAGATTAAATTCAGTTTATAGTTCCTATTATTTCCTTTTCTGGTTTTTCTTTGACTATTCACTACATACGCGATTGATGTTACTTCATGACTCTCGCAATGATGATGGAATCAAGAGTTTTTTTCAAGAGGTTCACGAGCTTTACATAAAGGTAAGTTTGACATGTTTCTTAGGTATTATTATTATTATTATTGCTATATTGGAAGGTGTAATGCTTGATTTACTATGGTATATTACTTAGGTTGAGTGAACTGTACCTCATTTGAATAGATCCCACTAGATAGAGTTCATTCTTTTCCCTCAAATATACTGAATTTTGACATTTGAATGGTACAGTTGTTAAATAAATAAATAAATTGTATCCCTTGAGTCTTAACTGTATATATTGACCAAAAAGAGTAAGTGGTTGATTTGTTATGAGTTCAGGTTGTCATCTTGAGTTTGCATGAGGCTTGAGATAGGGGTGGCAAGGATGGTTCATTTTTTGGATCCCTTGTTGTTTGGTCGTGTCAGGATGCTTAGATTCATGGCTCCTCTAGATTCCTTCAATGTGTTATTATTGTCTCTGTAGCACAGGATTGTGACCTATTTCTTCTAGGGAAATTAATTGGCTTTTGGAAATAAAATTGATAGAATGATTTGAAATCGTGTAGTAAAGTTACAAAATATTGTTAATAATTTATCGAGTGGATTTACGTGTTTGTCCCATGAAATTGTTGAGGTGCACTTAAGAGGGTCCGGACACTCACAGATATATGAAGAAAAAAAATCTAATGGGTTAAGATTTCATCATCATAATTTAATTGCTGAAAGAACTATTTGTAGAAAAGCTATGTTTGGAACATACCAAGATTTAATATCCTCTAACAGTGTACTTCAGCATTATGCTCAACTTGATTACATCACATTGAACAAAATGTTAGTGATAAAGATTTCCCGACTTGTCTGTCATCTTCTCCTGATATATGATTCAATTAGCCAATCAAACTTTGAATGCTTGAGGCTTTCTTGAATGGTAAGAAAGTGGTGTCAAGAGGGGAGTGTAATTGTTGTTCTGCTGATTTGTATCTGCTGTCATAATAGTATGATCCGGAGTACCACCCATCCCCAGCACTACGTACTTTTCCGTTGGAAAAAGTTTACCAATCAATGTATGGTATGCCTTTAATCAGCTATTCTGCAAATGAAATTAAATATTTCCTTGATTATTCTAGATTGAGTTTCCTATACGAGAACTTCTGTGACTCCTCTCCTTCATTTATTATTATTTTCTCTTTTGGTTTTTTAGTGCACTGTTAGTTGGTATTCCTATCAAGTTTTGTAAGTTTCATCTAGTTAGGATTTCAAAACCTTTAATTCACGTTTAGCACTACATTAGATAAAAGGTTTGGCACTTTGGTTTGGTCTCCGTTAGTCTACTCTGTACTATATTTGAAACTTAGTGAATGATGTAACATTGTTGTTGCAAAGATTTTGACGAGAGTGATTAAATTTTGTGGAATATTTTTATATTATTTATCCATTCCAAATTGCTTTGTGAATTTGCAGACTATACTTAATCCCCTCTACTTGCCTGGATCCCGCATCACATCTTCACATTTCGACACAAAAGTCCGTGCACTTGCGAGGAAGTATCTCTAGTGTTCACCTTGATTGCAGACCAACTGTGCTTTGTGACAATGAGCTTCAAAGTTTATAGCAGCATTGATTTTTCTATAATTGTGTTACATTTTGATCTTTTCTTTGCTATTTGTGGTCCTGACATTAAACCACTCAGTAGTTAAATATTGCTCAATGTCAATTATGGACTTTGTATGGCTTGATTACGAGCAGGAATAATAGCACTTAACAGCATTGATGTAATGCCCTCCCCGCTCCCTTTTACCCATCTTTTTATTGTAGGGAGTCCTATTGTGTTCTATCTTTTACAAGACAGGAATACTAAAATCATGCAAGAATGCGTTTTGAAGAAAGCTAAAGAGACAGCCCATTCCTTTTTTGAACTTTTGTGGTTGTTCCTTTCGATTTCCTTTACTTGCAATTAATGATTTGTATGGCACAAAGCAATATAAAAGGATCTCTTCAACAAAATGCCATCTATCATCCAGGGCCCGAGGAAAAGAATAAGAATCCCTCTCGTGAAGTCTCCTCCTCAGCGGGGGAAAAGAATAACAAACAAATCCATGAAAATGTTGCGAAAGAAAGATGTGATTCTTGGCGGGATAGGAAGGTTATATTTATTATAAGAGAACCGAAATTGTGGTTGAAAAGGTCTAGTGTAACCCTTGGAAAAGGAAGGTAATAAGCTTATATGAATCCGGGTCCCGAAACAAAACAGAACAGAAAAAAAGAAAAAAACTCCAAGCCCAAGAGACCACAACCAAATCTCCACAAATTCCATCAACCATTTTTCAAGTTTTGGTTCCATTCCTTAGAGATGTCCCGAACCCACAACAAGGGTGCAAGAAACTGATTTTATATGGGCTTCGATTGTCGAATCATGGGGAAAGGGGTATTGATGATATTGGTGCCGATGGTGTTCTCCATAAGCTGTTTGGCTCCATGTTCAGCTTCTTCCTTCTTTAAGGATAAGCTATGGGAGAGGAGGAGGCCAACTCTGTCGGTGCCGATCGCATCCGCATCCTCTTCGATTCCTTCATCCTCTATTGTGCTGCCTCTTCAAGGGAACGTCTTTCCAAATGGGTAAGCCACAACTCTGCTCACAGCTCCACCATGTTTCAGCTTTTGTTTTTCCTATTTCGATTTTACTCTACTGTTTCGAAGTACGGAAGTGGCTGAAAAGGGAGTTCAATGTGATTGTGATGCCTCTTAGTATCACATTGATGTCGTTTATCTTAGTGTGTTGTAACAACCTGGAAAAGAATGGGAATTGGAGTTTGGAAGGCCGTTGCAAATCTTATCTTAATGTATCTTCTCCCCCCGTGTTAAATGGGAATTGAGAAGACTACGAGACTCTGTTTCCCATGTTTTGACTTTCTGCCCTAAGTTCTTATTCTGTTTCTGTACAATTGCATTTGAGGTGTATTTATGAGTGCTTTTCAAAATGGCTTTTTAATTTTGAAGAAAAGGAACACATCCCCAAAAGAAGTGTTTCTATAATAAGACCCATTTGAAGATATATATCTCTGTTCTTGTTGAGGCTCACTTTCTTTACTATCAGAGCATAAATAATTCTTTTCATCAATGTGTTTGAGATCATATAATTTGTATCCTTTTTCAGGTTCTATAACGTTACCCTTTTTATAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCTGACACCGGTAGTGACCTCACTTGGCTTCAATGTGACGCTCCATGTCAGCAGTGCACTGAGGTAAATTGTTATTTGATTCAGCAGTTAATGTTCATTGCAAGGGTATTCTTGAAAATATGATGACCAAATGAACGAAGAAAGTTACTAAACTACACCTTCGGCTTCAGTTGAGTTAGTTCTCATCACAAGTAATCTGAATTCCCATCTTTGAATGTATGTTTTATTTGAAGTCTCCTCAATATTCTTCTTCTTGATATGCTTATTGCTTCTGTCATTTCTTTTGGTGTTGAGAGTTGAGATTGAATGTATCAAATTGATGACCTTGTTTCCCTTTCAGACACCTCATCCGCTCTATCAACCAAGCAACGATCTTGTCCCGTGTAAGGACCCTCTGTGTATGTCCTTGCACTCATCTATTGACCACAGATGTGAGAACCCAGATCAATGTGACTACGAGGTTGAGTATGCAGATGGAGGTTCGTCTCTTGGAGTCCTTGTCAGGGATATTTTTCCTCTCAACTTAACCAATGGGGATCCAATTAGACCCCGTTTGACCCTCGGGTAAGCCGCTTGACTGCCTCAGTTATTCTTCTTAATCAAATTACTTTGTTGAGTTCCTGGCAACCTAAACCAACCTTGCGTTTTCCAATTAAAACCATACATGAGAGCTATGCAAAAATAAACACCTTTTCATGCTGCCATATATTTTCTTTTGGCGTGCAACACAACGGATAGCATAATGTAAAGCTAGAGCTTATGCTGAAAACGGAACATATGTATTCATGAAGAACCCAGATAATTAAGGTCGCAACAGGCACTAGATTTCGTGACAGAATGATTCTGATTAGTATTTTCAACAGTCATCAATAAAGACTATATAGTACGAGAGATTTATATTCGGTGTGCTTCTAACCGTAGTTTCCGGTTCGTGTGGTTAAATAAAATGCATGTAGAGTCAGTTGCATGTCAGCTATTTGAATCTGTAGTACTGAGTTTTGTTTCATTTGATTACACTCAGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAGTACTTGGCCTTGGAAAGGGAGCAGTAAGCATTGTTTCACAGCTGCACAATCAGGGCATTGTCCGTAACGTTGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGATATTTATGATCCGTATCGCTTAGCTTGGACGCCCATGTCACGGGACTACCCGTAAGTAACCTACCTTGATCATACATTTATCTTGTTATGATTCATGTCGACTCCTAATCTCACGAAACATGCTCTATGTGATTGTATCAGGAAGCACTACTCCCCTGGGTTTGGGGACCTATTCTTCAATGGAAGAAGTACTGGACTCAGAAACCTCTTCGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAATTATAACATCTTTGGTAAATCATCCGTAAAAAAAACCTTCGATTCTTTTTCTCAATGATTCTTTAGTTCATGTTGGTACTGTTCTTGTAGTTGAATAGAGAACTAACTGGGAAACCGCTAAGAGAAGCCAAGGATGATGACACGCTGCCGCTCTGCTGGAGAGGGCGGAATCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCATTGAGCTTTTCCAGTGGTAGAAGAAGCAAAGCAGTGTTCGAAATGCCAATGGAAAGTTATCTTATAATATCGGTAAGCTCCAAGCCTCTACAACTACAATTTATAAGTTGAATGATTTAACTTATAAGTTTTGGATTGGCAGTCCAAGGGGAATGTTTGCTTGGGAATTCTGAACGGCAGTGAAGTTGGGCTTGAGAACTCCAATATCATTGGTGGTACGTACGTTATTTTGCATGCATTGCATGTTATTTCTCATAAAAACGAATGCCGTCATAGCCATGCAAAATTATTCTTTTATTAAAGAGAAAAAGGGAAAAAAAGAAGCCTCATCTTGTGTGGGGTTTGGTTGACAGATATTTCGATGCAAGATAAGATGGTAGTGTACAACAACGAGAAGCAAGCAATTGGATGGGCTACTGCTATCTGTGATCGGGTGCCCAAGTCTAGTGTTGGTAGCTTGTGAAGATACATGATCAGAGATCTTATTCTGAAAGGAGTGTTTCAGGAGAAGTCCTTGGAGTTGGCAATAGGATATAGATAAAATTTGTATTAAAAGCTAATGTATGTATACACAACAATCATAGCAACCGAACAAGTTGTTTGTAATGTAACATGAAATACACTTGGATTGATTTACCAAGTGCAGTAATAGAATTGGAACTGACATTATTTATTAATTCTGCTTTTGGAATTACAAAAATCAATCTGATTTCATTCATGTCTGGTGCAGCAGCATCATTTGGTTTGTCGCAGTTGCAGGCAGCCTTGGGATTATACGAGGCTTGA

mRNA sequence

CAATTAGTTAGCCCAAAAGAAATACCCAAGCCCAACCCAACTCGTCTCCCTCGCTTCAACTCTCTCTCTCTCTCTCTCTCTCTCATCTTACACCGCCGGCAACTTCTCGAATTGCTTCTCTGCAATGGCGACCACAGCTTGTTTCATCATCGTCAGTAGGAACAACATCCCTATTTATGAAGCTGAAGTCGGATCTGCTGCTAAATATGATCCGGAGTACCACCCATCCCCAGCACTACGTACTTTTCCGTTGGAAAAAGTTTACCAATCAATGTATGCTTCTTCCTTCTTTAAGGATAAGCTATGGGAGAGGAGGAGGCCAACTCTGTCGGTGCCGATCGCATCCGCATCCTCTTCGATTCCTTCATCCTCTATTGTGCTGCCTCTTCAAGGGAACGTCTTTCCAAATGGGTTCTATAACGTTACCCTTTTTATAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCTGACACCGGTAGTGACCTCACTTGGCTTCAATGTGACGCTCCATGTCAGCAGTGCACTGAGACACCTCATCCGCTCTATCAACCAAGCAACGATCTTGTCCCGTGTAAGGACCCTCTGTGTATGTCCTTGCACTCATCTATTGACCACAGATGTGAGAACCCAGATCAATGTGACTACGAGGTTGAGTATGCAGATGGAGGTTCGTCTCTTGGAGTCCTTGTCAGGGATATTTTTCCTCTCAACTTAACCAATGGGGATCCAATTAGACCCCGTTTGACCCTCGGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAGTACTTGGCCTTGGAAAGGGAGCAGTAAGCATTGTTTCACAGCTGCACAATCAGGGCATTGTCCGTAACGTTGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGATATTTATGATCCGTATCGCTTAGCTTGGACGCCCATGTCACGGGACTACCCGAAGCACTACTCCCCTGGGTTTGGGGACCTATTCTTCAATGGAAGAAGTACTGGACTCAGAAACCTCTTCGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAATTATAACATCTTTGTTGAATAGAGAACTAACTGGGAAACCGCTAAGAGAAGCCAAGGATGATGACACGCTGCCGCTCTGCTGGAGAGGGCGGAATCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCATTGAGCTTTTCCAGTGGTAGAAGAAGCAAAGCAGTGTTCGAAATGCCAATGGAAAGTTATCTTATAATATCGTCCAAGGGGAATGTTTGCTTGGGAATTCTGAACGGCAGTGAAGTTGGGCTTGAGAACTCCAATATCATTGGTGATATTTCGATGCAAGATAAGATGGTAGTGTACAACAACGAGAAGCAAGCAATTGGATGGGCTACTGCTATCTGTGATCGGGTGCCCAAGTCTAGTGTTGCAGCATCATTTGGTTTGTCGCAGTTGCAGGCAGCCTTGGGATTATACGAGGCTTGA

Coding sequence (CDS)

ATGGCGACCACAGCTTGTTTCATCATCGTCAGTAGGAACAACATCCCTATTTATGAAGCTGAAGTCGGATCTGCTGCTAAATATGATCCGGAGTACCACCCATCCCCAGCACTACGTACTTTTCCGTTGGAAAAAGTTTACCAATCAATGTATGCTTCTTCCTTCTTTAAGGATAAGCTATGGGAGAGGAGGAGGCCAACTCTGTCGGTGCCGATCGCATCCGCATCCTCTTCGATTCCTTCATCCTCTATTGTGCTGCCTCTTCAAGGGAACGTCTTTCCAAATGGGTTCTATAACGTTACCCTTTTTATAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCTGACACCGGTAGTGACCTCACTTGGCTTCAATGTGACGCTCCATGTCAGCAGTGCACTGAGACACCTCATCCGCTCTATCAACCAAGCAACGATCTTGTCCCGTGTAAGGACCCTCTGTGTATGTCCTTGCACTCATCTATTGACCACAGATGTGAGAACCCAGATCAATGTGACTACGAGGTTGAGTATGCAGATGGAGGTTCGTCTCTTGGAGTCCTTGTCAGGGATATTTTTCCTCTCAACTTAACCAATGGGGATCCAATTAGACCCCGTTTGACCCTCGGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAGTACTTGGCCTTGGAAAGGGAGCAGTAAGCATTGTTTCACAGCTGCACAATCAGGGCATTGTCCGTAACGTTGTTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGATATTTATGATCCGTATCGCTTAGCTTGGACGCCCATGTCACGGGACTACCCGAAGCACTACTCCCCTGGGTTTGGGGACCTATTCTTCAATGGAAGAAGTACTGGACTCAGAAACCTCTTCGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAATTATAACATCTTTGTTGAATAGAGAACTAACTGGGAAACCGCTAAGAGAAGCCAAGGATGATGACACGCTGCCGCTCTGCTGGAGAGGGCGGAATCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCATTGAGCTTTTCCAGTGGTAGAAGAAGCAAAGCAGTGTTCGAAATGCCAATGGAAAGTTATCTTATAATATCGTCCAAGGGGAATGTTTGCTTGGGAATTCTGAACGGCAGTGAAGTTGGGCTTGAGAACTCCAATATCATTGGTGATATTTCGATGCAAGATAAGATGGTAGTGTACAACAACGAGAAGCAAGCAATTGGATGGGCTACTGCTATCTGTGATCGGGTGCCCAAGTCTAGTGTTGCAGCATCATTTGGTTTGTCGCAGTTGCAGGCAGCCTTGGGATTATACGAGGCTTGA
BLAST of CmoCh02G000010 vs. Swiss-Prot
Match: ASP1_ORYSJ (Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica GN=ASP1 PE=2 SV=1)

HSP 1 Score: 344.7 bits (883), Expect = 1.6e-93
Identity = 183/392 (46.68%), Postives = 259/392 (66.07%), Query Frame = 1

Query: 80  PSSSIVLPLQGNVFPNGFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHP 139
           PSS++VL L GNV+P G + +T+ IG P K YFLD DTGS LTWLQCDAPC  C   PH 
Sbjct: 20  PSSAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHV 79

Query: 140 LYQPS-NDLVPCKDPLCMSLHSSI--DHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLN 199
           LY+P+   LV C D LC  L++ +    RC +  QCDY ++Y D  SS+GVLV D F L+
Sbjct: 80  LYKPTPKKLVTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLS 139

Query: 200 LTNGDPIRPRLTLGCGYDQDPGSSSYH-PMDGVLGLGKGAVSIVSQLHNQGIV-RNVVGH 259
            +NG      +  GCGYDQ   + +   P+D +LGL +G V+++SQL +QG++ ++V+GH
Sbjct: 140 ASNGTN-PTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGH 199

Query: 260 CFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYPKHYSPGFGDLFF--NGRSTGLRNLFVVF 319
           C SSKGGG+LFFGD       + WTPM+R++ K+YSPG G L F  N ++     + V+F
Sbjct: 200 CISSKGGGFLFFGDAQVPTSGVTWTPMNREH-KYYSPGHGTLHFDSNSKAISAAPMAVIF 259

Query: 320 DSGSSYTYFNAQAYQ----IITSLLNRELTGKPLREAKDDD-TLPLCWRGRNPFKSLRDV 379
           DSG++YTYF AQ YQ    ++ S LN E   K L E  + D  L +CW+G++   ++ +V
Sbjct: 260 DSGATYTYFAAQPYQATLSVVKSTLNSEC--KFLTEVTEKDRALTVCWKGKDKIVTIDEV 319

Query: 380 RKYFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLGILNGSE--VGLENSNIIGDI 439
           +K F+ L+L F+ G + KA  E+P E YLIIS +G+VCLGIL+GS+  + L  +N+IG I
Sbjct: 320 KKCFRSLSLEFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGI 379

Query: 440 SMQDKMVVYNNEKQAIGWATAICDRVPKSSVA 458
           +M D+MV+Y++E+  +GW    CDR+P+S  A
Sbjct: 380 TMLDQMVIYDSERSLLGWVNYQCDRIPRSESA 405

BLAST of CmoCh02G000010 vs. Swiss-Prot
Match: ASP1_ORYSI (Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica GN=ASP1 PE=2 SV=2)

HSP 1 Score: 330.1 bits (845), Expect = 4.0e-89
Identity = 175/390 (44.87%), Postives = 255/390 (65.38%), Query Frame = 1

Query: 80  PSSSIVLPLQGNVFPNGFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHP 139
           PSS++VL L GNV+P G + VT+ IG P KPYFLD DTGS LTWLQCD PC  C + PH 
Sbjct: 20  PSSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHG 79

Query: 140 LYQPS-NDLVPCKDPLCMSLHSSI--DHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLN 199
           LY+P     V C +  C  L++ +    +C   +QC Y ++Y  GGSS+GVL+ D F L 
Sbjct: 80  LYKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLP 139

Query: 200 LTNGDPIRPRLTLGCGYDQDPGSSSY-HPMDGVLGLGKGAVSIVSQLHNQGIV-RNVVGH 259
            +NG      +  GCGY+Q   + +   P++G+LGLG+G V+++SQL +QG++ ++V+GH
Sbjct: 140 ASNGTN-PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGH 199

Query: 260 CFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYPKHYSPGFGDLFFNGRSTGL--RNLFVVF 319
           C SSKG G+LFFGD       + W+PM+R++ KHYSP  G L FN  S  +    + V+F
Sbjct: 200 CISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLQFNSNSKPISAAPMEVIF 259

Query: 320 DSGSSYTYFNAQAYQIITSLLNRELTG--KPLREAKDDD-TLPLCWRGRNPFKSLRDVRK 379
           DSG++YTYF  Q Y    S++   L+   K L E K+ D  L +CW+G++  +++ +V+K
Sbjct: 260 DSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKK 319

Query: 380 YFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLGILNGSE--VGLENSNIIGDISM 439
            F+ L+L F+ G + KA  E+P E YLIIS +G+VCLGIL+GS+    L  +N+IG I+M
Sbjct: 320 CFRSLSLKFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGITM 379

Query: 440 QDKMVVYNNEKQAIGWATAICDRVPKSSVA 458
            D+MV+Y++E+  +GW    CDR+P+S+ A
Sbjct: 380 LDQMVIYDSERSLLGWVNYQCDRIPRSASA 405

BLAST of CmoCh02G000010 vs. Swiss-Prot
Match: APCB1_ARATH (Aspartyl protease APCB1 OS=Arabidopsis thaliana GN=APCB1 PE=1 SV=1)

HSP 1 Score: 307.4 bits (786), Expect = 2.8e-82
Identity = 171/399 (42.86%), Postives = 243/399 (60.90%), Query Frame = 1

Query: 72  IASASSSIPSSSIVLPLQGNVFPNGFYNVTLFIGQPP--KPYFLDPDTGSDLTWLQCDAP 131
           +++++ SI SS+ + P+ GNV+P+G Y   + +G+P   + Y LD DTGS+LTW+QCDAP
Sbjct: 177 LSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAP 236

Query: 132 CQQCTETPHPLYQPSND-LVPCKDPLCMSLH-SSIDHRCENPDQCDYEVEYADGGSSLGV 191
           C  C +  + LY+P  D LV   +  C+ +  + +   CEN  QCDYE+EYAD   S+GV
Sbjct: 237 CTSCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGV 296

Query: 192 LVRDIFPLNLTNGDPIRPRLTLGCGYDQDPGS-SSYHPMDGVLGLGKGAVSIVSQLHNQG 251
           L +D F L L NG      +  GCGYDQ     ++    DG+LGL +  +S+ SQL ++G
Sbjct: 297 LTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRG 356

Query: 252 IVRNVVGHCFSS--KGGGYLFFGDDIYDPYRLAWTPMSRD--------YPKHYSPGFGDL 311
           I+ NVVGHC +S   G GY+F G D+   + + W PM  D             S G G L
Sbjct: 357 IISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGML 416

Query: 312 FFNGRSTGLRNLFVVFDSGSSYTYFNAQAY-QIITSLLNRELTGKPLREAKDDDTLPLCW 371
             +G +  +    V+FD+GSSYTYF  QAY Q++TSL  +E++G  L     D+TLP+CW
Sbjct: 417 SLDGENGRVGK--VLFDTGSSYTYFPNQAYSQLVTSL--QEVSGLELTRDDSDETLPICW 476

Query: 372 RGRN--PFKSLRDVRKYFKPLALSFSSGRR--SKAVFEMPMESYLIISSKGNVCLGILNG 431
           R +   PF SL DV+K+F+P+ L   S     S+ +   P E YLIIS+KGNVCLGIL+G
Sbjct: 477 RAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQP-EDYLIISNKGNVCLGILDG 536

Query: 432 SEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAICDR 451
           S V   ++ I+GDISM+  ++VY+N K+ IGW  + C R
Sbjct: 537 SSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVR 570

BLAST of CmoCh02G000010 vs. Swiss-Prot
Match: ASPL2_ARATH (Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=2)

HSP 1 Score: 139.8 bits (351), Expect = 7.6e-32
Identity = 113/391 (28.90%), Postives = 172/391 (43.99%), Query Frame = 1

Query: 82  SSIVLPLQGN--VFPNGFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHP 141
           +SI LPL G+  V   G Y   + +G PPK Y +  DTGSD+ W+ C  PC +C    + 
Sbjct: 56  ASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNL 115

Query: 142 LYQPS---------NDLVPCKDPLCMSLHSSIDHRCENPDQCDYEVEYADGGSSLGVLVR 201
            ++ S         +  V C D  C  +  S    C+    C Y + YAD  +S G  +R
Sbjct: 116 NFRLSLFDMNASSTSKKVGCDDDFCSFISQS--DSCQPALGCSYHIVYADESTSDGKFIR 175

Query: 202 DIFPLNLTNGD----PIRPRLTLGCGYDQDPG-SSSYHPMDGVLGLGKGAVSIVSQLHNQ 261
           D+  L    GD    P+   +  GCG DQ     +    +DGV+G G+   S++SQL   
Sbjct: 176 DMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAAT 235

Query: 262 GIVRNVVGHCFSS-KGGGYLFFGDDIYDPYRLAWTPMSRDYPKHYSPGFGDLFFNGRSTG 321
           G  + V  HC  + KGGG   F   + D  ++  TPM  +   HY+     +  +G S  
Sbjct: 236 GDAKRVFSHCLDNVKGGG--IFAVGVVDSPKVKTTPMVPN-QMHYNVMLMGMDVDGTSLD 295

Query: 322 L-----RNLFVVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAKDDDTLPLCWRGRN 381
           L     RN   + DSG++  YF    Y    SL+   L  +P++    ++T         
Sbjct: 296 LPRSIVRNGGTIVDSGTTLAYFPKVLYD---SLIETILARQPVKLHIVEETFQC------ 355

Query: 382 PFKSLRDVRKYFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLGILNGSEVGLENS 441
            F    +V + F P++  F    +      +    YL    +   C G   G     E S
Sbjct: 356 -FSFSTNVDEAFPPVSFEFEDSVK----LTVYPHDYLFTLEEELYCFGWQAGGLTTDERS 415

Query: 442 NII--GDISMQDKMVVYNNEKQAIGWATAIC 449
            +I  GD+ + +K+VVY+ + + IGWA   C
Sbjct: 416 EVILLGDLVLSNKLVVYDLDNEVIGWADHNC 426

BLAST of CmoCh02G000010 vs. Swiss-Prot
Match: APF1_ARATH (Aspartyl protease family protein 1 OS=Arabidopsis thaliana GN=APF1 PE=1 SV=1)

HSP 1 Score: 118.2 bits (295), Expect = 2.4e-25
Identity = 109/371 (29.38%), Postives = 159/371 (42.86%), Query Frame = 1

Query: 97  FYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAPCQQCT-ETPHP--------LYQPS--- 156
           + NVT  +G P   + +  DTGSDL WL CD  C  C  E   P        +Y P+   
Sbjct: 105 YANVT--VGTPSDWFMVALDTGSDLFWLPCD--CTNCVRELKAPGGSSLDLNIYSPNASS 164

Query: 157 -NDLVPCKDPLCMSLHSSIDHRCENPDQ-CDYEVEY-ADGGSSLGVLVRDIFPL--NLTN 216
            +  VPC   LC         RC +P+  C Y++ Y ++G SS GVLV D+  L  N  +
Sbjct: 165 TSTKVPCNSTLCTR-----GDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKS 224

Query: 217 GDPIRPRLTLGCGYDQDPGSSSYHPMDGVLGLGKGAVSIVSQLHNQGIVRNVVGHCFSSK 276
              I  R+T GCG  Q          +G+ GLG   +S+ S L  +GI  N    CF + 
Sbjct: 225 SKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGND 284

Query: 277 GGGYLFFGDDIYDPYRLAWTPMSRDYPKHYSPGFGDLFFNGRSTGLRNLFVVFDSGSSYT 336
           G G + FGD      R   TP++   P             G +TG      VFDSG+S+T
Sbjct: 285 GAGRISFGDKGSVDQR--ETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFT 344

Query: 337 YFNAQAYQIITSLLNRELTGKPLREAKDDDTLPLCWRGRNPFKSLRDVRKYFKPLALSFS 396
           Y    AY +I+   N     K  +    +     C+       +L   +  F+  A++ +
Sbjct: 345 YLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCY-------ALSPNKDSFQYPAVNLT 404

Query: 397 SGRRSKAVFEMPMESYLIISSKGN--VCLGILNGSEVGLENSNIIGDISMQDKMVVYNNE 449
               S      P+   ++I  K     CL I+      +E+ +IIG   M    VV++ E
Sbjct: 405 MKGGSSYPVYHPL---VVIPMKDTDVYCLAIMK-----IEDISIIGQNFMTGYRVVFDRE 449

BLAST of CmoCh02G000010 vs. TrEMBL
Match: A0A0A0LKB0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G338820 PE=3 SV=1)

HSP 1 Score: 696.4 bits (1796), Expect = 2.4e-197
Identity = 323/352 (91.76%), Postives = 337/352 (95.74%), Query Frame = 1

Query: 107 PPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYQPSNDLVPCKDPLCMSLHSSIDHRC 166
           PPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLYQPSNDLVPCKDPLCMSLHSS+DHRC
Sbjct: 35  PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRC 94

Query: 167 ENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLTLGCGYDQDPGSSSYHPMD 226
           ENPDQCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRL LGCGYDQDPGSSSYHPMD
Sbjct: 95  ENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD 154

Query: 227 GVLGLGKGAVSIVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYP 286
           G+LGLG+GAVSIVSQLHNQGIVRNVVGHCF+SKGGGYLFFGD IYDPYRL WTPMSRDYP
Sbjct: 155 GILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVWTPMSRDYP 214

Query: 287 KHYSPGFGDLFFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAK 346
           KHYSPGFG+L FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ++TSLLNREL GKPLREA 
Sbjct: 215 KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAM 274

Query: 347 DDDTLPLCWRGRNPFKSLRDVRKYFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCL 406
           DDDTLPLCWRGR P KSLRDVRKYFKPLALSFSSG RSKAVFE+P E Y+IISS GNVCL
Sbjct: 275 DDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPTEGYMIISSMGNVCL 334

Query: 407 GILNGSEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAICDRVPKSSVAA 459
           GILNG++VGLENSNIIGDISMQDKMVVYNNEKQAIGWATA CDRVPKS V++
Sbjct: 335 GILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQVSS 386

BLAST of CmoCh02G000010 vs. TrEMBL
Match: A0A165Z0G7_DAUCA (Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_013175 PE=4 SV=1)

HSP 1 Score: 595.1 bits (1533), Expect = 7.4e-167
Identity = 274/384 (71.35%), Postives = 328/384 (85.42%), Query Frame = 1

Query: 73  ASASSSIPSS---SIVLPLQGNVFPNGFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAP 132
           + ASSS+ SS   S+VLPL GNV+P+G+Y+V   IGQPPKPYFLDPDTGSDLTWLQCDAP
Sbjct: 39  SGASSSVVSSVGSSVVLPLYGNVYPSGYYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAP 98

Query: 133 CQQCTETPHPLYQPSNDLVPCKDPLCMSLHSSIDHRCENPDQCDYEVEYADGGSSLGVLV 192
           C QCT  PHPLYQP+NDLV CKDP+C SLH   ++RC++PDQCDYEVEYADGGSS+GVLV
Sbjct: 99  CIQCTPAPHPLYQPTNDLVVCKDPICASLHPD-NYRCDDPDQCDYEVEYADGGSSIGVLV 158

Query: 193 RDIFPLNLTNGDPIRPRLTLGCGYDQDPGSSSYHPMDGVLGLGKGAVSIVSQLHNQGIVR 252
            D+FP+NLT+G   RPRLT+GCGYDQ PG  +YHP+DGVLGLG+G+ SIV+QL +QG+VR
Sbjct: 159 NDLFPVNLTSGMRARPRLTIGCGYDQLPGI-AYHPLDGVLGLGRGSSSIVAQLSSQGLVR 218

Query: 253 NVVGHCFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYPKHYSPGFGDLFFNGRSTGLRNLF 312
           NVVGHCFS +GGGYLFFGDDIYD  ++ WTPMSRDY KHY+PGF +L  NGRS+GL+NL 
Sbjct: 219 NVVGHCFSRRGGGYLFFGDDIYDSSKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLL 278

Query: 313 VVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAKDDDTLPLCWRGRNPFKSLRDVRK 372
           VVFDSGSSYTYFN Q YQ + S + ++L GKPL+EA +DDTLP+CWRG+ PFKS+RD +K
Sbjct: 279 VVFDSGSSYTYFNTQTYQTLLSFIKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKK 338

Query: 373 YFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLGILNGSEVGLENSNIIGDISMQD 432
           YFKPLALSF SG ++K+ FE+  ESYLIISSKG+VCLGILNG+EVGL+N NIIGDISMQ+
Sbjct: 339 YFKPLALSFGSGWKTKSQFEIQQESYLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQE 398

Query: 433 KMVVYNNEKQAIGWATAICDRVPK 454
           K+V+Y+NEKQ IGW  + CDR PK
Sbjct: 399 KLVIYDNEKQVIGWQPSNCDRPPK 420

BLAST of CmoCh02G000010 vs. TrEMBL
Match: Q5NT86_DAUCA (Nucellin-like protein OS=Daucus carota GN=DcNLP PE=3 SV=1)

HSP 1 Score: 595.1 bits (1533), Expect = 7.4e-167
Identity = 274/384 (71.35%), Postives = 328/384 (85.42%), Query Frame = 1

Query: 73  ASASSSIPSS---SIVLPLQGNVFPNGFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAP 132
           + ASSS+ SS   S+VLPL GNV+P+G+Y+V   IGQPPKPYFLDPDTGSDLTWLQCDAP
Sbjct: 39  SGASSSVVSSVGSSVVLPLYGNVYPSGYYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAP 98

Query: 133 CQQCTETPHPLYQPSNDLVPCKDPLCMSLHSSIDHRCENPDQCDYEVEYADGGSSLGVLV 192
           C QCT  PHPLYQP+NDLV CKDP+C SLH   ++RC++PDQCDYEVEYADGGSS+GVLV
Sbjct: 99  CIQCTPAPHPLYQPTNDLVVCKDPICASLHPD-NYRCDDPDQCDYEVEYADGGSSIGVLV 158

Query: 193 RDIFPLNLTNGDPIRPRLTLGCGYDQDPGSSSYHPMDGVLGLGKGAVSIVSQLHNQGIVR 252
            D+FP+NLT+G   RPRLT+GCGYDQ PG  +YHP+DGVLGLG+G+ SIV+QL +QG+VR
Sbjct: 159 NDLFPVNLTSGMRARPRLTIGCGYDQLPGI-AYHPLDGVLGLGRGSSSIVAQLSSQGLVR 218

Query: 253 NVVGHCFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYPKHYSPGFGDLFFNGRSTGLRNLF 312
           NVVGHCFS +GGGYLFFGDDIYD  ++ WTPMSRDY KHY+PGF +L  NGRS+GL+NL 
Sbjct: 219 NVVGHCFSRRGGGYLFFGDDIYDSSKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLL 278

Query: 313 VVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAKDDDTLPLCWRGRNPFKSLRDVRK 372
           VVFDSGSSYTYFN Q YQ + S + ++L GKPL+EA +DDTLP+CWRG+ PFKS+RD +K
Sbjct: 279 VVFDSGSSYTYFNTQTYQTLLSFIKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKK 338

Query: 373 YFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLGILNGSEVGLENSNIIGDISMQD 432
           YFKPLALSF SG ++K+ FE+  ESYLIISSKG+VCLGILNG+EVGL+N NIIGDISMQ+
Sbjct: 339 YFKPLALSFGSGWKTKSQFEIQQESYLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQE 398

Query: 433 KMVVYNNEKQAIGWATAICDRVPK 454
           K+V+Y+NEKQ IGW  + CDR PK
Sbjct: 399 KLVIYDNEKQVIGWQPSNCDRPPK 420

BLAST of CmoCh02G000010 vs. TrEMBL
Match: M5WHY2_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005961mg PE=3 SV=1)

HSP 1 Score: 594.7 bits (1532), Expect = 9.6e-167
Identity = 280/405 (69.14%), Postives = 327/405 (80.74%), Query Frame = 1

Query: 52  ASSFFKDKLWERRRPTLSVPIASASSSI--PSSSIVLPLQGNVFPNGFYNVTLFIGQPPK 111
           +S+ F D+    RR T+    A++S  +   +SSIVLP+ GNV+P G YNVTL IGQPPK
Sbjct: 26  SSASFGDQYHRGRRKTMLPDEATSSLGLNRAASSIVLPVHGNVYPIGSYNVTLNIGQPPK 85

Query: 112 PYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYQPSNDLVPCKDPLCMSLHSSIDHRCENP 171
           PYFLDPDTGSDLTWLQCDAPC +CTE PHP Y+P+NDLV CKDPLC +LH+   H+C+NP
Sbjct: 86  PYFLDPDTGSDLTWLQCDAPCVRCTEAPHPFYRPNNDLVVCKDPLCEALHAPGSHKCDNP 145

Query: 172 DQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLTLGCGYDQDPGSSSYHPMDGVL 231
           +QCDYEVEYADGGSSLGVLVRD F LN TNG+     L LGCGYDQ PGS SYHP+DGVL
Sbjct: 146 EQCDYEVEYADGGSSLGVLVRDAFLLNFTNGNQRTTHLALGCGYDQLPGS-SYHPIDGVL 205

Query: 232 GLGKGAVSIVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYPKHY 291
           GLGKG  SIVSQL NQG+VR+V+GHC S +GGG+ F GD +YD  R+ WTPMS DY KHY
Sbjct: 206 GLGKGKSSIVSQLSNQGLVRHVIGHCLSGRGGGFFFLGDGLYDSSRIVWTPMSPDYAKHY 265

Query: 292 SPGFGDLFFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAKDDD 351
           SPG  +L   G+STG RNL +VFDSGSSYTY N+QAYQ +TS L RELTGKPL+EA DD 
Sbjct: 266 SPGLAELIVGGKSTGFRNLVMVFDSGSSYTYLNSQAYQFLTSWLKRELTGKPLKEALDDR 325

Query: 352 TLPLCWRGRNPFKSLRDVRKYFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLGIL 411
           TLPLCW+GR PF+++RDV+ YFKPLAL F+SGR+    FE+P E+YLIISSKGNVCLGIL
Sbjct: 326 TLPLCWKGRKPFRNIRDVKTYFKPLALRFASGRKDTTQFELPPEAYLIISSKGNVCLGIL 385

Query: 412 NGSEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAICDRVPKS 455
           NGSEVGL+NSNIIGDISMQDKMV+Y+NEKQ IGW    CD++PKS
Sbjct: 386 NGSEVGLQNSNIIGDISMQDKMVIYDNEKQMIGWGPGNCDKLPKS 429

BLAST of CmoCh02G000010 vs. TrEMBL
Match: W9SFH5_9ROSA (Aspartic proteinase Asp1 OS=Morus notabilis GN=L484_027908 PE=3 SV=1)

HSP 1 Score: 588.2 bits (1515), Expect = 9.0e-165
Identity = 274/394 (69.54%), Postives = 326/394 (82.74%), Query Frame = 1

Query: 63  RRRPTLSVP-IASASSSIPSSSIVLPLQGNVFPNGFYNVTLFIGQPPKPYFLDPDTGSDL 122
           RR+ T  VP  +S   +   SS+V P+ GNV+P GFYNVTL IGQPPKPYFLDPDTGSDL
Sbjct: 34  RRKSTHPVPGTSSFELNRVGSSVVFPIHGNVYPIGFYNVTLNIGQPPKPYFLDPDTGSDL 93

Query: 123 TWLQCDAPCQQCTETPHPLYQPSNDLVPCKDPLCMSLHSSIDHRCENPDQCDYEVEYADG 182
           TWLQCDAPC QCTETPHPLY+PSNDLV C+DPLC++LH     +C+NP+QCDYEVEYADG
Sbjct: 94  TWLQCDAPCVQCTETPHPLYRPSNDLVGCRDPLCIALHLPGTPKCDNPEQCDYEVEYADG 153

Query: 183 GSSLGVLVRDIFPLNLTNGDPIRPRLTLGCGYDQDPGSSSYHPMDGVLGLGKGAVSIVSQ 242
           GSSLGVLV+D F  N T GD ++PRL LGCGYDQ PGSS   P+DGVLGLG+G  SIVSQ
Sbjct: 154 GSSLGVLVKDAFYFNSTKGDQLKPRLALGCGYDQVPGSSHPLPLDGVLGLGRGKTSIVSQ 213

Query: 243 LHNQGIVRNVVGHCFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYPKHYSPGFGDLFFNGR 302
           LH+QG++RNVVGHC S +GGG+LFFGD++YD  R+ WTPMS DY KHYSPG  +L F+G+
Sbjct: 214 LHSQGLMRNVVGHCLSGRGGGFLFFGDNVYDSSRVDWTPMSSDYLKHYSPGSAELRFDGK 273

Query: 303 STGLRNLFVVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAKDDDTLPLCWRGRNPF 362
            TGL+NL  VFDSGSSYTY  +QAYQ +T L+ REL  K LREA DD TLPLCW+G+ PF
Sbjct: 274 PTGLKNLLTVFDSGSSYTYLTSQAYQTLTFLIKRELPRKVLREATDDQTLPLCWKGKRPF 333

Query: 363 KSLRDVRKYFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLGILNGSEVGLENSNI 422
           K + DVRKYFKPLAL F++G ++K  +E+P E+YLI+SSKGNVCLGILNGSE+GL+NSNI
Sbjct: 334 KRVSDVRKYFKPLALDFTTGGKTK-TYELPPEAYLIVSSKGNVCLGILNGSEIGLQNSNI 393

Query: 423 IGDISMQDKMVVYNNEKQAIGWATAICDRVPKSS 456
           IGDISMQDKMV+Y+NEKQ IGWA+A CD++PK+S
Sbjct: 394 IGDISMQDKMVIYDNEKQMIGWASANCDKLPKTS 426

BLAST of CmoCh02G000010 vs. TAIR10
Match: AT4G33490.2 (AT4G33490.2 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 582.8 bits (1501), Expect = 1.9e-166
Identity = 261/377 (69.23%), Postives = 317/377 (84.08%), Query Frame = 1

Query: 82  SSIVLPLQGNVFPNGFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLY 141
           SS+V P+ GNV+P G+YNVT+ IGQPP+PY+LD DTGSDLTWLQCDAPC +C E PHPLY
Sbjct: 44  SSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 103

Query: 142 QPSNDLVPCKDPLCMSLHSSIDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGD 201
           QPS+DL+PC DPLC +LH + + RCE P+QCDYEVEYADGGSSLGVLVRD+F +N T G 
Sbjct: 104 QPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGL 163

Query: 202 PIRPRLTLGCGYDQDPGSSSYHPMDGVLGLGKGAVSIVSQLHNQGIVRNVVGHCFSSKGG 261
            + PRL LGCGYDQ PG+SS+HP+DGVLGLG+G VSI+SQLH+QG V+NV+GHC SS GG
Sbjct: 164 RLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 223

Query: 262 GYLFFGDDIYDPYRLAWTPMSRDYPKHYSPGF-GDLFFNGRSTGLRNLFVVFDSGSSYTY 321
           G LFFGDD+YD  R++WTPMSR+Y KHYSP   G+L F GR+TGL+NL  VFDSGSSYTY
Sbjct: 224 GILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTY 283

Query: 322 FNAQAYQIITSLLNRELTGKPLREAKDDDTLPLCWRGRNPFKSLRDVRKYFKPLALSFSS 381
           FN++AYQ +T LL REL+GKPL+EA+DD TLPLCW+GR PF S+ +V+KYFKPLALSF +
Sbjct: 284 FNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKT 343

Query: 382 GRRSKAVFEMPMESYLIISSKGNVCLGILNGSEVGLENSNIIGDISMQDKMVVYNNEKQA 441
           G RSK +FE+P E+YLIIS KGNVCLGILNG+E+GL+N N+IGDISMQD+M++Y+NEKQ+
Sbjct: 344 GWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQS 403

Query: 442 IGWATAICDRVPKSSVA 458
           IGW    CD +     A
Sbjct: 404 IGWMPVDCDELASLKAA 420

BLAST of CmoCh02G000010 vs. TAIR10
Match: AT1G44130.1 (AT1G44130.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 439.1 bits (1128), Expect = 3.4e-123
Identity = 203/376 (53.99%), Postives = 278/376 (73.94%), Query Frame = 1

Query: 82  SSIVLPLQGNVFPNGFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLY 141
           SS+V PL GNVFP G+Y+V + IG PPK +  D DTGSDLTW+QCDAPC  CT  P+  Y
Sbjct: 33  SSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCTLPPNLQY 92

Query: 142 QPSNDLVPCKDPLCMSLHSSIDHRCENP-DQCDYEVEYADGGSSLGVLVRDIFPLNLTNG 201
           +P  +++PC +P+C +LH      C NP +QCDYEV+YAD GSS+G LV D FPL L NG
Sbjct: 93  KPKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGSSMGALVTDQFPLKLVNG 152

Query: 202 DPIRPRLTLGCGYDQDPGSSSYHPMD-GVLGLGKGAVSIVSQLHNQGIVRNVVGHCFSSK 261
             ++P +  GCGYDQ   S+   P   GVLGLG+G + +++QL + G+ RNVVGHC SSK
Sbjct: 153 SFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSK 212

Query: 262 GGGYLFFGDDIYDPYRLAWTPM-SRDYPKHYSPGFGDLFFNGRSTGLRNLFVVFDSGSSY 321
           GGG+LFFGD++     +AWTP+ S+D   HY+ G  DL FNG+ TGL+ L ++FD+GSSY
Sbjct: 213 GGGFLFFGDNLVPSIGVAWTPLLSQD--NHYTTGPADLLFNGKPTGLKGLKLIFDTGSSY 272

Query: 322 TYFNAQAYQIITSLLNRELTGKPLREAKDDDTLPLCWRGRNPFKSLRDVRKYFKPLALSF 381
           TYFN++AYQ I +L+  +L   PL+ AK+D TLP+CW+G  PFKS+ +V+ +FK + ++F
Sbjct: 273 TYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINF 332

Query: 382 SSGRRSKAVFEMPMESYLIISSKGNVCLGILNGSEVGLENSNIIGDISMQDKMVVYNNEK 441
           ++GRR+  ++  P E YLI+S  GNVCLG+LNGSEVGL+NSN+IGDISMQ  M++Y+NEK
Sbjct: 333 TNGRRNTQLYLAP-ELYLIVSKTGNVCLGLLNGSEVGLQNSNVIGDISMQGLMMIYDNEK 392

Query: 442 QAIGWATAICDRVPKS 455
           Q +GW ++ C+++PK+
Sbjct: 393 QQLGWVSSDCNKLPKT 405

BLAST of CmoCh02G000010 vs. TAIR10
Match: AT1G77480.1 (AT1G77480.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 406.8 bits (1044), Expect = 1.9e-113
Identity = 194/375 (51.73%), Postives = 262/375 (69.87%), Query Frame = 1

Query: 81  SSSIVLPLQGNVFPNGFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPL 140
           SS++V P+ GNV+P G+Y V L IG PPK + LD DTGSDLTW+QCDAPC  CT+     
Sbjct: 50  SSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ 109

Query: 141 YQPSNDLVPCKDPLCMSLHSSIDHRCENP-DQCDYEVEYADGGSSLGVLVRDIFPLNLTN 200
           Y+P+++ +PC   LC  L    D  C +P DQCDYE+ Y+D  SS+G LV D  PL L N
Sbjct: 110 YKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLAN 169

Query: 201 GDPIRPRLTLGCGYDQ-DPGSSSYHPMDGVLGLGKGAVSIVSQLHNQGIVRNVVGHCFSS 260
           G  +  RLT GCGYDQ +PG     P  G+LGLG+G V + +QL + GI +NV+ HC S 
Sbjct: 170 GSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSH 229

Query: 261 KGGGYLFFGDDIYDPYRLAWTPMSRDYP-KHYSPGFGDLFFNGRSTGLRNLFVVFDSGSS 320
            G G+L  GD++     + WT ++ + P K+Y  G  +L FN ++TG++ + VVFDSGSS
Sbjct: 230 TGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSS 289

Query: 321 YTYFNAQAYQIITSLLNRELTGKPLREAKDDDTLPLCWRGRNPFKSLRDVRKYFKPLALS 380
           YTYFNA+AYQ I  L+ ++L GKPL + KDD +LP+CW+G+ P KSL +V+KYFK + L 
Sbjct: 290 YTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLR 349

Query: 381 FSSGRRSKAVFEMPMESYLIISSKGNVCLGILNGSEVGLENSNIIGDISMQDKMVVYNNE 440
           F + +++  +F++P ESYLII+ KG VCLGILNG+E+GLE  NIIGDIS Q  MV+Y+NE
Sbjct: 350 FGN-QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNE 409

Query: 441 KQAIGWATAICDRVP 453
           KQ IGW ++ CD++P
Sbjct: 410 KQRIGWISSDCDKLP 423

BLAST of CmoCh02G000010 vs. TAIR10
Match: AT1G49050.1 (AT1G49050.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 307.4 bits (786), Expect = 1.6e-83
Identity = 171/399 (42.86%), Postives = 243/399 (60.90%), Query Frame = 1

Query: 72  IASASSSIPSSSIVLPLQGNVFPNGFYNVTLFIGQPP--KPYFLDPDTGSDLTWLQCDAP 131
           +++++ SI SS+ + P+ GNV+P+G Y   + +G+P   + Y LD DTGS+LTW+QCDAP
Sbjct: 177 LSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAP 236

Query: 132 CQQCTETPHPLYQPSND-LVPCKDPLCMSLH-SSIDHRCENPDQCDYEVEYADGGSSLGV 191
           C  C +  + LY+P  D LV   +  C+ +  + +   CEN  QCDYE+EYAD   S+GV
Sbjct: 237 CTSCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGV 296

Query: 192 LVRDIFPLNLTNGDPIRPRLTLGCGYDQDPGS-SSYHPMDGVLGLGKGAVSIVSQLHNQG 251
           L +D F L L NG      +  GCGYDQ     ++    DG+LGL +  +S+ SQL ++G
Sbjct: 297 LTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRG 356

Query: 252 IVRNVVGHCFSS--KGGGYLFFGDDIYDPYRLAWTPMSRD--------YPKHYSPGFGDL 311
           I+ NVVGHC +S   G GY+F G D+   + + W PM  D             S G G L
Sbjct: 357 IISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGML 416

Query: 312 FFNGRSTGLRNLFVVFDSGSSYTYFNAQAY-QIITSLLNRELTGKPLREAKDDDTLPLCW 371
             +G +  +    V+FD+GSSYTYF  QAY Q++TSL  +E++G  L     D+TLP+CW
Sbjct: 417 SLDGENGRVGK--VLFDTGSSYTYFPNQAYSQLVTSL--QEVSGLELTRDDSDETLPICW 476

Query: 372 RGRN--PFKSLRDVRKYFKPLALSFSSGRR--SKAVFEMPMESYLIISSKGNVCLGILNG 431
           R +   PF SL DV+K+F+P+ L   S     S+ +   P E YLIIS+KGNVCLGIL+G
Sbjct: 477 RAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQP-EDYLIISNKGNVCLGILDG 536

Query: 432 SEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAICDR 451
           S V   ++ I+GDISM+  ++VY+N K+ IGW  + C R
Sbjct: 537 SSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVR 570

BLAST of CmoCh02G000010 vs. TAIR10
Match: AT5G22850.1 (AT5G22850.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 141.4 bits (355), Expect = 1.5e-33
Identity = 106/408 (25.98%), Postives = 178/408 (43.63%), Query Frame = 1

Query: 84  IVLPLQGNVFPN--GFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPH--- 143
           I  P+ G   P   G Y   L +G PP+ +++  DTGSD+ W+ C A C  C +T     
Sbjct: 65  IDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGCPQTSGLQI 124

Query: 144 --PLYQPSNDL----VPCKDPLCMSLHSSIDHRCENPDQ-CDYEVEYADGGSSLGVLVRD 203
               + P + +    + C D  C     S D  C   +  C Y  +Y DG  + G  V D
Sbjct: 125 QLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSD 184

Query: 204 IFPLNLTNGDPIRPRLT----LGCGYDQDPGS-SSYHPMDGVLGLGKGAVSIVSQLHNQG 263
           +   ++  G  + P  T     GC   Q      S   +DG+ G G+  +S++SQL +QG
Sbjct: 185 VLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQG 244

Query: 264 IVRNVVGHCFSSK--GGGYLFFGDDIYDPYRLAWTPMSRDYPKHYSPGFGDLFFNGRSTG 323
           I   V  HC   +  GGG L  G +I +P  + +TP+    P HY+     +  NG++  
Sbjct: 245 IAPRVFSHCLKGENGGGGILVLG-EIVEP-NMVFTPLVPSQP-HYNVNLLSISVNGQALP 304

Query: 324 LR-NLF-------VVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAKDDDTLPLCWR 383
           +  ++F        + D+G++  Y +  AY             + +  A      P+  +
Sbjct: 305 INPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV---------EAITNAVSQSVRPVVSK 364

Query: 384 GRNPFKSLRDVRKYFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLGILNGSEVGL 443
           G   +     V   F P++L+F+ G    ++F  P +  +  ++ G   +  +    +  
Sbjct: 365 GNQCYVITTSVGDIFPPVSLNFAGG---ASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQN 424

Query: 444 ENSNIIGDISMQDKMVVYNNEKQAIGWATAICDRVPKSSVAASFGLSQ 465
           +   I+GD+ ++DK+ VY+   Q IGWA   C      S  +S G S+
Sbjct: 425 QGITILGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNVSATSSSGRSE 456

BLAST of CmoCh02G000010 vs. NCBI nr
Match: gi|778670347|ref|XP_004147327.2| (PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis sativus])

HSP 1 Score: 780.8 bits (2015), Expect = 1.4e-222
Identity = 367/407 (90.17%), Postives = 386/407 (94.84%), Query Frame = 1

Query: 52  ASSFFKDKLWERRRPTLSVPIASASSSIPSSSIVLPLQGNVFPNGFYNVTLFIGQPPKPY 111
           ASSFFKDK WER+RP LSVP  +ASSS  SSSIVLPLQGNV+PNGFYNVTL++GQPPKPY
Sbjct: 24  ASSFFKDKPWERKRPILSVP--TASSSFASSSIVLPLQGNVYPNGFYNVTLYVGQPPKPY 83

Query: 112 FLDPDTGSDLTWLQCDAPCQQCTETPHPLYQPSNDLVPCKDPLCMSLHSSIDHRCENPDQ 171
           FLDPDTGSDLTWLQCDAPCQQCTET HPLYQPSNDLVPCKDPLCMSLHSS+DHRCENPDQ
Sbjct: 84  FLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQ 143

Query: 172 CDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLTLGCGYDQDPGSSSYHPMDGVLGL 231
           CDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRL LGCGYDQDPGSSSYHPMDG+LGL
Sbjct: 144 CDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGL 203

Query: 232 GKGAVSIVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYPKHYSP 291
           G+GAVSIVSQLHNQGIVRNVVGHCF+SKGGGYLFFGD IYDPYRL WTPMSRDYPKHYSP
Sbjct: 204 GRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSP 263

Query: 292 GFGDLFFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAKDDDTL 351
           GFG+L FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ++TSLLNREL GKPLREA DDDTL
Sbjct: 264 GFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTL 323

Query: 352 PLCWRGRNPFKSLRDVRKYFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLGILNG 411
           PLCWRGR P KSLRDVRKYFKPLALSFSSG RSKAVFE+P E Y+IISS GNVCLGILNG
Sbjct: 324 PLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPTEGYMIISSMGNVCLGILNG 383

Query: 412 SEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAICDRVPKSSVAA 459
           ++VGLENSNIIGDISMQDKMVVYNNEKQAIGWATA CDRVPKS V++
Sbjct: 384 TDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQVSS 428

BLAST of CmoCh02G000010 vs. NCBI nr
Match: gi|659121807|ref|XP_008460823.1| (PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo])

HSP 1 Score: 778.5 bits (2009), Expect = 6.8e-222
Identity = 366/407 (89.93%), Postives = 386/407 (94.84%), Query Frame = 1

Query: 52  ASSFFKDKLWERRRPTLSVPIASASSSIPSSSIVLPLQGNVFPNGFYNVTLFIGQPPKPY 111
           ASSFFKDK WER+RP LSVP  +ASSS  SSSIVLPLQGNV+PNGFYNVTL++GQPPKPY
Sbjct: 24  ASSFFKDKPWERKRPILSVP--TASSSFASSSIVLPLQGNVYPNGFYNVTLYVGQPPKPY 83

Query: 112 FLDPDTGSDLTWLQCDAPCQQCTETPHPLYQPSNDLVPCKDPLCMSLHSSIDHRCENPDQ 171
           FLDPDTGSDLTWLQCDAPCQQCTET HPLYQPSNDLVPCKDPLCMSLHSS+DHRCENPDQ
Sbjct: 84  FLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQ 143

Query: 172 CDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLTLGCGYDQDPGSSSYHPMDGVLGL 231
           CDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRL LGCGYDQDPGSSSYHPMDG+LGL
Sbjct: 144 CDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGL 203

Query: 232 GKGAVSIVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYPKHYSP 291
           G+GAVSIVSQLHNQGIVRNVVGHCF+SKGGGYLFFGD IYDPYRL WTPMSRDYPKHYSP
Sbjct: 204 GRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSP 263

Query: 292 GFGDLFFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAKDDDTL 351
           GFG+L FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ++TSLLNREL GKPLREA DDDTL
Sbjct: 264 GFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTL 323

Query: 352 PLCWRGRNPFKSLRDVRKYFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLGILNG 411
           PLCWR R P KSLRDVRKYFKPLALSFSSG RSKAVFE+P+E Y+IISS GNVCLGILNG
Sbjct: 324 PLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPIEGYMIISSMGNVCLGILNG 383

Query: 412 SEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAICDRVPKSSVAA 459
           ++VGLENSNIIGDISMQDKMVVYNNEKQAIGWATA CDRVPKS V++
Sbjct: 384 TDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQVSS 428

BLAST of CmoCh02G000010 vs. NCBI nr
Match: gi|778670345|ref|XP_011649449.1| (PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis sativus])

HSP 1 Score: 775.0 bits (2000), Expect = 7.5e-221
Identity = 367/411 (89.29%), Postives = 386/411 (93.92%), Query Frame = 1

Query: 52  ASSFFKDKLWERRRPTLSVPIASASSSIPSSSIVLPLQGNVFPNGFYNVTLFIGQPPKPY 111
           ASSFFKDK WER+RP LSVP  +ASSS  SSSIVLPLQGNV+PNGFYNVTL++GQPPKPY
Sbjct: 24  ASSFFKDKPWERKRPILSVP--TASSSFASSSIVLPLQGNVYPNGFYNVTLYVGQPPKPY 83

Query: 112 FLDPDTGSDLTWLQCDAPCQQCTETPHPLYQPSNDLVPCKDPLCMSLHSSIDHRCENPDQ 171
           FLDPDTGSDLTWLQCDAPCQQCTET HPLYQPSNDLVPCKDPLCMSLHSS+DHRCENPDQ
Sbjct: 84  FLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQ 143

Query: 172 CDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLTLG----CGYDQDPGSSSYHPMDG 231
           CDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRL LG    CGYDQDPGSSSYHPMDG
Sbjct: 144 CDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCQLICGYDQDPGSSSYHPMDG 203

Query: 232 VLGLGKGAVSIVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYPK 291
           +LGLG+GAVSIVSQLHNQGIVRNVVGHCF+SKGGGYLFFGD IYDPYRL WTPMSRDYPK
Sbjct: 204 ILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVWTPMSRDYPK 263

Query: 292 HYSPGFGDLFFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAKD 351
           HYSPGFG+L FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ++TSLLNREL GKPLREA D
Sbjct: 264 HYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMD 323

Query: 352 DDTLPLCWRGRNPFKSLRDVRKYFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLG 411
           DDTLPLCWRGR P KSLRDVRKYFKPLALSFSSG RSKAVFE+P E Y+IISS GNVCLG
Sbjct: 324 DDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPTEGYMIISSMGNVCLG 383

Query: 412 ILNGSEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAICDRVPKSSVAA 459
           ILNG++VGLENSNIIGDISMQDKMVVYNNEKQAIGWATA CDRVPKS V++
Sbjct: 384 ILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQVSS 432

BLAST of CmoCh02G000010 vs. NCBI nr
Match: gi|659121805|ref|XP_008460822.1| (PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis melo])

HSP 1 Score: 772.7 bits (1994), Expect = 3.7e-220
Identity = 366/411 (89.05%), Postives = 386/411 (93.92%), Query Frame = 1

Query: 52  ASSFFKDKLWERRRPTLSVPIASASSSIPSSSIVLPLQGNVFPNGFYNVTLFIGQPPKPY 111
           ASSFFKDK WER+RP LSVP  +ASSS  SSSIVLPLQGNV+PNGFYNVTL++GQPPKPY
Sbjct: 24  ASSFFKDKPWERKRPILSVP--TASSSFASSSIVLPLQGNVYPNGFYNVTLYVGQPPKPY 83

Query: 112 FLDPDTGSDLTWLQCDAPCQQCTETPHPLYQPSNDLVPCKDPLCMSLHSSIDHRCENPDQ 171
           FLDPDTGSDLTWLQCDAPCQQCTET HPLYQPSNDLVPCKDPLCMSLHSS+DHRCENPDQ
Sbjct: 84  FLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQ 143

Query: 172 CDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLTLG----CGYDQDPGSSSYHPMDG 231
           CDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRL LG    CGYDQDPGSSSYHPMDG
Sbjct: 144 CDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCQLICGYDQDPGSSSYHPMDG 203

Query: 232 VLGLGKGAVSIVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYPK 291
           +LGLG+GAVSIVSQLHNQGIVRNVVGHCF+SKGGGYLFFGD IYDPYRL WTPMSRDYPK
Sbjct: 204 ILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVWTPMSRDYPK 263

Query: 292 HYSPGFGDLFFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAKD 351
           HYSPGFG+L FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ++TSLLNREL GKPLREA D
Sbjct: 264 HYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMD 323

Query: 352 DDTLPLCWRGRNPFKSLRDVRKYFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCLG 411
           DDTLPLCWR R P KSLRDVRKYFKPLALSFSSG RSKAVFE+P+E Y+IISS GNVCLG
Sbjct: 324 DDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPIEGYMIISSMGNVCLG 383

Query: 412 ILNGSEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAICDRVPKSSVAA 459
           ILNG++VGLENSNIIGDISMQDKMVVYNNEKQAIGWATA CDRVPKS V++
Sbjct: 384 ILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQVSS 432

BLAST of CmoCh02G000010 vs. NCBI nr
Match: gi|700207119|gb|KGN62238.1| (hypothetical protein Csa_2G338820 [Cucumis sativus])

HSP 1 Score: 696.4 bits (1796), Expect = 3.4e-197
Identity = 323/352 (91.76%), Postives = 337/352 (95.74%), Query Frame = 1

Query: 107 PPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYQPSNDLVPCKDPLCMSLHSSIDHRC 166
           PPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLYQPSNDLVPCKDPLCMSLHSS+DHRC
Sbjct: 35  PPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRC 94

Query: 167 ENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLTLGCGYDQDPGSSSYHPMD 226
           ENPDQCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRL LGCGYDQDPGSSSYHPMD
Sbjct: 95  ENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD 154

Query: 227 GVLGLGKGAVSIVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDDIYDPYRLAWTPMSRDYP 286
           G+LGLG+GAVSIVSQLHNQGIVRNVVGHCF+SKGGGYLFFGD IYDPYRL WTPMSRDYP
Sbjct: 155 GILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVWTPMSRDYP 214

Query: 287 KHYSPGFGDLFFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQIITSLLNRELTGKPLREAK 346
           KHYSPGFG+L FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ++TSLLNREL GKPLREA 
Sbjct: 215 KHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAM 274

Query: 347 DDDTLPLCWRGRNPFKSLRDVRKYFKPLALSFSSGRRSKAVFEMPMESYLIISSKGNVCL 406
           DDDTLPLCWRGR P KSLRDVRKYFKPLALSFSSG RSKAVFE+P E Y+IISS GNVCL
Sbjct: 275 DDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEIPTEGYMIISSMGNVCL 334

Query: 407 GILNGSEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATAICDRVPKSSVAA 459
           GILNG++VGLENSNIIGDISMQDKMVVYNNEKQAIGWATA CDRVPKS V++
Sbjct: 335 GILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQVSS 386

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ASP1_ORYSJ1.6e-9346.68Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica GN=ASP1 PE=2 SV=1[more]
ASP1_ORYSI4.0e-8944.87Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica GN=ASP1 PE=2 SV=2[more]
APCB1_ARATH2.8e-8242.86Aspartyl protease APCB1 OS=Arabidopsis thaliana GN=APCB1 PE=1 SV=1[more]
ASPL2_ARATH7.6e-3228.90Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=... [more]
APF1_ARATH2.4e-2529.38Aspartyl protease family protein 1 OS=Arabidopsis thaliana GN=APF1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LKB0_CUCSA2.4e-19791.76Uncharacterized protein OS=Cucumis sativus GN=Csa_2G338820 PE=3 SV=1[more]
A0A165Z0G7_DAUCA7.4e-16771.35Uncharacterized protein OS=Daucus carota subsp. sativus GN=DCAR_013175 PE=4 SV=1[more]
Q5NT86_DAUCA7.4e-16771.35Nucellin-like protein OS=Daucus carota GN=DcNLP PE=3 SV=1[more]
M5WHY2_PRUPE9.6e-16769.14Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005961mg PE=3 SV=1[more]
W9SFH5_9ROSA9.0e-16569.54Aspartic proteinase Asp1 OS=Morus notabilis GN=L484_027908 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G33490.21.9e-16669.23 Eukaryotic aspartyl protease family protein[more]
AT1G44130.13.4e-12353.99 Eukaryotic aspartyl protease family protein[more]
AT1G77480.11.9e-11351.73 Eukaryotic aspartyl protease family protein[more]
AT1G49050.11.6e-8342.86 Eukaryotic aspartyl protease family protein[more]
AT5G22850.11.5e-3325.98 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778670347|ref|XP_004147327.2|1.4e-22290.17PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis sativus][more]
gi|659121807|ref|XP_008460823.1|6.8e-22289.93PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo][more]
gi|778670345|ref|XP_011649449.1|7.5e-22189.29PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis sativus][more]
gi|659121805|ref|XP_008460822.1|3.7e-22089.05PREDICTED: aspartic proteinase Asp1 isoform X1 [Cucumis melo][more]
gi|700207119|gb|KGN62238.1|3.4e-19791.76hypothetical protein Csa_2G338820 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh02G000010.1CmoCh02G000010.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 44..456
score: 2.0E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 276..450
score: 3.6E-22coord: 83..268
score: 1.0
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 90..451
score: 9.32
NoneNo IPR availablePANTHERPTHR13683:SF227ASPARTYL PROTEASE FAMILY PROTEINcoord: 44..456
score: 2.0E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None