CmoCh04G021150 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G021150
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEukaryotic aspartyl protease family protein
LocationCmo_Chr04 : 13724947 .. 13726314 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGTGTGTTCCCCAATTCCTCCTCCTCCTCCTCCTCCTCCTCTCATCCCTCGCAGATCTCTCCAATGCCATTCCCTCTCAATACCTGAAGTTCCCATTACTCCACACTAACCCCTTCTCCTCCCCTTCTCAAGCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCCGCCCACCGCCACAGCCCCACCCTCAAGTCCCCCCTCATCTCCGGCGCCTCCACCGGCTCTGGTCAGTACTTCGTCAATCTCCACCTCGGCACCCCTCCTCAAAGCCTCCTCCTCGTCGTCGATACCGGCAGCGACCTCGTCTGGGTCAAATGCTCCCCCTGCCGCAACTGCTCCCACCACCCTCCCTCCTCTGCTTTCTTCCCCCGCCACTCATCCTCCTTCTCCCCTTTCCACTGCTTCGACCCCCACTGCCGCCTCCTTCCCCACCCCCCTTCTCACCGCTGCAACCACACCCACCTCCACTCCCCCTGCTCCTTCCTCTACTCCTACGCCGACTCCTCCCTCTCCTCCGGCTTCTTCTCCAAAGATGTCACCACCTTCAACACCTTCTCCGGGACCCACACCCAGACCCGTCTCAACGACCTTTCCTTTGGATGTGGCTTTCGGATCTCGGGTCCTAGCGTTTCGGGCGCCCGATTCACTGGAGCACGTGGAGTCATGGGATTGGGCAGAGGCCCGATCTCCTTCTCCTCCCAACTCGGCCACCGATTTGGCAACACTTTTTCTTATTGTCTTATGGATTATACACTCTCTCCGCCGCCCACCAGCTACCTCATGATCGGCGGCGGCCTCCGTAGCCTACCGGTGACGAACGCCTCAAAAATCAGCTACACCCCGTTGCAGATTAACCCTCTGTCCCCGACATTCTACTACATTGTCGTGAAGAGCATCACCGTGGACGGCGTGAAATTGCCCATCAATCCCAAGGTGTGGGCCATCGACGAGCAAGGCAACGGCGGCACGGTGGTGGATTCCGGCACGACATTGACGTATCTAGCTGAGGCAGCATACGAGGAAGTGTTGAAGGCCATGAGACGGCGAGTGAAGCTGCCGAGAGCGCTCCAGTTGAGTCCAGGGTTCGATCTGTGCGTGAACGCATCGAGCGAGTCGCGGATGCGGAGTCTGCCGCAAATAAGATTCCGGGTAGGCGGCGGAGGTGTGTTTGCGCCGCCGGCAAGGAACTATTTTGTGGAGACAGAGGAGGGGGTGATGTGCTTGGCGATCCGACCCGTGGATTCGGGAAATGGGTTTTCGGTGATTGGGAATCTGATGCAACAAGGATTCTTGTTGGAGTTCGATAGAGAGAAGTCGAGGATGGGGTTTTCAAGGCGTGGCTGTGGGCTTCCTTGA

mRNA sequence

ATGCCGTGTGTTCCCCAATTCCTCCTCCTCCTCCTCCTCCTCCTCTCATCCCTCGCAGATCTCTCCAATGCCATTCCCTCTCAATACCTGAAGTTCCCATTACTCCACACTAACCCCTTCTCCTCCCCTTCTCAAGCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCCGCCCACCGCCACAGCCCCACCCTCAAGTCCCCCCTCATCTCCGGCGCCTCCACCGGCTCTGGTCAGTACTTCGTCAATCTCCACCTCGGCACCCCTCCTCAAAGCCTCCTCCTCGTCGTCGATACCGGCAGCGACCTCGTCTGGGTCAAATGCTCCCCCTGCCGCAACTGCTCCCACCACCCTCCCTCCTCTGCTTTCTTCCCCCGCCACTCATCCTCCTTCTCCCCTTTCCACTGCTTCGACCCCCACTGCCGCCTCCTTCCCCACCCCCCTTCTCACCGCTGCAACCACACCCACCTCCACTCCCCCTGCTCCTTCCTCTACTCCTACGCCGACTCCTCCCTCTCCTCCGGCTTCTTCTCCAAAGATGTCACCACCTTCAACACCTTCTCCGGGACCCACACCCAGACCCGTCTCAACGACCTTTCCTTTGGATGTGGCTTTCGGATCTCGGGTCCTAGCGTTTCGGGCGCCCGATTCACTGGAGCACGTGGAGTCATGGGATTGGGCAGAGGCCCGATCTCCTTCTCCTCCCAACTCGGCCACCGATTTGGCAACACTTTTTCTTATTGTCTTATGGATTATACACTCTCTCCGCCGCCCACCAGCTACCTCATGATCGGCGGCGGCCTCCGTAGCCTACCGGTGACGAACGCCTCAAAAATCAGCTACACCCCGTTGCAGATTAACCCTCTGTCCCCGACATTCTACTACATTGTCGTGAAGAGCATCACCGTGGACGGCGTGAAATTGCCCATCAATCCCAAGGTGTGGGCCATCGACGAGCAAGGCAACGGCGGCACGGTGGTGGATTCCGGCACGACATTGACGTATCTAGCTGAGGCAGCATACGAGGAAGTGTTGAAGGCCATGAGACGGCGAGTGAAGCTGCCGAGAGCGCTCCAGTTGAGTCCAGGGTTCGATCTGTGCGTGAACGCATCGAGCGAGTCGCGGATGCGGAGTCTGCCGCAAATAAGATTCCGGGTAGGCGGCGGAGGTGTGTTTGCGCCGCCGGCAAGGAACTATTTTGTGGAGACAGAGGAGGGGGTGATGTGCTTGGCGATCCGACCCGTGGATTCGGGAAATGGGTTTTCGGTGATTGGGAATCTGATGCAACAAGGATTCTTGTTGGAGTTCGATAGAGAGAAGTCGAGGATGGGGTTTTCAAGGCGTGGCTGTGGGCTTCCTTGA

Coding sequence (CDS)

ATGCCGTGTGTTCCCCAATTCCTCCTCCTCCTCCTCCTCCTCCTCTCATCCCTCGCAGATCTCTCCAATGCCATTCCCTCTCAATACCTGAAGTTCCCATTACTCCACACTAACCCCTTCTCCTCCCCTTCTCAAGCCCTCTCCTCCGACACCCACCGCCTCTCCCTCCTCTTCTCCGCCCACCGCCACAGCCCCACCCTCAAGTCCCCCCTCATCTCCGGCGCCTCCACCGGCTCTGGTCAGTACTTCGTCAATCTCCACCTCGGCACCCCTCCTCAAAGCCTCCTCCTCGTCGTCGATACCGGCAGCGACCTCGTCTGGGTCAAATGCTCCCCCTGCCGCAACTGCTCCCACCACCCTCCCTCCTCTGCTTTCTTCCCCCGCCACTCATCCTCCTTCTCCCCTTTCCACTGCTTCGACCCCCACTGCCGCCTCCTTCCCCACCCCCCTTCTCACCGCTGCAACCACACCCACCTCCACTCCCCCTGCTCCTTCCTCTACTCCTACGCCGACTCCTCCCTCTCCTCCGGCTTCTTCTCCAAAGATGTCACCACCTTCAACACCTTCTCCGGGACCCACACCCAGACCCGTCTCAACGACCTTTCCTTTGGATGTGGCTTTCGGATCTCGGGTCCTAGCGTTTCGGGCGCCCGATTCACTGGAGCACGTGGAGTCATGGGATTGGGCAGAGGCCCGATCTCCTTCTCCTCCCAACTCGGCCACCGATTTGGCAACACTTTTTCTTATTGTCTTATGGATTATACACTCTCTCCGCCGCCCACCAGCTACCTCATGATCGGCGGCGGCCTCCGTAGCCTACCGGTGACGAACGCCTCAAAAATCAGCTACACCCCGTTGCAGATTAACCCTCTGTCCCCGACATTCTACTACATTGTCGTGAAGAGCATCACCGTGGACGGCGTGAAATTGCCCATCAATCCCAAGGTGTGGGCCATCGACGAGCAAGGCAACGGCGGCACGGTGGTGGATTCCGGCACGACATTGACGTATCTAGCTGAGGCAGCATACGAGGAAGTGTTGAAGGCCATGAGACGGCGAGTGAAGCTGCCGAGAGCGCTCCAGTTGAGTCCAGGGTTCGATCTGTGCGTGAACGCATCGAGCGAGTCGCGGATGCGGAGTCTGCCGCAAATAAGATTCCGGGTAGGCGGCGGAGGTGTGTTTGCGCCGCCGGCAAGGAACTATTTTGTGGAGACAGAGGAGGGGGTGATGTGCTTGGCGATCCGACCCGTGGATTCGGGAAATGGGTTTTCGGTGATTGGGAATCTGATGCAACAAGGATTCTTGTTGGAGTTCGATAGAGAGAAGTCGAGGATGGGGTTTTCAAGGCGTGGCTGTGGGCTTCCTTGA
BLAST of CmoCh04G021150 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 228.8 bits (582), Expect = 1.2e-58
Identity = 143/396 (36.11%), Postives = 210/396 (53.03%), Query Frame = 1

Query: 63  HSPT---LKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHH 122
           H+P      S ++SG S GSG+YF  L +GTP + + +V+DTGSD+VW++C+PCR C + 
Sbjct: 120 HAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRC-YS 179

Query: 123 PPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFF 182
                F PR S +++   C  PHCR L    S  CN       C +  SY D S + G F
Sbjct: 180 QSDPIFDPRKSKTYATIPCSSPHCRRL---DSAGCNTR--RKTCLYQVSYGDGSFTVGDF 239

Query: 183 SKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQL 242
           S +  TF        + R+  ++ GCG    G       F GA G++GLG+G +SF  Q 
Sbjct: 240 STETLTFR-------RNRVKGVALGCGHDNEG------LFVGAAGLLGLGKGKLSFPGQT 299

Query: 243 GHRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIV 302
           GHRF   FSYCL+D + S  P+S +     +  +         +TPL  NP   TFYY+ 
Sbjct: 300 GHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRI-------ARFTPLLSNPKLDTFYYVG 359

Query: 303 VKSITVDGVKLP-INPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVK-LP 362
           +  I+V G ++P +   ++ +D+ GNGG ++DSGT++T L   AY  +  A R   K L 
Sbjct: 360 LLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLK 419

Query: 363 RALQLSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETE-EGVMCLAIRP 422
           RA   S  FD C + S+ + ++ +P +     G  V + PA NY +  +  G  C A   
Sbjct: 420 RAPDFSL-FDTCFDLSNMNEVK-VPTVVLHFRGADV-SLPATNYLIPVDTNGKFCFAF-- 479

Query: 423 VDSGNGFSVIGNLMQQGFLLEFDREKSRMGFSRRGC 453
             +  G S+IGN+ QQGF + +D   SR+GF+  GC
Sbjct: 480 AGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CmoCh04G021150 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 209.5 bits (532), Expect = 7.5e-53
Identity = 134/409 (32.76%), Postives = 205/409 (50.12%), Query Frame = 1

Query: 45  QALSSDTHRLSLLFSAHRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSD 104
           +A+     R+  + +  + S  +++P+ +G     G+Y +N+ +GTP  S   ++DTGSD
Sbjct: 63  RAIKRGERRMRSINAMLQSSSGIETPVYAG----DGEYLMNVAIGTPDSSFSAIMDTGSD 122

Query: 105 LVWVKCSPCRNCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCS 164
           L+W +C PC  C    P+  F P+ SSSFS   C   +C+ L   PS  CN    ++ C 
Sbjct: 123 LIWTQCEPCTQC-FSQPTPIFNPQDSSSFSTLPCESQYCQDL---PSETCN----NNECQ 182

Query: 165 FLYSYADSSLSSGFFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARG 224
           + Y Y D S + G+ + +  TF T S       + +++FGC     G    G       G
Sbjct: 183 YTYGYGDGSTTQGYMATETFTFETSS-------VPNIAFGC-----GEDNQGFGQGNGAG 242

Query: 225 VMGLGRGPISFSSQLGHRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYT 284
           ++G+G GP+S  SQLG      FSYC+  Y  S P T  L +G     +P  + S    T
Sbjct: 243 LIGMGWGPLSLPSQLG---VGQFSYCMTSYGSSSPST--LALGSAASGVPEGSPS----T 302

Query: 285 PLQINPLSPTFYYIVVKSITVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYE 344
            L  + L+PT+YYI ++ ITV G  L I    + + + G GG ++DSGTTLTYL + AY 
Sbjct: 303 TLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYN 362

Query: 345 EVLKAMRRRVKLPRALQLSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVE 404
            V +A   ++ LP   + S G   C    S+     +P+I  +   GGV     +N  + 
Sbjct: 363 AVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQF-DGGVLNLGEQNILIS 422

Query: 405 TEEGVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDREKSRMGFSRRGCG 454
             EGV+CLA+    S  G S+ GN+ QQ   + +D +   + F    CG
Sbjct: 423 PAEGVICLAMGS-SSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQCG 436

BLAST of CmoCh04G021150 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 205.3 bits (521), Expect = 1.4e-51
Identity = 130/377 (34.48%), Postives = 192/377 (50.93%), Query Frame = 1

Query: 78  GSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPPSSAFFPRHSSSFSPFH 137
           G G+Y +NL +GTP Q    ++DTGSDL+W +C PC  C  +  +  F P+ SSSFS   
Sbjct: 91  GDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQC-FNQSTPIFNPQGSSSFSTLP 150

Query: 138 CFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFFSKDVTTFNTFSGTHTQTR 197
           C    C+ L  P       T  ++ C + Y Y D S + G    +  TF + S       
Sbjct: 151 CSSQLCQALSSP-------TCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVS------- 210

Query: 198 LNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGHRFGNTFSYCLMDYTLS 257
           + +++FGCG    G      +  GA G++G+GRGP+S  SQL       FSYC+     S
Sbjct: 211 IPNITFGCGENNQG----FGQGNGA-GLVGMGRGPLSLPSQLD---VTKFSYCMTPIGSS 270

Query: 258 PPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSITVDGVKLPINPKVW 317
            P  S L++G    S+    A   + T +Q + + PTFYYI +  ++V   +LPI+P  +
Sbjct: 271 TP--SNLLLGSLANSV---TAGSPNTTLIQSSQI-PTFYYITLNGLSVGSTRLPIDPSAF 330

Query: 318 AID-EQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQLSPGFDLCVNASSES 377
           A++   G GG ++DSGTTLTY    AY+ V +    ++ LP     S GFDLC    S+ 
Sbjct: 331 ALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDP 390

Query: 378 RMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGNGFSVIGNLMQQGFLL 437
               +P       GG +   P+ NYF+    G++CLA+    S  G S+ GN+ QQ  L+
Sbjct: 391 SNLQIPTFVMHFDGGDL-ELPSENYFISPSNGLICLAMG--SSSQGMSIFGNIQQQNMLV 435

Query: 438 EFDREKSRMGFSRRGCG 454
            +D   S + F+   CG
Sbjct: 451 VYDTGNSVVSFASAQCG 435

BLAST of CmoCh04G021150 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 3.0e-49
Identity = 127/391 (32.48%), Postives = 198/391 (50.64%), Query Frame = 1

Query: 67  LKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPPSSAFF 126
           L +P++SGAS GSG+YF  + +GTP + + LV+DTGSD+ W++C PC +C +      F 
Sbjct: 147 LTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADC-YQQSDPVFN 206

Query: 127 PRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFFSKDVTTF 186
           P  SS++    C  P C LL    +  C      + C +  SY D S + G  + D  TF
Sbjct: 207 PTSSSTYKSLTCSAPQCSLL---ETSACR----SNKCLYQVSYGDGSFTVGELATDTVTF 266

Query: 187 NTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGHRFGNT 246
              SG     ++N+++ GCG    G       FTGA G++GLG G +S ++Q+      +
Sbjct: 267 GN-SG-----KINNVALGCGHDNEG------LFTGAAGLLGLGGGVLSITNQMK---ATS 326

Query: 247 FSYCLMDYTLSPPPT---SYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSI 306
           FSYCL+D       +   + + +GGG  + P+    KI            TFYY+ +   
Sbjct: 327 FSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKID-----------TFYYVGLSGF 386

Query: 307 TVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKA-MRRRVKLPRALQL 366
           +V G K+ +   ++ +D  G+GG ++D GT +T L   AY  +  A ++  V L +    
Sbjct: 387 SVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSS 446

Query: 367 SPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEE-GVMCLAIRPVDSGN 426
              FD C + SS S ++ +P + F   GG     PA+NY +  ++ G  C A  P  S  
Sbjct: 447 ISLFDTCYDFSSLSTVK-VPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSS-- 500

Query: 427 GFSVIGNLMQQGFLLEFDREKSRMGFSRRGC 453
             S+IGN+ QQG  + +D  K+ +G S   C
Sbjct: 507 SLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmoCh04G021150 vs. Swiss-Prot
Match: ASPG2_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 195.7 bits (496), Expect = 1.1e-48
Identity = 128/386 (33.16%), Postives = 195/386 (50.52%), Query Frame = 1

Query: 69  SPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPPSSAFFPR 128
           S ++SG   GSG+YFV + +G+PP+   +V+D+GSD+VWV+C PC+ C +      F P 
Sbjct: 118 SDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLC-YKQSDPVFDPA 177

Query: 129 HSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFFSKDVTTFNT 188
            S S++   C    C  + +   H          C +   Y D S + G  + +  TF  
Sbjct: 178 KSGSYTGVSCGSSVCDRIENSGCH-------SGGCRYEVMYGDGSYTKGTLALETLTF-- 237

Query: 189 FSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGHRFGNTFS 248
                 +T + +++ GCG R  G       F GA G++G+G G +SF  QL  + G  F 
Sbjct: 238 -----AKTVVRNVAMGCGHRNRG------MFIGAAGLLGIGGGSMSFVGQLSGQTGGAFG 297

Query: 249 YCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSITVDGV 308
           YCL+  +     T  L+   G  +LPV      S+ PL  NP +P+FYY+ +K + V GV
Sbjct: 298 YCLV--SRGTDSTGSLVF--GREALPV----GASWVPLVRNPRAPSFYYVGLKGLGVGGV 357

Query: 309 KLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMR-RRVKLPRALQLSPGFD 368
           ++P+   V+ + E G+GG V+D+GT +T L  AAY       + +   LPRA  +S  FD
Sbjct: 358 RIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSI-FD 417

Query: 369 LCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEE-GVMCLAIRPVDSGNGFSVI 428
            C + S    +R +P + F    G V   PARN+ +  ++ G  C A     S  G S+I
Sbjct: 418 TCYDLSGFVSVR-VPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAF--AASPTGLSII 470

Query: 429 GNLMQQGFLLEFDREKSRMGFSRRGC 453
           GN+ Q+G  + FD     +GF    C
Sbjct: 478 GNIQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of CmoCh04G021150 vs. TrEMBL
Match: A0A0A0KNH6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G174650 PE=3 SV=1)

HSP 1 Score: 732.3 bits (1889), Expect = 3.7e-208
Identity = 363/458 (79.26%), Postives = 395/458 (86.24%), Query Frame = 1

Query: 4   VPQFLLLLLLLLSSLADLSN------AIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLL 63
           +  F LL+LL    L  L N      A P+ +LK PLLH  PFSSPSQ+LSSDTHRLSLL
Sbjct: 6   ISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLL 65

Query: 64  FSAHRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCS 123
           FS  R +PTLKSPLISGASTGSGQYFV++ LGTPPQSLLLV DTGSDLVWVKCS CRNCS
Sbjct: 66  FS--RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCS 125

Query: 124 HHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSG 183
           HHPPSSAF PRHSSSFSPFHCFDPHCRLLPH P H CNHT LHSPC FLYSYAD SLSSG
Sbjct: 126 HHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSG 185

Query: 184 FFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSS 243
           FFSK+ TT  + SG+  +  L  LSFGCGFRISGPSVSGA+F GARGVMGLGRG ISFSS
Sbjct: 186 FFSKETTTLKSLSGS--EIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSS 245

Query: 244 QLGHRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYY 303
           QLG RFGN FSYCLMDYTLSPPPTS+LMIGGGL SLP+TNA+KISYTPLQINPLSPTFYY
Sbjct: 246 QLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYY 305

Query: 304 IVVKSITVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLP 363
           I + SIT+DGVKLPINP VW IDEQGNGGTVVDSGTTLTYL + AYEEVLK++RRRVKLP
Sbjct: 306 ITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLP 365

Query: 364 RALQLSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPV 423
            A +L+PGFDLCVNAS ESR  SLP++RFR+GGG VFAPP RNYF+ETEEGVMCLAIR V
Sbjct: 366 NAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAV 425

Query: 424 DSGNGFSVIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 456
           +SGNGFSVIGNLMQQGFLLEFD+E+SR+GF+RRGCGLP
Sbjct: 426 ESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of CmoCh04G021150 vs. TrEMBL
Match: F6HF17_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g02930 PE=3 SV=1)

HSP 1 Score: 579.3 bits (1492), Expect = 4.0e-162
Identity = 284/452 (62.83%), Postives = 346/452 (76.55%), Query Frame = 1

Query: 10  LLLLLLSSLADLSNAIPS------QYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSAHRH 69
           LLLLL+    D+ NA+P       +YLK  LLH  PF++PSQALS D+HRLS  FSA   
Sbjct: 11  LLLLLIFFFTDICNALPIAQNGTVEYLKLRLLHIKPFTTPSQALSFDSHRLSFFFSALHT 70

Query: 70  SPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPPSS 129
             +LKSP++SGASTGSGQYFV+L LGTPPQ LLLV DTGSDLVWVKCS CRNC+ H P S
Sbjct: 71  PQSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGS 130

Query: 130 AFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFFSKDV 189
           AF  RHS++FSP HC+D  C+L+P P  HRCNH  LHSPC + YSY D S +SGFFSK+ 
Sbjct: 131 AFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKET 190

Query: 190 TTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGHRF 249
           TT NT SG   + +L  ++FGC FRISGPSVSGA F GA GVMGLGRGPIS SSQLGHRF
Sbjct: 191 TTLNTSSG--REAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRF 250

Query: 250 GNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSI 309
           GN FSYCLMD+ +SP PTSYL+IG     +      ++ +TPL INPLSPTFYYI ++S+
Sbjct: 251 GNKFSYCLMDHDISPSPTSYLLIGSTQNDV-APGKRRMRFTPLHINPLSPTFYYIGIESV 310

Query: 310 TVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQLS 369
           +VDG+KLPINP VWA+DE GNGGT+VDSGTTLT+L E AY ++L  ++RRV+LP   + +
Sbjct: 311 SVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPT 370

Query: 370 PGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGNGF 429
           PGFDLCVN S     R LP++ F++GG  VF+PP RNYFV+T+E V CLA++ V + +GF
Sbjct: 371 PGFDLCVNVSEIEHPR-LPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGF 430

Query: 430 SVIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 456
           SVIGNLMQQGFLLEFD++++R+GFSR GC LP
Sbjct: 431 SVIGNLMQQGFLLEFDKDRTRLGFSRHGCALP 458

BLAST of CmoCh04G021150 vs. TrEMBL
Match: A0A067K2U7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18279 PE=3 SV=1)

HSP 1 Score: 578.9 bits (1491), Expect = 5.3e-162
Identity = 290/454 (63.88%), Postives = 351/454 (77.31%), Query Frame = 1

Query: 7   FLLLLLLL-----LSSLADLSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSAH 66
           FLLLLLL      +S    +++    +YLK PLLH  PF SP+QAL  D  RLSLL   H
Sbjct: 9   FLLLLLLTDLCYSISLRTTVNSTATKEYLKLPLLHRTPFKSPAQALPFDIRRLSLL---H 68

Query: 67  RHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPP 126
           R   +LKSP+ISGASTGSGQYFV+L LG+P Q+LLLV DTGSDLVWVKCS C+NCS++ P
Sbjct: 69  RQRTSLKSPVISGASTGSGQYFVSLRLGSPAQTLLLVADTGSDLVWVKCSACKNCSNYSP 128

Query: 127 SSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFFSK 186
            SAF  RHSS+FS  HCF+  CRL+PHP  + CN T LHSPC + YSYAD S +SGFFSK
Sbjct: 129 GSAFLARHSSTFSLIHCFNSQCRLVPHPRPNPCNRTRLHSPCRYEYSYADGSSTSGFFSK 188

Query: 187 DVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGH 246
           + TT NT +G   + +L +L+FGCGFRISGPS++GA F GA GV+GLGR PISFSSQLG 
Sbjct: 189 ETTTLNTSAGR--EKKLKNLAFGCGFRISGPSLTGASFAGAHGVIGLGRAPISFSSQLGR 248

Query: 247 RFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVK 306
           RFGN FSYCLMDYTLSPPPTSYLMIGG   S  V+    +++TPL +N LSPTFYYI +K
Sbjct: 249 RFGNKFSYCLMDYTLSPPPTSYLMIGGHQNSA-VSRKRILNFTPLLVNSLSPTFYYIGIK 308

Query: 307 SITVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQ 366
           S++VDGVKLPINP VW+ID+ GNGGT++DSGTTLT+L E AY E+L A++RRVKLP   +
Sbjct: 309 SVSVDGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFLVEPAYREILSAIKRRVKLPGPGE 368

Query: 367 LSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGN 426
           L+PGFDLCVN S   R    P++   + G  VF+PP RNYF++T EGV CLAI+PV+SG+
Sbjct: 369 LTPGFDLCVNVSG-VRRPVFPRMSLELAGNSVFSPPPRNYFIDTSEGVKCLAIQPVNSGS 428

Query: 427 GFSVIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 456
           GFSVIGNLMQQG+LLEFDR++SR+GF+R GC LP
Sbjct: 429 GFSVIGNLMQQGYLLEFDRDRSRLGFARSGCALP 455

BLAST of CmoCh04G021150 vs. TrEMBL
Match: B9SEI2_RICCO (Basic 7S globulin 2 small subunit, putative OS=Ricinus communis GN=RCOM_0705030 PE=3 SV=1)

HSP 1 Score: 575.9 bits (1483), Expect = 4.5e-161
Identity = 279/454 (61.45%), Postives = 351/454 (77.31%), Query Frame = 1

Query: 7   FLLLLLLLLSSLADLSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSAHRHSP- 66
           F  LL+ L  S +  +N   ++YLK PLLH  PF+SPS+AL+ D +R   L   HRH   
Sbjct: 7   FFFLLITLCPSSSAAANTT-TEYLKLPLLHKTPFTSPSEALAFDINRRLSLLHHHRHQQQ 66

Query: 67  ----TLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPP 126
               + +SP+ISGAS+GSGQYFV+L +GTPPQ+LLLV DTGSDL+WVKCSPCRNCSH  P
Sbjct: 67  HKQNSFRSPVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSP 126

Query: 127 SSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFFSK 186
            SAFF RHS+++S  HC+ P C+L+PHP  + CN T LHSPC + Y+YADSS ++GFFSK
Sbjct: 127 GSAFFARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSK 186

Query: 187 DVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGH 246
           +  T NT +G     +LN LSFGCGFRISGPS++GA F GA+GVMGLGR PISFSSQLG 
Sbjct: 187 EALTLNTSTG--KVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGR 246

Query: 247 RFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVK 306
           RFG+ FSYCLMDYTLSPPPTS+L IGG  +++ V+    +S+TPL INPLSPTFYYI +K
Sbjct: 247 RFGSKFSYCLMDYTLSPPPTSFLTIGGA-QNVAVSKKGIMSFTPLLINPLSPTFYYIAIK 306

Query: 307 SITVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQ 366
            + V+GVKLPINP VW+ID+ GNGGT++DSGTTLT++ E AY E+LKA ++RVKLP   +
Sbjct: 307 GVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAE 366

Query: 367 LSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGN 426
            +PGFDLC+N S  +R  +LP++ F + GG VF+PP RNYF+ET + + CLA++PV    
Sbjct: 367 PTPGFDLCMNVSGVTR-PALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDG 426

Query: 427 GFSVIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 456
           GFSV+GNLMQQGFLLEFDR+KSR+GF+RRGC LP
Sbjct: 427 GFSVLGNLMQQGFLLEFDRDKSRLGFTRRGCALP 455

BLAST of CmoCh04G021150 vs. TrEMBL
Match: Q9LI73_ARATH (Aspartyl protease family protein OS=Arabidopsis thaliana GN=At3g25700 PE=2 SV=1)

HSP 1 Score: 564.7 bits (1454), Expect = 1.0e-157
Identity = 283/451 (62.75%), Postives = 344/451 (76.27%), Query Frame = 1

Query: 7   FLLLLLLLLSSLADLSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSAHRHSPT 66
           FL L LL  S++A +SN   ++YLK PLL  +PF SP+QAL+ DT RL  L    +  P 
Sbjct: 11  FLSLFLLPPSNIAAVSNH--NKYLKLPLLRKSPFPSPTQALALDTRRLHFLSLRRKPIPF 70

Query: 67  LKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPPSSAFF 126
           +KSP++SGA++GSGQYFV+L +G PPQSLLL+ DTGSDLVWVKCS CRNCSHH P++ FF
Sbjct: 71  VKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFF 130

Query: 127 PRHSSSFSPFHCFDPHCRLLPHPP-SHRCNHTHLHSPCSFLYSYADSSLSSGFFSKDVTT 186
           PRHSS+FSP HC+DP CRL+P P  +  CNHT +HS C + Y YAD SL+SG F+++ T+
Sbjct: 131 PRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTS 190

Query: 187 FNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGHRFGN 246
             T SG   + RL  ++FGCGFRISG SVSG  F GA GVMGLGRGPISF+SQLG RFGN
Sbjct: 191 LKTSSG--KEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGN 250

Query: 247 TFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSITV 306
            FSYCLMDYTLSPPPTSYL+IG G   +     SK+ +TPL  NPLSPTFYY+ +KS+ V
Sbjct: 251 KFSYCLMDYTLSPPPTSYLIIGNGGDGI-----SKLFFTPLLTNPLSPTFYYVKLKSVFV 310

Query: 307 DGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQLSPG 366
           +G KL I+P +W ID+ GNGGTVVDSGTTL +LAE AY  V+ A+RRRVKLP A  L+PG
Sbjct: 311 NGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPG 370

Query: 367 FDLCVNASSESR-MRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGNGFS 426
           FDLCVN S  ++  + LP+++F   GG VF PP RNYF+ETEE + CLAI+ VD   GFS
Sbjct: 371 FDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFS 430

Query: 427 VIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 456
           VIGNLMQQGFL EFDR++SR+GFSRRGC LP
Sbjct: 431 VIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 452

BLAST of CmoCh04G021150 vs. TAIR10
Match: AT3G25700.1 (AT3G25700.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 564.7 bits (1454), Expect = 5.2e-161
Identity = 283/451 (62.75%), Postives = 344/451 (76.27%), Query Frame = 1

Query: 7   FLLLLLLLLSSLADLSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSAHRHSPT 66
           FL L LL  S++A +SN   ++YLK PLL  +PF SP+QAL+ DT RL  L    +  P 
Sbjct: 11  FLSLFLLPPSNIAAVSNH--NKYLKLPLLRKSPFPSPTQALALDTRRLHFLSLRRKPIPF 70

Query: 67  LKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPPSSAFF 126
           +KSP++SGA++GSGQYFV+L +G PPQSLLL+ DTGSDLVWVKCS CRNCSHH P++ FF
Sbjct: 71  VKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFF 130

Query: 127 PRHSSSFSPFHCFDPHCRLLPHPP-SHRCNHTHLHSPCSFLYSYADSSLSSGFFSKDVTT 186
           PRHSS+FSP HC+DP CRL+P P  +  CNHT +HS C + Y YAD SL+SG F+++ T+
Sbjct: 131 PRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTS 190

Query: 187 FNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGHRFGN 246
             T SG   + RL  ++FGCGFRISG SVSG  F GA GVMGLGRGPISF+SQLG RFGN
Sbjct: 191 LKTSSG--KEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGN 250

Query: 247 TFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSITV 306
            FSYCLMDYTLSPPPTSYL+IG G   +     SK+ +TPL  NPLSPTFYY+ +KS+ V
Sbjct: 251 KFSYCLMDYTLSPPPTSYLIIGNGGDGI-----SKLFFTPLLTNPLSPTFYYVKLKSVFV 310

Query: 307 DGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQLSPG 366
           +G KL I+P +W ID+ GNGGTVVDSGTTL +LAE AY  V+ A+RRRVKLP A  L+PG
Sbjct: 311 NGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPG 370

Query: 367 FDLCVNASSESR-MRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGNGFS 426
           FDLCVN S  ++  + LP+++F   GG VF PP RNYF+ETEE + CLAI+ VD   GFS
Sbjct: 371 FDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFS 430

Query: 427 VIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 456
           VIGNLMQQGFL EFDR++SR+GFSRRGC LP
Sbjct: 431 VIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 452

BLAST of CmoCh04G021150 vs. TAIR10
Match: AT2G42980.1 (AT2G42980.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 242.7 bits (618), Expect = 4.5e-64
Identity = 153/408 (37.50%), Postives = 226/408 (55.39%), Query Frame = 1

Query: 51  THRLSLLFSAHRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKC 110
           T  +SL+ +       L + L SG + GSG+YF+++ +GTPP+   L++DTGSDL W++C
Sbjct: 129 TSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQC 188

Query: 111 SPCRNCSHHPPSSAFF-PRHSSSFSPFHCFDPHCRLLPHP-PSHRCNHTHLHSPCSFLYS 170
            PC +C H   +  F+ P+ S+SF    C DP C L+  P P  +C   +    C + Y 
Sbjct: 189 LPCYDCFHQ--NGMFYDPKTSASFKNITCNDPRCSLISSPDPPVQCESDN--QSCPYFYW 248

Query: 171 YADSSLSSGFFSKDVTTFN--TFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVM 230
           Y D S ++G F+ +  T N  T  G  ++ ++ ++ FGCG    G       F+GA G++
Sbjct: 249 YGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGHWNRG------LFSGASGLL 308

Query: 231 GLGRGPISFSSQLGHRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPL 290
           GLGRGP+SFSSQL   +G++FSYCL+D   +   +S L+ G     L  TN +  S+   
Sbjct: 309 GLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNG 368

Query: 291 QINPLSPTFYYIVVKSITVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEV 350
           + N +  TFYYI +KSI V G  L I  + W I   G+GGT++DSGTTL+Y AE AYE +
Sbjct: 369 KENSVE-TFYYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEII 428

Query: 351 LKAMRRRVKLPRAL-QLSPGFDLCVNASS-ESRMRSLPQIRFRVGGGGVFAPPARNYFVE 410
                 ++K    + +  P  D C N S  E     LP++      G V+  PA N F+ 
Sbjct: 429 KNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIW 488

Query: 411 TEEGVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDREKSRMGFSRRGC 453
             E ++CLAI        FS+IGN  QQ F + +D ++SR+GF+   C
Sbjct: 489 LSEDLVCLAILGTPKST-FSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524

BLAST of CmoCh04G021150 vs. TAIR10
Match: AT1G01300.1 (AT1G01300.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 228.8 bits (582), Expect = 6.8e-60
Identity = 143/396 (36.11%), Postives = 210/396 (53.03%), Query Frame = 1

Query: 63  HSPT---LKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHH 122
           H+P      S ++SG S GSG+YF  L +GTP + + +V+DTGSD+VW++C+PCR C + 
Sbjct: 120 HAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRC-YS 179

Query: 123 PPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFF 182
                F PR S +++   C  PHCR L    S  CN       C +  SY D S + G F
Sbjct: 180 QSDPIFDPRKSKTYATIPCSSPHCRRL---DSAGCNTR--RKTCLYQVSYGDGSFTVGDF 239

Query: 183 SKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQL 242
           S +  TF        + R+  ++ GCG    G       F GA G++GLG+G +SF  Q 
Sbjct: 240 STETLTFR-------RNRVKGVALGCGHDNEG------LFVGAAGLLGLGKGKLSFPGQT 299

Query: 243 GHRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIV 302
           GHRF   FSYCL+D + S  P+S +     +  +         +TPL  NP   TFYY+ 
Sbjct: 300 GHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRI-------ARFTPLLSNPKLDTFYYVG 359

Query: 303 VKSITVDGVKLP-INPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVK-LP 362
           +  I+V G ++P +   ++ +D+ GNGG ++DSGT++T L   AY  +  A R   K L 
Sbjct: 360 LLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLK 419

Query: 363 RALQLSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETE-EGVMCLAIRP 422
           RA   S  FD C + S+ + ++ +P +     G  V + PA NY +  +  G  C A   
Sbjct: 420 RAPDFSL-FDTCFDLSNMNEVK-VPTVVLHFRGADV-SLPATNYLIPVDTNGKFCFAF-- 479

Query: 423 VDSGNGFSVIGNLMQQGFLLEFDREKSRMGFSRRGC 453
             +  G S+IGN+ QQGF + +D   SR+GF+  GC
Sbjct: 480 AGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of CmoCh04G021150 vs. TAIR10
Match: AT3G59080.1 (AT3G59080.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 228.0 bits (580), Expect = 1.2e-59
Identity = 144/399 (36.09%), Postives = 215/399 (53.88%), Query Frame = 1

Query: 59  SAHRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSH 118
           S    +  L + L SG + GSG+YF+++ +G+PP+   L++DTGSDL W++C PC +C  
Sbjct: 147 SVEEQAGQLVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQ 206

Query: 119 HPPSSAFF-PRHSSSFSPFHCFDPHCRLLPHP-PSHRCNHTHLHSPCSFLYSYADSSLSS 178
              + AF+ P+ S+S+    C D  C L+  P P   C   +    C + Y Y DSS ++
Sbjct: 207 Q--NGAFYDPKASASYKNITCNDQRCNLVSSPDPPMPCKSDN--QSCPYYYWYGDSSNTT 266

Query: 179 GFFSKDVTTFN--TFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPIS 238
           G F+ +  T N  T  G+     + ++ FGCG    G       F GA G++GLGRGP+S
Sbjct: 267 GDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRG------LFHGAAGLLGLGRGPLS 326

Query: 239 FSSQLGHRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPT 298
           FSSQL   +G++FSYCL+D       +S L+ G     L   N +  S+   + N L  T
Sbjct: 327 FSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKEN-LVDT 386

Query: 299 FYYIVVKSITVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRV 358
           FYY+ +KSI V G  L I  + W I   G GGT++DSGTTL+Y AE AYE +   +  + 
Sbjct: 387 FYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKA 446

Query: 359 KLPRALQLS-PGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLA 418
           K    +    P  D C N S    ++ LP++      G V+  P  N F+   E ++CLA
Sbjct: 447 KGKYPVYRDFPILDPCFNVSGIHNVQ-LPELGIAFADGAVWNFPTENSFIWLNEDLVCLA 506

Query: 419 IRPVDSGNGFSVIGNLMQQGFLLEFDREKSRMGFSRRGC 453
           +      + FS+IGN  QQ F + +D ++SR+G++   C
Sbjct: 507 MLGTPK-SAFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 532

BLAST of CmoCh04G021150 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 220.3 bits (560), Expect = 2.4e-57
Identity = 162/465 (34.84%), Postives = 237/465 (50.97%), Query Frame = 1

Query: 17  SLADLSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSAHR-------------H 76
           SL D S +  +  L   L H +  SS S A  +D   L L   + R              
Sbjct: 48  SLTDESLSESTTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGR 107

Query: 77  SPTLKSP---------LISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCR 136
           + T ++P         +ISG S GSG+YF+ L +GTP  ++ +V+DTGSD+VW++CSPC+
Sbjct: 108 NATKRTPRTAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCK 167

Query: 137 NCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSL 196
            C ++   + F P+ S +F+   C    CR L    S  C  T     C +  SY D S 
Sbjct: 168 AC-YNQTDAIFDPKKSKTFATVPCGSRLCRRL--DDSSEC-VTRRSKTCLYQVSYGDGSF 227

Query: 197 SSGFFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPIS 256
           + G FS +  TF+         R++ +  GCG    G       F GA G++GLGRG +S
Sbjct: 228 TEGDFSTETLTFH-------GARVDHVPLGCGHDNEG------LFVGAAGLLGLGRGGLS 287

Query: 257 FSSQLGHRFGNTFSYCLMDYT----LSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINP 316
           F SQ  +R+   FSYCL+D T     S PP++ +    G  ++P T+     +TPL  NP
Sbjct: 288 FPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVF---GNAAVPKTSV----FTPLLTNP 347

Query: 317 LSPTFYYIVVKSITVDGVKLP-INPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKA 376
              TFYY+ +  I+V G ++P ++   + +D  GNGG ++DSGT++T L + AY  +  A
Sbjct: 348 KLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDA 407

Query: 377 MR-RRVKLPRALQLSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETE-E 436
            R    KL RA   S  FD C + S  + ++ +P + F  GGG V + PA NY +    E
Sbjct: 408 FRLGATKLKRAPSYSL-FDTCFDLSGMTTVK-VPTVVFHFGGGEV-SLPASNYLIPVNTE 467

Query: 437 GVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDREKSRMGFSRRGC 453
           G  C A     +    S+IGN+ QQGF + +D   SR+GF  R C
Sbjct: 468 GRFCFAF--AGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of CmoCh04G021150 vs. NCBI nr
Match: gi|449451908|ref|XP_004143702.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus])

HSP 1 Score: 732.3 bits (1889), Expect = 5.3e-208
Identity = 363/458 (79.26%), Postives = 395/458 (86.24%), Query Frame = 1

Query: 4   VPQFLLLLLLLLSSLADLSN------AIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLL 63
           +  F LL+LL    L  L N      A P+ +LK PLLH  PFSSPSQ+LSSDTHRLSLL
Sbjct: 6   ISPFFLLILLFSFFLTHLPNPNATAVAAPADFLKLPLLHKPPFSSPSQSLSSDTHRLSLL 65

Query: 64  FSAHRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCS 123
           FS  R +PTLKSPLISGASTGSGQYFV++ LGTPPQSLLLV DTGSDLVWVKCS CRNCS
Sbjct: 66  FS--RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCS 125

Query: 124 HHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSG 183
           HHPPSSAF PRHSSSFSPFHCFDPHCRLLPH P H CNHT LHSPC FLYSYAD SLSSG
Sbjct: 126 HHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSG 185

Query: 184 FFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSS 243
           FFSK+ TT  + SG+  +  L  LSFGCGFRISGPSVSGA+F GARGVMGLGRG ISFSS
Sbjct: 186 FFSKETTTLKSLSGS--EIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSS 245

Query: 244 QLGHRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYY 303
           QLG RFGN FSYCLMDYTLSPPPTS+LMIGGGL SLP+TNA+KISYTPLQINPLSPTFYY
Sbjct: 246 QLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYY 305

Query: 304 IVVKSITVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLP 363
           I + SIT+DGVKLPINP VW IDEQGNGGTVVDSGTTLTYL + AYEEVLK++RRRVKLP
Sbjct: 306 ITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLP 365

Query: 364 RALQLSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPV 423
            A +L+PGFDLCVNAS ESR  SLP++RFR+GGG VFAPP RNYF+ETEEGVMCLAIR V
Sbjct: 366 NAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAV 425

Query: 424 DSGNGFSVIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 456
           +SGNGFSVIGNLMQQGFLLEFD+E+SR+GF+RRGCGLP
Sbjct: 426 ESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of CmoCh04G021150 vs. NCBI nr
Match: gi|659073000|ref|XP_008467208.1| (PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo])

HSP 1 Score: 732.3 bits (1889), Expect = 5.3e-208
Identity = 365/461 (79.18%), Postives = 395/461 (85.68%), Query Frame = 1

Query: 1   MPCVPQFLLLLLLLLSSLADLSN------AIPSQYLKFPLLHTNPFSSPSQALSSDTHRL 60
           M  +  F LL+ L    L  LSN      A  + +LK PLLH  PFSSPSQ+LSSDTHRL
Sbjct: 3   MLSISPFFLLIPLFFFFLTHLSNPNATAVAAAADFLKLPLLHKPPFSSPSQSLSSDTHRL 62

Query: 61  SLLFSAHRHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCR 120
           SLLFS  R +PTLKSPLISGASTGSGQYFV++ LGTPPQSLLLV DTGSDLVWVKCS CR
Sbjct: 63  SLLFS--RPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACR 122

Query: 121 NCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSL 180
           NCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPH P H CNHT LHSPC FLYSYAD SL
Sbjct: 123 NCSHHPPSSAFFPRHSSSFSPFHCFDPHCRLLPHAPPHHCNHTLLHSPCRFLYSYADGSL 182

Query: 181 SSGFFSKDVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPIS 240
           SSGFFSK+ TT  T SG+  +  L  LSFGCGFRISGPSVSGA+F GARGVMGLGRG IS
Sbjct: 183 SSGFFSKETTTLKTLSGS--EIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSIS 242

Query: 241 FSSQLGHRFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPT 300
           FSSQLG RFGN FSYCLMDYTLSPPPTS+LMIGGGL SLPV NA+KISYTPLQINPLSPT
Sbjct: 243 FSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPVNNATKISYTPLQINPLSPT 302

Query: 301 FYYIVVKSITVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRV 360
           FYYI + SIT+DGVKLPINP VW IDEQGNGGTVVDSGTTLTYL + AYEEVLK++RRRV
Sbjct: 303 FYYITINSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRV 362

Query: 361 KLPRALQLSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAI 420
           KLP A +L+PGFDLCVNAS ESR  SLP++RFR+GGG VFAPP RNYF+ETEEGVMCLAI
Sbjct: 363 KLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAI 422

Query: 421 RPVDSGNGFSVIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 456
           R V+SGNGFSVIGNLMQQGFLLEFD+E+SR+GF+RRGCGLP
Sbjct: 423 RAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459

BLAST of CmoCh04G021150 vs. NCBI nr
Match: gi|359473000|ref|XP_002278677.2| (PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera])

HSP 1 Score: 579.3 bits (1492), Expect = 5.8e-162
Identity = 284/452 (62.83%), Postives = 346/452 (76.55%), Query Frame = 1

Query: 10  LLLLLLSSLADLSNAIPS------QYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSAHRH 69
           LLLLL+    D+ NA+P       +YLK  LLH  PF++PSQALS D+HRLS  FSA   
Sbjct: 11  LLLLLIFFFTDICNALPIAQNGTVEYLKLRLLHIKPFTTPSQALSFDSHRLSFFFSALHT 70

Query: 70  SPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPPSS 129
             +LKSP++SGASTGSGQYFV+L LGTPPQ LLLV DTGSDLVWVKCS CRNC+ H P S
Sbjct: 71  PQSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGS 130

Query: 130 AFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFFSKDV 189
           AF  RHS++FSP HC+D  C+L+P P  HRCNH  LHSPC + YSY D S +SGFFSK+ 
Sbjct: 131 AFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKET 190

Query: 190 TTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGHRF 249
           TT NT SG   + +L  ++FGC FRISGPSVSGA F GA GVMGLGRGPIS SSQLGHRF
Sbjct: 191 TTLNTSSG--REAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRF 250

Query: 250 GNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSI 309
           GN FSYCLMD+ +SP PTSYL+IG     +      ++ +TPL INPLSPTFYYI ++S+
Sbjct: 251 GNKFSYCLMDHDISPSPTSYLLIGSTQNDV-APGKRRMRFTPLHINPLSPTFYYIGIESV 310

Query: 310 TVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQLS 369
           +VDG+KLPINP VWA+DE GNGGT+VDSGTTLT+L E AY ++L  ++RRV+LP   + +
Sbjct: 311 SVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPT 370

Query: 370 PGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGNGF 429
           PGFDLCVN S     R LP++ F++GG  VF+PP RNYFV+T+E V CLA++ V + +GF
Sbjct: 371 PGFDLCVNVSEIEHPR-LPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGF 430

Query: 430 SVIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 456
           SVIGNLMQQGFLLEFD++++R+GFSR GC LP
Sbjct: 431 SVIGNLMQQGFLLEFDKDRTRLGFSRHGCALP 458

BLAST of CmoCh04G021150 vs. NCBI nr
Match: gi|802680767|ref|XP_012082020.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Jatropha curcas])

HSP 1 Score: 578.9 bits (1491), Expect = 7.5e-162
Identity = 290/454 (63.88%), Postives = 351/454 (77.31%), Query Frame = 1

Query: 7   FLLLLLLL-----LSSLADLSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSAH 66
           FLLLLLL      +S    +++    +YLK PLLH  PF SP+QAL  D  RLSLL   H
Sbjct: 9   FLLLLLLTDLCYSISLRTTVNSTATKEYLKLPLLHRTPFKSPAQALPFDIRRLSLL---H 68

Query: 67  RHSPTLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPP 126
           R   +LKSP+ISGASTGSGQYFV+L LG+P Q+LLLV DTGSDLVWVKCS C+NCS++ P
Sbjct: 69  RQRTSLKSPVISGASTGSGQYFVSLRLGSPAQTLLLVADTGSDLVWVKCSACKNCSNYSP 128

Query: 127 SSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFFSK 186
            SAF  RHSS+FS  HCF+  CRL+PHP  + CN T LHSPC + YSYAD S +SGFFSK
Sbjct: 129 GSAFLARHSSTFSLIHCFNSQCRLVPHPRPNPCNRTRLHSPCRYEYSYADGSSTSGFFSK 188

Query: 187 DVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGH 246
           + TT NT +G   + +L +L+FGCGFRISGPS++GA F GA GV+GLGR PISFSSQLG 
Sbjct: 189 ETTTLNTSAGR--EKKLKNLAFGCGFRISGPSLTGASFAGAHGVIGLGRAPISFSSQLGR 248

Query: 247 RFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVK 306
           RFGN FSYCLMDYTLSPPPTSYLMIGG   S  V+    +++TPL +N LSPTFYYI +K
Sbjct: 249 RFGNKFSYCLMDYTLSPPPTSYLMIGGHQNSA-VSRKRILNFTPLLVNSLSPTFYYIGIK 308

Query: 307 SITVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQ 366
           S++VDGVKLPINP VW+ID+ GNGGT++DSGTTLT+L E AY E+L A++RRVKLP   +
Sbjct: 309 SVSVDGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFLVEPAYREILSAIKRRVKLPGPGE 368

Query: 367 LSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGN 426
           L+PGFDLCVN S   R    P++   + G  VF+PP RNYF++T EGV CLAI+PV+SG+
Sbjct: 369 LTPGFDLCVNVSG-VRRPVFPRMSLELAGNSVFSPPPRNYFIDTSEGVKCLAIQPVNSGS 428

Query: 427 GFSVIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 456
           GFSVIGNLMQQG+LLEFDR++SR+GF+R GC LP
Sbjct: 429 GFSVIGNLMQQGYLLEFDRDRSRLGFARSGCALP 455

BLAST of CmoCh04G021150 vs. NCBI nr
Match: gi|255566835|ref|XP_002524401.1| (PREDICTED: aspartic proteinase nepenthesin-1 [Ricinus communis])

HSP 1 Score: 575.9 bits (1483), Expect = 6.4e-161
Identity = 279/454 (61.45%), Postives = 351/454 (77.31%), Query Frame = 1

Query: 7   FLLLLLLLLSSLADLSNAIPSQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSAHRHSP- 66
           F  LL+ L  S +  +N   ++YLK PLLH  PF+SPS+AL+ D +R   L   HRH   
Sbjct: 7   FFFLLITLCPSSSAAANTT-TEYLKLPLLHKTPFTSPSEALAFDINRRLSLLHHHRHQQQ 66

Query: 67  ----TLKSPLISGASTGSGQYFVNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPP 126
               + +SP+ISGAS+GSGQYFV+L +GTPPQ+LLLV DTGSDL+WVKCSPCRNCSH  P
Sbjct: 67  HKQNSFRSPVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSP 126

Query: 127 SSAFFPRHSSSFSPFHCFDPHCRLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFFSK 186
            SAFF RHS+++S  HC+ P C+L+PHP  + CN T LHSPC + Y+YADSS ++GFFSK
Sbjct: 127 GSAFFARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSK 186

Query: 187 DVTTFNTFSGTHTQTRLNDLSFGCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGH 246
           +  T NT +G     +LN LSFGCGFRISGPS++GA F GA+GVMGLGR PISFSSQLG 
Sbjct: 187 EALTLNTSTG--KVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGR 246

Query: 247 RFGNTFSYCLMDYTLSPPPTSYLMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVK 306
           RFG+ FSYCLMDYTLSPPPTS+L IGG  +++ V+    +S+TPL INPLSPTFYYI +K
Sbjct: 247 RFGSKFSYCLMDYTLSPPPTSFLTIGGA-QNVAVSKKGIMSFTPLLINPLSPTFYYIAIK 306

Query: 307 SITVDGVKLPINPKVWAIDEQGNGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQ 366
            + V+GVKLPINP VW+ID+ GNGGT++DSGTTLT++ E AY E+LKA ++RVKLP   +
Sbjct: 307 GVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAE 366

Query: 367 LSPGFDLCVNASSESRMRSLPQIRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGN 426
            +PGFDLC+N S  +R  +LP++ F + GG VF+PP RNYF+ET + + CLA++PV    
Sbjct: 367 PTPGFDLCMNVSGVTR-PALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDG 426

Query: 427 GFSVIGNLMQQGFLLEFDREKSRMGFSRRGCGLP 456
           GFSV+GNLMQQGFLLEFDR+KSR+GF+RRGC LP
Sbjct: 427 GFSVLGNLMQQGFLLEFDRDKSRLGFTRRGCALP 455

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
APF2_ARATH1.2e-5836.11Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
NEP2_NEPGR7.5e-5332.76Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
NEP1_NEPGR1.4e-5134.48Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
ASPG1_ARATH3.0e-4932.48Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
ASPG2_ARATH1.1e-4833.16Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana GN=ASPG2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KNH6_CUCSA3.7e-20879.26Uncharacterized protein OS=Cucumis sativus GN=Csa_5G174650 PE=3 SV=1[more]
F6HF17_VITVI4.0e-16262.83Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g02930 PE=3 SV=... [more]
A0A067K2U7_JATCU5.3e-16263.88Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18279 PE=3 SV=1[more]
B9SEI2_RICCO4.5e-16161.45Basic 7S globulin 2 small subunit, putative OS=Ricinus communis GN=RCOM_0705030 ... [more]
Q9LI73_ARATH1.0e-15762.75Aspartyl protease family protein OS=Arabidopsis thaliana GN=At3g25700 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT3G25700.15.2e-16162.75 Eukaryotic aspartyl protease family protein[more]
AT2G42980.14.5e-6437.50 Eukaryotic aspartyl protease family protein[more]
AT1G01300.16.8e-6036.11 Eukaryotic aspartyl protease family protein[more]
AT3G59080.11.2e-5936.09 Eukaryotic aspartyl protease family protein[more]
AT3G61820.12.4e-5734.84 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|449451908|ref|XP_004143702.1|5.3e-20879.26PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis sativus][more]
gi|659073000|ref|XP_008467208.1|5.3e-20879.18PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo][more]
gi|359473000|ref|XP_002278677.2|5.8e-16262.83PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera][more]
gi|802680767|ref|XP_012082020.1|7.5e-16263.88PREDICTED: aspartic proteinase nepenthesin-1 [Jatropha curcas][more]
gi|255566835|ref|XP_002524401.1|6.4e-16161.45PREDICTED: aspartic proteinase nepenthesin-1 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G021150.1CmoCh04G021150.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 327..338
score: 1.0E-7coord: 88..108
score: 1.0E-7coord: 424..439
score: 1.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 7..453
score: 6.9E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 97..108
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 70..267
score: 2.2E-38coord: 277..452
score: 1.1
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 76..452
score: 2.6
NoneNo IPR availablePANTHERPTHR13683:SF354ASPARTYL PROTEASE FAMILY PROTEINcoord: 7..453
score: 6.9E