CmoCh02G006780 (gene) Cucurbita moschata (Rifu)

NameCmoCh02G006780
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionEukaryotic aspartyl protease family protein, putative
LocationCmo_Chr02 : 4302239 .. 4303967 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexonthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGCCGATTTCTCATCTTTTAATCCTTTTCTTCGTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCCGAAACAAGAAAGCGATGCCAATAATGAAGAAAAAGAATTCGTGAGGCTGGATCTGATACACCGCCACCATCCGGAAGTGGTTAAGAGGCTTCATGATGAAATTAAGGTGGATAAGATGGAGGATCGCATCAAGGATATTCGATATCACGATCAATCTCGCCTCCGAGCCATCTCCGTCCACATGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGAGAAGAAGGAGGCGTCGAGTTCGAACCTTCCTCCACAGTCGCAGACTCCAATAGCATTGAAAACATACCCCGGCGCTGATTTCGGTAGCAGTGAGTTTTTCGTGCAATTGAAATTGGGAACGCCGCCGCAGAAGTTCACGATGATTGCAGATACCGGAAGTGACCTATTGTGGACGAGATGCAGATACCGGCGGTGCAGGGGAGATTGCAGCAACCCCTCTCCGATCCATAAAATGCGTAACAGAATGAGAGAGAGATTCATTTACGCGCTTTATGCGAATCAGTCATCTTCTTTCTCCCCAATTCCTTGTTCCTCCAAGCAGTGTATCCAGGATTTCTCTGAGCTCGGCGGCCAACCCGATTGTCCAACCCCTAACTCCCCTTGTTCCTATACCTACAGGTATTAATAGTAATTAATATTATTATTATTATTTTTTTAAATAAAATTTTGGTGGGGACCATTATAACAATAAATGGATGTTGGTTGGTTTGTTTAAAAAAAAAATAACAGCTACTTAAGTGGGGACCGCGCGATGGGAATATTCGCAACCGAGACGGTAACGGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATACGGCTGCACCGAAGAAATAACTGACAGCCAGTTCTTGGACGGAGCCGATGGCCTCATTGGCCTAGGCTCTAGCATCTACTCCTTCGTTTACAAAGCGGCCGAAAACAACGTCGGCGGCGGCTTCTCCTACTGCCTCGCCGACCACCTCCGCAATATAACCGCCATTAGCTACTTCGTCTTCGGCACCCCCTCCCCCAAGACCTTCGCCGCCTCCACATCCTCTCCCATCGGCCCCCCCGCCACCACTAGACTCATCACCGGCGGCCGATACAGCTGCTACTACGGCGTCCAACTGGCCGGAATCTCCGTGGACGGACAGATCCTGAACATCCCCCCTCACGTCTGGAACATCAAGTCCGGTTGCGGCACCATCTTGGACACCGGCACCAGCCTGACGATGCTGACGGCGCCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCCAAGATCGAAAAATTCGGAAGAATGGAAAGGGATGTAAAGGGTGAAAGGGAGAAGAACTTCAAACTTTGCTTCAATGACACGCAGTGGAATTTTGGTATGTTGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGCGTCATACCAATGTAGCTGTATTGCCATAACTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACCTTTGGCAATTTGATTTACTCAAGGGATCCGTCACTTTTGCTCCCTCCGACTGCGCCTAGAACTTCTCCATTTTCTTTCATTTATTACTTCCTTCTTATTAAT

mRNA sequence

ATGTCGCCGATTTCTCATCTTTTAATCCTTTTCTTCGTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCCGAAACAAGAAAGCGATGCCAATAATGAAGAAAAAGAATTCGTGAGGCTGGATCTGATACACCGCCACCATCCGGAAGTGGTTAAGAGGCTTCATGATGAAATTAAGGTGGATAAGATGGAGGATCGCATCAAGGATATTCGATATCACGATCAATCTCGCCTCCGAGCCATCTCCGTCCACATGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGAGAAGAAGGAGGCGTCGAGTTCGAACCTTCCTCCACAGTCGCAGACTCCAATAGCATTGAAAACATACCCCGGCGCTGATTTCGGTAGCAGTGAGTTTTTCGTGCAATTGAAATTGGGAACGCCGCCGCAGAAGTTCACGATGATTGCAGATACCGGAAGTGACCTATTGTGGACGAGATGCAGATACCGGCGGTGCAGGGGAGATTGCAGCAACCCCTCTCCGATCCATAAAATGCGTAACAGAATGAGAGAGAGATTCATTTACGCGCTTTATGCGAATCAGTCATCTTCTTTCTCCCCAATTCCTTGTTCCTCCAAGCAGTGTATCCAGGATTTCTCTGAGCTCGGCGGCCAACCCGATTGTCCAACCCCTAACTCCCCTTGTTCCTATACCTACAGCTACTTAAGTGGGGACCGCGCGATGGGAATATTCGCAACCGAGACGGTAACGGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATACGGCTGCACCGAAGAAATAACTGACAGCCAGTTCTTGGACGGAGCCGATGGCCTCATTGGCCTAGGCTCTAGCATCTACTCCTTCGTTTACAAAGCGGCCGAAAACAACGTCGGCGGCGGCTTCTCCTACTGCCTCGCCGACCACCTCCGCAATATAACCGCCATTAGCTACTTCGTCTTCGGCACCCCCTCCCCCAAGACCTTCGCCGCCTCCACATCCTCTCCCATCGGCCCCCCCGCCACCACTAGACTCATCACCGGCGGCCGATACAGCTGCTACTACGGCGTCCAACTGGCCGGAATCTCCGTGGACGGACAGATCCTGAACATCCCCCCTCACGTCTGGAACATCAAGTCCGGTTGCGGCACCATCTTGGACACCGGCACCAGCCTGACGATGCTGACGGCGCCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCCAAGATCGAAAAATTCGGAAGAATGGAAAGGGATGTAAAGGGTGAAAGGGAGAAGAACTTCAAACTTTGCTTCAATGACACGCAGTGGAATTTTGGTATGTTGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGCGTCATACCAATGTAGCTGTATTGCCATAACTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACCTTTGGCAATTTGATTTACTCAAGGGATCCGTCACTTTTGCTCCCTCCGACTGCGCCTAGAACTTCTCCATTTTCTTTCATTTATTACTTCCTTCTTATTAAT

Coding sequence (CDS)

ATGTCGCCGATTTCTCATCTTTTAATCCTTTTCTTCGTCTTCTTCTCTCCTCTCACCGTCGCAGTCGCCGATCAAAGCAATGCCAATAATCCGAAACAAGAAAGCGATGCCAATAATGAAGAAAAAGAATTCGTGAGGCTGGATCTGATACACCGCCACCATCCGGAAGTGGTTAAGAGGCTTCATGATGAAATTAAGGTGGATAAGATGGAGGATCGCATCAAGGATATTCGATATCACGATCAATCTCGCCTCCGAGCCATCTCCGTCCACATGAATTGGACCAAAGTTGTGGAGAATGCGGAGGAGAAGGAGAAGAAGGAGGCGTCGAGTTCGAACCTTCCTCCACAGTCGCAGACTCCAATAGCATTGAAAACATACCCCGGCGCTGATTTCGGTAGCAGTGAGTTTTTCGTGCAATTGAAATTGGGAACGCCGCCGCAGAAGTTCACGATGATTGCAGATACCGGAAGTGACCTATTGTGGACGAGATGCAGATACCGGCGGTGCAGGGGAGATTGCAGCAACCCCTCTCCGATCCATAAAATGCGTAACAGAATGAGAGAGAGATTCATTTACGCGCTTTATGCGAATCAGTCATCTTCTTTCTCCCCAATTCCTTGTTCCTCCAAGCAGTGTATCCAGGATTTCTCTGAGCTCGGCGGCCAACCCGATTGTCCAACCCCTAACTCCCCTTGTTCCTATACCTACAGCTACTTAAGTGGGGACCGCGCGATGGGAATATTCGCAACCGAGACGGTAACGGTAAGACTAACAAACGGAAAAGAAAAGCAACTGAAGGACATTCTATACGGCTGCACCGAAGAAATAACTGACAGCCAGTTCTTGGACGGAGCCGATGGCCTCATTGGCCTAGGCTCTAGCATCTACTCCTTCGTTTACAAAGCGGCCGAAAACAACGTCGGCGGCGGCTTCTCCTACTGCCTCGCCGACCACCTCCGCAATATAACCGCCATTAGCTACTTCGTCTTCGGCACCCCCTCCCCCAAGACCTTCGCCGCCTCCACATCCTCTCCCATCGGCCCCCCCGCCACCACTAGACTCATCACCGGCGGCCGATACAGCTGCTACTACGGCGTCCAACTGGCCGGAATCTCCGTGGACGGACAGATCCTGAACATCCCCCCTCACGTCTGGAACATCAAGTCCGGTTGCGGCACCATCTTGGACACCGGCACCAGCCTGACGATGCTGACGGCGCCGGCTCACGATGCGGTGATAGAAGCGATGGCTCCCAAGATCGAAAAATTCGGAAGAATGGAAAGGGATGTAAAGGGTGAAAGGGAGAAGAACTTCAAACTTTGCTTCAATGACACGCAGTGGAATTTTGGTATGTTGCCGAAGCTTGGATTCCATTTCGAAGGCGGGGCGGTGTTCGAACCGCCGGATAGGAGCTACATCGTTTCGGCGTCATACCAATGTAGCTGTATTGCCATAACTTCTCTGCCCTTTCCGTCAATCAATATCTTAGGGAATATTATTCAGCAAACTTACCTTTGGCAATTTGATTTACTCAAGGGATCCGTCACTTTTGCTCCCTCCGACTGCGCCTAG
BLAST of CmoCh02G006780 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 160.6 bits (405), Expect = 4.6e-38
Identity = 136/448 (30.36%), Postives = 205/448 (45.76%), Query Frame = 1

Query: 85  LRAISVHMNWTK--VVENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLK 144
           L  +    N TK  +++ A ++ ++   S N   QS + I    Y G      E+ + + 
Sbjct: 46  LEQVDSGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAG----DGEYLMNVA 105

Query: 145 LGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSS 204
           +GTP   F+ I DTGSDL+WT+C    C    S P+PI   ++              SSS
Sbjct: 106 IGTPDSSFSAIMDTGSDLIWTQC--EPCTQCFSQPTPIFNPQD--------------SSS 165

Query: 205 FSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGK 264
           FS +PC S+ C QD       P     N+ C YTY Y  G    G  ATET T   ++  
Sbjct: 166 FSTLPCESQYC-QDL------PSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSS-- 225

Query: 265 EKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVG-GGFSYCLADHLR 324
              + +I +GC E+       +GA GLIG+G    S       + +G G FSYC+     
Sbjct: 226 ---VPNIAFGCGEDNQGFGQGNGA-GLIGMGWGPLSL-----PSQLGVGQFSYCMTS--- 285

Query: 325 NITAISYFVFGTPSPKTFA---ASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQI 384
                    +G+ SP T A   A++  P G P+TT LI       YY + L GI+V G  
Sbjct: 286 ---------YGSSSPSTLALGSAASGVPEGSPSTT-LIHSSLNPTYYYITLQGITVGGDN 345

Query: 385 LNIPPHVWNIKSG--CGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGERE 444
           L IP   + ++     G I+D+GT+LT L   A++AV +A   +I            E  
Sbjct: 346 LGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQI------NLPTVDESS 405

Query: 445 KNFKLCFND-TQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSIN 504
                CF   +  +   +P++   F+GG V    +++ ++S +    C+A+ S     I+
Sbjct: 406 SGLSTCFQQPSDGSTVQVPEISMQFDGG-VLNLGEQNILISPAEGVICLAMGSSSQLGIS 435

Query: 505 ILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
           I GNI QQ     +DL   +V+F P+ C
Sbjct: 466 IFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CmoCh02G006780 vs. Swiss-Prot
Match: CDR1_ARATH (Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 1.2e-33
Identity = 110/391 (28.13%), Postives = 174/391 (44.50%), Query Frame = 1

Query: 134 SSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIY 193
           S E+ + + +GTPP     IADTGSDLLWT+C    C    +   P+   +         
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQC--APCDDCYTQVDPLFDPKT-------- 146

Query: 194 ALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATET 253
                 SS++  + CSS QC    + L  Q  C T ++ CSY+ SY       G  A +T
Sbjct: 147 ------SSTYKDVSCSSSQC----TALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDT 206

Query: 254 VTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFS 313
           +T+  ++ +  QLK+I+ GC        F     G++GLG    S + K   +++ G FS
Sbjct: 207 LTLGSSDTRPMQLKNIIIGCGHN-NAGTFNKKGSGIVGLGGGPVSLI-KQLGDSIDGKFS 266

Query: 314 YCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGIS 373
           YCL          S   FGT +  + +   S+P        LI       +Y + L  IS
Sbjct: 267 YCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTP--------LIAKASQETFYYLTLKSIS 326

Query: 374 VDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKG 433
           V  + +           G   I+D+GT+LT+L    +  + +A+A  I      + + K 
Sbjct: 327 VGSKQIQYSGSDSESSEG-NIIIDSGTTLTLLPTEFYSELEDAVASSI------DAEKKQ 386

Query: 434 EREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPS 493
           + +    LC++ T      +P +  HF+G  V      ++ V  S    C A      PS
Sbjct: 387 DPQSGLSLCYSAT--GDLKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFAFRG--SPS 435

Query: 494 INILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
            +I GN+ Q  +L  +D +  +V+F P+DCA
Sbjct: 447 FSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CmoCh02G006780 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 142.5 bits (358), Expect = 1.3e-32
Identity = 119/395 (30.13%), Postives = 172/395 (43.54%), Query Frame = 1

Query: 133 GSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFI 192
           G  E+ + L +GTP Q F+ I DTGSDL+WT+C  + C    +  +PI   +        
Sbjct: 91  GDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQC--QPCTQCFNQSTPIFNPQG------- 150

Query: 193 YALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATE 252
                  SSSFS +PCSS+ C     +    P C   N+ C YTY Y  G    G   TE
Sbjct: 151 -------SSSFSTLPCSSQLC-----QALSSPTC--SNNFCQYTYGYGDGSETQGSMGTE 210

Query: 253 TVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGF 312
           T+T    +     + +I +GC E        +GA GL+G+G    S   +         F
Sbjct: 211 TLTFGSVS-----IPNITFGCGENNQGFGQGNGA-GLVGMGRGPLSLPSQLDVTK----F 270

Query: 313 SYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGI 372
           SYC+       T I      TPS     +  +S       T LI   +   +Y + L G+
Sbjct: 271 SYCM-------TPIG---SSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGL 330

Query: 373 SVDGQILNIPPHVWNIKSGCGT---ILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMER 432
           SV    L I P  + + S  GT   I+D+GT+LT     A+ +V      + E   ++  
Sbjct: 331 SVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSV------RQEFISQINL 390

Query: 433 DVKGEREKNFKLCFNDTQWNFGM-LPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITS 492
            V       F LCF        + +P    HF+GG + E P  +Y +S S    C+A+ S
Sbjct: 391 PVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGDL-ELPSENYFISPSNGLICLAMGS 434

Query: 493 LPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
                ++I GNI QQ  L  +D     V+FA + C
Sbjct: 451 -SSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmoCh02G006780 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 139.0 bits (349), Expect = 1.4e-31
Identity = 118/399 (29.57%), Postives = 171/399 (42.86%), Query Frame = 1

Query: 129 GADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMR 188
           G   GS E+F +L +GTP +   M+ DTGSD++W +C    CR   S   PI   R    
Sbjct: 134 GLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC--APCRRCYSQSDPIFDPR---- 193

Query: 189 ERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGI 248
                     +S +++ IPCSS  C +  S       C T    C Y  SY  G   +G 
Sbjct: 194 ----------KSKTYATIPCSSPHCRRLDS-----AGCNTRRKTCLYQVSYGDGSFTVGD 253

Query: 249 FATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNV 308
           F+TET+T R       ++K +  GC  +  +     GA GL+GLG    SF  +   +  
Sbjct: 254 FSTETLTFR-----RNRVKGVALGCGHD--NEGLFVGAAGLLGLGKGKLSFPGQTG-HRF 313

Query: 309 GGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQ 368
              FSYCL D   +           PS   F  +  S I     T L++  +   +Y V 
Sbjct: 314 NQKFSYCLVDRSAS---------SKPSSVVFGNAAVSRIA--RFTPLLSNPKLDTFYYVG 373

Query: 369 LAGISVDG-QILNIPPHVWNIK--SGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFG 428
           L GISV G ++  +   ++ +      G I+D+GTS+T L  PA+ A+ +A     +   
Sbjct: 374 LLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLK 433

Query: 429 RMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIA 488
           R            F  CF+ +  N   +P +  HF G  V  P     I   +    C A
Sbjct: 434 R------APDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKFCFA 485

Query: 489 ITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
                   ++I+GNI QQ +   +DL    V FAP  CA
Sbjct: 494 FAG-TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmoCh02G006780 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 127.5 bits (319), Expect = 4.3e-28
Identity = 111/450 (24.67%), Postives = 193/450 (42.89%), Query Frame = 1

Query: 81  DQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQ 140
           D SR+  I   + +   VE  +  + K   + +   Q++  +      GA  GS E+F +
Sbjct: 109 DSSRVAGIVAKIRFA--VEGVDRSDLKPVYNEDTRYQTED-LTTPVVSGASQGSGEYFSR 168

Query: 141 LKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS 200
           + +GTP ++  ++ DTGSD+ W +C    C        P+                   S
Sbjct: 169 IGVGTPAKEMYLVLDTGSDVNWIQC--EPCADCYQQSDPVFN--------------PTSS 228

Query: 201 SSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTN 260
           S++  + CS+ QC      L     C   ++ C Y  SY  G   +G  AT+TVT     
Sbjct: 229 STYKSLTCSAPQC-----SLLETSAC--RSNKCLYQVSYGDGSFTVGELATDTVTF---- 288

Query: 261 GKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHL 320
           G   ++ ++  GC  +  +     GA GL+GLG  + S   +    +    FSYCL D  
Sbjct: 289 GNSGKINNVALGCGHD--NEGLFTGAAGLLGLGGGVLSITNQMKATS----FSYCLVDR- 348

Query: 321 RNITAISYFVFGTPSPKTFAASTSSPI--GPPATTRLITGGRYSCYYGVQLAGISVDGQI 380
                         S K+ +   +S    G  AT  L+   +   +Y V L+G SV G+ 
Sbjct: 349 -------------DSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEK 408

Query: 381 LNIPPHVWNI--KSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGERE 440
           + +P  ++++      G ILD GT++T L   A++++ +A         +    +     
Sbjct: 409 VVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSI----- 468

Query: 441 KNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIV---SASYQCSCIAITSLPFPS 500
             F  C++ +  +   +P + FHF GG   + P ++Y++    +   C   A TS    S
Sbjct: 469 SLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTS---SS 500

Query: 501 INILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
           ++I+GN+ QQ     +DL K  +  + + C
Sbjct: 529 LSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmoCh02G006780 vs. TrEMBL
Match: A0A0A0KG92_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G134390 PE=3 SV=1)

HSP 1 Score: 516.9 bits (1330), Expect = 2.8e-143
Identity = 269/539 (49.91%), Postives = 355/539 (65.86%), Query Frame = 1

Query: 1   MSPISHLLILFFVFF--------SPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHR 60
           MSPIS+    FF F         S    A+ D+ N  N     + + +E+E ++ DL+HR
Sbjct: 8   MSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHR 67

Query: 61  HHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEAS-- 120
           HHP+V +++H ++K+  + +R+KDI  HD +R R+IS  MN  K VE+A  + + EA+  
Sbjct: 68  HHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRHRSISKSMN-QKQVEDARLRAEAEAATE 127

Query: 121 -----SSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRC 180
                S+ LPP + TPI ++   GADFGSSE+FV+LK+GTP Q F +IADTGSDL W +C
Sbjct: 128 EEVAKSAILPPATSTPIGMRMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKC 187

Query: 181 RYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPD 240
           RYRRC G+CS+ +  HK +N  ++RF +A  AN SSSF  + CSS  C  D ++L    +
Sbjct: 188 RYRRCFGNCSS-NVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVRE 247

Query: 241 CPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDG 300
           C  P SPC Y YSY  G  A GIFA ET+TV LTNGKEKQL + + GCTE +  S F  G
Sbjct: 248 CHNPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVF-GG 307

Query: 301 ADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSS 360
           ADG++GLG+S YS  YKAAEN  GGGFSYCL DHL +  AISYFV G P+P T A+++S+
Sbjct: 308 ADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSA 367

Query: 361 PIGPPAT-TRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTM 420
            +    T T+L  G  YS +YGV L GIS +G +LNIP  VW+I SG GTI+D+GTSLT+
Sbjct: 368 KLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTI 427

Query: 421 LTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGA 480
           L APA D V+EA+ P+++KF ++E +        F  CFN++Q+   M PKL FHF  G 
Sbjct: 428 LAAPAFDMVMEALTPRLKKFQQLEIE-------PFDFCFNNSQYTHEMAPKLRFHFGDGT 487

Query: 481 VFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
           VFEPP +SYIVS     SCI   S+PFP+ NI+GNI+QQ +LWQFD  K  V FAPS+C
Sbjct: 488 VFEPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC 536

BLAST of CmoCh02G006780 vs. TrEMBL
Match: A5BLS9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_015630 PE=3 SV=1)

HSP 1 Score: 338.2 bits (866), Expect = 1.8e-89
Identity = 186/479 (38.83%), Postives = 275/479 (57.41%), Query Frame = 1

Query: 45  VRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEK 104
           +RL+LIHRH P+V+ R   +++      R+K++ + D  R   I   +   ++      +
Sbjct: 1   MRLELIHRHSPQVMGRPKTQLQ------RLKELVHSDSVRQLMILHKLRGGQI----PRR 60

Query: 105 EKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTR 164
           + KE  SS+    S   I +  +P AD+G  ++FV  K+GTP QKF ++ADTGSDL W  
Sbjct: 61  KAKEVLSSSSGRGSDDAIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMS 120

Query: 165 CRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQP 224
           C+Y     +CSN     +   R+R + ++  +AN SSSF  IPC +  C  +  +L    
Sbjct: 121 CKYHCRSRNCSN-----RKARRIRHKRVF--HANLSSSFKTIPCLTDMCKIELMDLFSLT 180

Query: 225 DCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLD 284
           +CPTP +PC Y Y Y  G  A+G FA ETVTV L  G++ +L ++L GC+E      F  
Sbjct: 181 NCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSF-Q 240

Query: 285 GADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTS 344
            ADG++GLG S YSF  KAAE   GG FSYCL DHL +    +Y  FG+      + S  
Sbjct: 241 AADGVMGLGYSKYSFAIKAAEK-FGGKFSYCLVDHLSHKNVSNYLTFGS------SRSKE 300

Query: 345 SPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTM 404
           + +     T L+ G   + +Y V + GIS+ G +L IP  VW++K   GTILD+G+SLT 
Sbjct: 301 ALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTF 360

Query: 405 LTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGA 464
           LT PA+  V+ A+   + KF ++E D+        + CFN T +   ++P+L FHF  GA
Sbjct: 361 LTEPAYQPVMAALRVSLLKFRKVEMDIGP-----LEYCFNSTGFEESLVPRLVFHFADGA 420

Query: 465 VFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
            FEPP +SY++SA+    C+   S+ +P  +++GNI+QQ +LW+FDL    + FAPS C
Sbjct: 421 EFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448

BLAST of CmoCh02G006780 vs. TrEMBL
Match: F6H9S0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0085g01110 PE=3 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 1.5e-88
Identity = 185/479 (38.62%), Postives = 274/479 (57.20%), Query Frame = 1

Query: 45  VRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEK 104
           +RL+LIHRH P+V+ R   +++      R+K++ + D  R   I   +   ++      +
Sbjct: 41  MRLELIHRHSPQVMGRPKTQLQ------RLKELVHSDSVRQLMILHKLRGGQI----PRR 100

Query: 105 EKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTR 164
           + KE  SS+    S   I +  +P AD+G  ++ V  K+GTP QKF ++ADTGSDL W  
Sbjct: 101 KAKEVLSSSSGRGSDDAIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMS 160

Query: 165 CRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQP 224
           C+Y     +CSN     +   R+R + ++  +AN SSSF  IPC +  C  +  +L    
Sbjct: 161 CKYHCRSRNCSN-----RKARRIRHKRVF--HANLSSSFKTIPCLTDMCKIELMDLFSLT 220

Query: 225 DCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLD 284
           +CPTP +PC Y Y Y  G  A+G FA ETVTV L  G++ +L ++L GC+E      F  
Sbjct: 221 NCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSF-Q 280

Query: 285 GADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTS 344
            ADG++GLG S YSF  KAAE   GG FSYCL DHL +    +Y  FG+      + S  
Sbjct: 281 AADGVMGLGYSKYSFAIKAAEK-FGGKFSYCLVDHLSHKNVSNYLTFGS------SRSKE 340

Query: 345 SPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTM 404
           + +     T L+ G   + +Y V + GIS+ G +L IP  VW++K   GTILD+G+SLT 
Sbjct: 341 ALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTF 400

Query: 405 LTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGA 464
           LT PA+  V+ A+   + KF ++E D+        + CFN T +   ++P+L FHF  GA
Sbjct: 401 LTEPAYQPVMAALRVSLLKFRKVEMDIGP-----LEYCFNSTGFEESLVPRLVFHFADGA 460

Query: 465 VFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
            FEPP +SY++SA+    C+   S+ +P  +++GNI+QQ +LW+FDL    + FAPS C
Sbjct: 461 EFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 488

BLAST of CmoCh02G006780 vs. TrEMBL
Match: A0A0B0NTS3_GOSAR (Asparticase nepenthesin-1 OS=Gossypium arboreum GN=F383_00615 PE=3 SV=1)

HSP 1 Score: 325.5 bits (833), Expect = 1.2e-85
Identity = 194/519 (37.38%), Postives = 278/519 (53.56%), Query Frame = 1

Query: 7   LLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIK 66
           +L+ F V FS     V  Q + +  + + D+N+     + L+LIHRH P+          
Sbjct: 7   ILVPFMVLFS----MVVAQQHVDQMQHQHDSNS-----ITLELIHRHAPQFTNN-----N 66

Query: 67  VDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQTPIALKT 126
                 R+ D+ YHD  R   I  H         A+E++   AS           I +  
Sbjct: 67  PITQHQRLVDLLYHDIIR-HGIMSHRR------RAKEEDPLTAS-----------IKMPL 126

Query: 127 YPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGD--CSNPSPIHKMR 186
             G DFG  ++    K+GTP QKF +I DTGSDL W RCRYR  RGD  C++   I++ R
Sbjct: 127 ASGRDFGIGQYITSFKVGTPSQKFWLIVDTGSDLTWIRCRYRCSRGDRSCTSKGRINRKR 186

Query: 187 NRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDR 246
                      +A  SSSF+P+PC S+ C  +   L     CPTP +PC+Y Y Y  G  
Sbjct: 187 ---------VFHAPLSSSFNPVPCFSEMCKVELMNLFSLTTCPTPITPCAYDYRYSDGSA 246

Query: 247 AMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAA 306
           AMG+FA ETV+  LTNG++ +L ++L GCT+       L   DG++GL ++ YSF   AA
Sbjct: 247 AMGVFANETVSAGLTNGRKTRLHNVLIGCTDSF-QGPTLQNVDGIMGLANTKYSFATNAA 306

Query: 307 ENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCY 366
               GG FSYCL DHL ++ A +Y +FGT   +   +      G    T+L      S +
Sbjct: 307 AT-FGGKFSYCLVDHLSHLNATNYIIFGTNRNQVKVS------GNTRHTKLELDAIPS-F 366

Query: 367 YGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKF 426
           Y V + GISV  ++L IP  VW+   G GTI+D+GTSLT L  PA+ AV+EA+   + K+
Sbjct: 367 YAVNVIGISVGNKMLEIPMQVWDASEGGGTIIDSGTSLTFLADPAYQAVMEALKVSVSKY 426

Query: 427 GRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCI 486
            R++ D         + CFN T +N  ++PKL  HF+ GA FEP   SY+++A+ +  C+
Sbjct: 427 QRVKLD-----GVPMEYCFNSTGFNGSLVPKLIIHFDDGARFEPHWNSYVIAAAAEVRCL 470

Query: 487 AITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
                 FP+++++GNI+QQ YLW+FDL    + FAPS C
Sbjct: 487 GFLPARFPALSVIGNIMQQNYLWEFDLKGKRLVFAPSSC 470

BLAST of CmoCh02G006780 vs. TrEMBL
Match: W9QQY3_9ROSA (Aspartic proteinase nepenthesin-1 OS=Morus notabilis GN=L484_019203 PE=3 SV=1)

HSP 1 Score: 322.8 bits (826), Expect = 7.8e-85
Identity = 190/480 (39.58%), Postives = 266/480 (55.42%), Query Frame = 1

Query: 46  RLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKE 105
           RL+L+HR+ P++ ++   +I    ME   K I +H +  LR   V          +  + 
Sbjct: 25  RLELLHRNSPKLSEKW--QIPETTME---KLIEFHRRDVLRHRMV----------SHRRM 84

Query: 106 KKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRC 165
             E +SS     S + IA+    GAD+G  E+FV + +GTP Q+F ++ADTGSDL W  C
Sbjct: 85  GIETASS-----SASSIAMPMNAGADYGVGEYFVHVTVGTPGQRFMLVADTGSDLTWMHC 144

Query: 166 RYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPD 225
           R       C      HK R   R  F    +A++SSSF  IPC S+ C  + + L     
Sbjct: 145 R-------CGRRCGTHKGRLNNRRVF----HADRSSSFKTIPCLSEMCKVELANLFSLSK 204

Query: 226 CPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEI--TDSQFL 285
           CPTP +PC+Y Y YL G  A+G FA ET++VRL NGK+++L+D+L GCTE +   +    
Sbjct: 205 CPTPLTPCAYDYRYLEGSSAIGFFANETISVRLANGKKRKLRDVLVGCTESVQGAEESGF 264

Query: 286 DGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAAST 345
            GADG++GLG   ++F  KAA+   GG FSYCL DHL      +Y +FG    K   AS 
Sbjct: 265 KGADGVLGLGFGNHTFTRKAAQ-YFGGKFSYCLVDHLSPKNLSNYIIFG--HDKADKASC 324

Query: 346 SSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLT 405
           SS +     T L+ GG Y  +YGV L+GIS+ G +L IP   WN   G G IL++GTSLT
Sbjct: 325 SSSL---QHTDLVLGGDYGPFYGVNLSGISIGGVLLRIPSVAWNASLGGGAILESGTSLT 384

Query: 406 MLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGG 465
            LT P +  V   +     +FG +     G     F+ CFN T ++   +P L  HF  G
Sbjct: 385 FLTDPVYGPVTSELNKFTSRFGTLLPPGGGP----FEFCFNSTGYDESKMPPLRIHFSNG 444

Query: 466 AVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
           A+FEPP +SYI+  + +  C+   S  +P  +I+GNI+QQ +LW+FDL    + FAPS C
Sbjct: 445 AIFEPPVKSYILDIAPEKKCLGFVSASWPGTSIIGNIMQQNHLWEFDLENTRLGFAPSTC 463

BLAST of CmoCh02G006780 vs. TAIR10
Match: AT3G12700.1 (AT3G12700.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 277.7 bits (709), Expect = 1.5e-74
Identity = 156/427 (36.53%), Postives = 234/427 (54.80%), Query Frame = 1

Query: 98  VENAEEKEKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTG 157
           +E+    ++K  S  +    S   + +    G D+G++++F ++++GTP +KF ++ DTG
Sbjct: 67  IEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTG 126

Query: 158 SDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDF 217
           S+L W  CRYR    D           NR   R      A++S SF  + C ++ C  D 
Sbjct: 127 SELTWVNCRYRARGKD-----------NRRVFR------ADESKSFKTVGCLTQTCKVDL 186

Query: 218 SELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEI 277
             L     CPTP++PCSY Y Y  G  A G+FA ET+TV LTNG+  +L   L GC+   
Sbjct: 187 MNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSF 246

Query: 278 TDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPK 337
           T   F  GADG++GL  S +SF    A +  G  FSYCL DHL N    +Y +FG+    
Sbjct: 247 TGQSF-QGADGVLGLAFSDFSFT-STATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRST 306

Query: 338 TFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILD 397
             A   ++P+        +T  R   +Y + + GIS+   +L+IP  VW+  SG GTILD
Sbjct: 307 KTAFRRTTPLD-------LT--RIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILD 366

Query: 398 TGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQ-WNFGMLPKL 457
           +GTSLT+L   A+  V+  +A  + +  R++ +         + CF+ T  +N   LP+L
Sbjct: 367 SGTSLTLLADAAYKQVVTGLARYLVELKRVKPE-----GVPIEYCFSFTSGFNVSKLPQL 426

Query: 458 GFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSV 517
            FH +GGA FEP  +SY+V A+    C+   S   P+ N++GNI+QQ YLW+FDL+  ++
Sbjct: 427 TFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTL 460

Query: 518 TFAPSDC 524
           +FAPS C
Sbjct: 487 SFAPSAC 460

BLAST of CmoCh02G006780 vs. TAIR10
Match: AT3G25700.1 (AT3G25700.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 208.8 bits (530), Expect = 8.3e-54
Identity = 137/411 (33.33%), Postives = 201/411 (48.91%), Query Frame = 1

Query: 129 GADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMR 188
           GA  GS ++FV L++G PPQ   +IADTGSDL+W +C    CR +CS+ SP         
Sbjct: 76  GAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCS--ACR-NCSHHSPAT------- 135

Query: 189 ERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPD-CPTPN-----SPCSYTYSYLSG 248
                  +   SS+FSP  C    C      L  +PD  P  N     S C Y Y Y  G
Sbjct: 136 -----VFFPRHSSTFSPAHCYDPVC-----RLVPKPDRAPICNHTRIHSTCHYEYGYADG 195

Query: 249 DRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQF----LDGADGLIGLGSSIYS 308
               G+FA ET +++ ++GKE +LK + +GC   I+         +GA+G++GLG    S
Sbjct: 196 SLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPIS 255

Query: 309 FVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITG 368
           F  +      G  FSYCL D+  +    SY + G         +    I     T L+T 
Sbjct: 256 FASQLGRR-FGNKFSYCLMDYTLSPPPTSYLIIG---------NGGDGISKLFFTPLLTN 315

Query: 369 GRYSCYYGVQLAGISVDGQILNIPPHVWNI--KSGCGTILDTGTSLTMLTAPAHDAVIEA 428
                +Y V+L  + V+G  L I P +W I      GT++D+GT+L  L  PA+ +VI A
Sbjct: 316 PLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAA 375

Query: 429 MAPKIEKFGRMERDVKGEREKNFKLCFN--DTQWNFGMLPKLGFHFEGGAVFEPPDRSYI 488
           +        R++  +       F LC N         +LP+L F F GGAVF PP R+Y 
Sbjct: 376 VR------RRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYF 435

Query: 489 VSASYQCSCIAITSL-PFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           +    Q  C+AI S+ P    +++GN++QQ +L++FD  +  + F+   CA
Sbjct: 436 IETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450

BLAST of CmoCh02G006780 vs. TAIR10
Match: AT2G42980.1 (AT2G42980.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 169.9 bits (429), Expect = 4.3e-42
Identity = 147/518 (28.38%), Postives = 227/518 (43.82%), Query Frame = 1

Query: 23  ADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQ 82
           A  S +N+    S  ++  KE  R  +     P+   R+  E K  +    + D++  D 
Sbjct: 52  ASSSTSNDCGFSSKEHDPSKEHTRESV----KPQ--SRIKQETK--RTTHSVVDLQIQDL 111

Query: 83  SRLRAISVHMNWTKVVENAEEKEK--KEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQ 142
           +R++ +    N +K  +N + ++K   + S    P  S   +      G   GS E+F+ 
Sbjct: 112 TRIKTLHARFNKSKKQKNEKVRKKITSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMD 171

Query: 143 LKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQS 202
           + +GTPP+ F++I DTGSDL W +C    C  DC + + +                   S
Sbjct: 172 VLVGTPPKHFSLILDTGSDLNWLQC--LPCY-DCFHQNGMF-------------YDPKTS 231

Query: 203 SSFSPIPCSSKQCIQDFSELGGQPD----CPTPNSPCSYTYSYLSGDRAMGIFATETVTV 262
           +SF  I C+  +C      L   PD    C + N  C Y Y Y       G FA ET TV
Sbjct: 232 ASFKNITCNDPRC-----SLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTV 291

Query: 263 RLT----NGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGF 322
            LT       E ++ ++++GC     +     GA GL+GLG    SF     ++  G  F
Sbjct: 292 NLTTTEGGSSEYKVGNMMFGCGH--WNRGLFSGASGLLGLGRGPLSF-SSQLQSLYGHSF 351

Query: 323 SYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYS--CYYGVQLA 382
           SYCL D   N    S  +FG    K     T+        T  + G   S   +Y +Q+ 
Sbjct: 352 SYCLVDRNSNTNVSSKLIFG--EDKDLLNHTNLNF-----TSFVNGKENSVETFYYIQIK 411

Query: 383 GISVDGQILNIPPHVWNIKS--GCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRME 442
            I V G+ L+IP   WNI S    GTI+D+GT+L+    PA++ +    A K+++   + 
Sbjct: 412 SILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIF 471

Query: 443 RDVKGEREKNFKLCFN--DTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAI 502
           RD           CFN    + N   LP+LG  F  G V+  P  +  +  S    C+AI
Sbjct: 472 RDF-----PVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAI 525

Query: 503 TSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
              P  + +I+GN  QQ +   +D  +  + F P+ CA
Sbjct: 532 LGTPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCA 525

BLAST of CmoCh02G006780 vs. TAIR10
Match: AT3G59080.1 (AT3G59080.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 162.9 bits (411), Expect = 5.2e-40
Identity = 145/530 (27.36%), Postives = 231/530 (43.58%), Query Frame = 1

Query: 14  FFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIKVDKMEDR 73
           F +P+    A  S +N+    S      KE    +   + H   +KR           + 
Sbjct: 43  FPNPMRFGSASSSTSNDCGFSSPEKEPTKERTGENKTVKFH---LKRRETTTTEKATTNS 102

Query: 74  IKDIRYHDQSRLRAIS---VHMNWTKVVENAEEKEKKEASS-----SNLPPQSQTPIALK 133
           + +++  D +R++ +    +  N    V   ++K  KE  +     S++  Q+   +A  
Sbjct: 103 VLELQIRDLTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVTTTPVASSVEEQAGQLVATL 162

Query: 134 TYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRN 193
              G   GS E+F+ + +G+PP+ F++I DTGSDL W +C    C  DC   +       
Sbjct: 163 E-SGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQC--LPCY-DCFQQNG------ 222

Query: 194 RMRERFIYALY-ANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTP----NSPCSYTYSYL 253
                   A Y    S+S+  I C+ ++C      L   PD P P    N  C Y Y Y 
Sbjct: 223 --------AFYDPKASASYKNITCNDQRC-----NLVSSPDPPMPCKSDNQSCPYYYWYG 282

Query: 254 SGDRAMGIFATETVTVRL-TNGKEKQL---KDILYGCTEEITDSQFLDGADGLIGLGSSI 313
                 G FA ET TV L TNG   +L   +++++GC     +     GA GL+GLG   
Sbjct: 283 DSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGH--WNRGLFHGAAGLLGLGRGP 342

Query: 314 YSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLI 373
            SF     ++  G  FSYCL D   +    S  +FG         + +          L+
Sbjct: 343 LSF-SSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLV 402

Query: 374 TGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKS--GCGTILDTGTSLTMLTAPAHDAVI 433
                  +Y VQ+  I V G++LNIP   WNI S    GTI+D+GT+L+    PA++ + 
Sbjct: 403 -----DTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIK 462

Query: 434 EAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYI 493
             +A K +    + RD           CFN +  +   LP+LG  F  GAV+  P  +  
Sbjct: 463 NKIAEKAKGKYPVYRDF-----PILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSF 522

Query: 494 VSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
           +  +    C+A+   P  + +I+GN  QQ +   +D  +  + +AP+ CA
Sbjct: 523 IWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCA 533

BLAST of CmoCh02G006780 vs. TAIR10
Match: AT5G33340.1 (AT5G33340.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 146.0 bits (367), Expect = 6.6e-35
Identity = 110/391 (28.13%), Postives = 174/391 (44.50%), Query Frame = 1

Query: 134 SSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGDCSNPSPIHKMRNRMRERFIY 193
           S E+ + + +GTPP     IADTGSDLLWT+C    C    +   P+   +         
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQC--APCDDCYTQVDPLFDPKT-------- 146

Query: 194 ALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDRAMGIFATET 253
                 SS++  + CSS QC    + L  Q  C T ++ CSY+ SY       G  A +T
Sbjct: 147 ------SSTYKDVSCSSSQC----TALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDT 206

Query: 254 VTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAAENNVGGGFS 313
           +T+  ++ +  QLK+I+ GC        F     G++GLG    S + K   +++ G FS
Sbjct: 207 LTLGSSDTRPMQLKNIIIGCGHN-NAGTFNKKGSGIVGLGGGPVSLI-KQLGDSIDGKFS 266

Query: 314 YCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCYYGVQLAGIS 373
           YCL          S   FGT +  + +   S+P        LI       +Y + L  IS
Sbjct: 267 YCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTP--------LIAKASQETFYYLTLKSIS 326

Query: 374 VDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKFGRMERDVKG 433
           V  + +           G   I+D+GT+LT+L    +  + +A+A  I      + + K 
Sbjct: 327 VGSKQIQYSGSDSESSEG-NIIIDSGTTLTLLPTEFYSELEDAVASSI------DAEKKQ 386

Query: 434 EREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCIAITSLPFPS 493
           + +    LC++ T      +P +  HF+G  V      ++ V  S    C A      PS
Sbjct: 387 DPQSGLSLCYSAT--GDLKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFAFRG--SPS 435

Query: 494 INILGNIIQQTYLWQFDLLKGSVTFAPSDCA 525
            +I GN+ Q  +L  +D +  +V+F P+DCA
Sbjct: 447 FSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435

BLAST of CmoCh02G006780 vs. NCBI nr
Match: gi|659112547|ref|XP_008456273.1| (PREDICTED: aspartic proteinase CDR1 [Cucumis melo])

HSP 1 Score: 524.6 bits (1350), Expect = 1.9e-145
Identity = 274/538 (50.93%), Postives = 360/538 (66.91%), Query Frame = 1

Query: 1   MSPISH-----LLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHP 60
           MSPIS+     LL+ F  F S    A+ D++N  N   + D    E++ +R DL+HRHHP
Sbjct: 8   MSPISNFCFFFLLLFFLSFSSSFLFALGDEANNYNNNDDED----EQQTIRFDLLHRHHP 67

Query: 61  EVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEAS----- 120
           +V ++L+ ++K+  + +R+KDI  HD++R R+IS  MN  K +E+A  + + EA+     
Sbjct: 68  QVSEKLNGDMKIQDLHERMKDIHEHDRNRHRSISKSMN-QKQIEDARLRAEAEAATQVEV 127

Query: 121 --SSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYR 180
             S+ LPP + TPI +K   GADFGSSE+FVQLK+GTP Q F +IADTGSDL W +CRYR
Sbjct: 128 AKSAILPPATSTPIGMKMISGADFGSSEYFVQLKVGTPAQTFMLIADTGSDLTWLKCRYR 187

Query: 181 RCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPT 240
           RC G+CS  +  HK +N  ++RF +AL ANQSS+F  + CSS  C  + +EL    +C T
Sbjct: 188 RCFGNCSG-NVNHKSKNEKKQRFRHALLANQSSTFKTVSCSSTMCTNNLAELFAVAECDT 247

Query: 241 PNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADG 300
           P SPC Y YSY  G  A GIFA ET+TV LTNGKEKQL++ + GCT EI      DGADG
Sbjct: 248 PTSPCVYDYSYAGGASAKGIFAWETLTVGLTNGKEKQLRNSIIGCT-EIVQGNVFDGADG 307

Query: 301 LIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIG 360
           ++GLG+S YS  YKAAEN  GGGFSYCL DHL +  A+SYFV G P+P T A+++S+   
Sbjct: 308 VMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAVSYFVLGVPTPSTSASTSSAK-- 367

Query: 361 PPAT---TRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTML 420
           PPA    T+L  G  YS +YGV L GIS DGQ+LNIPP VW+   GCGTI+D+GTSLT+L
Sbjct: 368 PPAKMSYTKLYVGDPYSSFYGVDLIGISADGQMLNIPPRVWDSYKGCGTIIDSGTSLTVL 427

Query: 421 TAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAV 480
             PA D V+E +  ++++F ++E +        F  CFN++Q+   M PKL FHF  G V
Sbjct: 428 ATPAFDVVMEVLTSRLKQFQQIEIE-------PFNFCFNNSQYTHDMAPKLRFHFGDGTV 487

Query: 481 FEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
           FEPP +SYIVS     SCI I S+PFPS+NI+GNI+QQ +LWQFD  K  V FA S+C
Sbjct: 488 FEPPTKSYIVSVGEFISCIGIVSMPFPSLNIIGNILQQNHLWQFDFQKRRVGFAASEC 529

BLAST of CmoCh02G006780 vs. NCBI nr
Match: gi|778713001|ref|XP_004140022.2| (PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus])

HSP 1 Score: 516.9 bits (1330), Expect = 4.1e-143
Identity = 269/539 (49.91%), Postives = 355/539 (65.86%), Query Frame = 1

Query: 1   MSPISHLLILFFVFF--------SPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHR 60
           MSPIS+    FF F         S    A+ D+ N  N     + + +E+E ++ DL+HR
Sbjct: 8   MSPISNFCFFFFFFLLFFFLSFSSSFLFALGDEDNNFNNNNNINDDEDEQEIIKFDLLHR 67

Query: 61  HHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEAS-- 120
           HHP+V +++H ++K+  + +R+KDI  HD +R R+IS  MN  K VE+A  + + EA+  
Sbjct: 68  HHPQVAEKIHGDMKIQDVSERMKDIHEHDHNRHRSISKSMN-QKQVEDARLRAEAEAATE 127

Query: 121 -----SSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRC 180
                S+ LPP + TPI ++   GADFGSSE+FV+LK+GTP Q F +IADTGSDL W +C
Sbjct: 128 EEVAKSAILPPATSTPIGMRMISGADFGSSEYFVELKVGTPAQTFMLIADTGSDLTWMKC 187

Query: 181 RYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPD 240
           RYRRC G+CS+ +  HK +N  ++RF +A  AN SSSF  + CSS  C  D ++L    +
Sbjct: 188 RYRRCFGNCSS-NVNHKSKNEKKQRFRHAFLANHSSSFKTVSCSSTMCTNDLADLFAVRE 247

Query: 241 CPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDG 300
           C  P SPC Y YSY  G  A GIFA ET+TV LTNGKEKQL + + GCTE +  S F  G
Sbjct: 248 CHNPTSPCVYDYSYTGGASAKGIFAWETLTVGLTNGKEKQLHNSIIGCTESVQGSVF-GG 307

Query: 301 ADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSS 360
           ADG++GLG+S YS  YKAAEN  GGGFSYCL DHL +  AISYFV G P+P T A+++S+
Sbjct: 308 ADGVMGLGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSA 367

Query: 361 PIGPPAT-TRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTM 420
            +    T T+L  G  YS +YGV L GIS +G +LNIP  VW+I SG GTI+D+GTSLT+
Sbjct: 368 KLPAKMTYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINSGGGTIIDSGTSLTI 427

Query: 421 LTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGA 480
           L APA D V+EA+ P+++KF ++E +        F  CFN++Q+   M PKL FHF  G 
Sbjct: 428 LAAPAFDMVMEALTPRLKKFQQLEIE-------PFDFCFNNSQYTHEMAPKLRFHFGDGT 487

Query: 481 VFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
           VFEPP +SYIVS     SCI   S+PFP+ NI+GNI+QQ +LWQFD  K  V FAPS+C
Sbjct: 488 VFEPPTKSYIVSVGKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC 536

BLAST of CmoCh02G006780 vs. NCBI nr
Match: gi|147814824|emb|CAN65806.1| (hypothetical protein VITISV_015630 [Vitis vinifera])

HSP 1 Score: 338.2 bits (866), Expect = 2.6e-89
Identity = 186/479 (38.83%), Postives = 275/479 (57.41%), Query Frame = 1

Query: 45  VRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEK 104
           +RL+LIHRH P+V+ R   +++      R+K++ + D  R   I   +   ++      +
Sbjct: 1   MRLELIHRHSPQVMGRPKTQLQ------RLKELVHSDSVRQLMILHKLRGGQI----PRR 60

Query: 105 EKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTR 164
           + KE  SS+    S   I +  +P AD+G  ++FV  K+GTP QKF ++ADTGSDL W  
Sbjct: 61  KAKEVLSSSSGRGSDDAIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMS 120

Query: 165 CRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQP 224
           C+Y     +CSN     +   R+R + ++  +AN SSSF  IPC +  C  +  +L    
Sbjct: 121 CKYHCRSRNCSN-----RKARRIRHKRVF--HANLSSSFKTIPCLTDMCKIELMDLFSLT 180

Query: 225 DCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLD 284
           +CPTP +PC Y Y Y  G  A+G FA ETVTV L  G++ +L ++L GC+E      F  
Sbjct: 181 NCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSF-Q 240

Query: 285 GADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTS 344
            ADG++GLG S YSF  KAAE   GG FSYCL DHL +    +Y  FG+      + S  
Sbjct: 241 AADGVMGLGYSKYSFAIKAAEK-FGGKFSYCLVDHLSHKNVSNYLTFGS------SRSKE 300

Query: 345 SPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTM 404
           + +     T L+ G   + +Y V + GIS+ G +L IP  VW++K   GTILD+G+SLT 
Sbjct: 301 ALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTF 360

Query: 405 LTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGA 464
           LT PA+  V+ A+   + KF ++E D+        + CFN T +   ++P+L FHF  GA
Sbjct: 361 LTEPAYQPVMAALRVSLLKFRKVEMDIGP-----LEYCFNSTGFEESLVPRLVFHFADGA 420

Query: 465 VFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
            FEPP +SY++SA+    C+   S+ +P  +++GNI+QQ +LW+FDL    + FAPS C
Sbjct: 421 EFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448

BLAST of CmoCh02G006780 vs. NCBI nr
Match: gi|731434480|ref|XP_002265771.3| (PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera])

HSP 1 Score: 335.1 bits (858), Expect = 2.2e-88
Identity = 185/479 (38.62%), Postives = 274/479 (57.20%), Query Frame = 1

Query: 45  VRLDLIHRHHPEVVKRLHDEIKVDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEK 104
           +RL+LIHRH P+V+ R   +++      R+K++ + D  R   I   +   ++      +
Sbjct: 41  MRLELIHRHSPQVMGRPKTQLQ------RLKELVHSDSVRQLMILHKLRGGQI----PRR 100

Query: 105 EKKEASSSNLPPQSQTPIALKTYPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTR 164
           + KE  SS+    S   I +  +P AD+G  ++ V  K+GTP QKF ++ADTGSDL W  
Sbjct: 101 KAKEVLSSSSGRGSDDAIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMS 160

Query: 165 CRYRRCRGDCSNPSPIHKMRNRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQP 224
           C+Y     +CSN     +   R+R + ++  +AN SSSF  IPC +  C  +  +L    
Sbjct: 161 CKYHCRSRNCSN-----RKARRIRHKRVF--HANLSSSFKTIPCLTDMCKIELMDLFSLT 220

Query: 225 DCPTPNSPCSYTYSYLSGDRAMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLD 284
           +CPTP +PC Y Y Y  G  A+G FA ETVTV L  G++ +L ++L GC+E      F  
Sbjct: 221 NCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSF-Q 280

Query: 285 GADGLIGLGSSIYSFVYKAAENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTS 344
            ADG++GLG S YSF  KAAE   GG FSYCL DHL +    +Y  FG+      + S  
Sbjct: 281 AADGVMGLGYSKYSFAIKAAEK-FGGKFSYCLVDHLSHKNVSNYLTFGS------SRSKE 340

Query: 345 SPIGPPATTRLITGGRYSCYYGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTM 404
           + +     T L+ G   + +Y V + GIS+ G +L IP  VW++K   GTILD+G+SLT 
Sbjct: 341 ALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTF 400

Query: 405 LTAPAHDAVIEAMAPKIEKFGRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGA 464
           LT PA+  V+ A+   + KF ++E D+        + CFN T +   ++P+L FHF  GA
Sbjct: 401 LTEPAYQPVMAALRVSLLKFRKVEMDIGP-----LEYCFNSTGFEESLVPRLVFHFADGA 460

Query: 465 VFEPPDRSYIVSASYQCSCIAITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
            FEPP +SY++SA+    C+   S+ +P  +++GNI+QQ +LW+FDL    + FAPS C
Sbjct: 461 EFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 488

BLAST of CmoCh02G006780 vs. NCBI nr
Match: gi|728835766|gb|KHG15209.1| (Asparticase nepenthesin-1 [Gossypium arboreum])

HSP 1 Score: 325.5 bits (833), Expect = 1.7e-85
Identity = 194/519 (37.38%), Postives = 278/519 (53.56%), Query Frame = 1

Query: 7   LLILFFVFFSPLTVAVADQSNANNPKQESDANNEEKEFVRLDLIHRHHPEVVKRLHDEIK 66
           +L+ F V FS     V  Q + +  + + D+N+     + L+LIHRH P+          
Sbjct: 7   ILVPFMVLFS----MVVAQQHVDQMQHQHDSNS-----ITLELIHRHAPQFTNN-----N 66

Query: 67  VDKMEDRIKDIRYHDQSRLRAISVHMNWTKVVENAEEKEKKEASSSNLPPQSQTPIALKT 126
                 R+ D+ YHD  R   I  H         A+E++   AS           I +  
Sbjct: 67  PITQHQRLVDLLYHDIIR-HGIMSHRR------RAKEEDPLTAS-----------IKMPL 126

Query: 127 YPGADFGSSEFFVQLKLGTPPQKFTMIADTGSDLLWTRCRYRRCRGD--CSNPSPIHKMR 186
             G DFG  ++    K+GTP QKF +I DTGSDL W RCRYR  RGD  C++   I++ R
Sbjct: 127 ASGRDFGIGQYITSFKVGTPSQKFWLIVDTGSDLTWIRCRYRCSRGDRSCTSKGRINRKR 186

Query: 187 NRMRERFIYALYANQSSSFSPIPCSSKQCIQDFSELGGQPDCPTPNSPCSYTYSYLSGDR 246
                      +A  SSSF+P+PC S+ C  +   L     CPTP +PC+Y Y Y  G  
Sbjct: 187 ---------VFHAPLSSSFNPVPCFSEMCKVELMNLFSLTTCPTPITPCAYDYRYSDGSA 246

Query: 247 AMGIFATETVTVRLTNGKEKQLKDILYGCTEEITDSQFLDGADGLIGLGSSIYSFVYKAA 306
           AMG+FA ETV+  LTNG++ +L ++L GCT+       L   DG++GL ++ YSF   AA
Sbjct: 247 AMGVFANETVSAGLTNGRKTRLHNVLIGCTDSF-QGPTLQNVDGIMGLANTKYSFATNAA 306

Query: 307 ENNVGGGFSYCLADHLRNITAISYFVFGTPSPKTFAASTSSPIGPPATTRLITGGRYSCY 366
               GG FSYCL DHL ++ A +Y +FGT   +   +      G    T+L      S +
Sbjct: 307 AT-FGGKFSYCLVDHLSHLNATNYIIFGTNRNQVKVS------GNTRHTKLELDAIPS-F 366

Query: 367 YGVQLAGISVDGQILNIPPHVWNIKSGCGTILDTGTSLTMLTAPAHDAVIEAMAPKIEKF 426
           Y V + GISV  ++L IP  VW+   G GTI+D+GTSLT L  PA+ AV+EA+   + K+
Sbjct: 367 YAVNVIGISVGNKMLEIPMQVWDASEGGGTIIDSGTSLTFLADPAYQAVMEALKVSVSKY 426

Query: 427 GRMERDVKGEREKNFKLCFNDTQWNFGMLPKLGFHFEGGAVFEPPDRSYIVSASYQCSCI 486
            R++ D         + CFN T +N  ++PKL  HF+ GA FEP   SY+++A+ +  C+
Sbjct: 427 QRVKLD-----GVPMEYCFNSTGFNGSLVPKLIIHFDDGARFEPHWNSYVIAAAAEVRCL 470

Query: 487 AITSLPFPSINILGNIIQQTYLWQFDLLKGSVTFAPSDC 524
                 FP+++++GNI+QQ YLW+FDL    + FAPS C
Sbjct: 487 GFLPARFPALSVIGNIMQQNYLWEFDLKGKRLVFAPSSC 470

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NEP2_NEPGR4.6e-3830.36Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
CDR1_ARATH1.2e-3328.13Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1[more]
NEP1_NEPGR1.3e-3230.13Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
APF2_ARATH1.4e-3129.57Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
ASPG1_ARATH4.3e-2824.67Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KG92_CUCSA2.8e-14349.91Uncharacterized protein OS=Cucumis sativus GN=Csa_6G134390 PE=3 SV=1[more]
A5BLS9_VITVI1.8e-8938.83Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_015630 PE=3 SV=1[more]
F6H9S0_VITVI1.5e-8838.62Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0085g01110 PE=3 SV=... [more]
A0A0B0NTS3_GOSAR1.2e-8537.38Asparticase nepenthesin-1 OS=Gossypium arboreum GN=F383_00615 PE=3 SV=1[more]
W9QQY3_9ROSA7.8e-8539.58Aspartic proteinase nepenthesin-1 OS=Morus notabilis GN=L484_019203 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G12700.11.5e-7436.53 Eukaryotic aspartyl protease family protein[more]
AT3G25700.18.3e-5433.33 Eukaryotic aspartyl protease family protein[more]
AT2G42980.14.3e-4228.38 Eukaryotic aspartyl protease family protein[more]
AT3G59080.15.2e-4027.36 Eukaryotic aspartyl protease family protein[more]
AT5G33340.16.6e-3528.13 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659112547|ref|XP_008456273.1|1.9e-14550.93PREDICTED: aspartic proteinase CDR1 [Cucumis melo][more]
gi|778713001|ref|XP_004140022.2|4.1e-14349.91PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2 [Cucumis sativus][more]
gi|147814824|emb|CAN65806.1|2.6e-8938.83hypothetical protein VITISV_015630 [Vitis vinifera][more]
gi|731434480|ref|XP_002265771.3|2.2e-8838.62PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera][more]
gi|728835766|gb|KHG15209.1|1.7e-8537.38Asparticase nepenthesin-1 [Gossypium arboreum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0044238 primary metabolic process
biological_process GO:0030163 protein catabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh02G006780.1CmoCh02G006780.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 495..510
score: 1.1E-9coord: 394..405
score: 1.1E-9coord: 143..163
score: 1.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 123..523
score: 1.1E-111coord: 7..106
score: 1.1E
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 200..332
score: 1.9E-34coord: 134..167
score: 1.9E-34coord: 352..524
score: 3.4
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 129..523
score: 1.02
NoneNo IPR availableunknownCoilCoilcoord: 523..524
scor
NoneNo IPR availablePANTHERPTHR13683:SF280ASPARTYL PROTEASE FAMILY PROTEINcoord: 123..523
score: 1.1E-111coord: 7..106
score: 1.1E