CmaCh01G003360 (gene) Cucurbita maxima (Rimu)

NameCmaCh01G003360
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionEukaryotic aspartyl protease family protein
LocationCma_Chr01 : 1651637 .. 1652893 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTCTCTCTCTCCTCTTTCTGGCCCTATTCTCTCTCCTATTCTCCCAATCAAACTCCCTCTCTCTCTCCTTCCCTCTCACCTCCCTTCCTCTCTCCCAAGAACCCTCCTTTTCCCTCTCCTCCAAAACCTCACCCCATGACTCCACGAAGCTCCCCTTCAAATACTCCAACGCCCTGGTGGTCTCTTTCCCCGTTGGGTCGCCGCCGCAGCCGATGGACATGGTGCTAGACACCGGCAGCCAACTCTCTTGGGTTCAATGCCACGGAAAAGTAAGGACAGAGTCAGTGATTAATCGGTTTGACCCTTATCTCTCCTCCACTTTCTCTCACCTCCCCTGCAACAACTCTCTCTGCAAACCCCGAATTCCCGATTTTACCCTTCCTACTTCTTGTGACCGACATCGCCACTGCCACTACTCATACTTCTACGGCGATGGGACCTTGGCTGAGGGTAATCTCGTCACTGAAAAAATCACCTTCTCTAATTCCTTAACTACCCTCTCCCTCGTTCTCGGCTGCGCTACGACCTCCACCGAAAACAGGGGTATGTTGGGAATGAATAAGGGACGCCTCTCCTTCATCTCCCAGGCCAGAATAACCAAGTTTTCTTATTGTGTCCCGGATCGAATCGGGTCGGATCCAACCGGGTTGTTCTACCTCGGAGACAACCCAAATTCGGGTAAATTCAAATACGTCAAAATGTTGACTTTCACCAAAAGTCGACGCTCCACGAATCTTGACAAGTTGGCCTACACCCTCCCAATGAATGGGATTAGAATAGGCAAAAACCACCTCAACATCTCGCGGGCCGTTTTCAAACCGGACCCATCTGGCGCCGGTCAGACCATGATCGACTCCGGCTCGGATTTGACTTATTTGGTAGATGAAGCTTACAGCAAGGTTAGAGCAGAGATAGTGAGATTAGTGGGGCCCATGATGAAAAAAGCATATGAATACGCCGCCGTCGACATGTGTTTCGACGGCGCAGAGGCGGCGGTGGTGGGTCGGAGAATTGGCAACATGTGGTTCAAGTTTGAGAATGGGGTGGAGATATTGGTCGGGAAAGGAGAAGGGTTATTGACGGAAGTGGAAAAAGGAGTGAAGTGCGTGGGGATCGGACGGTCAGATAGACTTGTGATTGAGAGTAATATAATCGGGATCATTCATCAAAAGAATATGTGGGTGGAGTACGATCTGGCCAATAAGAGAGTTGGGTTCGGTGAAGCTGAGTGTAGTAGATTGAAGGCCTGGTGA

mRNA sequence

ATGCCTCTCTCTCTCCTCTTTCTGGCCCTATTCTCTCTCCTATTCTCCCAATCAAACTCCCTCTCTCTCTCCTTCCCTCTCACCTCCCTTCCTCTCTCCCAAGAACCCTCCTTTTCCCTCTCCTCCAAAACCTCACCCCATGACTCCACGAAGCTCCCCTTCAAATACTCCAACGCCCTGGTGGTCTCTTTCCCCGTTGGGTCGCCGCCGCAGCCGATGGACATGGTGCTAGACACCGGCAGCCAACTCTCTTGGGTTCAATGCCACGGAAAAGTAAGGACAGAGTCAGTGATTAATCGGTTTGACCCTTATCTCTCCTCCACTTTCTCTCACCTCCCCTGCAACAACTCTCTCTGCAAACCCCGAATTCCCGATTTTACCCTTCCTACTTCTTGTGACCGACATCGCCACTGCCACTACTCATACTTCTACGGCGATGGGACCTTGGCTGAGGGTAATCTCGTCACTGAAAAAATCACCTTCTCTAATTCCTTAACTACCCTCTCCCTCGTTCTCGGCTGCGCTACGACCTCCACCGAAAACAGGGGTATGTTGGGAATGAATAAGGGACGCCTCTCCTTCATCTCCCAGGCCAGAATAACCAAGTTTTCTTATTGTGTCCCGGATCGAATCGGGTCGGATCCAACCGGGTTGTTCTACCTCGGAGACAACCCAAATTCGGGTAAATTCAAATACGTCAAAATGTTGACTTTCACCAAAAGTCGACGCTCCACGAATCTTGACAAGTTGGCCTACACCCTCCCAATGAATGGGATTAGAATAGGCAAAAACCACCTCAACATCTCGCGGGCCGTTTTCAAACCGGACCCATCTGGCGCCGGTCAGACCATGATCGACTCCGGCTCGGATTTGACTTATTTGGTAGATGAAGCTTACAGCAAGGTTAGAGCAGAGATAGTGAGATTAGTGGGGCCCATGATGAAAAAAGCATATGAATACGCCGCCGTCGACATGTGTTTCGACGGCGCAGAGGCGGCGGTGGTGGGTCGGAGAATTGGCAACATGTGGTTCAAGTTTGAGAATGGGGTGGAGATATTGGTCGGGAAAGGAGAAGGGTTATTGACGGAAGTGGAAAAAGGAGTGAAGTGCGTGGGGATCGGACGGTCAGATAGACTTGTGATTGAGAGTAATATAATCGGGATCATTCATCAAAAGAATATGTGGGTGGAGTACGATCTGGCCAATAAGAGAGTTGGGTTCGGTGAAGCTGAGTGTAGTAGATTGAAGGCCTGGTGA

Coding sequence (CDS)

ATGCCTCTCTCTCTCCTCTTTCTGGCCCTATTCTCTCTCCTATTCTCCCAATCAAACTCCCTCTCTCTCTCCTTCCCTCTCACCTCCCTTCCTCTCTCCCAAGAACCCTCCTTTTCCCTCTCCTCCAAAACCTCACCCCATGACTCCACGAAGCTCCCCTTCAAATACTCCAACGCCCTGGTGGTCTCTTTCCCCGTTGGGTCGCCGCCGCAGCCGATGGACATGGTGCTAGACACCGGCAGCCAACTCTCTTGGGTTCAATGCCACGGAAAAGTAAGGACAGAGTCAGTGATTAATCGGTTTGACCCTTATCTCTCCTCCACTTTCTCTCACCTCCCCTGCAACAACTCTCTCTGCAAACCCCGAATTCCCGATTTTACCCTTCCTACTTCTTGTGACCGACATCGCCACTGCCACTACTCATACTTCTACGGCGATGGGACCTTGGCTGAGGGTAATCTCGTCACTGAAAAAATCACCTTCTCTAATTCCTTAACTACCCTCTCCCTCGTTCTCGGCTGCGCTACGACCTCCACCGAAAACAGGGGTATGTTGGGAATGAATAAGGGACGCCTCTCCTTCATCTCCCAGGCCAGAATAACCAAGTTTTCTTATTGTGTCCCGGATCGAATCGGGTCGGATCCAACCGGGTTGTTCTACCTCGGAGACAACCCAAATTCGGGTAAATTCAAATACGTCAAAATGTTGACTTTCACCAAAAGTCGACGCTCCACGAATCTTGACAAGTTGGCCTACACCCTCCCAATGAATGGGATTAGAATAGGCAAAAACCACCTCAACATCTCGCGGGCCGTTTTCAAACCGGACCCATCTGGCGCCGGTCAGACCATGATCGACTCCGGCTCGGATTTGACTTATTTGGTAGATGAAGCTTACAGCAAGGTTAGAGCAGAGATAGTGAGATTAGTGGGGCCCATGATGAAAAAAGCATATGAATACGCCGCCGTCGACATGTGTTTCGACGGCGCAGAGGCGGCGGTGGTGGGTCGGAGAATTGGCAACATGTGGTTCAAGTTTGAGAATGGGGTGGAGATATTGGTCGGGAAAGGAGAAGGGTTATTGACGGAAGTGGAAAAAGGAGTGAAGTGCGTGGGGATCGGACGGTCAGATAGACTTGTGATTGAGAGTAATATAATCGGGATCATTCATCAAAAGAATATGTGGGTGGAGTACGATCTGGCCAATAAGAGAGTTGGGTTCGGTGAAGCTGAGTGTAGTAGATTGAAGGCCTGGTGA

Protein sequence

MPLSLLFLALFSLLFSQSNSLSLSFPLTSLPLSQEPSFSLSSKTSPHDSTKLPFKYSNALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKVRTESVINRFDPYLSSTFSHLPCNNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVLGCATTSTENRGMLGMNKGRLSFISQARITKFSYCVPDRIGSDPTGLFYLGDNPNSGKFKYVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMIDSGSDLTYLVDEAYSKVRAEIVRLVGPMMKKAYEYAAVDMCFDGAEAAVVGRRIGNMWFKFENGVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRVGFGEAECSRLKAW
BLAST of CmaCh01G003360 vs. Swiss-Prot
Match: PCS1L_ARATH (Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1)

HSP 1 Score: 239.6 bits (610), Expect = 6.2e-62
Identity = 153/428 (35.75%), Postives = 230/428 (53.74%), Query Frame = 1

Query: 16  SQSNSLSLSFP-LTSLPLSQEPSFSLSSKTSPHD---STKLPFKYSNALVVSFPVGSPPQ 75
           S S+S S SF   +S   SQ     L ++ +P D   + KL F ++  L V+  VG+PPQ
Sbjct: 25  SSSSSSSFSFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLHFHHNVTLTVTLTVGTPPQ 84

Query: 76  PMDMVLDTGSQLSWVQCHGKVRTESVINRFDPYLSSTFSHLPCNNSLCKPRIPDFTLPTS 135
            + MV+DTGS+LSW++C+ +    + +N FDP  SS++S +PC++  C+ R  DF +P S
Sbjct: 85  NISMVIDTGSELSWLRCN-RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPAS 144

Query: 136 CDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVLGCATT--------STENRG 195
           CD  + CH +  Y D + +EGNL  E   F NS    +L+ GC  +         T+  G
Sbjct: 145 CDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTG 204

Query: 196 MLGMNKGRLSFISQARITKFSYCVPDRIGSDPTGLFYLGDNPNSGKFKYVKMLTFTK-SR 255
           +LGMN+G LSFISQ    KFSYC+      D  G   LGD+     F ++  L +T   R
Sbjct: 205 LLGMNRGSLSFISQMGFPKFSYCISGT--DDFPGFLLLGDS----NFTWLTPLNYTPLIR 264

Query: 256 RSTNL---DKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMIDSGSDLTYLVDEAY 315
            ST L   D++AYT+ + GI++    L I ++V  PD +GAGQTM+DSG+  T+L+   Y
Sbjct: 265 ISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVY 324

Query: 316 SKVRAEIVRLVGPMMKKAYE------YAAVDMCFDGAEAAV---VGRRIGNMWFKFENGV 375
           + +R+  +     ++   YE         +D+C+  +   +   +  R+  +   FE G 
Sbjct: 325 TALRSHFLNRTNGIL-TVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFE-GA 384

Query: 376 EILVGKGEGLLTEV------EKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKR 413
           EI V  G+ LL  V         V C   G SD + +E+ +IG  HQ+NMW+E+DL   R
Sbjct: 385 EIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSR 442

BLAST of CmaCh01G003360 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 156.4 bits (394), Expect = 6.9e-37
Identity = 111/360 (30.83%), Postives = 171/360 (47.50%), Query Frame = 1

Query: 61  VVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKVRTESVINR-FDPYLSSTFSHLPCNNSLC 120
           +++  +G+P QP   ++DTGS L W QC    +  +     F+P  SS+FS LPC++ LC
Sbjct: 96  LMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLC 155

Query: 121 KPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVLGCATTST 180
           +       L +    +  C Y+Y YGDG+  +G++ TE +TF  S++  ++  GC   + 
Sbjct: 156 Q------ALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFG-SVSIPNITFGCGENNQ 215

Query: 181 -----ENRGMLGMNKGRLSFISQARITKFSYCVPDRIGSDPTGLFYLGDNPNSGKFKYVK 240
                   G++GM +G LS  SQ  +TKFSYC+   IGS       LG   NS       
Sbjct: 216 GFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTP-IGSSTPSNLLLGSLANSVTAGSPN 275

Query: 241 MLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDP-SGAGQTMIDSGSDLTY 300
                 S+  T      Y + +NG+ +G   L I  + F  +  +G G  +IDSG+ LTY
Sbjct: 276 TTLIQSSQIPT-----FYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTY 335

Query: 301 LVDEAYSKVRAEIVRLVG-PMMKKAYEYAAVDMCFDGAEAAVVGRRIGNMWFKFENGVEI 360
            V+ AY  VR E +  +  P++  +   +  D+CF    +     +I      F+ G   
Sbjct: 336 FVNNAYQSVRQEFISQINLPVVNGS--SSGFDLCFQ-TPSDPSNLQIPTFVMHFDGG--D 395

Query: 361 LVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRVGFGEAEC 413
           L    E        G+ C+ +G S +     +I G I Q+NM V YD  N  V F  A+C
Sbjct: 396 LELPSENYFISPSNGLICLAMGSSSQ---GMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of CmaCh01G003360 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 143.7 bits (361), Expect = 4.7e-33
Identity = 97/352 (27.56%), Postives = 162/352 (46.02%), Query Frame = 1

Query: 66  VGSPPQPMDMVLDTGSQLSWVQCHGKVRTESVINR-FDPYLSSTFSHLPCNNSLCKPRIP 125
           VG+P + M +VLDTGS ++W+QC          +  F+P  SST+  L C+   C     
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCS---- 227

Query: 126 DFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVLGCATTS----TE 185
              L TS  R   C Y   YGDG+   G L T+ +TF NS    ++ LGC   +    T 
Sbjct: 228 --LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTG 287

Query: 186 NRGMLGMNKGRLSFISQARITKFSYCVPDRIGSDPTGLFYLGDNPNSGKFKYVKMLTFTK 245
             G+LG+  G LS  +Q + T FSYC+ DR     + L +       G      +     
Sbjct: 288 AAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLL----- 347

Query: 246 SRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMIDSGSDLTYLVDEAYS 305
             R+  +D   Y + ++G  +G   + +  A+F  D SG+G  ++D G+ +T L  +AY+
Sbjct: 348 --RNKKIDTFYY-VGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYN 407

Query: 306 KVRAEIVRLVGPMMKKAYEYAAVDMCFDGAEAAVVGRRIGNMWFKFENGVEILVGKGEGL 365
            +R   ++L   + K +   +  D C+D +  + V  ++  + F F  G  + +     L
Sbjct: 408 SLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTV--KVPTVAFHFTGGKSLDLPAKNYL 467

Query: 366 LTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRVGFGEAEC 413
           +   + G  C     +   +   +IIG + Q+   + YDL+   +G    +C
Sbjct: 468 IPVDDSGTFCFAFAPTSSSL---SIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmaCh01G003360 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 143.7 bits (361), Expect = 4.7e-33
Identity = 108/360 (30.00%), Postives = 177/360 (49.17%), Query Frame = 1

Query: 61  VVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKVRTESVINR-FDPYLSSTFSHLPCNNSLC 120
           +++  +G+P      ++DTGS L W QC    +  S     F+P  SS+FS LPC +  C
Sbjct: 97  LMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYC 156

Query: 121 KPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVLGCATTST 180
           +       LP+    +  C Y+Y YGDG+  +G + TE  TF  S +  ++  GC   + 
Sbjct: 157 QD------LPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETS-SVPNIAFGCGEDNQ 216

Query: 181 -----ENRGMLGMNKGRLSFISQARITKFSYCVPDRIGSDPTGLFYLGDNPNSGKFKYVK 240
                   G++GM  G LS  SQ  + +FSYC+     S P+ L  LG +  SG  +   
Sbjct: 217 GFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTL-ALG-SAASGVPEGSP 276

Query: 241 MLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMIDSGSDLTYL 300
             T   S    +L+   Y + + GI +G ++L I  + F+    G G  +IDSG+ LTYL
Sbjct: 277 STTLIHS----SLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYL 336

Query: 301 VDEAYSKVRAEIVRLVG-PMMKKAYEYAAVDMCF-DGAEAAVVGRRIGNMWFKFENGVEI 360
             +AY+ V       +  P + ++   + +  CF   ++ + V  ++  +  +F+ GV +
Sbjct: 337 PQDAYNAVAQAFTDQINLPTVDES--SSGLSTCFQQPSDGSTV--QVPEISMQFDGGV-L 396

Query: 361 LVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRVGFGEAEC 413
            +G+ + +L    +GV C+ +G S +L I  +I G I Q+   V YDL N  V F   +C
Sbjct: 397 NLGE-QNILISPAEGVICLAMGSSSQLGI--SIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of CmaCh01G003360 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 136.0 bits (341), Expect = 9.7e-31
Identity = 109/357 (30.53%), Postives = 171/357 (47.90%), Query Frame = 1

Query: 66  VGSPPQPMDMVLDTGSQLSWVQCHGKVRTESVINR-FDPYLSSTFSHLPCNNSLCKPRIP 125
           VG+P + + MVLDTGS + W+QC    R  S  +  FDP  S T++ +PC++  C+ R+ 
Sbjct: 148 VGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCR-RLD 207

Query: 126 DFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVLGCATTS----TE 185
                T   R + C Y   YGDG+   G+  TE +TF  +     + LGC   +      
Sbjct: 208 SAGCNT---RRKTCLYQVSYGDGSFTVGDFSTETLTFRRN-RVKGVALGCGHDNEGLFVG 267

Query: 186 NRGMLGMNKGRLSFISQAR---ITKFSYCVPDRIGSDPTGLFYLGDNPNSGKFKYVKMLT 245
             G+LG+ KG+LSF  Q       KFSYC+ DR  S        G+   S      ++  
Sbjct: 268 AAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVS------RIAR 327

Query: 246 FTKSRRSTNLDKLAYTLPMNGIRIGKNHL-NISRAVFKPDPSGAGQTMIDSGSDLTYLVD 305
           FT    +  LD   Y + + GI +G   +  ++ ++FK D  G G  +IDSG+ +T L+ 
Sbjct: 328 FTPLLSNPKLDTF-YYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIR 387

Query: 306 EAYSKVRAEIVRLVGPMMKKAYEYAAVDMCFDGAEAAVVGRRIGNMWFKFENGVEILVGK 365
            AY  +R +  R+    +K+A +++  D CFD +    V  ++  +   F  G ++ +  
Sbjct: 388 PAYIAMR-DAFRVGAKTLKRAPDFSLFDTCFDLSNMNEV--KVPTVVLHF-RGADVSLPA 447

Query: 366 GEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRVGFGEAECS 414
              L+     G  C     +   +   +IIG I Q+   V YDLA+ RVGF    C+
Sbjct: 448 TNYLIPVDTNGKFCFAFAGT---MGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGCA 485

BLAST of CmaCh01G003360 vs. TrEMBL
Match: A0A0L9UW35_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan07g076900 PE=3 SV=1)

HSP 1 Score: 492.3 bits (1266), Expect = 5.9e-136
Identity = 261/426 (61.27%), Postives = 318/426 (74.65%), Query Frame = 1

Query: 2   PLSLLFLALFSLLFSQSNS-----LSLSFPLTSLPLSQ----EPSFSLSSKTSPHDSTKL 61
           P+SL  L    LLFS S+S     +S SFPL SLP+S     + +  L S +S   + KL
Sbjct: 6   PVSLFSLLCTVLLFSASSSAKHDSVSFSFPLRSLPISAGKPLKTNPKLRSLSSASYNVKL 65

Query: 62  PFKYSNALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKVRTESVINRFDPYLSSTFSHL 121
           PFKYS ALVVS P+G+PPQ   MVLDTGSQLSW+QC  K  T   ++ FDP LSS+F  +
Sbjct: 66  PFKYSMALVVSLPIGTPPQHQQMVLDTGSQLSWIQCRNK--TPPTVS-FDPSLSSSFYVI 125

Query: 122 PCNNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVL 181
           PC + LCKPR+PDFTLPT+CD++R CHYSYFY DGT AEGNLV EK+TFS S TT  L L
Sbjct: 126 PCTHPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLTFSPSQTTPPLTL 185

Query: 182 GCATTSTENRGMLGMNKGRLSFISQARITKFSYCVPDRI---GSDPTGLFYLGDNPNSGK 241
           GCAT S +  G+LGMN GRLSF SQA+ITKFSYCVP R+   G+ PTG FYLG+NPNS +
Sbjct: 186 GCATESRDAGGILGMNLGRLSFPSQAKITKFSYCVPTRVSGSGNLPTGSFYLGNNPNSAR 245

Query: 242 FKYVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMIDSGS 301
           F+YV MLTF++S+R  NLD LAYT+PM GIRIG   LNI+ +VF+PD  G+GQTMIDSGS
Sbjct: 246 FRYVSMLTFSQSQRMPNLDPLAYTVPMQGIRIGGKRLNINPSVFRPDAGGSGQTMIDSGS 305

Query: 302 DLTYLVDEAYSKVRAEIVRLVGPMMKKAYEYAAV-DMCFDGAEAAVVGRRIGNMWFKFEN 361
           + T+LVDEAY +VR E+VR+VGP +KK Y Y  V DMCFDG+    +GR IG++  +FE 
Sbjct: 306 EFTFLVDEAYDRVREEVVRVVGPRIKKGYVYGGVADMCFDGSARESIGRLIGDVVLEFEK 365

Query: 362 GVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRVGFG 415
           GVEI+V K E +L +V  GV CVGIGRS+RL   SNIIG IHQ+N+WVE+DLAN R+GFG
Sbjct: 366 GVEIVVPK-ERVLADVGGGVHCVGIGRSERLGAASNIIGNIHQQNLWVEFDLANHRIGFG 425

BLAST of CmaCh01G003360 vs. TrEMBL
Match: A0A0S3RKY8_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.03G096000 PE=3 SV=1)

HSP 1 Score: 492.3 bits (1266), Expect = 5.9e-136
Identity = 261/426 (61.27%), Postives = 318/426 (74.65%), Query Frame = 1

Query: 2   PLSLLFLALFSLLFSQSNS-----LSLSFPLTSLPLSQ----EPSFSLSSKTSPHDSTKL 61
           P+SL  L    LLFS S+S     +S SFPL SLP+S     + +  L S +S   + KL
Sbjct: 6   PVSLFSLLCTVLLFSASSSAKHDSVSFSFPLRSLPISAGKPLKTNPKLRSLSSASYNVKL 65

Query: 62  PFKYSNALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKVRTESVINRFDPYLSSTFSHL 121
           PFKYS ALVVS P+G+PPQ   MVLDTGSQLSW+QC  K  T   ++ FDP LSS+F  +
Sbjct: 66  PFKYSMALVVSLPIGTPPQHQQMVLDTGSQLSWIQCRNK--TPPTVS-FDPSLSSSFYVI 125

Query: 122 PCNNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVL 181
           PC + LCKPR+PDFTLPT+CD++R CHYSYFY DGT AEGNLV EK+TFS S TT  L L
Sbjct: 126 PCTHPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLTFSPSQTTPPLTL 185

Query: 182 GCATTSTENRGMLGMNKGRLSFISQARITKFSYCVPDRI---GSDPTGLFYLGDNPNSGK 241
           GCAT S +  G+LGMN GRLSF SQA+ITKFSYCVP R+   G+ PTG FYLG+NPNS +
Sbjct: 186 GCATESRDAGGILGMNLGRLSFPSQAKITKFSYCVPTRVSGSGNLPTGSFYLGNNPNSAR 245

Query: 242 FKYVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMIDSGS 301
           F+YV MLTF++S+R  NLD LAYT+PM GIRIG   LNI+ +VF+PD  G+GQTMIDSGS
Sbjct: 246 FRYVSMLTFSQSQRMPNLDPLAYTVPMQGIRIGGKRLNINPSVFRPDAGGSGQTMIDSGS 305

Query: 302 DLTYLVDEAYSKVRAEIVRLVGPMMKKAYEYAAV-DMCFDGAEAAVVGRRIGNMWFKFEN 361
           + T+LVDEAY +VR E+VR+VGP +KK Y Y  V DMCFDG+    +GR IG++  +FE 
Sbjct: 306 EFTFLVDEAYDRVREEVVRVVGPRIKKGYVYGGVADMCFDGSARESIGRLIGDVVLEFEK 365

Query: 362 GVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRVGFG 415
           GVEI+V K E +L +V  GV CVGIGRS+RL   SNIIG IHQ+N+WVE+DLAN R+GFG
Sbjct: 366 GVEIVVPK-ERVLADVGGGVHCVGIGRSERLGAASNIIGNIHQQNLWVEFDLANHRIGFG 425

BLAST of CmaCh01G003360 vs. TrEMBL
Match: B9T2R1_RICCO (Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0593500 PE=3 SV=1)

HSP 1 Score: 489.6 bits (1259), Expect = 3.9e-135
Identity = 260/434 (59.91%), Postives = 311/434 (71.66%), Query Frame = 1

Query: 5   LLFLALFSLLFSQSN-----------SLSLSFPLTSLPLSQEPSFSLSSKTSPHDSTKLP 64
           LLFL LF    S S+           S S SFPL SLP S     S   ++S    TK P
Sbjct: 7   LLFLFLFFTFVSSSSPNPERTHLNTSSFSFSFPLRSLPASSPSKPSSPFRSSFVAQTKQP 66

Query: 65  -------FKYSNALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKV--RTESVINRFDPY 124
                  FKYS AL+VS P+G+PPQ   MVLDTGSQLSW+QCH K   +       FDP 
Sbjct: 67  SYNYRSSFKYSMALIVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPS 126

Query: 125 LSSTFSHLPCNNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNS 184
           LSS+FS LPCN+ LCKPRIPDFTLPT+CD++R CHYSYFY DGT AEG+LV EKITFS+S
Sbjct: 127 LSSSFSVLPCNHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSS 186

Query: 185 LTTLSLVLGCATTSTENRGMLGMNKGRLSFISQARITKFSYCVPDR---IGSDPTGLFYL 244
            +T  L+LGCA  ST+ +G+LGMN GR SF SQA+I+KFSYCVP R    G   TG FYL
Sbjct: 187 QSTPPLILGCAEASTDEKGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSFYL 246

Query: 245 GDNPNSGKFKYVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAG 304
           G+NPNSG+F+Y+ +LTFT S+RS NLD LAYT+PM GIR+G   LNIS  +F+PDPSGAG
Sbjct: 247 GNNPNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAG 306

Query: 305 QTMIDSGSDLTYLVDEAYSKVRAEIVRLVGPMMKKAYEYAAV-DMCFDGAEAAVVGRRIG 364
           QT+IDSGS+ TYLVDEAY+KVR E+VRLVGP +KK Y Y  V DMCFDG     +GR IG
Sbjct: 307 QTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSDMCFDG-NPMEIGRLIG 366

Query: 365 NMWFKFENGVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDL 415
           NM F+FE GVEI++ K   +L +V  GV C+GIGRS+ L   SNIIG  HQ+N+WVEYDL
Sbjct: 367 NMVFEFEKGVEIVIDKWR-VLADVGGGVHCIGIGRSEMLGAASNIIGNFHQQNLWVEYDL 426

BLAST of CmaCh01G003360 vs. TrEMBL
Match: U5FLT4_POPTR (Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0017s02710g PE=3 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 5.0e-135
Identity = 268/442 (60.63%), Postives = 322/442 (72.85%), Query Frame = 1

Query: 1   MPLSLLFLALFSLLFS------QSNSLSLSFPLTSLPLSQE------PSF---------- 60
           M L  LFL L S   S      +++SLS SFPLTSLP S +      PSF          
Sbjct: 1   MLLFYLFLLLTSCSLSAQETQHKNDSLSFSFPLTSLPRSPQASPNFYPSFISQTKKASTL 60

Query: 61  -SLSSKTSPHDSTKLPFKYSNALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKV-RTES 120
            S S  +SP++  +  FKYS  L+VS P+G+PPQ   M+LDTGSQLSW+QCH KV R   
Sbjct: 61  KSSSFSSSPYNY-RSGFKYSMILLVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPP 120

Query: 121 VINRFDPYLSSTFSHLPCNNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVT 180
             + FDP LSS+FS LPCN+ LCKPRIPDFTLPTSCD++R CHYSYFY DGTLAEGNLV 
Sbjct: 121 PSSVFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVR 180

Query: 181 EKITFSNSLTTLSLVLGCATTSTENRGMLGMNKGRLSFISQARITKFSYCVPDRI---GS 240
           EKITFS S +T  L+LGCA  S++ +G+LGMN GRLSF SQA++TKFSYCVP R    G 
Sbjct: 181 EKITFSRSQSTPPLILGCAEESSDAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGF 240

Query: 241 DPTGLFYLGDNPNSGKFKYVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVF 300
            PTG FYLG+NPNSG F+Y+ +LTF++S+R  NLD LAYT+ M GIRIG   LNI  + F
Sbjct: 241 TPTGSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAF 300

Query: 301 KPDPSGAGQTMIDSGSDLTYLVDEAYSKVRAEIVRLVGPMMKKAYEYAAV-DMCFDGAEA 360
           +PDPSGAGQTMIDSGS+ TYLVDEAY+KVR E+VRLVG  +KK Y Y  V DMCF+G  A
Sbjct: 301 RPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDMCFNG-NA 360

Query: 361 AVVGRRIGNMWFKFENGVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQK 415
             +GR IGNM F+F+ GVEI+V K E +L +V  GV CVGIGRS+ L   SNIIG  HQ+
Sbjct: 361 IEIGRLIGNMVFEFDKGVEIVVEK-ERVLADVGGGVHCVGIGRSEMLGAASNIIGNFHQQ 420

BLAST of CmaCh01G003360 vs. TrEMBL
Match: V7D2B7_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_001G229200g PE=3 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 6.6e-135
Identity = 265/435 (60.92%), Postives = 318/435 (73.10%), Query Frame = 1

Query: 3   LSLLFLALFSLLFSQSNSLSLSFPLTSLPLS--------QEPSFSLSSKTSPHD------ 62
           LSLLF  L SL  + + S SLSFPLTSLPLS        ++   + S++T+ H       
Sbjct: 13  LSLLFSPLLSLQLNHT-SFSLSFPLTSLPLSTNTASKMLRDSLIAYSNRTTNHTRLTSPS 72

Query: 63  ---STKLPFKYSNALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKV-RTESVINRFDPY 122
              + +L FKYS  L+V+ P+G+PPQ   M+LDTGSQLSW+QCH K  +       FDP 
Sbjct: 73  SPYNYRLTFKYSMVLIVNLPIGTPPQVQPMLLDTGSQLSWIQCHRKPPKVPPPTASFDPS 132

Query: 123 LSSTFSHLPCNNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNS 182
           LSSTFS LPC + +CKPRIPDFTLPTSCD++R CHYSYFY DGT AEGNLV EK TFS S
Sbjct: 133 LSSTFSILPCTHPVCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRS 192

Query: 183 LTTLSLVLGCATTSTENRGMLGMNKGRLSFISQARITKFSYCVP---DRIGSDPTGLFYL 242
           L T  L+LGCAT ST+ RG+LGMN+GRLSF SQ++ITKFSYCVP    R GS  +G FYL
Sbjct: 193 LFTPPLILGCATESTDPRGILGMNRGRLSFASQSKITKFSYCVPTRETRPGSTSSGSFYL 252

Query: 243 GDNPNSGKFKYVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAG 302
           G NPNS +FK+V+MLTFT+S+R  NLD LAYT+ + GIRIG   LNIS AVF+ D  G+G
Sbjct: 253 GHNPNSLRFKFVQMLTFTQSQRMPNLDPLAYTVALLGIRIGGRKLNISPAVFRADAGGSG 312

Query: 303 QTMIDSGSDLTYLVDEAYSKVRAEIVRLVGPMMKKAYEYAAV-DMCFDGAEAAVVGRRIG 362
           QTMIDSGSD TYLV+EAY KVRA++VR VGP MKK Y Y+ V DMCFDG  A  +GR IG
Sbjct: 313 QTMIDSGSDFTYLVNEAYDKVRAQVVRAVGPRMKKDYVYSGVADMCFDG-NAVEIGRLIG 372

Query: 363 NMWFKFENGVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDL 416
           +M F+FE GVEI++ K E +L  VE GV CVGIG SD+L   SNIIG  HQ+N+WVE+DL
Sbjct: 373 DMVFEFEKGVEIVIPK-ERVLASVEGGVHCVGIGNSDKLGAASNIIGNFHQQNLWVEFDL 432

BLAST of CmaCh01G003360 vs. TAIR10
Match: AT1G66180.1 (AT1G66180.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 475.7 bits (1223), Expect = 2.9e-134
Identity = 251/424 (59.20%), Postives = 305/424 (71.93%), Query Frame = 1

Query: 6   LFLALFSLLFSQSNSLSLSFPLTSLPLSQEPSF-----SLSSKTSPHDST-----KLPFK 65
           LF   F    S S SLSL  PLTSLP+S   +      SL S+ +P  S+     +  FK
Sbjct: 8   LFFFFFLNYVSLSTSLSLHLPLTSLPISTTTNSHRFTTSLLSRKNPSPSSPPYNFRSRFK 67

Query: 66  YSNALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKVRTESVINRFDPYLSSTFSHLPCN 125
           YS AL++S P+G+PPQ   MVLDTGSQLSW+QCH K         FDP LSS+FS LPC+
Sbjct: 68  YSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDPSLSSSFSTLPCS 127

Query: 126 NSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVLGCA 185
           + LCKPRIPDFTLPTSCD +R CHYSYFY DGT AEGNLV EKITFSN+  T  L+LGCA
Sbjct: 128 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCA 187

Query: 186 TTSTENRGMLGMNKGRLSFISQARITKFSYCVP---DRIGSDPTGLFYLGDNPNSGKFKY 245
           T S+++RG+LGMN+GRLSF+SQA+I+KFSYC+P   +R G  PTG FYLGDNPNS  FKY
Sbjct: 188 TESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKY 247

Query: 246 VKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMIDSGSDLT 305
           V +LTF +S+R  NLD LAYT+PM GIR G   LNIS +VF+PD  G+GQTM+DSGS+ T
Sbjct: 248 VSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSEFT 307

Query: 306 YLVDEAYSKVRAEIVRLVGPMMKKAYEY-AAVDMCFDGAEAAVVGRRIGNMWFKFENGVE 365
           +LVD AY KVRAEI+  VG  +KK Y Y    DMCFDG   A++ R IG++ F F  GVE
Sbjct: 308 HLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDG-NVAMIPRLIGDLVFVFTRGVE 367

Query: 366 ILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRVGFGEAE 416
           ILV K E +L  V  G+ CVGIGRS  L   SNIIG +HQ+N+WVE+D+ N+RVGF +A+
Sbjct: 368 ILVPK-ERVLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKAD 427

BLAST of CmaCh01G003360 vs. TAIR10
Match: AT5G37540.1 (AT5G37540.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 469.2 bits (1206), Expect = 2.7e-132
Identity = 249/430 (57.91%), Postives = 308/430 (71.63%), Query Frame = 1

Query: 3   LSLLFLALFSLLFSQSNSLSLSFPLTSL---PLSQEPSF--SLSSKTSPHDST-----KL 62
           L + F   +S+  S S+SLSL FPLTSL   P +   SF  SL S+ +P   +     + 
Sbjct: 13  LYIFFFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTFRS 72

Query: 63  PFKYSNALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKVRTESV---INRFDPYLSSTF 122
             KYS AL++S P+G+P Q  ++VLDTGSQLSW+QCH K   + +      FDP LSS+F
Sbjct: 73  NIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSF 132

Query: 123 SHLPCNNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLS 182
           S LPC++ LCKPRIPDFTLPTSCD +R CHYSYFY DGT AEGNLV EK TFSNS TT  
Sbjct: 133 SDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPP 192

Query: 183 LVLGCATTSTENRGMLGMNKGRLSFISQARITKFSYCVP---DRIGSDPTGLFYLGDNPN 242
           L+LGCA  ST+ +G+LGMN GRLSFISQA+I+KFSYC+P   +R G   TG FYLGDNPN
Sbjct: 193 LILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDNPN 252

Query: 243 SGKFKYVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMID 302
           S  FKYV +LTF +S+R  NLD LAYT+P+ GIRIG+  LNI  +VF+PD  G+GQTM+D
Sbjct: 253 SRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVD 312

Query: 303 SGSDLTYLVDEAYSKVRAEIVRLVGPMMKKAYEY-AAVDMCFDGAEAAVVGRRIGNMWFK 362
           SGS+ T+LVD AY KV+ EIVRLVG  +KK Y Y +  DMCFDG  +  +GR IG++ F+
Sbjct: 313 SGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVFE 372

Query: 363 FENGVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRV 416
           F  GVEILV K + LL  V  G+ CVGIGRS  L   SNIIG +HQ+N+WVE+D+ N+RV
Sbjct: 373 FGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRV 432

BLAST of CmaCh01G003360 vs. TAIR10
Match: AT5G02190.1 (AT5G02190.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 239.6 bits (610), Expect = 3.5e-63
Identity = 153/428 (35.75%), Postives = 230/428 (53.74%), Query Frame = 1

Query: 16  SQSNSLSLSFP-LTSLPLSQEPSFSLSSKTSPHD---STKLPFKYSNALVVSFPVGSPPQ 75
           S S+S S SF   +S   SQ     L ++ +P D   + KL F ++  L V+  VG+PPQ
Sbjct: 25  SSSSSSSFSFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLHFHHNVTLTVTLTVGTPPQ 84

Query: 76  PMDMVLDTGSQLSWVQCHGKVRTESVINRFDPYLSSTFSHLPCNNSLCKPRIPDFTLPTS 135
            + MV+DTGS+LSW++C+ +    + +N FDP  SS++S +PC++  C+ R  DF +P S
Sbjct: 85  NISMVIDTGSELSWLRCN-RSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPAS 144

Query: 136 CDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVLGCATT--------STENRG 195
           CD  + CH +  Y D + +EGNL  E   F NS    +L+ GC  +         T+  G
Sbjct: 145 CDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTG 204

Query: 196 MLGMNKGRLSFISQARITKFSYCVPDRIGSDPTGLFYLGDNPNSGKFKYVKMLTFTK-SR 255
           +LGMN+G LSFISQ    KFSYC+      D  G   LGD+     F ++  L +T   R
Sbjct: 205 LLGMNRGSLSFISQMGFPKFSYCISGT--DDFPGFLLLGDS----NFTWLTPLNYTPLIR 264

Query: 256 RSTNL---DKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMIDSGSDLTYLVDEAY 315
            ST L   D++AYT+ + GI++    L I ++V  PD +GAGQTM+DSG+  T+L+   Y
Sbjct: 265 ISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVY 324

Query: 316 SKVRAEIVRLVGPMMKKAYE------YAAVDMCFDGAEAAV---VGRRIGNMWFKFENGV 375
           + +R+  +     ++   YE         +D+C+  +   +   +  R+  +   FE G 
Sbjct: 325 TALRSHFLNRTNGIL-TVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFE-GA 384

Query: 376 EILVGKGEGLLTEV------EKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKR 413
           EI V  G+ LL  V         V C   G SD + +E+ +IG  HQ+NMW+E+DL   R
Sbjct: 385 EIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSR 442

BLAST of CmaCh01G003360 vs. TAIR10
Match: AT2G39710.1 (AT2G39710.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 234.6 bits (597), Expect = 1.1e-61
Identity = 157/439 (35.76%), Postives = 231/439 (52.62%), Query Frame = 1

Query: 15  FSQSNSLSLSFPLTSLPLS---QEPSFSLSSKTSPHDST-KLPFKYSNALVVSFPVGSPP 74
           F + + L L FPLT    S   Q   FSL ++  P  S+ KL F+++  L V+  VG PP
Sbjct: 16  FLRISVLLLIFPLTFCKTSSTNQTLLFSLKTQKLPQSSSDKLSFRHNVTLTVTLAVGDPP 75

Query: 75  QPMDMVLDTGSQLSWVQCHGKVRTESVINRFDPYLSSTFSHLPCNNSLCKPRIPDFTLPT 134
           Q + MVLDTGS+LSW+ C       SV   F+P  SST+S +PC++ +C+ R  D  +P 
Sbjct: 76  QNISMVLDTGSELSWLHCKKSPNLGSV---FNPVSSSTYSPVPCSSPICRTRTRDLPIPA 135

Query: 135 SCDRHRH-CHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVLGCATTS--------TEN 194
           SCD   H CH +  Y D T  EGNL  E      S+T    + GC  +          ++
Sbjct: 136 SCDPKTHLCHVAISYADATSIEGNLAHETFVIG-SVTRPGTLFGCMDSGLSSNSEEDAKS 195

Query: 195 RGMLGMNKGRLSFISQARITKFSYCVPDRIGSDPTGLFYLGDNPNS--GKFKYVKMLTFT 254
            G++GMN+G LSF++Q   +KFSYC+    GSD +G   LGD   S  G  +Y  ++   
Sbjct: 196 TGLMGMNRGSLSFVNQLGFSKFSYCIS---GSDSSGFLLLGDASYSWLGPIQYTPLVL-- 255

Query: 255 KSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMIDSGSDLTYLVDEAY 314
           +S      D++AYT+ + GIR+G   L++ ++VF PD +GAGQTM+DSG+  T+L+   Y
Sbjct: 256 QSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVY 315

Query: 315 SKVRAE-------IVRLVGP-------MMKKAYEYAA-----------VDMCFDGAEAAV 374
           + ++ E       ++RLV          M   Y+  +           V + F GAE +V
Sbjct: 316 TALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFRGAEMSV 375

Query: 375 VGRRIGNMWFKFENGVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNM 413
            G++              L+ +  G  +E ++ V C   G SD L IE+ +IG  HQ+N+
Sbjct: 376 SGQK--------------LLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNV 431

BLAST of CmaCh01G003360 vs. TAIR10
Match: AT3G61820.1 (AT3G61820.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 145.2 bits (365), Expect = 9.0e-35
Identity = 118/360 (32.78%), Postives = 170/360 (47.22%), Query Frame = 1

Query: 66  VGSPPQPMDMVLDTGSQLSWVQCHG-KVRTESVINRFDPYLSSTFSHLPCNNSLCKPRIP 125
           VG+P   + MVLDTGS + W+QC   K         FDP  S TF+ +PC + LC+ R+ 
Sbjct: 141 VGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCR-RLD 200

Query: 126 DFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVLGCATTS----TE 185
           D +   +  R + C Y   YGDG+  EG+  TE +TF  +     + LGC   +      
Sbjct: 201 DSSECVT-RRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVD-HVPLGCGHDNEGLFVG 260

Query: 186 NRGMLGMNKGRLSFISQARIT---KFSYCVPDRIGSDPTGLFYLGDNPNS----GKFKYV 245
             G+LG+ +G LSF SQ +     KFSYC+ DR  S  +        P S    G     
Sbjct: 261 AAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSS------KPPSTIVFGNAAVP 320

Query: 246 KMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHL-NISRAVFKPDPSGAGQTMIDSGSDLT 305
           K   FT    +  LD   Y L + GI +G + +  +S + FK D +G G  +IDSG+ +T
Sbjct: 321 KTSVFTPLLTNPKLDTFYY-LQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVT 380

Query: 306 YLVDEAYSKVRAEIVRLVGPMMKKAYEYAAVDMCFDGAEAAVVGRRIGNMWFKFENGVEI 365
            L   AY  +R +  RL    +K+A  Y+  D CFD +    V  ++  + F F  G E+
Sbjct: 381 RLTQPAYVALR-DAFRLGATKLKRAPSYSLFDTCFDLSGMTTV--KVPTVVFHFGGG-EV 440

Query: 366 LVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRVGFGEAEC 413
            +     L+    +G  C     +   +   +IIG I Q+   V YDL   RVGF    C
Sbjct: 441 SLPASNYLIPVNTEGRFCFAFAGT---MGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

BLAST of CmaCh01G003360 vs. NCBI nr
Match: gi|659114575|ref|XP_008457122.1| (PREDICTED: aspartic proteinase PCS1 [Cucumis melo])

HSP 1 Score: 574.3 bits (1479), Expect = 1.7e-160
Identity = 298/432 (68.98%), Postives = 342/432 (79.17%), Query Frame = 1

Query: 1   MPLSLLFLALFSLLFSQSNSLSLSFPLTSLPLSQEPSFS--------LSSKTSPHDSTKL 60
           M L L  L+LF+L FSQSNS+SL FPL+   LS++PS           + K S H S KL
Sbjct: 1   MLLILFSLSLFTLPFSQSNSVSLPFPLS---LSEKPSNISPIYGSQLYAKKPSSHGSFKL 60

Query: 61  PFKYSN-ALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKVRTE------SVINRFDPYL 120
           PFKYS+ ALVVS P+G+PPQP D+VLDTGSQLSW+QCH KV+ +           FDP L
Sbjct: 61  PFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKVKKKLPPLPKPKTASFDPSL 120

Query: 121 SSTFSHLPCNNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSL 180
           SS+FS LPCN+ +CKPRIPDFTLPTSCD++R CHYSYFY DGTLAEGNLV EK + SNSL
Sbjct: 121 SSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFSLSNSL 180

Query: 181 TTLSLVLGCATTSTENRGMLGMNKGRLSFISQARITKFSYCVPDRIGSDPTGLFYLGDNP 240
           +T  ++LGCA  STENRG+LGMNKGRLSFISQA+I+KFSYCVP R GS+PTGLFYLGDNP
Sbjct: 181 STPPVILGCAQASTENRGILGMNKGRLSFISQAKISKFSYCVPARTGSNPTGLFYLGDNP 240

Query: 241 NSGKFKYVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMI 300
           NS +FKYV MLTF +S+ S NLD LAYTLPM GI+I    LNIS A FKPD  G+GQTMI
Sbjct: 241 NSSRFKYVTMLTFPESQSSPNLDPLAYTLPMKGIKIAGKRLNISPAAFKPDAGGSGQTMI 300

Query: 301 DSGSDLTYLVDEAYSKVRAEIVRLVGPMMKKAYEYAAV-DMCFDGAEAAVVGRRIGNMWF 360
           DSGSDLTYLVDEAY KV+ E+VRLVG  MKK Y YAAV DMCFD    A VGRRIG + F
Sbjct: 301 DSGSDLTYLVDEAYEKVKEEVVRLVGAKMKKGYVYAAVADMCFDARVTAEVGRRIGGISF 360

Query: 361 KFENGVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKR 417
           +F+NGVEILVG+GEG+LTEVEKGVKCVG GRS+RL I SNIIG +HQ+NMWVEYDL N+R
Sbjct: 361 EFDNGVEILVGRGEGVLTEVEKGVKCVGFGRSERLGIGSNIIGTVHQQNMWVEYDLTNRR 420

BLAST of CmaCh01G003360 vs. NCBI nr
Match: gi|778679913|ref|XP_004140731.2| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 573.5 bits (1477), Expect = 2.9e-160
Identity = 299/433 (69.05%), Postives = 342/433 (78.98%), Query Frame = 1

Query: 1   MPLSLLFLALFSLLFSQSNSLSLSFPLTSLPLSQEPSFSLSS--------KTSPHDSTKL 60
           M L L  L+LF+L FSQSNSLSL FPL+   LS++PS ++ S        + S + S KL
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLS---LSEKPSNTIPSYSSQLYAKRPSSYGSFKL 60

Query: 61  PFKYSN-ALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKVRTESV-------INRFDPY 120
           PFKYS+ ALVVS P+G+PPQP D+VLDTGSQLSW+QCH K   + +          FDP 
Sbjct: 61  PFKYSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPS 120

Query: 121 LSSTFSHLPCNNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNS 180
           LSS+FS LPCN+ +CKPRIPDFTLPTSCD++R CHYSYFY DGTLAEGNLV EK TFS S
Sbjct: 121 LSSSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKS 180

Query: 181 LTTLSLVLGCATTSTENRGMLGMNKGRLSFISQARITKFSYCVPDRIGSDPTGLFYLGDN 240
           L+T  ++LGCA  STENRG+LGMN+GRLSFISQA+I+KFSYCVP R GS+PTGLFYLGDN
Sbjct: 181 LSTPPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDN 240

Query: 241 PNSGKFKYVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTM 300
           PNS KFKYV MLTF +S+ S NLD LAYTLPM  I+I    LNI  A FKPD  G+GQTM
Sbjct: 241 PNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTM 300

Query: 301 IDSGSDLTYLVDEAYSKVRAEIVRLVGPMMKKAYEYAAV-DMCFDGAEAAVVGRRIGNMW 360
           IDSGSDLTYLVDEAY KV+ E+VRLVG MMKK Y YA V DMCFD    A VGRRIG + 
Sbjct: 301 IDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGIS 360

Query: 361 FKFENGVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANK 417
           F+F+NGVEI VG+GEG+LTEVEKGVKCVGIGRS+RL I SNIIG +HQ+NMWVEYDLANK
Sbjct: 361 FEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANK 420

BLAST of CmaCh01G003360 vs. NCBI nr
Match: gi|778679910|ref|XP_011651212.1| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 573.2 bits (1476), Expect = 3.8e-160
Identity = 301/431 (69.84%), Postives = 337/431 (78.19%), Query Frame = 1

Query: 1   MPLSLLFLALFSLLFSQSNSLSLSFPL--TSLPLSQEPSFSLSS----KTSPHDSTKLPF 60
           M L L  L+LF+L FSQSNSLSL FPL  T  P +  P +  S     K S H   KLPF
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60

Query: 61  KYSN-ALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKVRTESV-------INRFDPYLS 120
           KYS+ ALVVS P+G+PPQP D+VLDTGSQLSW+QCH K   + +          FDP LS
Sbjct: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLS 120

Query: 121 STFSHLPCNNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLT 180
           S+FS LPCN+ +CKPRIPDFTLPTSCD++R CHYSYFY DGTLAEGNLV EK TFSNSL+
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180

Query: 181 TLSLVLGCATTSTENRGMLGMNKGRLSFISQARITKFSYCVPDRIGSDPTGLFYLGDNPN 240
           T  ++LGCA  STENRG+LGMN GRLSFISQA+I+KFSYCVP R GS+PTGLFYLGDNPN
Sbjct: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240

Query: 241 SGKFKYVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMID 300
           S KFKYV MLTF +S+ S NLD LAYTLPM  I+I    LNI  A FKPD  G+GQTMID
Sbjct: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYSKVRAEIVRLVGPMMKKAYEYAAV-DMCFDGAEAAVVGRRIGNMWFK 360
           SGSDLTYLVDEAY KV+ E+VRLVG MMKK Y YAAV DMCFD      VGRRIG+M F+
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360

Query: 361 FENGVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRV 417
           F+NGVEI VG+GEG+LTEVEKGVKCVGIGRS RL I SNIIG +HQ+NMWVEYDLANKRV
Sbjct: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420

BLAST of CmaCh01G003360 vs. NCBI nr
Match: gi|802732601|ref|XP_012086431.1| (PREDICTED: aspartic proteinase PCS1 [Jatropha curcas])

HSP 1 Score: 499.6 bits (1285), Expect = 5.3e-138
Identity = 264/436 (60.55%), Postives = 314/436 (72.02%), Query Frame = 1

Query: 1   MPLSLLFLALFSLL--------FSQSNSLSLSFPLTSLPLSQEPSFSLS----SKTSPHD 60
           MP  L F   FS L           +++ S SFPL S P SQ PSF  S    +K + H 
Sbjct: 1   MPFFLFFFLFFSFLSLPTKQTQIKNTSNFSFSFPLNSFPRSQNPSFQSSFISQTKRNQHV 60

Query: 61  ST-----KLPFKYSNALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKV-RTESVINRFD 120
            T     +  FKYS AL+VS P+G+PPQ   MVLDTGSQLSW+QCH K  R       FD
Sbjct: 61  QTSGYNYRSRFKYSMALIVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKAPRKLPPTTSFD 120

Query: 121 PYLSSTFSHLPCNNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFS 180
           P LSS+FS LPCN+ LCKPRIPDFTLPT+CD++R CHYSYFY DGTLAEG+LV EK TFS
Sbjct: 121 PSLSSSFSVLPCNHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTLAEGSLVREKFTFS 180

Query: 181 NSLTTLSLVLGCATTSTENRGMLGMNKGRLSFISQARITKFSYCVP---DRIGSDPTGLF 240
           N+ +T  L+LGCA  S +++G+LGMN GR SF SQA+I+KFSYCVP   +R G  PTGLF
Sbjct: 181 NTQSTPPLILGCAEDSGDDKGILGMNLGRRSFASQAKISKFSYCVPTRGNRAGLSPTGLF 240

Query: 241 YLGDNPNSGKFKYVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSG 300
           YLGDNPNSG F Y+ +LTFT S+RS NLD LAYT+PM GIRIG   LNI  +VF+PDPSG
Sbjct: 241 YLGDNPNSGGFHYINLLTFTPSQRSPNLDPLAYTVPMQGIRIGNTRLNIPASVFRPDPSG 300

Query: 301 AGQTMIDSGSDLTYLVDEAYSKVRAEIVRLVGPMMKKAYEYAAV-DMCFDGAEAAVVGRR 360
           +GQTM+DSGS+ TYLVDEAY+KVR EIVR+ G  +KK Y Y  V DMCFDG     +GR 
Sbjct: 301 SGQTMVDSGSEFTYLVDEAYNKVREEIVRVAGTKLKKNYVYGGVSDMCFDG-NPVEIGRL 360

Query: 361 IGNMWFKFENGVEILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEY 415
           IGNM F+FE GVEI+V + E +L  V  GV CVGIGRS+ L   SNIIG  HQ+N+WVE+
Sbjct: 361 IGNMVFEFEKGVEIVVDR-ERVLANVGNGVHCVGIGRSEMLGAASNIIGNFHQQNLWVEF 420

BLAST of CmaCh01G003360 vs. NCBI nr
Match: gi|950983173|ref|XP_014502710.1| (PREDICTED: aspartic proteinase PCS1-like [Vigna radiata var. radiata])

HSP 1 Score: 494.6 bits (1272), Expect = 1.7e-136
Identity = 261/424 (61.56%), Postives = 316/424 (74.53%), Query Frame = 1

Query: 6   LFLALFSLL-------FSQSNSLSLSFPLTSLPLSQ----EPSFSLSSKTSPHDSTKLPF 65
           LF  LF+LL       F++ +S+S SFPL SLPLS     + +  L S +S   + K PF
Sbjct: 9   LFSLLFALLLLFSASSFAKHDSVSFSFPLRSLPLSAGKPLKTNPKLRSLSSASYNVKWPF 68

Query: 66  KYSNALVVSFPVGSPPQPMDMVLDTGSQLSWVQCHGKVRTESVINRFDPYLSSTFSHLPC 125
           KYS ALVVS P+G+PPQ   MVLDTGSQLSW+QCH K    +    FDP LSS+F  +PC
Sbjct: 69  KYSMALVVSLPIGTPPQHQQMVLDTGSQLSWIQCHNKTPPTA---SFDPSLSSSFYVIPC 128

Query: 126 NNSLCKPRIPDFTLPTSCDRHRHCHYSYFYGDGTLAEGNLVTEKITFSNSLTTLSLVLGC 185
            + LCKPR+PDFTLPT+CD++R CHYSYFY DGT AEGNLV EK+TFS S TT  L LGC
Sbjct: 129 THPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLTFSPSQTTPPLTLGC 188

Query: 186 ATTSTENRGMLGMNKGRLSFISQARITKFSYCVP---DRIGSDPTGLFYLGDNPNSGKFK 245
           AT S +  G+LGMN GRLSF SQA+ITKFSYCVP    R G+ PTG FYLG+NPNSG+F+
Sbjct: 189 ATESRDAGGILGMNLGRLSFPSQAKITKFSYCVPTRQSRSGNLPTGSFYLGNNPNSGRFR 248

Query: 246 YVKMLTFTKSRRSTNLDKLAYTLPMNGIRIGKNHLNISRAVFKPDPSGAGQTMIDSGSDL 305
           YV MLTF++S+R  NLD LAYT+PM GIRIG   LNI+ +VF+PD  G+GQTMIDSGS+ 
Sbjct: 249 YVSMLTFSQSQRMPNLDPLAYTVPMQGIRIGGKRLNINPSVFRPDAGGSGQTMIDSGSEF 308

Query: 306 TYLVDEAYSKVRAEIVRLVGPMMKKAYEYAAV-DMCFDGAEAAVVGRRIGNMWFKFENGV 365
           T+LVDEAY +VR E+VR+VGP +KK Y Y  V DMCFDG+    +GR IG++  +FE GV
Sbjct: 309 TFLVDEAYDRVREEVVRVVGPRIKKGYVYGGVADMCFDGSARESIGRVIGDVVLEFEKGV 368

Query: 366 EILVGKGEGLLTEVEKGVKCVGIGRSDRLVIESNIIGIIHQKNMWVEYDLANKRVGFGEA 415
           EI+V K E +L +V  GV CVGIGRS+RL   SNIIG IHQ+N+WVE+DLAN RVGFGEA
Sbjct: 369 EIVVPK-ERVLADVGGGVHCVGIGRSERLGAASNIIGNIHQQNLWVEFDLANHRVGFGEA 428

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PCS1L_ARATH6.2e-6235.75Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1[more]
NEP1_NEPGR6.9e-3730.83Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
ASPG1_ARATH4.7e-3327.56Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
NEP2_NEPGR4.7e-3330.00Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
APF2_ARATH9.7e-3130.53Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0L9UW35_PHAAN5.9e-13661.27Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan07g076900 PE=3 SV=1[more]
A0A0S3RKY8_PHAAN5.9e-13661.27Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.03G096000 PE=... [more]
B9T2R1_RICCO3.9e-13559.91Aspartic proteinase nepenthesin-1, putative OS=Ricinus communis GN=RCOM_0593500 ... [more]
U5FLT4_POPTR5.0e-13560.63Aspartyl protease family protein OS=Populus trichocarpa GN=POPTR_0017s02710g PE=... [more]
V7D2B7_PHAVU6.6e-13560.92Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_001G229200g PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G66180.12.9e-13459.20 Eukaryotic aspartyl protease family protein[more]
AT5G37540.12.7e-13257.91 Eukaryotic aspartyl protease family protein[more]
AT5G02190.13.5e-6335.75 Eukaryotic aspartyl protease family protein[more]
AT2G39710.11.1e-6135.76 Eukaryotic aspartyl protease family protein[more]
AT3G61820.19.0e-3532.78 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|659114575|ref|XP_008457122.1|1.7e-16068.98PREDICTED: aspartic proteinase PCS1 [Cucumis melo][more]
gi|778679913|ref|XP_004140731.2|2.9e-16069.05PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|778679910|ref|XP_011651212.1|3.8e-16069.84PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|802732601|ref|XP_012086431.1|5.3e-13860.55PREDICTED: aspartic proteinase PCS1 [Jatropha curcas][more]
gi|950983173|ref|XP_014502710.1|1.7e-13661.56PREDICTED: aspartic proteinase PCS1-like [Vigna radiata var. radiata][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh01G003360.1CmaCh01G003360.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 3..415
score: 6.2E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 75..86
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 59..223
score: 3.7E-30coord: 227..415
score: 5.3
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 61..415
score: 4.78
NoneNo IPR availablePANTHERPTHR13683:SF327ASPARTYL PROTEASE FAMILY PROTEINcoord: 3..415
score: 6.2E

The following gene(s) are paralogous to this gene:

None