CmaCh14G008700 (gene) Cucurbita maxima (Rimu)

NameCmaCh14G008700
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionEukaryotic aspartyl protease family protein
LocationCma_Chr14 : 4487903 .. 4489162 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCTCTCTCTCTTCTACATCTCTCTACTCTCTCTCTCCTTCTCCCACTCCCATTCTCTCCCCCTCTCCTTCCCTCTTTCTCTCTCTAAACGCCCCTCCCCTGTTTCTCCTCTCTTCTCTTCTCTCTCCTCCTACGGCTCCGTCAAGCTCCCCTTCAAATACTCCACCGCCCTTGTCGTTTCTCTTCCTATTGGAACGCCGCCGCAGCCCACGGACTTGGTTTTGGACACCGGCAGCCAGCTTTCTTGGATTCAATGTCACCGCAAACTTCATAAGAAATTGCTGAAGCCCAAGACGGCGTCGTTTGACCCTTCTCTCTCCTCCTCTTTCTCTCTCCTCCCTTGTAATCATCCTCTCTGCAAACCCAGAATTCCTGATTTTACCCTTCCCACTTCTTGTGACCAAAATCGCCTCTGTCACTACTCCTACTTCTACGCTGATGGCACCTTGGCCGAGGGTAATCTCGTAAGAGAAAAATTCACCTTCTCAAATTCCCGTACTACCCCTCCTGTCATCCTTGGCTGTGCTCAAGCCTCCACTGAAAACAGGGGTATTTTGGGAATGAACACTGGACGTCTCTCCTTCGTCTCCCAAGCCAAAATCTCCAAATTCTCCTACTGCGTTCCCGGTCGAACCGGACCGGATTCAACCGGGTTGTTCTACCTTGGAGACAACCCGAATTCTGCCAATTTCAAATATATATCCTTGTTGACTTTCCCCAAAAGTCAACGCTCCCCGAATCTCGATCCACTGGCGTACACCCTCCCACTTAAGGGCATAAAAATTGCCGGGAACCGTCTCAATATCTCATCGGCCGTTTTCAAACCGGATAGGAGCGGGTCCGGTCAAACCATGATCGACTCCGGTTCGGACCTCACTTACCTAGTGGACGAAGCGTACGAGAAGGTTAAAGAAGAGATAGTCAAATTAGTGGGGCCCTTAATGAAGAAAGGGTACGAATACGCCGCCGTGGCCGACATGTGTTTCAACGACGGCGAGACAGCAGAGGTGGGTCGGAGGATTCGCGACATGTCGTTCGAGTTTGAGAATGGGGTGGAGATTTCGGTGGGGAAAGGAGAGGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGTTTGGACGGTCGGGAAGGCTTGGAATTGCGAGTAACATAATCGGAACCGTCCATCAGAAGAATACGTGGGTAGAATATGATTTGGCCAATAGGAGAATAGGGTTTGGTGGAGCCGACTGTAGCAGATTGAAGTGA

mRNA sequence

ATGCTTCTCTCTCTCTTCTACATCTCTCTACTCTCTCTCTCCTTCTCCCACTCCCATTCTCTCCCCCTCTCCTTCCCTCTTTCTCTCTCTAAACGCCCCTCCCCTGTTTCTCCTCTCTTCTCTTCTCTCTCCTCCTACGGCTCCGTCAAGCTCCCCTTCAAATACTCCACCGCCCTTGTCGTTTCTCTTCCTATTGGAACGCCGCCGCAGCCCACGGACTTGGTTTTGGACACCGGCAGCCAGCTTTCTTGGATTCAATGTCACCGCAAACTTCATAAGAAATTGCTGAAGCCCAAGACGGCGTCGTTTGACCCTTCTCTCTCCTCCTCTTTCTCTCTCCTCCCTTGTAATCATCCTCTCTGCAAACCCAGAATTCCTGATTTTACCCTTCCCACTTCTTGTGACCAAAATCGCCTCTGTCACTACTCCTACTTCTACGCTGATGGCACCTTGGCCGAGGGTAATCTCGTAAGAGAAAAATTCACCTTCTCAAATTCCCGTACTACCCCTCCTGTCATCCTTGGCTGTGCTCAAGCCTCCACTGAAAACAGGGGTATTTTGGGAATGAACACTGGACGTCTCTCCTTCGTCTCCCAAGCCAAAATCTCCAAATTCTCCTACTGCGTTCCCGGTCGAACCGGACCGGATTCAACCGGGTTGTTCTACCTTGGAGACAACCCGAATTCTGCCAATTTCAAATATATATCCTTGTTGACTTTCCCCAAAAGTCAACGCTCCCCGAATCTCGATCCACTGGCGTACACCCTCCCACTTAAGGGCATAAAAATTGCCGGGAACCGTCTCAATATCTCATCGGCCGTTTTCAAACCGGATAGGAGCGGGTCCGGTCAAACCATGATCGACTCCGGTTCGGACCTCACTTACCTAGTGGACGAAGCGTACGAGAAGGTTAAAGAAGAGATAGTCAAATTAGTGGGGCCCTTAATGAAGAAAGGGTACGAATACGCCGCCGTGGCCGACATGTGTTTCAACGACGGCGAGACAGCAGAGGTGGGTCGGAGGATTCGCGACATGTCGTTCGAGTTTGAGAATGGGGTGGAGATTTCGGTGGGGAAAGGAGAGGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGTTTGGACGGTCGGGAAGGCTTGGAATTGCGAGTAACATAATCGGAACCGTCCATCAGAAGAATACGTGGGTAGAATATGATTTGGCCAATAGGAGAATAGGGTTTGGTGGAGCCGACTGTAGCAGATTGAAGTGA

Coding sequence (CDS)

ATGCTTCTCTCTCTCTTCTACATCTCTCTACTCTCTCTCTCCTTCTCCCACTCCCATTCTCTCCCCCTCTCCTTCCCTCTTTCTCTCTCTAAACGCCCCTCCCCTGTTTCTCCTCTCTTCTCTTCTCTCTCCTCCTACGGCTCCGTCAAGCTCCCCTTCAAATACTCCACCGCCCTTGTCGTTTCTCTTCCTATTGGAACGCCGCCGCAGCCCACGGACTTGGTTTTGGACACCGGCAGCCAGCTTTCTTGGATTCAATGTCACCGCAAACTTCATAAGAAATTGCTGAAGCCCAAGACGGCGTCGTTTGACCCTTCTCTCTCCTCCTCTTTCTCTCTCCTCCCTTGTAATCATCCTCTCTGCAAACCCAGAATTCCTGATTTTACCCTTCCCACTTCTTGTGACCAAAATCGCCTCTGTCACTACTCCTACTTCTACGCTGATGGCACCTTGGCCGAGGGTAATCTCGTAAGAGAAAAATTCACCTTCTCAAATTCCCGTACTACCCCTCCTGTCATCCTTGGCTGTGCTCAAGCCTCCACTGAAAACAGGGGTATTTTGGGAATGAACACTGGACGTCTCTCCTTCGTCTCCCAAGCCAAAATCTCCAAATTCTCCTACTGCGTTCCCGGTCGAACCGGACCGGATTCAACCGGGTTGTTCTACCTTGGAGACAACCCGAATTCTGCCAATTTCAAATATATATCCTTGTTGACTTTCCCCAAAAGTCAACGCTCCCCGAATCTCGATCCACTGGCGTACACCCTCCCACTTAAGGGCATAAAAATTGCCGGGAACCGTCTCAATATCTCATCGGCCGTTTTCAAACCGGATAGGAGCGGGTCCGGTCAAACCATGATCGACTCCGGTTCGGACCTCACTTACCTAGTGGACGAAGCGTACGAGAAGGTTAAAGAAGAGATAGTCAAATTAGTGGGGCCCTTAATGAAGAAAGGGTACGAATACGCCGCCGTGGCCGACATGTGTTTCAACGACGGCGAGACAGCAGAGGTGGGTCGGAGGATTCGCGACATGTCGTTCGAGTTTGAGAATGGGGTGGAGATTTCGGTGGGGAAAGGAGAGGGGGTTTTGACGGAAGTGGAAAAAGGAGTGAAGTGTGTGGGGTTTGGACGGTCGGGAAGGCTTGGAATTGCGAGTAACATAATCGGAACCGTCCATCAGAAGAATACGTGGGTAGAATATGATTTGGCCAATAGGAGAATAGGGTTTGGTGGAGCCGACTGTAGCAGATTGAAGTGA

Protein sequence

MLLSLFYISLLSLSFSHSHSLPLSFPLSLSKRPSPVSPLFSSLSSYGSVKLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGFGGADCSRLK
BLAST of CmaCh14G008700 vs. Swiss-Prot
Match: PCS1L_ARATH (Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 6.7e-64
Identity = 155/443 (34.99%), Postives = 233/443 (52.60%), Query Frame = 1

Query: 1   MLLSLFYISLLSLSFSHSHSLPLSFPLSLSKRPSPVSPLFSSLSSYG---SVKLPFKYST 60
           ++LS+     +S S S S S   S   S S   + V PL + ++      + KL F ++ 
Sbjct: 12  LVLSVRTYKCVSSSSSSSSSFSFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLHFHHNV 71

Query: 61  ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPCN 120
            L V+L +GTPPQ   +V+DTGS+LSW++C+R  +   +     +FDP+ SSS+S +PC+
Sbjct: 72  TLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSNPNPVN----NFDPTRSSSYSPIPCS 131

Query: 121 HPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC- 180
            P C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F NS     +I GC 
Sbjct: 132 SPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCM 191

Query: 181 -------AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNS 240
                   +  T+  G+LGMN G LSF+SQ    KFSYC+ G    D  G   LGD    
Sbjct: 192 GSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGT--DDFPGFLLLGD---- 251

Query: 241 ANFKYISLLTFPK----SQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQT 300
           +NF +++ L +      S   P  D +AYT+ L GIK+ G  L I  +V  PD +G+GQT
Sbjct: 252 SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQT 311

Query: 301 MIDSGSDLTYLVDEAYEKVKEEIVKLVGPLM----KKGYEYAAVADMCFNDGET---AEV 360
           M+DSG+  T+L+   Y  ++   +     ++       + +    D+C+        + +
Sbjct: 312 MVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGI 371

Query: 361 GRRIRDMSFEFENGVEISVGKGEGVLTEV------EKGVKCVGFGRSGRLGIASNIIGTV 416
             R+  +S  FE G EI+V  G+ +L  V         V C  FG S  +G+ + +IG  
Sbjct: 372 LHRLPTVSLVFE-GAEIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHH 431

BLAST of CmaCh14G008700 vs. Swiss-Prot
Match: NEP2_NEPGR (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 7.9e-41
Identity = 116/364 (31.87%), Postives = 175/364 (48.08%), Query Frame = 1

Query: 60  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPCNHP 119
           ++++ IGTP      ++DTGS L W QC      +     T  F+P  SSSFS LPC   
Sbjct: 97  LMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCT--QCFSQPTPIFNPQDSSSFSTLPCESQ 156

Query: 120 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQA 179
            C+       LP+    N  C Y+Y Y DG+  +G +  E FTF  S + P +  GC + 
Sbjct: 157 YCQD------LPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETS-SVPNIAFGCGED 216

Query: 180 ST-----ENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKY 239
           +         G++GM  G LS  SQ  + +FSYC+    G  S     LG   +      
Sbjct: 217 NQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSY-GSSSPSTLALGSAASGVPEGS 276

Query: 240 ISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLT 299
            S      S     L+P  Y + L+GI + G+ L I S+ F+    G+G  +IDSG+ LT
Sbjct: 277 PSTTLIHSS-----LNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 336

Query: 300 YLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCF---NDGETAEVGRRIRDMSFEFEN 359
           YL  +AY  V +     +   +    E ++    CF   +DG T +V     ++S +F+ 
Sbjct: 337 YLPQDAYNAVAQAFTDQIN--LPTVDESSSGLSTCFQQPSDGSTVQV----PEISMQFDG 396

Query: 360 GVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGFG 416
           GV +++G+ + +L    +GV C+  G S +LGI  +I G + Q+ T V YDL N  + F 
Sbjct: 397 GV-LNLGE-QNILISPAEGVICLAMGSSSQLGI--SIFGNIQQQETQVLYDLQNLAVSFV 435

BLAST of CmaCh14G008700 vs. Swiss-Prot
Match: NEP1_NEPGR (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 2.4e-37
Identity = 110/363 (30.30%), Postives = 166/363 (45.73%), Query Frame = 1

Query: 60  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPCNHP 119
           +++L IGTP QP   ++DTGS L W QC      +     T  F+P  SSSFS LPC+  
Sbjct: 96  LMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCT--QCFNQSTPIFNPQGSSSFSTLPCSSQ 155

Query: 120 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQA 179
           LC+       L +    N  C Y+Y Y DG+  +G++  E  TF  S + P +  GC + 
Sbjct: 156 LCQ------ALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFG-SVSIPNITFGCGEN 215

Query: 180 ST-----ENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKY 239
           +         G++GM  G LS  SQ  ++KFSYC+    G  +     LG   NS     
Sbjct: 216 NQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTP-IGSSTPSNLLLGSLANSVTAGS 275

Query: 240 ISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPD-RSGSGQTMIDSGSDL 299
            +      SQ      P  Y + L G+ +   RL I  + F  +  +G+G  +IDSG+ L
Sbjct: 276 PNTTLIQSSQI-----PTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTL 335

Query: 300 TYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENG- 359
           TY V+ AY+ V++E +  +   +  G   ++  D+CF          +I      F+ G 
Sbjct: 336 TYFVNNAYQSVRQEFISQINLPVVNG--SSSGFDLCFQTPSDPS-NLQIPTFVMHFDGGD 395

Query: 360 VEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGFGG 416
           +E+     E        G+ C+  G S +     +I G + Q+N  V YD  N  + F  
Sbjct: 396 LEL---PSENYFISPSNGLICLAMGSSSQ---GMSIFGNIQQQNMLVVYDTGNSVVSFAS 434

BLAST of CmaCh14G008700 vs. Swiss-Prot
Match: ASPG1_ARATH (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 153.7 bits (387), Expect = 4.5e-36
Identity = 106/359 (29.53%), Postives = 166/359 (46.24%), Query Frame = 1

Query: 65  IGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPCNHPLCKPR 124
           +GTP +   LVLDTGS ++WIQC         +     F+P+ SS++  L C+ P C   
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQCEPCAD--CYQQSDPVFNPTSSSTYKSLTCSAPQCS-- 227

Query: 125 IPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQAS---- 184
                L TS  ++  C Y   Y DG+   G L  +  TF NS     V LGC   +    
Sbjct: 228 ----LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLF 287

Query: 185 TENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFY----LGDNPNSANFKYIS 244
           T   G+LG+  G LS  +Q K + FSYC+  R    S+ L +    LG    +A      
Sbjct: 288 TGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLL--- 347

Query: 245 LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL 304
                   R+  +D   Y + L G  + G ++ +  A+F  D SGSG  ++D G+ +T L
Sbjct: 348 --------RNKKIDTFYY-VGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRL 407

Query: 305 VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS 364
             +AY  +++  +KL   L KKG    ++ D C++    + V  ++  ++F F  G  + 
Sbjct: 408 QTQAYNSLRDAFLKLTVNL-KKGSSSISLFDTCYDFSSLSTV--KVPTVAFHFTGGKSLD 467

Query: 365 VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGFGGADC 416
           +     ++   + G  C  F  +     + +IIG V Q+ T + YDL+   IG  G  C
Sbjct: 468 LPAKNYLIPVDDSGTFCFAFAPTSS---SLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmaCh14G008700 vs. Swiss-Prot
Match: APF2_ARATH (Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1)

HSP 1 Score: 139.0 bits (349), Expect = 1.2e-31
Identity = 105/362 (29.01%), Postives = 176/362 (48.62%), Query Frame = 1

Query: 63  LPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPCNHPLCK 122
           L +GTP +   +VLDTGS + W+QC     ++        FDP  S +++ +PC+ P C+
Sbjct: 146 LGVGTPARYVYMVLDTGSDIVWLQC--APCRRCYSQSDPIFDPRKSKTYATIPCSSPHCR 205

Query: 123 PRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQAS-- 182
            R+      T   + + C Y   Y DG+   G+   E  TF  +R    V LGC   +  
Sbjct: 206 -RLDSAGCNT---RRKTCLYQVSYGDGSFTVGDFSTETLTFRRNR-VKGVALGCGHDNEG 265

Query: 183 --TENRGILGMNTGRLSFVSQAK---ISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYI 242
                 G+LG+  G+LSF  Q       KFSYC+  R+          G+   S   ++ 
Sbjct: 266 LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFT 325

Query: 243 SLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRL-NISSAVFKPDRSGSGQTMIDSGSDLT 302
            LL+ PK      LD   Y + L GI + G R+  +++++FK D+ G+G  +IDSG+ +T
Sbjct: 326 PLLSNPK------LDTF-YYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVT 385

Query: 303 YLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVE 362
            L+  AY  +++   ++    +K+  ++ ++ D CF+     EV  ++  +   F  G +
Sbjct: 386 RLIRPAYIAMRDAF-RVGAKTLKRAPDF-SLFDTCFDLSNMNEV--KVPTVVLHF-RGAD 445

Query: 363 ISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGFGGAD 417
           +S+     ++     G  C  F  +G +G   +IIG + Q+   V YDLA+ R+GF    
Sbjct: 446 VSLPATNYLIPVDTNGKFCFAF--AGTMG-GLSIIGNIQQQGFRVVYDLASSRVGFAPGG 485

BLAST of CmaCh14G008700 vs. TrEMBL
Match: A0A0B2S2W1_GLYSO (Aspartic proteinase nepenthesin-1 OS=Glycine soja GN=glysoja_013398 PE=3 SV=1)

HSP 1 Score: 540.4 bits (1391), Expect = 1.9e-150
Identity = 276/411 (67.15%), Postives = 325/411 (79.08%), Query Frame = 1

Query: 11  LSLSFSHSHSLPLSFPLSLSKRPSPVSPLFSSLSSYGSVKLPFKYSTALVVSLPIGTPPQ 70
           LSLSF  + SLPLS    L++ P+ +  L SS SSY ++K  FKYS ALVV+LPIGTPPQ
Sbjct: 27  LSLSFPLT-SLPLSTAKPLNRNPN-LRTLSSSSSSY-NIKSSFKYSMALVVTLPIGTPPQ 86

Query: 71  PTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPCNHPLCKPRIPDFTL 130
              +VLDTGSQLSWIQCH K       P TASFDPSLSSSF +LPC HPLCKPR+PDFTL
Sbjct: 87  HQQMVLDTGSQLSWIQCHNKT------PPTASFDPSLSSSFYILPCTHPLCKPRVPDFTL 146

Query: 131 PTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQASTENRGILGMN 190
           PT+CDQNRLCHYSYFYADGT AEGNLVREK TFS S+TTPP+ILGCA  S++ RGILGMN
Sbjct: 147 PTTCDQNRLCHYSYFYADGTYAEGNLVREKLTFSPSQTTPPLILGCATESSDARGILGMN 206

Query: 191 TGRLSFVSQAKISKFSYCVPGRTGPDS----TGLFYLGDNPNSANFKYISLLTFPKSQRS 250
            GRLSF SQAK++KFSYCVP R   +     TG FYLG+NPNSA F+Y+S+LTFP+SQR 
Sbjct: 207 LGRLSFPSQAKVTKFSYCVPTRQAANDNNLPTGSFYLGNNPNSARFRYVSMLTFPQSQRM 266

Query: 251 PNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYLVDEAYEKVKE 310
           PNLDPLAYT+P++GI+I G +LNI  +VF+P+  GSGQTM+DSGS+ T+LVD AY+ V+E
Sbjct: 267 PNLDPLAYTVPMQGIRIGGKKLNIPPSVFRPNAGGSGQTMVDSGSEFTFLVDAAYDAVRE 326

Query: 311 EIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEISVGKGEGVLTE 370
           E++++VGP +KKGY Y  VADMCFN G   E+GR I D++FEFE GVEI V K E VL +
Sbjct: 327 EVIRVVGPRVKKGYVYGGVADMCFN-GSAMEIGRLIGDVAFEFEKGVEIVVPK-ERVLAD 386

Query: 371 VEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGFGGADCSR 418
           V  GV C+G GRS RLG ASNIIG  HQ+N WVE+DLANRRIGFG ADCSR
Sbjct: 387 VGGGVHCLGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVADCSR 426

BLAST of CmaCh14G008700 vs. TrEMBL
Match: Q9FGI3_ARATH (AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana GN=At5g37540 PE=2 SV=1)

HSP 1 Score: 540.0 bits (1390), Expect = 2.5e-150
Identity = 280/431 (64.97%), Postives = 324/431 (75.17%), Query Frame = 1

Query: 2   LLSLFYISLLSLSFSHSHSLPLSFPL-SLSKRPSPVSPLF----------SSLSSYGSVK 61
           LL +F+    S+S S S SL L FPL SL   P+  S  F          S  SS  + +
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTFR 71

Query: 62  LPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSS 121
              KYS AL++SLPIGTP Q  +LVLDTGSQLSWIQCH K  KK L P T SFDPSLSSS
Sbjct: 72  SNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSS 131

Query: 122 FSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTP 181
           FS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSNS+TTP
Sbjct: 132 FSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTP 191

Query: 182 PVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFYLGDNP 241
           P+ILGCA+ ST+ +GILGMN GRLSF+SQAKISKFSYC+P    R G  STG FYLGDNP
Sbjct: 192 PLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDNP 251

Query: 242 NSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMI 301
           NS  FKY+SLLTFP+SQR PNLDPLAYT+PL+GI+I   RLNI  +VF+PD  GSGQTM+
Sbjct: 252 NSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMV 311

Query: 302 DSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSF 361
           DSGS+ T+LVD AY+KVKEEIV+LVG  +KKGY Y + ADMCF+   + E+GR I D+ F
Sbjct: 312 DSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVF 371

Query: 362 EFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRR 419
           EF  GVEI V K + +L  V  G+ CVG GRS  LG ASNIIG VHQ+N WVE+D+ NRR
Sbjct: 372 EFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRR 431

BLAST of CmaCh14G008700 vs. TrEMBL
Match: D7MID8_ARALL (Aspartyl protease family protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_493732 PE=3 SV=1)

HSP 1 Score: 539.3 bits (1388), Expect = 4.3e-150
Identity = 282/432 (65.28%), Postives = 325/432 (75.23%), Query Frame = 1

Query: 1   MLLSLFYISLLSLSFSHSHSLPLSFPL-SLSKRPSPVSPLF----------SSLSSYGSV 60
           +L   F+    S+S S S SL L FPL SL   P+  S  F          S  SS  + 
Sbjct: 12  LLYIFFFFFCNSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPSSSPYTF 71

Query: 61  KLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSS 120
           +  FKYS AL++SLPIGTP Q  +LVLDTGSQLSWIQCH K  KK L P T SFDPSLSS
Sbjct: 72  RSNFKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSS 131

Query: 121 SFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTT 180
           SFS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSNS+TT
Sbjct: 132 SFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTT 191

Query: 181 PPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFYLGDN 240
           PP+ILGCA+ ST+ +GILGMN GRLSF+SQAKISKFSYC+P    R G  STG FYLG+N
Sbjct: 192 PPLILGCAKESTDVKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGEN 251

Query: 241 PNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTM 300
           PNS  FKY+SLLTFP+SQR PNLDPLAYT+PL GI+I   RLNI S+VF+PD  GSGQTM
Sbjct: 252 PNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTM 311

Query: 301 IDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMS 360
           +DSGS+ T+LVD AY+KVKEEIV+LVG  +KKGY Y + ADMCF+      +GR I D+ 
Sbjct: 312 VDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLV 371

Query: 361 FEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANR 419
           FEF  GVEI V K + +L  V  G+ CVG GRS  LG ASNIIG VHQ+N WVE+D+ANR
Sbjct: 372 FEFGRGVEILVEK-QRLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVANR 431

BLAST of CmaCh14G008700 vs. TrEMBL
Match: V4LXR6_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10028350mg PE=3 SV=1)

HSP 1 Score: 539.3 bits (1388), Expect = 4.3e-150
Identity = 276/408 (67.65%), Postives = 320/408 (78.43%), Query Frame = 1

Query: 14  SFSHSHSLPLSFPLSLSKRPSPVSPLFSSLSSYGSVKLPFKYSTALVVSLPIGTPPQPTD 73
           S S S S   SF  SL+ R +P S  +S  S+       FKYS AL++SLPIGTP Q  +
Sbjct: 39  SSSSSSSSSSSFQTSLASRRTPSSLPYSFRSN-------FKYSMALILSLPIGTPAQTQE 98

Query: 74  LVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPCNHPLCKPRIPDFTLPTS 133
           LVLDTGSQLSWIQCH K  KK  KP T SFDPSLSSSFS LPC+HPLCKPRIPDFTLPT+
Sbjct: 99  LVLDTGSQLSWIQCHPKKKKK--KP-TTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTT 158

Query: 134 CDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQASTENRGILGMNTGR 193
           CD NRLCHYSYFYADGT AEGNLV+EKFTFSN++ TPP+ILGCA  ST+++GILGMN GR
Sbjct: 159 CDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNTQITPPLILGCAAESTDDKGILGMNLGR 218

Query: 194 LSFVSQAKISKFSYCVPGRT---GPDSTGLFYLGDNPNSANFKYISLLTFPKSQRSPNLD 253
           LSFVSQAKISKFSYC+P R+   G  STG FYLG+NP+S  FKY+SLLTFP+SQR PNLD
Sbjct: 219 LSFVSQAKISKFSYCIPTRSNQPGLSSTGSFYLGENPSSRGFKYVSLLTFPQSQRMPNLD 278

Query: 254 PLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYLVDEAYEKVKEEIVK 313
           PLAYT+PL+GI+I   RLNIS++VF+PD  GSGQTM+DSGS+ T+LVD AY+KVKEEIV+
Sbjct: 279 PLAYTVPLQGIRIGQKRLNISASVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVR 338

Query: 314 LVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEISVGKGEGVLTEVEKG 373
           LVGP +KKGY Y A ADMCF+     E+GR I D+ FEF  GVEI V K + +L  V  G
Sbjct: 339 LVGPRLKKGYVYGATADMCFDGNNPVEIGRLIGDLVFEFGRGVEILVEK-QRLLVNVGGG 398

Query: 374 VKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGFGGADCSRL 419
           V C+G GRS  LG ASNIIG VHQ+N WVE+D+ANRR+GF  ADCSRL
Sbjct: 399 VHCLGIGRSSMLGAASNIIGNVHQQNLWVEFDVANRRVGFSKADCSRL 435

BLAST of CmaCh14G008700 vs. TrEMBL
Match: V7BBS3_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_008G237800g PE=3 SV=1)

HSP 1 Score: 535.4 bits (1378), Expect = 6.1e-149
Identity = 278/425 (65.41%), Postives = 327/425 (76.94%), Query Frame = 1

Query: 1   MLLSLFYISLLSLSFSH---SHSLPL-SFPLSLSKRPSPVSPLFSSLSSYG-SVKLPFKY 60
           +L SL  +   S S +H   S S PL S PLS  K P   +P   SLSS   +VK  FKY
Sbjct: 12  LLFSLLLLFSASSSANHDSVSFSFPLRSLPLSTEK-PLKTNPKLRSLSSASYNVKWSFKY 71

Query: 61  STALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLP 120
           S ALVVSLPIGTPPQ   +VLDTGSQLSWIQCH K       P TASFDPSLSSSF ++P
Sbjct: 72  SMALVVSLPIGTPPQHQQMVLDTGSQLSWIQCHNKT------PPTASFDPSLSSSFYVIP 131

Query: 121 CNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILG 180
           C HPLCKPR+PDFTLPT+CDQNRLCHYSYFYADGT AEGNLVREK TFS S+TTPP+ LG
Sbjct: 132 CTHPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTFAEGNLVREKLTFSPSQTTPPLTLG 191

Query: 181 CAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGR-TGPDS--TGLFYLGDNPNSANF 240
           CA  S +  GILGMN GRLSF SQAKI+KFSYCVP R TGP +  TG FY+G+NPNSA F
Sbjct: 192 CATESRDASGILGMNLGRLSFPSQAKITKFSYCVPTRKTGPGNVPTGSFYIGNNPNSARF 251

Query: 241 KYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSD 300
           +Y+++LTF +SQR PNLDPLAYT+P++GI+I G RLNI+ +VF+PD SGSGQTMIDSGS+
Sbjct: 252 RYVNMLTFSQSQRMPNLDPLAYTVPMQGIRIGGKRLNINPSVFRPDASGSGQTMIDSGSE 311

Query: 301 LTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENG 360
            T+LVD+AY++V+EE+V++VGP +KKGY Y  VADMCF+D     +GR + D+  EFE G
Sbjct: 312 FTFLVDQAYDRVREEVVRVVGPRLKKGYVYGGVADMCFDDSARETIGRLLGDVVLEFEKG 371

Query: 361 VEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGFGG 418
           VEI V K E VL +V  GV CVG GRS RLG ASNIIG +HQ+N W+E+DLAN R+GFG 
Sbjct: 372 VEIVVPK-ERVLADVGGGVHCVGIGRSERLGAASNIIGNIHQQNMWMEFDLANHRVGFGE 428

BLAST of CmaCh14G008700 vs. TAIR10
Match: AT5G37540.1 (AT5G37540.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 540.0 bits (1390), Expect = 1.3e-153
Identity = 280/431 (64.97%), Postives = 324/431 (75.17%), Query Frame = 1

Query: 2   LLSLFYISLLSLSFSHSHSLPLSFPL-SLSKRPSPVSPLF----------SSLSSYGSVK 61
           LL +F+    S+S S S SL L FPL SL   P+  S  F          S  SS  + +
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTFR 71

Query: 62  LPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSS 121
              KYS AL++SLPIGTP Q  +LVLDTGSQLSWIQCH K  KK L P T SFDPSLSSS
Sbjct: 72  SNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSS 131

Query: 122 FSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTP 181
           FS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSNS+TTP
Sbjct: 132 FSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTP 191

Query: 182 PVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFYLGDNP 241
           P+ILGCA+ ST+ +GILGMN GRLSF+SQAKISKFSYC+P    R G  STG FYLGDNP
Sbjct: 192 PLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDNP 251

Query: 242 NSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMI 301
           NS  FKY+SLLTFP+SQR PNLDPLAYT+PL+GI+I   RLNI  +VF+PD  GSGQTM+
Sbjct: 252 NSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMV 311

Query: 302 DSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSF 361
           DSGS+ T+LVD AY+KVKEEIV+LVG  +KKGY Y + ADMCF+   + E+GR I D+ F
Sbjct: 312 DSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVF 371

Query: 362 EFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRR 419
           EF  GVEI V K + +L  V  G+ CVG GRS  LG ASNIIG VHQ+N WVE+D+ NRR
Sbjct: 372 EFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRR 431

BLAST of CmaCh14G008700 vs. TAIR10
Match: AT1G66180.1 (AT1G66180.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 518.5 bits (1334), Expect = 3.9e-147
Identity = 276/434 (63.59%), Postives = 318/434 (73.27%), Query Frame = 1

Query: 2   LLSLFYISLLSLSFSHSHSLPL-SFPLSLS-------------KRPSPVSPLFSSLSSYG 61
           L   F+++ +SLS S S  LPL S P+S +             K PSP SP ++  S   
Sbjct: 8   LFFFFFLNYVSLSTSLSLHLPLTSLPISTTTNSHRFTTSLLSRKNPSPSSPPYNFRSR-- 67

Query: 62  SVKLPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSL 121
                FKYS AL++SLPIGTPPQ   +VLDTGSQLSWIQCHRK  K   KPKT SFDPSL
Sbjct: 68  -----FKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRK--KLPPKPKT-SFDPSL 127

Query: 122 SSSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSR 181
           SSSFS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EK TFSN+ 
Sbjct: 128 SSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE 187

Query: 182 TTPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFYLG 241
            TPP+ILGCA  S+++RGILGMN GRLSFVSQAKISKFSYC+P    R G   TG FYLG
Sbjct: 188 ITPPLILGCATESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTGSFYLG 247

Query: 242 DNPNSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQ 301
           DNPNS  FKY+SLLTFP+SQR PNLDPLAYT+P+ GI+    +LNIS +VF+PD  GSGQ
Sbjct: 248 DNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQ 307

Query: 302 TMIDSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRD 361
           TM+DSGS+ T+LVD AY+KV+ EI+  VG  +KKGY Y   ADMCF DG  A + R I D
Sbjct: 308 TMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCF-DGNVAMIPRLIGD 367

Query: 362 MSFEFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLA 419
           + F F  GVEI V K E VL  V  G+ CVG GRS  LG ASNIIG VHQ+N WVE+D+ 
Sbjct: 368 LVFVFTRGVEILVPK-ERVLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVT 427

BLAST of CmaCh14G008700 vs. TAIR10
Match: AT2G39710.1 (AT2G39710.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 249.2 bits (635), Expect = 4.4e-66
Identity = 153/431 (35.50%), Postives = 227/431 (52.67%), Query Frame = 1

Query: 11  LSLSFSHSHSLPLSFPLSLSKRPSPVSPLFSSLSSY-----GSVKLPFKYSTALVVSLPI 70
           LS +F     L L FPL+  K  S    L  SL +       S KL F+++  L V+L +
Sbjct: 12  LSKNFLRISVLLLIFPLTFCKTSSTNQTLLFSLKTQKLPQSSSDKLSFRHNVTLTVTLAV 71

Query: 71  GTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTAS-FDPSLSSSFSLLPCNHPLCKPR 130
           G PPQ   +VLDTGS+LSW+ C +        P   S F+P  SS++S +PC+ P+C+ R
Sbjct: 72  GDPPQNISMVLDTGSELSWLHCKKS-------PNLGSVFNPVSSSTYSPVPCSSPICRTR 131

Query: 131 IPDFTLPTSCD-QNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC------- 190
             D  +P SCD +  LCH +  YAD T  EGNL  E F    S T P  + GC       
Sbjct: 132 TRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIG-SVTRPGTLFGCMDSGLSS 191

Query: 191 -AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSANFKYI 250
            ++   ++ G++GMN G LSFV+Q   SKFSYC+   +G DS+G   LGD    A++ ++
Sbjct: 192 NSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI---SGSDSSGFLLLGD----ASYSWL 251

Query: 251 SLLTFP----KSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGS 310
             + +     +S   P  D +AYT+ L+GI++    L++  +VF PD +G+GQTM+DSG+
Sbjct: 252 GPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGT 311

Query: 311 DLTYLVDEAYEKVKEEIVKLVGPLMK----KGYEYAAVADMCFNDGETAEVGRRIRDMSF 370
             T+L+   Y  +K E +     +++      + +    D+C+  G T         M  
Sbjct: 312 QFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVS 371

Query: 371 EFENGVEISVG------KGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEY 413
               G E+SV       +  G  +E ++ V C  FG S  LGI + +IG  HQ+N W+E+
Sbjct: 372 LMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEF 427

BLAST of CmaCh14G008700 vs. TAIR10
Match: AT5G02190.1 (AT5G02190.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 246.1 bits (627), Expect = 3.8e-65
Identity = 155/443 (34.99%), Postives = 233/443 (52.60%), Query Frame = 1

Query: 1   MLLSLFYISLLSLSFSHSHSLPLSFPLSLSKRPSPVSPLFSSLSSYG---SVKLPFKYST 60
           ++LS+     +S S S S S   S   S S   + V PL + ++      + KL F ++ 
Sbjct: 12  LVLSVRTYKCVSSSSSSSSSFSFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLHFHHNV 71

Query: 61  ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPCN 120
            L V+L +GTPPQ   +V+DTGS+LSW++C+R  +   +     +FDP+ SSS+S +PC+
Sbjct: 72  TLTVTLTVGTPPQNISMVIDTGSELSWLRCNRSSNPNPVN----NFDPTRSSSYSPIPCS 131

Query: 121 HPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGC- 180
            P C+ R  DF +P SCD ++LCH +  YAD + +EGNL  E F F NS     +I GC 
Sbjct: 132 SPTCRTRTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCM 191

Query: 181 -------AQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNS 240
                   +  T+  G+LGMN G LSF+SQ    KFSYC+ G    D  G   LGD    
Sbjct: 192 GSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGT--DDFPGFLLLGD---- 251

Query: 241 ANFKYISLLTFPK----SQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQT 300
           +NF +++ L +      S   P  D +AYT+ L GIK+ G  L I  +V  PD +G+GQT
Sbjct: 252 SNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQT 311

Query: 301 MIDSGSDLTYLVDEAYEKVKEEIVKLVGPLM----KKGYEYAAVADMCFNDGET---AEV 360
           M+DSG+  T+L+   Y  ++   +     ++       + +    D+C+        + +
Sbjct: 312 MVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGI 371

Query: 361 GRRIRDMSFEFENGVEISVGKGEGVLTEV------EKGVKCVGFGRSGRLGIASNIIGTV 416
             R+  +S  FE G EI+V  G+ +L  V         V C  FG S  +G+ + +IG  
Sbjct: 372 LHRLPTVSLVFE-GAEIAV-SGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHH 431

BLAST of CmaCh14G008700 vs. TAIR10
Match: AT3G18490.1 (AT3G18490.1 Eukaryotic aspartyl protease family protein)

HSP 1 Score: 153.7 bits (387), Expect = 2.5e-37
Identity = 106/359 (29.53%), Postives = 166/359 (46.24%), Query Frame = 1

Query: 65  IGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPCNHPLCKPR 124
           +GTP +   LVLDTGS ++WIQC         +     F+P+ SS++  L C+ P C   
Sbjct: 168 VGTPAKEMYLVLDTGSDVNWIQCEPCAD--CYQQSDPVFNPTSSSTYKSLTCSAPQCS-- 227

Query: 125 IPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQAS---- 184
                L TS  ++  C Y   Y DG+   G L  +  TF NS     V LGC   +    
Sbjct: 228 ----LLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLF 287

Query: 185 TENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFY----LGDNPNSANFKYIS 244
           T   G+LG+  G LS  +Q K + FSYC+  R    S+ L +    LG    +A      
Sbjct: 288 TGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLL--- 347

Query: 245 LLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYL 304
                   R+  +D   Y + L G  + G ++ +  A+F  D SGSG  ++D G+ +T L
Sbjct: 348 --------RNKKIDTFYY-VGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRL 407

Query: 305 VDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEIS 364
             +AY  +++  +KL   L KKG    ++ D C++    + V  ++  ++F F  G  + 
Sbjct: 408 QTQAYNSLRDAFLKLTVNL-KKGSSSISLFDTCYDFSSLSTV--KVPTVAFHFTGGKSLD 467

Query: 365 VGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGFGGADC 416
           +     ++   + G  C  F  +     + +IIG V Q+ T + YDL+   IG  G  C
Sbjct: 468 LPAKNYLIPVDDSGTFCFAFAPTSS---SLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500

BLAST of CmaCh14G008700 vs. NCBI nr
Match: gi|778679910|ref|XP_011651212.1| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 691.8 bits (1784), Expect = 7.4e-196
Identity = 348/431 (80.74%), Postives = 379/431 (87.94%), Query Frame = 1

Query: 1   MLLSLFYISLLSLSFSHSHSLPLSFPLSLSKRPSPVSPLFSSL-------SSYGSVKLPF 60
           MLL LF +SL +LSFS S+SL L FPLSL+++PS ++PL+ S        SS+G  KLPF
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLTEKPSNITPLYYSSQLYVKKPSSHGPFKLPF 60

Query: 61  KYST-ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLL----KPKTASFDPSLS 120
           KYS+ ALVVSLPIGTPPQPTDLVLDTGSQLSWIQCH K  KK L    KPKTASFDPSLS
Sbjct: 61  KYSSSALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLS 120

Query: 121 SSFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRT 180
           SSFSLLPCNHP+CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNS +
Sbjct: 121 SSFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLS 180

Query: 181 TPPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPN 240
           TPPVILGCAQ STENRGILGMN GRLSF+SQAKISKFSYCVP RTG + TGLFYLGDNPN
Sbjct: 181 TPPVILGCAQGSTENRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPN 240

Query: 241 SANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMID 300
           S+ FKY+++LTFP+SQ SPNLDPLAYTLP+K IKIAG RLNI  A FKPD  GSGQTMID
Sbjct: 241 SSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMID 300

Query: 301 SGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFE 360
           SGSDLTYLVDEAYEKVKEE+V+LVG +MKKGY YAAVADMCF+ G T EVGRRI DMSFE
Sbjct: 301 SGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAAVADMCFDAGVTVEVGRRIGDMSFE 360

Query: 361 FENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRI 420
           F+NGVEI VG+GEGVLTEVEKGVKCVG GRSGRLGI SNIIGTVHQ+N WVEYDLAN+R+
Sbjct: 361 FDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSGRLGIGSNIIGTVHQQNMWVEYDLANKRV 420

BLAST of CmaCh14G008700 vs. NCBI nr
Match: gi|659114575|ref|XP_008457122.1| (PREDICTED: aspartic proteinase PCS1 [Cucumis melo])

HSP 1 Score: 691.0 bits (1782), Expect = 1.3e-195
Identity = 349/429 (81.35%), Postives = 379/429 (88.34%), Query Frame = 1

Query: 1   MLLSLFYISLLSLSFSHSHSLPLSFPLSLSKRPSPVSPLFSSL------SSYGSVKLPFK 60
           MLL LF +SL +L FS S+S+ L FPLSLS++PS +SP++ S       SS+GS KLPFK
Sbjct: 1   MLLILFSLSLFTLPFSQSNSVSLPFPLSLSEKPSNISPIYGSQLYAKKPSSHGSFKLPFK 60

Query: 61  YS-TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLL---KPKTASFDPSLSSS 120
           YS TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCH K+ KKL    KPKTASFDPSLSSS
Sbjct: 61  YSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKVKKKLPPLPKPKTASFDPSLSSS 120

Query: 121 FSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTP 180
           FSLLPCNHP+CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKF+ SNS +TP
Sbjct: 121 FSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFSLSNSLSTP 180

Query: 181 PVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNSA 240
           PVILGCAQASTENRGILGMN GRLSF+SQAKISKFSYCVP RTG + TGLFYLGDNPNS+
Sbjct: 181 PVILGCAQASTENRGILGMNKGRLSFISQAKISKFSYCVPARTGSNPTGLFYLGDNPNSS 240

Query: 241 NFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSG 300
            FKY+++LTFP+SQ SPNLDPLAYTLP+KGIKIAG RLNIS A FKPD  GSGQTMIDSG
Sbjct: 241 RFKYVTMLTFPESQSSPNLDPLAYTLPMKGIKIAGKRLNISPAAFKPDAGGSGQTMIDSG 300

Query: 301 SDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFE 360
           SDLTYLVDEAYEKVKEE+V+LVG  MKKGY YAAVADMCF+   TAEVGRRI  +SFEF+
Sbjct: 301 SDLTYLVDEAYEKVKEEVVRLVGAKMKKGYVYAAVADMCFDARVTAEVGRRIGGISFEFD 360

Query: 361 NGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGF 420
           NGVEI VG+GEGVLTEVEKGVKCVGFGRS RLGI SNIIGTVHQ+N WVEYDL NRRIGF
Sbjct: 361 NGVEILVGRGEGVLTEVEKGVKCVGFGRSERLGIGSNIIGTVHQQNMWVEYDLTNRRIGF 420

BLAST of CmaCh14G008700 vs. NCBI nr
Match: gi|778679913|ref|XP_004140731.2| (PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus])

HSP 1 Score: 688.0 bits (1774), Expect = 1.1e-194
Identity = 349/430 (81.16%), Postives = 376/430 (87.44%), Query Frame = 1

Query: 1   MLLSLFYISLLSLSFSHSHSLPLSFPLSLSKRPSPVSPLFSSL------SSYGSVKLPFK 60
           MLL LF +SL +LSFS S+SL L FPLSLS++PS   P +SS       SSYGS KLPFK
Sbjct: 1   MLLILFSLSLFTLSFSQSNSLSLPFPLSLSEKPSNTIPSYSSQLYAKRPSSYGSFKLPFK 60

Query: 61  YS-TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLL----KPKTASFDPSLSS 120
           YS TALVVSLPIGTPPQPTDLVLDTGSQLSWIQCH K  KK L    KPKTASFDPSLSS
Sbjct: 61  YSSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSS 120

Query: 121 SFSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTT 180
           SFSLLPCNHP+CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFS S +T
Sbjct: 121 SFSLLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLST 180

Query: 181 PPVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVPGRTGPDSTGLFYLGDNPNS 240
           PPVILGCAQASTENRGILGMN GRLSF+SQAKISKFSYCVP RTG + TGLFYLGDNPNS
Sbjct: 181 PPVILGCAQASTENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNS 240

Query: 241 ANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDS 300
           + FKY+++LTFP+SQ SPNLDPLAYTLP+K IKIAG RLNI  A FKPD  GSGQTMIDS
Sbjct: 241 SKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDS 300

Query: 301 GSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEF 360
           GSDLTYLVDEAYEKVKEE+V+LVG +MKKGY YA VADMCF+ G TAEVGRRI  +SFEF
Sbjct: 301 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFEF 360

Query: 361 ENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIG 420
           +NGVEI VG+GEGVLTEVEKGVKCVG GRS RLGI SNIIGTVHQ+N WVEYDLAN+R+G
Sbjct: 361 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVG 420

BLAST of CmaCh14G008700 vs. NCBI nr
Match: gi|734420860|gb|KHN41056.1| (Aspartic proteinase nepenthesin-1 [Glycine soja])

HSP 1 Score: 540.4 bits (1391), Expect = 2.7e-150
Identity = 276/411 (67.15%), Postives = 325/411 (79.08%), Query Frame = 1

Query: 11  LSLSFSHSHSLPLSFPLSLSKRPSPVSPLFSSLSSYGSVKLPFKYSTALVVSLPIGTPPQ 70
           LSLSF  + SLPLS    L++ P+ +  L SS SSY ++K  FKYS ALVV+LPIGTPPQ
Sbjct: 27  LSLSFPLT-SLPLSTAKPLNRNPN-LRTLSSSSSSY-NIKSSFKYSMALVVTLPIGTPPQ 86

Query: 71  PTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSSFSLLPCNHPLCKPRIPDFTL 130
              +VLDTGSQLSWIQCH K       P TASFDPSLSSSF +LPC HPLCKPR+PDFTL
Sbjct: 87  HQQMVLDTGSQLSWIQCHNKT------PPTASFDPSLSSSFYILPCTHPLCKPRVPDFTL 146

Query: 131 PTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTPPVILGCAQASTENRGILGMN 190
           PT+CDQNRLCHYSYFYADGT AEGNLVREK TFS S+TTPP+ILGCA  S++ RGILGMN
Sbjct: 147 PTTCDQNRLCHYSYFYADGTYAEGNLVREKLTFSPSQTTPPLILGCATESSDARGILGMN 206

Query: 191 TGRLSFVSQAKISKFSYCVPGRTGPDS----TGLFYLGDNPNSANFKYISLLTFPKSQRS 250
            GRLSF SQAK++KFSYCVP R   +     TG FYLG+NPNSA F+Y+S+LTFP+SQR 
Sbjct: 207 LGRLSFPSQAKVTKFSYCVPTRQAANDNNLPTGSFYLGNNPNSARFRYVSMLTFPQSQRM 266

Query: 251 PNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMIDSGSDLTYLVDEAYEKVKE 310
           PNLDPLAYT+P++GI+I G +LNI  +VF+P+  GSGQTM+DSGS+ T+LVD AY+ V+E
Sbjct: 267 PNLDPLAYTVPMQGIRIGGKKLNIPPSVFRPNAGGSGQTMVDSGSEFTFLVDAAYDAVRE 326

Query: 311 EIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSFEFENGVEISVGKGEGVLTE 370
           E++++VGP +KKGY Y  VADMCFN G   E+GR I D++FEFE GVEI V K E VL +
Sbjct: 327 EVIRVVGPRVKKGYVYGGVADMCFN-GSAMEIGRLIGDVAFEFEKGVEIVVPK-ERVLAD 386

Query: 371 VEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRRIGFGGADCSR 418
           V  GV C+G GRS RLG ASNIIG  HQ+N WVE+DLANRRIGFG ADCSR
Sbjct: 387 VGGGVHCLGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVADCSR 426

BLAST of CmaCh14G008700 vs. NCBI nr
Match: gi|18421660|ref|NP_568551.1| (aspartyl protease family protein [Arabidopsis thaliana])

HSP 1 Score: 540.0 bits (1390), Expect = 3.6e-150
Identity = 280/431 (64.97%), Postives = 324/431 (75.17%), Query Frame = 1

Query: 2   LLSLFYISLLSLSFSHSHSLPLSFPL-SLSKRPSPVSPLF----------SSLSSYGSVK 61
           LL +F+    S+S S S SL L FPL SL   P+  S  F          S  SS  + +
Sbjct: 12  LLYIFFFFCYSVSLSWSSSLSLHFPLTSLRLTPTTNSSSFKTSLLSRRNPSPPSSPYTFR 71

Query: 62  LPFKYSTALVVSLPIGTPPQPTDLVLDTGSQLSWIQCHRKLHKKLLKPKTASFDPSLSSS 121
              KYS AL++SLPIGTP Q  +LVLDTGSQLSWIQCH K  KK L P T SFDPSLSSS
Sbjct: 72  SNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSS 131

Query: 122 FSLLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSRTTP 181
           FS LPC+HPLCKPRIPDFTLPTSCD NRLCHYSYFYADGT AEGNLV+EKFTFSNS+TTP
Sbjct: 132 FSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTP 191

Query: 182 PVILGCAQASTENRGILGMNTGRLSFVSQAKISKFSYCVP---GRTGPDSTGLFYLGDNP 241
           P+ILGCA+ ST+ +GILGMN GRLSF+SQAKISKFSYC+P    R G  STG FYLGDNP
Sbjct: 192 PLILGCAKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGDNP 251

Query: 242 NSANFKYISLLTFPKSQRSPNLDPLAYTLPLKGIKIAGNRLNISSAVFKPDRSGSGQTMI 301
           NS  FKY+SLLTFP+SQR PNLDPLAYT+PL+GI+I   RLNI  +VF+PD  GSGQTM+
Sbjct: 252 NSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMV 311

Query: 302 DSGSDLTYLVDEAYEKVKEEIVKLVGPLMKKGYEYAAVADMCFNDGETAEVGRRIRDMSF 361
           DSGS+ T+LVD AY+KVKEEIV+LVG  +KKGY Y + ADMCF+   + E+GR I D+ F
Sbjct: 312 DSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVF 371

Query: 362 EFENGVEISVGKGEGVLTEVEKGVKCVGFGRSGRLGIASNIIGTVHQKNTWVEYDLANRR 419
           EF  GVEI V K + +L  V  G+ CVG GRS  LG ASNIIG VHQ+N WVE+D+ NRR
Sbjct: 372 EFGRGVEILVEK-QSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRR 431

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PCS1L_ARATH6.7e-6434.99Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1[more]
NEP2_NEPGR7.9e-4131.87Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2 PE=1 SV=1[more]
NEP1_NEPGR2.4e-3730.30Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1 PE=1 SV=1[more]
ASPG1_ARATH4.5e-3629.53Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana GN=ASPG1 PE=1 ... [more]
APF2_ARATH1.2e-3129.01Aspartyl protease family protein 2 OS=Arabidopsis thaliana GN=APF2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0B2S2W1_GLYSO1.9e-15067.15Aspartic proteinase nepenthesin-1 OS=Glycine soja GN=glysoja_013398 PE=3 SV=1[more]
Q9FGI3_ARATH2.5e-15064.97AT5g37540/mpa22_p_70 OS=Arabidopsis thaliana GN=At5g37540 PE=2 SV=1[more]
D7MID8_ARALL4.3e-15065.28Aspartyl protease family protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRA... [more]
V4LXR6_EUTSA4.3e-15067.65Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10028350mg PE=3 SV=1[more]
V7BBS3_PHAVU6.1e-14965.41Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_008G237800g PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G37540.11.3e-15364.97 Eukaryotic aspartyl protease family protein[more]
AT1G66180.13.9e-14763.59 Eukaryotic aspartyl protease family protein[more]
AT2G39710.14.4e-6635.50 Eukaryotic aspartyl protease family protein[more]
AT5G02190.13.8e-6534.99 Eukaryotic aspartyl protease family protein[more]
AT3G18490.12.5e-3729.53 Eukaryotic aspartyl protease family protein[more]
Match NameE-valueIdentityDescription
gi|778679910|ref|XP_011651212.1|7.4e-19680.74PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|659114575|ref|XP_008457122.1|1.3e-19581.35PREDICTED: aspartic proteinase PCS1 [Cucumis melo][more]
gi|778679913|ref|XP_004140731.2|1.1e-19481.16PREDICTED: aspartic proteinase PCS1-like [Cucumis sativus][more]
gi|734420860|gb|KHN41056.1|2.7e-15067.15Aspartic proteinase nepenthesin-1 [Glycine soja][more]
gi|18421660|ref|NP_568551.1|3.6e-15064.97aspartyl protease family protein [Arabidopsis thaliana][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001461Aspartic_peptidase_A1
IPR001969Aspartic_peptidase_AS
IPR021109Peptidase_aspartic_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0016740 transferase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G008700.1CmaCh14G008700.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 285..296
score: 2.4E-5coord: 387..402
score: 2.4E-5coord: 65..85
score: 2.
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 5..418
score: 9.1E
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 74..85
scor
IPR021109Aspartic peptidase domainGENE3DG3DSA:2.40.70.10coord: 60..225
score: 1.0E-31coord: 228..417
score: 3.0
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 60..417
score: 6.99
NoneNo IPR availablePANTHERPTHR13683:SF327ASPARTYL PROTEASE FAMILY PROTEINcoord: 5..418
score: 9.1E