MC04g0861 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC04g0861
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionEukaryotic aspartyl protease family protein
LocationMC04: 14153939 .. 14155222 (-)
RNA-Seq ExpressionMC04g0861
SyntenyMC04g0861
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCGCCACGCAGTTCCTGAAGCTCCCTCTGCTTCACAGCAATCCATTCTCCTCCCCTTCTCACGCCCTCGCCTTCGACGCCCACCGTCTCTCCATTCTCTTCTCCTCCCGCAACCGCAACCGCAATCGCAATGCCGTCAAATCCCCTCTCATTTCTGGCGCCTCCACCGGCTCCGGCCAGTACTTTGTCGATCTCCGCCTCGGCACCCCTCCCCAGAGCCTCCTCCTCGTTGCCGATACCGGCAGCGACCTGGTCTGGGTCAAGTGCTCCCCCTGCAGAAACTGCTCCCACCACGCGCCTTCCTCCGCCTTCCTCCCCCGCCACTCCTCCTCCTTCTCCCCCCACCACTGCTTCGACCCGGCCTGCCGCCTCGTCCCCCACGCGCGCCTCTGCAACCACACGCGCCTCCACTCCCCCTGCCCCTACCACTACTCCTACGCCGACGGCTCCCTCTCCGCCGGCTTCTTCTCCAAGGATACCACTACCTTGAAGACCCTCTCCGGCGCCGAGGCCCGCCTCCCGAACCTCTCCTTCGGCTGCGGCTTTCGGATCTCCGGGCCGAGCGTCTCCGGCTCCACATTCAATGGAGCACGTGGCGTCATGGGATTGGGCAGAGGCCCCATTTCCTTCTCCTCCCAACTCGGCCGTCGATTTGGGAACAAGTTTTCGTATTGCCTTATGGACTACACGTTGTCTCCGCCGCCCACCAGCTACCTCATGATCGGCGCCCGCCGTAGCCCCGCCGTCTCCAACGCCTCCAGAATCAGCTACACCCCGCTCCAGATTAACCCCCTCTCCCCCACATTCTACTATATTCGAGTCAAAAGCATCGCCGTCGACGGCGTCACTTTGCCGATTAACCCTGCCGTCTGGGCCATCGACGAGCTCGGCAACGGCGGGACCGTCGTCGACTCCGGCACGACTCTCACCTTCCTGGCGGAGCCGGCTTACGATCAGGTGCTGGCGGCGTTTAGGCGGCAAGTGAAGCTGCCGAGGCCGGCCGAGTTGACTCCGGGGTTCGATCTGTGCTTGAACGCGTCGGGCGAGTCGAGGCCGAGTCTTCCGAGGCTGAGTTTCCGGCTAGCCGGAGGGGCCGTGTTGTCGCCGCCGCCGAGGAACTACTTTCTGGAGACGGAGGAGCGGGTCATGTGCTTGGCGATCCGACCCGTGGACTCGGGAAACGGGTTTTCGGTAATCGGGAATCTGATGCAACAAGGATTCTTGTTGGAGTTCGACAAGGACGCGTCACGGCTCGGGTTTTCAAGGCGTGGCTGCGGCCTTCCT

mRNA sequence

GCCGCCACGCAGTTCCTGAAGCTCCCTCTGCTTCACAGCAATCCATTCTCCTCCCCTTCTCACGCCCTCGCCTTCGACGCCCACCGTCTCTCCATTCTCTTCTCCTCCCGCAACCGCAACCGCAATCGCAATGCCGTCAAATCCCCTCTCATTTCTGGCGCCTCCACCGGCTCCGGCCAGTACTTTGTCGATCTCCGCCTCGGCACCCCTCCCCAGAGCCTCCTCCTCGTTGCCGATACCGGCAGCGACCTGGTCTGGGTCAAGTGCTCCCCCTGCAGAAACTGCTCCCACCACGCGCCTTCCTCCGCCTTCCTCCCCCGCCACTCCTCCTCCTTCTCCCCCCACCACTGCTTCGACCCGGCCTGCCGCCTCGTCCCCCACGCGCGCCTCTGCAACCACACGCGCCTCCACTCCCCCTGCCCCTACCACTACTCCTACGCCGACGGCTCCCTCTCCGCCGGCTTCTTCTCCAAGGATACCACTACCTTGAAGACCCTCTCCGGCGCCGAGGCCCGCCTCCCGAACCTCTCCTTCGGCTGCGGCTTTCGGATCTCCGGGCCGAGCGTCTCCGGCTCCACATTCAATGGAGCACGTGGCGTCATGGGATTGGGCAGAGGCCCCATTTCCTTCTCCTCCCAACTCGGCCGTCGATTTGGGAACAAGTTTTCGTATTGCCTTATGGACTACACGTTGTCTCCGCCGCCCACCAGCTACCTCATGATCGGCGCCCGCCGTAGCCCCGCCGTCTCCAACGCCTCCAGAATCAGCTACACCCCGCTCCAGATTAACCCCCTCTCCCCCACATTCTACTATATTCGAGTCAAAAGCATCGCCGTCGACGGCGTCACTTTGCCGATTAACCCTGCCGTCTGGGCCATCGACGAGCTCGGCAACGGCGGGACCGTCGTCGACTCCGGCACGACTCTCACCTTCCTGGCGGAGCCGGCTTACGATCAGGTGCTGGCGGCGTTTAGGCGGCAAGTGAAGCTGCCGAGGCCGGCCGAGTTGACTCCGGGGTTCGATCTGTGCTTGAACGCGTCGGGCGAGTCGAGGCCGAGTCTTCCGAGGCTGAGTTTCCGGCTAGCCGGAGGGGCCGTGTTGTCGCCGCCGCCGAGGAACTACTTTCTGGAGACGGAGGAGCGGGTCATGTGCTTGGCGATCCGACCCGTGGACTCGGGAAACGGGTTTTCGGTAATCGGGAATCTGATGCAACAAGGATTCTTGTTGGAGTTCGACAAGGACGCGTCACGGCTCGGGTTTTCAAGGCGTGGCTGCGGCCTTCCT

Coding sequence (CDS)

GCCGCCACGCAGTTCCTGAAGCTCCCTCTGCTTCACAGCAATCCATTCTCCTCCCCTTCTCACGCCCTCGCCTTCGACGCCCACCGTCTCTCCATTCTCTTCTCCTCCCGCAACCGCAACCGCAATCGCAATGCCGTCAAATCCCCTCTCATTTCTGGCGCCTCCACCGGCTCCGGCCAGTACTTTGTCGATCTCCGCCTCGGCACCCCTCCCCAGAGCCTCCTCCTCGTTGCCGATACCGGCAGCGACCTGGTCTGGGTCAAGTGCTCCCCCTGCAGAAACTGCTCCCACCACGCGCCTTCCTCCGCCTTCCTCCCCCGCCACTCCTCCTCCTTCTCCCCCCACCACTGCTTCGACCCGGCCTGCCGCCTCGTCCCCCACGCGCGCCTCTGCAACCACACGCGCCTCCACTCCCCCTGCCCCTACCACTACTCCTACGCCGACGGCTCCCTCTCCGCCGGCTTCTTCTCCAAGGATACCACTACCTTGAAGACCCTCTCCGGCGCCGAGGCCCGCCTCCCGAACCTCTCCTTCGGCTGCGGCTTTCGGATCTCCGGGCCGAGCGTCTCCGGCTCCACATTCAATGGAGCACGTGGCGTCATGGGATTGGGCAGAGGCCCCATTTCCTTCTCCTCCCAACTCGGCCGTCGATTTGGGAACAAGTTTTCGTATTGCCTTATGGACTACACGTTGTCTCCGCCGCCCACCAGCTACCTCATGATCGGCGCCCGCCGTAGCCCCGCCGTCTCCAACGCCTCCAGAATCAGCTACACCCCGCTCCAGATTAACCCCCTCTCCCCCACATTCTACTATATTCGAGTCAAAAGCATCGCCGTCGACGGCGTCACTTTGCCGATTAACCCTGCCGTCTGGGCCATCGACGAGCTCGGCAACGGCGGGACCGTCGTCGACTCCGGCACGACTCTCACCTTCCTGGCGGAGCCGGCTTACGATCAGGTGCTGGCGGCGTTTAGGCGGCAAGTGAAGCTGCCGAGGCCGGCCGAGTTGACTCCGGGGTTCGATCTGTGCTTGAACGCGTCGGGCGAGTCGAGGCCGAGTCTTCCGAGGCTGAGTTTCCGGCTAGCCGGAGGGGCCGTGTTGTCGCCGCCGCCGAGGAACTACTTTCTGGAGACGGAGGAGCGGGTCATGTGCTTGGCGATCCGACCCGTGGACTCGGGAAACGGGTTTTCGGTAATCGGGAATCTGATGCAACAAGGATTCTTGTTGGAGTTCGACAAGGACGCGTCACGGCTCGGGTTTTCAAGGCGTGGCTGCGGCCTTCCT

Protein sequence

AATQFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQYFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDPACRLVPHARLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSFGCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGARRSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDELGNGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGESRPSLPRLSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDASRLGFSRRGCGLP
Homology
BLAST of MC04g0861 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 241.1 bits (614), Expect = 2.3e-62
Identity = 154/418 (36.84%), Postives = 223/418 (53.35%), Query Frame = 0

Query: 20  SHALAFDAHRLSILFSSRNRNRNRNAVKSP--------LISGASTGSGQYFVDLRLGTPP 79
           S  L  D+ R+  + +   +   RN   +P        ++SG S GSG+YF  L +GTP 
Sbjct: 93  SSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPA 152

Query: 80  QSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDPACRLVPHARLC 139
           + + +V DTGSD+VW++C+PCR C +      F PR S +++   C  P CR +  A  C
Sbjct: 153 RYVYMVLDTGSDIVWLQCAPCRRC-YSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAG-C 212

Query: 140 NHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSFGCGFRISGPSVSG 199
           N  R    C Y  SY DGS + G FS +T T +       R+  ++ GCG    G     
Sbjct: 213 NTRR--KTCLYQVSYGDGSFTVGDFSTETLTFR-----RNRVKGVALGCGHDNEG----- 272

Query: 200 STFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGARRSPAVSN 259
             F GA G++GLG+G +SF  Q G RF  KFSYCL+D + S  P+S +   A    AVS 
Sbjct: 273 -LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNA----AVSR 332

Query: 260 ASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLP-INPAVWAIDELGNGGTVVDSGTTLT 319
            +R  +TPL  NP   TFYY+ +  I+V G  +P +  +++ +D++GNGG ++DSGT++T
Sbjct: 333 IAR--FTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVT 392

Query: 320 FLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGESRPSLPR--LSFRLAGGAVLS 379
            L  PAY  +  AFR   K  + A     FD C + S  +   +P   L FR   GA +S
Sbjct: 393 RLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR---GADVS 452

Query: 380 PPPRNYFLETEER-VMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDASRLGFSRRGC 426
            P  NY +  +     C A     +  G S+IGN+ QQGF + +D  +SR+GF+  GC
Sbjct: 453 LPATNYLIPVDTNGKFCFAF--AGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of MC04g0861 vs. ExPASy Swiss-Prot
Match: Q9LTW4 (Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE=1 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 1.2e-55
Identity = 159/434 (36.64%), Postives = 219/434 (50.46%), Query Frame = 0

Query: 6   LKLPLLHSN-----PFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQ 65
           ++L L H +     P S     +  D  R S++  SR RN +   VK  L SG   G+ Q
Sbjct: 49  VRLKLAHRDTLLPKPLSRIEDVIGADQKRHSLI--SRKRN-STVGVKMDLGSGIDYGTAQ 108

Query: 66  YFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSS----AFLPRHSSSFSPHH 125
           YF ++R+GTP +   +V DTGS+L WV      NC + A        F    S SF    
Sbjct: 109 YFTEIRVGTPAKKFRVVVDTGSELTWV------NCRYRARGKDNRRVFRADESKSFKTVG 168

Query: 126 CFDPACR--LVPHARLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLP 185
           C    C+  L+    L       +PC Y Y YADGS + G F+K+T T+   +G  ARLP
Sbjct: 169 CLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMARLP 228

Query: 186 NLSFGCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPP 245
               GC       S +G +F GA GV+GL     SF+S     +G KFSYCL+D+  +  
Sbjct: 229 GHLIGC-----SSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKN 288

Query: 246 PTSYLMIGARRSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAID 305
            ++YL+ G+ RS     A R + TPL +  + P FY I V  I++    L I   VW  D
Sbjct: 289 VSNYLIFGSSRS--TKTAFRRT-TPLDLTRI-PPFYAINVIGISLGYDMLDIPSQVW--D 348

Query: 306 ELGNGGTVVDSGTTLTFLAEPAYDQVLAAFRRQ-VKLPRPAELTPGFDLCLN-ASGESRP 365
               GGT++DSGT+LT LA+ AY QV+    R  V+L R        + C +  SG +  
Sbjct: 349 ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVS 408

Query: 366 SLPRLSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGN-GFSVIGNLMQQGFLLEF 425
            LP+L+F L GGA   P  ++Y ++    V CL    V +G    +VIGN+MQQ +L EF
Sbjct: 409 KLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGF--VSAGTPATNVIGNIMQQNYLWEF 460

BLAST of MC04g0861 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 210.3 bits (534), Expect = 4.3e-53
Identity = 140/392 (35.71%), Postives = 203/392 (51.79%), Query Frame = 0

Query: 39  RNRNRNAV---KSPLISGASTGSGQYFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNC 98
           R R+ NA+    S + +    G G+Y +++ +GTP  S   + DTGSDL+W +C PC  C
Sbjct: 71  RMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQC 130

Query: 99  SHHAPSSAFLPRHSSSFSPHHCFDPACRLVPHARLCNHTRLHSPCPYHYSYADGSLSAGF 158
               P+  F P+ SSSFS   C    C+ +P +  CN    ++ C Y Y Y DGS + G+
Sbjct: 131 -FSQPTPIFNPQDSSSFSTLPCESQYCQDLP-SETCN----NNECQYTYGYGDGSTTQGY 190

Query: 159 FSKDTTTLKTLSGAEARLPNLSFGCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLG 218
            + +T T +T S     +PN++FGCG    G        NGA G++G+G GP+S  SQLG
Sbjct: 191 MATETFTFETSS-----VPNIAFGCGEDNQGFGQG----NGA-GLIGMGWGPLSLPSQLG 250

Query: 219 RRFGNKFSYCLMDYTLSPPPTSYLMIGARRSPAVSNASRISYTPLQINPLSPTFYYIRVK 278
                +FSYC+  Y  S P T  L   A   P  S +     T L  + L+PT+YYI ++
Sbjct: 251 ---VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPS-----TTLIHSSLNPTYYYITLQ 310

Query: 279 SIAVDGVTLPINPAVWAIDELGNGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAE 338
            I V G  L I  + + + + G GG ++DSGTTLT+L + AY+ V  AF  Q+ LP   E
Sbjct: 311 GITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDE 370

Query: 339 LTPGFDLCL-NASGESRPSLPRLSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGN 398
            + G   C    S  S   +P +S +  GG VL+   +N  +   E V+CLA+    S  
Sbjct: 371 SSSGLSTCFQQPSDGSTVQVPEISMQFDGG-VLNLGEQNILISPAEGVICLAMGS-SSQL 430

Query: 399 GFSVIGNLMQQGFLLEFDKDASRLGFSRRGCG 427
           G S+ GN+ QQ   + +D     + F    CG
Sbjct: 431 GISIFGNIQQQETQVLYDLQNLAVSFVPTQCG 436

BLAST of MC04g0861 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 209.5 bits (532), Expect = 7.3e-53
Identity = 132/384 (34.38%), Postives = 196/384 (51.04%), Query Frame = 0

Query: 44  NAVKSPLISGASTGSGQYFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSA 103
           N   S ++SG   GSG+YFV + +G+PP+   +V D+GSD+VWV+C PC+ C +      
Sbjct: 114 NDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLC-YKQSDPV 173

Query: 104 FLPRHSSSFSPHHCFDPACRLVPHARLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTL 163
           F P  S S++   C    C      R+ N       C Y   Y DGS     ++K T  L
Sbjct: 174 FDPAKSGSYTGVSCGSSVC-----DRIENSGCHSGGCRYEVMYGDGS-----YTKGTLAL 233

Query: 164 KTLSGAEARLPNLSFGCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFS 223
           +TL+ A+  + N++ GCG R  G       F GA G++G+G G +SF  QL  + G  F 
Sbjct: 234 ETLTFAKTVVRNVAMGCGHRNRG------MFIGAAGLLGIGGGSMSFVGQLSGQTGGAFG 293

Query: 224 YCLMDYTLSPPPTSYLMIGARRSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVT 283
           YCL+  +     T  L+ G    P  +     S+ PL  NP +P+FYY+ +K + V GV 
Sbjct: 294 YCLV--SRGTDSTGSLVFGREALPVGA-----SWVPLVRNPRAPSFYYVGLKGLGVGGVR 353

Query: 284 LPINPAVWAIDELGNGGTVVDSGTTLTFLAEPAYDQVLAAFRRQ-VKLPRPAELTPGFDL 343
           +P+   V+ + E G+GG V+D+GT +T L   AY      F+ Q   LPR + ++  FD 
Sbjct: 354 IPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSI-FDT 413

Query: 344 CLNASGESRPSLPRLSFRLAGGAVLSPPPRNYFLETEER-VMCLAIRPVDSGNGFSVIGN 403
           C + SG     +P +SF    G VL+ P RN+ +  ++    C A     S  G S+IGN
Sbjct: 414 CYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAF--AASPTGLSIIGN 470

Query: 404 LMQQGFLLEFDKDASRLGFSRRGC 426
           + Q+G  + FD     +GF    C
Sbjct: 474 IQQEGIQVSFDGANGFVGFGPNVC 470

BLAST of MC04g0861 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 209.1 bits (531), Expect = 9.6e-53
Identity = 140/397 (35.26%), Postives = 199/397 (50.13%), Query Frame = 0

Query: 36  SRNRNRNRNAVKSPLISGAST----GSGQYFVDLRLGTPPQSLLLVADTGSDLVWVKCSP 95
           SR   R    +  P  SG  T    G G+Y ++L +GTP Q    + DTGSDL+W +C P
Sbjct: 68  SRRLQRLEAMLNGP--SGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQP 127

Query: 96  CRNCSHHAPSSAFLPRHSSSFSPHHCFDPACRLVPHARLCNHTRLHSPCPYHYSYADGSL 155
           C  C + + +  F P+ SSSFS   C    C+      L + T  ++ C Y Y Y DGS 
Sbjct: 128 CTQCFNQS-TPIFNPQGSSSFSTLPCSSQLCQ-----ALSSPTCSNNFCQYTYGYGDGSE 187

Query: 156 SAGFFSKDTTTLKTLSGAEARLPNLSFGCGFRISGPSVSGSTFNGARGVMGLGRGPISFS 215
           + G    +T T  ++S     +PN++FGCG    G        NGA G++G+GRGP+S  
Sbjct: 188 TQGSMGTETLTFGSVS-----IPNITFGCGENNQGFGQG----NGA-GLVGMGRGPLSLP 247

Query: 216 SQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGARRSPAVSNASRISYTPLQINPLSPTFYY 275
           SQL      KFSYC+     S P  S L++G+  +   + +     T L  +   PTFYY
Sbjct: 248 SQLD---VTKFSYCMTPIGSSTP--SNLLLGSLANSVTAGSPN---TTLIQSSQIPTFYY 307

Query: 276 IRVKSIAVDGVTLPINPAVWAID-ELGNGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKL 335
           I +  ++V    LPI+P+ +A++   G GG ++DSGTTLT+    AY  V   F  Q+ L
Sbjct: 308 ITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINL 367

Query: 336 PRPAELTPGFDLCLNA-SGESRPSLPRLSFRLAGGAVLSPPPRNYFLETEERVMCLAIRP 395
           P     + GFDLC    S  S   +P       GG  L  P  NYF+     ++CLA+  
Sbjct: 368 PVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGD-LELPSENYFISPSNGLICLAMG- 427

Query: 396 VDSGNGFSVIGNLMQQGFLLEFDKDASRLGFSRRGCG 427
             S  G S+ GN+ QQ  L+ +D   S + F+   CG
Sbjct: 428 -SSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQCG 435

BLAST of MC04g0861 vs. NCBI nr
Match: XP_038907006.1 (aspartyl protease family protein 2 [Benincasa hispida])

HSP 1 Score: 701 bits (1809), Expect = 1.11e-251
Identity = 352/431 (81.67%), Postives = 378/431 (87.70%), Query Frame = 0

Query: 2   ATQFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQY 61
           A  +LKLPLLH  PFSSPS AL+ D HRLS+LFS     R    +KSPLISGASTGSGQY
Sbjct: 24  AADYLKLPLLHKPPFSSPSQALSSDTHRLSLLFS-----RPNPTLKSPLISGASTGSGQY 83

Query: 62  FVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDPA 121
           FVDLRLGTPPQSLLLVADTGSDLVWVKCS CRNCSHH PS+AFLPRHSSSFSP HCFDP 
Sbjct: 84  FVDLRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSTAFLPRHSSSFSPFHCFDPH 143

Query: 122 CRLVPHA--RLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSFG 181
           CRL+PHA   LCNHTR HSPC + YSYADGSLS+GFFSK+TTTLKTLSG+E  L  LSFG
Sbjct: 144 CRLLPHAPPHLCNHTRFHSPCRFLYSYADGSLSSGFFSKETTTLKTLSGSEIHLKGLSFG 203

Query: 182 CGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYL 241
           CGFRISGPSVSG+ F+GARGVMGLGRG ISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYL
Sbjct: 204 CGFRISGPSVSGAQFSGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYL 263

Query: 242 MIGA-RRSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDELGN 301
           MIG   RS  V+NA++ISYTPLQINPLSPTFYYI + SI VDGV LPINPAVWA+D+ GN
Sbjct: 264 MIGGGHRSLPVTNATKISYTPLQINPLSPTFYYIAIHSITVDGVKLPINPAVWAMDKQGN 323

Query: 302 GGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGESR-PSLPRL 361
           GGTVVDSGTTLT+LA+ AYD+VL A RR+VKLP  +ELTPGFDLC+NAS  SR PSLPRL
Sbjct: 324 GGTVVDSGTTLTYLAKAAYDEVLKAVRRRVKLPSASELTPGFDLCVNASDSSRRPSLPRL 383

Query: 362 SFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDASR 421
            FRL GGAV +PPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDA+R
Sbjct: 384 RFRLGGGAVFAPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDAAR 443

Query: 422 LGFSRRGCGLP 428
           LGFSRRGCGLP
Sbjct: 444 LGFSRRGCGLP 449

BLAST of MC04g0861 vs. NCBI nr
Match: XP_023549997.1 (aspartyl protease family protein 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 694 bits (1792), Expect = 6.25e-249
Identity = 349/432 (80.79%), Postives = 378/432 (87.50%), Query Frame = 0

Query: 1   AATQFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQ 60
           AA  +LKLPLLH NPFSSPS AL+ D HRLS+LFS+  R  N   +KSPLISGASTGSGQ
Sbjct: 29  AAADYLKLPLLHKNPFSSPSQALSSDTHRLSLLFSALRRRPNPT-LKSPLISGASTGSGQ 88

Query: 61  YFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDP 120
           YFVDLR+GTPPQSLLLVADTGSDLVWVKCS CRNCSHH PSSAFLPRHSSSFSP HCFDP
Sbjct: 89  YFVDLRIGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDP 148

Query: 121 ACRLVPHA--RLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSF 180
            CRL+PHA   LCNHTRLHSPC + Y+YADGS S+GFFSK+TTTLKTLSG+E RL +LSF
Sbjct: 149 HCRLLPHAPPHLCNHTRLHSPCRFLYTYADGSTSSGFFSKETTTLKTLSGSETRLKDLSF 208

Query: 181 GCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSY 240
           GCGFRISGPSVSG+ FNGARGVMGLGRGPISFS+QLGRRFGNKFSYCLMDYTLSPPPTSY
Sbjct: 209 GCGFRISGPSVSGAQFNGARGVMGLGRGPISFSTQLGRRFGNKFSYCLMDYTLSPPPTSY 268

Query: 241 LMIGAR-RSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDELG 300
           LMIG   RS  V+NA++ISYTPL INPLSPTFYYI VKSI VDGV LPINP VWAIDE G
Sbjct: 269 LMIGGGLRSLPVTNATKISYTPLLINPLSPTFYYIAVKSITVDGVKLPINPTVWAIDEQG 328

Query: 301 NGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGES-RPSLPR 360
           NGGTVVDSGTTLT+LAE AY +VL A R++VKLP  AELTPGFDLC+N S ES RPSLPR
Sbjct: 329 NGGTVVDSGTTLTYLAEEAYKEVLKAVRQRVKLPAAAELTPGFDLCVNVSNESQRPSLPR 388

Query: 361 LSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDAS 420
           + F+L  GAV +PP RNYFLETEE VMCL+IR V+ GNGFSVIGNLMQQGFLLEFDK+AS
Sbjct: 389 VRFQLGNGAVFAPPARNYFLETEEGVMCLSIRAVEGGNGFSVIGNLMQQGFLLEFDKEAS 448

Query: 421 RLGFSRRGCGLP 428
           RLGFSRRGCGLP
Sbjct: 449 RLGFSRRGCGLP 459

BLAST of MC04g0861 vs. NCBI nr
Match: KAG7017139.1 (Aspartic proteinase nepenthesin-2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 692 bits (1787), Expect = 3.11e-248
Identity = 348/432 (80.56%), Postives = 378/432 (87.50%), Query Frame = 0

Query: 1   AATQFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQ 60
           AA  +LKLPLLH NPFSSPS AL+ D HRLS+LFS+  R  N   +KSPLISGASTGSGQ
Sbjct: 25  AAADYLKLPLLHKNPFSSPSQALSSDTHRLSLLFSALRRRPNPT-LKSPLISGASTGSGQ 84

Query: 61  YFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDP 120
           YFVDLR+GTPPQSLLLVADTGSDLVWVKCS CRNCSHH PSSAFLPRHSSSFSP HCFDP
Sbjct: 85  YFVDLRIGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDP 144

Query: 121 ACRLVPHA--RLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSF 180
            CRL+PHA   LCNHTRLHSPC + Y+YADGS S+GFFSK+TTTLKTLSG+E RL +LSF
Sbjct: 145 HCRLLPHAPPHLCNHTRLHSPCRFLYTYADGSTSSGFFSKETTTLKTLSGSETRLKDLSF 204

Query: 181 GCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSY 240
           GCGFRISGPSVSG+ FNGARGVMGLGRGPISFS+QLGRRFGNKFSYCLMDYTLSPPPTSY
Sbjct: 205 GCGFRISGPSVSGAQFNGARGVMGLGRGPISFSTQLGRRFGNKFSYCLMDYTLSPPPTSY 264

Query: 241 LMIGAR-RSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDELG 300
           LMIG   RS  V+NA++ISYTPL INPLSPTFYYI VKSI VDGV LPINP VWAI+E G
Sbjct: 265 LMIGGGLRSLPVTNATKISYTPLLINPLSPTFYYIAVKSITVDGVKLPINPIVWAINEQG 324

Query: 301 NGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGES-RPSLPR 360
           NGGTVVDSGTTLT+LAE AY +VL A R++VKLP  AELTPGFDLC+N S ES RPSLPR
Sbjct: 325 NGGTVVDSGTTLTYLAEEAYKEVLKAMRQRVKLPAAAELTPGFDLCVNVSNESQRPSLPR 384

Query: 361 LSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDAS 420
           + F+L  GAV +PP RNYFL+TEE VMCLAIR V+ GNGFSVIGNLMQQGFLLEFDK+AS
Sbjct: 385 VRFQLGNGAVFAPPARNYFLDTEEGVMCLAIRAVEGGNGFSVIGNLMQQGFLLEFDKEAS 444

Query: 421 RLGFSRRGCGLP 428
           RLGFSRRGCGLP
Sbjct: 445 RLGFSRRGCGLP 455

BLAST of MC04g0861 vs. NCBI nr
Match: XP_022928946.1 (aspartyl protease family protein 2-like [Cucurbita moschata])

HSP 1 Score: 692 bits (1787), Expect = 3.60e-248
Identity = 349/433 (80.60%), Postives = 378/433 (87.30%), Query Frame = 0

Query: 1   AATQFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQ 60
           AA  +LKLPLLH NPFSSPS AL+ D HRLS+LFS+  R  N   +KSPLISGASTGSGQ
Sbjct: 29  AAADYLKLPLLHKNPFSSPSQALSSDTHRLSLLFSALRRRPNPT-LKSPLISGASTGSGQ 88

Query: 61  YFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDP 120
           YFVDLR+GTPPQSLLLVADTGSDLVWVKCS CRNCSHH PSSAFLPRHSSSFSP HCFDP
Sbjct: 89  YFVDLRIGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDP 148

Query: 121 ACRLVPHA--RLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSF 180
            CRL+PHA   LCNHTRLHSPC + Y+YADGS S+GFFSK+TTTLKTL+G+E RL +LSF
Sbjct: 149 HCRLLPHAPPHLCNHTRLHSPCRFLYTYADGSTSSGFFSKETTTLKTLTGSETRLKDLSF 208

Query: 181 GCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSY 240
           GCGFRISGPSVSG+ FNGARGVMGLGRGPISFS+QLGRRFGNKFSYCLMDYTLSPPPTSY
Sbjct: 209 GCGFRISGPSVSGAQFNGARGVMGLGRGPISFSTQLGRRFGNKFSYCLMDYTLSPPPTSY 268

Query: 241 LMIGA--RRSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDEL 300
           LMIG   RR P V+NA++ISYTPL INPLSPTFYYI VKSI VDGV LPINP +WAIDE 
Sbjct: 269 LMIGGGLRRLP-VTNATKISYTPLLINPLSPTFYYIAVKSITVDGVKLPINPTLWAIDEQ 328

Query: 301 GNGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGES-RPSLP 360
           GNGGTVVDSGTTLT+LAE AY +VL A R++VKLP  AELTPGFDLC+N S ES RPSLP
Sbjct: 329 GNGGTVVDSGTTLTYLAEEAYKEVLKAMRQRVKLPAAAELTPGFDLCVNVSNESQRPSLP 388

Query: 361 RLSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDA 420
           R+ F+L  GAV  PP RNYFLETEE VMCLAIR V+ GNGFSVIGNLMQQGFLLEFDK+A
Sbjct: 389 RVRFQLGNGAVFPPPARNYFLETEEGVMCLAIRAVEGGNGFSVIGNLMQQGFLLEFDKEA 448

Query: 421 SRLGFSRRGCGLP 428
           SRLGFSRRGCGLP
Sbjct: 449 SRLGFSRRGCGLP 459

BLAST of MC04g0861 vs. NCBI nr
Match: XP_022969957.1 (aspartyl protease family protein 2 [Cucurbita maxima])

HSP 1 Score: 692 bits (1787), Expect = 3.60e-248
Identity = 349/432 (80.79%), Postives = 377/432 (87.27%), Query Frame = 0

Query: 1   AATQFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQ 60
           AA  +LKLPLLH NPFSSPS AL+ D HRLS+LFS+  R  N   +KSPLISGASTGSGQ
Sbjct: 29  AAADYLKLPLLHKNPFSSPSQALSSDTHRLSLLFSALRRRPNPT-LKSPLISGASTGSGQ 88

Query: 61  YFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDP 120
           YFVDLR+GTPPQSLLLVADTGSDLVWVKCS CRNCSHH PSSAFLPRHSSSFSP HCFDP
Sbjct: 89  YFVDLRIGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDP 148

Query: 121 ACRLVPHA--RLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSF 180
            CRL+PHA   LCNHTRLHSPC + Y+YADGS S+GFFSK+TTTLKTLSG+E RL +LSF
Sbjct: 149 HCRLLPHAPPHLCNHTRLHSPCRFLYTYADGSTSSGFFSKETTTLKTLSGSETRLKDLSF 208

Query: 181 GCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSY 240
           GCGFRISGPSVSG+ FNGARGVMGLGRGPISFS+QLGRRFGNKFSYCLMDYTLSPPPTSY
Sbjct: 209 GCGFRISGPSVSGAQFNGARGVMGLGRGPISFSTQLGRRFGNKFSYCLMDYTLSPPPTSY 268

Query: 241 LMIGAR-RSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDELG 300
           LMIG   RS  V+NA++ISYTPL INPLSPTFYYI VKSI VDGV LPINP VWAIDE G
Sbjct: 269 LMIGGGLRSLPVTNATKISYTPLLINPLSPTFYYIAVKSITVDGVKLPINPTVWAIDEQG 328

Query: 301 NGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGES-RPSLPR 360
           NGGTVVDSGTTLT+LAE AY +VL A R++VKLP  AELTPGFDLC+N S ES RPSLPR
Sbjct: 329 NGGTVVDSGTTLTYLAEEAYKEVLKAVRQRVKLPAAAELTPGFDLCVNVSKESQRPSLPR 388

Query: 361 LSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDAS 420
           + FR+  GAV +PP RNYFLET E VMCLAIR V+ GNGFSVIGNLMQQGFLLEFDK+AS
Sbjct: 389 VRFRVGNGAVFAPPARNYFLETVEGVMCLAIRAVEGGNGFSVIGNLMQQGFLLEFDKEAS 448

Query: 421 RLGFSRRGCGLP 428
           RLGFSRRGCGLP
Sbjct: 449 RLGFSRRGCGLP 459

BLAST of MC04g0861 vs. ExPASy TrEMBL
Match: A0A6J1HXS2 (aspartyl protease family protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111469001 PE=3 SV=1)

HSP 1 Score: 692 bits (1787), Expect = 1.74e-248
Identity = 349/432 (80.79%), Postives = 377/432 (87.27%), Query Frame = 0

Query: 1   AATQFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQ 60
           AA  +LKLPLLH NPFSSPS AL+ D HRLS+LFS+  R  N   +KSPLISGASTGSGQ
Sbjct: 29  AAADYLKLPLLHKNPFSSPSQALSSDTHRLSLLFSALRRRPNPT-LKSPLISGASTGSGQ 88

Query: 61  YFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDP 120
           YFVDLR+GTPPQSLLLVADTGSDLVWVKCS CRNCSHH PSSAFLPRHSSSFSP HCFDP
Sbjct: 89  YFVDLRIGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDP 148

Query: 121 ACRLVPHA--RLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSF 180
            CRL+PHA   LCNHTRLHSPC + Y+YADGS S+GFFSK+TTTLKTLSG+E RL +LSF
Sbjct: 149 HCRLLPHAPPHLCNHTRLHSPCRFLYTYADGSTSSGFFSKETTTLKTLSGSETRLKDLSF 208

Query: 181 GCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSY 240
           GCGFRISGPSVSG+ FNGARGVMGLGRGPISFS+QLGRRFGNKFSYCLMDYTLSPPPTSY
Sbjct: 209 GCGFRISGPSVSGAQFNGARGVMGLGRGPISFSTQLGRRFGNKFSYCLMDYTLSPPPTSY 268

Query: 241 LMIGAR-RSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDELG 300
           LMIG   RS  V+NA++ISYTPL INPLSPTFYYI VKSI VDGV LPINP VWAIDE G
Sbjct: 269 LMIGGGLRSLPVTNATKISYTPLLINPLSPTFYYIAVKSITVDGVKLPINPTVWAIDEQG 328

Query: 301 NGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGES-RPSLPR 360
           NGGTVVDSGTTLT+LAE AY +VL A R++VKLP  AELTPGFDLC+N S ES RPSLPR
Sbjct: 329 NGGTVVDSGTTLTYLAEEAYKEVLKAVRQRVKLPAAAELTPGFDLCVNVSKESQRPSLPR 388

Query: 361 LSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDAS 420
           + FR+  GAV +PP RNYFLET E VMCLAIR V+ GNGFSVIGNLMQQGFLLEFDK+AS
Sbjct: 389 VRFRVGNGAVFAPPARNYFLETVEGVMCLAIRAVEGGNGFSVIGNLMQQGFLLEFDKEAS 448

Query: 421 RLGFSRRGCGLP 428
           RLGFSRRGCGLP
Sbjct: 449 RLGFSRRGCGLP 459

BLAST of MC04g0861 vs. ExPASy TrEMBL
Match: A0A6J1ELQ4 (aspartyl protease family protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC111435697 PE=3 SV=1)

HSP 1 Score: 692 bits (1787), Expect = 1.74e-248
Identity = 349/433 (80.60%), Postives = 378/433 (87.30%), Query Frame = 0

Query: 1   AATQFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQ 60
           AA  +LKLPLLH NPFSSPS AL+ D HRLS+LFS+  R  N   +KSPLISGASTGSGQ
Sbjct: 29  AAADYLKLPLLHKNPFSSPSQALSSDTHRLSLLFSALRRRPNPT-LKSPLISGASTGSGQ 88

Query: 61  YFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDP 120
           YFVDLR+GTPPQSLLLVADTGSDLVWVKCS CRNCSHH PSSAFLPRHSSSFSP HCFDP
Sbjct: 89  YFVDLRIGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDP 148

Query: 121 ACRLVPHA--RLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSF 180
            CRL+PHA   LCNHTRLHSPC + Y+YADGS S+GFFSK+TTTLKTL+G+E RL +LSF
Sbjct: 149 HCRLLPHAPPHLCNHTRLHSPCRFLYTYADGSTSSGFFSKETTTLKTLTGSETRLKDLSF 208

Query: 181 GCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSY 240
           GCGFRISGPSVSG+ FNGARGVMGLGRGPISFS+QLGRRFGNKFSYCLMDYTLSPPPTSY
Sbjct: 209 GCGFRISGPSVSGAQFNGARGVMGLGRGPISFSTQLGRRFGNKFSYCLMDYTLSPPPTSY 268

Query: 241 LMIGA--RRSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDEL 300
           LMIG   RR P V+NA++ISYTPL INPLSPTFYYI VKSI VDGV LPINP +WAIDE 
Sbjct: 269 LMIGGGLRRLP-VTNATKISYTPLLINPLSPTFYYIAVKSITVDGVKLPINPTLWAIDEQ 328

Query: 301 GNGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGES-RPSLP 360
           GNGGTVVDSGTTLT+LAE AY +VL A R++VKLP  AELTPGFDLC+N S ES RPSLP
Sbjct: 329 GNGGTVVDSGTTLTYLAEEAYKEVLKAMRQRVKLPAAAELTPGFDLCVNVSNESQRPSLP 388

Query: 361 RLSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDA 420
           R+ F+L  GAV  PP RNYFLETEE VMCLAIR V+ GNGFSVIGNLMQQGFLLEFDK+A
Sbjct: 389 RVRFQLGNGAVFPPPARNYFLETEEGVMCLAIRAVEGGNGFSVIGNLMQQGFLLEFDKEA 448

Query: 421 SRLGFSRRGCGLP 428
           SRLGFSRRGCGLP
Sbjct: 449 SRLGFSRRGCGLP 459

BLAST of MC04g0861 vs. ExPASy TrEMBL
Match: A0A0A0KNH6 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G174650 PE=3 SV=1)

HSP 1 Score: 689 bits (1778), Expect = 4.09e-247
Identity = 345/432 (79.86%), Postives = 375/432 (86.81%), Query Frame = 0

Query: 1   AATQFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQ 60
           A   FLKLPLLH  PFSSPS +L+ D HRLS+LFS     R    +KSPLISGASTGSGQ
Sbjct: 33  APADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFS-----RPNPTLKSPLISGASTGSGQ 92

Query: 61  YFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDP 120
           YFVD+RLGTPPQSLLLVADTGSDLVWVKCS CRNCSHH PSSAFLPRHSSSFSP HCFDP
Sbjct: 93  YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDP 152

Query: 121 ACRLVPHA--RLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSF 180
            CRL+PHA   LCNHTRLHSPC + YSYADGSLS+GFFSK+TTTLK+LSG+E  L  LSF
Sbjct: 153 HCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSF 212

Query: 181 GCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSY 240
           GCGFRISGPSVSG+ FNGARGVMGLGRG ISFSSQLGRRFGNKFSYCLMDYTLSPPPTS+
Sbjct: 213 GCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSF 272

Query: 241 LMIGAR-RSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDELG 300
           LMIG    S  ++NA++ISYTPLQINPLSPTFYYI + SI +DGV LPINPAVW IDE G
Sbjct: 273 LMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQG 332

Query: 301 NGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGESR-PSLPR 360
           NGGTVVDSGTTLT+L + AY++VL + RR+VKLP  AELTPGFDLC+NASGESR PSLPR
Sbjct: 333 NGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPR 392

Query: 361 LSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDAS 420
           L FRL GGAV +PPPRNYFLETEE VMCLAIR V+SGNGFSVIGNLMQQGFLLEFDK+ S
Sbjct: 393 LRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEES 452

Query: 421 RLGFSRRGCGLP 428
           RLGF+RRGCGLP
Sbjct: 453 RLGFTRRGCGLP 459

BLAST of MC04g0861 vs. ExPASy TrEMBL
Match: A0A1S3CSZ8 (aspartyl protease family protein 2 OS=Cucumis melo OX=3656 GN=LOC103504612 PE=3 SV=1)

HSP 1 Score: 687 bits (1773), Expect = 2.36e-246
Identity = 345/432 (79.86%), Postives = 373/432 (86.34%), Query Frame = 0

Query: 1   AATQFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQ 60
           AA  FLKLPLLH  PFSSPS +L+ D HRLS+LFS     R    +KSPLISGASTGSGQ
Sbjct: 33  AAADFLKLPLLHKPPFSSPSQSLSSDTHRLSLLFS-----RPNPTLKSPLISGASTGSGQ 92

Query: 61  YFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDP 120
           YFVD+RLGTPPQSLLLVADTGSDLVWVKCS CRNCSHH PSSAF PRHSSSFSP HCFDP
Sbjct: 93  YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFFPRHSSSFSPFHCFDP 152

Query: 121 ACRLVPHA--RLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSF 180
            CRL+PHA    CNHT LHSPC + YSYADGSLS+GFFSK+TTTLKTLSG+E  L  LSF
Sbjct: 153 HCRLLPHAPPHHCNHTLLHSPCRFLYSYADGSLSSGFFSKETTTLKTLSGSEIHLKGLSF 212

Query: 181 GCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSY 240
           GCGFRISGPSVSG+ FNGARGVMGLGRG ISFSSQLGRRFGNKFSYCLMDYTLSPPPTS+
Sbjct: 213 GCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSF 272

Query: 241 LMIGAR-RSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDELG 300
           LMIG    S  V+NA++ISYTPLQINPLSPTFYYI + SI +DGV LPINPAVW IDE G
Sbjct: 273 LMIGGGLHSLPVNNATKISYTPLQINPLSPTFYYITINSITIDGVKLPINPAVWEIDEQG 332

Query: 301 NGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGESR-PSLPR 360
           NGGTVVDSGTTLT+L + AY++VL + RR+VKLP  AELTPGFDLC+NASGESR PSLPR
Sbjct: 333 NGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPR 392

Query: 361 LSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDAS 420
           L FRL GGAV +PPPRNYFLETEE VMCLAIR V+SGNGFSVIGNLMQQGFLLEFDK+ S
Sbjct: 393 LRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEES 452

Query: 421 RLGFSRRGCGLP 428
           RLGF+RRGCGLP
Sbjct: 453 RLGFTRRGCGLP 459

BLAST of MC04g0861 vs. ExPASy TrEMBL
Match: A0A6J1EPZ3 (aspartyl protease family protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC111436585 PE=3 SV=1)

HSP 1 Score: 671 bits (1730), Expect = 7.14e-240
Identity = 336/432 (77.78%), Postives = 369/432 (85.42%), Query Frame = 0

Query: 3   TQFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQYF 62
           +Q+LK PLLH+NPFSSPS AL+ D HRLS+LFS+    R+   +KSPLISGASTGSGQYF
Sbjct: 27  SQYLKFPLLHTNPFSSPSQALSSDTHRLSLLFSAH---RHSPTLKSPLISGASTGSGQYF 86

Query: 63  VDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDPAC 122
           V+L LGTPPQSLLLV DTGSDLVWVKCSPCRNCSHH PSSAF PRHSSSFSP HCFDP C
Sbjct: 87  VNLHLGTPPQSLLLVVDTGSDLVWVKCSPCRNCSHHPPSSAFFPRHSSSFSPFHCFDPHC 146

Query: 123 RLVPH--ARLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGA--EARLPNLSF 182
           RL+PH  +  CNHT LHSPC + YSYAD SLS+GFFSKD TT  T SG   + RL +LSF
Sbjct: 147 RLLPHPPSHRCNHTHLHSPCSFLYSYADSSLSSGFFSKDVTTFNTFSGTHTQTRLNDLSF 206

Query: 183 GCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSY 242
           GCGFRISGPSVSG+ F GARGVMGLGRGPISFSSQLG RFGN FSYCLMDYTLSPPPTSY
Sbjct: 207 GCGFRISGPSVSGARFTGARGVMGLGRGPISFSSQLGHRFGNTFSYCLMDYTLSPPPTSY 266

Query: 243 LMIGAR-RSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDELG 302
           LMIG   RS  V+NAS+ISYTPLQINPLSPTFYYI VKSI VDGV LPINP VWAIDE G
Sbjct: 267 LMIGGGLRSLPVTNASKISYTPLQINPLSPTFYYIVVKSITVDGVKLPINPKVWAIDEQG 326

Query: 303 NGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGESRP-SLPR 362
           NGGTVVDSGTTLT+LAE AY++VL A RR+VKLPR  +L+PGFDLC+NAS ESR  SLP+
Sbjct: 327 NGGTVVDSGTTLTYLAEAAYEEVLKAMRRRVKLPRALQLSPGFDLCVNASSESRMRSLPQ 386

Query: 363 LSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDAS 422
           + FR+ GG V +PP RNYF+ETEE VMCLAIRPVDSGNGFSVIGNLMQQGFLLEFD++ S
Sbjct: 387 IRFRVGGGGVFAPPARNYFVETEEGVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDREKS 446

Query: 423 RLGFSRRGCGLP 428
           R+GFSRRGCGLP
Sbjct: 447 RMGFSRRGCGLP 455

BLAST of MC04g0861 vs. TAIR 10
Match: AT3G25700.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 599.4 bits (1544), Expect = 2.3e-171
Identity = 291/430 (67.67%), Postives = 345/430 (80.23%), Query Frame = 0

Query: 4   QFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQYFV 63
           ++LKLPLL  +PF SP+ ALA D  RL  L     R +    VKSP++SGA++GSGQYFV
Sbjct: 30  KYLKLPLLRKSPFPSPTQALALDTRRLHFL---SLRRKPIPFVKSPVVSGAASGSGQYFV 89

Query: 64  DLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDPACR 123
           DLR+G PPQSLLL+ADTGSDLVWVKCS CRNCSHH+P++ F PRHSS+FSP HC+DP CR
Sbjct: 90  DLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCR 149

Query: 124 LVP---HARLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSFGC 183
           LVP    A +CNHTR+HS C Y Y YADGSL++G F+++TT+LKT SG EARL +++FGC
Sbjct: 150 LVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGC 209

Query: 184 GFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYLM 243
           GFRISG SVSG++FNGA GVMGLGRGPISF+SQLGRRFGNKFSYCLMDYTLSPPPTSYL+
Sbjct: 210 GFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLI 269

Query: 244 IGARRSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDELGNGG 303
           IG          S++ +TPL  NPLSPTFYY+++KS+ V+G  L I+P++W ID+ GNGG
Sbjct: 270 IG----NGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGG 329

Query: 304 TVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGESRPS--LPRLS 363
           TVVDSGTTL FLAEPAY  V+AA RR+VKLP    LTPGFDLC+N SG ++P   LPRL 
Sbjct: 330 TVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLK 389

Query: 364 FRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDASRL 423
           F  +GGAV  PPPRNYF+ETEE++ CLAI+ VD   GFSVIGNLMQQGFL EFD+D SRL
Sbjct: 390 FEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRL 449

Query: 424 GFSRRGCGLP 429
           GFSRRGC LP
Sbjct: 450 GFSRRGCALP 452

BLAST of MC04g0861 vs. TAIR 10
Match: AT3G25700.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 410.2 bits (1053), Expect = 2.0e-114
Identity = 221/430 (51.40%), Postives = 259/430 (60.23%), Query Frame = 0

Query: 4   QFLKLPLLHSNPFSSPSHALAFDAHRLSILFSSRNRNRNRNAVKSPLISGASTGSGQYFV 63
           ++LKLPLL  +PF SP+ ALA D  RL  L     R +    VKSP++SGA++GSGQYFV
Sbjct: 30  KYLKLPLLRKSPFPSPTQALALDTRRLHFL---SLRRKPIPFVKSPVVSGAASGSGQYFV 89

Query: 64  DLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDPACR 123
           DLR+G PPQSLLL+ADTGSDLVWVKCS CRNCSHH+P++ F PRHSS+FSP HC+DP CR
Sbjct: 90  DLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCR 149

Query: 124 LVP---HARLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSFGC 183
           LVP    A +CNHTR+HS C Y Y YADGSL++G F+++TT+LKT SG EARL +++FGC
Sbjct: 150 LVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGC 209

Query: 184 GFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYLM 243
           GFRISG SVS                                                  
Sbjct: 210 GFRISGQSVS-------------------------------------------------- 269

Query: 244 IGARRSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLPINPAVWAIDELGNGG 303
                                                                   GNGG
Sbjct: 270 --------------------------------------------------------GNGG 329

Query: 304 TVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGESRPS--LPRLS 363
           TVVDSGTTL FLAEPAY  V+AA RR+VKLP    LTPGFDLC+N SG ++P   LPRL 
Sbjct: 330 TVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLK 350

Query: 364 FRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDASRL 423
           F  +GGAV  PPPRNYF+ETEE++ CLAI+ VD   GFSVIGNLMQQGFL EFD+D SRL
Sbjct: 390 FEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRL 350

Query: 424 GFSRRGCGLP 429
           GFSRRGC LP
Sbjct: 450 GFSRRGCALP 350

BLAST of MC04g0861 vs. TAIR 10
Match: AT2G42980.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 243.4 bits (620), Expect = 3.3e-64
Identity = 152/388 (39.18%), Postives = 211/388 (54.38%), Query Frame = 0

Query: 50  LISGASTGSGQYFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHS 109
           L SG + GSG+YF+D+ +GTPP+   L+ DTGSDL W++C PC +C  H     + P+ S
Sbjct: 149 LESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDC-FHQNGMFYDPKTS 208

Query: 110 SSFSPHHCFDPACRLVPHAR---LCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTT--LK 169
           +SF    C DP C L+        C     +  CPY Y Y D S + G F+ +T T  L 
Sbjct: 209 ASFKNITCNDPRCSLISSPDPPVQCESD--NQSCPYFYWYGDRSNTTGDFAVETFTVNLT 268

Query: 170 TLSG--AEARLPNLSFGCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKF 229
           T  G  +E ++ N+ FGCG    G       F+GA G++GLGRGP+SFSSQL   +G+ F
Sbjct: 269 TTEGGSSEYKVGNMMFGCGHWNRG------LFSGASGLLGLGRGPLSFSSQLQSLYGHSF 328

Query: 230 SYCLMDYTLSPPPTSYLMIGARRSPAVSNASRISYTPLQINPLS--PTFYYIRVKSIAVD 289
           SYCL+D   +   +S L+ G  +   + N + +++T       +   TFYYI++KSI V 
Sbjct: 329 SYCLVDRNSNTNVSSKLIFGEDKD--LLNHTNLNFTSFVNGKENSVETFYYIQIKSILVG 388

Query: 290 GVTLPINPAVWAIDELGNGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPA-ELTPG 349
           G  L I    W I   G+GGT++DSGTTL++ AEPAY+ +   F  ++K   P     P 
Sbjct: 389 GKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPV 448

Query: 350 FDLCLNASG--ESRPSLPRLSFRLAGGAVLSPPPRNYFLETEERVMCLAIRPVDSGNGFS 409
            D C N SG  E+   LP L      G V + P  N F+   E ++CLAI        FS
Sbjct: 449 LDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKST-FS 508

Query: 410 VIGNLMQQGFLLEFDKDASRLGFSRRGC 426
           +IGN  QQ F + +D   SRLGF+   C
Sbjct: 509 IIGNYQQQNFHILYDTKRSRLGFTPTKC 524

BLAST of MC04g0861 vs. TAIR 10
Match: AT1G01300.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 241.1 bits (614), Expect = 1.6e-63
Identity = 154/418 (36.84%), Postives = 223/418 (53.35%), Query Frame = 0

Query: 20  SHALAFDAHRLSILFSSRNRNRNRNAVKSP--------LISGASTGSGQYFVDLRLGTPP 79
           S  L  D+ R+  + +   +   RN   +P        ++SG S GSG+YF  L +GTP 
Sbjct: 93  SSRLQRDSRRVKSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLGVGTPA 152

Query: 80  QSLLLVADTGSDLVWVKCSPCRNCSHHAPSSAFLPRHSSSFSPHHCFDPACRLVPHARLC 139
           + + +V DTGSD+VW++C+PCR C +      F PR S +++   C  P CR +  A  C
Sbjct: 153 RYVYMVLDTGSDIVWLQCAPCRRC-YSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAG-C 212

Query: 140 NHTRLHSPCPYHYSYADGSLSAGFFSKDTTTLKTLSGAEARLPNLSFGCGFRISGPSVSG 199
           N  R    C Y  SY DGS + G FS +T T +       R+  ++ GCG    G     
Sbjct: 213 NTRR--KTCLYQVSYGDGSFTVGDFSTETLTFR-----RNRVKGVALGCGHDNEG----- 272

Query: 200 STFNGARGVMGLGRGPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGARRSPAVSN 259
             F GA G++GLG+G +SF  Q G RF  KFSYCL+D + S  P+S +   A    AVS 
Sbjct: 273 -LFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNA----AVSR 332

Query: 260 ASRISYTPLQINPLSPTFYYIRVKSIAVDGVTLP-INPAVWAIDELGNGGTVVDSGTTLT 319
            +R  +TPL  NP   TFYY+ +  I+V G  +P +  +++ +D++GNGG ++DSGT++T
Sbjct: 333 IAR--FTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVT 392

Query: 320 FLAEPAYDQVLAAFRRQVKLPRPAELTPGFDLCLNASGESRPSLPR--LSFRLAGGAVLS 379
            L  PAY  +  AFR   K  + A     FD C + S  +   +P   L FR   GA +S
Sbjct: 393 RLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR---GADVS 452

Query: 380 PPPRNYFLETEER-VMCLAIRPVDSGNGFSVIGNLMQQGFLLEFDKDASRLGFSRRGC 426
            P  NY +  +     C A     +  G S+IGN+ QQGF + +D  +SR+GF+  GC
Sbjct: 453 LPATNYLIPVDTNGKFCFAF--AGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

BLAST of MC04g0861 vs. TAIR 10
Match: AT3G61820.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 237.3 bits (604), Expect = 2.3e-62
Identity = 163/449 (36.30%), Postives = 230/449 (51.22%), Query Frame = 0

Query: 2   ATQFLKLPLLHSNPFSSPSHA---------LAFDAHRLSILFSSRNRNRNRNAVK----- 61
           +T  L + L H +  SS S A         L  D+ R+  + S    +  RNA K     
Sbjct: 57  STTSLSVHLSHVDALSSFSDASPADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTPRT 116

Query: 62  -----SPLISGASTGSGQYFVDLRLGTPPQSLLLVADTGSDLVWVKCSPCRNCSHHAPSS 121
                  +ISG S GSG+YF+ L +GTP  ++ +V DTGSD+VW++CSPC+ C ++   +
Sbjct: 117 AGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKAC-YNQTDA 176

Query: 122 AFLPRHSSSFSPHHCFDPACRLVPHARLCNHTRLHSPCPYHYSYADGSLSAGFFSKDTTT 181
            F P+ S +F+   C    CR +  +  C  TR    C Y  SY DGS + G FS +T T
Sbjct: 177 IFDPKKSKTFATVPCGSRLCRRLDDSSEC-VTRRSKTCLYQVSYGDGSFTEGDFSTETLT 236

Query: 182 LKTLSGAEARLPNLSFGCGFRISGPSVSGSTFNGARGVMGLGRGPISFSSQLGRRFGNKF 241
                   AR+ ++  GCG    G       F GA G++GLGRG +SF SQ   R+  KF
Sbjct: 237 FH-----GARVDHVPLGCGHDNEG------LFVGAAGLLGLGRGGLSFPSQTKNRYNGKF 296

Query: 242 SYCLMDYT---LSPPPTSYLMIGARRSPAVSNASRISYTPLQINPLSPTFYYIRVKSIAV 301
           SYCL+D T    S  P S ++ G    P  S      +TPL  NP   TFYY+++  I+V
Sbjct: 297 SYCLVDRTSSGSSSKPPSTIVFGNAAVPKTS-----VFTPLLTNPKLDTFYYLQLLGISV 356

Query: 302 DGVTLP-INPAVWAIDELGNGGTVVDSGTTLTFLAEPAYDQVLAAFRRQVKLPRPAELTP 361
            G  +P ++ + + +D  GNGG ++DSGT++T L +PAY  +  AFR      + A    
Sbjct: 357 GGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYS 416

Query: 362 GFDLCLNASGESRPSLPRLSFRLAGGAVLSPPPRNYFL--ETEERVMCLAIRPVDSGNGF 421
            FD C + SG +   +P + F   GG V S P  NY +   TE R  C A     +    
Sbjct: 417 LFDTCFDLSGMTTVKVPTVVFHFGGGEV-SLPASNYLIPVNTEGR-FCFAF--AGTMGSL 476

Query: 422 SVIGNLMQQGFLLEFDKDASRLGFSRRGC 426
           S+IGN+ QQGF + +D   SR+GF  R C
Sbjct: 477 SIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LNJ32.3e-6236.84Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Q9LTW41.2e-5536.64Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE... [more]
Q766C24.3e-5335.71Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LHE37.3e-5334.38Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q766C39.6e-5335.26Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Match NameE-valueIdentityDescription
XP_038907006.11.11e-25181.67aspartyl protease family protein 2 [Benincasa hispida][more]
XP_023549997.16.25e-24980.79aspartyl protease family protein 2 [Cucurbita pepo subsp. pepo][more]
KAG7017139.13.11e-24880.56Aspartic proteinase nepenthesin-2, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022928946.13.60e-24880.60aspartyl protease family protein 2-like [Cucurbita moschata][more]
XP_022969957.13.60e-24880.79aspartyl protease family protein 2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1HXS21.74e-24880.79aspartyl protease family protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111469001 P... [more]
A0A6J1ELQ41.74e-24880.60aspartyl protease family protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
A0A0A0KNH64.09e-24779.86Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G17465... [more]
A0A1S3CSZ82.36e-24679.86aspartyl protease family protein 2 OS=Cucumis melo OX=3656 GN=LOC103504612 PE=3 ... [more]
A0A6J1EPZ37.14e-24077.78aspartyl protease family protein 2-like OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
Match NameE-valueIdentityDescription
AT3G25700.12.3e-17167.67Eukaryotic aspartyl protease family protein [more]
AT3G25700.22.0e-11451.40Eukaryotic aspartyl protease family protein [more]
AT2G42980.13.3e-6439.18Eukaryotic aspartyl protease family protein [more]
AT1G01300.11.6e-6336.84Eukaryotic aspartyl protease family protein [more]
AT3G61820.12.3e-6236.30Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 67..87
score: 49.53
coord: 301..312
score: 40.02
coord: 397..412
score: 18.99
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 269..421
e-value: 6.6E-38
score: 130.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 33..242
e-value: 1.5E-49
score: 170.6
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 245..426
e-value: 8.6E-52
score: 177.5
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 55..425
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 61..242
e-value: 9.5E-51
score: 172.5
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 4..426
NoneNo IPR availablePANTHERPTHR47967:SF72BNAA02G27600D PROTEINcoord: 4..426
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 61..421
score: 41.219387
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 60..425
e-value: 4.54785E-94
score: 282.229

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC04g0861.1MC04g0861.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
molecular_function GO:0004190 aspartic-type endopeptidase activity