Moc01g15010 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc01g15010
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionaspartic proteinase nepenthesin-1
Locationchr1: 9378141 .. 9380948 (+)
RNA-Seq ExpressionMoc01g15010
SyntenyMoc01g15010
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGGATTCTTTATGTTCGGTTCGATATCCGATCGTCCTTGTTGCGGTGTTGGCTACGTTGTTCTTCATCGATTTGACGGTTTCGAGCTCGACGCTCTCGCGGCGAGCTCTGCAGCAACAGAAGCTGTTGAATAATGGCTTTAGAATGAGGCTTCATCACGTGGACCATCATGTCAAGAACTTGACAAGATTCGAGCGGCTGCAGCGAGGGGCGGCGCGTGGGAGGAACAGGCTGCAGAGGCTAAATGCCATGGTGTTGGCTGCCGCGGGGGGCGCTGCCGTGGGAGACCAAGTGCAGGCGCCGGTGGTGGCTGGAAACGGCGAGTTTCTTATGAAGTTGGCAATCGGATCTCCGCCGAAAAGCTTCTCGGCGATCATGGACACTGGCAGCGATCTGATATGGACGCAGTGTAAACCTTGTCAGCAGTGTTTCGATCAATCGACACCGATTTTCGACCCAAAAGAATCTTCTTCTTTCTCTAAGCTTTCTTGCTCGAGCGAGCTCTGTGGAGCTCTCCCGACGTCGGCATGCAGCAACGATGGCTGCGAGTATTTATACACGTATGGTGATTATTCCTCCACCCATGGCCTTTTGGGTGCCGAGACCTTCACGTTCGGAGATCCCGGCGAAGATCAGGTTGTTAAATCACTGCTAAATCCGGAAAAAAGAAATAGAGCTTGAGTTAGGATAGATTTGTTATTATTTTTCAATTGCTTTGCTGGTTTCTGCAACTTTCTCATAATCATTTTGTTCTCTATTTTCTATTTTACAAAATAAGCATTAAAATATTATTTTCACCTCTATTTTTTATATAGTCTTTATAAACATTTTTAAAATCTAGGCAAAATTTTAAAAAAGAAAATGAATAATTTTCAAGAACTCATTCGTGTTTCTTAATTAATTGCCTAAAAATTCATAAATATGATTTCAACACTAGTGAAAGACTTATTAATGAAATTGTATTAAAACAAACGTAATATTTTTTAAAACATAAAATTTCCTTTTTAGTTCAAAACTTTCAAAGTCTCCAATTTGTTTAAAGATAACCATTTTAGTTTTCACGTTATTTTGCTATTTCTTTTGATAAGTATTATATCTTTTAGATGTCAAAAGATTATAACACATATTTTTAAGTAAAAGGAATGATATAACTTCAAAGACTGCATGAACATGTTAATATAGGCATATATTTAGATAGATATTTTGAAAATTTAATAACAAAAATATTTTATTTTATTCTAAATATAAATATTTTAAAATAAAATCAGATTTGAACTTAAAATTATTTTGCCAAAGCAACCTCAACGTGTACCTCCAACATGTTGTACTAAAAAAACTTAAAATTATTTTATGGATATTTTTTCAACATCGTATGGGTCGTATTTGAAAATGCAAAGTGATTATGAATATTTTTTTTTAATACAACATCGTGTGGGAAAGAAGATCCGAATTCACGATATTTTGGTTGAAAATAGGTGCCTTAACCCGTTGAACGTAACTCAAGTTGACGTGATTAATTAATACTAACATCTCATTTTTTCTCAATATTTCGTACATATATTCTTAACACAAACAGGTCTCAATCTCGGAGATCGGGTTCGGTTGTGGAGACGACAATGAAGGAGATGGGTTCAGCCAAGGCGCGGGGCTGGTGGGACTGGGGCGCGGGCCCCTATCGTTGGTTTCTCAACTGAAAGAGCAAAAGTTTGCTTATTGCTTAACCCCCATTGATGACTCAAAGCCAAGCTCACTCTTGATGGGATCTCTAGCAAACGTGAAGCCCAAAAAATCACAAGATGAAATCAAAACGACCCCTTTGATCAGAAACCCTTCTCAGCCCTCTTTTTACTATCTTTCTCTTGAAGGGATCTCTGTGGGTGGCTCTCATTTGGCCATTCCACAGCCAACTTTTGAGCTTCGTGATGATGGGAGTGGAGGAGTGATCATAGACTCAGGTACAACAATCACATACATTGACAAAAGTGCTTTTACTCTACTCAAGAAGGAGTTCATTGCTCAGATGAAGCTTCCAGTCGACGACTCTGGCACCGGTGGCCTCGACCTCTGCTTCAAATTGCCCTCCGAGGCAACTCAGGTGCGTTGAGTTACCATTGAAACCTGAACGTTAAAGTTCGTGTTTTCTTTGAAAAATTACTAATTTAGAGCGTGTTTAGTGTATGGATTACTAAGGGGTAATTATTTAAACGACTAAAAGTACTTTCTTAAAAGAAAAATCGATAATGTTTGGTAATAAATGAAAAATATGTTTCATAAATCAGATTGATTTAAAACTCTTTAGAAAGAAATACTTGATGGATAACTATGATTTTAAATTAGTATTTGACCGAAAATCGTTTACTACTAAGAGAAAAGTATATATTTGTGATTAGTAGAATAACGTTGTTTGAATGCATATAGTGCAACCAAATAAATTTGTTATATTGTAACAAAATGTAATCAAATGCATGATTAAAAGCATTTATATTAAAAGTAGTTTAACAAGAAACGCGCTTAAACTAAAAGGTATATCAAACCCGCCTCAAATTAATGAGATAGGTATGTTGAATTTGTTTTAACAAGGTGGAGGTTCCGAAGCTGACTTTCCATTTCAAAGACGCCGATCTAGAGCTCCCCGGCGAGAACTACATGATCGGTGATTCTGCGGCCGGGTTGTTATGCTTGGCCATTGGAAGTTCCAGTGGCATGTCTATATTTGGGAATCTTCAGCAGCAAAACTTCATGGTTGTTCATGATCTTCAAGAGGAAACCATTTCGTTTGTGCCTACTCAATGTGATGGAATATAA

mRNA sequence

ATGTTGGATTCTTTATGTTCGGTTCGATATCCGATCGTCCTTGTTGCGGTGTTGGCTACGTTGTTCTTCATCGATTTGACGGTTTCGAGCTCGACGCTCTCGCGGCGAGCTCTGCAGCAACAGAAGCTGTTGAATAATGGCTTTAGAATGAGGCTTCATCACGTGGACCATCATGTCAAGAACTTGACAAGATTCGAGCGGCTGCAGCGAGGGGCGGCGCGTGGGAGGAACAGGCTGCAGAGGCTAAATGCCATGGTGTTGGCTGCCGCGGGGGGCGCTGCCGTGGGAGACCAAGTGCAGGCGCCGGTGGTGGCTGGAAACGGCGAGTTTCTTATGAAGTTGGCAATCGGATCTCCGCCGAAAAGCTTCTCGGCGATCATGGACACTGGCAGCGATCTGATATGGACGCAGTGTAAACCTTGTCAGCAGTGTTTCGATCAATCGACACCGATTTTCGACCCAAAAGAATCTTCTTCTTTCTCTAAGCTTTCTTGCTCGAGCGAGCTCTGTGGAGCTCTCCCGACGTCGGCATGCAGCAACGATGGCTGCGAGTATTTATACACGTATGGTGATTATTCCTCCACCCATGGCCTTTTGGGTGCCGAGACCTTCACGTTCGGAGATCCCGGCGAAGATCAGGTCTCAATCTCGGAGATCGGGTTCGGTTGTGGAGACGACAATGAAGGAGATGGGTTCAGCCAAGGCGCGGGGCTGGTGGGACTGGGGCGCGGGCCCCTATCGTTGGTTTCTCAACTGAAAGAGCAAAAGTTTGCTTATTGCTTAACCCCCATTGATGACTCAAAGCCAAGCTCACTCTTGATGGGATCTCTAGCAAACGTGAAGCCCAAAAAATCACAAGATGAAATCAAAACGACCCCTTTGATCAGAAACCCTTCTCAGCCCTCTTTTTACTATCTTTCTCTTGAAGGGATCTCTGTGGGTGGCTCTCATTTGGCCATTCCACAGCCAACTTTTGAGCTTCGTGATGATGGGAGTGGAGGAGTGATCATAGACTCAGGTACAACAATCACATACATTGACAAAAGTGCTTTTACTCTACTCAAGAAGGAGTTCATTGCTCAGATGAAGCTTCCAGTCGACGACTCTGGCACCGGTGGCCTCGACCTCTGCTTCAAATTGCCCTCCGAGGCAACTCAGGTGGAGGTTCCGAAGCTGACTTTCCATTTCAAAGACGCCGATCTAGAGCTCCCCGGCGAGAACTACATGATCGGTGATTCTGCGGCCGGGTTGTTATGCTTGGCCATTGGAAGTTCCAGTGGCATGTCTATATTTGGGAATCTTCAGCAGCAAAACTTCATGGTTGTTCATGATCTTCAAGAGGAAACCATTTCGTTTGTGCCTACTCAATGTGATGGAATATAA

Coding sequence (CDS)

ATGTTGGATTCTTTATGTTCGGTTCGATATCCGATCGTCCTTGTTGCGGTGTTGGCTACGTTGTTCTTCATCGATTTGACGGTTTCGAGCTCGACGCTCTCGCGGCGAGCTCTGCAGCAACAGAAGCTGTTGAATAATGGCTTTAGAATGAGGCTTCATCACGTGGACCATCATGTCAAGAACTTGACAAGATTCGAGCGGCTGCAGCGAGGGGCGGCGCGTGGGAGGAACAGGCTGCAGAGGCTAAATGCCATGGTGTTGGCTGCCGCGGGGGGCGCTGCCGTGGGAGACCAAGTGCAGGCGCCGGTGGTGGCTGGAAACGGCGAGTTTCTTATGAAGTTGGCAATCGGATCTCCGCCGAAAAGCTTCTCGGCGATCATGGACACTGGCAGCGATCTGATATGGACGCAGTGTAAACCTTGTCAGCAGTGTTTCGATCAATCGACACCGATTTTCGACCCAAAAGAATCTTCTTCTTTCTCTAAGCTTTCTTGCTCGAGCGAGCTCTGTGGAGCTCTCCCGACGTCGGCATGCAGCAACGATGGCTGCGAGTATTTATACACGTATGGTGATTATTCCTCCACCCATGGCCTTTTGGGTGCCGAGACCTTCACGTTCGGAGATCCCGGCGAAGATCAGGTCTCAATCTCGGAGATCGGGTTCGGTTGTGGAGACGACAATGAAGGAGATGGGTTCAGCCAAGGCGCGGGGCTGGTGGGACTGGGGCGCGGGCCCCTATCGTTGGTTTCTCAACTGAAAGAGCAAAAGTTTGCTTATTGCTTAACCCCCATTGATGACTCAAAGCCAAGCTCACTCTTGATGGGATCTCTAGCAAACGTGAAGCCCAAAAAATCACAAGATGAAATCAAAACGACCCCTTTGATCAGAAACCCTTCTCAGCCCTCTTTTTACTATCTTTCTCTTGAAGGGATCTCTGTGGGTGGCTCTCATTTGGCCATTCCACAGCCAACTTTTGAGCTTCGTGATGATGGGAGTGGAGGAGTGATCATAGACTCAGGTACAACAATCACATACATTGACAAAAGTGCTTTTACTCTACTCAAGAAGGAGTTCATTGCTCAGATGAAGCTTCCAGTCGACGACTCTGGCACCGGTGGCCTCGACCTCTGCTTCAAATTGCCCTCCGAGGCAACTCAGGTGGAGGTTCCGAAGCTGACTTTCCATTTCAAAGACGCCGATCTAGAGCTCCCCGGCGAGAACTACATGATCGGTGATTCTGCGGCCGGGTTGTTATGCTTGGCCATTGGAAGTTCCAGTGGCATGTCTATATTTGGGAATCTTCAGCAGCAAAACTTCATGGTTGTTCATGATCTTCAAGAGGAAACCATTTCGTTTGTGCCTACTCAATGTGATGGAATATAA

Protein sequence

MLDSLCSVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVKNLTRFERLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPPKSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSNDGCEYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYLSLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPVDDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCLAIGSSSGMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI
Homology
BLAST of Moc01g15010 vs. NCBI nr
Match: XP_022154910.1 (aspartic proteinase nepenthesin-1 [Momordica charantia])

HSP 1 Score: 910.6 bits (2352), Expect = 5.5e-261
Identity = 460/460 (100.00%), Postives = 460/460 (100.00%), Query Frame = 0

Query: 1   MLDSLCSVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVK 60
           MLDSLCSVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVK
Sbjct: 1   MLDSLCSVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVK 60

Query: 61  NLTRFERLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPP 120
           NLTRFERLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPP
Sbjct: 61  NLTRFERLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPP 120

Query: 121 KSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSN 180
           KSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSN
Sbjct: 121 KSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSN 180

Query: 181 DGCEYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVG 240
           DGCEYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVG
Sbjct: 181 DGCEYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVG 240

Query: 241 LGRGPLSLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQ 300
           LGRGPLSLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQ
Sbjct: 241 LGRGPLSLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQ 300

Query: 301 PSFYYLSLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIA 360
           PSFYYLSLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIA
Sbjct: 301 PSFYYLSLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIA 360

Query: 361 QMKLPVDDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCL 420
           QMKLPVDDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCL
Sbjct: 361 QMKLPVDDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCL 420

Query: 421 AIGSSSGMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI 461
           AIGSSSGMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI
Sbjct: 421 AIGSSSGMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI 460

BLAST of Moc01g15010 vs. NCBI nr
Match: XP_022928703.1 (aspartic proteinase nepenthesin-1 [Cucurbita moschata])

HSP 1 Score: 735.7 bits (1898), Expect = 2.4e-208
Identity = 373/454 (82.16%), Postives = 405/454 (89.21%), Query Frame = 0

Query: 7   SVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVKNLTRFE 66
           S+RY I+ V +  T+ FI  + SSS+LSRRAL Q KL ++GFR+ L+HVD HVKNLTRFE
Sbjct: 4   SLRYLIISVVLSITMLFIHTSASSSSLSRRALWQPKLPSDGFRVSLNHVD-HVKNLTRFE 63

Query: 67  RLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPPKSFSAI 126
           RLQRG ARG+ RL RLNAMVLAA  G  VG +VQAPVVAGNGEFLMKLAIGSPP+SFSAI
Sbjct: 64  RLQRGVARGKTRLHRLNAMVLAANVG--VGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 123

Query: 127 MDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSNDGCEYL 186
           MDTGSDLIWTQCKPCQQCFDQ+TPIFDPKESSSFSK+SCSSELC ALPTS CS+D CEY 
Sbjct: 124 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 183

Query: 187 YTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPL 246
           YTYGDYSSTHG+L AETFTFGD  +DQVSI  +GFGCGDDNEGDGFSQG GLVGLGRGPL
Sbjct: 184 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 243

Query: 247 SLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYL 306
           SLVSQLKEQKF+YCLT IDD+KPSSLLMGSLANVKPK S+ EIKTTPLIRNPSQPSFYYL
Sbjct: 244 SLVSQLKEQKFSYCLTAIDDTKPSSLLMGSLANVKPKASEGEIKTTPLIRNPSQPSFYYL 303

Query: 307 SLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPV 366
           SL+GISVGG+ L IP+ TFEL DDGSGGVIIDSGTTITYI+K+AFTLLKKEF++QMKLPV
Sbjct: 304 SLQGISVGGTQLPIPKATFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 363

Query: 367 DDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCLAIGSSS 426
           DDSGT GLDLCF LP E TQVEVPKLTFHFK ADLELPGENYMIGDS A L+CLAIGSSS
Sbjct: 364 DDSGTSGLDLCFNLPPETTQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLAIGSSS 423

Query: 427 GMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI 461
           GMSIFGNLQQQN MVVHDLQEET+SF+PTQC  I
Sbjct: 424 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSEI 454

BLAST of Moc01g15010 vs. NCBI nr
Match: XP_038883313.1 (aspartic proteinase nepenthesin-1 [Benincasa hispida])

HSP 1 Score: 732.3 bits (1889), Expect = 2.7e-207
Identity = 375/461 (81.34%), Postives = 411/461 (89.15%), Query Frame = 0

Query: 1   MLDSLCSVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQ-QKLLNNGFRMRLHHVDHHV 60
           M  SL S R  I+LV VL T  FID    +  LSRRALQ+  +L +NGF+++L+HVD HV
Sbjct: 1   MTVSLHSYRCLILLVIVLITTLFID----TLALSRRALQKSNELPSNGFKVKLNHVD-HV 60

Query: 61  KNLTRFERLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSP 120
           KNLTRFERL+RG ARG+NRL RLNAMVLAA   AAVG+QVQAPVVAGNGEFLMKLAIG+P
Sbjct: 61  KNLTRFERLRRGVARGKNRLHRLNAMVLAA--NAAVGEQVQAPVVAGNGEFLMKLAIGTP 120

Query: 121 PKSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACS 180
           PKSFSAIMDTGSDLIWTQCKPCQQCFDQ+TPIFDPK SSSFSK+SCSSELCG LPTS CS
Sbjct: 121 PKSFSAIMDTGSDLIWTQCKPCQQCFDQATPIFDPKASSSFSKISCSSELCGPLPTSTCS 180

Query: 181 NDGCEYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLV 240
           +DGCEYLYTYGD SST G+L  ETFTFGD  +DQVSI+ +GFGCGDDNEGDGFSQGAGLV
Sbjct: 181 SDGCEYLYTYGDSSSTQGVLALETFTFGDSSDDQVSITGLGFGCGDDNEGDGFSQGAGLV 240

Query: 241 GLGRGPLSLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPS 300
           GLGRGPLSLVSQLKEQKFAYCLT IDDSKPSSLL+GSLAN+KPK ++DE+KTTPLIRNPS
Sbjct: 241 GLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLLLGSLANIKPKTTKDEMKTTPLIRNPS 300

Query: 301 QPSFYYLSLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFI 360
           QPSFYYLSL+GISVG + L+IP+ TFEL DDGSGGVIIDSGTTITYI+ +AFTLLK EFI
Sbjct: 301 QPSFYYLSLQGISVGSTQLSIPKTTFELHDDGSGGVIIDSGTTITYIENTAFTLLKNEFI 360

Query: 361 AQMKLPVDDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLC 420
           AQM LPVDDSGTGGLDLCF LP+EATQVEVPKLTFHF+ ADLELPGENYMIGDS  GL+C
Sbjct: 361 AQMNLPVDDSGTGGLDLCFNLPAEATQVEVPKLTFHFEGADLELPGENYMIGDSRTGLIC 420

Query: 421 LAIGSSSGMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI 461
           LAIGSS GMSIFGNLQQQNFMVVHDLQEET+SF+PTQCD I
Sbjct: 421 LAIGSSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 454

BLAST of Moc01g15010 vs. NCBI nr
Match: KAG7033562.1 (Aspartic proteinase nepenthesin-1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 731.5 bits (1887), Expect = 4.5e-207
Identity = 370/454 (81.50%), Postives = 404/454 (88.99%), Query Frame = 0

Query: 7   SVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVKNLTRFE 66
           S+RY I+ V +  T+ FI  + SSS+LSRRAL Q KL ++GFR+ L+HVD HVKNLTRFE
Sbjct: 4   SLRYLIISVVLSITMLFIHTSASSSSLSRRALWQPKLPSDGFRVSLNHVD-HVKNLTRFE 63

Query: 67  RLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPPKSFSAI 126
           RLQRG ARG+ RL RLNAM+LAA  G  VG +VQAPVVAGNGEFLMKLAIGSPP+SFSAI
Sbjct: 64  RLQRGVARGKTRLHRLNAMMLAANVG--VGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 123

Query: 127 MDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSNDGCEYL 186
           MDTGSDLIWTQCKPCQQCFDQ+TPIFDPKESSSFSK+SCSSELC ALPTS CS+D CEY 
Sbjct: 124 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 183

Query: 187 YTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPL 246
           YTYGDYSSTHG+L AETFTFGD  +DQVSI  +GFGCGDDNEGDGFSQG GLVGLGRGPL
Sbjct: 184 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 243

Query: 247 SLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYL 306
           SLVSQLKEQKF+YCLT IDD+KPSSLL+GSLANVKPK S+ EIKTTPLI NPSQPSFYYL
Sbjct: 244 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASEGEIKTTPLISNPSQPSFYYL 303

Query: 307 SLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPV 366
           SL+GISVGG+ L IP+ TFEL DDGSGGVIIDSGTTITYI+K+AFTLLKKEF++QMKLPV
Sbjct: 304 SLQGISVGGTQLPIPKATFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 363

Query: 367 DDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCLAIGSSS 426
           DDSGT GLDLCF LP E TQVEVPKLTFHFK ADLELPGENYMIGDS A L+CLAIGSSS
Sbjct: 364 DDSGTSGLDLCFNLPPETTQVEVPKLTFHFKGADLELPGENYMIGDSRAELICLAIGSSS 423

Query: 427 GMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI 461
           GMSIFGNLQQQN MVVHDLQEET+SF+PTQC  I
Sbjct: 424 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 454

BLAST of Moc01g15010 vs. NCBI nr
Match: XP_023543561.1 (aspartic proteinase nepenthesin-1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 731.1 bits (1886), Expect = 5.9e-207
Identity = 370/454 (81.50%), Postives = 405/454 (89.21%), Query Frame = 0

Query: 7   SVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVKNLTRFE 66
           S+RY IV V +  T+ FI  + SSS+ SRRAL+Q KL ++GFR+ L+HVD HVKNLTRFE
Sbjct: 4   SLRYLIVSVVLSITMLFIHTSASSSSHSRRALRQPKLPSDGFRVSLNHVD-HVKNLTRFE 63

Query: 67  RLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPPKSFSAI 126
           +LQRG ARG+ RL RLNAM+LAA  G  VG +VQAPVVAGNGEFLMKLAIGSPP+SFSAI
Sbjct: 64  QLQRGVARGKTRLHRLNAMMLAANVG--VGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 123

Query: 127 MDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSNDGCEYL 186
           MDTGSDLIWTQCKPCQQCFDQ+TPIFDPKESSSFSK+SCSSELC ALPTS CS+D CEY 
Sbjct: 124 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 183

Query: 187 YTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPL 246
           YTYGDYSSTHG+L AETFTFGD  +DQVSI  +GFGCGDDNEGDGFSQG GLVGLGRGPL
Sbjct: 184 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 243

Query: 247 SLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYL 306
           SLVSQLKEQKF+YCLT IDD+KPSSLL+GSLANVKPK S+ EIKTTPLIRNPSQPSFYYL
Sbjct: 244 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASEGEIKTTPLIRNPSQPSFYYL 303

Query: 307 SLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPV 366
           SL+GISVGG+ L IP+ TFEL DDGSGGVIIDSGTTITYI+K+AFTLLKKEF++QMKLPV
Sbjct: 304 SLQGISVGGTQLPIPKATFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 363

Query: 367 DDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCLAIGSSS 426
           DDSGT GLDLCF LP E TQVEVPKLTFHFK ADLELPGENYMIGDS A L+CLAIGSSS
Sbjct: 364 DDSGTSGLDLCFNLPPETTQVEVPKLTFHFKGADLELPGENYMIGDSRAELICLAIGSSS 423

Query: 427 GMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI 461
           GMSIFGNLQQQN MVVHDLQEET+SF+PTQC  I
Sbjct: 424 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 454

BLAST of Moc01g15010 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 418.7 bits (1075), Expect = 8.5e-116
Identity = 229/446 (51.35%), Postives = 300/446 (67.26%), Query Frame = 0

Query: 14  LVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVKNLTRFERLQRGAA 73
           L+A+     F+  T S+S  +     + K+   GF++ L HVD   KNLT+F+ L+R   
Sbjct: 9   LLALSIVYIFVAPTHSTSRTALNHRHEAKV--TGFQIMLEHVDSG-KNLTKFQLLERAIE 68

Query: 74  RGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPPKSFSAIMDTGSDL 133
           RG  RLQRL AM+   +G       V+  V AG+GE+LM L+IG+P + FSAIMDTGSDL
Sbjct: 69  RGSRRLQRLEAMLNGPSG-------VETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDL 128

Query: 134 IWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSNDGCEYLYTYGDYS 193
           IWTQC+PC QCF+QSTPIF+P+ SSSFS L CSS+LC AL +  CSN+ C+Y Y YGD S
Sbjct: 129 IWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGS 188

Query: 194 STHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPLSLVSQLK 253
            T G +G ET TFG      VSI  I FGCG++N+G G   GAGLVG+GRGPLSL SQL 
Sbjct: 189 ETQGSMGTETLTFG-----SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLD 248

Query: 254 EQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYLSLEGISV 313
             KF+YC+TPI  S PS+LL+GSLAN     S +    T LI++   P+FYY++L G+SV
Sbjct: 249 VTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPN----TTLIQSSQIPTFYYITLNGLSV 308

Query: 314 GGSHLAIPQPTFELR-DDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPVDDSGTG 373
           G + L I    F L  ++G+GG+IIDSGTT+TY   +A+  +++EFI+Q+ LPV +  + 
Sbjct: 309 GSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSS 368

Query: 374 GLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCLAIGSSS-GMSIF 433
           G DLCF+ PS+ + +++P    HF   DLELP ENY I  S  GL+CLA+GSSS GMSIF
Sbjct: 369 GFDLCFQTPSDPSNLQIPTFVMHFDGGDLELPSENYFISPS-NGLICLAMGSSSQGMSIF 428

Query: 434 GNLQQQNFMVVHDLQEETISFVPTQC 458
           GN+QQQN +VV+D     +SF   QC
Sbjct: 429 GNIQQQNMLVVYDTGNSVVSFASAQC 434

BLAST of Moc01g15010 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 397.5 bits (1020), Expect = 2.0e-109
Identity = 213/456 (46.71%), Postives = 301/456 (66.01%), Query Frame = 0

Query: 5   LCSVRYPIVL-VAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVKNLT 64
           + S  Y +VL +A+++ +     + S  TL       QK    G R+ L  VD   KNLT
Sbjct: 1   MASPLYSVVLGLAIVSAIVAPTSSTSRGTLLHHG---QKRPQPGLRVDLEQVDSG-KNLT 60

Query: 65  RFERLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPPKSF 124
           ++E ++R   RG  R++ +NAM+ +++G       ++ PV AG+GE+LM +AIG+P  SF
Sbjct: 61  KYELIKRAIKRGERRMRSINAMLQSSSG-------IETPVYAGDGEYLMNVAIGTPDSSF 120

Query: 125 SAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSNDGC 184
           SAIMDTGSDLIWTQC+PC QCF Q TPIF+P++SSSFS L C S+ C  LP+  C+N+ C
Sbjct: 121 SAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNNEC 180

Query: 185 EYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVGLGR 244
           +Y Y YGD S+T G +  ETFTF     +  S+  I FGCG+DN+G G   GAGL+G+G 
Sbjct: 181 QYTYGYGDGSTTQGYMATETFTF-----ETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGW 240

Query: 245 GPLSLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSF 304
           GPLSL SQL   +F+YC+T    S PS+L +GS A+  P+ S     +T LI +   P++
Sbjct: 241 GPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASGVPEGS----PSTTLIHSSLNPTY 300

Query: 305 YYLSLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMK 364
           YY++L+GI+VGG +L IP  TF+L+DDG+GG+IIDSGTT+TY+ + A+  + + F  Q+ 
Sbjct: 301 YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN 360

Query: 365 LPVDDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCLAIG 424
           LP  D  + GL  CF+ PS+ + V+VP+++  F    L L  +N +I   A G++CLA+G
Sbjct: 361 LPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILI-SPAEGVICLAMG 420

Query: 425 SSS--GMSIFGNLQQQNFMVVHDLQEETISFVPTQC 458
           SSS  G+SIFGN+QQQ   V++DLQ   +SFVPTQC
Sbjct: 421 SSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435

BLAST of Moc01g15010 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 260.8 bits (665), Expect = 3.0e-68
Identity = 143/368 (38.86%), Postives = 208/368 (56.52%), Query Frame = 0

Query: 97  DQVQAPVVA----GNGEFLMKLAIGSPPKSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIF 156
           + +  PVV+    G+GE+  ++ +G+P K    ++DTGSD+ W QC+PC  C+ QS P+F
Sbjct: 145 EDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVF 204

Query: 157 DPKESSSFSKLSCSSELCGALPTSACSNDGCEYLYTYGDYSSTHGLLGAETFTFGDPGED 216
           +P  SS++  L+CS+  C  L TSAC ++ C Y  +YGD S T G L  +T TFG+ G+ 
Sbjct: 205 NPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK- 264

Query: 217 QVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTPIDDSKPSSL 276
              I+ +  GCG DNEG  F+  AGL+GLG G LS+ +Q+K   F+YCL   D  K SSL
Sbjct: 265 ---INNVALGCGHDNEG-LFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSL 324

Query: 277 LMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYLSLEGISVGGSHLAIPQPTFELRDDGS 336
              S+      +      T PL+RN    +FYY+ L G SVGG  + +P   F++   GS
Sbjct: 325 DFNSV------QLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGS 384

Query: 337 GGVIIDSGTTITYIDKSAFTLLKKEFI-AQMKLPVDDSGTGGLDLCFKLPSEATQVEVPK 396
           GGVI+D GT +T +   A+  L+  F+   + L    S     D C+   S +T V+VP 
Sbjct: 385 GGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST-VKVPT 444

Query: 397 LTFHFKDA-DLELPGENYMIGDSAAGLLCLAIG-SSSGMSIFGNLQQQNFMVVHDLQEET 456
           + FHF     L+LP +NY+I    +G  C A   +SS +SI GN+QQQ   + +DL +  
Sbjct: 445 VAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNV 500

Query: 457 ISFVPTQC 458
           I     +C
Sbjct: 505 IGLSGNKC 500

BLAST of Moc01g15010 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 251.5 bits (641), Expect = 1.8e-65
Identity = 145/368 (39.40%), Postives = 205/368 (55.71%), Query Frame = 0

Query: 98  QVQAPVVAGNGEFLMKLAIGSPPKSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKES 157
           Q Q  + + +GE+LM ++IG+PP    AI DTGSDL+WTQC PC  C+ Q  P+FDPK S
Sbjct: 78  QPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTS 137

Query: 158 SSFSKLSCSSELCGALPTSA-CS-NDG-CEYLYTYGDYSSTHGLLGAETFTFGDPGEDQV 217
           S++  +SCSS  C AL   A CS ND  C Y  +YGD S T G +  +T T G      +
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPM 197

Query: 218 SISEIGFGCGDDNEGDGFSQGAGLVGLGRGPLSLVSQLKEQ---KFAYCLTPIDDSK--P 277
            +  I  GCG +N G    +G+G+VGLG GP+SL+ QL +    KF+YCL P+   K   
Sbjct: 198 QLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQT 257

Query: 278 SSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYLSLEGISVGGSHLAIPQPTFELRD 337
           S +  G+ A V    S   + +TPLI   SQ +FYYL+L+ ISVG   +   Q +    +
Sbjct: 258 SKINFGTNAIV----SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQI---QYSGSDSE 317

Query: 338 DGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPVDDSGTGGLDLCFKLPSEATQVEV 397
              G +IIDSGTT+T +    ++ L+    + +          GL LC+   S    ++V
Sbjct: 318 SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY---SATGDLKV 377

Query: 398 PKLTFHFKDADLELPGENYMIGDSAAGLLCLAIGSSSGMSIFGNLQQQNFMVVHDLQEET 457
           P +T HF  AD++L   N  +   +  L+C A   S   SI+GN+ Q NF+V +D   +T
Sbjct: 378 PVITMHFDGADVKLDSSNAFV-QVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKT 434

BLAST of Moc01g15010 vs. ExPASy Swiss-Prot
Match: Q7XV21 (Aspartyl protease 37 OS=Oryza sativa subsp. japonica OX=39947 GN=AP37 PE=3 SV=2)

HSP 1 Score: 250.8 bits (639), Expect = 3.1e-65
Identity = 153/447 (34.23%), Postives = 229/447 (51.23%), Query Frame = 0

Query: 48  FRMRLHHVD---HHVKNLTRFERLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVV 107
           FR+ L  VD       NLT  E L+R   R R RL  +  M    A  A      + P++
Sbjct: 25  FRLELASVDASAADAANLTEHELLRRAIQRSRYRLAGI-GMARGEAASARKAVVAETPIM 84

Query: 108 AGNGEFLMKLAIGSPPKSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLS 167
              GE+L+KL IG+PP  F+A +DT SDLIWTQC+PC  C+ Q  P+F+P+ SS+++ L 
Sbjct: 85  PAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALP 144

Query: 168 CSSELCGALPTSACSND---GCEYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGF 227
           CSS+ C  L    C +D    C+Y YTY   ++T G L  +    G+      +   + F
Sbjct: 145 CSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAF 204

Query: 228 GCGDDNEGDG-FSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANV 287
           GC   + G     Q +G+VGLGRGPLSLVSQL  ++FAYCL P     P  L++G  A+ 
Sbjct: 205 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPPASRIPGKLVLG--ADA 264

Query: 288 KPKKSQDEIKTTPLIRNPSQPSFYYLSLEGISVGGSHLAIPQPT---------------- 347
              ++       P+ R+P  PS+YYL+L+G+ +G   +++P  T                
Sbjct: 265 DAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPT 324

Query: 348 -------FELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPVDDSGTGGLDLC 407
                    + D    G+IID  +TIT+++ S +  L  +   +++LP     + GLDLC
Sbjct: 325 PSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLC 384

Query: 408 FKLPSEAT--QVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCLAIG--SSSGMSIFGN 461
           F LP      +V VP +   F    L L        D  +G++CL +G   +  +SI GN
Sbjct: 385 FILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGN 444

BLAST of Moc01g15010 vs. ExPASy TrEMBL
Match: A0A6J1DKZ2 (aspartic proteinase nepenthesin-1 OS=Momordica charantia OX=3673 GN=LOC111022058 PE=3 SV=1)

HSP 1 Score: 910.6 bits (2352), Expect = 2.6e-261
Identity = 460/460 (100.00%), Postives = 460/460 (100.00%), Query Frame = 0

Query: 1   MLDSLCSVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVK 60
           MLDSLCSVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVK
Sbjct: 1   MLDSLCSVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVK 60

Query: 61  NLTRFERLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPP 120
           NLTRFERLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPP
Sbjct: 61  NLTRFERLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPP 120

Query: 121 KSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSN 180
           KSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSN
Sbjct: 121 KSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSN 180

Query: 181 DGCEYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVG 240
           DGCEYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVG
Sbjct: 181 DGCEYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVG 240

Query: 241 LGRGPLSLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQ 300
           LGRGPLSLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQ
Sbjct: 241 LGRGPLSLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQ 300

Query: 301 PSFYYLSLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIA 360
           PSFYYLSLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIA
Sbjct: 301 PSFYYLSLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIA 360

Query: 361 QMKLPVDDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCL 420
           QMKLPVDDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCL
Sbjct: 361 QMKLPVDDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCL 420

Query: 421 AIGSSSGMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI 461
           AIGSSSGMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI
Sbjct: 421 AIGSSSGMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI 460

BLAST of Moc01g15010 vs. ExPASy TrEMBL
Match: A0A6J1EPU6 (aspartic proteinase nepenthesin-1 OS=Cucurbita moschata OX=3662 GN=LOC111435534 PE=3 SV=1)

HSP 1 Score: 735.7 bits (1898), Expect = 1.2e-208
Identity = 373/454 (82.16%), Postives = 405/454 (89.21%), Query Frame = 0

Query: 7   SVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVKNLTRFE 66
           S+RY I+ V +  T+ FI  + SSS+LSRRAL Q KL ++GFR+ L+HVD HVKNLTRFE
Sbjct: 4   SLRYLIISVVLSITMLFIHTSASSSSLSRRALWQPKLPSDGFRVSLNHVD-HVKNLTRFE 63

Query: 67  RLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPPKSFSAI 126
           RLQRG ARG+ RL RLNAMVLAA  G  VG +VQAPVVAGNGEFLMKLAIGSPP+SFSAI
Sbjct: 64  RLQRGVARGKTRLHRLNAMVLAANVG--VGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 123

Query: 127 MDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSNDGCEYL 186
           MDTGSDLIWTQCKPCQQCFDQ+TPIFDPKESSSFSK+SCSSELC ALPTS CS+D CEY 
Sbjct: 124 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 183

Query: 187 YTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPL 246
           YTYGDYSSTHG+L AETFTFGD  +DQVSI  +GFGCGDDNEGDGFSQG GLVGLGRGPL
Sbjct: 184 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 243

Query: 247 SLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYL 306
           SLVSQLKEQKF+YCLT IDD+KPSSLLMGSLANVKPK S+ EIKTTPLIRNPSQPSFYYL
Sbjct: 244 SLVSQLKEQKFSYCLTAIDDTKPSSLLMGSLANVKPKASEGEIKTTPLIRNPSQPSFYYL 303

Query: 307 SLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPV 366
           SL+GISVGG+ L IP+ TFEL DDGSGGVIIDSGTTITYI+K+AFTLLKKEF++QMKLPV
Sbjct: 304 SLQGISVGGTQLPIPKATFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 363

Query: 367 DDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCLAIGSSS 426
           DDSGT GLDLCF LP E TQVEVPKLTFHFK ADLELPGENYMIGDS A L+CLAIGSSS
Sbjct: 364 DDSGTSGLDLCFNLPPETTQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLAIGSSS 423

Query: 427 GMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI 461
           GMSIFGNLQQQN MVVHDLQEET+SF+PTQC  I
Sbjct: 424 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSEI 454

BLAST of Moc01g15010 vs. ExPASy TrEMBL
Match: A0A6J1HXS0 (aspartic proteinase nepenthesin-1 OS=Cucurbita maxima OX=3661 GN=LOC111467205 PE=3 SV=1)

HSP 1 Score: 727.6 bits (1877), Expect = 3.2e-206
Identity = 368/454 (81.06%), Postives = 403/454 (88.77%), Query Frame = 0

Query: 7   SVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNNGFRMRLHHVDHHVKNLTRFE 66
           S+RY IV V +  T+ FI  + SSS+LSRRAL+Q KL ++GFR+ L+HVD HVKNLTRFE
Sbjct: 4   SLRYLIVSVVLSITMLFIHTSASSSSLSRRALRQPKLPSDGFRVSLNHVD-HVKNLTRFE 63

Query: 67  RLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPPKSFSAI 126
           RLQRG ARG+ RL RLNAM+LAA  G  VG +VQAPVVAGNGEFLMKLAIGSPP+SFSAI
Sbjct: 64  RLQRGVARGKTRLHRLNAMMLAANIG--VGGRVQAPVVAGNGEFLMKLAIGSPPRSFSAI 123

Query: 127 MDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSNDGCEYL 186
           MDTGSDLIWTQCKPCQQCFDQ+TPIFDPKESSSFSK+SCSSELC ALPTS CS+D CEY 
Sbjct: 124 MDTGSDLIWTQCKPCQQCFDQATPIFDPKESSSFSKISCSSELCDALPTSTCSSDECEYF 183

Query: 187 YTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPL 246
           YTYGDYSSTHG+L AETFTFGD  +DQVSI  +GFGCGDDNEGDGFSQG GLVGLGRGPL
Sbjct: 184 YTYGDYSSTHGVLAAETFTFGDSSQDQVSIPGLGFGCGDDNEGDGFSQGEGLVGLGRGPL 243

Query: 247 SLVSQLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYL 306
           SLVSQLKEQKF+YCLT IDD+KPSSLL+GSLANVKPK S  EIKTTPLIRNPSQPSFYYL
Sbjct: 244 SLVSQLKEQKFSYCLTAIDDTKPSSLLLGSLANVKPKASDGEIKTTPLIRNPSQPSFYYL 303

Query: 307 SLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPV 366
           SL+GISVGG+ L IP+ TFEL DDGSGGVIIDSGTTITYI+K+AFTLLKKEF++QMKLPV
Sbjct: 304 SLQGISVGGTQLPIPKNTFELHDDGSGGVIIDSGTTITYIEKNAFTLLKKEFVSQMKLPV 363

Query: 367 DDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCLAIGSSS 426
           DDSGT GLDLCF LP +  QVEVPKLTFHFK ADLELPGENYMIGDS A L+CL IGSS+
Sbjct: 364 DDSGTSGLDLCFNLPPKTNQVEVPKLTFHFKGADLELPGENYMIGDSKAELICLTIGSSN 423

Query: 427 GMSIFGNLQQQNFMVVHDLQEETISFVPTQCDGI 461
           GMSIFGNLQQQN MVVHDLQEET+SF+PTQC  I
Sbjct: 424 GMSIFGNLQQQNIMVVHDLQEETVSFLPTQCSDI 454

BLAST of Moc01g15010 vs. ExPASy TrEMBL
Match: A0A0A0KYT9 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G554680 PE=3 SV=1)

HSP 1 Score: 726.5 bits (1874), Expect = 7.1e-206
Identity = 368/449 (81.96%), Postives = 402/449 (89.53%), Query Frame = 0

Query: 13  VLVAVLATLFFIDLTVSSSTLSRRALQQ-QKLLNNGFRMRLHHVDHHVKNLTRFERLQRG 72
           +L+ +L T  FI+    SS+LSRRALQ+  KL ++GFR+RL HVD HVKNLTRFERL+RG
Sbjct: 15  LLLIILITTLFINTLAFSSSLSRRALQKPNKLPSHGFRVRLKHVD-HVKNLTRFERLRRG 74

Query: 73  AARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPPKSFSAIMDTGS 132
            ARG+NRL RLNAMVLAAA  A VGDQV+APVVAGNGEFLMKLAIGSPP+SFSAIMDTGS
Sbjct: 75  VARGKNRLHRLNAMVLAAA-NATVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGS 134

Query: 133 DLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSNDGCEYLYTYGD 192
           DLIWTQCKPCQQCFDQSTPIFDPK+SSSF K+SCSSELCGALPTS CS+DGCEYLYTYGD
Sbjct: 135 DLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGD 194

Query: 193 YSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPLSLVSQ 252
            SST G+L  ETFTFGD  EDQ+SI  +GFGCG+DN GDGFSQGAGLVGLGRGPLSLVSQ
Sbjct: 195 SSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQ 254

Query: 253 LKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYLSLEGI 312
           LKEQKFAYCLT IDDSKPSSLL+GSLAN+ PK S+DE+KTTPLI+NPSQPSFYYLSL+GI
Sbjct: 255 LKEQKFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGI 314

Query: 313 SVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPVDDSGT 372
           SVGG+ L+IP+ TFEL DDGSGGVIIDSGTTITY++ SAFT LK EFIAQM LPVDDSGT
Sbjct: 315 SVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGT 374

Query: 373 GGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCLAIGSSSGMSIF 432
           GGLDLCF LP+   QVEVPKLTFHFK ADLELPGENYMIGDS AGLLCLAIGSS GMSIF
Sbjct: 375 GGLDLCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIF 434

Query: 433 GNLQQQNFMVVHDLQEETISFVPTQCDGI 461
           GNLQQQNFMVVHDLQEET+SF+PTQCD I
Sbjct: 435 GNLQQQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of Moc01g15010 vs. ExPASy TrEMBL
Match: A0A5A7TD10 (Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold128G001020 PE=3 SV=1)

HSP 1 Score: 718.8 bits (1854), Expect = 1.5e-203
Identity = 364/450 (80.89%), Postives = 401/450 (89.11%), Query Frame = 0

Query: 12  IVLVAVLATLFFIDLTVSSSTLSRRALQQ-QKLLNNGFRMRLHHVDHHVKNLTRFERLQR 71
           ++L+ V  T  FI+    SS+LS RALQ+  KL ++GFR+RL HVD HVKNLTRFERL+R
Sbjct: 14  LLLLIVFITTLFINTLAFSSSLSTRALQKPNKLPSHGFRVRLKHVD-HVKNLTRFERLRR 73

Query: 72  GAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGSPPKSFSAIMDTG 131
           G ARG+NRL RLNAMVLAAA  A+VGDQV+APVVAGNGEFLMKLAIGSPP+SFSAIMDTG
Sbjct: 74  GVARGKNRLHRLNAMVLAAA-NASVGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTG 133

Query: 132 SDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSNDGCEYLYTYG 191
           SDLIWTQCKPCQQCFDQ+TPIFDPK+SSSFSK+SC SELCGALPTS CS+DGCEYLYTYG
Sbjct: 134 SDLIWTQCKPCQQCFDQATPIFDPKQSSSFSKISCRSELCGALPTSTCSSDGCEYLYTYG 193

Query: 192 DYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPLSLVS 251
           D SST G+L  ETFTFGD  EDQ+SI  +GFGCG+DN GDGFSQGAGLVGLGRGPLSLVS
Sbjct: 194 DSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVS 253

Query: 252 QLKEQKFAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYLSLEG 311
           QLKEQKFAYCLT IDDSKPSSLL+GSLAN+ PK S+DE+K TPLI+NPSQPSFYYLSL+G
Sbjct: 254 QLKEQKFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKATPLIKNPSQPSFYYLSLQG 313

Query: 312 ISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPVDDSG 371
           ISVGG+ L+IP+ TFEL DDGSGGVIIDSGTTITYI+ +AF+ LK EFIAQM LPVDDSG
Sbjct: 314 ISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYIESTAFSSLKNEFIAQMNLPVDDSG 373

Query: 372 TGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMIGDSAAGLLCLAIGSSSGMSI 431
           TGGLDLCF LP+  TQVEVPKLTFHFK ADLELPGENYMIGDS  GLLCLAIGSS GMSI
Sbjct: 374 TGGLDLCFNLPAGTTQVEVPKLTFHFKGADLELPGENYMIGDSKTGLLCLAIGSSRGMSI 433

Query: 432 FGNLQQQNFMVVHDLQEETISFVPTQCDGI 461
           FGNLQQQNFMVVHDLQEET+SF+PTQCD I
Sbjct: 434 FGNLQQQNFMVVHDLQEETLSFLPTQCDSI 461

BLAST of Moc01g15010 vs. TAIR 10
Match: AT2G03200.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 530.8 bits (1366), Expect = 1.1e-150
Identity = 283/467 (60.60%), Postives = 354/467 (75.80%), Query Frame = 0

Query: 4   SLCSVRYPIVLVAVLATLFFIDLTVSSSTLSRRALQQQKLLNN----GFRMRLHHVDHHV 63
           S  S+ +P  L+     LF   ++VSS   SRR+L  + L  N    GFR+ L HVD   
Sbjct: 5   SSSSLLFPFFLI-----LFSCLISVSS---SRRSLIDRTLPKNLPRSGFRLSLRHVDSG- 64

Query: 64  KNLTRFERLQRGAARGRNRLQRLNAM-VLAAAGGAAVGDQVQAPVVAGNGEFLMKLAIGS 123
           KNLT+ +++QRG  RG +RL RL A+ VLA A      + ++AP   G+GEFLM+L+IG+
Sbjct: 65  KNLTKIQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGN 124

Query: 124 PPKSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSAC 183
           P   +SAI+DTGSDLIWTQCKPC +CFDQ TPIFDP++SSS+SK+ CSS LC ALP S C
Sbjct: 125 PAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNC 184

Query: 184 S--NDGCEYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGA 243
           +   D CEYLYTYGDYSST GLL  ETFTF    ED+ SIS IGFGCG +NEGDGFSQG+
Sbjct: 185 NEDKDACEYLYTYGDYSSTRGLLATETFTF----EDENSISGIGFGCGVENEGDGFSQGS 244

Query: 244 GLVGLGRGPLSLVSQLKEQKFAYCLTPIDDSK-PSSLLMGSLANVKPKKSQDEI-----K 303
           GLVGLGRGPLSL+SQLKE KF+YCLT I+DS+  SSL +GSLA+    K+   +     K
Sbjct: 245 GLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTK 304

Query: 304 TTPLIRNPSQPSFYYLSLEGISVGGSHLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSA 363
           T  L+RNP QPSFYYL L+GI+VG   L++ + TFEL +DG+GG+IIDSGTTITY++++A
Sbjct: 305 TMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETA 364

Query: 364 FTLLKKEFIAQMKLPVDDSGTGGLDLCFKLPSEATQVEVPKLTFHFKDADLELPGENYMI 423
           F +LK+EF ++M LPVDDSG+ GLDLCFKLP  A  + VPK+ FHFK ADLELPGENYM+
Sbjct: 365 FKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMV 424

Query: 424 GDSAAGLLCLAIGSSSGMSIFGNLQQQNFMVVHDLQEETISFVPTQC 458
            DS+ G+LCLA+GSS+GMSIFGN+QQQNF V+HDL++ET+SFVPT+C
Sbjct: 425 ADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTEC 458

BLAST of Moc01g15010 vs. TAIR 10
Match: AT3G18490.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 260.8 bits (665), Expect = 2.1e-69
Identity = 143/368 (38.86%), Postives = 208/368 (56.52%), Query Frame = 0

Query: 97  DQVQAPVVA----GNGEFLMKLAIGSPPKSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIF 156
           + +  PVV+    G+GE+  ++ +G+P K    ++DTGSD+ W QC+PC  C+ QS P+F
Sbjct: 145 EDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVF 204

Query: 157 DPKESSSFSKLSCSSELCGALPTSACSNDGCEYLYTYGDYSSTHGLLGAETFTFGDPGED 216
           +P  SS++  L+CS+  C  L TSAC ++ C Y  +YGD S T G L  +T TFG+ G+ 
Sbjct: 205 NPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK- 264

Query: 217 QVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTPIDDSKPSSL 276
              I+ +  GCG DNEG  F+  AGL+GLG G LS+ +Q+K   F+YCL   D  K SSL
Sbjct: 265 ---INNVALGCGHDNEG-LFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSL 324

Query: 277 LMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYLSLEGISVGGSHLAIPQPTFELRDDGS 336
              S+      +      T PL+RN    +FYY+ L G SVGG  + +P   F++   GS
Sbjct: 325 DFNSV------QLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGS 384

Query: 337 GGVIIDSGTTITYIDKSAFTLLKKEFI-AQMKLPVDDSGTGGLDLCFKLPSEATQVEVPK 396
           GGVI+D GT +T +   A+  L+  F+   + L    S     D C+   S +T V+VP 
Sbjct: 385 GGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST-VKVPT 444

Query: 397 LTFHFKDA-DLELPGENYMIGDSAAGLLCLAIG-SSSGMSIFGNLQQQNFMVVHDLQEET 456
           + FHF     L+LP +NY+I    +G  C A   +SS +SI GN+QQQ   + +DL +  
Sbjct: 445 VAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNV 500

Query: 457 ISFVPTQC 458
           I     +C
Sbjct: 505 IGLSGNKC 500

BLAST of Moc01g15010 vs. TAIR 10
Match: AT1G25510.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 255.4 bits (651), Expect = 8.9e-68
Identity = 164/443 (37.02%), Postives = 229/443 (51.69%), Query Frame = 0

Query: 39  QQQKLLNNGFRMRLH---------HVDHHVKNLTRFERLQRGAARGRNRLQRLNAMV--L 98
           +Q    ++ F ++LH         H D+  K+LT   RL R  AR ++ + RL+  +  +
Sbjct: 58  EQTHSASSSFSLQLHSRVSVRGTEHSDY--KSLT-LARLNRDTARVKSLITRLDLAINNI 117

Query: 99  AAAGGAAVG-------DQVQAPVVA----GNGEFLMKLAIGSPPKSFSAIMDTGSDLIWT 158
           + A    +          ++AP+++    G+GE+  ++ IG P +    ++DTGSD+ W 
Sbjct: 118 SKADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWL 177

Query: 159 QCKPCQQCFDQSTPIFDPKESSSFSKLSCSSELCGALPTSACSNDGCEYLYTYGDYSSTH 218
           QC PC  C+ Q+ PIF+P  SSS+  LSC +  C AL  S C N  C Y  +YGD S T 
Sbjct: 178 QCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTV 237

Query: 219 GLLGAETFTFGDPGEDQVSISEIGFGCGDDNEGDGFSQGAGLVGLGRGPLSLVSQLKEQK 278
           G    ET T G        +  +  GCG  NEG  F   AGL+GLG G L+L SQL    
Sbjct: 238 GDFATETLTIG-----STLVQNVAVGCGHSNEG-LFVGAAGLLGLGGGLLALPSQLNTTS 297

Query: 279 FAYCLTPIDDSKPSSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYLSLEGISVGGS 338
           F+YCL   D    S++  G+        S D +   PL+RN    +FYYL L GISVGG 
Sbjct: 298 FSYCLVDRDSDSASTVDFGT------SLSPDAV-VAPLLRNHQLDTFYYLGLTGISVGGE 357

Query: 339 HLAIPQPTFELRDDGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPVDDSGTGGLDL 398
            L IPQ +FE+ + GSGG+IIDSGT +T +    +  L+  F+         +G    D 
Sbjct: 358 LLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDT 417

Query: 399 CFKLPSEATQVEVPKLTFHFKDAD-LELPGENYMIGDSAAGLLCLAIG-SSSGMSIFGNL 458
           C+ L S  T VEVP + FHF     L LP +NYMI   + G  CLA   ++S ++I GN+
Sbjct: 418 CYNL-SAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNV 477

BLAST of Moc01g15010 vs. TAIR 10
Match: AT1G64830.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 251.5 bits (641), Expect = 1.3e-66
Identity = 152/419 (36.28%), Postives = 230/419 (54.89%), Query Frame = 0

Query: 46  NGFRMRLHHVDHHVKNLTRFERLQRGAARGRNRLQRLNAMVLAAAGGAAVGDQVQAPVVA 105
           +GF + L H D        +   +  + R RN ++R     L  +   A  +  Q+ + +
Sbjct: 24  DGFTIDLIHRDSPKSPF--YNSAETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITS 83

Query: 106 GNGEFLMKLAIGSPPKSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKESSSFSKLSC 165
             GE+LM ++IG+PP    AI DTGSDLIWTQC PC+ C+ Q++P+FDPKESS++ K+SC
Sbjct: 84  NRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSC 143

Query: 166 SSELCGALPTSACSND--GCEYLYTYGDYSSTHGLLGAETFTFGDPGEDQVSISEIGFGC 225
           SS  C AL  ++CS D   C Y  TYGD S T G +  +T T G  G   VS+  +  GC
Sbjct: 144 SSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGC 203

Query: 226 GDDNEGDGFSQGAGLVGLGRGPLSLVSQLKEQ---KFAYCLTPI--DDSKPSSLLMGSLA 285
           G +N G     G+G++GLG G  SLVSQL++    KF+YCL P   +    S +  G+  
Sbjct: 204 GHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNG 263

Query: 286 NVKPKKSQDEIKTTPLIRNPSQPSFYYLSLEGISVGGSHLAIPQPTFELRDDGSGGVIID 345
            V    S D + +T +++     ++Y+L+LE ISVG   +   Q T  +   G G ++ID
Sbjct: 264 IV----SGDGVVSTSMVKK-DPATYYFLNLEAISVGSKKI---QFTSTIFGTGEGNIVID 323

Query: 346 SGTTITYIDKSAFTLLKKEFIAQMKLPVDDSGTGGLDLCFKLPSEATQVEVPKLTFHFKD 405
           SGTT+T +  + +  L+    + +K        G L LC++   +++  +VP +T HFK 
Sbjct: 324 SGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYR---DSSSFKVPDITVHFKG 383

Query: 406 ADLELPGENYMIGDSAAGLLCLAIGSSSGMSIFGNLQQQNFMVVHDLQEETISFVPTQC 458
            D++L   N  +  S   + C A  ++  ++IFGNL Q NF+V +D    T+SF  T C
Sbjct: 384 GDVKLGNLNTFVAVS-EDVSCFAFAANEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDC 428

BLAST of Moc01g15010 vs. TAIR 10
Match: AT5G33340.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 251.5 bits (641), Expect = 1.3e-66
Identity = 145/368 (39.40%), Postives = 205/368 (55.71%), Query Frame = 0

Query: 98  QVQAPVVAGNGEFLMKLAIGSPPKSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKES 157
           Q Q  + + +GE+LM ++IG+PP    AI DTGSDL+WTQC PC  C+ Q  P+FDPK S
Sbjct: 78  QPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTS 137

Query: 158 SSFSKLSCSSELCGALPTSA-CS-NDG-CEYLYTYGDYSSTHGLLGAETFTFGDPGEDQV 217
           S++  +SCSS  C AL   A CS ND  C Y  +YGD S T G +  +T T G      +
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPM 197

Query: 218 SISEIGFGCGDDNEGDGFSQGAGLVGLGRGPLSLVSQLKEQ---KFAYCLTPIDDSK--P 277
            +  I  GCG +N G    +G+G+VGLG GP+SL+ QL +    KF+YCL P+   K   
Sbjct: 198 QLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQT 257

Query: 278 SSLLMGSLANVKPKKSQDEIKTTPLIRNPSQPSFYYLSLEGISVGGSHLAIPQPTFELRD 337
           S +  G+ A V    S   + +TPLI   SQ +FYYL+L+ ISVG   +   Q +    +
Sbjct: 258 SKINFGTNAIV----SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQI---QYSGSDSE 317

Query: 338 DGSGGVIIDSGTTITYIDKSAFTLLKKEFIAQMKLPVDDSGTGGLDLCFKLPSEATQVEV 397
              G +IIDSGTT+T +    ++ L+    + +          GL LC+   S    ++V
Sbjct: 318 SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY---SATGDLKV 377

Query: 398 PKLTFHFKDADLELPGENYMIGDSAAGLLCLAIGSSSGMSIFGNLQQQNFMVVHDLQEET 457
           P +T HF  AD++L   N  +   +  L+C A   S   SI+GN+ Q NF+V +D   +T
Sbjct: 378 PVITMHFDGADVKLDSSNAFV-QVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKT 434

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022154910.15.5e-261100.00aspartic proteinase nepenthesin-1 [Momordica charantia][more]
XP_022928703.12.4e-20882.16aspartic proteinase nepenthesin-1 [Cucurbita moschata][more]
XP_038883313.12.7e-20781.34aspartic proteinase nepenthesin-1 [Benincasa hispida][more]
KAG7033562.14.5e-20781.50Aspartic proteinase nepenthesin-1 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023543561.15.9e-20781.50aspartic proteinase nepenthesin-1 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q766C38.5e-11651.35Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q766C22.0e-10946.71Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q9LS403.0e-6838.86Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q6XBF81.8e-6539.40Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Q7XV213.1e-6534.23Aspartyl protease 37 OS=Oryza sativa subsp. japonica OX=39947 GN=AP37 PE=3 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1DKZ22.6e-261100.00aspartic proteinase nepenthesin-1 OS=Momordica charantia OX=3673 GN=LOC111022058... [more]
A0A6J1EPU61.2e-20882.16aspartic proteinase nepenthesin-1 OS=Cucurbita moschata OX=3662 GN=LOC111435534 ... [more]
A0A6J1HXS03.2e-20681.06aspartic proteinase nepenthesin-1 OS=Cucurbita maxima OX=3661 GN=LOC111467205 PE... [more]
A0A0A0KYT97.1e-20681.96Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G55468... [more]
A0A5A7TD101.5e-20380.89Aspartic proteinase nepenthesin-1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
Match NameE-valueIdentityDescription
AT2G03200.11.1e-15060.60Eukaryotic aspartyl protease family protein [more]
AT3G18490.12.1e-6938.86Eukaryotic aspartyl protease family protein [more]
AT1G25510.18.9e-6837.02Eukaryotic aspartyl protease family protein [more]
AT1G64830.11.3e-6636.28Eukaryotic aspartyl protease family protein [more]
AT5G33340.11.3e-6639.40Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 429..444
score: 26.0
coord: 335..346
score: 38.11
coord: 116..136
score: 43.19
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 303..453
e-value: 1.3E-33
score: 116.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 277..459
e-value: 1.4E-54
score: 186.6
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 96..275
e-value: 7.9E-55
score: 187.8
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 103..457
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 110..276
e-value: 1.6E-50
score: 171.8
NoneNo IPR availablePANTHERPTHR47967OS07G0603500 PROTEIN-RELATEDcoord: 31..459
NoneNo IPR availablePANTHERPTHR47967:SF23OS08G0469000 PROTEINcoord: 31..459
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 335..346
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 110..453
score: 42.364368
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 109..457
e-value: 1.32386E-109
score: 323.061

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc01g15010.1Moc01g15010.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005576 extracellular region
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity