MC02g1075 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC02g1075
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionaspartic proteinase-like protein 2 isoform X1
LocationMC02: 9055050 .. 9060806 (+)
RNA-Seq ExpressionMC02g1075
SyntenyMC02g1075
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCCTCATGCATTCCACTTTCTCCGCCGATCCCATCGCCGCCAATCTTCTCCTCACCCCTCCCCATCGCGCCATGGTCTTGCCTCTTTACCTCTCTTCCCCTAATTCCTCCAGATTGATCTCCAAGCCTCGCCGCCATCTCCGGGAATCAAATTCCTACAATCTCTCCAACGCTCGCATGCGGCTCTACGACGACCTCCTCCTCAATGGGTGTGTTCCCCTGTTTAGAGGTCTAATGATTTTGATTTGTTTCTGATAAAATGTGTGTTGTTTAAGGTATTATACGACGCGGCTATGGATCGGGACTCCGCCGCAGCAGTTCGCGCTCATAGTTGATACCGGGAGTTCGGTTACCTATGTTCCGTGCGCAAATTGCGAACAATGTGGGAGGCACCAGGTTAAAATGTTTAGAAATTAATGATCGTAGGACCGTTTATACGGTCTTAATATTCTTATTAGATCATCTGCTCTGTTTTCCTGTAATTCAATTACCTTTCAATATATCCCTTATCGTTCTTATTTGACTGGCCTCTATAAATACCGATATGGAATTTGGTAATCTAATACATCAACCTGGTGGGGGGGGGGGGGGGGGGAATAGATCATACAGATCCTAGAATGATACATGTTTATCTGGACAAATTGTTTTTCTTGGAACAAGTGATCTTGGTATCCGACTCTGGATACTCTTGAATATTGGTACATTTCCCTCCTATTTGGTGAAAATTTTCGGTATTCTGTGTGGGTCTTGAAGGTTGTCGACATGTTAGCTTTAAGTATAAAATTTGTCCTGTCATTATTGTATTTATATTAAAATATTTCTAGTTTTTTCTTCATAGGTACCTTGGAATGTGCTGTAGTTGGTACTTAAACTTTTATTTCTATCAGGACCCGAAGTTTGATCCAGATTTGTCAAGCACGTTCCGACCTGTCAAATGCAATCTTGATTGCAGTTGTGACGATGATGGACTGCTGTGTGTCTACGAGAGGCAGTATGCTGAAATGAGCACTAGCAGTGGTATCCTTGGTGAAGATATTATATCCTTTGGCAATCAGAGTGAACTCGTACCCCAGCGTGCTACGTTTGGTTGTGAGACTGTGGAAACTGGTGATCTTTATAGTCAACGTGCTGATGGAATTATGGGTTTGGGCAGTGGTGAACTTAGTATAGTCGATCAACTCGTTGAAAAAGGTGTGATTAATGATACTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGTATCTCTACTCCATCAGATATGGCTTTTAGCTTCTCAGACCGTATGAGAAGGTGTGTACAAATGTCGTTTAACCTTACAAATTGTACTTCATGCTCATTCAGCACAATTACAATTGATTCATCTCACACATTTACTTTAGTTTCGGGATGGCAACAGTCCATATTACAATGTTGATTTGAAGGAGATACGTGTTGCGGGTAAAAAATTGCTTCTGAATCCAAGTGTTTTTGATGGAAAATTTGGAACTGTCTTGGATAGTGGTACAACTTATGCTTACCTACCACAGCCAGCGTTTGGAGCTTTTAAGGATGCTGTAAGCTTCCATTTTTCTGAATTAACATTCACTGCCAGTAGTATATTTACTTTTGTCTGAATTTGAAGAGTGTTGCATCCTTTCCGGAGGACACTGTAAGATGATTTAGTTCTTACTTTTCCAGGATTATCTTTTTGCAACTTCTGTCTCAGTCTTATAATTTTTTTAATTGCATTTTTCTGTACTTCTTCTAGCTGTTTTATATTTTAATAATTTGATATGGCTTACGCTCGATTACATTATTTCCTTGACGACCTTTATAAATGGGATCATTTGATATATTATCGGATGATTTCAATTCATTGTGTGCAGTGGAATGGCAAGATCTTGTGTGATCCGAAAAATATTATTTATACAATGAAATAGGGAAACGAACAGAATGAAATGAAGAAACTTTTAGTTCCTTATCATAAAAATGAAGGATTGATCCCTGTAGTCATTCTATGACACGGCTCATAACCTGTGTCTCTTTCTCTTGCTCTCTCTAATGTTGTTGCCTTGTATGTCTTCTTTGCCATCGACTGTGCAAGTAGTCTCTAGGTCAAGGGCAATAGATCTTGTTCTTTTTTAAGAGTCTTGATTTTAATGTTACTCGTGCAATCTTGAGTTGTTAACTGTTGGAATCCTGTTTTTGACATTTTTATAGAACTCATAATGAACTTAAAAAGTCCTTCAGATTATGGACGAGGTTCATTCTTTGAAGAAGATTGGTGGTCCTGACCCAAATTTTAACGATATATGTTTTTCTGGCGCTGGAAGGTATGGTTAGCTAACACGTTTTGTCAGTTCTAAAAGTGATTTACATCAATGAGTAATTAATCCGTCTCATTATGTTAGTGATGTTGCTGAATTGTCGAAGACATTCCCGGCAGTTGACATGGTATTCGAAAATGGGCAAAAGTTGTCTCTAGCACCAGAAAATTACTTGTTTCGGGTAAGAAATTCCTCTGTGATTTTTTATTCAATTAAGAAAAGTTGTCCCGGCAGTTTATGTAGTTGCTAAGCATTGATCAAGTTAACGGATTCTTCTTTACTGTGAATTCTGCATCATTACTTCAGCACTCAAAGGTACATGGTGCATATTGTCTGGGCATTTTTGAGAACGGAAATGATCAAACTACTCTTCTAGGAGGTACCCTTTTTTTAATGGCCCACATGATTTTTCCCATGAATGCTTATTCTATGGGAAATTTGGCTTTTCTATCTGTTTGTTTTATGAAGTTGTATAATTTTTAATTTGATCACATATTTACATTTTTGCATAAATATTTTTTGTGATAGGAATTATTGTCCGCAACACTCTAGTGATGTATGACAGAGAGAATTCAAAAATTGGATTTTGGAAAACAAACTGTTCTGAGTTATGGGAAAGACTTCACATTTCTAATGATACTGCCCATGCTCCCTCTGTTTCAAATACATCACATGATACTGAAATGGCACCGGCATCTGCTCCGAGCGAGGCACCGGCATCTGCTCCGAGCGAGGCGCCGGCCTCTGCTCCGAGCGAGGCGCCGGCCTCTGCTCCGAGCGAGGCGCCGGCCACTGCTCCAAGCGAGGCACCGGCATCTGCTCCAAGCGAGGCACCACATTACATGATTCCAGGTATAGTTAAGTGCAAGGTTTCTTTTTGATTTTATGCTAGCATGGCAGCTTGACAGCATACGTTGATTCTTTAGTAATTTTCTTTGTGTGAATCATTGTTTTATTCAGTTAACCTTGATTTACTTCGACCTTTTTTTTTTTTAAAGGAGCTAATGTTTCATGCCATTCTTCATGTCATTAGGACAATTGAGGTTCTTTCAGAATTTGTGACAAGAGCAATGGCTTTTTAACATTTTAGTTTACTTTGACTATTTATGAGAACTGTTGGGTGAAGAAAATTATGTAACCAGCATTTTGGAAATATCATTTAGCGGAATTTGATGTAGATATTATAAAATTGCAATTTTCATAAGATGAATTCATGATCTTTTCAGTCTCTACCTTACATGGCCCGTTTGTGAAGTTTTCTTTTGTATTACGTGCAGGAGAGCTCCAGGTTGGACGTATCACATTTGAAATCCTGTTGAACATAAGCTACGAAGATCTGGAGCCTCATATTACAGAACTTTCTGACCTTATTGCTCAAGAGTTAAATGTTAGTTATTCACAGGTTAGTGCACATGACACATCTGGGATTGAAAATTCTTTTGGCAGTATTTGATGATAATTACGGAACAATCTTTAACATTTGCATTTTCAACAGTTCAAATGAATCGTGATGGTAATAACATATAGGTTATATTATAAATTTAATGCATGATTGTGCTTATTGGACTATAAACTATAAAAGCTTTTAATTGTGTCCCTAAATTATGAATTGCGATTCTATACCATTCTTATAGTCAGTATTATTAGCTATATAATAACATTTTATACCTAAATATCAGTTTAGCATGTGTAGCATACTTTTTTCATTAAACAACATATAAATTCCTCTTGTATAGAGATCATACTACTCTATTTTGTCCAGTTATCCACTCCATTCTGCCATTTGTCTTTTATGTATGCTACCCTTATGATTTTTGTTTTTTATGTCATATCAAATTAACCAACGTGGGGTCAATAGGCATGTCATATTATCATCTTGTCAACATTATTATCGGTAGGAACTACAATTCAAAGTTTTTGAGACAATTTAAATGTATTAAAATTTAAGGAATAAATAGTTGCCACCATCAAAATATACGGGCTAAATTTGTAATTTACCCCCACAAATTTCTTTCATTCTTTTCTCGCACCTAACTTTTTACTGCATGTCATTTCACCATTATTTCAGGTTCGTTTATTGAATTTTACCATGCAAGGAAACGATTCACTTATTCAGCTGGCCATAATCCCTGGTGGATCTTCAGAATTTTTCTCACATGCGACTGCCACTGTAAGTGAATAGACTTGCAACTGTAAACTGCATAACAACAGAAAATTCACAAGCATCAAGCCTGCCTTTTAAAATCTTGTTTTACTCTGGCAGACGATAATTGCCCAGATTGTGGAGCATCACATGCAACTACCTCCAACATTTGGAAGTTATCAGGTCGTTCAATGGAATGTCGAGCCTCTAATAAAAAGGTAAATATGTATCAATATATATGTTAAACTATAAGTTTAGTTACCAAACTTTGATGGTTGGGTCTATTTAATTATAAATTTTTGTATATGTTCCTGATTTTTTTTTTTTCTAACAAGTGCATGTCATCAATCCCAGTGTGTTAGTTGGACTAACGAGTGATAATTTTGTTAAAGAGTAAGCTAGTCAGATGAAATAAAGATTGAGTGTGCTAAAAAATTCAATGAAGCAATAGTATTCTGACCCACCTGCCTTTTTTTCATCATTTTGCCAAATTATCACTCATTGATACAATAACAACAGTTAAGTAATAGTGCTGACACATGAGCTTATTAGAATCAAAATCAAAGTTTATGAATATAATTAAAACTTTTTAAAGTTCAGGAATTAAATAATAGACGTAACCATCAAAGTTTAGGAACTAAATTTACAATTTAACTATACACATATTACTTTCCCTAACCAAAGTAGTGTATTCATGGATGGGTATCACCTCTACATTGTTCAGGTCATTGTGGAAGCAACTTTATGTTATGGTGATTGTAGCTGTTATTGTCACGCTTCTTCTTGGGTTGTCAGCATTGGGAGTGTGGCTTATTTGGAGAAGGAGACACCAATCCTTCAATTCTTATAAGCCTGTCAATGCAGCAGCTCCTGAGCAAGAACTCCAGCCCCTGTAAACAAGCTTAACCAACATATTTTTGCTTTCTGCTCCCTCTTTTCATCTTCTTCCTTCCTTTCTTTCTTTCCTGGTCAAATTTTCAGTGGGATATACAGGAAGATTCATTATTTTTCAGGTTTTTTTTATTCATTTATGCTGAAATGTTATTACATTATTTAAAAAAAAAAAAAAAAACTTGGGTGTAATGTTTTTAATTTTGTGGTGGCAGTGGTTCAAGGAATATGTATATAGGACACCCAATTCTTGGAGTTGTCCTTATTTAGAATCTTATATAAGATTGAGAATTAATTTTAATTTTTTGAGGGAAGAGCAAAATGTCTTAACTAATGGCTAAGCCAAGCTCATGTTACTCAGTGAGATGTTAATAACAATAATAGTATTTGAC

mRNA sequence

CTCCTCATGCATTCCACTTTCTCCGCCGATCCCATCGCCGCCAATCTTCTCCTCACCCCTCCCCATCGCGCCATGGTCTTGCCTCTTTACCTCTCTTCCCCTAATTCCTCCAGATTGATCTCCAAGCCTCGCCGCCATCTCCGGGAATCAAATTCCTACAATCTCTCCAACGCTCGCATGCGGCTCTACGACGACCTCCTCCTCAATGGGTATTATACGACGCGGCTATGGATCGGGACTCCGCCGCAGCAGTTCGCGCTCATAGTTGATACCGGGAGTTCGGTTACCTATGTTCCGTGCGCAAATTGCGAACAATGTGGGAGGCACCAGGACCCGAAGTTTGATCCAGATTTGTCAAGCACGTTCCGACCTGTCAAATGCAATCTTGATTGCAGTTGTGACGATGATGGACTGCTGTGTGTCTACGAGAGGCAGTATGCTGAAATGAGCACTAGCAGTGGTATCCTTGGTGAAGATATTATATCCTTTGGCAATCAGAGTGAACTCGTACCCCAGCGTGCTACGTTTGGTTGTGAGACTGTGGAAACTGGTGATCTTTATAGTCAACGTGCTGATGGAATTATGGGTTTGGGCAGTGGTGAACTTAGTATAGTCGATCAACTCGTTGAAAAAGGTGTGATTAATGATACTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGTATCTCTACTCCATCAGATATGGCTTTTAGCTTCTCAGACCGTATGAGAAGTCCATATTACAATGTTGATTTGAAGGAGATACGTGTTGCGGGTAAAAAATTGCTTCTGAATCCAAGTGTTTTTGATGGAAAATTTGGAACTGTCTTGGATAGTGGTACAACTTATGCTTACCTACCACAGCCAGCGTTTGGAGCTTTTAAGGATGCTATTATGGACGAGGTTCATTCTTTGAAGAAGATTGGTGGTCCTGACCCAAATTTTAACGATATATGTTTTTCTGGCGCTGGAAGGTATGTAATTAATCCGTCTCATTATGTTAGTGATGTTGCTGAATTGTCGAAGACATTCCCGGCAGTTGACATGGTATTCGAAAATGGGCAAAAGTTGTCTCTAGCACCAGAAAATTACTTGTTTCGGCACTCAAAGGTACATGGTGCATATTGTCTGGGCATTTTTGAGAACGGAAATGATCAAACTACTCTTCTAGGAGGAATTATTGTCCGCAACACTCTAGTGATGTATGACAGAGAGAATTCAAAAATTGGATTTTGGAAAACAAACTGTTCTGAGTTATGGGAAAGACTTCACATTTCTAATGATACTGCCCATGCTCCCTCTGTTTCAAATACATCACATGATACTGAAATGGCACCGGCATCTGCTCCGAGCGAGGCACCGGCATCTGCTCCGAGAGAGCTCCAGGTTGGACGTATCACATTTGAAATCCTGTTGAACATAAGCTACGAAGATCTGGAGCCTCATATTACAGAACTTTCTGACCTTATTGCTCAAGAGTTAAATGTTAGTTATTCACAGGTTCGTTTATTGAATTTTACCATGCAAGGAAACGATTCACTTATTCAGCTGGCCATAATCCCTGGTGGATCTTCAGAATTTTTCTCACATGCGACTGCCACTACGATAATTGCCCAGATTGTGGAGCATCACATGCAACTACCTCCAACATTTGGAAGTTATCAGGTCGTTCAATGGAATGTCGAGCCTCTAATAAAAAGGTCATTGTGGAAGCAACTTTATGTTATGGTGATTGTAGCTGTTATTGTCACGCTTCTTCTTGGGTTGTCAGCATTGGGAGTGTGGCTTATTTGGAGAAGGAGACACCAATCCTTCAATTCTTATAAGCCTGTCAATGCAGCAGCTCCTGAGCAAGAACTCCAGCCCCTGTAAACAAGCTTAACCAACATATTTTTGCTTTCTGCTCCCTCTTTTCATCTTCTTCCTTCCTTTCTTTCTTTCCTGGTCAAATTTTCAGTGGGATATACAGGAAGATTCATTATTTTTCAGGTTTTTTTTATTCATTTATGCTGAAATGTTATTACATTATTTAAAAAAAAAAAAAAAAACTTGGGTGTAATGTTTTTAATTTTGTGGTGGCAGTGGTTCAAGGAATATGTATATAGGACACCCAATTCTTGGAGTTGTCCTTATTTAGAATCTTATATAAGATTGAGAATTAATTTTAATTTTTTGAGGGAAGAGCAAAATGTCTTAACTAATGGCTAAGCCAAGCTCATGTTACTCAGTGAGATGTTAATAACAATAATAGTATTTGAC

Coding sequence (CDS)

CTCCTCATGCATTCCACTTTCTCCGCCGATCCCATCGCCGCCAATCTTCTCCTCACCCCTCCCCATCGCGCCATGGTCTTGCCTCTTTACCTCTCTTCCCCTAATTCCTCCAGATTGATCTCCAAGCCTCGCCGCCATCTCCGGGAATCAAATTCCTACAATCTCTCCAACGCTCGCATGCGGCTCTACGACGACCTCCTCCTCAATGGGTATTATACGACGCGGCTATGGATCGGGACTCCGCCGCAGCAGTTCGCGCTCATAGTTGATACCGGGAGTTCGGTTACCTATGTTCCGTGCGCAAATTGCGAACAATGTGGGAGGCACCAGGACCCGAAGTTTGATCCAGATTTGTCAAGCACGTTCCGACCTGTCAAATGCAATCTTGATTGCAGTTGTGACGATGATGGACTGCTGTGTGTCTACGAGAGGCAGTATGCTGAAATGAGCACTAGCAGTGGTATCCTTGGTGAAGATATTATATCCTTTGGCAATCAGAGTGAACTCGTACCCCAGCGTGCTACGTTTGGTTGTGAGACTGTGGAAACTGGTGATCTTTATAGTCAACGTGCTGATGGAATTATGGGTTTGGGCAGTGGTGAACTTAGTATAGTCGATCAACTCGTTGAAAAAGGTGTGATTAATGATACTTTCTCATTATGCTATGGTGGTATGGATATTGGTGGTGGTGCTATGGTTCTTGGTGGTATCTCTACTCCATCAGATATGGCTTTTAGCTTCTCAGACCGTATGAGAAGTCCATATTACAATGTTGATTTGAAGGAGATACGTGTTGCGGGTAAAAAATTGCTTCTGAATCCAAGTGTTTTTGATGGAAAATTTGGAACTGTCTTGGATAGTGGTACAACTTATGCTTACCTACCACAGCCAGCGTTTGGAGCTTTTAAGGATGCTATTATGGACGAGGTTCATTCTTTGAAGAAGATTGGTGGTCCTGACCCAAATTTTAACGATATATGTTTTTCTGGCGCTGGAAGGTATGTAATTAATCCGTCTCATTATGTTAGTGATGTTGCTGAATTGTCGAAGACATTCCCGGCAGTTGACATGGTATTCGAAAATGGGCAAAAGTTGTCTCTAGCACCAGAAAATTACTTGTTTCGGCACTCAAAGGTACATGGTGCATATTGTCTGGGCATTTTTGAGAACGGAAATGATCAAACTACTCTTCTAGGAGGAATTATTGTCCGCAACACTCTAGTGATGTATGACAGAGAGAATTCAAAAATTGGATTTTGGAAAACAAACTGTTCTGAGTTATGGGAAAGACTTCACATTTCTAATGATACTGCCCATGCTCCCTCTGTTTCAAATACATCACATGATACTGAAATGGCACCGGCATCTGCTCCGAGCGAGGCACCGGCATCTGCTCCGAGAGAGCTCCAGGTTGGACGTATCACATTTGAAATCCTGTTGAACATAAGCTACGAAGATCTGGAGCCTCATATTACAGAACTTTCTGACCTTATTGCTCAAGAGTTAAATGTTAGTTATTCACAGGTTCGTTTATTGAATTTTACCATGCAAGGAAACGATTCACTTATTCAGCTGGCCATAATCCCTGGTGGATCTTCAGAATTTTTCTCACATGCGACTGCCACTACGATAATTGCCCAGATTGTGGAGCATCACATGCAACTACCTCCAACATTTGGAAGTTATCAGGTCGTTCAATGGAATGTCGAGCCTCTAATAAAAAGGTCATTGTGGAAGCAACTTTATGTTATGGTGATTGTAGCTGTTATTGTCACGCTTCTTCTTGGGTTGTCAGCATTGGGAGTGTGGCTTATTTGGAGAAGGAGACACCAATCCTTCAATTCTTATAAGCCTGTCAATGCAGCAGCTCCTGAGCAAGAACTCCAGCCCCTGTAA

Protein sequence

LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSSTFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCETVETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTPSDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFGAFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFENGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFWKTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPRELQVGRITFEILLNISYEDLEPHITELSDLIAQELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTFGSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPVNAAAPEQELQPL
Homology
BLAST of MC02g1075 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 160.2 bits (404), Expect = 7.5e-38
Identity = 133/426 (31.22%), Postives = 198/426 (46.48%), Query Frame = 0

Query: 35  NSSRLISKPRRHLRESNSY-NLSNARMRLYDDLLLN--------GYYTTRLWIGTPPQQF 94
           N +   +   + L E  S+ +  +ARM    DL L         G Y T++ +G+PP+++
Sbjct: 32  NVTHKFAGKEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLGSPPKEY 91

Query: 95  ALIVDTGSSVTYVPCANCEQCGRHQD-----PKFDPDLSSTFRPVKCNLD-CS----CDD 154
            + VDTGS + +V CA C +C    D       +D   SST + V C  D CS     + 
Sbjct: 92  YVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSFIMQSET 151

Query: 155 DGLL--CVYERQYAEMSTSSGILGEDIISF----GN-QSELVPQRATFGCETVETGDL-- 214
            G    C Y   Y + STS G   +D I+     GN ++  + Q   FGC   ++G L  
Sbjct: 152 CGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQ 211

Query: 215 YSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTPSDMAFS 274
                DGIMG G    SI+ QL   G     FS C   M+ GGG   +G + +P  +  +
Sbjct: 212 TDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMN-GGGIFAVGEVESP--VVKT 271

Query: 275 FSDRMRSPYYNVDLKEIRVAGKKLLLNPSV--FDGKFGTVLDSGTTYAYLPQPAFGAFKD 334
                   +YNV LK + V G  + L PS+   +G  GT++DSGTT AYLPQ  +    +
Sbjct: 272 TPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLY----N 331

Query: 335 AIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFENGQK 394
           ++++++ + +++          CF              S  +   K FP V++ FE+  K
Sbjct: 332 SLIEKITAKQQVKLHMVQETFACF--------------SFTSNTDKAFPVVNLHFEDSLK 391

Query: 395 LSLAPENYLFRHSKVHGAYCLGIFENG-----NDQTTLLGGIIVRNTLVMYDRENSKIGF 426
           LS+ P +YLF  S     YC G    G          LLG +++ N LV+YD EN  IG+
Sbjct: 392 LSVYPHDYLF--SLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGW 434

BLAST of MC02g1075 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 147.9 bits (372), Expect = 3.9e-34
Identity = 118/406 (29.06%), Postives = 181/406 (44.58%), Query Frame = 0

Query: 44  RRHLRESNSYNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANC 103
           RRH R   S +L        D +   G Y T++ +G+PP+++ + VDTGS + ++ C  C
Sbjct: 49  RRHSRMLASIDLPLGGDSRVDSV---GLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPC 108

Query: 104 EQCGRHQD-----PKFDPDLSSTFRPVKCNLD-CS------CDDDGLLCVYERQYAEMST 163
            +C    +       FD + SST + V C+ D CS           L C Y   YA+ ST
Sbjct: 109 PKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADEST 168

Query: 164 SSGILGEDIISFGN-----QSELVPQRATFGCETVETGDLYS--QRADGIMGLGSGELSI 223
           S G    D+++        ++  + Q   FGC + ++G L +     DG+MG G    S+
Sbjct: 169 SDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSV 228

Query: 224 VDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTPSDMAFSFSDRMRSPYYNVDLKEIR 283
           + QL   G     FS C   +  GGG   +G + +P              +YNV L  + 
Sbjct: 229 LSQLAATGDAKRVFSHCLDNVK-GGGIFAVGVVDSPKVKTTPMVPNQM--HYNVMLMGMD 288

Query: 284 VAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFGAFKDAIMDEVHSLKKIGGPDPNFN 343
           V G  L L  S+     GT++DSGTT AY P+  + +  + I+       K+   +  F 
Sbjct: 289 VDGTSLDLPRSIVRNG-GTIVDSGTTLAYFPKVLYDSLIETIL--ARQPVKLHIVEETFQ 348

Query: 344 DICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFENGQKLSLAPENYLFRHSKVHGAYC 403
             CFS +                + + FP V   FE+  KL++ P +YLF  +     YC
Sbjct: 349 --CFSFS--------------TNVDEAFPPVSFEFEDSVKLTVYPHDYLF--TLEEELYC 408

Query: 404 LGIFENG-----NDQTTLLGGIIVRNTLVMYDRENSKIGFWKTNCS 426
            G    G       +  LLG +++ N LV+YD +N  IG+   NCS
Sbjct: 409 FGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427

BLAST of MC02g1075 vs. ExPASy Swiss-Prot
Match: Q9LS40 (Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASPG1 PE=1 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 3.3e-33
Identity = 104/372 (27.96%), Postives = 168/372 (45.16%), Query Frame = 0

Query: 69  NGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSSTFRPVKCN 128
           +G Y +R+ +GTP ++  L++DTGS V ++ C  C  C +  DP F+P  SST++ + C+
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCS 218

Query: 129 L-DCSCDDDGLL----CVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCETVET 188
              CS  +        C+Y+  Y + S + G L  D ++FGN  ++       GC     
Sbjct: 219 APQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKI--NNVALGCGHDNE 278

Query: 189 GDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMV------LGGI 248
           G L++  A G++GLG G LSI +Q+        +FS C    D G  + +      LGG 
Sbjct: 279 G-LFTGAA-GLLGLGGGVLSITNQMKA-----TSFSYCLVDRDSGKSSSLDFNSVQLGGG 338

Query: 249 STPSDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFD----GKFGTVLDSGTTYAY 308
              + +     ++    +Y V L    V G+K++L  ++FD    G  G +LD GT    
Sbjct: 339 DATAPL---LRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTR 398

Query: 309 LPQPAFGAFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKT-F 368
           L   A+ + +DA +    +LKK G    +  D C+               D + LS    
Sbjct: 399 LQTQAYNSLRDAFLKLTVNLKK-GSSSISLFDTCY---------------DFSSLSTVKV 458

Query: 369 PAVDMVFENGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDR 425
           P V   F  G+ L L  +NYL       G +C   F   +   +++G +  + T + YD 
Sbjct: 459 PTVAFHFTGGKSLDLPAKNYLIPVDD-SGTFCFA-FAPTSSSLSIIGNVQQQGTRITYDL 500

BLAST of MC02g1075 vs. ExPASy Swiss-Prot
Match: Q9LHE3 (Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASPG2 PE=2 SV=1)

HSP 1 Score: 136.0 bits (341), Expect = 1.5e-30
Identity = 104/373 (27.88%), Postives = 161/373 (43.16%), Query Frame = 0

Query: 69  NGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSSTFRPVKCN 128
           +G Y  R+ +G+PP+   +++D+GS + +V C  C+ C +  DP FDP  S ++  V C 
Sbjct: 128 SGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCG 187

Query: 129 L-------DCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCETV 188
                   +  C   G  C YE  Y + S + G L  + ++F   ++ V +    GC   
Sbjct: 188 SSVCDRIENSGCHSGG--CRYEVMYGDGSYTKGTLALETLTF---AKTVVRNVAMGCGHR 247

Query: 189 ETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCY--GGMDIGGGAMVLGGIST 248
             G      A G++G+G G +S V QL   G     F  C    G D   G++V G  + 
Sbjct: 248 NRGMFIG--AAGLLGIGGGSMSFVGQL--SGQTGGAFGYCLVSRGTD-STGSLVFGREAL 307

Query: 249 PSDMAFSFSDRMRSP----YYNVDLKEIRVAGKKLLLNPSVFD----GKFGTVLDSGTTY 308
           P  +  S+   +R+P    +Y V LK + V G ++ L   VFD    G  G V+D+GT  
Sbjct: 308 P--VGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAV 367

Query: 309 AYLPQPAFGAFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKT 368
             LP  A+ AF+D    +  +L +  G   +  D C+  +G               +S  
Sbjct: 368 TRLPTAAYVAFRDGFKSQTANLPRASG--VSIFDTCYDLSG--------------FVSVR 427

Query: 369 FPAVDMVFENGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYD 425
            P V   F  G  L+L   N+L       G YC   F       +++G I      V +D
Sbjct: 428 VPTVSFYFTEGPVLTLPARNFLMPVDD-SGTYCFA-FAASPTGLSIIGNIQQEGIQVSFD 470

BLAST of MC02g1075 vs. ExPASy Swiss-Prot
Match: Q6XBF8 (Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1)

HSP 1 Score: 134.8 bits (338), Expect = 3.4e-30
Identity = 101/383 (26.37%), Postives = 163/383 (42.56%), Query Frame = 0

Query: 69  NGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSSTFRPVKCN 128
           +G Y   + IGTPP     I DTGS + +  CA C+ C    DP FDP  SST++ V C+
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCS 146

Query: 129 L--------DCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQ--RATFGC 188
                      SC  +   C Y   Y + S + G +  D ++ G+      Q      GC
Sbjct: 147 SSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGC 206

Query: 189 ETVETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCY----------GGMDIG 248
                G  ++++  GI+GLG G +S++ QL +   I+  FS C             ++ G
Sbjct: 207 GHNNAG-TFNKKGSGIVGLGGGPVSLIKQLGDS--IDGKFSYCLVPLTSKKDQTSKINFG 266

Query: 249 GGAMVLGG--ISTPSDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGT-VL 308
             A+V G   +STP       +   +  +Y + LK I V  K++  + S  +   G  ++
Sbjct: 267 TNAIVSGSGVVSTP-----LIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIII 326

Query: 309 DSGTTYAYLPQPAFGAFKDAIMDEVHSLKKIGGPDPNFN-DICFSGAGRYVINPSHYVSD 368
           DSGTT   LP   +   +DA+   + + KK    DP     +C+S  G   +        
Sbjct: 327 DSGTTLTLLPTEFYSELEDAVASSIDAEKK---QDPQSGLSLCYSATGDLKV-------- 386

Query: 369 VAELSKTFPAVDMVFENGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVR 428
                   P + M F+ G  + L   N   + S+     C      G+   ++ G +   
Sbjct: 387 --------PVITMHFD-GADVKLDSSNAFVQVSE--DLVCFAF--RGSPSFSIYGNVAQM 437

BLAST of MC02g1075 vs. NCBI nr
Match: XP_022149434.1 (aspartic proteinase-like protein 2 isoform X1 [Momordica charantia])

HSP 1 Score: 1214 bits (3142), Expect = 0.0
Identity = 620/672 (92.26%), Postives = 620/672 (92.26%), Query Frame = 0

Query: 1   LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM 60
           LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM
Sbjct: 17  LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM 76

Query: 61  RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS 120
           RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS
Sbjct: 77  RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS 136

Query: 121 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 180
           TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET
Sbjct: 137 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 196

Query: 181 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 240
           VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP
Sbjct: 197 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 256

Query: 241 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 300
           SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG
Sbjct: 257 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 316

Query: 301 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFE 360
           AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAG          SDVAELSKTFPAVDMVFE
Sbjct: 317 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAG----------SDVAELSKTFPAVDMVFE 376

Query: 361 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW 420
           NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW
Sbjct: 377 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW 436

Query: 421 KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPRE------------ 480
           KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAP E            
Sbjct: 437 KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPSEAPASAPSEAPAS 496

Query: 481 -----------------------------LQVGRITFEILLNISYEDLEPHITELSDLIA 540
                                        LQVGRITFEILLNISYEDLEPHITELSDLIA
Sbjct: 497 APSEAPATAPSEAPASAPSEAPHYMIPGELQVGRITFEILLNISYEDLEPHITELSDLIA 556

Query: 541 QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF 600
           QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF
Sbjct: 557 QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF 616

Query: 601 GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV 631
           GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV
Sbjct: 617 GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV 676

BLAST of MC02g1075 vs. NCBI nr
Match: XP_022149435.1 (aspartyl protease family protein 1-like isoform X2 [Momordica charantia])

HSP 1 Score: 1113 bits (2878), Expect = 0.0
Identity = 584/672 (86.90%), Postives = 588/672 (87.50%), Query Frame = 0

Query: 1   LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM 60
           LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM
Sbjct: 17  LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM 76

Query: 61  RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS 120
           RLYDDLLLNGY      +G                  + C+       +QDPKFDPDLSS
Sbjct: 77  RLYDDLLLNGY------LG------------------MCCSWYLNFYFYQDPKFDPDLSS 136

Query: 121 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 180
           TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET
Sbjct: 137 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 196

Query: 181 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 240
           VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP
Sbjct: 197 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 256

Query: 241 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 300
           SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG
Sbjct: 257 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 316

Query: 301 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFE 360
           AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAG          SDVAELSKTFPAVDMVFE
Sbjct: 317 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAG----------SDVAELSKTFPAVDMVFE 376

Query: 361 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW 420
           NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW
Sbjct: 377 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW 436

Query: 421 KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPRE------------ 480
           KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAP E            
Sbjct: 437 KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPSEAPASAPSEAPAS 496

Query: 481 -----------------------------LQVGRITFEILLNISYEDLEPHITELSDLIA 540
                                        LQVGRITFEILLNISYEDLEPHITELSDLIA
Sbjct: 497 APSEAPATAPSEAPASAPSEAPHYMIPGELQVGRITFEILLNISYEDLEPHITELSDLIA 556

Query: 541 QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF 600
           QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF
Sbjct: 557 QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF 616

Query: 601 GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV 631
           GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV
Sbjct: 617 GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV 654

BLAST of MC02g1075 vs. NCBI nr
Match: XP_038902862.1 (aspartic proteinase CDR1-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1041 bits (2691), Expect = 0.0
Identity = 514/633 (81.20%), Postives = 567/633 (89.57%), Query Frame = 0

Query: 1   LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRE-SNSYNLSNAR 60
           +L+H    ADPI++N LLTP HRAMVLPLYLSSPNSS+LIS P RHLR+  +S N SNAR
Sbjct: 11  ILLHFLLFADPISSNPLLTPSHRAMVLPLYLSSPNSSKLISNPHRHLRQFPSSNNRSNAR 70

Query: 61  MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLS 120
           MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGS+VTYVPC+ CE+CGRHQDPKF+P+ S
Sbjct: 71  MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEECGRHQDPKFEPESS 130

Query: 121 STFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCE 180
           ST+ PVKCN+DC+CD+DGL CVYERQYAEMSTSSG+LGED+ISFGNQSEL+PQRA FGCE
Sbjct: 131 STYEPVKCNIDCTCDNDGLQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCE 190

Query: 181 TVETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGIST 240
            VETGDL+SQRADGIMGLG+G+LSIVDQLVEKGVIND+FSLCYGGMDIGGGAMVLGGIS 
Sbjct: 191 NVETGDLFSQRADGIMGLGTGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISP 250

Query: 241 PSDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAF 300
           PSDM FS+SD +RSPYYNVDLKEI VAGK+LLL PS+FDG++GTVLDSGTTYAYLP  AF
Sbjct: 251 PSDMIFSYSDPVRSPYYNVDLKEIHVAGKRLLLTPSIFDGRYGTVLDSGTTYAYLPVEAF 310

Query: 301 GAFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVF 360
           GAFKDAIMDE+HSLKKI GPDPNF DICFSGAG          SD AELS  FP VDMVF
Sbjct: 311 GAFKDAIMDELHSLKKIDGPDPNFKDICFSGAG----------SDAAELSNIFPTVDMVF 370

Query: 361 ENGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGF 420
           +NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGI+VRNTLVMYDR +SKIGF
Sbjct: 371 DNGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRAHSKIGF 430

Query: 421 WKTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPA-SAPRELQVGRITFEI 480
           WKTNCSELWERLHIS+D AHAPSVSNTSHDT++APASAP E+P  + P ELQ+GRITFEI
Sbjct: 431 WKTNCSELWERLHISDDHAHAPSVSNTSHDTDIAPASAPDESPHYTIPGELQIGRITFEI 490

Query: 481 LLNISYEDLEPHITELSDLIAQELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSH 540
           LLNISY DLEPHITELSD IA ELNVS+SQV LLNFTM+GNDSLIQLAI+P   SEFFSH
Sbjct: 491 LLNISYTDLEPHITELSDHIAHELNVSHSQVLLLNFTMRGNDSLIQLAILPNEPSEFFSH 550

Query: 541 ATATTIIAQIVEHHMQLPPTFGSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLS 600
           ATA TII+ IVEHHMQLPPTFGSYQV+QW +EPL++RSLWK+LY+MV +A+IVTL+LGLS
Sbjct: 551 ATAITIISLIVEHHMQLPPTFGSYQVLQWKIEPLMERSLWKRLYIMVGLAIIVTLILGLS 610

Query: 601 ALGVWLIWRRRHQSFNSYKPVNAAAPEQELQPL 631
           ALG W I RRR  +FNSY PVNAA PEQELQPL
Sbjct: 611 ALGAWFILRRRQAAFNSYMPVNAAVPEQELQPL 633

BLAST of MC02g1075 vs. NCBI nr
Match: XP_022985603.1 (aspartic proteinase nepenthesin-1-like [Cucurbita maxima])

HSP 1 Score: 1030 bits (2662), Expect = 0.0
Identity = 510/633 (80.57%), Postives = 569/633 (89.89%), Query Frame = 0

Query: 2   LMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRE-SNSYNLSNARM 61
           L H T SADPI++N LLTP HRAMVLPLY SSPNSS+LISKP R LR   NS N SNARM
Sbjct: 18  LTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPHRRLRGFPNSNNRSNARM 77

Query: 62  RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS 121
           RLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGS+VTYVPC+ CE CG+HQDPKFDP+LSS
Sbjct: 78  RLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSS 137

Query: 122 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 181
           T++PVKCN DC+CD+DG+ CVYERQYAEMSTSSG+LG+D+ISFGNQS LVPQRA FGCE 
Sbjct: 138 TYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCEN 197

Query: 182 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 241
            ETGDLYSQRADGIMGLGSG+LSIVDQLVEKGVIND+FSLCYGGMDIGGGAMVLGGIS P
Sbjct: 198 EETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPP 257

Query: 242 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 301
           S+M FS+SD +RSPYYNVDLKEI VAGKKL L PSVFDG++G+VLDSGTTY+YLPQ AFG
Sbjct: 258 SEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFG 317

Query: 302 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFE 361
            FK+AIM+ +HSLKKIGGPDPNF D CFSGAG          SD AELSKTFP VD++F+
Sbjct: 318 PFKNAIMNALHSLKKIGGPDPNFKDTCFSGAG----------SDAAELSKTFPTVDLIFD 377

Query: 362 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGN-DQTTLLGGIIVRNTLVMYDRENSKIGF 421
           NGQKLSLAPENYLFRHSKVHGAYCLGIFENGN DQTTLLGGIIVRNTLVMYDRE+SKIGF
Sbjct: 378 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGF 437

Query: 422 WKTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASA-PRELQVGRITFEI 481
           WKTNCSELWERLHIS++ AHAPSVSNTSHDT+ APASAPSE+P    P ++Q+GRITF+I
Sbjct: 438 WKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQIGRITFDI 497

Query: 482 LLNISYEDLEPHITELSDLIAQELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSH 541
           LLNISY+ LEPHIT LSD IAQELNVS+SQVRLLNFTM+GN SLIQLAI+P GSSEFFSH
Sbjct: 498 LLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSH 557

Query: 542 ATATTIIAQIVEHHMQLPPTFGSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLS 601
           ATATTII+ IVEHHM+LPP +GSYQV++WNVEPL+ RSLWK+LYV+V +A++VTL+LGLS
Sbjct: 558 ATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLS 617

Query: 602 ALGVWLIWRRRHQSFNSYKPVNAAAPEQELQPL 631
           A+GVW IWRRR Q+F+SYKPVNAAAPEQELQ L
Sbjct: 618 AVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of MC02g1075 vs. NCBI nr
Match: XP_023512095.1 (aspartic proteinase nepenthesin-1-like [Cucurbita pepo subsp. pepo] >XP_023522201.1 aspartic proteinase nepenthesin-1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1027 bits (2656), Expect = 0.0
Identity = 510/633 (80.57%), Postives = 568/633 (89.73%), Query Frame = 0

Query: 2   LMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRE-SNSYNLSNARM 61
           L H T SADPI++N LLTP HRAMVLPLY SSPNSS+LISKP R LR   NS N SNARM
Sbjct: 18  LTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPHRRLRGFPNSNNRSNARM 77

Query: 62  RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS 121
           RLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGS+VTYVPC+ CE CG+HQDPKFDP+LSS
Sbjct: 78  RLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSS 137

Query: 122 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 181
           T++PVKCN DC+CD DG+ CVYERQYAEMSTSSG+LG+D+ISFGNQS LVPQRA FGCE 
Sbjct: 138 TYQPVKCNSDCTCDGDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCEN 197

Query: 182 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 241
            ETGDLYSQRADGIMGLGSG+LSIVDQLVEKGVIND+FSLCYGGMDIGGGAMVLGGIS P
Sbjct: 198 EETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPP 257

Query: 242 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 301
           S+M FS+SD +RSPYYNVDLKEI VAGKKL L PSVFDG++G+VLDSGTTY+YLPQ AFG
Sbjct: 258 SEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFG 317

Query: 302 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFE 361
            FK+AIM+ +HSLKKIGGPDPNF D CFSGAG          SD AELSKTFP VD+VF+
Sbjct: 318 PFKNAIMNALHSLKKIGGPDPNFKDTCFSGAG----------SDAAELSKTFPTVDLVFD 377

Query: 362 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGN-DQTTLLGGIIVRNTLVMYDRENSKIGF 421
           NGQKLSLAPENYLFRHSKVHGAYCLGIFENGN DQTTLLGGIIVRNTLVMYDRE+SKIGF
Sbjct: 378 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGF 437

Query: 422 WKTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASA-PRELQVGRITFEI 481
           WKTNCSELWERLHIS+D AHAPSVSNTSHDT+MAPASAPSE+P    P +LQ+GRITF+I
Sbjct: 438 WKTNCSELWERLHISDDNAHAPSVSNTSHDTDMAPASAPSESPYDMIPEDLQIGRITFDI 497

Query: 482 LLNISYEDLEPHITELSDLIAQELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSH 541
           LLNISY+ LEPHIT+LSD IA ELNVS+SQVRLLNFTM+GN SLIQLAI+P GSSEFFS 
Sbjct: 498 LLNISYKHLEPHITQLSDHIAHELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSP 557

Query: 542 ATATTIIAQIVEHHMQLPPTFGSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLS 601
           ATATTII+ IVEHHM+LPP +GSY+V++WNVEPL+ RSLWK+LY++V +A++VTL+LGLS
Sbjct: 558 ATATTIISLIVEHHMKLPPKYGSYRVIRWNVEPLMDRSLWKRLYILVGLAIMVTLILGLS 617

Query: 602 ALGVWLIWRRRHQSFNSYKPVNAAAPEQELQPL 631
           A+GVW IWRRR Q+F+SYKPVNAAAPEQELQ L
Sbjct: 618 AMGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of MC02g1075 vs. ExPASy TrEMBL
Match: A0A6J1D718 (aspartic proteinase-like protein 2 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111017863 PE=3 SV=1)

HSP 1 Score: 1214 bits (3142), Expect = 0.0
Identity = 620/672 (92.26%), Postives = 620/672 (92.26%), Query Frame = 0

Query: 1   LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM 60
           LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM
Sbjct: 17  LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM 76

Query: 61  RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS 120
           RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS
Sbjct: 77  RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS 136

Query: 121 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 180
           TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET
Sbjct: 137 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 196

Query: 181 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 240
           VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP
Sbjct: 197 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 256

Query: 241 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 300
           SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG
Sbjct: 257 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 316

Query: 301 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFE 360
           AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAG          SDVAELSKTFPAVDMVFE
Sbjct: 317 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAG----------SDVAELSKTFPAVDMVFE 376

Query: 361 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW 420
           NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW
Sbjct: 377 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW 436

Query: 421 KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPRE------------ 480
           KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAP E            
Sbjct: 437 KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPSEAPASAPSEAPAS 496

Query: 481 -----------------------------LQVGRITFEILLNISYEDLEPHITELSDLIA 540
                                        LQVGRITFEILLNISYEDLEPHITELSDLIA
Sbjct: 497 APSEAPATAPSEAPASAPSEAPHYMIPGELQVGRITFEILLNISYEDLEPHITELSDLIA 556

Query: 541 QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF 600
           QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF
Sbjct: 557 QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF 616

Query: 601 GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV 631
           GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV
Sbjct: 617 GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV 676

BLAST of MC02g1075 vs. ExPASy TrEMBL
Match: A0A6J1D5P8 (aspartyl protease family protein 1-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC111017863 PE=3 SV=1)

HSP 1 Score: 1113 bits (2878), Expect = 0.0
Identity = 584/672 (86.90%), Postives = 588/672 (87.50%), Query Frame = 0

Query: 1   LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM 60
           LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM
Sbjct: 17  LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRESNSYNLSNARM 76

Query: 61  RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS 120
           RLYDDLLLNGY      +G                  + C+       +QDPKFDPDLSS
Sbjct: 77  RLYDDLLLNGY------LG------------------MCCSWYLNFYFYQDPKFDPDLSS 136

Query: 121 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 180
           TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET
Sbjct: 137 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 196

Query: 181 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 240
           VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP
Sbjct: 197 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 256

Query: 241 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 300
           SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG
Sbjct: 257 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 316

Query: 301 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFE 360
           AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAG          SDVAELSKTFPAVDMVFE
Sbjct: 317 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAG----------SDVAELSKTFPAVDMVFE 376

Query: 361 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW 420
           NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW
Sbjct: 377 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFW 436

Query: 421 KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPRE------------ 480
           KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAP E            
Sbjct: 437 KTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPSEAPASAPSEAPAS 496

Query: 481 -----------------------------LQVGRITFEILLNISYEDLEPHITELSDLIA 540
                                        LQVGRITFEILLNISYEDLEPHITELSDLIA
Sbjct: 497 APSEAPATAPSEAPASAPSEAPHYMIPGELQVGRITFEILLNISYEDLEPHITELSDLIA 556

Query: 541 QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF 600
           QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF
Sbjct: 557 QELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQLPPTF 616

Query: 601 GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV 631
           GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV
Sbjct: 617 GSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFNSYKPV 654

BLAST of MC02g1075 vs. ExPASy TrEMBL
Match: A0A6J1JE38 (aspartic proteinase nepenthesin-1-like OS=Cucurbita maxima OX=3661 GN=LOC111483616 PE=3 SV=1)

HSP 1 Score: 1030 bits (2662), Expect = 0.0
Identity = 510/633 (80.57%), Postives = 569/633 (89.89%), Query Frame = 0

Query: 2   LMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRE-SNSYNLSNARM 61
           L H T SADPI++N LLTP HRAMVLPLY SSPNSS+LISKP R LR   NS N SNARM
Sbjct: 18  LTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPHRRLRGFPNSNNRSNARM 77

Query: 62  RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS 121
           RLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGS+VTYVPC+ CE CG+HQDPKFDP+LSS
Sbjct: 78  RLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSS 137

Query: 122 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 181
           T++PVKCN DC+CD+DG+ CVYERQYAEMSTSSG+LG+D+ISFGNQS LVPQRA FGCE 
Sbjct: 138 TYQPVKCNSDCTCDNDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCEN 197

Query: 182 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 241
            ETGDLYSQRADGIMGLGSG+LSIVDQLVEKGVIND+FSLCYGGMDIGGGAMVLGGIS P
Sbjct: 198 EETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPP 257

Query: 242 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 301
           S+M FS+SD +RSPYYNVDLKEI VAGKKL L PSVFDG++G+VLDSGTTY+YLPQ AFG
Sbjct: 258 SEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFG 317

Query: 302 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFE 361
            FK+AIM+ +HSLKKIGGPDPNF D CFSGAG          SD AELSKTFP VD++F+
Sbjct: 318 PFKNAIMNALHSLKKIGGPDPNFKDTCFSGAG----------SDAAELSKTFPTVDLIFD 377

Query: 362 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGN-DQTTLLGGIIVRNTLVMYDRENSKIGF 421
           NGQKLSLAPENYLFRHSKVHGAYCLGIFENGN DQTTLLGGIIVRNTLVMYDRE+SKIGF
Sbjct: 378 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGF 437

Query: 422 WKTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASA-PRELQVGRITFEI 481
           WKTNCSELWERLHIS++ AHAPSVSNTSHDT+ APASAPSE+P    P ++Q+GRITF+I
Sbjct: 438 WKTNCSELWERLHISDENAHAPSVSNTSHDTDTAPASAPSESPHDMIPEDIQIGRITFDI 497

Query: 482 LLNISYEDLEPHITELSDLIAQELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSH 541
           LLNISY+ LEPHIT LSD IAQELNVS+SQVRLLNFTM+GN SLIQLAI+P GSSEFFSH
Sbjct: 498 LLNISYKHLEPHITHLSDHIAQELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSH 557

Query: 542 ATATTIIAQIVEHHMQLPPTFGSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLS 601
           ATATTII+ IVEHHM+LPP +GSYQV++WNVEPL+ RSLWK+LYV+V +A++VTL+LGLS
Sbjct: 558 ATATTIISLIVEHHMKLPPRYGSYQVIRWNVEPLMDRSLWKRLYVLVGLAIMVTLILGLS 617

Query: 602 ALGVWLIWRRRHQSFNSYKPVNAAAPEQELQPL 631
           A+GVW IWRRR Q+F+SYKPVNAAAPEQELQ L
Sbjct: 618 AVGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of MC02g1075 vs. ExPASy TrEMBL
Match: A0A6J1EYA5 (aspartic proteinase nepenthesin-1-like OS=Cucurbita moschata OX=3662 GN=LOC111439463 PE=3 SV=1)

HSP 1 Score: 1021 bits (2641), Expect = 0.0
Identity = 508/633 (80.25%), Postives = 566/633 (89.42%), Query Frame = 0

Query: 2   LMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRE-SNSYNLSNARM 61
           L H T SADPI++N LLTP HRAMVLPLY SSPNSS+LISKP R LR   NS N SNARM
Sbjct: 18  LTHFTLSADPISSNPLLTPSHRAMVLPLYRSSPNSSKLISKPHRRLRGFPNSNNRSNARM 77

Query: 62  RLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSS 121
           RLYDDLLLNGYYTTRLWIGTPPQ+FALIVDTGS+VTYVPC+ CE CG+HQDPKFDP+LSS
Sbjct: 78  RLYDDLLLNGYYTTRLWIGTPPQKFALIVDTGSTVTYVPCSTCELCGKHQDPKFDPELSS 137

Query: 122 TFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCET 181
           T++PVKCN DC+CD DG+ CVYERQYAEMSTSSG+LG+D+ISFGNQS LVPQRA FGCE 
Sbjct: 138 TYQPVKCNSDCTCDGDGVQCVYERQYAEMSTSSGVLGDDVISFGNQSALVPQRAVFGCEN 197

Query: 182 VETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTP 241
            ETGDLYSQRADGIMGLGSG+LSIVDQLVEKGVIND+FSLCYGGMDIGGGAMVLGGIS P
Sbjct: 198 EETGDLYSQRADGIMGLGSGDLSIVDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGISPP 257

Query: 242 SDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFG 301
           S+M FS+SD +RSPYYNVDLKEI VAGKKL L PSVFDG++G+VLDSGTTY+YLPQ AFG
Sbjct: 258 SEMIFSYSDPVRSPYYNVDLKEIHVAGKKLPLEPSVFDGRYGSVLDSGTTYSYLPQEAFG 317

Query: 302 AFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFE 361
            FK+AI++ +HSLKKIGGPDPNF D CFSGAG          SD AELSKTFP VD+VF+
Sbjct: 318 PFKNAILNALHSLKKIGGPDPNFKDTCFSGAG----------SDAAELSKTFPTVDLVFD 377

Query: 362 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGN-DQTTLLGGIIVRNTLVMYDRENSKIGF 421
           NGQKLSLAPENYLFRHSKVHGAYCLGIFENGN DQTTLLGGIIVRNTLVMYDRE+SKIGF
Sbjct: 378 NGQKLSLAPENYLFRHSKVHGAYCLGIFENGNNDQTTLLGGIIVRNTLVMYDREHSKIGF 437

Query: 422 WKTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASA-PRELQVGRITFEI 481
           WKTNCSELWERLHIS+D A APSVSNTSHDT+MAPASAPSE+P    P +LQ+GRITF+I
Sbjct: 438 WKTNCSELWERLHISDDNADAPSVSNTSHDTDMAPASAPSESPHDMIPEDLQIGRITFDI 497

Query: 482 LLNISYEDLEPHITELSDLIAQELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSH 541
           LLNISY+ LEPHIT+LSD IA ELNVS+SQVRLLNFTM+GN SLIQLAI+P GSSEFFS 
Sbjct: 498 LLNISYKHLEPHITQLSDHIAHELNVSHSQVRLLNFTMRGNHSLIQLAILPNGSSEFFSP 557

Query: 542 ATATTIIAQIVEHHMQLPPTFGSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLS 601
           ATATTII+ IV HHM+LPP +GSYQV++WNVEPL+ RSLWK+LY++V +A++VTL+LGLS
Sbjct: 558 ATATTIISLIVGHHMKLPPKYGSYQVIRWNVEPLMDRSLWKRLYILVGLAIMVTLILGLS 617

Query: 602 ALGVWLIWRRRHQSFNSYKPVNAAAPEQELQPL 631
           A+GVW IWRRR Q+F+SYKPVNAAAPEQELQ L
Sbjct: 618 AMGVWFIWRRRQQAFHSYKPVNAAAPEQELQTL 640

BLAST of MC02g1075 vs. ExPASy TrEMBL
Match: A0A5A7TUH1 (Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold243G003080 PE=3 SV=1)

HSP 1 Score: 1010 bits (2611), Expect = 0.0
Identity = 499/633 (78.83%), Postives = 555/633 (87.68%), Query Frame = 0

Query: 1   LLMHSTFSADPIAANLLLTPPHRAMVLPLYLSSPNSSRLISKPRRHLRE-SNSYNLSNAR 60
           +L+H   SADPI+ N L+TP HRAMVLPLYLSS NSS+ IS P RHLR+   S N SNAR
Sbjct: 12  ILLHFFLSADPISPNPLITPSHRAMVLPLYLSSSNSSKFISNPHRHLRQFPTSDNRSNAR 71

Query: 61  MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLS 120
           MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGS+VTYVPC+ CEQCGRHQDPKFDP+ S
Sbjct: 72  MRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESS 131

Query: 121 STFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCE 180
           ST++P+KCN+DC+CD DG+ CVYERQYAEMSTSSG+LGED+ISFGNQSEL+PQRA FGCE
Sbjct: 132 STYKPIKCNIDCTCDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCE 191

Query: 181 TVETGDLYSQRADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGIST 240
            +ETGDL+SQRADGIMGLG+G+LS+VDQLVEKG IND+FSLCYGGMDIGGGAMVLGGIS 
Sbjct: 192 NMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISP 251

Query: 241 PSDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAF 300
           PSDM F++SD +RSPYYNVDLKEI VAGKKL L+ S+FDG++GTVLDSGTTYAYLP  AF
Sbjct: 252 PSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSSIFDGRYGTVLDSGTTYAYLPAEAF 311

Query: 301 GAFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVF 360
           GAFKDAIMDE+HSLKKI GPDPNF DICFSGAG          SD AELS  FP VDMVF
Sbjct: 312 GAFKDAIMDELHSLKKIDGPDPNFKDICFSGAG----------SDAAELSNIFPTVDMVF 371

Query: 361 ENGQKLSLAPENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGF 420
           ENGQKLSLAPENY FRHSKVHGAYCLGIFENGNDQTTLLGGI+VRNTLVMYDR +SKIGF
Sbjct: 372 ENGQKLSLAPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRAHSKIGF 431

Query: 421 WKTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPA-SAPRELQVGRITFEI 480
           WKTNCSELWERL  S+D AHAPS+S  SH ++MAPASAP E+P  + P ELQ+GRITFEI
Sbjct: 432 WKTNCSELWERLRTSDDNAHAPSISTKSHGSDMAPASAPIESPHYTIPGELQIGRITFEI 491

Query: 481 LLNISYEDLEPHITELSDLIAQELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSH 540
           LLN SY DLEPHITELSD IAQELNVS+SQV LLNFTM+GNDSLI+LAIIP GSSE FSH
Sbjct: 492 LLNKSYTDLEPHITELSDHIAQELNVSHSQVLLLNFTMRGNDSLIKLAIIPYGSSEIFSH 551

Query: 541 ATATTIIAQIVEHHMQLPPTFGSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLS 600
           AT  TII++IVEHHMQLPPTFGSYQVV+WNVEP ++RS+WK+LYV+V +A+IV  +LGLS
Sbjct: 552 ATVNTIISKIVEHHMQLPPTFGSYQVVRWNVEPPMERSMWKRLYVLVGLAIIVIFILGLS 611

Query: 601 ALGVWLIWRRRHQSFNSYKPVNAAAPEQELQPL 631
           ALG W I R R Q+ NSYKPVNAA PEQELQPL
Sbjct: 612 ALGAWFILRSRQQAINSYKPVNAAVPEQELQPL 634

BLAST of MC02g1075 vs. TAIR 10
Match: AT3G50050.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 731.5 bits (1887), Expect = 5.8e-211
Identity = 367/623 (58.91%), Postives = 469/623 (75.28%), Query Frame = 0

Query: 15  NLLLTPP----HRAMVLPLYLSSPN-SSRLISKPRRHLRESNSYNLSNARMRLYDDLLLN 74
           NLL   P     R MV PL+LS PN SSR IS P R L +S+S +L ++RMRLYDDLL+N
Sbjct: 31  NLLHQSPTARSRRPMVFPLFLSQPNSSSRSISIPHRKLHKSDSKSLPHSRMRLYDDLLIN 90

Query: 75  GYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSSTFRPVKCNL 134
           GYYTTRLWIGTPPQ FALIVD+GS+VTYVPC++CEQCG+HQDPKF P++SST++PVKCN+
Sbjct: 91  GYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNM 150

Query: 135 DCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCETVETGDLYSQ 194
           DC+CDDD   CVYER+YAE S+S G+LGED+ISFGN+S+L PQRA FGCETVETGDLYSQ
Sbjct: 151 DCNCDDDREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQ 210

Query: 195 RADGIMGLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTPSDMAFSFSD 254
           RADGI+GLG G+LS+VDQLV+KG+I+++F LCYGGMD+GGG+M+LGG   PSDM F+ SD
Sbjct: 211 RADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMVFTDSD 270

Query: 255 RMRSPYYNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFGAFKDAIMDE 314
             RSPYYN+DL  IRVAGK+L L+  VFDG+ G VLDSGTTYAYLP  AF AF++A+M E
Sbjct: 271 PDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYLPDAAFAAFEEAVMRE 330

Query: 315 VHSLKKIGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFENGQKLSLAP 374
           V +LK+I GPDPNF D CF  A       S+YVS   ELSK FP+V+MVF++GQ   L+P
Sbjct: 331 VSTLKQIDGPDPNFKDTCFQVAA------SNYVS---ELSKIFPSVEMVFKSGQSWLLSP 390

Query: 375 ENYLFRHSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFWKTNCSELWE 434
           ENY+FRHSKVHGAYCLG+F NG D TTLLGGI+VRNTLV+YDRENSK+GFW+TNCSEL +
Sbjct: 391 ENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSELSD 450

Query: 435 RLHISNDTAHAPSVSNTSHDTEMAPASAPSEAPASAPRELQVGRITFEILLNISYEDLEP 494
           RLHI      A   SN S+ +  + ++             QVG+I  +I L ++   L+P
Sbjct: 451 RLHIDGAPPPATLPSNDSNPSHNSSSNLSGVT--------QVGQINLDIQLTVNSSYLKP 510

Query: 495 HITELSDLIAQELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIV 554
            I +LS + ++EL+V  SQV L N T +GN+SL+++ ++P   S +FS+ TAT I+++  
Sbjct: 511 RIEDLSKIFSKELDVKSSQVSLSNLTSKGNESLVRMVVLPPEPSTWFSNVTATNIVSRFT 570

Query: 555 EHHMQLPPTFGSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRR 614
            H ++LP  FG+YQ+V + +EP  KR+      ++VI   I+ +++GLSA G WLIW+R+
Sbjct: 571 NHQIKLPEIFGNYQLVNYKLEPPRKRT---NNNIVVIAIGIIAVIVGLSAYGAWLIWKRK 630

Query: 615 HQSFNSYKPVN-AAAPEQELQPL 632
             S   YKPV+ A   EQELQP+
Sbjct: 631 QTSI-PYKPVDEAIVAEQELQPI 632

BLAST of MC02g1075 vs. TAIR 10
Match: AT5G43100.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 710.3 bits (1832), Expect = 1.4e-204
Identity = 354/617 (57.37%), Postives = 454/617 (73.58%), Query Frame = 0

Query: 17  LLTPPHRAMVLPL-YLSSPNSSRLISKPRRHLRESNSYNLSNARMRLYDDLLLNGYYTTR 76
           L T     M+ PL Y S P   R+    RR L +S    L NA M+LYDDLL NGYYTTR
Sbjct: 23  LTTADESPMIFPLSYSSLPPRPRVEDFRRRRLHQS---QLPNAHMKLYDDLLSNGYYTTR 82

Query: 77  LWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPKFDPDLSSTFRPVKCNLDCSCDD 136
           LWIGTPPQ+FALIVDTGS+VTYVPC+ C+QCG+HQDPKF P+LS++++ +KCN DC+CDD
Sbjct: 83  LWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCNPDCNCDD 142

Query: 137 DGLLCVYERQYAEMSTSSGILGEDIISFGNQSELVPQRATFGCETVETGDLYSQRADGIM 196
           +G LCVYER+YAEMS+SSG+L ED+ISFGN+S+L PQRA FGCE  ETGDL+SQRADGIM
Sbjct: 143 EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEETGDLFSQRADGIM 202

Query: 197 GLGSGELSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTPSDMAFSFSDRMRSPY 256
           GLG G+LS+VDQLV+KGVI D FSLCYGGM++GGGAMVLG IS P  M FS SD  RSPY
Sbjct: 203 GLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPPGMVFSHSDPFRSPY 262

Query: 257 YNVDLKEIRVAGKKLLLNPSVFDGKFGTVLDSGTTYAYLPQPAFGAFKDAIMDEVHSLKK 316
           YN+DLK++ VAGK L LNP VF+GK GTVLDSGTTYAY P+ AF A KDA++ E+ SLK+
Sbjct: 263 YNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKR 322

Query: 317 IGGPDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFENGQKLSLAPENYLFR 376
           I GPDPN++D+CFSGAGR          DVAE+   FP + M F NGQKL L+PENYLFR
Sbjct: 323 IHGPDPNYDDVCFSGAGR----------DVAEIHNFFPEIAMEFGNGQKLILSPENYLFR 382

Query: 377 HSKVHGAYCLGIFENGNDQTTLLGGIIVRNTLVMYDRENSKIGFWKTNCSELWERLHISN 436
           H+KV GAYCLGIF +  D TTLLGGI+VRNTLV YDREN K+GF KTNCS++W RL    
Sbjct: 383 HTKVRGAYCLGIFPD-RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCSDIWRRLAAPE 442

Query: 437 DTAHAPSVSNTSHDTEMAPASAPSEAPAS-APRELQVGRITFEILLNISYEDLEPHITEL 496
             A    +S  +  + ++P+ A SE+P S  P   +VG ITFE+ ++++   L+P  +E+
Sbjct: 443 SPAPTSPISQ-NKSSNISPSPATSESPTSHLPGVFRVGVITFEVSISVNNSSLKPKFSEI 502

Query: 497 SDLIAQELNVSYSQVRLLNFTMQGNDSLIQLAIIPGGSSEFFSHATATTIIAQIVEHHMQ 556
           +D IA EL++  +QVRLLNF+  GN+  ++  + P  SSE+ S+ TA  I+  + E+ ++
Sbjct: 503 ADFIAHELDIQSAQVRLLNFSSSGNEYRLKWGVFPPQSSEYISNTTALNIMLLLKENRLR 562

Query: 557 LPPTFGSYQVVQWNVEPLIKRSLWKQLYVMVIVAVIVTLLLGLSALGVWLIWRRRHQSFN 616
           LP  FGSY++++W  E   K+S W++  + V+   +++LL+    + + L+WRRR Q   
Sbjct: 563 LPGQFGSYKLLEWKAEQKKKQSWWEKHLLGVVGGAMISLLVTSVMIKLALVWRRRKQEEA 622

Query: 617 SYKPVNAAAPEQELQPL 632
           +Y+PVNAA  EQELQPL
Sbjct: 623 TYEPVNAAIKEQELQPL 624

BLAST of MC02g1075 vs. TAIR 10
Match: AT5G22850.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 201.4 bits (511), Expect = 2.1e-51
Identity = 148/443 (33.41%), Postives = 219/443 (49.44%), Query Frame = 0

Query: 65  DLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPK-----FDPDLS 124
           D  + G Y T+L +GTPP+ F + VDTGS V +V CA+C  C +    +     FDP  S
Sbjct: 74  DPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSS 133

Query: 125 STFRPVKC----------NLDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGN--QS 184
            T  P+ C          + D  C     LC Y  QY + S +SG    D++ F     S
Sbjct: 134 VTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGS 193

Query: 185 ELVPQR---ATFGCETVETGDLY-SQRA-DGIMGLGSGELSIVDQLVEKGVINDTFSLCY 244
            LVP       FGC T +TGDL  S RA DGI G G   +S++ QL  +G+    FS C 
Sbjct: 194 SLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253

Query: 245 GGMDIGGGAMVLGGISTPSDMAFSFSDRMRS-PYYNVDLKEIRVAGKKLLLNPSVF--DG 304
            G + GGG +VLG I  P+     F+  + S P+YNV+L  I V G+ L +NPSVF    
Sbjct: 254 KGENGGGGILVLGEIVEPN---MVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSN 313

Query: 305 KFGTVLDSGTTYAYLPQPAFGAFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPS 364
             GT++D+GTT AYL + A+  F +AI + V           +   +   G   YVI  S
Sbjct: 314 GQGTIIDTGTTLAYLSEAAYVPFVEAITNAV---------SQSVRPVVSKGNQCYVITTS 373

Query: 365 HYVSDVAELSKTFPAVDMVFENGQKLSLAPENYLFRHSKVHG--AYCLGIFENGNDQTTL 424
             V D+      FP V + F  G  + L P++YL + + V G   +C+G     N   T+
Sbjct: 374 --VGDI------FPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITI 433

Query: 425 LGGIIVRNTLVMYDRENSKIGFWKTNCSELWERLHISNDTAHAPSVSNTSHDTEMAPASA 481
           LG +++++ + +YD    +IG+   +CS           T+   S +++S  +E   A  
Sbjct: 434 LGDLVLKDKIFVYDLVGQRIGWANYDCS-----------TSVNVSATSSSGRSEYVNAGQ 484

BLAST of MC02g1075 vs. TAIR 10
Match: AT1G08210.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 186.8 bits (473), Expect = 5.3e-47
Identity = 125/386 (32.38%), Postives = 189/386 (48.96%), Query Frame = 0

Query: 65  DLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQCGRHQDPK-----FDPDLS 124
           D  L G Y T++ +GTPP++F + +DTGS V +V C +C  C +  + +     FDP +S
Sbjct: 77  DPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVS 136

Query: 125 STFRPVKCN---------LDCSCDDDGLLCVYERQYAEMSTSSGILGEDIISFGN--QSE 184
           S+   V C+          +  C  +  LC Y  +Y + S +SG    D +SF     S 
Sbjct: 137 SSASLVSCSDRRCYSNFQTESGCSPNN-LCSYSFKYGDGSGTSGYYISDFMSFDTVITST 196

Query: 185 LVPQRA---TFGCETVETGDLYSQR--ADGIMGLGSGELSIVDQLVEKGVINDTFSLCYG 244
           L    +    FGC  +++GDL   R   DGI GLG G LS++ QL  +G+    FS C  
Sbjct: 197 LAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLK 256

Query: 245 GMDIGGGAMVLGGISTPSDMAFSFSDRMRSPYYNVDLKEIRVAGKKLLLNPSVFD--GKF 304
           G   GGG MVLG I  P  +          P+YNV+L+ I V G+ L ++PSVF      
Sbjct: 257 GDKSGGGIMVLGQIKRPDTVYTPLVP--SQPHYNVNLQSIAVNGQILPIDPSVFTIATGD 316

Query: 305 GTVLDSGTTYAYLPQPAFGAFKDAIMDEVHSLKKIGGPDPNFNDICFSGAGRYVINPSHY 364
           GT++D+GTT AYLP  A+  F  A+ + V                  S  GR +   S+ 
Sbjct: 317 GTIIDTGTTLAYLPDEAYSPFIQAVANAV------------------SQYGRPITYESYQ 376

Query: 365 VSDV-AELSKTFPAVDMVFENGQKLSLAPENYL-FRHSKVHGAYCLGIFENGNDQTTLLG 424
             ++ A     FP V + F  G  + L P  YL    S     +C+G     + + T+LG
Sbjct: 377 CFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILG 436

Query: 425 GIIVRNTLVMYDRENSKIGFWKTNCS 426
            +++++ +V+YD    +IG+ + +CS
Sbjct: 437 DLVLKDKVVVYDLVRQRIGWAEYDCS 441

BLAST of MC02g1075 vs. TAIR 10
Match: AT2G36670.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 176.4 bits (446), Expect = 7.2e-44
Identity = 140/469 (29.85%), Postives = 224/469 (47.76%), Query Frame = 0

Query: 7   FSADPIAANLLLTPPHRAMVLPLYLSSP-----NSSRLISKPR-RHL-------RESNSY 66
           F+A P+ +           +LPL  + P       S L ++ R RH        R+S+  
Sbjct: 22  FAASPLPSAYAKYAAGPTKILPLQRAFPLDELVELSELRARDRVRHARILLGGGRQSSVG 81

Query: 67  NLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSSVTYVPCANCEQC----GRH 126
            + +  ++   D  L G Y T++ +G+PP +F + +DTGS + +V C++C  C    G  
Sbjct: 82  GVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLG 141

Query: 127 QDPKF---------------DPDLSSTFRPVKCNLDCSCDDDGLLCVYERQYAEMSTSSG 186
            D  F               DP  SS F+       CS ++    C Y  +Y + S +SG
Sbjct: 142 IDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAA--QCSENNQ---CGYSFRYGDGSGTSG 201

Query: 187 -----------ILGEDIISFGNQSELVPQRATFGCETVETGDL--YSQRADGIMGLGSGE 246
                      ILGE +++  N S  +     FGC T ++GDL    +  DGI G G G+
Sbjct: 202 YYMTDTFYFDAILGESLVA--NSSAPI----VFGCSTYQSGDLTKSDKAVDGIFGFGKGK 261

Query: 247 LSIVDQLVEKGVINDTFSLCYGGMDIGGGAMVLGGISTPSDMAFSFSDRMRS-PYYNVDL 306
           LS+V QL  +G+    FS C  G   GGG  VLG I  P      +S  + S P+YN++L
Sbjct: 262 LSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVP---GMVYSPLVPSQPHYNLNL 321

Query: 307 KEIRVAGKKLLLNPSVFDGK--FGTVLDSGTTYAYLPQPAFGAFKDAIMDEVHSLKKIGG 366
             I V G+ L L+ +VF+     GT++D+GTT  YL + A+  F +AI + V  L     
Sbjct: 322 LSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQL----- 381

Query: 367 PDPNFNDICFSGAGRYVINPSHYVSDVAELSKTFPAVDMVFENGQKLSLAPENYLFRHSK 426
                  I  +G   Y+++ S        +S  FP+V + F  G  + L P++YLF +  
Sbjct: 382 ----VTPIISNGEQCYLVSTS--------ISDMFPSVSLNFAGGASMMLRPQDYLFHYGI 441

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q4V3D27.5e-3831.22Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q9S9K43.9e-3429.06Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q9LS403.3e-3327.96Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q9LHE31.5e-3027.88Protein ASPARTIC PROTEASE IN GUARD CELL 2 OS=Arabidopsis thaliana OX=3702 GN=ASP... [more]
Q6XBF83.4e-3026.37Aspartic proteinase CDR1 OS=Arabidopsis thaliana OX=3702 GN=CDR1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_022149434.10.092.26aspartic proteinase-like protein 2 isoform X1 [Momordica charantia][more]
XP_022149435.10.086.90aspartyl protease family protein 1-like isoform X2 [Momordica charantia][more]
XP_038902862.10.081.20aspartic proteinase CDR1-like isoform X1 [Benincasa hispida][more]
XP_022985603.10.080.57aspartic proteinase nepenthesin-1-like [Cucurbita maxima][more]
XP_023512095.10.080.57aspartic proteinase nepenthesin-1-like [Cucurbita pepo subsp. pepo] >XP_02352220... [more]
Match NameE-valueIdentityDescription
A0A6J1D7180.092.26aspartic proteinase-like protein 2 isoform X1 OS=Momordica charantia OX=3673 GN=... [more]
A0A6J1D5P80.086.90aspartyl protease family protein 1-like isoform X2 OS=Momordica charantia OX=367... [more]
A0A6J1JE380.080.57aspartic proteinase nepenthesin-1-like OS=Cucurbita maxima OX=3661 GN=LOC1114836... [more]
A0A6J1EYA50.080.25aspartic proteinase nepenthesin-1-like OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A5A7TUH10.078.83Aspartic proteinase CDR1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sc... [more]
Match NameE-valueIdentityDescription
AT3G50050.15.8e-21158.91Eukaryotic aspartyl protease family protein [more]
AT5G43100.11.4e-20457.37Eukaryotic aspartyl protease family protein [more]
AT5G22850.12.1e-5133.41Eukaryotic aspartyl protease family protein [more]
AT1G08210.15.3e-4732.38Eukaryotic aspartyl protease family protein [more]
AT2G36670.27.2e-4429.85Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 78..98
score: 53.59
coord: 396..411
score: 25.16
coord: 230..243
score: 23.29
coord: 283..294
score: 34.29
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 21..470
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 47..235
e-value: 2.0E-47
score: 163.7
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 244..428
e-value: 1.2E-42
score: 147.6
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 68..429
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 72..236
e-value: 1.4E-35
score: 123.1
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 256..419
e-value: 8.3E-25
score: 87.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 439..464
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 439..453
NoneNo IPR availablePANTHERPTHR13683:SF847EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 21..470
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 87..98
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 72..420
score: 47.409431
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 71..424
e-value: 2.9995E-75
score: 239.857

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC02g1075.1MC02g1075.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045944 positive regulation of transcription by RNA polymerase II
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0046983 protein dimerization activity
molecular_function GO:0000977 RNA polymerase II transcription regulatory region sequence-specific DNA binding