MC02g1238 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC02g1238
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionaspartic proteinase Asp1-like
LocationMC02: 11330113 .. 11339672 (-)
RNA-Seq ExpressionMC02g1238
SyntenyMC02g1238
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAGAGTTTTCCAAATTATGAGTTGTTGAACTAACATTTCTGTGAATTCTAATCGAGTTAATTTTCACCTCTACTCGATTAAAATCAGTTTCACCTCCTACGATCAAACTTTCACGGACGACAACTTTAACCATGGACATTGTGACATTCAGAATCAACTATGCTAGAAACTTTTTTTTTTTAGCAGATGACTATGTTAGTTGAAATAGACATACAAATAGGTGCAAATTCTTTGGGGAAAAAAAAGGAACTGCCCAACCGGTTCAAACCGACCGATTGGTGTGGTTCGGTATATCGATCCCCTCTCTCGCCATTACCACCAGATTCAGATCTTCCCCTCTCGCCGTCGTCCACTCGCCGTCGCGATCAGATCTCCCTCTCGCCGCCGTGTTCAGATCCCTCACTCGCCGTCGTCCGTTCGCCGATCGTGTGAAGTAATCAAAGGGAGAGACCCAATCGCGGGAATCTCCGCAATTGCTTCTCAGCAATGGCGACCACTGCTTGTTTCATCATCGTCAGTAGGAACGATATCCCCATTTATGAAGCTGAAGTTGGATCCGCTGCTAAAGTACTCCTTTTCTTTTAACCATTGCGTTTCCTTTCTATTTTGTATTTCTCTAGATTTCGAGGTTTATATCTTCGTTTTCATGATCCACGTTTCTTTCGAATTTTTATTTCACAAGTAACATTTCAAAACATGACGATTTGTTCGGAACTGTAGTTATGCCGTTAGAAATGAGCAGAAATTCACTTTTTCCATGTTTAGATTAGATTGCATAACCATGATTTTCCTTTTGGTTCTTTAATTGCTTTAGTTTTGACTGTCATTAGTTGCTCATCTTTTTGGCTTGACATGAAAGAGAGAGGATTCTGCTCAGCTTCATCAGTTTATATTGCATGCGTCCCTTGACATTGTTCAAGACCTGGCATGGACTACTAGCGCTATGTGAGTTCTTATTCCTTTTGTTTTTTGTTCATGTTCATTTATGTAGATCAGCGAGATACCTTTGAATTTCCCACTAGATTATTTCTATTGGTGCTTCTTCTTATGCAGGTTCTTGAAAGCAGTCGATAGGTTCAATGATTTGGTGGTGTCTGTATATGTAACCGCTGGTCATATCCTTTTATGTCGAGGGGGTTTGTTGGGAACAATGTGTTGGCAATTGAGTTGAATTTCTTCAATTTACTTGCCTATAAAAAATAACTCTAGGTTGATAAAAATTTTACATTTGAATCTATACCCCCACCACCACCCGGTTGCCTTACCGAGGCCAAGGTATTTTAACTGATGATATGGTCGTAATCTTTTTATAAGATCATTGTACGGGCATATGTATTTCAGAGATTCAATTCAGTATATAGTTCTTCTTATTTCCTTTTGTGCCAATGTTCTTGTTGGTTTTTCTTTGACCATTCACTACATACACGATTAATGTTACTTCATGACTCTCGCAATGATGATGGAATCAAGAGCTTTTTTCAAGAGGTTCATGAGCTCTATATAAAGGTAAGTGGAAACTTTACATGACTCTTAGGTATTATTATTGTTATTCCCATTATTTGAAGATGTGGTGCTTGATTTACTATGTTATATTGTTGAGGTTGATTGAAATATACTTTAGTTGATTAGTTCTTACTAGATAGCGGACATTCTTTTTTCTCAAATATACTGAATTGGACCTTCAAACAGTACAGTTGTTAAGTTAATTTATTGTATCACATGAGTTTGTTTGTATATATCCAAGAATGAGTGGTTGATTTGTTATGAGATCAGGTGATCGTGTTCAGATTGCATGAATCTTGAGGGAGGGGTTAGGGGAGGTGGGTTCATTTTCTGTGACCCCTTGTTGTTTGGTCTTGTCAGAATGCTTAGATTTAAGGCTCCTGTAGATCCCTTCTTTGTGTGATGATTGTTTTTGTGGCACAGGATTGTGACCCTCTTCGTCTAGGAAAATTAATTCCTTTTTGGAACTAAAATTTATAGAATGACTTGAAATCATGTGGTAGATTTGCACGTTAATAATTTATCTAGTGGGTTAAAGATTTTATTTCATCATCATAATTTAATAATTTAATTGCTGAAACAACTATTTGTAGAAAAATATGTTAGGGACATGCCAAGATTTAATTTACTTTAAAGTTTCCTTCAGCATAAGTAAATTCACGTTAACTTGTTTGTCATCTTCTCCTGATTTTGATGAAAATTTGGTCTATTGTAGATCTAAGTTTTTTTCAGTAGCCAATCAGAGCTTGAATGCTTAAAGCTTGCTTGAAGAGCAGAAAAGTGGGTTCAAGAGGGAAATTTAATTGTTGTTTTGAATTTATTCGTATCTGCTCATAATAGTATGTGCTTCCCAGTCCTAGCACTGCTTACTTTTCCGTTAGCAAAATTTCCTAATGAGTGTTTGCTATGCCTTTAAAGCTATTATGCAATTGAAATTTGAAACTTCCTCGATGATTCTAGATGGAGTTTCCTGTATAGCTACTCTCCTTCATTGATTATTATAATCTCTATCGGCTTTTAGTGCACCTTCTTAGATCAAAGTTAGCATAGCTATAAAATTTGTATAGCATAAGTTAGGTTTTCAAGATCTTAAATTTTCCTGTTTAGTATAACATAAGATAAAAGGTTTTGCTCTTTTGTTTGATGACCTACAGCCCACTCTTTCCATGATGTAGCATTGTTGGTGCAAAGATTTGACGAGTGTTGGAACCACTTTTATATAATTGCATATTCTTATCCATTTTATGTTGATAAGCAAGGATCCTAAGCTCTTAACTCTTAGCAATTGCCGCCGTATCTAAGTTTTCATAGCAGAGATTTTTTATTTATCTCGAGAAAGATTCCTAGAAAATCCTTGGAAATCTTTTTATAACCTTGCCTCACCTTCGCTATGTTATGTGCATGTGTAGGTACGCCATATCAAATATTTGGAAACTGTATCATAGTCAACCTATTTGAGAGAAATTGCACGACTGAAGTTTATAAACATTGGATAATTATTCAAGTTATGATGACGATGTATATAGACTTTCTTTATTTCCTTGGATGTGATTTTTCAGCTTTATATCAAATTTACTTCATCTATCGGAAAAATTCTAAGTTAAATAATTCAAGACCGGTTGTTGTAAGAGTTTGTCGGGTGACCATTTGTTCCCCATGTTTTCGTTGAATGGCTGTGGCATAAACCTTCTTGATTTCCCTGTTTTTTTCCTTTTTTCTCCGCTGATGCCAAGGGTTTTCATTTGGTATCAATTGTTGACTTGATTATGCTCATGCTGAAAATTGGCTTTGTGAATTTGCAGACTATACTAAATCCCCTCTACTTGCCTGGATCCCGCATCACATCTTCACATTTCGACACAAAAGTCCGTGCGCTCGCAAGGAAGTATCTCTAGTGCTCACCGTGATTGCAGACCAGACGGGCTTGTGACGATGGGCTTCAAAGTTTCTAGCAGCCTCAAGTTTCCTTCAATTCCCTATCATTTTGATCTTTGATATATAGTCCCGACATTATTCGACTCGGCTCAGTAGTTAAAGTATTATTGCTCATTGTGTATTATTGTCTTCAATATGACTTGATTACGAGTATGAAATTGCACTTTACAGCATTGATGTAATAATGTCCTCCCCTCTCCCTTCCACTCATCTCTTTATAGTTGAGAGTATTGTGTTTCATATTCCACAAGGCATGGATATTAAAATCATGCCACAAATATGCTCAGGAGACAGCCCTTTTCTTTTAATTTCTGTTACTTGCAAAGAATGTTTTGTATGGCGCAAGGAATATAAAGTTCTCTTCAGCAAAACTGCCATATTTCATCCAAGGCAGAGGAGGAAAAGAATAAGAAAACCCCTCATGTGAAGTCTCCTCCTAACTGGAGGAAAAGAATAACAAACCCACCAAAACTTAGCTAGAGAAATATGTGATTCCTAGTTAGGAAGGCTATAGAAGAACCAAAATTGTGGTAAAAGGCTCTGTAGTAAACCCTTGGAAAAGAAAATTCATAGTTCCCAAGCAAAACAGGCCCTCTTAAACACCAACTCCAATAGAGCAACCGGTTCCATTATTCAATTACAGCCATTTTGGCCGTACACTTTGATCTTTTGTTTTTTAGTTCCATTCCCGAGATACCCAGAAGACAGAAACGGTGAAAGAAACTGATTCTTTATAAGGTTCAGTTGTCCAATCATGGGGACAGGGCTGTTGAAGATATTGGTGCTGATGGTGGCCTCCATGAACTGTTTGGCTCCATCTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGGAAGCCAATTCTGTCGGTTTCGGCCACATCCTCTTCATTTGCTTCCTCTTCCATTGTGTTGCCTCTTCAAGGAAACGTCTATCCAAATGGGTAAGCTATATTTCTGCTCAGTTTCAGGTTTTATTATTTCCCATTTTGATTTCTTTTACTGTAGTGCTACAGAATATGGAAATGGCTGAAGAGAAAGTTCAATCTGAATGCTGTGGCTTAAAATATGCTACAGATTACAAGATCACGTTATCCTCTGTTTTCCCATTTTCTTTCTGACCTACGTTTCTTCTTATCCTGTTCCAGTAAAATTGCTTTGGAGTGTATTTGTGAGTGCTTTTCAGAGTAGGGTTTTCTCTGTTCTTATTTGAGGGTCACTTTCTTTACTGTCAGATCCTATAATTGTTTTCATCAATATATTTTTGAGGTCGTATAATCTGTCTTTTTCAGGTTCTATAATGTTACTCTCTATGTAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCAGACACCGGTAGTGATCTCACTTGGCTTCAATGTGATGCTCCATGTCAGCAGTGCACTGAGGTAATTTTTTGTTTGATTCAGCAGTTAATGGTTCATAGGAAGGAGATTCTTGAAAAATATGATGACCAAAATGAAAGAAGAAAGTTACTTTAACTACACCTTTGGCTTTACTTGAGTTATTTTCACAATAAGGAATCCAAATTTTCCATGTTTGATTGTTGTCTAAGTTTCCTCTATATTCCTTCTTGATGTGCTTATTGCTTCAGTTTTCGCTCTCTTAAAAATCCTTTTCATTTCTTTGGAGTTGAAGAGTTAAGAATGCATCAATTTGTTGGCATCGTTTACCCTTTCAGACACCTCATCCACTGTACCGACCGAGTGACGATCTTGTGCCGTGTAAAGACCCGCTGTGTATGTCCTTGCACTCATCTGTTGATCATAGATGTGAGAACCCAGATCAATGCGACTATGAGGTTGAGTATGCAGATGGCGGTTCGTCTCTCGGAGTCCTTGTCAGGGATATATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGGCCCCGTTTGGCCCTCGGGTAAAAACCACTCGACTGGCTTCCATAATTCATCTTAATCTCCTAGCAATTTAAATCGAACTTACTAATTCCAATTACAACAATATACTTTAGAGCTTTTGCTAAAACAAACATCATGTCGTGCTGTCATATCTTACTTTTTTGCTCTCTCCAGATATTACACAAGATAATTTTAGCATCAAGAAGCCACGTAATTAAGAAAAGATATAGCAGCAGAATCTAGATTTTGTAATTAACATAATGATTCTGATTTGTATTTTTCAAAGTCATCAATAAAAAAGTATACTAATGACTATATTGTTCAAGAGATTGTTATTCACTTTGCTAACAAATTTCATCCATATGGTTAAATAAAATCATAAGGAGCAAGTAATGTAGGTCAGCTGTTTGAATCTGTGGTGCTGAGTTTTGTTTCATTTGATTACTCAGATGTGGTTATGATCAAATTCCTGGATCATCTTATTATCACCCCATGGATGGAATACTCGGCCTTGGAAAGGGAGCAGTAAGCATCGTCTCGCAGCTGCATAATCAGGGTATCATCCGGAATGTCATCGGTCACTGTTTCAGTAGCAGAGGAGGAGGATACCTTTTCTTTGGGGATGACATTTACGATTCTCATCGTGTAGTTTGGACGCCAATGTCACGCGATTACCCGTAAATAACCCGCCTTGATCCTATATACATCTTAGTTATGATTCATGTTTGGCTCCTGTTTTTACAAAATTTTCTTCATGTGGTTGTATCAGGAAGCACTACTCCCCTGGGCTTGGAGAATTAATCTTCAATGGAAGAAGCACTGGACTGAGAAATTTGTTTGCAGTCTTTGACAGTGGCAGCTCTTACACCTACTTCAACGCTCAGGCTTATCAAGTTTTAACATCATTGGTAAGACTCATCTGTAAAGTAAAAGCCACTGATTCTTCCATGAATCTTTATTAATTGTTGCCATTGGCACTCTTCTTGTAGTTGAATAGAGAACTGGCAGGAAAACCGCTAAGAGAAGCCATGGACGACGACACGCTTCCACTTTGCTGGAGAGGTCGGAAGCCATTCAAAAGCTTACGCGATGTGAGAAAGTATTTCAAGCCATTGGCATTGGGCTTCTCCAGTGGCGGAAGAAGCAAAGCAGTGTTTGAAATACCAATGGAAGGCTATCTGATATTATCGGTAAGACTCCCTCCAAGCCTCCAACTAAGATTCATTACCATTTTCAACTGTGGAGCATTTGGTTCGATTTAAATTTCATTTAAGATTATTCTATTTTGGCCATTGAATTTTCATATTTGTTCTATTTTGACCCTGAACTTTCAAAAGATTTATTTTAGTTCTTAAACTTTAAAAAGGTAACTATTTTGGTCCAAACCTTTATTTTGTTTTACAAATTAACAACACAACTTAATGCATTTAAATCCACGTCGGTTGGAAGTGTGTATTGGGTAAGCAAGTATAACAAGGACCAAAATATCCTTCTTTTTTTTTTGTAAATCCAAAGATTAAAATTGACCTTTCGAAAGTTTATGTTCCAAAATGAAACAAACTTGAAAGTTGAAGATAAAAATAGGATCTAAAAGATTATATAAAAAAGTTACTTCTGAAGTTTTCCCTACAAAATTCTGCCTCTCCTGTAAAAAATTTCTAGCTGGAATATTTGTTTTTGAGATTTTTAAGTTGAATCAAATTGCAGTCCATGGGAAATGTTTGCTTGGGAATTTTAAACGGCACTGAAGTTGGGCTCCAAAACTCCAATATCATTGGTGGTACGTACTTTATATTTGCATGTTTTTTTTTCTTGAACAATAACGCTATATTTGGATTTTGGATGTTATTTGAAATACAAATTTCTTTTTTCTGAATACAACAAAGGAGTAGATATTCGAACTCACGACTTCTTAATTGGAGATCGGTGCTTAAACTTGGAGATATGTGCTTAAACTAGTCGAGCCGTTCATTCTCACGAAATATGAATTATTCCTTGCCTCATTTTATGCCATCTTTTTTTTGACAGATATATCAATGCATGATAAGATTGTAATATATAACAACGAGAAGCAAGCCATTGGATGGGCTACCGCTAACTGCGATCGAGTTCCCAAGTCCAGAGCTGCTGCCACCATGTGAGAATATATGACAGAAACTCGTAACAACGACGTCGAAAAACCGATTTATCTCCCACTAATGTATGTATAAACATAGTAACCAAACAAGTTGGGAAATAGGATTAGAATTTTTACATTGTTTATTGAACACTGCATTGAAATAACGAAGATCATCATCAGTTTCATTTGGAGAGTCCAAAGTTCATGAACAGAGAGTAAGTACTCCGGGAGTATAAATTTCCCGGAAAAGGAGTGTCGTGTCGTGGCCGAGTAGGATGAAGAAGACCACCAAGTTTAGAAACAGAGATCAATTCTTCTAACTACATTCTTCCATTCCAAGAAAATAAACAAATAAAAGGAAAGATAACAATGTTTTTGGCATATTGGTCTGTTTTCCAAAGAATTTGCTACCCTTAATCCTTATATGTAAAAATATGAATACAGCCTTCTTCTATTATGGTAGAGGAAGCCTTGGATACGCCATGGCCAATGCCCTTATACCTTCGCCTTTACAATGCTGTTGTTGATGGTGCTCGACTAAAGCTTCGTTCGGAATGACTTTTAGAAGAGTGAAAAATGCTTTTCGACGGGTCAATAAACACTTCCTCAAATCCTTATCCTTCGATTTAGATAATCAAAGGATGCTTTGAAATTTCTATTGACATAGCACTTGGTCTGTGCTTGAAAAGCATTTTGTTGTTCAAAAGTCATCCCAAACTCACTCTAAATTGGAGTCTGCAAATAAGCGAACATGTCGGGCAATGGTGAGAGTGGATCCATTTGGAACTGAACCTGATTCCCAAATGGGTCATCATCTGTCATGGAGAATGAATCCATATAATTGAACTGAAAAAAGTCCAAATCTGGCACTCCTTCTCCAAATCCCCATTTGGGTTGGCTCTGGACCTCCTTCTCCCAGGTGACCTCCGTCGAGGTGGCCAGCTCCGACCCTGATCCCGACCCGCTCGAGTCGGTGTGCATTCTCGGGACCGAATCCGACGTGTCCATCTGCAAATGGTTGTGGATGGGGAACTGTACCATGTTCTTGTTATTAGGCATCAAAATATTGGGCTTCTCCTCCTCAAAATCAGGGAATTCTGCATCCTTCTCGTCGTTGGCTTCGAAATGCTTCTCTATACATCCCTTCTTGTTGTAAATTCGACACAAAACCCAATCATCCAGCTGCAGAAAGCCCCAAATCGAGATCAGCAACCGATTTCCAAATCATAATTTAGGGTTAGGGATTTGAATTTCAATTACTTACTCTCAAATTGTTGTTCTTCTTGGCGGCGGATCTATCCACATTCGCGAGGCGATACTCGTGCATAATCCAATTGGTCTTGACACCACGAGGAGCCTTCCCGGCGTAGAACACGAGCGCCTTCTTGATTCCCAGAGCCTTAGGGCGGCCGATCGGCTTGTCGGCGCCGGTGGCTTTCCAGTAGCCGGTTCCGGCAGCCCGGTTCGGACGGGATCCGTTAGGGTACTTCCGATCCCTAGGAGAGAAGAAATACCACTCTTTCTCACCACAAACAGCCAATTCTGCGAATCGAATCGAATTGAATCACCGTCAGATTCAGATACACAAAAACGAAAAGAACAACACAAAACTCAGATTTCAAGAAAACCGAACCCGTACCAGGGAGCTGCCAGGGGTTGTACTTGTAGAGATCGATTTCCTTGATGATCGGAACGGCGATGGGCTGAGAAGAGCACTTCCGGCAGAGGTAGTGAAGAACTAGCTCCTCATCGGTGGGGTGAAATCTGAAGCCTGGAGGGAGCTCAAGTCCAGCTACGGTCATTTGCTAGCGGTTTCTGCTGAGTGCGGCGGTTGCTGTGAGTTGCTGCGGCGGCAGATACGGCGGCGGTTGGTCGGAGAGGGAATCGGTGAGCAGAGATGAAGAAGCGGAGCATATATATAAGAAGGAGAACACGTGGGGCTTATGGGGGAGGTGGTATTCACGCTTCTTCATGTTCTAGAAGACTATTAAACTAATAAATAATAAATAATAAAAATAAAAATCGAACGCCGTGGGACCCACGCGCAGCCCGACAAATTCCAAATATTCCATGATAATTAGAATTTAG

mRNA sequence

GAAGAGTTTTCCAAATTATGAGTTGTTGAACTAACATTTCTGTGAATTCTAATCGAGTTAATTTTCACCTCTACTCGATTAAAATCAGTTTCACCTCCTACGATCAAACTTTCACGGACGACAACTTTAACCATGGACATTGTGACATTCAGAATCAACTATGCTAGAAACTTTTTTTTTTTAGCAGATGACTATGTTAGTTGAAATAGACATACAAATAGGTGCAAATTCTTTGGGGAAAAAAAAGGAACTGCCCAACCGGTTCAAACCGACCGATTGGTGTGGTTCGGTATATCGATCCCCTCTCTCGCCATTACCACCAGATTCAGATCTTCCCCTCTCGCCGTCGTCCACTCGCCGTCGCGATCAGATCTCCCTCTCGCCGCCGTGTTCAGATCCCTCACTCGCCGTCGTCCGTTCGCCGATCGTGTGAAGTAATCAAAGGGAGAGACCCAATCGCGGGAATCTCCGCAATTGCTTCTCAGCAATGGCGACCACTGCTTGTTTCATCATCGTCAGTAGGAACGATATCCCCATTTATGAAGCTGAAGTTGGATCCGCTGCTAAAAGAGAGGATTCTGCTCAGCTTCATCAGTTTATATTGCATGCGTCCCTTGACATTGTTCAAGACCTGGCATGGACTACTAGCGCTATGTTCTTGAAAGCAGTCGATAGGTTCAATGATTTGGTGGTGTCTGTATATGTAACCGCTGGTCATACACGATTAATGTTACTTCATGACTCTCGCAATGATGATGGAATCAAGAGCTTTTTTCAAGAGGTTCATGAGCTCTATATAAAGACTATACTAAATCCCCTCTACTTGCCTGGATCCCGCATCACATCTTCACATTTCGACACAAAAGTCCGTGCGCTCGCAAGGAAGTATCTCTAGTGCTCACCGTGATTGCAGACCAGACGGGCTTGTGACGATGGGCTTCAAAGTTTCTAGCAGCCTCAAGTTTCCTTCAATTCCCTATCATTTTGATCTTTGATATATAGTCCCGACATTATTCGACTCGGCTCAGTAGTTAAAGTATTATTGCTCATTGTGTATTATTGTCTTCAATATGACTTGATTACGAGTATGAAATTGCACTTTACAGCATTGATGTAATAATGTCCTCCCCTCTCCCTTCCACTCATCTCTTTATAGTTGAGAGTATTGTGTTTCATATTCCACAAGGCATGGATATTAAAATCATGCCACAAATATGCTCAGGAGACAGCCCTTTTCTTTTAATTTCTGTTACTTGCAAAGAATGTTTTGTATGGCGCAAGGAATATAAAGTTCTCTTCAGCAAAACTGCCATATTTCATCCAAGGCAGAGGAGGAAAAGAATAAGAAAACCCCTCATGTGAAGTCTCCTCCTAACTGGAGGAAAAGAATAACAAACCCACCAAAACTTAGCTAGAGAAATATGTGATTCCTAGTTAGGAAGGCTATAGAAGAACCAAAATTGTGGTAAAAGGCTCTGTAGTAAACCCTTGGAAAAGAAAATTCATAGTTCCCAAGCAAAACAGGCCCTCTTAAACACCAACTCCAATAGAGCAACCGGTTCCATTATTCAATTACAGCCATTTTGGCCGTACACTTTGATCTTTTGTTTTTTAGTTCCATTCCCGAGATACCCAGAAGACAGAAACGGTGAAAGAAACTGATTCTTTATAAGGTTCAGTTGTCCAATCATGGGGACAGGGCTGTTGAAGATATTGGTGCTGATGGTGGCCTCCATGAACTGTTTGGCTCCATCTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGGAAGCCAATTCTGTCGGTTTCGGCCACATCCTCTTCATTTGCTTCCTCTTCCATTGTGTTGCCTCTTCAAGGAAACGTCTATCCAAATGGGTTCTATAATGTTACTCTCTATGTAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCAGACACCGGTAGTGATCTCACTTGGCTTCAATGTGATGCTCCATGTCAGCAGTGCACTGAGACACCTCATCCACTGTACCGACCGAGTGACGATCTTGTGCCGTGTAAAGACCCGCTGTGTATGTCCTTGCACTCATCTGTTGATCATAGATGTGAGAACCCAGATCAATGCGACTATGAGGTTGAGTATGCAGATGGCGGTTCGTCTCTCGGAGTCCTTGTCAGGGATATATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGGCCCCGTTTGGCCCTCGGATGTGGTTATGATCAAATTCCTGGATCATCTTATTATCACCCCATGGATGGAATACTCGGCCTTGGAAAGGGAGCAGTAAGCATCGTCTCGCAGCTGCATAATCAGGGTATCATCCGGAATGTCATCGGTCACTGTTTCAGTAGCAGAGGAGGAGGATACCTTTTCTTTGGGGATGACATTTACGATTCTCATCGTGTAGTTTGGACGCCAATGTCACGCGATTACCCGAAGCACTACTCCCCTGGGCTTGGAGAATTAATCTTCAATGGAAGAAGCACTGGACTGAGAAATTTGTTTGCAGTCTTTGACAGTGGCAGCTCTTACACCTACTTCAACGCTCAGGCTTATCAAGTTTTAACATCATTGTTGAATAGAGAACTGGCAGGAAAACCGCTAAGAGAAGCCATGGACGACGACACGCTTCCACTTTGCTGGAGAGGTCGGAAGCCATTCAAAAGCTTACGCGATGTGAGAAAGTATTTCAAGCCATTGGCATTGGGCTTCTCCAGTGGCGGAAGAAGCAAAGCAGTGTTTGAAATACCAATGGAAGGCTATCTGATATTATCGTCCATGGGAAATGTTTGCTTGGGAATTTTAAACGGCACTGAAGTTGGGCTCCAAAACTCCAATATCATTGGTGATATATCAATGCATGATAAGATTGTAATATATAACAACGAGAAGCAAGCCATTGGATGGGCTACCGCTAACTGCGATCGAGTTCCCAAGTCCAGAGCTGCTGCCACCATGTGAGAATATATGACAGAAACTCGTAACAACGACGTCGAAAAACCGATTTATCTCCCACTAATGTATGTATAAACATAGTAACCAAACAAGTTGGGAAATAGGATTAGAATTTTTACATTGTTTATTGAACACTGCATTGAAATAACGAAGATCATCATCAGTTTCATTTGGAGAGTCCAAAGTTCATGAACAGAGAGTAAGTACTCCGGGAGTATAAATTTCCCGGAAAAGGAGTGTCGTGTCGTGGCCGAGTAGGATGAAGAAGACCACCAAGTTTAGAAACAGAGATCAATTCTTCTAACTACATTCTTCCATTCCAAGAAAATAAACAAATAAAAGGAAAGATAACAATGTTTTTGGCATATTGGTCTGTTTTCCAAAGAATTTGCTACCCTTAATCCTTATATGTAAAAATATGAATACAGCCTTCTTCTATTATGGTAGAGGAAGCCTTGGATACGCCATGGCCAATGCCCTTATACCTTCGCCTTTACAATGCTGTTGTTGATGGTGCTCGACTAAAGCTTCGTTCGGAATGACTTTTAGAAGAGTGAAAAATGCTTTTCGACGGGTCAATAAACACTTCCTCAAATCCTTATCCTTCGATTTAGATAATCAAAGGATGCTTTGAAATTTCTATTGACATAGCACTTGGTCTGTGCTTGAAAAGCATTTTGTTGTTCAAAAGTCATCCCAAACTCACTCTAAATTGGAGTCTGCAAATAAGCGAACATGTCGGGCAATGGTGAGAGTGGATCCATTTGGAACTGAACCTGATTCCCAAATGGGTCATCATCTGTCATGGAGAATGAATCCATATAATTGAACTGAAAAAAGTCCAAATCTGGCACTCCTTCTCCAAATCCCCATTTGGGTTGGCTCTGGACCTCCTTCTCCCAGGTGACCTCCGTCGAGGTGGCCAGCTCCGACCCTGATCCCGACCCGCTCGAGTCGGTGTGCATTCTCGGGACCGAATCCGACGTGTCCATCTGCAAATGGTTGTGGATGGGGAACTGTACCATGTTCTTGTTATTAGGCATCAAAATATTGGGCTTCTCCTCCTCAAAATCAGGGAATTCTGCATCCTTCTCGTCGTTGGCTTCGAAATGCTTCTCTATACATCCCTTCTTGTTGTAAATTCGACACAAAACCCAATCATCCAGCTGCAGAAAGCCCCAAATCGAGATCAGCAACCGATTTCCAAATCATAATTTAGGGTTAGGGATTTGAATTTCAATTACTTACTCTCAAATTGTTGTTCTTCTTGGCGGCGGATCTATCCACATTCGCGAGGCGATACTCGTGCATAATCCAATTGGTCTTGACACCACGAGGAGCCTTCCCGGCGTAGAACACGAGCGCCTTCTTGATTCCCAGAGCCTTAGGGCGGCCGATCGGCTTGTCGGCGCCGGTGGCTTTCCAGTAGCCGGTTCCGGCAGCCCGGTTCGGACGGGATCCGTTAGGGTACTTCCGATCCCTAGGAGAGAAGAAATACCACTCTTTCTCACCACAAACAGCCAATTCTGCGAATCGAATCGAATTGAATCACCGTCAGATTCAGATACACAAAAACGAAAAGAACAACACAAAACTCAGATTTCAAGAAAACCGAACCCGTACCAGGGAGCTGCCAGGGGTTGTACTTGTAGAGATCGATTTCCTTGATGATCGGAACGGCGATGGGCTGAGAAGAGCACTTCCGGCAGAGGTAGTGAAGAACTAGCTCCTCATCGGTGGGGTGAAATCTGAAGCCTGGAGGGAGCTCAAGTCCAGCTACGGTCATTTGCTAGCGGTTTCTGCTGAGTGCGGCGGTTGCTGTGAGTTGCTGCGGCGGCAGATACGGCGGCGGTTGGTCGGAGAGGGAATCGGTGAGCAGAGATGAAGAAGCGGAGCATATATATAAGAAGGAGAACACGTGGGGCTTATGGGGGAGGTGGTATTCACGCTTCTTCATGTTCTAGAAGACTATTAAACTAATAAATAATAAATAATAAAAATAAAAATCGAACGCCGTGGGACCCACGCGCAGCCCGACAAATTCCAAATATTCCATGATAATTAGAATTTAG

Coding sequence (CDS)

ATGGGGACAGGGCTGTTGAAGATATTGGTGCTGATGGTGGCCTCCATGAACTGTTTGGCTCCATCTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGGAAGCCAATTCTGTCGGTTTCGGCCACATCCTCTTCATTTGCTTCCTCTTCCATTGTGTTGCCTCTTCAAGGAAACGTCTATCCAAATGGGTTCTATAATGTTACTCTCTATGTAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCAGACACCGGTAGTGATCTCACTTGGCTTCAATGTGATGCTCCATGTCAGCAGTGCACTGAGACACCTCATCCACTGTACCGACCGAGTGACGATCTTGTGCCGTGTAAAGACCCGCTGTGTATGTCCTTGCACTCATCTGTTGATCATAGATGTGAGAACCCAGATCAATGCGACTATGAGGTTGAGTATGCAGATGGCGGTTCGTCTCTCGGAGTCCTTGTCAGGGATATATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGGCCCCGTTTGGCCCTCGGATGTGGTTATGATCAAATTCCTGGATCATCTTATTATCACCCCATGGATGGAATACTCGGCCTTGGAAAGGGAGCAGTAAGCATCGTCTCGCAGCTGCATAATCAGGGTATCATCCGGAATGTCATCGGTCACTGTTTCAGTAGCAGAGGAGGAGGATACCTTTTCTTTGGGGATGACATTTACGATTCTCATCGTGTAGTTTGGACGCCAATGTCACGCGATTACCCGAAGCACTACTCCCCTGGGCTTGGAGAATTAATCTTCAATGGAAGAAGCACTGGACTGAGAAATTTGTTTGCAGTCTTTGACAGTGGCAGCTCTTACACCTACTTCAACGCTCAGGCTTATCAAGTTTTAACATCATTGTTGAATAGAGAACTGGCAGGAAAACCGCTAAGAGAAGCCATGGACGACGACACGCTTCCACTTTGCTGGAGAGGTCGGAAGCCATTCAAAAGCTTACGCGATGTGAGAAAGTATTTCAAGCCATTGGCATTGGGCTTCTCCAGTGGCGGAAGAAGCAAAGCAGTGTTTGAAATACCAATGGAAGGCTATCTGATATTATCGTCCATGGGAAATGTTTGCTTGGGAATTTTAAACGGCACTGAAGTTGGGCTCCAAAACTCCAATATCATTGGTGATATATCAATGCATGATAAGATTGTAATATATAACAACGAGAAGCAAGCCATTGGATGGGCTACCGCTAACTGCGATCGAGTTCCCAAGTCCAGAGCTGCTGCCACCATGTGA

Protein sequence

MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALGCGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALGFSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDRVPKSRAAATM
Homology
BLAST of MC02g1238 vs. ExPASy Swiss-Prot
Match: Q0IU52 (Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica OX=39947 GN=ASP1 PE=2 SV=1)

HSP 1 Score: 342.4 bits (877), Expect = 7.3e-93
Identity = 185/393 (47.07%), Postives = 258/393 (65.65%), Query Frame = 0

Query: 51  SSSIVLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPL 110
           SS++VL L GNVYP G + +T+ +G P K YFLD DTGS LTWLQCDAPC  C   PH L
Sbjct: 21  SSAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVL 80

Query: 111 YRPS-DDLVPCKDPLCMSLHSSV--DHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNL 170
           Y+P+   LV C D LC  L++ +    RC +  QCDY ++Y D  SS+GVLV D F L+ 
Sbjct: 81  YKPTPKKLVTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSA 140

Query: 171 TNGDPIRPRLALGCGYDQ-IPGSSYYHPMDGILGLGKGAVSIVSQLHNQGII-RNVIGHC 230
           +NG      +A GCGYDQ     +   P+D ILGL +G V+++SQL +QG+I ++V+GHC
Sbjct: 141 SNGTN-PTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 200

Query: 231 FSSRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIF--NGRSTGLRNLFAVFD 290
            SS+GGG+LFFGD    +  V WTPM+R++ K+YSPG G L F  N ++     +  +FD
Sbjct: 201 ISSKGGGFLFFGDAQVPTSGVTWTPMNREH-KYYSPGHGTLHFDSNSKAISAAPMAVIFD 260

Query: 291 SGSSYTYFNAQAYQ----VLTSLLNRELAGKPLREAMDDD-TLPLCWRGRKPFKSLRDVR 350
           SG++YTYF AQ YQ    V+ S LN E   K L E  + D  L +CW+G+    ++ +V+
Sbjct: 261 SGATYTYFAAQPYQATLSVVKSTLNSEC--KFLTEVTEKDRALTVCWKGKDKIVTIDEVK 320

Query: 351 KYFKPLALGFSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTE--VGLQNSNIIGDIS 410
           K F+ L+L F+ G + KA  EIP E YLI+S  G+VCLGIL+G++  + L  +N+IG I+
Sbjct: 321 KCFRSLSLEFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGIT 380

Query: 411 MHDKIVIYNNEKQAIGWATANCDRVPKSRAAAT 430
           M D++VIY++E+  +GW    CDR+P+S +A T
Sbjct: 381 MLDQMVIYDSERSLLGWVNYQCDRIPRSESAIT 407

BLAST of MC02g1238 vs. ExPASy Swiss-Prot
Match: A2ZC67 (Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica OX=39946 GN=ASP1 PE=2 SV=2)

HSP 1 Score: 328.2 bits (840), Expect = 1.4e-88
Identity = 176/393 (44.78%), Postives = 255/393 (64.89%), Query Frame = 0

Query: 51  SSSIVLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPL 110
           SS++VL L GNVYP G + VT+ +G P KPYFLD DTGS LTWLQCD PC  C + PH L
Sbjct: 21  SSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGL 80

Query: 111 YRPS-DDLVPCKDPLCMSLHSSV--DHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNL 170
           Y+P     V C +  C  L++ +    +C   +QC Y ++Y  GGSS+GVL+ D F L  
Sbjct: 81  YKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLPA 140

Query: 171 TNGDPIRPRLALGCGYDQIPGSSYYH---PMDGILGLGKGAVSIVSQLHNQGII-RNVIG 230
           +NG      +A GCGY+Q  G + ++   P++GILGLG+G V+++SQL +QG+I ++V+G
Sbjct: 141 SNGTN-PTSIAFGCGYNQ--GKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLG 200

Query: 231 HCFSSRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGL--RNLFAV 290
           HC SS+G G+LFFGD    +  V W+PM+R++ KHYSP  G L FN  S  +    +  +
Sbjct: 201 HCISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLQFNSNSKPISAAPMEVI 260

Query: 291 FDSGSSYTYFNAQAYQVLTSLLNRELAG--KPLREAMDDD-TLPLCWRGRKPFKSLRDVR 350
           FDSG++YTYF  Q Y    S++   L+   K L E  + D  L +CW+G+   +++ +V+
Sbjct: 261 FDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVK 320

Query: 351 KYFKPLALGFSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTE--VGLQNSNIIGDIS 410
           K F+ L+L F+ G + KA  EIP E YLI+S  G+VCLGIL+G++    L  +N+IG I+
Sbjct: 321 KCFRSLSLKFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGIT 380

Query: 411 MHDKIVIYNNEKQAIGWATANCDRVPKSRAAAT 430
           M D++VIY++E+  +GW    CDR+P+S +A T
Sbjct: 381 MLDQMVIYDSERSLLGWVNYQCDRIPRSASAIT 407

BLAST of MC02g1238 vs. ExPASy Swiss-Prot
Match: Q9M9A8 (Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1)

HSP 1 Score: 304.3 bits (778), Expect = 2.2e-81
Identity = 172/400 (43.00%), Postives = 242/400 (60.50%), Query Frame = 0

Query: 42  VSATSSSFASSSIVLPLQGNVYPNGFYNVTLYVGQPP--KPYFLDPDTGSDLTWLQCDAP 101
           +S ++ S  SS+ + P+ GNVYP+G Y   + VG+P   + Y LD DTGS+LTW+QCDAP
Sbjct: 177 LSTSAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAP 236

Query: 102 CQQCTETPHPLYRP-SDDLVPCKDPLCMSL-HSSVDHRCENPDQCDYEVEYADGGSSLGV 161
           C  C +  + LY+P  D+LV   +  C+ +  + +   CEN  QCDYE+EYAD   S+GV
Sbjct: 237 CTSCAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGV 296

Query: 162 LVRDIFPLNLTNGDPIRPRLALGCGYDQIPG--SSYYHPMDGILGLGKGAVSIVSQLHNQ 221
           L +D F L L NG      +  GCGYDQ  G   +     DGILGL +  +S+ SQL ++
Sbjct: 297 LTKDKFHLKLHNGSLAESDIVFGCGYDQ-QGLLLNTLLKTDGILGLSRAKISLPSQLASR 356

Query: 222 GIIRNVIGHCFSS--RGGGYLFFGDDIYDSHRVVWTPMSRD--------YPKHYSPGLGE 281
           GII NV+GHC +S   G GY+F G D+  SH + W PM  D             S G G 
Sbjct: 357 GIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGM 416

Query: 282 LIFNGRSTGLRNLFAVFDSGSSYTYFNAQAY-QVLTSLLNRELAGKPLREAMDDDTLPLC 341
           L  +G +  +  +  +FD+GSSYTYF  QAY Q++TSL  +E++G  L     D+TLP+C
Sbjct: 417 LSLDGENGRVGKV--LFDTGSSYTYFPNQAYSQLVTSL--QEVSGLELTRDDSDETLPIC 476

Query: 342 WRGRK--PFKSLRDVRKYFKPLALGFSSGGR--SKAVFEIPMEGYLILSSMGNVCLGILN 401
           WR +   PF SL DV+K+F+P+ L   S     S+ +  I  E YLI+S+ GNVCLGIL+
Sbjct: 477 WRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLL-IQPEDYLIISNKGNVCLGILD 536

Query: 402 GTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR 421
           G+ V   ++ I+GDISM   +++Y+N K+ IGW  ++C R
Sbjct: 537 GSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVR 570

BLAST of MC02g1238 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 146.0 bits (367), Expect = 1.0e-33
Identity = 127/438 (29.00%), Postives = 192/438 (43.84%), Query Frame = 0

Query: 8   ILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQGN--VYPN 67
           ++V+  AS N +  +      K K  E  K   S      S   +SI LPL G+  V   
Sbjct: 15  VIVIEFASANFVFKAQHKFAGKKKNLEHFK---SHDTRRHSRMLASIDLPLGGDSRVDSV 74

Query: 68  GFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPS---------DD 127
           G Y   + +G PPK Y +  DTGSD+ W+ C  PC +C    +  +R S           
Sbjct: 75  GLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNASSTSK 134

Query: 128 LVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGD----P 187
            V C D  C  +  S    C+    C Y + YAD  +S G  +RD+  L    GD    P
Sbjct: 135 KVGCDDDFCSFISQS--DSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGP 194

Query: 188 IRPRLALGCGYDQI----PGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSS 247
           +   +  GCG DQ      G S    +DG++G G+   S++SQL   G  + V  HC  +
Sbjct: 195 LGQEVVFGCGSDQSGQLGNGDS---AVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDN 254

Query: 248 -RGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGL-----RNLFAVF 307
            +GGG   F   + DS +V  TPM  +   HY+  L  +  +G S  L     RN   + 
Sbjct: 255 VKGGG--IFAVGVVDSPKVKTTPMVPN-QMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIV 314

Query: 308 DSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFK 367
           DSG++  YF    Y    SL+   LA +P++  + ++T        + F    +V + F 
Sbjct: 315 DSGTTLAYFPKVLYD---SLIETILARQPVKLHIVEETF-------QCFSFSTNVDEAFP 374

Query: 368 PLALGFSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNII--GDISMHDK 419
           P++  F    +      +    YL        C G   G     + S +I  GD+ + +K
Sbjct: 375 PVSFEFEDSVK----LTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNK 426

BLAST of MC02g1238 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 130.2 bits (326), Expect = 5.7e-29
Identity = 107/393 (27.23%), Postives = 169/393 (43.00%), Query Frame = 0

Query: 52  SSIVLPLQGNVYPN--GFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQC---TET 111
           ++I LPL G+   +  G Y   + +G PPK Y++  DTGSD+ W+ C APC +C   T+ 
Sbjct: 60  ANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDL 119

Query: 112 PHPL------YRPSDDLVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVR 171
             PL         +   V C+D  C  +  S    C     C Y V Y DG +S G  ++
Sbjct: 120 GIPLSLYDSKTSSTSKNVGCEDDFCSFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFIK 179

Query: 172 DIFPLNLTNGD----PIRPRLALGCGYDQIPG-SSYYHPMDGILGLGKGAVSIVSQLHNQ 231
           D   L    G+    P+   +  GCG +Q          +DGI+G G+   SI+SQL   
Sbjct: 180 DNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAG 239

Query: 232 GIIRNVIGHCFSSRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGL 291
           G  + +  HC  +  GG +F   ++ +S  V  TP+  +   HY+  L  +  +G    L
Sbjct: 240 GSTKRIFSHCLDNMNGGGIFAVGEV-ESPVVKTTPIVPN-QVHYNVILKGMDVDGDPIDL 299

Query: 292 RNLFA--------VFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRG 351
               A        + DSG++  Y     Y    SL+ +  A + ++  M  +T       
Sbjct: 300 PPSLASTNGDGGTIIDSGTTLAYLPQNLY---NSLIEKITAKQQVKLHMVQETFAC---- 359

Query: 352 RKPFKSLRDVRKYFKPLALGFSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQ 411
              F    +  K F  + L F    +      +    YL        C G  +G      
Sbjct: 360 ---FSFTSNTDKAFPVVNLHFEDSLK----LSVYPHDYLFSLREDMYCFGWQSGGMTTQD 419

Query: 412 NSNII--GDISMHDKIVIYNNEKQAIGWATANC 419
            +++I  GD+ + +K+V+Y+ E + IGWA  NC
Sbjct: 420 GADVILLGDLVLSNKLVVYDLENEVIGWADHNC 433

BLAST of MC02g1238 vs. NCBI nr
Match: XP_022157721.1 (aspartic proteinase Asp1 isoform X2 [Momordica charantia])

HSP 1 Score: 892 bits (2306), Expect = 0.0
Identity = 429/430 (99.77%), Postives = 429/430 (99.77%), Query Frame = 0

Query: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60
           MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG
Sbjct: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60

Query: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC 120
           NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC 120

Query: 121 KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI 240
           CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI
Sbjct: 181 CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI 240

Query: 241 YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT 300
           YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALGFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLAL FSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR 420
           PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR
Sbjct: 361 PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR 420

Query: 421 VPKSRAAATM 430
           VPKSRAAATM
Sbjct: 421 VPKSRAAATM 430

BLAST of MC02g1238 vs. NCBI nr
Match: XP_022157720.1 (aspartic proteinase Asp1 isoform X1 [Momordica charantia])

HSP 1 Score: 883 bits (2282), Expect = 0.0
Identity = 429/443 (96.84%), Postives = 429/443 (96.84%), Query Frame = 0

Query: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60
           MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG
Sbjct: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60

Query: 61  NVYPNG-------------FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETP 120
           NVYPNG             FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETP
Sbjct: 61  NVYPNGKIALECICECFSEFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETP 120

Query: 121 HPLYRPSDDLVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNL 180
           HPLYRPSDDLVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNL
Sbjct: 121 HPLYRPSDDLVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNL 180

Query: 181 TNGDPIRPRLALGCGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFS 240
           TNGDPIRPRLALGCGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFS
Sbjct: 181 TNGDPIRPRLALGCGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFS 240

Query: 241 SRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSS 300
           SRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSS
Sbjct: 241 SRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSS 300

Query: 301 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALG 360
           YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLAL 
Sbjct: 301 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALS 360

Query: 361 FSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNE 420
           FSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNE
Sbjct: 361 FSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNE 420

Query: 421 KQAIGWATANCDRVPKSRAAATM 430
           KQAIGWATANCDRVPKSRAAATM
Sbjct: 421 KQAIGWATANCDRVPKSRAAATM 443

BLAST of MC02g1238 vs. NCBI nr
Match: XP_038900559.1 (aspartic proteinase Asp1 isoform X2 [Benincasa hispida])

HSP 1 Score: 822 bits (2124), Expect = 5.91e-300
Identity = 385/420 (91.67%), Postives = 405/420 (96.43%), Query Frame = 0

Query: 6   LKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQGNVYPN 65
           L ILVLMVASM+CLAP SASSFFKDKPWERR+PILSV   SSSFASSSIV+PLQGNVYPN
Sbjct: 6   LMILVLMVASMSCLAPCSASSFFKDKPWERRRPILSVPIASSSFASSSIVMPLQGNVYPN 65

Query: 66  GFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPCKDPLC 125
           GFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLY+PS+DLVPCKDPLC
Sbjct: 66  GFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLC 125

Query: 126 MSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALGCGYDQ 185
           MSLHSS+DHRCENPDQCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRLALGCGYDQ
Sbjct: 126 MSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQ 185

Query: 186 IPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDIYDSHR 245
            PGSS YHPMDG+LGLG+GAVS+VSQLHNQGI+RNV+GHCFSS+GGGYLFFGD IYD +R
Sbjct: 186 DPGSSSYHPMDGVLGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGIYDPYR 245

Query: 246 VVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLTSLLNR 305
           +VWTPMSRDYPKHYSPG GELIFNGRSTGLRNLF VFDSGSSYTYFNAQAYQVLTSLLNR
Sbjct: 246 IVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNR 305

Query: 306 ELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALGFSSGGRSKAVFEIPMEGY 365
           ELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLAL FSSGGRSKAVFEIPMEGY
Sbjct: 306 ELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEIPMEGY 365

Query: 366 LILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDRVPKSR 425
           LI+SSMGN CLGILNGT+VGL+NSNIIGDISM DK+V+YNNEKQAIGWATANCDRVPKSR
Sbjct: 366 LIISSMGNACLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSR 425

BLAST of MC02g1238 vs. NCBI nr
Match: XP_004147327.2 (aspartic proteinase Asp1 isoform X1 [Cucumis sativus] >KAE8651999.1 hypothetical protein Csa_016941 [Cucumis sativus])

HSP 1 Score: 819 bits (2115), Expect = 1.39e-298
Identity = 384/425 (90.35%), Postives = 407/425 (95.76%), Query Frame = 0

Query: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60
           MG  +L +LVLMVASM+CLAP SASSFFKDKPWER++PILSV   SSSFASSSIVLPLQG
Sbjct: 1   MGKRVLVVLVLMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC 120
           NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLY+PS+DLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSS+DHRCENPDQCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI 240
           CGYDQ PGSS YHPMDGILGLG+GAVSIVSQLHNQGI+RNV+GHCF+S+GGGYLFFGD I
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT 300
           YD +R+VWTPMSRDYPKHYSPG GELIFNGRSTGLRNLF VFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALGFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWRGRKP KSLRDVRKYFKPLAL FSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR 420
           P EGY+I+SSMGNVCLGILNGT+VGL+NSNIIGDISM DK+V+YNNEKQAIGWATANCDR
Sbjct: 361 PTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSR 425
           VPKS+
Sbjct: 421 VPKSQ 425

BLAST of MC02g1238 vs. NCBI nr
Match: XP_008460823.1 (PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo])

HSP 1 Score: 814 bits (2102), Expect = 1.33e-296
Identity = 381/425 (89.65%), Postives = 406/425 (95.53%), Query Frame = 0

Query: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60
           MG  +L +L LMVASM+CLAP SASSFFKDKPWER++PILSV   SSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC 120
           NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLY+PS+DLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSS+DHRCENPDQCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI 240
           CGYDQ PGSS YHPMDGILGLG+GAVSIVSQLHNQGI+RNV+GHCF+S+GGGYLFFGD I
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT 300
           YD +R+VWTPMSRDYPKHYSPG GEL+FNGRSTGLRNLF VFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALGFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLAL FSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR 420
           P+EGY+I+SSMGNVCLGILNGT+VGL+NSNIIGDISM DK+V+YNNEKQAIGWATANCDR
Sbjct: 361 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSR 425
           VPKS+
Sbjct: 421 VPKSQ 425

BLAST of MC02g1238 vs. ExPASy TrEMBL
Match: A0A6J1DZ15 (aspartic proteinase Asp1 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111024375 PE=4 SV=1)

HSP 1 Score: 892 bits (2306), Expect = 0.0
Identity = 429/430 (99.77%), Postives = 429/430 (99.77%), Query Frame = 0

Query: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60
           MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG
Sbjct: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60

Query: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC 120
           NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC 120

Query: 121 KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI 240
           CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI
Sbjct: 181 CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI 240

Query: 241 YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT 300
           YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALGFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLAL FSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR 420
           PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR
Sbjct: 361 PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR 420

Query: 421 VPKSRAAATM 430
           VPKSRAAATM
Sbjct: 421 VPKSRAAATM 430

BLAST of MC02g1238 vs. ExPASy TrEMBL
Match: A0A6J1DTW3 (aspartic proteinase Asp1 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111024375 PE=4 SV=1)

HSP 1 Score: 883 bits (2282), Expect = 0.0
Identity = 429/443 (96.84%), Postives = 429/443 (96.84%), Query Frame = 0

Query: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60
           MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG
Sbjct: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60

Query: 61  NVYPNG-------------FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETP 120
           NVYPNG             FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETP
Sbjct: 61  NVYPNGKIALECICECFSEFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETP 120

Query: 121 HPLYRPSDDLVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNL 180
           HPLYRPSDDLVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNL
Sbjct: 121 HPLYRPSDDLVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNL 180

Query: 181 TNGDPIRPRLALGCGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFS 240
           TNGDPIRPRLALGCGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFS
Sbjct: 181 TNGDPIRPRLALGCGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFS 240

Query: 241 SRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSS 300
           SRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSS
Sbjct: 241 SRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSS 300

Query: 301 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALG 360
           YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLAL 
Sbjct: 301 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALS 360

Query: 361 FSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNE 420
           FSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNE
Sbjct: 361 FSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNE 420

Query: 421 KQAIGWATANCDRVPKSRAAATM 430
           KQAIGWATANCDRVPKSRAAATM
Sbjct: 421 KQAIGWATANCDRVPKSRAAATM 443

BLAST of MC02g1238 vs. ExPASy TrEMBL
Match: A0A1S3CDB2 (aspartic proteinase Asp1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=4 SV=1)

HSP 1 Score: 814 bits (2102), Expect = 6.45e-297
Identity = 381/425 (89.65%), Postives = 406/425 (95.53%), Query Frame = 0

Query: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60
           MG  +L +L LMVASM+CLAP SASSFFKDKPWER++PILSV   SSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC 120
           NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLY+PS+DLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSS+DHRCENPDQCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI 240
           CGYDQ PGSS YHPMDGILGLG+GAVSIVSQLHNQGI+RNV+GHCF+S+GGGYLFFGD I
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT 300
           YD +R+VWTPMSRDYPKHYSPG GEL+FNGRSTGLRNLF VFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALGFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLAL FSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR 420
           P+EGY+I+SSMGNVCLGILNGT+VGL+NSNIIGDISM DK+V+YNNEKQAIGWATANCDR
Sbjct: 361 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSR 425
           VPKS+
Sbjct: 421 VPKSQ 425

BLAST of MC02g1238 vs. ExPASy TrEMBL
Match: A0A6J1DU28 (aspartic proteinase Asp1 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111024375 PE=4 SV=1)

HSP 1 Score: 813 bits (2101), Expect = 8.82e-297
Identity = 396/417 (94.96%), Postives = 398/417 (95.44%), Query Frame = 0

Query: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60
           MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG
Sbjct: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60

Query: 61  NVYPNG-------------FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETP 120
           NVYPNG             FYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETP
Sbjct: 61  NVYPNGKIALECICECFSEFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETP 120

Query: 121 HPLYRPSDDLVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNL 180
           HPLYRPSDDLVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNL
Sbjct: 121 HPLYRPSDDLVPCKDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNL 180

Query: 181 TNGDPIRPRLALGCGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFS 240
           TNGDPIRPRLALGCGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFS
Sbjct: 181 TNGDPIRPRLALGCGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFS 240

Query: 241 SRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSS 300
           SRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSS
Sbjct: 241 SRGGGYLFFGDDIYDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSS 300

Query: 301 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALG 360
           YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLAL 
Sbjct: 301 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALS 360

Query: 361 FSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIG--DISMHDKIV 402
           FSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIG  DI  HD ++
Sbjct: 361 FSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGGVDIRTHDFLI 417

BLAST of MC02g1238 vs. ExPASy TrEMBL
Match: A0A5D3BS69 (Aspartic proteinase Asp1 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold680G00070 PE=4 SV=1)

HSP 1 Score: 814 bits (2102), Expect = 6.92e-296
Identity = 381/425 (89.65%), Postives = 406/425 (95.53%), Query Frame = 0

Query: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60
           MG  +L +L LMVASM+CLAP SASSFFKDKPWER++PILSV   SSSFASSSIVLPLQG
Sbjct: 42  MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 101

Query: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC 120
           NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLY+PS+DLVPC
Sbjct: 102 NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 161

Query: 121 KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSS+DHRCENPDQCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRLALG
Sbjct: 162 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 221

Query: 181 CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI 240
           CGYDQ PGSS YHPMDGILGLG+GAVSIVSQLHNQGI+RNV+GHCF+S+GGGYLFFGD I
Sbjct: 222 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 281

Query: 241 YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT 300
           YD +R+VWTPMSRDYPKHYSPG GEL+FNGRSTGLRNLF VFDSGSSYTYFNAQAYQVLT
Sbjct: 282 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 341

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALGFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLAL FSSGGRSKAVFEI
Sbjct: 342 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 401

Query: 361 PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR 420
           P+EGY+I+SSMGNVCLGILNGT+VGL+NSNIIGDISM DK+V+YNNEKQAIGWATANCDR
Sbjct: 402 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 461

Query: 421 VPKSR 425
           VPKS+
Sbjct: 462 VPKSQ 466

BLAST of MC02g1238 vs. TAIR 10
Match: AT4G33490.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 590.1 bits (1520), Expect = 1.4e-168
Identity = 277/423 (65.48%), Postives = 337/423 (79.67%), Query Frame = 0

Query: 8   ILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSF--ASSSIVLPLQGNVYPN 67
           ++VLMV S+  L  SSA  F     W +       +  S  F  A SS+V P+ GNVYP 
Sbjct: 9   MIVLMVMSL-VLGFSSAVDF----RWRK------TAGFSDRFTRAVSSVVFPVHGNVYPL 68

Query: 68  GFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPCKDPLC 127
           G+YNVT+ +GQPP+PY+LD DTGSDLTWLQCDAPC +C E PHPLY+PS DL+PC DPLC
Sbjct: 69  GYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLC 128

Query: 128 MSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALGCGYDQ 187
            +LH + + RCE P+QCDYEVEYADGGSSLGVLVRD+F +N T G  + PRLALGCGYDQ
Sbjct: 129 KALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYDQ 188

Query: 188 IPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDIYDSHR 247
           IPG+S +HP+DG+LGLG+G VSI+SQLH+QG ++NVIGHC SS GGG LFFGDD+YDS R
Sbjct: 189 IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSSR 248

Query: 248 VVWTPMSRDYPKHYSPGL-GELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLTSLLN 307
           V WTPMSR+Y KHYSP + GEL+F GR+TGL+NL  VFDSGSSYTYFN++AYQ +T LL 
Sbjct: 249 VSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLK 308

Query: 308 RELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALGFSSGGRSKAVFEIPMEG 367
           REL+GKPL+EA DD TLPLCW+GR+PF S+ +V+KYFKPLAL F +G RSK +FEIP E 
Sbjct: 309 RELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEA 368

Query: 368 YLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDRVPKS 427
           YLI+S  GNVCLGILNGTE+GLQN N+IGDISM D+++IY+NEKQ+IGW   +CD +   
Sbjct: 369 YLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCDELASL 420

BLAST of MC02g1238 vs. TAIR 10
Match: AT4G33490.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 549.3 bits (1414), Expect = 2.8e-156
Identity = 259/389 (66.58%), Postives = 311/389 (79.95%), Query Frame = 0

Query: 8   ILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSF--ASSSIVLPLQGNVYPN 67
           ++VLMV S+  L  SSA  F     W +       +  S  F  A SS+V P+ GNVYP 
Sbjct: 6   MIVLMVMSL-VLGFSSAVDF----RWRK------TAGFSDRFTRAVSSVVFPVHGNVYPL 65

Query: 68  GFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPCKDPLC 127
           G+YNVT+ +GQPP+PY+LD DTGSDLTWLQCDAPC +C E PHPLY+PS DL+PC DPLC
Sbjct: 66  GYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCNDPLC 125

Query: 128 MSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALGCGYDQ 187
            +LH + + RCE P+QCDYEVEYADGGSSLGVLVRD+F +N T G  + PRLALGCGYDQ
Sbjct: 126 KALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCGYDQ 185

Query: 188 IPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDIYDSHR 247
           IPG+S +HP+DG+LGLG+G VSI+SQLH+QG ++NVIGHC SS GGG LFFGDD+YDS R
Sbjct: 186 IPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYDSSR 245

Query: 248 VVWTPMSRDYPKHYSPGL-GELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLTSLLN 307
           V WTPMSR+Y KHYSP + GEL+F GR+TGL+NL  VFDSGSSYTYFN++AYQ +T LL 
Sbjct: 246 VSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLK 305

Query: 308 RELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALGFSSGGRSKAVFEIPMEG 367
           REL+GKPL+EA DD TLPLCW+GR+PF S+ +V+KYFKPLAL F +G RSK +FEIP E 
Sbjct: 306 RELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEA 365

Query: 368 YLILSSMGNVCLGILNGTEVGLQNSNIIG 394
           YLI+S  GNVCLGILNGTE+GLQN N+IG
Sbjct: 366 YLIISMKGNVCLGILNGTEIGLQNLNLIG 383

BLAST of MC02g1238 vs. TAIR 10
Match: AT1G44130.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 433.3 bits (1113), Expect = 2.2e-121
Identity = 198/387 (51.16%), Postives = 282/387 (72.87%), Query Frame = 0

Query: 41  SVSATSSSFASSSIVLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPC 100
           S+  T    + SS+V PL GNV+P G+Y+V + +G PPK +  D DTGSDLTW+QCDAPC
Sbjct: 22  SIFKTFIKSSPSSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDAPC 81

Query: 101 QQCTETPHPLYRPSDDLVPCKDPLCMSLHSSVDHRCENP-DQCDYEVEYADGGSSLGVLV 160
             CT  P+  Y+P  +++PC +P+C +LH      C NP +QCDYEV+YAD GSS+G LV
Sbjct: 82  SGCTLPPNLQYKPKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGSSMGALV 141

Query: 161 RDIFPLNLTNGDPIRPRLALGCGYDQIPGSSYYHPMD-GILGLGKGAVSIVSQLHNQGII 220
            D FPL L NG  ++P +A GCGYDQ   S++  P   G+LGLG+G + +++QL + G+ 
Sbjct: 142 TDQFPLKLVNGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLT 201

Query: 221 RNVIGHCFSSRGGGYLFFGDDIYDSHRVVWTP-MSRDYPKHYSPGLGELIFNGRSTGLRN 280
           RNV+GHC SS+GGG+LFFGD++  S  V WTP +S+D   HY+ G  +L+FNG+ TGL+ 
Sbjct: 202 RNVVGHCLSSKGGGFLFFGDNLVPSIGVAWTPLLSQD--NHYTTGPADLLFNGKPTGLKG 261

Query: 281 LFAVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDV 340
           L  +FD+GSSYTYFN++AYQ + +L+  +L   PL+ A +D TLP+CW+G KPFKS+ +V
Sbjct: 262 LKLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEV 321

Query: 341 RKYFKPLALGFSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISM 400
           + +FK + + F++G R+  ++  P E YLI+S  GNVCLG+LNG+EVGLQNSN+IGDISM
Sbjct: 322 KNFFKTITINFTNGRRNTQLYLAP-ELYLIVSKTGNVCLGLLNGSEVGLQNSNVIGDISM 381

Query: 401 HDKIVIYNNEKQAIGWATANCDRVPKS 425
              ++IY+NEKQ +GW +++C+++PK+
Sbjct: 382 QGLMMIYDNEKQQLGWVSSDCNKLPKT 405

BLAST of MC02g1238 vs. TAIR 10
Match: AT1G77480.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 406.4 bits (1043), Expect = 2.9e-113
Identity = 195/382 (51.05%), Postives = 260/382 (68.06%), Query Frame = 0

Query: 51  SSSIVLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPL 110
           SS++V P+ GNVYP G+Y V L +G PPK + LD DTGSDLTW+QCDAPC  CT+     
Sbjct: 50  SSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ 109

Query: 111 YRPSDDLVPCKDPLCMSLHSSVDHRCENP-DQCDYEVEYADGGSSLGVLVRDIFPLNLTN 170
           Y+P+ + +PC   LC  L    D  C +P DQCDYE+ Y+D  SS+G LV D  PL L N
Sbjct: 110 YKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLAN 169

Query: 171 GDPIRPRLALGCGYD-QIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSS 230
           G  +  RL  GCGYD Q PG     P  GILGLG+G V + +QL + GI +NVI HC S 
Sbjct: 170 GSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSH 229

Query: 231 RGGGYLFFGDDIYDSHRVVWTPMSRDYP-KHYSPGLGELIFNGRSTGLRNLFAVFDSGSS 290
            G G+L  GD++  S  V WT ++ + P K+Y  G  EL+FN ++TG++ +  VFDSGSS
Sbjct: 230 TGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSS 289

Query: 291 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALG 350
           YTYFNA+AYQ +  L+ ++L GKPL +  DD +LP+CW+G+KP KSL +V+KYFK + L 
Sbjct: 290 YTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLR 349

Query: 351 FSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNE 410
           F +  ++  +F++P E YLI++  G VCLGILNGTE+GL+  NIIGDIS    +VIY+NE
Sbjct: 350 FGN-QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNE 409

Query: 411 KQAIGWATANCDRVPKSRAAAT 430
           KQ IGW +++CD++PKS    T
Sbjct: 410 KQRIGWISSDCDKLPKSEPLFT 430

BLAST of MC02g1238 vs. TAIR 10
Match: AT1G77480.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 403.3 bits (1035), Expect = 2.5e-112
Identity = 192/375 (51.20%), Postives = 257/375 (68.53%), Query Frame = 0

Query: 51  SSSIVLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPL 110
           SS++V P+ GNVYP G+Y V L +G PPK + LD DTGSDLTW+QCDAPC  CT+     
Sbjct: 50  SSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ 109

Query: 111 YRPSDDLVPCKDPLCMSLHSSVDHRCENP-DQCDYEVEYADGGSSLGVLVRDIFPLNLTN 170
           Y+P+ + +PC   LC  L    D  C +P DQCDYE+ Y+D  SS+G LV D  PL L N
Sbjct: 110 YKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLAN 169

Query: 171 GDPIRPRLALGCGYD-QIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSS 230
           G  +  RL  GCGYD Q PG     P  GILGLG+G V + +QL + GI +NVI HC S 
Sbjct: 170 GSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSH 229

Query: 231 RGGGYLFFGDDIYDSHRVVWTPMSRDYP-KHYSPGLGELIFNGRSTGLRNLFAVFDSGSS 290
            G G+L  GD++  S  V WT ++ + P K+Y  G  EL+FN ++TG++ +  VFDSGSS
Sbjct: 230 TGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSS 289

Query: 291 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALG 350
           YTYFNA+AYQ +  L+ ++L GKPL +  DD +LP+CW+G+KP KSL +V+KYFK + L 
Sbjct: 290 YTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLR 349

Query: 351 FSSGGRSKAVFEIPMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNE 410
           F +  ++  +F++P E YLI++  G VCLGILNGTE+GL+  NIIGDIS    +VIY+NE
Sbjct: 350 FGN-QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNE 409

Query: 411 KQAIGWATANCDRVP 423
           KQ IGW +++CD++P
Sbjct: 410 KQRIGWISSDCDKLP 423

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q0IU527.3e-9347.07Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica OX=39947 GN=ASP1 PE=2 S... [more]
A2ZC671.4e-8844.78Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica OX=39946 GN=ASP1 PE=2 SV=... [more]
Q9M9A82.2e-8143.00Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1[more]
Q9S9K41.0e-3329.00Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q4V3D25.7e-2927.23Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_022157721.10.099.77aspartic proteinase Asp1 isoform X2 [Momordica charantia][more]
XP_022157720.10.096.84aspartic proteinase Asp1 isoform X1 [Momordica charantia][more]
XP_038900559.15.91e-30091.67aspartic proteinase Asp1 isoform X2 [Benincasa hispida][more]
XP_004147327.21.39e-29890.35aspartic proteinase Asp1 isoform X1 [Cucumis sativus] >KAE8651999.1 hypothetical... [more]
XP_008460823.11.33e-29689.65PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo][more]
Match NameE-valueIdentityDescription
A0A6J1DZ150.099.77aspartic proteinase Asp1 isoform X2 OS=Momordica charantia OX=3673 GN=LOC1110243... [more]
A0A6J1DTW30.096.84aspartic proteinase Asp1 isoform X1 OS=Momordica charantia OX=3673 GN=LOC1110243... [more]
A0A1S3CDB26.45e-29789.65aspartic proteinase Asp1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=4... [more]
A0A6J1DU288.82e-29794.96aspartic proteinase Asp1 isoform X3 OS=Momordica charantia OX=3673 GN=LOC1110243... [more]
A0A5D3BS696.92e-29689.65Aspartic proteinase Asp1 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
Match NameE-valueIdentityDescription
AT4G33490.21.4e-16865.48Eukaryotic aspartyl protease family protein [more]
AT4G33490.12.8e-15666.58Eukaryotic aspartyl protease family protein [more]
AT1G44130.12.2e-12151.16Eukaryotic aspartyl protease family protein [more]
AT1G77480.22.9e-11351.05Eukaryotic aspartyl protease family protein [more]
AT1G77480.12.5e-11251.20Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 238..422
e-value: 3.2E-26
score: 93.8
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 42..237
e-value: 1.0E-44
score: 154.8
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 60..421
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 280..414
e-value: 1.1E-12
score: 48.1
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 68..238
e-value: 1.5E-46
score: 158.9
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 8..423
NoneNo IPR availablePANTHERPTHR13683:SF800EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 8..423
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 68..414
score: 34.188499

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC02g1238.1MC02g1238.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity