MS000634 (gene) Bitter gourd (TR) v1

Overview
NameMS000634
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionaspartic proteinase-like protein 2
Locationscaffold93: 599180 .. 610017 (-)
RNA-Seq ExpressionMS000634
SyntenyMS000634
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCACCGCCGGCATGGGTACTTCACGCCCTTTGACTCTGTTGCTCCTTGTAATCTTCAACTTGCTTTCCAACACCATCACCGGCGGCGGCGGTGTCTCTGCTGAAAGCGGCGTGTTCAGCGTCAAATACAAGTATGCCGGCCGGGAGCGTTCTCTTAGTACCCTTAAGGCGCACGACATCAGCCGTCAGCTTCGATTTCTTGCCGGCGTCGATATCCCACTCGGTGGCTCCGGCCGACCTGATGCCGTCGGGTTAGTGATATATTTTTGAATTTCGTTCAGTATTGTGTGTGTTGATCGATTCTGCTGTGGTCTTAATTGAAGGGTTCACTTCTTTTCCAGGCTTTACTATGCAAAAATTGGAATTGGAACTCCTCCCAAGGACTATTATGTTCAAGTTGATACAGGGAGTGACATCGTGTGGGTGAATTGTATTCAGTGTATGGAGTGTCCGAGGAAAAGCACTCTTGGGGTATTGTTTCAGTAAATGTCTTCTTATTTTCAGTCTCTATGGTTGGTTTTTGATTGGTATTGCGATGGGGTTAATTGGAGTGGTTTGAATTTGGCATCAGATGGAGCTCACTCCGTATAATATAGAGGAATCCACAACTGGGAAATTGGTTTCCTGCGATGAACAATTTTGCTTGGAGGTGAATGGGGGTCCACTTTCAGGCTGCACTGGTAATATGTCCTGTCCATACCTTCAGATATACGGCGACGGGAGCTCAACTGCTGGATACTTCATCAAGGACTACGTACAATATGATCGGGTTTCTGGAGACCTTGAAACTACAGCTGCTAATGGAAGTATCAAGTTTGGGTGAGTTTTTTTTTTAACCTACGTACTATTATGTACTCTGTAGCCCCAATATAGGGAATGATGATCAAGGACGGTGGCATATAATATAGAAAGAAGAAGCTTCAGGGAGATCACCTTGCAAAACTTTGTCCCTTGCAAAACTCAAAAGGATAGGAGCCTTTATGTAGAGAACAAAGTTTTTGGAATAGATATCCAAGGGGAGTGCAAACTGTCCCTTTACCTACGAGTTTGAGCTCTAACCATGCTTTTCAAGCCATAGTACTAGTAGTAAGTGGAATGACAAAAGAGAATTAGGCACAATTGTAAATCTCCGCAACCAATTATTAAGTAATGGAGCTTCCTTTCTCCCCACCCCCCACCACCTTACTTGAAGCCTCCCTAAGAGTTGTGCGAAGAAGAGTCTTCCAATTAACCATATGACATCTGCCTTCAACCGTCCAAAATGGTAATTAGCTCTTTGTTGGCATTTCATTTTGGTCATTTTATCCAATCTAATGCAAAAGCACCAGAGTCTCTATTCTTGAGGTTTATAATAGTGTATGTATGGAAGGGAAATTTTCTTGCCGAGGTGACTGGAGACAAATTTATTGTATGATGAAGGTATTCGAGGAACTACCGATGGAATATACCTTCCATTAGCCATTTATCTAAATCTTCCTCTAGTGCATTCTTGCGCTGAAAAGGATCATGCCTAATGAAAGTACCAAACTAGAAGGTGGAGGGTTTGGATCTAGAGGAAGAGAGCCCTCGTTAAAGGCACAATTATTAGAGCTGGTCTCTATTTGGTTATCCTCGAAGAATCAAAGTTGCACAATGTGGACGAGAGGCTGGTCAGATTGATTTGGGGTTTCTACCACAGTGCTTGAATGGCCTCTGAGTATCGTAGCTTCAAGAGATATATGTTGGTTTTATGGGGTGGCCCTTATTTTAATGTCCTTGTTGCTATGGATGGTTCTTTTTCTTTTTCTTAAAGGAAAAATTTGATAAATGTACTACTGGTACCCATTGGTATTCCTGTGAGGAATAAAATGTGTCATTTTACTGAATCAACTTCAATGACAAAAATGAAATTGTTTCATAGGGTTTTAAGTAAACCCAACACAAAGTAAATCGTGAGATCCTCCCAAATATCTTTTGGTATTGGTAAATGTGTACGTAATCTAGTGTTCTAAACCTCTTTTGTACACCTGACATATAGGACGTTCCTTCACAAAATGCGCATCTTTTCTTAACTGAAACCAATAAAATCGAGAACTAATCTGGTCCAGTGTTTTATAACGGCTAAAGTGTTCCGTCCTCTAAGTTCTCTGGCATAAATGTCCATTAAAGTTTTCTGTCCAAGGAATGTATGGGCGCCGTGCATAATTTATTGTCCTTATACACAAAAATCCATCTTTCACGCTACATCTTCTTAAAATCCTTCATTTTTTTTCACTTATCTAAAATGCTGGCAAAATCTTCATCAAAATTAACCCCTAACTTTCCTACATCAATAGGTAAATCATCCAATGAAGTATCATTTATATCACCCACATTTGCCGTCTCATATTGAATCAAATGGTTGAAAAGTGAGTAAATTAGAGCTCTATTGCGATTAGGGGAAAATAAACCGACTGTTGGAAGTTATTAAACTTGGATTAGTAGTTAGTAATTTTCTTGGGGTCAAAGTAGTGCAAGTCGTTGTACCAGTGGTTGGCCTGCTGATTGGCAGCTATATTCACTGGTTTCAGGCATATAGCTCCCGATACCACCTGATAGTTCTAGATTTGTTATCCCATTCAAAATCCAATGGCGGGTGTTTAGTCCATAAAAGACACTATTGGCATAATTGCAGTAAATAACTCCCGTGCAGTAAATATTATTAAATTTCTTGCTTGCTCTCTTTCTTCTTCTTCCCCTTCTTTATTATTTATTATTTATTATTTTTAAATATATATATATATATTTTTTTTCCATAATGATGTTCACTTTTTTATGATGGAAAGTGGTTGATAACATTTGATTGGTTGCTTGATGACTGTCTTTTAATTTAGCTTTGGTGGGTATTTTTCTATGTCACTAGTTTTTAAGGAAGCTTAGCTCACGTGGATTGGAACTTTTTAGATAAGATTTTTGGGAAGAAAGGTTTTGGGTATAAATGGAGGGCTTGGATTTGGAGTTGTATTAGGACGGTTAATTACTGGTGAATGGGAGACCTAGAGGTAAAATCTTCGCTTCTAGAGGCCTTCGTCTGAGGGATCCCTTGTCTCCTTTCCTGTTTATTTTGGTGGTGGACATTCTTAGCTGGTTAGTCTCCGTTGGGGTCGAGAAAGGGAGTCTGAGAGCCTTTGAAGTGGGTAGGGAGAAAGTGACCCTCTCTCACCTTCAGTTTGCAGATGATACTGTCTTTTTCTGTTCTGGTAAGGACAATGCTTTCAATAATCTTAACTGCCTCCTTGGTTTTTTTGAACCTCTTTTTGGCCTTAAGATTAATCGAGGCAAAAGCTCTCTTTTGGGGTGAATTGTGACGTGGGTAAGCTTAGAGAGTGGGCTTCCCTGGTTGGGTGCGAAGTTGGTGAGTTTCCTTCCTTCTATCTTGGTCTTCCTTTAGGTCATAATCCTAGGAGTGCCTTTCTTCTTGGAAGAAAGCGTATTTTTCTAAAGGAGGGAGACTTACATTAATTCAATCTATTTTGAGGGGAATTCCGTCTTACTTTATGTTTCTATTTAAGATCCCGGTGGCTGTTATTGAGGGTCTAGAAAAGATCATGAGGGATTTTATTTGGGAGGGAGTAGAGGAGGGGGGTGGTACTCATCTAGTCAATTGGAAGGTGGTTACAAAGCCTCTAGATGCGGGGGGTTTGGGCATTGGAAACTTGAGGCTTAGTAATGAGGCTTTTCTAGCTAAATGGTTATGGCGCTTCTTCTATGAACCTAGGGCCCTGTGGCGAAGGGTCATTGTGAGTAAATACGGGCCTCACCCGTTTGACTGGGTCCTTGCAGGTGGTCTTAAGGTGAGCAATAACAATCCTTGGAAGGCTATCGCTTCGTGTTTCCCTACTTTTACTCGTTCTCTTCATGCTGTTGTGGGGGATGGTCAAAATACTTATTTCTGGGAGGATTCGTGGGTTGGTGATAAACCTTTGCGCCTTGCTTTCCCTCGGCTTTATTTGCTATCTGAAAATAGGCTTCTTTCGGTGGCCAAGATTTTGAATCCTGTGGGGGAGGTTAATTCCATTTCTTTGGGCCTCTCTCATCCTTTGACGGATCAGGAGTCTTTAGAGGTGTCCGAGTTGTTAGTGTTGTTGTCTGAGGTGTCTTTATTTGCTGGGAGGGAGGATGTGAGGGTGTGGTCCCCCAACCCCTCGATAGGTTTTTCTTGTAAATCTTATTTCGATATTTTGATTTCCCCTTCTCCTTCTTCCCCTTCGACTTGTTGGTTCAACTCTCTTTGGAAGATTAAGATTCTGAAGAAGATTAAATTCTTTGTGTGGCAAGTTTTACTTGGGAAAGTCAATACTTTAGATCGTATCCAAAGGATGTTTTCCTTTTGTCTGGGTGCGCAGTGATGCGTTCTTTGTAGGAATGCCTCTGAAGATATGGAGCACTTGTTATGGAGTTGTCAGTTCGCTCAGGATGTTTGGCTCCATTTCTTTTATTGCTTTGGCTTGTCTTGGGCCCGTTCCAGAGATTGTAGGGGGATGATGGAGGAGGTGCTGTTACACCCCCCTTTTCGTGACAGGGGTCGTTTTTTGTGGCAGGCCTGTTTTCGGGCTGCTATTTGGGGGATTTGGTTTGAGAGAAATAATAGGCTGTTTAGGGAGGTGGAGAAATATGTTGATTATGTTTGGGATCTCATTAGATTTAATACTTTTTTATGGGGCTCTGTTTCTAAGGCCTTTTGTTATTATCCGTTAGGTCTCATTCTTTTGGATTGGTGTCCGTTTCTGTAAGGGACTCCTCTCGGGGTTGGTTTTTTGTAGGCCCCTTTGGTTTCTTTTGTTTCTCTTTTCATTCTTCTCAATGAAAGTATCGTTTCTTATCAAAAAAAAAAAAAAAAAAAAGCTTGGCTCATCTATTTGAAGCTTGAATTTTTCTAGTGAGAAAACTCTTGACTTATTTAGGTTGCACTTATCCACTGTTAGAAGATGATCGAACGAATTCTGATTGTAACTATTATCGTCTAAGAGATGAAAATTGCCACTTTGTAGCTTTTGTTAACATGGACTTATAAGTTCATCCTCCTGTTAATTTCATAATTATCCCACTCAGAACAATTTCATTTAAATTATTTGGTTCTTATGATTATTAAAATTTAAAATTTATTTATTTACTATTATCATTTGGTAGATGTGGTGCCCGACAATCTGGGGATCTAGGCTCATCCGGCGAAGAAGCACTTGATGGCATACTTGGATTTGGGAAATCAAATTCATCTATTATTTCACAACTAGCCTCCTCTAGAAAAGTGAAAAAGATGTTTGCTCATTGCCTAGATGGAATAAATGGCGGTGGTATCTTTGCAATGGGACATGTTGTTCAGCCAAAAGTTAACATGACTCCGTTGGTACCAAATCAGTATGTAATGTGCTCTGTAGCTCTTAATTGTTGTTTTCTAACAAACTTTTGGATTTCATCATTTTCCTATCTGTATAATTTAGCTGGTTTCCCTTCATTAACTTGTTGGTTTTACTTCTGCGAGTTCTAAAAATAAAAAAATAAGAAAGATATAATATGTTGTTCTAATGAAGTATAATAGCTATTAAAATCTAACCCAAAACTATGGTTCAGTGAAGTCAATAATTAGTTTTTTTTTTTTCTTTTTTTTTTTTTTTNNNNNNNNAGTGCTTTCAATAAATTCTTTAAAGAACACATAAAAATTGACTATTTATGGTTTAGAACTATTAACTTTTTGCACTACTTGAGAATTCTTCTTTTGATGTCATATTTTGACTTTCTTCTTTTTATTATATATGTGATTTATCTCTTATTAATACCCCGTAATTAAAATTTAGGCCACATTACAATGTCAATATGACGGGAGTACAAGTTGGTCGTGTCATGTTAAATATTTCGGCTGATGTATTTGAGGCAGGAGACAGAAAAGGGACAATCATTGATATTGGTACAACTTTGGCATATCTTCCAGAATTGATTTACGGACCATTAGTAACCATGGTACATGATAATGCAAGAGCTCTATCCATTGGAGAACTTTATTACAAATGAATATAGCTGATCCTCTTGATTTTGGTCTTGTAGATAATCTCACGGCAACCTAATTTGGAAGTTCAAACCATTCATGGAGAGTATAAATGTTTTCAGTACTCAGAAAGGTATGTAGCACTCTGACTTTGATTACCTGTCGTTGTAATTTATATGACTGTCTTTTTGTGAAATTACGTTCCGGCCGAGCCAGAAGATTGGCTTGATTGGTCTATACTTTTGTAAGAAATCCAAATAAAATCCTAGAGCTAGAGCCTAGAAACGATCCAGCAAGATAGACCTCTACTTGTAATTGGGCATAGAATTGATGCTCCATCATTGTTTACTTGTTCATTTGAGATACTGAGGCCAAGAATCATAGATAAAAGTAGTTGATCTTGTTCTATGCAATTGTTCTCCTGATATGTCTGATTATTTTTACTGGAAAAGTTTGCCTGATAAAATTTTTCTGCTCCATTTTTTAGTGTCAGAAAAGATGTTAAATCCTTTTTGTTTCATTAAGATAGATATGGAGAGGTTTAAGTTGTTCTATATTGTATTTGACCTGCGGCTTCATTTAATATATTTTGATTTATTTGCAGTGTCGATGATGGATTTCCTCCAGTTATTTTCCATTTCGAGAATTCACTCTTGTTGAAGGTTTACCCTCATGAGTATCTATTCCAATATGTAAGTTTGCTGGACTGAGATTGTGTGTGCACTAACCCTTGAGCTTGCTGCAGGAATGGGTGATTTTTAAAATTTCAACGTCTAAGATTAGGCTGCGCTTGTTAATTGCTTACGTTCACAGTACTTCTAGATTCATAATGTATTAACGTTTTAAGATCCAATTGAATGTTTTCGGACTGCAATATTTCAACTTAGGACTTGACACTGTTCATAATCAATTTGTGCTTTTGTCCTTTGGATGGGAAATTTTCCCCTTTTTTGTAATTTTATACTACTAATGAAATTGTTTATCCAAATAAAAAAAAAGTGTTGTCTTGTTCTCATCTCATGAGATTTAGATGAGATTTCCAAACATGAATTTAATACCTAGTGAAGTACTTGGGATTATAGTCACTATTTTCCCTAACATTCATGATTAAGTTTGACGATTAAATTTATTATACTATTGAAAATCAAATGAAATGACATTGTGAGCAAAACAACCTTGATGGGCCAGTTCTTTATTTTGTTGATAAGAAACGACAAATTGTAGTCCACAAAACAAAGGAACAGCCTAAGGGCTGGGCAGAGAATCCCCACCCAAGAGCTACCTTTACAAAAGGATCGCCTCCTAATTGTATCTTAAGAGGACAAGCCAGTTCTTTTTTTTTTTCTCGAAATCCGTGTGTCCAGGCCAGCTTGGACGCACCTTGACTATCACGGGTCAACCGCCCGACCCTACTATATTTGGTTGACAAGGTAACTCGTAGGATATTAAATCCTAGGTAGGTGGTCACCATGGTTTGAACTCATTCCGTCTAAGTCCTTTGCTATTTACATTGGCCCTTATTGACCACTTGAGCTATCTCATGATGGTTAGACGGGCCAGTTCTTTATAATCTAACTTTGTAGACCAGATAGACATTTTTATTTATTAGTATTATTCTTTTGTAGCTTTCTATTTTAGTTTCATTCAATGTAGTTCCTGCATATATATATATATGATCAATAGTCTAGTCGGCTTGCGAAAGACGATTGTGTTTACATGCTTCATCAGACCTTACAATTAGCCTTATCTGCTCTGAAACATATTTGATGCTTTTGCACCCTGAAAGGTTTTGATGCTTTTGCACCCAAACGATTTCCATAGCAGTGTTTAGCCTGTGTACTGAAATTTTGTTTTCTAATCAGTTTATTTCTCTCCTTGGGGTAGATGCATGAGTTGGTTTACTAATATATACTTCTACAGGAGGGCTTGTGGTGTATGGGTTGGCAAAACAGTGGGATGCAATCTAGGGATAGGAAGAACGTTACCCTCTTTGGAGGTAAACTGCTAAACTCCAAATTTCTTCCATGTGAATATTAATTCATGTCTAAGAATTCTAATTCTGCTGCTATTGGGAAGAAGGCTGCTAACTATTGTTGTATTCATAATTCCTCACCTTTCTGTTGATCGTAGGACCACGAGTTGTGTATTCAGGCAGTTTGCTTTTGAGAGCTCTTTATTACTGAATATTTTATGGGTTAAAAATCATGCCACAAAAAAACTAAACTAAAGGCTCGCATGATAACCATTTTTTAAGTGACGTACTTTTTAAAATTAAGCTTATAACACTATTTCCACCTATTTACTTTTTTGTTTTGTAAATTAATAGTAGTTTTCTTTAAAATTAAGCTTATAAACACTATTTCCACCTATTTACTTTTTTGTTTCGTAAACTAATAGTAGTTTTCAAAAACTAGTTTTTGTTTATAAAATTTGACTAAGAATTCAAATGTACTTGTAAGAAACCGTGAAAACCCTACCAAAGAAATTGAGAAGAAGTAGGTATAATTGGGGGTGTGATCATCGGCCGGTTGGATTTGGTTTTTGGCCAAAATTGACTATTTTACTTCTCTTCTTTGTACTGAGCGACCGATCGATCGGTTCAGTCGGTTGGTTTTTTGGTTTAAAGTTTCTTTTTTTTTTAATTTTTATAATTATTGTTATTATTAGAATGTTTAAGAGCATGAAAAAGAATGCTAAAAAACTTGTAAAATAACTAAAAAAAGAACAAAAAATGAGTCGAGGAAGAACGTAAGGAAAAGTCTTACTATGGCATCTAACGTGAATGAATGGGGAGAAAGAAGAAAGGGGATGAAAAGAAAACCTTTCAAGAAAGATTGGAAAAAAAATAATTAAAAAGGAAAACGTTGAAAACGAATAAAAAAAATTAAAAGAAAGGGGATGAAGAAAACCCTTCGAGAAAGATTGGAAAAAATTAATTAAAAAGGAAAACATTGAAAACGAATAAAAAAAAAAATAAAAGAAACATTGAAAACCATGGCTGTTGGTTTTCAAGGTTGCTGCAGTTTGCATAGTATTCATTTTTTAAAAATAAAATATTAAATTTTTTTAACAAATTATTGGAAGTCGGTCAGTTGTCGGAAATTACTTTTATGACCAGATCAACCGACTGATGAGTCAGTTTCGTAAATTACAAAACCGACTCCGACCACAATCATGATTAAACCGATCAACTGATGTCGGTTCGGTTGGTTTTGATCGGTCGGATCGGTTTTTTGAAGTTCTTTCCTCACCCTAGGCATAATTTTTAGAAACAAGTAACTAAAAATGGGGCCTGAGTGATGGTTTGGTACTATGCCTTTTTCTTGCTATGTTGCTATGATGAATGATGACATATTTCATTTTTTGAGAATATAAGAATATAGAGTGGAGATTTGAACCAGGAAAATTATTTAATGTATCTCCTTTGCTTGCAGATCTAGTGCTATCAAATAAGCTAGTTTTATATGATCTTGAAAACCAGACAATCGGGTGGACCGAGTACAACTGTGAGTATAAAGTTTTTGCTTTACTGTCTTTGAAGTTTAATATCTCTGAATGTCATTAGTATTACATGTTTTGCTGCACATTTTTTCCCGACATTTACTGTATTTTGTCTCAACTTCGGTGTTTTTTTTTCATTGTTTTTTTTTACGAGAAAGAAAACTTTCCAATAACAAATCAAAGAATACAAGAGGGAAATACTGACGATTTTTTATTATTTTTTACCCTAGCCTCATTATGGAAGATTTTGATAAAATCCAATTGCACGAACACAAACACATGCATACAGGCTAGGTTTGATCTCGCTAGATTGTAGCCCGTTCTTTCAGTTTGGCTCTTTTTTGGGGCTGTTTTTGTATGTCCTTTTGTTTTTCTTTTCTTTTTTTTTTTTTTCATTTTTCTAAATGAAAGCTTGGTTTTTTCATAAAAGGAAAAAAAAAGGCCATAAGGCATATTTTATATGTTCACAAATCTCATATCTCAATGATTATTGGTAAAGCCTGATAGGTTCAAATCCTAGCATGGTAAATAATACCTGAACTTTCAAATATTTCACTTTAGTTTCTCAATTTTCAAAATTTCCACTATAGTTCCTAAACTTCATATAGAACTCATGTTAGTCTTTGCGATTAACTTTTCATTAATCATTAGAAAAATCTAAATTTGTTTCTTGAAACCTCTTCTACTAGATGTAGCTTAAAAAATTTACCAACACAAACATCTCCTCATCAAATAAATGAACAAAAGATACATTTATGAAAAGGAATTAACAATTATGTAAAGTTTGGAAACAAAATATTTAAAAGTTTAGTCCCCAAACAGCCCAGTCGACAAGGGCTTGGGATCTCTTGGTCATATTGGCTTAGAGGTCTCAATCTTAATCTTCTAGTGAGCTTAATACCAAAAACCTTTGATGTCTTTTGGATCTGGACCTTTGAGCGGGCACGGATGCCCTTGGGTATAGGGGAGCGAAACTCCGACTCCCAGTTCTAAAATATATATATATTTAAAAGTTTAGGAACCAAAATAAGATTTAAATCCTAATAATATTTCTTATTTCTTTTGATAGTTAGTGGACAAACTTATTGACAGTCTTTGAGACATCTTCAGAGCCCACTGATTCACACTGGTCAATCAATACCATTTTCACATTTCCAGTAATTGATTTTCAGGTTCTTCAAACATCAAAGTGCAGGATGAACAGACTGGAACAGTTCATTTAGTTGGTTCACATTCCATTTCTTCAGCCTGCAGATTGAATTCCCAATGGGCTGTGATCTTGTTGTTCCTGATCTTGCTTATGCACTGGTCAGCTCATTTCAGATGTTTCAGC

mRNA sequence

ATGGCCACCGCCGGCATGGGTACTTCACGCCCTTTGACTCTGTTGCTCCTTGTAATCTTCAACTTGCTTTCCAACACCATCACCGGCGGCGGCGGTGTCTCTGCTGAAAGCGGCGTGTTCAGCGTCAAATACAAGTATGCCGGCCGGGAGCGTTCTCTTAGTACCCTTAAGGCGCACGACATCAGCCGTCAGCTTCGATTTCTTGCCGGCGTCGATATCCCACTCGGTGGCTCCGGCCGACCTGATGCCGTCGGGCTTTACTATGCAAAAATTGGAATTGGAACTCCTCCCAAGGACTATTATGTTCAAGTTGATACAGGGAGTGACATCGTGTGGGTGAATTGTATTCAGTGTATGGAGTGTCCGAGGAAAAGCACTCTTGGGATGGAGCTCACTCCGTATAATATAGAGGAATCCACAACTGGGAAATTGGTTTCCTGCGATGAACAATTTTGCTTGGAGGTGAATGGGGGTCCACTTTCAGGCTGCACTGGTAATATGTCCTGTCCATACCTTCAGATATACGGCGACGGGAGCTCAACTGCTGGATACTTCATCAAGGACTACGTACAATATGATCGGGTTTCTGGAGACCTTGAAACTACAGCTGCTAATGGAAGTATCAAGTTTGGATGTGGTGCCCGACAATCTGGGGATCTAGGCTCATCCGGCGAAGAAGCACTTGATGGCATACTTGGATTTGGGAAATCAAATTCATCTATTATTTCACAACTAGCCTCCTCTAGAAAAGTGAAAAAGATGTTTGCTCATTGCCTAGATGGAATAAATGGCGGTGGTATCTTTGCAATGGGACATGTTGTTCAGCCAAAAGTTAACATGACTCCGTTGGTACCAAATCAGCCACATTACAATGTCAATATGACGGGAGTACAAGTTGGTCGTGTCATGTTAAATATTTCGGCTGATGTATTTGAGGCAGGAGACAGAAAAGGGACAATCATTGATATTGGTACAACTTTGGCATATCTTCCAGAATTGATTTACGGACCATTAGTAACCATGATAATCTCACGGCAACCTAATTTGGAAGTTCAAACCATTCATGGAGAGTATAAATGTTTTCAGTACTCAGAAAGTGTCGATGATGGATTTCCTCCAGTTATTTTCCATTTCGAGAATTCACTCTTGTTGAAGGTTTACCCTCATGAGTATCTATTCCAATATGAGGGCTTGTGGTGTATGGGTTGGCAAAACAGTGGGATGCAATCTAGGGATAGGAAGAACGTTACCCTCTTTGGAGATCTAGTGCTATCAAATAAGCTAGTTTTATATGATCTTGAAAACCAGACAATCGGGTGGACCGAGTACAACTGTTCTTCAAACATCAAAGTGCAGGATGAACAGACTGGAACAGTTCATTTAGTTGGTTCACATTCCATTTCTTCAGCCTGCAGATTGAATTCCCAATGGGCTGTGATCTTGTTGTTCCTGATCTTGCTTATGCACTGGTCAGCTCATTTCAGATGTTTCAGC

Coding sequence (CDS)

ATGGCCACCGCCGGCATGGGTACTTCACGCCCTTTGACTCTGTTGCTCCTTGTAATCTTCAACTTGCTTTCCAACACCATCACCGGCGGCGGCGGTGTCTCTGCTGAAAGCGGCGTGTTCAGCGTCAAATACAAGTATGCCGGCCGGGAGCGTTCTCTTAGTACCCTTAAGGCGCACGACATCAGCCGTCAGCTTCGATTTCTTGCCGGCGTCGATATCCCACTCGGTGGCTCCGGCCGACCTGATGCCGTCGGGCTTTACTATGCAAAAATTGGAATTGGAACTCCTCCCAAGGACTATTATGTTCAAGTTGATACAGGGAGTGACATCGTGTGGGTGAATTGTATTCAGTGTATGGAGTGTCCGAGGAAAAGCACTCTTGGGATGGAGCTCACTCCGTATAATATAGAGGAATCCACAACTGGGAAATTGGTTTCCTGCGATGAACAATTTTGCTTGGAGGTGAATGGGGGTCCACTTTCAGGCTGCACTGGTAATATGTCCTGTCCATACCTTCAGATATACGGCGACGGGAGCTCAACTGCTGGATACTTCATCAAGGACTACGTACAATATGATCGGGTTTCTGGAGACCTTGAAACTACAGCTGCTAATGGAAGTATCAAGTTTGGATGTGGTGCCCGACAATCTGGGGATCTAGGCTCATCCGGCGAAGAAGCACTTGATGGCATACTTGGATTTGGGAAATCAAATTCATCTATTATTTCACAACTAGCCTCCTCTAGAAAAGTGAAAAAGATGTTTGCTCATTGCCTAGATGGAATAAATGGCGGTGGTATCTTTGCAATGGGACATGTTGTTCAGCCAAAAGTTAACATGACTCCGTTGGTACCAAATCAGCCACATTACAATGTCAATATGACGGGAGTACAAGTTGGTCGTGTCATGTTAAATATTTCGGCTGATGTATTTGAGGCAGGAGACAGAAAAGGGACAATCATTGATATTGGTACAACTTTGGCATATCTTCCAGAATTGATTTACGGACCATTAGTAACCATGATAATCTCACGGCAACCTAATTTGGAAGTTCAAACCATTCATGGAGAGTATAAATGTTTTCAGTACTCAGAAAGTGTCGATGATGGATTTCCTCCAGTTATTTTCCATTTCGAGAATTCACTCTTGTTGAAGGTTTACCCTCATGAGTATCTATTCCAATATGAGGGCTTGTGGTGTATGGGTTGGCAAAACAGTGGGATGCAATCTAGGGATAGGAAGAACGTTACCCTCTTTGGAGATCTAGTGCTATCAAATAAGCTAGTTTTATATGATCTTGAAAACCAGACAATCGGGTGGACCGAGTACAACTGTTCTTCAAACATCAAAGTGCAGGATGAACAGACTGGAACAGTTCATTTAGTTGGTTCACATTCCATTTCTTCAGCCTGCAGATTGAATTCCCAATGGGCTGTGATCTTGTTGTTCCTGATCTTGCTTATGCACTGGTCAGCTCATTTCAGATGTTTCAGC

Protein sequence

MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCMECPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSSTAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKCFQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVILLFLILLMHWSAHFRCFS
Homology
BLAST of MS000634 vs. NCBI nr
Match: XP_022150427.1 (aspartic proteinase-like protein 2 [Momordica charantia])

HSP 1 Score: 1018.8 bits (2633), Expect = 1.5e-293
Identity = 498/498 (100.00%), Postives = 498/498 (100.00%), Query Frame = 0

Query: 1   MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHD 60
           MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHD
Sbjct: 1   MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHD 60

Query: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME 120
           ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME
Sbjct: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME 120

Query: 121 CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS 180
           CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS
Sbjct: 121 CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS 180

Query: 181 TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240
           TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS
Sbjct: 181 TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240

Query: 241 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300
           IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG
Sbjct: 241 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300

Query: 301 RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC 360
           RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC
Sbjct: 301 RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC 360

Query: 361 FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG 420
           FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG
Sbjct: 361 FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG 420

Query: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI 480
           DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI
Sbjct: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI 480

Query: 481 LLFLILLMHWSAHFRCFS 499
           LLFLILLMHWSAHFRCFS
Sbjct: 481 LLFLILLMHWSAHFRCFS 498

BLAST of MS000634 vs. NCBI nr
Match: XP_038886489.1 (aspartic proteinase 39 [Benincasa hispida] >XP_038886490.1 aspartic proteinase 39 [Benincasa hispida])

HSP 1 Score: 963.8 bits (2490), Expect = 5.9e-277
Identity = 469/498 (94.18%), Postives = 484/498 (97.19%), Query Frame = 0

Query: 1   MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHD 60
           MATAG+GTSRPLTLLL VI N LSNTITGGGGV A++GVFSVKYKYAGRERSLSTLKAHD
Sbjct: 1   MATAGIGTSRPLTLLLFVIINSLSNTITGGGGVYADNGVFSVKYKYAGRERSLSTLKAHD 60

Query: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME 120
           ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQC E
Sbjct: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCRE 120

Query: 121 CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS 180
           CPRKS+LGMELTPY++EESTTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIYGDGSS
Sbjct: 121 CPRKSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSS 180

Query: 181 TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240
           TAGYF+KDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS
Sbjct: 181 TAGYFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240

Query: 241 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300
           IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQP+VNMTPLVPNQPHYNVNMTGVQVG
Sbjct: 241 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPRVNMTPLVPNQPHYNVNMTGVQVG 300

Query: 301 RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC 360
           RVMLNISADVFEAGDRKGTIID GTTLAYLPELIY PLVTMI+S+Q NLEVQTIHGEYKC
Sbjct: 301 RVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYKPLVTMILSQQHNLEVQTIHGEYKC 360

Query: 361 FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG 420
           FQYSE VDDGFPPVIFHFENSLLLKVYPHEYLFQYE LWC+GWQNSGMQSRDRKNVTLFG
Sbjct: 361 FQYSERVDDGFPPVIFHFENSLLLKVYPHEYLFQYESLWCIGWQNSGMQSRDRKNVTLFG 420

Query: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI 480
           DLVLSNKLVLYDLENQ+IGWTEYNCSS+IKVQDEQTGTVHLVGSH ISSA RLN++W VI
Sbjct: 421 DLVLSNKLVLYDLENQSIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSANRLNTKWGVI 480

Query: 481 LLFLILLMHWSAHFRCFS 499
           LLFLILLMHWSAHFRCFS
Sbjct: 481 LLFLILLMHWSAHFRCFS 498

BLAST of MS000634 vs. NCBI nr
Match: XP_016900131.1 (PREDICTED: aspartic proteinase-like protein 2 isoform X1 [Cucumis melo])

HSP 1 Score: 943.7 bits (2438), Expect = 6.3e-271
Identity = 460/498 (92.37%), Postives = 476/498 (95.58%), Query Frame = 0

Query: 1   MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHD 60
           MATAG+GTSRPLTLLL +I NLLSNTITGGG V A++GVFSVKYKYAGRERSLSTLKAHD
Sbjct: 1   MATAGIGTSRPLTLLLFLIINLLSNTITGGGRVYADNGVFSVKYKYAGRERSLSTLKAHD 60

Query: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME 120
           ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTP KDYYVQVDTGSDIVWVNCIQC E
Sbjct: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRE 120

Query: 121 CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS 180
           CPR S+LGMELTPY++EESTTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIYGDGSS
Sbjct: 121 CPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSS 180

Query: 181 TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240
           TAGYF+KDYVQY+RVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS
Sbjct: 181 TAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240

Query: 241 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300
           IISQLASSRKVKKMFAHCLDG NGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG
Sbjct: 241 IISQLASSRKVKKMFAHCLDGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300

Query: 301 RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC 360
            VMLNISADVFEAGDRKGTIID GTTLAYLPELIY PLV  I+S+Q NLEVQTIHGEYKC
Sbjct: 301 HVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKC 360

Query: 361 FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG 420
           FQYSE VDDGFPPVIFHFENSLLLKVYPHEYLFQYE LWC+GWQNSGMQSRDRKNVTLFG
Sbjct: 361 FQYSERVDDGFPPVIFHFENSLLLKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLFG 420

Query: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI 480
           DLVLSNKLVLYDLENQTIGWTEYNCSS+IKVQDEQTGTVHLVGSH +SSA RLN++W VI
Sbjct: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYLSSAKRLNTKWGVI 480

Query: 481 LLFLILLMHWSAHFRCFS 499
            LFLILLMHWSAH RCFS
Sbjct: 481 FLFLILLMHWSAHSRCFS 498

BLAST of MS000634 vs. NCBI nr
Match: XP_004140876.1 (aspartic proteinase-like protein 2 [Cucumis sativus] >KGN45989.1 hypothetical protein Csa_005288 [Cucumis sativus])

HSP 1 Score: 943.3 bits (2437), Expect = 8.2e-271
Identity = 457/498 (91.77%), Postives = 477/498 (95.78%), Query Frame = 0

Query: 1   MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHD 60
           MATAG+GTSRPLTLLL +I NLLSNTI GGGGV A++G+FSVKYKYAGRERSLSTLKAHD
Sbjct: 1   MATAGIGTSRPLTLLLFLIINLLSNTINGGGGVYADNGIFSVKYKYAGRERSLSTLKAHD 60

Query: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME 120
           ISRQLRFLAG+DIPLGGSGRPDAVGLYYAKIGIGTP KDYYVQVDTGSDIVWVNCIQC E
Sbjct: 61  ISRQLRFLAGIDIPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRE 120

Query: 121 CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS 180
           CPR S+LGMELTPY++EESTTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIYGDGSS
Sbjct: 121 CPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSS 180

Query: 181 TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240
           TAGYF+KDYVQY+RVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS
Sbjct: 181 TAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240

Query: 241 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300
           IISQLAS+RKVKKMFAHCLDG NGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG
Sbjct: 241 IISQLASTRKVKKMFAHCLDGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300

Query: 301 RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC 360
            ++LNISADVFEAGDRKGTIID GTTLAYLPELIY PLV  I+S+Q NLEVQTIHGEYKC
Sbjct: 301 HIILNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKC 360

Query: 361 FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG 420
           FQYSE VDDGFPPVIFHFENSLLLKVYPHEYLFQYE LWC+GWQNSGMQSRDRKNVTLFG
Sbjct: 361 FQYSERVDDGFPPVIFHFENSLLLKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLFG 420

Query: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI 480
           DLVLSNKLVLYDLENQTIGWTEYNCSS+IKVQDEQTGTVHLVGSH ISSA RLN++W VI
Sbjct: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAKRLNTKWGVI 480

Query: 481 LLFLILLMHWSAHFRCFS 499
           LLFLILLMHWSAH RCFS
Sbjct: 481 LLFLILLMHWSAHSRCFS 498

BLAST of MS000634 vs. NCBI nr
Match: XP_023005003.1 (aspartic proteinase-like protein 2 [Cucurbita maxima] >XP_023005004.1 aspartic proteinase-like protein 2 [Cucurbita maxima] >XP_023005005.1 aspartic proteinase-like protein 2 [Cucurbita maxima])

HSP 1 Score: 934.5 bits (2414), Expect = 3.8e-268
Identity = 457/503 (90.85%), Postives = 479/503 (95.23%), Query Frame = 0

Query: 1   MATAGMGTSRPLTLLLLVIFNLLSNTITGGG-----GVSAESGVFSVKYKYAGRERSLST 60
           MA+A +GTSRP T+LL VI NLLS+TI GGG     GV A++GVFSVKYKYAGRERSLST
Sbjct: 1   MASAAIGTSRPFTVLLFVIINLLSSTILGGGGGVAVGVYADNGVFSVKYKYAGRERSLST 60

Query: 61  LKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNC 120
           LKAHDI+RQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPK+YYVQVDTGSDIVWVNC
Sbjct: 61  LKAHDINRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKNYYVQVDTGSDIVWVNC 120

Query: 121 IQCMECPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIY 180
           IQC ECPR+S+LGMELT Y++E+STTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIY
Sbjct: 121 IQCKECPRRSSLGMELTTYDLEQSTTGKLVSCDEQFCLEVNGGPLSGCTANMSCPYLQIY 180

Query: 181 GDGSSTAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG 240
           GDGSSTAG F+KDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG
Sbjct: 181 GDGSSTAGIFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG 240

Query: 241 KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMT 300
           KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKV MTPLVPNQPHYNVNMT
Sbjct: 241 KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMT 300

Query: 301 GVQVGRVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIH 360
           GVQVGRVMLNISADVFEAGDRKGTIID GTTLAYLPELIY PLVTMI+SRQ NLEVQ+IH
Sbjct: 301 GVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQSIH 360

Query: 361 GEYKCFQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKN 420
           GEYKCFQYS SVDDGFPPV FHFENSLLLKVYPHEYLFQ+EGLWC+GWQNSGMQSRDRKN
Sbjct: 361 GEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSRDRKN 420

Query: 421 VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNS 480
           VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSS+IKVQDEQTGTVHLVGSH ISSA RLN+
Sbjct: 421 VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAYRLNT 480

Query: 481 QWAVILLFLILLMHWSAHFRCFS 499
           +WAVILLFLIL+MHWSAHFRC S
Sbjct: 481 KWAVILLFLILVMHWSAHFRCLS 503

BLAST of MS000634 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 439.9 bits (1130), Expect = 3.9e-122
Identity = 207/436 (47.48%), Postives = 305/436 (69.95%), Query Frame = 0

Query: 39  VFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPK 98
           VF  ++K+AG++++L   K+HD  R  R LA +D+PLGG  R D+VGLY+ KI +G+PPK
Sbjct: 26  VFKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPK 85

Query: 99  DYYVQVDTGSDIVWVNCIQCMECPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGG 158
           +Y+VQVDTGSDI+W+NC  C +CP K+ L   L+ +++  S+T K V CD+ FC  ++  
Sbjct: 86  EYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQS 145

Query: 159 PLSGCTGNMSCPYLQIYGDGSSTAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSG 218
               C   + C Y  +Y D S++ G FI+D +  ++V+GDL+T      + FGCG+ QSG
Sbjct: 146 --DSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSG 205

Query: 219 DLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKV 278
            LG +G+ A+DG++GFG+SN+S++SQLA++   K++F+HCLD + GGGIFA+G V  PKV
Sbjct: 206 QLG-NGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKV 265

Query: 279 NMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPL 338
             TP+VPNQ HYNV + G+ V    L++   +   G   GTI+D GTTLAY P+++Y  L
Sbjct: 266 KTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNG---GTIVDSGTTLAYFPKVLYDSL 325

Query: 339 VTMIISRQPNLEVQTIHGEYKCFQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQY-EG 398
           +  I++RQP +++  +   ++CF +S +VD+ FPPV F FE+S+ L VYPH+YLF   E 
Sbjct: 326 IETILARQP-VKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEE 385

Query: 399 LWCMGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTG 458
           L+C GWQ  G+ + +R  V L GDLVLSNKLV+YDL+N+ IGW ++NCSS+IK++D  +G
Sbjct: 386 LYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSSIKIKD-GSG 445

Query: 459 TVHLVGSHSISSACRL 474
            V+ VG+ ++SSA RL
Sbjct: 446 GVYSVGADNLSSAPRL 453

BLAST of MS000634 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 439.5 bits (1129), Expect = 5.1e-122
Identity = 219/481 (45.53%), Postives = 317/481 (65.90%), Query Frame = 0

Query: 12  LTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGV 71
           ++ ++ V+F L+   ++G       + VF+V +K+AG+E+ LS LK+HD  R  R LA +
Sbjct: 10  ISRIVAVVFVLVIQVVSG-------NFVFNVTHKFAGKEKQLSELKSHDSFRHARMLANI 69

Query: 72  DIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCMECPRKSTLGMEL 131
           D+PLGG  R D++GLY+ KI +G+PPK+YYVQVDTGSDI+WVNC  C +CP K+ LG+ L
Sbjct: 70  DLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPL 129

Query: 132 TPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSSTAGYFIKDYVQ 191
           + Y+ + S+T K V C++ FC  +       C     C Y  +YGDGS++ G FIKD + 
Sbjct: 130 SLYDSKTSSTSKNVGCEDDFCSFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFIKDNIT 189

Query: 192 YDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKV 251
            ++V+G+L T      + FGCG  QSG LG + + A+DGI+GFG+SN+SIISQLA+    
Sbjct: 190 LEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQT-DSAVDGIMGFGQSNTSIISQLAAGGST 249

Query: 252 KKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF 311
           K++F+HCLD +NGGGIFA+G V  P V  TP+VPNQ HYNV + G+ V    +++   + 
Sbjct: 250 KRIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLA 309

Query: 312 EAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKCFQYSESVDDGF 371
                 GTIID GTTLAYLP+ +Y  L+  I ++Q  +++  +   + CF ++ + D  F
Sbjct: 310 STNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQ-QVKLHMVQETFACFSFTSNTDKAF 369

Query: 372 PPVIFHFENSLLLKVYPHEYLFQY-EGLWCMGWQNSGMQSRDRKNVTLFGDLVLSNKLVL 431
           P V  HFE+SL L VYPH+YLF   E ++C GWQ+ GM ++D  +V L GDLVLSNKLV+
Sbjct: 370 PVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVV 429

Query: 432 YDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACR--LNSQWAVILLFLILLM 490
           YDLEN+ IGW ++NCSS+IKV+D  +G  + +G+ ++ SA    +N     +L  LI + 
Sbjct: 430 YDLENEVIGWADHNCSSSIKVKD-GSGAAYQLGAENLISAASSVMNGTLVTLLSILIWVF 478

BLAST of MS000634 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 8.3e-32
Identity = 111/426 (26.06%), Postives = 184/426 (43.19%), Query Frame = 0

Query: 36  ESGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGT 95
           +SG    KY+   R       +   I+  L+  +G++ P+         G Y   + IGT
Sbjct: 50  DSGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGD-----GEYLMNVAIGT 109

Query: 96  PPKDYYVQVDTGSDIVWVNCIQCMECPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEV 155
           P   +   +DTGSD++W  C  C +C  + T       +N ++S++   + C+ Q+C ++
Sbjct: 110 PDSSFSAIMDTGSDLIWTQCEPCTQCFSQPT-----PIFNPQDSSSFSTLPCESQYCQDL 169

Query: 156 NGGPLSGCTGNMSCPYLQIYGDGSSTAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGAR 215
              P   C  N  C Y   YGDGS+T GY   +   ++        T++  +I FGCG  
Sbjct: 170 ---PSETCNNN-ECQYTYGYGDGSTTQGYMATETFTFE--------TSSVPNIAFGCGED 229

Query: 216 QSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLD--GINGGGIFAMGHV 275
             G    +G     G++G G    S+ SQL   +     F++C+   G +     A+G  
Sbjct: 230 NQGFGQGNGA----GLIGMGWGPLSLPSQLGVGQ-----FSYCMTSYGSSSPSTLALGSA 289

Query: 276 V------QPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGD--RKGTIIDIGT 335
                   P   +     N  +Y + + G+ VG   L I +  F+  D    G IID GT
Sbjct: 290 ASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGT 349

Query: 336 TLAYLPELIYGPLVTMIISRQPNLEV--QTIHGEYKCFQY-SESVDDGFPPVIFHFENSL 395
           TL YLP+  Y   V    + Q NL    ++  G   CFQ  S+      P +   F+  +
Sbjct: 350 TLTYLPQDAYN-AVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGV 409

Query: 396 LLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTE 449
           L     +  +   EG+ C+      M S  +  +++FG++      VLYDL+N  + +  
Sbjct: 410 LNLGEQNILISPAEGVICL-----AMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVP 438

BLAST of MS000634 vs. ExPASy Swiss-Prot
Match: Q3EBM5 (Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g35615 PE=3 SV=1)

HSP 1 Score: 125.6 bits (314), Expect = 1.6e-27
Identity = 112/414 (27.05%), Postives = 179/414 (43.24%), Query Frame = 0

Query: 61  ISRQLRF---LAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQ 120
           +SR  RF   L+  D+    SG   A G ++  I IGTPP   +   DTGSD+ WV C  
Sbjct: 59  VSRSRRFNHQLSQTDLQ---SGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKP 118

Query: 121 CMECPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGD 180
           C +C +++        ++ ++S+T K   CD + C  ++         N  C Y   YGD
Sbjct: 119 CQQCYKENG-----PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGD 178

Query: 181 GSSTAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKS 240
            S + G    + V  D  SG     +  G++ FGCG    G    +G     GI+G G  
Sbjct: 179 QSFSKGDVATETVSIDSASG--SPVSFPGTV-FGCGYNNGGTFDETG----SGIIGLGGG 238

Query: 241 NSSIISQLASSRKVKKMFAHCLD----GINGGGIFAMGHVVQPK-------VNMTPLVPN 300
           + S+ISQL SS  + K F++CL       NG  +  +G    P        V  TPLV  
Sbjct: 239 HLSLISQLGSS--ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDK 298

Query: 301 QP--HYNVNMTGVQVGRVMLNISADVFEAGD-------RKGTIIDIGTTLAYLPELIYGP 360
           +P  +Y + +  + VG+  +  +   +   D           IID GTTL  L    +  
Sbjct: 299 EPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDK 358

Query: 361 LVTMIISRQPNLE-VQTIHGEYK-CFQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQY 420
             + +       + V    G    CF+ S S + G P +  HF  + +     + ++   
Sbjct: 359 FSSAVEESVTGAKRVSDPQGLLSHCFK-SGSAEIGLPEITVHFTGADVRLSPINAFVKLS 418

Query: 421 EGLWCMGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSNI 450
           E + C+    +         V ++G+    + LV YDLE +T+ +   +CS+N+
Sbjct: 419 EDMVCLSMVPT-------TEVAIYGNFAQMDFLVGYDLETRTVSFQHMDCSANL 447

BLAST of MS000634 vs. ExPASy Swiss-Prot
Match: Q9LTW4 (Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE=1 SV=1)

HSP 1 Score: 123.2 bits (308), Expect = 8.0e-27
Identity = 110/405 (27.16%), Postives = 178/405 (43.95%), Query Frame = 0

Query: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME 120
           ISR+     GV + L GSG       Y+ +I +GTP K + V VDTGS++ WVNC     
Sbjct: 81  ISRKRNSTVGVKMDL-GSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNC----- 140

Query: 121 CPRKSTLGME-LTPYNIEESTTGKLVSCDEQFCLE--VNGGPLSGC-TGNMSCPYLQIYG 180
             R    G +    +  +ES + K V C  Q C    +N   L+ C T +  C Y   Y 
Sbjct: 141 --RYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYA 200

Query: 181 DGSSTAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGK 240
           DGS+  G F K+ +     +G +     +     GC +  +G       +  DG+LG   
Sbjct: 201 DGSAAQGVFAKETITVGLTNGRMARLPGH---LIGCSSSFTG----QSFQGADGVLGLAF 260

Query: 241 SNSSIISQLASSRKVKKMFAHCL------DGINGGGIFAMGHVVQPKVNMT---PLVPNQ 300
           S+ S  S   S    K  F++CL        ++   IF      +     T    L    
Sbjct: 261 SDFSFTSTATSLYGAK--FSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIP 320

Query: 301 PHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQP 360
           P Y +N+ G+ +G  ML+I + V++A    GTI+D GT+L  L +  Y  +VT +   + 
Sbjct: 321 PFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGL--ARY 380

Query: 361 NLEVQTIHGEYKCFQYSESVDDGF-----PPVIFHFENSLLLKVYPHEYLFQ-YEGLWCM 420
            +E++ +  E    +Y  S   GF     P + FH +     + +   YL     G+ C+
Sbjct: 381 LVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCL 440

Query: 421 GWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCS 447
           G+ ++G  +       + G+++  N L  +DL   T+ +    C+
Sbjct: 441 GFVSAGTPA-----TNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461

BLAST of MS000634 vs. ExPASy TrEMBL
Match: A0A6J1DAP8 (aspartic proteinase-like protein 2 OS=Momordica charantia OX=3673 GN=LOC111018586 PE=3 SV=1)

HSP 1 Score: 1018.8 bits (2633), Expect = 7.5e-294
Identity = 498/498 (100.00%), Postives = 498/498 (100.00%), Query Frame = 0

Query: 1   MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHD 60
           MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHD
Sbjct: 1   MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHD 60

Query: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME 120
           ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME
Sbjct: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME 120

Query: 121 CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS 180
           CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS
Sbjct: 121 CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS 180

Query: 181 TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240
           TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS
Sbjct: 181 TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240

Query: 241 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300
           IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG
Sbjct: 241 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300

Query: 301 RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC 360
           RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC
Sbjct: 301 RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC 360

Query: 361 FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG 420
           FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG
Sbjct: 361 FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG 420

Query: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI 480
           DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI
Sbjct: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI 480

Query: 481 LLFLILLMHWSAHFRCFS 499
           LLFLILLMHWSAHFRCFS
Sbjct: 481 LLFLILLMHWSAHFRCFS 498

BLAST of MS000634 vs. ExPASy TrEMBL
Match: A0A1S4DVW7 (aspartic proteinase-like protein 2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103488659 PE=3 SV=1)

HSP 1 Score: 943.7 bits (2438), Expect = 3.1e-271
Identity = 460/498 (92.37%), Postives = 476/498 (95.58%), Query Frame = 0

Query: 1   MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHD 60
           MATAG+GTSRPLTLLL +I NLLSNTITGGG V A++GVFSVKYKYAGRERSLSTLKAHD
Sbjct: 1   MATAGIGTSRPLTLLLFLIINLLSNTITGGGRVYADNGVFSVKYKYAGRERSLSTLKAHD 60

Query: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME 120
           ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTP KDYYVQVDTGSDIVWVNCIQC E
Sbjct: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRE 120

Query: 121 CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS 180
           CPR S+LGMELTPY++EESTTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIYGDGSS
Sbjct: 121 CPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSS 180

Query: 181 TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240
           TAGYF+KDYVQY+RVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS
Sbjct: 181 TAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240

Query: 241 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300
           IISQLASSRKVKKMFAHCLDG NGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG
Sbjct: 241 IISQLASSRKVKKMFAHCLDGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300

Query: 301 RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC 360
            VMLNISADVFEAGDRKGTIID GTTLAYLPELIY PLV  I+S+Q NLEVQTIHGEYKC
Sbjct: 301 HVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKC 360

Query: 361 FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG 420
           FQYSE VDDGFPPVIFHFENSLLLKVYPHEYLFQYE LWC+GWQNSGMQSRDRKNVTLFG
Sbjct: 361 FQYSERVDDGFPPVIFHFENSLLLKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLFG 420

Query: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI 480
           DLVLSNKLVLYDLENQTIGWTEYNCSS+IKVQDEQTGTVHLVGSH +SSA RLN++W VI
Sbjct: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYLSSAKRLNTKWGVI 480

Query: 481 LLFLILLMHWSAHFRCFS 499
            LFLILLMHWSAH RCFS
Sbjct: 481 FLFLILLMHWSAHSRCFS 498

BLAST of MS000634 vs. ExPASy TrEMBL
Match: A0A0A0KAX9 (Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G041740 PE=3 SV=1)

HSP 1 Score: 943.3 bits (2437), Expect = 4.0e-271
Identity = 457/498 (91.77%), Postives = 477/498 (95.78%), Query Frame = 0

Query: 1   MATAGMGTSRPLTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHD 60
           MATAG+GTSRPLTLLL +I NLLSNTI GGGGV A++G+FSVKYKYAGRERSLSTLKAHD
Sbjct: 1   MATAGIGTSRPLTLLLFLIINLLSNTINGGGGVYADNGIFSVKYKYAGRERSLSTLKAHD 60

Query: 61  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME 120
           ISRQLRFLAG+DIPLGGSGRPDAVGLYYAKIGIGTP KDYYVQVDTGSDIVWVNCIQC E
Sbjct: 61  ISRQLRFLAGIDIPLGGSGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRE 120

Query: 121 CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS 180
           CPR S+LGMELTPY++EESTTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIYGDGSS
Sbjct: 121 CPRTSSLGMELTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSS 180

Query: 181 TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240
           TAGYF+KDYVQY+RVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS
Sbjct: 181 TAGYFVKDYVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 240

Query: 241 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300
           IISQLAS+RKVKKMFAHCLDG NGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG
Sbjct: 241 IISQLASTRKVKKMFAHCLDGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 300

Query: 301 RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC 360
            ++LNISADVFEAGDRKGTIID GTTLAYLPELIY PLV  I+S+Q NLEVQTIHGEYKC
Sbjct: 301 HIILNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKC 360

Query: 361 FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG 420
           FQYSE VDDGFPPVIFHFENSLLLKVYPHEYLFQYE LWC+GWQNSGMQSRDRKNVTLFG
Sbjct: 361 FQYSERVDDGFPPVIFHFENSLLLKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLFG 420

Query: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI 480
           DLVLSNKLVLYDLENQTIGWTEYNCSS+IKVQDEQTGTVHLVGSH ISSA RLN++W VI
Sbjct: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAKRLNTKWGVI 480

Query: 481 LLFLILLMHWSAHFRCFS 499
           LLFLILLMHWSAH RCFS
Sbjct: 481 LLFLILLMHWSAHSRCFS 498

BLAST of MS000634 vs. ExPASy TrEMBL
Match: A0A6J1KXX5 (aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111498125 PE=3 SV=1)

HSP 1 Score: 934.5 bits (2414), Expect = 1.9e-268
Identity = 457/503 (90.85%), Postives = 479/503 (95.23%), Query Frame = 0

Query: 1   MATAGMGTSRPLTLLLLVIFNLLSNTITGGG-----GVSAESGVFSVKYKYAGRERSLST 60
           MA+A +GTSRP T+LL VI NLLS+TI GGG     GV A++GVFSVKYKYAGRERSLST
Sbjct: 1   MASAAIGTSRPFTVLLFVIINLLSSTILGGGGGVAVGVYADNGVFSVKYKYAGRERSLST 60

Query: 61  LKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNC 120
           LKAHDI+RQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPK+YYVQVDTGSDIVWVNC
Sbjct: 61  LKAHDINRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKNYYVQVDTGSDIVWVNC 120

Query: 121 IQCMECPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIY 180
           IQC ECPR+S+LGMELT Y++E+STTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIY
Sbjct: 121 IQCKECPRRSSLGMELTTYDLEQSTTGKLVSCDEQFCLEVNGGPLSGCTANMSCPYLQIY 180

Query: 181 GDGSSTAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG 240
           GDGSSTAG F+KDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG
Sbjct: 181 GDGSSTAGIFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFG 240

Query: 241 KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMT 300
           KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKV MTPLVPNQPHYNVNMT
Sbjct: 241 KSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMT 300

Query: 301 GVQVGRVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIH 360
           GVQVGRVMLNISADVFEAGDRKGTIID GTTLAYLPELIY PLVTMI+SRQ NLEVQ+IH
Sbjct: 301 GVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQSIH 360

Query: 361 GEYKCFQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKN 420
           GEYKCFQYS SVDDGFPPV FHFENSLLLKVYPHEYLFQ+EGLWC+GWQNSGMQSRDRKN
Sbjct: 361 GEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSRDRKN 420

Query: 421 VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNS 480
           VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSS+IKVQDEQTGTVHLVGSH ISSA RLN+
Sbjct: 421 VTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAYRLNT 480

Query: 481 QWAVILLFLILLMHWSAHFRCFS 499
           +WAVILLFLIL+MHWSAHFRC S
Sbjct: 481 KWAVILLFLILVMHWSAHFRCLS 503

BLAST of MS000634 vs. ExPASy TrEMBL
Match: A0A6J1H6J5 (aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111460583 PE=3 SV=1)

HSP 1 Score: 928.3 bits (2398), Expect = 1.3e-266
Identity = 453/498 (90.96%), Postives = 475/498 (95.38%), Query Frame = 0

Query: 2   ATAGMGTSRPLTLLLLVIFNLLSNTIT-GGGGVSAESGVFSVKYKYAGRERSLSTLKAHD 61
           A A +GTSRP T+LL VI NLLS+TI  GGGGV A++GVFSVKYKYAGR+RSLSTLKAHD
Sbjct: 3   AAAAIGTSRPFTVLLFVIINLLSSTILGGGGGVYADNGVFSVKYKYAGRQRSLSTLKAHD 62

Query: 62  ISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCME 121
           I+RQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPK+YYVQVDTGSDIVWVNCIQC E
Sbjct: 63  INRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPKNYYVQVDTGSDIVWVNCIQCKE 122

Query: 122 CPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSS 181
           CPR+S+LGMELT Y++E+STTGKLVSCDEQFCLEVNGGPLSGCT NMSCPYLQIYGDGSS
Sbjct: 123 CPRRSSLGMELTTYDLEQSTTGKLVSCDEQFCLEVNGGPLSGCTANMSCPYLQIYGDGSS 182

Query: 182 TAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSS 241
           TAG F+KDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGS GEEALDGILGFGKSNSS
Sbjct: 183 TAGIFVKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLGSPGEEALDGILGFGKSNSS 242

Query: 242 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVG 301
           IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKV MTPLVPNQPHYNVNMTGVQVG
Sbjct: 243 IISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVG 302

Query: 302 RVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKC 361
           RVMLNISADVFEAGDRKGTIID GTTLAYLPELIY PLVTMI+SRQ NLEVQTIHGEYKC
Sbjct: 303 RVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKC 362

Query: 362 FQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYEGLWCMGWQNSGMQSRDRKNVTLFG 421
           FQYS SVDDGFPPV FHFENSLLLKVYPHEYLFQ+EGLWC+GWQNSGMQSRDRKNVTLFG
Sbjct: 363 FQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSRDRKNVTLFG 422

Query: 422 DLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACRLNSQWAVI 481
           DLVLSNKLVLYDLENQTIGWTEYNCSS+IKVQDEQTGTVHLVGSH ISSA RLN++WAV+
Sbjct: 423 DLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLVGSHYISSAYRLNTKWAVM 482

Query: 482 LLFLILLMHWSAHFRCFS 499
           LLFLIL+MHWSAH RC S
Sbjct: 483 LLFLILVMHWSAHSRCLS 500

BLAST of MS000634 vs. TAIR 10
Match: AT1G05840.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 702.6 bits (1812), Expect = 2.3e-202
Identity = 335/456 (73.46%), Postives = 388/456 (85.09%), Query Frame = 0

Query: 33  VSAESGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIG 92
           VS   GVF+VKY+Y   + SL+ LK HD  RQL  LAG+D+PLGG+GRPD  GLYYAKIG
Sbjct: 26  VSCNPGVFNVKYRYPRLQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPDIPGLYYAKIG 85

Query: 93  IGTPPKDYYVQVDTGSDIVWVNCIQCMECPRKSTLGMELTPYNIEESTTGKLVSCDEQFC 152
           IGTP K YYVQVDTGSDI+WVNCIQC +CPR+STLG+ELT YNI+ES +GKLVSCD+ FC
Sbjct: 86  IGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDDFC 145

Query: 153 LEVNGGPLSGCTGNMSCPYLQIYGDGSSTAGYFIKDYVQYDRVSGDLETTAANGSIKFGC 212
            +++GGPLSGC  NMSCPYL+IYGDGSSTAGYF+KD VQYD V+GDL+T  ANGS+ FGC
Sbjct: 146 YQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGC 205

Query: 213 GARQSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGH 272
           GARQSGDL SS EEALDGILGFGK+NSS+ISQLASS +VKK+FAHCLDG NGGGIFA+G 
Sbjct: 206 GARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGR 265

Query: 273 VVQPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDIGTTLAYLPE 332
           VVQPKVNMTPLVPNQPHYNVNMT VQVG+  L I AD+F+ GDRKG IID GTTLAYLPE
Sbjct: 266 VVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPE 325

Query: 333 LIYGPLVTMIISRQPNLEVQTIHGEYKCFQYSESVDDGFPPVIFHFENSLLLKVYPHEYL 392
           +IY PLV  I S++P L+V  +  +YKCFQYS  VD+GFP V FHFENS+ L+VYPH+YL
Sbjct: 326 IIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYL 385

Query: 393 FQYEGLWCMGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQ 452
           F +EG+WC+GWQNS MQSRDR+N+TL GDLVLSNKLVLYDLENQ IGWTEYNCSS+IKV+
Sbjct: 386 FPHEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCSSSIKVK 445

Query: 453 DEQTGTVHLVGSHSISSACRLNSQWAVILLFLILLM 489
           DE TGTVHLVGSH ISSA  L++  ++ LLF +LL+
Sbjct: 446 DEGTGTVHLVGSHFISSALPLDT--SMCLLFSLLLL 479

BLAST of MS000634 vs. TAIR 10
Match: AT3G02740.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 496.9 bits (1278), Expect = 1.9e-140
Identity = 234/455 (51.43%), Postives = 327/455 (71.87%), Query Frame = 0

Query: 34  SAESGVFSVKYKYAG-RERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIG 93
           ++E+ VF V+ K+AG R + L  L+AHD+ R  R L+ +DIPLGG  +P+++GLY+AKIG
Sbjct: 31  ASENLVFEVRSKFAGKRVKDLGALRAHDVHRHSRLLSAIDIPLGGDSQPESIGLYFAKIG 90

Query: 94  IGTPPKDYYVQVDTGSDIVWVNCIQCMECPRKSTLGMELTPYNIEESTTGKLVSCDEQFC 153
           +GTP +D++VQVDTGSDI+WVNC  C+ CPRKS L +ELTPY+++ S+T K VSC + FC
Sbjct: 91  LGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL-VELTPYDVDASSTAKSVSCSDNFC 150

Query: 154 LEVNGGPLSGCTGNMSCPYLQIYGDGSSTAGYFIKDYVQYDRVSGDLETTAANGSIKFGC 213
             VN    S C    +C Y+ +YGDGSST GY +KD V  D V+G+ +T + NG+I FGC
Sbjct: 151 SYVN--QRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGC 210

Query: 214 GARQSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGH 273
           G++QSG LG S + A+DGI+GFG+SNSS ISQLAS  KVK+ FAHCLD  NGGGIFA+G 
Sbjct: 211 GSKQSGQLGES-QAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGE 270

Query: 274 VVQPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDIGTTLAYLPE 333
           VV PKV  TP++    HY+VN+  ++VG  +L +S++ F++GD KG IID GTTL YLP+
Sbjct: 271 VVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPD 330

Query: 334 LIYGPLVTMIISRQPNLEVQTIHGEYKCFQYSESVDDGFPPVIFHFENSLLLKVYPHEYL 393
            +Y PL+  I++  P L + T+   + CF Y++ + D FP V F F+ S+ L VYP EYL
Sbjct: 331 AVYNPLLNEILASHPELTLHTVQESFTCFHYTDKL-DRFPTVTFQFDKSVSLAVYPREYL 390

Query: 394 FQY-EGLWCMGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSNIKV 453
           FQ  E  WC GWQN G+Q++   ++T+ GD+ LSNKLV+YD+ENQ IGWT +NCS  I+V
Sbjct: 391 FQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCSGGIQV 450

Query: 454 QDEQTGTVHLVGSHSISSACRLNSQWAVILLFLIL 487
           +DE++G ++ VG+H++S +  L     + L+ L++
Sbjct: 451 KDEESGAIYTVGAHNLSWSSSLAITKLLTLVSLLI 480

BLAST of MS000634 vs. TAIR 10
Match: AT1G65240.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 439.9 bits (1130), Expect = 2.8e-123
Identity = 207/436 (47.48%), Postives = 305/436 (69.95%), Query Frame = 0

Query: 39  VFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAKIGIGTPPK 98
           VF  ++K+AG++++L   K+HD  R  R LA +D+PLGG  R D+VGLY+ KI +G+PPK
Sbjct: 26  VFKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPK 85

Query: 99  DYYVQVDTGSDIVWVNCIQCMECPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGG 158
           +Y+VQVDTGSDI+W+NC  C +CP K+ L   L+ +++  S+T K V CD+ FC  ++  
Sbjct: 86  EYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQS 145

Query: 159 PLSGCTGNMSCPYLQIYGDGSSTAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSG 218
               C   + C Y  +Y D S++ G FI+D +  ++V+GDL+T      + FGCG+ QSG
Sbjct: 146 --DSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSG 205

Query: 219 DLGSSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKV 278
            LG +G+ A+DG++GFG+SN+S++SQLA++   K++F+HCLD + GGGIFA+G V  PKV
Sbjct: 206 QLG-NGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKV 265

Query: 279 NMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPL 338
             TP+VPNQ HYNV + G+ V    L++   +   G   GTI+D GTTLAY P+++Y  L
Sbjct: 266 KTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNG---GTIVDSGTTLAYFPKVLYDSL 325

Query: 339 VTMIISRQPNLEVQTIHGEYKCFQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQY-EG 398
           +  I++RQP +++  +   ++CF +S +VD+ FPPV F FE+S+ L VYPH+YLF   E 
Sbjct: 326 IETILARQP-VKLHIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEE 385

Query: 399 LWCMGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSNIKVQDEQTG 458
           L+C GWQ  G+ + +R  V L GDLVLSNKLV+YDL+N+ IGW ++NCSS+IK++D  +G
Sbjct: 386 LYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSSIKIKD-GSG 445

Query: 459 TVHLVGSHSISSACRL 474
            V+ VG+ ++SSA RL
Sbjct: 446 GVYSVGADNLSSAPRL 453

BLAST of MS000634 vs. TAIR 10
Match: AT5G36260.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 439.5 bits (1129), Expect = 3.6e-123
Identity = 219/481 (45.53%), Postives = 317/481 (65.90%), Query Frame = 0

Query: 12  LTLLLLVIFNLLSNTITGGGGVSAESGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGV 71
           ++ ++ V+F L+   ++G       + VF+V +K+AG+E+ LS LK+HD  R  R LA +
Sbjct: 10  ISRIVAVVFVLVIQVVSG-------NFVFNVTHKFAGKEKQLSELKSHDSFRHARMLANI 69

Query: 72  DIPLGGSGRPDAVGLYYAKIGIGTPPKDYYVQVDTGSDIVWVNCIQCMECPRKSTLGMEL 131
           D+PLGG  R D++GLY+ KI +G+PPK+YYVQVDTGSDI+WVNC  C +CP K+ LG+ L
Sbjct: 70  DLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPL 129

Query: 132 TPYNIEESTTGKLVSCDEQFCLEVNGGPLSGCTGNMSCPYLQIYGDGSSTAGYFIKDYVQ 191
           + Y+ + S+T K V C++ FC  +       C     C Y  +YGDGS++ G FIKD + 
Sbjct: 130 SLYDSKTSSTSKNVGCEDDFCSFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFIKDNIT 189

Query: 192 YDRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASSRKV 251
            ++V+G+L T      + FGCG  QSG LG + + A+DGI+GFG+SN+SIISQLA+    
Sbjct: 190 LEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQT-DSAVDGIMGFGQSNTSIISQLAAGGST 249

Query: 252 KKMFAHCLDGINGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF 311
           K++F+HCLD +NGGGIFA+G V  P V  TP+VPNQ HYNV + G+ V    +++   + 
Sbjct: 250 KRIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLA 309

Query: 312 EAGDRKGTIIDIGTTLAYLPELIYGPLVTMIISRQPNLEVQTIHGEYKCFQYSESVDDGF 371
                 GTIID GTTLAYLP+ +Y  L+  I ++Q  +++  +   + CF ++ + D  F
Sbjct: 310 STNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQ-QVKLHMVQETFACFSFTSNTDKAF 369

Query: 372 PPVIFHFENSLLLKVYPHEYLFQY-EGLWCMGWQNSGMQSRDRKNVTLFGDLVLSNKLVL 431
           P V  HFE+SL L VYPH+YLF   E ++C GWQ+ GM ++D  +V L GDLVLSNKLV+
Sbjct: 370 PVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVV 429

Query: 432 YDLENQTIGWTEYNCSSNIKVQDEQTGTVHLVGSHSISSACR--LNSQWAVILLFLILLM 490
           YDLEN+ IGW ++NCSS+IKV+D  +G  + +G+ ++ SA    +N     +L  LI + 
Sbjct: 430 YDLENEVIGWADHNCSSSIKVKD-GSGAAYQLGAENLISAASSVMNGTLVTLLSILIWVF 478

BLAST of MS000634 vs. TAIR 10
Match: AT5G22850.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 326.2 bits (835), Expect = 4.4e-89
Identity = 171/416 (41.11%), Postives = 239/416 (57.45%), Query Frame = 0

Query: 47  AGRERSLSTLKAHDISRQLRFLAG----VDIPLGGSGRPDAVGLYYAKIGIGTPPKDYYV 106
           A  E  LS LKA D +R  R L      +D P+ G+  P  VGLYY K+ +GTPP+D+YV
Sbjct: 37  ANHEMELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYV 96

Query: 107 QVDTGSDIVWVNCIQCMECPRKSTLGMELTPYNIEESTTGKLVSCDEQFCLEVNGGPLSG 166
           QVDTGSD++WV+C  C  CP+ S L ++L  ++   S T   +SC +Q C        SG
Sbjct: 97  QVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSG 156

Query: 167 CT-GNMSCPYLQIYGDGSSTAGYFIKDYVQYDRVSGDLETTAANGSIKFGCGARQSGDLG 226
           C+  N  C Y   YGDGS T+G+++ D +Q+D + G      +   + FGC   Q+GDL 
Sbjct: 157 CSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLV 216

Query: 227 SSGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGIN-GGGIFAMGHVVQPKVNM 286
            S + A+DGI GFG+   S+ISQLAS     ++F+HCL G N GGGI  +G +V+P +  
Sbjct: 217 KS-DRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNMVF 276

Query: 287 TPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDIGTTLAYLPELIYGPLVT 346
           TPLVP+QPHYNVN+  + V    L I+  VF   + +GTIID GTTLAYL E  Y P V 
Sbjct: 277 TPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVE 336

Query: 347 MIISRQPNLEVQTIHGEYKCFQYSESVDDGFPPVIFHFENSLLLKVYPHEYLFQYE---- 406
            I +         +    +C+  + SV D FPPV  +F     + + P +YL Q      
Sbjct: 337 AITNAVSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGG 396

Query: 407 -GLWCMGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSNIKV 452
             +WC+G+Q         + +T+ GDLVL +K+ +YDL  Q IGW  Y+CS+++ V
Sbjct: 397 TAVWCIGFQRI-----QNQGITILGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNV 446

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022150427.11.5e-293100.00aspartic proteinase-like protein 2 [Momordica charantia][more]
XP_038886489.15.9e-27794.18aspartic proteinase 39 [Benincasa hispida] >XP_038886490.1 aspartic proteinase 3... [more]
XP_016900131.16.3e-27192.37PREDICTED: aspartic proteinase-like protein 2 isoform X1 [Cucumis melo][more]
XP_004140876.18.2e-27191.77aspartic proteinase-like protein 2 [Cucumis sativus] >KGN45989.1 hypothetical pr... [more]
XP_023005003.13.8e-26890.85aspartic proteinase-like protein 2 [Cucurbita maxima] >XP_023005004.1 aspartic p... [more]
Match NameE-valueIdentityDescription
Q9S9K43.9e-12247.48Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q4V3D25.1e-12245.53Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Q766C28.3e-3226.06Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q3EBM51.6e-2727.05Probable aspartic protease At2g35615 OS=Arabidopsis thaliana OX=3702 GN=At2g3561... [more]
Q9LTW48.0e-2727.16Aspartic proteinase NANA, chloroplast OS=Arabidopsis thaliana OX=3702 GN=NANA PE... [more]
Match NameE-valueIdentityDescription
A0A6J1DAP87.5e-294100.00aspartic proteinase-like protein 2 OS=Momordica charantia OX=3673 GN=LOC11101858... [more]
A0A1S4DVW73.1e-27192.37aspartic proteinase-like protein 2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC1034... [more]
A0A0A0KAX94.0e-27191.77Peptidase A1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G04174... [more]
A0A6J1KXX51.9e-26890.85aspartic proteinase-like protein 2 OS=Cucurbita maxima OX=3661 GN=LOC111498125 P... [more]
A0A6J1H6J51.3e-26690.96aspartic proteinase-like protein 2 OS=Cucurbita moschata OX=3662 GN=LOC111460583... [more]
Match NameE-valueIdentityDescription
AT1G05840.12.3e-20273.46Eukaryotic aspartyl protease family protein [more]
AT3G02740.11.9e-14051.43Eukaryotic aspartyl protease family protein [more]
AT1G65240.12.8e-12347.48Eukaryotic aspartyl protease family protein [more]
AT5G36260.13.6e-12345.53Eukaryotic aspartyl protease family protein [more]
AT5G22850.14.4e-8941.11Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001461Aspartic peptidase A1 familyPRINTSPR00792PEPSINcoord: 417..432
score: 25.0
coord: 93..113
score: 48.74
coord: 319..330
score: 40.66
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 13..463
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 289..441
e-value: 2.7E-16
score: 59.7
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 272..451
e-value: 2.7E-40
score: 140.0
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 78..271
e-value: 1.7E-46
score: 160.7
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 82..449
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 87..271
e-value: 2.4E-36
score: 125.6
NoneNo IPR availablePANTHERPTHR13683:SF685EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 13..463
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 87..441
score: 37.551876
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 87..445
e-value: 8.54002E-57
score: 188.241

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS000634.1MS000634.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity