CcUC02G017150 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC02G017150
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionaspartic proteinase Asp1-like
LocationCicolChr02: 74907 .. 85000 (+)
RNA-Seq ExpressionCcUC02G017150
SyntenyCcUC02G017150
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAAAGTTGAAGGAGACCGAGGCAGAATACATCAAATGCTTTTTTTAAGTGTATGTGTTTTAGACAATGGAAGACTTTAAAAAATCTTCATTTAATGGTTGCAGGTAATTATTTCTTTTGATGTATATTGTTTTTTCTTCGATACACTTAATCTGTATGGGCTGGTTTTCATTATTTTGAAGTTTAACGGTTTTAATTTTTTACATTTTTTTCTCAATACTAAATGTCTAGGAGGTTGGAAATAGAATATATACTCGCATATAGCTACTTAGTTAATAGGTAAATATAATATTCCATGGAGGTAATTTTGTCTTACGAAAACTAAGAAATTCTAGATTTAAGTATTTTTACAACGACTATTTATTTTAAGGATTTTTGAAATGCAACCATTTAAAGGTAGCTATTTTTGTGATCGACCCTAAAGCTGAAGGATACACAAACCCAGCCCATCTCGCGCCCCTCTCGCGTCCGAGCCCAGCCCGCCCCCTCCCCTTAACTCTCTGTCATCTCACTCCACCGGAAACCTCCCGAACTGCTTCTCAGAATGGCGACCACAGCTTGTTTCATCATCGTCAGTAGGAACAATATCCCTATCTATGAAGCTGAAGTCGGATCTGCTGTTAAAGTATTTTCTTCTTCTTCATTTTCCATTTCTTTTCATGTTTAGGCTGAATATTTCCTTCACAAGATCGCCAATAATAGGTGATCTCGTTTTGATTCTTTAATTACTTTAGTTTTGACTGTCATTAGTCGCTCATCTTTTTGGCTTGACATGAAAGAGAGAGGATTCCGCTCAGCTGCATCAGTTTATATTGCATGCGTCCCTCGACATTGTTCAAGACCTGGCATGGACAACTAGTGCTATGTGAGTTTGTATTCCTTTTAGTTTTTTACTTAATCATCGAGACATCTCCTTTCTCTTCTTTTTCTTTTTTGTTTCTTCCTTTGAATTTCGCCTTAGATCCCTTTTCTTGTTGCGGTTTATGCAGGTTCTTGAAAGCAGTCGATAGGTTCAATGACTTGGTCGTGTCTGTATATGTAACCGCCGGTCATATCCTTTAATGTCGAGGGGATTTGTCAAGCACACTGTGCTGGAAATTGAGTCAAATTTCATTAATTCACTCACTCGTAAAACTCAACTCCAAATTAATAAAAGTTTTATCTTTGAATCTTTTGCATCTATACAACACAACCTCTCCCCCCCCCCCCCCCCCCCCCCCCGCCCGTTTCAGTTCATAGTTCCTGTTGTTTCCTTTTGTACTATTATTCGGATTTTCCTTTGACTATTCACTACATACGCGATTGATGTTACTTCACGACTCTCGCAACGATGATGGAATCAAGAGCTTTTTTCAAGAGGTTCATGAGCTTTACATAAAGGTAAGTTGGACATGTTTCTTATGTATTATTACTTTATTGCCAATTTTTGAAGGTGTCGTGCTTATTGTTTTAGTTCAAGAACATGTGGGGTTGGGGAGTGTGGTGCTCGATTTACTATGTTATGTTATTGGTTGATCGAAGTATACCTTAGTTGATTAGGTTACTTTTCCAGAGCTGGGAGATTTACTTGATTCGGTCAGTTTGGAGTGGTATTTCCATTTACTTCTTTTCTTTGTTGGGACCCCTAATAAGGTTGAGGACATTGAAAAACTTATGTGTAACTTCCTTTGGGAAGAGGTGGAAGAAGGAATGGATTGCACCGGGTTAGATGGGAGATCAGTGGATCAGGTCGATTAAAAATTGGTAACTTAAGGACACAAAACAAAACCCTATTACCTAAATGGCTTTGTCATTTTGCTCTTGATTCTAATACCCTTTGGCACAAGGATATTGCTAGCAAGTATGGGCCTCATCCCTTTGAATGATTGTCGGCTGGGGTTAAAGGCACCAACTAGAACTTGTGGAAGGATCTAACGAGTGCCCTTCTTTCTCCCATTTTGTTCGTTGTGTGTGTTGGGGGGGGGGGGGGGGGGGTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGGGGGGGGATAGACCTCTGCTTCCGTTTCCCCCATATTTATCATTTATCCTCCTTTAAAAATCATTTTGTGTCTGACATTTTTATTTGGTTGGGGAGCTTGGTTTCCTTTTCCTTCAGGTTTAGTTGCTTTTTGATTGATAGGGAAGCGACAAATTTCATCTCTTTGTTGTCGTTGTTGGGGGAGTTTAAGGATTCGCATGTTTGGAATCTTGGCCCTCTTGAGGGATTCTACTGACTCTTGTAATTCCTTTTTCCGTTGTCGGCTGGATACTTCTCCTTCCAATGAGGGTTTTTCCCCCTCCTCTAGTTAGAGGATTAAGATCCCCAAGAAGGAGAGATTCTTTACCTAGCCAGTTCTACATGGACGTGTGAACACCTTGTATTGGCTCTCCAGGAGGAAGAGGATGTTGTCGTTGGGGGGCCCTTTCCTGTATTATGTCAGAAGGTGGAGGAAGACCTAAATCACATCCTTTGGAGATGCAAGTTTCGCACTCTGTTTGGAATTTTTTCTTCCAGAAATTTTGTTTTGTGGTTGCCTGTCAAAGGGACATTAGTGTTATGATCGGAGAGTCCCTCCCCATCCTCCTTTTTGTGATAAAGGTCATATGCCCTTGTATCTTTTAATATTTTCTCATTGAAAGTGCTTGCATTATCTTAATACAAAAGTTCTTGCTCAATGGAGTTATATATTTTTACTCGATAACACTTTGACCTTTGAAACAGTACAGTTAAATAAATAAATTGTATCACACTAATGTGAGCATAGATAAAAGGCACTAGTCACCATTTCAAAGGTCGATGGTTTGATCTCTTACCCCACAATTGATGAACTAAAAAGAAATCGCATTCCTTGAGTCTTGATTGTATACATTCATAAAGAAACATAACTGGTCGATTTGTTATGAGATGAGGTTGTCATCGCGAGTTTGCGTGAAGCATGAGATGGGGTGGTGAGGATGGTTTATCTTTTTGAGATCCCTTGTTGTTTAGTCATGTCAGGATGCTCAGATTTAAGGCTCCAATAGATCCCTTCTAAGTGTTATTATTGTTTCTGTAGCACTGGATTGTAACCTATATCTTCTACGAATATTAATTGGTTTTTGGAAATAAAATTGATAGAGTGACTTGAAATCATGTGGTAGAGTTACAAAATATTGTTATTAATTATCTAGTGCAATTGTAACAAGATTTAATCGTAATAATAATTTAACTGCTGAATGAGCTATTTGCAGAAAAACTATGTTTGGGACATACCAAGATTTAATTCCCTCTGAAGTGTCCTTTAGCGTTATGCTTTAACTTGATTGCACCACAGAAAAAATGGCCTAGAATAAGTGATAAATATTTCATGTTAACTTGTCTGTCATCTTCTCCTGATTAATGATTTTAATGTTGTTGTTTTTCCCCCTTCTATTGTGGACTTTTTTTCTTTCATTAGCCAATAAGACCTTTTCCCCTCCCTAAGAAACCAATAAGAGTTTCTGTTTGAATTGAGGAAAAGTGGTGTCAAGAGGGGAGTGTAATTGTTGTTCTGCTTATTTGTAACTGCTATCATAATAGTACGTTCCTGAGTGCTGCCTCCCATCCCCAGCACTACTTACGTTCTATTGGCAGAATTTCCCAATCAATGTATAGGCTATTCCTTTAGTCGGCTATTCTGCCAATTTCCACTGTATATAATTTTTCATAGCAAAGTAACTCCTTGCAGGTGATTTTTCAGCTTTCTTGATTCTCATCTATCGGTAGAATTCTAAGTTAATTAATGAGACAATGAAACGCTGGGTAACCATTTGTTAGTTGGTGTGGTGGGCTTTGGTCTTTTTGGTGTTCCCCATGTTTTCAATGAAGGGCTGTGGCATGAACCTTGTTGTCTTCCAGATATTTTTTCTCTCTATCGTTGCCATGAGTTTTCATTTGATGACAATTGTCAGCTTGATAATGCATGGACATGACTGAAATTGGCTTTGTGAATTTGCAGACTATACTCAATCCCCTCTACTTGCCTGGATCCCGCATCACGTCTTCACATTTTGACACAAAAGTCCGTGCACTCGCAAGGAAGTATCTCTAGCGATCACCTTCATCGCAGACCAAATGTGCTTTATGACACAATAGGCTTCAAAGTTTATAGCAGCCTCAAGTTTTCTTTAATTGCTTAACATTTTAATATGTGCTTTTTGTGGGCCTGATATCAATCAATTCGGTAGTTAAATATCTCTCAACGTGTATTATTGACTTTGGTATGGCTTGGTTACTAGTTGGTATTGCACTTTACACAGTAATGCATTCTCCGCTCCCTTTTATCATCTTTTATCGTAGGGAGGATTATCATTATTTGTTTTTATATTCTAGAAGGCAGTCAATATTAAATTTATGCAAAATATGCATTTTTGAAAAGCTCAAGAGACAGCCCAGTCCTTTTTAACTTTTTTGGCCGTTGGTCCTTTCAATTACCTTTACTTGAAATGAATGATTAGTATGGCGCCAAGGAATCTAAAAGATCACTTCAGCAAATTGCCATCTTTCATCCAAGGCGTGAGGGAAAGAATAAGAATACCTCTTCTGAAGTCTTAACCTTAGTGGAGGAAAAGAATAACAAATGCACCAGAACGTTGGAAAAGAAAGAGTGATCCTGGGCAGGGAGGGAAAGGGAATAACTTATAAGAAAACCAAAATAAGGTTTTGTAGTAAACCCTCGGAAAAGAAAGGTGATTGGGTGATGTGAATCCCTGGTCCCCAAACAAAACTATGGCCTCCTAAACTCCAAATCTCCACTTAGAACAACAAATCAATCATTGCCATTTTACCGTACACTTGCATTTTTTTAAAAAATCTTTTTTGGTTCCATTCCCAAAATACCCAGACGACAGAGCAAGGGTGAAAGACAAAAGGAACTGATTTTATAAGCCTCAGTTGTCAAATCATGGGGAAAAGGGTATTGATGATACTGGTTCTGATGGTGGCCTCCATGAGCTGTTTGGCTCTCTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGGAGGCCAATTCTGTCGGTTCCGGCCGCATCTTCTTCGTTTGCTTCATCCTCCATCGTGTTGCCTCTTCAAGGGAACGTCTTCCCAAATGGGTAAGCCATAACTCTGCTCAAAGATACGACGAAGGTTTAGCTTTTGTTTTTCCCATTTCGATTTTACTCTACTGTTACAGAGTATGAAGTGGCTGAAGAGGAAGTTCAATGTGAATTTGATGACTCCTGATATACTTTTCACATTGATAGTATTGTAATAACCGCAGTAGGCCTTCCCCATTCTTAATCTTATTGTATCTTCTCTCACGTGTTAAATTGGAGAAAAGATGATTACGAAACTCTGTTCTCTCATGTTTGCTTCTCTCGCGTGTTCTATTCCTGTGCAATAGGTTTGGAGGAGTGTATTTATGACTGCTAGCAAAATAACACATCCCCAAATTAAGTACTATTAAAATCTTCATACCCATTTTACTGAAGTTTTTAATTAGTTGCAAACAGTTTGAAAATAAAATATATATTTCTTTGTTTTGTTCGAGGCTAACTTATTTGCTTCAGATCCTAAATACTGATTTTCATCAATATGTTTTGAGGTCATTTAATCCGTGTCTTTTTTCAGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCAGACACTGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAACAGTGCACTGAGGTAAATTTTTATTTGATGCTAATTTGGAGATTCTTGTAAATATGATGACCCAGATGATAGAAGATACTTACTAAACTACAGCTTTTCCTTCAGTTAAGTTAGTTTTGATCATGAGTAATTCAAAATTTCATTTTTGAATAGTCTATGTTCTATGTTCCTCCTCTATTTCTTCTTGATGTGCTTAGTGCTTCTTTTTTCCCTCGGTCTCTCACTACATTAAATAAACGTTGCATTTCTTTTGGGATTGAGAGTTAAGGTTGATTGCTTCAAATTGATGGCCTTGTTTCCTTTTCAGACACTTCATCCTCTTTATCAACCAAGCAACGATCTTGTGCCGTGTAAGGACCCTCTGTGTATGTCCTTGCACTCATCTATGGACCACAGATGCGAGAACCCAGATCAATGTGACTACGAGGTTGAGTATGCAGATGGTGGTTCGTCTCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCCCGTTTGGCCCTTGGGTAAACTGCTCGACTGCCTTCAGTAATTCATCTTAATTCAGTGTACTGCTGAGTTCCTAACTCTCTAAATCAACCTGGCTACTTCCGATTACAACCACACATGAAAGTTTTGAGAATGAAACATCATATCATGCTGTCATATCTCATGTTTTGCACTTTCCTGATATTGCAAAGTAATGATTAGCATGAATGTAAAGATAGAGTTTATGTTGTAAAACAAACATGTTCATTCATGCTGACGGGTTACTTCAAGAAAACAATTAAGATAGCAACGGGCACTAGATTTTGTGGACATAATGATTCGGATTTGTATTTTTCATAGACATCAATAAAGTTTATCTTATACAAGAATTTCACGCATTTCTAGTCCTTACAAACCGTTGATGTCAGCTCATGTGGTCAAATAGCATGCATGTAAAGTCAGTATGTAAGCTAGCTATTTGAATCTATAGTACTGAGTTTGGTTGGTTCTGTTTGATTACTCAGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATACTTGGCCTTGGAAGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCATTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCCTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGTAAACAACCTGCCTTGATCATATGTATATATATATATCTCTTGTTATGATTCATGTTAGCATCTGTTTTAATGAAACCTACCTCATGTGATTGTATCAGGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGATCTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGGTAAATCATCTTTAAAAATAACTGTTGACTATTCTGGGAAAAATAGATCTTTCTGGCTCTTGAAAACTGATGGATAATACACAATGATTATTCAATTCATGTTGGCACTGTTCTTGTAGTTGAATAGAGAACTAGCTGGAAAACCGCTAAGAGAAGCCATGGACGATGATACACTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAAGCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGGTAAAAGCTCCGTGCCTTAAACTCGAACTGCTATTCACTAAACATTTTCTCTTTGGATTCTCAAGTTGAATAATTTCACTTATAAGCTATAATTTTGGAATCGCAGTCTATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAAATTCGAATATCATTGGTGGTACGTTAGTGCATGCATTGCATGTTAATATTTTCTCGCAATTGCAAATACCAATGCGTTTAAAGCCATGCAAAATTATTCTTTTATTAGAGAGGAAAAAAGCCTCATTTCTGTATGATTTTGTTGACAGATATATCAATGCAAGATAAGATGGTAGTATACAACAACGAGAAGCAAGCAATTGGCTGGGCTACGGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAAATGACATATATGTGAAATACATATGCGATCAGAAAACAGGTTTAGTTTGAGGGAAGTCCTTGCATATGGCAATAGGGTAGAAGAAACTCGTAACAACAACGCCATTTAACATGTAACTAATGTATTCGTAAAGCAAGCAACCGAACAAGTTGCTGTATTGTAACATGATGAAATACAGTTTGATGTACCAAATGAATTAATAGAATTAGAACTGACATTTTTTTATTAACTCTGCTTTTGGAATTACAAAGATGAGATGAGTTAGATTTCACATCCATGACTGGTGCAGCATTATTCGGTTTGTTGATTGCAGGCAGCCTTGGGAATATACAATGCTTAAATGACAGTGGGTCATGGTGTGGTGTAGACATGGGTTGGCATTTGTAATGTTGTGTTGCTAACTACACACTACCATCGCTCCTTTTGGTTTTGGGTTGAGGAGGATGAATTGAAGATCATCAACTTTGGAAGCGGAGCTCAGTTCTCTTGTAATCAACTACATTCTTCATTACAACAAAAGAAGGGAGTTCCTCATAATACTTTCTGGGCATATTGGTTTCGGTTTATCAAAGAATCTGACTGTAAAAATATGAATATTATGTCTATATATAGTAGAGAGAGGAAGCCTGGCAATGGGGTTGAATGCATCCCCTAAATCCATGCCTTTCCGGACGCCGCCACGGTGGTCAAATCTGCCTCTGTAGATACGAAAACATGTCCTGCAATGGCGACAGATGGTCCATCTGGACCTGAACCTGGCCTCCAAATGGATCGTCCTCTGGCATGGAGAAAGAATCCATGTAATTGAACTCAAAGAAGTCTAAATCTGCCGCTTCTCCCCATTTGGGTTGACTTTGGACCTCCTTATCCCAGGTGAGCTCCGGCGACGTCACTGGATCGGAGCCACTGGAGGAGTCGGTGTGCATTCTGGGAATGGAATCGGACGTGTCCATTTGCAAGTGATTCTGGATTGGGCGCTGTACCATGTTGTTGTTCGTAATATCGGGTTTCTCATCCTCGAAATCAGGGAATTCGGCTGCCTTGTCGTCGGTGGATTGGTAATGCTTCTCTATGCATCCCTTCTTGTTGTAGATACGACATAGCACCCAGTCGTCCAACTGTAAATTGATTAAGCAAAGTAACCCATCAGCATAATAATCCAATTGAAATTGATTGAAGATATGTTGTTGTTGTTGTCACTTACTCTCAAGTTGTTGGTTTTGTTGGCAGCGGATCTGTCGACGTTGGCGAGTCGATACTCGTGCATAATCCAATTGGTCTTGACGCCCCTTGGGGCCTTTCCGGCGTAGAAGACGAGTGCCTTCTTGATACCAAGAGTTTTGGGGCGGCCGATGGGCTTGTCGGCGCCGGTGGCCTTCCAATAGCCGGTACCGGCGGCCCGGTTGGGGCGGGAGCCGTTGGGGTACTTACGGTCCCTGGGGGAGAAGAAATACCACTCCTTTTCACCACAGACGGCCAATTCTGGAGAAGGAAAAAGAAGGGAAGTGAGAATATGAATGAATGAATGAAAATGATAGAGAAGAGAATAGGGAAAGTAAAATAGAATACCAGGGAGGTGCCAGGGGTCGTATTTGTAAAGGTCAATCTCCTTGATAATGGGGACGGCGATGGGCTGAGAGGAGCACTTTCTGCAGAGGTAATGAAGGACTAGCTCCTCATCAGTGGGGTGAAATCTGAAGCCTGGAGGTAACTCGATCCCAGCTACGGTCATTTGCTAGCGATTTGGCTGATTCTTTCACACTTGGCCTAAAGCCTCTCTCTCTCTCTCTCTCTCTCTCTCAGTTGATTTGCGGCGGCGGGGAATCTGATCTGATCTCCGATGAGCGGCGGCTGAGAATTGGGAATTTTGTGAACCGGAGAATTGTGGGATAGATATATATAGAAGAAGGAACACGTGGGGGCTCATGGGGTGGAGGCAGTATCCACGCTCCCCCATCTTCTAGAAGTTCATCTCATTTTATATATTTATTAATTATTACTGGAACGCCATTGGATGGGACCCCATACTAGAAAGAAAAAGATTAAAATGTC

mRNA sequence

GAAAAAGTTGAAGGAGACCGAGGCAGAATACATCAAATGCTTTTTTTAAGTGTATGTGTTTTAGACAATGGAAGACTTTAAAAAATCTTCATTTAATGGTTGCAGCTGAAGGATACACAAACCCAGCCCATCTCGCGCCCCTCTCGCGTCCGAGCCCAGCCCGCCCCCTCCCCTTAACTCTCTGTCATCTCACTCCACCGGAAACCTCCCGAACTGCTTCTCAGAATGGCGACCACAGCTTGTTTCATCATCGTCAGTAGGAACAATATCCCTATCTATGAAGCTGAAGTCGGATCTGCTGTTAAAAGAGAGGATTCCGCTCAGCTGCATCAGTTTATATTGCATGCGTCCCTCGACATTGTTCAAGACCTGGCATGGACAACTAGTGCTATGTTCTTGAAAGCAGTCGATAGGTTCAATGACTTGGTCGTGTCTGTATATGTAACCGCCGGTCATATCCTTTAATGTCGAGGGGATTTGTCAAGCACACTGTGCTGGAAATTGAGTCAAATTTCATTAATTCACTCACTCGTAAAACTCAACTCCAAATTAATAAAAGTTTTATCTTTGAATCTTTTGCATCTATACAACACAACCTCTCCCCCCCCCCCCCCCCCCCCCCCCGCCCGTTTCAGTTCATAGTTCCTGTTGTTTCCTTTTGTACTATTATTCGGATTTTCCTTTGACTATTCACTACATACGCGATTGATGTTACTTCACGACTCTCGCAACGATGATGGAATCAAGAGCTTTTTTCAAGAGGTTCATGAGCTTTACATAAAGACTATACTCAATCCCCTCTACTTGCCTGGATCCCGCATCACGTCTTCACATTTTGACACAAAAGTCCGTGCACTCGCAAGGAAGTATCTCTAGCGATCACCTTCATCGCAGACCAAATGTGCTTTATGACACAATAGGCTTCAAAGTTTATAGCAGCCTCAAGTTTTCTTTAATTGCTTAACATTTTAATATGTGCTTTTTGTGGGCCTGATATCAATCAATTCGGTAGTTAAATATCTCTCAACGTGTATTATTGACTTTGGTATGGCTTGGTTACTAGTTGGTATTGCACTTTACACAGTAATGCATTCTCCGCTCCCTTTTATCATCTTTTATCGTAGGGAGGATTATCATTATTTGTTTTTATATTCTAGAAGGCAGTCAATATTAAATTTATGCAAAATATGCATTTTTGAAAAGCTCAAGAGACAGCCCAGTCCTTTTTAACTTTTTTGGCCGTTGGTCCTTTCAATTACCTTTACTTGAAATGAATGATTAGTATGGCGCCAAGGAATCTAAAAGATCACTTCAGCAAATTGCCATCTTTCATCCAAGGCGTGAGGGAAAGAATAAGAATACCTCTTCTGAAGTCTTAACCTTAGTGGAGGAAAAGAATAACAAATGCACCAGAACGTTGGAAAAGAAAGAGTGATCCTGGGCAGGGAGGGAAAGGGAATAACTTATAAGAAAACCAAAATAAGGTTTTGTAGTAAACCCTCGGAAAAGAAAGGTGATTGGGTGATGTGAATCCCTGGTCCCCAAACAAAACTATGGCCTCCTAAACTCCAAATCTCCACTTAGAACAACAAATCAATCATTGCCATTTTACCGTACACTTGCATTTTTTTAAAAAATCTTTTTTGGTTCCATTCCCAAAATACCCAGACGACAGAGCAAGGGTGAAAGACAAAAGGAACTGATTTTATAAGCCTCAGTTGTCAAATCATGGGGAAAAGGGTATTGATGATACTGGTTCTGATGGTGGCCTCCATGAGCTGTTTGGCTCTCTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGGAGGCCAATTCTGTCGGTTCCGGCCGCATCTTCTTCGTTTGCTTCATCCTCCATCGTGTTGCCTCTTCAAGGGAACGTCTTCCCAAATGGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCAGACACTGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAACAGTGCACTGAGACACTTCATCCTCTTTATCAACCAAGCAACGATCTTGTGCCGTGTAAGGACCCTCTGTGTATGTCCTTGCACTCATCTATGGACCACAGATGCGAGAACCCAGATCAATGTGACTACGAGGTTGAGTATGCAGATGGTGGTTCGTCTCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCCCGTTTGGCCCTTGGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATACTTGGCCTTGGAAGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCATTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCCTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGATCTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGTTGAATAGAGAACTAGCTGGAAAACCGCTAAGAGAAGCCATGGACGATGATACACTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAAGCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGTCTATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAAATTCGAATATCATTGGTGATATATCAATGCAAGATAAGATGGTAGTATACAACAACGAGAAGCAAGCAATTGGCTGGGCTACGGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAAATGACATATATGTGAAATACATATGCGATCAGAAAACAGGTTTAGTTTGAGGGAAGTCCTTGCATATGGCAATAGGGTAGAAGAAACTCGTAACAACAACGCCATTTAACATGTAACTAATGTATTCGTAAAGCAAGCAACCGAACAAGTTGCTGTATTGTAACATGATGAAATACAGTTTGATGTACCAAATGAATTAATAGAATTAGAACTGACATTTTTTTATTAACTCTGCTTTTGGAATTACAAAGATGAGATGAGTTAGATTTCACATCCATGACTGGTGCAGCATTATTCGGTTTGTTGATTGCAGGCAGCCTTGGGAATATACAATGCTTAAATGACAGTGGGTCATGGTGTGGTGTAGACATGGGTTGGCATTTGTAATGTTGTGTTGCTAACTACACACTACCATCGCTCCTTTTGGTTTTGGGTTGAGGAGGATGAATTGAAGATCATCAACTTTGGAAGCGGAGCTCAGTTCTCTTGTAATCAACTACATTCTTCATTACAACAAAAGAAGGGAGTTCCTCATAATACTTTCTGGGCATATTGGTTTCGGTTTATCAAAGAATCTGACTGTAAAAATATGAATATTATGTCTATATATAGTAGAGAGAGGAAGCCTGGCAATGGGGTTGAATGCATCCCCTAAATCCATGCCTTTCCGGACGCCGCCACGGTGGTCAAATCTGCCTCTGTAGATACGAAAACATGTCCTGCAATGGCGACAGATGGTCCATCTGGACCTGAACCTGGCCTCCAAATGGATCGTCCTCTGGCATGGAGAAAGAATCCATGTAATTGAACTCAAAGAAGTCTAAATCTGCCGCTTCTCCCCATTTGGGTTGACTTTGGACCTCCTTATCCCAGGTGAGCTCCGGCGACGTCACTGGATCGGAGCCACTGGAGGAGTCGGTGTGCATTCTGGGAATGGAATCGGACGTGTCCATTTGCAAGTGATTCTGGATTGGGCGCTGTACCATGTTGTTGTTCGTAATATCGGGTTTCTCATCCTCGAAATCAGGGAATTCGGCTGCCTTGTCGTCGGTGGATTGGTAATGCTTCTCTATGCATCCCTTCTTGTTGTAGATACGACATAGCACCCAGTCGTCCAATCTCAAGTTGTTGGTTTTGTTGGCAGCGGATCTGTCGACGTTGGCGAGTCGATACTCGTGCATAATCCAATTGGTCTTGACGCCCCTTGGGGCCTTTCCGGCGTAGAAGACGAGTGCCTTCTTGATACCAAGAGTTTTGGGGCGGCCGATGGGCTTGTCGGCGCCGGTGGCCTTCCAATAGCCGGTACCGGCGGCCCGGTTGGGGCGGGAGCCGTTGGGGTACTTACGGTCCCTGGGGGAGAAGAAATACCACTCCTTTTCACCACAGACGGCCAATTCAGGGAGGTGCCAGGGGTCGTATTTGTAAAGGTCAATCTCCTTGATAATGGGGACGGCGATGGGCTGAGAGGAGCACTTTCTGCAGAGGTAATGAAGGACTAGCTCCTCATCAGTGGGGTGAAATCTGAAGCCTGGAGTTGATTTGCGGCGGCGGGGAATCTGATCTGATCTCCGATGAGCGGCGGCTGAGAATTGGGAATTTTGTGAACCGGAGAATTGTGGGATAGATATATATAGAAGAAGGAACACGTGGGGGCTCATGGGGTGGAGGCAGTATCCACGCTCCCCCATCTTCTAGAAGTTCATCTCATTTTATATATTTATTAATTATTACTGGAACGCCATTGGATGGGACCCCATACTAGAAAGAAAAAGATTAAAATGTC

Coding sequence (CDS)

ATGGGGAAAAGGGTATTGATGATACTGGTTCTGATGGTGGCCTCCATGAGCTGTTTGGCTCTCTGTTCAGCTTCTTCGTTCTTTAAGGATAAGCCATGGGAGAGAAGGAGGCCAATTCTGTCGGTTCCGGCCGCATCTTCTTCGTTTGCTTCATCCTCCATCGTGTTGCCTCTTCAAGGGAACGTCTTCCCAAATGGGTTCTATAACGTTACTCTTTATGTAGGGCAGCCTCCAAAGCCTTACTTTCTAGATCCAGACACTGGTAGTGATCTCACTTGGCTTCAATGTGACGCTCCATGTCAACAGTGCACTGAGACACTTCATCCTCTTTATCAACCAAGCAACGATCTTGTGCCGTGTAAGGACCCTCTGTGTATGTCCTTGCACTCATCTATGGACCACAGATGCGAGAACCCAGATCAATGTGACTACGAGGTTGAGTATGCAGATGGTGGTTCGTCTCTTGGAGTCCTTGTCAGGGATGTATTTCCTCTCAACTTAACCAATGGAGATCCAATTAGACCCCGTTTGGCCCTTGGATGTGGTTATGATCAAGATCCTGGATCATCATCTTATCACCCCATGGATGGAATACTTGGCCTTGGAAGGGGAGCAGTAAGCATGGTCTCACAACTGCATAATCAAGGCATTGTCCGTAATGTCATTGGTCACTGTTTCAGCAGCAAAGGAGGAGGATATCTTTTCTTTGGGGATGGCATTTATGATCCCTATCGCTTAGTTTGGACGCCCATGTCACGGGACTACCCGAAGCACTACTCCCCTGGGTTTGGAGAACTAATCTTCAATGGAAGATCTACTGGACTCAGAAACCTGTTTGTAGTTTTTGACAGTGGGAGCTCTTACACATACTTCAATGCTCAGGCTTATCAAGTTTTAACATCTTTGTTGAATAGAGAACTAGCTGGAAAACCGCTAAGAGAAGCCATGGACGATGATACACTTCCGCTCTGTTGGAGAGGGCGGAAGCCATTCAAAAGCTTACGTGATGTGAGAAAATATTTCAAGCCATTGGCCTTGAGCTTTTCCAGTGGTGGAAGAAGCAAAGCAGTGTTTGAAATACCAATGGAAGGTTATCTGATAATATCGTCTATGGGAAATGTTTGCTTAGGAATTCTGAACGGCACCGACGTTGGGCTTGAAAATTCGAATATCATTGGTGATATATCAATGCAAGATAAGATGGTAGTATACAACAACGAGAAGCAAGCAATTGGCTGGGCTACGGCTAACTGTGATCGGGTTCCCAAGTCTCGAGTTGGTAGCATGTAA

Protein sequence

MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGSM
Homology
BLAST of CcUC02G017150 vs. NCBI nr
Match: XP_038900559.1 (aspartic proteinase Asp1 isoform X2 [Benincasa hispida])

HSP 1 Score: 881.3 bits (2276), Expect = 3.3e-252
Identity = 420/429 (97.90%), Postives = 425/429 (99.07%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           M KRVLMILVLMVASMSCLA CSASSFFKDKPWERRRPILSVP ASSSFASSSIV+PLQG
Sbjct: 1   MEKRVLMILVLMVASMSCLAPCSASSFFKDKPWERRRPILSVPIASSSFASSSIVMPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDG+LGLGRGAVSMVSQLHNQGIVRNV+GHCFSSKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGVLGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYR+VWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRIVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           PMEGYLIISSMGN CLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PMEGYLIISSMGNACLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGSM 430
           VPKSRVGSM
Sbjct: 421 VPKSRVGSM 429

BLAST of CcUC02G017150 vs. NCBI nr
Match: XP_004147327.2 (aspartic proteinase Asp1 isoform X1 [Cucumis sativus] >KAE8651999.1 hypothetical protein Csa_016941 [Cucumis sativus])

HSP 1 Score: 870.9 bits (2249), Expect = 4.5e-249
Identity = 414/428 (96.73%), Postives = 423/428 (98.83%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGKRVL++LVLMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 1   MGKRVLVVLVLMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNV+GHCF+SKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWRGRKP KSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 421 VPKSQVSS 428

BLAST of CcUC02G017150 vs. NCBI nr
Match: XP_038900558.1 (aspartic proteinase Asp1 isoform X1 [Benincasa hispida])

HSP 1 Score: 863.6 bits (2230), Expect = 7.1e-247
Identity = 420/464 (90.52%), Postives = 425/464 (91.59%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           M KRVLMILVLMVASMSCLA CSASSFFKDKPWERRRPILSVP ASSSFASSSIV+PLQG
Sbjct: 1   MEKRVLMILVLMVASMSCLAPCSASSFFKDKPWERRRPILSVPIASSSFASSSIVMPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDG+LGLGRGAVSMVSQLHNQGIVRNV+GHCFSSKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGVLGLGRGAVSMVSQLHNQGIVRNVVGHCFSSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYR+VWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRIVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIIS-----------------------------------SMGNVCLGILNGTDVG 420
           PMEGYLIIS                                   SMGN CLGILNGTDVG
Sbjct: 361 PMEGYLIISVKAPCLQFHLLFTEHFFLWILNLNDFTYKLISEWQSMGNACLGILNGTDVG 420

Query: 421 LENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGSM 430
           LENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGSM
Sbjct: 421 LENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGSM 464

BLAST of CcUC02G017150 vs. NCBI nr
Match: XP_008460823.1 (PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo])

HSP 1 Score: 862.8 bits (2228), Expect = 1.2e-246
Identity = 410/428 (95.79%), Postives = 421/428 (98.36%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNV+GHCF+SKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 421 VPKSQVSS 428

BLAST of CcUC02G017150 vs. NCBI nr
Match: TYK02025.1 (aspartic proteinase Asp1 isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 862.8 bits (2228), Expect = 1.2e-246
Identity = 410/428 (95.79%), Postives = 421/428 (98.36%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 42  MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 101

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 102 NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 161

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 162 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 221

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNV+GHCF+SKGGGYLFFGDGI
Sbjct: 222 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 281

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 282 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 341

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 342 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 401

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 402 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 461

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 462 VPKSQVSS 469

BLAST of CcUC02G017150 vs. ExPASy Swiss-Prot
Match: Q0IU52 (Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica OX=39947 GN=ASP1 PE=2 SV=1)

HSP 1 Score: 340.9 bits (873), Expect = 2.1e-92
Identity = 183/388 (47.16%), Postives = 254/388 (65.46%), Query Frame = 0

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++VL L GNV+P G + +T+ +G P K YFLD DTGS LTWLQCDAPC  C    H L
Sbjct: 21  SSAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVL 80

Query: 111 YQPS-NDLVPCKDPLCMSLHSSM--DHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNL 170
           Y+P+   LV C D LC  L++ +    RC +  QCDY ++Y D  SS+GVLV D F L+ 
Sbjct: 81  YKPTPKKLVTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSA 140

Query: 171 TNGDPIRPRLALGCGYDQDPGSSSYH-PMDGILGLGRGAVSMVSQLHNQGIV-RNVIGHC 230
           +NG      +A GCGYDQ   + +   P+D ILGL RG V+++SQL +QG++ ++V+GHC
Sbjct: 141 SNGTN-PTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 200

Query: 231 FSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIF--NGRSTGLRNLFVVFD 290
            SSKGGG+LFFGD       + WTPM+R++ K+YSPG G L F  N ++     + V+FD
Sbjct: 201 ISSKGGGFLFFGDAQVPTSGVTWTPMNREH-KYYSPGHGTLHFDSNSKAISAAPMAVIFD 260

Query: 291 SGSSYTYFNAQAYQ----VLTSLLNRELAGKPLREAMDDD-TLPLCWRGRKPFKSLRDVR 350
           SG++YTYF AQ YQ    V+ S LN E   K L E  + D  L +CW+G+    ++ +V+
Sbjct: 261 SGATYTYFAAQPYQATLSVVKSTLNSEC--KFLTEVTEKDRALTVCWKGKDKIVTIDEVK 320

Query: 351 KYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGT--DVGLENSNIIGDIS 410
           K F+ L+L F+ G + KA  EIP E YLIIS  G+VCLGIL+G+   + L  +N+IG I+
Sbjct: 321 KCFRSLSLEFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGIT 380

Query: 411 MQDKMVVYNNEKQAIGWATANCDRVPKS 425
           M D+MV+Y++E+  +GW    CDR+P+S
Sbjct: 381 MLDQMVIYDSERSLLGWVNYQCDRIPRS 402

BLAST of CcUC02G017150 vs. ExPASy Swiss-Prot
Match: A2ZC67 (Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica OX=39946 GN=ASP1 PE=2 SV=2)

HSP 1 Score: 325.1 bits (832), Expect = 1.2e-87
Identity = 173/386 (44.82%), Postives = 248/386 (64.25%), Query Frame = 0

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++VL L GNV+P G + VT+ +G P KPYFLD DTGS LTWLQCD PC  C +  H L
Sbjct: 21  SSAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGL 80

Query: 111 YQPS-NDLVPCKDPLCMSLHSSM--DHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNL 170
           Y+P     V C +  C  L++ +    +C   +QC Y ++Y  GGSS+GVL+ D F L  
Sbjct: 81  YKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLPA 140

Query: 171 TNGDPIRPRLALGCGYDQDPGSSSY-HPMDGILGLGRGAVSMVSQLHNQGIV-RNVIGHC 230
           +NG      +A GCGY+Q   + +   P++GILGLGRG V+++SQL +QG++ ++V+GHC
Sbjct: 141 SNGTN-PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHC 200

Query: 231 FSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGL--RNLFVVFD 290
            SSKG G+LFFGD       + W+PM+R++ KHYSP  G L FN  S  +    + V+FD
Sbjct: 201 ISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLQFNSNSKPISAAPMEVIFD 260

Query: 291 SGSSYTYFNAQAYQVLTSLLNRELAG--KPLREAMDDD-TLPLCWRGRKPFKSLRDVRKY 350
           SG++YTYF  Q Y    S++   L+   K L E  + D  L +CW+G+   +++ +V+K 
Sbjct: 261 SGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVKKC 320

Query: 351 FKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGT--DVGLENSNIIGDISMQ 410
           F+ L+L F+ G + KA  EIP E YLIIS  G+VCLGIL+G+     L  +N+IG I+M 
Sbjct: 321 FRSLSLKFADGDK-KATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGITML 380

Query: 411 DKMVVYNNEKQAIGWATANCDRVPKS 425
           D+MV+Y++E+  +GW    CDR+P+S
Sbjct: 381 DQMVIYDSERSLLGWVNYQCDRIPRS 402

BLAST of CcUC02G017150 vs. ExPASy Swiss-Prot
Match: Q9M9A8 (Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1)

HSP 1 Score: 299.3 bits (765), Expect = 7.0e-80
Identity = 169/396 (42.68%), Postives = 238/396 (60.10%), Query Frame = 0

Query: 45  ASSSFASSSIVLPLQGNVFPNGFYNVTLYVGQPP--KPYFLDPDTGSDLTWLQCDAPCQQ 104
           ++ S  SS+ + P+ GNV+P+G Y   + VG+P   + Y LD DTGS+LTW+QCDAPC  
Sbjct: 180 SAGSIDSSTTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTS 239

Query: 105 CTETLHPLYQPSND-LVPCKDPLCMSL-HSSMDHRCENPDQCDYEVEYADGGSSLGVLVR 164
           C +  + LY+P  D LV   +  C+ +  + +   CEN  QCDYE+EYAD   S+GVL +
Sbjct: 240 CAKGANQLYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTK 299

Query: 165 DVFPLNLTNGDPIRPRLALGCGYDQDP-GSSSYHPMDGILGLGRGAVSMVSQLHNQGIVR 224
           D F L L NG      +  GCGYDQ     ++    DGILGL R  +S+ SQL ++GI+ 
Sbjct: 300 DKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIIS 359

Query: 225 NVIGHCFSS--KGGGYLFFGDGIYDPYRLVWTPMSRD--------YPKHYSPGFGELIFN 284
           NV+GHC +S   G GY+F G  +   + + W PM  D             S G G L  +
Sbjct: 360 NVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLD 419

Query: 285 GRSTGLRNLFVVFDSGSSYTYFNAQAY-QVLTSLLNRELAGKPLREAMDDDTLPLCWRGR 344
           G +  +    V+FD+GSSYTYF  QAY Q++TSL  +E++G  L     D+TLP+CWR +
Sbjct: 420 GENGRVGK--VLFDTGSSYTYFPNQAYSQLVTSL--QEVSGLELTRDDSDETLPICWRAK 479

Query: 345 K--PFKSLRDVRKYFKPLALSFSSGGR--SKAVFEIPMEGYLIISSMGNVCLGILNGTDV 404
              PF SL DV+K+F+P+ L   S     S+ +  I  E YLIIS+ GNVCLGIL+G+ V
Sbjct: 480 TNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLL-IQPEDYLIISNKGNVCLGILDGSSV 539

Query: 405 GLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 421
              ++ I+GDISM+  ++VY+N K+ IGW  ++C R
Sbjct: 540 HDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVR 570

BLAST of CcUC02G017150 vs. ExPASy Swiss-Prot
Match: Q9S9K4 (Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2)

HSP 1 Score: 150.2 bits (378), Expect = 5.3e-35
Identity = 118/401 (29.43%), Postives = 180/401 (44.89%), Query Frame = 0

Query: 52  SSIVLPLQGN--VFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHP 111
           +SI LPL G+  V   G Y   + +G PPK Y +  DTGSD+ W+ C  PC +C    + 
Sbjct: 56  ASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNL 115

Query: 112 LYQPS---------NDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVR 171
            ++ S         +  V C D  C  +  S    C+    C Y + YAD  +S G  +R
Sbjct: 116 NFRLSLFDMNASSTSKKVGCDDDFCSFI--SQSDSCQPALGCSYHIVYADESTSDGKFIR 175

Query: 172 DVFPLNLTNGD----PIRPRLALGCGYDQDPG-SSSYHPMDGILGLGRGAVSMVSQLHNQ 231
           D+  L    GD    P+   +  GCG DQ     +    +DG++G G+   S++SQL   
Sbjct: 176 DMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAAT 235

Query: 232 GIVRNVIGHCFSS-KGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTG 291
           G  + V  HC  + KGGG   F  G+ D  ++  TPM  +   HY+     +  +G S  
Sbjct: 236 GDAKRVFSHCLDNVKGGG--IFAVGVVDSPKVKTTPMVPN-QMHYNVMLMGMDVDGTSLD 295

Query: 292 L-----RNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRK 351
           L     RN   + DSG++  YF    Y    SL+   LA +P++  + ++T        +
Sbjct: 296 LPRSIVRNGGTIVDSGTTLAYFPKVLYD---SLIETILARQPVKLHIVEETF-------Q 355

Query: 352 PFKSLRDVRKYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENS 411
            F    +V + F P++  F    +      +    YL        C G   G     E S
Sbjct: 356 CFSFSTNVDEAFPPVSFEFEDSVK----LTVYPHDYLFTLEEELYCFGWQAGGLTTDERS 415

Query: 412 NII--GDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGS 429
            +I  GD+ + +K+VVY+ + + IGWA  NC    K + GS
Sbjct: 416 EVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSSIKIKDGS 436

BLAST of CcUC02G017150 vs. ExPASy Swiss-Prot
Match: Q4V3D2 (Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 9.7e-29
Identity = 107/403 (26.55%), Postives = 170/403 (42.18%), Query Frame = 0

Query: 52  SSIVLPLQGNVFPN--GFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQC---TET 111
           ++I LPL G+   +  G Y   + +G PPK Y++  DTGSD+ W+ C APC +C   T+ 
Sbjct: 60  ANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDL 119

Query: 112 LHPL------YQPSNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVR 171
             PL         ++  V C+D  C  +  S    C     C Y V Y DG +S G  ++
Sbjct: 120 GIPLSLYDSKTSSTSKNVGCEDDFCSFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFIK 179

Query: 172 DVFPLNLTNGD----PIRPRLALGCGYDQDPG-SSSYHPMDGILGLGRGAVSMVSQLHNQ 231
           D   L    G+    P+   +  GCG +Q      +   +DGI+G G+   S++SQL   
Sbjct: 180 DNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAG 239

Query: 232 GIVRNVIGHCFSSKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGL 291
           G  + +  HC  +  GG +F    +  P  +V T        HY+     +  +G    L
Sbjct: 240 GSTKRIFSHCLDNMNGGGIFAVGEVESP--VVKTTPIVPNQVHYNVILKGMDVDGDPIDL 299

Query: 292 RNLF--------VVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRG 351
                        + DSG++  Y     Y    SL+ +  A + ++  M  +T       
Sbjct: 300 PPSLASTNGDGGTIIDSGTTLAYLPQNLY---NSLIEKITAKQQVKLHMVQETFAC---- 359

Query: 352 RKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLE 411
              F    +  K F  + L F    +      +    YL        C G  +G     +
Sbjct: 360 ---FSFTSNTDKAFPVVNLHFEDSLK----LSVYPHDYLFSLREDMYCFGWQSGGMTTQD 419

Query: 412 NSNII--GDISMQDKMVVYNNEKQAIGWATANCDRVPKSRVGS 429
            +++I  GD+ + +K+VVY+ E + IGWA  NC    K + GS
Sbjct: 420 GADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVKDGS 443

BLAST of CcUC02G017150 vs. ExPASy TrEMBL
Match: A0A5D3BS69 (Aspartic proteinase Asp1 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold680G00070 PE=4 SV=1)

HSP 1 Score: 862.8 bits (2228), Expect = 5.9e-247
Identity = 410/428 (95.79%), Postives = 421/428 (98.36%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 42  MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 101

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 102 NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 161

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 162 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 221

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNV+GHCF+SKGGGYLFFGDGI
Sbjct: 222 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 281

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 282 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 341

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 342 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 401

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 402 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 461

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 462 VPKSQVSS 469

BLAST of CcUC02G017150 vs. ExPASy TrEMBL
Match: A0A1S3CDB2 (aspartic proteinase Asp1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=4 SV=1)

HSP 1 Score: 862.8 bits (2228), Expect = 5.9e-247
Identity = 410/428 (95.79%), Postives = 421/428 (98.36%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGDGI 240
           CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNV+GHCF+SKGGGYLFFGDGI
Sbjct: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           P+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR
Sbjct: 361 PIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKS+V S
Sbjct: 421 VPKSQVSS 428

BLAST of CcUC02G017150 vs. ExPASy TrEMBL
Match: A0A1S3CDB4 (aspartic proteinase Asp1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=4 SV=1)

HSP 1 Score: 857.1 bits (2213), Expect = 3.2e-245
Identity = 410/432 (94.91%), Postives = 421/432 (97.45%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MGK VL++L LMVASMSCLA CSASSFFKDKPWER+RPILSVP ASSSFASSSIVLPLQG
Sbjct: 1   MGKWVLVVLALMVASMSCLAPCSASSFFKDKPWERKRPILSVPTASSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180

Query: 181 ----CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFF 240
               CGYDQDPGSSSYHPMDGILGLGRGAVS+VSQLHNQGIVRNV+GHCF+SKGGGYLFF
Sbjct: 181 CQLICGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFF 240

Query: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300
           GDGIYDPYRLVWTPMSRDYPKHYSPGFGEL+FNGRSTGLRNLFVVFDSGSSYTYFNAQAY
Sbjct: 241 GDGIYDPYRLVWTPMSRDYPKHYSPGFGELMFNGRSTGLRNLFVVFDSGSSYTYFNAQAY 300

Query: 301 QVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKA 360
           QVLTSLLNRELAGKPLREAMDDDTLPLCWR RKP KSLRDVRKYFKPLALSFSSGGRSKA
Sbjct: 301 QVLTSLLNRELAGKPLREAMDDDTLPLCWRERKPIKSLRDVRKYFKPLALSFSSGGRSKA 360

Query: 361 VFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA 420
           VFEIP+EGY+IISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA
Sbjct: 361 VFEIPIEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATA 420

Query: 421 NCDRVPKSRVGS 429
           NCDRVPKS+V S
Sbjct: 421 NCDRVPKSQVSS 432

BLAST of CcUC02G017150 vs. ExPASy TrEMBL
Match: A0A6J1DZ15 (aspartic proteinase Asp1 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111024375 PE=4 SV=1)

HSP 1 Score: 825.5 bits (2131), Expect = 1.0e-235
Identity = 391/428 (91.36%), Postives = 411/428 (96.03%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSFASSSIVLPLQG 60
           MG  +L ILVLMVASM+CLA  SASSFFKDKPWERR+PILSV A SSSFASSSIVLPLQG
Sbjct: 1   MGTGLLKILVLMVASMNCLAPSSASSFFKDKPWERRKPILSVSATSSSFASSSIVLPLQG 60

Query: 61  NVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPC 120
           NV+PNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLY+PS+DLVPC
Sbjct: 61  NVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYRPSDDLVPC 120

Query: 121 KDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALG 180
           KDPLCMSLHSS+DHRCENPDQCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRLALG
Sbjct: 121 KDPLCMSLHSSVDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLALG 180

Query: 181 CGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGDGI 240
           CGYDQ PGSS YHPMDGILGLG+GAVS+VSQLHNQGI+RNVIGHCFSS+GGGYLFFGD I
Sbjct: 181 CGYDQIPGSSYYHPMDGILGLGKGAVSIVSQLHNQGIIRNVIGHCFSSRGGGYLFFGDDI 240

Query: 241 YDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLT 300
           YD +R+VWTPMSRDYPKHYSPG GELIFNGRSTGLRNLF VFDSGSSYTYFNAQAYQVLT
Sbjct: 241 YDSHRVVWTPMSRDYPKHYSPGLGELIFNGRSTGLRNLFAVFDSGSSYTYFNAQAYQVLT 300

Query: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360
           SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI
Sbjct: 301 SLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEI 360

Query: 361 PMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDR 420
           PMEGYLI+SSMGNVCLGILNGT+VGL+NSNIIGDISM DK+V+YNNEKQAIGWATANCDR
Sbjct: 361 PMEGYLILSSMGNVCLGILNGTEVGLQNSNIIGDISMHDKIVIYNNEKQAIGWATANCDR 420

Query: 421 VPKSRVGS 429
           VPKSR  +
Sbjct: 421 VPKSRAAA 428

BLAST of CcUC02G017150 vs. ExPASy TrEMBL
Match: A0A6J1I590 (aspartic proteinase Asp1-like OS=Cucurbita maxima OX=3661 GN=LOC111469733 PE=4 SV=1)

HSP 1 Score: 824.7 bits (2129), Expect = 1.8e-235
Identity = 391/431 (90.72%), Postives = 410/431 (95.13%), Query Frame = 0

Query: 1   MGKRVLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVP--AASSSFASSSIVLPL 60
           MGK VLMILVLMV+S+SCLA CSASSFFKDK WERRRP LSVP  +ASSS AS SIVLPL
Sbjct: 1   MGKGVLMILVLMVSSISCLAPCSASSFFKDKLWERRRPTLSVPIASASSSIASPSIVLPL 60

Query: 61  QGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLV 120
           QGNVFPNGFYNVTL++GQPPKPYFLDPDTGSDLTWLQCDAPCQQCTET HPLYQPSNDLV
Sbjct: 61  QGNVFPNGFYNVTLFIGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETPHPLYQPSNDLV 120

Query: 121 PCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLA 180
           PCKDPLCMSLHSS+DHRCENPDQCDYEVEYADGGSSLGVLVRD+FPLNLTNGDPIRPRL 
Sbjct: 121 PCKDPLCMSLHSSIDHRCENPDQCDYEVEYADGGSSLGVLVRDIFPLNLTNGDPIRPRLT 180

Query: 181 LGCGYDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGD 240
           LGCGYDQ+PGSSSYHPMDG+LGLG+GAVS+VSQLHNQGIVRNV+GHCFSSKGGGYLFFGD
Sbjct: 181 LGCGYDQNPGSSSYHPMDGVLGLGKGAVSIVSQLHNQGIVRNVVGHCFSSKGGGYLFFGD 240

Query: 241 GIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQV 300
            IYDPYRL WTPMSRDYPKHYSPGFG+L FNGRSTGLRNLFVVFDSGSSYTYFNAQAYQ+
Sbjct: 241 DIYDPYRLAWTPMSRDYPKHYSPGFGDLFFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQI 300

Query: 301 LTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVF 360
           +TSLLNREL GKPLREA DDDTLPLCWRGR PFKSLRDVRKYFKPLALSFSSG RSKAVF
Sbjct: 301 ITSLLNRELTGKPLREAKDDDTLPLCWRGRNPFKSLRDVRKYFKPLALSFSSGRRSKAVF 360

Query: 361 EIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANC 420
           E+P E YLIISS GNVCLGILNG++VGLENSNIIGDISMQDKMVVYNNEKQAIGWATANC
Sbjct: 361 EMPTESYLIISSKGNVCLGILNGSEVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANC 420

Query: 421 DRVPKSRVGSM 430
           DRVPKS VGS+
Sbjct: 421 DRVPKSSVGSL 431

BLAST of CcUC02G017150 vs. TAIR 10
Match: AT4G33490.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 577.8 bits (1488), Expect = 7.3e-165
Identity = 274/420 (65.24%), Postives = 332/420 (79.05%), Query Frame = 0

Query: 5   VLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSF--ASSSIVLPLQGNV 64
           V  ++VLMV S+  L   SA  F     W +          S  F  A SS+V P+ GNV
Sbjct: 6   VRFMIVLMVMSL-VLGFSSAVDF----RWRK------TAGFSDRFTRAVSSVVFPVHGNV 65

Query: 65  FPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKD 124
           +P G+YNVT+ +GQPP+PY+LD DTGSDLTWLQCDAPC +C E  HPLYQPS+DL+PC D
Sbjct: 66  YPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCND 125

Query: 125 PLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCG 184
           PLC +LH + + RCE P+QCDYEVEYADGGSSLGVLVRDVF +N T G  + PRLALGCG
Sbjct: 126 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCG 185

Query: 185 YDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGDGIYD 244
           YDQ PG+SS+HP+DG+LGLGRG VS++SQLH+QG V+NVIGHC SS GGG LFFGD +YD
Sbjct: 186 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 245

Query: 245 PYRLVWTPMSRDYPKHYSPGF-GELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTS 304
             R+ WTPMSR+Y KHYSP   GEL+F GR+TGL+NL  VFDSGSSYTYFN++AYQ +T 
Sbjct: 246 SSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTY 305

Query: 305 LLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEIP 364
           LL REL+GKPL+EA DD TLPLCW+GR+PF S+ +V+KYFKPLALSF +G RSK +FEIP
Sbjct: 306 LLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIP 365

Query: 365 MEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRV 422
            E YLIIS  GNVCLGILNGT++GL+N N+IGDISMQD+M++Y+NEKQ+IGW   +CD +
Sbjct: 366 PEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 414

BLAST of CcUC02G017150 vs. TAIR 10
Match: AT4G33490.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 535.8 bits (1379), Expect = 3.2e-152
Identity = 257/392 (65.56%), Postives = 308/392 (78.57%), Query Frame = 0

Query: 5   VLMILVLMVASMSCLALCSASSFFKDKPWERRRPILSVPAASSSF--ASSSIVLPLQGNV 64
           V  ++VLMV S+  L   SA  F     W +          S  F  A SS+V P+ GNV
Sbjct: 3   VRFMIVLMVMSL-VLGFSSAVDF----RWRK------TAGFSDRFTRAVSSVVFPVHGNV 62

Query: 65  FPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDLVPCKD 124
           +P G+YNVT+ +GQPP+PY+LD DTGSDLTWLQCDAPC +C E  HPLYQPS+DL+PC D
Sbjct: 63  YPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLYQPSSDLIPCND 122

Query: 125 PLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPRLALGCG 184
           PLC +LH + + RCE P+QCDYEVEYADGGSSLGVLVRDVF +N T G  + PRLALGCG
Sbjct: 123 PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLTPRLALGCG 182

Query: 185 YDQDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSSKGGGYLFFGDGIYD 244
           YDQ PG+SS+HP+DG+LGLGRG VS++SQLH+QG V+NVIGHC SS GGG LFFGD +YD
Sbjct: 183 YDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGGGILFFGDDLYD 242

Query: 245 PYRLVWTPMSRDYPKHYSPGF-GELIFNGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTS 304
             R+ WTPMSR+Y KHYSP   GEL+F GR+TGL+NL  VFDSGSSYTYFN++AYQ +T 
Sbjct: 243 SSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTYFNSKAYQAVTY 302

Query: 305 LLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALSFSSGGRSKAVFEIP 364
           LL REL+GKPL+EA DD TLPLCW+GR+PF S+ +V+KYFKPLALSF +G RSK +FEIP
Sbjct: 303 LLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIP 362

Query: 365 MEGYLIISSMGNVCLGILNGTDVGLENSNIIG 394
            E YLIIS  GNVCLGILNGT++GL+N N+IG
Sbjct: 363 PEAYLIISMKGNVCLGILNGTEIGLQNLNLIG 383

BLAST of CcUC02G017150 vs. TAIR 10
Match: AT1G44130.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 432.6 bits (1111), Expect = 3.8e-121
Identity = 199/396 (50.25%), Postives = 285/396 (71.97%), Query Frame = 0

Query: 39  ILSVPAASSSF-------ASSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDL 98
           ++ VP + SS        + SS+V PL GNVFP G+Y+V + +G PPK +  D DTGSDL
Sbjct: 13  LVIVPLSKSSIFKTFIKSSPSSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDL 72

Query: 99  TWLQCDAPCQQCTETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENP-DQCDYEVEYAD 158
           TW+QCDAPC  CT   +  Y+P  +++PC +P+C +LH      C NP +QCDYEV+YAD
Sbjct: 73  TWVQCDAPCSGCTLPPNLQYKPKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYAD 132

Query: 159 GGSSLGVLVRDVFPLNLTNGDPIRPRLALGCGYDQDPGSSSYHPMD-GILGLGRGAVSMV 218
            GSS+G LV D FPL L NG  ++P +A GCGYDQ   S+   P   G+LGLGRG + ++
Sbjct: 133 QGSSMGALVTDQFPLKLVNGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLL 192

Query: 219 SQLHNQGIVRNVIGHCFSSKGGGYLFFGDGIYDPYRLVWTP-MSRDYPKHYSPGFGELIF 278
           +QL + G+ RNV+GHC SSKGGG+LFFGD +     + WTP +S+D   HY+ G  +L+F
Sbjct: 193 TQLVSAGLTRNVVGHCLSSKGGGFLFFGDNLVPSIGVAWTPLLSQD--NHYTTGPADLLF 252

Query: 279 NGRSTGLRNLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGR 338
           NG+ TGL+ L ++FD+GSSYTYFN++AYQ + +L+  +L   PL+ A +D TLP+CW+G 
Sbjct: 253 NGKPTGLKGLKLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGA 312

Query: 339 KPFKSLRDVRKYFKPLALSFSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLEN 398
           KPFKS+ +V+ +FK + ++F++G R+  ++  P E YLI+S  GNVCLG+LNG++VGL+N
Sbjct: 313 KPFKSVLEVKNFFKTITINFTNGRRNTQLYLAP-ELYLIVSKTGNVCLGLLNGSEVGLQN 372

Query: 399 SNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKS 425
           SN+IGDISMQ  M++Y+NEKQ +GW +++C+++PK+
Sbjct: 373 SNVIGDISMQGLMMIYDNEKQQLGWVSSDCNKLPKT 405

BLAST of CcUC02G017150 vs. TAIR 10
Match: AT1G77480.2 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 410.2 bits (1053), Expect = 2.0e-114
Identity = 195/377 (51.72%), Postives = 261/377 (69.23%), Query Frame = 0

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++V P+ GNV+P G+Y V L +G PPK + LD DTGSDLTW+QCDAPC  CT+     
Sbjct: 50  SSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ 109

Query: 111 YQPSNDLVPCKDPLCMSLHSSMDHRCENP-DQCDYEVEYADGGSSLGVLVRDVFPLNLTN 170
           Y+P+++ +PC   LC  L    D  C +P DQCDYE+ Y+D  SS+G LV D  PL L N
Sbjct: 110 YKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLAN 169

Query: 171 GDPIRPRLALGCGYD-QDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSS 230
           G  +  RL  GCGYD Q+PG     P  GILGLGRG V + +QL + GI +NVI HC S 
Sbjct: 170 GSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSH 229

Query: 231 KGGGYLFFGDGIYDPYRLVWTPMSRDYP-KHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 290
            G G+L  GD +     + WT ++ + P K+Y  G  EL+FN ++TG++ + VVFDSGSS
Sbjct: 230 TGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSS 289

Query: 291 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALS 350
           YTYFNA+AYQ +  L+ ++L GKPL +  DD +LP+CW+G+KP KSL +V+KYFK + L 
Sbjct: 290 YTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLR 349

Query: 351 FSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNE 410
           F +  ++  +F++P E YLII+  G VCLGILNGT++GLE  NIIGDIS Q  MV+Y+NE
Sbjct: 350 FGN-QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNE 409

Query: 411 KQAIGWATANCDRVPKS 425
           KQ IGW +++CD++PKS
Sbjct: 410 KQRIGWISSDCDKLPKS 425

BLAST of CcUC02G017150 vs. TAIR 10
Match: AT1G77480.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 406.8 bits (1044), Expect = 2.2e-113
Identity = 193/375 (51.47%), Postives = 259/375 (69.07%), Query Frame = 0

Query: 51  SSSIVLPLQGNVFPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPL 110
           SS++V P+ GNV+P G+Y V L +G PPK + LD DTGSDLTW+QCDAPC  CT+     
Sbjct: 50  SSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ 109

Query: 111 YQPSNDLVPCKDPLCMSLHSSMDHRCENP-DQCDYEVEYADGGSSLGVLVRDVFPLNLTN 170
           Y+P+++ +PC   LC  L    D  C +P DQCDYE+ Y+D  SS+G LV D  PL L N
Sbjct: 110 YKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLAN 169

Query: 171 GDPIRPRLALGCGYD-QDPGSSSYHPMDGILGLGRGAVSMVSQLHNQGIVRNVIGHCFSS 230
           G  +  RL  GCGYD Q+PG     P  GILGLGRG V + +QL + GI +NVI HC S 
Sbjct: 170 GSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSH 229

Query: 231 KGGGYLFFGDGIYDPYRLVWTPMSRDYP-KHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 290
            G G+L  GD +     + WT ++ + P K+Y  G  EL+FN ++TG++ + VVFDSGSS
Sbjct: 230 TGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSS 289

Query: 291 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPFKSLRDVRKYFKPLALS 350
           YTYFNA+AYQ +  L+ ++L GKPL +  DD +LP+CW+G+KP KSL +V+KYFK + L 
Sbjct: 290 YTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLR 349

Query: 351 FSSGGRSKAVFEIPMEGYLIISSMGNVCLGILNGTDVGLENSNIIGDISMQDKMVVYNNE 410
           F +  ++  +F++P E YLII+  G VCLGILNGT++GLE  NIIGDIS Q  MV+Y+NE
Sbjct: 350 FGN-QKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGYNIIGDISFQGIMVIYDNE 409

Query: 411 KQAIGWATANCDRVP 423
           KQ IGW +++CD++P
Sbjct: 410 KQRIGWISSDCDKLP 423

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038900559.13.3e-25297.90aspartic proteinase Asp1 isoform X2 [Benincasa hispida][more]
XP_004147327.24.5e-24996.73aspartic proteinase Asp1 isoform X1 [Cucumis sativus] >KAE8651999.1 hypothetical... [more]
XP_038900558.17.1e-24790.52aspartic proteinase Asp1 isoform X1 [Benincasa hispida][more]
XP_008460823.11.2e-24695.79PREDICTED: aspartic proteinase Asp1 isoform X2 [Cucumis melo][more]
TYK02025.11.2e-24695.79aspartic proteinase Asp1 isoform X2 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q0IU522.1e-9247.16Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica OX=39947 GN=ASP1 PE=2 S... [more]
A2ZC671.2e-8744.82Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica OX=39946 GN=ASP1 PE=2 SV=... [more]
Q9M9A87.0e-8042.68Aspartyl protease APCB1 OS=Arabidopsis thaliana OX=3702 GN=APCB1 PE=1 SV=1[more]
Q9S9K45.3e-3529.43Aspartic proteinase 39 OS=Arabidopsis thaliana OX=3702 GN=A39 PE=1 SV=2[more]
Q4V3D29.7e-2926.55Aspartic proteinase 36 OS=Arabidopsis thaliana OX=3702 GN=A36 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3BS695.9e-24795.79Aspartic proteinase Asp1 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3CDB25.9e-24795.79aspartic proteinase Asp1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=4... [more]
A0A1S3CDB43.2e-24594.91aspartic proteinase Asp1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499584 PE=4... [more]
A0A6J1DZ151.0e-23591.36aspartic proteinase Asp1 isoform X2 OS=Momordica charantia OX=3673 GN=LOC1110243... [more]
A0A6J1I5901.8e-23590.72aspartic proteinase Asp1-like OS=Cucurbita maxima OX=3661 GN=LOC111469733 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT4G33490.27.3e-16565.24Eukaryotic aspartyl protease family protein [more]
AT4G33490.13.2e-15265.56Eukaryotic aspartyl protease family protein [more]
AT1G44130.13.8e-12150.25Eukaryotic aspartyl protease family protein [more]
AT1G77480.22.0e-11451.72Eukaryotic aspartyl protease family protein [more]
AT1G77480.12.2e-11351.47Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 47..237
e-value: 3.5E-45
score: 156.3
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 242..422
e-value: 2.2E-27
score: 97.6
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 60..421
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 280..414
e-value: 5.9E-15
score: 55.4
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 68..238
e-value: 3.2E-48
score: 164.3
IPR001461Aspartic peptidase A1 familyPANTHERPTHR13683ASPARTYL PROTEASEScoord: 8..423
NoneNo IPR availablePANTHERPTHR13683:SF800EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 8..423
NoneNo IPR availablePROSITEPS51257PROKAR_LIPOPROTEINcoord: 1..18
score: 6.0
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 68..414
score: 34.367401

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC02G017150.1CcUC02G017150.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0003677 DNA binding