CaUC01G012220 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC01G012220
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionmetalloendoproteinase 4-MMP
LocationCiama_Chr01: 23388332 .. 23423039 (+)
RNA-Seq ExpressionCaUC01G012220
SyntenyCaUC01G012220
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTATGGTTATACTCCCAGATATTGCTCCTCATATAATATACAAGTCTTACAATCACAACAACTACATCAAGTTTCACTATTTACAATTCACACATAATTTTAGGGAATATACAATTTAATTTAATTTAATTTAACTTTTATCTAGCTCGTGTAATTATGAACAAATAAATGTAGAGCATTGTCTAAAAGCTATATATATATATATATATATATATATTTCAAGGAGTACTCCATATTCTTGTGTTAACATATAAATCTTTGTTCTATTTCTTCTTTTAGGGTAGATATTTCTGATTAGTTGAATATTTAAAAAAATACAAGGAGAAAAGGAAAAGTGAAAGAGAAGGTTGTTCCACCAAACTTCCAGTGTCTGTCAAAGTTCAATCATCAAGTAAATAGCAGAGAAAATTGAAATTCCAACAATATAAGATCAACATTCACATATCAAATATCCAAAAACGGAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAGAGAAAAAGAAATAACCCCATTCATAAATTTTACTTTTTCCTCATGTATATAGCAAACACAAGCTTCCATTTCTCTGAACTGTTTGTATGCCAATGGTCTCACCGCCACCGCCATGTTTCTGCCCCTTCTATTGGTCCTCTTCTTCATCCTCTGTACGCCGCCCTGCCTTCCATCCCCGACAACACTGCCCCAAAACCCCACCGCCACGGTGATTTACCGGTCAAATTCCACTCTACTGCGGTGGCGCGTGTCCGACACATTGCATAGACCCCATGCGACGGCGCATTATGTATTTTTTCCTGGGAAGCCACGGTGGATGCGGCGAATGCCGATGGTGCTGACGTACGCATTTTTCAAGGAGAATTGGATGAGCAACTTGAGTTTAAAAGACGTGAAAGCTGTATTTCGTAAGGCGTTTTCGAAATGGGCCCATGTGATTCCGGTGAGTTTCGAGGAGACCGACGACTATGGATACGCGGATATTAAAATCGGGTTTTATAGAGGCGATCATAGCGACGGGCAGGCGTTCGACGGCGTGTTGGGGGTTTTGGCACATTCTTTTTCCCCTGAGATTGGGAGGCTCCACCTAGATGCGGCGGAGACGTGGGCGGTTGATTTCGAGAAGGAGAAATCGGCGGTGGCGGTTGATTTGGAATCGGTGGCGACGCATGAGATTGGTCATTTGTTGGGGCTGGGTCACAGCTCTGTGAAGGAATCGGTAATGTATCCAAGCTTGAAGCCGAGGGAAAAGAAGGCAAATTTAGAAGTGGATGACGTGGAAGGCGTGCAAGCCTTGTACGGATCGAATCCAAATTTCAAGTTTGAGTCTTTGGAATTGAAGTCCGACACTTCTGCCAGTCAGCCACACGTTTCCACGTGTACAATAGCTTGCCATTTTTCCCTCGCCTTCTTCCTTGTTTCCACTTACCATTTTATTCTTTCTCCATAAATATATATGTATAAATAGTTTCTTAAAATGACATAAAGAATGATCACAGACAAATGAGAATTTCTGATTGATAAGAAGGTGATCAAATATAGGCCTAAAACTAATATATAAGTTGTAACAATACAAATATATAATACCCAGCTCAAAAGTGAAGTATGTTTTACAACTTTTTACCATTTGAAATCATGTTTAAAAAACTATCAAACATCAGTGATTCCAAAAATAATGTCTAACGGCTAGTTTTTTCTTTTTTTCTTTTCTTTTACTCCTTAGTTTAAACAAATCTATGTAATTGACTCTTACAACATATAAAAGAGTTGAGAAATGAGCAAGGAATAGTAGATAAAAAAGATTATAAAGAGACTTATCTTCCTATTAACTCCAAGAAAGCTGTCCTTCCTGTGGATGTAGGCTGATCTTAGCCGAACCACGTAAAACACTGTCTTAGTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTTTTCTTTGATTTCTTCAAGCTGCAAGGTTTCCTCTTCAATTTAGTTTCTGTCCGACATTAGAGAGAGGAGAAAGAGAGGAAGTATTTTCTACAAATCATAAAGAAAACTAAAGAGGAAAGGAAAGGATATTTCCTCACCTCTTCTACATCTCCTTCACGTGGTTTAAGCTCAGCTACACGTGGAGAATTCTCTTCCACATCTGTCACCAAATTTAGGTGTTCAGACTTCAAGAAGGATCAAATTGGGCTCATCTTCCGCCTGCTGTTTTTGATTAATTTGCGCTTTCACCTTGCAAAACTGGTATTAGAGCTTGCATTCAATCAAGAAAGAGTTTCTTCAAAAATTCAAGATTCAAAACAGAGAAAAAAGAATGGCGGCTGTAAGATTTGAGATTGAAAAATTTGATGGCAAAGGTGATTTTGGTCTTTGGAAAGCTCACAAAAGCTCACAAGTACCTTTTGGATCCATCAATTCTGCCCAACACAGCTACAACTTAAGAAAAAGAAGATATGGAGTTGGCAGCTTATGGTACACTTGTGTTAAACCTAAGTGATAGTGTACTAAGACAATTGATAAACCAAGATACAACATACAAAATGTGGACAAAGTTGGAGACAATTGATAAACCAAGATACAACTAAAGATCTTCCAAACAAGATGTTCCTAAGGGAGAAGTTCTTTACATTTAGGATGGATTCCTCCAAGACTCTTGCAGAGAATTTAGATGAATCCAAGAAATTTTTTTTTGAGTTTAAAAATCTTGGAGAAAGACTGGGTGATGACAATGAGGCTTGTATTCTTTTGAACTCACTATCGGATTCTTACAAGAATGTGAAGAATGCTTTGAAATATGGCAGAGACACAATTACAACAGATGCTATCATTTCAGCCTGAAGGACAAAAGATTTAGAGCTTCAAGTCACCAAGATGGAACAACGAAGTGCAGAGGGTTTGTTTGTTAAAGGTAAAAGCAAATTTAATCCCTCTAAAGGAGGAAAATCAACAAACAAATGAAGAGCTTACCCAAGACTAAACTCAGGTGAAATTATTGCAAGAAGAAAGGTCATCTAATGAAGGATTGCTATAGCTTAAAAATGAAGAATCAAGAGAAGGAGAAAGACTACAAGAACAAACAACCGGAAGCCTCTATAGTAGAAGGCTCATACATCTACTCGGATGCCCTTGCTACAGCTCAAGACAAAGCTAACCAAGTGAGTCCTCTAGGGAAGCATGATTAGGTACTAGATTCCGGATGTACCAATCACATGAGCCCTTTTAGAACTTCGTTCAACACCTGCAAAGAAATTGATGGAGAGCTAGTATAAATGGGGAACAATGAAGCTTGCAAGATCAAAGGAATTGGCTCGATCTCATTAAAGCTAACAAATGGGACAGTGAAGCTTCTAAGGAATGTGAGGTATGTACCTTTGCTAAAAAGAAACTTGATTTCTTTAGGCATATTACACTCCATTGGGTGTGAATACAAAGGAAAATGAGGTACCCTAGAAGTGATCAAGGATTCTAAAGTAGTTATGGTTGGAGAAAAGGTGAATGATTTGTTTGTAGTCAGGGGAGTAGAAATGGTAAATGGTGCTTATACAGTGAACAAAGTAGCCTTAACCGAAGCTGATTTATGGCATAAAAGACTCCCTCATATTAGTCATAAAGGTATGCAAGCACTATCAAATCAAGGTATATTACCTAAAGGTATTTGTGAGCATCTCTCATTCTGTGAACACTGTGTTTTTGGGAAAGCTACAAGGAAAAACTTCACCAAAGCCCAGCACACTACCAAGGGAATTATAGACTACATTCACTCAAATTTGTGGGGGCCTGCTACAACACCAAGTCTAAGTGGTTCAAGGTATTTTCTATCTTTAATTGATGACTATTCTAAGAAAAGTTGGCTATATTTCCTTAAAACTAAAGACCAAGTGTTTAACAAAATTAAAGACTGGAAAACCTTAGTGGAAACTCAAACCAACAGAAAGATAAAATATCTTAGAACTGATAATGACTTGGAGTTCTGTAGTGAAGAGTTTAATAAATTTTGTAGAGATAATGGTATCACAAGGCACAAAACTGTTAGACACACTCCACAACAGAATGGTGTGGCTGAGAGGTTAAATAGAACAATAATGGAAAGGGTTAGGGGTGTACACGGGTTGGGTTGGGTTGGGTTGAGGGTATTTTTTGGACCAACCCGAAAATTCGGATTGGTCAGGTCGGCTACCCGAATAGGGTACCCAACCCAACCCAACCCAACCCTTAAATTTCGGGTTGGGTTGGGTTGGGTTGGGTTGTCGGGTTGTGTGTGTGTGTGTGTTTTTTTTTTCTTTTCTTTTCCCCCTATTTTTTGGTAAAACACATTGGCATATCAATAAATACTAATTGACATATCAATTGATCCAAGATAAAGTATAAAATACATTGATCCAAGAAAAAAATATAAAATAGTTCTTAACACTAACTTAAACTAATATATATATATATATATATATATATATATATATATATATATATATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATATATATATATATATATATATATATATATATATATATATATATATTTTTAATATTAATATTAGAAATTCGGGTTGGGTTAGGTCAACCCGAATTTTCAAAATTGCACCCCGAGACCCAACCCAACCCGAAAAATACAAAAAAAATAAAAAATTAACCCAACCCAACCCTTATTATTCGGGTTGGGTAGTCCGGGTTGTTCGAGTTGTTGGGTCATTTGAACACCCCTAGAAAGCGTAAGGGGGCAGAGGCAGCAGCTTACACTGTTTTCACCTTAAATACGTGTCCTCACACCTTTTCAGCTTTATTAACCCCTGAAGAAAAATGGTCTAATCACCCACCTAATTTAGATAATCTAAGAGTTTTTGGATGTGTTGGTAATGTTCATCAAAATCAGGGGAAGCTTAAAGCTAGAGTAGCTAGTGAAAGAAGGTTCATTGTGAGTAGGGATGTTACTTTCAGAGAAGATTAGATGTTCATGCACAAAGAGGGTATTATAAAGGAGAAGGCAGAGTCATCCAATTCTAAAATTGAGGTGGAGAAAACACCTGAACCTATTGCTCAAGGTGAAACAGAAGATGAAGGTGAAACTTTAATAGAACAACCTGAAATCTCAGAACCTCAACTAGATTTGAGTCAGTACTCACTACGTAGAGATAGACAAAGGAGTATTATAGTTCCACCAGCTAGGTATGTTGATATTAACTACATGAACCTAGTTTTAAGTGTTACTATTGCTCCTAATGATCATGAACCTACTACCTTTGAGGAAGGAATAAGTGGCAAGATGCAATACAGTGGATTGAGGCAATGAATGAGGAAATTCATTCTCTAAATATAAATGAGACCTGGACTTTAGCAACTTTACCAAAAGGGTATAAACCTATTGCCTCAAAGTGGATCTACAAACTAAAGGAAAAAATATCTAAGATCATCCTCCTAGGTACAAGGCAAGGGTTTTACTCAAAGATAAGGCATAGATTACTCAGAGATTTTCTCTCCAGTTGTAAAGCAAACCTCAATCAGACTTCTCCTATCCTTAGTGGCTCAAAACAACCTAGAATTGAACCAACTTGATGTTAAAATAGCTTTTCTTCATGGCTACCTAGAAGAAACCATCTATATGGAACAACCTAGGGGATATAAGGTAAAGGGAAAGGAGAATCTATATTGTTTGCTAAATAAGTCCATCTATGGGCTTAAGCAATTTCCTAGATGCTGGTATAGGAAGTTTAATGACTACCTTGGAGGTATGGGTTTCAAGAGAAGATCCTATGACATGTGTGTCTATGTGAATCAAACTTCATTCAAAGATAAGGTATATCTTCTACTATATGTAGATGATATGTTATTGGTAGGAAGACCTCTCCCATGTCAAAACCCTTTTAAATAAAGAGTTTGACATGAAAGACTTAGGAGAATGTAAAAGGATACTTGGCATAGACATCACAAGGAGCAGGGACCAAAACTTCCTAACCATAAGTCAAACATCCTACTATGGGAAACTAATCAAGAGGTTCAACCTAAGTCAAGCAAAGCCAGTAACTATCCTGTGGGAAACTAATCAAGAGGTTCAACCTAAGTCAGGAAGCAGTAACTATCCCTATTGCTCAACATTTCAAACTCTCTTCTGCTAACTCTCCTAATGAAACTAACCTATAGAGTATAAACAACAAATGGAAAATGTTCCCTATAGTCAAGTTGTGGGCTTAGAAAGTCTATGCCTTTTGAAAACCTATTAATTAATAATTAAGTTAAGGCATGCTCTCGAAAAAGATACCTTCCATTCTCTTGTCTCCGGCGATTAACCTATTCCAGAGCAGTCTGGTGATTAGCCTATCTCGCACTACTCGAAGTCTTCGTAAACTCATTGTCCGTAGTGGAGGTTGGAGGCTTCGTTGTCCGGCTCCTCATAAATGATGTTATCGGCAGTCAGAGACGGGCTATTGGAAGCCGGTGTTGATCCCTATAATGGGTTAGTCTGATGTACTTAATGATCTCTACAAGACTCGACCTATCCTACTCAACAAGTCTAGTCAGTAGGTACATGTCTAATTCGGGTAGAAGGCATTGGGAAGCTACAAAGCGGGTTATCAGATATCTAAAGTGGTCGATGGGTGCCAAGCTTTACTATTAAAAGATTGATAAATCAAAATTAGAGCTAATTGGTTTTGTTGGTTCTGATTTTGCAAGAGATTTAGATAAAAGAAGATCTCTAACTGGCTACATTTTTCTATATGGTCCTAACCTAATAAGCTGGAAGGCTACACTTCAATCTATTGTTGCATTATCAACTACTGAAGCAGAATACATAGCTCTAACAGAAGCACTCAAGGAGGCACTATGGATCAAAGGATTGATGAAAGACTTTGGAATAGATTAGTCATCGGTAAAGATCTATTGTGATAACCAAAGCACTATTCATCTCTCCAAAAACCCACAATTCTACAACAGAACAAAAACACAAAAATATCAAATTCCATTTTGCATGCAACATGATAGAAAAAGGGGAAATAGAAGTTGTAAAGGTTCATACCTCTGAAAATGTAGCTAACATGCTCATGAAGCCGGTTCCAAAACTCAAACTCCTCAAGTGTTTGGACATGATTGGCTTCGAGTTACTTGAGACAAAATAGAAATATGAGATTGGAAGGGCAGTTGTGTGCATGCATTGGAATCGGAATCAAGGTGGAGATTTGAAGGTGGAGATTCAAAAAATAATGCCTAATGGCTAGTTTTTTCTTTTTTATTTTCTTTTACTCCTTAGTTTAAACAAATCTATGTAATTGACTCTTACAATAGAGTTCAGAAATGAGCAAGGAGTAGTAGAGAAAAAAAAATGTAAAGAGACTTAACTTCCCATTAACTCCAAGAAAGTTGTCCTTCCCATGGATGTAGACTGATCTTAGCTGAACCACATTAAACTCTGTCTTAGTTCTTCTTCTTCTTTGATTTCTTCAAGCTGCAAGGTTTACTCTTCAATTTTAGTTTCTGTCCGACATCAGAGAACCGCCCGTTGTTGACTCGCGTTCTGTCGCATGCCATTGCACCTCTTGAAATCGGCCACAAACCGCCAAATTAAAATATTTGCTTACTCTCTTGGCTTTTTAAGCCCTGGACTCAAGATCTTCCCTTTGAATTTTTTTTTTTTTTTTTTTTTTCTTTCATACTCCTCCGGATGGCTTAAATGAATTCCTAAAACTCCACACCGAAATTTCCCTTTTAATACAGACTTAAATAAATCATTCCCTTTTTATATTACTTGAATAAATTCCCCTTTAACTATTTCAATTTCATGGAAAATAGGGACTTTATTTGGTGGATGGGTATTTGAATTTAAATCTCAATTTTGCTGATCTTATGAATTTTAACACTTATTTTCAGTTCCTAGTGAGACAATTTTAGCTTTTTGATTCCTAGGACGTTAGCCTAATTTAGAGCTCAAACTTCAATCGTTGTTTCCCATGTAAAAATTTTGTAGGAATTGAAAAGCAAAGAAAATGTGAAAGATCATAACAGAAAGGTTACTAGGGCGAAAATCAAGAAAGCTTTGACAACAATAAATGAAAAAATAAGTATTTTGTCTTTGAATTTGTGCTTTCTGGTTATGAGAGGGAGATATGAAGAAGTATTTGGAGATGAGAAAATTGATCACTCCAGCGTTAGCCACAACCCCCATCACTTGGAAGCAGGGAGGAACAATCAAACTGATTTGGACAAGGAGATTTTTAATATTCTAAATCATTTTATTTTTATTCAAAAGGATTTACTTTTTAAATAAAACTTTAATCTATTTGTCACATAATCAAACAAAGTGGCATGTTAGATAATCTATGTCAACAAATAACTGTTACATCACTATTCTGTTAGTTTAAAGTTAATGAAAAACCATTTTGAATCATTTTATGAATGTGGAGATTAAAATGTTTTTTTTTTTTTTTTTTTAACTTTAGTGATCAAATAGACAACAATCCCAAACCTCGAAACCAAAAGTGCATTTTGCTTATAAATGTTATCGCTTTAGTTAAATGGGGGAAAAAGAATAATTGCTTATATAAATACACATATATTTAAGGGCTATTTTTAAAGATAGCAAAATGAACCAAAATATTAACAGATATAACAAAATGTCTGTCACACCCCGCCCCAATTACCCTACCTGAACCTAAGACAGGTTGTAACGACAGTGGACCCCAAATGTGCAATGGACCTTACCGTCAAACTCTCCTTGTACTTGAAACTTAACTTGAATTAAATATCTTGTTTACCTGCATGTTGCAAGAAACTGATCAGTTGGGCAGCTTTTGATACTTACTTCCTACTAAAACTCCTAGTGGGGAAAAATAAAAATGAAAGGTTGAGCTACAAAGCCCAGTGATTTGTAAGTTTATAGGAAAATAATTTGAGAAAATCATCATGATTTTAAATAGAAATTCAAGTTTCGTAAATTGATTTCAAACATGAGCAGTTCAACAACTTGGGAGAATTATCACCTATAAAATATCATAAATATACTATTGCAAACGTAATCATAAACCGCAAATCGATATAAAACATGAACAAATCAAAAACTTGAGAATCATCCCATAAATAAAATCATGAGCATTTTCTTGCAAAACATGAGCATGAACTTGTAAAAACCTTCACTAAATCTGTACTTGAACACGGTCATACTATGTACGGAGGAATTCGACGCATATAGCAGGGCAACATGGTCCTACTTTACATTGAGCAAGCTTCAGGTGTATAGTAGAGCAAACATGGTCAAACTACTCACTGAGTACAAAGCAAACATGGTTATATGTGCATGCAGAGTAGCCCCCAAAAGGCAAAGTACCTCAACATACGACCCAAAACCTATGTGTGTGCCCAAATGGGTCCAAAAGACCTCATGTAATGAAGGAATCGAATCCTGGTGGAAGTGTGTTATCATGTTCTTTGTTTTCTTGAAAGAATTAGAAATGCCCAAAACTAGAAGAAAAAGGTTAAGCAACATACCTTTGTTGATCCACAACTGTAGCAAAATCTACTCCTCAAACCAGTAGTCTTCTCTTTGAAGTTGTCCTTTTCCACGAGATTTTCCTTGTTATGACTCTGGCAAAATTTTGGCTTGTGGGAACCGCTCTCAAAAGAATATTGGGAATGTGTAGAATGATTTTTCTAGAACTTTTCTTAGGAAAATTGAGAGAATTTCTAGAAGACGAGCTTTAGAGTAATTGCTGAGATTATTCTTATAAAGAATGTGTTCTATGAGGATCTCCCAAGGGCCCTATTGGGATAATCCCTATGCAGAGACTTTAAATTAAATATTTAATAATATCCAATATTAATAAACATCTACTCAAATAATATCTTATATTAAATGGTTTAAAATATTTAAATCATATTTAAATATTTATCTTCTGCAAAATCTTTAATTTGAATCTCATCCAATTTAAATTAACTAGGGTCATTTTCAAATATAGAAAAAATGAGTCAAACTATTTACCAATATAGAAAAATTTCACCGTCTATCAAACCACTATAGACTTCTATCACCCAAGCGATAGAAATCTTCCTACCGCGGTCTATTATTGATAGAGAGTGACATTTTTCTATATGTGTAAATAGTTTGATATTTTTTCTATTTATAATAATTTCTCAATTAATTAATTAATTTTTCCTCTATTTATTTATATGGATTTCATCGAAATAAATAATAAATTATTTGAATCATATTCAAATAAATTAATTCTCCATATTATAAGGTTATAATTTGAATCTTATTCAAAGTTGGGAAATTATCTTAAATGGCAAAACTGCTACAAATATTTACAACTAATAGCAAAATACACAAGATAGACCGTGATATATCTCAATAGACTATTATCTGTGTCTATCGTGATACAGATAGTAGTCTATCACGATCTATCATAGTCTATCATAGATAGACAGTGAAATTAATTAAACTCTTTCATTAACTTAATTAATATTCATTAACCGTGGGTCACTCCACTAAAGACCCACTGTTGTACTCTTCGCTCTATAGATATATTATCGTTTTACCCCTATAATTACTTTTTTCTCCTTAAGTCTCACTAATCCTCTAATGAACAACCTGTTTATGGTCATACCAAAACAAATCCCTTTCTGGTCAGTGAGTGAGGATGAGCTCCTTGTTCAAGACCTGAATTCAACACTTAAGAGAGTCATTTATCTTCTTGATGAGAAGGAGTGAACTTCATCTTGTGTAATTATGTTCCCAACTCTCTACTCGGACAAATCCCTAAAATGGTAGGCTTATTAAGTCGACGACATAAGTCACTCTCACTTATACAAATGAAAGGATTGTCCTCATAGACAGGAGTCCATAACTCAGTTAGGGTCGAGATCAAGTTACCTACAGTTATTCTTAATGAGAAGTTAATCTTTTCAATCAATAGTGTTATAAAGAGAGATTAATCATTTTATGGTCCAGTCTTATATAAACTCTTTATACAGGATACACTCACTCACATGTCTCCACATGAACGATTTGGATCAAATCATTTGTAACACTTACAACTCTTGTAACAATTGCTCGTATCCATATACACCCAACTTTATCTATACACTATAGACCTTTAGGTTATTTCTTGAACATGATTCACCTGTAGGTCAACTACATACATGTTTAAGACTTCATGCAATAACCTTGAATTTTTCCAGTTAATGGATAAATCAATATTGTATAATTAATAATTAAATTAATCAACTAATAACTTACGAGGCTTTAGGACATAGATCCTAACATGTAAATCATGACTATACAATGGAAACACATTCACTTCAATGTAGTGCTGATATATAAAACTTGCCGTTTCTGAAATATTTGCTTTCATATAATTCGTAAAACATTCCAAAGGTCATAATCTCTTTTTTTTTTTTTTCTCTTTTTTGAACAATATTTCATAAATCCAGAATTCTCACATGAAACACATCTTTTCTAAATGCTTTAGAACATAAAACCTTGCTAAAAGCATGACTATACTTGAAAATCTTGAATCCATAAACGTTTTTTTTTTTTAAATATTTTAATTCTTCCAAAACCATACATCATAGTTTTCATGAAATCGCATAATAAGTTAACTAATAATAAAATATTTCATAAAATAAACTACTCATAGAGTTATTATCGACCATGCTTCGAATTGATCGGGAATTTAAACTTCGTCCGTCCTGATATTCCAATACCCACTTGGATTAGGTTAGGCAAATATTCCTAAAAATAATTATAAGTTCACCTTTAATAGCTAGTTTGAAATATTTTATCCAACAATTCCAATTATACTGTCTCTCTCAATTTTTCTTATCTTGGATACATTTTGTCGTTTTATTCCTCCTATTGCATTCCACTCTTTACTCAAACCTCTCTCCCTTTTAGACTTCCCAAAACACACTTTTTCTCTCTTAAACACACTTAAATAGCCAACCAACTATACCAATCCAACACTATTTTGTTTCACACATACACACACATATATTCTTTTATTCATATCTTCTTCTTCTTCTGTCTTCTACTCCCCGTTCTTTCTATTTTATTTCTTTTCATCTTCTTCACTCACTCCCACTTCATCTTCTTCCTCCTACCTTAATTTATCTTTCTCTCTAATCTTCTTTATCTCTCTACCCTTTAGATTCTCTTAATTTAAACTCAAAACTATGTTGGGTATATATCACTCCAACAAATTAGGCCTTTATATATCACTAATGGATCTTTAATGTCAAACCGAGTGAAAAAAGAAAAATGGATCTTTAATGTCGATTTTCGATTGACATTAAAAGTACATTTAAATGAGGCTACAATGTCGATTTGAAATCGACATTAAAGATCCTCCCTAAATTTTTCTATATCACTTTTCTTTTTGTGAAACCCTGTCCCTCTCTCTCACAACTCCATTTCTCTCTCAAACTTTACTATCTTCCTTTAGCCAATTCTTCTTCTCCACGCCATCCCCTTTTGTCTTTTGTCCGCGCCGTCCGTCCTTGTCGTTCGAGTCGTCGTTCAAGTTTCGTTCTCGTCGTCCAACCTGTCCAGCCCGTTGTTCGAAGTCTGTCGTAGTCCGACATTCAAGGTTAGTCCAGCCAGTCATTTGAGGTCTATCGCAGCCTGTCTTAAATCTGTTCAGCGTGTCTTCTAGATTTGAGAGATTCGTCTAGCCCATGTCATATATGTGAGTTTCACCTTAGATTATCGTTCAGCCTATTGCAGCCCGTCGTTCGAGGTCCATCCGATTTCTTTAGGGTCACTTTCTCAAAACCCAAAAAATCTTCTCTACTTCCTTGCTTTATCTCTTCTTAGTCCGTTCCATGGCTGCTTCTGCTTCTTCTCTTTTCATGAAAATCCAAATCTAGAGGGATGGCACAGTGGGTTCACTCCCTTTTCCCTCTTTGATCTTTCCCATCACTTTGAGTTTTGCCTTCGATTTCATCTACTCTTTGTTTCTTTGTCTAGGTATGATTCTATTTACTGTTTGGTCTTTGGAAAACGTTAGTGGCACTGTTTATTGGGGAATTCGAGTAGTTTCTTTTAATAGGTCAATGTGATTCTTGTCGTTGTGTTTACTTGAGGGAAAATGAAGTCAACCTTTTTGCCTAATTAGAGTCGTGGAGAAATTGTGATGTCTCTAGGGTTATTTTGATAAATAATTTTATTTTAATAAATAATAACAAAAATAGGGTAGCGAAATCGTGGATGGAGTCCACGATTTCCACATGACATGTAAATCATGGACCCGGTTCACGATCTACTCTCCACATGGCATGCAAATCTTGGACTCGGTCCATGATTTTTCCTCCATATGGCATGCAAATTGTGGACCTGATCAACAATTTGCCAATTTGTTTTCTTTTTTCTTTTTTCTCTTCTTCGTAACTTCCTCTCTTTCCCTTTTTTTTTCCTTTTCTTCTTCTTCTTCTCCTGCATCATCATCATCATCATCATCTTCTTCTTCTTCTTCTTCTGGTCCAAAATCTGACGTGACCAACTCTGGTGAGCGAGAGTGAGAGATAAGACCGAGTGAGTGAGGAAGGAAGTGAGACGAGCGAGACGAGCGAGAGACGAAAGAGACGAAAGCAAGAGCGAGAGCGAGACGTCTTCTTCTTCTTCTCGTGCTCTCGCCTTCGTCTCGGTTGTCTTTGGCAAGAGCAATAGTGCTGTGCAATTTTTCTTAGGCTAAACTTAGTCAGTCACTTCTAGAAAGGCTAGAAGAAAGTCCTATTGATCGAATAGATTAATATGCTTAGATGGAAGGCATCTTAGACATGCAAACCTATTAGCATTAAGTTCAAAGGCAATTGCTAATAGGAAAAGCCTAATTAAGTGTTCGGTCCGAATCTCTCCTAAGTTGGTTAGATGAATGCAATCGGATTGGAATAGCCAAATCAATTTACAACCCTAATTCAACCTTTGTTACTTGGCGATCTATAAACCTAACAATTACTTATTAAGCATAGATCTTTAAATACCAATTAAAATGCTTGACTAATTTGTCAGATAATCTCAAGCGAATTTAATTGAAACATTAAACAAGGTGATAATTCATTAACAAGAAAATTGCAATACAAAAATTGTTCAAAACCTCATGAGATACAAACTAAGAATTGAACCTCAAACCCGAGCTAGATGATAGTACCTTAGCCACGATACATGCTTGTGCTCACCCAAAAATTGTTCAAAGAAATTCTCCTCAAGATGAAAAGAAGGAAAAATGCAGGGAAAAACTCCAAATACAGACTTTGGACTTGTGCTGCTCTTTTTTTCGAATGGTTGGTGGCTAGGGCATTTTATGCCAAAAATTGGACTGGCGCCGCAATGCTATAGGGCAGTGCCACTGTCACACTCTACTTTTTAAACCTATGCACTTAAGAACTTATAAGGAAAAGAAGTAAGACAAGAGAGATGGAATTAGATCACAAGTTTGTGGAGTTGGCCGAGAAATGTTGCCAAAATAGAAGTAAGTATGCGTTGAACTTAAGGGAAAAGAGAGGTTTGTGCTTAAGTAGGGTTAAGAGTAAGCGGTAGTTAAGTTTGGATTAGAAGGTTTAGATTGGAAGGTTGCGTTGAGTGATTTCTGGATCTTGGAATTAAAGTAATTCTTGGAGTTAAAAAGTCTTCTAAGTAGAATAACCAGACTAGGAGAACACAACTAAGTTTTTTAAGCTTGGAAGTATGATCTAAGGAAATCTCACCAAGTGGCGAAAGAGTTAAAAAGCATAGCCGAGAGTTAGTTGAGTTGGACGCATATTTGGTAAGTAAGAGACGATGTTGGACGCATGGTCAAGACTAAGAAAACACATGATTAGGGTAACCATTGTGGAAAAGAAAATGGGTAAGTGTTCAAGTATTGTAAGGAAACCCAAGAGGTCGGGGAAAAGAGGTAAACGCATAGGAGGTAATTAAGGAAGGAAATGTAAGCTTGGCTTAAGTTGGACATACGAGTAAGAGGACTAAAGTTGAGTAGTGGTGCGTTGGAAGGAAATCTAAGTCTCGAAAAGAAGGATAAAACGCATAAGGTGAAATCAAGGAAAGATGCTATTATAATTTAAGGTAAACGCATAACTTCCATCAAGGGAGATTTTGTTGAAGAATTGGATAATTTAGAAGGTTGAGAAGTAAGGCTAAAGGAAGGAGGCTGAAGATGGACGTATGCTAGGATTTATCGTTGAGAAAATGGCTAAGCATAAGGAAGAGGTTAGAAGCATAATCTTGGATTACAACTCAAGAACGCATAGAGGGACAATCAAGGGATCCTTAAGGATGGAAGGAAGGTCATGGACGCATAATAAAGTAGAGATAAACGCATGAAGTACAAACGCATGAAGGAAGATCTTGAGATGATTTTGGTTAACAGAATAACAAGCATAGTCGGATGTTTATTACTCAAGAGAAGGGGTTGTAGCTAAGATATGGATATAAGGAAAGAAGTAATGGACGTATGATAGTATGCGGTTAAGGCACATACGCATGGTAGTATGCGAAGGAGAAAAACCTGGTCAAAAGGAGGAAGGCCGGGAGGAAGAAAGAGAATAAAGAAGCTAAAACTTAAAAAGGAGAAGTAGATGGACGCATAGGGGCACTAAGTTGTATGCATAGTTCGTATCTCAGAACAAGTCAAATGCATAAGATAAGTTTATAAAGGTTACGTGGTTTCATTTAGACTTAACACTTTATAAGGAGTAGGTGTCAGGTTGCATGTCAGATGTGTCATACAGGGTGACAGGTGGCAAACATGTGGCAAAAAGAGAAGAGAAGTTGGATTCATCTCAACACTATAAGTAGGACCTTGAGGGTAAAGGAGCGGGAGATGATCAAAAAGTCTGAGTTCTGGATCGAAAGTAGAGAGAAGGTAGAAAAGGGTAATCCTTGCGTCCGTAGTGGAAGTCTATAGCCAAGAGATGAGGTGTTCTTAACAACCTAACTATCTTGCGTTCAACAAGTTTTTAGCGTTTAAGAAATCAGCTTGCGTTCAAAGATAGACTATGTGTTCAAAGTGAACGTCGCATTCAAGATCAACCCTGCGTTTAAGATAGAACCTTGCGTTCAAGGAGACCTGTGCTTCCAAGGGGTGAAGCCAGAAGAGTTGGTTGGAAGGTTGCACCTTGCGTTCAAGGTGTAGGGCTGCGTCCAACCGAGCTGACAAAAGGGAATAAGTTGCCTTGCGTTTCAGACTTCAAAACTTTGCGTCCAAGAAGCTCGGGAAAGAGAGTTAGAAGGAAGAAAAGGGAACTTCCCTTGAGCAAGTATTATGTGGGTAACACTAATATTTTCGAAATATGAATGCGTTCTCTGTGATTCATATCCCTGGTTTTGGAATATGAATAAGCATTCATGAGTTTTCAACGTATAGAGCATGCGTTTATATTTAAAAAGCATTTGAGCATGCTAGTTGTTTGAAATGAGTTTATTGAAAAACATGCTGTTTGATATTATGATTTTTATGGAAGATTTGGAACTGAGTTTATTGAAAAAGCATGTTGTTTGATATTATATGATTTTTATGGAAGATTTGGAAATGAGTTTATTGAAAAAGCATGCTGTTTGATATTATATGATTTCTATGGAAGATTTGGAATTGAGTTTATTGAAAAGTATGCTGTTTGATATAATATGATTTTATGAAAAATTTGGAAAAGTTTAACTTATGAATTTATGATATGATGTTTCCAAAACATGTTGTATATTTTATAAAGTCTTGAAATGGCACCCAAAGCTCAAGGACTTGGAGAAGGTCAATGGGTAGCTGGAATATAGAAAAGGTACCCAATGCTGAGGGACTCAAAGAAGGCAAATGGGTAACCTGGATATAGAAATGGGACCCTAATGCTCAAGGACTTGAAGAAGGCATATGGGCAGCCCAGATTATAATACCTATGTGCACATAGAGGAATATAAGACGTTGATGTGTTGAGATTACTCCACACAGCTAAGAATAATCGACGTTGAGAAGCGTAGAGCTTGGCTCCGCCTCAACTAAGGAAAATGATATGAAATGTTTTCGACGTTGAGTAGCGTAGAGCACCACTCCGCCTCAACTAAGGAGAATGATTTGAAACAAACAGAATTTCTGAGTATTTTTCTGAAATAAGCATGTATGATTTAACAATAGCATTTGAGCGTGTTTTGCAAGAATATGATTTAAATCTTTATTTTATATGCATTATTGACAATGTTTATATTATGATCAGTTCTATAGTTAAAGTAAGATATTTCCAAAACCGTTACTGGACTATTAGCTCATCCTTTCATATGTTTTCCATTTTTCAGGTAGCATGACAGTTCCCATTGCCGGTCGAAGTCGAGGTTTGTCAAGTATTAATGGTATCATTGTCCTAGAAAGTGTGGTCAAATGGAAAGCTAAAGTAAGTTCGACAGTGGAAGTCGACCGGGAAGTCGATGACTGTTGTCCTCACGCTCCGTCTTGGGCTACGGGAGGGTAGTTCGAGATGGGGTGTGATAGCCGCGATGCTGCGACGCGCATGTGGTACCTCGTCTGTCACCAATCGTAGCCGCGCGTTTTAGTGTAGCACCACGACGCTGTCTCCAGATCAAAATCGGGTGCCTTCTGGAAAAAGTGTAGTGTTGGCGTCGTGCTAGCGTCGCGGCGCCAGGCTGCTTTGTTCACTTCCGCCTCCATTTCGTTTGATTTGCTTGCTAATAATCTCACTTAAACCTGAAGTGCCTATAAACCGTCCCAAGCGCTCCATATGCTCATAACCTGTCTAAATCAAGAAATAAATCGGTAAAAAGGTAAGAACGATCCCGAATTAAGCGAAGTGAGATAGCACAATTTGAGTGCTATCAATGAACATATGGAAAATCTGTATATTTCTACATCAATTGTGATATTTTAGCCAATATATACATGTTTAATTGTTTGGGTTCTTTCATTAGAGAAAAAAGAGGAGTTTTGTTGAGGAAAACCTTTCAAGTGGCTTTATAGAATTGAAGAAAAAAAGAAAAGAAATTTTAGGGTGGAGATGTCTTGACAAGGTTCGGCCTATGTTATAGGTCAATATTCTTTTGGTCACACTTAAGAGTTTTATTATTTATACTTTTGTTACCACACTCTCACCCATTGAATTCCCCAAATTTGCAAACATTGAGTTAATTCTTTACAAATTTTGATATTGCCTTAAAATCTTTTTCATAACTTAGGCACATACCTAATCAAGAGGATGCTAGTAACTTGTTAGAGAATCTAAATGTAAGGCCTCCTATATTAGGATAGATTTTTGGGTTATAAATAGTGGGAAAGAGTATGCCTAGGGGTGTGAAGAATTTCATGTGGGATTTCCTAATTGGGAATTTGGGAGAGAATTCTCAGCCCTCTAAAATGAGCTGAGGTATATTGTAATTTCTTCATTGATATTGCAATATAATTCTATCTTTCAGTTTTCTTGTTTGTTTTTTGTGTTCTTGAGTTCTCTTGTTAGGAGGTATCCTAACACTAAACCAATACTATGTTTCTCTTTTAAATTTTGTGTTGATTGTTGTGTGTCGGAGTATGTGCCTTGCCATGAAAGAGTTACTTCAAAGTTGAGTTGGGAATAGAGCAGCTTGGTTAGGAGTTTTAGTGGTAGATCTTTTATTGGATAATCCATATGTACTATGGGGTAGTTCACAATCTTGTACAACTTGGCCGACTATATTGAATATTTATGCTACAACAAATATTGGGAATGTGAATGAGTTCATTTGCACAATCTTTTGTATTATTTTGTATATTCAAACATTGTATTTGGTTTGTTGTAATCCATATTCTAACATCCACAAATGTGAAAATTGAGTTGTGTATACAAAAGCAATTGAAATTTCATTTTCTGAATGCACAAGTGAGAGGATGCACCTCTGCGAATAACCAATTTTTATTTTTTAATTTTTTTTTTATTTTTGAATAACTAAATGATGACAAGTATTTCCAAACACAGAAATAGATATTCGATGTCAAGAAAATGTTAGACCAACATCAAAGTGACTAACATCAAAAAAGGTGTTATTAAAGGTGTTCAATGTCGGTTGAAAACTGACATTAATGTCATCTACAATGTCAAGTGAAAACCGACATCAAAGGTGACCGACATTGAACACCTTTAACAATACCTTCTTTAATGTCGGTCACTTGACCAACATCGTACTCCTATAATGTCGGTTTTCAACTGACATTTTAGCCTAATTTTATTGTAGCGTATGGTGAAAGAAAGTCAAAGTTTGGTGAATATTTGAAACATAAAAATGGATTTAATTGGGTAGATTAAATTCATTGTTTATAAACTTTCAAATTGGATTGAAACTTCAAAATAAAATGATGCCTCATGTACAAATTTAACATGTATGAAAGTTAATTTATAGTTTCTTCAATTTGAACGAGTTATGTTTGATTTTGTCAATTAACAATTAAATTGAGTTTAATTTGACAAATGTTTCAAATTTTATATTTATTCGATTTTATTTTGTACTTGGAAGAGTTACTAAGTTGCTAGCCTATAAATGGCAACTTGTTCCTTCATTTGTAGCATCCCACTTCAAATTGAAGAATTCTCTCTCTTCTTTCTTTCTTTTGGTCTTGAGTTAAGTTAAGAGTAAGTATCCAAGATTTCTGCCAGATTTTAGTAGTTCGAGGTTGCAGTTTTACCGGAGTTAGTCGTTGTATTCTGTTAAGACGATTGTTGCAATCTAGTTGCATCCGGTGTAGAGCGAATAATTGTCTTTAGGACAATGCTTATCAGAAGCATGCCTCAACTATCTATTGACGCCTCCAGATTATCTTGAGTGTGCTAGTATATTCGGTTCACATATATTATTTTTTCCCCAACAATCTGAAGACAATTAATAATATCTTAACTAATGGCTACAAACTCCTCTACCAATTCTTCTACTTCTGTCGTGACGAGATCACCCATTGTGAAACCCCATGTTGAAAAATCGGAGAAGTACAAAGGAGACAACTTCAAGAGATGACAACAGAAGATGATCTTCTACCTCACCTCGTTGAATCTTGCTCATATTTTGAAGGAAGAATGCCTAGTTACCCCATCAGATGTTGTAACTCTCAAAACGAAAGCAGCAAAGCAAGTGTAGATGCATTCAGACTTCCTATGTTGTAATTACATACTTAGTGGTCTTGAGGACATTTTGTATAATGTCCATTGCAATGCCTATAATACATCAAAACTATTGTAGGAGGCAATAGACAAGTACAAGCTAGAAAATGTTGGTACTAAGAAATTCCTTGTGGGAAAATTCTTAGATTACAAGATGATTGATACCAAGTTGGCAGTCAATCAGATAGAAGAATTACAAATTATCATCAGTTATTTGCAAAGTGAAGGATTGGACATCATCGAGCCATTCCAAGTTGTTAATGTGATTGAGAAGTTTCCTCCTTCCTAGAAGGACTTCAAATGTTATCACAAGCACAAGTGAAAGGAGTTATCCATGGAGAATCTCACAATAAAACTTTGTATATAGAAGAGGATAATAGAAAAGAAGATTCTCCACTGAAAGTTGAAGGCAAAGCTCATATTGCTGAAGCTTTAAAACATAAATCCAAGAAGGAGAATTTCAAGAAGAAAACTGTAAACCATGAGTTGAAGAAAGATACTAGCAAGCGCATTCGAGGAAACTGTTGGGTTTGTGGTACGTGGCCATACCGCTGCAATGTGCAAACACAAGAAGGGACATAGTTCCAACAATCAGGCTAATGTTGTGGAAAATGACGATCCCGTTCCCATTATTTCTAAGATAAATATGGCTTCAAACATGAAAGATGTCACACCCCCTCCCAGATTACCCACGTAACCTGAAAGGGAATGTGAAGACAATATAACATCACCCCTTTTGTGACGCATATTGTCTCTCTTACATTCCTCCCTGGGCATACATTTAATCATATACATACATAGAACCCAGTCATACTACACATGCTTATAACATAATATTATTCCCTGTATAGAGTACACTTTTATTAGATAACATAATTTAACTTCCATATCCATCACACAATGTACAAAACTCCACTGCAATTCCCACTCAAAACTACAAGGTCTACATGCCATCTACATAAAGTCATATACAACGTACAACTCTCAGACTTGACTAAATTTTGAGAACTCGAGACTTAGGTAGGGTGAGCTGAAGTAGGCTAAGCACCGGGAACGTCCTTGCTACCTGCGGGGAGAAAAGGAAAACATTGGAAAACATAAGCTAACAGCCCAGTGAGTGACAAATCTTTGTAAAACGAGCATTCATTGAAAATAGGCATGTTACGTATAACAAATAAGTAAGAATTTAACATATTGTTCAATTTAAACATACTTTCCACAAAGTCTATTAACCTCATAAATCAAATCTCAAATTATACATGAGTGTTTTCCAAGGAAGTTAACAATAACAATTTAGTTTTTCAACTTTTCCCTTAGTCGAGGCGGAGCAATGCTTCAACGCACTCAACGTCGAAAACAACCTTAGTTGGGGTGGAGCAAAGCTCAACACACTCAACGTCGAAATCACCCTTGGTTCAATGCGGAGCAAACTCAACGCACCAGCCAAATTAACCTTTATGTGCACATAGGAATATGACGGGTTGCCCAAGTACCTTCTCAAGTCCTTCAGTACTGGGTCCCGGGGAAACTCACGTGCCTTCTCGTGTCCTTGAGCACTGAGCCTCCTCTAAAACGTCTTAGAAATCCGTTCACATCACAGTTCACATGAAAATCCTTTCATAGAAATCATACAAATCTTTATTCAATAACATTTCATAAAAATGCTTTCTATCAACTTTAGAAACATATAACATGCCTTGTGAGTAATACCATCAAATATTATGCACATTAAACAGTTCATTGTAAAATCATACTTTATAAGAACGTTTTTAATAAAATAGAATTGAAGAATAAAATAGTCACTCACAACTCTTTCTAGGCTTTACGTGGATTATACGCTTTCCCTATTTCTTCTTGGCCTGCAATAGACATTTACCTTGGTTACTTAACATTAAACCTTCCTCCTTTTCTAAACTAACATTATAAGCGAGAATCCTGGCGATACCAACTAACAGAAGCTGTTACTAGCGATACTTGATTGAGCGACACTAGCGTTACTTTACCTTAGCAGCGACACTCAACGATATCTAGTGAGATTTGTGAGACGAAAAACTCCAGCCGTCTCACTATCAGTGCCCAGACAGTGTGCTGATACTTGATTCTTAGCGGCACTCTGCACTGTAATTTCTTACTTTCCTCTTGTTCTTACTTCAATCTATTAAGAGTCATAATTTCATACGGATTTAGGACTTCTTCTCTTCTACAAACTTATAGAGAATTGAGTTCTATAGCTTACCTCAAAGTTTCAAGGCTTAACTTTTCCTCGTTCTCTCGGTATTTCGAAATCACCTCATGCTGTTCCGGTTTGCTTATCAATCTTGTAAGTGCTGCTTTCCCTAAAGATTCACATAGTTAATCTTCAATTTCAGCATTACCATCCTTCACAAAAGTTGTTAGGAATTATGTTAACTTACTAACCTCCAAATTTCAGCTCTAAACTCGGCTTCCTCTATATGTTCTACTCTGAATATCAACCACTGCTCAATCTGTGCGACTTCAATTACTCCTCTGTTCCAGCTCCGTTTTCAACATCAATATGAGTAGAATTTCGTCCAAAATTTGCAAGAATACCTTCCTCAATCTTCTTCTGAAGTTTGTTTGAAATGTGGAACCGACTTAAGCTCTCGAAATGTCTTAAAAATGATTAAACATTTAATTAATCCAACTATATATACTACCGACTTAGCCTTTACAAACACCAGGATTTATGTTGCCGACTTACCCTTCACCCGCAAACTAGCGCAGGGTTCTTATCTCAACGCGAGGCACTCATGTTAACTTCTCCTCCAACGCCTCTAGCAACTGACCACACCCAACGCATGGTGTCGTCCTCAATGCCTCCTATTTTTGCTGCATACATTCTCGTCATACAGCACTATTTTCGTCGCGTCAAGACCACTCTGTCTTCGACGCGAAGTGACTTCAACGCATAACTGTCCTCAACGCAGAGTGACTTAAACGCATGGTTCAAGCACATTTCTTCCCCCGCCAATATCACCTATCCTTCAACGCGGGTAGCTACTTCAGCTCACTACCCTCCATGAGTCAACCAACAACGCATGGACTTGGCCAACACTTAGCCAAAATTTCACTTTCCAAAAATTCTTGCTCTCGCTCACAACCACTTCCCCCTTTGAAACGTATAGCCACAATCGCATTGCCGCACCCCCAACGCTTGGTAATGCAAGTCAGTGCTTACAAAAGATTGGTGGATTGATACCAGTGTTACTAGGAATATTTGAGAAGCTAAATGACAACGATAAGCTGTACATGGGAAATGCTTCAATTGCCTTTGTGGAAGGCAAAGTGAGGATCCTGCTGCAATGGACTTTTGGAAAAAGACTTACCCTCAATGAAGTGCAACATGTGCCAAATATTCGAAATATGGGTTCAAGCTTGTTTTTGAATCCGATAAATTTATGTTGGGATTTGTGTCCTAAAGCCTCGTAATTGATTAGTTAATTAATTTAACTATTAATTATTAATTTTGCAATATTAATTTATCCATTAACGAAAAAATCCAAGGTTATTGCATGAAACCTTTAAACATGTATGTAGTTGACATACGGATGGATCATGTTCAAGAAATAACCTAAAGATCTATAGTATGTGGATAAGGTTGGGTGCTTTATCCTGTTGACACTATGGATACAACTCACTTAGTAATAGTTTCAAATGGTTTGATCCAAACCATTCATGTGGAGACATGTGAGTGGGGATATCCTATACAAAGAGTTTGTATAAGACTGAACCACAAAATATTTAATCTCTCTTTGTAAATCCTTTAATTGATGAGATTAATATTTCATAGGATGATCATGTGCGACTCGATCTTAATCTTGGGTGACTTATGAACTCCTTTCTATGAGGTTGATCCTTTGATTTGCATGGGTGAGCGTGGCCTGAGTTGCCGACTCAATAAGCCTACCATTTTGAGGATTTGACTAAATAGGGAGTTGAGAACATAATTTCACAAGATAAAATTCACTTCTTTCTAGATAGGGTAAGTAGATAAATAGTTCCCTTAAGTGCTGATTTCGGGTTTTGAACAAGGGACCTCACCCTCTCACTGGCCCAAAAGGGACTTAGTTTATGGTTGGACCATAAACAAATTGTTCGTTAGAAAAATCAGTGGTACTTAAGGATAAAAGGTAATTATAGGGGTAGAATGGACATTTGACCTAGATGTAACTATGACCAACTCGTGAAGGATTAACTTGCTTGATTATATCCATGGACACAACTTGTCCTACACTGCATAAGAGTTCAACTATAGGTCTATAGTGGTTTGCCATATATTTAATGAATGAAGATTAATTTAATTATAGAGTTTAATTAATTAATTTTGGGTCATTAGAGCTTATAATCTGTAGGTCCATAAGGTCCCTTTGCTAGCTCACCACGGACCAACAAGGATCAACTATGTTGAAAGAAAATTTTGAAATGTTATGAGGGAAAATATTAATTGTATTGAATACAATTAATTACATAATTTATTATAAAGTTTATTTGAATAAGATTCAAACTATAACTTTATTATATGGAGAATTAATTTATTTGAATATGATTTAAATAATTAATTATTTATTTGGATGAAATCCATATAAATAAATAGATGGAGAATTAATTAATTAATTTGAATTGGATTCAAATAATTAATTATTTAATTTGGATGATATTCAAATTAAAGATTCTACAGAAAATAAATATTTAAACAAGATTTAAATATTTTAAACCATTTAATATGAGATATTATTTGGATAAATGTTTATTAATATTGGATATTATTAAATATCTCTGCAGAGGGTTATCCTAATAGAATTAGGAGTAACCTATAGAGTTCTATAAATAGGGCCCCCATAGAATAGATACACAATCTCCACAAGAATAATCTCTACGATTACTCTCAAAGTCTTTCTCTAGAAATTCTCTCAATTTTCTTATAAAACTACTAAGAAAATTCTTTTTCCCGTTCCCTCTATTCTTTCAAGAAGTGGTTCCCACAAATCGGATCCTTGCCAGAGTCATAGCGTAAGCACTGGCTTATAATACTAAGTGTTGGGAGGAGGCAAGTGAAATTGTGGCTATGCGTTGTTAGAGGAAGAGTAGTCGAGGGTGGAAGAGCAAAAATTTTGGCTAAGTGTTGGCCAAGACTATGAGTTGATGAAAGAAGAGAAGAGTTAGCATTTTTCTTTTAGTGGCCATGCGTTGAAGGAGGAATCACGAAGGGTTGTTGTGAGCTGAAGCAGTCACTCGCGTTGGAGGAGAAGTTGATCATTGGCAGAGGAAGAAACGTACTTGGCTTACCATGCGTTGGTAAGGTTGTCATGCGTTATGGTCCCTATGCGTCGAAGACAGAGTGGTTCCTATGCGTCGAAGACAGAGTGGTCCCTATGCGTGGAAGACAGTGGTGTAATTGGTCGTTGTAAGCGTTGAAGGTAGGAGGCGTTGAGGGCGCCATGTGTTGAGTGAGGATGATTGAGTTGTATGCATCAAAGGTAGGAGGCGTTGAGGGCGTCATGCGCTGAGTGGGGATGATTGCTGAAGGCATTAGAGTAGGTTGCGTTATGGGTGCTTTGTGTTGAAGTAAGAGAAAATTTGGTGCCTTGCGTTGAAGTAAGAATGAGAACCTTGCGCGCCAGCCTGCTTTTGAGGGTTAAGTCGGCAGTATATATATAGGTGCTTTTGAGGGTTAAGTCGGCAGTATATATAGGTAAGTGAATTAAATGTTTAATCATTTTTAGACACTTCGAGAGCTTAAGTTGGTTCCATTTTTCAGACAAACAACAAGGGAAGGTAGAAGAAGGTGTTCTTGTGGATTTTGGGTGGAATCCTACTCGCTTTGGTGTTGAAATCTAAGTTGGAACAGAAGATTGTTTGTGGTCGCACAAACTGGACAGTGTCTGATGTTTGGAGTAGAACGAATCGAGGATATCGAGTCTAGGCTGAAATTTTAAGGTCAGTAAGTTAATATAATTCTTAACAATTCTTGTGATGGATGGTAAAGCTGGAATTGAAGTTAATTATGCGAATTTTTAGGTAAAAGCAACGGTGACAGAATTGTTGTGCAATTTGGAGCAACGCGGAGTGATTTTGGAATAACTGGAGAACGAAGGGGAGTTAAGCCTTGAAATTTTGAGGTAATCTATGAAACTCAATTCTTTACAAGTTTGTAGAAGGGAAGAAATCTTAGTTTGGGCTGGAAATGTGTTTTTAGCAGGGGTGAATAAGAAGAACCGAAGCTGCAGCAGTCAGCTTTGTTAAGGAGGAAAGTAAGCTAGAAACGTTTGTGTTGCAGAGGGCTTGCGTTGCACACAAATTCTTCTTGAAATCAAGGTATAACAGAAAGGTTAAGGAGCAGCTTTGTAGCAATAGCATTAGTGTTTAAAGCTTGAGAATGAACCAATGGAAACTTAGCCGCATGATGAAATTTAAAGACATCGTGCATATGTTTAGTTGTTGTTTATGTTAAAGTGAATGGAAAAGAAGAATAGTAAAGAAATATGTTTATTTCAGGCCAAGAGGAGATAGGGAAGACGTATAATCCACTCAAGGCCTAGAAAGAGTTGTGAGTGACACTTTTATTGCTTCAAAACGATTTTGTTGTATATGTTAGTTCAGTTTTCTAATTGAATACGATTTTGTGATCAATCGATTTACAAAGCATACTTTATGTTTCTTAAAAGGTATTGAAAAGTATGTTTATGAAATGTTATTGATTTAAGAGTTGCCCTGTTTTCTTATGGAAGATTTGCATATGAATTGTTTGTATGAAGTATTGCTTTCTTTAAAGGAAAATGTTTTAGAGGAGACTCAGTGCTGAGGGACATGAGAAGGCACGTGAGTTTCCCCGGGACCCAGTATTGAGGGACATACAGAAGGTACTTGGGCAACCCGCCATGTTCCTATGTGCACATAGAGGTTATATGCTGGTGCGTTGAGTTCTGCTCCGCATCAAGCCTTGTTTGTTTTCGACGTTGGGGTGTTGAGTTTTGCTCCGCCTTAACTAAGGAGAAGTATTGCAACTAAACTGATTACTTTATGATCTGCTGGAAAATCATGTATGTTTTGATAAGGGCATTTAAGCATGTTTTATCTAATGAATATGAATTGATTCCATGTTTTACACATGACGTTGAACTTAATTTTTATCCTAATGATTTCCGAACTAAAAGCAGTTTGTTAAACTTAGTCACTCACTAGGCTGCTAGTTCATGTTTTCAATGTTTTCTTTTCCTCCCAGGTAGCAAGGACGTTCCCGGTGCTTAGCCTACTTCAGCTCGCCCTGCTTAAGCTTCGAGTTTCCAGAAACAGTTTATTTCTTGTTGTTGTACGTGTAAATAACTTAACACTGGTGGCATATAGATTTAGTTGTATACAAGTGGGTTGTGCGAGAGTATGTTGTAAAAGTGTACTCTGTACAGGATGGATTGTTTATAGTATGATCACGCGTTAGTCGTTAGTATGGGTGAATTCTATGAAAGTATATGTATGGATTGTTTATACAGGAAGTTATATAAGATGGGGCAATATGCAGGGGGTGATGTTTATTGTCCTCACATCTCCTTTCAGGTTAAATAGGTAATCAAGGAGGGGGTGTGACACATAGCAAGGAAGATCTCTTGGAAAAGAAAGACTACAAGGAGAAGACTATCGGTTCGAGGAGAGATTATGCTGAAATTAAGGATTGACAAAGGTATGTTGCTTAACCTTTTATTAATTATTTGTATTTTTTAATTTTTCAAGAAATCAAAGAACGCGATCACACACTTCCACCGGGATGCAATTTCCTTCAATTCTGACAAAGGGGATATGTGTATGGGAAAATGTTATCTAAGTGATGACATGTTCAAACTGAATGTACTGACACAAATGCCAAAGAATAATAATAATATCAACTCTTCTGCTTATATTGTTGAGTTGTGTGATGTGTGGCATTCTAAATTGAAGCATGTCAATTATCATTCTTTAAATAGAAGGGTAAGTTTAGGACTGTTGTTGAATTTTAACATTGCTAATAGACACAAATGTGAAGTTTGCATTGAATCAAAGTTTACTAAGAAGTCATTTTCATATGTGAAAAGAAGTAATGAGTTACTTGATTTAATCCACATTGATGTATGTGACATGAAAAGTACTCCCACAAGAGGTGGTAAGAGATATTTTGTCACATTCATTAATGATTGTAGTAAATATTGCTATGTATATTTATTACATAGTAAAGTTGAGACCTTTAATGTTTTAAAAGCATATAAGGCAAAGGTTGAAAATTAACTTGAGAAGAAAGTTAAGGTGCTTCGATCAGATAGAGGTGGTGAATATGAATCATCATCACTTTATAAATTTTTTGATGTTAATGGTATAACCCATCAAATTACAACATCTTAAACACCATAACAAAATGGTGTAGTTGAAAGAAAGAACGAAACTTTAAAGGACATGGTTAATTGCATGTTAAATAGTTTAGGTTTACCTCATAATTTGTGCAAGGGAAGCTTTGCTTACGAAATATTTTATACTATATAGAATTCTACATAAAAATACTAATAAGTCACTTTATGAAGTGTGGAATGAAAAATTGTCAACATTTAAAATGCTGAAAGTATGGGAGTGTTTGGCAAAGGTACAAGTTTCTTTACCAAAAAGGACCAAACTTGGACCTAAGACAATTGATTGTGTCTTTATAGGCTATGCAATGCATAGTGCTGCTTACAGATTCATGGTGATTAAATCAGAAGTATCTTATATTAATAATAATACCACCATGGAATCTATTGAAGTTGAATTCGAGGATATATTTCCTTTAAAGGAAGAAATACATGAAAATAATGATACCAAAAGAAATTATGAGGCTAGCTCTTTGAGAGGTTAAGACGATCAAAATACTGAGTTTGAGCCTAGAAGAAGTAAAAGAATTAAGAAAACTACTTCTTTTGGACCTGATTTCATAATTGATATGATAGAAGGTGCGCCTCAAACTTTTCAAGAAGCTGTATCATCTCTTGATGCACCTTATTGGAAGGAAGCTATCAATAGTGAGCTAGAATCTATTTTATAGAACAATACTGGGTATTGGTTGATTTACCTCTTAGAAGTAAACAAATTGGGAGTAAATGGATTTTTAAGAAAAAGCTAAAGGTTGACGACACTATTGATGAGTATAAAGGCTGCTTAGTAGCTAAAGGCTTTCGCCAAAAAGAAGGGTTAGATTTCTTTGATACCTATTCTCTAGTGATGAGGATTACATCTATAAGGATGTTAATTGCAATAGCAGCTTTACAAAATTTGGAAATACACCAAATGGATGTGAAAAGTGCATTCTTAAATGGAGATTTAGATGAAGAAATTTATATGGAACAACCTAAATGGTTTGTTGTTAAATGACATGAGGACAAAGTTTGTAAACTTCTCAAGTCTCTTTATGGGTTCAAACAAGCACCCAAAAAATTACATGAAAAGTTTGATCACACTATGTTATCAAATGGATATAAGATAAACAAGTGTGATGAATGTGCTTATGTTAAAACCATAGATCAAGAATATGTCATTGTTTGCTTGTCTGTGGATGATATGTTAATAATGAGCACAAATAGAAGTGTCATTGATTCCACGAAAAGGATGTTGAAATCCAACTTTGATATGAAAGACCTTGGTCTTGCTATTGTTATCCTTGGAATTCAAATAACAAGAAAGTCAAGTGGATACATACTTTCTTAATCTCACTATATAGAGAAAATGTTGAGAAAATTTGATCATTTTGAGAGTAAGCCGGTTGTTACTTCATTTGATCCAAATTGTAAGTTGAACAAAAACAATGATAGATGTGTCTTCTCTAGAATATTCTAGAGTTATTGGTAGCTTAATGTACATAATGAATTGTACAAGACTTGATATTGCTTATTAAGTAGGAAGGTTAGCAAGATACACTAGCAATCCAGGACATGATCATTGGAATGCTTTAGTCAGAGTTCTAAGTTACTTGAAGTACACTCTAGACTATGGATTGCATTATACAAGGTATCCACATGTATTAAAAGGATTTAGTGATGCTAATTGGATTTCTGATAGCATGGAGACTAAATCAACTAGTTGGTATATTTTTACCTTAGATGGAGTAGTTGTTTCATGGAAAAATCTTAAAAATAGATGTGTATAGCTCGTTGTATTATGGAATCAAAGTTTATAACTCTTAATAAAGTAGGAGAAGAAGTTGAAAGGCTTTGAAGTTTTCTTGAAGATATTCCATTATGACCAAAGCCTGGAAGTGTGATATGTATTTATTGTGACAATTTAGCAACTTCTATGAGAAGCAAGAATAATATCTATAATGGAAAGTCACAACATATTAGACAAAGTCATAATTCGACAAGGCAATTGCTCTCTAATGGAATAATGCAAATCCTTTGATGAAAGAATTGTCAAGAGAGCAAGTTGCATATACATCGACGGAAATGGGGCTAAAGCCTACGAAATGAGTTATCTAGTGGTAACCTAACCTAGTTGACTGGAGATCCCAAGATCTAGGTTTAAAAGGCAAACTAACTGGATAATTCATACTGGACACACTAATTAGAAAATTATAGTTTCTTCCCATTTCTATGATGTTGTTCAGTGTGTAATTTATTAATGTTAAGGTCATATTGATATCTTGTTTTACAAGAGGAGTATTGACATAATACACTTTATCAATATTACCTATGTGAGAGTAGAAGTCGGGTCGCTTCTATGAGAACTTTTCAAATGCCTTATGTTCTCTAAAGCTCTCATGAATTCAAGATTTTGTCCAGGACCAAAATGAACACAAACGTATAGAACTCAAAGATAAAAGATGAGTTGTGTGTGATATGTATTGTCTCAATTCACCGATGAGAAGGACAGTTCTAGAGCTTGGTCTCACTATTTCTTCGGTAAATTTGATGGATATTCAATAAGGAAGGTTCAAGTCTAGAAGATACCTCTCCTGATGCATAGTTTGTCTGTTTGCTATCAACAATTTTGTTTGATTTTTCACTTCCTTTCATGTGGGATTGTTGGATATATATAATGAAAAAAAATGAAAGTTTGATGAATATTTGAAACATAAAAATGGATTTAATTGTGTAGATTAAATTCATTGTTTAGAAACTTTCAAATTGGATTGAAACTTCAAAATAAAATAATGTTTCATGTATAAATTTGTAGTTTTTTCAATTTGAATGAGTTATGTTTGATTTCGTCAAGTAACAACTTAAGTTGAGTTTAATTTGACAAATATTTCAAATTTTATATTTATTTTATTTTGTATTTTGTATTTTGGAGAGTTAGTAAGTTGCTAGCCTATAAATAGCAACTTGTTTCTTCATTTGTAGCATCCTACTTCAAATTGAAGAATTCTCTCATTTCTTTCTTTCTTTTGGTCTTGAATTGAGTTGAGAGCAAATATCCAAGAGTTCTGCTAGATCTGAGTAGTTCGAGATTGTAGTTCTACTTGAGTTAGTCCTTGTATCCTGCTGAGACGATTGATTAGTCTGCTTGCATGGGGTGCAGAGCGAATAATTATCTTTAGGACAGCTCTTATCAGAAGTGTGTCTCGCCTATCTATTGACATCTCAACAATATATTGAGTTTGCTAGTGTATTTAGTTCACATATATTATTTTTTCTCAACAAACTATATAAATATCCACTAAATGGTAACACCATTGAATTTAATTTCACCAAAATTCTTTCCTCAAACTAACAAAACTTACCAATTGCATCAACAACTAATTCAAGCATAAAACCAACTTATTTGTACTGTGCTAACATTCTAAAAAAAAAAATCTCATGAACTTTCTCACCTTAAAGGAAGTATTTCTTCATGATTTTTGGCATTCAAGGAACATTCCCTTCCATCTCCATGTTTCTTCGGCGAGCTTTCAACAACTTTCAAATATTTTTTCTTTCCTGGCGGAGTGGTCCTTCATCCCTTCTTCCTTTTCTGACTATGCTAGACCATTTTCATTCTCTCTTTTTCACCCCCCTTTTCTATTTTCATCTACATAACCAAGTTATTTATTTATTATTATAAATTGTTTTTTAAACCCACATGGCACTATACTATTAATGCAATAAAAGGACCATCATACCCTTCTAATTTAAAGACCTTTTTCCTTTTTTCCTTTAATTTGACTTTTTTTTTCTTAATCAAACAAAAATCAATTGAGTTTCACAATGTCACAATAGATAATAATAGATATTGATAAACTACTATCTTTCATTTATGATAGATATTGATAGATGTCTATCAGTATCTATCACTAATAGAGAGTGACATTTTGCTATATATATTTTAAAATATTTTCAACAGTTTCCTATATTTAAATAGGTTGAAAACTTTACCAACGGACTTAGAATATTAATTATAAAATTTTCACTCAAAAATTCAACCTTTATCTAAAGTTATACCGATATTCCACTAAGTATGGGTTAAATGTAAATTTATTGCAAGTTAATTTGCACATTTACTTGTAATATATGTGTAATTGGAAAAACATTAATTTTAAATGTAAAGACATTATAATTAAATGAATTCACATCTTTTATCATTGTTGTTCGGTGCTATTGTTCACACATATGGATGTATAAAGTTTTATTTAATATGTATATAACATTGCTCAATGGTTATAATTAAGCATCCTAAACCATGTAGAGATAAATTTCCTTTCCGAAACTTACTATCTTATTATTGTTATTTTTTTCTTTTTTTTATGTAGTATACAAATTTTATTGGTCTTTAA

mRNA sequence

ATGATTATGCAAACACAAGCTTCCATTTCTCTGAACTGTTTGTATGCCAATGGTCTCACCGCCACCGCCATGTTTCTGCCCCTTCTATTGGTCCTCTTCTTCATCCTCTGTACGCCGCCCTGCCTTCCATCCCCGACAACACTGCCCCAAAACCCCACCGCCACGGTGATTTACCGGTCAAATTCCACTCTACTGCGGTGGCGCGTGTCCGACACATTGCATAGACCCCATGCGACGGCGCATTATGTATTTTTTCCTGGGAAGCCACGGTGGATGCGGCGAATGCCGATGGTGCTGACGTACGCATTTTTCAAGGAGAATTGGATGAGCAACTTGAGTTTAAAAGACGTGAAAGCTGTATTTCGTAAGGCGTTTTCGAAATGGGCCCATGTGATTCCGGTGAGTTTCGAGGAGACCGACGACTATGGATACGCGGATATTAAAATCGGGTTTTATAGAGGCGATCATAGCGACGGGCAGGCGTTCGACGGCGTGTTGGGGGTTTTGGCACATTCTTTTTCCCCTGAGATTGGGAGGCTCCACCTAGATGCGGCGGAGACGTGGGCGGTTGATTTCGAGAAGGAGAAATCGGCGGTGGCGGTTGATTTGGAATCGGTGGCGACGCATGAGATTGGTCATTTGTTGGGGCTGGGTCACAGCTCTGTGAAGGAATCGGTAATGTATCCAAGCTTGAAGCCGAGGGAAAAGAAGGCAAATTTAGAAGTGGATGACGTGGAAGGCGTGCAAGCCTTGTACGGATCGAATCCAAATTTCAAGTTTGAGTCTTTGGAATTGAAGTCCGACACTTCTGCCAGTCAGCCACACCCAATTCTTCTTCTCCACGCCATCCCCTTTTGTCTTTTGTCCGCGCCGTCCGTCCTTGTCGTTCGAGTCGTCGTTCAAGTTTCGTTCTCGTCGTCCAACCTGTCCAGCCCGTTGTTCGAAGTCTGTCGTAGTCCGACATTCAAGTATACAAATTTTATTGGTCTTTAA

Coding sequence (CDS)

ATGATTATGCAAACACAAGCTTCCATTTCTCTGAACTGTTTGTATGCCAATGGTCTCACCGCCACCGCCATGTTTCTGCCCCTTCTATTGGTCCTCTTCTTCATCCTCTGTACGCCGCCCTGCCTTCCATCCCCGACAACACTGCCCCAAAACCCCACCGCCACGGTGATTTACCGGTCAAATTCCACTCTACTGCGGTGGCGCGTGTCCGACACATTGCATAGACCCCATGCGACGGCGCATTATGTATTTTTTCCTGGGAAGCCACGGTGGATGCGGCGAATGCCGATGGTGCTGACGTACGCATTTTTCAAGGAGAATTGGATGAGCAACTTGAGTTTAAAAGACGTGAAAGCTGTATTTCGTAAGGCGTTTTCGAAATGGGCCCATGTGATTCCGGTGAGTTTCGAGGAGACCGACGACTATGGATACGCGGATATTAAAATCGGGTTTTATAGAGGCGATCATAGCGACGGGCAGGCGTTCGACGGCGTGTTGGGGGTTTTGGCACATTCTTTTTCCCCTGAGATTGGGAGGCTCCACCTAGATGCGGCGGAGACGTGGGCGGTTGATTTCGAGAAGGAGAAATCGGCGGTGGCGGTTGATTTGGAATCGGTGGCGACGCATGAGATTGGTCATTTGTTGGGGCTGGGTCACAGCTCTGTGAAGGAATCGGTAATGTATCCAAGCTTGAAGCCGAGGGAAAAGAAGGCAAATTTAGAAGTGGATGACGTGGAAGGCGTGCAAGCCTTGTACGGATCGAATCCAAATTTCAAGTTTGAGTCTTTGGAATTGAAGTCCGACACTTCTGCCAGTCAGCCACACCCAATTCTTCTTCTCCACGCCATCCCCTTTTGTCTTTTGTCCGCGCCGTCCGTCCTTGTCGTTCGAGTCGTCGTTCAAGTTTCGTTCTCGTCGTCCAACCTGTCCAGCCCGTTGTTCGAAGTCTGTCGTAGTCCGACATTCAAGTATACAAATTTTATTGGTCTTTAA

Protein sequence

MIMQTQASISLNCLYANGLTATAMFLPLLLVLFFILCTPPCLPSPTTLPQNPTATVIYRSNSTLLRWRVSDTLHRPHATAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFEETDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVAVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESLELKSDTSASQPHPILLLHAIPFCLLSAPSVLVVRVVVQVSFSSSNLSSPLFEVCRSPTFKYTNFIGL
Homology
BLAST of CaUC01G012220 vs. NCBI nr
Match: XP_038903989.1 (metalloendoproteinase 4-MMP-like [Benincasa hispida])

HSP 1 Score: 480.3 bits (1235), Expect = 1.3e-131
Identity = 237/264 (89.77%), Postives = 245/264 (92.80%), Query Frame = 0

Query: 24  MFLPLLLVLFFILCTPPCLPSPTTLPQNPTATVIYRSNSTLLRWRVSDTLHRPHATAHYV 83
           MFLPLLL+LFF LCTPPCLPSPTTLPQNPTA VI+RSNST +RWRVSDTLH PHATA YV
Sbjct: 1   MFLPLLLLLFFFLCTPPCLPSPTTLPQNPTAAVIFRSNSTSMRWRVSDTLHTPHATARYV 60

Query: 84  FFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFEETDDYG 143
           FFPGKPRW RRMPMVLTYAFF+ENW+SNLSLKDV+AVFRKAFSKWAHVIPV FEETDDYG
Sbjct: 61  FFPGKPRWTRRMPMVLTYAFFEENWISNLSLKDVRAVFRKAFSKWAHVIPVRFEETDDYG 120

Query: 144 YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVAVDL 203
           YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETW VDFEKEKSAVAVDL
Sbjct: 121 YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWTVDFEKEKSAVAVDL 180

Query: 204 ESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL 263
           ESVATHEIGHLLGLGHSSV+ESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL
Sbjct: 181 ESVATHEIGHLLGLGHSSVRESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL 240

Query: 264 ELKSDTSASQPHPILLLHAIPFCL 288
            L+SDTSASQP         PF L
Sbjct: 241 ALESDTSASQPPVSTCTVTWPFSL 264

BLAST of CaUC01G012220 vs. NCBI nr
Match: XP_008437790.1 (PREDICTED: metalloendoproteinase 4-MMP [Cucumis melo] >KAA0048832.1 metalloendoproteinase 4-MMP [Cucumis melo var. makuwa] >TYK20784.1 metalloendoproteinase 4-MMP [Cucumis melo var. makuwa])

HSP 1 Score: 458.8 bits (1179), Expect = 4.1e-125
Identity = 233/267 (87.27%), Postives = 243/267 (91.01%), Query Frame = 0

Query: 24  MFLPLLLVLFFILCTPPCLPSPTTLPQNPTATV-IYRSNSTLL--RWRVSDTLHRPHATA 83
           MFLPLLL+LFF LCTP CLP PTTLPQN TA V I+RSNS+ L  R R+++TLH PHAT 
Sbjct: 1   MFLPLLLLLFFFLCTPSCLPYPTTLPQNSTAAVIIFRSNSSSLQGRRRLTETLHNPHATE 60

Query: 84  HYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFEETD 143
            YVFFPGKPRW RRMPMVLTYAFFKENWMSNLSLKDV+AVFRKAFSKWAHVIPVSFEETD
Sbjct: 61  RYVFFPGKPRWARRMPMVLTYAFFKENWMSNLSLKDVRAVFRKAFSKWAHVIPVSFEETD 120

Query: 144 DYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVA 203
           DYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVA
Sbjct: 121 DYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVA 180

Query: 204 VDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKF 263
           VDLESVATHEIGHLLGLGHSSV+ESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKF
Sbjct: 181 VDLESVATHEIGHLLGLGHSSVRESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKF 240

Query: 264 ESLELKSDTSASQPHPILLLHAIPFCL 288
           ESLEL+SDTS +QP       A PF L
Sbjct: 241 ESLELESDTSDTQPPVSTCTMACPFSL 267

BLAST of CaUC01G012220 vs. NCBI nr
Match: XP_011650638.1 (metalloendoproteinase 4-MMP [Cucumis sativus])

HSP 1 Score: 454.1 bits (1167), Expect = 1.0e-123
Identity = 228/264 (86.36%), Postives = 239/264 (90.53%), Query Frame = 0

Query: 24  MFLPLLLVLFFILCTPPCLPSPTTLPQNPTATVIYRSNSTLLRWRVSDTLHRPHATAHYV 83
           MFLPLLL+LFF LCTP CLP PTTLPQNPT+ VI+RS+S+        +LH PHAT  YV
Sbjct: 1   MFLPLLLLLFFFLCTPSCLPYPTTLPQNPTSAVIFRSDSS--------SLHNPHATERYV 60

Query: 84  FFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFEETDDYG 143
           FFPGKPRW RRMPMVLTYAFFK+NWMSNLSL+DV+AVFRKAFSKWAHVIPVSFEETDDYG
Sbjct: 61  FFPGKPRWTRRMPMVLTYAFFKDNWMSNLSLEDVRAVFRKAFSKWAHVIPVSFEETDDYG 120

Query: 144 YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVAVDL 203
           YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVAVDL
Sbjct: 121 YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVAVDL 180

Query: 204 ESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL 263
           ESVATHEIGHLLGLGHSSV+ESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL
Sbjct: 181 ESVATHEIGHLLGLGHSSVRESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL 240

Query: 264 ELKSDTSASQPHPILLLHAIPFCL 288
           EL+SDTSASQP       A PF L
Sbjct: 241 ELESDTSASQPPVSTCTLACPFSL 256

BLAST of CaUC01G012220 vs. NCBI nr
Match: KAE8650215.1 (hypothetical protein Csa_010775 [Cucumis sativus])

HSP 1 Score: 453.4 bits (1165), Expect = 1.7e-123
Identity = 224/251 (89.24%), Postives = 235/251 (93.63%), Query Frame = 0

Query: 24  MFLPLLLVLFFILCTPPCLPSPTTLPQNPTATVIYRSNSTLLRWRVSDTLHRPHATAHYV 83
           MFLPLLL+LFF LCTP CLP PTTLPQNPT+ VI+RS+S+        +LH PHAT  YV
Sbjct: 1   MFLPLLLLLFFFLCTPSCLPYPTTLPQNPTSAVIFRSDSS--------SLHNPHATERYV 60

Query: 84  FFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFEETDDYG 143
           FFPGKPRW RRMPMVLTYAFFK+NWMSNLSL+DV+AVFRKAFSKWAHVIPVSFEETDDYG
Sbjct: 61  FFPGKPRWTRRMPMVLTYAFFKDNWMSNLSLEDVRAVFRKAFSKWAHVIPVSFEETDDYG 120

Query: 144 YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVAVDL 203
           YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVAVDL
Sbjct: 121 YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVAVDL 180

Query: 204 ESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL 263
           ESVATHEIGHLLGLGHSSV+ESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL
Sbjct: 181 ESVATHEIGHLLGLGHSSVRESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL 240

Query: 264 ELKSDTSASQP 275
           EL+SDTSASQP
Sbjct: 241 ELESDTSASQP 243

BLAST of CaUC01G012220 vs. NCBI nr
Match: KAG7028506.1 (Metalloendoproteinase 4-MMP, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 415.2 bits (1066), Expect = 5.2e-112
Identity = 208/257 (80.93%), Postives = 222/257 (86.38%), Query Frame = 0

Query: 24  MFLPL------LLVLFFILCTPPCLPSPTTLPQNPTATVIYRSNSTLLRWRVSDTLHRPH 83
           MFLPL      LL+  F LC PP +PS         + V +R NST  +WRVSDTLH PH
Sbjct: 1   MFLPLIPRSRYLLLALFFLCLPPIIPS---------SAVTFRPNSTARQWRVSDTLHGPH 60

Query: 84  ATAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFE 143
           ATA YV+FPGKPRW RRMPMVLTYAF KENW+SNLSLKDVK VFR AFSKWAHVIPV FE
Sbjct: 61  ATARYVYFPGKPRWTRRMPMVLTYAFLKENWVSNLSLKDVKFVFRTAFSKWAHVIPVRFE 120

Query: 144 ETDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKS 203
           ET+DYG+ADIKIGFYRGDH DGQ FDGVLGVLAHSFSPEIGRLHLDAAETWAVDFE+EKS
Sbjct: 121 ETEDYGFADIKIGFYRGDHGDGQPFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEEEKS 180

Query: 204 AVAVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNPN 263
           AVAVDLESVATHEIGHLLGLGHSSV+ESVMYPSLKPREKKANLEVDDVEGVQALYGSNPN
Sbjct: 181 AVAVDLESVATHEIGHLLGLGHSSVRESVMYPSLKPREKKANLEVDDVEGVQALYGSNPN 240

Query: 264 FKFESLELKSDTSASQP 275
           F+FESLEL+SDTSA+ P
Sbjct: 241 FRFESLELESDTSATHP 248

BLAST of CaUC01G012220 vs. ExPASy Swiss-Prot
Match: Q8GWW6 (Metalloendoproteinase 4-MMP OS=Arabidopsis thaliana OX=3702 GN=4MMP PE=1 SV=1)

HSP 1 Score: 278.9 bits (712), Expect = 7.6e-74
Identity = 134/196 (68.37%), Postives = 159/196 (81.12%), Query Frame = 0

Query: 77  HATAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSF 136
           H    YV+FPG+PRW R +P+ LTYAF +EN    L+  D++ VFR+AF KWA VIPVSF
Sbjct: 120 HTGKKYVYFPGRPRWTRDVPLKLTYAFSQENLTPYLAPTDIRRVFRRAFGKWASVIPVSF 179

Query: 137 EETDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEK 196
            ET+DY  ADIKIGF+ GDH DG+ FDGVLGVLAH+FSPE GRLHLD AETWAVDF++EK
Sbjct: 180 IETEDYVIADIKIGFFNGDHGDGEPFDGVLGVLAHTFSPENGRLHLDKAETWAVDFDEEK 239

Query: 197 SAVAVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNP 256
           S+VAVDLESVA HEIGH+LGLGHSSVK++ MYP+LKPR KK NL +DDV GVQ+LYG+NP
Sbjct: 240 SSVAVDLESVAVHEIGHVLGLGHSSVKDAAMYPTLKPRSKKVNLNMDDVVGVQSLYGTNP 299

Query: 257 NFKFESLELKSDTSAS 273
           NF   SL L S+TS +
Sbjct: 300 NFTLNSL-LASETSTN 314

BLAST of CaUC01G012220 vs. ExPASy Swiss-Prot
Match: O23507 (Metalloendoproteinase 1-MMP OS=Arabidopsis thaliana OX=3702 GN=1MMP PE=1 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 5.4e-64
Identity = 130/221 (58.82%), Postives = 156/221 (70.59%), Query Frame = 0

Query: 77  HATAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSF 136
           H TAHY +F GKP+W R     LTYA  K + +  L+ +DVK VFR+AFS+W+ VIPVSF
Sbjct: 145 HTTAHYTYFNGKPKWNRD---TLTYAISKTHKLDYLTSEDVKTVFRRAFSQWSSVIPVSF 204

Query: 137 EETDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFE-KE 196
           EE DD+  AD+KIGFY GDH DG  FDGVLG LAH+F+PE GRLHLDAAETW VD + K 
Sbjct: 205 EEVDDFTTADLKIGFYAGDHGDGLPFDGVLGTLAHAFAPENGRLHLDAAETWIVDDDLKG 264

Query: 197 KSAVAVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSN 256
            S VAVDLESVATHEIGHLLGLGHSS + +VMYPSL+PR KK +L VDDV GV  LYG N
Sbjct: 265 SSEVAVDLESVATHEIGHLLGLGHSSQESAVMYPSLRPRTKKVDLTVDDVAGVLKLYGPN 324

Query: 257 PNFKFESLELKSDT--SASQPHPILLLHAIPFCLLSAPSVL 295
           P  + +SL    D+  + +  H  L  + I + LL    +L
Sbjct: 325 PKLRLDSLTQSEDSIKNGTVSHRFLSGNFIGYVLLVVGLIL 362

BLAST of CaUC01G012220 vs. ExPASy Swiss-Prot
Match: O04529 (Metalloendoproteinase 2-MMP OS=Arabidopsis thaliana OX=3702 GN=2MMP PE=1 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 2.2e-49
Identity = 98/185 (52.97%), Postives = 128/185 (69.19%), Query Frame = 0

Query: 77  HATAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSF 136
           HA   Y  FPG+PRW R     LTYAF  +N ++    ++VK+VF +AF +W+ V  ++F
Sbjct: 150 HAVKRYTLFPGEPRWPRNR-RDLTYAFDPKNPLT----EEVKSVFSRAFGRWSDVTALNF 209

Query: 137 EETDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKE- 196
             ++ +  +DI IGFY GDH DG+ FDGVLG LAH+FSP  G+ HLDA E W V  + + 
Sbjct: 210 TLSESFSTSDITIGFYTGDHGDGEPFDGVLGTLAHAFSPPSGKFHLDADENWVVSGDLDS 269

Query: 197 --KSAVAVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYG 256
                 AVDLESVA HEIGHLLGLGHSSV+ES+MYP++   ++K +L  DDVEG+Q LYG
Sbjct: 270 FLSVTAAVDLESVAVHEIGHLLGLGHSSVEESIMYPTITTGKRKVDLTNDDVEGIQYLYG 329

Query: 257 SNPNF 259
           +NPNF
Sbjct: 330 ANPNF 329

BLAST of CaUC01G012220 vs. ExPASy Swiss-Prot
Match: Q5XF51 (Metalloendoproteinase 3-MMP OS=Arabidopsis thaliana OX=3702 GN=3MMP PE=1 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 6.5e-49
Identity = 98/188 (52.13%), Postives = 125/188 (66.49%), Query Frame = 0

Query: 75  RPHATAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPV 134
           R HA  HY FFPG+PRW R     LTYAF   N ++    ++VK+VF +AF++W  V P+
Sbjct: 152 RFHAVKHYSFFPGEPRWPRNR-RDLTYAFDPRNALT----EEVKSVFSRAFTRWEEVTPL 211

Query: 135 SFEETDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEK 194
           +F   + +  +DI IGFY G+H DG+ FDG +  LAH+FSP  G  HLD  E W V  E 
Sbjct: 212 TFTRVERFSTSDISIGFYSGEHGDGEPFDGPMRTLAHAFSPPTGHFHLDGEENWIVSGEG 271

Query: 195 EKSAV----AVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQA 254
               +    AVDLESVA HEIGHLLGLGHSSV+ S+MYP+++   +K +L  DDVEGVQ 
Sbjct: 272 GDGFISVSEAVDLESVAVHEIGHLLGLGHSSVEGSIMYPTIRTGRRKVDLTTDDVEGVQY 331

Query: 255 LYGSNPNF 259
           LYG+NPNF
Sbjct: 332 LYGANPNF 334

BLAST of CaUC01G012220 vs. ExPASy Swiss-Prot
Match: Q9ZUJ5 (Metalloendoproteinase 5-MMP OS=Arabidopsis thaliana OX=3702 GN=5MMP PE=1 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 9.7e-45
Identity = 90/184 (48.91%), Postives = 122/184 (66.30%), Query Frame = 0

Query: 79  TAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFEE 138
           T  Y FFPGKPRW +R    LTYAF  +N +++    +VK VF +AF++WA V P++F  
Sbjct: 140 TEKYSFFPGKPRWPKR-KRDLTYAFAPQNNLTD----EVKRVFSRAFTRWAEVTPLNFTR 199

Query: 139 TDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSA 198
           ++    ADI IGF+ G+H DG+ FDG +G LAH+ SP  G LHLD  E W +   +    
Sbjct: 200 SESILRADIVIGFFSGEHGDGEPFDGAMGTLAHASSPPTGMLHLDGDEDWLISNGEISRR 259

Query: 199 V-----AVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYG 258
           +      VDLESVA HEIGHLLGLGHSSV++++M+P++   ++K  L  DD+EG+Q LYG
Sbjct: 260 ILPVTTVVDLESVAVHEIGHLLGLGHSSVEDAIMFPAISGGDRKVELAKDDIEGIQHLYG 318

BLAST of CaUC01G012220 vs. ExPASy TrEMBL
Match: A0A5D3DB58 (Metalloendoproteinase 4-MMP OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold291G00450 PE=4 SV=1)

HSP 1 Score: 458.8 bits (1179), Expect = 2.0e-125
Identity = 233/267 (87.27%), Postives = 243/267 (91.01%), Query Frame = 0

Query: 24  MFLPLLLVLFFILCTPPCLPSPTTLPQNPTATV-IYRSNSTLL--RWRVSDTLHRPHATA 83
           MFLPLLL+LFF LCTP CLP PTTLPQN TA V I+RSNS+ L  R R+++TLH PHAT 
Sbjct: 1   MFLPLLLLLFFFLCTPSCLPYPTTLPQNSTAAVIIFRSNSSSLQGRRRLTETLHNPHATE 60

Query: 84  HYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFEETD 143
            YVFFPGKPRW RRMPMVLTYAFFKENWMSNLSLKDV+AVFRKAFSKWAHVIPVSFEETD
Sbjct: 61  RYVFFPGKPRWARRMPMVLTYAFFKENWMSNLSLKDVRAVFRKAFSKWAHVIPVSFEETD 120

Query: 144 DYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVA 203
           DYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVA
Sbjct: 121 DYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVA 180

Query: 204 VDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKF 263
           VDLESVATHEIGHLLGLGHSSV+ESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKF
Sbjct: 181 VDLESVATHEIGHLLGLGHSSVRESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKF 240

Query: 264 ESLELKSDTSASQPHPILLLHAIPFCL 288
           ESLEL+SDTS +QP       A PF L
Sbjct: 241 ESLELESDTSDTQPPVSTCTMACPFSL 267

BLAST of CaUC01G012220 vs. ExPASy TrEMBL
Match: A0A1S3AUH8 (metalloendoproteinase 4-MMP OS=Cucumis melo OX=3656 GN=LOC103483123 PE=4 SV=1)

HSP 1 Score: 458.8 bits (1179), Expect = 2.0e-125
Identity = 233/267 (87.27%), Postives = 243/267 (91.01%), Query Frame = 0

Query: 24  MFLPLLLVLFFILCTPPCLPSPTTLPQNPTATV-IYRSNSTLL--RWRVSDTLHRPHATA 83
           MFLPLLL+LFF LCTP CLP PTTLPQN TA V I+RSNS+ L  R R+++TLH PHAT 
Sbjct: 1   MFLPLLLLLFFFLCTPSCLPYPTTLPQNSTAAVIIFRSNSSSLQGRRRLTETLHNPHATE 60

Query: 84  HYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFEETD 143
            YVFFPGKPRW RRMPMVLTYAFFKENWMSNLSLKDV+AVFRKAFSKWAHVIPVSFEETD
Sbjct: 61  RYVFFPGKPRWARRMPMVLTYAFFKENWMSNLSLKDVRAVFRKAFSKWAHVIPVSFEETD 120

Query: 144 DYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVA 203
           DYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVA
Sbjct: 121 DYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVA 180

Query: 204 VDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKF 263
           VDLESVATHEIGHLLGLGHSSV+ESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKF
Sbjct: 181 VDLESVATHEIGHLLGLGHSSVRESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKF 240

Query: 264 ESLELKSDTSASQPHPILLLHAIPFCL 288
           ESLEL+SDTS +QP       A PF L
Sbjct: 241 ESLELESDTSDTQPPVSTCTMACPFSL 267

BLAST of CaUC01G012220 vs. ExPASy TrEMBL
Match: A0A0A0L2Y8 (ZnMc domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G117950 PE=4 SV=1)

HSP 1 Score: 454.1 bits (1167), Expect = 4.9e-124
Identity = 228/264 (86.36%), Postives = 239/264 (90.53%), Query Frame = 0

Query: 24  MFLPLLLVLFFILCTPPCLPSPTTLPQNPTATVIYRSNSTLLRWRVSDTLHRPHATAHYV 83
           MFLPLLL+LFF LCTP CLP PTTLPQNPT+ VI+RS+S+        +LH PHAT  YV
Sbjct: 1   MFLPLLLLLFFFLCTPSCLPYPTTLPQNPTSAVIFRSDSS--------SLHNPHATERYV 60

Query: 84  FFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFEETDDYG 143
           FFPGKPRW RRMPMVLTYAFFK+NWMSNLSL+DV+AVFRKAFSKWAHVIPVSFEETDDYG
Sbjct: 61  FFPGKPRWTRRMPMVLTYAFFKDNWMSNLSLEDVRAVFRKAFSKWAHVIPVSFEETDDYG 120

Query: 144 YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVAVDL 203
           YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVAVDL
Sbjct: 121 YADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVAVDL 180

Query: 204 ESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL 263
           ESVATHEIGHLLGLGHSSV+ESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL
Sbjct: 181 ESVATHEIGHLLGLGHSSVRESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKFESL 240

Query: 264 ELKSDTSASQPHPILLLHAIPFCL 288
           EL+SDTSASQP       A PF L
Sbjct: 241 ELESDTSASQPPVSTCTLACPFSL 256

BLAST of CaUC01G012220 vs. ExPASy TrEMBL
Match: A0A6J1IFQ0 (metalloendoproteinase 4-MMP-like OS=Cucurbita maxima OX=3661 GN=LOC111472903 PE=4 SV=1)

HSP 1 Score: 411.4 bits (1056), Expect = 3.6e-111
Identity = 208/256 (81.25%), Postives = 222/256 (86.72%), Query Frame = 0

Query: 24  MFLP-------LLLVLFFILCTPPCLPSPTTLPQNPTATVIYRSNSTLLRWRVSDTLHRP 83
           MFLP       LLL LFF LC PP +PS         + V +RSNST  RWRV+DTLH P
Sbjct: 1   MFLPSIPPPRYLLLALFF-LCLPPIIPS---------SAVTFRSNSTARRWRVADTLHGP 60

Query: 84  HATAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSF 143
           H TA Y++FPGKPRW RRMPMVLTYAF KENW+SNLSLKDVK VFR AFSKWAHVIPV F
Sbjct: 61  HVTARYLYFPGKPRWTRRMPMVLTYAFLKENWISNLSLKDVKFVFRTAFSKWAHVIPVRF 120

Query: 144 EETDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEK 203
           EET+DYG+ADIKIGFYRGDH DGQ FDGVLGVLAHSFSPEIGRLHLDAAETWAVDFE+EK
Sbjct: 121 EETEDYGFADIKIGFYRGDHGDGQPFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEEEK 180

Query: 204 SAVAVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNP 263
           SAVAVDLESVATHEIGHLLGLGHSSV+ESVMYPSLKPREKKANLEVDDVEGVQALYGSNP
Sbjct: 181 SAVAVDLESVATHEIGHLLGLGHSSVRESVMYPSLKPREKKANLEVDDVEGVQALYGSNP 240

Query: 264 NFKFESLELKSDTSAS 273
           NF+FESLEL+SDTSA+
Sbjct: 241 NFRFESLELESDTSAT 246

BLAST of CaUC01G012220 vs. ExPASy TrEMBL
Match: A0A6J1DB68 (metalloendoproteinase 4-MMP OS=Momordica charantia OX=3673 GN=LOC111018517 PE=4 SV=1)

HSP 1 Score: 391.7 bits (1005), Expect = 3.0e-105
Identity = 202/253 (79.84%), Postives = 218/253 (86.17%), Query Frame = 0

Query: 24  MFLPLLLVLFFILCTPPCLPSPTT-LPQNPTATVIYRSN--STLLRWRVSDTLHRPHATA 83
           MF  LLL LFF L +PPC+PS T  + +N TA +       +    WRVSDTLH PHATA
Sbjct: 1   MFRSLLLALFF-LGSPPCIPSRTVPVLKNRTAGLGLGLPIITAAAGWRVSDTLHGPHATA 60

Query: 84  HYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFEETD 143
            YV FPGKPRW RRMPMVLTYAF ++NWMSNLS+KDVK VFRK+FSKWA VIPVSFEETD
Sbjct: 61  RYVLFPGKPRWTRRMPMVLTYAFLEDNWMSNLSVKDVKRVFRKSFSKWASVIPVSFEETD 120

Query: 144 DYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSAVA 203
           DY YADIKIGFY+GDHSDGQ FDGVLGVLAHSFSPEIGRLHLDAAETWAVDFE+EKSAVA
Sbjct: 121 DYDYADIKIGFYKGDHSDGQPFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEREKSAVA 180

Query: 204 VDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNPNFKF 263
           VDLESVATHEIGHLLGLGHS V+ESVMYPSLK REKKANLEVDDVEGVQALYGSNPNFK 
Sbjct: 181 VDLESVATHEIGHLLGLGHSRVRESVMYPSLKAREKKANLEVDDVEGVQALYGSNPNFKL 240

Query: 264 ESLELKSDTSASQ 274
           ESLEL+S+TS +Q
Sbjct: 241 ESLELESETSTNQ 252

BLAST of CaUC01G012220 vs. TAIR 10
Match: AT2G45040.1 (Matrixin family protein )

HSP 1 Score: 278.9 bits (712), Expect = 5.4e-75
Identity = 134/196 (68.37%), Postives = 159/196 (81.12%), Query Frame = 0

Query: 77  HATAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSF 136
           H    YV+FPG+PRW R +P+ LTYAF +EN    L+  D++ VFR+AF KWA VIPVSF
Sbjct: 120 HTGKKYVYFPGRPRWTRDVPLKLTYAFSQENLTPYLAPTDIRRVFRRAFGKWASVIPVSF 179

Query: 137 EETDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEK 196
            ET+DY  ADIKIGF+ GDH DG+ FDGVLGVLAH+FSPE GRLHLD AETWAVDF++EK
Sbjct: 180 IETEDYVIADIKIGFFNGDHGDGEPFDGVLGVLAHTFSPENGRLHLDKAETWAVDFDEEK 239

Query: 197 SAVAVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSNP 256
           S+VAVDLESVA HEIGH+LGLGHSSVK++ MYP+LKPR KK NL +DDV GVQ+LYG+NP
Sbjct: 240 SSVAVDLESVAVHEIGHVLGLGHSSVKDAAMYPTLKPRSKKVNLNMDDVVGVQSLYGTNP 299

Query: 257 NFKFESLELKSDTSAS 273
           NF   SL L S+TS +
Sbjct: 300 NFTLNSL-LASETSTN 314

BLAST of CaUC01G012220 vs. TAIR 10
Match: AT4G16640.1 (Matrixin family protein )

HSP 1 Score: 246.1 bits (627), Expect = 3.9e-65
Identity = 130/221 (58.82%), Postives = 156/221 (70.59%), Query Frame = 0

Query: 77  HATAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSF 136
           H TAHY +F GKP+W R     LTYA  K + +  L+ +DVK VFR+AFS+W+ VIPVSF
Sbjct: 145 HTTAHYTYFNGKPKWNRD---TLTYAISKTHKLDYLTSEDVKTVFRRAFSQWSSVIPVSF 204

Query: 137 EETDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFE-KE 196
           EE DD+  AD+KIGFY GDH DG  FDGVLG LAH+F+PE GRLHLDAAETW VD + K 
Sbjct: 205 EEVDDFTTADLKIGFYAGDHGDGLPFDGVLGTLAHAFAPENGRLHLDAAETWIVDDDLKG 264

Query: 197 KSAVAVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYGSN 256
            S VAVDLESVATHEIGHLLGLGHSS + +VMYPSL+PR KK +L VDDV GV  LYG N
Sbjct: 265 SSEVAVDLESVATHEIGHLLGLGHSSQESAVMYPSLRPRTKKVDLTVDDVAGVLKLYGPN 324

Query: 257 PNFKFESLELKSDT--SASQPHPILLLHAIPFCLLSAPSVL 295
           P  + +SL    D+  + +  H  L  + I + LL    +L
Sbjct: 325 PKLRLDSLTQSEDSIKNGTVSHRFLSGNFIGYVLLVVGLIL 362

BLAST of CaUC01G012220 vs. TAIR 10
Match: AT1G70170.1 (matrix metalloproteinase )

HSP 1 Score: 197.6 bits (501), Expect = 1.6e-50
Identity = 98/185 (52.97%), Postives = 128/185 (69.19%), Query Frame = 0

Query: 77  HATAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSF 136
           HA   Y  FPG+PRW R     LTYAF  +N ++    ++VK+VF +AF +W+ V  ++F
Sbjct: 150 HAVKRYTLFPGEPRWPRNR-RDLTYAFDPKNPLT----EEVKSVFSRAFGRWSDVTALNF 209

Query: 137 EETDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKE- 196
             ++ +  +DI IGFY GDH DG+ FDGVLG LAH+FSP  G+ HLDA E W V  + + 
Sbjct: 210 TLSESFSTSDITIGFYTGDHGDGEPFDGVLGTLAHAFSPPSGKFHLDADENWVVSGDLDS 269

Query: 197 --KSAVAVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYG 256
                 AVDLESVA HEIGHLLGLGHSSV+ES+MYP++   ++K +L  DDVEG+Q LYG
Sbjct: 270 FLSVTAAVDLESVAVHEIGHLLGLGHSSVEESIMYPTITTGKRKVDLTNDDVEGIQYLYG 329

Query: 257 SNPNF 259
           +NPNF
Sbjct: 330 ANPNF 329

BLAST of CaUC01G012220 vs. TAIR 10
Match: AT1G24140.1 (Matrixin family protein )

HSP 1 Score: 196.1 bits (497), Expect = 4.6e-50
Identity = 98/188 (52.13%), Postives = 125/188 (66.49%), Query Frame = 0

Query: 75  RPHATAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPV 134
           R HA  HY FFPG+PRW R     LTYAF   N ++    ++VK+VF +AF++W  V P+
Sbjct: 152 RFHAVKHYSFFPGEPRWPRNR-RDLTYAFDPRNALT----EEVKSVFSRAFTRWEEVTPL 211

Query: 135 SFEETDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEK 194
           +F   + +  +DI IGFY G+H DG+ FDG +  LAH+FSP  G  HLD  E W V  E 
Sbjct: 212 TFTRVERFSTSDISIGFYSGEHGDGEPFDGPMRTLAHAFSPPTGHFHLDGEENWIVSGEG 271

Query: 195 EKSAV----AVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQA 254
               +    AVDLESVA HEIGHLLGLGHSSV+ S+MYP+++   +K +L  DDVEGVQ 
Sbjct: 272 GDGFISVSEAVDLESVAVHEIGHLLGLGHSSVEGSIMYPTIRTGRRKVDLTTDDVEGVQY 331

Query: 255 LYGSNPNF 259
           LYG+NPNF
Sbjct: 332 LYGANPNF 334

BLAST of CaUC01G012220 vs. TAIR 10
Match: AT1G59970.1 (Matrixin family protein )

HSP 1 Score: 182.2 bits (461), Expect = 6.9e-46
Identity = 90/184 (48.91%), Postives = 122/184 (66.30%), Query Frame = 0

Query: 79  TAHYVFFPGKPRWMRRMPMVLTYAFFKENWMSNLSLKDVKAVFRKAFSKWAHVIPVSFEE 138
           T  Y FFPGKPRW +R    LTYAF  +N +++    +VK VF +AF++WA V P++F  
Sbjct: 140 TEKYSFFPGKPRWPKR-KRDLTYAFAPQNNLTD----EVKRVFSRAFTRWAEVTPLNFTR 199

Query: 139 TDDYGYADIKIGFYRGDHSDGQAFDGVLGVLAHSFSPEIGRLHLDAAETWAVDFEKEKSA 198
           ++    ADI IGF+ G+H DG+ FDG +G LAH+ SP  G LHLD  E W +   +    
Sbjct: 200 SESILRADIVIGFFSGEHGDGEPFDGAMGTLAHASSPPTGMLHLDGDEDWLISNGEISRR 259

Query: 199 V-----AVDLESVATHEIGHLLGLGHSSVKESVMYPSLKPREKKANLEVDDVEGVQALYG 258
           +      VDLESVA HEIGHLLGLGHSSV++++M+P++   ++K  L  DD+EG+Q LYG
Sbjct: 260 ILPVTTVVDLESVAVHEIGHLLGLGHSSVEDAIMFPAISGGDRKVELAKDDIEGIQHLYG 318

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038903989.11.3e-13189.77metalloendoproteinase 4-MMP-like [Benincasa hispida][more]
XP_008437790.14.1e-12587.27PREDICTED: metalloendoproteinase 4-MMP [Cucumis melo] >KAA0048832.1 metalloendop... [more]
XP_011650638.11.0e-12386.36metalloendoproteinase 4-MMP [Cucumis sativus][more]
KAE8650215.11.7e-12389.24hypothetical protein Csa_010775 [Cucumis sativus][more]
KAG7028506.15.2e-11280.93Metalloendoproteinase 4-MMP, partial [Cucurbita argyrosperma subsp. argyrosperma... [more]
Match NameE-valueIdentityDescription
Q8GWW67.6e-7468.37Metalloendoproteinase 4-MMP OS=Arabidopsis thaliana OX=3702 GN=4MMP PE=1 SV=1[more]
O235075.4e-6458.82Metalloendoproteinase 1-MMP OS=Arabidopsis thaliana OX=3702 GN=1MMP PE=1 SV=1[more]
O045292.2e-4952.97Metalloendoproteinase 2-MMP OS=Arabidopsis thaliana OX=3702 GN=2MMP PE=1 SV=1[more]
Q5XF516.5e-4952.13Metalloendoproteinase 3-MMP OS=Arabidopsis thaliana OX=3702 GN=3MMP PE=1 SV=1[more]
Q9ZUJ59.7e-4548.91Metalloendoproteinase 5-MMP OS=Arabidopsis thaliana OX=3702 GN=5MMP PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3DB582.0e-12587.27Metalloendoproteinase 4-MMP OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaf... [more]
A0A1S3AUH82.0e-12587.27metalloendoproteinase 4-MMP OS=Cucumis melo OX=3656 GN=LOC103483123 PE=4 SV=1[more]
A0A0A0L2Y84.9e-12486.36ZnMc domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G117950 PE=4 S... [more]
A0A6J1IFQ03.6e-11181.25metalloendoproteinase 4-MMP-like OS=Cucurbita maxima OX=3661 GN=LOC111472903 PE=... [more]
A0A6J1DB683.0e-10579.84metalloendoproteinase 4-MMP OS=Momordica charantia OX=3673 GN=LOC111018517 PE=4 ... [more]
Match NameE-valueIdentityDescription
AT2G45040.15.4e-7568.37Matrixin family protein [more]
AT4G16640.13.9e-6558.82Matrixin family protein [more]
AT1G70170.11.6e-5052.97matrix metalloproteinase [more]
AT1G24140.14.6e-5052.13Matrixin family protein [more]
AT1G59970.16.9e-4648.91Matrixin family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021190Peptidase M10APRINTSPR00138MATRIXINcoord: 206..231
score: 55.72
coord: 121..136
score: 41.31
coord: 240..253
score: 58.8
coord: 145..173
score: 53.24
IPR006026Peptidase, metallopeptidaseSMARTSM00235col_5coord: 87..254
e-value: 3.7E-41
score: 152.7
IPR024079Metallopeptidase, catalytic domain superfamilyGENE3D3.40.390.10Collagenase (Catalytic Domain)coord: 49..259
e-value: 5.4E-55
score: 188.4
IPR001818Peptidase M10, metallopeptidasePFAMPF00413Peptidase_M10coord: 98..253
e-value: 8.5E-48
score: 162.2
NoneNo IPR availablePANTHERPTHR10201:SF245METALLOENDOPROTEINASE 4-MMPcoord: 69..275
NoneNo IPR availablePANTHERPTHR10201MATRIX METALLOPROTEINASEcoord: 69..275
NoneNo IPR availableSUPERFAMILY55486Metalloproteases ("zincins"), catalytic domaincoord: 84..258
IPR033739Peptidase M10A, catalytic domainCDDcd04278ZnMc_MMPcoord: 90..253
e-value: 6.57467E-65
score: 200.123

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC01G012220.1CaUC01G012220.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0030574 collagen catabolic process
biological_process GO:0030198 extracellular matrix organization
biological_process GO:0006457 protein folding
biological_process GO:0006508 proteolysis
cellular_component GO:0031225 anchored component of membrane
cellular_component GO:0005737 cytoplasm
cellular_component GO:0031012 extracellular matrix
molecular_function GO:0005524 ATP binding
molecular_function GO:0016887 ATP hydrolysis activity
molecular_function GO:0004222 metalloendopeptidase activity
molecular_function GO:0051082 unfolded protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0008237 metallopeptidase activity