Tan0020931 (gene) Snake gourd v1

Overview
NameTan0020931
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionubiquitin-like-specific protease 1D
LocationLG04: 11628938 .. 11661977 (+)
RNA-Seq ExpressionTan0020931
SyntenyTan0020931
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAAATTAAAAAAACCCAAAACTCTGTTCTGCGACTTGTGTCAAAGGTGTGTTGTCAAGAACAGAGAGGGGAAACTGAGAGACAGAGAGAGGGAGACGAAATAATAGCTTAAATTTCTGCTGCACCTGCACCAGATGCAAATCCATGGTGATGGAGCAAGAGAAAACGACAAAGAAACCTCTGAATATTGACTGGGACGAACTGTTCGGCTGTAAAGACGAGGAGCCTCCACTGGAAATAGTCATTTTGCCCGCCATCGCGAACTCGAAACACTTTGAAATGGAATCCGATCGCCAACATTTGGTAAGAGAAGAATACCAGAAGCTTAGTGATAGCGAACTGGAAGAAAAGATTCGCAGAATGCATCAATTCTACGAGTCCACGGCTTGCAAATTGCCAGACAAGGGGCAAAAGTATCTTCGCAATCTGGAGCTGTCCATGGAAGAGAGGGAATCCAGAAAGCTCCGTCGAGTTGAAAAGGTTTTTCTGTTACGTTTTGGTTCCATCCCTTCGATTCATAAACTCGTGTTAATTACTCGATTATTTTTTTTCTGAGAAAATTGAAAGAATGGGCAATTGGTCGTATTAGATTTTGCTCTTCTTTTTTTTTTTTTGTATCATTGGGGGGAAATTTTGTATGTTCCGACCCTGGTTCTAATGTTGATTATGTACTAAATTATTTTTTTTCCTTTCCGTTTTTTTAAAATTTTTATTTTATATAAATCGCCGAACTTGGTTTTGCGTTCGATCATCGAGATGCTGCATATCAGTTAAACTCCCCCTTTCAGAATTCGCGTTTGTTGTTGTGACACTGGTCAGTATTTTAGTAGTTCGTAAAAAAACAAAGAAGCTATCTTGCGAAGACTTTTCGTTAACAAATGCTTGCTGCTCGATAAAATGTGGTTGTTCCGCCTCAGTGTGTGGAGGTTGATTTATTTAGTTATTTTTTTATTAGGAAGCCTCCCAACTCCAGACGAGTTAAGAAAAACTCTCCAGTTGGAAGAGATAGAAGAGGGAAAGTGATATGACCAATTTCTTTTGAGCAAGAACTCGATAAAAGAAGCTTCATATAAAAAGTATCATAACCCAACTTCTAGTCTATTCGCGTTCTGAAAATTTAAAAAGTCTTGTTCCTCTCGGTTACACACTTGCGGATGACACCATACTGTTTTCTTCTCCTCAGGCAGCATTTATTGACAGTCTATTTAATATTATCAAAGATTTCGAGGATGGATCGGGGTTGAATATTAATAGGCAAAAATCAGAATTCCTAAATGTTCTTGTTGACAATACTGTTGCTTCTTCCATGGCTTCAAATCTGGGATGTAGTCTTGGAAACTGGCCAGCAACTTATTTGGGTCTACTTCTACATGGTAAGCCTAAAACTATGGCCTTTTGGGTCCCCATTGTAGAGTAAGTCCAAAAAAGGCTCCTATCGTGGGGGAGAAAATCCAAAAAAGGCTCCTATCTTGGGGGCACTCACACATATCCAAAGGGGGGTGATTGACTCTTATACAAGCCACTTTAGTCAACTTCCAATTTATTACCTTTCCCTCCTTTCCATTCCCTCCAAGGTCGCATCAATCATTGAGAGGCTTCTTCAGAATTTTTTTATGGAGAGGAAATTCTGATAAAGAGAGATTACACAACCTGAGTTGGGAGAAAGTTAAACTCCCATTGGAAGAAGGAGGCTTGCGGATCATTGATCGTAAACAGATAAATACTTACATTGTTGGCTAAGTGGATTTGGAGATTCCATAGAGAAAACAACGCTGTGGAGGAAAGTCATTGCTGCAAAATTTGGTTCCCAACACTTTGATCTGAAGCCCAAGACCTTTTCCCTAAGCTTTCTAAGGGACCTTGGAAAACCATTCATAAAGTCAGCAATAGTGTCTATTCTCATATATGCTGCAAAGTTGGAAATGGTCTTGGCACCAGTTTTTGGTCTGACCTATGGTTGGGAAATAGGGCTTTAAAGGATAAATTCCCCTTTTTATTTTCCATTGCCCTCAAAAGGGAGGGTTCTATGGTGGGGATGTGGAGCTCTAACACATTGTCCTGGAATATTCACCCGAGGAGACACATGATCGAGAAAGAAATTGAAGAATGGGGTATTTTCTTCTGAATCTCTCTCACAAGCCCTATCATTGGAAGGCGATAAGAGTTGCAAAGATCTATACAATTCTGTTTGGAAACGTCACTACCCTAAAAGAGTGAAGTTCTTCGTGTGAGAGGTGAGTCACATGTGTTTAAATACTTTTGAGAGGCTGAAAAGGCGCTCCCCCCGGATGTCCATATCTCCTGGCAAAAGGGATGGGGTCTCAAAATCATCTCTTTATATGGTGCAAGTAGGCTAGAACCCTTTGGGACTTTATTTTGGAAGCATTTGGATGGCAGACTGTATTTCCCTTCAGTATAAGGGATCTTCTTCCTTTGATACTCATTGGTCATCCATTCAAGAAAGAAAAGAAGGTGCTTTGGCTGAATTTTATTGGGGCATTCTTTTGGTCCTTATGATTGGAGCGCAAATCACGTGTCTTACCGGATAAAGAACAAGACCCTTGTTCTCTACACTCTTGTATCATTTTCTTATCCCTTAACTGGTGTAAATTGCACTCTCTCTTTTGTAATTATAGTCTTACCAATCTTATGATCAATTGGAAGTGTTTTTTGTAATCCACTGGATAAGATATCTTCTTTGTAATTTCATCTCATCAATGAAATGTTACTATGTTTCCTATAAAAAAACATAAGAAGTCTTGTTCCTTTCTGACCATGTTCTTCTGCAAGATGGCATCACCTGCATTGAGCCACCACTTTGTAACCTTTACTTTCAAGAACTCACAAAGACAAAATCTGGAAGCTCATAGTACTGCTCAGGAAAATAAAATGACTATTTAAAGGGTCAAAAAGTCTTCAATATATATATATGATGTGAAAGGACTGCCCATGAGAAGGTGGGCTACCTTTCAAGTCTTTCTTGCTCATAGCATCCAGGGAGAGGACTCTATATTGGTTCCTCAGGTCCATGTCTATAGTAGAACCTTCACTTTCTCAGTAAACTTGTGCTTCCTAATTACATTACCTACCTCTTCCAGCTAGGAGGAGTTAGATGCCCAAATGATCCAAAGGGGGCATTTGTGTAGAAGTCCCACTTTTGGCCACAAGTTCACGGCCTCATGTCTACGACATTGGCAATGAGCACCTCCAACTTGAAGTTTAATTGAACCACTCCTCAGAATCACATTGTGATTTTGAAGGCCTCTGGTTGGTGTTTGACTATGCTTGGGAATGTCTTCGACTTTTTATCGCTGATTTTTGTGAGACATCCTTTCAAGAATGACAAGGAGGCTTTGTGGCTTGCATTTGATCACTATTTCTTTTAGTCCCTTTGGTGTGAACGGAACGGATGGGTCTTCGATGATAAGTACTCCACATTCGAAAGCTTTATGGATTTGGTTATTTTTAGTGCTTTGTATTGGTGTATATGTTCACACCTTTTTAAAAACCGTCTTTCTACTTTAATTTGCGATTGGAAATCCTTTTTGTAATCCACATTGAGTGATTATTTCATTTATCAATGAAATATTTGTTTTTCTCTCAAAATTCCAAATCATTTGAGATCCCTGTAGACCCTTGAGTCTTGCATCTCAATTCTGTGAGTCTTGCATCTCAATTCTGAATTCCAACTTTGAGATTTGGTATTGAATTTGGGTCTGGTCATATAATAAATTATGAGGAAAGAAGTATGAAGAGAGGCACTTTTGGCTATTTATCTTTGCAAAATCTAGCAAGACGAGGTCTTTAACATTTTAATGTAGGTACTTATTTGATTTCTTCCAAAATCTCATAAGAAGTATGAGGTCTTTAACATTTTAGGGGTTGTTTAGCCCATTGGTATGAGTTGGGTTGAGTTGGTTTAAAAAACCAACTCAGTGTTTGAGCCATTGAGTTGGGTTGAGTTCCCAACTCCCCACTTCCCAAAATATCTCCCCCAATTTTCTCAACTTACATCTCCCCACAAAATCAATTCACACACCTCTCCACCAATTTCCCTAAACCTTGATTCCCTTCCGAAGGTCCGGCGGTGGTTCGATGACTTTCCGACGGTGGTCCGGCGAAACTCTGCGCGATGGTCCGACGGGATGAAGAGTGAGATGAAAAGAAAAGAACGAGAAGAAGATGAGAGTTTTGAAGTACACCCTTTAAGTTGCAATTACTCTAGTTTAAATAGGTAGCAAAAAGTCCTGATCCTTTTTAGAACAAAAAATCCCATTTTTTTAAAAAGGAAATCCTAATCTTTTTTTTTTTAAAAAGAAATCTTAATTCCCTTTTTTTAAAACATAAATCCTAATCCTTTTTAGACTTTGAAACACCATTTCCTAAAAAATCTCTAGATTCATAATCCTTATCCGATTATCCAAAAATTAAAATTTAAATCAAATACAATTAATTCATTTGATTTTACGAGGCGTTATAGAAAAAGAGGGGGTTTAAGTTGAATTATCCATAAGTTCCGTTGTTGAAACGAAAAGAAAAGTTATAGGGAAGTGATTAAGAAAAGAAAAATATGAAATGGAATCCGCTAGACGTAAAGATGTTGTAGACCAACTGTATGCAATAGAAGGATTGACCAAGGATGATCCTCTTAGTCGCAGACATCTAGAAGACGGACTGCTTCCTCCAAGTCCCATCACAATCCAAGAAGAGATACTGCTTGCATCTTCTTAGTAGGACTAGTTGAACACTTACTCCACCTATCTTTTACTAAACCAAGATGACCCATACACTTCATATTTATTTCGTTAGATATATTTTGTGACCGATAATATTTTTTGTATTCTTTAATTTTTTGAGGGCATGCGAGATAGGATTTGATAGCATGTCATTAATTCTTTTGTCATGAATGAATGTATTGTCAATGATGGGAAATAGTAAACACTCTTTTGACAATCCTTCATGAAGTTTGATTTAGACATTGATCGTAAGTTATCTGCCCAATTGCATTATTTTTTCTTTTTCTGTAAACCTTACAAATATTAAATTGATTCGATTAATTACATACTAATTGGAGAATTCAATTGATAAGGAAAAAAATAACAAAGCTTGATAACACACCATCATGTTCAATCATTATGCAAGTCAATATTATACACATTCATTACACTGAAGTTATCATACAACACAAGTTCGATGTTTACACCTGAATTCAATGTTGAAACTTGGTGGTATTGTGTTTTGAAGATAAATGTTTCAGATAGAATATCTAAAAACTATCAGTAAGATAAAAACTGACTATTTGATCTTAAAAAAAAACTGCACCAGACACCCCTGTTCAACTCAACTCAACTCTGCACCCCAAACACAAACTTCCTGCACTCAACTCAACTCTGCACAACTCAACCCAACTCTGCCCAACTCAACTCAACTCTGCGCGTCAAAAGCCCCTTTAGTGTGTTTGACTGCTTTCTTGTCTCTCCCAGCTAGCTTGAGAAGTTTGAGTATGTTGAGGTGAAAAGATTAGAAAGATCAACGTCTAATTATTTCCCTATTTTATTAAATGTTGGCAACATAAGTTGGGGTCCCACCCTTTTAAAGTCTGATAACATGTGGCTTGACCACTGGGAGTATGCATTTCATCGAGCAATTTTGAAGTCTCTTCCCATCCATAGGCATAATGCATAGGTTCACAAGGTCATTCTTTCATAATGAAGTTAAGTGTTCCCATAGGTTGGCCTCTCATCTTGTTAGCTCATAAAATGTTTCATGTTCTCTTCTCTTTAGTAGCTTGGGGATTGAATACATCCTCCATCAAGATCTTGTCTTTTTTGCTAATTGAAACTTCTGATTTGCCCTTGATAATGATTCCTTTTGGACAAAACCAAAAGGAATTGATACTGAATTTGGCCATACATGACGTGAAAGAGTTCTCGATAGTGAAAAAAATGCTCCCTTCCCTCTCCTCTCTTCCTTTCACCACCTGAACTCTGGCCTCTGACCATCTCCGACCACCATGTCCACCGAAAACAACCAACATCCTTCTCCTTGGAATTACTCCACCCGATCAATTCAAATTGAGCGGAAGACCTTTACAATTGCTTTTGATGGCCTTTCTCGAGGAAGCAAAGCAAAGATAACTGAGCATGGGAAACATTCCTCACATTCCATATCCTTATCATCGAGTTCCCTAAACTGGTTAGCCTCTTCCTTTAACTCTCTCTACAAAGATCCTTGCTCCTACAAGTTTTTCAAGAAGTTGCACTCAGCCGATGACACTCTTTGGGTAGAGAAGTTAAAGAACAAGTTTGACTACTTTGTTGAGTTCACTCAACTCACTCACAATGGTGAAAGGAGGAAAATTCTTATCCCATCTAAAGATAAGAAACAGGGTTGGTTTTCTTTTTTCTCCCTTATCTCAGATTATTCAGGCGAAATTCATAAAACATTTTCCTCTAAAGATGAACTTATCAGAAAGCCGCAACATTTGTTGGATCTTAAACTAGATGACAATCCCACCCTATCAACCACCTCCCAGCCATCGGTTCCTGTGGGTTCTTGGAATGGAATTCTGATTTGCCAACGGTTTTCTCATACAGATTCTTGGCCGAATATTCGGGTTGCACTGCAAGATTGTCTCTCACCTCGTTGCACCATCAATCCTTTTCAAGAAAATAAGGCGTTGATCCAAATTTACGACTCCAAACTCCCTGATCATTTTTGCATTACCTCTGTTTGGCATCTGGTTGGTGAGTTTAAATTAAAATTCCATGCCTTCTCCATTTTTTCCTTTTTGCAGGACAAAATGATCGTTTCTTATGGAGGTTGGATTGAAGTTGTGGATCTTCCTCTTAGCCTTTGGTCAGTTGATGTCTTCCGGTTTATTGGTGATAAATGTGGTGGTTTCCTTCAGACCTCGGATCACACTGATCGTTGATTATCTCTCCACGCTGCCCGCTTGAAAGTTCAAAGAAACCTCATCGGATCCATCCCGGAGTTTGTCGAGCTGCTGGAATCCATTGCCGAAATCACTCTCTCCGTTCGAATTAGGCCGGTTACTGTCATTAATCGCTTACAGGTCAGCAAAGCATCGGTTTTGGATTCTTTAATTGCAGATGAGGTCACTGAATAGTTTCCTTGTGAAGTGTCTGATAGAAAAGGAAAAAGGCATCTAATTGTTGCCGATAAGGAGGATATTTTACTCCAACCTTTATTTCCGCAAGAATCTGTTGCGATCTGCCCTCCTAAAATCCCTCCCTTGCCCGTTTTGCCAATTAATCTGATGATGGTGGATTTGAAAGTTTCTTTAACGGAATCAGTGCCAGATATCTTGGATTCCTCGATTGTCGCGTGTGGACCTTCAAATCCGCATTCCTTAGCTTTAATTCAACCAACTCTTTCCTCCATATCCTTTGCTGCGGGTCCCACCAATGAGGAAAGTCTTCCCATTTTACACATTGGCGTTAAACACTCCACTTTGGATGGGAATTCGATTCTGCCTTTTAATGCTTCAGATATGGAGGCTTATCTCTCTAGCCCTAATTCTCTTCCTTCAAATCACGAGTTTGACTTTCAACCTAAACCTTTTGACCCCACAATTTTCGGGGATAATCCTTCCCCTGATACCGCAACCCCCTTAGATTTTTTACCTCCAATTCCGTGTGACTTAACAAACCATCCGATTGTTGCTCAGCATCCGGAAAACCCCCTAATTAATTCTTTTGTGCTGTCTTCTCCCTCTAATGAAAAAGTCCCTCACAACTCTAATACTCAGCCAACTGTTGGCCCTTTTCCCCTTGAGTTGCGTGACATTACTCATTTTCTCACAGAGCATGGCTTATGCATTTTACCAATCCCAACCCTACCCTAACCAGCCAAGGCCAAAAAAGTTCAATCGACATATGGGAGGAAGAACAAATTACAGAGAGAGTTACAAAACTTGAAATCCACTGTCCATTATGATAAATTTTCCACTTTGGCTATGATGGAGGGCTCTTCGGTTGATCAATGAAATTTTTGACTTGGAATGTTCGGGGCCTAGGTTATTGGAAGAAGCAGGCTTTAACTAAGAGATCAATTCTTCAACATAACCAGGGAATTGTTCTTTTACAGGAAACTAAATTAACTTCTGACACGCATTAGTTGGTAAAGTCTATTTGGAGTTCATCACACATTGGTTGGGCATCTCTTGATGCAATTAATTCTGCTGGTGGTATTTTAATTCTTTGGAGTGAGCCAAATTCTATGGTTAAAGAAATAATCCAAGGTTTGTATACAGTCTCCATTCATGTTTTTTTAACTGATGGCTTTTCCTTCTGGCTTACATCAGTTTATGGCCCCTTTGGTAACGGTTTTTACGAGGATTTTTGGCGGGAGTTAGATGACTTGACTGGCCTAGGTGGTGATCGTTGGATTATTGGAGGAGATTTTAATGTTACTCGATGGTACTTGGGAGAAGTCACATGATCCTTTAATCAACCACAACATGAACACCTTTAACCGATGAATTTATACTTATAATTTGTTTGATGTTCCTTTGCAGAATGGTAACTTCACTTGGTCGAGCTTTGGTCCTACGCATTACTTATCTCTATGGGATAGGTTTTTAATTACTGATGGTTGTTCTAGAAAGTTTGGCTCTGCCACTCTTCGACGATTGGATCGTTATCACTTCTGACCATTTTCCCCTTGCTCTTTTCTTTGGTAATGTAGTATGGGGTCCTTGCCCATTCCGTTTTGAGAACTCTTGGTTGTTGATTGACTCAATTAAGGAGGTTGTCTCGAGTTGGTGGTCTCAAAATCCCTTAATTGGATGGTCAGGCCCTGGTTTAATGATGAAGTTAAAGGGTCTAAAGGTGTTTTTGAAGAATTGGAGCAAATCTCATCGTTTTGAGGCTAATAGATTGCAATCCTTTATTTCTCAGTTGCAGATTCTGGATAATTTGGAGGATACTACACCCCTATCGACTTCTCAGATTGAGTGTCGTCATCTCTTTAGGGAACAAATTGAGGTTTTAACAACCCAAGATCATATCTTTTGGAGACATAGGTGTAAATTGAAATGGCTTCAAGAAGGTGATGAGAACACCAAGCTCTTCCATAAAATCTTGGCTACAAGGAGAAGGAAGAACTCAATTACTGAGATCCTATCTAGAGATGGGATTAGCCTTATTACTGCTGATGATATTGAACACGAGTCTCTTGATTTTTATTCTAATTTGTTCACTAGAGATGAGGGTCTCCGCTTTCTTCCTGCTAATATTGATTGGAGTCCTATTTCTCTAGAGCAGACAGGTCGTCTTGAAGGCTTTCTCACAGAGGAGGAAGTTCGTAAGGCTGTTTTTTCTCTGGGTTCTGGTAAAACTCCAGGCCCTGATGGCTTTACTGCCGAATTCTTTAGGTTCTTCTAGGATACGATTAAACAGGATGTTATGACCATGATTCAGGAATTTTATTCCTCTGGGGTTATTAATGCATCTATGAATGAAACATATATTTGCTTTGTACCTAAAAAGTTGGCTTCCGAATCTGTTAATGAGTTTCGACCCATAAGCCTCATCTCCTGTGTTTATAAGATTGTTGCTCGGGTTTTATCTGACCGTTTGAAGCCTATTTTGGCCACTACAACACTACTATTACAGATAACCAACTTGCTTTTGTTTCGCAAAGACAGATCCTTAATGCTTCTTTAATGGCGAATGAATTGATTAATGATTGGACAAACCGTAATATAAAAGGTGTGGTCTTGAAGCTGGATCTTGAAAAGACATTTGATCTGATTGGTTGAGGATTTTCTTGTGTGATACTACGAGGCAAAGGGATTTGGGAGTCTTTGGAGGAAATGGATTCGTGGTTGTATTACTAGTGCAAACTATTCTATTATTTTAAATGGTAGGGCCCGTGGAAAAATTATTCCTAGTCGTGGTATTCGACAAGGTGATCCCTCATCACCTTTTCTCTTTATTTTGGTCTCTGACTATCTCAGCCGACTGTTGAACCATAGTGCTAATAGGGGCCTTATTTCAACCCATCCAATTGGTGCATCATCCTTCTGTTTAAACCATTAGCAGTTTGCGGATGATACTTTACTCTTTTCTACATTTGATCATTTGGCGCTGAAACATTTGTTTGAAGTGGTTGAAATTTTTGAGAGGGCTTCAGGGTTAAAAGTTAACCTTGCTAAGAGTGAGATTTTGGGTGTTCATGTTGATGATTCTGAATTGGATTGGATGCTGTCGACCTTTGGTTGCAAACAAGGTATGTGGCCGACTATCTATCTTGGCTTGCCTTTGGGAGGGAACTTAAAAAGAATTTCTTTTTGGAATCCAATTATTGAACGTTTGTAGCAGAAGCTCCATAATTGGAAGCATGCTTTTATTTCTAAGGGTGGGAGACATACCCTTATTCAAGCTACTCTCTCAAGTATGGCCTCTATATTATATGTCCTTGTTCAAACTTCCGCCAAAAGTCATTGCGAATCTAGATAAGATTATTAGGGATTTCTTGTGGGAAGGTGCAAGAGGTGATGGTGGAGTACATAATGTGAAATGGACAACCACTCAACTCCCTAAACTTTTGGGCGGTCTTGGGATTGACAACTTTAAGCAGCGTAATATGGCTCTCTTAGCAAAATGGGTTTGGCGGTTTTCTCAGGAACATGATTCCTTTGGAGGAAACTTATTGTTGCAAAGTATTATGATCATGCACAAGCAACCTTTTGGCCTCCTTTTTCTCCTAATACTCCTCATAACTCCCCATGGCGGTACATTTGTGATGTTTGAGATTTGGTTTCTTCTCGCTCCTTTCGGTGTATTGGGAATGGTTGTGATACTCGATTTTGGATTGATTCTTGGCTCAACAGTGGTTCTCTTGAAACTATCTGCTCACGACTCTTTCGACTTTCTTTGTTCCCTGATTCTAGAGTTGCAAGTCTTTGGAATTCCACGAATTCAACATTGGACCTAAAACTTCAGCGAAACTTAACAGATTTAGAGACTATTGAATGAGCCTATCTTTCTCAACTACTCACCTCTGTAAACCTTTCGAATATTCCTGATTCTTGAGTTTGGTCTTTAGAGCCTTCTGGATGTTTCTCTGGAAAATCTCTTACAGAGAGTTTGCTAGTTTCTGCTGATTTGGTGACACGAGACATCTATATGGTAATTTGGAAGGATAAATATCCAAAGAAGATTAAGATTTTCCTTTGGGAACTTAATCTGGGAGCCGTTAACACTTCTGATAGACTCCAAAGAAGAATGCCTTTTATTAGTCTCTCCCCCTCATGGTGTCCCATGTGTCATTTGGGATCAGAGTTTGCGGGCCATCTGTTCGTGCATTGTTTGTTTGCTCAACGTTTCTGGTCGTGTGTTCTGGATGCTTTCAGCTGGGTCATGCCTTTCTCTAGTAATATTTTTTATTTTCTTTCCTCCGTTTTGGTGGGCCATCCGTTCACAGGTTCAAAGAAGGTTCTTTGGTTGGCTTTTGTTCGTGCTTTCTTGTGGCATCTTTGGATCGAGAGGAATGGTCGTTTGTTCAAGGATGTTTCCACTCCTTTTGAACGTTTTATGGATCATGTTCTCTCAATTGTTTATTCATGGTGCAAGCATGTTCAACCTTTTGGTCTTTATAGCTTATCGTTTTTAATGTCAAATTGGCGATCATTCTTGTAATTCACCTTTTGGTGCTCGGAGTTATCTCCATTTCATTTATCAATGAAATAATTTCTTATACCAAAAAAGATACTAAATTTGGCCATTTTCCTAGAGATTGATGACTTAGTGATCATGGTTTTAGACTTTTTGATGACCCTTGGAGAGATATTTGGAAATCTAAGGCTTTTATAACCATGAGAGTCATTAAAGCCTTTAGGGGCCGTTTGGTTTAAGGTTTTATTGGATGAGAATGGGTAATAGATGCCTGGGAATAAGATGTTTGAGAATAATAAATGCCTGGGAATAAGATAACTGTGTTTGGTTATACATGGGAATATTATTAGATTTTTATGTAAAAAATGATATTTTATATTTATTTGACGAAATTAATATATAAAACTTATGTAATATATACTTACGATCTAATAAATAATGTGCAAATTTATTTTATGATATTTATCTTATAAATTTTATATATTATTATCTATTATTAATAAAATAATAAAAATGCATTTAAAAAATAATAATTAATTAATATAATAATAATTAGTGTTTTAATATTTTTATAATGAATATTTATGCTTGTTTGATGAAATTAATCTTAAAAACATTATTTTATTAATTAATCAAAATTAATGTCAACTAATAAAAATAAATTAATTATTTGTAAATATTTTTAATTAAATAATTAATTATGATTATTGAAAATAAATAAATAAATAAATAAAAATGTATAAATATTAATAAAGAAGAGAGAGATATTAATAAAGAAGAGAGAGAAAGAGAGAGATAGATGGGTATAAAGAAAACCCATCATTCTTGAACGGTATAGGAAACCCATCCACCATGGGAAAAAGTGATGCATGAGAATAAGTGATACCCATGCCATGCTTGCAAACCTTGTTCCAAACAAGGAGATTTAAAACCCTTACCCATTCCCATCCACAAAACCCTGCACCAAACGACCCCTTAGTAGCATTCAAGTCTCCTTTAAAGTTGGGGCTGGGAAGATTCTTGCATGGAAAATGCCTCTCTTGCCTCCCTGACCTTTATGTTGAATTGTTGATTCCTCTAAATCCATAGGTCATATGTGATGTGTGGACTCATCAATTTGGGACCTTAAATTTAGAAGAAACTTGAAAAATGTTGAGTTTTTGGATTGGCTTTCTCTTATGGAATATATTGAAATTTTTTCTTGGTTCTTGGATCCTCGAGGTCCTTCTCTTGTAGCTCCCTTCTGCAAGACCTCAAATCTCCTAAAAGCAGAGTGTATGTGAGTAAGTGTGAAAGGTTTTGCATTTTCTTGGTTCTTCGATCCTCGAGGTCCTTCTCTTGTAGCTCCCTTCTGCAAGACCTCAAATCTCCTAAAAGCAGAGTGTATGTGAGTTAGTGTGAAAGGTTTTGCATTTTCTTTTTCGTTTACTAATTTGAGCTCCTCTTGTCAATCAGTGCGACTCCAAAACCTACATACTCCGCTTTCTCTTATTAGTGCAATTGCAATATCAATATACAGTGTAGTTGCAAGGCTTTCTTTCGTGAGCTAGCCTTATTTACTTCCTCTTTTCCTCTTCGTTTCGGATGTCCATTCAATCTTTGATTCCAGTCCATAACTTCCTTGAATATAGCTCCTGTTGCTCTTCTCTAGCTAAGATGTCTATTAGCCAAGATATGTGCCCTGCAGTTTGGAGGATAGATACCCCCCGGTTGGAAAGTTTTACTGAAGAGTATTAACACCCAAGAGAAGTTCTAAAGAAAAAAAATCCAACCAACTCTCTCTCCAAATAGATGCATTCTCTGTTAGAGAGCTGAAGAATTACAAGACGACGTTTTCTTACTCAATTGTCAACAAGTCTCCTCCTTTTGGAACGTGCTTTTCTCCACTTTTGGGTTGTTTTGTCTGTTAGGTGGGAAGGTGAAAGCAAATATTCACTCTCTTTTGACTTATTAACCTTTCAAGAAGAAGGCTAGGAAGCTTCGGCTCACAAGTTTCCTTTGTTGCACTTTGATGGAGAGAAATCACCATATTGTGGGCAAAGATACAACTCTACTTTACTTTCTTGAGAATGCTATTTTTGTTGCTTACAGAAGTTTGAATACTCATGAGAGACTTCAAAAGAGGAGTCCAAATCTTGCCCTTTGCCCTTCCATTTGCCTGTTATGTTTGAGTAGTGAGGAGTCTTTGGATCACATTTTTCTGCTGGGAAAGGCTGCTGATTATTTTTGAGTTGCAAATTTGTCTCCTGAATAGGGCTGATGATTGATTGTTGGAGTCTTTGAGGGCTGGAAGTTGAAAAGTAAAGCTAGAATTCTTTGGAAGTTTGCCTCTAGAGCGTTTTTATGGAGTTTGTGGTTGAAATGAAATAGAAGAATTTTTTAGGATAAGTCTGCTTCTTTTGATTCCTTTTGGGATTTTGTACAGCTCAATGCATCTTGGTGGTGCCATAGCTATAGTCAATTCTTTTGCTCCCCCCTTTCCTTGATTCTTATTGATCGGAAGGCTGTGTGTAAATAGTTTCTTTGGGTGGGGGCTCTCTCATCCTCCAGGCAGCCCTTAGGCTGTGGCCCTTCTGTTTGTTCTTAATGTGAAGGTCTCTTATAAAAAAATGTTTTTTGATTGTTGCTGCCTCTTGGTGTAGGTTTAGTAATTACTTTTGTAATTATTTTATTTCAGATACTGGGCTTATTGGAGGGCTTTTTTGTCCTCTCTTGGCCTTGGGCTAACGCAGTTTCCTATCAATAATAATAATAAAACCAGAGCCTCATTTATTTTCTTAAGCTAGGGCCTCCTCCACCTTGAGGCAGAGAAACAACACTCCTAATGAGATGATTAGCCCCCCACTCTCCACTTTTCCCCATGAGAGAGTTGTTTGATTAAAGCAATATGGACAAGAGCCATGAAGACATATAGCAAGTGGGTGTTAGCTAAAGCTGCTTGATGTAAGATGAGCCTCTGTTGTGGTGGGCATAAACTTACTATGTTAGGATGGAATGATCCTATTTTAAGAGGAAGGAGGCTCATCTTACATCAAGCAACTTTAGCTAACACCCACTTGATTATAGGCCTGAAGCTTGTCTGAAAATCTTGATTACTCATCTCTTTAATCTTTCTACAATATTACAAAACGGATACACATGCAAGGGTGAACATTCAAACCTACAATTCCTTAACCACTAAGCCATACTCATGTCAATTCATCCATGAAATTTTTACTCTTGAGAAAAGGACGGTATCATTTGCAACCTGAAGGTGATTAATAATCATGTTGTTTTTTTTTGACAAGGCAACAAAGATTTCATTGAAGTAATGAAGGGAGACTACACTCCAAGTACAAGTTTTACAAAGATAAAAGAAAAAACAACCCAAATCCAATACCCAAGAGAACATTAAGAAATTCAACAACACAACAACAAACAATAACAAACGAAGCGAAAGGAACAAGAAGGAACCCAAGAAGCCACATTAACCACAACAAAAACAGACAATCACCGAACTGACGTAAAACCAGTTGAAACCTATGAAGATAAACTGCTAAAAAAGCCGACTAAAGAACTATCTCAGCAACCAACCAACAAGAGCACATAACAAAGCCTCCAAGCACTGCCCAACTGCAAATCGACCGAAATCTTGAGGACAAAACCCCACGCAAAAACACAGTCTTCAACAAAGATCTGATACTTAAAAACAAAAAAATCAACTATGATTCCTGAAGCAACACCACAATTAAACAAAATCTTTAGTCCAACCAACTCAGAAACCCTTCCCAAACCAACCCCGAGAGAACTCAACGAGTCACCACGACCTCAGAAGTAGGCCCAATGCCTCTAAACTGCAAACCACTAGCTTTGATTAACGATTCAAATCTACTTAATTCAGGAGGCACAATTGGATCTGCAACAGATGTCACACAAACCTTTTCCACCACTTTTGAACTGCCTGGAGAATCAAACAAACTAGCCAGATTGATAATAGGTGGATCCACCATCAAAGTCTTTGATAAGGAGAAGCCATTTGAACCTAACTCTGGACTGCTCAAACTAACCACCGAATCCACCACCGAGTCAAGATCAACCAAAGGGGAGAGTCCATCAACATTTATAGGAATTGGCATAGGATGCTGAAAAGAAGCTTTAATGAAAGAAACTTCCTTACTACCAATCTGGCATTCCTCAATAGTATAGGAAGGAAAGCTTGAACCTTTTTTGGCAAATGCCCTAACATTAGGCAGAGGAGAGATTTGTGCCTGCAAGCTACACACCGGACGCTCCTGGAAATCAGGAAAAGCGCCCTCCTGCTCCACAAGCTTCATCGCACTAACCTTTCCTTTCCTTGAATAAAACTTTCCACAGAATGTCTTTTTCACTCCCGTGGCAGGAGTAGCCAACAATGGAGAAGAGAGTTTACGGCGGTGTAAGACTCTCGCACCATCAGCCAAGCAAGCCTCAGTCCTACAACCCTCTACTAACGCCCCTCCTGACAAAACGGTTGCATCTCCATCCAAGCCACAAACCAAATTTCCCTCCCGAACCTCGACCAACGGAGCATCCCTTCCCAAGCGCGACCTCATCGAGCCTCCCACCAAATCCCCAGAGGCCGTCCCAACCACTGGATTCAAACAAACCTGTGAGGATGAGTAATCAACAACAGGGGAAGAGTACAAATTAAATCCAATCTCATCAACCATAACCGTAGAGATACGAGTCAGATCAAGTGAATTTGAAACATCCTTTAATACCAAGGAACCTGATAACACAGGAGGAGGATCCACACATTCCACATCACCAAACTTCAAAGAAAACTCACCTAGAAGTGGATCAATCACAGTAATTGAAGCCGGTAGAAAGCCACACAAATTTTGAGCCACCTTGATTTTAGCCCCTGAAAGATCAAGAAGATTCATGGTCTCCAAAGAAATATCCAACAAACCCCCCAGTTGCAACCCAATTGCTTCAAAAACCTCCTTTTTCCAGAGCTTCAAAGGCAAATTCCTTATTATTAACCAACCACCATACCCCCCAATTACGTTCGGTACACCATGTAAGTGATCTGTCCATCTCTCAATTTTCAAATGCAAATTACTTATCACTTGCCATTTCCCAATACTGGAAACAGAGAAGTCACTCGCAATATCATCAATCAAAAGAATGGCCTTGTCGGCCATGAATGGATTGATTTGAAACAACACTTGAAAATGAGCCTCCAACCCCTTCTTGACAGTCGACCAAGGAATATGGGCAAACAGACGAGTCACCACAAAGAGCTGTGGGAAATCCACCTCAACAACCTCTTCAGACTTTCGAACCCACCACCTTAATACCGAAGTTGAAACAGTACCCTGCAACGAGCAATCCCTCGCAACCCCCAAACCAAACCTGCCTCGATCTTCAACAGGAAGTCGCACCCCTTCGGGACTACTAAATTCGTCTCCAACCAACTCCGGAGAAGAAAAACGCACACCACAGGTTTCTGACGCCCCTGAATCCTCTAAAGAAGTGATAAAGTCACCAAGCATATCTCTAAAGACTTTCCAACCATTCAAAAACCTCCCCAAAGGAACACGGAAAGATTTCTTACCACCCAAAGGGGGCCACACAACACCAACCGCATCCCAATCTGCGTGCGAACGCACCTTGAATAATCCCAAAGTTGCTGCATCATCTCTGAATTTCCTTCGAACAAAGTAAGATGAAGACATTGAAACAATCGCCGAGAAGGACTTCAAAAACCAGTGGACATGAGCCAAGGACGACGATACCACCGAACCTCTACTGCCATCTTGAAGAAAGACGAAAGAAAAATCCTTCCAAATCCTAAACAGACGGTTGCCGATGCTACAAGAATGAAAAGGCATTGGAGTGCTCGGGAGGCAAACCGACATTAACCAAAACAGACTTCTTTTAATGGAAAAAAGAAAAAGAAACCCAGCCGAAGCTTGACGGGAACACTCCAACACGAGCGCTGAAAGATTCACCGCTGCGAGTCTCAAGGTTCCTCCAAAAGCTTCCCCTTAGCCAACGTTCGACGAGGAGGAAGGCAGCAACGAAACCCGGCGGTCGAACCGGCGGTGGGGCCCTCGGCGGCGCTAACCCGAGAGAAGAGAGAGAGATTGGAGACGGAGAGGGAGATGGATCGCGACTTCGGGGGAGCTCTTGGCGTCGTAGGTACAGGCGTGCGGACGAAGAAGAAGACGGCGCGGACGACTCGGCAACGACACCAGCGACGAGGACTGCTCACGATCGGAAGTGGGACAGTGGCCGGTTGGAGACGAAAGTTATCGGGGATTAGGGCTAGACGAGTGGAGAGAGAAGAGAGAAAAAAGGGGGGGGGTCCCACTGGGTGCGCTTAATTATTAATAATCATGTTGTTAACCTATATGTTCTAGTATTTGCAACTGCTCTTGCAAGTCAATTTTTGTGGGAGCAAATTCGATCTTGAAGTTCGTAAGAATAAGGCTAAGTTTTTAGCGAGTGAGCAGATAACTTCATATGAGGTTTTACTCTCCTAACATGGTCCTCATTTAAGATTCTTCTGCTTTTATTATTTATCAGGAAGCTACTGGATGCAAAAATCATTCACAGCCAACTTCCTCAAGTATAGTTGGTAAGGTTCTGAATCTGGCATTATAATTCCCTTTTGATTTATTCGTATTATTTTCTATACTTACCTTAAAACTCCAAATCATATATTGAATAGTATTGCCACACTTTGATAAAATGGAAAAATTAATGGATCTGGAATGGTTGAATGCTTGAACTGTGATGTGGAATGTTTTCTTTTATGTAATGCTCAATATAATCTTCTCTCTAGTATGAAATTGAGCACATTTGTGTTCGCAGACATTTGGGTGTTTTTTTCTTCGGAAAAAAAGAGGGAAGGAAAAAAAACTATTTATGTTCTCTTCTTCATCATGGCCACCCTTTATTGAAAAACATGAAAGAATACAAAAGGCCAAAGGGCATAAAAGACAACCCAAAAAAAGGAGCCAAGCCAAACTAACTATACAAATGTAGAAAAGCATCATTGTTTGCCCACTCAGTAGTTGATATGCAATCACACACTCACGTGGGGTATGCACAAGTTAAAAAGCATGGATTTTTTCTTGCTCTTGGTGACAACAGTAAAATTTAGGGTGATTAATTAGAAAAAAAATGTACAAATTCTTGAGTTGAACTATCTTCTTCATGCATTAGTAGGGTATCTATGAAGAATGGCATGTACTTGCCACTTTTGGTAAGAAACTGAGCTTACATCGAGAAAAATGAAAGATGGGTCCTCAACTACATGAAAGGACTCTAATCTAAAAGAATGATACCAAGTTGATAATTATAAAAAATCCGATTAACAGATGCCCATAAAGACACATTAAATCTCACAACCTCCCACACCTCCTCTAAAAACCTCTCTACCCCTCTAAAAATTCTGCAATTTCTCTCGAGCCACACCCTCCATAAGATAACAAAACAACAGGGCCTGTGGCAAGACTCTCCCTTTGTCTCAAAAAGGAGAGTTCATGATGATCTCCTCAATCATAGAGCGACAATCCTTATATTACACACCTTGTACTTGCAGTTAATGAAAGGAAGATATAGATTGTGTTGGAGATCATCACAGAGCTTTAAAATCATATTTAGACCAGATAGTGGAGGTTGACAAGGAGCTCAATTTAGATTTTTTTCCCCTGGTATCGAACATAAGTTTCTGCTTTATTTACTTTTCATATTTGAAGGTCTAGTGCCTTTTTCTAATTATAGTCGATAAATTAACTGTGGAGGTATATTGTGTGCATAAATAGGAGAAGGAGAAGGAGAAGGAGAAGTCATAAGAGACAATTTCTCTAATTGTTGAATCATCACAAAGGTTTCTTATAAAAAAAAGGAAAAATTGAAAAATGAAAAAAGTGACAGTTAGGTTATACGATCCACTACAAAGAAACTGGTTCATAATTCAAAATTTGTCAATTAATCATGCCCAAGGGCTCATATCTTTTTTGTAGGCTCGTCTTATGTAGCTAGGGAAAAGACTGCTTCATCTTCAGCTGATTCATTGTCCCCATTCACTTCTTGTTTTTACCAGAAGTTGGAAGAAAAAGTACTGTGCTGTGCCATATAAATATCTGTAACTTTTCAATTTCCTTCAGTCCTCTACATAAAATAAATACCTTTAACTTTTTGATTTACTTCAGTCCTTAATGTAATTCTTTTATGTATGTTAAATTATCTGAGCAGACAGAATGCAATAATAGTGCCTTTGGTCAAGAGCTATCAATTTTGGGACACTGTGATAATCGGAGACAAAGATTTAATGGTAGATTATCACCAAAAGTAAAACAGAAGGGTCAAACATCAGCAAGACAGCAGCCATTCAAATGTGCCAACAATCTTTCAACTGATGTACGTAAAAAAGTTTCTTCAGTTGCCGCTCAAAATTGTAAAAGTCCCGATCATTTAGATTCTCACGCCAGTGAAAGGCTGCCTGGCTGTTTTGGGAAGTAAGTTCTACTTTAACCATCCTTCTTTGACTACTTTTTAAGTTTTCCCCACCGTAATCCCAATGTGGTTCTGATGTAGAAACTGGACGCTGGCTTGAGATGACACTTTTTCATTTTTTTCCTTGTGAAATATACAATTTTATGCAATACAAGAATCTACTATAGCAGCAGAGATGGTTGGTCATGAAGATTCGTTACATTCTTTGTTTGTTTCAATACTTTATCCTTGCATTTAGATAAGCTAAAGTGATCTCTATCCTGCAATTAACTTGAATAAAGGACTTAGATGTCAATATCAATGAATATTCATGTCATGAACTGACGAAAGTGTCAATGGGGATTTGTAAAAAAAAAAAAAGGAAACAATAACAAACTTCGACAACAGTTCAAATAGGCCATAAAACATTACTCTTTAACACTTATATCCAAGAATATAAGATTGTAGCTCTATGAAGTTATGCAAATTATGGAAAGAGATTTCACAAGGGAATTGAAGAAAGTGATAAAGAAATTTAACTAAAAACTATAAATAGAAGACATGGAGTGTTTAGTCAGAAACTTCTTCTAGAATGGTGGTGCTATCACCCAAGCAGTCATTTGATGAAATGGGATGGAGCCTCTTTGCTTTGGCAAGAGGAGGCTTGGGGTTAGGCTCTTCAAACAAAGAAACAACACTTTTCTTCTTTAAGTGGCTGGAGAGATTCATGCAAACGAAAGGGGTTTTGTGGAAACGGGTTTGGCCAGTATCTGTGATATTATGTGAATTTTATTTTTGATATATGTGAGTGTCTAGACCCAACTTGCACTCACCATAACTATTCTCAAGGGACAACTGCCTCACCCTATTGTATTTGGTTGCAAAGGAAACACGTAGGATATTTAATCTCAAGTAGGTGACCACTATGACTTGAACCCACCCCCTTTAAGCTCGTTATTATTTACATGGCCCTTATTGATCGTTAGGCCAGCCTATAGTGGTTGATATTATGTAGATTATATCACAAGCTATGTTCATTTGAATGCTCAAAAGCCTTTTATGTTCTTTCTTCCTCTTTGTTCTTTACACGACTCTAGACTAAAGTCCATTGGATGTGTGTGTATTATTGGTTAGCTGTCCCATGCTAGGAAAAGATGGAAATAGAAAAAAAATTTGCGATTTGTGCAGGGTGTCTAATGGCTAGATGAGAGGTAGTTACATGTTTTTCTTTAAACCTATGGTCCTTTTACCTCACAGAATGCTACCGTCCAATATATACACAATGCGATTGATGACATTTCTTCAGGATTCCATCCATTTCATGAGTATGAAAGGGGATATATGTGATGCTCTCATGTATACATCTCTCTGTTTGAAGCCGTTTCTTTGAGGAGTTCAACTTATGCTTAGCTAGACTCAAGGGTTGCAGAGAGATGATTGCAGAGTTCATTTTCTACACTCTTTTTCGTGAGAAAGGAAGGATTTTGATGTAGGCATACTGTTGTTAGAGTGGGGGTTGTTGTACTGGATCTTTATTTCTGATATACAGTAACTTTTATTATTGTTAGAGTTACTGTTTTAGTTAGTCTTGGCTTTATTGTTTCATAACTTGTATAAAAGGCTGGTACATCTTATGTTTTGGAATTAATAAAAAGATCAGTAACTTGTTTTCTTGGAGATTTCGGTTCATCAGATTTTGTGGTGTGCCTATGTGTTGTTTTTTGGAGCCTTTGAGGAGTGAGGAATAACTGAGTCTTTGGAGAGCTTGAAAGGGCTCTAAGTGATGTTTGGAACCCTTATTAGGTTCAATATCTCTTGGGGTTCAGTGACGAAGTCTTTTTGTAATTACCCTTAGTTTGTCTTCTTTTGTGGACTTTATTTTTGTAATTACCCTTAGTATGACCTTGTTTTCTTTTATTTTTTTCCAATGAAAGTTGAGTTATTCATAAAACAAGTGTTATGCTCTCACGTAATAGATGTATATTTTAGTTTTAGAAATCTCTTATGACTTCTTTTTATTATTAATACTATCTTTCTAAAGAAACAGTATTTCATTGATGGAATGAAAAATACAATTTGAGATGAAAGTTTCCTAATCTGAAATGAGTTGCAGAAAGACGGTCAATTGGAAGTAATGGAAGGGAGACTAGAGAGTAACTACCAAAAGCTTTTGACATCTGCAGTATCCAAACCCCATTATCAAGCCATCTCACATCCTGAAAAATTTTTCCCTCTCCAACCAAATTTCCTACGAAACATCCCTTCCAAAAAATGCTGAGCCACGACACTTTGGCATTCCTTGTAGAAGGGAAGGCCAACAAAGAAAATGAATAAAATTAAATAAGTATCCAGTTGAGCTAAAAAAAATAAAATGTTGGCTCACCCAACTGTCGTTCTTAAAAATGAGTGATCTCAAAGTTGACCTAAATAATTTAATGACCTTTCATAACTTCTCTCAGTTAATTTTTCAGACTGCACTAGGCCTTCCTCGGTTCATGTGTTTACATTATTTTGGTCCCTTCTTTGCAGGAAGGATGCTTCTCAAGTTCAGCATTCAGATAGTCCAATGCCTAGAAAGGTACATTTTGAATGTTACCATTTGTTGGGTTTATGTTTTTCATCCTTGTGTCTTTGATGCGTAAGGCAAGATGTAATACCCATTTGTTACAATATTCTTACCATGTCAACATAATAGTTTGGTATAGAGCTTTGTGGGTGTTTTATCTCTCTGTTTTATGGAAAATCGCTCTCTAAGAAGGTCTTATTTTCTAAGTAGAAACAAAAAAGATCATGTTATAAGATGGCATTTGTGTTGCCCTTACATTGGATACTTATGTTCGGTTCTTTATGATACTTTGTTCATCTAATTGATGAGATTTCTCTGGACAGGGGCAGACCATTGTAGTTGTAGATGAGGAGGAAGCTTTGGCAACGAAAATACCAGAGCATGATGACAAATGGTAAACTTTACGTGCTTGTTTGTTTAGTCCATTGACATGCAGATTAGCTTGGACAATTTTATGATTCAAGCTTTGATTTACATTTTTGTGGAACTTTTCAGCATGAAAGAGTCCAAGATCTACTACCCATCGAGGTATGTGACACTTTTCCATATTATCAATGTTTGAGTATCTTTTCTTTGAATTTATGATTCTATGGAAGTGGTCTTTTATTATATATTCTACTAACTGTTGGCAGGGATGATCCAGAATCCGTTGAAATTTGTTTTGAGGACATAAAGTGTCTTGATCCAGAGGGTTATTTGACATCAACGATCATGAACTTTTATATTCGGTTAGTAGAGCACTATTTGGACCTGTGGAAGTGTTTAACTTGACGTGTGATTAATTGAACTCCAAATTTTTTTTCTCATCCTCTATGAACTTTGATATGGCATATGATACATTGTTATCACAAAAACTCATGGGGTTTAGTTTTCTTTTCTTCTAATTTGAAATAAGTTCTTGTTGTTTTGGCTGGAACTTGGACTGTTTTCTATGTTTTTTTTTTTTTTGACAAGAAACAGCCAAAATTTTCATTGAAATGATGAACTGAGAGACTAATGCTCCAAAAACAAGATAAAAAAAGAACCAATAACTAACAAGAACAAATAAAAGCATTCCAGTTTAAATTAATATCAAAAAGAGAAAGCCCCTAAAAAAATATTGGAAAGAGCACACTAGGAAGAGGAACTAGGAAGAGGAAAAGTTCGCTGATTCCGCTTGAACCAAATATATGATAAAATAGCTTTGACTGCATTTCCCCAAAGAACTGGGTTGCGTTGGTAAAGGAGGCCCCACATTGATCTGTAAAACATTGTCTTTAACACAACTCGTAAACACCCAACTCACATTGAAGATATGGAATAACTTGGTCCAACACTTCCCAGCATATTCACAGCTAAAAAATAAATGAAAATCGAATTCTGACTTCTTTTGACTAAGCAAACAAATCGATGGCTGCAAATTCATAAGATCCTTTGCAATACATCACTTGTGTTCAAATATCCAAATAGAAGAATCCATAATAAAACTTTAATCCTCTTGGGGCATTTAGACTTCCAAATAGCATTCTAAACTAGTTTGTCCAAGGGCGAGTGGTTAGCTAAATACTTAGACAGCAAGCTGACGGAAAACAAACCAGAGGAATCTGCTAACCACCTCCTTGAATCCTCCAACGAAGAAGGCTTAAAACTGCTCAACACTGCCAACAAATCTGAAACACATCAATTTCCTCTTCCTTCGACGATCTCCTGAATGTTGGGCTCCAAGAACCAGTATTAACATCCCAGACATCAACCACACTAACTGATTGAGAAGAGGATAACAAAAATAAAGAGTGAAAGTTATGGCTGATGGCTTTGATTTCTGGTTGACAACCATTTATGGGCCAACCCAACATACATTGCGTGAGAATTTTTGGCAAGAACATCACGATCTTGCTGGTTTGGGAGGTGATCGATTGATCTTAGGTGGTGATCTTAATGTTACTAGATGGTTTTGGGAGAAGCCACATGGTTGACTGACTACTCGTAACATGAGAACTTCCAATTAGTGGATTGCAACCTATCAATTAAGTGACATCCCTATGCAAAATGGCTGCTTTACTTGGTCTAGAACAAGGAAAAATAGAACTTAATCACTGCTGGATAGATTTTTGGTTACAAATGAATGCCCCCAGAAATTTGGGTTGGCTCACCTTATCCGTCTAGAGAGGGTAGTTTTAGACCATTTTCCTTAAGCTCTTAATTTTGGAGATTTATCATGGGTCCTAGTCCTTTTCGATTTGAAAATTCATGGTTGATGCATAAAGATTTTAATCTCATAGTAGATTCTTGGTGGAATCACAATCCTATGCAAGGTTGGCGAGGCCATGGATTAATGATGAAGCTTAAAGGGTTAAAGTTGGAGCTTCTCAATTGGAACAAAAGTTTGAAGGCGAATTCAACTAATATTCCTAACTTGGTTGTTCAGTTGAATTCTTTGGATGTTGTTGATGACCAGGGAAGTCTTTCTCCTATACAAAGAGCTCAAAGAATCCACCTCCGTGAACAGATTGACGACTTTACAATGAAAGAACATATCTTTTGGCGACAACGTTGTAAGCTTCAATGGTTAAAAGAAGGTGATGAAAATTCTAAATTTTTTCATCGATACTTGGCTGCGGGAAAGTGGAAAAACTCTATTACTGAATTATTATCACGAGATGGGGTGAGTTTATTAACGGTCACTGATATTGAGAGGGAGTTCATTGATTTCTATCAGACTCTTTATACCAAAGATGATAATACACAATTTCTTCCAACCAACCTTGATTGGAGTCGGATTAGTGTTGAACAAGCAACTGATTTGGAAATTCCTTTTTCTAAAGAAGAGGTTTATCTTGTTGTCCACTTGTTAGGATTAGGAAAATCTCCTGGGTCAGATGGTTTTGCTGCAGAATTCTTTAAACACTCTTGGAAGTAAAACAAGATGTGATGGTAATGATGATAACGTCTTTCATAATGGGATTGTTAATGTGGCATTGAATGAACCATATATCTGTCTTATCCCAAAGTGGGTGGTTTCTAAAACAGTTAATGATTCAACCAATTAGCCTCATTCCATGTGCATATAAAATTATTGCCAGTGTCTTGTCTAACCCGTTGAAAAATGTTTGCCCATCTACCATAGCGGAGTATCAAATAGCTTTTGTGTCTAATCGAAAAATATTGGATGCTTCTTTGATTGTAAATGAATTGATTGATGATTGACTTTTTGCTAAGAAAAAAGAAGTAGTTATCAAACTGGATCTGGAAAAGGCTTTTGATAAAGTGGATTGGAAATTCTTGGATGCAGTTTTATATGTTAAAGGCTTCGGTTCTACATGGAGGAAATGGATTAGAGGATGTGTATCTAGTGTTAATTATTCCATTATCATTAAAGTTAGACCTCGTGGTAAAATCATTCTAACCAGAGGTATTAGACAAGGTGATCTACTCTCTCCATTTCTTTTTATATTGGTCACTGATTGCCTTAGTTGATTATTGACTCATAGTGAAAGATTGGGAAATTTTTTTATTCATCCTATTGGTAACTCTTCTTTCCACTTGAACCATTTATGGTTTGCTGTTGATACACTTTTGTTGTCAACTTTTGATACATTGGCCATGGATAATTTATTTGCTGTTATTAAAATTTTTGAGCTTGCTTCTGGATTAGACATCGATTTTGGCCAAAGCGAATTATTGGGTAAATACGAGTGATTTGCATATGGAGGAATTGACAGCGAAATTTGGTTGTAAACAAGGTGTTTGGTCGACAACTTACCTTGGTCTTTCATAAGGAGGTAATCCTAAGTCATCAGCTTTTTGGAATGTTATCTAATACAAGCTATTCTTTCAAGTCAAAAAAAAAAAAAAAAAAGCTACTCCTTCAAGTCTGCCCATTTATTATCTCTCTTTATTTAAACTATCAACAAAGTGGCTAAATCTTTGGATAAGCTTGTTAGAGACTTCTTTTGGGAAGGATCCAAAGGTGAAGGTGGACTACATAATATAAATTGGGAGAAGACTCAACTTCCCCACTTAATGGGAGGCCTTGGTGTTGGAAATTTTAATTATAGGAATGCTGCACTTATGTCTAAGTAATATAAATTGGGAGAAGACTCAACTTCCCCACTTAATGGGAGGCCTTGGTGTTTGGTCGACAACTTACCTTGGTCTTTCATAAGGAGGTAATCCTAAGTCATCAGCTTTTGGAATGTTATCTAATACAAGCTATTCTTTCAAGTCAAAAAAAAAAAAAAAAAAGCTACTCCTTCAAGTCTGCCCATTTATTATCTCTCTTTATTTAAACTATCAACAAAGTGGCTAAATCTTTGGATAAGCTTGTTAGAGACTTCTTTTGGGAAGGATCCAAAGGTGAAGGTGGACTACATAATATAAATTGGGAGAAGACTCAACTTCCCCACTTAATGGGAGGCCTTGGTGTTGGAAATTTTAATTATAGGAATGCTGCACTTATGTCTAAGTAGATTTGGAGATTTCTGCACGAACAAAATGCTTTATGGAGAAAGACTATAGTGGCAAAGTACTATGATAGTCATGATTTTAATGGTTAGCCTTCATCCATTACAAGAGCGTATTCCAAATCACTTTGGAAACCCAATTGTTGCTATGTGGAATTGTGGGTTGCCGTATTCAAAAGGTTGGGAATGGTTTGTGTACCTTTTTTTGGAAAGTTGTTTGGCTTAATGGAACTTCTCTATCAACAAGTTACCCAAGGCTTTATCCACTGTACCAATGATTACTACTGCACAAGCATGGAAATCGACGGATGTTTTTGTAGGCTTGAGTTTCAGACGAAACCTAAACGATAATGAAACTATAGAATGAGCTAGTCTTTTGCATCTCTTATCGATGGTTAATTCATTCAAATGGTCCAATTACAATGGAAGGATTTTTACCCAAAGAAGATAAAAATCTTCTTATGGGAGCTTAGTCATGGTGGAGTTAATACAACTGATAAACTTCAGAGAATAAAGCCACATTTATCTATCTCTCCATCTTGGTGTGTCGTGTTGCACAAGTGCTGAAAGTCTTAGTAATTTATTTGATCATTGCTCCTTTGCTTTACAATATTGGTCAATTATGAAGGAGCTTTTGGTTGGTTGCTGGTCTTGCCAAATTCTATCTTTGATATTCTTGCATTGGTTTTTGTGGGACACCCTTTTAAGGCGGCAATAAAGACCCTATGGTTGGCTATCGATCATGCTTTCTTCTGGTATCTTTGGTGTGAGCGCAATGGCAAAATTTTCAGGGATGTTTCTTTGACTTTTAATGCTTTCATGGATTTTGTATTATTCAATGCTTTTTATTTGTGTAAATGTACAAAATGTAAACACCCTTTTAATGCTTATAGTCTCTCTACTCTTCTTTCCATTTGGAAAACATTTCTATCACACTTTTAGTATTTTGAGATTCTTCCCAACATTTCTCTCTGTCCTTTGTCAAAAAATAAAAAATAAAAAATAAAAAAATTCCCTCAAGATTTAGTTGAAAATCAAATTTTAAATTAAGGGCTGTCGGATCCTAATTTGAGTAAGTTCTCCTATTTAATCAAGGCTAGTGGATTACAATTTTGAGAAATTACCCCTTTAGCTTCCAAGGGGATAGCCAGAAAGTTTAAGCATTGATTATTTGTGCTGGTATTCCCATGAAAGATCTTTTCTTGGAATACAACGGGTGGTCTAGGAGATGGTTCTTAAAGAGTTGGTCTGGAAAGTTTATGTTAGGTATCCAAACCAGAACTAAGAACTCAAAGGAACTAATATATTGCAATATCAATAAAGAAAATTACAATATCCAATAGCCTTTCGAGAGAGCTATCTCTCGCCTTAAGTCCCACAATAGACACTACCAAAATGTTGCACCCCAAAATAAGGACCCTAACACTACTATTTATAACCAAGACACCCTAGTTAAAATCCCCAAAATATCCCTTACTGATATACTACTAATCTCCCACATAGACTCTCTACTATCCCATACTAGTACTCTCACAGTTTATTGCAGAAACGTTGTCCTGGTTTGTTATTGATCCAGAATCCAATTTGTTTACTCTAGATTTGTAAAATTTATAAGGAGCTTTGGAAGATTTAAATCGGGTTCATGTTGAATTGTATGGTTGCCCGGGAGGATTGCTGATCATGTGTGATGAGATTAAAGTGAATGCTAATTGCTATAGAGGCTTTAAAGGGTGGGTTCTCTCTTTTGGTAAAAGCATCGATTCGATAAGATAAATTTTGCCTTAATGATTACAGAGTAAGATTGGAATTTCATGGTCTAGAAAAAGGAAAGCTTGCTCTCTGTTTTTAATAGTCCGTTTCTTCTCTAACAAGGCTTGGGATGTTCCATTTGATAGCACAAGAGCTTTGAGGCAGGACTTGGGCTTGGGGATTAATTAAATTGTTGATGTTCTGGAGTTTGTGCCTTCTCATGTGGCAGTTGTGTTCGACCATGATGAGATTTCTGTTCACATGCCAAAAGCATTATTCTTTAGTTTCTTTTGGAACCCTCTTTGCCATGGGGCTCTCATTTGCTTTGGGTCAATGTAGTAAAAGTTTTGATTTCAGATCGTTGAAAAGAAATCTGAAGCTTGTTCATGGTGTTCTCTTTCCAAGTCTTTTGCTGGTTTTCTTTGTTAGATTTTATTCTTAATTGGGATGATTTTGTTTTTCTACTTTGCTATGAGTTTCAGGTTCTTATTTTCACTCCCTTGCGGAGCTTGTATCTTTGAGCATTAGTCTCTTTTCATTTCATCAATGAAAAGTTTCGTTTCTGGTTCAAAATAATCAAATCCTTTTATTCGAATCATTCAATCACTGAGATTAACCCTAATTGGAATCTTTTCTGTATATTTTATGCTTTTATCACGTAAAAACTAAAGAGCCTTATTTTGCAATTTCAAGTCTGTTTCAATGTTAAATTGTAGCTGGGATGGTTTATAATTAAAAAAAAAAATAATAATAATAATAATAATAATAATTCTTATTATTGTTGAAAGTTGACTGTAACTTTTCAAATCTCCAATAAAAAGTTTGTTTCTTGTAAAAAAAACGAATTCCACTTTTTTACTTGCTATATATGCAGGATTTCATTGCCGAAGTCTGTTTTTCTCTCCCTTTACATTTTTTCTTCTGTTTTAGTTGAATTTTTAGGTTTTGATGCAGTTTTGGGTTCCATGGGCAACTTCCCTTTTTATGTAGGTATTTACAGCAGCGAGCGTTTTTAACAAATAAGGTGATATGCAATTATCACTTTTTCAATACCTATTTTTATGAAAAGCTAAAGGAGGCTGTATCAAACAAGGTAGTCCTATACACTATTATCAAGGGTGTTTCACTACTTTTACTCCCCTCTTCTGAGTGATTGTTTCATTTTTTGTCGAGAGAAGTCTATAGTATTATGGCGCCAAAGTTTCTATTAACCTTAACCCAAGGTGAGGTCAGAGCCTGCCTAGATGCATAATTTAAATTTCAAATAATACATCCGGAAAAGATAATTCTTTTAGGAGTATAAATATAGAAAATTATTTAATCGTATATATGAACTGAAGTCATTCATAAGAAAAAGAGGCAAAGAAGAAATTGTAGATTTTTTTTAAAGTTAAAAAATAGTATTGCATTTAGGGAATTAGGGACAAAATGTTAGTGTGTAAAGAAAAATTGTAAACCTTATTGTGAGAAGGTTCACACCTAAGCTAGACTGCCTCGTCAGACATGTGGCAAAATTTCCAAGGTGTATCTACGTGACACACCTAGGTGTATGCTTTTAATCAATGACCTACATATATTGTTTTTATAAACTTTATTTACTGGATGTGGAAACACGTGTTGGCATGTGATGCGAAATGATATTGCAAAAATGACCTGTAACAGACACGGCGATATATCATATGCTGTTATATGTCAAACCCAGACACAAAATAGCTCCTTGGGATTTTTGTTTGGGATGTCTTATCTTCATTTATAAGATGTCGAAGTTTTTTTGTAACTATTCTTTAAGTCTTATCTTGCTCGACTAGAGCCTCTTTTTGTAGTTGGCTTCCCCCACTTTTTGTCGGCTTCTCCTTTTGTATGCCCGTATATTCTTTCATTTTTTTCTCAATGAAAGTTTGGTGTTTTATAAAAAAAAATCATATAATTTGAATAGTTTTTCTGTAATACAGTATTTCCTCCTGCAAGATATTCTTGATTTTGAATATTTTGACATGTTAAGGGGAAGGACAGAGACAATTTTTTTGTCAAGTTCAGAAGATGGTGGAGGGGCGTTAATATATTTCAGAAAGCATATGTACTGATTCCAATCCATGAGGAGTAAGTTCCTATAATCATCTCTTTTATATCTGGATGTGCAATAACTTGTCTTTTTGAGTTAATGGCACCATCAGTGCATGCTATTATTTATTTAGCAGTTCTCGTAGCAGGACAGTCTCCATGTGAATGTGTAGAATCTCATCATATTGATAATTGATTATGCTATTTATTCCCTGTAGCTCTTACATTGTCTATAGCTTTTCAATTACTCCTGTTACTGCCCACGTTGGGCGGGTCCCATGGCCAAAGGGAAAGCTTTAAATCAGATGCCTTGGTCACCTAGTATATTATATTCTATGCATTTTTTTGTTGGTCTGGTACGTTTTTTTAAAAAAAATCTTCTGTGTTGACCATAATTTTGATTTTATCTTTTGGTTTCTGATGTGTTCAGTCTCCACTGGAGTTTGGTGATTATATGTTTTCCACAGAAGGAAGATGAGTCAGGACCCATTATACTTCATTTGGATTCTCTGGGACTACACTCTAGCCGGTCAATTTTTGATAACATTAAAAGGTATGTAAGCCCCTTTCTGACGCAACGAAAATTCCTTAACCATTTCTGGAAATTTTTGGATCGTCTACGTTGGTGAAATTTATATTATTGTCTGTTTTGTCTTCTGGACCTTCTTTTAGATATACCGTAAAGATGTTCTTGAGAAAGCGATTAAGATCTAATTGCATAAAATGTCATCTGTAGTTTTATAAAAGAGGAATGGTGCTACTTGGATCGAGAAGTTGCTGATTCAGATCTTCCAATGCCGCATAGAATATGGAAAAATATCTCTAGGAGAATTGAAGAGAAAATAATCCAGGTAGTCTCCTCACTCTTATTATAGTCATGTTTTCTGTGTATTTTGTTAACCACATATGGTAATTTGTCCAGATTTACATGTATTATGTAAACCAATGGGAACCAGAAATCGAAAGGGAAACTTACGGAAAATAGTTTTTGAATTTAAGTCGTCAAACGTTTGATTGAGCAATTGGATTAGTAAGTCATTTACAGCACAAAATAGTTTTGCTACAATTTGGAAGTGTAAGAAACTCTGCCACACAGATGTAAAAATCAATAACTCATCAACTTTCCTTCATGAACCCAATAATTTAAAATCATTTCTTGTGTTGTGGATTTGAGATTTTACCTTTCAATTTAATGAATGGGCCATCACCAGGTTCCACAGCAAAAGAACGACTGTGATTGTGGTCTCTTTGTTCTATACTTCATAGAACGTTTCATTGAAGAGGCTCCTGATAGACTAAAAAGGACGGACCTAGATATGGTATTTATGCTTATGAATCGACTTTTATAATACTAACTGTGCTCCTGGCATGCATCATTGGCAATGACATGATGTGATGTTTTGCTCATGAAATATCAGTTTGGCAAGCGGTGGTTCAAACCCCAGGAAGCTTCCAGCTTGAGGACGAGGATAAAATGTCTGCTCAAAATAGAGTTCCAAGATGTGAAAAAACAATGCCTATCTGGTTCTGTGGAGAGCTCTTCCTCAGATCAGGCTCCAAACCAATGAATTGGCAGCTCTTTGTACGCAAAATGACATGTTGTTTCAGTTCACTTGGGATTGGTCGAGCTGTGGTATCGTCTATCGTGCTTGGCTAGTGAAGTTAAAGCCATCTCACTGTTCTCGTACTAGCAACCAACCGATCTTAGGTATGATTTTCTAAAAGATTCAGAAATGTAAAGCCTCAAAAAGTGTTTGCAATAGGTTGAAGTTATATGGTGGGTTGTTAGGGTCACATCCAAACACTAAGAAATCAAATTAGACGAGCTAGAAAAGCTCCATAACTTGGCTCGATTTCCAAGGACATGAGGAACATACCATAGGCTTAGGCTTTAGACCATTGCATGCAGTGGACTTGGAGTAAATTTGTCCATAGGAATTGTGTTTCTATTGTGAGTGACCGCTCCTAGCCTCCTGGGCATGATGCCTATTTACATTTTTTACTGAGTAAAAAGTGCTGTTGGTTAATTTCTTCTACCTTTCCCTGTTCTCTATGACTCGACTCTATACACTGTAGATATTTTTTGTATCAGTTAATGAACATTTAATTA

mRNA sequence

GAAAAATTAAAAAAACCCAAAACTCTGTTCTGCGACTTGTGTCAAAGGTGTGTTGTCAAGAACAGAGAGGGGAAACTGAGAGACAGAGAGAGGGAGACGAAATAATAGCTTAAATTTCTGCTGCACCTGCACCAGATGCAAATCCATGGTGATGGAGCAAGAGAAAACGACAAAGAAACCTCTGAATATTGACTGGGACGAACTGTTCGGCTGTAAAGACGAGGAGCCTCCACTGGAAATAGTCATTTTGCCCGCCATCGCGAACTCGAAACACTTTGAAATGGAATCCGATCGCCAACATTTGGTAAGAGAAGAATACCAGAAGCTTAGTGATAGCGAACTGGAAGAAAAGATTCGCAGAATGCATCAATTCTACGAGTCCACGGCTTGCAAATTGCCAGACAAGGGGCAAAAGTATCTTCGCAATCTGGAGCTGTCCATGGAAGAGAGGGAATCCAGAAAGCTCCGTCGAGTTGAAAAGGAAGCTACTGGATGCAAAAATCATTCACAGCCAACTTCCTCAAGTATAGTTGGCTCGTCTTATGTAGCTAGGGAAAAGACTGCTTCATCTTCAGCTGATTCATTGTCCCCATTCACTTCTTGTTTTTACCAGAAGTTGGAAGAAAAAACAGAATGCAATAATAGTGCCTTTGGTCAAGAGCTATCAATTTTGGGACACTGTGATAATCGGAGACAAAGATTTAATGGTAGATTATCACCAAAAGTAAAACAGAAGGGTCAAACATCAGCAAGACAGCAGCCATTCAAATGTGCCAACAATCTTTCAACTGATGTACGTAAAAAAGTTTCTTCAGTTGCCGCTCAAAATTGTAAAAGTCCCGATCATTTAGATTCTCACGCCAGTGAAAGGCTGCCTGGCTGTTTTGGGAAGAAGGATGCTTCTCAAGTTCAGCATTCAGATAGTCCAATGCCTAGAAAGGGGCAGACCATTGTAGTTGTAGATGAGGAGGAAGCTTTGGCAACGAAAATACCAGAGCATGATGACAAATGCATGAAAGAGTCCAAGATCTACTACCCATCGAGGGATGATCCAGAATCCGTTGAAATTTGTTTTGAGGACATAAAGTGTCTTGATCCAGAGGGTTATTTGACATCAACGATCATGAACTTTTATATTCGGTATTTACAGCAGCGAGCGTTTTTAACAAATAAGGTGATATGCAATTATCACTTTTTCAATACCTATTTTTATGAAAAGCTAAAGGAGGCTGTATCAAACAAGGGGAAGGACAGAGACAATTTTTTTGTCAAGTTCAGAAGATGGTGGAGGGGCGTTAATATATTTCAGAAAGCATATGTACTGATTCCAATCCATGAGGATCTCCACTGGAGTTTGGTGATTATATGTTTTCCACAGAAGGAAGATGAGTCAGGACCCATTATACTTCATTTGGATTCTCTGGGACTACACTCTAGCCGGTCAATTTTTGATAACATTAAAAGTTTTATAAAAGAGGAATGGTGCTACTTGGATCGAGAAGTTGCTGATTCAGATCTTCCAATGCCGCATAGAATATGGAAAAATATCTCTAGGAGAATTGAAGAGAAAATAATCCAGGTTCCACAGCAAAAGAACGACTGTGATTGTGGTCTCTTTGTTCTATACTTCATAGAACGTTTCATTGAAGAGGCTCCTGATAGACTAAAAAGGACGGACCTAGATATGTTTGGCAAGCGGTGGTTCAAACCCCAGGAAGCTTCCAGCTTGAGGACGAGGATAAAATGTCTGCTCAAAATAGAGTTCCAAGATGTGAAAAAACAATGCCTATCTGGTTCTGTGGAGAGCTCTTCCTCAGATCAGGCTCCAAACCAATGAATTGGCAGCTCTTTGTACGCAAAATGACATGTTGTTTCAGTTCACTTGGGATTGGTCGAGCTGTGGTATCGTCTATCGTGCTTGGCTAGTGAAGTTAAAGCCATCTCACTGTTCTCGTACTAGCAACCAACCGATCTTAGGTATGATTTTCTAAAAGATTCAGAAATGTAAAGCCTCAAAAAGTGTTTGCAATAGGTTGAAGTTATATGGTGGGTTGTTAGGGTCACATCCAAACACTAAGAAATCAAATTAGACGAGCTAGAAAAGCTCCATAACTTGGCTCGATTTCCAAGGACATGAGGAACATACCATAGGCTTAGGCTTTAGACCATTGCATGCAGTGGACTTGGAGTAAATTTGTCCATAGGAATTGTGTTTCTATTGTGAGTGACCGCTCCTAGCCTCCTGGGCATGATGCCTATTTACATTTTTTACTGAGTAAAAAGTGCTGTTGGTTAATTTCTTCTACCTTTCCCTGTTCTCTATGACTCGACTCTATACACTGTAGATATTTTTTGTATCAGTTAATGAACATTTAATTA

Coding sequence (CDS)

ATGGTGATGGAGCAAGAGAAAACGACAAAGAAACCTCTGAATATTGACTGGGACGAACTGTTCGGCTGTAAAGACGAGGAGCCTCCACTGGAAATAGTCATTTTGCCCGCCATCGCGAACTCGAAACACTTTGAAATGGAATCCGATCGCCAACATTTGGTAAGAGAAGAATACCAGAAGCTTAGTGATAGCGAACTGGAAGAAAAGATTCGCAGAATGCATCAATTCTACGAGTCCACGGCTTGCAAATTGCCAGACAAGGGGCAAAAGTATCTTCGCAATCTGGAGCTGTCCATGGAAGAGAGGGAATCCAGAAAGCTCCGTCGAGTTGAAAAGGAAGCTACTGGATGCAAAAATCATTCACAGCCAACTTCCTCAAGTATAGTTGGCTCGTCTTATGTAGCTAGGGAAAAGACTGCTTCATCTTCAGCTGATTCATTGTCCCCATTCACTTCTTGTTTTTACCAGAAGTTGGAAGAAAAAACAGAATGCAATAATAGTGCCTTTGGTCAAGAGCTATCAATTTTGGGACACTGTGATAATCGGAGACAAAGATTTAATGGTAGATTATCACCAAAAGTAAAACAGAAGGGTCAAACATCAGCAAGACAGCAGCCATTCAAATGTGCCAACAATCTTTCAACTGATGTACGTAAAAAAGTTTCTTCAGTTGCCGCTCAAAATTGTAAAAGTCCCGATCATTTAGATTCTCACGCCAGTGAAAGGCTGCCTGGCTGTTTTGGGAAGAAGGATGCTTCTCAAGTTCAGCATTCAGATAGTCCAATGCCTAGAAAGGGGCAGACCATTGTAGTTGTAGATGAGGAGGAAGCTTTGGCAACGAAAATACCAGAGCATGATGACAAATGCATGAAAGAGTCCAAGATCTACTACCCATCGAGGGATGATCCAGAATCCGTTGAAATTTGTTTTGAGGACATAAAGTGTCTTGATCCAGAGGGTTATTTGACATCAACGATCATGAACTTTTATATTCGGTATTTACAGCAGCGAGCGTTTTTAACAAATAAGGTGATATGCAATTATCACTTTTTCAATACCTATTTTTATGAAAAGCTAAAGGAGGCTGTATCAAACAAGGGGAAGGACAGAGACAATTTTTTTGTCAAGTTCAGAAGATGGTGGAGGGGCGTTAATATATTTCAGAAAGCATATGTACTGATTCCAATCCATGAGGATCTCCACTGGAGTTTGGTGATTATATGTTTTCCACAGAAGGAAGATGAGTCAGGACCCATTATACTTCATTTGGATTCTCTGGGACTACACTCTAGCCGGTCAATTTTTGATAACATTAAAAGTTTTATAAAAGAGGAATGGTGCTACTTGGATCGAGAAGTTGCTGATTCAGATCTTCCAATGCCGCATAGAATATGGAAAAATATCTCTAGGAGAATTGAAGAGAAAATAATCCAGGTTCCACAGCAAAAGAACGACTGTGATTGTGGTCTCTTTGTTCTATACTTCATAGAACGTTTCATTGAAGAGGCTCCTGATAGACTAAAAAGGACGGACCTAGATATGTTTGGCAAGCGGTGGTTCAAACCCCAGGAAGCTTCCAGCTTGAGGACGAGGATAAAATGTCTGCTCAAAATAGAGTTCCAAGATGTGAAAAAACAATGCCTATCTGGTTCTGTGGAGAGCTCTTCCTCAGATCAGGCTCCAAACCAATGA

Protein sequence

MVMEQEKTTKKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQKLSDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRVEKEATGCKNHSQPTSSSIVGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSILGHCDNRRQRFNGRLSPKVKQKGQTSARQQPFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHASERLPGCFGKKDASQVQHSDSPMPRKGQTIVVVDEEEALATKIPEHDDKCMKESKIYYPSRDDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKLKEAVSNKGKDRDNFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPIILHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQVPQQKNDCDCGLFVLYFIERFIEEAPDRLKRTDLDMFGKRWFKPQEASSLRTRIKCLLKIEFQDVKKQCLSGSVESSSSDQAPNQ
Homology
BLAST of Tan0020931 vs. ExPASy Swiss-Prot
Match: Q2PS26 (Ubiquitin-like-specific protease 1D OS=Arabidopsis thaliana OX=3702 GN=ULP1D PE=1 SV=1)

HSP 1 Score: 364.8 bits (935), Expect = 1.8e-99
Identity = 230/576 (39.93%), Postives = 325/576 (56.42%), Query Frame = 0

Query: 2   VMEQEKTTKKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQKL 61
           V++ + + KK   IDW      +DE P LEIV              SD Q    +  + L
Sbjct: 8   VIDVDCSEKKDFVIDWSSAMDKEDEVPELEIVNTTKPTPPPPPTFFSDDQ---TDSPKLL 67

Query: 62  SDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRV-EKEATGCKNH 121
           +D +L+E++ R      +    LPDKG+K    + L + + E  K RRV E         
Sbjct: 68  TDRDLDEQLERKKAIL-TLGPGLPDKGEK----IRLKIADLEEEKQRRVLEGSKMEVDRS 127

Query: 122 SQPTSSSIVGSSYV--------------------AREKTASSSADSLSPFTSCFYQKLEE 181
           S+  SS+  GS  +                    +R+  A S   S S F++ F    + 
Sbjct: 128 SKVVSSTSSGSDVLPQGNAVSKDTSRGNADSKDTSRQGNADSKEVSRSTFSAVF---SKP 187

Query: 182 KTEC-NNSAFGQELSILGHCDNRRQR---------FNG-RLSPKVKQKGQTSARQ----Q 241
           KT+  +  AFG+EL  LG C+ R+ +          NG RL P V  K + SA+Q     
Sbjct: 188 KTDSQSKKAFGKELEDLG-CERRKHKAGRKPVTRLSNGWRLLPDV-GKAEHSAKQFDSGL 247

Query: 242 PFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHASERLPGCFGKKDASQVQHSDSP---- 301
                N  S +   K   + +      D  D    +      G +   +     SP    
Sbjct: 248 KESKGNKKSKEPYGKKRPMESSTYSLIDDDDDDDDDDDNDTSGHETPREWSWEKSPSQSS 307

Query: 302 --MPRKGQTIVVVDEEEALATKIPEHDDKCMK--ESKIYYPSRDDPESVEICFEDIKCLD 361
               +   T++ VDEEEA  + + E   +  +  +  I YP+RDDP  V++C +D++CL 
Sbjct: 308 RRRKKSEDTVINVDEEEAQPSTVAEQAAELPEGLQEDICYPTRDDPHFVQVCLKDLECLA 367

Query: 362 PEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKLKEAVSNKGKDRDNFFVKF 421
           P  YLTS +MNFY+R+LQQ+   +N++  + HFFNTYFY+KL +AV+ KG D+D FFV+F
Sbjct: 368 PREYLTSPVMNFYMRFLQQQISSSNQISADCHFFNTYFYKKLSDAVTYKGNDKDAFFVRF 427

Query: 422 RRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPIILHLDSLGLHSSRSIFDN 481
           RRWW+G+++F+KAY+ IPIHEDLHWSLVI+C P K+DESG  ILHLDSLGLHS +SI +N
Sbjct: 428 RRWWKGIDLFRKAYIFIPIHEDLHWSLVIVCIPDKKDESGLTILHLDSLGLHSRKSIVEN 487

Query: 482 IKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQVPQQKNDCDCGLFVLYFIE 534
           +K F+K+EW YL+++    DLP+  ++WKN+ RRI E ++QVPQQKND DCG FVL+FI+
Sbjct: 488 VKRFLKDEWNYLNQDDYSLDLPISEKVWKNLPRRISEAVVQVPQQKNDFDCGPFVLFFIK 547

BLAST of Tan0020931 vs. ExPASy Swiss-Prot
Match: Q8RWN0 (Ubiquitin-like-specific protease 1C OS=Arabidopsis thaliana OX=3702 GN=ULP1C PE=1 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 2.4e-88
Identity = 218/566 (38.52%), Postives = 306/566 (54.06%), Query Frame = 0

Query: 3   MEQEKTTKKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQKLS 62
           +E ++  K  LNIDWD+  G  +E P LEI+    I   +      +    VR     L 
Sbjct: 7   IELDRVKKTMLNIDWDDALG-DEEVPELEIIATDKIPPREPTLSGYEPAVSVR----SLR 66

Query: 63  DSELEEKIRRMHQFYESTACKLPDKGQKYLRN----LELSMEERESRKLRRVEKEATGCK 122
           D+EL++ ++R          KL DKG+K +RN    LE   + R  ++  +++    GC+
Sbjct: 67  DNELDDHLKRQRSLLTRLGDKLADKGEK-IRNRIGELEYEKQRRMFQQRTKMQDADNGCQ 126

Query: 123 NHSQPTSSSI-----VGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQEL 182
              +P SS +       S   + + T+ S   S S F + F   L+   +        +L
Sbjct: 127 ILEKPKSSDVFMRASTASKDTSGQGTSGSKDVSRSTFAAHFSDNLKMGPQ-PVKLVNDKL 186

Query: 183 SILGHCD-------------NRRQRFNGRLSP-KVKQKGQTSARQQPFKCANNLSTDVRK 242
             LG                N   R   RLS  KV  K   S  + P         D R 
Sbjct: 187 QDLGRGSWISKANRDSIIEKNNVWRSLPRLSKCKVSLKNFYSESKDP-------KGDRRP 246

Query: 243 KVSSVAAQNCKSPDHL----DSHASERLPGCFGKK----DASQVQHSDSPMPRKGQTIVV 302
             +    +  +S  +L    D    +++ G    +     AS +Q S S   +    ++ 
Sbjct: 247 NEAYGKGKPNESSPYLLVDDDDGDDDKVIGYETPRHWSLKASPLQ-SSSCRKKSDDKVIN 306

Query: 303 VDEEEALATKIPEHDDKCMK--ESKIYYPSRDDPES---VEICFEDIKCLDPEGYLTSTI 362
           +DE+E L+  + E   +  +     IYYPS D  +    V++  +D+KCL P  YLTS +
Sbjct: 307 LDEDEPLSPMVVEEACELPEGLPEDIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPV 366

Query: 363 MNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKLKEAVSNKGKDRDNFFVKFRRWWRGVNI 422
           +NFYIRY+Q   F  +K   N HFFNT+FY+KL EAVS KG DRD +FVKFRRWW+G ++
Sbjct: 367 INFYIRYVQHHVFSADKTAANCHFFNTFFYKKLTEAVSYKGNDRDAYFVKFRRWWKGFDL 426

Query: 423 FQKAYVLIPIHEDLHWSLVIICFPQKEDESGPIILHLDSLGLHSSRSIFDNIKSFIKEEW 482
           F K+Y+ IPIHEDLHWSLVIIC P KEDESG  I+HLDSLGLH    IF+N+K F++EEW
Sbjct: 427 FCKSYIFIPIHEDLHWSLVIICIPDKEDESGLTIIHLDSLGLHPRNLIFNNVKRFLREEW 486

Query: 483 CYLDREVADSDLPMPHRIWKNISRRIEEKIIQVPQQKNDCDCGLFVLYFIERFIEEAPDR 533
            YL+++ A  DLP+  ++W+++   I E  +QVPQQKND DCGLF+L+FI RFIEEAP R
Sbjct: 487 NYLNQD-APLDLPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQR 546

BLAST of Tan0020931 vs. ExPASy Swiss-Prot
Match: Q8L7S0 (Probable ubiquitin-like-specific protease 2B OS=Arabidopsis thaliana OX=3702 GN=ULP2B PE=1 SV=3)

HSP 1 Score: 148.7 bits (374), Expect = 2.0e-34
Identity = 92/300 (30.67%), Postives = 161/300 (53.67%), Query Frame = 0

Query: 290 MKESKIYYPSRD-----------DPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRA 349
           + + K Y+PS D           DP++V IC  D++ L PE ++  TI++FYI YL+ + 
Sbjct: 367 LNQQKRYFPSFDEPFEDVVYPKGDPDAVSICKRDVELLQPETFVNDTIIDFYINYLKNQI 426

Query: 350 FLTNKVICNYHFFNTYFYEKLKEAVSNKGKDRDN--FFVKFRRWWRGVNIFQKAYVLIPI 409
               K    +HFFN++F+ KL +   +     D    F++ R+W R V++F K Y+ +P+
Sbjct: 427 QTEEK--HRFHFFNSFFFRKLADLDKDPSSIADGKAAFLRVRKWTRKVDMFGKDYIFVPV 486

Query: 410 HEDLHWSLVIICFPQK----------EDESGPIILHLDSL-GLHSSRSIFDNIKSFIKEE 469
           + +LHWSL++IC P +          + +  P ILH+DS+ G H+   + + +++++ EE
Sbjct: 487 NYNLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDSIKGSHA--GLKNLVQTYLCEE 546

Query: 470 WCYLDREVADSDLPMPHRIWKNISRRIEEKIIQVPQQKNDCDCGLFVLYFIERFIEEAPD 529
           W    +E +D D+       + +S       +++PQQ+N  DCGLF+L+++E F+ EAP 
Sbjct: 547 WKERHKETSD-DISSRFMNLRFVS-------LELPQQENSFDCGLFLLHYLELFLAEAPL 606

Query: 530 RLKRTDL----DMFGKRWFKPQEASSLRTRIKCLLKIEFQDVKKQCLSGSVESSSSDQAP 562
                 +    +     WF P EAS  RT I+   K+ F+ ++ +    S E + S ++P
Sbjct: 607 NFSPFKIYNASNFLYLNWFPPAEASLKRTLIQ---KLIFELLENRSREVSNEQNQSCESP 651

BLAST of Tan0020931 vs. ExPASy Swiss-Prot
Match: Q0WKV8 (Probable ubiquitin-like-specific protease 2A OS=Arabidopsis thaliana OX=3702 GN=ULP2A PE=2 SV=2)

HSP 1 Score: 134.8 bits (338), Expect = 3.0e-30
Identity = 79/257 (30.74%), Postives = 136/257 (52.92%), Query Frame = 0

Query: 295 IYYPSRDDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTY 354
           + YP + +P++V +  +DI+ L P  ++  TI++FYI+YL+ R  ++ K    +HFFN +
Sbjct: 295 LVYP-QGEPDAVVVRKQDIELLKPRRFINDTIIDFYIKYLKNR--ISPKERGRFHFFNCF 354

Query: 355 FYEKLKEAVSNKGKDRD----NFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFP 414
           F+ KL  A  +KG          + + ++W + V++F+K Y+ IPI+   HWSLVIIC P
Sbjct: 355 FFRKL--ANLDKGTPSTCGGREAYQRVQKWTKNVDLFEKDYIFIPINCSFHWSLVIICHP 414

Query: 415 -------QKEDESGPIILHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHR 474
                   +  +  P ILHLDS+       + +   S+++EEW        +     P+ 
Sbjct: 415 GELVPSHVENPQRVPCILHLDSIKGSHKGGLINIFPSYLREEWKARHENTTNDSSRAPN- 474

Query: 475 IWKNISRRIEEKIIQVPQQKNDCDCGLFVLYFIERFIEEAPDR----LKRTDLDMFGKRW 534
                   ++   +++PQQ+N  DCGLF+L++++ F+ +AP +    L     +   + W
Sbjct: 475 --------MQSISLELPQQENSFDCGLFLLHYLDLFVAQAPAKFNPSLISRSANFLTRNW 534

Query: 535 FKPQEASSLRTRIKCLL 537
           F  +EAS  R  I  LL
Sbjct: 535 FPAKEASLKRRNILELL 537

BLAST of Tan0020931 vs. ExPASy Swiss-Prot
Match: A7MBJ2 (Sentrin-specific protease 7 OS=Bos taurus OX=9913 GN=SENP7 PE=2 SV=1)

HSP 1 Score: 100.1 bits (248), Expect = 8.2e-20
Identity = 93/403 (23.08%), Postives = 165/403 (40.94%), Query Frame = 0

Query: 213  LSTDVRKKVSSVAAQNCKSPDHLDSHASERLPGCFGKKDASQVQHSDSPMPRKGQTIVVV 272
            L  ++  K SS     C S     +  +E +      K  SQ  ++D+  P    T++  
Sbjct: 662  LFQNLSSKESSFIHYYCASTCSFPAATTEEMK----MKSVSQPSNTDTAKPT--YTLLQK 721

Query: 273  DEEEALATKIPEHDDKCMKESK--------IYYPSRDDPESVEICFEDIKCLDPEGYLTS 332
                  +  I  + D+  +E +        I YP       + +  ED++CL+   +L  
Sbjct: 722  QSSGCYSLSITSNPDEEWREVRHTGPVQKLIVYPPPPTKGGLGVTNEDLECLEEGEFLND 781

Query: 333  TIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKLKEAVSNKGKDRDNFFVKFRR----- 392
             I++FY++YL      +++++   H F+++FY+ L    +N  +D  N  +  RR     
Sbjct: 782  VIIDFYLKYLILEK-ASDELVERSHIFSSFFYKCLTRKENNLTEDNPNLSMAQRRHKRVR 841

Query: 393  -WWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKED------------------------ 452
             W R +NIF K Y+ +P++E  HW L +ICFP  E+                        
Sbjct: 842  TWTRHINIFNKDYIFVPVNESSHWYLAVICFPWLEEVVYEDFPQTIPQYSQAEESHHDSR 901

Query: 453  ------------ESG--------------------PIILHLDSLGLHSSRSIFDNIKSFI 512
                         SG                    P IL LDSL   S ++   N++ ++
Sbjct: 902  TIDNDLHTSSALSSGTEDSQSPEMNVTVPKKMCKRPCILILDSLKAASIQNTVQNLREYL 961

Query: 513  KEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQVPQQKNDCDCGLFVLYFIERFIEE 545
            + EW     EV        HR +   +  + +   +VP+Q N  DCG+++L ++E F + 
Sbjct: 962  EVEW-----EVKRK----THREFSKTN--MVDLCPKVPKQDNSSDCGVYLLQYVESFFK- 1021

BLAST of Tan0020931 vs. NCBI nr
Match: XP_022966911.1 (ubiquitin-like-specific protease 1D [Cucurbita maxima])

HSP 1 Score: 981.5 bits (2536), Expect = 3.1e-282
Identity = 487/564 (86.35%), Postives = 520/564 (92.20%), Query Frame = 0

Query: 1   MVMEQ-EKTTKKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQ 60
           M MEQ E T KKPL+IDWD+LFG K+ EPPLEI++ P+I  SKHFEM+SDRQHL+R+E Q
Sbjct: 1   MAMEQDEMTKKKPLDIDWDKLFGGKETEPPLEIIVQPSITISKHFEMDSDRQHLLRDECQ 60

Query: 61  KLSDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRVEKEATGCKN 120
           KL+D+EL+EKIRRM QFYE+ A  LPDKG KYLRNL+LSMEERESRKLRRVEKEA  C+N
Sbjct: 61  KLNDAELDEKIRRMKQFYETKARYLPDKGHKYLRNLDLSMEERESRKLRRVEKEAATCEN 120

Query: 121 HSQPTSSSIVGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSILGHC 180
            SQPT+SS VGSSYVA EKTASSSADSLS FTSCFYQKLEEKTECN+SAF Q+LSILGHC
Sbjct: 121 CSQPTTSSTVGSSYVASEKTASSSADSLSTFTSCFYQKLEEKTECNSSAFSQDLSILGHC 180

Query: 181 DNRRQRFNGRLSPKVKQKGQTSARQQPFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHA 240
           DN+RQR NGRLSPKVK KGQ SARQQPFKC+ NLSTDV KKVSSV AQNCK PD LDSH 
Sbjct: 181 DNQRQRCNGRLSPKVKHKGQASARQQPFKCSINLSTDVNKKVSSVTAQNCKIPDDLDSHV 240

Query: 241 SERLPGCFGKKDASQVQHSDSPMPRKGQTIVVVDEEEALATKIPEHDDKCMKESKIYYPS 300
           SERLPGCF KKDASQVQHSD+ MPRKGQTIVVVDEEEALA KIPEHDDKCMKE+KIYYPS
Sbjct: 241 SERLPGCFRKKDASQVQHSDNSMPRKGQTIVVVDEEEALAMKIPEHDDKCMKEAKIYYPS 300

Query: 301 RDDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKL 360
           RDDPESVEICFEDI CLDPEGYLTSTIMNFYIRYLQQ+AF T KVICNYHFFNTYFYEKL
Sbjct: 301 RDDPESVEICFEDINCLDPEGYLTSTIMNFYIRYLQQQAFSTKKVICNYHFFNTYFYEKL 360

Query: 361 KEAVSNKGKDRDNFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 420
           KEAVSNKGKDR+NFFVKFRRWW+G+NIFQKAYVLIPIHEDLHWSLVIICFP+KEDESGPI
Sbjct: 361 KEAVSNKGKDRENFFVKFRRWWKGINIFQKAYVLIPIHEDLHWSLVIICFPRKEDESGPI 420

Query: 421 ILHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQV 480
           ILHLDSLGLHSSRS+FDNIKSFIKEEWCYLDREVA  DLPMP+RIWKNISRRIEEKIIQV
Sbjct: 421 ILHLDSLGLHSSRSVFDNIKSFIKEEWCYLDREVACLDLPMPYRIWKNISRRIEEKIIQV 480

Query: 481 PQQKNDCDCGLFVLYFIERFIEEAPDRLKRTDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540
           PQQKND DCGLFVLYFIERFIEEAPDRLKR DLDMFGKRWFKPQEASSLRTRIKCLLKIE
Sbjct: 481 PQQKNDYDCGLFVLYFIERFIEEAPDRLKRKDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540

Query: 541 FQDVKKQCLSGSVESSSSDQAPNQ 564
           FQ +K+QCL+GSV+SSSSD AP Q
Sbjct: 541 FQILKRQCLAGSVDSSSSDHAPKQ 564

BLAST of Tan0020931 vs. NCBI nr
Match: XP_022945620.1 (ubiquitin-like-specific protease 1D [Cucurbita moschata])

HSP 1 Score: 976.1 bits (2522), Expect = 1.3e-280
Identity = 485/564 (85.99%), Postives = 518/564 (91.84%), Query Frame = 0

Query: 1   MVMEQEKTT-KKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQ 60
           M MEQ++TT KKPLNIDWD+LFG K+ EPPLEI++ P+I  SKHFEM+SDRQHL+R+E Q
Sbjct: 1   MAMEQDQTTKKKPLNIDWDKLFGGKETEPPLEIIVQPSITISKHFEMDSDRQHLLRDECQ 60

Query: 61  KLSDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRVEKEATGCKN 120
           KL+D+EL+EKIRRM QFYE+ A  LPD+G KYLRNL+LSMEERESRKLRRVEKEA  C+N
Sbjct: 61  KLNDAELDEKIRRMKQFYETKARYLPDEGHKYLRNLDLSMEERESRKLRRVEKEAAACEN 120

Query: 121 HSQPTSSSIVGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSILGHC 180
            SQPT+SS VGSSYVA EKTAS SADSLS FTSCFYQKLEEKTECN+SAF Q+LSILGHC
Sbjct: 121 CSQPTTSSTVGSSYVASEKTASPSADSLSTFTSCFYQKLEEKTECNSSAFSQDLSILGHC 180

Query: 181 DNRRQRFNGRLSPKVKQKGQTSARQQPFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHA 240
           DN+RQR NGRLSPKVK KGQ SA QQPFKCA NLSTDV +KVSSV AQNCK PD LDSH 
Sbjct: 181 DNQRQRCNGRLSPKVKHKGQASAGQQPFKCAINLSTDVHEKVSSVTAQNCKIPDDLDSHV 240

Query: 241 SERLPGCFGKKDASQVQHSDSPMPRKGQTIVVVDEEEALATKIPEHDDKCMKESKIYYPS 300
           SERLPGC  KKDASQVQ SD+ MPRKGQTIVVVDEEEALA KIPEHDDKCMKE+KIYYPS
Sbjct: 241 SERLPGCLRKKDASQVQQSDNSMPRKGQTIVVVDEEEALAMKIPEHDDKCMKEAKIYYPS 300

Query: 301 RDDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKL 360
           RDDPESVEICFEDI CLDPEGYLTSTIMNFYIRYLQQ+AF T KVICNYHFFNTYFYEKL
Sbjct: 301 RDDPESVEICFEDINCLDPEGYLTSTIMNFYIRYLQQQAFSTKKVICNYHFFNTYFYEKL 360

Query: 361 KEAVSNKGKDRDNFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 420
           KEAVSNKGKDR+NFFVKFRRWW+GVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI
Sbjct: 361 KEAVSNKGKDRENFFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 420

Query: 421 ILHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQV 480
           ILHLDSLGLHSSRS+FDNIKSFIKEEWCYLDREVA  DLPMP+RIWKNISRRIEEKIIQV
Sbjct: 421 ILHLDSLGLHSSRSVFDNIKSFIKEEWCYLDREVACLDLPMPYRIWKNISRRIEEKIIQV 480

Query: 481 PQQKNDCDCGLFVLYFIERFIEEAPDRLKRTDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540
           PQQKND DCGLFVLYFIERFIEEAPDRLKR DLDMFGKRWFKPQEASSLRTRIKCLLKIE
Sbjct: 481 PQQKNDYDCGLFVLYFIERFIEEAPDRLKRKDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540

Query: 541 FQDVKKQCLSGSVESSSSDQAPNQ 564
           FQ +K+QCL+GSV+SSSSD AP Q
Sbjct: 541 FQILKRQCLAGSVDSSSSDHAPKQ 564

BLAST of Tan0020931 vs. NCBI nr
Match: KAG7012818.1 (Ubiquitin-like-specific protease 1D [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 970.7 bits (2508), Expect = 5.4e-279
Identity = 484/564 (85.82%), Postives = 517/564 (91.67%), Query Frame = 0

Query: 1   MVMEQEKTT-KKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQ 60
           M MEQ++TT KKPLNIDWD+LFG K+ EPPLEI++ P+I  SKHFEM+SDRQHL+R+E Q
Sbjct: 1   MAMEQDQTTKKKPLNIDWDKLFGGKETEPPLEIIVQPSITISKHFEMDSDRQHLLRDECQ 60

Query: 61  KLSDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRVEKEATGCKN 120
           KL+D+EL+EKIRRM QFYE+ A  LPD+G KYLRNL+LSMEERESRKLRRVEKEA  C+N
Sbjct: 61  KLNDAELDEKIRRMKQFYETKARYLPDEGHKYLRNLDLSMEERESRKLRRVEKEAAACEN 120

Query: 121 HSQPTSSSIVGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSILGHC 180
            SQPT+SS VGSSYVA EKTAS SADSLS FTSCFYQKLEEKTECN+SAF Q+LSILGHC
Sbjct: 121 CSQPTTSSTVGSSYVASEKTASPSADSLSTFTSCFYQKLEEKTECNSSAFSQDLSILGHC 180

Query: 181 DNRRQRFNGRLSPKVKQKGQTSARQQPFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHA 240
           DN+RQR NGRLSPKVK KGQ SARQQPFKC  NLSTDV +KVSSV AQNCK PD LDSH 
Sbjct: 181 DNQRQRCNGRLSPKVKHKGQASARQQPFKCI-NLSTDVHEKVSSVTAQNCKIPDDLDSHV 240

Query: 241 SERLPGCFGKKDASQVQHSDSPMPRKGQTIVVVDEEEALATKIPEHDDKCMKESKIYYPS 300
           SERLP C  KKDASQVQ SD+ MPRKGQTIVVVDEEEALA KIPEHDDKCMKE+KIYYPS
Sbjct: 241 SERLPDCLRKKDASQVQQSDNSMPRKGQTIVVVDEEEALAMKIPEHDDKCMKEAKIYYPS 300

Query: 301 RDDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKL 360
           RDDPESVEICFEDI CLDPEGYLTSTIMNFYIRYLQQ+AF T KVICNYHFFNTYFYEKL
Sbjct: 301 RDDPESVEICFEDINCLDPEGYLTSTIMNFYIRYLQQQAFSTKKVICNYHFFNTYFYEKL 360

Query: 361 KEAVSNKGKDRDNFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 420
           KEAVSNKGKDR+NFFVKFRRWW+GVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI
Sbjct: 361 KEAVSNKGKDRENFFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 420

Query: 421 ILHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQV 480
           ILHLDSLGLHSSRS+FDNIKSFIKEEWCYLDREVA  DLPMP+RIWKNISRRIEEKIIQV
Sbjct: 421 ILHLDSLGLHSSRSVFDNIKSFIKEEWCYLDREVACLDLPMPYRIWKNISRRIEEKIIQV 480

Query: 481 PQQKNDCDCGLFVLYFIERFIEEAPDRLKRTDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540
           PQQKND DCGLFVLYFIERFIEEAPDRLKR DLDMFGKRWFKPQEASSLRTRIKCLLKIE
Sbjct: 481 PQQKNDYDCGLFVLYFIERFIEEAPDRLKRKDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540

Query: 541 FQDVKKQCLSGSVESSSSDQAPNQ 564
           FQ +K+QCL+GSV+SSSSD AP Q
Sbjct: 541 FQILKRQCLAGSVDSSSSDHAPKQ 563

BLAST of Tan0020931 vs. NCBI nr
Match: XP_023541910.1 (ubiquitin-like-specific protease 1D [Cucurbita pepo subsp. pepo])

HSP 1 Score: 970.7 bits (2508), Expect = 5.4e-279
Identity = 483/563 (85.79%), Postives = 518/563 (92.01%), Query Frame = 0

Query: 2   VMEQEKTTK-KPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQK 61
           +MEQ++TTK KPLNIDWD+LFG K+ EPPLEI++ P+I  SKHFEM+SDRQHL+R+E QK
Sbjct: 3   MMEQDETTKNKPLNIDWDKLFGGKETEPPLEIIVQPSITISKHFEMDSDRQHLLRDECQK 62

Query: 62  LSDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRVEKEATGCKNH 121
           L+D+EL+EKIRRM QFYE+ A  LPD G KYLRNL+LSMEERESRKLRRVEKEA  C+N 
Sbjct: 63  LNDAELDEKIRRMKQFYETKARYLPDNGHKYLRNLDLSMEERESRKLRRVEKEAAACENC 122

Query: 122 SQPTSSSIVGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSILGHCD 181
           SQPT+SS VGSSYVA EKTASSSADSLS FTSCFYQKLEEKTECN+SAF Q+LSILGHCD
Sbjct: 123 SQPTTSSTVGSSYVASEKTASSSADSLSTFTSCFYQKLEEKTECNSSAFSQDLSILGHCD 182

Query: 182 NRRQRFNGRLSPKVKQKGQTSARQQPFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHAS 241
           N+RQR NGRLSP+VK KGQ SARQQPFKCA NLSTDV +KVSSV AQNCK P+ LDSH S
Sbjct: 183 NQRQRCNGRLSPRVKHKGQASARQQPFKCAINLSTDVHEKVSSVTAQNCKIPNDLDSHVS 242

Query: 242 ERLPGCFGKKDASQVQHSDSPMPRKGQTIVVVDEEEALATKIPEHDDKCMKESKIYYPSR 301
           ERLPGC  KKDASQVQ SD+ MPRKGQTIVVVDEEEALA KIPEHDDK MKE+KIYYPSR
Sbjct: 243 ERLPGCLRKKDASQVQDSDNSMPRKGQTIVVVDEEEALAMKIPEHDDKRMKEAKIYYPSR 302

Query: 302 DDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKLK 361
           DDPESVEICFEDI CLDPEGYLTSTIMNFYIRYLQQ+AF T KVICNYHFFNTYFYEKLK
Sbjct: 303 DDPESVEICFEDINCLDPEGYLTSTIMNFYIRYLQQQAFSTKKVICNYHFFNTYFYEKLK 362

Query: 362 EAVSNKGKDRDNFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPII 421
           EAVSNKGKDR+NFFVKFRRWW+GVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPII
Sbjct: 363 EAVSNKGKDRENFFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPII 422

Query: 422 LHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQVP 481
           LHLDSLGLHSSRS+FDNIKSFIKEEWCYLDREVA  DLPMP+RIWKNISRRIEEKIIQVP
Sbjct: 423 LHLDSLGLHSSRSVFDNIKSFIKEEWCYLDREVACLDLPMPYRIWKNISRRIEEKIIQVP 482

Query: 482 QQKNDCDCGLFVLYFIERFIEEAPDRLKRTDLDMFGKRWFKPQEASSLRTRIKCLLKIEF 541
           QQKND DCGLFVLYFIERFIEEAPDRLKR DLDMFGKRWFKPQEASSLRTRIKCLLKIEF
Sbjct: 483 QQKNDYDCGLFVLYFIERFIEEAPDRLKRKDLDMFGKRWFKPQEASSLRTRIKCLLKIEF 542

Query: 542 QDVKKQCLSGSVESSSSDQAPNQ 564
           Q +K+QCL+GSV+SSSSD AP Q
Sbjct: 543 QILKRQCLAGSVDSSSSDHAPKQ 565

BLAST of Tan0020931 vs. NCBI nr
Match: KAG6573744.1 (Ubiquitin-like-specific protease 1D, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 969.9 bits (2506), Expect = 9.3e-279
Identity = 484/564 (85.82%), Postives = 517/564 (91.67%), Query Frame = 0

Query: 1   MVMEQEKTT-KKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQ 60
           M MEQ++TT KKPLNIDWD+LFG K+ EPPLEI++ P+I  SKHFEM+SDRQHL+R+E Q
Sbjct: 1   MAMEQDQTTKKKPLNIDWDKLFGGKETEPPLEIIVQPSITISKHFEMDSDRQHLLRDECQ 60

Query: 61  KLSDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRVEKEATGCKN 120
           KL+D+EL+EKIRRM QFYE+ A  LPD+G KYLRNL+LSMEERESRKLRRVEKEA  C+N
Sbjct: 61  KLNDAELDEKIRRMKQFYETKARYLPDEGHKYLRNLDLSMEERESRKLRRVEKEAAACEN 120

Query: 121 HSQPTSSSIVGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSILGHC 180
            SQPT+SS VGSSYVA EKTAS SADSLS FTSCFYQKLEEKTECN+SAF Q+LSILGHC
Sbjct: 121 CSQPTTSSTVGSSYVASEKTASPSADSLSTFTSCFYQKLEEKTECNSSAFSQDLSILGHC 180

Query: 181 DNRRQRFNGRLSPKVKQKGQTSARQQPFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHA 240
           DN+RQR NGRLSPKVK KGQ SARQQPFKC  NLSTDV +KVSSV AQNCK PD LDSH 
Sbjct: 181 DNQRQRCNGRLSPKVKHKGQASARQQPFKCI-NLSTDVHEKVSSVTAQNCKIPDDLDSHV 240

Query: 241 SERLPGCFGKKDASQVQHSDSPMPRKGQTIVVVDEEEALATKIPEHDDKCMKESKIYYPS 300
           SERLPGC  KKDASQVQ SD+ MPRKGQTIVVVDEEEALA KIPEHDDKCMKE+KIYYPS
Sbjct: 241 SERLPGCLRKKDASQVQQSDNSMPRKGQTIVVVDEEEALAMKIPEHDDKCMKEAKIYYPS 300

Query: 301 RDDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKL 360
           RDDPESVEICFEDI CLDPEGYLTSTIMNFYIRYLQQ+AF T KVICNYHFF TYFYEKL
Sbjct: 301 RDDPESVEICFEDINCLDPEGYLTSTIMNFYIRYLQQQAFSTKKVICNYHFFITYFYEKL 360

Query: 361 KEAVSNKGKDRDNFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 420
           KEAVSNKGKDR+NFFVKFRRWW+GVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI
Sbjct: 361 KEAVSNKGKDRENFFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 420

Query: 421 ILHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQV 480
           ILHLDSLGLHSSRS+FDNIKSFIKEEWCYLDREVA  DLPMP+RIWKNISRRIEEKIIQV
Sbjct: 421 ILHLDSLGLHSSRSVFDNIKSFIKEEWCYLDREVACLDLPMPYRIWKNISRRIEEKIIQV 480

Query: 481 PQQKNDCDCGLFVLYFIERFIEEAPDRLKRTDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540
           PQQKND DCGLFVLYFIERFIEEAPDRLKR DLDMFGKRWFKPQEASSLRTRIKCLLKIE
Sbjct: 481 PQQKNDYDCGLFVLYFIERFIEEAPDRLKRKDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540

Query: 541 FQDVKKQCLSGSVESSSSDQAPNQ 564
           FQ +K+QCL+GSV+SSSSD AP Q
Sbjct: 541 FQILKRQCLAGSVDSSSSDHAPKQ 563

BLAST of Tan0020931 vs. ExPASy TrEMBL
Match: A0A6J1HQL5 (ubiquitin-like-specific protease 1D OS=Cucurbita maxima OX=3661 GN=LOC111466476 PE=3 SV=1)

HSP 1 Score: 981.5 bits (2536), Expect = 1.5e-282
Identity = 487/564 (86.35%), Postives = 520/564 (92.20%), Query Frame = 0

Query: 1   MVMEQ-EKTTKKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQ 60
           M MEQ E T KKPL+IDWD+LFG K+ EPPLEI++ P+I  SKHFEM+SDRQHL+R+E Q
Sbjct: 1   MAMEQDEMTKKKPLDIDWDKLFGGKETEPPLEIIVQPSITISKHFEMDSDRQHLLRDECQ 60

Query: 61  KLSDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRVEKEATGCKN 120
           KL+D+EL+EKIRRM QFYE+ A  LPDKG KYLRNL+LSMEERESRKLRRVEKEA  C+N
Sbjct: 61  KLNDAELDEKIRRMKQFYETKARYLPDKGHKYLRNLDLSMEERESRKLRRVEKEAATCEN 120

Query: 121 HSQPTSSSIVGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSILGHC 180
            SQPT+SS VGSSYVA EKTASSSADSLS FTSCFYQKLEEKTECN+SAF Q+LSILGHC
Sbjct: 121 CSQPTTSSTVGSSYVASEKTASSSADSLSTFTSCFYQKLEEKTECNSSAFSQDLSILGHC 180

Query: 181 DNRRQRFNGRLSPKVKQKGQTSARQQPFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHA 240
           DN+RQR NGRLSPKVK KGQ SARQQPFKC+ NLSTDV KKVSSV AQNCK PD LDSH 
Sbjct: 181 DNQRQRCNGRLSPKVKHKGQASARQQPFKCSINLSTDVNKKVSSVTAQNCKIPDDLDSHV 240

Query: 241 SERLPGCFGKKDASQVQHSDSPMPRKGQTIVVVDEEEALATKIPEHDDKCMKESKIYYPS 300
           SERLPGCF KKDASQVQHSD+ MPRKGQTIVVVDEEEALA KIPEHDDKCMKE+KIYYPS
Sbjct: 241 SERLPGCFRKKDASQVQHSDNSMPRKGQTIVVVDEEEALAMKIPEHDDKCMKEAKIYYPS 300

Query: 301 RDDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKL 360
           RDDPESVEICFEDI CLDPEGYLTSTIMNFYIRYLQQ+AF T KVICNYHFFNTYFYEKL
Sbjct: 301 RDDPESVEICFEDINCLDPEGYLTSTIMNFYIRYLQQQAFSTKKVICNYHFFNTYFYEKL 360

Query: 361 KEAVSNKGKDRDNFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 420
           KEAVSNKGKDR+NFFVKFRRWW+G+NIFQKAYVLIPIHEDLHWSLVIICFP+KEDESGPI
Sbjct: 361 KEAVSNKGKDRENFFVKFRRWWKGINIFQKAYVLIPIHEDLHWSLVIICFPRKEDESGPI 420

Query: 421 ILHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQV 480
           ILHLDSLGLHSSRS+FDNIKSFIKEEWCYLDREVA  DLPMP+RIWKNISRRIEEKIIQV
Sbjct: 421 ILHLDSLGLHSSRSVFDNIKSFIKEEWCYLDREVACLDLPMPYRIWKNISRRIEEKIIQV 480

Query: 481 PQQKNDCDCGLFVLYFIERFIEEAPDRLKRTDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540
           PQQKND DCGLFVLYFIERFIEEAPDRLKR DLDMFGKRWFKPQEASSLRTRIKCLLKIE
Sbjct: 481 PQQKNDYDCGLFVLYFIERFIEEAPDRLKRKDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540

Query: 541 FQDVKKQCLSGSVESSSSDQAPNQ 564
           FQ +K+QCL+GSV+SSSSD AP Q
Sbjct: 541 FQILKRQCLAGSVDSSSSDHAPKQ 564

BLAST of Tan0020931 vs. ExPASy TrEMBL
Match: A0A6J1G1G7 (ubiquitin-like-specific protease 1D OS=Cucurbita moschata OX=3662 GN=LOC111449805 PE=3 SV=1)

HSP 1 Score: 976.1 bits (2522), Expect = 6.3e-281
Identity = 485/564 (85.99%), Postives = 518/564 (91.84%), Query Frame = 0

Query: 1   MVMEQEKTT-KKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQ 60
           M MEQ++TT KKPLNIDWD+LFG K+ EPPLEI++ P+I  SKHFEM+SDRQHL+R+E Q
Sbjct: 1   MAMEQDQTTKKKPLNIDWDKLFGGKETEPPLEIIVQPSITISKHFEMDSDRQHLLRDECQ 60

Query: 61  KLSDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRVEKEATGCKN 120
           KL+D+EL+EKIRRM QFYE+ A  LPD+G KYLRNL+LSMEERESRKLRRVEKEA  C+N
Sbjct: 61  KLNDAELDEKIRRMKQFYETKARYLPDEGHKYLRNLDLSMEERESRKLRRVEKEAAACEN 120

Query: 121 HSQPTSSSIVGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSILGHC 180
            SQPT+SS VGSSYVA EKTAS SADSLS FTSCFYQKLEEKTECN+SAF Q+LSILGHC
Sbjct: 121 CSQPTTSSTVGSSYVASEKTASPSADSLSTFTSCFYQKLEEKTECNSSAFSQDLSILGHC 180

Query: 181 DNRRQRFNGRLSPKVKQKGQTSARQQPFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHA 240
           DN+RQR NGRLSPKVK KGQ SA QQPFKCA NLSTDV +KVSSV AQNCK PD LDSH 
Sbjct: 181 DNQRQRCNGRLSPKVKHKGQASAGQQPFKCAINLSTDVHEKVSSVTAQNCKIPDDLDSHV 240

Query: 241 SERLPGCFGKKDASQVQHSDSPMPRKGQTIVVVDEEEALATKIPEHDDKCMKESKIYYPS 300
           SERLPGC  KKDASQVQ SD+ MPRKGQTIVVVDEEEALA KIPEHDDKCMKE+KIYYPS
Sbjct: 241 SERLPGCLRKKDASQVQQSDNSMPRKGQTIVVVDEEEALAMKIPEHDDKCMKEAKIYYPS 300

Query: 301 RDDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKL 360
           RDDPESVEICFEDI CLDPEGYLTSTIMNFYIRYLQQ+AF T KVICNYHFFNTYFYEKL
Sbjct: 301 RDDPESVEICFEDINCLDPEGYLTSTIMNFYIRYLQQQAFSTKKVICNYHFFNTYFYEKL 360

Query: 361 KEAVSNKGKDRDNFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 420
           KEAVSNKGKDR+NFFVKFRRWW+GVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI
Sbjct: 361 KEAVSNKGKDRENFFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 420

Query: 421 ILHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQV 480
           ILHLDSLGLHSSRS+FDNIKSFIKEEWCYLDREVA  DLPMP+RIWKNISRRIEEKIIQV
Sbjct: 421 ILHLDSLGLHSSRSVFDNIKSFIKEEWCYLDREVACLDLPMPYRIWKNISRRIEEKIIQV 480

Query: 481 PQQKNDCDCGLFVLYFIERFIEEAPDRLKRTDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540
           PQQKND DCGLFVLYFIERFIEEAPDRLKR DLDMFGKRWFKPQEASSLRTRIKCLLKIE
Sbjct: 481 PQQKNDYDCGLFVLYFIERFIEEAPDRLKRKDLDMFGKRWFKPQEASSLRTRIKCLLKIE 540

Query: 541 FQDVKKQCLSGSVESSSSDQAPNQ 564
           FQ +K+QCL+GSV+SSSSD AP Q
Sbjct: 541 FQILKRQCLAGSVDSSSSDHAPKQ 564

BLAST of Tan0020931 vs. ExPASy TrEMBL
Match: A0A6J1DC74 (ubiquitin-like-specific protease 1D isoform X1 OS=Momordica charantia OX=3673 GN=LOC111019156 PE=3 SV=1)

HSP 1 Score: 907.1 bits (2343), Expect = 3.6e-260
Identity = 456/565 (80.71%), Postives = 494/565 (87.43%), Query Frame = 0

Query: 3   MEQEKTTKKPLNIDWDELFGCKDEEPPLEIVILPAIA---NSKHFEMESDRQHLVREEYQ 62
           MEQEKTTK+PLNIDWD+L GC+DEEPP ++VI P  A      H +M+SDRQHL REEYQ
Sbjct: 1   MEQEKTTKRPLNIDWDKLLGCEDEEPPPDVVIEPPTAAPHPQNHLDMDSDRQHLAREEYQ 60

Query: 63  KLSDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRVEKEATGCKN 122
           K SD ELE+KIRRM  FYE+ ACKLPDKGQKY+R LEL  EERE RKLRRVEKEATGC+N
Sbjct: 61  KFSDVELEDKIRRMKSFYETKACKLPDKGQKYIRTLELCEEEREYRKLRRVEKEATGCEN 120

Query: 123 HSQPTSSSIVGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSILGHC 182
            SQPT+SS+VG+SY AREKTASSSADSLS FTSCF +KLEEK  CNN +F QELS+LGHC
Sbjct: 121 LSQPTTSSMVGTSYGAREKTASSSADSLSTFTSCFSRKLEEKAGCNNGSFSQELSVLGHC 180

Query: 183 DNRRQRFNGRLSPKVKQKGQTSARQQPFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHA 242
           DNRRQR+NGRLSPK+KQKGQTS+RQQPFK ANNLS DV K        N K+ D  D   
Sbjct: 181 DNRRQRYNGRLSPKLKQKGQTSSRQQPFKSANNLSFDVHK--------NGKNSDQFDFDV 240

Query: 243 SERLPGCFGKKDASQVQHSDSPMPRKGQTIVVVDEEEALATKIPEHDDKCMKESKIYYPS 302
            E+LPGCFGKKDASQVQHSD+   R+GQTIVVVDEEEALA KI E DDKCMKE+KIYYPS
Sbjct: 241 IEKLPGCFGKKDASQVQHSDNLRLREGQTIVVVDEEEALAIKISERDDKCMKETKIYYPS 300

Query: 303 RDDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKL 362
           R DPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRA  TN+VICN HFFNTYFYEKL
Sbjct: 301 RHDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRACSTNRVICNSHFFNTYFYEKL 360

Query: 363 KEAVSNKGKDRDNFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 422
           KEAVSNKGKD++ FFVKFRRWW+GVNIFQKAYVLIPIHEDLHWSLVIICFP KEDE+GPI
Sbjct: 361 KEAVSNKGKDKEIFFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICFPHKEDEAGPI 420

Query: 423 ILHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQV 482
           ILHLDSLGLHSSRSIFDNIK++IKEEWCYLDREV+DSDLPMP RIWKNISRRIE+KII+V
Sbjct: 421 ILHLDSLGLHSSRSIFDNIKNYIKEEWCYLDREVSDSDLPMPFRIWKNISRRIEDKIIEV 480

Query: 483 PQQKNDCDCGLFVLYFIERFIEEAPDRLKRTDLDMFGKRWFKPQEASSLRTRIKCLLKIE 542
           PQQKND DCGLFVLYFIERFIEEAPDRLKR DLDMFGKRWFKPQEAS LRTRI+ LLK+E
Sbjct: 481 PQQKNDYDCGLFVLYFIERFIEEAPDRLKRKDLDMFGKRWFKPQEASGLRTRIRSLLKME 540

Query: 543 FQDVKKQCLSGSVES-SSSDQAPNQ 564
           FQ VK+QCLSGSVE  SSSDQ P Q
Sbjct: 541 FQIVKRQCLSGSVEKISSSDQPPKQ 557

BLAST of Tan0020931 vs. ExPASy TrEMBL
Match: A0A6J1DAG0 (ubiquitin-like-specific protease 1D isoform X2 OS=Momordica charantia OX=3673 GN=LOC111019156 PE=3 SV=1)

HSP 1 Score: 898.3 bits (2320), Expect = 1.7e-257
Identity = 454/565 (80.35%), Postives = 492/565 (87.08%), Query Frame = 0

Query: 3   MEQEKTTKKPLNIDWDELFGCKDEEPPLEIVILPAIA---NSKHFEMESDRQHLVREEYQ 62
           MEQEKTTK+PLNIDWD+L GC+DEEPP ++VI P  A      H +M+SDRQHL REEYQ
Sbjct: 1   MEQEKTTKRPLNIDWDKLLGCEDEEPPPDVVIEPPTAAPHPQNHLDMDSDRQHLAREEYQ 60

Query: 63  KLSDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRVEKEATGCKN 122
           K SD ELE+KIRRM  FYE+ ACKLPDKGQKY+R LEL  EERE RKLRRVEKEATGC+N
Sbjct: 61  KFSDVELEDKIRRMKSFYETKACKLPDKGQKYIRTLELCEEEREYRKLRRVEKEATGCEN 120

Query: 123 HSQPTSSSIVGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSILGHC 182
            SQPT+SS+VG+SY AREKTASSSADSLS FTSCF +KLEEK  CNN +F QELS+LGHC
Sbjct: 121 LSQPTTSSMVGTSYGAREKTASSSADSLSTFTSCFSRKLEEKAGCNNGSFSQELSVLGHC 180

Query: 183 DNRRQRFNGRLSPKVKQKGQTSARQQPFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHA 242
           DNRRQR+NGRLSPK+KQKGQTS+RQQPFK ANNLS DV K        N K+ D  D   
Sbjct: 181 DNRRQRYNGRLSPKLKQKGQTSSRQQPFKSANNLSFDVHK--------NGKNSDQFDFDV 240

Query: 243 SERLPGCFGKKDASQVQHSDSPMPRKGQTIVVVDEEEALATKIPEHDDKCMKESKIYYPS 302
            E+LPGCFGKKDASQVQHSD+   R  +TIVVVDEEEALA KI E DDKCMKE+KIYYPS
Sbjct: 241 IEKLPGCFGKKDASQVQHSDN--LRLRETIVVVDEEEALAIKISERDDKCMKETKIYYPS 300

Query: 303 RDDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKL 362
           R DPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRA  TN+VICN HFFNTYFYEKL
Sbjct: 301 RHDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRACSTNRVICNSHFFNTYFYEKL 360

Query: 363 KEAVSNKGKDRDNFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 422
           KEAVSNKGKD++ FFVKFRRWW+GVNIFQKAYVLIPIHEDLHWSLVIICFP KEDE+GPI
Sbjct: 361 KEAVSNKGKDKEIFFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICFPHKEDEAGPI 420

Query: 423 ILHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQV 482
           ILHLDSLGLHSSRSIFDNIK++IKEEWCYLDREV+DSDLPMP RIWKNISRRIE+KII+V
Sbjct: 421 ILHLDSLGLHSSRSIFDNIKNYIKEEWCYLDREVSDSDLPMPFRIWKNISRRIEDKIIEV 480

Query: 483 PQQKNDCDCGLFVLYFIERFIEEAPDRLKRTDLDMFGKRWFKPQEASSLRTRIKCLLKIE 542
           PQQKND DCGLFVLYFIERFIEEAPDRLKR DLDMFGKRWFKPQEAS LRTRI+ LLK+E
Sbjct: 481 PQQKNDYDCGLFVLYFIERFIEEAPDRLKRKDLDMFGKRWFKPQEASGLRTRIRSLLKME 540

Query: 543 FQDVKKQCLSGSVES-SSSDQAPNQ 564
           FQ VK+QCLSGSVE  SSSDQ P Q
Sbjct: 541 FQIVKRQCLSGSVEKISSSDQPPKQ 555

BLAST of Tan0020931 vs. ExPASy TrEMBL
Match: A0A6J1DDS3 (ubiquitin-like-specific protease 1D isoform X3 OS=Momordica charantia OX=3673 GN=LOC111019156 PE=3 SV=1)

HSP 1 Score: 897.1 bits (2317), Expect = 3.7e-257
Identity = 454/565 (80.35%), Postives = 491/565 (86.90%), Query Frame = 0

Query: 3   MEQEKTTKKPLNIDWDELFGCKDEEPPLEIVILPAIA---NSKHFEMESDRQHLVREEYQ 62
           MEQEKTTK+PLNIDWD+L GC+DEEPP ++VI P  A      H +M+SDRQHL REEYQ
Sbjct: 1   MEQEKTTKRPLNIDWDKLLGCEDEEPPPDVVIEPPTAAPHPQNHLDMDSDRQHLAREEYQ 60

Query: 63  KLSDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRVEKEATGCKN 122
           K SD ELE+KIRRM  FYE+ ACKLPDKGQKY+R LEL  EERE RKLRRVEKEATGC+N
Sbjct: 61  KFSDVELEDKIRRMKSFYETKACKLPDKGQKYIRTLELCEEEREYRKLRRVEKEATGCEN 120

Query: 123 HSQPTSSSIVGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSILGHC 182
            SQPT+SS   +SY AREKTASSSADSLS FTSCF +KLEEK  CNN +F QELS+LGHC
Sbjct: 121 LSQPTTSS---TSYGAREKTASSSADSLSTFTSCFSRKLEEKAGCNNGSFSQELSVLGHC 180

Query: 183 DNRRQRFNGRLSPKVKQKGQTSARQQPFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHA 242
           DNRRQR+NGRLSPK+KQKGQTS+RQQPFK ANNLS DV K        N K+ D  D   
Sbjct: 181 DNRRQRYNGRLSPKLKQKGQTSSRQQPFKSANNLSFDVHK--------NGKNSDQFDFDV 240

Query: 243 SERLPGCFGKKDASQVQHSDSPMPRKGQTIVVVDEEEALATKIPEHDDKCMKESKIYYPS 302
            E+LPGCFGKKDASQVQHSD+   R+GQTIVVVDEEEALA KI E DDKCMKE+KIYYPS
Sbjct: 241 IEKLPGCFGKKDASQVQHSDNLRLREGQTIVVVDEEEALAIKISERDDKCMKETKIYYPS 300

Query: 303 RDDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKL 362
           R DPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRA  TN+VICN HFFNTYFYEKL
Sbjct: 301 RHDPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRACSTNRVICNSHFFNTYFYEKL 360

Query: 363 KEAVSNKGKDRDNFFVKFRRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPI 422
           KEAVSNKGKD++ FFVKFRRWW+GVNIFQKAYVLIPIHEDLHWSLVIICFP KEDE+GPI
Sbjct: 361 KEAVSNKGKDKEIFFVKFRRWWKGVNIFQKAYVLIPIHEDLHWSLVIICFPHKEDEAGPI 420

Query: 423 ILHLDSLGLHSSRSIFDNIKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQV 482
           ILHLDSLGLHSSRSIFDNIK++IKEEWCYLDREV+DSDLPMP RIWKNISRRIE+KII+V
Sbjct: 421 ILHLDSLGLHSSRSIFDNIKNYIKEEWCYLDREVSDSDLPMPFRIWKNISRRIEDKIIEV 480

Query: 483 PQQKNDCDCGLFVLYFIERFIEEAPDRLKRTDLDMFGKRWFKPQEASSLRTRIKCLLKIE 542
           PQQKND DCGLFVLYFIERFIEEAPDRLKR DLDMFGKRWFKPQEAS LRTRI+ LLK+E
Sbjct: 481 PQQKNDYDCGLFVLYFIERFIEEAPDRLKRKDLDMFGKRWFKPQEASGLRTRIRSLLKME 540

Query: 543 FQDVKKQCLSGSVES-SSSDQAPNQ 564
           FQ VK+QCLSGSVE  SSSDQ P Q
Sbjct: 541 FQIVKRQCLSGSVEKISSSDQPPKQ 554

BLAST of Tan0020931 vs. TAIR 10
Match: AT1G60220.1 (UB-like protease 1D )

HSP 1 Score: 364.8 bits (935), Expect = 1.3e-100
Identity = 230/576 (39.93%), Postives = 325/576 (56.42%), Query Frame = 0

Query: 2   VMEQEKTTKKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQKL 61
           V++ + + KK   IDW      +DE P LEIV              SD Q    +  + L
Sbjct: 8   VIDVDCSEKKDFVIDWSSAMDKEDEVPELEIVNTTKPTPPPPPTFFSDDQ---TDSPKLL 67

Query: 62  SDSELEEKIRRMHQFYESTACKLPDKGQKYLRNLELSMEERESRKLRRV-EKEATGCKNH 121
           +D +L+E++ R      +    LPDKG+K    + L + + E  K RRV E         
Sbjct: 68  TDRDLDEQLERKKAIL-TLGPGLPDKGEK----IRLKIADLEEEKQRRVLEGSKMEVDRS 127

Query: 122 SQPTSSSIVGSSYV--------------------AREKTASSSADSLSPFTSCFYQKLEE 181
           S+  SS+  GS  +                    +R+  A S   S S F++ F    + 
Sbjct: 128 SKVVSSTSSGSDVLPQGNAVSKDTSRGNADSKDTSRQGNADSKEVSRSTFSAVF---SKP 187

Query: 182 KTEC-NNSAFGQELSILGHCDNRRQR---------FNG-RLSPKVKQKGQTSARQ----Q 241
           KT+  +  AFG+EL  LG C+ R+ +          NG RL P V  K + SA+Q     
Sbjct: 188 KTDSQSKKAFGKELEDLG-CERRKHKAGRKPVTRLSNGWRLLPDV-GKAEHSAKQFDSGL 247

Query: 242 PFKCANNLSTDVRKKVSSVAAQNCKSPDHLDSHASERLPGCFGKKDASQVQHSDSP---- 301
                N  S +   K   + +      D  D    +      G +   +     SP    
Sbjct: 248 KESKGNKKSKEPYGKKRPMESSTYSLIDDDDDDDDDDDNDTSGHETPREWSWEKSPSQSS 307

Query: 302 --MPRKGQTIVVVDEEEALATKIPEHDDKCMK--ESKIYYPSRDDPESVEICFEDIKCLD 361
               +   T++ VDEEEA  + + E   +  +  +  I YP+RDDP  V++C +D++CL 
Sbjct: 308 RRRKKSEDTVINVDEEEAQPSTVAEQAAELPEGLQEDICYPTRDDPHFVQVCLKDLECLA 367

Query: 362 PEGYLTSTIMNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKLKEAVSNKGKDRDNFFVKF 421
           P  YLTS +MNFY+R+LQQ+   +N++  + HFFNTYFY+KL +AV+ KG D+D FFV+F
Sbjct: 368 PREYLTSPVMNFYMRFLQQQISSSNQISADCHFFNTYFYKKLSDAVTYKGNDKDAFFVRF 427

Query: 422 RRWWRGVNIFQKAYVLIPIHEDLHWSLVIICFPQKEDESGPIILHLDSLGLHSSRSIFDN 481
           RRWW+G+++F+KAY+ IPIHEDLHWSLVI+C P K+DESG  ILHLDSLGLHS +SI +N
Sbjct: 428 RRWWKGIDLFRKAYIFIPIHEDLHWSLVIVCIPDKKDESGLTILHLDSLGLHSRKSIVEN 487

Query: 482 IKSFIKEEWCYLDREVADSDLPMPHRIWKNISRRIEEKIIQVPQQKNDCDCGLFVLYFIE 534
           +K F+K+EW YL+++    DLP+  ++WKN+ RRI E ++QVPQQKND DCG FVL+FI+
Sbjct: 488 VKRFLKDEWNYLNQDDYSLDLPISEKVWKNLPRRISEAVVQVPQQKNDFDCGPFVLFFIK 547

BLAST of Tan0020931 vs. TAIR 10
Match: AT1G10570.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 327.8 bits (839), Expect = 1.7e-89
Identity = 218/566 (38.52%), Postives = 306/566 (54.06%), Query Frame = 0

Query: 3   MEQEKTTKKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQKLS 62
           +E ++  K  LNIDWD+  G  +E P LEI+    I   +      +    VR     L 
Sbjct: 7   IELDRVKKTMLNIDWDDALG-DEEVPELEIIATDKIPPREPTLSGYEPAVSVR----SLR 66

Query: 63  DSELEEKIRRMHQFYESTACKLPDKGQKYLRN----LELSMEERESRKLRRVEKEATGCK 122
           D+EL++ ++R          KL DKG+K +RN    LE   + R  ++  +++    GC+
Sbjct: 67  DNELDDHLKRQRSLLTRLGDKLADKGEK-IRNRIGELEYEKQRRMFQQRTKMQDADNGCQ 126

Query: 123 NHSQPTSSSI-----VGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQEL 182
              +P SS +       S   + + T+ S   S S F + F   L+   +        +L
Sbjct: 127 ILEKPKSSDVFMRASTASKDTSGQGTSGSKDVSRSTFAAHFSDNLKMGPQ-PVKLVNDKL 186

Query: 183 SILGHCD-------------NRRQRFNGRLSP-KVKQKGQTSARQQPFKCANNLSTDVRK 242
             LG                N   R   RLS  KV  K   S  + P         D R 
Sbjct: 187 QDLGRGSWISKANRDSIIEKNNVWRSLPRLSKCKVSLKNFYSESKDP-------KGDRRP 246

Query: 243 KVSSVAAQNCKSPDHL----DSHASERLPGCFGKK----DASQVQHSDSPMPRKGQTIVV 302
             +    +  +S  +L    D    +++ G    +     AS +Q S S   +    ++ 
Sbjct: 247 NEAYGKGKPNESSPYLLVDDDDGDDDKVIGYETPRHWSLKASPLQ-SSSCRKKSDDKVIN 306

Query: 303 VDEEEALATKIPEHDDKCMK--ESKIYYPSRDDPES---VEICFEDIKCLDPEGYLTSTI 362
           +DE+E L+  + E   +  +     IYYPS D  +    V++  +D+KCL P  YLTS +
Sbjct: 307 LDEDEPLSPMVVEEACELPEGLPEDIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPV 366

Query: 363 MNFYIRYLQQRAFLTNKVICNYHFFNTYFYEKLKEAVSNKGKDRDNFFVKFRRWWRGVNI 422
           +NFYIRY+Q   F  +K   N HFFNT+FY+KL EAVS KG DRD +FVKFRRWW+G ++
Sbjct: 367 INFYIRYVQHHVFSADKTAANCHFFNTFFYKKLTEAVSYKGNDRDAYFVKFRRWWKGFDL 426

Query: 423 FQKAYVLIPIHEDLHWSLVIICFPQKEDESGPIILHLDSLGLHSSRSIFDNIKSFIKEEW 482
           F K+Y+ IPIHEDLHWSLVIIC P KEDESG  I+HLDSLGLH    IF+N+K F++EEW
Sbjct: 427 FCKSYIFIPIHEDLHWSLVIICIPDKEDESGLTIIHLDSLGLHPRNLIFNNVKRFLREEW 486

Query: 483 CYLDREVADSDLPMPHRIWKNISRRIEEKIIQVPQQKNDCDCGLFVLYFIERFIEEAPDR 533
            YL+++ A  DLP+  ++W+++   I E  +QVPQQKND DCGLF+L+FI RFIEEAP R
Sbjct: 487 NYLNQD-APLDLPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQR 546

BLAST of Tan0020931 vs. TAIR 10
Match: AT1G10570.2 (Cysteine proteinases superfamily protein )

HSP 1 Score: 327.4 bits (838), Expect = 2.3e-89
Identity = 218/564 (38.65%), Postives = 305/564 (54.08%), Query Frame = 0

Query: 3   MEQEKTTKKPLNIDWDELFGCKDEEPPLEIVILPAIANSKHFEMESDRQHLVREEYQKLS 62
           +E ++  K  LNIDWD+  G  +E P LEI+    I   +      +    VR     L 
Sbjct: 7   IELDRVKKTMLNIDWDDALG-DEEVPELEIIATDKIPPREPTLSGYEPAVSVR----SLR 66

Query: 63  DSELEEKIRRMHQFYESTACKLPDKGQKYLRNL-ELSMEERESRKLRRVEKEA-TGCKNH 122
           D+EL++ ++R          KL DKG+K    + EL  E++     +R + +A  GC+  
Sbjct: 67  DNELDDHLKRQRSLLTRLGDKLADKGEKIRNRIGELEYEKQRRMFQQRTKMDADNGCQIL 126

Query: 123 SQPTSSSI-----VGSSYVAREKTASSSADSLSPFTSCFYQKLEEKTECNNSAFGQELSI 182
            +P SS +       S   + + T+ S   S S F + F   L+   +        +L  
Sbjct: 127 EKPKSSDVFMRASTASKDTSGQGTSGSKDVSRSTFAAHFSDNLKMGPQ-PVKLVNDKLQD 186

Query: 183 LGHCD-------------NRRQRFNGRLSP-KVKQKGQTSARQQPFKCANNLSTDVRKKV 242
           LG                N   R   RLS  KV  K   S  + P         D R   
Sbjct: 187 LGRGSWISKANRDSIIEKNNVWRSLPRLSKCKVSLKNFYSESKDP-------KGDRRPNE 246

Query: 243 SSVAAQNCKSPDHL----DSHASERLPGCFGKK----DASQVQHSDSPMPRKGQTIVVVD 302
           +    +  +S  +L    D    +++ G    +     AS +Q S S   +    ++ +D
Sbjct: 247 AYGKGKPNESSPYLLVDDDDGDDDKVIGYETPRHWSLKASPLQ-SSSCRKKSDDKVINLD 306

Query: 303 EEEALATKIPEHDDKCMK--ESKIYYPSRDDPES---VEICFEDIKCLDPEGYLTSTIMN 362
           E+E L+  + E   +  +     IYYPS D  +    V++  +D+KCL P  YLTS ++N
Sbjct: 307 EDEPLSPMVVEEACELPEGLPEDIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPVIN 366

Query: 363 FYIRYLQQRAFLTNKVICNYHFFNTYFYEKLKEAVSNKGKDRDNFFVKFRRWWRGVNIFQ 422
           FYIRY+Q   F  +K   N HFFNT+FY+KL EAVS KG DRD +FVKFRRWW+G ++F 
Sbjct: 367 FYIRYVQHHVFSADKTAANCHFFNTFFYKKLTEAVSYKGNDRDAYFVKFRRWWKGFDLFC 426

Query: 423 KAYVLIPIHEDLHWSLVIICFPQKEDESGPIILHLDSLGLHSSRSIFDNIKSFIKEEWCY 482
           K+Y+ IPIHEDLHWSLVIIC P KEDESG  I+HLDSLGLH    IF+N+K F++EEW Y
Sbjct: 427 KSYIFIPIHEDLHWSLVIICIPDKEDESGLTIIHLDSLGLHPRNLIFNNVKRFLREEWNY 486

Query: 483 LDREVADSDLPMPHRIWKNISRRIEEKIIQVPQQKNDCDCGLFVLYFIERFIEEAPDRLK 533
           L+++ A  DLP+  ++W+++   I E  +QVPQQKND DCGLF+L+FI RFIEEAP RL 
Sbjct: 487 LNQD-APLDLPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLT 546

BLAST of Tan0020931 vs. TAIR 10
Match: AT1G09730.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 148.7 bits (374), Expect = 1.4e-35
Identity = 92/300 (30.67%), Postives = 161/300 (53.67%), Query Frame = 0

Query: 290 MKESKIYYPSRD-----------DPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRA 349
           + + K Y+PS D           DP++V IC  D++ L PE ++  TI++FYI YL+ + 
Sbjct: 399 LNQQKRYFPSFDEPFEDVVYPKGDPDAVSICKRDVELLQPETFVNDTIIDFYINYLKNQI 458

Query: 350 FLTNKVICNYHFFNTYFYEKLKEAVSNKGKDRDN--FFVKFRRWWRGVNIFQKAYVLIPI 409
               K    +HFFN++F+ KL +   +     D    F++ R+W R V++F K Y+ +P+
Sbjct: 459 QTEEK--HRFHFFNSFFFRKLADLDKDPSSIADGKAAFLRVRKWTRKVDMFGKDYIFVPV 518

Query: 410 HEDLHWSLVIICFPQK----------EDESGPIILHLDSL-GLHSSRSIFDNIKSFIKEE 469
           + +LHWSL++IC P +          + +  P ILH+DS+ G H+   + + +++++ EE
Sbjct: 519 NYNLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDSIKGSHA--GLKNLVQTYLCEE 578

Query: 470 WCYLDREVADSDLPMPHRIWKNISRRIEEKIIQVPQQKNDCDCGLFVLYFIERFIEEAPD 529
           W    +E +D D+       + +S       +++PQQ+N  DCGLF+L+++E F+ EAP 
Sbjct: 579 WKERHKETSD-DISSRFMNLRFVS-------LELPQQENSFDCGLFLLHYLELFLAEAPL 638

Query: 530 RLKRTDL----DMFGKRWFKPQEASSLRTRIKCLLKIEFQDVKKQCLSGSVESSSSDQAP 562
                 +    +     WF P EAS  RT I+   K+ F+ ++ +    S E + S ++P
Sbjct: 639 NFSPFKIYNASNFLYLNWFPPAEASLKRTLIQ---KLIFELLENRSREVSNEQNQSCESP 683

BLAST of Tan0020931 vs. TAIR 10
Match: AT1G09730.2 (Cysteine proteinases superfamily protein )

HSP 1 Score: 148.7 bits (374), Expect = 1.4e-35
Identity = 92/300 (30.67%), Postives = 161/300 (53.67%), Query Frame = 0

Query: 290 MKESKIYYPSRD-----------DPESVEICFEDIKCLDPEGYLTSTIMNFYIRYLQQRA 349
           + + K Y+PS D           DP++V IC  D++ L PE ++  TI++FYI YL+ + 
Sbjct: 367 LNQQKRYFPSFDEPFEDVVYPKGDPDAVSICKRDVELLQPETFVNDTIIDFYINYLKNQI 426

Query: 350 FLTNKVICNYHFFNTYFYEKLKEAVSNKGKDRDN--FFVKFRRWWRGVNIFQKAYVLIPI 409
               K    +HFFN++F+ KL +   +     D    F++ R+W R V++F K Y+ +P+
Sbjct: 427 QTEEK--HRFHFFNSFFFRKLADLDKDPSSIADGKAAFLRVRKWTRKVDMFGKDYIFVPV 486

Query: 410 HEDLHWSLVIICFPQK----------EDESGPIILHLDSL-GLHSSRSIFDNIKSFIKEE 469
           + +LHWSL++IC P +          + +  P ILH+DS+ G H+   + + +++++ EE
Sbjct: 487 NYNLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDSIKGSHA--GLKNLVQTYLCEE 546

Query: 470 WCYLDREVADSDLPMPHRIWKNISRRIEEKIIQVPQQKNDCDCGLFVLYFIERFIEEAPD 529
           W    +E +D D+       + +S       +++PQQ+N  DCGLF+L+++E F+ EAP 
Sbjct: 547 WKERHKETSD-DISSRFMNLRFVS-------LELPQQENSFDCGLFLLHYLELFLAEAPL 606

Query: 530 RLKRTDL----DMFGKRWFKPQEASSLRTRIKCLLKIEFQDVKKQCLSGSVESSSSDQAP 562
                 +    +     WF P EAS  RT I+   K+ F+ ++ +    S E + S ++P
Sbjct: 607 NFSPFKIYNASNFLYLNWFPPAEASLKRTLIQ---KLIFELLENRSREVSNEQNQSCESP 651

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q2PS261.8e-9939.93Ubiquitin-like-specific protease 1D OS=Arabidopsis thaliana OX=3702 GN=ULP1D PE=... [more]
Q8RWN02.4e-8838.52Ubiquitin-like-specific protease 1C OS=Arabidopsis thaliana OX=3702 GN=ULP1C PE=... [more]
Q8L7S02.0e-3430.67Probable ubiquitin-like-specific protease 2B OS=Arabidopsis thaliana OX=3702 GN=... [more]
Q0WKV83.0e-3030.74Probable ubiquitin-like-specific protease 2A OS=Arabidopsis thaliana OX=3702 GN=... [more]
A7MBJ28.2e-2023.08Sentrin-specific protease 7 OS=Bos taurus OX=9913 GN=SENP7 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_022966911.13.1e-28286.35ubiquitin-like-specific protease 1D [Cucurbita maxima][more]
XP_022945620.11.3e-28085.99ubiquitin-like-specific protease 1D [Cucurbita moschata][more]
KAG7012818.15.4e-27985.82Ubiquitin-like-specific protease 1D [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023541910.15.4e-27985.79ubiquitin-like-specific protease 1D [Cucurbita pepo subsp. pepo][more]
KAG6573744.19.3e-27985.82Ubiquitin-like-specific protease 1D, partial [Cucurbita argyrosperma subsp. soro... [more]
Match NameE-valueIdentityDescription
A0A6J1HQL51.5e-28286.35ubiquitin-like-specific protease 1D OS=Cucurbita maxima OX=3661 GN=LOC111466476 ... [more]
A0A6J1G1G76.3e-28185.99ubiquitin-like-specific protease 1D OS=Cucurbita moschata OX=3662 GN=LOC11144980... [more]
A0A6J1DC743.6e-26080.71ubiquitin-like-specific protease 1D isoform X1 OS=Momordica charantia OX=3673 GN... [more]
A0A6J1DAG01.7e-25780.35ubiquitin-like-specific protease 1D isoform X2 OS=Momordica charantia OX=3673 GN... [more]
A0A6J1DDS33.7e-25780.35ubiquitin-like-specific protease 1D isoform X3 OS=Momordica charantia OX=3673 GN... [more]
Match NameE-valueIdentityDescription
AT1G60220.11.3e-10039.93UB-like protease 1D [more]
AT1G10570.11.7e-8938.52Cysteine proteinases superfamily protein [more]
AT1G10570.22.3e-8938.65Cysteine proteinases superfamily protein [more]
AT1G09730.11.4e-3530.67Cysteine proteinases superfamily protein [more]
AT1G09730.21.4e-3530.67Cysteine proteinases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.395.10Adenoviral Proteinase; Chain Acoord: 271..390
e-value: 7.6E-14
score: 53.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 117..131
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 112..131
NoneNo IPR availablePANTHERPTHR46915:SF2UBIQUITIN-LIKE PROTEASE 4coord: 56..368
NoneNo IPR availablePANTHERPTHR46915UBIQUITIN-LIKE PROTEASE 4-RELATEDcoord: 56..368
IPR003653Ulp1 protease family, C-terminal catalytic domainPFAMPF02902Peptidase_C48coord: 321..365
e-value: 4.0E-6
score: 26.8
IPR003653Ulp1 protease family, C-terminal catalytic domainPROSITEPS50600ULP_PROTEASEcoord: 306..409
score: 9.682877
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 299..360

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020931.1Tan0020931.1mRNA
Tan0020931.2Tan0020931.2mRNA
Tan0020931.3Tan0020931.3mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity