CSPI07G05910 (gene) Wild cucumber (PI 183967)

NameCSPI07G05910
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionUlp1 protease family protein
LocationChr7 : 4388043 .. 4407738 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTTACTGGCTGCCTTTGGCTTCATAACCATGACTCGGACCTCCAGCTCCAAGCCATTTCCGTCGACAAGGAGAAAGGAGAGAGACATAGGTGACGGTGGAGGCAAGAGATTTTCCGTTTTCGACTTCAGCGAAGAGGACGCTCGTGTTGAGAAAGTCTCTCGAAGGTTACTCGGCAAGTTTTCTGCCCGCAGGAGCTCTCCCGTTACCAAGCATCAGTTTCTCCACTGCTGTGAGTGCCTTTGTACTACTACTTTTCAATCGCTATGGATTATTGTGGTGAAAACGGGATTTGTTTGCTTTTGGTGCTTGGGAGGCGTGAATATGATCATTTTGGTTATGGGAGACGAGAATCAGGACGGCTATTGAGTGAGAAAGGATGTTGTCTTATTTAGGATTTGGAAGGATAGATTGTTCTTGTGTGTGGAGAAGTTACCGTTAGTGGAAGGATCGTTCGGATAAATGGTTGAAATTAGTTTGCCGGTTTTCTTGTTTGCTAGACCGTAGAACAGAGCCCAGTTCGGTGGCTGGTAGAAATGAAAAACATGTAGTGCAAAAATTTAAACATAAGTTATGGAGGAAATTGTGCGTTGAGTAAATGTATGATTTGAAAAGTGAAAAAAGCAACAGATATGTGTTGTTTTAATGTTAGAAACATCTATGCAATTCTGTAAATAGAAAATTAAATACATTGATTTAATCGAACATCATTGATATCGAATTGTTTCCTTGGAATTTCCTTTCTTGTATGGAGCTTTTTTGTCTAATGAATATATGTACAGGCAGGGGTTGGTATCTCCATTAACTATCAGCTAATTTTGAACATTGCAACAAAGTGTTTTGGTTATGAACTATGATTATTCAGTTTGGCTTGTTTAGGACTCAGAAATATTCTACAAATATGTTATCATCAATAATCACGACAATGTATGACTGCGAGTTTTTATGTTGGTTTAAAGTTTCCCTTTAGATCAAACTTCAAAATGCAACCTATGTGCGCATATGTTTTGTGTTCATAATTTGGGAACTATATTAAATGTGCCGTCTGAACTATTTCATGTGTCTTCTATCCAGTTGGAAAAGGTGCGAAAAGTGTAAGCAGGAATCTTAGCGATGAGCTCATTGATATCGATGCTGAAGGTAAGGTAATATCCTTTTTTGTTCGTTTCCTTCGAGCATCACCAGTTATTTTCTTTCATATATATATATATATATATATATATATAATATTCTTTTTTTTTTTAGTTTCCTTATTACTACATTTTCTCTGGGTTTTAATTAGTTATTTATTTTGTTATAAATTTGTTTCTCCTTATCTATAGATAATACACGTGAAGTGTCTCATATTTTTTCCCACCCCTCCTTTCCTTCTCATGGGTTGACATGACTCCATGAAAATTATAACATGTTTGGCATCTGAAATTTGTTTTCTTCCTTTACCTCTGCTTCAGTTGGAAAAAGTGCCAACATTGATAGTTTCTCTGAAGATGTTAGCCACGAACTGATTCACATTGATTCTGAAGGTAAGGGTATTATCTGATTTTCTATTCTTAATTGTTAGCACTCACATTGCATCTGCTTTTAGCTTTAAAATAATCCTCGGGAATAATGTTTAAAGATTATTTAACAATCTTGTTTTATCTAGATCCAAGAGCAATTCAAATGTACACAACAGTAATTTCCAGTCTTTTGGGTAATGACAGAACAAATGAGATTGATTTTCATCTGTTAAGTGGCACAAAACACATGCCGAAGGAGACAAAGCCATTCCTAAACGTTTTCTTTGGGTTTTATTGGCTGAATTAATGCCACCTCATGCACCATTGATGTAGAAATATTCACTTGGTTTTTGAACTTTTTGATTTTCAACATGCCTTGAATCATCTCAATCTACTTTTGGAAGAATAGACAGAGTTTGAACTAAATTTTTCATTGAAAATTCATTCACTAGAGTTTCATTTCTGCATTCTTCTATTCCTCACAATGACTTCATAATTGTTGCCAACAAACCTAAGGAATTCCTCCCAGCCAAAATCTTTCTCAAGCGAAATTTAAAATATATATACCATCTATGACATCTAGTTTCTTATGAAATAGAATGTGGTAGGATAGAAAAAGATGGAGCATTTGTGAACAAGAATGGATGATCTCTCGTGCAACTATTTCCACAGTTCTTTCATGTTAACTGAAAGCTCCTCATGTTTAGTTTTTATGTGTAAATGAATCAGATTCTGTGCATTCAGATTACTTCTATTTTTATTTGTTATAATTTGTTTGTCTAACTGAAGTTCGAACCTCCCATTCAGTTTCTATCTATACTCTAACTCCAGGTCATCAAATTAGCCCGGGACCAACTGATCTTAATGCTGAAGATACATGTGCAGGTGATCCCACTTTTATGGTTTTTCTTAACATTTAAATTTATTTATGTTTTATTCCGTCTCCTAATGGTTAAGCTGGAAAAGAAGAAAAATGTTCTAGAAAGATGTAGCATTACATCTCTTCTTCCTTTTTCATTAAAATATTATTTACATACATACATATATGTATGCTGTGGGTTATTTGATGCTTCTGTTACAGTTTTTAAAATTGCACATCTTTTGTTGCATTTGCTGTAGATACCCCACTTGAAGGTGGTGGATCTATGAAGCAGGAGATTCTTGAAACCAATGATATATTGCTTTCTCGTTCTTCAACTAATGAGGTAGACACTCCAGCTTATTATGGATTTTGAAGTTTTCTCCAACTTAATATCATTTTAATCACCTGGCAAATATTTAATTCTTATAGTTTTCATGTGGGATTAACATCCTACTTATACTGGGGTGATCAAATAGCAGTTAATTTTGATATTTCATTAAAAAGGTATGCTAAATGTACTTACGCAAGTTATTTAATTTGTCTATCATGCCTGATGAGCAGTCTATTAATGAGGTTAGCTTTTTGTTTGTTCTTGTATTCTTTCATTTTTTTCTCTCAATTAAAGCAGTTTTTCTATCAAAGAAGTATTAATGAGGTTGGTGGACATGAACAAAAGGAAATTATTTTCATGGTTATACATGGGAGCATTTCATCTTCGAGAAATGAGAAATTAGGTTTGAAAAATGTCACATTATCACATGTATTTTGAAGACATTGTAACTATTTTATAGGAACTATCTTGTATAGTTGGAGGAAACCCCACTTTAGCAAGGCTTCCTTCTTTTGTGGATTTGCTCATGTATTCTTTAATGAAAGTTGTTGTTTGAATAAAAGAGCATGAAGATCATATTAATCCTAATGACCCTTTTGTCATGAAGGAACGAAATTTGAGCATCATTTCTGATTTTCAATAAAAAAAAAAAGCAAAAAAATAAAAAATAAAAATACTGTCGTGGAGAGGAATAATGGAAGTAGCTGAAAGGGTGAGGTGATTAGGGATGTTATGTCTGGTGGTTTGGTGCAAATTTCGTGGAAGAGGTATATCATTTGCCTAGCATGAAAACCCTTGAATTGCCCTATTGTTGATCTTGATGGTTTTGTACTCATTAATAGGGTTAGGTTGATTTAGGAGATTTTGGTTAAAGACTTCAATGCCGTTTCTTTGCATGGCAAGGAAAATTTTTGGAAATTTGAGGAATTAAGAACTTGAAATCTCCTTTCCAAATAGTAACTTTAAATGATCTTATAATGAGGCCTTTTCAATTGTTTATAGTGGAAGATATGCATAGACTTTCCACTATAAAACGATATTAAAGGAAGAACTTTGTTTAATGTCTTTGATGAGTACTTGTTCTTGAAGATTAATGCTATTAAAGATATATTAGTAATTAGTCAGGGAGTTTGTTATGGTATTTAGTCATAAATAGAGGGAATGAGAAAGGAGGAAGTTAGGAAATGTGTGAACAAAGGCCTGAGAGATATCTCAAGAGAGGAGGGTCTATATACCTTGAATACTTGGGGAGTTATATTGTAACTGTTTTTTCTTTTATATTTCAATATATTTGATTCTATTAAATTAGCTTTAGGGTAGTTCGATTCTGACCATGGGGGTTAGAATTAAGTGATAGTGTATATAATGGTATTACATGAAATAATGTACAAATTACATAATCAATTGAGTCATTTTATTGATGTCAATGAACAAATCAATTTGAGGAGAAGATTGAATCTGATGTAGGATACAAATGGGCATAAATTCAGAGAAGTCAAAGAAAGTTTTTATGCGGTTTAAAATACTCTCATCATTCTTGGTTTGACATATTTGCCAAAGTTATGATTAAAAGTGGTTATCATTATTGTCAAGTTGATCATACTCTATTTGTGAAATATGCAAACAACAAAATTGCAGTCCTGATTGTATAGTTAGATGATATCATCATGGCAGGAAATAATCTTTGGGAGATCCTTAATTTGAAAAGAATTCTTTGCAACTGAATTTGAAATCAAGGACTTAGGAAGTTTGAGATATTTTTTCAGAATGGATGTGGTGAGGTCTATGGATGGTATTGTTATTTCTCAACAAAAATATATCCTAGACTTGCTAAAGGAGATTGGAAATCTTAGGTTTGGACTTGTTGAAACACCCATGGGTCCAAATCCAGGTGAAGACCATAGTGAAGAGATTAGTCTAATTGGCAAGGTCATGTATCGAACGCTTGTGGGAAAGTTAATTTACTTGTCACATACCAGATCATACATCATATTCTACAAGCACTGTTAGTTAGCACATGAATAGTACAATTGAACACCATCTTAGGATTTGTTAACATGATTTTAAGATCTGAAAAAGTACTGTACATCATGATCTATTGTTACAAAGTCACCGAATAGATTGGTAGAACTTTATCTTGATGCTAGTTGTGTTGGAGACTTAACTGATCAAAGATCCACATTTGGATATTGCTCTTATGTATGAGGCAATGTAGTCTCTTGAAGAAGCAAAAAGCAGGCAGTTGTAGCTAGAAGCAATGTATAGGTTGAATATTGAGCTTTAGCTGGAAATTTGTGAAGGGGTGTGGATTTGATGGCTTTTGGAAGAACTAAGGATAGAAACTCAAAGTTCCTTTAAAATGCTTAGTGACAGTCAAACAACAATCGGCATAGCGAAGAATCTTATACATCATGACATGACAAATAACATAGAGATGGATTGACGTTTCATATCTGAGATCAACAATAATGCAGTGGAACCAAACTGTTTCACCATCAAATTTCCGACATTCTCACCAAAGCTTTATCAAGCTTGAATTGTATTTTATAACTAGTTTGCCTAGTTTGCCTAGTTTGAAGTTTTCTTTTATTATATTTACCAATTATAGGATCTTTTCCAGAAACAAGAAAAAATAAATTATTTCTCCAAACATAACACAAATACATTGATCAAGCATCTCTACTTGAATCTCCTGGAATTTTGAACCTTCAATATGTACTGGATAATAACATTATGAAGAATGGGGGATCCTAGTTTTCGAAGGGAGAGCTCATGGAGAGCTCCGTCCTTTTTGAGAGAAAATGCCATCTGGAAGCAGGGCCATGTCTTAAAGAGTTACATCCATTTCTGAAAAGAAAGTAAATAATCTTAAACTTTAGGCTCTTGGTTTCTAATGCTGCTTCTGAAATTGGGTTTTGAATGCACGTTATCTCATTTTGAAAAATATATGTATGATGGTGATGCACTAAAGGACTATAGATTAAAAGTTCAAGTTAAGTATCCATTGCATTCAGTAGTTCTGTGGTTAATCTTGATATCAGGACTTATAAGTTGAATAAAATATGTAATTTACTACCTTTTCTTTTGTTCTCTTAATCTTGAGAATTGATATCTTACTTAACTTCCTTTCAAAGAAGTGAGATGCTTGTATTAACCTTTGTGTTTTTGTAAAATTATTTTTCAGGATGATGTCACGGTTATTTTCCCTGATTTTGTCATCTATGAAGGTAACTGGTGTACAACATCGAAGTTAATATTTTCTTGTAGCTGCATTAAGTTCCGAGGTTCAGCACTGAGTGGGTTGCAAAGAACATTTGATTCCGAATGGGCTATCTCCGACATTATTGGCATTGAGTCAGAATGGTGCAGTAGGGTATAACATTGATTTTTTTTACTTCATTGTTTTATGTGTTTCTGTGTAGATCTATTTTTAAGTATTTTTCTCATGTAGGTTGAAACTGCAATTGTTAATCTTCGTCTCAAAGGAAAGCATTTCACGAGGGCTGAAAATTCAAAGGACATTTCAGGTATTTTTTATGTTGGAATCAAGCTTTTTATGGGTAACTGAAGGCTGATTATGTAAAAATATTAAACTTTTATTAGAAATACAAATCCGAACTATGTGTATGAGGTTTTGGTCTTGTTTCTAAAACACCATTTTGGGTAATCATACATATTTTTTTCTTTTTTTCCTTTCTTTCTATCTGGAGGGGAATTGGAAGACTTATTCCTACATTCAGAGATAGGAAGATGTAAACATGAAAAAGATGTCAAGCCTATAGACAGTCACCCCAACTAGGAGAGGATGTGTTACACAGTACTTTCATGAAATGGTTTGATCATGTTGTTAGGGTTAAATAGGTTTCTACAAAATGGCCTTGAGGAGGTAATGGAGGTGGCCTAATGTTGGGAGGATTAAGTCTAGGCTGTCATTGACACCCAAAATTTAATACCGCAAGCCCATTAAACTTGGTCTACGCTTTTTGGTATACCTACTGATTATCTTTTATCTTCTTCTTCTTCTTACTTAGCCCCAATGGAGGATAGTTTCATTTCTGGATGTTTTTTGGACTTCTGTTCTTGGTACTAGAATGCACGTAATCTAAGGTGACATCATTCCTGTCTTTCTTAACATTTGTGCAGGCATAGAGTTATTGAAATTTTCTGTTTGTGACCCTCTTTGGTCTGAAAGTGAAAAAGCAATCAGAACATTGAATCTTAGATATAATGACTTGTGGAGTGCAGATCATGAGTGAGTTTTTTTTCTTCTTTCTGTTTTTCTTTGGGCTTGCCTAGACTATATCTTTTCTTACATCATTTTGCTTACAATAATAATATAATATCAAGAAACTGCTTAAAGAATTGAATTATTATTATGATTATTATTTGAAGATAAGAATGGGTTAAGGTTGGAGACTGATAGCAATACAATACAATTTGATGCCAGAAAATCCTATGATTGGAATTCTGCATTTGTTTACCACCTCTTTATTTATTTATTTATTATTATTATTAATTTTAAAAAAATGTATGTTGAATTAATCAAAGTTAGGAGAGCATGAACAAATATATTGAAGTATCTTTTTCAAAAAGAAGAAGAAGAAGAAGAAAGAGAAGAAGAGGCACGAATACATTATCTAGCCATTACCTTACTTGAACAAAATCTGTTCTATAAGTTGTACAAATTTATTCATCTTATCTAGCCACTGAAAAATCTGGCATGTATTGAGCGTTTAATTTATACACTACCTGACTATTTTCAGCAGGGATGTTGCTTATCTGATCTATATTGAATTAGTTAATCTGAGAGGAAATTCCTCATTTCGAGGTTTAGATCACTATAGACAATTAACTATTTTTTATTTAACATTCTTGGCAGTGACAATGACAAGGTCAATGGGGAGGAAATTGTTTCCTGGAGGCACAGCGATGTGTTTTCTCCCAAGAATTGCTTTTCTGAGTGAGTAGAATTGTAACTTTTACATTATTCCTTCCATTCCCTCTGTTAAATACTGGAAATGCTGTTAATCTAATCTGGAAATGCATTAGATTTTCATGGGGGCCAAAGCCCGTCTGGATAACAGGAAGCTGTTTACTTCATCCAATTTACATTGATAATTGTAGACTGATGGAAATGGGTTGGGTTGTGTGGGTTTTGTAGAGAAAGACTCATCTCGTGGGTCAGATTAGGGGTTCAATTGCAAGCCTTATAGGTATTGACATCAACATTATTTTGGGAAATTATAGAATGGGGTTGCTATTGCTTTTTCCCCCTCATACCTAATTTTTTTTTATTAATGGTGTTTCTTAATGTGTCCAAACATATTAACTCTCTAGGGCTTATACTCAAAGTCCAGCTTATTTATTCAGAAATGCTCTGGAGCAACTATAAACTGTTTAAAATCCATCAAAATTGAATGCAAATTGAACCCAGGGGAATTGTCATAGTGCTAAATATCGTTGAAGCTTTTATCAAAGATTCAAGGGACTTTAAATGTTAGACTCCACATAAGTGAAGTATAGTAAAAAGAGAATCAAGACTTGGATTTTCTCCCCCTGTCATGGTTGGCATTAGACTCATAGTAAATTGTGAAAGTACCTGTAGAGAATTAGATGTATTTGAAAATGAAATTTTCTACGGTTTCTTCATGTTCCATTTCTTTTCAATTACTTGAACTTTTATCATGTTCAGGACAAAATATTAGGACTGCCTAATTTCTTTGTTTCTCTATGAGATGATAGGATGCCTCTGTTTGCATGTCTTTGAATCCCAAGCATGCATAAGAAAAAGCAAATGTATTGTGGGGAATCGTTATTTTTTCTAATAAATACTTTTGCTGCTTAGATCTTGGATAGACAGATCATTAGTTTATTTAGGGTGAAAATTGGATAATTGAAAAATTTATTCGTAGACCTTTTCAATTGTTACTTCAGTGTCTTTCATTGTTTTTATTTCCGTAGTAAAGAAAGATCTTTTTAATTGTTAATTCTTTTGGTTCTTTGGCTTTATAATGCTAGTATGCTTCTACAAAATTGTTTCTTTCATTTATTCATTTGGGTTAATGAGATCCTCCCATGTTTACTGTATGTTTTACAAATTTAACTGATCTATTCAGATTTGTCGATACTTTTGAAGAGGTCATCTATCCAATGGGGGATCCTGATGCTGTGACCATTAGTAAGAGAGACCTTGAGCTTCTGAAGCCAGGGATGTTTATTAACGATACTATCATTGACTTTTATGTTAAGTAAGTTTGAGTTTTCTCCTGTCTCTAGCAGAAGTTTGTTTCTCTTGGTTTTTGTTTGTCCTTATCTTAGAAATTGGCTGCATATATTATATTTCTTCATTTTAATCCCTATACCATGAATTGATTACCATTGGAAAACCTCTTTTTATTTTTGCTTCTTTTATTTTAGTGAGGATATTTATAGCTGAAATAAATAGTGCAGATAATTTTAGTGTACTTTCGAAATAGAAAGGCAAGAAACATCCAAATCATAATGTGAGATAGTTGCAATTTTTAGTTCTACGGTTACTTGGGATTTCATATCTTTTAACTTTCAGCCGATTAGACTGATAATGGATGGGAAATATTTCATTGATAGAATGAAACCCTTATATAAATAGGAATTCCCATCCGAGGACGTTGCAAAAATTCCTTCAGTAGGATATGAAATTGTTGAGACTATAGTCGCAAAAAGGAGAAACTTGCTTACATCAATAAAGCTTTGAATTGAAGACTCAAGAAATACAGTTCAGTGAAGAGCAAGTTGTGGGCCAAGTAAAAGGGAGTCCTGAAGTTTAGGAAGCGTCTGGGACAATGAACTATATTTAGAAAAGTCAATAGTAGAGCTAAAATGTAGAAAAGTTGTTATGAAAAAGTTTGAATGTTATTGAATTATTTCTCACTTTGTTGTTTTTACAAGAGTACAAAGCCCTAACTTAAATATAATGCATATCTAATAACATAAAAGGAAAGACTCCTACAAAAAAAAATTACATAATAATTACAGGGAATTAAATCAAATAAGTATAAATCAACAGTTATAGTTAGCTTACTTGACTTAGTAGAGTCTCCCACAACCTATGAATTCTGGAAAGTGATTGATGGAGGATTTGTACAGACGGGAGAGGTCCAACTTGTAAAAGCATCTCATTCATGCACAAAGTCTCTTGTGTAAATAAACGTAGATATTGTAACTGAAAAACTTGGGGATTGATCCCCTTGGCCAATTTGTTTGTTTGTGTCTTGTGCATTGGTGCTTCTTTAGCAAAACCATCTTATGAGCTTTTGGACGAACAAGAAGACTGACAAAGCGGAGCCTCTTAGGTCACATGGATAGGTGAAAAATTTTACTTGAGTTTTTACCTTTTGATAGTTACACTTTCACCGGCCATGGGTTCATCTTCGCTGTTAACTTGCTCTTCCTGAATGACAATGTAGTTTGAGGACTTTAATTGGTGGAGGTTTGGTTACAACCTTGCATCTTGATGCAAAAGACTTACACAAGCAGGCTGAAGTGGGTTGGGAAGCACACTTGTGCTGGTGGGTAGGGGTGTAGGTTGACGCTCACCCCAAAGAAAAGCAGGCATAGGTGTGTGCCCTGACGAATGTTGCACATGCAATGGGCGTGTGCACATACACTCAAGCTAGTAGAAATGCAGACAGGCGAGTGTGTGTGCACATGCGTATAGGGCTCAAATAGACACATGCGAGCGGCAACAAAAGTTGTGTCGGTCAGCTGGGCACCTTTGTCCAACCCTTCTCCCTCTTCACCTTTTATTGTTCGCATTTTGTGGTTGGGAAAGAGTTTCCTTTAAAGATTGGCTTTGTGTGACTTTGTCGTCTGCAAGTGCTTGGTTACTTGTCCAACGATAGCTGTGTTACCCTTTGAGAGGACTTGTGAGCCACGCCACTGTCATAGGGTAAAAGCTTAAGTAAAACCTTATTGATGCAGCCAAAGAGGTTACTCTTAGTCAACTTTCTCATTTGCCTAAAAGGATTCCAAAACTGTTTTGTTTGAAGAAGCAAGGGTGTTTTTTTGGAACAATTACAAAGTTTCAAATTCCATGGGAAAAATTGAAACCTGCCCAAATTTTAAGGTATTTTGTATGATTTAACCTGTTACTAACTAAAAAAAACAAATTGCTGACTCGATGACCAACTAACAAACTTATCTATTTAAGTAGTAGGATTGTTTATTTGATTTCAATTAGACACTTGCTTTATCTCTGTTACCACTGCCAATTCTTTCACCTCATATTTGAGCGCTTTTCCCTCCCTTTTAATTAGCAAACTGAGAATTATTACATGAAATGTTTGTAGGTATCTAAAAAATAAATTCCTTTCGGAGAAAAACAATAGGTTCTACTTCTTCAATAGTTTTTTCTTCCGAAAGCTTGTGGACCTAGACAAAGATTTATCAAGTGCTCGTGGAGGAAGGGATGCATTTCAGCGTGTTCACAAATGGACAAAGAAAGTAAATCTTTTTCAAAAGGATTATCTCTTCATTCCTGTTAACTACAGGTTATAACTTATAACTTATATATATTAAGCATTCTATTTTAATTGACTATCCTGCATTATGTACTCTGTTCTAATTCTAATTACAAGTCTTGATGACAGTCTCCATTGGAGTTTGGTTGTCATCTGCCATCCTGGTGAAGTGGTAAATTTGAAAGGCAAGATGTTTAGCACTTAATGAAAATGTCATTGGTACTAAGATAATCTTTTGTTGATTATTTCCTTTATTGTTAATAGTTTCATAGTTTTTAAAGATATATTTGTGTATTTTTTTTTTCCTTATTTTGTATTTTCCTACTTTCTATTGGGAATCTCTTCTATTTAAGAAAACCCCTTTCTTTCTATGTGAAATAAGAGATATAGTTTACACATCTAATTCATGTAATTTTTCATAATATCTTGCAGATAAAAAACATGATAACTTATCCAAGGTACCGTGCATCTTGCATATGGATTCTATCAAAGGGAGCCACAGAGGGCTGAAGTCTCTTTTTCAAAGGTAATATGCTCATATATCCATTTGATTGGTTCCAAGGAGATCTTTATATTTTTATTTTTGATGTCGTGAAGGTTGTTTCGTGAGTTCCTTGTGTTCAATGGTTTCCTGTTTCTAAGTAGGTCTCTTTGCATCATTCAAGTTTGTTTGCAGCAGTTCCTCCAAAAAAATTATGAACTAACAAAAAAAGGAGGTGATGAATTGGGATTTTTTTAAAAAAAATATTTTTATCACAAGATTACATTATTTCAATAGATTTCTGGTTTCTAACAAGTGGGTGGAACCATTTTGAGAATTCAAGAGAGCAAGGCAAGTGCGTGCCATCACTTAGCCAAACAGTTCGCGAAGAACTTCAGGCTCCCTCTCTTCCGAACATAACGACATTTCCCTTCTGTATTCTCCAACTCATTCAAAATGCATTTGATGTAATATAGAATAGTTAGTAAAACAGTTTAATTGTTTAAACTGTTAACTGTTAAAACTGTTTCATAAAAACAGTTATATCAACTACTTGTAACTAGTTACACATTCTTAATAAACACCCCTTCTCTTAGCCATTAGAGGCAGATAATTCATTTTGAGAAATAATTCTACTTTGATTTACATCAGCATTGGCTCATAGATTTAGGTGAGAAATCCATGATCTGTTTTAGCAAGGCTAAAGATCTGTTTGGAATTAAAAGGGCTCCTAATTAAACTTAAGTTGATGGCTCCTAAATAAAGTTAAGCTGGGATAAAAGGTAGGTGACAAAATAAAACCTCGTTCTCTGTATTAGGTGACTACCTAGATTGAAAAAAATTGTACAATTTACAGTACTATACTAATGGTCTTTAGAACTGACTGGAAGTTGTTTTGGAACTTTCGTTTTGCATTGTGAATCAATGTTATCTTCTTTGTTGTCATCAAGCATTTCACCAGGAATATCTTTCATGGTATTTGAGTTAATTTTAATGATTTTTTTTCTGCCTTTTGTTCTCCTTTATTCATGTTTTCTCTCTGAATCTTACATCTTAGTTTATCTAATCATTATAAATTTCTTGTGATGTTAGTTTAAAAGCCTATATTTTGTAATGTTCTTCGTACTTTGAAATTCTAATGTCAAAGGTTACCGATAATTATTTTAGATTTTAACGTGACAGTTACTTATGTGAAGAGTGGAAAGAGAGGTATGGTGATGGAGATTTCAAAGATATTTCTGCAGTGTTCTTAACCTTGCCATTTATCCCTCTAGAGGTATGGAAGTATTAAAAAATTTAGTAGAGTCCAGGGAGACTAAAATTTTGACTGTGTTGCCATTGTCAAGATCCTTCATTTTCTCTGCCCCCCCAACCCCCCAACCTCTCACACCTCAACACAATATTAATAACAATAATAATAAAATAGAGGAAAAAAGAAATTAAAAGAACAGTAAATTGAATAATACTTTTCATATTTTATGTTATCGTATAATCTTTCTTTTTATTTCTGAATTTTTATTGTAGGTTGAGAAATTTCTTTTTGAGGTTGAGTGCATGTCTCAAATGGTACAGGATAGGAAATGTCCCCACTTTTAAATGTTGGTTAATATGCGAAGGATATGGAGTTGGGCCTCATTCTCCTAGGTTCCGTTTTTTCCTCCTCCTGAACTGCACTGTTGATTTAGATCTCCAGGTGTCTAATTAAGCAACCAAATATCTCTTGCTGCATACATACTTGTAGCTTGAGAATGATAAACGATATTTATCTTATATTGAGATGCCTTACTGGTCAGGCACTCACGATTATCAATAAAACCTTGATTAATTTGGGGTTCTATGATGGAAGATTTCCCAGGTTTTGTGTTTAATTGATGCTCTAGGCGAAGAGCTTAATAAATAAAACTTCTTGATATCTCTTTTTATTAGTGTTTTATCACATGGGGTGGAGCATAATGCCACAATGACCAAATAACGATATGTTACCTTTGTACGCCTGAAATGTGTATAGCATATACACAATGATTTTTTTCTCTTTTTCTGGGTGAGGGGAGGAGGGGGCTGTTGAGAGTTAGTCAACTAAAATATAAACACTCTGTGCGTCAACAATTTTTTGGAAGATTCGTCTGATAGTAGATGTTTGTGTAATCTGCGTTCCCATGTCTAACATTGACTTTTGTTTTTCCTGCTTGGTCAGTTGCCACAACAAGAAAATTCATTTGATTGTGGTCTCTTCTTACTCCATTATGTGGAACTTTTTCTGGAGGGTGCGCCAGTGAACTTCAGCTCTCTCAAAATCTTGAAGTTCTCGAATTTTGTAGGTTCTTTATCCTTAAATTACAAATTCTCTATATGTGGCGTATCCTGTTGGTTCTATCCACTTACTAAACATTTGTCTTTTTAATATTTGCTACCTTATATTCAAGATTGTCTACTCATGAGTCAAAATGTTTATCGGAATTAAGATTTAGTATATCATGTATCCTAAATGTTGAGGAAAGTGCAAAATCTATCTCCACATTGGTGTGGTATTATCTACGTTGAGCCTAAACTCTTACCATTTTCTTTTGATTTCATCTAAAATGACTCATTTACCCATTGAGATAATTGTCCTTACTTATATACCATGGACCTCTATCTTTTCCAATCAATGTGGGACTTCAATTGGACCTAAGAATCCTCTCCTCAAACCAATTTTCACACTTGAGAGCCTCCCCTCAAAAAATCCATCCACCCTGACAAAGCTTGGTCTTTACCTGGTGGCACAACCAATCAATTCCAAGGAAACATTTGAACAGTGCATCAGCATTGTTCACACTGACGATGAGAGCTTTTTTCTCTTATCAAGGTGAGTTGAGAACATCCAGAGAGAGCAGAGCATAGATCACATTGTGGCACCTTCTCTCTCAAACAAAGATATATCACACAATCTTGCCAATAGAGATAATTGTCCTCACTCATATACCATTAATGTCTTTTTTCGCTAGCTAATGTGAAGCAATTTGAGGACTGATCTAAGATGTACGTGATATCTTCAAGAGAGACAATTTTCTGCATATGCAACGGGAAGAATTCAATCAAGCAGAATAAAAGAGGAATACCAAGCCCACAAAGAAGAAGAAAAATTGTGCAATAAAGGAGAATTAGATGATTAGATGAAAACATGAATGCGATATGACTGAAGACAAAGGTGAATCATAACTATTGTTAAAAAAGTTATAGGCGTTGTTGCATTAGACGTGATGACAAAATCTTTTGGGTTATTGTATGGGACGGGGAGGATGAAAACACGTCTAAAAGGAATAACTGGGCGATTCTTTTAGACCCATTACCCCAACCCGATCAATAGATTTTGAGGTTCCCATGGAAAGTCAATGGGATGTAGATGAGGAAGAGACATACTAAGAAATTTGTGTTGTTGCTTGTGAAGACGTAGTAAAACAGAAAAAAACTGGAACTGCACATTTGACATGAGCGACGAGTCTTTTACCCCAAACCCTACACAACCCGACTTGTAGCCAATTCTCTTTTCCTTTTCTTAATTGAACCAAACCCAATGCAAGCCAAGCCCAACTTTTTTTGTGGAATCAACTCAACCCATTTTCAAAATTGTTCAATTAAGCAAATTTTCCTCAAGTCATACTACTATTAGCGATTAATTGGCTTGAAGATGAGTTTCTTTTACATTTTGCATCGTTCTAACTATAAAAGGTTGTTTGAAGAACTGAGTTGAGATATGATGTCTGAAATTCATATGAATGTGGAGTTCATGTGTCTGGAGAGTTCACATTACTGTGTTTAGAGTGCAGAGTTGTTTTGAGTTCGTGTTTGCGTTTGTGGTGCAGAGTTGTTTTGAGTTCATGTTTGTGTTTGTGGTGCAGAGTTGTGTTCATATGTGAGGGATGTCTATTACATTTTTTTGAGTTCTAAATATTCTATTTTTATGATGGAAATTTTTAGTTTTTAAACTTTTTTATTCAATAATTTTACCATTCTTTTTTTTTAATGTTTGAATAAAATATGAATTGCATTTTTTCTTTCTTATGGTGATGTAAACAATTAACCCACGACATTTCATGCACAAACATGGGTAAATTTAGTGAGAAACAAAAGAGAAATCAAAACTTTTGATTCCTAATTTTTCTCTATGAATTTCATCAGCATCATCTTCATTTTTTTTCTTCCTACTGTTGCTTTGTATTTTTCTCATGAAACTTTTTTGAATTGAATCCCCCACGTCATCTTAGCAACCAACGAAATGCCACCACATTTCTCCAACAATTTCACATCTAATGCCTCAAAACCTGTATTTAAACACACATTCGCTAGCCAAGTTTTTTTATTTCTTCTTTTTAGAGCATATTACTGGTCGAATTGGTAGATAGAGTTAGTTGTTGAAGATGAACAACAAAGTTGGTGGTTTGAGTTAGTTGCTCAAAGTGATAGTTGGAGTTAGCCGCCAAATGTGATCGTTGTTAAGGTGTATTGTTAGAGATGGTCTTCGCTGTGGTTAGTTGCCTGCAGTGAATGTTGGTATTGAGCATTAGAGGTGATTGTCATAGGATTTGATCATCAGAGGTGATTTTTGGAGCTCGAAGTTGAAAGTGGTTGTCAAAGTTGATTATCAAATGTGACTTTTGTAAAGAATTGTAGTAGTAGGTGGTTATCGGAGCCAAAAGTTATCTTGGATTTTAACAGAGGAAGCAGAATTTGGGAGAGAAAACTTACTGACTGCAGTAAATTCCTTTATTTCTGTATGTGAAGAATATTTACATCAACCAAGCTTGGTATTTATAGATGCTTGGCAGCAACTAATAACTGTCTAACAGATTAACTAATACAACTAATAACTATCTAACAGATTAACTAATACAACTAATAACTATCTAACAGATTAACTAATACAACTAATAACTATCTAACAGATTAACTAATACAACTAATAACTATCTAACAGATTAACTAATACAACTAATAACTATCTAACAGATTAACTAATACAACTAATAACTATCTAACAGATTAACTAATACAACTAATAACTATCTAACAGATTAACTAATACAACAGAAAATAACAGTCTAACTACTTTTATTGTTAATAGATTTGGTTGTGGGAGTTTTTCGTCGAAGCCGGTTGTTGGTTGCATTGAGGTGGTTGTCAAAGTTGATCACTGAAGGTGATTTTAGTCCAAGTTTGTTGTTGGAGGTGGTTAGTGAAGCCCATAATTGGTTTGTTGAGTTGGTCATATGAAAGTTGAAGTCGAAGTGGTCACAGAAGTTCGTCGTCGGAGTTGGTTATCGTGGCACGAAGTTGGACGTCGTTTGAGTTGACCGTGTGTAAGTGGTCATCGAAGAGGGTTCTTGGAGTGCGGCAATGGAAGTCGTCAACAATGGACGATGGGTGGACCCATTGAAAACATTGGAAGAAGGGAGAGTTGAGTGTGTCTGTATAGGATACCAACTCAAGTCCACCAACATTTAAAGTTGGTGGGCAAACATTGAGTTGGTATGAGTCAAACACCTCATACATGTTATCCTATTGCAAAGGAATTATGCAAAATGTGAACCCAATAGGGGCCGAGCTCGATATATGCAAACTCTATGGCTAGACTTGGGCTAGTGATTTATAGGATTGAGGAAATAAGTTGTAACCATATTTGTATATGGAGAATATTTGCCACTGGCTTCTGTATCCAGGCACCCCAAGATCTCCAATAAATACAACAAATAGACTCCCAATCTTGGTACGCAGATACTTCAATACCTCGGTATGTCTTGTATATAGTTATGAATATGGACTCTCGAGCTTTGTATTTGACAATCTGACTAAACGTCCAAGTTCCAATTTCGTAACCTCCACACTTACTCAACTTAATCATAAGGACAGTTCACAGCACCTTTCTTATTCTCCATTCCCATAAGCCAACACTTCTCATGATAACTTAATGCAAGCACAACCCAATTTTAATCTTCTCACAAGCATACTTAACATCATCAATCAAATCATGTTGCATTCTTAGTCAATCCAAACTATCAAGTGTACATTAGACATAAGAATCATGCATTCTCTCACAAGATTTCTCCCATACTGGCACACTGGTTCTTAAACAACATTTGACATCAAGAACCCATAAGAACAGGGTTGTAACTCACCAATTTATGCATTCTATCAGTTACTATAACATGCTATTTCATCGATAATGTCTGTCATGCATTTTCTTAACACTATATTTATTTAGCACTTGGGCAGTTGTACCTTAATGTGCACCTCCTCTATTTTCAGTGCCTTTATCGTTTTGTAGATTTTTTATATTTATGATGCAAGTACTGAACCTTATTTTTCTCTAATGGCCTTACATTGGTAACAGTTACTGTTTTGCCTTTATTTGTGTAAGTTTCTGCATGTCTAACATTAATACGATCTCCAGCTAAGCCAGGACTGGTTCCATCCTGCGGAGGCTTCTCTAAAACGTGCACATATCCTAAAGTTAATTTATGAAATCATGGTCTGTAATCAAGCAAAGGAACTCTCTGGTAGCATTGGTAAATATCCTTCTTCTGATGCTAATGACTCGGACAATGATTTATCAAAACACGTGTCTGGACAGGCACATATTTTCACAATGACCCATTCTGACAACTTTTCGAGTGTTGGAGAAGAAGTTGGATCAGTGTCCAAAGTATCATCTGACACAAATTATCAACGAATAGGAAGATGGGAGAGTGTCATGCCACCCATTGAGGTAATTTATTGTTTTCTTTCATATAATGATTATACTTCTCGTTTGCTTACTGATTCACAGTCATCGGTTATGGAGGTGATTGTGTGATATACTAGTTTTCGTTTGGCTCGTCTCTACCAGCATAATATGTGTTCTTCAATTTAGAGGCTGTGATAGTTTTTGGGATGGAGTCGAAAATGGGTGTTAAATACGTCATTATAATGTCCCTGAATTGTAGTGTAGATTTGAACAATGTGCCAGTGTGAGGGAAACTAGAACATATACAAACATTGATAAGTCAACAAATGGGTGTCTTATGTCTAGGAGAAACAATAGTATAGAAAAGAAAAAGCGCCATCCTTCGATTCTCATATTGTTCTCTCTTGGGTGTTAAAAAGGAAAAAAAAAGAAAAAAAAAGTTAAAATGCCATTTTTCTCTATTTTAGTCTTCGTAAAATGAGAAAGCAGCCCAAACAATTTGTCTTTTGTAGAGAGTAGTTATAAGTCTTTACTATTATATTTACCATAGTGATGTTTACATACAAATTATGCAATTTTTCATATTATATTATCTAAAAGAAGTGCATTTGCTTTTGTTTTCATTTCCTTTTCTGGGGGCGCAGGAAGATGAGAATGGTGAAAGACCTGATTCACCACAATTCTTAGAAGATCGCCCCCAAGCTTCGGCAGTTTCTGAATGTTCATCGGCCTTCAGTTTCGGCCAACAATTTACGGAATTAGAAATATGTTGGGAAGGAAGATATTCTAAAAACGTAAAAGAAATGTGTAGAAAACCTTCTCCACGGCTATCACTTCATGAGTTGCAAACGCCATTGGAATTGGGACAACCAGAGATCTTAACCAGCTCAAGTGATGAACTTATCAATTGTGTAGTAGAGGACTCAGAGGAGGAAGGAAATGAAAGGAATGAGAGAATCGAGATCCAAGTTTCTTCCTCCTCCTCCTCCTCCTCCTCAAGGAACAACTTGTTCCTATCAAGGCAAGTGGTTGAATCTCCTGCAAACTTTAGTGATAATAATAGACAACATGAACATAAATAATCACATAACTAAGAAATGAACATCCATCCAACACACATGATTAGGGTTTGTAGGCGAAATTAATCAAAACTCATTAGAAGCCATATTTTGAAATGACTTAAGATAGGTGGTTGTGTATAATATTAAATCTCCTTTTTCACAGCCTTTGTAAAGTTATTGTCTGTTTGTGAATATATATATATGTGTGTGTGTGTAGGAACCTTAGCATACTTTCCACCACCTCCCATCTTTTCATGACTCCATC

mRNA sequence

ATGACTCGGACCTCCAGCTCCAAGCCATTTCCGTCGACAAGGAGAAAGGAGAGAGACATAGGTGACGGTGGAGGCAAGAGATTTTCCGTTTTCGACTTCAGCGAAGAGGACGCTCGTGTTGAGAAAGTCTCTCGAAGGTTACTCGGCAAGTTTTCTGCCCGCAGGAGCTCTCCCGTTACCAAGCATCAGTTTCTCCACTGCTTTGGAAAAGGTGCGAAAAGTGTAAGCAGGAATCTTAGCGATGAGCTCATTGATATCGATGCTGAAGTTGGAAAAAGTGCCAACATTGATAGTTTCTCTGAAGATGTTAGCCACGAACTGATTCACATTGATTCTGAAGTTTCTATCTATACTCTAACTCCAGGTCATCAAATTAGCCCGGGACCAACTGATCTTAATGCTGAAGATACATGTGCAGATACCCCACTTGAAGGTGGTGGATCTATGAAGCAGGAGATTCTTGAAACCAATGATATATTGCTTTCTCGTTCTTCAACTAATGAGGATGATGTCACGGTTATTTTCCCTGATTTTGTCATCTATGAAGGTAACTGGTGTACAACATCGAAGTTAATATTTTCTTGTAGCTGCATTAAGTTCCGAGGTTCAGCACTGAGTGGGTTGCAAAGAACATTTGATTCCGAATGGGCTATCTCCGACATTATTGGCATTGAGTCAGAATGGTGCAGTAGGGTTGAAACTGCAATTGTTAATCTTCGTCTCAAAGGAAAGCATTTCACGAGGGCTGAAAATTCAAAGGACATTTCAGGCATAGAGTTATTGAAATTTTCTGTTTGTGACCCTCTTTGGTCTGAAAGTGAAAAAGCAATCAGAACATTGAATCTTAGATATAATGACTTGTGGAGTGCAGATCATGATGACAATGACAAGGTCAATGGGGAGGAAATTGTTTCCTGGAGGCACAGCGATGTGTTTTCTCCCAAGAATTGCTTTTCTGAATTTGTCGATACTTTTGAAGAGGTCATCTATCCAATGGGGGATCCTGATGCTGTGACCATTAGTAAGAGAGACCTTGAGCTTCTGAAGCCAGGGATGTTTATTAACGATACTATCATTGACTTTTATGTTAAGTATCTAAAAAATAAATTCCTTTCGGAGAAAAACAATAGGTTCTACTTCTTCAATAGTTTTTTCTTCCGAAAGCTTGTGGACCTAGACAAAGATTTATCAAGTGCTCGTGGAGGAAGGGATGCATTTCAGCGTGTTCACAAATGGACAAAGAAAGTAAATCTTTTTCAAAAGGATTATCTCTTCATTCCTGTTAACTACAGTCTCCATTGGAGTTTGGTTGTCATCTGCCATCCTGGTGAAGTGGTAAATTTGAAAGATAAAAAACATGATAACTTATCCAAGGTACCGTGCATCTTGCATATGGATTCTATCAAAGGGAGCCACAGAGGGCTGAAGTCTCTTTTTCAAAGTTACTTATGTGAAGAGTGGAAAGAGAGGTATGGTGATGGAGATTTCAAAGATATTTCTGCAGTGTTCTTAACCTTGCCATTTATCCCTCTAGAGTTGCCACAACAAGAAAATTCATTTGATTGTGGTCTCTTCTTACTCCATTATGTGGAACTTTTTCTGGAGGGTGCGCCAGTGAACTTCAGCTCTCTCAAAATCTTGAAGTTCTCGAATTTTCTAAGCCAGGACTGGTTCCATCCTGCGGAGGCTTCTCTAAAACGTGCACATATCCTAAAGTTAATTTATGAAATCATGGTCTGTAATCAAGCAAAGGAACTCTCTGGTAGCATTGGTAAATATCCTTCTTCTGATGCTAATGACTCGGACAATGATTTATCAAAACACGTGTCTGGACAGGCACATATTTTCACAATGACCCATTCTGACAACTTTTCGAGTGTTGGAGAAGAAGTTGGATCAGTGTCCAAAGTATCATCTGACACAAATTATCAACGAATAGGAAGATGGGAGAGTGTCATGCCACCCATTGAGGAAGATGAGAATGGTGAAAGACCTGATTCACCACAATTCTTAGAAGATCGCCCCCAAGCTTCGGCAGTTTCTGAATGTTCATCGGCCTTCAGTTTCGGCCAACAATTTACGGAATTAGAAATATGTTGGGAAGGAAGATATTCTAAAAACGTAAAAGAAATGTGTAGAAAACCTTCTCCACGGCTATCACTTCATGAGTTGCAAACGCCATTGGAATTGGGACAACCAGAGATCTTAACCAGCTCAAGTGATGAACTTATCAATTGTGTAGTAGAGGACTCAGAGGAGGAAGGAAATGAAAGGAATGAGAGAATCGAGATCCAAGTTTCTTCCTCCTCCTCCTCCTCCTCCTCAAGGAACAACTTGTTCCTATCAAGGCAAGTGGTTGAATCTCCTGCAAACTTTAGTGATAATAATAGACAACATGAACATAAATAA

Coding sequence (CDS)

ATGACTCGGACCTCCAGCTCCAAGCCATTTCCGTCGACAAGGAGAAAGGAGAGAGACATAGGTGACGGTGGAGGCAAGAGATTTTCCGTTTTCGACTTCAGCGAAGAGGACGCTCGTGTTGAGAAAGTCTCTCGAAGGTTACTCGGCAAGTTTTCTGCCCGCAGGAGCTCTCCCGTTACCAAGCATCAGTTTCTCCACTGCTTTGGAAAAGGTGCGAAAAGTGTAAGCAGGAATCTTAGCGATGAGCTCATTGATATCGATGCTGAAGTTGGAAAAAGTGCCAACATTGATAGTTTCTCTGAAGATGTTAGCCACGAACTGATTCACATTGATTCTGAAGTTTCTATCTATACTCTAACTCCAGGTCATCAAATTAGCCCGGGACCAACTGATCTTAATGCTGAAGATACATGTGCAGATACCCCACTTGAAGGTGGTGGATCTATGAAGCAGGAGATTCTTGAAACCAATGATATATTGCTTTCTCGTTCTTCAACTAATGAGGATGATGTCACGGTTATTTTCCCTGATTTTGTCATCTATGAAGGTAACTGGTGTACAACATCGAAGTTAATATTTTCTTGTAGCTGCATTAAGTTCCGAGGTTCAGCACTGAGTGGGTTGCAAAGAACATTTGATTCCGAATGGGCTATCTCCGACATTATTGGCATTGAGTCAGAATGGTGCAGTAGGGTTGAAACTGCAATTGTTAATCTTCGTCTCAAAGGAAAGCATTTCACGAGGGCTGAAAATTCAAAGGACATTTCAGGCATAGAGTTATTGAAATTTTCTGTTTGTGACCCTCTTTGGTCTGAAAGTGAAAAAGCAATCAGAACATTGAATCTTAGATATAATGACTTGTGGAGTGCAGATCATGATGACAATGACAAGGTCAATGGGGAGGAAATTGTTTCCTGGAGGCACAGCGATGTGTTTTCTCCCAAGAATTGCTTTTCTGAATTTGTCGATACTTTTGAAGAGGTCATCTATCCAATGGGGGATCCTGATGCTGTGACCATTAGTAAGAGAGACCTTGAGCTTCTGAAGCCAGGGATGTTTATTAACGATACTATCATTGACTTTTATGTTAAGTATCTAAAAAATAAATTCCTTTCGGAGAAAAACAATAGGTTCTACTTCTTCAATAGTTTTTTCTTCCGAAAGCTTGTGGACCTAGACAAAGATTTATCAAGTGCTCGTGGAGGAAGGGATGCATTTCAGCGTGTTCACAAATGGACAAAGAAAGTAAATCTTTTTCAAAAGGATTATCTCTTCATTCCTGTTAACTACAGTCTCCATTGGAGTTTGGTTGTCATCTGCCATCCTGGTGAAGTGGTAAATTTGAAAGATAAAAAACATGATAACTTATCCAAGGTACCGTGCATCTTGCATATGGATTCTATCAAAGGGAGCCACAGAGGGCTGAAGTCTCTTTTTCAAAGTTACTTATGTGAAGAGTGGAAAGAGAGGTATGGTGATGGAGATTTCAAAGATATTTCTGCAGTGTTCTTAACCTTGCCATTTATCCCTCTAGAGTTGCCACAACAAGAAAATTCATTTGATTGTGGTCTCTTCTTACTCCATTATGTGGAACTTTTTCTGGAGGGTGCGCCAGTGAACTTCAGCTCTCTCAAAATCTTGAAGTTCTCGAATTTTCTAAGCCAGGACTGGTTCCATCCTGCGGAGGCTTCTCTAAAACGTGCACATATCCTAAAGTTAATTTATGAAATCATGGTCTGTAATCAAGCAAAGGAACTCTCTGGTAGCATTGGTAAATATCCTTCTTCTGATGCTAATGACTCGGACAATGATTTATCAAAACACGTGTCTGGACAGGCACATATTTTCACAATGACCCATTCTGACAACTTTTCGAGTGTTGGAGAAGAAGTTGGATCAGTGTCCAAAGTATCATCTGACACAAATTATCAACGAATAGGAAGATGGGAGAGTGTCATGCCACCCATTGAGGAAGATGAGAATGGTGAAAGACCTGATTCACCACAATTCTTAGAAGATCGCCCCCAAGCTTCGGCAGTTTCTGAATGTTCATCGGCCTTCAGTTTCGGCCAACAATTTACGGAATTAGAAATATGTTGGGAAGGAAGATATTCTAAAAACGTAAAAGAAATGTGTAGAAAACCTTCTCCACGGCTATCACTTCATGAGTTGCAAACGCCATTGGAATTGGGACAACCAGAGATCTTAACCAGCTCAAGTGATGAACTTATCAATTGTGTAGTAGAGGACTCAGAGGAGGAAGGAAATGAAAGGAATGAGAGAATCGAGATCCAAGTTTCTTCCTCCTCCTCCTCCTCCTCCTCAAGGAACAACTTGTTCCTATCAAGGCAAGTGGTTGAATCTCCTGCAAACTTTAGTGATAATAATAGACAACATGAACATAAATAA
BLAST of CSPI07G05910 vs. Swiss-Prot
Match: ULP2A_ARATH (Probable ubiquitin-like-specific protease 2A OS=Arabidopsis thaliana GN=ULP2A PE=2 SV=2)

HSP 1 Score: 456.8 bits (1174), Expect = 4.8e-127
Identity = 249/567 (43.92%), Postives = 348/567 (61.38%), Query Frame = 1

Query: 26  KRFSVFDFSEEDARVEKVSRRLLGKFSA----RRSSPVTKHQFLHCFGKGAKSVSRNLSD 85
           K   VFD+S+ED RVE+ S++LL KF +    +    + K++FL CF K  +S S+ L  
Sbjct: 13  KPIDVFDYSDEDDRVEEESKKLLRKFDSPVTKKHHCAIDKYEFLRCFAKDTQSESKVLQH 72

Query: 86  ELIDIDAEVGKSANIDSFSEDVSHELIHIDSEVSIYTLTPGHQISPGPTDLNAEDTCADT 145
            +ID++  V +  +    S D + +LI + S  S   +                      
Sbjct: 73  IVIDVEVPVKEEPSRCELSGDGNSDLIDVISNGSHRRI---------------------- 132

Query: 146 PLEGGGSMKQEILETNDILLSRSSTN----------EDDVTVIFPDFVIYEGNWCTTSKL 205
              G  S+    L  ND + +  +TN          E+   +I PD +IY   +CT SKL
Sbjct: 133 ---GIDSLTSSSLSENDEVSTGEATNPASDPHEVDPENAQVLIIPDVIIYGDIYCTNSKL 192

Query: 206 IFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLRLKGKHFTRAEN 265
            FS +C+    S+++  + TF  +W I DII IES+WC  VETA VN+ LK +     + 
Sbjct: 193 TFSRNCMNVESSSVNATKGTFSCQWTIEDIIKIESQWCLEVETAFVNVLLKSRKPEGVDI 252

Query: 266 SKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLWSADHDDNDKVNGEEIVSWRHSDV 325
           +KDISGI+LLKFSV DP WS+  + IR+L+ RY ++W       D +   E +++   D+
Sbjct: 253 AKDISGIDLLKFSVYDPKWSKEVETIRSLDSRYKNIWF------DTITESEEIAFSGHDL 312

Query: 326 FSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIIDFYVKYLKNKFL 385
            +     +   D+FE+++YP G+PDAV + K+D+ELLKP  FINDTIIDFY+KYLKN+  
Sbjct: 313 GTS---LTNLADSFEDLVYPQGEPDAVVVRKQDIELLKPRRFINDTIIDFYIKYLKNRIS 372

Query: 386 SEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQKDYLFIPVNYS 445
            ++  RF+FFN FFFRKL +LDK   S  GGR+A+QRV KWTK V+LF+KDY+FIP+N S
Sbjct: 373 PKERGRFHFFNCFFFRKLANLDKGTPSTCGGREAYQRVQKWTKNVDLFEKDYIFIPINCS 432

Query: 446 LHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHR-GLKSLFQSYLCEEWKER 505
            HWSLV+ICHPGE+V       +N  +VPCILH+DSIKGSH+ GL ++F SYL EEWK R
Sbjct: 433 FHWSLVIICHPGELV---PSHVENPQRVPCILHLDSIKGSHKGGLINIFPSYLREEWKAR 492

Query: 506 YGDGDFKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPVNFSSLKILKFS 565
           + +    D S     +  I LELPQQENSFDCGLFLLHY++LF+  AP  F+   I + +
Sbjct: 493 H-ENTTNDSSRA-PNMQSISLELPQQENSFDCGLFLLHYLDLFVAQAPAKFNPSLISRSA 540

Query: 566 NFLSQDWFHPAEASLKRAHILKLIYEI 578
           NFL+++WF   EASLKR +IL+L+Y +
Sbjct: 553 NFLTRNWFPAKEASLKRRNILELLYNL 540

BLAST of CSPI07G05910 vs. Swiss-Prot
Match: ULP2B_ARATH (Probable ubiquitin-like-specific protease 2B OS=Arabidopsis thaliana GN=ULP2B PE=1 SV=3)

HSP 1 Score: 421.8 bits (1083), Expect = 1.7e-116
Identity = 211/415 (50.84%), Postives = 285/415 (68.67%), Query Frame = 1

Query: 173 VIFPDFVIYEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRV 232
           ++  ++VI +   C  S +IFSC+ IK +    +  +  F  E+ + DI+ I+  W   V
Sbjct: 243 IMTSEYVILKDMHCAASLVIFSCNGIKIKSFLANNEEVPFSCEFGVEDIVSIQYNWYQNV 302

Query: 233 ETAIVNLRLKGKHFTRAENSKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLWSADH 292
              I+ +R+      + EN  +   +E LK +V +  W   ++ I +L+++Y  +W+ D 
Sbjct: 303 GLIILRIRV----LLKDENCHE--DMEELKIAVKEHNWPNKQQKINSLHVKYPAVWNTDL 362

Query: 293 DDNDKVNGEEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGM 352
           +D+ +V+G  +           K  F  F + FE+V+YP GDPDAV+I KRD+ELL+P  
Sbjct: 363 EDDVEVSGYNLNQ--------QKRYFPSFDEPFEDVVYPKGDPDAVSICKRDVELLQPET 422

Query: 353 FINDTIIDFYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKW 412
           F+NDTIIDFY+ YLKN+  +E+ +RF+FFNSFFFRKL DLDKD SS   G+ AF RV KW
Sbjct: 423 FVNDTIIDFYINYLKNQIQTEEKHRFHFFNSFFFRKLADLDKDPSSIADGKAAFLRVRKW 482

Query: 413 TKKVNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSH 472
           T+KV++F KDY+F+PVNY+LHWSL+VICHPGEV N  D   D+  KVPCILHMDSIKGSH
Sbjct: 483 TRKVDMFGKDYIFVPVNYNLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDSIKGSH 542

Query: 473 RGLKSLFQSYLCEEWKERYGDGDFKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVEL 532
            GLK+L Q+YLCEEWKER+ +    DIS+ F+ L F+ LELPQQENSFDCGLFLLHY+EL
Sbjct: 543 AGLKNLVQTYLCEEWKERHKETS-DDISSRFMNLRFVSLELPQQENSFDCGLFLLHYLEL 602

Query: 533 FLEGAPVNFSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELS 588
           FL  AP+NFS  KI   SNFL  +WF PAEASLKR  I KLI+E++  N+++E+S
Sbjct: 603 FLAEAPLNFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELLE-NRSREVS 641

BLAST of CSPI07G05910 vs. Swiss-Prot
Match: ULP1C_ARATH (Ubiquitin-like-specific protease 1C OS=Arabidopsis thaliana GN=ULP1C PE=1 SV=1)

HSP 1 Score: 140.6 bits (353), Expect = 7.6e-32
Identity = 89/266 (33.46%), Postives = 143/266 (53.76%), Query Frame = 1

Query: 326 EEVIYPMGDP----DAVTISKRDLELLKPGMFINDTIIDFYVKYLKNKFLSEKNN--RFY 385
           E++ YP  D     D V +S +DL+ L PG ++   +I+FY++Y+++   S        +
Sbjct: 315 EDIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPVINFYIRYVQHHVFSADKTAANCH 374

Query: 386 FFNSFFFRKLVDLDKDLSSARGGRDA-FQRVHKWTKKVNLFQKDYLFIPVNYSLHWSLVV 445
           FFN+FF++KL +    +S     RDA F +  +W K  +LF K Y+FIP++  LHWSLV+
Sbjct: 375 FFNTFFYKKLTEA---VSYKGNDRDAYFVKFRRWWKGFDLFCKSYIFIPIHEDLHWSLVI 434

Query: 446 ICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGL-KSLFQSYLCEEWKERYGDG--D 505
           IC P       DK+ ++      I+H+DS+    R L  +  + +L EEW     D   D
Sbjct: 435 ICIP-------DKEDES---GLTIIHLDSLGLHPRNLIFNNVKRFLREEWNYLNQDAPLD 494

Query: 506 FKDISAVFLTLPFI----PLELPQQENSFDCGLFLLHYVELFLEGAPVNFSSLKILKFSN 565
               + V+  LP +     +++PQQ+N FDCGLFLL ++  F+E AP   +    L+   
Sbjct: 495 LPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLT----LQDLK 554

Query: 566 FLSQDWFHPAEASLKRAHILKLIYEI 578
            + + WF P EAS  R  I  ++ ++
Sbjct: 555 MIHKKWFKPEEASALRIKIWNILVDL 563

BLAST of CSPI07G05910 vs. Swiss-Prot
Match: ULP2_SCHPO (Ubiquitin-like-specific protease 2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=ulp2 PE=1 SV=2)

HSP 1 Score: 139.4 bits (350), Expect = 1.7e-31
Identity = 99/310 (31.94%), Postives = 148/310 (47.74%), Query Frame = 1

Query: 328 VIYPMGDPDAVTISKRDLELLKPGMFINDTIIDFYVKYLKNKFLSEKN---NRFYFFNSF 387
           ++YP    +++ I+  DL  L  G F+NDTI+DFY++YL  K  ++     N  + FN+F
Sbjct: 337 LVYPFSGTNSIAITNTDLTRLNEGEFLNDTIVDFYLRYLYCKLQTQNPSLANDTHIFNTF 396

Query: 388 FFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQKDYLFIPVNYSLHWSLVVICHPGE 447
           F+ +L   DKD     G R   + V KWT+KV+LF K Y+ +P+N + HW L +IC+   
Sbjct: 397 FYNRLTSKDKD-----GKRLGHRGVRKWTQKVDLFHKKYIIVPINETFHWYLAIICNIDR 456

Query: 448 V--VNLKDKKHDNL-------------------SKVPCILHMDSIKGSHRGLKSLFQSYL 507
           +  V+ K ++ D +                   S  P IL  DS+   H+G  +  + YL
Sbjct: 457 LMPVDTKLEEQDEIVMSSVEQPSASKTRQAELTSNSPAILIFDSLANLHKGALNYLREYL 516

Query: 508 CEEWKERYGDGDFKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEG-----AP 567
            EE  ER      K++      +     ++PQQ N  DCG++ LH+VELFLE      A 
Sbjct: 517 LEEAFER------KNVHLKSTDIRGFHAKVPQQSNFSDCGIYALHFVELFLETPEQVIAN 576

Query: 568 VNFSSLKILKFSNFLSQDW----FHPAEASLKRAHILKLIYEIMVCNQAKELSGSIGKYP 605
               SL+     NF  Q W     +     LK   I +L  E    N+ + LS       
Sbjct: 577 TLDKSLRRTDAKNF-DQQWNLQKINTMRCDLK-GLIRRLSTEWSSNNERQSLSSG----- 628

BLAST of CSPI07G05910 vs. Swiss-Prot
Match: ULP1D_ARATH (Ubiquitin-like-specific protease 1D OS=Arabidopsis thaliana GN=ULP1D PE=1 SV=1)

HSP 1 Score: 132.1 bits (331), Expect = 2.7e-29
Identity = 84/264 (31.82%), Postives = 139/264 (52.65%), Query Frame = 1

Query: 326 EEVIYPM-GDPDAVTISKRDLELLKPGMFINDTIIDFYVKYLKNKFLSEK--NNRFYFFN 385
           E++ YP   DP  V +  +DLE L P  ++   +++FY+++L+ +  S    +   +FFN
Sbjct: 330 EDICYPTRDDPHFVQVCLKDLECLAPREYLTSPVMNFYMRFLQQQISSSNQISADCHFFN 389

Query: 386 SFFFRKLVDLDKDLSSARGGRDAF-QRVHKWTKKVNLFQKDYLFIPVNYSLHWSLVVICH 445
           ++F++KL D    ++     +DAF  R  +W K ++LF+K Y+FIP++  LHWSLV++C 
Sbjct: 390 TYFYKKLSDA---VTYKGNDKDAFFVRFRRWWKGIDLFRKAYIFIPIHEDLHWSLVIVCI 449

Query: 446 PGEVVNLKDKKHDNLSKVPCILHMDSI-KGSHRGLKSLFQSYLCEEWKERYGDGDFKDI- 505
           P       DKK ++      ILH+DS+   S + +    + +L +EW     D    D+ 
Sbjct: 450 P-------DKKDES---GLTILHLDSLGLHSRKSIVENVKRFLKDEWNYLNQDDYSLDLP 509

Query: 506 --SAVFLTLP----FIPLELPQQENSFDCGLFLLHYVELFLEGAPVNFSSLKILKFSNFL 565
               V+  LP       +++PQQ+N FDCG F+L +++ F+E AP         K     
Sbjct: 510 ISEKVWKNLPRRISEAVVQVPQQKNDFDCGPFVLFFIKRFIEEAPQRLKR----KDLGMF 569

Query: 566 SQDWFHPAEASLKRAHILKLIYEI 578
            + WF P EAS  R  I   + E+
Sbjct: 570 DKKWFRPDEASALRIKIRNTLIEL 576

BLAST of CSPI07G05910 vs. TrEMBL
Match: A0A0A0K633_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G065160 PE=4 SV=1)

HSP 1 Score: 1544.6 bits (3998), Expect = 0.0e+00
Identity = 778/802 (97.01%), Postives = 782/802 (97.51%), Query Frame = 1

Query: 1   MTRTSSSKPFPSTRRKERDIGDGGGKRFSVFDFSEEDARVEKVSRRLLGKFSARRSSPVT 60
           MTRTSSSKPFPSTRRKERDIGDGGGKRFSVFDFSEEDARVEKVSRRLLGKFSARRSSPVT
Sbjct: 1   MTRTSSSKPFPSTRRKERDIGDGGGKRFSVFDFSEEDARVEKVSRRLLGKFSARRSSPVT 60

Query: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKSANIDSFSEDVSHELIHIDSEVSIYTLT 120
           KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKSANIDSFSEDVS ELIHIDSE       
Sbjct: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKSANIDSFSEDVSDELIHIDSE------- 120

Query: 121 PGHQISPGPTDLNAEDTCADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVI 180
            GHQISPGPTDLNAEDTCADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVI
Sbjct: 121 -GHQISPGPTDLNAEDTCADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVI 180

Query: 181 YEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLR 240
           YEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNL 
Sbjct: 181 YEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLC 240

Query: 241 LKGKHFTRAENSKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLWSADHDDNDKVNG 300
           LKGKHFTRAENSKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLW+ADHDDNDKVNG
Sbjct: 241 LKGKHFTRAENSKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLWNADHDDNDKVNG 300

Query: 301 EEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIID 360
           EEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIID
Sbjct: 301 EEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIID 360

Query: 361 FYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQ 420
           FYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQ
Sbjct: 361 FYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQ 420

Query: 421 KDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQ 480
           KDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQ
Sbjct: 421 KDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQ 480

Query: 481 SYLCEEWKERYGDGDFKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPVN 540
           SYLCEEWKERYGDGD+KDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPVN
Sbjct: 481 SYLCEEWKERYGDGDYKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPVN 540

Query: 541 FSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSGSIGKYPSSDAND 600
           FSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIM CNQAKELSGSIGKYPSSDAND
Sbjct: 541 FSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIMACNQAKELSGSIGKYPSSDAND 600

Query: 601 SDNDLSKHVSGQAHIFTMTHSDNFSSVGEEVGSVSKVSSDTNYQRIGRWESVMPPIEEDE 660
           SDNDLSKHVSGQAHIFTMTHSDNFSSVG+EVGSVSKVSSDTNYQ IGRWESVMPPIEEDE
Sbjct: 601 SDNDLSKHVSGQAHIFTMTHSDNFSSVGKEVGSVSKVSSDTNYQPIGRWESVMPPIEEDE 660

Query: 661 NGERPDSPQFLEDRPQASAVSECSSAFSFGQQFTELEICWEGRYSKNVKEMCRKPSPRLS 720
           NGER DSPQ LEDRPQAS VSECSSAFSFGQQFTELEICWEGRYSKNVKEMCRKPSPRLS
Sbjct: 661 NGERADSPQCLEDRPQASTVSECSSAFSFGQQFTELEICWEGRYSKNVKEMCRKPSPRLS 720

Query: 721 LHELQTPLELGQPEILTSSSDELINCVVEDSEEEGNERNERIEIQVSSSSSSSSSRNNLF 780
           LHELQTPLELGQPEILTSSSDELINCVVEDSEEEGNERNERIEIQV SSSSSSSSRNNLF
Sbjct: 721 LHELQTPLELGQPEILTSSSDELINCVVEDSEEEGNERNERIEIQV-SSSSSSSSRNNLF 780

Query: 781 LSRQVVESPANFSDNNRQHEHK 803
           LSRQVVESPA FS   RQH+HK
Sbjct: 781 LSRQVVESPAKFS---RQHQHK 790

BLAST of CSPI07G05910 vs. TrEMBL
Match: B9S9I8_RICCO (Sentrin/sumo-specific protease, putative OS=Ricinus communis GN=RCOM_0886160 PE=4 SV=1)

HSP 1 Score: 535.0 bits (1377), Expect = 1.5e-148
Identity = 301/643 (46.81%), Postives = 392/643 (60.96%), Query Frame = 1

Query: 26  KRFSVFDFSEEDARVEKVSRRLLGKFSARR--------------------SSPVTKHQFL 85
           KR SVFDFSE+D R+E  S++L+ +F  R                     SS + K++FL
Sbjct: 14  KRLSVFDFSEDDGRIETASKKLINRFRNRNDDNNNNNKNNYVKRKRHSFFSSSIDKYKFL 73

Query: 86  HCFG---KGAKSVSRN----LSDELIDIDAE------------------VGKSANIDSFS 145
            CF    K  +S SRN    + DE ID+D +                     +A+    +
Sbjct: 74  ECFAGWNKAPESESRNEPIDVDDEPIDVDTDRGMTADCEEIGVGLVDIDANSAAHCHKLT 133

Query: 146 EDVSHELIHIDSEVS------IYTLTPGHQISPGPTDLNAEDTCADTPLEGGGSMKQEIL 205
                 +I  DS V       ++ L+   +    P  + ++D   D       S    +L
Sbjct: 134 VSSPISMIQEDSAVKEISGLDVHVLSSSSKYENVPRGMISDD--GDKSGMSSSSTSICML 193

Query: 206 ETNDILLSRSSTNE----------DDVTVIFPDFVIYEGNWCTTSKLIFSCSCIKFRGSA 265
           E N++  +   T            ++  V+FPDF++Y   +CT S L FS S I+  G  
Sbjct: 194 EENEVPSTEPETEYCSLGHKIDILNNAVVVFPDFILYGDIYCTESCLTFSSSHIRVEGLT 253

Query: 266 LSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLRLKGKHFTRAENSKDISGIELLKFS 325
           ++G + +F++EWAI+DI+ IESEWC RVETA++ L LK        NS + SGI+ LK S
Sbjct: 254 INGSKGSFNAEWAIADIVSIESEWCGRVETAMIKLHLKPNVSESVGNSNESSGIDELKVS 313

Query: 326 VCDPLWSESEKAIRTLNLRYNDLWS----ADHDDNDKVNGEEIVSWRHSDVFSPKNCFSE 385
           V DP WSE ++AI++L++RY D+W+    +D + +DK   E         V  PK     
Sbjct: 314 VYDPCWSEGQEAIKSLDVRYRDIWNVIIDSDQEKDDKAFAESY------SVAFPKPFLHV 373

Query: 386 FVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIIDFYVKYLKNKFLSEKNNRFYF 445
             +TFE+VIYP GDPDAV+ISKRD+ELL+P  FINDTIIDFY+K+LKNK   E  +R++F
Sbjct: 374 LDETFEDVIYPEGDPDAVSISKRDVELLRPETFINDTIIDFYIKFLKNKIQPEDQHRYHF 433

Query: 446 FNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQKDYLFIPVNYSLHWSLVVIC 505
           FNSFFFRKL DLDKD S A  GR AFQRV KWTKKVNLF+KD++FIPVNYSLHWSL+VIC
Sbjct: 434 FNSFFFRKLADLDKDPSGACEGRAAFQRVRKWTKKVNLFEKDFIFIPVNYSLHWSLIVIC 493

Query: 506 HPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQSYLCEEWKERYGDGDFKDIS 565
           HPGEV + +D++ +   KVPCILHMDSI+GSHRGLK+L QSYLCEEWKER+ +    D S
Sbjct: 494 HPGEVAHFRDEECEIAPKVPCILHMDSIRGSHRGLKNLIQSYLCEEWKERHSE-ILDDAS 553

Query: 566 AVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPVNFSSLKILKFSNFLSQDWFHP 603
           + F  L F+PLELPQQENSFDCGLFLLHYVELFLEG P+NFS  KI + SNFL+++WF P
Sbjct: 554 SKFSCLRFVPLELPQQENSFDCGLFLLHYVELFLEGVPINFSPFKITESSNFLNRNWFPP 613

BLAST of CSPI07G05910 vs. TrEMBL
Match: A0A067KDN9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18741 PE=4 SV=1)

HSP 1 Score: 516.9 bits (1330), Expect = 4.3e-143
Identity = 275/546 (50.37%), Postives = 365/546 (66.85%), Query Frame = 1

Query: 69  GKGAKSVSRNLSDELIDIDA-EVGKS------ANIDSFSEDVS-HELIHIDSEVSIYTLT 128
           G+G K++S  + +ELIDIDA ++G S        +    ED    E+  +D+ V   +  
Sbjct: 118 GRGMKTLSEEVVNELIDIDANDLGNSHKPSISLPMCMVQEDGDVKEICCLDAHVPSSSSN 177

Query: 129 PGHQISPGPTDLNAEDTCADTPLEGGGSMKQEILETNDILLSRSSTNE-----DDVTVIF 188
               +     D + +   + + +      ++E+  + D++    S        ++  VIF
Sbjct: 178 DEKTVDMILDDDDEKSEMSSSSISVSTLGEKEV-PSKDLVPECCSVGHKIDILNNAVVIF 237

Query: 189 PDFVIYEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETA 248
           PDF++Y+  +CT S+L FS SCI   GS ++G + +F ++WAI DI+ IESEW  RVETA
Sbjct: 238 PDFILYDDIYCTESRLTFSSSCISVEGSTVNGAKGSFKAKWAIGDIMSIESEWWGRVETA 297

Query: 249 IVNLRLKGKHFTRAENSKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLWSADHDDN 308
           ++NL LK K    +  + +I GI+ LKFSV DP W E ++AI++L+ +Y D+W+   D +
Sbjct: 298 MLNLSLKSKVSKGSGTANEIPGIDKLKFSVYDPDWFEGQEAIKSLDSKYRDIWNVIFDTD 357

Query: 309 DKVNGEEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFIN 368
            + +G+  +  ++  +  PK       +TFEEV+YP GDPDAV+ISKRD+ELL+P  FIN
Sbjct: 358 QETDGDAFLGSKNMSI--PKPHLYILDETFEEVVYPRGDPDAVSISKRDMELLRPETFIN 417

Query: 369 DTIIDFYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKK 428
           DTIIDFY+KYLKNK   E  +RF+FFNSFFFRKL D DKD  SA  GR AFQRV KWTKK
Sbjct: 418 DTIIDFYIKYLKNKIQPEDQHRFHFFNSFFFRKLADFDKDPRSACEGRAAFQRVRKWTKK 477

Query: 429 VNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGL 488
           VNLF+KDY+FIPVNYSLHWSL+V+CHPG+V    D + +   +VPCILHMDSIKGSHRGL
Sbjct: 478 VNLFEKDYIFIPVNYSLHWSLIVVCHPGDVACFIDDESERALRVPCILHMDSIKGSHRGL 537

Query: 489 KSLFQSYLCEEWKERYGDGDFKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLE 548
           K+L QSYLCEEWKER+ D    D+S  F  L F+PLE+PQQ NSFDCGLFLLHYVELFLE
Sbjct: 538 KNLIQSYLCEEWKERHNDTP-DDVSLKFSRLRFVPLEMPQQANSFDCGLFLLHYVELFLE 597

Query: 549 GAPVNFSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIM-----VCNQAKELSGSI 597
            AP+NFS  KI +FSNFL+++WF PAEASLKRAHI KLI EI+      C+Q +    S 
Sbjct: 598 EAPINFSPFKITEFSNFLNRNWFLPAEASLKRAHIQKLICEILEEQTQKCSQVE----ST 655

BLAST of CSPI07G05910 vs. TrEMBL
Match: A0A0D2V6L3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_012G143900 PE=4 SV=1)

HSP 1 Score: 509.6 bits (1311), Expect = 6.9e-141
Identity = 268/520 (51.54%), Postives = 347/520 (66.73%), Query Frame = 1

Query: 78  NLSDELIDIDAEVGKSANIDSFSEDVSHELIHIDSEVSIYTLTPGHQISPGPTDLNAEDT 137
           N+ DE+ D+D       ++ SFS +  +E I I S  +            G  ++++   
Sbjct: 156 NMGDEIPDLDN------SLQSFSSNDENEQIDIISNDN------------GLIEMSSSSA 215

Query: 138 CADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVIYEGNWCTTSKLIFSCSC 197
            A + +E G S ++++      +       ED   ++ PDF++Y G +CT  +L FS + 
Sbjct: 216 FASSHVEFGDSPEEQVSANGSDV--HEIEKEDVEIIVSPDFIMYRGMYCTEGQLTFSKTF 275

Query: 198 IKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLRLKGKHFTRAENSKDISG 257
           +KF   +++G +      WA+ DII I++EWC RVETAI+N  L+ K+  RAEN+ +IS 
Sbjct: 276 LKFEDFSVNGTKTKISFIWAVGDIISIDAEWCQRVETAIMNFVLQSKNSKRAENANEISV 335

Query: 258 IELLKFSVCDPLWSESEKAIRTLNLRYNDLWSADHDDNDKVNGEEIVSWRHSDVFSPKNC 317
           IE LKFSV D  WSE + +I++L++RY D+W+   D N     EE    R +  FS K  
Sbjct: 336 IESLKFSVYDTCWSERQDSIKSLSVRYRDVWNTLSDKN-----EENTLMRQNGRFSSKPY 395

Query: 318 FSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIIDFYVKYLKNKFLSEKNNR 377
           F +F + FEEVIYP GDPDA++ISKRD+ELL P  FINDTIIDFY+KYLKNK   E+ +R
Sbjct: 396 FYDFHEHFEEVIYPKGDPDAISISKRDVELLCPETFINDTIIDFYIKYLKNKIKPEEQHR 455

Query: 378 FYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQKDYLFIPVNYSLHWSLV 437
           F+FF+SFFF KL DLDK LS     + AFQRVHKWT+KV++F+KDY+FIPVNYSLHWSL+
Sbjct: 456 FHFFSSFFFLKLADLDKGLSDECQAKSAFQRVHKWTRKVDIFEKDYIFIPVNYSLHWSLI 515

Query: 438 VICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQSYLCEEWKERYGDGDFK 497
           VICHPGEV  LKD   +NL KVPCILHMDSI+GSHRGLK+LFQSYL EEWK+R+ +    
Sbjct: 516 VICHPGEVAKLKDDATENLLKVPCILHMDSIRGSHRGLKNLFQSYLTEEWKQRHKEA-AD 575

Query: 498 DISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPVNFSSLKILKFSNFLSQDW 557
           D+ + FL L F+PLELPQQENSFDCGLFLLHYVE FL  AP+NFS  K    SNFL+ +W
Sbjct: 576 DVPSKFLNLQFVPLELPQQENSFDCGLFLLHYVERFLLQAPINFSPSKTTGSSNFLNMNW 635

Query: 558 FHPAEASLKRAHILKLIYEIMVCNQAKELS-GSIGKYPSS 597
           F PAEASLKR HI +LIYEI+        S   I KY SS
Sbjct: 636 FPPAEASLKRCHIKRLIYEILEEQSCSSPSVDGIYKYSSS 649

BLAST of CSPI07G05910 vs. TrEMBL
Match: A0A0D2S378_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_012G143900 PE=4 SV=1)

HSP 1 Score: 509.6 bits (1311), Expect = 6.9e-141
Identity = 268/520 (51.54%), Postives = 347/520 (66.73%), Query Frame = 1

Query: 78  NLSDELIDIDAEVGKSANIDSFSEDVSHELIHIDSEVSIYTLTPGHQISPGPTDLNAEDT 137
           N+ DE+ D+D       ++ SFS +  +E I I S  +            G  ++++   
Sbjct: 157 NMGDEIPDLDN------SLQSFSSNDENEQIDIISNDN------------GLIEMSSSSA 216

Query: 138 CADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVIYEGNWCTTSKLIFSCSC 197
            A + +E G S ++++      +       ED   ++ PDF++Y G +CT  +L FS + 
Sbjct: 217 FASSHVEFGDSPEEQVSANGSDV--HEIEKEDVEIIVSPDFIMYRGMYCTEGQLTFSKTF 276

Query: 198 IKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLRLKGKHFTRAENSKDISG 257
           +KF   +++G +      WA+ DII I++EWC RVETAI+N  L+ K+  RAEN+ +IS 
Sbjct: 277 LKFEDFSVNGTKTKISFIWAVGDIISIDAEWCQRVETAIMNFVLQSKNSKRAENANEISV 336

Query: 258 IELLKFSVCDPLWSESEKAIRTLNLRYNDLWSADHDDNDKVNGEEIVSWRHSDVFSPKNC 317
           IE LKFSV D  WSE + +I++L++RY D+W+   D N     EE    R +  FS K  
Sbjct: 337 IESLKFSVYDTCWSERQDSIKSLSVRYRDVWNTLSDKN-----EENTLMRQNGRFSSKPY 396

Query: 318 FSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIIDFYVKYLKNKFLSEKNNR 377
           F +F + FEEVIYP GDPDA++ISKRD+ELL P  FINDTIIDFY+KYLKNK   E+ +R
Sbjct: 397 FYDFHEHFEEVIYPKGDPDAISISKRDVELLCPETFINDTIIDFYIKYLKNKIKPEEQHR 456

Query: 378 FYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQKDYLFIPVNYSLHWSLV 437
           F+FF+SFFF KL DLDK LS     + AFQRVHKWT+KV++F+KDY+FIPVNYSLHWSL+
Sbjct: 457 FHFFSSFFFLKLADLDKGLSDECQAKSAFQRVHKWTRKVDIFEKDYIFIPVNYSLHWSLI 516

Query: 438 VICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQSYLCEEWKERYGDGDFK 497
           VICHPGEV  LKD   +NL KVPCILHMDSI+GSHRGLK+LFQSYL EEWK+R+ +    
Sbjct: 517 VICHPGEVAKLKDDATENLLKVPCILHMDSIRGSHRGLKNLFQSYLTEEWKQRHKEA-AD 576

Query: 498 DISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPVNFSSLKILKFSNFLSQDW 557
           D+ + FL L F+PLELPQQENSFDCGLFLLHYVE FL  AP+NFS  K    SNFL+ +W
Sbjct: 577 DVPSKFLNLQFVPLELPQQENSFDCGLFLLHYVERFLLQAPINFSPSKTTGSSNFLNMNW 636

Query: 558 FHPAEASLKRAHILKLIYEIMVCNQAKELS-GSIGKYPSS 597
           F PAEASLKR HI +LIYEI+        S   I KY SS
Sbjct: 637 FPPAEASLKRCHIKRLIYEILEEQSCSSPSVDGIYKYSSS 650

BLAST of CSPI07G05910 vs. TAIR10
Match: AT4G33620.1 (AT4G33620.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 457.2 bits (1175), Expect = 2.1e-128
Identity = 250/573 (43.63%), Postives = 351/573 (61.26%), Query Frame = 1

Query: 26  KRFSVFDFSEEDARVEKVSRRLLGKFSA----RRSSPVTKHQFLHCFGKGAKSVSRNLSD 85
           K   VFD+S+ED RVE+ S++LL KF +    +    + K++FL CF K  +S S+ L  
Sbjct: 13  KPIDVFDYSDEDDRVEEESKKLLRKFDSPVTKKHHCAIDKYEFLRCFAKDTQSESKVLQH 72

Query: 86  ELIDIDAEVGKSANIDSFSEDVSHELIHIDSEVSIYTLTPGHQISPGPTDLNAEDTCADT 145
            +ID++  V +  +    S D + +LI + S  S   +                      
Sbjct: 73  IVIDVEVPVKEEPSRCELSGDGNSDLIDVISNGSHRRI---------------------- 132

Query: 146 PLEGGGSMKQEILETNDILLSRSSTN----------EDDVTVIFPDFVIYEGNWCTTSKL 205
              G  S+    L  ND + +  +TN          E+   +I PD +IY   +CT SKL
Sbjct: 133 ---GIDSLTSSSLSENDEVSTGEATNPASDPHEVDPENAQVLIIPDVIIYGDIYCTNSKL 192

Query: 206 IFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLRLKGKHFTRAEN 265
            FS +C+    S+++  + TF  +W I DII IES+WC  VETA VN+ LK +     + 
Sbjct: 193 TFSRNCMNVESSSVNATKGTFSCQWTIEDIIKIESQWCLEVETAFVNVLLKSRKPEGVDI 252

Query: 266 SKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLWSADHDDNDKVNGEEIVSWRHSDV 325
           +KDISGI+LLKFSV DP WS+  + IR+L+ RY ++W       D +   E +++   D+
Sbjct: 253 AKDISGIDLLKFSVYDPKWSKEVETIRSLDSRYKNIWF------DTITESEEIAFSGHDL 312

Query: 326 FSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIIDFYVKYLKNKFL 385
            +     +   D+FE+++YP G+PDAV + K+D+ELLKP  FINDTIIDFY+KYLKN+  
Sbjct: 313 GTS---LTNLADSFEDLVYPQGEPDAVVVRKQDIELLKPRRFINDTIIDFYIKYLKNRIS 372

Query: 386 SEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQKDYLFIPVNYS 445
            ++  RF+FFN FFFRKL +LDK   S  GGR+A+QRV KWTK V+LF+KDY+FIP+N S
Sbjct: 373 PKERGRFHFFNCFFFRKLANLDKGTPSTCGGREAYQRVQKWTKNVDLFEKDYIFIPINCS 432

Query: 446 LHWSLVVICHPGEVV------NLKDKKHDNLSKVPCILHMDSIKGSHR-GLKSLFQSYLC 505
            HWSLV+ICHPGE+V      +  D + +N  +VPCILH+DSIKGSH+ GL ++F SYL 
Sbjct: 433 FHWSLVIICHPGELVPSHVNFHSFDDEVENPQRVPCILHLDSIKGSHKGGLINIFPSYLR 492

Query: 506 EEWKERYGDGDFKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPVNFSSL 565
           EEWK R+ +    D S     +  I LELPQQENSFDCGLFLLHY++LF+  AP  F+  
Sbjct: 493 EEWKARH-ENTTNDSSRA-PNMQSISLELPQQENSFDCGLFLLHYLDLFVAQAPAKFNPS 549

Query: 566 KILKFSNFLSQDWFHPAEASLKRAHILKLIYEI 578
            I + +NFL+++WF   EASLKR +IL+L+Y +
Sbjct: 553 LISRSANFLTRNWFPAKEASLKRRNILELLYNL 549

BLAST of CSPI07G05910 vs. TAIR10
Match: AT1G09730.1 (AT1G09730.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 421.8 bits (1083), Expect = 9.6e-118
Identity = 211/415 (50.84%), Postives = 285/415 (68.67%), Query Frame = 1

Query: 173 VIFPDFVIYEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRV 232
           ++  ++VI +   C  S +IFSC+ IK +    +  +  F  E+ + DI+ I+  W   V
Sbjct: 275 IMTSEYVILKDMHCAASLVIFSCNGIKIKSFLANNEEVPFSCEFGVEDIVSIQYNWYQNV 334

Query: 233 ETAIVNLRLKGKHFTRAENSKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLWSADH 292
              I+ +R+      + EN  +   +E LK +V +  W   ++ I +L+++Y  +W+ D 
Sbjct: 335 GLIILRIRV----LLKDENCHE--DMEELKIAVKEHNWPNKQQKINSLHVKYPAVWNTDL 394

Query: 293 DDNDKVNGEEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGM 352
           +D+ +V+G  +           K  F  F + FE+V+YP GDPDAV+I KRD+ELL+P  
Sbjct: 395 EDDVEVSGYNLNQ--------QKRYFPSFDEPFEDVVYPKGDPDAVSICKRDVELLQPET 454

Query: 353 FINDTIIDFYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKW 412
           F+NDTIIDFY+ YLKN+  +E+ +RF+FFNSFFFRKL DLDKD SS   G+ AF RV KW
Sbjct: 455 FVNDTIIDFYINYLKNQIQTEEKHRFHFFNSFFFRKLADLDKDPSSIADGKAAFLRVRKW 514

Query: 413 TKKVNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSH 472
           T+KV++F KDY+F+PVNY+LHWSL+VICHPGEV N  D   D+  KVPCILHMDSIKGSH
Sbjct: 515 TRKVDMFGKDYIFVPVNYNLHWSLIVICHPGEVANRTDLDLDDSKKVPCILHMDSIKGSH 574

Query: 473 RGLKSLFQSYLCEEWKERYGDGDFKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVEL 532
            GLK+L Q+YLCEEWKER+ +    DIS+ F+ L F+ LELPQQENSFDCGLFLLHY+EL
Sbjct: 575 AGLKNLVQTYLCEEWKERHKETS-DDISSRFMNLRFVSLELPQQENSFDCGLFLLHYLEL 634

Query: 533 FLEGAPVNFSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELS 588
           FL  AP+NFS  KI   SNFL  +WF PAEASLKR  I KLI+E++  N+++E+S
Sbjct: 635 FLAEAPLNFSPFKIYNASNFLYLNWFPPAEASLKRTLIQKLIFELLE-NRSREVS 673

BLAST of CSPI07G05910 vs. TAIR10
Match: AT1G10570.1 (AT1G10570.1 Cysteine proteinases superfamily protein)

HSP 1 Score: 140.6 bits (353), Expect = 4.3e-33
Identity = 89/266 (33.46%), Postives = 143/266 (53.76%), Query Frame = 1

Query: 326 EEVIYPMGDP----DAVTISKRDLELLKPGMFINDTIIDFYVKYLKNKFLSEKNN--RFY 385
           E++ YP  D     D V +S +DL+ L PG ++   +I+FY++Y+++   S        +
Sbjct: 315 EDIYYPSSDQSDGRDLVQVSLKDLKCLSPGEYLTSPVINFYIRYVQHHVFSADKTAANCH 374

Query: 386 FFNSFFFRKLVDLDKDLSSARGGRDA-FQRVHKWTKKVNLFQKDYLFIPVNYSLHWSLVV 445
           FFN+FF++KL +    +S     RDA F +  +W K  +LF K Y+FIP++  LHWSLV+
Sbjct: 375 FFNTFFYKKLTEA---VSYKGNDRDAYFVKFRRWWKGFDLFCKSYIFIPIHEDLHWSLVI 434

Query: 446 ICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGL-KSLFQSYLCEEWKERYGDG--D 505
           IC P       DK+ ++      I+H+DS+    R L  +  + +L EEW     D   D
Sbjct: 435 ICIP-------DKEDES---GLTIIHLDSLGLHPRNLIFNNVKRFLREEWNYLNQDAPLD 494

Query: 506 FKDISAVFLTLPFI----PLELPQQENSFDCGLFLLHYVELFLEGAPVNFSSLKILKFSN 565
               + V+  LP +     +++PQQ+N FDCGLFLL ++  F+E AP   +    L+   
Sbjct: 495 LPISAKVWRDLPNMINEAEVQVPQQKNDFDCGLFLLFFIRRFIEEAPQRLT----LQDLK 554

Query: 566 FLSQDWFHPAEASLKRAHILKLIYEI 578
            + + WF P EAS  R  I  ++ ++
Sbjct: 555 MIHKKWFKPEEASALRIKIWNILVDL 563

BLAST of CSPI07G05910 vs. TAIR10
Match: AT1G60220.1 (AT1G60220.1 UB-like protease 1D)

HSP 1 Score: 132.1 bits (331), Expect = 1.5e-30
Identity = 84/264 (31.82%), Postives = 139/264 (52.65%), Query Frame = 1

Query: 326 EEVIYPM-GDPDAVTISKRDLELLKPGMFINDTIIDFYVKYLKNKFLSEK--NNRFYFFN 385
           E++ YP   DP  V +  +DLE L P  ++   +++FY+++L+ +  S    +   +FFN
Sbjct: 330 EDICYPTRDDPHFVQVCLKDLECLAPREYLTSPVMNFYMRFLQQQISSSNQISADCHFFN 389

Query: 386 SFFFRKLVDLDKDLSSARGGRDAF-QRVHKWTKKVNLFQKDYLFIPVNYSLHWSLVVICH 445
           ++F++KL D    ++     +DAF  R  +W K ++LF+K Y+FIP++  LHWSLV++C 
Sbjct: 390 TYFYKKLSDA---VTYKGNDKDAFFVRFRRWWKGIDLFRKAYIFIPIHEDLHWSLVIVCI 449

Query: 446 PGEVVNLKDKKHDNLSKVPCILHMDSI-KGSHRGLKSLFQSYLCEEWKERYGDGDFKDI- 505
           P       DKK ++      ILH+DS+   S + +    + +L +EW     D    D+ 
Sbjct: 450 P-------DKKDES---GLTILHLDSLGLHSRKSIVENVKRFLKDEWNYLNQDDYSLDLP 509

Query: 506 --SAVFLTLP----FIPLELPQQENSFDCGLFLLHYVELFLEGAPVNFSSLKILKFSNFL 565
               V+  LP       +++PQQ+N FDCG F+L +++ F+E AP         K     
Sbjct: 510 ISEKVWKNLPRRISEAVVQVPQQKNDFDCGPFVLFFIKRFIEEAPQRLKR----KDLGMF 569

Query: 566 SQDWFHPAEASLKRAHILKLIYEI 578
            + WF P EAS  R  I   + E+
Sbjct: 570 DKKWFRPDEASALRIKIRNTLIEL 576

BLAST of CSPI07G05910 vs. TAIR10
Match: AT3G06910.1 (AT3G06910.1 UB-like protease 1A)

HSP 1 Score: 94.7 bits (234), Expect = 2.7e-19
Identity = 73/271 (26.94%), Postives = 123/271 (45.39%), Query Frame = 1

Query: 278 RTLNLRYNDLWSADHDDNDKVNGEEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDA 337
           R L    +  W  D +  + V  E  V     +  + +  FS      +  I        
Sbjct: 244 RALLRSLSSFWRQDEEPVEVVQREAFVPLSREEETAVRRAFS----ANDSNILVTHKNSN 303

Query: 338 VTISKRDLELLKPGMFINDTIIDFYVKYLKNKFLSEKNN--RFYFFNSFFFRKLVDLDKD 397
           + I+ + L  LKPG ++ND +I+ Y+  LK +   E     + +FFN+FFF KLV+    
Sbjct: 304 IDITGKILRCLKPGKWLNDEVINLYMVLLKEREAREPKKFLKCHFFNTFFFTKLVN---- 363

Query: 398 LSSARGGRDAFQRVHKWT--KKVNLFQKDY--LFIPVNYSLHWSLVVICHPGEVVNLKDK 457
             SA G    +  V +WT  K++    KD   +FIP++ ++HW+L VI       N+KD+
Sbjct: 364 --SATGYN--YGAVRRWTSMKRLGYHLKDCDKIFIPIHMNIHWTLAVI-------NIKDQ 423

Query: 458 KHDNLSKVPCILHMDSIKGSHRGLKSLFQSYLCEEWKERYGDGDFKDISAVFLTLPFIPL 517
           K           ++DS KG    +      Y  +E +++       D+        F+  
Sbjct: 424 KFQ---------YLDSFKGREPKILDALARYFVDEVRDK----SEVDLDVSRWRQEFVQ- 481

Query: 518 ELPQQENSFDCGLFLLHYVELFLEGAPVNFS 543
           +LP Q N FDCG+F++ Y++ +  G  + F+
Sbjct: 484 DLPMQRNGFDCGMFMVKYIDFYSRGLDLCFT 481

BLAST of CSPI07G05910 vs. NCBI nr
Match: gi|778724518|ref|XP_011658819.1| (PREDICTED: probable ubiquitin-like-specific protease 2A [Cucumis sativus])

HSP 1 Score: 1544.6 bits (3998), Expect = 0.0e+00
Identity = 778/802 (97.01%), Postives = 782/802 (97.51%), Query Frame = 1

Query: 1   MTRTSSSKPFPSTRRKERDIGDGGGKRFSVFDFSEEDARVEKVSRRLLGKFSARRSSPVT 60
           MTRTSSSKPFPSTRRKERDIGDGGGKRFSVFDFSEEDARVEKVSRRLLGKFSARRSSPVT
Sbjct: 1   MTRTSSSKPFPSTRRKERDIGDGGGKRFSVFDFSEEDARVEKVSRRLLGKFSARRSSPVT 60

Query: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKSANIDSFSEDVSHELIHIDSEVSIYTLT 120
           KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKSANIDSFSEDVS ELIHIDSE       
Sbjct: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKSANIDSFSEDVSDELIHIDSE------- 120

Query: 121 PGHQISPGPTDLNAEDTCADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVI 180
            GHQISPGPTDLNAEDTCADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVI
Sbjct: 121 -GHQISPGPTDLNAEDTCADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVI 180

Query: 181 YEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLR 240
           YEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNL 
Sbjct: 181 YEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLC 240

Query: 241 LKGKHFTRAENSKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLWSADHDDNDKVNG 300
           LKGKHFTRAENSKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLW+ADHDDNDKVNG
Sbjct: 241 LKGKHFTRAENSKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLWNADHDDNDKVNG 300

Query: 301 EEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIID 360
           EEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIID
Sbjct: 301 EEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIID 360

Query: 361 FYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQ 420
           FYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQ
Sbjct: 361 FYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQ 420

Query: 421 KDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQ 480
           KDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQ
Sbjct: 421 KDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQ 480

Query: 481 SYLCEEWKERYGDGDFKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPVN 540
           SYLCEEWKERYGDGD+KDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPVN
Sbjct: 481 SYLCEEWKERYGDGDYKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPVN 540

Query: 541 FSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSGSIGKYPSSDAND 600
           FSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIM CNQAKELSGSIGKYPSSDAND
Sbjct: 541 FSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIMACNQAKELSGSIGKYPSSDAND 600

Query: 601 SDNDLSKHVSGQAHIFTMTHSDNFSSVGEEVGSVSKVSSDTNYQRIGRWESVMPPIEEDE 660
           SDNDLSKHVSGQAHIFTMTHSDNFSSVG+EVGSVSKVSSDTNYQ IGRWESVMPPIEEDE
Sbjct: 601 SDNDLSKHVSGQAHIFTMTHSDNFSSVGKEVGSVSKVSSDTNYQPIGRWESVMPPIEEDE 660

Query: 661 NGERPDSPQFLEDRPQASAVSECSSAFSFGQQFTELEICWEGRYSKNVKEMCRKPSPRLS 720
           NGER DSPQ LEDRPQAS VSECSSAFSFGQQFTELEICWEGRYSKNVKEMCRKPSPRLS
Sbjct: 661 NGERADSPQCLEDRPQASTVSECSSAFSFGQQFTELEICWEGRYSKNVKEMCRKPSPRLS 720

Query: 721 LHELQTPLELGQPEILTSSSDELINCVVEDSEEEGNERNERIEIQVSSSSSSSSSRNNLF 780
           LHELQTPLELGQPEILTSSSDELINCVVEDSEEEGNERNERIEIQV SSSSSSSSRNNLF
Sbjct: 721 LHELQTPLELGQPEILTSSSDELINCVVEDSEEEGNERNERIEIQV-SSSSSSSSRNNLF 780

Query: 781 LSRQVVESPANFSDNNRQHEHK 803
           LSRQVVESPA FS   RQH+HK
Sbjct: 781 LSRQVVESPAKFS---RQHQHK 790

BLAST of CSPI07G05910 vs. NCBI nr
Match: gi|659110287|ref|XP_008455147.1| (PREDICTED: probable ubiquitin-like-specific protease 2A isoform X4 [Cucumis melo])

HSP 1 Score: 1434.9 bits (3713), Expect = 0.0e+00
Identity = 733/814 (90.05%), Postives = 752/814 (92.38%), Query Frame = 1

Query: 1   MTRTSSSKPFPSTRRKERDIGDGGGKRFSVFDFSEEDARVEKVSRRLLGKFSARRSSPVT 60
           MTRTSSSKPF STRR  R  G+GGGKRFSVFDFSEED RVEKVSR LLGKFSARRSSPVT
Sbjct: 1   MTRTSSSKPFSSTRRNGRGRGEGGGKRFSVFDFSEEDVRVEKVSRSLLGKFSARRSSPVT 60

Query: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKSANIDSFSEDVSHELIHIDSEVSIYTLT 120
           KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGK AN DSFSED+S+ELIHIDSEVSI TL 
Sbjct: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKGANTDSFSEDISYELIHIDSEVSISTLN 120

Query: 121 PGHQISPGPTDLNAEDTCADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVI 180
             HQISPGP DLNAEDTCAD  LEGGGSMKQEILETND+L SRSSTNEDD TVIFPDFVI
Sbjct: 121 TSHQISPGPIDLNAEDTCADGSLEGGGSMKQEILETNDLLRSRSSTNEDDFTVIFPDFVI 180

Query: 181 YEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLR 240
           YEGNWCTTSKLIFSCSCIKF+GSALSGLQRTFDSEWA+SDIIGIESEWCSRVETAIVNLR
Sbjct: 181 YEGNWCTTSKLIFSCSCIKFQGSALSGLQRTFDSEWAVSDIIGIESEWCSRVETAIVNLR 240

Query: 241 LKGKHFTRAENSKDISG-----------IELLKFSVCDPLWSESEKAIRTLNLRYNDLWS 300
           LKGKHFT AENS DISG           IELLKFSVCDPLWSESEKAIRTLN+RYNDLW+
Sbjct: 241 LKGKHFTGAENSNDISGLFYVGIKVFMGIELLKFSVCDPLWSESEKAIRTLNVRYNDLWN 300

Query: 301 ADHDDNDKVNGEEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLK 360
           AD+DDNDKV  EEIVSWRHSDVF PKNCFSEFVDTFEEVIYP GDPDAVTISKRDLELLK
Sbjct: 301 ADYDDNDKVKWEEIVSWRHSDVFFPKNCFSEFVDTFEEVIYPKGDPDAVTISKRDLELLK 360

Query: 361 PGMFINDTIIDFYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRV 420
           PGMFINDTIIDFYVKYLKNKFLSEKN+RFYFFNSFFFRKL DLDKDLSSA GGRDAFQRV
Sbjct: 361 PGMFINDTIIDFYVKYLKNKFLSEKNDRFYFFNSFFFRKLADLDKDLSSACGGRDAFQRV 420

Query: 421 HKWTKKVNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIK 480
           HKWTKKVNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIK
Sbjct: 421 HKWTKKVNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIK 480

Query: 481 GSHRGLKSLFQSYLCEEWKERYGDG-DFKDISAVFLTLPFIPLELPQQENSFDCGLFLLH 540
           GSHRGLKSLFQSYLCEEWKERYGDG D +DISAVFLTLPFIPLELPQQENSFDCGLFLLH
Sbjct: 481 GSHRGLKSLFQSYLCEEWKERYGDGDDDEDISAVFLTLPFIPLELPQQENSFDCGLFLLH 540

Query: 541 YVELFLEGAPVNFSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSG 600
           YVELFLEGAPVNFS LKILK SNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSG
Sbjct: 541 YVELFLEGAPVNFSPLKILKLSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSG 600

Query: 601 SIGKYPSSDANDSDNDLSKHVSGQAHIFTMTHSDNFSSVGEEVGSVSKVSSDTNYQRIGR 660
           S+GKYPSSDANDSDNDLSKHVSG+A IFTMTHSDNFSSVG+E GSVSKVSSDTNYQRIG 
Sbjct: 601 SVGKYPSSDANDSDNDLSKHVSGEADIFTMTHSDNFSSVGKEFGSVSKVSSDTNYQRIGG 660

Query: 661 WESVMPPIEEDENGERPDSPQFLEDRPQASAVSECSSAFSFGQQFTELEICWEGRYSKNV 720
            ESVMPPIEEDENGE  DSPQ L+DR QASAV E SSAFSFGQQFTELEI WEGRYS+NV
Sbjct: 661 RESVMPPIEEDENGETADSPQCLDDRLQASAVFEFSSAFSFGQQFTELEISWEGRYSQNV 720

Query: 721 KE-MCRKPSPRLSLHELQTPLELGQPEILTSSSDELINCVVEDSEEEGNERNERIEIQVS 780
           KE MCRKPSPR SLHELQT L+LGQ EILTSSSDELINCVVEDSEEEGNERN+RIEI+V 
Sbjct: 721 KEDMCRKPSPRPSLHELQTTLKLGQLEILTSSSDELINCVVEDSEEEGNERNDRIEIEV- 780

Query: 781 SSSSSSSSRNNLFLSRQVVESPANFSDNNRQHEH 802
               SSSSRNNLFLSRQVVES ANFSDNNRQHEH
Sbjct: 781 ----SSSSRNNLFLSRQVVESTANFSDNNRQHEH 809

BLAST of CSPI07G05910 vs. NCBI nr
Match: gi|659110285|ref|XP_008455146.1| (PREDICTED: probable ubiquitin-like-specific protease 2A isoform X3 [Cucumis melo])

HSP 1 Score: 1432.5 bits (3707), Expect = 0.0e+00
Identity = 733/822 (89.17%), Postives = 752/822 (91.48%), Query Frame = 1

Query: 1   MTRTSSSKPFPSTRRKERDIGDGGGKRFSVFDFSEEDARVEKVSRRLLGKFSARRSSPVT 60
           MTRTSSSKPF STRR  R  G+GGGKRFSVFDFSEED RVEKVSR LLGKFSARRSSPVT
Sbjct: 1   MTRTSSSKPFSSTRRNGRGRGEGGGKRFSVFDFSEEDVRVEKVSRSLLGKFSARRSSPVT 60

Query: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKSANIDSFSEDVSHELIHIDSEVSIYTLT 120
           KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGK AN DSFSED+S+ELIHIDSEVSI TL 
Sbjct: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKGANTDSFSEDISYELIHIDSEVSISTLN 120

Query: 121 PGHQISPGPTDLNAEDTCADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVI 180
             HQISPGP DLNAEDTCAD  LEGGGSMKQEILETND+L SRSSTNEDD TVIFPDFVI
Sbjct: 121 TSHQISPGPIDLNAEDTCADGSLEGGGSMKQEILETNDLLRSRSSTNEDDFTVIFPDFVI 180

Query: 181 YEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLR 240
           YEGNWCTTSKLIFSCSCIKF+GSALSGLQRTFDSEWA+SDIIGIESEWCSRVETAIVNLR
Sbjct: 181 YEGNWCTTSKLIFSCSCIKFQGSALSGLQRTFDSEWAVSDIIGIESEWCSRVETAIVNLR 240

Query: 241 LKGKHFTRAENSKDISGIELLKFSVCDPLWSESEKAIRTLNLRYNDLWSADHDDNDKVNG 300
           LKGKHFT AENS DISGIELLKFSVCDPLWSESEKAIRTLN+RYNDLW+AD+DDNDKV  
Sbjct: 241 LKGKHFTGAENSNDISGIELLKFSVCDPLWSESEKAIRTLNVRYNDLWNADYDDNDKVKW 300

Query: 301 EEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLKPGMFINDTIID 360
           EEIVSWRHSDVF PKNCFSEFVDTFEEVIYP GDPDAVTISKRDLELLKPGMFINDTIID
Sbjct: 301 EEIVSWRHSDVFFPKNCFSEFVDTFEEVIYPKGDPDAVTISKRDLELLKPGMFINDTIID 360

Query: 361 FYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRVHKWTKKVNLFQ 420
           FYVKYLKNKFLSEKN+RFYFFNSFFFRKL DLDKDLSSA GGRDAFQRVHKWTKKVNLFQ
Sbjct: 361 FYVKYLKNKFLSEKNDRFYFFNSFFFRKLADLDKDLSSACGGRDAFQRVHKWTKKVNLFQ 420

Query: 421 KDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQ 480
           KDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQ
Sbjct: 421 KDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIKGSHRGLKSLFQ 480

Query: 481 SYLCEEWKERYGDG-DFKDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPV 540
           SYLCEEWKERYGDG D +DISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPV
Sbjct: 481 SYLCEEWKERYGDGDDDEDISAVFLTLPFIPLELPQQENSFDCGLFLLHYVELFLEGAPV 540

Query: 541 NFSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSGSIGKYPSSDAN 600
           NFS LKILK SNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSGS+GKYPSSDAN
Sbjct: 541 NFSPLKILKLSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSGSVGKYPSSDAN 600

Query: 601 DSDNDLSKHVSGQAHIFTMTHSDNFSSVGEEVGSVSKVSSDTNYQRIGRWESVMPPIEED 660
           DSDNDLSKHVSG+A IFTMTHSDNFSSVG+E GSVSKVSSDTNYQRIG  ESVMPPIEED
Sbjct: 601 DSDNDLSKHVSGEADIFTMTHSDNFSSVGKEFGSVSKVSSDTNYQRIGGRESVMPPIEED 660

Query: 661 ENGERPDSPQFLEDRPQASAVSECSSAFSFGQQFTELEICWEGRYSKNVKE-MCRKPSPR 720
           ENGE  DSPQ L+DR QASAV E SSAFSFGQQFTELEI WEGRYS+NVKE MCRKPSPR
Sbjct: 661 ENGETADSPQCLDDRLQASAVFEFSSAFSFGQQFTELEISWEGRYSQNVKEDMCRKPSPR 720

Query: 721 LSLHELQTPLELGQP-------------------EILTSSSDELINCVVEDSEEEGNERN 780
            SLHELQT L+LGQ                    EILTSSSDELINCVVEDSEEEGNERN
Sbjct: 721 PSLHELQTTLKLGQDSTPQATKNPNHPTEADNQLEILTSSSDELINCVVEDSEEEGNERN 780

Query: 781 ERIEIQVSSSSSSSSSRNNLFLSRQVVESPANFSDNNRQHEH 802
           +RIEI+V     SSSSRNNLFLSRQVVES ANFSDNNRQHEH
Sbjct: 781 DRIEIEV-----SSSSRNNLFLSRQVVESTANFSDNNRQHEH 817

BLAST of CSPI07G05910 vs. NCBI nr
Match: gi|659110281|ref|XP_008455144.1| (PREDICTED: probable ubiquitin-like-specific protease 2A isoform X1 [Cucumis melo])

HSP 1 Score: 1424.1 bits (3685), Expect = 0.0e+00
Identity = 733/833 (88.00%), Postives = 752/833 (90.28%), Query Frame = 1

Query: 1   MTRTSSSKPFPSTRRKERDIGDGGGKRFSVFDFSEEDARVEKVSRRLLGKFSARRSSPVT 60
           MTRTSSSKPF STRR  R  G+GGGKRFSVFDFSEED RVEKVSR LLGKFSARRSSPVT
Sbjct: 1   MTRTSSSKPFSSTRRNGRGRGEGGGKRFSVFDFSEEDVRVEKVSRSLLGKFSARRSSPVT 60

Query: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKSANIDSFSEDVSHELIHIDSEVSIYTLT 120
           KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGK AN DSFSED+S+ELIHIDSEVSI TL 
Sbjct: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKGANTDSFSEDISYELIHIDSEVSISTLN 120

Query: 121 PGHQISPGPTDLNAEDTCADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVI 180
             HQISPGP DLNAEDTCAD  LEGGGSMKQEILETND+L SRSSTNEDD TVIFPDFVI
Sbjct: 121 TSHQISPGPIDLNAEDTCADGSLEGGGSMKQEILETNDLLRSRSSTNEDDFTVIFPDFVI 180

Query: 181 YEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLR 240
           YEGNWCTTSKLIFSCSCIKF+GSALSGLQRTFDSEWA+SDIIGIESEWCSRVETAIVNLR
Sbjct: 181 YEGNWCTTSKLIFSCSCIKFQGSALSGLQRTFDSEWAVSDIIGIESEWCSRVETAIVNLR 240

Query: 241 LKGKHFTRAENSKDISG-----------IELLKFSVCDPLWSESEKAIRTLNLRYNDLWS 300
           LKGKHFT AENS DISG           IELLKFSVCDPLWSESEKAIRTLN+RYNDLW+
Sbjct: 241 LKGKHFTGAENSNDISGLFYVGIKVFMGIELLKFSVCDPLWSESEKAIRTLNVRYNDLWN 300

Query: 301 ADHDDNDKVNGEEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLK 360
           AD+DDNDKV  EEIVSWRHSDVF PKNCFSEFVDTFEEVIYP GDPDAVTISKRDLELLK
Sbjct: 301 ADYDDNDKVKWEEIVSWRHSDVFFPKNCFSEFVDTFEEVIYPKGDPDAVTISKRDLELLK 360

Query: 361 PGMFINDTIIDFYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRV 420
           PGMFINDTIIDFYVKYLKNKFLSEKN+RFYFFNSFFFRKL DLDKDLSSA GGRDAFQRV
Sbjct: 361 PGMFINDTIIDFYVKYLKNKFLSEKNDRFYFFNSFFFRKLADLDKDLSSACGGRDAFQRV 420

Query: 421 HKWTKKVNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIK 480
           HKWTKKVNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIK
Sbjct: 421 HKWTKKVNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIK 480

Query: 481 GSHRGLKSLFQSYLCEEWKERYGDG-DFKDISAVFLTLPFIPLELPQQENSFDCGLFLLH 540
           GSHRGLKSLFQSYLCEEWKERYGDG D +DISAVFLTLPFIPLELPQQENSFDCGLFLLH
Sbjct: 481 GSHRGLKSLFQSYLCEEWKERYGDGDDDEDISAVFLTLPFIPLELPQQENSFDCGLFLLH 540

Query: 541 YVELFLEGAPVNFSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSG 600
           YVELFLEGAPVNFS LKILK SNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSG
Sbjct: 541 YVELFLEGAPVNFSPLKILKLSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSG 600

Query: 601 SIGKYPSSDANDSDNDLSKHVSGQAHIFTMTHSDNFSSVGEEVGSVSKVSSDTNYQRIGR 660
           S+GKYPSSDANDSDNDLSKHVSG+A IFTMTHSDNFSSVG+E GSVSKVSSDTNYQRIG 
Sbjct: 601 SVGKYPSSDANDSDNDLSKHVSGEADIFTMTHSDNFSSVGKEFGSVSKVSSDTNYQRIGG 660

Query: 661 WESVMPPIEEDENGERPDSPQFLEDRPQASAVSECSSAFSFGQQFTELEICWEGRYSKNV 720
            ESVMPPIEEDENGE  DSPQ L+DR QASAV E SSAFSFGQQFTELEI WEGRYS+NV
Sbjct: 661 RESVMPPIEEDENGETADSPQCLDDRLQASAVFEFSSAFSFGQQFTELEISWEGRYSQNV 720

Query: 721 KE-MCRKPSPRLSLHELQTPLELGQP-------------------EILTSSSDELINCVV 780
           KE MCRKPSPR SLHELQT L+LGQ                    EILTSSSDELINCVV
Sbjct: 721 KEDMCRKPSPRPSLHELQTTLKLGQDSTPQATKNPNHPTEADNQLEILTSSSDELINCVV 780

Query: 781 EDSEEEGNERNERIEIQVSSSSSSSSSRNNLFLSRQVVESPANFSDNNRQHEH 802
           EDSEEEGNERN+RIEI+V     SSSSRNNLFLSRQVVES ANFSDNNRQHEH
Sbjct: 781 EDSEEEGNERNDRIEIEV-----SSSSRNNLFLSRQVVESTANFSDNNRQHEH 828

BLAST of CSPI07G05910 vs. NCBI nr
Match: gi|659110283|ref|XP_008455145.1| (PREDICTED: probable ubiquitin-like-specific protease 2A isoform X2 [Cucumis melo])

HSP 1 Score: 1412.1 bits (3654), Expect = 0.0e+00
Identity = 729/833 (87.52%), Postives = 748/833 (89.80%), Query Frame = 1

Query: 1   MTRTSSSKPFPSTRRKERDIGDGGGKRFSVFDFSEEDARVEKVSRRLLGKFSARRSSPVT 60
           MTRTSSSKPF STRR  R  G+GGGKRFSVFDFSEED RVEKVSR LLGKFSARRSSPVT
Sbjct: 1   MTRTSSSKPFSSTRRNGRGRGEGGGKRFSVFDFSEEDVRVEKVSRSLLGKFSARRSSPVT 60

Query: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKSANIDSFSEDVSHELIHIDSEVSIYTLT 120
           KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGK AN DSFSED+S+ELIHIDSE       
Sbjct: 61  KHQFLHCFGKGAKSVSRNLSDELIDIDAEVGKGANTDSFSEDISYELIHIDSE------- 120

Query: 121 PGHQISPGPTDLNAEDTCADTPLEGGGSMKQEILETNDILLSRSSTNEDDVTVIFPDFVI 180
            GHQISPGP DLNAEDTCAD  LEGGGSMKQEILETND+L SRSSTNEDD TVIFPDFVI
Sbjct: 121 -GHQISPGPIDLNAEDTCADGSLEGGGSMKQEILETNDLLRSRSSTNEDDFTVIFPDFVI 180

Query: 181 YEGNWCTTSKLIFSCSCIKFRGSALSGLQRTFDSEWAISDIIGIESEWCSRVETAIVNLR 240
           YEGNWCTTSKLIFSCSCIKF+GSALSGLQRTFDSEWA+SDIIGIESEWCSRVETAIVNLR
Sbjct: 181 YEGNWCTTSKLIFSCSCIKFQGSALSGLQRTFDSEWAVSDIIGIESEWCSRVETAIVNLR 240

Query: 241 LKGKHFTRAENSKDISG-----------IELLKFSVCDPLWSESEKAIRTLNLRYNDLWS 300
           LKGKHFT AENS DISG           IELLKFSVCDPLWSESEKAIRTLN+RYNDLW+
Sbjct: 241 LKGKHFTGAENSNDISGLFYVGIKVFMGIELLKFSVCDPLWSESEKAIRTLNVRYNDLWN 300

Query: 301 ADHDDNDKVNGEEIVSWRHSDVFSPKNCFSEFVDTFEEVIYPMGDPDAVTISKRDLELLK 360
           AD+DDNDKV  EEIVSWRHSDVF PKNCFSEFVDTFEEVIYP GDPDAVTISKRDLELLK
Sbjct: 301 ADYDDNDKVKWEEIVSWRHSDVFFPKNCFSEFVDTFEEVIYPKGDPDAVTISKRDLELLK 360

Query: 361 PGMFINDTIIDFYVKYLKNKFLSEKNNRFYFFNSFFFRKLVDLDKDLSSARGGRDAFQRV 420
           PGMFINDTIIDFYVKYLKNKFLSEKN+RFYFFNSFFFRKL DLDKDLSSA GGRDAFQRV
Sbjct: 361 PGMFINDTIIDFYVKYLKNKFLSEKNDRFYFFNSFFFRKLADLDKDLSSACGGRDAFQRV 420

Query: 421 HKWTKKVNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIK 480
           HKWTKKVNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIK
Sbjct: 421 HKWTKKVNLFQKDYLFIPVNYSLHWSLVVICHPGEVVNLKDKKHDNLSKVPCILHMDSIK 480

Query: 481 GSHRGLKSLFQSYLCEEWKERYGDG-DFKDISAVFLTLPFIPLELPQQENSFDCGLFLLH 540
           GSHRGLKSLFQSYLCEEWKERYGDG D +DISAVFLTLPFIPLELPQQENSFDCGLFLLH
Sbjct: 481 GSHRGLKSLFQSYLCEEWKERYGDGDDDEDISAVFLTLPFIPLELPQQENSFDCGLFLLH 540

Query: 541 YVELFLEGAPVNFSSLKILKFSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSG 600
           YVELFLEGAPVNFS LKILK SNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSG
Sbjct: 541 YVELFLEGAPVNFSPLKILKLSNFLSQDWFHPAEASLKRAHILKLIYEIMVCNQAKELSG 600

Query: 601 SIGKYPSSDANDSDNDLSKHVSGQAHIFTMTHSDNFSSVGEEVGSVSKVSSDTNYQRIGR 660
           S+GKYPSSDANDSDNDLSKHVSG+A IFTMTHSDNFSSVG+E GSVSKVSSDTNYQRIG 
Sbjct: 601 SVGKYPSSDANDSDNDLSKHVSGEADIFTMTHSDNFSSVGKEFGSVSKVSSDTNYQRIGG 660

Query: 661 WESVMPPIEEDENGERPDSPQFLEDRPQASAVSECSSAFSFGQQFTELEICWEGRYSKNV 720
            ESVMPPIEEDENGE  DSPQ L+DR QASAV E SSAFSFGQQFTELEI WEGRYS+NV
Sbjct: 661 RESVMPPIEEDENGETADSPQCLDDRLQASAVFEFSSAFSFGQQFTELEISWEGRYSQNV 720

Query: 721 KE-MCRKPSPRLSLHELQTPLELGQP-------------------EILTSSSDELINCVV 780
           KE MCRKPSPR SLHELQT L+LGQ                    EILTSSSDELINCVV
Sbjct: 721 KEDMCRKPSPRPSLHELQTTLKLGQDSTPQATKNPNHPTEADNQLEILTSSSDELINCVV 780

Query: 781 EDSEEEGNERNERIEIQVSSSSSSSSSRNNLFLSRQVVESPANFSDNNRQHEH 802
           EDSEEEGNERN+RIEI+V     SSSSRNNLFLSRQVVES ANFSDNNRQHEH
Sbjct: 781 EDSEEEGNERNDRIEIEV-----SSSSRNNLFLSRQVVESTANFSDNNRQHEH 820

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ULP2A_ARATH4.8e-12743.92Probable ubiquitin-like-specific protease 2A OS=Arabidopsis thaliana GN=ULP2A PE... [more]
ULP2B_ARATH1.7e-11650.84Probable ubiquitin-like-specific protease 2B OS=Arabidopsis thaliana GN=ULP2B PE... [more]
ULP1C_ARATH7.6e-3233.46Ubiquitin-like-specific protease 1C OS=Arabidopsis thaliana GN=ULP1C PE=1 SV=1[more]
ULP2_SCHPO1.7e-3131.94Ubiquitin-like-specific protease 2 OS=Schizosaccharomyces pombe (strain 972 / AT... [more]
ULP1D_ARATH2.7e-2931.82Ubiquitin-like-specific protease 1D OS=Arabidopsis thaliana GN=ULP1D PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0K633_CUCSA0.0e+0097.01Uncharacterized protein OS=Cucumis sativus GN=Csa_7G065160 PE=4 SV=1[more]
B9S9I8_RICCO1.5e-14846.81Sentrin/sumo-specific protease, putative OS=Ricinus communis GN=RCOM_0886160 PE=... [more]
A0A067KDN9_JATCU4.3e-14350.37Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18741 PE=4 SV=1[more]
A0A0D2V6L3_GOSRA6.9e-14151.54Uncharacterized protein OS=Gossypium raimondii GN=B456_012G143900 PE=4 SV=1[more]
A0A0D2S378_GOSRA6.9e-14151.54Uncharacterized protein OS=Gossypium raimondii GN=B456_012G143900 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G33620.12.1e-12843.63 Cysteine proteinases superfamily protein[more]
AT1G09730.19.6e-11850.84 Cysteine proteinases superfamily protein[more]
AT1G10570.14.3e-3333.46 Cysteine proteinases superfamily protein[more]
AT1G60220.11.5e-3031.82 UB-like protease 1D[more]
AT3G06910.12.7e-1926.94 UB-like protease 1A[more]
Match NameE-valueIdentityDescription
gi|778724518|ref|XP_011658819.1|0.0e+0097.01PREDICTED: probable ubiquitin-like-specific protease 2A [Cucumis sativus][more]
gi|659110287|ref|XP_008455147.1|0.0e+0090.05PREDICTED: probable ubiquitin-like-specific protease 2A isoform X4 [Cucumis melo... [more]
gi|659110285|ref|XP_008455146.1|0.0e+0089.17PREDICTED: probable ubiquitin-like-specific protease 2A isoform X3 [Cucumis melo... [more]
gi|659110281|ref|XP_008455144.1|0.0e+0088.00PREDICTED: probable ubiquitin-like-specific protease 2A isoform X1 [Cucumis melo... [more]
gi|659110283|ref|XP_008455145.1|0.0e+0087.52PREDICTED: probable ubiquitin-like-specific protease 2A isoform X2 [Cucumis melo... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR003653Peptidase_C48_C
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0008234cysteine-type peptidase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
molecular_function GO:0008234 cysteine-type peptidase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G05910.1CSPI07G05910.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003653Ulp1 protease family, C-terminal catalytic domainPFAMPF02902Peptidase_C48coord: 353..544
score: 3.8
IPR003653Ulp1 protease family, C-terminal catalytic domainPROFILEPS50600ULP_PROTEASEcoord: 338..533
score: 28
NoneNo IPR availableGENE3DG3DSA:3.30.310.130coord: 377..527
score: 7.1
NoneNo IPR availablePANTHERPTHR12606SENTRIN/SUMO-SPECIFIC PROTEASEcoord: 461..540
score: 2.8E-45coord: 558..575
score: 2.8E-45coord: 326..440
score: 2.8
NoneNo IPR availableunknownSSF54001Cysteine proteinasescoord: 327..576
score: 3.76

The following gene(s) are paralogous to this gene:

None