Clc01G02440 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G02440
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionE4 SUMO-protein ligase PIAL2 isoform X1
LocationClcChr01: 2145170 .. 2165085 (+)
RNA-Seq ExpressionClc01G02440
SyntenyClc01G02440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTGCATCGTCGCCACACGAGATGAACTTGAATAGAATTATCTTATACATCGATGGCTTAAATTTGCTCCTCAACCGTGTCGGCCAAATGGACCCATCCCAATTATGCAATCTCTGCTTTTCCATTGCCAGGTTTTGATTCAGTTACAAATAATTTAGGGTTTCCAAGTTTGTTAATTGTTCTTGTCGTAGAACTTCGCTTTGCATGCTTTTCTTGAAATGTTGTTAGTTGAGATGATTTGAGACTGGTAATACGAATACCTACTGTTGAGTAGTTGGTAGAAAATGGAGTGTTGTGTAGTTTGTTTGGAAGGCGGCATTTGTGTGGTTGGAACTTGGAAGTATTGTGTTTCAGCGTTTGTTTAGATTGTTCTCTTTGATTTGAAAATTTCCTGCTTTTTTACCCTGATGGAGGTGCGATGGGCGGATAGAATGTTGGGTGGGTTCATTTGAGAATTTTACTTCACTGTGATAGTGTCAAAAAATAATTAAAGAATCACATGGTTTTTAAGAGTGAAGAAGATGATTCTGAAAGTACTTTTTCTCGTAAGAAGAGATGCCTACACTGTCAATTTCTTCTTAAGGAGTAAGAACGTTTAAATAAGTGGTTTCAAAACTTTTGCACACTTCCTGTATTTTTGTTTTGAAAACTCAGTAGTTGCAAACAAGAGGAGCCAAACTAGCATACTAAGGGAGGCCATCTATATTAACACTGATTGTTTAAGGAGTATAGGAGGGAGCTCCTGATGTAAGAAGAATGGGGTCAATTGGTTAATTGGGCAATATTTTAGGTTAATTCCCTTTTTTACCCCACTTGTAATAAAACTCTATAAATAGGAGTCTTCCCCTCTCGTATGAGATGATTGTCATCACTTCAATAAAAGATGCAGAAGATTGATTCTTGGAAGATTATTCATTGAGAATACTTAGAATACATCAATTTGGAATCAAAGCGACCGCCTGGGCCAACGTTGACCGCCACTAGTGGAAGATCGGGAAACTCTTGGTGACGCCGATCACGAGAGGGAACTCTTTGAATGTCGTGCTCATTGCCAAAGAACCCACATATATTTAGGGGAACCTTTTAAAATTCGATTCAAGAAAGAAGAAAAAGAATCATCACAAATGAATTTTTACAACCTAAGAAGAATGTTCAAGGTTCAAAATTAATCTTTTGGTTATGAAACACGACCAAGAAAGGTTGATTCCTATTCAAGAAAATCAAAGAATTGGTCCTATACCTATTATGAGATGGATACATACTCAAGAACCTAACAAAATGACTACTTTGATCCTAGGAGAATATAAAGAAAAGAGCAAAAGAAATCCAATGTTCATTGGGAAAATGACCCAAAGTATAGAACAAATATAAAATCATTTACAAGAATTCAAGAACAGATTGATGTTGTATACTGAAGAAAAAACAAAGGAATCTCAATTAGAAACTTTTCAGATGGAAAATCTTATTGAGACCCATGTCATAAATGGAGGTGATTGGGACATAAATCATGCAAACGAAGTAGATGAAAAAGAAGAATGTGAAGATGTTGATTTGGAAGTAATATCAAGTGAAGAAGAAAAATCCAAAGAAGAAGATGGAAAAGAGGTACCGAACATCAAAACTCTAAATGTCGAGGTGGAAGTAAAAAAATTGAGGAAGAACGACACACCTATTGCGTTGGGAATTGATTACCATGTCAAAGAACTTGAAGTGAATGATGAAGTAGATGTGATTTTTGAAGGGTTCTCTTCGGAAGCTAATTTTTTTGTAACAGGTCCTATGATTACAGTTTTGAATGAGCTTGATGGAAGTTTCTTTTTTGGGTCTATGGCCAATGGAGAAGTTCATCTTTTGGTAGCATTGTTGGTTTTTATGTCTTGGATTTACCTCTTTGATCATAATATTTTTGTTCAAGATGTAGGAGGTTGGTTTGGCCATATTCTGTTGAACAAGTTTATTTTATTTCTTGATTGTTTCATTTTTTCTTTTCAAAACTCGAGGACGAGTATCTTTCTGGAGGGGTAGAATGATGTCAAAAGAATGGGGTTAGGTGTAATTGGGTAATATTCTTGACTAATTCCCTTTTTTACCCCACTTGTAATAAACCTCTAATTGGAGTCTTCCCCTCTTGTATGAGACGATTATTATCACTTCAATAAAAAATGTACAAGATTGATTATGGGAAGATTATTCATTAAGAATACTTAGACTACATCACGAGTGTTAGATTTATTATTTTAACGAATAGAGTTCTTGAGGTGGAAAAAAGAAAAAAGAGAAAGAAGAGGCCATGATCAACAATGGGGAGTGTTAGATTTATTATTTTAACGGAGAATTTGATGATGGACATATTACAACTCCTGTCAATGAGAGTTATGATGTCGGTTTTGCAAAACACAAATGGCGAGATAAGTTTTGGGGGAGAATCTCTAGGTATAGGGAGAATTTAAACCGAGAGAGAGCTATGTAGATTAAGTGATCAATAAAGAGTCGAAGTCCATAGCTTTATTCTCGAAGATGCGGTGATTATGCTCCAGCCAAGTCAACCATATGCTCCAATTGGACATGACTGGAAATTATGTACATTTGTTTGTCTCGAATGTTTTTTGTAATTATTGATTCTTACTAGAGTGAACAAAATTGGTTTTTTTGCTTTGTTTTTCTTAAATACTTTTTATGTACATGCTTTAGCTTCTGCTCATAACCATCAATTTATCATCTTCCTTGACTTTGCTTGATTAACATTGATAATACAGAAGTATCGACTATACAATTGCAAACAACATTGTTCCATCTAAAGCTCACAGTTTACCTAGTTTCATCAAACAGGTAATGTCTTTTATTCTAGATGTACATGAATTTCTGATGTTTTAAATCTATTTGTCCCTCAAGACACTCGCTTTCAAAGGCAAGTACATATTTACTCATTTCTGTTGGAATAAAAAGTACTACATTTTCTACACACACACACATACACATATATATTTTAATGAAAAAATTGTTGGTCTATTTTGAGCAGCAATTTAGGATGGTGTCATGATATGACCCTGGCCAATTGCAGGCTTTTTGGGCATAATTTACATTATTTGATTAATTTCTTGCCCCTAGAATGCCCAATTTCTGTCACATCACCTGTTTGGTTTCATTCCTTGTTCTTGTACAATCATGCAAAATGTTACACCTTCCCAAACAGTTTGGCCGTGTGATCTTCAAATTATGTTGCATTCTTGTTTTGGTTCTGCCTATGCGTTAGAGATCTTGGAAGGACATTCAATAGGTTGGGATTGGGATTGCAATTCCTATTAATTAGCTGAAACCTTAGCGTTTGTGGGATAAAGATGTTGGGCAAAGGCTTGTATTTGGATTTGGAAGATTAGTTGTTATCCTATCTGTATAATATGTAGATCGGGTTTTGGTCTCATGTTAAATGATTTTATATCAGTTATTCAGAATTTTGTATCCAGTTTCAATGTCTTTTCTATTATAAGTTTGCTTGAGCATTAGTTCACGTAGAATATTTATTTGTCCAGTTATGTCAGATGAAGCATTCTCATCGCTTAAAAGCAGCACTCATGGTACTCATGATAACCATCAAGGTATGTTTTTCTCATCTTTTACTTCACATTTGTTTATAATACAATGTTTTCTTTTTATAAGACTCAAGGTTTGATTAAATGTTTGTATGATTGAACCAAAATTTGTTTGAGATGAGTAGTAATCTGTTAAAAATAAGTAAAGACTTCAACAGCACTTGCACTTCCAAAGGTTGGAAGTTTGACGTTATTTAGGTAAGTAGCTTGCGAATGATGTGTATCTATGGAGTTTATAGTTGTAGTGTTGAAAGTACATACCATTAATGCTGAAGGTTCCTTACCTTCCATCAATCTTGCACTCAAGATTCATCCAAGAAGAATACTTTTACTGACCTACAAAATGGAAATCAAAATTTTCAGAATAGTAGAGGAAGAGGATGAGATTCTCAGAATAAAAGAGGAGGAAGGATCTGGAACAATTGTAAAAAACCGTGTCAATTGTGTAACAAGTATGGGCATACAATTTTAAAATGTCATTTTCTATCTCATCCTTCATTTCGTTGAATTGGTAATCAAAATGGAGGTTACAATGGGAATAACAATAATCATCCTCAGGCTCAGACCACTTGACAGAATTCCCAAATGTAAGCCATTGTGTTATATCCAAGTCTTGTAGATACAAATTGGTATCTGCATAACCATGTGACAAATATTTTAAAAAAATCTCAGTGAGCACCATTGTCAAAATGGAGGCTATATCCAGGCTGCAAGCAGTGTCTATTGGCCATTTTGGTTCTTCAACTCTTTCTATGCTTGATAAAAAATGAACTTTCCACTTACATAATCTCCAAAACGTGCCTAAAATTACCAAAAATCTCCTTCATGTGCCTAAACAAACTTCAGGATAGTAAGTGCTTGGTTAATCGTAAATAGAGATCTGGTTTGATAAGATGTTTAAGTTGTGTATCTGTTTGAAAATAAATGAAAGTTCATTGACAAATGAACCTAAGTTTATAAATCTTCACCTTCTGGTCAAGAGTTTACTTTATCTCATAAGGTAATCATGCAATACAATTTTGGACAGTCCTACTATCTGATTAGCTTTGATTGTCATTACATTTGAAGATATAATTGAAGCTCAAAAAGCTAAAATACTGGAAAAGGTTTACTTGCTTATAAAGTAAGAGACTTTTCCATCTTTGGGGTCAAACTGTCTTTGATATAGGCATTGGGGTCAAACTTGTAATTTTCTTAATTTTATCAAGTCCCTAGAGTGGTTATTTTTTTCCCCTTGCTAGTAACTTATGTAATTTATCTTCTTTTCTTTAAGCCTTACCTATTTACGCAGGTGGGGATTCTCCTTTCTGTACTTTTGAAATCAATAAATAATCTTTGTTAACTAACTACCCACCTCTCTGGGTATTATAAATGTTAAGTTCAATTGAAATAATCTTCTTTAATATATTACTTGGAAATCTTTGTGACACTGATGCAGAAACGCATCTCCACAGGTTTGTTTCTCCTGCTTGAAAAACTTTTCTCCCACTTTGTTTTTATTTTTATTTTTTATTTATTTAAGTCACTCGACTTCATTTTTTGTTGTGTAGAAATCATTTTGGAATTTTATCTGGTGGTAACAAATATCTTATGTTTCTTCTACTTCTATATTCTGTATGGTAGAATGCTTGCAAGGCGAAATGGTTTTCAGAAAAGGATGCAGAGGAACTCTATAGTCTAGCCAATGAGGTTTCTATTTATTTTATTATTTATTTATCATTGTTATTATTGTTGTCTAAAATTAGTGTTCACTTGCCCCATGGTAAATTTTTGTGCACATTTATGGATTTTTAATATTTTATTTGCTGAGAAATTATTATGTAATGTAGATTGGTAACGATTTCTTCGGAGATACGAATATTGGACAAAGCAATCCCCTTGCCACGATTAGTACAGTTATGGAAAGGTCTGAATTTCATTCTTGGTTAATTTGCATTCCGTGTCTTTAATCTCCTTTTTGTAAAAGGAAGCATATCAACGTTGTTAAATTTTATTAAAGGAAATTTTCATTGAAATAATGAATGAATCTAATGCTCAAACTACATGAGAGGAAGTAGAAATAAAAAGCTTCAAACTTGCCAATACAAGGCTAGAATCAAATTACAACTAAGTGACTAAAGCTATCAATGAGAAGTTACAATGTTCAATAATCCACATTGCAGAAAAAAGGACCAGAACCCTGTCCACTTATCTCACAATCCCAAAACAGAATCCTCAAAAGATCATTTTCTAAATATACATATTGCGAACAATTTACTTTAAAAGGCTTTCATTGAAGTAACCTCTACCCAATTCCTTGAAACCTCATTATATAGCCTATAATTCCTTTCCAACCAAATATTCCATAGAAAAGATTTTACAGCATTAACCCATAGGTGCTTAAGTCTTCCTTCGAACATATTTCTACAAAGCAATTGCTCCACATTTGCTGTACTATGAAAACCCATGAGATCCAAAAACCTTGTAACAAAGCCATTTTTTTCTGCTCACTGAACATGAAATGGAAAGATCATTGAAATCTTCACTCTCCTTTAGATCCTAGGTGCATTGAAAAGGATTTGTCATCCAAGTCGGACTTGAATTTATCTTTTTTTTTTTTTTTTTTTTTTTTTTTGAAACGGAAACAAGACTTTGATTTTATTGTAGGAGATTTCTCTCTTTTTTATCCTTTAGGCTACATCAATTTGGTATCAGAGCAGTTGCCTAGGCCAACGTTGACTACTGCTGGTGGAAGATTGGGAAACTTTTGGTGACACTGATCGCAAGAGGGAGCTTCGTGAGTTATGTTCGTTGCCGAACAACGCACATCTAATTAGGGAGAACCCTTCAAATTCTAAGAAAGAAAGAAAATACTGAAGATTTTTTTAAAGACTTTGCAAGAGTGGACTTATTACTGATCAAGGAAAATGCCCAAGAAAAAATGCAGCAGTTGGTATAGGCAAGAAAGTCAAATTTACTCCCAAAAACAATGCTGGAATTATGACTTAAGTGGTTTCAATGTTACATCGCTCATGCTTGATTTTATCAAACAGCCTATCAATTTACATTAAGGATTTCCTGGCTAGAAATGTGGGGTTGGGCTAATCGATCTTTTTATGTGATAATTTAGTGTTAGATGATATAATTAAACTTGGCTTCACCCATCAACTTAAGCTTTTGGGTCAATCGGCGATTTAACATGGTATCAAAGTTGGTGGTGGTTGTTTCCTCCCCAATTAAAATTGATTTCCACATGTTGGGTCTTCTTCATATTTCAAGCTCACAGTTGAAGGGAGTGTTAGATAATATAATTAAATTTGTTTTCACTCATCAACTTAAGCTTTTGGGCCAATCGACGATCTAACATTTATTGTAGGGTTGTGTGGGATTAGAAAGGAGGCAAAGGCACTACTTCAAAGGGGTAGGTTGGAAGGATTTTTTGCTCCATGGTTGGGATGTTCAAAGGGACGCAAACTTGCTTTAGATGGGGTGGGTTATGGTGCTCGGTGCCCATCTCTTTCTTAGTACAAAAAATTTTAATTTGTCAATTGTCAGCTTGTGTGGGGGTATTTCAAATCAGGATGACCATGTTGGGGAGGTCTCGTGGACAGAGGATGTCTCTTTCAAAGCAAATGGAAACACAGACGAGTTCATGCCCTTGGTTTGTTACCTTTAGCATGAGAAAACCTTTCTTTCGGTAAGATTTGTTGTTGACTAGAATAAGGGAGTTTTTTCTCGAAAGATGGTAGACTTACCCTACTTCGATTGGTTTTGAGTGGGATTCATGTATACCTGTTTATCCTTATTTAAAATCCCCGGGTTGGTGAGTAAAATTTCGGAAAAACGGGGCTCTTTTGGTGAAATGGATTTGGCAGTTTCATCAAGGGGCTGATACCTTATGGTTTAGGGTCATTGTGATCAAATATGGGTCACCCCTTTTGAATGGATCTTGAGGGGATTAAATGCATTTCCAAAAACCCTTGGAGAGGCAATGCTGAGGAGCACCCTTTCTATCCTCTTTTTATCCAATGTTTTGTAGGGATGAGAGGATTTTTTTTTTTTGGAAGAATAAGTGGATTGGAGCTAGCCCACTCTTCTAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCTTTTGCATATTTACTTTTTATCCTCAATGAGGAATCAGTCTTAGGCTGGTGGCCTCTCTCATTCACATCCTCCAACTCATGATTCTTCAATAATTGTATGGTTTGTGAGCCTTCAGTTGCATCTGTATGAGTTTGGATATTTTATATGTATATATATACTAGCATTATAGTTTGTTATTATTATGACTTAGATCTTAGTTGATTCTCCCCATTTTGTACTTTATCTTTGCCATTGTTTTCTAGCCTTTTTGCCGTTGGACAGTCCTCCCGCCTTGGTTCATGAGGTGTTGCCTGGGTTTGTTTGGGGTTTGAATTCAACTACTTCATCTTCTTCCCTCTTTTATCCCATCAATCTCGTGTGGAATTTCTTTTTGTTTCACCTCTCCACTTAATTGCCTTTTTGATGAACTTGGATGAAATAATACATACATTCTTCCAGCAAGTGAGAAATTTCTACTGACCACCATTATGTTTTTTACAACTAGGAGAGAAGATTGCAAAAAGGAGGAGAGCAGAGAACGTATTTTCCTTTTCCATTTCTTTATTTTTTAAAAGAATAAATAACTGTCCAGTCTTTATTCTTTAATGGTGGTCAGTAGGAAAATTTCTACCGGACTTGTCAGTAGGCAAGTAGTATTCACTTGGATGGCTTCCCACTGCCTTTGCCCCTTTTCCATAGCCTTGAGTTTCAATTTGGTCTCCTTTTTTTCTTCAATATAATTGCTACAAACTCATTCATGATCTCCTGCAGAATATTTCTCATTTTTTTGTAATACTCTTCCAGACGTAGAATCAATTTTCAAGAAATAATTATCTGGTACCAATTTTTAGGAACCGGAACTTAAAATAGTAATCATTAATGAATAATATAGGGGTTCAGATGGGGTTCAAATATATATATATATATAGGGGTTCAGATGTATTGTATTGTCAAAAGAAAATACAAATGCTTGCTAGACACTAGAAAGCCTTAGTTCTCTCCAAAAAAGGCTACTCCAAAATATGCCTCCCCCACTTTTTCTAACTCAAATTATTTATTACTATTGATCACCAACAGACTTACCATTTAATTACTAATATACCCCAGCTGGCATTAATCTCAGCAGTCTTTTGGGAATGCTGCATTTGAGTTGATTATTTTGCAATCTTTTACTTTCTATTTGCATGTGCACTCAGATTTTTTCCTCGCTTGAAGCTGGGTCAGATTGTTGCATCTGTAGAAGTTAAGGTGGTTACTTGTCTCTCTGCTGAACTATTTCCTTTAAAAGAGTTCTTTTTAACTTTCTTGGATATCCAATGACATTTACGTTTTCACTGTGCTAGCCTGGATATGGAGTATATGCTATTGATTTCAATATTTTAAAGACAATCCAATATGCACCGCAGGAGAAACTAGTAAGTCGCTATTTATTTGAGAGGCTAGTGGAACTGGGAAAATTTTGTAACCAGTGAATTCTTTTAAATTACGTATTTAGTAGCTTGTGTTCTTTCATTTTGACTGTTGGTCTGCATAAAGAGGAACCTTATTTGTGCGAATTGTGGAGTGGAATATACCTTTGAGCCAAATAGTATCTACTAGAGTTTTTGGGATTCAAATGGATGTGATGTGACCATGTATTACCCTTTAGTAGAATGAAATAGAGAAGCTTAGTGGCCAATATTGCTGTTTATAGTACGTCAGTGATAACTCTATCAAACTTCAAATATTTCTTATTGTTATTGATAGATGTAGTTAAACCGGTATTCTTCATCTCAAATTCTGTGTTTGCAAATTGATCTCCCATTGCAGCGACTGTTTGTTGCTCAAAAAGATAATACTGAGACGTCTGCATGCATAATCAGTCCTCCACAAGTTAAGTATGTTTAATTCGTAAATCCCTCCCTTTTAGATGGAATGAAGTTCAATTATCTCTGTGCTACATTAATTCCTTTTTTTTATTATCGTTTTTTTTTTATATATAGCTTCCTTGTCAATGGGAGGGGAGTCAACGGAAGGACAAATGTATACGTGGTTAGTTTAGATGCACTCAAAAGACGATTTGTACAGGACTCTTCTATTAATTGAATCTTCTTACCGGCATCATTTCTTTTTTAATAAAGGATACTGGACCCCAACTTCCAACAAATATAACTCATATGCTTAAATTAGGATCAAATCTTCTCCAAGCAATTGGGAGCTTCAACGGTAAGTTGAGGTCTAATATCAAGTTTAAGCATCTTGTATTTACTATTCTAGTCTTTAGAATATACTCCTAAAAGGGCCTTTAATCTTTGTTTAGGTCATTATGTTATAGCAGTTGCCATCATGGGTACCGCACCATCACCTGACTCCTCTGTTCTGCAAGATCATATACAGCCAGTTGTTTCTACTGTGGATTCAGGTACAGTTGCAATCATGAAGGGCAGGTGGAAATTTCTTTTGTTCTTTGTTTCATATTCTATTTATTTATCATTACAATTCGTTATTCAGTTAGGATGGCCTGGCTTAAATGTATTCACATTTCTTTATGCTATAGTTTTAAATGTTAGTAACTTAGTGCTTTGCATAATATAATTTTATTTTTGTTTACAGATTCTGATATAATTGAGGGCCCATCACGAATATCCCTTAATTGCCCCATAAGGTTACATAGACATTTTCTGCATACATTTCTGATACTTCCAGCAACTGCATTGCTTTAATTTTAGTATGCTATTTATATAATTTATTAGTGCTTTAATTTATATATTCCTTCAGATGAAGTTTATGATAATTTTCAGTTTATGTACTCAATTATTGACATGGTGATTGTTTCATCCCGTCCTCAATATGAAACAGCTATACCCGAATCAAGGTTCCTGTGAAAGGTCGTTCTTGTAAACATCTTCAGGTGAGCGTTCCCTGCCATGGCATCTGAAAAGTGTTGATGTACCTTGAATCATAGCATGATCGTCAAAAATCTTTCCCAGATGCTTATCTTTTCTGTCAAGAACATGGGGCAAGAATATTTTGTGCCGAGGCTACTGTAGATGCGTCTTGGCCCTCGACATGTGTACCTATTCAGTTCTTTTAGTTAGACTGTCTTGTAGTTCATATGAATTTACTGTATGCTATAGGACATCTTAGTAAGTTAAATATCAGTGTCTTCTTTCTGTATGCTACAGTGCTTTGATTTCCACAACTTTATTGACATAAATTCAAGAAGACCATCCTGGCGATGTCCACATTGTAATCAGTACATCTGCTTTTTGGATATTCGTGTTGATCAAAATATGTTGAAGGCAAGTAGAGTGATCGGCCTTTTCTAAATTTATTGTGTATTTGCAAATTTTTATTACTTCACTCCATTCACTGAATCCATTTTATTTCCGTTAGGTCATTAGAGAAGTGGCTGAGAACGTTACTGAAGTTATTATCTCAGCAGATGGATCATGGAAGGCTATCCTTGAAAATGATCATGGGGATGGTCGGCCATTGGATGATTCTCTTAACCATCAGAAAGAAAGGGCACAAGAAGAGTCTACTGCCCCTCCTGATGTGTTAGATCTTACTGAGGTTGATGATGATATGGACATCTGCAATTTGGAGACTGAAGACAGGAAAACTTGCCTTGATAATAAAAACCAACCGGTTTCTTCGAGTTTAGATATATCATCTGGAATGAACATGAATAGCTTGAATCAAAATCTTGCTGCTGTCTTGGATGACGACTTTTGGTCTGGAATAGTTACTGATGGGATTTTGACCTCAAGTGCTGGGTCAGATGCTCCAATGGGTAATAGCACGCCTCCACCTGGTTTTGCTGGTATTATGCAATCCACTGTCTTTACTGATGTTGTCCCACCTGTTCTTAATCATGGTGCGGGGGTTCCAGGGCATGCCAACTTTTCATCTTCTGCATTCTATGATCAAAACAATTTGCAGATTCAAGTTCCGAATTCAAATGAAAATAATCAATATGGGAGGATGCCATTAATAGCAAGACCTGTAAGCAGGACGCCAATTGCAGTTCAAGCCCTTCCTGCTCAATCCCAAGTAGCAGGCCAGCAATATAGTTCAAGAACTCCGATCATTTCCTCTGCTCCTCAAGTTGGACAAAGCATTCCGATTAACAGAGATGGTTTAAATACATTATCTCGTGATTTAGAAAGGAGACAACAATTTTCGAGACATCATGGAGATTCACATCACGCAACAAACCTAGCTCCATTTCACCACCCACAAACCGTGCAGGTGCTATTTTATTTTGTTTCCATGGTCTTCATGTGGTAATCTATTTGATGTTTTTCTTTCCAATGAAACATGACTTTTTGTATTGTGTTAACTTCCATTACGTATACATCTTGTGGAGATTTGTGCCAGGTGAACATAAGTTTTCTAACACGATTTGGGATGAAATATTTGTTTTCATTATTGAGTCATGACCTCAAGAATTTAAATAAAAGCAACACACTTTTTAAGTCTTAGGCCTTATCATTGAACCAACTCTCAGGGCAGTTGACGTTATTTGATCAACGGCCTAAGGGGAGGGCCCACATTTTGTTTTTCATTCTCAACTTGTGTTCTACATATCTCCGAGATGATATTATGTGCATACTCTATTATTTTACTATGTATATTAACCTTTAACTTATCTACTTGAACTCCCAGAACCGGGATTCTCAAGATCGTTCCTTCACTCCTGGTCAATCTGTTCAAGCATCGACTGCTCTAAGGCCATCCACGGGGCTATTAACCGATTTCCAAAATCCTCACCTTCAGCAAGCTCTCAATATGAGGATATCCCACCTCCGGAATCAGAACTCCAGCAGTGTCCGGCCATCTTTGCCATTCTCAAGACCTATGAGTCAAGCAGGAGGTGGATATGCTTATACAGCAGTAACACCTAATAGTCAACATGCAAGAATGGTGGCTGCTTCCCAGCGAGCTGAGATGATGAGACAATCTTCAGCCATGTCATTACAAAATCAAACTTCCAGATCCGCGCATTCTCTCCAAACTACCCCTGATGGGCTTAGGAGACCAGCTGGGGAGATGAGAAATGTTGGAGTATCTCCATCTGTTACTATGGCTCCAGGTTCGGTAGATCTATCTGTAGAACAGAACTGGCAGCCTGTAGGTCGAATGCGTGGCAGTCTTTCCGGTCGAGCTTATTCTGATGCTTATGGTGTAATTATTCAGCCAACTCAAGCTGCACAGTCTGCTCGACTGCCATCTCATTTGTCACCTACTCAACCCAGTGCTCCATCGACGCAGGCTCAAAGGTCCAATGGATTGGATACACACGTTCCAAGAACATAATATTACACGTTGAAGGATTTCTTGGTCTGACAAGTATTGACTGGATGTACAGGTTCTCTTCCGTTTCCTATGGCTCCATTTATCAAGTTCGTAAAAATTTTCTGAGGTATTTTTTCATTATACCATGCCCAATTCTGTCCACTGCAGGTCAGTTGTAAAGTTTTCCTTGTAATTAGCTAGAAGCTACCACTGTTTTATACAGTCTGTCCTGTAAATGGGTTGATTTAGTTCCCTCCATCATTGCAAACAATCTTTGGCTCTGAATCCATGAAACTGAAATTTTGAACAGGAAAATTGGGTCATGGAGATGCTTTGCATTAATATTTGCACAATGAAAATGTACAACTCTAGTTTCTAACTAGAAGGTTTAGTTGAAATTTCTAATCCCACTATTATCTATAAACCATTTATATATTAGATATGGTTGTCTTGTTCCTATTCTTGTTTGTCCACCTGCATTTTTGATAAATTTGAATAGAAAAGAAACTATGATGAGATTGGGCATATTTATTTGTTATTATGTTTCTGGGAACTCAAATCTCAATTAACTCCCACCAAAGTTCCAATCTTCATTCCACAATATTGAGTTATTGAATTCTCTTTCTATATATAGTCATAACTACTAAGTCTACTATAAGTTTGCTTGTCTATCTACTTTCTAACGATGTTTTCAACATCCTTGAAAAGTTTTGAAAATTAAAAAAAAAATATATATATTTAAACACTTATTTTTGTTTTTGAAATTTGATTTAAGATTGAAATGTCTACTATAGAAGATTGAAACCCATTGTAAAGAAATGAAGGGAAAATAAGCATTATTTTTAAAAAATCATAAAAATAAAATGATTGTCAAGCAGATTAATTTCTTTTTTTCTTTTTCCAATTTTTGAAGCAATGAAAATGGAATTTTATTTTATATGTTCAAATATGTTTATAGTAGCATATTTAGAAATTGGATTCAAGTTTAAATAATTAGTCATAATGTATTTGATAATATAATTAGTCATAATGTATTTGATAATATATTTATAAACTATAAAATTAGTTGTAGTTATCCACTAAATATTATGTTGAATATATATAGTCTATTATTAATTAATTTATGAACATATGTTATGGGTTTATATACTTAATTCTTTTACACTATCGTATAATTTATAATATAGTATATCATACATATTTTAATTTAACAAAAATGAATTAAGTTTATAACATAAAAAAATAGCTTAAGTGAACACAATTCAATTAGACATACACGGTACAACATGTGTTATCGGCCACGAGGTTTGAATCCTCCCCATCATTTATTGTTGAACTAAACTAAATATGAAATACTATATATTCATTGAAGTTTTTTTTTTTTTTTTTTTTTTCAAAACGTTCATACATTTAAAATATATACACATTAACAAAGTTTACATTATTTTGACGAGAGTCTAAAAGCAACTTTCCAAAATCAAATTTATTGAACGCATATTCCGATTGTCATAAAAAATAGGGTACGGTTTGAAAGTTTTTTTGAAATTTAATTTTTGTATATCCTTTGGTAATTTGAATACAATGAAAAACAAAGTAACTCAGCGCAAAATGAACTAACTATCAACTCTGTCTCCAAGTTACCTGTTTGATTTGTTTACTCAAATTATTTTGGAAATTACCAATTTGAATTTGCCATTCTAACTCAAGAATTTGGCAGAGAAACTTAAAGATTTTATTTATTTATTTTTAACAGCATGTTTGGCCTAACTTTTAGGTGATTTTGGAGGGCTAAAAGTAAATTTCTTAAAAACATTTTTCAGAAATACTTTAAAAATTATTAACCGAAAATTATTGTGCCAATTATATACTTGTATAATTATTTTTAGATGCAAATGTAACCAAACTACACTTGTAATTAAAATCACTTTAAAAATATGCAATCAAACAAGGAGTTAGTATTATTATTATTTAATGTTTAAACTAAAGGTATACGGAAAACTCACTCTACATTTTAGGACCTATAAACTTGCAAGTTGCAGATTAATCCATATTTATCTTTGAATTTTCAAAATTCCTATTTTAGTAATTAACTTTTAAAAATATAATTACTATTTTGGTTGTTGTGTTTTAGTACTAATATCTACTCCATAATTTTCCCAACTAACCTACATGATTCCAATACATGCTATATTTGCAAGCACTATTTATTTAATTAAACAATTAACTCTAAAATAACAACTAAAGTCAAATTATTATGTTTTGTTAAAAGGTAAGGAGAAATTTTTGAAAGTTTTAGAGATATAATTAGTTTTATTAATAATAATATAAAAAAAAGATTAGAAACTGAATCTTATAATCATAATTGTCAAACTTTAGCAATATATAGTACAGAAAGAAAGAATGAGAAATAGGAAATTATTGCATATATACACGTTGTTACTGTTACTTTTTTTTTTTAGAACTATATTATTTTATTCTTTCTTTTAAAATGAAAGACAGAGTAGTTGTAATATATAATTTTTTTGGGGTAAAAATATTATATATATATATATGTTTGAAGTTAGTTTTATTTTAGTTGTTATATTTTCAATAAATTTATTAGAAATAACAAACTTCCAAATATGTCTAATGATATGATATTGTGTAATTTATGCAGAAATAGAAAATTAGGTTGTTGCATATAGAAAATGGAAATGTATGTAAATGAGGTTAAGGAAATGGCTTTGAAATTATGCACTCTCTCTCTCTAAATTCAAACATAACTTAATTCATCATTTCCATCCAACCCGCCATTGCCCTTTCCCACACTATCTATCTCTATTATTCTCATCTCTTATCTTCTACCTATATAAATATTCTTCTTCCTTCTATATCTCTTCCAACTTCATCTTCATTCATAATCATATTGCAAAAACACATGGAGTTAAGTCGATTACTAATTTGGGGTACTTGTATCCTCTCTACTGCTGCCTTCGCTTTAACTATACTGATGCACTCTCCGATGGAAGCTATGAAAATTATGTACGAGGAAAACCTGTAATTTCTGTGTGGTTGGTTGTTAATTAAGATTATATACAAATGATTACATTTTAAACTCAATCTTCTGTTTGAATTTTTGACAATGGAATCATTACATTTGGGAATGTGAAAAACTGAAATAGTCAAAATCATTTTTTTTTTTTTTTTTTTCAAAATCACTCTCAACTGTGTCTTTAATTTCTAGGAATTAGTCCCCTTCACAAATAAAGTTCGGTTTCTTAAGGTCAAAACCTACTTAAGTTCAACTTAAACCTTAATCATCCCAAATGTGGTGTAATGATGCCATTTCTTTCTAAACTAAATACCCATTATAAGTAATTAACAATTATATAAACAAAAGATTATATAAGCTAAGCTATAACTCATTAGTTTAAGCTGTTTTGGAGGGGGTGGTCTTTGGCAGGTGCATTTGGTGGGAGGCTATTACGACGCTGGCGACAACGTGAAGTTTGGGTGGCCGATGGCATTTACAGTGACATTATGTTGAGTTGGACCCCCATTGAATATGAAAAAGAGATAGCATCTGTGATGCAGCTAGAACACCTCCGGAGCTCAGTCAGGTGGGGCGCCGACTTCATATTGCGGGCTCATGTTTCACCCACCACACTCTACACTCAGGTTTGTGTCATTGTATCCTCTTTTGTTTATTTTTTAATATATTTATATATAGTATTGGTTTTAAGATGACATAATAACCTTTGACTTGCCTTTTTTTGGTACAACATGGAAGGTGGGAAAATTTCAAATATTAGATCTTTTTTATTGATAGTATACATTTCATTTGAGCTCTGCTCCCTTTTAACAGCTTTAATACTATATATAACAATTAATTAATGTATTAAAAAGTATAACAGATTAAAAAAAGGGTTATATGTTACATTTTATTCGTCTATGTCATAACCTTTCAAGTTAAATTTCTACCATGTTTTTATTTTTTATTTTTTATTTTTATTTATGAACAGTACATAAATTCAATAGCTTAAACATTTTAATGGAGTTATTAATATCTTGAATAATAATAATAATAATAATAATGTGAGAGTGTAGATTCAAATCTCTCACTACTTTGATCGATGAAATATGTTTAACTGATCAAGTTATCCCTACGTCAACAATAATCTCTATGAATATTCTATTTGTTTTCTTTCAAAAACTAAAAACAAAGAAAAATCAAAACGTTATCCTTCAGAGTCTAATTTTATTTTATTTTTTGCATGGAATACACACAGGTAGGAGATGGAAATGGCGATCATCAGTGTTGGGAGCGGCCGCAGGACATGGACACCCCTCGAACGCTTTACAAGATAACGCCTAATTCTCCTGGCACCGAAGCTGCCGCTGAGGCTGCCGCCGCGCTTGCCGCTGCTTCCATTGTGTTCCACCATGTCGACGCCAATTACTCAAGAAGCCTTCTTCAACACTCCAAATCCGTGATAATTTAATTTAATTCCAAATATGAATGTAGTTTTAATTGTGTATTATATTGATCAAACGTCGTCGTTTTTGTTGCAATTGTCCAGCTCTTTCAGTTTGCTGATCAGTTTAGAGGATCTTACTCTGCTTCTTGTCCATTCTACTGTTCTTTTTCTGGGTACCAGGTAATTAATCCAACTAAGCAACATTATTTCTAACCCATCATTTTTATTTTCAATAAATCTTAAAATTAGTCTACACGACTAGTTTATTGTTAATTATAAAAAAATATATATAAAAAGAAATATTAGCCCATCTCCAAATTGTGGAAGCATATACACACGGAAAAAGTTTTAAGTCTTTTTATAGAGCATTAGAGTTTAAATGAAAATTAATGCATGTGTTGGAGTGATTTTAAAATGATTAAAATCACTTTTCTCATGTTTAAAATTACTTCGAAACAATATATTTCATTACCAAAAATCAATTTAATATTTAATTTTCATGTTTAATTGCATTTTTGTATAGTATTAAAATTAATTTTGAATGAATAAAAACTAATGGAGTAATATTTCTTGTAATTAATTGTATATGTATTGTTACATTATGTAAAATCATCTAAAGTTCAAGAAGTTGAACAATGTCATTATCATATCTATATTCGAACCATCTTATATTCTGAACAGATCGAAGATAAAGATGATTAGAAATCTCTAAGATACTTCATAAAATATTTGTAGAGTCTATAGTATTATCCTTTTTTGGCTGGTTCATTTTTGATTGTTTTGTAGGATGAGTTGCTGTGGGCAGCGGCTTGGGTATACAAAGCAAGTGGAAATAGCAAATATTTGAGCTATATTTTAAGCAACCAAGGGTGGAGTCAACCCACATCCCAGTTTAGTTGGGACAACAAATTTGTTGGAGCTCAGACACTGTTAGCAAAGGTGTTTAAACAAATATTTTTTAAACAAATCTCTAATAATTATAACTTTTTTTCTCTAATCTAATCAAAATCAACTCAAATACACATGAATAAAAGTAGCAATAAAAGTTATGATTTATGTTAATCCATGCATTTAAAGAAATCTTTATGTATTAGTGGTTCAAAAATGAAATTTAATTCACTTGTGACAGTTTAACAACCCAAATTTTAAAAATATTAACTAATTTTAGTCACACACATATGGAGGTGAATTTCAAAGTTAAGGTTAAAAATACAAATTTGGCTTTCTAATTATAAATAAATTAGATGATCACTTTATATAGTTTCTCTTTTGTTCTAAAGGCTTAAACTCTACAGGAGTTATATAAAGGAAAGAAAAACTTGAGCAAGTTTAAGATCGATGCAGAATCGTTCATCTGCATGGTGATGCCTGGTGGCAGCTGTTCTAAGATTCCAACAACACCCGGTATGAATTAAGCTATGATTTGAGTGATTTTGAAATTGTTGAAAATCATTTTTGTCATTTTCAAAATCACTTTGAATGATATTTTTAGAAGTCTTTGAAAATGTTTAAGTGTAAAAATAAAGTCTCTTAATAATTCTAAATCAATTTTGATGATGTGAATTACGAATAAAATATAATACTCCCACTCGTGAAAAATTACCAATAAATGAATTATTATTGAGTTGCTAAAGCTTCTTATTTGTAGGTGGGCTTCTTTTCCTAAGAGATAATAGTAATTTGCAGTATGCATCAAGCTCTTCCATGGTGCTTTTCATGTATTCTAGACTTTTAAATAAAGCTCGTGTTGATGGAGTTCATTGTGGATCCAAATATTTCTCCTCTTCTCAAATCAAAACCTTTGCCAAATCACAGGTAAGTTTAGAGATTCTCATATATGATGATTTTTCAATAATTGCTAAAAAAAATATGACAATTTTATAATGTGTGGTATGATATAATTAATTTAGCATGACCTAGTTAACCTTTTAACTAATAGATATTTCATAAGCAATATAAAATTTAAAATCTTAATGATGATTATTCAACTATGGTAACGTAATAATGTTATTTCACTTCTCTATTTTATTAGAAACTATATGTGTTGTACCTTGTATATCAAAATTAGTATTTAAGTACAACCATTTTGTTTGAAATTATGATGTTCTAGAAAAGACACTCAAAAGCTTAGGTCAATTTTCTTCTATTTAGGTGGATTACATATTGGGGAAAAATCCCATGAAATGGTCATACATGGTAGGATTTGGCAACAAATATCCATTACAATTGCATCATAGAGCTTCATCCATCCCTTCAATAAAAGTGCACTCAACAAAGGTTGGTTGTAATGATGGTTACTCACACTATTTTTATTCAAATAATCCAAATCCAAACGTACACATCGGTGCCATAGTAGGAGGTCCTAATTCAAACGATCAGTTCAGTGATTTGAGATCAGACCACTCTCATTCTGAACCTACAACTTATATGAATGCTGCTTTTGTTGGTTCAGTAGCTGCCTTAGTTGCATAA

mRNA sequence

ATGGGTGCATCGTCGCCACACGAGATGAACTTGAATAGAATTATCTTATACATCGATGGCTTAAATTTGCTCCTCAACCGTGTCGGCCAAATGGACCCATCCCAATTATGCAATCTCTGCTTTTCCATTGCCAGAAGTATCGACTATACAATTGCAAACAACATTGTTCCATCTAAAGCTCACAGTTTACCTAGTTTCATCAAACAGTTATGTCAGATGAAGCATTCTCATCGCTTAAAAGCAGCACTCATGGTACTCATGATAACCATCAAGAATGCTTGCAAGGCGAAATGGTTTTCAGAAAAGGATGCAGAGGAACTCTATAGTCTAGCCAATGAGATTGGTAACGATTTCTTCGGAGATACGAATATTGGACAAAGCAATCCCCTTGCCACGATTAGTACAGTTATGGAAAGATTTTTTCCTCGCTTGAAGCTGGGTCAGATTGTTGCATCTGTAGAAGTTAAGCCTGGATATGGAGTATATGCTATTGATTTCAATATTTTAAAGACAATCCAATATGCACCGCAGGAGAAACTACGACTGTTTGTTGCTCAAAAAGATAATACTGAGACGTCTGCATGCATAATCAGTCCTCCACAAGTTAACTTCCTTGTCAATGGGAGGGGAGTCAACGGAAGGACAAATGTATACGTGGATACTGGACCCCAACTTCCAACAAATATAACTCATATGCTTAAATTAGGATCAAATCTTCTCCAAGCAATTGGGAGCTTCAACGGTCATTATGTTATAGCAGTTGCCATCATGGGTACCGCACCATCACCTGACTCCTCTGTTCTGCAAGATCATATACAGCCAGTTGTTTCTACTGTGGATTCAGATTCTGATATAATTGAGGGCCCATCACGAATATCCCTTAATTGCCCCATAAGCTATACCCGAATCAAGGTTCCTGTGAAAGGTCGTTCTTGTAAACATCTTCAGTGCTTTGATTTCCACAACTTTATTGACATAAATTCAAGAAGACCATCCTGGCGATGTCCACATTGTAATCAGTACATCTGCTTTTTGGATATTCGTGTTGATCAAAATATGTTGAAGGCAAGTAGAGTCATTAGAGAAGTGGCTGAGAACGTTACTGAAGTTATTATCTCAGCAGATGGATCATGGAAGGCTATCCTTGAAAATGATCATGGGGATGGTCGGCCATTGGATGATTCTCTTAACCATCAGAAAGAAAGGGCACAAGAAGAGTCTACTGCCCCTCCTGATGTGTTAGATCTTACTGAGGTTGATGATGATATGGACATCTGCAATTTGGAGACTGAAGACAGGAAAACTTGCCTTGATAATAAAAACCAACCGGTTTCTTCGAGTTTAGATATATCATCTGGAATGAACATGAATAGCTTGAATCAAAATCTTGCTGCTGTCTTGGATGACGACTTTTGGTCTGGAATAGTTACTGATGGGATTTTGACCTCAAGTGCTGGGTCAGATGCTCCAATGGGTAATAGCACGCCTCCACCTGGTTTTGCTGGTATTATGCAATCCACTGTCTTTACTGATGTTGTCCCACCTGTTCTTAATCATGGTGCGGGGGTTCCAGGGCATGCCAACTTTTCATCTTCTGCATTCTATGATCAAAACAATTTGCAGATTCAAGTTCCGAATTCAAATGAAAATAATCAATATGGGAGGATGCCATTAATAGCAAGACCTGTAAGCAGGACGCCAATTGCAGTTCAAGCCCTTCCTGCTCAATCCCAAGTAGCAGGCCAGCAATATAGTTCAAGAACTCCGATCATTTCCTCTGCTCCTCAAGTTGGACAAAGCATTCCGATTAACAGAGATGGTTTAAATACATTATCTCGTGATTTAGAAAGGAGACAACAATTTTCGAGACATCATGGAGATTCACATCACGCAACAAACCTAGCTCCATTTCACCACCCACAAACCGTGCAGAACCGGGATTCTCAAGATCGTTCCTTCACTCCTGGTCAATCTGTTCAAGCATCGACTGCTCTAAGGCCATCCACGGGGCTATTAACCGATTTCCAAAATCCTCACCTTCAGCAAGCTCTCAATATGAGGATATCCCACCTCCGGAATCAGAACTCCAGCAGTGTCCGGCCATCTTTGCCATTCTCAAGACCTATGAGTCAAGCAGGAGGTGGATATGCTTATACAGCAGTAACACCTAATAGTCAACATGCAAGAATGGTGGCTGCTTCCCAGCGAGCTGAGATGATGAGACAATCTTCAGCCATGTCATTACAAAATCAAACTTCCAGATCCGCGCATTCTCTCCAAACTACCCCTGATGGGCTTAGGAGACCAGCTGGGGAGATGAGAAATGTTGGAGTATCTCCATCTGTTACTATGGCTCCAGGTTCGGTAGATCTATCTGTAGAACAGAACTGGCAGCCTGTAGGTCGAATGCGTGGCAGTCTTTCCGGTCGAGCTTATTCTGATGCTTATGGTGTAATTATTCAGCCAACTCAAGCTGCACAGTCTGCTCGACTGCCATCTCATTTGTCACCTACTCAACCCAGTGCTCCATCGACGCAGGCTCAAAGGTTCTCTTCCGTTTCCTATGGCTCCATTTATCAAGTTCGTAAAAATTTTCTGAGGTATTTTTTCATTATACCATGCCCAATTCTGTCCACTGCAGGTGCATTTGGTGGGAGGCTATTACGACGCTGGCGACAACGTGAAGTTTGGGTGGCCGATGGCATTTACAGTGACATTATGTTGAGTTGGACCCCCATTGAATATGAAAAAGAGATAGCATCTGTGATGCAGCTAGAACACCTCCGGAGCTCAGTCAGGTGGGGCGCCGACTTCATATTGCGGGCTCATGTTTCACCCACCACACTCTACACTCAGGTAGGAGATGGAAATGGCGATCATCAGTGTTGGGAGCGGCCGCAGGACATGGACACCCCTCGAACGCTTTACAAGATAACGCCTAATTCTCCTGGCACCGAAGCTGCCGCTGAGGCTGCCGCCGCGCTTGCCGCTGCTTCCATTGTGTTCCACCATGTCGACGCCAATTACTCAAGAAGCCTTCTTCAACACTCCAAATCCCTCTTTCAGTTTGCTGATCAGTTTAGAGGATCTTACTCTGCTTCTTGTCCATTCTACTGTTCTTTTTCTGGGTACCAGGATGAGTTGCTGTGGGCAGCGGCTTGGGTATACAAAGCAAGTGGAAATAGCAAATATTTGAGCTATATTTTAAGCAACCAAGGGTGGAGTCAACCCACATCCCAGTTTAGTTGGGACAACAAATTTGTTGGAGCTCAGACACTGTTAGCAAAGGAGTTATATAAAGGAAAGAAAAACTTGAGCAAGTTTAAGATCGATGCAGAATCGTTCATCTGCATGGTGATGCCTGGTGGCAGCTGTTCTAAGATTCCAACAACACCCGGTGGGCTTCTTTTCCTAAGAGATAATAGTAATTTGCAGTATGCATCAAGCTCTTCCATGGTGCTTTTCATGTATTCTAGACTTTTAAATAAAGCTCGTGTTGATGGAGTTCATTGTGGATCCAAATATTTCTCCTCTTCTCAAATCAAAACCTTTGCCAAATCACAGGTGGATTACATATTGGGGAAAAATCCCATGAAATGGTCATACATGGTAGGATTTGGCAACAAATATCCATTACAATTGCATCATAGAGCTTCATCCATCCCTTCAATAAAAGTGCACTCAACAAAGGTTGGTTGTAATGATGGTTACTCACACTATTTTTATTCAAATAATCCAAATCCAAACGTACACATCGGTGCCATAGTAGGAGGTCCTAATTCAAACGATCAGTTCAGTGATTTGAGATCAGACCACTCTCATTCTGAACCTACAACTTATATGAATGCTGCTTTTGTTGGTTCAGTAGCTGCCTTAGTTGCATAA

Coding sequence (CDS)

ATGGGTGCATCGTCGCCACACGAGATGAACTTGAATAGAATTATCTTATACATCGATGGCTTAAATTTGCTCCTCAACCGTGTCGGCCAAATGGACCCATCCCAATTATGCAATCTCTGCTTTTCCATTGCCAGAAGTATCGACTATACAATTGCAAACAACATTGTTCCATCTAAAGCTCACAGTTTACCTAGTTTCATCAAACAGTTATGTCAGATGAAGCATTCTCATCGCTTAAAAGCAGCACTCATGGTACTCATGATAACCATCAAGAATGCTTGCAAGGCGAAATGGTTTTCAGAAAAGGATGCAGAGGAACTCTATAGTCTAGCCAATGAGATTGGTAACGATTTCTTCGGAGATACGAATATTGGACAAAGCAATCCCCTTGCCACGATTAGTACAGTTATGGAAAGATTTTTTCCTCGCTTGAAGCTGGGTCAGATTGTTGCATCTGTAGAAGTTAAGCCTGGATATGGAGTATATGCTATTGATTTCAATATTTTAAAGACAATCCAATATGCACCGCAGGAGAAACTACGACTGTTTGTTGCTCAAAAAGATAATACTGAGACGTCTGCATGCATAATCAGTCCTCCACAAGTTAACTTCCTTGTCAATGGGAGGGGAGTCAACGGAAGGACAAATGTATACGTGGATACTGGACCCCAACTTCCAACAAATATAACTCATATGCTTAAATTAGGATCAAATCTTCTCCAAGCAATTGGGAGCTTCAACGGTCATTATGTTATAGCAGTTGCCATCATGGGTACCGCACCATCACCTGACTCCTCTGTTCTGCAAGATCATATACAGCCAGTTGTTTCTACTGTGGATTCAGATTCTGATATAATTGAGGGCCCATCACGAATATCCCTTAATTGCCCCATAAGCTATACCCGAATCAAGGTTCCTGTGAAAGGTCGTTCTTGTAAACATCTTCAGTGCTTTGATTTCCACAACTTTATTGACATAAATTCAAGAAGACCATCCTGGCGATGTCCACATTGTAATCAGTACATCTGCTTTTTGGATATTCGTGTTGATCAAAATATGTTGAAGGCAAGTAGAGTCATTAGAGAAGTGGCTGAGAACGTTACTGAAGTTATTATCTCAGCAGATGGATCATGGAAGGCTATCCTTGAAAATGATCATGGGGATGGTCGGCCATTGGATGATTCTCTTAACCATCAGAAAGAAAGGGCACAAGAAGAGTCTACTGCCCCTCCTGATGTGTTAGATCTTACTGAGGTTGATGATGATATGGACATCTGCAATTTGGAGACTGAAGACAGGAAAACTTGCCTTGATAATAAAAACCAACCGGTTTCTTCGAGTTTAGATATATCATCTGGAATGAACATGAATAGCTTGAATCAAAATCTTGCTGCTGTCTTGGATGACGACTTTTGGTCTGGAATAGTTACTGATGGGATTTTGACCTCAAGTGCTGGGTCAGATGCTCCAATGGGTAATAGCACGCCTCCACCTGGTTTTGCTGGTATTATGCAATCCACTGTCTTTACTGATGTTGTCCCACCTGTTCTTAATCATGGTGCGGGGGTTCCAGGGCATGCCAACTTTTCATCTTCTGCATTCTATGATCAAAACAATTTGCAGATTCAAGTTCCGAATTCAAATGAAAATAATCAATATGGGAGGATGCCATTAATAGCAAGACCTGTAAGCAGGACGCCAATTGCAGTTCAAGCCCTTCCTGCTCAATCCCAAGTAGCAGGCCAGCAATATAGTTCAAGAACTCCGATCATTTCCTCTGCTCCTCAAGTTGGACAAAGCATTCCGATTAACAGAGATGGTTTAAATACATTATCTCGTGATTTAGAAAGGAGACAACAATTTTCGAGACATCATGGAGATTCACATCACGCAACAAACCTAGCTCCATTTCACCACCCACAAACCGTGCAGAACCGGGATTCTCAAGATCGTTCCTTCACTCCTGGTCAATCTGTTCAAGCATCGACTGCTCTAAGGCCATCCACGGGGCTATTAACCGATTTCCAAAATCCTCACCTTCAGCAAGCTCTCAATATGAGGATATCCCACCTCCGGAATCAGAACTCCAGCAGTGTCCGGCCATCTTTGCCATTCTCAAGACCTATGAGTCAAGCAGGAGGTGGATATGCTTATACAGCAGTAACACCTAATAGTCAACATGCAAGAATGGTGGCTGCTTCCCAGCGAGCTGAGATGATGAGACAATCTTCAGCCATGTCATTACAAAATCAAACTTCCAGATCCGCGCATTCTCTCCAAACTACCCCTGATGGGCTTAGGAGACCAGCTGGGGAGATGAGAAATGTTGGAGTATCTCCATCTGTTACTATGGCTCCAGGTTCGGTAGATCTATCTGTAGAACAGAACTGGCAGCCTGTAGGTCGAATGCGTGGCAGTCTTTCCGGTCGAGCTTATTCTGATGCTTATGGTGTAATTATTCAGCCAACTCAAGCTGCACAGTCTGCTCGACTGCCATCTCATTTGTCACCTACTCAACCCAGTGCTCCATCGACGCAGGCTCAAAGGTTCTCTTCCGTTTCCTATGGCTCCATTTATCAAGTTCGTAAAAATTTTCTGAGGTATTTTTTCATTATACCATGCCCAATTCTGTCCACTGCAGGTGCATTTGGTGGGAGGCTATTACGACGCTGGCGACAACGTGAAGTTTGGGTGGCCGATGGCATTTACAGTGACATTATGTTGAGTTGGACCCCCATTGAATATGAAAAAGAGATAGCATCTGTGATGCAGCTAGAACACCTCCGGAGCTCAGTCAGGTGGGGCGCCGACTTCATATTGCGGGCTCATGTTTCACCCACCACACTCTACACTCAGGTAGGAGATGGAAATGGCGATCATCAGTGTTGGGAGCGGCCGCAGGACATGGACACCCCTCGAACGCTTTACAAGATAACGCCTAATTCTCCTGGCACCGAAGCTGCCGCTGAGGCTGCCGCCGCGCTTGCCGCTGCTTCCATTGTGTTCCACCATGTCGACGCCAATTACTCAAGAAGCCTTCTTCAACACTCCAAATCCCTCTTTCAGTTTGCTGATCAGTTTAGAGGATCTTACTCTGCTTCTTGTCCATTCTACTGTTCTTTTTCTGGGTACCAGGATGAGTTGCTGTGGGCAGCGGCTTGGGTATACAAAGCAAGTGGAAATAGCAAATATTTGAGCTATATTTTAAGCAACCAAGGGTGGAGTCAACCCACATCCCAGTTTAGTTGGGACAACAAATTTGTTGGAGCTCAGACACTGTTAGCAAAGGAGTTATATAAAGGAAAGAAAAACTTGAGCAAGTTTAAGATCGATGCAGAATCGTTCATCTGCATGGTGATGCCTGGTGGCAGCTGTTCTAAGATTCCAACAACACCCGGTGGGCTTCTTTTCCTAAGAGATAATAGTAATTTGCAGTATGCATCAAGCTCTTCCATGGTGCTTTTCATGTATTCTAGACTTTTAAATAAAGCTCGTGTTGATGGAGTTCATTGTGGATCCAAATATTTCTCCTCTTCTCAAATCAAAACCTTTGCCAAATCACAGGTGGATTACATATTGGGGAAAAATCCCATGAAATGGTCATACATGGTAGGATTTGGCAACAAATATCCATTACAATTGCATCATAGAGCTTCATCCATCCCTTCAATAAAAGTGCACTCAACAAAGGTTGGTTGTAATGATGGTTACTCACACTATTTTTATTCAAATAATCCAAATCCAAACGTACACATCGGTGCCATAGTAGGAGGTCCTAATTCAAACGATCAGTTCAGTGATTTGAGATCAGACCACTCTCATTCTGAACCTACAACTTATATGAATGCTGCTTTTGTTGGTTCAGTAGCTGCCTTAGTTGCATAA

Protein sequence

MGASSPHEMNLNRIILYIDGLNLLLNRVGQMDPSQLCNLCFSIARSIDYTIANNIVPSKAHSLPSFIKQLCQMKHSHRLKAALMVLMITIKNACKAKWFSEKDAEELYSLANEIGNDFFGDTNIGQSNPLATISTVMERFFPRLKLGQIVASVEVKPGYGVYAIDFNILKTIQYAPQEKLRLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNVYVDTGPQLPTNITHMLKLGSNLLQAIGSFNGHYVIAVAIMGTAPSPDSSVLQDHIQPVVSTVDSDSDIIEGPSRISLNCPISYTRIKVPVKGRSCKHLQCFDFHNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLKASRVIREVAENVTEVIISADGSWKAILENDHGDGRPLDDSLNHQKERAQEESTAPPDVLDLTEVDDDMDICNLETEDRKTCLDNKNQPVSSSLDISSGMNMNSLNQNLAAVLDDDFWSGIVTDGILTSSAGSDAPMGNSTPPPGFAGIMQSTVFTDVVPPVLNHGAGVPGHANFSSSAFYDQNNLQIQVPNSNENNQYGRMPLIARPVSRTPIAVQALPAQSQVAGQQYSSRTPIISSAPQVGQSIPINRDGLNTLSRDLERRQQFSRHHGDSHHATNLAPFHHPQTVQNRDSQDRSFTPGQSVQASTALRPSTGLLTDFQNPHLQQALNMRISHLRNQNSSSVRPSLPFSRPMSQAGGGYAYTAVTPNSQHARMVAASQRAEMMRQSSAMSLQNQTSRSAHSLQTTPDGLRRPAGEMRNVGVSPSVTMAPGSVDLSVEQNWQPVGRMRGSLSGRAYSDAYGVIIQPTQAAQSARLPSHLSPTQPSAPSTQAQRFSSVSYGSIYQVRKNFLRYFFIIPCPILSTAGAFGGRLLRRWRQREVWVADGIYSDIMLSWTPIEYEKEIASVMQLEHLRSSVRWGADFILRAHVSPTTLYTQVGDGNGDHQCWERPQDMDTPRTLYKITPNSPGTEAAAEAAAALAAASIVFHHVDANYSRSLLQHSKSLFQFADQFRGSYSASCPFYCSFSGYQDELLWAAAWVYKASGNSKYLSYILSNQGWSQPTSQFSWDNKFVGAQTLLAKELYKGKKNLSKFKIDAESFICMVMPGGSCSKIPTTPGGLLFLRDNSNLQYASSSSMVLFMYSRLLNKARVDGVHCGSKYFSSSQIKTFAKSQVDYILGKNPMKWSYMVGFGNKYPLQLHHRASSIPSIKVHSTKVGCNDGYSHYFYSNNPNPNVHIGAIVGGPNSNDQFSDLRSDHSHSEPTTYMNAAFVGSVAALVA
Homology
BLAST of Clc01G02440 vs. NCBI nr
Match: XP_011654714.1 (E4 SUMO-protein ligase PIAL2 isoform X1 [Cucumis sativus] >KAE8647846.1 hypothetical protein Csa_000250 [Cucumis sativus])

HSP 1 Score: 1375.1 bits (3558), Expect = 0.0e+00
Identity = 701/851 (82.37%), Postives = 753/851 (88.48%), Query Frame = 0

Query: 1   MGASSPHEMNLNRIILYIDGLNLLLNRVGQMDPSQLCNLCFSIARSIDYTIANNIVPSKA 60
           MGASS H+ NL +II YIDGL LL+N V Q+D + LC+LCFSI+RSIDY IANN VPSKA
Sbjct: 1   MGASSQHDTNLKKIISYIDGLTLLINHVAQIDLANLCSLCFSISRSIDYAIANNAVPSKA 60

Query: 61  HSLPSFIKQLCQMKHSHRLKAALMVLMITIKNACKAKWFSEKDAEELYSLANEIGNDFFG 120
           HSLPS +KQLCQ+KHSHR KAALMVLM+TIKNACK +WFSEKDAEEL  LANEIGNDFFG
Sbjct: 61  HSLPSLVKQLCQLKHSHRSKAALMVLMLTIKNACKVRWFSEKDAEELQRLANEIGNDFFG 120

Query: 121 DTNIGQSNPLATISTVMERFFPRLKLGQIVASVEVKPGYGVYAIDFNILKTIQYAPQEKL 180
           DTNIGQ+N L TI+TVMER+FP LKLGQIVAS+EVKPGYGVYA+DFNI +T+QYA QEKL
Sbjct: 121 DTNIGQANSLTTITTVMERYFPCLKLGQIVASLEVKPGYGVYALDFNISRTVQYASQEKL 180

Query: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNVYVDTGPQLPTNITHMLKLGSNLL 240
           RLFV QKDNTETSACIISPPQVNFLVNGRG+NGR N ++DTGPQLPTNITHMLKLGSNLL
Sbjct: 181 RLFVIQKDNTETSACIISPPQVNFLVNGRGINGRINTHMDTGPQLPTNITHMLKLGSNLL 240

Query: 241 QAIGSFNGHYVIAVAIMGTAPSPDSSVLQDHIQPVVSTVDSDSDIIEGPSRISLNCPISY 300
           QA+GSFNGHYV+A+AI GTAPSPDSSVL DHIQP+VST+DSDSDIIEGPSRISLNCPISY
Sbjct: 241 QAVGSFNGHYVLAIAITGTAPSPDSSVLHDHIQPIVSTLDSDSDIIEGPSRISLNCPISY 300

Query: 301 TRIKVPVKGRSCKHLQCFDFHNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLKASRVI 360
           TRIK+PVKG SCKHLQCFDF NFIDINSRRPSWRCPHCNQYICFLDIRVD+NMLK   VI
Sbjct: 301 TRIKIPVKGCSCKHLQCFDFDNFIDINSRRPSWRCPHCNQYICFLDIRVDRNMLK---VI 360

Query: 361 REVAENVTEVIISADGSWKAILENDHGDGRPLDDSLNHQKERAQEESTAPPDVLDLTEVD 420
           REVAENVTEVIIS DGSWKAILEND+GDGR L+DSLNHQ ERAQEES A PDVLD TEV 
Sbjct: 361 REVAENVTEVIISVDGSWKAILENDNGDGRSLNDSLNHQNERAQEESAASPDVLDHTEVG 420

Query: 421 DDMDICNLETEDRKTCLDNKNQPVSSSLDISSGMNMNSLNQNLAAVLDDDFWSGIVTDGI 480
           DDMDI N E EDRK CL NKNQ VSSSLD+SSGMNMNS +QNL+AV+DDD WS I  DG+
Sbjct: 421 DDMDIFNSEIEDRKPCLGNKNQRVSSSLDMSSGMNMNSFSQNLSAVMDDDIWSRI--DGV 480

Query: 481 LTSSAGSDAPMGNSTPPPGFAGIMQSTVFTDVVPPVLNHGAGVPGHANFSSSAFYDQNNL 540
           L S+AG DAPM NST PPGF G MQS V TD V PVLNHG GV GHANF S AFY+QNN+
Sbjct: 481 LISTAGLDAPMVNSTYPPGFTGTMQSAVLTDAVQPVLNHGVGVSGHANFPSPAFYNQNNV 540

Query: 541 QIQVPNSNENNQYGRMPLIARPVSRTPIAVQALPAQSQVAGQQYSSRTPIISSAPQVGQS 600
           QIQV NSNENNQYGR+  I+RPVSRTP+AVQALPAQS  AGQQYSSRTPIISS PQVGQS
Sbjct: 541 QIQVSNSNENNQYGRVTSISRPVSRTPVAVQALPAQSHAAGQQYSSRTPIISS-PQVGQS 600

Query: 601 IPINRDGLNTLSRDLERRQQFSRHHGDSHHATNLAPFHHPQTVQNRDSQDRSFTPGQSVQ 660
           IPINRDGLN LSRDLERRQQFSRHHGDSHH+TNLA FHHPQTVQNRD QDRSFT GQS+Q
Sbjct: 601 IPINRDGLNALSRDLERRQQFSRHHGDSHHSTNLASFHHPQTVQNRDPQDRSFTTGQSIQ 660

Query: 661 ASTALRPSTGLLTDFQNPHLQQALNMRISHLRNQNSSSVRPSLPFSRPMSQAGGGY---A 720
            S+  RPS GLL DFQNPHLQQALNMR+ HL+NQNSSSVR SL FSRPMSQ GGGY    
Sbjct: 661 TSSGARPSPGLLADFQNPHLQQALNMRMPHLQNQNSSSVRTSLSFSRPMSQVGGGYGGST 720

Query: 721 YTAVTPNSQHARMVAASQRAEMMRQSSAMSLQNQTSRSAHSLQTTPDGLRRPAGEMRNVG 780
           YT VTPNSQHARM+AASQR EMMRQS  MSL NQTSRSAHSLQTTPDGLRRP+G++RNVG
Sbjct: 721 YTTVTPNSQHARMLAASQRVEMMRQSPPMSLHNQTSRSAHSLQTTPDGLRRPSGDLRNVG 780

Query: 781 VSPSVTMAPGSVDLSVEQNWQPVGRMRGSLSGRAYSDAYGVIIQPTQAAQSARLPSHLSP 840
           VS SVTMA GSVDLS EQNWQP GRMRGSLSGR YSDAYGVIIQPTQAAQSAR PS+L+P
Sbjct: 781 VSQSVTMAAGSVDLSAEQNWQPAGRMRGSLSGRVYSDAYGVIIQPTQAAQSARPPSNLTP 840

Query: 841 TQPSAPSTQAQ 849
           TQP APSTQAQ
Sbjct: 841 TQPIAPSTQAQ 845

BLAST of Clc01G02440 vs. NCBI nr
Match: XP_008437346.1 (PREDICTED: E4 SUMO-protein ligase PIAL2 isoform X1 [Cucumis melo])

HSP 1 Score: 1370.9 bits (3547), Expect = 0.0e+00
Identity = 707/852 (82.98%), Postives = 758/852 (88.97%), Query Frame = 0

Query: 1   MGASSPHEMNLNRIILYIDGLNLLLNRVGQMDPSQLCNLCFSIARSIDYTIANNIVPSKA 60
           MGAS+  + NLN+II YIDGL LL+N V Q+D + LC LCFSI+RSIDY IANN VPSKA
Sbjct: 1   MGASAQQDTNLNKIIAYIDGLTLLINHVAQVDLAHLCTLCFSISRSIDYAIANNAVPSKA 60

Query: 61  HSLPSFIKQLCQMKHSHRLKAALMVLMITIKNACKAKWFSEKDAEELYSLANEIGNDFFG 120
           HSLP   KQLCQ+KHSHR KAALMVLM+TIKNACK +WFSEKDAEEL  LANEIGNDFFG
Sbjct: 61  HSLPILFKQLCQLKHSHRSKAALMVLMLTIKNACKVRWFSEKDAEELQRLANEIGNDFFG 120

Query: 121 DTNIGQSNPLATISTVMERFFPRLKLGQIVASVEVKPGYGVYAIDFNILKTIQYAPQEKL 180
           D NIGQ+N L TI+TVMER+FP LKLGQIVAS+EVKPGYGVYA+DFNI +TIQ APQEKL
Sbjct: 121 DMNIGQANSLTTITTVMERYFPCLKLGQIVASLEVKPGYGVYALDFNISRTIQCAPQEKL 180

Query: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNVYVDTGPQLPTNITHMLKLGSNLL 240
           RLFV QKDNTETSAC+ISPPQVNFLVNGRGVNGR N ++DTGPQLPTNITHMLKLGSNLL
Sbjct: 181 RLFVIQKDNTETSACMISPPQVNFLVNGRGVNGRINTHMDTGPQLPTNITHMLKLGSNLL 240

Query: 241 QAIGSFNGHYVIAVAIMGTAPSPDSSVLQDHIQPVVSTVDSDSDIIEGPSRISLNCPISY 300
           QAIGSFNGHYV+AVAI GTAPSP+SSVLQDH+QPVVST+DSDSDIIEGPSRISLNCPISY
Sbjct: 241 QAIGSFNGHYVLAVAITGTAPSPNSSVLQDHVQPVVSTLDSDSDIIEGPSRISLNCPISY 300

Query: 301 TRIKVPVKGRSCKHLQCFDFHNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLKASRVI 360
           TRIK+PVKG SCKHLQCFDF NFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLK   VI
Sbjct: 301 TRIKIPVKGCSCKHLQCFDFDNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLK---VI 360

Query: 361 REVAENVTEVIISADGSWKAILENDHGDGRPLDDSLNHQKERAQEESTAPPDVLDLTEVD 420
           REVAENVTEVIISADGSWKAILEND+GDGR LDDSLNHQ ERAQEES+AP DVLDLTEV 
Sbjct: 361 REVAENVTEVIISADGSWKAILENDNGDGRSLDDSLNHQSERAQEESSAPSDVLDLTEVG 420

Query: 421 DDMDICNLETEDRKTCLDNKNQPVSSSLDISSGMNMNSLNQNLAAVLDDDFWSGIVTDGI 480
           DDMDI + E EDRK CL NKNQPVSSSLD+SSGMNMNS +QNL+AV++DDFWS +  DG+
Sbjct: 421 DDMDIFDSEIEDRKPCLGNKNQPVSSSLDMSSGMNMNSFSQNLSAVVEDDFWSRL--DGV 480

Query: 481 LTSSAGSDAPMGNSTPPPGFAGIMQSTVFTDVVPPVLNHGAGVPGHANFSSSAFYDQNNL 540
           L SSAG DAPM NST PPGF  IMQS V TDVV PVLNHG GV GHANFSS AFY+QNN+
Sbjct: 481 LISSAGLDAPMVNSTYPPGFTNIMQSAVLTDVVQPVLNHGVGVLGHANFSSPAFYNQNNM 540

Query: 541 QIQVPNSNENNQYGRMPLIARPVSRTPIAVQALPAQSQVAGQQYSSRTPIISSAPQVGQS 600
            IQV NSNENNQYGR+  I+ P SRTPIAVQALPAQS  AGQQYSSRTPIISS PQVGQS
Sbjct: 541 HIQVSNSNENNQYGRVTSISIPASRTPIAVQALPAQSHAAGQQYSSRTPIISS-PQVGQS 600

Query: 601 IPINRDGLNTLSRDLERRQQFSRHHGDSHHATNLAPFHHPQTVQNRDSQDRSFTPGQSVQ 660
           IPINRDGLN LSRDLERRQQFSRHHGDSHHATNLA FHHPQTVQNRD QDRSFT GQS+Q
Sbjct: 601 IPINRDGLNALSRDLERRQQFSRHHGDSHHATNLASFHHPQTVQNRDPQDRSFTTGQSIQ 660

Query: 661 ASTALRPSTGLLTDFQNPHLQQALNMRISHLRNQNSSSVRPSLPFSRPMSQAGGGY---A 720
            S   RPSTGLLTDFQNPHLQQAL+ R+ HL+NQNSSSVRPSL FSRPMSQ GGGY    
Sbjct: 661 TSNGARPSTGLLTDFQNPHLQQALS-RMPHLQNQNSSSVRPSLSFSRPMSQVGGGYGGST 720

Query: 721 YTAVTPNSQHARMVAASQRAEMMRQSSAMSLQNQTSRSAHSLQTTPDGLRRPAGEMRNV- 780
           YT VTPNSQH+RM+AASQRAEMMRQS +MSLQNQTSRSAHSLQTTPDGLRRP+G++RNV 
Sbjct: 721 YTTVTPNSQHSRMMAASQRAEMMRQSPSMSLQNQTSRSAHSLQTTPDGLRRPSGDLRNVG 780

Query: 781 GVSPSVTMAPGSVDLSVEQNWQPVGRMRGSLSGRAYSDAYGVIIQPTQAAQSARLPSHLS 840
           GVS SVT+A GSVDLS EQNWQP GRMRGSLSGR YSDAYGVIIQPTQAAQSAR PS+L+
Sbjct: 781 GVSQSVTIAAGSVDLSAEQNWQPAGRMRGSLSGRVYSDAYGVIIQPTQAAQSARPPSNLT 840

Query: 841 PTQPSAPSTQAQ 849
           PTQP APSTQAQ
Sbjct: 841 PTQPIAPSTQAQ 845

BLAST of Clc01G02440 vs. NCBI nr
Match: KAG6579533.1 (E4 SUMO-protein ligase PIAL2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1352.8 bits (3500), Expect = 0.0e+00
Identity = 694/853 (81.36%), Postives = 749/853 (87.81%), Query Frame = 0

Query: 1   MGASSPHEMNLNRIILYIDGLNLLLNRVGQMDPSQLCNLCFSIARSIDYTIANNIVPSKA 60
           MGA++P+EM L+RI  YID L L +NRV Q+DP QLCN+CFS+ARSID+ IAN+ VPSKA
Sbjct: 1   MGATTPYEMKLDRISSYIDSLTLYVNRVDQIDPVQLCNICFSLARSIDFAIANDFVPSKA 60

Query: 61  HSLPSFIKQLCQMKHSHRLKAALMVLMITIKNACKAKWFSEKDAEELYSLANEIGNDFFG 120
             LPS +KQ+CQ KHSH LKAA+MVLMI  KNACK KWFSEK+AEELYSLANEIG+DFF 
Sbjct: 61  QGLPSLLKQICQKKHSHHLKAAIMVLMIAAKNACKVKWFSEKEAEELYSLANEIGSDFFV 120

Query: 121 DTNIGQSNPLATISTVMERFFPRLKLGQIVASVEVKPGYGVYAIDFNILKTIQYAPQEKL 180
           DTN G SN L TI+TVMERFFPRLKLGQIV S EVKPGYGV+A DFNI KTIQYAPQEK+
Sbjct: 121 DTNTGPSNSLTTITTVMERFFPRLKLGQIVISAEVKPGYGVFAFDFNISKTIQYAPQEKI 180

Query: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNVYVDTGPQLPTNITHMLKLGSNLL 240
           RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTN+Y+DTGPQLPTN+THMLKLGSNLL
Sbjct: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNIYMDTGPQLPTNVTHMLKLGSNLL 240

Query: 241 QAIGSFNGHYVIAVAIMGTAPSPDSSVLQDHIQPVVSTVDSDSDIIEGPSRISLNCPISY 300
           Q IGSFNGHYVIAVA+MG+APSPDSSVLQDH QPVVSTVDSDSDIIEGPSRISLNCPISY
Sbjct: 241 QVIGSFNGHYVIAVAVMGSAPSPDSSVLQDHEQPVVSTVDSDSDIIEGPSRISLNCPISY 300

Query: 301 TRIKVPVKGRSCKHLQCFDFHNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLKASRVI 360
           TRIKVPVKGRSCKHLQCFDF+NFIDINSRRPSWRCPHCNQYICFLDI +DQNMLK   VI
Sbjct: 301 TRIKVPVKGRSCKHLQCFDFYNFIDINSRRPSWRCPHCNQYICFLDICIDQNMLK---VI 360

Query: 361 REVAENVTEVIISADGSWKAILENDHGDGRPLDDSLNHQKERAQEESTAPPDVLDLTEVD 420
           REVAENVTEVIISADGSWKAILEND GDGRPLDDSLN Q ERAQ+ESTAPPDVLDLTEVD
Sbjct: 361 REVAENVTEVIISADGSWKAILENDCGDGRPLDDSLNQQNERAQQESTAPPDVLDLTEVD 420

Query: 421 DDMDICNLETEDRKTCLDNKNQPVSSSLDISSGMNMNSLNQNLAAVLDDDFWSGIVTDGI 480
           DDM+ICNLETEDRK CL NKNQPVSSSL+I SGMN NSLNQN +A LDDDFWSG+VTD +
Sbjct: 421 DDMNICNLETEDRKPCLGNKNQPVSSSLNILSGMNRNSLNQNFSAALDDDFWSGMVTDRL 480

Query: 481 LTSSAGSDAPMGNSTPPPGFAGIMQSTVFTDVVPPVLNHGAGVPGHANFSSSAFYDQNNL 540
           LTSS  SDAPMG+ST  P FAG+ QS   TD V PVLNH  GVPG  NF   AFYDQNN+
Sbjct: 481 LTSSIRSDAPMGSSTAAPSFAGLTQSAGLTDAVSPVLNHDVGVPGQVNFPFPAFYDQNNV 540

Query: 541 QIQVPNSNENNQYGRMPLIARPVSRTPIAVQALPAQSQVAGQQYSSRTPIISSAPQVGQS 600
           Q+QV NSNE+NQYGRM  IARPVSRT +A Q LPAQSQ +GQQYSSRT  ISSAPQVGQS
Sbjct: 541 QVQVSNSNESNQYGRMTSIARPVSRT-LAGQVLPAQSQTSGQQYSSRTSTISSAPQVGQS 600

Query: 601 IPINRDGLNTLSRDLERRQQFSRHHGDSHHATNLAPFHHPQTVQNRDSQDRSFTPGQSVQ 660
           IPI+RDGLNT+SRD ERRQ F RHHGD HHATNLAPF  P  VQNR+ QDRSFTPGQSV+
Sbjct: 601 IPISRDGLNTISRDSERRQPFPRHHGDLHHATNLAPFLRPPIVQNREPQDRSFTPGQSVR 660

Query: 661 ASTALRPSTGLLTDFQNPHLQQALNMRISHLRNQNSSSVRPSLPFSRPMSQAGGGY---A 720
           ASTA RPS G+LTDFQNPHLQQALN+RISHLRNQN SSVRPSLPFSRP SQ GGGY   A
Sbjct: 661 ASTAQRPSAGILTDFQNPHLQQALNLRISHLRNQNPSSVRPSLPFSRPTSQVGGGYGGSA 720

Query: 721 YTAVTPNSQHARMVAASQRAEMMRQSSAMSLQNQTSRSAHSLQTTPDGLRRPAGEMRNV- 780
           Y AVTP++QHARM+ ASQRAEMMRQSSAMSLQNQTSRS H LQTTPDGLRRPAG++RNV 
Sbjct: 721 YPAVTPHNQHARMMVASQRAEMMRQSSAMSLQNQTSRSPHPLQTTPDGLRRPAGDLRNVG 780

Query: 781 GVSPSVTMAPGSVDLSVEQNWQPVGRMRGSLSGRAYSDAYGVIIQPTQAAQSARLPSHLS 840
           G++ SVTMA   +D SVEQN QP+GRMRGSLSGRAYSDAYGVIIQPTQ  QSAR PS+L+
Sbjct: 781 GMTQSVTMASDLLDPSVEQNRQPIGRMRGSLSGRAYSDAYGVIIQPTQPVQSARPPSNLT 840

Query: 841 PTQPSAPSTQAQR 850
            TQ SAPST AQR
Sbjct: 841 TTQSSAPSTHAQR 849

BLAST of Clc01G02440 vs. NCBI nr
Match: XP_022928990.1 (E4 SUMO-protein ligase PIAL2-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 1346.6 bits (3484), Expect = 0.0e+00
Identity = 690/853 (80.89%), Postives = 747/853 (87.57%), Query Frame = 0

Query: 1   MGASSPHEMNLNRIILYIDGLNLLLNRVGQMDPSQLCNLCFSIARSIDYTIANNIVPSKA 60
           MGA++P+EM L+RI  YID L L +NRV Q+DP QLCN+CFS+ARSID+ IAN+ VPSKA
Sbjct: 1   MGATTPYEMKLDRISSYIDSLTLYVNRVDQIDPVQLCNICFSLARSIDFAIANDFVPSKA 60

Query: 61  HSLPSFIKQLCQMKHSHRLKAALMVLMITIKNACKAKWFSEKDAEELYSLANEIGNDFFG 120
             LPS +KQ+CQ KHSH LKAA+MVLMI  KNACK KWFSEK+AEELYSLANEIG+DFF 
Sbjct: 61  QGLPSLLKQICQKKHSHHLKAAIMVLMIAAKNACKVKWFSEKEAEELYSLANEIGSDFFV 120

Query: 121 DTNIGQSNPLATISTVMERFFPRLKLGQIVASVEVKPGYGVYAIDFNILKTIQYAPQEKL 180
           DTN G SN L TI+TVMERFFPRLKLGQIV S EVKPGYGV+A DFNI KTIQYAPQEK+
Sbjct: 121 DTNTGPSNSLTTITTVMERFFPRLKLGQIVISAEVKPGYGVFAFDFNISKTIQYAPQEKI 180

Query: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNVYVDTGPQLPTNITHMLKLGSNLL 240
           RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTN+Y+DTGPQLPTN+THMLKLGSNLL
Sbjct: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNIYMDTGPQLPTNVTHMLKLGSNLL 240

Query: 241 QAIGSFNGHYVIAVAIMGTAPSPDSSVLQDHIQPVVSTVDSDSDIIEGPSRISLNCPISY 300
           Q IGSFNGHYVIAVA+MG+APSPDSSVLQDH QPVVSTVDSDSDIIEGPSRISLNCPISY
Sbjct: 241 QVIGSFNGHYVIAVAVMGSAPSPDSSVLQDHEQPVVSTVDSDSDIIEGPSRISLNCPISY 300

Query: 301 TRIKVPVKGRSCKHLQCFDFHNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLKASRVI 360
           TRIKVPVKGRSCKHLQCFDF+NFIDINSRRPSWRCPHCNQYICFLDI +DQNMLK   VI
Sbjct: 301 TRIKVPVKGRSCKHLQCFDFYNFIDINSRRPSWRCPHCNQYICFLDICIDQNMLK---VI 360

Query: 361 REVAENVTEVIISADGSWKAILENDHGDGRPLDDSLNHQKERAQEESTAPPDVLDLTEVD 420
           REVAENVTEVIISADGSWKAILEND GDGRPLDDSLN Q ERAQ+ESTAPPDVLDLTEVD
Sbjct: 361 REVAENVTEVIISADGSWKAILENDCGDGRPLDDSLNQQNERAQQESTAPPDVLDLTEVD 420

Query: 421 DDMDICNLETEDRKTCLDNKNQPVSSSLDISSGMNMNSLNQNLAAVLDDDFWSGIVTDGI 480
           DDM+ICNLETEDRK CL NKNQPVSSSL+I SGMN NSLNQN +A LDDDFWSG+VTD +
Sbjct: 421 DDMNICNLETEDRKPCLGNKNQPVSSSLNILSGMNRNSLNQNFSAALDDDFWSGMVTDRL 480

Query: 481 LTSSAGSDAPMGNSTPPPGFAGIMQSTVFTDVVPPVLNHGAGVPGHANFSSSAFYDQNNL 540
           L SS  SDAPMG+ST  P FAG+ QS   TD V PVLNH  GVPG  NF   AFYDQNN+
Sbjct: 481 LISSIRSDAPMGSSTAAPSFAGLTQSAGLTDAVSPVLNHDVGVPGQVNFPFPAFYDQNNV 540

Query: 541 QIQVPNSNENNQYGRMPLIARPVSRTPIAVQALPAQSQVAGQQYSSRTPIISSAPQVGQS 600
           Q+QV NSNE+NQYGRM  IARPVSRT +A Q LPAQSQ +GQQYSSRT  ISSAPQVGQS
Sbjct: 541 QVQVSNSNESNQYGRMTSIARPVSRT-LAGQVLPAQSQTSGQQYSSRTSTISSAPQVGQS 600

Query: 601 IPINRDGLNTLSRDLERRQQFSRHHGDSHHATNLAPFHHPQTVQNRDSQDRSFTPGQSVQ 660
           IPI+RDGLNT+SRD ERRQ F RHHGD HHATNLAPF  P  VQNR+ QDRSFTPGQSV+
Sbjct: 601 IPISRDGLNTISRDSERRQPFPRHHGDLHHATNLAPFLRPPIVQNREPQDRSFTPGQSVR 660

Query: 661 ASTALRPSTGLLTDFQNPHLQQALNMRISHLRNQNSSSVRPSLPFSRPMSQAGGGY---A 720
           ASTA RPS G+LTDFQNPHLQQ+LN+RISHLRNQN SSVRPSLPFSRP SQ GGGY   A
Sbjct: 661 ASTAQRPSAGILTDFQNPHLQQSLNLRISHLRNQNPSSVRPSLPFSRPTSQVGGGYGGSA 720

Query: 721 YTAVTPNSQHARMVAASQRAEMMRQSSAMSLQNQTSRSAHSLQTTPDGLRRPAGEMRNV- 780
           Y AVTP++QHARM+ ASQRAEMMRQSSAMSLQNQTSRS H LQTTPDGLRRPAG++RNV 
Sbjct: 721 YPAVTPHNQHARMMVASQRAEMMRQSSAMSLQNQTSRSPHPLQTTPDGLRRPAGDLRNVG 780

Query: 781 GVSPSVTMAPGSVDLSVEQNWQPVGRMRGSLSGRAYSDAYGVIIQPTQAAQSARLPSHLS 840
           G++ SVTMA   +D SVEQN QP+GRMRGSLSGRAYSDAYGVIIQPTQ  QS R PS+L+
Sbjct: 781 GMTQSVTMASDLLDPSVEQNRQPIGRMRGSLSGRAYSDAYGVIIQPTQPVQSTRPPSNLT 840

Query: 841 PTQPSAPSTQAQR 850
            TQ +APST AQR
Sbjct: 841 TTQSNAPSTHAQR 849

BLAST of Clc01G02440 vs. NCBI nr
Match: XP_022969988.1 (E4 SUMO-protein ligase PIAL2-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 1341.6 bits (3471), Expect = 0.0e+00
Identity = 687/853 (80.54%), Postives = 746/853 (87.46%), Query Frame = 0

Query: 1   MGASSPHEMNLNRIILYIDGLNLLLNRVGQMDPSQLCNLCFSIARSIDYTIANNIVPSKA 60
           MGA++P+EM L+RI  YID L L +NRV Q+DP QLCN+CFS+ARSID+ IAN+ VPSKA
Sbjct: 1   MGATTPYEMKLDRISSYIDSLTLYVNRVDQIDPVQLCNICFSLARSIDFAIANDFVPSKA 60

Query: 61  HSLPSFIKQLCQMKHSHRLKAALMVLMITIKNACKAKWFSEKDAEELYSLANEIGNDFFG 120
             LPS +KQ+CQ KHSH LKAA+MVLMI  KNACK KWFSEK+AEELYSLANEIG+DFF 
Sbjct: 61  QGLPSLLKQICQKKHSHHLKAAIMVLMIAAKNACKVKWFSEKEAEELYSLANEIGSDFFV 120

Query: 121 DTNIGQSNPLATISTVMERFFPRLKLGQIVASVEVKPGYGVYAIDFNILKTIQYAPQEKL 180
           DTN G SN L TI+ VMERFFPRLKLGQIV S EVKPGYGV+A DFNI KTIQYAPQEK+
Sbjct: 121 DTNTGPSNSLTTITKVMERFFPRLKLGQIVISAEVKPGYGVFAFDFNISKTIQYAPQEKI 180

Query: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNVYVDTGPQLPTNITHMLKLGSNLL 240
           RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTN+Y+DTGPQLPTN+THMLKLGSNLL
Sbjct: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNIYMDTGPQLPTNVTHMLKLGSNLL 240

Query: 241 QAIGSFNGHYVIAVAIMGTAPSPDSSVLQDHIQPVVSTVDSDSDIIEGPSRISLNCPISY 300
           Q IGSFNGHYVI+VA+MG+APSPDSSVLQDH QP VSTVDSDSDIIEGPSRISLNCPISY
Sbjct: 241 QVIGSFNGHYVISVAVMGSAPSPDSSVLQDHEQPAVSTVDSDSDIIEGPSRISLNCPISY 300

Query: 301 TRIKVPVKGRSCKHLQCFDFHNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLKASRVI 360
           TRIKVPVKGRSCKHLQCFDF+NFIDINSRRPSWRCPHCNQYICFLDI +DQNMLK   VI
Sbjct: 301 TRIKVPVKGRSCKHLQCFDFYNFIDINSRRPSWRCPHCNQYICFLDICIDQNMLK---VI 360

Query: 361 REVAENVTEVIISADGSWKAILENDHGDGRPLDDSLNHQKERAQEESTAPPDVLDLTEVD 420
           REVAENVTEVIISADGSWKAILEND GDGRPLDDSLN Q ERAQ+ESTAPPDVLDLTEVD
Sbjct: 361 REVAENVTEVIISADGSWKAILENDCGDGRPLDDSLNQQNERAQQESTAPPDVLDLTEVD 420

Query: 421 DDMDICNLETEDRKTCLDNKNQPVSSSLDISSGMNMNSLNQNLAAVLDDDFWSGIVTDGI 480
           DDM+ICNLETEDRK CL NKNQPVSSSL+I SGMN NSLNQN +A LDDDFWS +VTD +
Sbjct: 421 DDMNICNLETEDRKPCLGNKNQPVSSSLNILSGMNRNSLNQNFSAALDDDFWSRMVTDRL 480

Query: 481 LTSSAGSDAPMGNSTPPPGFAGIMQSTVFTDVVPPVLNHGAGVPGHANFSSSAFYDQNNL 540
           LTSS  SDAPMG+ST  P FAG+ QS   TD V PVLNH  GVPG  NF   +FYDQNN+
Sbjct: 481 LTSSIRSDAPMGSSTAAPSFAGLTQSAGLTDAVSPVLNHDVGVPGQVNFPFPSFYDQNNV 540

Query: 541 QIQVPNSNENNQYGRMPLIARPVSRTPIAVQALPAQSQVAGQQYSSRTPIISSAPQVGQS 600
           Q+QV NSNE+NQYGRM  IARPVSRT +A Q LPAQSQ +GQQYSSRT  +SSAPQVGQS
Sbjct: 541 QVQVSNSNESNQYGRMTSIARPVSRT-LAGQVLPAQSQTSGQQYSSRTSTVSSAPQVGQS 600

Query: 601 IPINRDGLNTLSRDLERRQQFSRHHGDSHHATNLAPFHHPQTVQNRDSQDRSFTPGQSVQ 660
           IPI+RDGLNT+SRD E RQ F RHHGD HHATNLAPF  P  VQNR+ QDRSFTPGQSV+
Sbjct: 601 IPISRDGLNTISRDSEMRQPFPRHHGDLHHATNLAPFLRPPIVQNREPQDRSFTPGQSVR 660

Query: 661 ASTALRPSTGLLTDFQNPHLQQALNMRISHLRNQNSSSVRPSLPFSRPMSQAGGGY---A 720
           ASTA RPS G+LTDFQNPHLQQALN+RISHL+NQN SSVRPSLPFSRP SQ GGGY   A
Sbjct: 661 ASTAQRPSVGILTDFQNPHLQQALNLRISHLQNQNPSSVRPSLPFSRPTSQVGGGYGGSA 720

Query: 721 YTAVTPNSQHARMVAASQRAEMMRQSSAMSLQNQTSRSAHSLQTTPDGLRRPAGEMRNV- 780
           YTAVTP++QHARM+ ASQRAEMMRQSSAMSLQNQTSRS H LQTTPDGLRRPAGE+RNV 
Sbjct: 721 YTAVTPHNQHARMMVASQRAEMMRQSSAMSLQNQTSRSPHPLQTTPDGLRRPAGELRNVG 780

Query: 781 GVSPSVTMAPGSVDLSVEQNWQPVGRMRGSLSGRAYSDAYGVIIQPTQAAQSARLPSHLS 840
           G++ SVTMA   +D SVEQN QP+GRMRGSLSGRAYSDA+GVIIQPTQ  QSAR PS+L+
Sbjct: 781 GMTQSVTMASNLLDPSVEQNRQPIGRMRGSLSGRAYSDAFGVIIQPTQPVQSARPPSNLT 840

Query: 841 PTQPSAPSTQAQR 850
            TQ SAPST AQR
Sbjct: 841 TTQSSAPSTHAQR 849

BLAST of Clc01G02440 vs. ExPASy Swiss-Prot
Match: P22503 (Endoglucanase OS=Phaseolus vulgaris OX=3885 PE=2 SV=2)

HSP 1 Score: 606.3 bits (1562), Expect = 8.1e-172
Identity = 285/392 (72.70%), Postives = 338/392 (86.22%), Query Frame = 0

Query: 903  YSDIMLSWTPIEYEKEIASVMQLEHLRSSVRWGADFILRAHVSPTTLYTQVGDGNGDHQC 962
            +S  +LSW  +EYE EI+SV QL +L+S++RWGADF+LRAH SPTTLYTQVGDGN DH C
Sbjct: 103  FSTSLLSWAAVEYESEISSVNQLGYLQSAIRWGADFMLRAHTSPTTLYTQVGDGNADHNC 162

Query: 963  WERPQDMDTPRTLYKITPNSPGTEAAAEAAAALAAASIVFHHVDANYSRSLLQHSKSLFQ 1022
            WERP+DMDTPRT+YKI  NSPGTE AAE AAAL+AASIVF  +DA YS +LL HSKSLF 
Sbjct: 163  WERPEDMDTPRTVYKIDANSPGTEVAAEYAAALSAASIVFKKIDAKYSSTLLSHSKSLFD 222

Query: 1023 FADQFRGSYSASCPFYCSFSGYQDELLWAAAWVYKASGNSKYLSYILSNQGWSQPTSQFS 1082
            FAD+ RGSYS SCPFYCS+SGYQDELLWAAAW+YKASG SKYLSYI+SNQGWSQ  S+FS
Sbjct: 223  FADKNRGSYSGSCPFYCSYSGYQDELLWAAAWLYKASGESKYLSYIISNQGWSQTVSEFS 282

Query: 1083 WDNKFVGAQTLLAKELYKGKKNLSKFKIDAESFICMVMPGGSCSKIPTTPGGLLFLRDNS 1142
            WDNKFVGAQTLL +E Y GKK+L+K K DAESFIC VMPG +  +I TTPGGLLF RD+S
Sbjct: 283  WDNKFVGAQTLLTEEFYGGKKDLAKIKTDAESFICAVMPGSNSRQIKTTPGGLLFTRDSS 342

Query: 1143 NLQYASSSSMVLFMYSRLLNKARVDGVHCGSKYFSSSQIKTFAKSQVDYILGKNPMKWSY 1202
            NLQY +SS+MVLF++SR+LN+  ++G++CGS +F++SQI+ FAK+QV+YILGKNPMK SY
Sbjct: 343  NLQYTTSSTMVLFIFSRILNRNHINGINCGSSHFTASQIRGFAKTQVEYILGKNPMKMSY 402

Query: 1203 MVGFGNKYPLQLHHRASSIPSIKVHSTKVGCNDGYSHYFYSNNPNPNVHIGAIVGGPNSN 1262
            MVGFG+KYP QLHHR SSIPSIKVH  KVGCN G S Y+ S NPNPN H+GAIVGGP+SN
Sbjct: 403  MVGFGSKYPKQLHHRGSSIPSIKVHPAKVGCNAGLSDYYNSANPNPNTHVGAIVGGPDSN 462

Query: 1263 DQFSDLRSDHSHSEPTTYMNAAFVGSVAALVA 1295
            D+F+D RSD+SH+EPTTY+NAAFV S++AL+A
Sbjct: 463  DRFNDARSDYSHAEPTTYINAAFVASISALLA 494

BLAST of Clc01G02440 vs. ExPASy Swiss-Prot
Match: Q9SUS0 (Endoglucanase 20 OS=Arabidopsis thaliana OX=3702 GN=At4g23560 PE=2 SV=1)

HSP 1 Score: 553.1 bits (1424), Expect = 8.2e-156
Identity = 258/392 (65.82%), Postives = 319/392 (81.38%), Query Frame = 0

Query: 903  YSDIMLSWTPIEYEKEIASVMQLEHLRSSVRWGADFILRAHVSPTTLYTQVGDGNGDHQC 962
            ++  +LSW  IEY+ EI+SV QL +LRS+++WG DFILRAH SP  LYTQVGDGN DH C
Sbjct: 86   FTTTLLSWAAIEYQNEISSVNQLGYLRSTIKWGTDFILRAHTSPNMLYTQVGDGNSDHSC 145

Query: 963  WERPQDMDTPRTLYKITPNSPGTEAAAEAAAALAAASIVFHHVDANYSRSLLQHSKSLFQ 1022
            WERP+DMDT RTLY I+ +SPG+EAA EAAAALAAAS+VF  VD+ YS +LL H+K+LF+
Sbjct: 146  WERPEDMDTSRTLYSISSSSPGSEAAGEAAAALAAASLVFKSVDSTYSSTLLNHAKTLFE 205

Query: 1023 FADQFRGSYSASCPFYCSFSGYQDELLWAAAWVYKASGNSKYLSYILSNQGWSQPTSQFS 1082
            FAD++RGSY ASCPFYCS+SGYQDELLWAAAW+YKA+G+  Y++Y++SN+ WSQ  ++FS
Sbjct: 206  FADKYRGSYQASCPFYCSYSGYQDELLWAAAWLYKATGDKIYINYVISNKDWSQAVNEFS 265

Query: 1083 WDNKFVGAQTLLAKELYKGKKNLSKFKIDAESFICMVMPGGSCSKIPTTPGGLLFLRDNS 1142
            WDNKFVGAQ LL  E Y G  +L+KFK D ESF+C +MPG S  +I  TPGGLLF+RD+S
Sbjct: 266  WDNKFVGAQALLVSEFYNGANDLAKFKSDVESFVCAMMPGSSSQQIKPTPGGLLFIRDSS 325

Query: 1143 NLQYASSSSMVLFMYSRLLNKARVDGVHCGSKYFSSSQIKTFAKSQVDYILGKNPMKWSY 1202
            NLQY ++++ VLF YS+ L KA V  + CGS  F+ SQI+ FAKSQVDYILG NPMK SY
Sbjct: 326  NLQYVTTATTVLFHYSKTLTKAGVGSIQCGSTKFTVSQIRNFAKSQVDYILGNNPMKMSY 385

Query: 1203 MVGFGNKYPLQLHHRASSIPSIKVHSTKVGCNDGYSHYFYSNNPNPNVHIGAIVGGPNSN 1262
            MVGFG KYP Q HHR SS+PSI+    K+ CN GYS Y+ S+ PNPNVHIGAIVGGPNS+
Sbjct: 386  MVGFGTKYPTQPHHRGSSLPSIQSKPEKIDCNGGYS-YYNSDTPNPNVHIGAIVGGPNSS 445

Query: 1263 DQFSDLRSDHSHSEPTTYMNAAFVGSVAALVA 1295
            DQ+SD +SD+SH+EPTTY+NAAF+G VAAL++
Sbjct: 446  DQYSDKKSDYSHAEPTTYINAAFIGPVAALIS 476

BLAST of Clc01G02440 vs. ExPASy Swiss-Prot
Match: Q9SZ90 (Endoglucanase 18 OS=Arabidopsis thaliana OX=3702 GN=At4g09740 PE=3 SV=2)

HSP 1 Score: 537.3 bits (1383), Expect = 4.6e-151
Identity = 249/392 (63.52%), Postives = 312/392 (79.59%), Query Frame = 0

Query: 903  YSDIMLSWTPIEYEKEIASVMQLEHLRSSVRWGADFILRAHVSPTTLYTQVGDGNGDHQC 962
            ++  +LSW  +EY+ EI  V QL +LRS+++WG +FILRAH S   LYTQVGDGN DH C
Sbjct: 86   FTTTLLSWAALEYQNEITFVNQLGYLRSTIKWGTNFILRAHTSTNMLYTQVGDGNSDHSC 145

Query: 963  WERPQDMDTPRTLYKITPNSPGTEAAAEAAAALAAASIVFHHVDANYSRSLLQHSKSLFQ 1022
            WERP+DMDTPRTLY I+ +SPG+EAA EAAAALAAAS+VF  VD+ YS  LL ++KSLF+
Sbjct: 146  WERPEDMDTPRTLYSISSSSPGSEAAGEAAAALAAASLVFKLVDSTYSSKLLNNAKSLFE 205

Query: 1023 FADQFRGSYSASCPFYCSFSGYQDELLWAAAWVYKASGNSKYLSYILSNQGWSQPTSQFS 1082
            FAD++RGSY ASCPFYCS SGYQDELLWAAAW+YKA+G   YL+Y++SN+ WS+  ++FS
Sbjct: 206  FADKYRGSYQASCPFYCSHSGYQDELLWAAAWLYKATGEKSYLNYVISNKDWSKAINEFS 265

Query: 1083 WDNKFVGAQTLLAKELYKGKKNLSKFKIDAESFICMVMPGGSCSKIPTTPGGLLFLRDNS 1142
            WDNKF G Q LLA E Y G  +L KFK D ESF+C +MPG S  +I  TPGG+LF+RD+S
Sbjct: 266  WDNKFAGVQALLASEFYNGANDLEKFKTDVESFVCALMPGSSSQQIKPTPGGILFIRDSS 325

Query: 1143 NLQYASSSSMVLFMYSRLLNKARVDGVHCGSKYFSSSQIKTFAKSQVDYILGKNPMKWSY 1202
            NLQY ++++ +LF YS+ L KA V  + CGS  F+ SQI+ FAKSQVDYILG NP+K SY
Sbjct: 326  NLQYVTTATTILFYYSKTLTKAGVGSIQCGSTQFTVSQIRNFAKSQVDYILGNNPLKMSY 385

Query: 1203 MVGFGNKYPLQLHHRASSIPSIKVHSTKVGCNDGYSHYFYSNNPNPNVHIGAIVGGPNSN 1262
            MVGFG KYP Q HHR SS+PSI+    K+ CN G+S+Y + + PNPNVH GAIVGGPNS+
Sbjct: 386  MVGFGTKYPTQPHHRGSSLPSIQSKPEKIDCNGGFSYYNF-DTPNPNVHTGAIVGGPNSS 445

Query: 1263 DQFSDLRSDHSHSEPTTYMNAAFVGSVAALVA 1295
            DQ+SD R+D+SH+EPTTY+NAAF+GSVAAL++
Sbjct: 446  DQYSDKRTDYSHAEPTTYINAAFIGSVAALIS 476

BLAST of Clc01G02440 vs. ExPASy Swiss-Prot
Match: Q6ZA06 (Endoglucanase 20 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU15 PE=2 SV=1)

HSP 1 Score: 484.2 bits (1245), Expect = 4.7e-135
Identity = 227/386 (58.81%), Postives = 301/386 (77.98%), Query Frame = 0

Query: 907  MLSWTPIEYEKEIASVMQLEHLRSSVRWGADFILRAHVSPTTLYTQVGDGNGDHQCWERP 966
            +L W+ +EY   +A+  +L +LR+++RWGADF+LRAH SPTTLYTQVGDGN DHQCWERP
Sbjct: 107  LLGWSAVEYGAAVAAAGELGNLRAAIRWGADFLLRAHASPTTLYTQVGDGNADHQCWERP 166

Query: 967  QDMDTPRTLYKITPNSPGTEAAAEAAAALAAASIVF-HHVDANYSRSLLQHSKSLFQFAD 1026
            +DMDTPRTLYKIT +SPG+EAAAEA+AALAAA +      D  +S  LL  S+SLF FA+
Sbjct: 167  EDMDTPRTLYKITADSPGSEAAAEASAALAAAYVALKDDGDTAFSSRLLAASRSLFDFAN 226

Query: 1027 QFRGSYSASCPFYCSFSGYQDELLWAAAWVYKASGNSKYLSYILSNQGWSQPTSQFSWDN 1086
             +RGS+ +SCPFYCS+SG+QDELLWA+AW++KA+ ++KYL ++ +NQG S P ++FSWDN
Sbjct: 227  NYRGSFQSSCPFYCSYSGFQDELLWASAWLFKATRDAKYLDFLTNNQGSSNPVNEFSWDN 286

Query: 1087 KFVGAQTLLAKELYKGKKNLSKFKIDAESFICMVMPGGSCSKIPTTPGGLLFLRDNSNLQ 1146
            K+ GAQ L A+E   G+  L+++K + +SF+C +MP     +I TTPGGLLF RD+ NLQ
Sbjct: 287  KYAGAQMLAAQEYLGGRTQLARYKDNLDSFVCALMPNSGNVQIRTTPGGLLFTRDSVNLQ 346

Query: 1147 YASSSSMVLFMYSRLLNKARVDGVHCGSKYFSSSQIKTFAKSQVDYILGKNPMKWSYMVG 1206
            Y +++++VL +YS++L  +   GV C +  FS +QI +FA SQVDYILGKNP+  SYMVG
Sbjct: 347  YTTTATLVLSIYSKVLKSSGSRGVRCSAATFSPNQISSFATSQVDYILGKNPLGMSYMVG 406

Query: 1207 FGNKYPLQLHHRASSIPSIKVHSTKVGCNDGYSHYFYSNNPNPNVHIGAIVGGPNSNDQF 1266
            F  K+P ++HHR SSIPSIKV S KV C +G+S +  +++PNPN+H+GAIVGGP+ NDQF
Sbjct: 407  FSTKFPRRIHHRGSSIPSIKVLSRKVTCKEGFSSWLPTSDPNPNIHVGAIVGGPDGNDQF 466

Query: 1267 SDLRSDHSHSEPTTYMNAAFVGSVAA 1292
            SD R D SHSEP TY+NAAFVG+ AA
Sbjct: 467  SDNRGDSSHSEPATYINAAFVGACAA 492

BLAST of Clc01G02440 vs. ExPASy Swiss-Prot
Match: Q9SRX3 (Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1)

HSP 1 Score: 407.1 bits (1045), Expect = 7.2e-112
Identity = 206/399 (51.63%), Postives = 281/399 (70.43%), Query Frame = 0

Query: 903  YSDIMLSWTPIEYEKEIASVMQLEHLRSSVRWGADFILRAHVSPTTLYTQVGDGNGDHQC 962
            ++  MLSW+ IE+   + S  +L + + ++RW  DF+L+A   P T+Y QVGD N DH C
Sbjct: 106  FTTTMLSWSLIEFGGLMKS--ELPNAKDAIRWATDFLLKATSHPDTIYVQVGDPNMDHAC 165

Query: 963  WERPQDMDTPRTLYKITPNSPGTEAAAEAAAALAAASIVFHHVDANYSRSLLQHSKSLFQ 1022
            WERP+DMDTPR+++K+  N+PG++ A E AAALAAASIVF   D +YS  LLQ + ++F 
Sbjct: 166  WERPEDMDTPRSVFKVDKNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITVFT 225

Query: 1023 FADQFRGSYSAS-----CPFYCSFSGYQDELLWAAAWVYKASGNSKYLSYILSN---QGW 1082
            FAD++RG YSA      CPFYCS+SGYQDELLW AAW+ KA+ N  YL+YI +N    G 
Sbjct: 226  FADKYRGPYSAGLAPEVCPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQILGA 285

Query: 1083 SQPTSQFSWDNKFVGAQTLLAKE-LYKGKKNLSKFKIDAESFICMVMPGGSCSKIPTTPG 1142
             +  + FSWDNK VGA+ LL+KE L +  K+L ++K  A+SFIC V+PG S S+   TPG
Sbjct: 286  DEFDNMFSWDNKHVGARILLSKEFLIQKVKSLEEYKEHADSFICSVLPGASSSQY--TPG 345

Query: 1143 GLLFLRDNSNLQYASSSSMVLFMYSRLLNKARVDGVHCGSKYFSSSQIKTFAKSQVDYIL 1202
            GLLF    SN+QY +S+S +L  Y++ L  AR    +CG    + +++++ AK QVDY+L
Sbjct: 346  GLLFKMGESNMQYVTSTSFLLLTYAKYLTSART-VAYCGGSVVTPARLRSIAKKQVDYLL 405

Query: 1203 GKNPMKWSYMVGFGNKYPLQLHHRASSIPSIKVHSTKVGCNDGYSHYFYSNNPNPNVHIG 1262
            G NP+K SYMVG+G KYP ++HHR SS+PS+ VH T++ C+DG+S  F S +PNPN  +G
Sbjct: 406  GGNPLKMSYMVGYGLKYPRRIHHRGSSLPSVAVHPTRIQCHDGFS-LFTSQSPNPNDLVG 465

Query: 1263 AIVGGPNSNDQFSDLRSDHSHSEPTTYMNAAFVGSVAAL 1293
            A+VGGP+ NDQF D RSD+  SEP TY+NA  VG++A L
Sbjct: 466  AVVGGPDQNDQFPDERSDYGRSEPATYINAPLVGALAYL 498

BLAST of Clc01G02440 vs. ExPASy TrEMBL
Match: A0A0A0KMH4 (SP-RING-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G152190 PE=4 SV=1)

HSP 1 Score: 1375.1 bits (3558), Expect = 0.0e+00
Identity = 701/851 (82.37%), Postives = 753/851 (88.48%), Query Frame = 0

Query: 1   MGASSPHEMNLNRIILYIDGLNLLLNRVGQMDPSQLCNLCFSIARSIDYTIANNIVPSKA 60
           MGASS H+ NL +II YIDGL LL+N V Q+D + LC+LCFSI+RSIDY IANN VPSKA
Sbjct: 77  MGASSQHDTNLKKIISYIDGLTLLINHVAQIDLANLCSLCFSISRSIDYAIANNAVPSKA 136

Query: 61  HSLPSFIKQLCQMKHSHRLKAALMVLMITIKNACKAKWFSEKDAEELYSLANEIGNDFFG 120
           HSLPS +KQLCQ+KHSHR KAALMVLM+TIKNACK +WFSEKDAEEL  LANEIGNDFFG
Sbjct: 137 HSLPSLVKQLCQLKHSHRSKAALMVLMLTIKNACKVRWFSEKDAEELQRLANEIGNDFFG 196

Query: 121 DTNIGQSNPLATISTVMERFFPRLKLGQIVASVEVKPGYGVYAIDFNILKTIQYAPQEKL 180
           DTNIGQ+N L TI+TVMER+FP LKLGQIVAS+EVKPGYGVYA+DFNI +T+QYA QEKL
Sbjct: 197 DTNIGQANSLTTITTVMERYFPCLKLGQIVASLEVKPGYGVYALDFNISRTVQYASQEKL 256

Query: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNVYVDTGPQLPTNITHMLKLGSNLL 240
           RLFV QKDNTETSACIISPPQVNFLVNGRG+NGR N ++DTGPQLPTNITHMLKLGSNLL
Sbjct: 257 RLFVIQKDNTETSACIISPPQVNFLVNGRGINGRINTHMDTGPQLPTNITHMLKLGSNLL 316

Query: 241 QAIGSFNGHYVIAVAIMGTAPSPDSSVLQDHIQPVVSTVDSDSDIIEGPSRISLNCPISY 300
           QA+GSFNGHYV+A+AI GTAPSPDSSVL DHIQP+VST+DSDSDIIEGPSRISLNCPISY
Sbjct: 317 QAVGSFNGHYVLAIAITGTAPSPDSSVLHDHIQPIVSTLDSDSDIIEGPSRISLNCPISY 376

Query: 301 TRIKVPVKGRSCKHLQCFDFHNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLKASRVI 360
           TRIK+PVKG SCKHLQCFDF NFIDINSRRPSWRCPHCNQYICFLDIRVD+NMLK   VI
Sbjct: 377 TRIKIPVKGCSCKHLQCFDFDNFIDINSRRPSWRCPHCNQYICFLDIRVDRNMLK---VI 436

Query: 361 REVAENVTEVIISADGSWKAILENDHGDGRPLDDSLNHQKERAQEESTAPPDVLDLTEVD 420
           REVAENVTEVIIS DGSWKAILEND+GDGR L+DSLNHQ ERAQEES A PDVLD TEV 
Sbjct: 437 REVAENVTEVIISVDGSWKAILENDNGDGRSLNDSLNHQNERAQEESAASPDVLDHTEVG 496

Query: 421 DDMDICNLETEDRKTCLDNKNQPVSSSLDISSGMNMNSLNQNLAAVLDDDFWSGIVTDGI 480
           DDMDI N E EDRK CL NKNQ VSSSLD+SSGMNMNS +QNL+AV+DDD WS I  DG+
Sbjct: 497 DDMDIFNSEIEDRKPCLGNKNQRVSSSLDMSSGMNMNSFSQNLSAVMDDDIWSRI--DGV 556

Query: 481 LTSSAGSDAPMGNSTPPPGFAGIMQSTVFTDVVPPVLNHGAGVPGHANFSSSAFYDQNNL 540
           L S+AG DAPM NST PPGF G MQS V TD V PVLNHG GV GHANF S AFY+QNN+
Sbjct: 557 LISTAGLDAPMVNSTYPPGFTGTMQSAVLTDAVQPVLNHGVGVSGHANFPSPAFYNQNNV 616

Query: 541 QIQVPNSNENNQYGRMPLIARPVSRTPIAVQALPAQSQVAGQQYSSRTPIISSAPQVGQS 600
           QIQV NSNENNQYGR+  I+RPVSRTP+AVQALPAQS  AGQQYSSRTPIISS PQVGQS
Sbjct: 617 QIQVSNSNENNQYGRVTSISRPVSRTPVAVQALPAQSHAAGQQYSSRTPIISS-PQVGQS 676

Query: 601 IPINRDGLNTLSRDLERRQQFSRHHGDSHHATNLAPFHHPQTVQNRDSQDRSFTPGQSVQ 660
           IPINRDGLN LSRDLERRQQFSRHHGDSHH+TNLA FHHPQTVQNRD QDRSFT GQS+Q
Sbjct: 677 IPINRDGLNALSRDLERRQQFSRHHGDSHHSTNLASFHHPQTVQNRDPQDRSFTTGQSIQ 736

Query: 661 ASTALRPSTGLLTDFQNPHLQQALNMRISHLRNQNSSSVRPSLPFSRPMSQAGGGY---A 720
            S+  RPS GLL DFQNPHLQQALNMR+ HL+NQNSSSVR SL FSRPMSQ GGGY    
Sbjct: 737 TSSGARPSPGLLADFQNPHLQQALNMRMPHLQNQNSSSVRTSLSFSRPMSQVGGGYGGST 796

Query: 721 YTAVTPNSQHARMVAASQRAEMMRQSSAMSLQNQTSRSAHSLQTTPDGLRRPAGEMRNVG 780
           YT VTPNSQHARM+AASQR EMMRQS  MSL NQTSRSAHSLQTTPDGLRRP+G++RNVG
Sbjct: 797 YTTVTPNSQHARMLAASQRVEMMRQSPPMSLHNQTSRSAHSLQTTPDGLRRPSGDLRNVG 856

Query: 781 VSPSVTMAPGSVDLSVEQNWQPVGRMRGSLSGRAYSDAYGVIIQPTQAAQSARLPSHLSP 840
           VS SVTMA GSVDLS EQNWQP GRMRGSLSGR YSDAYGVIIQPTQAAQSAR PS+L+P
Sbjct: 857 VSQSVTMAAGSVDLSAEQNWQPAGRMRGSLSGRVYSDAYGVIIQPTQAAQSARPPSNLTP 916

Query: 841 TQPSAPSTQAQ 849
           TQP APSTQAQ
Sbjct: 917 TQPIAPSTQAQ 921

BLAST of Clc01G02440 vs. ExPASy TrEMBL
Match: A0A1S3AUE1 (E4 SUMO-protein ligase PIAL2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103482791 PE=4 SV=1)

HSP 1 Score: 1370.9 bits (3547), Expect = 0.0e+00
Identity = 707/852 (82.98%), Postives = 758/852 (88.97%), Query Frame = 0

Query: 1   MGASSPHEMNLNRIILYIDGLNLLLNRVGQMDPSQLCNLCFSIARSIDYTIANNIVPSKA 60
           MGAS+  + NLN+II YIDGL LL+N V Q+D + LC LCFSI+RSIDY IANN VPSKA
Sbjct: 1   MGASAQQDTNLNKIIAYIDGLTLLINHVAQVDLAHLCTLCFSISRSIDYAIANNAVPSKA 60

Query: 61  HSLPSFIKQLCQMKHSHRLKAALMVLMITIKNACKAKWFSEKDAEELYSLANEIGNDFFG 120
           HSLP   KQLCQ+KHSHR KAALMVLM+TIKNACK +WFSEKDAEEL  LANEIGNDFFG
Sbjct: 61  HSLPILFKQLCQLKHSHRSKAALMVLMLTIKNACKVRWFSEKDAEELQRLANEIGNDFFG 120

Query: 121 DTNIGQSNPLATISTVMERFFPRLKLGQIVASVEVKPGYGVYAIDFNILKTIQYAPQEKL 180
           D NIGQ+N L TI+TVMER+FP LKLGQIVAS+EVKPGYGVYA+DFNI +TIQ APQEKL
Sbjct: 121 DMNIGQANSLTTITTVMERYFPCLKLGQIVASLEVKPGYGVYALDFNISRTIQCAPQEKL 180

Query: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNVYVDTGPQLPTNITHMLKLGSNLL 240
           RLFV QKDNTETSAC+ISPPQVNFLVNGRGVNGR N ++DTGPQLPTNITHMLKLGSNLL
Sbjct: 181 RLFVIQKDNTETSACMISPPQVNFLVNGRGVNGRINTHMDTGPQLPTNITHMLKLGSNLL 240

Query: 241 QAIGSFNGHYVIAVAIMGTAPSPDSSVLQDHIQPVVSTVDSDSDIIEGPSRISLNCPISY 300
           QAIGSFNGHYV+AVAI GTAPSP+SSVLQDH+QPVVST+DSDSDIIEGPSRISLNCPISY
Sbjct: 241 QAIGSFNGHYVLAVAITGTAPSPNSSVLQDHVQPVVSTLDSDSDIIEGPSRISLNCPISY 300

Query: 301 TRIKVPVKGRSCKHLQCFDFHNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLKASRVI 360
           TRIK+PVKG SCKHLQCFDF NFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLK   VI
Sbjct: 301 TRIKIPVKGCSCKHLQCFDFDNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLK---VI 360

Query: 361 REVAENVTEVIISADGSWKAILENDHGDGRPLDDSLNHQKERAQEESTAPPDVLDLTEVD 420
           REVAENVTEVIISADGSWKAILEND+GDGR LDDSLNHQ ERAQEES+AP DVLDLTEV 
Sbjct: 361 REVAENVTEVIISADGSWKAILENDNGDGRSLDDSLNHQSERAQEESSAPSDVLDLTEVG 420

Query: 421 DDMDICNLETEDRKTCLDNKNQPVSSSLDISSGMNMNSLNQNLAAVLDDDFWSGIVTDGI 480
           DDMDI + E EDRK CL NKNQPVSSSLD+SSGMNMNS +QNL+AV++DDFWS +  DG+
Sbjct: 421 DDMDIFDSEIEDRKPCLGNKNQPVSSSLDMSSGMNMNSFSQNLSAVVEDDFWSRL--DGV 480

Query: 481 LTSSAGSDAPMGNSTPPPGFAGIMQSTVFTDVVPPVLNHGAGVPGHANFSSSAFYDQNNL 540
           L SSAG DAPM NST PPGF  IMQS V TDVV PVLNHG GV GHANFSS AFY+QNN+
Sbjct: 481 LISSAGLDAPMVNSTYPPGFTNIMQSAVLTDVVQPVLNHGVGVLGHANFSSPAFYNQNNM 540

Query: 541 QIQVPNSNENNQYGRMPLIARPVSRTPIAVQALPAQSQVAGQQYSSRTPIISSAPQVGQS 600
            IQV NSNENNQYGR+  I+ P SRTPIAVQALPAQS  AGQQYSSRTPIISS PQVGQS
Sbjct: 541 HIQVSNSNENNQYGRVTSISIPASRTPIAVQALPAQSHAAGQQYSSRTPIISS-PQVGQS 600

Query: 601 IPINRDGLNTLSRDLERRQQFSRHHGDSHHATNLAPFHHPQTVQNRDSQDRSFTPGQSVQ 660
           IPINRDGLN LSRDLERRQQFSRHHGDSHHATNLA FHHPQTVQNRD QDRSFT GQS+Q
Sbjct: 601 IPINRDGLNALSRDLERRQQFSRHHGDSHHATNLASFHHPQTVQNRDPQDRSFTTGQSIQ 660

Query: 661 ASTALRPSTGLLTDFQNPHLQQALNMRISHLRNQNSSSVRPSLPFSRPMSQAGGGY---A 720
            S   RPSTGLLTDFQNPHLQQAL+ R+ HL+NQNSSSVRPSL FSRPMSQ GGGY    
Sbjct: 661 TSNGARPSTGLLTDFQNPHLQQALS-RMPHLQNQNSSSVRPSLSFSRPMSQVGGGYGGST 720

Query: 721 YTAVTPNSQHARMVAASQRAEMMRQSSAMSLQNQTSRSAHSLQTTPDGLRRPAGEMRNV- 780
           YT VTPNSQH+RM+AASQRAEMMRQS +MSLQNQTSRSAHSLQTTPDGLRRP+G++RNV 
Sbjct: 721 YTTVTPNSQHSRMMAASQRAEMMRQSPSMSLQNQTSRSAHSLQTTPDGLRRPSGDLRNVG 780

Query: 781 GVSPSVTMAPGSVDLSVEQNWQPVGRMRGSLSGRAYSDAYGVIIQPTQAAQSARLPSHLS 840
           GVS SVT+A GSVDLS EQNWQP GRMRGSLSGR YSDAYGVIIQPTQAAQSAR PS+L+
Sbjct: 781 GVSQSVTIAAGSVDLSAEQNWQPAGRMRGSLSGRVYSDAYGVIIQPTQAAQSARPPSNLT 840

Query: 841 PTQPSAPSTQAQ 849
           PTQP APSTQAQ
Sbjct: 841 PTQPIAPSTQAQ 845

BLAST of Clc01G02440 vs. ExPASy TrEMBL
Match: A0A6J1ESZ6 (E4 SUMO-protein ligase PIAL2-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111435722 PE=4 SV=1)

HSP 1 Score: 1346.6 bits (3484), Expect = 0.0e+00
Identity = 690/853 (80.89%), Postives = 747/853 (87.57%), Query Frame = 0

Query: 1   MGASSPHEMNLNRIILYIDGLNLLLNRVGQMDPSQLCNLCFSIARSIDYTIANNIVPSKA 60
           MGA++P+EM L+RI  YID L L +NRV Q+DP QLCN+CFS+ARSID+ IAN+ VPSKA
Sbjct: 1   MGATTPYEMKLDRISSYIDSLTLYVNRVDQIDPVQLCNICFSLARSIDFAIANDFVPSKA 60

Query: 61  HSLPSFIKQLCQMKHSHRLKAALMVLMITIKNACKAKWFSEKDAEELYSLANEIGNDFFG 120
             LPS +KQ+CQ KHSH LKAA+MVLMI  KNACK KWFSEK+AEELYSLANEIG+DFF 
Sbjct: 61  QGLPSLLKQICQKKHSHHLKAAIMVLMIAAKNACKVKWFSEKEAEELYSLANEIGSDFFV 120

Query: 121 DTNIGQSNPLATISTVMERFFPRLKLGQIVASVEVKPGYGVYAIDFNILKTIQYAPQEKL 180
           DTN G SN L TI+TVMERFFPRLKLGQIV S EVKPGYGV+A DFNI KTIQYAPQEK+
Sbjct: 121 DTNTGPSNSLTTITTVMERFFPRLKLGQIVISAEVKPGYGVFAFDFNISKTIQYAPQEKI 180

Query: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNVYVDTGPQLPTNITHMLKLGSNLL 240
           RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTN+Y+DTGPQLPTN+THMLKLGSNLL
Sbjct: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNIYMDTGPQLPTNVTHMLKLGSNLL 240

Query: 241 QAIGSFNGHYVIAVAIMGTAPSPDSSVLQDHIQPVVSTVDSDSDIIEGPSRISLNCPISY 300
           Q IGSFNGHYVIAVA+MG+APSPDSSVLQDH QPVVSTVDSDSDIIEGPSRISLNCPISY
Sbjct: 241 QVIGSFNGHYVIAVAVMGSAPSPDSSVLQDHEQPVVSTVDSDSDIIEGPSRISLNCPISY 300

Query: 301 TRIKVPVKGRSCKHLQCFDFHNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLKASRVI 360
           TRIKVPVKGRSCKHLQCFDF+NFIDINSRRPSWRCPHCNQYICFLDI +DQNMLK   VI
Sbjct: 301 TRIKVPVKGRSCKHLQCFDFYNFIDINSRRPSWRCPHCNQYICFLDICIDQNMLK---VI 360

Query: 361 REVAENVTEVIISADGSWKAILENDHGDGRPLDDSLNHQKERAQEESTAPPDVLDLTEVD 420
           REVAENVTEVIISADGSWKAILEND GDGRPLDDSLN Q ERAQ+ESTAPPDVLDLTEVD
Sbjct: 361 REVAENVTEVIISADGSWKAILENDCGDGRPLDDSLNQQNERAQQESTAPPDVLDLTEVD 420

Query: 421 DDMDICNLETEDRKTCLDNKNQPVSSSLDISSGMNMNSLNQNLAAVLDDDFWSGIVTDGI 480
           DDM+ICNLETEDRK CL NKNQPVSSSL+I SGMN NSLNQN +A LDDDFWSG+VTD +
Sbjct: 421 DDMNICNLETEDRKPCLGNKNQPVSSSLNILSGMNRNSLNQNFSAALDDDFWSGMVTDRL 480

Query: 481 LTSSAGSDAPMGNSTPPPGFAGIMQSTVFTDVVPPVLNHGAGVPGHANFSSSAFYDQNNL 540
           L SS  SDAPMG+ST  P FAG+ QS   TD V PVLNH  GVPG  NF   AFYDQNN+
Sbjct: 481 LISSIRSDAPMGSSTAAPSFAGLTQSAGLTDAVSPVLNHDVGVPGQVNFPFPAFYDQNNV 540

Query: 541 QIQVPNSNENNQYGRMPLIARPVSRTPIAVQALPAQSQVAGQQYSSRTPIISSAPQVGQS 600
           Q+QV NSNE+NQYGRM  IARPVSRT +A Q LPAQSQ +GQQYSSRT  ISSAPQVGQS
Sbjct: 541 QVQVSNSNESNQYGRMTSIARPVSRT-LAGQVLPAQSQTSGQQYSSRTSTISSAPQVGQS 600

Query: 601 IPINRDGLNTLSRDLERRQQFSRHHGDSHHATNLAPFHHPQTVQNRDSQDRSFTPGQSVQ 660
           IPI+RDGLNT+SRD ERRQ F RHHGD HHATNLAPF  P  VQNR+ QDRSFTPGQSV+
Sbjct: 601 IPISRDGLNTISRDSERRQPFPRHHGDLHHATNLAPFLRPPIVQNREPQDRSFTPGQSVR 660

Query: 661 ASTALRPSTGLLTDFQNPHLQQALNMRISHLRNQNSSSVRPSLPFSRPMSQAGGGY---A 720
           ASTA RPS G+LTDFQNPHLQQ+LN+RISHLRNQN SSVRPSLPFSRP SQ GGGY   A
Sbjct: 661 ASTAQRPSAGILTDFQNPHLQQSLNLRISHLRNQNPSSVRPSLPFSRPTSQVGGGYGGSA 720

Query: 721 YTAVTPNSQHARMVAASQRAEMMRQSSAMSLQNQTSRSAHSLQTTPDGLRRPAGEMRNV- 780
           Y AVTP++QHARM+ ASQRAEMMRQSSAMSLQNQTSRS H LQTTPDGLRRPAG++RNV 
Sbjct: 721 YPAVTPHNQHARMMVASQRAEMMRQSSAMSLQNQTSRSPHPLQTTPDGLRRPAGDLRNVG 780

Query: 781 GVSPSVTMAPGSVDLSVEQNWQPVGRMRGSLSGRAYSDAYGVIIQPTQAAQSARLPSHLS 840
           G++ SVTMA   +D SVEQN QP+GRMRGSLSGRAYSDAYGVIIQPTQ  QS R PS+L+
Sbjct: 781 GMTQSVTMASDLLDPSVEQNRQPIGRMRGSLSGRAYSDAYGVIIQPTQPVQSTRPPSNLT 840

Query: 841 PTQPSAPSTQAQR 850
            TQ +APST AQR
Sbjct: 841 TTQSNAPSTHAQR 849

BLAST of Clc01G02440 vs. ExPASy TrEMBL
Match: A0A6J1I2J0 (E4 SUMO-protein ligase PIAL2-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111469015 PE=4 SV=1)

HSP 1 Score: 1341.6 bits (3471), Expect = 0.0e+00
Identity = 687/853 (80.54%), Postives = 746/853 (87.46%), Query Frame = 0

Query: 1   MGASSPHEMNLNRIILYIDGLNLLLNRVGQMDPSQLCNLCFSIARSIDYTIANNIVPSKA 60
           MGA++P+EM L+RI  YID L L +NRV Q+DP QLCN+CFS+ARSID+ IAN+ VPSKA
Sbjct: 1   MGATTPYEMKLDRISSYIDSLTLYVNRVDQIDPVQLCNICFSLARSIDFAIANDFVPSKA 60

Query: 61  HSLPSFIKQLCQMKHSHRLKAALMVLMITIKNACKAKWFSEKDAEELYSLANEIGNDFFG 120
             LPS +KQ+CQ KHSH LKAA+MVLMI  KNACK KWFSEK+AEELYSLANEIG+DFF 
Sbjct: 61  QGLPSLLKQICQKKHSHHLKAAIMVLMIAAKNACKVKWFSEKEAEELYSLANEIGSDFFV 120

Query: 121 DTNIGQSNPLATISTVMERFFPRLKLGQIVASVEVKPGYGVYAIDFNILKTIQYAPQEKL 180
           DTN G SN L TI+ VMERFFPRLKLGQIV S EVKPGYGV+A DFNI KTIQYAPQEK+
Sbjct: 121 DTNTGPSNSLTTITKVMERFFPRLKLGQIVISAEVKPGYGVFAFDFNISKTIQYAPQEKI 180

Query: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNVYVDTGPQLPTNITHMLKLGSNLL 240
           RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTN+Y+DTGPQLPTN+THMLKLGSNLL
Sbjct: 181 RLFVAQKDNTETSACIISPPQVNFLVNGRGVNGRTNIYMDTGPQLPTNVTHMLKLGSNLL 240

Query: 241 QAIGSFNGHYVIAVAIMGTAPSPDSSVLQDHIQPVVSTVDSDSDIIEGPSRISLNCPISY 300
           Q IGSFNGHYVI+VA+MG+APSPDSSVLQDH QP VSTVDSDSDIIEGPSRISLNCPISY
Sbjct: 241 QVIGSFNGHYVISVAVMGSAPSPDSSVLQDHEQPAVSTVDSDSDIIEGPSRISLNCPISY 300

Query: 301 TRIKVPVKGRSCKHLQCFDFHNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLKASRVI 360
           TRIKVPVKGRSCKHLQCFDF+NFIDINSRRPSWRCPHCNQYICFLDI +DQNMLK   VI
Sbjct: 301 TRIKVPVKGRSCKHLQCFDFYNFIDINSRRPSWRCPHCNQYICFLDICIDQNMLK---VI 360

Query: 361 REVAENVTEVIISADGSWKAILENDHGDGRPLDDSLNHQKERAQEESTAPPDVLDLTEVD 420
           REVAENVTEVIISADGSWKAILEND GDGRPLDDSLN Q ERAQ+ESTAPPDVLDLTEVD
Sbjct: 361 REVAENVTEVIISADGSWKAILENDCGDGRPLDDSLNQQNERAQQESTAPPDVLDLTEVD 420

Query: 421 DDMDICNLETEDRKTCLDNKNQPVSSSLDISSGMNMNSLNQNLAAVLDDDFWSGIVTDGI 480
           DDM+ICNLETEDRK CL NKNQPVSSSL+I SGMN NSLNQN +A LDDDFWS +VTD +
Sbjct: 421 DDMNICNLETEDRKPCLGNKNQPVSSSLNILSGMNRNSLNQNFSAALDDDFWSRMVTDRL 480

Query: 481 LTSSAGSDAPMGNSTPPPGFAGIMQSTVFTDVVPPVLNHGAGVPGHANFSSSAFYDQNNL 540
           LTSS  SDAPMG+ST  P FAG+ QS   TD V PVLNH  GVPG  NF   +FYDQNN+
Sbjct: 481 LTSSIRSDAPMGSSTAAPSFAGLTQSAGLTDAVSPVLNHDVGVPGQVNFPFPSFYDQNNV 540

Query: 541 QIQVPNSNENNQYGRMPLIARPVSRTPIAVQALPAQSQVAGQQYSSRTPIISSAPQVGQS 600
           Q+QV NSNE+NQYGRM  IARPVSRT +A Q LPAQSQ +GQQYSSRT  +SSAPQVGQS
Sbjct: 541 QVQVSNSNESNQYGRMTSIARPVSRT-LAGQVLPAQSQTSGQQYSSRTSTVSSAPQVGQS 600

Query: 601 IPINRDGLNTLSRDLERRQQFSRHHGDSHHATNLAPFHHPQTVQNRDSQDRSFTPGQSVQ 660
           IPI+RDGLNT+SRD E RQ F RHHGD HHATNLAPF  P  VQNR+ QDRSFTPGQSV+
Sbjct: 601 IPISRDGLNTISRDSEMRQPFPRHHGDLHHATNLAPFLRPPIVQNREPQDRSFTPGQSVR 660

Query: 661 ASTALRPSTGLLTDFQNPHLQQALNMRISHLRNQNSSSVRPSLPFSRPMSQAGGGY---A 720
           ASTA RPS G+LTDFQNPHLQQALN+RISHL+NQN SSVRPSLPFSRP SQ GGGY   A
Sbjct: 661 ASTAQRPSVGILTDFQNPHLQQALNLRISHLQNQNPSSVRPSLPFSRPTSQVGGGYGGSA 720

Query: 721 YTAVTPNSQHARMVAASQRAEMMRQSSAMSLQNQTSRSAHSLQTTPDGLRRPAGEMRNV- 780
           YTAVTP++QHARM+ ASQRAEMMRQSSAMSLQNQTSRS H LQTTPDGLRRPAGE+RNV 
Sbjct: 721 YTAVTPHNQHARMMVASQRAEMMRQSSAMSLQNQTSRSPHPLQTTPDGLRRPAGELRNVG 780

Query: 781 GVSPSVTMAPGSVDLSVEQNWQPVGRMRGSLSGRAYSDAYGVIIQPTQAAQSARLPSHLS 840
           G++ SVTMA   +D SVEQN QP+GRMRGSLSGRAYSDA+GVIIQPTQ  QSAR PS+L+
Sbjct: 781 GMTQSVTMASNLLDPSVEQNRQPIGRMRGSLSGRAYSDAFGVIIQPTQPVQSARPPSNLT 840

Query: 841 PTQPSAPSTQAQR 850
            TQ SAPST AQR
Sbjct: 841 TTQSSAPSTHAQR 849

BLAST of Clc01G02440 vs. ExPASy TrEMBL
Match: A0A6J1EMF7 (E4 SUMO-protein ligase PIAL2-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435722 PE=4 SV=1)

HSP 1 Score: 1302.0 bits (3368), Expect = 0.0e+00
Identity = 675/858 (78.67%), Postives = 734/858 (85.55%), Query Frame = 0

Query: 1   MGASSPHEMNLNRIILYIDGLNLLLNRVGQMDPSQLCNLCFSIARSIDYTIANNIVPSKA 60
           MGA++P+EM L+RI  YID L L +NRV Q+DP QLCN+CFS+ARSID+ IAN+ VPSKA
Sbjct: 1   MGATTPYEMKLDRISSYIDSLTLYVNRVDQIDPVQLCNICFSLARSIDFAIANDFVPSKA 60

Query: 61  HSLPSFIKQLCQMKHSHRLKAALMVLMITIKNACKAKWFSEKDAEELYSLANEIGNDFFG 120
             LPS +KQ+CQ KHSH LKAA+MVLMI  KNACK KWFSEK+AEELYSLANEIG+DFF 
Sbjct: 61  QGLPSLLKQICQKKHSHHLKAAIMVLMIAAKNACKVKWFSEKEAEELYSLANEIGSDFFV 120

Query: 121 DTNIGQSNPLATISTVMERFFPRLKLGQIVASVEVKPGYGVYAIDFNILKTIQYAPQEKL 180
           DTN G SN L TI+TVMERFFPRLKLGQIV S EVKPGYGV+A DFNI KTIQYAPQEK+
Sbjct: 121 DTNTGPSNSLTTITTVMERFFPRLKLGQIVISAEVKPGYGVFAFDFNISKTIQYAPQEKI 180

Query: 181 RLFVAQKDNTETSACIISPPQVNFLVNG-----RGVNGRTNVYVDTGPQLPTNITHMLKL 240
           RLFVAQKDNTETSACIISPPQ+     G     + ++       DTGPQLPTN+THMLKL
Sbjct: 181 RLFVAQKDNTETSACIISPPQLPCQWEGSQWKDKYIHASFLFNKDTGPQLPTNVTHMLKL 240

Query: 241 GSNLLQAIGSFNGHYVIAVAIMGTAPSPDSSVLQDHIQPVVSTVDSDSDIIEGPSRISLN 300
           GSNLLQ IGSFNGHYVIAVA+MG+APSPDSSVLQDH QPVVSTVDSDSDIIEGPSRISLN
Sbjct: 241 GSNLLQVIGSFNGHYVIAVAVMGSAPSPDSSVLQDHEQPVVSTVDSDSDIIEGPSRISLN 300

Query: 301 CPISYTRIKVPVKGRSCKHLQCFDFHNFIDINSRRPSWRCPHCNQYICFLDIRVDQNMLK 360
           CPISYTRIKVPVKGRSCKHLQCFDF+NFIDINSRRPSWRCPHCNQYICFLDI +DQNMLK
Sbjct: 301 CPISYTRIKVPVKGRSCKHLQCFDFYNFIDINSRRPSWRCPHCNQYICFLDICIDQNMLK 360

Query: 361 ASRVIREVAENVTEVIISADGSWKAILENDHGDGRPLDDSLNHQKERAQEESTAPPDVLD 420
              VIREVAENVTEVIISADGSWKAILEND GDGRPLDDSLN Q ERAQ+ESTAPPDVLD
Sbjct: 361 ---VIREVAENVTEVIISADGSWKAILENDCGDGRPLDDSLNQQNERAQQESTAPPDVLD 420

Query: 421 LTEVDDDMDICNLETEDRKTCLDNKNQPVSSSLDISSGMNMNSLNQNLAAVLDDDFWSGI 480
           LTEVDDDM+ICNLETEDRK CL NKNQPVSSSL+I SGMN NSLNQN +A LDDDFWSG+
Sbjct: 421 LTEVDDDMNICNLETEDRKPCLGNKNQPVSSSLNILSGMNRNSLNQNFSAALDDDFWSGM 480

Query: 481 VTDGILTSSAGSDAPMGNSTPPPGFAGIMQSTVFTDVVPPVLNHGAGVPGHANFSSSAFY 540
           VTD +L SS  SDAPMG+ST  P FAG+ QS   TD V PVLNH  GVPG  NF   AFY
Sbjct: 481 VTDRLLISSIRSDAPMGSSTAAPSFAGLTQSAGLTDAVSPVLNHDVGVPGQVNFPFPAFY 540

Query: 541 DQNNLQIQVPNSNENNQYGRMPLIARPVSRTPIAVQALPAQSQVAGQQYSSRTPIISSAP 600
           DQNN+Q+QV NSNE+NQYGRM  IARPVSRT +A Q LPAQSQ +GQQYSSRT  ISSAP
Sbjct: 541 DQNNVQVQVSNSNESNQYGRMTSIARPVSRT-LAGQVLPAQSQTSGQQYSSRTSTISSAP 600

Query: 601 QVGQSIPINRDGLNTLSRDLERRQQFSRHHGDSHHATNLAPFHHPQTVQNRDSQDRSFTP 660
           QVGQSIPI+RDGLNT+SRD ERRQ F RHHGD HHATNLAPF  P  VQNR+ QDRSFTP
Sbjct: 601 QVGQSIPISRDGLNTISRDSERRQPFPRHHGDLHHATNLAPFLRPPIVQNREPQDRSFTP 660

Query: 661 GQSVQASTALRPSTGLLTDFQNPHLQQALNMRISHLRNQNSSSVRPSLPFSRPMSQAGGG 720
           GQSV+ASTA RPS G+LTDFQNPHLQQ+LN+RISHLRNQN SSVRPSLPFSRP SQ GGG
Sbjct: 661 GQSVRASTAQRPSAGILTDFQNPHLQQSLNLRISHLRNQNPSSVRPSLPFSRPTSQVGGG 720

Query: 721 Y---AYTAVTPNSQHARMVAASQRAEMMRQSSAMSLQNQTSRSAHSLQTTPDGLRRPAGE 780
           Y   AY AVTP++QHARM+ ASQRAEMMRQSSAMSLQNQTSRS H LQTTPDGLRRPAG+
Sbjct: 721 YGGSAYPAVTPHNQHARMMVASQRAEMMRQSSAMSLQNQTSRSPHPLQTTPDGLRRPAGD 780

Query: 781 MRNV-GVSPSVTMAPGSVDLSVEQNWQPVGRMRGSLSGRAYSDAYGVIIQPTQAAQSARL 840
           +RNV G++ SVTMA   +D SVEQN QP+GRMRGSLSGRAYSDAYGVIIQPTQ  QS R 
Sbjct: 781 LRNVGGMTQSVTMASDLLDPSVEQNRQPIGRMRGSLSGRAYSDAYGVIIQPTQPVQSTRP 840

Query: 841 PSHLSPTQPSAPSTQAQR 850
           PS+L+ TQ +APST AQR
Sbjct: 841 PSNLTTTQSNAPSTHAQR 854

BLAST of Clc01G02440 vs. TAIR 10
Match: AT4G23560.1 (glycosyl hydrolase 9B15 )

HSP 1 Score: 553.1 bits (1424), Expect = 5.8e-157
Identity = 258/392 (65.82%), Postives = 319/392 (81.38%), Query Frame = 0

Query: 903  YSDIMLSWTPIEYEKEIASVMQLEHLRSSVRWGADFILRAHVSPTTLYTQVGDGNGDHQC 962
            ++  +LSW  IEY+ EI+SV QL +LRS+++WG DFILRAH SP  LYTQVGDGN DH C
Sbjct: 86   FTTTLLSWAAIEYQNEISSVNQLGYLRSTIKWGTDFILRAHTSPNMLYTQVGDGNSDHSC 145

Query: 963  WERPQDMDTPRTLYKITPNSPGTEAAAEAAAALAAASIVFHHVDANYSRSLLQHSKSLFQ 1022
            WERP+DMDT RTLY I+ +SPG+EAA EAAAALAAAS+VF  VD+ YS +LL H+K+LF+
Sbjct: 146  WERPEDMDTSRTLYSISSSSPGSEAAGEAAAALAAASLVFKSVDSTYSSTLLNHAKTLFE 205

Query: 1023 FADQFRGSYSASCPFYCSFSGYQDELLWAAAWVYKASGNSKYLSYILSNQGWSQPTSQFS 1082
            FAD++RGSY ASCPFYCS+SGYQDELLWAAAW+YKA+G+  Y++Y++SN+ WSQ  ++FS
Sbjct: 206  FADKYRGSYQASCPFYCSYSGYQDELLWAAAWLYKATGDKIYINYVISNKDWSQAVNEFS 265

Query: 1083 WDNKFVGAQTLLAKELYKGKKNLSKFKIDAESFICMVMPGGSCSKIPTTPGGLLFLRDNS 1142
            WDNKFVGAQ LL  E Y G  +L+KFK D ESF+C +MPG S  +I  TPGGLLF+RD+S
Sbjct: 266  WDNKFVGAQALLVSEFYNGANDLAKFKSDVESFVCAMMPGSSSQQIKPTPGGLLFIRDSS 325

Query: 1143 NLQYASSSSMVLFMYSRLLNKARVDGVHCGSKYFSSSQIKTFAKSQVDYILGKNPMKWSY 1202
            NLQY ++++ VLF YS+ L KA V  + CGS  F+ SQI+ FAKSQVDYILG NPMK SY
Sbjct: 326  NLQYVTTATTVLFHYSKTLTKAGVGSIQCGSTKFTVSQIRNFAKSQVDYILGNNPMKMSY 385

Query: 1203 MVGFGNKYPLQLHHRASSIPSIKVHSTKVGCNDGYSHYFYSNNPNPNVHIGAIVGGPNSN 1262
            MVGFG KYP Q HHR SS+PSI+    K+ CN GYS Y+ S+ PNPNVHIGAIVGGPNS+
Sbjct: 386  MVGFGTKYPTQPHHRGSSLPSIQSKPEKIDCNGGYS-YYNSDTPNPNVHIGAIVGGPNSS 445

Query: 1263 DQFSDLRSDHSHSEPTTYMNAAFVGSVAALVA 1295
            DQ+SD +SD+SH+EPTTY+NAAF+G VAAL++
Sbjct: 446  DQYSDKKSDYSHAEPTTYINAAFIGPVAALIS 476

BLAST of Clc01G02440 vs. TAIR 10
Match: AT4G09740.1 (glycosyl hydrolase 9B14 )

HSP 1 Score: 537.3 bits (1383), Expect = 3.3e-152
Identity = 249/392 (63.52%), Postives = 312/392 (79.59%), Query Frame = 0

Query: 903  YSDIMLSWTPIEYEKEIASVMQLEHLRSSVRWGADFILRAHVSPTTLYTQVGDGNGDHQC 962
            ++  +LSW  +EY+ EI  V QL +LRS+++WG +FILRAH S   LYTQVGDGN DH C
Sbjct: 86   FTTTLLSWAALEYQNEITFVNQLGYLRSTIKWGTNFILRAHTSTNMLYTQVGDGNSDHSC 145

Query: 963  WERPQDMDTPRTLYKITPNSPGTEAAAEAAAALAAASIVFHHVDANYSRSLLQHSKSLFQ 1022
            WERP+DMDTPRTLY I+ +SPG+EAA EAAAALAAAS+VF  VD+ YS  LL ++KSLF+
Sbjct: 146  WERPEDMDTPRTLYSISSSSPGSEAAGEAAAALAAASLVFKLVDSTYSSKLLNNAKSLFE 205

Query: 1023 FADQFRGSYSASCPFYCSFSGYQDELLWAAAWVYKASGNSKYLSYILSNQGWSQPTSQFS 1082
            FAD++RGSY ASCPFYCS SGYQDELLWAAAW+YKA+G   YL+Y++SN+ WS+  ++FS
Sbjct: 206  FADKYRGSYQASCPFYCSHSGYQDELLWAAAWLYKATGEKSYLNYVISNKDWSKAINEFS 265

Query: 1083 WDNKFVGAQTLLAKELYKGKKNLSKFKIDAESFICMVMPGGSCSKIPTTPGGLLFLRDNS 1142
            WDNKF G Q LLA E Y G  +L KFK D ESF+C +MPG S  +I  TPGG+LF+RD+S
Sbjct: 266  WDNKFAGVQALLASEFYNGANDLEKFKTDVESFVCALMPGSSSQQIKPTPGGILFIRDSS 325

Query: 1143 NLQYASSSSMVLFMYSRLLNKARVDGVHCGSKYFSSSQIKTFAKSQVDYILGKNPMKWSY 1202
            NLQY ++++ +LF YS+ L KA V  + CGS  F+ SQI+ FAKSQVDYILG NP+K SY
Sbjct: 326  NLQYVTTATTILFYYSKTLTKAGVGSIQCGSTQFTVSQIRNFAKSQVDYILGNNPLKMSY 385

Query: 1203 MVGFGNKYPLQLHHRASSIPSIKVHSTKVGCNDGYSHYFYSNNPNPNVHIGAIVGGPNSN 1262
            MVGFG KYP Q HHR SS+PSI+    K+ CN G+S+Y + + PNPNVH GAIVGGPNS+
Sbjct: 386  MVGFGTKYPTQPHHRGSSLPSIQSKPEKIDCNGGFSYYNF-DTPNPNVHTGAIVGGPNSS 445

Query: 1263 DQFSDLRSDHSHSEPTTYMNAAFVGSVAALVA 1295
            DQ+SD R+D+SH+EPTTY+NAAF+GSVAAL++
Sbjct: 446  DQYSDKRTDYSHAEPTTYINAAFIGSVAALIS 476

BLAST of Clc01G02440 vs. TAIR 10
Match: AT1G02800.1 (cellulase 2 )

HSP 1 Score: 407.1 bits (1045), Expect = 5.1e-113
Identity = 206/399 (51.63%), Postives = 281/399 (70.43%), Query Frame = 0

Query: 903  YSDIMLSWTPIEYEKEIASVMQLEHLRSSVRWGADFILRAHVSPTTLYTQVGDGNGDHQC 962
            ++  MLSW+ IE+   + S  +L + + ++RW  DF+L+A   P T+Y QVGD N DH C
Sbjct: 106  FTTTMLSWSLIEFGGLMKS--ELPNAKDAIRWATDFLLKATSHPDTIYVQVGDPNMDHAC 165

Query: 963  WERPQDMDTPRTLYKITPNSPGTEAAAEAAAALAAASIVFHHVDANYSRSLLQHSKSLFQ 1022
            WERP+DMDTPR+++K+  N+PG++ A E AAALAAASIVF   D +YS  LLQ + ++F 
Sbjct: 166  WERPEDMDTPRSVFKVDKNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITVFT 225

Query: 1023 FADQFRGSYSAS-----CPFYCSFSGYQDELLWAAAWVYKASGNSKYLSYILSN---QGW 1082
            FAD++RG YSA      CPFYCS+SGYQDELLW AAW+ KA+ N  YL+YI +N    G 
Sbjct: 226  FADKYRGPYSAGLAPEVCPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQILGA 285

Query: 1083 SQPTSQFSWDNKFVGAQTLLAKE-LYKGKKNLSKFKIDAESFICMVMPGGSCSKIPTTPG 1142
             +  + FSWDNK VGA+ LL+KE L +  K+L ++K  A+SFIC V+PG S S+   TPG
Sbjct: 286  DEFDNMFSWDNKHVGARILLSKEFLIQKVKSLEEYKEHADSFICSVLPGASSSQY--TPG 345

Query: 1143 GLLFLRDNSNLQYASSSSMVLFMYSRLLNKARVDGVHCGSKYFSSSQIKTFAKSQVDYIL 1202
            GLLF    SN+QY +S+S +L  Y++ L  AR    +CG    + +++++ AK QVDY+L
Sbjct: 346  GLLFKMGESNMQYVTSTSFLLLTYAKYLTSART-VAYCGGSVVTPARLRSIAKKQVDYLL 405

Query: 1203 GKNPMKWSYMVGFGNKYPLQLHHRASSIPSIKVHSTKVGCNDGYSHYFYSNNPNPNVHIG 1262
            G NP+K SYMVG+G KYP ++HHR SS+PS+ VH T++ C+DG+S  F S +PNPN  +G
Sbjct: 406  GGNPLKMSYMVGYGLKYPRRIHHRGSSLPSVAVHPTRIQCHDGFS-LFTSQSPNPNDLVG 465

Query: 1263 AIVGGPNSNDQFSDLRSDHSHSEPTTYMNAAFVGSVAAL 1293
            A+VGGP+ NDQF D RSD+  SEP TY+NA  VG++A L
Sbjct: 466  AVVGGPDQNDQFPDERSDYGRSEPATYINAPLVGALAYL 498

BLAST of Clc01G02440 vs. TAIR 10
Match: AT5G41580.1 (RING/U-box superfamily protein )

HSP 1 Score: 395.2 bits (1014), Expect = 2.0e-109
Identity = 212/474 (44.73%), Postives = 299/474 (63.08%), Query Frame = 0

Query: 30  QMDPSQLCNLCFSIARSIDYTIANNIVPSKAHSLPSFIKQLCQMKHSHRLKAALMVLMIT 89
           ++DP +    C S A+ ID+ IANN +P K    P  +KQLC+       K ALMVLMI+
Sbjct: 45  KVDPKEFQICCISFAKGIDFAIANNDIPKKVEEFPWLLKQLCRHGTDVYTKTALMVLMIS 104

Query: 90  IKNACKAKWFSEKDAEELYSLANEIGNDF--FGDTNIGQSNPLATISTVMERFFPRLKLG 149
           +K+AC   WFS+ +++EL +LA+EI   F   G T+ G  +P +T S +MERF+P +KLG
Sbjct: 105 VKHACHLGWFSDSESQELIALADEIRTCFGSSGSTSPGIKSPGSTFSQIMERFYPFVKLG 164

Query: 150 QIVASVEVKPGYGVYAIDFNILKTIQYAPQEKLRLFVAQKDNTETSACIISPPQVNFLVN 209
            ++ S EVK GY + A DF I K + ++ QEK+RLFVAQ DN +TSACI +PP+V+FL+N
Sbjct: 165 HVLVSFEVKAGYTMLAHDFYISKNMPHSLQEKIRLFVAQTDNIDTSACISNPPEVSFLLN 224

Query: 210 GRGVNGRTNVYVDTGPQLPTNITHMLKLGSNLLQAIGSFNGHYVIAVAIMGTAPSPDSSV 269
           G+GV  R N+ +DTGPQLPTN+T  LK G+NLLQ +G+F G+Y+I +A  G    P+  V
Sbjct: 225 GKGVEKRVNIAMDTGPQLPTNVTAQLKYGTNLLQVMGNFKGNYIIIIAFTGLVVPPEKPV 284

Query: 270 LQDHIQPVVSTVDSDSDIIEGPSRISLNCPISYTRIKVPVKGRSCKHLQCFDFHNFIDIN 329
           L+D++Q  V     DSDIIEGPSR+SL+CPIS  RIK+PVKG+ CKHLQCFDF N++ IN
Sbjct: 285 LKDYLQSGVIEASPDSDIIEGPSRVSLSCPISRKRIKLPVKGQLCKHLQCFDFSNYVHIN 344

Query: 330 SRRPSWRCPHCNQYICFLDIRVDQNMLKASRVIREVAENVTEVIISADGSWKAILENDHG 389
            R P+WRCPHCNQ +C+ DIR+DQNM K   ++++V  N  +VII A G+WK + +N   
Sbjct: 345 MRNPTWRCPHCNQPVCYPDIRLDQNMAK---ILKDVEHNAADVIIDAGGTWK-VTKNTGE 404

Query: 390 DGRPLDDSLNHQKERAQEESTAPPDVLDLTEVDDDMDI---CNLETEDRKTCLDNKNQPV 449
              P+ + + H  E       + P V DLT  DDD ++    + + EDRK C+       
Sbjct: 405 TPEPVREII-HDLEDPMSLLNSGPVVFDLTG-DDDAELEVFGDNKVEDRKPCMS------ 464

Query: 450 SSSLDISSGMNMNSLNQNLAAVLDDDFWSGIVTDGILTSSAGSDAPMGNSTPPP 499
               D     N N+ N++ +   +DD+ S      ++       + +GN+ P P
Sbjct: 465 ----DAQGQSNNNNTNKHPS---NDDYSSIFDISDVIALDPEILSALGNTAPQP 499

BLAST of Clc01G02440 vs. TAIR 10
Match: AT4G02290.1 (glycosyl hydrolase 9B13 )

HSP 1 Score: 384.4 bits (986), Expect = 3.6e-106
Identity = 189/397 (47.61%), Postives = 273/397 (68.77%), Query Frame = 0

Query: 903  YSDIMLSWTPIEYEKEIASVMQLEHLRSSVRWGADFILRAHVSPTTLYTQVGDGNGDHQC 962
            ++  MLSW+ IE+   + S  +L++ + ++RW  D++L+A   P T+Y QVGD N DH C
Sbjct: 115  FTTTMLSWSVIEFGGLMKS--ELQNAKIAIRWATDYLLKATSQPDTIYVQVGDANKDHSC 174

Query: 963  WERPQDMDTPRTLYKITPNSPGTEAAAEAAAALAAASIVFHHVDANYSRSLLQHSKSLFQ 1022
            WERP+DMDT R+++K+  N PG++ AAE AAALAAA+IVF   D +YS+ LL+ + S+F 
Sbjct: 175  WERPEDMDTVRSVFKVDKNIPGSDVAAETAAALAAAAIVFRKSDPSYSKVLLKRAISVFA 234

Query: 1023 FADQFRGSYSAS-----CPFYCSFSGYQDELLWAAAWVYKASGNSKYLSYILSN---QGW 1082
            FAD++RG+YSA      CPFYCS+SGYQDELLW AAW+ KA+ N KYL+YI  N    G 
Sbjct: 235  FADKYRGTYSAGLKPDVCPFYCSYSGYQDELLWGAAWLQKATKNIKYLNYIKINGQILGA 294

Query: 1083 SQPTSQFSWDNKFVGAQTLLAKE-LYKGKKNLSKFKIDAESFICMVMPGGSCSKIPTTPG 1142
            ++  + F WDNK  GA+ LL K  L +  K L ++K  A++FIC V+PG   S    TPG
Sbjct: 295  AEYDNTFGWDNKHAGARILLTKAFLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPG 354

Query: 1143 GLLFLRDNSNLQYASSSSMVLFMYSRLLNKARVDGVHCGSKYFSSSQIKTFAKSQVDYIL 1202
            GLLF   ++N+QY +S+S +L  Y++ L  A+   VHCG   ++  ++++ AK QVDY+L
Sbjct: 355  GLLFKMADANMQYVTSTSFLLLTYAKYLTSAKT-VVHCGGSVYTPGRLRSIAKRQVDYLL 414

Query: 1203 GKNPMKWSYMVGFGNKYPLQLHHRASSIPSIKVHSTKVGCNDGYSHYFYSNNPNPNVHIG 1262
            G NP++ SYMVG+G K+P ++HHR SS+P +  H  K+ C+ G++    S +PNPN  +G
Sbjct: 415  GDNPLRMSYMVGYGPKFPRRIHHRGSSLPCVASHPAKIQCHQGFA-IMNSQSPNPNFLVG 474

Query: 1263 AIVGGPNSNDQFSDLRSDHSHSEPTTYMNAAFVGSVA 1291
            A+VGGP+ +D+F D RSD+  SEP TY+N+  VG++A
Sbjct: 475  AVVGGPDQHDRFPDERSDYEQSEPATYINSPLVGALA 507

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011654714.10.0e+0082.37E4 SUMO-protein ligase PIAL2 isoform X1 [Cucumis sativus] >KAE8647846.1 hypothet... [more]
XP_008437346.10.0e+0082.98PREDICTED: E4 SUMO-protein ligase PIAL2 isoform X1 [Cucumis melo][more]
KAG6579533.10.0e+0081.36E4 SUMO-protein ligase PIAL2, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_022928990.10.0e+0080.89E4 SUMO-protein ligase PIAL2-like isoform X2 [Cucurbita moschata][more]
XP_022969988.10.0e+0080.54E4 SUMO-protein ligase PIAL2-like isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
P225038.1e-17272.70Endoglucanase OS=Phaseolus vulgaris OX=3885 PE=2 SV=2[more]
Q9SUS08.2e-15665.82Endoglucanase 20 OS=Arabidopsis thaliana OX=3702 GN=At4g23560 PE=2 SV=1[more]
Q9SZ904.6e-15163.52Endoglucanase 18 OS=Arabidopsis thaliana OX=3702 GN=At4g09740 PE=3 SV=2[more]
Q6ZA064.7e-13558.81Endoglucanase 20 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU15 PE=2 SV=1[more]
Q9SRX37.2e-11251.63Endoglucanase 1 OS=Arabidopsis thaliana OX=3702 GN=CEL2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KMH40.0e+0082.37SP-RING-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G15219... [more]
A0A1S3AUE10.0e+0082.98E4 SUMO-protein ligase PIAL2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103482791 ... [more]
A0A6J1ESZ60.0e+0080.89E4 SUMO-protein ligase PIAL2-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LO... [more]
A0A6J1I2J00.0e+0080.54E4 SUMO-protein ligase PIAL2-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC1... [more]
A0A6J1EMF70.0e+0078.67E4 SUMO-protein ligase PIAL2-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LO... [more]
Match NameE-valueIdentityDescription
AT4G23560.15.8e-15765.82glycosyl hydrolase 9B15 [more]
AT4G09740.13.3e-15263.52glycosyl hydrolase 9B14 [more]
AT1G02800.15.1e-11351.63cellulase 2 [more]
AT5G41580.12.0e-10944.73RING/U-box superfamily protein [more]
AT4G02290.13.6e-10647.61glycosyl hydrolase 9B13 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 259..380
e-value: 5.9E-37
score: 128.3
IPR012341Six-hairpin glycosidase-like superfamilyGENE3D1.50.10.10coord: 897..1294
e-value: 4.8E-123
score: 413.4
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 902..1287
e-value: 8.6E-108
score: 361.6
IPR004181Zinc finger, MIZ-typePFAMPF02891zf-MIZcoord: 292..340
e-value: 2.2E-20
score: 72.1
IPR004181Zinc finger, MIZ-typePROSITEPS51044ZF_SP_RINGcoord: 281..358
score: 38.800083
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 748..771
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 619..664
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 748..762
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 635..664
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 906..1293
NoneNo IPR availablePANTHERPTHR22298:SF22ENDOGLUCANASE 18-RELATEDcoord: 906..1293
NoneNo IPR availableCDDcd16650SP-RING_PIAS_likecoord: 293..340
e-value: 8.83665E-25
score: 96.1741
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GH9_2coord: 1191..1217
IPR008928Six-hairpin glycosidase superfamilySUPERFAMILY48208Six-hairpin glycosidasescoord: 903..1293

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G02440.1Clc01G02440.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0005975 carbohydrate metabolic process
biological_process GO:0016925 protein sumoylation
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
molecular_function GO:0016874 ligase activity
molecular_function GO:0019789 SUMO transferase activity
molecular_function GO:0008270 zinc ion binding