Cla97C03G053340 (gene) Watermelon (97103) v2.5

Overview
NameCla97C03G053340
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionE3 SUMO-protein ligase SIZ1
LocationCla97Chr03: 2503407 .. 2521127 (-)
RNA-Seq ExpressionCla97C03G053340
SyntenyCla97C03G053340
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTGGTTGGGGTTTCGGTCAAAAAGAAACACTAGCGCTAAAGGACAGGGTCGAGTGGGGAAGTCGGCGTGGGCGACCCGAGCCTCTGTTCATCAATATACCGCTTTCAATTCTTCTTCCACTTTGTACAAATCCCAAATTTTCCTTTAGGTAAATTTGCAATCTGTTTATACGCCATCTGTTGTTCAATTGGTAGTTTGTCTTTTGTTGATTAAATTTTGCTTAAATTTTTGTATGTGAAATCTCATTTTGTTTTGTTTTTGTTTTTAATTTTGGTCGTTTTCATGAGATCAATCGCAGTTTTGAGGACCATGCGTTGGTGGTGTGTGTTGGGTTACTGTTAAAATCGTCTTTTTGGCATACCATCGTTAGGGATTTTGTGGGTTGTTGCTTGGAAGGGATGGACTTGGTTGCTAATTGCAAGGTACTTTTTAATGTTGGTTTTTAGTTCAATTTTTGATTTTCAGTGCATGTTTTTCTTATTTGAATTTGGGGATACTATTTGCCTTGTCACTAGGCTTGCTCCAGGATTTTGTTGATGGAATGTAAGGTTAATGTGGATTTTCACTTGTGAAATTCCGTGTAGCTAAATTCATAGGAGATTTGCTTCTCTGCTTTAGTTACTTTTTTCTCTGTTAGCTTGAGATGATCAGGATATCACAAGTTTAATTTAGAATTTGTTTGTTACATTTTTTCGGTACCCTCTGGATGTTTACTTTGATTGTTGAACAGTGTTAAAATCCCCTTGAATGCAATCAAAGTGGCTAGCCTTGTGCGTGATTCTGTCTTTCAAGCTCTCACATCTTTCAGGGATTACTGTTGAACTGGTTGTTGACTATCAAATATAAATGTTCAGATGTAAAATATTGTTGATTAGACAATCGCCTCGCCTCTTGAACTAAGAGATCATAGAAAGTCAGGGGGACTTTTGTTTTTATATAAAGGAAAAAAATTAGATGTAGCAGGATTTGGCCAATTGAGTATTTAGGAAGGAATTTCTGTAGTTTGATTAGGTAGTCGTAGTACTATACCATCTATCTGTTTAACTAATTTATCTACATCATCTTTTCAGCTCATTTTATATTTCAATGTGATATATGATTGGGAAGTTGAGAAGACGTCTAGTTTTGATGTTAACTGAATGCAGCTCATCTAAAACAATACTTTGATCTTGTATAGTATAAGATTTGAACATGTGTTTAATGATTTTAGTCCTGGTTTGTATATCCACCCATAAAAATTTACGACAACGATACTCTAAGACAGGCAAAGTATTTTCATGCTGTCTCTTATATTTTCCTTTTGAGAAGGGAACCACAAATTGATTATCATTTACTAGTTAAGAACAGGAGGATGGTAATTGGTATTCAATACTATACTTGTTTATCTTCATATTGGATAGTTTCTTAAGCACATTGCCTTGAATACATATTATGTATGCATTTGAGATGATATCTAGGAATGAATTAGAGTCCCTTCCCTTTGTCTTATTTGTTAGCTATGATGTATTTTATGTTTTACTTAGTTTCTTGTTAGTGCGATGTCTTTACCATTTAGTTTTTCTGTCTAATAAAATGAAGTATACTTTCTGCAATCTGCTGTAGTTGATAGATTGATTGTTCCATCTATTTTGATTTAGTCTTAGTTGTCCAAAAGTTTCGTATGTGATTTCCGTTTTCTATTTGCTGGTTTATCTTTTTCTTGAATTTCAAACTTCTAATTCATCATTTGAGAAACTCCAAAATTAGCATTAATAATTAAATTGGATTTTTAGGTGGTTGTTGCTTTGGTTAGACAGATTATATCTTTTTCCATCCTTGCTCAGATTTTATTAGATCAGTCTGGATCTTGTGAAGAAATATTAATGCTACTGATTTGAATTCTGACTGTGCTCTTTGATTTTCACTTGTAGGATAAATTGGCTTATTTTCGAATAAAAGAGCTCAAGGATATTCTCACTCAACTAGGTCTCTCAAAGCAAGGAAAGAAGCAGGTGAATTGTCTGAATGTTGACTGTTATTTTTTTCTTTTTCTTTTCGTAAGAAATGGTCTCTTGTTCAGAGAAATGTTTAGTAATACACAAAAAATCAAAGAGGGTCTAGGCCAATCAATACAAAGGTTCAAAAGACAAAAAGCAAGCCTACAATAAAGAGCCCTAATTGTTGGTTTCGGCATTGATTCAAATTGTATAGTGGTGCATTTAGATAATTGTACCTGCATTTGGAAACGTCTGATGATGGTTTTTGTCTTCTTCTACTCTTTGCTCAGCATGCTACATGAACATTATTAACTTTTGCTGGAGACTGGATGTCACTGAAAAAAAAAAAAAAGTATATAAGTTTTTTGAGGTGCATGATAAAAAACTGTAATTTTTTTTATTTATGAAAAAATATAATATTATTAACCACGCTTTTGTAAAAGATTGTTTAGCTTTAGAGAAATGAGGGTTTTTGGATATTTCTTCATTTATTTATATATGTTTTTTTTTCTGTGTTTTTAAACAAAAAAAAGAAGGCGAGGTTGTGAGGGAGCCTTTTTGTCGAGGCACAATTTTTTAAAGCACTGTTTGGTGATTTATGAAAGTTGTATTCTTTATACAAATGTTGAGGTTAATTAGGGGAAACAATGGTTTTGTCTAGGTAAAAGTGTTTTCATATGACTAGCACGAGATCTTTGATGAGTCCAGAGAGAACTTTTTTTCAGATTATCATTAGTCATGCCAACCAGATAGAGTTCTTTTAGGCTGTCACTTGGAATAAGAAAGTAGAGTCGTAGCAACTGGGGGTGTGTCTTCTTAGAGGGCTTTTGTATTAAATTTGAAAAATGCGATTAAGAGGTCTTTGGGTGAATCTCTGTCATTCCCAACTGTTGAGGCTGGGTTAAAGTTTCATATTCCTTTGAACTTGTGGAAGGAGGTCTTGCAATCCTTATGATGACACATGTGAGGATTTCGTTTATTGTACTATAAATCCTTTAATTGAACTGATCTGAATGTATGGGAGATATTATCGAAGTCCAAAGGGATTATCTTGGGTTTCTGCCAGCAGAATCCATTTGCTCATCTTGATTCCCTTTATGTTGCTAAGTCTTTATGATGAATTTGAACTCCTAATGAACTACAAATCTCAAATTCATGAATGCTTCTTTAAGATCATCTTTGAGGAAAATTCTTCCTCTTGGGGTTTGTATGTCAGCAGCACAGTTTCAACTTCTGGGAGAAGGGAGTCTTGACAAGAGTTGTGTGACTTATTCGGGATTTGTGACCTATTAGGTGTTTGTAACAGTCCTTCGTGTATTAGTGGGAATTCAAATATGGTTAGATGAGTTTATTTTATTTTATATATTGGGTTATAGTTTCTCTATCAAGTGGCTTGAATGGGTTTATCAAAGACCATGGGGGTGCGCACTTTTATTTGTTCTGATATATTTTATTGTAAGCTCAAATCTTTTATTTTATTGACTTATGGTAGAAGGTTTAGATACCTTTTGGATAGTTGGCTTTTAGTTGAGTTTCTTCTTTGTAAGTTGCATTAGCTACATTGTGGCTTTTCATTTTATCCCTGAAAAGTTCTCTCTTATGACCTTGCTTGAACATGATCGGTTCTATGGGATGTACAACTGTGTCCTTGAGTTCTAATGAAAAATTCTCTCTTCTTTCGCCTTTTTTTTGGTTTGTTTGTTTGTTTCAAATGAAGTTTACCAGCTAGCACAGGACATATGAAAAAGTTTAACAAGCTTGTCCCATAGCGCTCCCTCATTGATATTCCCTTGTCAAATGGAGGTTCATTATGAGAGAGCAACTTGTTTTTGTTTTCATTGATTGTATTCTTTGCATCAGTGCCTGGTTCACCAAATACACTCTCCTGAAGGCTGAAAGTGAGAAATTTTTTTGGTCACTTCCCTATTCTCCTTGGCACTTGTAGGGTCTAATGCCCTTCAAAAGAAAAGAAAAGAAATAATTCTGGCTTCATCAATGCACACTCATCAAGAATTGGGGCGGGGGTCAGGGCGGGGATGGGGGGGATCTAATTGGGTGGAAGGTTTGCAAGGCTTACGGGCCAATTGGGAATTGAAATCCTTCTGATAATTCTTTGATTTTAGGGAGAATTCGGTGTCATTTCACTCTTTTTTTGCACACTGACTTCTCTTACTGAAAATTTCTGTTGCCAAAGAGGAAAGACCGGAAAAGTTCAACCAAAATATGATTCCATGATTATCAAGTTGAGTTGAGCATTGATATATATTCCTTCTGACTCCTACCTTATGAATTACAAGTTGAATATGAGACGTGTTCTAGTGACTGTCTGGCATGCTTTTGTAGGACCTTGTTGAACGGATACTAGCCATTCTTTCTGATGAACAAGGTATGTTGATACATTTTGATAATATTCTTCTTTATAACCATACTTAACTTGTACAATTTAACACGATATGGTCCCCTTTGTTCGTGAAGAAACCTTAGTACATTGAATTCTGTTTCATAATTCTTTGCTGGAAAGCCTGGAATACATTGAAGTCTTAAATATATGCAATGCTTTTATGTTGATTGTTTCATTTGCATGTCTTGGTTGAGAGGGACTAAGGGAAAGCCAGTTGGTTGAGGGTTTGGTAGATTGTTATTAAGGTCTCACGTTCAATTCTTTTTGGTTAACTAGAACCAAAATTTCCTTGGTTTCTTCTGGTATTTGGCATTGTATGACCCCATTATTTTATGTCCGTGAACTCAAATTTCAGTTAGTTTATGGCTTTTATAAGGCTGGTCTTTTTTTTTCTTTTATTATTTATTTTTCGAGACAAATGCTTATGCATAATCGATATATGTTGCAATTATTTGTTTCAGTTTCAAAAATGTGGGCAAAGAAAAATGCTGTTGGAAAGGATCAAGTAGCAAAACTAGTCGATGACACATACAGGTTTGTCTTCTGGTGTCATATTTTTCTTCAAGGTTCTTGTTGACAAATTTCCTCTAAATTCAATAATTATTCACATGTTTCTGTTTCATAAATATATCTTATCTTGTGGTGTTTCTTGTAATCATCTCAAGTGGTGCTTCTTTTACACATTCTCTTATTCCTTTTGTATGCCCAACTGTCATCTTTTCTTTATTTCATCTTCTTACTTGTATGATTTTCTTATACAGAAAGATGCAGGTTTCTGGGGCCACGGATTTAGCTTCCAAGGGTCAAGGTGTCTCAGACAGCAGTACTGTGCAGGTAAAAGGTGAAACGGATGATTCTTTGCAATTAGATACAAAGGTTCGGTGTCTTTGTGGAAATGCTTTGCAAACAGAATCAATGATCAAGGTATATCAAACCATTCCTTTTATGCATTTTTCTTCATAAGTTCCTCCCATGTTTATTTATTCTGATCCCTTCCTGGTTCCAGCTGCCTTGAAACATCAGTTACTTTAGTTCACCTGCCTTTTTTTTTTAATAAAATTTTTGGCAGTGTGAGGATCCACGGTGCCAAGTGTGGCAGCACATTAGTTGTGTTATAGTTCCGGAGAAACCTACAGAAGGAAATCCGCCATACCCTGAACACTTCTATTGTGAGATCTGTCGACTCAATCGTGCTGACCCGTACGTATCAACTACCGTTATCTTTCGTTTAGACTATAGGCTTTCTTTTAAGGATATTTAGTCTATAGGCTTCTTATTTGTGGAATTGGTGGGTACTGCACACAAGTTTGCATGTGTTAATGTCACAATAATTATCCTCTTTTATTTTGTCCTTCCGATGTGAATGTCTGAATGGATTTGTGCAGTGGAGGGCTGTGAAGTATAGTGTTTGCTTGAGAGTATCTTCTTCATATGGTTTGATATATTTGGGGTTGTTTGGTATATCTCACATAGAGAGAAAAAGTATTAGTAGCTTGTCTCGTAGATATAAAGTGGGTGTATAGAAAGCAGAAATCTTTGACAACGAAATATTAAGATACTTCGGTGAAGGAAGTGATTTTATAATACTGGAATAATTAAGATTTGATCAGTTACAGTAATTTGTATATTCGTTAAAGAAATTATAAACAGTTTGAAGAGGTTTATAAGAATACCAAGTAGGGACTTGTGGCTTAATGGTTTCTGCAGGATATTCTGTTGAATTATCATATTCTACCGACTATGACTATGAATGAGGTTTAGCCCCTTTAACCTTTGTTGATAAACTATTCACTTGTATATATCAAAATGGCGACTGAAAAGACTATGAATGAGTTAATTTTTTGGGGTGCCCAGATTGATCCTGTCAATTTTCTAACAAGAGTTAAATAACTAATGTTGCTTTGTTTCTTCTTTCATGGTGTATTTTCTAACAATTTTTTAATACAGTTTTTGGGTTTCAGTTGCACATCCTCTGTTTCCCGTAAAGCTGATAACCACAATGTCCACAAACATTCCAACGGATGGGTGAGTTCACATTTATTGACTTCTGATTCAAGATATTCTGTATCCCATCTCTTGAAAAGAAAAGGAAATCTGAAAACACTGTTTTTACTTGTGTGGATAGAATTTATTAGCTTAAGCTTAAACGTTTCTGTTGCTGCTTTTTGGTGTTGACTTTATAATTTCTTATGTAATTATTCAATTTATGATATTTGGACTAATTTCAGCGCTTTCTTGTAATATCACTTAGCCATGGGTTGAGGCTATCCTCTCCATTTTTTGCAGATTTCATTATGTCAGTGAAATGTCTAGTTTCTTTTCAAATAAGTGGAGATGATTATATTGACATTTTAAGAGTTGGAAGTAACTAAACGAATGGATGAAGGATGAAATGCTGGAGATGATTATCTCTATTATTGAAAAGTGCAATTTCTTTGATCAACACTGGAGCTAGGTGCTTTCTCTTTTTGTTTTATACATTTGATATTTGATGCAAGCAGTTTTCCTGGCAGAAGGTCTGATGTCTTGTTTGTCTATTAGTTTTTTTGTAACTATACTTGTTTTATGATTTATGAGAATTTGAAAAGTGTTTTTAGCGTTGAGAGAATATGTATGCCTTGTCCTTCTTCATTCCTTCAGGACACTTCAAATTGTTAATGGAATGTTACCTTTTTGGGAAATTGAAATACATTGACTTGTGAGTTCTCTTTAATCATGTATTTCTATGGCTGGGTAACCTTAACTTGCAAGTCAGTCTATTTGTCTATGGTGCTTATTTCAAACCACTTCTATATTTGTATGGCCTGTAATTGTGTTGTTTGCACCCCTTGCCGATGCCTCTAGAAGCCCAAATTTTTCCTTAGGGCGATCGTTTATTACAATGGAAAAGTTTGAGTTGGAGCAACACCCACTTATCAATCTTCTCCATCTTTTGCCAATCCTATTATATGCCTTTTCAAAATTGAGGTAGCTCTAGTCAACCCTATTGTACGCCTTTTTTTTTTTTTTTTGAAAAAGGAAACAATCTCTTCATTAAGTGAATGAAAAGGTACAACTAAGATAATGGATACAGATAGAAACTTACAAAATAGAGAGAAGATGGATCACATATAGTCATTCACAATATGGATCACATCTAAAATTTATCTTCCTTCAACAAAGGCCAATTGAGAGTTGTTGATTGTCAAAGGGAAAACTTTTCTTAAGCATTCAAACTTTTGAGGTAACTTTTTTGAGTGAAGACACAAGGTTGATAGGTCTAAATCTATTAACTTTGCTATCCACCTTCTTTCTGGATTCAAACCCAGGTTTCAGATTCATTGGTCTGTTTATTAATGATGCTGTTAGGGAAAAATTCTTGGAACACCTTAAGAAGGTCAACTTTAATAATAAAAAGAAGAAAATGGGTTAACATGAACAAGGTTTTTTGGTTTTCCTTTTTGGACATCTATACGGGTTATAAACTCCTTACCATGAAGATAAATCCAAAAAATGCTTTGCAACCCTGGTTGTAGCAGTCGAACCCAGTGCTAGTCGCCGCTGTCATTGGACCCAGCTCTAGTCATGCTGTCACTAGTTCTTTTCTTTGCTGTGCGCTTGCCTTCGTCTAGCATCCCCATTTGCCTACTCTCTCTCCCTGAAGGTTTCAGCCTGAGTCAACCCTATCCACGGTCAACTAACCCTAGCTGCCGCTATTGCTAGGTTTTTGCTTTGCCGTCATCACCGAACATAGTAGCACAGTTGCTGGCCCTTATCGTCGTCGGTAACTATCTTCGTTTGTTTGGTCATGATTTTTGCTCCTTAATCCGTCACCACTGAACAACTGTTAGACACTTCTTCTATGTCGTTCTTCATTGTTGTTGCCACCTTCACCATTGCTGCCATCCTCCACTGTCGTTGCCCCCTAACAATCATCAGATTTTGACATACTTTTAGTTTCGTTTGTTATTTTTCAGATCTCCACTTTGTCTCCATGGATATGGAGAATTGTTGCATCCGGGACATCTTATACAGTTGTTGGGGTGAAAATTGTATGATTTTATTTTTATGGAAGTGGTGGAAAGTAATCGACTTGTTCCTTTTTCAGCTCCTCTTCCTTTGGTTGAATTACAACTTCCCGTTCACTCACTTGAGAAGAAATTTAAGGATGATTTTGGTGTCATTAGCTTATCCAACTTTAGATCATCTTTTGGATGACATTTTGAATGCTCTGTTTGGTCTTCCATAGGAGGAAGAAAGACTGATAGTTCCTGTCGACATTGAAAAGGTCGGATGGTTTCCCTAGAAGGATATGGATGGAGATTTGGCTTTCAACTGGTCCGTCACTTTTCCTAACGACACACAAAGTTGGCTCAGCTTACTGATTAGTTGGACTCCTTTCAAAGCAGATAAAAGTCATCTATGGCTTAGTCTTGTTTGTGCTGTTTTGTGGAAGATTTGGGAAGAAAGAAACTCTAGAATCTTCTGGGATAAGAGCAGGACTGCAAAGGATATTTTGATACTATCATTCATGATGCTTTTTTTGGTGCGAAGATTTACCGGCTTTACACAATCATAGCTTTTCTTTCCTCATTGCAAATTGGAAACACCTTTTATTTTCCCTCTAAATGGGTTCTTTTTGTATACCCTATGAATGGGCTCCATATGTTCCCTTGGATATTTCATTCAATCAATGAAATTGTTTCTTACCCAACAAAAAGAAAAGGTTGGATGGTTTTTTGGGAAATGGTAAATGATTTCCTCTAATCTCTTGAGAACAAGATGCATTCGGAGTCTGAATCTTTTCATCAGTTGAAAGTAGCAGGAAACTCTCAGGATGATGTTTATTTTCTCTTTGGGTAAGATAAGAAAATAAAAAGAAGTATGGATTTGAATTTTATCTCCATTTTGGTGGCATCTCATTTGCTTGCTCATTACTCTTGGTAGTTCAAGTTTCCTTAGAAGATTTTTTTCCGTCTAAAGTTCACACGAAGATTTTTCAGACCTCAACTTTAATGGTAAGCGGAAGCTTTTTGGAGGCTTTCATATGAAACTTGAGCATTGGTCTAATAAGAATCACTGGAGGGAGTTTGTCAATTTCAAATTTACCTTTAGAATACTGGAAGAATCGTGTTTACAGCCAGAATTTGTCAACTTTGAGCTAAAGTAAGCTGCTTCATGATTGTGATCTGCTGGAATATGTGAATCTGATTATGTGTGTGATAGGCTTGATTGCTGTATTCAAGCAAAGCAACCTCTACCTAGTAATCTGTGCTCAAATGGACTACTAGCAAAGTAAAGACATTCATTTTTTGTTCAATCCTTGATCTAGCAAAATTGTTACAGTCAAGTGAACTTTTATCTTATTTGACCATCCTTAACCATTTTGTATGTGGTCAAGAAGATATATATGTATATATACAAACCCTTCGACAAATCTCGCATTAGTTACAACAGCAAGTTTTTTCATACATGAGGTGTTGAGTTATTACTATTTTTTTTTTTATTTTTTATTTTTGGGGTAAGAAGCTACGCTCGTGTCAACTTTGTTTCCTTTGGAACAAGGTAAGATATCAGGTTAAAAAATGGGAGATCAATTTTAACGTTAGAAGTTATGTTTTCATCTGAAGTATGTGGTCTTTTGAGCCTACATCTACGTAATGGGGAAGTTCTATAGACGCATCATTATTATTATATTTCTGCATAGTGCATTCACATTATTTCTTTACTTCTAATCTTGGAAATACATTTTTTCTTGCTGATCTTTGTGAGTAACCATTGTTTATCCAAGTGATTTCCGATTCTTGGAGGTTTGAACTTTTGCATGTGGTACCCCAACGATTTATTTGAGTTTTAATATTACATGCTTCCTCATGTTCCATCACATTTATAAAATGATCTCCTGTGTTTTGCATGATGGGCCTCTTGTACAAATTTGTAGTACAAATCCGATGCAGAGTGTGGATAGAACGTTTCAACTCACAAGGGCAGACAAGGACCTACTTTCTAAACAGGAATACGATGTGCAGGTTGGTTAACGCATCATATGGTGAATCTCTTTCCCTGTAAACTGATTGTTTTCTTGAACTAAGTATGTTTTTATTATGGTAGGCCTGGTGTATGCTTTTGAATGATAAGGTTCCATTCAGGATGCAATGGCCTCAATATGCAGATTTACAAATTAATGGTATGATATTTTTTTAAAAGAAAATTAAGGTTAAGTTATTAATTTATTGCTTTATGTATCAAAACTTTAGGCCTTGTTTGATAATGTTCATGTTTCCTATTTTCTGTTTTTGTTTCTAGTATTTTATGAAACATACTTGTTTGATAATCCATCCCCTTTCTTGTTTCCAAAATTTGAAAAATGTTTCTAAAATTGAGGATGAATTTCAGAAACTACTAAAAGCAATTTCTTTCTATTCCATTTCCTTTTTTCACTCTTATATTTACGTTCCTTCATAATATCAAAACTAGTCTTTGGTTTGTTAGCACGGGGGGCGAATAGTGATTGCCTTTGCTCTCTAGTAGGTGTGATCATCAGTTGGTCTGCGCCTGTGCTATGCTAAACCAATACCGACCACTGATGTTGTTTTTTGTCAGTCGGGTTTGATCAGTTCTTGTGATGTTGGTGACGCAGACCATCTAATGGTCAGTTGGTTGGTTTTGGTTGGTTTTTCCAACTCCAATGAACTAATCACAAAAGATGGCATGAACTAAATGGAATAAATGTGAGAAAGGGAGAAGAAAATAAGAAAAGTGGGAAGAAAGGGAGAAGAGAGAACAGATGAAAGAAAGAAGAAAATGGGAGAAGGAAGAGAGAAGGTAACACTAAAAGAAGGAAGAATAGGAAAAGAGGTTACGGAAGGGAGAAGGAGAAAAAAAGCAGTTCACGGGGAGCTTGGCGGCGCTCGGTTCTTCCAGTGGAAGACACTAAAAAGGAAAAAAAAAACAGAAGAGAGAGGAAAGCGAGGAGAGAAGAAGAGAGAGCAAAGGAGGAAGAAGAAAAAAAATGAAAGATGGAGGAAACAGAGAAGAAAATAGATTTTTTTTTTTTAAAGAGGTAATGATTATCTAAGGATAAAAACAAATTGGCCTACATGCTGCCGCGTAATGCAATTGTTTCTTCTTTTAATATACATATTGGCCAGATCAGTCGGTTTGAGCTTTTCTGGGCTCGCCGACCGGCCAATTGATTGGCTGGTTTTTCTATGGGCCAACGAAAAATCGACTCTGACTGACCAAAGTCAGTTTGGTCGGTTTTGGTTGGCTGGCTTGGCTTTTGGTCTTTGTTGCTCACCCTACTCTCTAGTTCCTGCACACTCACACACACACTATTTGAATTCTGAGTTTTACATTTTGCAAAATTTATATTTATTTACTATTTTGATTTCTGGTTCCAATTTTGAATTTAAATTTTGCATAATTTATATTTATATACTATTTTAATTTTTAGTACACTTTATAGAATTTTGTAAAATATAAATTATCCTCACTAAAAGATGCATTTAATCTCACTTTAATATGTAGAATATGTATATTAATGTACTAATTTAATTTTGAGTTTTCATTTTTGCACAATGTATATTATATACTAATTTTTATTACACATGGAACCGGTTAAAATTTTGCAATATATAAATTTTACTCACTAAAACATGCACTTAGCCTCAATTTAATATTTTAAAACTACAACATTTATAGAGCAATATAATTGAGTTGGATTTGAGTTGACATCATTTAAGGCAAATCACTCAATATTGCTGATCAAATTGGGACTAGATTTGAAGGATAGGCAATTTTATTATTGTTGTTAATTTTACCTCAGGAAGATAGTGTAAGCTTTGATAATTGCTTTTTATATGTTTTTATGTATATTATTTGTTATTATTTATTACATTATATAACTAAATTCTATTTAAACTTTAAAAAGGGAAAAAGGGAAATGAAGGAAGAAGAAATAGTTATCATGAGTTTATATTTTTGTTTCTTTCTCCAAGAAACAAGGAATAGTAATAGTTATCAAACACATTTTTGTTTCTAATCTTTGAAAAACAAGAACAAGGAAACAGAAACGTTATCAAACAGAACCATAATTTCTAGTCTTGCATTTGTATTTATCTATTTATATATTAAACCTATAATTGCCAGTTTTGTGACTGTATTTATTTATCCTTTTTTTGGGTTTTGTTTTGTTTTGTAAGTTTGTTATTAAGTAGGTCTGCGTTTAATTCTGGATAAATTCATTGTTGCATTATGCATGTATGATAGAGGATCTCTACTATGCAGTAGGACAGTTCATTTAGTGTTGAATAACAAATTTTAATGACTTCACAGGTTTGGCTGTTCGTGCCATTAATAGGCCTGGCTCTCAACTTTTGGGGGCGAATGGTCGTGATGATGGCCCAATTGTAAGTTCTCATGCTAAATTGCAATTTTCCTGTATTCTAGTCGAGCAATTATATTTTCTCTTACTCCTATCTTTCTGTCTCCTTTGTTCTGTCAGATCACAGCATGCACGAAAGATGGAATGAATAAGATAACCTTAACAGGGTGTGATGCTCGATCTTTTTGTTTAGGGGTTCGGATTGTGAAACGACGGACTGTTCAACAGGTAACTTTTGTTGGATCAATGAAATGGTCTGTTTCCTATTAAAAAATGGAAAAATTAGTTGTTGAATGTCTGACTAGGTTTTGGATATCATTTTGGAGCACCTTTTATGTTTTTAGCCATATCTAGAATATGAAGCACCACATTTTTCTAGTGGTGCAGTTTTCTAAGACATAAGTACGTCTTTGTTCTTATACGTTCTTATACAGTGTCATTTCTTGTTATGTATGCATCTTCATAGAACTAAACATACCTTTCTAATTAAGTTACTAGTTTCTGTAGCCAAAAAGATAGGTAGAGATGTTGAGAAAATTTTGTAGAGGGGGACGGATGGTGATGCAGGCTTCTTGAGTTGATGGGGTGTGGTCGCTAAACCTTAAACTGGTGGTGGTTTAGGCATTTGAGATTTTATTTATAAGAATGTTGCTGTTAGGAAAGTGGTGCTTTAGAGTATCTTGAGGGATGAACTTTTTGGGCGTTATTTGTGAGAGGTAAATTGAAATTGGGTGCATGAGAAAGTGGGAGATGGCGGGAACTAGAAAAGCAAAAAAGAAGGTTGGAGGCTTTTCCCACATTAGAAAGGAGAGGGGAGTGAGACATGTTACAAATGCATGTGTTTGTATTCATTTTCATTTCTACTGTTATTGTCCCTCACAAGTCTTGTCTTTATTATTATCATTATTATTATTTTTTAATGAGAAATTTCAAAAGTTATATTCCAAAAGATCACAATAGCCTAGGGGCGGGGTTGAGGCAATCCCCCAAAGAGATCCAAGGAGGGCCTTGCAATAAAAAAAATCATAAAAAAGGCTATAATTACAAAAGAATTTCTTGTGCGAAGATATCTATCAAGAGACCGTATTTTGTATGCTATCACCAAAATTTAAAAGGCTAAAAGATATCACAAGTTTTGTCTTTAATTATCATTAGTATTTTTGTTTCTACTTTTTTATGTCTTTTATGTTTTTTCCTCCCTAGATTCTAAGCATGATTCCTAAGGAGTCCGAGGGTGAACATTTTCAAGATGCTCTTGCCCGTATTTGCCGTTGCATCGGTGGTGGAAACACGGCAGATAATGCTGATAGTGATAGTGACTTGGAGGTGGTTGCAGAATTTTTTGGGGTTAATCTACGCTGTCCAGTAAGTCTCATGCTATTTAATGAAAGACATTAGTCTCTTATTAAAAAAAGAAAAATTAAAGACGTTAGTCTCCTACATTGTATTATCATGCTTATGAATTGAGATATTTGGGAGAAGGGGGAGGAGAAGTGTTTCTCAATCCTTTAACTTTTCATCGGCACTGGAGTGGGACTAATGAAATTACTACAGAAGGGGAAGGATAACCAAACCCCAAAAGAAGATTACCTAAAACTCTTCTAGCTTGCACTAAGAGAACTTAGACTATCATTGTTTTGTTAATTTGGTAAGAAACCAAGCTTTCTTAGAGAAAAATGAAAAAAACATAAGAGGGAATACAAAAAACAAGCCTACAAAAAAGGGCCCAATATAAGCTATAAGAACTTCAATCTAAGAGAATGTGACCAAGCTGACAATTATAAAATGTTTGATTGACACTGACAGACAGCGGTAAAGAAGAATCGAACATAAACTTCTTCTCTAAAAATACTGTGTAATCGAGGAAAGATGGTTGAAGCTAGAGTGAAAATTCACTTTTCAGCAAGAAAGTTATTAAATCTTTTTGGAGACCAGCATACTGTGTAATCGAACATATGTATTTTTCCTCTGTGCTCTGTTTTTAGATGAGTGGTTCAAGGATGAAGATTGCTGGACGATTTAAACCGTGTGCTCATATGGGATGCTTTGATCTTGAAGTGTTTGTAGAATTGAATCAACGTTCAAGGAAGGCAAGTTGTTAAAAAAATTTCTTATGCAAATCTTGTTCTCATCTCATATCTGACTGTTTTTGTATTTTTACTGCAGTGGCAGTGTCCAATTTGTCTGAAGAACTATGCACTGGAGAATGTTATCATAGACCCTTATTTCAACCGTATAACAGCTATGGTACTGCCTTATGCATTCTTCATCCACATTTTCTGAATGGAAGTGATTTAATGATATCGTATTATGGTTGTACAGATGAGACATTGTGGAGAAGATGTTACAGAAATTGAGGTTAAACCTGATGGTTCCTGGCGTGTAAGGTCAAAAACTGAAAGTGAGCGCAGGGAACTTGGTGATCTTTGCATGTGGCACTCCCCTGAGGGCAATTTGTGTGTTTCAAATGAGGAAGCTAAACCAAAGATGGAAGCATTGAAGCAGATAAAGCAGGAAGGTGGATCAGATCGTGGTCTGAAACTTGGCATCAGGAAGAATAGTAATGGGGTCTGGGAAGTCAGCAGACCTGAAGATATCAATACTTTCACTTCTGGTAGTAGATTACCGGAGAACTATGGGAGCCATGATCAGAAGATTATCCCAATGAGCAGCAGTGCCACCGGCAGTAGGGATGGTGAAGATCCAAGTGTAAACCAAGATGGTGGTGTAAACTTTGATTTCTCTACCAATAATGGGATTGAGATGGATTCTCTTTCCTTAAATGTTGACTCAGCATATGGATTTACTGAACAAAACCCTATTGCTCCAGTAGGTGAAGTCATTGTTCTAAGTGATTCAGATGATGAAAATGACATTTTAATTTCATCTGGAACTGTTTATCCGAGCAACCATACAGATGCAGGTGAAGTTCCTTTCTCCATGCCACCTTCTGGACTCACAGATGCATATCCTGAAGATCCAACTCTACTATCTGCTGGAAATTCATGCTTGGGTCTTTTTAATTCCCATGATGATGAATTTGGCATGCCTGTTTGGTCATTACCCCCTGGAACTCAAGGAGGTGCTGGATTCCAATTGTTTAGCTCTGATGCAGATGTATCAGATGCTCTAGTTGACTTGCAGCATAATTCCATTAATTGTTCCACCATGAATGGATATGCAGCTACCCCAGAGGCAGCTATAAGTCCGGCTTCTCTTGTTCCAGGTTCTTCAATTGGTCGTACTGATGGTGACATGAATGACAGCTTGGTTGATAATACCTTAGCTTTTGCTGGTGATGATCCATCTCTTCAAATATTTCTTCCTACAAGGCCTTCAGATGCACCTATGCAGTCTGATTTCAGAGATGAGGCTGATGTATCAAATGGTGTTCATACTGAAGATTGGATTTCTCTCAGGCTTGGGGGTGATGCAGGAGGTAGCAATGGGGAGTCCACAACCTCCAAGGGATTAAATTCAAGACAACATGTTCCATCAACAGGAGGTGAAATCAATTCATTGTCTGATACTGGTTAGTTATTTTCTGTTTTCCCTCGTGTTGCATTTGTGAAGATTGGTTAAATTTTATGATGGTTCCAGATCTGAGGAGCATGAATTGCTGAATAGGCCTTTTGTAGTTACTGCAATGTTTTGCTTGCAAAGTTTTTTATTTTTCTTCTTGTTCTTCACTATGTTTTATTACGACTTTTTACTATAATCAGTGAAAAGTGTTATCTATATATTTGTTTTCCCTTTCTTTTGCAACTTCACTTTACTGGTCATGTCCATTTTCGTTTGGATTTACCAATTCTTTCAAAGAGCTGTTAGTTATTCTATATTTTTAGTATTTAGCGTTTGAAAGATTGGAAAGCTATGAAAAAAGCTACCTGGACATAGCTCAACCAATCGAGACATTTTCAATCTCTTGGTTTATTGGTTAAGGTATATACATCCTCCACGAATAGATTAGAGGTCTGAATTCCCTCTCCATGTGTTTAAATTAAAAAAAGGTCTTAATTTACACTCATTACACTCGGTGAAGGGTCTTTTGGTAACTCTCATTAAGCCTTTGCGGTGGCTTTACTCCCCCTTTTGTACAATTCATCAAATTAATGAAATAATGTTTCTGATTTTATTTTATTCTATAAAAAACCTTCTTAATTCCAGCATCTCTCGTTTCTGCATTTGTTTTTATTCGAAATTCTCTGGCGTATTCTTGATTTTGAGGCTAACTATATCCTTGCATCAGCTTCTCTGCTTCTTGGAATGAATGATGTTAGACATGACAAAGCAAGTAGGCAAAGATCAGATAGTCCTTTCTCCTTCCCTCGCCAGAAACGTTCTGTCAGACCGCGGATGTGCTTTTCTATAGACTCCGAGTCAGAGTAGGCTTTGGACTTTAGGTGACCAAGAATTGTTGCTGCTCTCATTTGATTTTTTTCTGGGGATAAGCAAGACCGTAAATATTCATTTTGGTCCATCTCATCTTCAAACTTCAAAAGTATGTTCAATATTTTGTGGGTAAGAAAGTTGATGACACTGATTTGCTCATAAGCGAGCCATCATCGTGGAACGAAGCTAGCGGAAGGAATCTGGCCAGTCATTTTGGAAATAGGTAAGGGATATATATCATACCCTTAATTTCAATTTTCTTGGTCCTGACGCCCAGTCCTAATTTGACTTGTGAGATAGCAGGTTGGTCGGATGACTTGTGTGCCTCTAGTTCCAGTGCGGGTTAAAACTGTACAGAGAGATGTGGTCATATGAATTTGGTAGCCGTCATGTAATACAAACAAATTTTTTCCTCTACGATGAGAGCAATTAGCTGTAAATAATCTTTGATTGAGATCTAGTTTATTTATAGAAATTAATTTTACACCAGAAAGAAAATTTTAATGTTTAATTGGTCTAACATTTCTAGATTTCCATGATATGGTAGTTTATTGTCATGTCAAAGCATTGGTAATGTAATGAGTGGT

mRNA sequence

CTGGTTGGGGTTTCGGTCAAAAAGAAACACTAGCGCTAAAGGACAGGGTCGAGTGGGGAAGTCGGCGTGGGCGACCCGAGCCTCTGTTCATCAATATACCGCTTTCAATTCTTCTTCCACTTTGTACAAATCCCAAATTTTCCTTTAGTTTTGAGGACCATGCGTTGGTGGTGTGTGTTGGGTTACTGTTAAAATCGTCTTTTTGGCATACCATCGTTAGGGATTTTGTGGGTTGTTGCTTGGAAGGGATGGACTTGGTTGCTAATTGCAAGGATAAATTGGCTTATTTTCGAATAAAAGAGCTCAAGGATATTCTCACTCAACTAGGTCTCTCAAAGCAAGGAAAGAAGCAGGACCTTGTTGAACGGATACTAGCCATTCTTTCTGATGAACAAGTTTCAAAAATGTGGGCAAAGAAAAATGCTGTTGGAAAGGATCAAGTAGCAAAACTAGTCGATGACACATACAGAAAGATGCAGGTTTCTGGGGCCACGGATTTAGCTTCCAAGGGTCAAGGTGTCTCAGACAGCAGTACTGTGCAGGTAAAAGGTGAAACGGATGATTCTTTGCAATTAGATACAAAGGTTCGGTGTCTTTGTGGAAATGCTTTGCAAACAGAATCAATGATCAAGTGTGAGGATCCACGGTGCCAAGTGTGGCAGCACATTAGTTGTGTTATAGTTCCGGAGAAACCTACAGAAGGAAATCCGCCATACCCTGAACACTTCTATTGTGAGATCTGTCGACTCAATCGTGCTGACCCTTGCACATCCTCTGTTTCCCGTAAAGCTGATAACCACAATGTCCACAAACATTCCAACGGATGGATCTCCACTTTGTCTCCATGGATATGGAGAATTGTTGCATCCGGGACATCTTATACAGTTGTTGGGGTGAAAATTAGTGTGGATAGAACGTTTCAACTCACAAGGGCAGACAAGGACCTACTTTCTAAACAGGAATACGATGTGCAGGCCTGGTGTATGCTTTTGAATGATAAGGTTCCATTCAGGATGCAATGGCCTCAATATGCAGATTTACAAATTAATGGTTTGGCTGTTCGTGCCATTAATAGGCCTGGCTCTCAACTTTTGGGGGCGAATGGTCGTGATGATGGCCCAATTATCACAGCATGCACGAAAGATGGAATGAATAAGATAACCTTAACAGGGTGTGATGCTCGATCTTTTTGTTTAGGGGTTCGGATTGTGAAACGACGGACTGTTCAACAGATTCTAAGCATGATTCCTAAGGAGTCCGAGGGTGAACATTTTCAAGATGCTCTTGCCCGTATTTGCCGTTGCATCGGTGGTGGAAACACGGCAGATAATGCTGATAGTGATAGTGACTTGGAGGTGGTTGCAGAATTTTTTGGGGTTAATCTACGCTGTCCAATGAGTGGTTCAAGGATGAAGATTGCTGGACGATTTAAACCGTGTGCTCATATGGGATGCTTTGATCTTGAATGGCAGTGTCCAATTTGTCTGAAGAACTATGCACTGGAGAATGTTATCATAGACCCTTATTTCAACCGTATAACAGCTATGATGAGACATTGTGGAGAAGATGTTACAGAAATTGAGGTTAAACCTGATGGTTCCTGGCGTGTAAGGTCAAAAACTGAAAGTGAGCGCAGGGAACTTGGTGATCTTTGCATGTGGCACTCCCCTGAGGGCAATTTGTGTGTTTCAAATGAGGAAGCTAAACCAAAGATGGAAGCATTGAAGCAGATAAAGCAGGAAGGTGGATCAGATCGTGGTCTGAAACTTGGCATCAGGAAGAATAGTAATGGGGTCTGGGAAGTCAGCAGACCTGAAGATATCAATACTTTCACTTCTGGTAGTAGATTACCGGAGAACTATGGGAGCCATGATCAGAAGATTATCCCAATGAGCAGCAGTGCCACCGGCAGTAGGGATGGTGAAGATCCAAGTGTAAACCAAGATGGTGGTGTAAACTTTGATTTCTCTACCAATAATGGGATTGAGATGGATTCTCTTTCCTTAAATGTTGACTCAGCATATGGATTTACTGAACAAAACCCTATTGCTCCAGTAGGTGAAGTCATTGTTCTAAGTGATTCAGATGATGAAAATGACATTTTAATTTCATCTGGAACTGTTTATCCGAGCAACCATACAGATGCAGGTGAAGTTCCTTTCTCCATGCCACCTTCTGGACTCACAGATGCATATCCTGAAGATCCAACTCTACTATCTGCTGGAAATTCATGCTTGGGTCTTTTTAATTCCCATGATGATGAATTTGGCATGCCTGTTTGGTCATTACCCCCTGGAACTCAAGGAGGTGCTGGATTCCAATTGTTTAGCTCTGATGCAGATGTATCAGATGCTCTAGTTGACTTGCAGCATAATTCCATTAATTGTTCCACCATGAATGGATATGCAGCTACCCCAGAGGCAGCTATAAGTCCGGCTTCTCTTGTTCCAGGTTCTTCAATTGGTCGTACTGATGGTGACATGAATGACAGCTTGGTTGATAATACCTTAGCTTTTGCTGGTGATGATCCATCTCTTCAAATATTTCTTCCTACAAGGCCTTCAGATGCACCTATGCAGTCTGATTTCAGAGATGAGGCTGATGTATCAAATGGTGTTCATACTGAAGATTGGATTTCTCTCAGGCTTGGGGGTGATGCAGGAGGTAGCAATGGGGAGTCCACAACCTCCAAGGGATTAAATTCAAGACAACATGTTCCATCAACAGGAGGTGAAATCAATTCATTGTCTGATACTGCTTCTCTGCTTCTTGGAATGAATGATGTTAGACATGACAAAGCAAGTAGGCAAAGATCAGATAGTCCTTTCTCCTTCCCTCGCCAGAAACGTTCTGTCAGACCGCGGATGTGCTTTTCTATAGACTCCGAGTCAGAGTAGGCTTTGGACTTTAGGTGACCAAGAATTGTTGCTGCTCTCATTTGATTTTTTTCTGGGGATAAGCAAGACCGTAAATATTCATTTTGGTCCATCTCATCTTCAAACTTCAAAAGTATGTTCAATATTTTGTGGGTAAGAAAGTTGATGACACTGATTTGCTCATAAGCGAGCCATCATCGTGGAACGAAGCTAGCGGAAGGAATCTGGCCAGTCATTTTGGAAATAGGTTGGTCGGATGACTTGTGTGCCTCTAGTTCCAGTGCGGGTTAAAACTGTACAGAGAGATGTGGTCATATGAATTTGGTAGCCGTCATGTAATACAAACAAATTTTTTCCTCTACGATGAGAGCAATTAGCTGTAAATAATCTTTGATTGAGATCTAGTTTATTTATAGAAATTAATTTTACACCAGAAAGAAAATTTTAATGTTTAATTGGTCTAACATTTCTAGATTTCCATGATATGGTAGTTTATTGTCATGTCAAAGCATTGGTAATGTAATGAGTGGT

Coding sequence (CDS)

ATGGACTTGGTTGCTAATTGCAAGGATAAATTGGCTTATTTTCGAATAAAAGAGCTCAAGGATATTCTCACTCAACTAGGTCTCTCAAAGCAAGGAAAGAAGCAGGACCTTGTTGAACGGATACTAGCCATTCTTTCTGATGAACAAGTTTCAAAAATGTGGGCAAAGAAAAATGCTGTTGGAAAGGATCAAGTAGCAAAACTAGTCGATGACACATACAGAAAGATGCAGGTTTCTGGGGCCACGGATTTAGCTTCCAAGGGTCAAGGTGTCTCAGACAGCAGTACTGTGCAGGTAAAAGGTGAAACGGATGATTCTTTGCAATTAGATACAAAGGTTCGGTGTCTTTGTGGAAATGCTTTGCAAACAGAATCAATGATCAAGTGTGAGGATCCACGGTGCCAAGTGTGGCAGCACATTAGTTGTGTTATAGTTCCGGAGAAACCTACAGAAGGAAATCCGCCATACCCTGAACACTTCTATTGTGAGATCTGTCGACTCAATCGTGCTGACCCTTGCACATCCTCTGTTTCCCGTAAAGCTGATAACCACAATGTCCACAAACATTCCAACGGATGGATCTCCACTTTGTCTCCATGGATATGGAGAATTGTTGCATCCGGGACATCTTATACAGTTGTTGGGGTGAAAATTAGTGTGGATAGAACGTTTCAACTCACAAGGGCAGACAAGGACCTACTTTCTAAACAGGAATACGATGTGCAGGCCTGGTGTATGCTTTTGAATGATAAGGTTCCATTCAGGATGCAATGGCCTCAATATGCAGATTTACAAATTAATGGTTTGGCTGTTCGTGCCATTAATAGGCCTGGCTCTCAACTTTTGGGGGCGAATGGTCGTGATGATGGCCCAATTATCACAGCATGCACGAAAGATGGAATGAATAAGATAACCTTAACAGGGTGTGATGCTCGATCTTTTTGTTTAGGGGTTCGGATTGTGAAACGACGGACTGTTCAACAGATTCTAAGCATGATTCCTAAGGAGTCCGAGGGTGAACATTTTCAAGATGCTCTTGCCCGTATTTGCCGTTGCATCGGTGGTGGAAACACGGCAGATAATGCTGATAGTGATAGTGACTTGGAGGTGGTTGCAGAATTTTTTGGGGTTAATCTACGCTGTCCAATGAGTGGTTCAAGGATGAAGATTGCTGGACGATTTAAACCGTGTGCTCATATGGGATGCTTTGATCTTGAATGGCAGTGTCCAATTTGTCTGAAGAACTATGCACTGGAGAATGTTATCATAGACCCTTATTTCAACCGTATAACAGCTATGATGAGACATTGTGGAGAAGATGTTACAGAAATTGAGGTTAAACCTGATGGTTCCTGGCGTGTAAGGTCAAAAACTGAAAGTGAGCGCAGGGAACTTGGTGATCTTTGCATGTGGCACTCCCCTGAGGGCAATTTGTGTGTTTCAAATGAGGAAGCTAAACCAAAGATGGAAGCATTGAAGCAGATAAAGCAGGAAGGTGGATCAGATCGTGGTCTGAAACTTGGCATCAGGAAGAATAGTAATGGGGTCTGGGAAGTCAGCAGACCTGAAGATATCAATACTTTCACTTCTGGTAGTAGATTACCGGAGAACTATGGGAGCCATGATCAGAAGATTATCCCAATGAGCAGCAGTGCCACCGGCAGTAGGGATGGTGAAGATCCAAGTGTAAACCAAGATGGTGGTGTAAACTTTGATTTCTCTACCAATAATGGGATTGAGATGGATTCTCTTTCCTTAAATGTTGACTCAGCATATGGATTTACTGAACAAAACCCTATTGCTCCAGTAGGTGAAGTCATTGTTCTAAGTGATTCAGATGATGAAAATGACATTTTAATTTCATCTGGAACTGTTTATCCGAGCAACCATACAGATGCAGGTGAAGTTCCTTTCTCCATGCCACCTTCTGGACTCACAGATGCATATCCTGAAGATCCAACTCTACTATCTGCTGGAAATTCATGCTTGGGTCTTTTTAATTCCCATGATGATGAATTTGGCATGCCTGTTTGGTCATTACCCCCTGGAACTCAAGGAGGTGCTGGATTCCAATTGTTTAGCTCTGATGCAGATGTATCAGATGCTCTAGTTGACTTGCAGCATAATTCCATTAATTGTTCCACCATGAATGGATATGCAGCTACCCCAGAGGCAGCTATAAGTCCGGCTTCTCTTGTTCCAGGTTCTTCAATTGGTCGTACTGATGGTGACATGAATGACAGCTTGGTTGATAATACCTTAGCTTTTGCTGGTGATGATCCATCTCTTCAAATATTTCTTCCTACAAGGCCTTCAGATGCACCTATGCAGTCTGATTTCAGAGATGAGGCTGATGTATCAAATGGTGTTCATACTGAAGATTGGATTTCTCTCAGGCTTGGGGGTGATGCAGGAGGTAGCAATGGGGAGTCCACAACCTCCAAGGGATTAAATTCAAGACAACATGTTCCATCAACAGGAGGTGAAATCAATTCATTGTCTGATACTGCTTCTCTGCTTCTTGGAATGAATGATGTTAGACATGACAAAGCAAGTAGGCAAAGATCAGATAGTCCTTTCTCCTTCCCTCGCCAGAAACGTTCTGTCAGACCGCGGATGTGCTTTTCTATAGACTCCGAGTCAGAGTAG

Protein sequence

MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAVGKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNALQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRKADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYDVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDGMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTADNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLEWQCPICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLCMWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFTSGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEMDSLSLNVDSAYGFTEQNPIAPVGEVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSGLTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALVDLQHNSINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQHVPSTGGEINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE
Homology
BLAST of Cla97C03G053340 vs. NCBI nr
Match: XP_038894176.1 (E3 SUMO-protein ligase SIZ1 isoform X2 [Benincasa hispida])

HSP 1 Score: 1620.5 bits (4195), Expect = 0.0e+00
Identity = 816/894 (91.28%), Postives = 833/894 (93.18%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV
Sbjct: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60

Query: 61  GKDQVAKLVDDTYRKMQV-SGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGN 120
           GKDQVAKLVDDTYRKMQV SGATDLASKGQGVSDSS VQVKGETDDSLQLDTKVRCLCGN
Sbjct: 61  GKDQVAKLVDDTYRKMQVSSGATDLASKGQGVSDSSNVQVKGETDDSLQLDTKVRCLCGN 120

Query: 121 ALQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSR 180
            LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADP   SV+ 
Sbjct: 121 GLQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPFWVSVA- 180

Query: 181 KADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEY 240
                    H    +  ++     I   GT+        SVDRTFQLTRADKDLLSKQEY
Sbjct: 181 ---------HPLFPVKLITTMSANIPTDGTN-----PMQSVDRTFQLTRADKDLLSKQEY 240

Query: 241 DVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKD 300
           DVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKD
Sbjct: 241 DVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKD 300

Query: 301 GMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTA 360
           GMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPK+SEGE FQDALARICRCIGGG+TA
Sbjct: 301 GMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKDSEGERFQDALARICRCIGGGSTA 360

Query: 361 DNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQ 420
           DNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE           WQ
Sbjct: 361 DNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLEVFVELNQRSRKWQ 420

Query: 421 CPICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDL 480
           CPICLKNYALENVIIDPYFNRITA+MRHCGEDVTEIEVKPDGSWRVRSKTESERR+LGDL
Sbjct: 421 CPICLKNYALENVIIDPYFNRITALMRHCGEDVTEIEVKPDGSWRVRSKTESERRDLGDL 480

Query: 481 CMWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTF 540
           CMWHSPEG +CVSNEE KPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSR EDINTF
Sbjct: 481 CMWHSPEGIVCVSNEEVKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRTEDINTF 540

Query: 541 TSGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEMDSLSLNV 600
           TSGSRLP+NYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIE+DSLSLNV
Sbjct: 541 TSGSRLPDNYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIELDSLSLNV 600

Query: 601 DSAYGFTEQNPIAPVGEVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSGLTDA 660
           DS YG TEQ PIAPVGEVIVLSDSDD+N+ILIS GTVYPSNHTDAGEVPFSMPPSGLTDA
Sbjct: 601 DSTYGLTEQTPIAPVGEVIVLSDSDDDNNILISPGTVYPSNHTDAGEVPFSMPPSGLTDA 660

Query: 661 YPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALVDLQH 720
           YPEDPTLLSAGNSCLGLFNSHDDEFGMPVW LPPGTQGGAGFQLFSSD DVSDALVDLQH
Sbjct: 661 YPEDPTLLSAGNSCLGLFNSHDDEFGMPVW-LPPGTQGGAGFQLFSSDPDVSDALVDLQH 720

Query: 721 NSINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLP 780
           NSINCSTMNGYAAT EAAISPASLVPGSSIGRTDGD+NDSLVDNTLAFAGDDPSLQIFLP
Sbjct: 721 NSINCSTMNGYAATQEAAISPASLVPGSSIGRTDGDINDSLVDNTLAFAGDDPSLQIFLP 780

Query: 781 TRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQHVPSTGG 840
           TRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGES TSKGLNSRQH+PSTGG
Sbjct: 781 TRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESRTSKGLNSRQHIPSTGG 840

Query: 841 EINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE 883
           EINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPR+CFSIDSESE
Sbjct: 841 EINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRICFSIDSESE 878

BLAST of Cla97C03G053340 vs. NCBI nr
Match: XP_008463667.1 (PREDICTED: E3 SUMO-protein ligase SIZ1 [Cucumis melo])

HSP 1 Score: 1619.4 bits (4192), Expect = 0.0e+00
Identity = 813/894 (90.94%), Postives = 832/894 (93.06%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLV+RIL ILSDEQVSKMWAKKNAV
Sbjct: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVQRILDILSDEQVSKMWAKKNAV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
           GKDQVAKLVDDTYRKMQVSGATDLA+KGQGVSDSS VQVKGETDDSLQLDTKVRCLCGN 
Sbjct: 61  GKDQVAKLVDDTYRKMQVSGATDLATKGQGVSDSSNVQVKGETDDSLQLDTKVRCLCGNG 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADP   SV+  
Sbjct: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPFWVSVA-- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                   H    +  ++     I   GT+        SVDRTFQLTRADKDLLSKQEYD
Sbjct: 181 --------HPLFPVKLITTMSTNIPTDGTN-----PMQSVDRTFQLTRADKDLLSKQEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG
Sbjct: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKES+GE FQDALARICRCIGGGNTAD
Sbjct: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESDGERFQDALARICRCIGGGNTAD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE           WQC
Sbjct: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLEVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNYALENVIIDPYFNRIT+MMRHCGEDVTEIEVKPDG WRVRSKTESERR+LGDLC
Sbjct: 421 PICLKNYALENVIIDPYFNRITSMMRHCGEDVTEIEVKPDGFWRVRSKTESERRDLGDLC 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFT 540
           MWHSPEG LCVSNEE KPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFT
Sbjct: 481 MWHSPEGTLCVSNEEVKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFT 540

Query: 541 SGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFST-NNGIEMDSLSLNV 600
           SGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFST NNGIE+DSLSLNV
Sbjct: 541 SGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNNGIELDSLSLNV 600

Query: 601 DSAYGFTEQNPIAPVGEVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSGLTDA 660
           DSAYGFTEQNPIAPVGEVIVLSDSDD+NDILISSGTV+PSNHTDA EVPF MPPSGLTDA
Sbjct: 601 DSAYGFTEQNPIAPVGEVIVLSDSDDDNDILISSGTVFPSNHTDASEVPFPMPPSGLTDA 660

Query: 661 YPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALVDLQH 720
           YPEDPTLL A NSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLF SDADVSDALVDLQH
Sbjct: 661 YPEDPTLLPA-NSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFGSDADVSDALVDLQH 720

Query: 721 NSINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLP 780
           NSINCST+NGYAATPEAAISPAS+VPGSSIGRTDGDMNDSLVDNTLAFA +DPSLQIFLP
Sbjct: 721 NSINCSTINGYAATPEAAISPASIVPGSSIGRTDGDMNDSLVDNTLAFASEDPSLQIFLP 780

Query: 781 TRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQHVPSTGG 840
           TRPSDAPMQSDFR+EADVSNGVHTEDWISLRLGGDAGGSNGEST SKGLNSRQH+PSTGG
Sbjct: 781 TRPSDAPMQSDFREEADVSNGVHTEDWISLRLGGDAGGSNGESTASKGLNSRQHIPSTGG 840

Query: 841 EINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE 883
           EINSLSDTASLLLGMNDVRH+KASRQRSDSPFSFPRQKRSVRPRMC SIDSESE
Sbjct: 841 EINSLSDTASLLLGMNDVRHEKASRQRSDSPFSFPRQKRSVRPRMCLSIDSESE 878

BLAST of Cla97C03G053340 vs. NCBI nr
Match: XP_038894174.1 (E3 SUMO-protein ligase SIZ1 isoform X1 [Benincasa hispida] >XP_038894175.1 E3 SUMO-protein ligase SIZ1 isoform X1 [Benincasa hispida])

HSP 1 Score: 1610.5 bits (4169), Expect = 0.0e+00
Identity = 816/909 (89.77%), Postives = 833/909 (91.64%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQ---------------DLVERILAIL 60
           MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQ               DLVERILAIL
Sbjct: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQVNCPNVDGYFLFFILDLVERILAIL 60

Query: 61  SDEQVSKMWAKKNAVGKDQVAKLVDDTYRKMQV-SGATDLASKGQGVSDSSTVQVKGETD 120
           SDEQVSKMWAKKNAVGKDQVAKLVDDTYRKMQV SGATDLASKGQGVSDSS VQVKGETD
Sbjct: 61  SDEQVSKMWAKKNAVGKDQVAKLVDDTYRKMQVSSGATDLASKGQGVSDSSNVQVKGETD 120

Query: 121 DSLQLDTKVRCLCGNALQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEI 180
           DSLQLDTKVRCLCGN LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEI
Sbjct: 121 DSLQLDTKVRCLCGNGLQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEI 180

Query: 181 CRLNRADPCTSSVSRKADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTF 240
           CRLNRADP   SV+          H    +  ++     I   GT+        SVDRTF
Sbjct: 181 CRLNRADPFWVSVA----------HPLFPVKLITTMSANIPTDGTN-----PMQSVDRTF 240

Query: 241 QLTRADKDLLSKQEYDVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGA 300
           QLTRADKDLLSKQEYDVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGA
Sbjct: 241 QLTRADKDLLSKQEYDVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGA 300

Query: 301 NGRDDGPIITACTKDGMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQD 360
           NGRDDGPIITACTKDGMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPK+SEGE FQD
Sbjct: 301 NGRDDGPIITACTKDGMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKDSEGERFQD 360

Query: 361 ALARICRCIGGGNTADNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFD 420
           ALARICRCIGGG+TADNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFD
Sbjct: 361 ALARICRCIGGGSTADNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFD 420

Query: 421 LE-----------WQCPICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWR 480
           LE           WQCPICLKNYALENVIIDPYFNRITA+MRHCGEDVTEIEVKPDGSWR
Sbjct: 421 LEVFVELNQRSRKWQCPICLKNYALENVIIDPYFNRITALMRHCGEDVTEIEVKPDGSWR 480

Query: 481 VRSKTESERRELGDLCMWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDRGLKLGIRKNS 540
           VRSKTESERR+LGDLCMWHSPEG +CVSNEE KPKMEALKQIKQEGGSDRGLKLGIRKNS
Sbjct: 481 VRSKTESERRDLGDLCMWHSPEGIVCVSNEEVKPKMEALKQIKQEGGSDRGLKLGIRKNS 540

Query: 541 NGVWEVSRPEDINTFTSGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDF 600
           NGVWEVSR EDINTFTSGSRLP+NYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDF
Sbjct: 541 NGVWEVSRTEDINTFTSGSRLPDNYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDF 600

Query: 601 STNNGIEMDSLSLNVDSAYGFTEQNPIAPVGEVIVLSDSDDENDILISSGTVYPSNHTDA 660
           STNNGIE+DSLSLNVDS YG TEQ PIAPVGEVIVLSDSDD+N+ILIS GTVYPSNHTDA
Sbjct: 601 STNNGIELDSLSLNVDSTYGLTEQTPIAPVGEVIVLSDSDDDNNILISPGTVYPSNHTDA 660

Query: 661 GEVPFSMPPSGLTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLF 720
           GEVPFSMPPSGLTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVW LPPGTQGGAGFQLF
Sbjct: 661 GEVPFSMPPSGLTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVW-LPPGTQGGAGFQLF 720

Query: 721 SSDADVSDALVDLQHNSINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNT 780
           SSD DVSDALVDLQHNSINCSTMNGYAAT EAAISPASLVPGSSIGRTDGD+NDSLVDNT
Sbjct: 721 SSDPDVSDALVDLQHNSINCSTMNGYAATQEAAISPASLVPGSSIGRTDGDINDSLVDNT 780

Query: 781 LAFAGDDPSLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTT 840
           LAFAGDDPSLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGES T
Sbjct: 781 LAFAGDDPSLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESRT 840

Query: 841 SKGLNSRQHVPSTGGEINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRM 883
           SKGLNSRQH+PSTGGEINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPR+
Sbjct: 841 SKGLNSRQHIPSTGGEINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRI 893

BLAST of Cla97C03G053340 vs. NCBI nr
Match: XP_031746035.1 (E3 SUMO-protein ligase SIZ1 [Cucumis sativus] >KAE8652738.1 hypothetical protein Csa_013889 [Cucumis sativus])

HSP 1 Score: 1589.3 bits (4114), Expect = 0.0e+00
Identity = 799/893 (89.47%), Postives = 821/893 (91.94%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLV+RIL ILSDEQVSKMWAKKNAV
Sbjct: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVQRILDILSDEQVSKMWAKKNAV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
           GKDQVAKLVDDTYRKMQVSG  DLA+KGQGVSDSS VQVKGETDDSLQLDTKVRCLCGN 
Sbjct: 61  GKDQVAKLVDDTYRKMQVSG-VDLATKGQGVSDSSNVQVKGETDDSLQLDTKVRCLCGNG 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADP   SV+  
Sbjct: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPFWVSVA-- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                   H    +  ++     I   GT+        SVDR+FQLTRADKDLLSKQEYD
Sbjct: 181 --------HPLFPVKLITTMSTNIPTDGTN-----PMQSVDRSFQLTRADKDLLSKQEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG
Sbjct: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           MNKI LTGCDARSFCLGVRIVKRRTVQQILSMIPKES+GE FQDALARICRCIGGGNTAD
Sbjct: 301 MNKIALTGCDARSFCLGVRIVKRRTVQQILSMIPKESDGERFQDALARICRCIGGGNTAD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE           WQC
Sbjct: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLEVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNYALENVIIDPYFNRIT+MMRHCGEDVTEIEVKPDG WRVRSK+ESERR+LGDLC
Sbjct: 421 PICLKNYALENVIIDPYFNRITSMMRHCGEDVTEIEVKPDGFWRVRSKSESERRDLGDLC 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFT 540
           MWHSPEG LCVSNEE KPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDIN FT
Sbjct: 481 MWHSPEGTLCVSNEEVKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINNFT 540

Query: 541 SGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEMDSLSLNVD 600
                  NYG HDQKIIPMSSSATGSRDGEDPSVNQD G+NFDFS NNGIE+DSLSLNVD
Sbjct: 541 -------NYGCHDQKIIPMSSSATGSRDGEDPSVNQD-GLNFDFSNNNGIELDSLSLNVD 600

Query: 601 SAYGFTEQNPIAPVGEVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSGLTDAY 660
           SAYGFTEQNPIAPVGEVIVLSDSDD+NDILISSGTV+PSNHTD  EVPF MPPSGLTDAY
Sbjct: 601 SAYGFTEQNPIAPVGEVIVLSDSDDDNDILISSGTVFPSNHTDPSEVPFPMPPSGLTDAY 660

Query: 661 PEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALVDLQHN 720
           PEDPT+LSAGNSCLGLFNSH+DEFGMPVW LPPGTQGGAGFQLF SDADVSDALVDLQHN
Sbjct: 661 PEDPTILSAGNSCLGLFNSHEDEFGMPVWPLPPGTQGGAGFQLFGSDADVSDALVDLQHN 720

Query: 721 SINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLPT 780
           SINCST+NGYAATPEAAISPAS+VPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLPT
Sbjct: 721 SINCSTINGYAATPEAAISPASIVPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLPT 780

Query: 781 RPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQHVPSTGGE 840
           RPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQH+PSTGGE
Sbjct: 781 RPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQHIPSTGGE 840

Query: 841 INSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE 883
           INSLSDTASLLLGMNDVRH+KASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE
Sbjct: 841 INSLSDTASLLLGMNDVRHEKASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE 869

BLAST of Cla97C03G053340 vs. NCBI nr
Match: XP_022964655.1 (E3 SUMO-protein ligase SIZ1-like [Cucurbita moschata] >XP_022964656.1 E3 SUMO-protein ligase SIZ1-like [Cucurbita moschata] >XP_022964657.1 E3 SUMO-protein ligase SIZ1-like [Cucurbita moschata])

HSP 1 Score: 1582.4 bits (4096), Expect = 0.0e+00
Identity = 796/894 (89.04%), Postives = 819/894 (91.61%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV
Sbjct: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
           GKDQVAKLV+DTYRKMQVSGATDLASKGQGVSDSS VQVKGETDDSLQLDTKVRCLCG+A
Sbjct: 61  GKDQVAKLVEDTYRKMQVSGATDLASKGQGVSDSSNVQVKGETDDSLQLDTKVRCLCGSA 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGN PYPEHFYCEICRLNRADP   SV+  
Sbjct: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNSPYPEHFYCEICRLNRADPFWVSVA-- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                   H    +  ++     I   GT+        SVDRTFQLTRADKDLLSK EYD
Sbjct: 181 --------HPLFPVKLITTMSTNIPTDGTN-----PMQSVDRTFQLTRADKDLLSKPEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG
Sbjct: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           MNKITLTGCDAR+FCLGVRIVKRRTVQQIL MIPKESEGE FQDALARICRCIGGGNTAD
Sbjct: 301 MNKITLTGCDARTFCLGVRIVKRRTVQQILGMIPKESEGERFQDALARICRCIGGGNTAD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRF PCAHMGCFDLE           WQC
Sbjct: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFSPCAHMGCFDLEVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNYALENVIIDPYFNRIT+MMRHCGEDVTEIEVKPDGSWRVRS+TESERRELGDLC
Sbjct: 421 PICLKNYALENVIIDPYFNRITSMMRHCGEDVTEIEVKPDGSWRVRSRTESERRELGDLC 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFT 540
           +WHS +G  CV+NEE KPKMEA KQIKQEGGSDRGLKLGIRKNSNG WEVSRPEDINTFT
Sbjct: 481 LWHSSDGTSCVTNEEVKPKMEASKQIKQEGGSDRGLKLGIRKNSNGFWEVSRPEDINTFT 540

Query: 541 SGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEMDSLSLNVD 600
           SGSRLPENYG HDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEM+SLSL+VD
Sbjct: 541 SGSRLPENYGGHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEMNSLSLHVD 600

Query: 601 SAYGFTEQNPIAPVGEVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSGLTDAY 660
           S YGFTEQNPIAPVGEVIVLSDSD+ENDIL+SSGTVY SNHTDAGE+ FSMPP GL DAY
Sbjct: 601 SEYGFTEQNPIAPVGEVIVLSDSDEENDILVSSGTVYQSNHTDAGEISFSMPPPGLADAY 660

Query: 661 PEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALVDLQHN 720
           PEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPG QGGAGFQLFSSDADVS+ALVDLQH+
Sbjct: 661 PEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGAQGGAGFQLFSSDADVSEALVDLQHD 720

Query: 721 SINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLPT 780
           SINCSTMNGY ATPEAAISPASLVPGSSIG TDG+MNDSLVDN LAFAGDDPSLQIFLPT
Sbjct: 721 SINCSTMNGYLATPEAAISPASLVPGSSIGHTDGEMNDSLVDNPLAFAGDDPSLQIFLPT 780

Query: 781 RPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSN-GESTTSKGLNSRQHVPSTGG 840
           RPS APMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSN GEST S+GLNSRQH+PSTGG
Sbjct: 781 RPSVAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGGESTASRGLNSRQHIPSTGG 840

Query: 841 EINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE 883
           EINSLSDTASLLLGMNDVRHDKASRQRS SPFSFPRQKRSVR RM  SIDSESE
Sbjct: 841 EINSLSDTASLLLGMNDVRHDKASRQRSGSPFSFPRQKRSVRQRMFLSIDSESE 879

BLAST of Cla97C03G053340 vs. ExPASy Swiss-Prot
Match: Q680Q4 (E3 SUMO-protein ligase SIZ1 OS=Arabidopsis thaliana OX=3702 GN=SIZ1 PE=1 SV=2)

HSP 1 Score: 1053.5 bits (2723), Expect = 1.3e-306
Identity = 547/887 (61.67%), Postives = 672/887 (75.76%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDL ANCK+KL+YFRIKELKD+LTQLGLSKQGKKQ+LV+RIL +LSDEQ +++ +KKN V
Sbjct: 1   MDLEANCKEKLSYFRIKELKDVLTQLGLSKQGKKQELVDRILTLLSDEQAARLLSKKNTV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
            K+ VAKLVDDTYRKMQVSGA+DLASKGQ  SD+S ++VKGE +D  Q + KVRC+CGN+
Sbjct: 61  AKEAVAKLVDDTYRKMQVSGASDLASKGQVSSDTSNLKVKGEPEDPFQPEIKVRCVCGNS 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           L+T+SMI+CEDPRC VWQH+ CVI+P+KP +GNPP PE FYCEICRL RADP   +V+  
Sbjct: 121 LETDSMIQCEDPRCHVWQHVGCVILPDKPMDGNPPLPESFYCEICRLTRADPFWVTVAH- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                           LSP   R+ A+           SV+RTFQ+TRADKDLL+K EYD
Sbjct: 181 ---------------PLSP--VRLTATTIPNDGASTMQSVERTFQITRADKDLLAKPEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKV FRMQWPQYADLQ+NG+ VRAINRPG QLLG NGRDDGPIIT+C +DG
Sbjct: 241 VQAWCMLLNDKVLFRMQWPQYADLQVNGVPVRAINRPGGQLLGVNGRDDGPIITSCIRDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           +N+I+L+G D R FC GVR+VKRRT+QQ+L++IP+E +GE F+DALAR+ RCIGGG   D
Sbjct: 301 VNRISLSGGDVRIFCFGVRLVKRRTLQQVLNLIPEEGKGETFEDALARVRRCIGGGGGDD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSD+EVVA+FFGVNLRCPMSGSR+K+AGRF PC HMGCFDL+           WQC
Sbjct: 361 NADSDSDIEVVADFFGVNLRCPMSGSRIKVAGRFLPCVHMGCFDLDVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNY++E+VI+DPYFNRIT+ M+HC E+VTEIEVKPDGSWRV+ K ESERRELG+L 
Sbjct: 421 PICLKNYSVEHVIVDPYFNRITSKMKHCDEEVTEIEVKPDGSWRVKFKRESERRELGELS 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSD--RGLKLGIRKNSNGVWEVSRPEDINT 540
            WH+P+G+LC S  + K KME L  +KQEG SD    LKLGIRKN NG+WEVS+P + N 
Sbjct: 481 QWHAPDGSLCPSAVDIKRKMEML-PVKQEGYSDGPAPLKLGIRKNRNGIWEVSKP-NTNG 540

Query: 541 FTSGSRLPENYGSHDQKIIPMSSSATGS-RDGEDPSVNQDGGVNFDFSTNNGIEMDSLSL 600
            +S +R  E  G  ++ IIPMSSSATGS RDG+D SVNQD    FDF   NG+E+DS+S+
Sbjct: 541 LSSSNR-QEKVGYQEKNIIPMSSSATGSGRDGDDASVNQDAIGTFDF-VANGMELDSISM 600

Query: 601 NVDSAYGFTEQNPIAPVG--EVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSG 660
           NVDS Y F ++N     G  EVIVLSDSDDEND++I+ G  Y    TD G + F + P G
Sbjct: 601 NVDSGYNFPDRNQSGEGGNNEVIVLSDSDDENDLVITPGPAYSGCQTDGG-LTFPLNPPG 660

Query: 661 LTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALV 720
           + ++Y EDP  ++ G+S LGLFN  DDEF  P+WS P  T    GFQLF SDADVS  LV
Sbjct: 661 IINSYNEDPHSIAGGSSGLGLFND-DDEFDTPLWSFPSETPEAPGFQLFRSDADVSGGLV 720

Query: 721 DLQHNS-INCS--TMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDP 780
            L H+S +NCS     GY   PE +++   +VPGS+ GR++   ND LVDN LAF  DDP
Sbjct: 721 GLHHHSPLNCSPEINGGYTMAPETSMASVPVVPGST-GRSEA--NDGLVDNPLAFGRDDP 780

Query: 781 SLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQ 840
           SLQIFLPT+P DA  QS F+++AD+SNG+ +EDWISLRLG  A G++G+  T+ G+NS  
Sbjct: 781 SLQIFLPTKP-DASAQSGFKNQADMSNGLRSEDWISLRLGDSASGNHGDPATTNGINSSH 840

Query: 841 HVPSTGGEINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRS 869
            + +  G +++ ++TASLLLGMND R DKA +QRSD+PFSFPRQKRS
Sbjct: 841 QMSTREGSMDTTTETASLLLGMNDSRQDKAKKQRSDNPFSFPRQKRS 859

BLAST of Cla97C03G053340 vs. ExPASy Swiss-Prot
Match: Q6L4L4 (E3 SUMO-protein ligase SIZ1 OS=Oryza sativa subsp. japonica OX=39947 GN=SIZ1 PE=1 SV=1)

HSP 1 Score: 841.6 bits (2173), Expect = 7.8e-243
Identity = 457/907 (50.39%), Postives = 599/907 (66.04%), Query Frame = 0

Query: 2   DLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKM--WAKKNA 61
           DLV++CKDKLAYFRIKELKDIL QLGL KQGKKQDL++R+LA+L+DEQ  +   W +KN+
Sbjct: 3   DLVSSCKDKLAYFRIKELKDILNQLGLPKQGKKQDLIDRVLALLTDEQGQRHHGWGRKNS 62

Query: 62  VGKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGN 121
           + K+ VAK+VDDTYRKMQ+  A DLA++    SD S   ++ E  DS Q + KVRC+C +
Sbjct: 63  LTKEAVAKIVDDTYRKMQIQCAPDLATRSHSGSDFSFRPIE-EAYDSFQPEAKVRCICSS 122

Query: 122 ALQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSR 181
            +  +SMI+CED RCQVWQH++CV++P+KP E +   P  FYCE+CRL+RADP       
Sbjct: 123 TMVNDSMIQCEDQRCQVWQHLNCVLIPDKPGE-SAEVPPVFYCELCRLSRADPF------ 182

Query: 182 KADNHNVHKHSNGWISTLSPWI-WRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQE 241
                        W++  +P +  + V+SG +     V  SV+++FQL+R+D++ + +QE
Sbjct: 183 -------------WVTAGNPLLPVKFVSSGVTNDGTSVPQSVEKSFQLSRSDRETVQRQE 242

Query: 242 YDVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTK 301
           YD+Q WCMLLNDKV FRMQWPQYA+L +NG++VR + RPGSQLLG NGRDDGP+IT C++
Sbjct: 243 YDLQVWCMLLNDKVQFRMQWPQYAELHVNGISVRVVTRPGSQLLGINGRDDGPLITTCSR 302

Query: 302 DGMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNT 361
           +G+NKI L+  DAR+FC GVRI KRRTV Q+L+++PKE+EGE F+ ALAR+ RC+GGG+T
Sbjct: 303 EGINKICLSRVDARTFCFGVRIAKRRTVAQVLNLVPKEAEGESFEHALARVRRCLGGGDT 362

Query: 362 ADNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------W 421
           A+NADSDSDLEVVAE   VNLRCP SGSRM+IAGRFKPC HMGCFDLE           W
Sbjct: 363 AENADSDSDLEVVAESVTVNLRCPNSGSRMRIAGRFKPCIHMGCFDLETFVELNQRSRKW 422

Query: 422 QCPICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGD 481
           QCPICLKNY+LE+++IDPYFNRIT+++R+C EDV E++VKPDGSWRV+    S      +
Sbjct: 423 QCPICLKNYSLESLMIDPYFNRITSLLRNCNEDVNEVDVKPDGSWRVKGDAASR-----E 482

Query: 482 LCMWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSD--RGLKLGIRKNSNGVWEV-SRPED 541
           L  WH P+G LC   E+ KP M+   +   EG SD  + LK+GI++N NG+WEV S+ +D
Sbjct: 483 LSQWHMPDGTLCNPKEDVKPAMQNGNEQMMEGTSDGQKSLKIGIKRNPNGIWEVSSKADD 542

Query: 542 INTFTSGSRLPENYGSHD-QKIIPMSSSATGS-RDGEDPSVNQDGGVNFDFSTNNG-IEM 601
                 G+R+  N G      I+ MS+S T S RDGEDPSVNQ+   + D S NNG  E 
Sbjct: 543 KKPSVVGNRMQNNSGFRALNNIMHMSNSPTSSYRDGEDPSVNQESNRHVDLSLNNGNNEF 602

Query: 602 DSLSLNVDSAYGFTEQNPIAP--VGEVIVLSDSDDENDILISSGTVYPSNHTDAGE-VPF 661
           DS SLN   A   T+  P       +VIVLSDSD+END ++    VY +  T  G   PF
Sbjct: 603 DSFSLNFGQACN-TDDRPQQQHNATDVIVLSDSDEENDAMVCPPAVYDNTTTANGSGFPF 662

Query: 662 SMPPSGLTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPG-TQGGAGFQLFSSDA 721
           +    G T+ Y ED      G S LGL +++ D+F M  W +     Q   GFQ F +D 
Sbjct: 663 TTNGIGYTERYQED---AGVGTSGLGLLSNNVDDFEMNNWQMHSSYQQPEQGFQFFGNDT 722

Query: 722 DVSDALVDLQHNSINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFA 781
           DV +  V   HNS   +  N Y+      +  AS+ P  S+ R   +M+ SLVDN LA  
Sbjct: 723 DVHNTFVG-SHNSFGLAP-NDYSLDCNVGVEEASVTPALSVCRNSNEMHGSLVDNPLALV 782

Query: 782 GDDPSLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGL 841
           GDDPSLQIFLP++PS  P+Q +  + A+  NGV ++DWISL L   AGG   E      +
Sbjct: 783 GDDPSLQIFLPSQPSSVPLQEELSERANAPNGVQSDDWISLTLA--AGGGGNEEPAPADV 842

Query: 842 NSRQHVPSTGGEINSLSDTASLLLGMNDVRHDKA--SRQRSDSPFSFPRQKRSVRPRMCF 883
           NS+  +PST   I  L+D AS  L  N  R   A  + +R ++ FS PRQ RSVRPR+C 
Sbjct: 843 NSQPQIPSTETGIEPLTDAASAFLSTNIERRSGADLNPRRIENIFSHPRQPRSVRPRLCL 875

BLAST of Cla97C03G053340 vs. ExPASy Swiss-Prot
Match: Q6ASW7 (E3 SUMO-protein ligase SIZ2 OS=Oryza sativa subsp. japonica OX=39947 GN=SIZ2 PE=2 SV=1)

HSP 1 Score: 610.5 bits (1573), Expect = 2.9e-173
Identity = 358/789 (45.37%), Postives = 489/789 (61.98%), Query Frame = 0

Query: 3   LVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSD--EQVSKM--WAKKN 62
           L+A+CK KL +FRIKELKD+L QLGL KQG+KQ+LV++I+A+LSD  EQ S++     K 
Sbjct: 10  LLADCKYKLNHFRIKELKDVLHQLGLPKQGRKQELVDKIIAVLSDQQEQDSRLNGLPNKK 69

Query: 63  AVGKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSS-TVQVKGETDDSLQLDTKVRCLC 122
            VGK+ VAK+VDDT+ KM  +G+T+     +  +DS   V+ K ++DDS QLD KVRC C
Sbjct: 70  MVGKETVAKIVDDTFAKM--NGSTNAVPASRNQTDSGHIVKPKRKSDDSAQLDVKVRCPC 129

Query: 123 GNALQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPP-YPEHFYCEICRLNRADPCTSS 182
           G ++  +SMIKCE P+C   QH+ CVI+ EKP +  PP  P HFYC++CR+ RADP   +
Sbjct: 130 GYSMANDSMIKCEGPQCNTQQHVGCVIISEKPADSVPPELPPHFYCDMCRITRADPFWVT 189

Query: 183 VSRKADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSK 242
           V     NH V   S      ++P     VAS  SY V       ++TF L+RA+ ++L K
Sbjct: 190 V-----NHPVLPVS------ITPC---KVASDGSYAVQ----YFEKTFPLSRANWEMLQK 249

Query: 243 QEYDVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITAC 302
            EYD+Q WC+L ND VPFRMQWP ++D+QING+ +R +NR  +Q LG NGRDDGP++TA 
Sbjct: 250 DEYDLQVWCILFNDSVPFRMQWPLHSDIQINGIPIRVVNRQPTQQLGVNGRDDGPVLTAY 309

Query: 303 TKDGMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGG 362
            ++G NKI L+  D+R+FCLGVRI KRR+V+Q+LS++PKE +GE+F +ALAR+ RC+GGG
Sbjct: 310 VREGSNKIVLSRSDSRTFCLGVRIAKRRSVEQVLSLVPKEQDGENFDNALARVRRCVGGG 369

Query: 363 NTADNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE---------- 422
             ADNADSDSD+EVVA+   VNLRCPM+GSR+KIAGRFKPC HMGCFDLE          
Sbjct: 370 TEADNADSDSDIEVVADSVSVNLRCPMTGSRIKIAGRFKPCVHMGCFDLEAFVELNQRSR 429

Query: 423 -WQCPICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERREL 482
            WQCPICLKNY+L+N+IIDPYFNRITA+++ CG+DV+EI+VKPDGSWRV+        EL
Sbjct: 430 KWQCPICLKNYSLDNIIIDPYFNRITALVQSCGDDVSEIDVKPDGSWRVKGGA-----EL 489

Query: 483 GDLCMWHSPEGNLCV-SNEEAKPKMEALKQ-IKQEGGSDR---GLKLGIRKNSNGVWEVS 542
             L  WH P+G LC+ ++  +KP +  +KQ IK+E  S+     LKLGIR+N+NG WE++
Sbjct: 490 KGLAQWHLPDGTLCMPTDTRSKPNIRIVKQEIKEEPLSEETGGRLKLGIRRNNNGQWEIN 549

Query: 543 RPEDINTFTSGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIE 602
           +  D N   +G    EN          +S+S T   + ++   N + G  FD  T+N  +
Sbjct: 550 KRLDSNNGQNGYIEDEN--------CVVSASNTDDENSKNGIYNPEPG-QFDQLTSNIYD 609

Query: 603 MDSLSLNVDSAYGFTEQNPIAPVGEVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSM 662
           +DS  ++       TEQ       +VIVLSDSDD+N +++S G V  S+  D G      
Sbjct: 610 LDSSPMDAHFPPAPTEQ-------DVIVLSDSDDDNVMVLSPGDVNFSSAHDNGNAFPPN 669

Query: 663 PP--SGLTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDAD 722
           PP  SG+    P       AG       +  DD   +P W     +Q  AG Q+  +  +
Sbjct: 670 PPEASGICGEQPR-----GAGPDVTSFLDGFDD-LELPFWE-SSSSQDAAGTQVTDNQCE 729

Query: 723 VSDALVDLQ--HNSINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAF 766
           + + +V+ Q  H  I    + G AA+          +        DGD N +  D     
Sbjct: 730 MQNFIVNHQFLHEPILGVNLGGTAASNTLECEHDGALQACQSSDQDGDQNQTCHD---GH 747

BLAST of Cla97C03G053340 vs. ExPASy Swiss-Prot
Match: F1R4C4 (E3 SUMO-protein ligase PIAS4-A OS=Danio rerio OX=7955 GN=pias4a PE=2 SV=2)

HSP 1 Score: 67.4 bits (163), Expect = 9.3e-10
Identity = 58/215 (26.98%), Postives = 88/215 (40.93%), Query Frame = 0

Query: 302 NKITLT-GCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 361
           N++T+T G   + + + V +V+  T  ++ + + K    E+      RI          D
Sbjct: 241 NRVTITWGNFGKRYSVAVYLVRVFTSGELFNQL-KHCSVENPDRCRERI---------QD 300

Query: 362 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDL-----------EWQC 421
               D + E+      V+L CP+   R+ +  R   CAH+ CFD             W C
Sbjct: 301 KLRFDPESEIATTGLRVSLICPLVKMRLGVPCRVLTCAHLQCFDAVFFLQMNEKKPTWTC 360

Query: 422 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 481
           P+C K    E + ID   + I   ++   EDV EIE   DGSWR     + + RE  +  
Sbjct: 361 PVCDKPAPFELLTIDGLLSEI---LKETPEDVEEIEYLTDGSWRPIRDDKEKERERENSR 420

Query: 482 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDRG 505
               P  ++CV   EA     A     Q G S  G
Sbjct: 421 TPDYPVVDICV--PEANGHSPAHSGTNQTGKSGSG 440

BLAST of Cla97C03G053340 vs. ExPASy Swiss-Prot
Match: Q12216 (E3 SUMO-protein ligase SIZ2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=NFI1 PE=1 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 1.6e-09
Identity = 30/102 (29.41%), Postives = 52/102 (50.98%), Query Frame = 0

Query: 363 DSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDL-----------EWQCPI 422
           +   D +++     ++L+CP+S +RMK   +   C H+ CFD             WQCPI
Sbjct: 320 NEQDDDDIITTSTVLSLQCPISCTRMKYPAKTDQCKHIQCFDALWFLHSQSQVPTWQCPI 379

Query: 423 CLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWR 454
           C      + + I  + + I   +++C EDV ++E+  DGSW+
Sbjct: 380 CQHPIKFDQLKISEFVDNI---IQNCNEDVEQVEISVDGSWK 418

BLAST of Cla97C03G053340 vs. ExPASy TrEMBL
Match: A0A1S3CK96 (E3 SUMO-protein ligase SIZ1 OS=Cucumis melo OX=3656 GN=LOC103501758 PE=3 SV=1)

HSP 1 Score: 1619.4 bits (4192), Expect = 0.0e+00
Identity = 813/894 (90.94%), Postives = 832/894 (93.06%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLV+RIL ILSDEQVSKMWAKKNAV
Sbjct: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVQRILDILSDEQVSKMWAKKNAV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
           GKDQVAKLVDDTYRKMQVSGATDLA+KGQGVSDSS VQVKGETDDSLQLDTKVRCLCGN 
Sbjct: 61  GKDQVAKLVDDTYRKMQVSGATDLATKGQGVSDSSNVQVKGETDDSLQLDTKVRCLCGNG 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADP   SV+  
Sbjct: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPFWVSVA-- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                   H    +  ++     I   GT+        SVDRTFQLTRADKDLLSKQEYD
Sbjct: 181 --------HPLFPVKLITTMSTNIPTDGTN-----PMQSVDRTFQLTRADKDLLSKQEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG
Sbjct: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKES+GE FQDALARICRCIGGGNTAD
Sbjct: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESDGERFQDALARICRCIGGGNTAD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE           WQC
Sbjct: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLEVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNYALENVIIDPYFNRIT+MMRHCGEDVTEIEVKPDG WRVRSKTESERR+LGDLC
Sbjct: 421 PICLKNYALENVIIDPYFNRITSMMRHCGEDVTEIEVKPDGFWRVRSKTESERRDLGDLC 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFT 540
           MWHSPEG LCVSNEE KPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFT
Sbjct: 481 MWHSPEGTLCVSNEEVKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFT 540

Query: 541 SGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFST-NNGIEMDSLSLNV 600
           SGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFST NNGIE+DSLSLNV
Sbjct: 541 SGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNNGIELDSLSLNV 600

Query: 601 DSAYGFTEQNPIAPVGEVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSGLTDA 660
           DSAYGFTEQNPIAPVGEVIVLSDSDD+NDILISSGTV+PSNHTDA EVPF MPPSGLTDA
Sbjct: 601 DSAYGFTEQNPIAPVGEVIVLSDSDDDNDILISSGTVFPSNHTDASEVPFPMPPSGLTDA 660

Query: 661 YPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALVDLQH 720
           YPEDPTLL A NSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLF SDADVSDALVDLQH
Sbjct: 661 YPEDPTLLPA-NSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFGSDADVSDALVDLQH 720

Query: 721 NSINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLP 780
           NSINCST+NGYAATPEAAISPAS+VPGSSIGRTDGDMNDSLVDNTLAFA +DPSLQIFLP
Sbjct: 721 NSINCSTINGYAATPEAAISPASIVPGSSIGRTDGDMNDSLVDNTLAFASEDPSLQIFLP 780

Query: 781 TRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQHVPSTGG 840
           TRPSDAPMQSDFR+EADVSNGVHTEDWISLRLGGDAGGSNGEST SKGLNSRQH+PSTGG
Sbjct: 781 TRPSDAPMQSDFREEADVSNGVHTEDWISLRLGGDAGGSNGESTASKGLNSRQHIPSTGG 840

Query: 841 EINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE 883
           EINSLSDTASLLLGMNDVRH+KASRQRSDSPFSFPRQKRSVRPRMC SIDSESE
Sbjct: 841 EINSLSDTASLLLGMNDVRHEKASRQRSDSPFSFPRQKRSVRPRMCLSIDSESE 878

BLAST of Cla97C03G053340 vs. ExPASy TrEMBL
Match: A0A0A0LUR9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G086920 PE=3 SV=1)

HSP 1 Score: 1589.3 bits (4114), Expect = 0.0e+00
Identity = 799/893 (89.47%), Postives = 821/893 (91.94%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLV+RIL ILSDEQVSKMWAKKNAV
Sbjct: 92  MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVQRILDILSDEQVSKMWAKKNAV 151

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
           GKDQVAKLVDDTYRKMQVSG  DLA+KGQGVSDSS VQVKGETDDSLQLDTKVRCLCGN 
Sbjct: 152 GKDQVAKLVDDTYRKMQVSG-VDLATKGQGVSDSSNVQVKGETDDSLQLDTKVRCLCGNG 211

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADP   SV+  
Sbjct: 212 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPFWVSVA-- 271

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                   H    +  ++     I   GT+        SVDR+FQLTRADKDLLSKQEYD
Sbjct: 272 --------HPLFPVKLITTMSTNIPTDGTN-----PMQSVDRSFQLTRADKDLLSKQEYD 331

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG
Sbjct: 332 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 391

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           MNKI LTGCDARSFCLGVRIVKRRTVQQILSMIPKES+GE FQDALARICRCIGGGNTAD
Sbjct: 392 MNKIALTGCDARSFCLGVRIVKRRTVQQILSMIPKESDGERFQDALARICRCIGGGNTAD 451

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE           WQC
Sbjct: 452 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLEVFVELNQRSRKWQC 511

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNYALENVIIDPYFNRIT+MMRHCGEDVTEIEVKPDG WRVRSK+ESERR+LGDLC
Sbjct: 512 PICLKNYALENVIIDPYFNRITSMMRHCGEDVTEIEVKPDGFWRVRSKSESERRDLGDLC 571

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFT 540
           MWHSPEG LCVSNEE KPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDIN FT
Sbjct: 572 MWHSPEGTLCVSNEEVKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINNFT 631

Query: 541 SGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEMDSLSLNVD 600
                  NYG HDQKIIPMSSSATGSRDGEDPSVNQD G+NFDFS NNGIE+DSLSLNVD
Sbjct: 632 -------NYGCHDQKIIPMSSSATGSRDGEDPSVNQD-GLNFDFSNNNGIELDSLSLNVD 691

Query: 601 SAYGFTEQNPIAPVGEVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSGLTDAY 660
           SAYGFTEQNPIAPVGEVIVLSDSDD+NDILISSGTV+PSNHTD  EVPF MPPSGLTDAY
Sbjct: 692 SAYGFTEQNPIAPVGEVIVLSDSDDDNDILISSGTVFPSNHTDPSEVPFPMPPSGLTDAY 751

Query: 661 PEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALVDLQHN 720
           PEDPT+LSAGNSCLGLFNSH+DEFGMPVW LPPGTQGGAGFQLF SDADVSDALVDLQHN
Sbjct: 752 PEDPTILSAGNSCLGLFNSHEDEFGMPVWPLPPGTQGGAGFQLFGSDADVSDALVDLQHN 811

Query: 721 SINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLPT 780
           SINCST+NGYAATPEAAISPAS+VPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLPT
Sbjct: 812 SINCSTINGYAATPEAAISPASIVPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLPT 871

Query: 781 RPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQHVPSTGGE 840
           RPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQH+PSTGGE
Sbjct: 872 RPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQHIPSTGGE 931

Query: 841 INSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE 883
           INSLSDTASLLLGMNDVRH+KASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE
Sbjct: 932 INSLSDTASLLLGMNDVRHEKASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE 960

BLAST of Cla97C03G053340 vs. ExPASy TrEMBL
Match: A0A6J1HJJ8 (E3 SUMO-protein ligase SIZ1-like OS=Cucurbita moschata OX=3662 GN=LOC111464667 PE=3 SV=1)

HSP 1 Score: 1582.4 bits (4096), Expect = 0.0e+00
Identity = 796/894 (89.04%), Postives = 819/894 (91.61%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV
Sbjct: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
           GKDQVAKLV+DTYRKMQVSGATDLASKGQGVSDSS VQVKGETDDSLQLDTKVRCLCG+A
Sbjct: 61  GKDQVAKLVEDTYRKMQVSGATDLASKGQGVSDSSNVQVKGETDDSLQLDTKVRCLCGSA 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGN PYPEHFYCEICRLNRADP   SV+  
Sbjct: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNSPYPEHFYCEICRLNRADPFWVSVA-- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                   H    +  ++     I   GT+        SVDRTFQLTRADKDLLSK EYD
Sbjct: 181 --------HPLFPVKLITTMSTNIPTDGTN-----PMQSVDRTFQLTRADKDLLSKPEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG
Sbjct: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           MNKITLTGCDAR+FCLGVRIVKRRTVQQIL MIPKESEGE FQDALARICRCIGGGNTAD
Sbjct: 301 MNKITLTGCDARTFCLGVRIVKRRTVQQILGMIPKESEGERFQDALARICRCIGGGNTAD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRF PCAHMGCFDLE           WQC
Sbjct: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFSPCAHMGCFDLEVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNYALENVIIDPYFNRIT+MMRHCGEDVTEIEVKPDGSWRVRS+TESERRELGDLC
Sbjct: 421 PICLKNYALENVIIDPYFNRITSMMRHCGEDVTEIEVKPDGSWRVRSRTESERRELGDLC 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFT 540
           +WHS +G  CV+NEE KPKMEA KQIKQEGGSDRGLKLGIRKNSNG WEVSRPEDINTFT
Sbjct: 481 LWHSSDGTSCVTNEEVKPKMEASKQIKQEGGSDRGLKLGIRKNSNGFWEVSRPEDINTFT 540

Query: 541 SGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEMDSLSLNVD 600
           SGSRLPENYG HDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEM+SLSL+VD
Sbjct: 541 SGSRLPENYGGHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEMNSLSLHVD 600

Query: 601 SAYGFTEQNPIAPVGEVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSGLTDAY 660
           S YGFTEQNPIAPVGEVIVLSDSD+ENDIL+SSGTVY SNHTDAGE+ FSMPP GL DAY
Sbjct: 601 SEYGFTEQNPIAPVGEVIVLSDSDEENDILVSSGTVYQSNHTDAGEISFSMPPPGLADAY 660

Query: 661 PEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALVDLQHN 720
           PEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPG QGGAGFQLFSSDADVS+ALVDLQH+
Sbjct: 661 PEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGAQGGAGFQLFSSDADVSEALVDLQHD 720

Query: 721 SINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLPT 780
           SINCSTMNGY ATPEAAISPASLVPGSSIG TDG+MNDSLVDN LAFAGDDPSLQIFLPT
Sbjct: 721 SINCSTMNGYLATPEAAISPASLVPGSSIGHTDGEMNDSLVDNPLAFAGDDPSLQIFLPT 780

Query: 781 RPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSN-GESTTSKGLNSRQHVPSTGG 840
           RPS APMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSN GEST S+GLNSRQH+PSTGG
Sbjct: 781 RPSVAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGGESTASRGLNSRQHIPSTGG 840

Query: 841 EINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE 883
           EINSLSDTASLLLGMNDVRHDKASRQRS SPFSFPRQKRSVR RM  SIDSESE
Sbjct: 841 EINSLSDTASLLLGMNDVRHDKASRQRSGSPFSFPRQKRSVRQRMFLSIDSESE 879

BLAST of Cla97C03G053340 vs. ExPASy TrEMBL
Match: A0A6J1I0I8 (E3 SUMO-protein ligase SIZ1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111469401 PE=3 SV=1)

HSP 1 Score: 1578.9 bits (4087), Expect = 0.0e+00
Identity = 794/894 (88.81%), Postives = 818/894 (91.50%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV
Sbjct: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
           GKDQVAKLV+DTYRKMQVSGATDLASKGQGVSDSS VQVKGETDDSLQLDTKVRCLCG+A
Sbjct: 61  GKDQVAKLVEDTYRKMQVSGATDLASKGQGVSDSSNVQVKGETDDSLQLDTKVRCLCGSA 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGN PYPEHFYCEICRLNRADP   SV+  
Sbjct: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNSPYPEHFYCEICRLNRADPFWVSVA-- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                   H    +  ++     I   GT+        SVDRTFQLTRADKDLLSK EYD
Sbjct: 181 --------HPLFPVKLITTMSTNIPTDGTN-----PMQSVDRTFQLTRADKDLLSKPEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG
Sbjct: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           MNKITLTGCDAR+FCLGVRIVKRRTVQQILSMIPKESEGE FQDALARICRCIGGGNT D
Sbjct: 301 MNKITLTGCDARTFCLGVRIVKRRTVQQILSMIPKESEGERFQDALARICRCIGGGNTTD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRF PCAHMGCFDLE           WQC
Sbjct: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFSPCAHMGCFDLEVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNYALENVIIDPYFNRIT+MMRHCGEDVTEIEVKPDGSWRVRS+TESERRELG+LC
Sbjct: 421 PICLKNYALENVIIDPYFNRITSMMRHCGEDVTEIEVKPDGSWRVRSRTESERRELGELC 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDRGLKLGIRKNSNGVWEVSRPEDINTFT 540
           +WHS +G  CV+NEE KPKMEALKQIKQEGGSDRGLKLGIRKNSNG WEVSRPEDINTFT
Sbjct: 481 LWHSSDGTSCVTNEEVKPKMEALKQIKQEGGSDRGLKLGIRKNSNGFWEVSRPEDINTFT 540

Query: 541 SGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEMDSLSLNVD 600
           SGSRLPENYG HDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEM+SLSL+VD
Sbjct: 541 SGSRLPENYGGHDQKIIPMSSSATGSRDGEDPSVNQDGGVNFDFSTNNGIEMNSLSLHVD 600

Query: 601 SAYGFTEQNPIAPVGEVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSGLTDAY 660
           S YGFTEQNPIAP GEVIVLSDSD+ENDIL+SSGTVY SNH DAGE+ FSMPP GL DAY
Sbjct: 601 SEYGFTEQNPIAPEGEVIVLSDSDEENDILVSSGTVYQSNHADAGEISFSMPPPGLADAY 660

Query: 661 PEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALVDLQHN 720
           PEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPG QGGAGFQLFSSDADVS+ALVDLQH+
Sbjct: 661 PEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGAQGGAGFQLFSSDADVSEALVDLQHD 720

Query: 721 SINCSTMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDPSLQIFLPT 780
           SINCSTMNGY ATPEAAISPASLVPGSSIG TDG+MNDSLVDN LAFAGDDPSLQIFLPT
Sbjct: 721 SINCSTMNGYLATPEAAISPASLVPGSSIGHTDGEMNDSLVDNPLAFAGDDPSLQIFLPT 780

Query: 781 RPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSN-GESTTSKGLNSRQHVPSTGG 840
           RPS APMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSN GEST S+GLNSRQH+PSTGG
Sbjct: 781 RPSVAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGGESTASRGLNSRQHIPSTGG 840

Query: 841 EINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSESE 883
           EINSLSDTASLLLGMNDVRHDKASRQRS SPFSFPRQKRSVR RM  SIDSESE
Sbjct: 841 EINSLSDTASLLLGMNDVRHDKASRQRSGSPFSFPRQKRSVRQRMFLSIDSESE 879

BLAST of Cla97C03G053340 vs. ExPASy TrEMBL
Match: A0A5D3DCL3 (E3 SUMO-protein ligase SIZ1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold778G00230 PE=4 SV=1)

HSP 1 Score: 1552.0 bits (4017), Expect = 0.0e+00
Identity = 778/860 (90.47%), Postives = 798/860 (92.79%), Query Frame = 0

Query: 35  QDLVERILAILSDEQVSKMWAKKNAVGKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDS 94
           +DLV+RIL ILSDEQVSKMWAKKNAVGKDQVAKLVDDTYRKMQVSGATDLA+KGQGVSDS
Sbjct: 74  KDLVQRILDILSDEQVSKMWAKKNAVGKDQVAKLVDDTYRKMQVSGATDLATKGQGVSDS 133

Query: 95  STVQVKGETDDSLQLDTKVRCLCGNALQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNP 154
           S VQVKGETDDSLQLDTKVRCLCGN LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNP
Sbjct: 134 SNVQVKGETDDSLQLDTKVRCLCGNGLQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNP 193

Query: 155 PYPEHFYCEICRLNRADPCTSSVSRKADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVV 214
           PYPEHFYCEICRLNRADP   SV+          H    +  ++     I   GT+    
Sbjct: 194 PYPEHFYCEICRLNRADPFWVSVA----------HPLFPVKLITTMSTNIPTDGTN---- 253

Query: 215 GVKISVDRTFQLTRADKDLLSKQEYDVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAI 274
               SVDRTFQLTRADKDLLSKQEYDVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAI
Sbjct: 254 -PMQSVDRTFQLTRADKDLLSKQEYDVQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAI 313

Query: 275 NRPGSQLLGANGRDDGPIITACTKDGMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIP 334
           NRPGSQLLGANGRDDGPIITACTKDGMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIP
Sbjct: 314 NRPGSQLLGANGRDDGPIITACTKDGMNKITLTGCDARSFCLGVRIVKRRTVQQILSMIP 373

Query: 335 KESEGEHFQDALARICRCIGGGNTADNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRF 394
           KES+GE FQDALARICRCIGGGNTADNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRF
Sbjct: 374 KESDGERFQDALARICRCIGGGNTADNADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRF 433

Query: 395 KPCAHMGCFDLE-----------WQCPICLKNYALENVIIDPYFNRITAMMRHCGEDVTE 454
           KPCAHMGCFDLE           WQCPICLKNYALENVIIDPYFNRIT+MMRHCGEDVTE
Sbjct: 434 KPCAHMGCFDLEVFVELNQRSRKWQCPICLKNYALENVIIDPYFNRITSMMRHCGEDVTE 493

Query: 455 IEVKPDGSWRVRSKTESERRELGDLCMWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSDR 514
           IEVKPDG WRVRSKTESERR+LGDLCMWHSPEG LCVSNEE KPKMEALKQIKQEGGSDR
Sbjct: 494 IEVKPDGFWRVRSKTESERRDLGDLCMWHSPEGTLCVSNEEVKPKMEALKQIKQEGGSDR 553

Query: 515 GLKLGIRKNSNGVWEVSRPEDINTFTSGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSV 574
           GLKLGIRKNSNGVWEVSRPEDINTFTSGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSV
Sbjct: 554 GLKLGIRKNSNGVWEVSRPEDINTFTSGSRLPENYGSHDQKIIPMSSSATGSRDGEDPSV 613

Query: 575 NQDGGVNFDFST-NNGIEMDSLSLNVDSAYGFTEQNPIAPVGEVIVLSDSDDENDILISS 634
           NQDGGVNFDFST NNGIE+DSLSLNVDSAYGFTEQNPIAPVGEVIVLSDSDD+NDILISS
Sbjct: 614 NQDGGVNFDFSTNNNGIELDSLSLNVDSAYGFTEQNPIAPVGEVIVLSDSDDDNDILISS 673

Query: 635 GTVYPSNHTDAGEVPFSMPPSGLTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPP 694
           GTV+PSNHTDA EVPF MPPSGLTDAYPEDPTLL A NSCLGLFNSHDDEFGMPVWSLPP
Sbjct: 674 GTVFPSNHTDASEVPFPMPPSGLTDAYPEDPTLLPA-NSCLGLFNSHDDEFGMPVWSLPP 733

Query: 695 GTQGGAGFQLFSSDADVSDALVDLQHNSINCSTMNGYAATPEAAISPASLVPGSSIGRTD 754
           GTQGGAGFQLF SDADVSDALVDLQHNSINCST+NGYAATPEAAISPAS+VPGSSIGRTD
Sbjct: 734 GTQGGAGFQLFGSDADVSDALVDLQHNSINCSTINGYAATPEAAISPASIVPGSSIGRTD 793

Query: 755 GDMNDSLVDNTLAFAGDDPSLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGG 814
           GDMNDSLVDNTLAFA +DPSLQIFLPTRPSDAPMQSDFR+EADVSNGVHTEDWISLRLGG
Sbjct: 794 GDMNDSLVDNTLAFASEDPSLQIFLPTRPSDAPMQSDFREEADVSNGVHTEDWISLRLGG 853

Query: 815 DAGGSNGESTTSKGLNSRQHVPSTGGEINSLSDTASLLLGMNDVRHDKASRQRSDSPFSF 874
           DAGGSNGEST SKGLNSRQH+PSTGGEINSLSDTASLLLGMNDVRH+KASRQRSDSPFSF
Sbjct: 854 DAGGSNGESTASKGLNSRQHIPSTGGEINSLSDTASLLLGMNDVRHEKASRQRSDSPFSF 913

Query: 875 PRQKRSVRPRMCFSIDSESE 883
           PRQKRSVRPRMC SIDSESE
Sbjct: 914 PRQKRSVRPRMCLSIDSESE 917

BLAST of Cla97C03G053340 vs. TAIR 10
Match: AT5G60410.2 (DNA-binding protein with MIZ/SP-RING zinc finger, PHD-finger and SAP domain )

HSP 1 Score: 1073.9 bits (2776), Expect = 6.3e-314
Identity = 558/901 (61.93%), Postives = 684/901 (75.92%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDL ANCK+KL+YFRIKELKD+LTQLGLSKQGKKQ+LV+RIL +LSDEQ +++ +KKN V
Sbjct: 1   MDLEANCKEKLSYFRIKELKDVLTQLGLSKQGKKQELVDRILTLLSDEQAARLLSKKNTV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
            K+ VAKLVDDTYRKMQVSGA+DLASKGQ  SD+S ++VKGE +D  Q + KVRC+CGN+
Sbjct: 61  AKEAVAKLVDDTYRKMQVSGASDLASKGQVSSDTSNLKVKGEPEDPFQPEIKVRCVCGNS 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           L+T+SMI+CEDPRC VWQH+ CVI+P+KP +GNPP PE FYCEICRL RADP   +V+  
Sbjct: 121 LETDSMIQCEDPRCHVWQHVGCVILPDKPMDGNPPLPESFYCEICRLTRADPFWVTVAH- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                           LSP   R+ A+           SV+RTFQ+TRADKDLL+K EYD
Sbjct: 181 ---------------PLSP--VRLTATTIPNDGASTMQSVERTFQITRADKDLLAKPEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKV FRMQWPQYADLQ+NG+ VRAINRPG QLLG NGRDDGPIIT+C +DG
Sbjct: 241 VQAWCMLLNDKVLFRMQWPQYADLQVNGVPVRAINRPGGQLLGVNGRDDGPIITSCIRDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           +N+I+L+G D R FC GVR+VKRRT+QQ+L++IP+E +GE F+DALAR+ RCIGGG   D
Sbjct: 301 VNRISLSGGDVRIFCFGVRLVKRRTLQQVLNLIPEEGKGETFEDALARVRRCIGGGGGDD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSD+EVVA+FFGVNLRCPMSGSR+K+AGRF PC HMGCFDL+           WQC
Sbjct: 361 NADSDSDIEVVADFFGVNLRCPMSGSRIKVAGRFLPCVHMGCFDLDVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNY++E+VI+DPYFNRIT+ M+HC E+VTEIEVKPDGSWRV+ K ESERRELG+L 
Sbjct: 421 PICLKNYSVEHVIVDPYFNRITSKMKHCDEEVTEIEVKPDGSWRVKFKRESERRELGELS 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSD--RGLKLGIRKNSNGVWEVSRPEDINT 540
            WH+P+G+LC S  + K KME L  +KQEG SD    LKLGIRKN NG+WEVS+P + N 
Sbjct: 481 QWHAPDGSLCPSAVDIKRKMEML-PVKQEGYSDGPAPLKLGIRKNRNGIWEVSKP-NTNG 540

Query: 541 FTSGSRLPENYGSHDQKIIPMSSSATGS-RDGEDPSVNQDGGVNFDFSTNNGIEMDSLSL 600
            +S +R  E  G  ++ IIPMSSSATGS RDG+D SVNQD    FDF   NG+E+DS+S+
Sbjct: 541 LSSSNR-QEKVGYQEKNIIPMSSSATGSGRDGDDASVNQDAIGTFDF-VANGMELDSISM 600

Query: 601 NVDSAYGFTEQNPIAPVG--EVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSG 660
           NVDS Y F ++N     G  EVIVLSDSDDEND++I+ G  Y    TD G + F + P G
Sbjct: 601 NVDSGYNFPDRNQSGEGGNNEVIVLSDSDDENDLVITPGPAYSGCQTDGG-LTFPLNPPG 660

Query: 661 LTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALV 720
           + ++Y EDP  ++ G+S LGLFN  DDEF  P+WS P  T    GFQLF SDADVS  LV
Sbjct: 661 IINSYNEDPHSIAGGSSGLGLFND-DDEFDTPLWSFPSETPEAPGFQLFRSDADVSGGLV 720

Query: 721 DLQHNS-INCS--TMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDP 780
            L H+S +NCS     GY   PE +++   +VPGS+ GR++   ND LVDN LAF  DDP
Sbjct: 721 GLHHHSPLNCSPEINGGYTMAPETSMASVPVVPGST-GRSEA--NDGLVDNPLAFGRDDP 780

Query: 781 SLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQ 840
           SLQIFLPT+P DA  QS F+++AD+SNG+ +EDWISLRLG  A G++G+  T+ G+NS  
Sbjct: 781 SLQIFLPTKP-DASAQSGFKNQADMSNGLRSEDWISLRLGDSASGNHGDPATTNGINSSH 840

Query: 841 HVPSTGGEINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSES 883
            + +  G +++ ++TASLLLGMND R DKA +QRSD+PFSFPRQKRSVRPRM  SIDS+S
Sbjct: 841 QMSTREGSMDTTTETASLLLGMNDSRQDKAKKQRSDNPFSFPRQKRSVRPRMYLSIDSDS 873

BLAST of Cla97C03G053340 vs. TAIR 10
Match: AT5G60410.1 (DNA-binding protein with MIZ/SP-RING zinc finger, PHD-finger and SAP domain )

HSP 1 Score: 1073.9 bits (2776), Expect = 6.3e-314
Identity = 558/901 (61.93%), Postives = 684/901 (75.92%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDL ANCK+KL+YFRIKELKD+LTQLGLSKQGKKQ+LV+RIL +LSDEQ +++ +KKN V
Sbjct: 1   MDLEANCKEKLSYFRIKELKDVLTQLGLSKQGKKQELVDRILTLLSDEQAARLLSKKNTV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
            K+ VAKLVDDTYRKMQVSGA+DLASKGQ  SD+S ++VKGE +D  Q + KVRC+CGN+
Sbjct: 61  AKEAVAKLVDDTYRKMQVSGASDLASKGQVSSDTSNLKVKGEPEDPFQPEIKVRCVCGNS 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           L+T+SMI+CEDPRC VWQH+ CVI+P+KP +GNPP PE FYCEICRL RADP   +V+  
Sbjct: 121 LETDSMIQCEDPRCHVWQHVGCVILPDKPMDGNPPLPESFYCEICRLTRADPFWVTVAH- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                           LSP   R+ A+           SV+RTFQ+TRADKDLL+K EYD
Sbjct: 181 ---------------PLSP--VRLTATTIPNDGASTMQSVERTFQITRADKDLLAKPEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKV FRMQWPQYADLQ+NG+ VRAINRPG QLLG NGRDDGPIIT+C +DG
Sbjct: 241 VQAWCMLLNDKVLFRMQWPQYADLQVNGVPVRAINRPGGQLLGVNGRDDGPIITSCIRDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           +N+I+L+G D R FC GVR+VKRRT+QQ+L++IP+E +GE F+DALAR+ RCIGGG   D
Sbjct: 301 VNRISLSGGDVRIFCFGVRLVKRRTLQQVLNLIPEEGKGETFEDALARVRRCIGGGGGDD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSD+EVVA+FFGVNLRCPMSGSR+K+AGRF PC HMGCFDL+           WQC
Sbjct: 361 NADSDSDIEVVADFFGVNLRCPMSGSRIKVAGRFLPCVHMGCFDLDVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNY++E+VI+DPYFNRIT+ M+HC E+VTEIEVKPDGSWRV+ K ESERRELG+L 
Sbjct: 421 PICLKNYSVEHVIVDPYFNRITSKMKHCDEEVTEIEVKPDGSWRVKFKRESERRELGELS 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSD--RGLKLGIRKNSNGVWEVSRPEDINT 540
            WH+P+G+LC S  + K KME L  +KQEG SD    LKLGIRKN NG+WEVS+P + N 
Sbjct: 481 QWHAPDGSLCPSAVDIKRKMEML-PVKQEGYSDGPAPLKLGIRKNRNGIWEVSKP-NTNG 540

Query: 541 FTSGSRLPENYGSHDQKIIPMSSSATGS-RDGEDPSVNQDGGVNFDFSTNNGIEMDSLSL 600
            +S +R  E  G  ++ IIPMSSSATGS RDG+D SVNQD    FDF   NG+E+DS+S+
Sbjct: 541 LSSSNR-QEKVGYQEKNIIPMSSSATGSGRDGDDASVNQDAIGTFDF-VANGMELDSISM 600

Query: 601 NVDSAYGFTEQNPIAPVG--EVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSG 660
           NVDS Y F ++N     G  EVIVLSDSDDEND++I+ G  Y    TD G + F + P G
Sbjct: 601 NVDSGYNFPDRNQSGEGGNNEVIVLSDSDDENDLVITPGPAYSGCQTDGG-LTFPLNPPG 660

Query: 661 LTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALV 720
           + ++Y EDP  ++ G+S LGLFN  DDEF  P+WS P  T    GFQLF SDADVS  LV
Sbjct: 661 IINSYNEDPHSIAGGSSGLGLFND-DDEFDTPLWSFPSETPEAPGFQLFRSDADVSGGLV 720

Query: 721 DLQHNS-INCS--TMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDP 780
            L H+S +NCS     GY   PE +++   +VPGS+ GR++   ND LVDN LAF  DDP
Sbjct: 721 GLHHHSPLNCSPEINGGYTMAPETSMASVPVVPGST-GRSEA--NDGLVDNPLAFGRDDP 780

Query: 781 SLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQ 840
           SLQIFLPT+P DA  QS F+++AD+SNG+ +EDWISLRLG  A G++G+  T+ G+NS  
Sbjct: 781 SLQIFLPTKP-DASAQSGFKNQADMSNGLRSEDWISLRLGDSASGNHGDPATTNGINSSH 840

Query: 841 HVPSTGGEINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSES 883
            + +  G +++ ++TASLLLGMND R DKA +QRSD+PFSFPRQKRSVRPRM  SIDS+S
Sbjct: 841 QMSTREGSMDTTTETASLLLGMNDSRQDKAKKQRSDNPFSFPRQKRSVRPRMYLSIDSDS 873

BLAST of Cla97C03G053340 vs. TAIR 10
Match: AT5G60410.5 (DNA-binding protein with MIZ/SP-RING zinc finger, PHD-finger and SAP domain )

HSP 1 Score: 1073.9 bits (2776), Expect = 6.3e-314
Identity = 558/901 (61.93%), Postives = 684/901 (75.92%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDL ANCK+KL+YFRIKELKD+LTQLGLSKQGKKQ+LV+RIL +LSDEQ +++ +KKN V
Sbjct: 1   MDLEANCKEKLSYFRIKELKDVLTQLGLSKQGKKQELVDRILTLLSDEQAARLLSKKNTV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
            K+ VAKLVDDTYRKMQVSGA+DLASKGQ  SD+S ++VKGE +D  Q + KVRC+CGN+
Sbjct: 61  AKEAVAKLVDDTYRKMQVSGASDLASKGQVSSDTSNLKVKGEPEDPFQPEIKVRCVCGNS 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           L+T+SMI+CEDPRC VWQH+ CVI+P+KP +GNPP PE FYCEICRL RADP   +V+  
Sbjct: 121 LETDSMIQCEDPRCHVWQHVGCVILPDKPMDGNPPLPESFYCEICRLTRADPFWVTVAH- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                           LSP   R+ A+           SV+RTFQ+TRADKDLL+K EYD
Sbjct: 181 ---------------PLSP--VRLTATTIPNDGASTMQSVERTFQITRADKDLLAKPEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKV FRMQWPQYADLQ+NG+ VRAINRPG QLLG NGRDDGPIIT+C +DG
Sbjct: 241 VQAWCMLLNDKVLFRMQWPQYADLQVNGVPVRAINRPGGQLLGVNGRDDGPIITSCIRDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           +N+I+L+G D R FC GVR+VKRRT+QQ+L++IP+E +GE F+DALAR+ RCIGGG   D
Sbjct: 301 VNRISLSGGDVRIFCFGVRLVKRRTLQQVLNLIPEEGKGETFEDALARVRRCIGGGGGDD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSD+EVVA+FFGVNLRCPMSGSR+K+AGRF PC HMGCFDL+           WQC
Sbjct: 361 NADSDSDIEVVADFFGVNLRCPMSGSRIKVAGRFLPCVHMGCFDLDVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNY++E+VI+DPYFNRIT+ M+HC E+VTEIEVKPDGSWRV+ K ESERRELG+L 
Sbjct: 421 PICLKNYSVEHVIVDPYFNRITSKMKHCDEEVTEIEVKPDGSWRVKFKRESERRELGELS 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSD--RGLKLGIRKNSNGVWEVSRPEDINT 540
            WH+P+G+LC S  + K KME L  +KQEG SD    LKLGIRKN NG+WEVS+P + N 
Sbjct: 481 QWHAPDGSLCPSAVDIKRKMEML-PVKQEGYSDGPAPLKLGIRKNRNGIWEVSKP-NTNG 540

Query: 541 FTSGSRLPENYGSHDQKIIPMSSSATGS-RDGEDPSVNQDGGVNFDFSTNNGIEMDSLSL 600
            +S +R  E  G  ++ IIPMSSSATGS RDG+D SVNQD    FDF   NG+E+DS+S+
Sbjct: 541 LSSSNR-QEKVGYQEKNIIPMSSSATGSGRDGDDASVNQDAIGTFDF-VANGMELDSISM 600

Query: 601 NVDSAYGFTEQNPIAPVG--EVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSG 660
           NVDS Y F ++N     G  EVIVLSDSDDEND++I+ G  Y    TD G + F + P G
Sbjct: 601 NVDSGYNFPDRNQSGEGGNNEVIVLSDSDDENDLVITPGPAYSGCQTDGG-LTFPLNPPG 660

Query: 661 LTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALV 720
           + ++Y EDP  ++ G+S LGLFN  DDEF  P+WS P  T    GFQLF SDADVS  LV
Sbjct: 661 IINSYNEDPHSIAGGSSGLGLFND-DDEFDTPLWSFPSETPEAPGFQLFRSDADVSGGLV 720

Query: 721 DLQHNS-INCS--TMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDP 780
            L H+S +NCS     GY   PE +++   +VPGS+ GR++   ND LVDN LAF  DDP
Sbjct: 721 GLHHHSPLNCSPEINGGYTMAPETSMASVPVVPGST-GRSEA--NDGLVDNPLAFGRDDP 780

Query: 781 SLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQ 840
           SLQIFLPT+P DA  QS F+++AD+SNG+ +EDWISLRLG  A G++G+  T+ G+NS  
Sbjct: 781 SLQIFLPTKP-DASAQSGFKNQADMSNGLRSEDWISLRLGDSASGNHGDPATTNGINSSH 840

Query: 841 HVPSTGGEINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRSVRPRMCFSIDSES 883
            + +  G +++ ++TASLLLGMND R DKA +QRSD+PFSFPRQKRSVRPRM  SIDS+S
Sbjct: 841 QMSTREGSMDTTTETASLLLGMNDSRQDKAKKQRSDNPFSFPRQKRSVRPRMYLSIDSDS 873

BLAST of Cla97C03G053340 vs. TAIR 10
Match: AT5G60410.4 (DNA-binding protein with MIZ/SP-RING zinc finger, PHD-finger and SAP domain )

HSP 1 Score: 1053.5 bits (2723), Expect = 9.3e-308
Identity = 547/887 (61.67%), Postives = 672/887 (75.76%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDL ANCK+KL+YFRIKELKD+LTQLGLSKQGKKQ+LV+RIL +LSDEQ +++ +KKN V
Sbjct: 1   MDLEANCKEKLSYFRIKELKDVLTQLGLSKQGKKQELVDRILTLLSDEQAARLLSKKNTV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
            K+ VAKLVDDTYRKMQVSGA+DLASKGQ  SD+S ++VKGE +D  Q + KVRC+CGN+
Sbjct: 61  AKEAVAKLVDDTYRKMQVSGASDLASKGQVSSDTSNLKVKGEPEDPFQPEIKVRCVCGNS 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           L+T+SMI+CEDPRC VWQH+ CVI+P+KP +GNPP PE FYCEICRL RADP   +V+  
Sbjct: 121 LETDSMIQCEDPRCHVWQHVGCVILPDKPMDGNPPLPESFYCEICRLTRADPFWVTVAH- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                           LSP   R+ A+           SV+RTFQ+TRADKDLL+K EYD
Sbjct: 181 ---------------PLSP--VRLTATTIPNDGASTMQSVERTFQITRADKDLLAKPEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKV FRMQWPQYADLQ+NG+ VRAINRPG QLLG NGRDDGPIIT+C +DG
Sbjct: 241 VQAWCMLLNDKVLFRMQWPQYADLQVNGVPVRAINRPGGQLLGVNGRDDGPIITSCIRDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           +N+I+L+G D R FC GVR+VKRRT+QQ+L++IP+E +GE F+DALAR+ RCIGGG   D
Sbjct: 301 VNRISLSGGDVRIFCFGVRLVKRRTLQQVLNLIPEEGKGETFEDALARVRRCIGGGGGDD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSD+EVVA+FFGVNLRCPMSGSR+K+AGRF PC HMGCFDL+           WQC
Sbjct: 361 NADSDSDIEVVADFFGVNLRCPMSGSRIKVAGRFLPCVHMGCFDLDVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNY++E+VI+DPYFNRIT+ M+HC E+VTEIEVKPDGSWRV+ K ESERRELG+L 
Sbjct: 421 PICLKNYSVEHVIVDPYFNRITSKMKHCDEEVTEIEVKPDGSWRVKFKRESERRELGELS 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSD--RGLKLGIRKNSNGVWEVSRPEDINT 540
            WH+P+G+LC S  + K KME L  +KQEG SD    LKLGIRKN NG+WEVS+P + N 
Sbjct: 481 QWHAPDGSLCPSAVDIKRKMEML-PVKQEGYSDGPAPLKLGIRKNRNGIWEVSKP-NTNG 540

Query: 541 FTSGSRLPENYGSHDQKIIPMSSSATGS-RDGEDPSVNQDGGVNFDFSTNNGIEMDSLSL 600
            +S +R  E  G  ++ IIPMSSSATGS RDG+D SVNQD    FDF   NG+E+DS+S+
Sbjct: 541 LSSSNR-QEKVGYQEKNIIPMSSSATGSGRDGDDASVNQDAIGTFDF-VANGMELDSISM 600

Query: 601 NVDSAYGFTEQNPIAPVG--EVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSG 660
           NVDS Y F ++N     G  EVIVLSDSDDEND++I+ G  Y    TD G + F + P G
Sbjct: 601 NVDSGYNFPDRNQSGEGGNNEVIVLSDSDDENDLVITPGPAYSGCQTDGG-LTFPLNPPG 660

Query: 661 LTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALV 720
           + ++Y EDP  ++ G+S LGLFN  DDEF  P+WS P  T    GFQLF SDADVS  LV
Sbjct: 661 IINSYNEDPHSIAGGSSGLGLFND-DDEFDTPLWSFPSETPEAPGFQLFRSDADVSGGLV 720

Query: 721 DLQHNS-INCS--TMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDP 780
            L H+S +NCS     GY   PE +++   +VPGS+ GR++   ND LVDN LAF  DDP
Sbjct: 721 GLHHHSPLNCSPEINGGYTMAPETSMASVPVVPGST-GRSEA--NDGLVDNPLAFGRDDP 780

Query: 781 SLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQ 840
           SLQIFLPT+P DA  QS F+++AD+SNG+ +EDWISLRLG  A G++G+  T+ G+NS  
Sbjct: 781 SLQIFLPTKP-DASAQSGFKNQADMSNGLRSEDWISLRLGDSASGNHGDPATTNGINSSH 840

Query: 841 HVPSTGGEINSLSDTASLLLGMNDVRHDKASRQRSDSPFSFPRQKRS 869
            + +  G +++ ++TASLLLGMND R DKA +QRSD+PFSFPRQKRS
Sbjct: 841 QMSTREGSMDTTTETASLLLGMNDSRQDKAKKQRSDNPFSFPRQKRS 859

BLAST of Cla97C03G053340 vs. TAIR 10
Match: AT5G60410.3 (DNA-binding protein with MIZ/SP-RING zinc finger, PHD-finger and SAP domain )

HSP 1 Score: 1002.3 bits (2590), Expect = 2.5e-292
Identity = 520/855 (60.82%), Postives = 643/855 (75.20%), Query Frame = 0

Query: 1   MDLVANCKDKLAYFRIKELKDILTQLGLSKQGKKQDLVERILAILSDEQVSKMWAKKNAV 60
           MDL ANCK+KL+YFRIKELKD+LTQLGLSKQGKKQ+LV+RIL +LSDEQ +++ +KKN V
Sbjct: 1   MDLEANCKEKLSYFRIKELKDVLTQLGLSKQGKKQELVDRILTLLSDEQAARLLSKKNTV 60

Query: 61  GKDQVAKLVDDTYRKMQVSGATDLASKGQGVSDSSTVQVKGETDDSLQLDTKVRCLCGNA 120
            K+ VAKLVDDTYRKMQVSGA+DLASKGQ  SD+S ++VKGE +D  Q + KVRC+CGN+
Sbjct: 61  AKEAVAKLVDDTYRKMQVSGASDLASKGQVSSDTSNLKVKGEPEDPFQPEIKVRCVCGNS 120

Query: 121 LQTESMIKCEDPRCQVWQHISCVIVPEKPTEGNPPYPEHFYCEICRLNRADPCTSSVSRK 180
           L+T+SMI+CEDPRC VWQH+ CVI+P+KP +GNPP PE FYCEICRL RADP   +V+  
Sbjct: 121 LETDSMIQCEDPRCHVWQHVGCVILPDKPMDGNPPLPESFYCEICRLTRADPFWVTVAH- 180

Query: 181 ADNHNVHKHSNGWISTLSPWIWRIVASGTSYTVVGVKISVDRTFQLTRADKDLLSKQEYD 240
                           LSP   R+ A+           SV+RTFQ+TRADKDLL+K EYD
Sbjct: 181 ---------------PLSP--VRLTATTIPNDGASTMQSVERTFQITRADKDLLAKPEYD 240

Query: 241 VQAWCMLLNDKVPFRMQWPQYADLQINGLAVRAINRPGSQLLGANGRDDGPIITACTKDG 300
           VQAWCMLLNDKV FRMQWPQYADLQ+NG+ VRAINRPG QLLG NGRDDGPIIT+C +DG
Sbjct: 241 VQAWCMLLNDKVLFRMQWPQYADLQVNGVPVRAINRPGGQLLGVNGRDDGPIITSCIRDG 300

Query: 301 MNKITLTGCDARSFCLGVRIVKRRTVQQILSMIPKESEGEHFQDALARICRCIGGGNTAD 360
           +N+I+L+G D R FC GVR+VKRRT+QQ+L++IP+E +GE F+DALAR+ RCIGGG   D
Sbjct: 301 VNRISLSGGDVRIFCFGVRLVKRRTLQQVLNLIPEEGKGETFEDALARVRRCIGGGGGDD 360

Query: 361 NADSDSDLEVVAEFFGVNLRCPMSGSRMKIAGRFKPCAHMGCFDLE-----------WQC 420
           NADSDSD+EVVA+FFGVNLRCPMSGSR+K+AGRF PC HMGCFDL+           WQC
Sbjct: 361 NADSDSDIEVVADFFGVNLRCPMSGSRIKVAGRFLPCVHMGCFDLDVFVELNQRSRKWQC 420

Query: 421 PICLKNYALENVIIDPYFNRITAMMRHCGEDVTEIEVKPDGSWRVRSKTESERRELGDLC 480
           PICLKNY++E+VI+DPYFNRIT+ M+HC E+VTEIEVKPDGSWRV+ K ESERRELG+L 
Sbjct: 421 PICLKNYSVEHVIVDPYFNRITSKMKHCDEEVTEIEVKPDGSWRVKFKRESERRELGELS 480

Query: 481 MWHSPEGNLCVSNEEAKPKMEALKQIKQEGGSD--RGLKLGIRKNSNGVWEVSRPEDINT 540
            WH+P+G+LC S  + K KME L  +KQEG SD    LKLGIRKN NG+WEVS+P + N 
Sbjct: 481 QWHAPDGSLCPSAVDIKRKMEML-PVKQEGYSDGPAPLKLGIRKNRNGIWEVSKP-NTNG 540

Query: 541 FTSGSRLPENYGSHDQKIIPMSSSATGS-RDGEDPSVNQDGGVNFDFSTNNGIEMDSLSL 600
            +S +R  E  G  ++ IIPMSSSATGS RDG+D SVNQD    FDF   NG+E+DS+S+
Sbjct: 541 LSSSNR-QEKVGYQEKNIIPMSSSATGSGRDGDDASVNQDAIGTFDF-VANGMELDSISM 600

Query: 601 NVDSAYGFTEQNPIAPVG--EVIVLSDSDDENDILISSGTVYPSNHTDAGEVPFSMPPSG 660
           NVDS Y F ++N     G  EVIVLSDSDDEND++I+ G  Y    TD G + F + P G
Sbjct: 601 NVDSGYNFPDRNQSGEGGNNEVIVLSDSDDENDLVITPGPAYSGCQTDGG-LTFPLNPPG 660

Query: 661 LTDAYPEDPTLLSAGNSCLGLFNSHDDEFGMPVWSLPPGTQGGAGFQLFSSDADVSDALV 720
           + ++Y EDP  ++ G+S LGLFN  DDEF  P+WS P  T    GFQLF SDADVS  LV
Sbjct: 661 IINSYNEDPHSIAGGSSGLGLFND-DDEFDTPLWSFPSETPEAPGFQLFRSDADVSGGLV 720

Query: 721 DLQHNS-INCS--TMNGYAATPEAAISPASLVPGSSIGRTDGDMNDSLVDNTLAFAGDDP 780
            L H+S +NCS     GY   PE +++   +VPGS+ GR++   ND LVDN LAF  DDP
Sbjct: 721 GLHHHSPLNCSPEINGGYTMAPETSMASVPVVPGST-GRSEA--NDGLVDNPLAFGRDDP 780

Query: 781 SLQIFLPTRPSDAPMQSDFRDEADVSNGVHTEDWISLRLGGDAGGSNGESTTSKGLNSRQ 837
           SLQIFLPT+P DA  QS F+++AD+SNG+ +EDWISLRLG  A G++G+  T+ G+NS  
Sbjct: 781 SLQIFLPTKP-DASAQSGFKNQADMSNGLRSEDWISLRLGDSASGNHGDPATTNGINSSH 827

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894176.10.0e+0091.28E3 SUMO-protein ligase SIZ1 isoform X2 [Benincasa hispida][more]
XP_008463667.10.0e+0090.94PREDICTED: E3 SUMO-protein ligase SIZ1 [Cucumis melo][more]
XP_038894174.10.0e+0089.77E3 SUMO-protein ligase SIZ1 isoform X1 [Benincasa hispida] >XP_038894175.1 E3 SU... [more]
XP_031746035.10.0e+0089.47E3 SUMO-protein ligase SIZ1 [Cucumis sativus] >KAE8652738.1 hypothetical protein... [more]
XP_022964655.10.0e+0089.04E3 SUMO-protein ligase SIZ1-like [Cucurbita moschata] >XP_022964656.1 E3 SUMO-pr... [more]
Match NameE-valueIdentityDescription
Q680Q41.3e-30661.67E3 SUMO-protein ligase SIZ1 OS=Arabidopsis thaliana OX=3702 GN=SIZ1 PE=1 SV=2[more]
Q6L4L47.8e-24350.39E3 SUMO-protein ligase SIZ1 OS=Oryza sativa subsp. japonica OX=39947 GN=SIZ1 PE=... [more]
Q6ASW72.9e-17345.37E3 SUMO-protein ligase SIZ2 OS=Oryza sativa subsp. japonica OX=39947 GN=SIZ2 PE=... [more]
F1R4C49.3e-1026.98E3 SUMO-protein ligase PIAS4-A OS=Danio rerio OX=7955 GN=pias4a PE=2 SV=2[more]
Q122161.6e-0929.41E3 SUMO-protein ligase SIZ2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S2... [more]
Match NameE-valueIdentityDescription
A0A1S3CK960.0e+0090.94E3 SUMO-protein ligase SIZ1 OS=Cucumis melo OX=3656 GN=LOC103501758 PE=3 SV=1[more]
A0A0A0LUR90.0e+0089.47Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G086920 PE=3 SV=1[more]
A0A6J1HJJ80.0e+0089.04E3 SUMO-protein ligase SIZ1-like OS=Cucurbita moschata OX=3662 GN=LOC111464667 P... [more]
A0A6J1I0I80.0e+0088.81E3 SUMO-protein ligase SIZ1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC11... [more]
A0A5D3DCL30.0e+0090.47E3 SUMO-protein ligase SIZ1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaf... [more]
Match NameE-valueIdentityDescription
AT5G60410.26.3e-31461.93DNA-binding protein with MIZ/SP-RING zinc finger, PHD-finger and SAP domain [more]
AT5G60410.16.3e-31461.93DNA-binding protein with MIZ/SP-RING zinc finger, PHD-finger and SAP domain [more]
AT5G60410.56.3e-31461.93DNA-binding protein with MIZ/SP-RING zinc finger, PHD-finger and SAP domain [more]
AT5G60410.49.3e-30861.67DNA-binding protein with MIZ/SP-RING zinc finger, PHD-finger and SAP domain [more]
AT5G60410.32.5e-29260.82DNA-binding protein with MIZ/SP-RING zinc finger, PHD-finger and SAP domain [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003034SAP domainSMARTSM00513sap_9coord: 11..45
e-value: 1.4E-7
score: 41.2
IPR003034SAP domainPFAMPF02037SAPcoord: 14..42
e-value: 4.1E-8
score: 32.8
IPR003034SAP domainPROSITEPS50800SAPcoord: 11..45
score: 11.923741
IPR001965Zinc finger, PHD-typeSMARTSM00249PHD_3coord: 114..166
e-value: 4.2E-9
score: 46.3
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 352..454
e-value: 3.7E-24
score: 86.8
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 103..171
e-value: 1.1E-31
score: 110.4
IPR036361SAP domain superfamilyGENE3D1.10.720.30SAP domaincoord: 1..60
e-value: 7.0E-8
score: 33.8
IPR036361SAP domain superfamilySUPERFAMILY68906SAP domaincoord: 1..63
NoneNo IPR availablePIRSRPIRSR003033-1PIRSR003033-1coord: 8..43
e-value: 0.0013
score: 14.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 848..882
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 806..831
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 528..567
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 528..560
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 799..831
NoneNo IPR availablePANTHERPTHR10782:SF71E3 SUMO-PROTEIN LIGASE SIZ1coord: 9..813
NoneNo IPR availablePANTHERPTHR10782ZINC FINGER MIZ DOMAIN-CONTAINING PROTEINcoord: 9..813
NoneNo IPR availableCDDcd15570PHD_Bye1p_SIZ1_likecoord: 114..165
e-value: 1.23956E-21
score: 86.7452
IPR019786Zinc finger, PHD-type, conserved sitePROSITEPS01359ZF_PHD_1coord: 115..165
IPR004181Zinc finger, MIZ-typePROSITEPS51044ZF_SP_RINGcoord: 366..432
score: 21.700901
IPR011011Zinc finger, FYVE/PHD-typeSUPERFAMILY57903FYVE/PHD zinc fingercoord: 110..169

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G053340.2Cla97C03G053340.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016925 protein sumoylation
molecular_function GO:0016874 ligase activity
molecular_function GO:0019789 SUMO transferase activity
molecular_function GO:0008270 zinc ion binding