Csor.00g102970 (gene) Silver-seed gourd (wild; sororia) v1

Overview
NameCsor.00g102970
Typegene
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionE1 ubiquitin-activating enzyme
LocationCsor_Chr20: 1694051 .. 1706528 (+)
RNA-Seq ExpressionCsor.00g102970
SyntenyCsor.00g102970
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSinitialstart_codonpolypeptideintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGACGGCATTGGAGTGCTGGTCGAGTAGGGCTAGTACTGATGAGGATTTGGTGGAGCAGGTGCTGATGAGGACGCAGGATAGATCGGAAGGCTCCAAGCCGGAGAGTTCGTTGGCCGTCGGAGAGAAGGAGTCGTCGGCGATGCAGAAACGGTTGCAGAGATTTAGTCGGAACGTGTCGGAGGCGGTAGCGTCGCTTAAAAACTCCTTGAATCTGGACTCTGTTCGCGATCCCTCGCCTACGAAAACCGAGGGGTCTAAGAAGGCTGTCTGGGGGAGTGTTGTGCGGAATCTTACTCAGCTCTATCCTGGCAGTCAGTTGCCGGAAAAGCTCGTCTCCAATATTCGCAAGCATTACGATTCATTGCCTCTTAGGTATGGACTCTGATTAGTAATGTGTTGTCGTAAGTTTAAATTCGTGTTAACTGCAGAGATGGATTGAAAAAGAACGATATATATTATCGCGAACTGATTGGAGCTAGTCTAGTAACTGAGCGTCTTAATGGACGATAGATCAGATATCTGTTTCGTAAACATTTTCTAATCCTCTTAATTCTGTATTTTATTATCTGTTGTTTTTCTATTATGAACAGTTATGCTCAGGCGGGGTTTGAGATGAAAGATGTCTTTCTCCACATCAAATTGATAGAGCAGGCATCTGTTTATGATCACCCTGCCATCTTATTTCAAGAAGTGACGAATCATGACGTTCAAAAACCTACAATAAAGCTCACGTTTGCTTGCAACTCTTCTGTTTCATGGTCAGCGATGTCTGGAGCGTTGGAGAGCGCTGGCATTCGCTGTGAGAAAATACAGATTTTTGAGAAGAAGAAATTTAGTCTTGGAGTCATCCTTTTTGTAAATCTAGATGCTCAGGAGAAACTCTTCAAATCCAAGGTTGAAAATGCTCTTAAATTGGCTATTAAGAAGCCGAAAACTAATACAGTGAAGCTCCCATTTGGATTTTGTGGATGCCAAGAAGGTAACACTGGGGGGAAAGATCTGAGAGAAATCGAGGAGGATGCTGTTGATCAAAATTGCAGAAGTGGTTTCGAGAACTCGAATTTGAACGAAAATTTACAGATTGAAATGCCCTTATCGACTTCATCCTTTACTGTAACTGTCGATGAATGGCAAACAGTCCAATCTGGTGGACATGAATTGGGTAAATGGCTGCTAAGCTCTGAGAATCTTGAATTCACCGATCAGATTGGACCCAACTCATTCAAGGGAGTCTACAAGGGCAGAAGAGTTGCTATAGAGAAGATTAAAGGGTGTGAAAAGGGAGTTTCTTACAAGTTTGAGCTCCGAAAGGACTTGTTGGAGCTGATGACATGTGGGCACAAGAACATTCTGATGTTCTATGGTGTTTGTATTGATGAAAATCATGGCCTATGTGTGGTAACCAAACTAATGGAGGGCGGATCAGTCCATGAATTGATGCTGAAAAACAAAAGGCTTCAAATGAAAGAAATAACCAGGATTGCTGTTGATGTTGTAGAAGGGATCAAATTCATAAATGACCACGGTGTTGCTTATCGTGATCTTAACACACACAGAATTTTGTTAGATAAGAACGGCAATGCTTGTTTGGGAGACATGGGCATACTCACTGCATGCCGAAACTTAGGTGAGGCAATGGAATATGAGACCGATGGGTATCGATGGCTAGCTCCCGAGGTCTGTTCGCTATAACTCTGATTGCACTTGGTATTATCTAAAGCTAGTTTCAACTATAGTATGCATTGAAAAGAACCAAACACGTGTCTCTTTGTTTTTAGCTGAATAGCTCAATATCTACTCGTTTTTTTTTTTAATGAATTTATTTCTCTTTGGATTCATTACACTAAACCGCATGGGATGAGTCCCTTTCCAATTAAAGAGACTTTATGTCACTTTCTCCATGTTGGTCCAGTGTGTATAGCATACCGTCATGATCGAAGATGCTTTACCCATCAGTATATATATTTGTCTAGGACCTCTCATTTTCTTTAGACCATTAAGATTAGATTTCACTGCACCATCTGTTTGGCCGAAACTTTGAATTGTTGCCAGTTTGTGGAGCCCGCCATCTTAAAGCACTGCTCCTTTCTTCCAAGAAATTTGAAATTCCTTCAGATGTCATCTCCAATTGTCATTACTATCATTTAGAGTAATGACAGAGACAGGTTTTATTGAATTGTATTTGTTGCTCTATCTGGCTTGGCACAGATTATTGCTGGTGACCCGGAGAGTGTTAACGAGACATGGATGAGCAATGTATACAGCTTTGGAATGGTAATATGGGAGATGGTGACTGGCGAGGCAGCATATGGCGCATACTCGCCAGTGCAAGCGGCAGTTGGTATAGCTGCGTGTGGTCTGAGACCTGATGTTCCAAAGGACTGCCCGTCCACCCTGAAAGCTCTGATGATCAGGTGCTGGAACAATTGCCCATCGAAGCGCCCACAATTCTCCGAAGTTTTATCAATGCTGCTGGACTCCAACAATAACAATCATAAATGAAGTAAGTATCTTCCTCTTAATCTAAATTAATTCAAAATGTTTACTCAAGTTATTGTAAAGGGTTCCTTCTGCTCTGCCTTCAATGTAGATGTGAGAAGTTTGTGTTGTTAGGATATTAATCCATCTACTTTTCATAAAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACCCAATATCACCCTAGGAAAACCAACCCTCTTACTCAGTTACCACCACCCCGAAAGCAGGAAGATGACAAAGAAAGATCTTGAGAGAAAGTTTATATCTGTATTGTATGAGATGGAGAATGGATTCTTAAGAGATCATTTAATGTATGACTGAAGCAGATTCTGTGTAGCATGGGGTAGAGATAGGGAAAGAAATGATATTTACAGAGATTCCAATTGTTAAATCACTGCCAGAATAGGTTAATGTAAATTTAATCTTATATGATATCGTTTTATGTGGAAAATCTGTAGCAACGAAGAAAGGGTAGTCTGACTTCTCCTAGTTCTTACCTTTATGCTTTAAAACCATATGAGAGAGCACACAAGTTACTTTTGTTGCTTTCTTCTTCTATTTCTAACATGTTAATATGAATATTTATATGCCAACTTTATTGAATTTATGAGGAAGAAAAAGAACCGGGCAGGTTATGACTTACAATCGATTTATGGTTCGTTAAGTGGGAACATTTTGTGTTGACGAGTGGCATATTGACATGATCTCCACGAGTCGAATTACCAACAAGGACTTGGGGCATAAGTGTTAATGTGACCCGTGTTTCAAGTCACTCCAAAACATGTTGCCATGATATATGCAATGCCCTTGATGATATTATGACCCGTGTGTCAAAGTCATGCCAAAACATATTGACATGATCTCGATGACACCTTTGCTGATAAATCAAATTTAATAGGTATGAAAATCACATTTAAAACCGTCTTTTACATTGATTTTTATTTGACAAAAATGATTTTAATAAATTTTAAATGAGCTGGAAACATGTTAGTTGGATTTATTATGGTTGATTGTCTACCTAAGAATTGCTTGATTTTTCTTGTATTAAAATTACTCTTTGGTGAGTTGAATTTGTCTTTTATTATCTTGGTTGACTTTAATCGGCTTGGTTCATTTTTTGGAAACCTCAATTTTTTAGGTCGTTAATACATACAGACATACATAATTTAAAATTAAAAATAAAAAACAAAATGGCACTGAATTAGTGCCTGCATTTATGATTTATGGGTCGTGCTATTTTCTATTTTAAAATATAATTTACAAGTTTTTTTTTTATAAACTTTTTTTAAATGTTTTCTTATTATGTTCTTGTTTATTTATTCCATTTACATTAAATTAGAAACTGTGTGTAGTCATTTATAGGCACGTCGTTTTGGTAAGCATTAAATAGCACACGTGAAAACAAAGATTCATTTTCTGTTTTATTTATTTTACGAATGTATATCTTTTATTTAATTATAAAAACAAATTATTTAAGTTCACAAATAGGAAAAAATAAAAAAATAAAAAAAAACCTTCGAATTCAAATATTATTTTAAGTTAGGCTTTAGGGTTTATCCATCGAGAGCCCCACATTTTTTTGAAACAAAGTATTTTGGTTCTTGTCACTCTCTAGAGATCGAGAAAATTATAAGCATCAATATTAGCTAGAAACTCCGAGGATTAATCGTGATTTTAAGCTTCCACTTTCACAAATTATTTAAATTTAATTAATTATATAATTATCAATGTAATCGTAGTTGAACATATCTTGACTACGTATGTCATAAACTCGAAAAGATATTCTTAAAAAAAGAAAACAAAAAGTGTCCAATAGCAACTTGTAATAAAATAATATTTTGACATCCAATTAATTAGCGTATAATAATTAAAAAATGAAGTAAAAATAAATAAATAAATAAAGGAAAATAAAGGGCAGATGTAGCGAGAGTAGCGTGTTCCTTACCGTACCGAAGCATTTCTCTTTTTCCTCTTTTTCTCCCGCAATGGAGAGAGGAATAAACAGCATCTGTCCCCAATCTTCCCGAGGCCCCAAACCCTAGAGATCGATAAACCATGTCCCTGCAAAAAACCATATTTGCCCTTCTTCTCTTCCATATATGAACCCTTTTCACCTTCCATTTCCTCTCCAAGAAACCTCTGGATTCCACCTCAATTCCAAGAGCTTTAACCGTCTTCATCGCCACTGCGCCACATCCCCCATTTCCCCCTTGAGATTCCTTCCTTCATCTTCGTTTACGCTCCTAATCGGTACGTTTCCTTCTCCCTTTTTCTCTTAATTAATCTCTTTTTCTCTCTCTTTTTTGTTTTCGTTTCGTTGTTTCTTTCCACTCTTAAATCTGTGCCCATTTCCTGGATCATGTGGCCTGAGCTAGGGTTTATCGGTCTTCCTCCGATCGGTGATCTTTTAGTGTTTCGAGAGGTTTATGGCGGAGTTCTTGATGAATTATTTCTGGGTTTTGTTGTATTGCTTGTTTTCTTCATCATGCTGAGGTCTTTTGGGGTTTTCAGCAGCCTACTGCACTTTATGCTTCCCAGGAAGAGAGCTGGTGAAGAAGGTGTGGCTGTAGAAGAAAAAACTGATAACAGCAGCAGCAGCAGTAACAACAACAACAACAACAACAGCAACAACAGCGTTCGCAACGAGGGTGCGTCTCTGATCAAGAAGCAGCGGATCGATTCCGACTCCAACGCCAACAGTAAGGTAGCTGCCGTTGCCACCGGTGCTAACAACATCGTGTACGACGGTGCATCGTTGATCATGGCCTCGGCTAATTCTAATCCGCCCGATATTGATGAGGATCTGCACAGCCGACAACTTGCTGTGTATGGCCGCGAAACCATGCGGAAGCTTTTTGCGTCCAATGTCCTCATTTCGGGGATGCAAGGTCTTGGCGCTGAGATTGGTAAATACTGACGGAACTCCTTTTTCTTTTGCTAATTTTGCCTATACTGATCCGCTCCCTTGCATGTTTAGGGTCTGGTACCTGTGTCTTTTTGTTTGTGGCCGTGCTAGATGTTCATTAATAACTAACGAGCATCTGACGGTCGTCCTTTTTGGCTAAAATGCCCACACCTTCTATATATATGTGTGTATATATATTTACATTGAAGCGCATTCTCTTGGTCTTTCATTAGAGCTGATTGTGCTTTCTGTTTTTGCTTAATATTGTGATCTTATTTTGATTTGATTTGCGAGGGGATTTTCTTCATGCTTGCTTACTTTTGGCTCTTAATGTTTAGTTCTTAAGTTTTCTATTTTAACGATGATTTATTGTATTTTGCGTCTCTTCCTGCAGCAAAGAATGTTATTCTTGCTGGGGTGAAGTCTGTGACCTTGCATGACGAAGGTATAGTGGAGCTATGGGATCTGTCTAGTAATTTTGTGTTCTCAGAGAGTGATATTGGCAAAAACAGAGCACTTGCCTCTGCTCAGAAGTTGCAAGATCTCAACAATTCTGTTATTGTACACACCTTGACGACTGAGTTAGTTATAGAACAACTTTCCAAATTTGAGGTATTCTATCTTTAAGCATATTTTCTACTCATTTCTTTAAGGAGTGGTTAGATTCATCATATACTACATGTGTTCCTAAAAAAACGCATCCATTGGAATTGGAAGCCTAGCTTATTAAAACCATTTGATACATTGTATGCGAAATAAGAAGTTTAGTGACAAGGCCTACACTTCACTTCCAAATGTTCATATATTATTTTGTCTGGAATTTTGTATAGAACTGTAGAAGTTATGGTCATCTAATTTGAAAATCCCACATGCCCTTCTGATGAAGGTCAACTGGTATTCTCCTCACTTTATTTGTTTATAATTGTGAGCTTATGTCTTAGATTCTTCTATCATGTATCATTCTTATGTTACGTAGTAGACTGTTTCTTAATATTAGCCTATTGAGCACAAGATTCATGGCAGATCATCTCCAAGGCTTCCATTTGAAATGGTGATTGATACCTTCTCATACTCCTGAACTGCCTCTTGATGTCAAAGTAAAATTTGGTTTCAGAGTTCCCACCAAATTCTTCAAGGCACAAAAACAATCCAATGAGAGAGCCTTTTGTTTTCTTGTGAATATTACACCAATTTGGTCAGAATGTGCATTTTGGATGTATATTTTTGCAAGTGTTGAAGCCTCAATTAGAAAACTCTAAACGACCACCGTTAACTCTTTGAGGGAGTTGCGTTATCAAAGTGTTGTTAGGAAGTCTATTGGCATTTGAAGTTTTCTTCAGAGAGGCCATGGTTGGTAATTTTGCACGTCATATCATTTAGACATACCATGAGACCGTTGCCTGAACCTTTGCTAGTAGTTGAAAGAACACCGTGATAGGTTGCCATAAGGTTTAGTCTCTCTATCTTGTAGCCATTGGAGTTCTTGTGGAAGTAAAAGAAAGGGGGTCCTTACTATGACGTTTCCAATATTTTAGAAATGTGACAATTTTTTTTTATTAATATAGACGACTATGGTGGTATGTATGAGCTCGAGTAGTATCCATAAATGGTGGTGGGCAAATAGTGTTGACTTTGTTTGGAAGAGCTTTTTCCTAGATTTTGTTGGGGTCCATAGTTAAGCTTAAACCTTTATTTGGTAAGAGAGTTGGAAAGGTTTCCTAGTAGTAGCATTGGCAATATCACAACCATACTATGTAGGTTGTGACCCTCATGTCAAGACAAGCAAAGCGGCGGCTCTCAAGTGGTGTTCGTTTGGGCACGCAAGGGTCAATATAGAAAAATGTCATGATGTGACTGAGGAGGAGGGTGCATTATCCAACACACCATATAATGTGTGCATTGAAGAGGAAACGAGACATAAGAAAGATAGGGACAATGAGTGGACATGAGCGTCAAAGATAGGCTTGTTGAAGGGTCGGGTAGGGCCTTGAGCCTTAACCTTGAAAATTGGGGTTTTGGAGGGAACTTGGGCATGCGGGTCTTGAGTCAAGTCCATCCATAGGAAATATGAAAGCTCATCAAATTTGGTGTCAATCGGACCCTGAACGAACGCTCCACGACACCATATGTCCATTCACCAAAATCATGTCTGCACTTGCCACTTTGGAAATGCCTCAAGATGTGCTATAGGGGGAGGCATGGTGTTACCTCTCCACCACCTCTGGGCCTGGTGACCCTAAGCGAGCTACCCTGTGGCATGTGTGTCACATTGAGGGCAAGGATTTTGAGATGAGTGTCTCGGGCGACTTCCTTTGAAATGGGTCTTTCAAAACTTGGGTTGACGTCAAGACACATGGTTGTTGGGAACCAAGATGACAAGATGAGATGTGCATTGAGAAACCTTGGCCTTGAGTAGAGGTAAGTTGGTTGTGTGGTAGATGATTGAACATGCACTTGCAAGCAGTGTGTGGTCGAACAAGCTCGGATTCTGCTTGAGCAATGTTTGGTTGGTGAAGACAAACCTTGCCTAAGGTTTGGTGAAGCCACAAAGCCGGACGAAAAATGAATGGGTCTTGAGTTCTCCTTAAGGGTGGTCGTGAGCATGTCAAGATCATGATGCTGCACAGTTAAAATAGTACGACCGTGAAACCTAGCATCTAGATATCAAAGCCCGAATTCCTGCATTTATGGAAACTATCATTTAATTGTAGTACTTGACTTTGAGAACGTGTTGAAGGTGAACGGGGATAGGGGGTTGATTATTTGAGAATTTTGAAATGAGTGATCTTTATTAGATTGCTCAAGGAAGGTTGGTTTGAGACAGATATTGGAGAGGAAAAGAACGTGTCAACAATACGAGATGTTTACAGTGATTTTTTCCTAGTGGTGGTTGGTTAAGATTTTTGACCATTAGGGTCCGCTCTAGACAAAATGACCCGCTCTGAAATGCTCCAGTGTTCTTGAAGGCATTGAAGAGTACCTAGATTGTTCAAAACGTATTCCGTTTGCAAAGCGATTTACTGATTTCCATTTTCTCTCTTACTGGTGAGGGTGTTAGACATAAGCTCCGGGCATGAGTGCGTTCAGATCTAGACACTTGTACCTGAAGGTTACATAATATTAATGAAGTCTATGCCAAGTGCTGCTTAAGATTTTACATATATTTAAATATAAACAGTGTCAAGTGAGACTGGTTACAGGAAATTGTTCCTGATTGATTATATTGGATTGATAAATTTCAGCCAATTCACAATATTATATTGGATTGATAAATTTCAGCCAATTCACAATATTATCTTTCTACCTCTCTACCTCAGCTTCCTTGTCTCAAAATAGAAATTCCATACTAAAAATGTGGATTGGAGCTCATCAAACTACGAGGAAAACGGCTCATAAGCTCTTTCTTATGATCTAAAAATGTTAAAAAAAAGTTTCTTTGGGATAATTTTATTACAGATTTCTCCTGGAATTCTTTGCAAGAGAGAAACAGGTGTATCTTCCAAGATAAAGTCTCTCATTTCGTAGTTTTTTCGAGGCTTTAGTTAATGTGGTTGTTTGTGGCTCTTTTCCATACCGATAGCTACTCTTCTCCTCTTTAATTTATGAGGCCTTTTGTAATCTTGCCCTTGGAGTTCCCCCCTTTCAATATCATGCTAAGAACGAAATTCTTTCTTATTAAAAAGAAAAAGGAAAGGCAAACTGAAAATTTAAACGAAAAATTTAGTTTTGAGATGATAAAATCACCATAGAATTTTTTTATCGAATTTTCAACTGGTTGCAAATTCATAGAAAGGATCATGGCTTCTGCTAGTTAAAATATATAATAGTTGGAGTTAACATCAATTATGTGTGCTTGTTTATCATGTATATGAGTTTCATGCGCATGATGTAACTTCTCTGTGGATGCAGGCTGTTGTTTTCACTGATACTGGCCTTGACAAGGCCATGGAATTCAATGACTTCTGTCATAACCACCAGCCTCCTATTGCATTTATCAAGACTGAAGTTAGAGGGCTCTTTGGTTCGGTATTTTGTGACTTTGGTCCCGAGTTCACTGTTTATGATGTGTATGGAGAGGACCCACACACTGGCATAATTGCGTCCATTAGCAATGACAATCCTGCACTCGTTTCCTGTGTTGATGATGAGAGGCTTGAGTTTCAGGATGGAGATCTTGTGGTGTTTTCTGAAGTTCATGGTATGACAGAGCTGAATGATGGGAAGCCGAGAAGGATTAAAAATTGCAGGGCCTATTCATTTACTCTCGAGGAGGATACTACAAACTTTGGTAGCTATGAGAAAGGTGGCATTGTCACACAGGTGAAAGAGCCCAAGATGTTGAACTTCAAGCCATTAAGAGAAGCAATCAACGATCCTGGTGACTTCCTTCTCAGTGATTTCTCCAAGTTCGATCGTCCTCCTCTCCTACACTTGGCATTCCTGGCCTTGGATAAATTCGTGACTGAGTTGGGTCGCCTACCAGTTGCTGGTTCTGAGGAGGATGCTCAAAAGCTGATTTCTGTTGCCAGTAATGTTAACGAGAGTCTAGGAGACGGGAGAGTCGAAGATATTAATCCTAAGCTTTTGAGACATTTTGCATTTGGTGCCAAGGCAGTACTGAATCCGATGGCTGCCATGTTTGGTGGTATTGTAGCTCAAGAGGTTCTCAAAGCGTGCTCCGGAAAGTTTCATCCACTTGTCCAGGTTAGTAGATTACTTATGTTTTGAGTCTGATAAATATGCCTTAGCGACCTTCGTGTTGCAACGCCTTGGACCTTTTTGTTCAATGAAGTACATACAAGTTTGCATCTCTCTCTGCTTATATGAAATAGATCATAATCTATGGGAACAGTGAAGTTATTGCAATAACTCTCCCTGCATGCTGGGATTGTCATTTGGTTACTGTTTCTTGCATCCTTACGGTATTTTCTTTCGAATATGCAGTTCTTCTATTTCGACTCGTTGGAGTCTCTTCCCACAGAGTCATTGGAAGCCAGTGATTTTAGACCCTTGAATAGTCGTTATGATGCACAGATTTCTGTGTTTGGGTCTAAACTTCAGAAGAAACTGGAAAATGCCAAAGTCTTTATGGTTGGATCTGGTGCTCTAGGTTGTGAGTTCTTGAAGAACCTTGCACTTATGGGAGTTTCATGTAGCAACGAAGGGAAGCTAACGATTACGGATGATGATGTAATTGAAAAAAGCAACCTTAGTCGGCAGTTCCTTTTCCGTGACTGGAACATTGGGCAGGCGAAATCTACTGTCGCTGCTTCTGCTGCTGTTGCAATTAATAAGCACCTCAACATCGAAGCTTTGCAGAATCGTGTTAGTCCCGAGACTGAAAACGTGTTCGATGATAGCTTTTGGGAGAATTTGAATGTCGTGGTTAACGCACTAGACAATGTAAATGCAAGGCTCTATGTCGATCAAAGGTGCTTATACTTCCAGAAACCACTTCTAGAATCTGGAACTCTTGGTGCTAAATGCAATACTCAAATGGTCATCCCTCACCTGACTGAAAACTATGGGGCATCAAGAGACCCCCCTGAGAAACAAGCGCCCATGTGCACTGTGCATTCGTTTCCACATAATATCGACCACTGTTTGACATGGGCTCGATCTGAGTTCGAGGGCTTGCTTGAGAAGACTCCTACTGATGTGAACGCTTATCTATCAAATCCCAGTGAATACACTTCTGCAATGATGAATGCTGGTGATGCTCAGTCTAGGGACACTTTAGAGCGCATTCTCGAGTGTCTTGATAGAGAAAGATGCGAGACATTTGAAGACTGCATCACGTGGGCTCGCTTGAAGTACTGTTCTTCATATTCTCAACTCTCATCTGTCTCCATTTAACTTTTGATGTTCAGTTGTTTAATGTTATTGATTCCTGTTGTGATATTATAACTTCGACAGGTTTGAAGATTATTTTTCGAACCGTGTGAAGCAGTTGATATACACATTTCCTGAAGATGCTGTAACTAGCAATGGGGCACCATTCTGGTCTGCCCCAAAGAGATTTCCCCATCCACTCCAGTTTTCAACTGCCGATCAAAGCTACCTTCATTTTGTTTTGGCGGCGGCTATACTAAGAGCAGAATCATATGCCATTCCGATTCCTGACTGGGTTAAGAACCCCACGAAATTGGCCGATGCAGTTGACCGAGTTATAGTACCAGATTTTATGCCCAAAAAAGATGCCAAGATTGTGACTGATGAGAAGGCAACCAGTCTCTCTACAGCATCGGTTGACGATGCAGCCATTATCCACGACCTGGTGAACAAATTGGAGGATACAAGCAGGAAGCTACCAGAGGGATTCAGGATGAAACCGATCCAGTTTGAGAAGGTCGTTCGTTAGAGGATTTATTGATATTGTTATGTGTGATCAAAATTATTTCTGGAGATCTTAGAATCAATGGTGTAAATCTAACTAGCTTAGTTTGGATGGTGCAGGATGATGATTCGAATTTCCATATGGATCTCATAGCTGGGCTGGCTAACATGAGAGCGAGGAATTACAGCATTCCTGAAGTAGACAAGTTGAAAGCCAAGTTCATTGCTGGAAGGATCATCCCCGCCATTGCTACCTCTACTGCAATGGCTACAGGTCTCGTCTGCCTCGAACTGTACAAAGTTCTAGATGGCGGTCACAAGGTGGAGGATTACCGTAACACGTTTGCGAACCTTGCATTGCCTCTGTTCTCCATGGCCGAGCCAGTCCCGCCCAAGGTCATTAAGCACCGGGACATGAGCTGGACTGTCTGGGACAGATGGATCATCAAAGACAACCCTACACTCAGACAACTTATTGAATGGTTGAAGAACAAGGGATTGAATGCCTACAGCATCTCGTGCGGTAGCTGTCTCCTGTACAATAGTATGTTCCCTCGACACAGAGATCGAATGGACAAGAAGGTAGTTGATTTAGCTCGAGATGTTGCCAAGGTGGAACTGCCTCCATACCGTCGACATTTGGATGTTGTCGTAGCATGCGAGGATGACGAGGATAATGATATTGACATCCCTCTGGTGTCGGTTTACTTCCGTTAG

mRNA sequence

ATGGCGACGGCATTGGAGTGCTGGTCGAGTAGGGCTAGTACTGATGAGGATTTGGTGGAGCAGGTGCTGATGAGGACGCAGGATAGATCGGAAGGCTCCAAGCCGGAGAGTTCGTTGGCCGTCGGAGAGAAGGAGTCGTCGGCGATGCAGAAACGGTTGCAGAGATTTAGTCGGAACGTGTCGGAGGCGGTAGCGTCGCTTAAAAACTCCTTGAATCTGGACTCTGTTCGCGATCCCTCGCCTACGAAAACCGAGGGGTCTAAGAAGGCTGTCTGGGGGAGTGTTGTGCGGAATCTTACTCAGCTCTATCCTGGCAGTCAGTTGCCGGAAAAGCTCGTCTCCAATATTCGCAAGCATTACGATTCATTGCCTCTTAGTTATGCTCAGGCGGGGTTTGAGATGAAAGATGTCTTTCTCCACATCAAATTGATAGAGCAGGCATCTGTTTATGATCACCCTGCCATCTTATTTCAAGAAGTGACGAATCATGACGTTCAAAAACCTACAATAAAGCTCACGTTTGCTTGCAACTCTTCTGTTTCATGGTCAGCGATGTCTGGAGCGTTGGAGAGCGCTGGCATTCGCTGTGAGAAAATACAGATTTTTGAGAAGAAGAAATTTAGTCTTGGAGTCATCCTTTTTGTAAATCTAGATGCTCAGGAGAAACTCTTCAAATCCAAGGTTGAAAATGCTCTTAAATTGGCTATTAAGAAGCCGAAAACTAATACAGTGAAGCTCCCATTTGGATTTTGTGGATGCCAAGAAGGTAACACTGGGGGGAAAGATCTGAGAGAAATCGAGGAGGATGCTGTTGATCAAAATTGCAGAAGTGGTTTCGAGAACTCGAATTTGAACGAAAATTTACAGATTGAAATGCCCTTATCGACTTCATCCTTTACTGTAACTGTCGATGAATGGCAAACAGTCCAATCTGGTGGACATGAATTGGGTAAATGGCTGCTAAGCTCTGAGAATCTTGAATTCACCGATCAGATTGGACCCAACTCATTCAAGGGAGTCTACAAGGGCAGAAGAGTTGCTATAGAGAAGATTAAAGGGTGTGAAAAGGGAGTTTCTTACAAGTTTGAGCTCCGAAAGGACTTGTTGGAGCTGATGACATGTGGGCACAAGAACATTCTGATGTTCTATGGTGTTTGTATTGATGAAAATCATGGCCTATGTGTGGTAACCAAACTAATGGAGGGCGGATCAGTCCATGAATTGATGCTGAAAAACAAAAGGCTTCAAATGAAAGAAATAACCAGGATTGCTGTTGATGTTGTAGAAGGGATCAAATTCATAAATGACCACGGTGTTGCTTATCGTGATCTTAACACACACAGAATTTTGTTAGATAAGAACGGCAATGCTTGTTTGGGAGACATGGGCATACTCACTGCATGCCGAAACTTAGGTGAGGCAATGGAATATGAGACCGATGGGTATCGATGGCTAGCTCCCGAGATTATTGCTGGTGACCCGGAGAGTGTTAACGAGACATGGATGAGCAATGTATACAGCTTTGGAATGGTAATATGGGAGATGGTGACTGGCGAGGCAGCATATGGCGCATACTCGCCAGTGCAAGCGGCAGTTGGTATAGCTGCGTGTGGTCTGAGACCTGATGTTCCAAAGGACTGCCCGTCCACCCTGAAAGCTCTGATGATCAGGTGCTGGAACAATTGCCCATCGAAGCGCCCACAATTCTCCGAAATTCCTTCCTTCATCTTCGTTTACGCTCCTAATCGCAGCCTACTGCACTTTATGCTTCCCAGGAAGAGAGCTGGTGAAGAAGGTGTGGCTGTAGAAGAAAAAACTGATAACAGCAGCAGCAGCAGTAACAACAACAACAACAACAACAGCAACAACAGCGTTCGCAACGAGGGTGCGTCTCTGATCAAGAAGCAGCGGATCGATTCCGACTCCAACGCCAACAGTAAGGTAGCTGCCGTTGCCACCGGTGCTAACAACATCGTGTACGACGGTGCATCGTTGATCATGGCCTCGGCTAATTCTAATCCGCCCGATATTGATGAGGATCTGCACAGCCGACAACTTGCTGTGTATGGCCGCGAAACCATGCGGAAGCTTTTTGCGTCCAATGTCCTCATTTCGGGGATGCAAGGTCTTGGCGCTGAGATTGCAAAGAATGTTATTCTTGCTGGGGTGAAGTCTGTGACCTTGCATGACGAAGGTATAGTGGAGCTATGGGATCTGTCTAGTAATTTTGTGTTCTCAGAGAGTGATATTGGCAAAAACAGAGCACTTGCCTCTGCTCAGAAGTTGCAAGATCTCAACAATTCTGTTATTGTACACACCTTGACGACTGAGTTAGTTATAGAACAACTTTCCAAATTTGAGGCTGTTGTTTTCACTGATACTGGCCTTGACAAGGCCATGGAATTCAATGACTTCTGTCATAACCACCAGCCTCCTATTGCATTTATCAAGACTGAAGTTAGAGGGCTCTTTGGTTCGGTATTTTGTGACTTTGGTCCCGAGTTCACTGTTTATGATGTGTATGGAGAGGACCCACACACTGGCATAATTGCGTCCATTAGCAATGACAATCCTGCACTCGTTTCCTGTGTTGATGATGAGAGGCTTGAGTTTCAGGATGGAGATCTTGTGGTGTTTTCTGAAGTTCATGGTATGACAGAGCTGAATGATGGGAAGCCGAGAAGGATTAAAAATTGCAGGGCCTATTCATTTACTCTCGAGGAGGATACTACAAACTTTGGTAGCTATGAGAAAGGTGGCATTGTCACACAGGTGAAAGAGCCCAAGATGTTGAACTTCAAGCCATTAAGAGAAGCAATCAACGATCCTGGTGACTTCCTTCTCAGTGATTTCTCCAAGTTCGATCGTCCTCCTCTCCTACACTTGGCATTCCTGGCCTTGGATAAATTCGTGACTGAGTTGGGTCGCCTACCAGTTGCTGGTTCTGAGGAGGATGCTCAAAAGCTGATTTCTGTTGCCAGTAATGTTAACGAGAGTCTAGGAGACGGGAGAGTCGAAGATATTAATCCTAAGCTTTTGAGACATTTTGCATTTGGTGCCAAGGCAGTACTGAATCCGATGGCTGCCATGTTTGGTGGTATTGTAGCTCAAGAGGTTCTCAAAGCGTGCTCCGGAAAGTTTCATCCACTTGTCCAGTTCTTCTATTTCGACTCGTTGGAGTCTCTTCCCACAGAGTCATTGGAAGCCAGTGATTTTAGACCCTTGAATAGTCGTTATGATGCACAGATTTCTGTGTTTGGGTCTAAACTTCAGAAGAAACTGGAAAATGCCAAAGTCTTTATGGTTGGATCTGGTGCTCTAGGTTGTGAGTTCTTGAAGAACCTTGCACTTATGGGAGTTTCATGTAGCAACGAAGGGAAGCTAACGATTACGGATGATGATGTAATTGAAAAAAGCAACCTTAGTCGGCAGTTCCTTTTCCGTGACTGGAACATTGGGCAGGCGAAATCTACTGTCGCTGCTTCTGCTGCTGTTGCAATTAATAAGCACCTCAACATCGAAGCTTTGCAGAATCGTGTTAGTCCCGAGACTGAAAACGTGTTCGATGATAGCTTTTGGGAGAATTTGAATGTCGTGGTTAACGCACTAGACAATGTAAATGCAAGGCTCTATGTCGATCAAAGGTGCTTATACTTCCAGAAACCACTTCTAGAATCTGGAACTCTTGGTGCTAAATGCAATACTCAAATGGTCATCCCTCACCTGACTGAAAACTATGGGGCATCAAGAGACCCCCCTGAGAAACAAGCGCCCATGTGCACTGTGCATTCGTTTCCACATAATATCGACCACTGTTTGACATGGGCTCGATCTGAGTTCGAGGGCTTGCTTGAGAAGACTCCTACTGATGTGAACGCTTATCTATCAAATCCCAGTGAATACACTTCTGCAATGATGAATGCTGGTGATGCTCAGTCTAGGGACACTTTAGAGCGCATTCTCGAGTGTCTTGATAGAGAAAGATGCGAGACATTTGAAGACTGCATCACGTGGGCTCGCTTGAAGTTTGAAGATTATTTTTCGAACCGTGTGAAGCAGTTGATATACACATTTCCTGAAGATGCTGTAACTAGCAATGGGGCACCATTCTGGTCTGCCCCAAAGAGATTTCCCCATCCACTCCAGTTTTCAACTGCCGATCAAAGCTACCTTCATTTTGTTTTGGCGGCGGCTATACTAAGAGCAGAATCATATGCCATTCCGATTCCTGACTGGGTTAAGAACCCCACGAAATTGGCCGATGCAGTTGACCGAGTTATAGTACCAGATTTTATGCCCAAAAAAGATGCCAAGATTGTGACTGATGAGAAGGCAACCAGTCTCTCTACAGCATCGGTTGACGATGCAGCCATTATCCACGACCTGGTGAACAAATTGGAGGATACAAGCAGGAAGCTACCAGAGGGATTCAGGATGAAACCGATCCAGTTTGAGAAGGATGATGATTCGAATTTCCATATGGATCTCATAGCTGGGCTGGCTAACATGAGAGCGAGGAATTACAGCATTCCTGAAGTAGACAAGTTGAAAGCCAAGTTCATTGCTGGAAGGATCATCCCCGCCATTGCTACCTCTACTGCAATGGCTACAGGTCTCGTCTGCCTCGAACTGTACAAAGTTCTAGATGGCGGTCACAAGGTGGAGGATTACCGTAACACGTTTGCGAACCTTGCATTGCCTCTGTTCTCCATGGCCGAGCCAGTCCCGCCCAAGGTCATTAAGCACCGGGACATGAGCTGGACTGTCTGGGACAGATGGATCATCAAAGACAACCCTACACTCAGACAACTTATTGAATGGTTGAAGAACAAGGGATTGAATGCCTACAGCATCTCGTGCGGTAGCTGTCTCCTGTACAATAGTATGTTCCCTCGACACAGAGATCGAATGGACAAGAAGGTAGTTGATTTAGCTCGAGATGTTGCCAAGGTGGAACTGCCTCCATACCGTCGACATTTGGATGTTGTCGTAGCATGCGAGGATGACGAGGATAATGATATTGACATCCCTCTGGTGTCGGTTTACTTCCGTTAG

Coding sequence (CDS)

ATGGCGACGGCATTGGAGTGCTGGTCGAGTAGGGCTAGTACTGATGAGGATTTGGTGGAGCAGGTGCTGATGAGGACGCAGGATAGATCGGAAGGCTCCAAGCCGGAGAGTTCGTTGGCCGTCGGAGAGAAGGAGTCGTCGGCGATGCAGAAACGGTTGCAGAGATTTAGTCGGAACGTGTCGGAGGCGGTAGCGTCGCTTAAAAACTCCTTGAATCTGGACTCTGTTCGCGATCCCTCGCCTACGAAAACCGAGGGGTCTAAGAAGGCTGTCTGGGGGAGTGTTGTGCGGAATCTTACTCAGCTCTATCCTGGCAGTCAGTTGCCGGAAAAGCTCGTCTCCAATATTCGCAAGCATTACGATTCATTGCCTCTTAGTTATGCTCAGGCGGGGTTTGAGATGAAAGATGTCTTTCTCCACATCAAATTGATAGAGCAGGCATCTGTTTATGATCACCCTGCCATCTTATTTCAAGAAGTGACGAATCATGACGTTCAAAAACCTACAATAAAGCTCACGTTTGCTTGCAACTCTTCTGTTTCATGGTCAGCGATGTCTGGAGCGTTGGAGAGCGCTGGCATTCGCTGTGAGAAAATACAGATTTTTGAGAAGAAGAAATTTAGTCTTGGAGTCATCCTTTTTGTAAATCTAGATGCTCAGGAGAAACTCTTCAAATCCAAGGTTGAAAATGCTCTTAAATTGGCTATTAAGAAGCCGAAAACTAATACAGTGAAGCTCCCATTTGGATTTTGTGGATGCCAAGAAGGTAACACTGGGGGGAAAGATCTGAGAGAAATCGAGGAGGATGCTGTTGATCAAAATTGCAGAAGTGGTTTCGAGAACTCGAATTTGAACGAAAATTTACAGATTGAAATGCCCTTATCGACTTCATCCTTTACTGTAACTGTCGATGAATGGCAAACAGTCCAATCTGGTGGACATGAATTGGGTAAATGGCTGCTAAGCTCTGAGAATCTTGAATTCACCGATCAGATTGGACCCAACTCATTCAAGGGAGTCTACAAGGGCAGAAGAGTTGCTATAGAGAAGATTAAAGGGTGTGAAAAGGGAGTTTCTTACAAGTTTGAGCTCCGAAAGGACTTGTTGGAGCTGATGACATGTGGGCACAAGAACATTCTGATGTTCTATGGTGTTTGTATTGATGAAAATCATGGCCTATGTGTGGTAACCAAACTAATGGAGGGCGGATCAGTCCATGAATTGATGCTGAAAAACAAAAGGCTTCAAATGAAAGAAATAACCAGGATTGCTGTTGATGTTGTAGAAGGGATCAAATTCATAAATGACCACGGTGTTGCTTATCGTGATCTTAACACACACAGAATTTTGTTAGATAAGAACGGCAATGCTTGTTTGGGAGACATGGGCATACTCACTGCATGCCGAAACTTAGGTGAGGCAATGGAATATGAGACCGATGGGTATCGATGGCTAGCTCCCGAGATTATTGCTGGTGACCCGGAGAGTGTTAACGAGACATGGATGAGCAATGTATACAGCTTTGGAATGGTAATATGGGAGATGGTGACTGGCGAGGCAGCATATGGCGCATACTCGCCAGTGCAAGCGGCAGTTGGTATAGCTGCGTGTGGTCTGAGACCTGATGTTCCAAAGGACTGCCCGTCCACCCTGAAAGCTCTGATGATCAGGTGCTGGAACAATTGCCCATCGAAGCGCCCACAATTCTCCGAAATTCCTTCCTTCATCTTCGTTTACGCTCCTAATCGCAGCCTACTGCACTTTATGCTTCCCAGGAAGAGAGCTGGTGAAGAAGGTGTGGCTGTAGAAGAAAAAACTGATAACAGCAGCAGCAGCAGTAACAACAACAACAACAACAACAGCAACAACAGCGTTCGCAACGAGGGTGCGTCTCTGATCAAGAAGCAGCGGATCGATTCCGACTCCAACGCCAACAGTAAGGTAGCTGCCGTTGCCACCGGTGCTAACAACATCGTGTACGACGGTGCATCGTTGATCATGGCCTCGGCTAATTCTAATCCGCCCGATATTGATGAGGATCTGCACAGCCGACAACTTGCTGTGTATGGCCGCGAAACCATGCGGAAGCTTTTTGCGTCCAATGTCCTCATTTCGGGGATGCAAGGTCTTGGCGCTGAGATTGCAAAGAATGTTATTCTTGCTGGGGTGAAGTCTGTGACCTTGCATGACGAAGGTATAGTGGAGCTATGGGATCTGTCTAGTAATTTTGTGTTCTCAGAGAGTGATATTGGCAAAAACAGAGCACTTGCCTCTGCTCAGAAGTTGCAAGATCTCAACAATTCTGTTATTGTACACACCTTGACGACTGAGTTAGTTATAGAACAACTTTCCAAATTTGAGGCTGTTGTTTTCACTGATACTGGCCTTGACAAGGCCATGGAATTCAATGACTTCTGTCATAACCACCAGCCTCCTATTGCATTTATCAAGACTGAAGTTAGAGGGCTCTTTGGTTCGGTATTTTGTGACTTTGGTCCCGAGTTCACTGTTTATGATGTGTATGGAGAGGACCCACACACTGGCATAATTGCGTCCATTAGCAATGACAATCCTGCACTCGTTTCCTGTGTTGATGATGAGAGGCTTGAGTTTCAGGATGGAGATCTTGTGGTGTTTTCTGAAGTTCATGGTATGACAGAGCTGAATGATGGGAAGCCGAGAAGGATTAAAAATTGCAGGGCCTATTCATTTACTCTCGAGGAGGATACTACAAACTTTGGTAGCTATGAGAAAGGTGGCATTGTCACACAGGTGAAAGAGCCCAAGATGTTGAACTTCAAGCCATTAAGAGAAGCAATCAACGATCCTGGTGACTTCCTTCTCAGTGATTTCTCCAAGTTCGATCGTCCTCCTCTCCTACACTTGGCATTCCTGGCCTTGGATAAATTCGTGACTGAGTTGGGTCGCCTACCAGTTGCTGGTTCTGAGGAGGATGCTCAAAAGCTGATTTCTGTTGCCAGTAATGTTAACGAGAGTCTAGGAGACGGGAGAGTCGAAGATATTAATCCTAAGCTTTTGAGACATTTTGCATTTGGTGCCAAGGCAGTACTGAATCCGATGGCTGCCATGTTTGGTGGTATTGTAGCTCAAGAGGTTCTCAAAGCGTGCTCCGGAAAGTTTCATCCACTTGTCCAGTTCTTCTATTTCGACTCGTTGGAGTCTCTTCCCACAGAGTCATTGGAAGCCAGTGATTTTAGACCCTTGAATAGTCGTTATGATGCACAGATTTCTGTGTTTGGGTCTAAACTTCAGAAGAAACTGGAAAATGCCAAAGTCTTTATGGTTGGATCTGGTGCTCTAGGTTGTGAGTTCTTGAAGAACCTTGCACTTATGGGAGTTTCATGTAGCAACGAAGGGAAGCTAACGATTACGGATGATGATGTAATTGAAAAAAGCAACCTTAGTCGGCAGTTCCTTTTCCGTGACTGGAACATTGGGCAGGCGAAATCTACTGTCGCTGCTTCTGCTGCTGTTGCAATTAATAAGCACCTCAACATCGAAGCTTTGCAGAATCGTGTTAGTCCCGAGACTGAAAACGTGTTCGATGATAGCTTTTGGGAGAATTTGAATGTCGTGGTTAACGCACTAGACAATGTAAATGCAAGGCTCTATGTCGATCAAAGGTGCTTATACTTCCAGAAACCACTTCTAGAATCTGGAACTCTTGGTGCTAAATGCAATACTCAAATGGTCATCCCTCACCTGACTGAAAACTATGGGGCATCAAGAGACCCCCCTGAGAAACAAGCGCCCATGTGCACTGTGCATTCGTTTCCACATAATATCGACCACTGTTTGACATGGGCTCGATCTGAGTTCGAGGGCTTGCTTGAGAAGACTCCTACTGATGTGAACGCTTATCTATCAAATCCCAGTGAATACACTTCTGCAATGATGAATGCTGGTGATGCTCAGTCTAGGGACACTTTAGAGCGCATTCTCGAGTGTCTTGATAGAGAAAGATGCGAGACATTTGAAGACTGCATCACGTGGGCTCGCTTGAAGTTTGAAGATTATTTTTCGAACCGTGTGAAGCAGTTGATATACACATTTCCTGAAGATGCTGTAACTAGCAATGGGGCACCATTCTGGTCTGCCCCAAAGAGATTTCCCCATCCACTCCAGTTTTCAACTGCCGATCAAAGCTACCTTCATTTTGTTTTGGCGGCGGCTATACTAAGAGCAGAATCATATGCCATTCCGATTCCTGACTGGGTTAAGAACCCCACGAAATTGGCCGATGCAGTTGACCGAGTTATAGTACCAGATTTTATGCCCAAAAAAGATGCCAAGATTGTGACTGATGAGAAGGCAACCAGTCTCTCTACAGCATCGGTTGACGATGCAGCCATTATCCACGACCTGGTGAACAAATTGGAGGATACAAGCAGGAAGCTACCAGAGGGATTCAGGATGAAACCGATCCAGTTTGAGAAGGATGATGATTCGAATTTCCATATGGATCTCATAGCTGGGCTGGCTAACATGAGAGCGAGGAATTACAGCATTCCTGAAGTAGACAAGTTGAAAGCCAAGTTCATTGCTGGAAGGATCATCCCCGCCATTGCTACCTCTACTGCAATGGCTACAGGTCTCGTCTGCCTCGAACTGTACAAAGTTCTAGATGGCGGTCACAAGGTGGAGGATTACCGTAACACGTTTGCGAACCTTGCATTGCCTCTGTTCTCCATGGCCGAGCCAGTCCCGCCCAAGGTCATTAAGCACCGGGACATGAGCTGGACTGTCTGGGACAGATGGATCATCAAAGACAACCCTACACTCAGACAACTTATTGAATGGTTGAAGAACAAGGGATTGAATGCCTACAGCATCTCGTGCGGTAGCTGTCTCCTGTACAATAGTATGTTCCCTCGACACAGAGATCGAATGGACAAGAAGGTAGTTGATTTAGCTCGAGATGTTGCCAAGGTGGAACTGCCTCCATACCGTCGACATTTGGATGTTGTCGTAGCATGCGAGGATGACGAGGATAATGATATTGACATCCCTCTGGTGTCGGTTTACTTCCGTTAG

Protein sequence

MATALECWSSRASTDEDLVEQVLMRTQDRSEGSKPESSLAVGEKESSAMQKRLQRFSRNVSEAVASLKNSLNLDSVRDPSPTKTEGSKKAVWGSVVRNLTQLYPGSQLPEKLVSNIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHDVQKPTIKLTFACNSSVSWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKVENALKLAIKKPKTNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENLQIEMPLSTSSFTVTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAIEKIKGCEKGVSYKFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKNKRLQMKEITRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTACRNLGEAMEYETDGYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAVGIAACGLRPDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFVYAPNRSLLHFMLPRKRAGEEGVAVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQRIDSDSNANSKVAAVATGANNIVYDGASLIMASANSNPPDIDEDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVELWDLSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDKAMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEKGGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLPVAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQEVLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAKVFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDCITWARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAAILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAIIHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPKVIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMDKKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYFR
Homology
BLAST of Csor.00g102970 vs. ExPASy Swiss-Prot
Match: P93028 (Ubiquitin-activating enzyme E1 1 OS=Arabidopsis thaliana OX=3702 GN=UBA1 PE=1 SV=1)

HSP 1 Score: 1729.9 bits (4479), Expect = 0.0e+00
Identity = 832/1071 (77.68%), Postives = 955/1071 (89.17%), Query Frame = 0

Query: 618  NNNSNNSVRNEGASLIKKQRIDSDSNANSKVAAVATGANNIVYDGASLI----MASANSN 677
            N+ ++N++     +  KK+RID   +++ K +++    ++  + G S++    MA  NSN
Sbjct: 10   NDKNDNTIIGSDLASSKKRRIDFTESSSDKSSSILASGSSRGFHGDSVVQQIDMAFGNSN 69

Query: 678  PPDIDEDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGI 737
              +IDEDLHSRQLAVYGRETMR+LFASNVLISGM GLGAEIAKN+ILAGVKSVTLHDE +
Sbjct: 70   RQEIDEDLHSRQLAVYGRETMRRLFASNVLISGMHGLGAEIAKNLILAGVKSVTLHDERV 129

Query: 738  VELWDLSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTD 797
            VELWDLSSNFVFSE D+GKNRA AS QKLQDLNN+V+V +LT  L  E LS F+ VVF+D
Sbjct: 130  VELWDLSSNFVFSEDDVGKNRADASVQKLQDLNNAVVVSSLTKSLNKEDLSGFQVVVFSD 189

Query: 798  TGLDKAMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASI 857
              +++A+EF+D+CH+HQPPIAF+K +VRGLFGSVFCDFGPEF V DV GE+PHTGIIASI
Sbjct: 190  ISMERAIEFDDYCHSHQPPIAFVKADVRGLFGSVFCDFGPEFAVLDVDGEEPHTGIIASI 249

Query: 858  SNDNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNF 917
            SN+N A +SCVDDERLEF+DGDLVVFSEV GMTELNDGKPR+IK+ R YSFTL+EDTTN+
Sbjct: 250  SNENQAFISCVDDERLEFEDGDLVVFSEVEGMTELNDGKPRKIKSTRPYSFTLDEDTTNY 309

Query: 918  GSYEKGGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTE 977
            G+Y KGGIVTQVK+PK+LNFKPLREA+ DPGDFL SDFSKFDRPPLLHLAF ALD F  E
Sbjct: 310  GTYVKGGIVTQVKQPKLLNFKPLREALKDPGDFLFSDFSKFDRPPLLHLAFQALDHFKAE 369

Query: 978  LGRLPVAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGG 1037
             GR PVAGSEEDAQKLIS+A+ +N   GD +VE+++ KLLRHF+FGAKAVLNPMAAMFGG
Sbjct: 370  AGRFPVAGSEEDAQKLISIATAINTGQGDLKVENVDQKLLRHFSFGAKAVLNPMAAMFGG 429

Query: 1038 IVAQEVLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKK 1097
            IV QEV+KACSGKFHPL QFFYFDS+ESLP+E +++SDF P NSRYDAQISVFG+K QKK
Sbjct: 430  IVGQEVVKACSGKFHPLFQFFYFDSVESLPSEPVDSSDFAPRNSRYDAQISVFGAKFQKK 489

Query: 1098 LENAKVFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIG 1157
            LE+AKVF VGSGALGCEFLKNLALMGVSC ++GKLT+TDDD+IEKSNLSRQFLFRDWNIG
Sbjct: 490  LEDAKVFTVGSGALGCEFLKNLALMGVSCGSQGKLTVTDDDIIEKSNLSRQFLFRDWNIG 549

Query: 1158 QAKSTVAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVD 1217
            QAKSTVAASAA  IN   NIEALQNRV  ETENVFDD+FWENL VVVNALDNVNARLYVD
Sbjct: 550  QAKSTVAASAAAVINPRFNIEALQNRVGAETENVFDDAFWENLTVVVNALDNVNARLYVD 609

Query: 1218 QRCLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCL 1277
             RCLYFQKPLLESGTLG KCNTQ VIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCL
Sbjct: 610  SRCLYFQKPLLESGTLGTKCNTQSVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCL 669

Query: 1278 TWARSEFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFE 1337
            TWARSEFEGLLEKTP +VNAYLS+P EYT++MM+AGDAQ+RDTLERI+ECL++E+CETF+
Sbjct: 670  TWARSEFEGLLEKTPAEVNAYLSSPVEYTNSMMSAGDAQARDTLERIVECLEKEKCETFQ 729

Query: 1338 DCITWARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHF 1397
            DC+TWARL+FEDYF NRVKQLIYTFPEDA TS GAPFWSAPKRFP PLQ+S++D S L+F
Sbjct: 730  DCLTWARLRFEDYFVNRVKQLIYTFPEDAATSTGAPFWSAPKRFPRPLQYSSSDPSLLNF 789

Query: 1398 VLAAAILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASV 1457
            + A AILRAE++ IPIP+W KNP + A+AVDRVIVPDF P++DAKIVTDEKAT+L+TASV
Sbjct: 790  ITATAILRAETFGIPIPEWTKNPKEAAEAVDRVIVPDFEPRQDAKIVTDEKATTLTTASV 849

Query: 1458 DDAAIIHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEV 1517
            DDAA+I DL+ K++     L   FRMKPIQFEKDDD+N+HMD+IAGLANMRARNYSIPEV
Sbjct: 850  DDAAVIDDLIAKIDQCRHNLSPDFRMKPIQFEKDDDTNYHMDVIAGLANMRARNYSIPEV 909

Query: 1518 DKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAE 1577
            DKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVE YRNTFANLALPLFSMAE
Sbjct: 910  DKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEAYRNTFANLALPLFSMAE 969

Query: 1578 PVPPKVIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRH 1637
            P+PPKV+KHRDM+WTVWDRW++K NPTLR++++WL++KGL+AYSISCGSCLL+NSMF RH
Sbjct: 970  PLPPKVVKHRDMAWTVWDRWVLKGNPTLREVLQWLEDKGLSAYSISCGSCLLFNSMFTRH 1029

Query: 1638 RDRMDKKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYFR 1685
            ++RMDKKVVDLARDVAKVELPPYR HLDVVVACED++DND+DIPLVS+YFR
Sbjct: 1030 KERMDKKVVDLARDVAKVELPPYRNHLDVVVACEDEDDNDVDIPLVSIYFR 1080

BLAST of Csor.00g102970 vs. ExPASy Swiss-Prot
Match: P92974 (Ubiquitin-activating enzyme E1 2 OS=Arabidopsis thaliana OX=3702 GN=UBA2 PE=1 SV=1)

HSP 1 Score: 1691.8 bits (4380), Expect = 0.0e+00
Identity = 821/1063 (77.23%), Postives = 932/1063 (87.68%), Query Frame = 0

Query: 630  ASLIKKQRID----SDSNANSKVAAVATGANNIVYDGASLIMASA-----NSNPPDIDED 689
            +S +KK+RID    +D +A +   + + G NN +  G   +M+ A     NSN  +IDED
Sbjct: 15   SSPMKKRRIDHTESADGSAINASNSSSIGLNNSI-GGNDTVMSMAEFGNDNSNNQEIDED 74

Query: 690  LHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVELWDLS 749
            LHSRQLAVYGRETMRKLFASNVLISGMQGLG EIAKN+ILAGVKSVTLHDE +VELWDLS
Sbjct: 75   LHSRQLAVYGRETMRKLFASNVLISGMQGLGVEIAKNIILAGVKSVTLHDENVVELWDLS 134

Query: 750  SNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDKAM 809
            SNFVF+E DIGKNRALAS  KLQ+LNN+V V TLT +L  EQLS F+ VVF D   +KA 
Sbjct: 135  SNFVFTEEDIGKNRALASVHKLQELNNAVAVSTLTGKLTKEQLSDFQVVVFVDISFEKAT 194

Query: 810  EFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNPAL 869
            E +D+CH+HQPPIAFIK +VRGLFGS+FCDFGP FTV DV GE+PH+GIIAS+SN+NP  
Sbjct: 195  EIDDYCHSHQPPIAFIKADVRGLFGSLFCDFGPHFTVLDVDGEEPHSGIIASVSNENPGF 254

Query: 870  VSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEKGG 929
            VSCVDDERLEF+DG+LVVFSEV GMTELNDGKPR+IKN + +SFTLEEDT+++G Y KGG
Sbjct: 255  VSCVDDERLEFEDGNLVVFSEVEGMTELNDGKPRKIKNVKPFSFTLEEDTSSYGQYMKGG 314

Query: 930  IVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLPVA 989
            IVTQVK+PK+LNFKPLREA+ DPGDFLLSDFSKFDRPPLLHLAF ALD+F ++ GR P A
Sbjct: 315  IVTQVKQPKVLNFKPLREALKDPGDFLLSDFSKFDRPPLLHLAFQALDRFSSQAGRFPFA 374

Query: 990  GSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQEVL 1049
            GSEEDAQKL+ +A ++NE LGD R+ED+N KLLRH AFG++AVLNPMAAMFGGIV QEV+
Sbjct: 375  GSEEDAQKLVEIAVDINEGLGDARLEDVNSKLLRHLAFGSRAVLNPMAAMFGGIVGQEVV 434

Query: 1050 KACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAKVF 1109
            KACSGKFHP+ QFFYFDS+ESLP E L+AS+FRP NSRYDAQISVFGS LQKKLE+A+VF
Sbjct: 435  KACSGKFHPIFQFFYFDSVESLPKEPLDASEFRPQNSRYDAQISVFGSTLQKKLEDARVF 494

Query: 1110 MVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVA 1169
            +VG+GALGCEFLKNLALMGVSC  +GKLT+TDDDVIEKSNLSRQFLFRDWNIGQAKSTVA
Sbjct: 495  VVGAGALGCEFLKNLALMGVSCGTQGKLTVTDDDVIEKSNLSRQFLFRDWNIGQAKSTVA 554

Query: 1170 ASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLYFQ 1229
            A+AA  IN  LNI+ALQNRV PETENVFDDSFWENL VVVNALDNV ARLYVD RC+YFQ
Sbjct: 555  ATAAAGINSRLNIDALQNRVGPETENVFDDSFWENLTVVVNALDNVTARLYVDSRCVYFQ 614

Query: 1230 KPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEF 1289
            KPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEF
Sbjct: 615  KPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEF 674

Query: 1290 EGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDCITWAR 1349
            EGLLEKTP +VNAYLS+P EY  AM  AGDAQ+RDTL R++ECL++E+C +F+DCITWAR
Sbjct: 675  EGLLEKTPAEVNAYLSDPVEYMKAMRTAGDAQARDTLGRVVECLEKEKCNSFQDCITWAR 734

Query: 1350 LKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAAIL 1409
            L+FEDYF+NRVKQL YTFPEDA TS GAPFWSAPKRFP PLQFS+ D S+++FV+AA+IL
Sbjct: 735  LRFEDYFANRVKQLCYTFPEDAATSTGAPFWSAPKRFPRPLQFSSTDLSHINFVMAASIL 794

Query: 1410 RAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAIIH 1469
            RAE++ IP P+W K    LA+AV+RVIVPDF PKKDA IVTDEKAT+LSTASVDDAA+I 
Sbjct: 795  RAETFGIPTPEWAKTRAGLAEAVERVIVPDFEPKKDATIVTDEKATTLSTASVDDAAVID 854

Query: 1470 DLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKAKF 1529
            +L  KL      L   FRMK IQFEKDDD+N+HMD+IAGLANMRARNYS+PEVDKLKAKF
Sbjct: 855  ELNAKLVRCRMSLQPEFRMKAIQFEKDDDTNYHMDMIAGLANMRARNYSVPEVDKLKAKF 914

Query: 1530 IAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPKVI 1589
            IAGRIIPAIATSTAMATG VCLE+YKVLDG HKVEDYRNTFANLALPLFSMAEPVPPKV+
Sbjct: 915  IAGRIIPAIATSTAMATGFVCLEMYKVLDGSHKVEDYRNTFANLALPLFSMAEPVPPKVV 974

Query: 1590 KHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMDKK 1649
            KH+D SWTVWDRW+++ NPTLR+L++WLK KGLNAYSISCGS LLYNSMF RH++RM+++
Sbjct: 975  KHQDQSWTVWDRWVMRGNPTLRELLDWLKEKGLNAYSISCGSSLLYNSMFSRHKERMNRR 1034

Query: 1650 VVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYF 1684
            VVDLARDVA VELP YRRH+DVVVACEDD D D+DIPLVSVYF
Sbjct: 1035 VVDLARDVAGVELPAYRRHVDVVVACEDDNDADVDIPLVSVYF 1076

BLAST of Csor.00g102970 vs. ExPASy Swiss-Prot
Match: P20973 (Ubiquitin-activating enzyme E1 1 OS=Triticum aestivum OX=4565 GN=UBA1 PE=1 SV=1)

HSP 1 Score: 1676.8 bits (4341), Expect = 0.0e+00
Identity = 795/1010 (78.71%), Postives = 914/1010 (90.50%), Query Frame = 0

Query: 676  DIDEDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVE 735
            +IDEDLHSRQLAVYGRETM++LF SNVL+SG+QGLGAEIAKN++LAGVKSVTLHD+G VE
Sbjct: 42   EIDEDLHSRQLAVYGRETMKRLFGSNVLVSGLQGLGAEIAKNLVLAGVKSVTLHDDGNVE 101

Query: 736  LWDLSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTG 795
            LWDLSSNF  SE+D+G+NRA A  QKLQ+LNN+V+V  LT +L  E LSKF+AVVFTD  
Sbjct: 102  LWDLSSNFFLSENDVGQNRAQACVQKLQELNNAVLVSALTGDLTKEHLSKFQAVVFTDIS 161

Query: 796  LDKAMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISN 855
            LDKA+EF+D+CH+ QPPIAFIK+EVRGLFGSVFCDFGPEFTV DV GE+PHTGI+ASISN
Sbjct: 162  LDKAIEFDDYCHSQQPPIAFIKSEVRGLFGSVFCDFGPEFTVLDVDGEEPHTGIVASISN 221

Query: 856  DNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGS 915
            DNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPR++KN R YSF LEEDT++FG+
Sbjct: 222  DNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRKVKNARPYSFFLEEDTSSFGA 281

Query: 916  YEKGGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELG 975
            Y +GGIVTQVK PK++ FKPL+EA+++PG+FL+SDFSKF+RPPLLHLAF ALDKF TEL 
Sbjct: 282  YVRGGIVTQVKPPKVIKFKPLKEAMSEPGEFLMSDFSKFERPPLLHLAFQALDKFRTELS 341

Query: 976  RLPVAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIV 1035
            R PVAGS +D Q++I  A ++N++LGD ++E+I+ KLL HFA G++AVLNPMAAMFGGIV
Sbjct: 342  RFPVAGSTDDVQRVIEYAISINDTLGDRKLEEIDKKLLHHFASGSRAVLNPMAAMFGGIV 401

Query: 1036 AQEVLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLE 1095
             QEV+KACSGKFHPL QFFYFDS+ESLP + LE  D +P NSRYDAQISVFGSKLQ KLE
Sbjct: 402  GQEVVKACSGKFHPLYQFFYFDSVESLPVDPLEPGDLKPKNSRYDAQISVFGSKLQNKLE 461

Query: 1096 NAKVFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQA 1155
             AK+FMVGSGALGCEFLKNLALMG+SCS  G LT+TDDDVIEKSNLSRQFLFRDWNIGQ 
Sbjct: 462  EAKIFMVGSGALGCEFLKNLALMGISCSQNGNLTLTDDDVIEKSNLSRQFLFRDWNIGQP 521

Query: 1156 KSTVAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQR 1215
            KSTVAA+AA+ IN  L++EALQNR SPETENVF+D+FWENL+ VVNALDNV AR+Y+D R
Sbjct: 522  KSTVAATAAMVINPKLHVEALQNRASPETENVFNDAFWENLDAVVNALDNVTARMYIDSR 581

Query: 1216 CLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTW 1275
            C+YFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTW
Sbjct: 582  CVYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTW 641

Query: 1276 ARSEFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDC 1335
            ARSEFEGLLEKTPT+VNA+LSNP+ Y SA   AGDAQ+RD LER++ECLDR++CETF+D 
Sbjct: 642  ARSEFEGLLEKTPTEVNAFLSNPTTYISAARTAGDAQARDQLERVIECLDRDKCETFQDS 701

Query: 1336 ITWARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVL 1395
            ITWARLKFEDYFSNRVKQL +TFPED++TS+GAPFWSAPKRFP P++FS++DQS L F+L
Sbjct: 702  ITWARLKFEDYFSNRVKQLTFTFPEDSMTSSGAPFWSAPKRFPRPVEFSSSDQSQLSFIL 761

Query: 1396 AAAILRAESYAIPIPDWVKNPTKL-ADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVD 1455
            AAAILRAE++ IPIP+W K P KL A+AVD+VIVPDF PK+  KIVT EKATSLS+ASVD
Sbjct: 762  AAAILRAETFGIPIPEWAKTPNKLAAEAVDKVIVPDFQPKQGVKIVTHEKATSLSSASVD 821

Query: 1456 DAAIIHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVD 1515
            DAA+I +L+ KLE+ S+ LP GF M PIQFEKDDD+NFHMD+IAG ANMRARNYSIPEVD
Sbjct: 822  DAAVIEELIAKLEEVSKTLPSGFHMNPIQFEKDDDTNFHMDVIAGFANMRARNYSIPEVD 881

Query: 1516 KLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEP 1575
            KLKAKFIAGRIIPAIATSTAMATGLVCLELYK L GGHKVEDYRNTFANLA+PLFS+AEP
Sbjct: 882  KLKAKFIAGRIIPAIATSTAMATGLVCLELYKALAGGHKVEDYRNTFANLAIPLFSIAEP 941

Query: 1576 VPPKVIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHR 1635
            VPPK IKH+++SWTVWDRW +  N TLR+L+EWLK KGLNAYSISCG+ LLYNSMFPRH+
Sbjct: 942  VPPKTIKHQELSWTVWDRWTVTGNITLRELLEWLKEKGLNAYSISCGTSLLYNSMFPRHK 1001

Query: 1636 DRMDKKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYFR 1685
            +R+D+KVVD+AR+VAK+E+P YRRHLDVVVACEDD+DND+DIPLVSVYFR
Sbjct: 1002 ERLDRKVVDVAREVAKMEVPSYRRHLDVVVACEDDDDNDVDIPLVSVYFR 1051

BLAST of Csor.00g102970 vs. ExPASy Swiss-Prot
Match: P31251 (Ubiquitin-activating enzyme E1 2 OS=Triticum aestivum OX=4565 GN=UBA2 PE=2 SV=1)

HSP 1 Score: 1674.1 bits (4334), Expect = 0.0e+00
Identity = 794/1010 (78.61%), Postives = 912/1010 (90.30%), Query Frame = 0

Query: 676  DIDEDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVE 735
            +IDEDLHSRQLAVYGRETM+ LF SNVL+SG+QGLGAEIAKN++LAGVKSVTLHD+G VE
Sbjct: 42   EIDEDLHSRQLAVYGRETMKPLFGSNVLVSGLQGLGAEIAKNLVLAGVKSVTLHDDGNVE 101

Query: 736  LWDLSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTG 795
            LWDLSSNF  SE+D+G+NRA A  QKLQ+LNN+V+V  LT +L  E LSKF+AVVFTD  
Sbjct: 102  LWDLSSNFFLSENDVGQNRAQACVQKLQELNNAVLVSALTGDLTKEHLSKFQAVVFTDIS 161

Query: 796  LDKAMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISN 855
            LDKA+EF+D+CH+HQPPIAFIK+EVRGLFGSVFCDFGPEFTV DV GE+PHTGI+ASISN
Sbjct: 162  LDKAIEFDDYCHSHQPPIAFIKSEVRGLFGSVFCDFGPEFTVLDVDGEEPHTGIVASISN 221

Query: 856  DNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGS 915
            DNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPR++KN R YSF LEEDT++FG+
Sbjct: 222  DNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRKVKNARPYSFFLEEDTSSFGA 281

Query: 916  YEKGGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELG 975
            Y +GGIVTQVK PK++ FKPL+EA+++PG+FL+SDFSKF+RPPLLHLAF ALDKF TEL 
Sbjct: 282  YVRGGIVTQVKPPKVIKFKPLKEAMSEPGEFLMSDFSKFERPPLLHLAFQALDKFRTELS 341

Query: 976  RLPVAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIV 1035
            R PVAGS +D Q++I  A ++N++LGD ++E+I+ KLL HFA G++AVLNPMAAMFGGIV
Sbjct: 342  RFPVAGSTDDVQRVIEYAISINDTLGDRKLEEIDKKLLHHFASGSRAVLNPMAAMFGGIV 401

Query: 1036 AQEVLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLE 1095
             QEV+KACSGKFHPL QFFYFDS+ESLP + LE  D +P NSRYDAQISVFGS LQ KLE
Sbjct: 402  GQEVVKACSGKFHPLYQFFYFDSVESLPVDPLEPGDLKPKNSRYDAQISVFGSTLQNKLE 461

Query: 1096 NAKVFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQA 1155
             AK+FMVGSGALGCEFLKNLALMG+SCS  G LT+TDDDVIEKSNLSRQFLFRDWNIGQ 
Sbjct: 462  EAKIFMVGSGALGCEFLKNLALMGISCSQNGNLTVTDDDVIEKSNLSRQFLFRDWNIGQP 521

Query: 1156 KSTVAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQR 1215
            KSTVAA+AA+ IN  L++EALQNR SPETENVF+D+FWENL+ VVNALDNV AR+Y+D R
Sbjct: 522  KSTVAATAAMVINPKLHVEALQNRASPETENVFNDAFWENLDAVVNALDNVTARMYIDSR 581

Query: 1216 CLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTW 1275
            C+YFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTW
Sbjct: 582  CVYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTW 641

Query: 1276 ARSEFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDC 1335
            ARSEFEGLLEKTPT+VNA+LSNP+ Y SA   AGDAQ+RD LER++ECLDR++CETF+D 
Sbjct: 642  ARSEFEGLLEKTPTEVNAFLSNPTTYISAARTAGDAQARDQLERVIECLDRDKCETFQDS 701

Query: 1336 ITWARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVL 1395
            ITWARLKFEDYFSNRVKQL +TFPED++TS+GAPFWSAPKRFP P++FS++D S L F+L
Sbjct: 702  ITWARLKFEDYFSNRVKQLTFTFPEDSMTSSGAPFWSAPKRFPRPVEFSSSDPSQLSFIL 761

Query: 1396 AAAILRAESYAIPIPDWVKNPTKL-ADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVD 1455
            AAAILRAE++ IPI +W K P KL A+AVD+VIVPDF PK+  KIVTDEKATSLS+ASVD
Sbjct: 762  AAAILRAETFGIPISEWAKTPNKLAAEAVDKVIVPDFQPKQGVKIVTDEKATSLSSASVD 821

Query: 1456 DAAIIHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVD 1515
            DAA+I +L+ KLE+ S+ LP GF M PIQFEKDDD+NFHMD+IAG ANMRARNYSIPEVD
Sbjct: 822  DAAVIEELIAKLEEVSKTLPSGFHMNPIQFEKDDDTNFHMDVIAGFANMRARNYSIPEVD 881

Query: 1516 KLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEP 1575
            KLKAKFIAGRIIPAIATSTAMATGLVCLELYK L GGHKVEDYRNTFANLA+PLFS+AEP
Sbjct: 882  KLKAKFIAGRIIPAIATSTAMATGLVCLELYKALAGGHKVEDYRNTFANLAIPLFSIAEP 941

Query: 1576 VPPKVIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHR 1635
            VPPK IKH+++SWTVWDRW +  N TLR+L+EWLK KGLNAYSISCG+ LLYNSMFPRH+
Sbjct: 942  VPPKTIKHQELSWTVWDRWTVTGNITLRELLEWLKEKGLNAYSISCGTSLLYNSMFPRHK 1001

Query: 1636 DRMDKKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYFR 1685
            +R+D+KVVD+AR+VAK+E+P YRRHLDVVVACEDD+DND+DIPLVSVYFR
Sbjct: 1002 ERLDRKVVDVAREVAKMEVPSYRRHLDVVVACEDDDDNDVDIPLVSVYFR 1051

BLAST of Csor.00g102970 vs. ExPASy Swiss-Prot
Match: P31252 (Ubiquitin-activating enzyme E1 3 OS=Triticum aestivum OX=4565 GN=UBA3 PE=1 SV=1)

HSP 1 Score: 1605.1 bits (4155), Expect = 0.0e+00
Identity = 761/1011 (75.27%), Postives = 889/1011 (87.93%), Query Frame = 0

Query: 674  PPDIDEDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGI 733
            P +IDEDLHSRQLAVYGRETMR+LFAS+VL+SG+ GLGAEIAKN+ LAGVKSVT+HD   
Sbjct: 43   PQEIDEDLHSRQLAVYGRETMRRLFASDVLVSGLNGLGAEIAKNLALAGVKSVTIHDVKT 102

Query: 734  VELWDLSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTD 793
            V++WDLS NF  SE DIGKNRA A   KLQ+LNN+V++  LT EL  E LSKF+AVVFTD
Sbjct: 103  VKMWDLSGNFFLSEDDIGKNRAAACVAKLQELNNAVLISALTEELTTEHLSKFQAVVFTD 162

Query: 794  TGLDKAMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASI 853
              LDKA EF+D+CHNHQPPI+FIK+EV GLFGSVFCDFGP+FTV DV GEDPHTGIIASI
Sbjct: 163  IDLDKAYEFDDYCHNHQPPISFIKSEVCGLFGSVFCDFGPKFTVLDVDGEDPHTGIIASI 222

Query: 854  SNDNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNF 913
            SNDNPAL+SCVDDERLEFQDGDLVVFSEVHGMTELNDGKPR++KN R +SF++EEDT+NF
Sbjct: 223  SNDNPALISCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRKVKNARPFSFSIEEDTSNF 282

Query: 914  GSYEKGGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTE 973
            G Y KGGIVTQVKEPK+L FK LR+A+ DPG+ LLSDFSKF+RPP+LHLAF ALDKF  +
Sbjct: 283  GIYVKGGIVTQVKEPKVLCFKALRDAMTDPGEVLLSDFSKFERPPVLHLAFQALDKFKKD 342

Query: 974  LGRLPVAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGG 1033
             GR P AG EEDA   + +A+ +NE+  D +++ I+ KL R FA G++AVLNPMAAMFGG
Sbjct: 343  HGRCPAAGCEEDAHSFLKIAAAINEASADRKLDTIDEKLFRQFASGSRAVLNPMAAMFGG 402

Query: 1034 IVAQEVLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKK 1093
            IV QEV+KACSGKFHPL QFFYFDS+ESLPT  LE  D +P N+RYDAQ+SVFGSKLQKK
Sbjct: 403  IVGQEVVKACSGKFHPLNQFFYFDSVESLPTYPLEPQDLKPSNNRYDAQVSVFGSKLQKK 462

Query: 1094 LENAKVFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIG 1153
            +E A  F+VGSGALGCEFLKNLALMGVSCS++GKLTITDDD+IEKSNLSRQFLFRDWNIG
Sbjct: 463  MEEANTFVVGSGALGCEFLKNLALMGVSCSSKGKLTITDDDIIEKSNLSRQFLFRDWNIG 522

Query: 1154 QAKSTVAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVD 1213
            QAKSTVAA+AA AIN  L+I+ALQNR  P+TENVF D+FWE L+VV+NALDNVNAR+Y+D
Sbjct: 523  QAKSTVAATAASAINPSLHIDALQNRACPDTENVFHDTFWEGLDVVINALDNVNARMYMD 582

Query: 1214 QRCLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCL 1273
             RCLYFQKPLLESGTLGAKCN QMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCL
Sbjct: 583  MRCLYFQKPLLESGTLGAKCNIQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCL 642

Query: 1274 TWARSEFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFE 1333
            TWARSEFEGLLEKTP +VN++LSNP++Y +AM  AGDAQ+R+ LER+ ECL+++RC TF+
Sbjct: 643  TWARSEFEGLLEKTPNEVNSFLSNPAQYAAAMRKAGDAQARELLERVSECLNKDRCSTFD 702

Query: 1334 DCITWARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHF 1393
            DCI+WARLKFEDYFSNRVKQL +TFPEDA TS GAPFWSAPKRFP  LQFS ADQS+L+F
Sbjct: 703  DCISWARLKFEDYFSNRVKQLTFTFPEDAATSMGAPFWSAPKRFPRALQFSAADQSHLNF 762

Query: 1394 VLAAAILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASV 1453
            +++A+ILRAES+ + IP+W K+ +KLAD V+++ VP F PK+   IVTDEKA++LS+ SV
Sbjct: 763  IMSASILRAESFGVAIPEWAKDTSKLADVVNKIAVPTFEPKQGVNIVTDEKASNLSSTSV 822

Query: 1454 DDAAIIHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEV 1513
            DD A+I DL+ KL++ ++ L  GF+MKPIQFEKDDD+NFHMDLI+GLANMRARNYSIPEV
Sbjct: 823  DDVAVIEDLLAKLQEYAKMLLPGFQMKPIQFEKDDDTNFHMDLISGLANMRARNYSIPEV 882

Query: 1514 DKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAE 1573
            DKLKAKFIAGRIIPAIATSTAMATGLVCLELYKV+ G H VEDYRNTFANLALPLFSMAE
Sbjct: 883  DKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVIAGEHPVEDYRNTFANLALPLFSMAE 942

Query: 1574 PVPPKVIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRH 1633
            PVPPKV+KH++ SWTVWDRW ++ N TL +L++W  +KGL AYSISCG+ LLYN+MF RH
Sbjct: 943  PVPPKVMKHKETSWTVWDRWSVQGNLTLAELLQWFADKGLTAYSISCGTSLLYNNMFARH 1002

Query: 1634 RDRMDKKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYFR 1685
            +DR+ KKVVD+AR+VAKV++P YRRHLD+ VACED+++ND+DIPLVSVYFR
Sbjct: 1003 KDRLTKKVVDIAREVAKVDVPEYRRHLDIGVACEDEDENDVDIPLVSVYFR 1053

BLAST of Csor.00g102970 vs. NCBI nr
Match: KAG6570640.1 (Ubiquitin-activating enzyme E1 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 3363 bits (8719), Expect = 0.0
Identity = 1684/1684 (100.00%), Postives = 1684/1684 (100.00%), Query Frame = 0

Query: 1    MATALECWSSRASTDEDLVEQVLMRTQDRSEGSKPESSLAVGEKESSAMQKRLQRFSRNV 60
            MATALECWSSRASTDEDLVEQVLMRTQDRSEGSKPESSLAVGEKESSAMQKRLQRFSRNV
Sbjct: 1    MATALECWSSRASTDEDLVEQVLMRTQDRSEGSKPESSLAVGEKESSAMQKRLQRFSRNV 60

Query: 61   SEAVASLKNSLNLDSVRDPSPTKTEGSKKAVWGSVVRNLTQLYPGSQLPEKLVSNIRKHY 120
            SEAVASLKNSLNLDSVRDPSPTKTEGSKKAVWGSVVRNLTQLYPGSQLPEKLVSNIRKHY
Sbjct: 61   SEAVASLKNSLNLDSVRDPSPTKTEGSKKAVWGSVVRNLTQLYPGSQLPEKLVSNIRKHY 120

Query: 121  DSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHDVQKPTIKLTFACNSSV 180
            DSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHDVQKPTIKLTFACNSSV
Sbjct: 121  DSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHDVQKPTIKLTFACNSSV 180

Query: 181  SWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKVENALKLAIKKPK 240
            SWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKVENALKLAIKKPK
Sbjct: 181  SWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKVENALKLAIKKPK 240

Query: 241  TNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENLQIEMPLSTSSFT 300
            TNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENLQIEMPLSTSSFT
Sbjct: 241  TNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENLQIEMPLSTSSFT 300

Query: 301  VTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAIEKIKGCEKGVSY 360
            VTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAIEKIKGCEKGVSY
Sbjct: 301  VTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAIEKIKGCEKGVSY 360

Query: 361  KFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKNKRLQMKEI 420
            KFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKNKRLQMKEI
Sbjct: 361  KFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKNKRLQMKEI 420

Query: 421  TRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTACRNLGEAMEYETD 480
            TRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTACRNLGEAMEYETD
Sbjct: 421  TRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTACRNLGEAMEYETD 480

Query: 481  GYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAVGIAACGLR 540
            GYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAVGIAACGLR
Sbjct: 481  GYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAVGIAACGLR 540

Query: 541  PDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFVYAPNRSLLHFMLPRKRAGEEGV 600
            PDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFVYAPNRSLLHFMLPRKRAGEEGV
Sbjct: 541  PDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFVYAPNRSLLHFMLPRKRAGEEGV 600

Query: 601  AVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQRIDSDSNANSKVAAVATGANNIVY 660
            AVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQRIDSDSNANSKVAAVATGANNIVY
Sbjct: 601  AVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQRIDSDSNANSKVAAVATGANNIVY 660

Query: 661  DGASLIMASANSNPPDIDEDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVIL 720
            DGASLIMASANSNPPDIDEDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVIL
Sbjct: 661  DGASLIMASANSNPPDIDEDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVIL 720

Query: 721  AGVKSVTLHDEGIVELWDLSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVI 780
            AGVKSVTLHDEGIVELWDLSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVI
Sbjct: 721  AGVKSVTLHDEGIVELWDLSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVI 780

Query: 781  EQLSKFEAVVFTDTGLDKAMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDV 840
            EQLSKFEAVVFTDTGLDKAMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDV
Sbjct: 781  EQLSKFEAVVFTDTGLDKAMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDV 840

Query: 841  YGEDPHTGIIASISNDNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCR 900
            YGEDPHTGIIASISNDNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCR
Sbjct: 841  YGEDPHTGIIASISNDNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCR 900

Query: 901  AYSFTLEEDTTNFGSYEKGGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLL 960
            AYSFTLEEDTTNFGSYEKGGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLL
Sbjct: 901  AYSFTLEEDTTNFGSYEKGGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLL 960

Query: 961  HLAFLALDKFVTELGRLPVAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGA 1020
            HLAFLALDKFVTELGRLPVAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGA
Sbjct: 961  HLAFLALDKFVTELGRLPVAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGA 1020

Query: 1021 KAVLNPMAAMFGGIVAQEVLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYD 1080
            KAVLNPMAAMFGGIVAQEVLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYD
Sbjct: 1021 KAVLNPMAAMFGGIVAQEVLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYD 1080

Query: 1081 AQISVFGSKLQKKLENAKVFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSN 1140
            AQISVFGSKLQKKLENAKVFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSN
Sbjct: 1081 AQISVFGSKLQKKLENAKVFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSN 1140

Query: 1141 LSRQFLFRDWNIGQAKSTVAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVV 1200
            LSRQFLFRDWNIGQAKSTVAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVV
Sbjct: 1141 LSRQFLFRDWNIGQAKSTVAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVV 1200

Query: 1201 NALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMC 1260
            NALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMC
Sbjct: 1201 NALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMC 1260

Query: 1261 TVHSFPHNIDHCLTWARSEFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERI 1320
            TVHSFPHNIDHCLTWARSEFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERI
Sbjct: 1261 TVHSFPHNIDHCLTWARSEFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERI 1320

Query: 1321 LECLDRERCETFEDCITWARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHP 1380
            LECLDRERCETFEDCITWARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHP
Sbjct: 1321 LECLDRERCETFEDCITWARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHP 1380

Query: 1381 LQFSTADQSYLHFVLAAAILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIV 1440
            LQFSTADQSYLHFVLAAAILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIV
Sbjct: 1381 LQFSTADQSYLHFVLAAAILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIV 1440

Query: 1441 TDEKATSLSTASVDDAAIIHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGL 1500
            TDEKATSLSTASVDDAAIIHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGL
Sbjct: 1441 TDEKATSLSTASVDDAAIIHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGL 1500

Query: 1501 ANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNT 1560
            ANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNT
Sbjct: 1501 ANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNT 1560

Query: 1561 FANLALPLFSMAEPVPPKVIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISC 1620
            FANLALPLFSMAEPVPPKVIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISC
Sbjct: 1561 FANLALPLFSMAEPVPPKVIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISC 1620

Query: 1621 GSCLLYNSMFPRHRDRMDKKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVS 1680
            GSCLLYNSMFPRHRDRMDKKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVS
Sbjct: 1621 GSCLLYNSMFPRHRDRMDKKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVS 1680

Query: 1681 VYFR 1684
            VYFR
Sbjct: 1681 VYFR 1684

BLAST of Csor.00g102970 vs. NCBI nr
Match: KAG6605397.1 (Ubiquitin-activating enzyme E1 1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 3002 bits (7783), Expect = 0.0
Identity = 1517/1725 (87.94%), Postives = 1588/1725 (92.06%), Query Frame = 0

Query: 1    MATALECWSSRASTDEDLVEQVLMRTQDRSEGSKPESSLAVGEKESSAMQKRLQRFSRNV 60
            MATALECWSSRASTDED+VEQVLMRTQDRSEGSK ESS  VG KESSAMQKRLQR SRNV
Sbjct: 551  MATALECWSSRASTDEDVVEQVLMRTQDRSEGSKAESSFGVGVKESSAMQKRLQRLSRNV 610

Query: 61   SEAVASLKNSLNLDSVRDPSPTKTEGSKKAVWGSVVRNLTQLYPGSQLPEKLVSNIRKHY 120
            SEAVAS+KNSLNLDSVRDPSP++TEGSKK VWGSVVRNLT LYPGSQLPEKLVS+IRKHY
Sbjct: 611  SEAVASIKNSLNLDSVRDPSPSRTEGSKKEVWGSVVRNLTLLYPGSQLPEKLVSSIRKHY 670

Query: 121  DSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHDVQKPTIKLTFACNSSV 180
            DSLP SYAQAGF+MKDVFLHIKLIEQASVY+HPAI FQEVT +DVQKPT+KLTFACNSS+
Sbjct: 671  DSLPFSYAQAGFDMKDVFLHIKLIEQASVYEHPAIFFQEVTYNDVQKPTMKLTFACNSSI 730

Query: 181  SWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKVENALKLAIKKPK 240
            SWSAMSGALE + IRCEKIQIFEKKKF+LGVILF NLDAQ++LFK KVENALKLAIKKPK
Sbjct: 731  SWSAMSGALEGSDIRCEKIQIFEKKKFTLGVILFANLDAQDELFKPKVENALKLAIKKPK 790

Query: 241  TNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENLQIEMPLSTSSFT 300
            TNTVKLPFGFCG Q+GNT GKDLREIEEDA++QNCRSGFE  N NENLQIEMPLSTSSF 
Sbjct: 791  TNTVKLPFGFCGGQDGNTRGKDLREIEEDAIEQNCRSGFERLNSNENLQIEMPLSTSSFA 850

Query: 301  VTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAIEKIKGCEKGVSY 360
            V+VDEWQTVQSGGHELGKWLLSSENLEF+DQIGPNSFKGVYKG+RV IEKIKGCEKGVSY
Sbjct: 851  VSVDEWQTVQSGGHELGKWLLSSENLEFSDQIGPNSFKGVYKGKRVCIEKIKGCEKGVSY 910

Query: 361  KFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKNKRLQMKEI 420
            KFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKNKRLQ KEI
Sbjct: 911  KFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKNKRLQTKEI 970

Query: 421  TRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTACRNLGEAMEYETD 480
            TRIA+DV EG+KF+NDHG+AYRDLNT RIL+DKNGNACLGDMGIL AC+NLGEAMEYETD
Sbjct: 971  TRIAIDVAEGMKFMNDHGIAYRDLNTQRILIDKNGNACLGDMGILIACKNLGEAMEYETD 1030

Query: 481  GYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAVGIAA---- 540
            GYRWLAPEIIAGDPESVN+TWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAV        
Sbjct: 1031 GYRWLAPEIIAGDPESVNKTWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAVDAGTIAHQ 1090

Query: 541  ----------CGLRPDVP----KDCPSTLKALMIRCWNNC----PSKRPQFSE------- 600
                      C   P +     K C +    +   C+ N        +P  S        
Sbjct: 1091 SAPSSLKFYQCCWTPTITSIDKKICSAFNVDMRSSCFENVNPITSHNKPLHSSSILSFHS 1150

Query: 601  ------IPSFIFV-YAPNRSLLHFMLPRKRAGEEGVAVEEKTDNSSSSSNNNNNNNSNNS 660
                  +P F+ + YAPNR LLHFMLPRKRAGEEG  +E++ DNSSS        N +NS
Sbjct: 1151 PSPDSFLPFFLSLCYAPNR-LLHFMLPRKRAGEEGAVLEQQIDNSSS--------NISNS 1210

Query: 661  VRNEGASLIKKQRIDS---DSNANSKV---AAVATGANNIVYDGASLIMASANSNPPDID 720
            V+N G SLIKK RID+   DSN NS     AAV TG NNIV DGASLIMAS N NP DID
Sbjct: 1211 VQNAGVSLIKKHRIDNCNVDSNVNSNTNVAAAVPTG-NNIVNDGASLIMASGNLNPQDID 1270

Query: 721  EDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVELWD 780
            EDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEG+VELWD
Sbjct: 1271 EDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGVVELWD 1330

Query: 781  LSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDK 840
            LSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTL T+LV EQLSKFE VVFTDTGLDK
Sbjct: 1331 LSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLPTKLVKEQLSKFEVVVFTDTGLDK 1390

Query: 841  AMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNP 900
            AMEFNDFCHNHQPPI+FIK+EVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNP
Sbjct: 1391 AMEFNDFCHNHQPPISFIKSEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNP 1450

Query: 901  ALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEK 960
            ALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTL+EDTTNFG YEK
Sbjct: 1451 ALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLDEDTTNFGIYEK 1510

Query: 961  GGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLP 1020
            GGIVTQVK+PK+LNFKPLREAINDPGDFLLSDFSKFDRPPL+HLAFLALDKFVTELGR P
Sbjct: 1511 GGIVTQVKQPKVLNFKPLREAINDPGDFLLSDFSKFDRPPLIHLAFLALDKFVTELGRFP 1570

Query: 1021 VAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQE 1080
            VAGSE+DAQKLISVASN+NESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQE
Sbjct: 1571 VAGSEDDAQKLISVASNMNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQE 1630

Query: 1081 VLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAK 1140
            VLKACSGKFHPLVQFFYFDS+ESLPTE+L+AS+FRPLNSRYDAQISVFGSKLQKKLENAK
Sbjct: 1631 VLKACSGKFHPLVQFFYFDSVESLPTEALDASEFRPLNSRYDAQISVFGSKLQKKLENAK 1690

Query: 1141 VFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKST 1200
            VFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQ KST
Sbjct: 1691 VFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQNKST 1750

Query: 1201 VAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLY 1260
            VAASAAVAIN+HLNIEALQNRVSPETENVF+DSFWENL+V+VNALDNVNARLYVDQRCLY
Sbjct: 1751 VAASAAVAINRHLNIEALQNRVSPETENVFNDSFWENLSVIVNALDNVNARLYVDQRCLY 1810

Query: 1261 FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS 1320
            FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS
Sbjct: 1811 FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS 1870

Query: 1321 EFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDCITW 1380
            EFEGLLEKTP+DVNAYLSNPSEY S+MMNAGDAQSRDTLER+LECLDRERCETFEDCITW
Sbjct: 1871 EFEGLLEKTPSDVNAYLSNPSEYASSMMNAGDAQSRDTLERVLECLDRERCETFEDCITW 1930

Query: 1381 ARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAA 1440
            ARLKFEDYF+NRVKQLIYTFPEDA TSNGAPFWSAPKRFPHPL FS ADQS+L FVLAAA
Sbjct: 1931 ARLKFEDYFANRVKQLIYTFPEDAATSNGAPFWSAPKRFPHPLPFSPADQSHLQFVLAAA 1990

Query: 1441 ILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAI 1500
            ILRAESYAI IPDWVKNP KL DAVDRVIVPDFMPKKDAKIVTDEKATSLS ASVDDAA+
Sbjct: 1991 ILRAESYAISIPDWVKNPRKLGDAVDRVIVPDFMPKKDAKIVTDEKATSLSAASVDDAAV 2050

Query: 1501 IHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKA 1560
            IHDLVNKLEDT R LPEGFRMKPIQFEKDDD+N+HMDLIAGLANMRARNYSIPEVDKLKA
Sbjct: 2051 IHDLVNKLEDTRRNLPEGFRMKPIQFEKDDDTNYHMDLIAGLANMRARNYSIPEVDKLKA 2110

Query: 1561 KFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPK 1620
            KFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPK
Sbjct: 2111 KFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPK 2170

Query: 1621 VIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMD 1680
            VIKHRDMSWTVWDRWIIKDNPTLR+LI+WLKNKGLNAYSISCGSCLLYNSMFPRH+DRMD
Sbjct: 2171 VIKHRDMSWTVWDRWIIKDNPTLRELIKWLKNKGLNAYSISCGSCLLYNSMFPRHKDRMD 2230

Query: 1681 KKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYF 1683
            KKVVDLARD+AKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYF
Sbjct: 2231 KKVVDLARDIAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYF 2265

BLAST of Csor.00g102970 vs. NCBI nr
Match: KAA8543715.1 (hypothetical protein F0562_021539 [Nyssa sinensis])

HSP 1 Score: 2716 bits (7039), Expect = 0.0
Identity = 1339/1726 (77.58%), Postives = 1500/1726 (86.91%), Query Frame = 0

Query: 1    MATALECWSSRASTDEDLVEQVLMRTQDRSEGSKPESSLAV------GEKESSAMQKRLQ 60
            MA ALECWSSRASTDED+VEQVLMRTQDRSEG    SS         G KESSAMQKRLQ
Sbjct: 1    MAAALECWSSRASTDEDMVEQVLMRTQDRSEGLPENSSSGAASLNGGGVKESSAMQKRLQ 60

Query: 61   RFSRNVSEAVASLKNSLNLDSVRD--PSPTKTEGSKKAVWGSVVRNLTQLYPGSQLPEKL 120
            R SRNVSEA+ASLKNSLNLDSVRD  P  ++ E  +K VWGSVVRNLTQLYPGSQLPEKL
Sbjct: 61   RLSRNVSEAIASLKNSLNLDSVRDSPPQQSRIESCRKLVWGSVVRNLTQLYPGSQLPEKL 120

Query: 121  VSNIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHDVQKPTIKL 180
            VSNIRKHYDSLPLSYAQAGF+MKDVFLHIKLIEQAS  DHPAIL QEV++++ Q    +L
Sbjct: 121  VSNIRKHYDSLPLSYAQAGFDMKDVFLHIKLIEQASAEDHPAILIQEVSDNESQGSLFRL 180

Query: 181  TFACNSSVSWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKVENAL 240
            TFACNSS+SW AMSGAL+SA I C+KIQIFEKK F+LGV+L +    QEKLFKS+ ENAL
Sbjct: 181  TFACNSSISWPAMSGALDSASICCKKIQIFEKKGFTLGVVLLLVQSGQEKLFKSRFENAL 240

Query: 241  KLAIKKPKTNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENLQIEM 300
            K A+KKPK   +KLPFG CGCQE N  G++L EIE D  +QNCRSG ENSN    +Q++M
Sbjct: 241  KSALKKPKPTAMKLPFGLCGCQEENPRGRELGEIEADCGEQNCRSGIENSNTK--VQLQM 300

Query: 301  PLSTSSFTVTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAIEKIK 360
            PLSTSSF V+VDEWQTVQSGG E+G+WLL+S+NLEF DQIGPN+FKGVYKG+RV IEK+K
Sbjct: 301  PLSTSSFVVSVDEWQTVQSGGDEIGRWLLNSDNLEFIDQIGPNTFKGVYKGKRVGIEKLK 360

Query: 361  GCEKGVSYKFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKN 420
            GC+KG SY+FELRKDLLE+MTCGHKNIL FYGVC+DENHGLC+VT+LMEGGSVH++MLKN
Sbjct: 361  GCDKGNSYEFELRKDLLEIMTCGHKNILQFYGVCVDENHGLCIVTRLMEGGSVHDVMLKN 420

Query: 421  KRLQMKEITRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTACRNLG 480
            K+ Q KEI RIA DV EGIKF+NDHG+ Y DLNTHRILLD++G+ACLGDMGI+TAC+++G
Sbjct: 421  KKFQTKEIIRIAADVAEGIKFMNDHGIVYIDLNTHRILLDRHGSACLGDMGIVTACKSVG 480

Query: 481  EAMEYETDGYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAV 540
            EAM+YETDGYRWLAPEIIAGDPESV ETWMSNVYSFGMVIWEMVTGEAAY A+SPVQAAV
Sbjct: 481  EAMDYETDGYRWLAPEIIAGDPESVTETWMSNVYSFGMVIWEMVTGEAAYSAFSPVQAAV 540

Query: 541  GIAACGLRPDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFV-------------Y 600
            GIAACGLRPD+PKDCP  L++LM++CWNNCPSKRPQFSEI S +                
Sbjct: 541  GIAACGLRPDIPKDCPQILRSLMMKCWNNCPSKRPQFSEILSILLHPGNNNNRFHSFPHQ 600

Query: 601  APNRSLLHFMLPRKRAGEEGVAVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQRID 660
            + NRSLLH+MLPRKR  E  V   + +D  S                     L KK RI 
Sbjct: 601  SSNRSLLHYMLPRKRPVEGEVVEGDSSDRES---------------------LHKKHRIG 660

Query: 661  SDSNANSKVAAVATGANNIVYDG------ASLI---------------MASANSNPPDID 720
               ++++      T  NN    G      +S+I               M+  + NPPDID
Sbjct: 661  CLISSSTNATGTTTTGNNDKKSGEVNSSSSSVIGISDSNHTSGSSLPNMSLDDGNPPDID 720

Query: 721  EDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVELWD 780
            EDLHSRQLAVYGRETMR+LFASN+L+SGMQGLGAEIAKN++LAGVKSVTL+DEG VELWD
Sbjct: 721  EDLHSRQLAVYGRETMRRLFASNILVSGMQGLGAEIAKNLVLAGVKSVTLYDEGTVELWD 780

Query: 781  LSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDK 840
            LSSNF+FSE+D+GKNRAL S QKLQ+LNN+V+V TLTT+L  EQLS F+AVVFTD  L+K
Sbjct: 781  LSSNFIFSENDVGKNRALCSVQKLQELNNAVVVSTLTTKLTKEQLSDFQAVVFTDISLEK 840

Query: 841  AMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNP 900
            A+EFND+CHNHQPPIAFIK EVRGLFG+VFCDFGPEFTV+DV GE+PHTGIIASISNDNP
Sbjct: 841  AIEFNDYCHNHQPPIAFIKAEVRGLFGNVFCDFGPEFTVFDVDGEEPHTGIIASISNDNP 900

Query: 901  ALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEK 960
            ALVSCVDDERLEFQD DLV FSEV GMTELNDGKPR+IKN R YSFTLEEDTTNFG YE+
Sbjct: 901  ALVSCVDDERLEFQDEDLVAFSEVRGMTELNDGKPRKIKNARPYSFTLEEDTTNFGMYER 960

Query: 961  GGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLP 1020
            GGIVTQ+K+PK+LNFKPLREA+N+PGDFLLSDFSKFDRPPLLHLAF ALD F++E+G  P
Sbjct: 961  GGIVTQMKQPKVLNFKPLREALNNPGDFLLSDFSKFDRPPLLHLAFQALDTFISEMGCFP 1020

Query: 1021 VAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQE 1080
            +AGSEEDAQKLIS+AS +NE+LGDG++ D+NP LLRHFAFGA+AVLNPMAAMFGGIV QE
Sbjct: 1021 IAGSEEDAQKLISIASTINENLGDGKLGDVNPNLLRHFAFGARAVLNPMAAMFGGIVGQE 1080

Query: 1081 VLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAK 1140
            V+KACSGKFHPL QFFYFDS+ESLPTE LEASDFRPLNSRYDAQISVFGSKLQKKLE+A+
Sbjct: 1081 VMKACSGKFHPLFQFFYFDSVESLPTEPLEASDFRPLNSRYDAQISVFGSKLQKKLEDAQ 1140

Query: 1141 VFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKST 1200
            +F+VGSGALGCEFLKNLALMGVSC+ +GKLTITDDDVIEKSNLSRQFLFRDWNIGQAKST
Sbjct: 1141 LFVVGSGALGCEFLKNLALMGVSCNGQGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKST 1200

Query: 1201 VAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLY 1260
            VAASAA AIN HL IEALQNRV PETENVF+D+FWENL+VVVNALDNVNARLYVDQRCLY
Sbjct: 1201 VAASAAAAINPHLRIEALQNRVGPETENVFNDTFWENLSVVVNALDNVNARLYVDQRCLY 1260

Query: 1261 FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS 1320
            FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS
Sbjct: 1261 FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS 1320

Query: 1321 EFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDCITW 1380
            EFEGLLEKTP +VNAYLSNPSEYTSAM NAGDAQ+RD LER++ECLD+ERCETF+DCITW
Sbjct: 1321 EFEGLLEKTPAEVNAYLSNPSEYTSAMKNAGDAQARDNLERVIECLDKERCETFQDCITW 1380

Query: 1381 ARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAA 1440
            ARLKFEDY+ NR+KQLI+TFPEDA TS GAPFWSAPKRFP PLQFS+AD+S L F+LAA+
Sbjct: 1381 ARLKFEDYYVNRMKQLIFTFPEDAATSTGAPFWSAPKRFPRPLQFSSADRSLLQFILAAS 1440

Query: 1441 ILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAI 1500
            ILRAE++ IPIPDW K+P K A+AV++V VP+F P++  KIVTDEKATSLSTAS+DDAA+
Sbjct: 1441 ILRAETFGIPIPDWAKDPRKFAEAVEKVRVPEFQPREGVKIVTDEKATSLSTASIDDAAV 1500

Query: 1501 IHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKA 1560
            I++L+ KLE     LP GFRMKPIQFEKDDD+N+HMDLIAGLANMRARNYSIPEVDKLKA
Sbjct: 1501 INELIMKLEQCRMNLPSGFRMKPIQFEKDDDTNYHMDLIAGLANMRARNYSIPEVDKLKA 1560

Query: 1561 KFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPK 1620
            KFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPK
Sbjct: 1561 KFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPK 1620

Query: 1621 VIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMD 1680
            VIKH+DMSWTVWDRWIIKDNPTLR+L++WL +KGL+AYSISCGSCLLYNSMFPRHR+RMD
Sbjct: 1621 VIKHQDMSWTVWDRWIIKDNPTLRELLQWLADKGLSAYSISCGSCLLYNSMFPRHRERMD 1680

Query: 1681 KKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYFR 1684
            KKVVDLAR+VAKVELPPYRRHLDVVVACED+EDNDIDIP +S+YFR
Sbjct: 1681 KKVVDLAREVAKVELPPYRRHLDVVVACEDEEDNDIDIPQISIYFR 1703

BLAST of Csor.00g102970 vs. NCBI nr
Match: KCW52517.1 (hypothetical protein EUGRSUZ_J01907 [Eucalyptus grandis])

HSP 1 Score: 2616 bits (6780), Expect = 0.0
Identity = 1287/1703 (75.57%), Postives = 1472/1703 (86.44%), Query Frame = 0

Query: 1    MATALECWSSRASTDEDLVEQVLMRTQDRSEGS-------------KPESSLAVGEKESS 60
            MA ALECWSSRASTDED+VEQVLMRT DRSEGS             +  SS +     SS
Sbjct: 1    MAAALECWSSRASTDEDMVEQVLMRTHDRSEGSPAGAAGAASSGGAREPSSSSSSSSSSS 60

Query: 61   AMQKRLQRFSRNVSEAVASLKNSLNLDSVRDPSP--TKTEGSKKAVWGSVVRNLTQLYPG 120
             MQK+LQR SRNVSEA+ASLKNSLNLDS RD +P  +K +G ++ VWGSVVR+LTQLYPG
Sbjct: 61   VMQKKLQRLSRNVSEAIASLKNSLNLDSSRDGAPQASKIDGCRRMVWGSVVRSLTQLYPG 120

Query: 121  SQLPEKLVSNIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHDV 180
            SQLPEKLVSNIRKHYDSLPLSYAQAGF+MKDVFLHIKL+EQAS  D PAIL QEV+  +V
Sbjct: 121  SQLPEKLVSNIRKHYDSLPLSYAQAGFDMKDVFLHIKLMEQASGDDRPAILIQEVSQDEV 180

Query: 181  QKPTIKLTFACNSSVSWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFK 240
                 KLTFACNSS+SWS MSGAL++A I C+KIQIFEKK F+LG++L +    QE++FK
Sbjct: 181  HGSVFKLTFACNSSISWSVMSGALDNASICCKKIQIFEKKGFTLGIVLLLVQAEQERMFK 240

Query: 241  SKVENALKLAIKKPKTNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLN 300
            +++ENALKLA+KK +  TVKL FG CGCQE     +++ + E+D  + N R+G EN  L 
Sbjct: 241  TRIENALKLAMKKHRPATVKLAFGLCGCQEETANSREVGQAEDDVGELNYRNGSEN--LY 300

Query: 301  ENLQIEMPLSTSSFTVTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRR 360
              +Q++MPL TSSF ++VDEWQT+QSGG E+ KWLL+S+NLEF DQIGP+SFKGVYKGRR
Sbjct: 301  PKVQLQMPLPTSSFVISVDEWQTIQSGGDEIAKWLLNSDNLEFIDQIGPSSFKGVYKGRR 360

Query: 361  VAIEKIKGCEKGVSYKFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSV 420
            V IEK+KGC+KG SY+FELRKD LELMTCGHKN+L F GVCI+E+HGLCVVTKLMEGGS+
Sbjct: 361  VGIEKLKGCDKGNSYEFELRKDFLELMTCGHKNVLQFIGVCIEESHGLCVVTKLMEGGSL 420

Query: 421  HELMLKNKRLQMKEITRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGIL 480
            H+LMLK+K+LQ++EI RIA+DVVEGIKF+N+HG+ YRDLNT RILLD++GNACLGDMGI+
Sbjct: 421  HDLMLKSKKLQIREIVRIAIDVVEGIKFMNEHGITYRDLNTQRILLDRHGNACLGDMGIV 480

Query: 481  TACRNLGEAMEYETDGYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAY 540
             AC+++GEAMEYETDGYRWLAPEIIAGDPESV+ET MSNVYSFGMV+WEMVTGEAAY AY
Sbjct: 481  AACKSVGEAMEYETDGYRWLAPEIIAGDPESVSETCMSNVYSFGMVLWEMVTGEAAYAAY 540

Query: 541  SPVQAAVGIAACGLRPDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFVYAPNRSL 600
            SPVQAAVGIAACGLRPD+PKDCP  L+ LM +CWNN PSKRPQFSEI S +  Y  + + 
Sbjct: 541  SPVQAAVGIAACGLRPDIPKDCPQFLRNLMTKCWNNSPSKRPQFSEIVSLLLHYINSGND 600

Query: 601  LHFMLPRKRAGEEGVAVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQRIDS----D 660
                  RKRAGEEG  VE          +      S+ +  + G S +KK R+      +
Sbjct: 601  -----NRKRAGEEGEVVEGGESGGEGVGS------SSGAASSAGVSRLKKNRVGCFGPPE 660

Query: 661  SNANSKVAAVATGANNIVYDGASLIMASANSNPPDIDEDLHSRQLAVYGRETMRKLFASN 720
              A     + A   N     GA  IMA   S P DIDEDLHSRQLAVYGRETMR+LFASN
Sbjct: 661  LTATGNGKSNADSGNGSSGSGAP-IMALGGSMPTDIDEDLHSRQLAVYGRETMRRLFASN 720

Query: 721  VLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVELWDLSSNFVFSESDIGKNRALASAQK 780
            VL+SGMQGLG EIAKN++LAGVKSVTLHDEG+V+LWDLS NF+FSE D+GKNRALAS +K
Sbjct: 721  VLVSGMQGLGVEIAKNLVLAGVKSVTLHDEGVVQLWDLSGNFLFSERDVGKNRALASVEK 780

Query: 781  LQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDKAMEFNDFCHNHQPPIAFIKTEVR 840
            LQ+LNN+V+V TLTT+L  E+LS F+AVVFTD  L KA+EF+D+CH HQPPI+FIKTEVR
Sbjct: 781  LQELNNAVVVTTLTTKLTKERLSDFQAVVFTDIDLQKAIEFDDYCHTHQPPISFIKTEVR 840

Query: 841  GLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNPALVSCVDDERLEFQDGDLVVFSE 900
            GLFGSVFCDFGPEFTV+DV GE+PHTGIIASI NDNPALVSCVDDERLEFQDGDLVVFSE
Sbjct: 841  GLFGSVFCDFGPEFTVFDVDGEEPHTGIIASIGNDNPALVSCVDDERLEFQDGDLVVFSE 900

Query: 901  VHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEKGGIVTQVKEPKMLNFKPLREAIN 960
            VHGMTELNDGKPR+IK+ R YSF LEEDTTN+G+YEKGGIVTQVK PK+L F PL+EAI 
Sbjct: 901  VHGMTELNDGKPRKIKSARPYSFILEEDTTNYGAYEKGGIVTQVKLPKVLKFNPLKEAIK 960

Query: 961  DPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLPVAGSEEDAQKLISVASNVNESLG 1020
            DPGDFLLSDFSKFDRPPLLHLAF ALDKFV+E GR PVAGSE DAQ+LISVA+++NESLG
Sbjct: 961  DPGDFLLSDFSKFDRPPLLHLAFQALDKFVSEFGRYPVAGSEVDAQRLISVANSINESLG 1020

Query: 1021 DGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQEVLKACSGKFHPLVQFFYFDSLES 1080
            DG++EDINPKLL+HFAFG++AVLNPMAAMFGGIV QEV+KACSGKFHPL QFFYFDS+ES
Sbjct: 1021 DGKLEDINPKLLQHFAFGSRAVLNPMAAMFGGIVGQEVVKACSGKFHPLFQFFYFDSVES 1080

Query: 1081 LPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAKVFMVGSGALGCEFLKNLALMGVS 1140
            LPTE L+  D +P NSRYDAQ+SVFGSKLQKK+E+AKVF+VGSGALGCEFLKN+ALMGVS
Sbjct: 1081 LPTEPLDLDDLKPRNSRYDAQVSVFGSKLQKKMEDAKVFLVGSGALGCEFLKNIALMGVS 1140

Query: 1141 CSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAAVAINKHLNIEALQNRVS 1200
            C   GKLT+TDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAA +IN  LN+EALQNRV 
Sbjct: 1141 CGKHGKLTVTDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAATSINPRLNVEALQNRVG 1200

Query: 1201 PETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQMVIPH 1260
            PETENVFDD+FWENL+VV+NALDNVNARLYVDQ+CLYFQKPLLESGTLGAKCNTQMVIPH
Sbjct: 1201 PETENVFDDTFWENLSVVINALDNVNARLYVDQKCLYFQKPLLESGTLGAKCNTQMVIPH 1260

Query: 1261 LTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTPTDVNAYLSNPSEY 1320
            LTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTP +VNAYLSNP EY
Sbjct: 1261 LTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTPAEVNAYLSNPVEY 1320

Query: 1321 TSAMMNAGDAQSRDTLERILECLDRERCETFEDCITWARLKFEDYFSNRVKQLIYTFPED 1380
            T AM+N+GDAQ++DTLE +LECLD+ERCETFEDCI+WARLKFEDYF+NRVKQL YTFPED
Sbjct: 1321 TKAMINSGDAQAKDTLEHVLECLDKERCETFEDCISWARLKFEDYFTNRVKQLTYTFPED 1380

Query: 1381 AVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAAILRAESYAIPIPDWVKNPTKLAD 1440
            A+TS GAPFWSAPKRFP  LQFS +D  +LHFV+AA+ILRAE++ IP+PDW KNP K+A 
Sbjct: 1381 ALTSTGAPFWSAPKRFPCALQFSVSDPGHLHFVMAASILRAETFGIPVPDWAKNPKKMAQ 1440

Query: 1441 AVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAIIHDLVNKLEDTSRKLPEGFRMKP 1500
            AVD+VIVP F PK++A IVTDEKATSLSTAS+DDAA+I+DL+ +LE    KLP GFRMKP
Sbjct: 1441 AVDKVIVPGFQPKENANIVTDEKATSLSTASLDDAAVINDLITRLEHCRLKLPPGFRMKP 1500

Query: 1501 IQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMATGLVC 1560
            IQFEKDDD+N+HMDLIA LANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMATGLVC
Sbjct: 1501 IQFEKDDDTNYHMDLIAALANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMATGLVC 1560

Query: 1561 LELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPKVIKHRDMSWTVWDRWIIKDNPTL 1620
            LELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPP+ +KHRD++WTVWDRWIIK+NPTL
Sbjct: 1561 LELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPQAVKHRDLTWTVWDRWIIKNNPTL 1620

Query: 1621 RQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMDKKVVDLARDVAKVELPPYRRHLD 1680
            R+L++WL++KGLNAYSISCGSCLLYNSMFPRH++RMD+KVVDLA++VAK+E+P YRRHLD
Sbjct: 1621 RELMQWLQDKGLNAYSISCGSCLLYNSMFPRHQERMDRKVVDLAKEVAKLEVPSYRRHLD 1680

Query: 1681 VVVACEDDEDNDIDIPLVSVYFR 1684
            VVVACEDDE NDIDIP +S+YFR
Sbjct: 1681 VVVACEDDEGNDIDIPQISIYFR 1689

BLAST of Csor.00g102970 vs. NCBI nr
Match: OIW16493.1 (hypothetical protein TanjilG_32163 [Lupinus angustifolius])

HSP 1 Score: 2596 bits (6729), Expect = 0.0
Identity = 1284/1707 (75.22%), Postives = 1472/1707 (86.23%), Query Frame = 0

Query: 1    MATALECWSSRAST------DEDLVEQVLMRTQDRSEGSKPESSLAVGEKESSAMQKRLQ 60
            MA+ LECWSSR +T      D+D VEQVLMR+  RSE +    S +   K+SS +QK+L+
Sbjct: 1    MASPLECWSSRTTTTTTTTTDDDTVEQVLMRSHHRSEATTT-PSFSSSTKDSSIVQKKLR 60

Query: 61   RFSRNVSEAVASLKNSLNLDSVRDPSPTKTEGSKKAVWGSVVRNLTQLYPGSQLPEKLVS 120
            +F+RNVSEA+ S KNSLNLDS RDP+ +K E S+K  WG+VV+NLTQLYPGSQLPEKL+ 
Sbjct: 61   KFARNVSEAINSFKNSLNLDSTRDPTSSKIEASRKITWGTVVKNLTQLYPGSQLPEKLMC 120

Query: 121  NIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHD--VQKPTIKL 180
            NIRKHYDSLPLSY QA F+MKDVFLHIKLIEQAS  D PAILFQE T++D   +   +KL
Sbjct: 121  NIRKHYDSLPLSYGQAEFDMKDVFLHIKLIEQASETDQPAILFQEETDNDGEFEGSFLKL 180

Query: 181  TFACNSSVSWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKVENAL 240
            TFACNS +SW AMS AL+S+ I C+K+QIFEKK F+LGV + V    Q+K  + +VENA+
Sbjct: 181  TFACNSPISWPAMSSALDSSSINCKKVQIFEKKSFTLGVAILVYQSGQDKFVRMRVENAI 240

Query: 241  KLAIKKPKTNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENLQIEM 300
            K A+KKP+++ VKLPFG CGCQE N  GK+L E EED  D      FENS   +N+Q++M
Sbjct: 241  KFAMKKPRSSAVKLPFGLCGCQEENFRGKELGESEEDGGDACFGKEFENSC--QNIQLQM 300

Query: 301  PLSTSSFTVTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAIEKIK 360
            PL +SSF V+VDEWQT+ S   E+ KWLLSS+++EFTDQ+ PNS+KG+Y G+RV +EK+K
Sbjct: 301  PLPSSSFIVSVDEWQTIHSCVDEIEKWLLSSDSVEFTDQVEPNSYKGLYIGKRVGVEKLK 360

Query: 361  GCEKGVSYKFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKN 420
            GC+KG SY+FELRKDLLELMTCGH+NIL F GVC+ +NHGLCVVTK MEGGSVH+LM KN
Sbjct: 361  GCDKGNSYEFELRKDLLELMTCGHRNILQFCGVCVHDNHGLCVVTKYMEGGSVHDLMSKN 420

Query: 421  KRLQMKEITRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTACRNLG 480
            K+LQ K+I RIAVDV EG+KF+NDHGVAYRDLNT RILLDK+GNACLGDMGI+TACR++G
Sbjct: 421  KKLQAKDIVRIAVDVAEGMKFMNDHGVAYRDLNTQRILLDKHGNACLGDMGIVTACRSVG 480

Query: 481  EAMEYETDGYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAV 540
            EAMEYETDGYRWLAPEIIAGDPESV ETWMSNVYS+GMVIWEMVT E AY A+SPVQAAV
Sbjct: 481  EAMEYETDGYRWLAPEIIAGDPESVTETWMSNVYSYGMVIWEMVTSEVAYSAFSPVQAAV 540

Query: 541  GIAACGLRPDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFVYAPNRSLLHFMLPR 600
            GIAACGLRP++PKDCP TLK LM +CWNN PSKRPQFS+I + +     N  LLH MLPR
Sbjct: 541  GIAACGLRPEIPKDCPQTLKYLMTKCWNNSPSKRPQFSDILAILLRPNNNNRLLHCMLPR 600

Query: 601  KRA-GEEGVAVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQRID-----SDSNANS 660
            KRA GE GV VE  TD              NN+V +  AS  KK R       S S A++
Sbjct: 601  KRASGEGGVVVEGDTDTI------------NNTVASVSASFSKKNRTGCFAECSGSGADT 660

Query: 661  KVAAVAT-------GANNIVYDGASL--IMASANSNPPDIDEDLHSRQLAVYGRETMRKL 720
              +AV         G +N   D  S+  ++    +N  DIDEDLHSRQLAVYG ETMR+L
Sbjct: 661  VGSAVNDKGNGSIGGVSNKYNDSDSIGKLIGGGAANMVDIDEDLHSRQLAVYGLETMRRL 720

Query: 721  FASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVELWDLSSNFVFSESDIGKNRALA 780
            FASN+LISGMQGLG EIAKN+ILAGVKSVTLHDEG VELWDLSSNFVFS++D+GKNRA+A
Sbjct: 721  FASNILISGMQGLGVEIAKNLILAGVKSVTLHDEGTVELWDLSSNFVFSQNDVGKNRAVA 780

Query: 781  SAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDKAMEFNDFCHNHQPPIAFIK 840
            S  KLQ+LNN+V+V +LTT+L  EQLS F+AVVFT+  L+KA+EF+D+CH+HQP IAFIK
Sbjct: 781  SVSKLQELNNAVLVQSLTTKLTKEQLSNFQAVVFTEISLEKAIEFDDYCHSHQPSIAFIK 840

Query: 841  TEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNPALVSCVDDERLEFQDGDLV 900
             EVRGLFGSVFCDFGPEFTV+DV GE+PHTGIIASISNDNP+LVSCVDDERLEFQDGDLV
Sbjct: 841  AEVRGLFGSVFCDFGPEFTVFDVDGEEPHTGIIASISNDNPSLVSCVDDERLEFQDGDLV 900

Query: 901  VFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEKGGIVTQVKEPKMLNFKPLR 960
            +FSEVHGM ELNDGKPR+IKN RAYSFTLEEDTTN+G++EKGGIVTQVK+PK+LNFKPL+
Sbjct: 901  IFSEVHGMKELNDGKPRKIKNARAYSFTLEEDTTNYGAHEKGGIVTQVKQPKVLNFKPLK 960

Query: 961  EAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLPVAGSEEDAQKLISVASNVN 1020
            +A+NDP DFLLSDFSKFDRPPLLHLAF ALD F++ELGR PVAGSE+DAQK+IS+ASN+N
Sbjct: 961  QALNDPSDFLLSDFSKFDRPPLLHLAFQALDTFISELGRFPVAGSEDDAQKVISIASNIN 1020

Query: 1021 ESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQEVLKACSGKFHPLVQFFYFD 1080
            E+LGDGR+ED+NPKLLR F FGA+AVLNPMAA+FGGIV QEV+KACSGKFHPL Q+FYFD
Sbjct: 1021 ENLGDGRLEDMNPKLLRQFTFGARAVLNPMAAIFGGIVGQEVVKACSGKFHPLFQYFYFD 1080

Query: 1081 SLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAKVFMVGSGALGCEFLKNLAL 1140
            S+ESLPTE L+A+DFRP+NSRYDAQISVFG KLQK LE+A+VF+VGSGALGCEFLKNLAL
Sbjct: 1081 SVESLPTEPLDANDFRPINSRYDAQISVFGQKLQKILEDAQVFVVGSGALGCEFLKNLAL 1140

Query: 1141 MGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAAVAINKHLNIEALQ 1200
            MGVSC ++GKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAA +IN  LNIEALQ
Sbjct: 1141 MGVSCGSQGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAAASINPRLNIEALQ 1200

Query: 1201 NRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQM 1260
            NRV PETENVF D+FWENL+VV+NALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQM
Sbjct: 1201 NRVGPETENVFHDTFWENLSVVINALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQM 1260

Query: 1261 VIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTPTDVNAYLSN 1320
            VIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTP +VNAYLSN
Sbjct: 1261 VIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTPAEVNAYLSN 1320

Query: 1321 PSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDCITWARLKFEDYFSNRVKQLIYT 1380
            PSEYT+AM+ AGDAQ+RD LER+LECLD+E+CETF+DCITWARLKFEDYF+NRVKQL YT
Sbjct: 1321 PSEYTNAMIKAGDAQARDNLERVLECLDKEKCETFQDCITWARLKFEDYFANRVKQLTYT 1380

Query: 1381 FPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAAILRAESYAIPIPDWVKNPT 1440
            FPEDA TS GAPFWSAPKRFP PLQFS++D+ +L FVLAA+ILRAE++ I IP+WVK+P 
Sbjct: 1381 FPEDAATSTGAPFWSAPKRFPRPLQFSSSDEGHLQFVLAASILRAETFGISIPEWVKSPN 1440

Query: 1441 KLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAIIHDLVNKLEDTSRKLPEGF 1500
            KLA+AVD+VIVP+F P+KDAKIVTDE AT+LSTASVDDAA+I+DL+ KLE     LP GF
Sbjct: 1441 KLAEAVDKVIVPNFQPRKDAKIVTDETATNLSTASVDDAAVINDLIIKLERCWATLPPGF 1500

Query: 1501 RMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMAT 1560
            RMKPI FEKDDD+N+HMD+IAGLANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMAT
Sbjct: 1501 RMKPILFEKDDDTNYHMDMIAGLANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMAT 1560

Query: 1561 GLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPKVIKHRDMSWTVWDRWIIKD 1620
            G VCLELYKVL GGHK+EDYRNTFANLALPLFS+AEPVPPKVIKH+DMSWTVWDRW++KD
Sbjct: 1561 GFVCLELYKVLAGGHKLEDYRNTFANLALPLFSIAEPVPPKVIKHQDMSWTVWDRWVVKD 1620

Query: 1621 NPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMDKKVVDLARDVAKVELPPYR 1680
            N TLR+L+EWLK KGLNAYSISCGSCLLYNSMFPRHRDRMDKKVVDLAR+VAK+E+P YR
Sbjct: 1621 NLTLRELLEWLKAKGLNAYSISCGSCLLYNSMFPRHRDRMDKKVVDLAREVAKMEIPTYR 1680

Query: 1681 RHLDVVVACEDDEDNDIDIPLVSVYFR 1684
            RH D+VVACEDDEDNDIDIP VS+YFR
Sbjct: 1681 RHFDIVVACEDDEDNDIDIPQVSIYFR 1692

BLAST of Csor.00g102970 vs. ExPASy TrEMBL
Match: A0A5J5BKP9 (E1 ubiquitin-activating enzyme OS=Nyssa sinensis OX=561372 GN=F0562_021539 PE=3 SV=1)

HSP 1 Score: 2716 bits (7039), Expect = 0.0
Identity = 1339/1726 (77.58%), Postives = 1500/1726 (86.91%), Query Frame = 0

Query: 1    MATALECWSSRASTDEDLVEQVLMRTQDRSEGSKPESSLAV------GEKESSAMQKRLQ 60
            MA ALECWSSRASTDED+VEQVLMRTQDRSEG    SS         G KESSAMQKRLQ
Sbjct: 1    MAAALECWSSRASTDEDMVEQVLMRTQDRSEGLPENSSSGAASLNGGGVKESSAMQKRLQ 60

Query: 61   RFSRNVSEAVASLKNSLNLDSVRD--PSPTKTEGSKKAVWGSVVRNLTQLYPGSQLPEKL 120
            R SRNVSEA+ASLKNSLNLDSVRD  P  ++ E  +K VWGSVVRNLTQLYPGSQLPEKL
Sbjct: 61   RLSRNVSEAIASLKNSLNLDSVRDSPPQQSRIESCRKLVWGSVVRNLTQLYPGSQLPEKL 120

Query: 121  VSNIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHDVQKPTIKL 180
            VSNIRKHYDSLPLSYAQAGF+MKDVFLHIKLIEQAS  DHPAIL QEV++++ Q    +L
Sbjct: 121  VSNIRKHYDSLPLSYAQAGFDMKDVFLHIKLIEQASAEDHPAILIQEVSDNESQGSLFRL 180

Query: 181  TFACNSSVSWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKVENAL 240
            TFACNSS+SW AMSGAL+SA I C+KIQIFEKK F+LGV+L +    QEKLFKS+ ENAL
Sbjct: 181  TFACNSSISWPAMSGALDSASICCKKIQIFEKKGFTLGVVLLLVQSGQEKLFKSRFENAL 240

Query: 241  KLAIKKPKTNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENLQIEM 300
            K A+KKPK   +KLPFG CGCQE N  G++L EIE D  +QNCRSG ENSN    +Q++M
Sbjct: 241  KSALKKPKPTAMKLPFGLCGCQEENPRGRELGEIEADCGEQNCRSGIENSNTK--VQLQM 300

Query: 301  PLSTSSFTVTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAIEKIK 360
            PLSTSSF V+VDEWQTVQSGG E+G+WLL+S+NLEF DQIGPN+FKGVYKG+RV IEK+K
Sbjct: 301  PLSTSSFVVSVDEWQTVQSGGDEIGRWLLNSDNLEFIDQIGPNTFKGVYKGKRVGIEKLK 360

Query: 361  GCEKGVSYKFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKN 420
            GC+KG SY+FELRKDLLE+MTCGHKNIL FYGVC+DENHGLC+VT+LMEGGSVH++MLKN
Sbjct: 361  GCDKGNSYEFELRKDLLEIMTCGHKNILQFYGVCVDENHGLCIVTRLMEGGSVHDVMLKN 420

Query: 421  KRLQMKEITRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTACRNLG 480
            K+ Q KEI RIA DV EGIKF+NDHG+ Y DLNTHRILLD++G+ACLGDMGI+TAC+++G
Sbjct: 421  KKFQTKEIIRIAADVAEGIKFMNDHGIVYIDLNTHRILLDRHGSACLGDMGIVTACKSVG 480

Query: 481  EAMEYETDGYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAV 540
            EAM+YETDGYRWLAPEIIAGDPESV ETWMSNVYSFGMVIWEMVTGEAAY A+SPVQAAV
Sbjct: 481  EAMDYETDGYRWLAPEIIAGDPESVTETWMSNVYSFGMVIWEMVTGEAAYSAFSPVQAAV 540

Query: 541  GIAACGLRPDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFV-------------Y 600
            GIAACGLRPD+PKDCP  L++LM++CWNNCPSKRPQFSEI S +                
Sbjct: 541  GIAACGLRPDIPKDCPQILRSLMMKCWNNCPSKRPQFSEILSILLHPGNNNNRFHSFPHQ 600

Query: 601  APNRSLLHFMLPRKRAGEEGVAVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQRID 660
            + NRSLLH+MLPRKR  E  V   + +D  S                     L KK RI 
Sbjct: 601  SSNRSLLHYMLPRKRPVEGEVVEGDSSDRES---------------------LHKKHRIG 660

Query: 661  SDSNANSKVAAVATGANNIVYDG------ASLI---------------MASANSNPPDID 720
               ++++      T  NN    G      +S+I               M+  + NPPDID
Sbjct: 661  CLISSSTNATGTTTTGNNDKKSGEVNSSSSSVIGISDSNHTSGSSLPNMSLDDGNPPDID 720

Query: 721  EDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVELWD 780
            EDLHSRQLAVYGRETMR+LFASN+L+SGMQGLGAEIAKN++LAGVKSVTL+DEG VELWD
Sbjct: 721  EDLHSRQLAVYGRETMRRLFASNILVSGMQGLGAEIAKNLVLAGVKSVTLYDEGTVELWD 780

Query: 781  LSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDK 840
            LSSNF+FSE+D+GKNRAL S QKLQ+LNN+V+V TLTT+L  EQLS F+AVVFTD  L+K
Sbjct: 781  LSSNFIFSENDVGKNRALCSVQKLQELNNAVVVSTLTTKLTKEQLSDFQAVVFTDISLEK 840

Query: 841  AMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNP 900
            A+EFND+CHNHQPPIAFIK EVRGLFG+VFCDFGPEFTV+DV GE+PHTGIIASISNDNP
Sbjct: 841  AIEFNDYCHNHQPPIAFIKAEVRGLFGNVFCDFGPEFTVFDVDGEEPHTGIIASISNDNP 900

Query: 901  ALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEK 960
            ALVSCVDDERLEFQD DLV FSEV GMTELNDGKPR+IKN R YSFTLEEDTTNFG YE+
Sbjct: 901  ALVSCVDDERLEFQDEDLVAFSEVRGMTELNDGKPRKIKNARPYSFTLEEDTTNFGMYER 960

Query: 961  GGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLP 1020
            GGIVTQ+K+PK+LNFKPLREA+N+PGDFLLSDFSKFDRPPLLHLAF ALD F++E+G  P
Sbjct: 961  GGIVTQMKQPKVLNFKPLREALNNPGDFLLSDFSKFDRPPLLHLAFQALDTFISEMGCFP 1020

Query: 1021 VAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQE 1080
            +AGSEEDAQKLIS+AS +NE+LGDG++ D+NP LLRHFAFGA+AVLNPMAAMFGGIV QE
Sbjct: 1021 IAGSEEDAQKLISIASTINENLGDGKLGDVNPNLLRHFAFGARAVLNPMAAMFGGIVGQE 1080

Query: 1081 VLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAK 1140
            V+KACSGKFHPL QFFYFDS+ESLPTE LEASDFRPLNSRYDAQISVFGSKLQKKLE+A+
Sbjct: 1081 VMKACSGKFHPLFQFFYFDSVESLPTEPLEASDFRPLNSRYDAQISVFGSKLQKKLEDAQ 1140

Query: 1141 VFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKST 1200
            +F+VGSGALGCEFLKNLALMGVSC+ +GKLTITDDDVIEKSNLSRQFLFRDWNIGQAKST
Sbjct: 1141 LFVVGSGALGCEFLKNLALMGVSCNGQGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKST 1200

Query: 1201 VAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLY 1260
            VAASAA AIN HL IEALQNRV PETENVF+D+FWENL+VVVNALDNVNARLYVDQRCLY
Sbjct: 1201 VAASAAAAINPHLRIEALQNRVGPETENVFNDTFWENLSVVVNALDNVNARLYVDQRCLY 1260

Query: 1261 FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS 1320
            FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS
Sbjct: 1261 FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS 1320

Query: 1321 EFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDCITW 1380
            EFEGLLEKTP +VNAYLSNPSEYTSAM NAGDAQ+RD LER++ECLD+ERCETF+DCITW
Sbjct: 1321 EFEGLLEKTPAEVNAYLSNPSEYTSAMKNAGDAQARDNLERVIECLDKERCETFQDCITW 1380

Query: 1381 ARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAA 1440
            ARLKFEDY+ NR+KQLI+TFPEDA TS GAPFWSAPKRFP PLQFS+AD+S L F+LAA+
Sbjct: 1381 ARLKFEDYYVNRMKQLIFTFPEDAATSTGAPFWSAPKRFPRPLQFSSADRSLLQFILAAS 1440

Query: 1441 ILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAI 1500
            ILRAE++ IPIPDW K+P K A+AV++V VP+F P++  KIVTDEKATSLSTAS+DDAA+
Sbjct: 1441 ILRAETFGIPIPDWAKDPRKFAEAVEKVRVPEFQPREGVKIVTDEKATSLSTASIDDAAV 1500

Query: 1501 IHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKA 1560
            I++L+ KLE     LP GFRMKPIQFEKDDD+N+HMDLIAGLANMRARNYSIPEVDKLKA
Sbjct: 1501 INELIMKLEQCRMNLPSGFRMKPIQFEKDDDTNYHMDLIAGLANMRARNYSIPEVDKLKA 1560

Query: 1561 KFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPK 1620
            KFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPK
Sbjct: 1561 KFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPK 1620

Query: 1621 VIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMD 1680
            VIKH+DMSWTVWDRWIIKDNPTLR+L++WL +KGL+AYSISCGSCLLYNSMFPRHR+RMD
Sbjct: 1621 VIKHQDMSWTVWDRWIIKDNPTLRELLQWLADKGLSAYSISCGSCLLYNSMFPRHRERMD 1680

Query: 1681 KKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYFR 1684
            KKVVDLAR+VAKVELPPYRRHLDVVVACED+EDNDIDIP +S+YFR
Sbjct: 1681 KKVVDLAREVAKVELPPYRRHLDVVVACEDEEDNDIDIPQISIYFR 1703

BLAST of Csor.00g102970 vs. ExPASy TrEMBL
Match: A0A059AEP6 (E1 ubiquitin-activating enzyme OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_J01907 PE=3 SV=1)

HSP 1 Score: 2616 bits (6780), Expect = 0.0
Identity = 1287/1703 (75.57%), Postives = 1472/1703 (86.44%), Query Frame = 0

Query: 1    MATALECWSSRASTDEDLVEQVLMRTQDRSEGS-------------KPESSLAVGEKESS 60
            MA ALECWSSRASTDED+VEQVLMRT DRSEGS             +  SS +     SS
Sbjct: 1    MAAALECWSSRASTDEDMVEQVLMRTHDRSEGSPAGAAGAASSGGAREPSSSSSSSSSSS 60

Query: 61   AMQKRLQRFSRNVSEAVASLKNSLNLDSVRDPSP--TKTEGSKKAVWGSVVRNLTQLYPG 120
             MQK+LQR SRNVSEA+ASLKNSLNLDS RD +P  +K +G ++ VWGSVVR+LTQLYPG
Sbjct: 61   VMQKKLQRLSRNVSEAIASLKNSLNLDSSRDGAPQASKIDGCRRMVWGSVVRSLTQLYPG 120

Query: 121  SQLPEKLVSNIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHDV 180
            SQLPEKLVSNIRKHYDSLPLSYAQAGF+MKDVFLHIKL+EQAS  D PAIL QEV+  +V
Sbjct: 121  SQLPEKLVSNIRKHYDSLPLSYAQAGFDMKDVFLHIKLMEQASGDDRPAILIQEVSQDEV 180

Query: 181  QKPTIKLTFACNSSVSWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFK 240
                 KLTFACNSS+SWS MSGAL++A I C+KIQIFEKK F+LG++L +    QE++FK
Sbjct: 181  HGSVFKLTFACNSSISWSVMSGALDNASICCKKIQIFEKKGFTLGIVLLLVQAEQERMFK 240

Query: 241  SKVENALKLAIKKPKTNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLN 300
            +++ENALKLA+KK +  TVKL FG CGCQE     +++ + E+D  + N R+G EN  L 
Sbjct: 241  TRIENALKLAMKKHRPATVKLAFGLCGCQEETANSREVGQAEDDVGELNYRNGSEN--LY 300

Query: 301  ENLQIEMPLSTSSFTVTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRR 360
              +Q++MPL TSSF ++VDEWQT+QSGG E+ KWLL+S+NLEF DQIGP+SFKGVYKGRR
Sbjct: 301  PKVQLQMPLPTSSFVISVDEWQTIQSGGDEIAKWLLNSDNLEFIDQIGPSSFKGVYKGRR 360

Query: 361  VAIEKIKGCEKGVSYKFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSV 420
            V IEK+KGC+KG SY+FELRKD LELMTCGHKN+L F GVCI+E+HGLCVVTKLMEGGS+
Sbjct: 361  VGIEKLKGCDKGNSYEFELRKDFLELMTCGHKNVLQFIGVCIEESHGLCVVTKLMEGGSL 420

Query: 421  HELMLKNKRLQMKEITRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGIL 480
            H+LMLK+K+LQ++EI RIA+DVVEGIKF+N+HG+ YRDLNT RILLD++GNACLGDMGI+
Sbjct: 421  HDLMLKSKKLQIREIVRIAIDVVEGIKFMNEHGITYRDLNTQRILLDRHGNACLGDMGIV 480

Query: 481  TACRNLGEAMEYETDGYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAY 540
             AC+++GEAMEYETDGYRWLAPEIIAGDPESV+ET MSNVYSFGMV+WEMVTGEAAY AY
Sbjct: 481  AACKSVGEAMEYETDGYRWLAPEIIAGDPESVSETCMSNVYSFGMVLWEMVTGEAAYAAY 540

Query: 541  SPVQAAVGIAACGLRPDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFVYAPNRSL 600
            SPVQAAVGIAACGLRPD+PKDCP  L+ LM +CWNN PSKRPQFSEI S +  Y  + + 
Sbjct: 541  SPVQAAVGIAACGLRPDIPKDCPQFLRNLMTKCWNNSPSKRPQFSEIVSLLLHYINSGND 600

Query: 601  LHFMLPRKRAGEEGVAVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQRIDS----D 660
                  RKRAGEEG  VE          +      S+ +  + G S +KK R+      +
Sbjct: 601  -----NRKRAGEEGEVVEGGESGGEGVGS------SSGAASSAGVSRLKKNRVGCFGPPE 660

Query: 661  SNANSKVAAVATGANNIVYDGASLIMASANSNPPDIDEDLHSRQLAVYGRETMRKLFASN 720
              A     + A   N     GA  IMA   S P DIDEDLHSRQLAVYGRETMR+LFASN
Sbjct: 661  LTATGNGKSNADSGNGSSGSGAP-IMALGGSMPTDIDEDLHSRQLAVYGRETMRRLFASN 720

Query: 721  VLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVELWDLSSNFVFSESDIGKNRALASAQK 780
            VL+SGMQGLG EIAKN++LAGVKSVTLHDEG+V+LWDLS NF+FSE D+GKNRALAS +K
Sbjct: 721  VLVSGMQGLGVEIAKNLVLAGVKSVTLHDEGVVQLWDLSGNFLFSERDVGKNRALASVEK 780

Query: 781  LQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDKAMEFNDFCHNHQPPIAFIKTEVR 840
            LQ+LNN+V+V TLTT+L  E+LS F+AVVFTD  L KA+EF+D+CH HQPPI+FIKTEVR
Sbjct: 781  LQELNNAVVVTTLTTKLTKERLSDFQAVVFTDIDLQKAIEFDDYCHTHQPPISFIKTEVR 840

Query: 841  GLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNPALVSCVDDERLEFQDGDLVVFSE 900
            GLFGSVFCDFGPEFTV+DV GE+PHTGIIASI NDNPALVSCVDDERLEFQDGDLVVFSE
Sbjct: 841  GLFGSVFCDFGPEFTVFDVDGEEPHTGIIASIGNDNPALVSCVDDERLEFQDGDLVVFSE 900

Query: 901  VHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEKGGIVTQVKEPKMLNFKPLREAIN 960
            VHGMTELNDGKPR+IK+ R YSF LEEDTTN+G+YEKGGIVTQVK PK+L F PL+EAI 
Sbjct: 901  VHGMTELNDGKPRKIKSARPYSFILEEDTTNYGAYEKGGIVTQVKLPKVLKFNPLKEAIK 960

Query: 961  DPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLPVAGSEEDAQKLISVASNVNESLG 1020
            DPGDFLLSDFSKFDRPPLLHLAF ALDKFV+E GR PVAGSE DAQ+LISVA+++NESLG
Sbjct: 961  DPGDFLLSDFSKFDRPPLLHLAFQALDKFVSEFGRYPVAGSEVDAQRLISVANSINESLG 1020

Query: 1021 DGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQEVLKACSGKFHPLVQFFYFDSLES 1080
            DG++EDINPKLL+HFAFG++AVLNPMAAMFGGIV QEV+KACSGKFHPL QFFYFDS+ES
Sbjct: 1021 DGKLEDINPKLLQHFAFGSRAVLNPMAAMFGGIVGQEVVKACSGKFHPLFQFFYFDSVES 1080

Query: 1081 LPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAKVFMVGSGALGCEFLKNLALMGVS 1140
            LPTE L+  D +P NSRYDAQ+SVFGSKLQKK+E+AKVF+VGSGALGCEFLKN+ALMGVS
Sbjct: 1081 LPTEPLDLDDLKPRNSRYDAQVSVFGSKLQKKMEDAKVFLVGSGALGCEFLKNIALMGVS 1140

Query: 1141 CSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAAVAINKHLNIEALQNRVS 1200
            C   GKLT+TDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAA +IN  LN+EALQNRV 
Sbjct: 1141 CGKHGKLTVTDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAATSINPRLNVEALQNRVG 1200

Query: 1201 PETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQMVIPH 1260
            PETENVFDD+FWENL+VV+NALDNVNARLYVDQ+CLYFQKPLLESGTLGAKCNTQMVIPH
Sbjct: 1201 PETENVFDDTFWENLSVVINALDNVNARLYVDQKCLYFQKPLLESGTLGAKCNTQMVIPH 1260

Query: 1261 LTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTPTDVNAYLSNPSEY 1320
            LTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTP +VNAYLSNP EY
Sbjct: 1261 LTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTPAEVNAYLSNPVEY 1320

Query: 1321 TSAMMNAGDAQSRDTLERILECLDRERCETFEDCITWARLKFEDYFSNRVKQLIYTFPED 1380
            T AM+N+GDAQ++DTLE +LECLD+ERCETFEDCI+WARLKFEDYF+NRVKQL YTFPED
Sbjct: 1321 TKAMINSGDAQAKDTLEHVLECLDKERCETFEDCISWARLKFEDYFTNRVKQLTYTFPED 1380

Query: 1381 AVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAAILRAESYAIPIPDWVKNPTKLAD 1440
            A+TS GAPFWSAPKRFP  LQFS +D  +LHFV+AA+ILRAE++ IP+PDW KNP K+A 
Sbjct: 1381 ALTSTGAPFWSAPKRFPCALQFSVSDPGHLHFVMAASILRAETFGIPVPDWAKNPKKMAQ 1440

Query: 1441 AVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAIIHDLVNKLEDTSRKLPEGFRMKP 1500
            AVD+VIVP F PK++A IVTDEKATSLSTAS+DDAA+I+DL+ +LE    KLP GFRMKP
Sbjct: 1441 AVDKVIVPGFQPKENANIVTDEKATSLSTASLDDAAVINDLITRLEHCRLKLPPGFRMKP 1500

Query: 1501 IQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMATGLVC 1560
            IQFEKDDD+N+HMDLIA LANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMATGLVC
Sbjct: 1501 IQFEKDDDTNYHMDLIAALANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMATGLVC 1560

Query: 1561 LELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPKVIKHRDMSWTVWDRWIIKDNPTL 1620
            LELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPP+ +KHRD++WTVWDRWIIK+NPTL
Sbjct: 1561 LELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPQAVKHRDLTWTVWDRWIIKNNPTL 1620

Query: 1621 RQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMDKKVVDLARDVAKVELPPYRRHLD 1680
            R+L++WL++KGLNAYSISCGSCLLYNSMFPRH++RMD+KVVDLA++VAK+E+P YRRHLD
Sbjct: 1621 RELMQWLQDKGLNAYSISCGSCLLYNSMFPRHQERMDRKVVDLAKEVAKLEVPSYRRHLD 1680

Query: 1681 VVVACEDDEDNDIDIPLVSVYFR 1684
            VVVACEDDE NDIDIP +S+YFR
Sbjct: 1681 VVVACEDDEGNDIDIPQISIYFR 1689

BLAST of Csor.00g102970 vs. ExPASy TrEMBL
Match: A0A1J7IUS6 (E1 ubiquitin-activating enzyme OS=Lupinus angustifolius OX=3871 GN=TanjilG_32163 PE=3 SV=1)

HSP 1 Score: 2596 bits (6729), Expect = 0.0
Identity = 1284/1707 (75.22%), Postives = 1472/1707 (86.23%), Query Frame = 0

Query: 1    MATALECWSSRAST------DEDLVEQVLMRTQDRSEGSKPESSLAVGEKESSAMQKRLQ 60
            MA+ LECWSSR +T      D+D VEQVLMR+  RSE +    S +   K+SS +QK+L+
Sbjct: 1    MASPLECWSSRTTTTTTTTTDDDTVEQVLMRSHHRSEATTT-PSFSSSTKDSSIVQKKLR 60

Query: 61   RFSRNVSEAVASLKNSLNLDSVRDPSPTKTEGSKKAVWGSVVRNLTQLYPGSQLPEKLVS 120
            +F+RNVSEA+ S KNSLNLDS RDP+ +K E S+K  WG+VV+NLTQLYPGSQLPEKL+ 
Sbjct: 61   KFARNVSEAINSFKNSLNLDSTRDPTSSKIEASRKITWGTVVKNLTQLYPGSQLPEKLMC 120

Query: 121  NIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHD--VQKPTIKL 180
            NIRKHYDSLPLSY QA F+MKDVFLHIKLIEQAS  D PAILFQE T++D   +   +KL
Sbjct: 121  NIRKHYDSLPLSYGQAEFDMKDVFLHIKLIEQASETDQPAILFQEETDNDGEFEGSFLKL 180

Query: 181  TFACNSSVSWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKVENAL 240
            TFACNS +SW AMS AL+S+ I C+K+QIFEKK F+LGV + V    Q+K  + +VENA+
Sbjct: 181  TFACNSPISWPAMSSALDSSSINCKKVQIFEKKSFTLGVAILVYQSGQDKFVRMRVENAI 240

Query: 241  KLAIKKPKTNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENLQIEM 300
            K A+KKP+++ VKLPFG CGCQE N  GK+L E EED  D      FENS   +N+Q++M
Sbjct: 241  KFAMKKPRSSAVKLPFGLCGCQEENFRGKELGESEEDGGDACFGKEFENSC--QNIQLQM 300

Query: 301  PLSTSSFTVTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAIEKIK 360
            PL +SSF V+VDEWQT+ S   E+ KWLLSS+++EFTDQ+ PNS+KG+Y G+RV +EK+K
Sbjct: 301  PLPSSSFIVSVDEWQTIHSCVDEIEKWLLSSDSVEFTDQVEPNSYKGLYIGKRVGVEKLK 360

Query: 361  GCEKGVSYKFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKN 420
            GC+KG SY+FELRKDLLELMTCGH+NIL F GVC+ +NHGLCVVTK MEGGSVH+LM KN
Sbjct: 361  GCDKGNSYEFELRKDLLELMTCGHRNILQFCGVCVHDNHGLCVVTKYMEGGSVHDLMSKN 420

Query: 421  KRLQMKEITRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTACRNLG 480
            K+LQ K+I RIAVDV EG+KF+NDHGVAYRDLNT RILLDK+GNACLGDMGI+TACR++G
Sbjct: 421  KKLQAKDIVRIAVDVAEGMKFMNDHGVAYRDLNTQRILLDKHGNACLGDMGIVTACRSVG 480

Query: 481  EAMEYETDGYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAV 540
            EAMEYETDGYRWLAPEIIAGDPESV ETWMSNVYS+GMVIWEMVT E AY A+SPVQAAV
Sbjct: 481  EAMEYETDGYRWLAPEIIAGDPESVTETWMSNVYSYGMVIWEMVTSEVAYSAFSPVQAAV 540

Query: 541  GIAACGLRPDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFVYAPNRSLLHFMLPR 600
            GIAACGLRP++PKDCP TLK LM +CWNN PSKRPQFS+I + +     N  LLH MLPR
Sbjct: 541  GIAACGLRPEIPKDCPQTLKYLMTKCWNNSPSKRPQFSDILAILLRPNNNNRLLHCMLPR 600

Query: 601  KRA-GEEGVAVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQRID-----SDSNANS 660
            KRA GE GV VE  TD              NN+V +  AS  KK R       S S A++
Sbjct: 601  KRASGEGGVVVEGDTDTI------------NNTVASVSASFSKKNRTGCFAECSGSGADT 660

Query: 661  KVAAVAT-------GANNIVYDGASL--IMASANSNPPDIDEDLHSRQLAVYGRETMRKL 720
              +AV         G +N   D  S+  ++    +N  DIDEDLHSRQLAVYG ETMR+L
Sbjct: 661  VGSAVNDKGNGSIGGVSNKYNDSDSIGKLIGGGAANMVDIDEDLHSRQLAVYGLETMRRL 720

Query: 721  FASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVELWDLSSNFVFSESDIGKNRALA 780
            FASN+LISGMQGLG EIAKN+ILAGVKSVTLHDEG VELWDLSSNFVFS++D+GKNRA+A
Sbjct: 721  FASNILISGMQGLGVEIAKNLILAGVKSVTLHDEGTVELWDLSSNFVFSQNDVGKNRAVA 780

Query: 781  SAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDKAMEFNDFCHNHQPPIAFIK 840
            S  KLQ+LNN+V+V +LTT+L  EQLS F+AVVFT+  L+KA+EF+D+CH+HQP IAFIK
Sbjct: 781  SVSKLQELNNAVLVQSLTTKLTKEQLSNFQAVVFTEISLEKAIEFDDYCHSHQPSIAFIK 840

Query: 841  TEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNPALVSCVDDERLEFQDGDLV 900
             EVRGLFGSVFCDFGPEFTV+DV GE+PHTGIIASISNDNP+LVSCVDDERLEFQDGDLV
Sbjct: 841  AEVRGLFGSVFCDFGPEFTVFDVDGEEPHTGIIASISNDNPSLVSCVDDERLEFQDGDLV 900

Query: 901  VFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEKGGIVTQVKEPKMLNFKPLR 960
            +FSEVHGM ELNDGKPR+IKN RAYSFTLEEDTTN+G++EKGGIVTQVK+PK+LNFKPL+
Sbjct: 901  IFSEVHGMKELNDGKPRKIKNARAYSFTLEEDTTNYGAHEKGGIVTQVKQPKVLNFKPLK 960

Query: 961  EAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLPVAGSEEDAQKLISVASNVN 1020
            +A+NDP DFLLSDFSKFDRPPLLHLAF ALD F++ELGR PVAGSE+DAQK+IS+ASN+N
Sbjct: 961  QALNDPSDFLLSDFSKFDRPPLLHLAFQALDTFISELGRFPVAGSEDDAQKVISIASNIN 1020

Query: 1021 ESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQEVLKACSGKFHPLVQFFYFD 1080
            E+LGDGR+ED+NPKLLR F FGA+AVLNPMAA+FGGIV QEV+KACSGKFHPL Q+FYFD
Sbjct: 1021 ENLGDGRLEDMNPKLLRQFTFGARAVLNPMAAIFGGIVGQEVVKACSGKFHPLFQYFYFD 1080

Query: 1081 SLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAKVFMVGSGALGCEFLKNLAL 1140
            S+ESLPTE L+A+DFRP+NSRYDAQISVFG KLQK LE+A+VF+VGSGALGCEFLKNLAL
Sbjct: 1081 SVESLPTEPLDANDFRPINSRYDAQISVFGQKLQKILEDAQVFVVGSGALGCEFLKNLAL 1140

Query: 1141 MGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAAVAINKHLNIEALQ 1200
            MGVSC ++GKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAA +IN  LNIEALQ
Sbjct: 1141 MGVSCGSQGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAAASINPRLNIEALQ 1200

Query: 1201 NRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQM 1260
            NRV PETENVF D+FWENL+VV+NALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQM
Sbjct: 1201 NRVGPETENVFHDTFWENLSVVINALDNVNARLYVDQRCLYFQKPLLESGTLGAKCNTQM 1260

Query: 1261 VIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTPTDVNAYLSN 1320
            VIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTP +VNAYLSN
Sbjct: 1261 VIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEKTPAEVNAYLSN 1320

Query: 1321 PSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDCITWARLKFEDYFSNRVKQLIYT 1380
            PSEYT+AM+ AGDAQ+RD LER+LECLD+E+CETF+DCITWARLKFEDYF+NRVKQL YT
Sbjct: 1321 PSEYTNAMIKAGDAQARDNLERVLECLDKEKCETFQDCITWARLKFEDYFANRVKQLTYT 1380

Query: 1381 FPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAAILRAESYAIPIPDWVKNPT 1440
            FPEDA TS GAPFWSAPKRFP PLQFS++D+ +L FVLAA+ILRAE++ I IP+WVK+P 
Sbjct: 1381 FPEDAATSTGAPFWSAPKRFPRPLQFSSSDEGHLQFVLAASILRAETFGISIPEWVKSPN 1440

Query: 1441 KLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAIIHDLVNKLEDTSRKLPEGF 1500
            KLA+AVD+VIVP+F P+KDAKIVTDE AT+LSTASVDDAA+I+DL+ KLE     LP GF
Sbjct: 1441 KLAEAVDKVIVPNFQPRKDAKIVTDETATNLSTASVDDAAVINDLIIKLERCWATLPPGF 1500

Query: 1501 RMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMAT 1560
            RMKPI FEKDDD+N+HMD+IAGLANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMAT
Sbjct: 1501 RMKPILFEKDDDTNYHMDMIAGLANMRARNYSIPEVDKLKAKFIAGRIIPAIATSTAMAT 1560

Query: 1561 GLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPKVIKHRDMSWTVWDRWIIKD 1620
            G VCLELYKVL GGHK+EDYRNTFANLALPLFS+AEPVPPKVIKH+DMSWTVWDRW++KD
Sbjct: 1561 GFVCLELYKVLAGGHKLEDYRNTFANLALPLFSIAEPVPPKVIKHQDMSWTVWDRWVVKD 1620

Query: 1621 NPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMDKKVVDLARDVAKVELPPYR 1680
            N TLR+L+EWLK KGLNAYSISCGSCLLYNSMFPRHRDRMDKKVVDLAR+VAK+E+P YR
Sbjct: 1621 NLTLRELLEWLKAKGLNAYSISCGSCLLYNSMFPRHRDRMDKKVVDLAREVAKMEIPTYR 1680

Query: 1681 RHLDVVVACEDDEDNDIDIPLVSVYFR 1684
            RH D+VVACEDDEDNDIDIP VS+YFR
Sbjct: 1681 RHFDIVVACEDDEDNDIDIPQVSIYFR 1692

BLAST of Csor.00g102970 vs. ExPASy TrEMBL
Match: A0A166EG44 (E1 ubiquitin-activating enzyme OS=Daucus carota subsp. sativus OX=79200 GN=DCAR_007353 PE=3 SV=1)

HSP 1 Score: 2579 bits (6685), Expect = 0.0
Identity = 1280/1718 (74.51%), Postives = 1470/1718 (85.56%), Query Frame = 0

Query: 1    MATALECWSSRASTDEDLVEQVLMRTQDRSEG-------SKPESSLAVGEKESSAMQKRL 60
            M+ ALECWSSRASTDED+VEQVLMRTQ RSE        S      A G KE+SAMQKR+
Sbjct: 1    MSAALECWSSRASTDEDMVEQVLMRTQHRSESLNDAVLSSAVSPGGAAGFKETSAMQKRI 60

Query: 61   QRFSRNVSEAVASLKNSLNLDSVRDPSPT-KTEGSKKAVWGSVVRNLTQLYPGSQLPEKL 120
            QR SRNVSEA+ASLKNSLNLDS   P P+ + E  +K VW  VVRNLTQLYPGSQLPEKL
Sbjct: 61   QRLSRNVSEAIASLKNSLNLDS---PGPSGRVENCRKNVWAGVVRNLTQLYPGSQLPEKL 120

Query: 121  VSNIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHDVQKPTIKL 180
            VSNIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASV DHPAIL QEV++ +VQ    KL
Sbjct: 121  VSNIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVQDHPAILIQEVSDDEVQGSVFKL 180

Query: 181  TFACNSSVSWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKVENAL 240
             FAC SS+SW  MSGAL++A I C+KIQIFEKK F+LG++L +    QEKLFK+++++AL
Sbjct: 181  VFACTSSLSWPTMSGALDNASICCKKIQIFEKKGFTLGIVLVLVQSGQEKLFKNRIDSAL 240

Query: 241  KLAIKKPKTNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENLQIEM 300
            KL +KKPK + +KLPFG CGCQE +T G++L   E D      R+G ENSN    +Q+++
Sbjct: 241  KLGLKKPKNSGMKLPFGLCGCQEESTRGRELGVGEVDEDSGESRNGSENSN--SRVQLQL 300

Query: 301  PLSTSSFTVTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAIEKIK 360
            PLS S+F V+VDEWQTV+SGG E+G+WLL+ +NLEF DQIG +++KG+YKG++V IEK+K
Sbjct: 301  PLSNSAFVVSVDEWQTVESGGDEIGRWLLNPDNLEFMDQIGSSTYKGLYKGKKVGIEKLK 360

Query: 361  GCEKGVSYKFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKN 420
            GC+KG SY+FE+RKDLLELMTCGHKNIL F GVCID+NHGLCVVTKLMEGGSVH+LML+N
Sbjct: 361  GCDKGNSYEFEIRKDLLELMTCGHKNILQFCGVCIDDNHGLCVVTKLMEGGSVHDLMLRN 420

Query: 421  KRLQMKEITRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTACRNLG 480
            K+LQ KEI RIA DV EGIKF+NDHGVAYRDLNTHRILLD++GNACLGDMG++ AC+++ 
Sbjct: 421  KKLQNKEIVRIAADVAEGIKFMNDHGVAYRDLNTHRILLDRHGNACLGDMGVVAACKSVT 480

Query: 481  EAMEYETDGYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAV 540
            EAMEYETDGYRWLAPEIIAGDPESV ETWMSNVYS+GM++WEMVTGE AY AYSPVQAAV
Sbjct: 481  EAMEYETDGYRWLAPEIIAGDPESVTETWMSNVYSYGMIVWEMVTGEVAYSAYSPVQAAV 540

Query: 541  GIAACGLRPDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFVYAPN----RSLLHF 600
             IAACGLRPD+PKDCP  L+ALM +CWNNCPSKRP FS+I S +     N     SLLH+
Sbjct: 541  EIAACGLRPDIPKDCPQLLRALMSKCWNNCPSKRPHFSDILSILTRPVNNGNNTNSLLHY 600

Query: 601  MLPRKRAGEEGVAVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQR---IDSDSNAN 660
            MLP+KR+  EGVA    + N+SS S                  ++KKQ+   + S SN N
Sbjct: 601  MLPKKRS-VEGVAGGGDSVNTSSDS----------------GKVVKKQKTGCLFSSSNEN 660

Query: 661  SKVAAVATGANNIVYDGASL---IMASANSNPPDIDEDLHSRQLAVYGRETMRKLFASNV 720
            +KV  +    + +   G+S+    MA  + N  +IDEDLHSRQLAVYGRETMR+LFASNV
Sbjct: 661  TKVGNMGGSVSGVGGVGSSVEKRSMALDDGNQQEIDEDLHSRQLAVYGRETMRRLFASNV 720

Query: 721  LISGMQGLGAEI----------------AKNVILAGVKSVTLHDEGIVELWDLSSNFVFS 780
            L+SGMQGLGAEI                AKN+ILAGVKSVTLHDEG VELWD+S NF F+
Sbjct: 721  LVSGMQGLGAEIEYAIYSWVLANDKMGSAKNLILAGVKSVTLHDEGNVELWDMSCNFNFT 780

Query: 781  ESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDKAMEFNDFC 840
            E+DIGKNRALAS QKLQ+LNN+V+V TLT +L  EQLS F+AVVFTD  L+ A+EF+D+C
Sbjct: 781  ENDIGKNRALASVQKLQELNNAVVVTTLTKKLTKEQLSDFQAVVFTDIDLETAIEFSDYC 840

Query: 841  HNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNPALVSCVDD 900
            HNHQP IAFIKTEVRGLFG+VFCDFGPEFTV DV GE+PHTGIIASISND  ALVSCVDD
Sbjct: 841  HNHQPSIAFIKTEVRGLFGNVFCDFGPEFTVVDVDGEEPHTGIIASISNDASALVSCVDD 900

Query: 901  ERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEKGGIVTQVK 960
            ERLEFQDGDLVVFSEV GMTELNDGKPR+I N R YSF LEEDT+ +G YE+GGIVTQVK
Sbjct: 901  ERLEFQDGDLVVFSEVRGMTELNDGKPRKIINARPYSFNLEEDTSEYGQYERGGIVTQVK 960

Query: 961  EPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLPVAGSEEDA 1020
            +PK+LNFKPL+EA+ DPG++LLSDFSKFDRPPLLHLAF ALDK+V+ELGR PVAGSEEDA
Sbjct: 961  QPKVLNFKPLKEALEDPGEYLLSDFSKFDRPPLLHLAFQALDKYVSELGRFPVAGSEEDA 1020

Query: 1021 QKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQEVLKACSGK 1080
            QKLIS+ S +NESLG+ +V++I+PKLLR F+FGA+AVL+PMAAMFGGIV QEV+KACSGK
Sbjct: 1021 QKLISIVSALNESLGERKVDNISPKLLRQFSFGARAVLSPMAAMFGGIVGQEVMKACSGK 1080

Query: 1081 FHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAKVFMVGSGA 1140
            FHPL QFFYFDS+ESLPTESLE  DF PLNSRYDAQISVFG+KLQKKLE+A+VF+VGSGA
Sbjct: 1081 FHPLFQFFYFDSVESLPTESLENRDFEPLNSRYDAQISVFGAKLQKKLEDAQVFVVGSGA 1140

Query: 1141 LGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVAASAAVA 1200
            LGCEFLKNLALMGVSC N+GKLT+TDDDVIEKSNLSRQFLFRDWNIGQAKSTVAA+AA  
Sbjct: 1141 LGCEFLKNLALMGVSCGNQGKLTVTDDDVIEKSNLSRQFLFRDWNIGQAKSTVAATAAAL 1200

Query: 1201 INKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLYFQKPLLES 1260
            IN  L+IEALQNRV PETENVFDD++WENL+VVVNALDNVNARLYVDQRCLYFQKPLLES
Sbjct: 1201 INPALHIEALQNRVGPETENVFDDTYWENLSVVVNALDNVNARLYVDQRCLYFQKPLLES 1260

Query: 1261 GTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEK 1320
            GTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEK
Sbjct: 1261 GTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEFEGLLEK 1320

Query: 1321 TPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDCITWARLKFEDY 1380
            TP +VNAYLSN SEYTSA++NAGDAQ+RD LER+LECLD++RC+ F+DCITWARL+FEDY
Sbjct: 1321 TPAEVNAYLSNTSEYTSAIVNAGDAQARDKLERVLECLDKDRCDAFQDCITWARLRFEDY 1380

Query: 1381 FSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAAILRAESYA 1440
            FSNRVKQLI+TFPEDA TS GAPFWSAPKRFP PLQF+T+D S+LHF++AA+ILRAE++ 
Sbjct: 1381 FSNRVKQLIFTFPEDASTSTGAPFWSAPKRFPRPLQFTTSDPSHLHFIMAASILRAETFG 1440

Query: 1441 IPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAIIHDLVNKL 1500
            IPIPDW  +P  LA+AVDRV+VP+F PKK  KI TDEKAT+LS +S+DD+A+I++L+ KL
Sbjct: 1441 IPIPDWATHPKALAEAVDRVMVPEFQPKKGVKIETDEKATNLSASSIDDSAVINELITKL 1500

Query: 1501 EDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKAKFIAGRII 1560
            E   + L  GF+MKPIQFEKDDD+N+HMD+IA LANMRARNYSIPEVDKLKAKFIAGRII
Sbjct: 1501 EQCRKNLLPGFKMKPIQFEKDDDTNYHMDMIAALANMRARNYSIPEVDKLKAKFIAGRII 1560

Query: 1561 PAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPKVIKHRDMS 1620
            PAIATSTAMATGLVCLELYKVL+GGHKVEDYRNTFANLALPLFS+AEPVPPKV  HRDM 
Sbjct: 1561 PAIATSTAMATGLVCLELYKVLNGGHKVEDYRNTFANLALPLFSIAEPVPPKVFVHRDMK 1620

Query: 1621 WTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMDKKVVDLAR 1680
            WTVWDRWI++ NPTLR+L++WL +KGLNAYSISCGSCLLYNSMFPRH+DRMDKKVVDLAR
Sbjct: 1621 WTVWDRWIVEGNPTLRELLKWLSDKGLNAYSISCGSCLLYNSMFPRHKDRMDKKVVDLAR 1680

Query: 1681 DVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYFR 1684
            DVAK+ELPPYRRH DVVVACEDD+DNDIDIP +S+YFR
Sbjct: 1681 DVAKLELPPYRRHFDVVVACEDDDDNDIDIPQISIYFR 1696

BLAST of Csor.00g102970 vs. ExPASy TrEMBL
Match: A0A5C7IYU1 (E1 ubiquitin-activating enzyme OS=Acer yangbiense OX=1000413 GN=EZV62_003014 PE=3 SV=1)

HSP 1 Score: 2576 bits (6678), Expect = 0.0
Identity = 1293/1726 (74.91%), Postives = 1435/1726 (83.14%), Query Frame = 0

Query: 24   MRTQDRSEG---SKPESSLAVG---------EKESSA-----------MQKRLQRFSRNV 83
            MRT DRSEG   S   SS AVG         E  SSA           MQKR QR SRNV
Sbjct: 1    MRTSDRSEGPSSSSSSSSAAVGAGAGAGAALETSSSANAAGQREYPSVMQKRFQRLSRNV 60

Query: 84   SEAVASLKNSLNLDSVRDP----------SPTKTEGSKKAVWGSVVRNLTQLYPGSQLPE 143
            SEA+ASLKNSLNL+S              S +K E  +K VWGSVVRNLTQLYPGSQLPE
Sbjct: 61   SEAIASLKNSLNLESASAAPGAREQTAASSSSKNESCRKVVWGSVVRNLTQLYPGSQLPE 120

Query: 144  KLVSNIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNHDVQK--P 203
            KLVSNIRKHYDSLPLSYAQAGF+MK+VFLHIKLIEQAS  D PAIL QEV++ +  K   
Sbjct: 121  KLVSNIRKHYDSLPLSYAQAGFDMKEVFLHIKLIEQASGDDRPAILIQEVSDDEEVKGCS 180

Query: 204  TIKLTFACNSSVSWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKV 263
              KLTFACNSS+SW AMSGAL+SA I C+KIQIFEKK F+LGV+L +  + QEK FKS+ 
Sbjct: 181  VFKLTFACNSSISWPAMSGALDSASICCKKIQIFEKKGFTLGVVLLLVQNGQEKSFKSRT 240

Query: 264  ENALKLAIKKPKTNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENL 323
            ENALK A+K+ K  +VKLPFG CGCQE NT G+D  EIE++  ++N RSG +N N+   +
Sbjct: 241  ENALKSAMKRSKPTSVKLPFGICGCQEENTKGRDFGEIEDEGGEENFRSGVDNPNVR--I 300

Query: 324  QIEMPLSTSSFTVTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAI 383
            Q++MPLSTSSF V+VDEWQTVQSGG E+GKWLL+S+N+EF DQIGPNSFKGVYKG++V I
Sbjct: 301  QLQMPLSTSSFVVSVDEWQTVQSGGEEIGKWLLNSDNVEFADQIGPNSFKGVYKGKKVGI 360

Query: 384  EKIKGCEKGVSYKFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHEL 443
            EK+KGC+KG SY+FELRKDLLELMTCGH+NI  FYGVC+DENHGLCVVTKLMEGGSVHEL
Sbjct: 361  EKLKGCDKGNSYEFELRKDLLELMTCGHRNIQQFYGVCVDENHGLCVVTKLMEGGSVHEL 420

Query: 444  MLKNKRLQMKEITRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTAC 503
            MLK+K+LQ K++ RIA DV EGIKF+NDHGVAYRDLNT RILLD++GN CLGDMGI+TAC
Sbjct: 421  MLKSKKLQTKDLIRIAADVAEGIKFMNDHGVAYRDLNTQRILLDRHGNTCLGDMGIVTAC 480

Query: 504  RNLGEAMEYETDGYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPV 563
            +++GEAM+YETDGYRWLAPEIIAGDPESV ETWMSNVYSFGMVIWEMV+GEAAY A+SPV
Sbjct: 481  KSVGEAMDYETDGYRWLAPEIIAGDPESVTETWMSNVYSFGMVIWEMVSGEAAYAAFSPV 540

Query: 564  QAAVGIAACGLRPDVPKDCPSTLKALMIRCWNNCPSKRPQFSEIPSFIFVYAPNRSLLHF 623
            QAAVGIAACGLRPD+PKDCP  LK+LM +CWN+CPSKRP  S   S   V A        
Sbjct: 541  QAAVGIAACGLRPDIPKDCPQVLKSLMTKCWNSCPSKRPHCSSTSSSSVVAA-------- 600

Query: 624  MLPRKRAGEEGVAVEEKTDNSSSSSNNNNNNNSNNSVRNEGASLIKKQRIDSDSNANSKV 683
                        A E +                                           
Sbjct: 601  ------------AAEPQ------------------------------------------- 660

Query: 684  AAVATGANNIVYDGASLIMASANSNPPDIDEDLHSRQLAVYGRETMRKLFASNVLISGMQ 743
                             IMA  +SN  DIDEDLHSRQLAVYGRETMR+LFASN+L+SG+Q
Sbjct: 661  -----------------IMALGDSNQNDIDEDLHSRQLAVYGRETMRRLFASNILVSGIQ 720

Query: 744  GLGAEI------------------------------AKNVILAGVKSVTLHDEGIVELWD 803
            GLG EI                              AKN+ILAGVKSVTLHDEG VELWD
Sbjct: 721  GLGVEIGMVSSFIIIFSSARCFSAPVFARVVVDLALAKNLILAGVKSVTLHDEGTVELWD 780

Query: 804  LSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDK 863
            LSSNF+FSE D+GKNRALAS QKLQ+LNN+V++ TLTT+L  E+LS F+AVVFTD   DK
Sbjct: 781  LSSNFIFSEQDVGKNRALASVQKLQELNNAVVISTLTTKLTKEKLSDFQAVVFTDISFDK 840

Query: 864  AMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNP 923
            A+EFND+CHNHQP I+FIK EVRGLFGSVFCDFGPEFTV DV GE+PHTGIIAS+SNDNP
Sbjct: 841  AIEFNDYCHNHQPSISFIKAEVRGLFGSVFCDFGPEFTVVDVDGEEPHTGIIASVSNDNP 900

Query: 924  ALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEK 983
            ALVSC+DDERLEFQDGDLVVFSEVHGMTELNDGKPR+IK+ R YSF+LEEDTTNFG+Y K
Sbjct: 901  ALVSCIDDERLEFQDGDLVVFSEVHGMTELNDGKPRKIKSARPYSFSLEEDTTNFGAYMK 960

Query: 984  GGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLP 1043
            GGIVTQVK+PK+LNFKPLREA+ DPGDFLLSDFSKFDRPPLLHLAF ALDKFV+ LGR P
Sbjct: 961  GGIVTQVKQPKLLNFKPLREALKDPGDFLLSDFSKFDRPPLLHLAFQALDKFVSVLGRFP 1020

Query: 1044 VAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQE 1103
            VAGSEEDAQKLIS+A+N+NE LGDGRV+DINPKLLR FAFGA+AVLNPMAAMFGGIV QE
Sbjct: 1021 VAGSEEDAQKLISLAANINEDLGDGRVDDINPKLLRLFAFGARAVLNPMAAMFGGIVGQE 1080

Query: 1104 VLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAK 1163
            V+KACSGKFHPL QFFYFDS+ESLPTE L++SDF+PLNSRYDAQISVFGSKLQKKLE+AK
Sbjct: 1081 VVKACSGKFHPLYQFFYFDSVESLPTEPLDSSDFKPLNSRYDAQISVFGSKLQKKLEDAK 1140

Query: 1164 VFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKST 1223
            VF+VGSGALGCEFLKN+ALMGVSC+++GKLT+TDDDVIEKSNLSRQFLFRDWNIGQAKST
Sbjct: 1141 VFIVGSGALGCEFLKNVALMGVSCADQGKLTVTDDDVIEKSNLSRQFLFRDWNIGQAKST 1200

Query: 1224 VAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLY 1283
            VAASAA +IN  LNI ALQNRV  ETENVFDD+FWENL VV+NALDNVNARLYVDQRCLY
Sbjct: 1201 VAASAAASINPRLNIVALQNRVGNETENVFDDTFWENLTVVINALDNVNARLYVDQRCLY 1260

Query: 1284 FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS 1343
            FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS
Sbjct: 1261 FQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARS 1320

Query: 1344 EFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDCITW 1403
            EFEGLLEKTP +VNAYLSNP EYT+AM+NAGDAQ+RD LERILEC D+E C TF+DCITW
Sbjct: 1321 EFEGLLEKTPAEVNAYLSNPDEYTTAMINAGDAQARDNLERILECFDKENCVTFQDCITW 1380

Query: 1404 ARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAA 1463
            ARLKFEDYF+NRVKQL YTFPEDA TS GAPFWSAPKRFP PLQFS AD SYLHFV AA+
Sbjct: 1381 ARLKFEDYFANRVKQLTYTFPEDAATSTGAPFWSAPKRFPRPLQFSAADPSYLHFVTAAS 1440

Query: 1464 ILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAI 1523
            ILRAE++ IPIPDWVKNP  LA+AVD+VIVP+F PKKDAKIVTDEKATSLSTAS+DDAA+
Sbjct: 1441 ILRAETFGIPIPDWVKNPKMLAEAVDKVIVPEFQPKKDAKIVTDEKATSLSTASIDDAAV 1500

Query: 1524 IHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKA 1583
            I+DL+ KLE   + L  GFRMKPIQFEKDDD+N+HMD+IAGLANMRARNYSIPEVDKLKA
Sbjct: 1501 INDLIIKLEQCRKNLLPGFRMKPIQFEKDDDTNYHMDMIAGLANMRARNYSIPEVDKLKA 1560

Query: 1584 KFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPK 1643
            KFIAGRIIPAIATSTAMATGLVCLELYKVL+GGHK+EDYRNTFANLALPLFS+AEPVPPK
Sbjct: 1561 KFIAGRIIPAIATSTAMATGLVCLELYKVLNGGHKLEDYRNTFANLALPLFSIAEPVPPK 1620

Query: 1644 VIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMD 1684
            +IKHRDMSWTVWDRWI+KDNPTLR+LI+WLK+KGLNAYSISCGSCLL+NSMFPRHRDRMD
Sbjct: 1621 IIKHRDMSWTVWDRWILKDNPTLRELIQWLKDKGLNAYSISCGSCLLFNSMFPRHRDRMD 1644

BLAST of Csor.00g102970 vs. TAIR 10
Match: AT2G30110.1 (ubiquitin-activating enzyme 1 )

HSP 1 Score: 1729.9 bits (4479), Expect = 0.0e+00
Identity = 832/1071 (77.68%), Postives = 955/1071 (89.17%), Query Frame = 0

Query: 618  NNNSNNSVRNEGASLIKKQRIDSDSNANSKVAAVATGANNIVYDGASLI----MASANSN 677
            N+ ++N++     +  KK+RID   +++ K +++    ++  + G S++    MA  NSN
Sbjct: 10   NDKNDNTIIGSDLASSKKRRIDFTESSSDKSSSILASGSSRGFHGDSVVQQIDMAFGNSN 69

Query: 678  PPDIDEDLHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGI 737
              +IDEDLHSRQLAVYGRETMR+LFASNVLISGM GLGAEIAKN+ILAGVKSVTLHDE +
Sbjct: 70   RQEIDEDLHSRQLAVYGRETMRRLFASNVLISGMHGLGAEIAKNLILAGVKSVTLHDERV 129

Query: 738  VELWDLSSNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTD 797
            VELWDLSSNFVFSE D+GKNRA AS QKLQDLNN+V+V +LT  L  E LS F+ VVF+D
Sbjct: 130  VELWDLSSNFVFSEDDVGKNRADASVQKLQDLNNAVVVSSLTKSLNKEDLSGFQVVVFSD 189

Query: 798  TGLDKAMEFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASI 857
              +++A+EF+D+CH+HQPPIAF+K +VRGLFGSVFCDFGPEF V DV GE+PHTGIIASI
Sbjct: 190  ISMERAIEFDDYCHSHQPPIAFVKADVRGLFGSVFCDFGPEFAVLDVDGEEPHTGIIASI 249

Query: 858  SNDNPALVSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNF 917
            SN+N A +SCVDDERLEF+DGDLVVFSEV GMTELNDGKPR+IK+ R YSFTL+EDTTN+
Sbjct: 250  SNENQAFISCVDDERLEFEDGDLVVFSEVEGMTELNDGKPRKIKSTRPYSFTLDEDTTNY 309

Query: 918  GSYEKGGIVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTE 977
            G+Y KGGIVTQVK+PK+LNFKPLREA+ DPGDFL SDFSKFDRPPLLHLAF ALD F  E
Sbjct: 310  GTYVKGGIVTQVKQPKLLNFKPLREALKDPGDFLFSDFSKFDRPPLLHLAFQALDHFKAE 369

Query: 978  LGRLPVAGSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGG 1037
             GR PVAGSEEDAQKLIS+A+ +N   GD +VE+++ KLLRHF+FGAKAVLNPMAAMFGG
Sbjct: 370  AGRFPVAGSEEDAQKLISIATAINTGQGDLKVENVDQKLLRHFSFGAKAVLNPMAAMFGG 429

Query: 1038 IVAQEVLKACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKK 1097
            IV QEV+KACSGKFHPL QFFYFDS+ESLP+E +++SDF P NSRYDAQISVFG+K QKK
Sbjct: 430  IVGQEVVKACSGKFHPLFQFFYFDSVESLPSEPVDSSDFAPRNSRYDAQISVFGAKFQKK 489

Query: 1098 LENAKVFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIG 1157
            LE+AKVF VGSGALGCEFLKNLALMGVSC ++GKLT+TDDD+IEKSNLSRQFLFRDWNIG
Sbjct: 490  LEDAKVFTVGSGALGCEFLKNLALMGVSCGSQGKLTVTDDDIIEKSNLSRQFLFRDWNIG 549

Query: 1158 QAKSTVAASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVD 1217
            QAKSTVAASAA  IN   NIEALQNRV  ETENVFDD+FWENL VVVNALDNVNARLYVD
Sbjct: 550  QAKSTVAASAAAVINPRFNIEALQNRVGAETENVFDDAFWENLTVVVNALDNVNARLYVD 609

Query: 1218 QRCLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCL 1277
             RCLYFQKPLLESGTLG KCNTQ VIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCL
Sbjct: 610  SRCLYFQKPLLESGTLGTKCNTQSVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCL 669

Query: 1278 TWARSEFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFE 1337
            TWARSEFEGLLEKTP +VNAYLS+P EYT++MM+AGDAQ+RDTLERI+ECL++E+CETF+
Sbjct: 670  TWARSEFEGLLEKTPAEVNAYLSSPVEYTNSMMSAGDAQARDTLERIVECLEKEKCETFQ 729

Query: 1338 DCITWARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHF 1397
            DC+TWARL+FEDYF NRVKQLIYTFPEDA TS GAPFWSAPKRFP PLQ+S++D S L+F
Sbjct: 730  DCLTWARLRFEDYFVNRVKQLIYTFPEDAATSTGAPFWSAPKRFPRPLQYSSSDPSLLNF 789

Query: 1398 VLAAAILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASV 1457
            + A AILRAE++ IPIP+W KNP + A+AVDRVIVPDF P++DAKIVTDEKAT+L+TASV
Sbjct: 790  ITATAILRAETFGIPIPEWTKNPKEAAEAVDRVIVPDFEPRQDAKIVTDEKATTLTTASV 849

Query: 1458 DDAAIIHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEV 1517
            DDAA+I DL+ K++     L   FRMKPIQFEKDDD+N+HMD+IAGLANMRARNYSIPEV
Sbjct: 850  DDAAVIDDLIAKIDQCRHNLSPDFRMKPIQFEKDDDTNYHMDVIAGLANMRARNYSIPEV 909

Query: 1518 DKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAE 1577
            DKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVE YRNTFANLALPLFSMAE
Sbjct: 910  DKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEAYRNTFANLALPLFSMAE 969

Query: 1578 PVPPKVIKHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRH 1637
            P+PPKV+KHRDM+WTVWDRW++K NPTLR++++WL++KGL+AYSISCGSCLL+NSMF RH
Sbjct: 970  PLPPKVVKHRDMAWTVWDRWVLKGNPTLREVLQWLEDKGLSAYSISCGSCLLFNSMFTRH 1029

Query: 1638 RDRMDKKVVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYFR 1685
            ++RMDKKVVDLARDVAKVELPPYR HLDVVVACED++DND+DIPLVS+YFR
Sbjct: 1030 KERMDKKVVDLARDVAKVELPPYRNHLDVVVACEDEDDNDVDIPLVSIYFR 1080

BLAST of Csor.00g102970 vs. TAIR 10
Match: AT5G06460.1 (ubiquitin activating enzyme 2 )

HSP 1 Score: 1691.8 bits (4380), Expect = 0.0e+00
Identity = 821/1063 (77.23%), Postives = 932/1063 (87.68%), Query Frame = 0

Query: 630  ASLIKKQRID----SDSNANSKVAAVATGANNIVYDGASLIMASA-----NSNPPDIDED 689
            +S +KK+RID    +D +A +   + + G NN +  G   +M+ A     NSN  +IDED
Sbjct: 15   SSPMKKRRIDHTESADGSAINASNSSSIGLNNSI-GGNDTVMSMAEFGNDNSNNQEIDED 74

Query: 690  LHSRQLAVYGRETMRKLFASNVLISGMQGLGAEIAKNVILAGVKSVTLHDEGIVELWDLS 749
            LHSRQLAVYGRETMRKLFASNVLISGMQGLG EIAKN+ILAGVKSVTLHDE +VELWDLS
Sbjct: 75   LHSRQLAVYGRETMRKLFASNVLISGMQGLGVEIAKNIILAGVKSVTLHDENVVELWDLS 134

Query: 750  SNFVFSESDIGKNRALASAQKLQDLNNSVIVHTLTTELVIEQLSKFEAVVFTDTGLDKAM 809
            SNFVF+E DIGKNRALAS  KLQ+LNN+V V TLT +L  EQLS F+ VVF D   +KA 
Sbjct: 135  SNFVFTEEDIGKNRALASVHKLQELNNAVAVSTLTGKLTKEQLSDFQVVVFVDISFEKAT 194

Query: 810  EFNDFCHNHQPPIAFIKTEVRGLFGSVFCDFGPEFTVYDVYGEDPHTGIIASISNDNPAL 869
            E +D+CH+HQPPIAFIK +VRGLFGS+FCDFGP FTV DV GE+PH+GIIAS+SN+NP  
Sbjct: 195  EIDDYCHSHQPPIAFIKADVRGLFGSLFCDFGPHFTVLDVDGEEPHSGIIASVSNENPGF 254

Query: 870  VSCVDDERLEFQDGDLVVFSEVHGMTELNDGKPRRIKNCRAYSFTLEEDTTNFGSYEKGG 929
            VSCVDDERLEF+DG+LVVFSEV GMTELNDGKPR+IKN + +SFTLEEDT+++G Y KGG
Sbjct: 255  VSCVDDERLEFEDGNLVVFSEVEGMTELNDGKPRKIKNVKPFSFTLEEDTSSYGQYMKGG 314

Query: 930  IVTQVKEPKMLNFKPLREAINDPGDFLLSDFSKFDRPPLLHLAFLALDKFVTELGRLPVA 989
            IVTQVK+PK+LNFKPLREA+ DPGDFLLSDFSKFDRPPLLHLAF ALD+F ++ GR P A
Sbjct: 315  IVTQVKQPKVLNFKPLREALKDPGDFLLSDFSKFDRPPLLHLAFQALDRFSSQAGRFPFA 374

Query: 990  GSEEDAQKLISVASNVNESLGDGRVEDINPKLLRHFAFGAKAVLNPMAAMFGGIVAQEVL 1049
            GSEEDAQKL+ +A ++NE LGD R+ED+N KLLRH AFG++AVLNPMAAMFGGIV QEV+
Sbjct: 375  GSEEDAQKLVEIAVDINEGLGDARLEDVNSKLLRHLAFGSRAVLNPMAAMFGGIVGQEVV 434

Query: 1050 KACSGKFHPLVQFFYFDSLESLPTESLEASDFRPLNSRYDAQISVFGSKLQKKLENAKVF 1109
            KACSGKFHP+ QFFYFDS+ESLP E L+AS+FRP NSRYDAQISVFGS LQKKLE+A+VF
Sbjct: 435  KACSGKFHPIFQFFYFDSVESLPKEPLDASEFRPQNSRYDAQISVFGSTLQKKLEDARVF 494

Query: 1110 MVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDWNIGQAKSTVA 1169
            +VG+GALGCEFLKNLALMGVSC  +GKLT+TDDDVIEKSNLSRQFLFRDWNIGQAKSTVA
Sbjct: 495  VVGAGALGCEFLKNLALMGVSCGTQGKLTVTDDDVIEKSNLSRQFLFRDWNIGQAKSTVA 554

Query: 1170 ASAAVAINKHLNIEALQNRVSPETENVFDDSFWENLNVVVNALDNVNARLYVDQRCLYFQ 1229
            A+AA  IN  LNI+ALQNRV PETENVFDDSFWENL VVVNALDNV ARLYVD RC+YFQ
Sbjct: 555  ATAAAGINSRLNIDALQNRVGPETENVFDDSFWENLTVVVNALDNVTARLYVDSRCVYFQ 614

Query: 1230 KPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEF 1289
            KPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEF
Sbjct: 615  KPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNIDHCLTWARSEF 674

Query: 1290 EGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERCETFEDCITWAR 1349
            EGLLEKTP +VNAYLS+P EY  AM  AGDAQ+RDTL R++ECL++E+C +F+DCITWAR
Sbjct: 675  EGLLEKTPAEVNAYLSDPVEYMKAMRTAGDAQARDTLGRVVECLEKEKCNSFQDCITWAR 734

Query: 1350 LKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQSYLHFVLAAAIL 1409
            L+FEDYF+NRVKQL YTFPEDA TS GAPFWSAPKRFP PLQFS+ D S+++FV+AA+IL
Sbjct: 735  LRFEDYFANRVKQLCYTFPEDAATSTGAPFWSAPKRFPRPLQFSSTDLSHINFVMAASIL 794

Query: 1410 RAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLSTASVDDAAIIH 1469
            RAE++ IP P+W K    LA+AV+RVIVPDF PKKDA IVTDEKAT+LSTASVDDAA+I 
Sbjct: 795  RAETFGIPTPEWAKTRAGLAEAVERVIVPDFEPKKDATIVTDEKATTLSTASVDDAAVID 854

Query: 1470 DLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYSIPEVDKLKAKF 1529
            +L  KL      L   FRMK IQFEKDDD+N+HMD+IAGLANMRARNYS+PEVDKLKAKF
Sbjct: 855  ELNAKLVRCRMSLQPEFRMKAIQFEKDDDTNYHMDMIAGLANMRARNYSVPEVDKLKAKF 914

Query: 1530 IAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFANLALPLFSMAEPVPPKVI 1589
            IAGRIIPAIATSTAMATG VCLE+YKVLDG HKVEDYRNTFANLALPLFSMAEPVPPKV+
Sbjct: 915  IAGRIIPAIATSTAMATGFVCLEMYKVLDGSHKVEDYRNTFANLALPLFSMAEPVPPKVV 974

Query: 1590 KHRDMSWTVWDRWIIKDNPTLRQLIEWLKNKGLNAYSISCGSCLLYNSMFPRHRDRMDKK 1649
            KH+D SWTVWDRW+++ NPTLR+L++WLK KGLNAYSISCGS LLYNSMF RH++RM+++
Sbjct: 975  KHQDQSWTVWDRWVMRGNPTLRELLDWLKEKGLNAYSISCGSSLLYNSMFSRHKERMNRR 1034

Query: 1650 VVDLARDVAKVELPPYRRHLDVVVACEDDEDNDIDIPLVSVYF 1684
            VVDLARDVA VELP YRRH+DVVVACEDD D D+DIPLVSVYF
Sbjct: 1035 VVDLARDVAGVELPAYRRHVDVVVACEDDNDADVDIPLVSVYF 1076

BLAST of Csor.00g102970 vs. TAIR 10
Match: AT5G58520.1 (Protein kinase superfamily protein )

HSP 1 Score: 800.4 bits (2066), Expect = 2.7e-231
Identity = 393/596 (65.94%), Postives = 475/596 (79.70%), Query Frame = 0

Query: 1   MATALECWSSRA----STDEDLVEQVLMRTQDRSEG---SKPESSL------AVGEKESS 60
           MA ALECWSSRA      D DLV+QVLMRT DRSE    S PE+SL       V ++ SS
Sbjct: 1   MAAALECWSSRAGDGGDPDNDLVDQVLMRTHDRSESVITSLPETSLEVEGSTTVFDQSSS 60

Query: 61  AMQKRLQRFSRNVSEAVASLKNSLNLDSVRD------PSPTKTE-----GSKKAVWGSVV 120
           AMQKR QR SRNVSEA+ SLKN+LNLDS RD          K E     G +K VW +VV
Sbjct: 61  AMQKRFQRLSRNVSEAIVSLKNTLNLDSARDNQSFGGAMTPKAEVSGGGGGRKLVWATVV 120

Query: 121 RNLTQLYPGSQLPEKLVSNIRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAIL 180
           +NL ++YPGSQLPEKLVSN++KHYDSLP SY+QA F+MK+VFLH+KLIEQA+  D+P  +
Sbjct: 121 KNLAKMYPGSQLPEKLVSNLKKHYDSLPFSYSQADFDMKEVFLHVKLIEQAAGDDNPVFM 180

Query: 181 FQEVTNHDVQKPTIKLTFACNSSVSWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVN 240
            QEV+  + +   ++LTFACNS +SWS MSG L+SA I C+KIQIFEKK  +LGV+L ++
Sbjct: 181 IQEVSTEEPRGSVLRLTFACNSFLSWSTMSGVLDSASICCKKIQIFEKKGLTLGVVLLLD 240

Query: 241 LDAQEKLFKSKVENALKLAIKKPKTNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCR 300
              Q  LFK++VEN LK+A KKPK  +VKLPFG CGCQE N G  +L  +EE+++  + R
Sbjct: 241 QSGQHSLFKTRVENTLKVATKKPKPTSVKLPFGLCGCQEQNGGVGELGGVEEESIQHSSR 300

Query: 301 SGFENSNLNENLQIEMPLSTSSFTVTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNS 360
            G E  NLN  +QI++PL +SSF V+VDEWQT+QSGG+E+GKWLL+S++ EF DQIGP S
Sbjct: 301 LGIE--NLNSTIQIQVPLPSSSFAVSVDEWQTIQSGGNEIGKWLLNSDSFEFGDQIGPTS 360

Query: 361 FKGVYKGRRVAIEKIKGCEKGVSYKFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVV 420
            KG+++G+RV IEK+KGC+KG SY+FELRKD LELM CGHK+IL FYGVCIDENHGLCVV
Sbjct: 361 LKGIFRGKRVGIEKLKGCDKGNSYEFELRKDYLELMACGHKSILQFYGVCIDENHGLCVV 420

Query: 421 TKLMEGGSVHELMLKNKRLQMKEITRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGN 480
           TKLMEGGS+HELMLKNK+LQ K+I RIA+D+ EG+KF+NDHGVAYRDLNT RILLDK+GN
Sbjct: 421 TKLMEGGSLHELMLKNKKLQTKQILRIAIDIAEGLKFVNDHGVAYRDLNTQRILLDKHGN 480

Query: 481 ACLGDMGILTACRNLGEAMEYETDGYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMV 540
           ACLG++GI+TAC++ GEA+EYETDGYRWLAPEIIAGDPE+  ETWMSN YSFGMV+WEMV
Sbjct: 481 ACLGNIGIVTACKSFGEAVEYETDGYRWLAPEIIAGDPENTTETWMSNAYSFGMVLWEMV 540

Query: 541 TGEAAYGAYSPVQAAVGIAACGLRPDVPKDCPSTLKALMIRCWNNCPSKRPQFSEI 573
           TGEAAY + SPVQAAVGIAACGLRP++PK+CP  L+ LMI CWNN PSKRP FS I
Sbjct: 541 TGEAAYASCSPVQAAVGIAACGLRPEIPKECPQVLRTLMINCWNNSPSKRPNFSHI 594

BLAST of Csor.00g102970 vs. TAIR 10
Match: AT5G07140.1 (Protein kinase superfamily protein )

HSP 1 Score: 763.1 bits (1969), Expect = 4.8e-220
Identity = 375/579 (64.77%), Postives = 462/579 (79.79%), Query Frame = 0

Query: 1   MATALECWSSR----ASTDEDLVEQVLMRTQDRSEG-SKPESSLAVGEKESSAMQKRLQR 60
           MA+ALECW++R       D+D V+QVLM ++DRSE  + P  S    ++ SSAMQKR QR
Sbjct: 1   MASALECWTTRNAAGGCADDDFVDQVLMSSEDRSESLTAPPPS----DQTSSAMQKRFQR 60

Query: 61  FSRNVSEAVASLKNSLNLDSVRDPSPTKTEGSKKAVWGSVVRNLTQLYPGSQLPEKLVSN 120
             RNVS+A+ASLKNSLNLDS RD     T G +K VW +VVRNL ++YPGSQLP+KLVSN
Sbjct: 61  LGRNVSDAIASLKNSLNLDSSRDNQNAATGGGRKLVWATVVRNLAKMYPGSQLPDKLVSN 120

Query: 121 IRKHYDSLPLSYAQAGFEMKDVFLHIKLIEQASVYDHPAILFQEVTNH--DVQKPTIKLT 180
           +RKHYDSLPLSY+Q GF+MKDVF+HIKLIEQA   D+P  + QEV +   D Q    KLT
Sbjct: 121 LRKHYDSLPLSYSQTGFDMKDVFVHIKLIEQAFGDDNPVFVIQEVCDEEADDQGSVFKLT 180

Query: 181 FACNSSVSWSAMSGALESAGIRCEKIQIFEKKKFSLGVILFVNLDAQEKLFKSKVENALK 240
           FAC  S+ WS +SG+L+ A I C+K+QIFEKK  +LGV+L +    QEK+FK KV NAL+
Sbjct: 181 FACTCSLPWSTISGSLDGALICCKKVQIFEKKGLTLGVVLLLVESGQEKMFKVKVVNALR 240

Query: 241 LAIKKPKTNTVKLPFGFCGCQEGNTGGKDLREIEEDAVDQNCRSGFENSNLNENLQIEMP 300
            A++KPK+ +VKLPFG CGC+E N G  +  +++ +++DQ  R   E  +LN  +Q++MP
Sbjct: 241 SAVRKPKSTSVKLPFGLCGCEEQNAGVGEFGDVDVESIDQCYR--HELDDLNTRIQLQMP 300

Query: 301 LSTSSFTVTVDEWQTVQSGGHELGKWLLSSENLEFTDQIGPNSFKGVYKGRRVAIEKIKG 360
             +SSF+V+VDEWQT+QSGG ++ KWLL+S++LEF+ Q+GPNSFKGVY+G +VAIEK+KG
Sbjct: 301 PPSSSFSVSVDEWQTIQSGGDDIRKWLLNSDDLEFSGQLGPNSFKGVYRGTKVAIEKLKG 360

Query: 361 CEKGVSYKFELRKDLLELMTCGHKNILMFYGVCIDENHGLCVVTKLMEGGSVHELMLKNK 420
           CEKG SY+F +RKD LELMTCGHK+IL FYGVCIDENHGLCVVTKLM+GGS+ EL+LK K
Sbjct: 361 CEKGNSYEFAIRKDFLELMTCGHKSILQFYGVCIDENHGLCVVTKLMQGGSLRELVLKKK 420

Query: 421 RLQMKEITRIAVDVVEGIKFINDHGVAYRDLNTHRILLDKNGNACLGDMGILTACRNLGE 480
           +LQ K I +IAVD+ EG+KFINDHGVAYRDLNT RILLDK  NACLGD+GI+TAC+++ E
Sbjct: 421 KLQTKLIFQIAVDIAEGMKFINDHGVAYRDLNTQRILLDKQCNACLGDLGIVTACKSVNE 480

Query: 481 AMEYETDGYRWLAPEIIAGDPESVNETWMSNVYSFGMVIWEMVTGEAAYGAYSPVQAAVG 540
           AMEYETDGYRWLAPEIIAGDPE   E+WMSN YSFGMV+WEMVTGE AYG+ SPVQAAVG
Sbjct: 481 AMEYETDGYRWLAPEIIAGDPEKTRESWMSNAYSFGMVLWEMVTGEEAYGSCSPVQAAVG 540

Query: 541 IAACGLRPDVPKDCPSTLKALMIRCWNNCPSKRPQFSEI 573
           IAACGLRPD+PK+CP  LK LMI+CWN CPS R  FS+I
Sbjct: 541 IAACGLRPDIPKECPQVLKYLMIKCWNTCPSTRLNFSQI 573

BLAST of Csor.00g102970 vs. TAIR 10
Match: AT2G21470.1 (SUMO-activating enzyme 2 )

HSP 1 Score: 167.2 bits (422), Expect = 1.2e-40
Identity = 162/545 (29.72%), Postives = 239/545 (43.85%), Query Frame = 0

Query: 1091 QKKLENAKVFMVGSGALGCEFLKNLALMGVSCSNEGKLTITDDDVIEKSNLSRQFLFRDW 1150
            Q  ++ AKV MVG+G +GCE LK LAL G        + I D D IE SNL+RQFLFR  
Sbjct: 7    QSAIKGAKVLMVGAGGIGCELLKTLALSGFE-----DIHIIDMDTIEVSNLNRQFLFRRS 66

Query: 1151 NIGQAKSTVAASAAVAINKHLNIEALQNRV-SPETENVFDDSFWENLNVVVNALDNVNAR 1210
            ++GQ+K+ VA  A +    ++NI +    V +PE    FD  F++  +VV+N LDN++AR
Sbjct: 67   HVGQSKAKVARDAVLRFRPNINIRSYHANVKNPE----FDVDFFKQFDVVLNGLDNLDAR 126

Query: 1211 LYVDQRCLYFQKPLLESGTLGAKCNTQMVIPHLTENYGASRDPPEKQAPMCTVHSFPHNI 1270
             +V++ CL    PL+ESGT G      + I   TE Y     P  K  P+CT+ S P   
Sbjct: 127  RHVNRLCLAADVPLVESGTTGFLGQVTVHIKGKTECYECQTKPAPKTYPVCTITSTPTKF 186

Query: 1271 DHCLTWARSEFEGLLEKTPTDVNAYLSNPSEYTSAMMNAGDAQSRDTLERILECLDRERC 1330
             HC+ WA+   + L  K   D N    N     S    +   ++ D  ER  +       
Sbjct: 187  VHCIVWAK---DLLFAKLFGDKNQ--DNDLNVRSNNSASSSKETEDVFERSED------- 246

Query: 1331 ETFEDCITWARLKFEDYFSNRVKQLIYTFPEDAVTSNGAPFWSAPKRFPHPLQFSTADQS 1390
               ED   + R  ++  F + +         +A  SN   + +  +R P P+        
Sbjct: 247  ---EDIEQYGRKIYDHVFGSNI---------EAALSNEETWKN--RRRPRPI-------- 306

Query: 1391 YLHFVLAAAILRAESYAIPIPDWVKNPTKLADAVDRVIVPDFMPKKDAKIVTDEKATSLS 1450
            Y   VL  ++ +               T+     D  ++   MP    K         L 
Sbjct: 307  YSKDVLPESLTQQ-----------NGSTQNCSVTDGDLMVSAMPSLGLK-----NPQELW 366

Query: 1451 TASVDDAAIIHDLVNKLEDTSRKLPEGFRMKPIQFEKDDDSNFHMDLIAGLANMRARNYS 1510
              + +    I  L  KL    RK   G     + F+KDD     ++ +   AN+RA ++ 
Sbjct: 367  GLTQNSLVFIEAL--KLFFAKRKKEIGH----LTFDKDD--QLAVEFVTAAANIRAESFG 426

Query: 1511 IPEVDKLKAKFIAGRIIPAIATSTAMATGLVCLELYKVLDGGHKVEDYRNTFA------N 1570
            IP     +AK IAG I+ A+AT+ A+  GL+ +E  KVL     V+ +R T+        
Sbjct: 427  IPLHSLFEAKGIAGNIVHAVATTNAIIAGLIVIEAIKVLK--KDVDKFRMTYCLEHPSKK 478

Query: 1571 LALPLFSMAEPVPPKVIKHRDMSWTVWDRWIIKDNPTLRQLIEWL-KNK-GLNAYSISCG 1627
            L L      EP P   +     S T     I      LR L++ + K K G+N   I  G
Sbjct: 487  LLLMPIEPYEPNPACYV----CSETPLVLEINTRKSKLRDLVDKIVKTKLGMNLPLIMHG 478

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P930280.0e+0077.68Ubiquitin-activating enzyme E1 1 OS=Arabidopsis thaliana OX=3702 GN=UBA1 PE=1 SV... [more]
P929740.0e+0077.23Ubiquitin-activating enzyme E1 2 OS=Arabidopsis thaliana OX=3702 GN=UBA2 PE=1 SV... [more]
P209730.0e+0078.71Ubiquitin-activating enzyme E1 1 OS=Triticum aestivum OX=4565 GN=UBA1 PE=1 SV=1[more]
P312510.0e+0078.61Ubiquitin-activating enzyme E1 2 OS=Triticum aestivum OX=4565 GN=UBA2 PE=2 SV=1[more]
P312520.0e+0075.27Ubiquitin-activating enzyme E1 3 OS=Triticum aestivum OX=4565 GN=UBA3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
KAG6570640.10.0100.00Ubiquitin-activating enzyme E1 1, partial [Cucurbita argyrosperma subsp. sororia... [more]
KAG6605397.10.087.94Ubiquitin-activating enzyme E1 1, partial [Cucurbita argyrosperma subsp. sororia... [more]
KAA8543715.10.077.58hypothetical protein F0562_021539 [Nyssa sinensis][more]
KCW52517.10.075.57hypothetical protein EUGRSUZ_J01907 [Eucalyptus grandis][more]
OIW16493.10.075.22hypothetical protein TanjilG_32163 [Lupinus angustifolius][more]
Match NameE-valueIdentityDescription
A0A5J5BKP90.077.58E1 ubiquitin-activating enzyme OS=Nyssa sinensis OX=561372 GN=F0562_021539 PE=3 ... [more]
A0A059AEP60.075.57E1 ubiquitin-activating enzyme OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_J01907 ... [more]
A0A1J7IUS60.075.22E1 ubiquitin-activating enzyme OS=Lupinus angustifolius OX=3871 GN=TanjilG_32163... [more]
A0A166EG440.074.51E1 ubiquitin-activating enzyme OS=Daucus carota subsp. sativus OX=79200 GN=DCAR_... [more]
A0A5C7IYU10.074.91E1 ubiquitin-activating enzyme OS=Acer yangbiense OX=1000413 GN=EZV62_003014 PE=... [more]
Match NameE-valueIdentityDescription
AT2G30110.10.0e+0077.68ubiquitin-activating enzyme 1 [more]
AT5G06460.10.0e+0077.23ubiquitin activating enzyme 2 [more]
AT5G58520.12.7e-23165.94Protein kinase superfamily protein [more]
AT5G07140.14.8e-22064.77Protein kinase superfamily protein [more]
AT2G21470.11.2e-4029.72SUMO-activating enzyme 2 [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (sororia) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000011Ubiquitin/SUMO-activating enzyme E1PRINTSPR01849UBIQUITINACTcoord: 1327..1354
score: 48.81
coord: 703..727
score: 76.83
coord: 818..845
score: 60.12
coord: 1126..1149
score: 74.83
coord: 1173..1198
score: 52.72
IPR018965Ubiquitin-activating enzyme E1, C-terminalSMARTSM00985UBA_e1_C_a_2coord: 1557..1679
e-value: 2.7E-52
score: 189.7
IPR018965Ubiquitin-activating enzyme E1, C-terminalPFAMPF09358E1_UFDcoord: 1590..1679
e-value: 3.2E-22
score: 79.0
IPR019572Ubiquitin-activating enzyme, SCCH domainPFAMPF10585UBA_e1_thiolCyscoord: 1266..1519
e-value: 1.8E-83
score: 280.4
IPR042063Ubiquitin-activating enzyme E1, SCCH domainGENE3D1.10.10.2660coord: 1251..1526
e-value: 9.4E-194
score: 646.3
IPR001245Serine-threonine/tyrosine-protein kinase, catalytic domainPFAMPF07714PK_Tyr_Ser-Thrcoord: 344..574
e-value: 1.7E-38
score: 132.4
IPR042449Ubiquitin-activating enzyme E1, inactive adenylation domain, subdomain 1GENE3D3.50.50.80coord: 673..790
e-value: 9.3E-35
score: 121.2
IPR042302Ubiquitin-activating enzyme E1, FCCH domain superfamilyGENE3D2.40.30.180coord: 844..925
e-value: 4.5E-111
score: 372.7
IPR000594THIF-type NAD/FAD binding foldPFAMPF00899ThiFcoord: 1079..1578
e-value: 3.7E-74
score: 356.8
IPR000594THIF-type NAD/FAD binding foldPFAMPF00899ThiFcoord: 683..1061
e-value: 3.2E-30
score: 356.8
IPR032420Ubiquitin-activating enzyme E1, four-helix bundlePFAMPF16191E1_4HBcoord: 928..997
e-value: 1.4E-21
score: 76.4
IPR038252Ubiquitin-activating enzyme E1, C-terminal domain superfamilyGENE3D3.10.290.60coord: 1592..1682
e-value: 6.9E-21
score: 76.4
NoneNo IPR availableGENE3D1.10.510.10Transferase(Phosphotransferase) domain 1coord: 319..576
e-value: 1.6E-49
score: 170.3
NoneNo IPR availableGENE3D3.40.50.12550coord: 801..1089
e-value: 4.5E-111
score: 372.7
NoneNo IPR availableGENE3D3.40.50.720coord: 1099..1577
e-value: 9.4E-194
score: 646.3
NoneNo IPR availablePIRSRPIRSR039133-1PIRSR039133-1coord: 1081..1290
e-value: 1.8E-47
score: 160.0
NoneNo IPR availablePIRSRPIRSR039133-2PIRSR039133-2coord: 1081..1290
e-value: 1.8E-47
score: 160.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 599..626
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 605..626
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 25..46
NoneNo IPR availablePANTHERPTHR10953UBIQUITIN-ACTIVATING ENZYME E1coord: 660..1683
NoneNo IPR availablePANTHERPTHR10953:SF215UBIQUITIN-ACTIVATING ENZYME E1 1coord: 660..1683
NoneNo IPR availableCDDcd01490Ube1_repeat2coord: 1098..1637
e-value: 0.0
score: 728.702
NoneNo IPR availableCDDcd01491Ube1_repeat1coord: 681..1066
e-value: 7.05987E-146
score: 447.87
IPR018075Ubiquitin-activating enzyme E1TIGRFAMTIGR01408TIGR01408coord: 676..1684
e-value: 0.0
score: 1680.5
IPR032418Ubiquitin-activating enzyme E1, FCCH domainPFAMPF16190E1_FCCHcoord: 856..926
e-value: 3.1E-27
score: 94.5
IPR033127Ubiquitin-activating enzyme E1, Cys active sitePROSITEPS00865UBIQUITIN_ACTIVAT_2coord: 1258..1266
IPR018074Ubiquitin-activating enzyme E1, conserved sitePROSITEPS00536UBIQUITIN_ACTIVAT_1coord: 1041..1049
IPR000719Protein kinase domainPROSITEPS50011PROTEIN_KINASE_DOMcoord: 326..578
score: 25.954983
IPR035985Ubiquitin-activating enzymeSUPERFAMILY69572Activating enzymes of the ubiquitin-like proteinscoord: 676..1063
IPR035985Ubiquitin-activating enzymeSUPERFAMILY69572Activating enzymes of the ubiquitin-like proteinscoord: 1074..1574
IPR011009Protein kinase-like domain superfamilySUPERFAMILY56112Protein kinase-like (PK-like)coord: 317..589

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csor.00g102970.m01Csor.00g102970.m01mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006974 cellular response to DNA damage stimulus
biological_process GO:0006468 protein phosphorylation
biological_process GO:0016567 protein ubiquitination
biological_process GO:0006511 ubiquitin-dependent protein catabolic process
biological_process GO:0006464 cellular protein modification process
cellular_component GO:0005737 cytoplasm
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
molecular_function GO:0005524 ATP binding
molecular_function GO:0004672 protein kinase activity
molecular_function GO:0004839 ubiquitin activating enzyme activity
molecular_function GO:0008641 ubiquitin-like modifier activating enzyme activity