Cucsa.017950 (gene) Cucumber (Gy14) v1

NameCucsa.017950
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionENTH/VHS/GAT family protein
Locationscaffold00252 : 199689 .. 219917 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAACGCAACAACGCCCATTCTCTCACCAAAGGACAGAGCGACACACGCATACACTTTTAGATAGAGAGAGAGAGATTCTCATTTCTTTCTCTTCTGTCGTTCAACTCCAAATCTTCATCCCCATTATTTGCTTCTTTATTATCCTTTCAATTCCTCCTCTTAATCATTCATCATTACGCCTACCCACAGAGTTTCCAGCCATTTTTCAACCCTTTTTGCCCCAACCCCATTTCTTCTTTTTTCCTCAAATGCTCATCCTCTTCCTCTTCTAGGGTTTTTCACTTCTCTTCTTTTGTCTATTCTGGTAACGTTAACTCTACTTTCTTGCTTTTCACCTGTTCTTCAGTATTGAGTTTTTATTTTGAGATCTGTGTTGCTACTCTTTTCATTTTGATTGAAACTCCTCTTTTTTTCACACATGGGTTTGCACTTTACTGGACTTGAAGTTGAGTATCAGCTGTATCGTGTTTCTTTTACTTTCATTTCATGTGTTTTTGAACTTTTCTTTTTTCTTTTCCCCAGGCACTAAATTCTCTATTATTGTTGTTAATTTGTTTGGTTTTTAAGCTTGGACTTCCTTTTTCCCCCATTACAGAGTGGCTTATGGTTCCCACATCGGAAGGCTTTCTTGATTTCTATCTCTGCTTCTATTGGTGCAGTATGCCTATAAAGTCCAAGATATTGTAGAGCATTTTGGGGTTTTGTTTACCTTGACGATTTGTTACTCCCTTGACAATTCAAATATATAGGAGGTTGCTGATCTCGGTTAGTTATTCTTTTACTCTCTACTTTGTCATTTTAGTGTTTGAGATTTTTCATTTTCTGTATAAAAAGTTATGGTAATGTGTAGTTGTTATTATAAGCATTAGTAAGTGAAAATTTCATTATTTTTACTAAATACGTATTGCACAGCAGAACATAAAGATAAGTCTGCATTTAATTCTTTCTTTAAAAAACTACAAGGCCTTTTACCATTATGCTCTGTTATTATAGATTGTTCACTTTAGTGAGCGATGTTGTTTTTCCATTGAGTTTTTCTTTTGTCATCTGTAATGAATGATAATTAGCTCTGGATTATCTTTTTTCTCTCCATTACCTTGTCAGTTGAGTAGGAACATTGCCTATATTTGGAGTAAGGATGCCAAGACATGTGTAGTATAGTTTCTTCTCTTGCAAGTTTCTGTAGTTATTTACCAATGTTACAAAATATTATAGCATATCTGTGGTAGACCATCTGTGTCACAATTGACACAAATAGTAGTCTATTGCGGTTTATCACATATAGATTGTGATATTTTGTTATATTTGTAAATATTTTGGTTCATTTTACTACATTTCTTTCAATGTTGAAGGTTTCTGTAAGAAGTTATCACTTGACTGCATTACATTTTTTAGTCAATATTTTCTGACTATTGAAATATTTTAGTTGCCCTTTCACTTCTGAATCTGCCTCTTCTATTCTTACGTTTTTTTATCGACATTTTATTTTTTAAAAAAACCAAAATTAGGGGTACTATAAAGATGGCTGCTGAACTAGTCAACTCTGCTACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATCTGTGAATTAGTTGCTCATGATCAAAGGTGTGTCTTGTATATTCTCAAAGTATATTTTCCCACTTTTTTTGGTTTGATAAGGAAGAACTCATTTGGATTCTGCCTACAAAATATGTGAGGATAGATTTGGTTTTAAGTATGAGTTTATGACTCCCAAGCCAGACTGCTTATATCAGATGTATGGATGACAATATCTCGTTATGCCTTTATAATATCTCGTTATGCCTTTATAATATCACATTGCCACAAGTTCTAGTCTTCTTTGAGGAGATGGTCTTATTACTCTTAGTTGTAAGATTTGTGCCGCAGTTTCTTCTTCTATAAAAAATGCATTTTATGAAAATGCCATTTTCCTCAGTATTGCTTTCTTTGTTCTATTTGTGCAGGCAAGCAAAAGAGGTCATAAAAGCGATTAAAAAACGACTAGGAAATAAAAATGCAAATGCACAACTTTATGCAGTTTTGGTAAGATATTGATTCTAGTTACGCCAATTCTATTATATATTTAATATAAAAGCATGTGCCTCCTTTCATGTTTTATGTACCAAATTCATTCTTTTAGGAAAATGCATGCAATGAATATTAGGCCTTTTATCCCGCATGCTTCCAATTTTATCGGAGGTCCTTGAAGAGATGCCAGTTATCCTGCATCACAGGTGTGAAAGAGCACCAAAGAGCATGTGGATGGTGAGGTGTAATTGGCACTATTGGGAATATACCAATCTTAACATTAAGAACGGAGGAAGAGTGAGATTAGAGGAGAAAGTAAGTTTTTTCTTCTTTCTCCCCTCTCTCCCTTCTCCTACCTTCTTCTTCCCTTTGCATCATCTCATCCCCTCCAATCATCTTTCTCAGTTCTCCAGCGAGTGTCTAGCCTTTTATTTCGATCGCTCATTCAGAACTAGATATGGAAATGGAAAGTTGCAATATAGTTGACTCTCTTCACTGTATTTGGTTATGGACTATTCCAAGATGAAGACGTGGACGGCAACTAAATTCTGCCTTCATCAAATTTCGAACCGAGTTGGTTGCAAGAAGTTCTCTCAGAACTAAGCCAAAAACTAGAAAATCAATTCTTTCTCAAGAAAGAGAAGACCGATAATGGAATAACAGAAGTATCTAAATTCAGAGCCACAAAAGGTTGGATATTGAGATGCACTCTTTGGTCATGTATTGGCGGTAGGTCCTTGGTCATGTATTGGTGGTAGATCCTTCATTCAAGTTCTTTTAGGTGAGGAAAAACAGGGTTGGAAAACTTTTATTCAATTGCTAGGAGGCTTCAAATCGAAACTTGAATACTCGACATGGATTTCATCCCATTCAATGTCATCAAATATCCTTAAGAAGGACGTGAGCGCACGAAAGTCAACATGGGAAAGATATGAAGTCAAGGTGCATCCGAAAAAAAGTCACACATTCCTCTGTTTTTGACCTGGAAAAATAGAGCATGAGTTGTTACTGTATTATAAAGAATCCAAAAGTTCTCAAGACTAATTTCAACAATCTTTGGATTGTAACACGACTTTTTGAGTTTGATAAATGCTGAGCCATAGCTAGCTCACTAAAATTATACTTTGATACTGATGTCATTCTTAAAACTTTGTCTGCGGAATGTGCCTTAATCCTGTTGGATTAGGGTGAGTTGGAAGCTTTTGCTGAGTTCCCATGAACATGGCAAGAATGTGGTCAGTTTCATTTAAATAAATTGAAAAGTGGAACAAGTATCTACATGGTCGTCTGAATGTAATGAAAAGGTTCAATGGGTGGATCTCGATTAAAGATTTACCATTGGGCTGTTGGAGCTGAAAACCTTTTAAGTTACTGGGTCTTACTTGGGCGGATTAGTATCTATAGCAAATGGAACGTTGAATCTTATTAATGTTGTTGAAGCAAAAAATCAAGTTAAAAGAAACTTATGTGGTTTTATGCAATCTACCATAGAAATTTCAGATGAAAGCAAAGGTAGTATTTTTTTGAATTTTGGAGATGGCCCCCCCCCTCAAAGTTTAAGGACTAAGGTGCTTTGTTTATTAAAGATTGTTCCAACCTAATTTATTTGACTTGACTAAAGCAAGTTATGATTGATGAAGATTTAGATTCTTCTGTTTTGAATCTAGAATGGACATTTCAAGCTGCCCCGTAGTCCAAAAAATTTTCATGAAATTATTTTGCATCGAAGTAGCTTCAAGGGGAGAACCACACACAAAACCCACTAGAAATTGACAGTAGAGACTCTCCGACATGAGTCTTCCTGCTGACGATGGAAGAATCAAGGAGAAGTGAACAAACTCACGCTTTTCGAATGAGACGGACTACGACAACGCTTCGAACTGAAAGCAGTCCAAGGGATGCCTTTATTCTTGGTGCGAAAAGAAAGTTTCAAAATTTGCCAGTTGTTAGCCCTTTAAAGGACGCCCATAAGTCTTAAGAAGATAAAGACTCAACAGTCATTAATGATTCCTGTTTCCCAATTAATGAACCCTTGCCCAAAGTCTCCTTTGTCCCATTAGCAACCTCTCTTGAAAATAACAACTCTAAGTCTCTCTCATGGGTAGGTGAATGAGGTTGGACTGTGCAAGCCTATTTTGTCTTCCACACCTCACCTTTCACCTTCATCCCTGTTGGATAAAGGGTTATATAAAGTTATATGGAAAACTAGTAGGAAAGTTAATATACTCACCTTTGGGTTGCCGAACTTATGCAAAGAAAACTCCCAAATAGTTGCCTCTTACCTTTGGTTTGCCCTCTGTGCATGAAAGAAGAGGAAGATTTACCACATCTGTCTTTTACATGTTCATATTCAACCAGTTGTTGGGGAACCTGTTCTCTTTATTCAGTGTTGCTTGGGTTTTTGGACATTTGTTTAGCTCAAAAGTTCAATAGGTTCTTTTGGGTCTTTTCTTAAAGAAAGAAAGGGCCGAGACTAATATGGGGAAACATGACTAAAGCCTTGCTTGTGGAATTATGGTTTGAATGAAATCAAAGGGGCTTCTTCAACAACAAAATCTATAATCATCCTTCAGACTTCTTCTAAAGGCCAAAGACCAAGAGGACGTATCTTGGTCCCAATGTGCTGCAACTGATCCAGTGGGCAAAAGAGCAATTCTAAAGAGTCTAGAATATTGCTCTTTAAAGGGAGTGTTACCTACCTAAATATCTGTCCAAAAGCCAATTCTTTGGCCGTTACCAAGATTAAAAGAGGCAAATGAATCAACCGACCTCCAAACTCTAGCAATGCTAACCCAAGGACTCCTTAGGCTATTACCAGACTTCCCTTTAGTGAACCAATCGGAGGGCTTCTTACCATGCTAATTCCTTTCAATTCTTAAGCTTCCAATTACTTTGAAAGACTTGGAGTTTCTTCCAAGATTCGGGTTTACATGAAGCCTGTCTTCTGCTGGGCCGTACCTGACCATGAAAGCAATACTGATATCTCTCTTGTCGAAGATCTCTAGTCACTTCCTTTGCTCCCTTCTGGTATGAAGATTCTCGTTGGATAATACGGCAAACCTACCTCTTGCATTTTCCAACACTTTCAAGATATTGGAAGGAGAGATAACGAGCACACACTAATTTACGTGGAAACCCGAGTACCGGGAGAAAAACCACGATTGTTTGTTGATATTATTTTCTAATGAATAATACAATAGGTACAAGGGAGAATAAATAGAGAATAACAAGAGAATAAAAAAGGAAAAGATATAGGAAATAAGGAAAATATTCCCATAATCTTTCCAAAAACATTCTAAGATTCTAACAAGGAAAATATTGAGAAAGTAAAGGAAGGATTCCAACAAAAATAATGGCAAACTCTAACCTTCTTTGCCAAAAGACCATAGAATTCTTCATGTCTGCAGCGCCCACTAAACAATCCCTAACACAAGCTGAGGTGCTGAAATTCAGCTCCAGTGTACCAAATATCCTTCCATTTCTTTCTTCTATGCAAATCTTCCTAGCCTTCCCCCACAAACCCAACCAAGAATTTCTTCCCATCGATCTTGAGCTTTCCCATCTCTTAGATTTTCTCAAATTTTGAGAGAAAAGTTATCAGAAGTTCTTCGGTTGAAGCTTCCTCTGAATTCAGTGTGTCTCTCAGTGGGTGCATTATTTTTGGGTGTTAGTTCACCTTTTATTTTGTTTCTTAATTTATAGTGGATTTTTGTTAGCTTGATTAAGGTCAATGGTCTCAGTTAGGTTGGGCAGTTTAGATGATTTTTCTCAGTTTTTTTATTCGCATGCATTGATAGATTTCTGTGAGGAGTTTCGCTTTGGTGAATTCATCGTGTTCTTCAAAATATTGCTTGCTTAATTTTTTAGTTTGAGTTCACCCACAGTTGTCTTTTCTTATTTTCTTTAGGCTGTTTCAATTTTTTCTTTAGATCTAATACTTTATCGCAAAAAGATTTGTCTATCTTATTTTTTTATTTTGTGTCTCAAGCATTAGTCTCTTTTTATCATTTCATTGAAAAATTGTTTCGTAGATTTCAATTTTACGAATATATTCATAAAATACTGTTGTCAACGGTTGTTTTTCTAAAGTTAAAAACTAATAAACCATCATGGATTGACCTAAGGGTAAAAAGGGAGATGTAGTCTCAAATAGTTCATGGTTCAATCTACGGTGATCACCTACCATAGGCTTAGGTGGGTTGCCCCTTTTAGCAAAAAGAAAAAAAAAGGTAGAAAAGTAATAAATTATATTAAAGTTATTCTTACTTCCAAACTAAGTTAAATATTTTGTTATTATTATTCATATTCATATCATGTTTTTCTTTTTATTATTGAAAGATATTGGAGATATCTATTAATTCTTATTATCGATGTTGTGGCCTTTAATTTGTAGATATATTGATATATTGATGAATATTTCAATCTTTAACCTAGTCGGCACTTTATTTTATTCATAATTCAAGATTTCTTGGTTAGGAAAAAAGAGAAGCAAATAAGGTCAAGTACCATTATATATTAAATTTTGATGAAGTTTCTATCTACTAAATTTGTCCCCAGTACAAAGCATTCTTCACATACTAGTCTGCTGCTCTGTTGTTGGACACCACACCATCAATATCTTAATCTTTTGTGCAGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGGTGATTGATTCAGGGGTTCTCCCTATTCTTGTGAAGATAGTGAAGAAAAAGGTTTGAATTTGTTCTATTTATTTACGACAAATAGTTGACTATTAATCTTTCTTTAGAAGTATGGCGCAAACATGGTTATGAATATGGCGGCATGTCATATTATAGATAATAAGAACATGAATACCTCTATTAAAAGTGTTTTTTTTATTAAATTACAAATCTACCTAAAAGCTTGAGCCAATGGGTGACGACAAATTTAATATAATATCTAACCGTCTCCTCTATTTGTGGGTTTGAAATATGGAGAAAGTTCAATAAGTAGAAATAAATTTTAAATGGGGAGGGAATTCATTGCTGGGGTTTGATTCGTTTGAACACAGAACCTCCTTGACTACCTCCTTGATCACCAGCTCTGATTATATCTTAAATCACCCATCTACCCAAAAGCTTATGATGTGTATATGTGTGTTCTTGGAAAGGAAGATACAAGAGACTGGGTTCTTGAAAAATGACTGACTTTTTTATTAATTCTGGAAATATGACAAACTAACAGTATTTATACTAAAAACCTAATAACAGTCAAAGTCAAACCTAATTATAACAGCCAGTTAAAATATAACAAACTAATTACATCAGCTTAAGCTGATAGGTGAAGGGAATTTTAATATAATATGTAACACTTTTAAATGTATTTGATTAATTTTTTATATAACAACAAATTCACACTAGACCTTGGTTTTATTTTATAATTTATGAAGTAATATAAAAAGGGAGGTAATTTCTAACATTTCAAGAAGTTGACGAAACCAATAATATATATATCCAATGATTCAAGTACAGTAAACAAAGATTTTCAAATCCAAATAAAAAAATTGTGAATTGTTTTCCGCTCTTGAGAGTATTCGTAACTTTTAGTATTAGTTTTTTTTGAAATGGAGACAAGTCTCCCAAGAATCACAAAAGAAGATGGCTCAAGAAACGTTTCCAACCGGTTGAACAATTATTGTTTTTCAGCATTTTTGTGGTTTTGATTCTGCGAGGCTGGTTGTTGTCTGTGTTCCCTCTTCAATCATCTCGAAGCTTATTGTGGGCCAGTTGTCAGTTTTCTTTGGGTTCCTTTTGACAATCTGTTTTAATTACTTCATACCTAGTTTACATTATAGATTCCATTTGTTGTTTTGGGTTAGAGGATTTTGATTAGCTCGCTGTTATTGCTGAGTTTGATTTGGTTATTTTTTATATCTGTCTTAGTTTGTTTGCTCTCTGTATAATTTTAATATTAGTCTCTTTTCATTATGTGAATTTTCTATATTATTGAATAAATTAGAGTATTTGAGAGAACATAATGACCTTTTACATAGAGGAATAAATAGACCTCAAAGAAACTACAAAAGGAAAATACATAAATGGAAAATACCAAAAAGGATAATAATAATTGAAAATAATAATAAGCCAAATCCTAATTTCCATTTAACACATAATATCAGTGAAGAGCTTTGTTTCCTTTTTTCTAAAAAAAGGAATAGAAAATTTTTTGTTAAACAATTCAACATCTTGAAAAAGTTGAAGTTTGAATTTGATTATGTTCTTACGATTCTATGCTTAGAAAATAAGTGAATTTTTTGCGTAAAGTTATTATCTCTCTTGCATTTTGATTTTGGAATAAGATACAAATTTGAACTTTTTCAAGATGTAAGAGAGGGGAGCGACATTTTCTTAAGAGAATCTTATGTTTTTCGTGCAACTTTTTCAACAAGATTCTATGATTCAACAAATTTCAACTTTTTCAAGATGTCCGGAAAATAAATTTCAAATCTTATGTATTTTACAATTTTTTTTTGCAATTTCCTGATTTTCTATTTTTGCAGTCTGATTTACCAGTGCGAGAGAGAATATTTCTTCTTCTAGATGCCACACAGACAGCTCTTGGCGGTGCTTCTGGAAAGTTCCCTCAGTATTATTCAGCATATTATGATTTGGTGGTAGGACATTGCCCCCATATTTGCTATATATTTCTTTGATTAGAAGTTGTATATTGAAGCAAATGGATATAATCATGTTAAGAGAATTAAATCCTTGTTGTTCAACACTATTTATAGTTTGAAGGAGTGGAAGAATAAAATCATCAGTAAAGTAGATCATAATAATATGTTTTCATGGTTCTAATGAGTTGAATGGCGTTGACTTTGAAGAAATGAGTTCAAGGCATGATGGGCACCATGCATAGGATATAGTAGTATATGAATTACTTGGCAACCAAACGTAATAAGGCTAGATGGCTGCCTTGTGAAAATAGTCAAGGTGTGGACAAGCTGATTAGGACACCCACAGATATCAAAAAGATAAAAGAAAAGTTTGATGTACATTGTTAGTTTCATAAAAAATTTATCATTGATATTGGTTGATGCTTGCTAAATAATAATAGTTCTTATAGGTTCAGAATGAAAGATATTTTGATTGTGTCTAATCTTCAACTTTGTAGCGTTGGGATTATATTTTTTGGGTTCAGTGAAAATGAAGAAAGAGAGGGGAAAGTATTAGATTTGATTCCCACACATTATTGTTTATGTGATTCCTTTTGTTATGTATATTCCCATTCTTCGTTTCTTGTCAGGAAAAGAAAAGAGAGAAGAAACATTGCTGCTTAGAAATTCCCCGAAGAGTTCCATTGAGAAAACCTTGGTACTTCATAAAAGAAAAGATTAGGAACTCCGGACATTGTTTTCTGTTTGTTGTTGTTTACTTGGTTTTGGTTTTTGGATTTTGGATTTTGGAGCACTAATCTCTCTTCATTTCCCTAATGAAACGTTTGGTCATTTTCGAAGAAAGGAACTTGAACATATGTTCCATCTTTTATGATTACATTCAGTAACTGTAGAATCTTTGAAGTCGGAATACTCCAAATGAATTTTTCATTTGTTTCTCAGAGTGCCGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTTCATCAAATAGTCCTACCCAGCAGCAAATTAATAATACTTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGAATGTTGCTAGAGTGGAACCTCAGATATTATCAGAATCTAGGTATATCTTGTTGAGCTTATTCTTCTGCACTTTTCCGTCTTGTTATCCACTTGTTTTTCCCTTTTTATTTTTGACATACAGAACTCTAAGTTGTTAAAAATCCTAGAAATTTGTCATGAACTATTTATGAAACGAAAAACACATTACTCCCATTATAAGTGTTGATGTTTGTGCAGCTAATGGTTTGCTAAAAGGAACTATTTATGGAATTCTAGTATTATTGATTTATTCCATCATATTTGATTTTTTTTGGTTTTATTTGTTATTATTTTCTAGGATTCTTTCCTTTTATGTTTATATTTTTCTTATTTCATTGAGGCTGTATTCTATATCTAATTTAGGATTTTGTTAATAAGAATAAGAAAGTGAGAAATATTCACATGGTATCAAAGCAACAAACAGAAAAACCCTAACCTTAATTATTGTGCCGCCACTGACCTCTTCAACTCTTGCCGCTGCCGTCATTGATCTACGCTAGTGCTTGCCGTTGCTAATTTCGGATCTGGCCGCCGAAGCCTTTTATTTCTCAGATCTGGTCGTTAAAAGTTTTTTCAAGATCTGGTAAAAAATCTGGCGCTCAACCTTTTTTTTTCTGATCTGGTAGTTGCTGGTTTCCAATTTTTTTTTGAAATGGAGACAAGCCTCTTTAGTAATATTAATAATAATAAGAGAGACTAACGCTCAAAGTACAAGAGAGTTATATAGAGAACAAAAGGGCTAAAACAGATACAAACAATAGCTAAATCAAACTCAACAATAATAACGAGCTAAACAAAACCCTCTATCTGGAAGCACAATTGAAAACAGAAAAGTAAACTAAGTACAAAGTAATTTTTTTTGAAAAGGAGACAAGCTTCTTTATTATTAATAAATTCAAAGTACAAGAGAGCTATACAATGAGAATAATAGGAAAGCCAAGAAATGGGGAGAGAGAGAGAGAGAGAGGATCAGTAGGTGCACTCGGACATCTCAATTAGGTTGACACTCCTATAGCACCCTCATCATATCCAAAATACAAAGAACAAGAACAATACTAAGGTCATGAAAAGACCAAAGTAACAATCAAAACAATAATAGAAACTACCTACGGTAGACAAAAACAAGGCTGAAAATAAAAACATAACGGCAGAAATACACGTTAAAGCCCATCCAAACTACAGAAGCACTAATTCTGAACTGGCAATCGGGGGAAACTACATTGAATCTCATTAGATGAAGGCGGTTCGACTGAGGCAAATGTCCTGTATGGAGTGAACTTTGAATTCTGCATTTGAAGAACACCAAGCTGCTGCGTTTCTTTTAGATATATCTACGATGTCAACCTAATTCCTTTTTTTTTATCAAGGAAGATGTGTTGGTTACATTCGAACCAAATCTCAACCAGAAGCGCTTTTGTCAAATTAACCCAAATTAGAAAATGTTTTTTTTAATAAATAAGGCCCCCTCAATAATTGGAGCACATTGGCGCTAAACGAACCATCAAAGACCCAAACTGATTTGAGTATATAAAATATGCTAAACCAACAAGTAGATGAGAAGGGACAGAAAATGAATAAGTGTATTAAGAGTTCACTGTTTTCCAAGCAAAGGAGGCACACTGAAGGCAGCAAACAGCAAACAGCTTCCTTTGCAACTTCTAGGAACAGTTTAGAGGACCAACTGCCATAATCCAAACCAAGACATTTATTCTCCTTGGACTGCTGGTTTTCCAAATTGCTTAAAGTAATTTTTTTCCAAAGGTGAAGAGGAAGAAAGGGGCTTTGAAAGGGACTTAACTGAAAAATGGCCTAAGGATTCTAATGACCAAACTCTCCTATCATCCAAGTCCGTTAGATCTTTCTGAGATATGAGTGTTAAAAGACCTTGGAAATCTTGAATTTCTTCATCTTTGAAATACTAATTGTAATCTAGAATTCAGAGAAGTATATTTCTTATTCATCTTAAGTCAAAGGAATGTTACATATTTATATAGAGAATAAACTAAACCCTAGAGACTATGTACAATTACAATAAAGGGCATATGATATAAATATAAATATATATATATATCATAACACCCCGCCTCAAGCTGAAGCAAATATGTCGATCATGCCCAGCTTGTTGCACAGATAGCTTATCCTTGCTCCATTTAAAGCTTTAGTTAGAATATAGTTCTACTAGTTGTATATGTTTATGATATTGTTATTACTGGAACTATTTCTTTTAGGAGTTTGATTTCTTGCTTGCTAGCAGAAAGATGTTCGTTGGATGATTGGAGAGTTCTTCCATCCCTTTCAAAGAGAAAGGGCACCTTCTTTGCTTTGCAAGGGTTTGTGCTTTATTGTGGGATTTGTGGGGCGGAAGAAATAGTTGAAGTGTTTTAGGGTATGGAAAGGGACCTTAATGGGGTTTGGTCTTTGGTGAAGTTTCATGTTTCTTTGTGGGCTTTGGTTTCAAAGGCTTTTTGTAACTACACTCATGAGGCTGTTTTTTACCTAATTGGAAACTCTTTGTTTAGTTGGGTTTTTATGGGCTTGTATTTTTGTATGTCGTATTTTCTTTTCTTTCCAATGAAAGCAATTGGTTCTCTTAAAAATAGAAAAAGAAAAAAATGAAAAAAACCTTCGAATAAATCCACTTATTGCTCATCACATTTGCCCTTTTTCATGTTATTGTGCTTTCGTAGCCTCTTTTCTCACAAGTTAAATTGTTCTCATTTTAATGTTTTTTATGTAGTATAATTGAAAAGGCTGGCAATGCATTAGAAGTTCTGAAAGAAGTTCTTGATGCTGTTGATCCTCGACATCCTGAGGTATATTCTCTTGCCAGTTTTGACTCTCTCCCTCTCTCTTTATATATACCTATATTTCTCTCTTTATAAATATATGTTGGCTGTAAATATGTTAATTAGATGTAATCACCAATGGCTTATTTCATATTCTATTTCTTTCTTTTTGCATATCAATTCTTTTTTTCTTGGAAGAAAGTTGTGCAACTTACTGCTGAACTACTGTAATGAAAATATCCCTTTCATTAAAATTAAGATGGCACTTGATCATGTAAGGTTTAACTCTATACAGCCCTAAAAAATTGTTTTCACCACTTCTAGGGAAACCCTAATCACCAAGTTGTGGAAATTTTAGTTGCTTATCTTAACTAGTGCAATCTACTTCTAATCAAACTCACAAGTGAAGGTGTTCGTATTCAAGTTGTCATTGGTTGGAATGTGTTCACAAGAGAAGAGACATTTGAATGTCTTCATGATCGTTGTTGCATAAGCCTTTGCCTTCGCCTCCATTTTTGTGTGTTGCTTCTGCCGCTACACCATCACAACTTTTGGTGGTTGATGTGTTTGTGTTACCATCTGTCTGTCGTCGTGTGTTGTTCCCATTTGAGATAATTTGGGTTGATGCTATCATTTATGGGCTTCATTTTCGTGCTCAATTGTCGACCATCGTGACCGTCCAATGAAGGGTTTTTGGTTGTTGACCATAAGTGATGCTCAAACTTCATGCCTCCATTAGGTTTAAATTTTTTTTCATTTGATTTGGTTGAATTTGGTTAAGTTTAGTTCATAGTATAGTTGGTTTCTTTCTATTTTGTTCTCGTCTGTCTGCTATGTCCGGTTTGATTTGGGTTTTTGTCACAATTTGTGTGTGTGTATATATTTATCGAAACAAGCACTTTCACTGAGAAAAAGAATGAAAGAATACAACCAAGGCATGCAAAAGGCCAAACCCACAAAAGATAGAAGCCCTTGTACTAGAATGGATTCTGACTATGGAAAAGACTGCCTACGAAGTAGTTAAAATGAATTTTGAAATTGAAACTCACAAAGAAACAAGATAACGAAAAACAGACAATCTCTCACTAAGGTCCCTTTCCCTCCCTCTAAGAGTCCTATTATTTCGCTCACCCCACAAAGTCCATAAGATCACACACACCTGTAAAGCCATAAAAAGCGACTTTTCTCCCTCTGAGGTGGATTGAGGAGGAACTTCTCAATCATACTCATAGTAACCCTCTAACAAGCCACATAGAAGCCGAGAACTGCTCTTAGTAGAATTGGTATACTCACACTGCCAAAGGATGTGATCTAAATCTTCCCCGGTCTTCCAACAAAGAAACAATAGAACAACCTAACAAATGAAGGTGACTTCCTCACCAACTAATCCATTGTACTAGCACGACCATGAAAAACTTGCCCAGTAAAGAACCTCGCTTTTTTAGAAATCTTAATGCTCTAGAGAACCAAGAAGACTGAGACACCTGTGGGAGAATGATCAACCAGACATTGAAAAAAAAATTGCACGAGAACTCCTCTGAAGGCTTGGTTTTCCAGACTCTCACATGCCTTCTCCCAAACCTAAAAAAGTGACTTTCGAGTAAGACAAGAAGAAAGGCCACATCCAACATTTCACTATTGAAAAGTGAAGGATGAAGATCAAACATAAAGAAGCACGAGCTTCTAGACCCAACAATATTATAGTTTATAAATTATAATATTGGGATGAGGTCTTTCCCCTACCAAATGACCTTTTAAAAAATATGTGCCCTTTCCCTCCCTCACCACGCATTGGTCTAAGTGGACTATTGAAGGGAGCTTTTTTGAAATATCCTTCCACAAGTCCCGTGCCTACCTCTTATTCTCTTAAGACAATTGGTATGAAAAATATTTACTTTAATCTCAAAAGAGTAAATTCTTCAAGTCTTAAAATTGCTCTTAACCCATTTACCATCTAATAGTTTTTGTTTTCCCTATGCACAAGTAGGTCATTTTCCTGGAGCATTGTTGAGTTGCAATTCTTCTTTGTGAATTATCAATTCCAAAGAGCAGTTGGCTTATTTCCACCCACTAACCCCCCTCCCCCCCCAACTATCATTATATATTCTAGGAGAGGGATAAGGATGTTGGGAAGGGGACCACGAGCAAGTTAGTTGGACATGAGAACGTTGTTAGTTCATTTGTGGGAATTATAATTGCCAGTTGTAGCATGTCTCTTTTCTCATGCTTGTATATTGAATAAAATATTAGCTCTGTGCCGGAGAGTACTCCATGCTGTTGAAGATGAAGATTCCCTGAAAGGTTAAGTTTTTTATATTCTAGTCACGTGCAAAAGAGTACACCTTGGATTGGGTCTTTTATTGTATTCTTTGTAGGAAGTTGAGGGAGGATCTTGACAACCAATGTGTTGATTTTATGGAATTAATCTCTATTTTCATTGATTATTTATTTGAGGTACAAGCCTATATATAACCATAGAGAAATACACTTAAGGAAATAATATCTCCAGAATAATATCACCTAAGAATAATCTAATTGTATTAATACCCTCCCTCAAACTCAAGGTTGAAATCACAAACTTGAGTTTGCTAAGAACTAAGAAAAGAACCAATAGATCTAGAATAGAATAAAAAACCTGAAGAACAAGCACACAAAGAGCCTTTAGGCTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAACAAGAACTCTGGGGGCAAAACGAAAGAGCAAAACTGAAGCAAAGTAGAATCAAACCAGGACGAAAACGGGGTCTGAATTGGGGAAGAGCGTTTCTTCGGATCAGAATGGAACTGAACGAACTAAACTAGTGTAAGTCAGAATCAGACTAAACAGGAAATCTGGGGTGAACTGAACGAGGCGTTGAGGCTTAGCGGATCTGAACAAAAACTGAATCGGGGCGTCTTTGGATCTGAGCTTCGATCTAAACTTCGAAGGCGACACGAAGGATGGGTCTGAAGAAATCCGACAGATTTGGAACAGATTGGAGGCGGACACAAGAATCCTGAGGGCATGGGGTTTCGTGGGTTTTGTCGAGATAGCGTATGGAGATGAAGCGACGGCAACGAAACGCAGCAGCGAGCGGAGCTTCTCGAACGACGAACAAAACTGAAACTCTTCGGTTGGGCATTCGGCTTCAACTGACAGAGGTTCAATCGGCTGCTTGGCTGATGGAAACAAGACGTCAGATGGAGAACTATGGAGGAGATGGTCGGGAAGCAAGGGCGGTGGACGCTAGTGAAAACGATTGGGGTTTCTGGGGTTTGGATGGTGGCAGATTTGAGCAGCAATGAGCGACAATGAGCAGCAAAGGGAAGGTGCAATGACGGTGGCGGATTTGCAGATCTAAGGGTGGCGGCGGCTAATGGGGGATGATTAGGTAGATCGGACGGGATCTGGAGGCTAGAATTCAATGAGGCGAAAACCCTAGAGCTCTGATACTATGTTGATTTTATGGAATTAATTTTTATTTTCATTGATTATTTAGCAGAGGTACAAGCCTATATATACCCTAGAGAAATACACTCAAGGAAATAATATCAAATAATATCTCCAGAATATGATCACTTAAGAATTAATCTAATTATATTAATACAATGCACTGGCCACCTTTCAATCACTTATGAACCAGGTATTTTGCCTATTCCTAAGAGGCTTTGTACTTGTACTTTTTTATGATAGACTGGTCTATAGTTGTGATGTGAACGAGCACGAAAAACATTTGGGGATGGTGTTTGTTGTACTTAGAGATAATCATCTATTTGCAAACTGAAAAAATGTGTTATAGCCTACTCTAGAATCCAATATTTAGGCCACTAGATCTCAAGTAAGGGGGTGGAAGCTGATAAGGATAAAATACACTTAATAATGAATTGGCCTCAACCGAAAGATGTTATCGGATTGATGGGGTTTTTGGGATTGACAGGATACTATAGAAGGTTTGTGAGAGGATATGGTGAAATCTCTACCCCTTTGACAAAACTTCTACAGATTGGTCCAGTCTTTTTATGACTGGGTATTTTAAATGGAATGAGGAAGCTACATAGGCTTTTGAAAAGTTGAAAATAGCCATGACAACCATCCTTGTTTTAGCTTTACCAGATTGGTCCCTTCTTTTTATGACTGGAACAGATGCTTCTGGAATAGGGTTAGGGGCAGTTCTATCACAAAATGGACATCCCATTGCGTTCTTCAGTCAGAAACTAGCTCCAAGAGCCCAAGTCAAATCCATCTTTGAGAGAGAAGTGATGGATGTTGTGCTTTCAGTCCAAAAACGGAGATACTGTCTCTTGGGGAGGAAATTCACTATAATTTCATACCAGAAAGCTCTTAAATGTCTCTTAAATCAAAGAGAAGTTCAACCCCCAATTTCAAAAGTGGCTTACAAAACTGTTGGGGTATGATTTTGAGATCTTGTATCAGCTGAGACTGCAAAATAAGGCTGCTCATGCCCACTCTAGAATAGAACCACCTCTCGAACTGAATGTTATGACAACCCCGGGAATTGTTGACTTAGAACTGAATGTTACCCAAATCCATAATCACAGACTGGGATAAAATATTCCTCGGTAATTTCTAGAAGGAATTGTTCTCTACCATGGGCACGTTACTTAGGAGAAGCACAACTTTCTAATCGCAAAAAGATGAGCAAACCGAAAGAGTAAACAGATGCCTAGAGACCTATTTGAGGTGTTTTTGCAATGAGCAATCAAGCAGATGGCATAAATTCCTCCCATGGGCTGAGCTGTGGTATAACACAACCTTTCATGCATCCACGAAGGCTACCCCTTTCCAACTTGTGTTTGGTAGACCCCACCACCCTTAATATCATATGGGGATAAGAAATCCTCCAACAATGAAGTGGAAGTGATGATGAAGGAAAGAGACTTAGCCATAAATGCTCTCAAGGGGAATTTTATCGTGGCTCAATACCAGATGAAGAAAATGGCTAGATTTAAAGAGAAGGGAGCTGAAGTTTAAAGTAGGAGAAGAAGTTTACCTTAAACTGAGACCCTATAGACAGCGGTCACTAGCTAGAAAAAGATGTGAAAAACTTGCTCCTAAGTTTTATGGACCTTATAAATTAATTGAGGAAATTGGGGACATGGCATACCGATTACAACTACCACCAGAAGGAGCAATCCACAATGTCTTCCATGTGTCTCAATTGAAGCTCAAATTGGGAAAGCAACAAGTGCAGCACCAGCCCAAGACTGTTATGGGAATTCGTTGGAGTAAAGAGTTAGGAGCAAATGAACGGCTAATTAAATGGAAGGTTTTACATGGGAATCAATATATCAGATGAATCAGCAGTTCCCTACATTTCACCTTGAGGACAAGGTGATTTGAAACACAAGGGTGTTGTAAGGCCACCTATAATCCATACGTATAAAAGAAAGGGCAGAAAGGGAATTATCCAGCAAGCCAATGATGAGGGAATGATTGCAGAAAAGATTGCGAGTAATGGGCCCACCGGTAGGAGAGAGAGTGTCCTATAAAAAAATCTTTATGGGCAATGTGTAGGGAGGGTTTCTTTTTGTGTAAAAGCTGCAGAAAGCTTTAGGAGACGAATTTCCCAGCCTCCTTGTAAAGTTGCTGGGTAAGCTATTGTAATTTTCCTTTATTTTCGTTGTTGAACTCTGTGTGATCTTATCTTCGGCTGAAAGTAATATAAATAACAGAGTGCGGCCTCTGTTTTAGACATTTTGTTAGGATTGATTGGTATTTTTGAATTAGAATCCTAACAGTGGCAGTTTAGTTGTTTGCAAGGGTCGTCAAATAAGAGAAGCATGTCATTTTAACTGCTACATTGCTATTTTCCAGGGGGCAAGAGATGAGTTTACTCTTGATCTTGTAGAACAGTGTTCGTTTCAGAAGCAGAAACTAATGCATCTTGTGCTGTCTTCTCGGTATGGTTCCATCTTCGCCTGTTTCTTCCACAATGCATCAGTAATTGATTCCATTCAATGAATGGATGTTTTTCCTACTCAATGCTTTGTGAAACGATAACGTACTGTGTTAAGTAAATGAAAAGGAGACATGAAGCTACAAAATATTGTTGATATCATAATGGACAAATTTTGTAAATGAAAAAATGTGCAGTAATGTTTTTGTTTTCGTAACAACTTTGCGTGTAACTGAAAGAATCTCTTATATTGCTCACCCTGTTTTAACAATTGTAGATAGCTTCAAGGATGCTGAATTTGACACATCTATAGATACCTCATAAATTAGTACATTATGAGAAATAAGCTTAATACAACGTTGCTTCCTTTTTCCAATTTACTGACCTTGAACTCTGCTATGTGACTGTTGTTTCGATTTTTCTTTTCTATTTCTATACTGTGGTTGTAACTCTTGTTAGCCTACCATAGATATCCTTTTTTAATATTGCATGAGATAGTCATTGTGTATTCTTTTTCACTTCTTAGTAAAACACGGTTATCATTATAATTTATTTAATTGAAACATAATTGCTGTCACAAGAAAATTTAAAAATTGAAACATACTTACAATAACTTGTAATTCTATTGGTCTATCATTTTGATTTCCAGTTGCTTTCACTGTGTTTTAGGGACGAGAAGATTGTCTGCGGTGCCATTGAATTGAATGAGAAGCTCCAAAAGGTCCTTGCAAGACATGATGCCCTCCTCTCTGGTCAGTTTATGTCGACTCAAAATCAGTTCAACGGCGAAGAAGTTGGTATGTCCAGATTGCCTGCTAATCATTATAACCATGACGAAGGAGAAGATGAAGAAGAGGCCGATCAACTTTTCCGAAGGTAATACACTACATTCGATGTTGATTTCAGTATGGAGTGGTGAAAATATAAGCAAACTAAGTAGTGAGTGATATGGTCAATAACAGGTTGCGAAAAGGAAAGGCTTGTGTAAGGCCTGAAGACGAAGAGGATTCTTCAGAGGAGCGGCCGTCGTTGGGTTTGCTAGGATTGTCAATTCCAGTTGAACGAGCAAACCGTCCAATCATTCGACCTATTGACGAGAAGGTGTCAACGACATTGGAAATACAGCATGGTCAGGGTGTTTCAATACCACCACCACCAGTAAAGCATGCAGAAAGGGAGAAGTTCTTCAAGGATAAAAAAATAGATGTTGGAGTTGGACATATGAGAGGGCTTTCTTTACACAGTCGTAATGCTAGCAGCTCTCGCAGTGGAAGCATAGATTTCAACGAGTCATGAAGGAGTGGTGTGAAGAAGTGTTTGGGAGTGCAATTTTATTTATTTTCACTTCAACTCATTATTTGTAAAGTTGAGTCATTTTTCTTTCTTTCTTTTCAATGAGTTTGATATCTTTAAATTAAGGTGTAAGGGCCAGGGACTTACTGAATATATCCTTTATTCCATTCGTGTATAATTTTGTAATTTAAAGGTGGTTAACTATTTCAAGTTTGCTGAAGGTGAAAAAAGCCAAAGAAGGGCAATCGAAAGACCTTTATTTTTTCTTTTTTTATTATAATGCCC

mRNA sequence

AAACGCAACAACGCCCATTCTCTCACCAAAGGACAGAGCGACACACGCATACACTTTTAGATAGAGAGAGAGAGATTCTCATTTCTTTCTCTTCTGTCGTTCAACTCCAAATCTTCATCCCCATTATTTGCTTCTTTATTATCCTTTCAATTCCTCCTCTTAATCATTCATCATTACGCCTACCCACAGAGTTTCCAGCCATTTTTCAACCCTTTTTGCCCCAACCCCATTTCTTCTTTTTTCCTCAAATGCTCATCCTCTTCCTCTTCTAGGGTTTTTCACTTCTCTTCTTTTGTCTATTCTGAGTGGCTTATGGTTCCCACATCGGAAGGCTTTCTTGATTTCTATCTCTGCTTCTATTGGTGCAGTATGCCTATAAAGTCCAAGATATTGTAGAGCATTTTGGGGTTTTGTTTACCTTGACGATTTGTTACTCCCTTGACAATTCAAATATATAGGAGGTTGCTGATCTCGGGGTACTATAAAGATGGCTGCTGAACTAGTCAACTCTGCTACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATCTGTGAATTAGTTGCTCATGATCAAAGGCAAGCAAAAGAGGTCATAAAAGCGATTAAAAAACGACTAGGAAATAAAAATGCAAATGCACAACTTTATGCAGTTTTGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGTCTGATTTACCAGTGCGAGAGAGAATATTTCTTCTTCTAGATGCCACACAGACAGCTCTTGGCGGTGCTTCTGGAAAGTTCCCTCAGTATTATTCAGCATATTATGATTTGGTGAGTGCCGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTTCATCAAATAGTCCTACCCAGCAGCAAATTAATAATACTTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGAATGTTGCTAGAGTGGAACCTCAGATATTATCAGAATCTAGTATAATTGAAAAGGCTGGCAATGCATTAGAAGTTCTGAAAGAAGTTCTTGATGCTGTTGATCCTCGACATCCTGAGGGGGCAAGAGATGAGTTTACTCTTGATCTTGTAGAACAGTGTTCGTTTCAGAAGCAGAAACTAATGCATCTTGTGCTGTCTTCTCGGGACGAGAAGATTGTCTGCGGTGCCATTGAATTGAATGAGAAGCTCCAAAAGGTCCTTGCAAGACATGATGCCCTCCTCTCTGGTCAGTTTATGTCGACTCAAAATCAGTTCAACGGCGAAGAAGTTGGTATGTCCAGATTGCCTGCTAATCATTATAACCATGACGAAGGAGAAGATGAAGAAGAGGCCGATCAACTTTTCCGAAGGTTGCGAAAAGGAAAGGCTTGTGTAAGGCCTGAAGACGAAGAGGATTCTTCAGAGGAGCGGCCGTCGTTGGGTTTGCTAGGATTGTCAATTCCAGTTGAACGAGCAAACCGTCCAATCATTCGACCTATTGACGAGAAGGTGTCAACGACATTGGAAATACAGCATGGTCAGGGTGTTTCAATACCACCACCACCAGTAAAGCATGCAGAAAGGGAGAAGTTCTTCAAGGATAAAAAAATAGATGTTGGAGTTGGACATATGAGAGGGCTTTCTTTACACAGTCGTAATGCTAGCAGCTCTCGCAGTGGAAGCATAGATTTCAACGAGTCATGAAGGAGTGGTGTGAAGAAGTGTTTGGGAGTGCAATTTTATTTATTTTCACTTCAACTCATTATTTGTAAAGTTGAGTCATTTTTCTTTCTTTCTTTTCAATGAGTTTGATATCTTTAAATTAAGGTGTAAGGGCCAGGGACTTACTGAATATATCCTTTATTCCATTCGTGTATAATTTTGTAATTTAAAGGTGGTTAACTATTTCAAGTTTGCTGAAGGTGAAAAAAGCCAAAGAAGGGCAATCGAAAGACCTTTATTTTTTCTTTTTTTATTATAATGCCC

Coding sequence (CDS)

ATGGCTGCTGAACTAGTCAACTCTGCTACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATCTGTGAATTAGTTGCTCATGATCAAAGGCAAGCAAAAGAGGTCATAAAAGCGATTAAAAAACGACTAGGAAATAAAAATGCAAATGCACAACTTTATGCAGTTTTGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGTCTGATTTACCAGTGCGAGAGAGAATATTTCTTCTTCTAGATGCCACACAGACAGCTCTTGGCGGTGCTTCTGGAAAGTTCCCTCAGTATTATTCAGCATATTATGATTTGGTGAGTGCCGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTTCATCAAATAGTCCTACCCAGCAGCAAATTAATAATACTTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGAATGTTGCTAGAGTGGAACCTCAGATATTATCAGAATCTAGTATAATTGAAAAGGCTGGCAATGCATTAGAAGTTCTGAAAGAAGTTCTTGATGCTGTTGATCCTCGACATCCTGAGGGGGCAAGAGATGAGTTTACTCTTGATCTTGTAGAACAGTGTTCGTTTCAGAAGCAGAAACTAATGCATCTTGTGCTGTCTTCTCGGGACGAGAAGATTGTCTGCGGTGCCATTGAATTGAATGAGAAGCTCCAAAAGGTCCTTGCAAGACATGATGCCCTCCTCTCTGGTCAGTTTATGTCGACTCAAAATCAGTTCAACGGCGAAGAAGTTGGTATGTCCAGATTGCCTGCTAATCATTATAACCATGACGAAGGAGAAGATGAAGAAGAGGCCGATCAACTTTTCCGAAGGTTGCGAAAAGGAAAGGCTTGTGTAAGGCCTGAAGACGAAGAGGATTCTTCAGAGGAGCGGCCGTCGTTGGGTTTGCTAGGATTGTCAATTCCAGTTGAACGAGCAAACCGTCCAATCATTCGACCTATTGACGAGAAGGTGTCAACGACATTGGAAATACAGCATGGTCAGGGTGTTTCAATACCACCACCACCAGTAAAGCATGCAGAAAGGGAGAAGTTCTTCAAGGATAAAAAAATAGATGTTGGAGTTGGACATATGAGAGGGCTTTCTTTACACAGTCGTAATGCTAGCAGCTCTCGCAGTGGAAGCATAGATTTCAACGAGTCATGA

Protein sequence

MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLLLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES*
BLAST of Cucsa.017950 vs. Swiss-Prot
Match: TOM1_MOUSE (Target of Myb protein 1 OS=Mus musculus GN=Tom1 PE=1 SV=1)

HSP 1 Score: 74.3 bits (181), Expect = 3.3e-12
Identity = 67/284 (23.59%), Postives = 115/284 (40.49%), Query Frame = 1

Query: 6   VNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKR-LGNKNANAQLYAVLLLEM 65
           +  AT   L   DW  N++IC+++   +   K+  +A+KKR +GNKN +  + A+ +LE 
Sbjct: 17  IEKATDGSLQSEDWALNMEICDIINETEEGPKDAFRAVKKRIMGNKNFHEVMLALTVLET 76

Query: 66  LMNNIGEAIH-------------KQSDLP-------VRERIFLLLDATQTALGGASGKFP 125
            + N G   H              ++ LP       V +++  L+ +   A   +S    
Sbjct: 77  CVKNCGHRFHVLVANQDFVENVLVRTILPKNNPPTIVHDKVLNLIQSWADAF-RSSPDLT 136

Query: 126 QYYSAYYDLVSAGVQFP----------QRPPAVSSNSPTQQQIN----NTSQNGVIR--- 185
              + Y DL   G++FP            P     NS T  + N    NTSQ G +    
Sbjct: 137 GVVAVYEDLRRKGLEFPMTDLDMLSPIHTPQRTVFNSETPSRQNSVSSNTSQRGDLSQHA 196

Query: 186 --------LSEQENVARVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTL 244
                   L     +     QI    S +E     + V+ E+L  + P   E A  E   
Sbjct: 197 TPLPTPAVLPGDSPITPTPEQIGKLRSELEMVSGNVRVMSEMLTELVPTQVEPADLELLQ 256

BLAST of Cucsa.017950 vs. Swiss-Prot
Match: TOM1_HUMAN (Target of Myb protein 1 OS=Homo sapiens GN=TOM1 PE=1 SV=2)

HSP 1 Score: 73.2 bits (178), Expect = 7.4e-12
Identity = 65/284 (22.89%), Postives = 122/284 (42.96%), Query Frame = 1

Query: 6   VNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRL-GNKNANAQLYAVLLLEM 65
           +  AT   L   DW  N++IC+++   +   K+ ++A+KKR+ GNKN +  + A+ +LE 
Sbjct: 17  IEKATDGSLQSEDWALNMEICDIINETEEGPKDALRAVKKRIVGNKNFHEVMLALTVLET 76

Query: 66  LMNNIGEAIH-------------KQSDLP-------VRERIFLLLDATQTALGGASGKFP 125
            + N G   H              ++ LP       V +++  L+ +   A   +S    
Sbjct: 77  CVKNCGHRFHVLVASQDFVESVLVRTILPKNNPPTIVHDKVLNLIQSWADAF-RSSPDLT 136

Query: 126 QYYSAYYDLVSAGVQFPQRPPAVSS--NSPTQQQINNTSQNGVIRL----SEQEN----- 185
              + Y DL   G++FP     + S  ++P +   N+ +Q+G   +    S+QE+     
Sbjct: 137 GVVTIYEDLRRKGLEFPMTDLDMLSPIHTPQRTVFNSETQSGQDSVGTDSSQQEDSGQHA 196

Query: 186 --------------VARVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTL 244
                         +A    QI    S +E     + V+ E+L  + P   E A  E   
Sbjct: 197 APLPAPPILSGDTPIAPTPEQIGKLRSELEMVSGNVRVMSEMLTELVPTQAEPADLELLQ 256

BLAST of Cucsa.017950 vs. Swiss-Prot
Match: TOM1_CHICK (Target of Myb protein 1 OS=Gallus gallus GN=TOM1 PE=2 SV=2)

HSP 1 Score: 71.2 bits (173), Expect = 2.8e-11
Identity = 66/284 (23.24%), Postives = 117/284 (41.20%), Query Frame = 1

Query: 6   VNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRL-GNKNANAQLYAVLLLEM 65
           +  AT   L   DW  N++IC+++   +   K+  +AIKKR+ GNKN +  + A+ +LE 
Sbjct: 17  IERATDGSLRGEDWSLNMEICDIINETEEGPKDAFRAIKKRIVGNKNFHEVMLALTVLET 76

Query: 66  LMNNIGEAIH-------------KQSDLP-------VRERIFLLLDATQTALGGASGKFP 125
            + N G   H              ++ LP       V +++  L+ +   A   +S    
Sbjct: 77  CVKNCGHRFHILVASQDFVESVLVRTILPKNNPPAIVHDKVLTLIQSWADAF-RSSPDLT 136

Query: 126 QYYSAYYDLVSAGVQFPQ------------RPPAVSSNSPTQQQ---INNTSQNGVI--- 185
              + Y DL   G++FP             R    SSNS + Q    +N+  Q   I   
Sbjct: 137 GVVAVYEDLRRKGLEFPMTDLDMLSPIHTPRRSVYSSNSQSGQNSPAVNSPQQMESILHP 196

Query: 186 -------RLSEQENVARVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTL 244
                    S    +   + QI    S +E     ++V+ E+L  + P   E +  E   
Sbjct: 197 VTLPSGRDTSSNVPITPTQEQIKKLRSELEVVNGNVKVMSEMLTELVPSQAETSDLELLQ 256

BLAST of Cucsa.017950 vs. Swiss-Prot
Match: HGS_MOUSE (Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Mus musculus GN=Hgs PE=1 SV=2)

HSP 1 Score: 67.8 bits (164), Expect = 3.1e-10
Identity = 40/135 (29.63%), Postives = 67/135 (49.63%), Query Frame = 1

Query: 5   LVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLLLEM 64
           L++ ATS+ L ETDW   +QIC+L+     QAK  + +IKK++ +KN +  LYA+ ++E 
Sbjct: 11  LLDKATSQLLLETDWESILQICDLIRQGDTQAKYAVNSIKKKVNDKNPHVALYALEVMES 70

Query: 65  LMNNIGEAIH-----------------KQSDLPVRERIFLLLDATQTALGGASGKFPQYY 123
           ++ N G+ +H                 +Q ++ VR +I  L+ A   A      K+    
Sbjct: 71  VVKNCGQTVHDEVANKQTMEELKELLKRQVEVNVRNKILYLIQAWAHAFRN-EPKYKVVQ 130

BLAST of Cucsa.017950 vs. Swiss-Prot
Match: HGS_RAT (Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Rattus norvegicus GN=Hgs PE=1 SV=1)

HSP 1 Score: 67.8 bits (164), Expect = 3.1e-10
Identity = 40/135 (29.63%), Postives = 67/135 (49.63%), Query Frame = 1

Query: 5   LVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLLLEM 64
           L++ ATS+ L ETDW   +QIC+L+     QAK  + +IKK++ +KN +  LYA+ ++E 
Sbjct: 11  LLDKATSQLLLETDWESILQICDLIRQGDTQAKYAVNSIKKKVNDKNPHVALYALEVMES 70

Query: 65  LMNNIGEAIH-----------------KQSDLPVRERIFLLLDATQTALGGASGKFPQYY 123
           ++ N G+ +H                 +Q ++ VR +I  L+ A   A      K+    
Sbjct: 71  VVKNCGQTVHDEVANKQTMEELKELLKRQVEVNVRNKILYLIQAWAHAFRN-EPKYKVVQ 130

BLAST of Cucsa.017950 vs. TrEMBL
Match: W9RQ74_9ROSA (TOM1-like protein 2 OS=Morus notabilis GN=L484_013104 PE=4 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 4.1e-142
Identity = 278/416 (66.83%), Postives = 317/416 (76.20%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVN ATS+KL E DW KNI+ICELVA DQRQAK+VIKAIKKRLG+K+ N QLYAVL
Sbjct: 1   MAAELVNCATSDKLPEMDWTKNIEICELVARDQRQAKDVIKAIKKRLGSKHTNTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQ                 SDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENIHKQVIDTGIIPILVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVAR-VEPQIL 180
           PQYY+AYY+LVSAGVQFPQRPPAVSS+ PT Q  NN   NG +  S  E  AR  EPQ +
Sbjct: 121 PQYYNAYYELVSAGVQFPQRPPAVSSDHPTPQPNNNNLPNGELASSRHEGFARQAEPQDV 180

Query: 181 SESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDE 240
            ESSII+KAGN LEVLKEVLDAVD +HPEGA+DEFTLDLVEQCSFQKQ++MHLV++SRDE
Sbjct: 181 PESSIIQKAGNVLEVLKEVLDAVDSQHPEGAKDEFTLDLVEQCSFQKQRVMHLVMTSRDE 240

Query: 241 KIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEE 300
           K+V  AIELNE+LQKVLARH+ALLS +  ST                NH+N +E E+EEE
Sbjct: 241 KVVSRAIELNEQLQKVLARHEALLSAKPRST---------------VNHFNQEEAEEEEE 300

Query: 301 ADQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEI 360
           A+QLFRRLRKGKAC RPEDEE  + E P LG LG +IP E  NRP+IRP+  + S+    
Sbjct: 301 AEQLFRRLRKGKACARPEDEERPA-EHPHLGFLGSAIPGEMLNRPLIRPLSLEPSSREPN 360

Query: 361 QHGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNE 399
            H   V+IPPPP KH EREK+F++ K D   GH+RGLSLHSRNASSSRS SID ++
Sbjct: 361 GH---VAIPPPPAKHVEREKYFQENKADGLAGHVRGLSLHSRNASSSRSESIDSSD 397

BLAST of Cucsa.017950 vs. TrEMBL
Match: A0A067KK37_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_09342 PE=4 SV=1)

HSP 1 Score: 508.1 bits (1307), Expect = 1.0e-140
Identity = 269/418 (64.35%), Postives = 321/418 (76.79%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVN ATS+KL E DW KNI+ICELV  DQRQA++V+KAIKKRLG+KN NAQLYAV+
Sbjct: 1   MAAELVNLATSDKLPEVDWTKNIEICELVGRDQRQARDVVKAIKKRLGSKNPNAQLYAVM 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE +HK+                 +DLPVRERIFLLLDATQT+LGG+SGKF
Sbjct: 61  LLEMLMNNIGEPVHKEVIDTGVLPILVKIVKKKTDLPVRERIFLLLDATQTSLGGSSGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVA-RVEPQIL 180
           PQYYSAYYDLVSAGVQFPQRPP   +N+PT Q    ++ NG +  S  E VA + E Q++
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPETQTNNPTSQSNKRSTLNGELVASRHEEVAQKAEAQVV 180

Query: 181 SESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDE 240
            ESSII+KA N LEV KEVLDAVD ++PEGA+DEFTLDLVEQCSFQKQ++MHLV++SRDE
Sbjct: 181 PESSIIQKASNVLEVFKEVLDAVDSQNPEGAKDEFTLDLVEQCSFQKQRVMHLVMTSRDE 240

Query: 241 KIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEE 300
           K+V  AIELNE+L KVLAR+DA +SG+ M +      ++   +   ANH+ HDE E+EEE
Sbjct: 241 KVVSRAIELNEQLHKVLARYDAFISGRSMPSDRSAVSDKTTST---ANHFIHDEEEEEEE 300

Query: 301 ADQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEI 360
           A+QLFRRLRKGKAC RP DE  +SEER S+GLLG SI  ER NRP+IRPI         +
Sbjct: 301 AEQLFRRLRKGKACARPVDEV-NSEERLSMGLLGSSIAGERLNRPLIRPIISSDQPHEPL 360

Query: 361 QHGQGVSIPPPPVKHAEREKFFKDKKIDVG--VGHMRGLSLHSRNASSSRSGSIDFNE 399
            +   V+IPPPP KH ERE+FF++KK+D     GHMRGLSLHSRNASSSRSGSIDF++
Sbjct: 361 VNPPPVAIPPPPAKHVERERFFQEKKVDGSGVTGHMRGLSLHSRNASSSRSGSIDFSD 414

BLAST of Cucsa.017950 vs. TrEMBL
Match: A0A061DN59_THECC (ENTH/VHS/GAT family protein isoform 1 OS=Theobroma cacao GN=TCM_003342 PE=4 SV=1)

HSP 1 Score: 504.6 bits (1298), Expect = 1.1e-139
Identity = 275/422 (65.17%), Postives = 320/422 (75.83%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKL E DW KNI+ICELVA DQRQAK+V+KAIKKRLG+KN N QLY+VL
Sbjct: 1   MAAELVNSATSEKLTEMDWTKNIEICELVARDQRQAKDVVKAIKKRLGSKNPNTQLYSVL 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE +HKQ                 SDLP+RERIFLLLDATQT+LGG+SGKF
Sbjct: 61  LLEMLMNNIGENVHKQVIDSGILPILVKIVKKKSDLPIRERIFLLLDATQTSLGGSSGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVAR-VEPQIL 180
           PQYYSAYYDLVSAGVQFPQRP A  SN PT     N + NG +  +  E +A+  EPQI+
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPHATPSNPPTSLPNKNNTLNGELAAARHEAIAQQTEPQIV 180

Query: 181 SESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDE 240
            ESSII+KA NALEVLKEVLDAVDP++P G +DEFTLDLVEQCSFQKQ++MHLV+SS+DE
Sbjct: 181 PESSIIQKASNALEVLKEVLDAVDPQNPLGVKDEFTLDLVEQCSFQKQRVMHLVMSSQDE 240

Query: 241 KIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNH--DEGEDE 300
           K+V  AIELNE+LQKVL RHDALLSG+              +S  P +  NH   E E+E
Sbjct: 241 KVVSRAIELNEQLQKVLVRHDALLSGR------------TSVSSRPTSTINHFDPEEEEE 300

Query: 301 EEADQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIP--VERANRPIIRPIDEKVST 360
           EE +QLFRR+RKGKAC RPEDEE  S ERP LGLLG SIP   ER NRP+IRP+  + S 
Sbjct: 301 EEPEQLFRRIRKGKACARPEDEE-CSRERPHLGLLGSSIPGEKERLNRPLIRPLSLEPSC 360

Query: 361 TLEIQHGQGVSIPPPPVKHAEREKFFKDKKIDVG--VGHMRGLSLHSRNASSSRSGSIDF 399
                +  GV+IPPPP KH ERE++F++KK+D     GHMRG+SLHSRNASSSRSGS+DF
Sbjct: 361 E-NNANPSGVAIPPPPAKHMERERYFQEKKVDGSALAGHMRGMSLHSRNASSSRSGSMDF 408

BLAST of Cucsa.017950 vs. TrEMBL
Match: A0A067F6C8_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g0143901mg PE=4 SV=1)

HSP 1 Score: 499.6 bits (1285), Expect = 3.6e-138
Identity = 269/427 (63.00%), Postives = 316/427 (74.00%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MA ELVNSATSEKLA+ DW KNI+ICELVA DQR AK+VIKAIKKRLG+KN N QLYAV+
Sbjct: 1   MATELVNSATSEKLADVDWTKNIEICELVARDQRHAKDVIKAIKKRLGSKNTNVQLYAVM 60

Query: 61  LLEMLMNNIGEAIHK-----------------QSDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIG+ IHK                 +SDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGDHIHKLVIDTGILPILVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVA---RVEPQ 180
           PQYY+AYY+LVSAGVQFPQRP  + S+ P+       + NG +  S  E V    + EPQ
Sbjct: 121 PQYYTAYYELVSAGVQFPQRPRTIPSSHPSSDANKKVTLNGELASSRNEGVTLAQQPEPQ 180

Query: 181 ILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSR 240
           I+ ESSII+KA NALEVLK+VLDAV  ++PEGA+DEFTLDLVEQCSFQKQ++MHLV++SR
Sbjct: 181 IVPESSIIQKASNALEVLKDVLDAVGTQNPEGAKDEFTLDLVEQCSFQKQRVMHLVMTSR 240

Query: 241 DEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMS-------RLPANHYN 300
           DEK+V  AI+LNE+LQ VLARHD LLS +  ST N  N ++  +S          ANH +
Sbjct: 241 DEKVVSQAIDLNEQLQNVLARHDVLLSERSTSTANHVNHQDGHLSTRSTTTANHSANHAD 300

Query: 301 HDEGEDEEEADQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPID 360
           H E E+EEEA+QL RR+RKGK C RPEDEE   E  P LG+LG SIP  R NRP+IRP+ 
Sbjct: 301 HAEEEEEEEAEQLSRRMRKGKGCARPEDEEHLPERHP-LGILGSSIPAARLNRPLIRPVQ 360

Query: 361 EKVSTTLEIQHGQGVSIPPPPVKHAEREKFFKDKKID--VGVGHMRGLSLHSRNASSSRS 399
            +        H Q V+IPPPP KH EREKFF++KK+D     GHMRGLSLHSRNASSSRS
Sbjct: 361 AEPPHETN-AHPQPVTIPPPPAKHVEREKFFQEKKVDASAAAGHMRGLSLHSRNASSSRS 420

BLAST of Cucsa.017950 vs. TrEMBL
Match: B9IP43_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0019s00820g PE=4 SV=2)

HSP 1 Score: 496.1 bits (1276), Expect = 3.9e-137
Identity = 268/420 (63.81%), Postives = 317/420 (75.48%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELV+SATS+KL E DW KNI+ICELVA D+RQA++V+KAIKKRLG+KNAN QLYAV+
Sbjct: 50  MAAELVSSATSDKLTEVDWTKNIEICELVARDERQARDVVKAIKKRLGSKNANTQLYAVM 109

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE +H+Q                 ++LPVRERIFLLLDATQTALGGASGKF
Sbjct: 110 LLEMLMNNIGEQVHRQVIDTGILPILVKIVKKKTELPVRERIFLLLDATQTALGGASGKF 169

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVAR---VEPQ 180
           PQYYSAYYDLV AGVQFPQRP    SN    Q+    + NG +  +  E  A    VEPQ
Sbjct: 170 PQYYSAYYDLVCAGVQFPQRPRERPSNHQATQESKKNTLNGELAAARHEVGAHPVPVEPQ 229

Query: 181 ILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSR 240
           ++ ESSII+KA NALEVLKEVLDAVD ++PEGA+DEFTLDLVEQCSFQKQ++MHLV++SR
Sbjct: 230 VVPESSIIQKASNALEVLKEVLDAVDSQNPEGAKDEFTLDLVEQCSFQKQRVMHLVMTSR 289

Query: 241 DEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDE 300
           DEK+V  AIELNE+LQKVLARHD+LLSG+   +      +    +   ANH+NH+E E+E
Sbjct: 290 DEKLVSQAIELNEQLQKVLARHDSLLSGRSTVSDTTTISDR---TTTTANHFNHEESEEE 349

Query: 301 EEADQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTL 360
           EE +QLFRRLRKGKAC RPED E +SEER  LGLLG +IP +R NRP+IRP+  +     
Sbjct: 350 EEPEQLFRRLRKGKACARPED-EGNSEERLPLGLLGSTIPGDRLNRPLIRPLPSEQPQDP 409

Query: 361 EIQHGQGVSIPPPPVKHAEREKFFKDKKIDVGV--GHMRGLSLHSRNASSSRSGSIDFNE 399
                  V IPPPP KH ER+KFF++KK D     GHMRGLSLHSRNASSS SGSIDF++
Sbjct: 410 NANCAP-VVIPPPPAKHMERQKFFQEKKADGSAVSGHMRGLSLHSRNASSSCSGSIDFSD 464

BLAST of Cucsa.017950 vs. TAIR10
Match: AT5G63640.1 (AT5G63640.1 ENTH/VHS/GAT family protein)

HSP 1 Score: 434.1 bits (1115), Expect = 9.3e-122
Identity = 255/451 (56.54%), Postives = 308/451 (68.29%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELV+SATSEKLA+ DW KNI+ICEL A D+RQAK+VIKAIKKRLG+KN N QLYAV 
Sbjct: 1   MAAELVSSATSEKLADVDWAKNIEICELAARDERQAKDVIKAIKKRLGSKNPNTQLYAVQ 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQ                 SDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENIHKQVIDTGVLPTLVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARV---EPQ 180
           PQYY+AYY+LV+AGV+F QRP A      T Q +   + N  +  +  E  A     E Q
Sbjct: 121 PQYYTAYYELVNAGVKFTQRPNATPV-VVTAQAVPRNTLNEQLASARNEGPATTQQRESQ 180

Query: 181 ILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSR 240
            +S SSI++KA  ALE+LKEVLDAVD ++PEGA+DEFTLDLVEQCSFQK+++MHLV++SR
Sbjct: 181 SVSPSSILQKASTALEILKEVLDAVDSQNPEGAKDEFTLDLVEQCSFQKERVMHLVMTSR 240

Query: 241 DEKIVCGAIELNEKLQKVLARHDALLSGQFM-----STQN-----------QFNGEE--- 300
           DEK V  AIELNE+LQ++L RH+ LLSG+       +T N             NG++   
Sbjct: 241 DEKAVSKAIELNEQLQRILNRHEDLLSGRITVPSRSTTSNGYHSNLEPVRPISNGDQKRE 300

Query: 301 -------VGMSRLPAN--HYNHDEGEDEEEADQLFRRLRKGKACVRPEDEEDSSEERPSL 360
                     S   +N  H   +E ++EEE +QLFRRLRKGKA  RPEDEE+ S   P  
Sbjct: 301 LKASNANTESSSFISNRAHLKLEEEDEEEEPEQLFRRLRKGKARARPEDEEEPS---PPQ 360

Query: 361 GLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQG--VSIPPPPVKHAEREKFFKDKKID 399
           GL G +I  ER NRP+IRP+  + ++     H Q   V IPPPP KH EREKFFK+ K D
Sbjct: 361 GLPGSAIHNERLNRPLIRPLPSEEASRGGDSHSQSPPVVIPPPPAKHVEREKFFKENKGD 420

BLAST of Cucsa.017950 vs. TAIR10
Match: AT1G21380.1 (AT1G21380.1 Target of Myb protein 1)

HSP 1 Score: 188.7 bits (478), Expect = 6.8e-48
Identity = 114/322 (35.40%), Postives = 174/322 (54.04%), Query Frame = 1

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLL 61
           AA     AT++ L   DW  NI++C+++  +  QAKE +K +KKRLG+KN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDIINMEPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIH-----------------KQSDLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE+++                 K+ DL VRE+I  LLD  Q A GG+ G+FP
Sbjct: 65  LETLSKNCGESVYQLIVDRDILPDMVKIVKKKPDLTVREKILSLLDTWQEAFGGSGGRFP 124

Query: 122 QYYSAYYDLVSAGVQFPQR---------PPAVSSNSPTQQQINNTSQNGVIRLSEQENVA 181
           QYY+AY +L SAG++FP R         PP      P   Q   + ++  I+ S Q + A
Sbjct: 125 QYYNAYNELRSAGIEFPPRTESSVPFFTPP---QTQPIVAQATASDEDAAIQASLQSDDA 184

Query: 182 RVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHL 241
                 LS   I + A  +++VL ++L A+DP HPEG ++E  +DLVEQC   ++++M L
Sbjct: 185 SA----LSMEEI-QSAQGSVDVLTDMLGALDPSHPEGLKEELIVDLVEQCRTYQRRVMAL 244

Query: 242 VLSSRDEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHD 296
           V ++ DE+++C  + LN+ LQ+VL  HD    G  +             + +P    NHD
Sbjct: 245 VNTTSDEELMCQGLALNDNLQRVLQHHDDKAKGNSVPA--------TAPTPIPLVSINHD 304

BLAST of Cucsa.017950 vs. TAIR10
Match: AT3G08790.1 (AT3G08790.1 ENTH/VHS/GAT family protein)

HSP 1 Score: 187.2 bits (474), Expect = 2.0e-47
Identity = 122/396 (30.81%), Postives = 200/396 (50.51%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           M   LV+ ATS+ L   DW  N++IC+++ H+  Q +EV+  IKKRL ++ +  QL A+ 
Sbjct: 1   MVHPLVDRATSDMLIGPDWAMNLEICDMLNHEPGQTREVVSGIKKRLTSRTSKVQLLALT 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N GE IH Q                  ++ V+E+I +L+D  Q +  G  G+ 
Sbjct: 61  LLETIITNCGELIHMQVAEKDILHKMVKMAKRKPNIQVKEKILILIDTWQESFSGPQGRH 120

Query: 121 PQYYSAYYDLVSAGVQFPQRP---PAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQ 180
           PQYY+AY +L+ AG+ FPQRP   P+   N P+ +   N S+N      +    +     
Sbjct: 121 PQYYAAYQELLRAGIVFPQRPQITPSSGQNGPSTRYPQN-SRNARQEAIDTSTESEFPTL 180

Query: 181 ILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSR 240
            L+E   I+ A   ++VL E+++A+D  + EG + E  +DLV QC   KQ+++HLV S+ 
Sbjct: 181 SLTE---IQNARGIMDVLAEMMNAIDGNNKEGLKQEVVVDLVSQCRTYKQRVVHLVNSTS 240

Query: 241 DEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDE 300
           DE ++C  + LN+ LQ++LA+H+A+ SG  M  + + + +EV            D G  E
Sbjct: 241 DESMLCQGLALNDDLQRLLAKHEAIASGNSMIKKEEKSKKEVPKDTTQI----IDVGSSE 300

Query: 301 EEADQLFRRLRKG-KACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTT 360
            +   +      G K  +   D+ ++     SL L+ L  P  + + P+ +P D  +   
Sbjct: 301 TKNGSVVAYTTNGPKIDLLSGDDFETPNADNSLALVPLGPP--QPSSPVAKP-DNSIVLI 360

Query: 361 LEIQHGQGVSIPPPPVKHAEREKFFKDKKIDVGVGH 376
             +      S  P    HA  +K  ++     G GH
Sbjct: 361 DMLSDNNCESSTPTSNPHANHQKVQQNYSNGFGPGH 385

BLAST of Cucsa.017950 vs. TAIR10
Match: AT1G76970.1 (AT1G76970.1 Target of Myb protein 1)

HSP 1 Score: 180.6 bits (457), Expect = 1.9e-45
Identity = 126/376 (33.51%), Postives = 190/376 (50.53%), Query Frame = 1

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVLL 61
           AA     AT++ L   DW  NI++C+L+  D  QAKE +K +KKRLG+KN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDLINMDPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIH-----------------KQSDLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE ++                 K+ +L VRE+I  LLD  Q A GG  G++P
Sbjct: 65  LETLSKNCGENVYQLIIDRGLLNDMVKIVKKKPELNVREKILTLLDTWQEAFGGRGGRYP 124

Query: 122 QYYSAYYDLVSAGVQFPQRPPA-VSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 181
           QYY+AY DL SAG++FP R  + +S  +P Q Q     ++  I+ S Q + A      L 
Sbjct: 125 QYYNAYNDLRSAGIEFPPRTESSLSFFTPPQTQ---PDEDAAIQASLQGDDA--SSLSLE 184

Query: 182 ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 241
           E   I+ A  +++VL ++L A DP +PE  ++E  +DLVEQC   ++++M LV ++ DE+
Sbjct: 185 E---IQSAEGSVDVLMDMLGAHDPGNPESLKEEVIVDLVEQCRTYQRRVMTLVNTTTDEE 244

Query: 242 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 301
           ++C  + LN+ LQ VL RHD + +   + +  +       +  +  NH + D+  D+E  
Sbjct: 245 LLCQGLALNDNLQHVLQRHDDIANVGSVPSNGRNTRAPPPVQIVDINHDDEDDESDDE-- 304

Query: 302 DQLFRRL--RKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLE 358
              F RL  R      RP    DS               V+  +  + +P     S    
Sbjct: 305 ---FARLAHRSSTPTRRPVHGSDSG-------------MVDILSGDVYKPQGNSSS---- 346


HSP 2 Score: 29.6 bits (65), Expect = 5.3e+00
Identity = 15/40 (37.50%), Postives = 22/40 (55.00%), Query Frame = 1

Query: 349 SIPPPPVKHAEREKFFKDKKIDVG-----VGHMRGLSLHS 384
           ++PPPP +H +R++FF+      G      G  R LSL S
Sbjct: 372 NLPPPPSRHNQRQQFFEHHHSSSGSDSSYEGQTRNLSLTS 411

BLAST of Cucsa.017950 vs. TAIR10
Match: AT4G32760.2 (AT4G32760.2 ENTH/VHS/GAT family protein)

HSP 1 Score: 177.6 bits (449), Expect = 1.6e-44
Identity = 108/311 (34.73%), Postives = 165/311 (53.05%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           M   +V  ATSE L   DW  N++IC+++  D  QAK+V+K IKKR+G++N  AQL A+ 
Sbjct: 1   MVNAMVERATSEMLIGPDWAMNLEICDMLNSDPAQAKDVVKGIKKRIGSRNPKAQLLALT 60

Query: 61  LLEMLMNNIGEAIH-----------------KQSDLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N G+ +H                 K+ D  V+E+I +L+D  Q A GG   ++
Sbjct: 61  LLETIVKNCGDMVHMHVAEKGVIHEMVRIVKKKPDFHVKEKILVLIDTWQEAFGGPRARY 120

Query: 121 PQYYSAYYDLVSAGVQFPQR---------PPAVSSNSPTQQQINNTSQNGVIRLSEQENV 180
           PQYY+ Y +L+ AG  FPQR         PP     +     + N      +     E  
Sbjct: 121 PQYYAGYQELLRAGAVFPQRSERSAPVFTPPQTQPLTSYPPNLRNAGPGNDV----PEPS 180

Query: 181 ARVEPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMH 240
           A  E   LS S  I+ A   ++VL E+L A++P + E  + E  +DLVEQC   KQ+++H
Sbjct: 181 AEPEFPTLSLSE-IQNAKGIMDVLAEMLSALEPGNKEDLKQEVMVDLVEQCRTYKQRVVH 240

Query: 241 LVLSSRDEKIVCGAIELNEKLQKVLARHDALLSG-QFMSTQNQFNGEEVGMSRLPANHYN 285
           LV S+ DE ++C  + LN+ LQ+VL  ++A+ SG    S+Q +    E G S +  +   
Sbjct: 241 LVNSTSDESLLCQGLALNDDLQRVLTNYEAIASGLPGTSSQIEKPKSETGKSLVDVDGPL 300

BLAST of Cucsa.017950 vs. NCBI nr
Match: gi|778694136|ref|XP_011653749.1| (PREDICTED: target of Myb protein 1 isoform X2 [Cucumis sativus])

HSP 1 Score: 783.5 bits (2022), Expect = 1.8e-223
Identity = 399/399 (100.00%), Postives = 399/399 (100.00%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF 120
           LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF
Sbjct: 61  LLEMLMNNIGEAIHKQSDLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVSAGVQF 120

Query: 121 PQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSESSIIEKAGNALEVLKE 180
           PQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSESSIIEKAGNALEVLKE
Sbjct: 121 PQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILSESSIIEKAGNALEVLKE 180

Query: 181 VLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA 240
           VLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA
Sbjct: 181 VLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEKIVCGAIELNEKLQKVLA 240

Query: 241 RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPE 300
           RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPE
Sbjct: 241 RHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPE 300

Query: 301 DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAER 360
           DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAER
Sbjct: 301 DEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAER 360

Query: 361 EKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES 400
           EKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES
Sbjct: 361 EKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES 399

BLAST of Cucsa.017950 vs. NCBI nr
Match: gi|449449813|ref|XP_004142659.1| (PREDICTED: target of Myb protein 1 isoform X1 [Cucumis sativus])

HSP 1 Score: 772.7 bits (1994), Expect = 3.1e-220
Identity = 399/416 (95.91%), Postives = 399/416 (95.91%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQ                 SDLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240
           ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK
Sbjct: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240

Query: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300
           IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA
Sbjct: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300

Query: 301 DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360
           DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ
Sbjct: 301 DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360

Query: 361 HGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES 400
           HGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES
Sbjct: 361 HGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES 416

BLAST of Cucsa.017950 vs. NCBI nr
Match: gi|659069348|ref|XP_008449359.1| (PREDICTED: LOW QUALITY PROTEIN: TOM1-like protein 1 [Cucumis melo])

HSP 1 Score: 751.5 bits (1939), Expect = 7.5e-214
Identity = 386/416 (92.79%), Postives = 392/416 (94.23%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAK+VIKAIKKRLGNKNAN QLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKDVIKAIKKRLGNKNANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQ                 SDLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARVEPQILS 180
           PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQ Q NNTSQNG+IRLSEQENVARVEPQIL 
Sbjct: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQLQTNNTSQNGIIRLSEQENVARVEPQILP 180

Query: 181 ESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240
           ESSIIEKAGNALEVLKEVLDAVDP+HPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK
Sbjct: 181 ESSIIEKAGNALEVLKEVLDAVDPQHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDEK 240

Query: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300
           IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA
Sbjct: 241 IVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEEA 300

Query: 301 DQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360
           +QL+RRLRKGKACV PEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ
Sbjct: 301 EQLYRRLRKGKACVMPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEIQ 360

Query: 361 HGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNES 400
           HGQGV IPPPPVKHAEREKFFK+KK DVGVGHMRGLSLHSRNASSSRSGSIDFNES
Sbjct: 361 HGQGVGIPPPPVKHAEREKFFKEKKXDVGVGHMRGLSLHSRNASSSRSGSIDFNES 416

BLAST of Cucsa.017950 vs. NCBI nr
Match: gi|703098599|ref|XP_010096423.1| (TOM1-like protein 2 [Morus notabilis])

HSP 1 Score: 512.7 bits (1319), Expect = 5.8e-142
Identity = 278/416 (66.83%), Postives = 317/416 (76.20%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVN ATS+KL E DW KNI+ICELVA DQRQAK+VIKAIKKRLG+K+ N QLYAVL
Sbjct: 1   MAAELVNCATSDKLPEMDWTKNIEICELVARDQRQAKDVIKAIKKRLGSKHTNTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQ                 SDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENIHKQVIDTGIIPILVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVAR-VEPQIL 180
           PQYY+AYY+LVSAGVQFPQRPPAVSS+ PT Q  NN   NG +  S  E  AR  EPQ +
Sbjct: 121 PQYYNAYYELVSAGVQFPQRPPAVSSDHPTPQPNNNNLPNGELASSRHEGFARQAEPQDV 180

Query: 181 SESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDE 240
            ESSII+KAGN LEVLKEVLDAVD +HPEGA+DEFTLDLVEQCSFQKQ++MHLV++SRDE
Sbjct: 181 PESSIIQKAGNVLEVLKEVLDAVDSQHPEGAKDEFTLDLVEQCSFQKQRVMHLVMTSRDE 240

Query: 241 KIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEE 300
           K+V  AIELNE+LQKVLARH+ALLS +  ST                NH+N +E E+EEE
Sbjct: 241 KVVSRAIELNEQLQKVLARHEALLSAKPRST---------------VNHFNQEEAEEEEE 300

Query: 301 ADQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEI 360
           A+QLFRRLRKGKAC RPEDEE  + E P LG LG +IP E  NRP+IRP+  + S+    
Sbjct: 301 AEQLFRRLRKGKACARPEDEERPA-EHPHLGFLGSAIPGEMLNRPLIRPLSLEPSSREPN 360

Query: 361 QHGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNE 399
            H   V+IPPPP KH EREK+F++ K D   GH+RGLSLHSRNASSSRS SID ++
Sbjct: 361 GH---VAIPPPPAKHVEREKYFQENKADGLAGHVRGLSLHSRNASSSRSESIDSSD 397

BLAST of Cucsa.017950 vs. NCBI nr
Match: gi|1009125549|ref|XP_015879670.1| (PREDICTED: TOM1-like protein 2 isoform X2 [Ziziphus jujuba])

HSP 1 Score: 508.8 bits (1309), Expect = 8.4e-141
Identity = 278/416 (66.83%), Postives = 319/416 (76.68%), Query Frame = 1

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60
           MAAELVNSATSEKL E DW KNI+ICE VAHDQRQAK+VIKAIKKRLG+K+AN+QLYAV+
Sbjct: 6   MAAELVNSATSEKLNEIDWTKNIEICEFVAHDQRQAKDVIKAIKKRLGSKHANSQLYAVM 65

Query: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQ                 SDLPVRERIFLLLDATQT+LGGASGKF
Sbjct: 66  LLEMLMNNIGENIHKQVIDTGILPLLVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 125

Query: 121 PQYYSAYYDLVSAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRL-SEQENVARVEPQIL 180
           PQYY+AYYDLVSAGVQFPQRPPAV SN PT Q   N   NG   + S + +  +VEPQ +
Sbjct: 126 PQYYNAYYDLVSAGVQFPQRPPAVPSNRPTSQPNQNNLSNGHGEVASSRHDTKQVEPQNV 185

Query: 181 SESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVLSSRDE 240
            ESSII+KA NALEVLKEVLDAV  ++PEGARDEFTLDLVEQCSFQKQ++MHLV++SRDE
Sbjct: 186 PESSIIQKASNALEVLKEVLDAVSSQNPEGARDEFTLDLVEQCSFQKQRVMHLVMTSRDE 245

Query: 241 KIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGMSRLPANHYNHDEGEDEEE 300
           K+V  AIELNE+LQKVLARHDALLS +  S                ANH+N +E E+EEE
Sbjct: 246 KVVSQAIELNEQLQKVLARHDALLSARSTSI---------------ANHFNEEETEEEEE 305

Query: 301 ADQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERANRPIIRPIDEKVSTTLEI 360
           A+QLFRRLRKGKAC RPED+E S+ ERP LGLL  S+  ER NRP+IRP+  + S  +  
Sbjct: 306 AEQLFRRLRKGKACARPEDDEHSA-ERPHLGLLASSLSGERLNRPLIRPLSLEPSGEMN- 365

Query: 361 QHGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSRNASSSRSGSIDFNE 399
            H   V+IPPPP KH ERE++F++ K  V  GHMRGLSLHSRNASSS S SID ++
Sbjct: 366 GHSPPVAIPPPPAKHVERERYFQENKDGVS-GHMRGLSLHSRNASSSHSESIDSSD 403

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TOM1_MOUSE3.3e-1223.59Target of Myb protein 1 OS=Mus musculus GN=Tom1 PE=1 SV=1[more]
TOM1_HUMAN7.4e-1222.89Target of Myb protein 1 OS=Homo sapiens GN=TOM1 PE=1 SV=2[more]
TOM1_CHICK2.8e-1123.24Target of Myb protein 1 OS=Gallus gallus GN=TOM1 PE=2 SV=2[more]
HGS_MOUSE3.1e-1029.63Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Mus musculus GN=... [more]
HGS_RAT3.1e-1029.63Hepatocyte growth factor-regulated tyrosine kinase substrate OS=Rattus norvegicu... [more]
Match NameE-valueIdentityDescription
W9RQ74_9ROSA4.1e-14266.83TOM1-like protein 2 OS=Morus notabilis GN=L484_013104 PE=4 SV=1[more]
A0A067KK37_JATCU1.0e-14064.35Uncharacterized protein OS=Jatropha curcas GN=JCGZ_09342 PE=4 SV=1[more]
A0A061DN59_THECC1.1e-13965.17ENTH/VHS/GAT family protein isoform 1 OS=Theobroma cacao GN=TCM_003342 PE=4 SV=1[more]
A0A067F6C8_CITSI3.6e-13863.00Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g0143901mg PE=4 SV=1[more]
B9IP43_POPTR3.9e-13763.81Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0019s00820g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT5G63640.19.3e-12256.54 ENTH/VHS/GAT family protein[more]
AT1G21380.16.8e-4835.40 Target of Myb protein 1[more]
AT3G08790.12.0e-4730.81 ENTH/VHS/GAT family protein[more]
AT1G76970.11.9e-4533.51 Target of Myb protein 1[more]
AT4G32760.21.6e-4434.73 ENTH/VHS/GAT family protein[more]
Match NameE-valueIdentityDescription
gi|778694136|ref|XP_011653749.1|1.8e-223100.00PREDICTED: target of Myb protein 1 isoform X2 [Cucumis sativus][more]
gi|449449813|ref|XP_004142659.1|3.1e-22095.91PREDICTED: target of Myb protein 1 isoform X1 [Cucumis sativus][more]
gi|659069348|ref|XP_008449359.1|7.5e-21492.79PREDICTED: LOW QUALITY PROTEIN: TOM1-like protein 1 [Cucumis melo][more]
gi|703098599|ref|XP_010096423.1|5.8e-14266.83TOM1-like protein 2 [Morus notabilis][more]
gi|1009125549|ref|XP_015879670.1|8.4e-14166.83PREDICTED: TOM1-like protein 2 isoform X2 [Ziziphus jujuba][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002014VHS_dom
IPR004152GAT_dom
IPR008942ENTH_VHS
Vocabulary: Biological Process
TermDefinition
GO:0006886intracellular protein transport
Vocabulary: Cellular Component
TermDefinition
GO:0005622intracellular
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006886 intracellular protein transport
cellular_component GO:0005622 intracellular
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.017950.1Cucsa.017950.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002014VHS domainPFAMPF00790VHScoord: 4..76
score: 1.6
IPR002014VHS domainSMARTSM00288VHS_2coord: 2..117
score: 6.9
IPR002014VHS domainPROFILEPS50179VHScoord: 9..121
score: 23
IPR004152GAT domainGENE3DG3DSA:1.20.58.160coord: 166..248
score: 5.7
IPR004152GAT domainPFAMPF03127GATcoord: 174..248
score: 6.9
IPR004152GAT domainPROFILEPS50909GATcoord: 159..247
score: 1
IPR008942ENTH/VHSGENE3DG3DSA:1.25.40.90coord: 4..122
score: 7.4
IPR008942ENTH/VHSunknownSSF48464ENTH/VHS domaincoord: 4..122
score: 1.1
NoneNo IPR availablePANTHERPTHR13856VHS DOMAIN CONTAINING PROTEIN FAMILYcoord: 1..360
score: 4.3E
NoneNo IPR availablePANTHERPTHR13856:SF81ENTH/VHS/GAT FAMILY PROTEINcoord: 1..360
score: 4.3E
NoneNo IPR availableunknownSSF89009GAT-like domaincoord: 129..249
score: 4.97

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cucsa.017950Cucurbita pepo (Zucchini)cgycpeB0693
Cucsa.017950Cucurbita maxima (Rimu)cgycmaB0738
Cucsa.017950Cucurbita moschata (Rifu)cgycmoB0728