MS017824 (gene) Bitter gourd (TR) v1

Overview
NameMS017824
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA-binding protein SMUBP-2
Locationscaffold373: 2930260 .. 2939469 (+)
RNA-Seq ExpressionMS017824
SyntenyMS017824
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACGCCACAACATCGATCCACCTGTTTCGCCAGAATCAGACTGCAGTTACAGTTGCTTTCCAGCAGTTTGTTCAGACTATTAATGGCGCCAATCATCATCATCATCCTCCCAGTGGTACGCAGAGGAGGAGGATTCGTGTTGTTAAAACCAATAAAAATGTGAAGAAGCCGAATAGTCTCGAGATTTCTTCTCCTTCTACTGGCAACCTCGCTGCTGCTGCTAAAATTAGCATCAGTACTGGGAGTTTAGTCGGCGCTGAGACGGAGGCGCAACCGAAGCGGCCACCTCCGGGTGAACTGGAAGGGAAGAGGGATAATAGGTCGGTTAACGTGCGTGGTATCTATCAGAATGGGGATCCTCTTGGGCGGAGGGAACTGGGGAAGAGCGTGGTCCGGTGGATTGGGCAGGCCATGCGAGCTATGGCCTCAGATTTCGCTTCTGCGGAGGTTCAGGGCGATTTCTCCGAGCTACGGCAGCGGATGGGCACGGGGCTTACTTTTGTAATTCAAGCCCAACCGTATCTCAATGCGGTGCCTATGCCTCTTGGACTTGAAGCCGTCTGCTTGAAAGCTTGTACTCACTATCCCACTCTCTTTGACCATTTCCAGAGGGAGCTTAGGGATGTTCTCCAAGATCTCCAAAGCAAATCACTTTTTCCTGATTGGCGCCGCACTCAATCATGGAAGCTCCTCAAGGAGCTCGCTAATTCAGGTTCCTTCAACTACAATTGTATTTCTTGCGTGAAAATAATGTTTTTGAGTTTATTCTGGTAGAATTCGATCGAAGAACTAAATTGAATTCTATTGGAAGGCTAAAAAGTATTCATGTAACAAAGATTTCAATGTTTATTTATCAAATGATTTTGAAATATAAGTAACGAATTTCAAATCTTATAATTCTAAATGGACCCCATGAAAGTGTATTTCTTGTTTGTTCGATTAGGAACAAGTTGCTTGACTGGGAACTGACATGTTGCATTTTCAGCTCAGCATAAAGCTATAGCACGTAAGATAAGCCAGCCAAAGGCTGTTCAAGGTGTTTTAGGTATGGACCTGGAGAAGGCCAAGGCCATACAGAACAGGATTGATGAGTTCGCAAACCGCATGTCTGAATTACTTCGCATTGAGAGAGATTCCGAGTTGGAGTTTACACAAGAGGAGTTGGATGCAGTTCCTACACCAGATGAGGGTTCGGATCTTTCCAAACCCATCGAGTTCTTAGTCCATGGCCAAGCTCAGCAGGAACTCTGTGACACTATATGCAATTTGAATGCAGTTAGCACGTCTACAGGTCTAGCTAATGATTTCATTTTATAAAGGTGTTTCAAGCAATTTGCTTTCTTCAAATTTTGATCTCGGCTTTACTTCAATCACTTGTTCTCCAACATGCTTGAAAAATTAACTTTTATTGGTATTCAGGATTGGGGGGGATGCATTTGGTATTATTCAGGGTTGAAGGAACCCATAGATTACCGCCTACAACCCTTTCGCCAGGAGATATGGTTTGTGTGAGAGTTTGTGATAGCAGGGGTGCCGGTGCAACTTCTTGCATGCAAGGATTTGTGAACAATCTGGGGGACGATGGTTGCAGCATTACCATAGCCCTAGAATCTCGTCATGGTGACCCTACCTTTTCTAAGCTCTTTGGAAAGTCCGTGCGTATTGATCGTATTCCAGGATTAGCTGACACTCTCACTTATGAGGTACATGGTGAATTTTTCTTGCTTCACTCATTGAATTACTCGACCTGCATTGTTATCTTCAAAAAAATTTCATGAGATTTTATTTACTTAAAAAGTTATATCTGAACTAAGTATAAAATCTTTTACAGCGCAACTGTGAAGCTTTGATGTTGCTTCAGAGAAATGGTTTGCAAAAGAAAAATCCTTCTATTGCTGTAGTGGCTACATTATTTGGTGATAAAGAAGACATCAAGTGGATGGAAGAGAATGACTTGATAGATTTGGCTGAAACCAACATGGATGGCATCATGCTCAATGGAGATTTTGATAATTCACAAAGAAGTGCAATTTCTCATGCTTTGAATAAAAAGCGGCCTATATTGATAATCCAAGGGCCACCTGGTACTGGAAAGACAGGTTTGCTAAAGGAACTTATTGCACTTGCTGTTCAACAGGGTGAAAGGGTGCTTGTAACAGCACCCACTAATGCAGCTGTTGACAACATGGTTGAAAAACTCTCAAACATTGGGATAAACACTGTTAGGGTAGGAAATCCAGCACGGATATCTTCAAGTGTTGCATCCAAGTCTTTGGCTGAAATTGTAAACTCTAAACTTGCAAGTTTTAGAACAGAGTTTGAAAGGAAGAAGGCAGATCTAAGGAAAGACTTGAGACATTGTTTAAAGGATGACTCATTGGCTGCTGGCATACGCCAGCTTCTGAAGCAGCTTGGAAAGACACTAAAAAAGAAAGAGAAGGAAACTGTGAAGGAAGTACTCTCAAGTGCCCAAGTTGTTCTTGCCACCAACACTGGAGCGGCTGATCCTTTAATTCGTAGGTTGGAGAAATTTGATCTAGTTGTTATAGATGAGGCGGGTCAGGCGATTGAACCTTCTTGCTGGATTCCAATATTACAGGGATCCCGTTGTATTCTTGCAGGTGATCAATGCCAACTTGCTCCTGTGATCTTGTCTAGAAAAGCCTTGGAAGGTGGTCTTGGAGTGTCATTGCTGGAGCGGGCTGCAACCTTGCATCAGGAGACTCTAACCACTATGTTAACCATACAGTACCGTATGAATGATGCAATAGCTAGTTGGGCTTCAAAGGAGATGTATGGCGGAATGTTGAAGTCCTCATCAACAGTCTCTTCTCATCTTCTTGTAAACTCTCCATTTGTCAAGGTATATGCTCCAAAAATTACTGTATTGTGTTTTTGTATAACAGTCTTGACATTCATGCTGATGGAGGTTTTTGAGTTATATTTCTGAGAACATGATTCTTGATTCTATGTAAAAAATCTTCTGTCTTATCATTTTGTTACTAGTTTCCTTCAAGTAATCTGTATGAGTTTGAGGTCTGCATGTGTAGAGTCTATTTCAGTTCCATTGTTAATAAATTTAGTTCTAAAAATGGGTCCAAAGACTTCAAAGTTAAATAAGGATGGGTAAGTCGGTAAGGTTCAGTTAGCTTATCTGTAGCAGCTCAAGAGACTGTATATTCTGGTTAATTGGTTAACAATAATTAGATCCATGGCTTCAAGGATATTATACTAGGAAAATTTCCCCCATAACTTAACTTTAGGGCTATTGTAGCTAAAATGAACTATGAGGTGATCTTTAAAAAGTTTAATATTTCTTCAATCATGCTATAATTTGTTCCTCTCGTCATTTGCCTCCACCACCTCGTGTAGAGAAGGTCTGTATTTGTATATTTGTAAAAATAGTAGCCTTTTCCACGTGGAGGACCTGGTCACCCACCTGGAGGTCCGCAGGGCCAAAAAGATCAGGAAGCTAACCGCTTCTGCATCCATTTGTTTTTTCTCTTGCTCATGAAAATTTGTGTGTATTTGTTTTTTACATTCATTGAAATTTTTAAGTATTTCAATACTGGAAATTTATTTTGTGATATAAGTGTGTGTATTCTTATATGGAGTCAAAATAGCTATTATGTATGTCTCCGTTTCTTATGTATAAAAAAATGAAGTGTATGTATCTTATTGTATAATAAAATTTCGTGAGTCTCCATAAGTTGCCAAAGATTACGGTCCAGGTTCTTTTACATCTTTTCTGAACCTTGTCTGTGTGGTACTTAGTAACTATTTTTCTTCTTTGCATCGATGCAGCCCACATGGATAACCCAGTGCCCCTTGCTATTGCTCGACACTAGAATGCCATATGGAAGTTTGTCCGTTGGTTGTGAAGAGCACTTAGATCCAGCTGGTACAGGTTCATTATACAATGAAGGCGAGGCAGATATTGTCGTGCAACATGTCTGCTCTTTGATTTATTCTGGTAATGTTCTTTACTTGTCAATATTGATAATGTTTTGCTGTAGTGCCTTTTAAAAGAAGATTAGGGAAGCGAATGATGAAAAATATAGAGGTTTGGTGAATGGAAGTGGGGGATACTGAAAAGGTGGATTCTCTCTGAACTTTCAACCTTTTAGGAGCCAAGAGAAACTAGAGCAGATATATTATTTTTATTACAATGTTTAATAGGGGAAGATTGTTGTTTGGTCATAGATCTGAAAAAAGGATTATCTTTTTAACACAGAAAATAAGCAAATATAAAGGACGTACATTTAGCAGCATTCAATCTACATATTGTGGAATGACAACAACTACTTATAACTTATAGGTATAACTCCAATATAAAATTTGTTTCATATAAACAAAATCAGGTGAAGTTTTAAAACTTTCAGAGTTTGTGACCCGTATGATTTGTGATTGAAAACTGTATTCTTCACAACATTGAAAAGGAGGGAGTGAATCATTTGAAAACTTTTTTAAATTTTTGAAACAGAATATTTTAGAACATCCTTCATTTTTTGAATGAAAGAGTATTAGGATAAGAAATGATATATAAATTGAAAAACATAATTTTTCATTGATGAAATGAAAAATTACATAAGAGTTACAACAGTTTTCTTTAGCTTTAAGGTTTAGGGTTTACAAATGATATATAAATTGAAAAACTTGTCATTAAAAATGGAACCAAAAGAACAGTATAAAGGCTTGGGGATGAGAAGCCCCGGCCTTACAAAAAACTAATGAAAAACAACTTGTCAATCNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAAACAAAAAACTAAAAACAAAAAACAAAAAAAAAAAAAAAAAAAAAAAAAATCCTGTAGTTTAGACACCAAGTAGCAGCAGTAATTTGTACCGTATTTCAAAAAGAATCAGGAGAACGTATCTTGTTGAAAACCCAATCATTTCCCTCTTTCCAAAGGCACCACCAAAGAGCCCTAACTGCACAACCCCAAAGAATTTGTGTTTTTCCTTTGAAATGCCACCTCTGAAGCACCTCTAGAAGCATTCTTCCACACTTCTAGGCAAACAGCCAACAAAACCAAACTCTCTAAACATGAGGGACCAAAAACCAGAAGCAGTGGGCAATGAAGAAAAAGATGGCCCCGTTTTTCTGAGTTTTTGTAACAAAAACAGCGCACCCAAGGCAACAAACTCCAGTTTGAAACATTTTTTTGTAGGATCTCATGGGTGTTTAGGCTTCTATAAGCTAAGCTCCACAAAAAAAATAAAATAAAATAAAAATAAAGAACTTTAACTTCCATTGGATAATTGAACTTCCAAATAAGATTAATAAGGGGAGCCTTGATCTTTGGGGAGCTTTCTCTTAATTTGAGGAAGAGAGATTAGTTGAGAACATCTTCCTACTTTCGAGATTCCATAGAATTTTGTCTTCCCTGTTACCCTTCCGAAAAGAGTTCAAATTCTCATCACGGAAATCAAGCTAGCTGTATCCCGTAAAACCTCTCCTGAAAGAGGTTGGTGTTCTCATCTCCATTTAAAATCCATTGTTGTTTACAGTTTTGAGCCCACGGCCTTACTTCCTTCAAAATAATTTCAGCTAGTTCCTCCCTAACTGTTCTCATCTGTTATCACCACTCTAACAAAGCAGTTTTCATCCCACACTCAAATGCCCGCAATCCATGCATGAAGTTTGGTGTTCATGCAAGGATGCTCTATATGGGCACATCTGCTGTGATTTAGTTTAATACAATCATAGAAGGTATTTCTGGATCTTGTACATGTTTTAATTTTTTATAAGCTCTCTCCTCCCCCATCTCTGCCAAGCATCTCACATTTGTCATTGGATTGTCTATTGGTCGGTGTCGGAAAGAGTGCACTGCCTGAGATTTCTTAACTTCACCCATCAGCATCTTGGAGGAATTGGGTCATTAGATAGAACTTTTTATTTCTTCTTTTAGTAGGCATTTATGATATTCAAATGGCTTTAATCTTCTCCTACTTCTATCTTGGATGAAAGATTTTATTGAGATTTTCTGTTCTAAATCCTGCCACAGGCGTCAGTCCAAGAGCAATTGCAGTCCAATCTCCTTATGTTGCTCAGGTACAACTTTTGAGGAACAAGCTCGACGAAATTCCCGAAGCTGCTGGCATTGAGGTAGCCACCATTGATAGCTTCCAAGGGCGGGAGGCGGATGCAGTAATCATATCAATGGTAGGATATACATATACATAATTTGCAACTACATTATTTTCTTGACCATGGGTATGAATTACATGTCTACATGTATATGGGAATTCTTTTTCTTTATTATTTAAATCTTGTATTTTCTCTCTTCCACTAATTAATTATATCAAGCTGACTGCCTTTAAGTAGATTAATTGAGGTTGGCTAGATATAGCTTCTTCAAGAGGTAGAACCATTCTAAGTTTCTCAACATTGATAATTTCCTTCGTTTTTTATTAAAAGGACCATGGGCGTCTGAGTGGGCTTATTACACATTTTTCAATGACAATTGTACTCTTTAGAACTTATAGAAGATTTAGTACCTTCCCTTTATCTTCTTATCAAATATCAAAGCCGTATACAACTTCCATTTTTTGGTTACAAATAATTCAAGCCTCTTCTCTGTGTTTGGCATGAACTCAGAATTTATTTGCTTTGATTCTGCCCATGCTCTTATGAAAAAAAAAAAAATTAAAACAAAATAAGAAGGCATGCAAGCTCCTACTTCTACTCCGTCTCCTCTTTCATTGTCTTTGGTCCTTGGTTTGTAGTGGTATAAAGTGAGATCTTGCAAATGAATTAAGACAGATATACTAGTTCCCATTTTAGGGACAGGCTTTGCTGCTAGTGTTATATTGGTGAATTAACTTCGAGAGGACAAAGGATACATGCGTATCATTATTATGGCTCATGTTGGTGAACTAATCAATTCTGCTGGTATGTCATTGTGTATTTGAATAATTTGTCTTGTGATTTCGTTCCAGTAATCTAATGGTGAGGATGTTGATGTAATATATTTAAGGAATGAGTCCCATTCCGATTGCTTGGGCCAGACCCATTTTTGAACCTCGGCTCAAAGGTTGTAGCATCCTTACTTCTGCCTGGATTTTTATCTTCAACCCATTGACAAACTCTGCAACAAGATCTAACTAACACGTTTGCAGTTGACTTCGTTGTACCATCATAGTTTTCCAATGAGCGGGAAATTTTTCACACAACGTTCCCTCCCTGCAGGTCTAAATTGACTCAAAAAGTTGGTGATGTAAGTCCACCTAGATTTAGAATGGTCTTTGGCCTTCATCCTTATGAAAGTGGGGAAACACTTCTCCTTTTAAACAAATTGCTACGCACCAATCATCTCTACCTCTTATTTGCTTAGAATTACCTCTCTGCTCAGATCGAATAGTCCCCTAGCTCTTCGTTTCTCCATTAAATATTCACGTCTTGAATCACCTCACTAGTTTTCCACCATTACCCTATCATCACTATCCTCCTTTGCTCCAGCAGAAGTAGCTCCATTAGGAGCTTTTCCTTTCTGTTCTTGTCTCGATGTTTGTTTCCTGCTGATTTGGAATTGCACCACATCTCCATTTATTTCATCACAGTTTTTTTGTTGGGTAAGAAACAAAAGATTATATAGATCTGTGAGAGTAGTACAAAGGGTTTATTTCATCACAGTTCTAAAGCATCTCATCCCTTGTTTTCATTTAGTCTTGTATCCTTCACACTGTTGACACTAGAGATTGTTGTAATTCTGAAAGATCTATCTCCATTCCCTTGATTCTGCACTCCATTTTATTTTAGGAACTCTCACATCTTTGATAAGATTTGAAGTAGTAGTTTTGTTGGTTACCTGAAAAACAAGTACAATCAAGGAAATTTAGGAGCTCCTCTATCTTTCCAATCTGTCCTAATTAGTCACATGGGTCCAATCCAACATTACATTTCTTCCAAGCTTCCCCCATTTTTTGTCCTCGTAATTCTATTACCTAGCCCTCTCTTAGATCTCATCTTAGGAAAAGCTTATCAGATGATCGTTGATTAGATGACGTTTCATTAAAAATTCCCTTTCCCATGTTGCATCAGTTCGGAAGTTCCAAAAGGTTTGTTGTTAAAGTTTGCCAGAATGGTTAATAGGGTGAGATTTTGGAGAGGACTCTTGACTTCTTCTTTCTCCTTTGTGCATTGTGCGGCCGTCATTGCTTATCGGAAGGATGAACCATTAGTATTATCTTTTTGGATTGGAGCCCCTGTCTTTGAGTAGGTCTCCTTTCATTAGGCTGTTGTTTTTTAATGTCCTGAAATATTCTTTTTTTTTTCCTATTTTTCTCAATAAAAATTTCGGTTTCTCATTAAAAATATATATATATAGGGAGTGTATAGAATGGTTAGCCCTAACTCCTAAGCTAAGCGTTAATAATTGATTGGTATTTTTGTAAGGTTTGCAGGATGGTTCTGCAGTGTCATTTTCTTTAACAGAATCATGTTTGTGGCAAAACTCTGTTTTGATTTTATGAAAGGTCATACGGAGCATTCAAGATGTCAATAATGTTTTTACAATGTAAAATTTATATAATGTTCTGTGCTTATTCTATCAGGTAAGGTCAAACAATCTTGGAGCTGTCGGGTTTCTGGGAGACAGTCGGCGGATGAACGTGGCCATAACAAGGGCAAGAAAACACGTAGCGGTGGTCTGCGATAGCTCGACAATATGTCAAAATACCTTCTTGGCGAGGCTACTGCGTCACATACGATATTTTGGAAGAGTGAAGCATGCAGAACCAGGTACTTTTGGAGGATCTGGTCTTGGAATGAATCCAATGTTGCCATCCATTAAT

mRNA sequence

ATGAACGCCACAACATCGATCCACCTGTTTCGCCAGAATCAGACTGCAGTTACAGTTGCTTTCCAGCAGTTTGTTCAGACTATTAATGGCGCCAATCATCATCATCATCCTCCCAGTGGTACGCAGAGGAGGAGGATTCGTGTTGTTAAAACCAATAAAAATGTGAAGAAGCCGAATAGTCTCGAGATTTCTTCTCCTTCTACTGGCAACCTCGCTGCTGCTGCTAAAATTAGCATCAGTACTGGGAGTTTAGTCGGCGCTGAGACGGAGGCGCAACCGAAGCGGCCACCTCCGGGTGAACTGGAAGGGAAGAGGGATAATAGGTCGGTTAACGTGCGTGGTATCTATCAGAATGGGGATCCTCTTGGGCGGAGGGAACTGGGGAAGAGCGTGGTCCGGTGGATTGGGCAGGCCATGCGAGCTATGGCCTCAGATTTCGCTTCTGCGGAGGTTCAGGGCGATTTCTCCGAGCTACGGCAGCGGATGGGCACGGGGCTTACTTTTGTAATTCAAGCCCAACCGTATCTCAATGCGGTGCCTATGCCTCTTGGACTTGAAGCCGTCTGCTTGAAAGCTTGTACTCACTATCCCACTCTCTTTGACCATTTCCAGAGGGAGCTTAGGGATGTTCTCCAAGATCTCCAAAGCAAATCACTTTTTCCTGATTGGCGCCGCACTCAATCATGGAAGCTCCTCAAGGAGCTCGCTAATTCAGCTCAGCATAAAGCTATAGCACGTAAGATAAGCCAGCCAAAGGCTGTTCAAGGTGTTTTAGGTATGGACCTGGAGAAGGCCAAGGCCATACAGAACAGGATTGATGAGTTCGCAAACCGCATGTCTGAATTACTTCGCATTGAGAGAGATTCCGAGTTGGAGTTTACACAAGAGGAGTTGGATGCAGTTCCTACACCAGATGAGGGTTCGGATCTTTCCAAACCCATCGAGTTCTTAGTCCATGGCCAAGCTCAGCAGGAACTCTGTGACACTATATGCAATTTGAATGCAGTTAGCACGTCTACAGGATTGGGGGGGATGCATTTGGTATTATTCAGGGTTGAAGGAACCCATAGATTACCGCCTACAACCCTTTCGCCAGGAGATATGGTTTGTGTGAGAGTTTGTGATAGCAGGGGTGCCGGTGCAACTTCTTGCATGCAAGGATTTGTGAACAATCTGGGGGACGATGGTTGCAGCATTACCATAGCCCTAGAATCTCGTCATGGTGACCCTACCTTTTCTAAGCTCTTTGGAAAGTCCGTGCGTATTGATCGTATTCCAGGATTAGCTGACACTCTCACTTATGAGCGCAACTGTGAAGCTTTGATGTTGCTTCAGAGAAATGGTTTGCAAAAGAAAAATCCTTCTATTGCTGTAGTGGCTACATTATTTGGTGATAAAGAAGACATCAAGTGGATGGAAGAGAATGACTTGATAGATTTGGCTGAAACCAACATGGATGGCATCATGCTCAATGGAGATTTTGATAATTCACAAAGAAGTGCAATTTCTCATGCTTTGAATAAAAAGCGGCCTATATTGATAATCCAAGGGCCACCTGGTACTGGAAAGACAGGTTTGCTAAAGGAACTTATTGCACTTGCTGTTCAACAGGGTGAAAGGGTGCTTGTAACAGCACCCACTAATGCAGCTGTTGACAACATGGTTGAAAAACTCTCAAACATTGGGATAAACACTGTTAGGGTAGGAAATCCAGCACGGATATCTTCAAGTGTTGCATCCAAGTCTTTGGCTGAAATTGTAAACTCTAAACTTGCAAGTTTTAGAACAGAGTTTGAAAGGAAGAAGGCAGATCTAAGGAAAGACTTGAGACATTGTTTAAAGGATGACTCATTGGCTGCTGGCATACGCCAGCTTCTGAAGCAGCTTGGAAAGACACTAAAAAAGAAAGAGAAGGAAACTGTGAAGGAAGTACTCTCAAGTGCCCAAGTTGTTCTTGCCACCAACACTGGAGCGGCTGATCCTTTAATTCGTAGGTTGGAGAAATTTGATCTAGTTGTTATAGATGAGGCGGGTCAGGCGATTGAACCTTCTTGCTGGATTCCAATATTACAGGGATCCCGTTGTATTCTTGCAGGTGATCAATGCCAACTTGCTCCTGTGATCTTGTCTAGAAAAGCCTTGGAAGGTGGTCTTGGAGTGTCATTGCTGGAGCGGGCTGCAACCTTGCATCAGGAGACTCTAACCACTATGTTAACCATACAGTACCGTATGAATGATGCAATAGCTAGTTGGGCTTCAAAGGAGATGTATGGCGGAATGTTGAAGTCCTCATCAACAGTCTCTTCTCATCTTCTTGTAAACTCTCCATTTGTCAAGCCCACATGGATAACCCAGTGCCCCTTGCTATTGCTCGACACTAGAATGCCATATGGAAGTTTGTCCGTTGGTTGTGAAGAGCACTTAGATCCAGCTGGTACAGGTTCATTATACAATGAAGGCGAGGCAGATATTGTCGTGCAACATGTCTGCTCTTTGATTTATTCTGGCGTCAGTCCAAGAGCAATTGCAGTCCAATCTCCTTATGTTGCTCAGGTACAACTTTTGAGGAACAAGCTCGACGAAATTCCCGAAGCTGCTGGCATTGAGGTAGCCACCATTGATAGCTTCCAAGGGCGGGAGGCGGATGCAGTAATCATATCAATGGTAAGGTCAAACAATCTTGGAGCTGTCGGGTTTCTGGGAGACAGTCGGCGGATGAACGTGGCCATAACAAGGGCAAGAAAACACGTAGCGGTGGTCTGCGATAGCTCGACAATATGTCAAAATACCTTCTTGGCGAGGCTACTGCGTCACATACGATATTTTGGAAGAGTGAAGCATGCAGAACCAGGTACTTTTGGAGGATCTGGTCTTGGAATGAATCCAATGTTGCCATCCATTAAT

Coding sequence (CDS)

ATGAACGCCACAACATCGATCCACCTGTTTCGCCAGAATCAGACTGCAGTTACAGTTGCTTTCCAGCAGTTTGTTCAGACTATTAATGGCGCCAATCATCATCATCATCCTCCCAGTGGTACGCAGAGGAGGAGGATTCGTGTTGTTAAAACCAATAAAAATGTGAAGAAGCCGAATAGTCTCGAGATTTCTTCTCCTTCTACTGGCAACCTCGCTGCTGCTGCTAAAATTAGCATCAGTACTGGGAGTTTAGTCGGCGCTGAGACGGAGGCGCAACCGAAGCGGCCACCTCCGGGTGAACTGGAAGGGAAGAGGGATAATAGGTCGGTTAACGTGCGTGGTATCTATCAGAATGGGGATCCTCTTGGGCGGAGGGAACTGGGGAAGAGCGTGGTCCGGTGGATTGGGCAGGCCATGCGAGCTATGGCCTCAGATTTCGCTTCTGCGGAGGTTCAGGGCGATTTCTCCGAGCTACGGCAGCGGATGGGCACGGGGCTTACTTTTGTAATTCAAGCCCAACCGTATCTCAATGCGGTGCCTATGCCTCTTGGACTTGAAGCCGTCTGCTTGAAAGCTTGTACTCACTATCCCACTCTCTTTGACCATTTCCAGAGGGAGCTTAGGGATGTTCTCCAAGATCTCCAAAGCAAATCACTTTTTCCTGATTGGCGCCGCACTCAATCATGGAAGCTCCTCAAGGAGCTCGCTAATTCAGCTCAGCATAAAGCTATAGCACGTAAGATAAGCCAGCCAAAGGCTGTTCAAGGTGTTTTAGGTATGGACCTGGAGAAGGCCAAGGCCATACAGAACAGGATTGATGAGTTCGCAAACCGCATGTCTGAATTACTTCGCATTGAGAGAGATTCCGAGTTGGAGTTTACACAAGAGGAGTTGGATGCAGTTCCTACACCAGATGAGGGTTCGGATCTTTCCAAACCCATCGAGTTCTTAGTCCATGGCCAAGCTCAGCAGGAACTCTGTGACACTATATGCAATTTGAATGCAGTTAGCACGTCTACAGGATTGGGGGGGATGCATTTGGTATTATTCAGGGTTGAAGGAACCCATAGATTACCGCCTACAACCCTTTCGCCAGGAGATATGGTTTGTGTGAGAGTTTGTGATAGCAGGGGTGCCGGTGCAACTTCTTGCATGCAAGGATTTGTGAACAATCTGGGGGACGATGGTTGCAGCATTACCATAGCCCTAGAATCTCGTCATGGTGACCCTACCTTTTCTAAGCTCTTTGGAAAGTCCGTGCGTATTGATCGTATTCCAGGATTAGCTGACACTCTCACTTATGAGCGCAACTGTGAAGCTTTGATGTTGCTTCAGAGAAATGGTTTGCAAAAGAAAAATCCTTCTATTGCTGTAGTGGCTACATTATTTGGTGATAAAGAAGACATCAAGTGGATGGAAGAGAATGACTTGATAGATTTGGCTGAAACCAACATGGATGGCATCATGCTCAATGGAGATTTTGATAATTCACAAAGAAGTGCAATTTCTCATGCTTTGAATAAAAAGCGGCCTATATTGATAATCCAAGGGCCACCTGGTACTGGAAAGACAGGTTTGCTAAAGGAACTTATTGCACTTGCTGTTCAACAGGGTGAAAGGGTGCTTGTAACAGCACCCACTAATGCAGCTGTTGACAACATGGTTGAAAAACTCTCAAACATTGGGATAAACACTGTTAGGGTAGGAAATCCAGCACGGATATCTTCAAGTGTTGCATCCAAGTCTTTGGCTGAAATTGTAAACTCTAAACTTGCAAGTTTTAGAACAGAGTTTGAAAGGAAGAAGGCAGATCTAAGGAAAGACTTGAGACATTGTTTAAAGGATGACTCATTGGCTGCTGGCATACGCCAGCTTCTGAAGCAGCTTGGAAAGACACTAAAAAAGAAAGAGAAGGAAACTGTGAAGGAAGTACTCTCAAGTGCCCAAGTTGTTCTTGCCACCAACACTGGAGCGGCTGATCCTTTAATTCGTAGGTTGGAGAAATTTGATCTAGTTGTTATAGATGAGGCGGGTCAGGCGATTGAACCTTCTTGCTGGATTCCAATATTACAGGGATCCCGTTGTATTCTTGCAGGTGATCAATGCCAACTTGCTCCTGTGATCTTGTCTAGAAAAGCCTTGGAAGGTGGTCTTGGAGTGTCATTGCTGGAGCGGGCTGCAACCTTGCATCAGGAGACTCTAACCACTATGTTAACCATACAGTACCGTATGAATGATGCAATAGCTAGTTGGGCTTCAAAGGAGATGTATGGCGGAATGTTGAAGTCCTCATCAACAGTCTCTTCTCATCTTCTTGTAAACTCTCCATTTGTCAAGCCCACATGGATAACCCAGTGCCCCTTGCTATTGCTCGACACTAGAATGCCATATGGAAGTTTGTCCGTTGGTTGTGAAGAGCACTTAGATCCAGCTGGTACAGGTTCATTATACAATGAAGGCGAGGCAGATATTGTCGTGCAACATGTCTGCTCTTTGATTTATTCTGGCGTCAGTCCAAGAGCAATTGCAGTCCAATCTCCTTATGTTGCTCAGGTACAACTTTTGAGGAACAAGCTCGACGAAATTCCCGAAGCTGCTGGCATTGAGGTAGCCACCATTGATAGCTTCCAAGGGCGGGAGGCGGATGCAGTAATCATATCAATGGTAAGGTCAAACAATCTTGGAGCTGTCGGGTTTCTGGGAGACAGTCGGCGGATGAACGTGGCCATAACAAGGGCAAGAAAACACGTAGCGGTGGTCTGCGATAGCTCGACAATATGTCAAAATACCTTCTTGGCGAGGCTACTGCGTCACATACGATATTTTGGAAGAGTGAAGCATGCAGAACCAGGTACTTTTGGAGGATCTGGTCTTGGAATGAATCCAATGTTGCCATCCATTAAT

Protein sequence

MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNSLEISSPSTGNLAAAAKISISTGSLVGAETEAQPKRPPPGELEGKRDNRSVNVRGIYQNGDPLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAVPMPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSAQHKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELDAVPTPDEGSDLSKPIEFLVHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRLPPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLIDLAETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEFERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAADPLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGLGMNPMLPSIN
Homology
BLAST of MS017824 vs. NCBI nr
Match: XP_022157125.1 (DNA-binding protein SMUBP-2 [Momordica charantia])

HSP 1 Score: 1885.9 bits (4884), Expect = 0.0e+00
Identity = 965/968 (99.69%), Postives = 966/968 (99.79%), Query Frame = 0

Query: 1   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 60
           MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS
Sbjct: 7   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 66

Query: 61  LEISSPSTGNLAAAAKISISTGSLVGAETEAQPKRPPPGELEGKRDNRSVNVRGIYQNGD 120
           LEISSPSTGNLAAAAKISISTGS VGAETEAQPKRPPPGELEGKRDNRSVNVRGIYQNGD
Sbjct: 67  LEISSPSTGNLAAAAKISISTGSSVGAETEAQPKRPPPGELEGKRDNRSVNVRGIYQNGD 126

Query: 121 PLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAVP 180
           PLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAVP
Sbjct: 127 PLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAVP 186

Query: 181 MPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSAQ 240
           MPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSAQ
Sbjct: 187 MPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSAQ 246

Query: 241 HKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELDA 300
           HKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELDA
Sbjct: 247 HKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELDA 306

Query: 301 VPTPDEGSDLSKPIEFLVHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRLPP 360
           VPTPDEGSDLSKPIEFLVHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRLPP
Sbjct: 307 VPTPDEGSDLSKPIEFLVHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRLPP 366

Query: 361 TTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGKSV 420
           TTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGKSV
Sbjct: 367 TTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGKSV 426

Query: 421 RIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLIDL 480
           RIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLIDL
Sbjct: 427 RIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLIDL 486

Query: 481 AETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGER 540
           AETNMDGI+LNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGER
Sbjct: 487 AETNMDGIVLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGER 546

Query: 541 VLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEFER 600
           VLVTAPTNAAVDNMVEKLSNIGIN VRVGNPARISSSVASKSLAEIVNSKLASFRTEFER
Sbjct: 547 VLVTAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNSKLASFRTEFER 606

Query: 601 KKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAAD 660
           KKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAAD
Sbjct: 607 KKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAAD 666

Query: 661 PLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGV 720
           PLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGV
Sbjct: 667 PLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGV 726

Query: 721 SLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKP 780
           SLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKP
Sbjct: 727 SLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKP 786

Query: 781 TWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPR 840
           TWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPR
Sbjct: 787 TWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPR 846

Query: 841 AIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFL 900
           AIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFL
Sbjct: 847 AIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFL 906

Query: 901 GDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGLGM 960
           GDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGLGM
Sbjct: 907 GDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGLGM 966

Query: 961 NPMLPSIN 969
           NPMLPSIN
Sbjct: 967 NPMLPSIN 974

BLAST of MS017824 vs. NCBI nr
Match: XP_038906929.1 (DNA-binding protein SMUBP-2 [Benincasa hispida])

HSP 1 Score: 1724.5 bits (4465), Expect = 0.0e+00
Identity = 892/970 (91.96%), Postives = 922/970 (95.05%), Query Frame = 0

Query: 1   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 60
           M A TSIHLFRQN TAVTVAFQQFVQTING NH    PSG Q RRIRVVKT KNVKKPN 
Sbjct: 1   MTALTSIHLFRQNHTAVTVAFQQFVQTINGVNH----PSGAQ-RRIRVVKTKKNVKKPNI 60

Query: 61  LEISSPSTGNLAAAAKISISTGSLVGAETEAQPKRPPPGELEGKRDN-RSVNVRGIYQNG 120
           LE+SSPST     AAKIS+ST   + +ET+AQPKR PPGE E K+ N R VNV+GIYQNG
Sbjct: 61  LEVSSPST-----AAKISVSTSGSLASETKAQPKRLPPGESEKKKKNDREVNVQGIYQNG 120

Query: 121 DPLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAV 180
           DPLGRRELGK VVRWIGQAM+AMASDFASAEVQGDFSELRQRMG GLTFVIQAQPYLNAV
Sbjct: 121 DPLGRRELGKCVVRWIGQAMQAMASDFASAEVQGDFSELRQRMGPGLTFVIQAQPYLNAV 180

Query: 181 PMPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSA 240
           PMPLGLEAVCLKA THYPTLFDHFQRELRDVLQDLQ KSLF DWR TQSWKLLKELANS 
Sbjct: 181 PMPLGLEAVCLKASTHYPTLFDHFQRELRDVLQDLQRKSLFLDWRETQSWKLLKELANSV 240

Query: 241 QHKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELD 300
           QHKAIARKISQPKAVQGVLGM+LEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEEL+
Sbjct: 241 QHKAIARKISQPKAVQGVLGMNLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELN 300

Query: 301 AVPTPDEGSDLSKPIEFLV-HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRL 360
           AVPTPDE SD SKPIEFLV HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEG HRL
Sbjct: 301 AVPTPDESSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRL 360

Query: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGK 420
           PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSIT+ALESRHGDPTFSKLFGK
Sbjct: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGK 420

Query: 421 SVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLI 480
           +VRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWME+N+LI
Sbjct: 421 TVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEDNNLI 480

Query: 481 DLAETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG 540
           DLA+TN++GI+LNGDFD+SQ+SAISHALNKKRPILIIQGPPGTGKTGLLKELI LAVQQG
Sbjct: 481 DLADTNLNGIVLNGDFDDSQKSAISHALNKKRPILIIQGPPGTGKTGLLKELIVLAVQQG 540

Query: 541 ERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEF 600
           ERVLVTAPTNAAVDNMVEKLSN+GIN VRVGNPARISSSVASKSLAEIVNSKLASFRT+ 
Sbjct: 541 ERVLVTAPTNAAVDNMVEKLSNVGINIVRVGNPARISSSVASKSLAEIVNSKLASFRTDI 600

Query: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGA 660
           ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGK+LKKKEKETVKEVLS+AQVVLATNTGA
Sbjct: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKSLKKKEKETVKEVLSNAQVVLATNTGA 660

Query: 661 ADPLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGL 720
           ADPLIR+LEKFDLVVIDEAGQAIEP+CWIPILQG RCILAGDQCQLAPVILSRKALEGGL
Sbjct: 661 ADPLIRKLEKFDLVVIDEAGQAIEPACWIPILQGRRCILAGDQCQLAPVILSRKALEGGL 720

Query: 721 GVSLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFV 780
           GVSLLERA+TLHQ  LTTMLTIQYRMNDAIASWASKEMY GMLKSS TVSSHLLVNSPFV
Sbjct: 721 GVSLLERASTLHQGALTTMLTIQYRMNDAIASWASKEMYDGMLKSSPTVSSHLLVNSPFV 780

Query: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVS 840
           KPTWITQCPLLLLDTRMPYGSLS GCEE+LDPAGTGSLYNEGEADIVVQHVCSLIYSGVS
Sbjct: 781 KPTWITQCPLLLLDTRMPYGSLSAGCEEYLDPAGTGSLYNEGEADIVVQHVCSLIYSGVS 840

Query: 841 PRAIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900
           PRAIAVQSPYVAQVQLLRN+LDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG
Sbjct: 841 PRAIAVQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900

Query: 901 FLGDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGL 960
           FLGDSRRMNVAITRARKHVA+VCDSSTICQNTFLARLLRHIRYFGRVKHAEPG FGGSGL
Sbjct: 901 FLGDSRRMNVAITRARKHVALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGL 960

Query: 961 GMNPMLPSIN 969
           GMNPMLPSIN
Sbjct: 961 GMNPMLPSIN 960

BLAST of MS017824 vs. NCBI nr
Match: XP_022995943.1 (DNA-binding protein SMUBP-2-like [Cucurbita maxima])

HSP 1 Score: 1701.4 bits (4405), Expect = 0.0e+00
Identity = 875/970 (90.21%), Postives = 918/970 (94.64%), Query Frame = 0

Query: 1   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 60
           MNA TSI LFRQN TAVTV+FQQFVQT+N ANH    PSG Q +R+RVVK+ KNVKKPN 
Sbjct: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANH----PSGAQ-KRVRVVKSKKNVKKPNI 60

Query: 61  LEISSPSTGNLAAAAKISISTGSLVGAETEAQPKRPPPGELEG-KRDNRSVNVRGIYQNG 120
           LE+SSPST N +A A+ISIST   VG+E +A+PKR P GE EG K+ +R+VN+ GIYQNG
Sbjct: 61  LEVSSPSTANRSAGARISISTSGSVGSEMKARPKRSPLGEQEGKKKSDRAVNLHGIYQNG 120

Query: 121 DPLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAV 180
           DPLGRRELGKSVV+WIGQAM+AMASDFASA+V GDFSELRQ+MG GLTFVIQAQPYLNAV
Sbjct: 121 DPLGRRELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAV 180

Query: 181 PMPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSA 240
           PMPLGLEAVCLKA THYPTLFDHFQRELRD LQDLQSKSL  DWR TQSWKLLKELANSA
Sbjct: 181 PMPLGLEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSA 240

Query: 241 QHKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELD 300
           QHKAIARKISQPKAVQG LGMDLEKAKA+Q+RIDEF NRMSELLRIERDSELEFTQEEL+
Sbjct: 241 QHKAIARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELN 300

Query: 301 AVPTPDEGSDLSKPIEFLV-HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRL 360
           AVPTPDEGSD SKPIEFLV HGQAQQELCDTICNLNAVST TGLGGMHLVLFRVEG HRL
Sbjct: 301 AVPTPDEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRL 360

Query: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGK 420
           PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSIT+ALESRHGDPTFSKLFGK
Sbjct: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGK 420

Query: 421 SVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLI 480
           SVRIDRIPGLADTLTYERNCEALMLLQ+NGL+KKNPS AVVATLFGD+EDIKWME+N+LI
Sbjct: 421 SVRIDRIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDIKWMEDNNLI 480

Query: 481 DLAETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG 540
           DLA TN++ I+LNGDFD+SQ+ AIS ALNKKRPILI+QGPPGTGKTGLLKELIALAVQQG
Sbjct: 481 DLAHTNLNDIVLNGDFDDSQKGAISFALNKKRPILIVQGPPGTGKTGLLKELIALAVQQG 540

Query: 541 ERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEF 600
           ERVLVTAPTNAAVDNMVEKLSNIGIN VRVGNPARISSSVASKSLAEIVN+KLASFRT+ 
Sbjct: 541 ERVLVTAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDI 600

Query: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGA 660
           ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKE+LS+AQVVLATNTGA
Sbjct: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEILSNAQVVLATNTGA 660

Query: 661 ADPLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGL 720
           ADPLIR LEKFDLVVIDEAGQAIEP+CWIPILQG RCILAGDQCQLAPVILSRKALEGGL
Sbjct: 661 ADPLIRTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGL 720

Query: 721 GVSLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFV 780
           GVSLLERA+TLHQ TLT MLTIQYRMNDAIASWASKEMYGGMLKSS TVSSHLLVNSPFV
Sbjct: 721 GVSLLERASTLHQGTLTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFV 780

Query: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVS 840
           KPTWITQCPLLLLDTRMPYGSLSVGCEEHLD AGTGSLYNEGEADIVVQHVCSLIYSGVS
Sbjct: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVS 840

Query: 841 PRAIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900
           PRAIAVQSPYVAQVQLLRN+LDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG
Sbjct: 841 PRAIAVQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900

Query: 901 FLGDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGL 960
           FLGDSRRMNVAITRARKH+A+VCDSSTICQNTFLARLLRHIRYFGRVKHAEPG FGGSGL
Sbjct: 901 FLGDSRRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGL 960

Query: 961 GMNPMLPSIN 969
           GMNPMLPSIN
Sbjct: 961 GMNPMLPSIN 965

BLAST of MS017824 vs. NCBI nr
Match: XP_023533963.1 (DNA-binding protein SMUBP-2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1700.6 bits (4403), Expect = 0.0e+00
Identity = 875/970 (90.21%), Postives = 917/970 (94.54%), Query Frame = 0

Query: 1   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 60
           MNA TSI LFRQN TAVTV+FQQFVQT+N ANH    PSG Q +R+RVVK+ KNVKKPN 
Sbjct: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANH----PSGAQ-KRVRVVKSKKNVKKPNI 60

Query: 61  LEISSPSTGNLAAAAKISISTGSLVGAETEAQPKRPPPGELEG-KRDNRSVNVRGIYQNG 120
           LE+SSPST N +A A+ISIST   VG+ET+A+PKR P GE EG K+ +R+VN+ GIYQNG
Sbjct: 61  LEVSSPSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNG 120

Query: 121 DPLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAV 180
           DPLGRRELGKSVV+WIGQAM+AMASDFASA+V GDFSELRQ+MG GLTFVIQAQPYLNAV
Sbjct: 121 DPLGRRELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAV 180

Query: 181 PMPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSA 240
           PMPLGLEAVCLKA THYPTLFDHFQRELRD LQDLQSKSL  DWR TQSWKLLKELANSA
Sbjct: 181 PMPLGLEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSA 240

Query: 241 QHKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELD 300
           QHKAIARKISQPKAVQG LGMDLEKAKA+Q+RIDEF NRMSELLRIERDSELEFTQEEL+
Sbjct: 241 QHKAIARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELN 300

Query: 301 AVPTPDEGSDLSKPIEFLV-HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRL 360
           AVPTPDEGSD SKPIEFLV HGQAQQELCDTICNLNAVST TGLGGMHLVLFRVEG HRL
Sbjct: 301 AVPTPDEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRL 360

Query: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGK 420
           PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSIT+ALESRHGDPTFSKLFGK
Sbjct: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGK 420

Query: 421 SVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLI 480
           SVRIDRIPGLADTLTYERNCEALMLLQ+NGL+KKNPS AVVATLFGD+ED+KWME+N+LI
Sbjct: 421 SVRIDRIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLI 480

Query: 481 DLAETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG 540
           DLA TN++ I+LNGDFD+SQ+ AIS ALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG
Sbjct: 481 DLAHTNLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG 540

Query: 541 ERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEF 600
           ERVLVTAPTNAAVDNMVEKLSNIGIN VRVGNPARISSSVASKSLAEIVN+KLASFRT+ 
Sbjct: 541 ERVLVTAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDI 600

Query: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGA 660
           ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLS+AQVVLATNTGA
Sbjct: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGA 660

Query: 661 ADPLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGL 720
           ADPLIR LEKFDLVVIDEAGQAIEP+CWIPILQG RCILAGDQCQLAPVILSRKALEGGL
Sbjct: 661 ADPLIRTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGL 720

Query: 721 GVSLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFV 780
           GVSLLERA+TLHQ  LT MLTIQYRMNDAIASWASKEMYGGMLKSS TVSSHLLVNSPFV
Sbjct: 721 GVSLLERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFV 780

Query: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVS 840
           KPTWITQCPLLLLDTRMPYGSLSVGCEEHLD AGTGSLYNEGEADIVVQHVCSLIYSGVS
Sbjct: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVS 840

Query: 841 PRAIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900
           PRAIAVQSPYVAQVQLLRN+LDEIPEA GIEVATIDSFQGREADAVIISMVRSNNLGAVG
Sbjct: 841 PRAIAVQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900

Query: 901 FLGDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGL 960
           FLGDSRRMNVAITRARKH+A+VCDSSTICQNTFLARLLRHIRYFGRVKHAEPG FGGSGL
Sbjct: 901 FLGDSRRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGL 960

Query: 961 GMNPMLPSIN 969
           GMNPMLPSIN
Sbjct: 961 GMNPMLPSIN 965

BLAST of MS017824 vs. NCBI nr
Match: XP_022958504.1 (DNA-binding protein SMUBP-2-like [Cucurbita moschata])

HSP 1 Score: 1699.9 bits (4401), Expect = 0.0e+00
Identity = 875/970 (90.21%), Postives = 917/970 (94.54%), Query Frame = 0

Query: 1   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 60
           MNA TSI LFRQN  AVTV+FQQFVQT+N ANH    PSG Q +R+RVVK+ KNVKKPN 
Sbjct: 1   MNAPTSIPLFRQNHIAVTVSFQQFVQTVNDANH----PSGAQ-KRVRVVKSKKNVKKPNI 60

Query: 61  LEISSPSTGNLAAAAKISISTGSLVGAETEAQPKRPPPGELEG-KRDNRSVNVRGIYQNG 120
           LE+SSPST N +A A+ISIST   +G+ET+A+PKR P GE EG K+ +R+VN+ GIYQNG
Sbjct: 61  LEVSSPSTANRSAGARISISTSGSIGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNG 120

Query: 121 DPLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAV 180
           DPLGRRELGKSVV+WIGQAM+AMASDFASA+V GDFSELRQ+MG GLTFVIQAQPYLNAV
Sbjct: 121 DPLGRRELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAV 180

Query: 181 PMPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSA 240
           PMPLGLEAVCLKA THYPTLFDHFQRELRD LQDLQSKSL  DWR TQSWKLLKELANSA
Sbjct: 181 PMPLGLEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSA 240

Query: 241 QHKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELD 300
           QHKAIARKISQPKAVQG LGMDLEKAKA+Q+RIDEF NRMSELLRIERDSELEFTQEEL+
Sbjct: 241 QHKAIARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELN 300

Query: 301 AVPTPDEGSDLSKPIEFLV-HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRL 360
           AVPTPDEGSD SKPIEFLV HGQAQQELCDTICNLNAVST TGLGGMHLVLFRVEG HRL
Sbjct: 301 AVPTPDEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRL 360

Query: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGK 420
           PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSIT+ALESRHGDPTFSKLFGK
Sbjct: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGK 420

Query: 421 SVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLI 480
           SVRIDRIPGLADTLTYERNCEALMLLQ+NGL+KKNPS AVVATLFGD+EDIKWME+N+LI
Sbjct: 421 SVRIDRIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDIKWMEDNNLI 480

Query: 481 DLAETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG 540
           DLA TN++ I+LNGDFD+SQ+ AIS ALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG
Sbjct: 481 DLAHTNLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG 540

Query: 541 ERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEF 600
           ERVLVTAPTNAAVDNMVEKLSNIGIN VRVGNPARISSSVASKSLAEIVN+KLASFRT+ 
Sbjct: 541 ERVLVTAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDI 600

Query: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGA 660
           ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLS+AQVVLATNTGA
Sbjct: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGA 660

Query: 661 ADPLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGL 720
           ADPLIR LEKFDLVVIDEAGQAIEP+CWIPILQG RCILAGDQCQLAPVILSRKALEGGL
Sbjct: 661 ADPLIRTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGL 720

Query: 721 GVSLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFV 780
           GVSLLERA+TLHQ  LT MLTIQYRMNDAIASWASKEMYGGMLKSS TVSSHLLVNSPFV
Sbjct: 721 GVSLLERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFV 780

Query: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVS 840
           KPTWITQCPLLLLDTRMPYGSLSVGCEEHLD AGTGSLYNEGEADIVVQHVCSLIYSGVS
Sbjct: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVS 840

Query: 841 PRAIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900
           PRAIAVQSPYVAQVQLLRN+LDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG
Sbjct: 841 PRAIAVQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900

Query: 901 FLGDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGL 960
           FLGDSRRMNVAITRARKH+A+VCDSSTICQNTFLARLLRHIRYFGRVKHAEPG FGGSGL
Sbjct: 901 FLGDSRRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGL 960

Query: 961 GMNPMLPSIN 969
           GMNPMLPSIN
Sbjct: 961 GMNPMLPSIN 965

BLAST of MS017824 vs. ExPASy Swiss-Prot
Match: P38935 (DNA-binding protein SMUBP-2 OS=Homo sapiens OX=9606 GN=IGHMBP2 PE=1 SV=3)

HSP 1 Score: 352.1 bits (902), Expect = 2.1e-95
Identity = 259/693 (37.37%), Postives = 368/693 (53.10%), Query Frame = 0

Query: 272 IDEFANRMSELLRIERDSELE---FTQEELDAVPTPDEGSDLSKPIEFLVHGQAQQELCD 331
           ++ F  +  +LL +ERD+E+E     QE +        G  L K                
Sbjct: 6   VESFVTKQLDLLELERDAEVEERRSWQENISLKELQSRGVCLLK---------------- 65

Query: 332 TICNLNAVSTSTGLGGMHLVLF---RVEGTHRLPPTTLSPGDMVCVRVCDSRGAGATSCM 391
               L   S  TGL G  LV F   R      LP  + + GD+V +    + G+   +  
Sbjct: 66  ----LQVSSQRTGLYGRLLVTFEPRRYGSAAALPSNSFTSGDIVGLYDAANEGSQLAT-- 125

Query: 392 QGFVNNLGDDGCSITIALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQ 451
            G +  +     S+T+A +  H D   S     S R+ +   LA+ +TY R  +AL+ L+
Sbjct: 126 -GILTRVTQK--SVTVAFDESH-DFQLSLDRENSYRLLK---LANDVTYRRLKKALIALK 185

Query: 452 RNGLQKKNPSIAVVATLFGDKEDIKWMEENDLIDLAETNMDGIMLNGDFDNSQRSAISHA 511
           +       P+ +++  LFG        E + L             N   D SQ+ A+  A
Sbjct: 186 K---YHSGPASSLIEVLFGRSAPSPASEIHPL----------TFFNTCLDTSQKEAVLFA 245

Query: 512 LNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINT 571
           L++K  + II GPPGTGKT  + E+I  AV+QG +VL  AP+N AVDN+VE+L+      
Sbjct: 246 LSQKE-LAIIHGPPGTGKTTTVVEIILQAVKQGLKVLCCAPSNIAVDNLVERLALCKQRI 305

Query: 572 VRVGNPARISSSVASKSLAEIVNSKLASFRTEFERKKADLRKDLRHCL------KDDSLA 631
           +R+G+PAR+  S+   SL  ++       R++  +  AD+RKD+          +D    
Sbjct: 306 LRLGHPARLLESIQQHSLDAVL------ARSDSAQIVADIRKDIDQVFVKNKKTQDKREK 365

Query: 632 AGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGA-ADPLIRRLEK--FDLVVIDE 691
           +  R  +K L K LK++E+  + E L+SA VVLATNTGA AD  ++ L +  FD+VVIDE
Sbjct: 366 SNFRNEIKLLRKELKEREEAAMLESLTSANVVLATNTGASADGPLKLLPESYFDVVVIDE 425

Query: 692 AGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHQETLTT 751
             QA+E SCWIP+L+  +CILAGD  QL P  +S KA   GL +SL+ER A  +   +  
Sbjct: 426 CAQALEASCWIPLLKARKCILAGDHKQLPPTTVSHKAALAGLSLSLMERLAEEYGARVVR 485

Query: 752 MLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKPTWITQCPLLLLDTRMP 811
            LT+QYRM+ AI  WAS  MY G L + S+V+ HLL + P V  T  T  PLLL+DT   
Sbjct: 486 TLTVQYRMHQAIMRWASDTMYLGQLTAHSSVARHLLRDLPGVAATEETGVPLLLVDT--- 545

Query: 812 YGSLSVGCE-EHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLL 871
                 GC    L+     S  N GE  +V  H+ +L+ +GV  R IAV SPY  QV LL
Sbjct: 546 -----AGCGLFELEEEDEQSKGNPGEVRLVSLHIQALVDAGVPARDIAVVSPYNLQVDLL 605

Query: 872 RNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARK 931
           R  L  +     +E+ ++D FQGRE +AVI+S VRSN  G VGFL + RR+NVA+TRAR+
Sbjct: 606 RQSL--VHRHPELEIKSVDGFQGREKEAVILSFVRSNRKGEVGFLAEDRRINVAVTRARR 639

Query: 932 HVAVVCDSSTICQNTFLARLLRHIRYFGRVKHA 949
           HVAV+CDS T+  + FL  L+ +    G V+ A
Sbjct: 666 HVAVICDSRTVNNHAFLKTLVEYFTQHGEVRTA 639

BLAST of MS017824 vs. ExPASy Swiss-Prot
Match: Q60560 (DNA-binding protein SMUBP-2 OS=Mesocricetus auratus OX=10036 GN=IGHMBP2 PE=1 SV=1)

HSP 1 Score: 346.3 bits (887), Expect = 1.1e-93
Identity = 251/691 (36.32%), Postives = 366/691 (52.97%), Query Frame = 0

Query: 272 IDEFANRMSELLRIERDSELEFTQEELDAVPTPDEGSDLSKPIEFLVHGQAQQELCDTIC 331
           ++ F  +  ELL +ERD+E+E  +       +  E S L +           + +C  + 
Sbjct: 6   VESFVAQQLELLELERDAEVEERR-------SWQEHSSLKE--------LQSRGVC--LL 65

Query: 332 NLNAVSTSTGLGGMHLVLF---RVEGTHRLPPTTLSPGDMVCVRVCDSRGAGATSCMQGF 391
            L   S  TGL G  LV F   ++     LP  + + GD+V +   +     AT  +   
Sbjct: 66  KLQVSSQCTGLYGQRLVTFEPRKLGPVVVLPSNSFTSGDIVGLYDANESSQLATGVLTRI 125

Query: 392 VNNLGDDGCSITIALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQRNG 451
                    S+T+A +  H      +L        R+  LA+ +TY+R  +ALM L++  
Sbjct: 126 TQK------SVTVAFDESHD----FQLNLDRENTYRLLKLANDVTYKRLKKALMTLKK-- 185

Query: 452 LQKKNPSIAVVATLFGDKEDIKWMEENDLIDLAETNMDGIMLNGDFDNSQRSAISHALNK 511
                P+ +++  L G        E                 N   D SQ+ A+S AL +
Sbjct: 186 -YHSGPASSLIDVLLGGSSPSPTTEIPPF----------TFYNTALDPSQKEAVSFALAQ 245

Query: 512 KRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINTVRV 571
           K  + II GPPGTGKT  + E+I  AV+QG ++L  AP+N AVDN+VE+L+      +R+
Sbjct: 246 KE-VAIIHGPPGTGKTTTVVEIILQAVKQGLKILCCAPSNVAVDNLVERLALCKKRILRL 305

Query: 572 GNPARISSSVASKSLAEIVNSKLASFRTEFERKKADLRKDLRHCL------KDDSLAAGI 631
           G+PAR+  S    SL  ++       R++  +  AD+RKD+          +D    +  
Sbjct: 306 GHPARLLESAQQHSLDAVL------ARSDNAQIVADIRKDIDQVFGKNKKTQDKREKSNF 365

Query: 632 RQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGA-ADPLIRRLEK--FDLVVIDEAGQ 691
           R  +K L K LK++E+  + + L++A VVLATNTGA +D  ++ L +  FD+VV+DE  Q
Sbjct: 366 RNEIKLLRKELKEREEAAIVQSLTAADVVLATNTGASSDGPLKLLPENHFDVVVVDECAQ 425

Query: 692 AIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHQETLTTMLT 751
           A+E SCWIP+L+  +CILAGD  QL P  +S KA   GL  SL+ER    H      MLT
Sbjct: 426 ALEASCWIPLLKAPKCILAGDHRQLPPTTISHKAALAGLSRSLMERLVEKHGAGAVRMLT 485

Query: 752 IQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYGS 811
           +QYRM+ AI  WAS+ MY G L +  +V+ HLL + P V  T  T  PLLL+DT      
Sbjct: 486 VQYRMHQAITRWASEAMYHGQLTAHPSVAGHLLKDLPGVADTEETSVPLLLIDT------ 545

Query: 812 LSVGCE-EHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRNK 871
              GC    LD   + S  N GE  +V  H+ +L+ +GV    IAV +PY  QV LLR  
Sbjct: 546 --AGCGLLELDEEDSQSKGNPGEVRLVTLHIQALVDAGVHAGDIAVIAPYNLQVDLLRQS 605

Query: 872 L-DEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHV 931
           L ++ PE   +E+ ++D FQGRE +AVI++ VRSN  G VGFL + RR+NVA+TRAR+HV
Sbjct: 606 LSNKHPE---LEIKSVDGFQGREKEAVILTFVRSNRKGEVGFLAEDRRINVAVTRARRHV 638

Query: 932 AVVCDSSTICQNTFLARLLRHIRYFGRVKHA 949
           AV+CDS T+  + FL  L+ +    G V+ A
Sbjct: 666 AVICDSRTVNNHAFLKTLVDYFTEHGEVRTA 638

BLAST of MS017824 vs. ExPASy Swiss-Prot
Match: P40694 (DNA-binding protein SMUBP-2 OS=Mus musculus OX=10090 GN=Ighmbp2 PE=1 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 1.9e-93
Identity = 249/693 (35.93%), Postives = 366/693 (52.81%), Query Frame = 0

Query: 272 IDEFANRMSELLRIERDSELEFTQEELDAVPTPDEGSDLSKPIEFLVHGQAQQELCDTIC 331
           ++ F  +  +LL +ERD+E+E  +                    +  H   ++     +C
Sbjct: 6   VESFVAQQLQLLELERDAEVEERR-------------------SWQEHSSLRELQSRGVC 65

Query: 332 --NLNAVSTSTGLGGMHLVLF---RVEGTHRLPPTTLSPGDMVCVRVCDSRGAGATSCMQ 391
              L   S  TGL G  LV F   +      LP  + + GD+V +   +     AT  + 
Sbjct: 66  LLKLQVSSQRTGLYGQRLVTFEPRKFGPAVVLPSNSFTSGDIVGLYDTNENSQLATGVLT 125

Query: 392 GFVNNLGDDGCSITIALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQR 451
                      S+T+A +  H      +L        R+  LA+ +TY+R  +ALM L++
Sbjct: 126 RITQK------SVTVAFDESHD----LQLNLDRENTYRLLKLANDVTYKRLKKALMTLKK 185

Query: 452 NGLQKKNPSIAVVATLFGDKEDIKWMEENDLIDLAETNMDGIMLNGDFDNSQRSAISHAL 511
                  P+ +++  L G       ME   L             N   D SQ+ A+S AL
Sbjct: 186 ---YHSGPASSLIDILLGSSTPSPAMEIPPL----------SFYNTTLDLSQKEAVSFAL 245

Query: 512 NKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINTV 571
            +K  + II GPPGTGKT  + E+I  AV+QG +VL  AP+N AVDN+VE+L+      +
Sbjct: 246 AQKE-LAIIHGPPGTGKTTTVVEIILQAVKQGLKVLCCAPSNIAVDNLVERLALCKKRIL 305

Query: 572 RVGNPARISSSVASKSLAEIVNSKLASFRTEFERKKADLRKDLRHCL------KDDSLAA 631
           R+G+PAR+  SV   SL  ++       R++  +  AD+R+D+          +D     
Sbjct: 306 RLGHPARLLESVQHHSLDAVL------ARSDNAQIVADIRRDIDQVFGKNKKTQDKREKG 365

Query: 632 GIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGA-ADPLIRRL--EKFDLVVIDEA 691
             R  +K L K LK++E+  + + L++A VVLATNTGA +D  ++ L  + FD+VV+DE 
Sbjct: 366 NFRSEIKLLRKELKEREEAAIVQSLTAADVVLATNTGASSDGPLKLLPEDYFDVVVVDEC 425

Query: 692 GQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHQETLTTM 751
            QA+E SCWIP+L+  +CILAGD  QL P  +S +A   GL  SL+ER A  H   +  M
Sbjct: 426 AQALEASCWIPLLKAPKCILAGDHRQLPPTTVSHRAALAGLSRSLMERLAEKHGAGVVRM 485

Query: 752 LTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPY 811
           LT+QYRM+ AI  WAS+ MY G   S  +V+ HLL + P V  T  T+ PLLL+DT    
Sbjct: 486 LTVQYRMHQAIMCWASEAMYHGQFTSHPSVAGHLLKDLPGVTDTEETRVPLLLIDT---- 545

Query: 812 GSLSVGCE-EHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLR 871
                GC    L+   + S  N GE  +V  H+ +L+ +GV    IAV +PY  QV LLR
Sbjct: 546 ----AGCGLLELEEEDSQSKGNPGEVRLVTLHIQALVDAGVQAGDIAVIAPYNLQVDLLR 605

Query: 872 NKL-DEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARK 931
             L ++ PE   +E+ ++D FQGRE +AV+++ VRSN  G VGFL + RR+NVA+TRAR+
Sbjct: 606 QSLSNKHPE---LEIKSVDGFQGREKEAVLLTFVRSNRKGEVGFLAEDRRINVAVTRARR 638

Query: 932 HVAVVCDSSTICQNTFLARLLRHIRYFGRVKHA 949
           HVAV+CDS T+  + FL  L+ +    G V+ A
Sbjct: 666 HVAVICDSHTVNNHAFLETLVDYFTEHGEVRTA 638

BLAST of MS017824 vs. ExPASy Swiss-Prot
Match: Q9EQN5 (DNA-binding protein SMUBP-2 OS=Rattus norvegicus OX=10116 GN=Ighmbp2 PE=1 SV=1)

HSP 1 Score: 341.3 bits (874), Expect = 3.6e-92
Identity = 249/691 (36.03%), Postives = 366/691 (52.97%), Query Frame = 0

Query: 272 IDEFANRMSELLRIERDSELEFTQEELDAVPTPDEGSDLSKPIEFLVHGQAQQELCDTIC 331
           ++ F  +  +LL +ERD+E+E  +       +  E S L +           + +C  + 
Sbjct: 6   VESFVAQQLQLLELERDAEVEERR-------SWQEHSSLKE--------LQSRGVC--LL 65

Query: 332 NLNAVSTSTGLGGMHLVLF---RVEGTHRLPPTTLSPGDMVCVRVCDSRGAGATSCMQGF 391
            L      TGL G  LV F   +      LP  + + GD+V +   +     AT  +   
Sbjct: 66  KLQVSGQRTGLYGQRLVTFEPRKFGPAVVLPSNSFTSGDIVGLYDTNESSQLATGVLTRI 125

Query: 392 VNNLGDDGCSITIALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQRNG 451
                    S+ +A +  H      +L        R+  LA+ +TY+R  +AL+ L++  
Sbjct: 126 TQK------SVIVAFDESHD----FQLNLDRENTYRLLKLANDVTYKRLKKALLTLKK-- 185

Query: 452 LQKKNPSIAVVATLFGDKEDIKWMEENDLIDLAETNMDGIMLNGDFDNSQRSAISHALNK 511
                P+ +++  L G        E   L             N   D SQ+ A+S AL +
Sbjct: 186 -YHSGPASSLIDVLLGGSTPSPATEIPPL----------TFYNTTLDPSQKEAVSFALAQ 245

Query: 512 KRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINTVRV 571
           K  + II GPPGTGKT  + E+I  AV+QG +VL  AP+N AVDN+VE+L+      +R+
Sbjct: 246 KE-VAIIHGPPGTGKTTTVVEIILQAVKQGLKVLCCAPSNIAVDNLVERLALCKKQILRL 305

Query: 572 GNPARISSSVASKSLAEIVNSKLASFRTEFERKKADLRKDLRHCL------KDDSLAAGI 631
           G+PAR+  SV   SL  ++       R++  +  AD+R+D+          +D    +  
Sbjct: 306 GHPARLLESVQQHSLDAVL------ARSDNAQIVADIRRDIDQVFGKNKKTQDKREKSNF 365

Query: 632 RQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAA-DPLIRRL--EKFDLVVIDEAGQ 691
           R  +K L K LK++E+  + + LS+A VVLATNTGA+ D  ++ L  + FD+VV+DE  Q
Sbjct: 366 RNEIKLLRKELKEREEAAIVQSLSAADVVLATNTGASTDGPLKLLPEDYFDVVVVDECAQ 425

Query: 692 AIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHQETLTTMLT 751
           A+E SCWIP+L+  +CILAGD  QL P  +S KA   GL  SL+ER A  H   +  ML 
Sbjct: 426 ALEASCWIPLLKAPKCILAGDHKQLPPTTVSHKAALAGLSRSLMERLAEKHGAAVVRMLA 485

Query: 752 IQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYGS 811
           +QYRM+ AI  WAS+ MY G L +  +V+ HLL + P V  T  T  PLLL+DT      
Sbjct: 486 VQYRMHQAITRWASEAMYHGQLTAHPSVAGHLLKDLPGVADTEETSVPLLLIDT------ 545

Query: 812 LSVGCE-EHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRNK 871
              GC    L+   + S  N GE  +V  H+ +L+ +GV    IAV +PY  QV LLR  
Sbjct: 546 --AGCGLLELEEEDSQSKGNPGEVRLVTLHIQALVDAGVQAGDIAVIAPYNLQVDLLRQS 605

Query: 872 L-DEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHV 931
           L ++ PE   +E+ ++D FQGRE +AVI++ VRSN  G VGFL + RR+NVA+TRAR+HV
Sbjct: 606 LSNKHPE---LEIKSVDGFQGREKEAVILTFVRSNRKGEVGFLAEDRRINVAVTRARRHV 638

Query: 932 AVVCDSSTICQNTFLARLLRHIRYFGRVKHA 949
           AV+CDS T+  + FL  L+ +    G V+ A
Sbjct: 666 AVICDSHTVNNHAFLKTLVDYFTEHGEVRTA 638

BLAST of MS017824 vs. ExPASy Swiss-Prot
Match: O94247 (DNA polymerase alpha-associated DNA helicase A OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=hcs1 PE=3 SV=1)

HSP 1 Score: 266.2 bits (679), Expect = 1.5e-69
Identity = 205/654 (31.35%), Postives = 331/654 (50.61%), Query Frame = 0

Query: 288 DSELEFTQEELDAVPTPDEGSDLSKPIEFLVHGQAQQELCDTICNLNAVSTSTGLGGMHL 347
           D E+EF  E   +     E S    P+  L      Q     + NL      TG GG  +
Sbjct: 20  DREIEFVDEAQKSEVDETEKSIKRFPLSVL------QRKGLALINLRIGVVKTGFGGKTI 79

Query: 348 VLFRVE----GTHRLPPTTLSPGDMVCVR-----VCDSRGAGATSCMQGFVNNLGDDGCS 407
           + F  +        LP  + SPGD+V +R         R       ++G V  + +    
Sbjct: 80  IDFEKDPAFSNGEELPANSFSPGDVVSIRQDFQSSKKKRPNETDISVEGVVTRVHER--H 139

Query: 408 ITIALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAV 467
           I++AL+S    P+       SV    +  L + +TYER    ++  +R+  + +N   ++
Sbjct: 140 ISVALKSEEDIPS-------SVTRLSVVKLVNRVTYERMRHTMLEFKRSIPEYRN---SL 199

Query: 468 VATLFGDKEDIKWMEENDLIDLAETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGP 527
             TL G K+    +++  + D+          N + + SQ+ A+  ++  K  + +I GP
Sbjct: 200 FYTLIGRKKADVSIDQKLIGDIK-------YFNKELNASQKKAVKFSIAVKE-LSLIHGP 259

Query: 528 PGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSV 587
           PGTGKT  L E+I   V + +R+LV   +N AVDN+V++LS+ GI  VR+G+PAR+  S+
Sbjct: 260 PGTGKTHTLVEIIQQLVLRNKRILVCGASNLAVDNIVDRLSSSGIPMVRLGHPARLLPSI 319

Query: 588 ASKSLAEIVNSKLASFRTEFERKKADLRKDLRHCL------KDDSLAAGIRQLLKQLGKT 647
              SL       + S   +       + +D+  CL      K+      I + +++L K 
Sbjct: 320 LDHSL------DVLSRTGDNGDVIRGISEDIDVCLSKITKTKNGRERREIYKNIRELRKD 379

Query: 648 LKKKEKETVKEVLSSAQVVLATNTGAADPLIRRLEKFDLVVIDEAGQAIEPSCWIPILQG 707
            +K E +TV  ++S+++VV  T  GA    ++  ++FD V+IDEA QA+EP CWIP+L  
Sbjct: 380 YRKYEAKTVANIVSASKVVFCTLHGAGSRQLKG-QRFDAVIIDEASQALEPQCWIPLLGM 439

Query: 708 SRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHQETLTTMLTIQYRMNDAIASWA 767
           ++ ILAGD  QL+P + S++       +S+ ER      + +   L IQYRM++ I+ + 
Sbjct: 440 NKVILAGDHMQLSPNVQSKRPY-----ISMFERLVKSQGDLVKCFLNIQYRMHELISKFP 499

Query: 768 SKEMYGGMLKSSSTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAG 827
           S   Y   L  +  V   LL++   V+ T +T  P+   DT   Y        E +    
Sbjct: 500 SDTFYDSKLVPAEEVKKRLLMDLENVEETELTDSPIYFYDTLGNY--QEDDRSEDMQNFY 559

Query: 828 TGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRNKLDEIPEAAGIEVAT 887
             S  N  EA IV  H+  L+ +G+  + IAV +PY AQV L+R  L E  +   +E+ +
Sbjct: 560 QDSKSNHWEAQIVSYHISGLLEAGLEAKDIAVVTPYNAQVALIRQLLKE--KGIEVEMGS 619

Query: 888 IDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTI 927
           +D  QGRE +A+I S+VRSN++  VGFL + RR+NVAITR ++H+ V+ DS+T+
Sbjct: 620 VDKVQGREKEAIIFSLVRSNDVREVGFLAEKRRLNVAITRPKRHLCVIGDSNTV 631

BLAST of MS017824 vs. ExPASy TrEMBL
Match: A0A6J1DS82 (DNA-binding protein SMUBP-2 OS=Momordica charantia OX=3673 GN=LOC111023920 PE=3 SV=1)

HSP 1 Score: 1885.9 bits (4884), Expect = 0.0e+00
Identity = 965/968 (99.69%), Postives = 966/968 (99.79%), Query Frame = 0

Query: 1   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 60
           MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS
Sbjct: 7   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 66

Query: 61  LEISSPSTGNLAAAAKISISTGSLVGAETEAQPKRPPPGELEGKRDNRSVNVRGIYQNGD 120
           LEISSPSTGNLAAAAKISISTGS VGAETEAQPKRPPPGELEGKRDNRSVNVRGIYQNGD
Sbjct: 67  LEISSPSTGNLAAAAKISISTGSSVGAETEAQPKRPPPGELEGKRDNRSVNVRGIYQNGD 126

Query: 121 PLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAVP 180
           PLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAVP
Sbjct: 127 PLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAVP 186

Query: 181 MPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSAQ 240
           MPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSAQ
Sbjct: 187 MPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSAQ 246

Query: 241 HKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELDA 300
           HKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELDA
Sbjct: 247 HKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELDA 306

Query: 301 VPTPDEGSDLSKPIEFLVHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRLPP 360
           VPTPDEGSDLSKPIEFLVHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRLPP
Sbjct: 307 VPTPDEGSDLSKPIEFLVHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRLPP 366

Query: 361 TTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGKSV 420
           TTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGKSV
Sbjct: 367 TTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGKSV 426

Query: 421 RIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLIDL 480
           RIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLIDL
Sbjct: 427 RIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLIDL 486

Query: 481 AETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGER 540
           AETNMDGI+LNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGER
Sbjct: 487 AETNMDGIVLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGER 546

Query: 541 VLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEFER 600
           VLVTAPTNAAVDNMVEKLSNIGIN VRVGNPARISSSVASKSLAEIVNSKLASFRTEFER
Sbjct: 547 VLVTAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNSKLASFRTEFER 606

Query: 601 KKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAAD 660
           KKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAAD
Sbjct: 607 KKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAAD 666

Query: 661 PLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGV 720
           PLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGV
Sbjct: 667 PLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGV 726

Query: 721 SLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKP 780
           SLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKP
Sbjct: 727 SLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKP 786

Query: 781 TWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPR 840
           TWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPR
Sbjct: 787 TWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPR 846

Query: 841 AIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFL 900
           AIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFL
Sbjct: 847 AIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFL 906

Query: 901 GDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGLGM 960
           GDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGLGM
Sbjct: 907 GDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGLGM 966

Query: 961 NPMLPSIN 969
           NPMLPSIN
Sbjct: 967 NPMLPSIN 974

BLAST of MS017824 vs. ExPASy TrEMBL
Match: A0A6J1K9F5 (DNA-binding protein SMUBP-2-like OS=Cucurbita maxima OX=3661 GN=LOC111491308 PE=3 SV=1)

HSP 1 Score: 1701.4 bits (4405), Expect = 0.0e+00
Identity = 875/970 (90.21%), Postives = 918/970 (94.64%), Query Frame = 0

Query: 1   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 60
           MNA TSI LFRQN TAVTV+FQQFVQT+N ANH    PSG Q +R+RVVK+ KNVKKPN 
Sbjct: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANH----PSGAQ-KRVRVVKSKKNVKKPNI 60

Query: 61  LEISSPSTGNLAAAAKISISTGSLVGAETEAQPKRPPPGELEG-KRDNRSVNVRGIYQNG 120
           LE+SSPST N +A A+ISIST   VG+E +A+PKR P GE EG K+ +R+VN+ GIYQNG
Sbjct: 61  LEVSSPSTANRSAGARISISTSGSVGSEMKARPKRSPLGEQEGKKKSDRAVNLHGIYQNG 120

Query: 121 DPLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAV 180
           DPLGRRELGKSVV+WIGQAM+AMASDFASA+V GDFSELRQ+MG GLTFVIQAQPYLNAV
Sbjct: 121 DPLGRRELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAV 180

Query: 181 PMPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSA 240
           PMPLGLEAVCLKA THYPTLFDHFQRELRD LQDLQSKSL  DWR TQSWKLLKELANSA
Sbjct: 181 PMPLGLEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSA 240

Query: 241 QHKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELD 300
           QHKAIARKISQPKAVQG LGMDLEKAKA+Q+RIDEF NRMSELLRIERDSELEFTQEEL+
Sbjct: 241 QHKAIARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELN 300

Query: 301 AVPTPDEGSDLSKPIEFLV-HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRL 360
           AVPTPDEGSD SKPIEFLV HGQAQQELCDTICNLNAVST TGLGGMHLVLFRVEG HRL
Sbjct: 301 AVPTPDEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRL 360

Query: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGK 420
           PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSIT+ALESRHGDPTFSKLFGK
Sbjct: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGK 420

Query: 421 SVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLI 480
           SVRIDRIPGLADTLTYERNCEALMLLQ+NGL+KKNPS AVVATLFGD+EDIKWME+N+LI
Sbjct: 421 SVRIDRIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDIKWMEDNNLI 480

Query: 481 DLAETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG 540
           DLA TN++ I+LNGDFD+SQ+ AIS ALNKKRPILI+QGPPGTGKTGLLKELIALAVQQG
Sbjct: 481 DLAHTNLNDIVLNGDFDDSQKGAISFALNKKRPILIVQGPPGTGKTGLLKELIALAVQQG 540

Query: 541 ERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEF 600
           ERVLVTAPTNAAVDNMVEKLSNIGIN VRVGNPARISSSVASKSLAEIVN+KLASFRT+ 
Sbjct: 541 ERVLVTAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDI 600

Query: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGA 660
           ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKE+LS+AQVVLATNTGA
Sbjct: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEILSNAQVVLATNTGA 660

Query: 661 ADPLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGL 720
           ADPLIR LEKFDLVVIDEAGQAIEP+CWIPILQG RCILAGDQCQLAPVILSRKALEGGL
Sbjct: 661 ADPLIRTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGL 720

Query: 721 GVSLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFV 780
           GVSLLERA+TLHQ TLT MLTIQYRMNDAIASWASKEMYGGMLKSS TVSSHLLVNSPFV
Sbjct: 721 GVSLLERASTLHQGTLTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFV 780

Query: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVS 840
           KPTWITQCPLLLLDTRMPYGSLSVGCEEHLD AGTGSLYNEGEADIVVQHVCSLIYSGVS
Sbjct: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVS 840

Query: 841 PRAIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900
           PRAIAVQSPYVAQVQLLRN+LDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG
Sbjct: 841 PRAIAVQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900

Query: 901 FLGDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGL 960
           FLGDSRRMNVAITRARKH+A+VCDSSTICQNTFLARLLRHIRYFGRVKHAEPG FGGSGL
Sbjct: 901 FLGDSRRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGL 960

Query: 961 GMNPMLPSIN 969
           GMNPMLPSIN
Sbjct: 961 GMNPMLPSIN 965

BLAST of MS017824 vs. ExPASy TrEMBL
Match: A0A6J1H5A4 (DNA-binding protein SMUBP-2-like OS=Cucurbita moschata OX=3662 GN=LOC111459712 PE=3 SV=1)

HSP 1 Score: 1699.9 bits (4401), Expect = 0.0e+00
Identity = 875/970 (90.21%), Postives = 917/970 (94.54%), Query Frame = 0

Query: 1   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 60
           MNA TSI LFRQN  AVTV+FQQFVQT+N ANH    PSG Q +R+RVVK+ KNVKKPN 
Sbjct: 1   MNAPTSIPLFRQNHIAVTVSFQQFVQTVNDANH----PSGAQ-KRVRVVKSKKNVKKPNI 60

Query: 61  LEISSPSTGNLAAAAKISISTGSLVGAETEAQPKRPPPGELEG-KRDNRSVNVRGIYQNG 120
           LE+SSPST N +A A+ISIST   +G+ET+A+PKR P GE EG K+ +R+VN+ GIYQNG
Sbjct: 61  LEVSSPSTANRSAGARISISTSGSIGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNG 120

Query: 121 DPLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAV 180
           DPLGRRELGKSVV+WIGQAM+AMASDFASA+V GDFSELRQ+MG GLTFVIQAQPYLNAV
Sbjct: 121 DPLGRRELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAV 180

Query: 181 PMPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSA 240
           PMPLGLEAVCLKA THYPTLFDHFQRELRD LQDLQSKSL  DWR TQSWKLLKELANSA
Sbjct: 181 PMPLGLEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSA 240

Query: 241 QHKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELD 300
           QHKAIARKISQPKAVQG LGMDLEKAKA+Q+RIDEF NRMSELLRIERDSELEFTQEEL+
Sbjct: 241 QHKAIARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELN 300

Query: 301 AVPTPDEGSDLSKPIEFLV-HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRL 360
           AVPTPDEGSD SKPIEFLV HGQAQQELCDTICNLNAVST TGLGGMHLVLFRVEG HRL
Sbjct: 301 AVPTPDEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRL 360

Query: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGK 420
           PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSIT+ALESRHGDPTFSKLFGK
Sbjct: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGK 420

Query: 421 SVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLI 480
           SVRIDRIPGLADTLTYERNCEALMLLQ+NGL+KKNPS AVVATLFGD+EDIKWME+N+LI
Sbjct: 421 SVRIDRIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDIKWMEDNNLI 480

Query: 481 DLAETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG 540
           DLA TN++ I+LNGDFD+SQ+ AIS ALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG
Sbjct: 481 DLAHTNLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG 540

Query: 541 ERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEF 600
           ERVLVTAPTNAAVDNMVEKLSNIGIN VRVGNPARISSSVASKSLAEIVN+KLASFRT+ 
Sbjct: 541 ERVLVTAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDI 600

Query: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGA 660
           ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLS+AQVVLATNTGA
Sbjct: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGA 660

Query: 661 ADPLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGL 720
           ADPLIR LEKFDLVVIDEAGQAIEP+CWIPILQG RCILAGDQCQLAPVILSRKALEGGL
Sbjct: 661 ADPLIRTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGL 720

Query: 721 GVSLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFV 780
           GVSLLERA+TLHQ  LT MLTIQYRMNDAIASWASKEMYGGMLKSS TVSSHLLVNSPFV
Sbjct: 721 GVSLLERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFV 780

Query: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVS 840
           KPTWITQCPLLLLDTRMPYGSLSVGCEEHLD AGTGSLYNEGEADIVVQHVCSLIYSGVS
Sbjct: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVS 840

Query: 841 PRAIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900
           PRAIAVQSPYVAQVQLLRN+LDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG
Sbjct: 841 PRAIAVQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900

Query: 901 FLGDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGL 960
           FLGDSRRMNVAITRARKH+A+VCDSSTICQNTFLARLLRHIRYFGRVKHAEPG FGGSGL
Sbjct: 901 FLGDSRRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGL 960

Query: 961 GMNPMLPSIN 969
           GMNPMLPSIN
Sbjct: 961 GMNPMLPSIN 965

BLAST of MS017824 vs. ExPASy TrEMBL
Match: A0A5A7UKQ5 (DNA-binding protein SMUBP-2 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1728G00510 PE=3 SV=1)

HSP 1 Score: 1683.3 bits (4358), Expect = 0.0e+00
Identity = 872/970 (89.90%), Postives = 915/970 (94.33%), Query Frame = 0

Query: 1   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 60
           M A TSIHLFRQN TAVTVAF QFVQTING N     PSG Q RRIRVVK+ KNVKKPN 
Sbjct: 1   MTAPTSIHLFRQNHTAVTVAFHQFVQTINGVNQ----PSGAQ-RRIRVVKSKKNVKKPNV 60

Query: 61  LEISSPSTGNLAAAAKISISTGSLVGAETEAQPKRPPPGELEGKRDN-RSVNVRGIYQNG 120
           LE+SSPST     AAKIS+ST   + +ET+A+PKR    ELE K+ N R VNV+GIYQNG
Sbjct: 61  LEVSSPST-----AAKISVSTSGSLASETKARPKR---RELEEKKKNDREVNVQGIYQNG 120

Query: 121 DPLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAV 180
           DPLGRRELGKSVVRWIGQAM+AMASDFA+AEVQGDFSEL+QRMG GLTFVIQAQ YLNAV
Sbjct: 121 DPLGRRELGKSVVRWIGQAMQAMASDFAAAEVQGDFSELQQRMGPGLTFVIQAQRYLNAV 180

Query: 181 PMPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSA 240
           PMPLGLEAVCLKA THYPTLFDHFQRELRDVLQDLQ +SLF DWR TQSWKLLKELANS 
Sbjct: 181 PMPLGLEAVCLKASTHYPTLFDHFQRELRDVLQDLQRQSLFLDWRETQSWKLLKELANSV 240

Query: 241 QHKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELD 300
           QHKAIARKIS+PK VQG LGMDL+KAKAIQNRIDEFANRMSELLRIERDSELEFTQEEL+
Sbjct: 241 QHKAIARKISEPKVVQGALGMDLKKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELN 300

Query: 301 AVPTPDEGSDLSKPIEFLV-HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRL 360
           AVPTPDEGSD SKPIEFLV HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEG HRL
Sbjct: 301 AVPTPDEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRL 360

Query: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGK 420
           PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSIT+ALESRHGDPTFSKLFGK
Sbjct: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGK 420

Query: 421 SVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLI 480
           +VRIDRIPGLADTLTYERNCEALMLLQ+NGL KKNPSIAVVATLFGDK+DIKWME+N++I
Sbjct: 421 TVRIDRIPGLADTLTYERNCEALMLLQKNGLHKKNPSIAVVATLFGDKDDIKWMEDNNVI 480

Query: 481 DLAETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG 540
            LA+TN+DGI+LNGDFD+SQ+SAIS ALNKKRPILIIQGPPGTGKTGLLK+LIALAVQQG
Sbjct: 481 GLADTNLDGIVLNGDFDDSQKSAISRALNKKRPILIIQGPPGTGKTGLLKDLIALAVQQG 540

Query: 541 ERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEF 600
           ERVLVTAPTNAAVDNMVEKLSN+GIN VRVGNPARISSSVASKSLAEIVNS+L+SFRT+ 
Sbjct: 541 ERVLVTAPTNAAVDNMVEKLSNVGINIVRVGNPARISSSVASKSLAEIVNSELSSFRTDI 600

Query: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGA 660
           ERKKADLRKDLR CLKDDSLAAGIRQLLKQLGK+LKKKEKETVKEVLS+AQVVLATNTGA
Sbjct: 601 ERKKADLRKDLRQCLKDDSLAAGIRQLLKQLGKSLKKKEKETVKEVLSNAQVVLATNTGA 660

Query: 661 ADPLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGL 720
           ADPLIR+L+KFDLVVIDEAGQAIEP+CWIPILQG RCILAGDQCQLAPVILSRKALEGGL
Sbjct: 661 ADPLIRKLDKFDLVVIDEAGQAIEPACWIPILQGRRCILAGDQCQLAPVILSRKALEGGL 720

Query: 721 GVSLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFV 780
           GVSLLERAATLH+  LTTMLTIQYRMNDAIASWASKEMY G+LKSS TVSSHLLVNSPFV
Sbjct: 721 GVSLLERAATLHEGALTTMLTIQYRMNDAIASWASKEMYDGILKSSPTVSSHLLVNSPFV 780

Query: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVS 840
           KPTWITQCPLLLLDTRMPYGSLSVGCEE+LDPAGTGSLYNEGEADIVVQHVCSLIYSGVS
Sbjct: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEYLDPAGTGSLYNEGEADIVVQHVCSLIYSGVS 840

Query: 841 PRAIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900
           PRAIAVQSPYVAQVQLLRN+LDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG
Sbjct: 841 PRAIAVQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900

Query: 901 FLGDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGL 960
           FLGDSRRMNVAITRARKHVA+VCDSSTICQNTFLARLLRHIRYFGRVKHAEPG FGGSGL
Sbjct: 901 FLGDSRRMNVAITRARKHVALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGL 957

Query: 961 GMNPMLPSIN 969
           GMNPMLPSIN
Sbjct: 961 GMNPMLPSIN 957

BLAST of MS017824 vs. ExPASy TrEMBL
Match: A0A1S3CT28 (DNA-binding protein SMUBP-2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103504640 PE=3 SV=1)

HSP 1 Score: 1683.3 bits (4358), Expect = 0.0e+00
Identity = 872/970 (89.90%), Postives = 915/970 (94.33%), Query Frame = 0

Query: 1   MNATTSIHLFRQNQTAVTVAFQQFVQTINGANHHHHPPSGTQRRRIRVVKTNKNVKKPNS 60
           M A TSIHLFRQN TAVTVAF QFVQTING N     PSG Q RRIRVVK+ KNVKKPN 
Sbjct: 1   MTAPTSIHLFRQNHTAVTVAFHQFVQTINGVNQ----PSGAQ-RRIRVVKSKKNVKKPNV 60

Query: 61  LEISSPSTGNLAAAAKISISTGSLVGAETEAQPKRPPPGELEGKRDN-RSVNVRGIYQNG 120
           LE+SSPST     AAKIS+ST   + +ET+A+PKR    ELE K+ N R VNV+GIYQNG
Sbjct: 61  LEVSSPST-----AAKISVSTSGSLASETKARPKR---RELEEKKKNDREVNVQGIYQNG 120

Query: 121 DPLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQRMGTGLTFVIQAQPYLNAV 180
           DPLGRRELGKSVVRWIGQAM+AMASDFA+AEVQGDFSEL+QRMG GLTFVIQAQ YLNAV
Sbjct: 121 DPLGRRELGKSVVRWIGQAMQAMASDFAAAEVQGDFSELQQRMGPGLTFVIQAQRYLNAV 180

Query: 181 PMPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLFPDWRRTQSWKLLKELANSA 240
           PMPLGLEAVCLKA THYPTLFDHFQRELRDVLQDLQ +SLF DWR TQSWKLLKELANS 
Sbjct: 181 PMPLGLEAVCLKASTHYPTLFDHFQRELRDVLQDLQRQSLFLDWRETQSWKLLKELANSV 240

Query: 241 QHKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELD 300
           QHKAIARKIS+PK VQG LGMDL+KAKAIQNRIDEFANRMSELLRIERDSELEFTQEEL+
Sbjct: 241 QHKAIARKISEPKVVQGALGMDLKKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELN 300

Query: 301 AVPTPDEGSDLSKPIEFLV-HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGTHRL 360
           AVPTPDEGSD SKPIEFLV HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEG HRL
Sbjct: 301 AVPTPDEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRL 360

Query: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITIALESRHGDPTFSKLFGK 420
           PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSIT+ALESRHGDPTFSKLFGK
Sbjct: 361 PPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGK 420

Query: 421 SVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVVATLFGDKEDIKWMEENDLI 480
           +VRIDRIPGLADTLTYERNCEALMLLQ+NGL KKNPSIAVVATLFGDK+DIKWME+N++I
Sbjct: 421 TVRIDRIPGLADTLTYERNCEALMLLQKNGLHKKNPSIAVVATLFGDKDDIKWMEDNNVI 480

Query: 481 DLAETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQG 540
            LA+TN+DGI+LNGDFD+SQ+SAIS ALNKKRPILIIQGPPGTGKTGLLK+LIALAVQQG
Sbjct: 481 GLADTNLDGIVLNGDFDDSQKSAISRALNKKRPILIIQGPPGTGKTGLLKDLIALAVQQG 540

Query: 541 ERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEF 600
           ERVLVTAPTNAAVDNMVEKLSN+GIN VRVGNPARISSSVASKSLAEIVNS+L+SFRT+ 
Sbjct: 541 ERVLVTAPTNAAVDNMVEKLSNVGINIVRVGNPARISSSVASKSLAEIVNSELSSFRTDI 600

Query: 601 ERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGA 660
           ERKKADLRKDLR CLKDDSLAAGIRQLLKQLGK+LKKKEKETVKEVLS+AQVVLATNTGA
Sbjct: 601 ERKKADLRKDLRQCLKDDSLAAGIRQLLKQLGKSLKKKEKETVKEVLSNAQVVLATNTGA 660

Query: 661 ADPLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGL 720
           ADPLIR+L+KFDLVVIDEAGQAIEP+CWIPILQG RCILAGDQCQLAPVILSRKALEGGL
Sbjct: 661 ADPLIRKLDKFDLVVIDEAGQAIEPACWIPILQGRRCILAGDQCQLAPVILSRKALEGGL 720

Query: 721 GVSLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFV 780
           GVSLLERAATLH+  LTTMLTIQYRMNDAIASWASKEMY G+LKSS TVSSHLLVNSPFV
Sbjct: 721 GVSLLERAATLHEGALTTMLTIQYRMNDAIASWASKEMYDGILKSSPTVSSHLLVNSPFV 780

Query: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVS 840
           KPTWITQCPLLLLDTRMPYGSLSVGCEE+LDPAGTGSLYNEGEADIVVQHVCSLIYSGVS
Sbjct: 781 KPTWITQCPLLLLDTRMPYGSLSVGCEEYLDPAGTGSLYNEGEADIVVQHVCSLIYSGVS 840

Query: 841 PRAIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900
           PRAIAVQSPYVAQVQLLRN+LDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG
Sbjct: 841 PRAIAVQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVG 900

Query: 901 FLGDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGTFGGSGL 960
           FLGDSRRMNVAITRARKHVA+VCDSSTICQNTFLARLLRHIRYFGRVKHAEPG FGGSGL
Sbjct: 901 FLGDSRRMNVAITRARKHVALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGL 957

Query: 961 GMNPMLPSIN 969
           GMNPMLPSIN
Sbjct: 961 GMNPMLPSIN 957

BLAST of MS017824 vs. TAIR 10
Match: AT5G35970.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 1401.7 bits (3627), Expect = 0.0e+00
Identity = 696/866 (80.37%), Postives = 781/866 (90.18%), Query Frame = 0

Query: 101 LEGKRDNRSVNVRGIYQNGDPLGRRELGKSVVRWIGQAMRAMASDFASAEVQGDFSELRQ 160
           +E  ++++ +++R + QNGDPLGRR+LG++VV+WI QAM+AMASDFA+AEVQG+FSELRQ
Sbjct: 93  VEEPKNDKELSLRALNQNGDPLGRRDLGRNVVKWISQAMKAMASDFATAEVQGEFSELRQ 152

Query: 161 RMGTGLTFVIQAQPYLNAVPMPLGLEAVCLKACTHYPTLFDHFQRELRDVLQDLQSKSLF 220
            +G+GLTFVIQAQPYLNA+PMPLG E +CLKACTHYPTLFDHFQRELRDVLQDL+ K++ 
Sbjct: 153 NVGSGLTFVIQAQPYLNAIPMPLGSEVICLKACTHYPTLFDHFQRELRDVLQDLERKNIM 212

Query: 221 PDWRRTQSWKLLKELANSAQHKAIARKISQPKAVQGVLGMDLEKAKAIQNRIDEFANRMS 280
             W+ ++SWKLLKE+ANSAQH+ +ARK +Q K VQGVLGMD EK KAIQ RIDEF ++MS
Sbjct: 213 ESWKESESWKLLKEIANSAQHREVARKAAQAKPVQGVLGMDSEKVKAIQERIDEFTSQMS 272

Query: 281 ELLRIERDSELEFTQEELDAVPTPDEGSDLSKPIEFLV-HGQAQQELCDTICNLNAVSTS 340
           +LL++ERD+ELE TQEELD VPTPDE SD SKPIEFLV HG A QELCDTICNL AVSTS
Sbjct: 273 QLLQVERDTELEVTQEELDVVPTPDESSDSSKPIEFLVRHGDAPQELCDTICNLYAVSTS 332

Query: 341 TGLGGMHLVLFRVEGTHRLPPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSI 400
           TGLGGMHLVLF+V G HRLPPTTLSPGDMVC+RVCDSRGAGAT+C QGFV+NLG+DGCSI
Sbjct: 333 TGLGGMHLVLFKVGGNHRLPPTTLSPGDMVCIRVCDSRGAGATACTQGFVHNLGEDGCSI 392

Query: 401 TIALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQRNGLQKKNPSIAVV 460
            +ALESRHGDPTFSKLFGKSVRIDRI GLAD LTYERNCEALMLLQ+NGLQKKNPSI+VV
Sbjct: 393 GVALESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALMLLQKNGLQKKNPSISVV 452

Query: 461 ATLFGDKEDIKWMEENDLIDLAETNMDGIMLNGDFDNSQRSAISHALNKKRPILIIQGPP 520
           ATLFGD EDI W+E+ND +D +E  +    ++  FD+SQR AI+  +NKKRP++I+QGPP
Sbjct: 453 ATLFGDGEDITWLEQNDYVDWSEAELSDEPVSKLFDSSQRRAIALGVNKKRPVMIVQGPP 512

Query: 521 GTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNPARISSSVA 580
           GTGKTG+LKE+I LAVQQGERVLVTAPTNAAVDNMVEKL ++G+N VRVGNPARISS+VA
Sbjct: 513 GTGKTGMLKEVITLAVQQGERVLVTAPTNAAVDNMVEKLLHLGLNIVRVGNPARISSAVA 572

Query: 581 SKSLAEIVNSKLASFRTEFERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKE 640
           SKSL EIVNSKLASFR E ERKK+DLRKDLR CL+DD LAAGIRQLLKQLGKTLKKKEKE
Sbjct: 573 SKSLGEIVNSKLASFRAELERKKSDLRKDLRQCLRDDVLAAGIRQLLKQLGKTLKKKEKE 632

Query: 641 TVKEVLSSAQVVLATNTGAADPLIRRLEKFDLVVIDEAGQAIEPSCWIPILQGSRCILAG 700
           TVKE+LS+AQVV ATN GAADPLIRRLE FDLVVIDEAGQ+IEPSCWIPILQG RCIL+G
Sbjct: 633 TVKEILSNAQVVFATNIGAADPLIRRLETFDLVVIDEAGQSIEPSCWIPILQGKRCILSG 692

Query: 701 DQCQLAPVILSRKALEGGLGVSLLERAATLHQETLTTMLTIQYRMNDAIASWASKEMYGG 760
           D CQLAPV+LSRKALEGGLGVSLLERAA+LH   L T LT QYRMND IA WASKEMYGG
Sbjct: 693 DPCQLAPVVLSRKALEGGLGVSLLERAASLHDGVLATKLTTQYRMNDVIAGWASKEMYGG 752

Query: 761 MLKSSSTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNE 820
            LKS+ +V+SHLL++SPFVK TWITQCPL+LLDTRMPYGSLSVGCEE LDPAGTGSLYNE
Sbjct: 753 WLKSAPSVASHLLIDSPFVKATWITQCPLVLLDTRMPYGSLSVGCEERLDPAGTGSLYNE 812

Query: 821 GEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRNKLDEIPEAAGIEVATIDSFQGR 880
           GEADIVV HV SLIY+GVSP AIAVQSPYVAQVQLLR +LD+ P A G+EVATIDSFQGR
Sbjct: 813 GEADIVVNHVISLIYAGVSPMAIAVQSPYVAQVQLLRERLDDFPVADGVEVATIDSFQGR 872

Query: 881 EADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHI 940
           EADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTIC NTFLARLLRHI
Sbjct: 873 EADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTFLARLLRHI 932

Query: 941 RYFGRVKHAEPGTFGGSGLGMNPMLP 966
           RYFGRVKHA+PG+ GGSGLG++PMLP
Sbjct: 933 RYFGRVKHADPGSLGGSGLGLDPMLP 958

BLAST of MS017824 vs. TAIR 10
Match: AT2G03270.1 (DNA-binding protein, putative )

HSP 1 Score: 362.1 bits (928), Expect = 1.4e-99
Identity = 246/688 (35.76%), Postives = 380/688 (55.23%), Query Frame = 0

Query: 272 IDEFANRMSELLRIERDSELEFTQEELDAVPTPDEGSDLSKPIEFLVHGQAQQELCDTIC 331
           ++ F + M+ L+ +E+++E+  +             S  S+ IE        Q+   TI 
Sbjct: 7   LEAFVSTMAPLIDMEKEAEISMSLT-----------SGASRNIE------TAQKKGTTIL 66

Query: 332 NLNAVSTSTGLGGMHLVLFRVEGTHRLPPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNN 391
           NL  V   TGL G  L+ F+      LP       D+V +++ +    G++   QG V  
Sbjct: 67  NLKCVDVQTGLMGKSLIEFQSNKGDVLPAHKFGNHDVVVLKL-NKSDLGSSPLAQGVVYR 126

Query: 392 LGDDGCSITIALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQRNGLQK 451
           L D   SIT+       D    +    S+R+++   LA+ +TY R  + L+ L +  L  
Sbjct: 127 LKDS--SITVVF-----DEVPEEGLNTSLRLEK---LANEVTYRRMKDTLIQLSKGVL-- 186

Query: 452 KNPSIAVVATLFGDKEDIKWMEENDLIDLAETNMDGIMLNGDFDNSQRSAISHALNKKRP 511
           + P+  +V  LFG+++    + + D+             N + D SQ+ AI+ AL+ K  
Sbjct: 187 RGPASDLVPVLFGERQPS--VSKKDVKSFTP-------FNKNLDQSQKDAITKALSSK-D 246

Query: 512 ILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINTVRVGNP 571
           + ++ GPPGTGKT  + E++   V++G ++L  A +N AVDN+VE+L    +  VRVG+P
Sbjct: 247 VFLLHGPPGTGKTTTVVEIVLQEVKRGSKILACAASNIAVDNIVERLVPHKVKLVRVGHP 306

Query: 572 ARISSSVASKSL-AEIV---NSKLAS-FRTEFERKKADLRKDLRHCLKDDSLAAGIRQLL 631
           AR+   V   +L A+++   NS LA+  R E +     L K      KD +    I++ L
Sbjct: 307 ARLLPQVLDSALDAQVLKGDNSGLANDIRKEMKALNGKLLK-----AKDKNTRRLIQKEL 366

Query: 632 KQLGKTLKKKEKETVKEVLSSAQVVLATNTGAADPLIRRLEK--FDLVVIDEAGQAIEPS 691
           + LGK  +K+++  V +V+ +A V+L T TGA   L R+L+   FDLV+IDE  QA+E +
Sbjct: 367 RTLGKEERKRQQLAVSDVIKNADVILTTLTGA---LTRKLDNRTFDLVIIDEGAQALEVA 426

Query: 692 CWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHQETLTTMLTIQYRM 751
           CWI +L+GSRCILAGD  QL P I S +A   GLG +L ER A L+ + + +MLT+QYRM
Sbjct: 427 CWIALLKGSRCILAGDHLQLPPTIQSAEAERKGLGRTLFERLADLYGDEIKSMLTVQYRM 486

Query: 752 NDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYGSLSVGC 811
           ++ I +W+SKE+Y   + + S+V+SH+L +   V  +  T+  LLL+DT         GC
Sbjct: 487 HELIMNWSSKELYDNKITAHSSVASHMLFDLENVTKSSSTEATLLLVDT--------AGC 546

Query: 812 EEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRNKLDEIPE 871
           +         S YNEGEA++ + H   L+ SGV P  I + +PY AQV LLR    +  +
Sbjct: 547 DMEEKKDEEESTYNEGEAEVAMAHAKRLMESGVQPSDIGIITPYAAQVMLLRILRGKEEK 606

Query: 872 AAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAVVCDSS 931
              +E++T+D FQGRE +A+IISMVRSN+   VGFL D RRMNVA+TR+R+   +VCD+ 
Sbjct: 607 LKDMEISTVDGFQGREKEAIIISMVRSNSKKEVGFLKDQRRMNVAVTRSRRQCCIVCDTE 638

Query: 932 TICQNTFLARLLRHIRYFGRVKHAEPGT 953
           T+  + FL R++ +    G    A   T
Sbjct: 667 TVSSDAFLKRMIEYFEEHGEYLSASEYT 638

BLAST of MS017824 vs. TAIR 10
Match: AT5G47010.1 (RNA helicase, putative )

HSP 1 Score: 205.3 bits (521), Expect = 2.2e-52
Identity = 158/457 (34.57%), Postives = 234/457 (51.20%), Query Frame = 0

Query: 493 DFDNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGE-RVLVTAPTNAAV 552
           + + SQ +A+   L K  PI +IQGPPGTGKT     ++    +QG+ +VLV AP+N AV
Sbjct: 488 ELNASQVNAVKSVLQK--PISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVLVCAPSNVAV 547

Query: 553 DNMVEKLSNIGINTVRVGNPAR--ISSSVASKSLAEIVNSKLASFRTEFERKKADLRKDL 612
           D + EK+S  G+  VR+   +R  +SS V   +L   V     S ++E  + +       
Sbjct: 548 DQLAEKISATGLKVVRLCAKSREAVSSPVEYLTLHYQVRHLDTSEKSELHKLQQ------ 607

Query: 613 RHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAADPLIRRLEKF 672
              LKD+       +L     K  K  ++ T +E+  SA V+  T  GAAD  +    +F
Sbjct: 608 ---LKDEQ-----GELSSSDEKKYKNLKRATEREITQSADVICCTCVGAADLRLSNF-RF 667

Query: 673 DLVVIDEAGQAIEPSCWIPILQG-SRCILAGDQCQLAPVILSRKALEGGLGVSLLERAAT 732
             V+IDE+ QA EP C IP++ G  + +L GD CQL PVI+ +KA   GL  SL ER  T
Sbjct: 668 RQVLIDESTQATEPECLIPLVLGVKQVVLVGDHCQLGPVIMCKKAARAGLAQSLFERLVT 727

Query: 733 LHQETLTTMLTIQYRMNDAIASWASKEMYGGMLKSSSTVSSHLLVNSPFVKPTWITQCPL 792
           L  + +   L +QYRM+ A++ + S   Y G L++  T+         F  P        
Sbjct: 728 LGIKPI--RLQVQYRMHPALSEFPSNSFYEGTLQNGVTIIERQTTGIDFPWP-------- 787

Query: 793 LLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPY 852
             +  R  +  + +G +E +  +GT S  N  EA  V + V + + SGV P  I V +PY
Sbjct: 788 --VPNRPMFFYVQLG-QEEISASGT-SYLNRTEAANVEKLVTAFLKSGVVPSQIGVITPY 847

Query: 853 VAQVQLLRNKLDEIPEA-----AGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 912
             Q   + N +             IEVA++DSFQGRE D +I+S VRSN    +GFL D 
Sbjct: 848 EGQRAYIVNYMARNGSLRQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDP 907

Query: 913 RRMNVAITRARKHVAVVCDSSTICQNTFLARLLRHIR 941
           RR+NVA+TRAR  + ++ +   + +      LL H +
Sbjct: 908 RRLNVALTRARYGIVILGNPKVLSKQPLWNGLLTHYK 913

BLAST of MS017824 vs. TAIR 10
Match: AT1G08840.1 (DNA replication helicase, putative )

HSP 1 Score: 162.2 bits (409), Expect = 2.2e-39
Identity = 126/462 (27.27%), Postives = 212/462 (45.89%), Query Frame = 0

Query: 495  DNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNM 554
            +N QR AI   L  K   LI+ G PGTGKT  +   +   + +G  +L+ + TN+AVDN+
Sbjct: 889  NNDQRQAILKILTAKDYALIL-GMPGTGKTSTMVHAVKALLIRGSSILLASYTNSAVDNL 948

Query: 555  VEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEFERKKADLRKDLRHCLK 614
            + KL   GI  +R+G    +   V     +                        +  C  
Sbjct: 949  LIKLKAQGIEFLRIGRDEAVHEEVRESCFSA-----------------------MNMCSV 1008

Query: 615  DDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAADPLIRRLEKFDLVVI 674
            +D                        +K+ L   +VV +T  G   PL+    +FD+ +I
Sbjct: 1009 ED------------------------IKKKLDQVKVVASTCLGINSPLLVN-RRFDVCII 1068

Query: 675  DEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHQETL 734
            DEAGQ   P    P+L  S  +L GD  QL P++ S +A E G+G+SL  R +  H + +
Sbjct: 1069 DEAGQIALPVSIGPLLFASTFVLVGDHYQLPPLVQSTEARENGMGISLFRRLSEAHPQAI 1128

Query: 735  TTMLTIQYRMNDAIASWASKEMYGGML--KSSSTVSSHLLVNSPFVKPTWITQCPLLLLD 794
             ++L  QYRM   I   ++  +YG  L   S+    + L++++      W+ +    +L+
Sbjct: 1129 -SVLQNQYRMCRGIMELSNALIYGDRLCCGSAEVADATLVLSTSSSTSPWLKK----VLE 1188

Query: 795  TRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQV 854
                   ++       +     ++ N  EA I+ + V  L+ +GV  + I + +PY +Q 
Sbjct: 1189 PTRTVVFVNTDMLRAFEARDQNAINNPVEASIIAEIVEELVNNGVDSKDIGIITPYNSQA 1248

Query: 855  QLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSN---NLGAVGFLGDSRRMNVA 914
             L+++ +   P    +E+ TID +QGR+ D +++S VRS       A   LGD  R+NVA
Sbjct: 1249 SLIQHAIPTTP----VEIHTIDKYQGRDKDCILVSFVRSREKPRSSASSLLGDWHRINVA 1292

Query: 915  ITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPG 952
            +TRA+K + +V    T+ +   L  LL  ++    + +  PG
Sbjct: 1309 LTRAKKKLIMVGSQRTLSRVPLLMLLLNKVKEQSGILNLLPG 1292

BLAST of MS017824 vs. TAIR 10
Match: AT1G08840.2 (DNA replication helicase, putative )

HSP 1 Score: 162.2 bits (409), Expect = 2.2e-39
Identity = 126/462 (27.27%), Postives = 212/462 (45.89%), Query Frame = 0

Query: 495  DNSQRSAISHALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNM 554
            +N QR AI   L  K   LI+ G PGTGKT  +   +   + +G  +L+ + TN+AVDN+
Sbjct: 908  NNDQRQAILKILTAKDYALIL-GMPGTGKTSTMVHAVKALLIRGSSILLASYTNSAVDNL 967

Query: 555  VEKLSNIGINTVRVGNPARISSSVASKSLAEIVNSKLASFRTEFERKKADLRKDLRHCLK 614
            + KL   GI  +R+G    +   V     +                        +  C  
Sbjct: 968  LIKLKAQGIEFLRIGRDEAVHEEVRESCFSA-----------------------MNMCSV 1027

Query: 615  DDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSSAQVVLATNTGAADPLIRRLEKFDLVVI 674
            +D                        +K+ L   +VV +T  G   PL+    +FD+ +I
Sbjct: 1028 ED------------------------IKKKLDQVKVVASTCLGINSPLLVN-RRFDVCII 1087

Query: 675  DEAGQAIEPSCWIPILQGSRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHQETL 734
            DEAGQ   P    P+L  S  +L GD  QL P++ S +A E G+G+SL  R +  H + +
Sbjct: 1088 DEAGQIALPVSIGPLLFASTFVLVGDHYQLPPLVQSTEARENGMGISLFRRLSEAHPQAI 1147

Query: 735  TTMLTIQYRMNDAIASWASKEMYGGML--KSSSTVSSHLLVNSPFVKPTWITQCPLLLLD 794
             ++L  QYRM   I   ++  +YG  L   S+    + L++++      W+ +    +L+
Sbjct: 1148 -SVLQNQYRMCRGIMELSNALIYGDRLCCGSAEVADATLVLSTSSSTSPWLKK----VLE 1207

Query: 795  TRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQV 854
                   ++       +     ++ N  EA I+ + V  L+ +GV  + I + +PY +Q 
Sbjct: 1208 PTRTVVFVNTDMLRAFEARDQNAINNPVEASIIAEIVEELVNNGVDSKDIGIITPYNSQA 1267

Query: 855  QLLRNKLDEIPEAAGIEVATIDSFQGREADAVIISMVRSN---NLGAVGFLGDSRRMNVA 914
             L+++ +   P    +E+ TID +QGR+ D +++S VRS       A   LGD  R+NVA
Sbjct: 1268 SLIQHAIPTTP----VEIHTIDKYQGRDKDCILVSFVRSREKPRSSASSLLGDWHRINVA 1311

Query: 915  ITRARKHVAVVCDSSTICQNTFLARLLRHIRYFGRVKHAEPG 952
            +TRA+K + +V    T+ +   L  LL  ++    + +  PG
Sbjct: 1328 LTRAKKKLIMVGSQRTLSRVPLLMLLLNKVKEQSGILNLLPG 1311

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022157125.10.0e+0099.69DNA-binding protein SMUBP-2 [Momordica charantia][more]
XP_038906929.10.0e+0091.96DNA-binding protein SMUBP-2 [Benincasa hispida][more]
XP_022995943.10.0e+0090.21DNA-binding protein SMUBP-2-like [Cucurbita maxima][more]
XP_023533963.10.0e+0090.21DNA-binding protein SMUBP-2-like [Cucurbita pepo subsp. pepo][more]
XP_022958504.10.0e+0090.21DNA-binding protein SMUBP-2-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
P389352.1e-9537.37DNA-binding protein SMUBP-2 OS=Homo sapiens OX=9606 GN=IGHMBP2 PE=1 SV=3[more]
Q605601.1e-9336.32DNA-binding protein SMUBP-2 OS=Mesocricetus auratus OX=10036 GN=IGHMBP2 PE=1 SV=... [more]
P406941.9e-9335.93DNA-binding protein SMUBP-2 OS=Mus musculus OX=10090 GN=Ighmbp2 PE=1 SV=1[more]
Q9EQN53.6e-9236.03DNA-binding protein SMUBP-2 OS=Rattus norvegicus OX=10116 GN=Ighmbp2 PE=1 SV=1[more]
O942471.5e-6931.35DNA polymerase alpha-associated DNA helicase A OS=Schizosaccharomyces pombe (str... [more]
Match NameE-valueIdentityDescription
A0A6J1DS820.0e+0099.69DNA-binding protein SMUBP-2 OS=Momordica charantia OX=3673 GN=LOC111023920 PE=3 ... [more]
A0A6J1K9F50.0e+0090.21DNA-binding protein SMUBP-2-like OS=Cucurbita maxima OX=3661 GN=LOC111491308 PE=... [more]
A0A6J1H5A40.0e+0090.21DNA-binding protein SMUBP-2-like OS=Cucurbita moschata OX=3662 GN=LOC111459712 P... [more]
A0A5A7UKQ50.0e+0089.90DNA-binding protein SMUBP-2 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A1S3CT280.0e+0089.90DNA-binding protein SMUBP-2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103504640 P... [more]
Match NameE-valueIdentityDescription
AT5G35970.10.0e+0080.37P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
AT2G03270.11.4e-9935.76DNA-binding protein, putative [more]
AT5G47010.12.2e-5234.57RNA helicase, putative [more]
AT1G08840.12.2e-3927.27DNA replication helicase, putative [more]
AT1G08840.22.2e-3927.27DNA replication helicase, putative [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 265..285
NoneNo IPR availableCOILSCoilCoilcoord: 619..639
NoneNo IPR availableGENE3D2.40.30.270coord: 298..428
e-value: 1.4E-109
score: 368.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 95..109
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 84..109
NoneNo IPR availablePANTHERPTHR43788DNA2/NAM7 HELICASE FAMILY MEMBERcoord: 35..968
NoneNo IPR availablePANTHERPTHR43788:SF3P-LOOP CONTAINING NUCLEOSIDE TRIPHOSPHATE HYDROLASES SUPERFAMILY PROTEINcoord: 35..968
NoneNo IPR availableCDDcd18044DEXXQc_SMUBP2coord: 494..743
e-value: 5.55168E-88
score: 277.568
IPR014001Helicase superfamily 1/2, ATP-binding domainSMARTSM00487ultradead3coord: 490..732
e-value: 7.5E-4
score: 28.8
IPR003593AAA+ ATPase domainSMARTSM00382AAA_5coord: 509..724
e-value: 1.0E-4
score: 31.7
IPR041679DNA2/NAM7 helicase-like, C-terminalPFAMPF13087AAA_12coord: 719..923
e-value: 1.8E-50
score: 171.4
IPR041679DNA2/NAM7 helicase-like, C-terminalCDDcd18808SF1_C_Upf1coord: 744..940
e-value: 3.00479E-59
score: 198.612
IPR041677DNA2/NAM7 helicase, helicase domainPFAMPF13086AAA_11coord: 494..710
e-value: 1.5E-54
score: 185.5
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 278..726
e-value: 1.4E-109
score: 368.9
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 742..950
e-value: 1.5E-55
score: 189.8
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 493..931

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS017824.1MS017824.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0004386 helicase activity