Cp4.1LG05g07010 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG05g07010
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionDNA-binding protein SMUBP-2-like
LocationCp4.1LG05: 4222708 .. 4231411 (-)
RNA-Seq ExpressionCp4.1LG05g07010
SyntenyCp4.1LG05g07010
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGCCACGAAAGCGACCTCGCTGGCGCTTTCGAGCTCCCGAAGAAAATCTCTCGTCGAAAGAATCGTCGATGTGTTCTTCTCTTCCTCACCTCAAATTCACTGCGCGGCTTAAATTTGCAGAACCCATTTCCCGTCCGCCATTGCCTTGTCTTGAAGATTTTTCCGCCGCTTCTACGTAATGCAACGGTTTCCTCACGTTTTCTTGTTTCAACTTCGATTGATTTCTTCTCTTTGAGTGTTTTTCTCTTCGATTTTGAGTGTTGGTGCGATGAATGCGCCAACGTCGATTCCGTTGTTTCGCCAGAATCACACGGCCGTAACTGTTTCTTTCCAGCAGTTTGTTCAGACTGTTAATGACGCTAATCATCCCAGTGGAGCGCAGAAGAGGGTTCGTGTTGTCAAAAGTAAGAAGAATGTGAAGAAACCCAATATTCTCGAGGTTTCTTCTCCGTCTACTGCTAATCGATCTGCTGGTGCTAGAATCAGCATCAGTACCAGTGGTTCAGTCGGTTCGGAGACGAAGGCGCGACCGAAGCGGTCGCCTCTGGGTGAACAGGAAGGGAAGAAGAAGAGTGATCGGGCGGTTAACTTGCATGGTATCTATCAGAACGGGGATCCTCTAGGGCGGAGGGAGCTGGGGAAGAGTGTAGTTCAGTGGATTGGGCAGGCCATGCAAGCAATGGCCTCTGATTTCGCTTCTGCGGACGTCAACGGAGATTTCTCCGAGCTCCGGCAGCAGATGGGACCGGGGCTTACTTTTGTGATTCAAGCCCAACCGTATCTGAATGCGGTGCCTATGCCCCTTGGACTTGAAGCCGTCTGTTTGAAAGCTTCTACTCACTATCCCACTCTCTTTGACCATTTCCAGAGGGAGCTCAGGGATGCCCTCCAAGATCTCCAAAGCAAGTCGCTGATTCTAGATTGGCGCGAAACTCAATCATGGAAGCTCCTCAAGGAGCTTGCTAATTCAGGTTTCTTCTCTTACTCCCAAGTAATATCTGTACCAAGTTACATATCATTTCTTGCAAGAAAATGATGCTTTTTAATCTGTTTTATAGACTTCTTTTTTATCTCTAAGTGTTTCCTTGAAGTAAAGAATTAAATTGAATTCTATTGGAACACTAAGTAACCATCTAACAAGATTTTACTTTTCGTTATCAAATCATTTTGAAATATAAGTTACGAATTTCAAATCCTGTGTTTCTAATGGATGCATAAAAAGATATATATATATTTTGTTCGATTAGGAACAAATTGCTTGACTGGGAACTGACATGGTGCATTTGCAGCTCAGCATAAAGCTATAGCACGTAAGATAAGCCAGCCGAAGGCTGTCCAAGGTGCTTTAGGGATGGACCTGGAGAAGGCCAAGGCTTTACAGAGCAGGATTGATGAGTTTGTAAACCGCATGTCTGAGTTACTTCGCATTGAGAGAGATTCCGAATTGGAGTTTACACAAGAGGAGTTGAATGCGGTCCCTACACCAGATGAGGGTTCGGATAATTCTAAACCCATTGAGTTCTTAGTCAGTCATGGCCAAGCACAGCAAGAACTCTGTGACACGATATGCAATTTGAATGCAGTTAGCACGTTTACAGGTCCATCTAATGATTTCATTTTATAAGAGCGATTCAATCAATTATTTTTCTTCAGATTCTGAACTCAGCTCAACTTCAATATTTATTCTCTAAAGTTCGAGAGAAGTCCTTAATCGTATGATCAACTTTTATTGGTATTCAGGATTAGGGGGGATGCATTTAGTATTATTCAGAGTTGAAGGAAACCATAGATTACCACCTACAACCCTTTCGCCGGGAGATATGGTTTGTGTGAGGGTTTGCGATAGCAGGGGTGCTGGTGCAACTTCTTGCATGCAAGGGTTTGTGAACAATCTGGGGGATGATGGGTGCAGCATCACTGTTGCTCTAGAATCTCGTCATGGTGACCCTACCTTTTCAAAGCTCTTTGGAAAGTCCGTGCGTATTGATCGTATTCCAGGGTTAGCTGACACTCTCACTTATGAGGTACTAGGTGAATGTTTGTAGCTTGATTCAATAAATATACTTGTCCTGCATTGTTATCTTCAATTTTTCTCCATGAGATTATATTCAAGGAAAACTGCTATGTCTGAACCAAGTATCGATTCATTTACAGCGCAATTGTGAAGCCTTAATGTTGCTTCAGAAAAATGGTTTGCGAAAGAAAAATCCTTCTTCTGCTGTAGTGGCTACATTATTTGGCGATGAAGAAGACGTCAAGTGGATGGAAGATAATAACTTAATAGATCTAGCTCATACCAACCTGAATGACATAGTCCTCAATGGAGATTTTGATGATTCACAAAAAGGAGCAATTTCGTTTGCTTTGAATAAAAAGCGGCCGATATTGATAATCCAAGGGCCACCTGGCACTGGAAAGACAGGTCTGCTAAAGGAGCTTATTGCACTTGCTGTTCAACAAGGTGAAAGGGTGCTTGTAACAGCACCTACTAATGCAGCTGTTGACAACATGGTTGAAAAACTCTCAAACATTGGGATAAACATCGTTAGGGTAGGAAATCCGGCACGGATATCTTCAAGTGTTGCATCCAAGTCCTTGGCTGAAATTGTCAACACTAAACTTGCAAGTTTTAGAACAGATATTGAAAGGAAGAAGGCAGACCTAAGGAAAGACTTGAGACACTGTTTAAAGGACGATTCATTGGCTGCTGGCATACGCCAGCTTCTGAAGCAGCTTGGGAAGACATTAAAAAAGAAAGAGAAGGAAACTGTGAAGGAAGTACTTTCAAATGCCCAAGTTGTTCTTGCTACCAACACTGGAGCGGCTGATCCTTTAATTCGGACGCTGGAGAAATTTGATCTAGTTGTTATAGATGAGGCAGGTCAGGCAATTGAGCCAGCTTGCTGGATTCCAATATTACAGGGACACCGTTGTATTCTTGCAGGCGATCAATGCCAACTTGCTCCTGTGATTTTGTCTAGAAAAGCCCTGGAAGGTGGTCTTGGAGTATCATTGCTGGAGCGGGCTTCAACCTTGCATCAGGGGGCTTTAACCAAAATGTTAACAATACAATACCGGATGAATGACGCAATAGCTAGTTGGGCTTCAAAGGAAATGTATGGTGGAATGTTGAAGTCCTCACCAACAGTCTCTTCTCATCTTCTTGTAAACTCTCCGTTTGTCAAGGTACCATTCTCCAAAATTACTGTCGGTAGTGTACTTTGGCATTCCAATCTAGACATGCATGTTGGGGAGGTTATTATTTATTAAGTTATATTTCTAAGAACATGAAGATCTTTTGACTTACCATTTTGATACTAGTTCCCTTACAGTAACCTGTATGAAATTGAGGTTACATTGCAGAGTCTATTTTGACTCAACGTTGATAAATTTAGTTCTAAAAATGAGTCCAAAGCCTTCAAAGTAATGGATAGGTCTGTGAGAAGCTTCTAATGGTAAGCTTCTTTATTGATGCAGCCAACATGGATAACCCAGTGTCCCTTGCTGTTGCTTGACACTAGAATGCCATATGGCAGTTTGTCAGTTGGTTGTGAAGAGCACTTAGATCGAGCTGGTACAGGCTCATTATACAATGAAGGCGAGGCAGATATTGTCGTGCAACATGTCTGCTCATTGATTTATTCTGGTAAGTGTTCTTTATATGTCAATATCGATAATGTTATGTGATAGTGTCTATTATAAGTAGATTAGGGAAGCAAATGACAAAGGTGAATGTTAGTGGGGATACTGAAAAGATTGAATCTCTNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTAGTGAAGATTGGCTATTTTTGATATTCTCCAATTTGTACGCATCTTGACTACCTTCACCAGACAACAGCTTGACCCTACTATAGTTTGTTGACATGGAGACATGTAGGATATTAGATCCTAGGTAGATGACCACCATGGATCGAACTAATTCCTTCTTATTTTCTCAAGGCCCTTATAGCCACTAGGTGAACCCATGATGGTTGATTTTCTCTATGATGTTGATGGGGAAGATTGTTGTTTGAGAAATGATTCATTTTTTGACACTGAAAATATAAACGATGTACATCTAAAGGATGTGCATGAAGTAGCATCTGAAGAATGGAAACTAATTTTAGATTCAACTCTAATACAAAATTTGTTTCATATAAACAAAATCAGGTCAAGTTTTAAAAATGTCAGAGTTTAGGACCCGTATGATTTGTCACTCAAAACTGTATGATTGACAACATAGAAAACAAAGGAATGAAGCATTGGAAAACTAATAAAATCTTGAAATATATTTTGTGAGAACATCCTATATTCTTTCTCCAACGAAAGAGTAATTATGATAAGAAATGACATGTCCAAAACAAAGGGGTACAAAATAAAGAGATGAGATCTAACTCCAAAGCTTGCCATGGTTGGCTTACAAAATGGCGTTCTATTTTGTAATAACTGGAGCTGGGGAAATGCCTAACATAGCCACAAATAAATTATAGGATGGCCAAAACTTCTGTTCCTTTCACCCATATAACCCAAGTAATAGCCATAGCTTGGTTAGCAAAAACAATCTTAGCTTTTGCTTTGAATTTATGTTGGAAGATTAGGTGGGTAAGAATAATTTAAAAAGCGCTAGGTGTTATTCATTTATAATTATTTATGGGTATGCCTGTGTATATTTTTTCATGTTTCTCAATGAAAGTTCGGCTTCTTATTAAAAATAATAATAATTAAAGAGCTGGTTAAATAGCCTTTACAGTTAAATTTGTCTAGATTCTTATCCAAAAATGGAAGATGGTGAAAATAGGTTACTCTATCCTGCTAGTGAGAGTTCAACTGAGAAATATATAATCTTAGATTTCATTATAATTGAAGCAAGAAGGGCATTAGTTCAGATACAAGAATACCGTCAGATTTCTCTTTCACCCATATCCATGGATTGATTTTGTGCCATGGCTTAGAGACCAATGAAAAAAATTAAATTAATTGGAATCTTGAGCTAGGGTAGAAATGTAAGGGCAATATGTATCTGCGACTTGTTGAGGAGTTAATAGAACGTGAAATTAACAAGGAGACTATCAGATTCTTCTTTCCAGCCTTTATTAAGGAGGAACAATCCAATATCATAAGGAGAGATGGCCATTCTACAATTTATCAAATATTTAGATTGAATACCTGCCCAATCCTTTTGATCTCATTGATAATCAATAATGGTTTCCAGCTAAGAAACCCTAGGGATTTTGGGGTTGAATGATGTATCACAGATTTGCATTCGTAAAACTTCCCAAAATTTGCTCTCTTCACTGAAGTAGTCCTGCAGTCAGCAACGGTGGATCATGAACTTCTGAAGTTGATTTTAGTTAAGTGACTGGTGTTTTGATCAAAGGGCAGGAAAACTATTAGACTCTTTAGATATTTAATCCTCATAGCTGCCAAGCTTCCATTAGAAATCCGGCATATGCTTGGAGAAGGATACCACGCAGTTTATCTTATTCAAAAGCAACTGGAGGTGGAGCAATGAATACTTGTGAGGTCTAGTATGAAGATGCCATTGAACTACAAGTTGGGCTGAGAAGCCAATCACTTTAGTTTAAGTAGCAAACTCCCTGCCGTTGGATCAAGTAGTCAAATTTCCTTAAAAAGAGACACATTACGGAAGGGACTTCAATGAGTCCAGTTAATACTCTCATACTTCTGGTACTAGCTTTGTCATAAGATTTCTTGAGGTTAATCTTAGTAATCTTACAATGAGGTATATTTATTGTCATCTCCTTTTTTTTTTTTAAAAAAAAAAAAAATCATTTTAACTTTAACTATTTCGATTTTTTTTTTATTGGTTACCTTGGCTTTTCTTGGGTTGATGTCTCTCCCCTCGTGTTGTTTGTATAAACCCTTTTTATTAATAAGAACTGAATGAGTTTGTTTCTAATCTTATAAAATATCATTTACATTAAAAACTGTGAAATAGAGAACCCCAAGATCCTCAATTTCCCTCAAGGATGAATTACTAAAGTTTGGTGTATATGTAGCAATCCTCCGATATTTGAATACCTGCAATCCTTGTATGAAGTTTGGTGTATATATAAGGATAACTTAAGCTTGGTCCCTTGGATGTACTTTTCTGTGATTTAATTGATACAATCATGGAAGATATGTTATTGAAATTATCTGTTACAGGTGTCAGTCCAAGAGCAATTGCAGTCCAGTCTCCTTATGTTGCTCAGGTACAGCTATTGAGGAACAGGCTTGATGAAATTCCTGAAGCTACTGGCATTGAGGTAGCGACTATTGATAGCTTCCAAGGCCGAGAGGCAGATGCAGTAATCATATCAATGGTAAGATACATATGATTTGCAACACGTTTTTTTTCCTGACTGGGTGTGAATTACACGTCTACACCCTGATACACCTACATGGGCATTTTTTTGATCTTTTCTCTCTTCCACTTATTAATTGACACTGGTATAATTGGAATGCTAGCTCTGGCTGACTCATTTTATCCACATTATTTGAGACTGCTACATGTATCTTCTTCAAGGCAGAACCATTCTCAGTATTGAGAATTTCCTTTTTAAAAAAAGAACCATATGTACCTGAGTGGGCTTATTACATGTTTTCCAACAATCCTGACTTTAAAACATTTTCTGGAGAACCTATAGGATATTGTTACCTTCCCTTTATCTTCTTGCCCAATATCAAACCGTTGTACTAGTTCCATTTGTTGGTTGCAAATAATTCTAGCCTCTTCTCTGTATTAGCCACTACTGGTGTGAATGTTCTCAATTCACTACATAATAGGCTTAGACCAAGTCTAAAGTGAGATCATGCAAATGAATTAAGGCAGATACAATAGTTTCCATTTTAGGGACAGGCTTTGCTGCTGGTATTATATTGGTGAATTAACTTTGCGAGGATAAAGGATATTCATGTGCGTGAGCTAATCAATTTTCTGGTGTTTCATTGTAGCATCCTTATTTCTGCCTAGATTTTTATCTTCGACCCATTGACAAACTCTCCAAAAAAGCGTTCAATCTATATGTTGTCGACATCCTCGTACCATCACAGTTGTCCAATGGGCGTGGAATATTTCACATAATGTTCCCTCCCATGCAGTCTAAACTGACTCAAAAGTTGGTGATGTAGGTTCACCTAGCTTCAGGATGGTCTTTCGCCTTCCTCATGAAAGCAGAACACTTCTCCTTGAAGCAATTGTTATTGCTTCAATCATGCCTCTTTTTGCCCAGAATTACCTCTCTGCTCTAAACAGATAAACACCTGGCTCTTCTCCATTAAACATTCACGTCCAATCAGCTGATCAGTTATCGACCAATTCTCTCATCAGCTATAACCATCTCCTTCGCTTCAGCAGACGTTTGTTCAATAGGAGCTTTTCCTTTGTCTTCTTGTCTCGGTGTAGCTTCTCTTACCAATTTTGAATTGTGTGACGTCGTCATTTATTTCATCACAACTGCCATGTGCTTCTCATCATCAGGCTCTAGTGCATTCATCCCTTATAGTTTGGTCTGGGTCACACTAATGGCACTAGAGATTTTCTCTTGTATTTTTTATTTTTTAAAAGCTTTTCTCCCATTGCCTGGGTTCTGCTCTCCATCTTATTTAAGGAATTCAAAATTTTGATACCACTACAGTAAGATTCGAAGTAGTGGTTTTGTCAATTACCCAAAACTGCTCCTCACTCTTCCTTTCAAAGCTGTTTACACACTCTCATTCTAGTTAGTCACATGGGTCCAGCATTATATTTGCTGTAACTACTTTCCGTTTCTATCTCCTATTTCCCTTTACCTAGCACTTTCTTAGATCTCATCTAAAGAGTAGACAATGTTCCATTACCTTTGTGTACAGAGCTATCAAAATTCTATGCCATTGCTTGATCTAAATGATGAGCTACTCGTGACTGCTGGAAATTAAAAAACCTCAACCAGAAGCTGAATTAAAGAAAAAAATTGTTTGATTGGAGTTTAAAAAATTGTTAGCTCCAACTGCTAAGCTAAAGGATATTTGTTTGATCGTATATTGTAACTGTATGGAGGCTTCTCCACGCTTGCAGGATGGTTCTGCCCTATCATTCTTTTCACAAAATCATGTTTATGGCAAAAATCTGTCTTGATTTTAATTTTTATGAAGGGTCATATATGGAGCATTCAAGGTGTCAATAATGCTTTTACAATGGAAAGTTTTTATAATGTTCTTTGCATATTCTAACAGGTAAGGTCAAACAATCTTGGAGCTGTTGGATTTTTAGGAGACAGTCGACGGATGAACGTGGCCATAACTAGGGCAAGAAAACACATAGCACTGGTATGTGATAGCTCGACGATTTGTCAAAATACGTTCTTGGCAAGGCTATTACGTCATATTCGTTATTTTGGAAGAGTCAAGCATGCAGAACCAGGTAATTTTGGAGGATCTGGTCTTGGAATGAATCCAATGTTGCCATCCATTAATTAGCATTGCGTGACCATTGAACTTTGCTTTCAAGCGAAGGAAAGAAAAATTGTGTTCCATACTTGATTATTGTACAGCATTCCTTTTGCCTTAGGAAAGAAAATCACCCAAAACTATTGATAACTTGTATATAAATTTTGAAAGTATAGTGATGATACATTTTTACATGAACATTGTTTAATCTCATCATGATATTCAGAAATGAGTCAATGCATTGATCCAAGTGGTCGTTTGTTTGA

mRNA sequence

TGCCACGAAAGCGACCTCGCTGGCGCTTTCGAGCTCCCGAAGAAAATCTCTCGTCGAAAGAATCGTCGATGTGTTCTTCTCTTCCTCACCTCAAATTCACTGCGCGGCTTAAATTTGCAGAACCCATTTCCCGTCCGCCATTGCCTTGTCTTGAAGATTTTTCCGCCGCTTCTACGTAATGCAACGGTTTCCTCACGTTTTCTTGTTTCAACTTCGATTGATTTCTTCTCTTTGAGTGTTTTTCTCTTCGATTTTGAGTGTTGGTGCGATGAATGCGCCAACGTCGATTCCGTTGTTTCGCCAGAATCACACGGCCGTAACTGTTTCTTTCCAGCAGTTTGTTCAGACTGTTAATGACGCTAATCATCCCAGTGGAGCGCAGAAGAGGGTTCGTGTTGTCAAAAGTAAGAAGAATGTGAAGAAACCCAATATTCTCGAGGTTTCTTCTCCGTCTACTGCTAATCGATCTGCTGGTGCTAGAATCAGCATCAGTACCAGTGGTTCAGTCGGTTCGGAGACGAAGGCGCGACCGAAGCGGTCGCCTCTGGGTGAACAGGAAGGGAAGAAGAAGAGTGATCGGGCGGTTAACTTGCATGGTATCTATCAGAACGGGGATCCTCTAGGGCGGAGGGAGCTGGGGAAGAGTGTAGTTCAGTGGATTGGGCAGGCCATGCAAGCAATGGCCTCTGATTTCGCTTCTGCGGACGTCAACGGAGATTTCTCCGAGCTCCGGCAGCAGATGGGACCGGGGCTTACTTTTGTGATTCAAGCCCAACCGTATCTGAATGCGGTGCCTATGCCCCTTGGACTTGAAGCCGTCTGTTTGAAAGCTTCTACTCACTATCCCACTCTCTTTGACCATTTCCAGAGGGAGCTCAGGGATGCCCTCCAAGATCTCCAAAGCAAGTCGCTGATTCTAGATTGGCGCGAAACTCAATCATGGAAGCTCCTCAAGGAGCTTGCTAATTCAGCTCAGCATAAAGCTATAGCACGTAAGATAAGCCAGCCGAAGGCTGTCCAAGGTGCTTTAGGGATGGACCTGGAGAAGGCCAAGGCTTTACAGAGCAGGATTGATGAGTTTGTAAACCGCATGTCTGAGTTACTTCGCATTGAGAGAGATTCCGAATTGGAGTTTACACAAGAGGAGTTGAATGCGGTCCCTACACCAGATGAGGGTTCGGATAATTCTAAACCCATTGAGTTCTTAGTCAGTCATGGCCAAGCACAGCAAGAACTCTGTGACACGATATGCAATTTGAATGCAGTTAGCACGTTTACAGGATTAGGGGGGATGCATTTAGTATTATTCAGAGTTGAAGGAAACCATAGATTACCACCTACAACCCTTTCGCCGGGAGATATGGTTTGTGTGAGGGTTTGCGATAGCAGGGGTGCTGGTGCAACTTCTTGCATGCAAGGGTTTGTGAACAATCTGGGGGATGATGGGTGCAGCATCACTGTTGCTCTAGAATCTCGTCATGGTGACCCTACCTTTTCAAAGCTCTTTGGAAAGTCCGTGCGTATTGATCGTATTCCAGGGTTAGCTGACACTCTCACTTATGAGCGCAATTGTGAAGCCTTAATGTTGCTTCAGAAAAATGGTTTGCGAAAGAAAAATCCTTCTTCTGCTGTAGTGGCTACATTATTTGGCGATGAAGAAGACGTCAAGTGGATGGAAGATAATAACTTAATAGATCTAGCTCATACCAACCTGAATGACATAGTCCTCAATGGAGATTTTGATGATTCACAAAAAGGAGCAATTTCGTTTGCTTTGAATAAAAAGCGGCCGATATTGATAATCCAAGGGCCACCTGGCACTGGAAAGACAGGTCTGCTAAAGGAGCTTATTGCACTTGCTGTTCAACAAGGTGAAAGGGTGCTTGTAACAGCACCTACTAATGCAGCTGTTGACAACATGGTTGAAAAACTCTCAAACATTGGGATAAACATCGTTAGGGTAGGAAATCCGGCACGGATATCTTCAAGTGTTGCATCCAAGTCCTTGGCTGAAATTGTCAACACTAAACTTGCAAGTTTTAGAACAGATATTGAAAGGAAGAAGGCAGACCTAAGGAAAGACTTGAGACACTGTTTAAAGGACGATTCATTGGCTGCTGGCATACGCCAGCTTCTGAAGCAGCTTGGGAAGACATTAAAAAAGAAAGAGAAGGAAACTGTGAAGGAAGTACTTTCAAATGCCCAAGTTGTTCTTGCTACCAACACTGGAGCGGCTGATCCTTTAATTCGGACGCTGGAGAAATTTGATCTAGTTGTTATAGATGAGGCAGGTCAGGCAATTGAGCCAGCTTGCTGGATTCCAATATTACAGGGACACCGTTGTATTCTTGCAGGCGATCAATGCCAACTTGCTCCTGTGATTTTGTCTAGAAAAGCCCTGGAAGGTGGTCTTGGAGTATCATTGCTGGAGCGGGCTTCAACCTTGCATCAGGGGGCTTTAACCAAAATGTTAACAATACAATACCGGATGAATGACGCAATAGCTAGTTGGGCTTCAAAGGAAATGTATGGTGGAATGTTGAAGTCCTCACCAACAGTCTCTTCTCATCTTCTTGTAAACTCTCCGTTTGTCAAGCCAACATGGATAACCCAGTGTCCCTTGCTGTTGCTTGACACTAGAATGCCATATGGCAGTTTGTCAGTTGGTTGTGAAGAGCACTTAGATCGAGCTGGTACAGGCTCATTATACAATGAAGGCGAGGCAGATATTGTCGTGCAACATGTCTGCTCATTGATTTATTCTGGTGTCAGTCCAAGAGCAATTGCAGTCCAGTCTCCTTATGTTGCTCAGGTACAGCTATTGAGGAACAGGCTTGATGAAATTCCTGAAGCTACTGGCATTGAGGTAGCGACTATTGATAGCTTCCAAGGCCGAGAGGCAGATGCAGTAATCATATCAATGGTAAGGTCAAACAATCTTGGAGCTGTTGGATTTTTAGGAGACAGTCGACGGATGAACGTGGCCATAACTAGGGCAAGAAAACACATAGCACTGGTATGTGATAGCTCGACGATTTGTCAAAATACGTTCTTGGCAAGGCTATTACGTCATATTCGTTATTTTGGAAGAGTCAAGCATGCAGAACCAGGTAATTTTGGAGGATCTGGTCTTGGAATGAATCCAATGTTGCCATCCATTAATTAGCATTGCGTGACCATTGAACTTTGCTTTCAAGCGAAGGAAAGAAAAATTGTGTTCCATACTTGATTATTGTACAGCATTCCTTTTGCCTTAGGAAAGAAAATCACCCAAAACTATTGATAACTTGTATATAAATTTTGAAAGTATAGTGATGATACATTTTTACATGAACATTGTTTAATCTCATCATGATATTCAGAAATGAGTCAATGCATTGATCCAAGTGGTCGTTTGTTTGA

Coding sequence (CDS)

ATGAATGCGCCAACGTCGATTCCGTTGTTTCGCCAGAATCACACGGCCGTAACTGTTTCTTTCCAGCAGTTTGTTCAGACTGTTAATGACGCTAATCATCCCAGTGGAGCGCAGAAGAGGGTTCGTGTTGTCAAAAGTAAGAAGAATGTGAAGAAACCCAATATTCTCGAGGTTTCTTCTCCGTCTACTGCTAATCGATCTGCTGGTGCTAGAATCAGCATCAGTACCAGTGGTTCAGTCGGTTCGGAGACGAAGGCGCGACCGAAGCGGTCGCCTCTGGGTGAACAGGAAGGGAAGAAGAAGAGTGATCGGGCGGTTAACTTGCATGGTATCTATCAGAACGGGGATCCTCTAGGGCGGAGGGAGCTGGGGAAGAGTGTAGTTCAGTGGATTGGGCAGGCCATGCAAGCAATGGCCTCTGATTTCGCTTCTGCGGACGTCAACGGAGATTTCTCCGAGCTCCGGCAGCAGATGGGACCGGGGCTTACTTTTGTGATTCAAGCCCAACCGTATCTGAATGCGGTGCCTATGCCCCTTGGACTTGAAGCCGTCTGTTTGAAAGCTTCTACTCACTATCCCACTCTCTTTGACCATTTCCAGAGGGAGCTCAGGGATGCCCTCCAAGATCTCCAAAGCAAGTCGCTGATTCTAGATTGGCGCGAAACTCAATCATGGAAGCTCCTCAAGGAGCTTGCTAATTCAGCTCAGCATAAAGCTATAGCACGTAAGATAAGCCAGCCGAAGGCTGTCCAAGGTGCTTTAGGGATGGACCTGGAGAAGGCCAAGGCTTTACAGAGCAGGATTGATGAGTTTGTAAACCGCATGTCTGAGTTACTTCGCATTGAGAGAGATTCCGAATTGGAGTTTACACAAGAGGAGTTGAATGCGGTCCCTACACCAGATGAGGGTTCGGATAATTCTAAACCCATTGAGTTCTTAGTCAGTCATGGCCAAGCACAGCAAGAACTCTGTGACACGATATGCAATTTGAATGCAGTTAGCACGTTTACAGGATTAGGGGGGATGCATTTAGTATTATTCAGAGTTGAAGGAAACCATAGATTACCACCTACAACCCTTTCGCCGGGAGATATGGTTTGTGTGAGGGTTTGCGATAGCAGGGGTGCTGGTGCAACTTCTTGCATGCAAGGGTTTGTGAACAATCTGGGGGATGATGGGTGCAGCATCACTGTTGCTCTAGAATCTCGTCATGGTGACCCTACCTTTTCAAAGCTCTTTGGAAAGTCCGTGCGTATTGATCGTATTCCAGGGTTAGCTGACACTCTCACTTATGAGCGCAATTGTGAAGCCTTAATGTTGCTTCAGAAAAATGGTTTGCGAAAGAAAAATCCTTCTTCTGCTGTAGTGGCTACATTATTTGGCGATGAAGAAGACGTCAAGTGGATGGAAGATAATAACTTAATAGATCTAGCTCATACCAACCTGAATGACATAGTCCTCAATGGAGATTTTGATGATTCACAAAAAGGAGCAATTTCGTTTGCTTTGAATAAAAAGCGGCCGATATTGATAATCCAAGGGCCACCTGGCACTGGAAAGACAGGTCTGCTAAAGGAGCTTATTGCACTTGCTGTTCAACAAGGTGAAAGGGTGCTTGTAACAGCACCTACTAATGCAGCTGTTGACAACATGGTTGAAAAACTCTCAAACATTGGGATAAACATCGTTAGGGTAGGAAATCCGGCACGGATATCTTCAAGTGTTGCATCCAAGTCCTTGGCTGAAATTGTCAACACTAAACTTGCAAGTTTTAGAACAGATATTGAAAGGAAGAAGGCAGACCTAAGGAAAGACTTGAGACACTGTTTAAAGGACGATTCATTGGCTGCTGGCATACGCCAGCTTCTGAAGCAGCTTGGGAAGACATTAAAAAAGAAAGAGAAGGAAACTGTGAAGGAAGTACTTTCAAATGCCCAAGTTGTTCTTGCTACCAACACTGGAGCGGCTGATCCTTTAATTCGGACGCTGGAGAAATTTGATCTAGTTGTTATAGATGAGGCAGGTCAGGCAATTGAGCCAGCTTGCTGGATTCCAATATTACAGGGACACCGTTGTATTCTTGCAGGCGATCAATGCCAACTTGCTCCTGTGATTTTGTCTAGAAAAGCCCTGGAAGGTGGTCTTGGAGTATCATTGCTGGAGCGGGCTTCAACCTTGCATCAGGGGGCTTTAACCAAAATGTTAACAATACAATACCGGATGAATGACGCAATAGCTAGTTGGGCTTCAAAGGAAATGTATGGTGGAATGTTGAAGTCCTCACCAACAGTCTCTTCTCATCTTCTTGTAAACTCTCCGTTTGTCAAGCCAACATGGATAACCCAGTGTCCCTTGCTGTTGCTTGACACTAGAATGCCATATGGCAGTTTGTCAGTTGGTTGTGAAGAGCACTTAGATCGAGCTGGTACAGGCTCATTATACAATGAAGGCGAGGCAGATATTGTCGTGCAACATGTCTGCTCATTGATTTATTCTGGTGTCAGTCCAAGAGCAATTGCAGTCCAGTCTCCTTATGTTGCTCAGGTACAGCTATTGAGGAACAGGCTTGATGAAATTCCTGAAGCTACTGGCATTGAGGTAGCGACTATTGATAGCTTCCAAGGCCGAGAGGCAGATGCAGTAATCATATCAATGGTAAGGTCAAACAATCTTGGAGCTGTTGGATTTTTAGGAGACAGTCGACGGATGAACGTGGCCATAACTAGGGCAAGAAAACACATAGCACTGGTATGTGATAGCTCGACGATTTGTCAAAATACGTTCTTGGCAAGGCTATTACGTCATATTCGTTATTTTGGAAGAGTCAAGCATGCAGAACCAGGTAATTTTGGAGGATCTGGTCTTGGAATGAATCCAATGTTGCCATCCATTAATTAG

Protein sequence

MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSSPSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGRRELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLGLEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAIARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTPDEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHTNLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLIRTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPMLPSIN
Homology
BLAST of Cp4.1LG05g07010 vs. ExPASy Swiss-Prot
Match: P38935 (DNA-binding protein SMUBP-2 OS=Homo sapiens OX=9606 GN=IGHMBP2 PE=1 SV=3)

HSP 1 Score: 360.1 bits (923), Expect = 7.6e-98
Identity = 260/691 (37.63%), Postives = 376/691 (54.41%), Query Frame = 0

Query: 268 IDEFVNRMSELLRIERDSELEFTQEELNAVPTPDEGSDNSKPIEFLVSHGQAQQELCDTI 327
           ++ FV +  +LL +ERD+E+E  +     +            ++ L S G     +C  +
Sbjct: 6   VESFVTKQLDLLELERDAEVEERRSWQENI-----------SLKELQSRG-----VC--L 65

Query: 328 CNLNAVSTFTGLGGMHLVLF---RVEGNHRLPPTTLSPGDMVCVRVCDSRGAGATSCMQG 387
             L   S  TGL G  LV F   R      LP  + + GD+V +    + G+   +   G
Sbjct: 66  LKLQVSSQRTGLYGRLLVTFEPRRYGSAAALPSNSFTSGDIVGLYDAANEGSQLAT---G 125

Query: 388 FVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQKN 447
            +  +     S+TVA +  H D   S     S R+ +   LA+ +TY R  +AL+ L+K 
Sbjct: 126 ILTRVTQK--SVTVAFDESH-DFQLSLDRENSYRLLK---LANDVTYRRLKKALIALKK- 185

Query: 448 GLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHTNLNDIVLNGDFDDSQKGAISFALN 507
                 P+S+++  LFG        E + L             N   D SQK A+ FAL+
Sbjct: 186 --YHSGPASSLIEVLFGRSAPSPASEIHPL----------TFFNTCLDTSQKEAVLFALS 245

Query: 508 KKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINIVR 567
           +K  + II GPPGTGKT  + E+I  AV+QG +VL  AP+N AVDN+VE+L+     I+R
Sbjct: 246 QKE-LAIIHGPPGTGKTTTVVEIILQAVKQGLKVLCCAPSNIAVDNLVERLALCKQRILR 305

Query: 568 VGNPARISSSVASKSLAEIVNTKLASFRTDIERKKADLRKDLRHCL------KDDSLAAG 627
           +G+PAR+  S+   SL  ++       R+D  +  AD+RKD+          +D    + 
Sbjct: 306 LGHPARLLESIQQHSLDAVL------ARSDSAQIVADIRKDIDQVFVKNKKTQDKREKSN 365

Query: 628 IRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGA-ADPLIRTLEK--FDLVVIDEAG 687
            R  +K L K LK++E+  + E L++A VVLATNTGA AD  ++ L +  FD+VVIDE  
Sbjct: 366 FRNEIKLLRKELKEREEAAMLESLTSANVVLATNTGASADGPLKLLPESYFDVVVIDECA 425

Query: 688 QAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHQGALTKML 747
           QA+E +CWIP+L+  +CILAGD  QL P  +S KA   GL +SL+ER +  +   + + L
Sbjct: 426 QALEASCWIPLLKARKCILAGDHKQLPPTTVSHKAALAGLSLSLMERLAEEYGARVVRTL 485

Query: 748 TIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYG 807
           T+QYRM+ AI  WAS  MY G L +  +V+ HLL + P V  T  T  PLLL+DT     
Sbjct: 486 TVQYRMHQAIMRWASDTMYLGQLTAHSSVARHLLRDLPGVAATEETGVPLLLVDT----- 545

Query: 808 SLSVGCE-EHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRN 867
               GC    L+     S  N GE  +V  H+ +L+ +GV  R IAV SPY  QV LLR 
Sbjct: 546 ---AGCGLFELEEEDEQSKGNPGEVRLVSLHIQALVDAGVPARDIAVVSPYNLQVDLLRQ 605

Query: 868 RLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHI 927
            L  +     +E+ ++D FQGRE +AVI+S VRSN  G VGFL + RR+NVA+TRAR+H+
Sbjct: 606 SL--VHRHPELEIKSVDGFQGREKEAVILSFVRSNRKGEVGFLAEDRRINVAVTRARRHV 639

Query: 928 ALVCDSSTICQNTFLARLLRHIRYFGRVKHA 946
           A++CDS T+  + FL  L+ +    G V+ A
Sbjct: 666 AVICDSRTVNNHAFLKTLVEYFTQHGEVRTA 639

BLAST of Cp4.1LG05g07010 vs. ExPASy Swiss-Prot
Match: Q60560 (DNA-binding protein SMUBP-2 OS=Mesocricetus auratus OX=10036 GN=IGHMBP2 PE=1 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 7.6e-98
Identity = 258/694 (37.18%), Postives = 370/694 (53.31%), Query Frame = 0

Query: 266 SRIDEFVNRMSELLRIERDSELEFTQEELNAVPTPDEGSDNSKPIEFLVSHGQAQQELCD 325
           S ++ FV +  ELL +ERD+E+E                     ++ L S G     +C 
Sbjct: 4   STVESFVAQQLELLELERDAEVE-----------ERRSWQEHSSLKELQSRG-----VC- 63

Query: 326 TICNLNAVSTFTGLGGMHLVLF---RVEGNHRLPPTTLSPGDMVCVRVCDSRGAGATSCM 385
            +  L   S  TGL G  LV F   ++     LP  + + GD+V +   +     AT  +
Sbjct: 64  -LLKLQVSSQCTGLYGQRLVTFEPRKLGPVVVLPSNSFTSGDIVGLYDANESSQLATGVL 123

Query: 386 QGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQ 445
                       S+TVA +  H      +L        R+  LA+ +TY+R  +ALM L+
Sbjct: 124 TRITQK------SVTVAFDESHD----FQLNLDRENTYRLLKLANDVTYKRLKKALMTLK 183

Query: 446 KNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHTNLNDIVLNGDFDDSQKGAISFA 505
           K       P+S+++  L G        E                 N   D SQK A+SFA
Sbjct: 184 K---YHSGPASSLIDVLLGGSSPSPTTEIPPF----------TFYNTALDPSQKEAVSFA 243

Query: 506 LNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINI 565
           L +K  + II GPPGTGKT  + E+I  AV+QG ++L  AP+N AVDN+VE+L+     I
Sbjct: 244 LAQKE-VAIIHGPPGTGKTTTVVEIILQAVKQGLKILCCAPSNVAVDNLVERLALCKKRI 303

Query: 566 VRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKADLRKDLRHCL------KDDSLA 625
           +R+G+PAR+  S    SL  ++       R+D  +  AD+RKD+          +D    
Sbjct: 304 LRLGHPARLLESAQQHSLDAVL------ARSDNAQIVADIRKDIDQVFGKNKKTQDKREK 363

Query: 626 AGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGA-ADPLIRTLEK--FDLVVIDE 685
           +  R  +K L K LK++E+  + + L+ A VVLATNTGA +D  ++ L +  FD+VV+DE
Sbjct: 364 SNFRNEIKLLRKELKEREEAAIVQSLTAADVVLATNTGASSDGPLKLLPENHFDVVVVDE 423

Query: 686 AGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHQGALTK 745
             QA+E +CWIP+L+  +CILAGD  QL P  +S KA   GL  SL+ER    H     +
Sbjct: 424 CAQALEASCWIPLLKAPKCILAGDHRQLPPTTISHKAALAGLSRSLMERLVEKHGAGAVR 483

Query: 746 MLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWITQCPLLLLDTRMP 805
           MLT+QYRM+ AI  WAS+ MY G L + P+V+ HLL + P V  T  T  PLLL+DT   
Sbjct: 484 MLTVQYRMHQAITRWASEAMYHGQLTAHPSVAGHLLKDLPGVADTEETSVPLLLIDT--- 543

Query: 806 YGSLSVGCE-EHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLL 865
                 GC    LD   + S  N GE  +V  H+ +L+ +GV    IAV +PY  QV LL
Sbjct: 544 -----AGCGLLELDEEDSQSKGNPGEVRLVTLHIQALVDAGVHAGDIAVIAPYNLQVDLL 603

Query: 866 RNRL-DEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRAR 925
           R  L ++ PE   +E+ ++D FQGRE +AVI++ VRSN  G VGFL + RR+NVA+TRAR
Sbjct: 604 RQSLSNKHPE---LEIKSVDGFQGREKEAVILTFVRSNRKGEVGFLAEDRRINVAVTRAR 638

Query: 926 KHIALVCDSSTICQNTFLARLLRHIRYFGRVKHA 946
           +H+A++CDS T+  + FL  L+ +    G V+ A
Sbjct: 664 RHVAVICDSRTVNNHAFLKTLVDYFTEHGEVRTA 638

BLAST of Cp4.1LG05g07010 vs. ExPASy Swiss-Prot
Match: P40694 (DNA-binding protein SMUBP-2 OS=Mus musculus OX=10090 GN=Ighmbp2 PE=1 SV=1)

HSP 1 Score: 357.1 bits (915), Expect = 6.4e-97
Identity = 257/694 (37.03%), Postives = 372/694 (53.60%), Query Frame = 0

Query: 266 SRIDEFVNRMSELLRIERDSELEFTQEELNAVPTPDEGSDNSKPIEFLVSHGQAQQELCD 325
           S ++ FV +  +LL +ERD+E+E                     +  L S G     +C 
Sbjct: 4   STVESFVAQQLQLLELERDAEVE-----------ERRSWQEHSSLRELQSRG-----VC- 63

Query: 326 TICNLNAVSTFTGLGGMHLVLF---RVEGNHRLPPTTLSPGDMVCVRVCDSRGAGATSCM 385
            +  L   S  TGL G  LV F   +      LP  + + GD+V +   +     AT  +
Sbjct: 64  -LLKLQVSSQRTGLYGQRLVTFEPRKFGPAVVLPSNSFTSGDIVGLYDTNENSQLATGVL 123

Query: 386 QGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQ 445
                       S+TVA +  H      +L        R+  LA+ +TY+R  +ALM L+
Sbjct: 124 TRITQK------SVTVAFDESHD----LQLNLDRENTYRLLKLANDVTYKRLKKALMTLK 183

Query: 446 KNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHTNLNDIVLNGDFDDSQKGAISFA 505
           K       P+S+++  L G       ME   L             N   D SQK A+SFA
Sbjct: 184 K---YHSGPASSLIDILLGSSTPSPAMEIPPL----------SFYNTTLDLSQKEAVSFA 243

Query: 506 LNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINI 565
           L +K  + II GPPGTGKT  + E+I  AV+QG +VL  AP+N AVDN+VE+L+     I
Sbjct: 244 LAQKE-LAIIHGPPGTGKTTTVVEIILQAVKQGLKVLCCAPSNIAVDNLVERLALCKKRI 303

Query: 566 VRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKADLRKDLRHCL------KDDSLA 625
           +R+G+PAR+  SV   SL  ++       R+D  +  AD+R+D+          +D    
Sbjct: 304 LRLGHPARLLESVQHHSLDAVL------ARSDNAQIVADIRRDIDQVFGKNKKTQDKREK 363

Query: 626 AGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGA-ADPLIRTL--EKFDLVVIDE 685
              R  +K L K LK++E+  + + L+ A VVLATNTGA +D  ++ L  + FD+VV+DE
Sbjct: 364 GNFRSEIKLLRKELKEREEAAIVQSLTAADVVLATNTGASSDGPLKLLPEDYFDVVVVDE 423

Query: 686 AGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHQGALTK 745
             QA+E +CWIP+L+  +CILAGD  QL P  +S +A   GL  SL+ER +  H   + +
Sbjct: 424 CAQALEASCWIPLLKAPKCILAGDHRQLPPTTVSHRAALAGLSRSLMERLAEKHGAGVVR 483

Query: 746 MLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWITQCPLLLLDTRMP 805
           MLT+QYRM+ AI  WAS+ MY G   S P+V+ HLL + P V  T  T+ PLLL+DT   
Sbjct: 484 MLTVQYRMHQAIMCWASEAMYHGQFTSHPSVAGHLLKDLPGVTDTEETRVPLLLIDT--- 543

Query: 806 YGSLSVGCE-EHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLL 865
                 GC    L+   + S  N GE  +V  H+ +L+ +GV    IAV +PY  QV LL
Sbjct: 544 -----AGCGLLELEEEDSQSKGNPGEVRLVTLHIQALVDAGVQAGDIAVIAPYNLQVDLL 603

Query: 866 RNRL-DEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRAR 925
           R  L ++ PE   +E+ ++D FQGRE +AV+++ VRSN  G VGFL + RR+NVA+TRAR
Sbjct: 604 RQSLSNKHPE---LEIKSVDGFQGREKEAVLLTFVRSNRKGEVGFLAEDRRINVAVTRAR 638

Query: 926 KHIALVCDSSTICQNTFLARLLRHIRYFGRVKHA 946
           +H+A++CDS T+  + FL  L+ +    G V+ A
Sbjct: 664 RHVAVICDSHTVNNHAFLETLVDYFTEHGEVRTA 638

BLAST of Cp4.1LG05g07010 vs. ExPASy Swiss-Prot
Match: Q9EQN5 (DNA-binding protein SMUBP-2 OS=Rattus norvegicus OX=10116 GN=Ighmbp2 PE=1 SV=1)

HSP 1 Score: 354.8 bits (909), Expect = 3.2e-96
Identity = 255/692 (36.85%), Postives = 370/692 (53.47%), Query Frame = 0

Query: 268 IDEFVNRMSELLRIERDSELEFTQEELNAVPTPDEGSDNSKPIEFLVSHGQAQQELCDTI 327
           ++ FV +  +LL +ERD+E+E                     ++ L S G     +C  +
Sbjct: 6   VESFVAQQLQLLELERDAEVE-----------ERRSWQEHSSLKELQSRG-----VC--L 65

Query: 328 CNLNAVSTFTGLGGMHLVLF---RVEGNHRLPPTTLSPGDMVCVRVCDSRGAGATSCMQG 387
             L      TGL G  LV F   +      LP  + + GD+V +   +     AT  +  
Sbjct: 66  LKLQVSGQRTGLYGQRLVTFEPRKFGPAVVLPSNSFTSGDIVGLYDTNESSQLATGVLTR 125

Query: 388 FVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQKN 447
                     S+ VA +  H      +L        R+  LA+ +TY+R  +AL+ L+K 
Sbjct: 126 ITQK------SVIVAFDESHD----FQLNLDRENTYRLLKLANDVTYKRLKKALLTLKK- 185

Query: 448 GLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHTNLNDIVLNGDFDDSQKGAISFALN 507
                 P+S+++  L G        E   L             N   D SQK A+SFAL 
Sbjct: 186 --YHSGPASSLIDVLLGGSTPSPATEIPPL----------TFYNTTLDPSQKEAVSFALA 245

Query: 508 KKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINIVR 567
           +K  + II GPPGTGKT  + E+I  AV+QG +VL  AP+N AVDN+VE+L+     I+R
Sbjct: 246 QKE-VAIIHGPPGTGKTTTVVEIILQAVKQGLKVLCCAPSNIAVDNLVERLALCKKQILR 305

Query: 568 VGNPARISSSVASKSLAEIVNTKLASFRTDIERKKADLRKDLRHCL------KDDSLAAG 627
           +G+PAR+  SV   SL  ++       R+D  +  AD+R+D+          +D    + 
Sbjct: 306 LGHPARLLESVQQHSLDAVL------ARSDNAQIVADIRRDIDQVFGKNKKTQDKREKSN 365

Query: 628 IRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAA-DPLIRTL--EKFDLVVIDEAG 687
            R  +K L K LK++E+  + + LS A VVLATNTGA+ D  ++ L  + FD+VV+DE  
Sbjct: 366 FRNEIKLLRKELKEREEAAIVQSLSAADVVLATNTGASTDGPLKLLPEDYFDVVVVDECA 425

Query: 688 QAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHQGALTKML 747
           QA+E +CWIP+L+  +CILAGD  QL P  +S KA   GL  SL+ER +  H  A+ +ML
Sbjct: 426 QALEASCWIPLLKAPKCILAGDHKQLPPTTVSHKAALAGLSRSLMERLAEKHGAAVVRML 485

Query: 748 TIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYG 807
            +QYRM+ AI  WAS+ MY G L + P+V+ HLL + P V  T  T  PLLL+DT     
Sbjct: 486 AVQYRMHQAITRWASEAMYHGQLTAHPSVAGHLLKDLPGVADTEETSVPLLLIDT----- 545

Query: 808 SLSVGCE-EHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRN 867
               GC    L+   + S  N GE  +V  H+ +L+ +GV    IAV +PY  QV LLR 
Sbjct: 546 ---AGCGLLELEEEDSQSKGNPGEVRLVTLHIQALVDAGVQAGDIAVIAPYNLQVDLLRQ 605

Query: 868 RL-DEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKH 927
            L ++ PE   +E+ ++D FQGRE +AVI++ VRSN  G VGFL + RR+NVA+TRAR+H
Sbjct: 606 SLSNKHPE---LEIKSVDGFQGREKEAVILTFVRSNRKGEVGFLAEDRRINVAVTRARRH 638

Query: 928 IALVCDSSTICQNTFLARLLRHIRYFGRVKHA 946
           +A++CDS T+  + FL  L+ +    G V+ A
Sbjct: 666 VAVICDSHTVNNHAFLKTLVDYFTEHGEVRTA 638

BLAST of Cp4.1LG05g07010 vs. ExPASy Swiss-Prot
Match: O94247 (DNA polymerase alpha-associated DNA helicase A OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=hcs1 PE=3 SV=1)

HSP 1 Score: 275.0 bits (702), Expect = 3.2e-72
Identity = 212/656 (32.32%), Postives = 336/656 (51.22%), Query Frame = 0

Query: 284 DSELEFTQEELNAVPTPDEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMH 343
           D E+EF  E   +     E S    P+  L   G A       + NL      TG GG  
Sbjct: 20  DREIEFVDEAQKSEVDETEKSIKRFPLSVLQRKGLA-------LINLRIGVVKTGFGGKT 79

Query: 344 LVLFRVE----GNHRLPPTTLSPGDMVCVR-----VCDSRGAGATSCMQGFVNNLGDDGC 403
           ++ F  +        LP  + SPGD+V +R         R       ++G V  + +   
Sbjct: 80  IIDFEKDPAFSNGEELPANSFSPGDVVSIRQDFQSSKKKRPNETDISVEGVVTRVHER-- 139

Query: 404 SITVALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQKNGLRKKNPSSA 463
            I+VAL+S    P+       SV    +  L + +TYER    ++  +++    +N   +
Sbjct: 140 HISVALKSEEDIPS-------SVTRLSVVKLVNRVTYERMRHTMLEFKRSIPEYRN---S 199

Query: 464 VVATLFGDEEDVKWMEDNNLIDLAHTNLNDIVLNGDFDDSQKGAISFALNKKRPILIIQG 523
           +  TL G ++    ++   + D+ +        N + + SQK A+ F++  K  + +I G
Sbjct: 200 LFYTLIGRKKADVSIDQKLIGDIKY-------FNKELNASQKKAVKFSIAVKE-LSLIHG 259

Query: 524 PPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINIVRVGNPARISSS 583
           PPGTGKT  L E+I   V + +R+LV   +N AVDN+V++LS+ GI +VR+G+PAR+  S
Sbjct: 260 PPGTGKTHTLVEIIQQLVLRNKRILVCGASNLAVDNIVDRLSSSGIPMVRLGHPARLLPS 319

Query: 584 VASKSLAEIVNTKLASFRTDIERKKADLRKDLRHCL------KDDSLAAGIRQLLKQLGK 643
           +   SL  +  T       D+ R    + +D+  CL      K+      I + +++L K
Sbjct: 320 ILDHSLDVLSRT---GDNGDVIR---GISEDIDVCLSKITKTKNGRERREIYKNIRELRK 379

Query: 644 TLKKKEKETVKEVLSNAQVVLATNTGAADPLIRTLEKFDLVVIDEAGQAIEPACWIPILQ 703
             +K E +TV  ++S ++VV  T  GA    ++  ++FD V+IDEA QA+EP CWIP+L 
Sbjct: 380 DYRKYEAKTVANIVSASKVVFCTLHGAGSRQLKG-QRFDAVIIDEASQALEPQCWIPLLG 439

Query: 704 GHRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHQGALTK-MLTIQYRMNDAIAS 763
            ++ ILAGD  QL+P + S++       +S+ ER     QG L K  L IQYRM++ I+ 
Sbjct: 440 MNKVILAGDHMQLSPNVQSKRPY-----ISMFERL-VKSQGDLVKCFLNIQYRMHELISK 499

Query: 764 WASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDR 823
           + S   Y   L  +  V   LL++   V+ T +T  P+   DT   Y        E +  
Sbjct: 500 FPSDTFYDSKLVPAEEVKKRLLMDLENVEETELTDSPIYFYDTLGNY--QEDDRSEDMQN 559

Query: 824 AGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRNRLDEIPEATGIEV 883
               S  N  EA IV  H+  L+ +G+  + IAV +PY AQV L+R  L E  +   +E+
Sbjct: 560 FYQDSKSNHWEAQIVSYHISGLLEAGLEAKDIAVVTPYNAQVALIRQLLKE--KGIEVEM 619

Query: 884 ATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHIALVCDSSTI 924
            ++D  QGRE +A+I S+VRSN++  VGFL + RR+NVAITR ++H+ ++ DS+T+
Sbjct: 620 GSVDKVQGREKEAIIFSLVRSNDVREVGFLAEKRRLNVAITRPKRHLCVIGDSNTV 631

BLAST of Cp4.1LG05g07010 vs. NCBI nr
Match: XP_023533963.1 (DNA-binding protein SMUBP-2-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1877 bits (4863), Expect = 0.0
Identity = 965/965 (100.00%), Postives = 965/965 (100.00%), Query Frame = 0

Query: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60
           MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS
Sbjct: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60

Query: 61  PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120
           PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR
Sbjct: 61  PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120

Query: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180
           RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG
Sbjct: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180

Query: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240
           LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI
Sbjct: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240

Query: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300
           ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP
Sbjct: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300

Query: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360
           DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL
Sbjct: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360

Query: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420
           SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID
Sbjct: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420

Query: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT 480
           RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT
Sbjct: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT 480

Query: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540
           NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV
Sbjct: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540

Query: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600
           TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA
Sbjct: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600

Query: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660
           DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI
Sbjct: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660

Query: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720
           RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL
Sbjct: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720

Query: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780
           ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI
Sbjct: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780

Query: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840
           TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA
Sbjct: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840

Query: 841 VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900
           VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS
Sbjct: 841 VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900

Query: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960
           RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM
Sbjct: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960

Query: 961 LPSIN 965
           LPSIN
Sbjct: 961 LPSIN 965

BLAST of Cp4.1LG05g07010 vs. NCBI nr
Match: XP_022958504.1 (DNA-binding protein SMUBP-2-like [Cucurbita moschata])

HSP 1 Score: 1872 bits (4850), Expect = 0.0
Identity = 961/965 (99.59%), Postives = 963/965 (99.79%), Query Frame = 0

Query: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60
           MNAPTSIPLFRQNH AVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS
Sbjct: 1   MNAPTSIPLFRQNHIAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60

Query: 61  PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120
           PSTANRSAGARISISTSGS+GSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR
Sbjct: 61  PSTANRSAGARISISTSGSIGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120

Query: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180
           RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG
Sbjct: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180

Query: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240
           LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI
Sbjct: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240

Query: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300
           ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP
Sbjct: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300

Query: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360
           DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL
Sbjct: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360

Query: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420
           SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID
Sbjct: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420

Query: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT 480
           RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEED+KWMEDNNLIDLAHT
Sbjct: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDIKWMEDNNLIDLAHT 480

Query: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540
           NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV
Sbjct: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540

Query: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600
           TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA
Sbjct: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600

Query: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660
           DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI
Sbjct: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660

Query: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720
           RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL
Sbjct: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720

Query: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780
           ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI
Sbjct: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780

Query: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840
           TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA
Sbjct: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840

Query: 841 VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900
           VQSPYVAQVQLLRNRLDEIPEA GIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS
Sbjct: 841 VQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900

Query: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960
           RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM
Sbjct: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960

Query: 961 LPSIN 965
           LPSIN
Sbjct: 961 LPSIN 965

BLAST of Cp4.1LG05g07010 vs. NCBI nr
Match: XP_022995943.1 (DNA-binding protein SMUBP-2-like [Cucurbita maxima])

HSP 1 Score: 1870 bits (4845), Expect = 0.0
Identity = 959/965 (99.38%), Postives = 962/965 (99.69%), Query Frame = 0

Query: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60
           MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS
Sbjct: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60

Query: 61  PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120
           PSTANRSAGARISISTSGSVGSE KARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR
Sbjct: 61  PSTANRSAGARISISTSGSVGSEMKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120

Query: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180
           RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG
Sbjct: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180

Query: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240
           LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI
Sbjct: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240

Query: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300
           ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP
Sbjct: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300

Query: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360
           DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL
Sbjct: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360

Query: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420
           SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID
Sbjct: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420

Query: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT 480
           RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEED+KWMEDNNLIDLAHT
Sbjct: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDIKWMEDNNLIDLAHT 480

Query: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540
           NLNDIVLNGDFDDSQKGAISFALNKKRPILI+QGPPGTGKTGLLKELIALAVQQGERVLV
Sbjct: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIVQGPPGTGKTGLLKELIALAVQQGERVLV 540

Query: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600
           TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA
Sbjct: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600

Query: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660
           DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKE+LSNAQVVLATNTGAADPLI
Sbjct: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEILSNAQVVLATNTGAADPLI 660

Query: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720
           RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL
Sbjct: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720

Query: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780
           ERASTLHQG LTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI
Sbjct: 721 ERASTLHQGTLTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780

Query: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840
           TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA
Sbjct: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840

Query: 841 VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900
           VQSPYVAQVQLLRNRLDEIPEA GIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS
Sbjct: 841 VQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900

Query: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960
           RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM
Sbjct: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960

Query: 961 LPSIN 965
           LPSIN
Sbjct: 961 LPSIN 965

BLAST of Cp4.1LG05g07010 vs. NCBI nr
Match: KAG7035600.1 (DNA-binding protein SMUBP-2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1849 bits (4789), Expect = 0.0
Identity = 951/965 (98.55%), Postives = 956/965 (99.07%), Query Frame = 0

Query: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60
           MNAPTSIPLFRQNH AVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS
Sbjct: 1   MNAPTSIPLFRQNHIAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60

Query: 61  PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120
           PSTANRSAGARISISTSGS+GSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR
Sbjct: 61  PSTANRSAGARISISTSGSIGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120

Query: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180
           RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG
Sbjct: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180

Query: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240
           LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI
Sbjct: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240

Query: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300
           ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP
Sbjct: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300

Query: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360
           DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL
Sbjct: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360

Query: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420
           SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID
Sbjct: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420

Query: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT 480
           RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEED+KWMEDNNLIDLAHT
Sbjct: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDIKWMEDNNLIDLAHT 480

Query: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540
           NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV
Sbjct: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540

Query: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600
           TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA
Sbjct: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600

Query: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660
           DLRKDLRHCLKDDSLAAGIRQLLKQLGK+LKKKEKETVKEVLSNAQVVLATNTGAADPLI
Sbjct: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKSLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660

Query: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720
           R LEKFDLVVIDEAGQAIEPACWIPILQG RCILAGD+CQLAPVILSRKALEGGLGVSLL
Sbjct: 661 RKLEKFDLVVIDEAGQAIEPACWIPILQGRRCILAGDRCQLAPVILSRKALEGGLGVSLL 720

Query: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780
           ERASTLHQG LT MLTIQYRMN+AIASWASKEMY GMLKSSPTVSSHLLVNS FVKPTWI
Sbjct: 721 ERASTLHQGTLTTMLTIQYRMNNAIASWASKEMYDGMLKSSPTVSSHLLVNSLFVKPTWI 780

Query: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840
           TQCPLLLLDTRMPYGSLSV CEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA
Sbjct: 781 TQCPLLLLDTRMPYGSLSVDCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840

Query: 841 VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900
           VQSPYVAQVQLLRNRLDEIPEA GIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS
Sbjct: 841 VQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900

Query: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960
           RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM
Sbjct: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960

Query: 961 LPSIN 965
           LPSIN
Sbjct: 961 LPSIN 965

BLAST of Cp4.1LG05g07010 vs. NCBI nr
Match: KAG6605692.1 (DNA-binding protein SMUBP-2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1837 bits (4758), Expect = 0.0
Identity = 944/948 (99.58%), Postives = 946/948 (99.79%), Query Frame = 0

Query: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60
           MNAPTSIPLFRQNH AVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS
Sbjct: 1   MNAPTSIPLFRQNHIAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60

Query: 61  PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120
           PSTANRSAGARISISTSGS+GSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR
Sbjct: 61  PSTANRSAGARISISTSGSIGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120

Query: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180
           RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG
Sbjct: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180

Query: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240
           LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI
Sbjct: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240

Query: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300
           ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP
Sbjct: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300

Query: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360
           DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL
Sbjct: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360

Query: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420
           SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID
Sbjct: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420

Query: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT 480
           RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEED+KWMEDNNLIDLAHT
Sbjct: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDIKWMEDNNLIDLAHT 480

Query: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540
           NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV
Sbjct: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540

Query: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600
           TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA
Sbjct: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600

Query: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660
           DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI
Sbjct: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660

Query: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720
           RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL
Sbjct: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720

Query: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780
           ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI
Sbjct: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780

Query: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840
           TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA
Sbjct: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840

Query: 841 VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900
           VQSPYVAQVQLLRNRLDEIPEA GIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS
Sbjct: 841 VQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900

Query: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPG 948
           RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPG
Sbjct: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPG 948

BLAST of Cp4.1LG05g07010 vs. ExPASy TrEMBL
Match: A0A6J1H5A4 (DNA-binding protein SMUBP-2-like OS=Cucurbita moschata OX=3662 GN=LOC111459712 PE=3 SV=1)

HSP 1 Score: 1872 bits (4850), Expect = 0.0
Identity = 961/965 (99.59%), Postives = 963/965 (99.79%), Query Frame = 0

Query: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60
           MNAPTSIPLFRQNH AVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS
Sbjct: 1   MNAPTSIPLFRQNHIAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60

Query: 61  PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120
           PSTANRSAGARISISTSGS+GSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR
Sbjct: 61  PSTANRSAGARISISTSGSIGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120

Query: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180
           RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG
Sbjct: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180

Query: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240
           LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI
Sbjct: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240

Query: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300
           ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP
Sbjct: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300

Query: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360
           DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL
Sbjct: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360

Query: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420
           SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID
Sbjct: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420

Query: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT 480
           RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEED+KWMEDNNLIDLAHT
Sbjct: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDIKWMEDNNLIDLAHT 480

Query: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540
           NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV
Sbjct: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540

Query: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600
           TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA
Sbjct: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600

Query: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660
           DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI
Sbjct: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660

Query: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720
           RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL
Sbjct: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720

Query: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780
           ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI
Sbjct: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780

Query: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840
           TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA
Sbjct: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840

Query: 841 VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900
           VQSPYVAQVQLLRNRLDEIPEA GIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS
Sbjct: 841 VQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900

Query: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960
           RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM
Sbjct: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960

Query: 961 LPSIN 965
           LPSIN
Sbjct: 961 LPSIN 965

BLAST of Cp4.1LG05g07010 vs. ExPASy TrEMBL
Match: A0A6J1K9F5 (DNA-binding protein SMUBP-2-like OS=Cucurbita maxima OX=3661 GN=LOC111491308 PE=3 SV=1)

HSP 1 Score: 1870 bits (4845), Expect = 0.0
Identity = 959/965 (99.38%), Postives = 962/965 (99.69%), Query Frame = 0

Query: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60
           MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS
Sbjct: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60

Query: 61  PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120
           PSTANRSAGARISISTSGSVGSE KARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR
Sbjct: 61  PSTANRSAGARISISTSGSVGSEMKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120

Query: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180
           RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG
Sbjct: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180

Query: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240
           LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI
Sbjct: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240

Query: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300
           ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP
Sbjct: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300

Query: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360
           DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL
Sbjct: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360

Query: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420
           SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID
Sbjct: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420

Query: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT 480
           RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEED+KWMEDNNLIDLAHT
Sbjct: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDIKWMEDNNLIDLAHT 480

Query: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540
           NLNDIVLNGDFDDSQKGAISFALNKKRPILI+QGPPGTGKTGLLKELIALAVQQGERVLV
Sbjct: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIVQGPPGTGKTGLLKELIALAVQQGERVLV 540

Query: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600
           TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA
Sbjct: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600

Query: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660
           DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKE+LSNAQVVLATNTGAADPLI
Sbjct: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEILSNAQVVLATNTGAADPLI 660

Query: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720
           RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL
Sbjct: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720

Query: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780
           ERASTLHQG LTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI
Sbjct: 721 ERASTLHQGTLTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780

Query: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840
           TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA
Sbjct: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840

Query: 841 VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900
           VQSPYVAQVQLLRNRLDEIPEA GIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS
Sbjct: 841 VQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900

Query: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960
           RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM
Sbjct: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960

Query: 961 LPSIN 965
           LPSIN
Sbjct: 961 LPSIN 965

BLAST of Cp4.1LG05g07010 vs. ExPASy TrEMBL
Match: A0A5A7UKQ5 (DNA-binding protein SMUBP-2 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1728G00510 PE=3 SV=1)

HSP 1 Score: 1721 bits (4457), Expect = 0.0
Identity = 884/965 (91.61%), Postives = 922/965 (95.54%), Query Frame = 0

Query: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60
           M APTSI LFRQNHTAVTV+F QFVQT+N  N PSGAQ+R+RVVKSKKNVKKPN+LEVSS
Sbjct: 1   MTAPTSIHLFRQNHTAVTVAFHQFVQTINGVNQPSGAQRRIRVVKSKKNVKKPNVLEVSS 60

Query: 61  PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120
           PSTA     A+IS+STSGS+ SETKARPKR  L E   KKK+DR VN+ GIYQNGDPLGR
Sbjct: 61  PSTA-----AKISVSTSGSLASETKARPKRRELEE---KKKNDREVNVQGIYQNGDPLGR 120

Query: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180
           RELGKSVV+WIGQAMQAMASDFA+A+V GDFSEL+Q+MGPGLTFVIQAQ YLNAVPMPLG
Sbjct: 121 RELGKSVVRWIGQAMQAMASDFAAAEVQGDFSELQQRMGPGLTFVIQAQRYLNAVPMPLG 180

Query: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240
           LEAVCLKASTHYPTLFDHFQRELRD LQDLQ +SL LDWRETQSWKLLKELANS QHKAI
Sbjct: 181 LEAVCLKASTHYPTLFDHFQRELRDVLQDLQRQSLFLDWRETQSWKLLKELANSVQHKAI 240

Query: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300
           ARKIS+PK VQGALGMDL+KAKA+Q+RIDEF NRMSELLRIERDSELEFTQEELNAVPTP
Sbjct: 241 ARKISEPKVVQGALGMDLKKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELNAVPTP 300

Query: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360
           DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVST TGLGGMHLVLFRVEGNHRLPPTTL
Sbjct: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTL 360

Query: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420
           SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGK+VRID
Sbjct: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKTVRID 420

Query: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT 480
           RIPGLADTLTYERNCEALMLLQKNGL KKNPS AVVATLFGD++D+KWMEDNN+I LA T
Sbjct: 421 RIPGLADTLTYERNCEALMLLQKNGLHKKNPSIAVVATLFGDKDDIKWMEDNNVIGLADT 480

Query: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540
           NL+ IVLNGDFDDSQK AIS ALNKKRPILIIQGPPGTGKTGLLK+LIALAVQQGERVLV
Sbjct: 481 NLDGIVLNGDFDDSQKSAISRALNKKRPILIIQGPPGTGKTGLLKDLIALAVQQGERVLV 540

Query: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600
           TAPTNAAVDNMVEKLSN+GINIVRVGNPARISSSVASKSLAEIVN++L+SFRTDIERKKA
Sbjct: 541 TAPTNAAVDNMVEKLSNVGINIVRVGNPARISSSVASKSLAEIVNSELSSFRTDIERKKA 600

Query: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660
           DLRKDLR CLKDDSLAAGIRQLLKQLGK+LKKKEKETVKEVLSNAQVVLATNTGAADPLI
Sbjct: 601 DLRKDLRQCLKDDSLAAGIRQLLKQLGKSLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660

Query: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720
           R L+KFDLVVIDEAGQAIEPACWIPILQG RCILAGDQCQLAPVILSRKALEGGLGVSLL
Sbjct: 661 RKLDKFDLVVIDEAGQAIEPACWIPILQGRRCILAGDQCQLAPVILSRKALEGGLGVSLL 720

Query: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780
           ERA+TLH+GALT MLTIQYRMNDAIASWASKEMY G+LKSSPTVSSHLLVNSPFVKPTWI
Sbjct: 721 ERAATLHEGALTTMLTIQYRMNDAIASWASKEMYDGILKSSPTVSSHLLVNSPFVKPTWI 780

Query: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840
           TQCPLLLLDTRMPYGSLSVGCEE+LD AGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA
Sbjct: 781 TQCPLLLLDTRMPYGSLSVGCEEYLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840

Query: 841 VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900
           VQSPYVAQVQLLRNRLDEIPEA GIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS
Sbjct: 841 VQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900

Query: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960
           RRMNVAITRARKH+ALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM
Sbjct: 901 RRMNVAITRARKHVALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 957

Query: 961 LPSIN 965
           LPSIN
Sbjct: 961 LPSIN 957

BLAST of Cp4.1LG05g07010 vs. ExPASy TrEMBL
Match: A0A1S3CT28 (DNA-binding protein SMUBP-2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103504640 PE=3 SV=1)

HSP 1 Score: 1721 bits (4457), Expect = 0.0
Identity = 884/965 (91.61%), Postives = 922/965 (95.54%), Query Frame = 0

Query: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60
           M APTSI LFRQNHTAVTV+F QFVQT+N  N PSGAQ+R+RVVKSKKNVKKPN+LEVSS
Sbjct: 1   MTAPTSIHLFRQNHTAVTVAFHQFVQTINGVNQPSGAQRRIRVVKSKKNVKKPNVLEVSS 60

Query: 61  PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120
           PSTA     A+IS+STSGS+ SETKARPKR  L E   KKK+DR VN+ GIYQNGDPLGR
Sbjct: 61  PSTA-----AKISVSTSGSLASETKARPKRRELEE---KKKNDREVNVQGIYQNGDPLGR 120

Query: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180
           RELGKSVV+WIGQAMQAMASDFA+A+V GDFSEL+Q+MGPGLTFVIQAQ YLNAVPMPLG
Sbjct: 121 RELGKSVVRWIGQAMQAMASDFAAAEVQGDFSELQQRMGPGLTFVIQAQRYLNAVPMPLG 180

Query: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240
           LEAVCLKASTHYPTLFDHFQRELRD LQDLQ +SL LDWRETQSWKLLKELANS QHKAI
Sbjct: 181 LEAVCLKASTHYPTLFDHFQRELRDVLQDLQRQSLFLDWRETQSWKLLKELANSVQHKAI 240

Query: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300
           ARKIS+PK VQGALGMDL+KAKA+Q+RIDEF NRMSELLRIERDSELEFTQEELNAVPTP
Sbjct: 241 ARKISEPKVVQGALGMDLKKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELNAVPTP 300

Query: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360
           DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVST TGLGGMHLVLFRVEGNHRLPPTTL
Sbjct: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTL 360

Query: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420
           SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGK+VRID
Sbjct: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKTVRID 420

Query: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT 480
           RIPGLADTLTYERNCEALMLLQKNGL KKNPS AVVATLFGD++D+KWMEDNN+I LA T
Sbjct: 421 RIPGLADTLTYERNCEALMLLQKNGLHKKNPSIAVVATLFGDKDDIKWMEDNNVIGLADT 480

Query: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540
           NL+ IVLNGDFDDSQK AIS ALNKKRPILIIQGPPGTGKTGLLK+LIALAVQQGERVLV
Sbjct: 481 NLDGIVLNGDFDDSQKSAISRALNKKRPILIIQGPPGTGKTGLLKDLIALAVQQGERVLV 540

Query: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600
           TAPTNAAVDNMVEKLSN+GINIVRVGNPARISSSVASKSLAEIVN++L+SFRTDIERKKA
Sbjct: 541 TAPTNAAVDNMVEKLSNVGINIVRVGNPARISSSVASKSLAEIVNSELSSFRTDIERKKA 600

Query: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660
           DLRKDLR CLKDDSLAAGIRQLLKQLGK+LKKKEKETVKEVLSNAQVVLATNTGAADPLI
Sbjct: 601 DLRKDLRQCLKDDSLAAGIRQLLKQLGKSLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660

Query: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720
           R L+KFDLVVIDEAGQAIEPACWIPILQG RCILAGDQCQLAPVILSRKALEGGLGVSLL
Sbjct: 661 RKLDKFDLVVIDEAGQAIEPACWIPILQGRRCILAGDQCQLAPVILSRKALEGGLGVSLL 720

Query: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780
           ERA+TLH+GALT MLTIQYRMNDAIASWASKEMY G+LKSSPTVSSHLLVNSPFVKPTWI
Sbjct: 721 ERAATLHEGALTTMLTIQYRMNDAIASWASKEMYDGILKSSPTVSSHLLVNSPFVKPTWI 780

Query: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840
           TQCPLLLLDTRMPYGSLSVGCEE+LD AGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA
Sbjct: 781 TQCPLLLLDTRMPYGSLSVGCEEYLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840

Query: 841 VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900
           VQSPYVAQVQLLRNRLDEIPEA GIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS
Sbjct: 841 VQSPYVAQVQLLRNRLDEIPEAAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900

Query: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960
           RRMNVAITRARKH+ALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM
Sbjct: 901 RRMNVAITRARKHVALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 957

Query: 961 LPSIN 965
           LPSIN
Sbjct: 961 LPSIN 957

BLAST of Cp4.1LG05g07010 vs. ExPASy TrEMBL
Match: A0A0A0KL45 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G172850 PE=3 SV=1)

HSP 1 Score: 1706 bits (4419), Expect = 0.0
Identity = 878/965 (90.98%), Postives = 916/965 (94.92%), Query Frame = 0

Query: 1   MNAPTSIPLFRQNHTAVTVSFQQFVQTVNDANHPSGAQKRVRVVKSKKNVKKPNILEVSS 60
           M APTSI LFRQNHTAVTV+F QFVQT+N  N PSGAQ+R+RVVKSKKNVKKPN+LEVSS
Sbjct: 1   MTAPTSIHLFRQNHTAVTVAFHQFVQTINGVNQPSGAQRRIRVVKSKKNVKKPNVLEVSS 60

Query: 61  PSTANRSAGARISISTSGSVGSETKARPKRSPLGEQEGKKKSDRAVNLHGIYQNGDPLGR 120
           PSTA      +IS+STSGS+ SETKARPKR  L E   KKK DR VN+ GIYQNGDPLGR
Sbjct: 61  PSTA-----PKISVSTSGSLASETKARPKRRELEE---KKKKDREVNVQGIYQNGDPLGR 120

Query: 121 RELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGPGLTFVIQAQPYLNAVPMPLG 180
           RELGKSVV+WIG AM+AMASDFA+A+V GDF EL+Q+MG GLTFVIQAQPYLNAVPMPLG
Sbjct: 121 RELGKSVVRWIGLAMRAMASDFAAAEVQGDFPELQQRMGQGLTFVIQAQPYLNAVPMPLG 180

Query: 181 LEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWRETQSWKLLKELANSAQHKAI 240
           LEAVCLKASTHYPTLFDHFQRELRD LQDLQ +SL LDWRETQSWKLLK+LA+S QHKAI
Sbjct: 181 LEAVCLKASTHYPTLFDHFQRELRDVLQDLQRQSLFLDWRETQSWKLLKKLAHSVQHKAI 240

Query: 241 ARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTP 300
           ARKIS+PK VQGALGMDL+KAKA+Q+RIDEF NRMSELLRIERDSELEFTQEELNAVPTP
Sbjct: 241 ARKISEPKVVQGALGMDLKKAKAIQNRIDEFANRMSELLRIERDSELEFTQEELNAVPTP 300

Query: 301 DEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTL 360
           DE SDNSKPIEFLVSHGQAQQELCDTICNLNAVST TGLGGMHLVLFRVEG+HRLPPTTL
Sbjct: 301 DESSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGSHRLPPTTL 360

Query: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRID 420
           SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGK+VRID
Sbjct: 361 SPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVALESRHGDPTFSKLFGKTVRID 420

Query: 421 RIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHT 480
           RIPGLADTLTYERNCEALMLLQKNGL KKNPS AVVATLFGD+ED+KWMEDNNLI LA T
Sbjct: 421 RIPGLADTLTYERNCEALMLLQKNGLHKKNPSIAVVATLFGDKEDIKWMEDNNLIGLADT 480

Query: 481 NLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540
           NL+ IV NGDFDDSQK AIS ALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV
Sbjct: 481 NLDGIVFNGDFDDSQKSAISRALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLV 540

Query: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKA 600
           TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVN++L+SFRTDIERKKA
Sbjct: 541 TAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSLAEIVNSELSSFRTDIERKKA 600

Query: 601 DLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660
           DLRKDLR CLKDDSLAAGIRQLLKQLGK+LKKKEKETVKEVLSNAQVVLATNTGAADPLI
Sbjct: 601 DLRKDLRQCLKDDSLAAGIRQLLKQLGKSLKKKEKETVKEVLSNAQVVLATNTGAADPLI 660

Query: 661 RTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLL 720
           R LEKFDLVVIDEAGQAIEPACWIPILQG RCILAGDQCQLAPVILSRKALEGGLGVSLL
Sbjct: 661 RKLEKFDLVVIDEAGQAIEPACWIPILQGRRCILAGDQCQLAPVILSRKALEGGLGVSLL 720

Query: 721 ERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWI 780
           ERA+TLH+GALT MLTIQYRMNDAIASWASKEMY G+L+SSPTVSSHLLVNSPFVKPTWI
Sbjct: 721 ERAATLHEGALTTMLTIQYRMNDAIASWASKEMYDGILESSPTVSSHLLVNSPFVKPTWI 780

Query: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840
           TQCPLLLLDTRMPYGSLSVGCEEHLD AGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA
Sbjct: 781 TQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIA 840

Query: 841 VQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900
           VQSPYVAQVQLLRNRLDEIPE+ GIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS
Sbjct: 841 VQSPYVAQVQLLRNRLDEIPESAGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 900

Query: 901 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGNFGGSGLGMNPM 960
           RRMNVAITRARKH+ALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPG+FGGSGLGMNPM
Sbjct: 901 RRMNVAITRARKHVALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGSFGGSGLGMNPM 957

Query: 961 LPSIN 965
           LPSIN
Sbjct: 961 LPSIN 957

BLAST of Cp4.1LG05g07010 vs. TAIR 10
Match: AT5G35970.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 1390.6 bits (3598), Expect = 0.0e+00
Identity = 685/862 (79.47%), Postives = 775/862 (89.91%), Query Frame = 0

Query: 101 KSDRAVNLHGIYQNGDPLGRRELGKSVVQWIGQAMQAMASDFASADVNGDFSELRQQMGP 160
           K+D+ ++L  + QNGDPLGRR+LG++VV+WI QAM+AMASDFA+A+V G+FSELRQ +G 
Sbjct: 97  KNDKELSLRALNQNGDPLGRRDLGRNVVKWISQAMKAMASDFATAEVQGEFSELRQNVGS 156

Query: 161 GLTFVIQAQPYLNAVPMPLGLEAVCLKASTHYPTLFDHFQRELRDALQDLQSKSLILDWR 220
           GLTFVIQAQPYLNA+PMPLG E +CLKA THYPTLFDHFQRELRD LQDL+ K+++  W+
Sbjct: 157 GLTFVIQAQPYLNAIPMPLGSEVICLKACTHYPTLFDHFQRELRDVLQDLERKNIMESWK 216

Query: 221 ETQSWKLLKELANSAQHKAIARKISQPKAVQGALGMDLEKAKALQSRIDEFVNRMSELLR 280
           E++SWKLLKE+ANSAQH+ +ARK +Q K VQG LGMD EK KA+Q RIDEF ++MS+LL+
Sbjct: 217 ESESWKLLKEIANSAQHREVARKAAQAKPVQGVLGMDSEKVKAIQERIDEFTSQMSQLLQ 276

Query: 281 IERDSELEFTQEELNAVPTPDEGSDNSKPIEFLVSHGQAQQELCDTICNLNAVSTFTGLG 340
           +ERD+ELE TQEEL+ VPTPDE SD+SKPIEFLV HG A QELCDTICNL AVST TGLG
Sbjct: 277 VERDTELEVTQEELDVVPTPDESSDSSKPIEFLVRHGDAPQELCDTICNLYAVSTSTGLG 336

Query: 341 GMHLVLFRVEGNHRLPPTTLSPGDMVCVRVCDSRGAGATSCMQGFVNNLGDDGCSITVAL 400
           GMHLVLF+V GNHRLPPTTLSPGDMVC+RVCDSRGAGAT+C QGFV+NLG+DGCSI VAL
Sbjct: 337 GMHLVLFKVGGNHRLPPTTLSPGDMVCIRVCDSRGAGATACTQGFVHNLGEDGCSIGVAL 396

Query: 401 ESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQKNGLRKKNPSSAVVATLF 460
           ESRHGDPTFSKLFGKSVRIDRI GLAD LTYERNCEALMLLQKNGL+KKNPS +VVATLF
Sbjct: 397 ESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALMLLQKNGLQKKNPSISVVATLF 456

Query: 461 GDEEDVKWMEDNNLIDLAHTNLNDIVLNGDFDDSQKGAISFALNKKRPILIIQGPPGTGK 520
           GD ED+ W+E N+ +D +   L+D  ++  FD SQ+ AI+  +NKKRP++I+QGPPGTGK
Sbjct: 457 GDGEDITWLEQNDYVDWSEAELSDEPVSKLFDSSQRRAIALGVNKKRPVMIVQGPPGTGK 516

Query: 521 TGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINIVRVGNPARISSSVASKSL 580
           TG+LKE+I LAVQQGERVLVTAPTNAAVDNMVEKL ++G+NIVRVGNPARISS+VASKSL
Sbjct: 517 TGMLKEVITLAVQQGERVLVTAPTNAAVDNMVEKLLHLGLNIVRVGNPARISSAVASKSL 576

Query: 581 AEIVNTKLASFRTDIERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKE 640
            EIVN+KLASFR ++ERKK+DLRKDLR CL+DD LAAGIRQLLKQLGKTLKKKEKETVKE
Sbjct: 577 GEIVNSKLASFRAELERKKSDLRKDLRQCLRDDVLAAGIRQLLKQLGKTLKKKEKETVKE 636

Query: 641 VLSNAQVVLATNTGAADPLIRTLEKFDLVVIDEAGQAIEPACWIPILQGHRCILAGDQCQ 700
           +LSNAQVV ATN GAADPLIR LE FDLVVIDEAGQ+IEP+CWIPILQG RCIL+GD CQ
Sbjct: 637 ILSNAQVVFATNIGAADPLIRRLETFDLVVIDEAGQSIEPSCWIPILQGKRCILSGDPCQ 696

Query: 701 LAPVILSRKALEGGLGVSLLERASTLHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKS 760
           LAPV+LSRKALEGGLGVSLLERA++LH G L   LT QYRMND IA WASKEMYGG LKS
Sbjct: 697 LAPVVLSRKALEGGLGVSLLERAASLHDGVLATKLTTQYRMNDVIAGWASKEMYGGWLKS 756

Query: 761 SPTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEAD 820
           +P+V+SHLL++SPFVK TWITQCPL+LLDTRMPYGSLSVGCEE LD AGTGSLYNEGEAD
Sbjct: 757 APSVASHLLIDSPFVKATWITQCPLVLLDTRMPYGSLSVGCEERLDPAGTGSLYNEGEAD 816

Query: 821 IVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRNRLDEIPEATGIEVATIDSFQGREADA 880
           IVV HV SLIY+GVSP AIAVQSPYVAQVQLLR RLD+ P A G+EVATIDSFQGREADA
Sbjct: 817 IVVNHVISLIYAGVSPMAIAVQSPYVAQVQLLRERLDDFPVADGVEVATIDSFQGREADA 876

Query: 881 VIISMVRSNNLGAVGFLGDSRRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIRYFG 940
           VIISMVRSNNLGAVGFLGDSRRMNVAITRARKH+A+VCDSSTIC NTFLARLLRHIRYFG
Sbjct: 877 VIISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTFLARLLRHIRYFG 936

Query: 941 RVKHAEPGNFGGSGLGMNPMLP 963
           RVKHA+PG+ GGSGLG++PMLP
Sbjct: 937 RVKHADPGSLGGSGLGLDPMLP 958

BLAST of Cp4.1LG05g07010 vs. TAIR 10
Match: AT2G03270.1 (DNA-binding protein, putative )

HSP 1 Score: 368.6 bits (945), Expect = 1.5e-101
Identity = 247/681 (36.27%), Postives = 374/681 (54.92%), Query Frame = 0

Query: 263 ALQSRIDEFVNRMSELLRIERDSELEFTQEELNAVPTPDEGSDNSKPIEFLVSHGQAQQE 322
           A +  ++ FV+ M+ L+ +E+++E+  +             S  S+ IE         Q+
Sbjct: 2   ARKMSLEAFVSTMAPLIDMEKEAEISMSLT-----------SGASRNIE-------TAQK 61

Query: 323 LCDTICNLNAVSTFTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRVCDSRGAGATSCM 382
              TI NL  V   TGL G  L+ F+      LP       D+V +++ +    G++   
Sbjct: 62  KGTTILNLKCVDVQTGLMGKSLIEFQSNKGDVLPAHKFGNHDVVVLKL-NKSDLGSSPLA 121

Query: 383 QGFVNNLGDDGCSITVALESRHGDPTFSKLFGKSVRIDRIPGLADTLTYERNCEALMLLQ 442
           QG V  L D   SITV       D    +    S+R+++   LA+ +TY R  + L+ L 
Sbjct: 122 QGVVYRLKDS--SITVVF-----DEVPEEGLNTSLRLEK---LANEVTYRRMKDTLIQLS 181

Query: 443 KNGLRKKNPSSAVVATLFGDEEDVKWMEDNNLIDLAHTNLNDIVLNGDFDDSQKGAISFA 502
           K  LR   P+S +V  LFG+ +     +D       + NL         D SQK AI+ A
Sbjct: 182 KGVLR--GPASDLVPVLFGERQPSVSKKDVKSFTPFNKNL---------DQSQKDAITKA 241

Query: 503 LNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGINI 562
           L+ K  + ++ GPPGTGKT  + E++   V++G ++L  A +N AVDN+VE+L    + +
Sbjct: 242 LSSK-DVFLLHGPPGTGKTTTVVEIVLQEVKRGSKILACAASNIAVDNIVERLVPHKVKL 301

Query: 563 VRVGNPARISSSVASKSL-AEIVNTKLASFRTDIERKKADLRKDLRHCLKDDSLAAGIRQ 622
           VRVG+PAR+   V   +L A+++    +    DI ++   L   L    KD +    I++
Sbjct: 302 VRVGHPARLLPQVLDSALDAQVLKGDNSGLANDIRKEMKALNGKLLKA-KDKNTRRLIQK 361

Query: 623 LLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLIRTLEK--FDLVVIDEAGQAIE 682
            L+ LGK  +K+++  V +V+ NA V+L T TGA   L R L+   FDLV+IDE  QA+E
Sbjct: 362 ELRTLGKEERKRQQLAVSDVIKNADVILTTLTGA---LTRKLDNRTFDLVIIDEGAQALE 421

Query: 683 PACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHQGALTKMLTIQY 742
            ACWI +L+G RCILAGD  QL P I S +A   GLG +L ER + L+   +  MLT+QY
Sbjct: 422 VACWIALLKGSRCILAGDHLQLPPTIQSAEAERKGLGRTLFERLADLYGDEIKSMLTVQY 481

Query: 743 RMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWITQCPLLLLDTRMPYGSLSV 802
           RM++ I +W+SKE+Y   + +  +V+SH+L +   V  +  T+  LLL+DT         
Sbjct: 482 RMHELIMNWSSKELYDNKITAHSSVASHMLFDLENVTKSSSTEATLLLVDT--------A 541

Query: 803 GCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQVQLLRNRLDEI 862
           GC+    +    S YNEGEA++ + H   L+ SGV P  I + +PY AQV LLR    + 
Sbjct: 542 GCDMEEKKDEEESTYNEGEAEVAMAHAKRLMESGVQPSDIGIITPYAAQVMLLRILRGKE 601

Query: 863 PEATGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHIALVCD 922
            +   +E++T+D FQGRE +A+IISMVRSN+   VGFL D RRMNVA+TR+R+   +VCD
Sbjct: 602 EKLKDMEISTVDGFQGREKEAIIISMVRSNSKKEVGFLKDQRRMNVAVTRSRRQCCIVCD 629

Query: 923 SSTICQNTFLARLLRHIRYFG 941
           + T+  + FL R++ +    G
Sbjct: 662 TETVSSDAFLKRMIEYFEEHG 629

BLAST of Cp4.1LG05g07010 vs. TAIR 10
Match: AT5G47010.1 (RNA helicase, putative )

HSP 1 Score: 205.7 bits (522), Expect = 1.7e-52
Identity = 158/457 (34.57%), Postives = 234/457 (51.20%), Query Frame = 0

Query: 490 DFDDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGE-RVLVTAPTNAAV 549
           + + SQ  A+   L K  PI +IQGPPGTGKT     ++    +QG+ +VLV AP+N AV
Sbjct: 488 ELNASQVNAVKSVLQK--PISLIQGPPGTGKTVTSAAIVYHMAKQGQGQVLVCAPSNVAV 547

Query: 550 DNMVEKLSNIGINIVRVGNPAR--ISSSVASKSLAEIVNTKLASFRTDIERKKADLRKDL 609
           D + EK+S  G+ +VR+   +R  +SS V   +L   V     S ++++ + +       
Sbjct: 548 DQLAEKISATGLKVVRLCAKSREAVSSPVEYLTLHYQVRHLDTSEKSELHKLQQ------ 607

Query: 610 RHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLIRTLEKF 669
              LKD+       +L     K  K  ++ T +E+  +A V+  T  GAAD  +    +F
Sbjct: 608 ---LKDEQ-----GELSSSDEKKYKNLKRATEREITQSADVICCTCVGAADLRLSNF-RF 667

Query: 670 DLVVIDEAGQAIEPACWIPILQG-HRCILAGDQCQLAPVILSRKALEGGLGVSLLERAST 729
             V+IDE+ QA EP C IP++ G  + +L GD CQL PVI+ +KA   GL  SL ER  T
Sbjct: 668 RQVLIDESTQATEPECLIPLVLGVKQVVLVGDHCQLGPVIMCKKAARAGLAQSLFERLVT 727

Query: 730 LHQGALTKMLTIQYRMNDAIASWASKEMYGGMLKSSPTVSSHLLVNSPFVKPTWITQCPL 789
           L  G     L +QYRM+ A++ + S   Y G L++  T+         F  P        
Sbjct: 728 L--GIKPIRLQVQYRMHPALSEFPSNSFYEGTLQNGVTIIERQTTGIDFPWP-------- 787

Query: 790 LLLDTRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPY 849
             +  R  +  + +G +E +  +GT S  N  EA  V + V + + SGV P  I V +PY
Sbjct: 788 --VPNRPMFFYVQLG-QEEISASGT-SYLNRTEAANVEKLVTAFLKSGVVPSQIGVITPY 847

Query: 850 VAQVQLLRNRLDEIPEA-----TGIEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDS 909
             Q   + N +             IEVA++DSFQGRE D +I+S VRSN    +GFL D 
Sbjct: 848 EGQRAYIVNYMARNGSLRQQLYKEIEVASVDSFQGREKDYIILSCVRSNEHQGIGFLNDP 907

Query: 910 RRMNVAITRARKHIALVCDSSTICQNTFLARLLRHIR 938
           RR+NVA+TRAR  I ++ +   + +      LL H +
Sbjct: 908 RRLNVALTRARYGIVILGNPKVLSKQPLWNGLLTHYK 913

BLAST of Cp4.1LG05g07010 vs. TAIR 10
Match: AT1G08840.1 (DNA replication helicase, putative )

HSP 1 Score: 161.0 bits (406), Expect = 4.8e-39
Identity = 125/463 (27.00%), Postives = 212/463 (45.79%), Query Frame = 0

Query: 492  DDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNM 551
            ++ Q+ AI   L  K   LI+ G PGTGKT  +   +   + +G  +L+ + TN+AVDN+
Sbjct: 889  NNDQRQAILKILTAKDYALIL-GMPGTGKTSTMVHAVKALLIRGSSILLASYTNSAVDNL 948

Query: 552  VEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKADLRKDLRHCLK 611
            + KL   GI  +R+G    +   V     +                        +  C  
Sbjct: 949  LIKLKAQGIEFLRIGRDEAVHEEVRESCFSA-----------------------MNMCSV 1008

Query: 612  DDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLIRTLEKFDLVVI 671
            +D                        +K+ L   +VV +T  G   PL+    +FD+ +I
Sbjct: 1009 ED------------------------IKKKLDQVKVVASTCLGINSPLL-VNRRFDVCII 1068

Query: 672  DEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHQGAL 731
            DEAGQ   P    P+L     +L GD  QL P++ S +A E G+G+SL  R S  H  A+
Sbjct: 1069 DEAGQIALPVSIGPLLFASTFVLVGDHYQLPPLVQSTEARENGMGISLFRRLSEAHPQAI 1128

Query: 732  TKMLTIQYRMNDAIASWASKEMYGGML--KSSPTVSSHLLVNSPFVKPTWITQCPLLLLD 791
            + +L  QYRM   I   ++  +YG  L   S+    + L++++      W+ +    +L+
Sbjct: 1129 S-VLQNQYRMCRGIMELSNALIYGDRLCCGSAEVADATLVLSTSSSTSPWLKK----VLE 1188

Query: 792  TRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQV 851
                   ++       +     ++ N  EA I+ + V  L+ +GV  + I + +PY +Q 
Sbjct: 1189 PTRTVVFVNTDMLRAFEARDQNAINNPVEASIIAEIVEELVNNGVDSKDIGIITPYNSQA 1248

Query: 852  QLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSN---NLGAVGFLGDSRRMNVA 911
             L+++ +   P    +E+ TID +QGR+ D +++S VRS       A   LGD  R+NVA
Sbjct: 1249 SLIQHAIPTTP----VEIHTIDKYQGRDKDCILVSFVRSREKPRSSASSLLGDWHRINVA 1293

Query: 912  ITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGN 950
            +TRA+K + +V    T+ +   L  LL  ++    + +  PG+
Sbjct: 1309 LTRAKKKLIMVGSQRTLSRVPLLMLLLNKVKEQSGILNLLPGD 1293

BLAST of Cp4.1LG05g07010 vs. TAIR 10
Match: AT1G08840.2 (DNA replication helicase, putative )

HSP 1 Score: 161.0 bits (406), Expect = 4.8e-39
Identity = 125/463 (27.00%), Postives = 212/463 (45.79%), Query Frame = 0

Query: 492  DDSQKGAISFALNKKRPILIIQGPPGTGKTGLLKELIALAVQQGERVLVTAPTNAAVDNM 551
            ++ Q+ AI   L  K   LI+ G PGTGKT  +   +   + +G  +L+ + TN+AVDN+
Sbjct: 908  NNDQRQAILKILTAKDYALIL-GMPGTGKTSTMVHAVKALLIRGSSILLASYTNSAVDNL 967

Query: 552  VEKLSNIGINIVRVGNPARISSSVASKSLAEIVNTKLASFRTDIERKKADLRKDLRHCLK 611
            + KL   GI  +R+G    +   V     +                        +  C  
Sbjct: 968  LIKLKAQGIEFLRIGRDEAVHEEVRESCFSA-----------------------MNMCSV 1027

Query: 612  DDSLAAGIRQLLKQLGKTLKKKEKETVKEVLSNAQVVLATNTGAADPLIRTLEKFDLVVI 671
            +D                        +K+ L   +VV +T  G   PL+    +FD+ +I
Sbjct: 1028 ED------------------------IKKKLDQVKVVASTCLGINSPLL-VNRRFDVCII 1087

Query: 672  DEAGQAIEPACWIPILQGHRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHQGAL 731
            DEAGQ   P    P+L     +L GD  QL P++ S +A E G+G+SL  R S  H  A+
Sbjct: 1088 DEAGQIALPVSIGPLLFASTFVLVGDHYQLPPLVQSTEARENGMGISLFRRLSEAHPQAI 1147

Query: 732  TKMLTIQYRMNDAIASWASKEMYGGML--KSSPTVSSHLLVNSPFVKPTWITQCPLLLLD 791
            + +L  QYRM   I   ++  +YG  L   S+    + L++++      W+ +    +L+
Sbjct: 1148 S-VLQNQYRMCRGIMELSNALIYGDRLCCGSAEVADATLVLSTSSSTSPWLKK----VLE 1207

Query: 792  TRMPYGSLSVGCEEHLDRAGTGSLYNEGEADIVVQHVCSLIYSGVSPRAIAVQSPYVAQV 851
                   ++       +     ++ N  EA I+ + V  L+ +GV  + I + +PY +Q 
Sbjct: 1208 PTRTVVFVNTDMLRAFEARDQNAINNPVEASIIAEIVEELVNNGVDSKDIGIITPYNSQA 1267

Query: 852  QLLRNRLDEIPEATGIEVATIDSFQGREADAVIISMVRSN---NLGAVGFLGDSRRMNVA 911
             L+++ +   P    +E+ TID +QGR+ D +++S VRS       A   LGD  R+NVA
Sbjct: 1268 SLIQHAIPTTP----VEIHTIDKYQGRDKDCILVSFVRSREKPRSSASSLLGDWHRINVA 1312

Query: 912  ITRARKHIALVCDSSTICQNTFLARLLRHIRYFGRVKHAEPGN 950
            +TRA+K + +V    T+ +   L  LL  ++    + +  PG+
Sbjct: 1328 LTRAKKKLIMVGSQRTLSRVPLLMLLLNKVKEQSGILNLLPGD 1312

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P389357.6e-9837.63DNA-binding protein SMUBP-2 OS=Homo sapiens OX=9606 GN=IGHMBP2 PE=1 SV=3[more]
Q605607.6e-9837.18DNA-binding protein SMUBP-2 OS=Mesocricetus auratus OX=10036 GN=IGHMBP2 PE=1 SV=... [more]
P406946.4e-9737.03DNA-binding protein SMUBP-2 OS=Mus musculus OX=10090 GN=Ighmbp2 PE=1 SV=1[more]
Q9EQN53.2e-9636.85DNA-binding protein SMUBP-2 OS=Rattus norvegicus OX=10116 GN=Ighmbp2 PE=1 SV=1[more]
O942473.2e-7232.32DNA polymerase alpha-associated DNA helicase A OS=Schizosaccharomyces pombe (str... [more]
Match NameE-valueIdentityDescription
XP_023533963.10.0100.00DNA-binding protein SMUBP-2-like [Cucurbita pepo subsp. pepo][more]
XP_022958504.10.099.59DNA-binding protein SMUBP-2-like [Cucurbita moschata][more]
XP_022995943.10.099.38DNA-binding protein SMUBP-2-like [Cucurbita maxima][more]
KAG7035600.10.098.55DNA-binding protein SMUBP-2, partial [Cucurbita argyrosperma subsp. argyrosperma... [more]
KAG6605692.10.099.58DNA-binding protein SMUBP-2, partial [Cucurbita argyrosperma subsp. sororia][more]
Match NameE-valueIdentityDescription
A0A6J1H5A40.099.59DNA-binding protein SMUBP-2-like OS=Cucurbita moschata OX=3662 GN=LOC111459712 P... [more]
A0A6J1K9F50.099.38DNA-binding protein SMUBP-2-like OS=Cucurbita maxima OX=3661 GN=LOC111491308 PE=... [more]
A0A5A7UKQ50.091.61DNA-binding protein SMUBP-2 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN... [more]
A0A1S3CT280.091.61DNA-binding protein SMUBP-2 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103504640 P... [more]
A0A0A0KL450.090.98Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G172850 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT5G35970.10.0e+0079.47P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
AT2G03270.11.5e-10136.27DNA-binding protein, putative [more]
AT5G47010.11.7e-5234.57RNA helicase, putative [more]
AT1G08840.14.8e-3927.00DNA replication helicase, putative [more]
AT1G08840.24.8e-3927.00DNA replication helicase, putative [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 616..636
NoneNo IPR availableCOILSCoilCoilcoord: 261..281
NoneNo IPR availableGENE3D2.40.30.270coord: 294..425
e-value: 1.3E-110
score: 372.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 58..84
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 58..106
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 88..106
NoneNo IPR availablePANTHERPTHR43788:SF3P-LOOP CONTAINING NUCLEOSIDE TRIPHOSPHATE HYDROLASES SUPERFAMILY PROTEINcoord: 43..965
NoneNo IPR availablePANTHERPTHR43788DNA2/NAM7 HELICASE FAMILY MEMBERcoord: 43..965
NoneNo IPR availableCDDcd18044DEXXQc_SMUBP2coord: 491..740
e-value: 2.64392E-90
score: 283.732
IPR014001Helicase superfamily 1/2, ATP-binding domainSMARTSM00487ultradead3coord: 487..729
e-value: 0.0025
score: 23.5
IPR003593AAA+ ATPase domainSMARTSM00382AAA_5coord: 506..721
e-value: 0.002
score: 27.4
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 739..947
e-value: 1.5E-55
score: 189.8
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 274..723
e-value: 1.3E-110
score: 372.2
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 490..927
IPR041679DNA2/NAM7 helicase-like, C-terminalPFAMPF13087AAA_12coord: 716..920
e-value: 1.6E-49
score: 168.2
IPR041679DNA2/NAM7 helicase-like, C-terminalCDDcd18808SF1_C_Upf1coord: 741..937
e-value: 3.68079E-59
score: 198.612
IPR041677DNA2/NAM7 helicase, helicase domainPFAMPF13086AAA_11coord: 491..707
e-value: 1.1E-54
score: 186.0

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG05g07010.1Cp4.1LG05g07010.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032508 DNA duplex unwinding
cellular_component GO:0005681 spliceosomal complex
molecular_function GO:0043139 5'-3' DNA helicase activity
molecular_function GO:0003677 DNA binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003729 mRNA binding
molecular_function GO:0004386 helicase activity