Spg032687 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg032687
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptiontranscription factor MYB3R-2 isoform X3
Locationscaffold11: 5168112 .. 5190721 (+)
RNA-Seq ExpressionSpg032687
SyntenySpg032687
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGCACAGCTCGCCCTGGACAAGCATGCTCAAGACACCTAACATGTTGATTGTGTTGTTTGCTGCTTGTTGCTTATCATAACCAGGTTGTACTAACCCCTCCCCTTTCTCCCCCAACATTTTCAGATTGCAACCTTGATTCATCCATGTCTCGCCGCAACCATGATGATGGCGGTGGCGAGGAGCCTTCTGCCGGCGAGGAAGACGATGAAGCTGATGTGTCTGACGAGGACATCGAAGCCCTTCGACGAGCCTGTAGGCTTGTTGGAGTCAATCCTGAGGAGTATATTAATCTTCCGTCGTCGTCGAATGCTGGCAGAGATGACTCTTTCGACGGCATAAAACCTGGCTCTGATTCTGATGATATTGATGATCTCGAACTTGTTCGGAATATTCGGAAACGGTTCTCGACTGCGGCCGGTGAGCAGGACCTGACTTTATCTACGCAGCCGTTGAATACTCTCCCACCGGTGTCACCAGACGAGGAGGAAGACGAGTTTGAGACGCTCCGTCGGATTCGGCAGCGCTTTGCGGCGTATGAAAGTGGTAGGTCTTTCGTTATTGTAAAGATTGCTTTTATAAATGTCATGACTGTATTGGCTGTTTATGAATTATGATGACGTTGCTAGTCTTGAAGTCCTTGGGTTGTGATAATAATTACTCCTTCTGAGCTGCCCATCGTATAGGACATTTTCTAATGGCTTTGGATTGGGAAGAATTTTACATTCCTGCCCCAATCTGGGTTAAATTTATGTTACCTCACTGCGATTGTGATGATGATGTCGGTTGATGTTTCTTTAACTTCAAACTACTTCCTTCTGTTAGTATGGTTGATGTGGAGTATCTTTTGTGATTTTAGATCTTTAGATTCTTTAATAATTCTTGTTTTATCTTTATTTCTTGCAAGATCTTGTATATTTTGTCTTTAGTTTCCCCATGGTTACTATTATTTACTCTTTTGGCCGAATTGGTCAAATTAGTTGGAATTACTTGGGTTAGTCTTGTGTTTTTACCGGTTTGTGGCCTTTAAATAGCCGGCTTCTAGTTTGTAAATTGTAATTACATGTTTTGAATCACATCAATTGGTATCAGAGCAATCGGTTTTGGGCCTAATGGCTGGAAAACAAGTTGCTATGGGCAATGACGCAGATCACCCCACCCAAGATGCGGAAGCATCTATTCTCCCCCAAACCCTTAACACTAGACTACAAGCTGTGGAAGCTTCTATCGAAGAAATGAAGATAAGCATCCCCCAAAACCTTAACACTAGGCTACAAGCCATGGAAGCTTGTGCTGAAAAGATGAAGGTTAGCATGTTAGGAATGCAACAAATGATGGATGAAATCTCTCAGCAACTAGAGAGGATGAATACCAATCAACCAAGAATCCAAGAAGATGTTAATACTCCAAAACCACAAAGACAGAGGTTCCTATCCAATAACTAGGAAGACAAAAATCACCAAAGATACTCCACATCAATTGTAACTAATATTGAGTGCATGTTTAGTTGTTGAATCATGATTAACACTTGCATTGTTTTCGACGTGTGAATATCTTCTGCTAACTCATTGTTACCTAGATAAATAGTTTGTTGGAAGGGGGGACAAGGATGTTGGTAATTCTTCGGTAATATGTTTAGAACTATCCTACACTAGGTGTGCTCTTCATGAGCCTGGTGAGTTTTGTGGGCATGTTGATCAAGCTAGCATATCTTTGAAAGGTTTTAAATGCTACCATTTTAGAGTTGGCTTATGGATGATATATTCATATTCTTAATCTGTTACAATTTTTGACACCTAGTGCTATAATTTTTGGGGCTTCTATAGGTATTTTCTTGTTATATCATTTGCATTTATATATATGTTTTGCAACTCGTTTCACTTTAGCTCTAAACAGTTCGTGCTCTTTTAACTCTTTTAAAAAAAGTAGTCGATCATTATTGGAAGGATCCTTTGGAGAATGATTCTAGTGAGTTGTTATGAGGAGAGAATGAGATTGAGGAGGAGGTCTTTTCCTTTTATTCTTCTTTTTATGCTCCACCAATTCTTCCTAGACCATTCAAGGAGGGTTTAGATTAGAGTCCTATGTCTTTCCTTGATAAGAGCATAAATGCTCCTTTCACTTTAGAGGAAATGAGGAAGGTGGTTTTCAGGTTTGATAGGAGTAAGGCTCCTAGCCTTGGTGGTTTTTTATGGCCTTTTCTAGTATAGTTGGGAGTGTATTTATTTTTTTAAAATCTTTTTAATGTGAAACATAATAAAATATATTCATCAAAATAAAAGAATGTAGCTTAAAGGCTTTAGGGATGAGAAGTCTTTAACCCTAATTTGGAAAGCTGACAATGAAAGACTTCTAATCTAAGTTAACGAGATAATGCTATGTATTAAAGAAATAGTTATGTATGGCACATCACGTATGTACATTTTCACAAAAAAAAAAAAAAAAAAACACATGAAGATGACTTATCTAAGATCAATTTGTTGTTCCTTTCTTTTCAAATTGATCAGAGGTGAGCTCTAGTAGTGTTGCTCTGCAAAATCCTTGCCACCTTCTTAAACCACCAACTAGACAACAACGCGAAAAGACCATCTTCCACCTTTTTTGAAAGGCGGCCAAATCAAAAAGATTCATGAGGAACATCCACACCTCATAGGTAAATTAGTTGCGTGGGAAAATATGGTGGTTGTTTTCAGAACTTACATGGCAAAGGATGCAAATGTTTGCAAAAGATAGTTGGGAGTGTAGTAAAGGAGATCTTTGAAAGGTTTTTTGTGAGTTCTATGAGTGAGAGACGTCTTAAAGATCTCTTTAAATCTGATAAAATGGGAAGTTGTTTCTAATCCTGAGGAGTTGGGGGGGGGGGGGGTTAGGTATTGGCAACTTGAGGCTTCGCAACGAGGTCCTTTCGGGGAAATGGGTGTGGCGCTTCTTTAATGAGCCTGACATCCTGTGGTGGAAGGTCATTGCGAGTTTGCATGGTCTGCACCCTTTCGAGTGGATCTCTAGTGGTAGGCTGAAAGGATCTAGTAAAAACCTTTGGTCTGCTATTGCTTTGGGATTCCCTCGCTTCTTTCAGTTTGTCCATTGTTTTATTGGGGATGGTTCCAATACATATTTCTGGGAGGATTCTTGGGTTGGGGATAAACCATTGAGCGTTCTATTCCCTCGTCTTTATCACTTATCTGAGAACAGGTTGCATTCGATAGCCTCGGTCTTCCCTGATTCCGACTCCTCCTCTCCTGCCCTTGGCTTCTGTCGTTCCCTTACTGATAGAGAGGTGGCAGAGGTCATCAGTCTCCTTTCCCTTTTCGCTGTCCAATCATTTCATCGTGGGGGAAGGGACTCTCGGTTCTGGACCCCGAAGCCTTCTGAGGGCTTCTCTTGTGCTTCTTTTTTCCGTTCTCTTTCTATGTCTCCCCCTCCTCCAGGAGCCTCCATTTTCTCCTCGATTTGGAAGGTCAAAATTCCCAAGAAAGTGAACTTCTTTGCATGGGAAGTTTTACATGGTCGGGTGAATACTCAGGATCGGGTCCAAAGGTTCTCCTCCTTCGTGTTGCGGCCACAGTGGTGTGTCCTCTGTAGGAGTCAGGAGGAGGACCTTGATCATTTGCTTTGGGATTGTCCTTTCGTTTGCTCTATTTGGAGTCAGTTCTTCAGGGTGTTTGGGGTTGCTTCGGCTCGTAACATTGACTGCTTCTCTATGTTTTTTTTTTGATAAAGAACCAACCTTTCATTGAGAAAAAATGAAAGAATACAAGGGCATACAAAAAAACCAAGCCCCAAAAGGAGAACCCCTTAAAGGAAGGGATGCCAATCTAACAAAATAAGACCTATAGAATAATTACAAAACAAACGCGTCACTGACGCCCATAACGAGACGGAGTATTTAATAAAGTCCCAAGTATCCTCCGCATTCCTCTCAATTCCTCTAAAAATTCTATTATTTCTCTCCCCCCACAAACCCCAAATGATAGCGCAAATCCCAGCTTGCCATAAGAATTTCCCCTAGTCCCGAAACGGAGGATGGGGAAGGAACTCCTCGATCATCTCCCTAACATCCCTGTGTCTAGCAAACTGAAGCCCAAAAATCTCGAAAAAAGCGTTCCAAACTGAACGTGCAAAGTCACAGTGCCACAACATATGATCGAGGTCTTCCTCCGCCATCCTACAGAGAATACAACAGAATGGGCCTATCAAATTAGGCATCTTTCTCGTAAGCCGATCAAGAGTGTTTGCTCTTCCATGGAGGACCTGCCAAGTAAAAAACTGCACCTTCGTTGGAATTTTCACTTTCCAAACCAGAGAATAAACTGATTTCTCAGGAAGAGAAGAAGGGTCAAGAAGACAATGAAAGAACGAGCGGCAAGAGAAACCCTTAGAGGGGTTAGGGTTCCACGAACGAATATCCCTCCTATCCGGTCTAAAGGTGATCTCTTCAATCAAAGAAAGAAGAGATATGACATCATAAGTTTCCCTCTCGGATAAAGAGCGAGAAAAACCAAAAGAAAAAGAAATGGAGCTTCCCGAAGGAATCAACACCTTGGCCACAAAGCGGTTCTTCATCGAAGATAAATGATAAAGGCGAGGGTAAGCAACACAAAGGGGTCTATTTCCCACCCAACGGTCTTCCCAAAAATAGGTTTCCTCCCCATTCCCCACAAAATGATGAATGAATAAAGAGAGAGAGGGAAGCTCAACAAAAATTTCTTTCCAAGGATTCCTGGAAGTGCCTTTAAACCCACCCGACACCCACTCAAAAGGATGACGACCGTATTTGCTTACAATTATCCTATGCCATAGGGTGTTGGACTCAGAGGAGAAGCGCCATAACCATTTAGCCAACAGAGCTTTGTTGCGAATTCTCAAGTTACCCACCTCTAGACCTCCTCTACTCAAAGGCTTTCCCATAGTTTCCCAACTAACCAAGTGAGCACTCCGCCCTTCATCCACACCTTCCCACAGGAAGTCTCTCATAAGCTTCTCAATAGCTTTGCAAACAGAGGCAGGAGCCCTAAACAGGGAGAAAAAATAAACAGGCATGCCACTCAGAACAGATTTTATAAGGGTAAGCCTCCCCCCTTAGAAAAGAAGCTCCTTTTCCAGCTTGCAAGTCTCTTTCTAACCTTGCCCACCACTGGCTCCCAAAATAAAGAAGTCTTTGGATTGTGGCCCAACGGGAGTCCTAAGTACGAGGAGGGGAAGGAGCCCACCTCGCAACCAACCAACGAAGCCCATCTATTAACTTTGCTCTCATCACTATTAATACCCAGAAGCTGACATTTCCCCCTGTTAATTCTAAGGCCCGACATGGATTCAAAAAAGGCTAGCATATGATTCAAGACAGAAAATGATTCCTCCTTCCCAGAGCAGAAGAAAATCGTGTCATCAGCAAACTGAAGGTGAGATAGAGGAATCCTGTTTCGACCCACCTCGAATCCTTCAATAATTTTACCCTCAACCCCTTTTGAGACCAACCTACTGAGCGTGTCAACAACCAAAAGAAACAGGAACGGAGATAGGGGGTCTCCCTGGCGAAGGCCACGGGTAGCTCGAATTCTCCCACTGGGTCTACCATTAATGAGGATGGAATAGGTCACTGATCTTGTACAACTCCAAATCCACGATCTCCATTTGTACCCAAAACCTTTTTTCTCCAAAATCTTGTCCAAGAAGTTCCAGTCAACGTGATCATAAGCTTTCTCAAAATCAATCTTAAAGATAACTCCTTCCCGACCACTTCGTCTATACTCTTCAATAGCCTCATTAGCAATGAGGGCTTGATCCAAGATTTGCCTCCCTTCGATAAAAGCACCTGGGCCTCCGAGATCGTAGAGGGGAGGATCTTTTTCAACCTATTAGCCAAAACTTCGATGATTTTATAAACACTTGTGATCAAACTGATAGGTTTGAAGTCTTTAACCTTGTTCGCTCTCTCTTTCTTAGGAATAAGGCAGACGAAAGTTTCATTTAGCGAACTATCAAGGATGCCCCTTTCGTAGAATTCCTTGAAGACCCTCTCGAAGTCCACCTTGATCTCTTTCCAATTATCCTGAAAAAAGGCCATTGAGAAACCATCAGGACCCGGGGCCTTTTCTCTATCACACGGGAGTCCACACATTTTTGTCTCTAACTGAGATGGGGCTCCACTCCACCCCTTCCACAAAAGGCTTGGGGCTTACTTGGGAGGCATAGATACCGGAGAAAAAAGAAAGAATTTCGTCCTCAATTTCTTTCTCACCTTGAATCAGAATCCCTCTATTGTTCTCCAAGGAACCAATAAAGTTCTTGCTCCTTCTCCCACTCGCCACCCTATGGAAAAAAGCTGAGTTACAGTCCCCTTCTTTAGCCCATCGCACTTTGGACTTTTGTCTCCAACTTATCGTCTCTTTTTCCACTAGCTTAGCAAAATCAATCTTAAGCGTGGTTCTTTCTCTTTTGAGGGATTCATGAATAAGACCTTCCTCTTCAAGATTGTCGATGTGACTGATTCTTCTCAACACTTCCTGCTTCTTGACCCTGATATCCCCGAAAACCTCCTTATTCCATCTTTTCAGAGAACTTTTAAGGGCTCTGAGCTTTCCCATAAGCCTATAGCCTTCCCACTTACCTCCCACCTCCACATTCCACCAACCTGGAAAAGATGCTTTAAATGAAGGGTGATCTAGCCACATGTTCTCAAAACGAAAGGGACAAGGGCCCTATCTTTGAGATCCAGAGACAAAGATCAAGGGCCAGTGGTCCGAAACTACTCTAGGGCCTAAAGATTGTCGAATGTTAACAAAGGCCTCAAGCCAACCCTTTGAGCAAAGGGCTCTGTCAATTCTGCTCCGGGCTCTACTATTGGCCCACGTAAATCTCCCATTAGTCAGCGGAGGGTCATAAAGACTACAACCCTCGATCAGATCCTTGAAAATTTTCATGGATTTCGTCGTTCTCCCACCTGACGCCTTCTCAATAGGAAAGCGAGTGACATTAAAGTCACCCACCACACACCAACAATGGCCACACAAACCGAACAAATCACCTAATTTGTTCCAAAACGCCTGCCTGCCTTTGGGCCTCGAAGGACCATATGTACCCGTGACCCAGCCAACTTCCTTATTCTGCCACTTGATTCTAATAGACACAGAAAAAACCCCTTTCAAAACCTCCAAACACTCCACTGAGCTACTATCCCATAAAACCAAAACTCCTCCTGAGGCCCCGATGGACTCAATACCCGCCCACTCCACTTTTCTTTTACCCCAAACACTCTTAACAAACTCCCGATTATATCTATAGCGCTTAGTCTCTAACAAAACCACAATCTCTGGATCCTCATGGGATAAGAAGTCTTTCACCGTCCTACGCTTATCCCTACCCCCAAGACCCCTCACATTCCAAGAGATGATCTTCATTGATTACCTGAGTCATCCCCAGCCTCCTTCAAGGCTCTCCTACGATCATAGTTCACAGACGATTCCAGTTTTTTCAATTCCCTATCCACCCTTTTGGAGTCTCCTTTTTTGCAAGATTTCTTCCCTTTTTTCTCATTTTCCTCCTTTTTCTCATTAACTGATGGGGCCTTAGGGGTAGATTCTGCCGGAATCTCATGGATACCTATGGCATTCAATGTTGCCCACCCTGGTTGCCAAAAACCATTAGTCTCTCCGTTGGGTTGAAATCCTTGAATACCTAGAGAACTATAGATGGGTAAAGGGACTCGACCAACTAACCCATTTGGTAAAGGATGACCTTCCCCCCCAGTTCCACAACCCGAGACATAGGATAAAGGAGCAAAAGACGAGACATTCTCTCCAAAAGAGTAAACAGACGGGTGGGACAAGGCACTTCCCGAAGGATGAAAAAAATTGAAAATAACGGGAGATAGGAAGTTGTGATCAAGACTCAAAGACCCTCCCCCCTCCACTGGCAAAGATAAAGACATTGGATAGGAATAAGCCATCCCCCCCCCCCCCCCCGGATAAAATAACCCTCCCTTTTCAAACGGAACCGAGTTGGAAAGTGAAAAGTCCAAAGGTTGTGAGCGTACCTGGTTGGTGTTCCCTTTCGAGTTAACCTTGATTAGCGATTGAGAATGAAGAATTGGACTCGGCTCCTTCTGCTGAATATCTGTCGGCACGAAAGAAGGCCCCACAGCTTTGATAGAATACTTAGGGGGGGGAGTAAGATCTTCCGACGGCTTCTCTCCCTCTTCAATAAAAACATTTCCGAGATTCAGAATCTCCAACTCCTTGGTAACCGGCGTAGACAAGAAAGATTCATCTTCCGAATAGCTCACTACCGATGGGTCAGACCCAATTTCATCCTTGGCAAGAAGAAAATCAGTAGCATTGATCTCCGTATTCCATTCCAAGTCCTGATCTTTTTCATTATTCTCCCCATTCTGTATAACATTCTGAGTAAAATCTGGTTGATAATGAACAAGCCCAACCGGCCCACAATCTCTAACAAAGTCCTCGCACACTTCTTTGCCTTTCACGCTGTCTTTTTTTAAGGAGGATGCTGCTGTAGATTTAAAATCATTAAGCCCACAAGGCCCATCAGAAAGAGGCCCACTATCTACTCCAGTCACCAACCCAACTTCCTTATTAATATCCACGCTATCTTCTTTGCCTTTGCCTTTGTCTCTCACGCTGGACTTTTTTGAAGGAATTGATGCTAGAGGTCTTTTATCAACAAGCCCAGATGGCCCATCAGAAGTAGGCCCAATACCTAGATTAAAAACATCACCACCGTTTGGCCCCTCCTTATACTCCCCAGGATTTATCTTCACGTGATTCCTCGAGGGATGAACCGCCTTCCCTGCTACCAAAGCTGATTTCGGCAGATGAAGTACAACTTTTTCCGATTGTCTTGAATAACCGCTGGCTCGCCGGATTCTTTGACGAATCTCCCCCTACCGACAATAATTCTCACCGGAAAGAAGATGTTATCGTGATTAAGAAAGCAATTAGTCGGAATAAAACCTTCGTCATTACCTTTGGCTTTGAAGCACACTTCGTCCGTCCAACACGTCTCGCCTAAGATGTCTTCTTCTTCCGCTATACCTCCGCACAAATTCGCTATCGACTGAATCATTTGCTTCGTCCTGAGGAATGGAGGAACGTCAAAAGCAGTGACCCAACCACCGTTGAACAGAACATTTCTTCTCTGCCATCATTTTGATGTCCCATGGGTGGATAATAAAATCCTTAGAGACCATTGAATCCTTTGAGTGCGACACAGAGCTTATGTTAACAAGCTCCTCCACTTCCTTCAAGCTTCCACATAAACCCACCGCCAAGTTATCGCAAAACGGTTTCAGACTAAAGCCTATCTTGGCCATTCTCAGACACACCTCTTTTATTCCCATCCAATCCGATTTGAAGCTCCTTCGTTCTACAATAACTGCTGAAGAACATTCGAACGCAAAGTCGATATCCTCCGAAGACAAAACGTTCTCCTCCTTCATTTGTTTTGGCCCATCTCGCACAGCCTCCATATACGATCTACAAGTACTAGATTCTTGAGCTTCTACCCCTGCTTTCTTCTTTACCGTCTGCTCTAAAACCACCGGAGGGAGAATAGTTGTCATAGCCGTGGCCAGCAATCCCCATCCTTTCACTTTCGAGCCTTCTGGGATGAATATCACAGGCCTTCTACCCCTTTTTGAATCGATAGAAAGCAACCCAAATCGCCCCTTAGCATTCGCAAGGACTTGAAAGAAAATAATCCCTATATCTAATCGTCTTCTCCTCCAAAACCTAAAAGGATTCGACGAAGATGAAACAGCAACCAAGCAATCTCTAACCCACGCTACAGTCCCCAAATCCAGATCTATGCGACTCACTCTCGCCCCCCTCCGTTCGTCTAGAACTACCTTCTTTCCTATCTCGATTTTGCGCAGATGAAAGACATTACGGTCGATGTTCACCGTCCTCGACCGCTCCATAGCCAGACGGTTACATTTTCAGGCGAGAGGGAGAGACGAGAGAGAATAATAGACTTTTTAGAGGGCTGGAGAAATCCACGAAAAAAAAAAGTCAATTTTTAGAGGGCTGGAGAAATCCATGAATGATAGCAGGGCGAGACGAGAGAGAATAATAGACTTTTTTGTCATTACCCGCTCGGTCTTATTATTTTGGACTGGGATCGGTTCCTATAGTCTAGGTGGGTGTTCTTGGCGAGCTTGTTTTTTGTATGCCCTTTTGTATTCTTTCATTTTTTCTCAATGAAAGCATGGTTTCCTATAAAAAAAAAAGAAGAGCTCTTTGAATGAAACATTTGTGTTCTTCATCCCTAAGAAGGATGGAGCTTGGAGGGTTAGTGATTTTAGACCTATAAGTTTCGTGATTAGTTGCTACTGTTTTATGGAGGGGCATTAGATCTTGGATGCGGTTTTAGGTGCTGCTGAAACAGTTAGGGACTGTAGGTCTAGCGGGAAAAGTGGTATCCTCCTTAAGTTAGATTATGAAAAGGCCTATGACAAAGTTGACTAGGACTATTTGGAGTTTATTTTGAAGAAGGGTTTTGGTAACAGGTGGAGGGCTTGGATTAGAGGCTGTCTCTCGTCAAAACATTTCTCCGTTTTCCTAAATGGGAGACCTCGGGGGAAGATTTTGGCTAAAAGAGGTCTTCGTCAAGGGGATCCCTTGTCTCATTTTCTCTTTATGTTGATGGGTGATGCTTTCAGCAGACTGGTTCAATTTTGCTGTGAAAGAAGGATTTTACAGGGGATCTAAGTTGGTAAGGGAGGGGTTTCTATTACCCATCTCCAATATGCAGACAATACTATCATCTTCGGTCCAGCTGACAAAAAATTTCTCAAAAGATGGTGGATGATTATCTCCCTTTTTCTTCAATGCTCTGGGTATCTTTGAACATTTTAAAAACTTCTATTATAGGCATAAATGTGCTCGATAGTTGGCTGCCAGGTTTTGCTTCAAAGATTGGGTGTAGATTGGAACCTCTACCTTTTTTGTATTTAGGCTTCCCCTTCGGAGGTAATCACAAAGCCATATCTTTTTGGGAGCCAATGGTGGACAAGTTTAAACAAAAGCTTGACAAATGGAGATCTATAAACATATCGAAAGGTGGCCTTTTGATCTTAGCCCAATTGGTGCTCAATAGTATCCCGATGTATTTTTTCTCCTTATTAAAGGCCCCTCTAGGGGTCATTAAACATTTGGAGAAGCTTGTTCGAGACTTCTTTTGGGGCAATGTGGAGAGGCACGGGTGTAGCCATTTAGTAAATTGGGGCAAGACTTCCCTCCCTTTGAAGTATGGGGGTCTCTGTATTGGTGCTTTCAAGCAGAGAAATAATGTGTTGCTTCTGAAATGGTTGTGACGTTTTAGTGGTGAGGAAGAAGCCCTTTGGAGAAGGTCATTGTCAGAATTTACGGTATAGACTTTAGAGGGTGGTGCACCCTCCCCCCGAAAGGGAAAGCTAGAGGCAGACCGTGGTTTGATATTATGAAGAATGTTCGATGGTTTAGAAACTTCATAAAATTTAAAGTGGATCAGGGCAGCAGAATCAGTTTCTGGAAGAATGTGTGGATTGGAGACACCGATTTGGCTTCCACCTTTCCAGATTTGTATAATATCTCGATGAAACAAAATGCTACGATTGCTTAATGCTGGAACTCTGAGGCTAACGATTGGGATTTGGGTTTTAGAAGAGGCCTTTTTGATTGGGAGATTCACAGTTGGCTCGGACTCATTGAGAAGATTGATGCTATTCGGCTGGGGGTGGTAAAGATGAGATTGCCTGGACCCTAGATAAAACGGGGTCTTTCATGACCAACTCTACTTTCATGAAGCTATCAGAATCGACTTCCTCACTAGAAATCCCTCTAATTGAAGCTATATGGAAAAATGGTTCCCCTAAAAAGGTAAAGTTTTTCCTTTGGCCCCTAGCTCATAGAAGCCTTAACAACCATGACTTGCTCCAAAGGAAGTGTCAGAATTGGGCCCTTCTCCCCTTGGTGTGCCCCTTGTGTCATAAGGCTGAGGGGACTATCGGTCATCTCTTCCTTCAATGCCCCTTTGCTGCCTCGGGGTGGAGATGGTTGATGGAGGATTTCAATTTAGCTATTTGTCTCCCGAACAGGATCGAGGACTGGATGGCAGAATGTCCTGCTGGGTTGGAAAGATGAAGAGCAGGGCCAAGATCGTGTGGAATATGACGATCAGAGCTTTGCTTTGGAACCTTTGGATCGAAAGGAATCATCGGATTTTTGAAGATAAGAATACTCATTTTACTTCTTTTTGTTCAAATACTAAATTCTTAGCCTCTTGGTGGTGTACAAAGCACAAGAAATTCTTTTGTAATTACAGCTTGTCTATGATCCGATCAAATTGGAATGTTCTCACTTAGCTTTTTGCTTCGGGAGGGGTTCTCTCTTCCCCCTGCCCTTCGGCTGTATTTTGTGGCATTGTGCATAATTACCTAAATAACTTTTTAGCTTTCTTTGAGGCCAGTTGAGGTTGAATTGTGATCTAGGTAAATTGAGTGGGTGGGTTGCTTTGATTGGGTGTGATGTTTGATCATTCCATTTTTCTTACTTAGGTCATAACCCTAGGTGCCGTTTTGACCCTATTGTAAGATTTCAAGAGATTGTCGTCGTGGGAGAGGACTCTTTTCTTGAAAGATGGCTTACTCTTATCCGGTCTGGATTGAGTAAGATCTCAATTTATTTCTTAAGCCCTTTTGAGGATCCCTGTGTCAGTGAGCAAAATCCTAGAGAAGCTTATGAGGGATTTTCTTTTGGAAGGGGTTGAGGAGGGAAGAGTAGGCTCACTTAGTCAAGTGGGAGGTGGTCACGAAGTCGGTGGACTTTGGGAGATTAAGTATTGGTAATTTGAGAGCCCATAAAAAAGGCCTTTTGGCTAAATGGCTTTGGTGATTTCATCATGAGTCCAATTCCTTGGTTGTTGTGAGCAAATTTGGGTTGTCCACTTTTGAATGGATCTCAAGTGGGGTCAAAGGCACTTCCAAAATCCCTTAGAAAGGCATCACTGAGGAGCTTCCTTTCTTCTCTCTCTTTGTTCGTTGTTTGGTTGGAGTTGAGATGAACATTTTTTTTTTGGAGAGAATAAGTGATTGGGAGATAGACCCCTATGCTTTTGGTTTCCTCATCTATTTCAATTATCTTCGATAAGAAATCATTTGGTGGTCAAACCATGTCCTCTCTTTCACCTTGTTTTGGGTTTGTTGTGCTTTGTTCGATAGGAAAATGTTAGATGTTGTTGACCTTTTTTCTGTGCTTGGGAGTTTTGCTGTATTTAAAGAGGGGAGGGATGTTCGTATTTGGATTCCTACTTCGTTTGAAGGGTTCTTGTGTAAATCCTTTTTTATTTGTCTATTTAGTCATTCTCCCTCGAGTGATTTGGTATTTGCCTTGTTGTGGAAGGTGAAAATCCACAAGAAGGTTAAGGTTTTTTCTTTTTTTTTTTATTTGGTAGGTCCTCCACGGATGAATTAACACCTTGGATGGGGTGTTGAGGTGGATGCCCAACTTGATGGGGTTGTTGTGTTGCATCCTTTGTAGGAGGGTGGCTAAGGATCTTGATCATATTATTTGGAGTTATGATTTTCCTTACGTAGTCGAGAACCAATTTTATGAGGAGTTCGATGTACGCTTGGCCAGACATAAGGGATGTAGAGAGATGAACGAAGAGTTCAAGTTCCTTCTCCATCATTCTTTTCATGAGAAAGGTTTTTATGGCAAATGGGTGTGTGTGCCATGATGTGGTGATTGTAGGGTTAGAGGAACAATAGAATTTTTAGGAAGTTTGAGAGATCTCTAAGTGAGATTTGACCATTTGGTGGATTCTATGTTTATTTGTGGGCTTCTGTAGTTTGCCTTCTTTTTTTTTTTTTGCTCTCTTTTTGTATGCGTTGTATTATTTCTTCTTTTTTTCCCCCTCAATCGTTGATTTAGAAAATTTTTGTTTAATTATTTCTCAGTATGTATTGGAAGTTTTCTTTTAAGGTTTTTTTTTTCACTCCTACCATTTTCCAAAATATGTAACTGCTATGCAGATACTTTGAGCAGTAAACCCGATCAGTCTTGTGAACTTCATGGGCCTCTGGAGATGGATTCTAACAACATAGTTGTTGAGAGACAGTCATCCTTAGAGAGGTACCAACATATTCTAGAATGTGCAATAGTAATGCTTCTTAGTAATATTTTGATGCGAAACTGCCTCTTCCTTATATGCTGGTGCTGTATATCCTTGAGGTTTTGTTTTATTAAGTGGTGATTACCCAATGCTGTAGGTCATCTATGCTAGCCTTTGAAAAGGGAAGCTTGCCAAAGGCTGCGTTGGCATTTATTGATGCCATCAAGAAGAATAGGTCTCAGCAGAAGTATATTCGTAGTAAGATGATTCATGTTGAAGCTAGAATTGAGGAGAACAAAAAGCTCAGAAAACGTTTCAAAATTCTCAAAGATTTCCGGGGTTCTTGTAGAAGAAAAACAAGTTGTGCACTATCTCAGATGATTGATCCTCGAGTCCAGTTAATGTCGGCTGGAAAACCACAGGCTAAGGATTCGACAAAGGTTAGATGATAATTTATGCTTGTGACTGTATTACTTTTGAACTATTTGTACTTCCATTTTCACTAGGAGTTGATTTGTGTATTATTTGTTTACAGCTTTGAAGAATGCTTGTCTTCTTATTTTGAACAATAGACAAGCTTTTCATTAATTTGATAGAAGTTGTATAGCCAATCTTATCTTCTTTCATTTTTCTAAGTGAAAGCTTGGTTTCTTATCAAAATATACATACAATATGTATAAACAAATAAATATATATACACACATATATATAGCCAGTCTTATCATCTCAAAGCCCCCTATTTGAGCTCATAGCAAAAGGTGTATAGTTATGGAAAGAATTAGAGAATGTCATGAGGAGCATTTGGCCAAATCCAGTAATTTTTTCCCATTGTTTTTGCTTTTAAGAGAAACCCTTCCCTATAGCATTACAAGTAGTCTTTTCCAACAGGTATAAAAATCAAGTTATTGTAGAATCGAGAACCTTGAAAAAAAAATAAAAGCTTCCCGGTACAAATTAATCCTAGAGAGTGAATAGATCTTAAATTCTTTAGGACATTTAATCTTCGAGTATTTCTCTGCATGGAACTTGCTTTTCTAAATTTTTTCTAGGTTTTTGTCTATTTAAAATGCCAGGAACTAGCTTTAACCTAGTTGAACAAAAAATGTTAGCTCTCAGCCCCTGCCATATGCTGTAAAGCAGCCTAAAGTATTGTTTTATGCTAATAAATTGAAATTCCATGCAAGCCTAAAGCCTTGAATTTGAGAGTCCCACTTTGTGTTTTTGTTTTTTTTTTTGGTTATAAGAACAATGGAAAAGCAGCCTCTCGAATTGCTGGGTATCATGTAGAGGAAGTAGCAGACTGTTTGGCTATCTTTTGAGTATTATCTCAGTGAAATAACCCCTCTGTTTAGGGAAGAACTCTCCTTAAAGCAGAGCTAGTTCATGATCTTGTGTGTGGTGCGAGTATATTTGTGTTGTTCTTGTGTTGTTTGGCTGTTTTGGTGCAGGTTATTGAAGAAGATACCGGACATAATGAAATGCTGTCTGTTACTTCCTCAAGCAAGGATTTCACTGAAATATTCCATCCATTTCCAAATTTCACCCTCTTATATCCTCGCTTAAATAATTGAGGTTCAACCTTTCCCTCTTAGAAATGGTAGTGGCTCATCTGCTTTTCCAATCTATTAGTTCCTTTTGTTTAGAGGTTATTGCATATTTGGTTACTGTTAAAAGACTAACAGTAATGCTGGCGAACCTGAGAGCTCTATATTTTACAGCAACACTGATAAACCTGAGAGCTCTATGTTCTTATGTCGACCATATCAAAAAAGTCTTTGTAATATCATTTTCTATGCACGTTGGTCAGTGGATTATCTGCTTACTTTGAGCTTTATGCACCATTGTTGATTTGTTTTTTTCTAACATATCTGTTCTCTTGTTTCTGAGATTTGTTTCTCAGTTCTTGATCTGAAAATCTTCCTCTGTTTTTTGTTTCTTTTTTTCTCTTGCACCTTCTGTTTGTGCTTTACCCCTTTGATTCTTATGTGGAGTTTTTACTTTTTTTCTGCAGAAGGACAAACGACTTTCTGCAATGTATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCTACAAAATGGCATTGACAAAGTTTCCACCTTCTGTAGATCGAAAAAAATGGTCCAATGTAGAAAGGGAGAATCTAGGGAAGGGAATAAGACAGCAATTTCAAGAGATGGTGCTTCAGATTTCAGTGGATCAAATTAGGTAAGTTTTTTTCTATTGTGTTAGCCTCATGCAATCATTTTGAGCAGTCGTGTGGATAGGCAATGAATATGTATTACAATGATGCCATGAGAGTCTTTCATTCTTATTATTGTAAAATATATTTACTGTTTTAACTATTAAGAAAATCGACTAAATCCCATGTCACTCTAAAGATTTTATTCGTTCACAGATCCTTTGTTTCTCAGTTTGCTCTTAAATAGGAGAGATGGAATTTTACAAATTAAAGAAAATGGTTTACTAGACATTTCCATTATAAAATTTCAACTTAACTTTACATCCTAGATAGTAGTGTTTTCATTATTGCTATGGTTTCCTGGTCAGAACATGATGGCAAAGTTGGAAACATGATGAGGTTATAGAAAGGTTCAGGCAACTGTGCTTTTGCTCAATGCTATTGGGGCTCTTTCGCAACAGATGTGGTTGGAGAGAAATAGGAAAAATTTTCCAGCCGGGAGAGGGAGGGGGGTGGTTATGGCTGGAGATTTGGGATGTAATTGTTCTTGAGTGTTTTATCCAACTCTTTTTGTAACTACTTATCCTCTTTCTACCCGCTTGAGGTCCATTTGTGACTCCTTTGCTATTTTGGGAGGATGATTCCTCTCCTTGTAATTTGATAGGATACCACCTTCCCTTTTTTCTTTTCCTTTTTCCAGAAGCCTTATTGCAATTATTGTGGTATTTGGCAGTGGGCTACAAGGATTTTCAGCAGATCTTTGCTATTTTGGGAGGATGATTCCTCTCCTTGTAATTTGATAGGATACCCCCTTCCCTTTTTTCTTTTCCTTTTTCCAGAAGCCTTATTGCAATTATTGTGGTATTTGGCAGTGGGCTACAAGGATTTTCAGCAGATTCAAATGATCTGGATAACATTCTTGCATCAATAAAAGACCTTGATATTACCCCTGAAAAGATTAGGGAATTTCTACCAAAAGTTAATTGGGAGAAACTGGCTTCCATGTATCTTCGAGGTCGCTCAGGGGCAGAATGTGAAACAAGGTATGTACTTCAGACAAACGTCTTTGCTTTGCATATAACAAATATGGTTGAAGAAACTATTTTGTGAATTATCTTTCTATAAATTCATCTCTAGTTATTTTATGTATGTATATATTAAAATCCATTTCTAGTCTTTTAAAAGTTTCAATTTCAACAAGTCATTACTGTCTTACTAAATTGAGTACATCAACCAAGTGATGATTTGACGTTAGAGTACAATTTTATTAATTGTGCTAGTTAGATATGACAATCTATAAAATTGAAAGCTTGCCCACTTGAAATAAGCGGGTCACATTACCTTATTTAAAAAACAGTGCATTTTTTTTTTTTTTTAATTTCTAGACCATTACTAACTTTTATGGATAGTTAGGACTTGCTTGAGAAAACGTAATGCATAAAAGAATAAAGGATAATGTTGAGACTTCTAAGACATAAGAGACCAAATTGAAAATGAGATATACATTAGAAACCAAATGATATATTTAGCCCAAGTTTTTTTAATGGAGCAGATAATATTTCAAGTTAAGCACCATAATTTGCTAGTACTTTACCCAATTTATCTTGATGTGAAAACAATACTATAGAAGATCTTTTCAAGAAATATTTTAAAGATGAGCCTGTCATCTTTCGAGATTTTTGGTGCTTGAATATTATTCAAGCTGTTATTCATATGAATTTGAGGCTACTTTGTATGCCACAGATTTCTTTAACCAGACTTGAATGAAATTGTATGTTGGTAATGGTATTATTTTACCTTTAATCTGGTAGATTATTATTTTCTCAATACAGAGGCTTTTGCTATTGGTATATGTTCAGTTCTTGACTGGGTCCATATTGATTTTTTGTGAAGCATGTCAACTTCAATCGGCATTGCAGACGTCTATGACGATCATCATCACAGATATATTGACTATGATTGGTTTTGAATAATAGTACGAGGTCTTCATATTATAGTACAGGTGGGACCAACTTGTAACAAGTGCCTTATTCAAACAGTGGTCTTCAATATGTCTGTGATGACAATCGTCACAGACATCTGTGATGTCTTATATTTCCTTGAATAACACATTTTCAATGTATATTTTCTGACTTGATGATTGTCATTGCTAATCCATGTAGGTGGTTGAATTTTGAAGACCCCCTAATTAATCGGAATCCATGGACTACAAGTGAGGACAAGAATCTTTTGTTTACCATCCAACAGAAAGGGTTGAATAACTGGATTGACATAGCAGTTTCATTGGGTACAAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAAACTTCGATCTGCTGTTGCTATATTTGGTGTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACCGGTCCACAGTGCTCTAATAGGTTGGTCGTAGTTACAATAATCGGACCAGTTCAGTTTTCATCTTTGCGTTTCATACTTCAATCTTTTCTCAAAGGAGAGAAAAATGTTCATAGTGAGTGCTTTGAGATGCTTTTAGAGATTTTTCCATGTTTGATAGGCTTATGTTTTCTGTTCACCATTATATTTTATTCATTTTCAGTTGTATTTTTGAAAAGAAAAAAAAAATCAAAACACCAAGTTATTTGAATGTTACTGTGGTTTAAATCATAGTGCTGTTATGTGCCATTTGGTTGATTTTCAAGTGCAGTCGTTCACTTGCTATAATGAATTTCTTCTATTATGCAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGCTATTTCACTCCAGATGAAGACAGTCGCTTGACAATTGCGGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTTTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGGTGTGCTAAGTTGGCCTATTTTATACGTCATTTTTATTAGCTAAGTGCCATTTTTTTTTTTTTGGGTGAAGGGGAGGGAGTGTAACGAATAATTTCCATTAACTCAAGCCTGAATGCAAAAATGAACAAAAGAAAGCGTCCAGAAAGTATACACTCTGCAATTGGCTTTGATTTATATATATATATGTATAACATATATAAAAGTTAAGTATTTACTAAAGGATTTAGTTTGAGATGCCAATGCACAACCATAAAACAGCATCAAACCAACAATCACTTCCACTCCCCTATCTTCAATTAAATTTTAGTGTCTCCGTAACTAGAGTATAAAAAAAAGCTTAATGCATATTGTTTATCTGATAAACAAACAATTAGTTATTAGTCTGATTTGACGTATTATTAATTGTGATGGCAGATGGTTCAATTGTTTAGATCCTTCCTTGAGAAGATGTCAATGGACAGAAGAGGAGGATTTAAGGCTAGAGATAGCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCGTCACGAACAGATAATGAGTGCCGGAGGTAATGGTGTGTGGATTGAGTCTGCAAATTCCGTTTGCTTCAAAACCGTTATTCAATAGAAAGAAATTTGTACATGCCAGATGCTACTTTAAAATGTTCGGCATTCTCTTATTTCAATTATTTTTAATATATTGTTCTTCATATAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCTTTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCCACTGACTTTCGGCCTCGACTGGATACAGATTTATTATGTAATACTGATGATCCAAAACCTGCCCCGAAAAGAAATGTGAAGACGAGGTTAGTTTGAATATGACCCTTCTAATTTATTTATCTTGTACATAAAAAGAAAGAAAGAAAACTATTTGTCTTGATTTCAAGTTATATTCTAGCTTTAGCGTGTAATACCTTGGTCTACAGAAGGATTCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGGTAGGTTATGCTTTTCCTGTATTGTTCTATAGTCGGGTCTTCTTTTTCTTTTCTGTTTTCTTTTTCCTGAAGGAAATCTGTAGTCAGGCATTGAAGCTCTTGTTATTTCTTTCTTCATGTTTCCTGCATTAGCATGCATGTTAGAAATTGCAAGTTGATAGAGCTTCATATTTTTCAACAGTGATGCTCCAAAGAAGAGGAGATCAAATAACCAGAGGAATCAAGCTGATGCCACTGCTCAGGTAAGTATTGCAAATATTTCGTCTTCTGTCCCGGAGGAGGTTAAATCTATAAAACCTCAAAGAAAACGAAGTAGAAATGGAGCTCATACCACTAAGAGGAAGAAGGGAGTTCTGGAGCCACGTTCTAACAGTGAGAGACGTGTCGAACAGAATTTGGATACTCAGAGCCTCGAGGTGCAGCTGAATAGTAAGGAATCAGAGAGGACCAGCAGTGACTGCACTGAGACTGTTGATGAGAATGGTATGGAGTCGTGTGAGAACAAAGTTGCAGATAAGCTTTCTAAAAGAGATGTATGCTTTTCAGAGCAAGAAGAAAATCAAAACTCTACTGGATCTTCTGGAGTATCAGTATTGTCAGAAATGACCAATGACATGGACGAGTATAATCCCTCTATTATCCTTCCAGCAGATACAGCATTGCTGGCCAGCACTATTGGGGATGATATTATAGAAAGGAATGGGGAGAGTGTTGCAGACAAAGATCTGGATGACAGTAACAGTTTCTCGTTACCGCACAGTTGCTTAGAACTCAGGGCAACTGACAGGGAAGGTGTCAATAGCTATTCTGTGGATGAAGACACAGATAAAGGCCATGGGGTTTGCAAGTACCAAGGCAGAAGGAAGAAAGATAGTAAAACATCAAATAAGAGCCAGGATTCATTGGTTTCTAGTCGACAAGTGGAGCTGGAGAGGTCGGGGGCGAAGGAGCTTCATCGTCATAATCAATCAAAGAAGAGAAAGCATAACAGTACAAACACGACGAGTTTATTAGGAACATCGGAGACTGTTGAAGAGGTCGATGAGTGCACTCTTCTGGGCTTTTTGCAGAAAAGAGTGAAGAGGACGAACACGAGACATGACAAGAAAGTTGATGGCAGCTCTAGTTCATTAGAAGTTGATAATGATGATAATGATCCTTCCATTGCCTCGCTTCTCAAGGATAAATGTAAGAGAAAAAGGCATGAAGCGGCCAGTGGTGGAGGCTAAAACTTTCAATTCAATGATGTTGATGGAGTTTGGAGAAGCTGTACAAATCTCTACGCAATTGTGAGAAAGTGATAGACCACCAATTAAGCATGTGTATAAGAGAAAGAAGAGAAACAAACAGGGAATGAATACTTAGTCAGTTGTAAGACTGTTAGTCTGGTTAGGTTGTTAAGTCAGTTACTAACCATATCCCCTATATATATTTCTATCTTGTAAGAGGGAGGGGTACCTTTTGGCTAAGAAGTGTAGTTGGGTAAAGGGTGGAGAGATAGAGGGAACTCTCTAATATTCCCTTGGGTGTATTGTGTTGTGCATAAATAATACCAAGTTTCTATCAATTTGGTATCAGGGCAGTGTAATTTTTTTACCAAATTGATAGGAACTTGGTATTATTTATGCACAACACAATACATCCAAGGGAATATTTGAGAGTTCCCTCTATCTCTCCACCCTTTACCCAACTACAGTTCTTAGCCAAAAGGTACCCGTCCCTCTTACAAGATAGAACTATATAGGGGATATGGTTAGTACCGACTTAACAACCTAACTAGACTAACAGTCTTACAACTGACTAAGTATTGATTCCCTGTTCGTTTCTCTTCTTTCTCTTATACACATGTTTAATTTGTGGTCTATCAGAAAGAGTATCAAATAAAGCTACTCCCAAGGTCGCACAGGCACAATGGATAAATGCACAACTAACTGTATAGGTTTGATTTTATAGTATGCGAAATTCTGATCCTAATAGCTGAAGCTGAGAGAGGTTTGGCAATTTTGTTTTGTTTTATGGGTCATTTTTTAAGCTAATTCCCTTTGAGAAGGGTGGTGGTTGCCCCATAATCACAGCGAGCAAATTTTGATGAGTTGCTCGGTAGTGATACTCTTGAGTGATAATCCTCTCTTTGTATATTTCTCACCATCTTAAATGAAATCGTAGGGATGTGTATAGGTTTTTTATTATTTTCTTTTATACTTTTAACCACATTGTTAGAAATTGTTATATTCTTTTTGTGTTCTCAAAATTGATAGTCTTCTTTATGTCTGGATTCGTTTTCTC

mRNA sequence

ATGTATCTTCGAGGTCGCTCAGGGGCAGAATGTGAAACAAGGTGGTTGAATTTTGAAGACCCCCTAATTAATCGGAATCCATGGACTACAAGTGAGGACAAGAATCTTTTGTTTACCATCCAACAGAAAGGGTTGAATAACTGGATTGACATAGCAGTTTCATTGGGTACAAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAAACTTCGATCTGCTGTTGCTATATTTGGTGTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACCGGTCCACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGCTATTTCACTCCAGATGAAGACAGTCGCTTGACAATTGCGGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTTTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTTCAATTGTTTAGATCCTTCCTTGAGAAGATGTCAATGGACAGAAGAGGAGGATTTAAGGCTAGAGATAGCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCGTCACGAACAGATAATGAGTGCCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCTTTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCCACTGACTTTCGGCCTCGACTGGATACAGATTTATTATGTAATACTGATGATCCAAAACCTGCCCCGAAAAGAAATGTGAAGACGAGAAGGATTCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAGAAGAGGAGATCAAATAACCAGAGGAATCAAGCTGATGCCACTGCTCAGGTAAGTATTGCAAATATTTCGTCTTCTGTCCCGGAGGAGGTTAAATCTATAAAACCTCAAAGAAAACGAAGTAGAAATGGAGCTCATACCACTAAGAGGAAGAAGGGAGTTCTGGAGCCACGTTCTAACAGTGAGAGACGTGTCGAACAGAATTTGGATACTCAGAGCCTCGAGGTGCAGCTGAATAGTAAGGAATCAGAGAGGACCAGCAGTGACTGCACTGAGACTGTTGATGAGAATGGTATGGAGTCGTGTGAGAACAAAGTTGCAGATAAGCTTTCTAAAAGAGATGTATGCTTTTCAGAGCAAGAAGAAAATCAAAACTCTACTGGATCTTCTGGAGTATCAGTATTGTCAGAAATGACCAATGACATGGACGAGTATAATCCCTCTATTATCCTTCCAGCAGATACAGCATTGCTGGCCAGCACTATTGGGGATGATATTATAGAAAGGAATGGGGAGAGTGTTGCAGACAAAGATCTGGATGACAGTAACAGTTTCTCGTTACCGCACAGTTGCTTAGAACTCAGGGCAACTGACAGGGAAGGTGTCAATAGCTATTCTGTGGATGAAGACACAGATAAAGGCCATGGGGTTTGCAAGTACCAAGGCAGAAGGAAGAAAGATAGTAAAACATCAAATAAGAGCCAGGATTCATTGGTTTCTAGTCGACAAGTGGAGCTGGAGAGGTCGGGGGCGAAGGAGCTTCATCGTCATAATCAATCAAAGAAGAGAAAGCATAACAGTACAAACACGACGAGTTTATTAGGAACATCGGAGACTGTTGAAGAGGTCGATGAGTGCACTCTTCTGGGCTTTTTGCAGAAAAGAGTGAAGAGGACGAACACGAGACATGACAAGAAAGTTGATGGCAGCTCTAGTTCATTAGAAGTTGATAATGATGATAATGATCCTTCCATTGCCTCGCTTCTCAAGGATAAATGTAAGAGAAAAAGGCATGAAGCGGCCAGTGGTGGAGGCTAA

Coding sequence (CDS)

ATGTATCTTCGAGGTCGCTCAGGGGCAGAATGTGAAACAAGGTGGTTGAATTTTGAAGACCCCCTAATTAATCGGAATCCATGGACTACAAGTGAGGACAAGAATCTTTTGTTTACCATCCAACAGAAAGGGTTGAATAACTGGATTGACATAGCAGTTTCATTGGGTACAAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAAACTTCGATCTGCTGTTGCTATATTTGGTGTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACCGGTCCACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGCTATTTCACTCCAGATGAAGACAGTCGCTTGACAATTGCGGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTTTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTTCAATTGTTTAGATCCTTCCTTGAGAAGATGTCAATGGACAGAAGAGGAGGATTTAAGGCTAGAGATAGCAATTCAGGAACATGGATATAGCTGGGCTAAGGTAGCTGCATGTGTGCCGTCACGAACAGATAATGAGTGCCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCTTTGCTCCAGGAAGCTAGAAAGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCCACTGACTTTCGGCCTCGACTGGATACAGATTTATTATGTAATACTGATGATCCAAAACCTGCCCCGAAAAGAAATGTGAAGACGAGAAGGATTCCAGTGTCAAGGAATGAAAAGAGTGCTACTGGTGATGCTCCAAAGAAGAGGAGATCAAATAACCAGAGGAATCAAGCTGATGCCACTGCTCAGGTAAGTATTGCAAATATTTCGTCTTCTGTCCCGGAGGAGGTTAAATCTATAAAACCTCAAAGAAAACGAAGTAGAAATGGAGCTCATACCACTAAGAGGAAGAAGGGAGTTCTGGAGCCACGTTCTAACAGTGAGAGACGTGTCGAACAGAATTTGGATACTCAGAGCCTCGAGGTGCAGCTGAATAGTAAGGAATCAGAGAGGACCAGCAGTGACTGCACTGAGACTGTTGATGAGAATGGTATGGAGTCGTGTGAGAACAAAGTTGCAGATAAGCTTTCTAAAAGAGATGTATGCTTTTCAGAGCAAGAAGAAAATCAAAACTCTACTGGATCTTCTGGAGTATCAGTATTGTCAGAAATGACCAATGACATGGACGAGTATAATCCCTCTATTATCCTTCCAGCAGATACAGCATTGCTGGCCAGCACTATTGGGGATGATATTATAGAAAGGAATGGGGAGAGTGTTGCAGACAAAGATCTGGATGACAGTAACAGTTTCTCGTTACCGCACAGTTGCTTAGAACTCAGGGCAACTGACAGGGAAGGTGTCAATAGCTATTCTGTGGATGAAGACACAGATAAAGGCCATGGGGTTTGCAAGTACCAAGGCAGAAGGAAGAAAGATAGTAAAACATCAAATAAGAGCCAGGATTCATTGGTTTCTAGTCGACAAGTGGAGCTGGAGAGGTCGGGGGCGAAGGAGCTTCATCGTCATAATCAATCAAAGAAGAGAAAGCATAACAGTACAAACACGACGAGTTTATTAGGAACATCGGAGACTGTTGAAGAGGTCGATGAGTGCACTCTTCTGGGCTTTTTGCAGAAAAGAGTGAAGAGGACGAACACGAGACATGACAAGAAAGTTGATGGCAGCTCTAGTTCATTAGAAGTTGATAATGATGATAATGATCCTTCCATTGCCTCGCTTCTCAAGGATAAATGTAAGAGAAAAAGGCATGAAGCGGCCAGTGGTGGAGGCTAA

Protein sequence

MYLRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNEKSATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLEVQLNSKESERTSSDCTETVDENGMESCENKVADKLSKRDVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSIILPADTALLASTIGDDIIERNGESVADKDLDDSNSFSLPHSCLELRATDREGVNSYSVDEDTDKGHGVCKYQGRRKKDSKTSNKSQDSLVSSRQVELERSGAKELHRHNQSKKRKHNSTNTTSLLGTSETVEEVDECTLLGFLQKRVKRTNTRHDKKVDGSSSSLEVDNDDNDPSIASLLKDKCKRKRHEAASGGG
Homology
BLAST of Spg032687 vs. NCBI nr
Match: XP_022921860.1 (uncharacterized protein LOC111430000 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1027.7 bits (2656), Expect = 4.3e-296
Identity = 539/647 (83.31%), Postives = 572/647 (88.41%), Query Frame = 0

Query: 1   MYLRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRT 60
           MYL+GRSGAECE RWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRT
Sbjct: 358 MYLQGRSGAECEARWLNFEDPLINRNSWTTSEDKNLLFTIQQKGLNNWIELAVSLGTNRT 417

Query: 61  PFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRW 120
           PFQCLSRYQRSLNASILK EWTKDEDDKLRSAVAIFG GDWQAVASTLEGRTGPQCSNRW
Sbjct: 418 PFQCLSRYQRSLNASILKSEWTKDEDDKLRSAVAIFGEGDWQAVASTLEGRTGPQCSNRW 477

Query: 121 KKSLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 180
           KKSLDPARTKRGYFTPDEDSRL IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS
Sbjct: 478 KKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 537

Query: 181 LRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK 240
           LRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARK
Sbjct: 538 LRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNQVPLLQEARK 597

Query: 241 IQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNEK 300
           IQK ALISNFVDRESERPALGPTDFRP  ++ LLCNTDDP+ APKRNV+ RR+PVSRNEK
Sbjct: 598 IQKVALISNFVDRESERPALGPTDFRPVPNSHLLCNTDDPETAPKRNVRMRRMPVSRNEK 657

Query: 301 SATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTTKRKKG 360
           SA GDAPKK +SNNQRNQAD TAQV  AN +SSVP EVKS KPQRKR+R+GA+TT R+KG
Sbjct: 658 SANGDAPKKMKSNNQRNQADETAQVDFANNTSSVP-EVKSTKPQRKRTRHGAYTT-RRKG 717

Query: 361 VLEPRSNSERRVEQNLDTQSLEVQLNSKE-SERTSSDCTETVDENGMESCENKVADKLSK 420
             +   NSER  EQN DT+SLEVQLN KE +ER +SDC ETVDENGME  ENK A+  S+
Sbjct: 718 APKIGCNSERCAEQNSDTRSLEVQLNCKEPAERINSDCPETVDENGMEVFENKAAEMHSE 777

Query: 421 RDVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSIILPADTALLASTIGDDIIERNGE 480
             VCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS   P DT LLAS   DDIIE  G 
Sbjct: 778 GVVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS--TPPDTTLLASITADDIIETKGV 837

Query: 481 SVADKDLDDSNSFSLPHSCLELRATDREGVNSYSVDEDTDKGHGVCKYQGRRKKDSKTSN 540
           +VADKDLDDSNSFSLP SCLELR TD EGV+SYSVDE TDK HGVCK QGRRKK+SK SN
Sbjct: 838 NVADKDLDDSNSFSLPQSCLELRTTDSEGVDSYSVDEFTDKSHGVCKPQGRRKKNSKRSN 897

Query: 541 KSQDSLVSSRQVELERSGAKELHRHNQSKKRKHNSTNTTSLLGTSETVEEVDECTLLGFL 600
           KSQDSLVS +Q ELE SG  ELHR NQSKKRKH+ TN TS LGT E VEEVD+CTL GFL
Sbjct: 898 KSQDSLVSCQQAELEMSGMNELHRCNQSKKRKHSGTN-TSPLGTMEAVEEVDDCTLQGFL 957

Query: 601 QKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPSIASLLKDKCKRKRH 646
           QKR+KRT T HDKKVDGSSS+  EVDNDDNDP++A LLKDK KRK+H
Sbjct: 958 QKRLKRTTTTHDKKVDGSSSTPPEVDNDDNDPTLALLLKDKLKRKKH 999

BLAST of Spg032687 vs. NCBI nr
Match: XP_023515735.1 (uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1027.3 bits (2655), Expect = 5.7e-296
Identity = 536/647 (82.84%), Postives = 573/647 (88.56%), Query Frame = 0

Query: 1   MYLRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRT 60
           MYLRGRSGAECE RWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRT
Sbjct: 358 MYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIELAVSLGTNRT 417

Query: 61  PFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRW 120
           PFQCLSRYQRSLNASILK EWTKDEDDKLRSAVA+FG GDWQAVASTLEGRTGPQCSNRW
Sbjct: 418 PFQCLSRYQRSLNASILKSEWTKDEDDKLRSAVAVFGEGDWQAVASTLEGRTGPQCSNRW 477

Query: 121 KKSLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 180
           KKSLDPARTKRGYFTPDEDSRL IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS
Sbjct: 478 KKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 537

Query: 181 LRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK 240
           LRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARK
Sbjct: 538 LRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNQVPLLQEARK 597

Query: 241 IQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNEK 300
           IQK ALISNFVDRESERPALGPTDFRP  ++ LLCNTDDP+ APKRNV+TRR+PVSRNEK
Sbjct: 598 IQKVALISNFVDRESERPALGPTDFRPVPNSHLLCNTDDPETAPKRNVRTRRMPVSRNEK 657

Query: 301 SATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTTKRKKG 360
           SA GDAPK+R+SNNQRN+AD TAQV   N +SSVP EVKS KPQRKR+R+GA+TT R+KG
Sbjct: 658 SANGDAPKRRKSNNQRNRADETAQVDFGNNTSSVP-EVKSTKPQRKRTRHGAYTT-RRKG 717

Query: 361 VLEPRSNSERRVEQNLDTQSLEVQLNSKE-SERTSSDCTETVDENGMESCENKVADKLSK 420
             +   NSER  EQN DT+S+EVQLN KE +ER +SDC ETVDENGME  ENK A+  S+
Sbjct: 718 APKIGCNSERCAEQNSDTRSVEVQLNCKEPAERINSDCPETVDENGMEVFENKAAEMHSE 777

Query: 421 RDVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSIILPADTALLASTIGDDIIERNGE 480
             VCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS +   DT LLAS   DDIIE  G 
Sbjct: 778 GVVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSTL--PDTTLLASITADDIIETKGV 837

Query: 481 SVADKDLDDSNSFSLPHSCLELRATDREGVNSYSVDEDTDKGHGVCKYQGRRKKDSKTSN 540
           +VADKDLDDSNSFSLP SCLELR TD EGV+SYSVDE TDK HGVCK QGRRKK+SK SN
Sbjct: 838 NVADKDLDDSNSFSLPQSCLELRTTDSEGVDSYSVDEFTDKSHGVCKPQGRRKKNSKRSN 897

Query: 541 KSQDSLVSSRQVELERSGAKELHRHNQSKKRKHNSTNTTSLLGTSETVEEVDECTLLGFL 600
           KSQDSLVS +Q ELE SG  ELHR NQSKKRKH+ TN TS LGT E VEEVD+CTL GFL
Sbjct: 898 KSQDSLVSCQQAELEMSGTNELHRCNQSKKRKHSGTN-TSPLGTMEAVEEVDDCTLQGFL 957

Query: 601 QKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPSIASLLKDKCKRKRH 646
           QKR+KRT T HDKKVDGSSS+  EVDNDDNDP++A LL DK KRK+H
Sbjct: 958 QKRLKRTTTTHDKKVDGSSSTPPEVDNDDNDPTLALLLNDKLKRKKH 999

BLAST of Spg032687 vs. NCBI nr
Match: KAG7023314.1 (Myb-like protein L [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1021.9 bits (2641), Expect = 2.4e-294
Identity = 539/647 (83.31%), Postives = 572/647 (88.41%), Query Frame = 0

Query: 1    MYLRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRT 60
            MYLRGRSGAECE RWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRT
Sbjct: 368  MYLRGRSGAECEARWLNFEDPLINRNSWTTSEDKNLLFTIQQKGLNNWIELAVSLGTNRT 427

Query: 61   PFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRW 120
            PFQCLSRYQRSLNASILK EWTKDEDDKLRSAVAIFG GDWQAVASTLEGRTGPQCSNRW
Sbjct: 428  PFQCLSRYQRSLNASILKSEWTKDEDDKLRSAVAIFGEGDWQAVASTLEGRTGPQCSNRW 487

Query: 121  KKSLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 180
            KKSLDPARTKRGYFTPDEDSRL IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS
Sbjct: 488  KKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 547

Query: 181  LRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK 240
            LRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARK
Sbjct: 548  LRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNQVPLLQEARK 607

Query: 241  IQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNEK 300
            IQK ALISNFVDRESERPALGPTDFRP  ++ LLCNTDDP+ APKRNV+ RR+PVSRNEK
Sbjct: 608  IQKVALISNFVDRESERPALGPTDFRPVPNSHLLCNTDDPETAPKRNVRMRRMPVSRNEK 667

Query: 301  SATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTTKRKKG 360
            SA GDAPKKR+SNNQRN+AD TAQV  AN +SSVP EVKS KPQRKR+R+GA+TT R+KG
Sbjct: 668  SANGDAPKKRKSNNQRNRADETAQVDFANNTSSVP-EVKSTKPQRKRTRHGAYTT-RRKG 727

Query: 361  VLEPRSNSERRVEQNLDTQSLEVQLNSKE-SERTSSDCTETVDENGMESCENKVADKLSK 420
              +   NSER  EQN DTQSLEVQLN KE +ER +SDC ETVDENGME  ENK A+  S+
Sbjct: 728  APKIGCNSERCAEQNSDTQSLEVQLNCKEPAERINSDCPETVDENGMEVFENKAAEMHSE 787

Query: 421  RDVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSIILPADTALLASTIGDD-IIERNG 480
              VCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS   P DT LLAS   DD IIE  G
Sbjct: 788  GVVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS--TPPDTTLLASITADDIIIETKG 847

Query: 481  ESVADKDLDDSNSFSLPHSCLELRATDREGVNSYSVDEDTDKGHGVCKYQGRRKKDSKTS 540
             +VADKDLDDSNSFSLP SCLELR TD EGV+SYSVDE TDK HGVCK QGRRKK+SK S
Sbjct: 848  VNVADKDLDDSNSFSLPQSCLELRTTDSEGVDSYSVDEFTDKSHGVCKPQGRRKKNSKRS 907

Query: 541  NKSQDSLVSSRQVELERSGAKELHRHNQSKKRKHNSTNTTSLLGTSETVEEVDECTLLGF 600
            NKSQDSLVS +Q ELE SG  ELHR NQSKKRKH+ TN TS LGT E VEEVD+CTL GF
Sbjct: 908  NKSQDSLVSCQQAELEMSGMNELHRCNQSKKRKHSGTN-TSPLGTMEAVEEVDDCTLQGF 967

Query: 601  LQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPSIASLLKDKCKRKR 645
            LQKR+KRT T HDKKVDGSSS+  EVDNDDNDP++A LLKDK KR++
Sbjct: 968  LQKRLKRTTTTHDKKVDGSSSTPPEVDNDDNDPTLALLLKDKLKREK 1009

BLAST of Spg032687 vs. NCBI nr
Match: XP_022921861.1 (uncharacterized protein LOC111430000 isoform X2 [Cucurbita moschata])

HSP 1 Score: 1021.1 bits (2639), Expect = 4.1e-294
Identity = 538/647 (83.15%), Postives = 571/647 (88.25%), Query Frame = 0

Query: 1   MYLRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRT 60
           MYL+GRSGAECE RWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRT
Sbjct: 358 MYLQGRSGAECEARWLNFEDPLINRNSWTTSEDKNLLFTIQQKGLNNWIELAVSLGTNRT 417

Query: 61  PFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRW 120
           PFQCLSRYQRSLNASILK EWTKDEDDKLRSAVAIFG GDWQAVASTLEGRTGPQCSNRW
Sbjct: 418 PFQCLSRYQRSLNASILKSEWTKDEDDKLRSAVAIFGEGDWQAVASTLEGRTGPQCSNRW 477

Query: 121 KKSLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 180
           KKSLDPARTKRGYFTPDEDSRL IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS
Sbjct: 478 KKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 537

Query: 181 LRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK 240
           LRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARK
Sbjct: 538 LRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC-RRWKKLFPNQVPLLQEARK 597

Query: 241 IQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNEK 300
           IQK ALISNFVDRESERPALGPTDFRP  ++ LLCNTDDP+ APKRNV+ RR+PVSRNEK
Sbjct: 598 IQKVALISNFVDRESERPALGPTDFRPVPNSHLLCNTDDPETAPKRNVRMRRMPVSRNEK 657

Query: 301 SATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTTKRKKG 360
           SA GDAPKK +SNNQRNQAD TAQV  AN +SSVP EVKS KPQRKR+R+GA+TT R+KG
Sbjct: 658 SANGDAPKKMKSNNQRNQADETAQVDFANNTSSVP-EVKSTKPQRKRTRHGAYTT-RRKG 717

Query: 361 VLEPRSNSERRVEQNLDTQSLEVQLNSKE-SERTSSDCTETVDENGMESCENKVADKLSK 420
             +   NSER  EQN DT+SLEVQLN KE +ER +SDC ETVDENGME  ENK A+  S+
Sbjct: 718 APKIGCNSERCAEQNSDTRSLEVQLNCKEPAERINSDCPETVDENGMEVFENKAAEMHSE 777

Query: 421 RDVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSIILPADTALLASTIGDDIIERNGE 480
             VCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS   P DT LLAS   DDIIE  G 
Sbjct: 778 GVVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS--TPPDTTLLASITADDIIETKGV 837

Query: 481 SVADKDLDDSNSFSLPHSCLELRATDREGVNSYSVDEDTDKGHGVCKYQGRRKKDSKTSN 540
           +VADKDLDDSNSFSLP SCLELR TD EGV+SYSVDE TDK HGVCK QGRRKK+SK SN
Sbjct: 838 NVADKDLDDSNSFSLPQSCLELRTTDSEGVDSYSVDEFTDKSHGVCKPQGRRKKNSKRSN 897

Query: 541 KSQDSLVSSRQVELERSGAKELHRHNQSKKRKHNSTNTTSLLGTSETVEEVDECTLLGFL 600
           KSQDSLVS +Q ELE SG  ELHR NQSKKRKH+ TN TS LGT E VEEVD+CTL GFL
Sbjct: 898 KSQDSLVSCQQAELEMSGMNELHRCNQSKKRKHSGTN-TSPLGTMEAVEEVDDCTLQGFL 957

Query: 601 QKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPSIASLLKDKCKRKRH 646
           QKR+KRT T HDKKVDGSSS+  EVDNDDNDP++A LLKDK KRK+H
Sbjct: 958 QKRLKRTTTTHDKKVDGSSSTPPEVDNDDNDPTLALLLKDKLKRKKH 998

BLAST of Spg032687 vs. NCBI nr
Match: XP_023515736.1 (uncharacterized protein LOC111779809 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1020.8 bits (2638), Expect = 5.3e-294
Identity = 535/647 (82.69%), Postives = 572/647 (88.41%), Query Frame = 0

Query: 1   MYLRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRT 60
           MYLRGRSGAECE RWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRT
Sbjct: 358 MYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIELAVSLGTNRT 417

Query: 61  PFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRW 120
           PFQCLSRYQRSLNASILK EWTKDEDDKLRSAVA+FG GDWQAVASTLEGRTGPQCSNRW
Sbjct: 418 PFQCLSRYQRSLNASILKSEWTKDEDDKLRSAVAVFGEGDWQAVASTLEGRTGPQCSNRW 477

Query: 121 KKSLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 180
           KKSLDPARTKRGYFTPDEDSRL IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS
Sbjct: 478 KKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 537

Query: 181 LRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK 240
           LRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARK
Sbjct: 538 LRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC-RRWKKLFPNQVPLLQEARK 597

Query: 241 IQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNEK 300
           IQK ALISNFVDRESERPALGPTDFRP  ++ LLCNTDDP+ APKRNV+TRR+PVSRNEK
Sbjct: 598 IQKVALISNFVDRESERPALGPTDFRPVPNSHLLCNTDDPETAPKRNVRTRRMPVSRNEK 657

Query: 301 SATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTTKRKKG 360
           SA GDAPK+R+SNNQRN+AD TAQV   N +SSVP EVKS KPQRKR+R+GA+TT R+KG
Sbjct: 658 SANGDAPKRRKSNNQRNRADETAQVDFGNNTSSVP-EVKSTKPQRKRTRHGAYTT-RRKG 717

Query: 361 VLEPRSNSERRVEQNLDTQSLEVQLNSKE-SERTSSDCTETVDENGMESCENKVADKLSK 420
             +   NSER  EQN DT+S+EVQLN KE +ER +SDC ETVDENGME  ENK A+  S+
Sbjct: 718 APKIGCNSERCAEQNSDTRSVEVQLNCKEPAERINSDCPETVDENGMEVFENKAAEMHSE 777

Query: 421 RDVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSIILPADTALLASTIGDDIIERNGE 480
             VCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS +   DT LLAS   DDIIE  G 
Sbjct: 778 GVVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSTL--PDTTLLASITADDIIETKGV 837

Query: 481 SVADKDLDDSNSFSLPHSCLELRATDREGVNSYSVDEDTDKGHGVCKYQGRRKKDSKTSN 540
           +VADKDLDDSNSFSLP SCLELR TD EGV+SYSVDE TDK HGVCK QGRRKK+SK SN
Sbjct: 838 NVADKDLDDSNSFSLPQSCLELRTTDSEGVDSYSVDEFTDKSHGVCKPQGRRKKNSKRSN 897

Query: 541 KSQDSLVSSRQVELERSGAKELHRHNQSKKRKHNSTNTTSLLGTSETVEEVDECTLLGFL 600
           KSQDSLVS +Q ELE SG  ELHR NQSKKRKH+ TN TS LGT E VEEVD+CTL GFL
Sbjct: 898 KSQDSLVSCQQAELEMSGTNELHRCNQSKKRKHSGTN-TSPLGTMEAVEEVDDCTLQGFL 957

Query: 601 QKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPSIASLLKDKCKRKRH 646
           QKR+KRT T HDKKVDGSSS+  EVDNDDNDP++A LL DK KRK+H
Sbjct: 958 QKRLKRTTTTHDKKVDGSSSTPPEVDNDDNDPTLALLLNDKLKRKKH 998

BLAST of Spg032687 vs. ExPASy Swiss-Prot
Match: Q54NA6 (Myb-like protein L OS=Dictyostelium discoideum OX=44689 GN=mybL PE=3 SV=1)

HSP 1 Score: 215.7 bits (548), Expect = 1.6e-54
Identity = 125/319 (39.18%), Postives = 175/319 (54.86%), Query Frame = 0

Query: 6   RSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCL 65
           RS  E   RW N +DP IN+ P+T  EDK LL   ++   + W  I++ LGTNRTP  C+
Sbjct: 525 RSPLEAYLRWKNHDDPSINKGPFTKEEDKKLLTLAKKYDGHEWEKISIELGTNRTPLACI 584

Query: 66  SRYQRSLNASILKREWTKDEDDKLRSAVAIFGVG---DWQAVASTLEGRTGPQCSNRWKK 125
            RYQRSLN+ ++KREWTK+ED+ L   + +   G   DWQ +   + GRTG QC +RW K
Sbjct: 585 QRYQRSLNSKMMKREWTKEEDEVLAGVIKLHMHGERIDWQEITEYIPGRTGHQCLHRWHK 644

Query: 126 SLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR 185
           +LDP+  K+G ++P+ED  L  AV  +G  NW      + GR  VQCRER+ N LDP L 
Sbjct: 645 TLDPS-IKKGRWSPEEDQCLINAVNAYGKGNWILIKNHVKGRTDVQCRERYCNVLDPQLT 704

Query: 186 RCQWTEEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEAR 245
           + +WT +ED RL     + G   W+ VA  + +RTDN+C RRWK+L    N +   QE  
Sbjct: 705 KIRWTPQEDKRLFDITNKVGIGKWSDVAKLMENRTDNQCWRRWKQLNKSSNVLKDYQEKV 764

Query: 246 KIQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNE 305
             +K   +SNF  R+ ER  L   D              + K  PK N KT+ +  +   
Sbjct: 765 SKKKEICVSNFSGRKHERSELTVDD----------VIEIEEKLNPKSNKKTKTL--TSTT 824

Query: 306 KSATGDAPKKRRSNNQRNQ 319
            ++T       +++N  NQ
Sbjct: 825 TTSTNPTTNNNKTDNIDNQ 830

BLAST of Spg032687 vs. ExPASy Swiss-Prot
Match: Q5SXM2 (snRNA-activating protein complex subunit 4 OS=Homo sapiens OX=9606 GN=SNAPC4 PE=1 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 1.6e-38
Identity = 85/239 (35.56%), Postives = 136/239 (56.90%), Query Frame = 0

Query: 6   RSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCL 65
           RS  E    W N E P IN+  W+  E++ L       G   W  IA  LGT+R+ FQCL
Sbjct: 275 RSAEEIRKFWQNSEHPSINKQEWSREEEERLQAIAAAHGHLEWQKIAEELGTSRSAFQCL 334

Query: 66  SRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGD---WQAVASTLEGRTGPQCSNRWKK 125
            ++Q+  N ++ ++EWT++ED  L   V    VG    ++ +   +EGR   Q   RW K
Sbjct: 335 QKFQQH-NKALKRKEWTEEEDRMLTQLVQEMRVGSHIPYRRIVYYMEGRDSMQLIYRWTK 394

Query: 126 SLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR 185
           SLDP   K+GY+ P+ED++L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL+
Sbjct: 395 SLDPG-LKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLHFSLK 454

Query: 186 RCQWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK 241
           + +W  +E+ +L   I+++G   WAK+A+ +P R+ ++C  +WK +   +  L +  R+
Sbjct: 455 KGRWNLKEEEQLIELIEKYGVGHWAKIASELPHRSGSQCLSKWKIMMGKKQGLRRRRRR 511

BLAST of Spg032687 vs. ExPASy Swiss-Prot
Match: Q8BP86 (snRNA-activating protein complex subunit 4 OS=Mus musculus OX=10090 GN=Snapc4 PE=1 SV=2)

HSP 1 Score: 157.5 bits (397), Expect = 5.0e-37
Identity = 81/226 (35.84%), Postives = 133/226 (58.85%), Query Frame = 0

Query: 6   RSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCL 65
           RS  E    W + E P I++  W+T E + L       G   W  +A  LGT+R+ FQCL
Sbjct: 275 RSAEEIRKFWQSSEHPSISKQEWSTEEVERLKAIAATHGHLEWHLVAEELGTSRSAFQCL 334

Query: 66  SRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGD---WQAVASTLEGRTGPQCSNRWKK 125
            ++Q+  N ++ ++EWT++ED  L   V    VG+   ++ +   +EGR   Q   RW K
Sbjct: 335 QKFQQ-YNKTLKRKEWTEEEDHMLTQLVQEMRVGNHIPYRKIVYFMEGRDSMQLIYRWTK 394

Query: 126 SLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR 185
           SLDP+  KRG++ P+ED++L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL+
Sbjct: 395 SLDPS-LKRGFWAPEEDAKLLQAVAKYGAQDWFKIREEVPGRSDAQCRDRYIRRLHFSLK 454

Query: 186 RCQWTEEEDLRLEIAIQEHGYS-WAKVAACVPSRTDNECRRRWKKL 228
           + +W  +E+ +L   I+++G   WA++A+ +P R+ ++C  +WK L
Sbjct: 455 KGRWNAKEEQQLIQLIEKYGVGHWARIASELPHRSGSQCLSKWKIL 498

BLAST of Spg032687 vs. ExPASy Swiss-Prot
Match: P46200 (Transcriptional activator Myb OS=Bos taurus OX=9913 GN=MYB PE=2 SV=1)

HSP 1 Score: 141.7 bits (356), Expect = 2.9e-32
Identity = 70/176 (39.77%), Postives = 102/176 (57.95%), Query Frame = 0

Query: 78  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPD 137
           K  WT++ED+KL+  V   G  DW+ +A+ L  RT  QC +RW+K L+P   K G +T +
Sbjct: 40  KTRWTREEDEKLKKLVEQNGTDDWKVIANYLPNRTDVQCQHRWQKVLNPELIK-GPWTKE 99

Query: 138 EDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIA 197
           ED R+   V  +GPK W+  A+ L GR   QCRERW N L+P +++  WTEEED  +  A
Sbjct: 100 EDQRVIELVQKYGPKRWSVIAKHLKGRIGKQCRERWHNHLNPEVKKTSWTEEEDRIIYQA 159

Query: 198 IQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF 251
            +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Sbjct: 160 HKRLGNRWAEIAKLLPGRTDNAIKNHWNSTMRRKVEQEGYLQESSKASQPAVTTSF 214

BLAST of Spg032687 vs. ExPASy Swiss-Prot
Match: P10242 (Transcriptional activator Myb OS=Homo sapiens OX=9606 GN=MYB PE=1 SV=2)

HSP 1 Score: 141.7 bits (356), Expect = 2.9e-32
Identity = 70/176 (39.77%), Postives = 102/176 (57.95%), Query Frame = 0

Query: 78  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPD 137
           K  WT++ED+KL+  V   G  DW+ +A+ L  RT  QC +RW+K L+P   K G +T +
Sbjct: 40  KTRWTREEDEKLKKLVEQNGTDDWKVIANYLPNRTDVQCQHRWQKVLNPELIK-GPWTKE 99

Query: 138 EDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIA 197
           ED R+   V  +GPK W+  A+ L GR   QCRERW N L+P +++  WTEEED  +  A
Sbjct: 100 EDQRVIELVQKYGPKRWSVIAKHLKGRIGKQCRERWHNHLNPEVKKTSWTEEEDRIIYQA 159

Query: 198 IQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF 251
            +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Sbjct: 160 HKRLGNRWAEIAKLLPGRTDNAIKNHWNSTMRRKVEQEGYLQESSKASQPAVATSF 214

BLAST of Spg032687 vs. ExPASy TrEMBL
Match: A0A6J1E6Z7 (uncharacterized protein LOC111430000 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111430000 PE=4 SV=1)

HSP 1 Score: 1027.7 bits (2656), Expect = 2.1e-296
Identity = 539/647 (83.31%), Postives = 572/647 (88.41%), Query Frame = 0

Query: 1   MYLRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRT 60
           MYL+GRSGAECE RWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRT
Sbjct: 358 MYLQGRSGAECEARWLNFEDPLINRNSWTTSEDKNLLFTIQQKGLNNWIELAVSLGTNRT 417

Query: 61  PFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRW 120
           PFQCLSRYQRSLNASILK EWTKDEDDKLRSAVAIFG GDWQAVASTLEGRTGPQCSNRW
Sbjct: 418 PFQCLSRYQRSLNASILKSEWTKDEDDKLRSAVAIFGEGDWQAVASTLEGRTGPQCSNRW 477

Query: 121 KKSLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 180
           KKSLDPARTKRGYFTPDEDSRL IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS
Sbjct: 478 KKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 537

Query: 181 LRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK 240
           LRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARK
Sbjct: 538 LRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNQVPLLQEARK 597

Query: 241 IQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNEK 300
           IQK ALISNFVDRESERPALGPTDFRP  ++ LLCNTDDP+ APKRNV+ RR+PVSRNEK
Sbjct: 598 IQKVALISNFVDRESERPALGPTDFRPVPNSHLLCNTDDPETAPKRNVRMRRMPVSRNEK 657

Query: 301 SATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTTKRKKG 360
           SA GDAPKK +SNNQRNQAD TAQV  AN +SSVP EVKS KPQRKR+R+GA+TT R+KG
Sbjct: 658 SANGDAPKKMKSNNQRNQADETAQVDFANNTSSVP-EVKSTKPQRKRTRHGAYTT-RRKG 717

Query: 361 VLEPRSNSERRVEQNLDTQSLEVQLNSKE-SERTSSDCTETVDENGMESCENKVADKLSK 420
             +   NSER  EQN DT+SLEVQLN KE +ER +SDC ETVDENGME  ENK A+  S+
Sbjct: 718 APKIGCNSERCAEQNSDTRSLEVQLNCKEPAERINSDCPETVDENGMEVFENKAAEMHSE 777

Query: 421 RDVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSIILPADTALLASTIGDDIIERNGE 480
             VCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS   P DT LLAS   DDIIE  G 
Sbjct: 778 GVVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS--TPPDTTLLASITADDIIETKGV 837

Query: 481 SVADKDLDDSNSFSLPHSCLELRATDREGVNSYSVDEDTDKGHGVCKYQGRRKKDSKTSN 540
           +VADKDLDDSNSFSLP SCLELR TD EGV+SYSVDE TDK HGVCK QGRRKK+SK SN
Sbjct: 838 NVADKDLDDSNSFSLPQSCLELRTTDSEGVDSYSVDEFTDKSHGVCKPQGRRKKNSKRSN 897

Query: 541 KSQDSLVSSRQVELERSGAKELHRHNQSKKRKHNSTNTTSLLGTSETVEEVDECTLLGFL 600
           KSQDSLVS +Q ELE SG  ELHR NQSKKRKH+ TN TS LGT E VEEVD+CTL GFL
Sbjct: 898 KSQDSLVSCQQAELEMSGMNELHRCNQSKKRKHSGTN-TSPLGTMEAVEEVDDCTLQGFL 957

Query: 601 QKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPSIASLLKDKCKRKRH 646
           QKR+KRT T HDKKVDGSSS+  EVDNDDNDP++A LLKDK KRK+H
Sbjct: 958 QKRLKRTTTTHDKKVDGSSSTPPEVDNDDNDPTLALLLKDKLKRKKH 999

BLAST of Spg032687 vs. ExPASy TrEMBL
Match: A0A6J1E2J4 (uncharacterized protein LOC111430000 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111430000 PE=4 SV=1)

HSP 1 Score: 1021.1 bits (2639), Expect = 2.0e-294
Identity = 538/647 (83.15%), Postives = 571/647 (88.25%), Query Frame = 0

Query: 1   MYLRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRT 60
           MYL+GRSGAECE RWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRT
Sbjct: 358 MYLQGRSGAECEARWLNFEDPLINRNSWTTSEDKNLLFTIQQKGLNNWIELAVSLGTNRT 417

Query: 61  PFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRW 120
           PFQCLSRYQRSLNASILK EWTKDEDDKLRSAVAIFG GDWQAVASTLEGRTGPQCSNRW
Sbjct: 418 PFQCLSRYQRSLNASILKSEWTKDEDDKLRSAVAIFGEGDWQAVASTLEGRTGPQCSNRW 477

Query: 121 KKSLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 180
           KKSLDPARTKRGYFTPDEDSRL IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS
Sbjct: 478 KKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 537

Query: 181 LRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK 240
           LRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARK
Sbjct: 538 LRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC-RRWKKLFPNQVPLLQEARK 597

Query: 241 IQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNEK 300
           IQK ALISNFVDRESERPALGPTDFRP  ++ LLCNTDDP+ APKRNV+ RR+PVSRNEK
Sbjct: 598 IQKVALISNFVDRESERPALGPTDFRPVPNSHLLCNTDDPETAPKRNVRMRRMPVSRNEK 657

Query: 301 SATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTTKRKKG 360
           SA GDAPKK +SNNQRNQAD TAQV  AN +SSVP EVKS KPQRKR+R+GA+TT R+KG
Sbjct: 658 SANGDAPKKMKSNNQRNQADETAQVDFANNTSSVP-EVKSTKPQRKRTRHGAYTT-RRKG 717

Query: 361 VLEPRSNSERRVEQNLDTQSLEVQLNSKE-SERTSSDCTETVDENGMESCENKVADKLSK 420
             +   NSER  EQN DT+SLEVQLN KE +ER +SDC ETVDENGME  ENK A+  S+
Sbjct: 718 APKIGCNSERCAEQNSDTRSLEVQLNCKEPAERINSDCPETVDENGMEVFENKAAEMHSE 777

Query: 421 RDVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSIILPADTALLASTIGDDIIERNGE 480
             VCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS   P DT LLAS   DDIIE  G 
Sbjct: 778 GVVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS--TPPDTTLLASITADDIIETKGV 837

Query: 481 SVADKDLDDSNSFSLPHSCLELRATDREGVNSYSVDEDTDKGHGVCKYQGRRKKDSKTSN 540
           +VADKDLDDSNSFSLP SCLELR TD EGV+SYSVDE TDK HGVCK QGRRKK+SK SN
Sbjct: 838 NVADKDLDDSNSFSLPQSCLELRTTDSEGVDSYSVDEFTDKSHGVCKPQGRRKKNSKRSN 897

Query: 541 KSQDSLVSSRQVELERSGAKELHRHNQSKKRKHNSTNTTSLLGTSETVEEVDECTLLGFL 600
           KSQDSLVS +Q ELE SG  ELHR NQSKKRKH+ TN TS LGT E VEEVD+CTL GFL
Sbjct: 898 KSQDSLVSCQQAELEMSGMNELHRCNQSKKRKHSGTN-TSPLGTMEAVEEVDDCTLQGFL 957

Query: 601 QKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPSIASLLKDKCKRKRH 646
           QKR+KRT T HDKKVDGSSS+  EVDNDDNDP++A LLKDK KRK+H
Sbjct: 958 QKRLKRTTTTHDKKVDGSSSTPPEVDNDDNDPTLALLLKDKLKRKKH 998

BLAST of Spg032687 vs. ExPASy TrEMBL
Match: A0A6J1JKV7 (uncharacterized protein LOC111485355 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111485355 PE=4 SV=1)

HSP 1 Score: 1018.5 bits (2632), Expect = 1.3e-293
Identity = 536/647 (82.84%), Postives = 571/647 (88.25%), Query Frame = 0

Query: 1    MYLRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRT 60
            MYLRGRSGAECE RWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWID+AVSLGTNRT
Sbjct: 359  MYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDLAVSLGTNRT 418

Query: 61   PFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRW 120
            PFQ LSRYQRSLNASILK EWTKDEDDKLRSAVAIFG GDWQAVASTLEGRTGPQCSNRW
Sbjct: 419  PFQWLSRYQRSLNASILKSEWTKDEDDKLRSAVAIFGEGDWQAVASTLEGRTGPQCSNRW 478

Query: 121  KKSLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 180
            KKSLDPARTKRGYFTPDEDSRL IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS
Sbjct: 479  KKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 538

Query: 181  LRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK 240
            LRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARK
Sbjct: 539  LRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNQVPLLQEARK 598

Query: 241  IQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNEK 300
            IQK ALISNFVDRESERPALGPTDFRP  ++ LLCNTDDP+ APKRNV+TRR+PVSRNEK
Sbjct: 599  IQKVALISNFVDRESERPALGPTDFRPVPNSHLLCNTDDPETAPKRNVRTRRMPVSRNEK 658

Query: 301  SATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTTKRKKG 360
            SA GDAPKKR+SNNQRN+ D TAQV  A+ +SSVP EVKS KPQRKR+R+GA+TT R+KG
Sbjct: 659  SANGDAPKKRKSNNQRNRVDETAQVDFASNTSSVP-EVKSTKPQRKRTRHGAYTT-RRKG 718

Query: 361  VLEPRSNSERRVEQNLDTQSLEVQLNSKE-SERTSSDCTETVDENGMESCENKVADKLSK 420
              +   NSER  EQN DT++LEVQLN KE +ER +SDC ETVDENGME  ENK A+  S+
Sbjct: 719  APKIGCNSERCAEQNSDTRNLEVQLNCKEPAERINSDCPETVDENGMEVFENKAAEMHSE 778

Query: 421  RDVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSIILPADTALLASTIGDDIIERNGE 480
              VCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS +   DT LLAS   DDIIE  G 
Sbjct: 779  GVVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSTL--PDTTLLASITADDIIETKGV 838

Query: 481  SVADKDLDDSNSFSLPHSCLELRATDREGVNSYSVDEDTDKGHGVCKYQGRRKKDSKTSN 540
            +VADKDLD SNSFSLP SCLELR TD EGV+SYSVDE TDK H VCK QGRRKK+SK SN
Sbjct: 839  NVADKDLDGSNSFSLPQSCLELRTTDSEGVDSYSVDEFTDKSHVVCKPQGRRKKNSKRSN 898

Query: 541  KSQDSLVSSRQVELERSGAKELHRHNQSKKRKHNSTNTTSLLGTSETVEEVDECTLLGFL 600
            KSQDSLVS +Q ELE SG  ELHR NQ KKRKH+STN TS LGT E VEEVD+CTLLGFL
Sbjct: 899  KSQDSLVSCQQAELEMSGTNELHRCNQLKKRKHSSTN-TSPLGTMEAVEEVDDCTLLGFL 958

Query: 601  QKRVKRTNTRHDKKVDGSSS-SLEVDNDDNDPSIASLLKDKCKRKRH 646
            QKR+KRT T H KKVDGSSS S EVDNDDNDP++A LLK+K KRK+H
Sbjct: 959  QKRLKRTTTTHGKKVDGSSSTSPEVDNDDNDPTLALLLKEKLKRKKH 1000

BLAST of Spg032687 vs. ExPASy TrEMBL
Match: A0A6J1JK98 (uncharacterized protein LOC111485355 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111485355 PE=4 SV=1)

HSP 1 Score: 1011.9 bits (2615), Expect = 1.2e-291
Identity = 535/647 (82.69%), Postives = 570/647 (88.10%), Query Frame = 0

Query: 1   MYLRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRT 60
           MYLRGRSGAECE RWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWID+AVSLGTNRT
Sbjct: 359 MYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDLAVSLGTNRT 418

Query: 61  PFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRW 120
           PFQ LSRYQRSLNASILK EWTKDEDDKLRSAVAIFG GDWQAVASTLEGRTGPQCSNRW
Sbjct: 419 PFQWLSRYQRSLNASILKSEWTKDEDDKLRSAVAIFGEGDWQAVASTLEGRTGPQCSNRW 478

Query: 121 KKSLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 180
           KKSLDPARTKRGYFTPDEDSRL IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS
Sbjct: 479 KKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 538

Query: 181 LRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK 240
           LRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARK
Sbjct: 539 LRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNEC-RRWKKLFPNQVPLLQEARK 598

Query: 241 IQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNEK 300
           IQK ALISNFVDRESERPALGPTDFRP  ++ LLCNTDDP+ APKRNV+TRR+PVSRNEK
Sbjct: 599 IQKVALISNFVDRESERPALGPTDFRPVPNSHLLCNTDDPETAPKRNVRTRRMPVSRNEK 658

Query: 301 SATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTTKRKKG 360
           SA GDAPKKR+SNNQRN+ D TAQV  A+ +SSVP EVKS KPQRKR+R+GA+TT R+KG
Sbjct: 659 SANGDAPKKRKSNNQRNRVDETAQVDFASNTSSVP-EVKSTKPQRKRTRHGAYTT-RRKG 718

Query: 361 VLEPRSNSERRVEQNLDTQSLEVQLNSKE-SERTSSDCTETVDENGMESCENKVADKLSK 420
             +   NSER  EQN DT++LEVQLN KE +ER +SDC ETVDENGME  ENK A+  S+
Sbjct: 719 APKIGCNSERCAEQNSDTRNLEVQLNCKEPAERINSDCPETVDENGMEVFENKAAEMHSE 778

Query: 421 RDVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSIILPADTALLASTIGDDIIERNGE 480
             VCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPS +   DT LLAS   DDIIE  G 
Sbjct: 779 GVVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSTL--PDTTLLASITADDIIETKGV 838

Query: 481 SVADKDLDDSNSFSLPHSCLELRATDREGVNSYSVDEDTDKGHGVCKYQGRRKKDSKTSN 540
           +VADKDLD SNSFSLP SCLELR TD EGV+SYSVDE TDK H VCK QGRRKK+SK SN
Sbjct: 839 NVADKDLDGSNSFSLPQSCLELRTTDSEGVDSYSVDEFTDKSHVVCKPQGRRKKNSKRSN 898

Query: 541 KSQDSLVSSRQVELERSGAKELHRHNQSKKRKHNSTNTTSLLGTSETVEEVDECTLLGFL 600
           KSQDSLVS +Q ELE SG  ELHR NQ KKRKH+STN TS LGT E VEEVD+CTLLGFL
Sbjct: 899 KSQDSLVSCQQAELEMSGTNELHRCNQLKKRKHSSTN-TSPLGTMEAVEEVDDCTLLGFL 958

Query: 601 QKRVKRTNTRHDKKVDGSSS-SLEVDNDDNDPSIASLLKDKCKRKRH 646
           QKR+KRT T H KKVDGSSS S EVDNDDNDP++A LLK+K KRK+H
Sbjct: 959 QKRLKRTTTTHGKKVDGSSSTSPEVDNDDNDPTLALLLKEKLKRKKH 999

BLAST of Spg032687 vs. ExPASy TrEMBL
Match: A0A0A0L2R2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G113280 PE=4 SV=1)

HSP 1 Score: 956.1 bits (2470), Expect = 7.8e-275
Identity = 506/653 (77.49%), Postives = 565/653 (86.52%), Query Frame = 0

Query: 1    MYLRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRT 60
            MYL+GRSGAECE RWLNFEDPLINR+PWTTSEDK+LLFTIQQKGLNNWI++AVSLGTNRT
Sbjct: 353  MYLQGRSGAECEARWLNFEDPLINRDPWTTSEDKSLLFTIQQKGLNNWIEMAVSLGTNRT 412

Query: 61   PFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRW 120
            PFQCLSRYQRSLNASILKREWTK+EDD+LRSAVA FGV DWQAVASTLEGR G QCSNRW
Sbjct: 413  PFQCLSRYQRSLNASILKREWTKEEDDRLRSAVATFGVRDWQAVASTLEGRAGTQCSNRW 472

Query: 121  KKSLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 180
            KKSLDPART++GYFTPDED RL IAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS
Sbjct: 473  KKSLDPARTRKGYFTPDEDIRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPS 532

Query: 181  LRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARK 240
            LRRC+WTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP+EVPLLQEARK
Sbjct: 533  LRRCEWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPDEVPLLQEARK 592

Query: 241  IQKAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNEK 300
            IQKAALISNFVDRE+ERPALGP DFRPR +TD LCNTD P PAPKRNVKTR++PVSRNEK
Sbjct: 593  IQKAALISNFVDRETERPALGPADFRPRPNTDSLCNTDGPIPAPKRNVKTRKMPVSRNEK 652

Query: 301  SATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTTKRKKG 360
            SATGDAPKKR+SN QR Q DATAQV IA  +S VPEEV+S KPQRKR+R GA+T KR  G
Sbjct: 653  SATGDAPKKRKSNYQRFQTDATAQVGIAYNTSFVPEEVQSSKPQRKRNRRGAYTAKR-IG 712

Query: 361  VLEPRSNSERRVEQNLDTQSLEVQLNSKESERTSSDCTETVDENGMESCENKVADKLSKR 420
            V E RS+SE   +QNLDT+SL +QLNSKESER++S+CTETVDEN ME  ENKVA+KL++ 
Sbjct: 713  VPELRSDSEWCAKQNLDTESLGLQLNSKESERSNSNCTETVDENIMEVLENKVAEKLTEE 772

Query: 421  DVCFSEQEENQNSTGSSGVSVLSEMTNDMDEYNPSIILPADTALLASTIGDDIIERNGES 480
            + CFSE E+NQNSTGSSGVSVLSEMTND+ +YNPSI+   DT L AST  DDI E  G+S
Sbjct: 773  NACFSEPEKNQNSTGSSGVSVLSEMTNDLVDYNPSIL--TDTTLFASTTVDDIEELKGKS 832

Query: 481  VADKDLDDSNSFSLPHSCLELRATDREGVNSYSVDEDTDKGHGVCK-YQGRRKKDSKTSN 540
             AD+DLDDSNSFSL HSCLELR  D EGV+SYSVDE T K +GVC   QGRRKK+SKTSN
Sbjct: 833  AADRDLDDSNSFSLAHSCLELRTVDSEGVDSYSVDEYTAKSNGVCNPTQGRRKKNSKTSN 892

Query: 541  KSQDSLVSSR-QVELERSGAKELHRHNQSKKRKHNSTNTTSLLGTSETVEEVDECTLLGF 600
             S D+L+  R Q+  E  G K+   HNQSKKRKH++T   S L TSE VEEVD+CTL+GF
Sbjct: 893  NSHDNLLIPRQQIVQETLGTKKPLHHNQSKKRKHSNTG-PSTLKTSEAVEEVDDCTLVGF 952

Query: 601  LQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPSIASLLKDKCKRKRHEAASG 651
            LQKR+KRT   H++ VD SS++ L+VDNDDN+P+IAS L +K KRK+H+  SG
Sbjct: 953  LQKRLKRTAMTHNETVDCSSNAPLKVDNDDNEPTIASFL-NKLKRKKHQRPSG 1000

BLAST of Spg032687 vs. TAIR 10
Match: AT3G18100.2 (myb domain protein 4r1 )

HSP 1 Score: 366.3 bits (939), Expect = 5.1e-101
Identity = 193/397 (48.61%), Postives = 261/397 (65.74%), Query Frame = 0

Query: 3   LRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPF 62
           ++ RS AECE RW++ EDPLIN  PWT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPF
Sbjct: 207 IKDRSAAECEARWMSSEDPLINHGPWTAAEDKNLLRTIEQTSLTDWVDIAVSLGTNRTPF 266

Query: 63  QCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKK 122
           QCL+RYQRSLN SILK+EWT +EDD+LR+AV +FG  DWQ+VA+ L+GRTG QCSNRWKK
Sbjct: 267 QCLARYQRSLNPSILKKEWTAEEDDQLRTAVELFGEKDWQSVANVLKGRTGTQCSNRWKK 326

Query: 123 SLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR 182
           SL P  T++G ++ +ED R+ +AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + 
Sbjct: 327 SLRP--TRKGTWSLEEDKRVKVAVTLFGSQNWHKISQFVPGRTQTQCRERWLNCLDPKVN 386

Query: 183 RCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ 242
           R +WTEEED +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++Q
Sbjct: 387 RGKWTEEEDEKLREAIAEHGYSWSKVATNLSCRTDNQCLRRWKRLYPHQVALLQEARRLQ 446

Query: 243 KAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNE--- 302
           K A + NFVDRESERPAL  +      D  L    D      KR  K ++    R     
Sbjct: 447 KEASVGNFVDRESERPALVTSPILALPDISLEPEPDSVALKKKRKAKQKKSDAERQPKRR 506

Query: 303 ----KSATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTT 362
               K+ +GD  ++       N+ +   +  +  +      +  + +  ++R ++ A T 
Sbjct: 507 RKGLKNCSGDVCRQENETVCENEPNNGGEERMLALECHDEIQDNAKEKPKQRRKSVAETV 566

Query: 363 KRKKGVLEPRSNSERRVEQNLDTQSLEVQLNSKESER 393
                  EP +  E R+   L+  + E+Q N+KE  +
Sbjct: 567 CEN----EPNNGGEERM-LALECDN-EIQDNAKEKRK 595

BLAST of Spg032687 vs. TAIR 10
Match: AT3G18100.1 (myb domain protein 4r1 )

HSP 1 Score: 366.3 bits (939), Expect = 5.1e-101
Identity = 193/397 (48.61%), Postives = 261/397 (65.74%), Query Frame = 0

Query: 3   LRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPF 62
           ++ RS AECE RW++ EDPLIN  PWT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPF
Sbjct: 420 IKDRSAAECEARWMSSEDPLINHGPWTAAEDKNLLRTIEQTSLTDWVDIAVSLGTNRTPF 479

Query: 63  QCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKK 122
           QCL+RYQRSLN SILK+EWT +EDD+LR+AV +FG  DWQ+VA+ L+GRTG QCSNRWKK
Sbjct: 480 QCLARYQRSLNPSILKKEWTAEEDDQLRTAVELFGEKDWQSVANVLKGRTGTQCSNRWKK 539

Query: 123 SLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR 182
           SL P  T++G ++ +ED R+ +AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + 
Sbjct: 540 SLRP--TRKGTWSLEEDKRVKVAVTLFGSQNWHKISQFVPGRTQTQCRERWLNCLDPKVN 599

Query: 183 RCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ 242
           R +WTEEED +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++Q
Sbjct: 600 RGKWTEEEDEKLREAIAEHGYSWSKVATNLSCRTDNQCLRRWKRLYPHQVALLQEARRLQ 659

Query: 243 KAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNE--- 302
           K A + NFVDRESERPAL  +      D  L    D      KR  K ++    R     
Sbjct: 660 KEASVGNFVDRESERPALVTSPILALPDISLEPEPDSVALKKKRKAKQKKSDAERQPKRR 719

Query: 303 ----KSATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTT 362
               K+ +GD  ++       N+ +   +  +  +      +  + +  ++R ++ A T 
Sbjct: 720 RKGLKNCSGDVCRQENETVCENEPNNGGEERMLALECHDEIQDNAKEKPKQRRKSVAETV 779

Query: 363 KRKKGVLEPRSNSERRVEQNLDTQSLEVQLNSKESER 393
                  EP +  E R+   L+  + E+Q N+KE  +
Sbjct: 780 CEN----EPNNGGEERM-LALECDN-EIQDNAKEKRK 808

BLAST of Spg032687 vs. TAIR 10
Match: AT3G18100.3 (myb domain protein 4r1 )

HSP 1 Score: 366.3 bits (939), Expect = 5.1e-101
Identity = 193/397 (48.61%), Postives = 261/397 (65.74%), Query Frame = 0

Query: 3   LRGRSGAECETRWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPF 62
           ++ RS AECE RW++ EDPLIN  PWT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPF
Sbjct: 372 IKDRSAAECEARWMSSEDPLINHGPWTAAEDKNLLRTIEQTSLTDWVDIAVSLGTNRTPF 431

Query: 63  QCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKK 122
           QCL+RYQRSLN SILK+EWT +EDD+LR+AV +FG  DWQ+VA+ L+GRTG QCSNRWKK
Sbjct: 432 QCLARYQRSLNPSILKKEWTAEEDDQLRTAVELFGEKDWQSVANVLKGRTGTQCSNRWKK 491

Query: 123 SLDPARTKRGYFTPDEDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR 182
           SL P  T++G ++ +ED R+ +AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + 
Sbjct: 492 SLRP--TRKGTWSLEEDKRVKVAVTLFGSQNWHKISQFVPGRTQTQCRERWLNCLDPKVN 551

Query: 183 RCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQ 242
           R +WTEEED +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++Q
Sbjct: 552 RGKWTEEEDEKLREAIAEHGYSWSKVATNLSCRTDNQCLRRWKRLYPHQVALLQEARRLQ 611

Query: 243 KAALISNFVDRESERPALGPTDFRPRLDTDLLCNTDDPKPAPKRNVKTRRIPVSRNE--- 302
           K A + NFVDRESERPAL  +      D  L    D      KR  K ++    R     
Sbjct: 612 KEASVGNFVDRESERPALVTSPILALPDISLEPEPDSVALKKKRKAKQKKSDAERQPKRR 671

Query: 303 ----KSATGDAPKKRRSNNQRNQADATAQVSIANISSSVPEEVKSIKPQRKRSRNGAHTT 362
               K+ +GD  ++       N+ +   +  +  +      +  + +  ++R ++ A T 
Sbjct: 672 RKGLKNCSGDVCRQENETVCENEPNNGGEERMLALECHDEIQDNAKEKPKQRRKSVAETV 731

Query: 363 KRKKGVLEPRSNSERRVEQNLDTQSLEVQLNSKESER 393
                  EP +  E R+   L+  + E+Q N+KE  +
Sbjct: 732 CEN----EPNNGGEERM-LALECDN-EIQDNAKEKRK 760

BLAST of Spg032687 vs. TAIR 10
Match: AT3G09370.1 (myb domain protein 3r-3 )

HSP 1 Score: 128.3 bits (321), Expect = 2.3e-29
Identity = 61/147 (41.50%), Postives = 86/147 (58.50%), Query Frame = 0

Query: 78  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPD 137
           K  WT +ED+ LR AV  F    W+ +A +   RT  QC +RW+K L+P   K G +T +
Sbjct: 78  KGGWTPEEDETLRQAVDTFKGKSWKNIAKSFPDRTEVQCLHRWQKVLNPDLIK-GPWTHE 137

Query: 138 EDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIA 197
           ED ++   V  +GP  W+  A+ LPGR   QCRERW N L+P + +  WT EE++ L  A
Sbjct: 138 EDEKIVELVEKYGPAKWSIIAQSLPGRIGKQCRERWHNHLNPDINKDAWTTEEEVALMNA 197

Query: 198 IQEHGYSWAKVAACVPSRTDNECRRRW 225
            + HG  WA++A  +P RTDN  +  W
Sbjct: 198 HRSHGNKWAEIAKVLPGRTDNAIKNHW 223

BLAST of Spg032687 vs. TAIR 10
Match: AT3G09370.2 (myb domain protein 3r-3 )

HSP 1 Score: 128.3 bits (321), Expect = 2.3e-29
Identity = 61/147 (41.50%), Postives = 86/147 (58.50%), Query Frame = 0

Query: 78  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPD 137
           K  WT +ED+ LR AV  F    W+ +A +   RT  QC +RW+K L+P   K G +T +
Sbjct: 83  KGGWTPEEDETLRQAVDTFKGKSWKNIAKSFPDRTEVQCLHRWQKVLNPDLIK-GPWTHE 142

Query: 138 EDSRLTIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIA 197
           ED ++   V  +GP  W+  A+ LPGR   QCRERW N L+P + +  WT EE++ L  A
Sbjct: 143 EDEKIVELVEKYGPAKWSIIAQSLPGRIGKQCRERWHNHLNPDINKDAWTTEEEVALMNA 202

Query: 198 IQEHGYSWAKVAACVPSRTDNECRRRW 225
            + HG  WA++A  +P RTDN  +  W
Sbjct: 203 HRSHGNKWAEIAKVLPGRTDNAIKNHW 228

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022921860.14.3e-29683.31uncharacterized protein LOC111430000 isoform X1 [Cucurbita moschata][more]
XP_023515735.15.7e-29682.84uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG7023314.12.4e-29483.31Myb-like protein L [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022921861.14.1e-29483.15uncharacterized protein LOC111430000 isoform X2 [Cucurbita moschata][more]
XP_023515736.15.3e-29482.69uncharacterized protein LOC111779809 isoform X2 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q54NA61.6e-5439.18Myb-like protein L OS=Dictyostelium discoideum OX=44689 GN=mybL PE=3 SV=1[more]
Q5SXM21.6e-3835.56snRNA-activating protein complex subunit 4 OS=Homo sapiens OX=9606 GN=SNAPC4 PE=... [more]
Q8BP865.0e-3735.84snRNA-activating protein complex subunit 4 OS=Mus musculus OX=10090 GN=Snapc4 PE... [more]
P462002.9e-3239.77Transcriptional activator Myb OS=Bos taurus OX=9913 GN=MYB PE=2 SV=1[more]
P102422.9e-3239.77Transcriptional activator Myb OS=Homo sapiens OX=9606 GN=MYB PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A6J1E6Z72.1e-29683.31uncharacterized protein LOC111430000 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1E2J42.0e-29483.15uncharacterized protein LOC111430000 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JKV71.3e-29382.84uncharacterized protein LOC111485355 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1JK981.2e-29182.69uncharacterized protein LOC111485355 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A0A0L2R27.8e-27577.49Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G113280 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G18100.25.1e-10148.61myb domain protein 4r1 [more]
AT3G18100.15.1e-10148.61myb domain protein 4r1 [more]
AT3G18100.35.1e-10148.61myb domain protein 4r1 [more]
AT3G09370.12.3e-2941.50myb domain protein 3r-3 [more]
AT3G09370.22.3e-2941.50myb domain protein 3r-3 [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainSMARTSM00717santcoord: 77..126
e-value: 1.8E-12
score: 57.4
coord: 182..230
e-value: 3.1E-13
score: 60.0
coord: 130..179
e-value: 2.0E-8
score: 44.0
coord: 24..74
e-value: 1.4E-5
score: 34.5
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 178..228
score: 11.346723
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 126..177
score: 8.641033
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 73..124
score: 10.557079
IPR001005SANT/Myb domainPROSITEPS50090MYB_LIKEcoord: 20..72
score: 9.361002
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 80..124
e-value: 3.76689E-11
score: 56.4298
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 186..228
e-value: 7.36044E-12
score: 58.3558
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 134..175
e-value: 1.02775E-7
score: 46.7998
NoneNo IPR availableGENE3D1.10.10.60coord: 22..72
e-value: 3.0E-11
score: 44.9
NoneNo IPR availableGENE3D1.10.10.60coord: 77..127
e-value: 1.3E-16
score: 62.3
NoneNo IPR availableGENE3D1.10.10.60coord: 185..231
e-value: 9.4E-15
score: 56.5
NoneNo IPR availableGENE3D1.10.10.60coord: 128..183
e-value: 1.6E-16
score: 62.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 256..406
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 511..539
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 377..391
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 631..652
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 608..652
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 316..338
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 356..376
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 511..584
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 282..315
NoneNo IPR availablePANTHERPTHR46621SNRNA-ACTIVATING PROTEIN COMPLEX SUBUNIT 4coord: 1..587
IPR017930Myb domainPFAMPF00249Myb_DNA-bindingcoord: 184..227
e-value: 1.2E-15
score: 57.4
coord: 131..177
e-value: 1.4E-8
score: 34.8
coord: 78..122
e-value: 1.2E-11
score: 44.7
coord: 25..70
e-value: 1.1E-8
score: 35.1
IPR017930Myb domainPROSITEPS51294HTH_MYBcoord: 73..128
score: 20.136158
IPR017930Myb domainPROSITEPS51294HTH_MYBcoord: 130..177
score: 14.403814
IPR017930Myb domainPROSITEPS51294HTH_MYBcoord: 178..232
score: 23.919092
IPR017884SANT domainPROSITEPS51293SANTcoord: 76..125
score: 9.254794
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 2..69
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 128..224
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 57..127

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg032687.1Spg032687.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0005634 nucleus
molecular_function GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
molecular_function GO:0000978 RNA polymerase II cis-regulatory region sequence-specific DNA binding