Cp4.1LG14g06030 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g06030
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionHeat shock transcription factor
LocationCp4.1LG14 : 621995 .. 629995 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAGGCGAGTAGCGAAAACGCTTTATGTGTTAATCTGTACTCGCGTTTGAGAGAGGGGAAAAAGGGGAAGGCAGCGAAAAGGAGAGGGGAACCTCTTCTTCCAACTCCTGTTTGGTTCTTGATTTTGTATACTCCTACCCATGGAGTCGCACGTGCAATTCAGCGAAACGAAGCCGTTTTGGACCAATGAGAAGCACATGCACTTCTTGAATTCCATGGAAGCTTCTTTTGTGCGATCAATGTTCAAAAATCGGAACCATCGCCGCCTTCTACGGCTGGACCGCCACCTGTCGGACACCGCAGACTCAACCCTAGATTCGCCTCATAAGAACAGAAAAAAGCATTCCATCTCCGGTGAGTTTAATGCTAGCGTCTTTTTGTCAGTCATTTTGCAGGTCTTTACGCCTTGTTTGGATGCACGAATATAATGAACGCAAATATAGGATTAGGGTTGGAGTAGTTCTTGGTTAACGTTAAAGACGCATTATTTTCCTATATTTGGGGTTTTTTTTAATTTTATTTTATTTTATTTTATTTTATTTTGACTCAAATGGACCAACCAACCAAACAGCGCCTAAACAACACGACTAAAGCGGGAATAGTGAATATTCCCTGTGAAGATATTTATGTGTGATTTATATATCACAATAAAAATCAATTTTCTTTTTTGTTATTTATTTATTTATTTATTTTCTTTTTAATGACTTACGGCTATGATCCAGTAGGTACCAGAATGGACGGTAGATCTAGCAGGAGATCAAGACGGATACCATCTCTACCGCACACTTCAATTCAAGATCAGGTCATTATAAATATGTCCAACTCTCTTGTCTTTATTTCCTTTTAATCTTATCTTTTTCAATCAAAATTACGCTCTTTTTGCGGCTGGTTTCAGCATTTGTTCTAATGCCACGTGGATACAAACTATTAACTTATTTAGGTTTAATTAAATTTAAATACAAATTTTTTGATATTTTAAACAATAATATAGATATTTATAGTTAAATTATTAAAATAAAATGGCATGCTTATGTGGTATGGCAGGTGGTCCCACAGATGGAAGAGAGAGCAGTTGAAGACGAAGATGAGAGAGACCATCCCATGTCACCTGTCAACTAATAAATATAATATTAATTATTATTATTATTTCAGCTGCATTTTTTTCCCATTTAGTTTCTTCATTTAATCCTCGTGTCTTTTCACTTAGTTTGTTAATCTTCAATTAATGATGGGAAATGAGTGAAAGCGTTAATTTAATTGGCTACCTTAGGTAGGTTAAATACAATAAACGAAGACGAGGGTAAAATTGTAAATAAATGGAATATTCAAAGATATATATATATTTTTTTTAATTTATGTTTTTTATTTTCCAAGATCATAAAAGACCATGTGGGGTTATATTAATAGGAAAAGGGAGTCCATCGCCAATTCTGATTGGGAATATAATATAAGATAATATAATATAAATGATAGTATAGTGAAAGGAAGATATTTTATTTGGCAATGTGGGGTTGATTTGATTATGATTGGTTTGTTTTTTCCTTTTTTTTAAAAATTTTTTCTATTGGGTACAATGGAAAGTCATAAGAGTGTATGTGGATTATGTGGGGATAATGAAAGGGATTCATGAATATAAAGTTTGTTGCCTAATCTTATCTTATATATATAGGGGCAATTGTGGAAAAAAAAAACCATAAAGACACTGGAAGTGAACCCAAATAAAGTGGACCTTGGGCCGAGCCGATTCAAGCCCAATGTTGGAGAAATTAAAAATAAACAAATAAGAAGAATGAAATAATCTACGTGGCCGCCTACGTGGAATGATTTGGATTTGGGGTAACCTTTTTAGAAGGACGGTGACAGCTTAAATGTAATTTATTTATTTTTTTAAAAAAGAGAGGCTTAGCCTTTCGAGGAAGTTTCTCAAATCGCCGCACATTCTATTTCTCGTCGCTTCCAAGATTAAAAATATCTTTTCGTTCTCTTCCACGAAGCTTCTAGAATTGCAATCCCTCCAATGCCATTGCCGGAAAATTCGCTCATAGAAGCTTCTTTAGGGCTTGGGAGCTGCCTGCGATTGTTCCGGGATCTGATTTGGGTGATTCTTGTGGCGTCTCCCATGGGAGAGCAGAGCGCCAAACCGTCGCCTAATGGAGGTGATTCACAGAGATCTATGCCCACTCCTTTCTTGACCAAAACGTATCAGCTTGTTGACGACGAGGCAATCGACCATGTCATTTCATGGAACCACGACGGATCTACCTTCATCGTTTGGAACACCATCACTTTTGCTACAGATTTGCTGCCCAAGTATTTCAAGCACAATAACTTCACTAGCTTTCTCCGCCAGCTTAATACTTATGTAATCATCAATCCTCTGCTGGATTTCAATTTCGTTGCTCTTTCCTTCCAAGCCATGGTGGATTTGTTTTTTGATTGCTGGATTTTTTTTTTCAGGGATTTAAAAAGGTTGTATCGGACCGTTGGGAGTTCGCGAATGAGTGTTTTCGCAAAGGTAAGAAACAGCTTCTTTGTGAGATTCAGCGGCGGAAGTTTCACAGTCCGGTGCCGTCCACGACTCTGCCTGCGCATCTACTAGCATTGACGGGCAATTCTTCTAGTGAAGAACAAGTGATTTCGTCTGATGAGACTCCGACGGGAGCTTTTGCGGAGCTGATCGATGAGAACGATCGGCTTAGAAAAGAGAAAGCAAAGCTTTCGGAACAATTGGTTGAGATGAAATCTCTGTGCAACAACATCTTCTCGATGATGTCGAGCTTTGTTGAATGTCAATTCAAGAGCAGTTTCAAAGTAAGAGACAGCGTTTTAACACCGGCAAAATCGCTCGATCTTTTTCCAGTGAAGCGGCCTTCCGGCGAAGACGAAGCGGGAAAGACGAATCAGATCGGCGCGGCCATCGGAGCGAAGCGGCCGAGGGAATACAGAGAGTGGGCGACGGAGATAGCGGAGGACGATACTACTTTGAGACTCCAACCACCGGATAGATCGGAAGTCAAATCAGAACGGGTAAATTGTCATAAAAAAGTTGATGATCAGAAAACGTGGCGTAATCAAGTCCACTGAGAGCCATCAAATGGATCTGTAATTAACGGCTAGGATAATAATGTATCGACTTTTCAGGTTCTTGTTCAAGTAAAATCAGAGATGCAGGGAATCATTCCATCCCCAAATCTGGAAAAAGATGGCACGTGTGCCGCCATTACAGGGGGCAAATTAAGAAAGTTTCAAAGCACATTAAGCGTTAATGAAACTTCATCACCATTTTCTCCTTCGCTTTAGATTAGATTAGCAGCAGATAAAATGTTAACACAAATGTGAACATTTTGAGCATATACTCCTAAATACTTCTTTACAGTGTTATTCCCAGCTCAAAGTAAGAACCTTTCGTGTTACCCTTTACACATCTTTGATCCCTTTAAGAAACTTCAATACCTGAAGCATGGAAGGTCTGTTAGCAGGATTTTCTGATAAGCAAACACAAGCAATCTGAAGGGTTTGAAGCATCATATGCTTGGAATCAGCATTCAGTACTGTGGCGTCAAGAACGTCTGCAGCCTGCCCCTTGTTGATCTTCTGAAACACCCAACCAACCAGGTTTCCGCCCTCAATCTCTTTAAAGTCGGGTCCTGTTGGTTCCTTCCCAGTTACCAATTCCAGTAGGATTACACCATAGCTATAAACGTCCCCCTTTGTAGTAGACCTCCCACTCTGCCCGTACTCCGGCGGGATGTAACCAAAAGTTCCAGCAATCTCAGTTGTGACATGAGTCTCACAAGCACTGATCAGTCTAGCCAACCCGAAGTCGGCGACTTTTGGTTCGAAGTCTTGGTTGAGGAGTATATTGCTTGCTTTAACATCCCTATGAATGATGTGGGGGATGAATCCATGATGAAGAAATGCCAATCCACGAGCTGCGCCTGAAGCGACTTTAAATCGAGTCTCCCAGTTAAGGATTTCGAGAGTACCGATTCGGTTTCTTAGCCAAAGATCCAAACTACCATTCACCATATATTCATAGACTAGGAGCTTCTCCTCCCCAAGAGAACAGTAGCCAAGTAGTGGTACAAGATTATTGTGCTTCACTTTGCCTATGGTTTCCATTTCAGCTATAAATTCTCTGTGCCCCTGTGTTTTTGCTTCGCTTAGTTTCTTCACGGCAACAATTTTTCCATCAGGTAATGTGGCCTTGTACACTGTCCCGAATCCTCCATCTCCAATGATGTTTGTTTTACAGAAGTTATTGGTTGCTTCGAGAATATCAGCCAAAGTTAATTTCAGAAGGGGTTGCTCGAACGTGGCTACATTGATGCTTAAAGGCTCTCTCGATCTGCTGCTGCTGCTTAAGAAATAGAGATTGGGGTCTATGAAACATTTTAATTTGCTTTCCTCCATTTCCTCTGGATCGTTCTCTCTCTGGGTTCTAATGATCCATCTCCGCATGGCGAATGTGATAGTTAGAACGATAAGAACACTAACAATGATGATCCCTGCAACGCTCCAAGCATTCAATGCTGCTGATCTCTCCAAGATTTTGATCCGGCAATTCAAACCCATGATTCTTCCACAAAGGCCTTTGTTACCCACAAGTGAACTTTTGGATAGATTCTGGCAAATGCCACTTCTCGGAATTGGCCCTTCCAGGCTGTTGTCTGCCATATTCAGGTAAACCATATTGACAAGGCTGCATATTTTCTCTGGAATTTCTCCTGAGAGCTTGTTATTTGAAACATCCAAGTATTCAAGTTGCATAAGATCCCCAAAATCTGAAGGGATTTGCCCTGTGAATTTATTTCCATGAAGATCCAAAGTTGTCAAGTATGAAAGGTTGCCCAATGTTCGTGGAAGGACACCCTCAAAATAGTTATTACTCAAATTCAAAGTTTCAACCTTCCATGTCATGGAACTTGGGAAAAGTTCAACAACCTGACCAGAAAGCCTGTTCTCCTGTACATAAAGCCCGACAAGATTCAACATGTTGGACAGAGAAGAAGGAAGATCACCATCCAACTCATTAGAACTTAAATCCAAATGAGTTAGAGCTTTCAGATCACCAAGAGTTCTTGGAATTGAACCAGATAATTTATTTCCAGTCAAGTTCAACTTTACCAAGCTACTCAAATGACTGAAGCTTTCGGGGATTGTACCCACCAGGTGATTATTCCATAGATACAGGCCTTGGAGCTTGAGAGCATTGCCAATCTCTGTGGGAATAGGACCAGTAAGCATATTGCTAGACAAATCCAAGGTTGTCAGGTTCGTTAACTGAGAGAGAGATCTAGGAATCTCTCCAGAAAGTAGATTATTATTCAGCAAAAGATCAACTACAACAATACAATTCCCCAGTTCATCAGGTATGGTACCAGACAATCTATTATGAGACAGATCGAAAACACCATGATGCTGAACAAAGCTCAAATCTGGAATAGTAACCTGTCGAAAATAAGCAGAGGGCTTGGAAGGTATGGCTCCAGATAACATGTTGTGTGACAGAACTAGACACTGTAATTCAGTAAGGTTAGCAAGTCTTTCAGGAATCGACCCTTTTAGACTATTGTTTCCAAGGTCCAACGTGGTAAGTGAAGTGCAATCTGCAAGCATGGTAGGAATAGTTCCTTCAAGCAGATTGGAATTCAAATTTAGAACTGAAAGAGCTGTGAGATTTCCAATCTCATCTGGTATACCGCCTGTCAACCTATTGCTGCTGAGAACAAGCCTCTCAAGTGAAGCTGCATAGCCAATTTCTGAAGGGAGATGACCCTCCAACAGGTTATTTGCGGCAGAAAACTCCATTAAATCCACTGAGTTCCATATATTTCTAGGTAAACAACCGGTAAAATTATTAGAGTCAAGGTTGATTACCATTAAGGAAAGGTTTGAGAAGTACTCTGGTATTGCCCCAACAATCTGATTGTCTACCAAAACCAGCTGCGTAAGGTTTCTACACAGCACAAATGTATCATCAATCGGACCCGAGAGGAAATTGCTGTCAAGATCAATCTCCATCAAGGATGCAGCATTACAAATTTCTTTAGGTATTGGACCTGTCAACAAGTTATTGCTCAAACTCAGGTGATTAAGCATCGAGCAATTTCCAATCTCAGGAGGGATTTCCCCCATGAGATGATTACTCGAGAGTAAAATAGAATCAACATGATCCCATTTGCCAAGCCAGGAAGGTAATGACCCAGAAAGCTGATTCTTCTCAGCAGAAAATGTCAACATGGGAAGCTCTGAAAGCTCTTGTGGCAACACCCCAGATAGAAAGTTGAATGAAAGCATCAATGTTTTCAAATTTCTGCACCTCCCGAGCTCAGCAGGAATAGAACCATTAAGCTGAGTGTAAACCAGATTCAATATAGTTAAGTTCTGTAACTCACCAATCGATTTCGGGATAGAACACCCAAGTGGGTTGTATGAAAGGTCCAGTTTGCTCAATGATTTCAACTTGGATAGTTCGTCAGGCAATGGACCAGTTAAAGAACAAGAAGGCGAAAAAAAGTTCTCCAGCAATACAAGGTTACCAACTTCAGGAGGCAACTCACCGGAAAAGTGGTTAATGCCGATATAAAGATCAGTCAAATGCTGTAGGTTTCCAATTTCAGGTGGGATTGAACCCGAAAACGAATTGTTTGAAATGTCCAGAGACGTTAAAGATTTAAGGTCAGTATAGATAGTCAATGGGAGTGAACCTGATAAAAGATTGTTACCGAGGTCCAATGATAAAATCCTCGTCAGGTTTCCGATGTGGGTCGGGACATTTCCGGCGAAGGCATTGCTGGAGAGGTCAAGGGTCCGTAGCAGCTTCAAATTTCCAAGCTCCGGCGGGATTTTACCTGTGAATAAATTAGTCCCCAACTTGAGATTCTCCAACCGAGTCAACTCGGTGAGCTCGATTGGGAAATCACCGGAAAACTGATTATCGCCGAGAGCGAGCACCTTCAAGCTCCGAAGATTGGATATCTGAGGTGGGATTGAGCCATAGAGAAAGTTGTTTGAAAGGTCAAGAACAGAGAGACTCGAAATGTTGAAAAGCGACTGAGAGAGTCGGCCTTTGAGCGAAAGAGATGAAAGAGAGAGCTCTGTAACTCGACCGAGCCGGCAAGAAACTCCAGTCCAAAAGCAGTGAGGAAGTGATGAGTTCCATGGCAGAATTTCGGAGGTCTCAAGCGAAGCTTTGAAAGAAATCAAGCTTTCTCTGTCGATAATAATCTCATTCTGGTGTATAATACCATTGGAGCTCAAAATGCAGAGCTCGAAGTAGCCGATGAAAATGAGAAGGAAGCGTGTCAACTCCATACCCATATGAAAGATTATGATATCTCAATATTTCAAACCACGACTGCAAAAAAAATGAAGATGGGCAACCTTAACTTCCATGGCTGTTGCAGAGAGAGATGAAAAGAAGGGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGGTTACGACCCATAGCTATAACGCAGCATTGAAAGGACTATGGCAAAAGAAGGTTGGTCAGAGAGGAGGCGCAGCTCACATGACTTGAAGGATGGTATGGCAACGGCAATTCTGTTTTTCTTCTACTTTAGGCCCTCCAAATTCCCTGATATTACTTTTGGGTATTTATTTTGAAGTCCTCCAAGTCCTCAAGAAATTTGAAAAGGGGTCCCTCATAACACTAGACCAAGTCATAAATTCAAGGTTTTGTGGAAGACAAACAAAACAAAGCTGTGTATTGGGTGATTAAAAAAGCAGTCTTTTTGTCACTTCCATTCCCTTTTCTTTGTTCTTTATTCATATTTTGATAAAAGTAATGGTTTTTAGGGTTATGGAATAGGGAATAGAGAATAAAAATGAAAGCTTTTTAGGGTTTGTGGGTTTTGCAAATGCTCCCACCATTGTTGGCTGTGATTTCAACTCTCTAAATACCTTATTCCTATATGCATTTATTCCAATCTTTTTTGTTAAATTATTGTGGAAAGAATCAAAATTGGT

mRNA sequence

GAAGGCGAGTAGCGAAAACGCTTTATGTGTTAATCTGTACTCGCGTTTGAGAGAGGGGAAAAAGGGGAAGGCAGCGAAAAGGAGAGGGGAACCTCTTCTTCCAACTCCTGTTTGGTTCTTGATTTTGTATACTCCTACCCATGGAGTCGCACGTGCAATTCAGCGAAACGAAGCCGTTTTGGACCAATGAGAAGCACATGCACTTCTTGAATTCCATGGAAGCTTCTTTTGTGCGATCAATGTTCAAAAATCGGAACCATCGCCGCCTTCTACGGCTGGACCGCCACCTGTCGGACACCGCAGACTCAACCCTAGATTCGCCTCATAAGAACAGAAAAAAGCATTCCATCTCCGGTACCAGAATGGACGGTAGATCTAGCAGGAGATCAAGACGGATACCATCTCTACCGCACACTTCAATTCAAGATCAGGTGGTCCCACAGATGGAAGAGAGAGCAGTTGAAGACGAAGATGAGAGAGACCATCCCATGTCACCTGTCAACTAATAAATATAATATTAATTATTATTATTATTTCAGCTGCATTTTTTTCCCATTTAGTTTCTTCATTTAATCCTCGTGTCTTTTCACTTAGTTTGTTAATCTTCAATTAATGATGGGAAATGAGTGAAAGCGTTAATTTAATTGGCTACCTTAGGTAGGTTAAATACAATAAACGAAGACGAGGGTAAAATTGTAAATAAATGGAATATTCAAAGATATATATATATTTTTTTTAATTTATGTTTTTTATTTTCCAAGATCATAAAAGACCATGTGGGGTTATATTAATAGGAAAAGGGAGTCCATCGCCAATTCTGATTGGGAATATAATATAAGATAATATAATATAAATGATAGTATAGTGAAAGGAAGATATTTTATTTGGCAATGTGGGGTTGATTTGATTATGATTGGTTTGTTTTTTCCTTTTTTTTAAAAATTTTTTCTATTGGGTACAATGGAAAGTCATAAGAGTGTATGTGGATTATGTGGGGATAATGAAAGGGATTCATGAATATAAAGTTTGTTGCCTAATCTTATCTTATATATATAGGGGCAATTGTGGAAAAAAAAAACCATAAAGACACTGGAAGTGAACCCAAATAAAGTGGACCTTGGGCCGAGCCGATTCAAGCCCAATGTTGGAGAAATTAAAAATAAACAAATAAGAAGAATGAAATAATCTACGTGGCCGCCTACGTGGAATGATTTGGATTTGGGGTAACCTTTTTAGAAGGACGGTGACAGCTTAAATGTAATTTATTTATTTTTTTAAAAAAGAGAGGCTTAGCCTTTCGAGGAAGTTTCTCAAATCGCCGCACATTCTATTTCTCGTCGCTTCCAAGATTAAAAATATCTTTTCGTTCTCTTCCACGAAGCTTCTAGAATTGCAATCCCTCCAATGCCATTGCCGGAAAATTCGCTCATAGAAGCTTCTTTAGGGCTTGGGAGCTGCCTGCGATTGTTCCGGGATCTGATTTGGGTGATTCTTGTGGCGTCTCCCATGGGAGAGCAGAGCGCCAAACCGTCGCCTAATGGAGGTGATTCACAGAGATCTATGCCCACTCCTTTCTTGACCAAAACGTATCAGCTTGTTGACGACGAGGCAATCGACCATGTCATTTCATGGAACCACGACGGATCTACCTTCATCGTTTGGAACACCATCACTTTTGCTACAGATTTGCTGCCCAAGTATTTCAAGCACAATAACTTCACTAGCTTTCTCCGCCAGCTTAATACTTATGGATTTAAAAAGGTTGTATCGGACCGTTGGGAGTTCGCGAATGAGTGTTTTCGCAAAGGTAAGAAACAGCTTCTTTGTGAGATTCAGCGGCGGAAGTTTCACAGTCCGGTGCCGTCCACGACTCTGCCTGCGCATCTACTAGCATTGACGGGCAATTCTTCTAGTGAAGAACAAGTGATTTCGTCTGATGAGACTCCGACGGGAGCTTTTGCGGAGCTGATCGATGAGAACGATCGGCTTAGAAAAGAGAAAGCAAAGCTTTCGGAACAATTGGTTGAGATGAAATCTCTGTGCAACAACATCTTCTCGATGATGTCGAGCTTTGTTGAATGTCAATTCAAGAGCAGTTTCAAAGTAAGAGACAGCGTTTTAACACCGGCAAAATCGCTCGATCTTTTTCCAGTGAAGCGGCCTTCCGGCGAAGACGAAGCGGGAAAGACGAATCAGATCGGCGCGGCCATCGGAGCGAAGCGGCCGAGGGAATACAGAGAGTGGGCGACGGAGATAGCGGAGGACGATACTACTTTGAGACTCCAACCACCGGATAGATCGGAAGTCAAATCAGAACGGGTAAATTGTCATAAAAAAGTTGATGATCAGAAAACGTGGCGTAATCAAGTCCACTGAGAGCCATCAAATGGATCTGTAATTAACGGCTAGGATAATAATGTATCGACTTTTCAGGTTCTTGTTCAAGTAAAATCAGAGATGCAGGGAATCATTCCATCCCCAAATCTGGAAAAAGATGGCACGTGTGCCGCCATTACAGGGGGCAAATTAAGAAAGTTTCAAAGCACATTAAGCGTTAATGAAACTTCATCACCATTTTCTCCTTCGCTTTAGATTAGATTAGCAGCAGATAAAATGTTAACACAAATGTGAACATTTTGAGCATATACTCCTAAATACTTCTTTACAGTGTTATTCCCAGCTCAAAGTAAGAACCTTTCGTGTTACCCTTTACACATCTTTGATCCCTTTAAGAAACTTCAATACCTGAAGCATGGAAGGTCTGTTAGCAGGATTTTCTGATAAGCAAACACAAGCAATCTGAAGGGTTTGAAGCATCATATGCTTGGAATCAGCATTCAGTACTGTGGCGTCAAGAACGTCTGCAGCCTGCCCCTTGTTGATCTTCTGAAACACCCAACCAACCAGGTTTCCGCCCTCAATCTCTTTAAAGTCGGGTCCTGTTGGTTCCTTCCCAGTTACCAATTCCAGTAGGATTACACCATAGCTATAAACGTCCCCCTTTGTAGTAGACCTCCCACTCTGCCCGTACTCCGGCGGGATGTAACCAAAAGTTCCAGCAATCTCAGTTGTGACATGAGTCTCACAAGCACTGATCAGTCTAGCCAACCCGAAGTCGGCGACTTTTGGTTCGAAGTCTTGGTTGAGGAGTATATTGCTTGCTTTAACATCCCTATGAATGATGTGGGGGATGAATCCATGATGAAGAAATGCCAATCCACGAGCTGCGCCTGAAGCGACTTTAAATCGAGTCTCCCAGTTAAGGATTTCGAGAGTACCGATTCGGTTTCTTAGCCAAAGATCCAAACTACCATTCACCATATATTCATAGACTAGGAGCTTCTCCTCCCCAAGAGAACAGTAGCCAAGTAGTGGTACAAGATTATTGTGCTTCACTTTGCCTATGGTTTCCATTTCAGCTATAAATTCTCTGTGCCCCTGTGTTTTTGCTTCGCTTAGTTTCTTCACGGCAACAATTTTTCCATCAGGTAATGTGGCCTTGTACACTGTCCCGAATCCTCCATCTCCAATGATGTTTGTTTTACAGAAGTTATTGGTTGCTTCGAGAATATCAGCCAAAGTTAATTTCAGAAGGGGTTGCTCGAACGTGGCTACATTGATGCTTAAAGGCTCTCTCGATCTGCTGCTGCTGCTTAAGAAATAGAGATTGGGGTCTATGAAACATTTTAATTTGCTTTCCTCCATTTCCTCTGGATCGTTCTCTCTCTGGGTTCTAATGATCCATCTCCGCATGGCGAATGTGATAGTTAGAACGATAAGAACACTAACAATGATGATCCCTGCAACGCTCCAAGCATTCAATGCTGCTGATCTCTCCAAGATTTTGATCCGGCAATTCAAACCCATGATTCTTCCACAAAGGCCTTTGTTACCCACAAGTGAACTTTTGGATAGATTCTGGCAAATGCCACTTCTCGGAATTGGCCCTTCCAGGCTGTTGTCTGCCATATTCAGGTAAACCATATTGACAAGGCTGCATATTTTCTCTGGAATTTCTCCTGAGAGCTTGTTATTTGAAACATCCAAGTATTCAAGTTGCATAAGATCCCCAAAATCTGAAGGGATTTGCCCTGTGAATTTATTTCCATGAAGATCCAAAGTTGTCAAGTATGAAAGGTTGCCCAATGTTCGTGGAAGGACACCCTCAAAATAGTTATTACTCAAATTCAAAGTTTCAACCTTCCATGTCATGGAACTTGGGAAAAGTTCAACAACCTGACCAGAAAGCCTGTTCTCCTGTACATAAAGCCCGACAAGATTCAACATGTTGGACAGAGAAGAAGGAAGATCACCATCCAACTCATTAGAACTTAAATCCAAATGAGTTAGAGCTTTCAGATCACCAAGAGTTCTTGGAATTGAACCAGATAATTTATTTCCAGTCAAGTTCAACTTTACCAAGCTACTCAAATGACTGAAGCTTTCGGGGATTGTACCCACCAGGTGATTATTCCATAGATACAGGCCTTGGAGCTTGAGAGCATTGCCAATCTCTGTGGGAATAGGACCAGTAAGCATATTGCTAGACAAATCCAAGGTTGTCAGGTTCGTTAACTGAGAGAGAGATCTAGGAATCTCTCCAGAAAGTAGATTATTATTCAGCAAAAGATCAACTACAACAATACAATTCCCCAGTTCATCAGGTATGGTACCAGACAATCTATTATGAGACAGATCGAAAACACCATGATGCTGAACAAAGCTCAAATCTGGAATAGTAACCTGTCGAAAATAAGCAGAGGGCTTGGAAGGTATGGCTCCAGATAACATGTTGTGTGACAGAACTAGACACTGTAATTCAGTAAGGTTAGCAAGTCTTTCAGGAATCGACCCTTTTAGACTATTGTTTCCAAGGTCCAACGTGGTAAGTGAAGTGCAATCTGCAAGCATGGTAGGAATAGTTCCTTCAAGCAGATTGGAATTCAAATTTAGAACTGAAAGAGCTGTGAGATTTCCAATCTCATCTGGTATACCGCCTGTCAACCTATTGCTGCTGAGAACAAGCCTCTCAAGTGAAGCTGCATAGCCAATTTCTGAAGGGAGATGACCCTCCAACAGGTTATTTGCGGCAGAAAACTCCATTAAATCCACTGAGTTCCATATATTTCTAGGTAAACAACCGGTAAAATTATTAGAGTCAAGGTTGATTACCATTAAGGAAAGGTTTGAGAAGTACTCTGGTATTGCCCCAACAATCTGATTGTCTACCAAAACCAGCTGCGTAAGGTTTCTACACAGCACAAATGTATCATCAATCGGACCCGAGAGGAAATTGCTGTCAAGATCAATCTCCATCAAGGATGCAGCATTACAAATTTCTTTAGGTATTGGACCTGTCAACAAGTTATTGCTCAAACTCAGGTGATTAAGCATCGAGCAATTTCCAATCTCAGGAGGGATTTCCCCCATGAGATGATTACTCGAGAGTAAAATAGAATCAACATGATCCCATTTGCCAAGCCAGGAAGGTAATGACCCAGAAAGCTGATTCTTCTCAGCAGAAAATGTCAACATGGGAAGCTCTGAAAGCTCTTGTGGCAACACCCCAGATAGAAAGTTGAATGAAAGCATCAATGTTTTCAAATTTCTGCACCTCCCGAGCTCAGCAGGAATAGAACCATTAAGCTGAGTGTAAACCAGATTCAATATAGTTAAGTTCTGTAACTCACCAATCGATTTCGGGATAGAACACCCAAGTGGGTTGTATGAAAGGTCCAGTTTGCTCAATGATTTCAACTTGGATAGTTCGTCAGGCAATGGACCAGTTAAAGAACAAGAAGGCGAAAAAAAGTTCTCCAGCAATACAAGGTTACCAACTTCAGGAGGCAACTCACCGGAAAAGTGGTTAATGCCGATATAAAGATCAGTCAAATGCTGTAGGTTTCCAATTTCAGGTGGGATTGAACCCGAAAACGAATTGTTTGAAATGTCCAGAGACGTTAAAGATTTAAGGTCAGTATAGATAGTCAATGGGAGTGAACCTGATAAAAGATTGTTACCGAGGTCCAATGATAAAATCCTCGTCAGGTTTCCGATGTGGGTCGGGACATTTCCGGCGAAGGCATTGCTGGAGAGGTCAAGGGTCCGTAGCAGCTTCAAATTTCCAAGCTCCGGCGGGATTTTACCTGTGAATAAATTAGTCCCCAACTTGAGATTCTCCAACCGAGTCAACTCGGTGAGCTCGATTGGGAAATCACCGGAAAACTGATTATCGCCGAGAGCGAGCACCTTCAAGCTCCGAAGATTGGATATCTGAGGTGGGATTGAGCCATAGAGAAAGTTGTTTGAAAGGTCAAGAACAGAGAGACTCGAAATGTTGAAAAGCGACTGAGAGAGTCGGCCTTTGAGCGAAAGAGATGAAAGAGAGAGCTCTGTAACTCGACCGAGCCGGCAAGAAACTCCAGTCCAAAAGCAGTGAGGAAGTGATGAGTTCCATGGCAGAATTTCGGAGGTCTCAAGCGAAGCTTTGAAAGAAATCAAGCTTTCTCTGTCGATAATAATCTCATTCTGGTGTATAATACCATTGGAGCTCAAAATGCAGAGCTCGAAGTAGCCGATGAAAATGAGAAGGAAGCGTGTCAACTCCATACCCATATGAAAGATTATGATATCTCAATATTTCAAACCACGACTGCAAAAAAAATGAAGATGGGCAACCTTAACTTCCATGGCTGTTGCAGAGAGAGATGAAAAGAAGGGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGGTTACGACCCATAGCTATAACGCAGCATTGAAAGGACTATGGCAAAAGAAGGTTGGTCAGAGAGGAGGCGCAGCTCACATGACTTGAAGGATGGTATGGCAACGGCAATTCTGTTTTTCTTCTACTTTAGGCCCTCCAAATTCCCTGATATTACTTTTGGGTATTTATTTTGAAGTCCTCCAAGTCCTCAAGAAATTTGAAAAGGGGTCCCTCATAACACTAGACCAAGTCATAAATTCAAGGTTTTGTGGAAGACAAACAAAACAAAGCTGTGTATTGGGTGATTAAAAAAGCAGTCTTTTTGTCACTTCCATTCCCTTTTCTTTGTTCTTTATTCATATTTTGATAAAAGTAATGGTTTTTAGGGTTATGGAATAGGGAATAGAGAATAAAAATGAAAGCTTTTTAGGGTTTGTGGGTTTTGCAAATGCTCCCACCATTGTTGGCTGTGATTTCAACTCTCTAAATACCTTATTCCTATATGCATTTATTCCAATCTTTTTTGTTAAATTATTGTGGAAAGAATCAAAATTGGT

Coding sequence (CDS)

ATGCCATTGCCGGAAAATTCGCTCATAGAAGCTTCTTTAGGGCTTGGGAGCTGCCTGCGATTGTTCCGGGATCTGATTTGGGTGATTCTTGTGGCGTCTCCCATGGGAGAGCAGAGCGCCAAACCGTCGCCTAATGGAGGTGATTCACAGAGATCTATGCCCACTCCTTTCTTGACCAAAACGTATCAGCTTGTTGACGACGAGGCAATCGACCATGTCATTTCATGGAACCACGACGGATCTACCTTCATCGTTTGGAACACCATCACTTTTGCTACAGATTTGCTGCCCAAGTATTTCAAGCACAATAACTTCACTAGCTTTCTCCGCCAGCTTAATACTTATGGATTTAAAAAGGTTGTATCGGACCGTTGGGAGTTCGCGAATGAGTGTTTTCGCAAAGGTAAGAAACAGCTTCTTTGTGAGATTCAGCGGCGGAAGTTTCACAGTCCGGTGCCGTCCACGACTCTGCCTGCGCATCTACTAGCATTGACGGGCAATTCTTCTAGTGAAGAACAAGTGATTTCGTCTGATGAGACTCCGACGGGAGCTTTTGCGGAGCTGATCGATGAGAACGATCGGCTTAGAAAAGAGAAAGCAAAGCTTTCGGAACAATTGGTTGAGATGAAATCTCTGTGCAACAACATCTTCTCGATGATGTCGAGCTTTGTTGAATGTCAATTCAAGAGCAGTTTCAAAGTAAGAGACAGCGTTTTAACACCGGCAAAATCGCTCGATCTTTTTCCAGTGAAGCGGCCTTCCGGCGAAGACGAAGCGGGAAAGACGAATCAGATCGGCGCGGCCATCGGAGCGAAGCGGCCGAGGGAATACAGAGAGTGGGCGACGGAGATAGCGGAGGACGATACTACTTTGAGACTCCAACCACCGGATAGATCGGAAGTCAAATCAGAACGGGTAAATTGTCATAAAAAAGTTGATGATCAGAAAACGTGGCGTAATCAAGTCCACTGA

Protein sequence

MPLPENSLIEASLGLGSCLRLFRDLIWVILVASPMGEQSAKPSPNGGDSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLPKYFKHNNFTSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFHSPVPSTTLPAHLLALTGNSSSEEQVISSDETPTGAFAELIDENDRLRKEKAKLSEQLVEMKSLCNNIFSMMSSFVECQFKSSFKVRDSVLTPAKSLDLFPVKRPSGEDEAGKTNQIGAAIGAKRPREYREWATEIAEDDTTLRLQPPDRSEVKSERVNCHKKVDDQKTWRNQVH
BLAST of Cp4.1LG14g06030 vs. Swiss-Prot
Match: HFB2B_ORYSJ (Heat stress transcription factor B-2b OS=Oryza sativa subsp. japonica GN=HSFB2B PE=2 SV=1)

HSP 1 Score: 217.2 bits (552), Expect = 2.6e-55
Identity = 118/232 (50.86%), Postives = 147/232 (63.36%), Query Frame = 1

Query: 32  ASPMGEQSAKPSPNG--------GDSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTF 91
           A+ MGE S  P            G  QR++PTPFLTKTYQLVDD A+D VISWN DGSTF
Sbjct: 16  AATMGEPSPPPPAPAAEAAGVGVGQQQRTVPTPFLTKTYQLVDDPAVDDVISWNDDGSTF 75

Query: 92  IVWNTITFATDLLPKYFKHNNFTSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEI 151
           +VW    FA DLLPKYFKHNNF+SF+RQLNTYGF+K+V DRWEFAN+CFR+G+++LLCEI
Sbjct: 76  VVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKIVPDRWEFANDCFRRGERRLLCEI 135

Query: 152 QRRKFHSPVPSTT-------LPAHLLALTGNS-----SSEEQVISSD----------ETP 211
            RRK   P P+ T       +P  L   T        S EEQVISS           + P
Sbjct: 136 HRRKVTPPAPAATTAAVAAAIPMALPVTTTRDGSPVLSGEEQVISSSSSPEPPLVLPQAP 195

Query: 212 TG------AFAELIDENDRLRKEKAKLSEQLVEMKSLCNNIFSMMSSFVECQ 228
           +G      A  ++ DEN+RLR+E A+L+ +L +M+ LCNNI  +MS +   Q
Sbjct: 196 SGSGSGGVASGDVGDENERLRRENAQLARELSQMRKLCNNILLLMSKYASTQ 247

BLAST of Cp4.1LG14g06030 vs. Swiss-Prot
Match: HFB2B_ARATH (Heat stress transcription factor B-2b OS=Arabidopsis thaliana GN=HSFB2B PE=2 SV=1)

HSP 1 Score: 211.1 bits (536), Expect = 1.8e-53
Identity = 130/329 (39.51%), Postives = 177/329 (53.80%), Query Frame = 1

Query: 46  GGDSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLPKYFKHNNF 105
           GGDSQRS+PTPFLTKTYQLV+D   D +ISWN DG+TFIVW    FA DLLPKYFKHNNF
Sbjct: 49  GGDSQRSIPTPFLTKTYQLVEDPVYDELISWNEDGTTFIVWRPAEFARDLLPKYFKHNNF 108

Query: 106 TSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFHSPVPSTTLPAHLLALT 165
           +SF+RQLNTYGF+KVV DRWEF+N+CF++G+K LL +IQRRK   P  +    A   A+ 
Sbjct: 109 SSFVRQLNTYGFRKVVPDRWEFSNDCFKRGEKILLRDIQRRKISQPAMAAAAAAAAAAVA 168

Query: 166 G-----------------NSSSEEQVISSDETPTGAFA---------------------E 225
                             ++S EEQVISS+ +P  A A                     E
Sbjct: 169 ASAVTVAAVPVVAHIVSPSNSGEEQVISSNSSPAAAAAAIGGVVGGGSLQRTTSCTTAPE 228

Query: 226 LIDENDRLRKEKAKLSEQLVEMKSLCNNIFSMMSSFVECQFKSSFKVRDSVLTPAKSLDL 285
           L++EN+RLRK+  +L +++ ++K L  NI+++M++F   Q   +      +L   K LDL
Sbjct: 229 LVEENERLRKDNERLRKEMTKLKGLYANIYTLMANFTPGQEDCA-----HLLPEGKPLDL 288

Query: 286 FPVKRP------SGEDEAG---------KTNQIGAAIGAKRPREYREWATEIAEDD---T 319
            P ++       + E E G              G +IG KR R   E      EDD    
Sbjct: 289 LPERQEMSEAIMASEIETGIGLKLGEDLTPRLFGVSIGVKRARREEELGAAEEEDDDRRE 348

BLAST of Cp4.1LG14g06030 vs. Swiss-Prot
Match: HFB2A_ARATH (Heat stress transcription factor B-2a OS=Arabidopsis thaliana GN=HSFB2A PE=2 SV=1)

HSP 1 Score: 206.1 bits (523), Expect = 5.9e-52
Identity = 121/252 (48.02%), Postives = 163/252 (64.68%), Query Frame = 1

Query: 49  SQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLPKYFKHNNFTSF 108
           SQRS+PTPFLTKT+ LV+D +ID VISWN DGS+FIVWN   FA DLLPK+FKHNNF+SF
Sbjct: 16  SQRSIPTPFLTKTFNLVEDSSIDDVISWNEDGSSFIVWNPTDFAKDLLPKHFKHNNFSSF 75

Query: 109 LRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKF---HSPV--PSTTLPAHLLA 168
           +RQLNTYGFKKVV DRWEF+N+ F++G+K+LL EIQRRK    H  V  PS+      + 
Sbjct: 76  VRQLNTYGFKKVVPDRWEFSNDFFKRGEKRLLREIQRRKITTTHQTVVAPSSEQRNQTMV 135

Query: 169 LTGNSSSEE----QVISSD-------ETPT----GAFAELIDENDRLRKEKAKLSEQLVE 228
           ++ ++S E+    QV+SS        +T T    G   EL++EN++LR +  +L+ +L +
Sbjct: 136 VSPSNSGEDNNNNQVMSSSPSSWYCHQTKTTGNGGLSVELLEENEKLRSQNIQLNRELTQ 195

Query: 229 MKSLCNNIFSMMSSFVECQ-FKSSFKVRDSVLTPAKSLDLFPVKRPS----GEDEAGKTN 276
           MKS+C+NI+S+MS++V  Q    S+    S   P   ++  P KR S     E+E     
Sbjct: 196 MKSICDNIYSLMSNYVGSQPTDRSYSPGGSSSQP---MEFLPAKRFSEMEIEEEEEASPR 255

BLAST of Cp4.1LG14g06030 vs. Swiss-Prot
Match: HFB2C_ORYSJ (Heat stress transcription factor B-2c OS=Oryza sativa subsp. japonica GN=HSFB2C PE=2 SV=1)

HSP 1 Score: 198.0 bits (502), Expect = 1.6e-49
Identity = 117/263 (44.49%), Postives = 147/263 (55.89%), Query Frame = 1

Query: 50  QRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLPKYFKHNNFTSFL 109
           QRS+PTPFLTKTYQLV+D A+D VISWN DGSTF+VW    FA DLLPKYFKHNNF+SF+
Sbjct: 32  QRSLPTPFLTKTYQLVEDPAVDDVISWNEDGSTFVVWRPAEFARDLLPKYFKHNNFSSFV 91

Query: 110 RQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFHS------PVPSTTLPAHLLA 169
           RQLNTYGF+K+V DRWEFAN+CFR+G+K+LLC+I RRK  +      P PS  +     A
Sbjct: 92  RQLNTYGFRKIVPDRWEFANDCFRRGEKRLLCDIHRRKVVAAAAAAPPPPSPGMATAAAA 151

Query: 170 LTGNS----------------------SSEEQVISSD--------------ETPTG---- 229
           +   +                      SSEEQV+SS+                P G    
Sbjct: 152 VASGAVTVAAAPIPMALPVTRAGSPAHSSEEQVLSSNSGSGEEHRQASGSGSAPGGGGGG 211

Query: 230 --AFAELIDENDRLRKEKAKLSEQLVEMKSLCNNIFSMM---------------SSFVEC 250
             +  ++ +EN+RLR+E A+L+ +L  MK LCNNI  +M               SS   C
Sbjct: 212 SASGGDMGEENERLRRENARLTRELGHMKKLCNNILLLMSKYAATQHVEGSAGISSIANC 271

BLAST of Cp4.1LG14g06030 vs. Swiss-Prot
Match: HSF24_SOLPE (Heat shock factor protein HSF24 OS=Solanum peruvianum GN=HSF24 PE=2 SV=1)

HSP 1 Score: 180.6 bits (457), Expect = 2.7e-44
Identity = 95/191 (49.74%), Postives = 126/191 (65.97%), Query Frame = 1

Query: 49  SQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLPKYFKHNNFTSF 108
           SQR+ P PFL KTYQLVDD A D VISWN  G+TF+VW T  FA DLLPKYFKHNNF+SF
Sbjct: 2   SQRTAPAPFLLKTYQLVDDAATDDVISWNEIGTTFVVWKTAEFAKDLLPKYFKHNNFSSF 61

Query: 109 LRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRK--FHSPVPSTTLPAHLLALTG 168
           +RQLNTYGF+K+V D+WEFANE F++G+K+LL  I+RRK    +P    ++ A   A   
Sbjct: 62  VRQLNTYGFRKIVPDKWEFANENFKRGQKELLTAIRRRKTVTSTPAGGKSVAAGASASPD 121

Query: 169 N----------SSSEEQVISSDETP--TGAFAELIDENDRLRKEKAKLSEQLVEMKSLCN 226
           N          SS + +   S +TP     F +L DEN++L+K+   LS +LV+ K  CN
Sbjct: 122 NSGDDIGSSSTSSPDSKNPGSVDTPGKLSQFTDLSDENEKLKKDNQMLSSELVQAKKQCN 181

BLAST of Cp4.1LG14g06030 vs. TrEMBL
Match: A0A0A0L6G8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G181940 PE=3 SV=1)

HSP 1 Score: 411.0 bits (1055), Expect = 1.3e-111
Identity = 216/293 (73.72%), Postives = 235/293 (80.20%), Query Frame = 1

Query: 35  MGEQSAKPSPNGGDSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATD 94
           M E +A P+P   DS RS+PTPFLTKTYQLVDD +IDHVISWN DGSTFIVWNT+ FA D
Sbjct: 1   MEEPTANPTPTPIDSYRSVPTPFLTKTYQLVDDRSIDHVISWNDDGSTFIVWNTMAFAKD 60

Query: 95  LLPKYFKHNNFTSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFHSPVPS 154
           LLPKYFKHNNFTSFLRQLNTYGF+KVVSDRWEFANECFRKGKKQLLCEIQRRK   PVPS
Sbjct: 61  LLPKYFKHNNFTSFLRQLNTYGFRKVVSDRWEFANECFRKGKKQLLCEIQRRKLVGPVPS 120

Query: 155 TTLPA--------------HLLALTGNSSSEEQVISSDETPTGAFAELIDENDRLRKEKA 214
           T   A               +L LTGNSS EEQVISSDETPT A AELIDENDRLR+EK 
Sbjct: 121 TASNAAVVTTVGASAIPSVQVLTLTGNSSGEEQVISSDETPTRALAELIDENDRLRREKV 180

Query: 215 KLSEQLVEMKSLCNNIFSMMSSFVECQFKSSFKVRDSVLTPAKSLDLFPVKRPSGED--- 274
           +L+EQL E+KSLCNNIFS+MSSFVE QFK+SFKVR+SVL  AKSLDLFPVKRP+GE+   
Sbjct: 181 QLTEQLDEVKSLCNNIFSLMSSFVESQFKNSFKVRESVLESAKSLDLFPVKRPAGEEGTA 240

Query: 275 EAGKTNQIGAAIGAKRPREYREWATEIAEDDTTLRLQPPDRSEVKSERVNCHK 311
           E  +  +    IGAKR REYRE ATE AEDDTTLRLQPPDR  VKSER+NC K
Sbjct: 241 EVKEEEEERNQIGAKRAREYREGATERAEDDTTLRLQPPDRWVVKSERINCQK 293

BLAST of Cp4.1LG14g06030 vs. TrEMBL
Match: W9RWR0_9ROSA (Heat stress transcription factor B-2a OS=Morus notabilis GN=L484_015761 PE=3 SV=1)

HSP 1 Score: 288.5 bits (737), Expect = 1.0e-74
Identity = 162/314 (51.59%), Postives = 205/314 (65.29%), Query Frame = 1

Query: 30  LVASPMGEQSAKPSPNGGDSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTI 89
           +   P G+ ++      G+SQRS+PTPFLTKTYQLVDD+AID VISWN DGSTFIVWN  
Sbjct: 1   MATEPAGDSAS------GESQRSLPTPFLTKTYQLVDDQAIDDVISWNDDGSTFIVWNPT 60

Query: 90  TFATDLLPKYFKHNNFTSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFH 149
            FA DLLPKYFKHNNF+SF+RQLNTYGF+KVV DRWEF+NE FR+ +K+LLCEIQRRK  
Sbjct: 61  VFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFSNEYFRRSEKRLLCEIQRRKIA 120

Query: 150 SP--------------VPSTTLPAHLLALTGNSSSEEQVISSDETPTGAFAELIDENDRL 209
           +P              V    +P  +  ++ ++S EEQVISS+ +PT   AEL+DEN+RL
Sbjct: 121 TPGATPSAAATTTTVAVTVAAIPTAMPIISPSNSGEEQVISSNSSPTRGPAELVDENERL 180

Query: 210 RKEKAKLSEQLVEMKSLCNNIFSMMSSFVECQFKSSFKVRDSVLTPAKSLDLFPVKRPSG 269
           RKE  +LS++L EMKS+C+NIFSM+S++   Q +S F+ R+S     + LDL P KR SG
Sbjct: 181 RKENLQLSKELAEMKSICSNIFSMVSNYACLQSESVFQARESGFREVEPLDLMPAKRFSG 240

Query: 270 EDEAGKTNQ------IGAAIGAKRPREYREWATEIAEDDTTLRLQPPDRSEVKSERVNCH 324
           E E     +       G  IGAKR RE      E AED T L L+ P   +VKSE ++  
Sbjct: 241 EGEDATAEEKTSPKLFGVTIGAKRAREGGN-DIEPAEDQTDLLLRQPAGVDVKSEPLDV- 300

BLAST of Cp4.1LG14g06030 vs. TrEMBL
Match: A0A0F7G5V9_9ROSI (Heat stress transcription factor B-2a-like protein (Fragment) OS=Betula luminifera GN=HsfB2a PE=2 SV=1)

HSP 1 Score: 287.0 bits (733), Expect = 2.9e-74
Identity = 163/310 (52.58%), Postives = 196/310 (63.23%), Query Frame = 1

Query: 26  IWVIL--VASPMGEQSAKPSPNGGDSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTF 85
           +W +L  V  P  EQ+ + +   GDSQRS+PTPFLTKTYQLVDD  ID VISWN DGSTF
Sbjct: 31  VWFVLGFVMVPPAEQNCEST--SGDSQRSLPTPFLTKTYQLVDDHTIDEVISWNDDGSTF 90

Query: 86  IVWNTITFATDLLPKYFKHNNFTSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEI 145
           IVWN   FA DLLPKYFKHNNF+SF+RQLNTYGF+KVV DRWEF+NE FR+G+KQLL EI
Sbjct: 91  IVWNPTVFAKDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFSNEYFRRGEKQLLREI 150

Query: 146 QRRKFHSP---------VPSTTLPAHLLALTGNSSSEEQVISSDETPTGAFAELIDENDR 205
           QRRK  SP         V    +P     ++ ++S EEQVIS++ +PT   A+L DEN+R
Sbjct: 151 QRRKITSPMATAPAPVMVAMAAIPTAKPMISPSNSGEEQVISTNSSPTTEPADLADENER 210

Query: 206 LRKEKAKLSEQLVEMKSLCNNIFSMMSSFVECQFKSSFKVRDSVLTPAKSLDLFPVKRPS 265
           LRKE  +LS++L EMKSLCN IF++MS+F                   K LDL P KR +
Sbjct: 211 LRKENVQLSKELAEMKSLCNKIFTLMSNFASSNQSE---------IAGKPLDLLPSKRLA 270

Query: 266 GEDEAGKTNQIGAAIGAKRPREYREWATEIAEDDTTLRLQPPDRSEVKSERVNCHKKVDD 324
            E+        G AIG KR RE    AT   ED+T LRL  P     KSE + C   VDD
Sbjct: 271 REETETSARLFGVAIGGKRARESEGEAT---EDETELRLHQPGAGIAKSEPLECQSNVDD 326

BLAST of Cp4.1LG14g06030 vs. TrEMBL
Match: I1MPZ9_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_16G196200 PE=3 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 3.2e-73
Identity = 165/330 (50.00%), Postives = 213/330 (64.55%), Query Frame = 1

Query: 36  GEQSAKPSPNGG-DSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATD 95
           G+ +A  S +   +SQRS+PTPFLTKTYQLVDD++ID VISWN DGSTFIVWN   FA D
Sbjct: 11  GDSAATASASASAESQRSIPTPFLTKTYQLVDDQSIDDVISWNDDGSTFIVWNPTVFARD 70

Query: 96  LLPKYFKHNNFTSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFHSPVPS 155
           LLPK+FKHNNF+SF+RQLNTYGF+KVV DRWEF+N+ FR+G+K+LLCEIQRRK  SP PS
Sbjct: 71  LLPKFFKHNNFSSFVRQLNTYGFRKVVPDRWEFSNDYFRRGEKRLLCEIQRRKISSPAPS 130

Query: 156 TTLPAHLLA---------LTGNSSSEEQVISSDETPTGAFAELIDENDRLRKEKAKLSEQ 215
            T P  +           ++ ++S EEQVISS+ +P  A AEL+DEN+RLRKE  +L+++
Sbjct: 131 PTAPTTVTVPMPLTAIPIISPSNSGEEQVISSNSSPLRAPAELLDENERLRKENVQLTKE 190

Query: 216 LVEMKSLCNNIFSMMSSFVECQFKS--SFKV----------RDSVLTPAKSLDLFPVKRP 275
           L EM+SLCNNI+S+MSS+      S  S++           R+S +T  K LDL PVKR 
Sbjct: 191 LAEMRSLCNNIYSLMSSYGNKNGNSNGSYQTDGGAGGAQGSRESGMTAVKPLDLMPVKRS 250

Query: 276 SGEDEAGKTNQ----------IGAAIGAKRPREYREWATE---------IAEDDTTLRLQ 324
           SGED A    +           G AIGAKR RE    +             E+DT LRL 
Sbjct: 251 SGEDAADTVPKEINLIPNPKLFGVAIGAKRAREGGGGSGSGRDCGGGGGGGEEDTLLRLH 310

BLAST of Cp4.1LG14g06030 vs. TrEMBL
Match: M5X1X7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009180mg PE=3 SV=1)

HSP 1 Score: 280.0 bits (715), Expect = 3.6e-72
Identity = 159/301 (52.82%), Postives = 201/301 (66.78%), Query Frame = 1

Query: 44  PNGGD------SQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLP 103
           PNGG+      SQR++PTPFLTKTYQLVDD  ID VISWN DGS+F+VWN   FA DLLP
Sbjct: 8   PNGGESTSGESSQRALPTPFLTKTYQLVDDPTIDDVISWNDDGSSFVVWNPTVFARDLLP 67

Query: 104 KYFKHNNFTSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFHSPVPSTTL 163
           KYFKHNNF+SF+RQLNTYGF+KV+ DRWEF+N+CFR+G+K+LLCEIQRR+   P PS  +
Sbjct: 68  KYFKHNNFSSFVRQLNTYGFRKVIPDRWEFSNDCFRRGEKRLLCEIQRRRIMPPAPSVAV 127

Query: 164 PAHLLA---------LTGNSSSEEQVISSDETPTGAFAELIDENDRLRKEKAKLSEQLVE 223
                A         ++ ++S EEQVISS  +P  A +EL+DEN++LRKE  +L+++L E
Sbjct: 128 SPMATAAVVPNAKPMISPSNSGEEQVISSSSSPIRAPSELMDENEKLRKENMQLTKELAE 187

Query: 224 MKSLCNNIFSMMSSFVECQFKSSFKVRDSVLTPAKSLDLFPVKRPSG----EDEAGKTNQ 283
           +KSLCNNIFSM+S++   Q +S F          K LDL P KR SG    E+E      
Sbjct: 188 VKSLCNNIFSMVSNYAYAQSESGFPY-------VKPLDLMPEKRFSGDGEKEEEEASPKL 247

Query: 284 IGAAIGAKRPREYREWATEIAEDDTTLRLQPPD-RSEVKSERVNCHKKVDDQKT-WRNQV 324
            G AIGAKR RE       + ED+T LRLQ P    +VKSE ++    +D Q+T W NQ 
Sbjct: 248 FGVAIGAKRARE--TVGDGVEEDETGLRLQQPSGGGDVKSEPID----MDRQETPWLNQR 295

BLAST of Cp4.1LG14g06030 vs. TAIR10
Match: AT4G11660.1 (AT4G11660.1 winged-helix DNA-binding transcription factor family protein)

HSP 1 Score: 211.1 bits (536), Expect = 1.0e-54
Identity = 130/329 (39.51%), Postives = 177/329 (53.80%), Query Frame = 1

Query: 46  GGDSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLPKYFKHNNF 105
           GGDSQRS+PTPFLTKTYQLV+D   D +ISWN DG+TFIVW    FA DLLPKYFKHNNF
Sbjct: 49  GGDSQRSIPTPFLTKTYQLVEDPVYDELISWNEDGTTFIVWRPAEFARDLLPKYFKHNNF 108

Query: 106 TSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFHSPVPSTTLPAHLLALT 165
           +SF+RQLNTYGF+KVV DRWEF+N+CF++G+K LL +IQRRK   P  +    A   A+ 
Sbjct: 109 SSFVRQLNTYGFRKVVPDRWEFSNDCFKRGEKILLRDIQRRKISQPAMAAAAAAAAAAVA 168

Query: 166 G-----------------NSSSEEQVISSDETPTGAFA---------------------E 225
                             ++S EEQVISS+ +P  A A                     E
Sbjct: 169 ASAVTVAAVPVVAHIVSPSNSGEEQVISSNSSPAAAAAAIGGVVGGGSLQRTTSCTTAPE 228

Query: 226 LIDENDRLRKEKAKLSEQLVEMKSLCNNIFSMMSSFVECQFKSSFKVRDSVLTPAKSLDL 285
           L++EN+RLRK+  +L +++ ++K L  NI+++M++F   Q   +      +L   K LDL
Sbjct: 229 LVEENERLRKDNERLRKEMTKLKGLYANIYTLMANFTPGQEDCA-----HLLPEGKPLDL 288

Query: 286 FPVKRP------SGEDEAG---------KTNQIGAAIGAKRPREYREWATEIAEDD---T 319
            P ++       + E E G              G +IG KR R   E      EDD    
Sbjct: 289 LPERQEMSEAIMASEIETGIGLKLGEDLTPRLFGVSIGVKRARREEELGAAEEEDDDRRE 348

BLAST of Cp4.1LG14g06030 vs. TAIR10
Match: AT5G62020.1 (AT5G62020.1 heat shock transcription factor B2A)

HSP 1 Score: 206.1 bits (523), Expect = 3.3e-53
Identity = 121/252 (48.02%), Postives = 163/252 (64.68%), Query Frame = 1

Query: 49  SQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLPKYFKHNNFTSF 108
           SQRS+PTPFLTKT+ LV+D +ID VISWN DGS+FIVWN   FA DLLPK+FKHNNF+SF
Sbjct: 16  SQRSIPTPFLTKTFNLVEDSSIDDVISWNEDGSSFIVWNPTDFAKDLLPKHFKHNNFSSF 75

Query: 109 LRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKF---HSPV--PSTTLPAHLLA 168
           +RQLNTYGFKKVV DRWEF+N+ F++G+K+LL EIQRRK    H  V  PS+      + 
Sbjct: 76  VRQLNTYGFKKVVPDRWEFSNDFFKRGEKRLLREIQRRKITTTHQTVVAPSSEQRNQTMV 135

Query: 169 LTGNSSSEE----QVISSD-------ETPT----GAFAELIDENDRLRKEKAKLSEQLVE 228
           ++ ++S E+    QV+SS        +T T    G   EL++EN++LR +  +L+ +L +
Sbjct: 136 VSPSNSGEDNNNNQVMSSSPSSWYCHQTKTTGNGGLSVELLEENEKLRSQNIQLNRELTQ 195

Query: 229 MKSLCNNIFSMMSSFVECQ-FKSSFKVRDSVLTPAKSLDLFPVKRPS----GEDEAGKTN 276
           MKS+C+NI+S+MS++V  Q    S+    S   P   ++  P KR S     E+E     
Sbjct: 196 MKSICDNIYSLMSNYVGSQPTDRSYSPGGSSSQP---MEFLPAKRFSEMEIEEEEEASPR 255

BLAST of Cp4.1LG14g06030 vs. TAIR10
Match: AT1G46264.1 (AT1G46264.1 heat shock transcription factor B4)

HSP 1 Score: 166.4 bits (420), Expect = 2.9e-41
Identity = 91/205 (44.39%), Postives = 118/205 (57.56%), Query Frame = 1

Query: 51  RSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLPKYFKHNNFTSFLR 110
           +++P PFLTKTYQLVDD A DHV+SW  D +TF+VW    FA DLLP YFKHNNF+SF+R
Sbjct: 29  KAVPAPFLTKTYQLVDDPATDHVVSWGDDDTTFVVWRPPEFARDLLPNYFKHNNFSSFVR 88

Query: 111 QLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKF-------HSPVPSTTLPAHLLA 170
           QLNTYGF+K+V DRWEFANE F++G+K LLCEI RRK        HSP  S       + 
Sbjct: 89  QLNTYGFRKIVPDRWEFANEFFKRGEKHLLCEIHRRKTSQMIPQQHSPFMSHHHAPPQIP 148

Query: 171 LTGNS----------SSEEQVISSDETP-------------TGAFAELIDENDRLRKEKA 226
            +G S          + EE     D++P                   L ++N+RLR+   
Sbjct: 149 FSGGSFFPLPPPRVTTPEEDHYWCDDSPPSRPRVIPQQIDTAAQVTALSEDNERLRRSNT 208

BLAST of Cp4.1LG14g06030 vs. TAIR10
Match: AT4G36990.1 (AT4G36990.1 heat shock factor 4)

HSP 1 Score: 159.8 bits (403), Expect = 2.7e-39
Identity = 83/189 (43.92%), Postives = 122/189 (64.55%), Query Frame = 1

Query: 49  SQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLPKYFKHNNFTSF 108
           +QRS+P PFL+KTYQLVDD + D V+SWN +G+ F+VW T  FA DLLP+YFKHNNF+SF
Sbjct: 7   AQRSVPAPFLSKTYQLVDDHSTDDVVSWNEEGTAFVVWKTAEFAKDLLPQYFKHNNFSSF 66

Query: 109 LRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFHSPVPSTTLPAHLLALTGNS 168
           +RQLNTYGF+K V D+WEFAN+ FR+G + LL +I+RRK  S + ST     ++     S
Sbjct: 67  IRQLNTYGFRKTVPDKWEFANDYFRRGGEDLLTDIRRRK--SVIASTAGKCVVVGSPSES 126

Query: 169 SS---EEQVISSDETPTGA---------FAELIDENDRLRKEKAKLSEQLVEMKSLCNNI 226
           +S   ++   SS  +P  +          A+L  EN++L++E   LS +L   K   + +
Sbjct: 127 NSGGGDDHGSSSTSSPGSSKNPGSVENMVADLSGENEKLKRENNNLSSELAAAKKQRDEL 186

BLAST of Cp4.1LG14g06030 vs. TAIR10
Match: AT2G41690.1 (AT2G41690.1 heat shock transcription factor B3)

HSP 1 Score: 141.0 bits (354), Expect = 1.3e-33
Identity = 78/182 (42.86%), Postives = 106/182 (58.24%), Query Frame = 1

Query: 54  PTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLPKYFKHNNFTSFLRQLN 113
           P PFL KTY++V+D   D VISWN  G+ F+VW    FA DLLP  FKH NF+SF+RQLN
Sbjct: 38  PPPFLVKTYKVVEDPTTDGVISWNEYGTGFVVWQPAEFARDLLPTLFKHCNFSSFVRQLN 97

Query: 114 TYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKF---------HSPVPSTTL---PAHL 173
           TYGF+KV + RWEF+NE FRKG+++L+  I+RRK          H  VP+TT+     H 
Sbjct: 98  TYGFRKVTTIRWEFSNEMFRKGQRELMSNIRRRKSQHWSHNKSNHQVVPTTTMVNQEGH- 157

Query: 174 LALTGNSSSEEQVISSDETPTGAFAELIDENDRLRKEKAKLSEQLVEMKSLCNNIFSMMS 224
               G     E   SS  + +  +  L+DEN  L+ E   LS +L + K  C  +  ++ 
Sbjct: 158 -QRIGIDHHHEDQQSSATSSSFVYTALLDENKCLKNENELLSCELGKTKKKCKQLMELVE 217

BLAST of Cp4.1LG14g06030 vs. NCBI nr
Match: gi|659077411|ref|XP_008439190.1| (PREDICTED: heat stress transcription factor B-2b-like [Cucumis melo])

HSP 1 Score: 417.9 bits (1073), Expect = 1.6e-113
Identity = 223/301 (74.09%), Postives = 242/301 (80.40%), Query Frame = 1

Query: 29  ILVASPMGEQSAKPSPNGGDSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNT 88
           +LVA  M E + K +  G DS RS+PTPFLTKTYQLVDDE+IDHVISWN DGSTFIVWNT
Sbjct: 1   MLVAPLMEEPTTKSTTTGNDSYRSVPTPFLTKTYQLVDDESIDHVISWNDDGSTFIVWNT 60

Query: 89  ITFATDLLPKYFKHNNFTSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKF 148
           + FA D LPKYFKHNNFTSFLRQLNTYGF+KVVSDRWEFANECFRKGKKQLLCEIQRRK 
Sbjct: 61  MAFAKDFLPKYFKHNNFTSFLRQLNTYGFRKVVSDRWEFANECFRKGKKQLLCEIQRRKL 120

Query: 149 HSPVPS--------TTLPA------HLLALTGNSSSEEQVISSDETPTGAFAELIDENDR 208
             PVPS        TT+ A       LL LTGNSS EEQVISSDETPT A AELIDENDR
Sbjct: 121 AGPVPSSASNAAVVTTVGASAIPSVQLLPLTGNSSGEEQVISSDETPTRALAELIDENDR 180

Query: 209 LRKEKAKLSEQLVEMKSLCNNIFSMMSSFVECQFKSSFKVRDSVLTPAKSLDLFPVKRPS 268
           LR+EK +L+EQLVE+KSLCNNIFS+MSSFVE QFKSSFKVR+SVL  AKSLDLFPVKRP+
Sbjct: 181 LRREKVQLTEQLVEVKSLCNNIFSLMSSFVESQFKSSFKVRESVLASAKSLDLFPVKRPA 240

Query: 269 GEDEAG-----KTNQIGAAIGAKRPREYREWATEIAEDDTTLRLQPPDRSEVKSERVNCH 311
           GE+E       +TNQ    IG KR REYRE ATE AE+DTTLRLQPPDR  VKSER+NC 
Sbjct: 241 GEEERAEVKEEETNQ----IGVKRAREYREGATETAENDTTLRLQPPDRWVVKSERINCQ 297

BLAST of Cp4.1LG14g06030 vs. NCBI nr
Match: gi|449446047|ref|XP_004140783.1| (PREDICTED: heat stress transcription factor B-2b-like [Cucumis sativus])

HSP 1 Score: 411.0 bits (1055), Expect = 1.9e-111
Identity = 216/293 (73.72%), Postives = 235/293 (80.20%), Query Frame = 1

Query: 35  MGEQSAKPSPNGGDSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATD 94
           M E +A P+P   DS RS+PTPFLTKTYQLVDD +IDHVISWN DGSTFIVWNT+ FA D
Sbjct: 1   MEEPTANPTPTPIDSYRSVPTPFLTKTYQLVDDRSIDHVISWNDDGSTFIVWNTMAFAKD 60

Query: 95  LLPKYFKHNNFTSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFHSPVPS 154
           LLPKYFKHNNFTSFLRQLNTYGF+KVVSDRWEFANECFRKGKKQLLCEIQRRK   PVPS
Sbjct: 61  LLPKYFKHNNFTSFLRQLNTYGFRKVVSDRWEFANECFRKGKKQLLCEIQRRKLVGPVPS 120

Query: 155 TTLPA--------------HLLALTGNSSSEEQVISSDETPTGAFAELIDENDRLRKEKA 214
           T   A               +L LTGNSS EEQVISSDETPT A AELIDENDRLR+EK 
Sbjct: 121 TASNAAVVTTVGASAIPSVQVLTLTGNSSGEEQVISSDETPTRALAELIDENDRLRREKV 180

Query: 215 KLSEQLVEMKSLCNNIFSMMSSFVECQFKSSFKVRDSVLTPAKSLDLFPVKRPSGED--- 274
           +L+EQL E+KSLCNNIFS+MSSFVE QFK+SFKVR+SVL  AKSLDLFPVKRP+GE+   
Sbjct: 181 QLTEQLDEVKSLCNNIFSLMSSFVESQFKNSFKVRESVLESAKSLDLFPVKRPAGEEGTA 240

Query: 275 EAGKTNQIGAAIGAKRPREYREWATEIAEDDTTLRLQPPDRSEVKSERVNCHK 311
           E  +  +    IGAKR REYRE ATE AEDDTTLRLQPPDR  VKSER+NC K
Sbjct: 241 EVKEEEEERNQIGAKRAREYREGATERAEDDTTLRLQPPDRWVVKSERINCQK 293

BLAST of Cp4.1LG14g06030 vs. NCBI nr
Match: gi|950925928|ref|XP_014499538.1| (PREDICTED: heat stress transcription factor B-2a-like [Vigna radiata var. radiata])

HSP 1 Score: 292.7 bits (748), Expect = 7.7e-76
Identity = 163/306 (53.27%), Postives = 211/306 (68.95%), Query Frame = 1

Query: 48  DSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTITFATDLLPKYFKHNNFTS 107
           DSQRS+PTPFLTKT+QLVDD++ID VISWN DGSTFIVWN   FA DLLPKYFKHNNF+S
Sbjct: 22  DSQRSIPTPFLTKTFQLVDDQSIDDVISWNDDGSTFIVWNPTVFARDLLPKYFKHNNFSS 81

Query: 108 FLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFHSPV----PSTTLPAHLLA 167
           F+RQLNTYGF+KVV DRWEF+N+CFR+G+KQLL EIQRRK   PV    PSTT PA +  
Sbjct: 82  FVRQLNTYGFRKVVPDRWEFSNDCFRRGEKQLLSEIQRRKISLPVASTSPSTTAPATVSV 141

Query: 168 -----------LTGNSSSEEQVISSDETPTGAFAELIDENDRLRKEKAKLSEQLVEMKSL 227
                      ++ ++S EEQVISS+ +P+ A AELIDEN+RLRKE  +L+++L EM+SL
Sbjct: 142 PSSMPLTAIPIISPSNSGEEQVISSNSSPSLAPAELIDENERLRKENVQLTKELAEMRSL 201

Query: 228 CNNIFSMMSSFVECQFKSSFKV----------RDSVLTPAKSLDLFPVKRPSGEDEAGKT 287
           CNNI+++MS++      +S++           R+S +T  + LDL P KR SGED A   
Sbjct: 202 CNNIYALMSNYANANGNASYQTDGGAGGAQGSRESGMTAVRPLDLMPTKRISGEDAAELN 261

Query: 288 NQI-GAAIGAKRPREYREWATEIA---EDDTTLRLQPPDRSEVKSERVNCHKKVDDQKT- 324
            ++ G AIGAKR RE      E     + DT LRL  P   +VKSE ++C  ++++Q+  
Sbjct: 262 PKLFGVAIGAKRAREGGGSGGEEGGGPKKDTLLRLHHPGPGDVKSEPLDCQNQLENQEAP 321

BLAST of Cp4.1LG14g06030 vs. NCBI nr
Match: gi|703141083|ref|XP_010107420.1| (Heat stress transcription factor B-2a [Morus notabilis])

HSP 1 Score: 288.5 bits (737), Expect = 1.4e-74
Identity = 162/314 (51.59%), Postives = 205/314 (65.29%), Query Frame = 1

Query: 30  LVASPMGEQSAKPSPNGGDSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTFIVWNTI 89
           +   P G+ ++      G+SQRS+PTPFLTKTYQLVDD+AID VISWN DGSTFIVWN  
Sbjct: 1   MATEPAGDSAS------GESQRSLPTPFLTKTYQLVDDQAIDDVISWNDDGSTFIVWNPT 60

Query: 90  TFATDLLPKYFKHNNFTSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEIQRRKFH 149
            FA DLLPKYFKHNNF+SF+RQLNTYGF+KVV DRWEF+NE FR+ +K+LLCEIQRRK  
Sbjct: 61  VFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFSNEYFRRSEKRLLCEIQRRKIA 120

Query: 150 SP--------------VPSTTLPAHLLALTGNSSSEEQVISSDETPTGAFAELIDENDRL 209
           +P              V    +P  +  ++ ++S EEQVISS+ +PT   AEL+DEN+RL
Sbjct: 121 TPGATPSAAATTTTVAVTVAAIPTAMPIISPSNSGEEQVISSNSSPTRGPAELVDENERL 180

Query: 210 RKEKAKLSEQLVEMKSLCNNIFSMMSSFVECQFKSSFKVRDSVLTPAKSLDLFPVKRPSG 269
           RKE  +LS++L EMKS+C+NIFSM+S++   Q +S F+ R+S     + LDL P KR SG
Sbjct: 181 RKENLQLSKELAEMKSICSNIFSMVSNYACLQSESVFQARESGFREVEPLDLMPAKRFSG 240

Query: 270 EDEAGKTNQ------IGAAIGAKRPREYREWATEIAEDDTTLRLQPPDRSEVKSERVNCH 324
           E E     +       G  IGAKR RE      E AED T L L+ P   +VKSE ++  
Sbjct: 241 EGEDATAEEKTSPKLFGVTIGAKRAREGGN-DIEPAEDQTDLLLRQPAGVDVKSEPLDV- 300

BLAST of Cp4.1LG14g06030 vs. NCBI nr
Match: gi|819321000|gb|AKG50130.1| (heat stress transcription factor B-2a-like protein, partial [Betula luminifera])

HSP 1 Score: 287.0 bits (733), Expect = 4.2e-74
Identity = 163/310 (52.58%), Postives = 196/310 (63.23%), Query Frame = 1

Query: 26  IWVIL--VASPMGEQSAKPSPNGGDSQRSMPTPFLTKTYQLVDDEAIDHVISWNHDGSTF 85
           +W +L  V  P  EQ+ + +   GDSQRS+PTPFLTKTYQLVDD  ID VISWN DGSTF
Sbjct: 31  VWFVLGFVMVPPAEQNCEST--SGDSQRSLPTPFLTKTYQLVDDHTIDEVISWNDDGSTF 90

Query: 86  IVWNTITFATDLLPKYFKHNNFTSFLRQLNTYGFKKVVSDRWEFANECFRKGKKQLLCEI 145
           IVWN   FA DLLPKYFKHNNF+SF+RQLNTYGF+KVV DRWEF+NE FR+G+KQLL EI
Sbjct: 91  IVWNPTVFAKDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFSNEYFRRGEKQLLREI 150

Query: 146 QRRKFHSP---------VPSTTLPAHLLALTGNSSSEEQVISSDETPTGAFAELIDENDR 205
           QRRK  SP         V    +P     ++ ++S EEQVIS++ +PT   A+L DEN+R
Sbjct: 151 QRRKITSPMATAPAPVMVAMAAIPTAKPMISPSNSGEEQVISTNSSPTTEPADLADENER 210

Query: 206 LRKEKAKLSEQLVEMKSLCNNIFSMMSSFVECQFKSSFKVRDSVLTPAKSLDLFPVKRPS 265
           LRKE  +LS++L EMKSLCN IF++MS+F                   K LDL P KR +
Sbjct: 211 LRKENVQLSKELAEMKSLCNKIFTLMSNFASSNQSE---------IAGKPLDLLPSKRLA 270

Query: 266 GEDEAGKTNQIGAAIGAKRPREYREWATEIAEDDTTLRLQPPDRSEVKSERVNCHKKVDD 324
            E+        G AIG KR RE    AT   ED+T LRL  P     KSE + C   VDD
Sbjct: 271 REETETSARLFGVAIGGKRARESEGEAT---EDETELRLHQPGAGIAKSEPLECQSNVDD 326

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HFB2B_ORYSJ2.6e-5550.86Heat stress transcription factor B-2b OS=Oryza sativa subsp. japonica GN=HSFB2B ... [more]
HFB2B_ARATH1.8e-5339.51Heat stress transcription factor B-2b OS=Arabidopsis thaliana GN=HSFB2B PE=2 SV=... [more]
HFB2A_ARATH5.9e-5248.02Heat stress transcription factor B-2a OS=Arabidopsis thaliana GN=HSFB2A PE=2 SV=... [more]
HFB2C_ORYSJ1.6e-4944.49Heat stress transcription factor B-2c OS=Oryza sativa subsp. japonica GN=HSFB2C ... [more]
HSF24_SOLPE2.7e-4449.74Heat shock factor protein HSF24 OS=Solanum peruvianum GN=HSF24 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L6G8_CUCSA1.3e-11173.72Uncharacterized protein OS=Cucumis sativus GN=Csa_3G181940 PE=3 SV=1[more]
W9RWR0_9ROSA1.0e-7451.59Heat stress transcription factor B-2a OS=Morus notabilis GN=L484_015761 PE=3 SV=... [more]
A0A0F7G5V9_9ROSI2.9e-7452.58Heat stress transcription factor B-2a-like protein (Fragment) OS=Betula luminife... [more]
I1MPZ9_SOYBN3.2e-7350.00Uncharacterized protein OS=Glycine max GN=GLYMA_16G196200 PE=3 SV=1[more]
M5X1X7_PRUPE3.6e-7252.82Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009180mg PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G11660.11.0e-5439.51 winged-helix DNA-binding transcription factor family protein[more]
AT5G62020.13.3e-5348.02 heat shock transcription factor B2A[more]
AT1G46264.12.9e-4144.39 heat shock transcription factor B4[more]
AT4G36990.12.7e-3943.92 heat shock factor 4[more]
AT2G41690.11.3e-3342.86 heat shock transcription factor B3[more]
Match NameE-valueIdentityDescription
gi|659077411|ref|XP_008439190.1|1.6e-11374.09PREDICTED: heat stress transcription factor B-2b-like [Cucumis melo][more]
gi|449446047|ref|XP_004140783.1|1.9e-11173.72PREDICTED: heat stress transcription factor B-2b-like [Cucumis sativus][more]
gi|950925928|ref|XP_014499538.1|7.7e-7653.27PREDICTED: heat stress transcription factor B-2a-like [Vigna radiata var. radiat... [more]
gi|703141083|ref|XP_010107420.1|1.4e-7451.59Heat stress transcription factor B-2a [Morus notabilis][more]
gi|819321000|gb|AKG50130.1|4.2e-7452.58heat stress transcription factor B-2a-like protein, partial [Betula luminifera][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0043565sequence-specific DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
Vocabulary: INTERPRO
TermDefinition
IPR027725HSF_fam
IPR011991Winged helix-turn-helix DNA-binding domain
IPR000232HSF_DNA-bd
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0009408 response to heat
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g06030.1Cp4.1LG14g06030.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000232Heat shock factor (HSF)-type, DNA-bindingPRINTSPR00056HSFDOMAINcoord: 95..107
score: 9.2E-19coord: 57..80
score: 9.2E-19coord: 108..120
score: 9.2
IPR000232Heat shock factor (HSF)-type, DNA-bindingPFAMPF00447HSF_DNA-bindcoord: 57..146
score: 4.5
IPR000232Heat shock factor (HSF)-type, DNA-bindingSMARTSM00415hsfneu3coord: 53..146
score: 1.1
IPR000232Heat shock factor (HSF)-type, DNA-bindingPROSITEPS00434HSF_DOMAINcoord: 96..120
scor
IPR011991Winged helix-turn-helix DNA-binding domainGENE3DG3DSA:1.10.10.10coord: 49..141
score: 8.6
IPR011991Winged helix-turn-helix DNA-binding domainunknownSSF46785"Winged helix" DNA-binding domaincoord: 54..146
score: 2.04
IPR027725Heat shock transcription factor familyPANTHERPTHR10015HEAT SHOCK TRANSCRIPTION FACTORcoord: 46..276
score: 1.1E
NoneNo IPR availableunknownCoilCoilcoord: 185..212
scor
NoneNo IPR availablePANTHERPTHR10015:SF168HEAT STRESS TRANSCRIPTION FACTOR B-2Acoord: 46..276
score: 1.1E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG14g06030Cp4.1LG09g00900Cucurbita pepo (Zucchini)cpecpeB023
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG14g06030Cucurbita pepo (Zucchini)cpecpeB232
Cp4.1LG14g06030Cucurbita pepo (Zucchini)cpecpeB242
Cp4.1LG14g06030Cucurbita pepo (Zucchini)cpecpeB245
Cp4.1LG14g06030Cucumber (Gy14) v1cgycpeB0673
Cp4.1LG14g06030Cucurbita maxima (Rimu)cmacpeB608
Cp4.1LG14g06030Cucurbita moschata (Rifu)cmocpeB557
Cp4.1LG14g06030Melon (DHL92) v3.6.1cpemedB225
Cp4.1LG14g06030Silver-seed gourdcarcpeB0998
Cp4.1LG14g06030Cucumber (Chinese Long) v3cpecucB0252