Cla97C04G070320 (gene) Watermelon (97103) v2

NameCla97C04G070320
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionBSD transcription factor
LocationCla97Chr04 : 11075035 .. 11089225 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAACCAAGTATGTCCATAAGAGTGCCAAGTACAAGACCTCAGTTAAGGATCCTGGCATGCCTGGCGTTTTGGAAATGGTATTGGTCTAAAGTTTATCATTTTTGTGTCTGATTTTCTACCTATTTATCACCTTTGTTTTTGTTCTCTTTCAATTAATTTTTTTATGAAAACAACTTTTATTGAGAAAGAATGAAATACATATACATACAAAAAACCAGATCCTAAAAAAGGAAGCACCCTCTAAAGAAAGGGACTCCAGCTATGCACAATAGTGCCTAAAGAGTAGTTACAAAAGGTCTTCGAAATCGACGTCCACAAAGATACTTCTCTCAATTAATTACATACTTCTGATTTATTGTATTTGATGACCTAGTCTACTATTTTCTTAAGAGTATATGTATTTTATTGAAATGAGTTTGTAAGACAATATTCGAATGGTGCAAGTGGTACATGAGTGATTTGTTTTAGAGTTTTTCAGGTTAATTTTTCTAACTTTTGTTTCTGGAACATCTTCTTATGTTGTTTACTAGGAAGCATGGATGCATGTTGTACACAAATATGTTTGTAAATGCTTCAGATTCGTGTTCGACATGGACAATTTTGTGCCCCTTTTCCTTTTTTTTTTTTTTTTTTTTTCTTCTTCTTTAAAACAAAAATTTGAACTCAAAGAACATGCCATAGACATGCCATGTGCACATAAGATATTAATTGTAGTTGGGAGAGTTTCATAGTTCCTTTTTTATGGATCTCTCCTACATGTCTGTTTAATTTATATTAATGTTGTTACTTTTTATGGTTATTTTTGGTGTTTTCTTGTACATTGTATCTTGGAGCATTAGTCTTTTTTCATTTTCTTAAATAATAATAATAATAATAATAATAATAATAGTAGTAATAGTAATAATAGTAATAGTAGTAGTAATAATAATAATAGTAATAGTAGTAGTAATAATAATAATAATAATAATAATAGTAATAGTAATAGGAATAGTAATAGTAATTTCAAGTTTATTAATCCCTCTCCCCTCTCCATATTTATGATTTACACAAAAGTCTTCTTGTTCATGTATAGTTAATTATTTGAAATATTTTGATGTGCATCTGTGATTGGTTCGACCATTTAAGATCTAATTTAATTGACCACATAATTTGCTCAACTAATTTGGTTTTACTGTTTTCATTCTGTGCAGACAGAGCACAAGTTTGTATTTAGACCCAGTGATCCCACTTCAGCTTCTAAGCTTGATGTAGAGTTTAGATTTATTAAAGGTAACTACTTGATATATTTCTGGGGTTTCTACTTAATATATTTCTGCTACAATATTTATGGTATGTTCTATATTCCCGTACCTGTAGTATTTGTACTTTTGGAAAATGGCCTTTTCTTTTCATGTTGAGCACCTTCTGTAAGTAGAATTGAACTGTTTGGATGTTTGATAATTAGAACTTTTGAGTAAAGTGGTGCATTGTTGGCACAATCACAAATGAGGAATTCTTTACTCTTTGCACTCCACTATTCACTTATTATAACTCTGAGTCTCTGACTAACATTTAAGCCCATCTGTAACCTCTATTTCCTGTAACCTTATGTTTGAATAAGATCTTGTTATATCTAAACTTCCCGTATTTTTCAGGCCACAAAAACACTAAGGAAGGATCAAATAAACCACCGTGGCTTAATCTCACCAGGGACCAGGTTTCTAATATTTTGCTAGACATTATCTATGGATTTGGGTAAAGAAAAAGGGTCCAATTAAAAGGCACCATAAGTTAGATCATTGGAAATATTCCATGATTAAATCTTTGTACATATGATTTTAAAATAGAATATAATATATAAAAATTTATATGTTCTAAGAAGTCCATTATGTAATTTTCACCCTCAAGCTTATCTTCAAATTCTCTGTCAGTAGTTCTTTATTGAAAATACTTTGTAATTGATGAGGTCTCTTAAGTGATTTCATCAAGGGCCTTTGGATATCAATCTCAAACATCATTTCCACAAAGCATATTGTGAGTCTTGAATTTATCTTATGAGATGACAATTAAAGACATGTTACATTTTTCAGGGTGGAAGTTACATTTTTGAGTTCAAAAATTTCTCTGATCTTCATGTTTGCCGCGAGTTTGTAGGTAAGCCAGGATTCAAACTTATAAGCAATCTTTCTTTTTTAAGCTGTAATGCAATTTACGTCATTTATGCTTTTAGTTATATAAGAACTGTAGGTTCATTGATCTGAACGATGGTGAATAGTTGTGAATTTTGGTGATGGATTGATAAGGCCCCCTTGTTTGGTCTAATTATTCATGAAAGGCAATTAGGACAGAGGGATGAAGGGTGAATATCATAATCCCTAAACTTTATAAGTGCACCTCATAACCATCATGAGTTGGGCCTAGTGGTAAAAATGAGACTTTTTTTTTTCCCTTCTTTTCTCTTCTCTTTTTGATATATGTGAGTGTCCAGGCTAGCTTACGCGCACCTCAACTAATCTCACGGAACAATCTGCCTGACCCTACAACATTTGGGTGTCAAGGAAACCCGTAGGATATTAAATCCTAGGTAGGTGGCCACTATGGATTGAACCCTTACCCCTTGGCCCTTTGGCCTCTTTAAGATATTCCCACTACCACTAGGCCAACTTATGATGGGTTCAATCTATGGTGGTAAAAAAGAGACAGTCTCAATAAATAACTAAGAGGTCATGGGTTCAATCTATGGTTGTCACCTACCTAGGATTTAATATCCCACGAGTTTCCTTGATACCCAAATATTGTAGGGTCAGACGGCGTGTCCTATGAGATTAGTCGAGGTGCGCGTAAGCTGGTCCGGATACTCATGGATATAAAAAAAGTAAAAATAAGGGCACCTTAGACACACTACTTGGTAATAAGTTAGAGGGGTAAGACAAGGAAGTTGGGAGGTTGAATAATATGTTAAGATCAGGCAGGCAATTGCTTTGTTAAAGCTTTCTTTTGGCTTATCTGGCCTGAATAAAATCGGTGCATCTTCGACAACAAGGCGACATATTTCTCCACCTTTTTTGAACATCTAACATACATTACTTTCTTTTGGTGTAAATTTAACACATTCTCTTGTGATTATAGTCTTACATCTCTTATTTCACACTGGAACTGTATCATGTAATCATCATGGCAATTCTTGCCCTTTTGTAATTTCATTTAATCAATGAAATCTTCTTTGATGTTTCTCATAGAAAAAAGGTAGGGAATCATTTTGTAGTTTTGTCCAATTAACTTGGATTTGAAAGTGTAGGAGGATTTTTGAATATCTTCCTAGGAATTTGTACTATCTAATTGCGATATGAGGAACACTGATAACCTATTCTGTTCTCCATTGCAATCATTCTAATAATTTTCGTAATATTTCTCCATTCTTTGTCTGAGGTTGTACTTATACCCTTTTCCTGTATTTGTTGGATTTTCTAGCCTGATGGTTTGGCTTATTGTGCCTTCTCCAGGAAGTGCTTTAGCAAAGTCAGGAGAGGCTGCACAAGCTGCTCCCTCTGAGAGGCCTGTGGCGGCATTTCCTCATGAACAACTCAGTAAATCAGAAATGGAACTTCGGATGAGATGTTTGCAAGAGGATAGGTAGACTGATGACCCATATTATTGTAAACAAGAAGTCAACCTCTATTGTTTCGTGCTTAACAAGTCACGTAACATCTCTCAGCTCATCATTTATTTTGCAGTGAACTGCAAAAACTCCATAAACAATTTGTGATTGGTGGTGTGTTGACAGAATCTGAATTTTGGGCAGCAAGGAAGGTGAGGAGAGGACCTTTTTTCCGTTTATCTATTCTGCATTTGTAGCTGACCCTTTTAGAACTCTGATTGTATATTTAATTTTATCCAGTTCACAAAGTATATTCTGAAGAAATATGACATTTTCTGCAATTAATTAATACAGAAATTACTGGAACGAGACAGCTCCAAAAAATCAAAACAACTGATTGGTTTTAAGAGTTCAATGGTTTTGGATACCAAACCAATGTCTGATGGTCGGGTACAGCATCATTACCTTTTTTCCTACCAAATCAAACTTAGTTTATTAATCATTAATATCTTATTTGTCAAATCCTTTCCCCTGCTTGGTTTAATTCCACCGGATTTAACATTGTTATCTTTTTTTCTACTTTCAGACAAACAAGGTTACATTTAATTTGACACCGGAGATCAAATATCAGGCATGATTTGATTCTTTGTTCCTTCCTCACTCCCTCCCCAATTTTTTATATAATCAAGTATAGGTTGTTTCTTAACTTATTTATTTTATTCAAACAACATGTGTGAATGGGGAGAATTGAACTTTTGACCTCTTGATCGAGGGTATATGTCTTAACCAGTTGAGATGTGTTAAGGTTGATCTTTTCGTTTCTTTTGTTATTCCATTGAATTTGTACATTATGTACCTTTAGGCTTTATTTATTTATATATATATATATATTGTTACTTTTTATTTTTTTGAAAGGAAGCAAAAACTTCTCATAGTCAACCTCTAAGGTAAGGAAAGTTAAAGGTCTAGAGAAGCCTGAGGGAAGGAAGTAAAAAATTTTGGGAACTTGAGAAGCTTGATTGTTATGTTAACTATGGAGGGAAGAGCTGTCCAGTCTTATCTTATGTTGAAGATTTTATCTTGGAATGTGCAGGGGTTAAGAGGGAAGGTTAAGCGAAGTCTTGTTGAAGAAATTATTACCCAAGAGATTCCAAATTTGTTATTCTCGTTGAAACTAAGCGTCATTTGTATGATTGTCAGCTGATTAAAAGTCTTTGGGTAGTAGAAGGAAGGTCGCGTGGTTAGCCTTGATTCAGCGAGCTCCAGGGATCTTGTTAATGTGGATGGTAGTTTTTTCAGCCCTCATGAAGTGATCAAAGGTTTATTCTTGGTTATTCTTGGTTTCAATCTCCTTTACAAATTAAAAAACGAGGTTGTTTTGGATATCAGGGGTGTATGGCCCCTCTAGGATTGGATGTAGAAAGAATTTTTTGGATGCATTGGACAATCTGTACGATCTTTGTTGCCCTATATGATGTGTGGTGGGGGACTTCAACTTTTCCAAATCTTCGACTGAGAAGGATTTTGTAGGTGGAGCTACTAGGTCCATGGAACTCTTCAACGCCTTCATTGAAGAAGGTAATCTTGTTGACCGAAAATTAAATGGCAAGTTAACCTTGGCCTGTAGTAGGGTTCACAGTAGAATTGATAGATTTTTGCTGTCTAAGGAGTGGCTGAATTGTGTTGGTGAAGTAGACAAGTGTTAGGTTCGAGAGGGACTTTGGGTGGAATTTAACCTACCTAGGTGACGATGTGACATCTTCTTCCTTCTTTCATGTGTCCTAATGGGTCACCCTTTCAAAGATAAGAGAAGAACTCTTTAGTTGAACTTCAGGAGGACTTGCTCTTGGAGCCTTTGGTTTGAGTGTAATGTGGTTTCAACAAAAGGATGCTTGACACTCCGACCTTTATTGATTCCATCACTTTCTTAACATTTTCATGGTGCAGATTGTCTCCTTTTTGTAGTTACAACTTAACTACGCTCACTACACAATGGACTCTTCTAATTATTACCTCATGATAGGACCATTTGTACTCATTTCTCTACATCAGTGAAATATTTGCTTTCGATAAAATAAATAAATAATTAAACAATTCTTGTACATCCTTGGCACAACATGTACCTATCCTAAGGCTTTTTAGCGTAGCAATTCATCTCCATGGTTAATGGTTTATGGAGCTTTCAATCATATGACAAGTCCCTCACATCTATTTGACTTGTACACTTCTTCGCTAAGAAGCACAGTCACTTTAGTTCGGCAAGTATGTTTGTGTCTGACACGTGTCGGACACTTGAACAGTTGTTAAACATGTATCGGACACTTATTAGTACATCAAAATGTGTTAGATATGCATAGAACACTTGTTGAGTAGACTATAAAGGATAGATATATGACAATAATAATTACTTTTGAGCATGAAGTACATCAAACTAAGTTTTTTAAGCATATAAATGCATCAACCTATTTACTTTGAATTTTCTTTTGGTATAAAAATGATATAGATTTTAAAAAATATATATTTTAATAAATGTATCCTTCTATGTTGTATCCTAGATTTTAAAAATATGGCGTGTCACTGTATCTGTGTTGTGTCGTATTCGTGTCTCATATTTGTGTGCTTCTTAGCTTGTTAGTACCATCTCAGTTTCTTTACTTTCCATCCTTTTATTTTGTCTATCTCACTCTCCTCCTCTCTCTCGTCTCACTCCTCCTGTCTTTACTTTGTCTTCATCTCATGCTTCACTATATTAAATTCTCTCTCTTTTTGGTATGTTTTATGCCTCACCTCTCTAAGGCTCAGTTAATTTTATCTTTTCTGTGGAGAAATTATTTTTGGTATACTATGTGCCTCACCACACTCAAGCGCGCGCACTTCATTTTTATTTTTCTATCTTGCATTAGGCTAAGGACTAGTGCTTGAGGAAATCCTTTGTTTTAAAAGATAATGGTTTCAACATTGTTACTTCTTCCAGGATTCCTCCTATTTTTATTCAGTCATGTATTGGGTTTGTTATCTATTATCAAATAATTTGGTCTTTTTTTCTGTTTTCTTTTGTATAATCTGATATTGGTCCCAACATCCACACGTTAGTCAAATTGTGCAACCCTTTTATCTAGTTGAGTTATAATATGCTAACATAGTTAACCGACTTAGTGGTAAAAAAGAGACATAGTCTCAATAAATGGCTAAGAGATCATGGGTTCAATCTATGGTGGCCAACCTACCTAGGATTTAATATCCTACGAGTTTCCTTGACACTCAAATGTTGTAGGGTCAGGCAGGTTGTCTTGTGAGATTAGTCGAGGTGCACGTAAGCTAGCTCAGACATTCACAGAGATAAAAAAAAAAATGCTAACATAGTTTCAACATTCTTTTTAATATATTTTTCTTCTTTATCTTGTTCAGATTTTTGCTCTGAAACCAGCTGTTCACCAGGCCTTCCTTAATCACGTTCCCGATAAGGTACTGCCAATACTGAAAAAAGTAACTAAGTTATCCTCTTCTTTCTTGTTGCTTTTAACCATGTAGTTGATGAAGGTAATTGAATAAAGCAAATATAGAAAATGTTAATATATTTTATCATTTTCCTTATATTTAATAACAGATGTCAGAGAAAGACTTTTGGACAAAATATTTTAGAGCGGAGTACCTTCATAGTACCAAAAATTCTATTGCAGCTGCAGCAGAGGCTGCTGAAGACGAAGAACTTGCCCTTTTTCTGAAGGATGACGAGATATTGGCTGCTGAAACTCGGAAAAAGGTGGATTTTTGAAAAAATGTCAACCTCTTAGAGTTTTCTCTCCTATTAAAATACTTCGGGTGGTGTTTTTCTGATCACTTTTGTCATCCCATATTAAATTCTATTTATTTTAGACCTACCGAAATTTCAGCAACTTCGATTATCATTTGGTTCATGTTATTTCTAGATTCGGCATGTTGATCCTACATTGGATTTGGAAGCGGATCTAGGAGATGATTACACACACCTTCCAGTATGTTGGCCTTTATTTTGGTGTTTTGTTTGGATTCCTTTCTTGGTTCTTCATGGAGGATTTATCTTTATTTAAACTGTTATTATTCCTTTTGTATTCATTCTTATTTGTTCTTCTATTTTTTTGGTGAAGCTTTGTACATTTGAGGCTTTAGTCTCTTTCATTATTTCAATGAAACTTTCTTGTTTATGGTTTTAGAAAGAACAGATAGGGAAGACCTTTTAGATAATAGATCTATTTTTTGTTTTAAAGGGTTATATTTAGATTTGTGGCTTTCTTATCATACTGGTTGATGCAAATTTTCTTAAATCTAGGATCATGGAATCTTTCGTGATGGTGGCAAGGAGATAACTGAATCACAAAATGAGCACTATAAAAGGACTTTGTCACAAGACCTTAATCGTCAAGGTGCAGTTGTTCTTGAAGGCAGAACTATAGGTTAGTTTCCAACTTGCGGTAAATTATATTCTTCAGTTGCAGTTATCAGGTAAAGGTCTTAACTACTTTACAGATACTCCTTGTTGAATATATCAAGGTCAAATATACCCTAACCTATTATGGTCTGACTTATATGATAGGTAGATTACTTAGAGTGGAGATTCCAGTTTGTAGCTACTGTTTTACAAATATACCTTGCCTATCATTTTATTATTGTTCTTCCATTATTATTATTATTACCATTTCCTTGTCTGCATCATTTGTCGCTCCTCAAAAATCATTTTGTTGAAGATTTTCTCGTGTGGTCTAGGAGCTTTTGTTTCTTTTCCTTTGGGTTCCGGCGCTCTTGTTCTGATAGGGAGGCGATAGATGTGGTTGTTCTTCTTTCTTTACTCGAGGGTCACCCTTTAGAAGAGGAAGGAGGGATGTTAGAGTTTGGCGTCCCAATCCTTTGGAAGAGTTCTTGTGCAAGTCTTTCTTTTAGTGCTTGTTTGATCCTTCTCCCTTAGGAGTGTTGGTCTTTACGGTTCGTTGGAGGATTAAGATTCCTAGGAAAGTGAGGTTCTTTACTTGGCAAGTCCTTCATAGCCATGCTAACACAATGAATCAGTTTGCAAGGAGGTTTCCTTCGTTTTGGGTCTTTTTGTTGTATTCTTTGTTAGAAGGTGGAGGAAGGATTGGACCATATTTTCTGGCGCTGTGATTTTGCGAGTAGTGGTTGGGATTTATTCCTCCAAATGTTTGATATGATGTATGTTAGTCAAGAGTTATGTTCCTCCAATCGGCTTCATTCCAAAATGGAAGTTATCAAGAGTTGTGGTGGCTGGATAAAAATAAAAAAAAATCTTCCATTACCTTTTTGGAAAAAATCAGTTTTTGAAGCCATTGGCAAGCAATTAGGAGGTTTATTAGCCATTTCTTCTCAAACTCTCAATGGTCTTGATTATTCTTCAGCATTAATAAAGATTGAGGAAAATTTATGTGGGTTCATTCCAGCTATGAATATTAAAGATATTAATCTTGGTGTTTTCTCTATATATTTTTAAGATTCTGATCTTGAGGAAATCCAAAGGTCAGAATTTTCCAGTGTTGATAATTTCTCAAATTCATTAGATATTTCCCGAGCCAATTTAATTCCCAAAGATGTTGATACATTCCTGTTTTTTTGGGCAAGTCAAACGGCTCCCCCTTTAATGGAAAGGGCATTTATTGTGATGATAAAATCTTGGAGACCAATCAACAGACTTTTCCCTCCCCTGAATTCAATATTTTGTATGTCCAATGGTCAGATTCTCTCCCTTCTTTAATACCGGGTGCAGCAGCCATAATTGCTTCTCCTTCATCTGCTCCCAAGGAGAATTTAATTCCTTCCTTTTCAAAAAAAGCAAAGAGGAATTTGAAAAGTCTCGAAGACTTTTTTCAAAAGCGTTGCCTTTTCCCGAGATGGAAGGAAACTCAAGATTTTAATTGGAAAATAATCCTCCCCCGCTAGACGAGAACAATCAATTAATGCAACCTTCATTTTCCCCCGGTTTTAATGAAGTAGATTTATCTTCAGAAAATTACTATAAATAGAAAAAATATCAAACTATTTACAAATATAGAAAAATTTCACTGTCTATCAGTGATAGATCGCGATAGACTTCTATTACTTAAGTGATAGAAGTCTAGAAAGCGATAGAAGTCTATCATAATCTATCACAATCGATCATTGATAGATAGTGAGATTTTTCTATATTTGTAAATAGTTTGACTCATTCGCTATATTTGAAAAAAGCCCTTTATCTTTAGCATTTAAATTGGTGGCAAATAGCCGTAAAATTTCGGTGATGGAGAAGGATAGGTCAAATTCTCAAGCTCTCAAAACTCCTCATCAAAAATTAAAAAGACAACCATGTTGTCACTAGAACGATTATTCCAATTCCAAATGGAAAATTCAATTTAACGAGAGAAGTTCCATCCACATTCCAAGTCCAAAAAGTCAAGTTTTAGAGGAGGATTTTGATGACGAATCTTCAGCAAGTGTTAGTAGTGAAGAACCATAAATTTTGGCTCCAAATTCGGCCTCAGAACAGGAAGATATTATTTTCGGGAAAGAGGTTGTAGATCTATTTCGAACTCCTAGCTTCAAACAGTCACCATAGATGCTTTCTTGTAAGTTCTCCTTAATTAAGTCTCATGAAATCCCTTCAAAGTTTTCCTCCTTAATTGAAGCTTACTGTTTGAAGTTTGGAAGGGTGATTCATCACAATTCACTTCAATTTTTCTGAAATGGGATTCTTAGTTTTTTCTATATCCAGAAAAGTTATGTGGTTGTTCTTCTGAAGCTACGGGTTTTGATTAAATTGGTTTGAAGTGTTTGGTTTTGTGGTTGGAGTTTGATTCATGTCCCAGCTAATATGGAGCTTTTCTTCTTCAAAGCTTATTTGGCCTCATTCTTCTCTTTTAACTGCTGGTCGCTAGAGTATTCCTTGTTCTATCAAGCTCAAAGTTTAGGGTTCTTACCTTATCAGTATTCGGGTTGTTGCAGCAATTGTTCTCTTCTTCTTTGGGTTTGATTTTTCATCAATCATGCTTGATGGGCTCTTTGTTAGTCTCAAGTCCAACTTGTGCAAGATATTGTGGTGGAAGCTTTTCTTCTATTTGTTTCAGGCTTGTTCGAGTTTAAGTCTTCTGAAATTTGGATTTCTCGAGTTTTATGTTTAATTGGACTATTTTACTTTTTCTTTAGAGTTTTAGTCTTTATGATGTAATTTGGTTCTAGCCTTGTTTTGGCATGTTTATAGTTTTATCTTTGTTTCCTCTCATGTACTTTGAGCATTAGACTTATTCATCATATCAATGAATAGTCTTGTTTCTGTTTAAAAAAAATGATGTATGTTAGTCACAAAGATGCTAGTGTTATGATCGAGGAGTTCCTCTTCGATTTGCCATTTGGGAAGGAGGGGTGTTTTTTTTTATAGATATAGGGGGTGTAGGATCGGTATGACAACCCTAAAGGGGAGGGGGTGAATAGGGTTTATTAATTAATTGGAACTTTTTTCCTAAGTGGGCCCAATTAAACAATTTAATATACTTTTTGCTAATAAATTATTTAAACAATAATGTTATACAAATAAATTAAACTCAATAATCTAAAAATTAATAATAAAACTTCTAAACAACCTATTTGATTCTAATAACAAGATTGGTAAATGATATAGTTTAAGAACTACCCATCAAAATAAACCAATACAAGCAATTAATTTCAAGGAAAAAAGTTGCAATTAATTTCAAGCATATGAATTAATTGCAAAATTAAATATGAAAGGGATAGAGAAATTGACATTGGAAATTTTATAGTGTTTGGCACAACCTGATCTACATCCACTTCCCTAGCTCCTCTTAGATATGTCACCAGAAACCTTTGACTCTTTCCATGGTTTAGAGTCAAATTGCTACAATGTCTTTTTTTGGGGGCAAGATCAAATCTGATCCTTTCCATGATCGTGGATCAAACCGTTACAACACTCTTGTTGTGGGTTCAAGGGTAACCCCTTATAATAGGTTTAGAAATGAAGGACAACTTGACAAACTTTCTTAAAGAGTAGATATACAAATTTTTAGCTCACAACAAATCAATCTTGACACTATATAAAAATCTCTCTCAAGAATAAGATGAAAAAAAGAAAACGGAAGCTTGGAGAGAGCAACAATGGAAGCTTTGTGTTTTTGGAGAGGGTTGAAAATTATAAAAATTGTTGTGAAAATGTGGAGAAGATGAAGAGTTTCAAAAGAGAGGAAAGTTGTTGGAAACCACATTAAATTTGGACGATTGGATGTTGTTAAAATTCAGATTTAAAAGTAGTAGTAATAATAATACTAATGACAAAAAGCTTTAAATGGTACTATGCATTAAATACATTTTTAAGAAAAATAAACTAGCCGTTAGACAGACATATAAAATAATATAATAATATATATATTTTAAATCATTTCTTTAAAAACCCAACAAAAACTCAAAAGGCCACCACGTGTCACTCCTCCATTGCTCCAAGTGGCATCTTTCAATTTGTCCACTTTTATTAAAAAAAATTGTGATGTCATCCGTTTGAAATTTTTTCGTTAAGCTTGTTTGGAATATATCTTCTTTGTTTGAGTTCGTATTTGAGTAATTTAAATTGCGTTGGAATTATTATTTTGAGCTCTATGCAATGGACACTTCAAAACACTACAATTGTTAGTGAATAAAAGCAAGTTTGTTATCATCAAACTATTAATTAATTTAAATTAATCCATTTGAGATTAAGGGCCAAAAATCCGGTGGGGGTGCGATTTTGGGGTTCTTGTGGGGTGGTTAGGGGTGTGGAAATGGACCCCAAGGAGACTTGGTCCCTTGCTCCCTTCCATGTCATTTTGTGGGCTTCGATTTTGAATATCTTTTGTAACTATTCTATAGGCACTATTCTCAAAGTTGGAGTCCCTTCTTGTGAGGGAGCTCCCCTATTTTCTGTGGCTCATTTCTATATGCCCGTGTATTGTTTCATTTTTTTCCCTATGAAAGTTCTCATTTAAAAAAAATATATATACCAAAATGCTATTCAGCGAGCCCTTCCAAACTTGAAAATTGCTTGAAAAGCCAAAACTTCGTTAAAGCAAAACAGTTAGTCAAAGTCTCAAGATTATGAGAATTCCCAAAATTAAGTTCAGCCCTAAACAAGCCATCCAGTTAGACTTTAACTGATGATGAACGAGTTGGGATTGATTTCACAAGAATAATGGGTGACTTATCTGCTTGAATCTTTTTTGACATTTTGAAATAAGGTGTAAAAACTATCTGTGAAATAAACCTTTGAATTATGATCAACATATTCTAGGTGTGCCTAAAAATCCACAAGTTCCATATACTTGCACTAACCACTGATTCTCCATCTGAATCATAAATAGAACAGAATGGTTATCCTTAGGAGGTGAATGGCTAGACAATTAGGAGGTTCCTTTTGGGACCTCTTTTGCTTTGTACACTTTCTTTGACTTCGTAGAACTTTTTTCATCCTAAGAAACTGATTGTTTAATTTTCTGCATTGCTGCTTTACAAAATTGAGAGTAGACATAGACTTGGATGATCCAAGGACAGTAGCAGAAGCACTTGTGCGGTCTAGACATGGTTAGTATGGCTTATAAAGATTTCTTTTTCCAATTCTTATTCTTCCACCAATTGCACCCTGCGCTTGTCACTAACTTTACTTCTATCATGCATTGAGAAATTAATTACGGGAACTGATTTTACAGCGGTAGAAGGAAATGAGAAGCAGACAGCACTTGATAGGATCTCTAGGATGACAGCGATTGAGGATCTTCAAGCACCTCATAGCCATCCTTTTGCTCCTCTTTGTATCAAGGTTTGATTTTTATTGGTGAAAGATGCTTAAGATCTACAATCATTGTAGCAGAGGACTATTTGATGCCATATTTCCATAATAGGATCCTCGAGATTATTTTGATGCTCAACAAGCAAATGCCATCAAGACATTGGATGATACACGAGCAGGAATGCAACAAACTAAATGCAGTTTGGGTACTACAGAAGCATATTGCTCACTGAGGGAATCCATATCTGAGATCAAATCTTCCGGATTGAAGCTTCCCATAATTAAACCTGAAGTTGCTCTTATGGTAAAAGCACCTGCAGCATTCTCTTGCCAAAAACACGCATTTCTGCTATGTATTGACAACTCTTCTTCTGCTACCAAACTTCTAATTCTTCTTTTGTAGGTTTATAACGGATTAACTCAAAATATTTCTAGTACTAAATATCAACTAGGGAAAAACCCTCAGGAGAGTATTCTGGAGAGTTTACCAAACCCTACTAAGGAGGAACTTCTACACGTGAGTTGGTGCTGCTCTTGCTGTAATATTTGAATATATTTTGATGGCTGATTTTCAGGAGATGTTAAAATCACTCAATTTACCATCTCAGTAACCTTTATTGGCCATAATAATCATGTTTTGGACTTTGAAGTTGGTTTGCTGCCATACGTTAGTGTTCTCCAGTAACTTGTTTCTTCAGATATTACTGTTTAGTGAGGTATTGTTTTAAGAAAAGCTTATTGTTTCCAGCACTGGATCTCGATTCAAGAATTACTCAGGCATTTTTGGTCGTCTTACCCAATCACTACATCATATCTTTATACTAAA

mRNA sequence

ATGGGAACCAAGTATGTCCATAAGAGTGCCAAGTACAAGACCTCAGTTAAGGATCCTGGCATGCCTGGCGTTTTGGAAATGACAGAGCACAAGTTTGTATTTAGACCCAGTGATCCCACTTCAGCTTCTAAGCTTGATGTAGAGTTTAGATTTATTAAAGGCCACAAAAACACTAAGGAAGGATCAAATAAACCACCGTGGCTTAATCTCACCAGGGACCAGGGTGGAAGTTACATTTTTGAGTTCAAAAATTTCTCTGATCTTCATGTTTGCCGCGAGTTTGTAGGAAGTGCTTTAGCAAAGTCAGGAGAGGCTGCACAAGCTGCTCCCTCTGAGAGGCCTGTGGCGGCATTTCCTCATGAACAACTCAGTAAATCAGAAATGGAACTTCGGATGAGATGTTTGCAAGAGGATAGTGAACTGCAAAAACTCCATAAACAATTTGTGATTGGTGGTGTGTTGACAGAATCTGAATTTTGGGCAGCAAGGAAGAAATTACTGGAACGAGACAGCTCCAAAAAATCAAAACAACTGATTGGTTTTAAGAGTTCAATGGTTTTGGATACCAAACCAATGTCTGATGGTCGGACAAACAAGGTTACATTTAATTTGACACCGGAGATCAAATATCAGATTTTTGCTCTGAAACCAGCTGTTCACCAGGCCTTCCTTAATCACGTTCCCGATAAGATGTCAGAGAAAGACTTTTGGACAAAATATTTTAGAGCGGAGTACCTTCATAGTACCAAAAATTCTATTGCAGCTGCAGCAGAGGCTGCTGAAGACGAAGAACTTGCCCTTTTTCTGAAGGATGACGAGATATTGGCTGCTGAAACTCGGAAAAAGATTCGGCATGTTGATCCTACATTGGATTTGGAAGCGGATCTAGGAGATGATTACACACACCTTCCAGATCATGGAATCTTTCGTGATGGTGGCAAGGAGATAACTGAATCACAAAATGAGCACTATAAAAGGACTTTGTCACAAGACCTTAATCGTCAAGGTGCAGTTGTTCTTGAAGGCAGAACTATAGACATAGACTTGGATGATCCAAGGACAGTAGCAGAAGCACTTGTGCGGTCTAGACATGCGGTAGAAGGAAATGAGAAGCAGACAGCACTTGATAGGATCTCTAGGATGACAGCGATTGAGGATCTTCAAGCACCTCATAGCCATCCTTTTGCTCCTCTTTGTATCAAGGATCCTCGAGATTATTTTGATGCTCAACAAGCAAATGCCATCAAGACATTGGATGATACACGAGCAGGAATGCAACAAACTAAATGCAGTTTGGGTACTACAGAAGCATATTGCTCACTGAGGGAATCCATATCTGAGATCAAATCTTCCGGATTGAAGCTTCCCATAATTAAACCTGAAGTTGCTCTTATGGTTTATAACGGATTAACTCAAAATATTTCTAGTACTAAATATCAACTAGGGAAAAACCCTCAGGAGAGTATTCTGGAGAGTTTACCAAACCCTACTAAGGAGGAACTTCTACACCACTGGATCTCGATTCAAGAATTACTCAGGCATTTTTGGTCGTCTTACCCAATCACTACATCATATCTTTATACTAAA

Coding sequence (CDS)

ATGGGAACCAAGTATGTCCATAAGAGTGCCAAGTACAAGACCTCAGTTAAGGATCCTGGCATGCCTGGCGTTTTGGAAATGACAGAGCACAAGTTTGTATTTAGACCCAGTGATCCCACTTCAGCTTCTAAGCTTGATGTAGAGTTTAGATTTATTAAAGGCCACAAAAACACTAAGGAAGGATCAAATAAACCACCGTGGCTTAATCTCACCAGGGACCAGGGTGGAAGTTACATTTTTGAGTTCAAAAATTTCTCTGATCTTCATGTTTGCCGCGAGTTTGTAGGAAGTGCTTTAGCAAAGTCAGGAGAGGCTGCACAAGCTGCTCCCTCTGAGAGGCCTGTGGCGGCATTTCCTCATGAACAACTCAGTAAATCAGAAATGGAACTTCGGATGAGATGTTTGCAAGAGGATAGTGAACTGCAAAAACTCCATAAACAATTTGTGATTGGTGGTGTGTTGACAGAATCTGAATTTTGGGCAGCAAGGAAGAAATTACTGGAACGAGACAGCTCCAAAAAATCAAAACAACTGATTGGTTTTAAGAGTTCAATGGTTTTGGATACCAAACCAATGTCTGATGGTCGGACAAACAAGGTTACATTTAATTTGACACCGGAGATCAAATATCAGATTTTTGCTCTGAAACCAGCTGTTCACCAGGCCTTCCTTAATCACGTTCCCGATAAGATGTCAGAGAAAGACTTTTGGACAAAATATTTTAGAGCGGAGTACCTTCATAGTACCAAAAATTCTATTGCAGCTGCAGCAGAGGCTGCTGAAGACGAAGAACTTGCCCTTTTTCTGAAGGATGACGAGATATTGGCTGCTGAAACTCGGAAAAAGATTCGGCATGTTGATCCTACATTGGATTTGGAAGCGGATCTAGGAGATGATTACACACACCTTCCAGATCATGGAATCTTTCGTGATGGTGGCAAGGAGATAACTGAATCACAAAATGAGCACTATAAAAGGACTTTGTCACAAGACCTTAATCGTCAAGGTGCAGTTGTTCTTGAAGGCAGAACTATAGACATAGACTTGGATGATCCAAGGACAGTAGCAGAAGCACTTGTGCGGTCTAGACATGCGGTAGAAGGAAATGAGAAGCAGACAGCACTTGATAGGATCTCTAGGATGACAGCGATTGAGGATCTTCAAGCACCTCATAGCCATCCTTTTGCTCCTCTTTGTATCAAGGATCCTCGAGATTATTTTGATGCTCAACAAGCAAATGCCATCAAGACATTGGATGATACACGAGCAGGAATGCAACAAACTAAATGCAGTTTGGGTACTACAGAAGCATATTGCTCACTGAGGGAATCCATATCTGAGATCAAATCTTCCGGATTGAAGCTTCCCATAATTAAACCTGAAGTTGCTCTTATGGTTTATAACGGATTAACTCAAAATATTTCTAGTACTAAATATCAACTAGGGAAAAACCCTCAGGAGAGTATTCTGGAGAGTTTACCAAACCCTACTAAGGAGGAACTTCTACACCACTGGATCTCGATTCAAGAATTACTCAGGCATTTTTGGTCGTCTTACCCAATCACTACATCATATCTTTATACTAAA

Protein sequence

MGTKYVHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEALVRSRHAVEGNEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK
BLAST of Cla97C04G070320 vs. NCBI nr
Match: XP_008457278.1 (PREDICTED: probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 [Cucumis melo])

HSP 1 Score: 1020.4 bits (2637), Expect = 2.2e-294
Identity = 513/529 (96.98%), Postives = 521/529 (98.49%), Query Frame = 0

Query: 1   MGTKYVHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60
           MGTKYVHKSAKYKTSVKDPG PGVLEMTE KFVFRPSDPTSASKLDVEFRFIKGHKNTKE
Sbjct: 1   MGTKYVHKSAKYKTSVKDPGTPGVLEMTECKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60

Query: 61  GSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPSERPVAAFPH 120
           GSNKPPWLNLT+DQGGSYIFEFKNFSDLHVCREFVGSALAK GEAAQ APSERPVAAFPH
Sbjct: 61  GSNKPPWLNLTKDQGGSYIFEFKNFSDLHVCREFVGSALAKLGEAAQ-APSERPVAAFPH 120

Query: 121 EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIG 180
           EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIG
Sbjct: 121 EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIG 180

Query: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKY 240
           FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVP+KMSEKDFWTKY
Sbjct: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPNKMSEKDFWTKY 240

Query: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY 300
           FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAA+TRKKIRHVDPTLDLEADLGDDY
Sbjct: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAADTRKKIRHVDPTLDLEADLGDDY 300

Query: 301 THLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEALV 360
           THLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTID+DL+DPRTVAEALV
Sbjct: 301 THLPDHGIFRDGGKEITESQNEHYRRTLSQDLNRQGAVVLEGRTIDVDLEDPRTVAEALV 360

Query: 361 RSRHAVEGNEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD 420
           RSRHAVEGNE+QTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD
Sbjct: 361 RSRHAVEGNERQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD 420

Query: 421 TRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKYQ 480
           TRAGMQQTKCSL TTEAYCSLRESISEIKSSG   PIIKPEVALMVYNGLTQNISSTKYQ
Sbjct: 421 TRAGMQQTKCSLSTTEAYCSLRESISEIKSSGFNHPIIKPEVALMVYNGLTQNISSTKYQ 480

Query: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 530
           LGKNPQESILESLPNPTKEELLHHWISIQELL+HFWSSYPITTSYLYTK
Sbjct: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLKHFWSSYPITTSYLYTK 528

BLAST of Cla97C04G070320 vs. NCBI nr
Match: XP_011653743.1 (PREDICTED: probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 [Cucumis sativus] >XP_011653744.1 PREDICTED: probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 [Cucumis sativus])

HSP 1 Score: 1005.0 bits (2597), Expect = 9.6e-290
Identity = 506/529 (95.65%), Postives = 517/529 (97.73%), Query Frame = 0

Query: 1   MGTKYVHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60
           MGTKYVHKSAKYKTSVKDPG PGVLEMTE KFVFRPSDPTSASKLDVEFRFIKGHKNTKE
Sbjct: 1   MGTKYVHKSAKYKTSVKDPGTPGVLEMTECKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60

Query: 61  GSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPSERPVAAFPH 120
           GSNKPPWLNLT+DQGGSYIFEFKNFSDLHVCRE VGSALAK GEAAQ APSERPVAAFPH
Sbjct: 61  GSNKPPWLNLTKDQGGSYIFEFKNFSDLHVCRELVGSALAKLGEAAQ-APSERPVAAFPH 120

Query: 121 EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIG 180
           EQLSK EMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLE+D+SKKSKQLIG
Sbjct: 121 EQLSKLEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLEQDNSKKSKQLIG 180

Query: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKY 240
           FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVP+K+SEKDFWTKY
Sbjct: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPNKISEKDFWTKY 240

Query: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY 300
           FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY
Sbjct: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY 300

Query: 301 THLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEALV 360
           THLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTID+DL+DPRTVA+ALV
Sbjct: 301 THLPDHGIFRDGGKEITESQNEHYRRTLSQDLNRQGAVVLEGRTIDVDLEDPRTVADALV 360

Query: 361 RSRHAVEGNEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD 420
           RS+HAVEGNE QTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD
Sbjct: 361 RSKHAVEGNESQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD 420

Query: 421 TRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKYQ 480
           TRAGMQQTKCSL TTEAY SLRESISEIKSSG   PIIKPEVALMVYNGLTQNISSTKYQ
Sbjct: 421 TRAGMQQTKCSLSTTEAYGSLRESISEIKSSGFNHPIIKPEVALMVYNGLTQNISSTKYQ 480

Query: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 530
           LGKNPQESILESLPNPTKEELLHHWISIQELL+HFWSSYPITTSYLYTK
Sbjct: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLKHFWSSYPITTSYLYTK 528

BLAST of Cla97C04G070320 vs. NCBI nr
Match: XP_022932684.1 (probable RNA polymerase II transcription factor B subunit 1-1 [Cucurbita moschata])

HSP 1 Score: 994.6 bits (2570), Expect = 1.3e-286
Identity = 503/529 (95.09%), Postives = 511/529 (96.60%), Query Frame = 0

Query: 1   MGTKYVHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60
           MGTKYV KSAKYKTSVKDPG PGVLEMTE KFVFRPSDPTSASKLDVEFRFIKGHKNTKE
Sbjct: 1   MGTKYVQKSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60

Query: 61  GSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPSERPVAAFPH 120
           GSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA Q APSE+ VA FPH
Sbjct: 61  GSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAPQ-APSEKLVATFPH 120

Query: 121 EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIG 180
           EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD S KSKQL+G
Sbjct: 121 EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDYSTKSKQLVG 180

Query: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKY 240
           FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVP+KMSEKDFWTKY
Sbjct: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPNKMSEKDFWTKY 240

Query: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY 300
           FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIR VDPTLDLEADLGDDY
Sbjct: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRRVDPTLDLEADLGDDY 300

Query: 301 THLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEALV 360
           THLPDHGIFRDGGKEITES NE  +RTLSQDLNRQGAVVLEGRTID+DL+DPRTVAEALV
Sbjct: 301 THLPDHGIFRDGGKEITESHNEQCRRTLSQDLNRQGAVVLEGRTIDVDLEDPRTVAEALV 360

Query: 361 RSRHAVEGNEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD 420
           RSRHAVEGNEKQ  LDRI+RMTAIEDLQAPHS+PFAPLCIKDPRDYFDAQQANAIKTLDD
Sbjct: 361 RSRHAVEGNEKQIGLDRIARMTAIEDLQAPHSYPFAPLCIKDPRDYFDAQQANAIKTLDD 420

Query: 421 TRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKYQ 480
           TRAGMQQTKCSL TTEAYCSLRESISEIKSSGL  PIIKPEVALMVYNGLTQNISSTKYQ
Sbjct: 421 TRAGMQQTKCSLSTTEAYCSLRESISEIKSSGLNHPIIKPEVALMVYNGLTQNISSTKYQ 480

Query: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 530
           LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK
Sbjct: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 528

BLAST of Cla97C04G070320 vs. NCBI nr
Match: XP_023539702.1 (probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 990.7 bits (2560), Expect = 1.9e-285
Identity = 500/529 (94.52%), Postives = 510/529 (96.41%), Query Frame = 0

Query: 1   MGTKYVHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60
           MGTKYV KSAKYKTSVKDPG PGVLEMTE KFVFRPSDPTSASKLDV+FRFIKGHKNTKE
Sbjct: 1   MGTKYVQKSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSASKLDVDFRFIKGHKNTKE 60

Query: 61  GSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPSERPVAAFPH 120
           GSNKPPWLNLT+DQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA Q APSE+ VA FPH
Sbjct: 61  GSNKPPWLNLTKDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAPQ-APSEKLVATFPH 120

Query: 121 EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIG 180
           EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD S KSKQL+G
Sbjct: 121 EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDYSTKSKQLVG 180

Query: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKY 240
           FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVP+KMSEKDFWTKY
Sbjct: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPNKMSEKDFWTKY 240

Query: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY 300
           FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIR VDPTLDLEADLGDDY
Sbjct: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRRVDPTLDLEADLGDDY 300

Query: 301 THLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEALV 360
           THLPDHGIFRDGGKEITES NE  +RTLSQDLNRQGAVVLEGRTID+DL+DPRTVAEALV
Sbjct: 301 THLPDHGIFRDGGKEITESNNEQCRRTLSQDLNRQGAVVLEGRTIDVDLEDPRTVAEALV 360

Query: 361 RSRHAVEGNEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD 420
           RSRHA EGNEKQ  LDRI+RMTAIEDLQAPHS+PFAPLCIKDPRDYFDAQQANAIKTLDD
Sbjct: 361 RSRHAAEGNEKQIGLDRIARMTAIEDLQAPHSYPFAPLCIKDPRDYFDAQQANAIKTLDD 420

Query: 421 TRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKYQ 480
           TRAGMQQTKCSL TTEAYCSLRESISEIKSSGL  PIIKPEVALMVYNGLTQNISSTKYQ
Sbjct: 421 TRAGMQQTKCSLSTTEAYCSLRESISEIKSSGLNHPIIKPEVALMVYNGLTQNISSTKYQ 480

Query: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 530
           LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK
Sbjct: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 528

BLAST of Cla97C04G070320 vs. NCBI nr
Match: XP_022972269.1 (probable RNA polymerase II transcription factor B subunit 1-1 [Cucurbita maxima])

HSP 1 Score: 990.3 bits (2559), Expect = 2.5e-285
Identity = 501/529 (94.71%), Postives = 510/529 (96.41%), Query Frame = 0

Query: 1   MGTKYVHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60
           MGTKYV KSAKYKTSVKDPG PGVLEMTE KFVFRPSDPTSASKLDVEFRFIKGHKNTKE
Sbjct: 1   MGTKYVQKSAKYKTSVKDPGTPGVLEMTERKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60

Query: 61  GSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPSERPVAAFPH 120
           GSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEA Q APSE+ VA FPH
Sbjct: 61  GSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAPQ-APSEKLVATFPH 120

Query: 121 EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIG 180
           EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERD S KSKQL+G
Sbjct: 121 EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDYSTKSKQLVG 180

Query: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKY 240
           FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVP+KMSEKDFWTKY
Sbjct: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPNKMSEKDFWTKY 240

Query: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY 300
           FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIR VDPTLDLEADLGDDY
Sbjct: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRRVDPTLDLEADLGDDY 300

Query: 301 THLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEALV 360
            HLPDHGIFRDGGKEITES NE  +RTLSQDLNRQGAVVLEGRTID+DL+DPRTVAEALV
Sbjct: 301 IHLPDHGIFRDGGKEITESHNEQCRRTLSQDLNRQGAVVLEGRTIDVDLEDPRTVAEALV 360

Query: 361 RSRHAVEGNEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD 420
           RSRHAVEG+EKQ  LDRI+RMTAIEDLQAPHS+PFAPLCIKDPRDYFDAQQANAIKTLDD
Sbjct: 361 RSRHAVEGSEKQIGLDRIARMTAIEDLQAPHSYPFAPLCIKDPRDYFDAQQANAIKTLDD 420

Query: 421 TRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKYQ 480
           TRAGMQQTKCSL TTEAYCSLRESISEIKSSGL  PIIKPEVALMVYNGLTQNISSTKYQ
Sbjct: 421 TRAGMQQTKCSLSTTEAYCSLRESISEIKSSGLNHPIIKPEVALMVYNGLTQNISSTKYQ 480

Query: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 530
           LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK
Sbjct: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 528

BLAST of Cla97C04G070320 vs. TrEMBL
Match: tr|A0A1S3C6E8|A0A1S3C6E8_CUCME (probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497008 PE=4 SV=1)

HSP 1 Score: 1020.4 bits (2637), Expect = 1.5e-294
Identity = 513/529 (96.98%), Postives = 521/529 (98.49%), Query Frame = 0

Query: 1   MGTKYVHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60
           MGTKYVHKSAKYKTSVKDPG PGVLEMTE KFVFRPSDPTSASKLDVEFRFIKGHKNTKE
Sbjct: 1   MGTKYVHKSAKYKTSVKDPGTPGVLEMTECKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60

Query: 61  GSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPSERPVAAFPH 120
           GSNKPPWLNLT+DQGGSYIFEFKNFSDLHVCREFVGSALAK GEAAQ APSERPVAAFPH
Sbjct: 61  GSNKPPWLNLTKDQGGSYIFEFKNFSDLHVCREFVGSALAKLGEAAQ-APSERPVAAFPH 120

Query: 121 EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIG 180
           EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIG
Sbjct: 121 EQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIG 180

Query: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKY 240
           FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVP+KMSEKDFWTKY
Sbjct: 181 FKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPNKMSEKDFWTKY 240

Query: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDY 300
           FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAA+TRKKIRHVDPTLDLEADLGDDY
Sbjct: 241 FRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAADTRKKIRHVDPTLDLEADLGDDY 300

Query: 301 THLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEALV 360
           THLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQGAVVLEGRTID+DL+DPRTVAEALV
Sbjct: 301 THLPDHGIFRDGGKEITESQNEHYRRTLSQDLNRQGAVVLEGRTIDVDLEDPRTVAEALV 360

Query: 361 RSRHAVEGNEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD 420
           RSRHAVEGNE+QTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD
Sbjct: 361 RSRHAVEGNERQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDD 420

Query: 421 TRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKYQ 480
           TRAGMQQTKCSL TTEAYCSLRESISEIKSSG   PIIKPEVALMVYNGLTQNISSTKYQ
Sbjct: 421 TRAGMQQTKCSLSTTEAYCSLRESISEIKSSGFNHPIIKPEVALMVYNGLTQNISSTKYQ 480

Query: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 530
           LGKNPQESILESLPNPTKEELLHHWISIQELL+HFWSSYPITTSYLYTK
Sbjct: 481 LGKNPQESILESLPNPTKEELLHHWISIQELLKHFWSSYPITTSYLYTK 528

BLAST of Cla97C04G070320 vs. TrEMBL
Match: tr|A0A1S4E1L0|A0A1S4E1L0_CUCME (probable RNA polymerase II transcription factor B subunit 1-1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103497008 PE=4 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 1.1e-236
Identity = 421/434 (97.00%), Postives = 428/434 (98.62%), Query Frame = 0

Query: 96  GSALAKSGEAAQAAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLT 155
           GSALAK GEAAQ APSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLT
Sbjct: 10  GSALAKLGEAAQ-APSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLT 69

Query: 156 ESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFAL 215
           ESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFAL
Sbjct: 70  ESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFAL 129

Query: 216 KPAVHQAFLNHVPDKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEIL 275
           KPAVHQAFLNHVP+KMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEIL
Sbjct: 130 KPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEIL 189

Query: 276 AAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQ 335
           AA+TRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQ
Sbjct: 190 AADTRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLSQDLNRQ 249

Query: 336 GAVVLEGRTIDIDLDDPRTVAEALVRSRHAVEGNEKQTALDRISRMTAIEDLQAPHSHPF 395
           GAVVLEGRTID+DL+DPRTVAEALVRSRHAVEGNE+QTALDRISRMTAIEDLQAPHSHPF
Sbjct: 250 GAVVLEGRTIDVDLEDPRTVAEALVRSRHAVEGNERQTALDRISRMTAIEDLQAPHSHPF 309

Query: 396 APLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKL 455
           APLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSL TTEAYCSLRESISEIKSSG   
Sbjct: 310 APLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYCSLRESISEIKSSGFNH 369

Query: 456 PIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELLRHF 515
           PIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELL+HF
Sbjct: 370 PIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELLKHF 429

Query: 516 WSSYPITTSYLYTK 530
           WSSYPITTSYLYTK
Sbjct: 430 WSSYPITTSYLYTK 442

BLAST of Cla97C04G070320 vs. TrEMBL
Match: tr|A0A1S4E1K9|A0A1S4E1K9_CUCME (probable RNA polymerase II transcription factor B subunit 1-1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103497008 PE=4 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 1.1e-236
Identity = 421/434 (97.00%), Postives = 428/434 (98.62%), Query Frame = 0

Query: 96  GSALAKSGEAAQAAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLT 155
           GSALAK GEAAQ APSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLT
Sbjct: 14  GSALAKLGEAAQ-APSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLT 73

Query: 156 ESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFAL 215
           ESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFAL
Sbjct: 74  ESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFAL 133

Query: 216 KPAVHQAFLNHVPDKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEIL 275
           KPAVHQAFLNHVP+KMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEIL
Sbjct: 134 KPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEIL 193

Query: 276 AAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQ 335
           AA+TRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHY+RTLSQDLNRQ
Sbjct: 194 AADTRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLSQDLNRQ 253

Query: 336 GAVVLEGRTIDIDLDDPRTVAEALVRSRHAVEGNEKQTALDRISRMTAIEDLQAPHSHPF 395
           GAVVLEGRTID+DL+DPRTVAEALVRSRHAVEGNE+QTALDRISRMTAIEDLQAPHSHPF
Sbjct: 254 GAVVLEGRTIDVDLEDPRTVAEALVRSRHAVEGNERQTALDRISRMTAIEDLQAPHSHPF 313

Query: 396 APLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKL 455
           APLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSL TTEAYCSLRESISEIKSSG   
Sbjct: 314 APLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYCSLRESISEIKSSGFNH 373

Query: 456 PIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELLRHF 515
           PIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELL+HF
Sbjct: 374 PIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELLKHF 433

Query: 516 WSSYPITTSYLYTK 530
           WSSYPITTSYLYTK
Sbjct: 434 WSSYPITTSYLYTK 446

BLAST of Cla97C04G070320 vs. TrEMBL
Match: tr|A0A1S4E1L2|A0A1S4E1L2_CUCME (probable RNA polymerase II transcription factor B subunit 1-1 isoform X4 OS=Cucumis melo OX=3656 GN=LOC103497008 PE=4 SV=1)

HSP 1 Score: 826.2 bits (2133), Expect = 4.1e-236
Identity = 420/435 (96.55%), Postives = 428/435 (98.39%), Query Frame = 0

Query: 95  VGSALAKSGEAAQAAPSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVL 154
           + SALAK GEAAQ APSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVL
Sbjct: 1   MASALAKLGEAAQ-APSERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVL 60

Query: 155 TESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFA 214
           TESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFA
Sbjct: 61  TESEFWAARKKLLERDSSKKSKQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFA 120

Query: 215 LKPAVHQAFLNHVPDKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEI 274
           LKPAVHQAFLNHVP+KMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEI
Sbjct: 121 LKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEI 180

Query: 275 LAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNR 334
           LAA+TRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHY+RTLSQDLNR
Sbjct: 181 LAADTRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLSQDLNR 240

Query: 335 QGAVVLEGRTIDIDLDDPRTVAEALVRSRHAVEGNEKQTALDRISRMTAIEDLQAPHSHP 394
           QGAVVLEGRTID+DL+DPRTVAEALVRSRHAVEGNE+QTALDRISRMTAIEDLQAPHSHP
Sbjct: 241 QGAVVLEGRTIDVDLEDPRTVAEALVRSRHAVEGNERQTALDRISRMTAIEDLQAPHSHP 300

Query: 395 FAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLK 454
           FAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSL TTEAYCSLRESISEIKSSG  
Sbjct: 301 FAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYCSLRESISEIKSSGFN 360

Query: 455 LPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELLRH 514
            PIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELL+H
Sbjct: 361 HPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELLKH 420

Query: 515 FWSSYPITTSYLYTK 530
           FWSSYPITTSYLYTK
Sbjct: 421 FWSSYPITTSYLYTK 434

BLAST of Cla97C04G070320 vs. TrEMBL
Match: tr|A0A2P6PCY5|A0A2P6PCY5_ROSCH (Putative transcription factor BSD family OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr7g0221221 PE=4 SV=1)

HSP 1 Score: 750.4 bits (1936), Expect = 2.8e-213
Identity = 376/530 (70.94%), Postives = 435/530 (82.08%), Query Frame = 0

Query: 1   MGTKYVHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKE 60
           M T  V K AKYKTS+KDPG PG+L M E++FVFRP+DPTS SKLDV F  IKG KNTKE
Sbjct: 1   MSTPQVIKRAKYKTSIKDPGTPGLLTMMENRFVFRPNDPTSPSKLDVGFHQIKGQKNTKE 60

Query: 61  GSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSAL-AKSGEAAQAAPSERPVAAFP 120
           GSNKPPWLN++ ++ GSYIFEF+++ DLH CRE + +AL A  G +A             
Sbjct: 61  GSNKPPWLNISNNKDGSYIFEFESYPDLHACREIIANALEAAKGSSAATXXXXXXXXXXX 120

Query: 121 HEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLI 180
             Q S +E+ELRM+ +QEDSELQKLHKQFV  GVLTESEFWA RKKLL+ DS KKSKQ +
Sbjct: 121 XXQYSTAELELRMKLMQEDSELQKLHKQFVGSGVLTESEFWATRKKLLDGDSRKKSKQRV 180

Query: 181 GFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTK 240
           GFKSSM+LD KPM+DGR NKVTFNLT EIKYQIFALKPAVHQA+++ VP KM+EKDFW K
Sbjct: 181 GFKSSMILDIKPMTDGRMNKVTFNLTDEIKYQIFALKPAVHQAYIDLVPSKMTEKDFWNK 240

Query: 241 YFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDD 300
           YFRAEYLHSTKN +AAAAEAAEDEELA+FLK+DEILA E R+KI+ VDPTLD+EAD GDD
Sbjct: 241 YFRAEYLHSTKNVVAAAAEAAEDEELAIFLKEDEILAREARQKIQRVDPTLDMEADQGDD 300

Query: 301 YTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEAL 360
           Y HLPDHGIFRDG K+ITE QN+ Y+RTLSQDLNRQGAVVL+GR +D+D +DPRT+AEAL
Sbjct: 301 YIHLPDHGIFRDGTKDITELQNDQYRRTLSQDLNRQGAVVLQGRNLDVDPEDPRTIAEAL 360

Query: 361 VRSRHAVEGNEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLD 420
           +RSR     +  Q  LDR++RMT I+DL+  H HP APLCIKDPRDYFD QQANA++TLD
Sbjct: 361 MRSRQEPSDSAVQERLDRLTRMTEIDDLRESHDHPVAPLCIKDPRDYFDTQQANALRTLD 420

Query: 421 DTRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKY 480
           D+R G +QTK  L T EAY +L+ESIS+IKS GL    +KPEVAL V+NGLTQNISSTKY
Sbjct: 421 DSRTGTEQTKRKLTTEEAYGALKESISKIKSIGLSNSTVKPEVALTVFNGLTQNISSTKY 480

Query: 481 QLGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 530
           QLGKNP+ESIL+ LPN TKEELLHHW+SIQELLRHFWSSYPITTS LYTK
Sbjct: 481 QLGKNPRESILDGLPNRTKEELLHHWMSIQELLRHFWSSYPITTSSLYTK 530

BLAST of Cla97C04G070320 vs. Swiss-Prot
Match: sp|Q3ECP0|TFB1A_ARATH (General transcription and DNA repair factor IIH subunit TFB1-1 OS=Arabidopsis thaliana OX=3702 GN=TFB1-1 PE=2 SV=1)

HSP 1 Score: 596.3 bits (1536), Expect = 3.4e-169
Identity = 313/530 (59.06%), Postives = 390/530 (73.58%), Query Frame = 0

Query: 6   VHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKEGSNKP 65
           + K  KYK++VKDPG PG L + E   +F P+DP S SKL V  + IK  K TKEGSNKP
Sbjct: 6   IEKLVKYKSTVKDPGTPGFLRIREGMLLFVPNDPKSDSKLKVLTQNIKSQKYTKEGSNKP 65

Query: 66  PWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPSERPVAAFPHEQLSK 125
           PWLNLT  Q  S+IFEF+N+ D+H CR+F+  ALAK     +  P+ + V +   EQLS 
Sbjct: 66  PWLNLTNKQAKSHIFEFENYPDMHACRDFITKALAK----CELEPN-KSVVSTSSEQLSI 125

Query: 126 SEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSM 185
            E+ELR + L+E+SELQ+LHKQFV   VLTE EFWA RKKLL +DS +KSKQ +G KS M
Sbjct: 126 KELELRFKLLRENSELQRLHKQFVESKVLTEDEFWATRKKLLGKDSIRKSKQQLGLKSMM 185

Query: 186 VLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKYFRAEY 245
           V   KP +DGRTN+VTFNLTPEI +QIFA KPAV QAF+N+VP KM+EKDFWTKYFRAEY
Sbjct: 186 VSGIKPSTDGRTNRVTFNLTPEIIFQIFAEKPAVRQAFINYVPSKMTEKDFWTKYFRAEY 245

Query: 246 LHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPD 305
           L+STKN+  AAAEAAEDEELA+FLK DEILA ETR KIR VDPTLD+EAD GDDYTHL D
Sbjct: 246 LYSTKNTAVAAAEAAEDEELAVFLKPDEILARETRHKIRRVDPTLDMEADQGDDYTHLMD 305

Query: 306 HGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEALVRSRHA 365
           HGI RDG  ++ E QN+ +KR+L QDLNR  AVVLEGR+ID++ +D R VAEAL R +  
Sbjct: 306 HGIQRDGTMDVVEPQNDQFKRSLLQDLNRHAAVVLEGRSIDVESEDTRIVAEALTRVKQV 365

Query: 366 VEG------NEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLD 425
            +       +  Q  L+R+SR+  +EDLQAP + P APL IKDPRDYF++QQ N +    
Sbjct: 366 SKADGETTKDANQERLERMSRVAGMEDLQAPQNFPLAPLSIKDPRDYFESQQGNVLNVPR 425

Query: 426 DTRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKY 485
             + G+++        EAY  L+ESI EI+++GL  P+IKPEV+  V++ LT+ I++ K 
Sbjct: 426 GAK-GLKR-----NVHEAYGLLKESILEIRATGLSDPLIKPEVSFEVFSSLTRTIATAKN 485

Query: 486 QLGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 530
             GKNP+ES L+ LP  TK+E+LHHW SIQELL+HFWSSYPITT+YL+TK
Sbjct: 486 INGKNPRESFLDRLPKSTKDEVLHHWTSIQELLKHFWSSYPITTTYLHTK 524

BLAST of Cla97C04G070320 vs. Swiss-Prot
Match: sp|Q9M322|TFB1C_ARATH (General transcription and DNA repair factor IIH subunit TFB1-3 OS=Arabidopsis thaliana OX=3702 GN=TFB1-3 PE=2 SV=2)

HSP 1 Score: 589.0 bits (1517), Expect = 5.4e-167
Identity = 312/530 (58.87%), Postives = 381/530 (71.89%), Query Frame = 0

Query: 6   VHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKEGSNKP 65
           + K  KYK+ VKDPG  G LE++E   +F P+DP S  KL V+   IK  K TKEGSNKP
Sbjct: 1   MEKRVKYKSFVKDPGTLGSLELSEVMLLFVPNDPKSDLKLKVQTHNIKSQKYTKEGSNKP 60

Query: 66  PWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPSERPVAAFPHEQLSK 125
           PWLNLT  QG S+IFEF+N+ D+H CR+F+  ALAK  E        + V   P EQLS 
Sbjct: 61  PWLNLTSKQGRSHIFEFENYPDMHACRDFITKALAKCEE-----EPNKLVVLTPAEQLSM 120

Query: 126 SEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSM 185
           +E ELR + L+E+SELQKLHKQFV   VLTE EFW+ RKKLL +DS +KSKQ +G KS M
Sbjct: 121 AEFELRFKLLRENSELQKLHKQFVESKVLTEDEFWSTRKKLLGKDSIRKSKQQMGLKSMM 180

Query: 186 VLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKYFRAEY 245
           V   KP +DGRTN+VTFNLT EI +QIFA KPAV QAF+N+VP KM+EKDFWTKYFRAEY
Sbjct: 181 VSGIKPSTDGRTNRVTFNLTSEIIFQIFAEKPAVRQAFINYVPKKMTEKDFWTKYFRAEY 240

Query: 246 LHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPD 305
           L+STKN+  AAAEAAEDEELA+FLK DEILA E R+K+R VDPTLD++AD GDDYTHL D
Sbjct: 241 LYSTKNTAVAAAEAAEDEELAVFLKPDEILAQEARQKMRRVDPTLDMDADEGDDYTHLMD 300

Query: 306 HGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEALVRSRHA 365
           HGI RDG  +I E QN+  KR+L QDLNR  AVVLEGR I++  +D R VAEAL R++  
Sbjct: 301 HGIQRDGTNDIIEPQNDQLKRSLLQDLNRHAAVVLEGRCINVQSEDTRIVAEALTRAKQV 360

Query: 366 ------VEGNEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLD 425
                 +  +  Q  L+R+SR T +EDLQAP + P APL IKDPRDYF++QQ N +    
Sbjct: 361 SKADGEITKDANQERLERMSRATEMEDLQAPQNFPLAPLSIKDPRDYFESQQGNILSEPR 420

Query: 426 DTRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKY 485
             +A  +         EAY  L+ESI  I+ +GL  P+IKPEV+  V++ LT+ IS+ K 
Sbjct: 421 GAKASKRNVH------EAYGLLKESILVIRMTGLSDPLIKPEVSFEVFSSLTRTISTAKN 480

Query: 486 QLGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 530
            LGKNPQES L+ LP  TK+E++HHW SIQEL+RHFWSSYPITT+YL TK
Sbjct: 481 ILGKNPQESFLDRLPKSTKDEVIHHWTSIQELVRHFWSSYPITTTYLSTK 519

BLAST of Cla97C04G070320 vs. Swiss-Prot
Match: sp|P32780|TF2H1_HUMAN (General transcription factor IIH subunit 1 OS=Homo sapiens OX=9606 GN=GTF2H1 PE=1 SV=1)

HSP 1 Score: 121.3 bits (303), Expect = 3.2e-26
Identity = 132/488 (27.05%), Postives = 212/488 (43.44%), Query Frame = 0

Query: 52  IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPS 111
           IK  K + EG  K   L L    G +  F F N S     R+ V   L       Q  P 
Sbjct: 50  IKCQKISPEGKAKIQ-LQLVLHAGDTTNFHFSNESTAVKERDAVKDLL------QQLLPK 109

Query: 112 ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDS 171
                    ++ +  E+E + R LQED  L +L+K  V+  V++  EFWA R  +   DS
Sbjct: 110 --------FKRKANKELEEKNRMLQEDPVLFQLYKDLVVSQVISAEEFWANRLNVNATDS 169

Query: 172 SKKS--KQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPD 231
           S  S  KQ +G  ++ + D +P +DG  N + +NLT +I   IF   PAV   +  +VP 
Sbjct: 170 SSTSNHKQDVGISAAFLADVRPQTDG-CNGLRYNLTSDIIESIFRTYPAVKMKYAENVPH 229

Query: 232 KMSEKDFWTKYFRAEYLHSTKNSIAAA---AEAAEDEELALFLKDDEILAAETRKKIRHV 291
            M+EK+FWT++F++ Y H  + +  +    AE A+ +E  L          +T   +   
Sbjct: 230 NMTEKEFWTRFFQSHYFHRDRLNTGSKDLFAECAKIDEKGL----------KTMVSLGVK 289

Query: 292 DPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTID 351
           +P LDL A   +D      +GI        ++S  E+    + +  N   A+VL      
Sbjct: 290 NPLLDLTA--LEDKPLDEGYGISSVPSASNSKSIKENSNAAIIKRFNHHSAMVLAAGLRK 349

Query: 352 IDLDDPRTVAEALVRSRHAVEGNEKQTALDRISRMTAI--EDLQAPHSHPFAPLCIKDPR 411
            +  + +T +E      ++ + +  Q A+ R     +I  EDL   +S     L +K   
Sbjct: 350 QEAQNEQT-SEPSNMDGNSGDADCFQPAVKRAKLQESIEYEDLGKNNSVKTIALNLKKSD 409

Query: 412 DYFDAQ---QANAIKTLDDTRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPE 471
            Y+      Q+    T  D     Q  +  +   EAY      +    ++   +  + P 
Sbjct: 410 RYYHGPTPIQSLQYATSQDIINSFQSIRQEM---EAYTPKLTQVLSSSAASSTITALSPG 469

Query: 472 VALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPI 530
            ALM   G T              Q++I + +PN  + EL H ++++ ELLRHFWS +P+
Sbjct: 470 GALM--QGGT--------------QQAINQMVPNDIQSELKHLYVAVGELLRHFWSCFPV 489

BLAST of Cla97C04G070320 vs. Swiss-Prot
Match: sp|Q9DBA9|TF2H1_MOUSE (General transcription factor IIH subunit 1 OS=Mus musculus OX=10090 GN=Gtf2h1 PE=1 SV=2)

HSP 1 Score: 116.3 bits (290), Expect = 1.0e-24
Identity = 133/503 (26.44%), Postives = 210/503 (41.75%), Query Frame = 0

Query: 52  IKGHKNTKEGSNKPPWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPS 111
           IK  K + EG  K   L L    G +  F F N S     R+ V   L       Q  P 
Sbjct: 50  IKCQKISPEGKAKIQ-LQLVLHAGDTTNFHFSNESTAVKERDAVKDLL------QQLLPK 109

Query: 112 ERPVAAFPHEQLSKSEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDS 171
                    ++ +  E+E + R LQED  L +L+K  V+  V++  EFWA R  +   DS
Sbjct: 110 --------FKRKANKELEEKNRMLQEDPVLFQLYKDLVVSQVISAEEFWANRLNVNATDS 169

Query: 172 SKKS-KQLIGFKSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDK 231
           S  S KQ +G  ++ + D +P +DG  N + +NLT +I   IF   PAV   +   VP  
Sbjct: 170 STSSHKQDVGISAAFLADVRPQTDG-CNGLRYNLTSDIIESIFRTYPAVKMKYAETVPHN 229

Query: 232 MSEKDFWTKYFRAEYLHSTKNSIAAA---AEAAEDEELALFLKDDEILAAETRKKIRHVD 291
           M+EK+FWT++F++ Y H  + +  +    AE A+ +E  L          +T   +   +
Sbjct: 230 MTEKEFWTRFFQSHYFHRDRLNTGSKDLFAECAKIDEKGL----------KTMVSLGVKN 289

Query: 292 PTLDLEA------DLGDDYTHLP----DHGIFRDGGKEITESQNEHYKRTLSQDLNRQGA 351
           P LDL +      D G   + +P       I  +    I +  N H    L+  L +Q A
Sbjct: 290 PMLDLTSLEDKPLDEGYSISSVPSTSNSKSIKENSNAAIIKRFNHHSAMVLAAGLRKQQA 349

Query: 352 --------VVLEGRTIDIDLDDPRTVAEALVRSRHAVEGNEKQTALDRISRMTAIEDLQA 411
                     ++G + D D   P            AV+  + Q +++        EDL  
Sbjct: 350 QNGQNGEPSSVDGNSGDTDCFQP------------AVKRAKLQESIE-------YEDLGN 409

Query: 412 PHSHPFAPLCIKDPRDYFDAQ---QANAIKTLDDTRAGMQQTKCSLGTTEAYCSLRESIS 471
            +S     L +K    Y+      Q+    T  D     Q  +  +   EAY      + 
Sbjct: 410 NNSVKTIALNLKKSDRYYHGPTPIQSLQYATSQDIINSFQSIRQEM---EAYTPKLTQVL 469

Query: 472 EIKSSGLKLPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWI 530
              ++   +  + P  ALM   G T              Q+++ + +PN  + EL H ++
Sbjct: 470 SSSAASSTITALSPGGALM--QGGT--------------QQAVNQMVPNDIQSELKHLYV 488

BLAST of Cla97C04G070320 vs. Swiss-Prot
Match: sp|Q55FP1|TF2H1_DICDI (General transcription factor IIH subunit 1 OS=Dictyostelium discoideum OX=44689 GN=gtf2h1 PE=3 SV=1)

HSP 1 Score: 107.5 bits (267), Expect = 4.8e-22
Identity = 66/224 (29.46%), Postives = 128/224 (57.14%), Query Frame = 0

Query: 123 LSKSEMELRMRCLQEDSELQKLHKQFV-IGGVLTESEFWAARKKLLERDSSKKSKQLIGF 182
           LS+ +++ R+  LQ + EL++L++Q V    V++ES+FW +RK +L+ DS++  KQ  G 
Sbjct: 182 LSEQQIKQRVILLQSNKELRELYEQMVNKDRVISESDFWESRKSMLKNDSTRSEKQHTGM 241

Query: 183 KSSMVLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKYF 242
            S+++ D +P S+   N V +  TP + +QIF   P+V +A+  +VP K+SE++FW KY 
Sbjct: 242 PSNLLADVRPSSE-TPNAVHYRFTPTVIHQIFIQHPSVEKAYKANVPLKISEQNFWKKYV 301

Query: 243 RAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYT 302
           +++Y +  ++S  A A   +D+  + +  D++      ++K+  ++P +DL +  G D  
Sbjct: 302 QSKYFYRDRSS--ANAPPVDDDLFSKYETDEQNKIRILKRKLIDINPLVDLSSTDGFDTD 361

Query: 303 HLPDHGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTI 346
               +G+  D  ++  + +       L +  NR  A+VL  + +
Sbjct: 362 VHSGYGVLLDQSQDPNKLEK---ALPLLRKFNRHSALVLGSKDL 399

BLAST of Cla97C04G070320 vs. TAIR10
Match: AT1G55750.1 (BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins))

HSP 1 Score: 596.3 bits (1536), Expect = 1.9e-170
Identity = 313/530 (59.06%), Postives = 390/530 (73.58%), Query Frame = 0

Query: 6   VHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKEGSNKP 65
           + K  KYK++VKDPG PG L + E   +F P+DP S SKL V  + IK  K TKEGSNKP
Sbjct: 6   IEKLVKYKSTVKDPGTPGFLRIREGMLLFVPNDPKSDSKLKVLTQNIKSQKYTKEGSNKP 65

Query: 66  PWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPSERPVAAFPHEQLSK 125
           PWLNLT  Q  S+IFEF+N+ D+H CR+F+  ALAK     +  P+ + V +   EQLS 
Sbjct: 66  PWLNLTNKQAKSHIFEFENYPDMHACRDFITKALAK----CELEPN-KSVVSTSSEQLSI 125

Query: 126 SEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSM 185
            E+ELR + L+E+SELQ+LHKQFV   VLTE EFWA RKKLL +DS +KSKQ +G KS M
Sbjct: 126 KELELRFKLLRENSELQRLHKQFVESKVLTEDEFWATRKKLLGKDSIRKSKQQLGLKSMM 185

Query: 186 VLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKYFRAEY 245
           V   KP +DGRTN+VTFNLTPEI +QIFA KPAV QAF+N+VP KM+EKDFWTKYFRAEY
Sbjct: 186 VSGIKPSTDGRTNRVTFNLTPEIIFQIFAEKPAVRQAFINYVPSKMTEKDFWTKYFRAEY 245

Query: 246 LHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPD 305
           L+STKN+  AAAEAAEDEELA+FLK DEILA ETR KIR VDPTLD+EAD GDDYTHL D
Sbjct: 246 LYSTKNTAVAAAEAAEDEELAVFLKPDEILARETRHKIRRVDPTLDMEADQGDDYTHLMD 305

Query: 306 HGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEALVRSRHA 365
           HGI RDG  ++ E QN+ +KR+L QDLNR  AVVLEGR+ID++ +D R VAEAL R +  
Sbjct: 306 HGIQRDGTMDVVEPQNDQFKRSLLQDLNRHAAVVLEGRSIDVESEDTRIVAEALTRVKQV 365

Query: 366 VEG------NEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLD 425
            +       +  Q  L+R+SR+  +EDLQAP + P APL IKDPRDYF++QQ N +    
Sbjct: 366 SKADGETTKDANQERLERMSRVAGMEDLQAPQNFPLAPLSIKDPRDYFESQQGNVLNVPR 425

Query: 426 DTRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKY 485
             + G+++        EAY  L+ESI EI+++GL  P+IKPEV+  V++ LT+ I++ K 
Sbjct: 426 GAK-GLKR-----NVHEAYGLLKESILEIRATGLSDPLIKPEVSFEVFSSLTRTIATAKN 485

Query: 486 QLGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 530
             GKNP+ES L+ LP  TK+E+LHHW SIQELL+HFWSSYPITT+YL+TK
Sbjct: 486 INGKNPRESFLDRLPKSTKDEVLHHWTSIQELLKHFWSSYPITTTYLHTK 524

BLAST of Cla97C04G070320 vs. TAIR10
Match: AT3G61420.1 (BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins))

HSP 1 Score: 589.0 bits (1517), Expect = 3.0e-168
Identity = 312/530 (58.87%), Postives = 381/530 (71.89%), Query Frame = 0

Query: 6   VHKSAKYKTSVKDPGMPGVLEMTEHKFVFRPSDPTSASKLDVEFRFIKGHKNTKEGSNKP 65
           + K  KYK+ VKDPG  G LE++E   +F P+DP S  KL V+   IK  K TKEGSNKP
Sbjct: 1   MEKRVKYKSFVKDPGTLGSLELSEVMLLFVPNDPKSDLKLKVQTHNIKSQKYTKEGSNKP 60

Query: 66  PWLNLTRDQGGSYIFEFKNFSDLHVCREFVGSALAKSGEAAQAAPSERPVAAFPHEQLSK 125
           PWLNLT  QG S+IFEF+N+ D+H CR+F+  ALAK  E        + V   P EQLS 
Sbjct: 61  PWLNLTSKQGRSHIFEFENYPDMHACRDFITKALAKCEE-----EPNKLVVLTPAEQLSM 120

Query: 126 SEMELRMRCLQEDSELQKLHKQFVIGGVLTESEFWAARKKLLERDSSKKSKQLIGFKSSM 185
           +E ELR + L+E+SELQKLHKQFV   VLTE EFW+ RKKLL +DS +KSKQ +G KS M
Sbjct: 121 AEFELRFKLLRENSELQKLHKQFVESKVLTEDEFWSTRKKLLGKDSIRKSKQQMGLKSMM 180

Query: 186 VLDTKPMSDGRTNKVTFNLTPEIKYQIFALKPAVHQAFLNHVPDKMSEKDFWTKYFRAEY 245
           V   KP +DGRTN+VTFNLT EI +QIFA KPAV QAF+N+VP KM+EKDFWTKYFRAEY
Sbjct: 181 VSGIKPSTDGRTNRVTFNLTSEIIFQIFAEKPAVRQAFINYVPKKMTEKDFWTKYFRAEY 240

Query: 246 LHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPD 305
           L+STKN+  AAAEAAEDEELA+FLK DEILA E R+K+R VDPTLD++AD GDDYTHL D
Sbjct: 241 LYSTKNTAVAAAEAAEDEELAVFLKPDEILAQEARQKMRRVDPTLDMDADEGDDYTHLMD 300

Query: 306 HGIFRDGGKEITESQNEHYKRTLSQDLNRQGAVVLEGRTIDIDLDDPRTVAEALVRSRHA 365
           HGI RDG  +I E QN+  KR+L QDLNR  AVVLEGR I++  +D R VAEAL R++  
Sbjct: 301 HGIQRDGTNDIIEPQNDQLKRSLLQDLNRHAAVVLEGRCINVQSEDTRIVAEALTRAKQV 360

Query: 366 ------VEGNEKQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLD 425
                 +  +  Q  L+R+SR T +EDLQAP + P APL IKDPRDYF++QQ N +    
Sbjct: 361 SKADGEITKDANQERLERMSRATEMEDLQAPQNFPLAPLSIKDPRDYFESQQGNILSEPR 420

Query: 426 DTRAGMQQTKCSLGTTEAYCSLRESISEIKSSGLKLPIIKPEVALMVYNGLTQNISSTKY 485
             +A  +         EAY  L+ESI  I+ +GL  P+IKPEV+  V++ LT+ IS+ K 
Sbjct: 421 GAKASKRNVH------EAYGLLKESILVIRMTGLSDPLIKPEVSFEVFSSLTRTISTAKN 480

Query: 486 QLGKNPQESILESLPNPTKEELLHHWISIQELLRHFWSSYPITTSYLYTK 530
            LGKNPQES L+ LP  TK+E++HHW SIQEL+RHFWSSYPITT+YL TK
Sbjct: 481 ILGKNPQESFLDRLPKSTKDEVIHHWTSIQELVRHFWSSYPITTTYLSTK 519

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008457278.12.2e-29496.98PREDICTED: probable RNA polymerase II transcription factor B subunit 1-1 isoform... [more]
XP_011653743.19.6e-29095.65PREDICTED: probable RNA polymerase II transcription factor B subunit 1-1 isoform... [more]
XP_022932684.11.3e-28695.09probable RNA polymerase II transcription factor B subunit 1-1 [Cucurbita moschat... [more]
XP_023539702.11.9e-28594.52probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 [Cucurb... [more]
XP_022972269.12.5e-28594.71probable RNA polymerase II transcription factor B subunit 1-1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A1S3C6E8|A0A1S3C6E8_CUCME1.5e-29496.98probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 OS=Cucu... [more]
tr|A0A1S4E1L0|A0A1S4E1L0_CUCME1.1e-23697.00probable RNA polymerase II transcription factor B subunit 1-1 isoform X3 OS=Cucu... [more]
tr|A0A1S4E1K9|A0A1S4E1K9_CUCME1.1e-23697.00probable RNA polymerase II transcription factor B subunit 1-1 isoform X2 OS=Cucu... [more]
tr|A0A1S4E1L2|A0A1S4E1L2_CUCME4.1e-23696.55probable RNA polymerase II transcription factor B subunit 1-1 isoform X4 OS=Cucu... [more]
tr|A0A2P6PCY5|A0A2P6PCY5_ROSCH2.8e-21370.94Putative transcription factor BSD family OS=Rosa chinensis OX=74649 GN=RchiOBHm_... [more]
Match NameE-valueIdentityDescription
sp|Q3ECP0|TFB1A_ARATH3.4e-16959.06General transcription and DNA repair factor IIH subunit TFB1-1 OS=Arabidopsis th... [more]
sp|Q9M322|TFB1C_ARATH5.4e-16758.87General transcription and DNA repair factor IIH subunit TFB1-3 OS=Arabidopsis th... [more]
sp|P32780|TF2H1_HUMAN3.2e-2627.05General transcription factor IIH subunit 1 OS=Homo sapiens OX=9606 GN=GTF2H1 PE=... [more]
sp|Q9DBA9|TF2H1_MOUSE1.0e-2426.44General transcription factor IIH subunit 1 OS=Mus musculus OX=10090 GN=Gtf2h1 PE... [more]
sp|Q55FP1|TF2H1_DICDI4.8e-2229.46General transcription factor IIH subunit 1 OS=Dictyostelium discoideum OX=44689 ... [more]
Match NameE-valueIdentityDescription
AT1G55750.11.9e-17059.06BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS... [more]
AT3G61420.13.0e-16858.87BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS... [more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006289nucleotide-excision repair
GO:0006351transcription, DNA-templated
Vocabulary: Cellular Component
TermDefinition
GO:0000439core TFIIH complex
Vocabulary: INTERPRO
TermDefinition
IPR027079Tfb1/GTF2H1
IPR005607BSD_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006289 nucleotide-excision repair
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0000439 core TFIIH complex
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C04G070320.1Cla97C04G070320.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005607BSD domainSMARTSM00751wurzfinal6coord: 117..171
e-value: 0.045
score: 18.7
coord: 196..248
e-value: 5.2E-8
score: 42.6
IPR005607BSD domainPFAMPF03909BSDcoord: 199..251
e-value: 1.5E-11
score: 44.0
IPR005607BSD domainPROSITEPS50858BSDcoord: 196..248
score: 12.277
IPR027079TFIIH subunit Tfb1/GTF2H1PANTHERPTHR12856TRANSCRIPTION INITIATION FACTOR IIH-RELATEDcoord: 5..529
NoneNo IPR availableSUPERFAMILYSSF140383BSD domain-likecoord: 121..167
NoneNo IPR availableSUPERFAMILYSSF140383BSD domain-likecoord: 204..249
NoneNo IPR availableSUPERFAMILYSSF50729PH domain-likecoord: 21..101

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C04G070320Cla006088Watermelon (97103) v1wmwmbB293
Cla97C04G070320ClCG04G003010Watermelon (Charleston Gray)wcgwmbB200
Cla97C04G070320Lsi01G020130Bottle gourd (USVL1VR-Ls)lsiwmbB136
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C04G070320Silver-seed gourdcarwmbB0500
Cla97C04G070320Cucumber (Gy14) v2cgybwmbB283
Cla97C04G070320Cucumber (Gy14) v2cgybwmbB288
Cla97C04G070320Cucumber (Gy14) v1cgywmbB308
Cla97C04G070320Cucurbita maxima (Rimu)cmawmbB893
Cla97C04G070320Cucurbita moschata (Rifu)cmowmbB863
Cla97C04G070320Wild cucumber (PI 183967)cpiwmbB312
Cla97C04G070320Cucumber (Chinese Long) v3cucwmbB308
Cla97C04G070320Cucumber (Chinese Long) v2cuwmbB303
Cla97C04G070320Melon (DHL92) v3.6.1medwmbB537
Cla97C04G070320Melon (DHL92) v3.5.1mewmbB543
Cla97C04G070320Watermelon (97103) v1wmwmbB296
Cla97C04G070320Wax gourdwgowmbB434