Sed0003361 (gene) Chayote v1

Overview
NameSed0003361
Typegene
OrganismSechium edule (Chayote v1)
DescriptionRNA polymerase II transcription factor B subunit 2
LocationLG14: 22990603 .. 22999243 (-)
RNA-Seq ExpressionSed0003361
SyntenySed0003361
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTCATTTAGGTTTGTGGATAGTTGTTTTGGTCGAAGACCGGTAATCCAATTGAACCCTTAGAAGTGATATTAAGGAAAAAATGATGGAAAATAAATGCATAGGTATCGTCAATTGATCTCTCCAGCTCCGGTACGAGTGTTTACAAATTTCATCTCTGAATTTCTTCCCGATTTGAGTTCCACCAAAGAATTCCTCAAAAGAAGAAAAGATCGATTGATTGAGAAAATGCCTCAAGTAAAGATCATAGCGAAGAATTTCATGGACATGGTGGCCTCCTTGCCCGCCATGAAGCTCGACAAGCTCTATCACAACGCGTTCATCTGCGAAGCCATTCTCAGGTTCAATTCGTCTTTTCTTTCATCTCAAATGTTCTACATTTCATATCGATTCTCAATTCCTTCTTGCTCATTTTTTACTTACTCGGGGTTTCGTTGCCCGGTCAGCGAGCGGGGATCTATGGTAGTGCTCGATAAACTTTCTTAAAACCCTAGCTACACTTAGTAGTCATTATGTATTTCGTATTCTCCTTAATAAAACCATTATCTGCGGTCCCGTCGGATAATTATTTAATTTTTGAAATTTAAGTTTATTTTTACTCACATTTATAATATTTCTGTCTTTCCTACCATGTATCCATCTTTTCTTAGGAAGAATGTGAATATTATCCATTTCAAAAATAAAAAACAAGTTTTTGGAAACTATTTTGATTTGGTTTATAAAATTACAGGTAAGACATAGATATCCAAGAAATAACCTGAGGTAAATTAGGTGTTGTAGGCTTAAATTTCAAAAATAAAAAACTAAAAGCAAAATGGTTATGAAACGGGATGTACATAACTCGCCAGTGGATGCAGCCAACCCATCATTAGTGAACCAAGTAAATCTCTATCCGTTTTCTATTTTTCTGTTTGATTGTTTTTCCGTTTTCCTAACAGTTTATAACTTATAAATGGATTGTTTTTTTCTAGTTCCGGAGGTTTCAGTTATAAAGAAGAGTACTGATCATTGTTTTTTGTAATAGGTCACTTCCACCACTGGCCAAGAAGTTTGTTATACAAATGTTGTACATTGATGCTCCAGTAACAGCCAAGTCCATGGAGGAGTGGGTGCTTCCAGATGGAGTCTCGAAGTTTAAGGTTGCTGTCGATCGATTGATTCAATTGCGAGTATTTATTGAGACTGTGGATAGGTGTTTGTTCTCTTCTCTCGTTTTCTTAGAATTCGACATTCTTTCACAATGCTTTTAGATGCATTCTAGTTAAATAGGAAAGCTGAAATTTAAGTGTTCAAGAGTAAGGCCAAGTTTGTTGGGTTTGTTTTATCTGAGGTTTCGTTTCCATCAGTAGATAGATTACGTACATGTTCATATCACGATATCATCGGCATTGGAAGGTTTATGCAAGAGAAATTTAATTAAGAACTTGTGGTGCAATATGTAAATTCTTTTGCTTTGTTTATGCTTTGATCTCATTGATATCGTTTATAATTCTATCAGTAGTTCTGCTTGGCTTGGTTTATAGTTCTCAATCTAATCCTATATTGTAATATTTTCTATGTATCCCTGTTTTATTGATAGGAAAAGAGAAACAACCTACAGGCTAAATCCAACATTCCAAGCCAACCTCCAAAAGCTTTTAATACATGGGTAGGCTTTCTCTCACTTGGCTCATTAAATGCATTTTTAAACCAGATATGCAAGGAGGGATGTATGATTTCTTTAATAATTAATACTCAGCACAAACATTCTTGTACATGGATCTTTCTTAATATATGCACATTTTCTTTCGACTCGCTACCTTATCTGTGTTCATCAAGTTCTACTAACCTTCTATAAAAAGAAGCTGATCCTGGAGTTAGTCTCATTGATTTTAACATTTTAAACTGTTTTACTATTGGTTGTTTAGAGAATGACCTTGAATTTGGGATCTTGCAGTGAAGTTCTAGCCAGAGAACCAATGCCTTCTAATATAACCGTGAGGCTTCCAAGTCTGGAAGAGCTTGAGGCTTATGCTCTCGGTCAATGGGAGGTTGAACATTATCAAACTTTTCAGTTGTTTATTATATGATTTTTGATGGTGAATATTCTTCTGACTTGCTGTTATGTTAATTCAGTGCTTCTTGCTACAATTGATAAACTCAGGCCAAGCTGAGAAGCCATCAAATATTAGTTCTTCTATGATGAAAGTTTTCCAGAAAGGCCTTTTAAGTCAAAGGTTAAAAATTTTGTTCAGTGTATACTAGCTTATAGACAGCAATCACTTCTTATATCTGATAAAATTTCAGTATGGTATCCAGGGATAAAGAAGCTCCACGATTAACTGCGAGTGGTTTTCAGTTCTTGGTATGAATCACATGAACCATACACCTTTTCAAATACTCACTTATCTGTCTCACCAGGGGTTCTTGTGATATATGTAAACTTTAGTTGATGGAAACAAATGCACAACTTTGGTATATTATCAGAGAATATATATCTAACTCCGAGGTTTGTGTTTTTATTTTTTATTGTTATATTCTATTGCTTTTCTCCGAGGAAAACTGTATATTTTCTGTCATCAATTTGAGAACAATTTCTATTTGAATCCAAATTATGAACTATTCTTATCAAAAGAAAAGACAACTATTGTCAATTGTAAGGCTATTTTCTTTTAACACGTTTATGTTTATATTAGTTGGTCCGTGGAGATTTGTTATTAATAACTCATATTCCTCTTTTGTTTTTTGTTTTTTTGAAAAAGATATTTCTCCTTTAATAATGCTCTATCAATATGGTGCAGGAGCGAGGTGTGGATCCCGCAGATTTGATTTCTTTTATGCTAGAGCTTAGTTTTCACGTTACAGGAGAGGTATCAATTCTTGGATCATTTCAATTTCTTATTTTTCATCTTTTTCTTGTATTTTTTTATTAGTTATCTCACTACAGAACTATTTTTTAAGCCTTGTTGCTTACTACTTTGGGTTGGTTCTGAAGTATAAATTCTTGCACAAGATTGATTATATTTAGTTTGTTATTTGAAAACCCATATAAGGGGAAAAAATCGTAATGTTACATAGAAGCCTATCGAAACTACAATAAAGGTCAATAAATGAGCAAAATACAAAGATAAATCGGTAAATACAAACTCGTAAATATGAGATACGAAGAAACGTGTGGGTATTGGGTGAACTTGTAGGTCATTGTCAACGTAGTCATGAACTTGTAGGTCCTTTATATTTTGCTCGCAGATGAGAAAAATGGTACTTGTGTGTCAAGAACCGATTCCATCAGATATGCCATATGACTAGAGGAAGGAGAAACAACCAAAATTAAGGGGAATTCAAAGAGACACCAACTTAGGGGATGTTGAGGATCCCACATTGAAAAAGTGGGGATACATCACAACATTTTGGTGCCTTTTTGGGATCCTCAACATGCCCCCAAGTTGGTGCCTCTTTGGATTCATCATTTATGAATCGAATCTCAATTTATTTTTATTGGACGAAATACCTGTTTGGACTACATCGGCTCTGATACAATATTAGATAATACGAAGTTCCATCTCAAAACTAATTGCTAATTGAGTAGTTCATCAATCTTATAAATGTTGTAAAATAACCGATTTTCCTTCCACTCCTATAGCTTCAAGTGGCTGTTTACCTGTTCACAGATTCTTCCCTTTGAAACTCAAGTAAGAAAAAAAATATCGAACACACAAGCTAGGGATGCCACAATTAACTCTAAAACTCTTCTCATCCCTCTCGCTCGTCTCATAATCAATAACAATTGTACACTAATCGTCTGAAAACTATAACAAACTCTAATTATCCATATAACTGTTAGAAAACTTGTGCATATACTATTAACTGCACAATGCCAACATGGTTCTTCTTTGTTCTGTAGGCTTATGATATTGATACACTGACAGAAGAACAGAGATATGCTATCAAGGACCTTGCGGATCTAGGACTGGTTAAGCTTCAGCAGGTATTCGATTCGCCTCTGCTACTTTTTAAATTGAACATGACCTTTGCTTTAGTTATTTTATTTTAACTTCACTTTTTTTCCTTTTGTTGTGTGTGGGTGTGGGGTACAAGAAAGCTCTCCAATTAGAGATCAATGAAAAAACCTATAATCGATAAAAAAAAAAAAGGGTTTACATTTGCACAAGCATGGAAATAATCGAACCTATAAAGTTGTCTAAGCTTGAGAAAGCGTCTGTGAAAATTCTTGGTTTTCGCTGTTAATTGTTTCCAGGGTAGAAAAGACCGCTGGTTCATACCTACTAAATTAGCCACAAATCTTTCAATGAGTTTGGCAGATTCTTCTTCTAGGAAACAGGTGAGTGTTTCTGAATTATATTTAAATTTGATTAATTTCTGTTAGGTATCCAAGTATAGAATATATATATATATATATGAAAACTGCAAAGATAACTTTATTGCAATCTCAATGAGGAAATTACAATATCAATAGTCTAATTAATAAGGACGATTCACGATCTTCCACAATACAAATGCATAAACTCTTCTCAAGGCTAATAATTGTCTAACTAATCCGTTCCCAATACTAATAAATGCCTAATCATTTCGGACTCAATTTCTCTAGGTTCATTTTCTTTACGGCTACTAAGTTTTTGAGTGGTTTTTTATTTTTGCATAATAGCTTTTCCACAATTGTTTTATAGCTATTGATATACGCTATAGTATCCTCTGCTATATATATATATATATTATCCTCGTTACTTCCGTATTTTTTATTTGCAGATCCTTGTTTCTCGTATTATTAACTCAGTCCATTAATCCCTTTTTTTTAGTTTTCTATAGGCTAGTTGGAGGCTCTGATGAACGTTTTTGTTCTTCCTAATTTCTTCCTGTTGTATACCCAGGGTTTTGTCGTCGTGGAGACAAATTTCAGGCTGTATGCTTATTCTTCTTCCAAACTACATTGTGAAATATTACGTCTTTTTTCAAGGTACGTGTCATGATTTAATGTTTTATTACATTATATAACCAATGTTGAACTAGTTAGTTTTAATGACTCTGGATTGACCTATTGGATAAACCTGTGGCCCAAATTTCACACAACCAGGGCCAATGTAAACAGCAAAGGGTTAGAAGGAAAAATTGTGGTAACCACTTACACAAGATTAATATCCTACCTTGACAACTAAATTTAGTAGGGTCTAAGAGAATAGTTGAAATGTGTGCAAGATGGCCGAACACTGAAGAATATTAAAAAAAAATTACAAAGTCGATGGGTTCTGGATACATTTAATATTGACATAAAAAGTTTGCCTTTTCTTCTTCTTGAATGCTAAATATATTTGAGTTCCCCACCCCTGATAAACCCCTCTTGGAACTTCACATATGTGAGGAGGAAGATGTTAAGACACCTCTAATAATCAAACATACAATAGAACATAACAATATAATAGAGATTACAATGTCAACAGCCTTGCAAGAGGGCTATGTCTCTCTCAAAATCTCTACCAAAGACAATACCAAAACATTCTCTCCTTGTTTAGAACACAAACTTCCCTATATGACTAATACGCTAATAACTAATTACTACTATGCCATTCCTATGGGGTGACATAGTGGTTGGAGAGTTGGGCTTTGAGGGTATGCTCCCCTTAAGGTCCCGGATTCGAAACTCACCTGTGACATTACTCCTTCGATGGCTCCGGTGCCTGACCTAGGGATAGGTGTGGTTACCCTTGTTTAAAAAAAAAAGCTAATTACTACTATGCCCTTATTAATATAATACTAAAACTCCAAACCTACCCTTCCTTAGTACTCTTAACAGAAGATAGATTCTGTAAGGTCATTGGAAGGGCAAGCTGAGGGATAACTTGGCCATTCGTAGTTTCTTGAGAGTGCAAGCATTAAGCAGCTAGTGATAGTGTAAGGTCAGTAGGGCCAAGCATAGAATACTGTTACATAATTTGTTAATGATAATTCGATAATTATAGTTTTATACTTGTGCAAATTAGAAATGCCGTTACTTTTGTTACTATTGATAATTTTTATTTTACTTGTGTCAAGTTTTTACAACAAATTCGTTTTATTATAAAAGGTTTTTTTTTTAATTTTTTATAATTGTATTCTTTTCTTAAGCATTGCACATGTAATAGCCGTCGATTAAATAGGGTTGTGTTTGTTTTTGCGAAAGGAAAGATTCAAGTTCAACATTATACTATTTACCTACTTGCATAATGAATGCGGTCATTGCCTTAATAGATAATGCAGCAATCTGATCACACATCAATTTTCTATTTGGTTCAAGAATTATAAAATAAATTCTGTAACTGTCATGCAGGATTGAGTATCAACTTCCAAATCTTACCGTTGCAGCAATAACGAAAGAAAGCTTGTATAATGCTTTTAAGAATGGAATTACGGCAGATCAGGCATGTTTCTTGTCACCCTGGCATATGTTACTCTTAGTCTTATGAATTAAGGATACCATTATGTAATTTTCATATTGTAAATAGAATTTAGTAGATCAGATCTTCTTTGCGTTTTATTAGTTCATAATATCTTGCATTCTTAGTGTTCATTTTTGCTAATGGTGGAACTTTTTTTTTGAAACTGGGAAATCGGAGCTTCGCTCTACTATACCCGAGGTAACCGGGAGTTATTCCATTTTATTTTTTTGCACAAGAAGTGTTGGAGCATGGGGATTTGAGCCTCAAGATTATTTCTAAATCATCATAGGTTTTTGGCCTAGTGGTTAGGTCTTGAAAATCCAAGAAGTTATGAGTGCAATTCATGAGAATCCAAGAGGTTCAGAATCATAATTTTCTACGAATATATCAAATGTTGTAATGTTAGACAGTATGCCCTGCAATAATAGTCGAGGTACGTGGAATTTGGCTCCAACACTCAGATTAAAAAAAAAAGGATAATTTCTAGTAAGCAGATATCAATTGTCAATCCCATTATCAGTCAGTCACCGAGGGACTTCAACTAGCTTTGGTTCCTAATTCACAAAAAAAAAAAACCTATTTTTCTTGCGTGCATCAATCCCAATTCTCTCATAAACTTAATGTTAAACTCCGAATAATTCTTTATGTTATGAAAACGGTAACAACAATCAAACAGAAAACTATAAAGACAGAATACATAGAATTTACGTGGTTCACTAATGGTGTGTTAGCTCATCCACGGGCAGAGAAGAGAACTTTTTATTAAGGGGAGAACAAATAAAGAATGTCAATCGGAGAGGGCGAAGGATTTAAAAGAGTTTATATAGTGCACTCTCATAAAACTCTATTATAGAATAGACCCAAGTACAAAAACCTAATTACATATTTCATCTAGGTTTCGGGGCATTGCCCCCGAGCCCCCACCATGGGCCTCGCCCTTGGGACCTCGACTCGCTGACCGAGTAGGGAGACCCCGACTATGTAATCATGGGGCATCAAAAGGCATATCTCAACACTTTCAGAGTTGAAAACAATTAAAACGAGCAACCCATAGGGTTTTTTTGGAGTATGACAACAATAGGGGGATGGGAATTCAAACCCCTAACCTCTTGGTTTATGGGCGAACATTCATGTTGTTGAGCTATTGTTATGATTGGCAATCCATAGGTGAATATTACATTAAATATTAATTGATTTTCCATGCGGGTTGTCTCATTTCAAATTGCTTAAGATCTGACTGCTTTTTTATTTTGCAGATGGTTACTTTTCTACAGCAGAATGCACATCCTCGTGTTGCAGAGAGAATACCATCAGTCCCTGAAAATGTCACGGATCAGGTTAGGATTGCTTTATTAAGTTCTTTGATGTTTGGTAAGGCATTAGCTTCTATCTGTAAATCACCTGAAATTTTTTTACATTTGGTTCTCATGAGGCAATTTGATCAAATCTTTGTCCTGTTCCTGATTTTCCCCTTGCTTTCTATTCCATGATAGATTAGGTTATGGGAATCAGATCTTAATCGGGTCGAATTTACTTCCGCACATTTTTACGATGAGTTTCCTTCCAAGGTGCGAATTTCAAATGTTGCTTCTTTTCACTTTGAGGATTATTTGTACATTCATTTGAAAAATTGCCTGAGAATGAACTTTGGCATGTCAGGATGTTTTCGAGGCTGCTTGCGACTATGCACGAGAATGGAATGGGCTATTATGGGAGGACTCGAAAAGCATGCGATTGGTAGTGAAGGCAGACGTTCACACGCATATGCGCGAACATCTTCGCCGCCAAAAGTAGATCGTTAAGAGATGATACAATGCTGCTGCCAAGATCTGTTTATAAACCATGGCCTCTGCTACTTCAGTTTTTGTTGCTTCAGGGGGCATGGCTTAAAGTATGAAGAACAGTGAGAATTTTGTGGGGAAAGAATAGTGTCAGTGACTGTTCATTATGGTGTAGGCTTATTTTGCTAAGTCAAATTCCCCTTTTTTTTAGTACAACAACAATCTTTGGGGGCTAGGGGATTTGACGATGCGGCCTCTAGTCGAGAATACATGCCAATTACCATTGAGTTTAGTCCTCTTTGGCATCAATTTCCCTTAAAGATACAGTAAGCTCTGCTAGGTTGATTTCCCTTCTGGGCATAGATTTCATTCTAACACTTGTTTATAACTGGGACTAGACATTCTAGAGGAAATATAAGAATAATGACATGTATTTTTTTGAACGTGGAAA

mRNA sequence

ATTCATTTAGGTTTGTGGATAGTTGTTTTGGTCGAAGACCGGTAATCCAATTGAACCCTTAGAAGTGATATTAAGGAAAAAATGATGGAAAATAAATGCATAGGTATCGTCAATTGATCTCTCCAGCTCCGGTACGAGTGTTTACAAATTTCATCTCTGAATTTCTTCCCGATTTGAGTTCCACCAAAGAATTCCTCAAAAGAAGAAAAGATCGATTGATTGAGAAAATGCCTCAAGTAAAGATCATAGCGAAGAATTTCATGGACATGGTGGCCTCCTTGCCCGCCATGAAGCTCGACAAGCTCTATCACAACGCGTTCATCTGCGAAGCCATTCTCAGGTCACTTCCACCACTGGCCAAGAAGTTTGTTATACAAATGTTGTACATTGATGCTCCAGTAACAGCCAAGTCCATGGAGGAGTGGGTGCTTCCAGATGGAGTCTCGAAGTTTAAGGTTGCTGTCGATCGATTGATTCAATTGCGAGTATTTATTGAGACTGTGGATAGGAAAAGAGAAACAACCTACAGGCTAAATCCAACATTCCAAGCCAACCTCCAAAAGCTTTTAATACATGGTGAAGTTCTAGCCAGAGAACCAATGCCTTCTAATATAACCGTGAGGCTTCCAAGTCTGGAAGAGCTTGAGGCTTATGCTCTCGGTCAATGGGAGTGCTTCTTGCTACAATTGATAAACTCAGGCCAAGCTGAGAAGCCATCAAATATTAGTTCTTCTATGATGAAAGTTTTCCAGAAAGGCCTTTTAAGTCAAAGGGATAAAGAAGCTCCACGATTAACTGCGAGTGGTTTTCAGTTCTTGTTGATGGAAACAAATGCACAACTTTGGTATATTATCAGAGAATATATATCTAACTCCGAGGAGCGAGGTGTGGATCCCGCAGATTTGATTTCTTTTATGCTAGAGCTTAGTTTTCACGTTACAGGAGAGGCTTATGATATTGATACACTGACAGAAGAACAGAGATATGCTATCAAGGACCTTGCGGATCTAGGACTGGTTAAGCTTCAGCAGGGTAGAAAAGACCGCTGGTTCATACCTACTAAATTAGCCACAAATCTTTCAATGAGTTTGGCAGATTCTTCTTCTAGGAAACAGGGTTTTGTCGTCGTGGAGACAAATTTCAGGCTGTATGCTTATTCTTCTTCCAAACTACATTGTGAAATATTACGTCTTTTTTCAAGGATTGAGTATCAACTTCCAAATCTTACCGTTGCAGCAATAACGAAAGAAAGCTTGTATAATGCTTTTAAGAATGGAATTACGGCAGATCAGATGGTTACTTTTCTACAGCAGAATGCACATCCTCGTGTTGCAGAGAGAATACCATCAGTCCCTGAAAATGTCACGGATCAGGTTATGGGAATCAGATCTTAATCGGGTCGAATTTACTTCCGCACATTTTTACGATGAGTTTCCTTCCAAGGATGTTTTCGAGGCTGCTTGCGACTATGCACGAGAATGGAATGGGCTATTATGGGAGGACTCGAAAAGCATGCGATTGGTAGTGAAGGCAGACGTTCACACGCATATGCGCGAACATCTTCGCCGCCAAAAGTAGATCGTTAAGAGATGATACAATGCTGCTGCCAAGATCTGTTTATAAACCATGGCCTCTGCTACTTCAGTTTTTGTTGCTTCAGGGGGCATGGCTTAAAGTATGAAGAACAGTGAGAATTTTGTGGGGAAAGAATAGTGTCAGTGACTGTTCATTATGGTGTAGGCTTATTTTGCTAAGTCAAATTCCCCTTTTTTTTAGTACAACAACAATCTTTGGGGGCTAGGGGATTTGACGATGCGGCCTCTAGTCGAGAATACATGCCAATTACCATTGAGTTTAGTCCTCTTTGGCATCAATTTCCCTTAAAGATACAGTAAGCTCTGCTAGGTTGATTTCCCTTCTGGGCATAGATTTCATTCTAACACTTGTTTATAACTGGGACTAGACATTCTAGAGGAAATATAAGAATAATGACATGTATTTTTTTGAACGTGGAAA

Coding sequence (CDS)

ATGCATAGGTATCGTCAATTGATCTCTCCAGCTCCGGTACGAGTGTTTACAAATTTCATCTCTGAATTTCTTCCCGATTTGAGTTCCACCAAAGAATTCCTCAAAAGAAGAAAAGATCGATTGATTGAGAAAATGCCTCAAGTAAAGATCATAGCGAAGAATTTCATGGACATGGTGGCCTCCTTGCCCGCCATGAAGCTCGACAAGCTCTATCACAACGCGTTCATCTGCGAAGCCATTCTCAGGTCACTTCCACCACTGGCCAAGAAGTTTGTTATACAAATGTTGTACATTGATGCTCCAGTAACAGCCAAGTCCATGGAGGAGTGGGTGCTTCCAGATGGAGTCTCGAAGTTTAAGGTTGCTGTCGATCGATTGATTCAATTGCGAGTATTTATTGAGACTGTGGATAGGAAAAGAGAAACAACCTACAGGCTAAATCCAACATTCCAAGCCAACCTCCAAAAGCTTTTAATACATGGTGAAGTTCTAGCCAGAGAACCAATGCCTTCTAATATAACCGTGAGGCTTCCAAGTCTGGAAGAGCTTGAGGCTTATGCTCTCGGTCAATGGGAGTGCTTCTTGCTACAATTGATAAACTCAGGCCAAGCTGAGAAGCCATCAAATATTAGTTCTTCTATGATGAAAGTTTTCCAGAAAGGCCTTTTAAGTCAAAGGGATAAAGAAGCTCCACGATTAACTGCGAGTGGTTTTCAGTTCTTGTTGATGGAAACAAATGCACAACTTTGGTATATTATCAGAGAATATATATCTAACTCCGAGGAGCGAGGTGTGGATCCCGCAGATTTGATTTCTTTTATGCTAGAGCTTAGTTTTCACGTTACAGGAGAGGCTTATGATATTGATACACTGACAGAAGAACAGAGATATGCTATCAAGGACCTTGCGGATCTAGGACTGGTTAAGCTTCAGCAGGGTAGAAAAGACCGCTGGTTCATACCTACTAAATTAGCCACAAATCTTTCAATGAGTTTGGCAGATTCTTCTTCTAGGAAACAGGGTTTTGTCGTCGTGGAGACAAATTTCAGGCTGTATGCTTATTCTTCTTCCAAACTACATTGTGAAATATTACGTCTTTTTTCAAGGATTGAGTATCAACTTCCAAATCTTACCGTTGCAGCAATAACGAAAGAAAGCTTGTATAATGCTTTTAAGAATGGAATTACGGCAGATCAGATGGTTACTTTTCTACAGCAGAATGCACATCCTCGTGTTGCAGAGAGAATACCATCAGTCCCTGAAAATGTCACGGATCAGGTTATGGGAATCAGATCTTAA

Protein sequence

MHRYRQLISPAPVRVFTNFISEFLPDLSSTKEFLKRRKDRLIEKMPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTAKSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVLAREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLSQRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFLQQNAHPRVAERIPSVPENVTDQVMGIRS
Homology
BLAST of Sed0003361 vs. NCBI nr
Match: XP_022977542.1 (RNA polymerase II transcription factor B subunit 2 [Cucurbita maxima])

HSP 1 Score: 717.6 bits (1851), Expect = 6.4e-203
Identity = 362/383 (94.52%), Postives = 374/383 (97.65%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPAMKLD+LY NAFICEAILRSLPPLAKKFV+QMLYIDAPVTA
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
           KSMEEWVLPDGVSK+KVAVDRLIQLRVFIET DRKRETTY+LNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIHGEVL 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
           AREPMPSNITVRLP+LEELEAYAL QWECFLLQLINSGQA+KPSNISSS+MKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDKE PRLT SGFQFLLMETNAQLWYIIREYISN+EERGVDPADLISF+LELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKD WFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           VETNFR+YAYSSSKLHCEILRLFSRIEYQLPNL V AITKESLYNAFKNGITA Q+VTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQNAHPRVAERIPSVPENVTDQ+
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQI 383

BLAST of Sed0003361 vs. NCBI nr
Match: KAG6604444.1 (General transcription and DNA repair factor IIH subunit TFB2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 716.1 bits (1847), Expect = 1.9e-202
Identity = 364/393 (92.62%), Postives = 379/393 (96.44%), Query Frame = 0

Query: 35  KRRKDRLIEKMPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQ 94
           +RRK +   +MPQVKIIAKNFMDMVASLPAMKLD+LY NAFICEAILRSLPPLAKKFV+Q
Sbjct: 24  ERRKAQ--SRMPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQ 83

Query: 95  MLYIDAPVTAKSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANL 154
           MLYIDAPVTAKSMEEWVLPDGVSK+KVAVDRLIQLRVFIET DRKRETTY+LNPTFQANL
Sbjct: 84  MLYIDAPVTAKSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANL 143

Query: 155 QKLLIHGEVLAREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSM 214
           QKLLIHGEVLAREPMPSNITVRLP+LEELEAYAL QWECFLLQLINSGQA+KPSNISSS+
Sbjct: 144 QKLLIHGEVLAREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSV 203

Query: 215 MKVFQKGLLSQRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFM 274
           MKVFQKGLLSQRDKE PRLT SGFQFLLMETNAQLWYIIREYISN+EER VDPADLISF+
Sbjct: 204 MKVFQKGLLSQRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERDVDPADLISFL 263

Query: 275 LELSFHVTGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLAD 334
           LELSFHVTGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKD WFIPTKLATNLSMSLAD
Sbjct: 264 LELSFHVTGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLAD 323

Query: 335 SSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNG 394
           SSSRKQGFVVVETNFR+YAYSSSKLHCEILRLFSRIEYQLPNL V AITKESLYNAFKNG
Sbjct: 324 SSSRKQGFVVVETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNG 383

Query: 395 ITADQMVTFLQQNAHPRVAERIPSVPENVTDQV 428
           ITA Q+VTFLQQNAHPRVAERIPSVPENVTDQ+
Sbjct: 384 ITAQQIVTFLQQNAHPRVAERIPSVPENVTDQI 414

BLAST of Sed0003361 vs. NCBI nr
Match: TYK13130.1 (RNA polymerase II transcription factor B subunit 2 [Cucumis melo var. makuwa])

HSP 1 Score: 715.3 bits (1845), Expect = 3.2e-202
Identity = 360/383 (93.99%), Postives = 376/383 (98.17%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPAMKLD+LY NAFICEAILRSLPPLAKKFV+QMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
           KSMEEWVLPDGVSK+KVAVDRLIQLRVFIET DRKRETTYRLNPTFQANLQKLLIHGEV+
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVV 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
           AREPMPSNITVRLPSLE+LEAYAL QWECFLLQLINSGQAEKPSNISSS+MKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDKEAPRLT SGFQFLLMETNAQLWYIIREYISN+EERGVDPADLISF+LELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AYDIDTL++EQRYAIKDLADLGLVKLQQGRK+ WFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           VETNFR+YAYSSSKLHCEILRLFSRIEYQLPNL V AITKESLYNAFKNGITA+Q+VTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQNAHPRVAERIPSVPENVTDQ+
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQI 383

BLAST of Sed0003361 vs. NCBI nr
Match: XP_023543805.1 (RNA polymerase II transcription factor B subunit 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 714.9 bits (1844), Expect = 4.1e-202
Identity = 361/383 (94.26%), Postives = 373/383 (97.39%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPAMKLD+LY NAFICEAILRSLPPLAKKFV+QMLYIDAPVTA
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
           KSMEEWVLPDGVSK+KVAVDRLIQLRVFIET DRKRETTY+LNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIHGEVL 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
           AREPMPSNITVRLP+LEELEAYAL QWECFLLQLINSGQA+KPSNISSS+MKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDKE PRLT SGFQFLLMETNAQLWYIIREYISN+EER VDPADLISF+LELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERDVDPADLISFLLELSFHVTGE 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKD WFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           VETNFR+YAYSSSKLHCEILRLFSRIEYQLPNL V AITKESLYNAFKNGITA Q+VTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQNAHPRVAERIPSVPENVTDQ+
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQI 383

BLAST of Sed0003361 vs. NCBI nr
Match: XP_038882460.1 (general transcription and DNA repair factor IIH subunit TFB2 isoform X1 [Benincasa hispida])

HSP 1 Score: 713.8 bits (1841), Expect = 9.2e-202
Identity = 360/383 (93.99%), Postives = 374/383 (97.65%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLP MKLD+LY NAFICEAILRSLPPLAKKFV+QMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPPMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
           KSMEEWVLPDGVSK+KVAVDRLIQLRVFIET DRKRETTYRLNP FQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPMFQANLQKLLIHGEVL 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
           AREPMP+NITVRLPSLEEL+AYAL QWECFLLQLINSGQAEKPSNISSS+MKVFQKGLLS
Sbjct: 121 AREPMPANITVRLPSLEELKAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDKEAPRLT SGFQFLLMETNAQLWYIIREYISN+EERGVDPADLISF+LELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AYDIDTLT+EQRYAIKDLADLGLVKLQQGRKD WFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLTDEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           VETNFR+YAYSSSKLHCEILRLFSRIEYQLPNL V AITKESLYNAFKNGITA+Q+VTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQNAHPRVAERIPSVPENVTDQ+
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQI 383

BLAST of Sed0003361 vs. ExPASy Swiss-Prot
Match: Q680U9 (General transcription and DNA repair factor IIH subunit TFB2 OS=Arabidopsis thaliana OX=3702 GN=TFB2 PE=2 SV=1)

HSP 1 Score: 611.3 bits (1575), Expect = 8.4e-174
Identity = 302/383 (78.85%), Postives = 342/383 (89.30%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPA+KLDKLY+N FICEAILRSLPPLAKK+V+QMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
             MEEWVL DG SK +VA+DRLIQLR+F E  DRKR T+Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
            REPM S+  ++LPSL+ELE YAL QWECFLLQLINSGQ EK + ISSSMMK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDK+ PRLT SGFQFLLM+TNAQLWYIIREYI N+EER VDPADLISF+LELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AY+++TLTE Q   +KDLADLGLVKLQQGRKD WFIPTKLATNLS+SLADSS+RK+GFVV
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           +ETNFR+YAYS+SKL CEILRLF+RIEYQLPNL   AITKESLYNAF NGIT+DQ++TFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQN+HPR A+R+PS+PENVTDQ+
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQI 383

BLAST of Sed0003361 vs. ExPASy Swiss-Prot
Match: Q92759 (General transcription factor IIH subunit 4 OS=Homo sapiens OX=9606 GN=GTF2H4 PE=1 SV=1)

HSP 1 Score: 223.0 bits (567), Expect = 6.5e-57
Identity = 129/385 (33.51%), Postives = 220/385 (57.14%), Query Frame = 0

Query: 47  QVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTAKS 106
           +V +  +N  + +  L    LD+LY +   C A+ R LP LAK +V++ML+++ P+   +
Sbjct: 11  RVHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAA 70

Query: 107 MEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQ-KLLIHGEVLA 166
           +  WV  +     + +   L  LR++   +         LNP F+ NL+  LL  G+  +
Sbjct: 71  VALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWS 130

Query: 167 REPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLL-S 226
            +            +  L+ YA  +WE  L  ++ S  A    +++  +    Q GL+ S
Sbjct: 131 DDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKS 190

Query: 227 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 286
               E P +T++GFQFLL++T AQLWY + +Y+  ++ RG+D  +++SF+ +LSF   G+
Sbjct: 191 TEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGK 250

Query: 287 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLS--MSLADSSSRKQGF 346
            Y ++ +++     ++ L + GLV  Q+ RK R + PT+LA NLS  +S A  +  + GF
Sbjct: 251 DYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGF 310

Query: 347 VVVETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVT 406
           +VVETN+RLYAY+ S+L   ++ LFS + Y+ PN+ VA +T+ES+  A  +GITA Q++ 
Sbjct: 311 IVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIH 370

Query: 407 FLQQNAHPRVAERIPSVPENVTDQV 428
           FL+  AHP + ++ P +P  +TDQ+
Sbjct: 371 FLRTRAHPVMLKQTPVLPPTITDQI 391

BLAST of Sed0003361 vs. ExPASy Swiss-Prot
Match: P60027 (General transcription factor IIH subunit 4 OS=Pan troglodytes OX=9598 GN=GTF2H4 PE=3 SV=1)

HSP 1 Score: 223.0 bits (567), Expect = 6.5e-57
Identity = 129/385 (33.51%), Postives = 220/385 (57.14%), Query Frame = 0

Query: 47  QVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTAKS 106
           +V +  +N  + +  L    LD+LY +   C A+ R LP LAK +V++ML+++ P+   +
Sbjct: 11  RVHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAA 70

Query: 107 MEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQ-KLLIHGEVLA 166
           +  WV  +     + +   L  LR++   +         LNP F+ NL+  LL  G+  +
Sbjct: 71  VALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWS 130

Query: 167 REPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLL-S 226
            +            +  L+ YA  +WE  L  ++ S  A    +++  +    Q GL+ S
Sbjct: 131 DDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKS 190

Query: 227 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 286
               E P +T++GFQFLL++T AQLWY + +Y+  ++ RG+D  +++SF+ +LSF   G+
Sbjct: 191 TEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGK 250

Query: 287 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLS--MSLADSSSRKQGF 346
            Y ++ +++     ++ L + GLV  Q+ RK R + PT+LA NLS  +S A  +  + GF
Sbjct: 251 DYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGF 310

Query: 347 VVVETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVT 406
           +VVETN+RLYAY+ S+L   ++ LFS + Y+ PN+ VA +T+ES+  A  +GITA Q++ 
Sbjct: 311 IVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIH 370

Query: 407 FLQQNAHPRVAERIPSVPENVTDQV 428
           FL+  AHP + ++ P +P  +TDQ+
Sbjct: 371 FLRTRAHPVMLKQTPVLPPTITDQI 391

BLAST of Sed0003361 vs. ExPASy Swiss-Prot
Match: O70422 (General transcription factor IIH subunit 4 OS=Mus musculus OX=10090 GN=Gtf2h4 PE=1 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 1.4e-56
Identity = 128/379 (33.77%), Postives = 217/379 (57.26%), Query Frame = 0

Query: 53  KNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTAKSMEEWVL 112
           +N  + +  L    LD+LY +   C A+ R LP LAK +V++ML+++ P+   ++  WV 
Sbjct: 18  RNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAVALWVK 77

Query: 113 PDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQ-KLLIHGEVLAREPMPS 172
            +     + +   L  LR++   +         LNP F+ NL+  LL  G+  + +    
Sbjct: 78  KEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPVFRQNLRIALLGGGKAWSDDTSQL 137

Query: 173 NITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLL-SQRDKEA 232
                   +  L+ YA  +WE  L  ++ S  A    +++  +    Q GL+ S    E 
Sbjct: 138 GPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSTEPGEP 197

Query: 233 PRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGEAYDIDT 292
           P +T++GFQFLL++T AQLWY + +Y+  ++ RG+D  +++SF+ +LSF   G+ Y ++ 
Sbjct: 198 PCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVEG 257

Query: 293 LTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLS--MSLADSSSRKQGFVVVETN 352
           +++     ++ L + GLV  Q+ RK R + PT+LA NLS  +S A  +  + GF+VVETN
Sbjct: 258 MSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETN 317

Query: 353 FRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFLQQNA 412
           +RLYAY+ S+L   ++ LFS + Y+ PN+ VA +T+ES+  A  +GITA Q++ FL+  A
Sbjct: 318 YRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRA 377

Query: 413 HPRVAERIPSVPENVTDQV 428
           HP + ++ P +P  +TDQ+
Sbjct: 378 HPVMLKQNPVLPPTITDQI 392

BLAST of Sed0003361 vs. ExPASy Swiss-Prot
Match: Q54C29 (General transcription factor IIH subunit 4 OS=Dictyostelium discoideum OX=44689 GN=gtf2h4 PE=3 SV=1)

HSP 1 Score: 200.3 bits (508), Expect = 4.5e-50
Identity = 125/413 (30.27%), Postives = 224/413 (54.24%), Query Frame = 0

Query: 59  VASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTAKSMEEWVLPDGVSK 118
           +ASL +  L++LY + + C+AILRSLPP +K+++++ML +D        ++W     + +
Sbjct: 10  LASLDSKDLEELYKDPWTCQAILRSLPPRSKQYILKMLLVDT-YPLSLAKDWSTQASIQQ 69

Query: 119 FKVAVDRLIQLR-VFIETVDR--------------------------KRETTYRLNPTFQ 178
            K ++ +L  L+ +F++ +++                          + E T RLNP FQ
Sbjct: 70  HKESLKKLFDLKIIFLDKINKPIQPQQQQSSQQSSSQQQQQQQQQQQQTEQTIRLNPLFQ 129

Query: 179 ANLQKLLIH-GEVLAREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNI 238
            N+++ L+   +V+           + PS+++L++Y+  QWE  L  L  S    +PS +
Sbjct: 130 DNIKRSLVQVNQVIFSNNSSIKDNHKPPSIDDLDSYSKSQWEKVLYFL--SDDTVQPSKL 189

Query: 239 SSSMMKVFQKGLLSQRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEER----GVD 298
            S ++       L++++ +   +T+ GF+FLL +   Q+W ++  Y+ + E++       
Sbjct: 190 ISELL---LSSNLTKQEGDGLSITSEGFKFLLKDVYTQIWTLLIVYLDDLEKKKGKGSGS 249

Query: 299 PADLISFMLELSFHVTGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTK--- 358
             DL+SF+  LSF   G  Y +  L+E+Q+  +  L   GL+ ++       F PT+   
Sbjct: 250 RNDLLSFLFRLSFLNLGRGYLVSELSEQQKEYLFALKQFGLIYMRTD-SSILFYPTRLII 309

Query: 359 -LATNLSMSLADSSS-------RKQGFVVVETNFRLYAYSSSKLHCEILRLFSRIEYQLP 418
            L T  ++SL  S S       ++QG++V+ETN+RLYAY+SS L   +L LF ++ Y+LP
Sbjct: 310 SLTTGKTLSLIQSISSERTQTQKEQGYIVLETNYRLYAYTSSSLQISLLSLFVKMLYRLP 369

Query: 419 NLTVAAITKESLYNAFKNGITADQMVTFLQQNAHPRVAERIPSVPENVTDQVM 429
           NL V  IT+ES+  A  +GITADQ++ F++ N+HP  A     +P+ V +Q++
Sbjct: 370 NLAVGIITRESIRTALIHGITADQIIDFVRHNSHPNAANSGQPIPDVVAEQIL 415

BLAST of Sed0003361 vs. ExPASy TrEMBL
Match: A0A6J1IIT5 (RNA polymerase II transcription factor B subunit 2 OS=Cucurbita maxima OX=3661 GN=LOC111477843 PE=3 SV=1)

HSP 1 Score: 717.6 bits (1851), Expect = 3.1e-203
Identity = 362/383 (94.52%), Postives = 374/383 (97.65%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPAMKLD+LY NAFICEAILRSLPPLAKKFV+QMLYIDAPVTA
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
           KSMEEWVLPDGVSK+KVAVDRLIQLRVFIET DRKRETTY+LNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIHGEVL 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
           AREPMPSNITVRLP+LEELEAYAL QWECFLLQLINSGQA+KPSNISSS+MKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDKE PRLT SGFQFLLMETNAQLWYIIREYISN+EERGVDPADLISF+LELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKD WFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           VETNFR+YAYSSSKLHCEILRLFSRIEYQLPNL V AITKESLYNAFKNGITA Q+VTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQNAHPRVAERIPSVPENVTDQ+
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQI 383

BLAST of Sed0003361 vs. ExPASy TrEMBL
Match: A0A5D3CNX3 (RNA polymerase II transcription factor B subunit 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G007400 PE=3 SV=1)

HSP 1 Score: 715.3 bits (1845), Expect = 1.5e-202
Identity = 360/383 (93.99%), Postives = 376/383 (98.17%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPAMKLD+LY NAFICEAILRSLPPLAKKFV+QMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
           KSMEEWVLPDGVSK+KVAVDRLIQLRVFIET DRKRETTYRLNPTFQANLQKLLIHGEV+
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVV 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
           AREPMPSNITVRLPSLE+LEAYAL QWECFLLQLINSGQAEKPSNISSS+MKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDKEAPRLT SGFQFLLMETNAQLWYIIREYISN+EERGVDPADLISF+LELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AYDIDTL++EQRYAIKDLADLGLVKLQQGRK+ WFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           VETNFR+YAYSSSKLHCEILRLFSRIEYQLPNL V AITKESLYNAFKNGITA+Q+VTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQNAHPRVAERIPSVPENVTDQ+
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQI 383

BLAST of Sed0003361 vs. ExPASy TrEMBL
Match: A0A0A0KK55 (RNA polymerase II transcription factor B subunit 2 OS=Cucumis sativus OX=3659 GN=Csa_6G517270 PE=3 SV=1)

HSP 1 Score: 711.1 bits (1834), Expect = 2.9e-201
Identity = 358/383 (93.47%), Postives = 374/383 (97.65%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPAMKLD+LY NAFICEAILRSLPPLAKKFV+QMLYID PV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDGPVSA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
           KSMEEWVLPDGVSK+KVAVDRLIQLRVFIET DRKRETTYRLNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
           AREPMPSNITVRLPSLE+LEAYAL QWECFLLQLINSGQAEKPSNISSS+MKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDKEAPRLT SGFQFLLMETNAQLWYIIREYISN+EERGVDPADLISF+LELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AYDIDTL++EQRYAIKDLADLGLVKLQQGRK+ WFIPTKLATNLSMSLADSSSRK GFVV
Sbjct: 241 AYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTKLATNLSMSLADSSSRKLGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           VETNFR+YAYS+SKLHCEILRLFSRIEYQLPNL V AITKESLYNAFKNGITA+Q+VTFL
Sbjct: 301 VETNFRMYAYSTSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQNAHPRVAERIPSVPENVTDQ+
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQI 383

BLAST of Sed0003361 vs. ExPASy TrEMBL
Match: A0A6J1EGA7 (RNA polymerase II transcription factor B subunit 2 OS=Cucurbita moschata OX=3662 GN=LOC111433104 PE=3 SV=1)

HSP 1 Score: 711.1 bits (1834), Expect = 2.9e-201
Identity = 360/383 (93.99%), Postives = 372/383 (97.13%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPAMKLD+LY NAFICEAILRSLPPLAKKFV+QMLYIDAPVTA
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYGNAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
           KSMEEWVLPDGVSK+KVAVDRLIQLRVFIET DRKRETTY+LNPTFQANLQKLLI GEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIQGEVL 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
           AREPMPSNITVRLP+LEELEAYAL QWECFLLQLINSGQA+KPSNISSS+MKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDKE PRLT SGFQFLLMETNAQLWYIIREYISN+EER VDPADLISF+LELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERDVDPADLISFLLELSFHVTGE 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKD WFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           VETNFR+YAYSSSKLHCEILRLFSRIEYQLPNL V AITKESLYNAFKNGITA Q+VTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQNAHPRVAERIPSVPENVTDQ+
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQI 383

BLAST of Sed0003361 vs. ExPASy TrEMBL
Match: A0A5A7UBE2 (RNA polymerase II transcription factor B subunit 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold120G003070 PE=3 SV=1)

HSP 1 Score: 702.6 bits (1812), Expect = 1.0e-198
Identity = 361/408 (88.48%), Postives = 376/408 (92.16%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPAMKLD+LY NAFICEAILRSLPPLAKKFV+QMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
           KSMEEWVLPDGVSK+KVAVDRLIQLRVFIET DRKRETTYRLNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
           AREPMPSNITVRLPSLE+LEAYAL QWECFLLQLINSGQAEKPSNISSS+MKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 225 Q-------------------------RDKEAPRLTASGFQFLLMETNAQLWYIIREYISN 284
           Q                         RDKEAPRLT SGFQFLLMETNAQLWYIIREYISN
Sbjct: 181 QRLKSCSMYSSLYFARYLIKCYVWYSRDKEAPRLTESGFQFLLMETNAQLWYIIREYISN 240

Query: 285 SEERGVDPADLISFMLELSFHVTGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWF 344
           +EERGVDPADLISF+LELSFHVTGEAYDIDTL++EQRYAIKDLADLGLVKLQQGRK+ WF
Sbjct: 241 AEERGVDPADLISFLLELSFHVTGEAYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWF 300

Query: 345 IPTKLATNLSMSLADSSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTV 404
           IPTKLATNLSMSLADSSSRKQGFVVVETNFR+YAYSSSKLHCEILRLFSRIEYQLPNL V
Sbjct: 301 IPTKLATNLSMSLADSSSRKQGFVVVETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIV 360

Query: 405 AAITKESLYNAFKNGITADQMVTFLQQNAHPRVAERIPSVPENVTDQV 428
            AITKESLYNAFKNGITA+Q+VTFLQQNAHPRVAERIPSVPENVTDQ+
Sbjct: 361 GAITKESLYNAFKNGITAEQIVTFLQQNAHPRVAERIPSVPENVTDQI 408

BLAST of Sed0003361 vs. TAIR 10
Match: AT4G17020.2 (transcription factor-related )

HSP 1 Score: 611.3 bits (1575), Expect = 6.0e-175
Identity = 302/383 (78.85%), Postives = 342/383 (89.30%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPA+KLDKLY+N FICEAILRSLPPLAKK+V+QMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
             MEEWVL DG SK +VA+DRLIQLR+F E  DRKR T+Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
            REPM S+  ++LPSL+ELE YAL QWECFLLQLINSGQ EK + ISSSMMK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDK+ PRLT SGFQFLLM+TNAQLWYIIREYI N+EER VDPADLISF+LELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AY+++TLTE Q   +KDLADLGLVKLQQGRKD WFIPTKLATNLS+SLADSS+RK+GFVV
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           +ETNFR+YAYS+SKL CEILRLF+RIEYQLPNL   AITKESLYNAF NGIT+DQ++TFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQN+HPR A+R+PS+PENVTDQ+
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQI 383

BLAST of Sed0003361 vs. TAIR 10
Match: AT4G17020.1 (transcription factor-related )

HSP 1 Score: 611.3 bits (1575), Expect = 6.0e-175
Identity = 302/383 (78.85%), Postives = 342/383 (89.30%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPA+KLDKLY+N FICEAILRSLPPLAKK+V+QMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
             MEEWVL DG SK +VA+DRLIQLR+F E  DRKR T+Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
            REPM S+  ++LPSL+ELE YAL QWECFLLQLINSGQ EK + ISSSMMK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDK+ PRLT SGFQFLLM+TNAQLWYIIREYI N+EER VDPADLISF+LELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AY+++TLTE Q   +KDLADLGLVKLQQGRKD WFIPTKLATNLS+SLADSS+RK+GFVV
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           +ETNFR+YAYS+SKL CEILRLF+RIEYQLPNL   AITKESLYNAF NGIT+DQ++TFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQN+HPR A+R+PS+PENVTDQ+
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQI 383

BLAST of Sed0003361 vs. TAIR 10
Match: AT4G17020.3 (transcription factor-related )

HSP 1 Score: 611.3 bits (1575), Expect = 6.0e-175
Identity = 302/383 (78.85%), Postives = 342/383 (89.30%), Query Frame = 0

Query: 45  MPQVKIIAKNFMDMVASLPAMKLDKLYHNAFICEAILRSLPPLAKKFVIQMLYIDAPVTA 104
           MPQVKIIAKNFMDMVASLPA+KLDKLY+N FICEAILRSLPPLAKK+V+QMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 105 KSMEEWVLPDGVSKFKVAVDRLIQLRVFIETVDRKRETTYRLNPTFQANLQKLLIHGEVL 164
             MEEWVL DG SK +VA+DRLIQLR+F E  DRKR T+Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 165 AREPMPSNITVRLPSLEELEAYALGQWECFLLQLINSGQAEKPSNISSSMMKVFQKGLLS 224
            REPM S+  ++LPSL+ELE YAL QWECFLLQLINSGQ EK + ISSSMMK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 225 QRDKEAPRLTASGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFMLELSFHVTGE 284
           QRDK+ PRLT SGFQFLLM+TNAQLWYIIREYI N+EER VDPADLISF+LELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 285 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDRWFIPTKLATNLSMSLADSSSRKQGFVV 344
           AY+++TLTE Q   +KDLADLGLVKLQQGRKD WFIPTKLATNLS+SLADSS+RK+GFVV
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 345 VETNFRLYAYSSSKLHCEILRLFSRIEYQLPNLTVAAITKESLYNAFKNGITADQMVTFL 404
           +ETNFR+YAYS+SKL CEILRLF+RIEYQLPNL   AITKESLYNAF NGIT+DQ++TFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 405 QQNAHPRVAERIPSVPENVTDQV 428
           QQN+HPR A+R+PS+PENVTDQ+
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQI 383

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022977542.16.4e-20394.52RNA polymerase II transcription factor B subunit 2 [Cucurbita maxima][more]
KAG6604444.11.9e-20292.62General transcription and DNA repair factor IIH subunit TFB2, partial [Cucurbita... [more]
TYK13130.13.2e-20293.99RNA polymerase II transcription factor B subunit 2 [Cucumis melo var. makuwa][more]
XP_023543805.14.1e-20294.26RNA polymerase II transcription factor B subunit 2 [Cucurbita pepo subsp. pepo][more]
XP_038882460.19.2e-20293.99general transcription and DNA repair factor IIH subunit TFB2 isoform X1 [Beninca... [more]
Match NameE-valueIdentityDescription
Q680U98.4e-17478.85General transcription and DNA repair factor IIH subunit TFB2 OS=Arabidopsis thal... [more]
Q927596.5e-5733.51General transcription factor IIH subunit 4 OS=Homo sapiens OX=9606 GN=GTF2H4 PE=... [more]
P600276.5e-5733.51General transcription factor IIH subunit 4 OS=Pan troglodytes OX=9598 GN=GTF2H4 ... [more]
O704221.4e-5633.77General transcription factor IIH subunit 4 OS=Mus musculus OX=10090 GN=Gtf2h4 PE... [more]
Q54C294.5e-5030.27General transcription factor IIH subunit 4 OS=Dictyostelium discoideum OX=44689 ... [more]
Match NameE-valueIdentityDescription
A0A6J1IIT53.1e-20394.52RNA polymerase II transcription factor B subunit 2 OS=Cucurbita maxima OX=3661 G... [more]
A0A5D3CNX31.5e-20293.99RNA polymerase II transcription factor B subunit 2 OS=Cucumis melo var. makuwa O... [more]
A0A0A0KK552.9e-20193.47RNA polymerase II transcription factor B subunit 2 OS=Cucumis sativus OX=3659 GN... [more]
A0A6J1EGA72.9e-20193.99RNA polymerase II transcription factor B subunit 2 OS=Cucurbita moschata OX=3662... [more]
A0A5A7UBE21.0e-19888.48RNA polymerase II transcription factor B subunit 2 OS=Cucumis melo var. makuwa O... [more]
Match NameE-valueIdentityDescription
AT4G17020.26.0e-17578.85transcription factor-related [more]
AT4G17020.16.0e-17578.85transcription factor-related [more]
AT4G17020.36.0e-17578.85transcription factor-related [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004598Transcription factor TFIIH subunit p52/Tfb2PFAMPF03849Tfb2coord: 58..399
e-value: 3.3E-103
score: 345.4
IPR004598Transcription factor TFIIH subunit p52/Tfb2PANTHERPTHR13152TFIIH, POLYPEPTIDE 4coord: 49..399

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0003361.1Sed0003361.1mRNA
Sed0003361.3Sed0003361.3mRNA
Sed0003361.2Sed0003361.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006289 nucleotide-excision repair
biological_process GO:0070816 phosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0000439 transcription factor TFIIH core complex
cellular_component GO:0005675 transcription factor TFIIH holo complex
molecular_function GO:0001671 ATPase activator activity
molecular_function GO:0003690 double-stranded DNA binding