Sgr029333 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr029333
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionRNA polymerase II transcription factor B subunit 2
Locationtig00153293: 1311060 .. 1320263 (+)
RNA-Seq ExpressionSgr029333
SyntenySgr029333
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCCCAAGTAAAGATCATTGCGAAGAATTTCATGGACATGGTGGCCTCCTTGCCCGCCATGAAGCTCGACAAACTCTATGAGAATGCATTCATCTGCGAAGCCATTCTCAGGTCCTTGCTTGCGAGCTCTAAATGCTTTAATTATTTCGGGTCGTATGTATAACTGCAAAGCAAAAATACAATAAACGACTAAAACCTGATGTTGGATTAATTATGAATTGAATATTTTTTTGGGATTGCTCTGTATGTAATTGTGTCAACTGGTGCTCTTGGACATCTGAGCATGATGTTCATAAAATGATGTCACTGAAGCTGATTTTTTTTTGTTTTTGTTTTTTTGGCTCTAAAATGTTTTAGATTTGATATTCATGCTGAATTTCCATATTTCTTAATTAGCTGTTTTGAAAATTTTCCTTCTCGTTCTTTGAGTTAGAGGACTGTGCCCAAAGTTCTTGGCATTTGTTAATGCGTTTAACTTAAAAGTTGCTAGTTAATCCTAGTTGCGGAGGTTTTCATTGATAAACAAGGGTGCCAATCATCTTTTGTTATCAGGTCACTTCCACCACTGGCCAAGAAGTTTGTTTTACAAATGTTATACATTGATGCTCCAGTAACAGCTAAGTCCATGGAGGAGTGGGTGCTTTCAGATGGAATATCAAAGCATAAGGTTGCCATTGATCGGTTGATTCAGTTGAGAGTATTTATTGAGACTGCGGATAGGTATGTATTCTCTTTTCTTCATTTTATTAAATAATTGATATTCTTTCACAATGCTTTTTAAATGTATCCTTAATTAAATGCGAGATAGAGCCGAGAAAGCTGATATTCGAGTGTTTAAGAGTAAGGCCAAGTTTGTGGGGTTTTTATCTTCATTTTTTTCTCCGATGTTTCTCTTCCATTAGTAGAGAGATGGCCTACATGTTCATATCATGATATCATTGGAATAATCGAGAGGTTCATGCAATGAGAACTTCAATTATTAACTTGTGGTGCAATACATAAATGCTTTTGCAATGTTTCTGTTTTAATCTCACTGAAATTGTTCATAATTCTATTAGTAGTTTTGCACAGTTGATAGTCCTCAACCCAATCCTATATTGTAACCTTTTCTATGGATCTCTATTTCATTGACAGGAAAAGAGAAACAACCTACAGGCTAAATCCAACGTTCCAAGCAAACCTCCAAAAGCTTTTAATACATGGGTAAGTTTTCTCTCATTTGGCTCAATAAATGATTTTACTTACGGGATATGCAAGGAGGGATACATGATCTGTCTCATTTCCTTTCTACTCAGCACAAACATTCTAATTCCTGGATCTTTCTTAATATATATATACATTTTTCTTTCTGCTTACTTTCTTGCTACCTTATTTGCATTCATCAAGTCCAAGTAGATGATTACTCATCTTCTATAAAGAAGCTGATCCTGGAGTCGAGACTCCTTGATTTTAACATTTAAAAGTGATTTACTCATTTAGTTCGGAGAATGACCATGAATTTGGATCTTGCAGTGAAGTTCTAGCCAGAGAACCAATGCCTTCTAATATAACTGTGAGGCTTCCAAGTTTGGAAGATCTTGCGGCTTATGCTCTTGATCAATGGGAGGTTGGACAATTATCAAACAATAATTTTCATCATCTGTTTTTTGGTTGCATCCATTCTAGTTTATTCTTCGGCTATTATTTGTTCCATCCTCTTGTCTTGGACCCAATCAAAGGCAACCAGTCTTCTCAGTTGCTTACTGTTCAAGCTAATCTTGAATATTCTTCTGACCTGTTATTGTGTTAGTTCAGTGCTTCTTGCTGCAATTGATAAACTCAGGCCAATCAGAGAAGCCATCAAATATTAGTTCTTCTGTGATGAAAGTTTTCCAGAAAGGTCTTTTAAGTCAGAGGTTAAAATCTTGTTCAATGTACAATAGCTTATATATAGCGATCACTTCTTCTATCTGATATAATGTTATGTATGTTATCCAGGGATAAAGAAACTCCACGATTAACTGAGAGTGGTTTTCAGTTCTTGGTATGATTCACATGAACCATAAACCTCTTGAAATCCTCATTGATCTGTCTCACCAGGAATTCTTGTGATATATGTAAACTTCAGTTGATGGAAACAAATGCACAACTTTGGTATATCATCAGAGAATATATAACTAACTCTGAGGTTTGTGTTTTTTTATTTTTTTATTTTTATTGTTATATCCTATTGTCTTCCTCTAAGGAAAAATTATATATATTCTGGCATTATTTGGAACACAATTTCTAATTGAATCCAAATGGTGGACTATTCTTACACAAAATGAAAAGCTACTATTGTTAATTTTAAGACTATCTTCTTTTAGCATGTGTAAGTTTAGATTCATAGGTCAGTGTAATTTTATTATTTATTTTCTTGTAATAACTACTATTCTGATAGGATTGTGTCATTGGACTAGTTAGAAATATTATTAGGTTAGTAAGGGCAAATTAGTAATTGGTTAGAGAGTTTGTTAGTAGGGATTTGTTATAAATACAGTGAGTGGGAAGGGAAGAGGCAGACAATTTGTTTAGTGTGTTTAGGCTTGAGTGAGAATACTCAAGAGAGGGGGGAGGTTCCAAGTGCCTCGAATACTTGGGTTATTGTACTGTTTTAGTCTTTATATTTCAATACTATTCTAGTTATTTGGGTTCTATCAATTGGTATCAGAGCAGTTCGATTCTGGCCATGGAGTTGGATGTTCTACGTGAAGAGATATTGACAATTTTACGTAAAGAGATGGATCGGTTGAAGATTGAGTTGAAGGAACATGTGAATTTGATATGCGATTCAGTATCCCAACCTATGAGTTTTAGAACAGTGACCACGGCAAATACTCCGAAAGGAACAGAGGAAAAGATTGAGAGAAAGCCCAACACATTTGAACCGGATGTCTCATCTAACAAGAGATTGAGCTTAGTGAAAGAAGGGCAACATGCACAGATTGAAGAACGACGACGATTGATTACCAAGGGAGTGATGCGACCACAAAGAACGAAGTATCGTCATCGGAGGAAGAATGGTCGTATCAGGAGCAAAGGCACGAAGGTTCGAAACAGGCTAAAGAAGGAAGAGTTCGTGCGGACGCAAAAGAAAGAGGGTAGAAAGAGTAGAAAAGGAACTGAAAGATGGGGAGGCTGTCGAAAAATACCAAAGAGGAAGGTCCGAGTACGATTGAGGGGAGACCGACTCGAAATCTGTCGGAGTTCGAAAAGCTGGTGCGAGAGGCCGTGGAAGAGGTTGGGTGGGTCGACGGCTGGATTCGCGGAGAGAGATCGATGGCCGGTGATGATTTTGTGGCGGGAGGGGAAGACCCGGAGGAGGAGGTGCCGACGTGATGGGGCGGCCATGGAGGCAGTAGGTGCTCAAAGAGAAGACAGGTCACGTAAGAAAGAGGGTAAAATTAGCCCTAATCACGTGAATGGCAGAGAAAATTCTGGAAGAAAATGGGGCCAAGGGCTTGAAAAGGATGGACCATTTGGGCTGAATTTCAATATAATTGTTAAGGCCCATTCCCAGATAAAGGCAGAAAATAAAATGGCCCATATTATAGAAGATGGGTTTTGGAGCAAGTTGGTCCTCTGTTTTAATCCCACAACCCGACCCGATTTGGCAACCTTCTACCCATTTGAAGAGAGAAACCCTAGCCGTCGCAAGGAACGCCATCTCTTTTTCATCTACGTCACCGACACCTTTGTGTACTGGGCCAACTCTCGTCGTCGGCGTTTCCACCGTTGTCCCCCATCGGCGTCGTCCTCTGCCACGCGACTGCATGTCGAATGCGCCGCGCACACCCACACCTCTATACAGTTTCAAAATCGTTCCTCATCCCACACTCCCTCCGTTCCTGTCATCGCCCATCGTCCGTTGTAGTTCGTCGCCGACAGCTGCACATCCCAGTCAGAGCACCAAACCACCTGGGTTTTTCTATTACATTTTGGTTCAAGGCTAGGTTTCAAATTTTGCGGGTGACGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTCACCTTGGGGACAAGGTGTGTTTTGGGGGAGGGGTAATGATAGGATTGTGTCATTGGACTAGTTAGAAATATTATTAGGTTAGTAAGGGCAAATTAGTAATTGGTTAGAGAGTTTGTTAGTAGGGATTTGTTATAAATACAGTGAGTGGGAAGGGAAGAGGCAGACAATTTGTTTAGTGTGTTTAGGCTTGAGTGAGAATACTCAAGAGAGGGGGGAGGTTCCAAGTGCCTCGAATACTTGGGTTATTGTACTGTTTTAGTCTTTATATTTCAATACTATTCTAGTTATTTGGGTTCTATCATATTCCTCGTTTGTAAATGCTTTATCAATATGGTGCAGGAGCGAGGTGTGGATCCTGCAGATTTGATATCGTTTCTGCTAGAGCTTAGTTTTCATGTGACAGGAGAGGTATCAATTCTTTGGTTATCTTTTAATTCCTTATTTTCATCATTATCTTTTATTGTTGTGTTCTTTTATCTCACTTCATATCATTTTAATACTGGATATGAACTAGTTTGGGTAGGTTCAGGAGTATATTTTCTAATGCAAATTGATTATATTTAGTTTGTTATTATGAAAGCCTACACAATACTGTTTTATGACGAGTTTGTATGTTTTAGTGAATAAACAAATTTCTTTAGGGTTTTTTTTCCCATCTTCATCGTAAGAGAGAATAATTGTATTAATGATGAAGAATAAAGTACAAGGGGAGGTAAGGTATCAACCCAATTAAGAGCCACTGGGATTAAAAAAAATCCCTCCCGTTGACATTGACAAATAAGAATAATAATTACAAAAAAGTGTGGAATAGGAGCTCCGTATAGATGCTTTGAGTTTGATCATCTTACACAAGCATCTTTCCCTCTTAACTTATCCTGAAAATGTCCACAGTTTCTTTCATCCACGCTTGCTGCATTGTGGTCTTAACACCACAATCCCATACAACTTTAGGGGATAACAGCAAATGAGGTGATCTAATGATCTGGACAAGATTATCTGGTTTTGAGTTTCCATGTTTCCTGTGCACGTGACACACAAGGAGAGAGGATTCATATGGGTTTCTCCTTTAAGTAACCTTGGCCGTATTTATCCCCATCATTGCAAGCTGCAAAATGAAAAATATAATTCTCTTTGGATTGCCATGCCTCCAGGTAACTTAGATAGGTTGCTACTACTAGTAGGAGCTCTATCAGATAAAAAAGTTAATAAAGATTTACAAGTCTAGACTTTCAGAATTGTCCAAATTCCATCTCATTCTGTCAGGTTGTTCATGGATGTGGTTTTTGACAATCCTCAGCAGCTGACTAAGATCAGTAGCCTCTCTATCATTTTAGAGGTCTACTGAAATCTATCTTTTAGTTCTCAATTCCACCAGTCAGCTAACTCACACTCAGCATTTCTAGAGCTACGCAGCTGAACTCTACAGAACCCTAGCTTCAGAGGAATTTTCATGGCCTATGTACTTTCCTAGAGGCTGATTTTGGTTCCGTCACCCATTTTGAAGGAGAAAACTCCTTATTTTCTCTTTGAACCTCACCACAAACTTCCAAGGGCTGCTGCTTCTCATGTTCCTATGGTGACAAGTAAATGTAAATCAATTGTCTCTATCAATGCCATACTTGGTGGAGATTATAGAATACAGGAGGGATGCAAGAGGCTCCATAATACTCAGTTAGACACAAGTGCTAAATTTCTCTCCTTCAAATTGCCTACTCCAAATCCACGCTCTTTTGGTGGTTAGACACCCTCCTCCAATCAGCTACATGGCTGCCATTGTCAATGCCCTCTCTAGCCCATAGGAAATTTCTCATCATACTCTCTAAAAGCTTAGCCACTTTACTTGGATTTTAAATAGACACATCAAATAAATAAGCATACTCAACATTGTAAATTGCCCTGAGGTTAATCTTCCACCTTTAGACAAGTAGCACTTTTACACTTATTAAACCGTTTCCCTTTTCTTTGTGAAACCTCTGACTGATTTGCTTCCATTCCTATAGCTCCAAGTGGGTGTTAACCTGTTTGCAAAATTTTTCCCTTTGAAACTCAACTCCAGTAAAGAAAACTAGGGATGCCACCAATCCATTCAAACTTAGGACACATGCCCAGTGTTCTGCCTTCCCCCACTGGTGATTTTTGGCTATTCCATTGTGGACCTTTTGTACTCTGATCAAATTTTCAAAGCTTGATAGCATTTTGAGAGGGCTATATTTCTCCCACAAGCTGACCCCAAAACCTACTTCATTTCTCTCTAGTCTCATCTCTTAATTAATAATAGCTGCCCACTAGTCCTCTAATAAATTTTATATAACAAATTCTGATCATCGATTGAACTAACTTTTGTACTACTAATCTGTGTACTATTTACAGCCCAATAATTAACATGTTTCTTCTTTGTTTTGTAGGCTTATGATATTGATACACTGACAGAAGAACAGAGATATGCGATCAAGGACCTTGCTGATCTGGGACTAGTTAAGCTTCAACAGGTTTTTTGCTAGCCTCTGCTGCTTTTTAAATTGAACATGTTGACCTTTGATCAAGTTTTAACCCTATTTTAGCATCACCTTTTTTTTCCCTTGGGGGGCTTATCGCTATTAATTTGTCCAGGGTAGAAAAGACAGTTGGTTCATACCTACTAAATTAGCTACAAATCTTTCAATGAGTTTGGCAGATTCTTCTTCAAGGAAACAAGTATGTCTGAATTATGTTTAAATTTGATATATTTCTCTGAGGTTTATTTTCTCTTCGGCTACTAGGTTTTTTAGTGGTTTGTGCCTTTGCATGGTAGCTTTTCCATAATTATTTTATTTTTAATTATATATGTAGAGTATCCTCTGGTATATGAATGGATTAGCTGTTTTTTAAGTTTTGCTTTATAGTAATCAAATTTCAGCATTAGCTTATTTTGGCCTCTACATAGGAAGTAGATTAATCATGAGGGCTTTTTGTTTTTAAATAAATTATTTATTATTTATTATTATTATTATTATTTTGTCTAATTTGATGGTTGTTATCTTGTATTTTGGTTCAAGATATATATCTTATCCTCGTTCCTTTTGTAATTTTTATTTCATAAGTCTGGTTTCCGGTATTAAAAAACTCAGCCCGTGAAGTCCTCTGTTCTTTGTTGCTCTATGGGTTTGGAGGCTCTGTGAAAATTTTTTATTCTTCCTAATTTTCTTCCTGTTGTTTACCCAGGGATTTGTTGTCGTGGAGACAAATTTCAGGATGTATGCTTATTCTTCTTCCAAACTACATTGTGAAATATTACGTCTTTTTTCAAGGTACATGTCATTATATAATGTTGTATTACCTTATATAACCAATGTTGGAGAAGAATGAGTTCTAGTAAGCTTTACAAAACTGATTGGCTACTAGATTCATTTAATAAAAAGGTCTACTTTTTCTTCTTGAATGCTAAAAATATTTGAAGTACCCGCCCCTAATAGACCCCGATTGAAAGTTTACATATGTTAGGAGGAAGAAAGAGGCGCAAGCTGAGGGGTAACTTAGTCATTAGCACAGCTTCTTGAGAGTGCAAGCAACTAGTGATAGATTCTGTAAGGTCAGTAGGGCCAAATAATCATTGGAATATTATTACATAATTTGTTGATGATAATTCAATAATTATAGTGTATATTTGTATATAAAATAGAAATGCTGCTATTTTTGCAACTATAGAAAGTTTCTCTCAAGTTTTGCGTCAAATTTCTACAACAGATTCATTTGATTATAAAAGGTTTGTTTTTTATTGTATTCTAACCAAAAGCACTTGTAAGTTACTTGATTAAATAGGGATATGATTCAACATTCTACATTTTATCTGCTGCATAATGAATGCAGTTATTACCTTTTGTGTTGAATTGAACCAGATAATGCTTTAAAGCAGTCTAATCACATCGTCAATTTTCTATTTGGTTCAAGTATTGTAAAATAAAACAAAATTGTCTTGCAGGATTGAGTATCAACTTCCAAATCTTATAGTTGGAGCAATAACGAAGGAAAGCTTGTATAATGCTTTTAAGAATGGAATTACGGCAGAGCAGGCAAGTTTTGCTGCTAGTTCACCCTGGCATAGGTTACGCTTATGAAGTAAGGGTATCTATCATTATATTATGTTCAAATTGGCAATGGAATGCAGGAGATCAGTTCTTGGTGCTCCGTTGGGTCACAATATCTTTTCTTAAAACTGTTCATTCTTGCTGGTGGTGGAATATTTAGAGAGTCCTCTCTATATCTAATGAACTTAGTTCATGCAAGGCATTATCATTCTGCACAAGTAGCATAGAAAATATATGGGGCTTGGCCCAAGAGGGTTATTTCTAGTAAGCAGGGAGCAAGTGCCAAACCCATTATTGTGGGGACCTGCCACAAAGGGACTTAAACTAGCGTTGCTTCTAATCCACAAGAAATAATATTTCTATTTTTCTTTCGTGCATCAATCAAAACTCTCATAAATTTATTGTAAACTTGAACACCTCTTTCAAGCTCAGCACATAGGTATTCTATACATTATATACCTATTTCATTTCTATGCAGGTTTTCTCATTTCAGATTGCTTGATATCTGACTGTTTGATTTGCAGATAATTACTTTTCTACAGCAGAATGCACATCCTCGTGTTGCAGATAGAGTACCATCAGTCCCTGAAAATGTCACAGATCAGGTTTGGGTTGCTTTGTTATGTCCCTTGATGTTTAGCCAGCAATAGCTCCTATCTGCAGATCACCCGTGCTTAGCATGGAAACCAAGAACATCAATATATGTTTTCATCAGACAATTATATCAAATCTTTGTCCTGTTCCTGACTGTTTCCATCGCTCTCTATTCCATGACAGATTAGGTTATGGGAATCAGATCTTAATAGAGTCGATATTACTCCTGCACATTTTTACGACGAATTCCCTTCCAGGGTACGGATTTTGAATGTTGCATCTTTTAACTTTGAAGATTACTTGTCCATTCCTCTGAATAATCACCTGAGAAGGGACTTCTTTGGCATGCCAGGAAGTTTTCGAGGCTGCTTGCGACTATGCACGAGAATGGAATGGGCTGCTATGGGAGGACTCGAAAAATATGCGACTCGTAGTGAAGGCAGACATACACACACACATGCGGGAACATCTTCGCCGACAAAAATAGATCTGCAACTGCAACTAGATTCCTGA

mRNA sequence

ATGCCCCAAGTAAAGATCATTGCGAAGAATTTCATGGACATGGTGGCCTCCTTGCCCGCCATGAAGCTCGACAAACTCTATGAGAATGCATTCATCTGCGAAGCCATTCTCAGGTCACTTCCACCACTGGCCAAGAAGTTTGTTTTACAAATGTTATACATTGATGCTCCAGTAACAGCTAAGTCCATGGAGGAGTGGGTGCTTTCAGATGGAATATCAAAGCATAAGGTTGCCATTGATCGGTTGATTCAGTTGAGAGTATTTATTGAGACTGCGGATAGGAAAAGAGAAACAACCTACAGGCTAAATCCAACGTTCCAAGCAAACCTCCAAAAGCTTTTAATACATGGTGAAGTTCTAGCCAGAGAACCAATGCCTTCTAATATAACTGTGAGGCTTCCAAGTTTGGAAGATCTTGCGGCTTATGCTCTTGATCAATGGGAGTGCTTCTTGCTGCAATTGATAAACTCAGGCCAATCAGAGAAGCCATCAAATATTAGTTCTTCTGTGATGAAAGTTTTCCAGAAAGGTCTTTTAAGTCAGAGGGATAAAGAAACTCCACGATTAACTGAGAGTGGTTTTCAGTTCTTGTTGATGGAAACAAATGCACAACTTTGGTATATCATCAGAGAATATATAACTAACTCTGAGGAGCGAGGTGTGGATCCTGCAGATTTGATATCGTTTCTGCTAGAGCTTAGTTTTCATGTGACAGGAGAGGCTTATGATATTGATACACTGACAGAAGAACAGAGATATGCGATCAAGGACCTTGCTGATCTGGGACTAGTTAAGCTTCAACAGGGTAGAAAAGACAGTTGGTTCATACCTACTAAATTAGCTACAAATCTTTCAATGAGTTTGGCAGATTCTTCTTCAAGGAAACAAGGATTTGTTGTCGTGGAGACAAATTTCAGGATGTATGCTTATTCTTCTTCCAAACTACATTGTGAAATATTACGTCTTTTTTCAAGGATTGAGTATCAACTTCCAAATCTTATAGTTGGAGCAATAACGAAGGAAAGCTTGTATAATGCTTTTAAGAATGGAATTACGGCAGAGCAGATAATTACTTTTCTACAGCAGAATGCACATCCTCGTGTTGCAGATAGAGTACCATCAGTCCCTGAAAATGTCACAGATCAGATTAGGTTATGGGAATCAGATCTTAATAGAGTCGATATTACTCCTGCACATTTTTACGACGAATTCCCTTCCAGGGTACGGATTTTGAATGTTGCATCTTTTAACTTTGAAGATTACTTGTCCATTCCTCTGAATAATCACCTGAGAAGGGACTTCTTTGGCATGCCAGGAAGTTTTCGAGGCTGCTTGCGACTATGCACGAGAATGGAATGGGCTGCTATGGGAGGACTCGAAAAATATGCGACTCGTAGTGAAGGCAGACATACACACACACATGCGGGAACATCTTCGCCGACAAAAATAGATCTGCAACTGCAACTAGATTCCTGA

Coding sequence (CDS)

ATGCCCCAAGTAAAGATCATTGCGAAGAATTTCATGGACATGGTGGCCTCCTTGCCCGCCATGAAGCTCGACAAACTCTATGAGAATGCATTCATCTGCGAAGCCATTCTCAGGTCACTTCCACCACTGGCCAAGAAGTTTGTTTTACAAATGTTATACATTGATGCTCCAGTAACAGCTAAGTCCATGGAGGAGTGGGTGCTTTCAGATGGAATATCAAAGCATAAGGTTGCCATTGATCGGTTGATTCAGTTGAGAGTATTTATTGAGACTGCGGATAGGAAAAGAGAAACAACCTACAGGCTAAATCCAACGTTCCAAGCAAACCTCCAAAAGCTTTTAATACATGGTGAAGTTCTAGCCAGAGAACCAATGCCTTCTAATATAACTGTGAGGCTTCCAAGTTTGGAAGATCTTGCGGCTTATGCTCTTGATCAATGGGAGTGCTTCTTGCTGCAATTGATAAACTCAGGCCAATCAGAGAAGCCATCAAATATTAGTTCTTCTGTGATGAAAGTTTTCCAGAAAGGTCTTTTAAGTCAGAGGGATAAAGAAACTCCACGATTAACTGAGAGTGGTTTTCAGTTCTTGTTGATGGAAACAAATGCACAACTTTGGTATATCATCAGAGAATATATAACTAACTCTGAGGAGCGAGGTGTGGATCCTGCAGATTTGATATCGTTTCTGCTAGAGCTTAGTTTTCATGTGACAGGAGAGGCTTATGATATTGATACACTGACAGAAGAACAGAGATATGCGATCAAGGACCTTGCTGATCTGGGACTAGTTAAGCTTCAACAGGGTAGAAAAGACAGTTGGTTCATACCTACTAAATTAGCTACAAATCTTTCAATGAGTTTGGCAGATTCTTCTTCAAGGAAACAAGGATTTGTTGTCGTGGAGACAAATTTCAGGATGTATGCTTATTCTTCTTCCAAACTACATTGTGAAATATTACGTCTTTTTTCAAGGATTGAGTATCAACTTCCAAATCTTATAGTTGGAGCAATAACGAAGGAAAGCTTGTATAATGCTTTTAAGAATGGAATTACGGCAGAGCAGATAATTACTTTTCTACAGCAGAATGCACATCCTCGTGTTGCAGATAGAGTACCATCAGTCCCTGAAAATGTCACAGATCAGATTAGGTTATGGGAATCAGATCTTAATAGAGTCGATATTACTCCTGCACATTTTTACGACGAATTCCCTTCCAGGGTACGGATTTTGAATGTTGCATCTTTTAACTTTGAAGATTACTTGTCCATTCCTCTGAATAATCACCTGAGAAGGGACTTCTTTGGCATGCCAGGAAGTTTTCGAGGCTGCTTGCGACTATGCACGAGAATGGAATGGGCTGCTATGGGAGGACTCGAAAAATATGCGACTCGTAGTGAAGGCAGACATACACACACACATGCGGGAACATCTTCGCCGACAAAAATAGATCTGCAACTGCAACTAGATTCCTGA

Protein sequence

MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTAKSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVLAREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLSQRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVVVETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFLQQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSRVRILNVASFNFEDYLSIPLNNHLRRDFFGMPGSFRGCLRLCTRMEWAAMGGLEKYATRSEGRHTHTHAGTSSPTKIDLQLQLDS
Homology
BLAST of Sgr029333 vs. NCBI nr
Match: TYK13130.1 (RNA polymerase II transcription factor B subunit 2 [Cucumis melo var. makuwa])

HSP 1 Score: 814.3 bits (2102), Expect = 5.7e-232
Identity = 411/461 (89.15%), Postives = 431/461 (93.49%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPAMKLD+LYENAFICEAILRSLPPLAKKFVLQMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVL DG+SK+KVA+DRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEV+
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVV 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLPSLEDL AYALDQWECFLLQLINSGQ+EKPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDKE PRLTESGFQFLLMETNAQLWYIIREYI+N+EERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AYDIDTL++EQRYAIKDLADLGLVKLQQGRK+SWFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQI+TFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSRVRILNVASFNFED 420
           QQNAHPRVA+R+PSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSRVRI NVASFNFED
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSRVRISNVASFNFED 420

Query: 421 YLSIPLNNHLRRDFFGMPGSFRGCLRLCTRMEWAAMGGLEK 462
           YL I L   L ++F      F   L +         G ++K
Sbjct: 421 YLYIHLKKPLEKEFLWHSRKFSRLLAIMHESGMGCYGRIQK 461

BLAST of Sgr029333 vs. NCBI nr
Match: XP_022977542.1 (RNA polymerase II transcription factor B subunit 2 [Cucurbita maxima])

HSP 1 Score: 781.2 bits (2016), Expect = 5.3e-222
Identity = 390/407 (95.82%), Postives = 405/407 (99.51%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPAMKLD+LYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVL DG+SK+KVA+DRLIQLRVFIETADRKRETTY+LNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLP+LE+L AYALDQWECFLLQLINSGQ++KPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDKETPRLTESGFQFLLMETNAQLWYIIREYI+N+EERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITA+QI+TFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 408
           QQNAHPRVA+R+PSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 407

BLAST of Sgr029333 vs. NCBI nr
Match: XP_023543805.1 (RNA polymerase II transcription factor B subunit 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 778.5 bits (2009), Expect = 3.5e-221
Identity = 389/407 (95.58%), Postives = 404/407 (99.26%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPAMKLD+LYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVL DG+SK+KVA+DRLIQLRVFIETADRKRETTY+LNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLP+LE+L AYALDQWECFLLQLINSGQ++KPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDKETPRLTESGFQFLLMETNAQLWYIIREYI+N+EER VDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERDVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITA+QI+TFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 408
           QQNAHPRVA+R+PSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 407

BLAST of Sgr029333 vs. NCBI nr
Match: KAG6604444.1 (General transcription and DNA repair factor IIH subunit TFB2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 778.5 bits (2009), Expect = 3.5e-221
Identity = 389/407 (95.58%), Postives = 404/407 (99.26%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPAMKLD+LYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA
Sbjct: 32  MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 91

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVL DG+SK+KVA+DRLIQLRVFIETADRKRETTY+LNPTFQANLQKLLIHGEVL
Sbjct: 92  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIHGEVL 151

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLP+LE+L AYALDQWECFLLQLINSGQ++KPSNISSSVMKVFQKGLLS
Sbjct: 152 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 211

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDKETPRLTESGFQFLLMETNAQLWYIIREYI+N+EER VDPADLISFLLELSFHVTGE
Sbjct: 212 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERDVDPADLISFLLELSFHVTGE 271

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 272 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 331

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITA+QI+TFL
Sbjct: 332 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 391

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 408
           QQNAHPRVA+R+PSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR
Sbjct: 392 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 438

BLAST of Sgr029333 vs. NCBI nr
Match: XP_038882460.1 (general transcription and DNA repair factor IIH subunit TFB2 isoform X1 [Benincasa hispida])

HSP 1 Score: 775.8 bits (2002), Expect = 2.2e-220
Identity = 388/407 (95.33%), Postives = 402/407 (98.77%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLP MKLD+LYENAFICEAILRSLPPLAKKFVLQMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPPMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVL DG+SK+KVA+DRLIQLRVFIETADRKRETTYRLNP FQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPMFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
           AREPMP+NITVRLPSLE+L AYALDQWECFLLQLINSGQ+EKPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPANITVRLPSLEELKAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDKE PRLTESGFQFLLMETNAQLWYIIREYI+N+EERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AYDIDTLT+EQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLTDEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQI+TFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 408
           QQNAHPRVA+R+PSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 407

BLAST of Sgr029333 vs. ExPASy Swiss-Prot
Match: Q680U9 (General transcription and DNA repair factor IIH subunit TFB2 OS=Arabidopsis thaliana OX=3702 GN=TFB2 PE=2 SV=1)

HSP 1 Score: 663.3 bits (1710), Expect = 2.1e-189
Identity = 328/407 (80.59%), Postives = 368/407 (90.42%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPA+KLDKLY N FICEAILRSLPPLAKK+VLQMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
             MEEWVL+DG SKH+VAIDRLIQLR+F E +DRKR T+Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
            REPM S+  ++LPSL++L  YAL QWECFLLQLINSGQ EK + ISSS+MK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDK+ PRLTESGFQFLLM+TNAQLWYIIREYI N+EER VDPADLISFLLELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AY+++TLTE Q   +KDLADLGLVKLQQGRKDSWFIPTKLATNLS+SLADSS+RK+GFVV
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           +ETNFRMYAYS+SKL CEILRLF+RIEYQLPNLI  AITKESLYNAF NGIT++QIITFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 408
           QQN+HPR ADRVPS+PENVTDQIRLWE+DL R+++T AHFYDEFPS+
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSK 407

BLAST of Sgr029333 vs. ExPASy Swiss-Prot
Match: Q92759 (General transcription factor IIH subunit 4 OS=Homo sapiens OX=9606 GN=GTF2H4 PE=1 SV=1)

HSP 1 Score: 243.0 bits (619), Expect = 6.9e-63
Identity = 141/413 (34.14%), Postives = 235/413 (56.90%), Query Frame = 0

Query: 3   QVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTAKS 62
           +V +  +N  + +  L    LD+LY +   C A+ R LP LAK +V++ML+++ P+   +
Sbjct: 11  RVHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAA 70

Query: 63  MEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVLAR 122
           +  WV  +     + +   L  LR++             LNP F+ NL+  L+ G     
Sbjct: 71  VALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWS 130

Query: 123 EPM----PSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGL 182
           +      P      +PSL+    YA ++WE  L  ++ S  +    +++  +    Q GL
Sbjct: 131 DDTSQLGPDKHARDVPSLD---KYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGL 190

Query: 183 L-SQRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHV 242
           + S    E P +T +GFQFLL++T AQLWY + +Y+  ++ RG+D  +++SFL +LSF  
Sbjct: 191 MKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFST 250

Query: 243 TGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLS--MSLADSSSRK 302
            G+ Y ++ +++     ++ L + GLV  Q+ RK   + PT+LA NLS  +S A  +  +
Sbjct: 251 LGKDYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQ 310

Query: 303 QGFVVVETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQ 362
            GF+VVETN+R+YAY+ S+L   ++ LFS + Y+ PN++V  +T+ES+  A  +GITA+Q
Sbjct: 311 PGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQ 370

Query: 363 IITFLQQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSRV 409
           II FL+  AHP +  + P +P  +TDQIRLWE + +R+  T    Y++F S+V
Sbjct: 371 IIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 416

BLAST of Sgr029333 vs. ExPASy Swiss-Prot
Match: P60027 (General transcription factor IIH subunit 4 OS=Pan troglodytes OX=9598 GN=GTF2H4 PE=3 SV=1)

HSP 1 Score: 243.0 bits (619), Expect = 6.9e-63
Identity = 141/413 (34.14%), Postives = 235/413 (56.90%), Query Frame = 0

Query: 3   QVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTAKS 62
           +V +  +N  + +  L    LD+LY +   C A+ R LP LAK +V++ML+++ P+   +
Sbjct: 11  RVHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAA 70

Query: 63  MEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVLAR 122
           +  WV  +     + +   L  LR++             LNP F+ NL+  L+ G     
Sbjct: 71  VALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWS 130

Query: 123 EPM----PSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGL 182
           +      P      +PSL+    YA ++WE  L  ++ S  +    +++  +    Q GL
Sbjct: 131 DDTSQLGPDKHARDVPSLD---KYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGL 190

Query: 183 L-SQRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHV 242
           + S    E P +T +GFQFLL++T AQLWY + +Y+  ++ RG+D  +++SFL +LSF  
Sbjct: 191 MKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFST 250

Query: 243 TGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLS--MSLADSSSRK 302
            G+ Y ++ +++     ++ L + GLV  Q+ RK   + PT+LA NLS  +S A  +  +
Sbjct: 251 LGKDYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQ 310

Query: 303 QGFVVVETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQ 362
            GF+VVETN+R+YAY+ S+L   ++ LFS + Y+ PN++V  +T+ES+  A  +GITA+Q
Sbjct: 311 PGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQ 370

Query: 363 IITFLQQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSRV 409
           II FL+  AHP +  + P +P  +TDQIRLWE + +R+  T    Y++F S+V
Sbjct: 371 IIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 416

BLAST of Sgr029333 vs. ExPASy Swiss-Prot
Match: O70422 (General transcription factor IIH subunit 4 OS=Mus musculus OX=10090 GN=Gtf2h4 PE=1 SV=1)

HSP 1 Score: 241.5 bits (615), Expect = 2.0e-62
Identity = 140/407 (34.40%), Postives = 232/407 (57.00%), Query Frame = 0

Query: 9   KNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTAKSMEEWVL 68
           +N  + +  L    LD+LY +   C A+ R LP LAK +V++ML+++ P+   ++  WV 
Sbjct: 18  RNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAVALWVK 77

Query: 69  SDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVLAREPM--- 128
            +     + +   L  LR++             LNP F+ NL+  L+ G     +     
Sbjct: 78  KEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPVFRQNLRIALLGGGKAWSDDTSQL 137

Query: 129 -PSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLL-SQRD 188
            P      +PSL+    YA ++WE  L  ++ S  +    +++  +    Q GL+ S   
Sbjct: 138 GPDKHARDVPSLD---KYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSTEP 197

Query: 189 KETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGEAYD 248
            E P +T +GFQFLL++T AQLWY + +Y+  ++ RG+D  +++SFL +LSF   G+ Y 
Sbjct: 198 GEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYS 257

Query: 249 IDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLS--MSLADSSSRKQGFVVV 308
           ++ +++     ++ L + GLV  Q+ RK   + PT+LA NLS  +S A  +  + GF+VV
Sbjct: 258 VEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVV 317

Query: 309 ETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFLQ 368
           ETN+R+YAY+ S+L   ++ LFS + Y+ PN++V  +T+ES+  A  +GITA+QII FL+
Sbjct: 318 ETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLR 377

Query: 369 QNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSRV 409
             AHP +  + P +P  +TDQIRLWE + +R+  T    Y++F S+V
Sbjct: 378 TRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417

BLAST of Sgr029333 vs. ExPASy Swiss-Prot
Match: Q54C29 (General transcription factor IIH subunit 4 OS=Dictyostelium discoideum OX=44689 GN=gtf2h4 PE=3 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 7.8e-59
Identity = 143/437 (32.72%), Postives = 239/437 (54.69%), Query Frame = 0

Query: 15  VASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTAKSMEEWVLSDGISK 74
           +ASL +  L++LY++ + C+AILRSLPP +K+++L+ML +D        ++W     I +
Sbjct: 10  LASLDSKDLEELYKDPWTCQAILRSLPPRSKQYILKMLLVDT-YPLSLAKDWSTQASIQQ 69

Query: 75  HKVAIDRLIQLRVFI---------------------------ETADRKRETTYRLNPTFQ 134
           HK ++ +L  L++                             +   ++ E T RLNP FQ
Sbjct: 70  HKESLKKLFDLKIIFLDKINKPIQPQQQQSSQQSSSQQQQQQQQQQQQTEQTIRLNPLFQ 129

Query: 135 ANLQKLLIH-GEVLAREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNI 194
            N+++ L+   +V+           + PS++DL +Y+  QWE  L  L  S  + +PS +
Sbjct: 130 DNIKRSLVQVNQVIFSNNSSIKDNHKPPSIDDLDSYSKSQWEKVLYFL--SDDTVQPSKL 189

Query: 195 SSSVMKVFQKGLLSQRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEER----GVD 254
            S ++       L++++ +   +T  GF+FLL +   Q+W ++  Y+ + E++       
Sbjct: 190 ISELL---LSSNLTKQEGDGLSITSEGFKFLLKDVYTQIWTLLIVYLDDLEKKKGKGSGS 249

Query: 255 PADLISFLLELSFHVTGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDS--WFIPTK- 314
             DL+SFL  LSF   G  Y +  L+E+Q+  +  L   GL+ +   R DS   F PT+ 
Sbjct: 250 RNDLLSFLFRLSFLNLGRGYLVSELSEQQKEYLFALKQFGLIYM---RTDSSILFYPTRL 309

Query: 315 ---LATNLSMSLADSSS-------RKQGFVVVETNFRMYAYSSSKLHCEILRLFSRIEYQ 374
              L T  ++SL  S S       ++QG++V+ETN+R+YAY+SS L   +L LF ++ Y+
Sbjct: 310 IISLTTGKTLSLIQSISSERTQTQKEQGYIVLETNYRLYAYTSSSLQISLLSLFVKMLYR 369

Query: 375 LPNLIVGAITKESLYNAFKNGITAEQIITFLQQNAHPRVADRVPSVPENVTDQIRLWESD 407
           LPNL VG IT+ES+  A  +GITA+QII F++ N+HP  A+    +P+ V +QI LWE++
Sbjct: 370 LPNLAVGIITRESIRTALIHGITADQIIDFVRHNSHPNAANSGQPIPDVVAEQILLWEAE 429

BLAST of Sgr029333 vs. ExPASy TrEMBL
Match: A0A5D3CNX3 (RNA polymerase II transcription factor B subunit 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G007400 PE=3 SV=1)

HSP 1 Score: 814.3 bits (2102), Expect = 2.8e-232
Identity = 411/461 (89.15%), Postives = 431/461 (93.49%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPAMKLD+LYENAFICEAILRSLPPLAKKFVLQMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVL DG+SK+KVA+DRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEV+
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVV 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLPSLEDL AYALDQWECFLLQLINSGQ+EKPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDKE PRLTESGFQFLLMETNAQLWYIIREYI+N+EERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AYDIDTL++EQRYAIKDLADLGLVKLQQGRK+SWFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQI+TFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSRVRILNVASFNFED 420
           QQNAHPRVA+R+PSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSRVRI NVASFNFED
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSRVRISNVASFNFED 420

Query: 421 YLSIPLNNHLRRDFFGMPGSFRGCLRLCTRMEWAAMGGLEK 462
           YL I L   L ++F      F   L +         G ++K
Sbjct: 421 YLYIHLKKPLEKEFLWHSRKFSRLLAIMHESGMGCYGRIQK 461

BLAST of Sgr029333 vs. ExPASy TrEMBL
Match: A0A6J1IIT5 (RNA polymerase II transcription factor B subunit 2 OS=Cucurbita maxima OX=3661 GN=LOC111477843 PE=3 SV=1)

HSP 1 Score: 781.2 bits (2016), Expect = 2.6e-222
Identity = 390/407 (95.82%), Postives = 405/407 (99.51%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPAMKLD+LYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVL DG+SK+KVA+DRLIQLRVFIETADRKRETTY+LNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLP+LE+L AYALDQWECFLLQLINSGQ++KPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDKETPRLTESGFQFLLMETNAQLWYIIREYI+N+EERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITA+QI+TFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 408
           QQNAHPRVA+R+PSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 407

BLAST of Sgr029333 vs. ExPASy TrEMBL
Match: A0A0A0KK55 (RNA polymerase II transcription factor B subunit 2 OS=Cucumis sativus OX=3659 GN=Csa_6G517270 PE=3 SV=1)

HSP 1 Score: 774.2 bits (1998), Expect = 3.2e-220
Identity = 387/407 (95.09%), Postives = 402/407 (98.77%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPAMKLD+LYENAFICEAILRSLPPLAKKFVLQMLYID PV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDGPVSA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVL DG+SK+KVA+DRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLPSLEDL AYALDQWECFLLQLINSGQ+EKPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDKE PRLTESGFQFLLMETNAQLWYIIREYI+N+EERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AYDIDTL++EQRYAIKDLADLGLVKLQQGRK+SWFIPTKLATNLSMSLADSSSRK GFVV
Sbjct: 241 AYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTKLATNLSMSLADSSSRKLGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           VETNFRMYAYS+SKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQI+TFL
Sbjct: 301 VETNFRMYAYSTSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 408
           QQNAHPRVA+R+PSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 407

BLAST of Sgr029333 vs. ExPASy TrEMBL
Match: A0A6J1EGA7 (RNA polymerase II transcription factor B subunit 2 OS=Cucurbita moschata OX=3662 GN=LOC111433104 PE=3 SV=1)

HSP 1 Score: 772.7 bits (1994), Expect = 9.2e-220
Identity = 387/407 (95.09%), Postives = 402/407 (98.77%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPAMKLD+LY NAFICEAILRSLPPLAKKFVLQMLYIDAPVTA
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYGNAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVL DG+SK+KVA+DRLIQLRVFIETADRKRETTY+LNPTFQANLQKLLI GEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIQGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLP+LE+L AYALDQWECFLLQLINSGQ++KPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDKETPRLTESGFQFLLMETNAQLWYIIREYI+N+EER VDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERDVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITA+QI+TFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 408
           QQNAHPRVA+R+PSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 407

BLAST of Sgr029333 vs. ExPASy TrEMBL
Match: A0A5A7UBE2 (RNA polymerase II transcription factor B subunit 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold120G003070 PE=3 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 1.1e-217
Identity = 390/432 (90.28%), Postives = 404/432 (93.52%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPAMKLD+LYENAFICEAILRSLPPLAKKFVLQMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVL DG+SK+KVA+DRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLPSLEDL AYALDQWECFLLQLINSGQ+EKPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 181 Q-------------------------RDKETPRLTESGFQFLLMETNAQLWYIIREYITN 240
           Q                         RDKE PRLTESGFQFLLMETNAQLWYIIREYI+N
Sbjct: 181 QRLKSCSMYSSLYFARYLIKCYVWYSRDKEAPRLTESGFQFLLMETNAQLWYIIREYISN 240

Query: 241 SEERGVDPADLISFLLELSFHVTGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWF 300
           +EERGVDPADLISFLLELSFHVTGEAYDIDTL++EQRYAIKDLADLGLVKLQQGRK+SWF
Sbjct: 241 AEERGVDPADLISFLLELSFHVTGEAYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWF 300

Query: 301 IPTKLATNLSMSLADSSSRKQGFVVVETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIV 360
           IPTKLATNLSMSLADSSSRKQGFVVVETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIV
Sbjct: 301 IPTKLATNLSMSLADSSSRKQGFVVVETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIV 360

Query: 361 GAITKESLYNAFKNGITAEQIITFLQQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDI 408
           GAITKESLYNAFKNGITAEQI+TFLQQNAHPRVA+R+PSVPENVTDQIRLWESDLNRVDI
Sbjct: 361 GAITKESLYNAFKNGITAEQIVTFLQQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDI 420

BLAST of Sgr029333 vs. TAIR 10
Match: AT4G17020.2 (transcription factor-related )

HSP 1 Score: 663.3 bits (1710), Expect = 1.5e-190
Identity = 328/407 (80.59%), Postives = 368/407 (90.42%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPA+KLDKLY N FICEAILRSLPPLAKK+VLQMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
             MEEWVL+DG SKH+VAIDRLIQLR+F E +DRKR T+Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
            REPM S+  ++LPSL++L  YAL QWECFLLQLINSGQ EK + ISSS+MK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDK+ PRLTESGFQFLLM+TNAQLWYIIREYI N+EER VDPADLISFLLELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AY+++TLTE Q   +KDLADLGLVKLQQGRKDSWFIPTKLATNLS+SLADSS+RK+GFVV
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           +ETNFRMYAYS+SKL CEILRLF+RIEYQLPNLI  AITKESLYNAF NGIT++QIITFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 408
           QQN+HPR ADRVPS+PENVTDQIRLWE+DL R+++T AHFYDEFPS+
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSK 407

BLAST of Sgr029333 vs. TAIR 10
Match: AT4G17020.1 (transcription factor-related )

HSP 1 Score: 663.3 bits (1710), Expect = 1.5e-190
Identity = 328/407 (80.59%), Postives = 368/407 (90.42%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPA+KLDKLY N FICEAILRSLPPLAKK+VLQMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
             MEEWVL+DG SKH+VAIDRLIQLR+F E +DRKR T+Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
            REPM S+  ++LPSL++L  YAL QWECFLLQLINSGQ EK + ISSS+MK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDK+ PRLTESGFQFLLM+TNAQLWYIIREYI N+EER VDPADLISFLLELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AY+++TLTE Q   +KDLADLGLVKLQQGRKDSWFIPTKLATNLS+SLADSS+RK+GFVV
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           +ETNFRMYAYS+SKL CEILRLF+RIEYQLPNLI  AITKESLYNAF NGIT++QIITFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 408
           QQN+HPR ADRVPS+PENVTDQIRLWE+DL R+++T AHFYDEFPS+
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSK 407

BLAST of Sgr029333 vs. TAIR 10
Match: AT4G17020.3 (transcription factor-related )

HSP 1 Score: 663.3 bits (1710), Expect = 1.5e-190
Identity = 328/407 (80.59%), Postives = 368/407 (90.42%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDKLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60
           MPQVKIIAKNFMDMVASLPA+KLDKLY N FICEAILRSLPPLAKK+VLQMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 61  KSMEEWVLSDGISKHKVAIDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
             MEEWVL+DG SKH+VAIDRLIQLR+F E +DRKR T+Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 121 AREPMPSNITVRLPSLEDLAAYALDQWECFLLQLINSGQSEKPSNISSSVMKVFQKGLLS 180
            REPM S+  ++LPSL++L  YAL QWECFLLQLINSGQ EK + ISSS+MK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYITNSEERGVDPADLISFLLELSFHVTGE 240
           QRDK+ PRLTESGFQFLLM+TNAQLWYIIREYI N+EER VDPADLISFLLELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300
           AY+++TLTE Q   +KDLADLGLVKLQQGRKDSWFIPTKLATNLS+SLADSS+RK+GFVV
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIITFL 360
           +ETNFRMYAYS+SKL CEILRLF+RIEYQLPNLI  AITKESLYNAF NGIT++QIITFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 361 QQNAHPRVADRVPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSR 408
           QQN+HPR ADRVPS+PENVTDQIRLWE+DL R+++T AHFYDEFPS+
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSK 407

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYK13130.15.7e-23289.15RNA polymerase II transcription factor B subunit 2 [Cucumis melo var. makuwa][more]
XP_022977542.15.3e-22295.82RNA polymerase II transcription factor B subunit 2 [Cucurbita maxima][more]
XP_023543805.13.5e-22195.58RNA polymerase II transcription factor B subunit 2 [Cucurbita pepo subsp. pepo][more]
KAG6604444.13.5e-22195.58General transcription and DNA repair factor IIH subunit TFB2, partial [Cucurbita... [more]
XP_038882460.12.2e-22095.33general transcription and DNA repair factor IIH subunit TFB2 isoform X1 [Beninca... [more]
Match NameE-valueIdentityDescription
Q680U92.1e-18980.59General transcription and DNA repair factor IIH subunit TFB2 OS=Arabidopsis thal... [more]
Q927596.9e-6334.14General transcription factor IIH subunit 4 OS=Homo sapiens OX=9606 GN=GTF2H4 PE=... [more]
P600276.9e-6334.14General transcription factor IIH subunit 4 OS=Pan troglodytes OX=9598 GN=GTF2H4 ... [more]
O704222.0e-6234.40General transcription factor IIH subunit 4 OS=Mus musculus OX=10090 GN=Gtf2h4 PE... [more]
Q54C297.8e-5932.72General transcription factor IIH subunit 4 OS=Dictyostelium discoideum OX=44689 ... [more]
Match NameE-valueIdentityDescription
A0A5D3CNX32.8e-23289.15RNA polymerase II transcription factor B subunit 2 OS=Cucumis melo var. makuwa O... [more]
A0A6J1IIT52.6e-22295.82RNA polymerase II transcription factor B subunit 2 OS=Cucurbita maxima OX=3661 G... [more]
A0A0A0KK553.2e-22095.09RNA polymerase II transcription factor B subunit 2 OS=Cucumis sativus OX=3659 GN... [more]
A0A6J1EGA79.2e-22095.09RNA polymerase II transcription factor B subunit 2 OS=Cucurbita moschata OX=3662... [more]
A0A5A7UBE21.1e-21790.28RNA polymerase II transcription factor B subunit 2 OS=Cucumis melo var. makuwa O... [more]
Match NameE-valueIdentityDescription
AT4G17020.21.5e-19080.59transcription factor-related [more]
AT4G17020.11.5e-19080.59transcription factor-related [more]
AT4G17020.31.5e-19080.59transcription factor-related [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004598Transcription factor TFIIH subunit p52/Tfb2TIGRFAMTIGR00625TIGR00625coord: 13..409
e-value: 7.7E-97
score: 323.0
IPR004598Transcription factor TFIIH subunit p52/Tfb2PFAMPF03849Tfb2coord: 14..366
e-value: 3.3E-111
score: 371.7
IPR004598Transcription factor TFIIH subunit p52/Tfb2PANTHERPTHR13152TFIIH, POLYPEPTIDE 4coord: 5..412
IPR040662Transcription factor Tfb2, C-terminal domainPFAMPF18307Tfb2_Ccoord: 381..408
e-value: 3.3E-7
score: 30.6
NoneNo IPR availableGENE3D3.30.70.2610coord: 377..424
e-value: 2.3E-9
score: 39.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 466..491
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 473..491

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr029333.1Sgr029333.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006289 nucleotide-excision repair
biological_process GO:0070816 phosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0000439 transcription factor TFIIH core complex
cellular_component GO:0005675 transcription factor TFIIH holo complex
molecular_function GO:0001671 ATPase activator activity
molecular_function GO:0003690 double-stranded DNA binding