HG10016088 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10016088
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRNA polymerase II transcription factor B subunit 2
LocationChr03: 2715108 .. 2722751 (+)
RNA-Seq ExpressionHG10016088
SyntenyHG10016088
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTCAAGTAAAGATCATAGCGAAGAATTTTATGGACATGGTGGCCTCCTTGCCCGCCATGAAGCTCGATCAACTCTATGAGAATGCATTCATCTGCGAAGCCATTCTCAGGTCTTTGCTTGCGATTCTAAATGCTTTAATTGGTTGGGGTCGTATTTATTATTGAAATCACTAAACCATGAGTTTGTATGTAATTTGTTGGACATTTGAGCATGATTTTCATATCATTGAAGCTGATATTATTATTATTTTTAATTATTTTTTTAATTCATTGAAATGTTCTAGATTTGATATTGATGTTGATTTCCATGTTTCTTTATTAGTTGTTTTTTCATTTTTCCTTCTCGTTCATTGAGTTAGAGGACCGTGCCAAAGTTAGGTATTTGCAAGTGAATTTAACTTAAAAGTAGCTAGTTAATCCTAGTTTTGGAGGTTCTCATTTATAAAAGTGCCAATCATCTTTTGTAATCAGGTCACTTCCACCACTGGCTAAGAAGTTTGTTTTACAAATGCTGTACATTGATGCTCCAGTTTCAGCCAAGTCCATGGAGGAGTGGGTTCTCCCAGATGGAGTCTCAAAGTATAAGGTTGCTGTTGATCGATTGATTCAATTGAGAGTATTTATTGAGACTGCGGATAGGTGTTTATTCTCTTCTCGCATTTTCTAAAGAATTTGATATTTTTTCTCAATTATTTTCAACTGCATTCTTAGTTAAATAGATCCAAGAAGGCTGACATTCAAGAGTTCAAGAGTAAGGCCAAGTTTGTGGAGTGTTTTTTATTTTTTTATATTCATCTCTCTTGCCGATGTTTCTTTCCAATAGTTGATAGATTGCCTTCGTGTTCATATGATGATATCATCAGAATTGAGAGGTTCCTGCAAGGAGAACTTTAATTATGAACTTGTGGTGCAATATATAATTGCTTTTGCATTATTTCTGTTTTGATCTCACTTTATATTATCAGCAGTTTGTATGATTTTAGTCCTCAACCCAATCCTATATTGTAATCATTTCTATGAATCCCTATTTTATTGATAGGAAAAGAGAAATGACCTACAGGCTAAATCCAACGTTCCAAGCAAACCTCCAAAAGCTTTTAATACATGGGTAAGTTTTCTCTCAATTGGTTCAATGAATGCATTTCTAAACGAGATATGCAAGGAGGGATACATGATTTGTCTCATGTCCTTTCTACTCAGCCCAAACATTCTGATGCATGGATCTTTCTTAATATATGCACTTTTTCTTTTTACTTGCTCCCTTATTTGCATTCATTAAGTTCAAGTATATGATTACTCATCTTCTATAAAAAGATGCTGATCCAGGAGTTTAGACTCCTTAATTTTAACATTTAAGAGTGATTAACTCTTGGTTTAAAAAATGACAATGAATTTGGGATTTTGCAGTGAAGTTCTAGCCAGAGAACCAATGCCTTCAAATATAACCGTGAGGCTCCCAAGTTTGGAAGATCTTGAGGCTTATGCTCTAGATCAGTGGGAGGTCGGACAGTCATCAAAATAATTTTAATCATTTGTTGTTTGGTTGCATCCTTTCTAGTTGATTCTTCAGCTAGTATTGACTCCATCCTCTTGGACTCAATCGAAGGCAACCAGTCTTCTAATTTGCTTACTGTACAATGATCGTGGATATTCTTCTGACCTATTATTTTATTAGTTCAGTGCTTCTTGCTGCAATTGATAAACTCGGGCCAAGCAGAGAAGCCATCAAATATTAGTTCTTCTGTGATGAAAGTTTTCCAGAAAGGCCTTTTAAGTCAGAGGTTAAAATCTTGTACAATGTACTCTAGTCTCTAGCTTATATATAACGATCACTTCTTATATATGTTAACATGTTATGTATGGTATCTAGGGATAAAGAAGCTCCACGATTAACTGAGAGTGGTTTCCAGTTCTTGGTATGAATCACATGAACCATAGACTTCTTGAAATCCTCATCGATCTGTCTCACCAGGGATTCTTGTGATATATGTAAACTTCAGTTGATGGAAACAAATGCACAACTTTGGTATATCATCAGAGAATATATATCTAACGCTGAGGTTTGTGTTTTCCATTTTTATTGCATATCTGATTGCGTTCCTTCGAGGAAAAATTATATATATTCTATTTCTGCCATTAATTTTCTATTTGAATCTGAATGATGAGCTACTGTAACTCATAACGAAAAGCAACTATTGTTAATTTTAAGACCATTTTCTTTAGATTCATTGGACCGTGTAATTTTATCATTTATTTTCTTCTAATAATTCATATTCCTCTCTTGTAAATGCATGCTTAATCAATGTGGTGCAGGAGCGAGGCGTGGATCCTGCAGATTTGATTTCTTTTCTGCTAGAGCTTAGTTTTCACGTGACAGGAGAGGTATCAATTCCTTGATTGTCTTTTACATCTTTATCTTTTATTGTTGTGTTAATCTTTTATCCCATTGTAGAACTCTCTTTTAAGTCTTGCTGCCTCATATCATGCTAATACTGAAACTACTTTGGGTTGGTTCAGGAGTGTATATATATTTATCACAGATCGATTATCTTTAGTTCGTTACGTTGTGCACCAATCCTAATTTACCATAGATTGTTTATGACCAGTTTGAGTTTGATTGGAATTTTATCCATACTCTTTTTCATCTGTATAATGTCTGTCATTCTTTTGTGACGAGTTTGTGTGTTCTAGACAATAAACAAAGGGCTTGGTTCTGTTTCCCTCCATCATAAGAGTGAATAATTAGTTATGGAGAATAAAGTACAAGTTAAGGTAAGGCATCAACCCAATTAAGAACCCCTGGGATTACAATAAATACCTCCCATTGACAATGACAAATAAGAATCATAAATGCGTTGAGTTTGATCATCTTACATAATCATCTTTCCCTCTAAACTTATTCTGAAAATGTCTACTGTTTCTTTTATCAACGCTTCATTGTGGTCAAGTTTCCATATTTCCTGTGCATGTGATGCACAAGGAGAGAGGGATCATATGGGTTTTCCTTTGAGAAAGTTTGGTTTGTATTAATTCCCCTCATAGCAAGATGCAAAATGAAAATATAATTTTCTTTGGGTTTCCACGCCTCCACTTCAGGTAACCTTAAACAAGTTACTACTCTATTATATAAGAACAGTAATGATTTCAAGTCTAGAGTTTCCCCAAATTCCATCTCAATCTGTCAGGGTGCCAATTTGGATCGTGGTTTTATTTCAATCTTTTAATTCCTCTAACCCTAGCAGTCAGCTAACTCTTCTTTTTTAAAACTTGGCAATTGAATTCAGTAGAAAACTAGCTTTAGAGGAATATTGAAAGCCTATGTACTTTTATGGAGGTTTCCGTTGCCAATTTTGAGGGAAAAACTCCTAATTCTTTTTGCACCTTACCACAAACTTCCAAGGGCTGCTGCTTCTCATTTTCCCATGGTGAACAAGTAGATGTAAATCAATTGTCACTATCAGTGCCATATAGAACAAGTGCTAAATTTATCTCCTTCAGACTGCCTTCTCCAAATCCACGCTCTTTTGGTAGTTAGACACCCTCCTCCAATCAGCTAGAGATGGCTGGCATTGTCAATATTCTCTCTGACCCATAGGAGGTTTCTCAACATACTCTCCAAAAGTTTAGCCACTCGATTTTCAATAGATACATCAAATAAATAGGAGCATTGTAAATTGCACTAAGGTGAGTCTTTCCGCGTCAGACAAAAAGGCACTTACTCATTTTTTCTTTGAGAAACCTCTGAAACTGATTTTGCTTCTATTCCTATAGCTCCAAGTGGATATTAACCTATATACAAATTCTCCCCTTTGAAACTTCAGTAAGGGAAAAAAAAAAAAAAAAAGAAACAAACGTGGGATGCCACAATCCATTCAACCTTAAGACACATGCCCTAAATGTGTTCTACCTTCCCCCACTGGCAATTTTTGGATATTCCATTGTGGACCTTTTGTACCTTGATTTTGAGTGGGCTATATTACTTCAACAATCTAACTCCAAAACCTTCTTCATCCCTCTCTTCCGTCTCTTAATCAATAACAGCTGGTCACTAATCCTCTGATAACTATAATAAATACTAATAATCCATATTTCTAAATTTTATAAAACTAATCCGTATACTTTTTACTGTCCCGTTACCAACATGTTTATTCTTTTTTTTGTAGGCTTATGATATTGATACACTGACAGAAGAACAGAGGTATGCTATTAAGGACCTTGCTGATCTGGGACTAGTTAAGCTTCAGCAGGTTTTTGGCTAGCCTCTGCTGCTTTTTAAATTGCACATGACTTTACTCAAATTTTTACTTTTTTAACTTTGCCTTTTTCTTTTGCTGTTGTGTCTTTCGCTGTTAATTTGTCCAGGGTAGAAAGGACAGTTGGTTTATACCTACTAAATTAGCCACAAATCTTTCAATGAGTTTAGCAGATTCTTCTTCAAGGAAACAGGTGAGTGTTTCTGAATGTGAATTATATTTAAATTTGATATATTTCTCTTAAGTTTCATTTTTTTTATGGTTACTAAGTTTTCTAGGGGTGTTTGGCTCAAAGAGGTGGAGTTGAGTGAGTGTTTGGCCCAAGGAGTTTGTGGGTACCACGACTAAAAAACATCAACTTCATGCCTTATTAACTCCTTACACTATATTTTAAAAAAGAACTCTTTACACTGTGGGCCCCGTGACTTCACAACTCCTTAGATTTCAAACTCTCCAGATTTTACAACTCCACTCCTTGCCCGAGGCTTTATGAAATTGTTTGCACTTCCTAATTTTCTTCCTGTTGTATGCCCAGGGATTTGTTGTCGTGGAGACAAATTTCAGGATGTATGCTTATTCTTCTTCCAAACTACATTGTGAAATATTACGCCTTTTTTCAAGGTATATGTCATCGATTAATGTTATATTAAATTATATAACGAATGTTCAAGTTAGTTTTAACCACAATGGGTTGACCTACTAGGTCAATAAAGGTCCATGTAAATAAGACATGCTTTAGAGGGAACTACACTAGATTTGCATCTAATGAGTTACTTTGATAACGTAATGTAGTAGGGTTAGATAATTGTTTCGTGAAAATAGTTAGAGGGTATACAAGCTCAAATACTAAAGAATATTAGAAAAAAGCTTTACAAAATTAACTGGCTACCGGATACATTCAATATTTTCATCATCCCTTTTCTTCTTCTTGAATGCTAAATATATTTGAATTCCCACCCCTGATAGACCCTTGAAATTTCACATATGTTATAAGGAAGAAAGAGGGACATGCTGTGGGATAACTTAGTCATTAGCGTTGCTTCTTGAGAGTGCAAGCAACTAGTGATAGATTCTGTAAGGTCAGTAGGGCCAATAGTTGATTAAATAGGGATATGATTCAAGTTCAACATTCTACATTTTACCTATTGCGTCATAACGCAGTTATTACCTTTTCTGTAATGCAGCAATCTAATCACACATATCAATATTTGGTTCAAGAATTGTAAAATAAGACGAAGCTTTCCTTCTGCAGGATTGAGTATCAACTTCCAAATCTTATAGTTGGAGCAATAACAAAAGAAAGCTTGTATAATGCTTTTAAGAATGGGATTACGGCAGAGCAGGCAAGTTTTCTTTTTGTTCACCCTGGCATATGTTACTCTTAGTCTTATGACGTATGGGATTGGGTACCATTATATTATGTTCAAATTGCCAGTGGAATGCAGTAGACCAGTTCTTCTTTGCGTTTCATTTAATTCACAACAATTTGGGTTCGTAGCACTGTTAATTCTTGCTAGTGGTGGAAATGGAATACTTGGAGAATTCTCCCCAATTCTAAATATTAATTGCGTACAATTAGCATGAAAGATTTTGGGCTTGGCTCAAGGGGGTTATTTCTAGTTAGCAGGGATCAAGTGCCAAACCCACTGTAATCAGCCAGCCACAAAGGGACTTCAACTAGCATTGGCTCCTAATCCACAAGAAACAATATTTCTATTTTTCTTGTATGCATCAATCTAAATTCTCTCATAAACTTAACATAAACTCTGAATATTTTTCTTAGCAAAAAAAAATAGGAGCAAAATACACCTTTGGTCTCTGAAGTTCGAGTTTAGTGTTTGTTTAGTCCCTAAGTTTTCAAAATGAACATTTTAGTCCGAAAGATTTGGTAAATGGTTTTAAATGGTCCTTAGACTATTAGTTGATAACAGAAAATGACAATATGCGTGAGTAAGAAATAATGAGCCACCCCAGAAAAAAAATTAATATTAAATTTATAGTTGGGCTTCTTTGTCTCATCTTTTGTCTATCGTTCGTTTTCGATCTTCTCCTGATTCTTGGTTATGGCCCATTGATTCTTCATCGGTGTTTACTATTAAGTCTCTTATTGCTGATTTGGTGGGTTCGGCTGATTTGCTTTTGTCTGATCTTTATTCGATTATTTGGAAGGATTCTTATCCAAAGAAAATTAAAATTTTTCTTTGGGAATTGAGTTTGGGTGCCATCAATATTGTGGACCATCTTCAACGTTGAATGCCTTATATGTCGCTCTCACCTTCCTTGTGTTTTATCTGCTGGCATCATCTTAAATCACCTACTCATCTATTTATGCATTGCCCTTTCACGTCCAAATTTTGGAATATCATCTTTGAGGCTTTTGGTTGTTCTATGTCGTGCCCTAATACTATCTATGATCTTCTTGCTATATTGTTGGTGGGTCATCCTTTCGGTGGTACCAAGAAGATGATTTGGCTGGCTCTGCTGCACGTTTGGTCGTTATGGGGGAAGAAATGAACACCTTTTTCGTGACTCTTTTTCCTCTTTTGATCGATTTTTTAATTTTATTCTTTATACTACCTTTTCGTGGTGCAAAAATAAGCACCTCTTTAGACATTACAGTTTATCCTTTTTAGTTGTTAATATTGACAATCTCTCTTGTAACACTTTTAGGCTTTAAGTGCTTTGGGATTTTCCCCTATTTCATTTATAAATGAAATGTTTCTTCATCTTAAAAAAAAATACACAGGCATTCAATTACATTGAATACGTTACTACACTTTTTTATACTGGTTTTCTCATTTCAAATTGCTTGATATCTGACTATTGTTTTGCAGATAGTTACTTTTCTACAGCAGAATGCACATCCTCGTGTTGCAGAGAGAATACCATCAGTCCCTGAAAATGTCACGGATCAGGTTAGGGTAGCTTTAGTAAATTCTTTGATGTTTGTCAAGCCAATGGTTTCTGATCTACAAATCAACCCGTGCTTGTCATGGGAACCAATATATATGTTCTCATCTTACAATTTTATCAAATCTTTGTCATATTCCTGATTTTTTTCAATTGCTTTATATTCCATGACAGATTAGGTTATGGGAATCAGATCTTAATAGAGTCGATATTACTCCTGCACATTTTTACGATGAATTTCCTTCCAGGGTATGAATTTCAAATGTTGCTACTTTTAACTTTGAAGATTACTTGTACATTCACCTGAAAAACCACTTGAGAAGGAACTTCTTTGGCATTCCAGGAAGTTTTTGAGGCTGCTTGCGACTACGCACGAGAATGGAATGGGTTGCTATGGGAGGATTCGAAAAACATGCGACTGGTAGTGAAGGCAGACATACACACACATATGCGGGAACATCTTCGGCGTCAAAAGTAG

mRNA sequence

ATGCCTCAAGTAAAGATCATAGCGAAGAATTTTATGGACATGGTGGCCTCCTTGCCCGCCATGAAGCTCGATCAACTCTATGAGAATGCATTCATCTGCGAAGCCATTCTCAGGTCACTTCCACCACTGGCTAAGAAGTTTGTTTTACAAATGCTGTACATTGATGCTCCAGTTTCAGCCAAGTCCATGGAGGAGTGGGTTCTCCCAGATGGAGTCTCAAAGTATAAGGTTGCTGTTGATCGATTGATTCAATTGAGAGTATTTATTGAGACTGCGGATAGGAAAAGAGAAATGACCTACAGGCTAAATCCAACGTTCCAAGCAAACCTCCAAAAGCTTTTAATACATGGTGAAGTTCTAGCCAGAGAACCAATGCCTTCAAATATAACCGTGAGGCTCCCAAGTTTGGAAGATCTTGAGGCTTATGCTCTAGATCAGTGGGAGTGCTTCTTGCTGCAATTGATAAACTCGGGCCAAGCAGAGAAGCCATCAAATATTAGTTCTTCTGTGATGAAAGTTTTCCAGAAAGGCCTTTTAAGTCAGAGGGATAAAGAAGCTCCACGATTAACTGAGAGTGGTTTCCAGTTCTTGTTGATGGAAACAAATGCACAACTTTGGTATATCATCAGAGAATATATATCTAACGCTGAGGAGCGAGGCGTGGATCCTGCAGATTTGATTTCTTTTCTGCTAGAGCTTAGTTTTCACGTGACAGGAGAGGCTTATGATATTGATACACTGACAGAAGAACAGAGGTATGCTATTAAGGACCTTGCTGATCTGGGACTAGTTAAGCTTCAGCAGGGTAGAAAGGACAGTTGGTTTATACCTACTAAATTAGCCACAAATCTTTCAATGAGTTTAGCAGATTCTTCTTCAAGGAAACAGATAGTTACTTTTCTACAGCAGAATGCACATCCTCGTGTTGCAGAGAGAATACCATCAGTCCCTGAAAATGTCACGGATCAGATTAGGTTATGGGAATCAGATCTTAATAGAGTCGATATTACTCCTGCACATTTTTACGATGAATTTCCTTCCAGGGAAGTTTTTGAGGCTGCTTGCGACTACGCACGAGAATGGAATGGGTTGCTATGGGAGGATTCGAAAAACATGCGACTGGTAGTGAAGGCAGACATACACACACATATGCGGGAACATCTTCGGCGTCAAAAGTAG

Coding sequence (CDS)

ATGCCTCAAGTAAAGATCATAGCGAAGAATTTTATGGACATGGTGGCCTCCTTGCCCGCCATGAAGCTCGATCAACTCTATGAGAATGCATTCATCTGCGAAGCCATTCTCAGGTCACTTCCACCACTGGCTAAGAAGTTTGTTTTACAAATGCTGTACATTGATGCTCCAGTTTCAGCCAAGTCCATGGAGGAGTGGGTTCTCCCAGATGGAGTCTCAAAGTATAAGGTTGCTGTTGATCGATTGATTCAATTGAGAGTATTTATTGAGACTGCGGATAGGAAAAGAGAAATGACCTACAGGCTAAATCCAACGTTCCAAGCAAACCTCCAAAAGCTTTTAATACATGGTGAAGTTCTAGCCAGAGAACCAATGCCTTCAAATATAACCGTGAGGCTCCCAAGTTTGGAAGATCTTGAGGCTTATGCTCTAGATCAGTGGGAGTGCTTCTTGCTGCAATTGATAAACTCGGGCCAAGCAGAGAAGCCATCAAATATTAGTTCTTCTGTGATGAAAGTTTTCCAGAAAGGCCTTTTAAGTCAGAGGGATAAAGAAGCTCCACGATTAACTGAGAGTGGTTTCCAGTTCTTGTTGATGGAAACAAATGCACAACTTTGGTATATCATCAGAGAATATATATCTAACGCTGAGGAGCGAGGCGTGGATCCTGCAGATTTGATTTCTTTTCTGCTAGAGCTTAGTTTTCACGTGACAGGAGAGGCTTATGATATTGATACACTGACAGAAGAACAGAGGTATGCTATTAAGGACCTTGCTGATCTGGGACTAGTTAAGCTTCAGCAGGGTAGAAAGGACAGTTGGTTTATACCTACTAAATTAGCCACAAATCTTTCAATGAGTTTAGCAGATTCTTCTTCAAGGAAACAGATAGTTACTTTTCTACAGCAGAATGCACATCCTCGTGTTGCAGAGAGAATACCATCAGTCCCTGAAAATGTCACGGATCAGATTAGGTTATGGGAATCAGATCTTAATAGAGTCGATATTACTCCTGCACATTTTTACGATGAATTTCCTTCCAGGGAAGTTTTTGAGGCTGCTTGCGACTACGCACGAGAATGGAATGGGTTGCTATGGGAGGATTCGAAAAACATGCGACTGGTAGTGAAGGCAGACATACACACACATATGCGGGAACATCTTCGGCGTCAAAAGTAG

Protein sequence

MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSAKSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVLAREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLSQRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQIVTFLQQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREWNGLLWEDSKNMRLVVKADIHTHMREHLRRQK
Homology
BLAST of HG10016088 vs. NCBI nr
Match: XP_004134713.1 (general transcription and DNA repair factor IIH subunit TFB2 isoform X1 [Cucumis sativus] >KGN49219.1 hypothetical protein Csa_003950 [Cucumis sativus])

HSP 1 Score: 742.3 bits (1915), Expect = 2.2e-210
Identity = 386/451 (85.59%), Postives = 390/451 (86.47%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYID PVSA
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDGPVSA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRE TYRLNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AYDIDTL++EQRYAIKDLADLGLVKLQQGRK+SWFIPTKLATNLSMSLADSSSRK     
Sbjct: 241 AYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTKLATNLSMSLADSSSRKLGFVV 300

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QIVTFL
Sbjct: 301 VETNFRMYAYSTSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 393
           QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 420

BLAST of HG10016088 vs. NCBI nr
Match: XP_022977542.1 (RNA polymerase II transcription factor B subunit 2 [Cucurbita maxima])

HSP 1 Score: 738.0 bits (1904), Expect = 4.1e-209
Identity = 383/451 (84.92%), Postives = 389/451 (86.25%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRE TY+LNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLP+LE+LEAYALDQWECFLLQLINSGQA+KPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDKE PRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK     
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QIVTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 393
           QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 420

BLAST of HG10016088 vs. NCBI nr
Match: XP_038882460.1 (general transcription and DNA repair factor IIH subunit TFB2 isoform X1 [Benincasa hispida])

HSP 1 Score: 737.3 bits (1902), Expect = 7.1e-209
Identity = 384/451 (85.14%), Postives = 388/451 (86.03%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLP MKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA
Sbjct: 1   MPQVKIIAKNFMDMVASLPPMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRE TYRLNP FQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPMFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
           AREPMP+NITVRLPSLE+L+AYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPANITVRLPSLEELKAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AYDIDTLT+EQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK     
Sbjct: 241 AYDIDTLTDEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QIVTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 393
           QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 420

BLAST of HG10016088 vs. NCBI nr
Match: XP_023543805.1 (RNA polymerase II transcription factor B subunit 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 735.3 bits (1897), Expect = 2.7e-208
Identity = 382/451 (84.70%), Postives = 388/451 (86.03%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRE TY+LNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLP+LE+LEAYALDQWECFLLQLINSGQA+KPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDKE PRLTESGFQFLLMETNAQLWYIIREYISNAEER VDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERDVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK     
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QIVTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 393
           QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 420

BLAST of HG10016088 vs. NCBI nr
Match: KAG6604444.1 (General transcription and DNA repair factor IIH subunit TFB2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 735.3 bits (1897), Expect = 2.7e-208
Identity = 382/451 (84.70%), Postives = 388/451 (86.03%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPV+A
Sbjct: 32  MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 91

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRE TY+LNPTFQANLQKLLIHGEVL
Sbjct: 92  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIHGEVL 151

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLP+LE+LEAYALDQWECFLLQLINSGQA+KPSNISSSVMKVFQKGLLS
Sbjct: 152 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 211

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDKE PRLTESGFQFLLMETNAQLWYIIREYISNAEER VDPADLISFLLELSFHVTGE
Sbjct: 212 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERDVDPADLISFLLELSFHVTGE 271

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK     
Sbjct: 272 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 331

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QIVTFL
Sbjct: 332 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 391

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 393
           QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW
Sbjct: 392 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 451

BLAST of HG10016088 vs. ExPASy Swiss-Prot
Match: Q680U9 (General transcription and DNA repair factor IIH subunit TFB2 OS=Arabidopsis thaliana OX=3702 GN=TFB2 PE=2 SV=1)

HSP 1 Score: 600.5 bits (1547), Expect = 1.4e-170
Identity = 305/450 (67.78%), Postives = 348/450 (77.33%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPA+KLD+LY N FICEAILRSLPPLAKK+VLQMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
             MEEWVL DG SK++VA+DRLIQLR+F E +DRKR  +Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
            REPM S+  ++LPSL++LE YAL QWECFLLQLINSGQ EK + ISSS+MK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDK+ PRLTESGFQFLLM+TNAQLWYIIREYI NAEER VDPADLISFLLELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AY+++TLTE Q   +KDLADLGLVKLQQGRKDSWFIPTKLATNLS+SLADSS+RK     
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QI+TFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 392
           QQN+HPR A+R+PS+PENVTDQIRLWE+DL R+++T AHFYDEFPS++VFEAACD+AREW
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREW 420

BLAST of HG10016088 vs. ExPASy Swiss-Prot
Match: Q92759 (General transcription factor IIH subunit 4 OS=Homo sapiens OX=9606 GN=GTF2H4 PE=1 SV=1)

HSP 1 Score: 179.1 bits (453), Expect = 9.7e-44
Identity = 124/453 (27.37%), Postives = 213/453 (47.02%), Query Frame = 0

Query: 3   QVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSAKS 62
           +V +  +N  + +  L    LD+LY +   C A+ R LP LAK +V++ML+++ P+   +
Sbjct: 11  RVHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAA 70

Query: 63  MEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQ-KLLIHGEVLA 122
           +  WV  +     + +   L  LR++             LNP F+ NL+  LL  G+  +
Sbjct: 71  VALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWS 130

Query: 123 REPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLL-S 182
            +            +  L+ YA ++WE  L  ++ S  A    +++  +    Q GL+ S
Sbjct: 131 DDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKS 190

Query: 183 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 242
               E P +T +GFQFLL++T AQLWY + +Y+  A+ RG+D  +++SFL +LSF   G+
Sbjct: 191 TEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGK 250

Query: 243 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLS--------------- 302
            Y ++ +++     ++ L + GLV  Q+ RK   + PT+LA NLS               
Sbjct: 251 DYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGF 310

Query: 303 ----------------------------------------------MSLADSSSRKQIVT 362
                                                          ++A   + +QI+ 
Sbjct: 311 IVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIH 370

Query: 363 FLQQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAR 393
           FL+  AHP + ++ P +P  +TDQIRLWE + +R+  T    Y++F S+  FE    +AR
Sbjct: 371 FLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHAR 430

BLAST of HG10016088 vs. ExPASy Swiss-Prot
Match: P60027 (General transcription factor IIH subunit 4 OS=Pan troglodytes OX=9598 GN=GTF2H4 PE=3 SV=1)

HSP 1 Score: 179.1 bits (453), Expect = 9.7e-44
Identity = 124/453 (27.37%), Postives = 213/453 (47.02%), Query Frame = 0

Query: 3   QVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSAKS 62
           +V +  +N  + +  L    LD+LY +   C A+ R LP LAK +V++ML+++ P+   +
Sbjct: 11  RVHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAA 70

Query: 63  MEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQ-KLLIHGEVLA 122
           +  WV  +     + +   L  LR++             LNP F+ NL+  LL  G+  +
Sbjct: 71  VALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWS 130

Query: 123 REPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLL-S 182
            +            +  L+ YA ++WE  L  ++ S  A    +++  +    Q GL+ S
Sbjct: 131 DDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKS 190

Query: 183 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 242
               E P +T +GFQFLL++T AQLWY + +Y+  A+ RG+D  +++SFL +LSF   G+
Sbjct: 191 TEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGK 250

Query: 243 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLS--------------- 302
            Y ++ +++     ++ L + GLV  Q+ RK   + PT+LA NLS               
Sbjct: 251 DYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGF 310

Query: 303 ----------------------------------------------MSLADSSSRKQIVT 362
                                                          ++A   + +QI+ 
Sbjct: 311 IVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIH 370

Query: 363 FLQQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAR 393
           FL+  AHP + ++ P +P  +TDQIRLWE + +R+  T    Y++F S+  FE    +AR
Sbjct: 371 FLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHAR 430

BLAST of HG10016088 vs. ExPASy Swiss-Prot
Match: O70422 (General transcription factor IIH subunit 4 OS=Mus musculus OX=10090 GN=Gtf2h4 PE=1 SV=1)

HSP 1 Score: 177.9 bits (450), Expect = 2.2e-43
Identity = 123/447 (27.52%), Postives = 210/447 (46.98%), Query Frame = 0

Query: 9   KNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSAKSMEEWVL 68
           +N  + +  L    LD+LY +   C A+ R LP LAK +V++ML+++ P+   ++  WV 
Sbjct: 18  RNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAVALWVK 77

Query: 69  PDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQ-KLLIHGEVLAREPMPS 128
            +     + +   L  LR++             LNP F+ NL+  LL  G+  + +    
Sbjct: 78  KEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPVFRQNLRIALLGGGKAWSDDTSQL 137

Query: 129 NITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLL-SQRDKEA 188
                   +  L+ YA ++WE  L  ++ S  A    +++  +    Q GL+ S    E 
Sbjct: 138 GPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSTEPGEP 197

Query: 189 PRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGEAYDIDT 248
           P +T +GFQFLL++T AQLWY + +Y+  A+ RG+D  +++SFL +LSF   G+ Y ++ 
Sbjct: 198 PCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVEG 257

Query: 249 LTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLS--------------------- 308
           +++     ++ L + GLV  Q+ RK   + PT+LA NLS                     
Sbjct: 258 MSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETN 317

Query: 309 ----------------------------------------MSLADSSSRKQIVTFLQQNA 368
                                                    ++A   + +QI+ FL+  A
Sbjct: 318 YRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRA 377

Query: 369 HPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREWNGLL 393
           HP + ++ P +P  +TDQIRLWE + +R+  T    Y++F S+  FE    +ARE   L+
Sbjct: 378 HPVMLKQNPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLV 437

BLAST of HG10016088 vs. ExPASy Swiss-Prot
Match: Q6CLR2 (General transcription and DNA repair factor IIH subunit TFB2 OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) OX=284590 GN=TFB2 PE=3 SV=1)

HSP 1 Score: 168.7 bits (426), Expect = 1.3e-40
Identity = 119/487 (24.44%), Postives = 209/487 (42.92%), Query Frame = 0

Query: 13  DMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSAKSMEEWVLPDGV 72
           D +  LP     +LYE+   C AI R L P+AK F++ ML+ D  VS + +++WV PD  
Sbjct: 12  DYLEGLPEQVQSRLYESPATCLAIYRLLSPMAKFFIMSMLFQDHDVSLRDLDKWVKPDAK 71

Query: 73  SKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVLAREPMPSNITVR 132
            + + ++  +  L + IE  + K+ +  RLNP F+ + + +L  GE+       ++    
Sbjct: 72  YQLQYSIKSMKSLNLIIE-GESKQPLLIRLNPIFKKSFKNVLTGGEINNSFGDVADDDTN 131

Query: 133 LPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQ-KGLLSQRDKEAPRLTE 192
             S   L+ Y+ ++WE  L  ++ +     P      V+ + Q  GL+ + +    ++T 
Sbjct: 132 PVSTATLDQYSAEKWETILHYMVGTPNTNTP---GGKVLDLLQHSGLMEEAEYGELKITN 191

Query: 193 SGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGEAYDIDTLTEEQ 252
            GFQFLL + NAQ+W ++ +Y+  AE   +DP D+++F+  L     G+AY  D L+  Q
Sbjct: 192 QGFQFLLQDVNAQMWTLLLQYLKMAESLQMDPVDVLNFIFMLGALQLGKAYKCDQLSNTQ 251

Query: 253 RYAIKDLADLGLVKLQQGRKD-SWFIPTKLATNLSM------------------------ 312
           R  ++D+ D GL+   Q + D + F PT+LAT L+                         
Sbjct: 252 RTMLQDMRDYGLI--YQNQSDYAKFYPTRLATLLTSDTKAFRSASVALDSVLNKANETTA 311

Query: 313 ------------------------------------------------------------ 372
                                                                       
Sbjct: 312 VEGDSGQDETTERTQDGALIIETNFKLYSYSNSPLQIAILSLFVHLKSRFANMVTGQLTR 371

Query: 373 -----SLADSSSRKQIVTFLQQNAHPR------------------VAERIPSVPENVTDQ 391
                +L +  + +QI+ +L+ +AHPR                  V E +  +P  V DQ
Sbjct: 372 ESVRNALLNGITAEQIIAYLETHAHPRMRRLAEENLSKKLELDPTVKETLQVLPPTVVDQ 431

BLAST of HG10016088 vs. ExPASy TrEMBL
Match: A0A0A0KK55 (RNA polymerase II transcription factor B subunit 2 OS=Cucumis sativus OX=3659 GN=Csa_6G517270 PE=3 SV=1)

HSP 1 Score: 742.3 bits (1915), Expect = 1.1e-210
Identity = 386/451 (85.59%), Postives = 390/451 (86.47%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYID PVSA
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDGPVSA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRE TYRLNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AYDIDTL++EQRYAIKDLADLGLVKLQQGRK+SWFIPTKLATNLSMSLADSSSRK     
Sbjct: 241 AYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTKLATNLSMSLADSSSRKLGFVV 300

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QIVTFL
Sbjct: 301 VETNFRMYAYSTSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 393
           QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 420

BLAST of HG10016088 vs. ExPASy TrEMBL
Match: A0A6J1IIT5 (RNA polymerase II transcription factor B subunit 2 OS=Cucurbita maxima OX=3661 GN=LOC111477843 PE=3 SV=1)

HSP 1 Score: 738.0 bits (1904), Expect = 2.0e-209
Identity = 383/451 (84.92%), Postives = 389/451 (86.25%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRE TY+LNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLP+LE+LEAYALDQWECFLLQLINSGQA+KPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDKE PRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK     
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QIVTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 393
           QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 420

BLAST of HG10016088 vs. ExPASy TrEMBL
Match: A0A6J1EGA7 (RNA polymerase II transcription factor B subunit 2 OS=Cucurbita moschata OX=3662 GN=LOC111433104 PE=3 SV=1)

HSP 1 Score: 729.6 bits (1882), Expect = 7.1e-207
Identity = 380/451 (84.26%), Postives = 386/451 (85.59%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPAMKLDQLY NAFICEAILRSLPPLAKKFVLQMLYIDAPV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYGNAFICEAILRSLPPLAKKFVLQMLYIDAPVTA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRE TY+LNPTFQANLQKLLI GEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYKLNPTFQANLQKLLIQGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLP+LE+LEAYALDQWECFLLQLINSGQA+KPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPNLEELEAYALDQWECFLLQLINSGQADKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDKE PRLTESGFQFLLMETNAQLWYIIREYISNAEER VDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKETPRLTESGFQFLLMETNAQLWYIIREYISNAEERDVDPADLISFLLELSFHVTGE 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK     
Sbjct: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVV 300

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QIVTFL
Sbjct: 301 VETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAQQIVTFL 360

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 393
           QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 420

BLAST of HG10016088 vs. ExPASy TrEMBL
Match: A0A6J1CK16 (RNA polymerase II transcription factor B subunit 2 OS=Momordica charantia OX=3673 GN=LOC111012326 PE=3 SV=1)

HSP 1 Score: 716.8 bits (1849), Expect = 4.8e-203
Identity = 383/491 (78.00%), Postives = 389/491 (79.23%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPAMKLD LYENAFICEAILRSLPPLAKKFVLQMLYI+APV+A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDNLYENAFICEAILRSLPPLAKKFVLQMLYIEAPVAA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVL DGVSKYKVA+DRLIQLRVFIETADRKRE TYRLNPTFQANLQKLLI+GEVL
Sbjct: 61  KSMEEWVLSDGVSKYKVALDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIYGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISN+EERGVDPADLISFLLELSFHVTGE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNSEERGVDPADLISFLLELSFHVTGE 240

Query: 241 ----------------------------------------AYDIDTLTEEQRYAIKDLAD 300
                                                   AYDIDTLTEEQRYAIKDLAD
Sbjct: 241 LQWLFGDADLLLRFVCLFRGALFFIVLSGFLYFWEFISFLAYDIDTLTEEQRYAIKDLAD 300

Query: 301 LGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK------------------------- 360
           LGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK                         
Sbjct: 301 LGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRKQGFVVVETNFRMYAYSSSRLHCEIL 360

Query: 361 ----------------------------------QIVTFLQQNAHPRVAERIPSVPENVT 393
                                             QIVTFLQQNAHPRVAERIPSVPENVT
Sbjct: 361 RLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFLQQNAHPRVAERIPSVPENVT 420

BLAST of HG10016088 vs. ExPASy TrEMBL
Match: A0A5A7UBE2 (RNA polymerase II transcription factor B subunit 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold120G003070 PE=3 SV=1)

HSP 1 Score: 694.5 bits (1791), Expect = 2.5e-196
Identity = 369/458 (80.57%), Postives = 373/458 (81.44%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA
Sbjct: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
           KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRE TYRLNPTFQANLQKLLIHGEVL
Sbjct: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
           AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180

Query: 181 Q-------------------------RDKEAPRLTESGFQFLLMETNAQLWYIIREYISN 240
           Q                         RDKEAPRLTESGFQFLLMETNAQLWYIIREYISN
Sbjct: 181 QRLKSCSMYSSLYFARYLIKCYVWYSRDKEAPRLTESGFQFLLMETNAQLWYIIREYISN 240

Query: 241 AEERGVDPADLISFLLELSFHVTGEAYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWF 300
           AEERGVDPADLISFLLELSFHVTGEAYDIDTL++EQRYAIKDLADLGLVKLQQGRK+SWF
Sbjct: 241 AEERGVDPADLISFLLELSFHVTGEAYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWF 300

Query: 301 IPTKLATNLSMSLADSSSRK---------------------------------------- 360
           IPTKLATNLSMSLADSSSRK                                        
Sbjct: 301 IPTKLATNLSMSLADSSSRKQGFVVVETNFRMYAYSSSKLHCEILRLFSRIEYQLPNLIV 360

Query: 361 -------------------QIVTFLQQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDI 375
                              QIVTFLQQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDI
Sbjct: 361 GAITKESLYNAFKNGITAEQIVTFLQQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDI 420

BLAST of HG10016088 vs. TAIR 10
Match: AT4G17020.2 (transcription factor-related )

HSP 1 Score: 600.5 bits (1547), Expect = 9.6e-172
Identity = 305/450 (67.78%), Postives = 348/450 (77.33%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPA+KLD+LY N FICEAILRSLPPLAKK+VLQMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
             MEEWVL DG SK++VA+DRLIQLR+F E +DRKR  +Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
            REPM S+  ++LPSL++LE YAL QWECFLLQLINSGQ EK + ISSS+MK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDK+ PRLTESGFQFLLM+TNAQLWYIIREYI NAEER VDPADLISFLLELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AY+++TLTE Q   +KDLADLGLVKLQQGRKDSWFIPTKLATNLS+SLADSS+RK     
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QI+TFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 392
           QQN+HPR A+R+PS+PENVTDQIRLWE+DL R+++T AHFYDEFPS++VFEAACD+AREW
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREW 420

BLAST of HG10016088 vs. TAIR 10
Match: AT4G17020.1 (transcription factor-related )

HSP 1 Score: 600.5 bits (1547), Expect = 9.6e-172
Identity = 305/450 (67.78%), Postives = 348/450 (77.33%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPA+KLD+LY N FICEAILRSLPPLAKK+VLQMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
             MEEWVL DG SK++VA+DRLIQLR+F E +DRKR  +Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
            REPM S+  ++LPSL++LE YAL QWECFLLQLINSGQ EK + ISSS+MK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDK+ PRLTESGFQFLLM+TNAQLWYIIREYI NAEER VDPADLISFLLELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AY+++TLTE Q   +KDLADLGLVKLQQGRKDSWFIPTKLATNLS+SLADSS+RK     
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QI+TFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 392
           QQN+HPR A+R+PS+PENVTDQIRLWE+DL R+++T AHFYDEFPS++VFEAACD+AREW
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREW 420

BLAST of HG10016088 vs. TAIR 10
Match: AT4G17020.3 (transcription factor-related )

HSP 1 Score: 600.5 bits (1547), Expect = 9.6e-172
Identity = 305/450 (67.78%), Postives = 348/450 (77.33%), Query Frame = 0

Query: 1   MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDAPVSA 60
           MPQVKIIAKNFMDMVASLPA+KLD+LY N FICEAILRSLPPLAKK+VLQMLYID PV A
Sbjct: 1   MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60

Query: 61  KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKREMTYRLNPTFQANLQKLLIHGEVL 120
             MEEWVL DG SK++VA+DRLIQLR+F E +DRKR  +Y LNPTFQ NLQK +I G VL
Sbjct: 61  TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120

Query: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
            REPM S+  ++LPSL++LE YAL QWECFLLQLINSGQ EK + ISSS+MK+FQ+GLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180

Query: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
           QRDK+ PRLTESGFQFLLM+TNAQLWYIIREYI NAEER VDPADLISFLLELSFHVTG+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240

Query: 241 AYDIDTLTEEQRYAIKDLADLGLVKLQQGRKDSWFIPTKLATNLSMSLADSSSRK----- 300
           AY+++TLTE Q   +KDLADLGLVKLQQGRKDSWFIPTKLATNLS+SLADSS+RK     
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300

Query: 301 ------------------------------------------------------QIVTFL 360
                                                                 QI+TFL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360

Query: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEAACDYAREW 392
           QQN+HPR A+R+PS+PENVTDQIRLWE+DL R+++T AHFYDEFPS++VFEAACD+AREW
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREW 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004134713.12.2e-21085.59general transcription and DNA repair factor IIH subunit TFB2 isoform X1 [Cucumis... [more]
XP_022977542.14.1e-20984.92RNA polymerase II transcription factor B subunit 2 [Cucurbita maxima][more]
XP_038882460.17.1e-20985.14general transcription and DNA repair factor IIH subunit TFB2 isoform X1 [Beninca... [more]
XP_023543805.12.7e-20884.70RNA polymerase II transcription factor B subunit 2 [Cucurbita pepo subsp. pepo][more]
KAG6604444.12.7e-20884.70General transcription and DNA repair factor IIH subunit TFB2, partial [Cucurbita... [more]
Match NameE-valueIdentityDescription
Q680U91.4e-17067.78General transcription and DNA repair factor IIH subunit TFB2 OS=Arabidopsis thal... [more]
Q927599.7e-4427.37General transcription factor IIH subunit 4 OS=Homo sapiens OX=9606 GN=GTF2H4 PE=... [more]
P600279.7e-4427.37General transcription factor IIH subunit 4 OS=Pan troglodytes OX=9598 GN=GTF2H4 ... [more]
O704222.2e-4327.52General transcription factor IIH subunit 4 OS=Mus musculus OX=10090 GN=Gtf2h4 PE... [more]
Q6CLR21.3e-4024.44General transcription and DNA repair factor IIH subunit TFB2 OS=Kluyveromyces la... [more]
Match NameE-valueIdentityDescription
A0A0A0KK551.1e-21085.59RNA polymerase II transcription factor B subunit 2 OS=Cucumis sativus OX=3659 GN... [more]
A0A6J1IIT52.0e-20984.92RNA polymerase II transcription factor B subunit 2 OS=Cucurbita maxima OX=3661 G... [more]
A0A6J1EGA77.1e-20784.26RNA polymerase II transcription factor B subunit 2 OS=Cucurbita moschata OX=3662... [more]
A0A6J1CK164.8e-20378.00RNA polymerase II transcription factor B subunit 2 OS=Momordica charantia OX=367... [more]
A0A5A7UBE22.5e-19680.57RNA polymerase II transcription factor B subunit 2 OS=Cucumis melo var. makuwa O... [more]
Match NameE-valueIdentityDescription
AT4G17020.29.6e-17267.78transcription factor-related [more]
AT4G17020.19.6e-17267.78transcription factor-related [more]
AT4G17020.39.6e-17267.78transcription factor-related [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004598Transcription factor TFIIH subunit p52/Tfb2PFAMPF03849Tfb2coord: 14..287
e-value: 1.3E-72
score: 244.7
IPR004598Transcription factor TFIIH subunit p52/Tfb2PANTHERPTHR13152TFIIH, POLYPEPTIDE 4coord: 295..390
coord: 5..293
NoneNo IPR availableGENE3D3.30.70.2610coord: 318..392
e-value: 2.7E-27
score: 96.9
IPR040662Transcription factor Tfb2, C-terminal domainPFAMPF18307Tfb2_Ccoord: 322..389
e-value: 5.4E-23
score: 81.2

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10016088.1HG10016088.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006289 nucleotide-excision repair
biological_process GO:0070816 phosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0000439 transcription factor TFIIH core complex
cellular_component GO:0005675 transcription factor TFIIH holo complex
molecular_function GO:0001671 ATPase activator activity
molecular_function GO:0003690 double-stranded DNA binding