Cla97C03G053320 (gene) Watermelon (97103) v2

NameCla97C03G053320
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptiongeneral transcription factor IIH subunit 2
LocationCla97Chr03 : 2500917 .. 2502417 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGTTGTGCAAACTTGTATGTCAGGAACTTCTTTGATCAATATCCACTTAGTCAATTTGGTTGGTGACTATAAAAAATGGAGTGGCTCATTGCTTAACAGATCTTGGCGGAAGTCCGGAGTCTTAAAGCTTTAGTGGGTAAACTGGAATACTTAGGTGAAGCATCCTTGCAGAATGGTATGGAACTTGTCCACAGCTATCTAAATCAAATTCCATCATATGGGCATAGAGAAGTTTTAGTCTTGTATCCTGCTCCTAATTCCTGTGATCTTGGGGACATCGTGGACTTTGCCACAATGAGAGAAATTAAAATGGCTATTTATTCTATTTTTTCAGTTGACCAGCTAATAGAACTGACCACAAAGGAAGCTTAACAGAGAATACAGTCATAAGACAGAAGTTCAAAGTAAAATTACCATAGCAAGATTCTATTTGACGTTTGTATATATGGAAGTTCTGTTTCCTAAATCTTTGTGAAATGCTTTCCTTTTGTGTACAGTCTCGCTTCAAAGAGTTGCTATTGGAGCATGCCCCTCCACTCCAGCAATGGCAGAATCTGCGATGCCCAATTTAATCAAGGTGGGCCTTGAAGAAAGAGCGGCGGAGAGTTCTATTGCAATACGTTCATGCCACCAGGAAGCTAAAGCTGGAGGGGGATATACTTGCCCTCGATACAAAGCACAAGTTTGTGAGCTGCTAACGAAGTATCAATCTGTGGATTCACACTTATCTCCTCACCCCATTTGGCTAGGTCGAATCATCATCTCTGTCCAATTATACTACTTGATGAAGTCTATGATAAAGTAGTTCATGATCCACATGATCCGCGACATCAACTACCAAAAGTTTGCTTTGGCTGCCAAGAAAGGCTCATGGACCCTGGTAAGAACTCAAATAAACATGATATTTGGAAGCTTTCAAATGAAATCTCTTTATGGTAATTATTGTCTTCTAATAGTATAATATGCACATCGAGTTAGACAGCATTTTGTGTCAGGGGATGTCTTAAAATCAGGTAAAGTGAAAATTAAGAAGATACTCATTTTTTCTTTCCTTCAAGGAGCTTTTTTAATGTTTGGTCATTCTTCAATACTAATTATGTGCTGCATTAGTACTGCAACATGATGACTTTGCAAACTGATTTTGCTCTTTCTGTGTTGAGTGACTTCTTTTTGTTACATTGTTCATATGTGATTATGTTTAAGATAATAGAAGCAACTTATATTTGTGGCATGCAAGTATGAACAAATGTGGATGAGATTTTGCCCCTTTGTAATTTGCTGAATTGAACTTTTGACCACCCTTCAATCATTGCATACTCATTCAAAAGTACTTTTCATTTTAGGCACAGGTAACAACCGAGGCATCTGTATTTCTTGCACAAAGGGCAAACAACACTTCCTATCTTGATTTTGATATTTAGATTCACAAGAGCTTGCACAATTGTCCTGGCTGCGAGAGTTTCAGGCGTCCCAAATTGGCGACTTCTGACAAATGA

mRNA sequence

ATGGCTATCTTGGCGGAAGTCCGGAGTCTTAAAGCTTTAGTGGGTAAACTGGAATACTTAGGTGAAGCATCCTTGCAGAATGTCTCGCTTCAAAGAGTTGCTATTGGAGCATGCCCCTCCACTCCAGCAATGGCAGAATCTGCGATGCCCAATTTAATCAAGGTGGGCCTTGAAGAAAGAGCGGCGGAGAGTTCTATTGCAATACGTTCATGCCACCAGGAAGCTAAAGCTGGAGGGGGATATACTTGCCCTCGATACAAAGCACAATTCATGATCCACATGATCCGCGACATCAACTACCAAAAGTTTGCTTTGGCTGCCAAGAAAGGCTCATGGACCCTGATTCACAAGAGCTTGCACAATTGTCCTGGCTGCGAGAGTTTCAGGCGTCCCAAATTGGCGACTTCTGACAAATGA

Coding sequence (CDS)

ATGGCTATCTTGGCGGAAGTCCGGAGTCTTAAAGCTTTAGTGGGTAAACTGGAATACTTAGGTGAAGCATCCTTGCAGAATGTCTCGCTTCAAAGAGTTGCTATTGGAGCATGCCCCTCCACTCCAGCAATGGCAGAATCTGCGATGCCCAATTTAATCAAGGTGGGCCTTGAAGAAAGAGCGGCGGAGAGTTCTATTGCAATACGTTCATGCCACCAGGAAGCTAAAGCTGGAGGGGGATATACTTGCCCTCGATACAAAGCACAATTCATGATCCACATGATCCGCGACATCAACTACCAAAAGTTTGCTTTGGCTGCCAAGAAAGGCTCATGGACCCTGATTCACAAGAGCTTGCACAATTGTCCTGGCTGCGAGAGTTTCAGGCGTCCCAAATTGGCGACTTCTGACAAATGA

Protein sequence

MAILAEVRSLKALVGKLEYLGEASLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEERAAESSIAIRSCHQEAKAGGGYTCPRYKAQFMIHMIRDINYQKFALAAKKGSWTLIHKSLHNCPGCESFRRPKLATSDK
BLAST of Cla97C03G053320 vs. NCBI nr
Match: XP_004143721.1 (PREDICTED: general transcription factor IIH subunit 2 [Cucumis sativus] >KGN50372.1 hypothetical protein Csa_5G169080 [Cucumis sativus])

HSP 1 Score: 99.4 bits (246), Expect = 1.0e-17
Identity = 65/179 (36.31%), Postives = 83/179 (46.37%), Query Frame = 0

Query: 24  SLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEERAAESSIAIRSCHQEAKAGGGYTC 83
           +L     + + +   P  PA+A+SAMPNLIK+G  +RAAESSIAI SCH+EAK GGGYTC
Sbjct: 245 ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTC 304

Query: 84  PRYKAQF----------------MIHMIRDINY--------------------------- 139
           PR KA+                   H+ R  ++                           
Sbjct: 305 PRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCF 364

BLAST of Cla97C03G053320 vs. NCBI nr
Match: XP_008467294.1 (PREDICTED: general transcription factor IIH subunit 2 [Cucumis melo])

HSP 1 Score: 94.0 bits (232), Expect = 4.3e-16
Identity = 64/183 (34.97%), Postives = 79/183 (43.17%), Query Frame = 0

Query: 24  SLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEERAAESSIAIRSCHQEAKAGGGYTC 83
           +L     + + +   P  PA+A+SAMPNLIK+G  +RAAESSIAI SCH+EAK GGGYTC
Sbjct: 245 ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTC 304

Query: 84  PRYKAQF----------------MIHMIRDINY--------------------------- 139
           PR KA+                   H+ R  ++                           
Sbjct: 305 PRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCF 364

BLAST of Cla97C03G053320 vs. NCBI nr
Match: XP_022949453.1 (general transcription factor IIH subunit 2 [Cucurbita moschata] >XP_023525764.1 general transcription factor IIH subunit 2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 94.0 bits (232), Expect = 4.3e-16
Identity = 64/182 (35.16%), Postives = 78/182 (42.86%), Query Frame = 0

Query: 24  SLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEERAAESSIAIRSCHQEAKAGGGYTC 83
           +L     + + +   P  PA+A+SAMPNLIK+G  +RA ESSIAI SCH+EAK GGGYTC
Sbjct: 246 ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGGGYTC 305

Query: 84  PRYKAQF----------------MIHMIRDINY--------------------------- 138
           PR KA+                   H+ R  ++                           
Sbjct: 306 PRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCF 365

BLAST of Cla97C03G053320 vs. NCBI nr
Match: XP_022998882.1 (general transcription factor IIH subunit 2 [Cucurbita maxima])

HSP 1 Score: 94.0 bits (232), Expect = 4.3e-16
Identity = 64/182 (35.16%), Postives = 78/182 (42.86%), Query Frame = 0

Query: 24  SLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEERAAESSIAIRSCHQEAKAGGGYTC 83
           +L     + + +   P  PA+A+SAMPNLIK+G  +RA ESSIAI SCH+EAK GGGYTC
Sbjct: 246 ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAGESSIAICSCHKEAKVGGGYTC 305

Query: 84  PRYKAQF----------------MIHMIRDINY--------------------------- 138
           PR KA+                   H+ R  ++                           
Sbjct: 306 PRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKLFHDPRHQLPKVCF 365

BLAST of Cla97C03G053320 vs. NCBI nr
Match: XP_022157930.1 (general transcription factor IIH subunit 2 [Momordica charantia])

HSP 1 Score: 93.2 bits (230), Expect = 7.4e-16
Identity = 62/179 (34.64%), Postives = 82/179 (45.81%), Query Frame = 0

Query: 24  SLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEERAAESSIAIRSCHQEAKAGGGYTC 83
           +L     + + +   P  PA+A+SAMPNLIK+G  +RAAESSIAI SCH+EAK GGGYTC
Sbjct: 246 ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTC 305

Query: 84  PRYKAQF----------------MIHMIRDINY--------------------------- 139
           PR KA+                   H+ R  ++                           
Sbjct: 306 PRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVLNDPRHRLPKVCF 365

BLAST of Cla97C03G053320 vs. TrEMBL
Match: tr|A0A0A0KPM4|A0A0A0KPM4_CUCSA (General transcription factor IIH subunit OS=Cucumis sativus OX=3659 GN=Csa_5G169080 PE=3 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 6.8e-18
Identity = 65/179 (36.31%), Postives = 83/179 (46.37%), Query Frame = 0

Query: 24  SLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEERAAESSIAIRSCHQEAKAGGGYTC 83
           +L     + + +   P  PA+A+SAMPNLIK+G  +RAAESSIAI SCH+EAK GGGYTC
Sbjct: 245 ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTC 304

Query: 84  PRYKAQF----------------MIHMIRDINY--------------------------- 139
           PR KA+                   H+ R  ++                           
Sbjct: 305 PRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCF 364

BLAST of Cla97C03G053320 vs. TrEMBL
Match: tr|A0A1S3CUH8|A0A1S3CUH8_CUCME (General transcription factor IIH subunit OS=Cucumis melo OX=3656 GN=LOC103504674 PE=3 SV=1)

HSP 1 Score: 94.0 bits (232), Expect = 2.9e-16
Identity = 64/183 (34.97%), Postives = 79/183 (43.17%), Query Frame = 0

Query: 24  SLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEERAAESSIAIRSCHQEAKAGGGYTC 83
           +L     + + +   P  PA+A+SAMPNLIK+G  +RAAESSIAI SCH+EAK GGGYTC
Sbjct: 245 ALDESHFKELLLEHAPPPPAIADSAMPNLIKMGFPQRAAESSIAICSCHKEAKVGGGYTC 304

Query: 84  PRYKAQF----------------MIHMIRDINY--------------------------- 139
           PR KA+                   H+ R  ++                           
Sbjct: 305 PRCKARVCELPTECRICGLTLISSPHLARSYHHLFPIIPFDEVSDKVFHDPRHQLPKVCF 364

BLAST of Cla97C03G053320 vs. TrEMBL
Match: tr|A0A2I4EJC1|A0A2I4EJC1_9ROSI (General transcription factor IIH subunit OS=Juglans regia OX=51240 GN=LOC108990087 PE=3 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 1.5e-12
Identity = 58/181 (32.04%), Postives = 75/181 (41.44%), Query Frame = 0

Query: 24  SLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEERAAESSIAIRSCHQEAKAGGGYTC 83
           +L    L+ + +   P  PA+AE A+ NL+K+G  +RAAESSIAI SCH+EAK GGGYTC
Sbjct: 246 ALDEAHLKELILEHAPPPPAIAEFAIANLLKMGFPQRAAESSIAICSCHKEAKVGGGYTC 305

Query: 84  PRYKAQF----------------MIHMIRDINY--------------------------- 139
           PR KA+                   H+ R  ++                           
Sbjct: 306 PRCKARVCELPTECRVCGLTLISSPHLARSYHHLFPVVPFDEVPPSFLNDAHNRSPRSCF 365

BLAST of Cla97C03G053320 vs. TrEMBL
Match: tr|A0A2P5AFI3|A0A2P5AFI3_PARAD (General transcription factor IIH subunit OS=Parasponia andersonii OX=3476 GN=PanWU01x14_337440 PE=3 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 2.5e-12
Identity = 62/195 (31.79%), Postives = 82/195 (42.05%), Query Frame = 0

Query: 1   MAILAEVRSLKALVGKLEYLGEASLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEER 60
           + + AE+   K L  +       +L     + + +   P  PA+AE A+ NLIK+G  +R
Sbjct: 220 IGLSAEIFICKHLCQETGGSYSVALDESHFKELILEHAPPPPAIAEFAIANLIKMGFPQR 279

Query: 61  AAESSIAIRSCHQEAKAGGGYTCPRYKA-------------------------------- 120
           AAESSIAI SCH+EAKAGGGYTCPR KA                                
Sbjct: 280 AAESSIAICSCHKEAKAGGGYTCPRCKARVCELPTECRTCGLTLISSPHLARSYHHLFPI 339

Query: 121 ----QFMIHMIRDINYQKFALAAKKGSWTL---------------------------IHK 133
               +  + ++ D+ Y+K   A      TL                           IH+
Sbjct: 340 VPFDEVSLSLLNDL-YRKLPRACFGCQQTLLGAGNKPCPRVSCPKCKQHFCLDCDIYIHE 399

BLAST of Cla97C03G053320 vs. TrEMBL
Match: tr|A0A2P5E4E9|A0A2P5E4E9_9ROSA (General transcription factor IIH subunit OS=Trema orientalis OX=63057 GN=TorRG33x02_234280 PE=3 SV=1)

HSP 1 Score: 80.5 bits (197), Expect = 3.3e-12
Identity = 63/194 (32.47%), Postives = 81/194 (41.75%), Query Frame = 0

Query: 1   MAILAEVRSLKALVGKLEYLGEASLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEER 60
           + + AE+   K L  +       +L     + + +   P  PA+AE A+ NLIK+G  +R
Sbjct: 220 IGLSAEIFICKHLCQETGGSYSVALDESHFKELILEHAPPPPAIAEFAIANLIKMGFPQR 279

Query: 61  AAESSIAIRSCHQEAKAGGGYTCPRYKAQF----------------MIHMIRDIN----- 120
           AAESSIAI SCH+EAKAGGGYTCPR KA+                   H+ R  +     
Sbjct: 280 AAESSIAICSCHKEAKAGGGYTCPRCKARVCELPTECRTCGLTLISSPHLARSYHHLFPI 339

Query: 121 --------------YQKFALAAKKGSWTL---------------------------IHKS 133
                         Y+K   A      TL                           IH+S
Sbjct: 340 VPFDEVSLSLLNDPYRKLPRACFGCQHTLLGAGNKPGPRVSCPKCKQHFCLDCDIYIHES 399

BLAST of Cla97C03G053320 vs. Swiss-Prot
Match: sp|Q9ZVN9|TF2H2_ARATH (General transcription factor IIH subunit 2 OS=Arabidopsis thaliana OX=3702 GN=GTF2H2 PE=1 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 3.4e-12
Identity = 58/195 (29.74%), Postives = 78/195 (40.00%), Query Frame = 0

Query: 1   MAILAEVRSLKALVGKLEYLGEASLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEER 60
           + + AE+   K L  +   L   ++  V L+ + +   P  PA+AE A+ NLIK+G  +R
Sbjct: 220 IGLSAEMFICKHLCQETGGLYSVAVDEVHLKDLLLEHAPPPPAIAEFAIANLIKMGFPQR 279

Query: 61  AAESSIAIRSCHQEAKAGGGYTCPRYKAQF----------------MIHMIRDINY---- 120
           AAE S+AI SCH+E K G GY CPR KA+                   H+ R  ++    
Sbjct: 280 AAEGSMAICSCHKEVKIGAGYMCPRCKARVCDLPTECTICGLTLVSSPHLARSYHHLFPI 339

Query: 121 ---------------------------QKFALAAKK----------------GSWTLIHK 133
                                      Q    A  K                     IH+
Sbjct: 340 APFDEVPALSSLNDNRRKLGKSCFGCQQSLIGAGNKPVPCVTCRKCKHYFCLDCDIYIHE 399

BLAST of Cla97C03G053320 vs. TAIR10
Match: AT1G05055.1 (general transcription factor II H2)

HSP 1 Score: 72.8 bits (177), Expect = 1.9e-13
Identity = 58/195 (29.74%), Postives = 78/195 (40.00%), Query Frame = 0

Query: 1   MAILAEVRSLKALVGKLEYLGEASLQNVSLQRVAIGACPSTPAMAESAMPNLIKVGLEER 60
           + + AE+   K L  +   L   ++  V L+ + +   P  PA+AE A+ NLIK+G  +R
Sbjct: 220 IGLSAEMFICKHLCQETGGLYSVAVDEVHLKDLLLEHAPPPPAIAEFAIANLIKMGFPQR 279

Query: 61  AAESSIAIRSCHQEAKAGGGYTCPRYKAQF----------------MIHMIRDINY---- 120
           AAE S+AI SCH+E K G GY CPR KA+                   H+ R  ++    
Sbjct: 280 AAEGSMAICSCHKEVKIGAGYMCPRCKARVCDLPTECTICGLTLVSSPHLARSYHHLFPI 339

Query: 121 ---------------------------QKFALAAKK----------------GSWTLIHK 133
                                      Q    A  K                     IH+
Sbjct: 340 APFDEVPALSSLNDNRRKLGKSCFGCQQSLIGAGNKPVPCVTCRKCKHYFCLDCDIYIHE 399

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004143721.11.0e-1736.31PREDICTED: general transcription factor IIH subunit 2 [Cucumis sativus] >KGN5037... [more]
XP_008467294.14.3e-1634.97PREDICTED: general transcription factor IIH subunit 2 [Cucumis melo][more]
XP_022949453.14.3e-1635.16general transcription factor IIH subunit 2 [Cucurbita moschata] >XP_023525764.1 ... [more]
XP_022998882.14.3e-1635.16general transcription factor IIH subunit 2 [Cucurbita maxima][more]
XP_022157930.17.4e-1634.64general transcription factor IIH subunit 2 [Momordica charantia][more]
Match NameE-valueIdentityDescription
tr|A0A0A0KPM4|A0A0A0KPM4_CUCSA6.8e-1836.31General transcription factor IIH subunit OS=Cucumis sativus OX=3659 GN=Csa_5G169... [more]
tr|A0A1S3CUH8|A0A1S3CUH8_CUCME2.9e-1634.97General transcription factor IIH subunit OS=Cucumis melo OX=3656 GN=LOC103504674... [more]
tr|A0A2I4EJC1|A0A2I4EJC1_9ROSI1.5e-1232.04General transcription factor IIH subunit OS=Juglans regia OX=51240 GN=LOC1089900... [more]
tr|A0A2P5AFI3|A0A2P5AFI3_PARAD2.5e-1231.79General transcription factor IIH subunit OS=Parasponia andersonii OX=3476 GN=Pan... [more]
tr|A0A2P5E4E9|A0A2P5E4E9_9ROSA3.3e-1232.47General transcription factor IIH subunit OS=Trema orientalis OX=63057 GN=TorRG33... [more]
Match NameE-valueIdentityDescription
sp|Q9ZVN9|TF2H2_ARATH3.4e-1229.74General transcription factor IIH subunit 2 OS=Arabidopsis thaliana OX=3702 GN=GT... [more]
Match NameE-valueIdentityDescription
AT1G05055.11.9e-1329.74general transcription factor II H2[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006281 DNA repair
biological_process GO:0006289 nucleotide-excision repair
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0008150 biological_process
biological_process GO:0006357 regulation of transcription from RNA polymerase II promoter
biological_process GO:0000394 RNA splicing, via endonucleolytic cleavage and ligation
cellular_component GO:0000439 core TFIIH complex
cellular_component GO:0005575 cellular_component
cellular_component GO:0005675 holo TFIIH complex
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C03G053320.1Cla97C03G053320.1mRNA


The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C03G053320ClCG03G002310Watermelon (Charleston Gray)wcgwmbB186
Cla97C03G053320CmaCh01G005360Cucurbita maxima (Rimu)cmawmbB467
Cla97C03G053320CmoCh01G005670Cucurbita moschata (Rifu)cmowmbB448
Cla97C03G053320Carg12311Silver-seed gourdcarwmbB0467
Cla97C03G053320Bhi08G000283Wax gourdwgowmbB476
The following gene(s) are paralogous to this gene:

None