CSPI04G16980 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G16980
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptiongeneral transcription and DNA repair factor IIH subunit TFB1-1-like
LocationChr4: 14386959 .. 14397361 (-)
RNA-Seq ExpressionCSPI04G16980
SyntenyCSPI04G16980
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTACGTTAATTGAGTTATGCAAACCTTTTACGGAGTTCAGTTATAATATGGTAACGTAGTTTGAACATTCTTTTTTAATATGCTTTTCTTATCTATCTAGTTCAGATTTTTGCTCTGAAACCAGCTGTTCACCAAGCATTTCTTAATCATGTTCCCAATAAGGTACTGCTAAATACTGGAAATAGTAACACGTTATCCTCTTTCTTCTTGGTTTTAATCATTTAGTTTGGAGGTAGTTAAATAAGCAAATATAAAAAACGTTAATTTATTTAAGCATTTTCTTTTTGTTTAATGACAGATATCGGAGAAAGACTTTTGGACAAAATATTTTAGAGCGGAGTACCTTCATAGTACAAAAAATTCTATTGCAGCTGCAGCAGAGGCTGCTGAAGATGAAGAACTTGCCCTTTTTCTGAAGGACGATGAGATATTGGCTGCTGAAACTCGGAAAAAGGTATATTTTTTAAAATGTCAACCTCTTAGAGATTTCTCGCTTATTAAAATACTTAGCGTGTTTTTTTTCTTCTGATTATTGTCATTTCATATTAAATTCTATTTATTTTGGTTCATGTTGTTTCTAGATTCGGCATGTTGATCCTACATTGGATTTGGAAGCTGATCTAGGGGATGATTACACACACCTTCCAGTATGTTATTGGTCTTTGTTACGGTATCTTGTTTGGATTTTTTTTTTTTGGTTCTTCATGGAAGACTTATCCTTATTTGCAGTGTTATTTATTCCTGGTGCGCTGAGAGTAAGAAAGAGTATTAGTTGTAATAGAAAGAGAGTTATTTGTATTAGAAAGGCGGTTAGTTGTGAGAATTCTTTAGGACCTTGTACTCATATTTTCCTATATATACAACACTTCTTCTCTCATTTGAGGAGGATAATCATTTGATAAAAAATTAGACTTTTGGTTTACACCAATGCCCTTGGAGGATTTATCTTTGGATCTTTAGCCTCATTCATATTTCAATGAAGCTTTTCTGTTTCTTGTTTAAAAAGAATAGACAGGAAAATCAATCTTTTTTGTTTTAGTACAATTCTTATGTACTTTGAGTATTAGTCTCTTTTATTAATACCTTTAATAAAGAGGCTCGTCTCCGTTTAGAAAAAATTAAGATACAGGAAGACCTTTCAGATAATAAAACTAATTTTTGTTATATTCAGATTGGTGGCTTTCTTATCTTACTGACTGATGCAAAATTTCTTAAATCTAGGATCATGGAATCTTTCGTGATGGTGGCAAGGAAATAACTGAATCACAAAATGAGCACTATAGAAGGACTTTGTCACAAGACCTTAATCGTCAAGGTGCAGTTGTTCTTGAAGGCAGAACTATTGGTTAGTTTCCAACTTGTGATAAATTTTATTCTTATGTTGCAGTTAATCAGGTAAACGTCTGAACTAATTTATAGGTAGTCCTTGTTGAATATATATAATGGTGAAATATACCCCCAAGCCATTGTGGTCCGTCTTATAAGGTAGATTACTTAGAATGGAGATTCCGTTTGTAGACTACTGTTTTACAAATATACCTTAGCCCATGATTTTATGTTCCTTGTCATTCTAGTGAAGGTCATACCTCTTTATTAATTAGTGATAATAATAAAGAGCCAAAGCTCAGAGTACATGAGAGATATACATGAACAAAAACCTAAGGATCAGTATGCGCACTAGGCCATCTCAGCTAGGTTGACACACTCCCTTGGCGCCCTCATCATGTCCAAACGCTAGTTACAAAAAACGGATTACAAGAAAATACCAAACTAAAGCAAGCCTTTCCCCATAGGCTAAACAGATATATTCTAATTATAATAAAAGAAAAAATATACACTTCCTAACGCTTGAAATTTTATAGGAAATCACAGATAGCATGAAAAGGACATTTTGTGTTGTTTAGAAGATGAAGACCTGCCAATTCAGTTGTATCTCTTGAAGGGAATACTCTTAGAAGGTTTTGGATAAGTAACACCATGACAAGGTGTTTAATTCTACTAGTAATGACTTGACCATGATGCACCATAAGAGGAGAGCGTTTTCTTCATACTAAACCCATCATAATTTACAGCACATTATTTCTGATATTATTGGCGAATATTCGCAGCAGCAGGTTCTGCCGGTTTACAGTATAAGGACAGCCAAAAGTAGATGCTGAAGATTTTCCTGGTAATTGGGGGGGGAGGACAAGTGGGGAGAAGGGTAATGGGATGAGCTTTTTCTGTGATACTGAAGCACAGTTCGGACTTCCAAACAGCATGATCCACACTGAAATATTTACTCTCTTCAGACATTTAGTTTTCCAAAAACTCCTTTGCAGTGCATTGTCATTAGGTGAAGCCATGGATAGATGATTGATTAGAGATTTAATAGCGAAGGAGCTGCTTGTCTCCAGGAACCAAACTCTTTGATCTGAGGATGTATTTAACATTTTTCCTTCCAGTGAACTGAACAAAGCGTTTACTCCGCTAGGTCTTAATTAGATTAATGTCGATTGCTGCTTGCTTCGTTTCCTGAATCAGAACCATGTCTGGATTTGAGCTCTTTAATAGTCTGTTCCCCTGCATCTATGGAACGAGGAGGTGTTTAAGAGGGTTGGTGATAAGCTTGGAGGCTTCATTGAGTTTGCGGAAGAAAATTCGAGCCTCATTAACTGCTTGGAAGTGAATATCAAAGTTAGAGGGAATTATTGCGGCTTCATTCCGGCGGATTTTGAGCTGATAGAAGGATCCGATTCTTATCTTGTGCAGGTGGCTTCCTTCCAAGATCCCAATCTTCTTATTGATAAAGTTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGCCCGGATCCGAACCCGATTGATATTTGGCGTGTGGAGAGGCCGAACTCAAGGTTGGCAGTTTACGCTTTTCAGAACTCTGGGATGGTGAGTACGGCGAGAGAAGAAGCTGAGAATATTTGTCCTTCTCTGCAAACTGACGGAGACATTAGAGGAACTAATTCAAATTTACCGCCCAAGCAAACCCTAATGCGTTTGACTAATAAAGTGGGCTTGACCAATGAAATGGGCCTCACTACGGATGTGGGCTTCCCTGATTTAAAAGGGAAAGGTATAGCAGTGGATGAAGTATTTACTCAAAAGCCCACCAATATTAAAAAGCAAACTGGGCTTTTTATTAGAGAAAATTCCCAAGCTACCCGTGCCAACCAACCAGTGTGATTTGCTGCACAAACCAACCTGGAGGCTCTTCAACTTCAACGTCCATATCAGACTGCATTCATGGAGAAATAGATCCAAGTTCGGACGTTTCTATTTCAAGCTCAGGCAGCCTCATCTCTGAAAACAAATTTGCTCCTCTTGAAGCCGAAGACGTGCAACACATCTCTGATTTTTCTTCCGACGTCCAAAGACTCTTCCACGAGGGCTGGTATGGGATAAGTAAAATAATTTACCCACAAGGTGCTGATATTGTTCTTTATCAACTGGAACTTGATCATCAGAGTTTCCAAGTTTACAATTAAATTCAATAGGAGTATGAACATGACGACATCCCAACATACCTGTCTCGTTTAGCAAATCAAGGGTGTATTTTCTCTGAGACATGGAGATACTTTCTTTAGATCTAGCCACCTCCATCCCAAGGAAATATTTCAGATTTCCCAAATCCTTGATTTCAAATTCATCACCCATTCTCTGCTTTAGTCGACTGATTTCCGCCTGATCATCTCCAGTCAAAACAATGTCATCCACATAAACTATTAGAACAACAATCTTCCCTGTCTTGGAAACCTTTGTAAATAAAGTATGATCAGAGTGCCCCTGACTGTATCCTTGGGACTTGACAAAGGTAGTAAATCTATCTAATCATGCTCGGGGAGACTGTTTCAGACCATATAGGGATTTCTGGAGTTTACACACCTGCTGACCAAACTGGGCTTCAAATCCAGGCGGGGGGCTCATGTAGACTTCCTCTACAAGGTCTCCATTCAAAAAAGCATTTTTAACATCCAGCTGGTATAGAGGCTAATCTTTGTTCACAGCAACAGATAGCAAAACTCTAATAGTATTCAACTTAGCAACTAGAGAAAAAGTTTCTGAATAGTCAATACCATATGTTCGAGTAAATCCTTTTGCAACTAACCTTGTCTTGTGTCTGTCAAGAGTACCATCTGCTTTGTATTTGAGAGAGAACACCCATTTGCATCCACAGTTTTGTGCCCCTTGGGTGGAGTACAAATGTCCCAAGTACTATTCTTTTCAAGAGCTTTCATCTCTTCCATGACAGCATTCTTCCATTTAGGACACTCTAAATTAGTGTATATATTTTTTGGTATTATGGTAGAGTCAAGGCTTGCTGTAAAAGCTCTGAACTGAGGAGAGAGATTATCGTAGGAAACATAGTTGCAAATGGGATGTTTAGTACAAACCTGGTACCTTTTCTCAGAGCAATGGGAATGTCAAGAGAGGAATCATACTCATTAGGTTTTCCTGTATGACCCTGTTCTACTTCATTATTACTGGTTTCTATTATGACCTCAGTCTCATCACCACTGTTATTTTCTTCCACATTTTCAAGGACAGCAACATTAGATGACAGTCACCCTACATTAGATGATAATCAACTTCGATGACCACGTCTTCTTCAGGCTCCAACAACTACTACCCACGATAAACTCCAACAAAAGCCACCTTCAATGATCAACTTCGATGACAGTCACCCGACAACTTTGATTATCACCTTGATCAACTAACTCCACCGACAAATTTTGTTCACATCCAACAGCTAACTCTATCGATCTACTCAACCGATAATATACTATAAAAAGAAAAAGGAAAGTTAGGCTAATGGAAGTGTATTTAAATACAAATTTATAACTATTAGTTTATAGTGTTTAATAGTTGTTTTTGTTTTTTCATAGTAATTTGATGTTGTGAGATCATCTGAGGATCAACTCATCCCTATTGTTATGCTCACCCCTCTAAAGTCCTACCTGTAAACTGTTGTGCTGTTTGGTCTTTGTGGGGTGAGCAGAACAGTAGGACTTTTAAAAGTTGGGATACGGACCCTAGTGAGACTTGGTCCTTCGTTCGTTATTATTTTTCTTTATGGGCTGTGATTTTGAAAACCTTTTGTAACTACTCCATTGGCATTCTTTTGCATACTTGGAGCCTTTTATGAGTTGAGTTCCCTTTTTCTGGGCTTGGTTTTTTATATGTTTGTGCTTGTATTCGTTCATTTTTTTTCAATGAAGTGGTTGCTTTTTTTAAAAAAAAAAGAAGCTATGAAATAAACATATGAATAATCAACATGTTCTAAGGTGAGCCTGAAATTCTCCATTTGAATCATAAATAGAAGCAATCGTACATTTTCTTTGAGTTAGTATTTCTTCCTAAAAAGCTGACTGTTTAATTTTCTGAATTGTTGCTTTGCAAAATTGAGAGTAGATGTAGACTTGGAAGATCCAAGGACCGTTGCAGATGCACTTGTGCGGTCTAAACATGGTTAGTATGGCTTATGAAGGTTTCTTTTCTCAATTGCACTCCGGGTTTGTCACTAACTTTGCTTCTATTCTATCAAGCGTTGAGCAATTAAATACTAGTAACTGATTTTACAGCGGTAGAAGGAAATGAGAGCCAGACAGCACTTGATAGGATCTCTAGGATGACAGCAATTGAGGATCTTCAAGCTCCTCATAGCCATCCTTTTGCTCCTCTTTGTATCAAGGTTTGATTTTTATTGGTGAAGGACACTAAGATCTACAATCATTTTAGTAGAAGACAATTTGATGCCATATTTCCATGGCAGGATCCTCGAGATTATTTTGATGCTCAACAAGCAAATGCCATTAAGACATTGGATGATACACGAGCTGGAATGCAACAAACTAAATGCAGTTTAAGCACTACAGAAGCATATGGCTCACTGAGGGAATCCATATCTGAGATCAAGTCATCTGGATTCAATCATCCCATAATTAAACCGGAAGTTGCACTTATGGTAAAAGCACCTGCAGCATTATCTTGCTAAAAACAGCTATTTTTGCTATCTACATTGACAACTCTTCAGCTACTGAGCATTGATTTCTTTTTTTTGTAGGTTTATAATGGATTAACTCAAAATATTTCTAGTACCAAATATCAACTTGGGAAAAACCCTCAGGAGAGTATTCTGGAGAGTTTGCCAAACCCTACTAAGGAGGAACTGCTACACGTGAGTTGGTGCTCCTCTTGCCGTAATATTTGAATATATATATATATTGAAAGGAAAACAAGTCTCTTTTGTTAATTATAAGAGAGAATAAAGCTCATAGTATAAGAGGTTTATACAATGAGCAAAAGAAAACATAGGATCAGCAGACATACCCAGACATCTCAATTAGGTTAACACTCCTTTAGTGTCCTTGTCATATCCAAACAAAACTAAGAAAAGACTTGTCAAAGAAGGCAACCATACTGCCTTTACAGCATATAATAGTGATATTATAATATATTTTGATGGCTGGTTTTCAGGAGATGTTAAAATCACCAATTTAACCATCTCACTAATATTTATTGGCCACAATAATTATATTTTAGACATTGAAGTTGGTTGACTGCATACGTTATTGTTCTCCTGTCACTTGTCTCTTCAGATATTACTGTTTATTGAGGTATTGTTTAAGAAAGCTTATTGTTTCCAGCATTGGATCTCGATTCAAGAATTACTCAAGCATTTTTGGTCGTCTTATCCAATCACCACATCATATCTTTATACCAAAGTAAGTTCTTATTCTTATTGCATTAACAAGACGATTCTTCATTTGATACTTTTTATCCAATCACATCATCTTATCTTTATCCATTTATTTCTATATTGAATAAGTAGTTCAAGTTCTACTTCTTTAATACTTAACATTTTATGAACTACTGTTGTGCAAACGATTCACAATATTTACATTGTCGGGAACTTAAGGTTATGCAACGAAGCTCTGTTGGCCAAATGGTTGTGGCATTTTGCCCTTGAACCCGAAGCCTTCTGGTCAAAGATTATCGTGAGTAAACATGGCCCTCATCCTTCTGAGTGGGTGGTGAAAGGGCCAAAGGTACACCGTACACACCGTAATCCTTGGAAAGATATCTCCTTCTCCCTTCTTATTCCCATTTTGTATTCTGGGTTGTAGGTGAAGGTCGGGATACATTTTTCTGCAAGGATCATTGGGTGGGGGATAAACCTCTCTTCAACATTTCCTAGGTTATACCATTTGTCTTCTTTGAAAAGCTGCTTTGTATCAGATCTTTTTCTTTAGTCTGAGAACTCGGTTTCTTTATCCTTTGGTTTTCGTTGGGCTTTGTCCAATAAGGAGTCGACGGAGGTGGCTTCTCTTCTTTCTTTGATAGGGTTTTTACTTTAGATTGGGTAGAAGGGATGTTTGTGTTCGGATTACTAATCTAATGGAGGGTTTTCTTGGAAGCCTTTCTTTAGGATTTTAGTTAACCCTTCTCCCATCGTTGAATTGGCCTTTGATATTCTTTGGAGGATTGAGATTCCTAAGAAAGTTAGGTTCTTCGTCTGCCAAGTTCTGCTTGGCCGTGTGAATACAGTGGACAAGGTTTCAAGAAGGTTGCCTTTGTTAACGGGTCTTTTCTGTTGCATTTTTTGCTACAAGACGGGGGAAAGCCTGGATTGCTTTCTTTAAGAGTGTTAGTACTCGAGATCAATATGGATTTCTTTCTTGCCGGAGTTTGATGCTTCGTTTGCTTGTTCGAGGGATGTTCATCTGATGATTCCAAAATCCTTCTTCATCCGCCTTTCAGTGAGAAATGGTGTTTTCTTTGGTTTGCTGGGGTGTGCTGTGATTTGGGATCTGTGGGGTGAGAGAAATAATAACGGGTGTTCACTGTTCAGAGGTGTGGAAATGGACCCTTGTGAGGTTTGATTCTTGATGAGGTTTCATGTTTCTCTTTGGACTTCGGTTTCGAAGGTCTTTTGTAATTACTCTCTTGGCAGTATTTTACCTAGTTGGAAATCCTTTCTGTAGTGGGGTTCTGTTTTGGGGGCTTGATTTTTTTGGATGCCCTTCTATTCTTTCATTTTTTCTCAGTGAAAGTTGTTTCTATACACGCACATATATATATATATATATAAAGCTTCAATCAAAATTGAATAAATGATGCTAGTTGTTGCGCCAATACGTTTATGGGCTTATGTATTACTGGATTGGACTCAATTAAGGACCAGGTCCTTTGATAGATTTCAAGGGTCTAATTCATGCTTCTAAAGGCACAAGGCCAGTGGAAAACGAATAACAGAAAACCTAGACTTTGTTCTGTAGGCACAAGGCAAGGTGACCTCTTCTCTCTCGTGTTTATTATTCATAGTTGATTTATTTTGATACTCTTCAATTATGAGGCCTTGGAACAACCTAATTGAGGATTTGAACTTTGAAATGGAAGTACTGCTGTTCATACAACAAATTTTATGCTACCCTCCAATCCACTTCTTGTTAGGAGAAAATGCTTAATCTTTGTAACATTTTGCAGATCCTGGAGGTTTGTGGCATTAACTTTTGCTTCCAAACAAAGGTTAGCTTTGGAGCATCAATTGTGCCAACTCTTAACTAGGAAGAGTTCGTAAATAGTCTCAGCTACAATATCAACCTTTGGCCCTCCCCATATCTTGGTATTCTCTTGATTGTAGCTCTAGAATACAAAAAAGCCTCATTTTGTACTTTGAAAAACTGTGGTGCTGGTAGAATATTTTGTGAGGTAAAGAAACAGGGAAATAAAAACCCTTTCTAGGGTGGGTTACTTAGCTTTTGATACTGGAAAGGTTGGTGCATGATTGGAAATGTTGGATTAGAGTTTGAAACTTGGGATGAAGAATCTTCCTACTAGATGTATGATTTGACGTCAATATCGAACTAGTATTTTCCAAGTGAACAAGAAAACCAAATTTCTATTTGGAATTTGTTTTTTAGCTTTTGTTGTAGATGTAGACTGAATTTTCTCTCTTGGTATAATACTCTTGTACTTTTTGCTTAAGGCTCTTTTACTAATAATAATTTATTATTAAGAGGCTTGTCTTCGCTTAAAAAAAAGGAAACCAAATAGGTCACGTTGCCTGCTCCAACAAATCAAGGAAACCAAATAGGTCACGTTGCCTGCTCCAACAAATTGGAAAACGCTCACTTCCAAAAAAAATCCTTCTAATCAAACTCTGATTTAAAAAATGACCCTACTATTCCAGACTTGGTCTCGCATATTGTTTGAAAAATGTTTATCCTTTCCTTCTTTCTGGATGTGATTTGACCTGCTGATGCACTTTTTAGCGTCAACTTTTTATGCTTGGTATATATTGCATTTTTGGTATAAGTCGCTTGTACTCTTATTGCTTGTTGGTATGTTAATTGCTACGCAGGTGAGCAGATTGAAGGATGCAATGGCCAAAATCTATCGACAGTTAGAGGTACACAACTGCCCCCCCAATAATATATATTTGCATTATTATAGTTTTATGATACCATTTTTATTAAAAGCAGGAGATCAAGGAAACCGTGGTGGCAGATTTCCGCCACCAAGTATCTCTTTTGGTTCGTCCAATGCACCAGGTTTGTAGTGTGTTCTCTTTCAAAACTTATACAAGTACCTCAAACTTATAAAATTAACCACAATCAAACCTTGTTATAACTTTATTGATGAGTGGTTACAATTGGGTTCTAAATCCCCAATCGTAAACCACTTTTAAACTAGATCCACTTTTAAACTAGATGTAATAATACGCGACCACAAAAGTTGATGGTATTGTTGAATTAGAACCCTTTGTTTTGTGAGTTTAGGAGGATTGGATTATAATAGCCCTTTATTTTCTTATGCATGGTCGTGAGCATGGTGGATGAGAAGATTTGTTTGATTCATGTTTTGTAGGCTCTGGATGCTGCGTTTCAGCATCATGACGCTGACATGCAGAAGAGATCAGTAAAGAGTGGAGAAAGGCTTAACGGATACACTTAGAAATCATCAAATATAGAATCTCTCTTTCAATTCCCTCACTGAGTGTTCATCTTCAGATGAAATTTTGCGGTGGACAGCCATCATCATGTGTTGGACAATATCTAATCTATTGCCTTCAGTAGTAAATTAACCTTGTTCTTCTTGCATCATTCTACCTTTAGAAAAACCTTGTTCTTGAGTTCTAAAAAAATGAAATGAAATGAACAAGGAATAAAACGAAGTCATTCAACAAAGTATGACCAAACTAACATCCTAC

mRNA sequence

ATGTTTCAGATTTTTGCTCTGAAACCAGCTGTTCACCAAGCATTTCTTAATCATGTTCCCAATAAGATATCGGAGAAAGACTTTTGGACAAAATATTTTAGAGCGGAGTACCTTCATAGTACAAAAAATTCTATTGCAGCTGCAGCAGAGGCTGCTGAAGATGAAGAACTTGCCCTTTTTCTGAAGGACGATGAGATATTGGCTGCTGAAACTCGGAAAAAGATTCGGCATGTTGATCCTACATTGGATTTGGAAGCTGATCTAGGGGATGATTACACACACCTTCCAGATCATGGAATCTTTCGTGATGGTGGCAAGGAAATAACTGAATCACAAAATGAGCACTATAGAAGGACTTTGTCACAAGACCTTAATCGTCAAGGTGCAGTTGTTCTTGAAGGCAGAACTATTGATGTAGACTTGGAAGATCCAAGGACCGTTGCAGATGCACTTGTGCGGTCTAAACATGCGGTAGAAGGAAATGAGAGCCAGACAGCACTTGATAGGATCTCTAGGATGACAGCAATTGAGGATCTTCAAGCTCCTCATAGCCATCCTTTTGCTCCTCTTTGTATCAAGGATCCTCGAGATTATTTTGATGCTCAACAAGCAAATGCCATTAAGACATTGGATGATACACGAGCTGGAATGCAACAAACTAAATGCAGTTTAAGCACTACAGAAGCATATGGCTCACTGAGGGAATCCATATCTGAGATCAAGTCATCTGGATTCAATCATCCCATAATTAAACCGGAAGTTGCACTTATGGTTTATAATGGATTAACTCAAAATATTTCTAGTACCAAATATCAACTTGGGAAAAACCCTCAGGAGAGTATTCTGGAGAGTTTGCCAAACCCTACTAAGGAGGAACTGCTACACCATTGGATCTCGATTCAAGAATTACTCAAGCATTTTTGGTCGTCTTATCCAATCACCACATCATATCTTTATACCAAAGTGAGCAGATTGAAGGATGCAATGGCCAAAATCTATCGACAGTTAGAGGAGATCAAGGAAACCGTGGTGGCAGATTTCCGCCACCAAGTATCTCTTTTGGTTCGTCCAATGCACCAGGCTCTGGATGCTGCGTTTCAGCATCATGACGCTGACATGCAGAAGAGATCAGTAAAGAGTGGAGAAAGGCTTAACGGATACACTTAGAAATCATCAAATATAGAATCTCTCTTTCAATTCCCTCACTGAGTGTTCATCTTCAGATGAAATTTTGCGGTGGACAGCCATCATCATGTGTTGGACAATATCTAATCTATTGCCTTCAGTAGTAAATTAACCTTGTTCTTCTTGCATCATTCTACCTTTAGAAAAACCTTGTTCTTGAGTTCTAAAAAAATGAAATGAAATGAACAAGGAATAAAACGAAGTCATTCAACAAAGTATGACCAAACTAACATCCTAC

Coding sequence (CDS)

ATGTTTCAGATTTTTGCTCTGAAACCAGCTGTTCACCAAGCATTTCTTAATCATGTTCCCAATAAGATATCGGAGAAAGACTTTTGGACAAAATATTTTAGAGCGGAGTACCTTCATAGTACAAAAAATTCTATTGCAGCTGCAGCAGAGGCTGCTGAAGATGAAGAACTTGCCCTTTTTCTGAAGGACGATGAGATATTGGCTGCTGAAACTCGGAAAAAGATTCGGCATGTTGATCCTACATTGGATTTGGAAGCTGATCTAGGGGATGATTACACACACCTTCCAGATCATGGAATCTTTCGTGATGGTGGCAAGGAAATAACTGAATCACAAAATGAGCACTATAGAAGGACTTTGTCACAAGACCTTAATCGTCAAGGTGCAGTTGTTCTTGAAGGCAGAACTATTGATGTAGACTTGGAAGATCCAAGGACCGTTGCAGATGCACTTGTGCGGTCTAAACATGCGGTAGAAGGAAATGAGAGCCAGACAGCACTTGATAGGATCTCTAGGATGACAGCAATTGAGGATCTTCAAGCTCCTCATAGCCATCCTTTTGCTCCTCTTTGTATCAAGGATCCTCGAGATTATTTTGATGCTCAACAAGCAAATGCCATTAAGACATTGGATGATACACGAGCTGGAATGCAACAAACTAAATGCAGTTTAAGCACTACAGAAGCATATGGCTCACTGAGGGAATCCATATCTGAGATCAAGTCATCTGGATTCAATCATCCCATAATTAAACCGGAAGTTGCACTTATGGTTTATAATGGATTAACTCAAAATATTTCTAGTACCAAATATCAACTTGGGAAAAACCCTCAGGAGAGTATTCTGGAGAGTTTGCCAAACCCTACTAAGGAGGAACTGCTACACCATTGGATCTCGATTCAAGAATTACTCAAGCATTTTTGGTCGTCTTATCCAATCACCACATCATATCTTTATACCAAAGTGAGCAGATTGAAGGATGCAATGGCCAAAATCTATCGACAGTTAGAGGAGATCAAGGAAACCGTGGTGGCAGATTTCCGCCACCAAGTATCTCTTTTGGTTCGTCCAATGCACCAGGCTCTGGATGCTGCGTTTCAGCATCATGACGCTGACATGCAGAAGAGATCAGTAAAGAGTGGAGAAAGGCTTAACGGATACACTTAG

Protein sequence

MFQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLSQDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIKSSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQALDAAFQHHDADMQKRSVKSGERLNGYT*
Homology
BLAST of CSPI04G16980 vs. ExPASy Swiss-Prot
Match: Q3ECP0 (General transcription and DNA repair factor IIH subunit TFB1-1 OS=Arabidopsis thaliana OX=3702 GN=TFB1-1 PE=2 SV=1)

HSP 1 Score: 473.4 bits (1217), Expect = 2.5e-132
Identity = 241/393 (61.32%), Postives = 305/393 (77.61%), Query Frame = 0

Query: 1   MFQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALF 60
           +FQIFA KPAV QAF+N+VP+K++EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+F
Sbjct: 204 IFQIFAEKPAVRQAFINYVPSKMTEKDFWTKYFRAEYLYSTKNTAVAAAEAAEDEELAVF 263

Query: 61  LKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTL 120
           LK DEILA ETR KIR VDPTLD+EAD GDDYTHL DHGI RDG  ++ E QN+ ++R+L
Sbjct: 264 LKPDEILARETRHKIRRVDPTLDMEADQGDDYTHLMDHGIQRDGTMDVVEPQNDQFKRSL 323

Query: 121 SQDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEG------NESQTALDRISRMT 180
            QDLNR  AVVLEGR+IDV+ ED R VA+AL R K   +       + +Q  L+R+SR+ 
Sbjct: 324 LQDLNRHAAVVLEGRSIDVESEDTRIVAEALTRVKQVSKADGETTKDANQERLERMSRVA 383

Query: 181 AIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLR 240
            +EDLQAP + P APL IKDPRDYF++QQ N +      + G+++     +  EAYG L+
Sbjct: 384 GMEDLQAPQNFPLAPLSIKDPRDYFESQQGNVLNVPRGAK-GLKR-----NVHEAYGLLK 443

Query: 241 ESISEIKSSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELL 300
           ESI EI+++G + P+IKPEV+  V++ LT+ I++ K   GKNP+ES L+ LP  TK+E+L
Sbjct: 444 ESILEIRATGLSDPLIKPEVSFEVFSSLTRTIATAKNINGKNPRESFLDRLPKSTKDEVL 503

Query: 301 HHWISIQELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLL 360
           HHW SIQELLKHFWSSYPITT+YL+TKV +LKDAM+  Y +LE +KE+V +D RHQVSLL
Sbjct: 504 HHWTSIQELLKHFWSSYPITTTYLHTKVGKLKDAMSNTYSKLEAMKESVQSDLRHQVSLL 563

Query: 361 VRPMHQALDAAFQHHDADMQKRSVKSGERLNGY 388
           VRPM QALDAAF H++ D+Q+R+ KSGER NGY
Sbjct: 564 VRPMQQALDAAFHHYEVDLQRRTAKSGERPNGY 590

BLAST of CSPI04G16980 vs. ExPASy Swiss-Prot
Match: Q9M322 (General transcription and DNA repair factor IIH subunit TFB1-3 OS=Arabidopsis thaliana OX=3702 GN=TFB1-3 PE=2 SV=2)

HSP 1 Score: 449.9 bits (1156), Expect = 2.9e-125
Identity = 230/385 (59.74%), Postives = 292/385 (75.84%), Query Frame = 0

Query: 1   MFQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALF 60
           +FQIFA KPAV QAF+N+VP K++EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+F
Sbjct: 199 IFQIFAEKPAVRQAFINYVPKKMTEKDFWTKYFRAEYLYSTKNTAVAAAEAAEDEELAVF 258

Query: 61  LKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTL 120
           LK DEILA E R+K+R VDPTLD++AD GDDYTHL DHGI RDG  +I E QN+  +R+L
Sbjct: 259 LKPDEILAQEARQKMRRVDPTLDMDADEGDDYTHLMDHGIQRDGTNDIIEPQNDQLKRSL 318

Query: 121 SQDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHA------VEGNESQTALDRISRMT 180
            QDLNR  AVVLEGR I+V  ED R VA+AL R+K        +  + +Q  L+R+SR T
Sbjct: 319 LQDLNRHAAVVLEGRCINVQSEDTRIVAEALTRAKQVSKADGEITKDANQERLERMSRAT 378

Query: 181 AIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLR 240
            +EDLQAP + P APL IKDPRDYF++QQ N +      +A  +      +  EAYG L+
Sbjct: 379 EMEDLQAPQNFPLAPLSIKDPRDYFESQQGNILSEPRGAKASKR------NVHEAYGLLK 438

Query: 241 ESISEIKSSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELL 300
           ESI  I+ +G + P+IKPEV+  V++ LT+ IS+ K  LGKNPQES L+ LP  TK+E++
Sbjct: 439 ESILVIRMTGLSDPLIKPEVSFEVFSSLTRTISTAKNILGKNPQESFLDRLPKSTKDEVI 498

Query: 301 HHWISIQELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLL 360
           HHW SIQEL++HFWSSYPITT+YL TKV +LKDAM+  Y  L+ +K++V +D RHQVSLL
Sbjct: 499 HHWTSIQELVRHFWSSYPITTTYLSTKVGKLKDAMSNTYSLLDAMKQSVQSDLRHQVSLL 558

Query: 361 VRPMHQALDAAFQHHDADMQKRSVK 380
           VRPM QALDAAFQH+++D+Q+R+ K
Sbjct: 559 VRPMQQALDAAFQHYESDLQRRTAK 577

BLAST of CSPI04G16980 vs. ExPASy Swiss-Prot
Match: P32780 (General transcription factor IIH subunit 1 OS=Homo sapiens OX=9606 GN=GTF2H1 PE=1 SV=1)

HSP 1 Score: 73.2 bits (178), Expect = 7.4e-12
Identity = 90/390 (23.08%), Postives = 164/390 (42.05%), Query Frame = 0

Query: 4   IFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAA---AEAAEDEELALF 63
           IF   PAV   +  +VP+ ++EK+FWT++F++ Y H  + +  +    AE A+ +E  L 
Sbjct: 196 IFRTYPAVKMKYAENVPHNMTEKEFWTRFFQSHYFHRDRLNTGSKDLFAECAKIDEKGL- 255

Query: 64  LKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTL 123
                    +T   +   +P LDL A   +D      +GI        ++S  E+    +
Sbjct: 256 ---------KTMVSLGVKNPLLDLTA--LEDKPLDEGYGISSVPSASNSKSIKENSNAAI 315

Query: 124 SQDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAI--ED 183
            +  N   A+VL       + ++ +T ++      ++ + +  Q A+ R     +I  ED
Sbjct: 316 IKRFNHHSAMVLAAGLRKQEAQNEQT-SEPSNMDGNSGDADCFQPAVKRAKLQESIEYED 375

Query: 184 LQAPHSHPFAPLCIKDPRDYFDAQ---QANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRE 243
           L   +S     L +K    Y+      Q+    T  D     Q  +  +   EAY     
Sbjct: 376 LGKNNSVKTIALNLKKSDRYYHGPTPIQSLQYATSQDIINSFQSIRQEM---EAYTPKLT 435

Query: 244 SISEIKSSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLH 303
            +    ++      + P  ALM   G T              Q++I + +PN  + EL H
Sbjct: 436 QVLSSSAASSTITALSPGGALM--QGGT--------------QQAINQMVPNDIQSELKH 495

Query: 304 HWISIQELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQV---- 363
            ++++ ELL+HFWS +P+ T +L  KV ++K         LE  + T +  F+ ++    
Sbjct: 496 LYVAVGELLRHFWSCFPVNTPFLEEKVVKMKS-------NLERFQVTKLCPFQEKIRRQY 546

Query: 364 --SLLVRPMHQALDAAFQHHDADMQKRSVK 380
             + LV  + + L  A+        +R +K
Sbjct: 556 LSTNLVSHIEEMLQTAYNKLHTWQSRRLMK 546

BLAST of CSPI04G16980 vs. ExPASy Swiss-Prot
Match: Q9DBA9 (General transcription factor IIH subunit 1 OS=Mus musculus OX=10090 GN=Gtf2h1 PE=1 SV=2)

HSP 1 Score: 68.9 bits (167), Expect = 1.4e-10
Identity = 92/401 (22.94%), Postives = 160/401 (39.90%), Query Frame = 0

Query: 4   IFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAA---AEAAEDEELALF 63
           IF   PAV   +   VP+ ++EK+FWT++F++ Y H  + +  +    AE A+ +E  L 
Sbjct: 195 IFRTYPAVKMKYAETVPHNMTEKEFWTRFFQSHYFHRDRLNTGSKDLFAECAKIDEKGL- 254

Query: 64  LKDDEILAAETRKKIRHVDPTLDLEA------DLGDDYTHLPDHGIFRDGGKEITESQNE 123
                    +T   +   +P LDL +      D G   + +P         K I E+ N 
Sbjct: 255 ---------KTMVSLGVKNPMLDLTSLEDKPLDEGYSISSVPS----TSNSKSIKENSN- 314

Query: 124 HYRRTLSQDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQT-----ALDR 183
                + +  N   A+VL         ++ +           +V+GN   T     A+ R
Sbjct: 315 ---AAIIKRFNHHSAMVLAAGLRKQQAQNGQN------GEPSSVDGNSGDTDCFQPAVKR 374

Query: 184 ISRMTAI--EDLQAPHSHPFAPLCIKDPRDYFDAQ---QANAIKTLDDTRAGMQQTKCSL 243
                +I  EDL   +S     L +K    Y+      Q+    T  D     Q  +  +
Sbjct: 375 AKLQESIEYEDLGNNNSVKTIALNLKKSDRYYHGPTPIQSLQYATSQDIINSFQSIRQEM 434

Query: 244 STTEAYGSLRESISEIKSSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILES 303
              EAY      +    ++      + P  ALM   G T              Q+++ + 
Sbjct: 435 ---EAYTPKLTQVLSSSAASSTITALSPGGALM--QGGT--------------QQAVNQM 494

Query: 304 LPNPTKEELLHHWISIQELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVV 363
           +PN  + EL H ++++ ELL+HFWS +P+ T +L  KV ++K         LE  + T +
Sbjct: 495 VPNDIQSELKHLYVAVGELLRHFWSCFPVNTPFLEEKVVKMKS-------NLERFQVTKL 545

Query: 364 ADFRHQV------SLLVRPMHQALDAAFQHHDADMQKRSVK 380
             F+ ++      + LV  + + L  A+        +R +K
Sbjct: 555 CPFQEKIRRQYLSTNLVSHIEEMLQTAYNKLHTWQSRRLMK 545

BLAST of CSPI04G16980 vs. ExPASy Swiss-Prot
Match: Q55FP1 (General transcription factor IIH subunit 1 OS=Dictyostelium discoideum OX=44689 GN=gtf2h1 PE=3 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 2.0e-09
Identity = 85/432 (19.68%), Postives = 182/432 (42.13%), Query Frame = 0

Query: 3   QIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFLK 62
           QIF   P+V +A+  +VP KISE++FW KY +++Y +  ++S  A A   +D+  + +  
Sbjct: 270 QIFIQHPSVEKAYKANVPLKISEQNFWKKYVQSKYFYRDRSS--ANAPPVDDDLFSKYET 329

Query: 63  DDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKE---------ITESQN 122
           D++      ++K+  ++P +DL +  G D      +G+  D  ++         +    N
Sbjct: 330 DEQNKIRILKRKLIDINPLVDLSSTDGFDTDVHSGYGVLLDQSQDPNKLEKALPLLRKFN 389

Query: 123 EHYRRTL-SQDLNRQGAVVLE--------------------------GRTIDVDLEDPRT 182
            H    L S+DL    ++ +E                            T   +     T
Sbjct: 390 RHSALVLGSKDLLTNNSINIEKDQKNLKKTKKDENSTSTPTTTTTTTNTTNTTNTTTTTT 449

Query: 183 VADALVRSKHAVEGNESQT----ALDRI--------SRMTAIEDLQAPHSHPFAPLCIKD 242
             +  ++  +   G++ +      +++I        ++   I+DLQ  +S     L I D
Sbjct: 450 TNNTTIKDPNLYNGDDEENISVEQMEKILENHKKLVNQHIIIDDLQEENSQTLTLLKISD 509

Query: 243 PRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIKSSGFNHPIIKPEV 302
            + YF+    N I  L D     ++++        Y + + ++ ++     +   I  E 
Sbjct: 510 QKRYFEGHSTNNI--LSD----KEKSQLIDILDFDYKNWQPNLPQVFYQTHSSSSILQEP 569

Query: 303 ALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQELLKHFWSSY--- 362
            + V++ + +  +  K  +    + ++ ES     K +L   +    ELL+HFW++    
Sbjct: 570 NISVHSEIFEPYN--KAAINSKEEYNLPES---SFKRDLFQSFHHCNELLRHFWATTFTL 629

Query: 363 ---PITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQALDAAFQH 381
                 TS    K +++  A+A  Y ++EE K+ +++  +   S L  P+ ++L  A + 
Sbjct: 630 GRGAPPTSQQIDKNNKISSAIALQYDKIEEKKKMLISQNKVNQSSLFTPILESLHKAIEK 688

BLAST of CSPI04G16980 vs. ExPASy TrEMBL
Match: A0A0A0L1N5 (BSD domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G338420 PE=4 SV=1)

HSP 1 Score: 769.6 bits (1986), Expect = 6.2e-219
Identity = 388/388 (100.00%), Postives = 388/388 (100.00%), Query Frame = 0

Query: 1   MFQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALF 60
           MFQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALF
Sbjct: 1   MFQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALF 60

Query: 61  LKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTL 120
           LKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTL
Sbjct: 61  LKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTL 120

Query: 121 SQDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQ 180
           SQDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQ
Sbjct: 121 SQDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQ 180

Query: 181 APHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEI 240
           APHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEI
Sbjct: 181 APHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEI 240

Query: 241 KSSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISI 300
           KSSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISI
Sbjct: 241 KSSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISI 300

Query: 301 QELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQ 360
           QELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQ
Sbjct: 301 QELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQ 360

Query: 361 ALDAAFQHHDADMQKRSVKSGERLNGYT 389
           ALDAAFQHHDADMQKRSVKSGERLNGYT
Sbjct: 361 ALDAAFQHHDADMQKRSVKSGERLNGYT 388

BLAST of CSPI04G16980 vs. ExPASy TrEMBL
Match: A0A1S3C6E8 (probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497008 PE=4 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 7.8e-214
Identity = 377/387 (97.42%), Postives = 385/387 (99.48%), Query Frame = 0

Query: 2   FQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 61
           +QIFALKPAVHQAFLNHVPNK+SEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL
Sbjct: 209 YQIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 268

Query: 62  KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 121
           KDDEILAA+TRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS
Sbjct: 269 KDDEILAADTRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 328

Query: 122 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 181
           QDLNRQGAVVLEGRTIDVDLEDPRTVA+ALVRS+HAVEGNE QTALDRISRMTAIEDLQA
Sbjct: 329 QDLNRQGAVVLEGRTIDVDLEDPRTVAEALVRSRHAVEGNERQTALDRISRMTAIEDLQA 388

Query: 182 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 241
           PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAY SLRESISEIK
Sbjct: 389 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYCSLRESISEIK 448

Query: 242 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 301
           SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ
Sbjct: 449 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 508

Query: 302 ELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 361
           ELLKHFWSSYPITTSYLYTKVSRLKDAM+KIYRQLEEIKETVVADFRHQVSLLVRPMHQA
Sbjct: 509 ELLKHFWSSYPITTSYLYTKVSRLKDAMSKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 568

Query: 362 LDAAFQHHDADMQKRSVKSGERLNGYT 389
           LDAAFQHHDAD+QKRSVKSGER+NGYT
Sbjct: 569 LDAAFQHHDADLQKRSVKSGERVNGYT 595

BLAST of CSPI04G16980 vs. ExPASy TrEMBL
Match: A0A1S4E1L0 (probable RNA polymerase II transcription factor B subunit 1-1 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103497008 PE=4 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 7.8e-214
Identity = 377/387 (97.42%), Postives = 385/387 (99.48%), Query Frame = 0

Query: 2   FQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 61
           +QIFALKPAVHQAFLNHVPNK+SEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL
Sbjct: 123 YQIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 182

Query: 62  KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 121
           KDDEILAA+TRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS
Sbjct: 183 KDDEILAADTRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 242

Query: 122 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 181
           QDLNRQGAVVLEGRTIDVDLEDPRTVA+ALVRS+HAVEGNE QTALDRISRMTAIEDLQA
Sbjct: 243 QDLNRQGAVVLEGRTIDVDLEDPRTVAEALVRSRHAVEGNERQTALDRISRMTAIEDLQA 302

Query: 182 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 241
           PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAY SLRESISEIK
Sbjct: 303 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYCSLRESISEIK 362

Query: 242 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 301
           SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ
Sbjct: 363 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 422

Query: 302 ELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 361
           ELLKHFWSSYPITTSYLYTKVSRLKDAM+KIYRQLEEIKETVVADFRHQVSLLVRPMHQA
Sbjct: 423 ELLKHFWSSYPITTSYLYTKVSRLKDAMSKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 482

Query: 362 LDAAFQHHDADMQKRSVKSGERLNGYT 389
           LDAAFQHHDAD+QKRSVKSGER+NGYT
Sbjct: 483 LDAAFQHHDADLQKRSVKSGERVNGYT 509

BLAST of CSPI04G16980 vs. ExPASy TrEMBL
Match: A0A1S4E1K9 (probable RNA polymerase II transcription factor B subunit 1-1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103497008 PE=4 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 7.8e-214
Identity = 377/387 (97.42%), Postives = 385/387 (99.48%), Query Frame = 0

Query: 2   FQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 61
           +QIFALKPAVHQAFLNHVPNK+SEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL
Sbjct: 127 YQIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 186

Query: 62  KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 121
           KDDEILAA+TRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS
Sbjct: 187 KDDEILAADTRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 246

Query: 122 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 181
           QDLNRQGAVVLEGRTIDVDLEDPRTVA+ALVRS+HAVEGNE QTALDRISRMTAIEDLQA
Sbjct: 247 QDLNRQGAVVLEGRTIDVDLEDPRTVAEALVRSRHAVEGNERQTALDRISRMTAIEDLQA 306

Query: 182 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 241
           PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAY SLRESISEIK
Sbjct: 307 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYCSLRESISEIK 366

Query: 242 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 301
           SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ
Sbjct: 367 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 426

Query: 302 ELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 361
           ELLKHFWSSYPITTSYLYTKVSRLKDAM+KIYRQLEEIKETVVADFRHQVSLLVRPMHQA
Sbjct: 427 ELLKHFWSSYPITTSYLYTKVSRLKDAMSKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 486

Query: 362 LDAAFQHHDADMQKRSVKSGERLNGYT 389
           LDAAFQHHDAD+QKRSVKSGER+NGYT
Sbjct: 487 LDAAFQHHDADLQKRSVKSGERVNGYT 513

BLAST of CSPI04G16980 vs. ExPASy TrEMBL
Match: A0A1S4E1L2 (probable RNA polymerase II transcription factor B subunit 1-1 isoform X4 OS=Cucumis melo OX=3656 GN=LOC103497008 PE=4 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 7.8e-214
Identity = 377/387 (97.42%), Postives = 385/387 (99.48%), Query Frame = 0

Query: 2   FQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 61
           +QIFALKPAVHQAFLNHVPNK+SEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL
Sbjct: 115 YQIFALKPAVHQAFLNHVPNKMSEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 174

Query: 62  KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 121
           KDDEILAA+TRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS
Sbjct: 175 KDDEILAADTRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 234

Query: 122 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 181
           QDLNRQGAVVLEGRTIDVDLEDPRTVA+ALVRS+HAVEGNE QTALDRISRMTAIEDLQA
Sbjct: 235 QDLNRQGAVVLEGRTIDVDLEDPRTVAEALVRSRHAVEGNERQTALDRISRMTAIEDLQA 294

Query: 182 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 241
           PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAY SLRESISEIK
Sbjct: 295 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYCSLRESISEIK 354

Query: 242 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 301
           SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ
Sbjct: 355 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 414

Query: 302 ELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 361
           ELLKHFWSSYPITTSYLYTKVSRLKDAM+KIYRQLEEIKETVVADFRHQVSLLVRPMHQA
Sbjct: 415 ELLKHFWSSYPITTSYLYTKVSRLKDAMSKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 474

Query: 362 LDAAFQHHDADMQKRSVKSGERLNGYT 389
           LDAAFQHHDAD+QKRSVKSGER+NGYT
Sbjct: 475 LDAAFQHHDADLQKRSVKSGERVNGYT 501

BLAST of CSPI04G16980 vs. NCBI nr
Match: XP_011653743.1 (general transcription and DNA repair factor IIH subunit TFB1-1 isoform X2 [Cucumis sativus])

HSP 1 Score: 766.5 bits (1978), Expect = 1.1e-217
Identity = 386/387 (99.74%), Postives = 387/387 (100.00%), Query Frame = 0

Query: 2   FQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 61
           +QIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL
Sbjct: 209 YQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 268

Query: 62  KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 121
           KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS
Sbjct: 269 KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 328

Query: 122 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 181
           QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA
Sbjct: 329 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 388

Query: 182 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 241
           PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK
Sbjct: 389 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 448

Query: 242 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 301
           SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ
Sbjct: 449 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 508

Query: 302 ELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 361
           ELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQA
Sbjct: 509 ELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 568

Query: 362 LDAAFQHHDADMQKRSVKSGERLNGYT 389
           LDAAFQHHDADMQKRSVKSGERLNGYT
Sbjct: 569 LDAAFQHHDADMQKRSVKSGERLNGYT 595

BLAST of CSPI04G16980 vs. NCBI nr
Match: KGN54745.2 (hypothetical protein Csa_012761 [Cucumis sativus])

HSP 1 Score: 766.5 bits (1978), Expect = 1.1e-217
Identity = 386/387 (99.74%), Postives = 387/387 (100.00%), Query Frame = 0

Query: 2   FQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 61
           +QIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL
Sbjct: 342 YQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 401

Query: 62  KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 121
           KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS
Sbjct: 402 KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 461

Query: 122 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 181
           QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA
Sbjct: 462 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 521

Query: 182 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 241
           PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK
Sbjct: 522 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 581

Query: 242 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 301
           SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ
Sbjct: 582 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 641

Query: 302 ELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 361
           ELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQA
Sbjct: 642 ELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLLVRPMHQA 701

Query: 362 LDAAFQHHDADMQKRSVKSGERLNGYT 389
           LDAAFQHHDADMQKRSVKSGERLNGYT
Sbjct: 702 LDAAFQHHDADMQKRSVKSGERLNGYT 728

BLAST of CSPI04G16980 vs. NCBI nr
Match: XP_031739954.1 (general transcription and DNA repair factor IIH subunit TFB1-1 isoform X4 [Cucumis sativus])

HSP 1 Score: 756.9 bits (1953), Expect = 8.5e-215
Identity = 386/401 (96.26%), Postives = 387/401 (96.51%), Query Frame = 0

Query: 2   FQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 61
           +QIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL
Sbjct: 83  YQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 142

Query: 62  KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 121
           KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS
Sbjct: 143 KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 202

Query: 122 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 181
           QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA
Sbjct: 203 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 262

Query: 182 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 241
           PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK
Sbjct: 263 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 322

Query: 242 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 301
           SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ
Sbjct: 323 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 382

Query: 302 ELLKHFWSSYPITTSYLY--------------TKVSRLKDAMAKIYRQLEEIKETVVADF 361
           ELLKHFWSSYPITTSYLY              TKVSRLKDAMAKIYRQLEEIKETVVADF
Sbjct: 383 ELLKHFWSSYPITTSYLYTKILEVCGINFCFQTKVSRLKDAMAKIYRQLEEIKETVVADF 442

Query: 362 RHQVSLLVRPMHQALDAAFQHHDADMQKRSVKSGERLNGYT 389
           RHQVSLLVRPMHQALDAAFQHHDADMQKRSVKSGERLNGYT
Sbjct: 443 RHQVSLLVRPMHQALDAAFQHHDADMQKRSVKSGERLNGYT 483

BLAST of CSPI04G16980 vs. NCBI nr
Match: XP_031739953.1 (general transcription and DNA repair factor IIH subunit TFB1-1 isoform X3 [Cucumis sativus])

HSP 1 Score: 756.9 bits (1953), Expect = 8.5e-215
Identity = 386/401 (96.26%), Postives = 387/401 (96.51%), Query Frame = 0

Query: 2   FQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 61
           +QIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL
Sbjct: 123 YQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 182

Query: 62  KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 121
           KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS
Sbjct: 183 KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 242

Query: 122 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 181
           QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA
Sbjct: 243 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 302

Query: 182 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 241
           PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK
Sbjct: 303 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 362

Query: 242 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 301
           SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ
Sbjct: 363 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 422

Query: 302 ELLKHFWSSYPITTSYLY--------------TKVSRLKDAMAKIYRQLEEIKETVVADF 361
           ELLKHFWSSYPITTSYLY              TKVSRLKDAMAKIYRQLEEIKETVVADF
Sbjct: 423 ELLKHFWSSYPITTSYLYTKILEVCGINFCFQTKVSRLKDAMAKIYRQLEEIKETVVADF 482

Query: 362 RHQVSLLVRPMHQALDAAFQHHDADMQKRSVKSGERLNGYT 389
           RHQVSLLVRPMHQALDAAFQHHDADMQKRSVKSGERLNGYT
Sbjct: 483 RHQVSLLVRPMHQALDAAFQHHDADMQKRSVKSGERLNGYT 523

BLAST of CSPI04G16980 vs. NCBI nr
Match: XP_031739948.1 (general transcription and DNA repair factor IIH subunit TFB1-1 isoform X1 [Cucumis sativus] >XP_031739949.1 general transcription and DNA repair factor IIH subunit TFB1-1 isoform X1 [Cucumis sativus] >XP_031739950.1 general transcription and DNA repair factor IIH subunit TFB1-1 isoform X1 [Cucumis sativus] >XP_031739951.1 general transcription and DNA repair factor IIH subunit TFB1-1 isoform X1 [Cucumis sativus])

HSP 1 Score: 756.9 bits (1953), Expect = 8.5e-215
Identity = 386/401 (96.26%), Postives = 387/401 (96.51%), Query Frame = 0

Query: 2   FQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 61
           +QIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL
Sbjct: 209 YQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALFL 268

Query: 62  KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 121
           KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS
Sbjct: 269 KDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTLS 328

Query: 122 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 181
           QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA
Sbjct: 329 QDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEGNESQTALDRISRMTAIEDLQA 388

Query: 182 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 241
           PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK
Sbjct: 389 PHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLRESISEIK 448

Query: 242 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 301
           SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ
Sbjct: 449 SSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELLHHWISIQ 508

Query: 302 ELLKHFWSSYPITTSYLY--------------TKVSRLKDAMAKIYRQLEEIKETVVADF 361
           ELLKHFWSSYPITTSYLY              TKVSRLKDAMAKIYRQLEEIKETVVADF
Sbjct: 509 ELLKHFWSSYPITTSYLYTKILEVCGINFCFQTKVSRLKDAMAKIYRQLEEIKETVVADF 568

Query: 362 RHQVSLLVRPMHQALDAAFQHHDADMQKRSVKSGERLNGYT 389
           RHQVSLLVRPMHQALDAAFQHHDADMQKRSVKSGERLNGYT
Sbjct: 569 RHQVSLLVRPMHQALDAAFQHHDADMQKRSVKSGERLNGYT 609

BLAST of CSPI04G16980 vs. TAIR 10
Match: AT1G55750.1 (BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins) )

HSP 1 Score: 473.4 bits (1217), Expect = 1.8e-133
Identity = 241/393 (61.32%), Postives = 305/393 (77.61%), Query Frame = 0

Query: 1   MFQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALF 60
           +FQIFA KPAV QAF+N+VP+K++EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+F
Sbjct: 204 IFQIFAEKPAVRQAFINYVPSKMTEKDFWTKYFRAEYLYSTKNTAVAAAEAAEDEELAVF 263

Query: 61  LKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTL 120
           LK DEILA ETR KIR VDPTLD+EAD GDDYTHL DHGI RDG  ++ E QN+ ++R+L
Sbjct: 264 LKPDEILARETRHKIRRVDPTLDMEADQGDDYTHLMDHGIQRDGTMDVVEPQNDQFKRSL 323

Query: 121 SQDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHAVEG------NESQTALDRISRMT 180
            QDLNR  AVVLEGR+IDV+ ED R VA+AL R K   +       + +Q  L+R+SR+ 
Sbjct: 324 LQDLNRHAAVVLEGRSIDVESEDTRIVAEALTRVKQVSKADGETTKDANQERLERMSRVA 383

Query: 181 AIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLR 240
            +EDLQAP + P APL IKDPRDYF++QQ N +      + G+++     +  EAYG L+
Sbjct: 384 GMEDLQAPQNFPLAPLSIKDPRDYFESQQGNVLNVPRGAK-GLKR-----NVHEAYGLLK 443

Query: 241 ESISEIKSSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELL 300
           ESI EI+++G + P+IKPEV+  V++ LT+ I++ K   GKNP+ES L+ LP  TK+E+L
Sbjct: 444 ESILEIRATGLSDPLIKPEVSFEVFSSLTRTIATAKNINGKNPRESFLDRLPKSTKDEVL 503

Query: 301 HHWISIQELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLL 360
           HHW SIQELLKHFWSSYPITT+YL+TKV +LKDAM+  Y +LE +KE+V +D RHQVSLL
Sbjct: 504 HHWTSIQELLKHFWSSYPITTTYLHTKVGKLKDAMSNTYSKLEAMKESVQSDLRHQVSLL 563

Query: 361 VRPMHQALDAAFQHHDADMQKRSVKSGERLNGY 388
           VRPM QALDAAF H++ D+Q+R+ KSGER NGY
Sbjct: 564 VRPMQQALDAAFHHYEVDLQRRTAKSGERPNGY 590

BLAST of CSPI04G16980 vs. TAIR 10
Match: AT3G61420.1 (BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins) )

HSP 1 Score: 449.9 bits (1156), Expect = 2.1e-126
Identity = 230/385 (59.74%), Postives = 292/385 (75.84%), Query Frame = 0

Query: 1   MFQIFALKPAVHQAFLNHVPNKISEKDFWTKYFRAEYLHSTKNSIAAAAEAAEDEELALF 60
           +FQIFA KPAV QAF+N+VP K++EKDFWTKYFRAEYL+STKN+  AAAEAAEDEELA+F
Sbjct: 199 IFQIFAEKPAVRQAFINYVPKKMTEKDFWTKYFRAEYLYSTKNTAVAAAEAAEDEELAVF 258

Query: 61  LKDDEILAAETRKKIRHVDPTLDLEADLGDDYTHLPDHGIFRDGGKEITESQNEHYRRTL 120
           LK DEILA E R+K+R VDPTLD++AD GDDYTHL DHGI RDG  +I E QN+  +R+L
Sbjct: 259 LKPDEILAQEARQKMRRVDPTLDMDADEGDDYTHLMDHGIQRDGTNDIIEPQNDQLKRSL 318

Query: 121 SQDLNRQGAVVLEGRTIDVDLEDPRTVADALVRSKHA------VEGNESQTALDRISRMT 180
            QDLNR  AVVLEGR I+V  ED R VA+AL R+K        +  + +Q  L+R+SR T
Sbjct: 319 LQDLNRHAAVVLEGRCINVQSEDTRIVAEALTRAKQVSKADGEITKDANQERLERMSRAT 378

Query: 181 AIEDLQAPHSHPFAPLCIKDPRDYFDAQQANAIKTLDDTRAGMQQTKCSLSTTEAYGSLR 240
            +EDLQAP + P APL IKDPRDYF++QQ N +      +A  +      +  EAYG L+
Sbjct: 379 EMEDLQAPQNFPLAPLSIKDPRDYFESQQGNILSEPRGAKASKR------NVHEAYGLLK 438

Query: 241 ESISEIKSSGFNHPIIKPEVALMVYNGLTQNISSTKYQLGKNPQESILESLPNPTKEELL 300
           ESI  I+ +G + P+IKPEV+  V++ LT+ IS+ K  LGKNPQES L+ LP  TK+E++
Sbjct: 439 ESILVIRMTGLSDPLIKPEVSFEVFSSLTRTISTAKNILGKNPQESFLDRLPKSTKDEVI 498

Query: 301 HHWISIQELLKHFWSSYPITTSYLYTKVSRLKDAMAKIYRQLEEIKETVVADFRHQVSLL 360
           HHW SIQEL++HFWSSYPITT+YL TKV +LKDAM+  Y  L+ +K++V +D RHQVSLL
Sbjct: 499 HHWTSIQELVRHFWSSYPITTTYLSTKVGKLKDAMSNTYSLLDAMKQSVQSDLRHQVSLL 558

Query: 361 VRPMHQALDAAFQHHDADMQKRSVK 380
           VRPM QALDAAFQH+++D+Q+R+ K
Sbjct: 559 VRPMQQALDAAFQHYESDLQRRTAK 577

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3ECP02.5e-13261.32General transcription and DNA repair factor IIH subunit TFB1-1 OS=Arabidopsis th... [more]
Q9M3222.9e-12559.74General transcription and DNA repair factor IIH subunit TFB1-3 OS=Arabidopsis th... [more]
P327807.4e-1223.08General transcription factor IIH subunit 1 OS=Homo sapiens OX=9606 GN=GTF2H1 PE=... [more]
Q9DBA91.4e-1022.94General transcription factor IIH subunit 1 OS=Mus musculus OX=10090 GN=Gtf2h1 PE... [more]
Q55FP12.0e-0919.68General transcription factor IIH subunit 1 OS=Dictyostelium discoideum OX=44689 ... [more]
Match NameE-valueIdentityDescription
A0A0A0L1N56.2e-219100.00BSD domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G338420 PE=4 SV... [more]
A0A1S3C6E87.8e-21497.42probable RNA polymerase II transcription factor B subunit 1-1 isoform X1 OS=Cucu... [more]
A0A1S4E1L07.8e-21497.42probable RNA polymerase II transcription factor B subunit 1-1 isoform X3 OS=Cucu... [more]
A0A1S4E1K97.8e-21497.42probable RNA polymerase II transcription factor B subunit 1-1 isoform X2 OS=Cucu... [more]
A0A1S4E1L27.8e-21497.42probable RNA polymerase II transcription factor B subunit 1-1 isoform X4 OS=Cucu... [more]
Match NameE-valueIdentityDescription
XP_011653743.11.1e-21799.74general transcription and DNA repair factor IIH subunit TFB1-1 isoform X2 [Cucum... [more]
KGN54745.21.1e-21799.74hypothetical protein Csa_012761 [Cucumis sativus][more]
XP_031739954.18.5e-21596.26general transcription and DNA repair factor IIH subunit TFB1-1 isoform X4 [Cucum... [more]
XP_031739953.18.5e-21596.26general transcription and DNA repair factor IIH subunit TFB1-1 isoform X3 [Cucum... [more]
XP_031739948.18.5e-21596.26general transcription and DNA repair factor IIH subunit TFB1-1 isoform X1 [Cucum... [more]
Match NameE-valueIdentityDescription
AT1G55750.11.8e-13361.32BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS... [more]
AT3G61420.12.1e-12659.74BSD domain (BTF2-like transcription factors, Synapse-associated proteins and DOS... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 296..316
NoneNo IPR availablePANTHERPTHR12856:SF1SUBFAMILY NOT NAMEDcoord: 2..294
coord: 296..357
IPR005607BSD domainPFAMPF03909BSDcoord: 4..43
e-value: 4.7E-8
score: 32.9
IPR005607BSD domainPROSITEPS50858BSDcoord: 1..40
score: 12.213329
IPR027079TFIIH subunit Tfb1/GTF2H1PANTHERPTHR12856TRANSCRIPTION INITIATION FACTOR IIH-RELATEDcoord: 2..294
coord: 296..357
IPR035925BSD domain superfamilySUPERFAMILY140383BSD domain-likecoord: 4..41

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G16980.1CSPI04G16980.1mRNA
CSPI04G16980.2CSPI04G16980.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006289 nucleotide-excision repair
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0000439 transcription factor TFIIH core complex