Cucsa.178810.4 (mRNA) Cucumber (Gy14) v1

NameCucsa.178810.4
TypemRNA
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionGeneral transcription factor IIH subunit 3
Locationscaffold01227 : 1164629 .. 1169022 (+)
Sequence length1382
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCCATTCTGAAGCAGCCATGGCTTCAGCTCCTTCGAAGCTTTACGCAGGTTCTCCTCCAATCCCTTTCTCCTTCCTTATGGGCACTCGGGTACAACTATCGAAATTGCAGTGGCTTATTCGTTTTTTtGATCTTTGATTGTGTTGTTGCAGATGATGTTAGCCTTTTAGTGGTTTTACTGGATACGAATCCATTTTTCTGGAGCACATCTGCTCTTCCGTTCTCCAAGTTTCTGTCTCATGTAATTGATAAATTATCTCCTTTCCTGTGAACCATTCTACCCAGTAGGTTTTTCTGATAGAACTGTAATGAACATTAGCATTTCCTTTTTCTTTCTGTTTTGGGTGTATAGGTACTTGCTTTTCTGAACTCCATTTTAGTTCTGAACCAACTTAATGAGGTTGTGGTTATTGGTACTGGATATGCTTCATGCAAGTATTTATACAACTCGTCTTCTTACTCAAATCATGGCCTTGAAGATGGTAGAATGCCTGCACTTTGTACTCGTTTATTGAAGAATTTGGAGGAGTTCGTGATTGGGGATGAGCAGTCCATCAAGGAAGATCCCAAAGGAGGGACCATGTCTTCACTTCTTTCTGGATCGCTCTCCATGGCCTTGTGTTGTATCCTTCCTTGTTGATGATTATTGGTGCTGATTATTTGCTTGTTTATGCTCATAAGTAAAGCTTGACATGTTCTACAAATGGGCCATTAATGTAGCAGTCAGGACTAAACAGTATCTATTGTGATGGGAGTGGCATTTGGTCGAAATTTGTGGTAATGAGTATGGTTGCAAAATGTTGTAAGAGTTATGGTTGGCACGATGGAATTTTCTAATAGTCACACTGTTTAAATGTAACTTGAGTAATGTAATATTCATCAACATACTGTTATGCTTCAAATGCTTGACCCTGTGATTAGATATACAGAAAGTTTTCCGCTCTGGATCTCTCCATCCCCAACCACGAGTAAGCATTTGTTTCTGTTCTGGATAATTTATTCCCAATTTGCATATCCTGATTTTTCTTGTTGCTTACATATACTATTTGTTTTTGTCCTTTTTTGAAAATGTGACTAATCTATGTGAATATATTTTACTTTGAGTGTAGATCCTTTGCTTGCAGGGATCCCCAGATGGTCCTGAACAGTAAGAATATTTTCTTTATTCTTTAAAAATATGTATATATTTCTAATGAAGTTATTTTATTTGTGCAATTTTCTTTAATAATGGGAATCAAACTTTAAAATCTTCCGTCTAGTGACTGTTTTTCTCTTCCAGATATGTTGCAATCATGAATGCCATCTTTTCTGCTCAGCGTTCAATGGTTTGTATTATTTTGTTCACTCTTCTGTGTGTTCTTATATGACTTTACTAACTTCTGTCAATCACCAACTCTCATTATTTGAACTGGGTGATATCAGTAAGTAATGAAAATGTGACCACAAGTATTTCATGGAAATGCTCGAGACAAGTTTCCTCTCTATTTAACTTGTTCTTTGTTTGGTCTTGATGGCAGAAAATGGTGCTGGATTAAACTTTTTTGTTCGTTTCTTTTCCTATTTGAATTTGAAGATATCAATTTTTTCTCCCCCTATTTCGAGATATCCCACTTGTCCCATAGGCGAATTTGAATTTGAATGCATTAACAAGACTAAGTATTTTATATTGAACTACTTTCTTGGAGTTTACAACTGAATATCTAGTCTCTACCGTTAGAGTTGAACTCCAATATTTGGTATAGATTTTATTATTTAATTTGCTTGCCCTGTGGATGGAAAGCTGATGGATCTCTGGCATCTTTTCTACTATTAGTCTAGTCAGATTCAGTTGTGAAAGTTTTCATGTTTATATGTATGCATGTAAGTGTATGTATAAAGAATAAGAAGAAGAACAAATCTAGAGAGCAAAGATTCTTCGAAAAGGACAAGGGGAGAATTGGGAGTTTCCCTGAAGGAAGATTCTGGGATAGAAGTTTATAAGAAAAACTTCAGTCCAAGTTATCGATAGAGTAGAGAAGTAGCAAAACCCTTATGTAAGGTACTCCAAGTAAAGGCTGTAACTGTACAAGCTCCAAAATTTAGGAGAACAGTCCTGAAAAAACCAATAAAGAGTTGTATTTTCAAATTGGAAAAATTATAAATAGAGATGAACTTTTAATTTACTCTCTCCAACAACACATCAAATCCAAAGGTGAAAACTACTTAAGATTTTTGAATTGCTAGTTTTCATTTTGGTTATGACAGCTTAACGTAACAATATTGACCAAAGTATGATTAATTTCTTGATATCAAGTAATTGCATCTGCAATGTTCATGATAATAGAAATCATTTGGTCACAAGCAGAAAGTGGTAGTCAATGAGATCGTAAAAATGTGAATCTAATCTTCATTCACATTTGCATTTATGGGTTATGTTTAAACTTTCATGTGTGCTAGTCACTCATTTGAAGTTCACAGCTGTCTCCGTTTTCCTCTCCATGTTGCTTTTAACTATAAACTACATTCAACATTGAACAATGTTTTCATTTGTATAAGTATCCTATGATGTACAATTTTTCCTATGATGTACAATTTTATCATTCTTGCTTCAGGTTCCTATAGATTCATGTTACATTGGTTCCCACAATTCTGCATTTCTTCAGCAGGTAAGCTAACTGCTCATTGTTCCTTAATGGGTTTTTtACTTTtCTTGGAATCTTACAGTTACCATCTCCGGATGTATCATCTATTCATTGCTGGTGATGATTTGATTTCTGATGCTGTATAGGCTTCTTACATAACGGGTGGAGTTTATCTGAAGCCTCAGCAAATGGATGGGCTGTTTCAGTATCTCTCTGTAAGTTTTACTTTCCTATGGGTTCCCcTGTTTTTtGTAATCTCTTGCTTAGGCTGTTGTGATATTCTATTGTTCTTTCTATGCAGACTGTTTTTGGCACCGATTTGCATTCCCGGACCTTTTTACAGCTTCCAAAATCTGTTGGTGTGGATTTTCGTGCATCGTAAGTCTGCAGTCTTGTGAATACAAAATTCTGTTGTTTTGTGCTTATTATTTTGTGTCACTTACTACGGCCTTCTACAGATTTACACCTCTCATCATCATTTTGGGAATTTGCAGGTGTTTTTGCCACAAGAAAACGATCGATATGGGCTATGTCTGTTCTGTTTGTTTATCTATATTCTGCAAGCATCACAAGAAATGTTCTACCTGTGGGTAAGCGAAAGGTCAAGCCATAATTATTATGTTTTTCCCTTTTACTTGTGGTTTCCTAAAATTTATTGAAGGTAGTTTGACATTGACTTTCGAGAATGCTGTTGTCTGTTTGGCTTTATAAGTATGAAAaGGAAGATGAAGAGGAACCTAATTTTTGAGAACTCATTATGCTAGGGCTTTTTCCTCGGCAGTTTATATAATATTTCGGTGAGACATGATGTCAACGCGAGGGGGAGCTTTGTACCTTAGTATTGCCCCTAGATAATGGTTGCCTAAATAAGATGCTAAAAAATTCTGCCACGGCTTAAGTTTCTTAGGGGTGTTTGGAAAACTGAAATGTAATTAGAGTCACTTTGAATCAGAGTGGTGTTAAAATGGGTGTTGAATTGAGGGCAAAACAAAAATGGGGTCGTAAAGCGCCAAGACCCTTCTGAAATGCTAAAAAAACTCAACAATGTTGTTTGCCTACTATTACTTCTGTCTACTTCAGAACAAGATTGAATGTATTAATATTAATGAGTTTCAACTTCATTCAGGTCAGTTTTTGGTGAGACACCAGTAGAACTCGATTCAGTGTCTAAACTGAAGAGAAAAACTCCAGAATGATTGCGCTCTTCTGTAAGTTGTGATGGATCATTTCATGAAACTCAGTGTTATTTTATGATTTTGAACCATTTGTTTTCACTTCCATGGCTATAGAACTGCTTCATTTTTCTTCCCTTTCTTGAAGCTGTTTGGAACTGGAATTCATTTGGTTCACATGGCATGAACAGAGATGACTCAAAGGAAACAGTTGCTGAAAGAGAGTCAATCTCAACATTGTACAATTCAGATCAATAGTTGTCGAATTCGGATGGGATAACATGGAACGATGGCTAGCAAGGTCATGCATCTGTATTTCGTCTATTCCAGTCTGATACAAATCGGGTACAAGGTTTATATGGAATTCGAGACTTTAACTAGAAAAGTTAGTCTAATGTAATTGTACCGAAAGCTTGAGTGGTTTATAAGTACATGTAGATTTATATTTCTCTCACGCTTGAAAGTACTCCAAAGCTTACCTTTGTAAAGGTAGACAGAAGGATAGGGATATTCACATTGTTGATTTCAAGACCTCATACTTGGAAATTTTACGAGTTTGAAATGTGCCAAGTGAATCATATAATACTACAATTTGAGCATA

mRNA sequence

GCCCATTCTGAAGCAGCCATGGCTTCAGCTCCTTCGAAGCTTTACGCAGATGATGTTAGCCTTTTAGTGGTTTTACTGGATACGAATCCATTTTTCTGGAGCACATCTGCTCTTCCGTTCTCCAAGTTTCTGTCTCATGTACTTGCTTTTCTGAACTCCATTTTAGTTCTGAACCAACTTAATGAGGTTGTGGTTATTGGTACTGGATATGCTTCATGCAAGTATTTATACAACTCGTCTTCTTACTCAAATCATGGCCTTGAAGATGGTAGAATGCCTGCACTTTGTACTCGTTTATTGAAGAATTTGGAGGAGTTCGTGATTGGGGATGAGCAGTCCATCAAGGAAGATCCCAAAGGAGGGACCATGTCTTCACTTCTTTCTGGATCGCTCTCCATGGCCTTGTGTTATATACAGAAAGTTTTCCGCTCTGGATCTCTCCATCCCCAACCACGAATCCTTTGCTTGCAGGGATCCCCAGATGGTCCTGAACAATATGTTGCAATCATGAATGCCATCTTTTCTGCTCAGCGTTCAATGGTTCCTATAGATTCATGTTACATTGGTTCCCACAATTCTGCATTTCTTCAGCAGGCTTCTTACATAACGGGTGGAGTTTATCTGAAGCCTCAGCAAATGGATGGGCTGTTTCAGTATCTCTCTACTGTTTTTGGCACCGATTTGCATTCCCGGACCTTTTTACAGCTTCCAAAATCTGTTGGTGTGGATTTTCGTGCATCGTGTTTTTGCCACAAGAAAACGATCGATATGGGCTATGTCTGTTCTGTTTGTTTATCTATATTCTGCAAGCATCACAAGAAATGTTCTACCTGTGGGTAAGCGAAAGGTCAGTTTTTGGTGAGACACCAGTAGAACTCGATTCAGTGTCTAAACTGAAGAGAAAAACTCCAGAATGATTGCGCTCTTCTCTGTTTGGAACTGGAATTCATTTGGTTCACATGGCATGAACAGAGATGACTCAAAGGAAACAGTTGCTGAAAGAGAGTCAATCTCAACATTGTACAATTCAGATCAATAGTTGTCGAATTCGGATGGGATAACATGGAACGATGGCTAGCAAGGTCATGCATCTGTATTTCGTCTATTCCAGTCTGATACAAATCGGGTACAAGGTTTATATGGAATTCGAGACTTTAACTAGAAAAGTTAGTCTAATGTAATTGTACCGAAAGCTTGAGTGGTTTATAAGTACATGTAGATTTATATTTCTCTCACGCTTGAAAGTACTCCAAAGCTTACCTTTGTAAAGGTAGACAGAAGGATAGGGATATTCACATTGTTGATTTCAAGACCTCATACTTGGAAATTTTACGAGTTTGAAATGTGCCAAGTGAATCATATAATACTACAATTTGAGCATA

Coding sequence (CDS)

ATGGCTTCAGCTCCTTCGAAGCTTTACGCAGATGATGTTAGCCTTTTAGTGGTTTTACTGGATACGAATCCATTTTTCTGGAGCACATCTGCTCTTCCGTTCTCCAAGTTTCTGTCTCATGTACTTGCTTTTCTGAACTCCATTTTAGTTCTGAACCAACTTAATGAGGTTGTGGTTATTGGTACTGGATATGCTTCATGCAAGTATTTATACAACTCGTCTTCTTACTCAAATCATGGCCTTGAAGATGGTAGAATGCCTGCACTTTGTACTCGTTTATTGAAGAATTTGGAGGAGTTCGTGATTGGGGATGAGCAGTCCATCAAGGAAGATCCCAAAGGAGGGACCATGTCTTCACTTCTTTCTGGATCGCTCTCCATGGCCTTGTGTTATATACAGAAAGTTTTCCGCTCTGGATCTCTCCATCCCCAACCACGAATCCTTTGCTTGCAGGGATCCCCAGATGGTCCTGAACAATATGTTGCAATCATGAATGCCATCTTTTCTGCTCAGCGTTCAATGGTTCCTATAGATTCATGTTACATTGGTTCCCACAATTCTGCATTTCTTCAGCAGGCTTCTTACATAACGGGTGGAGTTTATCTGAAGCCTCAGCAAATGGATGGGCTGTTTCAGTATCTCTCTACTGTTTTTGGCACCGATTTGCATTCCCGGACCTTTTTACAGCTTCCAAAATCTGTTGGTGTGGATTTTCGTGCATCGTGTTTTTGCCACAAGAAAACGATCGATATGGGCTATGTCTGTTCTGTTTGTTTATCTATATTCTGCAAGCATCACAAGAAATGTTCTACCTGTGGGTAA

Protein sequence

MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVIGTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSLLSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG*
BLAST of Cucsa.178810.4 vs. Swiss-Prot
Match: TFB4_ARATH (RNA polymerase II transcription factor B subunit 4 OS=Arabidopsis thaliana GN=TFB4 PE=2 SV=1)

HSP 1 Score: 417.2 bits (1071), Expect = 1.4e-115
Identity = 205/277 (74.01%), Postives = 235/277 (84.84%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           M +  SK Y+DDVSLLV+LLDTNP FWST+++ FS+FLSHVLAFLN++L LNQLN+VVVI
Sbjct: 1   MPAIASKQYSDDVSLLVLLLDTNPLFWSTTSITFSQFLSHVLAFLNAVLGLNQLNQVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNHGLEDGR---MPALCTRLLKNLEEFVIGDEQSIKEDPKGGTM 120
            TGY+SC Y+Y+SS  SNHG  +     MPA+   LLK LEEFV  DE+  KE+     +
Sbjct: 61  ATGYSSCDYIYDSSLTSNHGNFESNGTGMPAIFGSLLKKLEEFVTKDEELSKEEVSEDRI 120

Query: 121 SS-LLSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 180
            S LLSGSLSMALCYIQ+VFRSG LHPQPRILCLQGSPDGPEQYVA+MN+IFSAQR MVP
Sbjct: 121 PSCLLSGSLSMALCYIQRVFRSGHLHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRLMVP 180

Query: 181 IDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGV 240
           IDSCYIG  NSAFLQQASYITGGV+  P+Q+DGLFQYL+T+F TDLHSR F+QLPK +GV
Sbjct: 181 IDSCYIGVQNSAFLQQASYITGGVHHTPKQLDGLFQYLTTIFATDLHSRGFVQLPKPIGV 240

Query: 241 DFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           DFRASCFCHKKTIDMGY+CSVCLSIFC+HHKKCSTCG
Sbjct: 241 DFRASCFCHKKTIDMGYICSVCLSIFCEHHKKCSTCG 277

BLAST of Cucsa.178810.4 vs. Swiss-Prot
Match: TF2H3_DICDI (General transcription factor IIH subunit 3 OS=Dictyostelium discoideum GN=gtf2h3 PE=3 SV=1)

HSP 1 Score: 196.4 bits (498), Expect = 4.0e-49
Identity = 105/252 (41.67%), Postives = 149/252 (59.13%), Query Frame = 1

Query: 34  FSKFLSHVLAFLNSILVLNQLNEVVVIGTGYASCKYLYNSSSYSNHGLEDGRMPA----- 93
           F+KFL H + F+N+ L+LNQ N++ +I +      +++  S+   +  E   +       
Sbjct: 92  FNKFLEHFMVFINAYLMLNQENQLAIICSKIGESSFVFPQSNIDQYQQEQQELEQRQLNE 151

Query: 94  ---LCTRLLKNLEEFVIGDEQS----IKEDPKGGTMSSLLSGSLSMALCYIQKVFRSGSL 153
              L     K ++  ++   Q     IK D +   +SS  S S+S+ALCYI ++ R    
Sbjct: 152 NGELLPTPNKTIQGQILAKLQKLDLEIKHD-QTDILSSSFSASMSIALCYINRIKRETPT 211

Query: 154 HPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVY 213
             +PRIL    SPD   QY+++MN IFS+Q+  +P+DSC +   +S FLQQAS++T G+Y
Sbjct: 212 I-KPRILVFNISPDVSSQYISVMNCIFSSQKQSIPVDSCILSQSDSTFLQQASHLTSGIY 271

Query: 214 LKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSI 273
           LKPQ+ + L QYL T F  D  SR  L  P    VD+RASCFCHK+ +D+GYVCSVCLSI
Sbjct: 272 LKPQKQELLSQYLLTTFLLDTLSRKSLAYPTLKSVDYRASCFCHKRIVDIGYVCSVCLSI 331

BLAST of Cucsa.178810.4 vs. Swiss-Prot
Match: TF2H3_HUMAN (General transcription factor IIH subunit 3 OS=Homo sapiens GN=GTF2H3 PE=1 SV=2)

HSP 1 Score: 183.0 bits (463), Expect = 4.5e-45
Identity = 109/286 (38.11%), Postives = 162/286 (56.64%), Query Frame = 1

Query: 11  DDVSLLVVLLDTNPFFWSTSALPFSKF-----LSHVLAFLNSILVLNQLNEVVVIGTGYA 70
           D+++LLV+++D NP +W   AL  S+F     +  V+   NS L +N+ N++ VI +   
Sbjct: 6   DELNLLVIVVDANPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHIQ 65

Query: 71  SCKYLY---------------NSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSI-- 130
             ++LY               N   ++  G +DG+       LL +  E ++ + + +  
Sbjct: 66  ESRFLYPGKNGRLGDFFGDPGNPPEFNPSGSKDGKY-----ELLTSANEVIVEEIKDLMT 125

Query: 131 KEDPKGGTMSSLLSGSLSMALCYIQKVFRS--GSLHPQPRILCLQGSPDGPEQYVAIMNA 190
           K D KG    +LL+GSL+ ALCYI ++ +    +   + RIL ++ + D   QY+  MN 
Sbjct: 126 KSDIKGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNV 185

Query: 191 IFSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRT 250
           IF+AQ+  + ID+C + S +S  LQQA  ITGG+YLK  QM  L QYL  VF  D   R+
Sbjct: 186 IFAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRS 245

Query: 251 FLQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTC 273
            L LP  V VD+RA+CFCH+  I++GYVCSVCLSIFC     C+TC
Sbjct: 246 QLILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTC 285

BLAST of Cucsa.178810.4 vs. Swiss-Prot
Match: TF2H3_BOVIN (General transcription factor IIH subunit 3 OS=Bos taurus GN=GTF2H3 PE=2 SV=1)

HSP 1 Score: 181.4 bits (459), Expect = 1.3e-44
Identity = 110/285 (38.60%), Postives = 159/285 (55.79%), Query Frame = 1

Query: 11  DDVSLLVVLLDTNPFFWSTSALPFSKF-----LSHVLAFLNSILVLNQLNEVVVIGTGYA 70
           D+++LLV+++DTNP +W   AL  S+F     +  V+   NS L +N+ N++ VI +   
Sbjct: 6   DELNLLVIIVDTNPIWWGKQALKESQFTLSKCIDAVMVLGNSHLFMNRSNKLAVIASHIQ 65

Query: 71  SCKYLYN----------------SSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIK 130
             ++LY                 SS ++  G +DG+   L        EE     +   K
Sbjct: 66  ESRFLYPGKNGRLGDFFGDPGNPSSEFTPSGSKDGKYELLTAANEVIAEEI---KDLMTK 125

Query: 131 EDPKGGTMSSLLSGSLSMALCYIQKVFRS--GSLHPQPRILCLQGSPDGPEQYVAIMNAI 190
            D +G    +LL+GSL+ ALCYI ++ +    +   + RIL ++ + D   QY+  MN I
Sbjct: 126 SDIEGQHTETLLAGSLAKALCYIHRMNKEVKDNQEMKSRILVIKAAEDSALQYMNFMNVI 185

Query: 191 FSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTF 250
           F+AQ+  + ID+C + S +S  LQQA  ITGG+YLK  QM  L QYL  VF  D   R+ 
Sbjct: 186 FAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQ 245

Query: 251 LQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTC 273
           L LP  V VD+RA+CFCH+  I++GYVCSVCLSIFC     C+TC
Sbjct: 246 LILPPPVHVDYRAACFCHRNLIEIGYVCSVCLSIFCNFSPICTTC 286

BLAST of Cucsa.178810.4 vs. Swiss-Prot
Match: TF2H3_MOUSE (General transcription factor IIH subunit 3 OS=Mus musculus GN=Gtf2h3 PE=1 SV=1)

HSP 1 Score: 179.9 bits (455), Expect = 3.8e-44
Identity = 112/285 (39.30%), Postives = 159/285 (55.79%), Query Frame = 1

Query: 11  DDVSLLVVLLDTNPFFWSTSALPFSKF-----LSHVLAFLNSILVLNQLNEVVVIGTGYA 70
           D+++LLV+++DTNP +W   AL  S+F     +  V+   NS L +N+ N++ VI +   
Sbjct: 6   DELNLLVIIVDTNPIWWGKQALKESQFTLSKCMDAVMVLANSHLFMNRSNQLAVIASHIQ 65

Query: 71  SCKYLYNSSSYSNHGLED-----GRMPALCT---------RLLKNLEEFVIGDEQSI--K 130
             + LY      N GL D     G     C           LL    E +  + + +  K
Sbjct: 66  ESRLLYPGK---NGGLGDFFGDPGNALPDCNPSGSKDGKYELLTVANEVIAEEIKDLMTK 125

Query: 131 EDPKGGTMSSLLSGSLSMALCYIQKVFRS--GSLHPQPRILCLQGSPDGPEQYVAIMNAI 190
            D KG    +LL+GSL+ ALCYI +V ++   +   + RIL ++ + D   QY+  MN I
Sbjct: 126 SDIKGQHTETLLAGSLAKALCYIHRVNKAVKDNQEMKSRILVIKAAEDSALQYMNFMNVI 185

Query: 191 FSAQRSMVPIDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTF 250
           F+AQ+  + ID+C + S +S  LQQA  ITGG+YLK  QM  L QYL  VF  D   R+ 
Sbjct: 186 FAAQKQNILIDACVLDS-DSGLLQQACDITGGLYLKVPQMPSLLQYLLWVFLPDQDQRSQ 245

Query: 251 LQLPKSVGVDFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTC 273
           L LP  + VD+RA+CFCH+  I++GYVCSVCLSIFC     C+TC
Sbjct: 246 LILPPPIHVDYRAACFCHRSLIEIGYVCSVCLSIFCNFSPICTTC 286

BLAST of Cucsa.178810.4 vs. TrEMBL
Match: A0A0A0LMM2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G072450 PE=4 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 1.7e-155
Identity = 273/273 (100.00%), Postives = 273/273 (100.00%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL
Sbjct: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120

Query: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC
Sbjct: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180

Query: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA 240
           YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA
Sbjct: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA 240

Query: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG
Sbjct: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 273

BLAST of Cucsa.178810.4 vs. TrEMBL
Match: M5XYU6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009386mg PE=4 SV=1)

HSP 1 Score: 464.2 bits (1193), Expect = 1.1e-127
Identity = 229/273 (83.88%), Postives = 247/273 (90.48%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLL+VLLDTNPFFWS+S+LPFS FLSHVL FLNSIL+LNQLN+VVVI
Sbjct: 1   MASAPSKLYADDVSLLMVLLDTNPFFWSSSSLPFSVFLSHVLTFLNSILLLNQLNQVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120
            TGY SC Y+Y+SS+ +N G ++GRMPA C  LL+ LEEFVI DEQ IKE  + G  SSL
Sbjct: 61  ATGYNSCSYIYDSSTSTNQGSDNGRMPARCVNLLQKLEEFVIEDEQLIKEGLREGIASSL 120

Query: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           LSGSLSMALCYIQ+VFRSG LHPQPRILCLQGS DGPEQYVAIMNAIFSAQRS VPIDSC
Sbjct: 121 LSGSLSMALCYIQRVFRSGPLHPQPRILCLQGSSDGPEQYVAIMNAIFSAQRS-VPIDSC 180

Query: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA 240
           Y+GS NSAFLQQASYITGGVYLKPQQ +GLFQYLSTVF TDLHSR FLQLPKS+GVDFRA
Sbjct: 181 YMGSSNSAFLQQASYITGGVYLKPQQPNGLFQYLSTVFATDLHSRAFLQLPKSLGVDFRA 240

Query: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           SCFCHKKTIDMGY+CSVCLSIFCKHHKKCSTCG
Sbjct: 241 SCFCHKKTIDMGYICSVCLSIFCKHHKKCSTCG 272

BLAST of Cucsa.178810.4 vs. TrEMBL
Match: A0A061FGT5_THECC (Basal transcription factor complex subunit-related isoform 1 OS=Theobroma cacao GN=TCM_035096 PE=4 SV=1)

HSP 1 Score: 455.7 bits (1171), Expect = 4.0e-125
Identity = 219/273 (80.22%), Postives = 243/273 (89.01%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSL+VVL+DTNPFFWS S+L FS+FLSHVLAFLN+IL LNQLN+VVVI
Sbjct: 1   MASAPSKLYADDVSLVVVLVDTNPFFWSASSLSFSQFLSHVLAFLNAILTLNQLNQVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120
            TGY SC Y+++SSS  N   E+GRMP +C+ LL+ LEEF+I DEQ  KE P+G   SSL
Sbjct: 61  ATGYNSCNYIFDSSSDLNQSFENGRMPVMCSSLLQKLEEFLIKDEQLSKEVPEGRIKSSL 120

Query: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           LSGSLSMALCYIQ+VFRSG+LHP PRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC
Sbjct: 121 LSGSLSMALCYIQRVFRSGALHPHPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180

Query: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA 240
           Y+G+ NSAFLQQASYITGGV+ KPQ +DGLFQYL T+F TDLHSR+FL LPK VGVDFRA
Sbjct: 181 YMGAQNSAFLQQASYITGGVHHKPQHLDGLFQYLMTIFATDLHSRSFLHLPKPVGVDFRA 240

Query: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           SCFCHK TIDMGY+CSVCLSIFCKHHKKCSTCG
Sbjct: 241 SCFCHKNTIDMGYICSVCLSIFCKHHKKCSTCG 273

BLAST of Cucsa.178810.4 vs. TrEMBL
Match: E0CR79_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g03100 PE=4 SV=1)

HSP 1 Score: 455.7 bits (1171), Expect = 4.0e-125
Identity = 223/274 (81.39%), Postives = 245/274 (89.42%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MA  PSKLY+DDVSLLVVLLDTNPFFWST++LPFSKFLSHVLAFLNSIL++NQLN+VVVI
Sbjct: 1   MAPVPSKLYSDDVSLLVVLLDTNPFFWSTASLPFSKFLSHVLAFLNSILLINQLNQVVVI 60

Query: 61  GTGYASCKYLYNSSSY-SNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSS 120
            TG  SC ++++SSS  +N  LE+GRMPALC+ LL+ LEEFV GDE+  KE    G  SS
Sbjct: 61  ATGCNSCNFIFDSSSVPANPNLENGRMPALCSNLLQKLEEFVTGDEKLSKEVLAAGIGSS 120

Query: 121 LLSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 180
           LLSGSLSMALCYIQ+VFR+G LHPQPRILCLQGSPDGPEQYVA+MNAIFSAQRSMVPIDS
Sbjct: 121 LLSGSLSMALCYIQRVFRTGPLHPQPRILCLQGSPDGPEQYVAVMNAIFSAQRSMVPIDS 180

Query: 181 CYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFR 240
           C IG+ +SAFLQQASYITGGVYLKPQQ+DGLFQYLSTVF TDLHSR FLQLPK  GVDFR
Sbjct: 181 CVIGAQHSAFLQQASYITGGVYLKPQQLDGLFQYLSTVFATDLHSRRFLQLPKPAGVDFR 240

Query: 241 ASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           ASCFCHK TIDMGY+CSVCLSIFCKHHKKCSTCG
Sbjct: 241 ASCFCHKNTIDMGYICSVCLSIFCKHHKKCSTCG 274

BLAST of Cucsa.178810.4 vs. TrEMBL
Match: A0A067GM61_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g022374mg PE=4 SV=1)

HSP 1 Score: 454.1 bits (1167), Expect = 1.2e-124
Identity = 219/273 (80.22%), Postives = 245/273 (89.74%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLY+DDVSL+VVLLDTNPFFWS+S+L FS+FL+HVLAFLN+IL LNQLN+VVVI
Sbjct: 1   MASAPSKLYSDDVSLVVVLLDTNPFFWSSSSLSFSQFLTHVLAFLNAILTLNQLNQVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120
            TGY SC Y+Y+SSS  N  + +GRMP+LC  LL+NLEEF+  DEQ  K++P+G    SL
Sbjct: 61  ATGYNSCDYVYDSSSTGNQSVGNGRMPSLCATLLQNLEEFMNKDEQLGKQEPEGRIACSL 120

Query: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           LSGSLSMALCYIQ+VFRSG LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC
Sbjct: 121 LSGSLSMALCYIQRVFRSGLLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180

Query: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA 240
           Y+G+ NSAFLQQASYITGGV+ KPQQ+DGLFQYL T+FGTDLHSR FLQLPK VGVDFRA
Sbjct: 181 YLGAQNSAFLQQASYITGGVHHKPQQLDGLFQYLLTIFGTDLHSRNFLQLPKPVGVDFRA 240

Query: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           SCFCHK TIDMGY+CSVCLSI+CKH KKCSTCG
Sbjct: 241 SCFCHKNTIDMGYICSVCLSIYCKHLKKCSTCG 273

BLAST of Cucsa.178810.4 vs. TAIR10
Match: AT1G18340.1 (AT1G18340.1 basal transcription factor complex subunit-related)

HSP 1 Score: 417.2 bits (1071), Expect = 8.1e-117
Identity = 205/277 (74.01%), Postives = 235/277 (84.84%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           M +  SK Y+DDVSLLV+LLDTNP FWST+++ FS+FLSHVLAFLN++L LNQLN+VVVI
Sbjct: 1   MPAIASKQYSDDVSLLVLLLDTNPLFWSTTSITFSQFLSHVLAFLNAVLGLNQLNQVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNHGLEDGR---MPALCTRLLKNLEEFVIGDEQSIKEDPKGGTM 120
            TGY+SC Y+Y+SS  SNHG  +     MPA+   LLK LEEFV  DE+  KE+     +
Sbjct: 61  ATGYSSCDYIYDSSLTSNHGNFESNGTGMPAIFGSLLKKLEEFVTKDEELSKEEVSEDRI 120

Query: 121 SS-LLSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVP 180
            S LLSGSLSMALCYIQ+VFRSG LHPQPRILCLQGSPDGPEQYVA+MN+IFSAQR MVP
Sbjct: 121 PSCLLSGSLSMALCYIQRVFRSGHLHPQPRILCLQGSPDGPEQYVAVMNSIFSAQRLMVP 180

Query: 181 IDSCYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGV 240
           IDSCYIG  NSAFLQQASYITGGV+  P+Q+DGLFQYL+T+F TDLHSR F+QLPK +GV
Sbjct: 181 IDSCYIGVQNSAFLQQASYITGGVHHTPKQLDGLFQYLTTIFATDLHSRGFVQLPKPIGV 240

Query: 241 DFRASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           DFRASCFCHKKTIDMGY+CSVCLSIFC+HHKKCSTCG
Sbjct: 241 DFRASCFCHKKTIDMGYICSVCLSIFCEHHKKCSTCG 277

BLAST of Cucsa.178810.4 vs. NCBI nr
Match: gi|449470273|ref|XP_004152842.1| (PREDICTED: general transcription factor IIH subunit 3 [Cucumis sativus])

HSP 1 Score: 556.6 bits (1433), Expect = 2.4e-155
Identity = 273/273 (100.00%), Postives = 273/273 (100.00%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL
Sbjct: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120

Query: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC
Sbjct: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180

Query: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA 240
           YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA
Sbjct: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA 240

Query: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG
Sbjct: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 273

BLAST of Cucsa.178810.4 vs. NCBI nr
Match: gi|659082582|ref|XP_008441918.1| (PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH subunit 3 [Cucumis melo])

HSP 1 Score: 548.9 bits (1413), Expect = 5.0e-153
Identity = 269/273 (98.53%), Postives = 271/273 (99.27%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLLVVLLDTNPFFWSTS+LPFSKFLSHVLAFLNSILVLNQLNEVVVI
Sbjct: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSSLPFSKFLSHVLAFLNSILVLNQLNEVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120
           GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL
Sbjct: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120

Query: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMV IDSC
Sbjct: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVSIDSC 180

Query: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA 240
           YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYL+TVFGTDLHSRTFLQLPKSVGVDFRA
Sbjct: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLATVFGTDLHSRTFLQLPKSVGVDFRA 240

Query: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           SCFCHK TIDMGYVCSVCLSIFCKHHKKCSTCG
Sbjct: 241 SCFCHKXTIDMGYVCSVCLSIFCKHHKKCSTCG 273

BLAST of Cucsa.178810.4 vs. NCBI nr
Match: gi|645225617|ref|XP_008219657.1| (PREDICTED: general transcription factor IIH subunit 3 [Prunus mume])

HSP 1 Score: 470.7 bits (1210), Expect = 1.7e-129
Identity = 230/273 (84.25%), Postives = 248/273 (90.84%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLL+VLLDTNPFFWS+S+LPFS FLSHVL FLNSIL+LNQLN+VVVI
Sbjct: 1   MASAPSKLYADDVSLLMVLLDTNPFFWSSSSLPFSVFLSHVLTFLNSILLLNQLNQVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120
            TGY SC Y+Y+SS+ +N G ++GRMPA C  LL+ LEEFVI DEQ IKE  + G  SSL
Sbjct: 61  ATGYNSCSYIYDSSTSTNQGSDNGRMPARCVNLLQKLEEFVIKDEQLIKEGLREGIASSL 120

Query: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           LSGSLSMALCYIQ+VFRSG LHPQPRILCLQGS DGPEQYVAIMNAIFSAQRSMVPIDSC
Sbjct: 121 LSGSLSMALCYIQRVFRSGPLHPQPRILCLQGSSDGPEQYVAIMNAIFSAQRSMVPIDSC 180

Query: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA 240
           Y+GS NSAFLQQASYITGGVYLKPQQ +GLFQYLSTVF TDLHSR FLQLPKS+GVDFRA
Sbjct: 181 YMGSSNSAFLQQASYITGGVYLKPQQPNGLFQYLSTVFATDLHSRAFLQLPKSLGVDFRA 240

Query: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           SCFCHKKTIDMGY+CSVCLSIFCKHHKKCSTCG
Sbjct: 241 SCFCHKKTIDMGYICSVCLSIFCKHHKKCSTCG 273

BLAST of Cucsa.178810.4 vs. NCBI nr
Match: gi|694371044|ref|XP_009363173.1| (PREDICTED: general transcription factor IIH subunit 3 [Pyrus x bretschneideri])

HSP 1 Score: 465.3 bits (1196), Expect = 7.3e-128
Identity = 230/274 (83.94%), Postives = 246/274 (89.78%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           MASAPSKLYADDVSLL+VLLDTNPFFWS+S LPFSKFL HVL FLNSIL+LNQLN+VVVI
Sbjct: 1   MASAPSKLYADDVSLLMVLLDTNPFFWSSSNLPFSKFLPHVLTFLNSILLLNQLNQVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNHGLEDGR-MPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSS 120
            TGY SC Y+Y+SSS SN G + GR MPA C+ LL+ LEEFVI DEQ  KE  + G  SS
Sbjct: 61  ATGYNSCSYIYDSSSDSNQGSDHGRIMPARCSNLLQKLEEFVIKDEQLFKEGSREGISSS 120

Query: 121 LLSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDS 180
           LLSGSLSMALCYIQ+VFRSG LHPQPRILCLQGS DGPEQYVAIMN+IFSAQRSMVPIDS
Sbjct: 121 LLSGSLSMALCYIQRVFRSGPLHPQPRILCLQGSSDGPEQYVAIMNSIFSAQRSMVPIDS 180

Query: 181 CYIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFR 240
           CY+GS NSAFLQQASYITGGVYLKPQQ +GLFQYLSTVF TDLHSR FLQLPKS+GVDFR
Sbjct: 181 CYMGSSNSAFLQQASYITGGVYLKPQQPNGLFQYLSTVFATDLHSRAFLQLPKSLGVDFR 240

Query: 241 ASCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           ASCFCHKKTIDMGY+CSVCLSIFCKHHKKCSTCG
Sbjct: 241 ASCFCHKKTIDMGYICSVCLSIFCKHHKKCSTCG 274

BLAST of Cucsa.178810.4 vs. NCBI nr
Match: gi|1009136759|ref|XP_015885697.1| (PREDICTED: RNA polymerase II transcription factor B subunit 4-like [Ziziphus jujuba])

HSP 1 Score: 464.2 bits (1193), Expect = 1.6e-127
Identity = 225/273 (82.42%), Postives = 246/273 (90.11%), Query Frame = 1

Query: 1   MASAPSKLYADDVSLLVVLLDTNPFFWSTSALPFSKFLSHVLAFLNSILVLNQLNEVVVI 60
           M +   KLYADDVSL++V LDTNPFFW+TS+LPFSKFLSHVL+FLNSI++LNQ N+VVVI
Sbjct: 1   MNAVAPKLYADDVSLVMVALDTNPFFWTTSSLPFSKFLSHVLSFLNSIMLLNQFNQVVVI 60

Query: 61  GTGYASCKYLYNSSSYSNHGLEDGRMPALCTRLLKNLEEFVIGDEQSIKEDPKGGTMSSL 120
            TGY SC Y+Y+SS  ++HG E+G+MPALC+ LL+ LEEFVI D+Q  K+    G  SSL
Sbjct: 61  ATGYNSCDYIYDSSLATDHGSENGKMPALCSNLLQKLEEFVIRDQQQRKDGSGEGLPSSL 120

Query: 121 LSGSLSMALCYIQKVFRSGSLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRSMVPIDSC 180
           LSGSLSMALCYIQKVFRSG LHPQPRILCLQGSPDGPEQYVAIMNAIFSAQR MVPIDSC
Sbjct: 121 LSGSLSMALCYIQKVFRSGPLHPQPRILCLQGSPDGPEQYVAIMNAIFSAQRLMVPIDSC 180

Query: 181 YIGSHNSAFLQQASYITGGVYLKPQQMDGLFQYLSTVFGTDLHSRTFLQLPKSVGVDFRA 240
           YIGS+NSAFLQQASYITGGVYLKPQQ+DGLFQYLSTVF TDLHSR FLQLPKSVGVDFRA
Sbjct: 181 YIGSNNSAFLQQASYITGGVYLKPQQLDGLFQYLSTVFATDLHSRRFLQLPKSVGVDFRA 240

Query: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 274
           SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG
Sbjct: 241 SCFCHKKTIDMGYVCSVCLSIFCKHHKKCSTCG 273

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TFB4_ARATH1.4e-11574.01RNA polymerase II transcription factor B subunit 4 OS=Arabidopsis thaliana GN=TF... [more]
TF2H3_DICDI4.0e-4941.67General transcription factor IIH subunit 3 OS=Dictyostelium discoideum GN=gtf2h3... [more]
TF2H3_HUMAN4.5e-4538.11General transcription factor IIH subunit 3 OS=Homo sapiens GN=GTF2H3 PE=1 SV=2[more]
TF2H3_BOVIN1.3e-4438.60General transcription factor IIH subunit 3 OS=Bos taurus GN=GTF2H3 PE=2 SV=1[more]
TF2H3_MOUSE3.8e-4439.30General transcription factor IIH subunit 3 OS=Mus musculus GN=Gtf2h3 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LMM2_CUCSA1.7e-155100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_2G072450 PE=4 SV=1[more]
M5XYU6_PRUPE1.1e-12783.88Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009386mg PE=4 SV=1[more]
A0A061FGT5_THECC4.0e-12580.22Basal transcription factor complex subunit-related isoform 1 OS=Theobroma cacao ... [more]
E0CR79_VITVI4.0e-12581.39Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g03100 PE=4 SV=... [more]
A0A067GM61_CITSI1.2e-12480.22Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g022374mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G18340.18.1e-11774.01 basal transcription factor complex subunit-related[more]
Match NameE-valueIdentityDescription
gi|449470273|ref|XP_004152842.1|2.4e-155100.00PREDICTED: general transcription factor IIH subunit 3 [Cucumis sativus][more]
gi|659082582|ref|XP_008441918.1|5.0e-15398.53PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH subunit 3 [Cucu... [more]
gi|645225617|ref|XP_008219657.1|1.7e-12984.25PREDICTED: general transcription factor IIH subunit 3 [Prunus mume][more]
gi|694371044|ref|XP_009363173.1|7.3e-12883.94PREDICTED: general transcription factor IIH subunit 3 [Pyrus x bretschneideri][more]
gi|1009136759|ref|XP_015885697.1|1.6e-12782.42PREDICTED: RNA polymerase II transcription factor B subunit 4-like [Ziziphus juj... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR004600TFIIH_Tfb4/GTF2H3
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO:0006289nucleotide-excision repair
Vocabulary: Cellular Component
TermDefinition
GO:0000439core TFIIH complex
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006289 nucleotide-excision repair
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0000394 RNA splicing, via endonucleolytic cleavage and ligation
biological_process GO:0006366 transcription from RNA polymerase II promoter
cellular_component GO:0000439 core TFIIH complex
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cucsa.178810Cucsa.178810gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cucsa.178810.4Cucsa.178810.4-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.178810.4.five_prime_UTR.1Cucsa.178810.4.five_prime_UTR.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.178810.4.CDS.1Cucsa.178810.4.CDS.1CDS
Cucsa.178810.4.CDS.2Cucsa.178810.4.CDS.2CDS
Cucsa.178810.4.CDS.3Cucsa.178810.4.CDS.3CDS
Cucsa.178810.4.CDS.4Cucsa.178810.4.CDS.4CDS
Cucsa.178810.4.CDS.5Cucsa.178810.4.CDS.5CDS
Cucsa.178810.4.CDS.6Cucsa.178810.4.CDS.6CDS
Cucsa.178810.4.CDS.7Cucsa.178810.4.CDS.7CDS
Cucsa.178810.4.CDS.8Cucsa.178810.4.CDS.8CDS
Cucsa.178810.4.CDS.9Cucsa.178810.4.CDS.9CDS
Cucsa.178810.4.CDS.10Cucsa.178810.4.CDS.10CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsa.178810.4.three_prime_UTR.1Cucsa.178810.4.three_prime_UTR.1three_prime_UTR
Cucsa.178810.4.three_prime_UTR.2Cucsa.178810.4.three_prime_UTR.2three_prime_UTR
Cucsa.178810.4.three_prime_UTR.3Cucsa.178810.4.three_prime_UTR.3three_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004600TFIIH subunit Tfb4/p34PANTHERPTHR12831TRANSCRIPTION INITIATION FACTOR IIH TFIIH , POLYPEPTIDE 3-RELATEDcoord: 1..273
score: 7.8E
IPR004600TFIIH subunit Tfb4/p34PFAMPF03850Tfb4coord: 13..273
score: 4.0