Cla002053 (gene) Watermelon (97103) v1

NameCla002053
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionDNA-directed RNA polymerase II subunit 3 (AHRD V1 **** A4RSP9_OSTLU); contains Interpro domain(s) IPR011263 DNA-directed RNA polymerase, RpoA/D/Rpb3-type
LocationChr11 : 475880 .. 478562 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGGGAGTTCGTACCAGCGGTTTCCAAAGGTGAAAATCCGGGAACTCCGAGATGATTACGCCAAATTCGAGCTCCGCGACACCGACGCCAGCATGGCCAACGCCCTCCGTCGCGTCATGATCGCCGAGGTGCCCACCATCGCCATTGACCTGGTAGAAATTGAAGTGAACTCTTCTGTTCTCAACGACGAGTTCATCGCTCATCGCCTTGGTCTCATTCCTCTCACCAGCGAGCGTGCCATGTCCATGCGCTTCTCTCGCGACTGTGATGCTTGCGATGGTGATGGCCAGTGCGAGTTTTGCTCTGTTGAGTTTCACCTTCGTGCCAAGTGCCACTCCGACCAAACCCTAGACGTCACCAGCAAGGATCTCTATAGTTCTGATCACACTGTTGTCCCCGTTGATTTCTCTGATTCTGCTGCAGCTAGTGGTGAAGCGTTGGATACCAAGTATGTCCTCTTACTCTCGTCTCTCAGACGTGTTTATTCTTGCCATTTTCTCGGCACGATTAAGTTGTTTGGCTTGTTGCTGATTATTGCATTAGTTTGTGTAGCTCTAGGTGTGATCTGTAATGTTTAGTCTACAACTCCATTTGCTATTCGGGTTACAAATGGAGGTTGAGGCTTAGGGGATTACACATCTTAGAGATAGACCATCACTAAAGCTCGGTCTCCAAAGTTCTAATTTGTTATTTCTATGCAACATTCCAACATGTTTGAGAACAATATCGTTTTAGTTTGGTGTCATTGTGGTTCATTAAGTAATTGCTCCAAAATTTGAGCTGGTTTTGTGATGATTGTAGAGGAATAATCATTGTGAAGCTACGTCGTGGTCAAGAGTTGCGGCTGAGAGCCATAGCTAGGAAGGGCATTGGTAAAGATCATGCAAAATGGTCGCCTGCAGCAACCGTGACATTCATGTATGAGCCAAACATTCACATCAATGAAGACCTAATGGAAACATTAACCCTAGAGGAAAAGAGAACTTGGGTTGAAAGTTGCCCAACCAGAGTATTTGAATTGGATCCTGTTACTCATCAGGTTCTTCTCATACAACCTCATTTTCTTTTTCTTCTTATTTGTTGCTTTACTACAATTATGCAATTGCTCAGAGGGTTGTGTTGAAGAGGTGATGTGGACGTTATTTGTCCTCCAACTGTATGTTGTTTAGAGTTACCAGATACGCCCCTCCCCCACAAAAAAAAAAAAAAAAAAAAAAAAGAAAATGATTGTCTTGGCAATTTCCTTCCACGGGCTAGAAGAACCTCTCCAAGTTTTGGAATTGATGAAGCCTCTGAAGGAGTCAACATCATGTTTGCTTTTCACGACCAACCTCCAAAAAGCGTTCATTTCATGTTCAAATCAACATAACCAGGTAAAAAGAATTCTCAAAATGAAGATTTTCCACTCCCAAACCTCCCAAGGAATGATGAGTCATGATCCCATTCCAATCTGACTAAATTGCAGCTTCCAACTCCTTGCAACTAGCTTCTATGTTTTCTCACTCTTCTGTTGATAATCGTCCTTCCGAAGGTTGTTTCGACTCAGCAAATGAGATATTTAAAGAGAAGGTGTCTTGAGATGATTTTGAGGAGTCTAATAAGGAGCCTATGGTAAGTTTTAGGCTTTAGTCGTGAAGACTTGGATCAGAATGTTTCGTTTTTCATTGGAAACAGTGAAGCTCTGAAGCTCCTGAAGAGGATTTTAATGATGAAGCTATTGGATAGGCTTTAAGCCTAAGTCTTCTCCAAATCTTCAAGCATCTTCTTCGAGAAGTGCTCTTATTTTCTTAAAGTTCGTGAATTTGAAGTGTGAGATTCGTACAAGAGCCGCTTGTCAAGGCGTTTAAGTCTCTTAATCTATAAGGCTTTTTGGTTGCAGCTGGTTTTGTGTTGTTTATTCTACTCCAGTTATGGTACTCTTAAGGTGTTTAAGTGCTCTTATTTCATTTGTTTATGTTCTTGGTGGCTCCAATTCAGACATGTCTTCCTTCCTTGTGGTTGGTTTTTTATGGGGCTTGCATTTATAGTATTTTCCCCCATGGTTGGCTGTTTAGTTATTTTGTAGCCTATTTTGGTTTCTTTCTTTCTTTCTTTTATTTTTAAAATGTAATTGCTTCCTGTTATCTTTTCTTGTTTGGTTCCTTCGTTCTTTTTCGCATTTTTTTGATTTCTCCCTCATATTTTATGTATAACCTGAATGGTGTAAAATTTTAGGAGTGCCAATGAATTAGATTTTTTTCATTTAGTTGTATCAACATCCAAATTTTAAGATATTTTGCCAATGGCTAAAGATAGTACAAATTATCCAAACTTAAAATTATCTTGTTTCTGATAGCTAATATATATTCATTTGTGTTTTCTGTCTGTCTTTATGCTGAATTCTTGTTGCTCATTTTGTTCGTAAAGGTTATGGTGGTTGATCCTGAGGCATACACTTATGATGATGAGGTGATTAAGAAAGCAGAAGCTATGGGTAAAGCTGGACTAGTGGATATTACTGCAAAAGAAGATAGTTTTATCTTCACAGTTGAATCTACCGGTGCAATCAAAGCTTCCCAGTTGATACTCAATGCCATAGATATCCTGAAACAGAAGCTGGATGCAGTCCGTCTTTCAGACGATACTGTAGAAGCAGATGATCAGTTTGGTGAGTTAGGTGCACATATGCGAGGTGGATGA

mRNA sequence

ATGGAAGGGAGTTCGTACCAGCGGTTTCCAAAGGTGAAAATCCGGGAACTCCGAGATGATTACGCCAAATTCGAGCTCCGCGACACCGACGCCAGCATGGCCAACGCCCTCCGTCGCGTCATGATCGCCGAGGTGCCCACCATCGCCATTGACCTGGTAGAAATTGAAGTGAACTCTTCTGTTCTCAACGACGAGTTCATCGCTCATCGCCTTGGTCTCATTCCTCTCACCAGCGAGCGTGCCATGTCCATGCGCTTCTCTCGCGACTGTGATGCTTGCGATGGTGATGGCCAGTGCGAGTTTTGCTCTGTTGAGTTTCACCTTCGTGCCAAGTGCCACTCCGACCAAACCCTAGACGTCACCAGCAAGGATCTCTATAGTTCTGATCACACTGTTGTCCCCGTTGATTTCTCTGATTCTGCTGCAGCTAGTGGTGAAGCGTTGGATACCAAAGGAATAATCATTGTGAAGCTACGTCGTGGTCAAGAGTTGCGGCTGAGAGCCATAGCTAGGAAGGGCATTGGTAAAGATCATGCAAAATGGTCGCCTGCAGCAACCGTGACATTCATGTATGAGCCAAACATTCACATCAATGAAGACCTAATGGAAACATTAACCCTAGAGGAAAAGAGAACTTGGGTTGAAAGTTGCCCAACCAGAGTATTTGAATTGGATCCTGTTACTCATCAGGTTATGGTGGTTGATCCTGAGGCATACACTTATGATGATGAGGTGATTAAGAAAGCAGAAGCTATGGGTAAAGCTGGACTAGTGGATATTACTGCAAAAGAAGATAGTTTTATCTTCACAGTTGAATCTACCGGTGCAATCAAAGCTTCCCAGTTGATACTCAATGCCATAGATATCCTGAAACAGAAGCTGGATGCAGTCCGTCTTTCAGACGATACTGTAGAAGCAGATGATCAGTTTGGTGAGTTAGGTGCACATATGCGAGGTGGATGA

Coding sequence (CDS)

ATGGAAGGGAGTTCGTACCAGCGGTTTCCAAAGGTGAAAATCCGGGAACTCCGAGATGATTACGCCAAATTCGAGCTCCGCGACACCGACGCCAGCATGGCCAACGCCCTCCGTCGCGTCATGATCGCCGAGGTGCCCACCATCGCCATTGACCTGGTAGAAATTGAAGTGAACTCTTCTGTTCTCAACGACGAGTTCATCGCTCATCGCCTTGGTCTCATTCCTCTCACCAGCGAGCGTGCCATGTCCATGCGCTTCTCTCGCGACTGTGATGCTTGCGATGGTGATGGCCAGTGCGAGTTTTGCTCTGTTGAGTTTCACCTTCGTGCCAAGTGCCACTCCGACCAAACCCTAGACGTCACCAGCAAGGATCTCTATAGTTCTGATCACACTGTTGTCCCCGTTGATTTCTCTGATTCTGCTGCAGCTAGTGGTGAAGCGTTGGATACCAAAGGAATAATCATTGTGAAGCTACGTCGTGGTCAAGAGTTGCGGCTGAGAGCCATAGCTAGGAAGGGCATTGGTAAAGATCATGCAAAATGGTCGCCTGCAGCAACCGTGACATTCATGTATGAGCCAAACATTCACATCAATGAAGACCTAATGGAAACATTAACCCTAGAGGAAAAGAGAACTTGGGTTGAAAGTTGCCCAACCAGAGTATTTGAATTGGATCCTGTTACTCATCAGGTTATGGTGGTTGATCCTGAGGCATACACTTATGATGATGAGGTGATTAAGAAAGCAGAAGCTATGGGTAAAGCTGGACTAGTGGATATTACTGCAAAAGAAGATAGTTTTATCTTCACAGTTGAATCTACCGGTGCAATCAAAGCTTCCCAGTTGATACTCAATGCCATAGATATCCTGAAACAGAAGCTGGATGCAGTCCGTCTTTCAGACGATACTGTAGAAGCAGATGATCAGTTTGGTGAGTTAGGTGCACATATGCGAGGTGGATGA

Protein sequence

MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSSVLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDVTSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAKWSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYTYDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLSDDTVEADDQFGELGAHMRGG
BLAST of Cla002053 vs. Swiss-Prot
Match: NRPB3_ARATH (DNA-directed RNA polymerases II, IV and V subunit 3 OS=Arabidopsis thaliana GN=NRPB3 PE=1 SV=1)

HSP 1 Score: 528.9 bits (1361), Expect = 3.9e-149
Identity = 263/320 (82.19%), Postives = 296/320 (92.50%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           M+G++YQRFPK+KIREL+DDYAKFELR+TD SMANALRRVMI+EVPT+AIDLVEIEVNSS
Sbjct: 1   MDGATYQRFPKIKIRELKDDYAKFELRETDVSMANALRRVMISEVPTVAIDLVEIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEF L +KC +DQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFRLSSKCVTDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TS+DLYS+D TV PVDF+  ++ S ++ + KGIIIVKLRRGQEL+LRAIARKGIGKDHAK
Sbjct: 121 TSRDLYSADPTVTPVDFTIDSSVS-DSSEHKGIIIVKLRRGQELKLRAIARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVTFMYEP+I INED+M+TL+ EEK   +ES PT+VF +DPVT QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPDIIINEDMMDTLSDEEKIDLIESSPTKVFGMDPVTRQVVVVDPEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YD+EVIKKAEAMGK GL++I+ K+DSFIFTVESTGA+KASQL+LNAID+LKQKLDAVRLS
Sbjct: 241 YDEEVIKKAEAMGKPGLIEISPKDDSFIFTVESTGAVKASQLVLNAIDLLKQKLDAVRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           DDTVEADDQFGELGAHMRGG
Sbjct: 301 DDTVEADDQFGELGAHMRGG 319

BLAST of Cla002053 vs. Swiss-Prot
Match: RPD3B_ARATH (DNA-directed RNA polymerases IV and V subunit 3B OS=Arabidopsis thaliana GN=NRPD3B PE=1 SV=2)

HSP 1 Score: 489.6 bits (1259), Expect = 2.7e-137
Identity = 246/320 (76.88%), Postives = 283/320 (88.44%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           M+G +YQRFP VKIREL+DDYAKFELR+TD SMANALRRVMI+EVPT+AI LV+IEVNSS
Sbjct: 1   MDGVTYQRFPTVKIRELKDDYAKFELRETDVSMANALRRVMISEVPTMAIHLVKIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIA RL LIPLTSERAMSMRF +DC+ C+GD  CEFCSVEF L AKC +DQTLDV
Sbjct: 61  VLNDEFIAQRLSLIPLTSERAMSMRFCQDCEDCNGDEHCEFCSVEFPLSAKCVTDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TS+DLYS+D TV PVDF+ +++ S ++ + KGIII KLRRGQEL+L+A+ARKGIGKDHAK
Sbjct: 121 TSRDLYSADPTVTPVDFTSNSSTS-DSSEHKGIIIAKLRRGQELKLKALARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVT+MYEP+I INE++M TLT EEK   +ES PT+VF +DPVT QV+VVDPEAYT
Sbjct: 181 WSPAATVTYMYEPDIIINEEMMNTLTDEEKIDLIESSPTKVFGIDPVTGQVVVVDPEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YD+EVIKKAEAMGK GL++I  K DSF+FTVESTGA+KASQL+LNAIDILKQKLDA+RLS
Sbjct: 241 YDEEVIKKAEAMGKPGLIEIHPKHDSFVFTVESTGALKASQLVLNAIDILKQKLDAIRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           D+TVEADDQFGELGAHMR G
Sbjct: 301 DNTVEADDQFGELGAHMREG 319

BLAST of Cla002053 vs. Swiss-Prot
Match: RPB3_DICDI (DNA-directed RNA polymerase II subunit rpb3 OS=Dictyostelium discoideum GN=polr2c PE=3 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 8.7e-72
Identity = 146/297 (49.16%), Postives = 197/297 (66.33%), Query Frame = 1

Query: 4   SSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSSVLN 63
           S   R P+++I E+++D   F L +TD S+ANALRRVMIAEVPT+ IDLVE E N+SVL 
Sbjct: 6   SQLTRQPELEILEIKNDSIIFILSNTDISVANALRRVMIAEVPTMCIDLVEFESNNSVLC 65

Query: 64  DEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDVTSK 123
           DEFIAHRLGLIPL S+      ++RDC   D   +C+ CSVE  L  KC  ++  DVTS 
Sbjct: 66  DEFIAHRLGLIPLVSDNIDKFCYTRDCSCSD---RCDQCSVELRLNVKCTENRPRDVTSS 125

Query: 124 DLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAKWSP 183
           DL S +  V+PV    +++ S +      I IVKLRRGQE++LRAIA+KG+GK+HAKWSP
Sbjct: 126 DLLSQNSAVIPVSSQVTSSNSEQE-----IPIVKLRRGQEIKLRAIAKKGVGKEHAKWSP 185

Query: 184 AATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDP--VTHQVMVVDPEAYTY 243
           +   T+ ++P I +N++ ++ LT ++K  WV SCPT+V+   P   T QV + DP    Y
Sbjct: 186 SCVATYQFQPIIVLNQNRIDELTDQQKEEWVGSCPTKVYSYSPHQSTQQVTIEDPLRCVY 245

Query: 244 DDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVR 299
             E  KKAE+ GK  LV +  K+D FIFTVES+GA+K   ++L AI I+K+KL  ++
Sbjct: 246 CLECKKKAESFGKPDLVHLEQKQDKFIFTVESSGALKPEDIVLYAIQIIKRKLTDIQ 294

BLAST of Cla002053 vs. Swiss-Prot
Match: RPB3_BOVIN (DNA-directed RNA polymerase II subunit RPB3 OS=Bos taurus GN=POLR2C PE=1 SV=1)

HSP 1 Score: 199.5 bits (506), Expect = 5.5e-50
Identity = 120/290 (41.38%), Postives = 165/290 (56.90%), Query Frame = 1

Query: 6   YQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSSVLNDE 65
           Y   P V+I EL D+  KF + +TD ++AN++RRV IAEVP IAID V+I+ NSSVL+DE
Sbjct: 3   YANQPTVRITELTDENVKFIIENTDLAVANSIRRVFIAEVPIIAIDWVQIDANSSVLHDE 62

Query: 66  FIAHRLGLIPLTSERAMS-MRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDVTSKD 125
           FIAHRLGLIPLTS+  +  +++SRDC     +  C  CSVEF L  +C+ DQT  VTS+D
Sbjct: 63  FIAHRLGLIPLTSDDIVDKLQYSRDCTC---EEFCPECSVEFTLDVRCNEDQTRHVTSRD 122

Query: 126 LYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAKWSPA 185
           L S+   V+PV   +      + ++   I+IVKLR+GQELRLRA A+KG GK+HAKW+P 
Sbjct: 123 LISNSPRVIPVTSRNRDNDPNDYVEQDDILIVKLRKGQELRLRAYAKKGFGKEHAKWNPT 182

Query: 186 ATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYTYDDE 245
           A V F Y+P     ++ +      +   W +S      ELD         D     YD  
Sbjct: 183 AGVAFEYDP-----DNALRHTVYPKPEEWPKS---EYSELDE--------DESQAPYDP- 242

Query: 246 VIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKL 295
                             K + F + VES G+++   ++L+A+  LK+KL
Sbjct: 243 ----------------NGKPERFYYNVESCGSLRPETIVLSALSGLKKKL 256

BLAST of Cla002053 vs. Swiss-Prot
Match: RPB3_HUMAN (DNA-directed RNA polymerase II subunit RPB3 OS=Homo sapiens GN=POLR2C PE=1 SV=2)

HSP 1 Score: 197.2 bits (500), Expect = 2.7e-49
Identity = 119/290 (41.03%), Postives = 164/290 (56.55%), Query Frame = 1

Query: 6   YQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSSVLNDE 65
           Y   P V+I EL D+  KF + +TD ++AN++RRV IAEVP IAID V+I+ NSSVL+DE
Sbjct: 3   YANQPTVRITELTDENVKFIIENTDLAVANSIRRVFIAEVPIIAIDWVQIDANSSVLHDE 62

Query: 66  FIAHRLGLIPLTSERAMS-MRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDVTSKD 125
           FIAHRLGLIPL S+  +  +++SRDC     +  C  CSVEF L  +C+ DQT  VTS+D
Sbjct: 63  FIAHRLGLIPLISDDIVDKLQYSRDCTC---EEFCPECSVEFTLDVRCNEDQTRHVTSRD 122

Query: 126 LYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAKWSPA 185
           L S+   V+PV   +      + ++   I+IVKLR+GQELRLRA A+KG GK+HAKW+P 
Sbjct: 123 LISNSPRVIPVTSRNRDNDPNDYVEQDDILIVKLRKGQELRLRAYAKKGFGKEHAKWNPT 182

Query: 186 ATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYTYDDE 245
           A V F Y+P     ++ +      +   W +S      ELD         D     YD  
Sbjct: 183 AGVAFEYDP-----DNALRHTVYPKPEEWPKS---EYSELDE--------DESQAPYDP- 242

Query: 246 VIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKL 295
                             K + F + VES G+++   ++L+A+  LK+KL
Sbjct: 243 ----------------NGKPERFYYNVESCGSLRPETIVLSALSGLKKKL 256

BLAST of Cla002053 vs. TrEMBL
Match: A0A0A0LHS1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G093850 PE=4 SV=1)

HSP 1 Score: 625.9 bits (1613), Expect = 2.6e-176
Identity = 316/320 (98.75%), Postives = 319/320 (99.69%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TSKDLYSSDHTVVPVDFSDSAAA+GEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFSDSAAATGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVTFMYEP+IHINEDLMETLTLEEKRTWVESCPTRVFELD VTHQVMVVDPEAYT
Sbjct: 181 WSPAATVTFMYEPSIHINEDLMETLTLEEKRTWVESCPTRVFELDTVTHQVMVVDPEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YDDEVIKKAEAMGKAGLVDITA+EDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS
Sbjct: 241 YDDEVIKKAEAMGKAGLVDITAREDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           DDTVEADDQFGELGAHMRGG
Sbjct: 301 DDTVEADDQFGELGAHMRGG 320

BLAST of Cla002053 vs. TrEMBL
Match: A0A061E7U0_THECC (DNA-directed RNA polymerase family protein OS=Theobroma cacao GN=TCM_007121 PE=4 SV=1)

HSP 1 Score: 567.4 bits (1461), Expect = 1.1e-158
Identity = 286/320 (89.38%), Postives = 306/320 (95.62%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           MEG SYQRFPKVKIREL+DDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGVSYQRFPKVKIRELKDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC +DQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCMTDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TSKDLYSSDHTVVPVDF+DSA    ++ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFTDSAGY--DSSEQRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVTFMYEP IHINED+METLTLEEK+++VES PTRVF++DP T QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDMMETLTLEEKQSFVESSPTRVFDIDPNTQQVVVVDPEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YDDEV+KKAEAMGK GLV+I AKEDSFIFTVESTGAIKASQL+LNAI++LKQKLDAVRLS
Sbjct: 241 YDDEVLKKAEAMGKPGLVEIYAKEDSFIFTVESTGAIKASQLVLNAIEVLKQKLDAVRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           +DTVEADDQFGELGAHMRGG
Sbjct: 301 EDTVEADDQFGELGAHMRGG 318

BLAST of Cla002053 vs. TrEMBL
Match: A0A0B0MXX2_GOSAR (DNA-directed RNA polymerase II subunit RPB3-A-like protein OS=Gossypium arboreum GN=F383_29872 PE=4 SV=1)

HSP 1 Score: 563.5 bits (1451), Expect = 1.6e-157
Identity = 285/320 (89.06%), Postives = 305/320 (95.31%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           MEG SYQRFPKVKIREL+DDYAKFELRDTDASMANALRRVMI+EVPTIAIDLVEIEVNSS
Sbjct: 1   MEGISYQRFPKVKIRELKDDYAKFELRDTDASMANALRRVMISEVPTIAIDLVEIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC SDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCISDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TSKDLYSSDHTVVPVDF+D+A    ++ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFTDNAGY--DSTEPRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVTFMYEP IHINEDLME+LTLEEK ++VES PT+VF++DP T QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDLMESLTLEEKLSFVESSPTKVFDIDPNTQQVVVVDPEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YDDEV+KKAEAMGK GLV+I AKEDSFIFTVESTGAIKASQL+LNAI+ILKQKLDAVRLS
Sbjct: 241 YDDEVLKKAEAMGKPGLVEIYAKEDSFIFTVESTGAIKASQLVLNAIEILKQKLDAVRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           +DTVEADDQFGELGAHMRGG
Sbjct: 301 EDTVEADDQFGELGAHMRGG 318

BLAST of Cla002053 vs. TrEMBL
Match: A0A0D2SZ13_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G023500 PE=4 SV=1)

HSP 1 Score: 563.5 bits (1451), Expect = 1.6e-157
Identity = 285/320 (89.06%), Postives = 305/320 (95.31%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           MEG SYQRFPKVKIREL+DDYAKFELRDTDASMANALRRVMI+EVPTIAIDLVEIEVNSS
Sbjct: 1   MEGISYQRFPKVKIRELKDDYAKFELRDTDASMANALRRVMISEVPTIAIDLVEIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC SDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCISDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TSKDLYSSDHTVVPVDF+D+A    ++ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFTDNAGY--DSTEPRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVTFMYEP IHINEDLME+LTLEEK ++VES PT+VF++DP T QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDLMESLTLEEKLSFVESSPTKVFDIDPNTQQVVVVDPEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YDDEV+KKAEAMGK GLV+I AKEDSFIFTVESTGAIKASQL+LNAI+ILKQKLDAVRLS
Sbjct: 241 YDDEVLKKAEAMGKPGLVEIYAKEDSFIFTVESTGAIKASQLVLNAIEILKQKLDAVRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           +DTVEADDQFGELGAHMRGG
Sbjct: 301 EDTVEADDQFGELGAHMRGG 318

BLAST of Cla002053 vs. TrEMBL
Match: V4SBK3_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028849mg PE=4 SV=1)

HSP 1 Score: 562.4 bits (1448), Expect = 3.6e-157
Identity = 284/320 (88.75%), Postives = 301/320 (94.06%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           M+G+SYQRFPKVKIREL+DDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MDGASYQRFPKVKIRELKDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TSKDLYSSDH+VVPVDF D A         +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHSVVPVDFVDPAGYDSTD-QQRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVTFMYEP IHINEDLME+L+LEEK++WVES PT+VF++DP T QV VVD EAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDLMESLSLEEKQSWVESSPTKVFDIDPNTGQVYVVDAEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YDDEVIKKAEAMGK GLV+I AKEDSFIFTVESTGAIKASQL+LNAI++LKQKLDAVRLS
Sbjct: 241 YDDEVIKKAEAMGKPGLVEIYAKEDSFIFTVESTGAIKASQLVLNAIEVLKQKLDAVRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           +DTVEADDQFGELGAHMRGG
Sbjct: 301 EDTVEADDQFGELGAHMRGG 319

BLAST of Cla002053 vs. NCBI nr
Match: gi|449442251|ref|XP_004138895.1| (PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3 [Cucumis sativus])

HSP 1 Score: 625.9 bits (1613), Expect = 3.8e-176
Identity = 316/320 (98.75%), Postives = 319/320 (99.69%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TSKDLYSSDHTVVPVDFSDSAAA+GEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFSDSAAATGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVTFMYEP+IHINEDLMETLTLEEKRTWVESCPTRVFELD VTHQVMVVDPEAYT
Sbjct: 181 WSPAATVTFMYEPSIHINEDLMETLTLEEKRTWVESCPTRVFELDTVTHQVMVVDPEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YDDEVIKKAEAMGKAGLVDITA+EDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS
Sbjct: 241 YDDEVIKKAEAMGKAGLVDITAREDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           DDTVEADDQFGELGAHMRGG
Sbjct: 301 DDTVEADDQFGELGAHMRGG 320

BLAST of Cla002053 vs. NCBI nr
Match: gi|659082285|ref|XP_008441761.1| (PREDICTED: LOW QUALITY PROTEIN: DNA-directed RNA polymerases II, IV and V subunit 3-like [Cucumis melo])

HSP 1 Score: 625.2 bits (1611), Expect = 6.5e-176
Identity = 315/320 (98.44%), Postives = 319/320 (99.69%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TSKDLYSSDHTVVPVDFSDSAAA+GEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFSDSAAATGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVTFMYEP+IHINEDLMETLTLEEK TWVESCPTRVFELDPVTHQVMVV+PEAYT
Sbjct: 181 WSPAATVTFMYEPSIHINEDLMETLTLEEKXTWVESCPTRVFELDPVTHQVMVVEPEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YDDEVIKKAEAMGKAGLVDITA+EDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS
Sbjct: 241 YDDEVIKKAEAMGKAGLVDITAREDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           DDTVEADDQFGELGAHMRGG
Sbjct: 301 DDTVEADDQFGELGAHMRGG 320

BLAST of Cla002053 vs. NCBI nr
Match: gi|1009149980|ref|XP_015892771.1| (PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3 [Ziziphus jujuba])

HSP 1 Score: 572.8 bits (1475), Expect = 3.8e-160
Identity = 285/320 (89.06%), Postives = 306/320 (95.62%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           MEG SYQRFPKVKIRE++DDY KFELR+TDASMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGKSYQRFPKVKIREMKDDYLKFELRETDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIAHRLGL+PLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV
Sbjct: 61  VLNDEFIAHRLGLVPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TSKDLYSSDHTVVPVDFSDSA    ++ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFSDSAGY--DSSENRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVTFMYEP+I INE+LM+TLT EEKR WV+S PT+VF++DP THQV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPDIRINEELMDTLTFEEKRNWVDSSPTKVFDIDPNTHQVVVVDPEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YD+EVIKKA+AMGK GLVDITAKEDSFIFTVESTGAIKASQL+LNAI++LKQKLDAVRLS
Sbjct: 241 YDEEVIKKADAMGKHGLVDITAKEDSFIFTVESTGAIKASQLLLNAIEVLKQKLDAVRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           DDTVEADDQFGELGAHMRGG
Sbjct: 301 DDTVEADDQFGELGAHMRGG 318

BLAST of Cla002053 vs. NCBI nr
Match: gi|590686859|ref|XP_007042502.1| (DNA-directed RNA polymerase family protein [Theobroma cacao])

HSP 1 Score: 567.4 bits (1461), Expect = 1.6e-158
Identity = 286/320 (89.38%), Postives = 306/320 (95.62%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           MEG SYQRFPKVKIREL+DDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS
Sbjct: 1   MEGVSYQRFPKVKIRELKDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC +DQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCMTDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TSKDLYSSDHTVVPVDF+DSA    ++ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFTDSAGY--DSSEQRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVTFMYEP IHINED+METLTLEEK+++VES PTRVF++DP T QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDMMETLTLEEKQSFVESSPTRVFDIDPNTQQVVVVDPEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YDDEV+KKAEAMGK GLV+I AKEDSFIFTVESTGAIKASQL+LNAI++LKQKLDAVRLS
Sbjct: 241 YDDEVLKKAEAMGKPGLVEIYAKEDSFIFTVESTGAIKASQLVLNAIEVLKQKLDAVRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           +DTVEADDQFGELGAHMRGG
Sbjct: 301 EDTVEADDQFGELGAHMRGG 318

BLAST of Cla002053 vs. NCBI nr
Match: gi|823203502|ref|XP_012436162.1| (PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3 [Gossypium raimondii])

HSP 1 Score: 563.5 bits (1451), Expect = 2.3e-157
Identity = 285/320 (89.06%), Postives = 305/320 (95.31%), Query Frame = 1

Query: 1   MEGSSYQRFPKVKIRELRDDYAKFELRDTDASMANALRRVMIAEVPTIAIDLVEIEVNSS 60
           MEG SYQRFPKVKIREL+DDYAKFELRDTDASMANALRRVMI+EVPTIAIDLVEIEVNSS
Sbjct: 1   MEGISYQRFPKVKIRELKDDYAKFELRDTDASMANALRRVMISEVPTIAIDLVEIEVNSS 60

Query: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCHSDQTLDV 120
           VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKC SDQTLDV
Sbjct: 61  VLNDEFIAHRLGLIPLTSERAMSMRFSRDCDACDGDGQCEFCSVEFHLRAKCISDQTLDV 120

Query: 121 TSKDLYSSDHTVVPVDFSDSAAASGEALDTKGIIIVKLRRGQELRLRAIARKGIGKDHAK 180
           TSKDLYSSDHTVVPVDF+D+A    ++ + +GIIIVKLRRGQELRLRAIARKGIGKDHAK
Sbjct: 121 TSKDLYSSDHTVVPVDFTDNAGY--DSTEPRGIIIVKLRRGQELRLRAIARKGIGKDHAK 180

Query: 181 WSPAATVTFMYEPNIHINEDLMETLTLEEKRTWVESCPTRVFELDPVTHQVMVVDPEAYT 240
           WSPAATVTFMYEP IHINEDLME+LTLEEK ++VES PT+VF++DP T QV+VVDPEAYT
Sbjct: 181 WSPAATVTFMYEPEIHINEDLMESLTLEEKLSFVESSPTKVFDIDPNTQQVVVVDPEAYT 240

Query: 241 YDDEVIKKAEAMGKAGLVDITAKEDSFIFTVESTGAIKASQLILNAIDILKQKLDAVRLS 300
           YDDEV+KKAEAMGK GLV+I AKEDSFIFTVESTGAIKASQL+LNAI+ILKQKLDAVRLS
Sbjct: 241 YDDEVLKKAEAMGKPGLVEIYAKEDSFIFTVESTGAIKASQLVLNAIEILKQKLDAVRLS 300

Query: 301 DDTVEADDQFGELGAHMRGG 321
           +DTVEADDQFGELGAHMRGG
Sbjct: 301 EDTVEADDQFGELGAHMRGG 318

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
NRPB3_ARATH3.9e-14982.19DNA-directed RNA polymerases II, IV and V subunit 3 OS=Arabidopsis thaliana GN=N... [more]
RPD3B_ARATH2.7e-13776.88DNA-directed RNA polymerases IV and V subunit 3B OS=Arabidopsis thaliana GN=NRPD... [more]
RPB3_DICDI8.7e-7249.16DNA-directed RNA polymerase II subunit rpb3 OS=Dictyostelium discoideum GN=polr2... [more]
RPB3_BOVIN5.5e-5041.38DNA-directed RNA polymerase II subunit RPB3 OS=Bos taurus GN=POLR2C PE=1 SV=1[more]
RPB3_HUMAN2.7e-4941.03DNA-directed RNA polymerase II subunit RPB3 OS=Homo sapiens GN=POLR2C PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LHS1_CUCSA2.6e-17698.75Uncharacterized protein OS=Cucumis sativus GN=Csa_2G093850 PE=4 SV=1[more]
A0A061E7U0_THECC1.1e-15889.38DNA-directed RNA polymerase family protein OS=Theobroma cacao GN=TCM_007121 PE=4... [more]
A0A0B0MXX2_GOSAR1.6e-15789.06DNA-directed RNA polymerase II subunit RPB3-A-like protein OS=Gossypium arboreum... [more]
A0A0D2SZ13_GOSRA1.6e-15789.06Uncharacterized protein OS=Gossypium raimondii GN=B456_008G023500 PE=4 SV=1[more]
V4SBK3_9ROSI3.6e-15788.75Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028849mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|449442251|ref|XP_004138895.1|3.8e-17698.75PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3 [Cucumis sativus][more]
gi|659082285|ref|XP_008441761.1|6.5e-17698.44PREDICTED: LOW QUALITY PROTEIN: DNA-directed RNA polymerases II, IV and V subuni... [more]
gi|1009149980|ref|XP_015892771.1|3.8e-16089.06PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3 [Ziziphus jujuba][more]
gi|590686859|ref|XP_007042502.1|1.6e-15889.38DNA-directed RNA polymerase family protein [Theobroma cacao][more]
gi|823203502|ref|XP_012436162.1|2.3e-15789.06PREDICTED: DNA-directed RNA polymerases II, IV and V subunit 3 [Gossypium raimon... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001514DNA-dir_RNA_pol_30-40kDasu_CS
IPR009025RBP11-like_dimer
IPR011262DNA-dir_RNA_pol_insert
IPR011263DNA-dir_RNA_pol_RpoA/D/Rpb3
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003899DNA-directed RNA polymerase activity
GO:0046983protein dimerization activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0005730 nucleolus
molecular_function GO:0003677 DNA binding
molecular_function GO:0003899 DNA-directed RNA polymerase activity
molecular_function GO:0046983 protein dimerization activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU11183watermelon unigene v2 vs TrEMBLtranscribed_cluster
WMU31778watermelon unigene v2 vs TrEMBLtranscribed_cluster
WMU49241watermelon EST collection version 2.0transcribed_cluster
WMU64940watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla002053Cla002053.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU64940WMU64940transcribed_cluster
WMU11183WMU11183transcribed_cluster
WMU31778WMU31778transcribed_cluster
WMU49241WMU49241transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001514DNA-directed RNA polymerase, 30-40kDa subunit, conserved sitePROSITEPS00446RNA_POL_D_30KDcoord: 35..75
scor
IPR009025DNA-directed RNA polymerase, RBP11-like dimerisation domainPFAMPF01193RNA_pol_Lcoord: 23..294
score: 2.7
IPR009025DNA-directed RNA polymerase, RBP11-like dimerisation domainunknownSSF55257RBP11-like subunits of RNA polymerasecoord: 214..302
score: 6.48E-35coord: 10..51
score: 6.48
IPR011262DNA-directed RNA polymerase, insert domainGENE3DG3DSA:2.170.120.12coord: 47..179
score: 2.9
IPR011262DNA-directed RNA polymerase, insert domainPFAMPF01000RNA_pol_A_baccoord: 53..182
score: 3.2
IPR011262DNA-directed RNA polymerase, insert domainunknownSSF56553Insert subdomain of RNA polymerase alpha subunitcoord: 46..183
score: 2.79
IPR011263DNA-directed RNA polymerase, RpoA/D/Rpb3-typeSMARTSM00662rpoldneu2coord: 21..300
score: 1.9
NoneNo IPR availableGENE3DG3DSA:3.30.1360.10coord: 6..46
score: 2.7E-44coord: 180..298
score: 2.7
NoneNo IPR availablePANTHERPTHR11800DNA-DIRECTED RNA POLYMERASEcoord: 5..318
score: 1.3E
NoneNo IPR availablePANTHERPTHR11800:SF2DNA-DIRECTED RNA POLYMERASE II SUBUNIT RPB3coord: 5..318
score: 1.3E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:

None