|
|
|
|
Vol. 14, No. 20, pp. 2551-2569, October 15, 2000
1 Department of Molecular and Cell Biology and 2 Howard Hughes Medical Institute, University of California Berkeley, Berkeley, California 94720, USA
An enormous body of work generated over the past
three decades has revealed that eukaryotic gene transcription is a
remarkably intricate biochemical process that is tightly regulated at
many levels. Biochemical and genetic analysis of various model
organisms has identified an astounding number of protein factors
responsible for transcriptional control. Although a large assortment of
gene-specific DNA-binding regulators was somewhat anticipated, the
sheer complexity of the general machinery relative to prokaryotes has
been a surprise. Even more unexpected were the numerous and intricate
layers of control imposed by the diversification of co-activators and
co-repressors, some of which possess enzymatic activities. Many
interactions between the identified factors and some of their
rate-limiting steps have been discerned. Despite these advances,
surprisingly little is known about the detailed mechanisms by which
individual genes are turned on or off in a cell. Recent evidence
suggests that there is an ordered progression of events leading to RNA synthesis in vivo and that a highly structured eukaryotic nucleus may
be important in orchestrating transcription. In this review, we present
our interpretation of recent findings and discuss various models that
integrate these observations with the emerging elaborate molecular
apparatus that has evolved to control gene expression.
Eukaryotic cells carry a tremendous amount of genetic information just
to encode the 6000 to 100,000 proteins necessary to perpetuate life
from yeast to animals. In addition, genomes must also contain vast
amounts of cis-regulatory DNA responsible for directing
spatial and temporal patterns of gene expression in response to
metabolic requirements, developmental programs, and a plethora of
external stimuli. To maintain and control such a large genetic load,
eukaryotes have organized co-linear DNA into discrete chromosomes each
packaged into chromatin, the minimal unit of which has been defined as
the nucleosome (Kornberg 1974 Aside from the unique physical properties imparted by chromatin to
the DNA template, the first indication that mechanisms of eukaryotic
transcription might have diverged significantly from prokaryotes was
suggested decades ago by the finding that animal cells employ three
distinct and separate enzymes for RNA synthesis (Roeder and Rutter
1969
![]()
Introduction
Top
Introduction
Diversification of...
A requirement for co-regulators
Classifying a profusion of...
Two models at opposite...
A pre-assembly model
If not stepwise nor...
Organization of the nucleus:...
A speculative model:...
Conclusions
References
; Luger et al. 1997
). Variable degrees of
DNA sequence accessibility exist within chromatin throughout the cell
cycle to accommodate essential biological processes such as DNA
replication, gene expression, and cell division. However, classically
held notions of chromatin as merely a passive DNA-packaging vehicle and
global repressor of transcription have proven to be inadequate to
explain its role in gene expression (Lorch et al. 1987
; for review, see
Grunstein 1992
). Instead, it has become clear that chromatin is a
dynamic and active participant in regulating transcription of the
eukaryotic genome. Thus, the question of how gene expression is
regulated in complex eukaryotic genomes has re-focused on the molecular machines that have evolved to navigate through chromatin and mediate transcriptional control.
![]()
Diversification of transcription factors
Top
Introduction
Diversification of...
A requirement for co-regulators
Classifying a profusion of...
Two models at opposite...
A pre-assembly model
If not stepwise nor...
Organization of the nucleus:...
A speculative model:...
Conclusions
References
). Fine structure mapping and genetic analysis of a few highly
expressed cellular genes and other transcription units harbored by DNA
tumor viruses that utilize the host cell machinery led to the discovery
of cis control elements responsible for directing the
transcription of protein-encoding genes by RNA polymerase II (Pol II;
Fig. 1A; for review, see McKnight and Tjian
1986
; Maniatis et al. 1987
). It was soon appreciated that eukaryotic
genes contain complex arrays of specific DNA sequences that combine
more commonly shared core promoter elements with broadly diverse
gene-specific enhancer elements and cooperate to define specific
expression patterns (for review, see Dynan 1989
). Because prokaryotic
core RNA polymerases require additional subunits, called
factors,
for DNA recognition (Bautz and Bautz 1970
; for review, see McClure
1985
), it was not surprising to find that animal RNA polymerases
require a cadre of accessory factors to recognize promoters and
accurately initiate transcription (Matsui et al. 1980
; Samuels et al.
1982
). Despite these early hints of diversification from prokaryotic
mechanisms, the enormous complexity of the eukaryotic transcription
machinery was unanticipated. Instead, it was hypothesized that
enhancer-bound activators would directly recruit RNA polymerase to
promoters in nucleosome-free regions of the DNA template to initiate
RNA synthesis (for review, see Ptashne 1988
; Mitchell and Tjian 1989
;
Ptashne and Gann 1997
). It is now clear that this simplistic view is no
longer tenable and falls far short of the elegant, and elaborate,
mechanisms that have evolved to regulate eukaryotic transcription.

View larger version (32K):
[in a new window]
Figure 1.
Fundamental elements of eukaryotic transcriptional
control. (A) Early studies led to the identification of
multiple eukaryotic RNA polymerases and the mapping of promoter and
enhancer DNA sequences. Core promoter elements (grey) that direct
transcription by RNA Pol II (yellow, green shading) include the TATA
box (TATA), the initiator (INR), and the downstream promoter element
(DPE). Promoter-proximal basal level enhancer elements (BLE, GC box,
CCAAT box, green) were found in many genes; however, gene-specific
signal-responsive distal enhancer elements were also identified and are
represented here as the hormone responsive element (HRE, purple) and
nuclear factor element (NFE, blue). (B) Sequence-specific
DNA-binding transcription factors that were isolated biochemically
include specificity protein-1 (Sp1, green), families of activator
proteins (i.e., AP-1, green), CCAAT enhancer-binding proteins (C/EBP),
steroid receptors (i.e., glucocorticoid or GR, purple) and
tissue-specific transcription factors (i.e., NF-
B, blue). Components
of the core initiation machinery required by RNA Pol II for promoter
recognition and basal transcription from naked DNA were also
fractionated biochemically, including TATA-binding protein, (TBP, red)
and general transcription factors (TFIIA, TFIIB, TFIID, TFIIE, TFIIF
and TFIIH, purple). (C) Because TBP is insufficient for
activated transcription, discovery of TAF subunits (orange) within
TFIID revealed a requirement for co-activators to mediate activator
responsiveness. Depicted here is one example of a specific and
functional interaction between Sp1 and Drosophila TAF110
(human TAF130). (D) Many co-activators and co-repressors were
subsequently found to be required for mediating signals between
sequence-specific transcription factors and the core machinery. For
further discussion pertaining to the specific co-regulators depicted,
see text and Table 1.
The molecular underpinnings of the eukaryotic transcriptional apparatus
have largely been dissected and reconstructed over the past two
decades. Combined with genetic approaches in yeast and
Drosophila, purification of transcription factors from
mammalian cells with sensitive biochemical assays during the first 10 years (1980-1990) revealed the existence of large families of
sequence-specific activators (Fig. 1B; i.e., Sp1, AP-1, C/EBP,
NF-
B, GR, etc.) as well as a host of accessory factors (Fig. 1B;
i.e., TFIIA, TFIIB, TFIID, TFIIE, TFIIF and TFIIH) necessary to program
a functional RNA Pol II complex (Dynan and Tjian 1983
; for review, see
McKnight and Tjian 1986
; Guarente 1987
; Levine and Hoey 1988
; Mitchell and Tjian 1989
; Conaway and Conaway 1993
). The availability of robust
in vitro transcription reactions uncovered multiple biochemical steps
and helped to establish a general order of assembly of these factors
for the formation of active pre-initiation complexes in vitro
(Buratowski et al. 1989
). The development of DNA affinity chromatography methods and various molecular cloning strategies for
isolating transcription factors enabled investigators to identify many
sequence-specific enhancer-binding proteins (Weinberger et al. 1985
;
Miesfeld et al. 1986
; Johnson et al. 1987
; Kadonaga et al. 1987
) and,
subsequently, to further refine the multi-subunit nature of the
transcription machinery (Fig. 1B).
These studies, in turn, enabled the biochemical characterization of the
first cloned eukaryotic general accessory factor, TFIID, which at the
time was thought to be equivalent to the single polypeptide identified
as the TATA-binding protein, TBP (Hahn et al. 1989
; Horikoshi et al.
1989
). However, it was soon learned that, although recombinant TBP
could replace a crude TFIID fraction to initiate basal transcription,
TBP could not support activated transcription in response to
enhancer-binding regulators (Pugh and Tjian 1990
). It was subsequently
determined that purified TFIID is, in fact, a multi-subunit complex
containing TBP and several tightly associated factors or TAFs (Dynlacht
et al. 1991
; Tanese et al. 1991
). Importantly, although critical for
mediating activator responsiveness, the TAF subunits of TFIID are
dispensable for basal transcription (Figs. 1B,C). These experiments led
to the co-activator hypothesis which posited that a new class of molecules, called co-activators or adapters, were necessary to relay
information imparted by the DNA-bound activators to the RNA Pol II
machinery (Pugh and Tjian 1990
). In the decade since (1990 to the
present), biochemical and genetic analysis from yeast to man has helped
to identify most of the components of the general initiation machinery
as well as a continually expanding array of co-activators and
co-repressors (Fig. 1D; for review, see Goodrich et al. 1996
;
Orphanides et al. 1996
; Kingston and Narlikar 1999
; Maldonado et al.
1999
; Tyler and Kadonaga 1999
; Brown et al. 2000
; Glass and Rosenfeld
2000
). These later studies have not only uncovered the amazingly
elaborate nature of the eukaryotic transcription machinery but have
also firmly established the importance of co-activators in mediating
transcriptional control.
| |
A requirement for co-regulators |
|---|
|
|
|---|
A defining characteristic that discriminates between different genes
in a genome is the unique bank of cis-regulatory sites recognized by sequence-specific DNA-binding factors. In principle, these regulators should be able to directly instruct the transcription apparatus to initiate RNA synthesis at a specific core promoter. This
view was strongly supported by findings in prokaryotes in which
sequence-specific regulators, such as the phage
cI protein, recruit
RNA polymerase to the promoter through direct contact with the
-subunit of the enzyme (Hochschild et al. 1983
; for review, see
McClure 1985
). Considering the mechanisms of transcriptional regulators
in prokaryotes, how do we rationalize the need for the diversity of
co-regulators that have evolved in eukaryotes, most dramatically in
metazoans? Given that animal cells likely utilize from 1000 to 10,000 different transcriptional regulators, it is not difficult to imagine
the need for specialized adapters employed by different classes of
activators and repressors to interface with a limited number of targets
within the general transcription apparatus. Although some of these
regulators may have similar types of activation or repression domains,
upon inspection of their primary sequences, context-dependent
structural effects imposed on these domains very likely contribute
higher orders of complexity. For example, allosteric controls on
interactions of regulators with their targets can be exerted through
subtle differences in DNA binding sites as well as through interactions between clustered trans-acting factors in specific
DNA-localized complexes, sometimes referred to as enhanceosomes
(Diamond et al. 1990
; for review, see Carey 1998
; Lefstin and Yamamoto
1998
). Considering the potential for a vast diversity of molecular
signals that must be interpreted by the transcriptional apparatus, it would appear to be untenable for each gene-selective regulator to have
a designated and unique target within the limited repertoire of the
general machinery. Instead, it is now evident that a given regulator
can partner and function with multiple types of co-activators or
co-repressors and vice versa (Fig. 1D; Onate et al. 1995
; Fondell et
al. 1996
; Kamei et al. 1996
; Mengus et al. 1997
; Nagy et al. 1997
).
Most genes are regulated by mixing and matching different types of
activators and repressors in a coordinated fashion. Consistent with the
notion of combinatorial specificity in regulator-co-regulator interactions, many transcriptional co-factors are constituents of
multi-subunit complexes, such as TFIID. Studies of the TFIID complex,
including an extensive biochemical analysis of TAF-activator and
TAF-basal factor interactions, revealed that different classes of
activators can target distinct TAFs to effect transcription (Goodrich
et al. 1993
; Chen et al. 1994
; for review, see Burley and Roeder 1996
).
While many sequence-specific DNA-binding regulators exhibit
cell-type-specific expression patterns (Lenardo and Baltimore 1989
; for
review, see Lai and Darnell 1991
), it has become increasingly evident
that some transcriptional co-regulators may also have tissue-restricted
expression. For example, some TFIID subunits apparently function in a
tissue-specific manner as exemplified by human TAF105, the first
identified cell-type-restricted TAF, which is highly expressed in B
cells (Dikstein et al. 1996b
). Moreover, at least two genes expressing
testes-specific TAFs, cannonball and no-hitter, have
been identified in Drosophila (M.T. Fuller, pers. comm.).
There will likely be other tissue-restricted TAF subunits resulting in
the assembly of different TFIID complexes in vivo, which would increase
permutations of regulator-co-regulator interactions and may play a
role in regulating cell-type-specific gene expression. Extending these
properties to other co-regulators, the requirement for transcriptional
co-factors to interface with diverse enhancer and promoter factors
could, therefore, be efficiently accommodated by a limited number of
multi-subunit co-regulator complexes, which themselves
might be alternately assembled. Together with differential
expression patterns, additional controls resulting from modification of
regulators or co-regulators via cellular signal transduction pathways
could allow the vast numbers of trans-acting factors to target
different subsets of requisite co-regulators at different genes or in
different cell types.
| |
Classifying a profusion of co-activators and co-repressors |
|---|
|
|
|---|
It is evident that studying the mechanisms by which co-activators
and co-repressors interface with gene regulators and the transcription
machinery has become essential to understanding transcriptional
regulation in eukaryotes. Broadly defined, transcriptional co-factors
could be divided into five classes (Table 1
and below). The first class includes those co-factors that are
intrinsic to or intimately associated with components of the core
machinery. For example, the TAFs of the TFIID complex would fit into
Class I along with other general co-regulators such as TFIIA, and,
possibly, PC4 and NC2 (Dynlacht et al. 1991
; Tanese et al. 1991
; Ge and Roeder 1994
; Yokomori et al. 1994
; Mermelstein et al. 1996
). A second
class of co-factors includes those that are principally associated with
activator or repressor molecules. Examples of Class II co-factors that
bind sequence-specific factors and modulate DNA occupancy or interface
with the core machinery include the cellular factors OCA-B
(OBF-1/Bob-1), Groucho, Notch, CtBP, and HCF along with viral
co-regulators E1A and VP16 (Wilson et al. 1993
; Strubin et al. 1995
). A
third class of co-factors is represented by a family of large
multi-subunit co-activators that strictly fits into neither the general
transcription factor-associated nor the activator-associated classes.
Class III co-factors include the yeast Mediator (Thompson et al. 1993
;
Kim et al. 1994
) and its recently identified albeit distantly related
metazoan counterparts: (1) CRSP and PC2 (Ryu et al. 1999
; Malik et al.
2000
); (2) ARC/DRIP/TRAP (Ito et al. 1999
; Näär et al.
1999
; Rachez et al. 1999
); and (3) NAT/SMCC/Srb-Mediator (Sun et al.
1998
; Boyer et al. 1999
; Gu et al. 1999
). The remaining two classes
include those co-factors that perform chromatin remodeling functions
involved in antagonizing or enhancing the repressive effects of
chromatin. Class IV co-factors include those that covalently modify
nucleosomes. Members of this class are represented here by the histone
acetylases CBP/p300, GCN5, P/CAF, and the SRC-1-related p160 family
(Ogryzko et al. 1996
; Yang et al. 1996
; Grant et al. 1997
; Spencer et
al. 1997
), as well as the histone deacetylases HDAC-1 and HDAC-2
(rpd3), and Sir2 (Taunton et al. 1996
; Kadosh and Struhl 1997
; Zhang et al. 1997
; Imai et al. 2000
), each in combination with their respective associated polypeptides. Of note, Sir2 is an NAD-dependent deacetylase that additionally employs ADP-ribosylation in some of its enzymatic functions (Imai et al. 2000
). Class V co-factors include those that
hydrolyze ATP in catalytic reactions that perturb or reorganize chromatin structure represented by the SWI/SNF-related and
ISWI-containing chromatin remodeling complexes (Côté et al.
1994
; Imbalzano et al. 1994
; Tsukiyama et al. 1995
).
|
It is noteworthy that, unlike components of the basal transcription
machinery, the co-regulators have diverged both in structure and
numbers when comparing metazoans with single-celled eukaryotes. Thus,
it appears that we have unveiled the molecular machinery that could
operate at a more species-specific or evolutionarily specialized level.
The yeast Mediator was originally defined as a protein complex tightly
associated with RNA Pol II via its carboxy-terminal repeat domain
(CTD). As noted above, humans appear to have adopted a set of at least
three structurally related Mediator-like complexes. While some of these
complexes were observed to associate with RNA Pol II in vitro, these
interactions apparently do not occur directly with the CTD (Sun et al.
1998
; Gu et al. 1999
; Näär et al. 1999
). In addition, like
the TAFs within TFIID, various subunits of some of these human
complexes have been observed to bind selectively to specific activators
(Boyer et al. 1999
; Gu et al. 1999
; Hittelman et al. 1999
; Rachez et
al. 1999
). TAF subunits themselves were identified as intrinsic
components of other multi-subunit co-regulators including the SAGA
complex in yeast and other diverse GCN5- and P/CAF-containing complexes
in metazoans (Grant et al. 1998
; Martinez et al. 1998
; Ogryzko et al.
1998
; Wieczorek et al. 1998
).
In addition to expanding the repertoire of commonly shared eukaryotic
co-factors, it could be expected that metazoans have evolved additional
levels of complexity for regulated transcription and may have acquired
a need for other co-activator functions. The metazoan-specific
acetyltransferases p300 and CBP have been observed to associate with
several co-regulators, including the metazoan-specific p160 family of
acetyltransferases, in seemingly distinct complexes (Kamei et al. 1996
;
Chen et al. 1997
; Nakajima et al. 1997
; Cho et al. 1998
; McKenna et al.
1998
). Are the observed co-factor complexes merely sub-complexes of a
larger master co-regulator or could structurally and functionally
distinct assemblages with multiple alternative subunits provide
greater diversity and combinatorial specificity? This question brings
us back to the larger issue of how the transcriptional machinery may be
assembled and targeted to specific promoters.
| |
Two models at opposite ends of the spectrum |
|---|
|
|
|---|
A stepwise assembly model
An ordered assembly of the transcription pre-initiation complex was
originally proposed on the basis of the formation of active transcription complexes in vitro (for review, see Buratowski 1994
). It
was observed that a stepwise addition of purified basal factors was
required for promoter binding and transcription initiation from naked
DNA templates (Fig. 2 and below). Steps
leading to Pol II transcription defined biochemically include: (1) a
metastable complex formed between TFIID, TFIIA, and TFIIB (DAB) capable
of recognizing and binding to the TATA promoter element; (2) a more stable closed complex containing DAB, hypophoshorylated RNA Pol II and
TFIIF; (3) an activated open complex formed by the further addition of
TFIIE and TFIIH, which stimulate an ATP-dependent isomerization and
promoter-melting event; and (4) promoter clearance and nascent RNA
synthesis upon hyperphosphorylation of the RNA Pol II CTD (Buratowski
et al. 1989
; Wang et al. 1992
; Tirode et al. 1999
; for review, see
Orphanides et al. 1996
). Various biochemical steps during elongation of
RNA synthesis have similarly been identified along with specific
requirements for the accessory factors and enzymatic activities
associated with transcription-coupled RNA processing such as
5'-capping, splicing, cleavage, and 3'-polyadenylation (for
review, see Colgan and Manley 1997
; McCracken et al. 1998
; Bentley
1999
; Tacke and Manley 1999
).
|
Direct or indirect interaction of activators with constituents of the
general machinery have long been observed to affect rates of complex
formation and transcription (Fig. 2B; Horikoshi et al. 1988
; Lin and
Green 1991
; Chi et al. 1995
). A stepwise model for assembly of the core
initiation machinery is consistent with the observed biochemically
defined steps and could satisfy a biological requirement for dynamic
regulation. However, we now understand that the RNA Pol II core
initiation machinery is more elaborate than previously anticipated and
contains up to 40 polypeptides comprising just the separable activities
that govern the distinct steps leading to transcription described
above. When combined with the profusion of additional co-factors
observed to interact with the core machinery and required to regulate
activated transcription (Inostroza et al. 1992
; Kim et al. 1994
;
Nakajima et al. 1997
), the assemblage that may represent an initiation
complex could be extraordinarily large (Fig. 2B,C). It would appear to
be inefficient for regulators to individually recruit and organize such
a complex assembly within the time scales necessary at each promoter in the cell. This notion is supported by the limited cellular
concentration of many of these factors relative to the number of genes
that must be transcribed and the modest binding affinities typically observed between these components and DNA targets, particularly in the
context of chromatin (Imbalzano et al. 1994
; Bushnell et al. 1996
; for
review, see Beato and Eisfeld 1997
). Furthermore, not all
sequence-specific binding factors interact directly with components of
the transcription apparatus and vice versa. The task of stepwise
recruitment of the initiation machinery by activators becomes
especially daunting considering the requirement for additional peripheral co-regulator activities necessary to navigate transcription through chromatin (Table 1 and Fig. 2A; for review, see Struhl 1998
;
Kingston and Narlikar 1999
). Are all of the reported interactions between regulators, co-regulators, and the core machinery necessarily important for the assembly of an initiation complex at a particular promoter? While many of the high-affinity interactions are certainly important, it is plausible that other interactions may occur only in
select contexts or, possibly, that some of these interactions may be
inconsequential and recruitment is indirect. Collectively, these
notions and other observations discussed below have led some
investigators to justly question the stepwise assembly model.
| |
A pre-assembly model |
|---|
|
|
|---|
Another popular but diametrically opposite model for transcription
initiation envisions the targeted recruitment of a completely pre-assembled RNA Pol II holoenzyme (Fig. 3
and below). This model was first proposed when certain preparations of
RNA Pol II were observed to co-purify with subsets of the basal
machinery along with some co-regulators, including chromatin remodeling
factors such as SWI/SNF and CBP, and even proteins involved in DNA
replication and repair (Koleske and Young 1994
; Chao et al. 1996
;
Maldonado et al. 1996
; for review, see Parvin and Young 1998
). Despite
considerable heterogeneity of these RNA Pol II preparations, one
invariant property has been the absence of TFIID in these holoenzyme
conglomerates. Consequently, minimally two targeted steps are required
to form an active pre-initiation complex with the holoenzyme model as recruitment of TFIID (or a functional equivalent) is a prerequisite for
transcription. One possible advantage to a holo-complex is the ability
to obviate the limited cellular concentration of individual transcription factors. A pre-assembled RNA polymerase complex could, in
principle, facilitate rapid responsiveness to arrayed regulators that
might cooperatively recruit the transcriptional machinery via targeting
of multiple interfaces. On the other hand, the recruitment of a
monolithic universal holoenzyme does not fit well with the observed
need for the vast diversity of co-regulators in animal cells. We
contend it would be more favorable to employ multiple regulators that
act at different stages of the transcription reaction. Such a
multi-faceted mechanism could impose controls at different barriers to
the transcription process and thereby provide greater flexibility and
fine tuning to rates of transcription. By contrast, a system that
relied on the recruitment of a holo-complex would seem to be less
effective in accommodating dynamic regulation of transcription in
response to small variations in the concentration of individual
cellular regulatory factors.
|
A holoenzyme model becomes more disadvantageous when considering the
wealth of biochemical evidence that suggests elongating RNA Pol II
complexes are distinctly different from those responsible for
initiation of transcription (Zawel et al. 1995
; for review, see Reines
et al. 1999
). If a pre-formed holo-complex were responsible for
initiation, then, presumably, the transcribing complex would have to
shed components that must then be recycled into new holoenzymes or,
otherwise, would be subject to degradation, de novo synthesis, and
re-assembly. Because eukaryotic polymerases are processive enzymes,
such a scenario would require the recruitment of additional holoenzymes
to each promoter for each re-initiation event. Such a mechanism appears
inconsistent with the significantly greater number of RNA Pol II
molecules relative to other components of the core initiation complex
in cells (Kimura et al. 1999
). Furthermore, evidence appears to be
mounting for an ordered progression of chromatin remodeling events that
temporally segregates the requisite co-regulators from the
transcription machinery. This latter aspect further erodes the notion
that a holoenzyme complex inherently containing everything needed to
both remodel chromatin and initiate transcription is pre-assembled.
Compelling evidence in support of the sequential requirement for
multiple co-regulators and their ordered interplay has come from
elegant studies of the HO endonuclease gene in yeast. Blocks to cell
cycle progression combined with chromatin immunoprecipitation (ChIP)
experiments in various genetic backgrounds revealed the epistasis of
events required for expression of this gene (Cosma et al. 1999
; Krebs
et al. 1999
). It was determined that occupation of upstream sequences
by a primary enhancer-binding factor preceded chromatin remodeling
events requiring SWI/SNF followed by SAGA, which together appeared to
promote localized histone acetylation and enabled secondary regulators
to access DNA targets. Perhaps most intriguing, chromatin remodeling
and histone acetylation at the promoter did not require the presence of
promoter proximal regulators; however, these remodeling events were
insufficient for gene expression which required further action by a set
of secondary regulators. Thus, chromatin remodeling preceded
recruitment of RNA Pol II. By inference, because transcription was
uncoupled from chromatin remodeling, SWI/SNF and other chromatin
remodeling activities are not likely to be intrinsic constituents of a
Pol II holoenzyme.
Other ATP-dependent chromatin remodeling- and histone
modifying-activities likely act in concert at targeted genes and their relative requirements may be regulator-dependent and dictated by
sequence positioning effects as well as overall chromatin organization. Consistent with this notion, proper steroid-responsiveness from the
well-characterized MMTV-LTR in mammalian cells appears to require both
SWI/SNF recruitment and genomic integration of this gene into chromatin
(Fryer and Archer 1998
). These results, together with others not
discussed here, suggest that specialized binding sites in chromatin are
accessible to primary activators but the involvement of secondary
activators is dependent on chromatin remodeling and modification by
co-regulators directly recruited by primary regulators. It is the
secondary activators that likely direct gene transcription through
multiplicative interactions with some requisite co-factors,
promoter-selectivity factors, and the core machinery. Not only have
chromatin remodeling and transcription events been uncoupled but, most
intriguingly, even recruitment of RNA polymerase to certain promoters
in vivo appears to be insufficient to trigger transcriptional
activation. It is well documented that the HSP70 promoter contains an
engaged but stalled RNA polymerase molecule that requires heat shock
factor for events subsequent to RNA Pol II recruitment to activate the gene (Rougvie and Lis 1988
; for review, see Lis and Wu 1993
). More
recent studies have proposed that one way in which glucocorticoids may
suppress immune function is through transcriptional interference by
direct association of GR with DNA-bound RelA, an NF-
B family member responsible for activation of pro-inflammatory genes. ChIP analysis determined that GR-dependent transcriptional interference occurred subsequent to RNA Pol II and TFIIH association with Rel A-occupied target genes at a step involving selective phosphorylation of RNA Pol II (Nissen and Yamamoto 2000
). Thus, although each may be
necessary, the simple binding of an activator and the subsequent recruitment of RNA Pol II to the promoter is not sufficient to adequately describe the transcription activation process in eukaryotes.
| |
If not stepwise nor holoenzyme, then what? |
|---|
|
|
|---|
Because the basal machinery and co-regulator activities are separable and can be reconstituted biochemically with distinct rate-limiting steps, it seems more likely that there are multiple stages employed by sequence-specific DNA-binding factors to exercise regulation in vivo that cannot be explained by the simple binary recruitment of an RNA Pol II holoenzyme. This does not require, nor do we suggest, that individual polypeptides of the entire pre-initiation complex be recruited to and assembled de novo at each promoter in the cell. Some components of the core machinery almost certainly pre-exist as tightly associated complexes such as TFIID, TFIIH, and core RNA Pol II itself. It seems reasonable to postulate that eukaryotes have evolved adaptable and interchangeable transcription complex modules along with attendant co-regulators incorporating subsets of multifunctional polypeptides. Selective engagement of such modules within this network at required stages in the transcription process could effectively coordinate the actions of multiple gene-specific factors in response to the often subtle regulatory signals that control the highly restricted expression of complex genomes.
Cytological studies suggest that some co-regulators and components of
the general machinery may be segregated from each other in the nucleus
(Reyes et al. 1997
). Considering the problem of limited factor
concentration in the nucleus and the important role of chromosomal
context for transcription of some genes, it is possible that gene
regulatory systems have evolved as organized compartments in the
nucleus containing increased local concentrations of select
co-regulators and subsets of the transcription machinery. To consider
how genes may be regulated in their native chromatin context and to
contemplate how these events could be coordinated, we now take into
account previous notions and recent evidence that the nucleus is highly
organized and possibly functionally subdivided in accordance with the
transcriptional requirements of a given cell type.
| |
Organization of the nucleus: Implications for compartmentalization of transcription |
|---|
|
|
|---|
It is well-documented that the nucleus contains a visible fibrous
network consisting of ribonucleoproteins, lamins, actin, and a myriad
of other uncharacterized proteins (for review, see Pederson 1998
). This
so-called nuclear matrix may then be analogous to the cytoskeleton
responsible for structural integrity and cell function. It was recently
established that individual chromosomes are epigenetically organized
into territories that condense and decondense in a cell cycle
coordinated fashion (Ferreira et al. 1997
; Croft et al. 1999
; Sadoni et
al. 1999
). It also appears that transcriptionally active genes are
generally oriented toward the center of the nucleus within individual
chromosome territories, while silenced genes are arranged near the
periphery of the nucleus (Andrulis et al. 1998
; Verschure et al. 1999
).
In addition, there is evidence to suggest that euchromatin is more
closely associated with structures of the nuclear pore complex that
extend into the nuclear space while heterochromatin is more confined to
sites distal to the pores (G. Blobel, pers. comm.). Although metaphase chromatin is highly condensed, interphase chromatin is variably de-condensed including loops of 30-nm fibers with matrix or scaffold attachment sites referred to as MARs or SARs. These
nuclease-accessible, often AT-rich sequences, can be separated by tens
or hundreds of kilobases and may encompass clusters of regional
cis-regulatory elements (Cockerill and Garrard 1986
; Gasser
and Laemmli 1986
; Forrester et al. 1994
). Some regulatory elements
suggested to be SARs/MARs have been proposed to act as insulators by
segregating transcriptionally active regions of chromatin from silent
heterochromatic regions (Girard et al. 1998
; Namciu et al. 1998
). These
or other similar regulatory elements and chromatin- or nuclear
matrix-associated factors may then serve to protect and promote an open
or accessible chromatin domain, which could be important in regulating
cell type-specific transcription (Kirillov et al. 1996
; Jenuwein et al.
1997
). Taken together, these studies are consistent with the notion
that transcriptionally competent genes might be localized to specific
areas of the nucleus.
One of the most prominent manifestations of a functional nuclear
compartment for transcription is the nucleolus, where rRNA synthesis
and ribosome biogenesis occurs. Clusters of tandemly arrayed rRNA
precursor genes from several chromosomal sites become associated with
the RNA Pol I machinery and rRNA processing apparatus to form the
nucleolus (for review, see Fakan and Hernandez-Verdun 1986
; Scheer and
Hock 1999
). Other types of cell cycle-modulated higher order nuclear
domains have also been observed including speckles, interchromatin
granule clusters, B-snurposomes, coiled or Cajal bodies, and PML bodies
or PODs (Gall et al. 1999
; for review, see Lamond and Earnshaw 1998
;
Matera 1999
). These potential nuclear compartments have been associated
with various transcription factors, co-regulators, RNA polymerases, and
RNA-processing factors. The functional significance of these bodies
remains uncertain as they could represent active enzymatic centers or,
alternatively, transcriptionally inert reservoirs for factors destined
for degradation or recycling. Some transcription and processing factors
appear to be associated with select foci and not others (Gall et al. 1999
; for review, see Matera 1999
). Unfortunately, however, the location(s) of some important but more recently identified regulatory factors has not been determined, and the exact molecular constituents of these sites remains poorly understood. Nevertheless, there is
evidence to suggest that transcription of some co-localized snRNA genes
occurs in a perinucleolar compartment while transcription of other
snRNA genes is segregated into other nuclear areas (Huang et al. 1998
;
Pombo et al. 1998
, 1999
). Furthermore, there are reports that active
RNA Pol II transcription occurs in a few thousand discrete foci in
isolated nuclei as visualized by immunofluorescence and
three-dimensional confocal microscopy (Zeng et al. 1997
; Wei et al.
1999
). It has been suggested that these foci represent individual
transcription units (Zeng et al. 1997
; Gall et al. 1999
). Considering
these observations, we speculate that there may be an overarching
network of active centers organized to help direct complex regulated
processes inherent to transcriptional activation.
If there is a systematic organization to the metazoan nucleus,
chromosomes must be arranged to accommodate differential cell type-specific and temporal patterns of gene expression.
Sequence-specific repressors and their associated co-regulators are
likely to play an important role in organizing gene expression
patterns. For example, the lymphoid-specific transcriptional regulators
Ikaros, Helios and Aiolos interact with HDAC complexes and associate
with some target genes in pericentric heterochromatin (Brown et al. 1997
; Hahm et al. 1998
; Kim et al. 1999
). Thus, these types of regulators and associated co-regulators together may actually recruit
some target genes to heterochromatin, perhaps as a way to maintain
tissue-restricted patterns of repression. Although recruitment of genes
to transcriptionally inactive space within chromosome territories seems
tangible, nuclear organization is also likely to directly influence and
perhaps participate in recruiting genes to active centers for
transcription. For example, the observed punctate sites for Pol II
transcription could represent compartments to which genes may be
actively recruited via directed mobilization of the nuclear matrix. To
consider this possibility, we return to the broader question of how a
gene is turned on in a cell.
| |
A speculative model: Integrating nuclear compartments and specialized factories for transcription |
|---|
|
|
|---|
A recent study examined activation of reporter genes stably
integrated into a homogeneously staining heterochromatic region in
response to an inducible heterologous activator in human cells. Remarkably, a 100-Mb region was observed to de-condense within minutes
upon induction of the transcriptional activator while a DNA-binding
domain alone failed to promote this large-scale chromosomal alteration
(Tumbar et al. 1999
). This extended de-condensation occurred in the
absence of active transcription, suggesting that chromatin remodeling
events may naturally take place prior to transcription. Such ordered
events are directly analogous to the chromatin remodeling events
observed with studies of the yeast HO gene described previously (Cosma
et al. 1999
; Krebs et al. 1999
). Specific HAT activities were observed
in a punctate pattern throughout the nucleoplasm and specifically
associated with this de-condensed region (Tumbar et al. 1999
). In other
studies, the ATP-dependent remodeling complex SWI/SNF was observed to
localize in a similar pattern of discrete punctate foci. Interestingly, SWI/SNF appears to selectively associate with the nuclear matrix and
nuclease accessible chromatin during the S and G2 phases of the cell
cycle when SWI/SNF is not mitotically inactivated by phosphorylation
(Reyes et al. 1997
; Sif et al. 1998
). These studies and others not
discussed here are consistent with the notion that genes might first
undergo an early chromatin rearrangement event through initial
interactions with transcriptional activators and some
chromatin-associated remodeling factors. It is also possible that some
genes may be epigentically organized to be competent for activation.
Recent studies with human
-globin transgenes in mouse cells have
found a requirement for an intact enhancer in the suppression of
silencing effects through exclusion of these transgenes from
heterochromatin and have further correlated histone acetylation of
these genes with this exclusion (Francastel et al. 1999
;
Schübeler et al. 2000
). Physical separation from heterochromatin was independent of the
-globin LCR, yet localized histone
H3-acetylation of both the promoter and the LCR appeared to be
important for gene expression (Schübeler et al. 2000
).
Accordingly, after initial remodeling events, enhancer-binding factors
could recruit other appropriate chromatin remodeling factors, some of
which may be intimately associated with the nuclear matrix (Fig.
4A). Following DNA recognition by
gene-specific regulators and subsequent chromatin remodeling events,
the proper transcription machinery must be assembled at the promoter.
Thus, the targeting of specific genes to select sites in the nuclear
matrix may, in turn, allow the formation of (or recruitment to) specialized
transcriptionally active compartments within chromosome territories.
|
Are the different bodies and particles observed by microscopy and
immunofluorescence structurally and functionally equivalent or could
there be distinct functions associated with different nuclear
addresses? It seems reasonable to postulate that gene-specific DNA
sequences will, in part, dictate the targeted recruitment of
transcriptional promoters to select functional compartments. Gene
promoters inherently contain information to distinguish functional specificity. This notion was originally established by the
characterization of
factors in prokaryotes and the separation of
three enzyme systems for transcription in eukaryotes. The finer points
of control in the mechanisms governing core promoter recognition in
eukaryotes, however, are only now coming into view. To consider how the
targeting of promoters to select areas in the nucleus might be
important for integrating the temporal assembly of the transcription
machinery, we first describe some key elements of eukaryotic promoter
recognition and the discovery of diversity and specialization of the
core machinery.
The first hint of core promoter selectivity in eukaryotes came when
TAFs in the TFIID complex were found to be involved in start site
selection and recognition of initiator and downstream promoter elements
for transcription by RNA Pol II (Pugh and Tjian 1991
; Hansen and Tjian
1995
; Burke and Kadonaga 1996
). This observation suggested that, at
some genes, TBP may not play the central role in DNA recognition by
TFIID but instead that the TAF subunits may govern promoter
recognition. Direct evidence for promoter selectivity by TAFs was
obtained with the demonstration that specific TAFs (250, 150, and 60)
recognize initiator elements and downstream promoter elements found in
many metazoan genes transcribed by RNA Pol II (Verrijzer et al. 1994
,
1995
; Burke and Kadonaga 1997
; J.T. Kadonaga, pers. comm.).
Furthermore, other distinct TAF subunits that are intrinsic components
of promoter-selectivity complexes utilized by RNA Pol I and Pol III
(i.e., SL1 and SNAPc) were also found to be important for
promoter recognition (Beckmann et al. 1995
; Henry et al. 1998
). A
growing body of evidence both in vitro and in vivo from yeast to man
support the conclusion that TAFs contribute to activator-mediated
transcription and participate in directing core promoter recognition
(Holstege et al. 1998
; Michel et al. 1998
; Komarnitsky et al. 1999
; for
review, see Goodrich et al. 1996
; Green 2000
). Because these
interactions are integral to the assembly of transcription initiation
complexes, promoter-selectivity factors and sequence-specific
DNA-binding regulators could collectively combine to designate potentially
important elements in establishing nuclear compartments for transcription.
Some gene-specific regulators are sequestered in the cytoplasm or are
otherwise inactivated and require signal transduction to instigate
transcription (Baeuerle and Baltimore 1988
; Picard et al. 1988
).
However, other sequence-specific activators that cooperate with the
signal-responsive regulators are constitutively nuclear (such as Sp1).
A subset of these types of regulators may themselves designate the
functional specificity of a given nuclear compartment by their
chromatin-limited association with DNA target sequences and their
interactions with modules of promoter selectivity factors (such as IID)
and co-regulators (such as CRSP). Therefore, by analogy to the
nucleolar organizer, perhaps genes that require Sp1, TAF130-containing
TFIID, and CRSP are co-localized to a nuclear compartment populated
with a defined subset of the transcription machinery (Fig. 4B, yellow
compartment). Other promoter-selectivity factors might be similarly
partitioned into distinct loci containing a different collection of
co-factors and transcription complexes. For example, such specialized
compartments for RNA Pol II transcription might contain TBP-TAF
complexes with tissue-restricted TAFs, such as the B-cell TAF105, to
which certain genes might be selectively recruited.
In addition to specialized TAFs, there are also the TBP-related
factors, TRF1 and TRF2 (Crowley et al. 1993
; Hansen et al. 1997
;
Rabenstein et al. 1999
), that could be pre-localized and thereby
provide additional compartment-specific activities capable of promoter
selectivity in metazoans. Reports of transcription initiation in the
absence of TBP in vitro suggest the long held notion that
TBP-containing complexes are the sole and universal promoter-recognition factors must be reconsidered (Usheva and Shenk
1994
; Wieczorek et al. 1998
; Takada et al. 2000
). Notably, a complex
containing the tissue-restricted factor TRF1 may play an important role
in directing Pol III-dependent transcription in Drosophila
(Takada et al. 2000
). Other recent results indicate that TRF1
recognizes a TC-rich DNA sequence in place of the TATA element
recognized by TBP (Holmes and Tjian 2000
). Interestingly, this study
found that one TRF1 target gene utilizes two tandem promoters
transcribed by RNA Pol II
one directed by TRF1 and another that
responds to TBP. Although considerably less well characterized, it
seems clear that TRF2 fails to recognize TATA sequences (Dantonel et
al. 1999
; Rabenstein et al. 1999
). By analogy to TBP and TRF1, it could
be expected that TRF2 and associated polypeptides recognize other as
yet undetermined DNA sequences to direct transcription at specific
classes of promoters. We hypothesize that these promoter-selectivity factors might be associated with distinct co-regulators and be segregated into different functional regions within a chromosome territory (Fig. 4B, green compartment).
Each cell may then require multiple types of transcriptionally active
nuclear domains designated by the cooperative interactions of
particular regulators and co-regulators. It is also possible that the
cell might somehow detect active centers of transcription within a
specific area of the nucleus and thus direct or modulate assembly of
the appropriate transcription components within such compartments. The
cell might even direct newly synthesized subsets of co-regulators and
specialized components of the transcriptional machinery to the targeted
sites as they are needed. Target genes competent for transcription
could themselves be oriented in such a way as to be in close proximity
to nuclear pores. In fact, multiple distinct nuclear importin or
karyopherin molecules that function as chaperones appear to selectively
associate with different transcription factors and thus could be
instrumental in ferrying them to the proper sites (Pemberton et al.
1999
; Titov and Blobel 1999
).
Transcription by modular signal-mediated transfer
In addition to recognition of specific promoter/enhancer DNA
sequences, there could be other important elements of transcriptional selectivity and assembly of initiation complexes. For many genes, the
acetylation state of chromatin has long been correlated with their
expression and these modifications have generally been proposed to
loosen chromatin structure. However, post-translational modification (acetylation, methylation, phosphorylation, etc.) of chromatin may not
only enable transcription factors to access their DNA targets, but also
may act as specific molecular signals for DNA recognition (for review,
see Strahl and Allis 2000
). Therefore, specific covalent
modification of chromatin subunits (i.e., histones) might serve as a
docking site for select transcription complexes. There is some evidence
for recognition of a particular acetylation state by co-regulators such
as the SIR complex, in part responsible for the formation and
maintenance of heterochromatin in yeast (Hecht et al. 1995
; Braunstein
et al. 1996
; Rundlett et al. 1996
). Additonally, phosphorylation of
histone H3 tails increase its recognition as a substrate for the GCN5
acetylase while both modifications are correlated with increased
transcription in yeast and mammalian cells (Cheung et al. 2000
; Lo et
al. 2000
). Recent studies suggest there may be a role for specific
histone acetylation in core promoter recognition by TFIID. X-ray
structural analysis combined with quantitative binding assays revealed
that the double bromodomain of TAF250 specifically recognized a
di-acetylated histone H4 tail peptide but not a control acetylated
peptide (Jacobson et al. 2000
). Importantly, these interactions
exhibited one to one stoichiometry and were undetectable with an
unacetylated histone H4 tail peptide. Interestingly, recent
low-resolution structures of the holo-TFIID complex revealed a cavity
large enough to accommodate the binding of a nucleosomal substrate
(Andel et al. 1999
; Brand et al. 1999
). Thus, in contrast to previous
assumptions, TFIID may not require binding to naked DNA but may instead
interact specifically and intimately with appropriately modified
chromatin templates (Fig. 4B). As a further extension of these
findings, one might envisage that such activated promoters are
recruited to transcription compartments through cooperative
protein-DNA interactions that, in the case of TFIID, may involve
recognition of a specific histone acetylation pattern (Fig. 4C). Once
localized to such compartments, transcription of an activated promoter
could be efficiently instigated by cooperative signals between
sequence-specific transcription factors, co-regulators, and the core
machinery (Fig. 4D).
There is scant evidence to support nuclear compartmentalization of RNA
Pol II transcription as proposed herein. However, there are a number of
recent observations that are more consistent with the notion of
territorial subdivision than the mass action competition for factors
inherent to stepwise assembly and holoenzyme models. One study
surprisingly revealed that ligand-dependent promoter remodeling,
co-activator association, and target gene transcription induced by
nuclear receptors are remarkably transient (minutes) despite continuous
receptor association with the target DNA (hours) (Chen et al. 1999
). A
mechanism for this de-activation was suggested to be p300
co-activator-dependent acetylation of another co-factor in the p160
family, thus preventing its association with nuclear receptors. A
similar observation was made for the attenuation of transcription
through the CBP co-activator-mediated acetylation of an architectural
transcription factor, HMGI(Y), that appears to be required for
stabilization of an enhanceosome (Munshi et al. 1998
). If, as these
studies suggest, post-translational modification and inactivation of
transcription factors are important for down-modulation of gene
expression, a nuclear compartmentalization model is most attractive and
is supported by further arguments presented below.
Biological systems are dynamic and transcription factors likely only
transiently associate with their cognate DNA recognition sites and
co-factor targets (McNally et al. 2000
). If a cell were to inactivate
the entire cellular pool of a given co-activator or activator in
response to one signal, such a mechanism would preclude responsiveness
by other activators or cooperativity at other genes in response to
additional signals. However, if transcription complexes are assembled
within segregated nuclear compartments, then inactivation, transient or
otherwise, of the few resident molecules of a particular transcription
factor within the designated compartment could take place without
affecting the same factors in other compartments associated with
different genes. Furthermore, it could be advantageous for a cell to
initiate transcription in one compartment and then, upon de-activation
signals, release the promoter or even translocate the RNA
polymerase-associated template to adjacent nuclear compartments to
facilitate elongation and RNA-processing events.
Transcription and RNA processing appear to be coupled and specific
phosphorylation signals are thought to be requisite for association of
additional factors with the RNA Pol II CTD to carry out post-initiation
events (Misteli et al. 1998
; Ho and Shuman 1999
; for review, see
McCracken et al. 1998
). Quantitative cytological studies have found
that nuclear speckle domains with high concentrations of splicing
factors are regionally stationary, but movement at their periphery
appears to be highly dynamic and dependent on active transcription
(Eils et al. 2000
). Although still poorly understood, there appears to
be gene-selective association with these speckle domains (Smith et al.
1999
; Wei et al. 1999
). Because transcription from some genes is
transient, promoter de-activation or other signals might initiate
template exchange to other nuclear areas for efficient transcript
extension and processing (Fig. 4E).
In considering localized transcription factories, we envisage the DNA template being pulled through matrix-associated RNA Pol II complexes that are themsel