|
|
|
-globin locus
1 Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK; 2 Erasmus University Rotterdam, Medical Genetics Center Department of Cell Biology and Genetics, 3000 DR Rotterdam, The Netherlands
| |
Abstract |
|---|
|
|
|---|
We have identified novel nuclear transcripts in the human
-globin locus using nuclear run-on analysis in erythroid cell lines and in situ hybridization analysis of erythroid tissue. These transcripts extend across the LCR and intergenic regions but are undetectable in nonerythroid cells. Surprisingly, transient
transfection of a
-globin gene (
,
, or
) induces
transcription of the LCR and intergenic regions from the chromosomal
-globin locus in nonerythroid cell lines. The
-globin genes
themselves, however, remain transcriptionally silent. Induction is
dependent on transcription of the globin gene in the transfected
plasmid but does not require protein expression. Using in situ
hybridization analysis, we show that the plasmid colocalizes with the
endogenous
-globin locus providing insight into the mechanism of
transinduction.
[Key Words:
Human
-globin locus; transcription; transinduction; locus control region]
| |
Introduction |
|---|
|
|
|---|
The extent of transcription by RNA polymerase II (PolII)
throughout an entire gene locus in higher eukaryotes has rarely been studied. As mature transcripts are generally mapped
by cDNA cloning, the extent of most primary transcription units is
unknown. Nuclear run-on (NRO) analysis, which measures polymerase
density, has generally been applied only to single genes and their
immediate flanking regions (Hagenbuchle et al. 1984
; Ashfield et al.
1991
; Enriquez-Harris et al. 1991
). Therefore, transcription of
complete intergenic regions in gene loci has not been investigated.
In contrast, transcription of rRNA genes by Pol I has been widely
investigated. In higher eukaryotes, the rRNA genes are arranged in
tandem arrays. Transcription of the rRNA genes terminates at a specific
region downstream of the mature end of the 28S rRNA. Additional
promoters, however, are located downstream of this termination site in
the spacer between the rDNA gene repeats. Transcription initiates at
spacer promoters, reads through the spacer DNA, and terminates upstream
of the next Pol I promoter. The role of these unstable spacer
transcripts is unknown (for review, see Reeder 1992
).
Expression of a number of genes is regulated by a locus control region
(LCR), an element that exerts a dominant transcriptional activation
function over a chromatin domain (Wolffe 1995
). Some LCRs are directly
transcribed by Pol II because of their location in gene introns, and
these include HS-40, the LCR-like element in the
-globin cluster
(Vyas et al. 1992
), the human growth hormone LCR (Bennani-Baiti et al.
1995
), and the human adenosine deaminase LCR (Aronow et al. 1989
).
The human
-globin locus extends over 70 kb of DNA and contains an
LCR and five erythroid-specific genes,
-G
-A
-
-
, arranged in the
order of their developmental expression (for review, see Orkin 1995
).
The
-globin LCR contains five erythroid-specific, developmentally
stable hypersensitive sites, HS1-HS5, situated upstream of the
-globin gene (Grosveld et al. 1987
; Zafarana et al. 1996).
Transcription has been detected both in a microlocus fragment
containing HS1-HS4 linked together and in HS2 as part of a chimeric
construct, raising the possibility that the
-globin LCR is
transcribed (Collis et al. 1990
; Tuan et al. 1992
). In addition, the
existence of large transcripts from the avian and murine
-globin
clusters, up to seven times longer than the globin mRNAs, was described
over two decades ago (Imaizumi et al. 1973
; Bastos and Aviv 1977
).
Although the human
-globin cluster represents one of the best
characterized gene loci in eukaryotes, the extent of transcription
throughout the locus has not been directly investigated. The sites of
transcription termination of these globin genes have not been mapped.
In this study we describe the analysis of nascent transcription across
the human
-globin gene cluster. Both NRO analysis in erythroid
cell lines and primary transcript in situ hybridization of transgenic
mouse fetal liver cells identify transcription of the LCR and the
intergenic regions in the
-globin locus. Analysis of transcription
of the
-globin genes by transfection into nonerythroid cell lines
reveals the surprising observation that intergenic transcription is
induced from the chromosomal
-globin locus in these cells. Insight
into the mechanism of transinduction comes from in situ hybridization
analysis of tranfected HeLa cells, which reveals that an inducing
plasmid colocalizes with the chromosomal
-globin locus.
| |
Results |
|---|
|
|
|---|
NRO analysis of the
-globin gene
To analyze transcription of the
-globin gene, single-stranded
DNA (ssDNA) M13 probes with inserts of ~250bp were made and the
position of these probes relative to the
gene is shown in Figure
1A. E1 is situated before the promoter; E2 and E3 are
within the gene. Probe E3 corresponds to the region of the
-globin
poly(A) site, and probes E3-E14 are contiguous in the 3
-flanking
region.
|
Endogenous transcription of the
gene was investigated in the
human erythroleukemic cell line K562 (Lozzio and Lozzio 1975
), and the
NRO data obtained are shown in Figure 1B. There is no signal before the
transcription start site (probe E1), but signals are detected within
the gene and in the 3
flanking region. Pretreatment of the nuclei
with a low concentration of
-amanitin abolishes all signals except
for the control 5S probe that detects Pol III transcription of 5S rRNA
genes. Thus, the
probe signals are caused by Pol II
transcription. The signals in the 3
-flanking region have been
corrected for the background level of hybridization and number of U
residues in the transcripts; these values are shown graphically in
Figure 1B. There is a drop to the background level from probes E3 to
E5, which may be attributable to transcription termination downstream
of the poly(A) site. However, this is followed by an uneven pattern of
transcription over probes E6-E9 and substantial transcription over
E10-E14.
Two possibilities exist to explain this NRO profile. The first is that
transcription terminates between E3 and E5 and there is another
transcription unit downstream from E10-E14. The pattern of
transcription from E6 to E9 could reflect pause sites involved in
transcription termination or initiation events that are nonprocessive. Alternatively, this transcription pattern could relate solely to
-globin transcription, but different speeds of Pol II through the
flanking region could influence the polymerase density and, therefore,
the NRO signal.
As manipulation of the
sequence could discriminate between these
possibilities, the
-globin gene was transiently transfected into
the nonerythroid epithelial HeLa cell line that does not transcribe
-globin endogenously. The NRO result from mock-transfected cells
confirms that there is no cross hybridization of endogenous HeLa cell
transcripts to the
NRO probes, whereas the histone signal is a
positive control for cellular transcription (Fig. 1C). The
gene
was transfected on a plasmid containing the SV40 enhancer that is
required for transcription from the globin promoter in HeLa cells
(Treisman et al. 1983
). The overall transcription pattern of
in
HeLa cells (Fig. 1C) is similar to that in K562 cells, with the
greatest difference over probe E7. Also, a signal is seen here over
probe E1 upstream of the
promoter, and this may be the result of
the transcription observed over E14 reading round the plasmid into the
promoter region.
Thus, in both K562 cells and transiently transfected HeLa cells, there
is a decrease in polymerase density after the
poly(A) site,
followed by a higher polymerase density downstream in the 3
-flanking region. As similar
transcription patterns are
observed in HeLa and K562 cells, transient transfection of manipulated
sequences can be performed to allow dissection of the NRO
profile.
NRO analysis across the LCR
The observation that there is no signal over probe E1, upstream of
the
promoter, in K562 cells is interesting because there are
reports of multiple upstream transcription start sites for
in
this cell line (Allan et al. 1983
). Also, it is possible that the LCR
is transcribed as microlocus and HS2 transcription have been detected
in constructs integrated into erythroid cells (Collis et al. 1990
; Tuan
et al. 1992
). We have therefore investigated transcription upstream of
the
gene using NRO probes with ~500-bp inserts from the LCR,
as shown in Figure 2A. Probes L1, L3, L5, and L7
contain the core regions of HS4, HS3, HS2, and HS1, respectively (Philipsen et al. 1990
; Caterina et al. 1991
; Lowrey et al. 1992
). Probes L2, L4, and L6 lie between the hypersensitive sites, and probes
L8-L12 lie between HS1 and
.
|
The NRO data obtained from K562 cells (Fig. 2B) reveal that every probe
has a signal above the M13 background. The signal for probe L4 is high
relative to the signals for the other LCR probes as the DNA in probe L4
contains an Alu repeat that hybridizes Pol III transcripts
from Alu repeats transcribed elsewhere in the genome. The
graph of the corrected signals for the remaining probes shows that
transcription is relatively even across the LCR and there is a decrease
in polymerase density to a background level upstream of the
promoter. Pretreatment with a low concentration of
-amanitin
abolishes the signals, indicating that the LCR is transcribed by Pol
II. Transcription was also analyzed on the other DNA strand away from
the
-globin gene with sense NRO probes (denoted by the suffix S)
(Fig. 2B). No signals were detected with these probes, indicating that
LCR transcription is predominantly in the same direction as
-globin transcription. In addition, no signals were obtained from
NRO analysis of LCR transcription in HeLa cells (Fig. 7A, below; data
not shown), suggesting that transcription of the LCR is erythroid
specific.
|
To investigate the reduction in signal over the
promoter region
in more detail, the region between L12 and E1 was divided into shorter
NRO probes, as indicated in Figure 2A. NRO analysis with these probes
(Fig. 2C) reveals that the four contiguous probes 5
of E1 have a
signal that decreases to the background level over probe E1. Probe E1
can hybridize RNA as shown by the NRO data from transient transfection
of the
gene into HeLa cells (Fig. 1C). Thus, it appears that
there is a decrease in polymerase density in a region within 400 bp of
the
transcription start site. Furthermore, the relative levels of
LCR and
-globin transcription can be deduced from this NRO data.
Correction of the
and LCR signals suggests that transcription of
is ~10-fold higher than that of the LCR.
NRO analysis of the
-globin genes
The
and LCR NRO analysis results reveal that there is
extensive transcription both 5
and 3
of the
-globin
gene. To investigate whether the DNA surrounding the
genes is
also transcribed, the NRO probes shown in Figure 3A
were made. Transcription of the
genes was analyzed in the human
erythroleukemic cell line HEL (Martin and Papayannopoulou 1982
).
However, this analysis is complicated by the extensive
gene
sequence homology caused by gene duplication of this region of the
-globin cluster (Shen et al. 1981
). Probes AG15-AG17 and AG1-AG3
hybridize transcripts from both
genes. Probes A4-A9 and G4-G9
contain equivalent fragments of A
and G
DNA, respectively, and weakly cross-hybridize transcripts from the
other
gene (data not shown). Transcription can be studied most
clearly in the areas covered by probes G10-G12 and A18-A22, which are
specific to the flanking regions of G
and
A
, respectively. Finally, probes G13 and G14 are
duplicated upstream of G
.
|
Analysis of A
transcription in HEL calls reveals that
transcription decreases in the A
-flanking region and
then increases again over probes A19-A22, which are specific for
A
transcription (Fig. 3B). All signals are sensitive to
a low concentration of
-amanitin (data not shown). As the signal
over AG3 has an approximately equal contribution from G
transcripts, transcription of the region equivalent to NRO probes A19-A22 appears higher than the level of A
genic
transcription. There is also substantial transcription before the
promoters as detected by the probe AG16.
Transcription in the G
-flanking region, corresponding to
the area of the A
-flanking region that is highly
transcribed, was also analyzed in HEL cells (Fig. 3C). Probes G10-G14
have signals in the same range as those for AG1-AG3. Thus, as observed
for A
, there is substantial transcription in the
G
3
-flanking region. Sensitivity to
-amanitin
shows that all of these signals are attributable to Pol II
transcription (data not shown).
Thus, the pattern of transcription of the
genes is similar to
that of the
gene in that there is a drop in signal after the
poly(A) site, followed by higher levels of transcription downstream. As
the pattern of
transcription is reproduced following transient transfection of the
-globin gene into HeLa cells, this same
approach was taken with the
genes to set up a system amenable to
sequence manipulation. This also has the advantage that transcription
of these two highly homologous
genes can be studied independently of each other.
NRO analysis of A
transiently transfected into HeLa
cells on a plasmid containing the SV40 enhancer (ASV
) reveals a
pattern very similar to that observed with HEL cells (Fig. 3D). In HeLa cells, the signal for probe AG1 is sixfold higher than that of probe
AG3, although the reason for this is unknown. Also, there is an
approximately twofold drop in signal from AG3-A4 in HEL cells, but not
in HeLa cells, and this is because AG3 hybridizes both G
and A
transcripts in HEL cells. No signals above
background were obtained for the
probes on transfection of the
test plasmid without either the SV40 enhancer (pUCA) or the
A
gene (pUCSV
). This indicates that there is no
cross hybridization of HeLa endogenous transcripts to the NRO probes
used to detect A
transcription. In these experiments,
the VA probe, which detects transcription of a cotransfected plasmid
containing the adenoviral VAI gene, serves as a positive control for
the transfection procedure.
Analysis of G
transcription in HeLa cells (Fig. 3E)
shows a similar pattern of transcription over probes G10-G14 to that
seen in HEL cells. Here, probes G4-G9 are used to investigate the
complete transcription pattern in the G
-flanking region.
NRO analysis of HeLa cells transfected with pUCSV
, the test plasmid
without the G
gene, fails to detect any signals above
background (data not shown), consistent with the NRO probes being
specific for the
genes.
In summary, transcription of the LCR,
-, and
-globin genes
has been investigated in erythroid cell lines by NRO analysis. The LCR
is transcribed, and this transcription appears to terminate within 400 bp of the
-globin start site. In addition, there are high levels
of transcription in the
, G
, and A
3
-flanking regions. Furthermore, the transcription patterns of
these genes has been reproduced following their transient transfection into HeLa cells.
Detection of LCR and intergenic transcription in erythroid tissue
As the identification of LCR and intergenic transcripts described
relied on NRO analysis of erythroid cell lines, we used a different
technique and tested whether these transcripts are present in erythroid
tissue. To this end, transcription was investigated in 13.5 day fetal
liver cells from a transgenic mouse heterozygous for a single copy of
the entire human
-globin locus (Strouboulis et al. 1992
). At this
stage in development, ~20% of loci transcribe the human
genes
and the remaining 80% transcribe the
gene (Wijgerde et al.
1996
).
Double-label primary transcript in situ hybridization was performed
with gene-specific intron probes (Wijgerde et al. 1995
) and a probe
specific to either the human LCR or G
-globin
3
-flanking region (equivalent to NRO probes G10-G14). In
addition, the
-
-intergenic region and
3
-flanking
region (equivalent to NRO probes B11 and B12; see below) were
investigated for transcriptional activity (Fig.
4a,d-f). Erythroid cells contain single
transcription foci for the human globin gene as expected for
heterozygous transgenics with a single human
-globin locus. The
intergenic probe signals colocalize with the genic signals, showing
that the intergenic probes specifically detect transcription in the
human
-globin locus. We have also detected transcription of the
mouse LCR in conjunction with an intronic mouse
-major probe (Fig.
4b). Here, two separate loci are detected in erythroid cells with a
pair of colocalizing signals corresponding to transcription of the
murine
-major globin gene and the LCR in each chromosome. The fate
of the transcripts can also be analyzed by use of in situ
hybridization. Figure 4c shows the result obtained in erythroid cells
with a 16-kb mouse LCR probe and both exonic and intronic mouse
-major probes. Here, in addition to the colocalizing signals in
the nucleus, mouse
-major mRNA is detected in the cytoplasm, which
appears as a ring around the nucleus, whereas the mouse LCR transcript
is absent. Thus, the mouse LCR transcript is nuclear and the absence of
the other intergenic transcripts in the cytoplasm suggests that they
are nuclear also.
|
Only a proportion of erythroid cells, 20%-30%, transcribe the LCR
and intergenic regions. Quantitation of cells with genic and/or intergenic foci in fetal liver preparations probed
with
-globin intronic probes and an intergenic probe shows that
most cells (73%) have only genic transcripts, similar to the left hand cell shown in Figure 4f, which is transcribing only the
gene with
no intergenic transcript present. A minority of cells (3%), such as
that in Figure 4f (right), have a
3
-flanking region signal
only and no
gene transcription signal. As mentioned previously, 20% of the human globin loci in transgenic erythroid fetal liver cells
at this stage of development are transcribing the
genes only
(Wijgerde et al. 1996
). These results, therefore, confirm that the LCR
and intergenic regions are transcribed in erythroid tissue in both the
human and mouse
-globin loci. Furthermore, as the genic and
intergenic transcripts can be detected independently, the intergenic
transcripts appear to be distinct from the globin gene primary
transcripts.
Transinduction of intergenic transcription from the HeLa cell chromosome
As transient transfection of the
- or
-globin genes into
HeLa cells generates similar transcription patterns to those observed in erythroid cells, deletion analysis was used to further investigate the intergenic transcripts. In the course of these experiments, one
type of deletion was found to give a surprising result.
, G
, and A
constructs with the
3
-flanking region downstream of the poly(A) site deleted were
transiently transfected into HeLa cells, and the NRO data are shown in
Figures 5A-C. Surprisingly, transcription of the
parts of the flanking regions found to be highly transcribed in
erythroid cell lines is still detected, even though the flanking region
DNA is absent from the transfected constructs.
|
As NRO signals are not detected from these flanking regions in
mock-transfected HeLa cells or cells transfected with derivatives of
the test plasmids (Figs. 1C and 3D), it follows that the flanking region signals observed (Fig. 5) cannot be caused by cross
hybridization of unrelated transcripts. This has been confirmed for the
G
-flanking region by Southern blot analysis of HeLa and
HEL DNA with a probe corresponding to the region of DNA from G10-G14
(Fig. 5E). As a result of the
gene duplication, this region has
weak homology to the DNA upstream of G
, and this is seen
as a weaker band in the Southern (2.6-kb BamHI fragment).
However, all of the bands are predicted from the
sequences
indicating that the NRO probes contain sequences unique to the
-globin locus.
Thus, the signals in the
- and
-flanking regions must be the
result of transcription of the
-globin locus in the HeLa cell chromosome. Furthermore, transient transfection of a plasmid containing a globin gene is necessary to induce transcription from the HeLa cell
chromosome. We refer to this phenomenon as transinduction.
The level of intergenic transcription detected when a construct with the flanking region deleted is transfected is generally lower than that seen with a full-length construct, although the levels of transcription from the flanking regions are variable (data not shown). This is consistent with flanking region transcription from both the plasmid and chromosomal sequences following transfection of the full length constructs, but only from the chromosome when the deleted constructs are transfected.
NRO analysis of transcription of the
-globin gene transiently
transfected into HeLa cells results in a similar transcription trend to
and
transcription. There is a decrease in signal after the
poly(A) site followed by higher signals in the downstream 3
-flanking region (data not shown) where a separate transcription unit was detected by in situ hybridization (Fig. 4f). Therefore, the
gene with its 3
-flanking region deleted was transfected into
HeLa cells to analyze whether transcription is also induced downstream
of the
gene from the chromosomal
-globin locus. The result
in Figure 5D shows that there is transinduction of the
-gene
3
-flanking region. Again, these signals are caused by Pol II
transcription as they are sensitive to a low concentration of
-amanitin (data not shown). In addition, no antisense
transcription in a direction toward the A
gene is
detected with sense probes (Fig. 5D; data not shown).
Transinduction does not require protein expression
To investigate the requirements for transinduction, the protein
product of the transfected globin gene was mutated. Specifically, a
frameshift mutation was made in the third exon of the
G
-globin gene by filling in a restriction site. This
mutant clone is still able to induce transcription, as shown in Figure
6A. In addition, a different frameshift mutation in
the second exon and mutation of the AUG does not affect induction of
chromosomal intergenic transcripts (data not shown), indicating that
the effect is not mediated at the protein level.
|
-Globin gene transcription is not induced from the
HeLa chromosome
Transcription in the G
-flanking region from the
HeLa chromosome is observed directly downstream of the poly(A) site
(Fig. 5B), in contrast to the other flanking regions in which there is
~2 kb of DNA between the poly(A) site and sites of downstream transcription. This raises the possibility that transcription of the
chromosomal G
gene is induced in HeLa cells. To test
this, the G
gene bearing the frameshift mutation was
transfected into HeLa cells. This generates the same pattern of
flanking region transcription from the chromosome but allows
transcription from the transfected and chromosomal copies of the
G
gene to be distinguished at the steady-state level.
RNase protection analysis was used to differentiate between
transcription of the transfected, mutant G
gene and the
chromosomal G
gene with a riboprobe that incorporates
the frameshift mutation (Fig. 6B). As a control, wild-type
G
was transfected to show that there is 100% cleavage
at the mismatch. On transfection of the mutant G
gene,
however, only a protected fragment corresponding to the transfected
gene is detected, showing that transcription of G
is not
induced at detectable levels in HeLa cells.
Thus,
intergenic, but not genic, transcripts are induced from the
chromosomal
-globin locus, indicating that the
intergenic transcripts are distinct from the
gene primary transcripts. Induction of transcription ~2 kb into the 3
-flanking regions of
the
, A
, and
genes (Fig. 5), but not
immediately downstream of the poly(A) site, is also consistent with the
intergenic transcripts being separate transcription units. This
conclusion is supported by the erythroid tissue in situ data (Fig. 4)
in which the genic and intergenic transcripts are detected
independently of each other in a proportion of the cells. Furthermore,
this suggests that the drop in polymerase density after the poly(A)
site of each gene reflects termination of globin gene transcription.
The LCR and intergenic transcripts are all induced in HeLa cells
following transfection of a
-globin gene
The results described above show that transient transfection of a
globin gene into HeLa cells induces transcription from the corresponding flanking region in the chromosomal
-globin locus. To
investigate whether there is transinduction of all the intergenic transcripts by each globin gene,
-globin was transiently
transfected into HeLa cells under the control of the inducible human
immunodeficiency virus (HIV) promoter, which requires the viral
transactivator protein Tat (for review, see Cullen 1993
). Transfection
of HIV-
into HeLa cells in the presence of Tat induces
transcription from the
-globin locus (Fig. 7A),
showing that there is no absolute requirement for a globin promoter.
The four LCR hypersensitive sites are transcribed (L1, L3, L5, L7), as
are the flanking regions of the
(E10), G
(G10),
A
(A19), and
(B11) genes. No transcription of the
(AG1, AG3) or
(B3) genes is detected, however, consistent
with a lack of induction of transcription of the globin genes
themselves. Strong signals are observed over probes PH and E3, as the
DNA in these probes is present in the HIV-
construct.
Transcription of the cotransfected VAI gene and an endogenous histone
gene is also detected.
The plasmid HIV-
was also transfected without the Tat-producing
plasmid (Fig. 7A). Short transcripts are made from the HIV promoter in
the absence of Tat (Cullen 1993
), and such weak transcription is
detected by probe PH. The increased initiation at the HIV promoter in
the presence of Tat (as observed by a higher PH signal relative to the
VA control) has been described previously (Laspia et al. 1989
).
Intergenic, but not genic, transcripts are induced extremely weakly in
the absence of Tat (data not shown). It appears that the level of
transinduction correlates with the strength of the promoter in the
plasmid.
Induction of intergenic, but not genic, transcripts by transient
transfection of HIV-
with a Tat-producing plasmid is also observed
in the nonerythroid cell lines 293 and CAK8 (Fig. 7A). Although the
transfection efficiencies of the different cell lines varies as
measured by plasmid transcription (E3) relative to cellular histone
transcription, the level of induced transcripts relative to plasmid
transcription is the same. Induced transcription is fivefold lower than
plasmid transcription.
To investigate the extent of LCR transcription upstream of HS4, ssDNA
probes were made from a 3.3-kb fragment that contained HS5.
Transcription of HS5 was analyzed following transient transfection of
HIV-
(Fig. 7B). HS5 is transcribed, suggesting that at least one
site of transcription initiation lies further 5
.
Transfection of the A
gene shows that this gene can also
induce the LCR (data not shown) and intergenic transcripts (E11-E14, B11, B12, A19) (Fig. 7C). Transcription over probes G
10 and
G
11, which lie upstream of G
(see Fig. 3A for probe
diagram) is detected, whereas G
12, which is immediately
downstream, lacks a signal. This transcription pattern over G
10 to
G
12 is also observed in HEL cells (data not shown) and may reflect
another transcription termination event. Transcription of
A
does not induce transcription of the
(E3, E4) or
(B3) genes. In general, transcription of AG1 is 5- to 10-fold
higher than that of the
poly(A) site, as discussed previously
(Fig. 3D). Quantitation of the level of induction relative to
transcription of the A
poly(A) site (in the same way as
induction was measured relative to the
poly(A) site in Fig. 7A)
reveals that induced transcription is fourfold lower (data not shown).
This is in agreement with the fivefold lower induction observed upon
transfection of HIV-
and Tat (Fig. 7A).
Similarly, transfection of the
gene induces intergenic but not
- or
-genic transcripts (data not shown). Thus, all the intergenic transcripts are induced on transient transfection of any one
of the
-globin genes. It is possible that transcription is
nonspecifically induced from many flanking regions on transfection of a
-globin gene. This is not the case for the
-globin
3
-flanking region, however, in which transcription is not induced
(data not shown).
Analysis of the specificity of transinduction
To analyze how strict the requirement is for a human
-globin
gene, the mouse
-major gene was tested for its ability to induce transcripts from the human
-globin locus in HeLa cells. The mouse
-major DNA inserted in the plasmid has 60% homology to the human
-globin gene. Transfection of the mouse
-major gene into HeLa cells under the control of the HIV promoter induces intergenic, but not
genic, transcripts (Fig. 8A). However, the level of
transinduction relative to plasmid transcription (PH) is fivefold lower
than that observed with a plasmid containing the HIV promoter and human
or
genes (Fig. 7A and data not shown).
|
Next, the
-globin gene, which resides in the separate
-globin
locus and is unrelated to the sequence of the
-globin genes, was
tested for its ability to transinduce transcripts from HeLa cell
-globin locus. A plasmid containing the HIV promoter fused to the
-globin gene was transiently transfected into HeLa cells and
plasmid transcription was detected by use of probes PH and BP (Fig.
8B). Transcription is not induced from the
-globin locus, however,
showing that
-globin gene sequences on the plasmid are required in
addition to an active promoter.
Analysis of transcription in transfected HeLa cells by in situ hybridization
Plasmid and chromosomal transcription in HeLa cells transiently
transfected with the plasmid ASV
were analyzed by double-label primary transcript in situ hybridization. Plasmid transcription was
detected with a
gene-specific intronic probe and two
transcription foci are present in transfected cells (Fig.
9a,b). The
gene probe specifically detects
plasmid transcription, as no
transcription is detectable from the
chromosomal
-globin locus by use of the sensitive RNase protection
assay (Fig. 6B). Transcription is detected from the
-globin locus
on each chromosome with LCR (Fig. 9a) or
3
(Fig. 9b) probes
and these transcription foci colocalize with those observed for the
transfected plasmid. Three or four colocalizing plasmid and chromosomal
transcription foci were observed in a minority of cells (see below). No
foci were detected in mock transfected cells or cells treated with
RNase (data not shown). This finding suggests that there is an
association between the plasmid and
-globin locus, and this was
further investigated by visualizing distribution of the plasmid at the
DNA level.
|
Plasmid and
-globin locus DNA in HeLa cells transfected with
ASV
were detected by in situ hybridization with a probe specific to
the plasmid backbone in conjunction with a human
-globin locus probe. The plasmid is not distributed throughout the nucleus, but is
instead localized to a discrete number of sites, some of which
colocalize with the endogenous locus (Fig. 9c). Such colocalization of
the plasmid and
-globin locus was observed in 95% of transfected cells. Cells with three chromosome signals were also detected (Fig. 9c)
consistent with the aneuploid nature of HeLa cells. As a control, a
probe specific to part of human chromosome 22 (M69; Mulder et al. 1995
)
was also used and the signals obtained from this probe do not
colocalize with plasmid DNA (Fig. 9d). Together, these results suggest
that there is a specific interaction between the plasmid and
-globin locus on chromosome 11 providing insight into the
mechanism of transinduction.
| |
Discussion |
|---|
|
|
|---|
These studies show that there is extensive Pol II transcription
throughout the
-globin locus in erythroid cells. Furthermore, we
have shown that it is possible to induce intergenic transcription from
the
-globin locus in nonerythroid cell lines by transient transfection of a plasmid containing an actively transcribed
,
, or
gene. Genic transcripts are not induced. The reason for this is unknown but may be due, at least in part, to the absence of
erythroid transcription factors.
Transcription in gene clusters
The pattern of transcription in the
-globin cluster appears
analogous to that of rDNA by Pol I in that gene transcription terminates and intergenic transcription initiates downstream. In the
-globin locus, the LCR is transcribed and this transcription appears to terminate within 400 bp of the
-globin transcription start site. The overall pattern of transcription of the
and
genes is similar in that there is a decrease in polymerase density after the poly(A) site, followed by a higher density of polymerases downstream in the intergenic regions. Transcription has been identified downstream of the
gene and upstream of G
, raising
the possibility that the transcription that initiates in the
3
-flanking region continues through to the G
gene.
Similarly, transcription extends through the entire
G
-A
intergenic region. Transcripts have
also been identified in the
and
3
-flanking region. It
is entirely possible that intergenic transcription is a general feature
of Pol II gene clusters, although to date this has not been
investigated.
Transcription of the LCR and intergenic regions
As all of the probes tested upstream of the
gene have a NRO
signal and these signals are all strand specific, it is possible that a
single transcription unit exists over the LCR. The LCR may be located
within a gene, as is the case for HS-40 (Vyas et al. 1992
), the growth
hormone LCR (Bennani-Baiti et al. 1995
), and the adenosine deaminase
LCR (Aronow et al. 1989
). In the same way, the intergenic transcripts
could also be genic. RNAs from the LCR and intergenic regions, however,
were detected in the nucleus, but not the cytoplasm, by in situ
hybridization and, consistent with this, a search of the EST database
derived from cytoplasmic RNAs failed to match human
-globin LCR or
intergenic sequences. Thus, it appears that the LCR and intergenic
transcripts are nuclear specific.
Alternatively, there may be multiple promoters in the LCR and
intergenic regions. Support for this comes from the observations that
the LCR microlocus (a fragment containing the four LCR HSs linked
together) and HS2 are transcribed (Collis et al. 1990
; Tuan et al.
1992
). The reason why only a proportion of erythroid cells transcribe
the intergenic regions as shown in Figure 4 is unknown, although this
could reflect a stage dependence of transcription.
Is LCR transcription relevant to its function?
As HS2 and the microlocus, which can substitute for the entire
LCR, are transcribed (Collis et al. 1990
; Tuan et al. 1992
), it is
possible that LCR transcription is important for its function. Indeed,
transcription has been correlated to LCR activity for the keratin 18 gene. Inactivation of transcription from an Alu repeat in the keratin
18 LCR results in loss of copy-number-dependent expression in the
kidney (Thorey et al. 1993
).
One way in which transcription may relate to LCR activity, is if the
function of transcription is to establish and maintain an open
chromatin conformation in the locus that is permissive for gene
transcription. Transcription of the intergenic regions could have a
similar role. It is also possible, however, that the widespread
transcription of the
-globin locus is merely a consequence of its
open chromatin conformation. In general, it is still not clear whether
transcription opens chromatin or is a consequence of an open chromatin
conformation (McKnight 1996
). The apparent strand specificity of
transcription and specificity of the sites at which transcription
initiates in the intergenic regions argues against transcription merely
being a consequence of open chromatin.
It is also possible that the role of the LCR and intergenic transcripts
is to deliver proteins and/or Pol II to the globin gene
promoters, and such a tracking mechanism has been proposed previously
(Tuan et al. 1992
).
Transinduction of the
-globin locus in nonerythroid
cells
The second feature of this study is that transcription is induced
from the chromosomal
-globin locus in nonerythroid cells following
transient transfection of a plasmid containing an
,
, or
-globin gene with an active promoter. The LCR and intergenic regions are transcribed from the locus, whereas the globin genes themselves are not. We have shown that there is specificity to transinduction as transcription is not induced by the
-globin gene
nor are
-globin-flanking region transcripts induced by the
-globin gene. However, it is difficult to directly compare the level of intergenic transcription in erythroid cells with that transinduced in nonerythroid cells. The in situ data revealed that
~50% of HeLa cells were transfected with plasmid and that ~25%
of erythroid cells transcribe the intergenic transcripts. Assuming that
the level of histone transcription is constant in HeLa and K562 cells,
then the relative levels of intergenic transcription in these cells can
be ascertained. Quantitation relative to histone (by use of the NRO
data in Figs. 1A and 7A) and correction for the number of transcribing
cells reveals that intergenic transcription is approximately fourfold
higher in HeLa cells than in K562 cells.
How are transcripts induced from the HeLa cell chromosome?
Analysis of the requirements for transinduction reveals that
protein expression is not required. Furthermore, a
gene lacking poly(A) signals and, hence, producing no detectable steady state
mRNA still induces intergenic transcription (H.L. Ashe and N.J. Proudfoot, unpubl.), implying that the effect is mediated at the DNA
level.
It is possible that transient transfection of a
-globin gene
allows titration of a repressor acting on the chromosomal
-globin locus in nonerythroid cells. The requirement for transcription of the
transfected plasmid, however, seems inconsistent with this hypothesis.
Plasmid DNA would titrate out such a DNA-binding repressor protein from
the endogenous locus whether or not the plasmid is transcribed.
An alternative mechanism of transinduction involves an association
between the globin gene in the plasmid with the corresponding globin
gene in the chromosome by sequence homology and our data provide direct
evidence for this. A plasmid containing the A
gene
colocalizes with the human
-globin locus at the DNA level and
transcripts induced from the globin locus colocalize with plasmid
transcripts. Association of the plasmid with the chromosome could
induce transcription in two ways. It is possible that a transcribed
plasmid, for example, one with the A
gene and SV40
enhancer, is located in a transcriptionally active area of the nucleus
and relocates the endogenous locus into this active region. The locus
would, therefore, be accessible to Pol II allowing LCR and intergenic
transcription. Plasmids that are introduced into HeLa cells, but are
not transcribed, such as those containing the
genes without the
SV40 enhancer, would be localized in a region of the nucleus inactive
for transcription and unable to transinduce. Alternatively, a
transcribed plasmid could bring Pol II and/or
transcription factors to the chromosomal locus with which it is
associated, thereby activating LCR and intergenic transcription.
Further experiments are required to discriminate between these two
possibilities.
Both possibilities predict that there will be a correlation between the
strength of the promoter in the plasmid and the level of
transinduction. This is supported by the greatly reduced level of
induction obtained by transfection of HIV-
in the absence of Tat.
The association model can also explain the lower level of
transinduction observed when the mouse
-major gene is present in
the plasmid. The mouse
-major and human
-globin genes are highly homologous (79%) in the 500 bp containing the first two exons
and first intron. Similarly, there is 69% homology between the third
exons, although the big, second intron is not conserved. We propose
that this weaker homology weakens the association between the plasmid
and chromosome resulting in fivefold lower transinduction.
Transcriptional effects in other systems
In this paper we propose that pairing of a plasmid with a
homologous region of the chromosome can activate chromosomal
transcription. Chromosome pairing in Drosophila has been shown
to influence gene expression in both positive and negative ways (for
review, see Henikoff 1997
). For example, the brown gene is
trans-inactivated by insertion of heterochromatin into the
brown gene on the other chromosome (Dreesen et al. 1991
). The two brown
alleles are paired and the heterochromatin insertion in one causes both
alleles to associate with the heterochromatic centromere, suppressing
brown expression (Csink and Henikoff 1996
; Dernburg et al. 1996
).
Transient pairing of mammalian chromosomes has also been described
(LaSalle and Lalande 1996
), although it remains to be determined
whether such pairing between chromosomal loci is used to regulate
transcription.
Other reports exist whereby introduction of DNA or RNA into cells
influences endogenous gene transcription. In transgenic plants, host
genes can be silenced by the introduction of another copy of the gene,
and this silencing can be mediated at the transcriptional level (Meyer
and Saedler 1996
). Inhibition of gene expression by introduction of
sense RNA into Caenorhabditis elegans has also been described
(Guo and Kemphues 1995
). Finally, cellular genes, including the human
-globin gene, are activated in embryonic fibroblasts following
retroviral infection (Groudine and Weintraub 1975
, 1980
). These
transcription effects highlight our lack of understanding of the
control of transcription in the nucleus.
If our model for transinduction of transcription from HeLa cells is correct, then this phenomenon could be a general, but hitherto undetected, feature of transient transfection studies. The challenge now is to induce genic transcripts, and ways in which this can be achieved are currently bei