|
|
|
Vol. 14, No. 10, pp. 1249-1260, May 15, 2000
1 Skirball Institute of Biomolecular Medicine and Department of Microbiology, New York University School of Medicine, New York, New York 10016 USA; 2 Department of Microbiology and Molecular Genetics, Harvard Medical School, Boston, Massachusetts 02115 USA
| |
ABSTRACT |
|---|
|
|
|---|
RNase E is a key regulatory enzyme that controls the principal pathway for mRNA degradation in Escherichia coli. The cellular concentration of this endonuclease is governed by a feedback mechanism in which RNase E tightly regulates its own synthesis. Autoregulation is mediated in cis by the 361-nucleotide 5' untranslated region (UTR) of rne (RNase E) mRNA. Here we report the determination of the secondary structure of the rne 5' UTR by phylogenetic comparison and chemical alkylation, together with dissection studies to identify the 5' UTR element that mediates autoregulation. Our findings reveal that the structure and function of the rne 5' UTRs are evolutionarily well conserved despite extensive sequence divergence. Within the rne 5' UTRs are multiple RNA secondary structure elements, two of which function in cis to mediate feedback regulation of rne gene expression. The more potent of these two elements is a stem-loop structure containing an internal loop whose sequence is the most highly conserved of any region of the rne 5' UTR. Our data show that this stem-loop functions as a sensor of cellular RNase E activity that directs autoregulation by modulating the degradation rate of rne mRNA in response to changes in RNase E activity.
[Key Words: RNase E; mRNA degradation; autoregulation; RNA secondary structure; phylogenetic comparison; E. coli]
| |
Introduction |
|---|
|
|
|---|
Messenger RNA degradation is an important mechanism for controlling
gene expression in all organisms. The lifetimes of
mRNAs can differ significantly within a single cell, and these
longevity differences have a direct effect on message concentrations.
In Escherichia coli, for example, distinct mRNAs may differ in
stability by as much as two orders of magnitude, with half-lives
ranging from a fraction of a minute to as long as an hour (Belasco
1993
). In addition, the longevity of individual transcripts can vary significantly in response to environmental cues.
The degradation of most mRNAs in E. coli is thought to begin
with cleavage at internal sites by RNase E (Apirion 1978
; Ono and
Kuwano 1979
; Mudd et al. 1990
; Babitzke and Kushner 1991
; Melefors and
von Gabain 1991
; Taraseviciene et al. 1991
). This endonuclease shows a
preference for cleaving AU-rich sequences within single-stranded
regions of RNA (Lin-Chao et al. 1994
; McDowall et al. 1994
), and most
mRNAs (both short- and long-lived) appear to contain multiple sites
where RNase E cleavage can occur. RNase E exists in E. coli as
a component of the RNA degradosome, a multienzyme RNA-degradation
complex that also contains a 3' exoribonuclease (polynucleotide
phosphorylase), an RNA helicase (RhlB), a glycolytic enzyme (enolase),
and possibly other components (Carpousis et al. 1994
; Miczak et al.
1996
; Py et al. 1996
).
RNase E is essential for cell viability, and either its underproduction
or its overproduction can impair cell growth (Apirion 1978
;
Claverie-Martin et al. 1991
; C. Jain and J. Belasco, unpubl.). To
maintain RNase E near its optimal cellular concentration, E. coli cells have evolved an autoregulatory mechanism for controlling the synthesis of this important ribonuclease (Mudd and Higgins 1993
;
Jain and Belasco 1995
). This mechanism involves changes in the
longevity of the RNase E (rne) gene transcript, whose
half-life (normally ~1 min) varies in response to the level of RNase
E activity (Jain and Belasco 1995
). Thus, a 21-fold increase in
rne gene dosage results in only a 2.8-fold increase in the
cellular concentration of RNase E due to accelerated degradation of
rne mRNA. Conversely, insufficient RNase E activity can
increase the lifetime of the rne transcript by as much as an
order of magnitude. Compared with other E. coli genes,
rne expression appears to be unusually sensitive to the level
of RNase E activity within the cell (Jain and Belasco 1995
).
Feedback regulation of rne gene expression is mediated in
cis by the rne 5' untranslated region (UTR),
which can confer this property onto heterologous transcripts to which
it is fused (Jain and Belasco 1995
). For example, whereas production of
-galactosidase from the E. coli lacZ gene is relatively
insensitive to cellular RNase E activity, its production from an
rne-lacZ gene fusion bearing the 361-nucleotide rne
5' UTR and the first 28 codons of the rne protein-coding
region is strongly influenced by the level of RNase E activity in
E. coli. This pronounced effect of RNase E on
rne-lacZ expression requires an intact rne 5'
UTR. As expected, RNase E exerts a corresponding influence on the
longevity of the rne-lacZ transcript, whose half-life
parallels that of rne mRNA in cells containing various levels
of RNase E activity (Jain and Belasco 1995
).
To begin to elucidate the mechanism by which RNase E regulates its own synthesis, we set out to identify the unique features of the rne 5' UTR that mediate this effect. Here we report an investigation of the secondary structure of the rne 5' UTR by phylogenetic analysis and chemical alkylation. In addition, we describe the mapping of regulatory elements within the rne 5' UTR by genetic dissection. These data reveal that an evolutionarily conserved RNA stem-loop within this UTR is the element principally responsible for directing feedback regulation of rne gene expression and that it acts by varying the rate of rne mRNA degradation in response to changes in cellular RNase E activity.
| |
Results |
|---|
|
|
|---|
Phylogenetic analysis of the secondary structure of the rne 5' UTR
To investigate the secondary structure of the E. coli rne
5' UTR, we elected to use phylogenetic analysis, a powerful and reliable method based on sequence comparison that has proven to be the
most useful procedure for determining the higher-order structure of
large RNAs (Noller and Woese 1981
; Schnare et al. 1996
). The principle
underlying this analytical method is that RNA base pairing with an
important biological function is maintained during evolution despite
sequence divergence. The hallmark of an important region of secondary
structure is the occurrence of covariation, in which compensatory
changes in each strand of an RNA duplex preserve the potential for base
pairing among homologous RNAs in diverse organisms. In contrast, if
related RNAs in different species do not share the potential to form a
particular base-paired structure, then this structural element either
does not form in vivo or is functionally unimportant.
We first examined rne transcriptional units whose sequence had
already been determined. Complete genomic sequences are available for
the
-purple bacteria E. coli and Haemophilus
influenzae. The amino-terminal half (residues 1-506) of the
Haemophilus RNase E protein shares a high degree of sequence
homology with its E. coli counterpart (83% amino acid
identity). However, the 5' UTRs of the E. coli and
Haemophilus rne genes have diverged too far to allow
meaningful sequence alignment. It was therefore necessary to determine
the sequence of the rne 5' UTR from several bacterial species more closely related to E. coli, yet not so closely
related as to preclude an informative degree of sequence covariation. To isolate these DNAs, we devised a strategy based on PCR, relying on
the knowledge that the protein-coding regions of the rne gene and the upstream yceC gene in E. coli and H. influenzae have diverged much more slowly during evolution than the
intergenic sequence containing the rne 5' UTR. This made
it possible to design PCR primers complementary to conserved sequences
within the rne and yceC coding regions of E. coli and Haemophilus, with the expectation that these
primers would allow the amplification of the diverged rne-yceC intergenic region from related bacterial species.
By use of this PCR strategy, the 5' noncoding region of the rne gene was amplified from genomic DNA extracted from E. coli, Serratia marcescens, Yersinia pseudotuberculosis, Erwinia carotovora, and Providencia alcalifaciens. The high degree of sequence conservation in the rne promoter and protein-coding region allowed the 5' and 3' boundaries of each rne 5' UTR to be defined by comparison with the known boundaries of the E. coli rne 5' UTR (Fig. 1). In addition, primer extension mapping with reverse transcriptase confirmed the location of the 5' terminus of the Providencia rne transcript (data not shown), whose sequence is the most evolutionarily diverged among these mRNAs. Compared with that of E. coli, considerable sequence divergence was evident for the rne 5' UTR of Serratia (29%), Yersinia (32%), Erwinia (33%), and Providencia (52%). (Entirely unrelated sequences would be expected to diverge by ~75%.)
|
Potential base-paired conformations of these 5' UTRs were examined to identify a secondary structure common to all five of them. As shown in Figure 2, all of these RNA segments can fold in a remarkably similar fashion despite their substantial sequence divergence. This phylogenetic analysis indicates that the rne 5' UTR contains three imperfect stem-loop structures (hp1, hp2, and hp3). One of the stem-loops (hp3) has a complex multipartite structure that comprises four stems emanating from a single internal loop. Each of the stem-loops is followed by a single-stranded spacer (ss1, ss2, and ss3), the last of which (ss3) contains the signals for translation initiation (the Shine-Dalgarno element and initiation codon). In addition, the exceptionally long 5' UTR of the Providencia transcript (425 nucleotides) has the potential to form a fourth base-paired structure between hp1 and hp2. Sequence alignment has identified multiple covarying base pairs in each of the three conserved stem-loop structures, including all four subdomains of hp3, evidence that strongly supports the formation of these stem-loops. With one limited exception (see below), other possible secondary structures suggested by computer-aided free-energy calculations are not phylogenetically conserved and are therefore not likely to form in vivo.
|
To test this structural model, we re-examined the 299-nucleotide 5' UTR of the rne transcript of H. influenzae. The sequence of this RNA segment has diverged so far from that of E. coli that a meaningful sequence alignment had been previously impossible. Nonetheless, it too has the potential to adopt an overall secondary structure that is strikingly similar to that of the other rne transcripts (Fig. 2). The predictive power of the secondary structure deduced by phylogenetic comparison lends strong support to this structural model.
Chemical probing of the secondary structure of the rne 5' UTR
To corroborate the structural model determined by phylogenetic
comparison, each nucleotide of the E. coli rne 5' UTR was
probed by chemical modification with alkylating agents whose reactivity is sensitive to base pairing. Dimethylsulfate (DMS) methylates unpaired
adenosine residues at N1 and unpaired cytidine residues at N3 (Moazed
et al. 1986
). Because DMS can penetrate cell membranes and enter the
cytoplasm, it can be used to probe the secondary structure of mRNAs in
their natural milieu. In contrast,
1-cyclohexyl-3-(2-morpholinoethyl)carbodiimide metho-p-toluenesulfonate (CMCT), which cannot enter cells,
alkylates unpaired uridine residues at N3 and, to a lesser extent,
unpaired guanosine residues at N1 (Moazed et al. 1986
). Nucleotides
engaged in Watson-Crick base pairing are protected from alkylation by these reagents due to the involvement of the specified nitrogen atoms
in hydrogen bonds. Sites of alkylation can be readily mapped, as these
base modifications block primer extension with reverse transcriptase by
preventing nucleotide incorporation opposite the modified base.
Chemical modification was performed on a hybrid transcript
(ez1) comprising the 5' UTR and first 181 codons of
rne fused in-frame to lacZ. The greater abundance of
the plasmid-encoded ez1 transcript made it a useful surrogate
for structural analysis. Owing to the presence of the E. coli
rne 5' UTR, this hybrid transcript is indistinguishable from
rne mRNA in its sensitivity to feedback regulation by RNase E
(Jain and Belasco 1995
). To boost the ez1 mRNA concentration
further, the experiments were performed using RNA from E. coli
host cells (CH1828) that produce a mutant form of RNase E defective in
feedback regulation. E. coli CH1828 cells containing the
ez1 gene on a multicopy plasmid (pEZ101) were grown to mid-log
phase at 37°C, and total cellular RNA was harvested either without
pretreatment or after treating the cells with DMS. RNA from the
untreated cells was alkylated in vitro with DMS or CMCT.
Primer-extension mapping of the sites of alkylation within the long
rne 5' UTR (361 nucleotides) required the use of five different end-labeled primers. As a control, an additional primer extension reaction was performed in each case with an unalkylated RNA
sample to identify natural sites of pausing or premature termination by
reverse transcriptase.
A representative primer extension gel is shown in Figure 3A. The similarity of the methylation patterns observed for RNA samples treated with DMS in vivo and in vitro (apart from expected differences in band intensities) indicates that the rne 5' UTR retains its native conformation upon extraction from cells, thereby validating the CMCT data, which could be obtained only by in vitro alkylation of extracted RNA. Natural sites of pausing by reverse transcriptase were particularly prevalent at guanosine residues, which also were inefficiently alkylated by CMCT, thereby preventing an independent assessment of the base-pairing status of these residues.
|
Superposition of the alkylation data on the phylogenetically determined
structure of the E. coli rne 5' UTR (Fig. 3B) indicates a
high degree of correlation between nucleotides that are alkylated and
those thought from phylogenetic evidence to be unpaired. Moreover, little alkylation was observed for nucleotides in double-stranded regions. As expected, a number of unpaired residues appeared resistant to alkylation. This common phenomenon probably reflects chemical inaccessibility resulting from higher order structure (e.g.,
non-Watson-Crick base pairing) or difficulty in scoring sites of
natural pausing by reverse transcriptase. Although the most reactive
bases mapped almost exclusively to single-stranded regions, some
nucleotides thought to be base paired were moderately reactive,
particularly A-U and G-U pairs adjacent to single-stranded regions.
Such reactivity is often observed for nucleotide pairs located at the
ends of double helices (Chen et al. 1991
), where breathing (transient separation) of terminal base pairs may occur. Overall, the alkylation data are consistent with the secondary structure determined
phylogenetically for the 5' UTR of the E. coli rne transcript.
Substitution and deletion analysis of the rne 5' UTR
Previous studies have indicated that the rne 5' UTR
acts in cis to mediate RNase E autoregulation (Jain and
Belasco 1995
). To determine whether not only the structure but also the
regulatory function of the rne 5' UTR is evolutionarily
conserved, we examined the effect of cellular RNase E activity on the
expression of an rne-lacZ fusion in which the E. coli
rne 5' UTR had been replaced with that of Y. pseudotuberculosis. First, as substitution endpoints, two unique
restriction sites were introduced into the E. coli rne-lacZ
reporter ez1. One was an NheI site created by a 4-bp substitution between the promoter and transcription start site of the
ez1 gene. The other was a BsrGI site created between
the 5' boundary of the ez1 ss3 segment and the
ribosome-binding site by inserting a single cytidylate residue between
nucleotides 341 and 342 of the 5' UTR. As expected, these point
mutations did not affect feedback regulation of the resulting
ez1000 chimera. This was assessed by measuring
-galactosidase synthesis following plasmid transformation into an
RNaseE-deficient lacZ strain (CH1828, which carries a
chromosomal rne missense mutation, ams-1, which reduces cellular RNase E activity) and an isogenic RNase
E-overproducing strain (CH1827+pRNE101, which contains a multicopy
plasmid clone of the wild-type rne gene). The observed
repression ratio (R, the ratio of
-galactosidase activity
in the CH1828 host vs. the CH1827+pRNE101 host) is a direct measure of
the degree to which expression of a particular rne-lacZ
fusion can be inhibited by cellular RNase E activity. This ratio was
approximately the same for ez1000 (R = 34 ± 5)
and the original ez1 transcript (R = 36 ± 6).
We then replaced the 0.34-kb NheI-BsrGI segment of
ez1000, which encodes all of the rne 5' UTR
upstream of the ribosome-binding site, with the corresponding segment
of the Y. pseudotuberculosis rne 5' UTR, which is similar
in secondary structure but markedly different in sequence (Fig. 2).
Despite resulting in 104 nucleotide substitutions, 6 deletions, and 7 insertions, this 5' UTR replacement had almost no effect on
feedback regulation in E. coli (R = 32 ± 3 for
Yez1000). In contrast, feedback regulation was virtually abolished by removing the entire 5' UTR segment upstream of the ez1000 ribosome-binding site
(ez
1-337); this 337-nucleotide
NheI-BsrGI deletion reduced the repression ratio to
just 2.0 ± 0.4, confirming the key role of the rne 5'
UTR in RNase E autoregulation. Together, these findings indicate that
autoregulation of rne gene expression is mediated in
cis by one or more evolutionarily conserved features of the
sequence and/or structure of the rne 5' UTR.
To begin to map which of the conserved features of the rne
5' UTR is responsible for feedback regulation, two complementary deletions were created within the ez1000 5' UTR, such that
the 3' endpoint of the promoter-proximal deletion coincided with
the 5' endpoint of the promoter-distal deletion. The repression
ratio of each deletion mutant was then determined (Fig.
4). Deletion of the entire 5' UTR segment
preceding hp3 (ez
1-113) impaired, but did not
abolish, regulation (R = 6 ± 1, versus 33 for
ez1000 and 2 for the baseline deletion mutant
ez
1-337). A complementary deletion that
removed hp3 and the ss3 segment upstream of the ribosome-binding site
(ez
114-337) had a more modest effect on feedback regulation (R = 12 ± 3). These findings
indicate that the first third of the rne 5' UTR
(nucleotides 1-113, comprising hp1, ss1, hp2, and ss2) and the next
two-thirds of the rne 5' UTR (nucleotides 114-337,
comprising hp3 and the first nine nucleotides of ss3) can each function
independently to mediate feedback regulation by RNase E, and that
together these two RNA segments produce the full autoregulatory effect
of the complete rne 5' UTR.
|
Proceeding further with the analysis, we constructed four additional
variants of rne-lacZ deletion mutant
ez
114-337 in which the remaining 5' UTR
structural domains upstream of the ribosome-binding site
hp1, ss1,
hp2, and ss2
were each deleted individually or pairwise (Fig. 4).
Whereas deletion of ss2 alone (ez
109-337) had
almost no effect on feedback regulation (R = 9 ± 1,
versus 12 for ez
114-337), the simultaneous
deletion of hp2 and ss2 (ez
52-337) virtually
abolished regulation (R = 2 ± 1, the same as that
observed for the baseline deletion mutant
ez
1-337 in which almost the entire
rne 5' UTR is deleted). Individual deletion of hp1
(ez
1-42/114-337) or ss1
(ez
43-51/114-337) caused only a
modest reduction in feedback regulation (R = 8-10, vs. 12 for ez
114-337). These findings indicate that
hp2 is necessary for efficient RNase E autoregulation. To test whether
this stem-loop can direct feedback regulation in the absence of all
other 5' UTR elements preceding the signals for translation
initiation, we substituted a single copy of hp2 for the entire
rne-derived segment of ez1000 mRNA upstream of the
ribosome-binding site. Expression of the resulting reporter transcript
(ez
1-51/109-337) was found to
be sensitive to cellular RNase E activity (R = 8 ± 1,
vs. 2 for the corresponding deletion mutant lacking hp2), indicating a
key role for hp2 in feedback regulation.
Despite the absence of hp2, the 5' UTR of rne-lacZ
deletion mutant ez
1-113 retains some
autoregulatory activity (R = 6 ± 1). To better define
the autoregulatory element remaining in the 5' UTR of
ez
1-113, we constructed two variants of this
deletion mutant by additionally removing either hp3 or the first nine
nucleotides of ss3, which together comprise the remainder of the 5'
UTR upstream of the ribosome-binding site (Fig. 4). Deleting this
portion of ss3 (ez
1-113/329-337) caused a
modest further reduction in feedback regulation
(R = 4 ± 1, vs. 6 ± 1 for
ez
1-113), whereas removing hp3
(ez
1-328) virtually abolished feedback
regulation (R = 2, the same as that observed for the
baseline deletion mutant ez
1-337). We conclude
that hp2 and, to a lesser extent, hp3 are the core 5' UTR elements
that mediate autoregulation of rne gene expression and that
each can function in the absence of all other RNA elements upstream of
the ribosome-binding site.
Conformation of rne hp2
Although the base-paired structure of rne hp2 shown in
Figures 2 and 3 fits the phylogenetic and alkylation data very well, an
alternative structure for the top portion of this hairpin was also
plausible on the basis of these data (cf. conformations A and B in Fig.
5). To distinguish between these possibilities, two
sets of mutations were introduced into hp2 in the context of
ez
114-337. Each set of mutations would
preserve base pairing in one hp2 conformation, via compensatory
base-pair substitutions, but disrupt base pairing in the other
(noncompensatory substitutions) (Fig. 5). When these two variants of
ez
114-337 were tested (Fig. 5), only the one
(ez
114-337var1) that preserved base pairing in
the original hp2 conformation (conformation A) was sensitive to
feedback regulation by RNase E (R = 8 ± 1); the other
(ez
114-337var2), which would disrupt
conformation A but preserve conformation B, was no more responsive than
the baseline rne-lacZ mutant lacking the entire 5' UTR
segment upstream of the ribosome binding site (R = 2 ± 1, the same as for
ez
1-337). These findings validate the original
structure proposed for rne hp2 (conformation A) and show that
it is the presence of base pairs at key positions in the upper part of
this stem-loop, rather than the sequence of these base pairs, that is
essential for hp2 function.
|
mRNA destabilization by rne hp2
Our previous studies have shown that the decay of both rne
and ez1 mRNA accelerates as cellular RNase E activity
increases. To demonstrate that rne hp2 plays a key role in
mediating this accelerated degradation, we compared the longevity of
rne-lacZ reporter mRNAs containing a wild-type copy of hp2
(ez
114-337) or a mutationally inactivated copy
(ez
114-337var2) in E. coli cells with
high or low RNase E activity.
For this purpose, we first introduced a base-pair substitution into the
promoters of ez
114-337 and
ez
114-337var2 to facilitate mRNA detection by
increasing the rate of transcription more than 10-fold. Plasmids bearing the
resulting reporter genes (ez
114-337T or
ez
114-337var2T) were introduced into E. coli strain CH1827 containing plasmid pRNE101 (multicopy
rne+; high RNase E activity) or into the isogenic strain
CH1828 (ams-1; low RNase E activity). Before measuring mRNA
decay rates, we checked the repression ratios of the two new reporter
genes. Measurements of
-galactosidase activity yielded repression
ratios of 5.1 ± 1.0 for ez
114-337T
(
-galactosidase activity = 662 ± 86 units in CH1827+pRNE101
vs. 3405 ± 459 units in CH1828) and 2.0 ± 0.5 for
ez
114-337var2T (
-galactosidase
activity = 1400 ± 102 units vs. 2807 ± 669 units). These
values resembled the repression ratios measured for the progenitor
reporter genes ez
114-337 and
ez
114-337var2 (see above), except that the
degree of repression was somewhat reduced for
ez
114-337T compared with
ez
114-337. The effect of the promoter up
mutation on the magnitude of feedback regulation by RNase E may be a
consequence of overproducing the rne-lacZ fusion protein,
which bears a large amino-terminal fragment of RNase E that might be capable
of influencing autoregulation when present at a high cellular concentration.
To measure the physical half-lives of the two reporter transcripts in the same host strains, rifampicin was added to log-phase cultures to halt further initiation of transcription, and total cellular RNA was extracted from culture samples withdrawn at time intervals thereafter. Equal amounts of each RNA extract were assayed by primer extension to determine the relative amount of rne-lacZ mRNA remaining. From these data, we calculated the half-lives of the rne-lacZ reporter mRNAs in cells with high or low RNase E activity (Fig. 6).
|
The half-life of the reporter transcript bearing a wild-type copy of
hp2 (ez
114-337T) was 5.2 times longer in
E. coli cells with low RNase E activity (half-life of
4.7 ± 0.8 min in the ams-1 strain) than in cells that
overproduce RNase E (half-life of 0.9 ± 0.1 min in the multicopy
rne+ strain). This difference in longevity closely
resembles the 5.1-fold difference in
-galactosidase activity
measured for ez
114-337T in the same two
strains. In comparison, the lifetime of the rne-lacZ transcript bearing a mutant copy of hp2
(ez
114-337var2T) was only 2.2 times longer in
the ams-1 strain (half-life of 4.4 ± 0.8 min) than in the
multicopy rne+ strain (half-life of 2.0 ± 0.3 min), a
ratio congruent with the 2.0-fold difference in
-galactosidase
activity measured for ez
114-337var2T in these
two strains. In cells with low RNase E activity,
ez
114-337T mRNA longevity and gene expression
were both independent of rne hp2. In contrast, an intact copy
of hp2 was necessary for an increase in RNase E activity to markedly
reduce the lifetime of ez
114-337T mRNA and
thereby to repress expression of this reporter gene by more than the
basal twofold effect typical of unrelated genes. These findings
indicate that hp2 directs feedback regulation of rne gene
expression by significantly accelerating RNase E-mediated degradation
of rne mRNA in response to increased cellular RNase E activity.
| |
Discussion |
|---|
|
|
|---|
The identification of the features of mRNAs that govern their
differential susceptibility to ribonuclease digestion is of great
importance for understanding how gene expression is controlled post-transcriptionally. This is particularly true for cleavage by RNase
E, the endonuclease that appears to initiate degradation via the
principal pathway for mRNA turnover in E. coli. Beside its
central role in the degradation of various unrelated mRNAs, this
important ribonuclease tightly regulates its own synthesis by
controlling the longevity of the RNase E (rne) gene transcript (Jain and Belasco 1995
). Compared with other E. coli genes,
expression of the rne gene is unusually sensitive to the
cellular level of RNase E activity (Jain and Belasco 1995
). Our data
show that feedback regulation of rne gene expression is
mediated in cis by two evolutionarily conserved secondary
structure elements within the rne 5' UTR. By targeting the
transcript that contains these regulatory elements for degradation at a
rate dependent on cytoplasmic RNase E activity, bacterial cells are
able to maintain RNase E near its optimal concentration.
We have determined the secondary structure of the 361-nucleotide
rne 5' UTR from E. coli by a combination of
phylogenetic comparison, chemical alkylation, and mutational analysis.
Our phylogenetic data indicate that, despite extensive sequence
divergence, the secondary structure of the rne 5' UTR is highly
conserved among diverse members of the
subgroup of purple bacteria,
including E. coli, S. marcescens, Y. pseudotuberculosis, E. carotovora, P. alcalifaciens, and H. influenzae. This evolutionary
conservation implies an important biological function for this long
untranslated RNA segment, a conclusion consistent with its role in
RNase E feedback regulation.
The rne 5' UTR can be subdivided into six structural domains: three stem-loops (hp1, hp2, and hp3) and three single-stranded segments (ss1, ss2, and ss3). Among these, hp2 and hp3 are the core elements that mediate autoregulation. Of the two autoregulatory elements, hp2 is the more potent. This stem-loop functions as a sensor of cellular RNase E activity that directs feedback regulation of rne gene expression by mediating the degradation of rne mRNA by RNase E at a rate that is sensitive to the level of RNase E activity in bacterial cells. At the top of hp2 is a 14-nucleotide RNA segment (GCAAUGGCGUAAGA) whose sequence is the most highly conserved of any rne 5' UTR segment more than a few nucleotides in length, showing strict conservation among E. coli, Serratia, Yersinia, Erwinia, and Providencia and deviating only slightly in Haemophilus. Together with two flanking nucleotides, this RNA segment folds to form a structural component, comprising a hairpin loop, a 2-bp stem, and an 8-nucleotide internal loop, whose integrity is critical for the ability of hp2 to modulate the rate of rne mRNA degradation in response to changes in cellular RNase E activity. A search of rne genes in bacteria for which incomplete genomic sequences are available reveals that an upstream element closely related to hp2 is also present in a number of other bacterial species, including Klebsiella pneumoniae, Vibrio cholera, and Actinobacillus actinomycetemcomitans.
The independent activity of hp2 and hp3 in mediating the inhibitory
effect of RNase E on rne gene expression and the excellent correlation between the effect of hp2 on gene expression and mRNA longevity suggest that each of these stem-loops functions in some manner as a target for RNase E. For example, in view of the preference of RNase E for cleaving RNA at sites that are AU- rich and single stranded (Lin-Chao et al. 1994
; McDowall et al. 1994
), the presence of
large AU-rich internal loops within hp3 raises the possibility that
this branched stem-loop might contain an RNase E cleavage site (Fig.
7A). In contrast, the predominantly base-paired
structure of hp2 makes it an unlikely target for RNase E cleavage, an
inference supported by in vitro cleavage experiments with the purified
catalytic domain of RNase E (data not shown). Nonetheless, binding of
hp2 by RNase E, either alone or in conjunction with a hypothetical protein cofactor, could accelerate rne mRNA degradation by
facilitating access of the bound ribonuclease to one or more
endonucleolytic cleavage sites elsewhere in the rne transcript
(Fig. 7B). In this manner, hp2 might help RNase E to overcome potential
impediments to cleavage imposed by other features of the rne
5' UTR, such as bound ribosomes (Baumeister et al. 1991
; Yarchuk et
al. 1992
; Jain and Kleckner 1993
; Arnold et al. 1998
), a
5'-proximal stem-loop (hp1) (Bouvet and Belasco 1992
), and a
5'-terminal triphosphate (Mackie 1998
), all of which have been
implicated in protecting other RNAs from cleavage by this enzyme. Such
a role for hp2 would be consistent with a number of lines of evidence
suggesting that it is often not the mere presence of RNase E cleavage
sites but rather their accessibility that is most important in
determining mRNA lifetimes.
|
What of the other rne 5' UTR domains upstream of the
ribosome binding site (hp1, ss1, and ss2)? In contrast to the marked autoregulatory effect observed for hp2 and the moderate effect of hp3,
none of these other domains appears to play a significant role in
feedback regulation, although they may slightly amplify the degree of
RNase E feedback regulation mediated by hp2 and hp3. It is worth noting
that the ss1 segment is dispensable for RNase E feedback regulation
even though it contains an apparent RNase E cleavage site (Jain and
Belasco 1995
). Either cleavage at this particular site is irrelevant to
the mechanism of feedback regulation, or this site is functionally redundant
with other RNase E cleavage sites within the rne transcript.
Phylogenetic analysis of the rne 5' UTR was greatly facilitated by the strategy used to isolate this gene segment from a number of different bacterial species. This strategy involved PCR amplification of genomic DNA using primers complementary to flanking coding region segments that were expected to be relatively well conserved at the sequence level. By carefully designing the primers and gradually stepping down the PCR annealing temperature (see Materials and Methods), it was possible to obtain a single major PCR product in good yield from a variety of bacterial genomes despite a lack of advance knowledge as to the exact degree of complementarity between the primers and their targets. This approach should be of value for rapidly determining the sequence and secondary structure of many other untranslated regions that control bacterial gene expression, as noncoding regions are generally expected to diverge in sequence more rapidly than protein-coding regions during the course of evolution.
| |
Materials and methods |
|---|
|
|
|---|
Strains and plasmids
The isogenic E. coli strains CH1827 (MC1061,
zce-726::Tn10) and CH1828 (MC1061,
zce-726::Tn10 ams-1) are derivatives of MC1061 [ara D39
(ara, leu) 7697
leuX74 galU
galK
hsr
hsm+ strA] (Mudd and Higgins 1993
). Y. pseudotuberculosis, S. marcescens, and E. carotovora were provided by Dr. A. Chatterjee (University of
Missouri, Columbia). P. alcalifaciens (ATCC 51902) was
obtained from the American Type Culture Collection.
Plasmid pRNE101 is a pACYC177 derivative containing the wild-type
E. coli rne gene (Jain and Belasco 1995
). Plasmids pEZ101 (pBR322 derivative) and pEZ201 (pSC101 derivative) each contain an
rne-lacZ reporter gene (ez1) comprising the
rne promoter, 5' UTR, and the first 181 codons of the
rne coding region fused to the tenth codon of
lacZ (Jain and Belasco 1995
). Plasmid pEZ1000 is a derivative
of pEZ201 that contains three modifications of the rne-lacZ
reporter. First, a single nucleotide change in the
10 region of the
rne promoter
(AATAAT
AATAAA) reduces the
strength of the promoter. In addition, NheI and BsrGI
restriction sites were introduced to facilitate dissection of essential
cis-acting elements in the rne 5' UTR. The
NheI site (GCTAGC) was created at the 5' boundary of the
5' UTR by replacing 4 nucleotides between the
10 re gion of
the promoter and the transcription initiation site
(GAGGCC
GCTAGC). The BsrGI
site (TGTACA) was created 7 nucleotides upstream of the Shine-Dalgarno
element by inserting one nucleotide (C) at position 340 relative to the
transcription initiation site. Plasmids pEZ
1-113, pEZ
1-328,
pEZ
1-337, pEZ
52-337, pEZ
109-337, pEZ
114-337,
pEZ
1-42/114-337, pEZ
43-51/114-337, and
pEZ
1-113/329-337 each encode an rne-lacZ
transcript with a deletion of the indicated 5' UTR nucleotides. Two
variants of pEZ
114-337 encode multiple nucleotide substitutions between
5' UTR nucleotides 69 and 91 that either preserve (pEZ
114-337var1, ACGCAGCAAUGGCGUAAGACGU
AGGCACGAAUGCGGUAAGACCU)
or disrupt (pEZ
114-337var2, ACGCAGCAAUGGCGUAAGACGU
GGCCAGCAAUGGGCCAAGACGU) the secondary structure at the top of rne hp2. Plasmids pEZ
114-337T
and pEZ
114-337var2T are identical to pEZ
114-337 and
pEZ
114-337var2, respectively, except for a base-pair substitution
in the
10 region of the rne-lacZ promoter
(AATAAA
AATAAT). In plasmid
pEZ
1-51/109-337, the 0.34-kb NheI-BsrGI fragment of pEZ1000 was replaced with a
single copy of rne hp2
(GCTAGCATTGCCCGACCGATCATCCACGCAGCAATGGCGTAAGACGTATTGATCTTTCAGGCAGTCTAGA). In plasmid pYEZ1000, the 0.34-kb NheI-BsrGI fragment
of pEZ1000 was replaced with a PCR-amplified
NheI-BsrGI fragment encoding the corresponding
segment of the Y. pseudotuberculosis rne 5' UTR
(nucleotides 1-338 flanked upstream by GCTAGC and downstream by
TGTACA). All plasmid constructions were confirmed by DNA sequencing.
Amplification and sequence analysis of the rne 5' UTR
Genomic DNA from E. coli, Y. pseudotuberculosis,
S. marcescens, E. carotovora, and P. alcalifaciens was prepared by resuspending an overnight bacterial
culture (4 ml) in Luria broth (LB, 1 ml), extracting with phenol (pH
8), ethanol-precipitating the genomic DNA, and resuspending it in TE
buffer (0.5 ml) (Mak and Ho 1992
).
Intergenic DNA upstream of the rne coding region of
Yersinia, Serratia, Erwinia, and
Providencia was amplified by PCR, using degenerate
oligonucleotide primers designed to be complementary to the
protein-coding region of the rne and yceC genes of
both E. coli and H. influenzae (Fleischmann et al.
1995
; Blattner et al. 1997
). These primers included the
oligonucleotides Drne1 (5'-ACAAGGGCAACGCGCARYTCTTC-3', complementary to rne codons 19-12), DyceC1
(5'-TAAACCACTGCCGCCATGTACNGC-3', complementary to yceC
codons 119-112), and DyceC3 (5'-AAGTTATCGATKCGTTGMCCKGCTTCGTC -3', complementary to yceC codons 25-16), where N = A/G/T/C,
R = A/G, Y = C/T, K = G/T, and M = A/C. PCR amplification
was performed using genomic DNA (typically 10-30 ng), an appropriate
primer pair (10 pmole each), 50 mM KCl, 10 mM
Tris-HCl (pH 9.0), 1.5 mM MgCl2, 1% Triton X-100,
180 µM each dNTP, and 1 unit of Taq DNA
polymerase (Promega) in a total reaction volume of 50 µl. After
denaturing the chromosomal DNA for 5 min at 94°C, step-down PCR was
performed on a thermocycler (MJ Research XT-100) using a denaturing
temperature of 94°C (30 sec), an initial annealing temperature of
65°C (30 sec), and a DNA synthesis temperature of 72°C (1 min).
After three cycles under these conditions, the annealing temperature
was reduced by 3°C for the next three cycles, reduced again by
3°C, and so on until the annealing temperature had fallen to
44°C. After 20 additional cycles at an annealing temperature of
50°C, the PCR products were purified using a High Pure PCR
purification kit (Boehringer Mannheim) and sequenced by Jackson
Laboratories (Bar Harbor, ME). Preliminary sequence data for the
rne genes of K. pneumoniae, V. cholera, and
A. actinomycetemcomitans were obtained from The Institute for
Genomic Research website (http://www.tigr.org).
Phylogenetic and biochemical analysis of rne 5' UTR secondary structure
Phylogenetic analysis of RNA secondary structure was performed by first aligning the sequences of the rne 5' UTRs of E. coli, Y. pseudotuberculosis, S. marcescens, E. carotovora, and P. alcalifaciens and then examining them visually for potential base-paired conformations common to all five. Covarying base-paired nucleotides served as hallmarks of phylogenetically conserved elements of secondary structure. The phylogenetically conserved secondary structure that we identified by visual sequence inspection was subsequently corroborated by computer analysis of sequence covariation (R.R. Gutell, University of Texas at Austin, pers. comm.). Other potential secondary structures were not phylogenetically conserved.
Chemical probing of RNA secondary structure was performed in vivo and
in vitro as described previously (Moazed et al. 1986
; Chen et al.
1991
). Total cellular RNA was isolated from an exponential culture of
E. coli strain CH1828 containing pEZ101 after treating an
aliquot of the culture with DMS (5 µl per ml of culture). In addition, samples of RNA extracted from an untreated culture were alkylated in vitro with DMS or CMCT. Sites of alkylation within the
ez1 5' UTR were mapped by primer extension using AMV reverse transcriptase and various complementary 5' end-labeled DNA primers.
Assays of
-galactosidase activity
Overnight cultures of CH1827+pRNE101 or CH1828 containing each
reporter plasmid were diluted 1:100 into fresh LB medium (2 ml) and
grown at 37°C to OD600 ~0.5.
-Galactosidase
activity was measured for each culture as described previously (Jain
and Belasco 1995
). All reported values are the average of at least six measurements.
Assays of rne-lacZ mRNA half-life
Overnight cultures of CH1827+pRNE101 or CH1828 containing plasmid
pEZ
114-337T or pEZ
114-337var2T were grown exponentially in
LB medium at 37°C to OD600 = 0.5. Rifampicin (0.2 mg/ml) was added to inhibit further transcription
initiation, and total cellular RNA was isolated at time intervals
thereafter, as described previously (Emory and Belasco 1990
). Primer
extension analysis was performed by first combining equal amounts of
each RNA extract (4 or 8 µg) with a 5' end-labeled DNA primer
(5'-CCCAGTCACGACGTTGTAAAACG-3'; 25 fmole) in reverse
transcription buffer (50 mM Tris-HCl at pH 8.3, 75 mM KCl, 3 mM MgCl2; 6.25 µl). The
RNA-DNA mixtures were denatured at 65°C and then allowed to anneal
by cooling slowly for 1 hr to ~30°C. Reverse transcription buffer
(1.25 µl) containing all four dNTPs (3 mM each),
dithiothreitol (60 mM), and Superscript reverse transcriptase
(75 units; Life Technologies) was added, and primer extension was
allowed to proceed at 45°C for 16 hr. The reactions were quenched by
adding loading buffer (5 µl) that contained 95% formamide, 20 mM EDTA (pH 8.0), 0.05% bromophenol blue, and 0.05% xylene
cyanol. Following denaturation at 90°C for 2 min, the primer
extension products were separated by electrophoresis on a 4%
polyacrylamide-urea gel. Radioactive gel bands were visualized and
quantitated with a Molecular Dynamics Storm 820 PhosphorImager. The
physical half-life of each reporter mRNA in each host strain was
calculated from the slope of a semilogarithmic plot of mRNA concentration versus time, as determined by linear regression analysis.
For each plot, a half-life error was estimated from the standard
deviation of the slope. All half-life measurements were performed at
least twice using two different RNA preparations.
| |
Acknowledgments |
|---|
We thank Robin Gutell for helping to corroborate our phylogenetically determined secondary structure by independently analyzing our sequence data for covariation. We also thank Ann Hochschild for generously sharing her laboratory facilities and Diana Bratu and Jixiang Xu for their experimental contributions. This research was funded by a grant (to J.G.B.) from the National Institutes of Health (GM35769) and by a Faculty Research Award (to J.G.B.) from the American Cancer Society (FRA-419).
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
Note added in proof |
|---|
The sequences of the rne 5' UTR from Yersinia pseudotuberculosis (accession number AF259267), Providencia alcalifaciens (AF259268), Serratia marcescens (AF259269), and Erwinia carotovora (AF259270) have been deposited at GenBank.
| |
Footnotes |
|---|
Received February 22, 2000; revised version accepted March 28, 2000.
3 These two authors contributed equally to this work.
4 Corresponding author.
E-MAIL belasco{at}saturn.med.nyu.edu; FAX (212) 263-8951.
| |
References |
|---|
|
|
|---|