|
|
|
Vol. 17, No. 4, pp. 419-437, February 15, 2003
Departments of 1 Pathology and 2 Molecular and Cellular Biology, Baylor College of Medicine, Houston, Texas 77030, USA; 3 Graduate Program in Basic and Applied Biology, ICBAS, University of Oporto, Portugal
The precision and complexity of intron removal during pre-mRNA
splicing still amazes even 26 years after the
discovery that the coding information of metazoan genes is interrupted
by introns (Berget et al. 1977 The typical human gene contains an average of 8 exons. Internal
exons average 145 nucleotides (nt) in length, and introns average more
than 10 times this size and can be much larger (Lander et al. 2001
![]()
Introduction
Top
Introduction
Constitutive splicing and the...
Alternative splicing
Human disease caused by...
Cis effects: mutations that...
Cis effects: mutations that...
Trans effects: mutations that...
Trans effects: mutations that...
Conclusions
References
; Chow et al. 1977
). Adding to this
amazement is the recent realization that most human genes express more
than one mRNA by alternative splicing, a process by which functionally diverse protein isoforms can be expressed according to different regulatory programs. Given that the vast majority of human genes contain introns and that most pre-mRNAs undergo alternative splicing, it is not surprising that disruption of normal splicing patterns can
cause or modify human disease. The purpose of this review is to
highlight the different mechanisms by which disruption of pre-mRNA
splicing play a role in human disease. Several excellent reviews
provide detailed information on splicing and the regulation of splicing
(Burge et al. 1999
; Hastings and Krainer 2001
; Black 2003
). The
potential role of splicing as a modifier of human disease has also
recently been reviewed (Nissim-Rafinia and Kerem 2002
).
![]()
Constitutive splicing and the basal splicing machinery
Top
Introduction
Constitutive splicing and the...
Alternative splicing
Human disease caused by...
Cis effects: mutations that...
Cis effects: mutations that...
Trans effects: mutations that...
Trans effects: mutations that...
Conclusions
References
).
Exons are defined by rather short and degenerate classical
splice-site sequences at the intron/exon borders (5' splice site, 3'
splice site, and branch site; Fig. 1A).
Components of the basal splicing machinery bind to the classical
splice-site sequences and promote assembly of the multicomponent
splicing complex known as the spliceosome. The spliceosome performs the two primary functions of splicing: recognition of the intron/exon boundaries and catalysis of the cut-and-paste reactions that remove introns and join exons. The spliceosome is made up of five small nuclear ribonucleoproteins (snRNPs) and >100 proteins. Each snRNP is
composed of a single uridine-rich small nuclear RNA (snRNA) and
multiple proteins. The U1 snRNP binds the 5' splice site, and the U2
snRNP binds the branch site via RNA:RNA interactions between the snRNA
and the pre-mRNA (Fig. 1B). Spliceosome assembly is highly dynamic in
that complex rearrangements of RNA:RNA, RNA:protein, and
protein:protein interactions take place within the spliceosome. Coinciding with these internal rearrangements, both splice sites are
recognized multiple times by interactions with different components during the course of spliceosome assembly (for example, see Burge et
al. 1999
; Du and Rosbash 2002
; Lallena et al. 2002
; Liu 2002
). The
catalytic component is likely to be U6 snRNP, which joins the
spliceosome as a U4/U6 · U5 tri-snRNP (Villa et al. 2002
).

View larger version (35K):
[in a new window]
Figure 1.
Classical and auxiliary splicing signals (n = G, A,
U, or C; y = pyrimidine; r = purine). (A) Classical splice
sites: The classical splicing signals found in the major class (>99%)
of human introns are required for recognition of all exons. There is
also a minor class of introns using different classical sequences
and different spliceosome components (Tarn and Steitz 1997
).
(B) Classical and auxiliary splicing elements and binding
factors: Factors that bind classical and auxiliary splicing
elements. Auxiliary elements within exons (ESEs and ESSs) and
introns (ISEs and ISSs) are commonly required for efficient splicing of
constitutive and alternative exons. Intronic elements also serve to
modulate cell-specific use of alternative exons by binding
multicomponent regulatory complexes. (C) Cis-acting
splicing mutations. Mutations that disrupt cis-acting elements
required for pre-mRNA splicing can result in defective splicing that
causes disease.
A splicing error that adds or removes even 1 nt will disrupt the open
reading frame of an mRNA; yet exons are correctly spliced from within
tens of thousands of intronic nucleotides. This remarkable precision
is, in part, built into the mechanism of intron removal because once
the spliceosome is assembled, the base-paired snRNAs target specific
phosphate bonds for cleavage. The challenge for the spliceosome comes
in recognizing the correct splice sites prior to the cut-and-paste
reactions. The short and degenerate splice sites contain only half of
the information necessary for splice-site recognition (Lim and Burge
2001
) because bona fide splice sites must be distinguished from pseudo
splice-site sequences that resemble classical splice sites but are
never used. Pseudo splice sites can outnumber bona fide splice sites
within a pre-mRNA by an order of magnitude (Sun and Chasin 2000
).
Auxiliary cis-elements, known as exonic and intronic splicing
enhancers (ESEs and ISEs) and exonic and intronic splicing silencers
(ESSs and ISSs; Fig. 1B), aid in the recognition of exons (see below).
It is now clear that exon recognition is accomplished by the
accumulated recognition of multiple weak signals, resulting in a
network of interactions across exons as well as across introns (Fig.
1B; Berget 1995
; Reed 1996
). It is also clear that different constitutive exons are recognized by different mechanisms and require
different sets of auxiliary elements in addition to the classical
splice-site sequences. The significance of these observations is
threefold. First, there are a considerable number of disease-causing mutations in exons or introns that disrupt previously unrecognized auxiliary cis-elements as well as the well-known classical
splice sites (Fig. 1C). Second, because exons differ in their
requirements for recognition, mutations that disrupt the function of
the splicing machinery will have different effects on different
subsets of exons. Third, variability in the basal splicing machinery
among different cell types could cause cell-specific sensitivities to individual splicing mutations.
| |
Alternative splicing |
|---|
|
|
|---|
Alternative splicing is the joining of different 5' and 3' splice
sites, allowing individual genes to express multiple mRNAs that encode
proteins with diverse and even antagonistic functions. Up to 59% of
human genes generate multiple mRNAs by alternative splicing (Lander
et al. 2001
), and ~80% of alternative splicing results in changes in
the encoded protein (Modrek and Lee 2002
), revealing what is likely to
be the primary source of human proteomic diversity. Alternative
splicing generates segments of mRNA variability that can insert or
remove amino acids, shift the reading frame, or introduce a termination
codon (Fig. 2). Alternative splicing also
affects gene expression by removing or inserting regulatory elements
controlling translation, mRNA stability, or localization.
|
A large fraction of alternative splicing undergoes cell-specific
regulation in which splicing pathways are modulated according to cell
type, developmental stage, gender, or in response to external stimuli.
In the best characterized models of vertebrate cell-specific alternative splicing, regulation is mediated by intronic repressor and
activator elements distinct from the classical splicing sequences. Cell
specificity emerges primarily from two features: First, the repression
of splicing in the inappropriate cell type is combined with activation
of splicing in the appropriate cell type; and, second, combinatorial
control is exerted by multiple components involving cooperative
assembly of activation and/or repression complexes on the
cis-acting elements surrounding the regulated splice sites
(Grabowski 1998
; Smith and Valcarcel 2000
). The straightforward model
is that these complexes serve to enhance or inhibit recognition of
the classical splice sites by the basal splicing machinery. Activating and repressing activities coexist within cells (Charlet et
al. 2002a
), and it remains unclear why activation dominates in one cell
type whereas repression dominates in another. Importantly, mutations
that perturb this balance can result in aberrant regulation of
alternative splicing, causing the expression of protein isoforms that
are inappropriate for a cell type or developmental stage.
| |
Human disease caused by disruption of pre-mRNA splicing |
|---|
|
|
|---|
To define the diverse mechanisms by which defects in pre-mRNA splicing result in a primary cause of disease, we have classified splicing mutations into four categories (Fig. 3). These categories are based on two criteria. First, does the mutation affect expression of a single gene by disrupting a splicing cis-element, or does the mutation have an effect in trans on multiple genes by disrupting a component of the splicing machinery or of a splicing regulatory complex? Second, does the mutation cause aberrant splicing (expression of unnatural mRNAs) by creating unnatural splicing patterns or aberrant regulation of splicing (the inappropriate expression of natural mRNAs) by disrupting use of alternatively used splice sites?
|
Cis-acting mutations can affect the use of constitutive splice sites (Fig. 3A) or alternative splice sites (Fig. 3B). Disrupted constitutive splicing most often results in loss of gene expression due to aberrant splicing (see below). On the other hand, a cis-acting mutation that inactivates (or activates) one of two alternatively used splice sites will force expression of one of the alternative splicing patterns. Although a natural mRNA is expressed, its expression in an inappropriate tissue or developmental stage might result in disease.
Trans-acting splicing mutations can affect the function of the basal splicing machinery (Fig. 3C) or factors that regulate alternative splicing (Fig. 3D). Mutations that affect the basal splicing machinery have the potential to affect splicing of all pre-mRNAs, whereas mutations that affect a regulator of alternative splicing will affect only the subset of pre-mRNAs that are targets of the regulator. Each of these four categories are described in the remainder of the review.
| |
Cis effects: mutations that disrupt use of constitutive splice sites |
|---|
|
|
|---|
The majority of mutations that disrupt splicing are single
nucleotide substitutions within the intronic or exonic segments of the
classical splice sites (Fig. 1C). These mutations result in either
complete exon skipping, use of a nearby pseudo 3' or 5' splice site, or
retention of the mutated intron. Mutations can also introduce a new
splice site within an exon or intron. In rare cases, mutations that do
not disrupt or create a splice site activate pre-existing pseudo splice
sites distal from the mutation (Pagani et al. 2002
), consistent with
the proposal that introns contain splicing-inhibitory sequences
(Fairbrother and Chasin 2000
). In most cases, use of unnatural splice
sites or intron retention introduces premature termination codons
(PTCs) into the mRNA, typically resulting in degradation by
nonsense-mediated decay (NMD) and loss of function of the mutated
allele (for recent reviews, see Hentze and Kulozik 1999
; Maquat and
Carmichael 2001
). Cis-acting mutations have been tabulated in
several reviews and the Human Gene Mutation Database (HGMD,
http://www.uwcm.ac.uk/uwcm/mg/hgmd0.html; Nakai and Sakamoto
1994
; Cooper et al. 1995
; Rogan et al. 1998
; Caceres and Kornblihtt
2002
; Cartegni et al. 2002
).
A survey performed more than a decade ago found that 15% of point
mutants that result in human genetic disease disrupted splicing (Krawczak et al. 1992
). This is likely to be an underestimate because
the analysis was limited to mutations in the classical splice-site
sequences, the only splicing elements widely recognized at the time. It
is now known that widespread aberrant splicing is also caused by
mutations that disrupt exonic splicing elements (ESEs and ESSs; Fig.
1C). Given recent predictions that the majority of human exons contain
ESEs (Liu et al. 2001
; Fairbrother et al. 2002
), one striking
realization is that a significant fraction of exonic mutations that
cause disease are unrecognized splicing mutations (for review, see
Cooper and Mattox 1997
; Caceres and Kornblihtt 2002
; Cartegni et al.
2002
). The identification of disease-causing mutations is based
primarily on linkage of the mutation with the disease phenotype. The
effect of the mutation on gene expression is generally assumed based in
its location. Because exonic mutations are assumed to cause disease by
affecting only the coding potential, silent mutations have been ignored as potential causes of disease, missense mutations have been assumed to
create a significant alteration in protein function, and nonsense mutations have been assumed to lead to expression of nonfunctional or
deleterious truncated proteins or loss of function caused by NMD. In
fact, the primary mechanism of disease in a significant fraction of
disease-causing exonic mutations is a catastrophic splicing abnormality
rather than a direct effect on coding potential (Cartegni et al. 2002
).
The definitive test of whether a disease-causing mutation affects
splicing is by direct analysis of mRNA linear structure for correct
splicing and mRNA steady-state levels to detect NMD. Ideally, RNA from
the affected tissue should be analyzed because cis-acting
splicing mutations can have cell-specific effects (Slaugenhaupt et
al. 2001
). Unfortunately, the appropriate tissues are often not
available to analyze splicing of endogenous mRNAs. As alternatives, mutations that disrupt ESEs or ESSs have been identified using transient transfection of minigenes or in vitro splicing assays comparing splicing of the mutant and wild-type exons (e.g., see McCarthy and Phillips 1998
; D'Souza et al. 1999
; Pagani et al. 2000
;
Cartegni and Krainer 2002
).
The ability to identify exonic auxiliary splicing elements based on
sequence alone would significantly enhance identification of
disease-causing mutations. Bona fide mutations could be distinguished from benign polymorphisms and the missense, and nonsense mutations that
disrupt ESEs or ESSs could be recognized. As therapies directed toward
reverting aberrant splicing patterns become practical, the relevance of
identifying splicing mutations will increase. Two major classes of ESEs
have been defined based on nucleotide composition: purine-rich and
A/C-rich (Cooper and Mattox 1997
). The purine-rich ESEs are recognized
by a conserved family of serine/arginine-rich (SR) proteins that
recruit spliceosome components (such as U2 auxiliary factor, U2AF) to
the splice sites (Fig. 1B; Blencowe 2000
). ESEs can also enhance
splicing by inhibiting adjacent ESSs (Kan and Green 1999
; Zhu et al.
2001
). The A/C-rich ESEs (ACEs) bind the cold-box protein, YB-1, and
promote splicing by an undetermined mechanism (Coulter et al. 1997
;
Stickeler et al. 2001
).
Several complementary approaches are being used to identify additional
auxiliary splicing elements (for review, see Ladd and Cooper 2002
). A
recent computational analysis of human genomic sequence identified
10 ESEs, 5 of which are novel, by analyzing hexameric sequences
enriched in exons that are flanked by weak splice sites. All 10 ESEs
functioned autonomously to enhance splicing of a weak exon in vivo
(Fairbrother et al. 2002
). In a different approach, preferred ESE
targets for four individual SR proteins were identified using
functional systematic evolution of ligands by exponential enrichment
(SELEX; Liu et al. 1998
, 2000
). The consensus sequences derived from
these experiments were used to develop an ESE prediction program (at
http://exon.cshl.org/ESE), which has subsequently been used to
identify ESE mutations that cause pathogenic splicing abnormalities in
four genes including breast cancer susceptibility genes, BRCA1
and BRCA2 (Liu et al. 2001
; Fackenthal et al. 2002
), and the
SMN2 gene, which plays a role in spinal muscular atrophy (SMA;
Cartegni and Krainer 2002
; see below). Cartegni and Krainer (2002)
predicted that 50% of exonic mutations that cause exon skipping
disrupt binding sites for one of the four SR proteins used for
functional SELEX. The corollary prediction is that the other 50%
disrupt binding sites for other proteins. Analyses of ESEs and the
mechanism of ESE-mediated splicing have focused on purine-rich SR
protein-binding sites. Given that additional ESEs continue to be
identified, it is likely that the diversity as well as the number of
ESEs relevant to human disease have been underestimated.
| |
Cis effects: mutations that disrupt use of alternative splice sites |
|---|
|
|
|---|
Pre-mRNA mutations that affect the use of an alternative splice site shift the ratio of natural protein isoforms (Fig. 3B) rather than create an aberrant splice with the usual associated loss of function. There are four well-characterized examples of such mutations associated with human disease (Fig. 4).
|
Familial isolated growth hormone deficiency type II (IGHD II)
Postnatal growth in humans requires secretion of growth hormone (GH)
from the anterior pituitary. Familial isolated GH deficiency type II
(IGHD II) is a dominantly inherited disorder caused by mutations in the
single GH gene (GH-1), in which the main symptom is short
stature (Cogan et al. 1994
). GH-1 contains five exons and
generates a small amount (5%-10%) of alternatively spliced mRNAs
(Lecomte et al. 1987
). Full-length GH protein is 22 kD, whereas use of
an alternative 3' splice site that removes the first 45 nt of exon 3 and skipping of exon 3 generate 20-kD and 17.5-kD isoforms,
respectively (Fig. 4A). All IGHD II mutations cause increased
alternative splicing of exon 3 by disrupting one of three splicing
elements: an ISE, an ESE, or the 5' splice site (Fig. 4A; Binder et al.
1996
; Cogan et al. 1997
; Moseley et al. 2002
). The natural functions of
the 17.5-kD and 20-kD proteins are unknown, but dominant inheritance is
thought to result from a dominant-negative effect of the truncated
proteins on secretion (Binder et al. 1996
). The ISE was first
identified by two independent GH-1 mutations located in the
intron downstream from exon 3 (Fig. 4A; Cogan et al. 1997
), and
analysis of transiently expressed GH-1 minigene constructs
demonstrated ISE activity in vivo (McCarthy and Phillips 1998
). One
IGHD II mutation is a G
A substitution within one of two adjacent
G triplets. The other is a deletion that removes both G triplets. The G
triplets disrupted by these mutations are similar to a regulatory
element identified in the chicken
-tropomyosin gene (Sirand-Pugnet
et al. 1995
). The association of G triplets with 5' splice sites was
identified computationally early on (Nussinov 1988
; Engelbrecht et al.
1992
), and these elements have been shown to recruit U1 (McCullough and
Berget 2000
).
A G
A substitution in the fifth nucleotide of exon 3 was recently
linked to disease in an IGHD II family and was shown to disrupt an ESE
(Fig. 4A; Moseley et al. 2002
). The ESE mutations caused exon skipping
as well as enhanced use of the alternative 3' splice site within exon
3. Finally, a G
A mutation in the first nucleotide of intron 3 disrupts the 5' splice site and causes complete exon skipping and
expression of the 17.5-kD isoform (Fig. 4A; Cogan et al. 1997
).
Frasier syndrome
Inactivation of the Wilms tumor suppressor gene (WT1) is
responsible for ~15% of Wilms tumors, a pediatric cancer of the
kidney (Call et al. 1990
; Gessler et al. 1990
). Three additional
disorders are associated with abnormalities in WT1 expression:
WAGR (Wilms tumor, aniridia, genitourinary abnormalities, mental
retardation), Denys-Drash syndrome (DDS), and Frasier syndrome (FS).
All three diseases are characterized by urogenital disorders involving
kidney and gonad developmental defects. Consistent with these defects, normal expression patterns of human WT1 during development
indicate important roles in kidney and gonad development (Armstrong et al. 1993
), and Wt1-null mice lack gonads and kidneys
(Kreidberg et al. 1993
).
The human WT1 pre-mRNA undergoes extensive alternative splicing;
however, the only alternative splice conserved among vertebrates is the
use of two alternative 5' splice sites for exon 9 separated by 9 nt
that encode lysine-threonine-serine (KTS; Fig. 4B; Miles et al.
1998
). The +KTS and
KTS isoforms are expressed at a constant ratio
favoring the +KTS isoform in all tissues and developmental stages that
express WT1 (Haber et al. 1991
). The majority of individuals with FS were found to have mutations that inactivate the downstream 5'
splice site, resulting in a shift to the
KTS isoform (Fig. 4B;
Barbaux et al. 1997
; Kohsaka et al. 1999
; Melo et al. 2002
).
The WT1 protein contains four C2H2 zinc fingers at
its C terminus and a proline/glutamine-rich N-terminal region. The
variable KTS region is located between the third and fourth zinc
fingers. Mouse models that express only the endogenous
KTS or +KTS
isoforms provide a striking demonstration of the functional differences between the two nearly identical isoforms (Hammes et al. 2001
). The
properties of the two WT1 isoforms also indicate that they perform
distinct functions. The
KTS isoform trans-activates
transcription of genes involved in early gonad development including
Sf1 and Sry (Hossain and Saunders 2001
; Wilhelm and
Englert 2002
). In contrast, the +KTS isoform binds DNA only weakly, and
is unable to activate targets of the
KTS isoform (Wilhelm and Englert
2002
). The +KTS isoform appears to function in RNA metabolism, perhaps pre-mRNA splicing. Whereas
KTS shows diffuse nuclear localization, +KTS colocalizes in nuclear speckles, which are thought to be storage
areas for components of the basal splicing machinery (Larsson et al.
1995
; Davies et al. 1998
). In addition, +KTS binds U2AF65, an
essential splicing factor involved in the early steps of exon recognition.
Because FS is dominantly inherited, affected individuals have a
wild-type allele expressing the normal ratio of +KTS/
KTS. The basis
for FS is therefore underexpression of +KTS, overexpression of
KTS,
or a combination, indicating that the ratio of the two isoforms is
critical (Barbaux et al. 1997
).
Frontotemporal dementia and Parkinsonism linked to Chromosome 17 (FTDP-17)
Aggregation of the microtubule-associated protein tau into neuronal
cytoplasmic inclusions is associated with several neuropathological conditions characterized by progressive dementia including Alzheimer's disease, Pick's disease, and frontotemporal dementia and Parkinsonism linked to Chromosome 17 (FTDP-17; Buee et al. 2000
). FTDP-17 is an
autosomal dominant disorder caused by mutations in the MAPT gene that encodes tau. Tau is required for microtubule assembly and
function and is thought to play a major role in microtubule-dependent transport in axons. Free tau, not bound to microtubules, is proposed to
be subject to hyperphosphorylation and aggregation.
Since the initial discovery in 1998 that MAPT mutations cause
FTDP-17, at least 16 mutations have been identified in 50 FTDP-17 families. MAPT mutations fall into two mechanistic classes.
One class includes mutations that alter the biochemical properties of
the protein. In vitro analysis of these mutant proteins demonstrated either altered ability to modulate microtubule polymerization or
enhanced self-aggregation into filaments that resemble neurofibrillary tangles. A second class of disease-causing mutations that affected splicing was revealed by mutations clustered in and around the alternatively spliced exon 10 (Fig. 4C). A primary role for splicing defects was indicated by the discovery of 5' splice site mutations and
the observation that not all exon 10 missense mutations altered tau
function in vitro. Subsequently, silent mutations in exon 10 were
linked to FTDP-17, ruling out expression of a mutated protein as the
pathogenic event (Hong et al. 1998
; D'Souza et al. 1999
). Exon 10 encodes the last of four microtubule-binding domains, and exon 10 inclusion is determinative for the ratio of the 4R-tau and 3R-tau
protein isoforms (4R and 3R designate four and three
microtubule-binding domains, respectively). The normal 4R/3R ratio is
1, and some FTDP-17 mutations alter this ratio by as little as twofold,
which indicates that a strict balance is required for either normal tau
function or to prevent tau aggregation.
The 4R/3R ratio is maintained by a complex set of intronic and exonic
splicing elements surrounding and within exon 10 including ESEs, ESSs,
ISSs, and a putative hairpin structure that sequesters the 5' splice
site (Fig. 4C). The vast majority of FTDP-17 mutations affect these
regulatory elements and cause disease by increasing inclusion of exon
10. As expected, the 4R-tau protein isoform predominates in the
insoluble tau aggregates in individuals with FTDP-17 (Hutton et al.
1998
; Spillantini et al. 1998
). However, not all FTDP-17 mutations are
expected to increase the 4R/3R ratio. One mutation, a 3-nt deletion
within the 5' ESE (
K280), results in complete exon skipping in a
minigene construct, presumably because it weakens an ESE (D'Souza et
al. 1999
). This mutation also decreases the 4R-tau protein function in
vitro, although the biochemical properties of the recombinant 4R
protein are irrelevant if the exon is completely skipped in affected
individuals. Unfortunately, the level of exon 10 inclusion in these
individuals is unknown because tissue samples are not available.
Atypical cystic fibrosis
Cystic fibrosis (CF) is an autosomal recessive disorder caused by
loss of function of the cystic fibrosis transmembrane conductance regulator (CFTR) gene. CFTR encodes a cAMP-dependent
transmembrane chloride channel that is expressed in secretory
epithelium. In the USA, more than two-thirds of individuals affected
with CF carry the devastating
F508 mutation, which causes a failure
of the protein to localize to the apical plasma membrane. Fifty percent of affected individuals are homozygous for this allele, resulting in
severe pulmonary and pancreatic disease. However, less frequent, "milder" mutations that retain residual CFTR function are
responsible for a range of CF-related disorders including late onset or
less severe pulmonary disease, male infertility due to congenital
bilateral absence of the vas deferens (CBAVD), and chronic idiopathic
pancreatitis (Noone and Knowles 2001
).
Two polymorphisms in the CFTR gene that contribute to atypical
CF phenotypes are located at the 3' end of intron 8 and directly affect
splicing of exon 9 (Fig. 4D). One is a variant polyuridine tract
containing 5, 7, or 9 uridines within the polypyrimidine tract of
intron 8. The second is a polymorphic poly(UG)n locus immediately upstream of the (U)n tract. Both
polymorphisms are located between the presumptive branch site for
intron 8 and the AG-terminal dinucleotide. Nearly all individuals
express a small fraction of CFTR mRNAs that lack exon 9 and express a
nonfunctional protein (Delaney et al. 1993
; Strong et al. 1993
). It is
unclear whether this alternative splice serves a purpose.
The shortest (U)n allele, 5U, can be
associated with a high level of exon skipping in respiratory epithelial
cells compared with the 7U and 9U alleles. The
frequency of 5U carriers is estimated to be 10% worldwide
(Kiesewetter et al. 1993
). Some individuals homozygous for the
5U allele skip exon 9 in >95% of CFTR mRNAs in lung
epithelium (Chu et al. 1992
). This polymorphism is rarely sufficiently
penetrant to be associated with a severe CF phenotype (Noone et al.
2000
); however, many individuals affected with CBAVD are compound
heterozygotes for the 5U allele with a severe CFTR
mutation. Some individuals with CBAVD are 5U homozygotes (Chillon et al. 1995
), indicating that the 5U allele alone can cause disease and the disease in these individuals correlates with the
level of exon 9 skipping and the subsequent loss of CFTR function
(Larriba et al. 1998
).
On the other hand, the identification of healthy 5U
homozygotes demonstrated that the penetrance of the 5U allele
is quite variable. Variable penetrance is explained in part by the
second polymorphic (UG)n, tract which ranges in size
from (UG)9 to (UG)13. Longer UG tracts are
associated with higher disease penetrance and increased skipping of
exon 9 in individuals with CBAVD. In fact, healthy fathers of
individuals affected with CBAVD have been shown to contain shorter
(UG)n polymorphisms and exhibit less exon 9 skipping
than their affected sons, explaining the variable penetrance within
some families (Cuppens et al. 1998
). Transient transfection analysis of
CFTR minigenes directly demonstrated that the longer
(UG)n tract correlates with increased exon 9 skipping, but only when combined with the 5U allele (Niksic et
al. 1999
). A protein of unknown function, TDP-43, binds to the
(UG)n tract and inhibits exon 9 inclusion (Buratti et al. 2001
). The prediction, thus far untested, is that the
polypyrimidine tract of the 5U allele binds U2AF65
poorly compared with the 7U and 9U alleles and that
this interaction is negatively affected by binding of TDP-43 to the
upstream (UG)n tract (Buratti et al. 2001
).
| |
Trans effects: mutations that affect the basal splicing machinery |
|---|
|
|
|---|
There are several genetic diseases in which a mutation disrupts the
machinery of splicing, either the constitutive components of the
spliceosome (Fig. 3C) or auxiliary factors that regulate alternative
splicing (Fig. 3D). Null mutations in spliceosome components are
generally lethal or synthetic lethal in yeast and are most often lethal
at the cellular level in metazoans. For example, four components of the
basal splicing machinery (U2AF35, Sm protein D1, SF3b subunit
4, and U1Ca) were identified as genes required for early vertebrate
development in a large scale insertional mutagenesis screen in
zebrafish (Golling et al. 2002
). All four mutations resulted in
nonspecific developmental defects that are thought to result from
cell-lethal mutations. Despite the expectation that dysfunction of the
basal splicing machinery should be cell-lethal regardless of cell type,
mutations that disrupt the assembly or function of spliceosomal snRNPs
are responsible for two human diseases in which two different subsets
of neurons are affected (Fig. 3C).
Retinitis pigmentosa
Retinitis pigmentosa (RP) is a heterogeneous disease affecting 1 in
4000 individuals characterized by progressive retinal degeneration,
night blindness, loss of peripheral vision, and ultimately total
blindness. The disease results from the specific loss of rod
photoreceptor cells. RP can be inherited as an autosomal dominant,
autosomal recessive, or X-linked disorder. More than 30 different RP
genes and loci have been identified, most of which have retina-specific
functions. However, within the past year and a half, three genes
responsible for autosomal dominant RP (PRPF31, HPRP3, and
PRPC8) have been identified as the human orthologs of the yeast
genes PRP31, PRP3, and PRP8, respectively
(McKie et al. 2001
; Vithana et al. 2001
; Chakarova et al. 2002
). All three yeast genes are involved in the function of the U4/U6 · U5
tri-snRNP, the spliceosome component required for the transition to a
catalytically active state. All three human proteins were found in
isolated functional spliceosomes (Zhou et al. 2002
).
Pathogenic mutations in PRPF31 have been identified in four RP
families and three sporadic cases of autosomal dominant RP (Vithana et
al. 2001
). The mutations include insertions, deletions, missense
mutations, and splice-site mutations. It is likely that in at least
some mutant alleles, the function of PRPF31 is severely affected, if
not completely eliminated. Therefore, PRPF31 mutations are
likely to cause autosomal dominant RP due to haploinsufficiency, although a dominant-negative effect for alleles expressing truncated proteins cannot be ruled out.
Of the three splicing-factor genes that cause RP, the function of
PRPF31 is best defined. Prp31p is an essential splicing factor
in both Saccharomyces cerevisiae (60% similarity to human) and Schizosaccharomyces pombe (68% similarity to human;
Weidenhammer et al. 1996
; Bishop et al. 2000
). Human PRPF31 is a U4/U6
snRNP-associated protein that promotes association between U4/U6 snRNP
and U5 snRNP by direct interactions with a 102-kD U5-specific protein.
In vitro splicing using HeLa cell nuclear extracts immunodepleted of
PRPF31 showed accumulation of the prespliceosome complex (containing U1
and U2 snRNPs) by preventing association of the U4/U6 · U5 tri-snRNP and assembly of the active spliceosome. Addition of recombinant PRPF31 reversed this inhibition, demonstrating that the
deficiency of PRPF31 was responsible for the block (Makarova et al.
2002
). Although several mutations in snRNP proteins inhibit the
prespliceosome-to-spliceosome transition, PRPF31 is unique in that it
directly mediates formation of the U4/U6 · U5 tri-snRNP rather than
direct interactions between the tri-snRNP and the prespliceosome. The
ability to deplete PRPF31 and then reconstitute PRPF31-dependent
splicing provides a powerful in vitro assay to test the intrinsic
activities of PRPF31-disease-causing mutations.
Mutations in HPRP3 have been shown to cause RP in three
families and two sporadic cases (Chakarova et al. 2002
). All five examples are caused by one of two missense mutations in two highly conserved adjacent codons in exon 11. This protein domain is unique in
the database, and its specific function is unknown. Like PRPF31p, HPRP3
is a component of the U4/U6 snRNP (Wang et al. 1997
). In mammals, HPRP3
is thought to recruit HPRP4 to the U4/U6 snRNP (Gonzalez-Santos et al.
2002
). HPRP3 and HPRP4 are homologs of the yeast U4/U6-snRNP-specific
proteins. In yeast, PRP3 and PRP4 genetically
interact, and physical interactions between Prp3p and Prp4p proteins
are required for association of Prp3p and Prp4p with U4/U6 (Ayadi et
al. 1998
).
Mutations in PRPC8 cause a severe form of RP (McKie et al.
2001
). Seven different mutations have been identified in three RP
families and four individuals with a history of autosomal dominant RP.
All of these mutations cluster in a highly conserved 14-amino-acid region in the last exon. PRPC8 encodes PRP8, a 220-kD core
component of the U5 snRNP. The PRP8 protein is highly conserved, being
62% identical in human and S. cerevisiae throughout its
~2300 residues. PRP8 is known to be an integral component of the
spliceosome catalytic core and makes direct contact with both the 5'-
and 3' splice sites and U6 as well as U5 snRNAs (Wyatt et al. 1992
;
Teigelkamp et al. 1995
; Vidal et al. 1999
). PRP8 is thought to provide
overall structural support for the catalytic core and to modulate the RNA helicase activities that control the extensive RNA:RNA base-pairing rearrangements required to activate the spliceosome (Collins and Guthrie 2000
). The remarkable clustering of the mutations identifies a
specific functional domain, but it remains to be determined whether
these mutations inactivate the allele or create a protein with
dominant-negative function. The possible basis for the striking cell-specific effects of the RP mutations is discussed below.
Spinal muscular atrophy
Spinal muscular atrophy (SMA) is an autosomal recessive disorder
that is one of the most common genetic causes of childhood mortality.
The main characteristic of the disease is progressive loss of spinal
cord motor neurons, resulting in skeletal muscle denervation with
subsequent weakness, atrophy, and paralysis of voluntary muscles. The
SMA locus maps to a complex inverted repeat of ~500 kb on Chromosome
5q13 that contains several genes. The cause of SMA in 96% of cases is
homozygous loss of the telomeric copy of the survivor of motor neuron
gene (SMN1) located within the inverted repeat (Wirth 2000
). A
duplicated gene within the centromeric copy of the inverted repeat
(SMN2) is also transcribed and contains only a few nucleotide
substitutions, none of which alters the protein coding sequence.
Despite the potential to encode the identical protein, the
SMN2 gene does not completely compensate for loss of
SMN1 function because one of the nucleotide substitutions disrupts an ESE in exon 7 that causes the exon to be skipped in the
majority of SMN2 mRNAs (Cartegni and Krainer 2002
). The resulting SMN2
E7 mRNA encodes a truncated protein missing the C-terminal 16 residues and is thought to be nonfunctional (Cifuentes-Diaz et al.
2001
).
SMN is a ubiquitously expressed 294-amino-acid protein that is
essential in S. pombe (Owen et al. 2000
) and is required for cell viability in vertebrates (Wang and Dreyfuss 2001
). The specific functions of SMN are unknown, but it is in a complex (Baccon et al.
2002
) that interacts with components of several RNP complexes with
diverse functions, which suggests that SMN acts as a "master assembler" of RNP complexes (Terns and Terns 2001
). The best
characterized role for the SMN complex is in the assembly of U1, U2,
U4, and U5 snRNPs, which contain a common set of seven Sm proteins
as well as sets of proteins unique to each snRNP. SnRNP assembly begins
with export of nascent snRNA to the cytoplasm, where Sm proteins
assemble as a ring around a 9-nt Sm-binding site on each snRNA, forming
the so-called core snRNP. The assembled Sm proteins plus a
trimethylguanosine (m3G) cap at the snRNA 5' end serve as a bipartite
nuclear localization signal. Once in the nucleus, snRNP-specific
proteins are added to the core snRNPs to form active snRNPs (Will and
Luhrmann 2001
).
SMN is required for the cytoplasmic assembly of the core snRNPs.
Immunodepletion of SMN plus a tightly associated integral component of
the SMN complex, Gemin2, prevented U1 snRNP assembly in
Xenopus oocyte extracts despite the presence of abundant Sm proteins. Assembly was restored by adding back purified SMN complex (Meister et al. 2001
). Overexpression of an N-terminal truncation mutant of SMN (SMN
N27) with dominant-negative activity resulted in
cytoplasmic coaccumulation of Sm proteins, SMN, and U snRNAs. The snRNA
did not contain the m3G cap, suggesting that these accumulations result
from arrested snRNP maturation (Pellizzoni et al. 1998
). Native SMN
complexes purified from cells have recently been shown to be
necessary and sufficient to promote ATP-dependent assembly of core
snRNPs in vitro. Furthermore, under the conditions used, the SMN
complex was required to prevent binding of Sm proteins to non-U
snRNAs (Pellizzoni et al. 2002
). Although purified Sm proteins assemble
on snRNAs in an ordered pathway in vitro in the absence of non-snRNP
factors (Raker et al. 1996
), the SMN complex could be required for
efficient core assembly in the complex cellular environment or to
regulate snRNP assembly in response to cellular metabolism.
Four clinical types of SMA have been defined based on age of onset
and disease severity, which ranges from intrauterine demise to mild
symptoms in older individuals. Results from individuals affected
with SMA and SMA mouse models demonstrate that there is a clear
correlation between SMN protein levels, loss of motor neurons, and
disease severity (Coovert et al. 1997
; Lefebvre et al. 1997
; Jablonka
et al. 2000
). Because both copies of SMN1 are missing in most
individuals with SMA, the only source of full-length SMN protein is
the small fraction of SMN2 mRNAs that include exon 7. Quantification of
SMN2 gene number using real-time PCR showed that
individuals with the less severe type III typically have multiple
copies of the SMN2 gene through gene replacement and duplication (Feldkotter et al. 2002
). Therefore, the effects of a
primary loss of SMN1 are ameliorated by the small amount of full-length SMN protein encoded by each copy of the SMN2 gene.
A strong correlation between the loss of motor neurons and the
reduction of nuclear staining for SM-containing snRNPs in mouse models
of SMA strongly suggests that the SMN deficiency causes disease by a
defect in pre-mRNA splicing. Unlike humans, mice have only one
Smn gene. Smn
/
mice die at the
blastocyst stage (Schrank et al. 1997
), and Smn+/
mice develop symptoms strikingly similar to SMA (Jablonka et al. 2002
).
The diffuse staining of cytoplasmic SMN was reduced in spinal neurons
of Smn+/
mice, and nuclear anti-Sm
immunofluorescence (for nuclear snRNPs) was reduced by 39% (Jablonka
et al. 2000
). In addition, Smn+/
and
Gemin2+/
double heterozygotes had a 61% reduction
in nuclear Sm staining correlating with substantially increased motor
neuron loss compared with Smn+/
mice (Jablonka et
al. 2002
).
Smn+/
mice are normal at birth but develop
SMA-like symptoms within days owing to a normal developmentally
regulated decline in which SMN protein levels in the spinal cord
drop to <50% of fetal levels, primarily between postnatal days 5 and 15 (Hsieh-Li et al. 2000
; Jablonka et al. 2000
; Monani et al.
2000
). This down-regulation also occurs in humans (Burlet et al.
1998
), and individuals with type III SMA display a worsening of
symptoms that correlates with this down-regulation of SMN protein. The
drop in SMN protein occurs in several tissues that are unaffected in
SMA despite the fact that SMN protein levels are lower in these than
in the spinal cord (Lefebvre et al. 1997
; Burlet et al. 1998
). A
muscle-specific knockout of SMN induces severe muscular
dystrophy, indicating that substantial reduction of SMN will induce
intrinsic muscle disease (Cifuentes-Diaz et al. 2001
). These results
indicate that postnatal motor neurons require higher steady-state
levels of SMN protein than other metabolically active tissues.
What is the basis for cell-specificity in RP and SMA?
For both RP and SMA, the primary defect appears to be a loss of
function of essential splicing factors, although dominant-negative function for some RP alleles cannot be ruled out. How can the loss of
ubiquitous functions result in such remarkable cell-specific sensitivity? Because exons are diverse units of recognition, different exons are likely to exhibit a wide range of sensitivities to
deficiencies of essential splicing factors. Perhaps only a subset of
pre-mRNAs (or even one pre-mRNA) required for rod cell or motor neuron
viability is affected by deficiencies in the U4/U6 · U5 tri-snRNP
or SMN function, respectively. It can also be argued that cell-specific pre-mRNAs are more likely to be affected by a deficiency of a basal
splicing factor than pre-mRNAs that are widely expressed. In contrast
to cell-specific pre-mRNAs, widely expressed pre-mRNAs must have the
ability to undergo efficient splicing in a variety of nuclear
environments and presumably contain information in cis for
more robust splicing. The few essential splicing factors that have been
examined in vertebrates show surprisingly variable levels of expression
among different tissues that do not correlate with tissue metabolic
activity. For example, SF1, a spliceosome component involved in the
initial recognition of the branch site, is barely detectable in
pancreas, kidney, and lung, whereas PRP8 is barely detectable in liver
(Luo et al. 1999
; Vervoort et al. 2000
).
Even in yeast, where intron recognition is highly homogenous,
loss-of-function phenotypes for PRP2 and CEF1 are due
to defective removal of single introns (Chen et al. 1998
; Burns et al.
2002
). For example, a screen for mutants that disrupt transport of
secretory proteins from the endoplasmic reticulum (ER) to the Golgi
identified a well-characterized essential splicing factor, PRP2. PRP2
is an RNA-dependent ATPase required for the first transesterification reaction. The protein secretion defect of the PRP2 mutant was found to be caused by inefficient splicing of the intron of the SAR1 gene, which encodes a small GTPase required for ER
vesicle formation. The PRP2 protein secretion defect was suppressed by overexpressing the SAR1 cDNA or by removing the SAR1 gene
intron (Chen et al. 1998
). Similarly, Cef1p (CDC5 in S. pombe)
was genetically identified as a cell cycle regulator (Ohi et al. 1994
),
and a role in pre-mRNA splicing was subsequently found by genetic and biochemical analyses (for review, see Burns et al. 2002
). Global analysis of splicing in Cef1p mutants using an oligonucleotide array
(Clark et al. 2002
) demonstrated a significant general splicing defect.
Despite this, the G2/M block was relieved by replacing the
-tubulin gene with the
-tubulin cDNA, demonstrating that failure
to remove the single
-tubulin intron is primarily responsible for
the CEF1 loss-of-function phenotype (Burns et al. 2002
).
The opsin pre-mRNA is one potential target of the presumed tri-snRNP
deficiency in RP. The opsin protein binds covalently to a chromophore
to form the photopigment rhodopsin, which undergoes a conformational
change in response to photons that initiates the photodetection cascade
(Bessant et al. 2001
). Rhodopsin is embedded in the extensive array of
membranous discs present in each rod cell. The discs undergo daily
renewal just prior to waking (Korenbrot and Fernald 1989
), putting
considerable demand on the splicing machinery to produce huge amounts
of opsin mRNA. Insufficient production of rhodopsin caused by opsin
gene mutations also causes dominantly inherited RP, consistent with the
proposal that PRPF31, HPRP3, and PRPC8
mutations result in a rhodopsin deficiency secondary to a splicing defect.
Analogous potential targets required for motor neuron viability in SMA
are less obvious. It is possible that one or only a few pre-mRNAs are
affected by the sequence of events resulting from reduced assembly of
core snRNPs. It is also possible, however, that the pathogenic
mechanism in SMA is more complex than a loss of snRNPs. If an SMN
deficiency results in promiscuous association of Sm proteins with
inappropriate RNAs in vivo as it does in vitro (Pellizzoni et al.
2002
), a loss of function of those RNAs or a gain of function of the
aberrant complex could contribute to pathogenesis.
| |
Trans effects: mutations that affect regulators of alternative splicing |
|---|
|
|
|---|
Three mouse models illustrate the deleterious effects resulting from
the loss of factors that regulate splicing (SC35, QKI-5, and Nova-1;
Jensen et al. 2000
; Wang et al. 2001
; Wu et al. 2002
). All three
examples illustrate that inactivation of a splicing regulator in mice
specifically affects its natural pre-mRNA targets. The same specificity
is expected in human diseases caused by disrupted function of
alternative splicing regulators.
Myotonic dystrophy
Myotonic dystrophy (DM) is the one human disease in which disease
phenotype has been directly linked to disrupted regulation of
alternative splicing (Fig. 3D). DM is an autosomal dominant disorder
and the most common form of adult-onset muscular dystrophy, with a
worldwide incidence of 1 in 8000. DM is unusual because of its
phenotypic variability even within families and the diversity of
tissues affected. Symptoms include skeletal muscle hyperexcitability (myotonia), progressive muscle wasting, cardiac conduction defects, cataracts, smooth muscle dysfunction, testicular atrophy, an unusual form of insulin resistance, and neuropsychiatric and cognitive disturbances (Harper 2001
). Two types of DM have been identified. The
most common form is type 1 (DM1), which is caused by a CTG expansion in
the 3' untranslated region (UTR) of the DM protein kinase
(DMPK) gene located on Chromosome 19q13.3. Disease severity and age of onset correlate with repeat length, which ranges from 80 to
thousands of repeats. Unaffected individuals have fewer than ~40
repeats. DM type 2 (DM2) is caused by a large CCTG expansion in intron
1 of the ZNF9 gene on Chromosome 3q21 (Liquori et al. 2001
).
Several independent lines of evidence indicate that the predominant
mechanism for DM pathogenesis is a gain of function for RNA transcribed
from the expanded alleles. First, no point mutants or deletions
within the DM1 or DM2 loci cause DM, indicating that the repeats are determinative for these diseases rather than a loss of
function associated with the DM1 or DM2 loci. Second, the fact that two different loci containing similar expanded repeats cause strikingly similar diseases strongly suggests that DM1 and DM2
share a common pathogenic mechanism that is independent of a loss of
function for the affected locus. Third, RNAs containing long tracks of
CUG or CCUG repeats are transcribed from the expanded DMPK and
ZNF9 alleles, and both repeat-containing RNAs accumulate in
discrete nuclear foci (Taneja et al. 1995
; Davis et al. 1997
; Liquori
et al. 2001
). Fourth, transgenic mice (HSALR) expressing 250 CUG repeats in the 3'-UTR of the human skeletal
-actin gene
reproduced myotonia and the histopathological features observed in DM1
muscle (Mankodi et al. 2000
), demonstrating that expression of CUG
repeats independent of the DM1 locus is sufficient to induce
major features of the disease.
According to the RNA gain-of-function hypothesis, DM pathogenesis
results from disrupted RNA processing secondary to disrupted function
of RNA-binding proteins by the expanded RNA repeats (Wang et al. 1995
).
Consistent with this hypothesis, five pre-mRNAs have been shown to
undergo aberrantly regulated splicing in DM1 tissues and/or mouse
models: cardiac troponin T (cTNT), insulin receptor (IR),
muscle-specific chloride channel (ClC-1), tau, and myotubularin-related
1 (Philips et al. 1998
; Savkur et al. 2001
; Seznec et al. 2001
;
Buj-Bello et al. 2002
; Charlet et al. 2002b
). Misregulated splicing of
IR and ClC-1 pre-mRNAs is likely to directly cause two common symptoms
in individuals affected with DM1. The IR splicing switch observed in
DM1 skeletal muscle results in expression of a lower signaling IR
isoform directly correlating with the unusual form of insulin
resistance observed in individuals with DM1 (Savkur et al. 2001
).
Similarly, loss of ClC-1 function secondary to aberrantly regulated
splicing is sufficient to account for myotonia, the delayed muscle
relaxation following voluntary contraction caused by repeated firing of
action potentials. Recent results from individuals with DM1 and
HSALR mice demonstrate that aberrantly regulated splicing of
ClC-1 pre-mRNAs introduces PTCs resulting in NMD of the ClC-1 mRNA and ultimately loss of ClC-1 function (Charlet et al. 2002b
; Mankodi et
al. 2002
).
The mechanism by which CUG-repeat RNA induces disease is likely to
involve CUG-repeat-binding proteins. Several CUG-repeat-binding proteins have been identified including muscleblind, CUG-binding protein (CUG-BP), elav-type RNA binding protein 3 (ETR-3), which is
78% identical to CUG-BP, and protein kinase R (PKR; Timchenko et al.
1996
; Lu et al. 1999
; Miller et al. 2000
; Tian et al. 2000
). The
proteins from three human muscleblind genes (Fardaei et al. 2002
) are
homologs of a protein required for development of muscle and
photoreceptor cells in Drosophila (Begemann et al. 1997
;
Artero et al. 1998
). These proteins contain two Cys3His-type
zinc finger domains found in RNA processing and transcription factors.
CUG-repeat RNA with more than ~20 repeats forms double-stranded RNA
containing U-U mismatches (Napieraa and Krzyosiak 1997
; Michalowski et
al. 1999
), and muscleblind has a strong affinity for double-stranded CUG-repeat RNA in vitro and colocalizes with the nuclear foci containing CUG- and CCUG-repeat RNA in DM cells (Michalowski et al.
1999
; Miller et al. 2000
; Fardaei et al. 2001
). Because the function of
muscleblind is unknown, the consequences of muscleblind colocalization
with CUG-repeat RNA for splicing and DM pathogenesis remain to be determined.
CUG-BP was identified as a protein that bound to a single-stranded
synthetic (CUG)8 RNA. In contrast to muscleblind, CUG-BP does
not bind double-stranded CUG-repeat RNA (Miller et al. 2000
) and does
not colocalize with the CUG-repeat RNA in nuclear foci (Michalowski et
al. 1999
; Fardaei et al. 2001
). Although the physical evidence links
muscleblind and not CUG-BP with the nuclear foci of CUG-repeat RNA,
functional analyses indicate that increased activity of CUG-BP is
responsible for the aberrant regulation of cTNT, IR, and ClC-1
alternative splicing observed in DM1. First, CUG-BP is a
well-characterized alternative splicing regulator (Ladd et al. 2001
).
It is one of six paralogs called CUG-BP and ETR-3 Like Factors (CELF;
Ladd et al. 2001
) or Bruno-like (Brunol; Good et al. 2000
) proteins.
Second, the steady-state levels of CUG-BP protein are elevated in DM1
striated muscle tissues where aberrantly regulated splicing has been
demonstrated (Savkur et al. 2001
; Timchenko et al. 2001
). Third, cTNT,
IR, and ClC-1 pre-mRNAs are known targets for CUG-BP regulation
(Philips et al. 1998
; Savkur et al. 2001
; Charlet et al. 2002b
). For
all three pre-mRNAs, CUG-BP has been shown to bind to U/G-rich motifs
in introns adjacent to the regulated splice sites. Furthermore,
overexpression of CUG-BP with cTNT, IR, and ClC-1 minigenes in normal
cells induces the splicing patterns observed in DM1 striated muscle,
which are different for the different pre-mRNAs (cTNT exon 5 inclusion, IR exon 11 skipping, and ClC-1 intron 2 retention), consistent with the
increased steady-state levels observed in DM1 striated muscle.
Pre-mRNAs containing mutated CUG-BP-binding sites are no longer
regulated by CUG-BP overexpression. The effects of elevated CUG-BP
appear to be limited to its natural targets because the ratio of
alternatively spliced isoforms of hnRNP A1 is unaffected in DM1
(Philips et al. 1998
). Fourth, overexpression of CUG-repeat RNA with
cotransfected cTNT and IR minigenes induced the aberrant splicing
patterns observed in DM1 striated muscle (Philips et al. 1998
; Savkur
et al. 2001
). Minigene pre-mRNAs containing mutated CUG-BP-binding
sites did not respond to coexpressed CUG-repeat RNA, demonstrating that
CUG-BP, or another protein that binds to the CUG-BP-binding site,
mediates the splicing switch induced by CUG-repeat RNA. Fifth, a cTNT
minigene expressed in DM1 muscle cultures reproduced the aberrant
splicing pattern of endogenous cTNT, whereas a minigene containing a
mutated CUG-BP-binding site was not aberrantly regulated in DM1 muscle
cultures (Philips et al. 1998
). Taken together, these results indicate
that the aberrant regulation of these targets observed in DM1 skeletal
muscle is mediated by CUG-BP or other CELF proteins such as ETR-3 that
bind to the intronic regulatory elements.
A general model for the pathogenic mechanism of DM is that expression
of CUG- or CCUG-repeat RNA induces overexpression of CUG-BP, resulting
in misregulated splicing of its target pre-mRNAs (Fig.
5). The mechanism by which CUG-repeat RNA
induces CUG-BP expression is unknown and could be dependent or
independent of binding of muscleblind to CUG-repeat RNA. When
CUG-repeat RNA was expressed in COS cells, the half-life of endogenous
CUG-BP protein increased greater than twofold (Timchenko et al. 2001
), consistent with the increased steady-state levels observed in DM1
striated muscle tissue. The half-life of CUG-BP in DM1 cells remains to
be determined. In addition, CUG-BP phosphorylation and
nuclear:cytoplasmic distribution are altered in DM1 striated muscle
tissues (Roberts et al. 1997
); however, the relationship between these
changes and the aberrantly regulated splicing observed in these tissues
has not yet been established.
|