ORC binding, gene amplification, and the nature of metazoan replication origins

  1. Allan C. Spradling
  1. Howard Hughes Medical Institute, Carnegie Institution of Washington, Baltimore, Maryland 21210 USA

Genomic DNA replication is a critical process shaped by strong evolutionary pressures. Rapid replication can give a viral or cellular genome an immediate advantage over its competitors as long as requisitely high accuracy is not sacrificed. Prokaryotes employ an extremely fast, accurate DNA replication mechanism utilizing sophisticated multiprotein machines (Kornberg and Baker 1992; Baker and Bell 1998). Genome duplication is controlled by the assembly and activation of bidirectional replication forks at specific replication origins, under the control of initiator proteins that bind origins sequence specifically. For example, Escherichia coli oriC binds multiple copies of the initiator protein, dnaA, at repeated motifs. In an ATP-dependent manner dnaA then locally unwinds the DNA, facilitates helicase loading, and organizes the assembly of polymerases, primases, and other components of the replication fork. Special proteins are required to assemble and maintain this unwieldy machinery. Thus, loading the ring-shaped six-protein dnaB helicase around the DNA requires dnaC, whereas proteins such as tau probably act as scaffolds, organizing replication fork components (including polymerases).

Despite the greater complexity of eukaryotic chromosomes, work carried out in Saccharomyces cerevisiae indicates that its genome replicates via a strongly similar mechanism (Leatherwood 1998). Each yeast chromosome contains multiple discrete replication origins where prereplication complexes are assembled during late M and G1 phases. Fully formed, bidirectional replication forks are then activated during S phase. Replication origins can be readily identified as autonomously replicating sequence (ARS) elements that support the propagation of extrachromosomal plasmids. Budding yeast origins are ∼100 bp in size and share an A/T-rich ARS consensus sequence (ACS) that is essential for origin function. Six proteins that form the origin recognition complex (ORC) bind to the ACS in an ATP-dependent manner throughout the cell cycle (Bell and Stillman 1992). Beginning in late mitosis, Cdc6p interacts with origin-bound ORC to bring the six-protein MCM complex, a putative replicative helicase, onto the DNA (Aparicio et al. 1997; Perkins and Diffley 1997; Tanaka et al. 1997). In a late step requiring cyclin-dependent kinase (cdk) activity, Cdc45p is added to the prereplication complex, possibly as a scaffold component (Zou and Stillman 1999). Activation of another origin-associated kinase, Cdc7p, by its regulatory subunit Dbf4p, probably serves as the final signal activating replication fork movement (Dowell et al. 1994; Hardy 1998;Brown and Kelly 1999). By the time prereplication complexes fire, high cdk activity precludes the assembly of new complexes. Because a checkpoint ensures that the cell will not divide until replication is complete, and high levels of cdk activity are not reversed until mitosis is nearly finished, the genome will be replicated once and only once per cell cycle.

The metazoan enigma

The demands on DNA replication in metazoan eukaryotes significantly exceed those of simpler organisms. The genomes of Drosophilaor Xenopus are 15 or 250 fold larger, respectively, thanS. cerevisiae and are rich in recombination-prone repetitive DNA. Yet they need to be faithfully copied during embryonic cleavage stages in only half the time required to duplicate the E. coligenome. Metazoan genomes must replicate as many as 1014 times to produce the cells for a large animal. Moreover, the replication process must be compatible with hundreds of different cell types that are undergoing diverse processes of patterning and differentiation. In particular, the intricate and specific constellations of chromatin proteins in these cells need to be preserved. Given the extraordinary demands metazoans place on their replication machinery, it would not be surprising if new mechanisms not seen in yeast and prokaryotes had evolved.

Whether metazoans actually do utilize novel mechanisms to control DNA replication has remained controversial for most of the last two decades. Homologs of virtually all the major protein components of the budding yeast cell cycle and replication machinery have been identified. For example, a six-protein ORC complex inDrosophila binds DNA and is required for DNA replication (Gossen et al. 1995; Landis et al. 1997; Chesnekov et al. 1999). All six MCM homologs have also been identified (Feger 1999). Studies of replication in Xenopus oocyte extracts reveal strong similarities between budding yeast and higher eukaryotes (Pasero and Gasser 1998). Indeed, it is hard to imagine that a complex set of interacting proteins would have evolved a new way to carry out the same basic process. Nonetheless, other evidence exists to suggest that replication in metazoans differs significantly from the budding yeast paradigm.

One of the fundamental differences between metazoan and budding yeast replication lies in the nature of the cis-acting elements required for origin initiation. Metazoan DNA synthesis initiates during S phase at multiple sites, some of which are cell-type specific and parallel differences in gene expression. However, it has been difficult to identify and study specific replication origins and to link them to particular DNA sequences. In part this may reflect the fact that plasmid maintenance assays used to identify yeast ARSs, have not yielded positive results. Even in the 15–20 cases where origins have been mapped based on the chromosomal location of replicative intermediates (DePamphlis 1999) their nature remains ambiguous. Two-dimensional gel electrophoresis methods frequently reveal less specific initiation sites and suggest that replication begins within a broad ‘initiation zone’ rather than at one or a few specific sites. In budding yeast both methods give consistent results, and the explanation for these discrepancies remains elusive. Moreover, metazoan replication appears to be influenced by chromatin structure on an even larger scale. Multiple replication origins sometimes appear to be clustered within nuclei into large ‘replication factories’ (Newport and Yan 1996). It has even been suggested that at the sequence level there are no preferred initiation sites, and that origin usage is determined in an entirely epigenetic manner by higher order chromatin structure and interactions with the nuclear matrix (Gilbert 1998). Consistent with this view, extracts from Xenopus oocytes can replicate virtually any added DNA once and only once, suggesting specificcis-acting sequences are nonessential for metazoan replication.

The question of ORC binding specificity

One key approach to assessing the importance of specific DNA sequences in initiation is to analyze the DNA binding specificity of eukaryotic ORC proteins, whose role as initiator proteins is presumably conserved. The budding yeast paradigm predicts that the ORC complex should preferentially bind to specific DNA sequences found within functional replication origins, whereas in a chromatin-based model, ORC should bind widely distributed sequences. Previous studies have failed to resolve this issue. For example, reconstituted DrosophilaORC complexes were not observed to bind to specific sequences (Chesnikov et al. 1999). The Schizosaccharomyces pombe Orc4 protein does show affinity for the ars1 origin (Chaung and Kelly 1999), but the specificity of this binding remains unclear.

Consequently, the paper in this issue by Austin et al. is of particular interest. The authors show for the first time that DrosophilaORC binds sequence specifically to regulatory elements known asACE3, ACE1, and AER-d that have been implicated genetically in controlling replication initiation during eggshell (chorion) gene amplification. Specific association between ORC and replicator sequences was observed both in vivo and, using purifiedACE3 and AER-d DNA in vitro. Moreover, binding in vitro depended on the presence of ATP, like ORC binding to yeast ARS elements. In vivo, ORC associated with sequences extending several Kb from ACE3, and formed a doughnut-shaped structure that was visualized by immunofluorescence microscopy. Finally, in the absence ofACE3, both ORC binding and DNA amplification were abolished.

This work provides the strongest evidence to date that sequence-specific recognition by ORC plays a role in activating at least some metazoan replicons. However, one might consider whether the nature of the amplification control sequences recognized byDrosophila ORC are suitable as models for normal replication origins. During a specific period of Drosophila oogenesis beginning at stage 10B, ovarian follicle cells amplify two specific genomic regions containing clusters of chorion genes by supporting extra rounds of replication initiation and bidirectional fork elongation (for review, see Orr-Weaver 1991; Calvi and Spradling 1999). Amplification of the third chromosome gene cluster is strongly dependent on ACE3, a small cis-regulatory element located about 1.5 Kb from AER-d, anothercis-regulator that probably serves as the major replication origin. A strong cis-regulatory sequence involved in amplification of the first chromosome cluster, ACE1, has been less well localized. Unlike normal replication, initiation events during amplifcation are not limited to once per cell cycle. Moreover, late-stage follicle cells are polytene and most amplification takes place after the rest of the genome has ceased replicating.

Despite these differences, recent studies have documented a close relationship between chorion gene amplification and cell cycle-regulated chromosomal replication. First, a detailed examination of the follicle cell cycles indicated that amplification occurs during a final S phase that is under the control of Cyclin E (Calvi et al. 1998). Inactivation of Cyclin E by misexpressing the p27-like genedacappo in follicle cells causes an immediate block to amplification. Thus, amplification, like normal replication, appears to require cdk activity. Second, mutations have been identified in E2F and DP that selectively block amplification, indicating that these two S phase initiator proteins required for normal replication are also needed for amplification (Royzmann et al. 1999). Third, the fact that 16 sister chromatids lie in close proximity in the polytene follicle cells allows proteins located at the amplification sites to be visualized by immunofluorescence microscopy prior to and during amplification. ORC1 and ORC2 protein can both be seen to localize to the sites where amplification will take place during stage 10A, just prior to the onset of the bulk of amplification (Asano et al. 1999;Royzmann et al. 1999), but are absent in some E2F and DP mutants.

Further evidence that amplification utilizes normal replication machinery comes from studies of additional mutations that block amplification. A class of at least ten genes have been identified in which female-sterile mutations disrupt chorion gene amplification, causing viable adult females to produce thin eggshells. Amplification mutations probably select for leaky alleles of genes that are specifically involved in replication initiation. A twofold reduction in the rate of initiation may be compensated for during normal replication by an increase in the length of S phase, but probably causes an exponentially increasing deficit in gene copy number during the short, fixed time period allotted for amplification. In this sense, amplification mutants resemble mutants selected for defective plasmid or minichromosome maintenance (Hogan and Koshland 1992; for review, seeTye 1999). The first amplification gene isolated, k43, encodes ORC2 (Landis et al. 1997). This confirms that ORC2 protein participates in amplification in vivo. Recently, another gene, chiffon, was characterized and found to contain a segment that is related to S. cerevisiae Dbf4 and its S. pombe counterpart Dfp1 (Landis and Tower 1999). Metazoan homologs related to the Cdc7 kinase had been described previously, and Chiffon is a strong candidate for its regulatory subunit. Clearly, several major trans-acting components of normal DNA replication initiation also participate in the repeated initiation events that produce local amplification.

The cis-regulatory sequences within amplification control elements also generally resemble the few eukaryotic origin sequences outside of budding yeast that have been analyzed genetically (Clyne and Kelly 1995; Dubey et al. 1996; Aladjem et al. 1998; Kim and Huberman 1998, 1999). These elements are larger than budding yeast origins: 3–4 kb in the case of the chorion locus, 0.5–2 kb in the case of S. pombe origins, and 3–4 kb in the case of the mouse globin region. Moreover, in both the chorion and globin replicators, normal flanking regions appear to contain redundant elements that act to enhance the overall efficiency of replication. In addition, one or more small zones with a strong effect on replication are found within the core region. Thus, the 430-bp ACE3 segment was essential for amplification in the context of a 7.7-kb genomic fragment (Orr-Weaver and Spradling 1986). Three separate regions of ∼50 bp in S. pombe ars3001reduced its activity at least 30 fold (Kim and Huberman 1998). Furthermore, evidence of internal redundancy was observed in both elements (Orr-Weaver et al. 1989; Kim and Huberman 1998). Finally, the activity of both types of elements is subject to position effects when tested at ectopic genomic sites (deCicco and Spradling 1984; Aladjem et al. 1998). Normally, the mouse replicator requires an upstream locus control region (LCR), but was LCR-independent at two tested sites in CV-1 tissue culture cells. Position effects on amplification can also be buffered by a chromatin boundary element, namely su(Hw) binding sites (Lu and Tower 1998). Several of these characteristics have also been reported for the small elements controlling replication of the small rDNA minichromosomes in Tetrahymena macronuclei (Blomberg et al. 1998; Resichman et al. 1999).

The sequences within ACE3 and other eukaryotic origins also exhibit some similarities. Origins in S. pombe are frequently A/T-rich, contain runs of A and T residues, and display internal repeats. The ACE3 region contains conserved sequences, including an 80-bp A/T-rich sequence that is >90%identical between Drosophila melanogaster, D. grimshawi, and D. virlis (Swimmer et al. 1990). This is particularly striking given that these species are estimated to have diverged >40 million years ago and show no other comparable regions of homology throughout the rest of the locus. This region contains several of the characteristics cited above, as the first 21 bp form a near-perfect direct repeat, and there are runs of Ts throughout the sequence. However, it is not essential for amplification (Orr-Weaver et al. 1989). Most of this highly conserved segment (Fig.) is contained within a 104-bp DNA fragment that stimulated strong ORC binding to purified DNA (Austin et al. 1999), suggesting that it might be a direct target of ATP-dependent ORC binding. Sequence-specific ORC binding was also observed with another region, AER-d, located ∼1.5 kb from ACE3. InD. melanogaster, this region also contains an A/Trich region but its sequence is not conserved in the other species. It is not known where replication initiates in theD. grimshawi and D. virilis clusters.

An alignment of DNA sequences within an 89bp conserved region located in a similar position several hundred base pairs upstream from the transcription start site of s18 gene in the chorion gene clusters of three Drosophila species,D. melanogaster (X06257), D. grimshawi (X53422), andD. virilis (X53421). Nucleotides that differ from the consensus are underlined.


Amplification: model or exception?

These observations argue that the chorion gene cluster replicates under cell cycle control using at least some homologs of budding yeast replication proteins. Moreover, the general characteristics of thecis-regulatory amplification control elements resemble the larger, more complex initiator sequences used by eukaryotes other than budding yeast. Despite this, it remains possible that the ORC binding observed by Austin et al. (1999) represents an exceptional situation. During stage 10A, ORC binding is observed only at the two major chorion loci that are about to amplify, as well as two minor loci, but not throughout the rest of the nucleus as would be expected during a normal cell cycle (Royzmann et al. 1999). ORC binding to the chorion loci just prior to amplification may require an exceptional, sequence-specific mechanism rather than the chromatin-based mechanism used during normal replication. The highly conserved sequence withinACE3 may have been preserved to help load ORC complexes under these circumstances, but not during normal replication. Even if sequence-specific ORC binding is a property of amplification control elements but not normal origins, further study will nonetheless assist in understanding how ORC helps assemble competent replication forks on metazoan chromosomes.

It seems at least as likely, however, that the control elements studied by Austin et al. (1999) are essentially normal metazoan replicator elements that are acted upon by as-yet undetermined mechanisms in late follicle cells to cause repeated rounds of initiation. In this view, the sequence-specific ORC binding observed by Austin and colleagues would be a general property of metazoan replication origins. The unusual requirement of ACE3 for ORC binding throughout the region might indicate that in larger, more complex origins, sequences that mediate initial ORC binding can reside at a larger distance from the eventual site of initiation than is commonly seen in budding yeast origins (Bielinsky and Gerbi 1998). Perhaps, ACE3 mediates the initial binding and loading of many ORC molecules that spread over several kilobases of surrounding DNA. Eventually, replication forks would be assembled and initiate at other sites within the affected region, particularly AER-d.

This model gains support from several additional observations reported by Austin et al. (1999). The strength of the immunofluoresence signals suggested that large amounts of ORC are present on the chorion loci, even in stage 10A. ORC complexes could be cross-linked in vivo to sequences that were located several kilobases distant fromACE3. Moreover, ORC protein showed a striking doughnut-like organization on amplifying chorion loci in stage 10B.

If origins used during normal S phase do bind ORC, then the work ofAustin et al. (1999) establishes an assay that should allow many additional origins to be identified elsewhere in the genome. If ORC is loaded onto an extended region surrounding sequence-specific binding sites at other metazoan replication origins, this might somehow explain the observed discrepancies between different methods of origin mapping. Whatever the answers to these particular questions, the prospect of finally resolving the enigma of metazoan DNA replication has clearly been amplified.

Footnotes

  • E-MAIL ; FAX (410) 243-6311.

References

« Previous | Next Article »Table of Contents