- I can explain why DNA in chromosomes has to be unwrapped and un-zipped into two strands for replication.
- I can explain why the two DNA strands have to be treated differently when the complementary bases are being added to return them to the 2-stranded state.
- I can use the similarities and differences in DNA replication in Bacteria, Archaea, and Eukarya to compare their evolutionary histories.
DNA replication requires three processes: the initiation of replication, the elongation of the new DNA during actual copying, and the termination of the DNA replication process. Each of these consists of reactions between enzymes and the DNA macromolecule. And each has to be performed precisely to produce two identical strands of DNA.
The importance of DNA replication has led to it being extensively studied in cells, replicated as chemical reactions in experiments, and modeled at a molecular level. The basic processes for DNA replication are similar in Bacteria, Archaea, and Eukarya, although there are significant differences in the details of the processes. The text below describes the processes of initiation, elongation and termination generally, and then discusses some key differences among the three domains of life as well as for DNA in organelles (like plastids and mitochondria), plasmids, and viruses.
Let's start, however, with two beautiful animations of the elongation process (in a eukaryotic cell) based on a very detailed molecular simulation. The first video shows the overall process, and the second video shows how the new strand of DNA is built. Each process illustrated in the videos is caused by spontaneously chemical reactions when the right molecules are present in the right places in the cells. Evolution over billions of years has produced these impressive molecular machines.
Chromosomal DNA is typically wrapped around histones (in Archaea and Eukarya) or histone-like proteins (in bacteria). When DNA is wrapped and twisted on itself, the DNA molecule is protected from interacting with most other molecules and can not be copied. However, specific enzymes change the shape and supercoiling of the chromosome, and these enzymes can expose the origin of replication, which is a specific nucleotide sequence that binds to the specific proteins that initiate the replication process. Every chromosome has at least one origin of replication so that it can be copied, and some Archaea and Eukarya have more than one.
The initiation of replication starts at the origin of replication and consists of converting the DNA into a strand that is accessible for replication. An enzyme called helicase binds to the DNA at the origin of replication and separates it into two strands by breaking the hydrogen bonds between the nitrogenous base pairs. As the DNA strands "un-zip", they are coated with special proteins to prevent the now single-stranded DNA from rewinding into a double helix, which is its more stable form. These stabilized single DNA strands extend outward from the helicase enzyme to produce a Y-shaped structure called a replication fork. A special RNA sequence, called a primer, binds to the DNA strands near the replication fork to help create the right chemical environment for building the complementary DNA strand for each of the original strands being copied. At this point, DNA replication can begin.
The complementary DNA strand is added to the original DNA strand by an enzyme called DNA polymerase, which is held near the replication fork by an enzyme that facilitates smooth passage of the single DNA strand to the polymerase. As the DNA strand slides through, the DNA polymerase adds the appropriate nucleotide to complements. The addition of these nucleotides requires energy, which comes from breaking bonds in the phosphate groups attached to each nucleotide (a triphosphate nucleotide). When the bond between the phosphates is broken, diphosphate is released along with energy that allows the formation of a covalent bond between the incoming nucleotide and the growing DNA strand. The DNA polymerase catalyzes this process.
DNA polymerase can only extend the complementary strand of DNA in one specific direction defined by the geometry of the pentose sugar and phosphate groups bound to the nitrogenous bases in DNA. However, the DNA double helix is antiparallel; that is, one strand is oriented in one direction and the other is oriented in the opposite direction (see Structure and Function of DNA). Thus, during replication, one strand of the original DNA is oriented in the right direction to be copied whereas the other one is not; the two strands have to be treated differently at the replication fork. DNA polymerase adds nucleotides to the properly oriented strand, the "leading strand", without complications. However, another DNA polymerase has to add bases to the other strand, the "lagging strand", in the opposite direction. This requires the replication fork to create loops of the DNA and attach DNA polymerases to it repeatedly. In this process, a DNA polymerase attaches to the single stranded DNA and builds the complementary strand while moving away from the replication fork. Once it hits part of the DNA strand that already has the complement, it jumps back toward the replication fork to begin adding bases to a new place in the DNA. These steps produce a double-stranded DNA molecule with RNA primers located at each jump point. These RNA primers are replaced with DNA as one of the final steps in the elongation process.
Once the complete chromosome has been replicated, DNA replication is terminated. Although much is known about initiation of replication, less is known about the termination process, and it is different for Bacteria and Eukarya. For organisms with circular chromosomes (most Bacteria and Archaea), the elongation process stops when two replication forks encounter each other. If there is only one origin of replication on the circular chromosome, the two forks will encounter each other once full chromosome has been copied. However, the new chromosomes are interlocked, so they have to be broken apart and the ends have to be joined to reform the circle. For organisms with linear chromosomes, termination happens when either two replication forks meet or the end of the chromosome is reached. At an end, there is no place to add a primer on the lagging DNA strand for the DNA replicase to attache to. Thus, the ends of linear chromosomes can remain unpaired and, over time, linear chromosomes may get progressively shorter, losing information.
Variations in DNA Replication
DNA replication follows a similar process in Bacteria, Archaea, and Eukarya: they all have a genetic marker for the origin of replication, similar mechanisms for elongation, and the need for termination. However, the actual enzymes that perform the replication show some variations related to their evolutionary history. The genes that code for the enzymes that structure the replication fork are very similar in all three domains (Bell, 2017), which suggests that the basic process for duplicating DNA evolved in the last common ancestor of all life. In contrast, most of the other genes involved in DNA replication are similar in Archaea and Eukarya, but different in Bacteria (Bell, 2017). The differences between Bacteria and Archaea/Eukarya are consistent with other evolutionary relationships between the groups that suggest Archaea and Eukarya are more closely related to each other than they are to Bacteria.
Plasmids and some viruses also require DNA replication to reproduce. They lack the genes to make their own replication enzymes, so they rely on their host cells for DNA replication.
DNA replication has been well studied in bacteria primarily because of the small size of the genome and the mutants that are available. E. coli has 4.6 million base pairs (Mbp) in a single circular chromosome and all of it is replicated in approximately 42 minutes, starting from a single origin of replication and proceeding around the circle in both directions. This means that approximately 1000 nucleotides are added per second. The process is quite rapid and occurs with few errors.
Most archaea also have a single circular chromosome, but their processes of DNA replication are less well studied. The enzymes for DNA replication in Archaea is similar to those in Eukarya, but they are in general simpler (Bell, 2017). Most archaeal chromosomes have more than one origin of replication, with four being the maximum documented to date. There are lots of unanswered research questions that can be pursued to better understand how archaea function as cells.
Eukarya Nuclear DNA
Eukaryotic genomes are much more complex and larger than bacterial and archaeal genomes and are typically composed of multiple linear chromosomes. The human genome, for example, requires the insertion of 6 billion base pairs are inserted during replication. There are multiple origins of replication on each eukaryotic chromosome; the human genome has 30,000 to 50,000 origins of replication. The rate of replication is approximately 100 nucleotides per second—10 times slower than bacterial replication.
Eukaryotic chromosomes are linear, which means that their ends sometime contain unpaired bases. Thus, over time, they may get progressively shorter as cells continue to divide. To help prevent important information from being lost, the ends of the linear chromosomes consist of noncoding repetitive sequences called telomeres. The telomeres protect coding sequences from being lost as cells continue to divide. In humans, telomeres consist of 100 to 1000 repetitions of a six base-pair sequence, TTAGGG. There is a specific enzyme that attaches to this sequence at the end of the template DNA for the lagging strand of DNA and extends it with an RNA template. Once the lagging strand template is long enough for the DNA polymerase to shift to a new primer, the polymerase can finish adding the complementary nucleotides to the end of the chromosome.
In humans, telomerase is typically active in germ cells and adult stem cells; it is not active in adult somatic cells and may be associated with the aging of these cells. Eukaryotic microbes including fungi and protozoans also produce telomerase to maintain chromosomal integrity. For her discovery of telomerase and its action, Elizabeth Blackburn (1948–) received the Nobel Prize for Medicine or Physiology in 2009.
Eukaryotes also host DNA in their organelles. This orDNA was inherited from bacteria that merged ancient ancestors to modern eukaryotes through a process called endosymbiosis. Over hundreds of millions of years, some of the bacterial genes were transferred to the host cells, some were lost, and all evolved in response to being permanently encased within another cell. One of the key processes transferred from the ancestors to the organelles to the host cells was DNA replication. The replication of orDNA is controlled by genes in the nucleus, even though orDNA replication is not necessarily coordinated with replication of nuclear DNA. The replication process is also variable among organisms, in part due to the different evolutionary paths of different eukaryotic lineages after the endosymbiosis events. Overall, orDNA replication is less well understood by scientists than nuclear DNA replication (see https://www.frontiersin.org/articles/10.3389/fpls.2015.00883/full), even though it is critical for the function of most eukaryotes.
Plasmids and Viruses
Plasmids and DNA viruses depend on host cell DNA replication processes to replicate their DNA. Plasmids are small circular molecules of double-stranded DNA that are separate from the main chromosome(s) of their host cells. They usually contain one or two non-essential genes, like antibiotic resistance, and none of the genes necessary for DNA replication. Thus, they depend on enzymes coded in genes in the host cell chromosomes to replicate their DNA. Plasmids have their own origin of replication to initiate DNA replication, but they will only be replicated if the host cell’s enzymes recognize their origin of replication. Because origins of replication have different coding sequences in Bacteria, Archaea, and Eukarya, plasmids are usually specific to one domain of life or even a smaller subset of organisms. In addition, sometimes the replication process proceeds differently than for chromosomal DNA, initiating with a nick in one of the DNA strands and elongating the two strands separately.
Viruses also depend on their host cell’s enzymes for replication. Some viruses contain DNA whereas others have RNA. DNA viruses do not include all the genes for DNA replication enzymes. Thus, they rely on their host cell’s cellular machinery to produce more viral particles and use multiple methods for tricking host cells into replicating their DNA. RNA viruses can replicate their genetic code more easily since RNA is a reactive molecule and does not require the processes of initiation, elongation, and termination essential for DNA replication. For more information on viruses, see The Viral Life Cycle.
More Information and References
Bell, S.D., 2017. Chapter 5 - Initiation of DNA Replication in the Archaea, in H. Masai, M. Foiani (eds.), DNA Replication, Advances in Experimental Medicine and Biology 1042, https://doi.org/10.1007/978-981-10-6955-0_5