Methods Enzymol. 2011; 498: 349–361. Three methods for assembling multiple, overlapping DNA molecules are described. Each method shares the same basic approach: (i) an exonuclease removes nucleotides from the ends of double-stranded (ds) DNA molecules, exposing complementary single-stranded (ss) DNA overhangs that are specifically annealed; (ii) the ssDNA gaps
of the joined molecules are filled in by DNA polymerase, and the nicks are covalently sealed by DNA ligase. The first method employs the 3′-exonuclease activity of T4 DNA polymerase (T4 pol), Taq DNA polymerase (Taq pol), and Taq DNA ligase (Taq lig) in a two-step thermocycled reaction. The second method uses 3′-exonuclease III (ExoIII), antibody-bound Taq pol, and Taq lig in a one-step thermocycled reaction. The third method employs 5′-T5
exonuclease, Phusion® DNA polymerase, and Taq lig in a one-step isothermal reaction and can be used to assemble both ssDNA and dsDNA. These assembly methods can be used to seamlessly construct synthetic and natural genes, genetic pathways, and entire genomes and could be very useful for molecular engineering tools. Keywords: In vitro recombination, Enzymatic DNA assembly, Genome synthesis, Gene synthesis, Synthetic biology, Homologous
recombination, Cloning multiple fragments For nearly 40 years, scientists have had the ability to join DNA sequences to produce combinations that are not present in nature. “Recombinant DNA technology” was initiated soon after the discovery of DNA ligase (Gellert, 1967,
Weiss and Richardson, 1967) and restriction endonucleases (Smith and Wilcox, 1970). In 1972, Berg and colleagues constructed the first recombinant DNA fragment, an SV40 hybrid DNA molecule
(Jackson et al., 1972). In the following year, Cohen et al. (1973) combined two antibiotic resistance markers in a single plasmid and showed that when bacterial cells take up the recombined plasmid, they become resistant to both antibiotics. By then, a 77-nucleotide gene
encoding a yeast alanine transfer RNA had been synthesized from 17 overlapping oligonucleotides (oligos) (Agarwal et al., 1970). It was not long before it was announced that the human genes for somatostatin (Itakura et al., 1977) and insulin
(Crea et al., 1978, Goeddel et al., 1979) were synthesized and inserted into a vector for expression in Escherichia coli. Polymerase chain reaction (PCR) allows specific genes to be amplified from a complex mixture of genomic DNA and cloned into a vector.
Restriction sites can be added to PCR primers to allow amplified products to be inserted into vector cloning sites. Overlapping DNA fragments can be assembled during PCR (Horton et al., 1990) using primers specific to the ends of the assembly; however, the size of the product cannot exceed the length that can be readily amplified by PCR. Complementary vector sequence is added to PCR
primers in a technique known as ligation-independent cloning (LIC) (Aslanidis and de Jong, 1990). An improvement on this method was made and is known as sequence and ligation-independent cloning (SLIC) (Li and Elledge, 2007), which removes the sequence constraints of LIC. Both methods
utilize exonuclease activity to expose complementary ssDNA sequences that can be specifically annealed. In these two systems, ligation of the DNA molecules is not performed in vitro but instead performed within a host organism following transformation of the annealed molecules. Type II restriction enzymes have traditionally been used to clone DNA fragments by inserting them into a vector. However, this approach is of limited value for assembling multiple pieces at once
and for producing large DNA fragments that do not contain a restriction site “scar.” On the other hand, type IIS restriction enzymes, which cleave adjacent to but outside their recognition sites to produce sticky ends specific for the sequences to be joined, have become widely used for constructing seamless DNA molecules. For example, this scheme was used to assemble a 31.5-kb full-length infectious cDNA of the group II coronavirus mouse hepatitis virus strain A59
(Yount et al., 2002) and a completely synthetic 32-kb polyketide synthase gene cluster (Kodumal et al., 2004). However, as the recombined DNA molecules get larger, it becomes increasingly difficult to find a type IIS restriction enzyme that does not cut within the
assembly. Extensive reengineering of genetic elements relies on technology that enables the assembly of small synthetic oligos. A number of in vitro enzymatic strategies are available for the assembly of single-stranded (ss) oligos into larger double-stranded (ds) DNA constructs (Czar et al., 2008,
Xiong et al., 2008a, Xiong et al., 2008b). For example, oligos can be joined by polymerase cycling assembly and subsequently amplified by PCR (Smith et al., 2003,
Stemmer et al., 1995). These dsDNA fragments can then be assembled into a vector by any of the methods described earlier and cloned in E. coli. Overlapping DNA molecules can also be joined by the action of three enzymes: (i) an exonuclease, which chews back the ends of the fragments and exposes ssDNA overhangs that are specifically annealed; (ii) a polymerase, which
fills the gaps in the annealed products; and (iii) a ligase, which covalently seals the resulting nicks. A two-step thermocycled in vitro recombination method was used to join 101 overlapping DNA cassettes into four quarter molecules of the Mycoplasma genitalium genome, each between 136 and 166 kb in size (Gibson et al., 2008). Since then, two additional in
vitro recombination methods were described that can join and clone DNA molecules larger than 300 kb, and can be carried out in a single step (Gibson et al., 2009). The simplest approach is a one-step isothermal (ISO) reaction that can be used to join both ssDNA and dsDNA
(Gibson et al., 2009, Gibson et al., 2010). All three assembly systems utilize the three enzymes above to produce DNA molecules that are covalently joined, and polyethylene glycol (PEG), a macromolecular crowding agent, to stimulate recombination. Because the DNA products are covalently
joined, PCR or rolling-circle amplification (RCA) can be performed directly from the reactions (Gibson et al., 2009, Gibson et al., 2010). Although three recombination methods are described later, the one-step ISO system is typically used due to its simplicity. We found that
all components of this assembly system can be premixed and kept frozen until needed. Thus, all that is required for DNA assembly is for input ssDNA or dsDNA to be added to this mixture, and the mixture to be briefly incubated at 50 °C. These approaches could be very useful for cloning multiple inserts into a vector without relying on the availability of restriction sites and for rapidly constructing large DNA molecules. For example, regions of DNA too large to be amplified by a single PCR
event can be divided into multiple overlapping PCR amplicons and then assembled into one piece. The ISO system is advantageous for assembling circular products, which accumulate because they are not substrates for any of the three enzymes. The one-step thermocycled method, however, can be used to generate linear assemblies because the exonuclease is inactivated during the reaction. Protocols for these recombination methods are provided later. DNA molecules are designed such that neighboring fragments contain at least 40 bp of overlapping sequence. However, as much as 500 bp can be used if the procedures are slightly modified as indicated below. If the DNA fragments will originate from PCR products, 40 bp overlapping sequences are introduced at the 5′ends of the primers used in the amplification reactions. DNA molecules can also be
assembled from overlapping restriction fragments. The noncomplementary partial restriction sites will be removed during recombination and form a contiguous piece of DNA without intervening sequence. DNA fragments are often assembled with a vector to form a circular product. PCR amplification can be used to produce a unique vector containing terminal overlaps to the ends of the DNA fragments being joined. To produce these cloning vectors, each PCR primer includes an overlap with one end of the
vector, a restriction site (e.g., NotI) not present within the insert to allow its release from the vector, and an overlap with one end of the DNA fragment assembly. An example of how to assemble a linear fragment into PCR-amplified pUC19 is shown in Fig. 15.1 . 1. E.
coli strains carrying overlapping restriction fragments, contained within a plasmid, are propagated in Luria Broth (LB) containing the appropriate antibiotic and incubated at 30 or 37 °C for 16 h. The cultures are harvested, and the DNA molecules are isolated using a commercially available kit (e.g., Qiagen's HiSpeed Plasmid Maxi Kit) or by following a standard alkaline-lysis procedure. Plasmid DNA is eluted or resuspended in Tris–EDTA buffer, pH 8.0 (TE buffer). Overlapping DNA
fragments are then released from the vector by restriction digestion. These reactions are terminated by heat inactivation or phenol–chloroform–isoamyl alcohol (PCI) extraction and ethanol precipitation. DNA is dissolved in TE buffer and then quantified by gel electrophoresis with standards. 2. Overlapping PCR fragments are produced with a high-fidelity (HF) DNA polymerase such as the hot-start Phusion® polymerase (New England Biolabs, NEB) with the HF
buffer. Results may be improved by gel purifying the PCR products prior to DNA assembly. However, this is not necessary and instead, reactions may be column purified (e.g., QIAquick PCR purification Kit, Qiagen). For the one-step thermocycled and ISO systems, which are not inhibited by the presence of dNTPs, PCR products may be directly used in assembly reactions without additional purification. PCR products are quantified by gel electrophoresis with standards. Assembly vector primer design. (A) A linear DNA sequence that is to be assembled into a vector. The first and last 40 bp of DNA sequence is underlined. (B) Two primers that could be used to PCR-amplify pUC19 to produce a vector containing overlaps to the sequence shown in (A), thus producing a circle. The primer sequences include regions that can anneal to pUC19 (nonbolded, lowercase), NotI restriction sites (bolded and italicized, uppercase) to release the insert from the vector, and 40-bp overlaps (underlined, uppercase) to the ends of the DNA sequence shown in (A). 3. Two-Step Thermocycled Assembly of Overlapping dsDNAThis two-step in vitro recombination method for assembling overlapping DNA molecules makes use of the 3′-exonuclease activity of T4 DNA polymerase (T4 pol) to produce ssDNA overhangs, and a combination of Taq DNA polymerase (Taq pol) and Taq DNA ligase (Taq lig) to repair the annealed joints (Gibson et al., 2008, Gibson et al., 2009; Fig. 15.2 ). The reaction is carried out in a thermocycler, in two steps. In the first step, the 3′-ends of the DNA fragments are digested to expose the overlap regions using T4 pol in the absence of dNTPs. The T4 pol is then inactivated by incubation at 75 °C, followed by slow cooling to anneal the complementary overlap regions. In the second step, the annealed joints are repaired using Taq pol and Taq lig at 45 °C in the presence of all four dNTPs. Taq pol is generally used as the gap-filling enzyme in this system because it does not strand-displace, which would lead to disassembly of the joined DNA fragments. It also has inherent 5′-exonuclease activity (or nick translation activity) (Chien et al., 1976), which eliminates the need to phosphorylate the input DNA (a requirement for DNA ligation). This is because 5′-phosphorylated ends are created following nick translation. Further, this activity removes any noncomplementary sequences (e.g., partial restriction sites at the ends of overlapping restriction fragments), which would otherwise end up in the final joined product.
Two-step thermocycled in vitro recombination. Two adjacent DNA fragments (magenta and green) sharing terminal sequence overlaps (thickened black line) are joined into one covalently sealed molecule by a two-step thermocycled reaction. 4. One-Step Thermocycled Assembly of Overlapping dsDNAA DNA assembly method that requires the absence of dNTPs to achieve exonuclease activity, such as the T4 pol-based system described earlier, cannot be completed in one step. This is because dNTPs are required at a later point to fill in the gapped DNA molecules. ExoIII, which removes nucleotides from the 3′-ends of dsDNA, is fully functional even in the presence of dNTPs, so it is a candidate for a one-step reaction. However, it will compete with polymerase for binding to the 3′-ends. To eliminate this competition, and allow for one-step DNA assembly, antibody-bound Taq pol (Ab-Taq pol) is used in combination with ExoIII (Fig. 15.3 ). In this assembly method, overlapping DNA fragments and all components necessary to covalently join the DNA molecules (i.e., ExoIII, Ab-Taq pol, dNTPs, Taq lig, etc.) are added in a single tube and placed in a thermocycler (Gibson et al., 2009). At 37 °C, ExoIII is active (but Ab-Taq pol remains inactive) and recesses the 3′-ends of the dsDNA molecules. The reaction is then shifted to 75 °C, which inactivates ExoIII. Annealing of the DNA molecules commences and the antibody dissociates from Taq pol, thus activating this enzyme. Further annealing, extension, and ligation are then carried out at 60 °C.
One-step thermocycled assembly of overlapping DNA segments. Two adjacent DNA fragments sharing terminal sequence overlaps are joined into one covalently sealed molecule by a one-step thermocycled reaction. 5. One-Step ISO Assembly of Overlapping dsDNAExonucleases that recess dsDNA from 5′-ends, and are not inhibited by the presence of dNTPs, are also candidates for a one-step DNA assembly reaction. Further, these exonucleases will not compete with polymerase activity. Thus, all activities required for DNA assembly can be simultaneously active in a single ISO reaction. A 50 °C ISO assembly system has been optimized using the activities of the 5′-T5 exonuclease (T5 exo), Phusion® DNA polymerase (Phusion® pol), and Taq lig (Gibson et al., 2009; Fig. 15.4 ). Taq pol can be used in place of Phusion® pol; however, Phusion® pol is preferred, as it has inherent proofreading activity for removing noncomplementary sequences from assembled molecules. In addition, Phusion® pol incorporates the incorrect nucleotide at a significantly lower rate.
One-step isothermal assembly of overlapping DNA fragments. Two adjacent DNA fragments sharing terminal sequence overlaps are joined into one covalently sealed molecule by a one-step isothermal reaction. 6. One-Step ISO DNA Assembly of Overlapping ssDNAThe one-step ISO reaction described earlier can also be used to directly assemble oligos into the pUC19 cloning vector (Gibson et al., 2010; Fig. 15.5 ). The assembled products are then cloned in E. coli, and the errors originating from the chemical synthesis of the oligos are weeded out by DNA sequencing. To ensure that error-free molecules are obtained at a reasonable efficiency, only eight to twelve 60-base oligos are assembled at one time. The resulting dsDNA molecules can then be assembled by any of the methods described earlier.
Isothermal assembly of overlapping oligonucleotides into pUC19. Eight 60-base oligos (red lines) are directly assembled into pUC19 (gray lines), in vitro, to produce a dsDNA fragment. N indicates the NotI restriction site (black line), which is added to release the fragment from the pUC19 vector. AcknowledgmentsThe author would like to thank Synthetic Genomics, Inc. (SGI) for funding this work and the synthetic biology group at JCVI for helpful discussions. References
What is a contiguous piece of DNA?A contig (as related to genomic studies; derived from the word “contiguous”) is a set of DNA segments or sequences that overlap in a way that provides a contiguous representation of a genomic region.
What are contigs and scaffolds?A scaffold is a portion of the genome sequence reconstructed from end-sequenced whole-genome shotgun clones. Scaffolds are composed of contigs and gaps. A contig is a contiguous length of genomic sequence in which the order of bases is known to a high confidence level.
How can you create overlapping fragments quizlet?How can you create overlapping fragments? - Digest multiple copies of the genome partially with a restriction enzyme.
What is a contigs file?Contigs are contiguous fragments of DNA sequence from an incomplete draft genome. Chimeric contigs are contigs that combine sequences from more than one genome. Short sequence reads from two different genomes can be incorrectly assembled into one contig due to a short region of similar sequence.
|