DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A systematic comparison of error correction enzymes by next-generation sequencing

Abstract

Gene synthesis, the process of assembling genelength fragments from shorter groups of oligonucleotides (oligos), is becoming an increasingly important tool in molecular and synthetic biology. The length, quality and cost of gene synthesis are limited by errors produced during oligo synthesis and subsequent assembly. Enzymatic error correction methods are cost-effective means to ameliorate errors in gene synthesis. Previous analyses of these methods relied on cloning and Sanger sequencing to evaluate their efficiencies, limiting quantitative assessment. Here, we develop a method to quantify errors in synthetic DNA by next-generation sequencing. We analyzed errors in model gene assemblies and systematically compared six different error correction enzymes across 11 conditions. We find that ErrASE and T7 Endonuclease I are the most effective at decreasing average error rates (up to 5.8-fold relative to the input), whereas MutS is the best for increasing the number of perfect assemblies (up to 25.2-fold). We are able to quantify differential specificities such as ErrASE preferentially corrects C/G transversions whereas T7 Endonuclease I preferentially corrects A/T transversions. More generally, this experimental and computational pipeline is a fast, scalable and extensible way to analyze errors in gene assemblies, to profile error correction methods, and to benchmark DNA synthesis methods.

Authors:
ORCiD logo [1];  [2];  [3];  [4]; ORCiD logo [1]
  1. Univ. of California, Los Angeles, CA (United States); UCLA-DOE Inst. for Genomics and Proteomics, Los Angeles, CA (United States)
  2. Univ. of Pennsylvania, Philadelphia, PA (United States)
  3. Univ. of California, Los Angeles, CA (United States)
  4. Brigham and Women's Hospital (Harvard Medical School), Boston, MA (United States); Wyss Inst. for Biologically Inspired Engineering, Boston, MA (United States)
Publication Date:
Research Org.:
Univ. of California, Los Angeles, CA (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1425978
Grant/Contract Number:  
FC02-02ER63421
Resource Type:
Accepted Manuscript
Journal Name:
Nucleic Acids Research
Additional Journal Information:
Journal Volume: 45; Journal Issue: 15; Journal ID: ISSN 0305-1048
Publisher:
Oxford University Press
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES

Citation Formats

Lubock, Nathan B., Zhang, Di, Sidore, Angus M., Church, George M., and Kosuri, Sriram. A systematic comparison of error correction enzymes by next-generation sequencing. United States: N. p., 2017. Web. doi:10.1093/nar/gkx691.
Lubock, Nathan B., Zhang, Di, Sidore, Angus M., Church, George M., & Kosuri, Sriram. A systematic comparison of error correction enzymes by next-generation sequencing. United States. https://doi.org/10.1093/nar/gkx691
Lubock, Nathan B., Zhang, Di, Sidore, Angus M., Church, George M., and Kosuri, Sriram. Tue . "A systematic comparison of error correction enzymes by next-generation sequencing". United States. https://doi.org/10.1093/nar/gkx691. https://www.osti.gov/servlets/purl/1425978.
@article{osti_1425978,
title = {A systematic comparison of error correction enzymes by next-generation sequencing},
author = {Lubock, Nathan B. and Zhang, Di and Sidore, Angus M. and Church, George M. and Kosuri, Sriram},
abstractNote = {Gene synthesis, the process of assembling genelength fragments from shorter groups of oligonucleotides (oligos), is becoming an increasingly important tool in molecular and synthetic biology. The length, quality and cost of gene synthesis are limited by errors produced during oligo synthesis and subsequent assembly. Enzymatic error correction methods are cost-effective means to ameliorate errors in gene synthesis. Previous analyses of these methods relied on cloning and Sanger sequencing to evaluate their efficiencies, limiting quantitative assessment. Here, we develop a method to quantify errors in synthetic DNA by next-generation sequencing. We analyzed errors in model gene assemblies and systematically compared six different error correction enzymes across 11 conditions. We find that ErrASE and T7 Endonuclease I are the most effective at decreasing average error rates (up to 5.8-fold relative to the input), whereas MutS is the best for increasing the number of perfect assemblies (up to 25.2-fold). We are able to quantify differential specificities such as ErrASE preferentially corrects C/G transversions whereas T7 Endonuclease I preferentially corrects A/T transversions. More generally, this experimental and computational pipeline is a fast, scalable and extensible way to analyze errors in gene assemblies, to profile error correction methods, and to benchmark DNA synthesis methods.},
doi = {10.1093/nar/gkx691},
journal = {Nucleic Acids Research},
number = 15,
volume = 45,
place = {United States},
year = {Tue Aug 01 00:00:00 EDT 2017},
month = {Tue Aug 01 00:00:00 EDT 2017}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 19 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

FOGSAA: Fast Optimal Global Sequence Alignment Algorithm
journal, April 2013

  • Chakraborty, Angana; Bandyopadhyay, Sanghamitra
  • Scientific Reports, Vol. 3, Issue 1
  • DOI: 10.1038/srep01746

Mutation detection using immobilized mismatch binding protein (MutS)
journal, January 1995

  • Wagner, Robert; Debble, Paul; Radman, Miroslav
  • Nucleic Acids Research, Vol. 23, Issue 19
  • DOI: 10.1093/nar/23.19.3944

Large-scale de novo DNA synthesis: technologies and applications
journal, April 2014

  • Kosuri, Sriram; Church, George M.
  • Nature Methods, Vol. 11, Issue 5
  • DOI: 10.1038/nmeth.2918

DNA synthesis, assembly and applications in synthetic biology
journal, August 2012

  • Ma, Siying; Tang, Nicholas; Tian, Jingdong
  • Current Opinion in Chemical Biology, Vol. 16, Issue 3-4
  • DOI: 10.1016/j.cbpa.2012.05.001

Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerase
journal, January 1988


Mapping DNA polymerase errors by single-molecule sequencing
journal, May 2016

  • Lee, David F.; Lu, Jenny; Chang, Seungwoo
  • Nucleic Acids Research, Vol. 44, Issue 13
  • DOI: 10.1093/nar/gkw436

The Genome Project-Write
journal, June 2016


Error correction in gene synthesis technology
journal, March 2012


Accurate gene synthesis with tag-directed retrieval of sequence-verified DNA molecules
journal, August 2012

  • Schwartz, Jerrod J.; Lee, Choli; Shendure, Jay
  • Nature Methods, Vol. 9, Issue 9
  • DOI: 10.1038/nmeth.2137

Screening for mutations by enzyme mismatch cleavage with T4 endonuclease VII.
journal, January 1995

  • Youil, R.; Kemper, B. W.; Cotton, R. G.
  • Proceedings of the National Academy of Sciences, Vol. 92, Issue 1
  • DOI: 10.1073/pnas.92.1.87

Analysis of the Mismatch and Insertion/Deletion Binding Properties ofThermus thermophilus,HB8, MutS
journal, April 1997

  • Whitehouse, Adrian; Deeble, Jayne; Parmar, Rekha
  • Biochemical and Biophysical Research Communications, Vol. 233, Issue 3
  • DOI: 10.1006/bbrc.1997.6551

Gene Synthesis by a LCR-Based Approach: High-Level Production of Leptin-L54 Using Synthetic Gene inEscherichia coli
journal, July 1998

  • Au, Lo-Chun; Yang, Feng-Yuan; Yang, Wan-Jung
  • Biochemical and Biophysical Research Communications, Vol. 248, Issue 1
  • DOI: 10.1006/bbrc.1998.8929

Parallel on-chip gene synthesis and application to optimization of protein expression
journal, April 2011

  • Quan, Jiayuan; Saaem, Ishtiaq; Tang, Nicholas
  • Nature Biotechnology, Vol. 29, Issue 5
  • DOI: 10.1038/nbt.1847

Fast gapped-read alignment with Bowtie 2
journal, March 2012

  • Langmead, Ben; Salzberg, Steven L.
  • Nature Methods, Vol. 9, Issue 4
  • DOI: 10.1038/nmeth.1923

Accurate multiplex gene synthesis from programmable DNA microchips
journal, December 2004

  • Tian, Jingdong; Gong, Hui; Sheng, Nijing
  • Nature, Vol. 432, Issue 7020, p. 1050-1054
  • DOI: 10.1038/nature03151

Error Rate Comparison during Polymerase Chain Reaction by DNA Polymerase
journal, January 2014

  • McInerney, Peter; Adams, Paul; Hadi, Masood Z.
  • Molecular Biology International, Vol. 2014
  • DOI: 10.1155/2014/287430

A general method applicable to the search for similarities in the amino acid sequence of two proteins
journal, March 1970


A simple fluorescent method for detecting mismatched DNAs using a MutS–fluorophore conjugate
journal, February 2007


SpeedyGenes: an improved gene synthesis method for the efficient production of error-corrected, synthetic protein libraries for directed evolution
journal, August 2014

  • Currin, Andrew; Swainston, Neil; Day, Philip J.
  • Protein Engineering, Design and Selection, Vol. 27, Issue 9
  • DOI: 10.1093/protein/gzu029

RNA Codewords and Protein Synthesis: The Effect of Trinucleotides upon the Binding of sRNA to Ribosomes
journal, September 1964


A high-throughput optomechanical retrieval method for sequence-verified clonal DNA from the NGS platform
journal, February 2015

  • Lee, Howon; Kim, Hyoki; Kim, Sungsik
  • Nature Communications, Vol. 6, Issue 1
  • DOI: 10.1038/ncomms7073

Deoxynucleoside phosphoramidites—A new class of key intermediates for deoxypolynucleotide synthesis
journal, January 1981


Detection of mutations by cleavage of DNA heteroduplexes with bacteriophage resolvases
journal, February 1995

  • Mashal, Robert D.; Koontz, Jason; Sklar, Jeffrey
  • Nature Genetics, Vol. 9, Issue 2
  • DOI: 10.1038/ng0295-177

T7 Endonuclease I Mediates Error Correction in Artificial Gene Synthesis
journal, June 2016

  • Sequeira, Ana Filipa; Guerreiro, Catarina I. P. D.; Vincentelli, Renaud
  • Molecular Biotechnology, Vol. 58, Issue 8-9
  • DOI: 10.1007/s12033-016-9957-7

Error correction of microchip synthesized genes using Surveyor nuclease
journal, November 2011

  • Saaem, Ishtiaq; Ma, Siying; Quan, Jiayuan
  • Nucleic Acids Research, Vol. 40, Issue 3
  • DOI: 10.1093/nar/gkr887

Comparison of T7E1 and Surveyor Mismatch Cleavage Assays to Detect Mutations Triggered by Engineered Nucleases
journal, January 2015

  • Vouillot, Léna; Thélie, Aurore; Pollet, Nicolas
  • G3: Genes|Genomes|Genetics, Vol. 5, Issue 3
  • DOI: 10.1534/g3.114.015834

High-quality gene assembly directly from unpurified mixtures of microarray-synthesized oligonucleotides
journal, August 2010

  • Borovkov, Alex Y.; Loskutov, Andrey V.; Robida, Mark D.
  • Nucleic Acids Research, Vol. 38, Issue 19
  • DOI: 10.1093/nar/gkq677

Protein-mediated error correction for de novo DNA synthesis
journal, November 2004


Error removal in microchip-synthesized DNA using immobilized MutS
journal, May 2014

  • Wan, Wen; Li, Lulu; Xu, Qianqian
  • Nucleic Acids Research, Vol. 42, Issue 12
  • DOI: 10.1093/nar/gku405

High-fidelity gene synthesis by retrieval of sequence-verified DNA identified using high-throughput pyrosequencing
journal, November 2010

  • Matzas, Mark; Stähler, Peer F.; Kefer, Nathalie
  • Nature Biotechnology, Vol. 28, Issue 12
  • DOI: 10.1038/nbt.1710

Fast and accurate long-read alignment with Burrows–Wheeler transform
journal, January 2010


Direct Analysis of Gene Synthesis Reactions Using Solid-State Nanopores
journal, October 2015


Using stable MutS dimers and tetramers to quantitatively analyze DNA mismatch recognition and sliding clamp formation
journal, July 2013

  • Groothuizen, Flora S.; Fish, Alexander; Petoukhov, Maxim V.
  • Nucleic Acids Research, Vol. 41, Issue 17
  • DOI: 10.1093/nar/gkt582

Enzymatic assembly of DNA molecules up to several hundred kilobases
journal, April 2009

  • Gibson, Daniel G.; Young, Lei; Chuang, Ray-Yuan
  • Nature Methods, Vol. 6, Issue 5, p. 343-345
  • DOI: 10.1038/nmeth.1318

Scalable gene synthesis by selective amplification of DNA pools from high-fidelity microchips
journal, November 2010

  • Kosuri, Sriram; Eroshenko, Nikolai; LeProust, Emily M.
  • Nature Biotechnology, Vol. 28, Issue 12, p. 1295-1299
  • DOI: 10.1038/nbt.1716

Fidelity of DNA polymerases in DNA amplification.
journal, December 1989

  • Keohavong, P.; Thilly, W. G.
  • Proceedings of the National Academy of Sciences, Vol. 86, Issue 23
  • DOI: 10.1073/pnas.86.23.9253

Removal of mismatched bases from synthetic genes by enzymatic mismatch cleavage
journal, March 2005


Removal of polymerase-produced mutant sequences from PCR products
journal, June 1997

  • Smith, J.; Modrich, P.
  • Proceedings of the National Academy of Sciences, Vol. 94, Issue 13
  • DOI: 10.1073/pnas.94.13.6847

Affinity of mismatch-binding protein MutS for heteroduplexes containing different mismatches
journal, March 2001

  • Brown, James; Brown, Tom; Fox, Keith R.
  • Biochemical Journal, Vol. 354, Issue 3
  • DOI: 10.1042/bj3540627

Gene synthesis by circular assembly amplification
journal, November 2007

  • Bang, Duhee; Church, George M.
  • Nature Methods, Vol. 5, Issue 1
  • DOI: 10.1038/nmeth1136

Analysis and Purification of Synthetic Nucleic Acids Using HPLC
journal, June 2015


Analysis of the Mismatch and Insertion/Deletion Binding Properties ofThermus thermophilus,HB8, MutS
journal, April 1997

  • Whitehouse, Adrian; Deeble, Jayne; Parmar, Rekha
  • Biochemical and Biophysical Research Communications, Vol. 233, Issue 3
  • DOI: 10.1006/bbrc.1997.6551

Gene Synthesis by a LCR-Based Approach: High-Level Production of Leptin-L54 Using Synthetic Gene inEscherichia coli
journal, July 1998

  • Au, Lo-Chun; Yang, Feng-Yuan; Yang, Wan-Jung
  • Biochemical and Biophysical Research Communications, Vol. 248, Issue 1
  • DOI: 10.1006/bbrc.1998.8929

Optimal sequence alignment using affine gap costs
journal, September 1986

  • Altschul, Stephen F.; Erickson, Bruce W.
  • Bulletin of Mathematical Biology, Vol. 48, Issue 5-6
  • DOI: 10.1007/bf02462326

A general method applicable to the search for similarities in the amino acid sequence of two proteins
journal, March 1970


Polymerase specific error rates and profiles identified by single molecule sequencing
journal, February 2016

  • Hestand, Matthew S.; Houdt, Jeroen Van; Cristofoli, Francesca
  • Mutation Research/Fundamental and Molecular Mechanisms of Mutagenesis, Vol. 784-785
  • DOI: 10.1016/j.mrfmmm.2016.01.003

A biotinylated MutS fusion protein and its use in a rapid mutation screening technique
journal, October 1996

  • Geschwind, Daniel H.; Rhee, Richard; Nelson, Stanley F.
  • Genetic Analysis: Biomolecular Engineering, Vol. 13, Issue 4
  • DOI: 10.1016/s1050-3862(95)00160-3

Direct Analysis of Gene Synthesis Reactions Using Solid-State Nanopores
journal, October 2015


Accurate multiplex gene synthesis from programmable DNA microchips
journal, December 2004

  • Tian, Jingdong; Gong, Hui; Sheng, Nijing
  • Nature, Vol. 432, Issue 7020, p. 1050-1054
  • DOI: 10.1038/nature03151

High-fidelity gene synthesis by retrieval of sequence-verified DNA identified using high-throughput pyrosequencing
journal, November 2010

  • Matzas, Mark; Stähler, Peer F.; Kefer, Nathalie
  • Nature Biotechnology, Vol. 28, Issue 12
  • DOI: 10.1038/nbt.1710

A high-throughput optomechanical retrieval method for sequence-verified clonal DNA from the NGS platform
journal, February 2015

  • Lee, Howon; Kim, Hyoki; Kim, Sungsik
  • Nature Communications, Vol. 6, Issue 1
  • DOI: 10.1038/ncomms7073

Detection of mutations by cleavage of DNA heteroduplexes with bacteriophage resolvases
journal, February 1995

  • Mashal, Robert D.; Koontz, Jason; Sklar, Jeffrey
  • Nature Genetics, Vol. 9, Issue 2
  • DOI: 10.1038/ng0295-177

Enzymatic assembly of DNA molecules up to several hundred kilobases
journal, April 2009

  • Gibson, Daniel G.; Young, Lei; Chuang, Ray-Yuan
  • Nature Methods, Vol. 6, Issue 5, p. 343-345
  • DOI: 10.1038/nmeth.1318

Fast gapped-read alignment with Bowtie 2
journal, March 2012

  • Langmead, Ben; Salzberg, Steven L.
  • Nature Methods, Vol. 9, Issue 4
  • DOI: 10.1038/nmeth.1923

Accurate gene synthesis with tag-directed retrieval of sequence-verified DNA molecules
journal, August 2012

  • Schwartz, Jerrod J.; Lee, Choli; Shendure, Jay
  • Nature Methods, Vol. 9, Issue 9
  • DOI: 10.1038/nmeth.2137

FOGSAA: Fast Optimal Global Sequence Alignment Algorithm
journal, April 2013

  • Chakraborty, Angana; Bandyopadhyay, Sanghamitra
  • Scientific Reports, Vol. 3, Issue 1
  • DOI: 10.1038/srep01746

Fidelity of DNA polymerases in DNA amplification.
journal, December 1989

  • Keohavong, P.; Thilly, W. G.
  • Proceedings of the National Academy of Sciences, Vol. 86, Issue 23
  • DOI: 10.1073/pnas.86.23.9253

Screening for mutations by enzyme mismatch cleavage with T4 endonuclease VII.
journal, January 1995

  • Youil, R.; Kemper, B. W.; Cotton, R. G.
  • Proceedings of the National Academy of Sciences, Vol. 92, Issue 1
  • DOI: 10.1073/pnas.92.1.87

Removal of polymerase-produced mutant sequences from PCR products
journal, June 1997

  • Smith, J.; Modrich, P.
  • Proceedings of the National Academy of Sciences, Vol. 94, Issue 13
  • DOI: 10.1073/pnas.94.13.6847

Fast and accurate long-read alignment with Burrows–Wheeler transform
journal, January 2010


Mutation detection using immobilized mismatch binding protein (MutS)
journal, January 1995

  • Wagner, Robert; Debble, Paul; Radman, Miroslav
  • Nucleic Acids Research, Vol. 23, Issue 19
  • DOI: 10.1093/nar/23.19.3944

Error correction of microchip synthesized genes using Surveyor nuclease
journal, November 2011

  • Saaem, Ishtiaq; Ma, Siying; Quan, Jiayuan
  • Nucleic Acids Research, Vol. 40, Issue 3
  • DOI: 10.1093/nar/gkr887

‘Shotgun DNA synthesis’ for the high-throughput construction of large DNA molecules
journal, June 2012

  • Kim, Hwangbeom; Han, Hyojun; Ahn, Jinwoo
  • Nucleic Acids Research, Vol. 40, Issue 18
  • DOI: 10.1093/nar/gks546

Using stable MutS dimers and tetramers to quantitatively analyze DNA mismatch recognition and sliding clamp formation
journal, July 2013

  • Groothuizen, Flora S.; Fish, Alexander; Petoukhov, Maxim V.
  • Nucleic Acids Research, Vol. 41, Issue 17
  • DOI: 10.1093/nar/gkt582

Error removal in microchip-synthesized DNA using immobilized MutS
journal, May 2014

  • Wan, Wen; Li, Lulu; Xu, Qianqian
  • Nucleic Acids Research, Vol. 42, Issue 12
  • DOI: 10.1093/nar/gku405

Mapping DNA polymerase errors by single-molecule sequencing
journal, May 2016

  • Lee, David F.; Lu, Jenny; Chang, Seungwoo
  • Nucleic Acids Research, Vol. 44, Issue 13
  • DOI: 10.1093/nar/gkw436

Protein-mediated error correction for de novo DNA synthesis
journal, November 2004


RNA Codewords and Protein Synthesis: The Effect of Trinucleotides upon the Binding of sRNA to Ribosomes
journal, September 1964


Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerase
journal, January 1988


Writing the Genome: Are We Ready?
journal, April 2017


Comparison of T7E1 and Surveyor Mismatch Cleavage Assays to Detect Mutations Triggered by Engineered Nucleases
journal, January 2015

  • Vouillot, Léna; Thélie, Aurore; Pollet, Nicolas
  • G3: Genes|Genomes|Genetics, Vol. 5, Issue 3
  • DOI: 10.1534/g3.114.015834

structSSI: Simultaneous and Selective Inference for Grouped or Hierarchically Structured Data
journal, January 2014

  • Sankaran, Kris; Holmes, Susan
  • Journal of Statistical Software, Vol. 59, Issue 13
  • DOI: 10.18637/jss.v059.i13

Mutation detection using Surveyor™ nuclease
journal, April 2004

  • Qiu, Peter; Shandilya, Harini; D'Alessio, James M.
  • BioTechniques, Vol. 36, Issue 4
  • DOI: 10.2144/04364pf01

Works referencing / citing this record:

A Characterization of the DNA Data Storage Channel
journal, July 2019


Multiplexed gene synthesis in emulsions for exploring protein functional landscapes
journal, January 2018


Technological challenges and milestones for writing genomes
journal, October 2019


A Characterization of the DNA Data Storage Channel
text, January 2019