Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Quantum biological insights into CRISPR-Cas9 sgRNA efficiency from explainable-AI driven feature engineering

Journal Article · · Nucleic Acids Research
DOI:https://doi.org/10.1093/nar/gkad736· OSTI ID:2004638
Abstract

CRISPR-Cas9 tools have transformed genetic manipulation capabilities in the laboratory. Empirical rules-of-thumb have been developed for only a narrow range of model organisms, and mechanistic underpinnings for sgRNA efficiency remain poorly understood. This work establishes a novel feature set and new public resource, produced with quantum chemical tensors, for interpreting and predicting sgRNA efficiency. Feature engineering for sgRNA efficiency is performed using an explainable-artificial intelligence model: iterative Random Forest (iRF). By encoding quantitative attributes of position-specific sequences for Escherichia coli sgRNAs, we identify important traits for sgRNA design in bacterial species. Additionally, we show that expanding positional encoding to quantum descriptors of base-pair, dimer, trimer, and tetramer sequences captures intricate interactions in local and neighboring nucleotides of the target DNA. These features highlight variation in CRISPR-Cas9 sgRNA dynamics between E. coli and H. sapiens genomes. These novel encodings of sgRNAs enhance our understanding of the elaborate quantum biological processes involved in CRISPR-Cas9 machinery.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Laboratory Directed Research and Development (LDRD) Program; USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
2004638
Alternate ID(s):
OSTI ID: 2007740
Journal Information:
Nucleic Acids Research, Journal Name: Nucleic Acids Research Journal Issue: 19 Vol. 51; ISSN 0305-1048
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (61)

Chromatin accessibility and guide sequence secondary structure affect CRISPR-Cas9 gene editing efficiency journal June 2017
Nature and magnitude of aromatic base stacking in DNA and RNA: Quantum chemistry, molecular mechanics and experiment journal June 2013
Effect of the damping function in dispersion corrected density functional theory journal March 2011
The Crystal Structure of C-C-A-T-T-A-A-T-G-G journal May 1994
RNA-Mediated Transcriptional Gene Silencing in Human Cells book January 2008
Target specificity of the CRISPR-Cas9 system journal June 2014
Crystal Structure of Cas9 in Complex with Guide RNA and Target DNA journal February 2014
High-Resolution CRISPR Screens Reveal Fitness Genes and Genotype-Specific Cancer Liabilities journal December 2015
Evolutionary systems biology: links between gene evolution and function journal October 2006
Computational approaches for effective CRISPR guide RNA design and evaluation journal January 2020
CRISPR/Cas9 and Genome Editing in Drosophila journal January 2014
CRISPR-P 2.0: An Improved CRISPR-Cas9 Tool for Genome Editing in Plants journal March 2017
Key role of the REC lobe during CRISPR–Cas9 activation by ‘sensing’, ‘regulating’, and ‘locking’ the catalytic HNH domain journal January 2018
From Quantum Chemistry to Networks in Biology: A Graph Spectral Approach to Protein Structure Analyses journal March 2019
Excitons in Solids from Periodic Equation-of-Motion Coupled-Cluster Theory journal March 2020
Reformulation of the D3(Becke–Johnson) Dispersion Correction without Resorting to Higher than C 6 Dispersion Coefficients journal June 2015
Time-Dependent Extension of the Long-Range Corrected Density Functional Based Tight-Binding Method journal March 2017
Catalytic Mechanism of Non-Target DNA Cleavage in CRISPR-Cas9 Revealed by Ab Initio Molecular Dynamics journal November 2020
Nonempirically Tuned Range-Separated DFT Accurately Predicts Both Fundamental and Excitation Gaps in DNA and RNA Nucleobases journal July 2012
Parametrization and Benchmark of DFTB3 for Organic Molecules journal November 2012
Stacking Interactions and the Twist of DNA journal January 2008
Functional Insights Revealed by the Kinetic Mechanism of CRISPR/Cas9 journal February 2018
An Analysis of the Different Behavior of DNA and RNA through the Study of the Mutual Relationship between Stacking and Hydrogen Bonding journal March 2009
Reduced HOMO−LUMO Gap as an Index of Kinetic Stability for Polycyclic Aromatic Hydrocarbons journal September 1999
Rational design of highly active sgRNAs for CRISPR-Cas9–mediated gene inactivation journal September 2014
Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9 journal January 2016
CRISPRscan: designing highly efficient sgRNAs for CRISPR-Cas9 targeting in vivo journal August 2015
Defining and improving the genome-wide specificities of CRISPR–Cas9 nucleases journal April 2016
Localized thermodynamic coupling between hydrogen bonding and microenvironment polarity substantially stabilizes proteins journal June 2009
Real-space and real-time dynamics of CRISPR-Cas9 visualized by high-speed atomic force microscopy journal November 2017
Multiplexed Cas9 targeting reveals genomic location effects and gRNA-based staggered breaks influencing mutation efficiency journal April 2019
Clarifying the quantum mechanical origin of the covalent chemical bond journal September 2020
Principles of target DNA cleavage and the role of Mg2+ in the catalysis of CRISPR–Cas9 journal October 2022
Sequence features associated with the cleavage efficiency of CRISPR/Cas9 system journal January 2016
A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu journal April 2010
DFTB+, a software package for efficient approximate density functional theory based atomistic simulations journal March 2020
Iterative random forests to discover predictive and stable high-order interactions journal January 2018
Nucleosomes inhibit target cleavage by CRISPR-Cas9 in vivo journal September 2018
Delivering CRISPR: a review of the challenges and approaches journal January 2018
CRISPR-DT: designing gRNAs for the CRISPR-Cpf1 system with improved target efficiency and specificity journal January 2019
CRISPR/Cas9: a molecular Swiss army knife for simultaneous introduction of multiple genetic modifications in Saccharomyces cerevisiae journal March 2015
PubChem in 2021: new data content and improved web interfaces journal November 2020
Refined sgRNA efficacy prediction improves large- and small-scale CRISPR–Cas9 applications journal December 2017
Improved sgRNA design in bacteria via genome-wide activity profiling journal June 2018
Sequence determinants of improved CRISPR sgRNA design journal June 2015
Layer by layer 3D tissue epitaxy by cell laden hydrogel droplets conference April 2009
Multiplex Genome Engineering Using CRISPR/Cas Systems journal January 2013
Genetic Screens in Human Cells Using the CRISPR-Cas9 System journal December 2013
The new frontier of genome engineering with CRISPR-Cas9 journal November 2014
Identification of potential drug targets for tuberous sclerosis complex by synthetic screens combining CRISPR-based knockouts with RNAi journal September 2015
CRISPR–Cas9 Structures and Mechanisms journal May 2017
ViennaRNA Package 2.0 journal November 2011
Quantitative CRISPR interference screens in yeast identify chemical-genetic interactions and new rules for guide RNA design journal March 2016
DeepCRISPR: optimized CRISPR guide RNA design by deep learning journal June 2018
A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action journal October 2017
CRISPRpred: A flexible and efficient tool for sgRNAs on-target activity prediction in CRISPR/Cas9 systems journal August 2017
Using local chromatin structure to improve CRISPR/Cas9 efficiency in zebrafish journal August 2017
Are the current gRNA ranking prediction algorithms useful for genome editing in plants? journal January 2020
A Novel Hybrid CNN-SVR for CRISPR/Cas9 Guide RNA Activity Prediction journal January 2020
A High-Performance Computing Implementation of Iterative Random Forest for the Creation of Predictive Expression Networks journal December 2019
Nucleosomes impede Cas9 access to DNA in vivo and in vitro journal March 2016

Similar Records

Controlling CRISPR-Cas9 with ligand-activated and ligand-deactivated sgRNAs
Journal Article · Wed May 08 20:00:00 EDT 2019 · Nature Communications · OSTI ID:1603511

Demonstration of CRISPR/Cas9/sgRNA-mediated targeted gene modification in Arabidopsis, tobacco, sorghum and rice
Journal Article · Fri Aug 30 20:00:00 EDT 2013 · Nucleic Acids Research · OSTI ID:1904805

Related Subjects