skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Searching for fat tails in CRISPR-Cas systems: Data analysis and mathematical modeling

Journal Article · · PLoS Computational Biology (Online)
ORCiD logo [1]; ORCiD logo [2]; ORCiD logo [3]; ORCiD logo [4]
  1. Palomar College, San Marcos, CA (United States)
  2. USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States); Mammoth BioSciences, San Franisco, CA (United States)
  3. Univ. of Leicester (United Kingdom); Russian Academy of Sciences (RAS), Moscow (Russian Federation)
  4. Russian Academy of Sciences (RAS), Moscow (Russian Federation)

Understanding CRISPR-Cas systems—the adaptive defence mechanism that about half of bacterial species and most of archaea use to neutralise viral attacks—is important for explaining the biodiversity observed in the microbial world as well as for editing animal and plant genomes effectively. The CRISPR-Cas system learns from previous viral infections and integrates small pieces from phage genomes called spacers into the microbial genome. The resulting library of spacers collected in CRISPR arrays is then compared with the DNA of potential invaders. One of the most intriguing and least well understood questions about CRISPR-Cas systems is the distribution of spacers across the microbial population. Here, using empirical data, we show that the global distribution of spacer numbers in CRISPR arrays across multiple biomes worldwide typically exhibits scale-invariant power law behaviour, and the standard deviation is greater than the sample mean. We develop a mathematical model of spacer loss and acquisition dynamics which fits observed data from almost four thousand metagenomes well. In analogy to the classical ‘rich-get-richer’ mechanism of power law emergence, the rate of spacer acquisition is proportional to the CRISPR array size, which allows a small proportion of CRISPRs within the population to possess a significant number of spacers. Our study provides an alternative explanation for the rarity of all-resistant super microbes in nature and why proliferation of phages can be highly successful despite the effectiveness of CRISPR-Cas systems.

Research Organization:
USDOE Joint Genome Institute (JGI), Berkeley, CA (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1904074
Journal Information:
PLoS Computational Biology (Online), Vol. 17, Issue 3; ISSN 1553-7358
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English

References (60)

Long-Term Dynamics of Adaptation in Asexual Populations journal November 2013
Power-law models for infectious disease spread journal September 2014
Intervening Sequences of Regularly Spaced Prokaryotic Repeats Derive from Foreign Genetic Elements journal February 2005
The size of the immune repertoire of bacteria journal February 2020
Microbial Genomics and Infectious Diseases journal July 2011
Molecular memory of prior infections activates the CRISPR/Cas adaptive bacterial immunity system journal January 2012
How adaptive immunity constrains the composition and fate of large bacterial populations journal July 2018
Priming in a permissive type I-C CRISPR–Cas system reveals distinct dynamics of spacer acquisition and loss journal July 2017
The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4) journal October 2015
Power-Law Distributions in Empirical Data journal November 2009
Psl trails guide exploration and microcolony formation in Pseudomonas aeruginosa biofilms journal May 2013
Different genetic and morphological outcomes for phages targeted by single or multiple CRISPR-Cas spacers journal March 2019
The CRISPR tool kit for genome editing and beyond journal May 2018
Adaptation of the Haloarcula hispanica CRISPR-Cas system to a purified virus strictly requires a priming process journal November 2013
Do-or-die life cycles and diverse post-infection resistance mechanisms limit the evolution of parasite host ranges journal February 2014
GenBank journal November 2012
Why put up with immunity when there is resistance: an excursion into the population and evolutionary dynamics of restriction–modification and CRISPR-Cas journal March 2019
Small CRISPR RNAs Guide Antiviral Defense in Prokaryotes journal August 2008
Short motif sequences determine the targets of the prokaryotic CRISPR defence system journal March 2009
Spatial Structure and Lamarckian Adaptation Explain Extreme Genetic Diversity at CRISPR Locus journal August 2012
CRISPR Provides Acquired Resistance Against Viruses in Prokaryotes journal March 2007
Computational approaches to predict bacteriophage–host relationships journal December 2015
CRISPR-Based Technologies and the Future of Food Science journal October 2015
The Red Queen and the Court Jester: Species Diversity and the Role of Biotic and Abiotic Factors Through Time journal February 2009
On the global CRISPR array behavior in class I systems journal August 2017
Mathematical modeling and comparison of protein size distribution in different plant, animal, fungal and microbial species reveals a negative correlation between protein size and protein number, thus providing insight into the evolution of proteomes journal February 2012
Dynamic properties of the Sulfolobus CRISPR/Cas and CRISPR/Cmr systems when challenged with vector-borne viral and plasmid genes and protospacers: Dynamics of Sulfolobus CRISPR-based systems journal November 2010
On the Frequency of Severe Terrorist Events journal February 2007
The predator-prey power law: Biomass scaling across terrestrial and aquatic biomes journal September 2015
Global distribution of microbial abundance and biomass in subseafloor sediment journal August 2012
Likelihood Ratio Tests for Model Selection and Non-Nested Hypotheses journal March 1989
CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats journal June 2007
Characterization of a Type II-A CRISPR-Cas System in Streptococcus mutans journal June 2020
Phylogenetically Novel Uncultured Microbial Cells Dominate Earth Microbiomes journal September 2018
A Programmable Dual-RNA-Guided DNA Endonuclease in Adaptive Bacterial Immunity journal June 2012
The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification journal October 2014
Probabilistic models for CRISPR spacer content evolution journal February 2013
Co-evolution within structured bacterial communities results in multiple expansion of CRISPR loci and enhanced immunity journal March 2020
The ecology and evolution of microbial CRISPR-Cas adaptive immune systems journal March 2019
Strong bias in the bacterial CRISPR elements that confer immunity to phage journal February 2013
On a Class of Skew Distribution Functions journal December 1955
The Population and Evolutionary Dynamics of Phage and Bacteria with CRISPR–Mediated Immunity journal March 2013
powerlaw: A Python Package for Analysis of Heavy-Tailed Distributions journal January 2014
Power laws, Pareto distributions and Zipf's law journal September 2005
Characterization of the transcriptome of Achromobacter sp. HZ01 with the outstanding hydrocarbon-degrading ability journal June 2016
Mechanism of Foreign DNA Selection in a Bacterial Adaptive Immune System journal June 2012
CRISPR Interference Directs Strand Specific Spacer Acquisition journal April 2012
Optimal number of spacers in CRISPR arrays journal December 2017
Type I-F CRISPR-Cas resistance against virulent phages results in abortive infection and provides population-level immunity journal December 2019
Dominance of the suppressed: Power-law size structure in tropical forests journal January 2016
CRISPR-Cas antimicrobials: Challenges and future prospects journal June 2018
The diversity-generating benefits of a prokaryotic adaptive immune system journal April 2016
Universal1/fNoise from Dissipative Self-Organized Criticality Models journal January 1999
CRISPR-Cas immunity in prokaryotes journal October 2015
IMG/M: integrated genome and metagenome comparative data analysis system journal October 2016
Phage-Resistant Phase-Variant Sub-populations Mediate Herd Immunity Against Bacteriophage Invasion of Bacterial Meta-Populations journal July 2019
CRISPR-Cas Systems Optimize Their Immune Response by Specifying the Site of Spacer Integration journal November 2016
A theory of power-law distributions in financial market fluctuations journal May 2003
Earthquake size distribution: Power-law with exponent ? journal July 2010
It is unclear how important CRISPR-Cas systems are for protecting natural populations of bacteria against infections by mobile genetic elements journal October 2020