Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

An interpretable model of pre-mRNA splicing for animal and plant genes

Journal Article · · Science Advances
 [1];  [2]
  1. Massachusetts Institute of Technology (MIT), Cambridge, MA (United States); , Massachusetts Institute of Technology, Cambridge, MA (United States)
  2. Massachusetts Institute of Technology (MIT), Cambridge, MA (United States)
Pre-mRNA splicing is a fundamental step in gene expression, conserved across eukaryotes, in which the spliceosome recognizes motifs at the 3' and 5' splice sites (SSs), excises introns, and ligates exons. SS recognition and pairing is often influenced by protein splicing factors (SFs) that bind to splicing regulatory elements (SREs). Here, we describe SMsplice, a fully interpretable model of pre-mRNA splicing that combines models of core SS motifs, SREs, and exonic and intronic length preferences. We learn models that predict SS locations with 83 to 86% accuracy in fish, insects, and plants and about 70% in mammals. Learned SRE motifs include both known SF binding motifs and unfamiliar motifs, and both motif classes are supported by genetic analyses. Our comparisons across species highlight similarities between non-mammals, increased reliance on intronic SREs in plant splicing, and a greater reliance on SREs in mammalian splicing.
Research Organization:
Krell Institute, Ames, IA (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
FG02-97ER25308
OSTI ID:
2471914
Journal Information:
Science Advances, Journal Name: Science Advances Journal Issue: 19 Vol. 10; ISSN 2375-2548
Publisher:
AAASCopyright Statement
Country of Publication:
United States
Language:
English

References (42)

Evolution of SR protein and hnRNP splicing regulatory factors journal September 2011
Exon and intron definition in pre-mRNA splicing: Exon and intron definition in pre-mRNA splicing journal October 2012
A minimal intron length but no specific internal sequence is required for splicing the large rabbit β-globin intron journal July 1984
Pre-mRNA splicing in higher plants journal April 2000
Systematic Identification and Analysis of Exonic Splicing Silencers journal December 2004
The Spliceosome: Design Principles of a Dynamic RNP Machine journal February 2009
Learning the Sequence Determinants of Alternative Splicing from Millions of Random Sequences journal October 2015
Combinatorial Genetics Reveals a Scaling Law for the Effects of Mutations on Splicing journal January 2019
Predicting Splicing from Primary Sequence with Deep Learning journal January 2019
Comparative Analysis Identifies Exonic Splicing Regulatory Sequences—The Complex Definition of Enhancers and Silencers journal June 2006
Sequence, Structure, and Context Preferences of Human RNA Binding Proteins journal June 2018
Quantitative Activity Profile and Context Dependence of All Human 5′ Splice Sites journal September 2018
Deciphering the splicing code journal May 2010
A compendium of RNA-binding motifs for decoding gene regulation journal July 2013
Lessons from non-canonical splicing journal May 2016
Alternative splicing and evolution: diversification, exon definition and function journal April 2010
Context-dependent control of alternative splicing by RNA-binding proteins journal August 2014
Intronic splicing enhancers, cognate splicing factors and context-dependent regulation rules journal September 2012
A complex network of factors with overlapping affinities represses splicing through intronic elements journal December 2012
A large-scale binding and functional map of human RNA-binding proteins journal July 2020
Prediction of protein–ligand binding affinity from sequencing data with interpretable machine learning journal May 2022
Variation in sequence and organization of splicing regulatory elements in vertebrate genes journal October 2004
A computational analysis of sequence features involved in recognition of short introns journal September 2001
Maximum Entropy Modeling of Short Sequence Motifs with Applications to RNA Splicing Signals journal March 2004
Bayesian prediction of tissue-regulated splicing using RNA sequence and cellular context journal July 2011
TimeTree 5: An Expanded Resource for Species Divergence Times journal August 2022
RNA splice junctions of different classes of eukaryotes: sequence statistics and functional implications in gene expression journal January 1987
APPRIS: annotation of principal and alternative splice isoforms journal November 2012
In silico prediction of splice-altering single nucleotide variants in the human genome journal November 2014
Splice-switching antisense oligonucleotides as therapeutic drugs journal June 2016
Saturation mutagenesis reveals manifold determinants of exon definition journal December 2017
An efficient forward-backward algorithm for an explicit-duration hidden Markov model journal January 2003
Predictive Identification of Exonic Splicing Enhancers in Human Genes journal July 2002
The human splicing code reveals new insights into the genetic determinants of disease journal December 2014
Introns and Splicing Elements of Five Diverse Fungi journal October 2004
The evolution, impact and properties of exonic splice enhancers journal December 2013
MutPred Splice: machine learning-based prediction of exonic variants that disrupt splicing journal January 2014
MMSplice: modular modeling improves the predictions of genetic variant effects on splicing journal March 2019
Improved modeling of RNA-binding protein motifs in an interpretable neural model of RNA splicing journal January 2024
CI-SpliceAI—Improving machine learning predictions of disease causing splicing variants using curated alternative splice sites journal June 2022
The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture journal December 2017
Mutations primarily alter the inclusion of alternatively spliced exons journal October 2020

Similar Records

The kinetics of pre-mRNA splicing in the Drosophila genome and the influence of gene architecture
Journal Article · Tue Dec 26 19:00:00 EST 2017 · eLife · OSTI ID:1414969