DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Improved regulatory element prediction based on tissue-specific local epigenomic signatures

Abstract

Accurate enhancer identification is critical for understanding the spatiotemporal transcriptional regulation during development as well as the functional impact of disease-related noncoding genetic variants. Computational methods have been developed to predict the genomic locations of active enhancers based on histone modifications, but the accuracy and resolution of these methods remain limited. Here, we present an algorithm, regulator y element prediction based on tissue-specific local epigenetic marks (REPTILE), which integrates histone modification and whole-genome cytosine DNA methylation profiles to identify the precise location of enhancers. We tested the ability of REPTILE to identify enhancers previously validated in reporter assays. Compared with existing methods, REPTILE shows consistently superior performance across diverse cell and tissue types, and the enhancer locations are significantly more refined. We show that, by incorporating base-resolution methylation data, REPTILE greatly improves upon current methods for annotation of enhancers across a variety of cell and tissue types.

Authors:
 [1];  [2];  [3];  [4];  [4];  [2];  [5];  [6];  [7];  [8];  [9]
  1. Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037,, Bioinformatics Program, University of California, San Diego, La Jolla, CA 92093,
  2. Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA 92093,
  3. Lawrence Berkeley National Laboratory, Berkeley, CA 94720,
  4. Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037,
  5. Institute for Human Genetics, University of California, San Francisco, CA 94143,, Department of Neurology, University of California, San Francisco, CA 94143,
  6. Lawrence Berkeley National Laboratory, Berkeley, CA 94720,, US Department of Energy Joint Genome Institute, Walnut Creek, CA 94598,, School of Natural Sciences, University of California, Merced, CA 95343,
  7. Lawrence Berkeley National Laboratory, Berkeley, CA 94720,, US Department of Energy Joint Genome Institute, Walnut Creek, CA 94598,
  8. Ludwig Institute for Cancer Research, University of California, San Diego, La Jolla, CA 92093,, Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92093,
  9. Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037,, Howard Hughes Medical Institute, The Salk Institute for Biological Studies, La Jolla, CA 92037
Publication Date:
Research Org.:
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC); National Institutes of Health (NIH); Gordon and Betty Moore Foundation
OSTI Identifier:
1343636
Alternate Identifier(s):
OSTI ID: 1393127
Grant/Contract Number:  
AC02-05CH11231; U54 HG006997; K12 GM068524; GBMF3034; R01 MH094670; U01 MH105985; GC1R-06673-B
Resource Type:
Published Article
Journal Name:
Proceedings of the National Academy of Sciences of the United States of America
Additional Journal Information:
Journal Name: Proceedings of the National Academy of Sciences of the United States of America Journal Volume: 114 Journal Issue: 9; Journal ID: ISSN 0027-8424
Publisher:
Proceedings of the National Academy of Sciences
Country of Publication:
United States
Language:
English
Subject:
60 APPLIED LIFE SCIENCES; 59 BASIC BIOLOGICAL SCIENCES; enhancer prediction; DNA methylation; bioinformatics; gene regulation; epigenetics

Citation Formats

He, Yupeng, Gorkin, David U., Dickel, Diane E., Nery, Joseph R., Castanon, Rosa G., Lee, Ah Young, Shen, Yin, Visel, Axel, Pennacchio, Len A., Ren, Bing, and Ecker, Joseph R. Improved regulatory element prediction based on tissue-specific local epigenomic signatures. United States: N. p., 2017. Web. doi:10.1073/pnas.1618353114.
He, Yupeng, Gorkin, David U., Dickel, Diane E., Nery, Joseph R., Castanon, Rosa G., Lee, Ah Young, Shen, Yin, Visel, Axel, Pennacchio, Len A., Ren, Bing, & Ecker, Joseph R. Improved regulatory element prediction based on tissue-specific local epigenomic signatures. United States. https://doi.org/10.1073/pnas.1618353114
He, Yupeng, Gorkin, David U., Dickel, Diane E., Nery, Joseph R., Castanon, Rosa G., Lee, Ah Young, Shen, Yin, Visel, Axel, Pennacchio, Len A., Ren, Bing, and Ecker, Joseph R. Mon . "Improved regulatory element prediction based on tissue-specific local epigenomic signatures". United States. https://doi.org/10.1073/pnas.1618353114.
@article{osti_1343636,
title = {Improved regulatory element prediction based on tissue-specific local epigenomic signatures},
author = {He, Yupeng and Gorkin, David U. and Dickel, Diane E. and Nery, Joseph R. and Castanon, Rosa G. and Lee, Ah Young and Shen, Yin and Visel, Axel and Pennacchio, Len A. and Ren, Bing and Ecker, Joseph R.},
abstractNote = {Accurate enhancer identification is critical for understanding the spatiotemporal transcriptional regulation during development as well as the functional impact of disease-related noncoding genetic variants. Computational methods have been developed to predict the genomic locations of active enhancers based on histone modifications, but the accuracy and resolution of these methods remain limited. Here, we present an algorithm, regulator y element prediction based on tissue-specific local epigenetic marks (REPTILE), which integrates histone modification and whole-genome cytosine DNA methylation profiles to identify the precise location of enhancers. We tested the ability of REPTILE to identify enhancers previously validated in reporter assays. Compared with existing methods, REPTILE shows consistently superior performance across diverse cell and tissue types, and the enhancer locations are significantly more refined. We show that, by incorporating base-resolution methylation data, REPTILE greatly improves upon current methods for annotation of enhancers across a variety of cell and tissue types.},
doi = {10.1073/pnas.1618353114},
journal = {Proceedings of the National Academy of Sciences of the United States of America},
number = 9,
volume = 114,
place = {United States},
year = {2017},
month = {2}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record
https://doi.org/10.1073/pnas.1618353114

Citation Metrics:
Cited by: 7 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

Cistrome and Epicistrome Features Shape the Regulatory DNA Landscape
journal, May 2016


High-Resolution Mapping and Characterization of Open Chromatin across the Genome
journal, January 2008


ChromHMM: automating chromatin-state discovery and characterization
journal, February 2012


Model-based Analysis of ChIP-Seq (MACS)
journal, January 2008


BEDOPS: high-performance genomic feature operations
journal, May 2012


Integrative analysis of haplotype-resolved epigenomes across human tissues
journal, February 2015

  • Leung, Danny; Jung, Inkyung; Rajagopal, Nisha
  • Nature, Vol. 518, Issue 7539
  • DOI: 10.1038/nature14217

Charting a dynamic DNA methylation landscape of the human genome
journal, August 2013

  • Ziller, Michael J.; Gu, Hongcang; Müller, Fabian
  • Nature, Vol. 500, Issue 7463
  • DOI: 10.1038/nature12433

Fast and accurate short read alignment with Burrows-Wheeler transform
journal, May 2009


Progress and challenges in bioinformatics approaches for enhancer identification
journal, December 2015

  • Kleftogiannis, Dimitrios; Kalnis, Panos; Bajic, Vladimir B.
  • Briefings in Bioinformatics, Vol. 17, Issue 6
  • DOI: 10.1093/bib/bbv101

DNA methylation: roles in mammalian development
journal, February 2013

  • Smith, Zachary D.; Meissner, Alexander
  • Nature Reviews Genetics, Vol. 14, Issue 3
  • DOI: 10.1038/nrg3354

Functions of DNA methylation: islands, start sites, gene bodies and beyond
journal, May 2012

  • Jones, Peter A.
  • Nature Reviews Genetics, Vol. 13, Issue 7
  • DOI: 10.1038/nrg3230

9p21 DNA variants associated with coronary artery disease impair interferon-γ signalling response
journal, February 2011

  • Harismendy, Olivier; Notani, Dimple; Song, Xiaoyuan
  • Nature, Vol. 470, Issue 7333
  • DOI: 10.1038/nature09753

Histone H3K27ac separates active from poised enhancers and predicts developmental state
journal, November 2010

  • Creyghton, M. P.; Cheng, A. W.; Welstead, G. G.
  • Proceedings of the National Academy of Sciences, Vol. 107, Issue 50
  • DOI: 10.1073/pnas.1016071107

Statistical methods for detecting differentially methylated loci and regions
journal, September 2014


DNA methylation patterns and epigenetic memory
journal, January 2002


Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome
journal, February 2007

  • Heintzman, Nathaniel D.; Stuart, Rhona K.; Hon, Gary
  • Nature Genetics, Vol. 39, Issue 3
  • DOI: 10.1038/ng1966

GENCODE: The reference human genome annotation for The ENCODE Project
journal, September 2012


Recruitment of CBP/p300 by the IFNβ Enhanceosome Is Required for Synergistic Activation of Transcription
journal, January 1998


Inferring regulatory element landscapes and transcription factor networks from cancer methylomes
journal, May 2015


Human body epigenome maps reveal noncanonical DNA methylation variation
journal, June 2015

  • Schultz, Matthew D.; He, Yupeng; Whitaker, John W.
  • Nature, Vol. 523, Issue 7559
  • DOI: 10.1038/nature14465

Dynamic DNA methylation across diverse human cell lines and tissues
journal, January 2013


C/EBPβ (CEBPB) protein binding to the C/EBP|CRE DNA 8-mer TTGC|GTCA is inhibited by 5hmC and enhanced by 5mC, 5fC, and 5caC in the CG dinucleotide
journal, June 2015

  • Khund Sayeed, Syed; Zhao, Jianfei; Sathyanarayana, Bangalore K.
  • Biochimica et Biophysica Acta (BBA) - Gene Regulatory Mechanisms, Vol. 1849, Issue 6
  • DOI: 10.1016/j.bbagrm.2015.03.002

Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues
journal, September 2013

  • Hon, Gary C.; Rajagopal, Nisha; Shen, Yin
  • Nature Genetics, Vol. 45, Issue 10
  • DOI: 10.1038/ng.2746

Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position
journal, October 2013

  • Buenrostro, Jason D.; Giresi, Paul G.; Zaba, Lisa C.
  • Nature Methods, Vol. 10, Issue 12
  • DOI: 10.1038/nmeth.2688

Erratum: Corrigendum: DNA-binding factors shape the mouse methylome at distal regulatory regions
journal, April 2012

  • Stadler, Michael B.; Murr, Rabih; Burger, Lukas
  • Nature, Vol. 484, Issue 7395
  • DOI: 10.1038/nature11086

Inducible expression of an hsp68-lacZ hybrid gene in transgenic mice
journal, April 1989


Prediction of promoters and enhancers using multiple DNA methylation-associated features
journal, June 2015


Long-Range Control of Gene Expression: Emerging Mechanisms and Disruption in Disease
journal, January 2005

  • Kleinjan, Dirk A.; van Heyningen, Veronica
  • The American Journal of Human Genetics, Vol. 76, Issue 1
  • DOI: 10.1086/426833

RFECS: A Random-Forest Based Algorithm for Enhancer Identification from Chromatin State
journal, March 2013


Human DNA methylomes at base resolution show widespread epigenomic differences
journal, October 2009

  • Lister, Ryan; Pelizzola, Mattia; Dowen, Robert H.
  • Nature, Vol. 462, Issue 7271
  • DOI: 10.1038/nature08514

Genome-wide discovery of human heart enhancers
journal, January 2010

  • Narlikar, L.; Sakabe, N. J.; Blanski, A. A.
  • Genome Research, Vol. 20, Issue 3
  • DOI: 10.1101/gr.098657.109

An integrated encyclopedia of DNA elements in the human genome
journal, September 2012


The 8q24 cancer risk variant rs6983267 shows long-range interaction with MYC in colorectal cancer
journal, June 2009

  • Pomerantz, Mark M.; Ahmadiyeh, Nasim; Jia, Li
  • Nature Genetics, Vol. 41, Issue 8
  • DOI: 10.1038/ng.403

Histone modifications at human enhancers reflect global cell-type-specific gene expression
journal, March 2009

  • Heintzman, Nathaniel D.; Hon, Gary C.; Hawkins, R. David
  • Nature, Vol. 459, Issue 7243
  • DOI: 10.1038/nature07829

The Human Genome Browser at UCSC
journal, May 2002

  • Kent, W. J.; Sugnet, C. W.; Furey, T. S.
  • Genome Research, Vol. 12, Issue 6
  • DOI: 10.1101/gr.229102

Establishing, maintaining and modifying DNA methylation patterns in plants and animals
journal, February 2010

  • Law, Julie A.; Jacobsen, Steven E.
  • Nature Reviews Genetics, Vol. 11, Issue 3
  • DOI: 10.1038/nrg2719

ChIP–seq: advantages and challenges of a maturing technology
journal, September 2009

  • Park, Peter J.
  • Nature Reviews Genetics, Vol. 10, Issue 10
  • DOI: 10.1038/nrg2641

High-throughput mapping of regulatory DNA
journal, January 2016

  • Rajagopal, Nisha; Srinivasan, Sharanya; Kooshesh, Kameron
  • Nature Biotechnology, Vol. 34, Issue 2
  • DOI: 10.1038/nbt.3468

Abnormalities in human pluripotent cells due to reprogramming mechanisms
journal, July 2014

  • Ma, Hong; Morey, Robert; O'Neil, Ryan C.
  • Nature, Vol. 511, Issue 7508
  • DOI: 10.1038/nature13551

BEDTools: a flexible suite of utilities for comparing genomic features
journal, January 2010


In vivo enhancer analysis of human conserved non-coding sequences
journal, November 2006

  • Pennacchio, Len A.; Ahituv, Nadav; Moses, Alan M.
  • Nature, Vol. 444, Issue 7118
  • DOI: 10.1038/nature05295

Transcriptional enhancers in development and disease
journal, January 2012


Non-CG Methylation in the Human Genome
journal, August 2015


‘Leveling’ the playing field for analyses of single-base resolution DNA methylomes
journal, December 2012

  • Schultz, Matthew D.; Schmitz, Robert J.; Ecker, Joseph R.
  • Trends in Genetics, Vol. 28, Issue 12
  • DOI: 10.1016/j.tig.2012.10.012

Epigenomic Analysis of Multilineage Differentiation of Human Embryonic Stem Cells
journal, May 2013


PEDLA: predicting enhancers with a deep learning-based algorithmic framework
journal, June 2016

  • Liu, Feng; Li, Hao; Ren, Chao
  • Scientific Reports, Vol. 6, Issue 1
  • DOI: 10.1038/srep28517

Integrating Diverse Datasets Improves Developmental Enhancer Prediction
journal, June 2014


Base-resolution methylation patterns accurately predict transcription factor bindings in vivo
journal, February 2015

  • Xu, Tianlei; Li, Ben; Zhao, Meng
  • Nucleic Acids Research, Vol. 43, Issue 5
  • DOI: 10.1093/nar/gkv151

VISTA Enhancer Browser--a database of tissue-specific human enhancers
journal, January 2007

  • Visel, A.; Minovitsky, S.; Dubchak, I.
  • Nucleic Acids Research, Vol. 35, Issue Database
  • DOI: 10.1093/nar/gkl822

Integrative analysis of 111 reference human epigenomes
journal, February 2015

  • Kundaje, Anshul; Meuleman, Wouter; Ernst, Jason
  • Nature, Vol. 518, Issue 7539
  • DOI: 10.1038/nature14248

A comparative encyclopedia of DNA elements in the mouse genome
journal, November 2014

  • Yue, Feng; Cheng, Yong; Breschi, Alessandra
  • Nature, Vol. 515, Issue 7527
  • DOI: 10.1038/nature13992

Estimation of False Discovery Rate Using Sequential Permutation p -Values : Sequential Permutation
journal, February 2013


CTCF: an architectural protein bridging genome topology and function
journal, March 2014

  • Ong, Chin-Tong; Corces, Victor G.
  • Nature Reviews Genetics, Vol. 15, Issue 4
  • DOI: 10.1038/nrg3663

Discover regulatory DNA elements using chromatin signatures and artificial neural network
journal, May 2010


Differential sensitivity to methylated DNA by ETS-family transcription factors is intrinsically encoded in their DNA-binding domains
journal, June 2016

  • Stephens, Dominique C.; Poon, Gregory M. K.
  • Nucleic Acids Research, Vol. 44, Issue 18
  • DOI: 10.1093/nar/gkw528

Prediction of regulatory elements in mammalian genomes using chromatin signatures
journal, January 2008


Random Forests
journal, January 2001


Unsupervised pattern discovery in human chromatin structure through genomic segmentation
journal, March 2012

  • Hoffman, Michael M.; Buske, Orion J.; Wang, Jie
  • Nature Methods, Vol. 9, Issue 5
  • DOI: 10.1038/nmeth.1937

Systematic Localization of Common Disease-Associated Variation in Regulatory DNA
journal, September 2012