Atlas of Transcription Factor Binding Sites from ENCODE DNase Hypersensitivity Data across 27 Tissue Types
more »
- Institute for Systems Biology, Seattle, WA (United States)
- Univ. of Maryland School of Medicine, Baltimore, MD (United States)
- Univ. of Chicago, IL (United States)
- Mayo Clinic, Jacksonville, FL (United States)
- Univ. of Southern California, Los Angeles, CA (United States)
- Univ. of Chicago, IL (United States); Argonne National Lab. (ANL), Argonne, IL (United States)
Characterizing the tissue-specific binding sites of transcription factors (TFs) is essential to reconstruct gene regulatory networks and predict functions for non-coding genetic variation. DNase-seq footprinting enables the prediction of genome-wide binding sites for hundreds of TFs simultaneously. Despite the public availability of high-quality DNase-seq data from hundreds of samples, a comprehensive, up-to-date resource for the locations of genomic footprints is lacking. Here, we develop a scalable footprinting workflow using two state-of-the-art algorithms: Wellington and HINT. We apply our workflow to detect footprints in 192 ENCODE DNase-seq experiments and predict the genomic occupancy of 1,515 human TFs in 27 human tissues. We validate that these footprints overlap true-positive TF binding sites from ChIP-seq. We demonstrate that the locations, depth, and tissue specificity of footprints predict effects of genetic variants on gene expression and capture a substantial proportion of genetic risk for complex traits.
- Research Organization:
- Argonne National Laboratory (ANL), Argonne, IL (United States)
- Sponsoring Organization:
- National Human Genome Research Institute (NHGRI); National Institute of General Medical Sciences (NIGMS); National Institute of Mental Health (NIMH); National Institute on Aging (NIA); National Institutes of Health (NIH); USDOE
- Grant/Contract Number:
- AC02-06CH11357
- OSTI ID:
- 1774294
- Journal Information:
- Cell Reports, Journal Name: Cell Reports Journal Issue: 7 Vol. 32; ISSN 2211-1247
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
| Reproducible Big Data Science: A Case Study In Continuous Fairness | text | January 2018 |
| Reproducible Big Data Science: A Case Study In Continuous Fairness | text | January 2018 |
Similar Records
MZF-1 and DbpA interact with DNase I hypersensitive sites that correlate with expression of the human MUC1 mucin gene
A web-based tool for the prediction of rice transcription factor function
Genome-wide Transcription Factor DNA Binding Sites and Gene Regulatory Networks in Clostridium thermocellum
Journal Article
·
Mon Aug 01 00:00:00 EDT 2005
· Experimental Cell Research
·
OSTI ID:20717632
A web-based tool for the prediction of rice transcription factor function
Journal Article
·
Wed Jun 05 20:00:00 EDT 2019
· Database
·
OSTI ID:2469652
Genome-wide Transcription Factor DNA Binding Sites and Gene Regulatory Networks in Clostridium thermocellum
Dataset
·
Mon Apr 19 00:00:00 EDT 2021
·
OSTI ID:1778964