Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A causal data fusion method for the general exposure and outcome

Journal Article · · Statistics in Medicine
DOI:https://doi.org/10.1002/sim.9239· OSTI ID:1828989
 [1];  [2];  [1];  [1];  [3]
  1. Department of Biostatistics, School of Public Health, Cheeloo College of Medicine Shandong University Jinan Shandong P. R. China, Institute for Medical Dataology, Cheeloo College of Medicine Shandong University Jinan Shandong P. R. China
  2. Department of Biostatistics, School of Public Health Peking University Beijing P. R. China
  3. Department of Biostatistics, School of Public Health Peking University Beijing P. R. China, Shool of Mathematical sciences Peking University Beijing P. R. China
Abstract

With the advent of the big data era, the need to combine multiple individual data sets to draw causal effects arises naturally in many medical and biological applications. Especially each data set cannot measure enough confounders to infer the causal effect of an exposure on an outcome. In this article, we extend the method proposed by a previous study to causal data fusion of more than two data sets without external validation and to a more general (continuous or discrete) exposure and outcome. Theoretically, we obtain the condition for identifiability of exposure effects using multiple individual data sources for the continuous or discrete exposure and outcome. The simulation results show that our proposed causal data fusion method has unbiased causal effect estimate and higher precision than traditional regression, meta‐analysis and statistical matching methods. We further apply our method to study the causal effect of BMI on glucose level in individuals with diabetes by combining two data sets. Our method is essential for causal data fusion and provides important insights into the ongoing discourse on the empirical analysis of merging multiple individual data sources.

Sponsoring Organization:
USDOE
OSTI ID:
1828989
Journal Information:
Statistics in Medicine, Journal Name: Statistics in Medicine Journal Issue: 2 Vol. 41; ISSN 0277-6715
Publisher:
Wiley Blackwell (John Wiley & Sons)Copyright Statement
Country of Publication:
United Kingdom
Language:
English

References (14)

Causal data fusion methods using summary‐level statistics for a continuous outcome journal January 2020
A sufficient condition for pooling data journal February 2008
Causality book January 2009
The Effect of Body Mass Index on Fasting Blood Glucose After Initiation of Thiazide Therapy in Hypertensive Patients journal April 2008
Adjustment for Missing Confounders Using External Validation Data and Propensity Scores journal March 2012
Combining Multiple Observational Data Sources to Estimate Causal Effects journal June 2019
Adjusting Effect Estimates for Unmeasured Confounding with Validation Data using Propensity Score Calibration journal August 2005
Performance of Propensity Score Calibration--A Simulation Study journal March 2007
Propensity Score Calibration in the Absence of Surrogacy journal April 2012
Adjustment for Missing Confounders in Studies Based on Observational Databases: 2-Stage Calibration Combining Propensity Scores From Primary and Validation Data journal June 2014
Guided Bayesian imputation to adjust for confounding when combining heterogeneous data sources in comparative effectiveness research journal March 2017
Introduction to causal diagrams for confounder selection: Causal diagrams journal January 2014
An historical perspective on meta-analysis: Dealing quantitatively with varying study results journal December 2007
Causal inference in statistics: An overview journal January 2009

Similar Records

Feature selection and causal analysis for microbiome studies in the presence of confounding using standardization
Journal Article · Mon Jul 05 20:00:00 EDT 2021 · BMC Bioinformatics · OSTI ID:1805307

Related Subjects