skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Machine learning prediction of incidence of Alzheimer’s disease using large-scale administrative health data

Journal Article · · npj Digital Medicine
 [1]; ORCiD logo [2];  [3];  [4];  [4];  [3];  [1]; ORCiD logo [3]; ORCiD logo [5]
  1. Brookhaven National Lab. (BNL), Upton, NY (United States)
  2. Yonsei Univ. College of Medicine, Seoul (Korea)
  3. National Health Insurance Service Ilsan Hospital, Goyang (Korea)
  4. Columbia Univ., New York, NY (United States)
  5. Columbia Univ., New York, NY (United States); Seoul National Univ. (Korea)

Nationwide population-based cohort provides a new opportunity to build an automated risk prediction model based on individuals’ history of health and healthcare beyond existing risk prediction models. We tested the possibility of machine learning models to predict future incidence of Alzheimer’s disease (AD) using large-scale administrative health data. From the Korean National Health Insurance Service database between 2002 and 2010, we obtained de-identified health data in elders above 65 years (N = 40,736) containing 4,894 unique clinical features including ICD-10 codes, medication codes, laboratory values, history of personal and family illness and socio-demographics. To define incident AD we considered two operational definitions: “definite AD” with diagnostic codes and dementia medication (n = 614) and “probable AD” with only diagnosis (n = 2026). We trained and validated random forest, support vector machine and logistic regression to predict incident AD in 1, 2, 3, and 4 subsequent years. For predicting future incidence of AD in balanced samples (bootstrapping), the machine learning models showed reasonable performance in 1-year prediction with AUC of 0.775 and 0.759, based on “definite AD” and “probable AD” outcomes, respectively; in 2-year, 0.730 and 0.693; in 3-year, 0.677 and 0.644; in 4-year, 0.725 and 0.683. The results were similar when the entire (unbalanced) samples were used. Important clinical features selected in logistic regression included hemoglobin level, age and urine protein level. This study may shed a light on the utility of the data-driven machine learning model based on large-scale administrative health data in AD risk prediction, which may enable better selection of individuals at risk for AD in clinical trials or early detection in clinical settings.

Research Organization:
Brookhaven National Laboratory (BNL), Upton, NY (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (SC-21); Seoul National University; NHS Ilsan Hospital Research Support Program; National Institutes of Health (NIH); Brain Behavior Research Foundation Young Investigator Award; Korean Scientists and Engineers Association Young Investigator Grant; National Research Foundation of Korea (NRF); Ministry of Science
Grant/Contract Number:
SC0012704; K01-MH109836
OSTI ID:
1618403
Report Number(s):
BNL-215918-2020-JAAM
Journal Information:
npj Digital Medicine, Vol. 3, Issue 1; ISSN 2398-6352
Publisher:
Springer NatureCopyright Statement
Country of Publication:
United States
Language:
English

References (37)

Monetary Costs of Dementia in the United States journal April 2013
The Value of Delaying Alzheimer’s Disease Onset journal January 2015
Big data analytics in healthcare: promise and potential journal February 2014
Dementia risk prediction in the population: are screening models accurate? journal May 2010
Multiple cognitive deficits during the transition to Alzheimer's disease journal September 2004
Cognitive Deficits 3 to 6 Years Before Dementia Onset in a Population Sample: The Honolulu-Asia Aging Study: COGNITIVE DEFICITS BEFORE DEMENTIA ONSET journal March 2005
Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait—a cohort study journal January 2013
Prediction models to identify individuals at risk of metabolic syndrome who are unlikely to participate in a health intervention program journal March 2018
Ten-year prediction of suicide death using Cox regression and machine learning in a nationwide retrospective cohort study in South Korea journal April 2018
Evaluation of Machine-Learning Algorithms for Predicting Opioid Overdose Risk Among Medicare Beneficiaries With Opioid Prescriptions journal March 2019
Predicting drug-resistant epilepsy — A machine learning approach based on administrative claims data journal December 2018
Predicting the Future — Big Data, Machine Learning, and Clinical Medicine journal September 2016
On the Prospects for a (Deep) Learning Health Care System journal September 2018
Deep Learning—A Technology With the Potential to Transform Health Care journal September 2018
Current Developments in Dementia Risk Prediction Modelling: An Updated Systematic Review journal September 2015
Scalable and accurate deep learning with electronic health records journal May 2018
Anaemia increases the risk of dementia in cognitively intact elderly journal February 2006
Hemoglobin level in older persons and incident Alzheimer disease: Prospective cohort analysis journal July 2011
Anemia and risk of dementia in older adults: Findings from the Health ABC study journal July 2013
Anemia is associated with incidence of dementia: a national health screening study in Korea involving 37,900 persons journal December 2017
Diagnostic Accuracy of Urine Dipsticks for Detection of Albuminuria in the General Community journal July 2011
Short-term treatment with tolfenamic acid improves cognitive functions in Alzheimer's disease mice journal October 2013
Tolfenamic acid reduces tau and CDK5 levels: implications for dementia and tauopathies journal October 2014
Tolfenamic acid downregulates BACE1 and protects against lead-induced upregulation of Alzheimer's disease related biomarkers journal April 2014
Tolfenamic Acid: A Modifier of the Tau Protein and its Role in Cognition and Tauopathy journal May 2018
Psychotropic Medication Burden and Factors Associated with Antipsychotic Use: An Analysis of a Population-Based Sample of Community-Dwelling Older Persons with Dementia journal September 2011
Schizophrenia and risk of dementia: a meta-analysis study journal January 2018
The treatment of cognitive dysfunction in dementia: a multiple treatments meta-analysis journal March 2018
Role of Vasodilation in Cognitive Impairment journal May 2011
A Nationwide Survey on the Prevalence of Dementia and Mild Cognitive Impairment in South Korea journal February 2011
Cohort Profile: The National Health Insurance Service–National Sample Cohort (NHIS-NSC), South Korea journal January 2016
All Patient Refined-Diagnosis Related Groups’ (APR-DRGs) Severity of Illness and Risk of Mortality as predictors of in-hospital mortality journal May 2022
Risk score for the prediction of dementia risk in 20 years among middle aged people: a longitudinal, population-based study journal September 2006
Comparison of Proteinuria Determination by Urine Dipstick, Spot Urine Protein Creatinine Index, and Urine Protein 24 Hours in Lupus Patients journal January 2011
Dementia risk in renal dysfunction: A systematic review and meta-analysis of prospective studies journal December 2016
Projections of Alzheimer's disease in the United States and the public health impact of delaying disease onset. journal September 1998
Identification of Physician-Diagnosed Alzheimer’s Disease and Related Dementias in Population-Based Administrative Data: A Validation Study Using Family Physicians’ Electronic Medical Records journal August 2016

Cited By (23)

Period, birth cohort and prevalence of dementia in mainland China, Hong Kong and Taiwan: a meta‐analysis journal May 2014
Prevalence of dementia in East Asia: a synthetic review of time trends journal May 2015
Clinical Implications of Quantitative Electroencephalography and Current Source Density in Patients with Alzheimer’s Disease journal June 2012
Cognitive Function and Quality of Life in Community-Dwelling Seniors with Mild Cognitive Impairment in Taiwan journal March 2016
Approaches in methodology for population-based longitudinal study on neuroprotective model for healthy longevity (TUA) among Malaysian Older Adults journal December 2015
The prevalence of mild cognitive impairment and its etiological subtypes in elderly Chinese journal January 2014
Current and past leisure time physical activity in relation to risk of Alzheimer's disease in older adults journal October 2019
The prevalence and incidence of dementia with Lewy bodies: a systematic review of population and clinical studies journal March 2013
Impact of illiteracy on depression symptomatology in community-dwelling older adults journal June 2014
The role of depression in the insomnia of people with subjective memory impairment, mild cognitive impairment, and dementia in a community sample of elderly individuals in South Korea journal December 2016
Association between lifestyle and cognitive impairment among women aged 65 years and over in the Republic of Korea journal September 2015
Mental health service utilization among Korean elders in Korean churches: preliminary findings from the Memory and Aging Study of Koreans in Maryland (MASK-MD) journal July 2013
One-year mortality among newly admitted older patients in a long-term care hospital in South Korea journal July 2018
Mild cognitive impairment: a concept in evolution journal March 2014
Alzheimer's disease with cerebrovascular disease: current status in the Asia-Pacific region journal March 2016
Rehabilitation of lost teeth related to maintenance of cognitive function journal February 2018
Productive Activities and Risk of Cognitive Impairment and Depression: Does the Association Vary by Gender? journal December 2019
Burden of disease due to dementia in the elderly population of Korea: present and future journal April 2013
Incidence and predictors of mild cognitive impairment (MCI) within a multi-ethnic Asian populace: a community-based longitudinal study journal August 2019
The changing prevalence and incidence of dementia over time — current evidence text January 2017
Cancer Prevention Using Machine Learning, Nudge Theory and Social Impact Bond journal January 2020
Traditional Korean East Asian Medicines and Herbal Formulations for Cognitive Impairment journal November 2013
Cognitive Stimulation as a Therapeutic Modality for Dementia: A Meta-Analysis journal January 2017