Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Mining and Validating Social Media Data for COVID-19–Related Human Behaviors Between January and July 2020: Infodemiology Study

Journal Article · · Journal of Medical Internet Research
DOI:https://doi.org/10.2196/27059· OSTI ID:1827580
 [1];  [1];  [1];  [1];  [2];  [1];  [1];  [1];  [2];  [1];  [1];  [1]
  1. Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
  2. Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Univ. of New Mexico, Albuquerque, NM (United States)
Background: Health authorities can minimize the impact of an emergent infectious disease outbreak through effective and timely risk communication, which can build trust and adherence to subsequent behavioral messaging. Monitoring the psychological impacts of an outbreak, as well as public adherence to such messaging, is also important for minimizing long-term effects of an outbreak. Objective: We used social media data from Twitter to identify human behaviors relevant to COVID-19 transmission, as well as the perceived impacts of COVID-19 on individuals, as a first step toward real-time monitoring of public perceptions to inform public health communications. Methods: We developed a coding schema for 6 categories and 11 subcategories, which included both a wide number of behaviors as well codes focused on the impacts of the pandemic (eg, economic and mental health impacts). We used this to develop training data and develop supervised learning classifiers for classes with sufficient labels. Classifiers that performed adequately were applied to our remaining corpus, and temporal and geospatial trends were assessed. We compared the classified patterns to ground truth mobility data and actual COVID-19 confirmed cases to assess the signal achieved here. Results: We applied our labeling schema to approximately 7200 tweets. The worst-performing classifiers had F1 scores of only 0.18 to 0.28 when trying to identify tweets about monitoring symptoms and testing. Classifiers about social distancing, however, were much stronger, with F1 scores of 0.64 to 0.66. We applied the social distancing classifiers to over 228 million tweets. We showed temporal patterns consistent with real-world events, and we showed correlations of up to –0.5 between social distancing signals on Twitter and ground truth mobility throughout the United States. Conclusions: Behaviors discussed on Twitter are exceptionally varied. Twitter can provide useful information for parameterizing models that incorporate human behavior, as well as for informing public health communication strategies by describing awareness of and compliance with suggested behaviors.
Research Organization:
Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
USDOE Laboratory Directed Research and Development (LDRD) Program; USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
89233218CNA000001
OSTI ID:
1827580
Report Number(s):
LA-UR--21-20074
Journal Information:
Journal of Medical Internet Research, Journal Name: Journal of Medical Internet Research Journal Issue: 5 Vol. 23; ISSN 1438-8871
Publisher:
JMIR PublicationsCopyright Statement
Country of Publication:
United States
Language:
English

References (37)

Tracking the Rise in Popularity of Electronic Nicotine Delivery Systems (Electronic Cigarettes) Using Search Query Surveillance journal April 2011
COVID-19 pandemic and mental health consequences: Systematic review of the current evidence journal October 2020
Perceptions and Behavioral Responses of the General Public During the 2009 Influenza A (H1N1) Pandemic: A Systematic Review journal April 2015
Home is not always a haven: The domestic violence crisis amid the COVID-19 pandemic. journal August 2020
Gluttony and guilt: monthly trends in internet search query data are comparable with national-level energy intake and dieting behavior journal January 2018
The Effects of Social Media Use on Preventive Behaviors during Infectious Disease Outbreaks: The Mediating Role of Self-relevant Emotions and Public Risk Perception journal February 2020
The characteristics of multi-source mobility datasets and how they reveal the luxury nature of social distancing in the U.S. during the COVID-19 pandemic journal February 2021
Using social media to monitor mental health discussions − evidence from Twitter journal October 2016
Catching Zika Fever: Application of Crowdsourcing and Machine Learning for Tracking Health Misinformation on Twitter conference August 2017
Social media for rapid knowledge dissemination: early experience from the COVID ‐19 pandemic journal March 2020
Aggregated mobility data could help fight COVID-19 journal April 2020
From health search to healthcare: explorations of intention and utilization via query logs and user surveys journal January 2014
Towards detecting influenza epidemics by analyzing Twitter messages conference January 2010
Psychological Language on Twitter Predicts County-Level Heart Disease Mortality journal January 2015
Zika Virus Awareness and Prevention Practices Among University Students in Miami: Fall 2016 journal March 2018
Social media use by community-based organizations conducting health promotion: a content analysis journal December 2013
Accounting for behavioral responses during a flu epidemic using home television viewing journal January 2015
Social Connectedness: Measurement, Determinants, and Effects journal August 2018
Assessing Vaccination Sentiments with Online Social Media: Implications for Infectious Disease Dynamics and Control journal October 2011
Using Web Search Query Data to Monitor Dengue Epidemics: A New Model for Neglected Tropical Disease Surveillance journal May 2011
An investigation into the knowledge, perceptions and role of personal protective technologies in Zika prevention in Colombia journal January 2020
Low Acceptability of A/H1N1 Pandemic Vaccination in French Adult Population: Did Public Health Policy Fuel Public Dissonance? journal April 2010
National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic journal December 2013
Social distancing beliefs and human mobility: Evidence from Twitter journal March 2021
Zika and Public Health: Understanding the Epidemiology and Information Environment journal February 2018
Staying at Home: Mobility Effects of COVID-19 journal January 2020
Identifying Protective Health Behaviors on Twitter: Observational Study of Travel Advisories and Zika Virus journal January 2019
Comparison of Social Media, Syndromic Surveillance, and Microbiologic Acute Respiratory Infection Data: Observational Study journal January 2020
Top Concerns of Tweeters During the COVID-19 Pandemic: Infoveillance Study journal January 2020
Tracking Social Media Discourse About the COVID-19 Pandemic: Development of a Public Coronavirus Twitter Data Set journal January 2020
COVID-19 and the 5G Conspiracy Theory: Social Network Analysis of Twitter Data journal January 2020
“Thought I’d Share First” and Other Conspiracy Theory Tweets from the COVID-19 Infodemic: Exploratory Study journal January 2021
“Fitspiration” on Social Media: A Content Analysis of Gendered Images journal January 2017
Forecasting the West Nile Virus in the United States: An Extensive Novel Data Streams–Based Time Series Analysis and Structural Equation Modeling of Related Digital Searching Behavior journal January 2019
SARS-related Perceptions in Hong Kong journal March 2005
Precautionary Behavior in Response to Perceived Threat of Pandemic Influenza journal September 2007
Perceptions of Community Risk and Travel During Pregnancy in an Area of Zika Transmission journal July 2017

Similar Records

“Thought I’d Share First” and Other Conspiracy Theory Tweets from the COVID-19 Infodemic: Exploratory Study
Journal Article · Tue Apr 13 20:00:00 EDT 2021 · JMIR Public Health and Surveillance · OSTI ID:1804356

Modern Senicide in the Face of a Pandemic: An Examination of Public Discourse and Sentiment About Older Adults and COVID-19 Using Machine Learning
Journal Article · Tue Aug 11 20:00:00 EDT 2020 · The Journals of Gerontology. Series B, Psychological Sciences and Social Sciences · OSTI ID:1810565

Comparison of Social Media, Syndromic Surveillance, and Microbiologic Acute Respiratory Infection Data: Observational Study
Journal Article · Thu Apr 23 20:00:00 EDT 2020 · JMIR Public Health and Surveillance · OSTI ID:1716788