DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Zika discourse in the Americas: A multilingual topic analysis of Twitter

Journal Article · · PLoS ONE

Article Authors Metrics Comments Media Coverage Abstract Introduction Materials and methods Results Discussion Acknowledgments References Reader Comments (0) Media Coverage (0) Figures Abstract This work examines Twitter discussion surrounding the 2015 outbreak of Zika, a virus that is most often mild but has been associated with serious birth defects and neurological syndromes. We introduce and analyze a collection of 3.9 million tweets mentioning Zika geolocated to North and South America, where the virus is most prevalent. Using a multilingual topic model, we automatically identify and extract the key topics of discussion across the dataset in English, Spanish, and Portuguese. We examine the variation in Twitter activity across time and location, finding that rises in activity tend to follow to major events, and geographic rates of Zika-related discussion are moderately correlated with Zika incidence (ρ = .398).

Research Organization:
Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
89233218CNA000001
OSTI ID:
1526952
Report Number(s):
LA-UR-18-25885
Journal Information:
PLoS ONE, Vol. 14, Issue 5; ISSN 1932-6203
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 23 works
Citation information provided by
Web of Science

References (76)

Early Assessment of Anxiety and Behavioral Response to Novel Swine-Origin Influenza A(H1N1) journal December 2009
Computer-Assisted Text Analysis for Comparative Politics journal January 2015
Early Assessment of Anxiety and Behavioral Response to Novel Swine-Origin Influenza A(H1N1) journal December 2009
Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance journal October 2015
Zika tweets and topics (2015-03-01 to 2016-10-31) dataset January 2019
A large-scale quantitative analysis of latent factors and sentiment in online doctor reviews journal November 2014
What are we ‘tweeting’ about obesity? Mapping tweets with topic modeling and Geographic Information System journal March 2013
The Effect of Population and "Structural" Biases on Social Media-based Algorithms: A Case Study in Geolocation Inference Across the Urban-Rural Spectrum conference January 2017
Possible Association Between Zika Virus Infection and Microcephaly — Brazil, 2015 journal January 2016
Virtual Zika transmission after the first U.S. case: who said what and how it spread on Twitter journal May 2018
Forecasting Zika Incidence in the 2016 Latin America Outbreak Combining Traditional Disease Surveillance with Search, Social Media, and News Report Data journal January 2017
Polylingual topic models
  • Mimno, David; Wallach, Hanna M.; Naradowsky, Jason
  • Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing Volume 2 - EMNLP '09 https://doi.org/10.3115/1699571.1699627
conference January 2009
Mass Media and the Contagion of Fear: The Case of Ebola in America journal June 2015
Applications of Topic Models journal January 2017
Cheap Translation for Cross-Lingual Named Entity Recognition conference January 2017
Disease Detection or Public Opinion Reflection? Content Analysis of Tweets, Other Social Media, and Online Newspapers During the Measles Outbreak in the Netherlands in 2013 journal January 2015
Mining multilingual topics from wikipedia conference January 2009
Zika Virus Infection With Prolonged Maternal Viremia and Fetal Brain Abnormalities journal March 2017
Possible Association Between Zika Virus Infection and Microcephaly — Brazil, 2015 journal January 2016
Forecasting Zika Incidence in the 2016 Latin America Outbreak Combining Traditional Disease Surveillance with Search, Social Media, and News Report Data journal January 2017
Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance text January 2015
Computer-Assisted Keyword and Document Set Discovery from Unstructured Text: KEYWORD AND DOCUMENT SET DISCOVERY journal April 2017
Detecting themes of public concern: A text mining analysis of the Centers for Disease Control and Prevention's Ebola live Twitter chat journal October 2015
Redundancy-Aware Topic Modeling for Patient Record Notes text January 2014
E-Cigarette Surveillance With Social Media Data: Social Bots, Emerging Topics, and Trends journal January 2017
What Are People Tweeting About Zika? An Exploratory Study Concerning Its Symptoms, Treatment, Transmission, and Prevention journal January 2017
The spread of awareness and its impact on epidemic outbreaks journal March 2009
#Healthy Selfies: Exploration of Health Topics on Instagram journal January 2018
Possible Association Between Zika Virus Infection and Microcephaly — Brazil, 2015 journal January 2016
Probabilistic topic models journal April 2012
Geographic Maldistribution of Primary Care for Children journal January 2012
Probabilistic topic models conference January 2011
Zika in Twitter: Temporal Variations of Locations, Actors, and Concepts journal January 2017
The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies journal January 2011
Monitoring Public Health Concerns Using Twitter Sentiment Classifications conference September 2013
What are we ‘tweeting’ about obesity? Mapping tweets with topic modeling and Geographic Information System journal March 2013
Redundancy-Aware Topic Modeling for Patient Record Notes journal February 2014
Redundancy-Aware Topic Modeling for Patient Record Notes journal February 2014
Global reaction to the recent outbreaks of Zika virus: Insights from a Big Data analysis journal September 2017
Risk perception and the media journal January 2000
Zika tweets and topics (2015-03-01 to 2016-10-31) dataset January 2019
Comparing Apples to Apple: The Effects of Stemmers on Topic Models journal December 2016
Empirical study of topic modeling in Twitter conference January 2010
Discovering Health Topics in Social Media Using Topic Models journal August 2014
The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies journal January 2011
Effective vaccine communication during the disneyland measles outbreak journal June 2016
Risk perception and the media journal January 2000
Comparing grounded theory and topic modeling: Extreme divergence or unlikely convergence?
  • Baumer, Eric P. S.; Mimno, David; Guha, Shion
  • Journal of the Association for Information Science and Technology, Vol. 68, Issue 6 https://doi.org/10.1002/asi.23786
journal April 2017
Geographic Maldistribution of Primary Care for Children journal December 2010
Global reaction to the recent outbreaks of Zika virus: Insights from a Big Data analysis journal September 2017
Comparing Apples to Apple: The Effects of Stemmers on Topic Models journal December 2016
Comparing grounded theory and topic modeling: Extreme divergence or unlikely convergence?
  • Baumer, Eric P. S.; Mimno, David; Guha, Shion
  • Journal of the Association for Information Science and Technology, Vol. 68, Issue 6 https://doi.org/10.1002/asi.23786
journal April 2017
Geographic Maldistribution of Primary Care for Children journal December 2010
Investigating task performance of probabilistic topic models: an empirical study of PLSA and LDA journal August 2010
Discovering Health Topics in Social Media Using Topic Models journal August 2014
Statistical machine translation journal August 2008
Twitter Improves Influenza Forecasting journal January 2014
Social media for large studies of behavior journal November 2014
Flu Gone Viral: Syndromic Surveillance of Flu on Twitter Using Temporal Topic Models conference December 2014
Group chats on Twitter conference January 2013
What makes people talk about Ebola on social media? A retrospective analysis of Twitter use journal January 2015
Dictionary-based techniques for cross-language information retrieval journal May 2005
Dictionary-based techniques for cross-language information retrieval journal May 2005
What Are People Tweeting About Zika? An Exploratory Study Concerning Its Symptoms, Treatment, Transmission, and Prevention journal January 2017
The spread of awareness and its impact on epidemic outbreaks journal March 2009
Applications of Topic Models journal January 2017
Zika in Twitter: Temporal Variations of Locations, Actors, and Concepts journal January 2017
Disease Detection or Public Opinion Reflection? Content Analysis of Tweets, Other Social Media, and Online Newspapers During the Measles Outbreak in the Netherlands in 2013 journal January 2015
A Literature Review of Zika Virus journal July 2016
Computer-Assisted Keyword and Document Set Discovery from Unstructured Text: KEYWORD AND DOCUMENT SET DISCOVERY journal April 2017
Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance journal October 2015
Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality
  • Lau, Jey Han; Newman, David; Baldwin, Timothy
  • Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics https://doi.org/10.3115/v1/E14-1056
conference January 2014
Investigating task performance of probabilistic topic models: an empirical study of PLSA and LDA journal August 2010
Zika Virus Infection with Prolonged Maternal Viremia and Fetal Brain Abnormalities journal June 2016
#Healthy Selfies: Exploration of Health Topics on Instagram journal January 2018
Diagnosing and Improving Topic Models by Analyzing Posterior Variability journal April 2018

Cited By (1)


Figures / Tables (8)