skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: STExNMF: Spatio-Temporally Exclusive Topic Discovery for Anomalous Event Detection

Conference ·
DOI:https://doi.org/10.1109/ICDM.2017.53· OSTI ID:1426578

Understanding newly emerging events or topics associated with a particular region of a given day can provide deep insight on the critical events occurring in highly evolving metropolitan cities. We propose herein a novel topic modeling approach on text documents with spatio-temporal information (e.g., when and where a document was published) such as location-based social media data to discover prevalent topics or newly emerging events with respect to an area and a time point. We consider a map view composed of regular grids or tiles with each showing topic keywords from documents of the corresponding region. To this end, we present a tilebased spatio-temporally exclusive topic modeling approach called STExNMF, based on a novel nonnegative matrix factorization (NMF) technique. STExNMF mainly works based on the two following stages: (1) first running a standard NMF of each tile to obtain general topics of the tile and (2) running a spatiotemporally exclusive NMF on a weighted residual matrix. These topics likely reveal information on newly emerging events or topics of interest within a region. We demonstrate the advantages of our approach using the geo-tagged Twitter data of New York City. We also provide quantitative comparisons in terms of the topic quality, spatio-temporal exclusiveness, topic variation, and qualitative evaluations of our method using several usage scenarios. In addition, we present a fast topic modeling technique of our model by leveraging parallel computing.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1426578
Resource Relation:
Conference: IEEE International Conference on Data Mining (ICDM) - New Orleans, Louisiana, United States of America - 11/18/2017 3:00:00 PM-11/21/2017 3:00:00 PM
Country of Publication:
United States
Language:
English

References (20)

Fast rank-2 nonnegative matrix factorization for hierarchical document clustering conference January 2013
Learning the parts of objects by non-negative matrix factorization journal October 1999
The Joint Inference of Topic Diffusion and Evolution in Social Communities conference December 2011
Combining activity-evaluation information with NMF for trust-link prediction in social media conference October 2015
L-EnsNMF: Boosted Local Topic Discovery via Ensemble of Nonnegative Matrix Factorization conference December 2016
Interest mining from user tweets conference January 2013
Applying Fourth-Order Partial Differential Equations and Contrast Enhancement to Fluorescence Microscopic Image Denoising book January 2012
Interactive Visual Discovering of Movement Patterns from Sparsely Sampled Geo-tagged Social Media Data journal January 2016
Discovering geographical topics in the twitter stream conference April 2012
An improved data stream summary: the count-min sketch and its applications journal April 2005
Simultaneous Discovery of Common and Discriminative Topics via Joint Nonnegative Matrix Factorization conference August 2015
Spatio-Temporal Topic Modeling in Mobile Social Media for Location Recommendation conference December 2013
TargetVue: Visual Analysis of Anomalous User Behaviors in Online Communication Systems journal January 2016
Nonnegative Matrix Factorization Based on Alternating Nonnegativity Constrained Least Squares and Active Set Method journal January 2008
Tiara conference July 2010
Locally discriminative topic modeling journal January 2012
PairFac conference October 2016
LPTA: A Probabilistic Model for Latent Periodic Topic Analysis conference December 2011
TopicSketch: Real-Time Bursty Topic Detection from Twitter journal August 2016
MedLDA conference June 2009