Massive Social Network Analysis: Mining Twitter for Social Good
Social networks produce an enormous quantity of data. Facebook consists of over 400 million active users sharing over 5 billion pieces of information each month. Analyzing this vast quantity of unstructured data presents challenges for software and hardware. We present GraphCT, a Graph Characterization Tooklit for massive graphs representing social network data. On a 128-processor Cray XMT, GraphCT estimates the betweenness centrality of an artificially generated (R-MAT) 537 million vertex, 8.6 billion edge graph in 55 minutes. We use GraphCT to analyze public data from Twitter, a microblogging network. Twitter's message connections appear primarily tree-structured as a news dissemination system. Within the public data, however, are clusters of conversations. Using GraphCT, we can rank actors within these conversations and help analysts focus attention on a much smaller data subset.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (US)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1032703
- Report Number(s):
- PNNL-SA-71335; 400470000
- Country of Publication:
- United States
- Language:
- English
Similar Records
State-of-the-Art of Social Media Analytics Research
A Faster Parallel Algorithm and Efficient Multithreaded Implementations for Evaluating Betweenness Centrality on Massive Datasets
Implementing and Evaluating Multithreaded Triad Census Algorithms on the Cray XMT
Technical Report
·
Mon Dec 31 23:00:00 EST 2012
·
OSTI ID:1077994
A Faster Parallel Algorithm and Efficient Multithreaded Implementations for Evaluating Betweenness Centrality on Massive Datasets
Conference
·
Fri May 29 00:00:00 EDT 2009
·
OSTI ID:974005
Implementing and Evaluating Multithreaded Triad Census Algorithms on the Cray XMT
Conference
·
Fri May 29 00:00:00 EDT 2009
·
OSTI ID:973732