skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: DICER: Data Intensive Computing Environment and Runtime for Evaluating Unprecedented Scale of Geospatial-Temporal Human Mobility Data

Conference ·

With the significant increase in sources and volume of human mobility data through commercial data vendors as well as microsimulation of cities, the scale of geospatial-temporal data to analyze and assess for mobility characterization has grown to the level of Big Data. There are mobility related commercial organizations deploying scalable computing, but often the system architecture, workflow, and intermediate processing components are not fully disclosed in relevant scope. Current research literature has a notable lack of studies demonstrating architectures and workflows for human mobility analytics that are implemented on a TeraByte scale of geospatial-temporal data. In this context, this paper presents a hyperscale-level system solution named DICER (Data Intensive Computing Environment and Runtime) for processing and analytics of geospatial-temporal data at big data scale. Although the cluster computing architecture of DICER with Apache Spark job running on Kubernetes cluster is not new, there are innovations in the workflow, hierarchical processing logic, and a wide range of intermediate preprocessing and mobility metrics calculation. We have performed case studies to validate the effectiveness of DICER system solution by performing detailed analytics and assessment of human mobility microsimulation output at three different scopes and scale, including a usecase with 16.97 TeraByte and 259.2 Billion rows of data. In addition, we have presented another case study of utilizing DICER to perform the same mobility processing and comparative analytics on large-scale commercially available geospatial-temporal data. All these case studies validate the efficiency and usefulness of DICER in computing population mobility characteristics from geospatial-temporal trajectory data at an unprecedented scale (not only just data volume, but also combination of: number of user entities, temporal frequency, spatial resolution, data duration).

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
2429834
Resource Relation:
Conference: The 25th IEEE International Conference on Mobile Data Management (IEEE MDM 2024) - Brussels, , Belgium - 6/24/2024 4:00:00 AM-6/27/2024 4:00:00 AM
Country of Publication:
United States
Language:
English

References (10)

IMPORTANT: a framework to systematically analyze the Impact of Mobility on Performance of Routing Protocols for Adhoc Networks conference January 2003
Real-Time Urban Monitoring Using Cell Phones: A Case Study in Rome journal March 2011
A Survey on Big Data Processing Frameworks for Mobility Analytics journal August 2021
Unveiling the complexity of human mobility by querying and mining massive trajectory data journal July 2011
Understanding individual human mobility patterns journal June 2008
Planet-scale human mobility measurement conference June 2010
Cellular Census: Explorations in Urban Data Collection journal July 2007
Limits of Predictability in Human Mobility journal February 2010
COBRA: A framework for the analysis of realistic mobility models conference April 2013
GeoSpark conference November 2015