Choosing the best partition of the output from a large-scale simulation

Challacombe, Chelsea Jordan; Casleton, Emily Michele

doi:10.2172/1396090

Title: Choosing the best partition of the output from a large-scale simulation

Technical Report · Tue Sep 26 00:00:00 EDT 2017

DOI:https://doi.org/10.2172/1396090· OSTI ID:1396090

Challacombe, Chelsea Jordan ^[1]; Casleton, Emily Michele ^[1]

Los Alamos National Lab. (LANL), Los Alamos, NM (United States)

Data partitioning becomes necessary when a large-scale simulation produces more data than can be feasibly stored. The goal is to partition the data, typically so that every element belongs to one and only one partition, and store summary information about the partition, either a representative value plus an estimate of the error or a distribution. Once the partitions are determined and the summary information stored, the raw data is discarded. This process can be performed in-situ; meaning while the simulation is running. When creating the partitions there are many decisions that researchers must make. For instance, how to determine once an adequate number of partitions have been created, how are the partitions created with respect to dividing the data, or how many variables should be considered simultaneously. In addition, decisions must be made for how to summarize the information within each partition. Because of the combinatorial number of possible ways to partition and summarize the data, a method of comparing the different possibilities will help guide researchers into choosing a good partitioning and summarization scheme for their application.

View Technical Report

Cite

Export

Save

Research Organization:: Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)

Sponsoring Organization:: USDOE Office of Science (SC). Advanced Scientific Computing Research (ASCR) (SC-21)

DOE Contract Number:: AC52-06NA25396

OSTI ID:: 1396090

Report Number(s):: LA-UR-17-28730

Country of Publication:: United States

Language:: English

Similar Records

Argonne National Laboratory summary site environmental report for calendar year 2007.

Technical Report · Fri May 22 00:00:00 EDT 2009 · OSTI ID:1396090

Golchert, N W

PipeSight: A High-Performance Computing Platform for Pipeline Integrity Management

Technical Report · Wed Mar 01 00:00:00 EST 2023 · OSTI ID:1396090

Spring, Daniel; Panzarella, Charles; Stenta, Aaron; +2 more

Used fuel disposition campaign international activities implementation plan.

Technical Report · Wed Jun 29 00:00:00 EDT 2011 · OSTI ID:1396090

Nutt, W M

Related Subjects

97 MATHEMATICS AND COMPUTING

Title: Choosing the best partition of the output from a large-scale simulation

Citation Formats

Similar Records

Related Subjects