Chunking of Large Multidimensional Arrays

Rotem, Doron; Otoo, Ekow J; Seshadri, Sridhar

doi:10.2172/927033

Title: Chunking of Large Multidimensional Arrays

Technical Report · Wed Feb 28 00:00:00 EST 2007

DOI:https://doi.org/10.2172/927033· OSTI ID:927033

Rotem, Doron; Otoo, Ekow J; Seshadri, Sridhar

Data intensive scientific computations as well on-lineanalytical processing applications as are done on very large datasetsthat are modeled as k-dimensional arrays. The storage organization ofsuch arrays on disks is done by partitioning the large global array intofixed size hyper-rectangular sub-arrays called chunks or tiles that formthe units of data transfer between disk and memory. Typical queriesinvolve the retrieval of sub-arrays in a manner that accesses all chunksthat overlap the query results. An important metric of the storageefficiency is the expected number of chunks retrieved over all suchqueries. The question that immediately arises is "what shapes of arraychunks give the minimum expected number of chunks over a query workload?"In this paper we develop two probabilistic mathematical models of theproblem and provide exact solutions using steepest descent and geometricprogramming methods. Experimental results, using synthetic workloads onreal life data sets, show that our chunking is much more efficient thanthe existing approximate solutions.

View Technical Report

Cite

Export

Save

Research Organization:: Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

Sponsoring Organization:: USDOE Director. Office of Science. Advanced ScientificComputing Research

DOE Contract Number:: DE-AC02-05CH11231

OSTI ID:: 927033

Report Number(s):: LBNL-63230; R&D Project: 429201; BnR: KJ0101030; TRN: US200810%%206

Country of Publication:: United States

Language:: English

Similar Records

Optimal Chunking of Large Multidimensional Arrays for Data Warehousing

Journal Article · Fri Feb 15 00:00:00 EST 2008 · INFORMATION SYSTEMS · OSTI ID:927033

Otoo, Ekow J; Otoo, Ekow J; Rotem, Doron; +1 more

Minimizing I/O Costs of Multi-Dimensional Queries with BitmapIndices

Conference · Thu Mar 30 00:00:00 EST 2006 · OSTI ID:927033

Rotem, Doron; Stockinger, Kurt; Wu, Kesheng

Extension of 4-8 Texture Hierarchies to Large Video Processing and Visualization

Technical Report · Fri Nov 30 00:00:00 EST 2007 · OSTI ID:927033

Senecal, J G; Wegner, A E

Related Subjects

99
EFFICIENCY
EXACT SOLUTIONS
MATHEMATICAL MODELS
METRICS
PROCESSING
PROGRAMMING
STORAGE
Multi-dimensional Arrays Algorithm Array Chunking

Title: Chunking of Large Multidimensional Arrays

Citation Formats

Similar Records

Related Subjects