| | |
Summary: A Query Language for Understanding
Component Interactions in Production Systems
Adam J. Oliner and Alex Aiken
Stanford University
Department of Computer Science
{oliner, aiken}@cs.stanford.edu
ABSTRACT
When something unexpected happens in a large production system,
administrators must first perform a search to isolate which compo-
nents and component interactions are likely to be involved. The
system may consist of thousands of interacting subsystems, the log-
ging instrumentation may be noisy or incomplete, and the problem
description may be vague, so this search is often the most difficult
part of understanding the system's behavior. To facilitate the search
process, we present a query language and a method for computing
these queries that makes minimal assumptions about the available
data. We evaluate our method on nearly 1.22 billion lines of sys-
tem logs from four supercomputers, two autonomous vehicles, and
a server cluster.
Categories and Subject Descriptors
|