Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

An LLNL perspective on ASCI data mining and pattern recognition requirements

Technical Report ·
DOI:https://doi.org/10.2172/9659· OSTI ID:9659

The working document has been put together by the members of the Sapphire project at LLNL. The goal of Sapphire is to apply and extend techniques from data mining and pattern recognition in order to detect automatically the areas of interest in very large data sets. The intent is to help scientists address the problem of data overload by providing them effective and efficient ways of exploring and analyzing massive data sets. One of the key areas where they expect this technology to be used is in the analysis of the output from ASCI simulations. It is expected that a simulation running on the 100 Tflop ASCI machine in the year 2004 will produce data at the rate of 12TB/hour. Given the difficulties they currently have in analyzing and visualizing a terabyte of data, it is imperative that they start planning now for ways that will make the analysis of petabyte data sets feasible. This document focuses on the relevance of data mining and pattern recognition to ASCI, discusses potential applications of these techniques in ASCI, and identifies research issues that arise as they apply the algorithms in these areas to massive data sets.

Research Organization:
Lawrence Livermore National Lab., CA (US)
Sponsoring Organization:
USDOE Office of Defense Programs (DP) (US)
DOE Contract Number:
W-7405-ENG-48
OSTI ID:
9659
Report Number(s):
UCRL-ID-132850; YN0100000; 99-ERI-010; YN0100000; 99-ERI-010
Country of Publication:
United States
Language:
English

Similar Records

LDRD 99-ERI-010 Final Report: Sapphire: Scalable Pattern Recognition for Large-Scale Scientific Data Mining
Technical Report · Tue Jan 29 23:00:00 EST 2002 · OSTI ID:15003138

On the design and implementation of a parallel, object-oriented, image processing toolkit
Conference · Thu Jun 22 00:00:00 EDT 2000 · OSTI ID:15006502

Scalable pattern recognition for large-scale scientific data mining
Technical Report · Sun Mar 22 23:00:00 EST 1998 · OSTI ID:310913