skip to main content

Title: Large-scale data mining pilot project in human genome

This whitepaper briefly describes a new, aggressive effort in large- scale data Livermore National Labs. The implications of `large- scale` will be clarified Section. In the short term, this effort will focus on several @ssion-critical questions of Genome project. We will adapt current data mining techniques to the Genome domain, to quantify the accuracy of inference results, and lay the groundwork for a more extensive effort in large-scale data mining. A major aspect of the approach is that we will be fully-staffed data warehousing effort in the human Genome area. The long term goal is strong applications- oriented research program in large-@e data mining. The tools, skill set gained will be directly applicable to a wide spectrum of tasks involving a for large spatial and multidimensional data. This includes applications in ensuring non-proliferation, stockpile stewardship, enabling Global Ecology (Materials Database Industrial Ecology), advancing the Biosciences (Human Genome Project), and supporting data for others (Battlefield Management, Health Care).
Authors:
; ;
Publication Date:
OSTI Identifier:
647050
Report Number(s):
UCRL-JC--127338; CONF-9705227--
ON: DE98051372
DOE Contract Number:
W-7405-ENG-48
Resource Type:
Conference
Resource Relation:
Conference: Workshop on research and development opportunities in Federal Information Systems, Arlington, VA (United States), 13-14 May 1997; Other Information: PBD: 1 May 1997
Research Org:
Lawrence Livermore National Lab., CA (United States)
Sponsoring Org:
USDOE, Washington, DC (United States)
Country of Publication:
United States
Language:
English
Subject:
99 MATHEMATICS, COMPUTERS, INFORMATION SCIENCE, MANAGEMENT, LAW, MISCELLANEOUS; 55 BIOLOGY AND MEDICINE, BASIC STUDIES; HUMAN FACTORS ENGINEERING; HUMAN POPULATIONS; LAWRENCE LIVERMORE NATIONAL LABORATORY; INFORMATION SYSTEMS; DATA PROCESSING; PATTERN RECOGNITION; GENETIC MAPPING; DNA SEQUENCING