Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

The Bird project: Using Big Data tools to support Search Analytics

Technical Report ·
DOI:https://doi.org/10.2172/1505412· OSTI ID:1505412
 [1];  [1]
  1. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

The Bird project explored the use of big data analytics tool to improve the findability of information within the Sandia internal network. We were able to perform query classification utilizing the supervised learning algorithms in the Apache Spark library. By relying on the distributed processing capabilities provided by the Apache Hadoop framework, we successfully processed the large query log files needed to train the models in this effort. The capabilities developed in this project are being used to enhance the effectiveness of the enterprise search engine.

Research Organization:
Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
DOE Contract Number:
AC04-94AL85000
OSTI ID:
1505412
Report Number(s):
SAND--2017-0406; 650436
Country of Publication:
United States
Language:
English

Similar Records

Deactivation and decommissioning web log analysis using big data technology - 15710
Conference · Wed Jul 01 00:00:00 EDT 2015 · OSTI ID:22824525

Offloading Calculations to Computational Storage Devices: Spark and HDFS [Slides]
Technical Report · Thu Aug 12 00:00:00 EDT 2021 · OSTI ID:1813800

A View from ORNL: Scientific Data Research Opportunities in the Big Data Age
Conference · Sun Jul 01 00:00:00 EDT 2018 · OSTI ID:1468120