The Bird project: Using Big Data tools to support Search Analytics
- Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
The Bird project explored the use of big data analytics tool to improve the findability of information within the Sandia internal network. We were able to perform query classification utilizing the supervised learning algorithms in the Apache Spark library. By relying on the distributed processing capabilities provided by the Apache Hadoop framework, we successfully processed the large query log files needed to train the models in this effort. The capabilities developed in this project are being used to enhance the effectiveness of the enterprise search engine.
- Research Organization:
- Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA)
- DOE Contract Number:
- AC04-94AL85000
- OSTI ID:
- 1505412
- Report Number(s):
- SAND--2017-0406; 650436
- Country of Publication:
- United States
- Language:
- English
Similar Records
Deactivation and decommissioning web log analysis using big data technology - 15710
Offloading Calculations to Computational Storage Devices: Spark and HDFS [Slides]
A View from ORNL: Scientific Data Research Opportunities in the Big Data Age
Conference
·
Wed Jul 01 00:00:00 EDT 2015
·
OSTI ID:22824525
Offloading Calculations to Computational Storage Devices: Spark and HDFS [Slides]
Technical Report
·
Thu Aug 12 00:00:00 EDT 2021
·
OSTI ID:1813800
A View from ORNL: Scientific Data Research Opportunities in the Big Data Age
Conference
·
Sun Jul 01 00:00:00 EDT 2018
·
OSTI ID:1468120