skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information


; ; ; ; ;
Publication Date:
Research Org.:
Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
Sponsoring Org.:
OSTI Identifier:
Report Number(s):
DOE Contract Number:
Resource Type:
Resource Relation:
Conference: Presented at: 2017 IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Curaao, Netherlands, Dec 10 - Dec 13, 2017
Country of Publication:
United States

Citation Formats

Kailkhura, B, Ray, P, Rajan, D, Yen, A, Barnes, P, and Goldhahn, R. BYZANTINE-RESILIENT COLLABORATIVE AUTONOMOUS DETECTION. United States: N. p., 2017. Web.
Kailkhura, B, Ray, P, Rajan, D, Yen, A, Barnes, P, & Goldhahn, R. BYZANTINE-RESILIENT COLLABORATIVE AUTONOMOUS DETECTION. United States.
Kailkhura, B, Ray, P, Rajan, D, Yen, A, Barnes, P, and Goldhahn, R. 2017. "BYZANTINE-RESILIENT COLLABORATIVE AUTONOMOUS DETECTION". United States. doi:.
author = {Kailkhura, B and Ray, P and Rajan, D and Yen, A and Barnes, P and Goldhahn, R},
abstractNote = {},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = 2017,
month = 6

Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share:
  • One predominant barrier encountered in furthering research and development efforts aimed at facilitating resilient HPC applications is a substantial lack of existing reliability and performance data originating from extreme-scale computing distributions. In order to develop an understanding of how and why highly scaled HPC applications are encountering increasingly frequent performance interruptions, one must conduct extensive trending and analysis on contemporary machines and their associated programs. However, existing HPC application log files are labyrinthine documents that, even with the assistance of intelligent data mining algorithms, translate poorly to human discern. In addition, conventional log filtering, processes are limited to execution withinmore » a post-mortem, reactive time period, as the enormous size of these documents prevents efficient real time interaction. Thus, there exists a strong need within the HPC field for the provision of accurate-yet-concise real time application information. Moreover, the means of reporting this data must be sufficiently lightweight and non-intrusive, as to successfully-yet-discretely attach itself to the multiple processes running on multiple cores within tens (or in some cases, hundreds) of thousands of compute nodes. Furthermore, this information should in turn be used to facilitate the autonomous correction of application-threatening faults, suspensions, and interruptions. This paper describes a dynamic application instrumentation module (utilizing a combination of Open/SpeedShop software and custom scripting) aimed at successfully achieving these goals.« less
  • No abstract prepared.
  • Two detection methods based solely on sound and electromagnetic emissions form the arc are presented. Detection of arc induced sound signals 40 to 50 dB above background noise are observed. Detection of arc induced low radio frequency (HF) electromagnetic noise levels 20 to 60 dB above background are observed. The arc noise is randomly strongest on side A and/or B of the DIII-D rf system. The sum of these sensors correlates with tripping due to an increase in the rf reflection coefficient. The sensors are resilient to ELMs and other plasma noise.
  • Resilient control systems in critical infrastructures require increased cyber-security and state-awareness. One of the necessary conditions for achieving the desired high level of resiliency is timely reporting and understanding of the status and behavioral trends of the control system. This paper describes the design and development of a neural-network based data-fusion system for increased state-awareness of resilient control systems. The proposed system consists of a dedicated data-fusion engine for each component of the control system. Each data-fusion engine implements three-layered alarm system consisting of: (1) conventional threshold-based alarms, (2) anomalous behavior detector using self-organizing maps, and (3) prediction error basedmore » alarms using neural network based signal forecasting. The proposed system was integrated with a model of the Idaho National Laboratory Hytest facility, which is a testing facility for hybrid energy systems. Experimental results demonstrate that the implemented data fusion system provides timely plant performance monitoring and cyber-state reporting.« less