Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Learning Global Proliferation Expertise Evolution Using AI-Driven Analytics and Public Information

Journal Article · · IEEE Transactions on Nuclear Science
Detecting and anticipating global proliferation expertise and capability evolution from unstructured, noisy, and incomplete public data streams is a highly desired, but extremely challenging task. Here, in this article, we present our pioneering data-driven approach to support the non-proliferation mission to detect and explain the evolution of proliferation expertise and capability development globally from terabytes of publicly available information (PAI), focusing on our knowledge extraction pipeline and descriptive analytics. We first discuss how we fuse nine open-source data streams, including multilingual data, to convert 4 TB of unstructured data to structured knowledge and encode dynamically evolving proliferation expertise representations—content and context graphs. For this, we rely on natural language processing (NLP) and deep learning (DL) models to perform information extraction, topic modeling, and distributed text representation (aka embedding) learning. We then present interactive, usable, and explainable descriptive analytics to refine domain knowledge and present it in a human-understandable form. Finally, we introduce future work avenues that will leverage our dynamic knowledge representations and descriptive analytics to enable predictive and prescriptive inferences to achieve real-time domain understanding and contextual reasoning about global proliferation expertise and capability evolution.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States); Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA); USDOE Office of Science (SC)
Grant/Contract Number:
AC02-05CH11231; AC05-76RL01830
OSTI ID:
1902232
Alternate ID(s):
OSTI ID: 1961815
Report Number(s):
PNNL-SA-164401
Journal Information:
IEEE Transactions on Nuclear Science, Journal Name: IEEE Transactions on Nuclear Science Journal Issue: 6 Vol. 69; ISSN 0018-9499
Publisher:
IEEECopyright Statement
Country of Publication:
United States
Language:
English

References (30)

Analyzing scientific networks for nuclear capabilities assessment
  • Kas, Miray; Khadka, Alla G.; Frankenstein, William
  • Journal of the American Society for Information Science and Technology, Vol. 63, Issue 7 https://doi.org/10.1002/asi.22678
journal April 2012
Explaining and predicting human behavior and social dynamics in simulated virtual worlds: reproducibility, generalizability, and robustness of causal discovery methods journal November 2021
The automatic normalisation challenge: detailed addresses identification journal February 2013
Institution name disambiguation for research assessment journal December 2013
Collecting large-scale publication data at the level of individual researchers: a practical proposal for author name disambiguation journal March 2020
Resilient and Trustworthy Dynamic Data-driven Application Systems (DDDAS) Services for Crisis Management Environments journal January 2015
A jurisdictional maturity model for risk management, accountability and continual improvement of abandoned mine remediation programs journal March 2015
Bayesian network analysis of safety culture and organizational culture in a nuclear power plant journal March 2013
Finding community structure in very large networks journal December 2004
Content-based features predict social media influence operations journal July 2020
Science of science journal March 2018
Exploiting citation networks for large-scale author name disambiguation journal September 2014
Early detection of promoted campaigns on social media journal July 2017
Learning from Dynamic User Interaction Graphs to Forecast Diverse Social Behavior conference November 2019
On the Detection of Disinformation Campaign Activity with Network Analysis conference November 2020
The Proposition Bank: An Annotated Corpus of Semantic Roles journal March 2005
“Participant” Perceptions of Twitter Research Ethics journal January 2018
Forecasting influenza-like illness dynamics for military populations using neural networks and social media journal December 2017
Modeling and prediction of the 2019 coronavirus disease spreading in China incorporating human migration data journal October 2020
Stanza: A Python Natural Language Processing Toolkit for Many Human Languages conference January 2020
Recurrent Event Network: Autoregressive Structure Inferenceover Temporal Knowledge Graphs conference January 2020
Identifying Causal Influences on Publication Trends and Behavior: A Case Study of the Computational Linguistics Community conference January 2021
End-to-end Neural Coreference Resolution conference January 2017
Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction conference January 2018
Deep Semantic Role Labeling: What Works and What’s Next
  • He, Luheng; Lee, Kenton; Lewis, Mike
  • Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) https://doi.org/10.18653/v1/P17-1044
conference January 2017
ESTEEM: A Novel Framework for Qualitatively Evaluating and Visualizing Spatiotemporal Embeddings in Social Media conference January 2017
Big Data-Enhanced Risk Management journal July 2019
Visualization of bibliometric networks of scientific publications on the study of the human factor in the operation of nuclear power plants based on the bibliographic database Dimensions journal January 2020
Semantic Role Labeling with Pretrained Language Models for Known and Unknown Predicates conference October 2019
Construction of a Turkish proposition bank journal January 2018