Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Opportunities for retrieval and tool augmented large language models in scientific facilities

Journal Article · · npj Computational Materials

Abstract

Upgrades to advanced scientific user facilities such as next-generation x-ray light sources, nanoscience centers, and neutron facilities are revolutionizing our understanding of materials across the spectrum of the physical sciences, from life sciences to microelectronics. However, these facility and instrument upgrades come with a significant increase in complexity. Driven by more exacting scientific needs, instruments and experiments become more intricate each year. This increased operational complexity makes it ever more challenging for domain scientists to design experiments that effectively leverage the capabilities of and operate on these advanced instruments. Large language models (LLMs) can perform complex information retrieval, assist in knowledge-intensive tasks across applications, and provide guidance on tool usage. Using x-ray light sources, leadership computing, and nanoscience centers as representative examples, we describe preliminary experiments with a Context-Aware Language Model for Science (CALMS) to assist scientists with instrument operations and complex experimentation. With the ability to retrieve relevant information from facility documentation, CALMS can answer simple questions on scientific capabilities and other operational procedures. With the ability to interface with software tools and experimental hardware, CALMS can conversationally operate scientific instruments. By making information more accessible and acting on user needs, LLMs could expand and diversify scientific facilities’ users and accelerate scientific output.

Sponsoring Organization:
USDOE
Grant/Contract Number:
NONE; AC02-06CH11357
OSTI ID:
2476271
Alternate ID(s):
OSTI ID: 2574416
Journal Information:
npj Computational Materials, Journal Name: npj Computational Materials Journal Issue: 1 Vol. 10; ISSN 2057-3960
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (15)

Ingrained: An Automated Framework for Fusing Atomic‐Scale Image Simulations into Experiments journal April 2022
Large Language Models in the Workplace: A Case Study on Prompt Engineering for Job Type Classification book January 2023
ChatGPT for good? On opportunities and challenges of large language models for education journal April 2023
Scientists used ChatGPT to generate an entire paper from scratch — but is it any good? journal July 2023
Prepare for truly useful large language models journal March 2023
The future of chemistry is language journal May 2023
Augmenting large language models with chemistry tools journal May 2024
Domain-specific chatbots for science using embeddings journal October 2023
14 examples of how LLMs can transform materials science and chemistry: a reflection on a large language model hackathon journal January 2023
MaScQA: investigating materials science knowledge of large language models journal January 2024
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
Self-Instruct: Aligning Language Models with Self-Generated Instructions conference January 2023
Large Language Models Can Self-Improve conference January 2023
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
  • Reimers, Nils; Gurevych, Iryna
  • Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) https://doi.org/10.18653/v1/D19-1410
conference January 2019
Is GPT-3 all you need for low-data discovery in chemistry? preprint February 2023

Similar Records

CALMS: CONTEXT-AWARE LANGUAGE MODEL FOR SCIENCE
Software · Tue Apr 09 20:00:00 EDT 2024 · OSTI ID:code-125944

ESAC (EQ-SANS Assisting Chatbot): Application of large language models and retrieval-augmented generation for enhanced user experience at EQ-SANS
Journal Article · Wed Jun 11 00:00:00 EDT 2025 · SoftwareX · OSTI ID:2575355

Related Subjects