skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Domain-specific chatbots for science using embeddings

Journal Article · · Digital Discovery
DOI:https://doi.org/10.1039/D3DD00112A· OSTI ID:2202517
ORCiD logo [1]
  1. Center for Functional Nanomaterials, Brookhaven National Laboratory, Upton, New York 11973, USA

Large language models (LLMs) have emerged as powerful machine-learning systems capable of handling a myriad of tasks. Tuned versions of these systems have been turned into chatbots that can respond to user queries on a vast diversity of topics, providing informative and creative replies. However, their application to physical science research remains limited owing to their incomplete knowledge in these areas, contrasted with the needs of rigor and sourcing in science domains. Here, we demonstrate how existing methods and software tools can be easily combined to yield a domain-specific chatbot. The system ingests scientific documents in existing formats and uses text embedding lookup to provide the LLM with domain-specific contextual information when composing its reply. We similarly demonstrate that existing image embedding methods can be used for search and retrieval across publication figures. These results confirm that LLMs are already suitable for use by physical scientists in accelerating their research efforts.

Research Organization:
Brookhaven National Laboratory (BNL), Upton, NY (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC), Basic Energy Sciences (BES). Scientific User Facilities (SUF)
Grant/Contract Number:
SC0012704
OSTI ID:
2202517
Alternate ID(s):
OSTI ID: 2267566
Report Number(s):
BNL-225132-2023-JAAM; DDIIAI
Journal Information:
Digital Discovery, Journal Name: Digital Discovery Vol. 2 Journal Issue: 6; ISSN 2635-098X
Publisher:
Royal Society of Chemistry (RSC)Copyright Statement
Country of Publication:
United Kingdom
Language:
English

References (63)

What learning algorithm is in-context learning? Investigations with linear models preprint January 2022
Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting preprint January 2023
Sparks of Artificial General Intelligence: Early experiments with GPT-4 preprint January 2023
TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs preprint January 2023
CERMINE: automatic extraction of structured metadata from scientific literature journal July 2015
ReAct: Synergizing Reasoning and Acting in Language Models preprint January 2022
CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society preprint January 2023
QLoRA: Efficient Finetuning of Quantized LLMs preprint January 2023
On the Opportunities and Risks of Foundation Models preprint January 2021
Autonomous experimentation systems for materials development: A community perspective journal September 2021
High-Resolution Image Synthesis with Latent Diffusion Models preprint January 2021
Molecular Origin of Photovoltaic Performance in Donor- block -Acceptor All-Conjugated Block Copolymers journal October 2015
Fine-Tuning Language Models from Human Preferences preprint January 2019
Generative Artificial Intelligence: Trends and Prospects journal October 2022
Emergent analogical reasoning in large language models journal July 2023
Large Language Models are Zero-Shot Reasoners preprint January 2022
Autonomous discovery of emergent morphologies in directed self-assembly of block copolymer blends journal January 2023
Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback preprint January 2023
LoRA: Low-Rank Adaptation of Large Language Models preprint January 2021
Artificial muses: Generative Artificial Intelligence Chatbots Have Risen to Human-Level Creativity preprint January 2023
Emergent Abilities of Large Language Models preprint January 2022
Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task preprint January 2022
Lamellar and liquid crystal ordering in solvent-annealed all-conjugated block copolymers journal January 2014
Autonomous materials discovery driven by Gaussian process regression with inhomogeneous measurement noise and anisotropic kernels journal October 2020
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face preprint January 2023
ChatGPT is not all you need. A State of the Art Review of large Generative AI models preprint January 2023
Toolformer: Language Models Can Teach Themselves to Use Tools preprint January 2023
TruthfulQA: Measuring How Models Mimic Human Falsehoods preprint January 2021
Focused Transformer: Contrastive Training for Context Scaling preprint January 2023
Tree of Thoughts: Deliberate Problem Solving with Large Language Models preprint January 2023
Progress and prospects for accelerating materials science with automated and autonomous workflows journal January 2019
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models preprint January 2023
LongNet: Scaling Transformers to 1,000,000,000 Tokens preprint January 2023
Novel photo-switching using azobenzene functional materials journal September 2006
Reflexion: Language Agents with Verbal Reinforcement Learning preprint January 2023
Hierarchical Text-Conditional Image Generation with CLIP Latents preprint January 2022
Galactica: A Large Language Model for Science preprint January 2022
Emergent autonomous scientific research capabilities of large language models preprint January 2023
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge preprint January 2018
Theory of Mind Might Have Spontaneously Emerged in Large Language Models preprint January 2023
Longformer: The Long-Document Transformer preprint January 2020
HellaSwag: Can a Machine Really Finish Your Sentence? preprint January 2019
Learning Transferable Visual Models From Natural Language Supervision preprint January 2021
Learning to Generate Novel Scientific Directions with Contextualized Literature-based Discovery preprint January 2023
Non-native three-dimensional block copolymer morphologies journal December 2016
Attention Is All You Need preprint January 2017
Progress measures for grokking via mechanistic interpretability preprint January 2023
Measuring Massive Multitask Language Understanding preprint January 2020
Large Language Models as Tool Makers preprint January 2023
Let's Verify Step by Step preprint January 2023
Voyager: An Open-Ended Embodied Agent with Large Language Models preprint January 2023
Arbitrary lattice symmetries via block copolymer nanomeshes journal June 2015
Layout-aware text extraction from full-text PDF of scientific articles journal May 2012
Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling preprint January 2023
Matplotlib: A 2D Graphics Environment journal January 2007
A survey of machine learning for big data processing journal May 2016
The Creativity of Text-to-Image Generation conference November 2022
Photomechanical Effects in Azo-Polymers Studied by Neutron Reflectometry journal December 2006
PAL: Program-aided Language Models preprint January 2022
Python for Scientific Computing journal January 2007
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models preprint January 2023
Selective directed self-assembly of coexisting morphologies using block copolymer blends journal August 2016
Autonomous x-ray scattering journal May 2023