Mitigative Strategies for Recovering From Large Language Model Trust Violations
Journal Article
·
· Journal of Cognitive Engineering and Decision Making
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
In this study, we investigated strategies to address trust issues arising from errors in large language models (LLMs). The study examined the impact of confidence scores, system capability explanations, and user feedback on trust restoration post-error. 68 participants viewed the responses of an LLM to 20 general trivia questions, with an error introduced on the third trial. Each participant was presented with one mitigation strategy. Participants rated their overall trust in the model and the reliability of the answer. Results showed an immediate drop in trust after the error; however, there were no differences across the three strategies in trust recovery. Further, all conditions had a logarithmic trend in trust recovery following error. Differences in overall trust were predicted by perceived reliability of the answer, suggesting that participants were evaluating results critically and using that to inform their trust in the model. Qualitative data supported this finding; participants expressed lasting distrust despite the LLM’s later accuracy. Results showcase the need to prioritize accuracy in LLM deployment, because early errors may irrevocably damage user trust calibration and later adoption.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE; USDOE Laboratory Directed Research and Development (LDRD) Program; USDOE Office of Science (SC), Office of Workforce Development for Teachers & Scientists (WDTS)
- Grant/Contract Number:
- AC05-76RL01830
- OSTI ID:
- 2479660
- Alternate ID(s):
- OSTI ID: 2560812
- Report Number(s):
- PNNL-SA--193711
- Journal Information:
- Journal of Cognitive Engineering and Decision Making, Journal Name: Journal of Cognitive Engineering and Decision Making Journal Issue: 1 Vol. 19; ISSN 1555-3434
- Publisher:
- Sage PublicationsCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Role of Uncertainty Quantification in the Explainability of Large Language Models for the Nuclear Industry
Trust and Public Participation in Risk Policy Issues
Conference
·
Tue Jun 17 20:00:00 EDT 2025
·
OSTI ID:3028731
Trust and Public Participation in Risk Policy Issues
Conference
·
Tue Nov 30 23:00:00 EST 1999
·
OSTI ID:15004140