Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Assessing the nature of large language models: A caution against anthropocentrism

Technical Report ·
DOI:https://doi.org/10.2172/2430355· OSTI ID:2430355
 [1]
  1. Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)

Generative AI models garnered a large amount of public attention and speculation with the release of OpenAI’s chatbot, ChatGPT in November of 2022. At least two opinion camps exist – one that is excited about the possibilities these models offer for fundamental changes to human tasks, and another that is highly concerned about the power these models seem to have – especially since the release of GPT-4, which was trained on multimodal data and has ~1.7 trillion (T) parameters. We evaluated some concerns regarding these models’ power by assessing GPT 3.5 using standard, normed, and validated cognitive and personality measures. These measures come from the tradition of psychometrics in experimental psychology and have a long history of providing valuable insights and predictive distinctions in humans. For this seedling project, we developed a battery of tests that allowed us to estimate the boundaries of some of these models’ capabilities, how stable those capabilities are over a short period of time, and how they compare to humans.

Research Organization:
Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA); USDOE Laboratory Directed Research and Development (LDRD) Program
DOE Contract Number:
NA0003525
OSTI ID:
2430355
Report Number(s):
SAND--2023-09372R
Country of Publication:
United States
Language:
English

Similar Records

Assessing the nature of large language models: A caution against anthropocentrism.
Technical Report · Mon Jul 01 00:00:00 EDT 2024 · OSTI ID:2429946

Large language model evaluation for high–performance computing software development
Journal Article · Wed Sep 04 00:00:00 EDT 2024 · Concurrency and Computation. Practice and Experience · OSTI ID:2474767

AI in Science Communication
Journal Article · Tue Aug 13 00:00:00 EDT 2024 · TBD · OSTI ID:2432387

Related Subjects