Assessing the nature of large language models: A caution against anthropocentrism

Speed, Ann

doi:10.2172/2430355

Assessing the nature of large language models: A caution against anthropocentrism

Technical Report · Fri Sep 01 00:00:00 EDT 2023

DOI:https://doi.org/10.2172/2430355· OSTI ID:2430355

Speed, Ann ^[1]

Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)

Generative AI models garnered a large amount of public attention and speculation with the release of OpenAI’s chatbot, ChatGPT in November of 2022. At least two opinion camps exist – one that is excited about the possibilities these models offer for fundamental changes to human tasks, and another that is highly concerned about the power these models seem to have – especially since the release of GPT-4, which was trained on multimodal data and has ~1.7 trillion (T) parameters. We evaluated some concerns regarding these models’ power by assessing GPT 3.5 using standard, normed, and validated cognitive and personality measures. These measures come from the tradition of psychometrics in experimental psychology and have a long history of providing valuable insights and predictive distinctions in humans. For this seedling project, we developed a battery of tests that allowed us to estimate the boundaries of some of these models’ capabilities, how stable those capabilities are over a short period of time, and how they compare to humans.

Research Organization:: Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)

Sponsoring Organization:: USDOE National Nuclear Security Administration (NNSA); USDOE Laboratory Directed Research and Development (LDRD) Program

DOE Contract Number:: NA0003525

OSTI ID:: 2430355

Report Number(s):: SAND--2023-09372R

Country of Publication:: United States

Language:: English

Similar Records

Assessing the nature of large language models: A caution against anthropocentrism.

Technical Report · Mon Jul 01 00:00:00 EDT 2024 · OSTI ID:2429946

Large language model evaluation for high–performance computing software development

Journal Article · Wed Sep 04 00:00:00 EDT 2024 · Concurrency and Computation. Practice and Experience · OSTI ID:2474767

AI in Science Communication

Journal Article · Tue Aug 13 00:00:00 EDT 2024 · TBD · OSTI ID:2432387

Related Subjects

97 MATHEMATICS AND COMPUTING

Assessing the nature of large language models: A caution against anthropocentrism

Citation Formats

Similar Records

Related Subjects