Skip to yearly menu bar Skip to main content


Oral Poster

Position: Principles of Animal Cognition to Improve LLM Evaluations

Sunayana Rane · Cyrus Kirkman · Graham Todd · Amanda Royka · Ryan Law · Erica Cartmill · Jacob Foster

East Exhibition Hall A-B #E-502
[ ] [ ]
Wed 16 Jul 4:30 p.m. PDT — 7 p.m. PDT
 
Oral presentation: Oral 4B Positions: Generative AI Evaluation
Wed 16 Jul 3:30 p.m. PDT — 4:30 p.m. PDT

Abstract:

It has become increasingly challenging to understand and evaluate LLM capabilities as these models exhibit a broader range of behaviors. In this position paper, we argue that LLM researchers should draw on the lessons from another field which has developed a rich set of experimental paradigms and design practices for probing the behavior of complex intelligent systems: animal cognition. We present five core principles of evaluation drawn from animal cognition research, and explain how they provide invaluable guidance for understanding LLM capabilities and behavior. We ground these principles in an empirical case study, and show how they can already provide a richer picture of one particular reasoning capability: transitive inference.

Lay Summary:

As LLM exhibit a broader range of behaviors, we may often wonder what they truly understand and what they don't. In this position paper, we argue that LLM researchers should draw on the lessons from another field which has developed a rich set of lessons for probing a wide variety of intelligent behavior: animal cognition. We present five core principles of evaluation drawn from animal cognition research, and explain how they provide invaluable guidance for understanding LLM capabilities and behavior. We ground these principles in an empirical case study, and show how they can already provide a richer picture of one particular reasoning capability: transitive inference.

Chat is not available.