Skip to yearly menu bar Skip to main content


Invited Talk
in
Workshop: Methods and Opportunities at Small Scale (MOSS)

The Art of Artificial Reasoning for Small Language Models

Yejin Choi

[ ]
Sat 19 Jul 2:15 p.m. PDT — 3 p.m. PDT

Abstract:

Large reasoning models such as Deepseek's R1 and OpenAI's O1/O3 have demonstrated the power of reinforcement learning to enable a new axis of scaling — test-time compute. This has catalyzed intensive research across the open-source community, generating rapid progress but also seemingly contradictory results. In this talk, I will present critical insights into the conditions under which reinforcement learning thrives or struggles, and how we can induce stronger reasoning capabilities from small language models, closing the gap against the larger counterparts in specific domains.

Chat is not available.