ICML Capability Transfer from Large to Small Models with Synthetically-Generated Data

Poster
in
Workshop: Tiny Titans: The next wave of On-Device Learning for Foundation Models (TTODLer-FM)

Capability Transfer from Large to Small Models with Synthetically-Generated Data

Lillian Sun · Emma Yang · Arif Dayi

[ Abstract ] [ Project Page ]

[ OpenReview]

Fri 18 Jul 1 p.m. PDT — 1:45 p.m. PDT

Abstract:

We investigate the transfer of capabilities from large language models to smaller models using synthetic, LLM-generated data. Instead of using human-annotated data, we explore whether a large model can effectively "teach" a smaller model natural language capabilities like summarization and question-answering through generated synthetic data. The large model acts as a teacher in generating both the training data and evaluation metrics, while a smaller student model learns exclusively from this synthetic data. We empirically investigate two key tasks, summarization and question-answering. Through this work, we aim to demonstrate the feasibility of a fully synthetic data driven pipeline for capability transfer. Our experiments demonstrate promising results for both tasks, displaying up to 56% performance improvement in summarization and at least on-par performance in question-answering on the synthetic capability metric. Our study highlights the potential of synthetic data as a scalable and cost-effective alternative to human annotation, paving the way for more efficient training of smaller models without sacrificing performance.

Chat is not available.

Poster in Workshop: Tiny Titans: The next wave of On-Device Learning for Foundation Models (TTODLer-FM)

Capability Transfer from Large to Small Models with Synthetically-Generated Data

Lillian Sun · Emma Yang · Arif Dayi

Poster
in
Workshop: Tiny Titans: The next wave of On-Device Learning for Foundation Models (TTODLer-FM)