Poster
Scaling Large Motion Models with Million-Level Human Motions
Ye Wang · Sipeng Zheng · Bin Cao · Qianshan Wei · Weishuai Zeng · Qin Jin · Zongqing Lu
West Exhibition Hall B2-B3 #W-203
Inspired by advanced large language models, we aim to create large models that can truly understand and generate diverse human motions. However, progress is hindered by a lack of massive, high-quality motion data. To address this, we introduce MotionLib—a dataset with over a million human motions, much larger than existing ones, where each motion has detailed descriptions. We also develop MotionBook, a new method that compactly and accurately represents motions, preserving fine-grained details, to efficiently help models learn these motions. Using these, we train Being-M0, which shows strong performance across many activities, including those it has not seen before. Our work highlights that larger datasets and models are key to improving motion generation, paving the way for developing more versatile large models in fields like gaming and robotics.