Poster
Improving Transformer World Models for Data-Efficient RL
Antoine Dedieu · Joseph Ortiz · Xinghua Lou · Carter Wendelken · J Swaroop Guntupalli · Wolfgang Lehrach · Miguel Lazaro-Gredilla · Kevin Murphy
West Exhibition Hall B2-B3 #W-714
Learning how to rapidly learn new skills from limited data is critical for building general AI systems. As a step towards this goal, we propose a new method for training AI agents for playing 2D video games, including Craftax-classic (a version of Crafter) and Minitar (a simplified version of Atari). Our method is the first to beat human performance on Craftax-classic using limited training data. We achieve this by adding three key enhancements to existing reinforcement learning methods: a better way to preprocess images, a new technique to accurately predict the next frame of the game (as part of the agent's world model), and an improved method for learning the agent strategy from both real data (from the environment) combined with "imaginary data" (generated by the agent's model).