Poster
Enhancing Diversity In Parallel Agents: A Maximum State Entropy Exploration Story
Vincenzo De Paola · Riccardo Zamboni · Mirco Mutti · Marcello Restelli
West Exhibition Hall B2-B3 #W-601
Modern reinforcement learning systems often speed up learning by running many identical agents in parallel environments, collecting data much faster than a single agent could. But this raises an important question: could we go even further by letting these agents specialize, instead of making them all act the same?In our work, we propose a new approach where each agent explores the environment in its own unique way. We designed a method that encourages agents to behave differently, increasing the variety of the data they collect. This reduces redundancy and makes the overall dataset more informative. We also use a centralized learning technique to manage this coordination efficiently.Our results show that this method improves learning speed and quality compared to traditional identical-agent systems. It also works well with other RL strategies that benefit from diverse data. We back our findings with theoretical analysis, suggesting that smarter specialization in parallel RL could significantly advance real-world AI applications.