Spotlight Poster
Monte-Carlo Tree Search with Uncertainty Propagation via Optimal Transport
Tuan Dam · Pascal Stenger · Lukas Schneider · Joni Pajarinen · Carlo D'Eramo · Odalric-Ambrym Maillard
West Exhibition Hall B2-B3 #W-718
Imagine you're playing a complex game where the rules keep changing randomly, and you can't see all the information you need to make good decisions. Traditional computer algorithms that plan moves ahead (like those used in chess or Go) struggle in these uncertain situations because they assume the world is predictable and all information is available. Our research introduces a new planning algorithm called "Wasserstein MCTS" that's specifically designed to handle uncertainty and randomness. Instead of just tracking single "best guess" values for each possible move, our algorithm keeps track of entire probability distributions - essentially maintaining a range of possible outcomes and their likelihoods. The key innovation is how we combine information from different possible moves. We use a mathematical technique called "Wasserstein barycenters" (think of it as a sophisticated way of averaging probability distributions) that allows the algorithm to properly account for uncertainty when deciding which moves to explore. This is like having a chess player who not only considers the most likely outcome of each move, but also weighs the full range of possible results and their uncertainties. We tested our algorithm on various challenging scenarios - from navigating slippery frozen lakes where movements are unpredictable, to complex maze-like environments where important information is hidden. In these tests, our approach consistently outperformed existing methods, often by substantial margins (20-80% improvement in many cases). This research has practical implications for real-world applications like robot navigation in unpredictable environments, autonomous vehicle planning in uncertain traffic conditions, and resource management where outcomes depend on many random factors. By better handling uncertainty, our algorithm makes more robust decisions in situations where traditional planning methods might fail.