Spotlight Poster
High-Dimensional Prediction for Sequential Decision Making
Georgy Noarov · Ramya Ramalingam · Aaron Roth · Stephan Xie
West Exhibition Hall B2-B3 #W-918
Wed 16 Jul 3:30 p.m. PDT — 4:30 p.m. PDT
Consider one or more agents who must sequentially take actions, playing in a nonstationary (possibly adversarially changing) environment; each action incurs some reward in each round of the interaction. (For instance, imagine the online routing game in which people living in the same city each want to get from home to work as fast as possible each morning.) There exist a variety of online learning algorithms which can directly optimize various performance benchmarks for any one agent, e.g. guaranteeing that the agent's taken actions will give them cumulative reward at least as high as if they always played the best fixed action. However, most such algorithms have one or more of these undesirable traits: (1) don't offer simultaneous guarantees to multiple agents at once, (2) are very inefficient when the action space is very large (e.g. the number of home-work routes is exponentially sized), and (3) cannot give these performance guarantees conditionally on various relevant events (i.e. they don't offer guarantees specifically on days when it rains, or on national holidays, or on days when an agent uses a particular road, etc). We develop an algorithmic framework that can solve all these issues in a large variety of sequential settings such as the one above! It lets us issue a single coordinated vector-forecast each day that is appropriately unbiased/calibrated such that all agents --- if they trust and use our forecast --- will all get strong performance guarantees.