Poster
Feasible Action Search for Bandit Linear Programs via Thompson Sampling
Aditya Gangrade · Aldo Pacchiano · Clay Scott · Venkatesh Saligrama
West Exhibition Hall B2-B3 #W-919
Practical engineering and scientific disciplines often need to find processes that satisfy a number of objectives that lie in tension with one-another. Our work describes a new method, FAST, that allows efficient and intelligent trial-and-error to quickly find a process that achieves a nearly 'best-possible,' i.e., minimax, balance between such objectives. FAST observes the results of previous experiments to select which process to try next, i.e., it is a "sequential experimental design". While there were some prior methods that could be used to solve this task, these methods were computationally very slow, and would have taken days or years of computation to pick the right processes to try. Our method, which is based on a technique called Thompson Sampling (TS), instead reduces this time to seconds, while using nearly the fewest-possible number of experiments (in a certain technical sense). Our basic technical contribution is to allow TS-like techniques to work with many objectives, while prior understanding of TS dealt with only one objective. For this, we both developed a new algorithmic design in the form a "coupled noise", and developed new wys to mathematically analyse TS with many objectives. Since the method is general, it may serve to help practitioners in diverse fields such as manufacturing, control, and resource-allocation to quickly discover good processes that balance the many needs they must address.