Poster
Improved Online Confidence Bounds for Multinomial Logistic Bandits
Joongkyu Lee · Min-hwan Oh
West Exhibition Hall B2-B3 #W-917
Many AI systems repeatedly face the challenge of choosing from multiple options — such as which product to recommend or which ad to show — while learning from user feedback to improve over time. This paper tackles that challenge and proposes a new algorithm that achieves state-of-the-art performance in both learning efficiency and computational speed.Our method enables AI systems to make smarter, more confident decisions even under uncertainty. It introduces a more effective way to gauge how reliable the system’s knowledge is, while avoiding the heavy computations that previous methods require.As a result, our algorithm is faster, more scalable, and easier to deploy in real-world applications. It helps AI systems learn quickly, use fewer resources, and make better choices — enabling more responsive and intelligent tools for recommendation, search, and personalized services.