Poster
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel · Luca Viano · Volkan Cevher
West Exhibition Hall B2-B3 #W-704
Imitation learning is a way for artificial intelligence (AI) systems to learn new skills by watching and copying expert behavior—much like how a child learns by observing adults. However, making AI learn efficiently from demonstrations can be challenging, especially in complex environments.This paper introduces a new framework called SOAR (Soft Optimistic Actor cRitic) to improve how AI learns from experts. The key idea behind SOAR is to help the AI not just copy what it sees, but also to explore actions it’s less sure about, guided by a sense of “optimism” about what might work well. This is achieved by using multiple “critics” (advisors) within the AI that estimate how good different actions might be, and then encouraging the AI to try actions where these critics are most optimistic.The authors show that SOAR can be used as a flexible template, improving several popular imitation learning algorithms. In practical tests with simulated robots (using the MuJoCo environment), SOAR helped these algorithms learn faster and more efficiently—cutting the amount of training needed by half to reach the same level of performance.In summary, SOAR is a promising step towards making AI systems better at learning from demonstrations, allowing them to master new tasks more quickly and with less data.