Social
Agents and Safety
Ekaterina Artemova · Alexander Borodetskiy · Ksenia Peresvetova · Elizaveta Yoshida
West Ballroom D
This social brings together AI practitioners focused on agent development and AI safety to address the unique risks these agents pose, such as misuse, unintended actions, and adversarial attacks, which traditional security models often fail to mitigate. The event will explore both development-phase safeguards and post-deployment evaluation strategies, including red teaming, automated testing, monitoring, and human-in-the-loop assessments. In the first part, expert speakers will share real-world cases and technical insights into current safety challenges and solutions. In the second part, attendees will engage in open discussions to exchange ideas and propose new directions for ensuring that increasingly autonomous agents remain safe, reliable, and aligned with human values. The goal is to foster collaboration and innovation toward building trustworthy AI systems.
Schedule
Wed 7:00 p.m. - 7:15 p.m.
|
Opening remarks
|
Alexander Borodetskiy · Ilya Kocik 🔗 |
Wed 7:15 p.m. - 7:30 p.m.
|
Red teaming AI agents
(
Talk
)
>
|
Renaud de la Gueronniere 🔗 |
Wed 7:45 p.m. - 8:30 p.m.
|
AI Agents Safety Panel
(
Discussion panel
)
>
|
Sergei Tilga · Jing-Jing Li · Saurabh Jha 🔗 |
Wed 8:30 p.m. - 9:00 p.m.
|
Closing remarks & networking
(
Networking time
)
>
|
Alexander Borodetskiy · Ilya Kocik 🔗 |