ICML Social Agents and Safety

Social

Agents and Safety

Ekaterina Artemova · Alexander Borodetskiy · Ksenia Peresvetova · Elizaveta Yoshida

West Ballroom D

[ Abstract ]

Wed 16 Jul 7 p.m. PDT — 9 p.m. PDT

Abstract:

This social brings together AI practitioners focused on agent development and AI safety to address the unique risks these agents pose, such as misuse, unintended actions, and adversarial attacks, which traditional security models often fail to mitigate. The event will explore both development-phase safeguards and post-deployment evaluation strategies, including red teaming, automated testing, monitoring, and human-in-the-loop assessments. In the first part, expert speakers will share real-world cases and technical insights into current safety challenges and solutions. In the second part, attendees will engage in open discussions to exchange ideas and propose new directions for ensuring that increasingly autonomous agents remain safe, reliable, and aligned with human values. The goal is to foster collaboration and innovation toward building trustworthy AI systems.

Chat is not available.

Schedule

Wed 7:00 p.m. - 7:15 p.m.	Opening remarks	Alexander Borodetskiy · Ilya Kocik 🔗
Wed 7:15 p.m. - 7:30 p.m.	Red teaming AI agents ( Talk ) >	Renaud de la Gueronniere 🔗
Wed 7:45 p.m. - 8:30 p.m.	AI Agents Safety Panel ( Discussion panel ) >	Sergei Tilga · Jing-Jing Li · Saurabh Jha 🔗
Wed 8:30 p.m. - 9:00 p.m.	Closing remarks & networking ( Networking time ) >	Alexander Borodetskiy · Ilya Kocik 🔗