Skip to yearly menu bar Skip to main content


Social

Agents and Safety

Ekaterina Artemova · Alexander Borodetskiy · Ksenia Peresvetova · Elizaveta Yoshida

West Ballroom D
[ ]
Wed 16 Jul 7 p.m. PDT — 9 p.m. PDT

Abstract:

This social brings together AI practitioners focused on agent development and AI safety to address the unique risks these agents pose, such as misuse, unintended actions, and adversarial attacks, which traditional security models often fail to mitigate. The event will explore both development-phase safeguards and post-deployment evaluation strategies, including red teaming, automated testing, monitoring, and human-in-the-loop assessments. In the first part, expert speakers will share real-world cases and technical insights into current safety challenges and solutions. In the second part, attendees will engage in open discussions to exchange ideas and propose new directions for ensuring that increasingly autonomous agents remain safe, reliable, and aligned with human values. The goal is to foster collaboration and innovation toward building trustworthy AI systems.

Chat is not available.
Schedule