ICML Kaleidoscopic Teaming in Multi Agent Simulations

Poster
in
Workshop: Multi-Agent Systems in the Era of Foundation Models: Opportunities, Challenges and Futures

Kaleidoscopic Teaming in Multi Agent Simulations

Ninareh Mehrabi · Tharindu Kumarage · Kai-Wei Chang · Aram Galstyan · Rahul Gupta

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

Warning: This paper contains content that may be inappropriate or offensive.AI agents have gained significant recent attention due to their autonomous tool usage capabilities and their integration in various real-world applications. This autonomy poses novel challenges for the safety of such systems, both in single- and multi-agent scenarios. We argue that existing red teaming or safety evaluation frameworks fall short in evaluating safety risks in complex behaviors, thought processes and actions taken by agents. Moreover, they fail to consider risks in multi-agent setups where various vulnerabilities can be exposed when agents engage in complex behaviors and interactions with each other. To address this shortcoming, we introduce the term kaleidoscopic teaming which seeks to capture complex and wide range of vulnerabilities that can happen in agents both in single-agent and multi-agent scenarios. Our goal is to capture vulnerabilities in thoughts, actions, final responses, and interactions of agents with other agents. By doing so, we aim to make safety analysis more realistic and comprehensive specially for agentic use-cases. We also present a new kaleidoscopic teaming framework that generates a diverse array of scenarios modeling real-world human societies. Our framework evaluates safety of agents in both single-agent and multi-agent setups. In single-agent setup, an agent is given a scenario that it needs to complete using the tools it has access to. In multi-agent setup, multiple agents either compete against or cooperate together to complete a task in the scenario through which we capture existing safety vulnerabilities in agents. We introduce new in-context optimization techniques that can be used in our kaleidoscopic teaming framework to generate better scenarios for safety analysis. Lastly, we present appropriate metrics that can be used along with our framework to measure safety of agents. Utilizing our kaleidoscopic teaming framework, we identify opportunities in various models to improve their safety in agentic use-cases.

Chat is not available.

Poster in Workshop: Multi-Agent Systems in the Era of Foundation Models: Opportunities, Challenges and Futures

Kaleidoscopic Teaming in Multi Agent Simulations

Ninareh Mehrabi · Tharindu Kumarage · Kai-Wei Chang · Aram Galstyan · Rahul Gupta

Poster
in
Workshop: Multi-Agent Systems in the Era of Foundation Models: Opportunities, Challenges and Futures