SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems

2402.03741

Published 6/27/2024 by Oubo Ma, Yuwen Pu, Linkang Du, Yang Dai, Ruo Wang, Xiaolei Liu, Yingcai Wu, Shouling Ji

🏅

Abstract

Recent advancements in multi-agent reinforcement learning (MARL) have opened up vast application prospects, such as swarm control of drones, collaborative manipulation by robotic arms, and multi-target encirclement. However, potential security threats during the MARL deployment need more attention and thorough investigation. Recent research reveals that attackers can rapidly exploit the victim's vulnerabilities, generating adversarial policies that result in the failure of specific tasks. For instance, reducing the winning rate of a superhuman-level Go AI to around 20%. Existing studies predominantly focus on two-player competitive environments, assuming attackers possess complete global state observation. In this study, we unveil, for the first time, the capability of attackers to generate adversarial policies even when restricted to partial observations of the victims in multi-agent competitive environments. Specifically, we propose a novel black-box attack (SUB-PLAY) that incorporates the concept of constructing multiple subgames to mitigate the impact of partial observability and suggests sharing transitions among subpolicies to improve attackers' exploitative ability. Extensive evaluations demonstrate the effectiveness of SUB-PLAY under three typical partial observability limitations. Visualization results indicate that adversarial policies induce significantly different activations of the victims' policy networks. Furthermore, we evaluate three potential defenses aimed at exploring ways to mitigate security threats posed by adversarial policies, providing constructive recommendations for deploying MARL in competitive environments.

Create account to get full access

Overview

Recent advancements in multi-agent reinforcement learning (MARL) have opened up vast application prospects, such as swarm control of drones, collaborative manipulation by robotic arms, and multi-target encirclement.
However, potential security threats during MARL deployment need more attention and thorough investigation.
Recent research reveals that attackers can rapidly exploit the victim's vulnerabilities, generating adversarial policies that result in the failure of specific tasks, such as reducing the winning rate of a superhuman-level Go AI to around 20%.
Existing studies predominantly focus on two-player competitive environments, assuming attackers possess complete global state observation.

Plain English Explanation

Multi-agent reinforcement learning (MARL) is a powerful technique that enables multiple autonomous agents to work together to solve complex problems. This technology has a wide range of practical applications, from controlling swarms of drones to coordinating robotic arms to surround multiple targets.

However, the researchers behind this study have found that MARL systems can be vulnerable to attacks. Adversaries can exploit weaknesses in the MARL system to generate policies that undermine the performance of the agents, even in situations where the attackers have limited information about the system. For example, they were able to reduce the winning rate of a top-performing Go AI from superhuman levels down to just 20%.

Previous research on this topic has focused on scenarios with two players and where the attackers have full access to all the information about the system. This new study takes a closer look at what happens when the attackers have only partial information about the MARL system they're trying to undermine.

Technical Explanation

This study proposes a novel black-box attack called SUB-PLAY that allows attackers to generate adversarial policies even when they have limited information about the MARL system they're targeting. The key idea behind SUB-PLAY is to construct multiple "subgames" that the attacker can use to mitigate the impact of partial observability and improve their ability to exploit vulnerabilities in the victim's policies.

The researchers extensively evaluated the effectiveness of SUB-PLAY under three typical partial observability limitations. Their visualization results indicate that the adversarial policies generated by SUB-PLAY induce significantly different activations in the victim's policy networks, suggesting that the attack is disrupting the normal functioning of the MARL system.

Furthermore, the study evaluates three potential defenses aimed at mitigating the security threats posed by adversarial policies, providing constructive recommendations for deploying MARL in competitive environments.

Critical Analysis

The researchers have done a thorough job of exploring the security implications of MARL systems, particularly in scenarios where the attackers have limited information about the system. Their novel attack approach and extensive evaluations provide valuable insights into the vulnerabilities of MARL and the potential threats that need to be addressed.

However, the paper does not delve into the long-term robustness of the proposed defenses or the broader implications of these security threats for the real-world deployment of MARL systems. Additionally, the researchers could have explored the efficiency and scalability of their attack and defense approaches, as these factors will be crucial for practical applications.

Conclusion

This study highlights the pressing need to address security vulnerabilities in multi-agent reinforcement learning systems, particularly in competitive environments where adversaries may have limited information about the system. The researchers have proposed a novel attack approach and evaluated potential defenses, providing valuable insights for the development and deployment of robust MARL systems.

As MARL continues to advance and find new applications, it will be crucial for researchers and developers to prioritize security and work towards building more resilient multi-agent systems that can withstand malicious attacks. This study serves as an important step in that direction, paving the way for further research and innovations in this rapidly evolving field.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🏅

What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning?

Songyang Han, Sanbao Su, Sihong He, Shuo Han, Haizhao Yang, Shaofeng Zou, Fei Miao

Various methods for Multi-Agent Reinforcement Learning (MARL) have been developed with the assumption that agents' policies are based on accurate state information. However, policies learned through Deep Reinforcement Learning (DRL) are susceptible to adversarial state perturbation attacks. In this work, we propose a State-Adversarial Markov Game (SAMG) and make the first attempt to investigate different solution concepts of MARL under state uncertainties. Our analysis shows that the commonly used solution concepts of optimal agent policy and robust Nash equilibrium do not always exist in SAMGs. To circumvent this difficulty, we consider a new solution concept called robust agent policy, where agents aim to maximize the worst-case expected state value. We prove the existence of robust agent policy for finite state and finite action SAMGs. Additionally, we propose a Robust Multi-Agent Adversarial Actor-Critic (RMA3C) algorithm to learn robust policies for MARL agents under state uncertainties. Our experiments demonstrate that our algorithm outperforms existing methods when faced with state perturbations and greatly improves the robustness of MARL policies. Our code is public on https://songyanghan.github.io/what_is_solution/.

4/15/2024

cs.AI cs.GT cs.MA

Behavior-Targeted Attack on Reinforcement Learning with Limited Access to Victim's Policy

Shojiro Yamabe, Kazuto Fukuchi, Ryoma Senda, Jun Sakuma

This study considers the attack on reinforcement learning agents where the adversary aims to control the victim's behavior as specified by the adversary by adding adversarial modifications to the victim's state observation. While some attack methods reported success in manipulating the victim agent's behavior, these methods often rely on environment-specific heuristics. In addition, all existing attack methods require white-box access to the victim's policy. In this study, we propose a novel method for manipulating the victim agent in the black-box (i.e., the adversary is allowed to observe the victim's state and action only) and no-box (i.e., the adversary is allowed to observe the victim's state only) setting without requiring environment-specific heuristics. Our attack method is formulated as a bi-level optimization problem that is reduced to a distribution matching problem and can be solved by an existing imitation learning algorithm in the black-box and no-box settings. Empirical evaluations on several reinforcement learning benchmarks show that our proposed method has superior attack performance to baselines.

6/7/2024

cs.LG cs.AI

Optimal Attack and Defense for Reinforcement Learning

Jeremy McMahan, Young Wu, Xiaojin Zhu, Qiaomin Xie

To ensure the usefulness of Reinforcement Learning (RL) in real systems, it is crucial to ensure they are robust to noise and adversarial attacks. In adversarial RL, an external attacker has the power to manipulate the victim agent's interaction with the environment. We study the full class of online manipulation attacks, which include (i) state attacks, (ii) observation attacks (which are a generalization of perceived-state attacks), (iii) action attacks, and (iv) reward attacks. We show the attacker's problem of designing a stealthy attack that maximizes its own expected reward, which often corresponds to minimizing the victim's value, is captured by a Markov Decision Process (MDP) that we call a meta-MDP since it is not the true environment but a higher level environment induced by the attacked interaction. We show that the attacker can derive optimal attacks by planning in polynomial time or learning with polynomial sample complexity using standard RL techniques. We argue that the optimal defense policy for the victim can be computed as the solution to a stochastic Stackelberg game, which can be further simplified into a partially-observable turn-based stochastic game (POTBSG). Neither the attacker nor the victim would benefit from deviating from their respective optimal policies, thus such solutions are truly robust. Although the defense problem is NP-hard, we show that optimal Markovian defenses can be computed (learned) in polynomial time (sample complexity) in many scenarios.

6/18/2024

cs.LG cs.CR cs.GT

🏅

Toward Evaluating Robustness of Reinforcement Learning with Adversarial Policy

Xiang Zheng, Xingjun Ma, Shengjie Wang, Xinyu Wang, Chao Shen, Cong Wang

Reinforcement learning agents are susceptible to evasion attacks during deployment. In single-agent environments, these attacks can occur through imperceptible perturbations injected into the inputs of the victim policy network. In multi-agent environments, an attacker can manipulate an adversarial opponent to influence the victim policy's observations indirectly. While adversarial policies offer a promising technique to craft such attacks, current methods are either sample-inefficient due to poor exploration strategies or require extra surrogate model training under the black-box assumption. To address these challenges, in this paper, we propose Intrinsically Motivated Adversarial Policy (IMAP) for efficient black-box adversarial policy learning in both single- and multi-agent environments. We formulate four types of adversarial intrinsic regularizers -- maximizing the adversarial state coverage, policy coverage, risk, or divergence -- to discover potential vulnerabilities of the victim policy in a principled way. We also present a novel bias-reduction method to balance the extrinsic objective and the adversarial intrinsic regularizers adaptively. Our experiments validate the effectiveness of the four types of adversarial intrinsic regularizers and the bias-reduction method in enhancing black-box adversarial policy learning across a variety of environments. Our IMAP successfully evades two types of defense methods, adversarial training and robust regularizer, decreasing the performance of the state-of-the-art robust WocaR-PPO agents by 34%-54% across four single-agent tasks. IMAP also achieves a state-of-the-art attacking success rate of 83.91% in the multi-agent game YouShallNotPass. Our code is available at url{https://github.com/x-zheng16/IMAP}.

4/29/2024

cs.LG