Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning

Read original: arXiv:2406.03978 - Published 6/18/2024 by Lin Liu, Jian Zhao, Cheng Hu, Zhengtao Cao, Youpeng Zhao, Zhenbin Ye, Meng Meng, Wenjun Wang, Zhaofeng He, Houqiang Li and 2 others
Total Score

0

Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • This paper introduces "Mini Honor of Kings," a lightweight environment for multi-agent reinforcement learning research.
  • The environment is designed to simulate the gameplay of the popular mobile game "Honor of Kings," allowing researchers to study complex multi-agent interactions in a controlled setting.
  • The authors demonstrate the usefulness of Mini Honor of Kings by evaluating several multi-agent reinforcement learning algorithms on the environment.

Plain English Explanation

The paper presents a new research environment called "Mini Honor of Kings" that is designed to help scientists study how multiple artificial intelligence agents can work together to accomplish tasks. The environment is based on the popular mobile game "Honor of Kings," which involves teams of players competing against each other.

By creating a simplified version of this game, the researchers have developed a testbed where they can experiment with different AI algorithms and approaches to multi-agent coordination and competition. This allows them to explore complex interactions between intelligent agents in a controlled setting, which is difficult to do in the real world.

The authors show that Mini Honor of Kings is a valuable tool for evaluating various multi-agent reinforcement learning algorithms - techniques that allow AIs to learn how to accomplish goals through trial and error. They demonstrate the usefulness of the environment by testing several of these algorithms and analyzing their performance.

Overall, Mini Honor of Kings provides researchers with a lightweight, flexible platform to advance the state-of-the-art in multi-agent AI systems, which have important applications in areas like robotics, automated transportation, and video game design.

Technical Explanation

The paper introduces "Mini Honor of Kings," a new environment designed for multi-agent reinforcement learning research. The environment is inspired by the popular mobile game "Honor of Kings," which features two teams of players competing against each other.

The authors have created a simplified version of this game that captures the core game mechanics and multi-agent interactions, while being lightweight and easily configurable for research purposes. Mini Honor of Kings includes features like a grid-based game map, hero units with unique abilities, and team-based objectives.

To demonstrate the utility of this environment, the researchers evaluate the performance of several multi-agent reinforcement learning algorithms, including MAVEN, SMAC, and MAEXP. The results show that Mini Honor of Kings can effectively differentiate the capabilities of these algorithms and serve as a useful benchmark for competitive multi-agent reinforcement learning.

Critical Analysis

The authors acknowledge that while Mini Honor of Kings captures the essence of the original "Honor of Kings" game, it is a simplified version that may not fully reflect the complexity of real-world multi-agent scenarios. The environment's grid-based structure and limited number of agents and actions could limit its ability to capture nuanced multi-agent interactions.

Additionally, the paper does not provide extensive details on the specific game mechanics, map configurations, or other environmental parameters used in the experiments. This makes it difficult to fully assess the generalizability of the results and the suitability of Mini Honor of Kings as a benchmark for different types of multi-agent problems.

Further research could explore ways to increase the environmental complexity and realism of Mini Honor of Kings, such as by incorporating more detailed agent behaviors, dynamic map generation, or additional game modes. Comparisons to other multi-agent benchmarks, such as StarCraft II or Dota 2, could also help to contextualize the strengths and limitations of this new environment.

Conclusion

The Mini Honor of Kings environment provides a valuable testbed for researchers working on multi-agent reinforcement learning algorithms. By simulating the competitive dynamics of a popular mobile game in a lightweight and configurable setting, the authors have created a useful platform for exploring complex multi-agent interactions.

The findings from the paper demonstrate the potential of Mini Honor of Kings to serve as a benchmark for evaluating the performance of different multi-agent learning techniques. As the field of multi-agent AI continues to advance, this environment could become an important tool for driving progress and expanding our understanding of how intelligent agents can effectively coordinate and compete in realistic scenarios.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning
Total Score

0

Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning

Lin Liu, Jian Zhao, Cheng Hu, Zhengtao Cao, Youpeng Zhao, Zhenbin Ye, Meng Meng, Wenjun Wang, Zhaofeng He, Houqiang Li, Xia Lin, Lanxiao Huang

Games are widely used as research environments for multi-agent reinforcement learning (MARL), but they pose three significant challenges: limited customization, high computational demands, and oversimplification. To address these issues, we introduce the first publicly available map editor for the popular mobile game Honor of Kings and design a lightweight environment, Mini Honor of Kings (Mini HoK), for researchers to conduct experiments. Mini HoK is highly efficient, allowing experiments to be run on personal PCs or laptops while still presenting sufficient challenges for existing MARL algorithms. We have tested our environment on common MARL algorithms and demonstrated that these algorithms have yet to find optimal solutions within this environment. This facilitates the dissemination and advancement of MARL methods within the research community. Additionally, we hope that more researchers will leverage the Honor of Kings map editor to develop innovative and scientifically valuable new maps. Our code and user manual are available at: https://github.com/tencent-ailab/mini-hok.

Read more

6/18/2024

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
Total Score

0

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks

Yun Qu, Boyuan Wang, Jianzhun Shao, Yuhang Jiang, Chen Chen, Zhenbin Ye, Lin Liu, Junfeng Yang, Lin Lai, Hongyang Qin, Minwen Deng, Juchao Zhuo, Deheng Ye, Qiang Fu, Wei Yang, Guang Yang, Lanxiao Huang, Xiangyang Ji

The advancement of Offline Reinforcement Learning (RL) and Offline Multi-Agent Reinforcement Learning (MARL) critically depends on the availability of high-quality, pre-collected offline datasets that represent real-world complexities and practical applications. However, existing datasets often fall short in their simplicity and lack of realism. To address this gap, we propose Hokoff, a comprehensive set of pre-collected datasets that covers both offline RL and offline MARL, accompanied by a robust framework, to facilitate further research. This data is derived from Honor of Kings, a recognized Multiplayer Online Battle Arena (MOBA) game known for its intricate nature, closely resembling real-life situations. Utilizing this framework, we benchmark a variety of offline RL and offline MARL algorithms. We also introduce a novel baseline algorithm tailored for the inherent hierarchical action space of the game. We reveal the incompetency of current offline RL approaches in handling task complexity, generalization and multi-task learning.

Read more

8/21/2024

🏋️

Total Score

0

Massively Multiagent Minigames for Training Generalist Agents

Kyoung Whan Choe, Ryan Sullivan, Joseph Su'arez

We present Meta MMO, a collection of many-agent minigames for use as a reinforcement learning benchmark. Meta MMO is built on top of Neural MMO, a massively multiagent environment that has been the subject of two previous NeurIPS competitions. Our work expands Neural MMO with several computationally efficient minigames. We explore generalization across Meta MMO by learning to play several minigames with a single set of weights. We release the environment, baselines, and training code under the MIT license. We hope that Meta MMO will spur additional progress on Neural MMO and, more generally, will serve as a useful benchmark for many-agent generalization.

Read more

6/10/2024

Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning
Total Score

0

Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning

Yizhe Huang, Anji Liu, Fanqi Kong, Yaodong Yang, Song-Chun Zhu, Xue Feng

Despite the recent successes of multi-agent reinforcement learning (MARL) algorithms, efficiently adapting to co-players in mixed-motive environments remains a significant challenge. One feasible approach is to hierarchically model co-players' behavior based on inferring their characteristics. However, these methods often encounter difficulties in efficient reasoning and utilization of inferred information. To address these issues, we propose Hierarchical Opponent modeling and Planning (HOP), a novel multi-agent decision-making algorithm that enables few-shot adaptation to unseen policies in mixed-motive environments. HOP is hierarchically composed of two modules: an opponent modeling module that infers others' goals and learns corresponding goal-conditioned policies, and a planning module that employs Monte Carlo Tree Search (MCTS) to identify the best response. Our approach improves efficiency by updating beliefs about others' goals both across and within episodes and by using information from the opponent modeling module to guide planning. Experimental results demonstrate that in mixed-motive environments, HOP exhibits superior few-shot adaptation capabilities when interacting with various unseen agents, and excels in self-play scenarios. Furthermore, the emergence of social intelligence during our experiments underscores the potential of our approach in complex multi-agent environments.

Read more

7/15/2024