A Distributed Approach to Autonomous Intersection Management via Multi-Agent Reinforcement Learning

2405.08655

Published 5/15/2024 by Matteo Cederle, Marco Fabris, Gian Antonio Susto

A Distributed Approach to Autonomous Intersection Management via Multi-Agent Reinforcement Learning

Abstract

Autonomous intersection management (AIM) poses significant challenges due to the intricate nature of real-world traffic scenarios and the need for a highly expensive centralised server in charge of simultaneously controlling all the vehicles. This study addresses such issues by proposing a novel distributed approach to AIM utilizing multi-agent reinforcement learning (MARL). We show that by leveraging the 3D surround view technology for advanced assistance systems, autonomous vehicles can accurately navigate intersection scenarios without needing any centralised controller. The contributions of this paper thus include a MARL-based algorithm for the autonomous management of a 4-way intersection and also the introduction of a new strategy called prioritised scenario replay for improved training efficacy. We validate our approach as an innovative alternative to conventional centralised AIM techniques, ensuring the full reproducibility of our results. Specifically, experiments conducted in virtual environments using the SMARTS platform highlight its superiority over benchmarks across various metrics.

Create account to get full access

Overview

This paper proposes a distributed approach to autonomous intersection management using multi-agent reinforcement learning (MARL).
The goal is to develop a decentralized system where vehicles can navigate intersections efficiently without the need for a centralized controller.
The authors leverage MARL techniques to enable vehicles to learn optimal behavior through interaction and coordination.

Plain English Explanation

The paper describes a new way to manage traffic at intersections without a central controller. Typically, intersections are controlled by traffic lights or signs that tell vehicles when to stop and go. This can be inefficient, especially as we move towards autonomous vehicles that can communicate and coordinate with each other.

The researchers in this paper propose using multi-agent reinforcement learning to allow autonomous vehicles to learn how to navigate intersections on their own. Each vehicle is like an "agent" that learns through trial and error how to cross the intersection as quickly and safely as possible.

By having the vehicles coordinate with each other directly, without relying on a central system, the intersection can be managed in a more distributed and adaptive way. The vehicles learn to make decisions that optimize the overall flow of traffic, rather than just following a pre-programmed set of rules.

This approach has the potential to make intersections much more efficient, reducing delays and improving traffic flow. It also removes the need for traditional traffic control infrastructure like traffic lights, which could lead to cost savings. As autonomous vehicle technology advances, this type of decentralized, self-organizing intersection management could become an important part of the future of transportation.

Technical Explanation

The paper proposes a distributed approach to autonomous intersection management using multi-agent reinforcement learning (MARL). The key idea is to enable autonomous vehicles to learn optimal intersection-crossing behaviors through interaction and coordination, without relying on a centralized controller.

The authors model the intersection management problem as a multi-agent Markov decision process (MMDP), where each vehicle is an agent that aims to maximize its own reward (e.g., minimizing time to cross the intersection) while also considering the collective reward of all agents. They propose a MARL algorithm that allows the agents to learn their optimal policies through iterative interaction and feedback.

The MARL algorithm is based on the delay-aware MARL framework and incorporates control-theoretic safety constraints to ensure safe intersections. The agents learn their policies through a combination of on-policy and off-policy updates, with the goal of maximizing the overall intersection throughput.

The authors evaluate their approach through simulation experiments, comparing it to a centralized intersection management system and a decentralized rule-based system. The results demonstrate that the proposed MARL-based approach can achieve significant improvements in intersection throughput, delay, and safety compared to the baseline methods.

Critical Analysis

The paper presents a promising approach to autonomous intersection management, but there are a few potential limitations and areas for further research:

The simulation experiments are conducted in a simplified, idealized environment. More realistic scenarios, such as intersections with variable traffic patterns, complex road geometries, or mixed autonomy (autonomous and human-driven vehicles), should be considered to better understand the approach's performance in real-world conditions.
The paper does not address issues related to communication and coordination between vehicles, which could be crucial for the effectiveness of the MARL-based approach. Factors like communication delays, packet losses, and scalability with increasing numbers of vehicles should be investigated.
The safety constraints incorporated in the MARL algorithm are based on a simplified control-theoretic model. More sophisticated safety monitoring and enforcement mechanisms may be necessary to ensure robust and reliable intersection management, especially in the presence of unexpected or adversarial behaviors.
The paper does not discuss the computational and storage requirements of the MARL-based approach, which could be a concern for real-time implementation in resource-constrained vehicles. Efficient MARL algorithms may be necessary to address these challenges.

Overall, the proposed approach is a valuable contribution to the field of autonomous intersection management, but further research and validation in more realistic scenarios are needed to fully assess its practicality and scalability.

Conclusion

This paper presents a distributed approach to autonomous intersection management using multi-agent reinforcement learning (MARL). The key idea is to enable autonomous vehicles to learn optimal intersection-crossing behaviors through interaction and coordination, without relying on a centralized controller.

The proposed MARL-based approach has the potential to significantly improve intersection throughput, delay, and safety compared to traditional centralized and rule-based systems. By allowing vehicles to learn and adapt their behavior based on the collective conditions, the intersection can be managed in a more flexible and efficient manner.

As autonomous vehicle technology continues to advance, this type of decentralized, self-organizing approach to intersection management could become an important component of the future transportation system. However, further research is needed to address practical challenges, such as communication, safety, and computational requirements, to ensure the approach is viable in real-world conditions.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

A low-cost Framework for Decentralized Autonomous Intersection Management

Rugved Katole, Arpita Sinha

This paper addresses the traffic management problem for autonomous vehicles at intersections without traffic signals. In the current system, a road junction has no traffic signals when the traffic volume is low to medium. Installing infrastructure at each unsignalled crossing to coordinate autonomous cars can be formidable. We propose a novel low-cost solution strategy where the vehicles use a harmony matrix to find the best possible combination of the cars to cross the intersection without any crashes. The harmony matrix defines the connection between different vehicle maneuvers and is queried online for intersection management. We maximize the throughput of the intersection by solving a maximal clique problem formulated based on the vehicles present at the intersection. The proposed algorithm relies on the intent perceived by the autonomous vehicles. We compare our work with a communication-based strategy that uses V2I communication protocols, and through extensive simulation, we showed that our algorithm is comparable when the traffic volume is less than 500 PCUs/hr/lane.

4/1/2024

cs.RO cs.MA

Multi-Agent Reinforcement Learning with Control-Theoretic Safety Guarantees for Dynamic Network Bridging

Raffaele Galliera, Konstantinos Mitsopoulos, Niranjan Suri, Raffaele Romagnoli

Addressing complex cooperative tasks in safety-critical environments poses significant challenges for Multi-Agent Systems, especially under conditions of partial observability. This work introduces a hybrid approach that integrates Multi-Agent Reinforcement Learning with control-theoretic methods to ensure safe and efficient distributed strategies. Our contributions include a novel setpoint update algorithm that dynamically adjusts agents' positions to preserve safety conditions without compromising the mission's objectives. Through experimental validation, we demonstrate significant advantages over conventional MARL strategies, achieving comparable task performance with zero safety violations. Our findings indicate that integrating safe control with learning approaches not only enhances safety compliance but also achieves good performance in mission objectives.

4/3/2024

cs.MA cs.AI cs.LG cs.NI cs.SY eess.SY

🏅

Fully Distributed Fog Load Balancing with Multi-Agent Reinforcement Learning

Maad Ebrahim, Abdelhakim Hafid

Real-time Internet of Things (IoT) applications require real-time support to handle the ever-growing demand for computing resources to process IoT workloads. Fog Computing provides high availability of such resources in a distributed manner. However, these resources must be efficiently managed to distribute unpredictable traffic demands among heterogeneous Fog resources. This paper proposes a fully distributed load-balancing solution with Multi-Agent Reinforcement Learning (MARL) that intelligently distributes IoT workloads to optimize the waiting time while providing fair resource utilization in the Fog network. These agents use transfer learning for life-long self-adaptation to dynamic changes in the environment. By leveraging distributed decision-making, MARL agents effectively minimize the waiting time compared to a single centralized agent solution and other baselines, enhancing end-to-end execution delay. Besides performance gain, a fully distributed solution allows for a global-scale implementation where agents can work independently in small collaboration regions, leveraging nearby local resources. Furthermore, we analyze the impact of a realistic frequency to observe the state of the environment, unlike the unrealistic common assumption in the literature of having observations readily available in real-time for every required action. The findings highlight the trade-off between realism and performance using an interval-based Gossip-based multi-casting protocol against assuming real-time observation availability for every generated workload.

5/22/2024

cs.AI cs.DC cs.LG cs.MA

Multi-Task Lane-Free Driving Strategy for Connected and Automated Vehicles: A Multi-Agent Deep Reinforcement Learning Approach

Mehran Berahman, Majid Rostami-Shahrbabaki, Klaus Bogenberger

Deep reinforcement learning has shown promise in various engineering applications, including vehicular traffic control. The non-stationary nature of traffic, especially in the lane-free environment with more degrees of freedom in vehicle behaviors, poses challenges for decision-making since a wrong action might lead to a catastrophic failure. In this paper, we propose a novel driving strategy for Connected and Automated Vehicles (CAVs) based on a competitive Multi-Agent Deep Deterministic Policy Gradient approach. The developed multi-agent deep reinforcement learning algorithm creates a dynamic and non-stationary scenario, mirroring real-world traffic complexities and making trained agents more robust. The algorithm's reward function is strategically and uniquely formulated to cover multiple vehicle control tasks, including maintaining desired speeds, overtaking, collision avoidance, and merging and diverging maneuvers. Moreover, additional considerations for both lateral and longitudinal passenger comfort and safety criteria are taken into account. We employed inter-vehicle forces, known as nudging and repulsive forces, to manage the maneuvers of CAVs in a lane-free traffic environment. The proposed driving algorithm is trained and evaluated on lane-free roads using the Simulation of Urban Mobility platform. Experimental results demonstrate the algorithm's efficacy in handling different objectives, highlighting its potential to enhance safety and efficiency in autonomous driving within lane-free traffic environments.

6/24/2024

cs.RO