Solution Concepts in Hierarchical Games under Bounded Rationality with Applications to Autonomous Driving

2009.10033

Published 6/6/2024 by Atrisha Sarkar, Krzysztof Czarnecki

✅

Abstract

With autonomous vehicles (AV) set to integrate further into regular human traffic, there is an increasing consensus on treating AV motion planning as a multi-agent problem. However, the traditional game-theoretic assumption of complete rationality is too strong for human driving, and there is a need for understanding human driving as a emph{bounded rational} activity through a behavioural game-theoretic lens. To that end, we adapt four metamodels of bounded rational behaviour: three based on Quantal level-k and one based on Nash equilibrium with quantal errors. We formalize the different solution concepts that can be applied in the context of hierarchical games, a framework used in multi-agent motion planning, for the purpose of creating game theoretic models of driving behaviour. Furthermore, based on a contributed dataset of human driving at a busy urban intersection with a total of approximately 4k agents and 44k decision points, we evaluate the behaviour models on the basis of model fit to naturalistic data, as well as their predictive capacity. Our results suggest that among the behaviour models evaluated, at the level of maneuvers, modeling driving behaviour as an adaptation of the Quantal level-k model with level-0 behaviour modelled as pure rule-following provides the best fit to naturalistic driving behaviour. At the level of trajectories, bounds sampling of actions and a maxmax non-strategic models is the most accurate within the set of models in comparison. We also find a significant impact of situational factors on the performance of behaviour models.

Create account to get full access

Overview

As autonomous vehicles (AVs) become more integrated into regular traffic, there is a growing consensus that AV motion planning should be treated as a multi-agent problem.
However, the traditional game-theoretic assumption of complete rationality does not accurately capture human driving behavior, which is more accurately described as "bounded rational" activity.
The research paper adapts four different models of bounded rational behavior, including Quantal level-k and Nash equilibrium with quantal errors, to create game-theoretic models of driving behavior.
The models are evaluated using a dataset of human driving at a busy urban intersection, looking at both maneuver-level and trajectory-level performance.

Plain English Explanation

Autonomous vehicles (AVs) are becoming more common on our roads, and as they do, it's important to understand how they can interact effectively with human drivers. The provided research paper explores this by treating AV motion planning as a "multi-agent problem" - in other words, looking at how the AV can navigate safely and efficiently alongside human drivers.

The traditional approach to this has been to assume that all drivers are "completely rational" - that they always make the optimal decision. However, the researchers recognize that human driving behavior is often more "bounded" or limited in its rationality. People don't always make the absolute best choice; instead, they make decisions that are "good enough" given the situation and their own limitations.

To better model this, the researchers adapted four different models of "bounded rational" behavior. These include variations on the "Quantal level-k" model, which assumes drivers follow simple rules, as well as a "Nash equilibrium with quantal errors" model, which allows for some randomness in decision-making.

The researchers then tested these models against a large dataset of actual human driving behavior at a busy urban intersection. They looked at how well the models could predict the specific maneuvers drivers would take, as well as the detailed trajectories of their vehicles.

The results suggest that the Quantal level-k model with rule-following "level-0" behavior provides the best fit for maneuver-level prediction, while a "bounds sampling" model works best for trajectory-level prediction. The researchers also found that the performance of the different models was significantly impacted by the specific driving situation.

Technical Explanation

The paper addresses the challenge of modeling human driving behavior in the context of autonomous vehicle (AV) motion planning, which is increasingly being treated as a multi-agent problem. The researchers recognize that the traditional game-theoretic assumption of complete rationality is too strong for human driving, and they seek to understand it through a behavioral game-theoretic lens.

To this end, the researchers adapt four different metamodels of bounded rational behavior: three based on Quantal level-k and one based on Nash equilibrium with quantal errors. They formalize the solution concepts that can be applied in the context of hierarchical games, a framework used in multi-agent motion planning, to create game-theoretic models of driving behavior.

The researchers then evaluate the behavior models using a contributed dataset of human driving at a busy urban intersection. This dataset includes approximately 4,000 agents and 44,000 decision points. The models are assessed in terms of their fit to the naturalistic driving data, as well as their predictive capacity at both the maneuver level and the trajectory level.

The results suggest that, at the maneuver level, modeling driving behavior as an adaptation of the Quantal level-k model with level-0 behavior modeled as pure rule-following provides the best fit to the naturalistic data. At the trajectory level, the researchers found that a bounds sampling of actions and a max-max non-strategic model were the most accurate within the set of models evaluated.

Importantly, the researchers also found a significant impact of situational factors on the performance of the different behavior models, highlighting the need to consider context-specific factors when modeling human driving behavior.

Critical Analysis

The research presented in this paper makes a valuable contribution to the field of autonomous vehicle motion planning by incorporating more realistic models of human driving behavior. The adaptation of bounded rational behavior models, such as Quantal level-k and Nash equilibrium with quantal errors, represents an important step forward in capturing the complexity of human decision-making in driving situations.

One potential limitation of the study is the reliance on a single dataset of human driving at a specific urban intersection. While this provides a rich source of naturalistic data, it may not fully capture the diversity of driving behaviors and situations that AVs will encounter in the real world. Further validation of the models across a wider range of driving scenarios would help to strengthen the conclusions.

Additionally, the paper does not explore the potential impact of individual differences in driving styles or decision-making processes. It may be useful to investigate how factors such as age, experience, or risk tolerance influence the performance of the different behavior models, as this could inform the development of more personalized AV motion planning algorithms.

Overall, this research represents an important step towards more realistic and effective AV motion planning in mixed traffic environments. The insights gained from the evaluation of bounded rational behavior models can help to inform the design of AV systems that can better anticipate and respond to the complex decision-making of human drivers.

Conclusion

This research paper presents a significant advancement in the modeling of human driving behavior for the purpose of autonomous vehicle (AV) motion planning. By adapting four different metamodels of bounded rational behavior, the researchers have demonstrated that more realistic representations of human decision-making can be incorporated into game-theoretic frameworks for multi-agent motion planning.

The evaluation of these models against a naturalistic dataset of human driving behavior provides valuable insights into the strengths and limitations of each approach. The finding that the Quantal level-k model with rule-following "level-0" behavior performs best at the maneuver level, while a bounds sampling model is most accurate for trajectory-level prediction, highlights the importance of considering both strategic and non-strategic driving behaviors.

Furthermore, the researchers' observation that situational factors can significantly impact the performance of the different behavior models underscores the need for AV systems to be adaptive and context-aware. As autonomous vehicles become more integrated into regular traffic, this type of research will be crucial in ensuring safe and efficient interactions between AVs and human drivers.

Overall, this paper represents an important contribution to the ongoing development of autonomous vehicle technology, paving the way for motion planning algorithms that can better anticipate and respond to the complex decision-making of human road users.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Hierarchical Learned Risk-Aware Planning Framework for Human Driving Modeling

Nathan Ludlow, Yiwei Lyu, John Dolan

This paper presents a novel approach to modeling human driving behavior, designed for use in evaluating autonomous vehicle control systems in a simulation environments. Our methodology leverages a hierarchical forward-looking, risk-aware estimation framework with learned parameters to generate human-like driving trajectories, accommodating multiple driver levels determined by model parameters. This approach is grounded in multimodal trajectory prediction, using a deep neural network with LSTM-based social pooling to predict the trajectories of surrounding vehicles. These trajectories are used to compute forward-looking risk assessments along the ego vehicle's path, guiding its navigation. Our method aims to replicate human driving behaviors by learning parameters that emulate human decision-making during driving. We ensure that our model exhibits robust generalization capabilities by conducting simulations, employing real-world driving data to validate the accuracy of our approach in modeling human behavior. The results reveal that our model effectively captures human behavior, showcasing its versatility in modeling human drivers in diverse highway scenarios.

5/13/2024

cs.RO

Act Natural! Projecting Autonomous System Trajectories Into Naturalistic Behavior Sets

Hamzah I. Khan, Adam J. Thorpe, David Fridovich-Keil

Autonomous agents operating around human actors must consider how their behaviors might affect those humans, even when not directly interacting with them. To this end, it is often beneficial to be predictable and appear naturalistic. Existing methods to address this problem use human actor intent modeling or imitation learning techniques, but these approaches rarely capture all possible motivations for human behavior or require significant amounts of data. In contrast, we propose a technique for modeling naturalistic behavior as a set of convex hulls computed over a relatively small dataset of human behavior. Given this set, we design an optimization-based filter which projects arbitrary trajectories into it to make them more naturalistic for autonomous agents to execute while also satisfying dynamics constraints. We demonstrate our methods on real-world human driving data from the inD intersection dataset (Bock et al., 2020).

5/30/2024

cs.MA

🤿

Risk-anticipatory autonomous driving strategies considering vehicles' weights, based on hierarchical deep reinforcement learning

Di Chen, Hao Li, Zhicheng Jin, Huizhao Tu, Meixin Zhu

Autonomous vehicles (AVs) have the potential to prevent accidents caused by drivers errors and reduce road traffic risks. Due to the nature of heavy vehicles, whose collisions cause more serious crashes, the weights of vehicles need to be considered when making driving strategies aimed at reducing the potential risks and their consequences in the context of autonomous driving. This study develops an autonomous driving strategy based on risk anticipation, considering the weights of surrounding vehicles and using hierarchical deep reinforcement learning. A risk indicator integrating surrounding vehicles weights, based on the risk field theory, is proposed and incorporated into autonomous driving decisions. A hybrid action space is designed to allow for left lane changes, right lane changes and car-following, which enables AVs to act more freely and realistically whenever possible. To solve the above hybrid decision-making problem, a hierarchical proximal policy optimization (HPPO) algorithm with an attention mechanism (AT-HPPO) is developed, providing great advantages in maintaining stable performance with high robustness and generalization. An indicator, potential collision energy in conflicts (PCEC), is newly proposed to evaluate the performance of the developed AV driving strategy from the perspective of the consequences of potential accidents. The performance evaluation results in simulation and dataset demonstrate that our model provides driving strategies that reduce both the likelihood and consequences of potential accidents, at the same time maintaining driving efficiency. The developed method is especially meaningful for AVs driving on highways, where heavy vehicles make up a high proportion of the traffic.

5/8/2024

cs.RO cs.LG

📉

Tactical Game-theoretic Decision-making with Homotopy Class Constraints

Michael Khayyat, Alessandro Zanardi, Stefano Arrigoni, Francesco Braghin

We propose a tactical homotopy-aware decision-making framework for game-theoretic motion planning in urban environments. We model urban driving as a generalized Nash equilibrium problem and employ a mixed-integer approach to tame the combinatorial aspect of motion planning. More specifically, by utilizing homotopy classes, we partition the high-dimensional solution space into finite, well-defined subregions. Each subregion (homotopy) corresponds to a high-level tactical decision, such as the passing order between pairs of players. The proposed formulation allows to find global optimal Nash equilibria in a computationally tractable manner by solving a mixed-integer quadratic program. Each homotopy decision is represented by a binary variable that activates different sets of linear collision avoidance constraints. This extra homotopic constraint allows to find solutions in a more efficient way (on a roundabout scenario on average 5-times faster). We experimentally validate the proposed approach on scenarios taken from the rounD dataset. Simulation-based testing in receding horizon fashion demonstrates the capability of the framework in achieving globally optimal solutions while yielding a 78% average decrease in the computational time with respect to an implementation without the homotopic constraints.

6/21/2024

cs.MA cs.RO