2-Level Reinforcement Learning for Ships on Inland Waterways: Path Planning and Following

Read original: arXiv:2307.16769 - Published 8/22/2024 by Martin Waltz, Niklas Paulig, Ostap Okhrin

🏅

Overview

This paper proposes a modularized framework for controlling autonomous surface vehicles (ASVs) on inland waterways (IWs) using deep reinforcement learning (DRL).
The framework consists of two levels: a high-level local path planning (LPP) unit and a low-level path following (PF) unit, each with a DRL agent.
The LPP agent plans a path considering dynamic vessels, traffic rules, and waterway geometry, using a novel spatial-temporal recurrent neural network architecture.
The PF agent performs low-level actuator control, accounting for shallow water influences and environmental forces like winds, waves, and currents.

Plain English Explanation

The paper describes a system for controlling autonomous boats on inland waterways, using a two-part approach based on deep reinforcement learning. The first part, the local path planning unit, is responsible for planning a safe route for the boat. It takes into account the movements of other boats, the rules of the waterway, and the shape of the waterway itself. This part uses a novel neural network architecture to continuously adjust the boat's course.

The second part, the path following unit, controls the boat's actuators (like the rudder and engine) to actually follow the planned path. It accounts for factors like shallow water, wind, waves, and currents that can affect the boat's movement.

The authors show that this two-part system outperforms other methods, keeping the boat farther away from other vessels and following the planned path more accurately, while also requiring less control effort from the boat's actuators.

Technical Explanation

The local path planning (LPP) agent uses a spatial-temporal recurrent neural network architecture to continuously plan a path for the ASV that avoids collisions with dynamic vessels and obeys traffic rules. This novel approach outperforms a state-of-the-art artificial potential field method, increasing the minimum distance to other vessels by 65% on average.

The path following (PF) agent controls the boat's actuators while accounting for shallow water influences and environmental forces like winds, waves, and currents. Compared to a traditional PID controller, the PF agent yields only 61% of the mean cross-track error while significantly reducing the control effort required.

The authors validate the full framework in simulation, using real automatic identification system (AIS) data to model the behavior of other ships on the lower Elbe river in Germany.

Critical Analysis

The paper provides a comprehensive and technically sound framework for controlling ASVs on inland waterways. The use of DRL allows the system to adaptively navigate dynamic environments, which is a key advantage over traditional control methods.

However, the paper does not address the challenge of transferring the trained DRL agents from simulation to real-world deployment. This sim-to-real gap is a common issue in DRL and would need to be carefully addressed before the system could be used in practice.

Additionally, the paper focuses on a single river system as a case study. Further research would be needed to evaluate the framework's performance and generalization across a wider range of inland waterway environments and conditions.

Conclusion

This paper presents a promising modularized framework for controlling autonomous surface vehicles on inland waterways using deep reinforcement learning. The two-level approach of high-level path planning and low-level path following demonstrates significant improvements over traditional control methods.

While the technical details are sound, the authors will need to address the challenge of bridging the sim-to-real gap and validating the framework's performance across a broader range of inland waterway environments.

Overall, this research represents an important step towards developing robust and adaptive autonomous navigation systems for aquatic vehicles, with potential applications in areas like transportation, environmental monitoring, and search and rescue operations.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🏅

2-Level Reinforcement Learning for Ships on Inland Waterways: Path Planning and Following

Martin Waltz, Niklas Paulig, Ostap Okhrin

This paper proposes a realistic modularized framework for controlling autonomous surface vehicles (ASVs) on inland waterways (IWs) based on deep reinforcement learning (DRL). The framework improves operational safety and comprises two levels: a high-level local path planning (LPP) unit and a low-level path following (PF) unit, each consisting of a DRL agent. The LPP agent is responsible for planning a path under consideration of dynamic vessels, closing a gap in the current research landscape. In addition, the LPP agent adequately considers traffic rules and the geometry of the waterway. We thereby introduce a novel application of a spatial-temporal recurrent neural network architecture to continuous action spaces. The LPP agent outperforms a state-of-the-art artificial potential field (APF) method by increasing the minimum distance to other vessels by 65% on average. The PF agent performs low-level actuator control while accounting for shallow water influences and the environmental forces winds, waves, and currents. Compared with a proportional-integral-derivative (PID) controller, the PF agent yields only 61% of the mean cross-track error (MCTE) while significantly reducing control effort (CE) in terms of the required absolute rudder angle. Lastly, both agents are jointly validated in simulation, employing the lower Elbe in northern Germany as an example case and using real automatic identification system (AIS) trajectories to model the behavior of other ships.

8/22/2024

🤿

A Deep Reinforcement Learning Framework and Methodology for Reducing the Sim-to-Real Gap in ASV Navigation

Luis F W Batista (UL), Junghwan Ro, Antoine Richard, Pete Schroepfer, Seth Hutchinson, Cedric Pradalier

Despite the increasing adoption of Deep Reinforcement Learning (DRL) for Autonomous Surface Vehicles (ASVs), there still remain challenges limiting real-world deployment. In this paper, we first integrate buoyancy and hydrodynamics models into a modern Reinforcement Learning framework to reduce training time. Next, we show how system identification coupled with domain randomization improves the RL agent performance and narrows the sim-to-real gap. Real-world experiments for the task of capturing floating waste show that our approach lowers energy consumption by 13.1% while reducing task completion time by 7.4%. These findings, supported by sharing our open-source implementation, hold the potential to impact the efficiency and versatility of ASVs, contributing to environmental conservation efforts.

7/12/2024

Let Hybrid A* Path Planner Obey Traffic Rules: A Deep Reinforcement Learning-Based Planning Framework

Xibo Li, Shruti Patel, Christof Buskens

Deep reinforcement learning (DRL) allows a system to interact with its environment and take actions by training an efficient policy that maximizes self-defined rewards. In autonomous driving, it can be used as a strategy for high-level decision making, whereas low-level algorithms such as the hybrid A* path planning have proven their ability to solve the local trajectory planning problem. In this work, we combine these two methods where the DRL makes high-level decisions such as lane change commands. After obtaining the lane change command, the hybrid A* planner is able to generate a collision-free trajectory to be executed by a model predictive controller (MPC). In addition, the DRL algorithm is able to keep the lane change command consistent within a chosen time-period. Traffic rules are implemented using linear temporal logic (LTL), which is then utilized as a reward function in DRL. Furthermore, we validate the proposed method on a real system to demonstrate its feasibility from simulation to implementation on real hardware.

7/2/2024

🤿

Autonomous Navigation of Unmanned Vehicle Through Deep Reinforcement Learning

Letian Xu, Jiabei Liu, Haopeng Zhao, Tianyao Zheng, Tongzhou Jiang, Lipeng Liu

This paper explores the method of achieving autonomous navigation of unmanned vehicles through Deep Reinforcement Learning (DRL). The focus is on using the Deep Deterministic Policy Gradient (DDPG) algorithm to address issues in high-dimensional continuous action spaces. The paper details the model of a Ackermann robot and the structure and application of the DDPG algorithm. Experiments were conducted in a simulation environment to verify the feasibility of the improved algorithm. The results demonstrate that the DDPG algorithm outperforms traditional Deep Q-Network (DQN) and Double Deep Q-Network (DDQN) algorithms in path planning tasks.

7/30/2024