Autonomous Control of a Novel Closed Chain Five Bar Active Suspension via Deep Reinforcement Learning

2406.18899

Published 6/28/2024 by Nishesh Singh, Sidharth Ramesh, Abhishek Shankar, Jyotishka Duttagupta, Leander Stephen D'Souza, Sanjay Singh

cs.RO cs.AI

🤿

Abstract

Planetary exploration requires traversal in environments with rugged terrains. In addition, Mars rovers and other planetary exploration robots often carry sensitive scientific experiments and components onboard, which must be protected from mechanical harm. This paper deals with an active suspension system focused on chassis stabilisation and an efficient traversal method while encountering unavoidable obstacles. Soft Actor-Critic (SAC) was applied along with Proportional Integral Derivative (PID) control to stabilise the chassis and traverse large obstacles at low speeds. The model uses the rover's distance from surrounding obstacles, the height of the obstacle, and the chassis' orientation to actuate the control links of the suspension accurately. Simulations carried out in the Gazebo environment are used to validate the proposed active system.

Create account to get full access

Overview

Planetary exploration often involves traversing rugged terrain, which can be challenging for rovers and other robots carrying sensitive scientific equipment.
This paper proposes an active suspension system that uses Soft Actor-Critic (SAC) and Proportional Integral Derivative (PID) control to stabilize the chassis and efficiently traverse large obstacles at low speeds.
The system uses information about the rover's distance from obstacles, the height of obstacles, and the chassis' orientation to accurately control the suspension components.
Simulations in the Gazebo environment were used to validate the proposed active suspension system.

Plain English Explanation

Exploring other planets can be tricky for robots, as they often need to navigate rough, uneven terrain. These robots also carry delicate scientific instruments that need to be protected from getting damaged. This paper presents a solution to help rovers and other planetary exploration robots handle these challenges.

The researchers developed an "active suspension system" that uses a combination of advanced control techniques to keep the robot's body stable and allow it to traverse large obstacles without causing harm to the sensitive equipment onboard. The system uses information about the robot's surroundings, such as how far away obstacles are and how tall they are, as well as the orientation of the robot's body, to precisely adjust the suspension components and keep everything stable.

The researchers tested this active suspension system in computer simulations, using a realistic 3D environment called Gazebo, to see how well it would work in practice. By using these advanced control methods, the researchers were able to show that the robot could navigate tough terrain and obstacles while keeping the delicate instruments safe.

Technical Explanation

The paper proposes an active suspension system that combines Soft Actor-Critic (SAC) reinforcement learning with Proportional Integral Derivative (PID) control to stabilize the chassis of a planetary exploration rover and enable efficient traversal of large obstacles at low speeds.

The system uses information about the rover's distance from surrounding obstacles, the height of obstacles, and the orientation of the chassis to accurately actuate the control links of the suspension. This allows the suspension to adapt to the terrain and keep the sensitive scientific instruments onboard stable and protected.

The researchers conducted simulations in the Gazebo environment to validate the performance of the proposed active suspension system. Gazebo is a widely used 3D simulation platform that provides a realistic physical simulation of robots and their environments, making it well-suited for testing and evaluating new robotic control systems like the one described in this paper.

By combining advanced control techniques like SAC and PID, the active suspension system was able to effectively stabilize the rover's chassis and navigate large obstacles while maintaining a low speed to protect the onboard instruments. This is a key capability for planetary exploration robots that need to traverse challenging terrains while safeguarding delicate scientific payloads.

Critical Analysis

The paper provides a thorough evaluation of the proposed active suspension system through extensive simulations in the Gazebo environment. However, the researchers acknowledge that further testing and validation on real-world robotic platforms would be necessary to fully assess the system's performance and robustness in actual planetary exploration scenarios.

One potential limitation of the research is that the simulations were conducted at low speeds, which may not fully capture the challenges of navigating high-speed maneuvers over rugged terrain. Additionally, the paper does not explore how the active suspension system would handle more complex obstacle configurations or environmental conditions, such as loose soil, steep inclines, or variable surface friction.

Further research could also investigate the integration of this active suspension system with other robotic exploration capabilities, such as advanced perception or autonomous decision-making, to create a more comprehensive and capable planetary exploration platform.

Conclusion

This paper presents an innovative active suspension system that leverages advanced control techniques like Soft Actor-Critic and PID control to stabilize the chassis of planetary exploration rovers and enable efficient traversal of large obstacles. The simulations conducted in the Gazebo environment demonstrate the promising performance of this system in maintaining the stability of sensitive scientific instruments while navigating challenging terrain.

While further real-world testing and integration with other robotic capabilities are needed, this research represents an important step towards developing more capable and resilient planetary exploration platforms that can safely navigate the rugged environments found on other worlds.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Sim-to-real transfer of active suspension control using deep reinforcement learning

Viktor Wiberg, Erik Wallin, Arvid Falldin, Tobias Semberg, Morgan Rossander, Eddie Wadbro, Martin Servin

We explore sim-to-real transfer of deep reinforcement learning controllers for a heavy vehicle with active suspensions designed for traversing rough terrain. While related research primarily focuses on lightweight robots with electric motors and fast actuation, this study uses a forestry vehicle with a complex hydraulic driveline and slow actuation. We simulate the vehicle using multibody dynamics and apply system identification to find an appropriate set of simulation parameters. We then train policies in simulation using various techniques to mitigate the sim-to-real gap, including domain randomization, action delays, and a reward penalty to encourage smooth control. In reality, the policies trained with action delays and a penalty for erratic actions perform nearly at the same level as in simulation. In experiments on level ground, the motion trajectories closely overlap when turning to either side, as well as in a route tracking scenario. When faced with a ramp that requires active use of the suspensions, the simulated and real motions are in close alignment. This shows that the actuator model together with system identification yields a sufficiently accurate model of the actuators. We observe that policies trained without the additional action penalty exhibit fast switching or bang-bang control. These present smooth motions and high performance in simulation but transfer poorly to reality. We find that policies make marginal use of the local height map for perception, showing no indications of predictive planning. However, the strong transfer capabilities entail that further development concerning perception and performance can be largely confined to simulation.

5/1/2024

cs.RO

Passive Obstacle Aware Control to Follow Desired Velocities

Lukas Huber, Trinca Thibaud, Jean-Jacques Slotine, Aude Billard

Evaluating and updating the obstacle avoidance velocity for an autonomous robot in real-time ensures robustness against noise and disturbances. A passive damping controller can obtain the desired motion with a torque-controlled robot, which remains compliant and ensures a safe response to external perturbations. Here, we propose a novel approach for designing the passive control policy. Our algorithm complies with obstacle-free zones while transitioning to increased damping near obstacles to ensure collision avoidance. This approach ensures stability across diverse scenarios, effectively mitigating disturbances. Validation on a 7DoF robot arm demonstrates superior collision rejection capabilities compared to the baseline, underlining its practicality for real-world applications. Our obstacle-aware damping controller represents a substantial advancement in secure robot control within complex and uncertain environments.

6/21/2024

cs.RO cs.SY eess.SY

ASID: Active Exploration for System Identification in Robotic Manipulation

Marius Memmel, Andrew Wagenmaker, Chuning Zhu, Patrick Yin, Dieter Fox, Abhishek Gupta

Model-free control strategies such as reinforcement learning have shown the ability to learn control strategies without requiring an accurate model or simulator of the world. While this is appealing due to the lack of modeling requirements, such methods can be sample inefficient, making them impractical in many real-world domains. On the other hand, model-based control techniques leveraging accurate simulators can circumvent these challenges and use a large amount of cheap simulation data to learn controllers that can effectively transfer to the real world. The challenge with such model-based techniques is the requirement for an extremely accurate simulation, requiring both the specification of appropriate simulation assets and physical parameters. This requires considerable human effort to design for every environment being considered. In this work, we propose a learning system that can leverage a small amount of real-world data to autonomously refine a simulation model and then plan an accurate control strategy that can be deployed in the real world. Our approach critically relies on utilizing an initial (possibly inaccurate) simulator to design effective exploration policies that, when deployed in the real world, collect high-quality data. We demonstrate the efficacy of this paradigm in identifying articulation, mass, and other physical parameters in several challenging robotic manipulation tasks, and illustrate that only a small amount of real-world data can allow for effective sim-to-real transfer. Project website at https://weirdlabuw.github.io/asid

6/28/2024

cs.RO cs.LG cs.SY eess.SY

📈

Active Learning-based Model Predictive Coverage Control

Rahel Rickenbach, Johannes Kohler, Anna Scampicchio, Melanie N. Zeilinger, Andrea Carron

The problem of coverage control, i.e., of coordinating multiple agents to optimally cover an area, arises in various applications. However, coverage applications face two major challenges: (1) dealing with nonlinear dynamics while respecting system and safety critical constraints, and (2) performing the task in an initially unknown environment. We solve the coverage problem by using a hierarchical framework, in which references are calculated at a central server and passed to the agents' local model predictive control (MPC) tracking schemes. Furthermore, to ensure that the environment is actively explored by the agents a probabilistic exploration-exploitation trade-off is deployed. In addition, we derive a control framework that avoids the hierarchical structure by integrating the reference optimization in the MPC formulation. Active learning is then performed drawing inspiration from Upper Confidence Bound (UCB) approaches. For all developed control architectures, we guarantee closed-loop constraint satisfaction and convergence to an optimal configuration. Furthermore, all methods are tested and compared on hardware using a miniature car platform.

4/1/2024

eess.SY cs.SY