PIE: Parkour with Implicit-Explicit Learning Framework for Legged Robots

Read original: arXiv:2408.13740 - Published 9/4/2024 by Shixin Luo, Songbo Li, Ruiqi Yu, Zhicheng Wang, Jun Wu, Qiuguo Zhu

PIE: Parkour with Implicit-Explicit Learning Framework for Legged Robots

Overview

This paper proposes a new framework called PIE (Parkour with Implicit-Explicit Learning) for enabling legged robots to navigate challenging parkour environments.
The key ideas are to combine implicit (e.g., visual perception) and explicit (e.g., motion planning) learning to allow the robot to reason about and execute dynamic movements.
Experiments show the PIE framework enables a simulated legged robot to successfully navigate complex parkour courses, outperforming baselines.

Plain English Explanation

The researchers developed a new approach called PIE (Parkour with Implicit-Explicit Learning) to help legged robots navigate obstacle courses and perform dynamic parkour-style movements. The core insight is to combine two different types of learning:

Implicit Learning: This handles the robot's visual perception of the environment, allowing it to recognize obstacles, surfaces, and other relevant features.
Explicit Learning: This focuses on planning the robot's physical motions, like how to jump, land, and reorient its body.

By using both implicit and explicit learning together, the robot can reason about the environment and then execute the necessary movements to safely traverse complex parkour courses. The researchers tested this approach in simulations and found that the PIE framework enabled the robot to outperform other methods at navigating challenging parkour-style obstacle courses.

The key advantage of the PIE approach is that it allows the robot to break down the problem into perceptual and motion planning components, leveraging the strengths of each. The implicit visual learning handles the recognition of relevant environmental features, while the explicit motion planning handles the generation of the complex movements required for parkour. Integrating these two components enables the robot to tackle parkour challenges that would be difficult using a single learning approach.

Technical Explanation

The PIE framework has two main components:

Implicit Learning: This handles the robot's visual perception of the environment, using deep learning models to identify relevant visual features like obstacles, surfaces, and other landmarks. This allows the robot to build an understanding of the 3D structure of the environment.
Explicit Learning: This component focuses on planning the robot's physical motions, like how to jump, land, and reorient its body. It uses reinforcement learning techniques to train motion policies that can execute the dynamic movements required for parkour.

The key innovation is how these two components are integrated. The implicit visual perception feeds into the explicit motion planning, allowing the robot to reason about the environmental context when selecting actions. This enables the robot to plan and execute the complex, coordinated movements needed to navigate challenging parkour courses.

The researchers evaluate the PIE framework in simulation, testing the robot's ability to traverse obstacle courses of increasing difficulty. They compare the performance of the PIE approach to baselines that use only implicit or only explicit learning. The results show that the combined implicit-explicit approach of PIE significantly outperforms the baselines, demonstrating the value of this integrated framework for enabling legged robots to navigate complex parkour environments.

Critical Analysis

The PIE framework represents an interesting step forward in combining different learning approaches to enable robots to tackle complex physical challenges. By separating the perceptual and motion planning components, the researchers were able to leverage the strengths of each, resulting in more capable parkour navigation.

However, the paper does not address the potential limitations or challenges of deploying such a system in the real world. The experiments were all conducted in simulation, and it's unclear how well the approach would scale to handle the full complexity and uncertainty of a real-world environment. Additionally, the paper does not discuss the computational and hardware requirements of the PIE framework, which could be a significant practical concern for deploying on actual legged robots.

Another potential issue is the reliance on reinforcement learning for the explicit motion planning component. Reinforcement learning can be notoriously sample-inefficient and difficult to train, which could make the overall system brittle or challenging to deploy. Exploring alternative motion planning techniques, or ways to better integrate the implicit and explicit components, could be a fruitful area for future research.

Overall, the PIE framework is a promising step forward, but there are still many open challenges to be addressed before such an approach could be reliably deployed on physical legged robots navigating complex real-world environments.

Conclusion

The PIE (Parkour with Implicit-Explicit Learning) framework proposed in this paper represents an innovative approach to enabling legged robots to navigate challenging parkour-style environments. By combining implicit learning for visual perception and explicit learning for motion planning, the researchers have developed a system that can reason about the 3D structure of the environment and execute the necessary dynamic movements to traverse complex obstacle courses.

The results of the simulation experiments demonstrate the advantages of this integrated implicit-explicit learning approach, showing significant performance improvements over baselines that use only one type of learning. While there are still practical challenges to be addressed, the PIE framework is an important contribution to the field of legged robotics, pointing the way towards more capable and agile systems that can handle a wider range of real-world tasks and environments.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

PIE: Parkour with Implicit-Explicit Learning Framework for Legged Robots

Shixin Luo, Songbo Li, Ruiqi Yu, Zhicheng Wang, Jun Wu, Qiuguo Zhu

Parkour presents a highly challenging task for legged robots, requiring them to traverse various terrains with agile and smooth locomotion. This necessitates comprehensive understanding of both the robot's own state and the surrounding terrain, despite the inherent unreliability of robot perception and actuation. Current state-of-the-art methods either rely on complex pre-trained high-level terrain reconstruction modules or limit the maximum potential of robot parkour to avoid failure due to inaccurate perception. In this paper, we propose a one-stage end-to-end learning-based parkour framework: Parkour with Implicit-Explicit learning framework for legged robots (PIE) that leverages dual-level implicit-explicit estimation. With this mechanism, even a low-cost quadruped robot equipped with an unreliable egocentric depth camera can achieve exceptional performance on challenging parkour terrains using a relatively simple training process and reward function. While the training process is conducted entirely in simulation, our real-world validation demonstrates successful zero-shot deployment of our framework, showcasing superior parkour performance on harsh terrains.

9/4/2024

Humanoid Parkour Learning

Ziwen Zhuang, Shenzhe Yao, Hang Zhao

Parkour is a grand challenge for legged locomotion, even for quadruped robots, requiring active perception and various maneuvers to overcome multiple challenging obstacles. Existing methods for humanoid locomotion either optimize a trajectory for a single parkour track or train a reinforcement learning policy only to walk with a significant amount of motion references. In this work, we propose a framework for learning an end-to-end vision-based whole-body-control parkour policy for humanoid robots that overcomes multiple parkour skills without any motion prior. Using the parkour policy, the humanoid robot can jump on a 0.42m platform, leap over hurdles, 0.8m gaps, and much more. It can also run at 1.8m/s in the wild and walk robustly on different terrains. We test our policy in indoor and outdoor environments to demonstrate that it can autonomously select parkour skills while following the rotation command of the joystick. We override the arm actions and show that this framework can easily transfer to humanoid mobile manipulation tasks. Videos can be found at https://humanoid4parkour.github.io

6/18/2024

Model Predictive Parkour Control of a Monoped Hopper in Dynamically Changing Environments

Maximilian Albracht, Shivesh Kumar, Shubham Vyas, Frank Kirchner

A great advantage of legged robots is their ability to operate on particularly difficult and obstructed terrain, which demands dynamic, robust, and precise movements. The study of obstacle courses provides invaluable insights into the challenges legged robots face, offering a controlled environment to assess and enhance their capabilities. Traversing it with a one-legged hopper introduces intricate challenges, such as planning over contacts and dealing with flight phases, which necessitates a sophisticated controller. A novel model predictive parkour controller is introduced, that finds an optimal path through a real-time changing obstacle course with mixed integer motion planning. The execution of this optimized path is then achieved through a state machine employing a PD control scheme with feedforward torques, ensuring robust and accurate performance.

8/27/2024

🔄

Learning Agile Locomotion on Risky Terrains

Chong Zhang, Nikita Rudin, David Hoeller, Marco Hutter

Quadruped robots have shown remarkable mobility on various terrains through reinforcement learning. Yet, in the presence of sparse footholds and risky terrains such as stepping stones and balance beams, which require precise foot placement to avoid falls, model-based approaches are often used. In this paper, we show that end-to-end reinforcement learning can also enable the robot to traverse risky terrains with dynamic motions. To this end, our approach involves training a generalist policy for agile locomotion on disorderly and sparse stepping stones before transferring its reusable knowledge to various more challenging terrains by finetuning specialist policies from it. Given that the robot needs to rapidly adapt its velocity on these terrains, we formulate the task as a navigation task instead of the commonly used velocity tracking which constrains the robot's behavior and propose an exploration strategy to overcome sparse rewards and achieve high robustness. We validate our proposed method through simulation and real-world experiments on an ANYmal-D robot achieving peak forward velocity of >= 2.5 m/s on sparse stepping stones and narrow balance beams. Video: youtu.be/Z5X0J8OH6z4

8/12/2024