One-Shot Transfer of Long-Horizon Extrinsic Manipulation Through Contact Retargeting

Read original: arXiv:2404.07468 - Published 4/12/2024 by Albert Wu, Ruocheng Wang, Sirui Chen, Clemens Eppner, C. Karen Liu

One-Shot Transfer of Long-Horizon Extrinsic Manipulation Through Contact Retargeting

Overview

This paper presents a method for transferring long-horizon extrinsic manipulation skills between different robot platforms using a technique called "contact retargeting."
The key idea is to capture the contact interactions between a robot and an object during a manipulation task, and then transfer those contacts to a new robot with a different kinematics and dynamics.
This allows the new robot to perform the same high-level manipulation task, even if it has very different physical capabilities compared to the original robot.

Plain English Explanation

The researchers have developed a way to teach one robot how to perform a complex physical task, and then transfer that knowledge to a completely different robot. The key is to focus on the contacts and interactions between the robot and the object, rather than just the robot's movements.

Imagine you have a robot that can expertly stack a tower of blocks. Rather than try to painstakingly program every joint movement for a new robot to do the same task, the researchers have found a way to capture the essential "feeling" of how the blocks should be handled. They can then apply that "feeling" to a new robot, even if it has a very different physical design. This allows the new robot to perform the same high-level task, without having to start from scratch.

This contact-based approach is particularly useful for long, multi-step manipulation tasks, where the robot needs to carefully orchestrate a sequence of interactions with the object. By distilling the task down to the key contact points and timings, the knowledge can be efficiently transferred to a new platform.

Technical Explanation

The core of the method is a "contact retargeting" algorithm that maps the contact points and forces between the original robot and the object onto the new robot's kinematics and dynamics. This involves building a physics-based model of the object's interactions with both robots, and then optimizing the new robot's motion to recreate the same contact behavior.

Key innovations include:

A contact-based task representation that captures the essential manipulation skills
A differentiable physics simulation to enable gradient-based optimization of the new robot's motion
Techniques to handle discontinuous contacts and maintain stability during the transfer process

The paper demonstrates the approach on a range of long-horizon manipulation tasks, such as inserting a key into a lock and stacking blocks. Experiments show that the transferred skills allow the new robot to successfully complete the tasks, even when its kinematics and dynamics differ significantly from the original.

Critical Analysis

The main limitation of this work is that it assumes the availability of an accurate physics model of the object and its interactions with the robots. In practice, building such models can be challenging, especially for complex real-world objects and environments.

Additionally, the paper does not address how to handle situations where the new robot's capabilities are insufficient to fully recreate the original contact behavior. In such cases, the transferred skills may not be directly applicable, and further adaptation or learning may be required.

Nonetheless, this contact-based approach represents an important step towards more flexible and generalizable robot manipulation skills. By focusing on the essential task-relevant interactions, rather than just the robot's motion, it opens up new possibilities for rapidly transferring complex manipulation capabilities between different platforms.

Conclusion

This paper introduces a novel method for one-shot transfer of long-horizon manipulation skills between robots with different kinematics and dynamics. By capturing the critical contact interactions during a task, the researchers have developed a way to efficiently transfer high-level manipulation capabilities, without the need for extensive retraining or reprogramming.

While the approach has some limitations, it represents a significant advancement in the field of robot manipulation. By enabling rapid skill transfer, it could help accelerate the deployment of capable robotic systems in a wide range of real-world applications, from object handovers to complex assembly tasks. As the field of robotics continues to evolve, techniques like this will be crucial for unlocking the full potential of robotic manipulation in diverse, unstructured environments.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

One-Shot Transfer of Long-Horizon Extrinsic Manipulation Through Contact Retargeting

Albert Wu, Ruocheng Wang, Sirui Chen, Clemens Eppner, C. Karen Liu

Extrinsic manipulation, the use of environment contacts to achieve manipulation objectives, enables strategies that are otherwise impossible with a parallel jaw gripper. However, orchestrating a long-horizon sequence of contact interactions between the robot, object, and environment is notoriously challenging due to the scene diversity, large action space, and difficult contact dynamics. We observe that most extrinsic manipulation are combinations of short-horizon primitives, each of which depend strongly on initializing from a desirable contact configuration to succeed. Therefore, we propose to generalize one extrinsic manipulation trajectory to diverse objects and environments by retargeting contact requirements. We prepare a single library of robust short-horizon, goal-conditioned primitive policies, and design a framework to compose state constraints stemming from contacts specifications of each primitive. Given a test scene and a single demo prescribing the primitive sequence, our method enforces the state constraints on the test scene and find intermediate goal states using inverse kinematics. The goals are then tracked by the primitive policies. Using a 7+1 DoF robotic arm-gripper system, we achieved an overall success rate of 80.5% on hardware over 4 long-horizon extrinsic manipulation tasks, each with up to 4 primitives. Our experiments cover 10 objects and 6 environment configurations. We further show empirically that our method admits a wide range of demonstrations, and that contact retargeting is indeed the key to successfully combining primitives for long-horizon extrinsic manipulation. Code and additional details are available at stanford-tml.github.io/extrinsic-manipulation.

4/12/2024

🐍

Learning Extrinsic Dexterity with Parameterized Manipulation Primitives

Shih-Min Yang, Martin Magnusson, Johannes A. Stork, Todor Stoyanov

Many practically relevant robot grasping problems feature a target object for which all grasps are occluded, e.g., by the environment. Single-shot grasp planning invariably fails in such scenarios. Instead, it is necessary to first manipulate the object into a configuration that affords a grasp. We solve this problem by learning a sequence of actions that utilize the environment to change the object's pose. Concretely, we employ hierarchical reinforcement learning to combine a sequence of learned parameterized manipulation primitives. By learning the low-level manipulation policies, our approach can control the object's state through exploiting interactions between the object, the gripper, and the environment. Designing such a complex behavior analytically would be infeasible under uncontrolled conditions, as an analytic approach requires accurate physical modeling of the interaction and contact dynamics. In contrast, we learn a hierarchical policy model that operates directly on depth perception data, without the need for object detection, pose estimation, or manual design of controllers. We evaluate our approach on picking box-shaped objects of various weight, shape, and friction properties from a constrained table-top workspace. Our method transfers to a real robot and is able to successfully complete the object picking task in 98% of experimental trials. Supplementary information and videos can be found at https://shihminyang.github.io/ED-PMP/.

5/10/2024

Tactile-Driven Non-Prehensile Object Manipulation via Extrinsic Contact Mode Control

Miquel Oller, Dmitry Berenson, Nima Fazeli

In this paper, we consider the problem of non-prehensile manipulation using grasped objects. This problem is a superset of many common manipulation skills including instances of tool-use (e.g., grasped spatula flipping a burger) and assembly (e.g., screwdriver tightening a screw). Here, we present an algorithmic approach for non-prehensile manipulation leveraging a gripper with highly compliant and high-resolution tactile sensors. Our approach solves for robot actions that drive object poses and forces to desired values while obeying the complex dynamics induced by the sensors as well as the constraints imposed by static equilibrium, object kinematics, and frictional contact. Our method is able to produce a variety of manipulation skills and is amenable to gradient-based optimization by exploiting differentiability within contact modes (e.g., specifications of sticking or sliding contacts). We evaluate 4 variants of controllers that attempt to realize these plans and demonstrate a number of complex skills including non-prehensile planar sliding and pivoting on a variety of object geometries. The perception and controls capabilities that drive these skills are the building blocks towards dexterous and reactive autonomy in unstructured environments.

5/29/2024

APriCoT: Action Primitives based on Contact-state Transition for In-Hand Tool Manipulation

Daichi Saito, Atsushi Kanehira, Kazuhiro Sasabuchi, Naoki Wake, Jun Takamatsu, Hideki Koike, Katsushi Ikeuchi

In-hand tool manipulation is an operation that not only manipulates a tool within the hand (i.e., in-hand manipulation) but also achieves a grasp suitable for a task after the manipulation. This study aims to achieve an in-hand tool manipulation skill through deep reinforcement learning. The difficulty of learning the skill arises because this manipulation requires (A) exploring long-term contact-state changes to achieve the desired grasp and (B) highly-varied motions depending on the contact-state transition. (A) leads to a sparsity of a reward on a successful grasp, and (B) requires an RL agent to explore widely within the state-action space to learn highly-varied actions, leading to sample inefficiency. To address these issues, this study proposes Action Primitives based on Contact-state Transition (APriCoT). APriCoT decomposes the manipulation into short-term action primitives by describing the operation as a contact-state transition based on three action representations (detach, crossover, attach). In each action primitive, fingers are required to perform short-term and similar actions. By training a policy for each primitive, we can mitigate the issues from (A) and (B). This study focuses on a fundamental operation as an example of in-hand tool manipulation: rotating an elongated object grasped with a precision grasp by half a turn to achieve the initial grasp. Experimental results demonstrated that ours succeeded in both the rotation and the achievement of the desired grasp, unlike existing studies. Additionally, it was found that the policy was robust to changes in object shape.

7/17/2024