Learning Visuotactile Skills with Two Multifingered Hands

Read original: arXiv:2404.16823 - Published 5/24/2024 by Toru Lin, Yu Zhang, Qiyang Li, Haozhi Qi, Brent Yi, Sergey Levine, Jitendra Malik
Total Score

0

⛏️

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • Researchers aim to replicate human-like dexterity, perception, and motion in a bimanual robotic system with multifingered hands and touch sensors.
  • Two key challenges are the lack of affordable teleoperation systems for dual-arm setups and the scarcity of multifingered hands with touch sensors.
  • To address these challenges, the researchers developed HATO, a low-cost teleoperation system, and adapted prosthetic hands with touch sensors for their experiments.
  • Using the data collected from their system, the researchers learned skills to complete complex manipulation tasks that require dexterity and touch feedback.

Plain English Explanation

The researchers wanted to create a robotic system that could move and interact with the world in a way similar to how humans do. Specifically, they were interested in replicating human-like dexterity, or the ability to use our fingers and hands with precision, as well as the perceptual experiences we have through touch and sight.

To achieve this, the researchers built a robotic system with two arms and multifingered hands - just like humans have. However, they faced two significant challenges. First, they didn't have an affordable and accessible way to control the robotic system remotely, which is important for collecting data and training the system. Second, they didn't have access to robotic hands that were equipped with touch sensors, which would allow the system to feel the objects it's interacting with, just like we can.

To overcome the first challenge, the researchers developed a low-cost teleoperation system called HATO that allows them to control the robotic system from a computer. For the second challenge, they found a creative solution by adapting prosthetic hands with touch sensors and using those in their experiments.

With this setup, the researchers were able to collect data on how the robotic system sees and feels the world around it. They then used this data to train the system to perform complex manipulation tasks, such as grasping and moving objects, that require a high level of dexterity and touch feedback. Their results show promising progress in developing bimanual, multifingered robotic systems that can interact with the world in a more human-like way.

Technical Explanation

The researchers developed a low-cost teleoperation system called HATO that allows them to control a bimanual robotic system with multifingered hands remotely. HATO leverages off-the-shelf electronics and includes a comprehensive software suite for data collection, multimodal data processing, policy learning, and policy deployment.

To address the lack of multifingered hands with touch sensors, the researchers repurposed two prosthetic hands equipped with touch sensors and integrated them into their robotic system. This novel hardware adaptation enabled the collection of visuotactile data, which the researchers then used to train policies for completing long-horizon, high-precision manipulation tasks.

The researchers empirically investigated the effects of dataset size, sensing modality, and visual input preprocessing on the policy learning process. Their results demonstrate the importance of touch feedback and suggest that leveraging pretrained representations can improve few-shot imitation learning performance.

The researchers also explored physics-aware iterative learning to predict saliency maps, which can help the robotic system focus on the most relevant parts of the environment during manipulation. Additionally, they investigated dynamic grasping of unknown objects using multifingered hands, which is a challenging task that requires dexterity and touch sensing.

Critical Analysis

The researchers have made significant progress in developing a bimanual robotic system with multifingered hands and touch sensors. However, their approach is still limited in several ways:

  1. The teleoperation system, while low-cost, may not be as intuitive or responsive as more advanced commercial systems, which could impact the quality of the training data.

  2. The repurposed prosthetic hands, while a clever solution, may not have the same capabilities as purpose-built robotic hands, which could limit the system's overall dexterity and manipulation abilities.

  3. The researchers focused on long-horizon, high-precision tasks, but there may be other important manipulation skills, such as rapid object grasping in cluttered environments, that their system has not yet demonstrated.

  4. The evaluation of the system's performance was primarily based on task success rates, but more detailed analysis of the quality and efficiency of the manipulation skills would provide a more comprehensive understanding of the system's capabilities.

Overall, the researchers' work represents an important step forward in the development of bimanual, multifingered robotic systems, but there is still room for improvement and further exploration of the challenges involved in replicating human-like dexterity and perception.

Conclusion

The researchers have made significant progress in developing a bimanual robotic system with multifingered hands and touch sensors, which is a crucial step towards replicating human-like dexterity, perception, and motion. By addressing the challenges of affordable teleoperation and the scarcity of multifingered hands with touch sensors, the researchers were able to collect visuotactile data and train policies for completing complex manipulation tasks.

The insights gained from this research, such as the importance of touch feedback and the potential benefits of leveraging pretrained representations, could have important implications for the broader field of robotic manipulation and interaction. As the researchers continue to refine and expand their system, it could lead to more advanced and versatile robotic platforms capable of assisting humans in a wide range of tasks, from industrial applications to assistive technologies.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

⛏️

Total Score

0

Learning Visuotactile Skills with Two Multifingered Hands

Toru Lin, Yu Zhang, Qiyang Li, Haozhi Qi, Brent Yi, Sergey Levine, Jitendra Malik

Aiming to replicate human-like dexterity, perceptual experiences, and motion patterns, we explore learning from human demonstrations using a bimanual system with multifingered hands and visuotactile data. Two significant challenges exist: the lack of an affordable and accessible teleoperation system suitable for a dual-arm setup with multifingered hands, and the scarcity of multifingered hand hardware equipped with touch sensing. To tackle the first challenge, we develop HATO, a low-cost hands-arms teleoperation system that leverages off-the-shelf electronics, complemented with a software suite that enables efficient data collection; the comprehensive software suite also supports multimodal data processing, scalable policy learning, and smooth policy deployment. To tackle the latter challenge, we introduce a novel hardware adaptation by repurposing two prosthetic hands equipped with touch sensors for research. Using visuotactile data collected from our system, we learn skills to complete long-horizon, high-precision tasks which are difficult to achieve without multifingered dexterity and touch feedback. Furthermore, we empirically investigate the effects of dataset size, sensing modality, and visual input preprocessing on policy learning. Our results mark a promising step forward in bimanual multifingered manipulation from visuotactile data. Videos, code, and datasets can be found at https://toruowo.github.io/hato/ .

Read more

5/24/2024

🌿

Total Score

0

MimicTouch: Leveraging Multi-modal Human Tactile Demonstrations for Contact-rich Manipulation

Kelin Yu, Yunhai Han, Qixian Wang, Vaibhav Saxena, Danfei Xu, Ye Zhao

Tactile sensing is critical to fine-grained, contact-rich manipulation tasks, such as insertion and assembly. Prior research has shown the possibility of learning tactile-guided policy from teleoperated demonstration data. However, to provide the demonstration, human users often rely on visual feedback to control the robot. This creates a gap between the sensing modality used for controlling the robot (visual) and the modality of interest (tactile). To bridge this gap, we introduce MimicTouch, a novel framework for learning policies directly from demonstrations provided by human users with their hands. The key innovations are i) a human tactile data collection system which collects multi-modal tactile dataset for learning human's tactile-guided control strategy, ii) an imitation learning-based framework for learning human's tactile-guided control strategy through such data, and iii) an online residual RL framework to bridge the embodiment gap between the human hand and the robot gripper. Through comprehensive experiments, we highlight the efficacy of utilizing human's tactile-guided control strategy to resolve contact-rich manipulation tasks. The project website is at https://sites.google.com/view/MimicTouch.

Read more

9/6/2024

Robot Synesthesia: In-Hand Manipulation with Visuotactile Sensing
Total Score

0

Robot Synesthesia: In-Hand Manipulation with Visuotactile Sensing

Ying Yuan, Haichuan Che, Yuzhe Qin, Binghao Huang, Zhao-Heng Yin, Kang-Won Lee, Yi Wu, Soo-Chul Lim, Xiaolong Wang

Executing contact-rich manipulation tasks necessitates the fusion of tactile and visual feedback. However, the distinct nature of these modalities poses significant challenges. In this paper, we introduce a system that leverages visual and tactile sensory inputs to enable dexterous in-hand manipulation. Specifically, we propose Robot Synesthesia, a novel point cloud-based tactile representation inspired by human tactile-visual synesthesia. This approach allows for the simultaneous and seamless integration of both sensory inputs, offering richer spatial information and facilitating better reasoning about robot actions. The method, trained in a simulated environment and then deployed to a real robot, is applicable to various in-hand object rotation tasks. Comprehensive ablations are performed on how the integration of vision and touch can improve reinforcement learning and Sim2Real performance. Our project page is available at https://yingyuan0414.github.io/visuotactile/ .

Read more

8/1/2024

📈

Total Score

0

Integrating Visuo-tactile Sensing with Haptic Feedback for Teleoperated Robot Manipulation

Noah Becker, Erik Gattung, Kay Hansel, Tim Schneider, Yaonan Zhu, Yasuhisa Hasegawa, Jan Peters

Telerobotics enables humans to overcome spatial constraints and allows them to physically interact with the environment in remote locations. However, the sensory feedback provided by the system to the operator is often purely visual, limiting the operator's dexterity in manipulation tasks. In this work, we address this issue by equipping the robot's end-effector with high-resolution visuotactile GelSight sensors. Using low-cost MANUS-Gloves, we provide the operator with haptic feedback about forces acting at the points of contact in the form of vibration signals. We propose two different methods for estimating these forces; one based on estimating the movement of markers on the sensor surface and one deep-learning approach. Additionally, we integrate our system into a virtual-reality teleoperation pipeline in which a human operator controls both arms of a Tiago robot while receiving visual and haptic feedback. We believe that integrating haptic feedback is a crucial step for dexterous manipulation in teleoperated robotic systems.

Read more

5/1/2024