RoboDuet: A Framework Affording Mobile-Manipulation and Cross-Embodiment

Read original: arXiv:2403.17367 - Published 5/14/2024 by Guoping Pan, Qingwei Ben, Zhecheng Yuan, Guangqi Jiang, Yandong Ji, Jiangmiao Pang, Houde Liu, Huazhe Xu
Total Score

0

RoboDuet: A Framework Affording Mobile-Manipulation and Cross-Embodiment

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • Introduces a novel framework called "RoboDuet" that enables mobile manipulation and cross-embodiment capabilities in robotic systems
  • Combines mobile platforms and robotic arms to allow for a wide range of tasks in different environments
  • Enables seamless transition between different robotic embodiments, expanding the versatility of the system

Plain English Explanation

The RoboDuet framework allows robots to move around and manipulate objects in their environment. It combines mobile platforms, like wheeled or legged robots, with robotic arms, giving the system the ability to navigate through different spaces and interact with objects.

One of the key features of RoboDuet is the ability to "cross-embody" - the robot can seamlessly transition between different physical forms, like switching from a wheeled base to a quadruped robot. This flexibility allows the system to tackle a wider range of tasks and navigate diverse environments, from indoor spaces to more challenging outdoor settings.

The framework draws inspiration from the learning and reusing robotic skills and simultaneous optimization of design and control approaches, integrating them to create a versatile and adaptable robotic system.

Technical Explanation

The RoboDuet framework combines mobile platforms and robotic arms to enable a wide range of mobile manipulation capabilities. It includes a visual foundation model that allows the system to perceive and interact with its environment, as well as a whole-body control module for coordinating the movements of the mobile base and the robotic arm.

The key innovation of RoboDuet is the cross-embodiment feature, which allows the robot to seamlessly transition between different physical forms, such as a wheeled base, a quadruped, or a humanoid. This is achieved through a task-driven computational framework that simultaneously optimizes the robot's design and control policies for different embodiments.

The researchers demonstrate the capabilities of RoboDuet through a series of experiments, showcasing its ability to perform various mobile manipulation tasks in different environments, including indoor and outdoor settings, as well as its long-horizon locomotion and manipulation capabilities on a large quadruped robot.

Critical Analysis

The RoboDuet framework presents a promising approach to enhancing the versatility and adaptability of robotic systems. By integrating mobile platforms and robotic arms, the system can tackle a wide range of tasks in diverse environments, which is a significant advancement in the field of mobile manipulation.

However, the paper does not delve deeply into the specific challenges and limitations of the cross-embodiment feature. While the concept is intriguing, the practical implementation and performance of seamlessly transitioning between different physical forms may require further investigation and experimentation.

Additionally, the paper could benefit from a more comprehensive analysis of the potential trade-offs and design considerations associated with the task-driven computational framework. Exploring the impact of different optimization strategies and their influence on the robot's performance and adaptability could provide valuable insights for future research.

Conclusion

The RoboDuet framework represents a significant step forward in the development of mobile manipulation systems. By combining mobile platforms and robotic arms, and enabling cross-embodiment capabilities, the system can adapt to a variety of tasks and environments, expanding the scope of what robots can achieve.

The insights and techniques presented in this paper have the potential to inspire further advancements in the field of robotics, contributing to the long-horizon locomotion and manipulation capabilities of robotic systems and enhancing their versatility and adaptability in real-world scenarios.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

RoboDuet: A Framework Affording Mobile-Manipulation and Cross-Embodiment
Total Score

0

RoboDuet: A Framework Affording Mobile-Manipulation and Cross-Embodiment

Guoping Pan, Qingwei Ben, Zhecheng Yuan, Guangqi Jiang, Yandong Ji, Jiangmiao Pang, Houde Liu, Huazhe Xu

Combining the mobility of legged robots with the manipulation skills of arms has the potential to significantly expand the operational range and enhance the capabilities of robotic systems in performing various mobile manipulation tasks. Existing approaches are confined to imprecise six degrees of freedom (DoF) manipulation and possess a limited arm workspace. In this paper, we propose a novel framework, RoboDuet, which employs two collaborative policies to realize locomotion and manipulation simultaneously, achieving whole-body control through interactions between each other. Surprisingly, going beyond the large-range pose tracking, we find that the two-policy framework may enable cross-embodiment deployment such as using different quadrupedal robots or other arms. Our experiments demonstrate that the policies trained through RoboDuet can accomplish stable gaits, agile 6D end-effector pose tracking, and zero-shot exchange of legged robots, and can be deployed in the real world to perform various mobile manipulation tasks. Our project page with demo videos is at https://locomanip-duet.github.io .

Read more

5/14/2024

Wheeled Humanoid Bilateral Teleoperation with Position-Force Control Modes for Dynamic Loco-Manipulation
Total Score

0

Wheeled Humanoid Bilateral Teleoperation with Position-Force Control Modes for Dynamic Loco-Manipulation

Amartya Purushottam, Jack Yan, Christopher Xu, Youngwoo Sim, Joao Ramos

Remote-controlled humanoid robots can revolutionize manufacturing, construction, and healthcare industries by performing complex or dangerous manual tasks traditionally done by humans. We refer to these behaviors as Dynamic Loco-Manipulation (DLM). To successfully complete these tasks, humans control the position of their bodies and contact forces at their hands. To enable similar whole-body control in humanoids, we introduce loco-manipulation retargeting strategies with switched position and force control modes in a bilateral teleoperation framework. Our proposed locomotion mappings use the pitch and yaw of the operator's torso to control robot position or acceleration. The manipulation retargeting maps the operator's arm movements to the robot's arms for joint-position or impedance control of the end-effector. A Human-Machine Interface captures the teleoperator's motion and provides haptic feedback to their torso, enhancing their awareness of the robot's interactions with the environment. In this paper, we demonstrate two forms of DLM. First, we show the robot slotting heavy boxes (5-10.5 kg), weighing up to 83% of the robot's weight, into desired positions. Second, we show human-robot collaboration for carrying an object, where the robot and teleoperator take on leader and follower roles.

Read more

7/18/2024

UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers
Total Score

0

UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers

Huy Ha, Yihuai Gao, Zipeng Fu, Jie Tan, Shuran Song

We introduce UMI-on-Legs, a new framework that combines real-world and simulation data for quadruped manipulation systems. We scale task-centric data collection in the real world using a hand-held gripper (UMI), providing a cheap way to demonstrate task-relevant manipulation skills without a robot. Simultaneously, we scale robot-centric data in simulation by training whole-body controller for task-tracking without task simulation setups. The interface between these two policies is end-effector trajectories in the task frame, inferred by the manipulation policy and passed to the whole-body controller for tracking. We evaluate UMI-on-Legs on prehensile, non-prehensile, and dynamic manipulation tasks, and report over 70% success rate on all tasks. Lastly, we demonstrate the zero-shot cross-embodiment deployment of a pre-trained manipulation policy checkpoint from prior work, originally intended for a fixed-base robot arm, on our quadruped system. We believe this framework provides a scalable path towards learning expressive manipulation skills on dynamic robot embodiments. Please checkout our website for robot videos, code, and data: https://umi-on-legs.github.io

Read more

7/16/2024

Empowering Embodied Manipulation: A Bimanual-Mobile Robot Manipulation Dataset for Household Tasks
Total Score

0

Empowering Embodied Manipulation: A Bimanual-Mobile Robot Manipulation Dataset for Household Tasks

Tianle Zhang, Dongjiang Li, Yihang Li, Zecui Zeng, Lin Zhao, Lei Sun, Yue Chen, Xuelong Wei, Yibing Zhan, Lusong Li, Xiaodong He

The advancements in embodied AI are increasingly enabling robots to tackle complex real-world tasks, such as household manipulation. However, the deployment of robots in these environments remains constrained by the lack of comprehensive bimanual-mobile robot manipulation data that can be learned. Existing datasets predominantly focus on single-arm manipulation tasks, while the few dual-arm datasets available often lack mobility features, task diversity, comprehensive sensor data, and robust evaluation metrics; they fail to capture the intricate and dynamic nature of household manipulation tasks that bimanual-mobile robots are expected to perform. To overcome these limitations, we propose BRMData, a Bimanual-mobile Robot Manipulation Dataset specifically designed for household applications. BRMData encompasses 10 diverse household tasks, including single-arm and dual-arm tasks, as well as both tabletop and mobile manipulations, utilizing multi-view and depth-sensing data information. Moreover, BRMData features tasks of increasing difficulty, ranging from single-object to multi-object grasping, non-interactive to human-robot interactive scenarios, and rigid-object to flexible-object manipulation, closely simulating real-world household applications. Additionally, we introduce a novel Manipulation Efficiency Score (MES) metric to evaluate both the precision and efficiency of robot manipulation methods in household tasks. We thoroughly evaluate and analyze the performance of advanced robot manipulation learning methods using our BRMData, aiming to drive the development of bimanual-mobile robot manipulation technologies. The dataset is now open-sourced and available at https://embodiedrobot.github.io/.

Read more

6/7/2024