Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches and a Head-Mounted Camera

Read original: arXiv:2401.00847 - Published 5/7/2024 by Jiye Lee, Hanbyul Joo

Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches and a Head-Mounted Camera

Overview

This paper proposes a lightweight motion capture system that uses smartwatches and a head-mounted camera to capture full-body human motion.
The system is designed to be accessible and affordable, allowing for motion capture in a wide range of settings, from daily life to professional applications.
The authors demonstrate the system's performance through various experiments and show its ability to accurately capture complex human movements.

Plain English Explanation

The paper introduces a new motion capture system that uses smartwatches and a head-mounted camera to track the movements of a person's entire body. Motion capture is a technology that records the movements of a person or object, which can be used for a variety of applications, such as creating realistic animations, analyzing athletic performance, or studying human behavior.

Traditional motion capture systems often require specialized equipment, such as multiple cameras or sensors placed around a studio. This can be expensive and limit where the technology can be used. The new system proposed in this paper is designed to be more accessible and affordable, using commonly available smartwatches and a single head-mounted camera.

The authors demonstrate that their system can accurately capture complex human movements, such as dancing or exercising, in a variety of settings, from a person's home to a public space. This opens up new possibilities for using motion capture in everyday life, as well as in professional applications that require tracking human movement [like sports training or virtual reality.

Technical Explanation

The researchers developed a motion capture system that uses a head-mounted camera and smartwatches worn on each wrist to track the full-body movements of a person. The head-mounted camera captures the positions of the user's hands and body, while the smartwatches provide additional data on the movements of the user's arms and torso.

The system uses a combination of computer vision and sensor fusion techniques to integrate the data from the camera and smartwatches and reconstruct a 3D model of the user's body in motion. The authors describe the algorithms and optimization methods they used to achieve accurate and robust motion capture, even in challenging conditions such as occlusions or rapid movements.

Through a series of experiments, the researchers demonstrate the system's ability to capture a wide range of human movements, including complex activities like dance and exercise. They compare the accuracy of their system to that of a professional motion capture setup, showing that their lightweight approach can achieve comparable performance at a fraction of the cost and setup complexity.

Critical Analysis

The paper presents a promising approach to making motion capture technology more accessible and widely available. The authors acknowledge that their system has some limitations, such as the need for the user to wear specific devices and the potential for occlusions or interference from the environment.

One concern that is not addressed in the paper is the privacy implications of using a head-mounted camera to capture someone's movements. While the authors note that the system could be used in a wide range of settings, they do not discuss any safeguards or considerations around the ethical use of this technology, particularly in public spaces or without the full consent of all participants.

Additionally, the paper does not explore the potential for bias or errors in the motion capture data, which could be a concern when using the system for applications like sports training or medical assessments. Further research may be needed to understand the robustness and reliability of the system under diverse conditions and with different types of users.

Overall, the proposed motion capture system is an interesting and promising development, but the authors should consider the broader implications and potential risks of such technology as they continue to refine and expand its capabilities.

Conclusion

The paper introduces a novel motion capture system that uses smartwatches and a head-mounted camera to track full-body human movements in a lightweight and accessible way. The authors demonstrate the system's ability to accurately capture a wide range of complex movements, opening up new possibilities for using motion capture technology in a variety of settings, from daily life to professional applications.

While the system shows promise, the authors should also consider the potential privacy and ethical concerns, as well as the need for further research to ensure the robustness and reliability of the technology. As motion capture becomes more widely accessible, it will be important to address these issues to ensure that the technology is used responsibly and in a way that benefits society as a whole.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches and a Head-Mounted Camera

Jiye Lee, Hanbyul Joo

We present a lightweight and affordable motion capture method based on two smartwatches and a head-mounted camera. In contrast to the existing approaches that use six or more expert-level IMU devices, our approach is much more cost-effective and convenient. Our method can make wearable motion capture accessible to everyone everywhere, enabling 3D full-body motion capture in diverse environments. As a key idea to overcome the extreme sparsity and ambiguities of sensor inputs with different modalities, we integrate 6D head poses obtained from the head-mounted cameras for motion estimation. To enable capture in expansive indoor and outdoor scenes, we propose an algorithm to track and update floor level changes to define head poses, coupled with a multi-stage Transformer-based regression module. We also introduce novel strategies leveraging visual cues of egocentric images to further enhance the motion capture quality while reducing ambiguities. We demonstrate the performance of our method on various challenging scenarios, including complex outdoor environments and everyday motions including object interactions and social interactions among multiple individuals.

5/7/2024

Ubiquitous Robot Control Through Multimodal Motion Capture Using Smartwatch and Smartphone Data

Fabian C Weigend, Neelesh Kumar, Oya Aran, Heni Ben Amor

We present an open-source library for seamless robot control through motion capture using smartphones and smartwatches. Our library features three modes: Watch Only Mode, enabling control with a single smartwatch; Upper Arm Mode, offering heightened accuracy by incorporating the smartphone attached to the upper arm; and Pocket Mode, determining body orientation via the smartphone placed in any pocket. These modes are applied in two real-robot tasks, showcasing placement accuracy within 2 cm compared to a gold-standard motion capture system. WearMoCap stands as a suitable alternative to conventional motion capture systems, particularly in environments where ubiquity is essential. The library is available at: www.github.com/wearable-motion-capture.

6/4/2024

Motion Capture from Inertial and Vision Sensors

Xiaodong Chen, Wu Liu, Qian Bao, Xinchen Liu, Quanwei Yang, Ruoli Dai, Tao Mei

Human motion capture is the foundation for many computer vision and graphics tasks. While industrial motion capture systems with complex camera arrays or expensive wearable sensors have been widely adopted in movie and game production, consumer-affordable and easy-to-use solutions for personal applications are still far from mature. To utilize a mixture of a monocular camera and very few inertial measurement units (IMUs) for accurate multi-modal human motion capture in daily life, we contribute MINIONS in this paper, a large-scale Motion capture dataset collected from INertial and visION Sensors. MINIONS has several featured properties: 1) large scale of over five million frames and 400 minutes duration; 2) multi-modality data of IMUs signals and RGB videos labeled with joint positions, joint rotations, SMPL parameters, etc.; 3) a diverse set of 146 fine-grained single and interactive actions with textual descriptions. With the proposed MINIONS, we conduct experiments on multi-modal motion capture and explore the possibilities of consumer-affordable motion capture using a monocular camera and very few IMUs. The experiment results emphasize the unique advantages of inertial and vision sensors, showcasing the promise of consumer-affordable multi-modal motion capture and providing a valuable resource for further research and development.

7/24/2024

Real-Time Simulated Avatar from Head-Mounted Sensors

Zhengyi Luo, Jinkun Cao, Rawal Khirodkar, Alexander Winkler, Jing Huang, Kris Kitani, Weipeng Xu

We present SimXR, a method for controlling a simulated avatar from information (headset pose and cameras) obtained from AR / VR headsets. Due to the challenging viewpoint of head-mounted cameras, the human body is often clipped out of view, making traditional image-based egocentric pose estimation challenging. On the other hand, headset poses provide valuable information about overall body motion, but lack fine-grained details about the hands and feet. To synergize headset poses with cameras, we control a humanoid to track headset movement while analyzing input images to decide body movement. When body parts are seen, the movements of hands and feet will be guided by the images; when unseen, the laws of physics guide the controller to generate plausible motion. We design an end-to-end method that does not rely on any intermediate representations and learns to directly map from images and headset poses to humanoid control signals. To train our method, we also propose a large-scale synthetic dataset created using camera configurations compatible with a commercially available VR headset (Quest 2) and show promising results on real-world captures. To demonstrate the applicability of our framework, we also test it on an AR headset with a forward-facing camera.

4/26/2024