UIVNAV: Underwater Information-driven Vision-based Navigation via Imitation Learning

2309.08806

Published 4/17/2024 by Xiaomin Lin, Nare Karapetyan, Kaustubh Joshi, Tianchen Liu, Nikhil Chopra, Miao Yu, Pratap Tokekar, Yiannis Aloimonos

cs.RO

🛸

Abstract

Autonomous navigation in the underwater environment is challenging due to limited visibility, dynamic changes, and the lack of a cost-efficient accurate localization system. We introduce UIVNav, a novel end-to-end underwater navigation solution designed to drive robots over Objects of Interest (OOI) while avoiding obstacles, without relying on localization. UIVNav uses imitation learning and is inspired by the navigation strategies used by human divers who do not rely on localization. UIVNav consists of the following phases: (1) generating an intermediate representation (IR), and (2) training the navigation policy based on human-labeled IR. By training the navigation policy on IR instead of raw data, the second phase is domain-invariant -- the navigation policy does not need to be retrained if the domain or the OOI changes. We show this by deploying the same navigation policy for surveying two different OOIs, oyster and rock reefs, in two different domains, simulation, and a real pool. We compared our method with complete coverage and random walk methods which showed that our method is more efficient in gathering information for OOIs while also avoiding obstacles. The results show that UIVNav chooses to visit the areas with larger area sizes of oysters or rocks with no prior information about the environment or localization. Moreover, a robot using UIVNav compared to complete coverage method surveys on average 36% more oysters when traveling the same distances. We also demonstrate the feasibility of real-time deployment of UIVNavin pool experiments with BlueROV underwater robot for surveying a bed of oyster shells.

Create account to get full access

Overview

Autonomous navigation in underwater environments is challenging due to limited visibility, dynamic changes, and lack of accurate localization systems.
The paper introduces UIVNav, a novel end-to-end underwater navigation solution that enables robots to navigate over Objects of Interest (OOI) while avoiding obstacles, without relying on localization.
UIVNav uses imitation learning and is inspired by the navigation strategies used by human divers who do not rely on localization.
The solution consists of two phases: (1) generating an intermediate representation (IR), and (2) training the navigation policy based on human-labeled IR.

Plain English Explanation

Navigating underwater can be very difficult due to poor visibility, constantly changing environments, and the lack of reliable localization systems (like GPS on land). The researchers have developed a new approach called UIVNav that allows robots to navigate through underwater environments and find specific objects of interest (like oyster or rock reefs) without needing to know their exact location.

The key idea is to have the robot learn from how human divers navigate, who also don't rely on localization systems. The robot first creates an "intermediate representation" of the environment, which is a simplified view that captures the important information. It then uses this representation to learn a navigation policy - a set of rules for how to move around and find the objects of interest while avoiding obstacles.

This two-step approach is powerful because the navigation policy doesn't need to be retrained even if the environment changes or the target objects are different. The robot can just use the same navigation policy for different underwater domains and objects.

The researchers show that their UIVNav method is more efficient than other approaches like randomly exploring or trying to exhaustively cover the entire area. It is able to find more of the target objects while traveling the same distances. They also demonstrate that UIVNav can be deployed in real-time on an underwater robot to survey a bed of oyster shells.

Technical Explanation

UIVNav uses imitation learning and is inspired by the navigation strategies used by human divers who do not rely on localization. The solution consists of two phases:

Generating an Intermediate Representation (IR): The robot creates a simplified view of the environment that captures the important information, without needing to know its exact location.
Training the Navigation Policy: The robot learns a set of rules for how to move around and find the objects of interest while avoiding obstacles, based on the human-labeled IR.

By training the navigation policy on the IR instead of raw data, the second phase becomes "domain-invariant" - the navigation policy does not need to be retrained if the domain or the OOI changes. The researchers demonstrate this by deploying the same navigation policy for surveying two different OOIs (oyster and rock reefs) in two different domains (simulation and a real pool).

The results show that UIVNav is more efficient than complete coverage and random walk methods in gathering information about the OOIs while also avoiding obstacles. The robot using UIVNav surveys on average 36% more oysters when traveling the same distances as the complete coverage method. The researchers also demonstrate the real-time deployment of UIVNav on an underwater robot (BlueROV) for surveying a bed of oyster shells.

Critical Analysis

The paper presents a promising approach to autonomous underwater navigation, but there are a few important caveats and areas for further research:

Validation in more complex environments: The experiments were conducted in relatively simple and controlled environments (simulation and a pool). More research is needed to understand how well UIVNav would perform in real-world, dynamic underwater environments with stronger currents, more obstacles, and greater variability in the target objects.
Robustness to sensor noise and failures: The paper does not explicitly address how UIVNav would handle noisy or unreliable sensor data, which is a common challenge in underwater robotics. Evaluating the system's robustness to sensor failures and environmental disturbances would be an important next step.
Scalability to larger-scale surveys: The experiments focused on relatively small-scale surveys of oyster and rock reefs. Further work is needed to understand how well UIVNav would scale to larger-scale underwater mapping and exploration tasks.
Comparison to other navigation approaches: While the paper compares UIVNav to complete coverage and random walk methods, it would be valuable to benchmark it against other state-of-the-art underwater navigation techniques, such as those using scale-aware visual navigation, multi-view scanning, or fast traversability learning.
Integration with other underwater robotics components: To fully realize the potential of UIVNav, it would be important to explore how it can be integrated with other key underwater robotics capabilities, such as interactive and fast-adaptable routing and navigation.

Overall, the UIVNav approach is an interesting and promising step forward in autonomous underwater navigation, but further research is needed to fully understand its strengths, limitations, and potential for real-world applications.

Conclusion

The paper introduces UIVNav, a novel end-to-end underwater navigation solution that enables robots to navigate over Objects of Interest (OOI) while avoiding obstacles, without relying on localization. By using imitation learning and an intermediate representation of the environment, UIVNav is able to learn a navigation policy that is domain-invariant, meaning it can be applied to different underwater environments and target objects without retraining.

The key advantages of UIVNav are its efficiency in finding target objects while avoiding obstacles, and its ability to generalize to new domains. This suggests that the approach could be a valuable tool for a wide range of underwater exploration and mapping tasks, from surveying coral reefs to inspecting submerged infrastructure. Further research is needed to fully understand the system's capabilities and limitations, but the initial results are very promising.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

✅

Robust Perception and Navigation of Autonomous Surface Vehicles in Challenging Environments

Mingi Jeong

Research on coastal regions traditionally involves methods like manual sampling, monitoring buoys, and remote sensing, but these methods face challenges in spatially and temporally diverse regions of interest. Autonomous surface vehicles (ASVs) with artificial intelligence (AI) are being explored, and recognized by the International Maritime Organization (IMO) as vital for future ecosystem understanding. However, there is not yet a mature technology for autonomous environmental monitoring due to typically complex coastal situations: (1) many static (e.g., buoy, dock) and dynamic (e.g., boats) obstacles not compliant with the rules of the road (COLREGs); (2) uncharted or uncertain information (e.g., non-updated nautical chart); and (3) high-cost ASVs not accessible to the community and citizen science while resulting in technology illiteracy. To address the above challenges, my research involves both system and algorithmic development: (1) a robotic boat system for stable and reliable in-water monitoring, (2) maritime perception to detect and track obstacles (such as buoys, and boats), and (3) navigational decision-making with multiple-obstacle avoidance and multi-objective optimization.

5/29/2024

cs.RO

UINav: A Practical Approach to Train On-Device Automation Agents

Wei Li, Fu-Lin Hsu, Will Bishop, Folawiyo Campbell-Ajala, Max Lin, Oriana Riva

Automation systems that can autonomously drive application user interfaces to complete user tasks are of great benefit, especially when users are situationally or permanently impaired. Prior automation systems do not produce generalizable models while AI-based automation agents work reliably only in simple, hand-crafted applications or incur high computation costs. We propose UINav, a demonstration-based approach to train automation agents that fit mobile devices, yet achieving high success rates with modest numbers of demonstrations. To reduce the demonstration overhead, UINav uses a referee model that provides users with immediate feedback on tasks where the agent fails, and automatically augments human demonstrations to increase diversity in training data. Our evaluation shows that with only 10 demonstrations UINav can achieve 70% accuracy, and that with enough demonstrations it can surpass 90% accuracy.

7/1/2024

cs.HC cs.AI

Mesh-based Photorealistic and Real-time 3D Mapping for Robust Visual Perception of Autonomous Underwater Vehicle

Jungwoo Lee, Younggun Cho

This paper proposes a photorealistic real-time dense 3D mapping system that utilizes a learning-based image enhancement method and mesh-based map representation. Due to the characteristics of the underwater environment, where problems such as hazing and low contrast occur, it is hard to apply conventional simultaneous localization and mapping (SLAM) methods. Furthermore, for sensitive tasks like inspecting cracks, photorealistic mapping is very important. However, the behavior of Autonomous Underwater Vehicle (AUV) is computationally constrained. In this paper, we utilize a neural network-based image enhancement method to improve pose estimation and mapping quality and apply a sliding window-based mesh expansion method to enable lightweight, fast, and photorealistic mapping. To validate our results, we utilize real-world and indoor synthetic datasets. We performed qualitative validation with the real-world dataset and quantitative validation by modeling images from the indoor synthetic dataset as underwater scenes.

4/30/2024

cs.RO

Synergistic Reinforcement and Imitation Learning for Vision-driven Autonomous Flight of UAV Along River

Zihan Wang, Jianwen Li, Nina Mahmoudian

Vision-driven autonomous flight and obstacle avoidance of Unmanned Aerial Vehicles (UAVs) along complex riverine environments for tasks like rescue and surveillance requires a robust control policy, which is yet difficult to obtain due to the shortage of trainable riverine environment simulators. To easily verify the vision-based navigation controller performance for the river following task before real-world deployment, we developed a trainable photo-realistic dynamics-free riverine simulation environment using Unity. In this paper, we address the shortcomings that vanilla Reinforcement Learning (RL) algorithm encounters in learning a navigation policy within this partially observable, non-Markovian environment. We propose a synergistic approach that integrates RL and Imitation Learning (IL). Initially, an IL expert is trained on manually collected demonstrations, which then guides the RL policy training process. Concurrently, experiences generated by the RL agent are utilized to re-train the IL expert, enhancing its ability to generalize to unseen data. By leveraging the strengths of both RL and IL, this framework achieves a faster convergence rate and higher performance compared to pure RL, pure IL, and RL combined with static IL algorithms. The results validate the efficacy of the proposed method in terms of both task completion and efficiency. The code and trainable environments are available.

5/1/2024

cs.RO