Learning of Sea Surface Height Interpolation from Multi-variate Simulated Satellite Observations

2310.07626

Published 5/7/2024 by Theo Archambault, Arthur Filoche, Anastase Charantonis, Dominique Bereziat, Sylvie Thiria

👁️

Abstract

Satellite-based remote sensing missions have revolutionized our understanding of the Ocean state and dynamics. Among them, space-borne altimetry provides valuable Sea Surface Height (SSH) measurements, used to estimate surface geostrophic currents. Due to the sensor technology employed, important gaps occur in SSH observations. Complete SSH maps are produced using linear Optimal Interpolations (OI) such as the widely-used Data Unification and Altimeter Combination System (DUACS). On the other hand, Sea Surface Temperature (SST) products have much higher data coverage and SST is physically linked to geostrophic currents through advection. We propose a new multi-variate Observing System Simulation Experiment (OSSE) emulating 20 years of SSH and SST satellite observations. We train an Attention-Based Encoder-Decoder deep learning network (textsc{abed}) on this data, comparing two settings: one with access to ground truth during training and one without. On our OSSE, we compare ABED reconstructions when trained using either supervised or unsupervised loss functions, with or without SST information. We evaluate the SSH interpolations in terms of eddy detection. We also introduce a new way to transfer the learning from simulation to observations: supervised pre-training on our OSSE followed by unsupervised fine-tuning on satellite data. Based on real SSH observations from the Ocean Data Challenge 2021, we find that this learning strategy, combined with the use of SST, decreases the root mean squared error by 24% compared to OI.

Create account to get full access

Overview

Satellite-based remote sensing has revolutionized our understanding of the ocean
Space-borne altimetry measures Sea Surface Height (SSH), which is used to estimate surface geostrophic currents
SSH observations have gaps due to sensor technology, so complete maps are produced using linear Optimal Interpolation (OI)
Sea Surface Temperature (SST) has higher data coverage and is linked to geostrophic currents through advection
The authors propose a new Observing System Simulation Experiment (OSSE) to emulate 20 years of SSH and SST satellite observations
They train an Attention-Based Encoder-Decoder (abed) deep learning network on this data, comparing settings with and without ground truth during training
The authors evaluate the SSH interpolations in terms of eddy detection and introduce a new learning strategy to transfer knowledge from simulation to real satellite observations

Plain English Explanation

Satellites are revolutionizing our understanding of the oceans. They use altimetry (measuring the height of the sea surface) to estimate the speed of surface currents. However, the technology has some gaps in the data it collects, so scientists use a mathematical technique called Optimal Interpolation to fill in the missing information and create complete maps.

Another type of satellite data is Sea Surface Temperature (SST), which is linked to the speed of the currents. The authors of this paper have created a simulated dataset that mimics 20 years of real satellite observations of SSH and SST. They then trained a deep learning model, called an Attention-Based Encoder-Decoder (abed), to reconstruct the SSH data using this simulated dataset.

The authors compared two different ways of training the abed model - one where it had access to the "ground truth" (the real, complete data) during training, and one where it did not. They also looked at whether including the SST data helped the model perform better at reconstructing the SSH.

In addition, the authors introduced a new way to take what the model learned from the simulated data and apply it to the real satellite observations. They first did some supervised pre-training on the simulated data, and then did additional unsupervised fine-tuning on the real satellite data. This helped the model achieve a 24% reduction in error compared to the standard Optimal Interpolation approach when applied to the real SSH observations.

Technical Explanation

The authors created a multi-variate Observing System Simulation Experiment (OSSE) to emulate 20 years of satellite observations of Sea Surface Height (SSH) and Sea Surface Temperature (SST). They then trained an Attention-Based Encoder-Decoder (abed) deep learning network on this simulated data, comparing two settings: one where the model had access to ground truth during training, and one where it did not.

The abed model was trained to reconstruct the SSH data using either supervised or unsupervised loss functions, with or without the additional SST information. The authors evaluated the SSH interpolations in terms of eddy detection, which is an important oceanographic metric.

To bridge the gap between the simulated and real-world data, the authors introduced a new learning strategy. They first did supervised pre-training on the OSSE data, and then did unsupervised fine-tuning on the actual satellite observations. This combined approach, along with the use of SST data, led to a 24% reduction in root mean squared error compared to the standard Data Unification and Altimeter Combination System (duacs) when applied to real SSH observations from the Ocean Data Challenge 2021.

Critical Analysis

The paper presents a novel approach to SSH reconstruction using deep learning and multi-variate satellite data. The authors' use of a simulated OSSE dataset to train and validate their model is a reasonable approach, as it allows them to control the experimental conditions and have access to ground truth data that is not available in the real-world observations.

However, one potential limitation is the realism of the OSSE dataset. While the authors claim it emulates 20 years of satellite observations, the extent to which it accurately captures the full complexity and variability of the real ocean system is not clear. Additionally, the authors do not provide details on how the OSSE was generated or validated, which makes it difficult to assess the suitability of this dataset for training deep learning models.

Furthermore, the authors' new learning strategy of supervised pre-training followed by unsupervised fine-tuning is an interesting approach, but its effectiveness may depend heavily on the quality and representativeness of the simulated data. It would be valuable to see the authors evaluate this strategy on a wider range of real-world satellite datasets to better understand its broader applicability and robustness.

Despite these potential limitations, the authors' work demonstrates the potential of deep learning and multi-variate data fusion to improve the reconstruction of SSH from satellite observations. Their findings contribute to the ongoing efforts to better understand ocean dynamics and climate change through remote sensing.

Conclusion

This paper proposes a novel deep learning-based approach for reconstructing Sea Surface Height (SSH) from satellite observations, leveraging the complementary information in Sea Surface Temperature (SST) data. By creating a simulated Observing System Simulation Experiment (OSSE) and training an Attention-Based Encoder-Decoder (abed) model, the authors were able to achieve a 24% reduction in root mean squared error compared to standard Optimal Interpolation methods when applied to real-world satellite observations.

The authors' work demonstrates the potential of deep learning and multi-variate data fusion to improve our understanding of the ocean state and dynamics, which is crucial for monitoring and responding to climate change. The new learning strategy of supervised pre-training and unsupervised fine-tuning also presents an interesting approach for bridging the gap between simulated and real-world data, which could have broader applications in Earth system modeling and data assimilation.

Overall, this research represents an important step forward in the use of advanced machine learning techniques for satellite image time series analysis and uncertainty-aware estimation of geophysical parameters, with potential implications for a wide range of environmental monitoring and climate research applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

NeuRSS: Enhancing AUV Localization and Bathymetric Mapping with Neural Rendering for Sidescan SLAM

Yiping Xie, Jun Zhang, Nils Bore, John Folkesson

Implicit neural representations and neural render- ing have gained increasing attention for bathymetry estimation from sidescan sonar (SSS). These methods incorporate multiple observations of the same place from SSS data to constrain the elevation estimate, converging to a globally-consistent bathymetric model. However, the quality and precision of the bathymetric estimate are limited by the positioning accuracy of the autonomous underwater vehicle (AUV) equipped with the sonar. The global positioning estimate of the AUV relying on dead reckoning (DR) has an unbounded error due to the absence of a geo-reference system like GPS underwater. To address this challenge, we propose in this letter a modern and scalable framework, NeuRSS, for SSS SLAM based on DR and loop closures (LCs) over large timescales, with an elevation prior provided by the bathymetric estimate using neural rendering from SSS. This framework is an iterative procedure that improves localization and bathymetric mapping. Initially, the bathymetry estimated from SSS using the DR estimate, though crude, can provide an important elevation prior in the nonlinear least-squares (NLS) optimization that estimates the relative pose between two loop-closure vertices in a pose graph. Subsequently, the global pose estimate from the SLAM component improves the positioning estimate of the vehicle, thus improving the bathymetry estimation. We validate our localization and mapping approach on two large surveys collected with a surface vessel and an AUV, respectively. We evaluate their localization results against the ground truth and compare the bathymetry estimation against data collected with multibeam echo sounders (MBES).

5/10/2024

cs.RO

Deep Generative Data Assimilation in Multimodal Setting

Yongquan Qu, Juan Nathaniel, Shuolin Li, Pierre Gentine

Robust integration of physical knowledge and data is key to improve computational simulations, such as Earth system models. Data assimilation is crucial for achieving this goal because it provides a systematic framework to calibrate model outputs with observations, which can include remote sensing imagery and ground station measurements, with uncertainty quantification. Conventional methods, including Kalman filters and variational approaches, inherently rely on simplifying linear and Gaussian assumptions, and can be computationally expensive. Nevertheless, with the rapid adoption of data-driven methods in many areas of computational sciences, we see the potential of emulating traditional data assimilation with deep learning, especially generative models. In particular, the diffusion-based probabilistic framework has large overlaps with data assimilation principles: both allows for conditional generation of samples with a Bayesian inverse framework. These models have shown remarkable success in text-conditioned image generation or image-controlled video synthesis. Likewise, one can frame data assimilation as observation-conditioned state calibration. In this work, we propose SLAMS: Score-based Latent Assimilation in Multimodal Setting. Specifically, we assimilate in-situ weather station data and ex-situ satellite imagery to calibrate the vertical temperature profiles, globally. Through extensive ablation, we demonstrate that SLAMS is robust even in low-resolution, noisy, and sparse data settings. To our knowledge, our work is the first to apply deep generative framework for multimodal data assimilation using real-world datasets; an important step for building robust computational simulators, including the next-generation Earth system models. Our code is available at: https://github.com/yongquan-qu/SLAMS

6/14/2024

cs.CV

Deep Learning Powered Estimate of The Extrinsic Parameters on Unmanned Surface Vehicles

Yi Shen, Hao Liu, Chang Zhou, Wentao Wang, Zijun Gao, Qi Wang

Unmanned Surface Vehicles (USVs) are pivotal in marine exploration, but their sensors' accuracy is compromised by the dynamic marine environment. Traditional calibration methods fall short in these conditions. This paper introduces a deep learning architecture that predicts changes in the USV's dynamic metacenter and refines sensors' extrinsic parameters in real time using a Time-Sequence General Regression Neural Network (GRNN) with Euler angles as input. Simulation data from Unity3D ensures robust training and testing. Experimental results show that the Time-Sequence GRNN achieves the lowest mean squared error (MSE) loss, outperforming traditional neural networks. This method significantly enhances sensor calibration for USVs, promising improved data accuracy in challenging maritime conditions. Future work will refine the network and validate results with real-world data.

6/10/2024

cs.RO

👨‍🏫

Semi Supervised Heterogeneous Domain Adaptation via Disentanglement and Pseudo-Labelling

Cassio F. Dantas (EVERGREEN, INRAE), Raffaele Gaetano (EVERGREEN), Dino Ienco (EVERGREEN)

Semi-supervised domain adaptation methods leverage information from a source labelled domain with the goal of generalizing over a scarcely labelled target domain. While this setting already poses challenges due to potential distribution shifts between domains, an even more complex scenario arises when source and target data differs in modality representation (e.g. they are acquired by sensors with different characteristics). For instance, in remote sensing, images may be collected via various acquisition modes (e.g. optical or radar), different spectral characteristics (e.g. RGB or multi-spectral) and spatial resolutions. Such a setting is denoted as Semi-Supervised Heterogeneous Domain Adaptation (SSHDA) and it exhibits an even more severe distribution shift due to modality heterogeneity across domains.To cope with the challenging SSHDA setting, here we introduce SHeDD (Semi-supervised Heterogeneous Domain Adaptation via Disentanglement) an end-to-end neural framework tailored to learning a target domain classifier by leveraging both labelled and unlabelled data from heterogeneous data sources. SHeDD is designed to effectively disentangle domain-invariant representations, relevant for the downstream task, from domain-specific information, that can hinder the cross-modality transfer. Additionally, SHeDD adopts an augmentation-based consistency regularization mechanism that takes advantages of reliable pseudo-labels on the unlabelled target samples to further boost its generalization ability on the target domain. Empirical evaluations on two remote sensing benchmarks, encompassing heterogeneous data in terms of acquisition modes and spectral/spatial resolutions, demonstrate the quality of SHeDD compared to both baseline and state-of-the-art competing approaches. Our code is publicly available here: https://github.com/tanodino/SSHDA/

6/21/2024

cs.AI cs.LG