An Empirical Study of Large-Scale Data-Driven Full Waveform Inversion

Read original: arXiv:2307.15388 - Published 4/26/2024 by Peng Jin, Yinan Feng, Shihang Feng, Hanchen Wang, Yinpeng Chen, Benjamin Consolvo, Zicheng Liu, Youzuo Lin

⛏️

Overview

This paper investigates how using large, diverse datasets can improve the performance of deep learning models for solving the full waveform inversion (FWI) problem.
FWI is a technique used in geophysics to generate detailed 3D models of the Earth's subsurface structure from seismic data.
The researchers trained and evaluated deep learning FWI models on a large, synthetic dataset called OpenFWI, and compared the results to models trained on smaller subsets of the data.

Plain English Explanation

Deep learning models have shown great potential for solving complex problems in fields like geophysics. One such problem is full waveform inversion (FWI), which aims to create detailed 3D maps of the Earth's subsurface structure using seismic data.

In this study, the researchers wanted to see if providing deep learning models with larger, more diverse datasets could further boost their performance on FWI tasks. They trained and evaluated FWI models on a new, large-scale synthetic dataset called OpenFWI, and compared the results to models trained on smaller subsets of the data.

The key finding was that training on the full OpenFWI dataset led to significant improvements in the models' accuracy, as measured by metrics like mean absolute error (MAE), mean squared error (MSE), and structural similarity index (SSIM). On average, the models trained on the combined dataset showed 13-28% better performance compared to those trained on individual subsets.

The researchers also found that the model's capacity (size and complexity) needed to scale with the dataset size to achieve optimal improvements. Larger, more complex models were able to extract more value from the additional data.

Overall, this study provides empirical evidence that using large, high-quality datasets can meaningfully enhance the performance of deep learning models - even for specialized geophysical problems like FWI. This has important implications for practitioners looking to apply deep learning to challenging real-world tasks.

Technical Explanation

The researchers conducted an empirical study to investigate the impact of using a large, diverse dataset on the performance of deep learning models for the full waveform inversion (FWI) problem.

They trained and evaluated multiple FWI models on a combination of 10 2D subsets from the recently published OpenFWI dataset, which contains a total of 470,000 pairs of seismic data and velocity maps. This allowed them to compare model performance when trained on the full dataset versus individual subsets.

The experiments demonstrated that training on the combined OpenFWI dataset yielded significant improvements compared to the individual subsets. On average, the models showed a 13.03% improvement in MAE, a 7.19% improvement in MSE, and a 1.87% improvement in SSIM. In a leave-one-out generalization test, the average improvements were even larger at 28.60%, 21.55%, and 8.22%, respectively.

The researchers also found that scaling the model capacity (size and complexity) in proportion to the dataset size was important for achieving optimal performance gains. Their largest model outperformed the smallest model by an average of 20.06% in MAE, 13.39% in MSE, and 0.72% in SSIM.

Critical Analysis

The paper provides a thorough empirical validation of the impact of large, diverse datasets on deep learning models for the FWI problem. However, a few potential limitations and areas for further research are worth noting:

Synthetic Data Limitations: The OpenFWI dataset used in this study is entirely synthetic, generated using numerical simulations. While this allowed for the creation of a large, diverse dataset, it may not fully capture the complexity and variability of real-world seismic data. Further research is needed to validate these findings on real-world seismic data.
Generalization to Other Domains: The paper focuses solely on the FWI problem in geophysics. It would be valuable to investigate whether these findings around dataset size and model capacity scaling generalize to other domains where deep learning is applied, such as underwater image enhancement or reverse gradient matching.
Computational and Training Considerations: The paper does not explore the computational and training time requirements for the larger models used with the full OpenFWI dataset. In real-world applications, these factors may be important considerations alongside model performance.

Overall, this study provides compelling evidence that leveraging large, diverse datasets can significantly boost the performance of deep learning models, even for specialized geophysical problems like FWI. The insights on the importance of model capacity scaling are particularly noteworthy and warrant further investigation.

Conclusion

This paper demonstrates the substantial benefits that large, diverse datasets can provide for training deep learning models to solve the full waveform inversion (FWI) problem in geophysics. By evaluating models on the comprehensive OpenFWI dataset, the researchers were able to show average improvements of 13-28% across key performance metrics compared to models trained on smaller subsets of the data.

The findings highlight the importance of scaling model capacity (size and complexity) in proportion to dataset size to extract the maximum value from large, high-quality datasets. These insights have important implications for practitioners looking to apply deep learning to challenging real-world problems, where access to diverse, high-quality data can be a key driver of model performance.

While this study was focused on the FWI domain, the principles around dataset size and model capacity scaling likely extend to other fields as well. Further research is needed to validate these findings on real-world seismic data and across a broader range of applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

⛏️

An Empirical Study of Large-Scale Data-Driven Full Waveform Inversion

Peng Jin, Yinan Feng, Shihang Feng, Hanchen Wang, Yinpeng Chen, Benjamin Consolvo, Zicheng Liu, Youzuo Lin

This paper investigates the impact of big data on deep learning models to help solve the full waveform inversion (FWI) problem. While it is well known that big data can boost the performance of deep learning models in many tasks, its effectiveness has not been validated for FWI. To address this gap, we present an empirical study that investigates how deep learning models in FWI behave when trained on OpenFWI, a collection of large-scale, multi-structural, synthetic datasets published recently. In particular, we train and evaluate the FWI models on a combination of 10 2D subsets in OpenFWI that contain 470K pairs of seismic data and velocity maps in total. Our experiments demonstrate that training on the combined dataset yields an average improvement of 13.03% in MAE, 7.19% in MSE and 1.87% in SSIM compared to each split dataset, and an average improvement of 28.60%, 21.55% and 8.22% in the leave-one-out generalization test. We further demonstrate that model capacity needs to scale in accordance with data size for optimal improvement, where our largest model yields an average improvement of 20.06%, 13.39% and 0.72% compared to the smallest one.

4/26/2024

Elastic Full-Waveform Inversion : How the physics of problem improves data-driven techniques?

Vahid Negahdari, Seyed Reza Moghadasi, Mohammad Reza Razvan

Full-Waveform Inversion (FWI) is a nonlinear iterative seismic imaging technique that, by reducing the misfit between recorded and predicted seismic waveforms, can produce detailed estimates of subsurface geophysical properties. Nevertheless, the strong nonlinearity of FWI can trap the optimization in local minima. This issue arises due to factors such as improper initial values, the absence of low frequencies in the measurements, noise, and other related considerations. To address this challenge and with the advent of advanced machine-learning techniques, data-driven methods, such as deep learning, have attracted significantly increasing attention in the geophysical community. Furthermore, the elastic wave equation should be included in FWI to represent elastic effects accurately. The intersection of data-driven techniques and elastic scattering theories presents opportunities and challenges. In this paper, by using the knowledge of elastic scattering (Physics of problem) and integrating it with deep learning techniques, we propose methods for the solution of time-harmonic FWI to enhance accuracy compared to pure data-driven approaches. Moreover, by modifying the structure of the Variational Autoencoder, we introduce a probabilistic deep learning method based on the physics of the problem that enables us to explore the uncertainties of the solution. According to the limited availability of datasets in this field and to assess the performance and accuracy of the proposed methods, we create a comprehensive dataset close to reality and conduct a comparative analysis of the presented approaches to it.

6/11/2024

Inversion-DeepONet: A Novel DeepONet-Based Network with Encoder-Decoder for Full Waveform Inversion

Zekai Guo, Lihui Chai, Shengjun Huang, Ye Li

Full waveform inversion (FWI) plays a crucial role in the field of geophysics. There has been lots of research about applying deep learning (DL) methods to FWI. The success of DL-FWI relies significantly on the quantity and diversity of the datasets. Nevertheless, existing FWI datasets, like OpenFWI, where sources have fixed locations or identical frequencies, provide limited information and do not represent the complex real-world scene. For instance, low frequencies help in resolving larger-scale structures. High frequencies allow for a more detailed subsurface features. %A single source frequency is insufficient to describe subsurface structural properties. We consider that simultaneously using sources with different frequencies, instead of performing inversion using low frequencies data and then gradually introducing higher frequencies data, has rationale and potential advantages. Hence, we develop three enhanced datasets based on OpenFWI where each source have varying locations, frequencies or both. Moreover, we propose a novel deep operator network (DeepONet) architecture Inversion-DeepONet for FWI. We utilize convolutional neural network (CNN) to extract the features from seismic data in branch net. Source parameters, such as locations and frequencies, are fed to trunk net. Then another CNN is employed as the decoder of DeepONet to reconstruct the velocity models more effectively. Through experiments, we confirm the superior performance on accuracy and generalization ability of our network, compared with existing data-driven FWI methods.

8/16/2024

🔄

Accelerating Full Waveform Inversion By Transfer Learning

Divya Shyam Singh, Leon Herrmann, Qing Sun, Tim Burchner, Felix Dietrich, Stefan Kollmannsberger

Full waveform inversion (FWI) is a powerful tool for reconstructing material fields based on sparsely measured data obtained by wave propagation. For specific problems, discretizing the material field with a neural network (NN) improves the robustness and reconstruction quality of the corresponding optimization problem. We call this method NN-based FWI. Starting from an initial guess, the weights of the NN are iteratively updated to fit the simulated wave signals to the sparsely measured data set. For gradient-based optimization, a suitable choice of the initial guess, i.e., a suitable NN weight initialization, is crucial for fast and robust convergence. In this paper, we introduce a novel transfer learning approach to further improve NN-based FWI. This approach leverages supervised pretraining to provide a better NN weight initialization, leading to faster convergence of the subsequent optimization problem. Moreover, the inversions yield physically more meaningful local minima. The network is pretrained to predict the unknown material field using the gradient information from the first iteration of conventional FWI. In our computational experiments on two-dimensional domains, the training data set consists of reference simulations with arbitrarily positioned elliptical voids of different shapes and orientations. We compare the performance of the proposed transfer learning NN-based FWI with three other methods: conventional FWI, NN-based FWI without pretraining and conventional FWI with an initial guess predicted from the pretrained NN. Our results show that transfer learning NN-based FWI outperforms the other methods in terms of convergence speed and reconstruction quality.

8/2/2024