Propagating the prior from shallow to deep with a pre-trained velocity-model Generative Transformer network

Read original: arXiv:2408.09767 - Published 8/20/2024 by Randy Harsuko, Shijun Cheng, Tariq Alkhalifah

Propagating the prior from shallow to deep with a pre-trained velocity-model Generative Transformer network

Overview

This paper presents a novel approach for propagating the prior from a shallow pre-trained velocity-model Generative Transformer network to a deeper network.
The proposed method aims to improve the performance of deep generative models for tasks such as seismic velocity synthesis and inversion.
The key idea is to leverage the knowledge captured in a pre-trained shallow network to guide the training of a deeper model, leading to better performance and faster convergence.

Plain English Explanation

The researchers developed a new technique to [object Object]. Seismic data is used in the oil and gas industry to understand the structure of the Earth's subsurface.

The core idea is to [object Object], and use that knowledge to train a deeper (more complex) model. This helps the deeper model learn faster and perform better at tasks like [object Object] and inverting seismic data to estimate properties of the Earth's subsurface.

The key insight is that the shallow pre-trained model has already captured important patterns and relationships in the seismic data, and this information can be leveraged to guide the training of the deeper model. This helps the deeper model converge more quickly and produce more accurate results.

Technical Explanation

The researchers propose a [object Object]. The shallow pre-trained network serves as a [object Object] that has already learned relevant features and patterns from the seismic data.

To leverage this pre-trained knowledge, the researchers introduce a knowledge distillation technique. This involves training the deeper network to not only match the target seismic data, but also to mimic the output and internal representations of the shallow pre-trained network. This encourages the deeper network to learn similar features and relationships, leading to better performance and faster convergence.

The experiments demonstrate that this approach outperforms training the deeper network from scratch, as well as other transfer learning techniques. The deeper network is able to generate more accurate seismic velocity models and invert seismic data more effectively, thanks to the guidance provided by the shallow pre-trained model.

Critical Analysis

The paper presents a compelling approach for improving the performance of deep generative models for seismic data analysis tasks. However, the authors acknowledge [object Object].

One potential concern is the reliance on a pre-trained shallow model, which may not be available in all scenarios. The researchers suggest exploring ways to [object Object] to reduce this dependency.

Additionally, the paper focuses on a specific task of seismic velocity synthesis and inversion. It would be valuable to [object Object], such as full-waveform inversion or electromagnetic data processing.

Conclusion

This paper presents a novel technique for [object Object]. The proposed [object Object] enables the deeper network to learn more efficiently and produce more accurate seismic velocity models and inversion results.

This research demonstrates the potential of [object Object] to enhance the performance of deep learning algorithms in the geophysical domain, with broader implications for other fields where data-driven modeling is crucial.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Propagating the prior from shallow to deep with a pre-trained velocity-model Generative Transformer network

Randy Harsuko, Shijun Cheng, Tariq Alkhalifah

Building subsurface velocity models is essential to our goals in utilizing seismic data for Earth discovery and exploration, as well as monitoring. With the dawn of machine learning, these velocity models (or, more precisely, their distribution) can be stored accurately and efficiently in a generative model. These stored velocity model distributions can be utilized to regularize or quantify uncertainties in inverse problems, like full waveform inversion. However, most generators, like normalizing flows or diffusion models, treat the image (velocity model) uniformly, disregarding spatial dependencies and resolution changes with respect to the observation locations. To address this weakness, we introduce VelocityGPT, a novel implementation that utilizes Transformer decoders trained autoregressively to generate a velocity model from shallow subsurface to deep. Owing to the fact that seismic data are often recorded on the Earth's surface, a top-down generator can utilize the inverted information in the shallow as guidance (prior) to generating the deep. To facilitate the implementation, we use an additional network to compress the velocity model. We also inject prior information, like well or structure (represented by a migration image) to generate the velocity model. Using synthetic data, we demonstrate the effectiveness of VelocityGPT as a promising approach in generative model applications for seismic velocity model building.

8/20/2024

Controllable seismic velocity synthesis using generative diffusion models

Fu Wang, Xinquan Huang, Tariq Alkhalifah

Accurate seismic velocity estimations are vital to understanding Earth's subsurface structures, assessing natural resources, and evaluating seismic hazards. Machine learning-based inversion algorithms have shown promising performance in regional (i.e., for exploration) and global velocity estimation, while their effectiveness hinges on access to large and diverse training datasets whose distributions generally cover the target solutions. Additionally, enhancing the precision and reliability of velocity estimation also requires incorporating prior information, e.g., geological classes, well logs, and subsurface structures, but current statistical or neural network-based methods are not flexible enough to handle such multi-modal information. To address both challenges, we propose to use conditional generative diffusion models for seismic velocity synthesis, in which we readily incorporate those priors. This approach enables the generation of seismic velocities that closely match the expected target distribution, offering datasets informed by both expert knowledge and measured data to support training for data-driven geophysical methods. We demonstrate the flexibility and effectiveness of our method through training diffusion models on the OpenFWI dataset under various conditions, including class labels, well logs, reflectivity images, and the combination of these priors. The performance of the approach under out-of-distribution conditions further underscores its generalization ability, showcasing its potential to provide tailored priors for velocity inverse problems and create specific training datasets for machine learning-based geophysical applications.

8/12/2024

Generative Geostatistical Modeling from Incomplete Well and Imaged Seismic Observations with Diffusion Models

Huseyin Tuna Erdinc, Rafael Orozco, Felix J. Herrmann

In this study, we introduce a novel approach to synthesizing subsurface velocity models using diffusion generative models. Conventional methods rely on extensive, high-quality datasets, which are often inaccessible in subsurface applications. Our method leverages incomplete well and seismic observations to produce high-fidelity velocity samples without requiring fully sampled training datasets. The results demonstrate that our generative model accurately captures long-range structures, aligns with ground-truth velocity models, achieves high Structural Similarity Index (SSIM) scores, and provides meaningful uncertainty estimations. This approach facilitates realistic subsurface velocity synthesis, offering valuable inputs for full-waveform inversion and enhancing seismic-based subsurface modeling.

6/11/2024

Stochastic full waveform inversion with deep generative prior for uncertainty quantification

Yuke Xie, Herv'e Chauris, Nicolas Desassis

To obtain high-resolution images of subsurface structures from seismic data, seismic imaging techniques such as Full Waveform Inversion (FWI) serve as crucial tools. However, FWI involves solving a nonlinear and often non-unique inverse problem, presenting challenges such as local minima trapping and inadequate handling of inherent uncertainties. In addressing these challenges, we propose leveraging deep generative models as the prior distribution of geophysical parameters for stochastic Bayesian inversion. This approach integrates the adjoint state gradient for efficient back-propagation from the numerical solution of partial differential equations. Additionally, we introduce explicit and implicit variational Bayesian inference methods. The explicit method computes variational distribution density using a normalizing flow-based neural network, enabling computation of the Bayesian posterior of parameters. Conversely, the implicit method employs an inference network attached to a pretrained generative model to estimate density, incorporating an entropy estimator. Furthermore, we also experimented with the Stein Variational Gradient Descent (SVGD) method as another variational inference technique, using particles. We compare these variational Bayesian inference methods with conventional Markov chain Monte Carlo (McMC) sampling. Each method is able to quantify uncertainties and to generate seismic data-conditioned realizations of subsurface geophysical parameters. This framework provides insights into subsurface structures while accounting for inherent uncertainties.

6/10/2024