Diffusion-based Generative Image Outpainting for Recovery of FOV-Truncated CT Images

Read original: arXiv:2406.04769 - Published 6/10/2024 by Michelle Espranita Liman, Daniel Rueckert, Florian J. Fintelmann, Philip Muller

Diffusion-based Generative Image Outpainting for Recovery of FOV-Truncated CT Images

Overview

This paper presents a diffusion-based generative model for outpainting CT images that have been truncated due to a limited field-of-view (FOV).
The model is designed to recover the missing image regions outside the original FOV, allowing for more complete and accurate CT reconstructions.
The approach leverages the powerful capabilities of diffusion models in generating high-quality, realistic images.

Plain English Explanation

Computed tomography (CT) scans are important medical imaging techniques that provide detailed information about the body's internal structures. However, sometimes the CT scanner's field-of-view (FOV) is not large enough to capture the entire region of interest, resulting in "truncated" images with missing information around the edges.

This paper introduces a new method to address this issue using a type of artificial intelligence called a "diffusion model." Diffusion models are a powerful machine learning technique that can generate highly realistic and detailed images. In this case, the researchers trained a diffusion model to analyze the partial CT images and then "outpaint" the missing regions, essentially hallucinating the missing information in a way that is plausible and medically accurate.

By using this diffusion-based outpainting approach, the researchers were able to recover the full, undistorted view of the CT scans, even for cases where a significant portion of the image was originally missing. This could be particularly useful for improving the quality and diagnostic value of CT scans, especially in situations where the scanner's FOV is limited due to patient size or other constraints.

Technical Explanation

The key technical innovation of this paper is the development of a diffusion-based generative model for outpainting CT images with truncated fields-of-view (FOVs). Diffusion models are a type of generative model that learns to generate new data by mimicking the process of "diffusion," where a noisy input is gradually refined into a realistic output.

The researchers adapted this diffusion-based approach to the task of extending the FOV of truncated CT images. Their model takes a partial CT scan as input and learns to progressively refine and "outpaint" the missing regions, generating a complete, high-quality image that is consistent with the observed data.

This builds on related work in diffusion-based MRI reconstruction and physics-informed diffusion models, leveraging the powerful image generation capabilities of diffusion models while also incorporating domain-specific knowledge about CT imaging.

The researchers thoroughly evaluated their approach on a dataset of truncated CT scans, demonstrating its ability to accurately recover the missing image regions and produce high-fidelity reconstructions. This outpainting capability could have significant practical implications for improving the diagnostic value of CT imaging, especially in situations where the FOV is constrained.

Critical Analysis

The researchers acknowledge several limitations of their approach that could be addressed in future work. For example, the model was trained and evaluated on a specific dataset of truncated CT scans, and its performance may vary for different imaging modalities or anatomical regions. Additionally, while the outpainted regions appear visually plausible, there may be subtle medical inaccuracies that could impact clinical decision-making.

Further research could explore ways to better incorporate physical constraints and domain knowledge into the diffusion model, potentially improving its medical accuracy and robustness. Integrating this outpainting approach with other CT reconstruction techniques, such as sparse-view reconstruction, could also lead to more comprehensive solutions for handling FOV-truncated CT images.

Additionally, while the paper demonstrates the effectiveness of the diffusion-based outpainting approach, it would be valuable to see comparisons to other potential solutions, such as inpainting methods or generative adversarial networks (GANs), to better understand the relative strengths and weaknesses of the proposed technique.

Conclusion

This paper presents a novel diffusion-based approach for outpainting truncated CT images, effectively recovering the missing regions outside the original field-of-view. By leveraging the powerful image generation capabilities of diffusion models, the researchers demonstrate a promising solution for improving the quality and diagnostic value of CT scans in situations where the scanner's FOV is limited.

While the approach shows compelling results, further research is needed to address potential limitations, such as improving medical accuracy and robustness, and exploring comparisons to alternative methods. Nonetheless, this work represents an important step forward in the development of advanced image reconstruction techniques for CT imaging, with the potential to have a meaningful impact on clinical practice.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Diffusion-based Generative Image Outpainting for Recovery of FOV-Truncated CT Images

Michelle Espranita Liman, Daniel Rueckert, Florian J. Fintelmann, Philip Muller

Field-of-view (FOV) recovery of truncated chest CT scans is crucial for accurate body composition analysis, which involves quantifying skeletal muscle and subcutaneous adipose tissue (SAT) on CT slices. This, in turn, enables disease prognostication. Here, we present a method for recovering truncated CT slices using generative image outpainting. We train a diffusion model and apply it to truncated CT slices generated by simulating a small FOV. Our model reliably recovers the truncated anatomy and outperforms the previous state-of-the-art despite being trained on 87% less data.

6/10/2024

🤿

Field-of-View Extension for Diffusion MRI via Deep Generative Models

Chenyu Gao, Shunxing Bao, Michael Kim, Nancy Newlin, Praitayini Kanakaraj, Tianyuan Yao, Gaurav Rudravaram, Yuankai Huo, Daniel Moyer, Kurt Schilling, Walter Kukull, Arthur Toga, Derek Archer, Timothy Hohman, Bennett Landman, Zhiyuan Li

Purpose: In diffusion MRI (dMRI), the volumetric and bundle analyses of whole-brain tissue microstructure and connectivity can be severely impeded by an incomplete field-of-view (FOV). This work aims to develop a method for imputing the missing slices directly from existing dMRI scans with an incomplete FOV. We hypothesize that the imputed image with complete FOV can improve the whole-brain tractography for corrupted data with incomplete FOV. Therefore, our approach provides a desirable alternative to discarding the valuable dMRI data, enabling subsequent tractography analyses that would otherwise be challenging or unattainable with corrupted data. Approach: We propose a framework based on a deep generative model that estimates the absent brain regions in dMRI scans with incomplete FOV. The model is capable of learning both the diffusion characteristics in diffusion-weighted images (DWI) and the anatomical features evident in the corresponding structural images for efficiently imputing missing slices of DWI outside of incomplete FOV. Results: For evaluating the imputed slices, on the WRAP dataset the proposed framework achieved PSNRb0=22.397, SSIMb0=0.905, PSNRb1300=22.479, SSIMb1300=0.893; on the NACC dataset it achieved PSNRb0=21.304, SSIMb0=0.892, PSNRb1300=21.599, SSIMb1300= 0.877. The proposed framework improved the tractography accuracy, as demonstrated by an increased average Dice score for 72 tracts (p < 0.001) on both the WRAP and NACC datasets. Conclusions: Results suggest that the proposed framework achieved sufficient imputation performance in dMRI data with incomplete FOV for improving whole-brain tractography, thereby repairing the corrupted data. Our approach achieved more accurate whole-brain tractography results with extended and complete FOV and reduced the uncertainty when analyzing bundles associated with Alzheimer's Disease.

8/30/2024

New!Task-Specific Data Preparation for Deep Learning to Reconstruct Structures of Interest from Severely Truncated CBCT Data

Yixing Huang, Fuxin Fan, Ahmed Gomaa, Andreas Maier, Rainer Fietkau, Christoph Bert, Florian Putz

Cone-beam computed tomography (CBCT) is widely used in interventional surgeries and radiation oncology. Due to the limited size of flat-panel detectors, anatomical structures might be missing outside the limited field-of-view (FOV), which restricts the clinical applications of CBCT systems. Recently, deep learning methods have been proposed to extend the FOV for multi-slice CT systems. However, in mobile CBCT system with a smaller FOV size, projection data is severely truncated and it is challenging for a network to restore all missing structures outside the FOV. In some applications, only certain structures outside the FOV are of interest, e.g., ribs in needle path planning for liver/lung cancer diagnosis. Therefore, a task-specific data preparation method is proposed in this work, which automatically let the network focus on structures of interest instead of all the structures. Our preliminary experiment shows that Pix2pixGAN with a conventional training has the risk to reconstruct false positive and false negative rib structures from severely truncated CBCT data, whereas Pix2pixGAN with the proposed task-specific training can reconstruct all the ribs reliably. The proposed method is promising to empower CBCT with more clinical applications.

9/16/2024

Iterative CT Reconstruction via Latent Variable Optimization of Shallow Diffusion Models

Sho Ozaki, Shizuo Kaji, Toshikazu Imae, Kanabu Nawa, Hideomi Yamashita, Keiichi Nakagawa

Image-generative artificial intelligence (AI) has garnered significant attention in recent years. In particular, the diffusion model, a core component of generative AI, produces high-quality images with rich diversity. In this study, we proposed a novel computed tomography (CT) reconstruction method by combining the denoising diffusion probabilistic model with iterative CT reconstruction. In sharp contrast to previous studies, we optimized the fidelity loss of CT reconstruction with respect to the latent variable of the diffusion model, instead of the image and model parameters. To suppress the changes in anatomical structures produced by the diffusion model, we shallowed the diffusion and reverse processes and fixed a set of added noises in the reverse process to make it deterministic during the inference. We demonstrated the effectiveness of the proposed method through the sparse-projection CT reconstruction of 1/10 projection data. Despite the simplicity of the implementation, the proposed method has the potential to reconstruct high-quality images while preserving the patient's anatomical structures and was found to outperform existing methods, including iterative reconstruction, iterative reconstruction with total variation, and the diffusion model alone in terms of quantitative indices such as the structural similarity index and peak signal-to-noise ratio. We also explored further sparse-projection CT reconstruction using 1/20 projection data with the same trained diffusion model. As the number of iterations increased, the image quality improved comparable to that of 1/10 sparse-projection CT reconstruction. In principle, this method can be widely applied not only to CT but also to other imaging modalities.

9/14/2024