XctDiff: Reconstruction of CT Images with Consistent Anatomical Structures from a Single Radiographic Projection Image

Read original: arXiv:2406.04679 - Published 6/17/2024 by Qingze Bai, Tiange Liu, Zhi Liu, Yubing Tong, Drew Torigian, Jayaram Udupa

XctDiff: Reconstruction of CT Images with Consistent Anatomical Structures from a Single Radiographic Projection Image

Overview

This paper presents XctDiff, a novel method for reconstructing 3D computed tomography (CT) images from a single 2D radiographic projection image.
The key idea is to leverage a diffusion-based model to generate a 3D CT volume that is consistent with the input projection image while preserving realistic anatomical structures.
The proposed approach outperforms existing single-view CT reconstruction techniques and has applications in medical imaging, security screening, and industrial inspection.

Plain English Explanation

The paper introduces a new way to create 3D medical images from a single 2D X-ray. Typically, getting a 3D CT scan requires taking multiple X-ray images from different angles and then combining them. This process can be time-consuming and exposes patients to more radiation.

The researchers developed a method called XctDiff that can generate a 3D CT image from just one 2D X-ray. It works by using a special kind of artificial intelligence model called a diffusion model. This model is trained to take the 2D X-ray and generate a 3D image that matches it, while also making sure the anatomy looks realistic and anatomically consistent.

Compared to other single-view CT reconstruction techniques, XctDiff is able to produce higher-quality 3D images that better preserve the true structure of the body. This could have important applications in medical imaging, security screening, and industrial inspection, where getting 3D information from limited 2D data is valuable.

Technical Explanation

The key technical innovation of XctDiff is the use of a diffusion-based model to reconstruct the 3D CT volume from a single 2D radiographic projection. Diffusion models are a class of generative AI models that learn to transform noise into realistic data by following a step-by-step "diffusion" process.

The XctDiff model takes a 2D X-ray image as input and applies a series of diffusion steps to gradually transform it into a 3D CT volume. At each step, the model learns to update the 3D volume in a way that preserves the anatomical consistency and structural details observed in the input projection.

The researchers designed XctDiff's architecture and training process to address key challenges in single-view CT reconstruction, such as:

Handling the ill-posed nature of the inverse problem
Ensuring the reconstructed 3D volume is anatomically plausible
Improving robustness to noise and artifacts in the input projection

Experiments on benchmark datasets show that XctDiff outperforms prior single-view CT reconstruction methods in terms of both quantitative metrics and perceptual quality. The model is able to faithfully recover 3D structures like bones, organs, and soft tissues from a single 2D X-ray image.

Critical Analysis

One limitation of the XctDiff approach is that it relies on having a large dataset of paired 2D X-ray and 3D CT images for training. This may not always be available, especially for specialized medical applications or rare anatomical conditions. The authors mention the potential to explore few-shot learning techniques to address this data scarcity issue.

Additionally, while XctDiff can generate 3D volumes from single-view inputs, the reconstructed images may not capture all the fine details and spatial relationships present in a true CT scan. Combining XctDiff with other multi-view techniques, such as those explored in this paper, could potentially lead to even higher-quality 3D reconstructions.

It would also be valuable to investigate the robustness of XctDiff to variations in imaging modality, patient positioning, and other factors that can affect real-world X-ray data. Low-dose CT reconstruction methods may also be a relevant area to explore in the context of XctDiff's application to medical imaging.

Conclusion

The XctDiff method presented in this paper represents an important advancement in the field of single-view 3D reconstruction from 2D imaging data. By leveraging diffusion-based generative models, the approach can produce 3D CT volumes that are anatomically consistent with a single radiographic projection, without the need for multiple views or complex reconstruction algorithms.

This technology has the potential to streamline medical imaging workflows, reduce patient radiation exposure, and enable new applications in areas like security screening and industrial inspection. While further research is needed to address certain limitations, the core ideas and results demonstrated in this paper are a significant step forward in the quest to extract rich 3D information from limited 2D observations.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

XctDiff: Reconstruction of CT Images with Consistent Anatomical Structures from a Single Radiographic Projection Image

Qingze Bai, Tiange Liu, Zhi Liu, Yubing Tong, Drew Torigian, Jayaram Udupa

In this paper, we present XctDiff, an algorithm framework for reconstructing CT from a single radiograph, which decomposes the reconstruction process into two easily controllable tasks: feature extraction and CT reconstruction. Specifically, we first design a progressive feature extraction strategy that is able to extract robust 3D priors from radiographs. Then, we use the extracted prior information to guide the CT reconstruction in the latent space. Moreover, we design a homogeneous spatial codebook to improve the reconstruction quality further. The experimental results show that our proposed method achieves state-of-the-art reconstruction performance and overcomes the blurring issue. We also apply XctDiff on self-supervised pre-training task. The effectiveness indicates that it has promising additional applications in medical image analysis. The code is available at:https://github.com/qingze-bai/XctDiff

6/17/2024

DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Xuhui Liu, Zhi Qiao, Runkun Liu, Hong Li, Juan Zhang, Xiantong Zhen, Zhen Qian, Baochang Zhang

Computed tomography (CT) is widely utilized in clinical settings because it delivers detailed 3D images of the human body. However, performing CT scans is not always feasible due to radiation exposure and limitations in certain surgical environments. As an alternative, reconstructing CT images from ultra-sparse X-rays offers a valuable solution and has gained significant interest in scientific research and medical applications. However, it presents great challenges as it is inherently an ill-posed problem, often compromised by artifacts resulting from overlapping structures in X-ray images. In this paper, we propose DiffuX2CT, which models CT reconstruction from orthogonal biplanar X-rays as a conditional diffusion process. DiffuX2CT is established with a 3D global coherence denoising model with a new, implicit conditioning mechanism. We realize the conditioning mechanism by a newly designed tri-plane decoupling generator and an implicit neural decoder. By doing so, DiffuX2CT achieves structure-controllable reconstruction, which enables 3D structural information to be recovered from 2D X-rays, therefore producing faithful textures in CT images. As an extra contribution, we collect a real-world lumbar CT dataset, called LumbarV, as a new benchmark to verify the clinical significance and performance of CT reconstruction from X-rays. Extensive experiments on this dataset and three more publicly available datasets demonstrate the effectiveness of our proposal.

7/19/2024

Diff2CT: Diffusion Learning to Reconstruct Spine CT from Biplanar X-Rays

Zhi Qiao, Xuhui Liu, Xiaopeng Wang, Runkun Liu, Xiantong Zhen, Pei Dong, Zhen Qian

Intraoperative CT imaging serves as a crucial resource for surgical guidance; however, it may not always be readily accessible or practical to implement. In scenarios where CT imaging is not an option, reconstructing CT scans from X-rays can offer a viable alternative. In this paper, we introduce an innovative method for 3D CT reconstruction utilizing biplanar X-rays. Distinct from previous research that relies on conventional image generation techniques, our approach leverages a conditional diffusion process to tackle the task of reconstruction. More precisely, we employ a diffusion-based probabilistic model trained to produce 3D CT images based on orthogonal biplanar X-rays. To improve the structural integrity of the reconstructed images, we incorporate a novel projection loss function. Experimental results validate that our proposed method surpasses existing state-of-the-art benchmarks in both visual image quality and multiple evaluative metrics. Specifically, our technique achieves a higher Structural Similarity Index (SSIM) of 0.83, a relative increase of 10%, and a lower Fr'echet Inception Distance (FID) of 83.43, which represents a relative decrease of 25%.

8/22/2024

New!DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s)

Yun Su Jeong, Hye Bin Yoo, Il Yong Chun

Computational tomography (CT) provides high-resolution medical imaging, but it can expose patients to high radiation. X-ray scanners have low radiation exposure, but their resolutions are low. This paper proposes a new conditional diffusion model, DX2CT, that reconstructs three-dimensional (3D) CT volumes from bi or mono-planar X-ray image(s). Proposed DX2CT consists of two key components: 1) modulating feature maps extracted from two-dimensional (2D) X-ray(s) with 3D positions of CT volume using a new transformer and 2) effectively using the modulated 3D position-aware feature maps as conditions of DX2CT. In particular, the proposed transformer can provide conditions with rich information of a target CT slice to the conditional diffusion model, enabling high-quality CT reconstruction. Our experiments with the bi or mono-planar X-ray(s) benchmark datasets show that proposed DX2CT outperforms several state-of-the-art methods. Our codes and model will be available at: https://www.github.com/intyeger/DX2CT.

9/16/2024