MVMS-RCN: A Dual-Domain Unfolding CT Reconstruction with Multi-sparse-view and Multi-scale Refinement-correction

Read original: arXiv:2405.17141 - Published 5/28/2024 by Xiaohong Fan, Ke Chen, Huaming Yi, Yin Yang, Jianping Zhang

MVMS-RCN: A Dual-Domain Unfolding CT Reconstruction with Multi-sparse-view and Multi-scale Refinement-correction

Overview

Presents a novel deep learning-based approach for sparse-view CT reconstruction, called MVMS-RCN (Multi-View Multi-Scale Refinement-Correction Network)
Leverages multi-view projection and multi-scale geometric correction to improve reconstruction quality from limited projection data
Demonstrates superior performance compared to existing sparse-view CT reconstruction methods

Plain English Explanation

MVMS-RCN is a new deep learning technique for reconstructing high-quality 3D CT images from a limited number of X-ray projections. Typically, CT scans require a large number of projections taken from different angles to produce a clear 3D image. However, this can be time-consuming and expose patients to increased radiation.

The key idea behind MVMS-RCN is to combine information from multiple views (projections) and use a multi-scale approach to gradually refine and correct the reconstructed image. By leveraging the complementary information in the different views and applying corrections at multiple scales, the algorithm is able to produce high-quality 3D images even when only a small number of projections are available.

This is particularly important for applications like medical imaging, where reducing radiation exposure is critical. It also has implications for industrial CT scanning and 3D scene reconstruction from multiple views, where the number of capture views may be limited.

Technical Explanation

The MVMS-RCN approach consists of two main components:

Multi-View Projection: The network takes as input a set of sparse-view projections from different angles. By processing these multiple views simultaneously, the model can leverage the complementary information to produce a better initial reconstruction.
Multi-Scale Refinement-Correction: The reconstructed image is then iteratively refined and corrected at multiple scales. This allows the model to address both local and global artifacts, gradually improving the reconstruction quality.

The overall network architecture follows a deep unfolding strategy, where the iterative refinement process is encoded as a recurrent neural network. This allows the model to learn optimal correction steps in an end-to-end fashion, without relying on handcrafted priors or regularization terms.

The authors evaluate MVMS-RCN on several CT reconstruction benchmarks, including sparse-view and limited-angle settings. The results demonstrate that MVMS-RCN outperforms existing deep learning-based approaches, particularly in terms of preserving fine details and reducing artifacts.

Critical Analysis

The authors acknowledge that MVMS-RCN relies on the availability of multiple input views, which may not always be feasible in practice. They suggest that future work could explore ways to incorporate prior information or leverage single-view reconstruction techniques to address this limitation.

Additionally, the computational complexity of the multi-scale refinement process may limit the practical deployment of MVMS-RCN, especially for real-time applications. Further research could explore ways to optimize the network architecture or accelerate the inference process.

Overall, the MVMS-RCN approach represents a promising step forward in sparse-view CT reconstruction, demonstrating the potential of deep learning to address this challenging problem. However, as with any research, there are opportunities for continued improvement and expansion to address the remaining limitations.

Conclusion

The MVMS-RCN paper presents a novel deep learning-based method for sparse-view CT reconstruction that leverages multi-view projection and multi-scale refinement to produce high-quality 3D images from limited data. This has important implications for medical imaging, industrial CT scanning, and 3D scene reconstruction, where reducing radiation exposure or capture time is crucial.

The technical approach shows strong performance on benchmark datasets, outperforming existing deep learning methods. While the reliance on multiple input views and computational complexity are potential limitations, the core ideas behind MVMS-RCN demonstrate the power of combining multiple complementary techniques to address challenging inverse problems in imaging and reconstruction.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

MVMS-RCN: A Dual-Domain Unfolding CT Reconstruction with Multi-sparse-view and Multi-scale Refinement-correction

Xiaohong Fan, Ke Chen, Huaming Yi, Yin Yang, Jianping Zhang

X-ray Computed Tomography (CT) is one of the most important diagnostic imaging techniques in clinical applications. Sparse-view CT imaging reduces the number of projection views to a lower radiation dose and alleviates the potential risk of radiation exposure. Most existing deep learning (DL) and deep unfolding sparse-view CT reconstruction methods: 1) do not fully use the projection data; 2) do not always link their architecture designs to a mathematical theory; 3) do not flexibly deal with multi-sparse-view reconstruction assignments. This paper aims to use mathematical ideas and design optimal DL imaging algorithms for sparse-view tomography reconstructions. We propose a novel dual-domain deep unfolding unified framework that offers a great deal of flexibility for multi-sparse-view CT reconstruction with different sampling views through a single model. This framework combines the theoretical advantages of model-based methods with the superior reconstruction performance of DL-based methods, resulting in the expected generalizability of DL. We propose a refinement module that utilizes unfolding projection domain to refine full-sparse-view projection errors, as well as an image domain correction module that distills multi-scale geometric error corrections to reconstruct sparse-view CT. This provides us with a new way to explore the potential of projection information and a new perspective on designing network architectures. All parameters of our proposed framework are learnable end to end, and our method possesses the potential to be applied to plug-and-play reconstruction. Extensive experiments demonstrate that our framework is superior to other existing state-of-the-art methods. Our source codes are available at https://github.com/fanxiaohong/MVMS-RCN.

5/28/2024

📈

MSDiff: Multi-Scale Diffusion Model for Ultra-Sparse View CT Reconstruction

Pinhuang Tan, Mengxiao Geng, Jingya Lu, Liu Shi, Bin Huang, Qiegen Liu

Computed Tomography (CT) technology reduces radiation haz-ards to the human body through sparse sampling, but fewer sampling angles pose challenges for image reconstruction. Score-based generative models are widely used in sparse-view CT re-construction, performance diminishes significantly with a sharp reduction in projection angles. Therefore, we propose an ultra-sparse view CT reconstruction method utilizing multi-scale dif-fusion models (MSDiff), designed to concentrate on the global distribution of information and facilitate the reconstruction of sparse views with local image characteristics. Specifically, the proposed model ingeniously integrates information from both comprehensive sampling and selectively sparse sampling tech-niques. Through precise adjustments in diffusion model, it is capable of extracting diverse noise distribution, furthering the understanding of the overall structure of images, and aiding the fully sampled model in recovering image information more effec-tively. By leveraging the inherent correlations within the projec-tion data, we have designed an equidistant mask, enabling the model to focus its attention more effectively. Experimental re-sults demonstrated that the multi-scale model approach signifi-cantly improved the quality of image reconstruction under ultra-sparse angles, with good generalization across various datasets.

5/10/2024

C^2RV: Cross-Regional and Cross-View Learning for Sparse-View CBCT Reconstruction

Yiqun Lin, Jiewen Yang, Hualiang Wang, Xinpeng Ding, Wei Zhao, Xiaomeng Li

Cone beam computed tomography (CBCT) is an important imaging technology widely used in medical scenarios, such as diagnosis and preoperative planning. Using fewer projection views to reconstruct CT, also known as sparse-view reconstruction, can reduce ionizing radiation and further benefit interventional radiology. Compared with sparse-view reconstruction for traditional parallel/fan-beam CT, CBCT reconstruction is more challenging due to the increased dimensionality caused by the measurement process based on cone-shaped X-ray beams. As a 2D-to-3D reconstruction problem, although implicit neural representations have been introduced to enable efficient training, only local features are considered and different views are processed equally in previous works, resulting in spatial inconsistency and poor performance on complicated anatomies. To this end, we propose C^2RV by leveraging explicit multi-scale volumetric representations to enable cross-regional learning in the 3D space. Additionally, the scale-view cross-attention module is introduced to adaptively aggregate multi-scale and multi-view features. Extensive experiments demonstrate that our C^2RV achieves consistent and significant improvement over previous state-of-the-art methods on datasets with diverse anatomy.

6/7/2024

CT-SDM: A Sampling Diffusion Model for Sparse-View CT Reconstruction across All Sampling Rates

Liutao Yang, Jiahao Huang, Guang Yang, Daoqiang Zhang

Sparse views X-ray computed tomography has emerged as a contemporary technique to mitigate radiation dose. Because of the reduced number of projection views, traditional reconstruction methods can lead to severe artifacts. Recently, research studies utilizing deep learning methods has made promising progress in removing artifacts for Sparse-View Computed Tomography (SVCT). However, given the limitations on the generalization capability of deep learning models, current methods usually train models on fixed sampling rates, affecting the usability and flexibility of model deployment in real clinical settings. To address this issue, our study proposes a adaptive reconstruction method to achieve high-performance SVCT reconstruction at any sampling rate. Specifically, we design a novel imaging degradation operator in the proposed sampling diffusion model for SVCT (CT-SDM) to simulate the projection process in the sinogram domain. Thus, the CT-SDM can gradually add projection views to highly undersampled measurements to generalize the full-view sinograms. By choosing an appropriate starting point in diffusion inference, the proposed model can recover the full-view sinograms from any sampling rate with only one trained model. Experiments on several datasets have verified the effectiveness and robustness of our approach, demonstrating its superiority in reconstructing high-quality images from sparse-view CT scans across various sampling rates.

9/4/2024