TransAnaNet: Transformer-based Anatomy Change Prediction Network for Head and Neck Cancer Patient Radiotherapy

Read original: arXiv:2405.05674 - Published 5/24/2024 by Meixu Chen, Kai Wang, Michael Dohopolski, Howard Morgan, David Sher, Jing Wang

🔮

Overview

This study aims to assess the feasibility of using a vision-transformer (ViT) based neural network to predict radiotherapy (RT)-induced anatomic changes in head and neck cancer (HNC) patients.
The researchers retrospectively included 121 HNC patients treated with definitive RT/CRT and collected various imaging and clinical data for model construction and evaluation.
A UNet-style ViT network was designed to learn spatial correspondence and contextual information from embedded CT, dose, and CBCT images, as well as delineated tumor volumes.
The model estimated the deformation vector field between initial and later CBCT scans to predict anatomic changes, and the predicted CBCT was compared to the actual follow-up CBCT.

Plain English Explanation

Head and neck cancer patients often experience significant changes to their anatomy during the course of their radiotherapy treatment. Identifying these patients early can help optimize their care and make the best use of medical resources. This study explored using a type of artificial intelligence called a vision transformer to predict the anatomic changes that would occur in these patients.

The researchers collected various imaging and clinical data from 121 head and neck cancer patients who had undergone radiotherapy. This included their initial CT scans, the planned radiation dose, and CBCT (cone-beam CT) scans taken at the start of treatment and after 21 fractions. They also had the doctors' outlines of the main tumor and involved lymph nodes on these scans.

The researchers then designed a neural network model inspired by the "vision transformer" architecture. This model was able to learn patterns from the embedded CT, dose, and CBCT images, as well as the tumor volume information. Using this, the model could predict how the patient's anatomy would change over the course of their radiotherapy treatment.

The model's predictions were quite accurate when compared to the actual follow-up CBCT scans. This suggests that this AI-based approach has promise for helping doctors anticipate anatomic changes and adapt radiotherapy plans accordingly.

Technical Explanation

The researchers retrospectively included 121 HNC patients treated with definitive RT/CRT. They collected the planning CT (pCT), planned dose, CBCTs acquired at the initial treatment (CBCT01) and fraction 21 (CBCT21), and primary tumor volume (GTVp) and involved nodal volume (GTVn) delineated on both pCT and CBCTs.

A UNet-style ViT network was designed to learn spatial correspondence and contextual information from embedded CT, dose, CBCT01, GTVp, and GTVn image patches. The model estimated the deformation vector field between CBCT01 and CBCT21 as the prediction of anatomic change, and deformed CBCT01 was used as the prediction of CBCT21.

The predicted image from the proposed method yielded the best similarity to the real image (CBCT21) over pCT, CBCT01, and predicted CBCTs from other comparison models. The average MSE and SSIM between the normalized predicted CBCT and CBCT21 were 0.009 and 0.933, while the average dice coefficient between body mask, GTVp mask, and GTVn mask were 0.972, 0.792, and 0.821 respectively.

Critical Analysis

The paper presents a promising approach for predicting radiotherapy-induced anatomic changes in HNC patients using a vision transformer-based neural network. The model's strong performance in estimating the deformation between initial and follow-up CBCT scans, as well as in segmenting key anatomical structures, suggests it could be a valuable tool for quantifying serial changes in medical imaging.

However, the study is limited by its retrospective nature and relatively small sample size. Further validation on a larger, prospective cohort would be needed to fully assess the model's clinical utility. Additionally, the paper does not provide much insight into the robustness of the model to factors like image quality, artifact, or anatomical variability that may be present in real-world clinical settings.

It would also be interesting to see how this ViT-based approach compares to other deep learning methods for tumor and organ segmentation in medical imaging. Exploring the model's generalizability to different cancer types or treatment modalities could also expand the potential applications of this research.

Conclusion

This study demonstrates the feasibility of using a vision transformer-based neural network to accurately predict radiotherapy-induced anatomic changes in head and neck cancer patients. The model's strong performance in estimating deformation and segmenting key structures suggests it could be a valuable tool to assist in the decision-making of adaptive radiotherapy. Further research is needed to validate the approach in larger, prospective studies and explore its broader applicability in the field of medical imaging and oncology.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔮

TransAnaNet: Transformer-based Anatomy Change Prediction Network for Head and Neck Cancer Patient Radiotherapy

Meixu Chen, Kai Wang, Michael Dohopolski, Howard Morgan, David Sher, Jing Wang

Early identification of head and neck cancer (HNC) patients who would experience significant anatomical change during radiotherapy (RT) is important to optimize patient clinical benefit and treatment resources. This study aims to assess the feasibility of using a vision-transformer (ViT) based neural network to predict RT-induced anatomic change in HNC patients. We retrospectively included 121 HNC patients treated with definitive RT/CRT. We collected the planning CT (pCT), planned dose, CBCTs acquired at the initial treatment (CBCT01) and fraction 21 (CBCT21), and primary tumor volume (GTVp) and involved nodal volume (GTVn) delineated on both pCT and CBCTs for model construction and evaluation. A UNet-style ViT network was designed to learn spatial correspondence and contextual information from embedded CT, dose, CBCT01, GTVp, and GTVn image patches. The model estimated the deformation vector field between CBCT01 and CBCT21 as the prediction of anatomic change, and deformed CBCT01 was used as the prediction of CBCT21. We also generated binary masks of GTVp, GTVn, and patient body for volumetric change evaluation. The predicted image from the proposed method yielded the best similarity to the real image (CBCT21) over pCT, CBCT01, and predicted CBCTs from other comparison models. The average MSE and SSIM between the normalized predicted CBCT to CBCT21 are 0.009 and 0.933, while the average dice coefficient between body mask, GTVp mask, and GTVn mask are 0.972, 0.792, and 0.821 respectively. The proposed method showed promising performance for predicting radiotherapy-induced anatomic change, which has the potential to assist in the decision-making of HNC Adaptive RT.

5/24/2024

🌐

ARANet: Attention-based Residual Adversarial Network with Deep Supervision for Radiotherapy Dose Prediction of Cervical Cancer

Lu Wen, Wenxia Yin, Zhenghao Feng, Xi Wu, Deng Xiong, Yan Wang

Radiation therapy is the mainstay treatment for cervical cancer, and its ultimate goal is to ensure the planning target volume (PTV) reaches the prescribed dose while reducing dose deposition of organs-at-risk (OARs) as much as possible. To achieve these clinical requirements, the medical physicist needs to manually tweak the radiotherapy plan repeatedly in a trial-anderror manner until finding the optimal one in the clinic. However, such trial-and-error processes are quite time-consuming, and the quality of plans highly depends on the experience of the medical physicist. In this paper, we propose an end-to-end Attentionbased Residual Adversarial Network with deep supervision, namely ARANet, to automatically predict the 3D dose distribution of cervical cancer. Specifically, given the computer tomography (CT) images and their corresponding segmentation masks of PTV and OARs, ARANet employs a prediction network to generate the dose maps. We also utilize a multi-scale residual attention module and deep supervision mechanism to enforce the prediction network to extract more valuable dose features while suppressing irrelevant information. Our proposed method is validated on an in-house dataset including 54 cervical cancer patients, and experimental results have demonstrated its obvious superiority compared to other state-of-the-art methods.

8/27/2024

Deep-Motion-Net: GNN-based volumetric organ shape reconstruction from single-view 2D projections

Isuru Wijesinghe, Michael Nix, Arezoo Zakeri, Alireza Hokmabadi, Bashar Al-Qaisieh, Ali Gooya, Zeike A. Taylor

We propose Deep-Motion-Net: an end-to-end graph neural network (GNN) architecture that enables 3D (volumetric) organ shape reconstruction from a single in-treatment kV planar X-ray image acquired at any arbitrary projection angle. Estimating and compensating for true anatomical motion during radiotherapy is essential for improving the delivery of planned radiation dose to target volumes while sparing organs-at-risk, and thereby improving the therapeutic ratio. Achieving this using only limited imaging available during irradiation and without the use of surrogate signals or invasive fiducial markers is attractive. The proposed model learns the mesh regression from a patient-specific template and deep features extracted from kV images at arbitrary projection angles. A 2D-CNN encoder extracts image features, and four feature pooling networks fuse these features to the 3D template organ mesh. A ResNet-based graph attention network then deforms the feature-encoded mesh. The model is trained using synthetically generated organ motion instances and corresponding kV images. The latter is generated by deforming a reference CT volume aligned with the template mesh, creating digitally reconstructed radiographs (DRRs) at required projection angles, and DRR-to-kV style transferring with a conditional CycleGAN model. The overall framework was tested quantitatively on synthetic respiratory motion scenarios and qualitatively on in-treatment images acquired over full scan series for liver cancer patients. Overall mean prediction errors for synthetic motion test datasets were 0.16$pm$0.13 mm, 0.18$pm$0.19 mm, 0.22$pm$0.34 mm, and 0.12$pm$0.11 mm. Mean peak prediction errors were 1.39 mm, 1.99 mm, 3.29 mm, and 1.16 mm.

7/10/2024

Segmentation-Free Outcome Prediction in Head and Neck Cancer: Deep Learning-based Feature Extraction from Multi-Angle Maximum Intensity Projections (MA-MIPs) of PET Images

Amirhosein Toosi, Isaac Shiri, Habib Zaidi, Arman Rahmim

We introduce an innovative, simple, effective segmentation-free approach for outcome prediction in head & neck cancer (HNC) patients. By harnessing deep learning-based feature extraction techniques and multi-angle maximum intensity projections (MA-MIPs) applied to Fluorodeoxyglucose Positron Emission Tomography (FDG-PET) volumes, our proposed method eliminates the need for manual segmentations of regions-of-interest (ROIs) such as primary tumors and involved lymph nodes. Instead, a state-of-the-art object detection model is trained to perform automatic cropping of the head and neck region on the PET volumes. A pre-trained deep convolutional neural network backbone is then utilized to extract deep features from MA-MIPs obtained from 72 multi-angel axial rotations of the cropped PET volumes. These deep features extracted from multiple projection views of the PET volumes are then aggregated and fused, and employed to perform recurrence-free survival analysis on a cohort of 489 HNC patients. The proposed approach outperforms the best performing method on the target dataset for the task of recurrence-free survival analysis. By circumventing the manual delineation of the malignancies on the FDG PET-CT images, our approach eliminates the dependency on subjective interpretations and highly enhances the reproducibility of the proposed survival analysis method.

5/6/2024