Features Fusion for Dual-View Mammography Mass Detection

Read original: arXiv:2404.16718 - Published 4/26/2024 by Arina Varlamova, Valery Belotsky, Grigory Novikov, Anton Konushin, Evgeny Sidorov

🔎

Overview

Detecting malignant breast lesions on mammography images is crucial for early diagnosis of breast cancer.
Current approaches often analyze images from two different angles, but automatic detection methods struggle to effectively fuse this information.
The paper proposes a new model called MAMM-Net that can process both mammography views simultaneously and share information at both the object and feature levels.
MAMM-Net's key component is the Fusion Layer, which uses deformable attention to improve detection precision while maintaining high recall.
Experiments show MAMM-Net outperforms previous state-of-the-art models on the public DDSM dataset and introduces new helpful features like pixel-level lesion annotation and malignancy classification.

Plain English Explanation

Breast cancer is a serious disease, but early detection can greatly improve outcomes. Mammograms, or X-ray images of the breast, are a common way to screen for breast cancer. Radiologists review these images and look for suspicious areas that could be cancerous lesions.

To get a more complete picture, mammograms are usually taken from two different angles. Radiologists can then compare the images and use the combined information to locate and examine any potential lesions. However, automatically processing these two views together has been a challenge for computer-based detection systems.

The researchers developed a new model called MAMM-Net that can analyze both mammography views at the same time. MAMM-Net has a Fusion Layer that takes information from the two views and combines it in a smart way. This helps MAMM-Net detect lesions more accurately than previous state-of-the-art models.

In addition, MAMM-Net can provide more detailed information, like marking the exact location of lesions on the image and classifying whether they are likely to be cancerous or not. This extra detail could be very useful for radiologists and patients in diagnosing and treating breast cancer early on.

Technical Explanation

The key innovation in MAMM-Net is the Fusion Layer, which is designed to effectively combine information from the two mammography views. The Fusion Layer uses a deformable attention mechanism to selectively focus on and integrate the most relevant features from each view.

This allows MAMM-Net to share information not just at the object level (as in previous approaches), but also at the feature level. The researchers hypothesize that this deeper fusion leads to more precise lesion detection while maintaining high recall.

In experiments on the public DDSM dataset, MAMM-Net outperformed previous state-of-the-art models. It also demonstrated the ability to generate pixel-level lesion annotations and classify lesion malignancy, which could provide valuable additional insights for radiologists and patients.

Critical Analysis

The paper presents a compelling approach to the important problem of automating breast cancer detection from mammograms. The researchers have thoughtfully designed MAMM-Net to effectively leverage the information contained in the two mammography views.

That said, the paper does not address some potential limitations. For example, the performance of MAMM-Net was only evaluated on the relatively small DDSM dataset. Larger-scale validation on real-world clinical data would be needed to fully assess the model's generalizability and robustness.

Additionally, the paper does not discuss how MAMM-Net's performance might vary across different demographics or breast density categories. Breast cancer risk and detection can be influenced by factors like age and breast density, so it would be important to examine the model's effectiveness across diverse populations.

Overall, the MAMM-Net approach represents an interesting and potentially impactful advance in mammography-based breast cancer detection. Further research to address these limitations would help strengthen the case for adopting this type of technology in clinical practice.

Conclusion

The MAMM-Net model proposes an innovative way to fuse information from multiple mammography views to improve the automatic detection of breast lesions. By sharing information at both the object and feature levels, MAMM-Net demonstrates superior performance compared to previous state-of-the-art approaches.

Beyond just detecting lesions, MAMM-Net can also provide additional clinically relevant outputs like pixel-level lesion annotations and malignancy classification. These capabilities could make MAMM-Net a valuable tool to assist radiologists in the early diagnosis and treatment of breast cancer.

While further validation is needed, the research presented in this paper represents an important step forward in leveraging advanced AI techniques to enhance breast cancer screening and save lives.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔎

Features Fusion for Dual-View Mammography Mass Detection

Arina Varlamova, Valery Belotsky, Grigory Novikov, Anton Konushin, Evgeny Sidorov

Detection of malignant lesions on mammography images is extremely important for early breast cancer diagnosis. In clinical practice, images are acquired from two different angles, and radiologists can fully utilize information from both views, simultaneously locating the same lesion. However, for automatic detection approaches such information fusion remains a challenge. In this paper, we propose a new model called MAMM-Net, which allows the processing of both mammography views simultaneously by sharing information not only on an object level, as seen in existing works, but also on a feature level. MAMM-Net's key component is the Fusion Layer, based on deformable attention and designed to increase detection precision while keeping high recall. Our experiments show superior performance on the public DDSM dataset compared to the previous state-of-the-art model, while introducing new helpful features such as lesion annotation on pixel-level and classification of lesions malignancy.

4/26/2024

Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model

Kun Zhao, Jakub Prokop, Javier Montalt Tordera, Sadegh Mohammadi

Mammography is crucial for breast cancer surveillance and early diagnosis. However, analyzing mammography images is a demanding task for radiologists, who often review hundreds of mammograms daily, leading to overdiagnosis and overtreatment. Computer-Aided Diagnosis (CAD) systems have been developed to assist in this process, but their capabilities, particularly in lesion segmentation, remained limited. With the contemporary advances in deep learning their performance may be improved. Recently, vision-language diffusion models emerged, demonstrating outstanding performance in image generation and transferability to various downstream tasks. We aim to harness their capabilities for breast lesion segmentation in a panoptic setting, which encompasses both semantic and instance-level predictions. Specifically, we propose leveraging pretrained features from a Stable Diffusion model as inputs to a state-of-the-art panoptic segmentation architecture, resulting in accurate delineation of individual breast lesions. To bridge the gap between natural and medical imaging domains, we incorporated a mammography-specific MAM-E diffusion model and BiomedCLIP image and text encoders into this framework. We evaluated our approach on two recently published mammography datasets, CDD-CESM and VinDr-Mammo. For the instance segmentation task, we noted 40.25 AP0.1 and 46.82 AP0.05, as well as 25.44 PQ0.1 and 26.92 PQ0.05. For the semantic segmentation task, we achieved Dice scores of 38.86 and 40.92, respectively.

7/22/2024

Multi-Attention Integrated Deep Learning Frameworks for Enhanced Breast Cancer Segmentation and Identification

Pandiyaraju V, Shravan Venkatraman, Pavan Kumar S, Santhosh Malarvannan, Kannan A

Breast cancer poses a profound threat to lives globally, claiming numerous lives each year. Therefore, timely detection is crucial for early intervention and improved chances of survival. Accurately diagnosing and classifying breast tumors using ultrasound images is a persistent challenge in medicine, demanding cutting-edge solutions for improved treatment strategies. This research introduces multiattention-enhanced deep learning (DL) frameworks designed for the classification and segmentation of breast cancer tumors from ultrasound images. A spatial channel attention mechanism is proposed for segmenting tumors from ultrasound images, utilizing a novel LinkNet DL framework with an InceptionResNet backbone. Following this, the paper proposes a deep convolutional neural network with an integrated multi-attention framework (DCNNIMAF) to classify the segmented tumor as benign, malignant, or normal. From experimental results, it is observed that the segmentation model has recorded an accuracy of 98.1%, with a minimal loss of 0.6%. It has also achieved high Intersection over Union (IoU) and Dice Coefficient scores of 96.9% and 97.2%, respectively. Similarly, the classification model has attained an accuracy of 99.2%, with a low loss of 0.31%. Furthermore, the classification framework has achieved outstanding F1-Score, precision, and recall values of 99.1%, 99.3%, and 99.1%, respectively. By offering a robust framework for early detection and accurate classification of breast cancer, this proposed work significantly advances the field of medical image analysis, potentially improving diagnostic precision and patient outcomes.

7/16/2024

Pay Less On Clinical Images: Asymmetric Multi-Modal Fusion Method For Efficient Multi-Label Skin Lesion Classification

Peng Tang, Tobias Lasser

Existing multi-modal approaches primarily focus on enhancing multi-label skin lesion classification performance through advanced fusion modules, often neglecting the associated rise in parameters. In clinical settings, both clinical and dermoscopy images are captured for diagnosis; however, dermoscopy images exhibit more crucial visual features for multi-label skin lesion classification. Motivated by this observation, we introduce a novel asymmetric multi-modal fusion method in this paper for efficient multi-label skin lesion classification. Our fusion method incorporates two innovative schemes. Firstly, we validate the effectiveness of our asymmetric fusion structure. It employs a light and simple network for clinical images and a heavier, more complex one for dermoscopy images, resulting in significant parameter savings compared to the symmetric fusion structure using two identical networks for both modalities. Secondly, in contrast to previous approaches using mutual attention modules for interaction between image modalities, we propose an asymmetric attention module. This module solely leverages clinical image information to enhance dermoscopy image features, considering clinical images as supplementary information in our pipeline. We conduct the extensive experiments on the seven-point checklist dataset. Results demonstrate the generality of our proposed method for both networks and Transformer structures, showcasing its superiority over existing methods We will make our code publicly available.

7/16/2024