Breast tumor classification based on self-supervised contrastive learning from ultrasound videos

Read original: arXiv:2408.10600 - Published 8/21/2024 by Yunxin Tang, Siyuan Tang, Jian Zhang, Hao Chen

🏷️

Overview

Breast ultrasound is widely used to diagnose breast tumors.
Many deep learning-based systems have been developed to assist radiologists, but they require large amounts of labeled data, which is expensive and time-consuming to obtain.
The researchers propose a novel approach to learn representations from unlabeled breast ultrasound video clips using a triplet network and self-supervised contrastive learning.

Plain English Explanation

Breast ultrasound is a popular tool for detecting and diagnosing breast tumors. Automatic systems based on deep learning have been created to help radiologists with this task. However, these systems typically need a lot of labeled data to train, which can be costly and requires specialized medical knowledge to produce.

To address this challenge, the researchers in this paper developed a new method that can learn useful representations from unlabeled breast ultrasound video clips. They used a triplet network and a self-supervised contrastive learning technique to extract features from the videos without needing labeled data. They also designed a novel "hard triplet loss" to focus the learning on image pairs that are difficult to distinguish.

The researchers built a large pretraining dataset from the breast ultrasound videos, including anchor, positive, and negative samples. They then used this pretrained model and fine-tuned it on a smaller dataset of labeled images for a benign/malignant classification task. Their model outperformed other state-of-the-art approaches, including some that were pretrained on the popular ImageNet dataset.

Importantly, the researchers found that their pretrained model only needed about 100 labeled images to achieve strong performance, much less than typical deep learning models require. This suggests their approach can significantly reduce the need for costly labeled data, making it promising for automating breast ultrasound diagnosis.

Technical Explanation

The researchers adopted a triplet network architecture and a self-supervised contrastive learning approach to learn visual representations from unlabeled breast ultrasound video clips. Triplet networks are designed to learn embeddings that capture semantic similarities, by training on triplets of anchor, positive, and negative samples.

To construct the pretraining dataset, the researchers collected 1,360 breast ultrasound videos from 200 patients. From these videos, they extracted 11,805 anchor images, 188,880 positive images, and dynamically generated negative samples. They also created a smaller finetuning dataset with 400 labeled images from 66 patients.

The key innovation was the introduction of a "hard triplet loss" function, which focuses the training on triplets where the positive and negative samples are hard to distinguish. This helps the model learn discriminative features that are particularly useful for the downstream classification task.

The pretrained network was then transferred to a benign/malignant classification task and compared to several other models, including ImageNet-pretrained networks and a previous contrastive learning approach retrained on the researchers' datasets. Experiments showed that the proposed framework achieved an AUC of 0.952, significantly outperforming the other methods.

Furthermore, the researchers found that their pretrained model only required around 100 labeled samples to achieve an AUC of 0.901 on the classification task, demonstrating its data efficiency compared to typical deep learning models.

Critical Analysis

The researchers present a novel and promising approach for learning useful representations from unlabeled breast ultrasound data to aid in tumor diagnosis. The use of triplet networks and hard triplet loss is an interesting technical contribution that helps the model focus on the most informative features.

One potential limitation is that the pretraining dataset, while large, is still limited to a single institution's data. It would be important to validate the approach on more diverse datasets from different hospitals and scanners to ensure the robustness of the learned representations.

Additionally, the paper does not provide much insight into the types of features or representations learned by the model. Further analysis of the learned embeddings and their clinical relevance could help build trust and understanding in the model's decision-making process.

Finally, while the data efficiency of the approach is a key strength, the researchers could explore ways to further reduce the need for labeled data, such as through more advanced self-supervised techniques or incorporating domain-specific knowledge into the model architecture.

Conclusion

This research presents a novel self-supervised learning framework that can effectively learn visual representations from unlabeled breast ultrasound data. By utilizing a triplet network and hard triplet loss, the model is able to achieve state-of-the-art performance on a benign/malignant classification task while requiring significantly less labeled data than traditional deep learning approaches.

The potential impact of this work is significant, as it could help reduce the burden and cost of developing automated breast ultrasound diagnosis systems. By leveraging unlabeled data, the approach holds promise for broader adoption and application in clinical settings, ultimately improving patient care and outcomes.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🏷️

Breast tumor classification based on self-supervised contrastive learning from ultrasound videos

Yunxin Tang, Siyuan Tang, Jian Zhang, Hao Chen

Background: Breast ultrasound is prominently used in diagnosing breast tumors. At present, many automatic systems based on deep learning have been developed to help radiologists in diagnosis. However, training such systems remains challenging because they are usually data-hungry and demand amounts of labeled data, which need professional knowledge and are expensive. Methods: We adopted a triplet network and a self-supervised contrastive learning technique to learn representations from unlabeled breast ultrasound video clips. We further designed a new hard triplet loss to to learn representations that particularly discriminate positive and negative image pairs that are hard to recognize. We also constructed a pretraining dataset from breast ultrasound videos (1,360 videos from 200 patients), which includes an anchor sample dataset with 11,805 images, a positive sample dataset with 188,880 images, and a negative sample dataset dynamically generated from video clips. Further, we constructed a finetuning dataset, including 400 images from 66 patients. We transferred the pretrained network to a downstream benign/malignant classification task and compared the performance with other state-of-the-art models, including three models pretrained on ImageNet and a previous contrastive learning model retrained on our datasets. Results and conclusion: Experiments revealed that our model achieved an area under the receiver operating characteristic curve (AUC) of 0.952, which is significantly higher than the others. Further, we assessed the dependence of our pretrained model on the number of labeled data and revealed that <100 samples were required to achieve an AUC of 0.901. The proposed framework greatly reduces the demand for labeled data and holds potential for use in automatic breast ultrasound image diagnosis.

8/21/2024

🏷️

Classification of Breast Cancer Histopathology Images using a Modified Supervised Contrastive Learning Method

Matina Mahdizadeh Sani, Ali Royat, Mahdieh Soleymani Baghshah

Deep neural networks have reached remarkable achievements in medical image processing tasks, specifically classifying and detecting various diseases. However, when confronted with limited data, these networks face a critical vulnerability, often succumbing to overfitting by excessively memorizing the limited information available. This work addresses the challenge mentioned above by improving the supervised contrastive learning method to reduce the impact of false positives. Unlike most existing methods that rely predominantly on fully supervised learning, our approach leverages the advantages of self-supervised learning in conjunction with employing the available labeled data. We evaluate our method on the BreakHis dataset, which consists of breast cancer histopathology images, and demonstrate an increase in classification accuracy by 1.45% at the image level and 1.42% at the patient level compared to the state-of-the-art method. This improvement corresponds to 93.63% absolute accuracy, highlighting our approach's effectiveness in leveraging data properties to learn more appropriate representation space.

5/7/2024

Multi-Attention Integrated Deep Learning Frameworks for Enhanced Breast Cancer Segmentation and Identification

Pandiyaraju V, Shravan Venkatraman, Pavan Kumar S, Santhosh Malarvannan, Kannan A

Breast cancer poses a profound threat to lives globally, claiming numerous lives each year. Therefore, timely detection is crucial for early intervention and improved chances of survival. Accurately diagnosing and classifying breast tumors using ultrasound images is a persistent challenge in medicine, demanding cutting-edge solutions for improved treatment strategies. This research introduces multiattention-enhanced deep learning (DL) frameworks designed for the classification and segmentation of breast cancer tumors from ultrasound images. A spatial channel attention mechanism is proposed for segmenting tumors from ultrasound images, utilizing a novel LinkNet DL framework with an InceptionResNet backbone. Following this, the paper proposes a deep convolutional neural network with an integrated multi-attention framework (DCNNIMAF) to classify the segmented tumor as benign, malignant, or normal. From experimental results, it is observed that the segmentation model has recorded an accuracy of 98.1%, with a minimal loss of 0.6%. It has also achieved high Intersection over Union (IoU) and Dice Coefficient scores of 96.9% and 97.2%, respectively. Similarly, the classification model has attained an accuracy of 99.2%, with a low loss of 0.31%. Furthermore, the classification framework has achieved outstanding F1-Score, precision, and recall values of 99.1%, 99.3%, and 99.1%, respectively. By offering a robust framework for early detection and accurate classification of breast cancer, this proposed work significantly advances the field of medical image analysis, potentially improving diagnostic precision and patient outcomes.

7/16/2024

Enhancing AI Diagnostics: Autonomous Lesion Masking via Semi-Supervised Deep Learning

Ting-Ruen Wei, Michele Hell, Dang Bich Thuy Le, Aren Vierra, Ran Pang, Mahesh Patel, Young Kang, Yuling Yan

This study presents an unsupervised domain adaptation method aimed at autonomously generating image masks outlining regions of interest (ROIs) for differentiating breast lesions in breast ultrasound (US) imaging. Our semi-supervised learning approach utilizes a primitive model trained on a small public breast US dataset with true annotations. This model is then iteratively refined for the domain adaptation task, generating pseudo-masks for our private, unannotated breast US dataset. The dataset, twice the size of the public one, exhibits considerable variability in image acquisition perspectives and demographic representation, posing a domain-shift challenge. Unlike typical domain adversarial training, we employ downstream classification outcomes as a benchmark to guide the updating of pseudo-masks in subsequent iterations. We found the classification precision to be highly correlated with the completeness of the generated ROIs, which promotes the explainability of the deep learning classification model. Preliminary findings demonstrate the efficacy and reliability of this approach in streamlining the ROI annotation process, thereby enhancing the classification and localization of breast lesions for more precise and interpretable diagnoses.

4/22/2024