Compressing VAE-Based Out-of-Distribution Detectors for Embedded Deployment

Read original: arXiv:2409.00880 - Published 9/4/2024 by Aditya Bansal, Michael Yuhas, Arvind Easwaran

Compressing VAE-Based Out-of-Distribution Detectors for Embedded Deployment

Overview

Researchers develop a compressed version of a VAE-based out-of-distribution detector for deployment on embedded devices.
The work is part of the DesCartes programme and funded by the National Research Foundation, Singapore.
The goal is to enable deployment of advanced out-of-distribution detection models on resource-constrained hardware.

Plain English Explanation

Out-of-distribution (OOD) detection is the task of identifying when an input is significantly different from the data a machine learning model was trained on. This is an important capability, as it allows models to recognize when they may produce unreliable outputs. Variational Autoencoders (VAEs) are a type of deep learning model that can be effective for OOD detection.

However, VAE-based OOD detectors can be computationally expensive, making it challenging to deploy them on embedded devices with limited processing power and memory. This research aimed to develop a compressed version of a VAE-based OOD detector that could run efficiently on resource-constrained hardware.

The researchers used techniques like model pruning and quantization to reduce the size and complexity of the VAE model, without significantly impacting its OOD detection performance. This allows the compressed model to be deployed on embedded devices, enabling advanced OOD detection capabilities even in low-power settings.

Technical Explanation

The researchers start with a baseline VAE-based OOD detector, which uses the reconstruction error of the VAE as the OOD score. They then apply several model compression techniques to reduce the model size and complexity:

Model Pruning: The researchers prune the network by removing connections with small weights, reducing the number of parameters in the model.
Weight Quantization: The full-precision weights of the model are quantized to lower bit-widths (e.g., 8-bit), reducing the memory footprint.
Knowledge Distillation: A smaller "student" model is trained to mimic the behavior of the original "teacher" model, further compressing the model size.

The compressed OOD detector is evaluated on several benchmark datasets, and the researchers demonstrate that it maintains similar OOD detection performance to the original, uncompressed model while achieving significant reductions in model size and inference latency.

Critical Analysis

The researchers acknowledge that their compressed model may not achieve the same level of OOD detection performance as the original, uncompressed model. They suggest that further research could explore ways to better preserve the OOD detection capabilities during the compression process.

Additionally, the paper does not provide a detailed analysis of the trade-offs between model size, inference latency, and OOD detection performance. It would be helpful to see more information on the specific performance metrics and how they are impacted by the different compression techniques.

Finally, the researchers could have considered other model compression methods, such as low-rank factorization or neural architecture search, to explore the design space further and potentially achieve even greater compression without significant performance degradation.

Conclusion

This research demonstrates a promising approach for compressing VAE-based OOD detectors to enable their deployment on embedded devices. By leveraging techniques like model pruning and weight quantization, the researchers were able to significantly reduce the model size and inference latency while maintaining similar OOD detection performance.

The ability to deploy advanced OOD detection capabilities on resource-constrained hardware has important implications for a wide range of applications, from autonomous vehicles to edge computing devices. This work represents an important step towards bringing these advanced ML models to the real world, where they can have a tangible impact.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Compressing VAE-Based Out-of-Distribution Detectors for Embedded Deployment

Aditya Bansal, Michael Yuhas, Arvind Easwaran

Out-of-distribution (OOD) detectors can act as safety monitors in embedded cyber-physical systems by identifying samples outside a machine learning model's training distribution to prevent potentially unsafe actions. However, OOD detectors are often implemented using deep neural networks, which makes it difficult to meet real-time deadlines on embedded systems with memory and power constraints. We consider the class of variational autoencoder (VAE) based OOD detectors where OOD detection is performed in latent space, and apply quantization, pruning, and knowledge distillation. These techniques have been explored for other deep models, but no work has considered their combined effect on latent space OOD detection. While these techniques increase the VAE's test loss, this does not correspond to a proportional decrease in OOD detection performance and we leverage this to develop lean OOD detectors capable of real-time inference on embedded CPUs and GPUs. We propose a design methodology that combines all three compression techniques and yields a significant decrease in memory and execution time while maintaining AUROC for a given OOD detector. We demonstrate this methodology with two existing OOD detectors on a Jetson Nano and reduce GPU and CPU inference time by 20% and 28% respectively while keeping AUROC within 5% of the baseline.

9/4/2024

Towards Out-of-Distribution Detection in Vocoder Recognition via Latent Feature Reconstruction

Renmingyue Du, Jixun Yao, Qiuqiang Kong, Yin Cao

Advancements in synthesized speech have created a growing threat of impersonation, making it crucial to develop deepfake algorithm recognition. One significant aspect is out-of-distribution (OOD) detection, which has gained notable attention due to its important role in deepfake algorithm recognition. However, most of the current approaches for detecting OOD in deepfake algorithm recognition rely on probability-score or classified-distance, which may lead to limitations in the accuracy of the sample at the edge of the threshold. In this study, we propose a reconstruction-based detection approach that employs an autoencoder architecture to compress and reconstruct the acoustic feature extracted from a pre-trained WavLM model. Each acoustic feature belonging to a specific vocoder class is only aptly reconstructed by its corresponding decoder. When none of the decoders can satisfactorily reconstruct a feature, it is classified as an OOD sample. To enhance the distinctiveness of the reconstructed features by each decoder, we incorporate contrastive learning and an auxiliary classifier to further constrain the reconstructed feature. Experiments demonstrate that our proposed approach surpasses baseline systems by a relative margin of 10% in the evaluation dataset. Ablation studies further validate the effectiveness of both the contrastive constraint and the auxiliary classifier within our proposed approach.

6/5/2024

Continual Unsupervised Out-of-Distribution Detection

Lars Doorenbos, Raphael Sznitman, Pablo M'arquez-Neila

Deep learning models excel when the data distribution during training aligns with testing data. Yet, their performance diminishes when faced with out-of-distribution (OOD) samples, leading to great interest in the field of OOD detection. Current approaches typically assume that OOD samples originate from an unconcentrated distribution complementary to the training distribution. While this assumption is appropriate in the traditional unsupervised OOD (U-OOD) setting, it proves inadequate when considering the place of deployment of the underlying deep learning model. To better reflect this real-world scenario, we introduce the novel setting of continual U-OOD detection. To tackle this new setting, we propose a method that starts from a U-OOD detector, which is agnostic to the OOD distribution, and slowly updates during deployment to account for the actual OOD distribution. Our method uses a new U-OOD scoring function that combines the Mahalanobis distance with a nearest-neighbor approach. Furthermore, we design a confidence-scaled few-shot OOD detector that outperforms previous methods. We show our method greatly improves upon strong baselines from related fields.

6/5/2024

Fast Decision Boundary based Out-of-Distribution Detector

Litian Liu, Yao Qin

Efficient and effective Out-of-Distribution (OOD) detection is essential for the safe deployment of AI systems. Existing feature space methods, while effective, often incur significant computational overhead due to their reliance on auxiliary models built from training features. In this paper, we propose a computationally-efficient OOD detector without using auxiliary models while still leveraging the rich information embedded in the feature space. Specifically, we detect OOD samples based on their feature distances to decision boundaries. To minimize computational cost, we introduce an efficient closed-form estimation, analytically proven to tightly lower bound the distance. Based on our estimation, we discover that In-Distribution (ID) features tend to be further from decision boundaries than OOD features. Additionally, ID and OOD samples are better separated when compared at equal deviation levels from the mean of training features. By regularizing the distances to decision boundaries based on feature deviation from the mean, we develop a hyperparameter-free, auxiliary model-free OOD detector. Our method matches or surpasses the effectiveness of state-of-the-art methods in extensive experiments while incurring negligible overhead in inference latency. Overall, our approach significantly improves the efficiency-effectiveness trade-off in OOD detection. Code is available at: https://github.com/litianliu/fDBD-OOD.

6/5/2024