Dataset Quantization with Active Learning based Adaptive Sampling

Read original: arXiv:2407.07268 - Published 7/11/2024 by Zhenghao Zhao, Yuzhang Shang, Junyi Wu, Yan Yan

Dataset Quantization with Active Learning based Adaptive Sampling

Overview

This paper proposes a novel dataset quantization method that combines active learning and adaptive sampling to efficiently reduce the size of large datasets while preserving their accuracy.
The method selects a small subset of the dataset that captures the most important information, and then quantizes the remaining data to further reduce its size.
Experiments show that the proposed method outperforms existing dataset distillation and coreset selection techniques in terms of accuracy and compression ratio.

Plain English Explanation

The paper presents a way to make large datasets smaller without losing too much of their accuracy. This is important because big datasets can be slow and expensive to work with, especially for AI models.

The key idea is to use active learning to identify a small subset of the data that captures the most important information. This "coreset" is kept at full resolution, while the rest of the dataset is quantized, or compressed, using an adaptive bit-width quantization technique.

This two-step process - selecting a representative subset and then compressing the rest - allows the dataset to be significantly reduced in size while maintaining most of its accuracy. The authors show that this outperforms other methods for dataset distillation and coreset selection.

Technical Explanation

The proposed method, called ADAPT (Adaptive Data Quantization with Active Learning), consists of two main components:

Coreset Selection: The authors use an active learning-based approach to select a small subset of the dataset that captures the most important information. This "coreset" is kept at full resolution.
Adaptive Quantization: The remaining data points outside the coreset are quantized using an adaptive bit-width quantization technique. The bit-width for each dimension is determined adaptively based on the distribution of the data.

The authors evaluate ADAPT on several image classification and language modeling tasks and compare it to existing dataset distillation and coreset selection methods. They show that ADAPT achieves higher accuracy and better compression ratios compared to these baselines.

Critical Analysis

The authors provide a thorough evaluation of ADAPT, including comparisons to state-of-the-art methods and ablation studies to understand the contributions of the different components. However, there are a few potential limitations and areas for future research:

The adaptive quantization approach may not be as effective for datasets with complex, high-dimensional structures. Additional techniques may be needed to handle these cases.
The active learning-based coreset selection could be further improved by incorporating uncertainty estimates to better identify the most informative data points.
The authors only consider image classification and language modeling tasks. It would be valuable to assess the performance of ADAPT on a wider range of applications, such as medical imaging or reinforcement learning.

Overall, the ADAPT method presents a promising approach for efficiently compressing large datasets while preserving their accuracy, with potential applications in a variety of AI and machine learning domains.

Conclusion

The ADAPT method offers a novel way to reduce the size of large datasets without sacrificing too much of their accuracy. By combining active learning for coreset selection and adaptive quantization, the authors have demonstrated significant improvements over existing dataset distillation and coreset selection techniques.

This research has important implications for improving the efficiency and scalability of AI systems, which often rely on large, high-quality datasets. By reducing the size of these datasets, ADAPT can help reduce the computational and storage requirements, making AI models more accessible and deployable in a wide range of real-world applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Dataset Quantization with Active Learning based Adaptive Sampling

Zhenghao Zhao, Yuzhang Shang, Junyi Wu, Yan Yan

Deep learning has made remarkable progress recently, largely due to the availability of large, well-labeled datasets. However, the training on such datasets elevates costs and computational demands. To address this, various techniques like coreset selection, dataset distillation, and dataset quantization have been explored in the literature. Unlike traditional techniques that depend on uniform sample distributions across different classes, our research demonstrates that maintaining performance is feasible even with uneven distributions. We find that for certain classes, the variation in sample quantity has a minimal impact on performance. Inspired by this observation, an intuitive idea is to reduce the number of samples for stable classes and increase the number of samples for sensitive classes to achieve a better performance with the same sampling ratio. Then the question arises: how can we adaptively select samples from a dataset to achieve optimal performance? In this paper, we propose a novel active learning based adaptive sampling strategy, Dataset Quantization with Active Learning based Adaptive Sampling (DQAS), to optimize the sample selection. In addition, we introduce a novel pipeline for dataset quantization, utilizing feature space from the final stage of dataset quantization to generate more precise dataset bins. Our comprehensive evaluations on the multiple datasets show that our approach outperforms the state-of-the-art dataset compression methods.

7/11/2024

Efficient and Robust Quantization-aware Training via Adaptive Coreset Selection

Xijie Huang, Zechun Liu, Shih-Yang Liu, Kwang-Ting Cheng

Quantization-aware training (QAT) is a representative model compression method to reduce redundancy in weights and activations. However, most existing QAT methods require end-to-end training on the entire dataset, which suffers from long training time and high energy costs. In addition, the potential label noise in the training data undermines the robustness of QAT. We propose two metrics based on analysis of loss and gradient of quantized weights: error vector score and disagreement score, to quantify the importance of each sample during training. Guided by these two metrics, we proposed a quantization-aware Adaptive Coreset Selection (ACS) method to select the data for the current training epoch. We evaluate our method on various networks (ResNet-18, MobileNetV2, RetinaNet), datasets(CIFAR-10, CIFAR-100, ImageNet-1K, COCO), and under different quantization settings. Specifically, our method can achieve an accuracy of 68.39% of 4-bit quantized ResNet-18 on the ImageNet-1K dataset with only a 10% subset, which has an absolute gain of 4.24% compared to the baseline. Our method can also improve the robustness of QAT by removing noisy samples in the training set.

8/21/2024

🏋️

AdaQAT: Adaptive Bit-Width Quantization-Aware Training

C'edric Gernigon (TARAN), Silviu-Ioan Filip (TARAN), Olivier Sentieys (TARAN), Cl'ement Coggiola (CNES), Mickael Bruno (CNES)

Large-scale deep neural networks (DNNs) have achieved remarkable success in many application scenarios. However, high computational complexity and energy costs of modern DNNs make their deployment on edge devices challenging. Model quantization is a common approach to deal with deployment constraints, but searching for optimized bit-widths can be challenging. In this work, we present Adaptive Bit-Width Quantization Aware Training (AdaQAT), a learning-based method that automatically optimizes weight and activation signal bit-widths during training for more efficient DNN inference. We use relaxed real-valued bit-widths that are updated using a gradient descent rule, but are otherwise discretized for all quantization operations. The result is a simple and flexible QAT approach for mixed-precision uniform quantization problems. Compared to other methods that are generally designed to be run on a pretrained network, AdaQAT works well in both training from scratch and fine-tuning scenarios.Initial results on the CIFAR-10 and ImageNet datasets using ResNet20 and ResNet18 models, respectively, indicate that our method is competitive with other state-of-the-art mixed-precision quantization approaches.

4/29/2024

QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning

Haoxuan Wang, Yuzhang Shang, Zhihang Yuan, Junyi Wu, Junchi Yan, Yan Yan

The practical deployment of diffusion models still suffers from the high memory and time overhead. While quantization paves a way for compression and acceleration, existing methods unfortunately fail when the models are quantized to low-bits. In this paper, we empirically unravel three properties in quantized diffusion models that compromise the efficacy of current methods: imbalanced activation distributions, imprecise temporal information, and vulnerability to perturbations of specific modules. To alleviate the intensified low-bit quantization difficulty stemming from the distribution imbalance, we propose finetuning the quantized model to better adapt to the activation distribution. Building on this idea, we identify two critical types of quantized layers: those holding vital temporal information and those sensitive to reduced bit-width, and finetune them to mitigate performance degradation with efficiency. We empirically verify that our approach modifies the activation distribution and provides meaningful temporal information, facilitating easier and more accurate quantization. Our method is evaluated over three high-resolution image generation tasks and achieves state-of-the-art performance under various bit-width settings, as well as being the first method to generate readable images on full 4-bit (i.e. W4A4) Stable Diffusion. Code is available href{https://github.com/hatchetProject/QuEST}{here}.

9/9/2024