ApproxDARTS: Differentiable Neural Architecture Search with Approximate Multipliers

Read original: arXiv:2404.08002 - Published 4/15/2024 by Michal Pinos, Lukas Sekanina, Vojtech Mrazek

ApproxDARTS: Differentiable Neural Architecture Search with Approximate Multipliers

Overview

Introduces a novel approach called ApproxDARTS for neural architecture search that leverages approximate multipliers to reduce computational cost.
ApproxDARTS is a differentiable neural architecture search method that optimizes both the network architecture and the approximate multipliers jointly.
Evaluated on CIFAR-10, CIFAR-100, and ImageNet datasets, showing that ApproxDARTS can discover high-performing architectures with significant computational savings.

Plain English Explanation

ApproxDARTS is a new way to design efficient neural network architectures. Traditional neural architecture search (NAS) methods can be computationally expensive, as they require training many different network designs to find the best one.

ApproxDARTS takes a different approach. Instead of training full network models, it uses "approximate multipliers" - simplified math operations that are faster but slightly less accurate. By optimizing both the network architecture and the approximate multipliers at the same time, ApproxDARTS can discover high-performing architectures with significant computational savings.

This is similar to how Accel-NASBench and SpikeNAS consider hardware-specific constraints during the architecture search process. By taking the underlying hardware into account, these methods can find more efficient network designs.

ApproxDARTS goes a step further by directly incorporating approximations into the search process. This allows it to find architectures that work well with cheaper, faster math operations, leading to substantial performance improvements on real-world datasets like CIFAR-10, CIFAR-100, and ImageNet.

Technical Explanation

ApproxDARTS is a differentiable neural architecture search (DARTS) method that jointly optimizes the network architecture and the approximate multipliers used in the network operations.

The key idea is to relax the discrete search space of network architectures and approximate multipliers to a continuous representation, which can then be optimized using gradient-based methods. Specifically, ApproxDARTS learns a set of continuous architecture parameters and approximate multiplier parameters, which are used to construct the final network and hardware implementation.

The authors evaluate ApproxDARTS on image classification tasks using the CIFAR-10, CIFAR-100, and ImageNet datasets. Compared to the original DARTS method, ApproxDARTS is able to discover architectures that achieve similar or better accuracy while using significantly fewer multiply-accumulate (MAC) operations.

For example, on CIFAR-10, ApproxDARTS finds an architecture that is 2.4x more efficient (in terms of MACs) than the DARTS-discovered architecture, while maintaining a similar accuracy of around 97.5%. On ImageNet, ApproxDARTS improves the top-1 accuracy from 75.7% to 76.5% while reducing the number of MACs by 27%.

Critical Analysis

The key strength of ApproxDARTS is its ability to jointly optimize the neural network architecture and the underlying approximate multipliers, leading to substantial computational savings without sacrificing much model performance.

However, the paper does not provide a deep analysis of the limitations of this approach. For instance, it's unclear how ApproxDARTS would perform on more complex tasks or larger datasets, or how sensitive the results are to the choice of approximation methods.

Additionally, the paper does not explore the tradeoffs between the accuracy of the approximate multipliers and the overall model performance. Further research could investigate how different levels of approximation affect the discovered architectures and their generalization capabilities.

It would also be interesting to see how ApproxDARTS compares to other hardware-aware NAS methods, such as 1-Bit Quantized Chip or Exploring Quantization-Mapping Synergy, which aim to optimize for specific hardware constraints and capabilities.

Conclusion

In summary, ApproxDARTS is a promising approach for efficient neural architecture search that takes into account the computational characteristics of the underlying hardware. By jointly optimizing the network architecture and the approximate multipliers, ApproxDARTS can discover high-performing models with substantial computational savings.

This work highlights the importance of considering hardware-specific constraints and opportunities during the neural architecture design process. As AI models continue to grow in complexity, techniques like ApproxDARTS will become increasingly valuable for developing efficient and deployable AI systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

ApproxDARTS: Differentiable Neural Architecture Search with Approximate Multipliers

Michal Pinos, Lukas Sekanina, Vojtech Mrazek

Integrating the principles of approximate computing into the design of hardware-aware deep neural networks (DNN) has led to DNNs implementations showing good output quality and highly optimized hardware parameters such as low latency or inference energy. In this work, we present ApproxDARTS, a neural architecture search (NAS) method enabling the popular differentiable neural architecture search method called DARTS to exploit approximate multipliers and thus reduce the power consumption of generated neural networks. We showed on the CIFAR-10 data set that the ApproxDARTS is able to perform a complete architecture search within less than $10$ GPU hours and produce competitive convolutional neural networks (CNN) containing approximate multipliers in convolutional layers. For example, ApproxDARTS created a CNN showing an energy consumption reduction of (a) $53.84%$ in the arithmetic operations of the inference phase compared to the CNN utilizing the native $32$-bit floating-point multipliers and (b) $5.97%$ compared to the CNN utilizing the exact $8$-bit fixed-point multipliers, in both cases with a negligible accuracy drop. Moreover, the ApproxDARTS is $2.3times$ faster than a similar but evolutionary algorithm-based method called EvoApproxNAS.

4/15/2024

Exploring DNN Robustness Against Adversarial Attacks Using Approximate Multipliers

Mohammad Javad Askarizadeh, Ebrahim Farahmand, Jorge Castro-Godinez, Ali Mahani, Laura Cabrera-Quiros, Carlos Salazar-Garcia

Deep Neural Networks (DNNs) have advanced in many real-world applications, such as healthcare and autonomous driving. However, their high computational complexity and vulnerability to adversarial attacks are ongoing challenges. In this letter, approximate multipliers are used to explore DNN robustness improvement against adversarial attacks. By uniformly replacing accurate multipliers for state-of-the-art approximate ones in DNN layer models, we explore the DNNs robustness against various adversarial attacks in a feasible time. Results show up to 7% accuracy drop due to approximations when no attack is present while improving robust accuracy up to 10% when attacks applied.

4/19/2024

NASH: Neural Architecture and Accelerator Search for Multiplication-Reduced Hybrid Models

Yang Xu, Huihong Shi, Zhongfeng Wang

The significant computational cost of multiplications hinders the deployment of deep neural networks (DNNs) on edge devices. While multiplication-free models offer enhanced hardware efficiency, they typically sacrifice accuracy. As a solution, multiplication-reduced hybrid models have emerged to combine the benefits of both approaches. Particularly, prior works, i.e., NASA and NASA-F, leverage Neural Architecture Search (NAS) to construct such hybrid models, enhancing hardware efficiency while maintaining accuracy. However, they either entail costly retraining or encounter gradient conflicts, limiting both search efficiency and accuracy. Additionally, they overlook the acceleration opportunity introduced by accelerator search, yielding sub-optimal hardware performance. To overcome these limitations, we propose NASH, a Neural architecture and Accelerator Search framework for multiplication-reduced Hybrid models. Specifically, as for NAS, we propose a tailored zero-shot metric to pre-identify promising hybrid models before training, enhancing search efficiency while alleviating gradient conflicts. Regarding accelerator search, we innovatively introduce coarse-to-fine search to streamline the search process. Furthermore, we seamlessly integrate these two levels of searches to unveil NASH, obtaining the optimal model and accelerator pairing. Experiments validate our effectiveness, e.g., when compared with the state-of-the-art multiplication-based system, we can achieve $uparrow$$2.14times$ throughput and $uparrow$$2.01times$ FPS with $uparrow$$0.25%$ accuracy on CIFAR-100, and $uparrow$$1.40times$ throughput and $uparrow$$1.19times$ FPS with $uparrow$$0.56%$ accuracy on Tiny-ImageNet. Codes are available at url{https://github.com/xuyang527/NASH.}

9/10/2024

Heterogeneous Learning Rate Scheduling for Neural Architecture Search on Long-Tailed Datasets

Chenxia Tang

In this paper, we attempt to address the challenge of applying Neural Architecture Search (NAS) algorithms, specifically the Differentiable Architecture Search (DARTS), to long-tailed datasets where class distribution is highly imbalanced. We observe that traditional re-sampling and re-weighting techniques, which are effective in standard classification tasks, lead to performance degradation when combined with DARTS. To mitigate this, we propose a novel adaptive learning rate scheduling strategy tailored for the architecture parameters of DARTS when integrated with the Bilateral Branch Network (BBN) for handling imbalanced datasets. Our approach dynamically adjusts the learning rate of the architecture parameters based on the training epoch, preventing the disruption of well-trained representations in the later stages of training. Additionally, we explore the impact of branch mixing factors on the algorithm's performance. Through extensive experiments on the CIFAR-10 dataset with an artificially induced long-tailed distribution, we demonstrate that our method achieves comparable accuracy to using DARTS alone. And the experiment results suggest that re-sampling methods inherently harm the performance of the DARTS algorithm. Our findings highlight the importance of careful data augment when applying DNAS to imbalanced learning scenarios.

6/12/2024