CSCO: Connectivity Search of Convolutional Operators

Read original: arXiv:2404.17152 - Published 4/29/2024 by Tunhou Zhang, Shiyu Li, Hsin-Pai Cheng, Feng Yan, Hai Li, Yiran Chen

CSCO: Connectivity Search of Convolutional Operators

Overview

Explores a new approach called "Connectivity Search of Convolutional Operators" (CSCO) for designing effective convolutional neural network architectures
Introduces a dense connectivity design space that allows for more efficient exploration of network topologies
Demonstrates improved performance on various computer vision tasks compared to popular CNN architectures

Plain English Explanation

The paper introduces a new method called "Connectivity Search of Convolutional Operators" (CSCO) that helps design effective convolutional neural network (CNN) architectures. CNNs are a type of machine learning model widely used in computer vision tasks like image recognition.

The key idea behind CSCO is to expand the search space for CNN architectures beyond the typical "stacking" of convolutional layers. Instead, CSCO explores a "dense connectivity" design space, where each layer can receive inputs from multiple previous layers. This allows for more efficient exploration of different network topologies.

By leveraging this dense connectivity design space, the researchers show that CSCO can outperform popular CNN architectures like VGG, ResNet, and MobileNet on various computer vision tasks. The improvements demonstrate the potential of CSCO to help create more efficient and effective CNN models.

Technical Explanation

The paper introduces a new approach called "Connectivity Search of Convolutional Operators" (CSCO) for designing convolutional neural network (CNN) architectures. CSCO is motivated by the observation that the typical "stacking" of convolutional layers in popular CNN architectures may not be the most efficient way to explore the design space.

To address this, the paper proposes a "dense connectivity" design space, where each layer in the CNN can receive inputs from multiple previous layers. This allows for more flexible and efficient exploration of different network topologies. The researchers develop a search algorithm to efficiently navigate this dense connectivity design space and identify optimal network architectures.

The paper evaluates CSCO on various computer vision tasks, including image classification, object detection, and semantic segmentation. The results show that CSCO-designed CNN models outperform popular architectures like VGG, ResNet, and MobileNet in terms of accuracy and efficiency. The authors attribute these improvements to the increased flexibility and expressive power of the dense connectivity design space explored by CSCO.

Critical Analysis

The paper presents a compelling approach for designing more effective convolutional neural network architectures. The dense connectivity design space explored by CSCO is a promising direction for improving the efficiency and performance of CNNs.

However, the paper does not address the potential computational and memory overhead associated with the increased connectivity between layers in the CSCO-designed models. While the reported results show improvements in accuracy and efficiency, the tradeoffs in terms of resource requirements should be further investigated.

Additionally, the paper focuses on relatively standard computer vision tasks, such as image classification and object detection. It would be valuable to see how CSCO performs on more challenging or domain-specific tasks, as well as its applicability to other types of neural network architectures beyond CNNs.

Further research could also explore the interpretability and explainability of the CSCO-designed models, as the increased connectivity may make it more difficult to understand the internal workings and decision-making processes of the networks.

Conclusion

The paper introduces a novel approach called "Connectivity Search of Convolutional Operators" (CSCO) for designing effective convolutional neural network architectures. By exploring a dense connectivity design space, CSCO demonstrates improved performance on various computer vision tasks compared to popular CNN models like VGG, ResNet, and MobileNet.

The findings suggest that the increased flexibility and expressive power of the dense connectivity design space can lead to more efficient and effective CNN architectures. While the paper presents promising results, further research is needed to address potential computational and memory overhead, as well as to explore the broader applicability and interpretability of CSCO-designed models.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

CSCO: Connectivity Search of Convolutional Operators

Tunhou Zhang, Shiyu Li, Hsin-Pai Cheng, Feng Yan, Hai Li, Yiran Chen

Exploring dense connectivity of convolutional operators establishes critical synapses to communicate feature vectors from different levels and enriches the set of transformations on Computer Vision applications. Yet, even with heavy-machinery approaches such as Neural Architecture Search (NAS), discovering effective connectivity patterns requires tremendous efforts due to either constrained connectivity design space or a sub-optimal exploration process induced by an unconstrained search space. In this paper, we propose CSCO, a novel paradigm that fabricates effective connectivity of convolutional operators with minimal utilization of existing design motifs and further utilizes the discovered wiring to construct high-performing ConvNets. CSCO guides the exploration via a neural predictor as a surrogate of the ground-truth performance. We introduce Graph Isomorphism as data augmentation to improve sample efficiency and propose a Metropolis-Hastings Evolutionary Search (MH-ES) to evade locally optimal architectures and advance search quality. Results on ImageNet show ~0.6% performance improvement over hand-crafted and NAS-crafted dense connectivity. Our code is publicly available.

4/29/2024

Can Dense Connectivity Benefit Outlier Detection? An Odyssey with NAS

Hao Fu, Tunhou Zhang, Hai Li, Yiran Chen

Recent advances in Out-of-Distribution (OOD) Detection is the driving force behind safe and reliable deployment of Convolutional Neural Networks (CNNs) in real world applications. However, existing studies focus on OOD detection through confidence score and deep generative model-based methods, without considering the impact of DNN structures, especially dense connectivity in architecture fabrications. In addition, existing outlier detection approaches exhibit high variance in generalization performance, lacking stability and confidence in evaluating and ranking different outlier detectors. In this work, we propose a novel paradigm, Dense Connectivity Search of Outlier Detector (DCSOD), that automatically explore the dense connectivity of CNN architectures on near-OOD detection task using Neural Architecture Search (NAS). We introduce a hierarchical search space containing versatile convolution operators and dense connectivity, allowing a flexible exploration of CNN architectures with diverse connectivity patterns. To improve the quality of evaluation on OOD detection during search, we propose evolving distillation based on our multi-view feature learning explanation. Evolving distillation stabilizes training for OOD detection evaluation, thus improves the quality of search. We thoroughly examine DCSOD on CIFAR benchmarks under OOD detection protocol. Experimental results show that DCSOD achieve remarkable performance over widely used architectures and previous NAS baselines. Notably, DCSOD achieves state-of-the-art (SOTA) performance on CIFAR benchmark, with AUROC improvement of $sim$1.0%.

6/5/2024

Connectivity-Inspired Network for Context-Aware Recognition

Gianluca Carloni, Sara Colantonio

The aim of this paper is threefold. We inform the AI practitioner about the human visual system with an extensive literature review; we propose a novel biologically motivated neural network for image classification; and, finally, we present a new plug-and-play module to model context awareness. We focus on the effect of incorporating circuit motifs found in biological brains to address visual recognition. Our convolutional architecture is inspired by the connectivity of human cortical and subcortical streams, and we implement bottom-up and top-down modulations that mimic the extensive afferent and efferent connections between visual and cognitive areas. Our Contextual Attention Block is simple and effective and can be integrated with any feed-forward neural network. It infers weights that multiply the feature maps according to their causal influence on the scene, modeling the co-occurrence of different objects in the image. We place our module at different bottlenecks to infuse a hierarchical context awareness into the model. We validated our proposals through image classification experiments on benchmark data and found a consistent improvement in performance and the robustness of the produced explanations via class activation. Our code is available at https://github.com/gianlucarloni/CoCoReco.

9/9/2024

🧠

GRASP-GCN: Graph-Shape Prioritization for Neural Architecture Search under Distribution Shifts

Sofia Casarin, Oswald Lanz, Sergio Escalera

Neural Architecture Search (NAS) methods have shown to output networks that largely outperform human-designed networks. However, conventional NAS methods have mostly tackled the single dataset scenario, incuring in a large computational cost as the procedure has to be run from scratch for every new dataset. In this work, we focus on predictor-based algorithms and propose a simple and efficient way of improving their prediction performance when dealing with data distribution shifts. We exploit the Kronecker-product on the randomly wired search-space and create a small NAS benchmark composed of networks trained over four different datasets. To improve the generalization abilities, we propose GRASP-GCN, a ranking Graph Convolutional Network that takes as additional input the shape of the layers of the neural networks. GRASP-GCN is trained with the not-at-convergence accuracies, and improves the state-of-the-art of 3.3 % for Cifar-10 and increasing moreover the generalization abilities under data distribution shift.

5/14/2024