Tiny adversarial multi-objective one-shot neural architecture search

Xie, Guoyang; Wang, Jinbao; Yu, Guo; Lyu, Jiayi; Zheng, Feng; Jin, Yaochu

doi:10.1007/s40747-023-01139-8

Tiny adversarial multi-objective one-shot neural architecture search

Original Article
Open access
Published: 11 July 2023

Volume 9, pages 6117–6138, (2023)
Cite this article

Download PDF

You have full access to this open access article

Complex & Intelligent Systems Aims and scope Submit manuscript

Tiny adversarial multi-objective one-shot neural architecture search

Download PDF

Guoyang Xie^1,2^na1,
Jinbao Wang¹^na1,
Guo Yu³,
Jiayi Lyu⁴,
Feng Zheng⁵ &
…
Yaochu Jin ORCID: orcid.org/0000-0003-1100-0631^6,7

992 Accesses
2 Citations
Explore all metrics

Abstract

The widely employed tiny neural networks (TNNs) in mobile devices are vulnerable to adversarial attacks. However, more advanced research on the robustness of TNNs is highly in demand. This work focuses on improving the robustness of TNNs without sacrificing the model’s accuracy. To find the optimal trade-off networks in terms of the adversarial accuracy, clean accuracy, and model size, we present TAM-NAS, a tiny adversarial multi-objective one-shot network architecture search method. First, we build a novel search space comprised of new tiny blocks and channels to establish a balance between the model size and adversarial performance. Then, we demonstrate how the supernet facilitates the acquisition of the optimal subnet under white-box adversarial attacks, provided that the supernet significantly impacts the subnet’s performance. Concretely, we investigate a new adversarial training paradigm by evaluating the adversarial transferability, the width of the supernet, and the distinction between training subnets from scratch and fine-tuning. Finally, we undertake statistical analysis for the layer-wise combination of specific blocks and channels on the first non-dominated front, which can be utilized as a design guideline for the design of TNNs.

Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions

Article 18 August 2021

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

Article Open access 31 March 2021

A survey on Image Data Augmentation for Deep Learning

Article Open access 06 July 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Due to the fragility of tiny neural networks, there is a critical need for systematically investigating how to design robust tiny neural networks (TNNs). It is well known that deep neural networks are susceptible to attacks that introduce subtle perturbations to the input data [1, 2]. To defend attacks, current studies [3, 4] investigate the relationship between the model’s capacity and its adversarial robustness via ResNet [5] as the backbone network. It has been shown that adding more neural network parameters may greatly improve the model’s resilience. Despite the widespread usage of tiny neural networks for mobile applications, little research concentrates on the improvement of the robustness of TNNs. Typically, they are 10 K to 2 M in size. Hence, the main goal of our research is to re-design the TNN architecture to enhance its robustness.

Designing the neural network architecture is a promising way to enhance robustness against adversarial examples. Previous studies [3, 4, 6, 7] have illustrated the significance of the neural architecture for adversarial robustness. Huang et al. [8] give a comprehensive investigation on the impact of network width and depth on the robustness. Liu et al. [9] employ multi-objective NAS to search for robust neural network architectures. However, most of them do not target on the desired FLOP count and address the tiny architecture design issue. We presume that the tiny network design itself has anti-adversarial capabilities. Thus, the purpose of our work is to investigate the best trade-off architecture and present a design principle for compact, resilient network architectures.

To identify the best tiny neural network architecture with clean accuracy, adversarial accuracy, and model size, we employ a multi-objective architecture search algorithm to find the best trade-off architecture design. Most present works adopt adversarial training [2, 10, 11] to increase the robustness of the model. However, most are solely concerned with enhancing resilience, ignoring the degradation of clean accuracy. Hence, we employ a multi-objective approach-based NAS [12,13,14] to find the best architecture for the trade-off solutions between the adversarial accuracy, the clean accuracy, and the mode size. In our work, we mainly address the trade-off problem based on the ShuffleNetV2 architecture [15], Xception block [16], SE layer [17], Non-Local block [18], and their variants.

The contributions of this work can be summarized as follows:

To find the best trade-off neural networks between the adversarial accuracy, clean accuracy, and model size, we propose three novel tiny robust blocks. Due to the inertial self-attention mechanism, the layer-wise combination of these three blocks can increase the robustness without substantially degrading the clean performance.
We explore a new adversarial training paradigm for the supernet. Because the subnets heavily rely on the supernet in one-shot NAS, the adversarial performance of the subnets can be further improved using our proposed training paradigm. To this end, we examine how the width of the supernet, the perturbation range, and the number of attack steps for the supernet adversarial training affect the performance of the subnets.
We seamlessly integrate a multi-objective search algorithm with a one-shot NAS algorithm. After the search process, we can get the non-dominated front immediately, which makes it easier to find the best trade-off subnets. In addition, we discover that training from scratch outperforms fine-tuning for the non-dominated subnets.
We provide guidelines for how to design tiny robust neural networks. First, pure robust blocks and small robust blocks should be placed in the shallow levels, while pure tiny blocks should be placed in the deep layers. Second, larger intermediate channels should be placed in the shallow levels, whereas intermediate channels should decrease gradually in the remaining layers. Finally, we rebuild a tiny neural network using the guidelines and find that it can reduce the model size and increase the adversarial accuracy and the clean accuracy.

Related work

Adversarial training is the most popular defensive mechanism against adversarial attacks [2], which uses both clean and adversarial images for training. Derived from game theory [19], the work in Ref. [10] reformulates the min–max optimization problem of adversarial learning as Nash equilibrium [20]. The game-theory-based optimization method [21] can effectively reduce the high computational cost without sacrificing adversarial accuracy. In addition, the work [11] has empirically proved the trade-off information between the adversarial accuracy and clean accuracy. Kannan et al. [22] and Zhang et al. [23] develop a surrogate loss function to reduce the disparity between clean images and adversarial counterparts. Madry et al. [3] conclude that increasing the capacity size of a neural network might enhance their performance against adversarial attacks. It is well known that most neural network models deployed in our electronic devices are quite small due to energy consumption and storage limitations. Hence, we aim to figure out which kind of tiny neural network architectures can be effective for the resilience of adversarial perturbations.

White-box attacks assume that the adversary knows detailed information about the targeted models, including model architecture, hyperparameters, gradients, and training data. In the following, we use $X^{*}$ and X to denote the adversarial and clean examples, respectively. Then, $\triangledown _{X}$ measures the gradients of the loss function l for X.

Fast Gradient Sign Method (FGSM). It is a one-step and non-target attack, which generates adversarial examples by adding perturbations along the direction of the gradient sign at each pixel [2]. The generated adversarial examples can be calculated by

$$\begin{aligned} X^{*} = X + \epsilon \cdot \textrm{sign} \left( \triangledown _{X}\;l(X, y_{true}) \right) , \end{aligned}$$

(1)

where $\epsilon $ is a hyperparameter that controls the magnitude of the disturbance, and $y_{true}$ is the ground truth label.

Projected Gradient Descent (PGD). The PGD attack [3], which combines randomized initialization with multi-step attacks, is one of the strongest adversarial attacks against adversarial training. The adversarial examples generated by the PGD attack can be expressed as

$$\begin{aligned} X_{0}^{*} = X + {\mathcal {U}} \left( -\epsilon , \epsilon \right) , \end{aligned}$$

(2)

$$\begin{aligned} X_{n+1}^{*} = \prod _{X, \epsilon } \left\{ X_{n}^{*} + \alpha \cdot \textrm{sign} (\triangledown _{X^{*}_{n}}\;l(X_{n}^{*}, y_{true}) \right\} , \end{aligned}$$

(3)

where ${\mathcal {U}}$ is a uniform distribution, $X_{n}^{*}$ is the adversarial examples after n steps, and $\prod _{X,\epsilon } (B)$ is the projections of $B(X,\epsilon )$.

Neural network architecture search aims to replace hand-crafted architecture design approaches with automated design using machine learning techniques. Representative search algorithms include evolutionary algorithms [24, 25], reinforcement learning [26, 27], and gradient-based methods [28,29,30]. In one-shot NAS [31], the authors [31] construct a supernet that is capable of generating any potential architecture in the search space. The work in Ref. [31] trains a supernet for once, and then during the search, they can retrieve multiple fitness values for different subnets through weight sharing from the supernet. However, most of them employ a single-objective optimization approach to search, which is not well suited for solving the trade-off problem [11]. To solve the multi-objective optimization problem, we adopt the elitist non-dominated sorting genetic algorithm (NSGA-II) [32] as our search algorithm. Recently, several papers have been published that explore the impact of the network width and depth on robustness [8, 9]. In addition, Huang et al. [33] propose a robust residual block and a compound scaling rule to investigate the influence of network width and depth. By contrast, TAM-NAS pays more attention to the resilience attack ability of tiny neural networks.

TAM-NAS

Our TAM-NAS approach consists of four steps as follows. First, design a supernet search space and uniformly sample different candidates from the supernet to increase our supernet representation ability for many subnet architectures when using a single supernet. Second, train the candidates sampled from the supernet using adversarial examples and make them more robust in the presence of adversarial attacks. Third, perform multi-objective search of new subnets using NSGA-II [32] and evaluate the clean accuracy, the adversarial accuracy, and the number of parameters of each subnet by cloning its weight from the pre-trained supernet. Finally, fine-tune each subnet on the first non-dominated front and evaluate their performance on the test dataset. Figure 1 shows the overall framework.

Problem definition

Without loss of generality, our supernet search space ${\mathcal {A}}$ can be represented by a directed acyclic graph (DAG), denoted as ${\mathcal {N}}({\mathcal {A}}, W)$, where W is the weight of supernet. A subnet architecture is a subgraph $a \in {\mathcal {A}}$, denoted as ${\mathcal {N}}(a, w)$, where w is the weight of the subnet. $\Gamma ({\mathcal {A}})$ is a prior distribution of $a \in {\mathcal {A}}$. ${\mathcal {L}}_{adv-train}\left( \cdot \right) $ is the adversarial training loss function on the adversarial training examples. The most important factor for TAM-NAS is that the performance of the subnets using inherited weights from the supernet (without extra fine-tuning or training from scratch) should be highly predictive. In other words, the supernet weights $W_{{\mathcal {A}}}$ should be optimized, so that all subnet architectures in the search space A are optimized simultaneously. It can be expressed as

$$\begin{aligned} W_{{\mathcal {A}}} = \underset{W}{\textrm{argmin}} {\mathbb {E}}_{a \sim \Gamma ({\mathcal {A}})} \left[ {\mathcal {L}}_{adv-train}({\mathcal {N}}(a, W(a))) \right] . \end{aligned}$$

(4)

After finishing the training of supernet, the next step is to find a set of Pareto optimal subnets $a^{*} \in {\mathcal {A}}$ in terms of our objectives: the adversarial error, the clean error, and the model size. It can be expressed as

$$\begin{aligned} \begin{aligned}&\min \left\{ f_{1}(a^{*}), f_{2}(a^{*}), f_{3}(a^{*})\right\} , \\&\text {s.t.}\; a^{*} \in {\mathcal {A}}, \end{aligned} \end{aligned}$$

(5)

where $f_{1}, f_{2}, f_{3}$ are the three objectives, the adversarial error, the clean error, and the model size, respectively. Figure 2 shows the pipeline of multi-objective one-shot NAS. Actually, it cannot get the minimum value for three objectives simultaneously, since each objective has a strong trade-off relationship with the other two objectives. For instance, if the model size is larger, the adversarial error and the clean error will become smaller. Our aim is to obtain a tiny model with compatible performance in the adversarial dataset and clean dataset.

Table 1 Supernet architecture

Tiny adversarial multi-objective one-shot neural architecture search

Abstract

Similar content being viewed by others

Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

A survey on Image Data Augmentation for Deep Learning

Introduction

Related work

TAM-NAS

Problem definition

Search space design

Block search spaces

Channel search spaces

Uniform sampling

Block sampling

Block and channel jointly sampling

Adversarial training

Multi-objective search

Crossover

Mutation

Training from scratch or fine-tuning

Experimental result and analysis

Experimental settings

Supernet

NSGA-II

Training from scratch

Supernet transferability

Number of attack steps

Epsilon size

Model size

Width impact of supernet

Training from scratch or fine-tuning

Subnets’ analysis

Adversarial error, clean error, and the size of neural network

Block and channel analysis

Assumption verification

Main comparisons

Conclusion

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation