ABCNN-IDS: Attention-Based Convolutional Neural Network for Intrusion Detection in IoT Networks

Momand, Asadullah; Jan, Sana Ullah; Ramzan, Naeem

doi:10.1007/s11277-024-11260-7

ABCNN-IDS: Attention-Based Convolutional Neural Network for Intrusion Detection in IoT Networks

Research
Open access
Published: 03 July 2024

Volume 136, pages 1981–2003, (2024)
Cite this article

Download PDF

You have full access to this open access article

Wireless Personal Communications Aims and scope Submit manuscript

ABCNN-IDS: Attention-Based Convolutional Neural Network for Intrusion Detection in IoT Networks

Download PDF

Asadullah Momand¹,
Sana Ullah Jan² &
Naeem Ramzan¹

198 Accesses
Explore all metrics

Abstract

This paper proposes an attention-based convolutional neural network (ABCNN) for intrusion detection in the Internet of Things (IoT). The proposed ABCNN employs an attention mechanism that aids in the learning process for low-instance classes. On the other hand, the Convolutional Neural Network (CNN) employed in the ABCNN framework converges toward the most important parameters and effectively detects malicious activities. Furthermore, the mutual information technique is employed during the pre-processing stage to filter out the most significant features from the datasets, thereby improving the effectiveness of the ABCN model. To assess the effectiveness of the ABCNN approach, we utilized the Edge-IoTset, IoTID20, ToN_IoT, and CIC-IDS2017 datasets. The performance of the proposed architecture was assessed using various evaluation metrics, such as precision, recall, F1-score, and accuracy. Additionally, the performance of the proposed model was compared to multiple ML and DL methods to evaluate its effectiveness. The proposed model exhibited impressive performance on all the utilized datasets, achieving an average accuracy of 99.81%. Furthermore, it demonstrated excellent scores for other evaluation metrics, including 98.02% precision, 98.18% recall, and 98.08% F1-score, which outperformed other ML and DL models.

COREM2 project: a beginning to end approach for cyber intrusion detection

Article 14 March 2022

Intrusion Detection Using Attention-Based CNN-LSTM Model

A Hybrid Network Intrusion Detection Model Based on CNN-LSTM and Attention Mechanism

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The Internet of Things (IoT) envisions a connected network of various intelligent objects in our surroundings, capable of gathering, processing, and transmitting information [1]. In recent years, the IoT had a significant impact on various industries, including agriculture, medicine, transportation, automobiles, and water monitoring [2,3,4,5]. This is the era where all businesses rely on technology, everything is going digital, and as we see, the demand for IoT devices has increased significantly, escalating from 15.42 billion in 2015 to a staggering 35.8 billion in 2021[6,7,8]. IoT devices often have limited computational power and memory, making it challenging to implement robust security measures [9, 10]. As businesses deploy more IoT devices, the risk of vulnerabilities being targeted and exploited increases [11]. As shown in Fig. 1, By the year 2025, it is projected that the IoT will reach a staggering number of 75.44 billion devices, resulting in an enormous data output of 79 zettabytes [12]. The IoT has been recognized as a crucial factor in digitization for societal transformation [13, 14].

Many IoT devices gather, save, and handle sensitive data, while their diverse configuration and openness make them an attractive target for attackers [15,16,17]. Ensuring confidentiality is crucial for the successful implementation of IoT networks. To identify malicious activity, an intrusion detection system (IDS) is necessary to monitor IoT network operations [18,19,20,21]. IoT networks often involve a large number of heterogeneous devices, each with its own communication protocols and data formats [22, 23]. Traditional IDS solutions may struggle to handle the diversity and complexity of IoT network traffic, making it difficult to identify abnormal behavior specific to IoT devices [24, 25]. Numerous researchers have collaborated on IDS development, leveraging the power of ML and DL algorithms [26,27,28]. ML and DL methods find extensive applications in diverse domains including agriculture, medicine, and transportation [29,30,31,32]. DL, a subset of ML, is particularly useful for addressing problems involving high-dimensional and intricate data. Moreover, DL enables systematic training of nonlinear models on big datasets.

An imbalanced and inadequate dataset may result in low performance on the current IDS. For instance, consider a security dataset that exhibits imbalanced data, where the disparity between the high and low instances of classes is substantial. The intrusion detection model can be affected by this data imbalance, as it tends to primarily focus on the high-instance classes while disregarding or gradually learning from the low-class instances. As a result, the IoT network utilizing this model may fail to detect attacks that were underrepresented in the training data. Furthermore, a significant challenge in IDS design is feature engineering to extract the most salient attributes. To enhance the effectiveness of existing systems, it is essential to extract the most significant features. To address these issues, this paper proposes an attention-based convolutional neural network (ABCNN) for intrusion detection in IoT networks. The proposed ABCNN employs an attention mechanism that computes attention values for each input attribute. This mechanism aids in the learning process for low-instance classes. On the other hand, the Convolutional Neural Network (CNN) employed in the ABCNN framework converges toward the most important parameters and effectively detects malicious activities [33]. Furthermore, this study utilizes pre-processing techniques such as feature filtering, normalization, and stratified splitting. The mutual information technique is applied during pre-processing to filter out the most significant features from the dataset. The proposed architecture was evaluated using the Edge-IIoTset, IoTID20, ToN_IoT, and CIC-IDS2017 datasets. The performance of the proposed methods was measured using several metrics, including precision, recall, F1-score, and accuracy. The main contributions of this article are:

In this study, A novel deep learning technique attention-based convolutional neural network (ABCNN) is proposed for intrusion detection in IoT networks. The attention layer computes the attention value for each input, and the CNN is utilized to predict the network’s behavior on high-attention features.
In this study, we employed the mutual information method to select the most significant features. This method calculates the mutual information between each attribute and the target variable based on entropy.
To demonstrate the effectiveness of the proposed approach in comparison to other several ML and DL methods, a series of experiments were conducted. It is worth noting that all preprocessing steps used in the comparison of the proposed and other models were identical.

The rest of this article is organized as follows: Section 2 presents recent research on intrusion detection in IoT. Section 3 covers the mathematical modeling, overall architecture flow, and experimental methodology. In Section 4, a concise discussion of the experimental results obtained from the proposed model is provided. Finally, Section 5 presents a brief conclusion.

2 Related Work

The proliferation of IoT technology has led to a significant increase in the connectivity of smart devices to the internet. However, this interconnectedness also opens up opportunities for attackers to exploit IoT networks and carry out malicious activities. In response to this pressing issue, numerous researchers have put forth various models aimed at identifying and mitigating such malicious activities in IoT networks.

Altunay et al. [34] proposed a hybrid DL model that incorporates both CNN and long short-term memory (LSTM) for the detection of intrusions in IoT networks. They evaluated the model using the UNSW-NB15 and X-IIoTID datasets for both binary and multi-class classifications. The results section compares the effectiveness of the model with that of CNN and LSTM models and shows that the hybrid CNN and LSTM model outperforms the other models. Wu et al. [35] adopted the Geometric Graph Alignment (GGA) approach to effectively handle the variations in geometry between different domains, thus improving the transfer of intrusion knowledge. In this method, each intrusion domain was represented as a graph, with the vertices and edges corresponding to intrusion categories and their interrelationships. To assess the performance of the GGA approach, the authors employed five publicly available datasets, including NSL-KDD, UNSW-NB15, CIC-IDS2017, UNSW-BoT-IoT, and UNSW-TONIoT. Their proposed model achieved an accuracy of 71.72%, which outperformed other approaches in the comparative analysis of results.

Javadpour et al. [36] introduced a multi-agent-based model designed for the detection and prevention of cyberattacks in the Cloud Internet of Things (CIoT) environment. These agents utilize association rules to effectively identify intrusions. The performance of the multi-agent-based model was assessed using the KDD Cup 99 and NSL-KDD datasets, achieving an accuracy of 71.12%. Thakkar et al. [37] presented a bagging method based on deep neural networks (DNN) to detect intrusions in IoT networks. Their primary emphasis was on addressing the challenge of unbalanced datasets. The evaluation of their proposed bagging model involved the use of NSL-KDD, UNSW-NB15, CIC-IDS2017, and BoT-IoT datasets. Their presented approach yielded an average accuracy of 98.22% across all the datasets. Alghanam et al. [38] introduced the pigeon-inspired optimization local search (LS-PIO) method for the purpose of detecting intrusions in IoT networks. Their proposed method was evaluated using four public datasets, namely BoT-IoT, UNSW-NB15, NLS-KDD, and KDDCUP-99. The LS-PIO method achieved an average accuracy of 96.58% across all the datasets used in the evaluation.

Saba et al. [39] implemented a CNN-based approach for anomaly-based intrusion detection in IoT networks. Their proposed method was trained and evaluated using two distinct datasets: the network intrusion detection (NID) dataset and the Botnet (BoT-IoT) dataset. The CNN model achieved an average accuracy of 96.18% on both datasets. Eme et al. [40] proposed a hybrid model called BGH that utilizes a combination of bi-LSTM and gated recurrent units (GRU) to effectively detect eight known IoT network attacks. The model was trained and evaluated using two widely recognized IoT network traffic datasets: CIC-IDS-2018 and BoT-IoT. Remarkably, the BGH technique achieved an impressive average accuracy of 99.38% on both datasets.

Sharma et al. [41] adopted a deep neural network (DNN) approach for detecting anomalies in IoT networks. They employed a feature filtering technique to extract the most important features from the dataset. To evaluate the performance of their model, they utilized the UNSW-NB15 dataset, achieving 84% accuracy for imbalanced data. However, by utilizing generative adversarial networks (GANs) to balance the data, the accuracy improved significantly to 99%. El-Ghamry et al. [42] proposed a CNN-based intrusion detection system specifically designed for agriculture IoT networks. They preprocessed the data, selected relevant features, and transformed it into colored images. The authors employed CNN to analyze the images and identify malicious activities within the networks. To evaluate the effectiveness of their system, they used the NSL-KDD dataset, achieving 99% accuracy in their model’s outcomes.

A short overview of the literature is presented in Table 1. After reviewing the relevant studies, it becomes clear that numerous studies have focused mainly on a select few classes because of the highly imbalanced datasets. As a result, when dealing with a greater number of attack classes, these systems often encounter difficulties in achieving precise detection. In contrast, this paper presents a novel method known as ABCNN, which improves the effectiveness of current models for both smaller and larger sets of attack classes.

Table 1 Literature overview

ABCNN-IDS: Attention-Based Convolutional Neural Network for Intrusion Detection in IoT Networks

Abstract

Similar content being viewed by others

COREM2 project: a beginning to end approach for cyber intrusion detection

Intrusion Detection Using Attention-Based CNN-LSTM Model

A Hybrid Network Intrusion Detection Model Based on CNN-LSTM and Attention Mechanism

1 Introduction

2 Related Work

3 The Proposed Attention-Based CNN

4 Performance Evaluation

4.1 Datasets

4.2 Data Cleaning

4.3 Features Filtering

4.4 Normalization

4.5 Experimental Setup

4.6 Evaluation Measure

4.7 Proposed ABCNN Layers Comparison

4.8 Proposed ABCNN Results on Different Optimization Functions

4.9 Proposed ABCNN Results on Different Batch Sizes

4.10 Performance Comparison with Other ML and DL Methods

5 Conclusion

Data Availibility

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical Approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation