Abstract
Wireless sensor network (WSN) is used for data collection and transmission in IoT environment. Since it consists of a large number of sensor nodes, a significant amount of redundant data and outliers are generated which substantially deteriorate the network performance. Data aggregation is needed to reduce energy consumption and prolong the lifetime of WSN. In this paper a novel data aggregation scheme is proposed which is based on modified radial basis function neural network to classify the collected data at cluster head and eliminate the redundant data and outliers. Additionally, cosine similarity is used to cluster the nodes having the most similar data. The radial basis function (RBF) is adapted by Mahalanobis distance to support the outlier’s detection and analysis in the multivariate data. The data collected from the sensor node at the cluster head are processed by mahalanbis distance-based radial basis function neural network (MDRBF-NN) before transferred to the based station. Extensive computer simulation with real datasets shows that the proposed scheme consistently outperforms the existing representative data aggregation schemes in terms of data classification, outlier detection, and energy efficiency.
Similar content being viewed by others
References
Abukhalaf H, Wang J, Zhang S (2015) Outlier detection techniques for localization in wireless sensor networks: a survey. Int J Future Gener Commun Netw 8(6):99–114
Adachi K (2016) Matrix-based introduction to multivariate data analysis. Springer, Singapore
Aggarwal CC (2015) Outlier analysis. In: Data mining. Springer, Cham. https://doi.org/10.1007/978-3-319-14142-8_8
Bharti S, Pattanaik K, Pandey A (2019) Contextual outlier detection for wireless sensor networks. J Ambient Intell Human Comput 11(4):1511–1530
Biswas S, Das R, Chatterjee P (2018) Energy-efficient connected target coverage in multi-hop wireless sensor networks. In: Bhattacharyya S, Sen S, Dutta M, Biswas P, Chattopadhyay H (eds) Industry interactive innovations in science, engineering and technology, vol 11. Springer, Singapore, pp 411–421
Bo W, Han-ying H, Wen F (2007) A pseudo LEACH algorithm for wireless sensor networks. In: IMECS, pp 1366–1370
Cui X, Ma Z (2019) Dynamic heartbeat detection algorithm based on RBFNN. J Eng 2019(20):6426–6428
da Silva IN, Hernane Spatti D, Andrade Flauzino R, Liboni LHB, dos Reis Alves SF (2017) Radial basis function networks. In: Artificial neural networks. Springer, Cham, pp 117–138
De Paola A, Gaglio S, Re GL, Milazzo F, Ortolani M (2015) Adaptive distributed outlier detection for WSNs. IEEE Trans Cybern 45:902–913
De Paola A, La Cascia M, Re GL, Morana M, Ortolani M (2012) User detection through multi-sensor fusion in an ami scenario. In 2012 15th international conference on information fusion, pp 2502–2509
Du K-L, Swamy MN (2013) Neural networks and statistical learning. Springer Science & Business Media, Berlin
Fang W, Wen X, Xu J, Zhu J (2019) CSDA: a novel cluster-based secure data aggregation scheme for WSNs. Cluster Comput 22(3):5233–5244
Garrido-Varo A, Garcia-Olmo J, Fearn T (2019) A note on Mahalanobis and related distance measures in WinISI and The Unscrambler. J Near Infrared Spectrosc 27(4):253–258
Gedik B, Liu L, Philip SY (2007) ASAP: an adaptive sampling approach to data collection in sensor networks. IEEE Trans Parallel Distrib Syst 18(12):1766–1783
Haykin SS (2009) Neural networks and learning machines. Prentice-Hall, Pearson Upper Saddle River
Jadhav NH, Kashid DN, Kulkarni SR (2014) Subset selection in multiple linear regression in the presence of outlier and multicollinearity. Stat Methodol 19:44–59
Khedo K, Doomun R, Aucharuz S (2010) Reada: redundancy elimination for accurate data aggregation in wireless sensor networks. Wirel Sens Netw 2(04):300
Kokoska S, Zwillinger D (2000) CRC standard probability and statistics tables and formulae. Chapman & Hall/CRC, USA. ISBN 0-8493-0026-6
Kuna HD, García-Martinez R, Villatoro FR (2014) Outlier detection in audit logs for application systems. Inf Syst 44:22–33
Li G, Wang Y (2013) Automatic ARIMA modeling-based data aggregation scheme in wireless sensor networks. EURASIP J Wirel Commun Netw 2013:85
Lin H, Bai D, Gao D, Liu Y (2016) Maximum data collection rate routing protocol based on topology control for rechargeable wireless sensor networks. Sensors 16:1201
Liu C, Wu K, Pei J (2007) An energy-efficient data collection framework for wireless sensor networks by exploiting spatiotemporal correlation. IEEE Trans Parallel Distrib Syst 18(7):1010–1023
Murphy KP, Russell S (2002) Dynamic Bayesian networks: representation, inference and learning. PhD thesis, University of California, Berkeley
Nawaz F, Jeoti V (2020) Efficient data delivery in dense reader environment of passive sensor network. J Ambient Intell Human Comput 11:3707–3715
Ozdemir S, Xiao Y (2011) Polynomial regression based secure data aggregation for wireless sensor networks. In 2011 IEEE global telecommunications conference-GLOBECOM 2011, pp 1–5
Prabaharan G, Jayashri S (2020) An optimal mobile data gathering in small scale WSN by power saving adaptive clustering techniques. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-020-01757-x
Rafiq M, Ashraf S, Abdullah S, Mahmood T, Muhammad S (2019) The cosine similarity measures of spherical fuzzy sets and their applications in decision making. J Intell Fuzzy Syst 36(6):6059–6073
Roy N, Pallapa G, Das SK (2007) A middleware framework for ambiguous context mediation in smart healthcare application. In: Third IEEE international conference on wireless and mobile computing, networking and communications (WiMob 2007), p 72
Rutkowski L, Jaworski M, Duda P (2020) Basic concepts of data stream mining. In: Stream data mining: algorithms and their probabilistic properties, vol 56. Springer, Cham, pp 13–33
Santini S, Romer K (2006) An adaptive strategy for quality-based data reduction in wireless sensor networks. In Proceedings of the 3rd international conference on networked sensing systems (INSS 2006), pp 29–36
Singh M, Soni SK (2019) Fuzzy based novel clustering technique by exploiting spatial correlation in wireless sensor network. J Ambient Intell Human Comput 10(4):1361–1378
Srilakshmi N, Sangaiah AK (2019) Selection of machine learning techniques for network lifetime parameters and synchronization issues in wireless networks. J Inf Process Syst 15(4):833–852
Sun L-Y, Cai W, Huang X-X (2010) Data aggregation scheme using neural networks in wireless sensor networks. In: 2010 2nd international conference on future computer and communication, vol 1, p 725
Sung W-T (2009) Employed BPN to multi-sensors data fusion for environment monitoring services. In: International conference on autonomic and trusted computing, pp 149–163
Ullah I, Youn HY (2018) Statistical multipath queue-wise preemption routing for ZigBee-based WSN. Wirel Pers Commun 100:1537–1551
Ullah I, Youn HY (2019) A novel data aggregation scheme based on self-organized map for WSN. J Supercomput 75(7):3975–3996
Ullah I, Youn HY (2020) Efficient data aggregation with node clustering and extreme learning machine for WSN. J Supercomput 76:10009–10035
van Kasteren T, Krose B (2007) Bayesian activity recognition in residence for elders. In: 2007 3rd IET international conference on intelligent environments, pp 209–212
Villas LA, Boukerche A, Guidoni DL et al (2013) An energy-aware spatio-temporal correlation mechanism to perform efficient data collection in wireless sensor networks. Comput Commun 36(9):1054–1066
Wan X, Wang W, Liu J, Tong T (2014) Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range. BMC Med Res Methodol 14(1):135. https://doi.org/10.1186/1471-2288-14-135
Yadav S, Yadav RS (2019) Redundancy elimination during data aggregation in wireless sensor networks for IoT systems. In: Khare A, Tiwary U, Sethi I, Singh N (eds) Recent trends in communication, computing, and electronics. Springer, Singapore, pp 195–205
Yin Y, Liu F, Zhou X, Li Q (2015) An efficient data compression model based on spatial clustering and principal component analysis in wireless sensor networks. Sensors 15(8):19443–19465
Yu T, Akhtar AM, Wang X, Shami A (2015) Temporal and spatial correlation based distributed fault detection in wireless sensor networks. In: 2015 IEEE 28th Canadian conference on electrical and computer engineering (CCECE), pp 1351–1355
Yuan J, Chen H (2009) The optimized clustering technique based on spatial-correlation in wireless sensor networks. In: 2009 IEEE youth conference on information, computing and telecommunication. IEEE, pp 411–414
Yuan F, Zhan Y, Wang Y (2014) Data density correlation degree clustering method for data aggregation in WSN. IEEE Sens J 14(4):1089–1098
Zhang Y, Ji Q (2006) Active and dynamic information fusion for multisensor systems with dynamic Bayesian networks. IEEE Trans Syst Man Cybern Part B (Cybern) 36(2):467–472
Zheng J, Wang P, Li C (2010) Distributed data aggregation using Slepian–Wolf coding in cluster-based wireless sensor networks. IEEE Trans Veh Technol 59(5):2564–2574
Acknowledgements
This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (No. NRF-2020R1I1A3065610 and No. NRF-2018R1A6A1A03025526).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Ullah, I., Youn, H.Y. & Han, YH. An efficient data aggregation and outlier detection scheme based on radial basis function neural network for WSN. J Ambient Intell Human Comput (2021). https://doi.org/10.1007/s12652-020-02703-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12652-020-02703-7