Skip to main content

Advertisement

Log in

Feature Selection and Evolutionary Rule Learning for Big Data in Smart Building Energy Management

  • Published:
Cognitive Computation Aims and scope Submit manuscript

Abstract

Since buildings are one of the largest sources of energy consumption in most cities of the world, energy management is one of the major concerns in their design. To ameliorate this problem, buildings are becoming smarter by the incorporation of intelligent supervision and control systems. Data captured by the sensors can be interpreted and processed by rule-based computation methods of biological inspiration (such as genetic fuzzy systems, GFS) for predicting the future behavior of the building in a knowledge-based interpretable human-like manner. GFS are computational models inspired in human cognition which use evolutionary computation (inspired in the natural evolution) to automatically learn fuzzy rules which contain explicit imprecise knowledge about a system or process. This knowledge, represented using fuzzy rules that involve fuzzy linguistic variables and values, is used to perform approximate reasoning on the input values for obtaining inferred values for the output variables. In energy management of buildings, these rules allow a smart control of the system actuators to reduce the building average energy consumption. However, the large amount of data produced on a per second basis complicates the generation of accurate and interpretable models by means of traditional methods. In this paper, we present an evolutionary computation-based approach, namely a genetic fuzzy system, to build scalable and interpretable knowledge bases for predicting energy consumption in smart buildings. For accomplishing this task, we propose a cognitive computation system for multi-step prediction based on S-FRULER, a state-of-the-art scalable distributed GFS, coupled with a feature subset selection method to automatically select the most relevant features for different time steps. S-FRULER is able to learn a fuzzy rule-based system made up of Takagi-Sugeno-Kang (TSK) rules that are able to predict the output values using both linguistic imprecise knowledge (represented by fuzzy sets) and fuzzy inference. Experiments with real data on two different problems related with the energy management revealed an average improvement of 6% on accuracy with respect to S-FRULER without feature selection, and with knowledge bases with a lower number of variables.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

References

  1. Alcalá R, Alcalá-Fdez J, Herrera F. A proposal for the genetic lateral tuning of linguistic fuzzy systems and its interaction with rule selection. IEEE Trans Fuzzy Syst 2007;15(4):616–35.

    Article  Google Scholar 

  2. Alcalá R, Alcalá-Fdez J, Herrera F, Otero J. Genetic learning of accurate and compact fuzzy rule based systems based on the 2-tuples linguistic representation. Int J Approx Reason 2007;44(1):45–64.

    Article  Google Scholar 

  3. Alcalá R, Gacto MJ, Herrera F. A fast and scalable multiobjective genetic fuzzy system for linguistic fuzzy modeling in high-dimensional regression problems. IEEE Trans Fuzzy Syst 2011;19(4):666–81.

    Article  Google Scholar 

  4. Aljarah I, Ala’M AZ, Faris H, Hassonah MA, Mirjalili S, Saadeh H. Simultaneous feature selection and support vector machine optimization using the grasshopper optimization algorithm. Cogn Comput 2018; 10(3):478–495.

    Article  Google Scholar 

  5. Atli BG, Miche Y, Kalliola A, Oliver I, Holtmanns S, Lendasse A. Anomaly-based intrusion detection using extreme learning machine and aggregation of network traffic statistics in probability space. Cogn Comput 2018;10(5):848–863.

    Article  Google Scholar 

  6. Balac N, Sipes T, Wolter N, Nunes K, Sinkovits B, Karimabadi H. Large scale predictive analytics for real-time energy management. IEEE international conference on Big Data, 2013; 2013. p. 657–64.

  7. Bontempi G, Taieb S, Borgne YL. Machine learning strategies for time series forecasting. Business Intelligence. 2013;62–77.

  8. Chen T, Guestrin C. Xgboost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, CA, USA, August 13–17, 2016; 2016. p. 785–794.

  9. Cordón O, Herrera F, Hoffmann F, Magdalena L, Cordon O, Herrera F, Hoffmann F. Genetic fuzzy systems. Singapore: World Scientific Publishing Company; 2001.

    Book  Google Scholar 

  10. Ding S, Xi X, Liu Z, Qiao H, Zhang B. A novel manifold regularized online semi-supervised learning model. Cogn Comput 2018;10(1):49–61.

    Article  Google Scholar 

  11. Fan C, Xiao F, Wang S. Development of prediction models for next-day building energy consumption and peak power demand using data mining techniques. Appl Energy 2014;127:1–10.

    Article  Google Scholar 

  12. Fernández A, López V, del Jesus MJ, Herrera F. 2015. Revisiting evolutionary fuzzy systems: taxonomy, applications, new trends and challenges. Knowl-Based Syst. 80:109–121.

  13. Fernández A, del Río S, López V, Bawakid A, del Jesus MJ, Benítez JM, Herrera F. Big data with cloud computing: an insight on the computing environment, mapreduce, and programming frameworks. Wiley Interdiscip Rev: Data Min Knowl Disc 2014;4(5):380–409.

    Google Scholar 

  14. Foucquier A, Robert S, Suard F, Stéphan L, Jay A. State of the art in building modelling and energy performances prediction: a review. Renew Sust Energ Rev 2013;23:272–88.

    Article  Google Scholar 

  15. Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001;29(5):1189–1232.

    Article  Google Scholar 

  16. Guyon I, Elisseeff A. An introduction to variable and feature selection. J Mach Learn Res 2003;3:1157–82.

    Google Scholar 

  17. Herrera F. Genetic fuzzy systems: taxonomy, current research trends and prospects. Evol Intel 2008;1(1):27–46.

    Article  Google Scholar 

  18. Ishibuchi H, Nozaki K, Tanaka H, Hosaka Y, Matsuda M. Empirical study on learning in fuzzy systems by rice taste analysis. Fuzzy Sets Syst 1994;64(2):129–44.

    Article  Google Scholar 

  19. L’Heureux A, Grolinger K, ElYamany HF, Capretz MAM. Machine learning with big data: challenges and approaches. IEEE Access 2017;5:7776–97.

    Article  Google Scholar 

  20. Li C, Deng C, Zhou S, Zhao B, Huang GB. Conditional random mapping for effective elm feature representation. Cogn Comput 2018;10(5):827–847.

    Article  Google Scholar 

  21. Life-OPERE. 2016. http://www.life-opere.org/. Online; Accessed 19 Feb 2018.

  22. Lu T, Viljanen M. Prediction of indoor temperature and relative humidity using neural network models: model comparison. Neural Comput Appl 2009;18(4):345–57.

    Article  Google Scholar 

  23. Luo X, Zhu X, Lim EG, Huang Y. A semi-blind model with parameter identification for building temperature estimation. Cogn Comput 2018;10(1):105–16.

    Article  Google Scholar 

  24. Marchiori E. Class conditional nearest neighbor for large margin instance selection. IEEE Trans Pattern Anal Mach Intell 2010;32(2):364–70.

    Article  Google Scholar 

  25. Márquez AA, Márquez FA, Roldán AM, Peregrín A. An efficient adaptive fuzzy inference system for complex and high dimensional regression problems in linguistic fuzzy modelling. Knowl-Based Syst 2013; 54:42–52.

    Article  Google Scholar 

  26. Mechaqrane A, Zouak M. A comparison of linear and neural network ARX models applied to a prediction of the indoor temperature of a building. Neural Comput Appl 2004;13(1):32–7.

    Article  Google Scholar 

  27. Meteogalicia: Galician meteorological web page. http://www.meteogalicia.gal (2016). Online; Accessed 19 Feb 2018.

  28. Nian X, Sun M, Guo H, Wang H, Dai L. Observer-based stabilization control of time-delay t-s fuzzy systems via the non-uniform delay partitioning approach. Cogn Comput 2017;9(1):225–36.

    Article  Google Scholar 

  29. Parliament E, Council E. On the energy performance of buildings. Off J Eur Union 2010;153:13–35.

    Google Scholar 

  30. Ramírez-Gallego S, Fernández A, García S, Chen M, Herrera F. Big data: tutorial and guidelines on information and process fusion for analytics algorithms with mapreduce. Inf Fusion 2018;42:51–61.

    Article  Google Scholar 

  31. Reyes-Ortiz JL, Oneto L, Anguita D. Big data analytics in the cloud: spark on Hadoop vs MPI/OpenMP on Beowulf. Procedia Comput Sci 2015;53:121–30.

    Article  Google Scholar 

  32. Riza LS, Bergmeir CN, Herrera F, Benítez Sánchez JM. 2015. FRBS: Fuzzy rule-based systems for classification and regression in R. American Statistical Association.

  33. Rodríguez FJ, García A, Pardo PJ, Chávez F, Luque-Baena RM. Study and classification of plum varieties using image analysis and deep learning techniques. Progr Artif Intell 2018;7(2):119–27.

    Article  Google Scholar 

  34. Rodríguez-Fdez I, Mucientes M, Bugarín A. An instance selection algorithm for regression and its application in variance reduction. Proceedings of the IEEE international conference on fuzzy systems (FUZZ-IEEE); 2013. p. 1–8.

  35. Rodríguez-Fdez I, Mucientes M, Bugarín A. Reducing the complexity in genetic learning of accurate regression TSK rule-based systems. Proceedings of the 2015 IEEE international conference on fuzzy systems (FUZZ-IEEE). IEEE; 2015. p. 1–8.

  36. Rodríguez-Fdez I, Mucientes M, Bugarín A. FRULER: fuzzy rule learning through evolution for regression. Inf Sci 2016;354:1–18.

    Article  Google Scholar 

  37. Rodríguez-Fdez I, Mucientes M, Bugarín A. S-FRULER: scalable fuzzy rule learning through evolution for regression. Knowl-Based Syst 2016;110:255–66.

    Article  Google Scholar 

  38. Rodriguez-Mier P, Mucientes M, Bugarín A. Scalable modeling of thermal dynamics in buildings using fuzzy rules for regression. 2017 IEEE international conference on fuzzy systems (FUZZ-IEEE); 2017. p. 1–6.

  39. Ruano A, Crispim E, Conceicao E, Lúcio MM. Prediction of building’s temperature using neural networks models. Energy Build 2006;38(6):682–94.

    Article  Google Scholar 

  40. Shaikh PH, Nor NBM, Nallagownden P, Elamvazuthi I, Ibrahim T. A review on optimized control systems for building energy and comfort management of smart sustainable buildings. Renew Sust Energ Rev 2014;34: 409–29.

    Article  Google Scholar 

  41. Teodosiu C, Hohota R, Rusaouën G, Woloszyn M. Numerical prediction of indoor air humidity and its effect on indoor environment. Build Environ 2003;38(5):655–64.

    Article  Google Scholar 

  42. Thomas B, Soleimani-Mohseni M. Artificial neural network models for indoor temperature prediction: investigations in two buildings. Neural Comput Appl 2006;16(1):81–9.

    Article  Google Scholar 

  43. White T. 2012. Hadoop: the definitive guide. O’Reilly Media, Inc.

  44. Yaqoob I, Hashem IAT, Gani A, Mokhtar S, Ahmed E, Anuar NB, Vasilakos AV. Big data: from beginning to future. Int J Inf Manag 2016;36(6):1231–47.

    Article  Google Scholar 

  45. Zaharia M, Chowdhury M, Franklin MJ, Shenker S, Stoica I. Spark: cluster computing with working sets. Proceedings of the 2nd USENIX conference on hot topics in cloud computing, vol 10, p 10; 2010.

  46. Zhao HX, Magoulès F. A review on the prediction of building energy consumption. Renew Sust Energ Rev 2012;16(6):3586–92.

    Article  Google Scholar 

  47. Zhao J, Lasternas B, Lam KP, Yun R, Loftness V. Occupant behavior and schedule modeling for building energy simulation through office appliance power consumption data mining. Energy Build 2014;82:341–55.

    Article  Google Scholar 

  48. Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc 2005;67(2):301–20.

    Article  Google Scholar 

Download references

Funding

This research was supported by the European Union LIFE programme (grant LIFE12 ENV/ES/001173). Also, the following support is acknowledged: Spanish Ministry of Economy and Competitiveness (grant TIN2017-84796-C2-1-R), Galician Ministry of Education, Culture and Universities (grants GRC2014/030 and accreditation 2016-2019, ED431G/08). These grants are co-funded by the European Regional Development Fund (ERDF/FEDER program).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pablo Rodriguez-Mier.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest.

Additional information

Informed Consent

Informed consent was not required as no human or animals were involved.

Human and Animal Rights

This article does not contain any studies with human or animal subjects performed by any of the authors.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rodriguez-Mier, P., Mucientes, M. & Bugarín, A. Feature Selection and Evolutionary Rule Learning for Big Data in Smart Building Energy Management. Cogn Comput 11, 418–433 (2019). https://doi.org/10.1007/s12559-019-09630-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12559-019-09630-6

Keywords

Navigation