A Novel Reinforcement Learning Method for Improving Occupant Comfort via Window Opening and Closing

May, Ross; Han, Mengjie; Zhang, Xingxing

doi:10.1007/978-981-16-2778-1_10

Ross May²,
Mengjie Han² &
Xingxing Zhang³

Part of the book series: Sustainable Development Goals Series ((SDGS))

631 Accesses

Abstract

An occupant’s window opening and closing behaviour can significantly influence the level of comfort in the indoor environment. Such behaviour is, however, complex to predict and control conventionally. This chapter, therefore, proposes a novel reinforcement learning (RL) method for the advanced control of window opening and closing. The RL control aims at optimising the time point for window opening/closing through observing and learning from the environment. The theory of model-free RL control is developed with the objective of improving occupant comfort, which is applied to historical field measurement data taken from an office building in Beijing. Preliminary testing of RL control is conducted by evaluating the control method’s actions. The results show that the RL control strategy improves thermal and indoor air quality by more than 90% when compared with the actual historically observed occupant data. This methodology establishes a prototype for optimally controlling window opening and closing behaviour. It can be further extended by including more environmental parameters and more objectives such as energy consumption. The model-free characteristic of RL avoids the disadvantage of implementing inaccurate or complex models for the environment, thereby enabling a great potential in the application of intelligent control for buildings.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A reinforcement learning approach for thermostat setpoint preference learning

Article 01 August 2023

Exploring the Potential of Adaptive Behavior as a Tool Intended for Comfort and Saving Energy

Towards Plug&Play Smart Thermostats for Building’s Heating/Cooling Control

Notes

1.
We use Q(S, A) to represent an approximate value function from the data and q(S,A) to represent the target of the approximation.

References

Andersen R, Fabi V, Toftum J, Corgnati SP, Olesen BW (2013) Window opening behaviour modelled from measurements in Danish dwellings. Build Environ 69:101–113. https://doi.org/10.1016/j.buildenv.2013.07.005
Article Google Scholar
ASHRAE standard 55—thermal environmental conditions for human occupancy (2017). ASHRAE Inc
Google Scholar
Bellman R (1957a) A Markovian Decision Process. Indiana Univ Mathe J 6(4):679–684. https://doi.org/10.1512/iumj.1957.6.56038
Bellman R (1957b) Dynamic programming. Princeton Univercity Press, Princeton, NJ
Google Scholar
Botvinick M, Ritter S, Wang JX, Kurth-Nelson Z, Blundell C, Hassabis D (2019) Reinforcement learning, fast and slow. Trends Cogn Sci 23(5):408–422. https://doi.org/10.1016/j.tics.2019.02.006
Article Google Scholar
Chen Y, Norford LK, Samuelson HW, Malkawi A (2018) Optimal control of HVAC and window systems for natural ventilation through reinforcement learning. Energy Build 169:195–205. https://doi.org/10.1016/j.enbuild.2018.03.051
Article Google Scholar
Chen B, Cai Z, Berges M (2019) Gnu-RL: a precocial reinforcement learning solution for building hvac control using a differentiable MPC policy. New York, NY, USA, pp 316–325.https://doi.org/10.1145/3360322.3360849
Cheng W-L, Chen Y-S, Zhang J, Lyons TJ, Pai J-L, Chang S-H (2007) Comparison of the revised air quality index with the PSI and AQI indices. Sci Total Environ 382(2–3):191–198. https://doi.org/10.1016/j.scitotenv.2007.04.036
Article Google Scholar
D’Oca S, Hong T (2014) A data-mining approach to discover patterns of window opening and closing behaviour in offices. Build Environ 82:726–739. https://doi.org/10.1016/j.buildenv.2014.10.021
Article Google Scholar
Dalamagkidis K, Kolokotsa D, Kalaitzakis K, Stavrakakis GS (2007) Reinforcement learning for energy conservation and comfort in buildings. Build Environ 42(7):2686–2698. https://doi.org/10.1016/j.buildenv.2006.07.010
Article Google Scholar
Ding X, Du W, Cerpa A (2019) OCTOPUS: deep reinforcement learning for holistic smart building control. New York, NY, USA, pp 326–335. https://doi.org/10.1145/3360322.3360857
Dussault J-M, Sourbron M, Gosselin L (2016) Reduced energy consumption and enhanced comfort with smart windows: comparison between quasi-optimal, predictive and rule-based control strategies. Energy Build 127:680–691. https://doi.org/10.1016/j.enbuild.2016.06.024
Article Google Scholar
Enescu D (2017) A review of thermal comfort models and indicators for indoor environments. Renew Sustain Energy Rev 79:1353–1379. https://doi.org/10.1016/j.rser.2017.05.175
Article Google Scholar
Fabi V, Andersen RV, Corgnati S, Olesen BW (2012) Occupants’ window opening behaviour: a literature review of factors influencing occupant behaviour and models. Build Environ 58:188–198. https://doi.org/10.1016/j.buildenv.2012.07.009
Article Google Scholar
Fabi V, Andersen RV, Corgnati SP, Olesen BW (2013) A methodology for modelling energy-related human behaviour: Application to window opening behaviour in residential buildings. Build Simul 6(4):415–427. https://doi.org/10.1007/s12273-013-0119-6
Article Google Scholar
Fazenda P, Veeramachaneni K, Lima P, O’Reilly U-M (2014) Using reinforcement learning to optimize occupant comfort and energy usage in HVAC systems. J Ambient Intell Smart Environ 6(6):675–690. https://doi.org/10.3233/AIS-140288
Article Google Scholar
Fritsch R, Kohler A, Nygård-Ferguson M, Scartezzini J-L (1990) A stochastic model of user behaviour regarding ventilation. Build Environ 25(2):173–181. https://doi.org/10.1016/0360-1323(90)90030-U
Article Google Scholar
Frontczak M, Andersen RV, Wargocki P (2012) Questionnaire survey on factors influencing comfort with indoor environmental quality in Danish housing. Build Environ 50:56–64. https://doi.org/10.1016/j.buildenv.2011.10.012
Article Google Scholar
Haldi F, Robinson D (2009) Interactions with window openings by office occupants. Build Environ 44(12):2378–2395. https://doi.org/10.1016/j.buildenv.2009.03.025
Article Google Scholar
Han M et al (2019) A review of reinforcement learning methodologies for controlling occupant comfort in buildings. Sustain Cities Soc 51:101748. https://doi.org/10.1016/j.scs.2019.101748
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Article Google Scholar
Hong T, Wang Z, Luo X, Zhang W (2020) State-of-the-art on research and applications of machine learning in the building life cycle. Energy Build 212(109831):1–15
Google Scholar
Huizenga C, Abbaszadeh S, Zagreus L, Arens EA (2006) Air quality and thermal comfort in office buildings: results of a large indoor environmental quality survey. Healthy Build Lisbon 3:393–397
Google Scholar
Jassim MS, Coskuner G (2017) Assessment of spatial variations of particulate matter (PM10 and PM2.5) in Bahrain identified by air quality index (AQI). Arab J Geosci 10(19). https://doi.org/10.1007/s12517-016-2808-9
Jeong B, Jeong J-W, Park JS (2016) Occupant behaviour regarding the manual control of windows in residential buildings. Energy Build 127:206–216. https://doi.org/10.1016/j.enbuild.2016.05.097
Article Google Scholar
Jin W, Zhang N, He J (2015) Experimental study on the influence of a ventilated window for indoor air quality and indoor thermal environment. Procedia Eng 121:217–224. https://doi.org/10.1016/j.proeng.2015.08.1058
Article Google Scholar
Kyrkilis G, Chaloulakou A, Kassomenos PA (2007) Development of an aggregate air quality Index for an urban Mediterranean agglomeration: Relation to potential health effects. Environ Int 33(5):670–676. https://doi.org/10.1016/j.envint.2007.01.010
Article Google Scholar
Li N, Li J, Fan R, Jia H (2015) Probability of occupant operation of windows during transition seasons in office buildings. Renew Energy 73:84–91. https://doi.org/10.1016/j.renene.2014.05.065
Article Google Scholar
Mandic DP, Chambers JA (2001) Recurrent neural networks for prediction: learning algorithms, architectures, and stability. John Wiley, Chichester; New York
Book Google Scholar
Mnih V et al (2013) Playing Atari with Deep Reinforcement learning. arXiv:1312.5602 [cs], Accessed: 26 Jan 2019. [Online]. Available: http://arxiv.org/abs/1312.5602
Mnih V et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533. https://doi.org/10.1038/nature14236
Article Google Scholar
Mozer MC (1998) The neural network house: An environment that adapts to its inhabitants. AAAI Spring Symp Intell Environ 58:110–114
Google Scholar
Nagy A, Kazmi H, Cheaib F, Driesen J (2018) Deep reinforcement learning for optimal control of space heating. arXiv:1805.03777
Nunes de Freitas P, Guedes MC (2015) The use of windows as environmental control in ‘Baixa Pombalina’s’ heritage buildings. Renew Energy 73:92–98. https://doi.org/10.1016/j.renene.2014.08.029
Article Google Scholar
Pan S et al (2018) A study on influential factors of occupant window-opening behaviour in an office building in China. Build Environ 133:41–50. https://doi.org/10.1016/j.buildenv.2018.02.008
Article Google Scholar
Pan S et al (2019) A model based on Gauss Distribution for predicting window behaviour in building. Build Environ 149:210–219. https://doi.org/10.1016/j.buildenv.2018.12.008
Article Google Scholar
Park JY, Dougherty T, Fritz H, Nagy Z (2019) LightLearn: an adaptive and occupant centered controller for lighting based on reinforcement learning. Build Environ 147:397–414. https://doi.org/10.1016/j.buildenv.2018.10.028
Article Google Scholar
Pascanu R, Mikolov T, Bengio Y (2013) On the difficulty of training recurrent neural networks. In: International conference on machine learning, pp 1310–1318
Google Scholar
Pu H, Luo K, Wang P, Wang S, Kang S (2017) Spatial variation of air quality index and urban driving factors linkages: evidence from Chinese cities. Environ Sci Pollut Res 24(5):4457–4468. https://doi.org/10.1007/s11356-016-8181-0
Article Google Scholar
Rijal HB, Tuohy P, Nicol F, Humphreys MA, Samuel A, Clarke J (2008) Development of an adaptive window-opening algorithm to predict the thermal comfort, energy use and overheating in buildings. J Build Perform Simul 1(1):17–30. https://doi.org/10.1080/19401490701868448
Article Google Scholar
Rijal HB, Humphreys MA, Nicol JF (2018) Development of a window opening algorithm based on adaptive thermal comfort to predict occupant behaviour in Japanese dwellings. Jpn Architectural Rev 1(3):310–321. https://doi.org/10.1002/2475-8876.12043
Article Google Scholar
Roulet C-A et al (2006) Perceived health and comfort in relation to energy use and building characteristics. Build Res Inf 34(5):467–474. https://doi.org/10.1080/09613210600822279
Article Google Scholar
Ruelens F, Claessens BJ, Vandael S, Iacovella S, Vingerhoets P, Belmans R (2014) Demand response of a heterogeneous cluster of electric water heaters using batch reinforcement learning. Wroclaw, Poland, pp 1–7
Google Scholar
Ruelens F, Iacovella S, Claessens BJ, Belmans R (2015) Learning agent for a heat-pump thermostat with a set-back strategy using model-free reinforcement learning. Energies 8:8300–8318. https://doi.org/10.3390/en8088300
Shaikh PH, Nor NBM, Nallagownden P, Elamvazuthi I, Ibrahim T (2013) Robust stochastic control model for energy and comfort management of buildings. Aust J Basic Appl Sci 7(10):137–144
Google Scholar
Shi G, Liu D, Wei Q (2017) Echo state network-based Q-learning method for optimal battery control of offices combined with renewable energy. IET Control Theory Appl 11(7):915–922
Article Google Scholar
Shi Z et al (2018) Seasonal variation of window opening behaviours in two naturally ventilated hospital wards. Build Environ 130:85–93. https://doi.org/10.1016/j.buildenv.2017.12.019
Article Google Scholar
Silver D et al (2016) Mastering the game of Go with deep neural networks and tree search. Nature 529(7587):484–489. https://doi.org/10.1038/nature16961
Article Google Scholar
Silver D et al (2017) Mastering the game of go without human knowledge. Nature 550(7676):354–359. https://doi.org/10.1038/nature24270
Article Google Scholar
Singh J (1996) Review: health, comfort and productivity in the indoor environment. Indoor and Built Environ 5(1):22–33. https://doi.org/10.1177/1420326X9600500105
Article Google Scholar
Stazi F, Naspi F, Ulpiani G, Di Perna C (2017) Indoor air quality and thermal comfort optimization in classrooms developing an automatic system for windows opening and closing. Energy Build 139:732–746. https://doi.org/10.1016/j.enbuild.2017.01.017
Article Google Scholar
Sutton RS, Barto AG (2018) Reinforcement learning: an introduction, 2nd edn. The MIT Press, Cambridge, MA
Google Scholar
Tanner RA, Henze GP (2014) Stochastic control optimization for a mixed mode building considering occupant window opening behaviour. J Build Perform Simul 7(6):427–444. https://doi.org/10.1080/19401493.2013.863384
Article Google Scholar
Wang L, Greenberg S (2015) Window operation and impacts on building energy consumption. Energy Build 92:313–321. https://doi.org/10.1016/j.enbuild.2015.01.060
Article Google Scholar
Watkins CJCH (1989) Learning from delayed rewards. Ph.D. thesis, University of Cambridge
Google Scholar
Werbos PJ (1990) Backpropagation through time: what it does and how to do it. Proc IEEE 78(10):1550–1560. https://doi.org/10.1109/5.58337
Article Google Scholar
Yun GY, Steemers K (2008) Time-dependent occupant behaviour models of window control in summer. Build Environ 43(9):1471–1482. https://doi.org/10.1016/j.buildenv.2007.08.001
Article Google Scholar
Zhang H, Arens E, Pasut W (2011) Air temperature thresholds for indoor comfort and perceived air quality. Build Res Inf 39(2):134–144. https://doi.org/10.1080/09613218.2011.552703
Article Google Scholar

Download references

Author information

Authors and Affiliations

Micro Data Analysis , Dalarna University, SE-79188, Falun, Sweden
Ross May & Mengjie Han
Department of Energy and Community Buildings, Dalarna University, SE-79188, Falun, Sweden
Xingxing Zhang

Authors

Ross May
View author publications
You can also search for this author in PubMed Google Scholar
Mengjie Han
View author publications
You can also search for this author in PubMed Google Scholar
Xingxing Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ross May .

Editor information

Editors and Affiliations

Dalarna University, Falun, Sweden
Xingxing Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

May, R., Han, M., Zhang, X. (2021). A Novel Reinforcement Learning Method for Improving Occupant Comfort via Window Opening and Closing. In: Zhang, X. (eds) Data-driven Analytics for Sustainable Buildings and Cities. Sustainable Development Goals Series. Springer, Singapore. https://doi.org/10.1007/978-981-16-2778-1_10

Download citation

DOI: https://doi.org/10.1007/978-981-16-2778-1_10
Published: 12 September 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-2777-4
Online ISBN: 978-981-16-2778-1
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)

Publish with us

Policies and ethics

A Novel Reinforcement Learning Method for Improving Occupant Comfort via Window Opening and Closing

Abstract

Access this chapter

Similar content being viewed by others

A reinforcement learning approach for thermostat setpoint preference learning

Exploring the Potential of Adaptive Behavior as a Tool Intended for Comfort and Saving Energy

Towards Plug&Play Smart Thermostats for Building’s Heating/Cooling Control

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

A Novel Reinforcement Learning Method for Improving Occupant Comfort via Window Opening and Closing

Abstract

Access this chapter

Similar content being viewed by others

A reinforcement learning approach for thermostat setpoint preference learning

Exploring the Potential of Adaptive Behavior as a Tool Intended for Comfort and Saving Energy

Towards Plug&Play Smart Thermostats for Building’s Heating/Cooling Control

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation