Control trajectory optimisation and optimal control of an electric vehicle HVAC system for favourable efficiency and thermal comfort

In order to increase the driving range of battery electric vehicles, while maintaining a high level of thermal comfort inside the passenger cabin, it is necessary to design an energy management system which optimally synthesizes multiple control actions of heating, ventilation and air-conditioning (HVAC) system. To gain an insight into optimal control actions and set a control benchmark, the paper first proposes an algorithm of dynamic programming (DP)-based optimisation of HVAC control variables, which minimises the conflicting criteria of passenger thermal comfort and HVAC efficiency. Next, a hierarchical structure of thermal comfort control system is proposed, which consists of optimised low-level feedback controllers, optimisation-based control allocation algorithm that sets references for the low-level controllers, and a superimposed cabin temperature controller that commands the cooling capacity to the allocation algorithm. Finally, the overall control system is verified by simulation for cool-down scenario, and the simulation results are compared with the DP benchmark. The results show that the control system behaviour can approach the DP benchmark if the superimposed controller bandwidth is tuned along with the allocation cost function weighting coefficients, where a fast controller tuning relates to better thermal comfort while a slow tuning results in improved efficiency.


Introduction
In recent years, electric vehicles have been increasingly adopted by public due to their superior energy efficiency and low or absent emissions. The presence of ample electric energy storage in hybrid and battery electric vehicles facilitates implementation of electrically-powered auxiliary drives for improved efficiency, flexibility and performance. This is of particular interest in automotive heating, ventilation and air-conditioning (HVAC) systems, where the use of variablespeed drives, such as those of compressor and fans, improves the overall performance. Moreover, modern electric vehicles are usually equipped with redundant HVAC actuators and multiple energy flows (Zhang et al. 2018), particularly when considering implementation of efficient heat pump systems in addition to airconditioning (A/C) system (Drage et al. 2019). This all significantly extends the requirements on designing the HVAC energy management system.
The driving range of fully electric vehicles is heavily affected by heating and cooling loads. Zhang et al. (2015) report average reduction of driving range by up to 60% in cold weather and about 33% in extremely hot weather. Under these extreme weather conditions, the HVAC system can constitute up to 65% of secondary energy consumption, with the primary consumption coming from powertrain. Paffumi et al. (2019) investigated the influence of ambient temperature on driving range in laboratory conditions using WLTC driving cycle, and confirmed that at − 10 °C the range drops by 59% compared to the range obtained at + 23 °C. Thus, it is of great interest to achieve highest possible HVAC system efficiency, which would result in increased driving range, while maintaining high passenger thermal comfort. To fulfil these conflicting criteria, it is necessary to develop advanced control systems which optimally coordinate multiple actuators and energy storage units.
Setting a realistic and achievable benchmark is imperative from the perspective of complex control system design and verification. Such a benchmark can conveniently be obtained by off-line control trajectory optimisation (Rao 2014). The main advantage of dynamic programming method (DP) is finding globally optimal solution for the general case of a nonlinear, discontinuous and multi-variable system. However, the DP algorithm is rather computationally inefficient, and it can only be applied to systems with a low number of state and control variables. DP solutions have been used as control performance benchmarks in hybrid electric vehicles (Guzella and Sciaretta 2007) or extended range electric vehicles (Škugor et al. 2014). The aim is to find optimal response of control variables such as the engine or e-motor torques/speeds for minimising the fuel consumption at the end of prescribed driving cycle, while satisfying different hardware constraints including the boundary condition on final battery state-of-charge (SoC). Guermi et al. (2014) apply similar approach to a fuel cell-powered series hybrid electric vehicle, while Sinoquet et al. (2011) use DP to optimise e-motor/engine power split as a part of solving a powertrain component sizing optimisation problem. In the framework of HVAC systems, DP has been used in conventional vehicles' A/C system optimisation , where an A/C clutch command 1 3 sequence is optimised to minimise fuel consumption for a given driving cycle. Similarly,  apply DP for optimisation of energy management of A/C system that uses thermal energy storage in A/C system.
The cabin thermal comfort is usually controlled through cabin air temperature feedback control (Marshall et al. 2019). On the other hand, fuzzy-logic control of cabin thermal comfort (Farzaneh and Tootoonchi 2008) relies on the feedback information of simplified predicted mean vote (PMV), as an index that more comprehensively characterises the thermal comfort. It is demonstrated therein that both thermal comfort and energy efficiency can be improved when using the PMV feedback in comparison with cabin air temperature control. Yan et al. (2018) propose a multiinput/single-output proportional-integral (PI)-like controller that accounts for engine efficiency and a PMV-based thermal comfort criterion. It is shown therein that the fuel consumption can be reduced compared to conventional control algorithms for the same level of thermal comfort. A more advanced, optimal control system is presented in (Amini et al. 2019), which uses a model-predictive controller (MPC) for tracking the cabin air temperature reference by commanding the evaporator outlet air temperature and the blower fan air mass flow. However, MPC generally strongly relies on accurate prediction model and is computationally demanding because of applying on-line optimisation over the prediction horizon at each time step. Once the control system structure is set up, the tuneable control parameters should be optimised for favourable performance. Genetic algorithm-based parameter optimisation is widely used in vehicle design, e.g. for vehicle suspension (Gobbi 2013) or automatic transmission control parameterisation (Ranogajec et al. 2019).
HVAC control examples include (Zhang et al. 2017), where a multi-objective optimisation method is used for obtaining trade-off between the fuel consumption and the cabin cool-down time, and (Cvok et al. 2020) where a control allocation map is optimised.
This paper proposes a DP-based control trajectory optimisation method for an electric vehicle HVAC system, as well as a hierarchical/cascade control strategy that can approach the DP results. The contributions of the paper are threefold. First, the DP algorithm is aimed at optimising both thermal comfort and efficiency, where a simplified first-order cabin model and a static HVAC model are used to reduce the computational load. Secondly, a hierarchical control structure is proposed, where an instantaneous optimisation-based control allocation algorithm transforms the cooling/heating capacity commanded by the superimposed cabin air temperature controller to open-loop actions and references for inner HVAC control loops. Thirdly, combined tuning of optimal allocation cost function and superimposed cabin air temperature controller parameters is proposed, as it is recognized that both have impact on the DP-revealed trade-off between thermal comfort and efficiency. Although the presented case study is based on an A/C system model, the developed optimisation approach and particularly the hierarchical control strategy can also be applied to more complex HVAC systems, such as those utilised in advanced battery electric vehicles (Drage et al. 2019).
The remaining part of the paper is organized as follows. In support of optimisation and control strategy design studies presented, Sect. 2 outlines control-oriented HVAC and cabin models used, as well as PMV index mapping. The DP-based HVAC control trajectory optimisation is presented in Sect. 3. Design of hierarchical control system is described in Sect. 4. Section 5 presents control trajectory optimisation results and simulation-based verification of the control strategy for a cool-down scenario. Concluding remarks are given in Sect. 6. Figure 1 depicts the passenger cabin thermal system connected to the conventional HVAC system considered to operate in A/C mode. A 12th-order control-oriented model of the HVAC system has been adopted from (Jensen and Tummescheit 2002) and described in detail in (Ratković et al. 2019). The model is based on movingboundary method, which provides a good trade-off between model complexity and accuracy (Jensen and Tummescheit 2002), and has been experimentally validated against high-fidelity fixed volume models (Rasmussen and Shenoy 2012). Refrigerant properties used in the model are obtained using publicly available CoolProp tool (Bell et al. 2014), while other physical parameters such as component dimensions are estimated to represent typical vehicle A/C system. The main model variables and parameters are described in Nomenclature.

HVAC and cabin modelling
Electric motor-powered compressor, electronic expansion valve, blower fan and condenser fan are considered as electric vehicle HVAC actuators. They typically have faster dynamics compared to the slower heat exchanger dynamics, which justifies modelling them as static elements (Rasmussen and Shenoy 2012). Therefore, the control inputs fed to the HVAC model are compressor speed ω com , electronic expansion valve opening a v , and blower and condenser fan air mass flows ṁ ea and ṁ ca , respectively. The model outputs include the evaporator outlet air temperature T ea,out (i.e. the cabin inlet air temperature), the superheat temperature ΔT SH , and the coefficient of performance (COP) defined as the ratio of evaporator air-side cooling power Q ea and compressor power consumption P com , i.e. COP = Q ea /P com . The power consumptions of expansion valve, blower fan and condenser fan are not considered in COP calculation since the compressor is dominant power consumer in this system.
The considered passenger cabin model (Fayazbakhsh and Bahrami 2013) consists of two thermal masses: (1) the cabin air volume V c with temperature T c and (2) the cabin body elements with mass m b and temperature T b . The modelled thermal loads include the constant metabolic load Q met taken into account if the cabin air temperature is below 36 °C, the solar radiation load Q sol , the ambient air convection heat transfer Q ab over the outer body surface A ab characterised with a variable heat transfer coefficient α ab (v veh ), the HVAC thermal load Q HVAC that takes into account the cabin air inlet and outlet, and the convection heat transfer from body elements to cabin air Q cb over the inner body surface A cb with the heat transfer coefficient α cb .
The second-order cabin model based on the heat balance method reads (Fayazbakhsh and Bahrami 2013): where c p,ac is the cabin air specific heat capacity, c p,ea is the evaporator-side air specific heat capacity, ρ c is the cabin air density and c p,b is the body specific heat capacity.
The model (1) may be simplified to a first-order model by assuming that the body temperature dynamic is slower than the cabin air temperature dynamic, which gives: where k c scales the cabin air temperature thermal inertia to properly match the second-order model dynamics and ΔT b is the constant air-to-body temperature offset used for "tuning" the steady state accuracy.
For the single-zone modelling assumption, the mean air velocity v air inside the cabin is set to be proportional to the blower fan air mass flow ṁ ea : where k v,ea is expressed as the ratio of air density and cabin inlet vents cross-section area. Similarly, a linear relationship between the vehicle speed v veh and the condenser fan air mass flow ṁ ca is assumed: where k v,ca is constant coefficient, and ṁ ca,R is the condenser fan control input, set to zero herein.
The closed-loop dynamics of evaporator outlet air temperature control system built in the HVAC model is assumed to be by an order of magnitude faster than the cabin air temperature dynamics. Therefore, in order to enhance the computational efficiency of DP-based control trajectory optimisation, the overall HVAC model is in that case represented by static maps shown in Fig. 2 (Ratković et al. 2019). Here, the superheat temperature ΔT SH was fixed to its target value of 5 °C, based on the assumption that it was effectively controlled by the electronic expansion valve.
The cabin thermal comfort is evaluated through the Predicted Mean Vote (PMV) index, which is adjusted to take into account the cooling effect of increased air velocity (ANSI/ASHRAE Standard 55, Rev. 2017). A positive value of PMV means that the cabin environment is too hot, while a negative PMV indicates that it is too cold ( Table 1). The zero PMV suggests ideal thermal comfort, while the comfortable range according to the aforementioned standard is defined as |PMV| < 0.5. The PMV takes into account six different parameters: air temperature T air , air velocity  In order to simplify the PMV calculation, it is assumed that: (1) the mean radiant temperature T mr is equal to the mean air temperature inside the cabin T c , (2) the driver is wearing summer clothes (the clothing thermal resistance is set to 0.5 clo), and (3) the metabolic rate is set to 1.5. This gives the PMV map shown in Fig. 3a, where black circles indicate the comfortable range, i.e. |PMV| < 0.5. An excerpt of PMV map related to constant relative humidity of 44% is shown in Fig. 3b, where the black solid lines denote the boundaries of comfort range (|PMV| < 0.5). This plot indicates that in the hot conditions, i.e. when PMV > 0, the same level of thermal comfort can be achieved for growing cabin air temperatures T air if the air velocity v air is increased (and also if the humidity is reduced, Fig. 3a). Similarly, in the cold conditions, i.e. when PMV < 0, the thermal comfort can be preserved for falling cabin air temperature if the air velocity is decreased.

Control trajectory optimisation
The presented control trajectory optimisation is conducted by using a customdesigned algorithm based on the dynamic programming (DP) method (Bellman and Dreyfus 1962). The DP-based optimisation results in globally optimal solution, as it starts from the final time instant t f and calculates the optimal control inputs for all possible state variables backwards in time, while satisfying the process model.
In order to reduce the computational burden of DP optimisation, the cabin is described by the first-order model (2) with the state variable the control input vector u is reduced to two main inputs

Fig. 3
Predicted mean vote (PMV) map in dependence of three main inputs and with filled circles showing comfort range (|PMV| < 0.5) (a) and PMV map excerpt related to relative humidity RH = 44% with black lines denoting comfort range (b) and the HVAC model is described by the static maps T ea,out = T ea,out (ω com , ṁ ea , ṁ ca ) and COP = COP(ω com , ṁ ea , ṁ ca ) shown in Fig. 2. The thermal comfort output PMV is obtained by using the map shown in Fig. 3a. Trilinear interpolation is applied when evaluating the maps in Figs. 2 and 3a. The expansion valve opening a v is not contained in the control vector u, because the HVAC static maps have been obtained for the constant/target superheat temperature value. The condenser fan air mass flow ṁ ca represents a disturbance variable.
The control trajectory optimisation problem is to find the control vector u(k), which minimises the cost function at each discrete-time instant k and subject to a discrete-time counterpart of the plant dynamics model (2). The terminal condition function is set to provide that the cabin air temperature reference x R (t f ) = T cR is achieved at the end of optimisation time horizon, which is ensured by applying a sufficiently high penalisation coefficient K tf (set to 10,000, herein). The sub-integral function F(·) includes minimisation of thermal comfort criterion (PMV) and maximisation of efficiency (COP), alongside with penalisation of state-variable and control input constraint violations: where K PMV and K COP are weighting coefficients that set the trade-off between thermal comfort and efficiency, K lim is constraint violation penalisation coefficient that should be sufficiently high (set to 1000, herein), and H(a) is the Heaviside function defined as H(a) = 0 for a < 0 and H(a) = 1 for a ≥ 0. The constraints are used to contain the state-variable in the target range defined by the scenario, e.g. 20-40 °C. Note that, alternatively, the HVAC system power consumption can be used instead of COP in the cost function (7).

Control strategy design
The control strategy proposed in this paper has a two-level hierarchical structure shown below in Fig. 6. Low-level feedback controllers ensure setpoint tracking and disturbance rejection for HVAC subsystem. The high-level control subsystem controls the cabin air temperature and allocates references for low-level controllers.

Low-level control system
The evaporator outlet air temperature, i.e. the cabin inlet air temperature T ea,out , is controlled in a feedback loop to provide accurate and high-bandwidth tracking of the reference set by the high-level control system. The superheat temperature ΔT SH is regulated with respect to fixed reference ΔT SH,R = 5 °C, where the main aim of the corresponding feedback controller is to suppress disturbance influence including the one imposed by the action of outlet temperature controller. The linearized input-output HVAC model is characterised by coupled dynamics, which is described in Fig. 4a by four transfer functions linking the control inputs ω com and a v to the controlled outputs T ea,out and ΔT SH : It has been found that reasonably good control performance of superheat temperature regulation and evaporator setpoint tracking can be obtained for the given HVAC model by applying a simplified, decoupled control structure where only two main controllers G c11 (s) and G c22 (s) are used (Fig. 4a). The controllers are of proportional-integral (PI) type, and their parameters are tuned by using a search-algorithm optimisation procedure targeted to single-input single-output (SISO) linear system (Isermann 1981). The cost function to be minimised combines penalisation of closed-loop control error and control effort. Referring to the control structure shown in Fig. 4a, the cost functions for the two control loops are defined as: where r 11 and r 22 are weighting coefficients which set the trade-off between control error suppression, i.e. performance, and control effort reduction, i.e. efficiency and relative stability. Since the linearized HVAC dynamics model parameters depend on the operating point, PI controller gain scheduling maps have been obtained by repeating the optimisation procedure for multiple operating points with fixed weighting coefficients r 11 and r 22 . The analysis has showed that the most significant operating point parameters are the evaporator outlet air temperature T ea,out and the blower fan air mass flow ṁ ea , which results in two-dimensional scheduling maps for the controller proportional and integral gains. The final low-level control system structure is shown in Fig. 4b and it consists of two PI controllers with two pairs of gain-scheduling maps. The low-level control system performance is illustrated in Fig. 5 for the full, 12-th order nonlinear process model, where blue lines denote the response of control system with fixed controller gains (tuned for T ea,out = 15 °C and m ea = 0.05 kg/s), while green lines correspond to the control system with gain-scheduling applied. The evaporator air mass flow ṁ ea is kept at 0.075 kg/s, the superheat temperature reference is ΔT SH,R = 5 °C and the evaporator outlet air temperature reference with magnitude of ΔT ea,out,R = 5 °C is applied at t = 1000 s. In comparison with the fixedgain control system, the gain-scheduling control system achieves faster evaporator outlet air temperature response (Fig. 5a) and lower superheat temperature control error (Fig. 5b). The performance improvement is achieved by stronger compressor and expansion valve control efforts (Fig. 5c, d). Figure 5e, f show that optimal controller gains vary significantly, thus making the gain scheduling algorithm necessary to achieve optimal performance over a wide operating range.
It has been found that the closed-loop system performance can be further improved by taking into account the coupled dynamics of HVAC model, which are described in Fig. 4a by the cross-coupling transfer functions G 12 (s) and G 21 (s). In this case, the parameters of both PI controller were optimised simultaneously, with an option to include the cross-coupling controller terms/gains G c12 (s) and G c21 (s), as well (Fig. 4a). A multi-objective genetic algorithm was used as optimisation algorithm, because it allowed for overcoming the appearance of local optima and presenting the results in the form of Pareto frontier that enables the designer to select optimal solution based on his/her preference (Cvok et al. 2020). However, such procedure is more time consuming, especially when gain-scheduling is concerned.

High-level control system
In order to achieve favourable cabin thermal comfort while maintaining the maximum HVAC system efficiency, a supervisory high-level control system has been developed. According to the block diagram shown in Fig. 6, the high-level control system regulates the cabin air temperature T c by commanding the cooling capacity Q d . The cooling capacity Q d is then transformed within a control allocation map to low-level controller inputs/references, which in this case include the evaporator outlet air temperature and air mass flow references T ea,out,R and ṁ ea,R , respectively, while in a more general case more inputs are possible, such as the condenser air mass flow ṁ ca,R . Using the cabin air temperature T c and the cooling capacity demand Q d as inputs to the control allocation map allows for omitting the cabin dynamics model when designing the control allocation map. This significantly facilitates allocation map generation, and, more importantly, makes the allocation map independent of cabin model and related disturbances (see Fig. 1). To achieve optimal system performance, it is crucial to base the design of control allocation map on optimisation (Johansen and Fossen 2013). For the specific HVAC system and design case, control allocation is based on instantaneous, on-line optimisation. A linear search-based method is applied starting from the minimum blower fan air mass flow setpoint and corresponding evaporator outlet air temperature as Fig. 6 Overall hierarchical control system structure including cabin air temperature controller, optimal control input allocation, and inner control loops initial guesses. The on-line optimisation relies on PMV and COP maps, both of which are prepared off-line as functions of two inputs (Fig. 2). However, in more general case when using multiple control inputs (e.g. ṁ ca ), the dimension of COP map grows, which can lead to poor computational efficiency when using a linear search or may result in local optima when a more advanced, directional search approach is applied. To overcome these weaknesses, an alternative, off-line optimisation approach based on a multi-objective genetic algorithm can be applied, as presented in Cvok et al. (2020). That approach results in control input maps as functions of cabin temperature and cooling capacity demand (Fig. 6), which could be fitted by analytical models/functions, to facilitate the control strategy implementation and calibration.
At the superimposed level, a fixed-gain PI-type cabin air temperature controller G c,CAB (s) is used with an option to add a gain scheduling algorithm in more general case (Fig. 6). Since the cabin air temperature dynamics are slow, the cabin air temperature controller and the control allocation strategy can have higher sampling time than the low-level controllers (10 s vs. 0.1 s).
The optimal control allocation map is obtained by minimising the following cost function for a wide range of operating points ( Q d , T c ): where K PMV and K COP are weighting coefficients that set the trade-off between the two conflicting criteria: thermal comfort (PMV) and efficiency (COP). The control variables T ea,out,R and ṁ ea,R , are subject to the following constraints:

Results
Control trajectory optimisation and, similarly, control system simulation analysis, have been carried out for a cool-down scenario at the constant vehicle velocity v veh = 40 km/h. The objective is to bring the cabin air temperature down from its initial value, which is equal to the ambient air temperature (T c0 = T a = 40 °C), to the final cabin air temperature of T c,R = 26 °C in 10 min, i.e. t f = 600 s.

Control trajectory optimisation results
DP optimisation has been carried out by using the parameters listed in Table 2.
The results shown in Fig. 7 indicate that for the HVAC efficiency-oriented case (red line), the optimal control action maintains a modest cooling capacity. This is reflected in a relatively slow fall of cabin air temperature (Fig. 7a), relatively high evaporator outlet air temperature T ea,out (Fig. 7a), and correspondingly high evaporator air mass flow ṁ ea (Fig. 7c). Such control is beneficial for HVAC efficiency (Fig. 7f) as it enables the compressor to operate at low speeds (Fig. 7d), thus minimising its power consumption and maximising the COP. Note that the optimal behaviour for this case will change to some extent if the blower fan power consumption were accounted for in COP, as the power consumption typically grows with air mass flow increase.  For the case of PMV minimisation (blue line), the optimal control behaviour is to increase the compressor speed and also air mass flow (Fig. 7c, d) at the beginning of response, in order to lower the cabin inlet air temperature (Fig. 7a) and achieve high cooling capacity, thus bringing the thermal comfort criterion PMV (Fig. 7e) towards zero as fast as possible. This results in the lowest COP (Fig. 7f) until the thermal comfort has been achieved (PMV = 0, t ~ 200 s), while the COP increases afterwards since lower compressor speed and lower air mass flow are sufficient to maintain the PMV around zero.
In the combined cost function case (green line in Fig. 7), the optimal control action expectedly results in compromise between the previous two extreme cases related to efficiency and thermal comfort maximisation.

Control system simulation results
The control system simulation results shown in Fig. 8 have been obtained for the same sets of weighting coefficients K PMV and K COP as used in Sect. 5.1, i.e. Figure 7, but now applied to the cost function (10) used in control allocation optimisation. In all these cost function settings, the same tuning of cabin air temperature PI controller is considered (K P = 125 and K I = 0.01).
The cabin air temperature response shown in Fig. 8a (dashed lines) is very similar for all three cases due to the same PI controller used. However, the allocated control inputs, i.e. the evaporator outlet air temperature (Fig. 8a, solid lines) and the evaporator air mass flow (Fig. 8f), are dependent on the weighting coefficients K PMV and K COP . For the case of COP maximisation (red line), the compressor speed (Fig. 8c) is kept low, which results in the highest efficiency (Fig. 8e, dashed lines). The low cabin air inlet temperature (Fig. 8a, solid lines) is set to meet the high cooling capacity demand set by superimposed controller. To achieve low cabin air inlet temperature at lower compressor speeds, the evaporator air mass flow (Fig. 8f) is kept relatively low. For the case of PMV minimisation (blue line) the thermal comfort (Fig. 8e, solid lines) is achieved at the fastest rate but this results in the lowest efficiency. The results of combined cost function case (green lines) fall between previous two extreme cases. Figure 8b, d show that the performance of superheat temperature control is satisfactory, and it could be further improved by applying more complex cross-coupling control.
Comparison of DP-optimised responses in Fig. 7 and the control system results in Fig. 8 indicates qualitative differences between the two solutions. This is especially pronounced in the COP maximisation case (red lines), in which the DP keeps the cooling capacity low to slowly bring the cabin air temperature to target value, whereas the superimposed controller commands relatively high cooling capacity and brings the cabin temperature to the target value faster. This is explained by the fixed parameters of superimposed cabin air temperature controller, i.e. same cabin air temperature (and cooling capacity demand) response for all allocation weighting coefficient settings. In order to bring the control system performance closer to DP results, the superimposed cabin air temperature controller bandwidth should be tuned in correlation with allocation cost function setting, i.e. the superimposed controller should be made slower for the COP maximisation case. Figure 9 shows the comparative control system responses for three values of superimposed controller integral gains K I and the combined-criteria cost function (K PMV = 0.5, K COP = 1). The cabin air temperature response (Fig. 9a, dashed lines) is faster for higher integral gain K I , which is connected with higher cooling capacity demand effort in that case (Fig. 9b). This results in faster thermal comfort achievement but deteriorates efficiency (see Fig. 9c and also Fig. 9d). The increased cooling capacity demand is optimally satisfied with lower evaporator outlet air mass flow (Fig. 9e), which enables lower evaporator air outlet temperature (Fig. 9a, solid lines).
For the case of lowest K I (red line), the cabin air temperature response is slowest (Fig. 9a), and the cooling capacity demand effort is weakest (Fig. 9b). The weak cooling capacity demand is met by means of high evaporator air outlet temperature (Fig. 9a) and high blower fan air mass flow (Fig. 9e), which results in higher efficiency, but deteriorates the thermal comfort (Fig. 9c). Performance of moderate/ nominal superimposed controller tuning (green line) falls between previous two tunings in terms of cost function indices (Fig. 9c) and control inputs (Fig. 9a-e). The overall control system behaviour is closer to the DP results (cf. Figs. 7,9) than the previously considered case illustrated in Fig. 8. Table 3 contains performance indices related to simulation results of cooldown scenario for various combinations of control allocation weighting coefficients and cabin air temperature controller tunings. The considered indices include the energy consumption E com = ∫P com dt and the following two thermal comfort criteria: (1) the cumulative absolute value of PMV, i.e. C 1 = ∫|PMV|dt, and (2) the cumulative absolute value of those PMV values greater than a threshold (set to 0.22, herein), i.e. C 2 = ∫|PMV|dt if |PMV| > 0.22. The criterion C 2 is  b Comfort-oriented: K PMV = 1, K COP = 0; efficiency-oriented: K PMV = 0, K COP = 1 Combined cost: K PMV = 0.5, K COP = 1 considered to be more suitable for transient evaluation as it allows the PMV to slightly deviate from the ideal value of zero in steady-state conditions.
The best overall performance in terms of efficiency is achieved in the case of slow superimposed cabin air temperature controller tuning and efficiency-oriented allocation cost. In this case the energy consumption is reduced by 25% compared to the selected nominal setting (blue line in Table 3). However, the best-efficiency setting results in the worst thermal comfort: the indices C 1 and C 2 are higher by 81% and 130% compared to the nominal setting. Keeping the slow superimposed controller tuning and changing the allocation cost towards the comfort-oriented case reduces the thermal comfort indices, but it in turn significantly increases the energy consumption. The best performance in terms of thermal comfort is achieved in the case of fast superimposed controller tuning and comfort-oriented allocation cost, where the comfort index C 2 is 33% lower than in the nominal case, with only 13% more energy consumption. In the case of moderate controller tuning, the combined allocation cost appears to be a reasonable choice, as its performance falls approximately in the middle of those obtained for the two extreme cases.

Conclusion
A hierarchical thermal comfort control strategy, including an optimised HVAC control input allocation algorithm, has been developed and compared with globally optimal dynamic programming-based control trajectory optimisation results. The control strategy verification results are given for a cool-down scenario, and they show that it is possible to tune the strategy for a favourable trade-off between thermal comfort and HVAC efficiency, and achieve the behaviour that is comparable to the globally optimal solution. More specifically, along with tuning the allocation cost function weighting coefficients, the superimposed cabin air temperature controller bandwidth should be adjusted. For reducing the energy consumption, the superimposed controller should be made slower and the allocation should primarily be focused on efficiency, while for gaining the thermal comfort the superimposed controller should be tuned for fast response with the allocation adjusted for the combined comfort/efficiency setting.
Implementing the proposed control strategy in a fully electric vehicle would allow the designer to predetermine the trade-off between thermal comfort and efficiency depending on the driving conditions, or to allow the driver to adjust the trade-off based on his/her preference. A higher driving range would be achieved with slower superimposed controller and efficiency-oriented allocation at the expense of reduced thermal comfort. Therefore, such tuning may be reserved for range-critical scenarios, i.e. when the battery charge is not anticipated to be sufficient for given driving cycle. In extremely hot conditions or when pre-cooling the vehicle while being charged, the driver would opt for thermal comfort-oriented setting based on fast tuning of superimposed controller. Finally, a balanced efficiency/comfort tuning would be appropriate in most of the driving scenarios and a variety of ambient conditions. The ongoing work includes extending the control trajectory optimisation tool with (1) additional state-variables to take into account slower HVAC dynamics that that may be characteristic for HVAC system architectures used in electric vehicles, and (2) additional control variables that may improve the system performance. Similarly, optimising the allocation maps taking into account additional HVAC control inputs and corresponding tuning of the superimposed controller deserves further research. Finally, a thorough multi-objective parameter optimisation of low-level controllers taking into account the coupled HVAC dynamics should be considered as a final refinement of the overall control strategy.