State Estimation and Sensorimotor Noise in a Driver Steering Model with a Gaussian Process Internal Model

Fieldhouse, Harry; Cole, David

doi:10.1007/978-3-031-70392-8_10

Harry Fieldhouse¹⁷ &
David Cole¹⁷

Part of the book series: Lecture Notes in Mechanical Engineering ((LNME))

Included in the following conference series:

Advanced Vehicle Control Symposium

Abstract

Refinements to a mathematical model of human drivers’ steering control incorporating driver learning are reported. State estimation and realistic sensorimotor noise sources are introduced to the driver model to better represent neural processes. It is found that the driver model exhibits the expected learning behaviour in terms of estimation and control performance. Further work is planned to validate the model experimentally.

The work described in this paper is part of PhD research at Cambridge University Engineering Department, grant number G116038, funded by Toyota Motor Europe (TME). The industrial supervisor is Xabier Carrera Akutain of TME.

You have full access to this open access chapter, Download conference paper PDF

Keywords

1 Introduction

Realistic mathematical driver steering control models are useful tools for developing driver assistance systems such as stability control and lane-keeping assistance. Driver models allow the vehicle design space to be explored safely and at low cost.

State of the art driver models typically use optimisation-based predictive control algorithms. These require a prediction model, which in a human context is known as the internal model, and represents the human understanding of the vehicle dynamics. The internal model is often assumed to be an accurate deterministic representation of the vehicle dynamics. However, it seems likely that in many cases a driver has an inaccurate and uncertain understanding of the vehicle dynamics, particularly in unfamiliar nonlinear regimes of operation.

Recently, a new approach has been taken by the authors, using Model Predictive Control (MPC) with a Gaussian Process (GP) providing a data-driven internal prediction model of the vehicle dynamics [4]. The motivation for using a GP is that it represents more closely the experience-based learning process and prediction uncertainty of the human driver. This approach was inspired by the ‘predictive processing’ hypothesis which has gained traction in several academic communities [2].

An example of a partially learnt GP model of nonlinear lateral-yaw vehicle dynamics is shown in Fig. 1, predicting the mean and variance of the vehicle lateral velocity state at the next time step given current vehicle states (here all zero) and steering angle input (on x-axis). The learning behaviour of the driver model is apparent in Fig. 2, which compares the RMS path error with RMS handwheel velocity (akin to control effort) over the course of twelve successive avoidance manoeuvres (elk or moose test).

It is reported in [4] that the simulated learning behaviour seen in Fig. 2 is similar to that measured in some of the human test subjects that performed the manoeuvres in a real vehicle on a test track. The behaviour can be described as ‘cautious’, where control activity increases and path error decreases with successive manoeuvres. Some other drivers in the experiment displayed ‘adventurous’ behaviour, characterised by control activity and path error decreasing with successive manoeuvres.

The MPC+GP driver model architecture described in [4] is shown in Fig. 3, with additional elements introduced in the present work shown in orange and grey to be described in the next sections. The model includes additive memory data noise but no sensory measurement noise. The GP internal model generates predictions. Control actions are optimised to minimise a cost function that penalises predicted lateral and yaw deviations from a previewed target path, together with other penalties on GP internal model prediction variance and the first and second derivatives of steering angle with respect to time. Cautious and adventurous steering behaviours are generated by adjusting the penalty on GP internal model prediction variance [4].

The aim of the work in this paper is to improve the MPC+GP driver steering model by sensory noise and state estimation. Based on experimental observations in the field of computational neuroscience, it is believed that human sensory measurement noise is signal-dependent, where the noise is proportional to the signal being measured [7]. In human control, there is evidence that state estimation is performed by the brain in a probabilistically optimal, Bayesian manner [3, 5], using internal model predictions to improve the accuracy of the state determination and mitigate effects of sensory noise. In the present paper the existing MPC+GP driver model is extended to include realistic sensorimotor noise and a state estimator that uses the GP for the state prediction step. The performance of the model is then investigated.

2 Noise Model and State Estimation

Signal-dependent noise sources are added to represent measurement noise and control (or process) noise, as shown in orange in Fig. 3. The noise is modelled as shown in Fig. 4, with constant signal to noise ratio (SNR) and a noise floor to represent sensory perception threshold.

With reference to the variables in Fig. 3, measurement variance is diagonal, with ${(\Sigma _m)}_{k,k} = \text {max}({(\boldsymbol{z}_{i,j})}_{k}^{2} {(\boldsymbol{s}_m)}_{k}^{-1}, \, {(\boldsymbol{n}_m)}_{k})$, and control variance, $\boldsymbol{\sigma }_c = \text {max}({\delta }_{i}^{2} {s}_c^{-1}, \, n_c)$. Here $\boldsymbol{z}_{i,j}$ is the state vector at the $i^{\text {th}}$ timestep of the $j^{\text {th}}$ manoeuvre, $\boldsymbol{s}_m$ is the measurement SNR for each state dimension, $\boldsymbol{n}_m$ is the measurement noise floor, $\delta _{i}$ is the planned control action for the $i^{\text {th}}$ timestep, $s_c$ and $n_c$ are the control SNR and noise floor.

Physiologically plausible values for SNRs and noise floors can be ascertained from the literature on sensory thresholds and just noticeable differences (JNDs), for example [3, 6, 7]. SNRs are typically in the region of unity; a value of 1.6 is used for the results in the present paper.

The State Estimation block (Fig. 3) is implemented as a Kalman Filter. The state estimation process uses the GP Internal Model to predict the next state of the vehicle given the current state and control input. From this, with the predicted state, the measured state and the variances of each, Kalman Filtering is used to update the state estimate probabilistically. This state estimate is then used as the believed current state from which to predict future states when optimising the control action. The equations for the Kalman Filter prediction and update are implemented as follows, based on [1]:

$$\begin{aligned} \text {Prediction Step:} \quad & \hat{\boldsymbol{x}}_{k|k-1} = \text {GP Prediction}\, (\hat{\boldsymbol{x}}_{k-1|k-1} , u_{k-1}P_{k-1|k-1}) \\ & P_{k|k-1} = \text {GP Variance}\, (\hat{\boldsymbol{x}}_{k-1|k-1} , u_{k-1}, P_{k-1|k-1}) \\ \text {Update Step:} \quad & K_k = P_{k|k-1} H^T (H P_{k|k-1} H^T + R_k)^{-1} \\ & \hat{\boldsymbol{x}}_{k|k} = \hat{\boldsymbol{x}}_{k|k-1} + K_k (\boldsymbol{y}_k - H \hat{\boldsymbol{x}}_{k|k-1}) \\ & P_{k|k} = (I - K_k H) P_{k|k-1} \end{aligned}$$

where $\hat{\boldsymbol{x}}_{k|k-1}$ and $P_{k|k-1}$ are the GP model predicted current state mean and covariance, $\hat{\boldsymbol{x}}_{k-1|k-1}$ and $P_{k-1|k-1}$ are the previous state estimate mean and covariance and $u_{k-1}$ is the previous control input.

The Kalman filtered state estimates are not used in the memory dataset for training the GP internal model. This is because the Kalman filtering introduces a bias to the data towards the errors in the learnt model, so the use of this filtered data significantly reduces the learning rate. Instead, the GP internal model is trained on the raw measured data, and the Kalman Filter is used only for state estimation during the control optimisation process. As the GP internal model improves and the state estimate becomes more accurate, better predictions of future states can be made leading to better control performance.

3 Simulation Results and Discussion

In this section simulations were performed with the control noise in Fig. 3 set to zero. The measurement noise was set to give an SNR of 1.6. Simulations were initially performed with the Deterministic Internal Model block shown in grey in Fig. 3 switched in and data noise set to zero. This enabled the effect of GP learning on state estimation to be investigated separately from the GP’s effect on control optimisation. The GP was initialised with fifty data points randomly distributed within the vehicle’s operating envelope. The driver steering model was then run to perform twelve successive elk avoidance manoeuvres, with the memory dataset and the GP updated after each of the twelve manoeuvres. For the purpose of determining statistically reliable results, the sequence of twelve manoeuvres was repeated one hundred times, each time with different uncorrelated noise signals.

The performance of the state estimator in each of the twelve manoeuvres was quantified by calculating the ratio of the variance of the true state to the variance of the state estimation error. This ratio is also denoted as a SNR. However, to avoid the calculated SNR being dependent on the performance of the controller in each manoeuvre, the vehicle states during each manoeuvre were not used in calculating the estimator’s SNR. Instead, a set of 2000 independent randomly generated starting states with measurement noise of SNR=1.6 was used to evaluate the performance of the state estimator, using the GP internal model from each of the twelve manoeuvres. The SNR of the estimator was also calculated for multiple timesteps beyond the starting state.

It can be seen in Fig. 5 that the SNR of the estimate of lateral velocity improves over the course of the twelve manoeuvres as the GP internal model learns the vehicle dynamics. The SNR reduces as more timesteps are advanced, converging to an SNR of around 3 for the first manoeuvre and 10 for the final manoeuvre, which is significantly improved from the measurement SNR of 1.6 and confirms the effectiveness of the estimator.

Another simulation was run with the prediction model switch (Fig. 3) set to use the GP internal model for control optimisation as well as for state estimation. Figure 6 shows the RMS path error against RMS handwheel velocity over twelve elk avoidance manoeuvres. The control performance and learning behaviour of the new model are similar to that of the earlier model seen in Fig. 2, and therefore similar to the measured behaviour of human test subjects reported in [4].

The work reported in this paper contributes increased realism of driver models for use in virtual vehicle development. Work is planned to add visual and vestibular sensory dynamics to the driver steering model, and to validate further the model against experiments with human test subjects.

4 Conclusion

There is a need for realistic mathematical driver steering control models that represent human learning behaviour.
Recent work by the authors has combined MPC with a GP internal model.
The present work adds signal-dependent sensorimotor noise and state estimation to the MPC+GP driver model.
Simulation results show that the state estimator with a GP internal model exhibits the expected improvement in estimation accuracy with successive manoeuvres.
The results also show that control performance and learning behaviour are similar to measured human behaviour reported recently.
Further work is planned to extend the model to include visual and vestibular sensory dynamics, and to perform more experimental validation.

References

Chui, C.K., Chen, G.: Kalman Filtering. Springer, Cham (2017)
Book Google Scholar
Ciria, A., Schillaci, G., Pezzulo, G., Hafner, V.V., Lara, B.: Predictive processing in cognitive robotics: a review. Neural Comput. 33, 1402–1432 (2021)
Article Google Scholar
Faisal, A.A., Selen, L.P., Wolpert, D.M.: Noise in the nervous system. Nat. Rev. Neurosci. 9, 292–303 (2008)
Article Google Scholar
Fieldhouse, H., Keen, S., Cole, D.: Measurement and modelling of driver learning of steering control during succesive obstacle avoidance manoeuvres. Apollo - University of Cambridge Repository. Submitted to Vehicle System Dynamics (2024). https://doi.org/10.17863/CAM.109315
Körding, K.P., Wolpert, D.M.: Bayesian integration in sensorimotor learning. Nature 427, 244–247 (2004)
Article Google Scholar
Nash, C.J.: Measurement and modelling of human sensory feedback in car driving. Ph.D. thesis, Department of Engineering, University of Cambridge (2018)
Google Scholar
Todorov, E.: Stochastic optimal control and estimation methods adapted to the noise characteristics of the sensorimotor system. Neural Comput. 17, 1084–1108 (2005)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Engineering, University of Cambridge, Cambridge, CB2 1PZ, UK
Harry Fieldhouse & David Cole

Authors

Harry Fieldhouse
View author publications
You can also search for this author in PubMed Google Scholar
David Cole
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Harry Fieldhouse .

Editor information

Editors and Affiliations

Department of Mechanical Engineering, Politecnico di Milano, Milano, Italy
Giampiero Mastinu
Department of Mechanical Engineering, Politecnico di Milano, Milano, Italy
Francesco Braghin
Department of Mechanical Engineering, Politecnico di Milano, Milano, Italy
Federico Cheli
Department of Electronics, Information Technology and Bioengineering, Politecnico di Milano, Milano, Italy
Matteo Corno
Department of Electronics, Information Technology and Bioengineering, Politecnico di Milano, Milano, Italy
Sergio M. Savaresi

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fieldhouse, H., Cole, D. (2024). State Estimation and Sensorimotor Noise in a Driver Steering Model with a Gaussian Process Internal Model. In: Mastinu, G., Braghin, F., Cheli, F., Corno, M., Savaresi, S.M. (eds) 16th International Symposium on Advanced Vehicle Control. AVEC 2024. Lecture Notes in Mechanical Engineering. Springer, Cham. https://doi.org/10.1007/978-3-031-70392-8_10

Download citation

DOI: https://doi.org/10.1007/978-3-031-70392-8_10
Published: 04 October 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-70391-1
Online ISBN: 978-3-031-70392-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics