Analysis of Time Series Data

Agami Reddy, T.

doi:10.1007/978-1-4419-9613-8_9

T. Agami Reddy²

3694 Accesses
5 Citations

Abstract

This chapter introduces several methods to analyze time series data in the time domain; an area rich in theoretical development and in practical applications. What constitutes time series data and some of the common trends encountered are first presented. This is followed by a description of three types of time-domain modeling and forecasting models. The first general class of models involves moving average smoothing techniques which are methods for removing rapid fluctuations in time series so that the general secular trend can be seen. The second class of models is similar to classical OLS regression models, but here the time variable appears as a regressor, thereby allowing the trend and the seasonal behavior in the data series to be captured by the model. Its strength lies in its ability to model the deterministic or structural trend of the data in a relatively simple manner. The third class of models called ARIMA models allows separating and modeling the systematic component of the model residuals from the purely random white noise element, thereby enhancing the prediction accuracy of the overall model. ARMAX models, which are extensions of the univariate ARIMA models to multivariate problems, and their ability to model dynamic systems are also discussed with illustrative examples. Finally, an overview is provided of a practical application involving control chart techniques which are extensively used for process and condition monitoring of engineered systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
A stochastic process is one which is described by a set of time indexed observations subject to probabilistic laws.
2.
Stationarity in a time series strictly requires that all statistical descriptors, such as the mean, variance, correlation coefficients of the data series be invariant in time. Due to the simplified treatment in this text, the discussion is geared primarily towards stabilizing the mean, i.e., removing the long-term and seasonal trends.
3.
See for example McClave and Benson (1988) or Montgomery and Johnson (1976).
4.
Stationarity of a stochastic process can be interpreted qualitatively as a process which is in statistical equilibrium.
5.
Strictly, this formulation should be called discrete transfer function or z-transform since it uses discrete time intervals (of one hour).
6.
Note the distinction between the number of samples (k) and the sample size (n).

References

Andrews, J. and N. Jelley, 2007. Energy Science: Principles, Technologies and Impacts, Oxford University Press, Oxford.
Google Scholar
Bloomfield, P., 1976. Fourier Analysis of Time Series: An Introduction . John Wiley, New York.
Google Scholar
Box, G.E.P. and G.M. Jenkins, 1976. Time Series Analysis: Forecasting and Control, Holden Day.
Google Scholar
Box, G.E.P and A. Luceno, 1997. A., Statistical Control by Monitoring and Feedback Adjustment, John Wiley and Sons, New York.
Google Scholar
Chatfield, C., 1989. The Analysis of Time Series: An Introduction, 4th Ed., Chapman and Hall, London.
Google Scholar
Devore J., and N. Farnum, 2005. Applied Statistics for Engineers and Scientists, 2nd Ed., Thomson Brooks/Cole, Australia.
Google Scholar
Dhar, A., T.A. Reddy and D.E. Claridge, 1999. Generalization of the Fourier series approach to model hourly energy use in commercial buildings, Journal of Solar Energy Engineering, vol. 121, p. 54, Transactions of the American Society of Mechanical Engineers, New York.
Google Scholar
Himmelblau, D.M. 1978. Fault Detection and Diagnosis in Chemical and Petrochemical Processes, Chemical Engineering Monograph 8, Elsevier.
Google Scholar
Kreider, J.K., P.S. Curtiss and A. Rabl, 2009. Heating and Cooling of Buildings, 2nd Ed., CRC Press, Boca Raton, Fl.
Google Scholar
McCain L.J. and R. McCleary, 1979. The statistical analysis of the simple interrupted time series quasi-experiment, in Quasi-experimentation: Design and analysis issues for field settings, Houghton Mifflin Company, Boston, MA.
Google Scholar
McClave, J.T. and P.G. Benson, 1988. Statistics for Business and Economics, 4th Ed., Dellen and Macmillan, London.
Google Scholar
Montgomery, D.C. and L.A. Johnson, 1976. Forecasting and Time Series Analysis, McGraw-Hill, New York, NY.
Google Scholar
Pindyck, R.S. and D.L. Rubinfeld, 1981. Econometric Models and Economic Forecasts, 2nd Edition, McGraw-Hill, New York, NY.
Google Scholar
Ruch, D.K., J.K. Kissock, and T.A. Reddy, 1999. “Prediction uncertainty of linear building energy use models with autocorrelated residuals, ASME Journal of Solar Energy Engineering, vol.121, pp.63-68,February, Transactions of the American Society of Mechanical Engineers, New York.
Google Scholar
Walpole, R.E., R.H. Myers, S.L. Myers, and K. Ye, 2007. Probability and Statistics for Engineers and Scientists, 8th Ed., Pearson Prentice Hall, Upper Saddle River, NJ.
Google Scholar
Wei, W.W.S., 1990. Time Series Analysis: Univariate and Multivariate Methods, Addison-Wesley Publishing Company, Redwood City, CA.
Google Scholar

Download references

Author information

Authors and Affiliations

The Design School and School of Sustainability, Arizona State University, PO Box 871605, 85287-1605, Tempe, AZ, USA
T. Agami Reddy

Authors

T. Agami Reddy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to T. Agami Reddy .

Problems

Pr. 9.1

Consider the time series data given in Table 9.1 which was used to illustrate various concepts throughout this chapter. Example 9.4.2 revealed that the model was still not satisfactory since the residuals still has a distinct trend. You will investigate alternative models (such as a second order linear or an exponential) in an effort to improve the residual behavior

Perform the same types of analyses as illustrated in the text. This involves determining whether the model is more accurate when fit to the first 48 data points, whether the residuals show less pronounced patterns, and whether the forecasts of the four quarters for 1986 have become more accurate. Document your findings in a succinct manner along with your conclusions.

Pr. 9.2

Use the following time series models for forecasting purposes:

(a)
\({Z_t} = 20 + {\varepsilon _t} + 0.45{\varepsilon _{t - 1}} - 0.35{\varepsilon _{t - 2}}\). Given the latest four observations:

{17.50, 21.36, 18.24, 16.91}, compute forecasts for the next two periods
(b)
\({Z_t} = 15 + 0.86{Z_{t - 1}} - 0.32{Z_{t - 2}} + {\varepsilon _t}\). Given the latest two values of Z {32, 30), determine the next four forecasts.

Pr. 9.3.

Section 9.5.3 describes the manner in which various types of ARMA series can be synthetically generated as shown in Figs. 9.21, 9.22 and 9.23, and how one could verify different recommendations on model identification. These are useful aids for acquiring insights and confidence in the use of ARMA. You are asked to synthetically generate 50 data points using the following models and then use these data sequences to re-identify the models (because of the addition of random noise, there will be some differences in model parameters identified);

(a)
\({Z_t} = 5 + {\varepsilon _t} + 0.7{\varepsilon _{t - 1}}\;{\rm{ with }}\;N(0,0.5)\)
(b)
\({Z_t} = 5 + {\varepsilon _t} + 0.7{\varepsilon _{t - 1}}\;{\rm{ with }}\;N(0,1)\)
(c)
\({Z_t} = 20 + 0.6{Z_{t - 1}} + {\varepsilon _t}\;{\rm{ with }}\;N(0,1)\)
(d)
\({Z_t} = 20 + 0.8{Z_{t - 1}} - 0.2{Z_{t - 1}} + {\varepsilon _t}\;{\rm{ with }}\;N(0,1)\)
(e)
\({Z_t} = 20 + 0.8{Z_{t - 1}} + {\varepsilon _t} + 0.7{\varepsilon _{t - 1}}\;{\rm{ with }}\;N(0,1)\)

Pr. 9.4.

Time series analysis of sun spot frequency per year from 1770–1869

Data assembled in Table B.5 (in Appendix B) represents the so-called Wolf number of sunspots per year (n) over many years (from Montgomery and Johnson 1976 by permission of McGraw-Hill).

(a)
First plot the data and visually note underlying patterns
(b)
You will develop at least 2 alternative models using data from years 1770–1859. The models should include different trend and/or seasonal OLS models, as well as sub-classes of the ARIMA models (where the trends have been removed by OLS models or by differencing). Note that you will have to compute the ACF and PACF for model identification purposes
(c)
Evaluate these models using the expost approach where the data for years 1860–1869 are assumed to be known with certainty (as done in Example 9.6.1).

Pr. 9.5.

Time series of yearly atmospheric CO ₂ concentrations from 1979–2005

Table B.6 (refer to Appendix B) assembles data of yearly carbon-dioxide (CO₂) concentrations (in ppm) in the atmosphere and the temperature difference with respect to a base year (in °C) (from Andrews and Jelley 2007 by permission of Oxford University Press).

(a)
Plot the data both as time series as well as scatter plots and look for underlying trends
(b)
Using data from years 1979–1999, develop at least two models for the Temp. difference variable. These could be trend and /or seasonal or ARIMA type models
(c)
Repeat step (b) but for the CO₂ concentration variable
(d)
Using the same data from 1979–1999, develop a model for CO₂ where Temp.diff is one of the regressor variables
(e)
Evaluate the models developed in (c) and (d) using data from 2000–2005 assumed known (this is the expost conditional case)
(f)
Compare the above results for the exante unconditional situation. In this case, future values of temperature difference are not know, and so model developed in step (b) will be used to first predict this variable, which will then be used as an input to the model developed in step (d)
(g)
Using the final model, forecast the CO₂ concentration for 2006 along with 95% CL.

Pr. 9.6

Time series of monthly atmospheric CO ₂ concentrations from 2002–2006

Figure 9.33 represents global CO₂ levels but at monthly levels. Clearly there is both a long-term trend and a cyclic seasonal variation. The corresponding data is shown in Table B.7 (and can be found in Appendix B). You will use the first four years of data (2002–2005) to identify different moving average smoothing techniques, trend+seasonal OLS models, as well as ARIMA models as illustrated through several examples in the text. Subsequently, evaluate these models in terms of how well they predict the monthly values of the last year i.e., year 2006).

Pr. 9.7

Transfer function analysis of unsteady state heat transfer through a wall

You will use the conduction transfer function coefficients given in Example 9.6.1 to calculate the hourly heat gains (Q_cond) through the wall for a constant room temperature of 24°C and the hourly solar-air temperatures for a day given in Table 9.13 (adapted from Kreider et al. 2009). You will assume guess values to start the calculation, and repeat the diurnal calculation over as many days as needed to achieve convergence assuming the same T_solair values for successive days. This problem is conveniently solved on a spreadsheet.

Table 9.13 Data table for Problem 9.7

Full size table

Pr. 9.8

Transfer function analysis using simulated hourly loads in a commercial building

The hourly loads (total electrical, thermal cooling and thermal heating) for a large hotel in Chicago, IL have been generated for three days in August using a detailed building energy simulation program. The data shown in Table B.8 (given in Appendix B) consists of outdoor dry-bulb (T_db) and wet-bulb (T_wb) temperatures in °F as well as the internal electric loads of the building Q_int (these are the three regressor variables). The response variables are the total building electric power use (kWh) and the cooling and heating thermal loads (Btu/h).

(a)
Plot the various variables as time series plots and note underlying patterns.
(b)
Use OLS to identify a trend and seasonal model using indicator variables but with no lagged terms for Total Building electric power.
(c)
For the same response variable, evaluate whether the seasonal differencing approach, i.e., ∇₂₄Y_t = Y_t − Y_t-24 is as good as the trend and seasonal model in detrending the data series.
(d)
Identify ARMAX models for all three response variables separately by using two days for model identification and the last day for model evaluation
(e)
Report all pertinent statistics and compare the results of different models. Provide reasons as to why the particular model was selected as the best one for each of the three response variables.

Pr. 9.9

Example 9.7.1 illustrated the use of Shewhart charts for variables.

(a)
Reproduce the analysis results in order to gain confidence
(b)
Repeat the analysis but using the Cusum and EWMA (with λ = 0.4) and compare results.

Pr. 9.10

Example 9.7.1 illustrated the use of Shewhart charts for attributes variables.

(a)
Reproduce the analysis results in order to gain confidence
(b)
Repeat the analysis but using the Cusum and EWMA (with λ = 0.4) and compare results.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Agami Reddy, T. (2011). Analysis of Time Series Data. In: Applied Data Analysis and Modeling for Energy Engineers and Scientists. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-9613-8_9

Download citation

DOI: https://doi.org/10.1007/978-1-4419-9613-8_9
Published: 29 July 2011
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-9612-1
Online ISBN: 978-1-4419-9613-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Analysis of Time Series Data

Abstract

Access this chapter

Notes

References

Author information

Authors and Affiliations

Corresponding author

Problems

Problems

Pr. 9.1

Pr. 9.2

Pr. 9.3.

Pr. 9.4.

Pr. 9.5.

Pr. 9.6

Pr. 9.7

Pr. 9.8

Pr. 9.9

Pr. 9.10

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation