# Encyclopedia of Database Systems

Living Edition
| Editors: Ling Liu, M. Tamer Özsu

# Linear Regression

Living reference work entry
DOI: https://doi.org/10.1007/978-1-4899-7993-3_542-2

## Definition

Linear regression is a classical statistical tool that models relationship between a dependent variable or regress and Y, explanatory variable or regressor X = {x 1,  … , x I } and a random term ε by fitting a linear function,
$$Y={\alpha}_{+}{\alpha}_1{x}_1+{\alpha}_2{x}_2+\dots +{\alpha}_I{x}_I+\varepsilon$$

where α 0 is the constant term, the α i s are the respective parameters of independent variable, and I is the number of parameters to be estimated in the linear regression.

## Key Points

Linear regression analysis is an important component for several tasks such as clustering, time series analysis, and information retrieval. For instance, it is a very powerful forecasting method for time series data. It helps identify the long-term movement of a certain data set based on given information and explore the dependent variable as function of time.

To solve the linear regression problem, there are various kinds of approaches available to determine suitable regression coefficients [1 , 2]. They include: (i) least-squares analysis, (ii) assessing the least-squares model, (iii) modifications of least-squares analysis, and (iiii) polynomial fitting. The primary objective is to select a straight line which minimizes the error between the real data and the line estimated to provide a best fit.

## References

1. 1.
Draper, N.R., Smith, H. Applied regression analysis Wiley Series in Probability and Statistics; 1998.Google Scholar
2. 2.
Gross J. Linear regression. Berlin: Springer; 2003.Google Scholar

## Copyright information

© Springer Science+Business Media LLC 2016

## Authors and Affiliations

1. 1.Singapore eManagement UniversitySingaporeSingapore

## Section editors and affiliations

• Xiaofang Zhou
• 1
1. 1.School of Inf. Tech. & Elec. Eng.Univ. of QueenslandBrisbaneAustralia 