Abstract
The regression-line for a set of data-points \(p_i = (x_i, y_i), 1 \le i \le N\) and \(N > 2\), lacks the rotation-property in the sense that if each \(p_i\) is rotated by an angle \(\theta \) around the origin then the regression-line does not rotate by the same angle \(\theta \) except for the special case when all \(p_i\)’s are collinear. This makes the regression-line unsuitable as a linear model of a set of data points for applications in data mining and machine learning. We present an alternative linear model that has the rotation property. In many ways, the new model is also more appealing intuitively as we show with examples. The computation of the new linear model takes the same O(N) time as that for the regression-line.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W.H.Freeman & Co. (1979)
Tan, P.-N., Steinbach, M., Karpatne, A., Kumar, V.: Introduction to Data Mining. Addison-Wesley, Boston (2018)
Abott, D.: Applied Predictive Analysis. Wiley, Hoboken (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kundu, S. (2022). A Better Linear Model Than Regression-Line for Data-Mining Applications. In: Tan, Y., Shi, Y. (eds) Data Mining and Big Data. DMBD 2022. Communications in Computer and Information Science, vol 1745. Springer, Singapore. https://doi.org/10.1007/978-981-19-8991-9_6
Download citation
DOI: https://doi.org/10.1007/978-981-19-8991-9_6
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-8990-2
Online ISBN: 978-981-19-8991-9
eBook Packages: Computer ScienceComputer Science (R0)