Relationship Between the Popularity of Key Words in the Google Browser and the Evolution of Worldwide Financial Indices

Ortells, R.; Egozcue, J. J.; Ortego, M. I.; Garola, A.

doi:10.1007/978-3-319-44811-4_10

Relationship Between the Popularity of Key Words in the Google Browser and the Evolution of Worldwide Financial Indices

R. Ortells³,
J. J. Egozcue⁴,
M. I. Ortego⁵ &
…
A. Garola⁶

Conference paper
First Online: 20 November 2016

1175 Accesses
8 Citations

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 187))

Abstract

The purpose of this contribution is to evaluate whether there is enough statistical basis to establish a relationship between the popularity of certain terms in the Google browser and the evolution of several worldwide economic indices the subsequent week. A linear model trying to predict the evolution of 19 financial indices from all over the world with the information of how many times a selected group of 200 key words are looked up online the previous week is proposed. The linear model that is proposed takes a compositional approach due to two reasons. First, the information contained in the values of the financial indices has a compositional nature. The strongest proof supporting this idea is that in case all values for the indices on a certain week were multiplied by a factor, the information would remain unchanged. In fact, the value for a certain index is irrelevant by itself, since it is its evolution with respect to the rest of indices that indicates whether it is performing well. Therefore, this idea suggests that the numerical values of the 19 indices for a certain week can be understood as a vector of the simplex and be analyzed accordingly. Second, the explanatory variable has to be understood as a vector of the simplex as well, for a similar reason as before. For instance, let us imagine that the number of times the words are looked up online in a certain week was multiplied by a factor. Indeed, the information contained in this vector would be exactly the same. Moreover, it seems intuitive as well how the absolute value for the number of searches is irrelevant by itself, since we will be interested in the relationships amongst variables. For the reasons we have just set, a compositional approach seems necessary in order to address the problem successfully, since both the explanatory and predicted variables present a compositional nature. In other words, despite not adding up to a constant, the components of the vectors of both the explanatory and predicted variables seem to be closely related in terms of giving information of a part of a whole, so tackling the problem through a compositional perspective seems appropriate. The analysis consists of an exploratory analysis of both response (indices) and explanatory (searches) variables and a compositional linear multiple regression between both sets of variables.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
The information on the searches of different words is provided by Google Trends, subsidiary of Google, Inc. From now on, we will refer to Google Trends simply as Google.
2.
It is arguable whether we could have used an ILR transformation, as we have done for the predicted variable. Provided that a principal component analysis will be applied, it is unnecessary to define a binary partition and an ILR transformation. Given that in the end we will work with the orthogonal base that we get from the principal component analysis, we can work directly with a standard CLR procedure.
3.
Yahoo Finance, owned by Yahoo!, Inc. Information consulted in 2013.
4.
Investing.com, owned by Fusion Media Limited. Information consulted in 2014.

References

Aitchison, J.: The statistical analysis of compositional data (with discussion). J. R. Stat. Soc. Ser. B (Stat. Methodol.) 44(2), 139–177 (1982)
Google Scholar
Aitchison, J.: The Statistical Analysis of Compositional Data (Reprinted in 2003 by The Blackburn Press), p. 416. Chapman & Hall Ltd., London (UK) (1986)
Google Scholar
Aitchison, J., Greenacre, M.: Biplots of compositional data. J. R. Stat. Soc. Ser. C (Appl. Stat.) 51(4), 375–392 (2002)
Google Scholar
Anderson, T.W., Darling, D.A.: Asymptotic theory of certain goodness-of-fit criteria based on stochastic processes. Ann. Math. Stat. 23, 193–212 (1952)
Article MathSciNet MATH Google Scholar
Arrow, K.J.: Aspects of the theory of risk bearing. The theory of risk aversion. Helsinki: Yrjo Jahnssonin Saatio, Reprinted in: Essays in the theory of risk bearing, Markham Publ. Co., Chicago, 1971 (1965)
Google Scholar
Babu, G.J., Rao, C.R.: Goodness-of-fit tests when parameters are estimated. Technometrics (Am. Stat. Assoc.) 66, 63–74 (2004)
MathSciNet MATH Google Scholar
Bordino, I., Battiston, S., Caldarelli, G., et al.: Web search queries can predict stock market volumes. PloS one 7(7):e40, 014 (2012)
Google Scholar
Challet, D., Marsili, M., Zhang, Y.C., et al.: Minority Games: Interacting Agents in Financial Markets. OUP Catalogue (2013)
Google Scholar
Choi, H., Varian, H.: Predicting the present with Google trends. Econ. Rec. 88, 2–9 (2012)
Google Scholar
Cook, R.D.: Detection of influential observations in linear regression. Technometrics (Am. Stat. Assoc.) 19, 15–18 (1977)
MATH Google Scholar
Durbin, J., Watson, G.S.: Testing for serial correlation in least squares regression. I. Biometrika 37, 409–428 (1950)
MathSciNet MATH Google Scholar
Durbin, J., Watson, G.S.: Testing for serial correlation in least squares regression. II. Biometrika 38, 159–179 (1951)
Article MathSciNet MATH Google Scholar
Efron, B., Hastie, T., Johnstone, I., et al.: Least angle regression. Ann. Stat. 32(2), 407–499 (2004)
Article MathSciNet MATH Google Scholar
Egozcue, J.J., Pawlowsky-Glahn, V.: Groups of parts and their balances in compositional data analysis. Math. Geol. 37(7), 799–832 (2005)
Article MathSciNet MATH Google Scholar
Egozcue, J.J., Pawlowsky-Glahn, V.: Compositional data and their analysis: an introduction. Geol. Soc., Lond., Spec. Publ. 264, 1–10 (2006)
Google Scholar
Egozcue, J.J., Pawlowsky-Glahn, V.: Simplicial geometry for compositional data. Geol. Soc., Lond., Spec. Publ. 264, 145–159 (2006)
Google Scholar
Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G., et al.: Isometric logratio transformations for compositional data analysis. Math. Geol. 35(3), 279–300 (2003)
Article MathSciNet MATH Google Scholar
Ginsberg, J., et al.: Detecting influenza epidemics using search engine query data. Nature 457, 1012–1014 (2009)
Article Google Scholar
Kim, C., Storer, B.E.: Reference values for Cook’s distance. Commun. Stat. Simul. Comput. 25, 691–708 (1996)
Google Scholar
Koohang, A., Harman, K., Britz, J.: Knowledge Management: Theoretical Foundation (Chapter 6: Network Analysis and Crowds of People as Sources of New Organisational Knowledge). Informing Science Press, Santa Rosa, CA, US (2008)
Google Scholar
Martín-Fernández, J.A., Hron, K., Templ, M., et al.: Model-based replacement of rounded zeros in compositional data: classical and robust approach. Comput. Stat. Data Anal. 56, 2688–2704 (2012)
Article MATH Google Scholar
Massey, F.J.: The Kolmogorov-Smirnov test for goodness of fit. J. Am. Stat. Assoc. 46(253), 68–78 (1951)
Article MATH Google Scholar
Pawlowsky-Glahn, V., Egozcue, J.J., Tolosana-Delgado, R.: Modeling and analysis of compositional data. Wiley (2015)
Google Scholar
Pratt, J.W.: Risk aversion in the small and in the large. Econometrica 32(1–2), 122–136 (1964)
Article MATH Google Scholar
Preis, T., Moat, H.S., Stanley, H.E.: Quantifying trading behavior in financial markets using Google trends. Sci. Rep. 3, 1684 (2013)
Google Scholar
Royston, P.: Algorithm AS 181: the W test for normality. Appl. Stat. 31, 176–180 (1982)
Article Google Scholar
Surowiecki, J.: The Wisdom of Crowds: Why the many are smarter than the few and how collective wisdom shapes business. Economies. Societies and Nation, Little, Brown (2004)
Google Scholar
Surowiecki, J.: The Wisdom of Crowds. Anchor Books (2005)
Google Scholar

Download references

Acknowledgments

The present research work has been developed during the time that Robert Ortells has benefited from a grant within the program “Becas de Colaboración” of the Spanish Ministerio de Educación, for the development of the project “Correlació entre l’evolució dels mercats financers globals i la popularitat de diferents paraules al buscador Google” during 2014–2015. This project has been developed in the Applied Mathematics III Department of Universitat Politècnica de Catalunya (UPC). This research has been also partially funded by the Spanish Ministerio de Economía y Competitividad under project ‘Metrics’ Ref.MTM2012-33236, and by the Agència de Gestió d’Ajuts Universitaris i de Recerca (AGAUR) of the Generalitat de Catalunya under the project “Compositional and Spatial Analysis” (COSDA) (Ref: 2014SGR551;2014-2016).

Author information

Authors and Affiliations

The London School of Economics and Political Science (LSE), London, SE11 5TA, UK
R. Ortells
Universitat Politècnica de Catalunya (UPC), Campus Nord. C/ Jordi Girona, 1-3, Edifici C2, Despatx 211B, 08034, Barcelona, Spain
J. J. Egozcue
Universitat Politècnica de Catalunya (UPC), Campus Nord. C/ Jordi Girona, 1-3, Edifici C2, Despatx 307, 08034, Barcelona, Spain
M. I. Ortego
Universitat Politècnica de Catalunya (UPC), Campus Nord. C/ Jordi Girona, 1-3, Edifici B1, Despatx 301, 08034, Barcelona, Spain
A. Garola

Authors

R. Ortells
View author publications
You can also search for this author in PubMed Google Scholar
J. J. Egozcue
View author publications
You can also search for this author in PubMed Google Scholar
M. I. Ortego
View author publications
You can also search for this author in PubMed Google Scholar
A. Garola
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Ortells .

Editor information

Editors and Affiliations

Department of Computer Science and Applied Mathematics, University of Girona, Girona, Spain
Josep Antoni Martín-Fernández
Department of Computer Science and Applied Mathematics, University of Girona, Girona, Spain
Santiago Thió-Henestrosa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ortells, R., Egozcue, J.J., Ortego, M.I., Garola, A. (2016). Relationship Between the Popularity of Key Words in the Google Browser and the Evolution of Worldwide Financial Indices. In: Martín-Fernández, J., Thió-Henestrosa, S. (eds) Compositional Data Analysis. CoDaWork 2015. Springer Proceedings in Mathematics & Statistics, vol 187. Springer, Cham. https://doi.org/10.1007/978-3-319-44811-4_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-44811-4_10
Published: 20 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44810-7
Online ISBN: 978-3-319-44811-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics