Abstract
Recently, investor sentiment measures have become one of the more widely examined areas in behavioral finance. They are capable of both explaining and forecasting stock returns. The purpose of this paper is to present a method, based on a combination of a Naïve Bayes classifier and the n-gram probabilistic language model, which can create a sentiment index for specific stocks and indices of the New York Stock Exchange. An economic useful proxy for investor sentiment is constructed from U.S. news articles mainly provided by The New York Times. Initially, a large amount of articles for ten big companies and indices is collected and processed, in order to be able to extract a sentiment score from each one of them. Then, the classifier is trained from the positive, negative and neutral articles, so that it is possible afterwards to examine the sentiment of any unseen newspaper article, for any company or index. Subsequently, the classification task is tested and validated for its accuracy and efficiency. The widely used Baker and Wurgler sentiment index [2] is used as a comparison measure for predicting stock returns. In a sample of S&P 500 index from 2004 to 2010 on monthly basis, it is shown that the new sentiment index created has, on average, twice the predictive ability of Baker and Wurgler’s index, for the existing time frame.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Antweiler, W., Frank, M.: Is all that talk just noise? The information content of Internet stock message boards. J. Finance 59(3), 1259–1294 (2004)
Baker, M., Wurgler, J.: Investor sentiment and the cross-section of stock returns. J. Finance 61(4), 1645–1680 (2006)
Barber, B.M., Odean, T., Zhu, N.: Do retail trades move markets? Rev. Finan. Stud. 22, 151–186 (2009)
Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2010)
Bram, J., Ludvigson, S.C.: Does consumer confidence forecast household expenditure? A sentiment index horse race. Econ. Policy Rev. 4(2) (1998)
Das, S., Chen, M.: Yahoo! for Amazon: sentiment extraction from small talk on the Web. Manag. Sci. 53(9), 1375–1388 (2007)
Dergiades, T.: Do investors’ sentiment dynamics affect stock returns? Evidence from the US economy. Econ. Lett. 116(3), 404–407 (2012). ISSN 0165-1765, http://dx.doi.org/10.1016/j.econlet.2012.04.018
Dickinson, B., Hu, W.: Sentiment analysis of investor opinions on twitter. Soc. Network 4, 62–71 (2015)
Fang, H., Tao, T., Zhai, Ch. X.: A formal study of information retrieval heuristics. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ‘04). ACM, New York, NY, USA, pp. 49–56 (2004). doi:http://dx.doi.org/10.1145/1008992.1009004
Garcia, D.: Sentiment during recessions. J. Finance 68(3), 1267–1299 (2013)
Iyengar, S.: Is Anyone Responsible? How Television Frames Political Issues. University of Chicago Press, Chicago (1991)
Klemola, A., Nikkinen, J., Peltomäki, J.: Investor Sentiment in the Stock Market Inferred from Google Search Volumes (2010)
McDonald, B.: Bill McDonald’s Word Lists Page, University of Notre Dame (2013). http://www3.nd.edu/~mcdonald/Word_Lists.html
Pang, B., Lee, L.: Opinion Mining and Sentiment Analysis (2008)
Plous S (1993) The Psychology of Judgment and Decision Making. McGraw-Hill. ISBN 978-0-07-050477-6
Price, V., Tewksbury, D.: News Values and Public Opinion: A Theoretical Account of Media Priming and Framing. In: Barett, G.A., Boster, F.J. (eds.) Progress in Communication Sciences: Advances in Persuasion, vol. 13, pp. 173–212. Ablex, Greenwich, CT (1997)
Price, V., Tewksbury, D., Powers, E.: Switching trains of thought: the impact of news frames on readers’ cognitive responses. Commun. Res. 24(5), 481–506 (1997)
Qiu, L., Welch, I.: Investor Sentiment Measures. Brown University and NBER (2006)
Schmeling, M.: Institutional and individual sentiment: smart money and noise trader risk? 23, 127–145 (2007)
Sehgal, V., Song, C.: SOPS: Stock Prediction using Web Sentiment. In: Seventh IEEE International Conference on Data Mining—Workshops, pp. 21–26 (2009)
Shiller, R.J.: Irrational Exuberance, 2nd edn. Princeton University Press, Princeton, New Jersey (2005)
Shleifer, A., Summers, L.H.: The noise trader approach to finance. J. Econ. Perspect. 4, 19–33 (1990)
Spärck Jones, K.: A statistical interpretation of term specificity and its application in retrieval. J. Documentation 28, 11–21 (1972). doi:10.1108/eb026526
Statman, M.: Normal investors, then and now. CFA Institute, Finan. Anal. J. 61(2), 31–37 (2005)
Tumarkin, R., Whitelaw, R.F.: News or Noise? Internet Message Board Activity and Stock Prices (2000)
Tversky, A., Kahneman, D.: The Framing of decisions and the psychology of choice. Science 211(4481), 453–458 (1981). doi:10.1126/science.7455683
Vliegenthart, R., Schuck, A.R.T., Boomgaarden, H.G., De Vreese, C.H.: News coverage and support for european integration, 1990–2006. Int. J. Public Opin. Res. 20(4), 415–436 (2008)
Wolpert, D.H.: Stacked generalization. Neural Networks 5, 241 (1992)
Acknowledgments
We would like to give special thanks to Dr. Theologos Dergiades, Academic Associate of the School of Science & Technology at the International Hellenic University, Thessaloniki, Greece, for his advice and assistance whenever it was needed.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Arvanitis, K., Bassiliades, N. (2017). Real-Time Investors’ Sentiment Analysis from Newspaper Articles. In: Hatzilygeroudis, I., Palade, V., Prentzas, J. (eds) Advances in Combining Intelligent Methods. Intelligent Systems Reference Library, vol 116 . Springer, Cham. https://doi.org/10.1007/978-3-319-46200-4_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-46200-4_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46199-1
Online ISBN: 978-3-319-46200-4
eBook Packages: EngineeringEngineering (R0)