Sentiment analysis of financial news articles using performance indicators

Regular Paper
  • 338 Downloads

Abstract

Mining financial text documents and understanding the sentiments of individual investors, institutions and markets is an important and challenging problem in the literature. Current approaches to mine sentiments from financial texts largely rely on domain-specific dictionaries. However, dictionary-based methods often fail to accurately predict the polarity of financial texts. This paper aims to improve the state-of-the-art and introduces a novel sentiment analysis approach that employs the concept of financial and non-financial performance indicators. It presents an association rule mining-based hierarchical sentiment classifier model to predict the polarity of financial texts as positive, neutral or negative. The performance of the proposed model is evaluated on a benchmark financial dataset. The model is also compared against other state-of-the-art dictionary and machine learning-based approaches and the results are found to be quite promising. The novel use of performance indicators for financial sentiment analysis offers interesting and useful insights.

Keywords

Sentiment analysis Financial news Performance indicators Text mining Machine learning Classification 

References

  1. 1.
    Agrawal R, Imieliński T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of the 1993 ACM SIGMOD International conference on management of data, pp 207–216, Washington, DC, May 26–28Google Scholar
  2. 2.
    Antweiler W, Frank MZ (2004) Is all that talk just noise? The information content of internet stock message boards. J Finance 59(3):1259–1294CrossRefGoogle Scholar
  3. 3.
    Berzal F, Cubero J-C, Sánchez D, Serrano JM (2004) Art: a hybrid classification model. Mach Learn 54(1):67–92CrossRefMATHGoogle Scholar
  4. 4.
    Bird S (2006) Nltk: the natural language toolkit. In: Proceedings of the COLING/ACL on interactive presentation sessions. Association for Computational Linguistics, pp 69–72Google Scholar
  5. 5.
    Blitzer J, Blitzer J, Dredze M, Dredze M, Pereira F, Pereira F (2007) Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. Annu Meet Assoc Comput Linguist 45(1):440Google Scholar
  6. 6.
    Cambria E, Olsher D, Rajagopal D ( 2014). Senticnet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: Proceedings of the twenty-eighth AAAI conference on artificial intelligence, pp 1515–1521Google Scholar
  7. 7.
    Dang Y, Zhang Y, Chen H (2010) A lexicon-enhanced method for sentiment classification: an experiment on online product reviews. IEEE Intell Syst 25(4):46–53CrossRefGoogle Scholar
  8. 8.
    Esuli A, Sebastiani F (2006) Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of LREC, vol 6, pp 417–422Google Scholar
  9. 9.
    Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27(8):861–874MathSciNetCrossRefGoogle Scholar
  10. 10.
    Ferguson N, Philip D, Lam H, Guo J (2014) Media content and stock returns: the predictive power of press. Multinatl Finance J 19(1):1–31Google Scholar
  11. 11.
    Huang AH, Zang AY, Zheng R (2014) Evidence on the information content of text in analyst reports. Account Rev 89(6):2151–2180CrossRefGoogle Scholar
  12. 12.
    Ittner CD, Larcker DF (1998) Are nonfinancial measures leading indicators of financial performance? An analysis of customer satisfaction. J Account Res 36:1–35CrossRefGoogle Scholar
  13. 13.
    Kaplan RS, Norton DP (1996) Linking the balanced scorecard to strategy. Calif Manag Rev 39(1):53–79CrossRefGoogle Scholar
  14. 14.
    Kearney C, Liu S (2014) Textual sentiment in finance: a survey of methods and models. Int Rev Financ Anal 33:171–185CrossRefGoogle Scholar
  15. 15.
    Li F (2010) The information content of forward- looking statements in corporate filings—a naive bayesian machine learning approach. J Account Res 48(5):1049–1102CrossRefGoogle Scholar
  16. 16.
    Li W, Han J, Pei J (2001) CMAR: accurate and efficient classification based on multiple class-association rules. In: Proceedings 2001 IEEE international conference on data mining, pp 369–376Google Scholar
  17. 17.
    Li Q, Wang T, Li P, Liu L, Gong Q, Chen Y (2014a) The effect of news and public mood on stock movements. Inf Sci 278:826–840CrossRefGoogle Scholar
  18. 18.
    Li X, Xie H, Chen L, Wang J, Deng X (2014b) News impact on stock price return via sentiment analysis. Knowl Based Syst 69:14–23CrossRefGoogle Scholar
  19. 19.
    Li Q, Chen Y, Jiang LL, Li P, Chen H (2016) A tensor-based information framework for predicting the stock market. ACM Trans Inf Syst (TOIS) 34(2):11CrossRefGoogle Scholar
  20. 20.
    Liu M, Qu J (2012) Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM international conference on information and knowledge management. ACM, pp 55–64Google Scholar
  21. 21.
    Liu B, Hsu W, Ma Y, Ma B (1998) Integrating classification and association rule mining. Knowl Discov Data Min 80–86Google Scholar
  22. 22.
    Loughran T, Mcdonald B (2011) When is a liability not a liability? Textual analysis, dictionaries, and 10-Ks. J Finance 66(1):35–65CrossRefGoogle Scholar
  23. 23.
    Loughran T, McDonald B (2015) The use of word lists in textual analysis. J Behav Finance 16(1):1–11CrossRefGoogle Scholar
  24. 24.
    Loughran T, McDonald B (2016) Textual analysis in accounting and finance: a survey. J Account Res 54:1187–1230CrossRefGoogle Scholar
  25. 25.
    Malo P, Sinha A, Korhonen P, Wallenius J, Takala P (2014) Good debt or bad debt: detecting semantic orientations in economic texts. J Assoc Inf Sci Technol 65(4):782–796CrossRefGoogle Scholar
  26. 26.
    Man Y, Yuanxin O, Hao S (2014) Investigating association rules for sentiment classification of web reviews. J Intell Fuzzy Syst 27(4):2055–2065Google Scholar
  27. 27.
    Meretakis D, Wüthrich B (1999) Extending naive bayes classifiers using long itemsets. In: Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 165–174Google Scholar
  28. 28.
    Moilanen K, Pulman S, Zhang Y (2010) Packed feelings and ordered sentiments: sentiment parsing with quasi-compositional polarity sequencing and compression. In: Proceedings of the 1st workshop on computational approaches to subjectivity and sentiment analysis (WASSA 2010) at the 19th European conference on artificial intelligence (ECAI 2010), pp. 36–43Google Scholar
  29. 29.
    Mo S, Y K, Liu A, Yang SY (2016) News sentiment to market impact and its feedback effect. Environ Syst Decisi 1–9Google Scholar
  30. 30.
    O’Hare N, Davy M, Bermingham A, Ferguson P, Sheridan PP, Gurrin C, Smeaton AF, OHare N. (2009) Topic-dependent sentiment analysis of financial blogs. In: International CIKM workshop on topic-sentiment analysis for mass opinion measurement, pp 9–16Google Scholar
  31. 31.
    Pang B, Lee L, Vaithyanathan S (2002) Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the conference on empirical methods in natural language processing, pp 79–86Google Scholar
  32. 32.
    Schumaker RP, Chen H (2009) Textual analysis of stock market prediction using breaking financial news: the AZFin text system. ACM Trans Inf Syst (TOIS) 27(2):12CrossRefGoogle Scholar
  33. 33.
    Stone PJ, Bales RF, Namenwirth JZ, Ogilvie DM (1962) The general inquirer: a computer system for content analysis and retrieval based on the sentence as unit of information. Comput Behav Sci 7(4):484–498CrossRefGoogle Scholar
  34. 34.
    Tetlock PC (2016) Giving content to investor sentiment: the role of media in the stock market. J Finance 62(3):1139–1168CrossRefGoogle Scholar
  35. 35.
    Tetlock PC, Saar-Tsechansky M, MacSkassy S (2008) More than words: quantifying language to measure firms’ fundamentals. J Finance 63(3):1437–1467CrossRefGoogle Scholar
  36. 36.
    Thelwall M, Buckley K, Paltoglou G (2012) Sentiment strength detection for the social web. J Am Soc Inf Sci Technol 63(1):163–173CrossRefGoogle Scholar
  37. 37.
    Turney PD (2002) Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th annual meeting of the association for computational linguistics, pp 417–424Google Scholar
  38. 38.
    Van De Kauter M, Breesch D, Hoste V (2015) Fine-grained analysis of explicit and implicit sentiment in financial news articles. Expert Syst Appl 42(11):4999–5010CrossRefGoogle Scholar
  39. 39.
    Wiebe J, Wilson T, Cardie C (2005) Annotating expressions of opinions and emotions in language. Lang Resour Eval 39(2–3):165–210CrossRefGoogle Scholar
  40. 40.
    Yang CC, Tang X, Wong Y, Wei C-P (2010) Understanding online consumer review opinions with sentiment analysis using machine learning. Pac Asia J Assoc Inf Syst 2(3):73–89Google Scholar

Copyright information

© Springer-Verlag London Ltd., part of Springer Nature 2017

Authors and Affiliations

  1. 1.Information Systems AreaIndian Institute of Management AhmedabadAhmedabadIndia

Personalised recommendations