Stock Trend Prediction by Classifying Aggregative Web Topic-Opinion

  • Li Xue
  • Yun Xiong
  • Yangyong Zhu
  • Jianfeng Wu
  • Zhiyuan Chen
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7819)


According to the Efficient Market Hypothesis(EMH) theory, the stock market is driven mainly by overall information instead of individual event. Furthermore, the information about hot topics is believed to have more impact on stork market than that about ordinary events. Inspired by these ideas, we propose a novel stock market trend prediction method by Classifying Aggregative Web Topic-Opinion(CAWTO), which predicts stocks movement trend according to the aggregative opinions on hot topics mentioned by financial corpus on the web. Several groups of experiments were carried out using the data of Shanghai Stock Exchange Composite Index(SHCOMP) and 287,686 financial articles released on SinaFinance, which prove the effectiveness of our proposed method.


Opinion Mining Aggregative Opinion Stock Prediction 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Fama, E.F.: The behavior of stock-market prices. The Journal of Busines 38(1), 34–105 (1965)CrossRefGoogle Scholar
  2. 2.
    Koppel, M., Shtrimberg, I.: Good news or bad news? Let the market decide. In: Proceedings of the AAAI Spring Symposium on Exploring Attitude and Affect in Text: Theories and Applications, pp. 86–88 (2004)Google Scholar
  3. 3.
    Sehgaland, V., Song, C.: SOPS: Stock Prediction Using Web Sentiment. In: Proceedings of the 7th IEEE International Conference on Data Mining 2007, Data Mining in Web 2.0 Environments Workshop, Omaha, U.S, pp. 21–26 (2007)Google Scholar
  4. 4.
    Wong, K.-F., Xia, Y., Xu, R., Wu, M., Li, W.: Pattern-based Opinion Mining for Stock Market Trend Prediction. International Journal of Computer Processing of Oriental Languages 21(4), 347–362 (2008)CrossRefGoogle Scholar
  5. 5.
    Wthrich, B., Cho, V., Leung, S., Peramunetilleke, D., Sankaran, K., Zhang, J., Lam, W.: Daily Prediction of Major Stock Indices from Textual WWW Data. In: Proceedings of the 4th ACM SIGKDD, NY, pp. 364–368 (1998)Google Scholar
  6. 6.
    Lavrenko, V., Schmill, M., Lawrie, D., Ogilvie, P., Jensen, D., Allan, J.: Language models for financial news recommendation. In: Proceedings of the 9th International Conference on Information and Knowledge Management (2000a)Google Scholar
  7. 7.
    Ingvaldsen, J., Gulla, V., Læreid, T., Sandal, P.: Financial News Mining: Monitoring Continuous Streams of Text. In: Proc. IEEE/WIC/ACM International Conference on Web Intelligence (2006)Google Scholar
  8. 8.
    Kloptchenko, A., Eklund, T., Back, B., Karlsson, J., Vanharanta, H., Visa, A.: Combining data and text mining techniques for analyzing financial reports. In: Proc. Eighth Americas Conference on Information Systems (2002)Google Scholar
  9. 9.
    Ahmad, K., Cheng, D., Almas, Y.: Multi-lingual sentiment analysis in financial news streams. In: Proc. of the 1st Intl. Conf. on Grid in Finance, Italy (2006)Google Scholar
  10. 10.
    Devitt, A., Ahmad, K.: Sentiment Polarity Identification in Financial News: A Cohesion-based Approach. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, Prague, pp. 984–991 (June 2007)Google Scholar
  11. 11.
    Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stock market. Journal of Computational Science (2011)Google Scholar
  12. 12.
    Mahajan, A., Dey, L., Haque, S.M.: Mining Financial News for Major Events and Their Impacts on the Market. Presented at the WI-IAT 2008. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, pp. 423–426 (2008)Google Scholar
  13. 13.
    AlSumait, L., Barbar, D., Domeniconi: Online LDA: Adaptive topic model for mining text streams with application on topic detection and tracking. In: Proceedings of the IEEE International Conference on Data Mining (2008)Google Scholar
  14. 14.
    Bishop, C.M.: Pattern Recognition and Machine Learning. Springer (2006)Google Scholar
  15. 15.
    Davies, D.L., Bouldin, D.: A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1:224-1:227 (1979)Google Scholar
  16. 16.
    Xue, L., Chen, M., Xiong, Y., Zhu, Y.: User Navigation Behavior Mining Using Multiple Data Domain Description. In: IEEE/WIC/ACM, International Conference on Web Intelligence and Intelligent Agent Technology (2010)Google Scholar
  17. 17.
    Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. The Journal of Machine Learning Research 3, 993–1022 (2003)zbMATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Li Xue
    • 1
  • Yun Xiong
    • 1
  • Yangyong Zhu
    • 1
  • Jianfeng Wu
    • 2
  • Zhiyuan Chen
    • 3
  1. 1.School of Computer ScienceFudan UniversityShanghaiP.R. China
  2. 2.Shanghai Stock ExchangesShanghaiP.R. China
  3. 3.Department of Information SystemsUniversity of Maryland Baltimore CountyBaltimoreUSA

Personalised recommendations