Skip to main content

Stock Market Prediction Using Keywords from Expert Articles

Part of the Advances in Intelligent Systems and Computing book series (AISC,volume 700)


The market analysis is one of the important tasks for text mining. In this situation, Web news has an important role to predict stock prices. In this paper, we propose a method to predict the Nikkei Stock Average, which is one of the most important stock market indexes. We extract viewpoints from experts’ articles for analyzing Web news. The extracted words are index words in the vector space of a machine learning technique. We also incorporate word embedding and bootstrap approaches into our method. It predicts “UP” or “DOWN” of the next day by using the articles of a day. We also evaluate our method with not only one-day prediction but also simulated trading. The experimental result shows that index words based on expert articles were effective for both one-day prediction and simulated trading.


  • Stock market prediction
  • Experts’ articles
  • Bootstrap
  • Feature extraction

This is a preview of subscription content, access via your institution.

Buying options

USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-3-319-72550-5_39
  • Chapter length: 9 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
USD   269.00
Price excludes VAT (USA)
  • ISBN: 978-3-319-72550-5
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   349.00
Price excludes VAT (USA)
Fig. 1.


  1. 1.

  2. 2.

  3. 3.

    Since the number of BoWs is approximately 10,000, BoK-E is approximately 10% of the BoW.

  4. 4.

  5. 5.

    Note that we never use experts’ articles for the vectorization. We just use expert’s articles for extracting index words for the vector space.

  6. 6.

    More precisely, the model predicts the next day’s “UP” or “DOWN” from the target day.

  7. 7.

    Loss-cutting is a financial word. It is an order to buy or sell stocks automatically in the case that unrealized capital losses become larger.

  8. 8.

    Short selling is a financial word and the practice of selling stocks that are not currently owned by borrowing them from a securities company.

  9. 9.

    More properly, the borrowed stocks are paid back to the securities company. The balance is the benefit of the day.

  10. 10.

    Although we evaluated other methods, such as the method with BoT-E, the BoK-E also produced the best performance in terms of this simulated trading.


  1. Bollen, J., Mao, H., Zeng, X.-J.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)

    CrossRef  Google Scholar 

  2. Lee, H., Surdeanu, M., MacCartney, B., Jurafsky, D.: On the importance of text analysis for stock price prediction. In: Proceedings of the 9th Edition of the Language Resources and Evaluation Conference (LREC), pp. 1170–1175 (2014)

    Google Scholar 

  3. Nguyen, T.H., Shirai, K.: Topic modeling based sentiment analysis on social media for stock market prediction. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, pp. 1354–1364 (2015)

    Google Scholar 

  4. Izumi, K., Goto, T., Matsui, T.: Trading tests of long-term market forecast by text mining. In: Proceedings of 2010 IEEE International Conference on Data Mining Workshops (ICDMW), pp. 935–942 (2010)

    Google Scholar 

  5. Schumaker, R.P., Chen, H.: A discrete stock price prediction engine based on financial news. Computer 43(1), 51–56 (2010)

    CrossRef  Google Scholar 

  6. Ding, X., Zhang, Y., Liu, T., Duan, J.: Using structured events to predict stock price movement: an empirical investigation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1415–1425 (2014)

    Google Scholar 

  7. Xie, B., Passonneau, R.J., Wu, L., Creamer, G.G.: Semantic frames to predict stock price movement. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 873–883 (2013)

    Google Scholar 

  8. Peng, Y., Jiang, H.: Leverage financial news to predict stock price movements using word embeddings and deep neural networks. In: Proceedings of NAACL-HLT 2016, pp. 374–379 (2016)

    Google Scholar 

  9. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of NIPS (2013)

    Google Scholar 

  10. Bar-Haim, R., Dinur, E., Feldman, R., Fresko, M., Goldstein, G.: Identifying and following expert investors in stock microblogs. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pp. 1310–1319 (2011)

    Google Scholar 

  11. Kudo, T., Yamamoto, K., Matsumoto, Y.: Applying conditional random fields to Japanese morphological analysis. In: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP-2004), pp. 230–237 (2004)

    Google Scholar 

  12. Kudo, T., Matsumoto, Y.: Japanese dependency analysis using cascaded chunking. In: CoNLL 2002: Proceedings of the 6th Conference on Natural Language Learning 2002 (COLING 2002 Post-Conference Workshops), pp. 63–69 (2002)

    Google Scholar 

  13. Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1999)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Kazutaka Shimada .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Verify currency and authenticity via CrossMark

Cite this paper

Ichinose, K., Shimada, K. (2018). Stock Market Prediction Using Keywords from Expert Articles. In: Ghazali, R., Deris, M., Nawi, N., Abawajy, J. (eds) Recent Advances on Soft Computing and Data Mining. SCDM 2018. Advances in Intelligent Systems and Computing, vol 700. Springer, Cham.

Download citation

  • DOI:

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-72549-9

  • Online ISBN: 978-3-319-72550-5

  • eBook Packages: EngineeringEngineering (R0)