Skip to main content
Log in

Stock market prediction with time series data and news headlines: a stacking ensemble approach

  • Research
  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

Time series forecasting models are gaining traction in many real-world domains as valuable decision support tools. Stock market analysis is a challenging domain, characterized by a complex multi-variate and time-evolving nature, with high volatility, and multiple correlations with exogenous factors. Autoregressive, machine learning, and deep learning models for temporal data have been adopted thus far to solve this task. However, they are usually limited to the analysis of a single data source or modality, and do not collectively deal with all the inherent challenges and complexities presented by stock market data. In this paper, inspired by the promising learning capabilities of hybrid ensemble methods, we propose a novel stacking ensemble approach for stock market prediction that jointly considers news headlines, multi-variate time series data, and multiple base models as predictors. By taking multiple factors into consideration, our model is able to learn historical patterns leveraging multiple data sources and models. Our experiments showcase the ability of our model to outperform popular baselines on next-day stock market trend prediction. A portfolio analysis reveals that our method is also able to yield potential gains or capital preservation capabilities when its predictions are exploited for trading decisions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Availability of data and materials

All data sources used for our experiments are public and disclosed in the paper.

Notes

  1. https://pypi.org/project/yahooquery/

  2. https://pypi.org/project/TA-Lib/

  3. https://eodhistoricaldata.com/

  4. The dataset used in our work and the implementation of the proposed approach are publicly available at: https://github.com/rcorizzo/stock-market-stacking/

  5. https://pypi.org/project/pmdarima/

  6. https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.GradientBoostingClassifier.html

  7. https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.AdaBoostClassifier.html

  8. All results are conveniently accessible at: http://www.rcorizzo.com/stock-market/stacking/

  9. Additional high-resolution portfolio analysis plots considering other stacking models are conveniently accessible at: http://www.rcorizzo.com/stock-market/stacking/

References

Download references

Acknowledgements

We acknowledge the support of NVIDIA through a donation of a Titan V GPU.

Funding

Not applicable

Author information

Authors and Affiliations

Authors

Contributions

Roberto Corizzo: Conceptualization, Methodology, Investigation, Software, Writing, Resources, Supervision – Jacob Rosen: Data Curation, Software, Visualization, Writing (Review & Editing)

Corresponding author

Correspondence to Roberto Corizzo.

Ethics declarations

Conflict of interest/Competing interests

The authors declare that there are no financial or non-financial interests directly or indirectly related to the work submitted for publication.

Ethics approval

Not applicable

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Corizzo, R., Rosen, J. Stock market prediction with time series data and news headlines: a stacking ensemble approach. J Intell Inf Syst 62, 27–56 (2024). https://doi.org/10.1007/s10844-023-00804-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-023-00804-1

Keywords

Navigation