Skip to main content

Alternative Data, Big Data, and Applications to Finance

  • Chapter
  • First Online:
Fintech with Artificial Intelligence, Big Data, and Blockchain

Abstract

Financial technology has often been touted as revolutionary for financial services. The promise of financial technology can be ascribed to a handful of key ideas: cloud computing, smart contracts on the blockchain, machine learning/AI, and finally—big and alternative data. This chapter focuses on the last concept, big and alternative data, and unpacks the meaning behind this term as well as its applications. We explore applications to various domains such as quantitative trading, macroeconomic measurement, credit scoring, corporate social responsibility, and more.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    To be more precise, there exists an investment strategy that provides statistically robust “alpha,” relative to a benchmark regression. A typical benchmark is the market returns or the Fama-French three-factor model.

  2. 2.

    \( \frac{{{\text{sales}}_{t} - {\text{sales}}_{{\left\{ {t - 4} \right\}}} }}{\sigma } \) where \( \sigma \) is the standard deviation of the prior annual sales growth \( {\text{sales}}_{t - k} - {\text{sales}}_{t - k - 4} \).

  3. 3.

    There exists a handful of studies in this vein today, but their data are subject to massive back-filling issues. LinkedIn launched in 2007, with most databases gathering this data emerging in 2013. Many of these studies on employee turnover back-fill the history, leading to many issues related to survivorship bias. Further, it is not clear that the time an employee moves is equal to when it is reported on a Web site such as LinkedIn, because employees may do so on a lag or delay.

  4. 4.

    Reuters, July 18, 2018. “UPDATE 1-Ghana’s economy seen up to 40 pct bigger after data overhaul—officials.”

References

  1. Abis S (2017) Man vs. machine: quantitative and discretionary equity management. Working Paper

    Google Scholar 

  2. Agarwal S, Qian W, Zou X (2017) Thy neighbor’s misfortune: peer effect on consumption. Available at SSRN 2780764

    Google Scholar 

  3. Agarwal AK, Hacamo I, Hu Z (2020) Information dispersion across employees and stock returns. Kelley school of business research paper 18–47

    Google Scholar 

  4. Arraiz I, Bruhn M, Ortega CR, Stucchi R (2017) Are psychometric tools a viable screening method for small and medium-size enterprise lending? Evidence from Peru. Working Paper

    Google Scholar 

  5. Baker SR, Bloom N, Davis SJ (2016) Measuring economic policy uncertainty. Q J Econ 131(4):1593–1636

    Article  Google Scholar 

  6. Baker M, Wurgler J (2006) Investor sentiment and the cross-section of stock returns. J Finan 61(4):1645–1680

    Article  Google Scholar 

  7. Baker SR, Farrokhnia RA, Meyer S, Pagel M, Yannelis C (2020) How does household spending respond to an epidemic? Consumption during the 2020 covid-19 pandemic (No. w26949). National Bureau of Economic Research

    Google Scholar 

  8. Baker SR, Bloom N, Davis SJ (2016) Measuring economic policy uncertainty. Q J Econ 2304 131(4):1593–1636

    Google Scholar 

  9. Bartik AW, Cullen ZB, Glaeser EL, Luca M Stanton CT (2020) What jobs are being done at home during the COVID-19 crisis? Evidence from firm-level surveys (No. w27422). National Bureau of Economic Research

    Google Scholar 

  10. Berg T, Burg V, Gombović A, Puri M (2020) On the rise of fintechs: credit scoring using digital footprints. Rev Finan Stud 33(7):2845–2897

    Article  Google Scholar 

  11. Björkegren D, Grissen D (2018) The potential of digital credit to bank the poor. In: AEA papers and proceedings, vol 108, pp 68–71

    Google Scholar 

  12. Black F, Scholes M (1973) The pricing of options and corporate liabilities. J Polit Econ 81(3):637–654

    Google Scholar 

  13. Bronfenbrenner M, Holzman FD (1963) Survey of inflation theory. Am Econ Rev 53(4):593–661

    Google Scholar 

  14. Campello M, Kankanhalli G, Muthukrishnan P (2020) Corporate hiring under covid-19: labor market concentration, downskilling, and income inequality (No. w27208). National Bureau of Economic Research

    Google Scholar 

  15. Castelnuovo E, Tran TD (2017) Google it up! a Google trends-based uncertainty index for the united states and Australia. Econ Lett 161:149–153

    Article  Google Scholar 

  16. Cavallo A (2013) Online and official price indexes: measuring Argentina’s inflation. J Monetary Econ 60(2):152–165

    Article  Google Scholar 

  17. Cavallo A, Rigobon R (2016) The billion prices project: using online prices for measurement and research. J Econ Perspect 30(2):151–178

    Article  Google Scholar 

  18. Charoenwong B, Han M, Wu J (2020a) Not coming home: trade and economic policy uncertainty in American supply Chain networks. SSRN Electr J 1–41. https://doi.org/10.2139/ssrn.3533827

  19. Charoenwong B, Kimura Y, Kwan A (2020b) How and why do managers use public forecasts to guide the market ? Working paper

    Google Scholar 

  20. Charoenwong B, Kimura Y, Kwan A, Tan E (2020c) Investment plans, uncertainty, and misallocation. Working paper

    Google Scholar 

  21. Charoenwong B, Kwan A, Pursiainen V (2020d) Social connections with COVID-19-affected areas increase compliance with mobility restrictions. SSRN Electr J 3574560

    Google Scholar 

  22. Chen H, De P, Hu YJ, Hwang BH (2014) Wisdom of crowds: the value of stock opinions transmitted through social media. Rev Financ Stud 27(5):1367–1403

    Article  Google Scholar 

  23. Chen W, Chen X, Hsieh CT, Song Z (2019) A forensic examination of China’s national accounts (No. w25754). National Bureau of Economic Research

    Google Scholar 

  24. Chen H, Qian W, Wen Q (2020) The impact of the COVID-19 pandemic on consumption: learning from high frequency transaction data. Available at SSRN 3568574

    Google Scholar 

  25. Chetty R, Friedman JN, Hendren N, Stepner M (2020) How did covid-19 and stabilization policies affect spending and employment? A new real-time economic tracker based on private sector data (No. w27431). National Bureau of Economic Research

    Google Scholar 

  26. Clark H, Pinkovskiy M, Sala-i-Martin X (2017) China’s GDP growth may be understated (No. w23323). National Bureau of Economic Research

    Google Scholar 

  27. Cohen L, Frazzini A (2008) Economic links and predictable returns. J Finan 63(4):1977–2011

    Article  Google Scholar 

  28. Da Z, Engelberg J, Gao P (2011) In search of attention. J Finan 66(5):1461–1499

    Article  Google Scholar 

  29. Daniel K, Moskowitz TJ (2016) Momentum crashes. J Financ Econ 122(2):221–247

    Article  Google Scholar 

  30. Deutsche Bank (2015) Macro and MicroJobEnomics. Report, pp 1–39

    Google Scholar 

  31. Ding W, Levine R, Lin C, Xie W (2020) Corporate immunity to the COVID-19 pandemic. J Finan Econ, forthcoming

    Google Scholar 

  32. Doidge C, Karolyi GA, Stulz RM (2017) The US listing gap. J Financ Econ 123(3):464–487

    Article  Google Scholar 

  33. Doyle TM (2018) Ratings that don’t rate: the subjective world of ESG ratings agencies. Report, American Council for Capital Formation

    Google Scholar 

  34. Duarte J, Siegel S, Young L (2012) Trust and credit: the role of appearance in peer-to-peer lending. Rev Finan Stud 25(8):2455–2484

    Article  Google Scholar 

  35. Fama EF, French KR (1993) Common risk factors in the returns on stocks and bonds. J Finan 33:3–56

    MATH  Google Scholar 

  36. Froot K, Kang N, Ozik G, Sadka R (2017) What do measures of real-time corporate sales say about earnings surprises and post-announcement returns? J Financ Econ 125(1):143–162

    Article  Google Scholar 

  37. Goolsbee AD, Klenow PJ (2018) Internet rising, prices falling: measuring inflation in a world of e-commerce. In: AEA Papers and Proceedings, vol 108, pp 488–492

    Google Scholar 

  38. Harvey CR, Liu Y, Zhu H (2016) … and the cross-section of expected returns. Rev Finan Stud 29(1):5–68

    Article  Google Scholar 

  39. Hassan TA, Hollander S, van Lent L, Tahoun A (2019) Firm-level political risk: measurement and effects. Q J Econ 134(4):2135–2202

    Article  Google Scholar 

  40. Henderson JV, Storeygard A, Weil DN (2012) Measuring economic growth from outer space. Am Econ Rev 102(2):994–1028

    Article  Google Scholar 

  41. Hirshleifer D, Li Y, Lourie B, Ruchti T (2019) Do trade creditors possess private information? Stock returns evidence (No. w25553). National Bureau of Economic Research

    Google Scholar 

  42. Hoberg G, Phillips G (2010) Product market synergies and competition in mergers and acquisitions: a text-based analysis. Rev Finan Stud 23(10):3773–3811

    Article  Google Scholar 

  43. Huang J (2018) The customer knows best: The investment value of consumer opinions. J Financ Econ 128(1):164–182

    Article  Google Scholar 

  44. Huang W, Karolyi A, Kwan A (2020) Paying attention to ESG: evidence from big data analytics. Working paper

    Google Scholar 

  45. Jacobs H, Müller S (2020) Anomalies across the globe: Once public, no longer existent?. J Financ Econ 135(1):213–230

    Google Scholar 

  46. Jagadeesh N, Titman S (1993) Returns to buying winners and selling losers: implications for stock market efficiency. J Finan 48(1):65–91

    Article  Google Scholar 

  47. Jha V (2019) Implementing alternative investment data process in an. Big data and machine learning in quantitative investment, p 51

    Google Scholar 

  48. Jiang H, Zhengzi Li S, Wang H (2020) Pervasive underreaction: evidence from high-frequency data. Forthcoming. J Finan Econ

    Google Scholar 

  49. Joo H, Huang Z, Cai F (2009) The impact of macroeconomic announcements on real time foreign exchange rates in emerging markets. FRB international finance discussion paper (973)

    Google Scholar 

  50. Khandani AE, Lo AW (2011) What happened to the quants in August 2007? Evidence from factors and transactions data. J Financ Markets 14(1):1–46

    Article  Google Scholar 

  51. Klinger B, Khwaja AI, Del Carpio C (2013) Enterprising psychometrics and poverty reduction, vol 860. Springer, New York, NY

    Book  Google Scholar 

  52. Kwan A (2020) Measuring knowledge work, working paper

    Google Scholar 

  53. Lee CM, Sun ST, Wang R, Zhang R (2019) Technological links and predictable returns. J Financ Econ 132(3):76–96

    Article  Google Scholar 

  54. Liu Y, Matthies B (2018) Long run risk: is it there? Available at SSRN 2592814

    Google Scholar 

  55. Manela A, Moreira A (2017) News implied volatility and disaster concerns. J Financ Econ 123(1):137–162

    Article  Google Scholar 

  56. McLean RD, Pontiff J (2016) Does academic research destroy stock return predictability? J Finan 71(1):5–32

    Article  Google Scholar 

  57. Mehra R, Prescott EC (1985) The equity premium: a puzzle. J Monetary Econ 15(2):145–161

    Article  Google Scholar 

  58. Mukherjee A, Panayotov G, Shon J (2020) Eye in the sky: private satellites and government macro data. J Financ Econ (forthcoming)

    Google Scholar 

  59. Pinkovskiy M, Sala-i-Martin X (2016) Lights, camera… income! Illuminating the national accounts-household surveys debate. Q J Econ 131(2):579–631

    Article  Google Scholar 

  60. San Pedro J, Proserpio D, Oliver N (2015) MobiScore: towards universal credit scoring from mobile phone data. In: International conference on user modeling, adaptation, and personalization (pp 195–207). Springer, Cham

    Google Scholar 

  61. Savov A (2011) Asset pricing with garbage. J Financ 66(1):177–201

    Article  MathSciNet  Google Scholar 

  62. Silver M, Heravi S (2001) Scanner data and the measurement of inflation. Econ J 111(472):383–404

    Article  Google Scholar 

  63. Todorov A, Mandisodza AN, Goren A, Hall CC (2005) Inferences of competence from faces predict election outcomes. Science 308(5728):1623–1626

    Article  Google Scholar 

  64. Veldkamp LL (2006) Information markets and the comovement of asset prices. Rev Econ Stud 73(3):823–845

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

This chapter draws on material from Dr. Alan Kwan’s course, “Big Data in Finance,” taught at the University of Hong Kong, as well as research by both Ben Charoenwong and Alan Kwan. We thank Emmett Kilduff at Eagle Alpha and Vinesh Jha at Extract Alpha for allowing us to re-print statistics from their companies’ respective whitepapers, and Chalinee Charoenwong for assistance.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alan Kwan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Charoenwong, B., Kwan, A. (2021). Alternative Data, Big Data, and Applications to Finance. In: Choi, P.M.S., Huang, S.H. (eds) Fintech with Artificial Intelligence, Big Data, and Blockchain. Blockchain Technologies. Springer, Singapore. https://doi.org/10.1007/978-981-33-6137-9_2

Download citation

  • DOI: https://doi.org/10.1007/978-981-33-6137-9_2

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-33-6136-2

  • Online ISBN: 978-981-33-6137-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics