Skip to main content
Log in

Knowledge representation framework of accounting event in corpus-based financial report text

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

As the financial report has entered the “era of annotation” (Fan and Zhang in Contemp Accout Res 29(1):38–42 2012), and the length of the unstructured information in the financial report has far exceeded the financial statements. In order to carry on the automatic analysis and processing to the text information in the financial report with the help of information technology, a combination of the information extraction and XBRL technology is needed to carry on the structured research to the text information in the Chinese financial report (Heidari and Felden in Proceedings of 10th international conference on DESRIST, LNCS 9073, 2015). Based on a self-built Chinese enterprise annual report corpus, this paper studies the characteristics and patterns of the disclosure language of accounting events in Chinese enterprises’ annual reports, which can be examplified in the “related transaction events”. This paper summarizes an expression framework of accounting event knowledge. The study is expected to lay the foundation for the study of annotated textual information extraction and automatic generation of XBRL financial reports, which shall be helpful to improve the quality of financial data and information sharing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Fan, Q.T., Zhang, X.J.: Accounting conservatism, aggregation, and information quality. Contemp. Accout. Res. 29(1), 38–42 (2012). https://doi.org/10.1111/j.1911-3846.2011.01069.x

    Article  Google Scholar 

  2. Heidari, M., Felden, C.: Impact of text mining application on financial footnotes analysis. In: Proceedings of 10th International Conference on DESRIST, LNCS 9073, pp. 463–470 (2015). https://doi.org/10.1007/978-3-319-18714-3_39

  3. Li, H.Q., Zhai, J.: Literature review of XBRL semantic research. In: International Conference on Computer Science and Intelligent Communication. HK: Atlantis, pp. 316–320. (2015). https://doi.org/10.2991/csic-15.2015.76

  4. Radzimski, M., Sanchez-Cervantes, J., Garcia-Crespo, A.: Intelligent architecture for comparative analysis of public companies using semantics and XBRL data. Int. J. Softw. Eng. Knowl. Eng. 24(5), 801–823 (2014). https://doi.org/10.1142/s0218194014500314

    Article  Google Scholar 

  5. Feng, H.Y., Wu, L.W.: Research on information disclosure of China ‘s listed companies based on XBRL financial statements. Financ. Newsl. 4, 14–17 (2011)

    Google Scholar 

  6. Ge, J.S., Liu, F.: On the nature and characteristics of financial reporting of business enterprises. Account. Res. 12, 3 (2011)

    Google Scholar 

  7. Sun, F., Yang, Z.N.: Linguistic analysis and improvement research of XBRL technical system structure. J. Account. Res. 7 (2013)

  8. Santos, I., Castro, E., Velasco, M.: XBRL formula specification in the multidimensional data model. Inf. Syst. 57, 20–37 (2016). https://doi.org/10.1016/j.is.2015.11.001

    Article  Google Scholar 

  9. Hirshleifer, D., Teoh, S.H.: Limited attention, information disclosure, and financial reporting. J. Accout. Econ. 36(1), 337–386 (2003). https://doi.org/10.1016/j.jacceco.2003.10.002

    Article  Google Scholar 

  10. Rutherford, B.A.: Genre analysis of corporate annual report narratives a corpus linguistics-based approach. J. Bus. Commun. 42(4), 349–378 (2005). https://doi.org/10.1177/0021943605279244

    Article  Google Scholar 

  11. Martin, W.: The digital divide: where we are (2012). http://ucrel.lancs.ac.uk/cfie/index.php

  12. Wei, N.X.: Corous-based and corpus-driven approaches to the study of collocation. Contemp. Ling. 2(2), 101–114 (2002)

    Google Scholar 

  13. Cui, X.L., Zhang, B.L.: Global Chinese learners corpus construction program. Lang. Appl. (2), 100–108 (2011)

  14. Simon, S., Wu, J.Y.: Design and use of Chinese sketch engine: a word collocation search interface. In: 2005 International Conference on Internet Chinese Education, pp. 19–27 (2005)

  15. Yuan, Y.L.: Matching even-template with argument structure of verbs: towards a verb-driven approach of information extraction. J. Chin. Inf. Process. 19(5), 37–43 (2005)

    Google Scholar 

  16. Zhao, Y., Qin, B., Che, W.X.: Research on Chinese event extraction. J. Chin. Inf. Process. 22(1), 3–8 (2008)

    Article  Google Scholar 

  17. Meng, L., Ding, X., Qin, B.: Financial event argument extraction based on dependency parsing and noun phrase parsing. In: The 11th Chinese National Conference on Computational Linguistics, CNCCL 2011 (2011)

  18. Li, J.L., Li, X.Q., Zhou, J.S.: Event sentence extraction in financial field. Appl. Res. Comput. 34(10), 2915–2918 (2017). https://doi.org/10.3969/j.issn.1001-3695

    Article  Google Scholar 

  19. Wang, W., Zhao, D.Y., Zhao, W.: Identification of topic sentence about key event in Chinese news. Acta Scientiarum Naturalium Universitatis Pekinensis (2011). https://doi.org/10.1109/isip.2010.112

    Article  Google Scholar 

  20. Xu, R.H., Wu, G., Li, P.F., et al.: Topic event fusion based on event framework. Appl. Res. Comput. 26(12), 4542–4545 (2009)

    Google Scholar 

  21. Antonia, K., Camilla, M., Barbro, B.: Mining textual contents of financial reports. Int. J. Digit. Account. Res. 4(7), 1–29 (2004)

    Google Scholar 

  22. Mendez, N.S., Trivio, G.: Combining semantic web technologies and computational theory of perceptions for text generation in financial analysis. IEEE Int. Conf. on Fuzzy Syst. 2012, 1–8 (2010). https://doi.org/10.1109/fuzzy.2010.5583974

    Article  Google Scholar 

  23. Chou, C., Lian, Z.: Enhancing effectiveness of business information retrieval and integration via text mining and XBRL technology. J. Contemp. Account. 12(1), 85–114 (2011). https://doi.org/10.6675/jca.2011.12.1.04

    Article  Google Scholar 

  24. Antonina, K., Tomas, E., Jonas, K., et al.: Combining data and text mining techniques for analysing financial reports: research articles. Int. J. Intell. Syst. Account. Financ. Manag. 12(1), 29–41 (2004). https://doi.org/10.1002/isaf.v12:1

    Article  Google Scholar 

  25. Apache (2009) The digital divide: where we are. https://pdfbox.apache.org/

  26. Meng, Y.: Use of parallel text in translation of financial audit reports from english to Chinese. Dissertation of Shanghai International Studies University (2014)

  27. He, B., Zhang, L.H.: Management Information Systems. TSINGHUA University Press, Beijing (2006)

    Google Scholar 

  28. Enterprise Accounting Standards Committee (2017). Guidelines for the Application of Accounting Standards for Business Enterprises. LiXin Accounting Press

  29. Sun, M.S.: On the consistency of Chinese word—word corpus. Lang. Appl. (2), 87–90 (1992)

  30. Che, W.X., Li, Z.H., Liu, T.: LTP: a Chinese language technology platform. In: Proceedings of the Coling 2010: Demonstrations. 08:13–16 (2010)

  31. Liang, M.C., Li, W.Z., Xu, J.J.: Using Corpora: A Practical Coursebook. Foreign Language Teaching and Research Press (2010)

  32. Anthony, L. (2011).The digital divide: where we are. http://www.laurenceanthony.net/software.html

  33. Hsiao, F., Gibson, E.: Processing relative clauses in Chinese. Cognition 90(1), 3–27 (2003). https://doi.org/10.1016/S0010-0277(03)00124-0

    Article  Google Scholar 

Download references

Acknowledgements

The research is supported by National Natural Science Foundation of PR China (71771104, 61402197).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhuoqian Liang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liang, Z., Pan, D. & Xu, R. Knowledge representation framework of accounting event in corpus-based financial report text. Cluster Comput 22 (Suppl 4), 9335–9346 (2019). https://doi.org/10.1007/s10586-018-2153-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-018-2153-8

Keywords

Navigation