Abstract
Professional discourse is a culturally situated construct. However, due to methodological constrains, culture as a discourse determinant revealed by analytical traditions cannot easily be generalized. By integrating culture representative word stems generated by a bag-of-word (BoW) model into a latent Dirichlet allocation (LDA) model, this paper explored a lexicon-based hybrid approach to mining corporate cultures from MD&As. Two separate corpora, the corporation culture corpus sized 801,219 tokens and the MD&A corpus with 2,199,109 tokens, were used. Key word stems drawn from the corporate culture corpus by the BoW showed a strong capacity of distinguishing implicit corporate values between texts, while the LDA model applied to the MD&A corpus succeeded in identifying three types of cultures in MD&As. Case studies confirmed the credibility of the classification. The hybrid method extends the literature by providing a quantitative tool for the qualitative study of culture in professional discourse, and shows implications for what interdiscursivity in MD&As looks like and how it works to formulate a renewable cycle in discourse realization.
Similar content being viewed by others
Data Availability
Not applicable.
Code Availability
Not applicable.
References
Audi, R., Loughran, T., & McDonald, B. (2016). Trust, but verify: MD&A language and the role of trust in corporate culture. Journal of Business Ethics, 139(3), 551–561.
Berkenkotter, C., & Huckin, T. N. (1995). Genre knowledge in disciplinary communication: Cognition/ Culture/ power. New Jersey: Lawrence Erlbaum Associates.
Bhatia, V. K. (2006). Discursive practices in disciplinary and professional contexts. Linguistics & the Human Sciences, 2(1), 5–28.
Bhatia, V. K. (2008). Genre analysis, ESP and professional practice. English for Specific Purposes, 27(2), 161–174.
Bhatia, V. K. (2010). Interdiscursivity in professional communication. Discourse & Communication, 4(1), 32–50.
Bhatia, V. K. (2017). Critical genre analysis: investigating Interdiscursive Performance in Professional Practice. London and New York: Routledge.
Bjørge, A. K., & Whittaker, S. (2015). Corporate values: a linguistic approach. International Journal of Cross Cultural Management, 15(3), 347–362.
Block, B. (1996). The language-culture connection in international business. Foreign Language Annals, 29(1), 27–36.
Blue, G. M., & Harun, M. (2003). Hospitality language as a professional skill. English for Specific Purposes, 22(1), 73–91.
Bonini, A. (2010). Critical genre analysis and professional practice: the case of public contests to select professors for brazilian public universities. Linguagem em (Dis) curso, 10(3), 485–510.
Breeze, R. (2011). Critical discourse analysis and its critics. Pragmatics, 21(4), 493–525.
Brown, R. W., & Lenneberg, E. H. (1954). A study in language and cognition. The Journal of Abnormal and Social Psychology, 49(3), 454.
Cobb, T., & Horst, M. (2004). Is there room for an academic word list in french?. In P. Bogaards, & B. Laufer (Eds.), Vocabulary in a Second Language: selection, Acquisition, and testing (pp. 15–38). Amsterdam: John Benjamins Publishing.
Coxhead, A., Stevens, L., & Tinkle, J. (2010). Why might secondary science textbooks be difficult to read? New Zealand Studies in Applied Linguistics, 16(2), 35–52.
Fairclough, N. (1995). Critical discourse analysis: the critical study of Language. London: Longman.
Fairclough, N. (2003). Analysing discourse: textual analysis for Social Research. London and New York: Routledge.
Feinerer, I., Hornik, K., & Meyer, D. (2008). Text mining infrastructure in R. Journal of Statistical Software, 25, 1–54.
Foss, D. J., & Hakes, D. T. (1978). Psycholinguistics: an introduction to the psychology of Language. New Jersey: Prentice-Hall.
Gan, J., & Qi, Y. (2021). Selection of the optimal number of topics for LDA Topic Model - taking patent policy analysis as an Example. Entropy, 23(10), 1–45.
Hájek, P., Olej, V., & Prochazka, O. (2017). Predicting Corporate Credit Ratings Using Content Analysis of Annual Reports - A Naïve Bayesian Network Approach. In S. Feuerriegel & D. Neumann (Eds.), Enterprise Applications, Markets and Services in the Finance Industry (pp. 47–61). Berlin Heidelberg: Springer.
Hamamura, T., & Xu, Y. (2015). Changes in chinese culture as examined through changes in personal pronoun usage. Journal of Cross-Cultural Psychology, 46(7), 930–941.
Haniffa, R. M., & Cooke, T. E. (2002). Culture, corporate governance and disclosure in malaysian corporations. Abacus, 38(3), 317–349.
Heath, S. B. (1983). Ways with words: Language, Life and Work in Communities and Classrooms. Cambridge: Cambridge University Press.
Hofstede, G., Hofstede, G. J., & Minkov, M. (2005). Cultures and organizations: Software of the mind (Vol. 2). New York: Mcgraw-hill.
Hofstede, G., Neuijen, B., Ohayv, D. D., & Sanders, G. (1990). Measuring organizational cultures: A qualitative and quantitative study across twenty cases. Administrative Science Quarterly, 35(2), 286–316.
Hornik, K., & Grün, B. (2011). Topicmodels: an R package for fitting topic models. Journal of Statistical Software, 40(13), 1–30.
Huang, C. Y., Liu, P. Y., & Xie, S. M. (2020). Predicting brand equity by text-analyzing annual reports. International Journal of Market Research, 62(3), 300–313.
Hudson, B. (2002). Interprofessionality in health and social care: the Achilles’ heel of partnership? Journal of Interprofessional Care, 16(1), 7–17.
Humpherys, S. L., Moffitt, K. C., Burns, M. B., Burgoon, J. K., & Felix, W. F. (2011). Identification of fraudulent financial statements using linguistic credibility analysis. Decision Support Systems, 50(3), 585–594.
Levina, N., & Orlikowski, W. J. (2009). Understanding shifting power relations within and across organizations: a critical genre analysis. Academy of Management Journal, 52(4), 672–703.
Li, K., Mai, F., Shen, R., & Yan, X. (2021). Measuring corporate culture using machine learning. The Review of Financial Studies, 34(7), 3265–3315.
Maass, A., Karasawa, M., Politi, F., & Suga, S. (2006). Do verbs and adjectives play different roles in different cultures? A cross-linguistic analysis of person representation. Journal of Personality and Social Psychology, 90(5), 734–750.
Marincioni, F. (2007). Information technologies and the sharing of disaster knowledge: the critical role of professional culture. Disasters, 31(4), 459–476.
Matveeva, I., Levow, G. A., Farahat, A., & Royer, C. H. (2007). Term representation with generalized latent semantic analysis. In N. Nicolov, G. Angelova, & R. Mitkov (Eds.), Recent advances in Natural Language Processing IV: selected Papers from RANLP (pp. 45–54). Amsterdam: John Benjamins Publishing.
Moreno, R. G., & Sznajder, H. S. (2013). Business communication across three european cultures: a contrastive analysis of british, spanish and polish email writing. Ibérica, 26, 77–98.
Moro, S., Cortez, P., & Rita, P. (2015). Business intelligence in banking: a literature analysis from 2002 to 2013 using text mining and latent dirichlet allocation. Expert Systems with Applications, 42(3), 1314–1324.
Nikolenko, S. I., Koltcov, S., & Koltsova, O. (2017). Topic modelling for qualitative studies. Journal of Information Science, 43(1), 88–102.
Noorullah, R., & Mohammed, M. (2021). Visualization and performance measure to determine number of topics in twitter data clustering using hybrid topic modeling. Journal of Intelligent & Fuzzy Systems, 41, 803–817.
Potts, A., & Baker, P. (2012). Does semantic tagging identify cultural change in british and american English? International Journal of Corpus Linguistics, 17(3), 295–324.
Qian, Y. (2020). A critical genre analysis of MD&A discourse in corporate annual reports. Discourse & Communication, 14(4), 424–437.
Qian, Y., & Sun, Y. (2021). The Correlation Between Annual Reports’ Narratives and Business Performance: A Retrospective Analysis. SAGE Open, 11(3), 1–14.
Rogerson-Revell, P. (2007). Using English for international business: a european case study. English for Specific Purposes, 26(1), 103–120.
Rosch, E. (1987). Linguistic relativity. Et Cetera, 44(3), 254–279.
Sherzer, J. (1987). Discourse-centered approach to language and culture. American Anthropologist, 89(2), 295–309.
Stubbs, M. (1997). Whorf’s children: critical comments on critical discourse analysis. In A. Ryan, & A. Wray (Eds.), Evolving models of Language (pp. 100–116). Clevedon: Multilingual Matters.
Suzuki, S. (2013). Culture change in organizational public discourse 1998–2008: Examining annual reports of Japanese and US corporations. International Journal of Intercultural Relations, 37(5), 579–593.
Tarca, A., Street, D. L., & Aerts, W. (2011). Factors affecting MD&A disclosures by SEC registrants: views of practitioners. Journal of International Accounting Auditing and Taxation, 20(1), 45–59.
Vuorela, T. (2005). How does a sales team reach goals in intercultural business negotiations? A case study. English for Specific Purposes, 24(1), 65–92.
Widdowson, H. G. (2004). Text, Context, Pretext: critical issues in discourse analysis. Oxford: Blackwell.
Yang, Y. (2013). Exploring linguistic and cultural variations in the use of hedges in English and Chinese scientific discourse. Journal of Pragmatics, 50(1), 23–36.
Zhao, W., Chen, J. J., Perkins, R., Liu, Z., Ge, W., Ding, Y., & Zou, W. (2015). A heuristic approach to determine an appropriate number of topics in topic modeling. Bmc Bioinformatics, 16(Suppl 13), doi: https://doi.org/10.1186/1471-2105-16-S13-S8.
Funding
Beijing Social Science Fund (Grant no. 21YYC013).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest/Competing interests
No conflicts of interest.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Qian, Y. Mining culture from professional discourse: a lexicon-based hybrid method. Lang Resources & Evaluation 57, 945–961 (2023). https://doi.org/10.1007/s10579-022-09630-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10579-022-09630-3