Abstract
Intelligent Technologies and research results from the field of knowledge management, have steadily and progressively improved knowledge quality, over the course of the last decades, especially in the current industrial context. The companies consider the knowledge as an important strategic resource for innovation. This paper focus on the problem of learning topic hierarchies from knowledge. The aim targeted is to respond to knowledge capitalization issues in big data context, by proposing a Knowledge capitalization system as an adaptive intelligent technique. This system acts on top of a big data platform, and runs on large scale globally to constitute a robust intelligent knowledge capitalization paradigm, with a clear separation of concerns. The architecture considers a batch processing as a preparation stage which starts by extracting hidden topics, by means of the HLDA in order to handle the complexity of multi knowledge domains, and to keep the semantic relations between knowledge entities. The hierarchical mechanism gives an effective and flexible way to store and analyses the knowledge. As a result, the time responding is obtained of high quality, and with the best precision, in comparison with the systems which uses LSA and LDA approach as a preparation stage.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Huang, T., Lan, L., Fang, X., An, P., Min, J., Wang, F.: Promises and challenges of big data computing in health sciences. Big Data Res. 2(1), 2–11 (2015)
Davenport, T.H., Patil, D.: Data scientist. Harvard Bus. Rev. 90(5), 70–76 (2012)
Griffiths, T.L., Jordan, M.I., Tenenbaum, J.B., Blei, D.M.: Hierarchical topic models and the nested Chinese restaurant process. In: Advances in Neural Information Processing Systems, pp. 17–24 (2004)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391 (1990)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Garcia, G., Roser, X.: Enhancing integrated design model–based process and engineering tool environment: towards an integration of functional analysis, operational analysis and knowledge capitalisation into co-engineering practices. In: Concurrent Engineering, 1063293X17737357 (2016)
Tung, V.W.S., Cheung, C., Law, R.: Does the listener matter? the effects of capitalization on storytellers evaluations of travel memories. J. Travel Res. 0047287517729759 (2017)
Martín-de Castro, G.: Knowledge management and innovation in knowledge-based and high-tech industrial markets: the role of openness and absorptive capacity. Ind. Mark. Manag. 47, 143–146 (2015)
Lebis, A., Lefevre, M., Luengo, V., Guin, N.: Capitalisation of analysis processes: enabling reproducibility, openness and adaptability thanks to narration. In: Proceedings of the 8th International Conference on Learning Analytics and Knowledge. ACM, pp. 245–254 (2018)
De Silva, M., Howells, J., Meyer, M.: Innovation intermediaries and collaboration: knowledge-based practices and internal value creation. Res. Policy 47(1), 70–87 (2018)
Nikolenko, S.I., Koltcov, S., Koltsova, O.: Topic modelling for qualitative studies. J. Inf. Sci. 43(1), 88–102 (2017)
Pascual, M., Miñana, E.P., Giacomello, E.: Integrating knowledge on biodiversity and ecosystem services: mind-mapping and bayesian network modelling. Ecosyst. Serv. 17, 112–122 (2016)
Bast, H., Buchhold, B., Haussmann, E., et al.: Semantic search on text and knowledge bases. Found. Trends® Inf. Retrieval 10(2–3), 119–271 (2016)
Zwicklbauer, S., Seifert, C., Granitzer, M.: Doser-a knowledge-base-agnostic framework for entity disambiguation using semantic embeddings. In: International Semantic Web Conference. Springer, pp. 182–198 (2016)
Ganea, O.E., Ganea, M., Lucchi, A., Eickhoff, C., Hofmann, T.: Probabilistic bag-of-hyperlinks model for entity linking. In: Proceedings of the 25th International Conference on World Wide Web, International World Wide Web Conferences Steering Committee, pp. 927–938 (2016)
Wang, C., Blei, D.M.: Collaborative topic modeling for recommending scientific articles. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD’11. ACM, New York, NY, USA, pp. 448–456 (2011)
Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, pp. 50–57 (1999)
Berger, J.B.M.B.J., Dawid, A., Smith, D.H.A., West, M.: Hierarchical Bayesian models for applications in information retrieval. In: Bayesian Statistics 7: Proceedings of the Seventh Valencia International Meeting, p. 25. Oxford University Press (2003)
Girolami, M., Kabán, A.: On an equivalence between PLSI and LDA. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR’03, pp. 433–434. ACM, New York, NY, USA (2003)
Blei, D.M.: Probabilistic topic models. Commun. ACM 55(4), 77–84 (2012)
Mimno, D., Wallach, H.M., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, pp. 262–272 (2011)
Blei, D.M., Griffiths, T.L., Jordan, M.I.: The nested Chinese restaurant process and Bayesian nonparametric inference of topic hierarchies. J. ACM (JACM) 57(2), 7 (2010)
Andrieu, C., de Freitas, N., Doucet, A., Jordan, M.I.: An introduction to MCMC for machine learning. Mach. Learn. 50(1), 5–43 (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Hirchoua, B., Ouhbi, B., Frikh, B. (2019). Topic Hierarchies for Knowledge Capitalization using Hierarchical Dirichlet Processes in Big Data Context. In: Ezziyyani, M. (eds) Advanced Intelligent Systems for Sustainable Development (AI2SD’2018). AI2SD 2018. Advances in Intelligent Systems and Computing, vol 915. Springer, Cham. https://doi.org/10.1007/978-3-030-11928-7_54
Download citation
DOI: https://doi.org/10.1007/978-3-030-11928-7_54
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-11927-0
Online ISBN: 978-3-030-11928-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)