Abstract
This paper intend to present an approach to analyse the change of word meaning based on word embedding, which is a more general method to quantize words than before. Through analysing the similar words and clustering in different period, semantic change could be detected. We analysed the trend of semantic change through density clustering method called DBSCAN. Statics and data visualization is also included to make the result more clear. Some words like ‘gay’, ‘mouse’ are traced as case to prove this approach works. At last, we also compared the context words and similar words on semantic presentation and proved the context words worked better.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Bengio, Y., Schwenk, H., Senécal, J.S., Morin, F., Gauvain, J.L.: Neural probabilistic language models. In: Holmes, D.E., Jain, L.C. (eds.) Innovations in Machine Learning, vol. 194, pp. 137–186. Springer, Heidelberg (2006). doi:10.1007/3-540-33486-6_6
Campbell, L.: Historical linguistics: an introduction. Diachronica: Int. J. Hist. Linguist. (1), 159–160 (1998)
Davies, M.: Making Google Books n-grams useful for a wide range of research on language change. Int. J. Corpus Linguist. 19(3), 401–416 (2014)
Elman, J.L.: Finding structure in time. Cogn. Sci. 14(2), 179–211 (1990)
Erk, K.: Unknown word sense detection as outlier detection. In: Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, New York, USA, 4–9 June 2006 (2006)
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD, vol. 96, pp. 226–231 (1996)
Gulordava, K., Baroni, M.: A distributional similarity approach to the detection of semantic change in the Google Books Ngram corpus. In: GEMS 2011 Workshop on GEometrical MODELS of Natural Language Semantics, pp. 67–71 (2011)
Hamilton, W.L., Leskovec, J., Dan, J.: Diachronic word embeddings reveal statistical laws of semantic change (2016)
Hinton, G., Rumelhart, D., Williams, R.: Learning internal representations by back-propagating errors. Parallel Distrib. Process. Explor. Microstruct. Cogn. 5, 1 (1985)
Hinton, G.E., Mcclelland, J.L., Rumelhart, D.E.: Distributed representations. In: Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol. 1: Foundations (1986)
Jatowt, A., Duh, K.: A framework for analyzing semantic change of words across time. In: Digital Libraries, pp. 229–238 (2014)
Kulkarni, V., Alrfou, R., Perozzi, B., Skiena, S.: Statistically significant detection of linguistic change. In: Computer Science (2014)
McFee, B., Lanckriet, G.R.: Large-scale music similarity search with spatial trees. In: ISMIR, pp. 55–60 (2011)
Mikolov, T.: Language models for automatic speech recognition of Czech lectures. In: Proceedings of STUDENT EEICT (2008)
Mikolov, T., Kopeckỳ, J., Burget, L., Glembek, O., Černockỳ, J.H.: Neural network based language models for highly inflective languages. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2009, pp. 4725–4728. IEEE (2009)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 26, 3111–3119 (2013)
Rohrdantz, C., Hautli, A., Mayer, T., Butt, M., Keim, D.A., Plank, F.: Towards tracking semantic change by visual analytics. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers, vol. 2, pp. 305–310. Association for Computational Linguistics (2011)
Sagi, E., Kaufmann, S., Clark, B.: Semantic density analysis: comparing word meaning across time and phonetic space. In: The Workshop on Geometrical MODELS of Natural Language Semantics, pp. 104–111 (2010)
Wijaya, D.T., Yeniterzi, R.: Understanding semantic change of words over centuries. In: International Workshop on Detecting and Exploiting Cultural Diversity on the Social Web, pp. 35–40 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Liao, X., Cheng, G. (2016). Analysing the Semantic Change Based on Word Embedding. In: Lin, CY., Xue, N., Zhao, D., Huang, X., Feng, Y. (eds) Natural Language Understanding and Intelligent Applications. ICCPOL NLPCC 2016 2016. Lecture Notes in Computer Science(), vol 10102. Springer, Cham. https://doi.org/10.1007/978-3-319-50496-4_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-50496-4_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50495-7
Online ISBN: 978-3-319-50496-4
eBook Packages: Computer ScienceComputer Science (R0)