Abstract
Nowadays hashtags are an important mechanism of semantic navigation in social media. In the current research we considered the problem of building associative series of hashtags for Instagram. These series should meet two criteria: they should be relatively short and should not have wide semantic gaps between sequential hashtags. A generator of such series could be used for creating recommendations for increasing the quantity of hashtags in messages.
We gathered information from approximately 15 million messages from Instagram and built a co-occurrence network for hashtags from these messages. To build associative series we defined a formal definition of the semantic path building problem as a multicriteria optimization problem on the co-occurrence network.
We developed a combined optimization function for both criteria from the semantic path building problem. For measuring semantic similarity between hashtags, we use a metric based on vector embedding of hashtags by word2vec algorithm. Using empirical paths built by different algorithms, we graduated the parameters of the combined optimization function. The function with these parameters could be used in Dijkstra’s algorithm and in a specialized greedy algorithm for building semantic paths which look as appropriate associative series.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Halliday, K., Hasan, R.: Cohesion in English. Longman, London (1976)
Morris, J., Hirst, G.: Lexical cohesion, the thesaurus, and the structure of text. Comput. Linguist. 17(1), 21–48 (1991)
Fellbaum, C.: WordNet: An Electronic Lexical Database. Language, Speech, and Communication Series. MIT Press, Cambridge (1998)
Barzilay, R., Elhadad, M.: Using lexical chains for text summarization. In: Proceedings of the ACL Workshop on Intelligent Scalable Text Summarization, pp. 10–17, Madrid (1997)
Mikolov, T., Chen, K., Corrado, G.K., Dean, J.: Efficient estimation of word representations in vector space. CoRR, abs/1301.3781 (2013)
Sommer, C.: Shortest-path queries in static networks. ACM Comput. Surv. 46(4), 1–31 (2014). https://doi.org/10.1145/2530531
He, L., et al.: Neurally-guided semantic navigation in knowledge graph. IEEE Trans. Big Data (2018). https://doi.org/10.1109/TBDATA.2018.2805363
West, R., Pineau, J., Precup, D.: Wikispeedia: an online game for inferring semantic distances between concepts. In: IJCAI, pp. 1598–1603. Morgan Kaufmann Publishers Inc. (2009)
Neelakantan, A., Roth, B., McCallum, A.: Compositional vector space models for knowledge base completion. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, pp. 156–166, Beijing, China (2015). https://doi.org/10.3115/v1/P15-1016
Passant, A.: Measuring semantic distance on linking data and using it for resources recommendations. In: AAAI Spring Symposium: Linked Data Meets Artificial Intelligence, vol. 77 (2010)
Bringmann, K., Keusch, R., Lengler, J., Maus, Y., Molla, A.R.: Greedy routing and the algorithmic small-world phenomenon. In: Proceedings of the ACM Symposium on Principles of Distributed Computing, pp. 371–380, New York, USA (2017). https://doi.org/10.1145/3087801.3087829
Capitán, J.A., et al.: Local-based semantic navigation on a networked representation of information. PLoS ONE 7(8), 1–10 (2012). https://doi.org/10.1371/journal.pone.0043694
Pennington, J., Socher, R., Manning, C.: Glove: global vectors for word representation. In: EMNLP, pp. 1532–1543 (2014). https://doi.org/10.3115/v1/D14-1162
Goyal, P., Ferrara, E.: Graph embedding techniques, applications, and performance: a survey. Knowl. Based Syst. 89–94 (2018). https://doi.org/10.1016/j.knosys.2018.03.022
Dijkstra, E.: A note on two problems in connexion with graphs. Numer. Math. 1(1), 269–271 (1959). https://doi.org/10.1007/BF01386390
Hart, P., Nilsson, N.J., Raphael, B.: A formal basis for the heuristic determination of minimum cost paths. IEEE Trans. Syst. Sci. Cybern. SSC 4, 100–107 (1968). https://doi.org/10.1109/TSSC.1968.300136
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Makrushin, S., Blokhin, N. (2020). Developing a Network-Based Method for Building Associative Series of Hashtags. In: van der Aalst, W., et al. Analysis of Images, Social Networks and Texts. AIST 2019. Communications in Computer and Information Science, vol 1086. Springer, Cham. https://doi.org/10.1007/978-3-030-39575-9_33
Download citation
DOI: https://doi.org/10.1007/978-3-030-39575-9_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-39574-2
Online ISBN: 978-3-030-39575-9
eBook Packages: Computer ScienceComputer Science (R0)