Abstract
Today Twitter is one of the most important sources for information distribution. But finding useful and interesting tweets on a specific topic is a non-trivial task, because there are thousands of new posts every minute. In this paper, we describe our preference-based search approach on Twitter messages, which allows users to get the best possible results. For this, we introduce a new CONTAINS preference constructor to search on full-text data, use NLP techniques to handle natural language mistakes, and present experiments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ayers, J.W., et al.: Why do people use electronic nicotine delivery systems (electronic cigarettes)? A content analysis of Twitter, 2012–2015. PLoS ONE 12(3), 1–8 (2017)
Cavazos-Rehg, P., et al.: A content analysis of depression-related Tweets. Comput. Hum. Behav. 54, 351–357 (2016)
Chomicki, J., Ciaccia, P., Meneghetti, N.: Skyline queries, front and back. SIGMOD 42(3), 6–18 (2013)
Damerau, F.J.: A technique for computer detection and correction of spelling errors. ACM 7(3), 171–176 (1964)
Hristidis, V., Koudas, N., Papakonstantinou, Y.: PREFER: a system for the efficient execution of multi-parametric ranked queries. SIGMOD Rec. 30(2), 259–270 (2001)
Keeney, R.L., Raiffa, H.: Decisions with Multiple Objectives: Preferences and Value Trade-Offs. Cambridge University Press, Cambridge (1993)
Kießling, W.: Foundations of preferences in database systems. In: Proceedings of VLDB 2002, Hong Kong SAR, China, pp. 311–322. VLDB Endowment (2002)
Linckels, S., Meinel, C.: Natural language processing. In: E-Librarian Service, pp. 61–79. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-17743-9_4
Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: the Penn Treebank. Comput. Linguist. 19(2), 313–330 (1993)
Norvig, P.: English Letter Frequency Counts: Mayzner Revisited or ETAOIN SRHLDCU (2013)
Pagolu, V.S., Reddy, K.N., Panda, G., Majhi, B.: Sentiment analysis of Twitter data for predicting stock market movements. In: International Conference on Signal Processing, Communication, Power and Embedded System (SCOPES), pp. 1345–1350, October 2016
Peterson, J.L.: A note on undetected typing errors. ACM 29(7), 633–637 (1986)
Porter, M.F.: An algorithm for suffix stripping. Program 40, 211–218 (1980)
Samir, A., Lahbib, Z.: Stemming and lemmatization for information retrieval systems in Amazigh language. In: Tabii, Y., Lazaar, M., Al Achhab, M., Enneya, N. (eds.) BDCA 2018. CCIS, vol. 872, pp. 222–233. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-96292-4_18
Subramaniyaswamy, V., Logesh, R., Abejith, M., Umasankar, S., Umamakeswari, A.: Sentiment analysis of tweets for estimating criticality and security of events. J. Organ. End User Comput. 29, 51–71 (2017)
Sutton, J., et al.: Lung cancer messages on Twitter: content analysis and evaluation. J. Am. Coll. Radiol. 15, 210–217 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Rudenko, L., Haas, C., Endres, M. (2020). Analyzing Twitter Data with Preferences. In: Darmont, J., Novikov, B., Wrembel, R. (eds) New Trends in Databases and Information Systems. ADBIS 2020. Communications in Computer and Information Science, vol 1259. Springer, Cham. https://doi.org/10.1007/978-3-030-54623-6_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-54623-6_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-54622-9
Online ISBN: 978-3-030-54623-6
eBook Packages: Computer ScienceComputer Science (R0)