Analyzing Twitter Data with Preferences

Rudenko, Lena; Haas, Christian; Endres, Markus

doi:10.1007/978-3-030-54623-6_16

Lena Rudenko⁹,
Christian Haas⁹ &
Markus Endres¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1259))

Included in the following conference series:

European Conference on Advances in Databases and Information Systems

484 Accesses
1 Citations

Abstract

Today Twitter is one of the most important sources for information distribution. But finding useful and interesting tweets on a specific topic is a non-trivial task, because there are thousands of new posts every minute. In this paper, we describe our preference-based search approach on Twitter messages, which allows users to get the best possible results. For this, we introduce a new CONTAINS preference constructor to search on full-text data, use NLP techniques to handle natural language mistakes, and present experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Ayers, J.W., et al.: Why do people use electronic nicotine delivery systems (electronic cigarettes)? A content analysis of Twitter, 2012–2015. PLoS ONE 12(3), 1–8 (2017)
Article Google Scholar
Cavazos-Rehg, P., et al.: A content analysis of depression-related Tweets. Comput. Hum. Behav. 54, 351–357 (2016)
Article Google Scholar
Chomicki, J., Ciaccia, P., Meneghetti, N.: Skyline queries, front and back. SIGMOD 42(3), 6–18 (2013)
Article Google Scholar
Damerau, F.J.: A technique for computer detection and correction of spelling errors. ACM 7(3), 171–176 (1964)
Article Google Scholar
Hristidis, V., Koudas, N., Papakonstantinou, Y.: PREFER: a system for the efficient execution of multi-parametric ranked queries. SIGMOD Rec. 30(2), 259–270 (2001)
Article Google Scholar
Keeney, R.L., Raiffa, H.: Decisions with Multiple Objectives: Preferences and Value Trade-Offs. Cambridge University Press, Cambridge (1993)
Book Google Scholar
Kießling, W.: Foundations of preferences in database systems. In: Proceedings of VLDB 2002, Hong Kong SAR, China, pp. 311–322. VLDB Endowment (2002)
Google Scholar
Linckels, S., Meinel, C.: Natural language processing. In: E-Librarian Service, pp. 61–79. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-17743-9_4
Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: the Penn Treebank. Comput. Linguist. 19(2), 313–330 (1993)
Google Scholar
Norvig, P.: English Letter Frequency Counts: Mayzner Revisited or ETAOIN SRHLDCU (2013)
Google Scholar
Pagolu, V.S., Reddy, K.N., Panda, G., Majhi, B.: Sentiment analysis of Twitter data for predicting stock market movements. In: International Conference on Signal Processing, Communication, Power and Embedded System (SCOPES), pp. 1345–1350, October 2016
Google Scholar
Peterson, J.L.: A note on undetected typing errors. ACM 29(7), 633–637 (1986)
Article Google Scholar
Porter, M.F.: An algorithm for suffix stripping. Program 40, 211–218 (1980)
Article Google Scholar
Samir, A., Lahbib, Z.: Stemming and lemmatization for information retrieval systems in Amazigh language. In: Tabii, Y., Lazaar, M., Al Achhab, M., Enneya, N. (eds.) BDCA 2018. CCIS, vol. 872, pp. 222–233. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-96292-4_18
Chapter Google Scholar
Subramaniyaswamy, V., Logesh, R., Abejith, M., Umasankar, S., Umamakeswari, A.: Sentiment analysis of tweets for estimating criticality and security of events. J. Organ. End User Comput. 29, 51–71 (2017)
Article Google Scholar
Sutton, J., et al.: Lung cancer messages on Twitter: content analysis and evaluation. J. Am. Coll. Radiol. 15, 210–217 (2017)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Augsburg, Universitätsstr. 6a, 86159, Augsburg, Germany
Lena Rudenko & Christian Haas
University of Passau, Innstr. 43, 94032, Passau, Germany
Markus Endres

Authors

Lena Rudenko
View author publications
You can also search for this author in PubMed Google Scholar
Christian Haas
View author publications
You can also search for this author in PubMed Google Scholar
Markus Endres
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lena Rudenko .

Editor information

Editors and Affiliations

Université Lumière Lyon 2, Lyon, France
Jérôme Darmont
National Research University Higher School of Economics, St. Petersburg, Russia
Boris Novikov
Poznań University of Technology, Poznań, Poland
Robert Wrembel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rudenko, L., Haas, C., Endres, M. (2020). Analyzing Twitter Data with Preferences. In: Darmont, J., Novikov, B., Wrembel, R. (eds) New Trends in Databases and Information Systems. ADBIS 2020. Communications in Computer and Information Science, vol 1259. Springer, Cham. https://doi.org/10.1007/978-3-030-54623-6_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-54623-6_16
Published: 17 August 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-54622-9
Online ISBN: 978-3-030-54623-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics