Skip to main content

Analyzing Twitter Data with Preferences

  • Conference paper
  • First Online:
New Trends in Databases and Information Systems (ADBIS 2020)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1259))

Included in the following conference series:

Abstract

Today Twitter is one of the most important sources for information distribution. But finding useful and interesting tweets on a specific topic is a non-trivial task, because there are thousands of new posts every minute. In this paper, we describe our preference-based search approach on Twitter messages, which allows users to get the best possible results. For this, we introduce a new CONTAINS preference constructor to search on full-text data, use NLP techniques to handle natural language mistakes, and present experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Tweets per second: internetlivestats.com/one-second/.

  2. 2.

    Twitter Developer Website: www.developer.twitter.com.

  3. 3.

    Statista:de.statista.com/statistik/daten/studie/541918/umfrage/anteil-der-mobilen-monatlich-aktive-nutzer-von-twitter-weltweit/.

References

  1. Ayers, J.W., et al.: Why do people use electronic nicotine delivery systems (electronic cigarettes)? A content analysis of Twitter, 2012–2015. PLoS ONE 12(3), 1–8 (2017)

    Article  Google Scholar 

  2. Cavazos-Rehg, P., et al.: A content analysis of depression-related Tweets. Comput. Hum. Behav. 54, 351–357 (2016)

    Article  Google Scholar 

  3. Chomicki, J., Ciaccia, P., Meneghetti, N.: Skyline queries, front and back. SIGMOD 42(3), 6–18 (2013)

    Article  Google Scholar 

  4. Damerau, F.J.: A technique for computer detection and correction of spelling errors. ACM 7(3), 171–176 (1964)

    Article  Google Scholar 

  5. Hristidis, V., Koudas, N., Papakonstantinou, Y.: PREFER: a system for the efficient execution of multi-parametric ranked queries. SIGMOD Rec. 30(2), 259–270 (2001)

    Article  Google Scholar 

  6. Keeney, R.L., Raiffa, H.: Decisions with Multiple Objectives: Preferences and Value Trade-Offs. Cambridge University Press, Cambridge (1993)

    Book  Google Scholar 

  7. Kießling, W.: Foundations of preferences in database systems. In: Proceedings of VLDB 2002, Hong Kong SAR, China, pp. 311–322. VLDB Endowment (2002)

    Google Scholar 

  8. Linckels, S., Meinel, C.: Natural language processing. In: E-Librarian Service, pp. 61–79. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-17743-9_4

  9. Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: the Penn Treebank. Comput. Linguist. 19(2), 313–330 (1993)

    Google Scholar 

  10. Norvig, P.: English Letter Frequency Counts: Mayzner Revisited or ETAOIN SRHLDCU (2013)

    Google Scholar 

  11. Pagolu, V.S., Reddy, K.N., Panda, G., Majhi, B.: Sentiment analysis of Twitter data for predicting stock market movements. In: International Conference on Signal Processing, Communication, Power and Embedded System (SCOPES), pp. 1345–1350, October 2016

    Google Scholar 

  12. Peterson, J.L.: A note on undetected typing errors. ACM 29(7), 633–637 (1986)

    Article  Google Scholar 

  13. Porter, M.F.: An algorithm for suffix stripping. Program 40, 211–218 (1980)

    Article  Google Scholar 

  14. Samir, A., Lahbib, Z.: Stemming and lemmatization for information retrieval systems in Amazigh language. In: Tabii, Y., Lazaar, M., Al Achhab, M., Enneya, N. (eds.) BDCA 2018. CCIS, vol. 872, pp. 222–233. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-96292-4_18

    Chapter  Google Scholar 

  15. Subramaniyaswamy, V., Logesh, R., Abejith, M., Umasankar, S., Umamakeswari, A.: Sentiment analysis of tweets for estimating criticality and security of events. J. Organ. End User Comput. 29, 51–71 (2017)

    Article  Google Scholar 

  16. Sutton, J., et al.: Lung cancer messages on Twitter: content analysis and evaluation. J. Am. Coll. Radiol. 15, 210–217 (2017)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lena Rudenko .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rudenko, L., Haas, C., Endres, M. (2020). Analyzing Twitter Data with Preferences. In: Darmont, J., Novikov, B., Wrembel, R. (eds) New Trends in Databases and Information Systems. ADBIS 2020. Communications in Computer and Information Science, vol 1259. Springer, Cham. https://doi.org/10.1007/978-3-030-54623-6_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-54623-6_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-54622-9

  • Online ISBN: 978-3-030-54623-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics