Skip to main content

Text Mining for Wellbeing: Selecting Stories Using Semantic and Pragmatic Features

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7553))

Abstract

In this article, we explore an application in an area of research called wellbeing informatics. More specifically, we consider how to build a system that could be used for searching stories that relate to the interest of the user (content relevance), and help the user in his or her developmental process by providing encouragement, useful experiences, or otherwise supportive content (emotive relevance). The first objective is covered through topic modeling applying independent component analysis and the second by using sentiment analysis. We also use style analysis to exclude stories that are inappropriate in style. We discuss linguistic theories and methodological aspects of this area, outline a hybrid methodology that can be used in selecting stories that match both the content and emotive criteria, and present the results of experiments that have been used to validate the approach.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agarwal, A., Bhattacharyya, P.: Sentiment analysis: A new approach for effective use of linguistic knowledge and exploiting similarities in a set of documents to be classified. In: Proc. of the Int. Conf. on NLP (2005)

    Google Scholar 

  2. Bingham, E., Kuusisto, J., Lagus, K.: ICA and SOM in text document analysis. In: Proceedings of the 25th ACM SIGIR Conference, pp. 361–362. ACM, New York (2002)

    Google Scholar 

  3. Bleys, J., Loetzsch, M., Spranger, M., Steels, L.: The grounded color naming game. In: Proceedings of the 18th IEEE International Symposium on Robot and Human Interactive Communication (2009)

    Google Scholar 

  4. Comon, P.: Independent component analysis—a new concept? Signal Processing 36, 287–314 (1994)

    Article  MATH  Google Scholar 

  5. Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. Journal of the American Society of Information Science 41, 391–407 (1990)

    Article  Google Scholar 

  6. Devitt, A., Ahmad, K.: Sentiment analysis in financial news: A cohesion-based approach. In: Proceedings of the Association for Computational Linguistics (ACL), pp. 984–991 (2007)

    Google Scholar 

  7. Givón, T.: Mind, code, and context: essays in pragmatics. Lawrence Erlbaum Associates (1989)

    Google Scholar 

  8. Honkela, T., Hyvärinen, A., Väyrynen, J.: WordICA - Emergence of linguistic representations for words by independent component analysis. Natural Language Engineering 16(3), 277–308 (2010)

    Article  Google Scholar 

  9. Hurst, M., Nigam, K.: Retrieving topical sentiments from online document collections. In: Document Recognition and Retrieval XI, pp. 27–34 (2004)

    Google Scholar 

  10. Hyvärinen, A., Karhunen, J., Oja, E.: Independent component analysis, vol. 26. Wiley (2001)

    Google Scholar 

  11. Jutten, C., Hérault, J.: Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture. Signal Processing 24, 1–10 (1991)

    Article  MATH  Google Scholar 

  12. Karlgren, J.: Textual stylistic variation: Choices, genres and individuals. In: Structure of Style, pp. 129–142. Springer (2010)

    Google Scholar 

  13. Munezero, M., Kakkonen, T., Montero, C.: Towards automatic detection of antisocial behavior from texts. In: Proceedings of the Workshop on Sentiment Analysis where AI meets Psychology (SAAIP 2011), pp. 20–27 (November 2011)

    Google Scholar 

  14. Ritter, H., Kohonen, T.: Self-organizing semantic maps. Biological Cybernetics 61(4), 241–254 (1989)

    Article  Google Scholar 

  15. Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment in short strength detection informal text. Journal of the American Society for Information Science and Technology 61(12), 2544–2558 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Honkela, T., Izzatdust, Z., Lagus, K. (2012). Text Mining for Wellbeing: Selecting Stories Using Semantic and Pragmatic Features. In: Villa, A.E.P., Duch, W., Érdi, P., Masulli, F., Palm, G. (eds) Artificial Neural Networks and Machine Learning – ICANN 2012. ICANN 2012. Lecture Notes in Computer Science, vol 7553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33266-1_58

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33266-1_58

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33265-4

  • Online ISBN: 978-3-642-33266-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics