Abstract
In this article, we explore an application in an area of research called wellbeing informatics. More specifically, we consider how to build a system that could be used for searching stories that relate to the interest of the user (content relevance), and help the user in his or her developmental process by providing encouragement, useful experiences, or otherwise supportive content (emotive relevance). The first objective is covered through topic modeling applying independent component analysis and the second by using sentiment analysis. We also use style analysis to exclude stories that are inappropriate in style. We discuss linguistic theories and methodological aspects of this area, outline a hybrid methodology that can be used in selecting stories that match both the content and emotive criteria, and present the results of experiments that have been used to validate the approach.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agarwal, A., Bhattacharyya, P.: Sentiment analysis: A new approach for effective use of linguistic knowledge and exploiting similarities in a set of documents to be classified. In: Proc. of the Int. Conf. on NLP (2005)
Bingham, E., Kuusisto, J., Lagus, K.: ICA and SOM in text document analysis. In: Proceedings of the 25th ACM SIGIR Conference, pp. 361–362. ACM, New York (2002)
Bleys, J., Loetzsch, M., Spranger, M., Steels, L.: The grounded color naming game. In: Proceedings of the 18th IEEE International Symposium on Robot and Human Interactive Communication (2009)
Comon, P.: Independent component analysis—a new concept? Signal Processing 36, 287–314 (1994)
Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. Journal of the American Society of Information Science 41, 391–407 (1990)
Devitt, A., Ahmad, K.: Sentiment analysis in financial news: A cohesion-based approach. In: Proceedings of the Association for Computational Linguistics (ACL), pp. 984–991 (2007)
Givón, T.: Mind, code, and context: essays in pragmatics. Lawrence Erlbaum Associates (1989)
Honkela, T., Hyvärinen, A., Väyrynen, J.: WordICA - Emergence of linguistic representations for words by independent component analysis. Natural Language Engineering 16(3), 277–308 (2010)
Hurst, M., Nigam, K.: Retrieving topical sentiments from online document collections. In: Document Recognition and Retrieval XI, pp. 27–34 (2004)
Hyvärinen, A., Karhunen, J., Oja, E.: Independent component analysis, vol. 26. Wiley (2001)
Jutten, C., Hérault, J.: Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture. Signal Processing 24, 1–10 (1991)
Karlgren, J.: Textual stylistic variation: Choices, genres and individuals. In: Structure of Style, pp. 129–142. Springer (2010)
Munezero, M., Kakkonen, T., Montero, C.: Towards automatic detection of antisocial behavior from texts. In: Proceedings of the Workshop on Sentiment Analysis where AI meets Psychology (SAAIP 2011), pp. 20–27 (November 2011)
Ritter, H., Kohonen, T.: Self-organizing semantic maps. Biological Cybernetics 61(4), 241–254 (1989)
Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment in short strength detection informal text. Journal of the American Society for Information Science and Technology 61(12), 2544–2558 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Honkela, T., Izzatdust, Z., Lagus, K. (2012). Text Mining for Wellbeing: Selecting Stories Using Semantic and Pragmatic Features. In: Villa, A.E.P., Duch, W., Érdi, P., Masulli, F., Palm, G. (eds) Artificial Neural Networks and Machine Learning – ICANN 2012. ICANN 2012. Lecture Notes in Computer Science, vol 7553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33266-1_58
Download citation
DOI: https://doi.org/10.1007/978-3-642-33266-1_58
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33265-4
Online ISBN: 978-3-642-33266-1
eBook Packages: Computer ScienceComputer Science (R0)