Extracting Social Networks Enriched by Using Text
Forums on the Internet are an overwhelming source of knowledge considering the number of topics treated and users who participate in these discussions. This volume of data is difficult to comprehend for a person with respect for the large number of posts. Our work proposes a new formal framework for synthesizing information contained in these forums. We extract a social network that reflects reality by extracting multiple relationships between individuals (structural relationship, name and text quotation relationships). These relationships are created from the structure and the content of the discussion. Results show that discovering quotation relationships from forums is not trivial.
KeywordsSocial Network Structural Relationship Quotation Mark Egocentric Network Implicit Relationship
Unable to display preview. Download preview PDF.
- 1.Culotta, A., McCallum, A., Bekkerman, R.: Extracting social networks and contact information from email and the web (2005)Google Scholar
- 2.Fisher, D., Smith, M., Welser, H.: You are who you talk to: Detecting roles in usenet newsgroups. In: Proceedings of the 39th Annual Hawaii International Conference on System Sciences, HICSS 2006, pp. 59b–59b (2006)Google Scholar
- 3.Jin, Y., Matsuo, Y., Ishizuka, M.: Extracting social networks among various entities on the web. The Semantic Web: Research and Applications, 251–266 (2007)Google Scholar
- 4.Jin, Y., Matsuo, Y., Ishizuka, M.: Ranking Entities on the Web using Social Network Mining and Ranking Learning. Structure 1, 3 (2008)Google Scholar
- 6.Kelly, J., Fisher, D., Smith, M.: Friends, foes, and fringe: norms and structure in political discussion networks. In: Proceedings of the 2006 International Conference on Digital Government Research, pp. 21–24 (May 2006)Google Scholar
- 9.Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: International Conference of New Methods in Language Processing (1994)Google Scholar
- 10.Soukoreff, R., MacKenzie, I.: Measuring errors in text entry tasks: an application of the Levenshtein string distance statistic. In: CHI 2001 Extended Abstracts on Human Factors in Computing Systems, pp. 319–320. ACM, New York (2001)Google Scholar
- 11.Stavrianou, A.: Modeling and Mining of Web Discussions. Ph.D. thesis, Universit Lumire Lyon 2 (2010)Google Scholar
- 12.Welser, H., Gleave, E., Fisher, D., Smith, M.: Visualizing the signatures of social roles in online discussion groups. Journal of Social Structure 8(2) (2007)Google Scholar