SalienceGraph: Visualizing Salience Dynamics of Written Discourse by Using Reference Probability and PLSA
Since public involvement in the decision-making process for community development needs a lot of efforts and time, support tools for speeding up the consensus building process among stakeholders are required. This paper presents a new method for finding, tracking and visualizing participants’ concerns (topics) from the record of a public debate. For finding topics, we use the salience of a term, which is computed as its reference probability based on referential coherence in the Centering Theory. Our system first annotates a debate record or minute into Global Document Annotation (GDA) format automatically, and then computes the salience of each term from the GDA-annotated text sentence by sentence. Then, by using the Probalilistic Latent Semantic Analytsis (PLSA), our system reduces the dimensions of the vector of salience values of terms into a set of major latent topics. For tracking topics, we use the salience dynamics, which is computed as the temporal change of joint attention to the major latent topics with additional user-supplied terms. The resulting graph is called SalienceGraph. For visualizing SalienceGraph, we use 3D visualizer with GUI designed by “overview first, zoom and filter, then details on demand” principle. SalienceGraph provides more accurate trajectory of topics than conventional TF·IDF.
Keywordsdiscourse analysis visualization discourse salience PLSA
Unable to display preview. Download preview PDF.
- 1.Jeong, H., Hatori, T., Kobayashi, K.: Discourse analysis of public debates: A corpus-based approach. In: Proceedings of 2007 IEEE International Conference on Systems, Man and Cybernetics (SMC 2007), pp. 1782–1793 (2007)Google Scholar
- 2.Shneiderman, B.: Designing the User Interface: Strategies for Effective Human-Computer Interaction, 3rd edn. Pearson Addison Wesley (1998)Google Scholar
- 3.Hasida, K.: Global Document Annotation (GDA), http://i-content.org/GDA/
- 4.Kudo, T., Matsumoto, Y.: Japanese dependency analysis using cascaded chunking. In: Proceeding of the 6th conference on Natural language learning (CoNLL-2002, COLING 2002 Post-Conference Workshops), pp. 1–7 (2002)Google Scholar
- 5.Grosz, B., Joshi, A., Weinstein, S.: Centering: A Framework for Modeling the Local Coherence of Discourse. Computational Linguistics 21(2), 203–226 (1995)Google Scholar
- 8.Maekawa, K.: Corpus of Spontaneous Japanese: Its Design and Evaluation. In: Proceedings of the ISCA & SSPR 2003, pp. 7–12 (2003)Google Scholar
- 9.GSK (Gengo Shigen Kyokai): Linguistic resourse catalogue (in Japanese), http://www.gsk.or.jp/catalog.html
- 10.YodoRiver-Watershed-Committee: Minute of the Debate Session among Citizens and Committee Members (Nyu Dam) (2005) (in Japanese), http://www.yodoriver.org/kaigi/biwa/17.html#ikenkoukan
- 13.Mochihashi, D., Matsumoto, Y.: Context as filtering. In: Advances in Neural Information Processing Systems (NIPS 2005), vol. 18, pp. 907–914 (2006)Google Scholar
- 14.Kleinberg, J.: Bursty and hierarchical structure in streams. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and DataMining, pp. 1–25 (2002)Google Scholar