Abstract
We describe a way to extract visitors’ experiences from Weblogs (blogs) and also a way to mine and visualize activities of visitors at sightseeing spots. A system using our proposed method mines association rules between locations, time periods, and types of experiences out of blog entries. Association rules between experiences are also extracted. We constructed a local information search system that enables the user to specify a location, a time period, or a type of experience in a search query and find relevant Web content. Results of experiments showed that three proposed refinement algorithms applied to a conventional text mining method raises the precision and recall of the extracted rules.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kumar, R., Novak, J., Raghavan, P., Tomkins, A.: On the bursty evolution of blogspace. In: Proceedings of the 12th International World Wide Web Conference, pp. 568–576 (2003)
Kumar, R., Novak, J., Raghavan, P., Tomkins, A.: Structure and evolution of blogspace. Communications of the ACM 47(12), 35–39 (2004)
Bar-Ilan, J.: An outsider’s view on ’topic-oriented’ blogging. In: Proceedings of the Alternate Papers Track of the 13th International World Wide Web Conference, pp. 28–34 (2004)
Nakajima, S., Tatemura, J., Hara, Y., Tanaka, K., Uemura, S.: Identifying Agitators as Important Blogger Based on Analyzing Blog Threads. In: Zhou, X., Li, J., Shen, H.T., Kitsuregawa, M., Zhang, Y. (eds.) APWeb 2006. LNCS, vol. 3841, pp. 285–296. Springer, Heidelberg (2006)
Fujimura, K., Inoue, T., Sugisaki, M.: The Eigen Algorithm for Ranking Blogs. In: Proceedings of the WWW 2005 2nd Annual Workshop on the Weblogging Ecosystem: Aggregation, Analysis and Dynamics (2005)
Okumura, M., Nanno, T., Fujiki, T., Suzuki, Y.: Text mining based on automatic collection and monitoring of Japanese weblogs. In: The 6th Web and Ontology Workshop, The Japanese Society for Artificial Intelligence (2004)
Fujiki, T., Nanno, T., Suzuki, Y., Okumura, M.: Identification of bursts in a document stream. In: First International Workshop on Knowledge Discovery in Data Streams (in conjunction with ECML/PKDD 2004), pp. 55–64 (2004)
Kleinberg, J.: Bursty and hierarchical structure in streams. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 91–101 (2002)
Avesani, P., Cova, M., Hayes, C., Massa, P.: Proceedings of the WWW 2005 2nd Annual Workshop on the Weblogging Ecosystem: Aggregation, Analysis and Dynamics, Chiba, Japan (2005)
DC Metro Blogmap, http://www.reenhead.com/map/metroblogmap.html
nyc bloggers, http://www.nycbloggers.com/
Kurashima, T., Tezuka, T., Tanaka, K.: Blog Map of Experiences: Extracting and Geographically Mapping Visitor Experiences from Urban Blogs. In: Ngu, A.H.H., Kitsuregawa, M., Neuhold, E.J., Chung, J.-Y., Sheng, Q.Z. (eds.) WISE 2005. LNCS, vol. 3806, pp. 496–503. Springer, Heidelberg (2005)
Tezuka, T., Kurashima, T., Tanaka, K.: Toward Tighter Integration of Web Search with a Geographic Information System. In: Proceedings of the Fifteenth World Wide Web Conference (WWW 2006), Edinburgh, Scotland (2006)
The National Language Research Institute, Cases and Japanese Postpositions, The National Language Research Institute (1997)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487–499 (1994)
Bulkfeeds, http://bulkfeeds.net/
goo blog, http://blog.goo.ne.jp
Japanese Vocabulary System, http://www.ntt-tec.jp/technology/C404.html
Google Maps API, http://www.google.com/apis/maps/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kurashima, T., Tezuka, T., Tanaka, K. (2006). Mining and Visualizing Local Experiences from Blog Entries. In: Bressan, S., KĂĽng, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2006. Lecture Notes in Computer Science, vol 4080. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11827405_21
Download citation
DOI: https://doi.org/10.1007/11827405_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37871-6
Online ISBN: 978-3-540-37872-3
eBook Packages: Computer ScienceComputer Science (R0)