Extracting Concerns and Reports on Crimes in Blogs

  • Yusuke Abe
  • Takehito Utsuro
  • Yasuhide Kawada
  • Tomohiro Fukuhara
  • Noriko Kando
  • Masaharu Yoshioka
  • Hiroshi Nakagawa
  • Yoji Kiyota
  • Masatoshi Tsuchiya
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6335)

Abstract

Among other domains and topics on which some issues are frequently argued in the blogosphere, the domain of crime is one of the most seriously discussed by various kinds of bloggers. Such information on crimes in blogs is especially valuable for those who are not familiar with tips for preventing being victimized. This paper proposes a framework of extracting people’s concerns and reports on crimes in their own blogs. In this framework, we solve two tasks. In the first task, we focus on experts in crime domain and address the issue of extracting concerns on crimes such as tips for preventing being victimized. In the evaluation of this first task, we show that we successfully rank expert bloggers high in the results of blog feed retrieval. In the second task, on the other hand, we focus on victims of criminal acts, and address the issue of extracting reports on being victimized. In the evaluation of this second task, we show that we outperform blog post ranking based on the search engine API by incorporating dependency relations for identifying victims’ blog posts.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Glance, N., Hurst, M., Tomokiyo, T.: Blogpulse: Automated trend discovery for Weblogs. In: WWW 2004 Workshop on the Weblogging Ecosystem: Aggregation, Analysis and Dynamics (2004)Google Scholar
  2. 2.
    Nanno, T., Fujiki, T., Suzuki, Y., Okumura, M.: Automatically collecting, monitoring, and mining Japanese weblogs. In: WWW Alt. 2004: Proc. 13th WWW Conf. Alternate Track Papers & Posters, pp. 320–321 (2004)Google Scholar
  3. 3.
    Macdonald, C., Ounis, I., Soboroff, I.: Overview of the TREC-2007 Blog Track. In: Proc. TREC-2007 (Notebook), pp. 31–43 (2007)Google Scholar
  4. 4.
    Yangarber, R., Best, C., von Etter, P., Fuart, F., Horby, D., Steinberger, R.: Combining Information about Epidemic Threats from Multiple Sources. In: Proc. Workshop: Multi-source, Multilingual Information Extraction and Summarization, pp. 41–48 (2007)Google Scholar
  5. 5.
    Pouliquen, B., Steinberger, R., Belyaeva, J.: Multilingual Multi-document Continuously-updated Social Networks. In: Proc. Workshop: Multi-source, Multilingual Information Extraction and Summarization, pp. 25–32 (2007)Google Scholar
  6. 6.
    Yoshioka, M.: IR Interface for Contrasting Multiple News Sites. In: Prof. 4th AIRS, pp. 516–521 (2008)Google Scholar
  7. 7.
    Bautin, M., Vijayarenu, L., Skiena, S.: International Sentiment Analysis for News and Blogs. In: Proc. ICWSM, pp.19–26 (2008)Google Scholar
  8. 8.
    De Saeger, S., Torisawa, K., Kazama, J.: Looking for Trouble. In: Proc. 22nd COLING, pp. 185–192 (2008)Google Scholar
  9. 9.
    Torisawa, K., De Saeger, S., Kakizawa, Y., Kazama, J., Murata, M., Noguchi, D., Sumida, A.: TORISHIKI-KAI, an Autogenerated Web Search Directory. In: Proc. 2nd ISUC, pp.179–186 (2008)Google Scholar
  10. 10.
    Kawaba, M., Nakasaki, H., Utsuro, T., Fukuhara, T.: Cross-Lingual Blog Analysis based on Multilingual Blog Distillation from Multilingual Wikipedia Entries. In: Proc. ICWSM, pp. 200–201 (2008)Google Scholar
  11. 11.
    Nakasaki, H., Kawaba, M., Yamazaki, S., Utsuro, T., Fukuhara, T.: Visualizing Cross-Lingual/Cross-Cultural Differences in Concerns in Multilingual Blogs. In: Proc. ICWSM, pp. 270–273 (2009)Google Scholar
  12. 12.
    Evans, D.K., Ku, L.W., Seki, Y., Chen, H.H., Kando, N.: Opinion Analysis across Languages: An Overview of and Observations from the NTCIR6 Opinion Analysis Pilot Task. In: Proc. 3rd Inter. Cross-Language Information Processing Workshop (CLIP 2007), pp. 456–463 (2007)Google Scholar
  13. 13.
    Wiebe, J., Wilson, T., Cardie, C.: Annotating Expressions of Opinions and Emotions in Language. Language Resources and Evaluation 39, 165–210 (2005)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Yusuke Abe
    • 1
  • Takehito Utsuro
    • 1
  • Yasuhide Kawada
    • 2
  • Tomohiro Fukuhara
    • 3
  • Noriko Kando
    • 4
  • Masaharu Yoshioka
    • 5
  • Hiroshi Nakagawa
    • 6
  • Yoji Kiyota
    • 6
  • Masatoshi Tsuchiya
    • 7
  1. 1.University of TsukubaTsukubaJapan
  2. 2.Navix Co., Ltd.TokyoJapan
  3. 3.National Institute of Advanced Industrial Science and TechnologyTokyoJapan
  4. 4.National Institute of InformaticsTokyoJapan
  5. 5.Hokkaido UniversitySapporoJapan
  6. 6.University of TokyoTokyoJapan
  7. 7.Toyohashi University of TechnologyToyohashiJapan

Personalised recommendations