Advertisement

Towards High-Quality Semantic Entity Detection over Online Forums

  • Juan Du
  • Weiming Zhang
  • Peng Cai
  • Linling Ma
  • Weining Qian
  • Aoying Zhou
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6984)

Abstract

User-generated content (UGC) implies user-behaviors. Mining on such data helps understanding the relationship between social media and the real world. Howevr, UGC is usually of low quality, which results in the difficulty of semantic entity extraction. In this paper, we propose a method towards high-quality semantic entity refinement on forums by employing external resources. Experiments on real-life Chinese online forums show the effectiveness of our method.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Toivonen, H.: Apriori algorithm. In: Sammut, C., Webb, G.I. (eds.) Encyclopedia of Machine Learning, pp. 39–40. Springer, US (2010)Google Scholar
  2. 2.
    Qian, W., Chen, F., Du, J., Zhang, W., Zhang, C., Ma, H., Cai, P., Zhou, M., Zhou, A.: aUCWeb: A prototype for analyzing user-created web data. In: Yu, J.X., Kim, M.H., Unland, R. (eds.) DASFAA 2011, Part II. LNCS, vol. 6588, pp. 442–445. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  3. 3.
    Tanev, H., Piskorski, J., Atkinson, M.: Real-time news event extraction for global crisis monitoring. In: Kapetanios, E., Sugumaran, V., Spiliopoulou, M. (eds.) NLDB 2008. LNCS, vol. 5039, pp. 207–218. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  4. 4.
    Cai, P., Luo, H., Zhou, A.: Semantic entity detection by integrating CRF and SVM. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds.) WAIM 2010. LNCS, vol. 6184, pp. 483–494. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  5. 5.
    Liu, M., Li, W., Wu, M., Hu, H.: Event-based extractive summarization using event semantic relevance from external linguistic resource. In: ALPIT, pp. 117–122 (2007)Google Scholar
  6. 6.
    Hersh, W., Bhupatiraju, R., Price, S.: Phrases, boosting, and query expansion using external knowledge resources for genomic information retrieval. In: TREC, pp. 503–509 (2003)Google Scholar
  7. 7.
    Wang, P., Domeniconi, C.: Building semantic kernels for text classification using wikipedia. In: SIGKDD, pp. 713–721. ACM, New York (2008)Google Scholar
  8. 8.
    Tsagkias, M., de Rijke, M., Weerkamp, W.: Linking online news and social media. In: WSDM, pp. 565–574. ACM, New York (2011)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Juan Du
    • 1
  • Weiming Zhang
    • 1
  • Peng Cai
    • 1
  • Linling Ma
    • 1
  • Weining Qian
    • 1
  • Aoying Zhou
    • 1
  1. 1.Institute of Massive Computing, Software Engineering InstituteEast China Normal UniversityChina

Personalised recommendations