Automatic Extraction for Product Feature Words from Comments on the Web

  • Zhichao Li
  • Min Zhang
  • Shaoping Ma
  • Bo Zhou
  • Yu Sun
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5839)


Before deciding to buy a product, many people tend to consult others’ opinions on it. Web provides a perfect platform which one can get information to find out the advantages and disadvantages of the product of his interest. How to automatically manage the numerous opinionated documents and then to give suggestions to the potential customers is becoming a research hotspot recently. Constructing a sentiment resource is one of the vital elements of opinion finding and polarity analysis tasks. For a specific domain, the sentiment resource can be regarded as a dictionary, which contains a list of product feature words and several opinion words with sentiment polarity for each feature word. This paper proposes an automatic algorithm to extraction feature words and opinion words for the sentiment resource. We mine the feature words and opinion words from the comments on the Web with both NLP technique and statistical method. Left context entropy is proposed to extract unknown feature words; Adjective rules and background corpus are taken into consideration in the algorithm. Experimental results show the effectiveness of the proposed automatic sentiment resource construction approach. The proposed method that combines NLP and statistical techniques is better than using only NLP-based technique. Although the experiment is built on mobile telephone comments in Chinese, the algorithm is domain independent.


Resource constructing product feature opinion word 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
  2. 2.
    Ye, Q., Shi, W., Li, Y.: Sentiment Classification for Movie Reviews in Chinese by Improved Semantic Oriented Approach. In: Proceedings of the 39th Annual Hawaii international Conference on System Sciences, HICSS, January 04 - 07, vol. 03, p. 53.2. IEEE Computer Society, Washington (2006)Google Scholar
  3. 3.
    Li, J., Sun, M.: Experimental Study on Sentiment Classification of Chinese Review using Machine Learning Techniques. In: Proceedings of IEEE International Conference on Natural Language Processing and Knowledge Engineering 2007, pp. 393–400 (2007)Google Scholar
  4. 4.
    Hu, M., Liu, B.: Mining Opinion Features in Customer Reviews. In: Proceedings of Nineteenth National Conference on Artificial Intelligence, San Jose, California, USA, July 2-29, pp. 755–760. AAAI Press, Menlo Park (2004)Google Scholar
  5. 5.
    Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD international Conference on Knowledge Discovery and Data Mining, KDD 2004, Seattle, WA, USA, August 22 - 25, pp. 168–177. ACM, New York (2004)CrossRefGoogle Scholar
  6. 6.
    Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the Web. In: Proceedings of the 14th international Conference on World Wide Web, WWW 2005, Chiba, Japan, May 10-14, pp. 342–351. ACM, New York (2005)Google Scholar
  7. 7.
    Popescu, A., Etzioni, O.: Extracting product features and opinions from reviews. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, Human Language Technology Conference. Association for Computational Linguistics, Vancouver, British Columbia, Canada, October 06-08, pp. 339–346. Morristown, NJ (2005)Google Scholar
  8. 8.
    Yi, J., Niblack, W.: Sentiment Mining in WebFountain. In: Proceedings of the 21st international Conference on Data Engineering (Icde 2005), ICDE, April 05-08, vol. 00, pp. 1073–1083. IEEE Computer Society, Washington (2005)Google Scholar
  9. 9.
    Scaffidi, C., Bierhoff, K., Chang, E., Felker, M., Ng, H., Jin, C.: Red Opal: product-feature scoring from reviews. In: Proceedings of the 8th ACM Conference on Electronic Commerce, EC 2007, San Diego, California, USA, June 11-15, pp. 182–191. ACM, New York (2007)Google Scholar
  10. 10.
    Wang, B., Wang, H.: Bootstrapping both Product Properties and Opinion Words from Chinese Reviews with Cross-Training. In: Proceedings of the IEEE/WIC/ACM international Conference on Web intelligence, Web Intelligence, November 02 - 05, pp. 259–262. IEEE Computer Society, Washington (2007)Google Scholar
  11. 11.
    Su, Q., Xu, X., Guo, H., Guo, Z., Wu, X., Zhang, X., Swen, B., Su, Z.: Hidden sentiment association in chinese Web opinion mining. In: Proceeding of the 17th international Conference on World Wide Web, WWW 2008, Beijing, China, April 21 - 25, pp. 959–968. ACM, New York (2008)Google Scholar
  12. 12.
  13. 13.
    Sogou Lab Internet Vocabulary,
  14. 14.
    Luo, Z., Song, R.: An Integrated Method for Chinese Unknown Word Extraction. In: Proceedings of 3rd ACL SIGHAN Workshop on Chinese Language Processing, Barcelona, Spain, pp. 148–155 (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Zhichao Li
    • 1
  • Min Zhang
    • 1
  • Shaoping Ma
    • 1
  • Bo Zhou
    • 1
  • Yu Sun
    • 1
  1. 1.State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and TechnologyTsinghua UniversityBeijingP.R. China

Personalised recommendations