Skip to main content
Log in

Semantic-Based Unsupervised Hybrid Technique for Opinion Targets Extraction from Unstructured Reviews

  • Research Article - Computer Engineering and Computer Science
  • Published:
Arabian Journal for Science and Engineering Aims and scope Submit manuscript

Abstract

Opinion target identification is an important task of opinion mining problem. Several approaches have been employed for this task that can be broadly divided into two major categories: supervised and unsupervised. The supervised approaches require training data which need manual work and are mostly domain dependent. Unsupervised technique is most popularly used due to its two main advantages: domain independent and no need of training data. This paper presents optimization of unsupervised likelihood ratio test (LRT) technique for opinion targets identification from unstructured reviews. The LRT approach basically depends on frequently observed features and hence do not predict infrequent features (IFF). This paper exploits WordNet lexical resource to identify IFF based on semantic relatedness to the frequent features in review documents which improves the overall accuracy of features extraction algorithms. Empirical results show the validity of the proposed technique.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Al-Kabi M., et al.: Examining Web search trends across Arab countries. Arab. J. Sci. Eng. 37, 1585–1593 (2012)

    Article  Google Scholar 

  2. Pang, B.; Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2, 135 (2008)

    Google Scholar 

  3. Liu, B.: Sentiment Analysis and Subjectivity. Handbook of Natural Language Processing (2nd edn.), pp. 627–666 (2010)

  4. Feldman, R., et al.: Extracting product comparisons from discussion boards. Presented at the the 2007 Seventh IEEE International Conference on Data Mining (2007)

  5. Weiss, S.M., et al.: Text mining (2005)

  6. Bloom, K., et al.: Extracting appraisal expressions. In: Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics, pp. 308–315. Rochester, New York (2007)

  7. Ben-David, S., et al.: Analysis of representations for domain adaptation. Adv. Neural Inf. Process. Syst. 19 (2007)

  8. Lu, Y.; Zhai, C.: Opinion integration through semi-supervised topic modeling. In: 17th International World Wide Web Conference (WWW ’08), pp. 121–130. Beijing, China (2008)

  9. Kessler, J.S., et al.: The 2010 icwsm jdpa sentment corpus for the automotive domain. In: 4th International AAAI Conference on Weblogs and Social Media Data Workshop Challenge (ICWSMDWC2010), Washington, DC (2010)

  10. Lin, C.J.; Chao, P.H.: Tourism related opinion detection and tourist attraction target identification. Int. J. Comput. Linguist. Chin. Lang. Process. 15 (2010)

  11. Zhai, Z., et al.: Clustering product features for opinion mining. Presented at the The fourth ACM international conference on Web search and data mining, Hong Kong (2011)

  12. Goujon, B.: Text mining for opinion target detection. In: European Intelligence and Security Informatics Conference (EISIC), pp. 332–326 (2011)

  13. Hu, M.; Liu, B.: Mining and summarizing customer reviews. Presented at the 10th ACM SIGKDD international conference on Knowledge discovery and data mining, Seattle (2004)

  14. Agrawal, R.; Srikant, R.: Fast algorithms for mining association rules in large databases. Presented at the 20th International Conference on Very Large Data Bases (1994)

  15. Wei, C.-P., et al.: Understanding what concerns consumers: a semantic approach to product features extraction from consumer reviews. Inf. Syst. E-bus Manag. pp. 149–167 (2010)

  16. Dunning, T.:Accurate methods for the statistics of surprise and coincidence. Comput. Linguist. 19, pp. 61–74, 1993

    Google Scholar 

  17. Yi, J., et al.: Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques. In: Third IEEE International Conference on Data Mining (ICDM), pp. 427–434 (2003)

  18. Ferreira, L., et al.: A comparative study of feature extraction algorithms in customer reviews. In: 2008 IEEE International Conference on Semantic Computing, pp. 144–151 (2008)

  19. Stark, M.M.; Riesenfeld, R.F.: WordNet: an electronic lexical database. Presented at the 11th Eurographics Workshop on Rendering (1998)

  20. Kamps, J., et al.: Using WordNet to measure semantic orientation of adjectives. In: 4th International Conference on Language Resources and Evaluation, pp. 1115–1118 (2004)

  21. Resnik, P.: Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. J. Artif. Intell. Res. 11, 95–130 (1999)

    Google Scholar 

  22. Xu, X., et al.: Aspect-based extractive summarization of online reviews. Presented at the 2011 ACM Symposium on Applied Computing, TaiChung (2011)

  23. Ganesan, K.; Zhai, C.X.; Han, J.: Opinosis: a graph based approach to abstractive summarization of highly redundant opinions. In: 23rd International Conference on Computational Linguistics (COLING 2010), Beijing (2010)

  24. Lopez-Fernandez, A., et al.: Feature extraction from product reviews using feature similarity and polarity (2009)

  25. Toutanova, K.; Manning, C.D.: Enriching the knowledge sources used in a maximum entropy part-of-speech tagger. In: The Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP/VLC-2000), pp. 63–70 (2000)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Khairullah Khan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Khan, K., Baharudin, B.B. & Khan, A. Semantic-Based Unsupervised Hybrid Technique for Opinion Targets Extraction from Unstructured Reviews. Arab J Sci Eng 39, 3681–3689 (2014). https://doi.org/10.1007/s13369-014-0990-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13369-014-0990-1

Keywords

Navigation