Cross-Domain Sentiment Analysis Employing Different Feature Selection and Classification Techniques

Conference paper
Part of the Lecture Notes in Networks and Systems book series (LNNS, volume 10)

Abstract

The paramount work of information mustering has been to find out what is the opinion of the people. Sentiment analysis is errand discerning the polarity for the given content which is dichotomized into two categories—positive and negative. Sentiment analysis operates on colossal feature sets of unique terms using bag of words (BOW) slant, in which case discrete attributes do not give factual information. This necessitates the elimination of extraneous and inconsequential terms from the feature set. Another challenging fact is most of the times, the training data might not be of the particular domain for which the perusal of test data is needed. This miscellany of challenges is unfolded by probing feature selection (FS) methods in cross-domain sentiment analysis. The boon of cross-domain and Feature Selection methods lies in significantly less computational power and time for processing. The informative features chosen are employed for training the classifier and investigating their execution for classification in terms of accuracy. Experimentation of FS methods (IG, GR, CHI, SAE) was performed on standard dataset viz. Amazon product review dataset and TripAdvisor dataset with NB, SVM, DT, and KNN classifiers. The paper works on different techniques by which cross-domain analysis vanquishes, despite the lower accuracy due to difference in domains, as better algorithmic efficient method.

Keywords

Feature selection Cross domain Machine learning Sentiment classification 

References

  1. 1.
    R. Feldman, Techniques and applications for sentiment analysis, Communications of the ACM 56 (4) (2013) 82–89.Google Scholar
  2. 2.
    Y. Dang, Y. Zhang, H. Chen, A lexicon-enhanced method for sentiment classification: An experiment on online product reviews, Intelligent Systems, IEEE 25 (4) (2010) 46–53.CrossRefGoogle Scholar
  3. 3.
    R. Xia, C. Zong, S. Li, Ensemble of feature sets and classification algorithms for sentiment classification, Information Sciences 181 (6) (2011) 1138–1152.CrossRefGoogle Scholar
  4. 4.
    B. Liu, Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, Springer, 2007Google Scholar
  5. 5.
    E. Boiy et al., Automatic Sentiment Analysis in On-line Text, ELPUB2007Google Scholar
  6. 6.
    F. Bisio, P. Gastaldo, C. Peretti, R. Zunino, E. Cambria, Data intensive review mining for sentiment classification across heterogeneous domains, in: Advances in Social Networks Analysis and Mining (ASONAM), 2013 IEEE/ACM International Conference on, IEEE, 2013, pp. 1061–1067.Google Scholar
  7. 7.
    D. Bollegala, D. Weir, J. Carroll, Cross-domain sentiment classification using a sentiment sensitive thesaurus, Knowledge and Data Engineering, IEEE Transactions on 25 (8) (2013) 1719–1731.CrossRefGoogle Scholar
  8. 8.
    P. Jambhulkar, S. Nirkhi, A survey paper on cross-domain sentiment analysis, International Journal of Advanced Research in Computer and Communication Engineering 3 (1) (2014) 5241–5245.Google Scholar
  9. 9.
    H. Wang, Y. Lu, C. Zhai, Latent aspect rating analysis on review text data: a rating regression approach, in: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, 2010, pp. 783–792.Google Scholar
  10. 10.
    A. Sharma, S. Dey, A comparative study of feature selection and machine learning techniques for sentiment analysis, Proceedings of the 2012 ACM Research in Applied Computation Symposium, 2012, pp. 1–7Google Scholar
  11. 11.
    S. Tan, J. Zhang, An empirical study of sentiment analysis for Chinese documents, Expert Systems with Applications 34 (4) (2008) 2622–2629.Google Scholar
  12. 12.
    M. Annett, G. Kondrak, A comparison of sentiment analysis techniques: Polarizing movie blogs, in: Advances in artificial intelligence, Springer, 2008, pp. 25–35.Google Scholar
  13. 13.
    B. Pang, L. Lee, S. Vaithyanathan, Thumbs up?: sentiment classification using machine learning techniques, in: Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10, Association for Computational Linguistics, 2002, pp. 79–86.Google Scholar
  14. 14.
    E. Cambria, A. Hussain, Sentic computing: Techniques, tools, and applications, Vol. 2, Springer Science & Business Media, 2012.Google Scholar
  15. 15.
    B. Pang, L. Lee, Opinion mining and sentiment analysis, Foundations and trends in information retrieval 2 (1-2) (2008) 1–135.CrossRefGoogle Scholar
  16. 16.
    G. A. Miller, Wordnet: a lexical database for english, Communications of the ACM 38 (11) (1995) 39–41.CrossRefGoogle Scholar
  17. 17.
    P. Turney, Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews, Proceedings of the 40th annual meeting of the association for computational linguistics (ACL), 2002, pp. 417–424Google Scholar
  18. 18.
    J. Han, M. Kamber, J. Pei, Data mining, southeast Asia edition: Concepts and techniques, Morgan kaufmann, 2006.Google Scholar
  19. 19.
    D. I. Morariu, R. G. Creulescu, M. Breazu, Feature selection in document classification.Google Scholar
  20. 20.
    A. Ahmad, L. Dey, A feature selection technique for classificatory analysis, Pattern Recognition Letters 26 (1) (2005) 43–56.CrossRefGoogle Scholar
  21. 21.
    B. Liu, Sentiment analysis: A multi-faceted problem, IEEE Intelligent Systems 25 (3) (2010) 76–80.Google Scholar
  22. 22.
    Wang  S., Li D., Song X., Wei Y., Li H.: A feature selection method based on improved fishers discriminant ratio for text sentiment classification. Expert Systems with Applications 38.7, 8696-8702 (2011)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringNational Institute of Technology, TiruchirappalliTiruchirappalliIndia

Personalised recommendations