Discrimination of Personal Opinions from Commercial Speech Based on Subjectivity of Text

  • Yoshio Seino
  • Takahiro HayashiEmail author
Conference paper
Part of the Lecture Notes on Multidisciplinary Industrial Engineering book series (LNMUINEN)


This paper presents a method for discriminating personal opinions from commercial speech. Today many personal opinions such as complaints about a particular product can be found on the Web. Mining useful information from these opinions is important for a wide range of applications. Personal opinions on the web, however, are often contaminated with commercial speech. Commercial speech is generated by companies and individuals for the intent of making a profit. The data cleaning process that discriminates personal opinions from commercial speech is important for obtaining useful results in opinion mining. As a data cleaning method, we propose a language-independent method for discriminating personal opinions from commercial speech based on subjectivity of text. Assuming that subjective words frequently occur in personal opinions rather than commercial speech, we define the subjectivity score of each word. Estimating the total subjectivity score of a given text, the proposed method identifies whether the given text expresses personal opinions or commercial comments. From experiments using texts datasets written in different languages, we have found over 90% of personal opinions can be correctly discriminated from commercial speech by the proposed method.


Personal opinions Commercial speech Data mining Data cleaning Information filtering 


  1. 1.
    Hayashi T, Abe K, Onai R (2008) Retrieval of personal web documents by extracting subjective expressions. In: International conference on advanced information networking and applications - workshops, pp 1187–1192Google Scholar
  2. 2.
    Hayashi T, Abe K, Roy D, Onai R (2009) Discrimination of personal web pages by extracting subjective expressions. Int J Bus Intell Data Min 4(1):62–77CrossRefGoogle Scholar
  3. 3.
    Khan FH, Bashir S, Qamar U (2014) Tom: Twitter opinion mining framework using hybrid classification scheme. Decis Support Syst 57(3):245–257CrossRefGoogle Scholar
  4. 4.
    Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. In: International joint conference on artificial intelligence, pp 1137–1143Google Scholar
  5. 5.
    Rao S (2016) A survey on sentiment analysis and opinion mining. In: International conference on advances in information communication technology & computing, p 53Google Scholar
  6. 6.
    Ravi K, Ravi V (2015) A survey of opinion mining and sentiment analysis. J Knowl Based Syst 89(C):14–46CrossRefGoogle Scholar
  7. 7.
    Selvam B, Abirami S (2013) A survey on opinion mining framework. Int J Adv Res Comput Commun Eng 3(9):3544–3549Google Scholar
  8. 8.
    Vinodhini G, Chandrasekaran RM (2012) Sentiment analysis and opinion mining: a survey. Int J Adv Res Comput Sci Softw Eng 2:282–292Google Scholar
  9. 9.
    You Q (2016) Sentiment and emotion analysis for social multimedia: Methodologies and applications. In: ACM on multimedia conference, pp 1445–1449Google Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  1. 1.Graduate School of Science and TechnologyNiigata UniversityNiigata-shiJapan
  2. 2.Institute of Science and TechnologyNiigata UniversityNiigata-shiJapan

Personalised recommendations