Abstract
While data mining is well established in practice, opinion mining is still in its infancy, with issues in particular around the development of methodologies which effectively extract accurate, reliable, influential and useful information from the raw opinion data collected from informal product reviews. Current approaches adopt a single-variable approach, focusing on individual metrics—word length, the presence of keywords, or the overall semantic orientation of terms within the data—while neglecting to evaluate whether these individual artifacts are indicative of the tone of a given review. This approach has significant limitations when we move from trying to merely evaluate whether an online opinion is positive or negative, to trying to evaluate how likely it is that the opinion will influence others. Given this issue, one promising avenue would be to evaluate the general analysis approaches utilized by opinion mining algorithms and identified in the literature in terms of how accurately they reflect how people actually interpret and are influenced by electronic online reviews. Through interviewing and a follow up survey of 136 participants, the validity of the approach in terms of ascertaining the tone of a piece of text can be evaluated, as well as the identification of measurable factors within text which make a given opinionated text more or less influential in an online context, further facilitating the development of more effective multivariate opinion mining approaches. Furthermore, the identification of factors which make an online opinion text more or less persuasive helps to facilitate the development of opinion mining approaches which can evaluate how likely a review is to affect an individual’s decision making.
Similar content being viewed by others
References
Anderson, C. (2004). The long tail. Wired 12.10. Retrieved May 31, 2009 from http://www.wired.com/wired/archive/12.10/tail.html?pg=1&topic=tail.
Bickart, B., & Shindler, R. M. (2001). Internet forums as influential sources of information. Journal of Interactive Marketing, 15(3), 31–40.
Bone, P. F. (1995). Word of mouth effects on short-term and long-term product judgements. Journal of Business Research, 32(3), 213–223.
Chatterjee, P. (2001). Online reviews: do consumers use them? Advances in Consumer Research, 28(1), 129–133.
Chevalier, J. A., & Mayzlin, D. (2006). The effect of word of mouth on sales: online book reviews. Journal of Marketing Research, 43(3), 345–354.
Clemons, E. K., Gao, G., & Hitt, L. M. (2006). When online reviews meet hyper differentiation: a study of the craft beer industry. Journal of Management Information Systems, 23(2), 149–171.
Dave, K., Lawrence, S., & Pennock, D. M. (2003). Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In Proceedings of the 12th international world wide web conference (WWW2003), Budapest, Hungary. Available at http://www2003.org.
Davenport, T. H., & Harris, J. G. (2007). Competing on analytics: the new science of winning. Boston: Harvard Business School.
Dellarocas, C. (2006). Strategic manipulation of Internet opinion forums: implications for consumers and firms. Management Science, 52(10), 1577–1593.
Dellarocas, C., Awad, N., & Zhang, M. (2004). Exploring the value of online reviews to organizations: implications for revenue forecasting and planning. Working paper, MIT Sloan School of Management.
de Valck, K., van Bruggen, G. H., & Wierenga, B. (2009). Virtual communities: a marketing perspective. Decision Support Systems, 47(3), 185–203.
Engel, J. F., Blackwell, R. D., & Kollet, D. T. (1978). Consumer behaviour (3rd ed.). Hinsdale: Dryden Press.
Esuli, A., & Sebastiani, F. (2006). Determining term subjectivity and term orientation for opinion mining. In Proceedings of EACL-06, 11th conference of the European chapter of the association for computational linguistics, Trento, Italy (pp. 193–200). Association for Computational Linguistics.
Esuli, A., & Sebastiani, F. (2006). SentiWordNet: a publicly available lexical resource for opinion mining. In Proceedings of LREC-06, the 5th conference on language resources and evaluation, Genoa, Italy, LREC (pp. 417–422).
Esrock, S. L., & Leichty, G. B. (2000). Organization of corporate web pages: publics and functions. Public Relations Review, 26(3), 327–344.
Fayyad, U., & Uthurusamy, R. (1996). Data mining and knowledge discovery in databases. Communications of the ACM, 39(11), 24–26.
Geller, J., Parikh, S., & Krishnan, S. (2007). Blog mining for the fortune 500. In MLDM ’07: proceedings of the 5th international conference on machine learning and data mining in pattern recognition, Leipzig, Germany (pp. 379–391). Berlin: Springer.
Gemser, G., Van Oostrum, M., & Leenders, M. A. A. M. (2007). The impact of film reviews on the box office performance of art house versus mainstream motion pictures. Journal of Cultural Economics, 31(1), 43–63.
Gerdes, J. Jr., Stringam, B. B., & Brookshire, R. G. (2008). An integrative approach to assess qualitative and quantitative consumer feedback. Electronic Commerce Research, 8(4), 217–234.
Godes, D., & Mayzlin, D. (2004). Using online conversations to study word of mouth communication. Marketing Science, 23(4), 545–560.
Goh, T. T., & Huang, Y.-P. (2009). Monitoring youth depression risk in web 2.0. VINE, 39(3), 192–202.
Han, J., & Kamber, M. (2006). Data mining: concepts and techniques. San Francisco: Morgan Kaufmann.
Hsee, C. K., Yang, Y., Gu, Y., & Chen, J. (2009). Specification seeking: how product specifications influence consumer perception. Journal of Consumer Research, 35(6), 952–966.
Hu, M., & Liu, B. (2004). Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining, Seattle, US (pp. 168–177). New York: ACM.
Hu, N., Liu, L., & Zhang, J. J. (2008). Do online reviews affect product sales? The role of reviewer characteristics and temporal effects. Information Technology and Management, 9(3), 201–214.
Hu, W., Gong, Z., & Guo, J. (2010). Mining product features from online reviews. Paper presented at the 2010 IEEE 7th international. conference on e-business engineering (ICEBE).
Huang, M. X., Cai, F. Y., Tsang, A. S. L., & Zhou, N. (2011). Making your online voice loud: the critical role of WOM information. European Journal of Marketing, 45(7–8), 1277–1297.
Kaplan, B., & Duchon, D. (1988). Combining qualitative and quantitative methods in information systems research: a case study. Management Information Systems Quarterly, 12(4), 571–586.
Kozinets, R. V. (2002). The field behind the screen: using netnography for marketing research in online communities. Journal of Marketing Research, 39(1), 61–72.
Mahajan, V., Muller, E., & Kerin, R. A. (1984). Introduction strategy for new products with positive and negative word of mouth. Management Science, 30(12), 1389–1404.
Minichiello, V., Aroni, R., Timewell, E., & Alexander, L. (1990). In-depth interviewing: researching people. Hong Kong: Longman Cheshire Pty Limited.
Nielson, J. (2000). Why you only need to test with 5 users. Retrieved November 9, 2009 from http://www.useit.com/alertbox/20000319.html.
Peterson, R. A., Balasubramanian, S., & Bronnenberg, B. J. (1997). Exploring the implications of the Internet for consumer marketing. Journal of the Academy of Marketing Science, 25(4), 329–346.
Peterson, R. A., & Merino, M. C. (2003). Consumer information search behaviour and the internet. Psychology & Marketing, 20(2), 99–121.
Porter, A. L., & Cunningham, S. W. (2005). Tech mining: exploiting new technologies for competitive advantage. New York: Wiley.
Punch, K. F. (2006). Developing effective research proposals. London: Sage.
Radian6 (2009). Uncover influencers. Retrieved November 9, 2009 from http://www.radian6.com/applications/uncover-influencers/.
Riegner, C. (2007). Word of mouth on the web: the impact of web 2.0 on consumer purchase decisions. Journal of Advertising Research, 47(4), 436–447.
Schibrowsky, J. A., Peltier, J. W., & Nill, A. (2007). The state of Internet marketing research: a review of the literature and future research directions. European Journal of Marketing, 41(7/8), 722–733.
Stalk, G. Jr., & Lachenauer, R. (2004). Hard ball. Harvard Business Review, 82(4), 62–71.
Strauss, A., & Corbin, J. (1990). Basics of qualitative research: grounded theory procedures and techniques. Newbury Park: Sage.
Vegnaduzzo, S. (2004). Acquisition of subjective adjectives with limited resources. In Proceedings of the AAAI spring symposium on exploring attitude and affect in text: theories and applications, Stanford, US. Menlo Park: AAAI Press.
Yu, J., Zha, Z.-J., Wang, M., & Chua, T.-S. (2011). Aspect ranking: identifying important product aspects from online consumer reviews. Paper presented at the proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, vol. 1.
Zhang, J. Q., Craciun, G., & Shin, D. (2010). When does electronic word-of-mouth matter? A study of consumer product reviews. Journal of Business Research, 63(12), 1336–1341.
Zhang, K., Cheng, Y., Liao, W.-k., & Choudhary, A. (2011). Mining millions of reviews: a technique to rank products based on importance of reviews. Paper presented at the 10th international conference on entertainment computing.
Zhang, K., Narayanan, R., & Choudhary, A. (2010). Voice of the customers: mining online customer reviews for product feature-based ranking. Paper presented at the proceedings of the 3rd conference on online social networks, Boston, MA.
Zhang, R. C., & Tran, T. (2010). Helpful or unhelpful: a linear approach for ranking product reviews. Journal of Electronic Commerce Research, 11(3), 220–230.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Robinson, R., Goh, TT. & Zhang, R. Textual factors in online product reviews: a foundation for a more influential approach to opinion mining. Electron Commer Res 12, 301–330 (2012). https://doi.org/10.1007/s10660-012-9095-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10660-012-9095-7