Segmenting and Characterizing Adopters of E-Books and Paper Books Based on Amazon Book Reviews

  • Lu Guan
  • Yafei Zhang
  • Jonathan ZhuEmail author
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 669)


Online product reviews through which consumers express their opinions and experiences with products are extremely valuable for both potential buyers to make informed purchase decisions and retailers to improve their products/services and adjust existing marketing strategies. One of the key challenges for mining product reviews is how to obtain a “ground truth” to guide the segmentation of reviewers properly. We propose a behavior-to-opinion approach, in which users are first categorized based on some unambiguous behavioral patterns (if available) and their online reviews are then classified to reveal unique and detailed characteristics of each user category. In this paper, we identify four categories of book consumers (i.e., kindle-only, print-only, print-to-kindle, and kindle-to-print) based on the long-term patterns of their review behavior. Their review posts are then clustered through word2vec and K-means, and four categories of adopters are matched with their concerned word topics. Finally, we find that print-only adopters show significantly different patterns on content-oriented topics as compared to other three groups. Kindle-to-print adopters pay more attention on portability whereas print-to-kindle adopters stress more on money and user experience. Taken together, our work indicates a diversity of characteristics among four categories of book reviewers.


Text analytics Behavioral patterns E-books Product reviews 


  1. 1.
    Rogers, E.M.: Diffusion of Innovations. S&S, New York (2003)Google Scholar
  2. 2.
    Mahajan, V., Muller, E., Srivastava, R.K.: Determination of adopter categories by using innovation diffusion models. J. Mark. Res. 27, 37–50 (1990)CrossRefGoogle Scholar
  3. 3.
    Zhu, J.J., He, Z.: Perceived characteristics, perceived needs, and perceived popularity adoption and use of the Internet in China. Commun. Res. 29, 466–495 (2002)CrossRefGoogle Scholar
  4. 4.
    Levy, O., Goldberg, Y., Dagan, I.: Improving distributional similarity with lessons learned from word embeddings. In: TACL, vol. 3, pp. 211–225 (2015)Google Scholar
  5. 5.
    Baroni, M., Dinu, G., Kruszewski, G.: Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors. In: ACL (2014)Google Scholar
  6. 6.
    Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: Proceedings of EMNLP, vol. 14, pp. 1532–4315 (2014)Google Scholar
  7. 7.
    Mikolov, T., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of NIPS (2013)Google Scholar
  8. 8.
    McAuley, J., Targett, C., Shi, Q., van den Hengel, A.: Image-based recommendations on styles and substitutes. In: Proceedings of SIGIR, pp. 43–52 (2015)Google Scholar
  9. 9.
    Daugherty, T., Eastin, M., Gangadharbatla, H.: E-CRM: understanding Internet confidence and implications for customer relationship management. In: Advances in Electronic Marketing, pp. 67–82 (2005)Google Scholar
  10. 10.
    Lin, T.M., Luarn, P., Huang, Y.K.: Effect of Internet book reviews on purchase intention: a focus group study. J. Acad. Librarianship 31(5), 461–468 (2005)CrossRefGoogle Scholar
  11. 11.
    Chevalier, J.A., Mayzlin, D.: The effect of word of mouth on sales: online book reviews. J. Mark. Res. 43(3), 345–354 (2006)CrossRefGoogle Scholar
  12. 12.
    David, S., Pinch, T.J.: Six degrees of reputation: the use and abuse of online review and recommendation systems. Available at SSRN 857505 (2005)Google Scholar
  13. 13.
    Levine-Clark, M.: Electronic books and the humanities: a survey at the University of Denver. Collect. Build. 26(1), 7–14 (2007)CrossRefGoogle Scholar
  14. 14.
    Bierman, J., Ortega, L., Rupp-Serrano, K.: E-book usage in pure and applied sciences. Sci. Technol. Libr. 29(1–2), 69–91 (2010)CrossRefGoogle Scholar
  15. 15.
    Stephens, J., Melgoza, P., Wan, G.: Safari books online: currency, usage and book release policies of an e-book database. Collect. Build. 27(1), 14–17 (2008)CrossRefGoogle Scholar
  16. 16.
    Nicholas, D., Rowlands, I., Clark, D., Huntington, P., Jamali, H.R., Olle, C.: UK scholarly e-book usage: a landmark survey. ASLIB Proc. 60(4), 311–334 (2008). Emerald Group Publishing LimitedCrossRefGoogle Scholar
  17. 17.
    Slater, R.: Why aren’t e-books gaining more ground in academic libraries? E-book use and perceptions: a review of published literature and research. J. Web Librarianship 4(4), 305–331 (2010)CrossRefGoogle Scholar
  18. 18.
    Knutson, R., Fowler, G.A.: Book smarts? E-texts receive mixed reviews from students. Wall Street J. 20 (2009).
  19. 19.
    Malouf, R., Mullen, T.: Taking sides: user classification for informal online political discourse. Int. Res. 18(2), 177–190 (2008)Google Scholar
  20. 20.
    Pennacchiotti, M., Popescu, A.M.: A machine learning approach to Twitter user classification. Proc. ICWSM 11(1), 281–288 (2011)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2016

Authors and Affiliations

  1. 1.Web Mining Lab, Department of Media and CommunicationCity University of Hong KongKowloonHong Kong SAR, China
  2. 2.Key Laboratory of System Control and Information Processing, Ministry of Education of China, Department of AutomationShanghai Jiao Tong UniversityShanghaiChina

Personalised recommendations