Abstract
In previous studies, customer survey data were commonly adopted to perform the modelling of customer satisfaction (CS). However, it could be time-consuming to conduct surveys and obtain their data. On the other hand, respondents’ responses are quite often confined by preset questions. Nowadays, a huge number of customer online reviews on products can be found on various websites. The reviews can be extracted easily in a very short time. Customers can freely express their concerns and views of products in their online reviews. Those reviews provide a valuable source of information for manufacturers to improve their existing products and develop their new products. Previous studies have attempted to develop CS models based on survey data by using various computational intelligence techniques. However, no attempt at developing explicit CS models based on online reviews was reported in the literature. In this paper, a methodology for the modelling of CS based on customer online reviews and a multigene genetic programming-based fuzzy regression (MGGP-FR) approach is proposed. In the proposed methodology, relevant textual reviews of products are extracted from e-commerce websites. Then, opinion mining is conducted on the reviews and sentiments scores of customer concerns are derived. A MGGP-FR approach is then introduced to develop CS models based on the derived sentiment scores. A case study on developing CS models for electronic hairdryers is conducted to illustrate the proposed methodology and validate the effectiveness of MGGP-FR in the modelling of CS. The validation results show MGGP-FR outperforms the other three modelling approaches, fuzzy regression, genetic programming, and genetic programming-based fuzzy regression, in the CS modelling in terms of prediction accuracy.
Similar content being viewed by others
References
Alfrjani R, Osman T, Cosma G (2019) A hybrid semantic knowledgebase-machine learning approach for opinion mining. Data Knowl Eng 121:88–108. https://doi.org/10.1016/J.DATAK.2019.05.002
Castelli M, Manzoni L, Silva S, Vanneschi L (2011) A quantitative study of learning and generalization in genetic programming. In: Lecture notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol 6621 LNCS, pp 25–36. Springer, Berlin. https://doi.org/10.1007/978-3-642-20407-4_3
Chakraverty S, Sahoo DM, Mahato NR, Chakraverty S, Sahoo DM, Mahato NR (2019) Fuzzy numbers. In: Concepts of soft computing, pp 53–69. Springer, Singapore. https://doi.org/10.1007/978-981-13-7430-2_3
Chan KY, Ling SH (2016) A forward selection based fuzzy regression for new product development that correlates engineering characteristics with consumer preferences. J Intell Fuzzy Syst 30(3):1869–1880. https://doi.org/10.3233/IFS-151898
Chan KY, Kwong CK, Wong TC (2011a) Modelling customer satisfaction for product development using genetic programming. J Eng Des 22(1):55–68. https://doi.org/10.1080/09544820902911374
Chan KY, Kwonga CK, Dillonb TS, Funga KY (2011b) An intelligent fuzzy regression approach for affective product design that captures nonlinearity and fuzziness. J Eng Des 22(8):523–542. https://doi.org/10.1080/09544820903550924
Chan KY, Kwong CK, Dillon TS (2012) Development of product design models using fuzzy regression based genetic programming, pp 111–128. Springer, Berlin. https://doi.org/10.1007/978-3-642-27476-3_6
Chen WL, Chiang YM (2010) A study on the product design of hair dryer using neural network method. In: 3CA 2010–2010 international symposium on computer, communication, control and automation, vol 1, pp 396–399. IEEE. https://doi.org/10.1109/3CA.2010.5533792
Chen Y, Tang J, Fung RYKK, Ren Z (2004) Fuzzy regression-based mathematical programming model for quality function deployment. Int J Prod Res 42(5):1009–1027. https://doi.org/10.1080/00207540310001619623
Chen Y, Zhao Y, Qin B, Liu T (2016) Product aspect clustering by incorporating background knowledge for opinion mining. PLoS ONE 11(8):1–17. https://doi.org/10.1371/journal.pone.0159901
Chen K, Luo P, Wang H (2017a) An influence framework on product word-of-mouth (WoM) measurement. Inf Manag 54(2):228–240. https://doi.org/10.1016/j.im.2016.06.010
Chen Q, Zhang M, Xue B (2017b) Feature selection to improve generalization of genetic programming for high-dimensional symbolic regression. IEEE Trans Evol Comput 21(5):792–806. https://doi.org/10.1109/TEVC.2017.2683489
Chen Q, Xue B, Zhang M (2019) Improving generalization of genetic programming for symbolic regression with angle-driven geometric semantic operators. IEEE Trans Evol Comput 23(3):488–502. https://doi.org/10.1109/TEVC.2018.2869621
Fernández-Gavilanes M, Álvarez-López T, Juncal-Martínez J, Costa-Montenegro E, Javier González-Castaño F (2016) Unsupervised method for sentiment analysis in online texts. Expert Syst Appl 58:57–75. https://doi.org/10.1016/j.eswa.2016.03.031
Gandomi AH, Alavi AH (2012) A new multi-gene genetic programming approach to nonlinear system modeling. Part I: materials and structural engineering problems. Neural Comput Appl 21(1):171–187. https://doi.org/10.1007/s00521-011-0734-z
Groves RM (2006) Nonresponse rates and nonresponse bias in household surveys. Publ Opin Q 70(5):646–675. https://doi.org/10.1093/poq/nfl033
He L, Hoyle C, Chen W (2011) Examination of customer satisfaction surveys in choice modelling to support engineering design. J Eng Des 22(10):669–687. https://doi.org/10.1080/09544828.2010.505186
Ireland R, Liu A (2018) Application of data analytics for product design: sentiment analysis of online product reviews. CIRP J Manuf Sci Technol 23:128–144. https://doi.org/10.1016/j.cirpj.2018.06.003
Jiang H, Kwong CK, Law MC, Ip WH (2013) Development of customer satisfaction models for affective design using rough set and ANFIS approaches. Proc Comput Sci 22:104–112. https://doi.org/10.1016/j.procs.2013.09.086
Jin J, Liu Y, Ji P, Liu H (2016) Understanding big consumer opinion Undata for market-driven product design. Int J Prod Res 54(10):3019–3041. https://doi.org/10.1080/00207543.2016.1154208
Kwong CK, Chen Y, Chan KY, Luo X (2010) A generalised fuzzy least-squares regression approach to modelling relationships in QFD. J Eng Des 21(5):601–613. https://doi.org/10.1080/09544820802563234
Kwong CK, Fung KY, Jiang H, Chan KY, Siu KWM (2013) A modified dynamic evolving neural-fuzzy approach to modeling customer satisfaction for affective design. Sci World J 2013:1–11. https://doi.org/10.1155/2013/636948
Laili Y, Tao F, Zhang L (2015) A hybrid RCO for dual scheduling of cloud service and computing resource in private cloud. In: Configurable intelligent optimization algorithm, pp 257–287. Springer, Cham. https://doi.org/10.1007/978-3-319-08840-2_9
Liu B, Zhang L (2012) A survey of opinion mining and sentiment analysis. In: Mining text data, vol 9781461432, pp 415–463. Springer US, Boston, MA. https://doi.org/10.1007/978-1-4614-3223-4_13
McAllister MN (1996) Fuzzy logic with engineering applications (Timothy Ross). SIAM review, vol 38. Wiley. Retrieved from http://books.google.com/books?hl=en&lr=&id=nhz1f9j6_SMC&oi=fnd&pg=PR7&dq=ross+fuzzy&ots=vfr-0vLov5&sig=stuh2F1LrUt6ZVWre7c-76kRf6U
Mehr AD, Nourani V (2017) A Pareto-optimal moving average-multigene genetic programming model for rainfall-runoff modelling. Environ Model Softw 92:239–251. https://doi.org/10.1016/j.envsoft.2017.03.004
Mirtalaie MA, Hussain OK, Chang E, Hussain FK (2017) A decision support framework for identifying novel ideas in new product development from cross-domain analysis. Inf Syst 69:59–80. https://doi.org/10.1016/j.is.2017.04.003
Mousavi SM, Gandomi AH, Alavi AH, Vesalimahmood M (2010) Modeling of compressive strength of HPC mixes using a combined algorithm of genetic programming and orthogonal least squares. Struct Eng Mech 36(2):225–241. https://doi.org/10.12989/sem.2010.36.2.225
Naik TR, Dabhi VK (2013) Improving generalization ability of genetic programming: comparative study. J Bioinform Intell Control 2:243–252. https://doi.org/10.1166/jbic.2013.1063
Park J, Han SH (2004) A fuzzy rule-based approach to modeling affective user satisfaction towards office chair design. Int J Ind Ergon 34:31–47. https://doi.org/10.1016/j.ergon.2004.01.006
Peters G (1994) Fuzzy linear regression with fuzzy intervals. Fuzzy Sets Syst 63(1):45–55. https://doi.org/10.1016/0165-0114(94)90144-9
Poli R, Koza J (2014) Genetic programming. In: Search methodologies: introductory tutorials in optimization and decision support techniques, second edition, vol 5, pp 143–186. Springer, New York. https://doi.org/10.1007/978-1-4614-6940-7_6
Pookulangara S, Koesler K (2011) Cultural influence on consumers’ usage of social networks and its’ impact on online purchase intentions. J Retail Consum Serv 18(4):348–354. https://doi.org/10.1016/j.jretconser.2011.03.003
Rivero D, Fernandez-Blanco E, Fernandez-Lozano C, Pazos A (2019) Population subset selection for the use of a validation dataset for overfitting control in genetic programming. J Exp Theor Artif Intell. https://doi.org/10.1080/0952813x.2019.1647562
Saleena N (2018) An ensemble classification system for twitter sentiment analysis. Proc Comput Sci 132:937–946. https://doi.org/10.1016/J.PROCS.2018.05.109
Saylor RL (2015) A dynamic model predicting customer satisfaction ratings based on capital structure. ProQuest LLC, January, 99. Retrieved from https://search.proquest.com/docview/1658143957?pq-origsite=primo
Searson D, Willis M, Montague G (2007) Co-evolution of non-linear PLS model components. J Chemom 21(12):592–603. https://doi.org/10.1002/cem.1084
Searson DP, Leahy DE, Willis MJ (2011) Predicting the toxicity of chemical compounds using GPTIPS: a free genetic programming toolbox for MATLAB. In: Lecture Notes in Electrical Engineering, vol 70 LNEE, pp 83–93. https://doi.org/10.1007/978-94-007-0286-8_8
Shuang K, Zhang Z, Guo H, Loo J (2018) A sentiment information collector–extractor architecture based neural network for sentiment analysis. Inf Sci 467:549–558. https://doi.org/10.1016/j.ins.2018.08.026
Sivapragasam C, Muttil N, Muthukumar S, Arun VM (2010) Prediction of algal blooms using genetic programming. Mar Pollut Bull 60(10):1849–1855. https://doi.org/10.1016/j.marpolbul.2010.05.020
Tanaka H, Uejima S, Asai K (1982) Linear regression analysis with fuzzy model. IEEE Trans Syst Man Cybern 75(6):903–907. https://doi.org/10.1109/TSMC.1982.4308925
Tran B, Xue B, Zhang M (2019) Genetic programming for multiple-feature construction on high-dimensional classification. Pattern Recogn 93:404–417. https://doi.org/10.1016/j.patcog.2019.05.006
Wang H, Wang W (2014) Product weakness finder: an opinion-aware system through sentiment analysis. Indust Manag Data Syst 114(8):1301–1320. https://doi.org/10.1108/IMDS-05-2014-0159
Wang Y, Wang M (2018) Fine-grained opinion extraction from chinese car reviews with an integrated strategy. J Shanghai Jiaotong Univ Sci 23(5):620–626. https://doi.org/10.1007/s12204-018-1961-6
Wang W, Pan SJ, Dahlmeier D (2018) Memory networks for fine-grained opinion mining. Artif Intell 265:1–17. https://doi.org/10.1016/j.artint.2018.09.002
Wang WM, Wang JW, Li Z, Tian ZG, Tsui E (2019) Multiple affective attribute classification of online customer product reviews: a heuristic deep learning method for supporting Kansei engineering. Eng Appl Artif Intell 85:33–45. https://doi.org/10.1016/j.engappai.2019.05.015
Yang B, Liu Y, Liang Y, Tang M (2019) Exploiting user experience from online customer reviews for product design. Int J Inf Manag 46:173–186. https://doi.org/10.1016/j.ijinfomgt.2018.12.006
Zhang H, Sekhari A, Ouzrout Y, Bouras A (2016) Jointly identifying opinion mining elements and fuzzy measurement of opinion intensity to analyze product features. Eng Appl Artif Intell 47:122–139. https://doi.org/10.1016/j.engappai.2015.06.007
Zhang L, Chu X, Xue D (2019) Identification of the to-be-improved product features based on online reviews for product redesign. Int J Prod Res 57(8):2464–2479. https://doi.org/10.1080/00207543.2018.1521019
Acknowledgements
This PhD research is supported by the Research Grants Council (RGC) of Hong Kong and The Hong Kong Polytechnic University.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
All the authors in the paper have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yakubu, H., Kwong, C.K. & Lee, C.K.M. A multigene genetic programming-based fuzzy regression approach for modelling customer satisfaction based on online reviews. Soft Comput 25, 5395–5410 (2021). https://doi.org/10.1007/s00500-020-05538-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-020-05538-8