Skip to main content
Log in

Data-driven product configuration improvement and product line restructuring with text mining and multitask learning

  • Published:
Journal of Intelligent Manufacturing Aims and scope Submit manuscript

Abstract

In the era of big data, data-driven product configuration improvement and product line restructuring are two important and interrelated problems, and joint decision-making regarding these two problems needs to be tackled. There are two difficulties to achieve the joint decision-making. One is to obtain consumer choice probabilities related to improved product configurations, and the other is to obtain the best product configuration portfolio. In this study, a framework combining text mining and multitask learning is developed to deal with the difficulties. In the framework, improved product configurations are generated using online reviews and transaction data of the target product and its competitors. A one-to-many mapping from customer requirements to improved product configurations is realized by using a multitask support vector machine to obtain consumer choice probabilities for the improved product configurations. The profit maximization models considering the customer choice probabilities are then developed to obtain the best product configuration portfolio. A case study of Huawei P20 series smartphones is used to illustrate the effectiveness of the proposed methods. The results indicate that the multitask support vector machine obtained a higher prediction accuracy than two single-task learning and two other multi-task learning methods, and the proposed framework has the ability to increase the profits produced by the best product configuration portfolio.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. https://www.jd.com/

  2. https://www.tmall.com/

  3. http://www.zol.com.cn/

  4. https://github.com/fxsjy/jieba

  5. See Subsection 5.2.1 of Liu (2015) for the details of lexicon-based aspect sentiment analysis.

  6. http://www.keenage.com/html/c_index.html

References

  • Bakker, B., & Heskes, T. (2003). Task clustering and gating for Bayesian multitask learning. Journal of Machine Learning Research, 4, 83–99.

    Google Scholar 

  • Bechler, G., Steinhardt, C., Mackert, J., & Klein, R. (2021). Product line optimization in the presence of preferences for compromise alternatives. European Journal of Operational Research, 288(3), 902–917.

    Article  Google Scholar 

  • Belloni, A., Freund, R., Selove, M., & Simester, D. (2008). Optimizing product line designs: Efficient methods and comparisons. Management Science, 54(9), 1544–1552.

    Article  Google Scholar 

  • Chen, Z. Y., Fan, Z. P., & Sun, M. (2012). A hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data. European Journal of Operational Research, 223(2), 461–472.

    Article  Google Scholar 

  • Chen, Z., & Wang, L. (2009). Adaptable product configuration system based on neural network. International Journal of Production Research, 47(18), 5037–5066.

    Article  Google Scholar 

  • Choi, J., Oh, S., Yoon, J., Lee, J. M., & Coh, B. Y. (2020). Identification of time-evolving product opportunities via social media mining. Technological Forecasting and Social Change, 156, 120045.

    Article  Google Scholar 

  • Davis, J. M., Gallego, G., & Topaloglu, H. (2014). Assortment optimization under variants of the nested logit model. Operations Research, 62(2), 250–273.

    Article  Google Scholar 

  • Dou, R., Huang, R., Nan, G., & Liu, J. (2020). Less diversity but higher satisfaction: An intelligent product configuration method for type-decreased mass customization. Computers and Industrial Engineering, 142, 106336.

    Article  Google Scholar 

  • Evgeniou, T., and Pontil, M. 2004. Regularized multi--task learning. In: Proceedings of the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, Washington, UAS

  • Fang, X., Zhou, J., Zhao, H., & Chen, Y. (2020). A biclustering-based heterogeneous customer requirement determination method from customer participation in product development. Annals of Operations Research. https://doi.org/10.1007/s10479-020-03607-7

    Article  Google Scholar 

  • Farias, V. F., Jagabathula, S., & Shah, D. (2013). A nonparametric approach to modeling choice with limited data. Management Science, 59(2), 305–322.

    Article  Google Scholar 

  • Gupta, A., & Hsu, D. (2020). Parameter identification in Markov chain choice models. Theoretical Computer Science, 808, 99–107.

    Article  Google Scholar 

  • Hopp, W. J., & Xu, X. (2005). Product line selection and pricing with modularity in design. Manufacturing and Service Operations Management, 7(3), 172–187.

    Article  Google Scholar 

  • Hsu, C. W., & Lin, C. J. (2002). A comparison of methods for multiclass support vector machines. IEEE Transactions on Neural Networks, 13(2), 415–425.

    Article  Google Scholar 

  • Hu, N., Zhang, J., & Pavlou, P. A. (2009). Overcoming the J-shaped distribution of product reviews. Communications of the ACM, 52(10), 144–147.

    Article  Google Scholar 

  • Jeong, B., Yoon, J., & Lee, J. M. (2019). Social media mining for product planning: A product opportunity mining approach based on topic modeling and sentiment analysis. International Journal of Information Management, 48, 280–290.

    Article  Google Scholar 

  • Jiang, H., Kwong, C. K., Park, W. Y., & Yu, K. M. (2018). A multi-objective PSO approach of mining association rules for affective design based on online customer reviews. Journal of Engineering Design, 28(7), 1–23.

    Google Scholar 

  • Jiao, J., & Zhang, Y. (2005). Product portfolio planning with customer-engineering interaction. IIE Transactions, 37(9), 801–814.

    Article  Google Scholar 

  • Jiao, Y., & Yang, Y. (2018). A product configuration approach based on online data. Journal of Intelligent Manufacturing, 30(6), 2473–2487.

    Article  Google Scholar 

  • Jin, J., Ji, P., & Gu, R. (2016). Identifying comparative customer requirements from product online reviews for competitor analysis. Engineering Applications of Artificial Intelligence, 49, 61–73.

    Article  Google Scholar 

  • Jin, J., Liu, Y., Ji, P., & Liu, H. G. (2016). Understanding big consumer opinion data for market-driven product design. International Journal of Production Research, 54(10), 1–23.

    Article  Google Scholar 

  • Kumar, S., & Chatterjee, A. K. (2013). A profit maximizing mathematical model for pricing and selecting optimal product line. Computers and Industrial Engineering, 64(2), 545–551.

    Article  Google Scholar 

  • Lai, X., Zhang, Q., Chen, Q., Huang, Y., Mao, N., & Liu, J. (2019). The analytics of product-design requirements using dynamic internet data: Application to Chinese smartphone market. International Journal of Production Research, 57(18), 5660–5684.

    Article  Google Scholar 

  • Li, G., Rusmevichientong, P., & Topaloglu, H. (2015). The D-level nested logit model: Assortment and price optimization problems. Operations Research, 63(2), 325–342.

    Article  Google Scholar 

  • Li, Z., Tian, Z. G., Wang, J. W., & Wang, W. M. (2020). Extraction of affective responses from customer reviews: An opinion mining and machine learning approach. International Journal of Computer Integrated Manufacturing, 33(7), 670–685.

    Article  Google Scholar 

  • Liu, B. (2015). Sentiment Analysis: Mining Opinions, Sentiments, and Emotions. Cambridge University Press.

    Book  Google Scholar 

  • Luo, L. (2011). Product line design for consumer durables an integrated marketing and engineering approach. Journal of Marketing Research, 48(1), 128–139.

    Article  Google Scholar 

  • Mei, B., & Xu, Y. (2019). Multi-task least squares twin support vector machine for classification. Neurocomputing, 338(21), 26–33.

    Article  Google Scholar 

  • Mirtalaie, M. A., Hussain, O. K., Chang, E., & Hussain, F. K. (2017). A decision support framework for identifying novel ideas in new product development from cross-domain analysis. Information Systems, 69, 59–80.

    Article  Google Scholar 

  • Qi, J., Zhang, Z., Jeon, S., & Zhou, Y. (2016). Mining customer requirements from online Reviews: A product improvement perspective. Information and Management, 53(8), 951–963.

    Article  Google Scholar 

  • Seo, W., Yoon, J., Park, H., Coh, B.-Y., Lee, J.-M., & Kwon, O.-J. (2016). Product opportunity identification based on internal capabilities using text mining and association rule mining. Technological Forecasting and Social Change, 105, 94–104.

    Article  Google Scholar 

  • Sun, H., Guo, W., Shao, H., & Rong, B. (2020). Dynamical mining of ever-changing user requirements: A product design and improvement perspective. Advanced Engineering Informatics, 46, 101174.

    Article  Google Scholar 

  • Talluri, K., & Ryzin, G. J. (2004). Revenue management under a general discrete choice model of consumer behavior. Management Science, 50(1), 15–33.

    Article  Google Scholar 

  • Vithayathil, J., Dadgar, M., & Osiri, J. K. (2020). Social media use and consumer shopping preferences. International Journal of Information Management, 54, 102117.

    Article  Google Scholar 

  • Wang, W., Li, Z., Tian, Z., Wang, J., & Cheng, M. (2018). Extracting and summarizing affective features and responses from online product descriptions and reviews: A Kansei text mining approach. Engineering Applications of Artificial Intelligence, 73, 149–162.

    Article  Google Scholar 

  • Wang, Y., & Li, X. (2021). Mining product reviews for needs-based product configurator design: A transfer learning-based approach. IEEE Transactions on Industrial Informatics, 17(9), 6192–6199.

    Article  Google Scholar 

  • Wang, Y., Mo, D. Y., & Tseng, M. M. (2018). Mapping customer needs to design parameters in the front end of product design by applying deep learning. CIRP Annals-Manufacturing Technology, 67(1), 145–148.

    Article  Google Scholar 

  • Zhang, L., Chu, X., & Xue, D. (2019). Identification of the to-be-improved product features based on online reviews for product redesign. International Journal of Production Research, 57(7–8), 2464–2479.

    Article  Google Scholar 

  • Zhang, M., Fan, B., Zhang, N., Wang, W., & Fan, W. (2021). Mining product innovation ideas from online reviews. Information Processing and Management, 58(1), 102389.

    Article  Google Scholar 

  • Zhang, Y., & Yang, Q. (2018). An overview of multi-task learning. National Science Review, 5, 30–43.

    Article  Google Scholar 

  • Zhao, S., Zhang, Q., Peng, Z., & Fan, Y. (2019). Integrating customer requirements into customized product configuration design based on Kano’s model. Journal of Intelligent Manufacturing, 31(3), 597–613.

    Article  Google Scholar 

  • Zhou, F., Jiao, R. J., & Linsey, J. S. (2015). Latent customer needs elicitation by use case analogical reasoning from sentiment analysis of online product reviews. Journal of Mechanical Design, 137(7), 071401.

    Article  Google Scholar 

Download references

Funding

No funding was received to assist with the preparation of this manuscript. The authors have no relevant financial or non-financial interests to disclose.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhen-Yu Chen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1

Like the standard support vector machine (Chen et al., 2012), the Lagrange multiplier method can be used to obtain the dual of the MT-SVM in Eqs. (7)–(9). The Lagrangian for the MT-SVM is described as:

$$ L\left( {{\mathbf{w}}_{0} ,{\mathbf{v}}_{t} ,a_{it} ,\gamma_{it} } \right) = J\left( {{\mathbf{w}}_{0} ,{\mathbf{v}}_{t} ,\xi_{it} } \right) - \mathop \sum \limits_{t = 1}^{T} \mathop \sum \limits_{i = 1}^{I} a_{it} \left( {y_{it} \left( {{\mathbf{w}}_{0} + {\mathbf{v}}_{t} } \right) \cdot {\mathbf{x}}_{it} - 1 + \xi_{it} } \right) - \mathop \sum \limits_{t = 1}^{T} \mathop \sum \limits_{i = 1}^{I} \gamma_{it} \xi_{it} , $$
(14)

where \(\alpha_{it}\) and \(\gamma_{it}\) are nonnegative Lagrange multipliers. According to the Karush–Kuhn–Tucker (KKT) condition, taking partial derivatives of \(L\left( {{\mathbf{w}}_{0} ,{\mathbf{v}}_{t} ,a_{it} ,\gamma_{it} } \right)\) with respect to the primal variables:

$$ \frac{\partial L}{{\partial w_{0} }} = 0 \Rightarrow {\mathbf{w}}_{0} = \frac{1}{{2\lambda_{2} }}\mathop \sum \limits_{t = 1}^{T} \mathop \sum \limits_{i = 1}^{I} \alpha_{it} y_{it} {\mathbf{x}}_{it} $$
(15)
$$ \frac{\partial L}{{\partial v_{t} }} = 0 \Rightarrow {\mathbf{v}}_{t} = \frac{T}{{2\lambda_{1} }}\mathop \sum \limits_{t = 1}^{T} \mathop \sum \limits_{i = 1}^{I} \alpha_{it} y_{it} {\mathbf{x}}_{it} $$
(16)
$$ \frac{\partial L}{{\partial \xi_{it} }} = 0 \Rightarrow 1 - \alpha_{it} - \gamma_{it} = 0. $$
(17)

Substituting the results in Eqs. (15)–(17) into (14) leads to the dual formulation of the MT-SVM

$$ {\text{max}}\left\{ {\mathop \sum \limits_{t = 1}^{T} \mathop \sum \limits_{i = 1}^{I} \alpha_{it} - \frac{1}{2}\mathop \sum \limits_{s = 1}^{S} \mathop \sum \limits_{i = 1}^{I} \mathop \sum \limits_{t = 1}^{T} \mathop \sum \limits_{j = 1}^{J} \alpha_{is} y_{is} \alpha_{jt} y_{jt} K_{st} \left( {{\mathbf{x}}_{is} ,{\mathbf{x}}_{jt} } \right)} \right\}, $$
(18)
$$ \begin{gathered} {\text{s}}.{\text{t}}.{\mkern 1mu} {\mkern 1mu} {\mkern 1mu} 0 \le \alpha _{{it}} \le C{\mkern 1mu} {\mkern 1mu} {\mkern 1mu} {\text{for}}{\mkern 1mu} {\mkern 1mu} {\mkern 1mu} i \in \left\{ {1,2, \cdots ,I} \right\}{\mkern 1mu} {\mkern 1mu} {\mkern 1mu} {\mkern 1mu} {\text{and}} \hfill \\ t \in \left\{ {1,2, \cdots ,T} \right\}. \hfill \\ \end{gathered} $$
(19)

Let \(\alpha_{it}^{*}\) is the optimal solution of the dual problem, the resulting classification. function of the MT-SVM is

$$ f_{t}^{*} \left( x \right) = \mathop \sum \limits_{t = 1}^{T} \mathop \sum \limits_{i = 1}^{I} \alpha_{it}^{*} y_{it} K_{st} \left( {{\mathbf{x}}_{is} ,{\mathbf{x}}} \right). $$
(20)

Appendix 2

See Fig. 4 and Tables (5, 6, 7, 8, 9).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, ZY., Liu, XL. & Yin, LP. Data-driven product configuration improvement and product line restructuring with text mining and multitask learning. J Intell Manuf 34, 2043–2059 (2023). https://doi.org/10.1007/s10845-021-01891-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10845-021-01891-z

Keywords

Navigation