1 Introduction

The rapid development of Big Data Analytics, Artificial Intelligence (AI) techniques and their associated applications have brought tremendous opportunities to enterprises (George et al. 2016; Zhan et al. 2020; Fosso Wamba and Akter 2019; Duan et al. 2019; Dwivedi et al. 2019; Akter et al. 2019, 2020), but also the need to transform Operations and process management has evolved at the same time (Matthias et al. 2017; Kache and Seuring 2017; Fosso Wamba and Queiroz 2020). The New Product Development (NPD) is one of the strategic cores of enterprise survival and development, and an important focus area in corporate strategy and decision making processes (McCarthy et al. 2006; Gonzalez-Zapatero et al. 2017; Yan and Wagner 2017). In this ever-changing environment, it is imperative for companies to develop new products meeting the demands of the highly complex, dynamic and unpredictable business environment (Nambisan 2002; Yan and Azadegan 2017), but also foster innovation in the process while targeting operational efficiency (Lam et al. 2016).

In previous decades, NPD was highly reliant on several stages such as market research, product idea creation and product design. The decision-making at each stage was deemed difficult because of the challenges in obtaining information from various involved parties (McCarthy et al. 2006) and often with a significant cost in terms of energy, financial resources and time to make decisions (Ogawa and Piller 2006). In the era of Big Data, Analytical processes and AI, the traditional decision-making process has not kept up with the times; as the evolution of technology provides cheaper and faster decision-making solutions as data are evolving and increasing in volume and formats (Bawack et al. 2019; Feki et al. 2016; Fosso Wamba et al. 2017; Queiroz and Telles 2018; Fosso Wamba et al. 2020), the NPD process should also focus on “making sense” of the data available (Lycett 2013; Wei et al. 2014). The sensemaking process of public views for NPD can utilise feedback and comments from social media and highlight the iterative role of the customer in value co-creation while exploring the sentiment throughout the whole NPD process (Zhan et al. 2018; Du et al. 2016; Fuchs and Schreier 2011; Majumdar and Bose 2019).

Social media provide a new communication path between the enterprise and the directly involved stakeholders in NPD (Chae 2015; Du et al. 2016). Social media, equipped with the latest AI techniques of text mining, could provide unpreceded decision-making opportunities (Duan et al. 2019; Dwivedi et al. 2019). Social media sensemaking processes in operations (Matthias et al. 2017) can reach real-time and common consensus and communication with consumers, opening up a new path for understanding ever-changing consumer demand and building good relationships with them (Hoyer et al. 2010). Also, the communication with customers is deemed as essential (Bahemia et al. 2017), not only for the building external links with the market but also for the support of internal product development teams. Social media applications can run through the entire process of product development (Du et al. 2016; See-To and Ngai 2018), from requirements communication, creative generation, product design, prototype testing, product release, product promotion and information feedback to product after-sales service. Consumers can be deeply involved in the NPD cycle (Hoyer et al. 2010), forming a strong relationship with the corporate products, and ultimately reflecting consumer demand for the newly launched products.

Building on a multidisciplinary, combinatory approach of the theoretical and empirical foundations borrowed from Operations and Information Management, we ground our study in the NPD and sensemaking academic fields for the analysis of social media data. We apply the sensemaking approach to explore the customer’s perspective and sentiment as they are expressed through social media throughout the stages of NPD. We collect data through an API that we developed to elicit information from Twitter for a specific product from the automotive industry. Through a sentiment analysis that is grounded on fuzzy theory for retrieving consensus from the data, we identify the impact of customer involvement (via the use of social media). This can be reflected through a variety of emotions throughout the NPD process.

Our objective is to contribute to our knowledge on NPD by providing theoretical insights and empirical findings for each stage and our aim to provide a research consensus for the analysis of social media data for NPD processes based on sensemaking patterns (Weick 1995; Weick et al. 2005) of customers’ sentiments. To achieve this, we discuss the NPD process and the associated steps where the public opinion can improve the process, and we extend each of these steps by exploring the sensemaking patterns in data retrieved from social media. Based on our findings we develop a research agenda for future research, highlighting the iterative role of the customer in the NPD process and the use of social media as a facilitator of these communications. Although previous studies have emphasized the use of marketing intelligence and consumer preferences for the development of new products, our study proposes a novel approach through the use of social media to identify latent product features using a two-level analysis. The proposed sensemaking mechanisms are used through the combination of sentiment analysis and fuzzy mathematics to provide a lens of analysing NPD stages and on a broader scope, various operations and supply chain processes.

The remainder of the paper proceeds as follows. First, we explain the emergence of social media in NPD through an extensive literature review putting social media data strategies on the core of attention alongside the sensemaking perspectives in operations management fields. Building on this, we conceptualise and explore the aspects of the NPD to underpin our investigation of the customer iterations in the whole process. Next, we develop a sensemaking approach through the analysis of social media data (collected for an example product from the automotive industry—10,210 tweets) providing the views of customers on various stages of NPD.

2 Theoretical foundations

The question motivating our study is: What is the impact of the customer sentiment in the steps of the Product Development Process? We aim to uncover insights from social media data of customer views and demystify the sentiment variations during the development process and the associated value for the overall operations.

Following Weick’s (1995) prescriptions, we adopt a sensemaking-oriented approach to the analysis of social media data. In particular, we draw on the (commonly) conceptual work in the NPD literature to develop a framing for the analysis of the NPD process in various stages. We analyse the sentiment variations of the customer opinion for a case product (from the automotive industry) for each step of the NPD process, while we explain and discuss the arising sensemaking patterns of customer views for each stage of NPD to improve the products and services. Thus, we provide an initial overview of the overarching theoretical concepts which motivated and supported this study, namely: the process of NPD, customer participation in the NPD, AI, social media and NPD and ultimately the foundations of sensemaking for social and public opinion.

2.1 Product design and development: the NPD process

NPD activities constitute a significant source of companies’ long-term profits, and sustainable competitive advantage (Van Oorschot et al. 2010). NPD includes the identification and deployment of detailed characteristics that result progressively into genuinely new and improved products (Levitt 1985). The products are comprised of multiple traits and embedded components, such as quality, cost, technology, packaging and services, and any breakthrough in one dimension can be regarded as the production of new products (Takeuchi and Nonaka 1986). Seminal research work on NPD focuses on the innovative and strategic potential of the NPD cycle and the implications for the success or failure of the product launch (Schilling and Hill 1998; Veryzer 1998) and the key role of Big Data Analytics and AI have to play in this process (Trabucchi and Buganza 2019).

Specific NPD details include changes in product structure, technical performance, or material that are superior to the previous generation, resulting in products more advanced and closer to consumer demand (Ding and Eliashberg 2003). When defining the functional requirements of a new product, critical factors need to be considered, such as technological progress, promoting business growth, increasing profits, increasing customer satisfaction, or reducing costs (Cooper et al. 2005). The company’s new products can be original products, existing products with added unique characteristics, or new brands. At the same time, companies need to analyse the characteristics of new products, from different perspectives like sales, profits, customer groups, competitors and prices to make sure which product functions should be focused on in order to achieve success in the market.

For Hoyer et al. (2010), the NPD process includes four major stages: ideation, product development, commercialisation, and post-launch. In this study we focus on the first three, leaving the post-launch phase for further research, as it can be influenced differently from after-sales feedback (Ramanathan et al. 2017).

Ideation The idea screening stage is divided into a customer and a vendor source process (Barczak 1995). The customer-sourced idea screening process is well suited to the needs of a particular user or market and accounts for a large proportion of NPD (Hoyer et al. 2010). The manufacturer develops new products according to the original product ideas proposed by the customers. Product concept development is based on market demand through continuous communication and feedback information from the market (Barczak et al. 2009) thus an overall evaluation of the competitiveness and economic viability at each stage.

Product development Product development refers to developers’ or engineers’ research to design physical products based on product concepts, so that product ideas can be transformed into useful products (McCarthy et al. 2006). The product development process is transforming a product concept into a new product entity and mainly solving the problem of whether product ideas can be transformed into commercially and technically available products (Ferioli et al. 2010; Hoyer et al. 2010). The process is completed through the design, trial production, testing and certification of new products.

Commercialisation If the new product trial sales achieve the expected results, the company will decide to commercialise the new products (Barczak et al. 2009). At this point, the company should focus on the following aspects: choose the correct time to market, specific target market and right marketing mix strategy. To truly enter the commercial circulation and complete the final commercial realisation, the product also needs to go through a complete commercial packaging process to form a commodity. This usually includes activities such as customer delivery content, product sales policies, product sales brochures, product sales quotes, product sales agreements, and product after-sales services (Durmuşoǧlu and Barczak 2011).

2.2 Customer participation in NPD

As the customers can have an iterative role in knowledge and opinion calls such as customer preferences, market demand and popular trends (Chong et al. 2016), customer involvement is deemed necessary for the NPD process (Lagrosen 2005; Tsinopoulos and Mena 2015). In order to ensure the smooth progress of the innovation process for the increasing need for more diversified and customised products, enterprises gradually realise the importance of user participation. User engagement can motivate companies to generate new ideas and encourage companies to invest more resources and capabilities into product innovation and process improvement (Poetz and Schreier 2012) supporting also customer value co-creation initiatives (Romero and Molina 2011).

Customer participation involves a relationship of sharing of resources between the customer and the enterprise in the process of enterprise NPD, providing valuable knowledge and information to the process (Chang and Taylor 2015). It has multiple manifestations such as: jointly developing new products or technologies; paying close attention to user demand for product performance, preferences on appearance types; implementing user knowledge management and better service NPD (Lin and Germain 2004). Customer engagement also could refer to the resources (capabilities, ideas, knowledge, investments) that users provide in NPD activities (Mikkola and Skjøtt-Larsen 2004).

The above definitions mainly describe the specific lines of user participation, defining the role of the user and the scope of activities involved. The involvement of customers in the NPD process is vast and could provide a competitive advantage for value co-creation (Hoyer et al. 2010), a decision-making mechanism (McCarthy et al. 2006) and a source of creativity (Stevens et al. 2003) for NPD. Combined with the above research, user participation our study is based on information resource sharing and performance improvement. The company actively invites users to participate in innovative activities such as product and technology process development, to achieve the absorption and utilisation of essential resources such as customer knowledge, information and capabilities.

2.3 AI, social media and NPD

Access to information about customers’ product needs, preferences and needs are often seen as a foundation of successful innovation in NPD (Poetz and Schreier 2012; Chang and Taylor 2015; Lam et al. 2016; Hartmann et al. 2016). According to recent surveys, most enterprises regard social media information as an important source of enterprise’s innovation (Cui et al. 2018; Chen et al. 2015), which is of considerable significance to enterprise product development. As a result, many researchers have focused on social media, applying social media data to their business processes, and integrating customers into the company’s NPD (Du et al. 2016). Users can feedback product requests or ideas on social media platforms, and these online interactions or online comment exchanges have become the focus of NPD researchers (Chan et al. 2017; Fuchs and Schreier 2011; Gu and Ye 2014).

Artificial Intelligence (AI), as a principle, existed for over 6 decades, however, the last decade the term came back to popularity (Duan et al. 2019). Nowadays, AI is mostly applied for decision making; where the common role of AI is either to support/assist the human decision makers, or to replace them (Bawack et al. 2019; Duan et al. 2019; Dwivedi et al. 2019). One of the most popular AI techniques for decision-making is through text mining techniques, obtaining insight from social media content (Bawack et al. 2019; Fosso Wamba et al. 2019). As the progress of AI enables stakeholders to obtain knowledge from social media, new requirements evolve as to demystify the knowledge and make sense of it, and the major challenge is to undertake more complex tasks that require cognitive capabilities (Duan et al. 2019).

Many scholars have fully affirmed the critical value of social media for the process transformation but also product development (Aral et al. 2013). Social media can be a powerful platform for collaboration between companies and customers, where the user-generated content can provide valuable insights for operational development (Fader and Winer 2012). Piller and Walcher (2006) propose a social network-based creative contest application that allows manufacturers to get innovative ideas and solutions from their users. Social media also enable companies to monitor the list of competitors, track trends in other companies’ news releases, identify new products, and understand potential customer needs (Fan and Gordon 2014).

The availability and use of social media have opened new ways of value creation and potential to business and process transformation (Rad et al. 2018), as well as unprecedented operational opportunities for collaboration (Kane et al. 2014). Moving beyond marketing value to innovative product development, multiple opportunities arise as social media provide new methods in marketing, information systems and social sciences research and can also go towards directions related to operations management, where the opportunities could increase for innovative processes, products or services (Leonardi 2014; Levina and Arriaga 2014).

Chae (2015) but also Mishra and Singh (2018) emphasize the potential role of social media for a wide array of supply chain practices (such as professional networks, stakeholder engagement, demand shaping, new product/service development and supply chain risk management). Singh et al. (2018) also posit that using text mining methods for social media analysis could help to improve the management of the supply chain. There is a growing body of literature that advocates that the analysis of data collected from social media, while focusing on the main concerns of consumers can provide a series of recommendations for developing a consumer-centric supply chain (Chae 2015; Lam et al. 2016; See-To and Ngai 2018; Singh et al. 2018).

2.4 Sensemaking and social media

Social media in general, and Twitter feeds in particular, are often used for expression of public opinion in the political discussion (Stieglitz and Dang-Xuan 2013), crisis management in emergencies (Fosso Wamba et al. 2019; Kumar et al. 2020; Singh et al. 2019) and information sharing for humanitarian operations (Maresh-Fuehrer and Smith 2016; Panagiotopoulos et al. 2016). Bruns and Stieglitz (2013) explain that often Twitter can support and influence various situations, spanning from isolated crises to cultural interactions, but also reviews of products and services (Rehman et al. 2016; See-To and Ngai 2018). However, the major problem identified in Twitter-based communication as a data collection method is the lack of standard metrics for comparing communicative patterns across cases (Bruns and Stieglitz 2013). Identifying and making sense of prevailing patterns within the collected data from social media, while disseminating and sharing information, could be challenging as various sources should be filtered and integrated for this process (Zeng et al. 2010). An additional challenge in the use of social media content could be around the different ways this content could be interpreted, or even misinterpreted (Stieglitz and Dang-Xuan 2013). Therefore, a rigorous analysis should be followed in order to extract the exact content specifics, and identify the various roles of the content generators, as well as their sentiment (Feldman 2013; Pang and Lee 2009).

Sensemaking is a generic term that refers to the processes of interpretation whereby individuals and groups interpret and reflect on phenomena (Weick et al. 2005). The sensemaking cycle iterates the social means of people enactment of their experiences and views, through verbal or written communication and negotiations (Light et al. 1967). In Operations and Supply Chain Management several best practices have been applied to enhance sensemaking, such as interaction with both suppliers and customers at trade shows, cross-organisational or cross-functional teams, or the use of the practice of the guest engineer at suppliers’ sites. These practices facilitate the development of common understanding of critical issues. Through the development and use of a shared vocabulary, the individual understanding and appreciation is integrated to collective cognitions that shape decision making (Grant 1996; Revilla and Knoppen 2015).

Sensemaking as a way to define patterns from social media data can also be conceived as ethnomethodology to study different ways that people construct a stable social world through everyday utterances and actions (Garfinkel 2016). In our study, we apply an interpretative scheme and hermeneutics (Myers et al. 2011; Walsham 1995) to social media data, in order to elicit information and knowledge and identify the sensemaking patterns for the analysis of customer views. Therefore, the sensemaking approach as it is presented in this study provides a methodological grounding, where the application of fuzzy mathematics assists in identifying the patterns/themes in the data which can make sense for each of the NPD stages.

3 Research design

Our study is highly motivated and influenced by the work of Chan et al. (2016) that extracted data from Facebook via the NCapture tool. For accurate content analysis, the researchers selected the NPD-related factors (such as product price, product performance) by high-frequency word sorting to establish a model for NPD evaluation. Their model logically links the features that customers care most about with NPD. There are a few studies on NPD supporting how product performance can determine market strategy, product development process and product services (Cho and Lee 2013). Our study frames the whole analysis of the NPD process for the specific product in four distinct aspects related to the style-appearance, performance, configuration and service. These four features were identified as the common sensemaking patterns (prevailing themes in the data) characterised by the high frequency within the content of collected tweets (Mirbabaie and Zapatka 2017) and review the qualities of the product under research.

Our research design draws on three major stages (Table 1), namely:

Table 1 Research design
  1. a.

    The data collection stage,

  2. b.

    The initial exploratory phase (sentiment analysis)

  3. c.

    The in-depth analysis (sensemaking approach through fuzzy mathematics).

The data collection process included the required methods to collect the twitter data, for a given product in a specific period. The exploratory phase served the purposes of an initial exploratory analysis of the collected Twitter data, identifying the polarity and sentiment of the customers’ views for each of the NPD phases. The final stage adapted the principles of sensemaking in identifying patterns in the data for each of the NPD processes through techniques of fuzzy mathematics. These stages are explained further in the following sub-sections.

4 First stage: data collection

The case product As the automotive industry is one of the main research contexts in operations and supply chain management (e.g., Lin and Zhou 2011; Kim et al. 2011; Prahinski and Benton 2004; Wagner et al. 2009; Childerhouse et al. 2003), we selected as an exemplary case a product from the automotive industry as our research context. In recent years, with the exhaustion of energy and the popularity of new energy vehicles, people are increasingly aware of the importance of new energy vehicles and are beginning to buy new energy vehicles or electric vehicles. Tesla, an American electric vehicle and energy company, is the first to be sought. Moreover, Model X is a new model of Tesla and recently has entered the global market, therefore it could base as an example case the specifics of this study. Through the sentiment analysis and the fuzzy theory for text mining of tweets, we explore the variations of sentiments of the customers about the product. As quantitative methods of text analysis will also bring basis and support the results of the sentiment analysis, we postulate that we adopt a holistic perspective about the negative and positive emotions about Tesla’s product development, or redesign stages.

Twitter data Our study is based on commentary twitter data about Tesla’s Model X characteristics. Comments and responses to a tweet are also crucial as well as the retweeting behaviour (Bruns and Stieglitz 2013; Stieglitz and Dang-Xuan 2013). The Twitter open platform can easily provide developers with rich applications and complete service platforms using third-party access (Chae 2015). Developers can access rich interface resources according to the application service process, and there are plenty of data interfaces that can be collected, including tweets, users’ information, content, relationships, and topics (Bruns and Stieglitz 2013; Mirbabaie and Zapatka 2017). There are two main types of data collection methods from Twitter: (1) Twitter Open Platform Application Programming Interface (API), which can conveniently and quickly obtain Tweets in batches and (2) Web crawler technology that can automatically extract information from web pages. Due to the freedom and freeness of the Twitter open platform, and for the purposes of this study, we chose to collect data through Twitter’s free and open-source API, Tweepy, which is based on Python and can provide a large to a rich pool of data. In “Appendix A” we provide the code for the data collection and a snapshot of the collected data.

5 Second stage (exploratory phase)

Twitter data and information are mostly in the form of text. Conducting a sentiment analysis with Tesla Model X’s commentary can provide the means to consider the users’ opinions. Our proposition is that by following the inclusion of social media in their NPD processes, Tesla could potentially match the needs of consumers as much as possible, and make rational decisions based on customer requirements.

The construction of a sentiment dictionary is a fundamental primary stage in the sentiment analysis. We used a lexicon-based approach to extract sentiment from the texts. The extent of which the sentiment dictionary could be comprehensive and accurate is primarily influenced by the results of the analysis of the emotional sentiment of the text (Burnap et al. 2015; McKenna et al. 2017). Our approach adopted the following process: We constructed a domain of emotional resources, developed the evaluation of information quaternions, and then calculated the emotional polarity of phrases and sentences (Lau et al. 2014).

A specific domain of sentiment lexicon has been constructed within a particular sentiment analysis environment. The expanded sentiment dictionary made up for the incompleteness of the basic sentiment dictionary, the disadvantages of domain emotional words and online emotional words are not included and improve the accuracy of emotional sentiment analysis. The construction of the domain sentiment dictionary comprised the selection of emotional seed words, the extension of the emotional dictionary (the field words are added to the dictionary), and the calculation of the weight of the emotional words.

5.1 Research propositions

Our research propositions were formed around five thematic clusters, related to product characteristics as these are introduced by Popescu and Etzioni (Popescu and Etzioni 2007):

Proposition 1

The customer is interested and comments on general characteristics of the product throughout the new product development lifecycle phases.

Proposition 2

The customer is interested and comments on the product’s components throughout the new product development lifecycle phases.

Proposition 3

The customer is interested and comments on the appearance-aesthetics throughout the new product development lifecycle phases.

Proposition 4

The customer is interested and comments on the product’s attributes throughout the new product development lifecycle phases.

Proposition 5

The customer is interested and comments on product-related concepts throughout the new product development lifecycle phases.

5.2 Evaluation object and evaluation methods

The five thematic clusters which formed the propositions of this research were applied as evaluation objects. The evaluation object of the tweet is the user’s opinion on a specific aspect of a certain product. In the commentary information of the tweet, the evaluation object mainly refers to the product itself, a specific attribute or characteristic of the product, and the situation of some things related to the product itself (logistics, customer service, outer packaging). The five types of product evaluation objects used for the sentiment analysis could be summarised as (Popescu and Etzioni 2007): (1) the overall product, (2) the product’s components, (3) the appearance-aesthetics, (4) the product’s attributes and (5) product-related concepts.

The product evaluation objects (Popescu and Etzioni 2007) are represented in the tweets in the following exemplary manner.

  1. 1.

    “Tesla is very good” (the overall product) where the evaluation object is “Tesla” itself

  2. 2.

    “The design of the seat is not very suitable for human factors” (the product components) where the evaluation object is the part of the car “the seat of the car”;

  3. 3.

    “The colour of this car is not good” (the appearance-aesthetics) where the evaluation object “colour” is related to the appearance of the car;

  4. 4.

    “The engine of Tesla’s engine is not good” (the product’s own attributes) where the evaluation object “engine” is the performance of the product;

  5. 5.

    “Regular maintenance is not good” (the product-related concepts) where “maintenance” is relevant to the product-related properties.

The extraction of the evaluation objects is mainly used to obtain the specific object for which the expression provides the evaluation information. In the study of the evaluation object extraction, the evaluation information corpus is pre-processed (discontinuation word processing, the lemmatisation and part-of-speech tagging), nouns or noun phrases are extracted as candidate evaluation object sets, and then the collection is performed by statistical methods. Finally, the noun words less than a certain threshold are removed, and the more frequent nouns or noun phrases are retained as the evaluation object set.

5.3 Extraction of emotional words

Our evaluation method followed a lexicon-based extraction method which is highly supported by Computational Linguistics research (Qiu et al. 2011; Taboada et al. 2011) in text mining unsupervised knowledge extraction through AI (Quan and Ren 2014; Rozenfeld and Feldman 2011). Firstly, the evaluation information was pre-processed, and then the processed words were compared with the pre-built emotional domain sentiment lexicon (adverb dictionary, negative words dictionary and turning words dictionary). If the word we found was in the dictionary, we concluded that this is an emotional word, and subsequently obtained the corresponding word information; if the word was not in the dictionary, then we evaluated the next words until the end of the whole sentence in the tweet. When emotional words were found in a sentence, the same method was used to traverse the negative dictionary, the adverb dictionary, the turning word dictionary. With this process, the corresponding word weight was obtained accordingly.

Based on the extraction of the evaluation words and the evaluation objects, a phrase-based quaternary model was then constructed. The evaluation path obtained the relevant elements of the phrase to facilitate the calculation based on the emotional polarity related rules of the evaluation information. For an evaluation message of Tesla on Twitter, the user evaluation information was segmented into phrases, and a four-tuple model of sentiment analysis was then constructed based on the phrase.

For example: “The engine performance is not good, but the GPS system is very good, and I like it very much.” The evaluation information was divided into three phrases: “The engine performance is not good”, “but the GIS system is very good” and “I like it very much”. The evaluation factors were sequentially extracted for the evaluation phase. For the subjective evaluation information that did not contain commodity attributes such as “I like it very much”, the pronoun “I” was directly removed during pre-processing, and only the emotional phrase “like very much” was retained.

5.4 Calculation of emotional values

For the sentiment analysis of tweets, the emotional tendencies of the constructed feature quads were analysed (Ilieva et al. 2018; Zhou et al. 2019). According to fuzzy statistical analysis, the common four-tuple of emotional phrases follow the four identified sets of conditions:

$$ {\text{Evaluation Object }}\left( {\text{O}} \right) + {\text{Emotional Words }}\left( {\text{W}} \right) $$

In our analysis, the emotional extremes of the quad phrase were judged by combining the weight of the evaluation object and the emotional polarity and intensity of the words in the sentiment dictionary. For example, in the phrase “This car is beautiful”, the commentary only contains the evaluation object “this car” and the evaluation of the emotional word “beautiful”. When calculating the emotional polarity of the tweet, it is only necessary to multiply the weight of the evaluation object and the weight of the emotional word. In the tweets where the evaluation sentence contained a plurality of evaluation quads, all the quads were weighted and summed to obtain the emotional polarity of the entire sentence.

$$ {\text{Evaluation Object }}\left( {\text{O}} \right) + {\text{Negative Words }}\left( {\text{N}} \right) + {\text{Emotional Words }}\left( {\text{W}} \right) $$

However, the emotional polarity of emotional words is not only related to negative words but also closely related to the number of negative words. When the quaternion performs the emotional polarity calculation, the weight of the evaluation object, the weight of the negative word, and the weight of the emotional word were multiplied to obtain the emotional extreme value of the emotional phrase. In the case where there were several negative words, the weight of the negative words was multiplied by the corresponding number of times. When the number of negative words was even, this was consistent with the polarity of the emotional words; in the case where the number was odd, this was opposite to the polarity of the emotional words. For example, for the tweets “this car is not beautiful,” and “this car is not no beautiful”, the number of negative words in these two sentences is different, and the apparent emotional polarity is inconsistent.

$$ {\text{Evaluation Object }}\left( {\text{O}} \right) + {\text{Adverb }}\left( {\text{Ad}} \right) + {\text{Emotional Words }}\left( {\text{W}} \right) $$

An additional layer in our analysis included the tweets/sentences in which the appearance of modified adverbs could potentially enhance the emotional intensity of emotional words. The emotional polarity of a phrase changes appropriately with the degree of emotional adverbs based on emotional words. For example, in the tweet “this car is pretty beautiful”, the inclusion of the degree adverb “pretty”, the apparent emotional word “beautiful” is moderated. The calculation of the polarity of the emotional words is not a simple weighted summation, but on the basis of the extreme values of the modified emotional words, the emotional intensity is strengthened. The formula we used in our analysis is:

If the emotional word (W) is positive, then

$$ {\text{Polarity}} = \left( {\text{W}} \right) + \left( {1 - \left( {\text{W}} \right)} \right)*\left( {\text{Ad}} \right) $$

If the emotional word (W) is negative, then

$$ {\text{Polarity}} = \left( {\text{W}} \right) + \left( { - 1 - \left( {\text{W}} \right)} \right)*\left( {\text{Ad}} \right) $$

Taking the previous example into account, if the value of the emotional word “beautiful” is 0.7, then the emotional phrase “pretty beautiful” is (0.7 + (1 − 0.7) * 0.7) is 0.91. Based on obtaining the weight of the emotional words, the weights of the evaluation objects were subsequently weighted and summed to obtain the emotional extreme values of the evaluation phrases.

$$ {\text{Evaluation Object }}\left( {\text{O}} \right) + {\text{Negative Words }}\left( {\text{N}} \right) + {\text{Adverb }}\left( {\text{Ad}} \right) + {\text{Emotional Words }}\left( {\text{W}} \right) $$

In tweets where the negative words and adverbs appeared at the same time, based on the multiplication of the weights of the sentiment words and the weights of the negative words, the weights of the evaluation objects were weighted and summed to obtain the emotional extremes of the evaluation phrases. The formula we used in our analysis is:

If the emotional word (W) is positive, then

$$ {\text{Polarity}} = \left( {\text{W}} \right)*\left( {\text{N}} \right) + \left( {1 - \left( {\text{W}} \right)} \right)*\left( {\text{Ad}} \right) $$

If the emotional word (W) is negative, then

$$ {\text{Polarity}} = \left( {\text{W}} \right)*\left( {\text{N}} \right) + \left( { - 1 - \left( {\text{W}} \right)} \right)*\left( {\text{Ad}} \right) $$

The above rules cover the emotional expression of most of the commentary phrases. Our analysis was built on the method proposed by Taboada et al. (2011), as a formula for calculating the emotional polarity of the quaternary (Table 2).

Table 2 Examples of calculations of quaternions of emotional values

By using the degree adverbs, negative words, sentiment lexicons, and several statistical rules introduced above, the emotional polarity and emotional extremes of most evaluation information quads were calculated. In “Appendix B” we provide the code in python for the sentiment analysis.

6 Third stage: in-depth analysis (sensemaking patterns based on fuzzy mathematics)

In the previous section, we discussed in detail the way that the sentiment analysis was performed on Tesla’s Model X for a single tweet message. For the sentiment analysis of a single tweet, the method directs to one of the five categories of the user’s emotional tendency toward the product {very good, good, average, poor, very poor}. The proposed method can be convenient for the decision-makers or the company (Tesla) to categorize and understand the polarity of the information. However, it may take a lot of time to go through the five categories to see the attitude of each tweet. For that purpose, the proposed method of a fuzzy comprehensive evaluation is used for sentiment analysis, to quantify the qualitative emotional tendency and to calculate the complete emotional value of the commodity through a detailed evaluation model. This comprehensive value can assist the company (Tesla) to decide, plan and improve the future steps or processes of NPD.

6.1 Classification of evaluation objects

The following step in our analysis of Twitter feeds was the construction of the evaluation indicator system of Tesla’s X model. The evaluation indicator system was used to summarise and reflect the users’ appreciation of the product (Zhou et al. 2019). Due to their generality, the evaluation indicators cannot cover all the evaluation objects. Hence, the evaluation objects were classified to obtain the fuzzy evaluation and the index weights by using fuzzy statistics so as to conduct a comprehensive sentiment analysis on Tesla (Vatrapu et al. 2016; Zhou et al. 2019). In order to obtain the aspects of Tesla’s X model that consumers are interested in, first we conducted a word frequency statistic of all the words that were used in the twitter comments. Our analysis shows that the words “grey”, “small”, “V-shape”, “fast”, “powerful”, “low-emission”, “big-sunroof”, “accurate”, “comfortable”, “economy”, “good service” and “clean” are the most frequently used about the X model. Based on these evaluation objectives, all the evaluation objects were classified according to the first-level indicators and the second-level indicators determined in the indicator structure model. The specific categories are shown in Table 3 below.

Table 3 Example of category for evaluating features

6.2 Construction of indicator system

The evaluation indicator system is a core and critical link in the evaluation of Tesla vehicles. In the indicator system for building a certain product, many scholars have constructed various reasonable structural models of commodity evaluation indicators, which can bring a lot of substantial help to the company’s further development. Based on the analysis of many structural models (e.g. Soukhoroukova et al. 2012; Malviya and Kant 2016; Wu et al. 2015) and combined with the evaluation information of users on Twitter, the indicator structure model for our study is shown in Fig. 1. For Tesla’s evaluation indicator structure model, the determination of indicators is a factor that most users are concerned about. However, we acknowledge that there can be evaluation models that can include more or other indicators.

Fig. 1
figure 1

The evaluation indicator system

6.3 Determination of the evaluation indicators

In the process of fuzzy evaluation of products (Malviya and Kant 2016; Wu et al. 2015), each evaluation indicator has a different influence on the product, so it is necessary to calculate the weight coefficient of the indicator reasonably. At present, there are two standard methods for calculating the weights of the fuzzy comprehensive evaluation indicators: subjective and objective weighting methods. The first is based on the subjective judgment of an individual’s comments to determine the index weight. There are some common subjective weighting methods, such as the Delphi method and the direct judgment method. The subjective weighting method is convenient and straightforward to use, but it strongly relies on people’s judgements. The objective weighting method uses statistical methods, and the weights are determined according to statistical correlations, such as principal component analysis, entropy method, etc. This method has a more accurate theoretical basis, higher credibility, and is suitable for a large amount of data, but the calculation is relatively cumbersome.

The weight determination method that we used is more in line with objective facts than the method used by the previous researchers. It can not only use large-scale evaluation information, but also make the weight determination more convincing, and can carry out incremental weight calculation. At the same time, it reduces the tediousness of double counting, saves time, and maximises the use of tweet information.

Based on the advantages and disadvantages of the above two methods, we propose a more concise and easier to understand evaluation index weight assignment method.

  • First, we count the number of pieces of comment information of the item (n)

  • Then, we obtain the evaluation object through the extraction method of the evaluation object mentioned earlier

  • Finally, the frequency of mention of each evaluation object is multiplied by the score of the previous sentiment analysis.

6.4 Example analysis

Based on the previous steps of the sentiment analysis, the secondary indicator score is equal to the average of the scores in the tweet containing corresponding objectives. Based on each secondary indicator, we can calculate a rating for each primary indicator. This is calculated by dividing the sum of all secondary indicator scores by the number of secondary indicators (Shown in Table 4).

Table 4 Primary and secondary indicators score table in Tesla Model X

An indicative example of the sentiment analysis is shown in Table 5. We provide only one example for illustration purposes, due to the large number of the collected tweets (also shown in “Appendix C”).

Table 5 Partial tweet analysis results

Style/appearance Based on our analysis, the score of the style/appearance indicator is 0.31 (0.78 in colour, 0.02 in size and 0.14 in shape) and the proportion of style/appearance is 0.28. As can be seen from the data in Table 4, the users are positively disposed toward the style/appearance of Tesla Model X. Style/appearance’s mention rate is about 28% of the sum of the four aspects. Based on the data, this part focuses on how to conduct the NPD process from two aspects, namely idea generation and idea screening.

Performance The score for the performance indicator is − 0.04 (− 0.36 in terms of speed, − 0.23 in terms of engine, 0.46 in terms of displacement) and the proportion of performance is 0.33. The performance of the Tesla Model X is the company’s first priority. On the one hand, the performance rate of mention is the highest of the four first-level indicators. On the other hand, according to the user’s tweets, the performance of Model X appears to be at a medium level. Based on this analysis, we can extrapolate that Tesla’s design department should pay attention to this aspect of the products’ characteristics.

Configuration The score for the configuration indicator is 0.32 (− 0.24 for the sunroof, 0.85 for the GPS and 0.35 in terms of seat comfort). 18% of users’ comments focus on the product’s configuration and the proportion of the configuration indicator is 0.18. According to our results, Tesla Model X’s perceived evaluation in terms of configuration is considered good. The most frequent user tweets referred to the design of the sunroof. Based on our data, the configuration of the sunroof does not appear to meet consumers’ expectations. Therefore, in the idea generation or the idea screening processes, the company should pay more attention to the redesign of the sunroof.

Service: The score of the service indicator is − 0.33 (− 0.54 in price, − 0.11 in maintenance and − 0.34 in cleanliness) and the proportion of service is 0.21. According to our results, Tesla Model X’s overall service is in a poor state, especially in terms of price. For those users who tweeted about the product’s service dimensions, price is also the most eminent issue. So, this part will focus on the price of the Tesla Model X.

7 Discussion

7.1 Implications for research

In this paper, we set off to investigate how information retrieved from social media can assist decision-making processes and customer co-creation for NPD. Our analysis identified sensemaking patterns in Twitter data, thereby explaining the NPD process and the associated steps where the social interactions from customers can have an iterative role. We contribute to the literature of NPD using digital innovations (e.g. Akter et al. 2020; Fosso Wamba et al. 2019), as we discuss the importance of digital innovations (supported by technology-Twitter) in making people’s emotions evident and present, and how people appreciate (or disapprove) the design of a product. Furthermore, we acknowledge the role of Big Data Analytics and AI in this process (Trabucchi and Buganza 2019) as we attempt to make sense of the use of Twitter data during NPD. Hence, we extend current studies (e.g. Kumar et al. 2020) in that we use this data to make sense of the proposed products during the NPD process, reinvigorating the role of customers as active participants in the process of co-creation (Irani et al. 2017). Therefore, we contribute to the research on effective customer involvement to create synergies and networks that are necessary for the creation of products that satisfy customer needs and aspirations and the strategic goals of the organisation (Romero and Molina 2011; Niesten and Stefan 2019).

By analysing Twitter data, we illustrate the usefulness of social media in the co-creation process as a sense-making device that assists managers and developers to make sense of the data available (Lycett 2013; Wei et al. 2014) and customer sentiment and decision-making opportunities (Duan et al. 2019; Dwivedi et al. 2019) throughout the whole NPD process (Zhan et al. 2018; Du et al. 2016; Fuchs and Schreier 2011; Majumdar and Bose 2019). Therefore, we argue that the NPD is inseparable from the familiarity with the law of market operation and the understanding of customer needs. It is necessary to first form market data and information into new product (technology) concepts, and then combine concepts into research and development projects. Social media is the optimal platform where the company can conduct data/sentiment collection and analysis which is a fundamental prerequisite for NPD.

7.2 Implications for practice

This study has significant implications for practitioners and managers who are involved in NPD. It illustrates how companies can harness social media to improve aspects of the design. For instance, the results of the analysis of the case show that even though customers are satisfied in terms of the style/appearance of the studied product, 28% of the customers are concerned about the current style/appearance of the product. During the idea generation stage of product design, the company could consider what aspects of Model X’s performance can be improved. The sentiment analysis can assist in the design of a brand new sunroof, but also help the company to improve the existing sunroof. In this way, social media data can help a company get ideas from consumers, but also to collect feedback related to opinion generation. For the screening phase, the results suggest that social media can play an important role in the idea screening stage. Big data analysis based on social media can perform behaviour analysis on users. In this way, companies can establish evaluation criteria in the idea screening process in NPD. Within the product development phase, a company could understand the target market conditions and define target consumers at the early stage of the new product market strategy formulation. Consumers are constantly changing their communication methods and consumption patterns in the face of different new media and platforms. Social media analytics provides real-time data to understand consumers’ social media usage habits, social content, and the motivation behind them. Finally, in terms of commercialization, product managers (e.g. the product manager of Tesla Model X) need to consider these aspects, price, maintenance and cleanliness. On one hand, when faced with the advertising-based commercial product, the manager can emphasize this part of the repricing, as well as highlight free maintenance and cleaning services. On the other hand, product managers can develop value-added services, such as high-end paid maintenance, which can improve customers’ satisfaction.

Our study calculates a score from the sentiment analysis, which is very useful for product managers, as it determines the customer satisfaction of aspects of Tesla Model X. The proportional which results from the fuzzy theory is used to determine which aspects of the product are the main objects mentioned by the user on Twitter. To be important, the weight, the product between the score and the proportion, determines the priority of each part of Tesla Model X in terms of existing and limited human resources, financial resources, and technical resources.

To sum up, our study highlights that social media can positively contribute to all the stages of NPD of a company. Social media can play three distinctive roles in a company’s NPD process. Firstly, social media can help to explore the direction of innovation in new products. In addition to well-known platforms, many unfamiliar platforms, such as professional forums or blogs, also provide a wealth of information to help companies’ NPD. Social media can also change the traditional way of exploring new ideas. NPD is often influenced by unanticipated market segments. However, traditional marketing research is difficult to reach these segments. Relying on social media makes it easy to get relevant information. A company simply needs to develop processes and systems that enable social media users to participate throughout the process. In addition, social media can assist in the commercialization of new product launches. A company can use social networking sites to achieve broad user engagement and generate innovative and interactive ways to spark customer interest in new products.

8 Conclusion

An important application area of Data Analytics and AI is to mine and analyse a large number of product reviews on social media. Its primary purpose is to discover underlying patterns to assess more accurately certain capabilities and drawbacks of products or services (Chan et al. 2016; Fader and Winer 2012). The utmost interest of a product review is to support a company through AI in identifying and obtaining the characteristics or attributes of the product, know the subjective comments of the user, extract comments and collocations, and distinguish the user’s tendency (Singh et al. 2017) so that the company can plan and tailor the production strategy analogously. The extraction of information serves many roles for NPD, like market research, collections of customer’s requests and the choice of strategy. The information extracting and sensemaking of product reviews is mostly based on data mining techniques of AI. However, since the subject of the product review is the name of the product, the focus of the product review is to extract information around the specific characteristics of the product and the corresponding emotional words (Bruns and Stieglitz 2013).

With the popularity of Internet users, social media have become an important part of people’s daily lives. The sentiment analysis and fuzzy analysis of the evaluation of products in social media can affect the process of NPD to a certain extent. Based on our study’s objectives, we carried out an in-depth research and generated insights for the improvement of the use of social media in the NPD process through a combination of sentiment analysis and fuzzy mathematics for the identification of sensemaking patterns that can assist companies in several stages of NPD.

The limitations of the study have to do with the limited generalisability because of the choice of one case in a particular context. However, the aim of the study is not about statistical generalizability (Guba and Lincoln 1994; Lincoln and Guba 1990); rather, it is about theory generalization from the case findings (Yin 2009), where theory is informed of a phenomenon and results are judged based on the appropriate reasoning used when discussing the findings and conclusions.

A potential extension of our study could be to examine other operations decision-making areas, such as sales forecasting and supply chain risk assessment. It would be interesting also to focus on various stages of product development such as concept development, business analysis, and prototyping. The sentiment analysis method that we propose subdivides the emotional vocabulary to a certain extent and obtains five evaluation grades, which have achieved certain effects. However, several limitations allow further refinement of emotion and the understanding of the emotional semantic level. Exploring new methods of identification through AI of the emotional polarity and more dimensions could extend the research scope of this study for the future. To improve the reliability of fuzzy evaluation analysis, it is necessary to increase the research on the degree of discrimination of indicators. The evaluation information sentiment analysis classification results have a low recall rate and an unstable accuracy rate. In the future research, different data are selected for comparative experiments, and the reasons for the instability of the evaluation results are found, and the classification accuracy and recall rate of each level are further improved.