Introduction

In this service economy era, enterprises’ primary goals include rapid responses to consumer requirements, shortening the time needed to launch new products or services, and improving the quality of their products (Choi et al. 2011; Chen and Cheng 2009; Walstad and Rebeck 2002). To meet these goals, more and more companies are putting efforts into discovering consumer demands with regard to their brands, products, and services. This information can then be used as a reference when designing new products or making innovations. Therefore, the efficient gathering and analysis of consumer demands can help companies to remain competitive (Kuo et al. 2009; Bei and Shang 2006; Chen et al. 2016).

With the development of new technologies, a growing number of consumers now post their opinions about products or services on blogs, web forums, and bulletin board systems (BBS), as well as in e-mails. This phenomenon is called, “electronic word-of-mouth” (eWOM) or “online word-of-mouth” (Hennig-Thurau and Walsh 2003, Pai et al. 2013a). eWOM is essentially consumer product or service appraisals, and thus also include important consumer demands. In order to understand consumer demands, enterprises are therefore investing a substantial amount of money in the collection and analysis of eWOM content. However, the tremendous amount of such information that is available online often overwhelms these collection and analysis efforts, and results in information overload (Park and Lee 2009).

A number of studies have undertaken examinations of the transmission (Chen et al. 2014; Erkan and Evans 2016), influence (Chu and Sung 2015; Kim and Johnson 2016; Wu and Lin 2017), and marketing (Nieto et al. 2014) of eWOM. Currently, however, there are very few eWOM-related studies that discuss the analysis of consumer demands through computerized eWOM. The most similar work to the current on is Archak et al. (2011), which defined the characteristics of a product and then brought up specific descriptions of the product from the eWOM content. The characteristics were then used to find consumers’ needs. Appraisal words are the sentiments, opinions or thoughts consumers have toward products. As a result, Archak et al. (2011) used POS (part of speech) tagging to find out the nouns from the online appraisals, which were then used as the appraisal words. However, not every noun could express consumers’ sentiments, opinions or thoughts. Besides, the analytical results of his research tended to be more subjective, because it required much labor used to cross reference the eWOM data, noun and appraisal words. Since most appraisals are done with sentiment words (Feng et al. 2011; Söderlund and Rosengren 2007), by applying emotion analysis techniques to eWOM, it is expected to that it will be possible to find the nouns related to sentiments. Instead of carrying out the analysis through traditional labor-intensive approaches, the current study applied many mature information technologies and method to do a consumer needs analysis using the eWOM data, nouns and appraisal words. Therefore, the analytical results of this study tend to be more objective. As such, this study is different from Archak et al. (2011) with regard to the data it analyzes.

By using appraisal words, it is expected that it will be possible to better understand consumers’ preferences and get more comprehensive results. However, few studies have been conducted to investigate methods for the semantic analysis of eWOM. One of the advantages of analyzing eWOM content is that it directly reveals consumer opinions, which can enable enterprises to monitor market changes in real time. Nevertheless, the difficulty of analyzing eWOM lies in the complexity of the semantic content. Since consumers often use “text speak” to post their opinions online, and because there are no standard writing formats and unified descriptive words for online reviews (Pai et al. 2013a), the difficulty of analysis is increased. These problems also prevent computer programs from analyzing the text, and thus extracting the meanings of these terms is even more difficult. Despite the abovementioned difficulties, a number of enterprises continuously collect and analyze eWOM, aiming to find out consumer demands. Nevertheless, traditional eWOM data collection wastes a lot of time, which usually results in delayed reactions to the market. Therefore, this study proposes a computer-based approach for the analysis of consumer demands. In order to develop a computerized eWOM analysis system, eWOM collection programs are employed to replace human labor in terms of searching and collecting eWOM content. The process of appraisal extraction and appraisal word classification is thus computerized to replace human labor. This use of information technology is expected to boost the speed of eWOM content analysis. In addition, traditional research into business administration or information technology puts emphasis on issues such as the spread, decision-making, and market-influence of eWOM. In contrast, information technology is rarely employed for the collection and analysis of eWOM. Therefore, employing information technology to analyze eWOM is an issue that deserves more attention. This study does not develop new algorithm or techniques for eWOM, but it applies well-developed information technology to develop a new application, and thus can be easily adopted by enterprises, and this indicated the value of the current work.

In summary, this study proposes a computer-based approach for the analysis of consumer demands. However, before constructing a complete system, a conceptual model or theory is suggested to be used as the foundation for developing it. As such, the system design is based on a review of the literature and is more reasonable than would otherwise be the case (Chen and Chen 2012; Chen 2011; Pai et al. 2013a; Pai et al. 2013b). Specifically, this study has the following objectives: (i) constructing a consumer demands analysis model in eWOM; (ii) designing a consumer demands analysis procedure; and (iii) developing a system for analyzing consumer demands. Finally, this study carries out an empirical test to validate the proposed method.

Consumer demands analysis model using eWOM

In order to analyze consumer demands efficiently, this section reviews traditional approaches for analyzing them and investigates the problems that they have. Approaches to the analysis of the relationship between appraisal words and consumer demands are also discussed. Finally, based on this review of the literature, details of the design of the consumer demand analysis model using eWOM are presented.

Review of traditional approaches for analyzing consumer demands

In order to gain the satisfaction and support of customers, businesses have long developed their products and services based on consumer opinions (Bai et al. 2008; Deng et al. 2010; Jiménez-Jiménez and Cegarra-Navarro 2007; Kim et al. 2004), with many enterprises and scholars using questionnaires to identify key consumer demands. The various methods that are used in the studies examined in this literature review to investigate consumer demands are shown in Fig. 1 (Kwon 2005; Grigoroudis and Siskos 2004; Park 2004), and described below.

Fig. 1
figure 1

Traditional approaches for analyzing consumer demands

Collection of evaluation factors

Very often a researcher first collects the critical factors that influence purchasing decisions by employing two methods: First, experts are interviewed to identify evaluation factors. Second, a literature review is undertaken to identify evaluation factors. Nonetheless, these two data collection methods have their limitations. Insufficient information from the experts may jeopardize the reliability of the study, and an incomprehensive literature review can also hinder reliability.

Selection of evaluation factors

After extensive data collection, the evaluation factors are organized and classified. This requires researchers to sort the data, and repetitions of similar evaluation factors are eliminated. However, this process is time-consuming and laborious.

Confirmation of consumer demand factors

The generated evaluation factors are subsequently used to design a questionnaire. By this process, the exact consumer demand factors can be identified.

The procedures outlined above require a considerable amount of work to filter and organize the collected data. However, using a computer-based approach it is possible to use automation to reduce the complexity of this process and avoid errors, and thus improve the accuracy of consumer demands analysis.

Analysis of the relationship between appraisal words and consumer demands

To attain effective analysis of consumer demands from eWOM using computers, this research focuses on examining eWOM to gain insights. eWOM comes from consumers’ actual encounters with the focal product or service, and includes valuable information on key consumer demands (Pai et al. 2013a). Consumers share their demands in eWOM using appraisal words when they have positive or negative experiences with a product or service. On the other hand, when the experience is not significantly good or bad, consumers tend not to use appraisal words to express their demands. That said, in general appraisal word indicate consumer demands. Pai et al. (2013b) carried out a SWOT analysis on products or services by examining positive or negative appraisals. Enterprises can then use the results of the SWOT analysis to improve their products or services. SWOT is a tool for market analysis (Kotler and fox 1995), and this is done to better understand consumer needs and demands from the mass market. As a result, we can figure out consumer needs from eWOM appraisals (Pai et al. 2013b).

There are many studies that use keywords to obtain predictions of the economy or consumer decision-making. For example, Choi and Varian (2012) utilized Google Trends to carry out keyword analysis. The frequencies of the analysis were used as the near-term values of economic indicators. Moreover, Preis et al. (2013) found that the frequency of keywords could be used as “early warning signs”. When a keyword is searched for more frequently, this may imply that it interests a lot of people. The frequencies of keywords can thus be used to assist investment decision-making. Similarly, when there are a lot of consumers who use a specific appraisal word to express their opinions about a product, the specific appraisal words used can present the needs and expectations of consumers. When the frequency of appraisal words rises or drops dramatically, then this can be seen as an early warning sign that businesses should pay attention to.

Generally speaking, the amount of appraisal a product or service receives tends to be steady, and a sudden shift in this would indicate a change in consumer demands. For example, certain aspects of a product would attract consumers’ attention, thus leading to more appraisals, or another product’s lack of features would not receive much interest, thus leading to a fall in the desire to discuss it. Therefore, when there is a significant change in the number of appraisals, it is likely that consumers’ demands have altered. A company’s marketing department and management could utilize this as a basis for determining consumer demands, and find out the reasons behind such changes. Statistically, the number of eWOM appraisal words is increasing every day. The sample of appraisal words collected in this study is n ≥ 30, and thus a large sample pool. Based on central limit theorem, no matter what kind of distribution a population has, the sampling distribution approaches a normal one. Therefore, this research assumes that eWOM appraisal words are normally distributed, and under a 95% confidence interval the frequencies of eWOM appraisal words would fall into the range of [μ + 2σ, μ-2σ]. When the number of appraisal words exceeds the 95% confidence interval range, it means there is a significant change in the number of appraisals.

Frequencies have been used in many studies as an indicator of future trends. For instance, Google Trends uses the frequency of searches for keywords to determine trends in various related areas. Fig 2 shows a progression graph for the keyword “KFC” (Kentucky Fried Chicken) between 2005 and 2010, and the letters A to M indicate different events during this period. As shown by the graph, the number of searches for KFC changes with each event, and this is especially true for events J and K. It could then be inferred that these two events attracted the most attention from consumers online. In this research, we focus on understanding consumer demands using appraisal words, and the consumers’ perceptions are shown by significant upswings or downswings in the use of these. In short, the frequencies of the appearance of appraisal words are used as indicators for identifying consumer demands.

Fig. 2
figure 2

Keyword trends for “KFC”

Currently, many enterprises utilize eWOM to understand the competitive situations of their markets. They also use eWOM to understand the needs of consumers. Based on their understanding of consumer needs, enterprises can then take actions to cater to these, such as by making innovations or improving their products (Peter 1999). This also indicates that eWOM contains information about consumer needs. eWOM is thus an intellectual tool that is valued by enterprises, and many enterprises have started to collect and analyze eWOM to understand the needs of consumers. Therefore, we can view the appraisals or opinions of eWOM as consumer needs. A lot of appraisals related to products or services are expressed by using sentiment words (Söderlund and Rosengren 2007). Researchers such as Pai et al. (2013a) thus used sentiment analysis to extract information. The advantage of using information technology to process eWOM is that it can obtain eWOM-related information automatically. Businesses can then follow consumers’ opinions or appraisals and respond to them more efficiently. Current information technology is able to collect and analyze eWOM automatically, so making it easier to understand consumer needs.

To effectively understand the relationship between appraisal words and consumer demands, we use one case for the purpose of demonstration. For example, eWOM authors use appraisal words, such as “delicious,” “smells good,” and “not bad,” to express their feelings about a food. However, eWOM authors’ use of such words varies due to their different demographic backgrounds, such as environment, education, and age. Different appraisal words from a variety of authors can cause difficulties in computerized content analysis. To clarify this information, appraisal words have been organized based on the following characteristics.

Synonyms

These are appraisal words that describe similar consumer demand factors. For example, for the consumer demand factor “flavor,” “concentrated” is synonymous with “nice-smelling,” while for the consumer demand factor “environment,” “beautiful” is synonymous with “clean” (Fig. 3).

Fig. 3
figure 3

Examples of synonymous appraisal words

Antonyms

Appraisal words that describe similar consumer demand factors but are opposite in meaning. For example, the appraisal words “nice-smelling” and “bad-smelling” express positive and negative flavor, respectively. Therefore, “nice-smelling” and “bad-smelling” are antonyms that describe the same consumer demand factor “flavor” (Fig. 4).

Fig. 4
figure 4

Examples of antonymous appraisal words

Multi-relational

Appraisal words can sometimes be used to describe multiple consumer demand factors. For example, “not bad” and “very good” describe both the “flavor” and “environment” consumer demands factors.

The discussion above demonstrates that appraisal words encompass the concepts that consumers wish to express, which can represent key consumer demands. The present study classified appraisal words into categories based on their characteristics, as shown in Table 1.

Table 1 Definitions of appraisal word characteristics

Design of consumer demand analysis model in eWOM

Based on the analysis of the relationships between appraisal words and consumer demands, this study designed a consumer demands analysis model. This may help enterprises to have a better understanding of consumer demands, as shown in Fig. 5. In order to solidify the theoretical foundation of the consumer demands analysis model, this study applied the eWOM management framework for service experience (Pai et al. 2013a), and DIKW hierarchy (data, information, knowledge and wisdom hierarchy) (Rowley 2007) as the framework for designing the consumer demands analysis model. The consumer demands analysis model mainly consists of eWOM collection, appraisal extraction, and appraisal word classification, as is explained in more detail below:

Fig. 5
figure 5

Consumer demands analysis model in eWOM

eWOM collection

eWOM belongs to the data level in DIKW hierarchy. The sources of eWOM were from blogs, web forums, and bulletin board systems (BBS). In order to effectively collect eWOM, this study used eWOM management framework for service experience (Pai et al. 2013a) as the framework. The consumer demands analysis model was used to design the eWOM collection stage, and the eWOM content was collected mainly from blogs, web forums, and bulletin board systems (BBS).

Appraisal extraction

eWOM includes opinions and appraisal from consumers. The consumers often use appraisal words to express their opinions and thoughts. Therefore, the appraisals are important information for eWOM data. This study applied the theory of DIKW hierarchy to extract these, with the appraisal words defined at the information level. In addition, we used the eWOM management framework for service experience (Pai et al. 2013a) to design the appraisal extraction stage in the consumer demands analysis model. The aim here is to find out important appraisal information from eWOM data.

Appraisal word classification

Classification is a common tool that is often used in knowledge management. However, each appraisal word is an independent piece of information. In order to construct the relationships among appraisal words, we applied an appraisal classification to produce the knowledge context between appraisal words. Therefore, based on DIKW hierarchy theory, this study defined appraisal knowledge at the knowledge level. In order to solidify its theoretical foundations, this study used the eWOM management framework for service experience (Pai et al. 2013a). We designed the appraisal word classification stage in the consumer demands analysis model. The primary purpose of this is to classify and categorize appraisal words. After this, the knowledge context of appraisal words is produced.

Consumer demand identification

Wisdom uses knowledge as its foundation. Through classifying, organizing, and analyzing this, new applications or values are found. Nonetheless, eWOM does not generate wisdom, but only business intelligence (BI). BI is a combination of statistics about business operations and analytical tools. It mainly provides information concerning market trends so that enterprises can carry out business planning or decision making (Watson and Wixom 2007), Hence, finding out consumer demands through eWOM can be said to be a kind of BI. BI and wisdom belong to advanced knowledge, that is, the wisdom level in the DIKW hierarchy. In order to solidify the theoretical foundation of the consumer demands analysis model, the researchers of this study referred to the eWOM management framework for service experience (Pai et al., 2013) in the designing consumer demand identification stage for the consumer demands analysis model. The primary function was to analyze appraisal knowledge so that the analytical results could serve as the basis to identify and filter consumer needs.

Methods and technologies

This study integrated the approaches shown in Fig. 5 into the design of procedure for analyzing consumer demands in eWOM. This procedure includes eWOM collection, appraisal extraction, appraisal word classification, and consumer demands identification.

Procedure for analyzing consumer demands in eWOM

Based on the design of the consumer demands analysis model in eWOM in Fig. 5, this study develops procedures for consumer demands analysis. In this way, the enterprises may be able to better understand consumer demands, as shown in Fig. 6. The proposed consumer demands analysis in the eWOM procedure consists mainly of eWOM collection, appraisal extraction, appraisal word classification and consumer demands identification, as described in the sections that follow.

Fig. 6
figure 6

The procedure for consumer demand analysis from eWOM

Fig. 7
figure 7

eWOM collection process

eWOM collection

eWOM content is collected from the Internet with content collection and identification technologies, as follows:

  • Content collection: The names of products, services, and companies are the keywords that are searched for in the web content.

  • eWOM identification: The collected web content is filtered for appraisal words.

Appraisal extraction

Semantic analysis and content extraction are conducted using the eWOM content to identify appraisal words related to the target products or services:

  • Parts of speech tagging: All of the parts of speech in the text are identified.

  • Appraisal analysis: Sentiment analysis technologies monitor unknown appraisal words.

Appraisal word classification

Appraisal words include consumer demands. Therefore, this study uses calculations of similarity to determine the relationships between appraisal words and consumer demands. The classifications for appraisal words are then used to calculate the frequencies of the appearance of appraisal words in consumer demand factors. The number of appearances are saved month-by-month.

Consumer demand identification

By analyzing the frequencies of appraisal words in relation to consumer demand factors, consumer demand identification is conducted to identify changes in these demands.

eWOM collection

In order to effectively collect eWOM content, a collection method was developed to extract specific content related to eWOM communication and to exclude non-related content (Fig. 7). The eWOM collection processes mainly include “content collection” and “eWOM identification”, as described below:

Content collection

To ensure efficient and accurate collection of related content (about specific products, services, or brands), the following processes and technologies were developed.

  • Content crawler: By searching web content using keywords (e.g., products, services, or brands), items of interest are extracted from blogs and online communities via the Google API (Application Programming Interface).

  • HTML structure analysis: Each blog and online community has a unique HTML tag format. After obtaining the web content, the HTML structure is analyzed to facilitate extraction.

  • Automatic marking: Web content is often presented in semi- or non-structured formats. Analyses of web content is generally hampered by the use of “text speak”, special symbols for certain online communities, a lack of punctuation, and other challenging characteristics. To address these problems, an automatic marking function is developed to analyze and categorize HTML language. This also punctuates the eWOM content with the aid of a rule database, thus improving the accuracy of the analysis of the parts of speech in Chinese texts (Fig. 8).

Fig. 8
figure 8

Automatic punctuation rules

  • HTML tag refinement: Tag refinement is primarily applied to eliminate advertisements and HTML language links, and thus only the important parts of the web content are retained, improving the efficiency of the follow-up procedures. This content is saved as a “*.txt” file.

eWOM identification

The collected web content might still include information not related to eWOM communications. The following filters are applied to extract only the required content.

  • Appraisal word filtering: An appraisal word database is employed to locate web content that includes the appraisal words.

  • Content filtering: Repeated content is deleted from the selected web content.

Appraisal extraction

Despite applying the above-mentioned filtering processes, the content still included some unidentified appraisal words. The following techniques were applied to determine the appraisal words that are commonly used by consumers, as shown in Fig. 9.

Fig. 9
figure 9

Appraisal extraction process

Parts of speech tagging

The Chinese Knowledge Information Processing (CKIP) system, developed by Academia Sinica in Taiwan, is used to tag parts of speech.

Appraisal analysis

Consumer opinions have been frequently analyzed with sentiment analysis technologies (He and Zhou 2011; Fan and Chang 2010; Vechtomova 2010), and a variety of methods can be used with Chinese text. For instance, Pai et al. employed the Chinese sentiment analysis method proposed by Ku et al. (2005) to develop an eWOM analytical method (Pai et al. 2013a), further developing a SWOT analytical technique (Pai et al. 2013b) using the results of eWOM analysis. This work thus uses the analytical method in Pai et al. (2013a) and Ku et al. (2005) as the basis of appraisal analysis, with the following steps applied for analysis of unknown appraisal words.

First and foremost, this study conducted a calculation of the appraisal character weight, which indicates positive or negative values, as calculated with eqs. (1) and (2).

$$ {P}_{ci}=\frac{fp_{ci}}{fp_{ci}+{fn}_{ci}} $$
(1)
$$ {N}_{ci}=\frac{fn_{ci}}{fp_{ci}+{fn}_{ci}} $$
(2)

where ci is the i th character in the appraisal character set, fp ci is the number of times appraisal character ci appears in the positive appraisal word database; fn ci is the number of times appraisal character ci appears in the negative appraisal word database; P ci is the positive weight of the appraisal character ci, and N ci is the negative weight of the appraisal character ci.

Equations (1) and (2) denote the number of times an appraisal character appears in the positive and negative appraisal word databases. Equations (3) and (4) regularize eqs. (1) and (2), respectively.

$$ {NP}_{ci}=\frac{fp_{ci}/\sum_{j=1}^n{fp}_{cj}}{fp_{ci}/\sum_{j=1}^n{fp}_{cj}+{fp}_{ci}/\sum_{j=1}^m{fn}_{cj}} $$
(3)
$$ {NN}_{ci}=1-{NP}_{ci} $$
(4)

where fp ci is the number of times appraisal character ci appears in the positive appraisal word database; fn ci is the number of times appraisal character ci appears in the negative appraisal word database; n is the total number of appraisal words in the positive appraisal word database, and m is the total number of appraisal words in the negative appraisal word database; NP ci is the positive weight of appraisal character ci in the positive appraisal word database, and NN ci is the negative weight of appraisal character ci in the negative appraisal word database. The above-mentioned formulae are applied to weigh the appraisal characters, with the results shown in Table 2.

Table 2 Results of analyzing appraisal character weights

Based on these results, we carried out a calculation of appraisal word weight. In eq. (5), S cj is the appraisal character weight of each appraisal character. NP ci and NN ci are the positive and negative weights of the appraisal characters, respectively, and are used to assess the meanings of these characters. For example, if NP ci is greater than NN ci , the appraisal character is positive, and vice versa.

$$ {S}_{ci}={NP}_{ci}-{NN}_{ci} $$
(5)
$$ {S}_w=\frac{1}{p}\times \sum_{j=1}^p{S}_{cj} $$
(6)

Equation (6) primarily evaluates whether the appraisal word is positive or negative. S w is the weight of the appraisal word, and is either positive or negative, p is the number of appraisal characters in the appraisal word, and S cj is the appraisal character weight of each appraisal character. Table 3 gives the results of this process.

Table 3 Results of analyzing appraisal word weight

Appraisal word classification

Based on the analysis of the relationships between appraisal words and consumer demands carried out above, processes were designed to analyze the relationships between appraisal words and consumer demand factors by calculating the frequencies with which the appraisal words were found in consumer demand occurrences. This study analyzed the words with similar meanings in both appraisal words and consumer demand factors, with the results used as the basis for the categorization of the former. This study found the following 21 demand factors in a review of the literature: reliability, responsiveness, competence, access, courtesy, communication, credibility, security, understanding/knowing, tangibles, food quality, food price, food innovation, food rarity, natural food, food flavor, food mouthfeel, serving size, convenience of location, restaurant environment and restaurant style (Currie et al. 2009; Liu and Jang 2009; Ziethmal and Bitner 2003; French et al. 2000; Kivela et al. 1999; Earle 1997; Lee and Ulgado 1997). This study then constructed a database containing synonyms for all of these demand factors. There were two sources for this database. The first was the manual collection and classification of synonyms. The second was a comparison between the results of appraisal word classification and the synonyms database. Through this process, the synonyms that were not in the database were collected.

The Jaccard coefficient was applied to calculate the similarity between appraisal words and the synonym corpus for the consumer demand factors. Their similarity was analyzed, and the results used as the basis for categorizing the appraisal words, using eq. (7).

$$ sim\left({a}_k,{d}_{ij}\right)=\left|\frac{a_k\cap {d}_{ij}}{a_k\cup {d}_{ij}}\right| $$
(7)

Based on eq. (7), the results of appraisal word classification are listed in Table 4. a k is the appraisal word set, a k ∈{a 1 , a 2 , a 3 ,…, a E }, and E indicates the number of appraisal word sets(k = 1 ~ Z). As for “d ij ”, i is the number of consumer demand factors, j is the ranking of the synonym in the corpus. This study uses appraisal word sets and the synonym corpus of consumer demand factors to construct a binary matrix, as shown in eq. (8).

$$ \begin{array}{l}\\ {}sim\left({a}_k,{d}_{ij}\right)=\left[\begin{array}{cccc}\hfill \mathit{\operatorname{Max}}\left[sim\left({a}_1,{d}_{1j}\right)\right]\hfill & \hfill \mathit{\operatorname{Max}}\left[sim\left({a}_1,{d}_{2j}\right)\right],\hfill & \hfill \cdots, \hfill & \hfill \mathit{\operatorname{Max}}\left[sim\left({a}_1,{di}_j\right)\right]\hfill \\ {}\hfill \mathit{\operatorname{Max}}\left[sim\left({a}_2,{d}_{1j}\right)\right],\hfill & \hfill \mathit{\operatorname{Max}}\left[sim\left({a}_2,{d}_{2j}\right)\right],\hfill & \hfill \cdots, \hfill & \hfill \mathit{\operatorname{Max}}\left[sim\left({d}_2,{d}_{ij}\right)\right]\hfill \\ {}\hfill \cdots \cdots \hfill & \hfill \cdots \cdots \hfill & \hfill \ddots \hfill & \hfill \cdots \cdots \hfill \\ {}\hfill \mathit{\operatorname{Max}}\left[sim\left({a}_k,{d}_{1j}\right)\right],\hfill & \hfill \mathit{\operatorname{Max}}\left[sim\left({a}_k,{d}_{2j}\right)\right],\hfill & \hfill \cdots, \hfill & \hfill \mathit{\operatorname{Max}}\left[sim\left({a}_k,{d}_{ij}\right)\right]\hfill \end{array}\right]\end{array} $$
(8)
Table 4 The similarity between the appraisal word set and consumer demand factors

In the binary matrix, sim(a k ,d ij ) indicates the similarity between the appraisal word set and consumer demand factors. If the value of similarity between appraisal words and consumer demand factors is high, then the value of sim(a k ,d ij ) will almost be equal to 1, while the value of r(a m ,b n ) will be near or equal to 0, as can be seen in Table 4.

In order to program the categorization of appraisal words, this study uses eq. 8 to design the algorithm for appraisal word classification, as shown in Table 5.

figure a

The frequencies at which consumer demands occur are saved through the method of appraisal word classification. As shown in Table 6, t n indicates the number in each month, and (t∈{t n ,t = 1,2,3,…,n}), DF 1 to DF 21 stand for consumer demand factors, including reliability, responsiveness, competence, access, courtesy, communication, credibility, security, understanding/ knowing, tangibles, food quality, food price, food innovation, food rarity, natural food, food flavor, food mouth feel, serving size, convenience of location, restaurant environment and restaurant style, respectively.

Table 6 The frequency of consumer demand occurrence

Consumer demands identification

Based on the results of appraisal word classification, the frequency with which appraisal words are found in consumer demand factors each month could be calculated. To translate these numbers into tangible enterprise knowledge, this research designs a consumer demands identification technique which analyzes and identifies changes in consumer demands. This technique is explained below.

As shown in Fig. 10, consumer demand in each factor includes many appraisal words. According to the central limit theorem, when the sampling number of appraisal words is n ≥ 30, the consumer demands in each factor are normally distributed. Therefore, this research assumes that the consumer demands in each factor are normally distributed, and under the 95% confidence interval the frequencies of consumer demand occurrences would fall into the range of [μ + 2σ, μ-2σ]. Therefore, when the frequency of consumer demands found in consumer demand occurrences is out of the 95% confidence interval range, then this is defined as an obvious change in consumer demand factors.

Fig. 10
figure 10

Estimation and confidence intervals

Equation (9) calculates the mean value of the frequency of consumer demand occurrences per month, where t is the time set (t ∈{t j , t = 1,2,3,…,n-1}). Equation (10) calculates the standard deviation for the frequency of consumer demand occurrences, where x t stands for the frequency of consumer demand occurrences for each month.

$$ \upmu =\frac{1}{\mathrm{n}-1}\sum_{\mathrm{t}=1}^{\mathrm{n}-1}{\mathrm{x}}_{\mathrm{t}} $$
(9)
$$ \sigma =\sqrt{\frac{1}{n-2}\sum_{t=1}^{n-1}{\left({x}_t-\mu \right)}^2} $$
(10)

Using the example, we explain the results of the calculations from eqs. (9) and (10). As summarized in Table 7, t n specifies the time set, μ stands for the mean value, σ stands for the standard deviation, and DC indicates whether consumer demands have changed. If the company is interested in whether consumer demand factors have changed in t 10 , we should first use Table 7 to find out the frequencies of appearance for DF 1  ~ DF 21 during t 10 , and if the frequencies of the appearance of consumer demand factors (DF 1  ~ DF 21 ) are larger than μ + 2σ or smaller than μ-2σ during t 10 , this indicates the consumer demands factors have changed. On the other hand, if the frequencies of the appearance of consumer demand factors (DF 1  ~ DF 21 ) are smaller than μ + 2σ or larger than μ-2σ during this time, this indicates that consumer demand factors have not changed.

Table 7 Consumer demands identification

Case study

Based on the above-mentioned analysis tools, this study develops a computer-based approach for analyzing consumer demands. This section describes an implementation of this system using C#, and a case study is used to evaluate the effectiveness of this approach.

Prototype

The research set-up includes an Intel Core2 Duo E6420–2.13G PC, running Microsoft Windows XP Professional, Internet Information Services (IIS), Microsoft SQL Server 2008, and Microsoft Visual Studio 2010.

Screenshots of the system are presented in Figs. 11-15. Fig 11 shows the user interface screen for content collection, Fig. 12 shows the semantic analysis and extraction of the content, while Fig. 13 shows the classification of the appraisal words. Fig. 14 shows the frequency results of consumer demand occurrence, and Fig. 15 presents the results of consumer demands identification.

Fig. 11
figure 11

Collection of eWOM content

Fig. 12
figure 12

Semantic analysis and extraction of eWOM content

Fig. 13
figure 13

Classification of appraisal words

Fig. 14
figure 14

Frequency results of consumer demands occurrence

Fig. 15
figure 15

Results of consumer demands identification

System evaluation from a user satisfaction perspective

In order to examine the effectiveness of the proposed mechanism and assist strategic planning managers, marketing personnel, and R&D personnel to carry out marketing strategies, this study used a user satisfaction questionnaire to conduct a system evaluation, adopting the approach from Li and Chang (2009), Chen and Chen (2012), and Pai et al. (2013b). The literature review was used as the basis of this study, and the results were used to carry out the design of user satisfaction questionnaires items. The items in the instrument were rated on a five-point Likert scale, with 1 = strongly disagree, 3 = undecided, and 5 = strongly agree.

This research was conducted using both control and experimental groups. The results of a system evaluation showed that the proposed method could be used as an effective basis for product and service planning, and that it could assist strategic planning managers, marketing personnel, as well as research and development departments. In the case of the control group, the participants evaluated the traditional questionnaire surveys on consumer needs. For the experimental group, the participants were provided access to the proposed method and were then surveyed on their opinions as to the accuracy and effectiveness of the method, given their professional knowledge and judgment. Finally, based on the surveys of the two groups and a t-test, we analyzed the level of satisfaction with these methods to determine whether any significant differences existed between the traditional approaches (marketing surveys) for analyzing consumer demands and the proposed approach. Convenience sampling was applied in this study, and the questionnaires were administered to students of executive master of business administration (EMBA) programs from three universities in southern Taiwan. The students were asked to evaluate the system used in this study. However, only one of the universities was willing to fully participate in this work, and only eighteen participants were willing to take part in the evaluation of the system. These participants who took part in the experiment all had a bachelor’s degree in business administration, and were mainly from small and medium-sized enterprises in Taiwan. The participants included strategic planning managers and marketing personnel, as well as research and development personnel. Each participant had to possess related knowledge and real-world experience of using marketing surveys and consumer demand analysis. When the participants took part in this study, they had all been working for at least 15 years, and were all company leaders or experts in their line of business. Therefore, the results derived from their opinions could be said to have expert validity.

The results of the analysis are shown in Table 8. The responses to questionnaire items 1 to 6 indicate that the method proposed in this work could be seen as effective. The results also indicated that the mechanism can be used to retrieve reliable information about a changing market and consumers. The results of the eWOM analysis on consumer demands can be seen as both objective and reliable, and so useful for market planning. This study also interviewed the participants, and they stated that one of the benefits of the method was that it provided timely and extensive consumer information, which is less likely to be obtained with traditional questionnaire surveys. The analytical results for questionnaire item 7 were not significant. The participants agreed that this study presented a new analytical method to analyze consumer demands. While verifying the application and value of this method requires long-term study, many participants thought that eWOM analysis may be what the business world wants, and that the analytical results of this study might be able to help enterprises to innovate and improve their products or services.

Table 8 Results of the performance evaluation of the proposed consumer demands analysis method

In summary, the results of analyzing the questionnaire indicate that the method proposed in this study is better than traditional questionnaire surveys. It may thus be able to offer timely and extensive analytical results that help managers make better decisions. However, many participants thought that the method still requires long-term analyses and observations to verify its reliability. A short-term study was carried out in this worked because we wanted to find out any problems with this approach and then work to address them. As such, future studies are needed to extend the work of the current project.

Conclusions and perspectives

A computer-based approach was developed in this study to analyze consumer demands, and this can help enterprises develop their product lines and improve their services. This work provides primary findings and has the following contributions to the literature. First, a user satisfaction survey was conducted to reveal the effectiveness of this system, and the results prove that it is capable of analyzing consumer demands, as well as providing marketers and managers with valuable market information. Secondly, a computer-based approach was developed in this study to improve traditional methods of analyzing consumer demands. Finally, this paper employed computer programs to analyze critical consumer demands. Computerized analysis makes it easier for enterprises to collect and analyze this kind of data, while enhancing the accuracy of the process.

Based on the computer-based approach presented in this work, this study has three suggestions for future works. First and foremost, future studies should include processes to filter out eWOM information published by enterprises in order to increase the validity of the results. Secondly, this study used a questionnaire with marketers and managers for the purpose of system evaluation and to identify potential issues for future enhancement. However, the evaluation process should be continued for at least three to five years for more objective results. Therefore, a longitudinal study is suggested to determine the feasibility of this mechanism. Finally, the system presented in this study can only determine consumer demands valued by eWOM, and is not capable of prioritizing which consumer demands deserve more improvement efforts. As a result, future studies are suggested to take a step further in designing a computer-based approach that could prioritize the items that require the most attention.