Data-driven strategies in operation management: mining user-generated content in Twitter

Saura, Jose Ramon; Ribeiro-Soriano, Domingo; Palacios-Marqués, Daniel

doi:10.1007/s10479-022-04776-3

Data-driven strategies in operation management: mining user-generated content in Twitter

Original Research
Open access
Published: 10 June 2022

Volume 333, pages 849–869, (2024)
Cite this article

Download PDF

You have full access to this open access article

Annals of Operations Research Aims and scope Submit manuscript

Data-driven strategies in operation management: mining user-generated content in Twitter

Download PDF

Jose Ramon Saura ORCID: orcid.org/0000-0002-9457-7745¹,
Domingo Ribeiro-Soriano² &
Daniel Palacios-Marqués³

Abstract

In recent years, the business ecosystem has focused on understanding new ways of automating, collecting, and analyzing data in order to improve products and business models. These actions allow operations management to improve prediction, value creation, optimization, and automatization. In this study, we develop a novel methodology based on data-mining techniques and apply it to identify insights regarding the characteristics of new business models in operations management. The data analyzed in the present study are user-generated content from Twitter. The results are validated using the methods based on Computer-Aided Text Analysis. Specifically, a sentimental analysis with TextBlob on which experiments are performed using vector classifier, multinomial naïve Bayes, logistic regression, and random forest classifier is used. Then, a Latent Dirichlet Allocation is applied to separate the sample into topics based on sentiments to calculate keyness and p-value. Finally, these results are analyzed with a textual analysis developed in Python. Based on the results, we identify 8 topics, of which 5 are positive (Automation, Data, Forecasting, Mobile accessibility and Employee experiences), 1 topic is negative (Intelligence Security), and 2 topics are neutral (Operational CRM, Digital teams). The paper concludes with a discussion of the main characteristics of the business models in the OM sector that use DDI. In addition, we formulate 26 research questions to be explored in future studies.

A survey on sentiment analysis methods, applications, and challenges

Article 07 February 2022

Artificial intelligence in E-Commerce: a bibliometric study and literature review

Article 18 March 2022

Analyzing social media data: A mixed-methods framework combining computational and qualitative text analysis

Article 02 April 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In the last decade, the development of new technologies has boosted business models and marketing strategies globally (Lee & Carter, 2011). Accordingly, the business ecosystem has sought to better understand new ways of automating, collecting, and analyzing data. These actions are developed to make predictions and offer added value to corresponding markets (Qiu et al., 2013). In this paradigm, technologies, artificial intelligence, blockchain, cloud computing, and Internet of Things (IoT) enable companies to develop business models focused on what is known as data-driven innovation (DDI) (Trabucchi & Buganza, 2019; Adamides & Karacapilidis, 2020).

DDI has become widely used in many organizations where decisions are made based on Big Data processes (de Camargo et al., 2018; Hossain et al., 2020). The main objectives of DDI include offering better products and services, improving communication actions, and optimizing the cost of investments (Jetzek et al., 2014). Of note, business changes boosted by the collection and management of data are performed not only on the performance level (Akter & Wamba, 2016), but also on the organizational level (Saura et al., 2021).

As argued by Sorescu (2017), data innovation has been among major emerging drivers to promote new products and services and to generate new business opportunities based on the digital ecosystem. These changes on both performance and organizational levels (Grant, 1996) have led to an increase in the relevance of data innovation in operations management (OM) (Duan and Edwards, 2018; Lins et al., 2021) as a new form of business development to improve results. However, none of the previous studies has identified the main strategies and characteristics of DDI applied in OM business models. Yet, these new strategies are destined to transform the OM industry in the next decade (Thomé et al., 2016; Awan et al., 2021).

Furthermore, like other business areas, OM (Barney, 1991) has been significantly influenced by data-centric decision-making (Akter et al., 2019) and the adoption of strategies in a data-based ecosystem (Yang et al., 2007). OM focuses on the creation of goods and services (De Menezes et al., 2010). Accordingly, OM encompasses the development of plans, organization, coordination, and control of information sources and flows to produce a company’s products and services (Toktay et al., 2000) from data innovation (Tiberius et al., 2021). Therefore, OM has a direct impact on companies’ products and services (Helfat & Peteraf, 2009).

As noted by Pryke (2009) and Farahani & Rezapour (2011), OM is influenced by what is referred to as ‘four Vs’: volume, variety, variation, and visibility. As concerns volume, OM organizes and coordinates large volumes of productions, business structures, as well as the processes that improve decision-making for managing the volume of orders of products, services, and their production (Ransbotham & Kiron, 2017; Liu et al., 2020).

Importantly, using DDI, the analysis and collection of data that a company works with to generate a large volume of orders and production can be automated (Raisch & Krakowski, 2021). Likewise, DDI can be used to predict the volume of goods and services procurement (Johnson et al., 2021). In terms of variety, the actions that a company can perform vary according to each industry (Lakshmi & Bahli, 2020).

However, using DDI, companies can optimize their non-profitable actions, identify new opportunities concerning the services they offer, and measure the efficiency of these services for organizations’ performance (Davenport, 2013; Saura, 2020). Of note, according to the variation in OM, the variety of actions performed by a company based on both economic and production possibilities can be optimized with data-centric strategies (Yue et al., 2014).

In addition, applying data-driven algorithms makes it possible to identify price variations or market strategies (Breidbach & Maglio, 2020). Regarding visibility, experience in OM is usually managed based on operation processes that provide visibility to the actions performed by companies and that allow users to have contact with those companies. In this respect, Hartmann et al., (2014) reported that the use of data can improve visibility of companies and optimize both customer experience and customer journey. Furthermore, as highlighted by Araz et al., (2020), OM and the use of data-centric strategies are relevant for short-term process optimization.

As argued by Thomé et al., (2016) and Awan et al., (2021), the influence of data-centric technologies and strategies in OM should be thoroughly investigated. Awan et al., (2021) proposed to explore the main strategies that should be developed in OM using data-driven strategies, data-based business models, or data-based tools, as well as to measure the influence of technologies such as Big Data (Wamba et al., 2017) and Artificial Intelligence in OM to improve the performance of business processes. Focusing on these contributions, in this study, we bridge this gap in the literature by identifying the main data technologies of new business models based on data in OM and proposing correct uses for their development in different industries (Saura et al., 2021a).

While the main challenges and objectives of data innovation theories highlight the influence of innovation on organization culture, technological sophistication, and commercialization of new products, among others (Saura, 2021; Goel, 2022), the present study aims to understand the impact of data innovation on the development of new business models for OM by identifying its characteristics according to the sentiment of user-generated data (UGD) in Twitter. Therefore, the originality of the present investigation lies in the scarcity of previous research on the characteristics of new DDI business models in OM using Twitter as a source of data and in applying text-mining techniques.

To this end, in the present study, we also develop an original method based on DDI approaches, which reinforces the originality of the present study. More specifically, we first develop a novel method that uses data-mining techniques to identify insights about the characteristics of new busines models in OM based on Twitter-based UGD. Accordingly, methods based on Computer-Aided Text Analysis (CATA) are used to validate results. Specifically, we employ sentimental analysis with TextBlob on which experiments are performed using vector classifier, multinomial naïve Bayes (MNB), logistic regression (LG), and random forest classifier (RFC). Then, a Latent Dirichlet Allocation (LDA), a topic modeling algorithm, is applied to separate the sample in topics based on the sentiment and to calculate keyness and p values. Finally, these results are analyzed with a textual analysis developed in Python.

To reach the objectives and address the problems applying a novel method in OM research field, this study addresses the following two research questions: RQ1: What are the characteristics of business models in OM focused on DDI strategies? RQ2: What are the main sentiments in UGC about the characteristics of business models in OM that use DDI?

The remainder of this paper is structured as follows. Section 2 presents a literature review of relevant past research on OM. Section 3 describes the methodology used in the present study. The results are reported in Sect. 4. Finally, in Sect. 5, the results are discussed, while Sect. 6 draws main conclusions and outlines implications of our findings.

2 Literature review

UGD as a data source based on OM has been widely used in the literature. As indicated by Saura et al., (2021a), UGD consists of both user-generated content (UGC) and user-generated data (UGD). In addition, UGD is a valid source of data to extract insights and thereby assist organizations in their decision-making or optimizing their actions on the Internet. For instance, Yuan et al., (2018) used an algorithm to make predictions on eCommerce sales performance using topic modeling and sentiment analysis. Likewise, in a study that explored investments from the OM perspective and the influence of Big Data, Liu & Yi (2018) showed how the use of company information processed with Big Data analytics can help to optimize investment processes and ratios.

In another relevant study, Chae (2015) used the hashtag #supplychain to extract the main practices that can be applied in research, thus using UGC as a source. Similarly, Liu et al., (2019) analyzed the challenges and risk factors for mass customization in OM using as data source platforms and social networks (Twitter among them as one of the main sources of information). Based on the results, the authors argued for using new technologies for OM in fashion systems.

Furthermore, in an analysis of Twitter messages, Gruber et al., (2015) evaluated the states of crisis in OM, as well as the leadership of executives and members of the company’s top management and the actions they take on social networks. The results of this study revealed the relevance of studying Twitter-based UGC in Twitter from the management perspective on the company level.

Likewise, in a case study based on OM, Liao et al., (2021) highlighted the influence of opinion leaders in social networks and innovation in communities. The authors emphasized that Twitter is a valid tool to conduct sentiment analysis and explored its influence on the management of communities on the Internet (Bruns et al., 2014).

In another study on exposure to the impact of opinions on social networks, Rad et al., (2018) analyzed UGC linked to opinions about news. The authors explored how these opinions influence the product lifecycle and linked them to companies’ OM strategies. According to the results of this study, modeling and simulation can help to establish future mechanisms to improve both data-driven and operational strategies of a company.

Finally, Giannakis et al., (2020) used social networks to find out patterns for the development of new business products using consumer sentiment analysis. The authors highlighted the importance of using Big Data analytics (Akter et al., 2016) and artificial intelligence (AI) (Waller & Fawcett, 2013) to improve operation efficiency. A summary of previous studies briefly reviewed above is provided in Table 1.

Table 1 Previous studies

Full size table

As can be seen in Table 1, there have been diverse previous studies on social networks that sought to identify new forms and characteristics of OM. However, although the UGC has been used as a valid data source in these studies, no academic contribution to date has aimed to identify the characteristics of OM in relation to the main data-driven strategies used by companies. Therefore, by identifying the main characteristics of OM’s business models and linking them to data-centric strategies, this study covers a gap in the literature that has overlooked using a data mining technique to analyze the data from Twitter and theorizing the results for the OM industry.

3 Methodology

In recent years, data-mining techniques have come to be extensively used to extract insights and generate knowledge (Verbeke et al., 2012). Following this trend, the present study uses Twitter-based UGD to create knowledge on OM (Franco & Esteves, 2020). To validate this approach, we focus on the use of computer-aided text analysis tools under the CATA framework (Short et al., 2010). According to CATA, validation can be built using computer-aided text analysis tools applied to different industries; then the extracted insights can be used to define major theoretical constructs of interest.

Following Pollach (2012) and McKenny et al., (2018), and aiming to build theory from the data extracted using data-mining approaches, we used the following three methodologies: (1) sentiment analysis with TextBlob (Vijayarani & Janani, 2016), on which experiments with SPC, MB, LG and RFC (Hiremath & Patil, 2020) were validated; (2) LDA, which was used to separate the sample into topics of different emotional valence (Thorsrud, 2020); and (3) textual analysis approach (Krippendorff, 2018) will be applied. The use of this methodology applied to OM opens up a new line of research focused on data. The originality of our methodology will allow other researchers to apply it to identify insights and create knowledge in relation to different areas of OM and combine knowledge discovery, data mining and information sciences in the OM research field. In what follows, we provide further detail on these approaches.

3.1 Data sampling

In order to collect information related to OM and innovation in product development and business models, we collected the following two types of data: (1) tweets with the hashtags #OperationManagement and #Innovation; this was done to obtain information of innovations in OM that were subsequently linked to product development or innovation business models; and (2) tweets with the hashtags #OperationManagement and #Data; these data were collected to obtain insights in relation to product development and innovation in business models based on the use of data and its linkage to DDI (Bermingham and Smeaton, 2011; Kontopoulos et al., 2013; Kim et al., 2018).

Tweets were collected from Twitter API during 3 months, as the volume of UGC does not exceed the standard measure of 5.000 tweets per day (Mittal et al., 2021). The data collection process started on April 1, 2021, and ended on June 15, 2021. The total sample amounted to n = 43.427 tweets. After the sample filtering and debugging process, Python and Pandas libraries were used to exclude from the dataset the tweets that did not exceed 50 characters, had URLs and symbols, as well as duplicate tweets or RTs. After filtering, a total of n = 32.283 valid tweets were included in the final dataset.

3.2 Sentiment analysis with TextBlob

Sentiment analysis is a methodological approach that can classify a sample of 32.283 tweets into subsamples containing data expressing different sentiments. These data can be composed of pieces of text, documents, or other types of input (Xia et al., 2015). Sentiment analysis can be conducted using different programming languages (van Atteveldt et al., 2021).

Upon sentiment analysis, the data are divided into categories that express different emotions (Hussein, 2018). In the present study, we focused on the text; therefore, images, emoticons, videos, and other multimedia elements were discarded from the tweets. This was done because the methodological approaches were used to analyze text, rather than multimedia elements (Medhat et al., 2014; Kumar et al., 2015).

An effective tool that has been effectively used for the development of sentiment analysis in the past and that was used in the present study is TextBlob, which was built in NLTK and Patterns (Hardeniya et al., 2016). Of note, while sentiment analysis can identify positive, negative, or neutral sentiments, among its limitations and challenges when categorizing the analyzed content are connotations and irony, as algorithms can barely recognize and classify these characteristics of the language (Gao et al., 2019).

Accordingly, in order to increase the quality of our results, we used the Texblob algorithm taking into account the considerations articulated by Bermingham and Smeaton (2011). Following the indications proposed by Bermingham and Smeaton (2011), polarity was classified from − 1 to 1, while the subjectivity values ranged from 0 to 1. The algorithm was trained a total of 587 times using tweets that manually classified by researchers in the form of text inputs. This was followed by the classification of the tweets into those expressing positive, negative, and neutral sentiments. According to Cherif et al., (2016), the more trained a machine-learning algorithm is, the greater are its predictive capacity and accuracy.

To determine the number of times to train the algorithm, we followed the indications formulated in Li & Zhou (2007). Once the algorithm was trained, four validations that focused on experiments with different development technologies for sentiment analysis in TextBlob were performed. Specifically, the database was experimented and validated with the following calcification models: SVC, MNB, LR y RFC. The results were validated with the analysis of the indicators precision, recall, f1-score, and support, which were also considered in terms of macro average and weighted average.

3.3 Topic modeling

LDA is an algorithm that makes it possible to identify topics on a sample (Thorsrud, 2020). This algorithm is a probabilistic assumption that can be developed using Gibbs sampling in the Mac version. LDA enables identification of different topics in a sample based on the statistical understanding of the words’ location and repetition in a dataset. LDA classifies documents (Hagen, 2018) into topics according to their relevance. The data in this case can be blocks of text in the form of reviews, Tweets, social media posts, and so forth (Park & Oh, 2017).

Before applying sentiment analysis to the Twitter database, the topic modeling algorithm was applied to three different databases. In the first one, LDA was applied to obtain positive topics, and then the process was repeated with negative and neutral tweets. According to Chen (2017), the LDA focuses on understanding of frequency and positioning of the words within this database; the output of this analysis is the automatic grouping of words in the database. Upon topic identification, all topics in a database are labelled using 10–20 most frequent words in each specific topic. While LDA has been widely used to analyze different datasets, none of the previous studies has used this algorithm to explore data about OM and DDI.

3.4 Textual analysis

Textual analysis, which evolved from content analysis proposed by Burrows (2014), is based on the assumption that the weight and word opposition can determine the meaning of patterns not manually identified by users (Krippendorff, 2018). Textual analysis measures the relevance of keywords found in the text to understand what insights and patterns can be identified from the analyzed topics and their sentiments (Chan & Chong, 2017).

In textual analysis, an important value is the weighted percentage (WP), a variable that measures the weight of a keyword in relation to the total database. Furthermore, in content and textual analysis techniques based on NLP, additional techniques can also be used to statistically represent the results of the word that compose a database (Saura et al., 2021). One such techniques is the exploration based on n-grams, i.e. numerical factors that analyze the word prefixes to understand their meaning. Considering that the database is divided into sentiments, the study of n-grams can provide additional insights into the research results. Furthermore, as highlighted by Sidorov et al., (2014) the results can be linked to the values of mutual information, represented by keyness and p-value variables. This measure makes it possible understand the occurrence probability of an indicator, correlations among variables, and the weight in terms of the relevance of the analyzed indicators.

4 Results

4.1 TextBlob sentiment analysis results

In the present study, the following four experiments were designed to develop the sentiment analysis with TextBlob: SPC, MNB; LG and RFC. Of note, accuracy is a measure that indicates the success of the sentiment modeling. Indeed, accuracy has been widely used in previous studies that used machine-learning as a driver of the experiment (Bermingham and Smeaton, 2011). Here, the higher is the accuracy of the results, the higher is the confidence in the outcome of the research. In the present study, the highest accuracy was found in relation to a Linear SVC Sl. No. 7 (0.869218) and 8 (0.860905). As concerns random forest classifier, the highest accuracy was 0.555445. The accuracy of multinomial Naïve Bayes amounted to 0.737801, while that of logistic regression was 0.837057. Table 2 summarizes the results of the comparison of the results and the different experiments that were conducted.

Table 2 TextBlob analysis by experiment

Full size table

Furthermore, different scores of TextBlob were obtained for each of the used models. Table 3 summarizes brief scores in relation to the model used. The comparison between the results of the experiment when working with machine learning is standard in procedures working with sentiment analysis (Li & Liu, 2014). As shown in Table 3, the highest values to the set of accuracy in the results were those corresponding to linear SVC and logistic regression (0.869218 and 0.837057, respectively).

Table 3 Brief scores of TextBlob analysis

Full size table

Furthermore, Table 4 shows the classification report for each sentiment: positive, negative, and neutral. The table presents the values of accuracy, recall, f1-score and support obtained for each sentiment. As indicated by Li & Liu (2014), accuracy is a variable that represents the quality of a machine learning model to perform the requested tasks. Likewise, following Supriya et al. (2016), recall reflects the number of parameters that the machine learning model can identify in the database from the total number of inputs. Trofimovich (2016) also highlighted the importance of the f1-score indicator, which is typically used to combine precision and recall into a single value.

This is a practical approach that facilitates a comparison of two performance-centered metrics of accuracy and completeness between various solutions. Furthermore, the support metric shows the predictive ability of the model. As indicated by Bermingham and Smeaton (2011), the macro average measures the total average of the model based on the analyzed variables. Finally, weighted average also measures relativity in terms of weight.

Table 4 Classification report of machine-learning model results

Full size table

As can be seen in Table 4, the highest accuracy indicator obtained for recall was 0.80 for negative sentiments and 0.92 for neutral sentiments.

4.2 Topic-modeling results

Upon application of LDA, a total of 8 valid topics related to the development of new products and business models in OM with the use of DDI were identified in the dataset. As discussed previously, and following Hong and Davison (2010), the 20 most frequent words in each topic were used to create labels for each of the identified topics.

Then, the connections of specific words to sentiments and specific topics were measured. Of note, since LDA is an automated process, researchers have to adapt their criteria in an exploratory and manual way. Likewise, the insights were previously linked to the sentiment analysis process, since LDA was applied to the set of tweets divided according to the sentiments expressed in them. In this way, the topics were identified as positive, negative, or neutral.

Thus, of the total number of modeled topics, Table 5 presents the topics that are relevant to the research questions addressed in the present study. Furthermore, following Ambrosino et al., (2018), we evaluated the relevance of each topic using keyness values. Keyness (Gabrielatos and Marchi, 2011) is a metric that measures the relevance of the topics and is defined as the strength of the link between the topics linked to the log-likelihood score values (Rayson and Garside, 2000). Through the use of keyness (Gabrielatos, 2018), the log-likelihood of > 3.8 was statistically significant when p-value < 0.05.

Table 5 Topic modeling results

Full size table

4.3 Textual analysis results

As discussed previously, textual analysis is used to derive insights based on relevance, frequency, and position of words in a dataset. Textual analysis is the third step in the application of data-mining techniques to create knowledge and extract valid insights (Asarta & Méndez-Carbajo, 2020) to build theory and understand the object under study. In the present study, we analyzed the number of times that keywords were repeated in the data, as well as their WP, as which is an indicator of the relevance and weight of a keyword and similar ones grouped within the same node (grouping words) in the total database (Loughran & McDonald, 2016).

In structuring the results, we followed content analysis directions proposed by Ibrahim and Ahmad (2010) and the NLP conceptual framework (Bos and Markert (2005). Pandas GroupBy in Python was used (McKinney, 2012). Table 6 presents the main words identified as a result of textual analysis, similar words that compose the same node, total frequency of these words, and their WP.

Table 6 Grouped keywords by topic

Full size table

Finally, in order to understand the structure of the content, and following Sapkota et al., (2015), we used n-grams to obtain additional insights. This approach is common in linguistic studies. An n-gram model can predict the occurrence of a word with the analysis of n-1 previous words. Accordingly, a bigram model (n = 2) works predicts the occurrence of a word given only its previous word (as n – 1 = 1). Table 7 shows the unigrams and bigrams indicators. Of note, as indicated by Daneshvar & Inkpen (2018), the study of n-grams allows enables researchers to analyze strong and stable relationships between different terms. Table 7 presents the main topics identified in the methodological process along with their Rank (R), Frequency (F) of words on the left (FreqL) and on the right (FreqR). Finally, the main collocates are also presented.

Table 7 N-grams for the identified collocates

Full size table

5 Discussion

In recent years, DDI strategies have been extensively used in the OM sector. Accordingly, organization, structure, and design of DDI strategies have opened up new opportunities for business development and the approach to data-centric decision-making (Wang et al., 2018). However, as argued by Zhang et al., (2019), the influence of DDI on OM and short-term trends remains poorly understood. To bridge this gap in previous research, in the present study, we used data-mining techniques to analyze Twitter-based UGC to identify the main themes relevant for the new business models focused on OM.

Zhao et al., (2016), we identified a total of 8 topics related to OM. The most relevant topic was the positive topic “automation” (keyness = 733.01; p = 0.045). With the TA frequency of 7272 and WP of 15.12, this topic reflects the importance that process automation for the OM sector and for the improvement of the relationship with the suppliers or customers. In this respect, our results are consistent with those reported by Tsai et al., (2013).

Another positive topic “Data” (keyness = 451.73; p = 0.031), with the TA frequency of 4286 and WP of 13.38, highlights the importance of DDI in the OM sector in terms of actions and strategies that use data for the development and management processes. Similarly, Nwokeji et al. (2015) underscored the importance of data-centric approaches to improve decision-making in OM companies. Furthermore, there is also evidence that data collection and analysis may shape the future of the OM sector (Awan et al., 2021).

Next, the positive topic “forecasting” (keyness = 423.08; p = 0.029), with the TA frequency of 3710 and WP of 11.08, shows the relevance of prediction techniques and tools that can be based on artificial intelligence, as well as tactics such as machine learning or Big Data analytics. In line with our findings, Kumar et al., (2021) argued that forecasting and prediction techniques in the OM sector will help to anticipate the challenges and issues that may arise in the OM sector.

Another positive topic identified in our data analysis is “Mobile Accessibility” (keyness = 400.97; p = 0.027). With the TA frequency of 2890 and WP of 7.71, this topic highlights the mobility of the sector towards transformation to the mobile ecosystem. Recently, both the organization of tasks and the relationship and communication with customers and suppliers have moved to a mobile-friendly sphere. Likewise, Fragapane et al., (2020) showed that mobile accessibility in OM is essential to make decisions in short time that can determine the added value of one company over another (Yamin & Alharthi, 2020).

The positive topic “Employee experiences” (keyness = 361.12; p = 0.026), with the TA frequency of 2749 and a WP of 7.01, indicates that the opinions and experiences of the company’s employees should be prioritized. In line with this finding, Reid & Sanders (2019) argued on the need to promote internal communication with customers in OM so that to ensure a successful relationship between suppliers and services. In an automation and data-centric ecosystem, employees play a key role in company’s relationship with its stakeholders (Pagell & Gobeli, 2009.

In contrast to the positive topics discussed above, the negative topic “Intelligence security” (keyness = 347.84; p = 0.024), with the TA frequency of 2035 and WP of 5.19, highlights the challenges of the use of DDI and the collection, analysis and prediction of data in terms of security and privacy of processes in OM. Intelligence security reflects concerns about corporate data leaks, industry cyber-attacks, patents, and trade secret thefts, and so forth (see also Choi et al., 2020). In an increasingly automated and data-centric ecosystem, security and cybercriminal attacks are becoming more common. Therefore, companies should not only adapt their actions to sales forecasts to increase profitability, but also consider the issues that can negatively impact the automation processes, data collection, and data-centric decisions (Saura et al., 2021).

Furthermore, the neutral topic “Operational CRM” (keyness = 347.75; p = 0.024), with the TA frequency of 1987 and WP of 5.11, confirms the importance of appropriate management of data processing and online management tools. Operational CRM can organize and structure strategies and, when combined with tools such as analytical CRMs (Schniederjans et al., 2012), the use of artificial intelligence can be a key point in business development (Liu et al., 2018).

Finally, the neutral topic “Digital teams” keyness = 338.27; p = 0.021), with the TA frequency of 1980 and a WP of 4.98, reveals the importance of employees’ management and their availability to telework. Recently, COVID-19 pandemic has dramatically transformed a multitude of processes into the digital ecosystem. Therefore, according to Grover et al., (2020), the OM industry must focus on both adapting traditional processes to the digital ecosystem and training its employees to make better decisions and manage processes by teleworking using tools that work directly in the cloud.

Taken together, the results of the present study highlight that the process automation, data-centric strategies, and digitalization of traditional processes to the digital environment will shape the characteristics of the OM sector in terms of integration of DDI strategies (Metallo et al., 2021). Thus, in order to present these contributions applied to the OM industry, in Table 8, we formulate future research questions regarding the use of DDI strategies in OM in relation to the topics identified in this study.

Table 8 Future research questions in relation to DDI in OM research

Full size table

6 Conclusions

In the present study, we applied data-mining techniques (namely, sentiment analysis, LDA and textual analysis) to analyze Twitter-UGC to identify the main characteristics of the new business models focused on DDI in the OM sector. Based on the results, we identified 8 topics: 5 positive (Automation, Data, Forecasting, Mobile accessibility and Employee experiences), 1 negative (Intelligence Security), and 2 topics are neutral (Operational CRM, Digital teams). The results were discussed in terms of the main characteristics of the business models in the OM field that use DDI. We also formulated 26 research questions to be further investigated in the future.

Therefore, our first research question (RQ1: What are the characteristics of new business models in operation management focused on DDI strategies?) was answered with the insights of the positive and negative characteristics of the identified topics. In addition, in terms of the relevance of each topic, the automation processes in OM (as the new perspectives in the data-centric business models based on analysis, collection, prediction, and forecasting) reveal the trends in the OM sector when using DDI. Moreover, it is important to point out the influence mobile accessibility in the workplace, as suggested by the importance of both the working teams and employees’ experiences with the use, adoption, and enjoyment of data-centric tools. The new processes in OM focus on, among other functions, connectivity and the use of new technologies to drive risk reduction in production, result measurement, data collection, and demand prediction. The adoption of new strategies of data monitoring and analysis, as well as their optimization, are the main development paths for the new business models in OM.

Furthermore, our second research question (RQ2: What are the UGC’S main sentiments about the characteristics of new business models in operation management that use DDI?) was addressed by explaining and discussing the relevance of the identified. Of note, only one topic in our data– intelligence security—was negative. This suggests that cyber-attacks and risks of data leakage should be closely investigated in the future as one of the ways to protect industrial secrets, innovation in business models, and the prediction of future actions in OM. While DDI strategies in the OM sector provide new mechanisms to develop and promote business innovation, if these applications are not appropriately implemented, the risks linked to misuse of artificial intelligence or similar technologies could jeopardize business success in OM.

6.1 Theoretical implications

The present study is a pioneering attempt to apply data-mining analysis tools to analyze Twitter-based UGC in the OM domain. In further research, our results, presented as topics, could be analyzed as new points of knowledge, be tested with empirical methodologies that measure statical significance based on approaches focused on understanding both the adoption of DDI in OM, or be used as constructs or variables in statistical models. In addition, since the present study takes the UGC under CATA as a valid source of data to develop theory, our findings can be extended in future research to improve its accuracy and to identify additional insights in relation to DDI in OM.

Regarding the results, new theoretical investigations should focus on the use of automation actions in OM. Together with data-driven business models and prediction technologies, this focus may cause a massive change in industrial production and transformation policies in the next decade. In addition, the identification of the indicators for employees to be valued in OM must be properly classified and defined. In addition, digital teamwork has become a priority and should be more closely investigated in relation to the development of business models in OM.

Finally, our study can serve as an example for further studies aiming to explore the characteristics of business models from UGC on Twitter. Our results can be replicated using new methods applied to DDI in OM and in other industries.

6.2 Managerial implications

First of all, the results obtained can be used by OM companies as a practical guide to identify the characteristics of new DDI-centric business models. In this way, OM companies can better understand how automation, data, and the mobile ecosystem are driving innovation in OM. Furthermore, based on our findings, practitioners can better understand the relevance of each of the identified topics for the OM field. Furthermore, our results are informative for CEOs and executives in terms of exploring new trends and characteristics of the industry’s adaptation to a data-centric and digital era.

Accordingly, the identified indicators can serve as a guide to improve decision-making in OM when working with DDI. Practitioners should understand the characteristics of new business models in OM and combine practical knowledge with the adoption of technological development. It is equally important to ensure that communication with stakeholders is fluid and constant, since technological development and the adoption of DDI business models drive the development of interactive and connected models. The processes of adaptation to digital change in OM, as well as the use of new platforms for data collection and analysis or production prediction, among others, must attend to the principles, characteristics, and limits of the development capacity of new models.

6.3 Limitations and future research

The limitations of the present study are related to the number of tweets in the sample. Indeed, the more trained are the algorithms, the more reliable will be the results. Therefore, in order to achieve higher accuracy, it is necessary to continuously train the algorithms over time. In addition, despite the use of quantitative analysis approaches, the present study is exploratory, as it takes UGC as a sample. While UGC has been widely used as a valid source of data in many previous studies, the results of the present study should be compared with other data sources, such as opinion surveys or focus groups. Doing so would complement the present results and increase the quality of the identified indicators. In further research, it would be necessary to more closely focus on each of the topics identified in the present study and link them to OM both for the improvement of decision-making and the larger-scale adoption of new DDI-centric business models.

References

McKenny, A. F., Aguinis, H., Short, J. C., & Anglin, A. H. (2018). What doesn’t get measured does exist: Improving the accuracy of computer-aided text analysis. Journal of Management, 44(7), 2909–2933. https://doi.org/10.1177/0149206316657594
Article Google Scholar
Adamides, E., & Karacapilidis, N. (2020). Information technology for supporting the development and maintenance of open innovation capabilities. Journal of Innovation and Knowledge, 5(1), 29–38. https://doi.org/10.1016/j.jik.2018.07.001
Article Google Scholar
Akter, S., Bandara, R., Hani, U., Wamba, F., Foropon, S., & Papadopoulos, C., T (2019). Analytics-based decision making for service systems: A qualitative study and agenda for future research. International Journal of Information Management, 48, 85–95. https://doi.org/10.1016/j.ijinfomgt.2019.01.020
Article Google Scholar
Akter, S., & Wamba, S. F. (2016). Big data analytics in E-commerce: a systematic review and agenda for future research. Electronic Markets, 1–22. https://doi.org/10.1007/s12525-016-0219-0
Akter, S., Wamba, S. F., Gunasekaran, A., Dubey, R., & Childe, S. J. (2016). How to improve firm performance using big data analytics capability and businessstrategy alignment? International Journal of Production Economics, 182, 113–131. https://doi.org/10.1016/j.ijpe.2016.08.018
Article Google Scholar
Ambrosino, A., Cedrini, M., Davis, J. B., Fiori, S., Guerzoni, M., & Nuccio, M. (2018). What topic modeling could reveal about the evolution of economics. Journal of Economic Methodology, 25(4), 329–348. https://doi.org/10.1080/1350178X.2018.1529215
Article Google Scholar
Araz, O. M., Choi, T. M., Olson, D. L., & Salman, F. S. (2020). Role of analytics for operational risk management in the era of big data. Decision Sciences, 51(6), 1320–1346. https://doi.org/10.1111/deci.12451
Article Google Scholar
Asarta, C. J., & Méndez-Carbajo, D. (2020). Bringing down the walls in Business and Management Education: A textual analysis of economic education articles. Journal of Education for Business, 1–12. https://doi.org/10.1080/08832323.2020.1726268
Awan, U., Sroufe, R., & Shahbaz, M. (2021). Industry 4.0 and the circular economy: A literature review and recommendations for future research. Business Strategy and the Environment, 30(4), 2038–2060. https://doi.org/10.1002/bse.2731
Article Google Scholar
Barney, J. (1991). Firm resources and sustained competitive advantage. Journal of management, 17, 99–120. https://doi.org/10.1177/014920639101700108
Article Google Scholar
Bermingham, A., & Smeaton, A. (2011, November). On using Twitter to monitor political sentiment and predict election results. In Proceedings of the Workshop on Sentiment Analysis where AI meets Psychology (SAAIP 2011) (pp. 2–10)
Bos, J., & Markert, K. (2005, April). Combining shallow and deep NLP methods for recognizing textual entailment. In Proceedings of the First PASCAL Challenges Workshop on Recognising Textual Entailment, Southampton, UK (pp. 65–68)
Breidbach, C. F., & Maglio, P. (2020). Accountable algorithms? The ethical implications of data-driven business models. Journal of Service Management, 31(22), 163–185. https://doi.org/10.1108/JOSM-03-2019-0073
Article Google Scholar
Bruns, A., Weller, K., Zimmer, M., & Proferes, N. J. (2014). A topology of Twitter research: disciplines, methods, and ethics. Aslib Journal of Information Management, 66(3), 250–261. https://doi.org/10.1108/AJIM-09-2013-0083
Article Google Scholar
Chae, B. K. (2015). Insights from hashtag# supplychain and Twitter Analytics: Considering Twitter and Twitter data for supply chain practice and research. International Journal of Production Economics, 165, 247–259. https://doi.org/10.1016/j.ijpe.2014.12.037
Article Google Scholar
Chan, H. K., Lacka, E., Yee, R. W., & Lim, M. K. (2017). The role of social media data in operations and production management. International Journal of Production Research, 55(17), 5027–5036. https://doi.org/10.1080/00207543.2015.1053998
Article Google Scholar
Chan, S. W., & Chong, M. W. (2017). Sentiment analysis in financial texts. Decision Support Systems, 94, 53–64. https://doi.org/10.1016/j.dss.2016.10.006
Article Google Scholar
Chen, L. C. (2017). An effective LDA-based time topic model to improve blog search performance. Information Processing and Management, 53(6), 1299–1319. https://doi.org/10.1016/j.ipm.2017.08.001
Article MathSciNet Google Scholar
Cherif, W., Madani, A., & Kissi, M. (2016). A hybrid optimal weighting scheme and machine learning for rendering sentiments in tweets. International Journal of Intelligent Engineering Informatics, 4(3–4), 322–339. https://doi.org/10.1504/IJIEI.2016.080527
Article Google Scholar
Choi, T. M., Guo, S., & Luo, S. (2020). When blockchain meets social-media: Will the result benefit social media analytics for supply chain operations management? Transportation Research Part E: Logistics and Transportation Review, 135, 101860. https://doi.org/10.1016/j.tre.2020.101860
Article Google Scholar
Daneshvar, S., & Inkpen, D. (2018). Gender identification in twitter using n-grams and lsa. In Proceedings of the Ninth International Conference of the CLEF Association (CLEF 2018)
Davenport, T. H. (2013). Analytics 3.0. Harvard Business Review, 91, 64–72
Google Scholar
de Camargo Fiorini, P., Roman Pais Seles, B. M., Chiappetta Jabbour, C. J., Barberio Mariano, E., & de Sousa Jabbour, A. B. L. (2018). Management theory and big data literature: From a review to a research agenda. International Journal of Information Management, 43, 112–129. https://doi.org/10.1016/j.ijinfomgt.2018.07.005
Article Google Scholar
De Menezes, L. M., Wood, S., & Gelade, G. (2010). The integration of human resource and operation management practices and its link with performance: A longitudinal latent class study. Journal of operations Management, 28(6), 455–471. https://doi.org/10.1016/j.jom.2010.01.002
Article Google Scholar
Duan, Y., Cao, G., & Edwards, J. S. (2018). Understanding the impact of business analytics on innovation. European Journal of Operational Research. https://doi.org/10.1016/j.ejor.2018.06.021
Article Google Scholar
Farahani, R., & Rezapour, S. (2011). Logistics operations and management: concepts and models. Elsevier
Fragapane, G., Ivanov, D., Peron, M., Sgarbossa, F., & Strandhagen, J. O. (2020). Increasing flexibility and productivity in Industry 4.0 production networks with autonomous mobile robots and smart intralogistics. Annals of operations research, 1–19. https://doi.org/10.1007/s10479-020-03526-7
Franco, M., & Esteves, L. (2020). Inter-clustering as a network of knowledge and learning: Multiple case studies. Journal of Innovation and Knowledge, 5(1), 39–49. https://doi.org/10.1016/j.jik.2018.11.001
Article Google Scholar
Gabrielatos, C. (2018). Keyness analysis. Corpus approaches to discourse: A critical review, 225–258. In book: Taylor, C. and Marchi, A. (eds.) Corpus Approaches To Discourse: A critical review. Oxford: Routledge.Chapter: 11Publisher: Routledge
Gabrielatos, C., & Marchi, A. (2011, November). Keyness: Matching metrics to definitions. In Theoretical-methodological challenges in corpus approaches to discourse studies and some ways of addressing them
Gao, Z., Feng, A., Song, X., & Wu, X. (2019). Target-dependent sentiment classification with BERT. Ieee Access : Practical Innovations, Open Solutions, 7, 154290–154299. https://doi.org/10.1109/ACCESS.2019.2946594
Article Google Scholar
Giannakis, M., Dubey, R., Yan, S., Spanaki, K., & Papadopoulos, T. (2020). Social media and sensemaking patterns in new product development: demystifying the customer sentiment. Annals of Operations Research, 1–31. https://doi.org/10.1007/s10479-020-03775-6
Goel, P. S. (2022). From Technology to Management. Making of a Satellite Centre (pp. 79–95). Singapore: Springer. https://doi.org/10.1007/978-981-16-3480-2_9
Chapter Google Scholar
Grant, R. M. (1996). Prospering in dynamically-competitive environments: Organizational capability as knowledge integration. Organization science, 7, 375–387. https://doi.org/10.1287/orsc.7.4.375
Article Google Scholar
Grover, P., Kar, A. K., & Dwivedi, Y. K. (2020). Understanding artificial intelligence adoption in operations management: insights from the review of academic literature and social media discussions. Annals of Operations Research, 1–37. https://doi.org/10.1007/s10479-020-03683-9
Gruber, D. A., Smerek, R. E., Thomas-Hunt, M. C., & James, E. H. (2015). The real-time power of Twitter: Crisis management and leadership in an age of social media. Business Horizons, 58(2), 163–172. https://doi.org/10.1016/j.bushor.2014.10.006
Article Google Scholar
Hagen, L. (2018). Content analysis of e-petitions with topic modeling: How to train and evaluate LDA models? Information Processing and Management, 54(6), 1292–1307. https://doi.org/10.1016/j.ipm.2018.05.006
Article Google Scholar
Hardeniya, N., Perkins, J., Chopra, D., Joshi, N., & Mathur, I. (2016). Natural language processing: python and NLTK. Packt Publishing Ltd.
Hartmann, P. M., Zaki, M., Feldmann, N., & Neely, A. (2014). Big data for big business? A taxonomy of data-driven business models used by start-up firms.Cambridge Service Alliance,1–29
Helfat, C. E., & Peteraf, M. A. (2009). Understanding dynamic capabilities: progress along a developmental path. Strategic Organization, 7, 91–102. https://doi.org/10.1177/1476127008100133
Article Google Scholar
Hiremath, B. N., & Patil, M. M. (2020). Enhancing optimized personalized therapy in clinical decision support system using Natural Language Processing. Journal of King Saud University-Computer and Information Sciences. doi: https://doi.org/10.1016/j.jksuci.2020.03.006
Hong, L., & Davison, B. D. (2010, July). Empirical study of topic modeling in twitter. In Proceedings of the first workshop on social media analytics (pp. 80–88)
Hossain, M. A., Akter, S., & Yanamandram, V. (2020). Customer analytics capabilities in the big data spectrum: a systematic approach to achieve sustainable firm performance. In Technological Innovations for Sustainability and Business Growth (pp. 1–17). IGI Global. https://doi.org/10.4018/978-1-5225-9940-1.ch001
Hussein, D. M. E. D. M. (2018). A survey on sentiment analysis challenges. Journal of King Saud University-Engineering Sciences, 30(4), 330–338. https://doi.org/10.1016/j.jksues.2016.04.002
Article Google Scholar
Ibrahim, M., & Ahmad, R. (2010, May). Class diagram extraction from textual requirements using natural language processing (NLP) techniques. In 2010 Second International Conference on Computer Research and Development (pp. 200–204). IEEE
Jetzek, T., Avital, M., & Bjorn-Andersen, N. (2014). Data-driven innovation through open government data. Journal of theoretical and applied electronic commerce research, 9(2), 100–120
Article Google Scholar
Johnson, C. D., Bauer, B. C., & Niederman, F. (2021). The automation of management and business science. Academy of Management Perspectives, 35(2), 292–309. https://doi.org/10.5465/amp.2017.0159
Article Google Scholar
Kim, H., Jang, S. M., Kim, S. H., & Wan, A. (2018). Evaluating sampling methods for content analysis of Twitter data. Social Media + Society, 4(2), https://doi.org/10.1177/2056305118772836
Kontopoulos, E., Berberidis, C., Dergiades, T., & Bassiliades, N. (2013). Ontology-based sentiment analysis of twitter posts. Expert systems with applications, 40(10), 4065–4074. https://doi.org/10.1016/j.eswa.2013.01.001
Article Google Scholar
Krippendorff, K. (2018). Content analysis: An introduction to its methodology. Sage publications
Kumar, J., Singh, A. K., & Buyya, R. (2021). Self directed learning based workload forecasting model for cloud resource management. Information Sciences, 543, 345–366. https://doi.org/10.1016/j.ins.2020.07.012
Article Google Scholar
Kumar, K. A., Rajasimha, N., Reddy, M., Rajanarayana, A., & Nadgir, K. (2015). Analysis of users’ sentiments from kannada web documents. Procedia Computer Science, 54, 247–256. https://doi.org/10.1016/j.procs.2015.06.029
Article Google Scholar
Lakshmi, V., & Bahli, B. (2020). Understanding the robotization landscape transformation: A centering resonance analysis. Journal of Innovation and Knowledge, 5(1), 59–67. https://doi.org/10.1016/j.jik.2019.01.005
Article Google Scholar
Lee, K., & Carter, S. (2011). Global marketing management. Strategic Direction https://doi.org/10.1108/sd.2011.05627aae.001
Article Google Scholar
Li, G., & Liu, F. (2014). Sentiment analysis based on clustering: a framework in improving accuracy and recognizing neutral opinions. Applied intelligence, 40(3), 441–452. https://doi.org/10.1007/s10489-013-0463-3
Article MathSciNet Google Scholar
Li, M., & Zhou, Z. H. (2007). Improve computer-aided diagnosis with machine learning techniques using undiagnosed samples. IEEE Transactions on Systems Man and Cybernetics-Part A: Systems and Humans, 37(6), 1088–1098
Article CAS Google Scholar
Liao, X., Ye, G., Yu, J., et al. (2021). Identifying lead users in online user innovation communities based on supernetwork. Annals of Operation Research, 300, 515–543. https://doi.org/10.1007/s10479-021-03953-0
Article MathSciNet Google Scholar
Lins, M. G., Zotes, L. P., & Caiado, R. (2021). Critical factors for lean and innovation in services: from a systematic review to an empirical investigation. Total Quality Management and Business Excellence, 32(5–6), 606–631. https://doi.org/10.1080/14783363.2019.1624518
Article Google Scholar
Liu, N., Chow, P. S., & Zhao, H. (2020). Challenges and critical successful factors for apparel mass customization operations: recent development and case study. Annals of Operation Research, 291, 531–563. https://doi.org/10.1007/s10479-019-03149-7
Article Google Scholar
Liu, P., & Yi, S. (2018). Investment decision-making and coordination of a three-stage supply chain considering Data Company in the Big Data era.Annals of Operations Research, 270,255–27. https://doi.org/10.1007/s10479-018-2783-5
Liu, Y., Wang, W., Sun, M., Ma, B., Pang, L., Du, Y., … and … Ni, J. (2019). Polygonum multiflorum-induced liver injury: Clinical characteristics, risk factors, material basis, action mechanism and current challenges. Frontiers in pharmacology, 10, 1467
Article CAS PubMed PubMed Central Google Scholar
Loughran, T., & McDonald, B. (2016). Textual analysis in accounting and finance: A survey. Journal of Accounting Research, 54(4), 1187–1230. https://doi.org/10.1111/1475-679X.12123
Article Google Scholar
McKinney, W. (2012). Python for data analysis: Data wrangling with Pandas, NumPy, and IPython. " O’Reilly Media, Inc.“
Medhat, W., Hassan, A., & Korashy, H. (2014). Sentiment analysis algorithms and applications: A survey. Ain Shams engineering journal, 5(4), 1093–1113. https://doi.org/10.1016/j.asej.2014.04.011
Article Google Scholar
Metallo, C., Agrifoglio, R., Briganti, P., Mercurio, L., & Ferrara, M. (2021). Entrepreneurial Behaviour and New Venture Creation: the Psychoanalytic Perspective. Journal of Innovation and Knowledge, 6(1), 35–42. https://doi.org/10.1016/j.jik.2020.02.001
Article Google Scholar
Mittal, R., Ahmed, W., Mittal, A., & Aggarwal, I. (2021). Twitter users exhibited coping behaviours during the COVID-19 lockdown: an analysis of tweets using mixed methods. Information Discovery and Delivery. https://doi.org/10.1108/IDD-08-2020-0102
Article Google Scholar
Nwokeji, J. C., Clark, T., Barn, B., Kulkarni, V., & Anum, S. O. (2015, September). A data-centric approach to change management. In 2015 IEEE 19th International Enterprise Distributed Object Computing Conference (pp. 185–190). IEEE
Pagell, & Gobeli, D. (2009). How plant managers’ experiences and attitudes toward sustainability relate to operational performance. Production and Operations Management, 18(3), 278–299. https://doi.org/10.1111/j.1937-5956.2009.01050.x
Article Google Scholar
Park, J., & Oh, H. J. (2017). Comparison of topic modeling methods for analyzing research trends of archives management in korea: Focused on lda and hdp. Journal of Korean Library and Information Science Society, 48(4), 235–258
Article Google Scholar
Pollach, I. (2012). Taming textual data: The contribution of corpus linguistics to computer-aided text analysis. Organizational Research Methods, 15(2), 263–287. https://doi.org/10.1177/1094428111417451
Article Google Scholar
Pryke, S. (Ed.). (2009). Construction supply chain management: concepts and case studies (3 vol.). John Wiley and Sons
Qiu, L., Rui, H., & Whinston, A. (2013). Social network-embedded prediction markets: The effects of information acquisition and communication on predictions. Decision Support Systems, 55(4), 978–987. https://doi.org/10.1016/j.dss.2013.01.007
Article Google Scholar
Rad, A. A., Jalali, M. S., & Rahmandad, H. (2018). How exposure to different opinions impacts the life cycle of social media. Annals Of Operations Research, 268, 63–91. https://doi.org/10.1007/s10479-017-2554-8
Article MathSciNet Google Scholar
Raisch, S., & Krakowski, S. (2021). Artificial intelligence and management: The automation–augmentation paradox. Academy of Management Review, 46(1), 192–210. https://doi.org/10.5465/amr.2018.0072
Article Google Scholar
Ransbotham, S., & Kiron, D. (2017). Analytics as a Source of Business Innovation. MIT Sloan Management Review, 58, n/a-0
Rayson, P., & Garside, R. (2000, October). Comparing corpora using frequency profiling. In The workshop on comparing corpora (pp. 1–6). https://doi.org/10.3115/1117729.1117730
Reid, R. D., & Sanders, N. R. (2019). Operations management: an integrated approach. John Wiley and Sons
Sapkota, U., Bethard, S., Montes, M., & Solorio, T. (2015). Not all character n-grams are created equal: A study in authorship attribution. In Proceedings of the 2015 conference of the North American chapter of the association for computational linguistics: Human language technologies (pp. 93–102)
Saura, J. R. (2021). Using Data Sciences in Digital Marketing: Framework, methods, and performance metrics. Journal of Innovation and Knowledge. https://doi.org/10.1016/j.jik.2020.08.001
Article Google Scholar
Saura, J. R., Ribeiro-Soriano, D., & Palacios-Marqués, D. (2021). From user-generated data to data-driven innovation: A research agenda to understand user privacy in digital markets. International Journal of Information Management, Volume 60, October 2021, 102331. doi: https://doi.org/10.1016/j.ijinfomgt.2021.102331
Saura, J. R., Palacios-Marqués, D., & Iturricha-Fernández, A. (2021a). Ethical Design in Social Media: Assessing the main performance measurements of user online behavior modification. Journal of Business Research, 129, May 2021, 271–281. https://doi.org/10.1016/j.jbusres.2021.03.001
Schniederjans, M. J., Cao, Q., & Gu, C., V (2012). An operations management perspective on adopting customer-relations management (CRM) software. International Journal of Production Research, 50(14), 3974–3987. https://doi.org/10.1080/00207543.2011.613865
Article Google Scholar
Short, J. C., Broberg, J. C., Cogliser, C. C., & Brigham, K. H. (2010). Construct validation using computer-aided text analysis (CATA) an illustration using entrepreneurial orientation. Organizational Research Methods, 13(2), 320–347. https://doi.org/10.1177/1094428109335949
Article Google Scholar
Sidorov, G., Velasquez, F., Stamatatos, E., Gelbukh, A., & Chanona-Hernández, L. (2014). Syntactic n-grams as machine learning features for natural language processing. Expert Systems with Applications, 41(3), 853–860. https://doi.org/10.1016/j.eswa.2013.08.015
Article Google Scholar
Sorescu, A. (2017). Data-driven business model innovation. Journal of Product Innovation Management, 34(5), 691–696. https://doi.org/10.1111/jpim.12398
Article Google Scholar
Supriya, B. N., Kallimani, V., Prakash, S., & Akki, C. B. (2016, March). Twitter sentiment analysis using binary classification technique. In International Conference on Nature of Computation and Communication (pp. 391–396). Springer, Cham
Thomé, A. M. T., Scavarda, L. F., & Scavarda, A. J. (2016). Conducting systematic literature review in operations management. Production Planning and Control, 27(5), 408–420. https://doi.org/10.1080/09537287.2015.1129464
Article Google Scholar
Thorsrud, L. A. (2020). Words are the new numbers: A newsy coincident index of the business cycle. Journal of Business and Economic Statistics, 38(2), 393–409. https://doi.org/10.1080/07350015.2018.1506344
Article MathSciNet Google Scholar
Tiberius, V., Schwarzer, H., & Roig-Dobón, S. (2021). Radical innovations: Between established knowledge and future research opportunities. Journal of Innovation and Knowledge, 6(3), 145–153. https://doi.org/10.1016/j.jik.2020.09.001
Article Google Scholar
Toktay, L. B., Wein, L. M., & Zenios, S. A. (2000). Inventory management of remanufacturable products. Management science, 46(11), 1412–1426
Article Google Scholar
Trabucchi, D., & Buganza, T. (2019). Data-driven innovation: switching the perspective on Big Data”. European Journal of Innovation Management, 22(1), 23–40. https://doi.org/10.1108/EJIM-01-2018-0017
Article Google Scholar
Trofimovich, J. (2016). Comparison of neural network architectures for sentiment analysis of russian tweets. In Computational Linguistics and Intellectual Technologies: Proceedings of the International Conference Dialogue (pp. 50–59)
Tsai, M. C., Lai, K. H., & Hsu, W. C. (2013). A study of the institutional forces influencing the adoption intention of RFID by suppliers. Information and Management, 50(1), 59–65. https://doi.org/10.1016/j.im.2012.05.006
Article Google Scholar
van Atteveldt, W., van der Velden, M. A., & Boukes, M. (2021). The Validity of Sentiment Analysis: Comparing Manual Annotation, Crowd-Coding, Dictionary Approaches, and Machine Learning Algorithms. Communication Methods and Measures, 15(2), 121–140. https://doi.org/10.1080/19312458.2020.1869198
Article Google Scholar
Verbeke, W., Dejaeger, K., Martens, D., Hur, J., & Baesens, B. (2012). New insights into churn prediction in the telecommunication sector: A profit driven data mining approach. European journal of operational research, 218(1), 211–229. https://doi.org/10.1016/j.ejor.2011.09.031
Article Google Scholar
Vijayarani, S., & Janani, R. (2016). Text mining: open source tokenization tools-an analysis. Advanced Computational Intelligence: An International Journal (ACII), 3(1), 37–47. https://doi.org/10.5121/acii.2016.3104
Article Google Scholar
Waller, M. A., & Fawcett, S. E. (2013). Data science, predictive analytics, and big data: a revolution that will transform supply chain design and management. Journal of Business Logistics, 34, 77–84. https://doi.org/10.1111/jbl.12010
Article Google Scholar
Wamba, S. F., Gunasekaran, A., Akter, S., Ren, S. J., Dubey, R., & Childe, S. J. (2017). Big data analytics and firm performance: Effects of dynamic capabilities. Journal of Business Research, 70, 356–365. https://doi.org/10.1016/j.jbusres.2016.08.009
Article Google Scholar
Wang, Y., Kung, L., & Byrd, T. A. (2018). Big data analytics: Understanding its capabilities and potential benefits for healthcare organizations. Technological Forecasting and Social Change, 126, 3–13. https://doi.org/10.1016/j.techfore.2015.12.019
Article Google Scholar
Xia, R., Xu, F., Zong, C., Li, Q., Qi, Y., & Li, T. (2015). Dual sentiment analysis: Considering two sides of one review. IEEE transactions on knowledge and data engineering, 27(8), 2120–2133. https://doi.org/10.1109/TKDE.2015.2407371
Article Google Scholar
Yamin, & Alharthi, S. (2020). Measuring impact of healthcare information systems in administration and operational management. International Journal of Information Technology, 12(3), 767–774. https://doi.org/10.1007/s41870-019-00329-3
Article Google Scholar
Yang, H. M., Choi, B. S., Park, H. J., Suh, M. S., & Chae, B. K. (2007). Supply chain management six sigma: a management innovation methodology at the Samsung Group. Supply Chain Management: An International Journal. https://doi.org/10.1108/13598540710737271
Article Google Scholar
Yuan, H., Xu, W., Li, Q., et al. (2018). Topic sentiment mining for sales performance prediction in e-commerce. Annals Of Operations Research, 270, 553–576. https://doi.org/10.1007/s10479-017-2421-7
Article Google Scholar
Yue, H., Guo, L., Li, R., Asaeda, H., & Fang, Y. (2014). DataClouds: Enabling community-based data-centric services over the Internet of Things. IEEE Internet of Things Journal, 1(5), 472–482. https://doi.org/10.1109/JIOT.2014.2353629
Article Google Scholar
Zhang, W. J., Wang, J. W., & Lin, Y. (2019). Integrated design and operation management for enterprise systems. 13, 2019,424–429. https://doi.org/10.1080/17517575.2019.1597169
Zhao, F., Zhu, Y., Jin, H., & Yang, L. T. (2016). A personalized hashtag recommendation approach using LDA-based topic model in microblog environment. Future Generation Computer Systems, 65, 196–206. https://doi.org/10.1016/j.future.2015.10.012
Article Google Scholar

Download references

Funding

Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature.

Author information

Authors and Affiliations

Rey Juan Carlos University, Madrid, Spain
Jose Ramon Saura
Universitat de Valencia, Valencia, Spain
Domingo Ribeiro-Soriano
Universitat Politècnica de València, Valencia, Spain
Daniel Palacios-Marqués

Authors

Jose Ramon Saura
View author publications
You can also search for this author in PubMed Google Scholar
Domingo Ribeiro-Soriano
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Palacios-Marqués
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jose Ramon Saura.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Saura, J.R., Ribeiro-Soriano, D. & Palacios-Marqués, D. Data-driven strategies in operation management: mining user-generated content in Twitter. Ann Oper Res 333, 849–869 (2024). https://doi.org/10.1007/s10479-022-04776-3

Download citation

Received: 23 July 2021
Revised: 12 January 2022
Accepted: 11 May 2022
Published: 10 June 2022
Issue Date: February 2024
DOI: https://doi.org/10.1007/s10479-022-04776-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Data-driven strategies in operation management: mining user-generated content in Twitter

Abstract

Similar content being viewed by others

A survey on sentiment analysis methods, applications, and challenges

Artificial intelligence in E-Commerce: a bibliometric study and literature review

Analyzing social media data: A mixed-methods framework combining computational and qualitative text analysis

1 Introduction

2 Literature review

3 Methodology

3.1 Data sampling

3.2 Sentiment analysis with TextBlob

3.3 Topic modeling

3.4 Textual analysis

4 Results

4.1 TextBlob sentiment analysis results

4.2 Topic-modeling results

4.3 Textual analysis results

5 Discussion

6 Conclusions

6.1 Theoretical implications

6.2 Managerial implications

6.3 Limitations and future research

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Data-driven strategies in operation management: mining user-generated content in Twitter

Abstract

Similar content being viewed by others

A survey on sentiment analysis methods, applications, and challenges

Artificial intelligence in E-Commerce: a bibliometric study and literature review

Analyzing social media data: A mixed-methods framework combining computational and qualitative text analysis

1 Introduction

2 Literature review

3 Methodology

3.1 Data sampling

3.2 Sentiment analysis with TextBlob

3.3 Topic modeling

3.4 Textual analysis

4 Results

4.1 TextBlob sentiment analysis results

4.2 Topic-modeling results

4.3 Textual analysis results

5 Discussion

6 Conclusions

6.1 Theoretical implications

6.2 Managerial implications

6.3 Limitations and future research

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation