1 Introduction and motivation

Traditionally, global corporations tend to maintain a low profile on contentious issues like immigration or racism (Smith and Korschun 2018). However, in recent times, companies seem increasingly willing to adopt a more assertive stance, actively participating in social and political discourse. This shift is notably facilitated by the widespread influence of social media platforms, which amplifies the reach and impact of corporate positions.

The media can influence public behavior through various mechanisms, including the salience effect (or exposure effect), where extensive discussions make a topic highly noticeable (McCombs and Shaw 1972; Erbring et al. 1980). Additionally, the frame effect, which concerns the tone of a discussion, plays a role in shaping perceptions (Scheufele 1999). While the impact of traditional media (TV and newspapers) on individual opinions and consumption behavior is well-documented in political science and communication literature (McCombs and Shaw 1972; Erbring et al. 1980; Entman 1993), the influence of social media on consumption has received less attention.Footnote 1 This study aims to contribute to this understanding, focusing on the 2020 engagement of the National Basketball Association (NBA) with the Black Lives Matter (BLM) movement. Specifically, we investigate the extent to which public opinion, represented by Twitter social media data, influenced the choices of sports consumers, measured through TV ratings and viewership numbers.

A considerable number of consumers now rely on social media platforms such as Facebook, Instagram, and Twitter for news and information, gradually replacing or supplementing more traditional media outlets like TV and newspapers. Among these platforms, Twitter stands out with over 330 million active monthly users, making it one of the most influential channels and a primary source for real-time information.Footnote 2 Twitter allows users to share concise and frequent messages (“tweets”), limited to 280 characters. These messages are easily searchable and can be quickly shared (retweeted) with one’s followers. As highlighted by Austmann and Vigne (2021), discussions on Twitter have the potential to reach a broader audience beyond its user base. Twitter data has been extensively utilized in various research domains (Soroka et al. 2018; Shen et al. 2019; Zhuravskaya et al. 2020). Importantly, it is the most commonly used social platform by journalists and professional players in major American sports leagues.

The Black Lives Matter (BLM) movement emerged in 2013 as a social movement protesting police brutality against black individuals in the USA. In June 2020, widespread protests erupted across the country following the death of George Floyd, catalyzing one of the most significant civil rights movements in recent American history. The National Basketball Association (NBA) has been notably vocal and supportive of the BLM cause. Upon the resumption of play in late July 2020, after a COVID-19-induced suspension, the NBA adopted a firm but contentious stance, allowing athletes to kneel during the national anthem, wear social justice messages on their jerseys instead of surnames, and display the BLM slogan on basketball courts.Footnote 3 Many prominent players and NBA Commissioner Adam Silver endorsed the Black Lives Matter protests, with Silver stating on June 1, 2020: “Together with our teams and players, we will continue our efforts to promote inclusion and bridge divides.”

While the NBA’s engagement with the BLM movement aimed to raise awareness of social issues, it faced criticism from some politicians. In September 2020, President Donald Trump tweeted: “People are tired of watching the highly political NBA. Basketball ratings are way down, and they won’t be coming back.” President Trump’s statement implied that consumers may penalize companies for political involvement. NBA Commissioner Adam Silver responded, stating: “No data BLM on-court hurts NBA ratings. There is no doubt there are some people who have become further engaged with the league. They respect their right to speak out on issues that are important to them.” This polarization was amplified by social media users. Motivated by these dynamics, our study aims to answer two key questions: To what extent did individuals alter their NBA TV consumption due to the interest generated by the league’s involvement (intensity) in the BLM movement, and was this change driven by the tone (sentiment) of messages posted on social media?

The empirical nature of our research questions means that we do not hold any preconceived expectations regarding the impact on consumers, if such an impact exists. Individuals might have chosen to: reduce their TV consumption due to disapproval of the NBA’s involvement in non-sport-related issues; increase their time spent watching NBA matches, indicating support for the league’s social stance; or remain unaffected, indicating a low elasticity of NBA TV demand to social media commentary.

To disentangle the role of social media from other concurrent factors, we initially conducted an event study, utilizing high-frequency data and incorporating a comprehensive set of control variables. Our measure of the TV audience included data on viewers and ratings for all NBA matches broadcast on national TV networks during the 2018–2019 and 2019–2020 seasons. While TV audience data is not a traditional measure of consumption, it is closely linked to revenues. Kanazawa and Funk (2001), for instance, found that higher ratings for locally televised NBA basketball games enabled teams to generate greater advertising revenues.

To gauge social media sentiment toward the NBA’s involvement in the BLM movement, we utilized XLM-RoBERTa-base (referred to as XLM-RoBERTa). This is a pre-trained natural language processing technique (Conneau et al. 2019) trained on a large corpus of English data in a self-supervised manner. A key feature of XLM-RoBERTa is its ability to comprehend the meaning of nuanced languages, including the interpretation of emoticons. Using XLM-RoBERTa, we predicted the likelihood of positive, neutral, or negative sentiments in tweets.

We constructed both an intensity (exposure) index, representing the total number of tweets on the NBA’s involvement with the BLM movement, and a sentiment (tone) index, categorizing Twitter data into positive and negative sentiment clusters. In our baseline empirical analysis, we conducted regressions using TV audience data for NBA matches as the dependent variable and included the social media intensity and sentiment indexes as explanatory variables. Our results highlight that only positively toned tweets have a significant impact on sports TV consumption. Notably, our findings suggest an average viewership increase of 6.6% during the period following the resumption of play after the COVID-related hiatus. These results remain robust when using different model functional forms and tweet sentiment classification metrics. Additionally, we exploited the granularity of Twitter data by exploring the popularity of messages and authors.

An interpretation of the asymmetric effects suggests that NBA core viewers, characterized by an inelastic demand, continue to enjoy watching basketball regardless of public opinions. Conversely, the number of marginal (non-habitual) viewers appears to increase with positive sentiment toward the NBA’s involvement, indicating an elastic demand. This group is likely to tune in due to their alignment with the social campaigns presented by the NBA, thus supporting the economic significance of our findings.

To account for potentially omitted variables, we supplemented our analysis with a difference-in-differences (DiD) approach, using the National Hockey League (NHL) as a control. The NHL serves as an ideal comparison to the NBA, sharing the following features: both leagues have the same number (82) of regular-season matches; the regular season for both leagues was interrupted and resumed at the same time; and, finally, both leagues exhibited a similar (negative) trend in TV viewers during the period under consideration (2018–2020). In line with our primary findings, there is limited evidence suggesting that the intensity of NBA involvement in the BLM movement contributed to the decline in the number of viewers and ratings. While the nature of our data does not allow inferences about changes in behavior at the individual level, the results confirm that aggregate TV sports demand is inelastic with respect to the total number of tweets referring to the NBA’s involvement in the BLM movement.

We conclude our empirical analysis by examining a proxy for traditional media intensity, measured by the number of articles appearing in newspapers, as well as TV and radio transcripts. While the Twitter-based indexes gauge the active involvement of social media users on a certain topic, motivating the focus of this paper, consumers of traditional media are passively exposed to the authors’ views. Additionally, we considered the volume of Google searches on the NBA-BLM involvement using daily Google Trends data. Consistent with the previous results, we do not find any statistically significant effect for any of the non-social-media-intensity indexes.

Our work draws inspiration from various strands of the economics, politics, and marketing literature. Firstly, we examine whether media coverage and its tone affect consumer behavior (Carroll 2003; Lerner et al. 2007; Lamla and Lein 2014; Biolsi and Lebedinsky 2021; Abdollahi 2023).Footnote 4 Secondly, we explore beliefs and consumption patterns (Angeletos and La’O 2013; Gillitzer and Prasad 2018; Benhabib and Spiegel 2019). These studies acknowledge the influence of non-fundamental factors, such as beliefs and opinions, on consumers’ spending decisions. Within this context, recent literature has found that social media is increasingly influencing beliefs and fostering activism among consumers (Bovitz et al. 2002; Hendel et al. 2017; Enikolopov et al. 2020; Zhuravskaya et al. 2020; Gorodnichenko et al. 2021). Thirdly, we explore the demand for sports, especially for TV broadcasting (Hausman and Leonard 1997; Forrest et al. 2005; Buraimo et al. 2022; Caselli et al. 2024). This literature focuses on aspects such as outcome uncertainty, team identity, and the quality of players. Very few studies have analyzed the role of other determinants.Footnote 5 Finally, this study is also linked to recent marketing literature that examines the influence of social and political activism on a firm’s performance (Scherer et al. 2014; Smith and Korschun 2018). We contribute to this by disentangling the involvement of firms from that of other confounding factors.

The paper’s structure is organized as follows. Section 2 introduces and examines the data. Section 3 outlines the results of the baseline model. Section 4 explores a series of robustness checks. Section 5 reports the findings using difference-in-differences methodology. Section 6 presents additional results using traditional media outlets. Finally, Section 7 provides concluding remarks.

2 Data description

In this section, we describe the variables employed in the empirical analysis.

2.1 TV audience

Major professional sports leagues in the USA operate as private entities and are not publicly traded on the stock market. Consequently, using share value as an indicator of a firm’s success is not applicable in this context. Instead, our approach involves gauging the NBA’s performance through two alternative indicators: TV viewership and ratings. While these measures are distinct, they are interconnected and provide insights into the television audience (Nielsen 2021). “Ratings” denote the percentage of U.S. TV households tuned in to a specific program, while “TV share” represents a percentage based on the households engaged in television viewing. On the other hand, “viewers” and “viewing figures” encompass the total number of individuals watching a program. It is important to note that TV ratings and viewing figures may not comprehensively capture all aspects of media consumption, given the evolving landscape of how people consume TV, particularly with the increasing prevalence of programs on internet-related platforms. Unfortunately, data pertaining to this aspect is not available for our analysis.Footnote 6

The TV scheduling of NBA games in the USA aligns with other major professional sports leagues, such as the National Football League and the National Hockey League. A limited number of games are nationally broadcasted, while the remainder are accessible through regional networks. Our analysis centers on the nationally broadcasted games, as detailed data on regional network broadcasting is unavailable. These nationally aired games are typically featured on cable TV channels like ABC and TNT. Consumers typically subscribe to a package, involving payment, that encompasses a bundle of TV channels providing various content such as news and movies. Once subscribed, users enjoy unlimited access to all programs broadcast by those networks. This subscription fee, from an economic standpoint, constitutes a sunk cost with an almost negligible marginal cost of consumption. This setup is advantageous for our empirical analysis because consumers can promptly adjust their viewership behavior without incurring additional costs. The decision to subscribe can be viewed as the extensive margin, while the actual consumption reflects the intensive margin. Our data and empirical setting are better suited for investigating the latter.

We collected data on all 506 NBA matches broadcast by national networks during the 2018–2019 and 2019–2020 seasons. The dataset for ratings includes information on 490 matches, encompassing details such as date and time, broadcasting network, and the game type (regular season or playoff). It is important to note that certain days featured more than one televised match. Table 1 reveals an average viewership of 2.35 million for these matches, contrasting with an average rating of 1.47. Approximately 68% of the matches were aired during prime time (after 8 pm ET), with 33% occurring in the playoffs and 20% on weekends. ESPN and TNT were the primary broadcasters for the majority of the games.Footnote 7

Table 1 Summary statistics

2.2 Social media intensity and sentiment

To measure the intensity (or exposure) and public sentiment (or tone) regarding the NBA’s involvement with BLM, we utilized data from the social media platform Twitter. Our approach involved searching for all original Twitter messages containing both the terms “NBA” and “BLM,” encompassing variations such as abbreviations or fully spelled-out forms. Specifically, our search criteria included terms like “BLM,” “\(\#\)BLM,” “Black Lives Matter,” “Black Live Matters,” and so forth.Footnote 8 Our focus was solely on original tweets, aligning with the methodology employed in the existing literature (Hatte et al. 2021). All tweet-related information was obtained from the Twitter Research Access API, and we specifically considered messages written in English or those with an undefined language, provided that hashtags were included.Footnote 9 Consequently, we created the variable Tweets NBA\(\cap \)BLM to represent the total daily count of original tweets, serving as our indicator for intensity or exposure. Illustrated in the upper panel of Fig. 1, the tweet count remained consistently minimal until the end of May 2020, after which it experienced a sharp and notable surge. Two distinct peaks were discernible: the first occurred following the resumption of NBA matches at the end of July 2020, while the second was associated with the boycott following the police shooting of a black man in Wisconsin in August of the same year.

Fig. 1
figure 1

Trends in tweets. Notes: The top panel reports the number of Tweets NBA \(\cap \)BLM, whereas the bottom panel reports the number of Tweets NBA and Tweets BLM

The variable Tweets NBA \(\cap \)BLM measures the intensity of public interest regarding the NBA’s engagement with BLM. This intensity could influence TV viewership through two contrasting channels. On one hand, some viewers might opt not to watch matches due to disagreement with the NBA’s involvement. On the other hand, viewers might increase their viewership to express support. In essence, a potential explanatory factor for sports consumption behavior lies not solely in the quantity of tweets but rather in their tone. This is particularly relevant in the current social media era where polarization on sensitive issues has become commonplace (Dawson et al. 2014; Lamla and Lein 2014). Consequently, we conducted a sentiment analysis on each message within the Tweets NBA \(\cap \)BLM variable.

A conventional objective in sentiment analysis involves determining whether the tone conveyed in a given text is positive, negative, or neutral. The advent of advanced language models like BERT and RoBERTa enables the exploration of more intricate data domains, such as texts where authors tend to express their opinions or sentiments less explicitly, or even rely on the use of emoticons (Ho et al. 2020; Hamborg et al 2021). In this study, we employ the XLM-RoBERTa language model, developed by Facebook AI (Conneau et al. 2019). This model integrates cross-lingual language models (XLMs) with RoBERTa, where RoBERTa stands for Robustly Optimized BERT Approach (Liu et al. 2019).Footnote 10 Much like BERT, RoBERTa employs a transformer-based architecture and undergoes pre-training on extensive textual data, enabling it to acquire contextual representations of words and phrases. However, in contrast to BERT, RoBERTa is trained on a more extensive data-set and follows a more efficient training procedure.Footnote 11 The NLP model utilized in this study, XLM-RoBERTa, undergoes pre-training on 2.5TB of filtered CommonCrawl data encompassing 100 languages. For tweet classification, we employed the general-purpose Python library TweetNLP (Camacho-Collados et al. 2022). TweetNLP relies on the transformer-based language models RoBERTa and XLM-R as its backbone, further pre-trained on Twitter-specific corpora. In addition to sentiment analysis, TweetNLP supports various tasks, including emotion recognition, irony detection, etc.Footnote 12 The model employed in TweetNLP assigns a positive, negative, or neutral probability to each tweet, with these probabilities summing up to one.

We categorize a tweet as positive, negative, or neutral based on the sentiment with the highest probability. Consequently, we construct the variables Tweets Positive, Tweets Negative, and Tweets Neutral by summing the daily tweets belonging to each sentiment group.Footnote 13 The top panel of Fig. 2 presents the histogram of Tweets NBA \(\cap \)BLM from June 1, 2020, categorized by sentiment type. In the bottom panel, we illustrate the average sentiment probability over time. Throughout the entire period, neutral sentiment maintains a slightly higher probability. Negative and positive sentiments exhibit comparable probabilities, indicating a high level of polarization with neither sentiment dominating the other.

Fig. 2
figure 2

Tweets and average sentiment probability. Notes: This figure illustrates the number of tweets posted from June 1, 2020, onwards. Top panel: histogram of the total number of Tweets NBA \(\cap \)BLM divided by the type of sentiment. Bottom panel: average daily probability associated with each sentiment - positive, negative, and neutral

We also collected tweets containing the word NBA but not BLM, and those containing BLM but not NBA. Two variables, labeled Tweets NBA and Tweets BLM, were constructed to represent the daily total number of tweets. These measures capture public interest in the NBA and BLM separately. In the bottom panel of Fig. 1, we depict the trends of Tweets NBA and Tweets BLM. The former follows a cyclical path, with more tweets posted at the start of the regular season and during the playoffs. In contrast, Tweets BLM peaked immediately after the death of George Floyd. A comparison between the bottom and top panels reveals that the NBA’s involvement with BLM did not perfectly align with the trends of Tweets NBA and Tweets BLM. This observation is reassuring, suggesting that NBA/BLM-related tweets follow a distinct trend.

3 Baseline model results

Our baseline model has the following form:

$$\begin{aligned} {TV\, Consumption}_{i,t} = \alpha + \beta {Tweets NBA} \cap {BLM}_{i,t-1} + \gamma X_{i,t} + \varepsilon _{i,t}, \end{aligned}$$
(1)

where i represents the match number, and t captures the daily time dimension. The variable TV Consumption denotes the number of viewers (in logs) and ratings, taken in turns.Footnote 14Tweets NBA \(\cap \)BLM reflects the NBA/BLM intensity, serving as a proxy for social media interest in the topic. This variable has been scaled by 1,000 to enhance the interpretation of the coefficients. We incorporate the lagged value, denoted as \(t-1\), under the assumption that information requires time to disseminate and reach social media users. Individuals typically need some time to process and subsequently respond to social media inputs. Additionally, due to the nature of our data, we were unable to precisely control for whether tweets were released on the same day before or after NBA matches were aired.Footnote 15

The variable set X encompasses a comprehensive array of control variables pertinent to both intensity and TV audience. Tweets NBA and Tweets BLM were detailed in the previous section. Key Dates is a binary variable with a value of one on days marked by significant events correlated with media exposure and audience impact. One such event is the restart of NBA matches in July 2020, during which players took a knee for the first time. Another key date is associated with the August 2020 boycott, where several NBA teams refused to play following the shooting of Jacob Blake in Wisconsin. Additionally, we incorporate two dummy variables capturing critical comments made by President Trump regarding the NBA’s engagement with the BLM movement. One corresponds to September 1, when he tweeted the message outlined in the introduction. The second dummy is linked to the well-known interview given to the TV cable channel Fox on August 5.Footnote 16 To account for the COVID-19 context, we introduce the dummy variable After, taking a value of one for matches played after the resumption of the NBA at the end of July 2020 (refer to Fig. 1, top panel). The inclusion of this dummy aims in interpreting the role of Tweets NBA \(\cap \)BLM during the COVID-19 period.

Using regular season matches as the reference group, we incorporate controls for different play-off stages (1st and 2nd rounds, semi-finals, and finals). Dummy variables are employed to account for matches aired during prime time and on weekends. A similar approach is applied to the three primary TV networks broadcasting the highest number of matches: ESPN, ABC, and TNT. Additionally, we introduce a linear daily trend and season-fixed effects into the model. Residuals are clustered, and p-values are computed using wild bootstrap (Cameron and Miller 2015).

Table 2 Social media intensity, sentiment and TV consumption: baseline model

The outcomes are displayed in Table 2, columns (1) and (4), corresponding to Viewers and Ratings, respectively. The point estimates do not show statistical significance at conventional levels, indicating that social media exposure did not impact TV Consumption. Following the examination of the total number of tweets, our focus now shifts to their tone (or sentiment). We consider the subsequent specification:

$$\begin{aligned} {TV Consumption}_{i,t}= & {} \alpha + \zeta {Tweets Positive}_{i,t-1} + \delta {Tweets Negative}_{i,t-1} \nonumber \\{} & {} + \gamma X_{i,t} + \varepsilon _{i,t} , \end{aligned}$$
(2)

where \(Tweets \ Positive\) and \(Tweets \ Negative\) are the variables detailed in Section 2. The findings are presented in columns (2) and (5) of Table 2. In columns (3) and (6), we replace negative tweets with \( Tweets \ Negative \& Neutral\), representing the sum of negative and neutral tweets. A discernible pattern emerges, suggesting that the tone of the message plays a crucial role. Days featuring a higher number of positively toned tweets are associated with an increase in both Viewers and Ratings, while negatively toned tweets are linked to a decrease. However, only the results for positive tweets attain statistical significance, whereas those for negative tweets do not. Quantifying the effect, the log-linear model in Column (2) indicates that for the average number of positively toned tweets, the increase in the number of viewers is 1.16%. However, for games played post the COVID-related hiatus (a more meaningful time frame) the increase in viewership amounts to 6.6%. Alternatively, if we focus only on the three days with the highest number of tweets, the effect is comparable to that of the initial round of playoffs. We deem this effect economically significant as it captures the behavior of casual (non-habitual or marginal) fans who mobilize to watch NBA matches in support of its involvement in the BLM movement.Footnote 17 Our results also align with the perspective that responses to positive and negative messages are asymmetrical (Soroka 2006; Akhtar et al. 2011). To highlight our key findings, we visually present the main coefficients in Fig. 3.

Fig. 3
figure 3

Main results. Notes: This figure illustrates the coefficients for Tweets NBA \(\cap \)BLM, Tweets Positive, and Tweets Negative as reported in Table 2’s columns (1), (2), (4), and (5). The confidence intervals are based on wild bootstrapping and are presented asymmetrically. The lower-bound confidence interval associated with Ratings is truncated at -5

The remaining variables, namely Tweets NBA and Tweets BLM, consistently lack statistical significance. The variable After reveals a decline in TV consumption upon the NBA’s resumption of play, but this effect is only significant for Ratings. Matches during the playoff period attracted a larger audience, with the Finals registering the highest value. As anticipated, the variable Prime Time exhibits a positive and significant impact.

By incorporating a comprehensive set of variables and using high-frequency data, along with lagged values, we aim to provide a robust identification strategy. It remains plausible to contend that the intensity of tweets related to NBA/BLM is endogenous concerning the TV audience. For instance, a match involving the Chicago Bulls and the Los Angeles Lakers might attract more interest - ceteris paribus - than a less prestigious one. Similarly, games played between teams in urban, multicultural areas might draw different audiences compared to those in more rural, homogeneous areas. To account for such characteristics, in Table 3, we replicate the baseline results by incorporating match-up fixed effects.Footnote 18

We present initial results for Viewers followed by those for Ratings. Our findings indicate that the overall impact of social media intensity on TV Consumption is negligible. It is the sentiment of the tweets that holds substantial importance, where tweets supporting the NBA’s engagement with BLM are linked to a larger audience. However, the incorporation of match-up fixed effects results in an absolute increase in the coefficients capturing the relationship between tweet sentiment and the number of viewers. The statistical significance is also heightened. For instance, a significant effect is observed for Tweets(-1d) Neg &Neu.

4 Robustness analyses

In this section, we provide a series of exercises that are intended to challenge the results presented in the baseline analyses.

Table 3 Social media intensity, sentiment, and TV consumption: including game FE

4.1 Alternative metrics of sentiment

The initial robustness-check exercise explores alternative approaches to weighing tweet sentiments. In the baseline model, tweets are categorized as positive, negative, or neutral based on their respective probabilities. The sum of all tweets within each sentiment category is then calculated for each day. However, in the baseline model, a tweet might be labeled as positive even if the negative sentiment has only a slightly lower probability. To address this, we introduce the variables Tweets NBA \(\cap \)BLM Positive, Tweets NBA \(\cap \)BLM Negative, and Tweets NBA \(\cap \)BLM Neutral. These variables are created by multiplying the average daily probability for each sentiment group by the total number of tweets posted in a day. Consistent with our baseline specification, we explore two models: one including only negative tweets and another combining neutral and negative tweets.

Table 4 Alternative methodologies to classify tweet sentiments

In addition, we utilize another widely-used lexicon called Afinn, which assigns sentiment scores to 2,476 English words on a scale from -5 (indicating the most negative sentiment) to +5 (indicating the most positive sentiment). Tweets with scores between -5 and 0 are classified as negative, while those with scores between 1 and 5 are considered positive. We compute a metric representing the daily count of tweets associated with each specific sentiment.Footnote 19 The results from these analyses, presented in Table 4, align with those in Table 2. The magnitude of the point estimates indicates that messages with a positive tone had a more significant impact on the TV audience.

4.2 Social media echo chamber

We will now further exploit the granularity of social media data. The primary regressor in Table 2Tweets NBA \(\cap \)BLM — represents the quantity of original tweets. However, this measure does not necessarily convey: a) the reception of these tweets by Twitter users; or b) the influence of prominent authors. To address the first concern, in columns (1) to (4) of Table 5 for Viewers — and (5) to (9) for Ratings — we introduce alternative metrics based on the popularity of the authors of the messages.

In columns (1) and (5), we introduce Tweets NBA \(\cap \)BLM Pop, which aggregates tweets, retweets, and likes. This variable is 38 times larger than Tweets NBA \(\cap \)BLM, with a notably high standard deviation.Footnote 20 In columns (2) and (6), we further break down Tweets NBA \(\cap \)BLM Pop into positive and negative tweets. Similar to the baseline model, we observe that positively toned tweets are linked to an increase in both Viewers and Ratings.

To account for the popularity of the authors of the messages, in columns (3) and (7), we multiply the daily intensity of original tweets by the average number of Twitter followers of the authors posting on that day. In columns (4) and (9), we separate this overall popularity measure into positive and negative messages. The results indicate that none of these metrics exhibits a statistically significant effect on TV Consumption.

Table 5 Social media eco chamber

Lastly, in columns (5) and (10), we explore the influence of the authors’ identity in the tweets. We introduce the variables Tweets NBA \(\cap \)BLM, Leag+ Pla and Tweets NBA \(\cap \) BLM, Fans. The former signifies the total count of tweets from accounts associated with NBA stakeholders, such as the league, NBA teams, and NBA players. This metric serves as a close approximation to the NBA’s direct support for the BLM movement, reflecting the unfiltered endorsement of the league. On the other hand, Tweets NBA \(\cap \)BLM, Fans represents tweets generated by fans/individuals who are not stakeholders.

This analysis uncovers a positive and statistically significant impact of Tweets NBA \(\cap \)BLM, Leag+ Pla on both Viewers and Ratings, whereas no discernible effect is observed for Tweets NBA \(\cap \)BLM, Fans. If anything, the active engagement of NBA stakeholders played a mobilizing role in attracting viewers, irrespective of the intensity of that engagement.Footnote 21

5 Difference-in-differences

The findings presented thus far indicate that days marked by heightened public interest in the NBA’s involvement with BLM did not result in significant changes in media consumption. To further account for potentially omitted or time-varying factors, we employ a difference-in-differences (DiD) model, using the National Hockey Association (NHL) as a comparative reference. The NHL serves as an optimal counterfactual for the NBA due to several shared similarities. Firstly, both leagues operate from October to June in non-pandemic years. Secondly, the NBA and NHL both suspended play in March 2020 and resumed in July 2020, as illustrated in the top panel of Fig. 2. Thirdly, both leagues witnessed a decline in TV viewership in the preceding seasons. Despite these resemblances, the two leagues significantly differed in their engagement with the BLM movement, as evidenced by the summary statistics in Table 1.Footnote 22 We are aware that other major leagues might be slightly more similar to the NBA in terms of demographic - including race - and political affiliation, compared to the NHL.Footnote 23 The National Football League was not considered, given the limited number of matches played and the absence of overlap in the regular season.

Fig. 4
figure 4

Trends in viewers and tweets for the NBA and NHL. Notes: This figure provides a visual representation of the temporal dynamics of Viewers (in millions) for both seasons and leagues, NBA (black) and NHL (grey). The first sub-sample encompasses the entire 2018–2019 season; the second illustrates the earlier part of the 2019–2020 season until the end of the suspension of all activities due to the COVID-19-related halt (30 July 2020); the last sub-sample (After) covers the period from the end of July to October 2020 (i.e., when both seasons resumed)

Figure 4 illustrates the temporal dynamics of Viewers for both seasons and leagues. The initial subset encompasses the entire 2018–2019 season, the second focuses on the early part of the 2019–2020 season up to the suspension of activities due to the COVID-19 pandemic; the last subset covers the period from the end of July to October 2020, when the seasons resumed. This figure illustrates that the average number of viewers was higher for the NBA compared to the NHL.Footnote 24 However, both leagues exhibit a similar pre-event trend, supporting the application of a difference-in-differences approach. To validate the visual presentation, indicating a shared trend, we estimate the following model:

$$\begin{aligned} TV\ \text { }Consumption_{l,i,t}=\alpha \, Tweets\, League\cap BLM_{l,i,t}+ \theta _{l}+\gamma X_{l,i,t}+\varepsilon _{l,i,t}, \end{aligned}$$
(3)

, where l represents the league, denoted as either NBA or NHL. Tweets NHL \(\cap \)BLM refers to tweets containing the keywords NHL and BLM. The other variables have been discussed in Section 3. Additionally, we incorporate a control variable, Tweets NHL, measuring the number of tweets with NHL excluding those with BLM. The outcomes are presented in columns (1) and (2) of Table 6. The residuals are clustered at the league and date levels, and p-values are determined using the wild bootstrap method (Cameron and Miller 2015). The findings underscore that while \(\theta \) is positive, it is not statistically significant for viewership, whereas it attains significance for Ratings.

In columns (3) and (4), we run a standard difference-in-differences model employing a dummy variable that captures the interaction between NBA and After, labeled as Int. This approach accommodates the possibility that the NBA’s engagement with BLM leads to a change in our dependent variable over the entire season.Footnote 25 Our findings indicate that the NBA did not undergo a decline in viewership (or experience reduced ratings) as a consequence of its involvement with the BLM movement.

6 Conventional media & 2SLS

In this section, we explore intensity measures derived from alternative media platforms, beginning with the use of traditional news outlets (Baker et al. 2016; Caporale et al. 2022). We include one left-leaning outlet, one centrist outlet, and one right-leaning outlet on the US political spectrum.Footnote 26 Specifically, for the left-leaning outlet, we utilize The New York Times, USA Today for the center, and Fox News for the right.Footnote 27 Similar to the Twitter data, we conducted searches for daily articles or transcripts containing the keywords “NBA” and “BLM,” including their spelled-out forms, using Nexis, a news and business research database. The variable NYT &USA &Fox was created to encompass all three sources together. The correlation coefficient between the latter and Tweets NBA \(\cap \)BLM stands at 0.65.

Table 6 Difference-in-differences

Additionally, we collected data on the broadcast transcripts from various US national television and radio news programs, obtained through Nexis. This dataset encompasses networks such as ABC, CBS, CNN, and others. Given the inclusion of Fox News in the previous measure, we opted not to incorporate it in this context.Footnote 28 We denoted this variable as News Transcripts, and it displays a correlation of 0.78 with Tweets NBA \(\cap \)BLM.

Table 7 Conventional media & 2SLS

The final metric of public interest intensity that we examine is Google Trends, developed by Google. This index has found applications across various research domains, including IT, communications, medicine, health, business, and economics (Vosen and Schmidt 2011; Choi and Varian 2012; Jun et al. 2018).Footnote 29 We gather data on internet searches related to the topic “NBA-BLM.” The correlation coefficient between Google Trends and Tweets NBA \(\cap \)BLM stands at 0.61.

The outcomes for the three indicators of conventional media exposure are presented in Table 7. No compelling evidence of a significant statistical impact on TV Consumption is identified. The coefficient for NYT &USA &Fox is negative, reflecting the prevailing negative sentiment in articles and transcripts.Footnote 30 Notably, only Google Trends displays a positive and statistically significant association with Ratings.

To further validate our findings, we employ a 2SLS regression model following a methodology employed by Chalfin and McCrary (2018) in examining the impact of police on crime in the USA. These researchers demonstrated that when two variables represent noisy measures attempting to capture the same phenomenon, it is feasible to use one as an instrument for the other. In our study, we utilize alternative measures of media exposure as instruments for Tweets NBA \(\cap \)BLM. Initially, we regress the latter on all three alternative media measures: NYT &USA &Fox, News Transcripts, and Google Trends. Although this analysis is not displayed, it indicates that News Transcripts is the most substantial predictor of Tweets NBA \(\cap \)BLM. Consequently, we conduct a just-identified 2SLS model, employing News Transcripts as an instrument, while retaining the use of NYT &USA &Fox and Google Trends as a robustness check (reported in the Appendix). The results, presented in columns (7) and (8), show the absence of an effect on TV Consumption. The instrument exhibits a high correlation with Tweets NBA \(\cap \)BLM, and the F-statistic surpasses the threshold suggested by Stock et al. (2002).Footnote 31

7 Conclusions

This paper investigates how the intensity and sentiment of social media posts about the 2020 National Basketball Association’s involvement with the Black Lives Matter movement affected the TV audience. Utilizing data from the widely used social media platform Twitter as a measure of media exposure, we find that the posting intensity is not linked to viewership and ratings. However, positively toned posts are correlated with both TV audience metrics, and the observed effect is economically significant. Conversely, the impact of negatively toned tweets is not statistically significant. Our results remain robust across different metrics for constructing social media sentiment proxies and alternative model specifications.

These findings carry significant implications. Firstly, they underscore the relevance of social media in either supporting or opposing the NBA’s involvement in social causes, revealing an asymmetry in sentiments. Our sentiment analysis highlights the polarization of opinions on this subject, indicating that the NBA’s engagement was not costly, at least in the short term. Secondly, our analysis provides additional insights into the elasticity of demand for sports concerning non-economic factors. We argue that this asymmetry may stem from a positive elasticity among casual (non-habitual) NBA viewers concerning positive sentiments toward NBA involvement. These viewers are likely to align with the NBA’s stances on civil rights and BLM. In contrast, the core NBA fan base exhibits inelastic demand and is unlikely to cease watching NBA matches.

We acknowledge the limitation of our study due to the use of national-level data, which might conceal potential heterogeneous patterns at the state and individual characteristics levels. Future research could explore the implications of social media sentiment on sports viewership, considering individual characteristics and preferences. A data-set collected through surveys could be well-suited for this purpose.