Introduction

Market prices may diverge from the fundamentals for a long time that betting against market sentiment may be too costly, i.e., there are limits to arbitrage (Shleifer and Vishny 1997). Hence, as a driver of prices, market sentiment is important for both investors and policymakers. The Bitcoin market is a sensational market for investigating the role of sentiment. No central bank or authority regulates the supply of Bitcoin, implying that investor sentiments may affect the prices more pronouncedly. Moreover, it is difficult to pinpoint the fundamentals of the Bitcoin market as opposed to that of national fiat currencies.

The main driver of Bitcoin prices is a pertinent issue, especially for investors. One of the drivers of asset prices is investor sentimentFootnote 1 and psychology (Baker and Wurgler 2006, 2007). This sentiment is more pronounced in impacting price fluctuations due to the unregulated nature of the Bitcoin market. Among the significant turmoil in the cryptocurrency markets, the coronavirus (COVID-19) pandemic proved a compelling case ever since the Bitcoin market experienced an influx of new investors (Urquhart 2021). Hence, the Bitcoin market needs to be carefully investigated, while considering the COVID-19 pandemic as a natural experiment opportunity.

Although the literature has investigated the relationship between investor psychology and cryptocurrencies, the changing dynamics due to the pandemic are overlooked as most of the studies (Chen et al. 2020; Goodell and Goutte 2021) focused on the initial phase of the COVID-19 pandemic and used a single time–frequency and a generic sentiment measure. This study employs a news and social media-based Bitcoin-specific sentiment measure provided by Thomson Reuters MarketPsych Indices (TRMI) to investigate the role of sentiment in price discovery before and during the COVID-19 pandemic. A wavelet methodology is conducted to analyze this relationship in different trading frequencies, from one week to 24 weeks. Wavelet transformation reveals the spectral characteristics of time series during different periods by providing information in frequency and time domains simultaneously. The wavelet spectrum visualizes how trends, structural changes, seasonality, predominant business cycles, and short-run dynamics in financial time series evolve throughout the observations (Wen 2002; Merrill et al. 2008). The wavelet methodology can decompose the data in different time frames at a specific frequency so that it does not require stationarity, seasonality, and structural break tests, which are necessary standard tests used in time series analysis to maintain consistent estimations (Masset 2015).

Our results reveal that, regarding the co-movement between Bitcoin sentiment and return, the wavelet coherence spectrum had significant coherency at high frequency (2–3 weeks). The coherency still existed in other frequencies (3–6, 6–9, and 9–18 weeks) for different periods. We employed phase diagrams to check the lead-lag relationship. Phase differences reveal that Bitcoin sentiment led to Bitcoin returns before the COVID-19 pandemic, and the relationship is positive. Interestingly, during the pandemic, although positive coherency existed, the lead-lag relationship became insignificant. This is probably due to the extreme uncertainty during the pandemic and the increasing volatility in Bitcoin prices. We also employed partial wavelet tools to investigate the effects of the COVID-19 pandemic because the pandemic may jointly impact both return and sentiment. We used the number of COVID-19 cases and deaths and the Equity Market Volatility Infectious Disease Tracker (EMVID) index to observe the direct relationship between sentiment and return. Our results reveal that before the pandemic, the sentiment had a positive effect on price, but during the pandemic, the lead-lag relation disappeared. Hence, in the Bitcoin market, sentiment was more pronounced even after considering the impact of the unprecedented COVID-19 pandemic. Regarding the effects of the pandemic, the causal relationship that sentiment leads to return can only be integrated as a short-term trading strategy. However, for longer-term trading strategies, Bitcoin-specific sentiment is uninformative for Bitcoin’s return. Moreover, Bitcoin-specific sentiment and its return are positively correlated at all frequencies.

This paper contributes to the literature on several grounds. First, the wavelet methodology allows us to investigate the relationship between sentiment and return for different time frequencies, whereas the literature mainly employs a single frequency. For example, Chen et al. (2020) reported linear vector autoregressive (VAR) estimation results only for daily data. Anamika et al. (2021) employed dynamic linear regression using pre-COVID-19 weekly data. Guler (2021) used monthly data and reported VAR estimation results. The contribution of this study is unique as it covers a wide range of frequency domains from 1 to 18 weeks. Hence, we can draw implications for different investment horizons and trading strategies.

Second, earlier literature either focused on a shorter COVID-19 period (Goodell and Goutte 2021; Chen et al. 2020; Guler 2021) or ignored the pandemic period (Anamika et al. 2021). However, we analyze a more extended observation period from January 1, 2016 to June 30, 2021, that is, both before and during the pandemic. This enables us to examine the effect of sentiment on Bitcoin prices more accurately, considering the unprecedented COVID-19 pandemic. Furthermore, we employ a partial wavelet coherence approach that allows us to investigate the net effect of investor sentiment on Bitcoin return by factoring out the impact of the outbreak.

Third, contrary to recent literature (Chen et al. 2020; Guler 2021), we employ a Bitcoin-specific sentiment index rather than a generic index. Utilizing a tailored sentiment index of TRMI provides a more accurate investor psychology measure. Therefore, our analysis reveals the market-specific dynamics of Bitcoin. In contrast, the generic sentiment measures mainly used in the earlier literature cannot capture market-specific psychology.

This study’s empirical findings have important implications for the portfolio configurations of Bitcoin investors and its return discovery dynamics during different periods and trading frequencies. Regarding investors’ decision-making process, the time and frequency varying causal relationship between Bitcoin and its sentiment allows investors to adopt dynamic investment strategies and adjust crypto-asset weights in their portfolios more efficiently. News and social media updates affect investors’ sentiments regarding the price discovery process, thus impacting Bitcoin prices. When making longer-term investment strategies, investors should consider that the Bitcoin market is still not mature compared with other financial markets, such as the stock and bonds markets. Considering the high volatility in Bitcoin prices, investors may be better off following longer-term trends/cycles rather than focusing on short-run sentiment dynamics.

The rest of the paper is organized as follows. The next session explains the dataset. Sect. "Data" examines the methodology used. Sect. "Methodology" presents the results and discusses them in detail, while the last section concludes.

Related literature

As there are no central authorities directly regulating the supply of Bitcoin, it is relevant and timely to investigate the role of sentiment in Bitcoin markets as it is the main cryptocurrency that directly affects the prices and returns of all other cryptocurrencies. The decentralized feature of the Bitcoin market implies that investor sentiments may affect its prices more pronouncedly. The recent empirical literature generally agrees that the cryptocurrency markets are inefficient, and the returns are driven by sentiment and news about the crypto market innovations and political decisions. Thus, our theoretical contribution is to reveal the relevancy of the sentiment in the Bitcoin markets.Footnote 2

While the recent sentiment-crypto asset nexus literature differs in relation to the sentiment index, data frequency, and methodology employed, the literature mainly suggests that investor sentiment is a significant determinant of cryptocurrency prices. Although the role of sentiment is well recognized in the literature, finding a proxy to measure sentiment can be challenging as sentiment is not directly observed. Thus, different measures have been employed in the literature:

i.publicly available market data, such as VIX (Chicago Board Options Exchange Volatility Index), EPU (Economic Policy Uncertainty Index), benchmark market return, and volatility (Jo et al. 2020; Paule-Vianez et al. 2020; Shaikh 2020).

ii.author-estimated composite indices based on market-specific data (Baker and Wurgler 2006, 2007).

iii.social media and news search analytics from traditional media, Twitter, Google Trends, and Yahoo (Tetlock 2007; Zhang et al. 2018; Khan et al. 2020).

Generally, recent literature has investigated the relationship between sentiment and Bitcoin returns with social media sentiment measures. Some researchers measured only positive sentiment; for instance, Bouri et al. (2020) analyzed the role of happiness in dynamic volatility correlation among major cryptocurrencies. Others, including Burggraf et al. (2020) and Chen et al. (2020), constructed the FEARS index as a proxy for negative sentiment and other net sentiment measures by incorporating bearish, bullish, and neutral investor behavior from social media data or surveys (Guegan and Renault 2021). Twitter feed data and Google trend analytics are commonly used social media-based sentiment measurement resources (Lamon et al. 2017; Eom et al. 2019; Lopez et al. 2019; Smuts 2019; Burggraf et al. 2020; Chen et al. 2020; Rajput et al. 2020; Subramaniam and Chakraborty 2020; Guegan and Renault 2021). Karavevicius et al. (2017), Lamon et al. (2017), Entrop et al. (2020), and Rognone et al. (2020) also included news data in their sentiment–cryptocurrency market nexus research.

Among studies that have investigated the relationship between sentiment and Bitcoin returns in the pre-COVID-19 era, Lopez et al. (2019), Valencia et al. (2019), Guegan and Entrop et al. (2020), Eom et al. (2019), Keskin and Aste (2020), Khan et al. (2020), and Renault (2021) justified the existing causality between return and sentiment, in which sentiment has predictive power for Bitcoin returns. Moreover, Karalevicius et al. (2017), Smuts (2019), Jo et al. (2020), and Rajput et al. (2020) concluded that this relationship is positive. However, comparing the impact of Bitcoin news on main currencies and Bitcoin returns, Rognone et al. (2020) indicated that Bitcoin return is irresponsive but positively related to news sentiment. In contrast, major currencies react positively and negatively to positive and negative news, respectively. Regarding negative sentiment, Burggraf et al. (2020) stated that the FEAR index has a negative impact on Bitcoin returns. Similarly, Jo et al. (2020) found that VIX and Bitcoin returns are negatively related.

Chen et al. (2020), an early study of the pandemic period, examined the relationship between Bitcoin returns, trading volume, and a new FEAR sentiment structured by the authors from search queries of Google Trends that contain “COVID-19” and “Coronavirus.” The study analyzed the relationship by employing the VAR and Granger causality methodology. The findings revealed that pandemic-related words were included. Fear sentiment can explain negative Bitcoin returns and its high trading volume during the early stages of the pandemic. Regarding this outcome, Chen et al. (2020) argued that Bitcoin should be treated as a traditional asset as it behaves like conventional assets during downturn periods. In another recent study, Guler (2021) investigated the impact of investor sentiment on Bitcoin returns and their conditional volatility by employing asymmetric conditional heteroscedasticity models. She also focused on the COVID-19 period by utilizing monthly data. She used the fear and greed index as a proxy for investor sentiment. She concluded that investor sentiment, Bitcoin returns, and volatility are positively related, and this relationship becomes more robust with the pandemic.

Anamika et al. (2021) used the Sentix index, a survey-based sentiment measure, to investigate how sentiment is connected to major cryptocurrency returns, including Bitcoin. They analyzed weekly data from September 2017 to the start of the outbreak in February 2020 and revealed that Bitcoin returns respond positively to the changes in Sentix sentiment.

Contrary to the studies mentioned above, which mainly employed a single frequency, our methodology allows us to investigate the relationship between sentiment and return for different time frequencies (Chen et al. 2020; Anamika et al. 2021; Guler 2021). Our contribution is unique in the literature as our study covers a wide range of frequency domains from 1 to 18 weeks. Hence, we can draw implications for different investment horizons and trading strategies.

Trading and hedging strategies might change concerning the investment horizon. However, few studies have assessed and compared the interrelation between sentiment and Bitcoin returns in different frequencies. For instance, Entrop et al. (2020) and Guegan and Reanult (2021) reported comparative linear model estimations for 1–15-min and 1-min to 24-h frequencies, respectively. The latter stated that the relationship between social media and Bitcoin returns tends to disappear in relatively low frequencies (not more than one-day trading).

Goodell and Goutte (2021) analyzed the co-movement between daily COVID-19-related deaths and Bitcoin prices by conducting a wavelet coherence analysis. Their data spanned from 31st December 2019 to 29th April 2020, that is, just the beginning of the pandemic. In contrast, our study covers a pandemic period much longer than a year (the unprecedented COVID-19 pandemic), enabling us to evaluate investor psychology of Bitcoin prices more accurately. Moreover, the daily death numbers used in their study are a generic proxy, whereas we employ a Bitcoin-specific sentiment measure. We also cover both before and during the pandemic periods to isolate the effect of the pandemic. Furthermore, we use a partial wavelet coherence approach, which enables us to investigate the net effect of investor sentiment on Bitcoin return by factoring out the impact of the outbreak.Footnote 3

Studies related to the pandemic mostly focus on a shorter COVID-19 period (Goodell and Goutte 2021; Chen et al. 2020; Guler 2021) or ignore the pandemic period at all (Anamika et al. 2021). However, we analyze a more extended observation period from January 1, 2016 to June 30, 2021. This enables us to examine the effect of sentiment on Bitcoin prices more accurately while considering the impact of the COVID-19 pandemic. Furthermore, we employ a partial wavelet coherence approach to investigate the net effect of investor sentiment on Bitcoin return by factoring out the impact of the outbreak. More importantly, contrary to recent literature (Chen et al. 2020; Guler 2021), we employ a Bitcoin-specific sentiment index rather than a generic index. Utilizing a tailored sentiment index of TRMI provides a more accurate investor psychology measure as generic sentiment measures cannot capture market-specific psychology. Table 1 summarizes the methods and findings of recent studies.

Table 1 Summary of literature review

Data

We used data on daily Bitcoin returns,Footnote 4 Bitcoin-specific sentiment data from TRMI, the daily number of COVID-19 cases and deaths,Footnote 5 and the daily EMVID constructed by Baker et al. (2020) in this study. As a newspaper-based index, EMVID is built by searching through a wide range of US newspapers. It focuses on disease-related keywords such as “coronavirus” and “pandemic.” Table 2 summarizes the descriptive statistics of the data. Figure 1 presents seven days moving average number of COVID-19 new cases, new deaths due to COVID-19, and the EMVID index from January 22, 2020 to June 30, 2021.

Table 2 Descriptive statistics
Fig. 1
figure 1

Panel A plots 7 days moving average number of COVID-19 new cases. Panel B shows 7 days moving average number of new deaths due to COVID-19 pandemic. Panel C shows EMVID index. All panels cover the period from 1st January 2020 to 30th June 2021

Our primary variable of interest is the Bitcoin-specific sentiment index produced by Thomson Reuters MarketPsych. TRMI covers multiple sentiment measures, from general sentiment to specific psychological variables, such as fear, stress, and urgency, specific to an asset (Peterson 2013). TRMI is published as two separate contents—news-based and social media-based sentiments. This distinction helps differentiate between professional news from the content produced by retail investors. More than 50,000 news websites regarding news-based sentiments are scanned, including leading sites such as The New York Times, The Wall Street Journal, Financial Times, and Seeking Alpha (Peterson 2013). Social media-based sentiments are produced by scanning millions of social media sites. Using advanced linguistic machine learning algorithms, TRMI is updated every minute, providing a timely measure of investor psychology. These advanced linguistic techniques enable TRMI to present multi-dimensional novel sentiment measures that cannot be captured by bipolar sentiment variables employed by earlier studies (Tetlock 2007; Da et al. 2015; Chen et al. 2020). In this study, we employ the “sentiment” index for Bitcoin to capture the generic sentiment. The sentiment index is defined as the overall positive references, net of negative references, and then normalized to take values between the range of − 1 to 1 (Peterson 2016). Following earlier studies, we employ both news-based and social media-based sentiments to capture professional and retail investor sentiments (Nooijen and Broda 2016; Entrop et al. 2019). Figure 2a depicts the time series plot of 30 days moving average of TRMI investor sentiment and Bitcoin price from January 1, 2016 to June 30, 2021.

Fig. 2
figure 2

a Investor sentiment (blue line) versus Bitcoin price (red line). Notes: Blue line depicts 30 days moving average Investor sentiment on the right axis. Red line depicts Bitcoin price data in logarithmic scale on the left axis. Observation period is from 1st January 2016 to 30th June 2021. b Time varying volatility between Bitcoin returns and TRMI Bitcoin sentiment index. Notes: Results are estimated by a dynamic conditional correlation model of DCC(1,1) model specification. Mean model (ARIMA) and volatility model (GARCH) specification of Bitcoin return and TRMI sentiment are ARIMA(0,1,0) x SGARCH(1,1) and ARIMA(1,1,1) x AVGARCH(1,1), respectively. Both mean and volatility model parameter estimations’ results and their estimation diagnostics are available upon request. Data spans from 1st January 206 to 30th June 2021

Figure 2b depicts time-varying correlations between Bitcoin returns and the Bitcoin-specific sentiment index (TRMI) from January 1, 2016 to June 30, 2021. We structure a specific multivariate generalized autoregressive conditional heteroskedasticity that decomposes the conditional covariance matrix into conditional standard deviations and correlations, called dynamic conditional correlation (DCC) (Engle 2002; Tse and Tsui 2002). According to Fig. 2b, daily Bitcoin returns and investor sentiment (TRMI) were positively correlated during the observation period, and their DCC varied from 15 and 35%. In Fig. 2b, a structural shift occurred in DCC just after the COVID-19 pandemic began in 2020q1. This structural change in conditional correlation implies that the correlation between Bitcoin return and investor sentiment shifted to a higher correlated state by the start of the pandemic.

Methodology

Wavelet transformation, which was initially used as a signal processing and visualization method in engineering fields, enables finance specialists and economists to analyze the spectral characteristics of time series during different observation periods. Superior to Fourier transformation, which reveals the frequency distribution without indicating where these frequencies occur in time, wavelet transformation simultaneously provides information on frequency and time domains. Then, its spectrum visualizes how long-run trends, structural changes, seasonality, predominant business cycles, and short-run dynamics in financial time series would evolve throughout the observation (Wen 2002; Merrill et al. 2008). As wavelet transformation can decompose the data in different time frames at a specific frequency, it does not require stationarity and/or non-trending and/or nonseasonal data assumptions to maintain consistent estimations (Masset 2015). Therefore, starting with Gençay’s (2002, 2005) introductory studies, wavelet decomposition methodology has spread fast in the financial economics literature. More recent studies that have applied wavelet transformation techniques to various issues, including those of Crowley and Hudgins (2021), Bilgili et al. (2020), Aguiar-Conraria et al. (2020), Hu et al. (2020), Bilgili et al. (2019), and Aguiar-Conraria et al. (2018).

Continuous wavelet transformation (CWT) and wavelet power

A wavelet—the so-called small wave—can be defined as a wave oscillating in a bounded time interval. It is a square differentiable and finite function of time, as described below:

$$\eta_{{\left( {a,b} \right)}} \left( t \right) = \frac{1}{\sqrt a }\eta \left( {\frac{t - b}{a}} \right), b \in {\mathbb{R}},{ }a \in {\mathbb{R}}^{ + } \;{\text{and}}\;\eta \left( t \right) \in L^{2} \left( {\mathbb{R}} \right)$$
(1)

where \(\eta_{{\left( {a,b} \right)}} \left( t \right)\) denotes a wavelet daughter with its mother wavelet \(\eta \left( . \right)\), which is a function of dilation (\(a)\), location \((b\)), and time (\(t)\). As the dilation parameter (\(a)\) scales \(\eta_{{\left( {a,b} \right)}} \left( t \right)\) to capture variations more accurately in different frequencies, the location parameter \((b\)) locates the center of the wavelet in time.\({W}\left(ax,b\right)\), representing the CWT of a given time series \(x\left( t \right)\) with respect to a selected wavelet function \(\eta_{{\left( {a,b} \right)}} \left( t \right)\), is calculated as follows:

$$W_{x} \left( {a,b} \right) = \mathop \smallint \limits_{ - \infty }^{\infty } x\left( t \right)\frac{1}{\sqrt a }\overline{{\vartheta \left( {\frac{t - b}{a}} \right)}} dt, x\left( t \right) \in L^{2} \left( {\mathbb{R}} \right)$$
(2)

where the bar indicates the complex conjugation of the mother wavelet function η(.). The mother wavelet function (η(t)) must satisfy zero frequency and unit energy conditions stated below:

$$\mathop \smallint \limits_{ - \infty }^{\infty } \eta \left( t \right)dt = 0\;{\text{and}}\;\mathop \smallint \limits_{ - \infty }^{\infty } \left| {\eta \left( t \right)} \right|^{2} dt = 1$$
(3)

These conditions imply that η(t) should have no zero-frequency element, and oscillations around the mean are expected to be 0. However, these conditions are not sufficient to analyze the CWTs of different time series (Percival and Walden 2000). Reconstructing the original times series from its CWT requires an admissibility condition for the mother wavelet function η(t) of the time series x(t), as presented below:

$$G_{\eta } = \mathop \smallint \limits_{0}^{\infty } \frac{{\left| {{\text{K}}\left( f \right)} \right|^{2} }}{f}df < \infty$$
(4)

where \(b \in {\mathbb{R}}\) and \(a > 0\). The wavelet power of \(x\left( t \right)\) can be defined as \(P_{x} \left( {a,b} \right) = W_{x} \overline{{W_{x} }}\) = \(\left| {W_{x} } \right|^{2}\). Furthermore, while admissibility and unit energy conditions hold, \(P_{x}\) can be derived as follows:

$$x\left( t \right) = \frac{1}{{G_{\eta } }}\mathop \smallint \limits_{0}^{\infty } \left[ {\mathop \smallint \limits_{ - \infty }^{\infty } W_{x} \left( {a,b} \right)\eta_{{\left( {a,b} \right)}} \left( t \right)db} \right]\frac{da}{{d^{2} }}$$
(5)
$$\left\| x \right\|^{2} = \frac{1}{{G_{\eta } }}\mathop \smallint \limits_{0}^{\infty } \left[ {\mathop \smallint \limits_{ - \infty }^{\infty } P_{x} \left( {a,b} \right)db} \right]\frac{da}{{d^{2} }}$$
(6)

where \(\left\| x \right\|^{2}\) measures the preserved energy ( ). This implies that the wavelet power spectrum \(P_{x} \left( {a,b} \right)\) indicates the energy distribution of \(x\left( t \right)\) time and frequency domains. This energy can be interpreted as the variance of \(x\left(t\right)\); hence, the wavelet power spectrum displays the variance distribution of \(x\left(t\right)\), its time, and frequency domains (Aguiar‐Conraria and Soares 2014).

Multivariate wavelet tools

Multivariate wavelet tools can be applied to time series when the interrelation between different time series is examined. They include cross wavelet power (CP), wavelet coherency, and multiple and partial wavelet coherencies. These tools reveal the frequency and time dependency structure between time series. If \(x\left( t \right)\) and \(y\left( t \right)\) are random time series, the CP between them \(\left( {{\text{W}}_{{{\text{xy}}}} \left( {{\text{a}},{\text{b}}} \right)} \right)\) can be calculated as follows:

$${\text{W}}_{{{\text{xy}}}} \left( {{\text{a}},{\text{b}}} \right) = {\text{W}}_{{\text{x}}} \left( {{\text{a}},{\text{b}}} \right)\overline{{{\text{W}}_{{\text{y}}} \left( {{\text{a}},{\text{b}}} \right)}}$$
(7)

\({\text{W}}_{{\text{x}}}\) and \({\text{W}}_{{\text{y}}}\) denote individual CWT of \(x\left( t \right)\) and \(y\left( t \right)\), respectively. Similarly, \(a\) and \(b\) denote dilation (scale) and location parameters, respectively. While Hudgins et al. (1993) defined CP as a measurement of the local covariance between two time series, wavelet coherency \((R_{xy}\)) measures the local correlation. Analogous to the linear correlation coefficient, wavelet coherency takes a value from 0 to 1 (\(1 \le R_{xy} \left( {a,b} \right) \le 0\)), and it visualizes this value by coloring areas on its spectrum. Wavelet coherency between time series \(x\left( t \right)\) and \(y\left( t \right)\) can be written as follows:

$$R_{xy} \left( {a,b} \right) = \frac{{\left| {S\left( {W_{xy} \left( {a,b} \right)} \right)} \right|}}{{\sqrt {S\left( { \left| {W_{x} } \right|^{2} } \right)S\left( { \left| {W_{y} } \right|^{2} } \right)} }}$$
(8)

where \(S\) is for the smoothing operator. A smoothing process at scale and frequency is required for cross and individual wavelet power spectra because the wavelet coherency spectrum might estimate high coherence spuriously (Liu 1994). When factoring out the possible interaction of some other variables on the interrelation between two time series in time and frequency domains, partial wavelet coherency may occur. When partial wavelet coherency occurs between \(x\left( t \right)\) and \(y\left( t \right)\), the effects of \(z\left( t \right)\) eliminated can be written as follows:

$$R_{xy\parallel z} \left( {a,b} \right) = \frac{{R_{xy} \left( {a,b} \right) - R_{xz} \left( {a,b} \right)\overline{{R_{yz} \left( {a,b} \right)}} }}{{\sqrt {\left( {1 - R_{xz}^{2} \left( {a,b} \right)} \right)\left( {1 - R_{yz}^{2} \left( {a,b} \right)} \right)} }}$$
(9)

Complex Morlet wavelet

This study employs a complex Morlet wavelet to detect cycles and variations in Bitcoin return and its sentiment while considering the number of deaths and new cases of COVID-19 to discover co-movement between them in the frequency and time domains. Morlet wavelet is widely preferred in wavelet-based financial and economic time series studies. For example, it was used in the following studies: Percival and Walden (2000), Crowley (2005), Rua and Nunes (2009), Galleati et al. (2014), Madaleno and Pinho (2014), Aguiar-Conraria et al. (2018), and Aguiar-Conraria and Soares (2021). Choosing a more appropriate wavelet mother that suits the time series’ cyclical properties well is crucial as there are major differences in their structures. Shannon, Meyer, Beta, Mexican hat, Morlet, and dozens of others are wavelet families that are widely used as mother wavelets for CWT. The complex Morlet wavelet has two main advantages. First, its complex-valued structure contains additional information about the phase structure of the signals (time series), which allows analysts to compare the location of the signals of different cycles at a specific frequency interval throughout the observation. This comparison supports the test of causal inference between the series by observing differences between the phases of signals, which reveals signals that lead and how they interact. Second, the complex Morlet mother wavelet satisfies the optimal joint frequency–time concentration (Addo et al. 2014; Gallegati et al. 2014). A complex Morlet mother wavelet, which was first introduced by Grossman and Morlet (1984), can be written as follows:

$$\eta \left( {\varepsilon ,t} \right) = \frac{1}{{\pi^{{{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 4}}\right.\kern-0pt} \!\lower0.7ex\hbox{$4$}}}} }}\left( {{\text{exp}}\left( {i\varepsilon t} \right) - {\text{exp}}\left( {\frac{{ - \varepsilon^{2} }}{2}} \right)} \right){\text{exp}}\left( {\frac{{ - t^{2} }}{2}} \right)$$
(10)

where complex Morlet wavelet \(\eta \left( {\varepsilon ,t} \right)\) is a function of the central frequency parameter \(\varepsilon\) rather than the frequency itself. However, if the location parameter is \(\varepsilon > 5\), the value of \({\text{exp}}\left( { - \varepsilon^{2} /2} \right)\) becomes negligibly small; then, Eq. 4 can be rewritten as \(\eta \left( {\varepsilon ,t} \right) = \frac{1}{{\pi^{{{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 4}}\right.\kern-0pt} \!\lower0.7ex\hbox{$4$}}}} }}{\text{ exp}}\left( {i\varepsilon t} \right){\text{ exp}}\left( {\frac{{ - t^{2} }}{2}} \right)\). Moreover, Aguiar‐Conraria and Soares (2014) stated that the central frequency parameter is commonly chosen to be six, that is, \({\upvarepsilon } = 6\), which allows a substitution from scale to the frequency (scale can be written as the reciprocal of the frequency). After this setup, a complex Morlet wavelet can be written as time \(t\) and frequency\(f\), which provides the best available information mix in both domains.

Phase difference

Complex CWT can determine the phase of a signal (time series), that is, the location of its cycles in time at a specific frequency band. Once the phase structures of two time series are computed, their difference would provide evidence of a causal relationship. The phase difference \(\gamma_{x,y}\) between time series \(x\left( t \right)\) and \(y\left( t \right)\) can be denoted as follows:

$$\gamma_{x,y} = arctan\left( {\frac{{{\Im }{ }\left( {W_{xy} \left( {a,b} \right)} \right)}}{{{\Re }\left( {W_{xy} \left( {a,b} \right)} \right)}}} \right)$$
(11)

where\({\gamma }_{x,y}\in \left[-\pi ,\pi \right]\), and \(\mathfrak{I}\left({\mathrm{W}}_{\mathrm{xy}}\right)\) and \(\mathfrak{R}\left({\mathrm{W}}_{\mathrm{xy}}\right)\) denote imaginary and real parts of a given complex CWT\({W}_{xy}(a,b)\), respectively. Phase differences formulate the direction and lead-lag structure of the relationship between \(x(t)\) and \(y(t)\) as follows: if\({\gamma }_{x,y}\in \left[\left(-\frac{\uppi }{2},0 \right); \left(\frac{\uppi }{2},\uppi \right)\right]\), then \(\mathrm{y}(\mathrm{t})\) leads \(x(t)\) and vice versa if\({\gamma }_{x,y}\in \left[\left(0,\frac{\uppi }{2} \right); \left(-\uppi ,-\frac{\uppi }{2}\right)\right]\). If\({\gamma }_{x,y}\in \left[(\frac{-\pi }{2},\frac{\pi }{2})\right]\), then \(x(t)\) and \(y(t)\) are in phase, implying a positive relationship. Conversely, if\({\gamma }_{x,y}\in \left[\left(-\frac{\uppi }{2},-\uppi , \right); \left(\frac{\uppi }{2},\uppi \right)\right]\), then \(x(t)\) and \(y(t)\) are out of phase, implying a negative relationship. Hence, an exact phase difference of either π or − π indicates an exact negative relationship. However, a phase difference of 0 indicates contemporaneous positive co-movement at a specific frequency band.

Results

Figure 3 depicts the individual wavelet power spectra of return on Bitcoin prices and change in Bitcoin-specific investor sentiment (on the right) as well as their daily time plots (on the left). The observation data span from January 1, 2016 to June 30, 2021. Wavelet power spectra separately visualize the relative variance intensities of return and change in sentiment by scaling them in relation to a range of frequency from 2 to 23 weeks. The return has significant wavelet power areas on the upper panel, mostly at a 2–3-week frequency. These areas extended their power to relatively lower frequencies of 3–6 and 6–9 weeks during 2017q3–2018q1, 2020q1–2020q2, and 2021q1–2021q2 time intervals. The latter corresponds to COVID-19 pandemic-related financial outcomes. The first interval coincided with August 2017, its all-time high-price upsurge in December 2017, and its dramatic fall in February 2018.

Fig. 3
figure 3

Left Panel (a) depicts the daily return on Bitcoin prices and change in bitcoin specific sentiment index from 1st January 2016 to 30th June 2021. Right Panel (b) shows the wavelet power spectrum of return and sentiment index data, respectively. Thick black line indicates the boundaries of the cone of influence (COI). Thin black line shades the areas with 10% significant level. These areas have been simulated after 1000 runs. The wavelet power ranges respect to jet color codes from low (blue) to high (red). Periods are depicted as weekly on the vertical axis

The wavelet power of change in sentiment on the lower right panel reveals relatively high power in shorter cycles of 2–3 weeks. On the power spectrum, significant high-power areas existed in 2017q3, 2020q1, and 2020q3 time intervals as they did in the Bitcoin return’s power spectrum. These results for Bitcoin-specific sentiment are similar to recent studies that employ CWT techniques for spectral decomposition analysis of different investor sentiment measures and various asset returns. Zhang and Li (2019), Khan et al. (2020), and Aloui et al. (2021) displayed wavelet power spectra of individual, household, and oil investor sentiment indices in the US, respectively. Their results mainly revealed that the wavelet power of sentiment indices becomes significantly high at relatively lower frequencies but during specific dates related to important events, such as the Global Financial Crisis (2008–2009) and the Tapering action (end of 2013 and early 2014).

Although the asset returns had high wavelet power again at specific dates, larger frequency bands similar to their sentiment proxies or indices such as the Bitcoin returns’ power spectrum appears on the upper right panel. Similarly, Celeste et al. (2020) and Qiao et al. (2020) concluded that the price levels and returns of other main cryptocurrencies have similar wavelet power images.

According to the wavelet power spectra (Fig. 3), Bitcoin’s returns and its specific sentiment had common high variability (high wavelet power) in 2017q3 and early 2020, when the pandemic started at higher frequencies (2–3 weeks periods). These variability measures (wavelet power) still do not provide sufficient information to explain the dynamic interrelation or causality between a single series. The wavelet coherency and phase difference analysis can explain the co-movement between two time series and/or their causality structure in both the frequency and time domains by employing CWT techniques. In Fig. 4, the right panel depicts the co-movement (coherency) between daily return (Bitcoin) and its sentiment data from January 2016 to June 2021 at a 2–23-week frequency band interval. According to the wavelet coherency spectrum (right panel), there were significant coherency areas at high frequency (2–3 weeks). There were also significant coherency areas at 3–9-week frequency bands, predominantly during 2016q4–2017q1, 2018q1–2019q1, and 2020q4. At 9–18-week frequency bands, significant coherence existed throughout 2018q3–2019q1 and 2019q4–2021q1. Significant coherency areas reveal that return and sentiment had high coherency in the short term (2–3 weeks). Co-movement can be observed at lower frequencies (2–3, 6–9, and 9–18 weeks) without disappearing at higher frequencies at specific periods, which lasts at most one year. Finally, the co-movement (coherency) between return and sentiment increased with the COVID-19 pandemic. However, it vanished after 2020q1.

Fig. 4
figure 4

On the left: wavelet coherency spectrum between return Bitcoin prices and change in Bitcoin specific sentiment index. Periods are depicted as weekly on the vertical axis. Thick black line indicates the boundaries of the cone of influence (COI). Thin black line shades the areas with 10% significant level. These areas have been simulated after 1000 runs. The wavelet coherency ranges respect to jet color codes from low (blue) to high (red). On the right: phase differences diagrams of 2–3, 3–6, 6–9- and 9–18-weeks frequency, respectively. Blue line indicates mean of phase difference between return and sentiment index throughout particular frequency intervals. Data spans from 1st January 2016 to 30th June 2021

The left panel in Fig. 4 depicts the mean of phase differences between return and sentiment separately at 2–3-, 3–6-, 6–9-, and 9–18-week frequencies from 2016 to 2021. Phase-difference diagrams help us understand the direction of causality (which series lead or lag) and co-movement (in the same direction or inversely) at specific periods. The phase differences reveal that Bitcoin-specific sentiment generally leads to Bitcoin returns and is positively related at any observation frequency\((-\pi /2,0 )\). This result is consistent with that of Burggraf et al. (2020), who examined the relationship between investor sentiment and Bitcoin for the pre-COVID-19 period and argued that sentiment has a high predictive power on Bitcoin returns. Guegan and Renault (2021) supported the same causality from Bitcoin sentiment to its return at less than 15 min trading frequency from 2018 to 2019.

Throughout the observation period, return and sentiment are positively coherent (phase difference is \(\left(-\pi /2, \pi /2\right)\)), but the direction of causation becomes uncertain at some points. In particular, the phase difference starts to change its state at lower frequencies (6–9 and 9–18 weeks) in 2020q1. These fluctuations may be due to the COVID-19 pandemic and its probable joint effects on investor sentiment and asset returns. Hence, partial wavelet tools are used to re-examine the interrelation between Bitcoin’s return and its specific sentiment, excluding the effects of the outbreak by factoring out the number of new deaths and new cases of COVID-19 as well as the EMVID index. The partial wavelet spectra (on the left) and phase difference diagrams (on the right) are depicted in Fig. 5. After the number of new deaths, the number of new cases, and the EMVID index are factored out from the coherency between sentiment and return, the partial wavelet coherency spectra in Fig. 5 indicate that (i) during the pre-COVID-19 period, the significant high coherency areas in Fig. 5 (1st and 2nd spectra) overlapped with the high coherency areas in Fig. 4 as the number of new deaths and new cases data were not available before the COVID-19 pandemic. Although the EMVID index is incorporated into the analysis for the pre-COVID-19 period, there are some minor differences in the third spectrum compared with that in Fig. 4. (ii) During the COVID-19 period, high coherency regions at 9–18 weeks became more extensive and continuous compared with those in Fig. 4.

Fig. 5
figure 5

On the left: partial wavelet coherency between return Bitcoin prices and change in Bitcoin specific sentiment index by factoring out number of new cases (1st row), number of new deaths (2nd row) and EMVID index (3rd row), respectively. Thick black line indicates the boundaries of the cone of influence (COI). Thin black line shades the areas with 10% significant level. These areas have been simulated after 1000 runs. Partial wavelet coherency ranges respect to jet color codes from low (blue) to high (red). On the right: partial phase differences diagrams of 2–3, 3–6, 6–9- and 9–18-weeks frequency, respectively. Blue line indicates mean of partial phase difference between return and sentiment index throughout frequency intervals. Data spans from 1st January 2016 to 30th June 2021

The right panel in Fig. 5 presents the mean of partial phase differences between daily sentiment and return series at 2–3-, 3–6-, 6–9-, and 9–18-week frequency bands. According to partial phase difference diagrams, in the short run (at 2–3-week and 3–6-week frequencies), the mean of partial phase differences varied between\((-\pi /2,0 )\), suggesting that sentiment led to return, and the partial phase differences were in phase. Similarly, the phase difference at the 6–9-week frequency band fluctuated between \((-\pi /2,0 )\) till 2019q4 as it did at former frequency bands. However, during 2020q2 and 2020q3, the phase difference shifted in locating in the interval\((0, \pi /2)\), indicating a change in the lead-lag relation, while return and sentiment continued to co-move in the same direction (in phase). This change in the phase difference structure suggests that Bitcoin price return led the sentiment from March to September 2020, and they co-move positively at the 6–9-week period.

The frequency in partial wavelet coherency diagrams of significant regions from 2020q1 to 2020q3 at 6–9 weeks supports this change. However, they do not provide strong evidence of a determined and persistent structural change in causality. Because these regions are small and discontinuous after 2020q3, at 9–18-week frequency, the partial phase difference diagrams depict a similar but smoother pattern relative to shorter frequency bands. Hence, they indicate a positive co-movement between sentiment and return during the observation period. Moreover, there were phase differences \(\left(-\pi /2,0\right)\) until the beginning of 2020, suggesting that sentiment leads to return. However, the phase differences were close to 0 from 2020q2 to 2020q4. Although this does not provide evidence of a significant lead-lag relationship between return and sentiment, it proposes a positive contemporaneous impact on each other, starting from 2020q2.

This study repeats the wavelet methodology using data on daily Bitcoin returns; its specific sentiment; and the VIX index, S&P500, and gold prices during the pandemic period to conduct a robustness test. The partial wavelet coherency and phase difference estimations that factor out the number of new deaths and new cases as well as the EMVID index are robust to the analyses that factor out the VIX index, S&P500, and gold. The spectrum of partial wavelet coherency between sentiment and Bitcoin returns indicates similar significant coherency areas and partial phase differences when VIX, S&P500, and gold are factored out in the analysis.Footnote 6

The wavelet coherency between Bitcoin returns and its sentiment appears to be significantly high on specific dates at lower frequencies (vertically located coherency regions) rather than during an extended period at a particular frequency (horizontally located coherency regions). These dates coincide with specific events, which cause high volatility in Bitcoin returns (as depicted in its wavelet power spectrum). This implies that specific circumstances that increase the volatility of return (wavelet power) allow investors to determine a potential co-movement between return and sentiment. Furthermore, Bitcoin-specific sentiment and its return tend to co-move in the same direction, that is, positively correlated, both in the short and long run, meaning that sentiment and return series are significantly pro-cyclical.Footnote 7 This result indicates that a higher return (sentiment) is associated with a higher sentiment (return). Hence, investors should consider that Bitcoin-specific sentiment and its return are predicted to have a positive co-movement at all frequencies.

Additionally, Bitcoin-specific sentiment led to Bitcoin returns both in the short and long run from 2016q1 to 2020q1. However, this causal dependence disappeared at lower frequencies (9–18 weeks) after 2020q2 and shifted into the opposite at the 6–9-week frequency, which implies that Bitcoin return was influenced by sentiment from 2020q2 to 2020q3. Nevertheless, the latter state of causality cannot be justified empirically as the significant regions are relatively small and intermittent. The causality between return and sentiment tended to diverge from its \((-\pi /2,0 )\) range, and it was no longer valid that sentiment led to return at relatively more extended frequencies after 2020q2. Hence, during the COVID-19 outbreak, investors should have noted that the causal relationship that states that sentiment indicates return can only be integrated into short-term trading strategies (up to 6-week frequency). However, Bitcoin-specific sentiment would be uninformative to its return discovery in a 9–18-week frequency trading strategy.Footnote 8 Investors should note that Bitcoin-specific sentiment and its return are estimated to have a positive co-movement at all frequencies.

Conclusion

This paper contributes to the literature in several aspects. First, the wavelet methodology reveals the relationship between sentiment and return in different investment horizons. Second, focusing on the unprecedented COVID-19 pandemic period allows us to investigate the changing dynamics in investor psychology. Third, we employ a partial wavelet coherence approach to explore the net effect of investor sentiment on Bitcoin return by factoring out the impact of the outbreak. Last, phase differences help us to check the direction of the causality between Bitcoin return and sentiment.

By employing a wavelet coherence method, we analyze the relationship between Bitcoin price and sentiment. Bitcoin-specific sentiment indicates high volatility at the beginning of the COVID-19 pandemic as it had done when Bitcoin prices started to rise steeply in 2017. We find that Bitcoin sentiment variability increased in the short run (2–3 weeks) during critical events. The pandemic has significantly affected Bitcoin-specific sentiment variability. Using the COVID-19 pandemic as a natural experiment, we check whether the relationship changes. The results reveal that before the pandemic, sentiment had a positive effect on Bitcoin prices. During the pandemic, the relationship was still positive, but sentiment did not affect Bitcoin prices. Then, employing a partial wavelet approach, we factor out new COVID-19 cases and deaths and the EMVID index to observe the direct relationship between Bitcoin sentiment and price. We find robust results—after factoring out, the positive coherence between sentiment and price still existed, but the lead-lag relationship disappeared. These results reveal that positive coherence existed strongly even during the pandemic, which might be due to the effect of the pandemic on markets.

Sentiment-motivated strategies were expected to perform better than the expected market return before the pandemic emerged. However, the influence of sentiment on Bitcoin returns had disappeared by the start of the outbreak at a lower frequency of 9–18 weeks. Although the positive relationship between the change in sentiment and Bitcoin returns persisted during the COVID-19 pandemic, sentiment lost its Bitcoin price discovery role in lower-frequency trading strategies. This is probably due to the excess information revealed during the pandemic, making it harder to extract information from sentiment. Sentiment-driven strategies preserve their importance at relatively higher frequency trading of up to 6 weeks. These results imply that short-term investment decisions should rely more on sentiment measures. Regarding long-term investment strategies, investors should consider that the Bitcoin market is not mature compared with other financial markets, such as the stock and bonds markets. Due to the high volatility in Bitcoin prices, investors may be better off following long-term trends/cycles rather than focusing on short-term sentiment dynamics. In particular, the lead-lag relationship during the COVID-19 pandemic period may differ (or even disappear) compared with normal times due to the unprecedented uncertainty caused by the pandemic.

From the perspective of the efficient market hypothesis, the Bitcoin market is not efficient as sentiment plays a significant price discovery role at an average of up to a one-month investment horizon. In addition, just after the pandemic started, a contemporaneous co-movement between Bitcoin return and its sentiment arose at 9–18-week trading frequency when the impact of the pandemic was excluded. The results suggest that wavelet methodology can be conducted in future research that investigates Bitcoin’s causal dependence on other types of disaggregated sentiment measures. Future studies can also perform time–frequency analysis of the relationship between other major cryptocurrencies and their specific sentiments.