Introduction

With the advent of advanced high-frequency data analysis techniques, finance scholars have increasingly delved into the intricate dynamics of financial markets, thereby uncovering novel insights into their behavior and interconnectivity. However, accessing high-frequency data poses a significant obstacle for relatively new and immature financial assets, such as cryptocurrencies. As a result, most early studies in this field resorted to employing daily data, inadvertently missing out on the advantages of utilizing data with higher frequencies (Corbet et al. 2018a). Nevertheless, as the infrastructure supporting cryptocurrency trading matured, collecting data at higher frequencies became possible, ushering in a new wave of cryptocurrency research that leveraged these more granular data (e.g., Katsiampa et al. 2022).

Contrary to low-frequency data, high-frequency data helps identify jumps in financial assets (Christensen et al. 2014), facilitates the detection of behavioral biases, accurate information about market microstructures, intraday changes, and patterns that cannot be easily captured but are concealed at daily frequency (see Hansen and Huang 2016). Using high-frequency (hourly) data helps search for arbitrage trades in both short- and long-term horizons (Sifat et al. 2019). Inefficiency in cryptocurrencies can also be better detected in high-frequency (intraday) data than in low-frequency (daily) data (Zargar and Kumar 2019). Moreover, low-frequency data, even daily data, cannot reflect the swiftly changing environments in crypto markets during trading. By contrast, high-frequency data that cover shorter time horizons can improve the precision of models and the understanding of market dynamics, i.e., monitoring price dynamics and formation, trading activity, microstructures, and investor behaviors in the crypto market. From COVID-19 onwards, the literature has evaluated cryptocurrencies at high frequencies from various perspectives (Katsiampat et al. 2022).

The results reported in cryptocurrency research papers, often developed in parallel, create discrepancies stemming from differences in observation periods and variations in the data frequencies used by scholars. Alexander and Dakos (2020) found that 80 of 124 papers published between January 2017 and March 2019 used questionable and/or non-traded data in their empirical analysis. To our knowledge, no previous studies have attempted to comprehensively explore cryptocurrency literature using high-frequency data, synthesize the available findings, and provide meaningful directions for future research in this domain. To fill this gap and enhance our understanding of the role of high-frequency cryptocurrency data in shaping the cryptocurrency literature, this study employs a bibliometric analysis.

The bibliometric technique is the most popular approach for analyzing growing literature, assessing the scholarly contributions of diverse stakeholders, and helping create thematic maps for scholarly applications. In addition, we performed content analysis. We attempt to answer the following questions: What are the prevailing publication trends in using high-frequency data in cryptography-related research? Which highly influential articles leverage high-frequency data in cryptographic studies? Does a surge in the number of publications by authors or journals lead to a proportional increase in the total number of citations and getting more attention? Does publishing in highly influential journals in this field contribute significantly to an increase in the number of citations of the authors’ articles? What are the primary research streams encompassing past and current themes in the cryptocurrency literature using high-frequency data? What are the prospective avenues of research within this field?

This literature review aimed to comprehensively explore diverse academic articles that have utilized high-frequency data to unravel the complexities of cryptocurrency trends, market volatility, and the interrelationships among different digital currencies. By synthesizing and critically analyzing the findings of these pivotal studies, we aimed to provide a nuanced understanding of the implications of high-frequency data analysis in shaping our comprehension of the ever-evolving cryptocurrency landscape.

This study contributes to the literature in several ways. First, we conducted a performance or citation analysis based on article production growth, contributing authors, institutions, countries, and key journals. We also examine the impact of increasing publication authors, and journals on their citations and attention. Further, we assess and identify the fields relevant to crypto market research and their associated links with other assets that dominate the use of high-frequency datasets. This topic is underexplored in the literature. Thus, this study aimed to fill this gap by classifying the body of knowledge, contributing to the rapidly growing cryptocurrency literature, and offering specific directions for further research on cryptocurrency scholars. Our study highlights the benefits of using high-frequency data in cryptocurrency research, and we believe that it will facilitate further research in these innovative areas.

Second, we conduct a meta-analysis of the literature through bibliometric and content analyses, employing qualitative and quantitative methods to synthesize research on the cryptocurrency market, particularly emphasizing the growing utilization of high-frequency datasets. Although several alternative approaches are available for categorizing existing papers based on their primary themes and methodologies, such as a systematic literature review (Corbet et al. 2019), bibliometric analysis is rapidly gaining prominence in finance. Therefore, our contribution builds upon previous studies that utilized alternative methodologies, including a systematic literature review (Ballis & Verousis 2022; Corbet et al. 2019) or survey method (Fang et al. 2022; Holub & Johnson 2018), by integrating both meta-analysis and bibliometric review approaches.

Finally, the novelty of our study lies in its distinction from recently published bibliometric analyses of cryptocurrencies. For example, several studies have exclusively conducted bibliometric analyses focusing on Bitcoin (Merediz-Solà & Bariviera 2019; Ramona et al. 2019; Shen et al. 2020), while others have explored cryptocurrencies from various perspectives, including their growth (García-Corral et al. 2022), economic implications (Yue et al. 2021), financial asset role (Jalal et al. 2021), adoption trends (Sousa et al. 2022), and the evolution from a technological to an economic standpoint (Jiang et al. 2021). However, our current systematic literature review and bibliometric analysis emphasize the use of high-frequency data in cryptocurrency market research. To our knowledge, no bibliometric reviews have focused on this subject.

We find that the highest number of publications is by Ahmet Sensoy (author) in Finance Research Letters (journals). Frequently publishing authors and journals are also influential, but the level of attention is not attached to the frequency of publications. Articles published in influential journals (with many citations per paper) attract more citations. The use of high-frequency data in cryptocurrency research can be divided into four major streams. The first group focuses on return prediction and the measurement of cryptocurrency volatility, which is expected given the lack of established pricing models and the high volatility in cryptocurrency markets. The second focuses on the (in)efficiency of cryptocurrencies. Market efficiency is an inherent feature of the market’s microstructure. The third examines price dynamics and bubbles in cryptocurrencies, and the fourth examines the hedging, diversification, and safe haven properties of Bitcoin.

The remainder of this paper is organized as follows: Sect. "Literature background" discusses the background literature, Sect. "Data and methodology" explains the data and methodology, Sect. "Bibliometric results" describes the bibliometric results, Sect. "Avenues of future research" identifies research hotspots in the field, Sect. "Conclusions" offers future research directions, and Sect. "Conclusions" concludes the paper.

Literature background

Cryptocurrency, initially a niche product in small online communities, has evolved into a digital medium of exchange prized for its economic efficiency and rapid transaction speeds. Its journey can be traced back to the 1980s when David Chaum (1983) introduced a groundbreaking encryption method still used for web-based security. It is the foundation for modern electronic fund transfers (Bunjaku et al. 2017). In the 1990s, several digital currencies were developed; however, many of them, such as Wei Dye’s money, failed to gain traction. A notable exception is PayPal, which has gradually expanded in influence. Notwithstanding past attempts to create virtual currencies, such as E-gold and Liberty Reserve in 1996 and 2006, the title of the first enduring and most famous blockchain-based cryptocurrency is Bitcoin, introduced by Nakamoto (2008). Before the inception of Bitcoin, the concept of cryptocurrency was virtually nonexistent, primarily because of concerns surrounding centralization and the legitimacy of digital currencies (Bakar et al. 2017).

The introduction of Bitcoin has ignited substantial excitement within financial markets, and this phenomenon poses an existential risk to certain conventional financial functions (Härdle et al. 2020). Subsequently, new cryptocurrencies such as Ethereum, Ripple, and Litecoin and stable coins such as Tether have also been introduced, potentially serving as competitors or alternatives to Bitcoin. According to Statista, the cryptocurrency landscape has experienced exponential growth, expanding from 63 in 2013 to the staggering 4,501 options available today. Among these, only the top 20 account for 90% of the worldwide market share, thus achieving global dominance (Ji et al. 2019).

From a notable paper by Nakomoto (2008), who introduced Bitcoin, many papers have been published in economics, finance, computer science, management, and other fields. Cryptocurrency has become a revolutionary payment tool that serves as an alternative to traditional financial markets due to its sage of innovative technology (blockchain) and cryptography,Footnote 1 thereby ensuring a move towards the evolution of digital finance.

These newly developed digital currencies have advantages over conventional currencies, such as decentralized transactions, auditability, and anonymity, to name just a few. Still, they also contain unique risks, such as uncontrolled network operations (Johnson et al. 2014). The recent COVID-19 outbreak has spiked the interest of retail and institutional investors in the crypto market, which can be accessed from home through fintech trading sites, such as Binance and Robinhood (Long et al. 2022). This increased accessibility to digital assets has led to enhanced intraday trading in the cryptocurrency market. Moreover, improved ecosystems such as Cosmos and Polkadot facilitate crypto investors to make cross-blockchain trades that were not previously viable.

The discussion of cryptocurrencies has become a hot topic, attracting readers from industrial and academic communities (Tschorsch and Scheuermann 2016). Initially, researchers focused on understanding blockchain technology (Zheng et al. 2018; Zyskind and Nathan 2015) and the potential of Bitcoin as a substitute for traditional currencies in economics (Böhme et al. 2015; Yermack 2015). The current literature analyzing the financial features of the crypto market shows enormous growth from the perspective of economics and finance (see Chu et al. 2021; Kayal and Rohilla 2021; Shi and Shi 2021) because this market, with billions of dollars of capitalization, serves as a complementary currency with diversification features resembling an investible asset class (Bouri et al. 2017a; Carrick 2016; Gandal et al. 2018). Moreover, with the maturity of the blockchain space, thousands of cryptocurrencies are traded worldwide (Corbet et al. 2019). However, these digital assets demonstrate some standard features, such as heavy tails, long memory, volatility clustering, and adverse leverage effects (Fung et al. 2022), and are speculative investment tools, which means they are prone to explosiveness or speculative bubbles (Bouri et al. 2019a; Shahzad et al. 2022; Cheah and Fry 2015) because of the potential to generate higher returns for investors, thereby attracting a new group of speculators.

Despite the increased accessibility of high-frequency crypto data, most studies employ low-frequency data in the crypto market and aligned research areas. This is highlighted by a survey conducted by Bariviera and Merediz-Solà (2021), which affirms that most studies consider the daily frequency as a natural data frequency, ignoring other options. Among the most relevant studies, only a few have employed high-frequency data and considered them more efficient. Besides the findings of this survey, numerous studies have explored different aspects of cryptocurrencies, predominantly Bitcoin, by using data of daily frequency such as bubbles (Cheah & Fry 2015), hedging capabilities (Dyhrberg 2016b), liquidity, and efficiency (Wei 2018), long memory interdependency (Cheah et al. 2018), structural breaks (Thies and Molnár 2018), transaction activity (Koutmos 2018), tail risk (Borri 2019) and its dependence (Nguyen et al. 2020), long memory effects (Phillip et al. 2019), and price clustering (Urquhart 2017), volatility spillover (Yi et al. 2018), and cyber-attacks, contagion and spillover within crypto market (Caporale et al. 2021).

The crypto market offers 24 h trading throughout the year. In the existing traditional financial market, investors may hold assets for a short period of a few minutes (Glantz and Kissell 2013), and the crypto market is no different. Many online exchanges offer tools and services that enable cryptocurrency traders to trade for minutes or seconds, resulting in high profits (Brauneis et al. 2022; Chu et al. 2019). Moreover, compared to other financial instruments, cryptocurrencies demonstrate higher intraday volatility (movements of 3–5% are common), and a large portion of high-frequency trades are pursued through algorithmic trading bots, which signifies the use of high-frequency data in analyzing cryptocurrencies (Chan et al. 2022; Zhang et al. 2019b).

Several studies have supported the significance of using high-frequency data. For instance, Mensi et al. (2020) suggested that high-frequency data can locate or trace fitful market activities over time. In contrast, daily prices also differ in their calculations, which may create issues. Vidal-Tomás (2021) affirms that daily prices are either closing prices at the last closing hour of a day or the weighted averages of the hourly closing prices of a day (24 h), which are statistically different. The last hour’s daily closing prices may fail to explain the daily behavior of cryptocurrencies owing to characteristic volatility, and these prices may change in the next minute. Meanwhile, weighted average daily prices may distort trader behavior, as they cause a reduction in the volatility of cryptocurrencies by averaging high-frequency (hourly) prices, leading to a persistent time series. Recently, Yarovaya and Zięba (2022) confirmed that the return and volume of cryptocurrencies are not very sensitive to variations in the sampling intervals of high-frequency data (i.e., 5, 10, 20 min, 40 min, and hourly) but to variations between weekly, daily, and intraday data.

In line with the highlighted importance of high-frequency data, the recent research is more inclined to using of high-frequency data in exploring different aspects of cryptocurrencies, such as extreme connectedness between return and volume of cryptocurrencies at higher tail risk (Chan et al. 2022), long memory in return and volatility (Arouxet et al. 2022), rough volatility (Takaishi 2020) market complexity (Drożdż et al. 2020), price clustering (Li et al. 2020; Quiroga-Garcia et al. 2022) potential profitability by using a momentum strategy (Chu et al. 2020), volatility spillover among cryptocurrencies (Sensoy et al. 2021) and exchanges such as Binance, Coinbase, etc., (Alexander et al. 2022; Ji et al. 2021).

Previous attempts have been made to aggregate and review cryptocurrency literature. Ballis and Verousis (2022) study the impact of behavioral factors on cryptocurrencies, while Corbet et al. (2019) examine the breadth of cryptocurrency-related research. Fang et al. (2022) summarize existing research articles on cryptocurrency trading, covering topics such as trading platforms, signals, strategies, and risk management and highlighting related outcomes. Holub and Johnson (2018) map research on Bitcoin and other cryptocurrencies across various disciplines, including technical fields, economics, law, public policy, finance, and accounting. However, to our knowledge, no existing studies have provided a comprehensive literature survey of recent cryptocurrency research employing high-frequency data. Therefore, in this study, we aimed to fill this gap using a bibliometric analysis and meta-review approaches.

Data and methodology

This study employed the Scopus database to search for keywords and extracted relevant articles using a literature retrieval procedure. For data extraction, we can use all platforms such as Scopus, Web of Science (Clarivate Analytics), EconLit, IDEAS/REPEC, and Google Scholar; however, it would necessitate a huge data-cleaning process to combine all datasets (Corbet et al. 2019). Compared with WoS and EconLit, Scopus has had a broader coverage of journals since 1996. Although it has the same features as REPEC, it does not include working papers. Information on citations is available in Scopus, WoS, and REPEC but not in EconLit (Ruane and Tol 2007). However, relying on Scopus compared to other databases is based on its distinct features, that is broader coverage with citation information and more than 15,000 peer-reviewed subjects (Levine-Clark and Gil 2008), which is notably 30% larger than the WoS database (Comerio and Strozzi 2019), and allows for a more subtle analysis compared to Google Scholar, which provides limited bibliometric information for conducting bibliometric analysis.

The keywords are “crypto-market” or “cryptocurrenc*” or “crypto-currenc*” or “digital currenc*” or “Bitcoin” or “BTC” or “Ethereum” or “ETH” and “high-frequency” or “high frequency” or “intra-day” or “intraday” or “hour*” or “tick data” or “5?min” or “30?min.” This search generated 344 documents written in English and published between January 2015 and July 2022. However, we limited our search to articles published or in the press by excluding book chapters, conference papers, editorials, errata, data papers, and review papers, and we got 240 articles. Finally, by further scrutinizing the articles for irrelevant terms such as high-frequency band, closing hours, peak hours, work hours, intraday snapshots, etc., we obtained a final list of 189 articles as a literature sample from 2015 to 2022. This bibliometric dataset was analyzed using a bibliometric technique. The year-wise publication trends are shown in Fig. 1. A significant increase in publications using high-frequency data was observed after 2018.

Fig. 1
figure 1

Publications trend over the years

Bibliometric analysis has gained popularity in several fields and has become a widely used approach in systematic reviews (Ruggeri et al. 2019; Patel et al. 2022). This approach can reveal unknown trends by collecting relevant information on a specific field or aspect of the subject. This can reduce subjective bias and steer authors to highly related fields to improve the quality of the literature review (Zupic and Čater 2015). Moreover, it helps researchers to comprehend a wider picture and creates links between publications from various sub-areas. Therefore, it is suitable for identifying the existing research state and future avenues (Paul and Bhukya 2021).

Bibliometric research methods use quantitative statistical analysis of a set of publications and then map in clusters of core authors, citations, and research institutions (De Bakker et al. 2005; Wang et al. 2021). It commonly focuses on citations, co-authors, co-citations, cartographic analyses, and bibliographic coupling (Zupic and Čater 2015). This study used bibliometric and content analyses to comprehensively examine the use of high-frequency data in cryptocurrency research to offer insight into existing research interests and discover streams of research on emerging topics in the field. Initially, the study employed a citation analysis (no networks) to highlight the volume of publications and their influential role in a specific topic. This step includes a citation analysis of authors, documents, and journals. Furthermore, regression analysis was performed to determine the authors’ and journals’ impact and the attention that they receive by publishing most frequently in this particular field. In addition, the authors’ impact those are publishing in journals that are found influencial this field. In the second stage, a co-authorship analysis was conducted to illustrate the network of collaborating authors in this field. In the third stage, cartographic or keyword co-occurrence analyses are performed to identify the evolution of research streams on a particular topic. Furthermore, co-citation (network) analysis was conducted to draw a scientific map showing the general topics or research streams through each cluster’s content analysis.

Several software tools are frequently used to conduct bibliometric studies easier such as HistCite, CiteSpace, CiteSpaceV, Publish or Perish, SciMAT, BibExcel, VOSviewer, Biblioshiny, Pajek, and Gephi. Considering the diversity of the features of different bibliometric tools, we decided to use two software tools: VOSviewer (van Eck and Waltman 2014) and Biblioshiny (Aria and Cuccurullo 2017). These fit our aim of examining citation counts and classifying the literature on crypto market research utilizing high-frequency data and offering the required analysis output. These software tools were chosen because of their reliability in dealing with large datasets, comprehensive ability to sort and deal with manifold information types, flexible visual impairments, expedient settings, built-in algorithms, and provision of an eclectic range of other available network and visualization analytic tools, such as Pajek and Gephi. Moreover, these tools are superior to other potential software tools for various reasons. For example, CiteSpaceV can be an alternative. Still, it is not that flexible and requires a lot of time to master its pretty complex user interface. At the same time, Histcite is confined to WoS and unable to interpret the Scopus dataset (Fahimnia et al. 2015), Bib Excel requires extra skill set and experience to conduct a modest analysis (Ampah et al. 2021), Pajek requires “. Net” dataset.

Therefore, the software tools opted for this study are easy to use without requiring profound familiarity with clustering techniques and advanced computer skills and are freely accessible to anyone (particularly aspiring scholars) to identify existing trends and gaps in scholarly research articles and publication databases. Biblioshiny has a well-ordered and instinctive interface (Moral-Muñoz et al. 2020). We adopt it for descriptive and citation analysis, including tables and graphs, while we use VOSviewer for science mapping for multiple reasons, such as data and network visualization; readers from different disciplines can easily understand the clustering and map created by it. For identifying clusters in mapping and deriving themes from them, VOSviewer is a compelling graphic interface for users who quickly observe maps (Cobo et al. 2011). We also use Microsoft Excel for frequency analysis and analyze the citation, co-citation, and keyword co-occurrence, thereby constructing a research atlas of the propensity to use high-frequency data in cryptocurrency and exploring the research flash points and frontlines in this field.

To this end, we perform three key bibliometric techniques: co-author, co-citation, and keyword co-occurrence analyses, thereby constructing a research atlas of the propensity to use high-frequency data in cryptocurrency and exploring the research flash points and frontlines in this field. A roadmap for this study is shown in Fig. 2.

Fig. 2
figure 2

Road map from data extraction to analysis

Bibliometric results

This section reports the bibliometric results: the performance analysis of authors, their affiliated institutions, nations, and publishing journals in the form of citation tables using Biblioshiny and network analysis, including co-citation, co-authorship, and keyword co-occurrence networks generated through VOSviewer.

We explain the function of VOSviewer in network analyses. It is a compelling network visualization tool for structures comprising several elements based on a distance-based vision approach. This creates networks or maps based on the data. Generated networks contain items or nodes representing objects of interest, such as authors, sources, countries, keywords, and lines that demonstrate the connection or relationship between them (every pair of items) in the form of co-citation and co-occurrence (van Eck and Waltman 2019). The strength of the individual links was designated as a positive numerical value. This relationship becomes stronger, corresponding to higher values. The sum of the times two items (e.g., authors) are quoted together indicates a link’s strength. The items were grouped into clusters where every node was ascribed to one cluster. The number of resulting clusters depends on clustering parameters, such as the smallest cluster size. This clustering analysis allowed the samples to be engraved into smaller, more homogenous groups that could be recognized and explored. However, attaining the highest homogeneity among separate clusters in our analysis was impossible. The clustering technique requires a relevant and specific algorithm to solve an optimization query or problem (for details see, Waltman et al. 2010). In doing so, VOSviewer utilizes a smart local moving algorithm (Waltman and Van Eck 2013). Therefore, network analyses were performed using VOSviewer software.

The figures or network diagrams generated through VOSviewer use clustering procedures to allot articles with higher strengths to create results (networks) displayed in different colors. As described earlier, nodes or items represent authors, documents, keywords, etc., and their size signifies highly cited or co-cited literature. Edges represent relations in which the closeness of two nodes indicates a higher strength of the items (e.g., authors) and the co-citations or relations between them. However, they are distinguished from each other based on the variation in their total strength links, weighed-degree (computation is a built-in feature of VOSviewer), or other centrality measures such as degree, betweenness, and closeness centralities (can be computed through Gephi and Pajek). Therefore, the network relationship between the item (authors, documents, etc.) clusters helps determine the strength of the connection among the clusters that create a whole cluster analysis diagram of a discipline, field, or specific topic (Li et al. 2017).

Performance analysis

This section presents a performance analysis of the authors, their affiliated institutions, countries, and journals. We also categorize the top-trending and most influential articles published in this field. Furthermore, we examined the impact of the number of authors’ and journals’ publications on their citations, attention, and publishing in influential journals.

Citation analysis

Citation analysis is a widely used method for assessing the impact of a publication (Ding and Cronin 2011) through the citation numbers that an article receives from other authors (Zupic and Čater 2015). From the 189 sample documents, 86 sources, and 395 authors, we report the top 11 authors, journals, and countries, their respective numbers of articles produced (Table 1), and their influence or higher number of citations (Table 2) on the subject. We also categorized the highly cited papers (Table 3) and top-trending papers (Table 4).

Table 1 Highly publishing authors, journals, and affiliated countries
Table 2 Highly influential authors, journals, and countries
Table 3 Highly influential papers
Table 4 Top trending papers

Table 1 shows the classification of authors, journals, and affiliated countries at the forefront of publishing high-frequency cryptocurrency market research. For instance, Ahmet Sensoy was the most prolific author in this area, with the most publications. Finance Research Letters is the most frequent publisher of high-frequency cryptocurrency market research, with the United Kingdom being the top-affiliated country.

Table 2 presents the categorization of authors, journals, and countries of affiliation associated with highly cited articles on high-frequency cryptocurrency market research. For instance, Ahmet Sensoy is this field’s most frequently cited author. Finance Research Letters is the most influential journal on this topic, with the United Kingdom being the primary country of affiliation. From this list of papers, we can infer considerable research on cryptocurrency pricing and trading, focusing on Bitcoin. Researchers are exploring topics such as price discovery, volatility forecasting, price dynamics, correlations, inefficiencies, liquidity, transaction costs, and technical trading. This shows that the cryptocurrency market is gaining significant attention from researchers and investors and that high-frequency data and trading techniques are used to gain insight.

Table 3 presents the classification of highly cited or impactful papers that use high-frequency data and cover various topics. For instance, Brandvold et al. (2015) explored price discovery on Bitcoin exchanges, which has had a major influence on cryptocurrency research using high-frequency data. Urquhart and Zhang (2019) examined the hedge and safe haven abilities of Bitcoin for currencies using intraday data, which is the second most influential paper on cryptocurrency research that relies on high-frequency data.

This list covers the top ten papers that are cited the most on the subject.

Table 4 classifies the top articles that employ high-frequency data and encompass a variety of topics based on their higher citations per year. Urquhart and Zhang (2019) are the most trending papers in cryptocurrency research that explore the investment characteristics of Bitcoin as a hedge and safe haven. Chen et al. (2020) study examining the impact of fear and uncertainty on Bitcoin price dynamics during COVID-19 is the second most widely referenced study. In explaining the impact of articles, it is interesting to note that some articles reported in Table 3, for example, Dyhrberg et al. (2018) and Nakano et al. (2018), are not reported in Table 4 because they have fewer citations per year. Whereas two articles reported in Table 4, Baur and Hoang (2021) and Naeem et al. (2021), are not reported in Table 3 because they are not highly cited, but their per-year citations are higher.

Regression analysis

We use the dataset consisting of authors and journals’ number of publications, the total citations they received for articles, and citations per article to run OLS regressions in which we examine the link between authors and journal publication count to their corresponding citations and attention, as well as influential journals’ role in getting more citations (see Table 5).

Table 5 Regressions for authors’ publication impact

Table 5 presents the regression results for the three panels. In panel A, we show that an author’s increase in the number of articles significantly increases the number of citations while reducing the citations per article. This implies that publishing may improve the author’s total citation count, but at the same time, it can adversely affect the average number of citations per article. The reduction in average citations per article may result from multiple factors, such as publishing in diverse research areas and variation in the importance of topics. For example, groundbreaking articles may get higher citations and reworking, or articles not aligned with the prevailing research trends cannot receive the same level of attention. This result was generated using the total number of authors’ citations (right panel A of Table 5) and citations per article (left panel A of Table 5) as dependent variables and the number of publications as a regressor. Similarly, in panel B, we repeat the same regressions with the journals’ number of articles as the independent variable and the journals’ total citations (right panel B of Table 5) and average citations per journal (left panel B of Table 5) as the dependent variables. We find that an increase in the journals’ number of publications enhances their citations; however, the attention and average citations per journal results are insignificant. Finally, in Panel C, we investigate the impact of publishing in highly influential journals (here, we consider highly cited journals publishing in this field) by running a regression for authors’ total citations (independent variable) and journals’ total citations. The results demonstrate that publishing in journals whose articles are most frequently cited because of publishing on similar topics, fields, or impactful content may increase authors’ citations.

Network analysis

Co-authorship analysis

Co-authorship analysis enables us to create links between authors, institutions, and countries researching a subject or topic (Van Eck and Waltman 2014). In our analysis of co-author networks using high-frequency data in crypto markets, we maintain our criteria at a minimum of five documents per author, which generates a network of 14 authors distributed into four clusters (see Fig. 3). The colors of the clusters varied, with Cluster 1 in red, Cluster 2 in green, Cluster 3 in blue, and Cluster 4 in yellow.

Fig. 3
figure 3

Co-author network of authors researching cryptocurrencies

Keywords co-occurrence analysis

Keywords: words used in a paper to reflect content. The authors mostly index keywords based on the influence of their predecessors and the connections between keywords in the article. When sufficient authors use similar keywords and accept those connections in their articles, they can be used to explore topics or hotspots in a field (Whittaker 1989). Co-word analysis reveals patterns or trends in any discipline by measuring the strength of linkages between keywords. In contrast, the co-word network, based on the frequency of their co-occurrences, reflects content similarities or words with similar meanings (Ding et al. 2001). We created a keyword co-occurrence network by selecting the authors’ keywords with a minimum of five occurrences to identify the evolution of research flashpoints and embedded relevant topics.

The keyword co-occurrence network generated five clusters (Fig. 4). The first cluster in violet focuses on assessing the volatility of cryptocurrencies, specifically Bitcoin, while the second cluster in yellow focuses on the price discovery and market efficiency of Ethereum and the stablecoin, that is, Tether. The third cluster, in red, highlights the role of the volatility and liquidity of cryptocurrencies using high-frequency data. The fourth green cluster focuses on asymmetry in cryptocurrency futures. The fifth cluster in blue shows the safe haven ability of Bitcoin and gold during COVID-19.

Fig. 4
figure 4

Keywords co-occurrence network of cryptocurrencies

Co-citation analysis

Co-citation analysis reveals the relationship between two papers if they are concurrently cited by a third (Small 1973). This shows the relationship by distributing documents with higher strength in the form of a network or clusters using clustering methods. This relationship can be used to examine the commonality, centrality, and progression of various research fields. We performed a co-citation analysis using VOSviewer and selected 20 as the minimum number of citations for an author.

This generated four clusters illustrated in different colors (see Fig. 5). Cluster 1 highlights the focus of studies on return predictions and the measurement of volatility in cryptocurrencies. Cluster 2 (green) includes studies focusing on the informational (in)efficiency of cryptocurrencies. Cluster 3 (red) represents the studies on price discovery and speculative bubbles in cryptocurrencies. Cluster 4, in blue, represents research contributing to portfolio diversification, examining the co-movement, spillover, hedge, and safe haven abilities of cryptocurrencies against other assets.

Fig. 5
figure 5

Co-citation network of authors researching cryptocurrencies

Identification of major streams of cryptocurrencies research

We performed content analysis (Potter and Levine-Donnerstein 1999) to reveal, arrange, and validate the research streams identified by the bibliometric analysis. This analysis reveals the concepts, classifications, evolution, and current research streams in cryptocurrency literature. Based on this, we identify four major cryptocurrency research streams through a co-citation network of cited authors (Fig. 5). Persson (1994) noted that citing publications constitute a research front, while cited publications form an intellectual base.

Return prediction and volatility in cryptocurrencies

This stream of research focuses on the peculiarities of return prediction and volatility forecasting in cryptocurrencies, emphasizing high-frequency data and various volatility models.

Return prediction and volatility in cryptocurrencies have been studied since the introduction of Bitcoin by Nakamoto (2008). The volatile nature of cryptocurrencies is attributed to their newness compared with equity markets (Katsiampa 2017). Various factors have been found to drive Bitcoin prices, such as searching sites (Kristoufek 2013), market forces (Ciaian et al. 2016a), economic fundamentals (Li and Wang 2017), transaction speed (Corbet et al. 2018a), market attention (Urquhart 2018), trading volume (Balcilar et al. 2017; Bouri et al. 2019b), economic policy uncertainty (Demir et al. 2018; Fang et al. 2019), cryptocurrency liquidity (Wei 2018), Google search index (Bleher and Dimpfl 2019; Nasir et al. 2019), and Twitter tweets (Karalevicius et al. 2018; Shen et al. 2019). In addition to price, these factors have been found to predict cryptocurrency returns, volume, and volatility. Various models and machine learning techniques, such as long-short-term memory neural networks (Alonso-Monsalve et al. 2020; Lahmiri and Bekiros 2019; McNally et al. 2018) and support vector machines (Akyildirim et al. 2021), have been employed to predict cryptocurrency returns. Studies show that cryptocurrency liquidity relies on specific variables and is irrelevant to global financial market variables (Brauneis et al. 2021). Studies have also analyzed crypto market liquidity using high-frequency data (e.g., Dyhrberg et al. 2018; among others).

After introducing realized volatility based on high-frequency data by Anderson et al. (2003), and the provision of a framework for its integration into lower frequency return distributions and volatilities (Andersen et al. 2003), it is used to study the stochastic properties of returns (Barndorff‐Nielsen and Shephard 2002). In doing so, several volatility-based models, i.e., ARCH (Engle 1982) and GARCH (Bollerslev 1986), jump detection methods within realized volatility by separating bi-power variation and realized variance (Andersen et al. 2007; Barndorff-Nielsen and Shephard 2003, 2004; Huang and Tauchen 2005), and up-and downside realized measures (semi-variances) entirely based on upward and downward intraday price fluctuations (Barndorff-Nielsen et al. 2006). High-frequency data can accurately assess volatility asymmetries over longer daily return horizons (Bollerslev et al. 2006). Subsequently, methods for filtering out rare and large jumps were introduced (Tauchen and Zhou 2011). However, volatility or volatility jumps can be predicted by several factors, such as geopolitical risk (Gkillas et al. 2018). Moreover, very good and bad news have diverse impacts on volatility. For example, relatively good (intra-daily) news reduces future or next-day volatility (Chen and Ghysels 2011). The prediction of Bitcoin’s realized volatility depends on historical realized volatilities and jumps (Qiu et al. 2021). For instance, Patton and Sheppard (2015) argue that future volatility highly depends on the volatility of past negative returns and that the sign of the jump matters for the impact of price jumps on volatility, such as positive (negative) jumps lower (raise) future volatility. Moreover, co-jumping behavior in the volatility of cryptocurrencies may spark future jumps (Gkillas et al. 2022).

Corsi (2009) recommends the HAR-RV model as a better method to reproduce the key empirical properties of financial returns (self-similarity, long memory, and fat tails). However, several studies (e.g., Dyhrberg 2016a; Guesmi et al. 2019; Katsiampa 2017; Yu 2019; among others) examine the return prediction and volatility of cryptocurrencies using other volatility models (e.g., Bergsli et al. 2022; Chu et al. 2017; Diebold and Yılmaz 2012, 2014; Hansen et al. 2012; among others). Studies also examine volatility cascades in cryptocurrencies against the US dollar (Gradojevic and Tsiakas 2021) and volatility spillover among cryptocurrencies (e.g., Yousaf and Ali 2020).

High-frequency data are an efficient way to predict cryptocurrency returns (Vidal-Tomás 2022). Volume and frequency are inversely proportional (Yarovaya and Zięba 2022). Thus, future research topics include detecting price manipulation in cryptocurrencies at the intraday level, using feature selection techniques to improve accuracy, analyzing the impact of hard and soft forks on intraday reversal and momentum, assessing time-based momentum strategies during price crashes and sharp recovery using high-frequency data (Wen et al. 2022), and examining causal patterns between Bitcoin and other cryptocurrencies. Additionally, the volume-return relationship of cryptocurrencies (Balcilar et al. 2017; Yarovaya and Zięba 2022) can be examined with the inclusion of seasonality and volatility effects.

Similarly, using machine learning approaches such as recurrent neural networks and long short-term memory, new research can predict Bitcoin’s intraday price dynamics and identify its drivers (e.g., online sentiment and media hype). Furthermore, we provide prospective avenues for researchers in this field. For instance, Gkillas et al. (2020) suggest examining the macrofinancial determinants of crypto market jumps, and Grobys and Huynh (2022) propose assessing the impact of jumps in stablecoins. Gkillas et al. (2022) suggest assessing the impact of cryptocurrencies and the jumping behavior of other financial assets (upside, downside, and large and small jumps). Finally, volatility forecasting requires models with different data frequencies. The in-sample and out-of-sample forecasts can be reconciled at the short-, medium-, and long-term horizons.

(In)efficiency in the cryptocurrency market

Studies have identified various factors and developed different models or approaches that can predict cryptocurrency prices, challenging the efficient market hypothesis (all information is already reflected in prices). Research on cryptocurrency price prediction has led to an investigation of the efficiency of the cryptocurrency market based on the efficient market hypothesis. Increased liquidity makes cryptocurrencies inefficient and less predictable (Brauneis and Mestel 2018).

Similar to other assets, stylized facts such as fat-tailed distributions, volatility clustering, long memory features (Bariviera 2017; Bariviera et al. 2017; Chu et al. 2015; Cont 2001; Takaishi 2018; Urquhart 2017), adaptive patterns of long memory (Khuntia and Pattanayak 2020), interactions among returns, volatility, volume, and bid-ask spread (Eross et al. 2019), and asymmetric relations with price trends are observed in cryptocurrencies (Nadarajah and Chu 2017). There is also evidence of strong asymmetric reverting behavior in Bitcoin for negative returns, whereas excessive levels of persistence are revealed for positive returns (Corbet and Katsiampa 2020). In a similar vein, studies examine and analyze the microstructure properties or their impact on cryptocurrencies (e.g., Scaillet et al. 2020), stylized facts including multifractal properties (Stavroyiannis et al. 2019) by utilizing high-frequency data and different methods such as volatility autocorrelation, multi scaling effects, etc., (Drożdż et al. 2018), including Detrended Fluctuation Analysis, and Hurst exponents (Zhang et al. 2019a), and herding behavior during the COVID-19 pandemic and other different situations (e.g., Choi et al. 2022; Jalal et al. 2020; Mandaci and Cagli 2022; Yarovaya et al. 2021). Moreover, Bitcoin may respond more to negative than positive news (Fang et al. 2019). Negative market sentiment predicts bitcoin returns, volatility, trading volumes, and jumps. For instance, bitcoin returns and volatility fall in response to news relevant to crypto fraud and cyberattacks (Rognone et al. 2020). Moreover, evidence indicates that behavioral patterns in Bitcoin-related Google searches and tweets amplify herding, whereas EPU patterns and dependence between foreign exchange and equity markets dampen herding (Philippas et al. 2020). Thus, events like the COVID-19 pandemic may steer cryptocurrency markets away from weak-form efficiency, making it possible to predict future cryptocurrency prices based on past information. Furthermore, long-range dependence on cryptocurrency price returns and their amplification during the COVID-19 period points to inefficiency (Naeem et al. 2021).

In efficient markets (Fama 1970), past information should be reflected in prices; however, Bitcoin’s anti-persistence behavior (Takaishi 2021) indicates a frequent reversal in its moving direction, and asymmetric volatility induces inefficiency (Kristoufek 2018; Phillip et al. 2018), thereby increasing the possibility of achieving abnormal profits (Caporale et al. 2018). However, Bitcoin has predictive information for the price trends of other cryptocurrencies (Wang and Ngene 2020). To study the efficiency of crypto markets, several approaches, such as the Hurst exponent (Carbone et al. 2004), multifractal analysis (Drożdż et al. 2010; Jiang et al. 2018), wavelet and detrended fluctuation analysis (Oświȩcimka et al. 2006), detrending analysis (Oświȩcimka et al. 2014), and multifractal detrended analysis (Kantelhardt et al. 2002) have been employed for various assets (Jiang et al. 2019; Wang and Wang 2021) and or cryptocurrencies (Alvarez-Ramirez et al. 2018; Aslan and Sensoy 2020; Bariviera 2017; da Silva Filho et al. 2018; Dimitrova et al. 2019; Kristoufek 2018; Takaishi and Adachi 2020; Takaishi 2018; Urquhart 2016; Zhang et al. 2018).

Along with the approaches used to study the multifractal properties of Bitcoin, efficiency in the cryptocurrency market is examined through benchmark currency, varying sampling frequency, etc. (Bariviera 2017; Chu et al. 2019; Khuntia and Pattanayak 2018; Jiang et al. 2018; Sensoy 2019; Tiwari et al. 2018; Vidal-Tomás and Ibañez 2018). Despite the various perspectives, the primary conclusion of this study is that Bitcoin is inefficient (Kristoufek 2018; Mensi et al. 2019; Nadarajah and Chu 2017; Urquhart 2016) but may achieve weak-form efficiency over time (Chu et al. 2019; Gradojevic et al. 2022). Based on evidence of increased economic value and trading anomalies during the COVID-19 pandemic, Wen et al. (2022) suggest that traders can form a profitable strategy based on historical intra-day returns, leading to an inefficient cryptocurrency market.

However, as previous studies have examined herding behavior in bitcoin futures markets (e.g., Blasco et al. 2022), investigating stylized facts and empirical results of cryptocurrency futures and spot markets is important. Similarly, herding effects and their intensity should be studied across countries and markets while considering the impact of distinct factors. Examining the macro-financial, sentiment, psychological, and uncertainty determinants of herding behavior in the cryptocurrency market can provide detailed insights. The efficiency of cryptocurrencies in different regimes (upward and downward trends and asymmetric behavior) should be assessed, and the relative efficiency of other assets with cryptocurrencies should be compared. Variations in persistence and market efficiency should be evaluated during the outbreak of COVID-19 and the Ukraine-Russia War, and the anomalies and efficiency of more and less liquid cryptocurrencies should be assessed. Furthermore, as Dong et al. (2022) suggest, anomalous returns (displaying long or short legs) differ from those identified in equity markets, and the fundamental economic mechanisms of market anomalies can be explored in future research.

Price discovery and speculative bubbles in cryptocurrencies

In the preceding section, we discussed how the EMH and asset pricing theory follow the assumption that asset prices move around or mean revert to their fundamental value. The presence of explosiveness or bubbles in asset prices indicates a significant departure from their fundamental values. Similarly, studies have attempted to predict bitcoin prices through different approaches, such as the network-based properties of blockchain or bitcoin transaction graphs (e.g., Greaves and Au 2015). However, according to Tiwari et al. (2018) and Urquhart (2016), Bitcoin’s fundamental value is zero, and individual investors drive the market such that noise traders can distort the price discovery mechanism, leading to inefficiencies. Therefore, in addition to return and volatility forecasting, price explosiveness or bubbles in the cryptocurrency market indicate inefficiency. Moreover, high-frequency data has proven to be a valuable tool for examining price formation, discovery, and speculative bubbles.

This research stream indicates that the literature on cryptocurrencies is vast, with a range of studies exploring their higher volatility than conventional currencies (e.g., Katsiampa 2017; Dwyer 2015) and internally-driven nature (e.g., Baek and Elbeck 2015), making it highly speculative (Yermack 2015). However, some argue that Bitcoin is a currency or synthetic commodity money with monetary value, not a speculative investment (Blau 2017, 2018; Selgin 2015), although it possesses both speculative and financial asset properties (Kristoufek 2015) and competes with other cryptocurrencies (Gandal and Halaburda 2016). By studying Bitcoin’s price formation mechanism, Ciaian et al. (2016b) report that speculators influence Bitcoin prices but that macroeconomic factors only affect them in the short term. Subsequently, Ciaian et al. (2018) argue that Bitcoin’s transactions and speculative demand significantly impact its price formation. Meanwhile, Hu et al. (2019) suggest that prices usually form through negotiations and strategic trading.

However, price deviation across exchanges varies (Brandvold et al. 2015), with price discovery driven by shocks (Makarov and Schoar 2020). Using data from several exchanges, Brandvold et al. (2015) were the first to examine Bitcoin’s price discovery mechanism by applying the econometric approaches of Hasbrouck (1995) and Gonzalo and Granger (1995). They argue that the exchanges with the highest trading volumes are pronounced price leaders concerning information sharing and find the United States BTC-e and Japan’s Mtgox to be price setters or leaders. Pagnottoni and Dimpfl (2019) find that price discovery largely takes place on China-based OKcoin, Baur and Dimpfl (2019), Corbet et al. (2018b) find that spot prices lead the process, and Fassas et al. (2020) and Kapar and Olmo (2019) find that Bitcoin futures are dominant. Giudici and Polinesi (2021) and Giudici and Pagnottoni (2020) find Bitstamp, Coinbase, and Bitfinex to lead and Kraken as a follower exchange. Chang and Shi (2020) found Bitcoin to be a highly dominant cryptocurrency driving price discovery at the market level.

In addition to examining the short-term effects of speculators and macroeconomic factors, researchers have examined the endogeneity of price dynamics and the potential for bubbles. For example, the endogeneity of price dynamics is an important concept for understanding the dynamics of price bubbles because it explains how investors’ expectations can drive prices. The endogeneity of price dynamics is the idea that investors’ expectations of future prices are based on current prices. This can lead to a feedback loop of rising prices resulting from increased expectations of future prices. Price behavior in price discovery and bubbles in the Bitcoin market are two sides of the debate. Thus, explosiveness distorts the price discovery process. Turanova (2017) holds that Bitcoin’s price discovery mechanism is unlikely to operate without an extreme price rise, suggesting its true value is higher than its current level. By contrast, Ché and Fry (2015) argue that the intrinsic value of a cryptocurrency is zero and that its exponential growth is a bubble. As discussed earlier, Bouoiyour and Selmi (2015), Bouoiyour et al. (2016), Corbet et al. (2018a) and Kristoufek (2013) consider Bitcoin a speculative bubble rather than a currency.

Fry and Cheah (2016) find a negative bubble in Bitcoin and Ripple, with Ripple’s spillover intensifying price drops in Bitcoin. Fry (2018) identifies bubbles in Bitcoin and Ethereum and argues that boom-bust cycles could occur without bubbles. Cheung et al. (2015) identified multiple transitory and three major bubbles in Bitcoin, with bursts corresponding to major events in the market. Bariviera et al. (2018) applied permutation information theory quantifiers to study the high-frequency price dynamics of 12 cryptocurrencies during a bubble and bust in the starting month of 2018. They found that cryptocurrencies demonstrate distinct dynamics, indicating they are not homogenous assets. Corbet et al. (2018a) reported Bitcoin to be in the bubble phase after crossing a $1000 value. Bouri et al. (2019a) examine the link between explosive periods in the cryptocurrency market and find multidirectional co-explosivity, not necessarily from older to newer or larger to smaller markets.

There is a space for future research to study the relative price discovery of stablecoins, gold-backed cryptocurrencies, and tokens. These assets can contribute to price discovery and the crypto market efficiency. Moreover, the degree of price deviation and integration between these digital currencies during the onset of specific events, such as COVID-19 and the Ukraine-Russia war, may help broaden the understanding of overall cryptocurrency market behavior. As proposed by Su et al. (2022), it is important to examine the dramatic variations in the price discovery of Bitcoin at the intraday level in the post-COVID period. The explosivity or bubbles in cryptocurrency markets (Cheung et al. 2015), co-explosivity (Bouri et al. 2019a), price explosiveness, and the role of tweets (Shahzad et al. 2022) have been examined. However, the bubble contagion or spillover between cryptocurrencies and other financial assets can also be examined. The impact of supply and demand on the boom and bust of bubbles can be analyzed, along with using supply and demand graphs to simulate boom and bust periods. The connection and coherency of pre- and post-price explosiveness in the crypto market would be another avenue for future research, as information dynamics tools involving the microstructure in the cryptocurrency market may provide early warning signals of crashes. As Vasiliauskaite et al. (2022) suggest, the price and liquidity information dynamics concerning the microstructure in times of price explosiveness can be examined in the future.

Diversification, hedge, and safe-haven assets

In times of crisis, economic and financial shocks distort return predictability and price discovery processes. Smaller cryptocurrencies may start following larger ones (Vidal-Tomás et al. 2019), herding behavior tends to increase with uncertainty (Bouri et al. 2019c), and increasing prices can create a potential market bubble that can burst anytime. Additionally, in the cryptocurrency market, cryptocurrencies portray different patterns of volatility spillover owing to the difference in the structure of return and volatility clusters among them (Sensoy et al. 2021). The highly market cap cryptocurrencies, such as Bitcoin, Ethereum, and Litecoin, act as both transmitters and receivers of volatility in the crypto market (Polat and Kabakçı Günay 2021), while low cap ones, such as Maidsafe Coin may also act as its transmitters (Yi et al. 2018). This shock transmission results in lower diversification benefits. In such differing situations, investors start adjusting their investments and opt for portfolio optimization or hedging strategies. In particular, during periods of downturn, rising demand for hedging and safe haven assets may drive up their prices, leading to inefficiency in the market. Moreover, for the cryptocurrency market, several factors, such as the price formation process and limited liquidity, may lead to unreliable safe haven assets (Smales 2019).

This research stream discusses the cryptocurrency literature, which includes cryptocurrencies in financial asset portfolios as diversifiers, hedges, and safe havens. In addition to the daily interdependencies within cryptocurrency markets (e.g., Ciaian and Rajcaniova 2018), studies have examined diversification opportunities using high-frequency data (e.g., Katsiampa et al. 2022; Sifat et al. 2019; Yarovaya and Zięba 2022). Bitcoin investment possesses the most distinctive features: remarkably high average returns and volatility and a lower correlation with other financial assets. Its inclusion, even in a small fraction, dramatically improves the portfolio’s risk-return trade-off (Brière et al. 2015) and diversifies the portfolios (Guesmi et al. 2019; Platanakis and Urquhart 2020). As a financial asset, it is a useful instrument for portfolio, risk management, and market sentiment analysis based on its value-storing and exchanging benefits (Dyhrberg 2016a). Cryptocurrencies offer diversification benefits over other financial assets in short-term investment horizons (Corbet et al. 2018c). Similarly, Guesmi et al. (2019) report that including Bitcoin, gold, emerging equity markets, and oil in a portfolio can significantly decrease portfolio risk. Bitcoin offers income hedging as well.

Bitcoin displays hedging abilities like gold against market risk (Dyhrberg 2016b). It can act as a diversifier and poor hedge for the US dollar, stock, bond, gold, oil, and commodity markets. In contrast, its safe haven properties are especially visible in extreme downward movements in Asian stock markets. Data frequency matters to Bitcoin investors as its properties vary across time horizons (Bouri et al. 2017a). It acts as a strong hedge and safe haven against fluctuations in commodity market indices (Bouri et al. 2017b). However, Klein et al. (2018) find no hedging ability against a decline in equity prices, and it can hedge against uncertainties (Bouri et al. 2017c). In intraday trading, Urquhart and Zhang (2019) find that it can diversify the risk of the Japanese yen and Australian and Canadian dollars; hedge against the euro, Swiss franc, and pound; and act as a safe haven for the Swiss franc, Canadian dollar, and pound. Shahzad et al. (2019) find a weak safe haven ability of Bitcoin, commodities, and gold against global stock markets, with varying properties across equity market indices. Shahzad et al. (2020) find distinctive safe haven and hedge properties of Bitcoin and gold in G7 stock markets. Bouri et al. (2020a) find evidence that Bitcoin, Ripple, and Stellar are safe havens for US equity indices, whereas Monero and Litecoin do so for the US equity market index. Bitcoin has the least dependence on other assets and a superior safe-haven ability compared to gold and commodities (Bouri et al. 2020b).

Recently, investor concerns over the COVID-19 outbreak have led to research on the role of Bitcoin as a safe haven asset. Corbet et al. (2020a) find that investors are more inclined to view digital currency as a safe haven. However, a few studies (e.g., Conlon and McGee 2020; Corbet et al. 2020b) suggest cryptocurrencies may even increase contagion. Unlike Tether, Conlon et al. (2020) indicate that Bitcoin and Ethereum do not act as safe havens for most equity markets. Yousaf and Ali (2020) reveal that the portfolios of Bitcoin, Litecoin, and Ethereum are costlier and more effective in hedging during the pandemic. Finally, Das et al. (2020) conclude that Bitcoin is better at hedging crude oil uncertainty than commodities and gold but not the US dollar.

It is possible to examine liquidity spillovers in cryptocurrencies to identify diversification benefits. This behavior can also be examined across assets. Diversification opportunities can be identified by examining the linkages among cryptocurrencies, meme stocks, and rare-earth stocks, considering various investment horizons. Since COVID-19 and its induced financial market turbulence have been the initial tests of the safe haven ability of Bitcoin (Kristoufek 2020), it is important to assess the safe haven property of Bitcoin by considering the differential impacts of the intensity and type of crisis. Another possible line of future research could be the assessment of the hedging effectiveness and safe haven ability of Bitcoin and other cryptocurrencies in the recovery phase of the COVID-19 pandemic and during the Russia-Ukraine War. The impact of health- and war-induced crises may provide new insights, depending on the nature of the events. Gkillas et al. (2020) suggest the inclusion of Bitcoin, gold, and oil in a single portfolio to analyze diversification benefits while studying the spillover effects of skewness, kurtosis, and jumps. Baur and Hoang (2021) find that the six largest stablecoins act as safe havens against Bitcoin, and Baumöhl and Vyrost (2020) find a few stable coins that act as safe havens against other top cryptocurrencies. There are many prospective avenues for examining the safe haven ability of stable coins and tokens (compared with unstable cryptocurrencies) against other assets. The symmetric or asymmetric multifractality and detrended correlation analysis of safe haven assets, including cryptocurrencies, stablecoins, and other financial assets, is a potential topic to explore. Since cryptocurrencies are becoming less influential, except Bitcoin, dApps, and Protocols, which are receiving much attention from investors (Katsiampa et al. 2022), upcoming research can focus on these distinctive cryptocurrencies in their market analysis.

Avenues of future research

Research opportunities opened by cryptocurrencies and Bitcoin in the scientific domain have prompted an in-depth examination of their future paths and influences. Various digital currencies, such as stablecoins, altcoins, and gold-backed cryptocurrencies, combined with the gradual progress of legalization in some countries, have led researchers to explore the financial properties of these currencies.

In this section, we offer a few future research questions that do not unconditionally fit into one of the four previously mentioned research streams: What are the causes of the cryptocurrency market crash in 2022? How do investors perceive the cryptocurrency market’s future? Is there a need to revisit pre-crash research findings empirically? How do Bitcoin’s hard and soft forks affect intraday reversal and momentum? What are the causal patterns of Bitcoin and other cryptocurrencies? What is the impact of changing regulations on the cryptocurrency market? What is the impact of an increase in decentralized finance (DeFi) on the cryptocurrency market? What is the impact of institutional investors on the cryptocurrency market? What effects do cryptocurrency derivatives have on the cryptocurrency market? What is the impact of news events on the cryptocurrency market? What are the effects of cryptocurrency mining on the cryptocurrency markets? What are the environmental impacts of cryptocurrency mining? What pricing models are used in cryptocurrency exchanges? What is the role of social media in influencing the cryptocurrency market? What are the effects of algorithmic trading on the cryptocurrency markets? What are the effects of “whales” on cryptocurrency markets? What challenges do stakeholders in the crypto market face, and how can they overcome them?

Conclusions

In recent years, the intensified use of high-frequency data has revolutionized the analysis of cryptocurrencies, which are now actively traded 24/7 across global markets. We adopted a comprehensive bibliometric technique, leveraging co-citation and keyword co-occurrence analyses of 189 pertinent papers extracted from multidisciplinary domains in the Scopus database from 2015 to 2022. Our findings illuminate the expanding literature within the cryptocurrency market and shed light on leading articles, eminent authors, prominent institutions, reputable journals, and influential countries actively contributing to this domain.

Through a meticulous analysis of a bibliometric network and a thorough content assessment of scholarly articles, we identify and delineate four primary research streams that shape the discourse within the cryptocurrency market. These include studies focused on (1) examining the intricate dynamics of return and volatility behavior, encompassing predictive models and the phenomenon of abrupt fluctuations within cryptocurrencies; (2) analyzing the inherent inefficiencies within the cryptocurrency landscape; (3) investigating the mechanisms of price discovery, including its formation, potential manipulation, and the emergence of speculative bubbles within cryptocurrency markets; and (4) evaluating the diversification benefits, hedging strategies, and safe haven properties, particularly on the flagship cryptocurrency Bitcoin.

In our analysis, we meticulously pinpointed the critical gaps and unexplored avenues within each research stream, thereby providing scholars with comprehensive insights into potential future directions for research utilizing high-frequency data in the cryptocurrency domain. It is worth noting that, as Zha et al. (2020) highlighted, the study of investor behavior in the context of cryptocurrencies necessitates a multidimensional and integrated interdisciplinary approach. Therefore, the trajectory of cryptocurrency research transcends the confines of any single discipline, necessitating collaboration across diverse academic fields and methodologies.

It is imperative to acknowledge that while this study offers significant insights into research conducted using high-frequency data on the cryptocurrency market, it is constrained by its reliance on the Scopus database and its focus on the 24/7 trading nature of cryptocurrencies. Moving forward, the scope of this study can be expanded to encompass a broader spectrum of financial assets and alternate databases, such as the Web of Science and SpringerLink, to capture a holistic view of the evolving landscape within the realm of digital financial assets. As bibliometric studies continue to serve as influential tools for highlighting the growing impact of scholarly works and journals within specific domains, our research lays a strong foundation for further exploration and advancement in the realm of cryptocurrency research and analysis.

The insights gained from our comprehensive analysis of high-frequency data in the cryptocurrency market have significant practical and policy implications for stakeholders. As the cryptocurrency landscape continues to expand and evolve, investors, financial regulators, and market participants can leverage identified research streams and trends to make informed decisions, manage risks, and develop robust strategies that capitalize on the unique properties of digital assets. Policymakers can use their findings to establish a more comprehensive regulatory framework that addresses the intricacies of the cryptocurrency market, thereby fostering transparency, stability, and investor protection. By embracing emerging research trends and addressing the highlighted gaps, stakeholders can foster a more resilient and secure cryptocurrency ecosystem aligned with the evolving demands of global financial markets. In addition, our study underscores the significance of interdisciplinary collaboration and the integration of diverse analytical tools in the study of cryptocurrency dynamics, underscoring the need for a cohesive and interdisciplinary approach that incorporates insights from fields such as economics, finance, computer science, and data analytics to ensure a holistic understanding of the multifaceted nature of the cryptocurrency market.