1 Introduction

The use of technology to provide new and improved financial services, or FinTech, has long been a central characteristic of the finance industry. Since the introduction of wire transfers and ATM machines, technology has had a large impact on how the financial sector operates. Yet, in recent years, a new FinTech revolution of unprecedented speed and magnitude has brought major innovations that have the potential to disrupt the financial intermediation sector (Philippon 2016; Thakor 2020).Footnote 1 A key characteristic of this last wave of FinTech innovations is the fact that it is coming from outside the financial sector, as technology start-ups attempt to disrupt incumbent financial institutions.

In this paper, we study whether this increased competition from entering FinTech start-ups affects the incentives to innovate of incumbent financial firms. The causal link between competition and innovation is complex. On the one hand, an increase in competition can discourage innovation by reducing the rents for innovators (Romer 1990; Aghion and Howitt 1992). On the other hand, when competition is weak and pre-innovation rents are high, firms may have less incentive to innovate (Arrow 1962; Gilbert and Newbery 1982). These two competing views are rationalized in Aghion et al. ’s (2005) Schumpeterian growth model in which competition and innovation display an inverted U-shaped relationship: at low levels of competition, the entry of new competitors stimulates innovation, while in markets with already high competition, a further increase in competition has a less positive or even a negative effect on innovation as firms with lower market power can extract fewer rents from new ideas and patents. The former effect, known as the “escape competition effect” means that, in industries where competition is weak, incumbents will try to innovate in order to escape from a situation in which competition constrains profits.

Given that the lack of competition has been an endemic problem in the financial sector, the recent FinTech revolution offers a unique experiment to test these theories (Philippon 2016). As such, our main testable hypothesis is that the threat of entry by FinTech innovators from outside the finance industry leads to an increase in innovation by incumbent financial firms.

One empirical challenge in addressing this question is the lack of an established taxonomy to define a FinTech innovation. As such, we first provide an objective, data-based classification of FinTech innovations by exploiting patent filing. We construct a novel dataset of FinTech patent applications based on patents published in the PATSTAT Global dataset over the period from 2000-2016. We apply several machine learning algorithms to the textual data of patent abstracts to classify patents into five categories reflecting key technological characteristics of FinTech innovations: (1) Data Analytics, (2) Fraud, (3) Insurance, (4) Investments and (5) Payments. Our final classification of patents is based on BERT (Bidirectional Encoder Representations from Transformers) models that are a family of machine learning techniques for natural language processing pre-trained on a large corpus of un-annotated text such as Wikipedia (Devlin et al. 2018).

We identify a sample of around 19,000 patents corresponding to these categories and match them with firm level information from various sources. We find that US-based private firms are responsible for the large majority of FinTech patents and that these firms are predominantly non-financial.Footnote 2 Start-ups account for 13% of patent applications, and payment and data analytics represent the largest categories of FinTech technologies (accounting for 62% of patent applications).

We use this sample of FinTech innovators to examine whether the rise of non-financial start-ups spurs innovation in the financial sector. A key identification challenge in testing the role of competition in innovation is the fact that innovation by both incumbents and potential entrants can respond to the same unobservable technological shocks. We mitigate this concern by building a proxy for competition from outside the financial sector that is less likely to be driven by unobservable technology shocks alone. This measure exploits the time variation in the ratio of FinTech patent applications by non-financial startups relative to the patent applications of incumbent financial firms in the same country over the previous four years. Specifically, our main identifying assumption is that, if financial incumbents are more likely to innovate in periods when the competition from start-ups is stronger (a high ratio of patent applications by startups relative to incumbents), then this increase in innovation cannot be driven by technology shocks alone (which would raise innovation by both incumbents and outsiders thereby making the ratio relatively stable over time).

We find that competitive pressures from non-financial startups increase the probability that financial incumbents become FinTech innovators, that is, apply for a FinTech patent. Specifically, a one standard deviation increase in the ratio of patent applications by non-financial start-ups relative to incumbents over the previous four years, increases the probability that a financial incumbent applies for a FinTech patent by 0.2%. This effect is stronger when we weight the number of patent applications by their relative importance, measured by the forward citations count. These results are robust to controlling for time-varying firms characteristics such as past number of patenting, size, and revenue. We also control for financial incumbents’ propensity to acquire FinTech innovators, as opposed to innovating themselves. Not surprisingly, we find that incumbent financial firms who have acquired FinTech innovators in the past are less likely to apply for FinTech patents.

Furthermore, as many of the patent filing in our sample belong to non-financial firms, we also investigate if competition from start-ups affects innovation relatively more among financial as compared to non-financial firms. We find that the total number of FinTech patent applications is higher among financial firms relative to non-financial firms when competitive pressures from start-ups are stronger. This result suggests that the rise of FinTech startups has disproportionally driven financial incumbents to innovate. Moreover, we show that competition from incumbent technology firms does not have the same effect and is not followed by an increase in FinTech patent applications by financial incumbents.

Our estimations are robust to a wide array of model specifications and alternative definitions of our main explanatory variable. We also include, across all specifications, time fixed effects to account for technology waves that spur FinTech innovations across all firms as well as firm fixed effects that allow us to obtain identification from within-firm variation over time. We further mitigate omitted variable bias by implementing an instrumental variable strategy where we instrument competition by start-ups by the beginning of sample ratio of venture capital investment to GDP in the country.

Our work is related to several branches of literature. First, we contribute to a growing literature that uses taxonomies to automatically classify patents across different categories of innovations (see, among, others Fall et al. 2003; Benzineb and Guyot 2011; Gomez and Moens 2014; Grawe et al. 2017; Li et al. 2018; Shalaby et al. 2018; Hu et al. 2018; Abdelgawad et al. 2019; Lee and Hsiang 2019; Mann and Püttmann 2021). Chen et al. (2019) were the first to propose a typology to classify a sample of 2 million patents filed with the US Patent and Trademark Office (USPTO) into seven categories of FinTech innovations: cybersecurity, mobile transactions, data analytics, blockchain, peer-to-peer, robo-advising, and the internet of things. They train and employ several families of machine learning algorithms such as support vector machines and neural networks to identify a sample of FinTech innovations by US-based firms or individuals. Chen et al. (2019) then investigate the value of these FinTech innovations by studying stock market responses around the news of patent filings. They find that the FinTech patents that are most valuable to innovators are blockchain, cybersecurity, and robo-advising. Xu et al. (2020) also train machine learning classifiers such as random forest classifiers to identify FinTech patents in a sample of patent applications to the USPTO from 2014-2018.

We extend these efforts to automatically classify FinTech innovations in several directions. For example, we provide an enhanced FinTech patent classification by training and evaluating different types of deep learning classifiers, with a focus on BERT models (Devlin et al. 2018). Specifically, while other studies that examine patent texts have used traditional machine learning approaches such as linear support vector machines or neural network models, recent research in computer science has shown that deep learning approaches outperform such methods (Abdelgawad et al. 2019). We thus aim to provide a new and improved taxonomy of FinTech innovations by using deep learning methods.

Second, we employ our classification of FinTech patents to investigate how competition drives innovation in the financial industry. The relationship between innovation and competition is of long-standing theoretical and empirical interest (Aghion et al. 2001, 2005; Aghion and Griffith 2008). A large Schumpeterian growth literature models two effects of entry on incumbent innovation. In industries with high levels of competition, entry reduces incumbents’ incentives to innovate as it decreases innovation rents and reduces the probability of surviving entry (Schumpeterian effect). In industries with low levels of competition, where leaders compete neck-to-neck, an increase in competition has a positive effect and encourages incumbents to innovate in order to acquire a lead over their rival in the sector (escape competition effect) (Aghion et al. 2014).

Empirical works have documented these two effects across different countries, time periods, and industries. Studies on the effects of competition-enhancing reforms or foreign competition find an overall positive effect (see, for example Aghion et al. 2005; Ayyagari et al. 2011; Bloom et al. 2016), but negative effects are also found (Liu et al. 2014; Dorn et al. 2020). Moreover, the effect depends on the level of technological advancement in the industry (Aghion et al. 2009; Liu et al. 2014) or type of innovation (Tang 2006). A key empirical challenge in this literature is that entry is endogenous to the innovation of incumbents. The FinTech revolution offers an interesting setting to test this relationship, as the entry of start-ups was not likely driven by innovations of large financial incumbents. For example, Cojoianu et al. (2020) study the determinants of the emergence of FinTech start-ups across 21 OECD countries and find that creating a FinTech venture is positively related to the regional productivity and new knowledge created in the information technology (IT) sector, but not that created in the financial sector.

Our work also relates to a growing literature that looks at how the dramatic growth of FinTech start-ups has shaped the financial landscape (see Aaron et al. 2017; Alt et al. 2018; Temelkov 2018; Stulz 2019; Allen et al. 2020). For example, Hornuf et al. (2021) study how banks interact with FinTech start-ups using detailed information on strategic alliances made by the 100 largest banks in Canada, France, Germany, and the UK. They show that banks are more likely to form alliances when they have a well-defined digital strategy or employ a chief digital officer (see also Li et al. 2017; Tseng and Guo 2022). Kowalewski and Pisany (2020) document the rise of FinTech start-ups across a large cross-section of countries and highlight the importance of technological advancements and university-industry collaborations in FinTech creation and activity.

The remainder of the paper is organized as follows: Section 2 presents the patent data and the machine learning techniques used to classify the FinTech patents, while we the identification strategy and methodology in Section 3. Section 4 presents the results, and Section 5 concludes.

2 Patent data and classification of FinTech innovations

In this section, we describe the process followed to create a FinTech taxonomy as well as to train the machine learning and deep learning models used to classify FinTech patents. We also provide some descriptive statistics of the identified dataset of FinTech patent applications.

2.1 Patent data

Patent data is obtained from the BvD Orbis database that sources information from PATSTAT GlobalFootnote 3. We retrieve all patents applications filed between 2000 and 2016 that belong to Classes G and H of the International Patent Classification (IPC), and cover areas related to digital computing that underlie the FinTech technologies classified in this paper. This results in a sample of 6.8 million patents. To narrow down the search, we use a text-based filtering to identify patents that are plausibly related to financial services. We obtain from Chen et al. (2019) a list of 516 financial terms based on Campbell R. Harvey’s Hypertextual Finance Glossary and the online Oxford Dictionary of Finance and Banking. We select patents that contain at least one keyword from this list in their abstract. After this filtering, we identify a potential number of 38,228 patents that are related to financial services.Footnote 4

Fig. 1
figure 1

WordCloud of the 100 most frequent financial terms. The figure shows the 100 most frequent financial terms in the set of 38,228 patents. The size of a term is proportional to the number of patents in which the term appears

The distribution of the 100 most frequent financial terms in our dataset is illustrated in Fig. 1, where the size of each term is proportional to the number of patent documents in which that term appears. The most frequent terms include payment, compliance, trading, banking, insurance, investment, money, among others.

2.2 FinTech innovation taxonomy

There is a wide range of financial products and services that fall under the FinTech umbrella. Currently, there is no comprehensive, well-accepted taxonomy to analyze this sector. Hence, we build a FinTech taxonomy by comparing the taxonomies from several academic works, industry reports and market maps (such as Mellon 2015; Levy 2015; Young and Treasury 2016; CBInsights 2017; Eckenrode and Friedman 2017; Chen et al. 2019; Haddad and Hornuf 2019).

Our taxonomy aims to capture innovations that pursue the integration of more sophisticated IT tools and data science solutions in financial products. We include five broad FinTech categories: Data Analytics, Fraud, Insurance, Investments and Payments. Applications corresponding to these categories, together with an example of a patent filing abstract in each category are shown in Table 1. Our FinTech taxonomy is aligned with that of Chen et al. (2019), although we use a broader range of financial services, such as including insurance. We also exclude some applications which are not necessarily specific to the financial sector such as blockchain or the internet-of-things.

Table 1 FinTech Taxonomy. The table presents examples of the abstracts of the patent filings for the five FinTech categories we identify
Table 2 Performance results of traditional and deep learning models. Table shows the out-of-sample performance of each machine-learning method. Accuracy is one minus the ratio of incorrect category predictions to total observations. Precision is the ratio of true positives to the sum of true positives and true negatives. Recall is the ratio of true positives to the sum of true and false positives. F1 score is the harmonic mean of Precision and Recall. Columns (1)-(7) correspond to traditional machine learning models that have been commonly employed to classify patent data. These are: Linear Support Vector Machines (Linear-SVM), Gaussian Support Vector Machines (Gaussian-SVM), Naïve Bayes (NB), k-nearest neighbour (kNN), random forest (RF), gradient boosting (GB) and Multilayer Perceptron (MLP). Column (8) is a voting classifier that aggregates predictions of the Linear-SVM, Gaussian-SVM, and neural network models and uses the prediction made by the MLP, in case of ties. The deep learning methods employed are: Convolutional Neural Networks (CNN), recurrent neural networks (RNN) and Bidirectional Encoder Representations from Transformers (BERT). The best result for each criteria and category is marked in bold

2.3 Machine learning algorithms to classify FinTech patents

To train machine learning and deep learning models to identify FinTech patents, we manually labeled a subset of our patent dataset. Specifically, we manually labeled 2,350 FinTech patents (500 patents in each of the following categories, insurance, fraud, investments and payments; and 350 patents in the category of data analytics). Furthermore, we manually labeled a subset of 1,500 non-FinTech patents. We use the 3,850 manually labeled patents to train and test different types of text classifiers.

We focus on BERT models as our main text classifier, given the success of the deep learning approaches on patent classification tasks (Li et al. 2018; Shalaby et al. 2018; Sun et al. 2019; Lee and Hsiang 2019). BERT is a language model that uses a deep bidirectional transformer encoder architecture to encode sentences and their tokens into dense vector representations (Vaswani et al. 2017; Devlin et al. 2018). BERT models are pre-trained on a large corpus of un-annotated text (such as Wikipedia) using two self-supervised learning tasks: masked word prediction and next sentence prediction. A generic BERT model can then be further pre-trained and fine-tuned for specific natural language processing tasks.

As it is common in the literature, we do not simply rely on a single approach, but rather use several different families of algorithms, which have been successfully applied to classification problems in other domains. Specifically, we compare the results obtained from BERT models with text classifiers such as CNN (Convolutional Neural Networks) and RNN (Recurrent Neural Networks) that use BERT embedding as input. Finally, to allow comparison with previous FinTech patent classifications in Chen et al. (2019) and Xu et al. (2020), we also contrast the results of the deep learning models with traditional machine learning approaches such as support vector machines (SVM), neural networks, Naïve Bayes, k-nearest neighbour (kNN), random forest, and gradient boosting. The Online Appendix presents details on the machine learning methods we employ.

To evaluate the performance of the various models that we train, we use four standard metrics: accuracy, precision, recall and F1-score.Footnote 5 We also report these metrics for each of our five FinTech categories to determine what categories are the easier or harder to identify. The performance results on the test data are presented in Table 2. The first eight columns correspond to traditional machine learning models, while the last three depict deep learning models. The best result for each method and evaluation criteria is highlighted in bold, while the best overall result in a row is marked with a star. Based on the average score for all performance criteria, we find that the best machine learning model is the MLP model, while the best deep learning model is the BERT model.

Table 3 Filtering steps in the construction of the dataset of patent applications. The table summarizes the number of patent filing selected in each step of the construction of the dataset of patent applications

We thus use the BERT model to classify our sample of patents. Table 3 summarizes the number of patent filings removed at each step of our methodology. The classifiers we employ allow us to identify a sample of 19,055 unique FinTech patent applications.

2.4 Descriptive statistics

Table 4 shows the frequencies of FinTech patent applications filed by various groups of innovators. From the sample of 19,055 patents identified by the machine learning algorithms described in the previous section, 3,166 (20%) were filed by individuals. This leaves a sample of 15,889 patent applications by firms that will be the focus of our analysis. Among these, private firms are the most important group of FinTech innovators by accounting for 68% of the applications. Non-financial firms are also responsible for a large share of innovations in FinTech, while startups account for 13% of the applications.

Furthermore, while the 15,889 patents are filed at the European and US Patent offices by firms across 58 countries, the large majority of filing (77.5%) belong to US-based firms. The geographical split is similar if we organize the data based on the priority country of the patent (first country in which the patent application was filed), as opposed to the headquarters of the inventor, with 82.1% (13,050) of applications having the US as priority country, 8.5% (1,351) Europe, and 9.4% (1,488) in the rest of the world.

Table 4 FinTech innovation by innovator type and region. This table shows the frequencies of FinTech patent filings by various types of innovators and across regions. Financial firms represent those firms belonging to the NACE Rev2 two-digit industry codes 64, 65 and 66, respectively. Start-ups are defined as firms with a founding date of no more than eight years prior to the patent filing date

Table 5 shows the distribution of innovators across the different types of technologies classified as FinTech innovations. Data analytics and Payments represent the largest share of FinTech innovations (62%). Table 5 also shows that private and non-financial firms dominate all types of FinTech technologies. Similarly, startups innovate evenly across the 5 types of technologies considered.

Table 5 FinTech innovation activity by technology category and innovator type. The table shows the frequencies of FinTech patent filing by various types of innovators for each technology category. Financial firms represent those firms belonging to the NACE Rev2 two-digit industry codes 64, 65 and 66, respectively. Start-ups are defined as firms with a founding date of no more than eight years prior to the patent filing date

Next, we show some trends in the evolution of FinTech innovation over time. Figures 2 and 3 display the 6-months moving averages of patent filing from 2000 to 2016. Figure 2 shows the evolution for financial versus non-financial firms. Non-financial firms dominate the patent filing numbers in all periods, but the pace at which applications are filed by these two groups of firms shows considerable variation over time.

Fig. 2
figure 2

FinTech innovation over time by financial versus non-financial firms. The figure shows the 6-month moving average of patent applications between 2000 and 2016 by financial firms (corresponding to NACE 2 digits codes 64, 65, and 66) versus non-financial firms

Fig. 3
figure 3

FinTech innovation over time by non-financial start-ups versus incumbent financial firms. The figure shows the 6 months moving average of patent applications between 2000 and 2016 by incumbent financial firms (corresponding to NACE 2 digits codes 64, 65 and 66) versus start-ups outside the finance industry. Start-ups are defined as firms less than eight years old at the time of the patent application

Fig. 4
figure 4

FinTech patents by technology type. The figure shows the total number of applications by year and type of FinTech innovation

Figure 3 plots the evolution of FinTech patents by company age and sector. It shows that while non-financial start-ups and financial incumbents have similar patent filing rates prior to 2007, incumbents dominate filing in the second half of the sample. Our empirical strategy exploits this time variation in patent filings across these two sectors.

Finally, Fig. 4 shows the evolution of patent applications by type of FinTech technology. It is apparent that the increase in FinTech innovations occurred across all types of technologies.

3 Identification strategy

Our empirical strategy aims to identify how competition from FinTech startups affects innovation efforts by incumbent financial firms. This strategy faces two main identification challenges. The first is the fact that entry can be endogenous to innovation. In other words, the innovations and potential productivity gains in the finance industry can drive the innovations in financial technologies by entrants. Several arguments suggests that this is not a very likely concern. First, there is ample anecdotal evidence that the most recent FinTech innovations, such as digital advisory and trading systems, artificial intelligence, machine learning, peer-to-peer lending, equity crowdfunding and mobile payment systems, have originated outside the incumbent financial industry (Philippon 2016; Vives 2019).

Second, the financial industry is rather inefficient. Philippon (2015) estimates that the unit cost of financial intermediation in the US has remained around 2% for the past 130 years. As such, innovations by financial incumbents have done little to increase productivity and translate into lower costs for the end users. Therefore, the argument that the most recent FinTech revolution is not driven by innovations already happening in the financial sector is plausible.

The second threat to identification is that innovations by both start-ups and financial incumbents are driven by the same unobservable technology shocks. This is a more plausible concern and ideally one would use an exogenous shock to identify FinTech innovation among start-ups. However, in the absence of such a shock, we address this concern by controlling for the amount of patent applications by incumbent competitors in the financial industry. The argument is that, if unobservable technology shocks are driving innovation in both incumbents and outsiders, then the rate of innovation between the two groups would be relatively stable over time. This, however, does not seem to be the case. Figure 5 shows the evolution of the share of patent applications by (non-financial) start-ups as a fraction of applications by financial incumbents. We observe a higher share of FinTech patents by start-up firms in the beginning of the sample (2000-2008), which reverses in the second part of the sample. Moreover, there is significant variation in this ratio across time, which is key to our identification strategy. We exploit this time variation as a source of competitive pressure from outside the financial sector. As such, if financial incumbents are more likely to innovate when the innovation by outsiders is higher than the one from insiders (a high ratio of patent applications by startups vs incumbents) then it is less plausible that the higher patent output by financial incumbents is driven by technology shocks alone. We can thus argue that incumbents respond to competition by innovating themselves (escape competition effect).

Fig. 5
figure 5

Ratio of patent applications by start-ups relative to incumbent financial firms (6 months moving average)

We further deal with omitted variable bias by considering not only the number of patent applications, but also their importance captured by the citation count (Akcigit and Kerr 2018). This is done by scaling our measure of competition by the future citation count of start-ups’ patents at the end of the sample. This measure of competition weighted by forward citations is meant to distinguish small incremental FinTech patents from those that bring significant innovations to the sector. As such, we expect competition to be stronger when the innovation by outsiders is more important.

Our baseline strategy is as follows:

$$\begin{aligned} \text {FinTech Innovation} _{i,t} = \alpha _i + \gamma _t + \beta _1 \text {Competition Startups}_{c, t-4 \rightarrow t-1} + \theta 'X_{i,t} + \epsilon _{i,t}, \end{aligned}$$
(1)

where \(\text {FinTech Innovation} _{i,t}\) is an indicator variable equal to one if firm i is an incumbent financial institution and has applied for at least one FinTech patent in year t. We define an incumbent financial firm as one having the 2-digit NACE Rev 2 codes of 64, 65 and 66, and is not a start-up, that is, founded more than eight years prior to the patent application date.

Our main independent variable is the ratio of FinTech patent applications by start-ups relative to financial incumbents (see also Etro 2004; Cockburn and MacGarvie 2011; Czarnitzki et al. 2014), that is computed as:

$$\begin{aligned} \text {Competition Start-ups}_{c, t-4 \rightarrow t-1} =\frac{ \sum _{\tau =t-4}^{t-1} \text {Startup Patents}_c}{\sum _{\tau =t-4}^{t-1} \text {Financial incumbents}_{-i,c}}. \end{aligned}$$
(2)

In the baseline model, we consider the sum of patents in the previous four years (excluding the current year) by firms in the same country as financial incumbent i. The sum in the denominator in (2) excludes the patent applications by incumbent i, which we add as a separate explanatory variable.Footnote 6 One concern with the specification in (2) is that the number of innovations by incumbent i is positively correlated with the sum of innovations of other financial incumbents. However, this correlation would put a downward bias on our estimations that would lessen the effect of competition by startups. Another concern is that the shock affects incumbents versus outsiders to a different extent over time. In particular, periods in which the unobservable shock drives more innovation among incumbents leads to lower competition ratios resulting in a downward bias. On the other hand, periods where the shock leads to more innovations among start-ups create an upward bias. It is not a priori clear which direction the bias would be across the entire time period, and, as a result, it is difficult to argue that the bias is consistently upwards or downwards over the entire period.

The model in Eq. (1) also controls for time varying and invariant covariates at the firm and country level. The vector \(X_{i,t}\) includes measures of the past innovation and financial performance of the firm. We control for the lagged sum of patent applications by firm i over \([t-4, t-1]\) to account for the propensity of an incumbent to produce FinTech innovations. We also include measures of firms size, such as log of Total Assets and Operating Revenue. We also control for the propensity of a firm to acquire FinTech innovators as opposed to innovating itself. This is captured by the sum of acquisitions, mergers, follow-on funding, management buyout (MBO), or joint ventures by financial incumbent i with a FinTech innovator over the period \([t-4,t-1]\). Finally, we control for time-invariant firm characteristics through firm fixed effects (\(\alpha _i\)), as well as year fixed effects that capture waves of innovation across all countries in a given year.

Table 6 Descriptive statistics. The table presents the descriptive statistics for the two main independent variables, i.e., the Competition Start-ups ratio and the Competition Start-ups (citations) ratio, as well as the firm control variables

3.1 Financial data

We match our new database of FinTech patent applications by public and private firms with several datasets containing financial data. The primary source of company data is Orbis (Bureau van Dijk) that we complement with data from Capital IQ to obtain a longer time series for a subset of listed firms. We keep in our sample countries that have at least one patent filing from a financial incumbent that yields a final dataset of 15,889 patent applications from 3,446 unique firms over the period from 2000-2016.

To control for the propensity to acquire FinTech innovators, we also collect data on the universe of acquisitions, mergers, follow-on funding, MBO, or joint ventures, where at least one of the members of the deal is a FinTech innovator identified by our machine learning algorithm. We obtained this data from Zephyr.

A summary of the financial information by types of FinTech innovators is presented in Table 6. Financial incumbents tend to be larger and have higher operating revenue than the average firm in the overall sample. They also have, on average, more patents and acquire more FinTech innovators. At the end of Table 6 we also present the descriptive statistics for our two main independent variables the Competition Start-ups ratio and the Competition Start-ups (citations) ratio, with the latter being the forward citations weighted variant of the former.

4 Results

The results from our baseline model in Eq. (1) are presented in columns (1)-(3) of Table 7, in which the dependent variable is an indicator equal to one if a financial incumbent applied for a FinTech patent in year t.

Table 7 FinTech patent applications and competition from start-ups. The dependent variable in columns (1)-(3) is an indicator variable equal to one if a financial firm applied for a FinTech patent in year t, while in columns (4)-(6) it is the total number of FinTech patent applications by a firm in year t. Financial firms correspond to the 2-digit NACE Rev 2 codes: 64, 65 and 66. Competition Start-ups is the ratio of the sum of FinTech patent applications by non-financial start-ups relative to that of financial incumbents over the period \([t-4,t-1]\). Competition Start-ups (Citations) is the Competition Start-ups ratio scaled by the number of forward citations of non-financial start-ups. Number Start-ups is the ratio of the total number of start-ups applying for a FinTech patent over the total number of incumbents during the previous four years. Finance is a dummy variable for financial firms. Past Patents is the sum of FinTech patent applications by incumbent firm i over \([t-4, t-1]\). FinTech Acquisitions is the sum of acquisitions, mergers, follow-on funding, MBO, or joint ventures by firm i of a FinTech innovator over the period \([t-4,t-1]\). Standard errors clustered at the country level are reported in parentheses

Our main explanatory variable is competition by start-ups (Column 1) measured by the ratio of patent applications by start-ups relative to those by other financial incumbents over the period \([t-4, t-1]\). We find evidence that following periods with relatively higher FinTech innovations by non-financial start-ups, financial incumbents are more likely to innovate themselves. To ease interpretation of the point estimates, we have standardised throughout all estimations the measure of Competition Start-ups. As such, the point estimate in Columns (1) suggests that a one standard deviation increase in the Competition Start-ups ratio increases the probability of being an innovating financial incumbent by 0.2%.Footnote 7

We consider next the importance of FinTech innovations by start-ups by weighting the number of patent application by their forward citations count (Column 2). Again, we find that financial incumbents are more likely to apply for FinTech patents particularly when the quality of patent applications by outsiders is higher. The point estimate in Column (2) suggests a 1.1% higher likelihood of financial incumbent innovation for a one standard deviation increase in the Competition Start-ups (Citations) variable.

Our main proxy for outside competitive pressures counts the number of patent applications. This approach, however, does not account for the fact that these applications might come from a small number of very successful innovators. Consequently, in column (3), we replace the main independent variable with an alternative one that considers the number of startup firms applying for a FinTech patent as opposed to the total number of applications in a period. Specifically, we recompute the ratio in Eq. (2) by considering the number of startup firms as opposed to the number of patent applications in the numerator. The results are very similar, suggesting that the applications counted in the Competition Start-ups measure come from different firms, making the two alternative proxies of competition almost identical in size.

In columns (4)-(6) of Table 7, we extend our baseline model by using the the total number of FinTech patent applications by a firm i in year t as the dependent variable. Since our sample is comprised of both financial and non-financial firms that are FinTech innovators, we modify the model in Eq. (1) by considering the effect of competition from start-ups on financial firms relative to non-financial firms. Specifically, the model tested is as follows:

$$\begin{aligned} \begin{aligned} \text {FinTech patents} _{i,t}&= \alpha _i + \gamma _t + \beta _1 \text {Competition Start-ups} _{c, t-4 \rightarrow t-1} \\&\quad \beta _2 \text {Competition Start-ups} _{c, t-4 \rightarrow t-1} \times \text {Finance} + \theta 'X_{i,t} + \epsilon _{i,t}, \end{aligned} \end{aligned}$$
(3)

where \(\text {FinTech patents} _{i,t} \) is now the total number of patents applications by firm i in year t, measured as \(log(1+\)Total Patents\(_{i,t}\)).Footnote 8 The coefficient of interest, \(\beta _2\), is that of the interaction between the \(\text {Competition Start-ups} _{c, t-4 \rightarrow t-1}\) ratio and a dummy equal to one for financial firms (\(\text {Finance}\)). This interaction term captures the increase in the number of patents applications by financial incumbents relative to that of non-financial firms. We control separately for \(\text {Competition Start-ups} _{c, t-4 \rightarrow t-1}\), while the \(\text {Finance}\) dummy is absorbed by firm fixed effects.

The results in column (4) of Table 7 show that this interaction term is positive and statistically significant, suggesting that competition from start-ups results in a relatively higher increase in the number of FinTech patent applications by financial firms as compared to non-financial ones. We next repeat the same exercise as in columns (2)-(3) and replace the number of patents by start-ups as the main explanatory variable with (i) its value weighted by citations (column (5)) and (ii) the number of firms as opposed to the number of patent applications (column (6)). The results are robust across all specifications and indicate that competitive pressures from start-ups push financial incumbents to innovate relatively more than non-financial firms.

All estimations in Table 7 control for an array of firm characteristics and fixed effects. We include a measure of Past Patents as the sum of patent applications by incumbent i over \([t-4, t-1]\). As expected, firms that produce FinTech innovations in the past are more likely to continue innovating. We also control for whether a financial incumbent has acquired or merged with a FinTech innovator over the past four years. Again, as expected, firms that buy FinTech innovators are less likely to apply for new patents themselves. Firm size and operating revenue are positively correlated with the probability of applying for a FinTech innovation, although the effect is not statistically significant across all specifications. Finally, all estimations include firm fixed effects, which means that we obtain identification from changes within a firm over time. We also include year fixed effects throughout to capture innovation waves that can occur in all countries in a given year.Footnote 9

Overall, the results in Table 7, point to an escape competition effect, whereby financial incumbents innovate when the competitive pressure from non-financial start-ups relative to other financial incumbents is higher. In Appendix Fig. 7, we show that this result is even stronger when we consider a longer time horizon for measuring the relative number of total patent applications by start-ups to financial incumbents. Figure 7 presents the coefficient estimates for our measures of competitive pressures in (2) over different time frames: \([t-2, t-1]\), \([t-3, t-1]\) and \([t-5, t-1]\), respectively. We find the strongest effect for the last interval, suggesting a relatively large time lag between the patent applications by competitors and financial incumbents’ FinTech innovation.

The results in Table 7 are also robust to alternative model specifications. Specifically, the model in Eq. (1) was estimated using a fixed effects linear probability model that includes both financial and non-financial firms applying for FinTech patents. As such, the average marginal effect of a covariate is a linear combination of (i) the estimated coefficient of the group of financial incumbents (which innovate at different points in time), and (ii) zero, which is the corresponding coefficient of the group of non-financial companies (Beck 2020). An alternative specification is to estimate a fixed effects logistic model, which would only employ the subset of data that has variation in the dependent variable, i.e., the group of financial incumbents that apply for FinTech patents in some (but not all) periods in the sample. We present the results employing a fixed effects logistic model in Table 8, columns (1)-(3) for all definitions of FinTech competition. The results are consistent with the baseline model and the estimates in column (1) suggest that a one standard deviation increase in Competition Start-ups increases the probability of innovating of incumbent financial firms by 2.8%.

Table 8 FinTech patents and competition: alternative model specifications. Columns (1)-(3) present the coefficient estimates of a logit model where the dependent variable is an indicator variable equal to one if a financial firm applied for a FinTech patent in year t. Columns (4)-(6) show coefficient estimates of a negative binomial model where the dependent variable is the total number of FinTech patent applications of a firm in year t. Financial firms correspond to the 2-digit NACE Rev 2 codes: 64, 65 and 66. Competition Start-ups is the ratio of the sum of FinTech patent applications by non-financial start-ups relative to that of financial incumbents during the previous four years. Competition Start-ups (Citations) is the Competition Start-ups ratio scaled by the number of forward citations of non-financial start-ups. Number Start-ups is the ratio of the total number of start-ups applying for a FinTech patent over the total number of incumbents in the previous four years. Finance is a dummy variable for financial firms. Firm controls include: the sum of patent applications and the sum of acquisitions of FinTech innovators over the last four years, as well as the log of Total Assets and Operating Revenue. Standard errors clustered at the country level are reported in parentheses

Furthermore, the dependent variable in model (1) is the log-transformation of patent count. However, while widely used, this linear transformation can produce biased results (see Cohn et al. 2021; Wooldridge 2010, page 726). We thus re-estimate Eq. (1) using a negative binomial model, which is suitable for modeling count variables, especially for over-dispersed outcome variables.Footnote 10 The results using this alternative specification are presented in Table 8, columns (4)-(6), and yield similar results as our baseline model.

Overall, the results in Tables 7 and 8 point to a strong effect of entry by non-financial start-ups on the propensity to innovate of financial firms. We investigate next whether this effect corresponds to a particular type of FinTech technology.

4.1 Type of FinTech innovation

To investigate whether the effects of start-up competition on innovation are driven by a particular FinTech technology, we re-estimate the models in (1) and (3) for each of the five FinTech technologies classified by the machine learning algorithms in Section 2. More specifically, we look at the probability that financial incumbents apply for a specific type of FinTech patents when competition from non-financial start-ups in the same type of financial service intensifies. The model estimated is thus:

$$\begin{aligned} \text {FinTech Innovation} _{i,f, t} = \alpha _i + \gamma _t + \beta \text {Competition Start-ups}_{c, f, t-4 \rightarrow t-1} + \theta 'X_{i,t} + \epsilon _{i,t}, \end{aligned}$$
(4)

where \(\text {FinTech Innovation} _{i,f,t}\) is an indicator variable equal to one if company i is an incumbent financial institution and has applied for at least one FinTech patent in category \(f \in \) {Data Analytics, Fraud, Insurance, Investments and Payments} in year t. \(\text {Competition Start-ups}_{c, f, t-4 \rightarrow t-1}\) is now the sum of patent applications by non-financial start-ups in a FinTech category f divided by the sum of patent applications by other financial incumbents in the same category over the past 4 years. We estimate Eq. (4) for each category f separately.

Fig. 6
figure 6

Competition and type of FinTech technologies. Figure a shows the point estimates of \(\beta \) in Eq. (4) for each category f separately, where \(f \in \) {Data Analytics, Fraud, Insurance, Investments and Payments}. Figure b shows the point estimates of \(\beta _2\) in Eq. (3) for each category f separately

Figure 6(a) presents the point estimates of \(\beta \) in Eq. (4). The results are not as precisely estimated given the lower number of observations within each category. We only find a statistically significant effect for the Investment technology, which corresponds to innovations related to portfolio management, lending and investing platforms. This is confirmed when considering the importance of patents applications measured by their forwards citations (lower part of figure (a)). Applications related to insurance are also driven by the competition pressure from start-ups in this case.

We also consider the effect of innovation across different types of FinTech patents on the number of patent applications by financial firms as compared to non-financial ones within each patent category. In figure (b) we present the coefficient \(\beta _2\) in Eq. (3), which is re-estimated for each type of FinTech category j. Here we find stronger evidence across other categories of FinTech applications, such as Fraud, Insurance and Investments, with all the coefficients being statistically significant when we weight the interaction term by the citation count.

The bulk of this evidence confirms that competitive threats from FinTech start-ups push financial incumbents to innovate to escape competition. This evidence is stronger across some type of FinTech innovations such as Investment technologies, in particular when we consider the magnitude of the innovation proxied by patent citations. Furthermore, the effect is stronger when we investigate the effect of competition on financial incumbents as compared to non-financial firms.

4.2 Alternative measures of competition

We check the robustness of our main results when employing alternative definitions of our measures of competitive pressures. First, we consider competition not just from non-financial start-ups headquartered in the same country as the financial incumbent, but also those that list as priority country the domicile of the financial incumbent. The priority country is the country where the patent was first filed worldwide before being extended to other countries. If FinTech start-ups register patents in the country where the financial incumbent is headquartered then they are considered to compete with the incumbent. As such, the ratio in Eq. (2) now includes the sum of patent applications that list as priority country the domicile of financial incumbent i. In many cases this corresponds to the applicant’s domestic patent office, and as such, the number of patents counted would be the same as in our baseline measure. However, this extended definition can also capture competition from Start-ups domiciled abroad, but whose FinTech innovation was patented in the financial incumbent’s domicile country.

The results using this alternative measure are presented in Table 9 Panel A and are qualitatively similar to our baseline results across all specifications. The magnitude of the effect is larger as compared to the estimates in Table 7, which is to be expected as the number of FinTech start-up innovators using this alternative definition is likely to be larger.

Table 9 Alternative measures of competition (based on Priority Country). The dependent variable in columns (1)-(3) is an indicator variable equal to one if a financial firm applied for a FinTech patent in year t, while in columns (4)-(6) it is the total number of FinTech patent applications a firm in year t. Financial firms correspond to the 2-digit NACE Rev 2 codes: 64, 65 and 66. Competition Start-ups is the ratio of the sum of FinTech patent applications by non-financial start-ups relative to that of financial incumbents during the previous four years. Competition Start-ups (Citations) is the Competition Start-ups ratio scaled by the number of forward citations of non-financial start-ups. Number Start-ups is the ratio of the total number of start-ups applying for a FinTech patent over the total number of incumbents in the previous four years. Finance is a dummy variable for financial firms. Firm controls include: the sum of patents acquisitions of FinTech innovators over the last 5 years, as well as the log of Total Assets and Operating Revenue. Standard errors clustered at the country level are reported in parentheses

We next consider an alternative proxy for the importance of FinTech innovations. In our baseline specification we count the number of FinTech patent applications as a measure of competition. We then weight these patents by importance using a measure of forward citations count. An alternative measure of importance is to consider only the number of patent applications that are eventually granted. As such, in ratio (2) we only include the applications whose final status is “patented". The results are presented in Panel B of Table 9 and support our main hypothesis although are statistically significant only when we consider the measure weighted by citations count. This suggests that the importance of the applications, as measured by the citation count, matters more than whether the applications are merely granted.

Our results are also robust to alternative definitions of Start-ups. Our baseline model follows Chen et al. (2019) and uses an 8-year threshold to define a non-financial startup. In Appendix Fig. 9 we show that our baseline results in Table 7 are robust to start-ups being defined according to 6-year age cutoff or a 10-year age cutoff.

Finally, our main measure of competition by start-ups is scaled by other financial incumbents’ patent applications in order to mitigate the role of unobservable technology shocks on both financial firms’, as well as non-financial start-ups’ innovations. However, to further mitigate any endogeneity concern, we also implement an instrumental variable model. Specifically, we instrument the innovative efforts by start-ups by the size of the venture capital (VC) investment to GDP in each country at the beginning of the sample. We obtain venture capital data for the sample of OECD countries, so our estimations are based on a subset of the data. Venture capital is an important source of financing for start-ups, in particular in high-tech sectors (Bijlsma and Zwart 2013; Janeway et al. 2021). As such, we expect this measure to be highly correlated with the future innovative effort of start-ups by reflecting the overall ease of obtaining financing in a country. The exclusion restriction condition for our IV strategy is that the VC investment at the beginning of the sample is not correlated with the future unobserved technology shocks in the reduced form model in Eq. (5) below (i.e., Cov (VC/GDP\(_i\), \(\epsilon _{i,t}\))=0). This is a sensible assumption, as the important technological waves in FinTech such as machine learning, blockchain and cloud computing were arguably difficult to anticipate in the early 2000s.

$$\begin{aligned} \text {FinTech Innovation} _{i,t}= & {} \alpha + \delta \text {Startup Patents}_{c, t-4 \rightarrow t-1} + \theta 'X_{i,t} + \gamma _t + \epsilon _{i,t} \end{aligned}$$
(5)
$$\begin{aligned} \text {Startup Patents}_{c, t-4 \rightarrow t-1}= & {} \gamma + \beta \text {VC/GDP}_i+ \varsigma _{i,t}. \end{aligned}$$
(6)
Table 10 Instrumental variable model. The dependent variable in columns (1)-(2) is an indicator variable equal to one if a financial firm applied for a FinTech patent in year t, while in columns (3)-(4) it is the total number of FinTech patent applications a firm in year t. Financial firms correspond to the 2-digit NACE Rev 2 codes: 64, 65 and 66. Startup Patents is the sum of FinTech patent applications by non-financial start-ups during the previous four years. Finance is a dummy variable for financial firms. Firm controls include: the sum of patents acquisitions of FinTech innovators over the last 5 years, as well as the log of Total Assets and Operating Revenue. Standard errors clustered at the country level are reported in parentheses

Table 10 presents both the OLS estimations of Eq. (5) (column (1)), as well as the first and second stage IV estimations (column (2)). As seen in the bottom of column (2), the measure of VC/GDP is highly correlated with \(\text {Startup Patents}_{c, t-4 \rightarrow t-1}\) and this correlation is significant at the 1% level. Moreover, the high F-statistic suggests the instrument is strong enough for identification. Note also that both OLS and IV estimation in Table 10 do not include firm fixed effects, as our instrument is not time-varying. Furthermore, in columns (3) and (4) we repeat the IV strategy for the second model that looks at the total number of patents granted. Again we find that the effect of competition by start-ups leads to a higher number of patent applications by financial incumbents as compared to non-financial firms in both the OLS and IV estimations.

Table 11 FinTech patent application and competition from tech incumbents. The dependent variable in columns (1)-(3) is an indicator variable equal to one if a financial firm applied for a FinTech patent in year t, while in columns (4)-(6) it is the total number of FinTech patent applications a firm in year t. Financial firms correspond to the 2-digit NACE Rev 2 codes: 64, 65 and 66. Competition Tech is the ratio of the sum of FinTech patent applications by Technology firms relative to that of financial incumbents during the previous four years. Technology firms correspond to the 2-digit NACE Rev 2 codes: 26, 47, 58, 61, 62, are 63, respectively. Competition Tech (Citations) is the Competition Tech ratio scaled by the number of forward citations of Technology firms. Number Tech is the ratio of the total number of technology firms applying for a FinTech patent over the total number of incumbents in the previous four years. Finance is a dummy variable for financial firms. Past Patents is the sum of FinTech patent applications by firm i over \([t-4, t-1]\). FinTech Acquisitions is the sum of acquisitions, mergers, follow-on funding, MBO, or joint ventures by financial incubator i of a FinTech innovator over the period \([t-4,t-1]\). Standard errors clustered at the country level are reported in parentheses.

4.3 Competition from technology incumbents

The descriptive statistics in Table 4 showed that a large fraction of FinTech patent applications are owned by non-financial firms. Technology firms corresponding to the 2-digit NACE Rev 2 codes: 26, 47, 49, 58, 61, 62 and 63, represent a large fraction of that. 21% of the FinTech patent application come from technology firms that are not start-ups (older than eight years at the moment of application). In this subsection we check whether the effect of competition on financial incumbents’ innovation is present when competition is coming from these incumbent technology firms and not start-ups.

As such, we re-construct the measures of competition, but considering incumbent technology companies in the numerator of the ratio in Eq. (2).

The results are presented in Table 11. Overall, we find little evidence that competition from technology incumbents spurs innovation in the financial sector. The results are overall less precise and not always positive. We only find a statistically significant effect when we consider the interaction of Competition Tech (Citations) \(\times \) Finance in column (5). In the Appendix we estimate the alternative specifications using this definition of competition from technology incumbents. Specifically, Appendix Fig. 8 considers alternative time frames, Appendix Fig. 10 looks at the type of FinTech technology, while Appendix Table 13 measures competition based on priority country. Overall, these results show weak evidence that the competitive pressures from mature technology firms are important in driving innovation in the financial industry.

5 Conclusion

Technological innovations have long shaped the financial sector and have had large effects on long-term profitability (Fuentelsaz et al. 2009; Haynes and Thompson 2000; Scott et al. 2017). Yet, the rise of FinTech innovators over the past few decades has the potential to represent the largest disruption in the financial sector so far (Gomber et al. 2017).

In this study, we provide the first test of how financial incumbents responded to the competition from non-financial FinTech start-ups. To this end, we classify a large sample of patents into five FinTech categories using machine learning classifiers. We then exploit the fluctuations in the ratio of FinTech patent applications by non-financial start-ups versus incumbents to capture competitive pressures coming from outside the financial sector.

We show that financial incumbents are more likely to innovate when they face greater competitive pressure from non-financial start-ups. This competitive pressure is even greater if the patent applications turn out to be radical innovations, as captured by their forward citation count.

These results provide support for an escape competition effect, whereby in markets with a low level of competition, the threat of entry pushes incumbents to innovate. They also hold several policy implications. In particular, given the documented inefficiencies in the financial sector (Philippon 2016), our results suggest that an increase in competition from non-financial start-ups can result in important technological innovations in the financial sector. This is important because theoretical models such as Laeven et al. (2015) show that, as economies become more technologically advanced, the well-documented link between financial development and economic growth (Peia and Roszbach 2015; Valickova et al. 2015) disappears unless financiers themselves innovate and become more efficient. Our results suggest that policies that encourage entry from technologically advanced FinTech start-ups can lead to such efficiency gains in the financial sector. As such, policies that encourage the funding of FinTech start-ups, though, for example, venture capital, can lead to technological innovations in the financial sector as well.