Introduction

Inspired by Rothschild and Stiglitz (1976), researchers have long studied the equilibrium of the insurance market under adverse selection. The traditional theoretical literature shows that, in an insurance market under adverse selection, there could exist a separating equilibrium where insurance coverage and risk occurrence are positively correlated with each other. However, recent empirical findings are more debatable. While some of the studies confirm a positive correlation between insurance coverage and risk occurrence in health insurance (Cardon and Hendel, 2001) and the annuities market (Finkelstein and Poterba, 2004), a striking number of them cannot find strong evidence for such a positive correlation, and even find a negative correlation in other insurance markets such as life insurance (Cawley and Philipson, 1999), long-term care insurance (Finkelstein and McGarry, 2006), reverse mortgage insurance (Davidoff and Welke, 2007), dental care insurance (Gronqvist, 2004), and medigap insurance (Fang et al., 2008).

De Meza and Webb (2001) provide an intriguing theory to explain why the previous literature cannot find evidence to support the existence of adverse selection. They show that adverse selection might be replaced by “advantageous selection” if both the individuals’ risk preference and precautionary actions are unobservable to the insurer. They assume that according to the risk preference there are two types of individuals: the timid and the bold. Both types of individuals can invest in self-protection to reduce the loss probability. On the other hand, insurance companies have no information regarding the risk preferences of individuals. However, they try to provide a product menu for individuals to self-select insurance contracts. They show that, under certain conditions, the market could settle on a separating equilibrium where timid individuals purchase market insurance as well as make an effort to reduce the loss probability, whereas bold individuals neither purchase market insurance nor make an effort to reduce the loss probability. Thus, their model predicts that the individuals who purchase insurance suffer a lower accident probability than those who do not. In other words, the purchase of market insurance or the choices of coverage levels are not necessarily a bad signal for the risk level. This finding induces an important empirical issue, which is to verify whether purchasing insurance is a bad or a good signal in insurance markets.

This paper provides empirical evidence supporting advantageous selection. It contributes to the literature in two ways. First, we examine advantageous selection from another angle, by directly testing the relationships between the purchase of market insurance, the investment in self-protection, and the risk probability. Second, the market we focus on is that of commercial fire insurance, which is an important line in property and casualty insurance. Moreover, most empirical papers that study asymmetric information use data based on personal insurance. We thus extend the literature to encompass commercial insurance.

In empirical studies, a negative correlation between the coverage provided by the insurance and the occurrence of the risk might not be enough to support the existence of advantageous selection. As shown by Chiappori et al. (2006), a negative correlation between the coverage provided by the insurance and the occurrence of the risk can be explained jointly by market power and privately known risk aversions. Thus it is very important to provide other and more direct evidence besides the negative correlation between the coverage of the insurance and the occurrence of the risk in order to support the existence of advantageous selection.

The theory of De Meza and Webb (2001) is able to generate at least two other empirical hypotheses: a positive relationship between the investment in self-protection and the purchase of insurance, and a negative relationship between the investment in self-protection and the occurrence of risk. De Meza and Webb (2001) indicate that individuals purchasing market insurance are more willing to make an effort. This hypothesis certainly deserves a rigorous examination, since as shown by Ehrlich and Becker (1972) market insurance and self-protection could be either substitutes or complements.Footnote 1 They also argue that individuals making an effort to reduce their loss probabilities indeed have less chance of suffering a loss. This hypothesis is not trivial. For example, there are two types of individuals: one with a loss probability of 0.01, and the other with a loss probability of 0.02. It is assumed that only high-risk individuals make an effort to reduce their loss probability and reduce it from 0.02 to 0.015. In this case, we may find that individuals making an effort to reduce their loss probabilities still have a greater chance of suffering a loss. Thus, instead of focusing on the test of the negative correlation between the coverage of the insurance and the occurrence of the risk as argued in the literature, we investigate the relationship between the insurance coverage and the risk occurrence as well as the other two hypotheses together.

There are three reasons why we do not measure the degree of risk aversion as other papers do in support of advantageous selection. First, Jullien et al. (1999) have shown that an agent with a higher degree of risk aversion will not necessarily invest more in self-protection. Second, the empirical evidence for the correlation between the degree of risk aversion and the risk type is indeed mixed. Finkelstein and McGarry (2006) find that individuals with a higher degree of risk aversion are more likely to purchase long-term care insurance and less likely to enter a nursing home. On contrary, Cohen and Einav (2007) find a positive correlation between the individual’s degree of risk aversion and loss probability in automobile insurance. Third, not only hidden risk preferences as indicated by De Meza and Webb (2001) but also hidden risk perceptions (Koufopoulos, 2005; Huang et al., 2007), hidden regret (Huang et al., 2008), and hidden patience (Sonnenholzner and Wambach, 2008) are capable of generating advantageous selection. The above papers assume that the ex ante risk probabilities for all individuals are the same but the individuals’ own characteristic (hidden information) such as risk preference, risk perception, degree of regret, or degree of patience could affect their investment decision regarding self-protection (hidden action), and further change their risk probabilities. They predict that the market could settle on a separating equilibrium where individuals with certain characteristics would expend a great amount of effort reducing their loss probabilities and purchasing more insurance, whereas other types of individuals do not expend any effort on loss prevention and purchase less insurance. Since there are many rationales for advantageous selection and they all give rise to the same predictions for all three hypotheses, focusing on the relationships that exist among loss probability, self-protection, and market insurance rather than examining the individual’s degree of risk aversion could provide direct evidence for advantageous selection.

The hypotheses developed in this paper are concerned with three endogenous decisions: the purchase of market insurance, efforts directed toward self-protection activities, and the occurrence of the accident. Since we focus on testing the advantageous selection caused by hidden information, we investigate the significance of the correlation between those three endogenous variables after controlling for observable information. In this paper, we use two methods to examine the correlation between the three endogenous variables. The first involves a two-stage method, similar to the method derived by Dionne et al. (2001). The second one is developed by Gourieroux et al. (1987).Footnote 2

Both tests suggest that firms purchasing market insurance have a greater tendency to make an effort to engage in self-protection. We also find that firms making such efforts are less likely to suffer a fire accident. Furthermore, we find that firms with commercial fire insurance have less chance of suffering a fire accident than those without such insurance. All the above three findings jointly support the view that advantageous selection could play a critical role in the commercial fire insurance market.

The remainder of this paper is organized as follows. The next section describes the data collection process and the design of the questionnaire. The subsequent section provides the summary statistics. The empirical methodology is introduced in the fourth section. The penultimate section reports the empirical results. The last section concludes the paper.

Data collection and questionnaire design

To achieve our goal, we hand collected the data since using the data sets from the insurance companies was not sufficient. As suggested by Chiappori and Salanie (1997), the records kept by insurance companies will of course provide excellent data for conducting empirical research on asymmetric information. However, there are two drawbacks to the data obtained from the insurance companies. First, insurance companies do not have information on the non-insured. Thus, it might not be appropriate to examine the behavioral differences between the insured and non-insured by only using data obtained from insurance companies.Footnote 3 The previous literature bypasses this problem by focusing on whether individuals with higher insurance coverage have a higher chance of suffering a loss than those with lower insurance coverage, since insurance companies have information on both the insured with high coverage and the insured with low coverage. Second, and in particular, insurance companies may not have information on individuals’ self-protection activities. Without data on individuals’ self-protection activities, it may be even more difficult to directly investigate our hypotheses. We therefore overcome these two problems by using survey data.

Since the heterogeneity of the data could give rise to estimation bias as mentioned by Chiappori and Salanie (2000), we restrict our sample in terms of the business locations and business types. We focus on class-A businesses, which are classified according to business type in the standard for the installation of fire safety equipment contained in the Fire Act in Taiwan based on use and occupancy.Footnote 4 The government in Taiwan classifies firms as class-A due to the belief that this type of firm has a higher probability of loss than those in other places. We further sample the firms from the two largest cities in Taiwan, Taipei and Kaohsiung, because the fire risk factors of rural or urban businesses could be quite different. For example, the response times of the fire department, materials/codes and height of building, number of neighbors, etc., could be diversified between rural and urban areas, which also cause the fire risk to be diversified between rural and urban areas.

In the questionnaires, we collected information regarding (1) market insurance, (2) records of fire accidents,Footnote 5 (3) self-protection activities, and (4) other relevant variables on which insurance companies usually collect data to underwrite commercial fire insurance, such as the locations and building characteristics of the firms, and the basic fire safety equipment installed.

Based on the purchases of market insurance in the year 2004, we separated the insured firms from the non-insured firms. To test for advantageous selection, we analyzed the differences in terms of the self-protection behavior and accident records between firms that buy the basic fire insuranceFootnote 6 and those that do not buy such coverage.

We obtained information as to whether a firm had faced a fire accident in the year 2004. The fire accident records are more reliable than claim records when it comes to estimating the probability of a fire accident.Footnote 7 The claim records kept by the insurance companies underestimate the occurrence of accidents, because the insured may not report accidents involving small amounts of claims due to either the provisions of deductibles or the experience-rating system. Thus, the accident records used in this paper provide more reliable data for estimating the probability of the risk.

Most importantly, we collected information on the self-protection activities of firms, which are made throughout the year 2004. There are two criteria for a variable used to represent a self-protection activity. The literature has defined self-protection as a non-market risk reduction activity to reduce the probability of a loss. Thus, the first criterion for a proxy of self-protection is that the activity could reduce the likelihood that a fire will occur. The second criterion is that this activity to reduce risk is unobservable to insurers. In the models of De Meza and Webb (2001), Koufopoulos (2005), and Huang et al. (2007), insurers are assumed to be unable to observe precautions directly, but they could estimate the probability of a loss when a hidden action is taken. This criterion rules out certain activities, such as the installation of fire safety equipment.

The first proxy for self-protection is the qualified operation of fire safety equipment. Insurance companies not only can observe the basic safety equipment, but they also take such equipment into account during the process of underwriting the commercial fire insurance and enable the insured to benefit from the point of view of the premium paid. However, the insurance companies cannot be certain how such equipment is maintained from that time on, since they do not follow up on whether such equipment could function well during the policy period. The fire departments are responsible for and implement fire safety inspections every half year, being able to understand how all of the fire safety equipment functions. In our survey data, with the help of certain fire departments, we collected the information regarding the qualified operation of the fire safety equipment.Footnote 8 The qualified operation of fire safety equipment is identified by the fire departments through their inspections, and gives us a chance of obtaining a good proxy for self-protection activities.

Another suitable proxy which satisfies the above two criteria is related to the operations of the self-defense fire organizations. A self-defense fire organization is an informal organization within a firm. Every member of this organization has competent knowledge regarding fire control and hazard reduction. To establish knowledge regarding fire control, all of the members need to attend relevant courses every half year. The tasks in the organization include preventing the occurrence of a fire, checking whether all of the fire safety equipments are operable, clearing the obstacles in an emergency entrance, reporting the occurrence of a fire to a fire station and to neighbors, guiding customers or employees to emergency exits, and handling fire safety equipment, etc. All of the above tasks can effectively reduce the possibility of a fire. The operation of such self-defense fire safety organizations is reviewed by the fire departments only every half year. The insurance companies in Taiwan do not collect this information while underwriting.

It should be recognized that we also collected data on all the relevant variables that insurance companies use in pricing commercial fire insurance.Footnote 9 There are two effects of these exogenous variables. First, they can be used to control the insurer’s known information. Advantageous selection is caused by a firm’s hidden risk preference or a firm’s hidden risk perception, which the insurance companies do not know. Controlling the information that insurance companies do know is essential for a proper test of advantageous selection. Second, data are obtained from a heterogeneous sample. As noticed by Chiappori and Salanie (1997), the estimator used to test for asymmetric information would be biased if a test did not appropriately control for the heterogeneity of the sample.

Basic statistics

With the help of the fire departments of the cities of Taipei and Kaohsiung, we sent out 6,249 questionnaires, and received 3,361 of them back. After eliminating 523 questionnaires with missing variables, 2,838 effective questionnaires were left.

To test our hypothesis of advantageous selection, we further excluded observations from the sample if firms had purchased commercial fire insurance due to the requirements of a loan, since in that case the theories of adverse selection and advantageous selection cannot be applied. After excluding those firms, we were left with a sample of 2,592 questionnaires for our empirical tests.

Tables 1 and 2 define variables and give descriptive statistics. The sample is well distributed in terms of business types as shown in Table 2. Within class-A places, Taiwan’s Fire Act further classifies firms into seven sub-classes. Table 2 shows that the final sample does not concentrate on any specific business. For the seven types of business, the percentages are mostly near 10 percent, and the share of the highest one is 30 percent, which includes a dining room, catering shop, coffee bar, and tearoom. Almost 87 percent of the sample is located in Taipei city. Within Taipei city, 15 percent of the sample is in the Ta-an district (with the postal area code 106), which is a shopping and commercial district in Taipei.

Table 1 Definitions of variables
Table 2 Summary statistics for the final sample

From Table 2, we can observe some basic characteristics of our final sample. About 40 percent of the firms purchased basic commercial fire insurance, while about 5 percent of them had experienced a fire accident. As for self-protection, 74.6 percent of firms had all four kinds of fire safety equipment functioning properly in the most recent half year, and 75.3 percent of the firms in our sample had operated a self-defense fire organization. Almost 80 percent of the firms were located in buildings with less than 10 floors, and over 80 percent of them had basic fire extinguishing equipment, fire alarm equipment, and an area of refuge as well as escape equipment. The percentage of firms with the basic equipment needed for a rescue in the event of a fire was a little bit lower, but still reached 74.01 percent.

Empirical methodology

To test the three hypotheses generated from the theories of advantageous selection, which include the existence of a positive relationship between market insurance and self-protection, a negative relationship between self-protection and the probability of a fire accident, and a negative relationship between market insurance and the risk type, two different methods are employed in this paper.

The first empirical model involves a two-stage method, similar to that of Dionne et al. (2001). There are three interactive endogenous variables: the purchase of the insurance, the occurrence of the fire accident, and the self-protection effort. In this model, we simultaneously estimate the probability of each endogenous variable that is conditional upon the other two endogenous variables as well as other exogenous variables. Since the non-linearity among the endogenous variables is both important and crucial to the results as suggested by Dionne et al. (2001), we employ the two-stage regression presented below.

More precisely, in the first stage, we run three probit regressions as follows:

and

where i=1,…, n denotes the firm, Prob(•) denotes a probability function, β1j, j=ins, accid, sp, is the corresponding parameter vector, and Φ is the cumulative distribution function of N(0, 1) In addition, X1i is an exogenous variable vector for firm i which is defined in note 1 of Table 1, and the variables in X1i are all dummy variables. The three endogenous dummy variables are as follows: (1) ins i =1 if firm i is insured under the basic fire insurance; ins i =0 if firm i is uninsured; (2) accid i =1 if firm i has suffered a fire accident; accid i =0 otherwise; and (3) sp i =1Footnote 10 if firm i engages in self-protection; sp i =0 otherwise. We record the estimated probabilities of the three endogenous variables, Prôb(ins i =1∣X1i), Prôb(accid i =1∣X1i) and Prôb(sp i =1∣X1i), from probit regressions (1) to (3).

Then, in the second stage, we run another three probit regressions for each of the three endogenous variables:

β2j, j=ins, accid, sp, is the parameter vector of the exogenous vector X2i, which differs from X1i and is listed in note 2 of Table 1. bj,k and bj,ek are the coefficients of the variables k and Prôb(k i =1∣X1i) on the dependent variable j, respectively, where j, k=ins, accid, sp, jk. We test the dependency among the three endogenous variables through the significance of the coefficients bins,accid and bins,sp in regression (4), baccid,ins and baccid,sp in regression (5), and bsp,accid and bsp,ins in regression (6). According to our three hypotheses, in regression (4), the coefficient bins,accid should be significantly negative, while the coefficient bins,sp should be significantly positive. In regression (5), the coefficient baccid,ins should be significantly negative, while the coefficient baccid,sp should be significantly negative. In regression (6), the coefficient bsp,accid should be significantly negative, while the coefficient bsp,ins should be significantly positive.

The second empirical method follows a reduced form approach. This method is derived by Gourieroux et al. (1987) and proposed by Chiappori and Salanie (2000). We test the conditional dependency between the three endogenous variables through the significance of the residual correlation between each pair of them. Specifically, we establish three probit regressions:

and

where Prob(•), Φ, ins i , accid i , and sp i are as defined in the previous model, X3i is the vector of control variables which is described in note 3 of Table 1, and β3j, j=ins, accid, sp, is the corresponding parameter vector.

Let ɛ i j be the independent centered normal errors with unit variance for the endogenous variable j in the above regressions. The estimated residuals ɛ i j from Eqs. (7), (8) and (9) are defined as

where j=ins, accid, sp, and φ represents the density distribution function of N(0, 1).

To test the conditional dependence, a statistic W jk , derived by Gourieroux et al. (1987), is obtained as

Under the null hypothesis of cov(ɛ i j, ɛ i k)=0, the W jk statistic is distributed asymptotically as χ2(1). If W jk is significant, we reject the hypothesis that the probability of the occurrence of the endogenous variables j and k is conditionally independent. However, the W jk statistic reveals nothing about the direction of the dependency. In order to clarify the direction of the dependence, we further calculate the conditional correlation coefficient ρ jk for and . Under the predictions made based on the theories of advantageous selection, the W jk for all j and k, jk are hypothesized to be significant and ρins,sp>0, ρins,accid<0, and ρaccid,sp<0.

Empirical results

The empirical results obtained from employing the two-stage method are listed in Tables 3 and 4. In Table 3, the proxy variable for self-protection measures the qualified operation of all four kinds of fire safety equipment (sp_qualified). From Table 3, we find that bins,accid in regression (4) and baccid,ins in regression (5) are both significantly negative. The purchase of the commercial fire insurance and the occurrence of fire accidents are significantly negatively correlated. This finding is consistent with Cawley and Philipson (1999), Dionne et al. (2001), and Finkelstein and McGarry (2006), who also find a negative correlation between insurance coverage and the occurrence of losses.

Table 3 The probit regression on the purchase of market insurance, risk probability, and investment in self-protection – using the qualified operation of all four kinds of fire safety equipment (sp_qualified) as the proxy for investment in self-protection
Table 4 The probit regression on the purchase of market insurance, the risk probability, and the investment in self-protection – using the operation of the self-defense fire organization (sp_org) as the proxy for investment in self-protection

The bins,sp in regression (4) and bsp,ins in regression (6) are both significantly positive, and a positive conditional correlation is found between the purchase of commercial fire insurance and self-protection. The findings support the view that firms that purchase such insurance in the market indeed make more efforts to engage in self-protection in accordance with the theory of advantageous selection.Footnote 11

The baccid,sp in regression (5) and bsp,accid in regression (6) are both significantly negative which means that the conditional correlation between the occurrence of a loss and self-protection is significantly negative, which confirms that a firm that never experiences a fire loss has a greater likelihood of protecting itself, and thus self-protection activities could reduce the likelihood of a fire occurring. Our finding suggests that both the quantity and quality of a firm's self-protection activities are essential in order to reduce the probability of fire accidents.

In Table 4, the proxy variable for self-protection is the operation of a self-defense fire organization (sp_org). The empirical results in Table 4 are generally consistent with those in Table 3. In Table 4, we still find that the purchase of the insurance and the occurrence of the accident are negatively correlated, the purchase of the insurance and the efforts at self-protection are positively correlated, and the occurrence of the accident and the efforts at self-protection are negatively correlated.

Table 5 reports the pair-wise residual correlations of the three endogenous variables obtained using the second method. In Panel A of Table 5, the proxy for self-protection is the qualified operation of the fire safety equipment (sp_qualified). In Panel B of Table 5, the proxy for self-protection is the self-defense fire organization (sp_org). Note that, in Table 5, the W statistic and the conditional correlation coefficient of market insurance and the occurrence of a fire accident in both panels are the same because the use of the proxy for self-protection does not change either the endogenous or the exogenous variables of these two probit regressions. Thus, no matter what the proxy of self-protection is, Eqs. (7) and (8) remain the same.

Table 5 Conditional correlations between endogenous variables – based on the residual correlation test method

From Table 5, regardless of whether we use the qualified operation of the fire safety equipment or the self-defense fire organization as a proxy variable for self-protection, we find that the results sustain the three hypotheses we put forward. In both panels, we find that the purchase of the commercial fire insurance and the occurrence of the fire accidents are significantly negatively correlated. The conditional correlation between self-protection and loss occurrence is also significantly negative.Footnote 12 The correlation coefficients between market insurance and self-protection are, however, significantly positive.

The evidence from our tests as a whole shows that advantageous selection exists in the market.Footnote 13 Firms purchasing market insurance also engage in more self-protection activities. Self-protection activities indeed reduce the chances of fire accidents. Thus, we also observe that the occurrence of fire accidents and the purchase of fire insurance counter the predictions of well-known adverse selection theories.

Conclusions and discussion

With the help of the respective fire departments of the cities of Taipei and Kaohsiung, we have hand-collected firm data which include information on market insurance and fire accidents as well as on self-protection activities. By employing these unique data, we find that: (1) firms that purchase market insurance have a greater tendency to make an effort to engage in self-protection; (2) firms that make an effort to engage in self-protection are less likely to suffer a fire accident; and (3) firms with commercial fire insurance have a lower chance of suffering a fire accident than those without such insurance. The above three findings jointly support the view that advantageous selection exists in the commercial fire insurance market in Taiwan.

Our paper contributes to the literature by providing more direct evidence to support the existence of the advantageous selection theory derived by De Meza and Webb (2001), Koufopoulos (2005), and Huang et al. (2007). Our findings are consistent with those of recent empirical studies that confirm the existence of advantageous selection in different lines of insurance.

For future studies examining advantageous selection in commercial insurance, several issues deserve more attention and could provide fruitful results. First, a multiple year data set will help to decrease the noise while estimating loss probabilities and strengthening the robustness of the empirical findings. However, when a dynamic data set is employed, the learning effect should be taken into consideration. Firms might learn about their true risk types from accidents and further change the investment in self-protection or the purchase of insurance in response. Second, the amount of variation in insurance coverage should be documented to completely control for insurance. This paper models the insurance coverage as a binary choice. We focus on whether the firms purchase fire insurance or not. Collecting information on the amount of insurance coverage could help to document whether the amount of insurance coverage varies importantly across firm or risk types. Third, firms are sophisticated in their risk management and their risk management behavior, such as issuing cat bonds, could affect the risk types. Most of our sample consists of small businesses. Small businesses may not be able to engage in such risk management. However, if researchers are interested in large businesses, it is important to collect the information regarding each firm’s risk management behavior and to further identify whether this information is hidden to insurance companies.