A Machine Learning Approach for Flagging Incomplete Bid-Rigging Cartels

Wallimann, Hannes; Imhof, David; Huber, Martin

doi:10.1007/s10614-022-10315-w

A Machine Learning Approach for Flagging Incomplete Bid-Rigging Cartels

Open access
Published: 11 September 2022

Volume 62, pages 1669–1720, (2023)
Cite this article

Download PDF

You have full access to this open access article

Computational Economics Aims and scope Submit manuscript

A Machine Learning Approach for Flagging Incomplete Bid-Rigging Cartels

Download PDF

Hannes Wallimann^1,2,
David Imhof^2,3,4 &
Martin Huber²

6064 Accesses
7 Citations
1 Altmetric
Explore all metrics

Abstract

We propose a detection method for flagging bid-rigging cartels, particularly useful when cartels are incomplete. Our approach combines screens, i.e., statistics derived from the distribution of bids in a tender, with machine learning to predict the probability of collusion. As a methodological innovation, we calculate such screens for all possible subgroups of three or four bids within a tender and use summary statistics like the mean, median, maximum, and minimum of each screen as predictors in the machine learning algorithm. This approach tackles the issue that competitive bids in incomplete cartels distort the statistical signals produced by bid rigging and it outperforms previously suggested methods in applications to incomplete cartels based on empirical data from Switzerland.

A machine learning approach to detect collusion in public procurement with limited information

Article 13 June 2024

On cartel detection and Moran’s I

Article Open access 06 August 2016

A Noisy-Labels Approach to Detecting Uncompetitive Auctions

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

When firms deviate from competitive behavior and instigate a cartel, they secretly conspire to raise prices or lower the quality of goods or services. As such, conspiracies directly harm taxpayers, buyers, or sellers. Cartel formation remains a pervasive problem and has been considered in a range of studies. See for instance the Swedish asphalt cartel described in Bergman et al. (2020), collusion among seafood processors in the US (Abrantes-Metz et al. 2006), bid rigging in public procurement auctions for construction works in Japan (Ishii 2014), in Poland (Foremny et al. 2018), in Canada (Clark et al. 2018) and in the US (Porter and Zona 1993; Feinstein et al. 1985) and bid rigging for school milk contracts in Ohio (Porter and Zona 1999), Florida and Texas (Pesendorfer 2000). To enhance the fight against cartels, the OECD recommends competition agencies to promote pro-active methods for uncovering conspiracies, as such methods may help to discover cartels where leniency is unlikely to be sought (OECD 2013). Answering the need for statistical tools in this context, Porter and Zona (1993), Bajari and Ye (2003), Harrington (2008), Jimenez and Perdiguero (2012), Imhof et al. (2018), Crede (2019) and Bergman et al. (2020), among others, have proposed different methods for uncovering cartels.

However, the detection of cartels might be more challenging in the presence of competitive bidders participating in markets in which a cartel is active (McAfee and McMillan 1992; Hendricks et al. 2008; Asker 2010; Bos and Harrington 2010; Conley and Decarolis 2016; Decarolis et al. 2020). When a cartel is incomplete due to competitive bidders, it weakens the statistical pattern produced by bid rigging in the distribution of bids, increasing the difficulty of detecting a cartel. Moreover, a cartel might temporarily collapse because of deserters, i.e., it is not always stable. This instability in the cartel might affect the screens rendering the statistical signals of bid rigging more challenging to detect. Finally, a cartel aware of methods for uncovering cartels might try to weaken the statistical pattern due to bid rigging in order to decrease the ability of such methods to predict the cartel’s presence.

Thus, this paper offers an original application of a detection method based on screens to detect both incomplete and complete bid-rigging cartels. Screens are statistics derived from the distribution of bids in a tender capturing the distributional changes produced by bid rigging (see Abrantes-Metz et al. 2006; Hueschelrath and Veith 2014; Abrantes-Metz et al. 2012; Jimenez and Perdiguero 2012; Imhof et al. 2018; Imhof 2019). Our novel approach consists of calculating screens for all possible subgroups of three or four bids in a tender and not only for all bids. We then use the screens calculated for all the subgroups in a particular tender to calculate descriptive statistics of each screen, which synthesize the properties of the distribution of bids in a tender. Those descriptive statistics of screens, henceforth called ’summary screens’, circumvent the distortion that competitive bidders or deserters generate in the statistical signals produced by bid rigging in a tender, rendering our suggested method robust to the presence of competitive bidders.

In our study, we combine the summary screens with machine learning as in a prediction policy problem (see Kleinberg et al. 2015). Machine learning has been applied in a rapidly increasing number of studies (Rabuzin and Modrusan 2019; Imhof and Wallimann 2021; García Rodríguez et al. 2020; Rodríguez et al. 2022; Silveira et al. 2022; Huber et al. 2022) and aims at finding the optimal combination of covariates that best predicts the presence or absence of bid rigging in a tender. Also related to our paper is the recent study of Uslu et al. (2021), applying machine learning to investigate trade-based manipulations of capital market instruments. Moreover, our paper is related to studies analyzing bidding strategies (see, e.g., Liu et al. 2020; Cai et al. 2019) and applying predictive models (see, e.g., Mir et al. 2020; Mirzapour et al. 2019) in other research fields. As we focus on the predictive performance, we do not have to construct explicit structural models for collusion. To train and evaluate models, we focus on the random forest (see Breiman 2001) as machine learner because it provides a flexible prediction method that does not impose any parametric (e.g., linearity) assumptions when considering our large set of screens. In contrast to many other machine learners, random forests do not require tuning specific penalty terms, see the discussion in Athey and Imbens (2019), and are therefore easier to be implemented. This appears desirable if a competition agency applies our detection method for screening procurement markets.

Calculating screens for subgroups as in our approach is also considered in Conley and Decarolis (2016) and Chassang et al. (2022). First, Conley and Decarolis (2016) investigate subgroups to detect cartels in collusive auctions in Italy, but in contrast to our method (which considers all possible subgroups in a tender), they exploit firm-specific covariates (such as, e.g., common owner, municipality, or country) to form subgroups. Relying on firm-specific covariates could impede a broad screening activity if firm-specific data are unavailable or if the time needed to collect them without raising the attention of potential cartel participants is lacking. Chassang et al. (2022) show that winning bids tend to be isolated in terms of value when bidders collude. For analyzing the missing density of bids between the first and the second-lowest bids, they calculate the normalized margin. First, bids are normalized by the reserve price (which would be impossible in our data). Second, they calculate for each normalized bid i the difference with the minimal normalized bid (other than i) in each tender. A missing density around zero for the normalized margin is incompatible with competition, especially if repeatedly observed. A competitive bidder rationally maximizing profits would be tempted to increase her or his bids if she or he regularly observes that other bidders submit substantially higher bids. Therefore, a density gap around zero is incompatible with competition.

Two important arguments favor of our approach based on machine learning and synthetized screens. First, it exclusively relies on information about bids rather than firm-specific characteristics or cost-related variables required for econometric tests (see for instance Bajari and Ye 2003; Aryal and Gabrielli 2013). Our suggested method requires only bid summaries, which are either public or readily accessible for competition agencies and thus not as costly to acquire as firm- or cost-specific information. The necessity to gather firm-level information can attract, in some cases, the attention of the cartel, decreasing the chance of success in acting against it. Second, machine learning relies on the hypothesis that bid rigging affects the distribution of bids in a tender (also common to other methods for flagging bid-rigging cartels as the econometric tests suggested by Bajari and Ye (2003)) but remains agnostic about how the distribution is affected. In our case, it is sufficient to assume that bid rigging modifies the distribution of bids and that screens can capture these changes.

Our study investigates the correct classification rates of different methods in the context of incomplete cartels. We first apply a benchmark method, suggested by Imhof et al. (2018), which implements two screens with benchmarks, i.e., a rule of thumb, for classifying a tender as collusive or competitive. The second method applies machine learning using a set of screens, calculated based on all bids in a tender, so-called ’tender-based screens’, to predict collusion. Finally, the third method is the novel approach suggested in this paper, which includes summary statistics of the screens (median, mean, maximum and minimum) calculated for all possible subgroups of bids in a tender as predictors in the random forest.

We use data from Switzerland, where the incidence of collusive and competitive tenders is known. We apply our approach to two investigations of the Swiss competition commission (hereafter COMCO): See-Gaster and Strassenbau Graubünden. Both cases were characterized by well-organized bid-rigging cartels, which sometimes faced competition from outsiders. These competitive bidders might have tried on one hand to benefit from the umbrella effect of the cartel by bidding higher than they would have done in a competitive situation (Bos and Harrington 2010). On the other hand, too many competitive bidders could have destabilized the formation of cartels.

We find that the benchmarking approach exhibits low correct classification rates for incomplete cartels. Using tender-based screens in predictive models, we obtain correct classification rates from 61 and 77% when competitive bidders are present. Applying our novel approach based on summary screens increases the performance to correct classification rates ranging between 67 and 84%. Further, we note that the performance of machine learning decreases with the proportion of competitive bids. This result confirms the findings from the investigations that cartel participants partially endogenize the presence of competitive bidders by adopting a more competitive behavior, at least in some cases.

The remainder of this study is organized as follows. Section 2 presents the bid-rigging cartels uncovered in Switzerland from which our data are drawn. Section 3 outlines the detection methods for flagging both complete and incomplete bid-rigging cartels. Section 4 applies our original application to incomplete cartels based on empirical data from the cases of See-Gaster and Strassenbau Graubünden. Section 5 concludes.

2 Bid-Rigging Cartels and Data

The Swiss Parliament revised the federal Cartel Act and introduced a sanction regime in April 2004, with an adaptation period of one year, alongside a compliance program. This legislative modification helped initiating a change in the praxis towards economically harmful bid-rigging cartels. At the end of 2004, COMCO began investigating the Ticino cartel, releasing its decision in 2007. The Ticino cartel dissolved without sanctions since it had ended its illegal conduct precisely before April 2005, consuming the entire adaptation period. However, it stressed the damage and mischief of a bid-rigging cartel with a price increase of over 30% (see Imhof 2019). In 2008, COMCO decided to prioritize fighting bid rigging.

Following its decision in the Ticino case, the authority prosecuted many bid-rigging cases. Initially, COMCO rendered an essential decision against bid rigging every other year. From 2015 onwards, however, COMCO rendered more decisions, emphasizing its determination to prosecute bid-rigging conspiracies. Table 1 lists COMCO’s most important decisions in bid-rigging cases and the sanctions it imposed in each case.

Table 1 Decisions of COMCO in bid-rigging cases

A Machine Learning Approach for Flagging Incomplete Bid-Rigging Cartels

Abstract

Similar content being viewed by others

A machine learning approach to detect collusion in public procurement with limited information

On cartel detection and Moran’s I

A Noisy-Labels Approach to Detecting Uncompetitive Auctions

1 Introduction

2 Bid-Rigging Cartels and Data

2.1 Procurement Data

2.2 The Cartel in See-Gaster

2.3 The Strassenbau Cartel in Graubünden

2.4 Data from the Cases See-Gaster and Graubünden

3 Detection Methods

3.1 Random Forest

3.2 Screens

3.3 Summary Screens

3.4 Model Specification

4 Flagging Incomplete Bid-Rigging Cartels

4.1 Application

4.2 Robustness Analysis

5 Conclusion

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Disclaimer

Additional information

Publisher's Note

Appendix

Appendix

1.1 A Nomenclature for Abbreviations

1.2 B Classification Tree Adjusting the Benchmarking Rule of Imhof et al. (2018) in the Swiss Data in Sample Only with Complete Cartels

1.3 C Details About Lasso Regression and the Ensemble Method

1.4 D Descriptive Statistics for the Swiss Data

1.5 E Descriptive Statistics for Predictors

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation