Abstract
The aim of this research is to explore and analyze the benefits of risk classification methods by using data mining techniques on premium ratemaking in nonlife insurance. We rely on generalized linear models (GLMs) framework for nonlife premiums and examine the impact of specific data mining techniques on classifications of risk in motor hull insurance in Bosnia and Herzegovina. We study this relationship in an integrated framework considering a standard risk model based on the application of Poisson GLM for claims frequency estimate. Although GLM is a widely used method to determine insurance premiums, improvements of GLM by using the data mining methods identified in this paper may solve practical challenges for the risk models. The application of the data mining method in this paper aims to improve the results in the process of nonlife insurance premium ratemaking. The improvement is reflected in the choice of predictors or risk factors that have an impact on insurance premium rates. The following data mining methods for the selection of prediction variables were investigated: forward stepwise and neural networks. We provide strong and robust evidence that the use of data mining techniques influences premium ratemaking in nonlife insurance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Antonio K, Valdez EA (2010) Statistical concepts of a priori and a posteriori risk classification. Adv Stat Anal 96(2):187–224
Coskun S (2016) Introducing credibility theory into GLMs for ratemaking on auto portfolio. Institute de Actuaries, Actuarial thesis. Centre d'Etudes Actuarielles
Denuit M, Lang S (2004) Non-life rate-making with Bayesian GAMs. Insur: Math Econ 35(3):627–647
Dionne G, Vanasse C (1988) A generalization of actuarial automobile insurance rating models: the negative binomial distribution with a regression component. ASTIN Bull 19(2):199–212
Dionne G, Vanasse C (1992) Automobile insurance ratemaking in the presence of asymmetrical information. J Appl Economet 7(2):149–165
Dugas C, Bengio Y, Chapados N, Vincent P, Denoncourt G, Fournier C (2003) Statistical learning algorithms applied to automobile insurance ratemaking. Casualty Actuar Soc Forum 1(1):179–214
Efroymson MA (1960) ‘Multiple regression analysis. In: Mathematical methods for digital computers’. Wiley, New York
Famoye F, Rothe DE (2003) Variable selection for poisson regression model. J Mod Appl Stat Methods 2(2):380–388
Flynn M, Francis LA (2009) More flexible GLMs: zero-inflated models and hybrid models. Casualty Actuar Soc E‐Forum 148–224
Francis L (2001) Neural networks demystified. Casualty actuarial society forum. pp 253–320
Frees EW, Lee G (2016) Rating endorsements using generalized linear models casualty actuarial society. Var Adv Sci Risk 10(1):51–74
Garrido J, Genest C, Schulz J (2016) Generalized linear models for dependent frequency and severity of insurance claims. Insur: Math Econ 70:205–215
Goldburd M, Khare A, Tevet D (2016) Generalized linear models for insurance rating. Casualty Actuar Soc 5, 2nd edn
Guo L (2003) Applying data mining techniques in property/casualty insurance. Casualty Actuarial Society forum. Available at: https://www.casact.org/pubs/forum/03wforum/03wf001.pdf
Haberman S, Renshaw AE (1996) Generalized linear models and actuarial science. Stat 45(4):407–436
Han J, Kamber M, Pei J (2012) Data mining concepts and techniques, 3rd edn. The Morgan Kaufmann, Burlington, USA
Harrell F (2001) Regression modeling strategies: with applications to linear models, logistic regression, and survival analysis. Chapter 5: resampling, validating, and simplifying the model. 3:88–103
Hastie T, Tibshirani R, Friedman J (2001) The elements of statistical learning. Springer, New York, USA
Hebb DO (1949) The organization of behavior. A neuropsychological theory. Wiley
Hilbe J (2007) Negative binomial regression. Cambridge: Cambridge University Press. https://doi.org/10.1017/CBO9780511811852
Hilbe JM (2014) Modeling count data. Cambridge University Press, New York
de Jong P, Heller GZ (2013) Generalized linear models for insurance data, 5th edn. Cambridge University Press, New York
Jørgensen B, Souza M (1994) Fitting tweedie's compound poisson model to insurance claims data. Scandinavian Actuarial J 1:69–93. https://doi.org/10.1080/03461238.1994.10413930
Kaas R, Goovaerts M, Dhaene J, Denuit M (2009) Modern actuarial risk theory, using R. Springer, Berlin
Kolyshkina I, Wong S, Lim S (2004) Enhancing generalised linear models with data mining. Casualty actuarial society. Discussion paper program. Available at https://www.researchgate.net/publication/253447757_Enhancing_Generalised_Linear_Models_with_Data_Mining
Kriesel D (2007) A brief introduction to neural networks
Kuha J (2004) AIC and BIC: comparisons of assumptions and performance. Socio Method Res 33(2):188–229. https://doi.org/10.1177/0049124103262065
Lowe J, Pryor L (1996) ‘Neural networks v. GLMs in pricing general insurance’, General Insurance Convention
Makov UE, Weiss J, Frees EW, Meyers GG, Derrig RA (2016) Predictive modeling for usage-based auto insurance
McCullagh P, Nelder JA (1989) Generalized linear models, 2nd edn. Chapman & Hall, London
Nelder JA, Wedderburn RWM (1972) Generalized linear models. J Roy Stat Soc 135(3):370–384
Ohlsson E, Johansson B (2010) Non-life insurance pricing with generalized linear models. Springer-Verlag, Berlin
Parodi P (2014) Pricing in general insurance, 1st edn. Chapman and Hall/CRC, New York
Pinquet J (1997) Allowance for cost of claims in bonus-malus systems. Ins: Math Econ 19:152–152. https://doi.org/10.1016/S0167-6687(97)81696-4
Renshaw AE (1994) Modeling the claims process in the presence of covariates. ASTIN Bull 24(2):265–285
SAS Institute (2002) Data mining in the insurance industry - solving business problems using SAS enterprise miner software. Available at: https://www.insurance-canada.ca/2002/10/01/data-mining-in-the-insurance-industry-solving-business-problems-using-sas-enterprise-miner-software/
Saltelli AS et al (2004) Sensitivity analysis in practice–a guide to assessing scientific models. Wiley
Shapiro AF, Jain LC (2003) Intelligent and other computational techniques in insurance. World Scientific. https://doi.org/10.1142/5441
Smyth G, Jorgensen B (2002) Fitting tweedie's compound poisson model to insurance claims data: dispersion modelling. ASTIN Bulletin. 32. https://doi.org/10.2143/AST.32.1.1020
Sumathi S, Sivanandam SN (2006) Introduction to data mining and its applications. Springer-Verlag, Berlin
Werner G, Modlin C (2010) Basic ratemaking. Casualty actuarial society, 4th edn
Yao J (2008) Clustering in ratemaking: with application in territories clustering. Casualty Actuar Soc Discuss Pap Program 170–192
Yip K, Yau, K (2005) On modelling claim frequency data in general insurance with extra zeros. Ins: Math Econ 36:153–163. https://doi.org/10.1016/j.insmatheco.2004.11.002
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Omerašević, A., Selimović, J. (2022). Risk Classification in Nonlife Insurance Premium Ratemaking. In: Terzioğlu, M.K. (eds) Advances in Econometrics, Operational Research, Data Science and Actuarial Studies. Contributions to Economics. Springer, Cham. https://doi.org/10.1007/978-3-030-85254-2_33
Download citation
DOI: https://doi.org/10.1007/978-3-030-85254-2_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-85253-5
Online ISBN: 978-3-030-85254-2
eBook Packages: Economics and FinanceEconomics and Finance (R0)