# The continuous sample of working lives: improving its representativeness

- 1.3k Downloads

## Abstract

This paper studies the representativeness of the Continuous Sample of Working Lives (CSWL), a set of anonymized microdata containing information on individuals from Spanish Social Security records. We examine several CSWL waves (2005–2013) and show that it is not representative for the population with a pension income. We then develop a methodology to draw a large dataset from the CSWL that is much more representative of the retired population in terms of pension type, gender and age. This procedure also makes it possible for users to choose between goodness of fit and subsample size. In order to illustrate the practical significance of our methodology, the paper also contains an application in which we generate a large subsample distribution from the 2010 CSWL. The results are striking: with a very small reduction in the size of the original CSWL, we significantly reduce errors in estimating pension expenditure for 2010, with a *p* value greater or equal to 0.999.

## Keywords

Continuous Sample of Working Lives Public pension system Subsample selection Stratified sampling Chi-square test*p*value

## JEL Classification

C81 H55 J26## 1 Introduction

Selecting a representative sample from the population is a very important factor in quantitative research given that results obtained from a wrongly selected sample which is not properly in tune with the object of study cannot be generalized to the whole population. Moreover, smaller than appropiate sample may not have enough diversity to enable significant differences or associations potentially present in the target population to be identified. It is fundamental to make the right choice and decide on the right way to deal with the dataset, always making sure that it is the best one for the purposes of the research. Hence, it is important to check that the sample selected is representative of the target population in terms of demographic characteristics as well as any others that can affect the results of the study to be conducted.^{1}

The Continuous Sample of Working Lives (CSWL) is a random sample (RS) of around 1.2 million people, i.e. 4% of the reference population. It contains administrative data on working lives, which provide the basis for this sample taken from Spanish Social Security records, and comprises anonymized microdata with detailed information on individuals. Izquierdo et al. (2009) point out that this database provides a unique dataset with very rich information about labour market histories and personal characteristics, such as nationality, date and country of birth (or province if Spanish), gender and place of residence when the individual first entered the Social Security system, along with additional information about the composition of the household and labour market variables.

The first wave covers people who had an economic relationship with the Social Security system in 2004, and provides the entire working history (employment, unemployment and retirement) of the sample population. The sample is updated every year using information from the variables selected from the Social Security system dating back to when computerized records began, and from other administrative data sources which record additional information on individuals. Apart from the details given by the institutions responsible for generating the CSWL,^{2} the data available to researchers date from 2004 to 2015.^{3}

The sample reference population^{4} is defined as individuals who have had some connection (through contributions, pensions or unemployment benefits) to the Social Security system at any time during the year of reference. This population of reference makes it possible to select people who on a particular date in that year had no relationship with Social Security, but who did at some time before or after that date in the same year. This means that the CSWL will contain details about a person who may have had various relationships with Social Security (unemployed, unaffiliated, contributor, pensioner), unlike other datasets that only show a single relationship because they refer to the situation on one specific date in the year.

Individuals are selected from the reference population each year according to whether their identification codes contain certain randomly-generated figures in the correct positions. Each year individuals who figured in the previous version of the CSWL and continue to have a relationship with Social Security remain in the selection, while new individuals are incorporated if they meet the requirement of having identification codes containing the randomly-generated figures described above. The detailed information available on the individuals selected includes work trajectories (from 1967), contribution bases (from 1980) and/or pensions (from 1996), as long as it is contained in the Social Security administrative records. Those individuals who for any reason have no connection to Social Security in a particular year will not figure in the CSWL.

The CSWL is a dataset that has been used in a considerable number of studies, especially on labour economics (Treviño et al. 2008; Benavides et al. 2010; Vall Castello 2012; Bonhomme and Hospido 2013, 2016; Solé et al. 2013; Agliari et al. 2014; Arranz and García-Serrano 2014; Barra et al. 2014 and Nagore García and van Soest 2016) and the Spanish public pension system (Antón Pérez et al. 2007; Argimón et al. 2007; Boado-Penas et al. 2008; Moral Arce et al. 2008; Vidal Meliá et al. 2009; Cairó Blanco 2010; Peinado Martínez 2011; Devesa et al. 2012; Domínguez Fabián et al. 2012; Meneu Gaya and Encinas Goenechea 2012; Vicente Merino et al. 2012; Conde Ruiz and González 2013, and Vegas Sánchez et al. 2013). Other studies give detailed descriptions of its characteristics, advantages and limitations.^{5}

Other countries such as the USA and Germany have similar databases but they use stratification. For example, the US Social Security Administration (SSA) has been compiling a Continuous Work History Sample (CWHS) since the late 1930s’, which is a one-percent stratified cluster probability sample of all possible Social Security numbers. The population, or sampling frame, from which the CWHS cases are selected consists of the 1 billion possible nine-digit Social Security numbers (SSNs). These digits represent the geographical area of each number allocated, a group for the date of issue and a random serial number [Smith (1989)]. The numbers are thus stratified geographically by place of application for SSN and chronologically by the process of allocation of numbers within each stratum. The information source is the so-called MEF (Master Earnings File—Olsen and Hudson 2009), which is used to determine pensions for retirement, permanent disability and widowhood in the US public Social Security system (OASDI).

Similarly in Germany (Himmelreicher and Stegmann 2008), there is the so-called sample of insured persons and their insurance accounts (Versicherungskontenstichprobe, VSKT), which provides longitudinal data that have a high potential for analyses of employment biographies and pension claims in old age. These data are process-produced, contain very large samples and allow for differentiated analyses of a variety of social groups. The VSKT was initially sampled in 1983 as a stratified^{6} random sample with disproportional selection probabilities and since then has continued as a panel containing monthly information on the individuals included in the sample. It represents 1% of the contributing population.

One important branch of research in pension systems is the problem of global ageing and the sustainability of public pension systems, referred to hereafter as PPS. As mentioned above, the CSWL is one of the datasets considered for the purposes of this research in Spain. In order to analyze Spanish PPS, it is necessary to have information on relevant variables such as age and gender for each year of reference as considered in previous studies. These last two factors are essential for correctly estimating life expectancy, so any study that make estimates of future benefits should select a sample that is representative of the population in terms of age and gender as well as type of pension.

With all this in mind, the first objective of the paper is to analyse whether all the information given by the CSWL on the benefit recipients makes up the best replica of the study population that researchers can have. To solve this question of how representative the CSWL is of the population of pensioners, it is important to carry out a statistical analysis to determine whether there are significant differences between the CSWL sample distribution and the population distribution. Even though the researcher does not have the entire population, the distributions of the population of pensioners organized by age, gender and type of pension on the last day of each year are available. Therefore it is possible to carry out a test in order to check whether the CSWL has the same distribution as the population of pensioners. However, given that the CSWL is not a stratified sample, it is advisable to check whether it covers the correct proportion of the population in each stratum to be considered in the study of the pensioner population categorized by age, gender and type of pension.

In this paper we conduct such an analysis for the CSWL waves for 2005–2013 and confirm that there is a lack of representativeness in most years. Given these results and the fact that it has long been known that a stratified random sample (SR) enables a more efficient selection to be made when one of the variables in the study of interest presents great variability, as is the case of the ages of pensioners in the different types of pension, the first idea that comes to mind is why not use a random sample with stratification to obtain a dataset that better represents the population of pensioners?

The answer is clear: researchers do not have all the data on the population, so they cannot extract such a sample. They would have to use a stratified random sample (SR) contained in the CSWL, but with a considerably smaller size, which would entail a loss of richness in the information on pensioners’ working lives. Hence it is important and advisable to develop other procedures in order to obtain larger subsamples with less reduction in the total number of pensioners than in the SR subsample contained in the CSWL, and at the same time to make the original CSWL more representative in terms of pension benefits.

Hence the second objective of the paper is to provide researchers with a novel methodology for the design of a dataset on pensioners by making the CSWL more representative of the population in terms of type of pension, gender, and age, and by trying to miss as few pension records as possible so as not to overlook diversity in working lives.

Subsample selection is done by finding a feasible solution to a nonlinear optimization problem (NLP)^{7} using mixed integer nonlinear programming with just one real non-negative decision variable, the constant of proportionality, *q*, in a stratified sample design with proportional allocation. Maximizing *q* is equivalent to maximizing the size of the subsample and is subject to constraints implied by the fact that the number of pensioners to be included in each cohort of the subsample has to be a natural number (non-negative integer) and that the subsample obtained must be included in the population as well as in the CSWL. The methodology applied uses a goodness of fit test—Pearson’s chi-square test—in order to make the subsample selected more representative of the population, providing *p* values close to 1. This methodology enables us to obtain quite a large dataset included in the CSWL which is much more representative of the pensioner population in terms of type of pension, gender, and age, as would be the case with an SR. In addition, this procedure enables users to choose between goodness of fit quality and subsample size.

Finally, in order to illustrate the gains obtained with the selection of the subsample, the methodology is applied to the CSWL for 2010,^{8} to gauge the improvement in the estimate of total pension expenditure in different cohorts taking into account age, gender and type of pension, even though the main objective is not this but to obtain the subsample itself for use in any subsequent analysis of the Spanish public pension system. Given that the lack of representativeness of the CSWL has also been found in other years, our findings suggest that the same procedure might relevantly be applied to select subsamples in the other waves of the CSWL.

The structure of the rest of the paper is as follows: Sect. 2 analyses the representativeness of the CSWL for the years from 2005 to 2013 with respect to pension benefits. Section 3 sets out the distribution by type of pension, gender, and age of a hypothetical CSWL using SR sampling and the distribution of a subsample obtained by SR sampling using the original CSWL. In this section we show the importance of stratification as a sort of backtesting. We check whether stratification matters by looking at the total expenditure estimated using a stratified random sample. Section 4 details the criteria used for subsample selection and the results obtained. The paper ends with conclusions, pointers for future research and two appendices: the first shows all the tables and graphs with the estimates of the total expenditure deviations for 2010, and the second (online) extends the analysis of the goodness of fit of the CSWL to the population (INSS) for the whole period 2005–2013 and summarizes the problem statement whose solution provides the distribution of the number of pensioners in large subsamples which also represent the population better than the CSWL itself.

## 2 Analysis of the goodness of fit of the CSWL to the population (2005–2013)

In this section we analyse how well the CSWL pension data distribution fits the population distribution of pensioners at December \(31{\mathrm{st}}\) for the years 2005 to 2013^{9} by age, gender, and type of pension. The data are available in the statistical reports of the National Social Security Institute (INSS), though it is important to stress that the population from which the sample is drawn comprises all those individuals who have been registered or have received some kind of contributory pension from the Social Security system at any time during the year of reference, regardless of how long they were in that situation. It does not therefore coincide with the figure for the population at December \(31{\mathrm{st}}\) each year or indeed with the population of pensioners, but is larger. However, in our study there is a process of post-stratification of the CSWL with all pensions registered (current) at December \(31{\mathrm{st}}\) being grouped by cohorts for age, gender and type of pension as of that date. We do not add all the pensions that were registered at some time during the year but only those which were recorded as currently registered on December \(31{\mathrm{st}}\), just as the INSS statistical report for each year does. The composition of the pensioners population is obtained from INSS (2006–2007), INSS (2008–11) and INSS (2012–2014). Pensions deregistered during the year due to the death of the recipient or because the recipient ceased to meet the requirements for receiving the pension are not considered on December \(31{\mathrm{st}}\). The statistics on the total number of pensioners in the INSS statistical report consider only those pensions recorded as currently registered on December \(31{\mathrm{st}}\), so a comparison between the distributions of pensioners (by type of pension, gender and age) in the CSWL as of December \(31{\mathrm{st}}\) and the INSS makes sense, because the moment in time considered is the same.

In short, the aim of this analysis is to determine whether there are any statistically significant differences in the weights of the cohorts between the sample and the population using a goodness of fit test.

To conduct the test it is necessary to conduct a post-stratification of the CSWL once the sample records with no information on gender or date of birth have been deleted. The main theoretical reason why the population of pensioners is stratified by type of pension, gender and age is that there is a different life expectancy for each pensioner depending on the type of benefit received (retirement, permanent disability, widow(er)’s, orphan’s and family responsibilities),^{10} whether the pensioner is a man or a woman and whether he/she has a given age are taken into account. So in order to make accurate forecasts when analysing the sustainability of the Spanish public pension system, in which life expectancy plays a crucial role, those differences have to be taken into account. Therefore it is very important to have a sample of individuals that adequately represents the population of pensioners, taking into account these variables.

Another practical reason is that the information available to us about the population of pensioners is the distribution of this population organized by age, gender and type of pension. Moreover, the age cohorts considered in our analysis are also given in the format in which the information is disclosed by the INSS.

Once the data on pensions from the CSWL has been post-stratified at December \(31{\mathrm{st}}\) by type of pension, gender, and age cohorts, we perform a preliminary analysis comparing the distribution obtained from the CSWL with that of the population for 2005–2013. We use the equivalent table from the statistical report of the Spanish National Social Security Institute (INSS), once those population records that provide no information on gender or date of birth have been deleted. We thus compare the number of pensions in each cohort of the sample with the same cohort of the population, to check for differences with respect to the 4% of the population that the sample should in theory represent.

Percentages of pensions in the CSWL out of the total INSS population by age, 2010.

*Source* Authors’ own calculations based on the CSWL 2010 and INSS (2011)

Age cohorts | Permanent disability | Retirement | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|

Male | Female | Total | Male | Female | Total | |||||||

% | P. Average | % | P. Average | % | P. Average | % | P. Average | % | P. Average | % | P. Average | |

15–19 | | 884.22 | | 0.00 | | 884.22 | 0.00 | 0.00 | 0.00 | |||

20–24 | 3.47 | 535.76 | 4.13 | 592.80 | 3.59 | 547.81 | 0.00 | 0.00 | 0.00 | |||

25–29 | 4.30 | 752.98 | 3.74 | 625.79 | 4.16 | 724.17 | 0.00 | 0.00 | 0.00 | |||

30–34 | 3.78 | 773.33 | 3.61 | 684.87 | 3.73 | 748.48 | | 392.40 | 0.00 | | 392.40 | |

35–39 | 4.08 | 785.29 | 3.84 | 665.09 | 4.00 | 748.29 | | 278.76 | | 425.40 | | 352.08 |

40–44 | 4.02 | 788.74 | 4.03 | 688.59 | 4.02 | 756.41 | | 329.64 | 0.00 | | 329.64 | |

45–49 | 4.02 | 800.97 | 3.89 | 715.74 | 3.98 | 773.37 | 10.00 | 685.54 | | 0.00 | | 685.54 |

50–54 | 3.96 | 851.36 | 4.08 | 742.42 | 4.00 | 812.14 | 3.71 | 2033.79 | | 1637.85 | 4.75 | 1916.81 |

55–59 | 4.00 | 971.50 | 3.98 | 767.40 | 4.00 | 902.43 | 3.38 | 1885.94 | | 1.721.92 | 3.47 | 1878.04 |

60–64 | 4.01 | 1017.31 | 4.02 | 747.39 | 4.01 | 930.23 | 4.00 | 1402.87 | 4.09 | 916.03 | 4.03 | 1268.00 |

65–69 | | 1013.24 | | 715.45 | | 913.47 | 3.85 | 1190.45 | 3.79 | 705.68 | 3.83 | 1.025.15 |

70–74 | | 581.79 | | 439.33 | | 488.17 | 4.05 | 1023.05 | 4.00 | 614.31 | 4.03 | 884.88 |

75–79 | | 788.94 | 4.33 | 376.67 | 4.46 | 426.54 | 4.00 | 959.63 | 3.97 | 581.99 | 3.99 | 832.27 |

80–84 | 4.11 | 552.04 | 4.16 | 366.07 | 4.16 | 375.32 | 3.97 | 893.06 | 3.92 | 559.02 | 3.95 | 768.49 |

85 and over | 4.58 | 512.80 | 3.94 | 362.50 | 3.97 | 369.60 | 3.89 | 799.44 | 3.93 | 507.17 | 3.91 | 659.56 |

Total | 4.25 | 916.16 | 4.22 | 710.48 | 4.24 | 845.13 | 3.96 | 1.041.00 | 3.92 | 619.00 | 3.95 | 891.00 |

Average age | 55 | 57 | 56 | 75 | 76 | 76 |

Age cohorts | Widow(er)’s | Orphan’s | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|

Male | Female | Total | Male | Female | Total | |||||||

% | P. Average | % | P. Average | % | P. Average | % | P. Average | % | P. Average | % | P. Average | |

0–4 | 0.00 | 0.00 | 0.00 | 4.58 | 265.96 | 3.75 | 262.08 | 4.17 | 264.24 | |||

5–9 | 0.00 | 0.00 | 0.00 | 4.32 | 263.20 | 4.21 | 260.01 | 4.27 | 261.68 | |||

10–14 | 0.00 | 0.00 | 0.00 | 4.28 | 263.90 | 4.37 | 268.06 | 4.32 | 265.94 | |||

15–19 | | 396.36 | | 1.042.79 | | 827.31 | 3.88 | 272.37 | 4.06 | 265.82 | 3.97 | 269.08 |

20–24 | | 894.40 | | 664.19 | | 745.44 | 4.07 | 286.34 | 3.96 | 296.96 | 4.01 | 291.67 |

25–29 | | 642.69 | | 711.74 | | 678.94 | 4.39 | 313.52 | 3.52 | 279.11 | 4.03 | 301.13 |

30–34 | | 673.96 | | 653.24 | | 660.63 | 3.71 | 345.78 | 3.88 | 349.61 | 3.78 | 347.38 |

35–39 | | 616.53 | 4.52 | 648.78 | 4.69 | 643.71 | 3.76 | 347.02 | 3.93 | 367.01 | 3.83 | 355.22 |

40–44 | 4.93 | 594.51 | 4.28 | 625.15 | 4.36 | 620.95 | 3.98 | 374.65 | 4.05 | 388.96 | 4.01 | 380.56 |

45–49 | 4.39 | 564.79 | 4.14 | 627.18 | 4.17 | 618.52 | 4.06 | 410.16 | 3.92 | 424.74 | 4.00 | 416.02 |

50–54 | 3.59 | 576.48 | 3.88 | 646.28 | 3.84 | 637.47 | 4.08 | 443.42 | 4.24 | 445.60 | 4.15 | 444.38 |

55–59 | 3.51 | 570.87 | 3.93 | 638.26 | 3.88 | 630.86 | 4.13 | 466.53 | 3.86 | 475.74 | 4.01 | 470.58 |

60–64 | 3.73 | 539.97 | 3.99 | 659.33 | 3.97 | 648.91 | 3.63 | 476.60 | 4.05 | 496.56 | 3.84 | 487.30 |

65–69 | 3.70 | 462.44 | 3.95 | 634.15 | 3.94 | 622.96 | 3.65 | 488.78 | 4.56 | 498.23 | 4.15 | 494.48 |

70–74 | 4.09 | 400.04 | 4.01 | 605.55 | 4.02 | 593.48 | 3.66 | 516.53 | 4.08 | 499.55 | 3.91 | 505.97 |

75–79 | 3.94 | 383.33 | 4.01 | 588.50 | 4.01 | 577.41 | 4.29 | 526.40 | 3.82 | 534.87 | 3.98 | 531.78 |

80–84 | 4.05 | 359.75 | 3.97 | 562.96 | 3.97 | 551.40 | 3.89 | 525.99 | 4.38 | 563.58 | 4.24 | 553.89 |

85 and over | 3.70 | 333.78 | 3.97 | 515.48 | 3.95 | 504.75 | | 585.80 | 3.40 | 510.36 | 3.25 | 522.27 |

Total | 3.96 | 436.00 | 3.99 | 582.00 | 3.99 | 572.00 | 4.01 | 344.00 | 4.08 | 352.00 | 4.04 | 348.00 |

Average age | 72 | 76 | 76 | 33 | 34 | 33 |

Age cohorts | Family responsibilities | Total | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|

Male | Female | Total | Male | Female | Total | |||||||

% | P. Average | % | P. Average | % | P. Average | % | P. Average | % | P. Average | % | P. Average | |

0–4 | | 0.00 | 0.00 | | 0.00 | 4.58 | 265.96 | 3.75 | 262.08 | 4.17 | 264.24 | |

5–9 | | 0.00 | | 0.00 | | 0.00 | 4.32 | 263.20 | 4.20 | 260.01 | 4.26 | 261.68 |

10–14 | | 0.00 | | 0.00 | | 0.00 | 4.26 | 263.90 | 4.36 | 268.06 | 4.31 | 265.94 |

15–19 | | 318.96 | 3.23 | 243.07 | 3.07 | 276.80 | 3.88 | 273.15 | 4.06 | 267.03 | 3.97 | 270.07 |

20–24 | 3.01 | 242.84 | 3.39 | 275.16 | 3.20 | 259.70 | 4.03 | 308.40 | 3.99 | 307.96 | 4.01 | 308.18 |

25–29 | 3.38 | 316.99 | 4.21 | 265.04 | 3.85 | 284.83 | 4.91 | 647.26 | 4.46 | 552.51 | 4.76 | 616.63 |

30–34 | | 237.98 | | 281.07 | 3.87 | 255.93 | 4.18 | 690.40 | 4.13 | 607.80 | 4.16 | 659.32 |

35–39 | 3.51 | 281.15 | | 253.39 | 4.98 | 261.32 | 4.08 | 710.16 | 4.08 | 611.98 | 4.08 | 669.29 |

40–44 | 4.19 | 302.81 | 4.35 | 208.14 | 4.27 | 252.69 | 4.06 | 717.12 | 4.13 | 623.99 | 4.09 | 675.10 |

45–49 | | 352.90 | 3.56 | 417.85 | 4.30 | 386.79 | 4.06 | 734.10 | 4.00 | 645.04 | 4.03 | 692.13 |

50–54 | 4.16 | 494.19 | 4.30 | 460.53 | 4.26 | 471.30 | 3.94 | 800.77 | 4.00 | 675.32 | 3.97 | 736.84 |

55–59 | 4.53 | 450.86 | 3.48 | 554.48 | 3.79 | 517.08 | 3.94 | 967.80 | 3.95 | 687.47 | 3.94 | 827.84 |

60–64 | 3.42 | 451.38 | 4.02 | 486.03 | 3.87 | 478.29 | 3.99 | 1.209.63 | 4.03 | 753.09 | 4.01 | 1.018.27 |

65–69 | | 490.43 | 4.15 | 530.45 | 4.34 | 521.53 | 4.03 | 1.167.15 | 3.97 | 681.53 | 4.00 | 960.60 |

70–74 | 3.83 | 499.33 | 3.79 | 498.91 | 3.79 | 498.97 | 4.05 | 1.007.24 | 4.01 | 609.42 | 4.03 | 822.92 |

75–79 | 4.69 | 470.51 | 4.40 | 491.46 | 4.43 | 488.97 | 4.00 | 940.06 | 3.99 | 584.00 | 3.99 | 757.29 |

80–84 | | 529.06 | 4.09 | 425.30 | 4.20 | 438.06 | 3.97 | 865.29 | 3.95 | 558.28 | 3.96 | 685.23 |

85 and over | 3.15 | 467.27 | 3.93 | 466.85 | 3.85 | 466.89 | 3.87 | 758.08 | 3.95 | 510.04 | 3.93 | 586.03 |

Total | 4.13 | 442.00 | 4.00 | 474.00 | 4.03 | 467.00 | 4.00 | 975.63 | 3.98 | 599.70 | 3.99 | 783.13 |

Average age | 61 | 71 | 69 | 70 | 73 | 72 |

Analysing the results for the said ratios, it can be concluded that there are cases in all the years where the figure exceeds 5% or fails to reach 3% of the population, i.e. where it deviates from the percentage of the population (4%) represented by the CSWL, with some cohorts being considerably overrepresented in relative terms. In all the years considered the CSWL also contains age cohorts for some types of pension that present outliers, while in the population those cohorts have no pensions.

The main mismatches are found in permanent disability and widow(er)’s pensions, and to a lesser extent retirement pensions, in the case of men. The mismatches are greatest in 2005, 2006, 2008, 2009, 2010 and 2011, and less significant in 2007, 2012 and 2013, given that the number of cohorts more than one fourth away from 4% is smaller and where such differences do exist they are smaller. Hence for the years where the differences are greater, the statistical test is expected to provide results that support the existence of statistically significant differences not due exclusively to sample size.

Pearson’s chi-squared test \((\chi ^{2})\) is considered as a test of goodness of fit to check whether the sample follows the same distribution as the population of pensioners as of \(31{\mathrm{st}}\) December. Goodness of fit tests usually have a given hypothesis as to the theoretical distribution for the population, which they test using the data observed in the sample. In the case of the CSWL the distribution of the population of pensioners by age, gender, and type of pension is known from the statistical report of the INSS (INSS, 2006–2014).

For this test the expected frequency for each cohort needs to be calculated by gender and type of pension. That is, for a given gender and type of pension we calculate the relative frequency as the ratio of the number of pensions in each age cohort to the total pensions for the same gender and type of pension in the population.

*i*is the index for the 18 cohorts into which the variable “age” has been divided;

*j*is the index corresponding to “gender” (male, female);

*k*is the index for the 5 types of pension (permanent disability, retirement, widow(er)’s, orphan’s and family responsibilities); \(N_{j,k} \): is the number of pension benefits in the population per type of pension

*k*and gender

*j*; and \(n_{j,k} \): is the number of pension benefits in the CSWL for pension type

*k*and gender

*j*.

For large samples, as is the case with the CSWL, which covers more than 300,000 pensioners in every available year, it is very unlikely that the sample will be a perfect fit to the population, so the test statistic will show a rejection of the hypothesis that the sample and the population have the same distribution and conclude that the differences between the two distributions are statistically significant. Those differences could be magnified because of the large size of the sample. To overcome this possible error in the interpretation of the test results (pointed out by Berkson 1938; Wang 1993 and Lin et al. 2013 among others) it is important to ensure that the differences found are not due to the large size of the sample, so there is evidence not only of statistical differences but also of practical ones.

According to Wilkinson (1999), statistical significance refers to whether the effect observed is larger than would be expected by chance, i.e. can the null hypothesis that there is no effect be rejected? This is what is typically addressed by *p* values. Practical significance is about whether we should care, i.e. whether the effect is useful in an applied context. Two groups will almost never be exactly the same if thousands or millions of people are tested. That does not mean that every difference is of interest. This is usually associated with effect size measures [e.g. Cohen’s d, (Cohen 1988)].

In statistics, an effect size is a measure of the strength of the relationship between two variables in a statistical population, or a sample-based estimate of that quantity. An effect size calculated from data is a descriptive statistic that expresses the estimated magnitude of a relationship without making any statement about whether the apparent relationship in the data reflects a true relationship in the population. Thus effect sizes complement inferential statistics such as *p* values. Sample-based effect sizes are distinguished from test statistics used in hypothesis testing in that they estimate the strength of an apparent relationship rather than assigning a significance level reflecting whether the relationship could be due to chance. The effect size does not determine the significance level or vice-versa. Given a sufficiently large sample size, a statistical comparison will always show a significant difference unless the population effect size is exactly zero.

*k*= the number of regrouped cohorts.

Once the hypothesis has been tested in the case of the fit of the distribution by age for each gender and type of pension, to determine whether the differences found are statistically significant, the size of the effect is estimated using Cramér’s V, the results of which enable the preliminary analysis of practical significance to be completed.

Results chi-squared test for 2010.

*Source* Authors’ own calculations

Data | Permanent disability | Retirement | Widow(er)’s | Orphan’s | Family responsibilities | |||||
---|---|---|---|---|---|---|---|---|---|---|

Male | Female | Male | Female | Male | Female | Male | Female | Male | Female | |

Sample size | 25,960 | 13,694 | 132,151 | 73,116 | 6303 | 85,497 | 5634 | 5302 | 328 | 1184 |

\(\chi ^{2}\) | 29,200 | 16,415 | 57.7 | 54.87 | 1179 | 84.1 | 16.6 | 12.8 | 12.7 | 8.8 |

Cohorts (k) | 13 | 14 | 8 | 7 | 12 | 13 | 18 | 18 | 14 | 15 |

| 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.480 | 0.748 | 0.472 | 0.843 |

\(\chi _{{\upalpha } ,(\mathrm{k}-1)}^2 \) | 21.026 | 22.362 | 14.067 | 12.592 | 19.69 | 21.026 | 27.587 | 27.587 | 22.362 | 23.685 |

Reject | Yes | Yes | Yes | Yes | Yes | Yes | No | No | No | No |

V Cramér | 0.306 | 0.304 | 0.008 | 0.011 | 0.130 | 0.009 | 0.013 | 0.012 | 0.055 | 0.023 |

TE | Large | Large | Negli. | Negli. | Medium | Negli. | Negli. | Negli. | Small | Negli. |

The results of the test for per type of pension and gender for 2010 are summarized (Table 2) whereas (the results) for the whole period can be found in the Online Appendix (Tables 1.9–1.18). It can be observed that they are almost equivalent in most years, which means that the hypothesis that the CSWL has the same distribution as the population in most pension benefits (permanent disability, retirement and widow(er)s) is rejected. However, in pension benefits for retirement the size of the effect is negligible, so the differences detected can be attributed to the large size of the sample. The causes of these differences can be attributed to the sample design (simple random sampling), to administrative errors and to a reclassification of pensioners older than 65 with permanent disability benefits, who are considered as disabled in the CSWL but as retirement pension beneficiaries in the official population statistics.

In the versions of the CSWL for 2007, 2012, and 2013, the differences found in the distribution of pensioners receiving permanent disability benefits are due to the large size of the sample, given that the size of the effect is negligible. This does not happen in the case of pension benefits for widow(er)s. It has also been detected in those years that the code assigned to most pensioners over 65 on permanent disability benefits is changed to retirement benefits, when in the previous years they continued to be classed as receiving permanent disability benefits. This explains the better fit in 2007, 2012 and 2013. It is worth noting in particular what happens after 2007: the same coding errors appear in the pensions for permanent disability for pensioners older than 65.

Hence it is concluded that the fit is not good for some types of pension benefit given that some cohorts are over or underrepresented in the CSWL with respect to the actual population of pensioners, and contain a number of pensions clearly higher or lower than the figure expected depending on the proportion of the reference population represented by the CSWL (4%). The results seem to suggest that for 2005, 2006, 2008, 2009, 2010 and 2011 the CSWL does not fit the distribution of the population well in terms of type of pension, gender, and age for two types of pension benefit: permanent disability and widowhood. The mismatch is greater in the former, and the poor fit is not attributable solely to the large sample size. In other years the null hypothesis that the CSWL has the same distribution as the population cannot be rejected given the size of the sample, but there is room for improvement in the cases where the null hypothesis cannot be rejected, as will be shown in the next section.

These results must be taken into account when making forecasts on the sustainability of the Spanish public pension system using the CSWL, given that we find that for most years and some types of pension benefit it does not correctly represent the distribution by age of the contributory pensions in the system whose sustainability is to be analysed. It is concluded that the significance of the differences detected goes beyond a single year given that they are found for a considerable number of waves of the CSWL.

Wang (1993) advocates asking whether these differences are important or significant in practice too. The answer to this question differs from case to case and according to the experience of each research team, as there is no statistic for measuring significance in practice. Hence some researchers may consider an error of 1% to be important while for others it is negligible, depending on the context and goal of each study.

To estimate the significance in practice of the differences in the distribution of the number of pensions by cohorts, the total annual expenditure on pensions by cohorts could be estimated using the CSWL and compared with the estimate obtained using the population, which is known. In addition, if the estimate provided by a sample obtained using SR can be compared, it would reveal how much margin for improvement there is in these estimates and hence the significance of the differences found. We seek to answer these questions below. In particular, if one of the objectives is to use the data from the CSWL to make forecasts about the sustainability of the Spanish public pension system, it is advisable to have more representative subsamples based on using an SR, the characteristics of which are described in the following section.

## 3 Checking whether stratification matters

When deciding what sampling design is most appropriate for studying pension benefits and pension expenditure, it is important to know whether it is relevant to divide the population into levels and groups. To show the real importance of stratification for the case of the CSWL, in this section a sort of backtesting is carried out. This is a process widely used in finance, demographics and insurance among other fields. In our case we want to test our methodology on prior time periods. Instead of applying the methodology for the time period forward, in which case its effectiveness could take years to check, our procedure is applied to relevant past data in order to gauge its usefulness.

We have the information on the distribution of pensioners organized by age, gender and type of pension and we know the number of pensions and the mean pension expenditure in each group, so in this section we check whether stratification matters by looking at the total expenditure estimated using a stratified random sample. Given that we do not have the entire population, we obtain the distribution of pensions of a hypothetical stratified random sample extracted from the population and estimate the total annual pension expenditure by cohort at \(31{\mathrm{st}}\) December 2010. This is then compared with the CSWL to check for any improvement in the estimate and forecasts for pension expenditure for 2010 using the hypothetical sample obtained from the population by stratified random sampling (SR) with respect to the CSWL. An improvement can be expected because we find that for most years and some types of pension benefit the CSWL does not correctly represent the distribution by age of the contributory pensions in the system.

It is clear that stratification is indeed relevant because, for example, the age of pensioners is a variable that has an important influence when it comes to forecasting expenditure on pensions. As stated above, these variables are also important in analysing the sustainability of the public pension system, where pensioners’ life expectancy plays a very important role.

One of the main goals of stratification is to give a better cross-section of the population so as to increase relative precision. There are several reasons to use stratified random sampling. Stratification ensures adequate representation of various groups of the population which may be of interest or importance. The use of stratification increases the accuracy with which a characteristic of a population can be estimated. It is achieved by dividing a heterogeneous population into sub-populations, each of which is homogeneous within itself. When there are extreme values in the population, stratification is more powerful because individual strata will be more homogeneous and separate estimates obtained from them can be combined into a precise estimate for the whole population by taking a relatively smaller sample (Singh and Chaudhary 1986).

*i*-th stratum, as is the case here because the number of pensioners in the population in each age cohort by gender and type of pension is known, a given sample of size

*n*is allocated in the different strata in proportion to their sizes, i.e. in the i-th stratum:

*i*. It is calculated exactly like the stratified estimator \(\bar{Y}_{S}\) but is based on the results of a simple -not a stratified- random sample. The weights \(w_{i}\) are assumed to be known. When the size of the simple random sample,

*n*, is large, the proportion of sampled elements that fall into a given group can be expected to be approximately equal to the proportion of elements of that group or stratum in the population, that is, \(n_i /n\approx N_i /N\). So when

*n*is large, the post-stratified estimator based on a simple random sample can be expected to behave like a stratified estimator based on a proportional stratified sample.

These considerations on the data from the population are important in studying the average age and average pension of the population relative to the same data in the age cohorts or strata of the CSWL and for the subsequent post-stratification. However, taking into account that the average means and the average pensions of all the population are known for each type of pension, gender and age cohort, the interest is focused on whether \(n_i /n\approx N_i /N\), even though the CSWL size may be considered large. The first section of the paper has questioned whether the proportions in the strata or cohorts of the CSWL are similar to the corresponding ones in the population. Table 3 presents the distribution by type of pension, gender and age that a sample should have using proportional stratified sampling if it is to be representative of the population of pensioners with a constant of proportional allocation of, approximately, \(q=3.992\%\). This constant of proportional allocation is the result of dividing the number of pension benefits at December \(31{\mathrm{st}}\) contained in the CSWL, \(n^{CSWL}=349,169\) (source: Authors’ Own Calculations based on the CSWL for 2010), by the total number of pensions in the population taken from INSS (2011), \(N^{INSS}=8,747,470\). The results in Table 3 total 3 pensions more than in the CSWL because of rounding.

To correctly estimate the dimension of the problems found in our analysis it is worth highlighting the differences in the relative importance of each type of pension: retirement, permanent disability and death. For example, in 2010 (see Table 3) retirement pensions account for 59.48% of the total number of beneficiaries. Gender really matters: for men the figure is 78.43% whereas for women it is only 41.50%. Widow(er)s account for 26.31% of the total number of pensions, but for women the figure is 47.72%, which is even higher than for retirement pensions. In the case of men the weight of this contingency is much lower at just 3.73% of the total number of beneficiaries.

Next we estimate total annual pension expenditure by cohort at \(31{\mathrm{st}}\) December 2010 for the population and for the CSWL (Table 4). To isolate the effect of the differences in the distribution of average pension amounts between the CSWL and the actual population (INSS), we estimate total expenditure for each cohort by taking the average pension published in the 2010 INSS statistical report. To obtain pension expenditure for the population (INSS), we multiply the number of pensions by the average pension for each cohort and by the coefficient (between 12 and 14) for adjusting the total amount of pensions at December \(31{\mathrm{st}}\) to recognized expenditure in financial year 2010 on each type of pension, as shown in the fourth column (No. INSS adjusted payments) in Table 4.

*q*) and by the same coefficient that adjusts the monthly pension to the pension recognized as expenditure in 2010. For the case of the SR we proceed in the same way as with the CSWL.

Distribution of pensions at 31/12/2010 in a hypothetical sample extracted from the population using stratified random sampling.

*Source* Authors’ own calculations

Age cohorts | Permanent disability | Retirement | Widow(er)’s | ||||||
---|---|---|---|---|---|---|---|---|---|

Male | Female | Total | Male | Female | Total | Male | Female | Total | |

15–19 | 2 | 2 | – | – | – | – | – | – | |

20–24 | 64 | 14 | 78 | – | – | – | – | 4 | 4 |

25–29 | 260 | 88 | 348 | – | – | – | 3 | 31 | 34 |

30–34 | 657 | 269 | 926 | – | – | – | 15 | 127 | 142 |

35–39 | 1233 | 583 | 1816 | – | – | – | 49 | 341 | 390 |

40–44 | 2029 | 964 | 2993 | – | – | – | 111 | 804 | 915 |

45–49 | 2973 | 1472 | 4445 | 3 | 1 | 4 | 229 | 1507 | 1736 |

50–54 | 4111 | 2247 | 6358 | 33 | 4 | 37 | 384 | 2457 | 2841 |

55–59 | 5524 | 2841 | 8365 | 466 | 11 | 477 | 495 | 3578 | 4073 |

60–64 | 7426 | 3533 | 10,959 | 10,260 | 3849 | 14,109 | 560 | 5473 | 6033 |

65–69 | 70 | 32 | 102 | 33,134 | 17,436 | 50,570 | 576 | 7732 | 8308 |

70–74 | 4 | 16 | 20 | 28,575 | 14,782 | 43,357 | 647 | 10,565 | 11,212 |

75–79 | 10 | 101 | 111 | 27,946 | 14,334 | 42,280 | 925 | 15,887 | 16,812 |

80–84 | 17 | 330 | 347 | 19,425 | 11,708 | 31,133 | 1002 | 16,967 | 17,969 |

85 and over | 19 | 450 | 469 | 13,468 | 12,244 | 25,712 | 1351 | 20,038 | 21,389 |

Total | 24,399 | 12,940 | 37,339 | 133,310 | 74,369 | 207,679 | 6347 | 85,511 | 91,858 |

Total% | 6.99 | 3.71 | 10.69 | 38.18 | 21.30 | 59.48 | 1.82 | 24.49 | 26.31 |

Gender% | 14.35 | 7.22 | – | 78.43 | 41.50 | 0.00 | 3.73 | 47.72 | – |

Age cohorts | Orphan’s | Family responsibilities | Total | ||||||
---|---|---|---|---|---|---|---|---|---|

Male | Female | Total | Male | Female | Total | Male | Female | Total | |

0–4 | 83 | 81 | 164 | 83 | 81 | 164 | |||

5–9 | 316 | 297 | 613 | 1 | 1 | 316 | 298 | 14 | |

10–14 | 644 | 605 | 1247 | 2 | 1 | 3 | 644 | 606 | 1250 |

15–19 | 1203 | 1163 | 2366 | 6 | 6 | 12 | 1211 | 1169 | 2380 |

20–24 | 691 | 715 | 1406 | 15 | 14 | 29 | 770 | 747 | 1517 |

25–29 | 73 | 51 | 124 | 9 | 12 | 21 | 345 | 182 | 527 |

30–34 | 145 | 100 | 245 | 6 | 7 | 13 | 823 | 503 | 1326 |

35–39 | 251 | 166 | 417 | 5 | 7 | 12 | 1538 | 1097 | 2635 |

40–44 | 365 | 252 | 617 | 8 | 8 | 16 | 2513 | 2028 | 4541 |

45–49 | 452 | 315 | 767 | 16 | 27 | 43 | 3673 | 3322 | 6995 |

50–54 | 425 | 324 | 749 | 38 | 79 | 117 | 4991 | 5111 | 10,102 |

55–59 | 345 | 290 | 635 | 54 | 124 | 178 | 6884 | 6844 | 13,728 |

60–64 | 254 | 263 | 517 | 51 | 152 | 203 | 18,551 | 13,270 | 31,821 |

65–69 | 168 | 205 | 373 | 30 | 131 | 161 | 33,978 | 25,536 | 59,514 |

70–74 | 95 | 140 | 235 | 17 | 112 | 129 | 29,338 | 25,615 | 54,953 |

75–79 | 61 | 120 | 181 | 19 | 148 | 167 | 28,961 | 30,590 | 59,551 |

80–84 | 26 | 66 | 92 | 18 | 160 | 178 | 20,488 | 29,231 | 49,719 |

85 and over | 9 | 38 | 47 | 24 | 194 | 218 | 14,871 | 32,964 | 47,835 |

Total | 5606 | 5191 | 10,795 | 318 | 1183 | 1501 | 169,978 | 179,194 | 349,172 |

Total% | 1.61 | 1.49 | 3.09 | 0.09 | 0.34 | 0.43 | 48.68 | 51.32 | 100.00 |

Gender% | 3.30 | 2.90 | 0.00 | 0.19 | 0.66 | 3.30 | 100.00 | 100.00 | – |

Estimate of total expenditure on pensions from expenditure at 31st December.

*Source* (Instituto Nacional de la Seguridad Social, INSS 2011). In millions of euros Figures have been rounded up or down as appropriate

Type of pension | INSS (month) 31-12 | Recognized expenditure | No. INSS adjusted payments | 1/q | CSWL (month) 31-12 | CSWL expenditure (year) | % diff. |
---|---|---|---|---|---|---|---|

Permanent disability | 799 | 11,156 | 13.96 | 25.05 | 33.51 | 11,721 | \(-\)5.06 |

Retirement | 4647 | 63,268 | 13.61 | 182.88 | 62,371 | 1.42 | |

Widow(er)’s | 1321 | 18,142 | 13.73 | 52.55 | 18,072 | 0.38 | |

Orphan’s | 95 | 1313 | 13.82 | 3.80 | 1317 | \(-\)0.27 | |

Family responsibilities | 17 | 239 | 13.77 | 0.71 | 244 | \(-\)1.74 |

In Appendix 1, Tables 12, 13, 14, 15, 16, 17, 18, 19, 20 and 21 show the expenditure calculated per cohort using the CSWL and the SR hypothetical sample for each type of pension together with Figs. 1, 2, 3 based on these tables. With them we seek to show the distortion in the estimation of pension expenditure for each type of contingency, gender and age cohorts caused by using RS instead of SR. Graphically, the improvement in the estimate of total pension expenditure for all types of pensions and for both men and women can be seen, as expected. The fit is worse in the case of SR, where the size of the sample is small due to the loss of elements in the cohorts, so this originates greater differences in the forecast of total expenditure for those cohorts.

Improvement indicators in the estimate of total pension expenditure in % in the hypothetical SR sample with respect to the CSWL.

*Source* Authors’ own calculations

Improvement indicators | Permanent disability | Retirement | Widow(er)’s | Orphan’s | Family responsibilities | |||||
---|---|---|---|---|---|---|---|---|---|---|

Men | Women | Men | Women | Men | Women | Male | Female | Men | Women | |

R_ SRMQE | 99.95 | 99.84 | 99.92 | 99.83 | 99.41 | 99.53 | 96.11 | 93.75 | 85.58 | 97.44 |

R_SDAV | 99.87 | 99.57 | 99.88 | 99.77 | 99.35 | 99.53 | 95.62 | 94.31 | 82.80 | 96.28 |

MDC | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 0.15 | 0.03 |

CSWL_MDC | 6.17 | 5.18 | 0.97 | 1.35 | 1.60 | 0.11 | 0.33 | 0.20 | 1.40 | 1.50 |

CE \(\le \) 0.01% | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 66.67 | 82.35 |

CSWL_CE\(\le 0.01\%\) | 80.00 | 60.00 | 55.56 | 55.56 | 57.14 | 66.67 | 27.78 | 38.89 | 61.11 | 47.06 |

The indicators used to measure the improvement are defined as follows:

R_SRMQE: reduction in the square root of the mean quadratic error.

R_SDAV: reduction in the sum of the differences in absolute value of estimated expenditure by cohort.

MDC: maximum difference in the estimate of expenditure by cohort as a percentage of total expenditure by gender and type of pension for the case of the SR sample.

CSWL_MDC: maximum difference in the estimate of expenditure by cohort as a percentage of total expenditure by gender and type of pension for the case of the original CSWL.

CE \(\le \) 0.01%: percentage of cohorts with an error in the estimate of expenditure of less than 0.01% of total expenditure, in absolute value, for the case of the SR sample.

CSWL_CE \(\le \) 0.01%: percentage of cohorts with an error in the estimate of expenditure of less than 0.01% of total expenditure, in absolute value, for the case of the original CSWL.

The SR sample obtained seems to be the best at representing the population of pensioners by age, gender and type of pension, with the larger one being better. The aim of this research is to extract a large subsample obtained from the CSWL, with a view to improving its representativeness, and to compare the reductions in the various measures of error of the estimate of total pension expenditure with those resulting from the hypothetical SR sample obtained from the population (which researchers cannot access).

*q*would have to be reduced.

Distribution of pensions at 31-12-2010 in a subsample selected from the CSWL using SR sampling.

*Source* Authors’ own calculations based on CSWL 2010

Age cohorts | Permanent disability | Retirement | Widow(er)’s | ||||||
---|---|---|---|---|---|---|---|---|---|

Male | Female | Total | Male | Female | Total | Male | Female | Total | |

20–24 | 16 | 4 | 20 | – | – | – | 1 | 1 | |

25–29 | 65 | 22 | 87 | – | – | – | 1 | 8 | 9 |

30–34 | 165 | 67 | 232 | – | – | – | 4 | 32 | 36 |

35–39 | 309 | 146 | 455 | – | – | – | 12 | 85 | 97 |

40–44 | 508 | 242 | 750 | – | – | – | 28 | 201 | 229 |

45–49 | 745 | 369 | 1114 | 1 | – | 1 | 57 | 377 | 434 |

50–54 | 1030 | 563 | 1593 | 8 | 1 | 9 | 96 | 616 | 712 |

55–59 | 1384 | 712 | 2096 | 117 | 3 | 120 | 124 | 896 | 1020 |

60–64 | 1860 | 885 | 2745 | 2570 | 964 | 3534 | 140 | 1371 | 1511 |

65–69 | 18 | 8 | 26 | 8301 | 4368 | 12,669 | 144 | 1937 | 2081 |

70–74 | 1 | 4 | 5 | 7159 | 3703 | 10,862 | 162 | 2647 | 2809 |

75–79 | 3 | 25 | 28 | 7001 | 3591 | 10,592 | 232 | 3980 | 4212 |

80–84 | 4 | 83 | 87 | 4866 | 2933 | 7799 | 251 | 4251 | 4502 |

85 and over | 5 | 113 | 118 | 3374 | 3067 | 6441 | 339 | 5020 | 5359 |

TOTAL | 6113 | 3243 | 9356 | 33,397 | 18,630 | 52,027 | 1590 | 21,422 | 23,012 |

Age cohorts | Orphan’s | Family responsibilities | Total | ||||||
---|---|---|---|---|---|---|---|---|---|

Male | Female | Total | Male | Female | Total | Male | Female | Total | |

0–4 | 21 | 20 | 41 | – | – | – | 21 | 20 | 41 |

5–9 | 79 | 74 | 153 | – | – | – | 79 | 74 | 153 |

10–14 | 161 | 152 | 313 | – | – | – | 161 | 152 | 313 |

15–19 | 301 | 291 | 592 | 1 | 2 | 3 | 302 | 293 | 595 |

20–24 | 173 | 179 | 352 | 4 | 4 | 8 | 193 | 188 | 381 |

25–29 | 18 | 13 | 31 | 2 | 3 | 5 | 86 | 46 | 132 |

30–34 | 36 | 25 | 61 | 1 | 2 | 3 | 206 | 126 | 332 |

35–39 | 63 | 42 | 105 | 1 | 2 | 3 | 385 | 275 | 660 |

40–44 | 91 | 63 | 154 | 2 | 2 | 4 | 629 | 508 | 1137 |

45–49 | 113 | 79 | 192 | 4 | 7 | 11 | 920 | 832 | 1752 |

50–54 | 106 | 81 | 187 | 10 | 20 | 30 | 1250 | 1281 | 2531 |

55–59 | 87 | 73 | 160 | 13 | 31 | 44 | 1725 | 1715 | 3440 |

60–64 | 64 | 66 | 130 | 13 | 38 | 51 | 4647 | 3324 | 7971 |

65–69 | 42 | 51 | 93 | 8 | 33 | 41 | 8513 | 6397 | 14,910 |

70–74 | 24 | 35 | 59 | 4 | 28 | 32 | 7350 | 6417 | 13,767 |

75–79 | 15 | 30 | 45 | 5 | 37 | 42 | 7256 | 7663 | 14,919 |

80–84 | 6 | 16 | 22 | 4 | 40 | 44 | 5131 | 7323 | 12,454 |

85 and over | 2 | 9 | 11 | 6 | 49 | 55 | 3726 | 8258 | 11,984 |

Total | 1402 | 1299 | 2701 | 78 | 298 | 376 | 42,580 | 44,892 | 87,472 |

Improvement indicators in the estimate of total pension expenditure in % of the SR subsample (SRSS) with respect to the CSWL.

*Source* Authors’ own calculations

Improvement indicators | Permanent disability | Retirement | Widow(er)’s | Orphan’s | Family responsibilities | |||||
---|---|---|---|---|---|---|---|---|---|---|

M | F | M | F | M | F | M | F | M | F | |

R_ SRMQE | 99.78 | 99.52 | 99.59 | 99.57 | 97.85 | 97.77 | 80.75 | 76.98 | 41.70 | 88.12 |

R_SDAV | 99.36 | 98.90 | 99.36 | 99.43 | 97.80 | 97.83 | 77.59 | 79.28 | 36.37 | 83.78 |

MDC | 0.01 | 0.01 | 0.00 | 0.00 | 0.03 | 0.00 | 0.05 | 0.05 | 0.65 | 0.13 |

CSWL_MDC | 6.17 | 5.18 | 0.97 | 1.35 | 1.60 | 0.11 | 0.33 | 0.20 | 1.40 | 1.50 |

CE \(\le \) 0.01% | 100.00 | 93.33 | 100.00 | 100.00 | 64.29 | 100.00 | 44.44 | 44.44 | 44.44 | 52.94 |

CSWL_CE\(\le 0.01\%\) | 80.00 | 60.00 | 55.56 | 55.56 | 57.14 | 66.67 | 27.78 | 38.89 | 61.11 | 47.06 |

In short, backtesting is carried out in this section to show the real importance of stratification for the case of the CSWL. We show, looking at the estimate of total expenditure, that with an SR sampling a better fit can be obtained by age cohorts, gender and type of pension. In order to be able to obtain an SR sample from the CSWL available and not from the population, the subsample reduces the size to 25% of the original. In the following section we apply a procedure to select large SR subsamples, thus improving the fit of the CSWL to the population of pensioners, bearing in mind that this improvement will not be as great as that obtained using the hypothetical but unfeasible SR sample from the population, but it may at least be as good as the improvement offered by the SR subsample of 87,972 pensions extracted from the CSWL using stratification. The methodology developed has the property of allowing the user to choose the relationship between the desired goodness of fit to the population and the size of the subsample.

## 4 Selection of a large subsample distribution from the CSWL: results

In this section we explain the criteria for the design of a large subsample to be selected using proportional allocation stratification from the 2010 CSWL data to improve its representativeness with respect to the number of pension benefits in the population. We explain the procedure developed for this and the distribution of pensions in a subsample selected from the 2010 CSWL obtained with this procedure that will improve the fit with a high *p* value without missing as many registers as with the SR contained in the CSWL. Besides that, in order to illustrate the practical significance of our findings we show the results obtained in its application to estimate total expenditure by age, gender and type of pension and compare it with those that use the SR hypothetical sample and the subsample with respect to the CSWL.

The aim is to find a subsample for the more general case of fit to the distribution of the CSWL to the pensioner population by age, gender and type of pension.

- (a)
It must be more representative of the population under study. The procedure should therefore include a goodness of fit test on the distribution of the number of pensioners by age, gender and type of pension that takes into account the associated

*p*values. - (b)
The total number of pensioners needs to be relatively high so as to be bigger than the number that would result from a stratified sample from the CSWL, approximately 1% of the population of pensioners. Hence the requirement is to maximize subsample size, with 1% of the pensioner population being the lower limit.

- (c)
The subsample obtained must be included in the population as well as in the CSWL. These two requirements might seem obvious, but constraints have to be introduced to avoid the outliers found in the CSWL but not in the population, i.e. those cohorts for which the number of pensions is greater than zero in the CSWL but zero in the population of pensioners. It is also important to have a number of pensioners in each cohort of the subsample that is lower than or equal to that of the corresponding cohort of the population.

*q*is equivalent to maximizing the size of the subsample. Without taking into account the integer constraints we have:

*q*is the maximization of \(\hat{q}\), the adjusted constant of proportionality, so this is what is finally considered:

Summary of results.

*Source* Authors’ own calculations

\({p value_{min}}\) | \(n^{SUB}\) | \(\hat{q}\%\) | \(\left( {n^{SUB}/n^{CSWL}} \right) \% \) |
---|---|---|---|

0.8000 | 324,986 | 3.715 | 93.074 |

0.9000 | 320,825 | 3.668 | 91.882 |

0.9500 | 317,773 | 3.633 | 91.008 |

0.9700 | 315,887 | 3.611 | 90.468 |

0.9800 | 314,555 | 3.596 | 90.087 |

0.9900 | 312,172 | 3.569 | 89.404 |

0.9990 | 302,907 | 3.463 | 86.751 |

0.9995 | 297,865 | 3.405 | 85.307 |

0.9999 | 247,457 | 2.829 | 70.870 |

Distribution of pensions at 31-12-2010 in a subsample selected from the CSWL with a *p* value of \(\ge 0.999\)

Age cohorts | Permanent disability | Retirement | Widow(er)’s | ||||||
---|---|---|---|---|---|---|---|---|---|

Male | Female | Total | Male | Female | Total | Male | Female | Total | |

15–19 | 1 | 1 | – | – | – | ||||

20–24 | 55 | 12 | 67 | – | – | – | 3 | 3 | |

25–29 | 225 | 75 | 300 | – | – | – | 2 | 26 | 28 |

30–34 | 570 | 233 | 803 | – | – | – | 12 | 109 | 121 |

35–39 | 1069 | 505 | 1574 | – | – | – | 42 | 295 | 337 |

40–44 | 1760 | 836 | 2596 | – | – | – | 96 | 697 | 793 |

45–49 | 2580 | 1277 | 3857 | 2 | – | 2 | 198 | 1307 | 1505 |

50–54 | 3567 | 1950 | 5517 | 28 | 3 | 31 | 332 | 2132 | 2464 |

55–59 | 4793 | 2465 | 7258 | 395 | 9 | 404 | 429 | 3104 | 3533 |

60–64 | 6443 | 3065 | 9508 | 8903 | 3339 | 12,242 | 485 | 4749 | 5234 |

65–69 | 60 | 27 | 87 | 28,752 | 15,130 | 43,882 | 499 | 6709 | 7208 |

70–74 | 3 | 14 | 17 | 24,796 | 12,827 | 37,623 | 561 | 9167 | 9728 |

75–79 | 8 | 87 | 95 | 24,250 | 12,438 | 36,688 | 802 | 13,786 | 14,588 |

80–84 | 15 | 286 | 301 | 16,856 | 10,160 | 27,016 | 869 | 14,723 | 15,592 |

85 and over | 16 | 390 | 406 | 11,687 | 10,624 | 22,311 | 1172 | 17,388 | 18,560 |

Total | 21,165 | 11,222 | 32,387 | 115,669 | 64,530 | 180,199 | 5499 | 74,195 | 79,694 |

Age cohorts | Orphan’s | Family responsibilities | Total | ||||||
---|---|---|---|---|---|---|---|---|---|

Male | Female | Total | Male | Female | Total | Male | Female | Total | |

0–4 | 71 | 70 | 141 | – | – | – | 71 | 70 | 141 |

5–9 | 274 | 257 | 531 | – | – | – | 274 | 257 | 531 |

10–14 | 557 | 525 | 1082 | – | – | – | 557 | 525 | 1082 |

15–19 | 1043 | 1009 | 2052 | 4 | 5 | 9 | 1048 | 1014 | 2062 |

20–24 | 599 | 620 | 1219 | 11 | 12 | 23 | 665 | 647 | 1312 |

25–29 | 63 | 44 | 107 | 8 | 10 | 18 | 298 | 155 | 453 |

30–34 | 126 | 86 | 212 | 4 | 5 | 9 | 712 | 433 | 1145 |

35–39 | 217 | 144 | 361 | 3 | 5 | 8 | 1331 | 949 | 2280 |

40–44 | 316 | 218 | 534 | 6 | 7 | 13 | 2178 | 1758 | 3936 |

45–49 | 392 | 273 | 665 | 13 | 23 | 36 | 3185 | 2880 | 6065 |

50–54 | 368 | 281 | 649 | 33 | 68 | 101 | 4328 | 4434 | 8762 |

55–59 | 299 | 251 | 550 | 46 | 107 | 153 | 5962 | 5936 | 11,898 |

60–64 | 220 | 228 | 448 | 44 | 131 | 175 | 16,095 | 11,512 | 27,607 |

65–69 | 145 | 177 | 322 | 26 | 113 | 139 | 29,482 | 22,156 | 51,638 |

70–74 | 82 | 121 | 203 | 14 | 96 | 110 | 25,456 | 22,225 | 47,681 |

75–79 | 53 | 104 | 157 | 16 | 128 | 144 | 25,129 | 26,543 | 51,672 |

80–84 | 22 | 56 | 78 | 15 | 138 | 153 | 17,777 | 25,363 | 43,140 |

85 and over | 6 | 32 | 38 | 19 | 168 | 187 | 12,900 | 28,602 | 41,502 |

Total | 4853 | 4496 | 9349 | 262 | 1016 | 1278 | 147,448 | 155,459 | 302,907 |

Table 8 shows the results of applying the above procedure to the CSWL. A large subsample design is generated, improving the fit to the distribution of the population of pensioners, with high *p* values of the test. There are feasible solutions with sizes ranging from 93.074% of the CSWL, associated with a minimum *p* value of 0.8, to 70.87% of the CSWL with a *p* value of 0.9999.

It has been proved that the subsample obtained is included in the population and in the CSWL. Therefore, given the values of the goodness of fit test and the fact that the subsample is well over 1% larger than the subsample obtained using stratified sampling, it is sure to meet the objective of finding bigger subsamples that are more representative than the CSWL.

As an example, Table 9 shows the theoretical distribution of the subsample that provides a goodness of fit test with a *p* value of \(\ge 0.999\).

*p*value greater or equal to 0.999 for each type of pension, gender, and age cohort. A major reduction in errors can be seen in the estimation of pension expenditure with a very small reduction in the size of the original CSWL.

Comparison of results of the goodness of fit test: *p* value.

*Source* Authors’ own calculations

Type of pension | CSWL | SS | SR subsample | SR sample |
---|---|---|---|---|

Permanent disability male | 0 | 1 | 1 | 1 |

Permanent disability female | 0 | 1 | 1 | 1 |

Retirement male | 0 | 0.9999191 | 1 | 1 |

Retirement female | 0 | 0.9995441 | 1 | 1 |

Widower’s | 0 | 1 | 1 | 1 |

Widow’s | 0 | 1 | 1 | 1 |

Orphan’s male | 0.4624223 | 1 | 1 | 1 |

Orphan’s female | 0.5618020 | 1 | 1 | 1 |

Family responsibilities male | 0.4084868 | 0.9990003 | 0.9999445 | 1 |

Family responsibilities female | 0.8416536 | 0.9999963 | 1 | 1 |

Total pensions | 0 | 1 | 1 | 1 |

Improvement indicators in the estimate of total pension expenditure in % with respect to the CSWL: subsample (SS) with *p* value \(\ge \) 0.999, SR subsample (SRSS) and SR sample (SRS).

*Source* Authors’ own calculations

Indicators | Case | Permanent disability | Retirement | Widow(er)’s | Orphan’s | Family responsibilities | |||||
---|---|---|---|---|---|---|---|---|---|---|---|

M | F | M | F | M | F | M | F | M | F | ||

R_SRMQE | SS | 99.82 | 99.69 | 98.17 | 99.07 | 98.54 | 96.14 | 87.27 | 88.20 | 58.34 | 90.89 |

SRSS | 99.78 | 99.52 | 99.59 | 99.57 | 97.85 | 97.77 | 80.75 | 76.98 | 41.70 | 88.12 | |

SRS | 99.95 | 99.84 | 99.92 | 99.83 | 99.41 | 99.53 | 96.11 | 93.75 | 85.58 | 97.44 | |

R_SDAV | SS | 99.50 | 99.24 | 97.41 | 98.46 | 98.54 | 96.79 | 88.98 | 89.58 | 53.93 | 86.77 |

SRSS | 99.36 | 98.90 | 99.36 | 99.43 | 97.80 | 97.83 | 77.59 | 79.28 | 36.37 | 83.78 | |

SRS | 99.87 | 99.57 | 99.88 | 99.77 | 99.35 | 99.53 | 95.62 | 94.31 | 82.80 | 96.28 | |

MDC | SS | 0.01 | 0.01 | 0.01 | 0.01 | 0.02 | 0.01 | 0.06 | 0.03 | 0.63 | 0.10 |

SRSS | 0.01 | 0.01 | 0.00 | 0.00 | 0.03 | 0.00 | 0.05 | 0.05 | 0.65 | 0.13 | |

SRS | 0.00 | 0.00 | 0.00 | 0.00 | 0.01 | 0.00 | 0.01 | 0.01 | 0.15 | 0.03 | |

CE \(\le \) 0.01% | SS | 100.0 | 100.0 | 88.89 | 100.0 | 57.14 | 100.0 | 61.11 | 55.56 | 5.56 | 5.88 |

SRSS | 100.0 | 93.33 | 100.0 | 100.0 | 64.29 | 100.0 | 44.44 | 44.44 | 44.44 | 52.94 | |

SRS | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 66.67 | 82.35 |

Table 10 shows the *p* values for the 10 cases considered (5 types of pension by 2 genders) for the goodness of fit test for pensions in the subsample obtained using this procedure (SS) compared to the pensions in the CSWL, along with the subsample design selected from the CSWL using stratification (SR subsample, Table 6) and the hypothetical sample extracted from the population using stratified random sampling (SR sample, Table 3). Obviously the values in the subsample obtained by the procedure designed are lower than those obtained with the SR sample and subsample, but the differences with the latter are almost non-existent. It is therefore possible to find subsamples contained in the CSWL and in the population that have a better fit and many more observations than would be provided by a stratified random sample taken from the CSWL. Overall, the distribution of total pensions is adjusted to the population using Pearson’s goodness of fit test using a *p* value of 1, as in the SR sample and subsample. More important in the improvement in forecasts for pension expenditure are the differences found by type of pension and gender.

Table 11 shows the improvement indicators in the estimate of total pension expenditure using the subsample obtained with a *p* value greater than 0.999 as well as those that use the SR hypothetical sample and subsample with respect to the CSWL.

It can be seen that the subsample (SS) obtained by the procedure greatly reduces the errors in the estimation of total expenditure in each cohort of the CSWL considered and that in many cases it improves on the reductions given by the SR subsample (SRSS) contained in the CSWL, which was 1% of the population. Obviously it does not reach the results obtained with the best possible sample of similar size to the CSWL, which would be obtained using an SR sampling from the population.

The estimation errors in pension expenditure by contingency, gender and age cohort of the SR subsample with a size of 87,472 benefits, and those of the subsample obtained with a *p* value greater than 0.999, which numbers 302,907 benefits, are much smaller than those obtained with the original CSWL, but somewhat lower than those from the sample obtained by the procedure proposed. However, this last sample is 3.462 times bigger, which is a great advantage for forecasting given the diversity in working lives for any subsequent analysis of the sustainability of public pension systems.

## 5 Summary, conclusions and future research

The CSWL is a set of anonymized microdata taken from Spanish Social Security records. The availability of this data has marked a turning point for studies on the Spanish public pension system since it has given researchers access to very valuable information about individuals that enables them to examine in depth numerous aspects of the pension system that had previously been ignored. However, the CSWL is obtained using a simple random sample (RS), so its fit to the population in terms of age, gender and type of pension is worse than would be obtained using stratified random sampling (SR) with proportional allocation. In this paper we examine how representative the CSWL data is of pension benefits. The results show that it does not fit the distribution of the population of pensioners by age and gender in two types of benefit: permanent disability and widower’s benefits for the case of men; there is also a mismatch to a lesser extent regarding retirement pensions. These mismatches are bigger for 2005, 2006, 2008, 2009, 2010 and 2011, and smaller in 2007, 2012 and 2013 due to the correction of a codifing error in permanent disability benefits for persons over 65. It is hard to understand why the error reappears after the correction in 2007.

We check the effects of this poor fit of the CSWL to the pensioner population in terms of pension benefits by estimating the annual total pension expenditure by cohorts obtained from the CSWL and the figure that would have resulted from a hypothetical SR. Given that researchers cannot access full data on the population, they are unable to obtain an SR sample and must resort to an SR subsample smaller than the CSWL and contained within it. The problem is that the reduction in size is considerable, so richness in pension types is lost.

For the reasons indicated, it can be said that it is advisable to use other procedures to select large size subsamples, so the representativeness of the original CSWL is improved with less reduction in the number of pensions. With the methodology developed, large subsamples can be obtained that pass the \({\chi }^{2}\) test giving *p* values near 1, so it can be concluded not only that it is feasible to find a large dataset selected from the CSWL that better represents the population of pensioners but also that it is possible to draw up a procedure for extracting subsamples from the CSWL with a good fit to the population of interest by type of pension, gender and age.

Apart from the problems described, the CSWL is a powerful dataset whose size enables representative large subsamples to be extracted which emulate the whole population of pensioners. This is taken into account in formulating and solving optimization problems to obtain feasible solutions (contained in the population and in the CSWL) that more or less meet the goodness of fit requirements in terms of a *p* value chosen by the researcher, with sizes ranging from 70 to 93% of the original.

According to our findings, it can be said that studies conducted using subsamples of data on pension benefits from the CSWL obtained using SR sampling with proportional allocation taking into account the known distribution of the population should not give rise to doubt as to how well the CSWL represents the population. However, those which use data on pension benefits selected from the CSWL without checking that they are representative may produce conclusions based on age cohorts which are overrepresented or underrepresented in the subsamples with respect to their real weight in the population.

Last but not least, we would like to emphasize that this research meets the conditions for reproducibility. Boyland (2016) establishes the guidelines for a paper to be recognized as “reproducible”: the most important are that the data used in the analysis should be accessible to other researchers and that the algorithms or methods of analysis should be specified in the manuscript in sufficient detail to allow the results to be reproduced. This is our case: any research team can replicate the procedure for selecting more representative large subsamples from the CSWL.

A further line of research directly related to the results of this paper would be to consider another variable in the design and selection procedure for large size subsamples plus the number of pensions, such as the amount of the pensions selected. The objective would be to find a dataset that fits the population as well as possible, taking into account not only the number of pensions but also the average pension amount per cohort of the population, the data for which is also known.

## Footnotes

- 1.
- 2.
- 3.
Researchers can request versions of the CSWL by post from the Dirección General de Ordenación de la Seguridad Social at the Spanish Ministry of Employment and Social Security (Ministerio de Empleo y Seguridad Social). A separate request must be made for each version. Requests consist of a user profile describing the project being carried out and a document accepting the CSWL’s conditions of use. These are available from MESS (2016) at the following address: http://www.seg-social.es/Internet_1/Estadistica/Est/Muestra_Continua_de_Vidas_Laborales/SolicitarM/index.htm.

- 4.
The definition of the population from which the CSWL was taken can be checked in

*Ministerio de Trabajo y Asuntos Sociales*(MTAS), (2006), pp. 25–29. - 5.
Argimón and González (2006), Durán and Sevilla (2006), Toharia Cortés et al. (2007), Durán (2007), García Segovia and Durán (2008), Patxot et al. (2009), Izquierdo et al. (2009), Lapuerta (2010), Arranz and García-Serrano (2011), López Roldán (2011), Alonso Domínguez (2012), Muñoz de Bustillo et al. (2011), Arranz et al. (2013), and Pérez-Salamero González et al. (2016).

- 6.
Stratification is by gender, nationality, insurance branch of the current account holder and age cohort. http://www.jpi-dataproject.eu/Home/Database/153?topicId=2.

- 7.
- 8.
We chose 2010 because it was the most up-to-date sample when we began our work in early 2012. We had a long learning curve. It took us nearly two years to program the necessary tests to make an assessment of its representativeness. Then we started to work with the other samples (Sect. 2) and also found a problem of representativeness. Moreover, after applying the procedure to 2010 we did the same with other years and obtained results that are not very different from those for 2010.

- 9.
We do not use the first version from 2004 because it does not give the month of birth, so it is not possible to determine actuarial age as in the subsequent versions.

- 10.
This refers to a special type of survivors’ benefits for family members. This benefit is not compatible with the beneficiary receiving other public pensions.

## Supplementary material

## References

- Agliari E, Barra A, Contucci P, Sandell R, Vernia C (2014) A stochastic approach for quantifying immigrant integration: the Spanish test case. New J Phys 16:103034. http://stacks.iop.org/1367-2630/16/i=10/a=103034
- Antón Pérez J, Braña Pino J, Muñoz de Bustillo Llorente R (2007) Edad efectiva de jubilación en España: un análisis a partir de la explotación de la Muestra Continua de Vidas Laborales de la Seguridad Social. Jornadas de Usuarios de la Muestra Continua de Vidas Laborales. Madrid, 4 & 5 October 2007. Ministerio de Trabajo y Asuntos Sociales and FEDEAGoogle Scholar
- Argimón I, González CI (2006) La Muestra Continua de Vidas Laborales de la Seguridad Social. Bol Econ Banco de Esp (May) 40–53. http://www.bde.es/f/webbde/SES/Secciones/Publicaciones/InformesBoletinesRevistas/BoletinEconomico/06/May/Fich/art3.pdf
- Argimón I, González C, Vegas R (2007) Jubilación entre los 60 y los 65 años. Algunas características. Presup Gasto Público 47:161–184Google Scholar
- Arranz JM, García-Serrano C, Hernanz V (2013) How do we pursue ”labormetrics”? An application using the CSWL. Estad Esp, 181:231–254. http://www.ine.es/ss/Satellite?L=0&c=INERevEstad_C&p=1254735226759&pagename=ProductosYServicios%2FPYSLayout&_charset_=UTF-8&cid=1259943175448&submit=Ir
- Arranz JM, García-Serrano C (2011) Are the CSWL tax data useful? Ideas for mining. Hacienda Pública Esp/Rev Econ Política 199(4):151–186Google Scholar
- Arranz JM, García-Serrano C (2014) The interplay of the unemployment compensation system, fixed-term contracts and rehirings: the case of Spain. Int J Manpow 35(8):1236–1259CrossRefGoogle Scholar
- Barra A, Contucci P, Sandell R, Vernia C (2014) An analysis of a large dataset on immigrant integration in Spain. The statistical mechanics perspective on social action. Sci Rep, 4:4174. www.nature.com/scientificreports
- Bazaraa MS, Sherali HD, Shetty CM (2006) Nonlinear programming: theory and applications, 3rd edn. Wiley-Interscience, HobokenCrossRefGoogle Scholar
- Benavides F, Durán X, Martínez J, Jódar P, Boix P, Amable M (2010) Incidencia de incapacidad permanente en una cohorte de trabajadores afiliados a la seguridad social, 2004–2007. Gac Sanit 24(5):385–390CrossRefGoogle Scholar
- Berkson J (1938) Some difficulties of interpretation encountered in the application of the chi-square test. J Am Stat Assoc 33(203):526–536CrossRefGoogle Scholar
- Bertsekas DP (1999) Nonlinear programming, 2nd edn. Athena Scientific, BostonGoogle Scholar
- Boado-Penas MC, Valdés-Prieto S, Vidal-Meliá C (2008) The actuarial balance sheet for pay-as-you-go finance: solvency indicators for Spain and Sweden. Fisc Stud 29:89–134CrossRefGoogle Scholar
- Bonhomme S, Hospido L (2013) Earnings inequality in Spain: new evidence using tax data. Appl Econ 45(30):4212–4225CrossRefGoogle Scholar
- Bonhomme S, Hospido L (2016) The cycle of earnings inequality: evidence from Spanish Social Security Data. Econ J. doi: 10.1111/ecoj.12368 Google Scholar
- Bowley AL (1926) Measurement of precision attained in sampling. Bull Int Stat Inst 22(1):6–62Google Scholar
- Boyland JE (2016) Reproducibility. IMA J Manag Math 27(2):107–108. doi: 10.1093/imaman/dpw003 CrossRefGoogle Scholar
- Cairó Blanco I (2010) An empirical analysis of retirement behaviour in Spain: partial versus full retirement. SERIEs—J Span Econ Assoc 1(3):325–356Google Scholar
- Cohen J (1988) Statistical power analysis for the behavioral sciences, 2nd edn. Erlbaum, HillsdaleGoogle Scholar
- Conde Ruiz JI, González CI (2013) Reforma de pensiones 2011 en España. Hacienda Pública/ Rev Public Econ 204(1):9–44Google Scholar
- Devesa JE, Devesa M, Domínguez I, Encinas B, Meneu R, Nagore A (2012) Equidad y sostenibilidad como objetivos ante la reforma del sistema contributivo de pensiones de jubilación. Hacienda Pública Esp/Rev Econ Pública 201:9–38Google Scholar
- Domínguez ÁA (2012) Labor transitions of Spanish workers: a flexicurity approach. Rev Int Organ/Int J Organ 9:121–143. http://www.raco.cat/index.php/RIO/article/view/281050/368714
- Domínguez Fabián I, Devesa Carpio M, Rosado Cebrián B, (2012) La Muestra Continua de Vidas Laborales y su potencial para analizar la solvencia del sistema de pensiones desde la perspectiva del empleo. Madrid: Ministerio de Empleo y Seguridad Social. FIPROS. [Consulted 8-3-2013]. http://www.seg-social.es/prdi00/groups/public/documents/binario/174191.pdf
- Durán A (2007) La muestra continua de vidas laborales de la seguridad social. Rev Minist Trab Asun Soc 1:231–240Google Scholar
- Durán A, Sevilla M (2006) Una muestra continua de vidas laborales. In: El papel de los registros administrativos en el análisis social y económico y el desarrollo del sistema estadístico. Carmen Marcos García (directora) Colección: Estudios de Hacienda Pública. Instituto de Estudios Fiscales, Madrid. pp. 241–252Google Scholar
- García Segovia F, Durán A (2008) Nuevos avances en la información laboral: la Muestra Continua de Vidas Laborales. Economistas 116:228–231Google Scholar
- Grafström A, Schelin L (2014) How to select representative samples. Scand J Stat 41:277–290CrossRefGoogle Scholar
- Griva I, Nash SG, Sofer A (2009) Linear and nonlinear optimization, 2nd edn. Society for Industrial and Applied Mathematics, PhiladelphiaCrossRefGoogle Scholar
- Hansen MH, Hurwitz WN, Madow W (1953) Sample surveys: methods and theory. Wiley, New YorkGoogle Scholar
- Himmelreicher RK, Stegmann M (2008) New Possibilities for socio-economic research through longitudinal data from the research data centre of the German Federal Pension Insurance (FDZ-RV). Schmollers Jahrb 128(4):647–660CrossRefGoogle Scholar
- Instituto Nacional de la Seguridad Social (INSS) (2006-2007) Informes Estadísticos 2005–2006. Madrid: INSS. Secretaría de Estado de la Seguridad Social. Ministerio de Trabajo y Asuntos SocialesGoogle Scholar
- INSS (2008–11) Informes Estadísticos 2007-2010. Madrid: INSS. Secretaría de Estado de la Seguridad Social. Ministerio de Trabajo e InmigraciónGoogle Scholar
- INSS (2012–2014) Informes Estadístico 2011-2013. Madrid: INSS. Secretaría de Estado de la Seguridad Social. Ministerio de Empleo y Seguridad SocialGoogle Scholar
- Izquierdo M, Lacuesta A, Vegas R (2009) Assimilation of immigrants in Spain: a longitudinal analysis. Labour Econ 16(6):669–678CrossRefGoogle Scholar
- Kruskall W, Mosteller F (1979a) Representative Sampling I. Int Stat Rev 47(1):13–24CrossRefGoogle Scholar
- Kruskall W, Mosteller F (1979b) Representative sampling, II: scientific literature, excluding statistics. Int Stat Rev 47(2):111–127CrossRefGoogle Scholar
- Kruskall W, Mosteller F (1979c) Representative sampling, III: the current statistical literature. Int Stat Rev 47(3):245–265CrossRefGoogle Scholar
- Kruskall W, Mosteller F (1980) Representative sampling, IV: the history of the concept in statistics, 1895–1939. Int Stat Rev 48(2):169–195CrossRefGoogle Scholar
- Lapuerta I (2010) Claves para el trabajo con la Muestra Continua de Vidas Laborales. DemoSoc working paper (2010-37)Google Scholar
- Lin M, Lucas HC, Shmieli G (2013) Research commentary: too big to fail: large samples and the p-value problem. Inf Syst Res 24(4):906–917CrossRefGoogle Scholar
- López Roldán P (2011) La Muestra Continua de Vidas Laborales: posibilidades y limitaciones. Aplicación al estudio de la ocupación de la población inmigrante. Metodol Encuestas 13:7–32Google Scholar
- Luenberger DG (2003) Linear and nonlinear programming, 2nd edn. Kluwer Academic Publishers, BostonGoogle Scholar
- Meneu Gaya R, Encinas Goenechea B (2012) Valoración de la reforma del sistema de pensiones español de 2011 desde la óptica de la viabilidad financiero-actuarial. Un análisis a través de la CSWL. Madrid: Ministerio de Empleo y Seguridad Social. FIPROS. [Consulted 14-5-2014]. http://www.seg-social.es/prdi00/groups/public/documents/binario/174193.pdf
- Meneu Gaya R, Pérez-Salamero González JM, Ventura Marco M (1998) Fundamentos de optimización matemática en Economía. Repro-Exprés, S. L. http://roderic.uv.es/handle/10550/25951
- MESS (2016) La Muestra Continua de Vidas Laborales. Guía del contenido. Estadísticas, Presupuestos y Estudios. Estadísticas. http://www.seg-social.es/prdi00/groups/public/documents/binario/190489.pdf. Accessed 22-May-2015
- Ministerio de Trabajo y Asuntos Sociales (MTAS) (2006) La Muestra Continua de Vidas Laborales. Colección Informes y Estudios. Serie Seguridad Social, vol 26. Ministerio de Trabajo y Asuntos Sociales, MadridGoogle Scholar
- Moral Arce I, Patxot C, Souto G (2008) La sostenibilidad del sistema de pensiones. Una aproximación a partir de la CSWL. Rev Econ Apl XVI(E–1):29–66Google Scholar
- Muñoz de Bustillo R, De Pedraza P, Antón JI, Rivas LA (2011) Working life and retirement pensions in Spain: the simulated impact of a parametric reform. Int Soc Secur Rev 64(1):73–93CrossRefGoogle Scholar
- Nagore García A, van Soest A (2016) New job matches and their stability before and during the crisis. CentER discussion paper; vol. 2016-033. Tilburg University: EconometricsGoogle Scholar
- Olsen A, Hudson R (2009) Social security administration’s master earnings file: background information. Soc Secur Bull 69(3):29–45Google Scholar
- Omair A (2014) Sample size estimation and sampling techniques for selecting a representative sample. J Health Spec 2(4):142–147CrossRefGoogle Scholar
- Patxot C, Souto G, Villanueva J (2009) Fostering the contributory nature of the Spanish retirement pension system: an arithmetic micro-simulation exercise using the CSWL. Presup Gasto Público 57:7–32Google Scholar
- Peinado Martínez P (2011) Pension system’s reform in Spain: a dynamic analysis of the effects on welfare. Serrano Pérez, F. (director). Doctoral thesis. Universidad del País Vasco. [Consulted 4-3-2014]. https://addi.ehu.es/bitstream/10810/8113/8/peinado.pdf
- Pérez-Salamero González JM, Régulez-Castillo M, Vidal-Meliá C (2016) Análisis de la representatividad de la MCVL: el caso de las prestaciones del sistema público de pensiones. Hacienda Pública Esp/Rev Public Econ 217(2/2016): 67–130Google Scholar
- Ramsey CA, Hewitt AD (2005) A methodology for assessing sample representativeness. Environ Forensics 6:71–75CrossRefGoogle Scholar
- Ruszczynski AP (2006) Nonlinear optimization. Princeton University Press, PrincetonGoogle Scholar
- Singh D, Chaudhary FS (1986) Theory and analysis of sample survey designs. Wiley Eastern Limited, HobokenGoogle Scholar
- Smith CM (1989) The social security administration’s continuous work history sample. Soc Secur Bulletin 52:10Google Scholar
- Solé M, Diaz Serrano L, Rodríguez M (2013) Disparities in work, risk and health between immigrants and native-born Spaniards. Soc Sci Med 76:179–187CrossRefGoogle Scholar
- Toharia Cortés L, Moreno G, Muñoz C (2007) La mejora del sistema de información estadística procedente de los registros de la seguridad social. Ministerio de Trabajo y Asuntos Sociales, MadridGoogle Scholar
- Treviño R, Vidal E, Devolder D (2008) Factores e indicadores de vulnerabilidad en la conciliación del empleo y la familia. Ministerio de Trabajo e Inmigración, MadridGoogle Scholar
- Tryfos P (1996) Sampling methods for applied research: text and cases. Wiley, HobokenGoogle Scholar
- Vall Castello J (2012) Promoting employment of disabled women in Spain: evaluating a policy. Labour Econ 19:82–91CrossRefGoogle Scholar
- Vegas Sánchez R, Argimón I, Botella M, González C (2013) Old age pensions and retirement in Spain. SERIEs J Span Econ Assoc 2013(4):273–307Google Scholar
- Vicente Merino A, Calderón Milán M, Martínez Aguado T (2012) Muchos pierden y pocos ganan: efectos de la reforma legislativa sobre el poder adquisitivo del trabajador tras la jubilación. An Inst Actuar Esp 18:77–110Google Scholar
- Vidal Meliá C, Boado Penas MC, Settergren O (2009) Automatic Balance Mechanisms in pay-as-you-go pension systems. The Geneva Pap Risk and Insur Issues Pract 34:287–317. doi: 10.1057/gpp.2009.2 CrossRefGoogle Scholar
- Wang C (1993) Sense and nonsense of statistical inference: controversy, misuse, and subtlety. Marcel Dekker, New YorkGoogle Scholar
- Wilkinson L, APA Task Force on Statistical Inference (1999) Statistical methods in psychology journals: guidelines and explanations. Am Psychol 54:594–604Google Scholar

## Copyright information

**Open Access**This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.