Index of active participation in society

The concept of Active citizenship is gaining importance. Ignoring the issues of selection of dimensions and items, the paper addresses methodological issues and aggregation of indicators of multidimensional active participation index (PI) by two methods. Policy makers and researchers can take advantages of the proposed methods of arithmetic aggregation (Method-1) with normally distributed transformed scores or multiplicative aggregation (Method-2) without scaling and choosing weights. Thus, the Method-2 avoids the problems of rank robustness specific to selection of weights. Multiplicative aggregation proposed in Method-2 can be converted to additive model by taking logarithms. Normal distributions of two or more groups by Method-1 are likely to give rise to lower value of Ginis coefficient indicating equality. Avoiding major limitations of ordinal scores, both methods satisfy desired properties, analysis under parametric set up for meaningful comparisons including testing of statistical hypothesis, identification of critical dimensions, ranking of the dimensions by elasticity, assessment of progress/decline of PI, etc. Method-2 offers more generalized approach satisfying time reversal test and formation of chain indices. However, test of normality is required for this method unlike the Method-1 which ensures normally distributed scores.


Introduction
The concept of Active citizenship (AC) is related to civic engagement and building social capital and is defined as participation in civil society, community and political life, characterized by mutual respect and non-violence and in accordance with human rights and democracy [1].
It deals with issues relating to rights and responsibilities.Clearly, form and content of AC varies with social values, cultural and political to environmental activities at local, regional, national and even international levels.Thus, it excludes participation in extremist groups promoting intolerance and violence.
In a democratic system, quality of citizenship depends on behavior of community members in terms of active participations in politics, society and community along with values.Improving economy, social, and political effectiveness of a country depends heavily on active participation of its citizens.In fact, the concept of good government differs in different countries.Attempts have been made to assess quality of citizenship and degree of active participation [2] with different number of indicators and associated dimensions.Factors of AC at individual level and also at national level using a multilevel model were identified [3] and found higher AC in countries with high GDP, equal distribution of income and heterogeneous religious climate. 1 3 Measurement of Participation Index (PI) means choosing a real valued function f from n-dimensional space corresponding to n-number of indicators.Quality of PI depends on properties of such function and measurement procedures adopted for the indicators.Ignoring the issues of selection of dimensions and items, the paper gives two methods of aggregation of discrete indicator scores to continuous scores following normal distribution for better and meaningful uses of PI and satisfying desired properties including computation of PI for a group of countries.Problems of construction of PI and remedial actions are addressed with the work of [4] as an example.

Framework on the concept of citizen participation
There is no agreement in operational definition of multidimensional participation index (PI).Active citizenship composite index (ACCI) considered participation in four dimensions viz.Political Life, Civil Society, Community Life and the Values and different number of sub-dimensions and indicators under each dimension [5].Direct impact of participation in the plan-making process was found [6] in inclusion outcomes to reduce cultural barriers; mitigating poverty; increase economic opportunities and promote good governance, where degree of relationship varied across different indicators of social inclusion.In ref. [7] observed that theoretical framework of active ageing failed to identify the relevant contributing factors and barriers and need improved conceptual and analytical clarity.Moreover, empirical investigations of active ageing depend on health, education, good finances, etc.Thus, evaluation of active ageing for individuals may be an elusive goal for a large segment of older adults.
The European Charter on the Participation of Young People in Local and Regional Life (1992 and revised in 2003) is an international policy document approved by the Congress of Local and Regional Authorities of the Council of Europe (www.salto-youth.net/ downl oads/4-17-1510/ Revis ed% 20Eur opean% 20Cha rter% 20on% 20the% 20Par ticip ation% 20of% 20YP.pdf ).The charter outlines the following 14 areas for involvement of young people: Similarly, the National Report on Youth Policy in Norway [8] summarizes a number of critical changes which impact on the lives of young people and has significant overlaps with the Council of Europe.
Socio-economic inequalities may restrict participation in the society and even undermines responsiveness and representativeness of people with disadvantaged backgrounds [9,10].Other factors influencing participation in positive fashions are increase in age, family income, education, etc. [11].However, methodological issues like scaling, selection of weights and aggregation of indicators suffer from limitations for meaningful inter-country/inter-regional/inter-sample comparisons, classification, assessment of progress, etc. Statistical testing of equality of mean of participation across time and space can be done if assumptions of the techniques like normal distribution of the variables are satisfied.Effect of active participation to wellbeing and quality of life has been addressed by [12][13][14].

Data
To operationalize the concept, 63 basic indicators distributed over the following four dimensions and a number of sub-dimensions were considered.Details are given in Table 1.
It may be observed that indicators like money donated are related with per capita GDP, which is an indicator of Human Development Index (HDI).Similarly, Women Participation in national parliament overlaps with Global Gender Gap Index (GGGI); Democracy may coincide with trust, perception of inequality, which are components of Social Cohesion Index (SCI).Thus, PI is likely to have positive correlations with HDI, GGGI, and SCI.

Natures of data on selected indicators
-Membership to Political parties, Human rights organisations, Trade union organisations, and other organisations are in ratio scale usually given in percentages.However, addition of X% = × 100 is not meaningful when D X ≠ KD Y .For example, 8 24 = 33.33%and 5 10 = 50% .Combined percentage is 13 34 = 38.23%and not average of 50% and 33.33% i.e. 41.67% -Participation could be at different levels like very active, active, dormant, etc. Information about participation obtained by survey using 3-point or 5-point items are ordinal and needs appropriate method of aggregation since latent distance between "very active" and "active" is not same between "active" and "dormant".Thus, ordinal scores emerging from 3-point or 5-point items are not equidistant and hence, addition is not meaningful.X̅ > Y̅ or X̅ < Y̅ is meaningless for ordinal scales [15].-Data on money donated (in ratio scale) may not be reliable.
-Voting Turnout in Parliament, Women Participation in national parliament are secondary data in percentages.
-Working in an organisation or association is a binary indicator of Yes-No type.Signing a petition, taking part in lawful demonstrations, boycotting products, contacted a politician, etc. say in last 12 months are in numbers.However, concepts of lawful demonstrations, ethical consumption, etc. vary and need to be defined properly without ambiguity.-Non-organised help in the community may convey differently to different subjects.For better understanding of community organizing, [16] found it is useful to discuss what organizing is not.Functions of community organizers are different from community developers, social service professionals, lawyer, or other fields of community engagement.-Indicators of dimension of Values need to be defined since they are subjective and sensitive too.Questioner for such indicators needs to be developed carefully avoiding leading items/questions.-Correlation between a pair of indicators may vary.Very high correlation between two indicators gives rise to multicolinearity and needs to be avoided.

Observations
The selected indicators may not be exhaustive since active citizenship is an evolving concept.IT-related interactions and participations need to be included.Increasing use of digital technology like internet, civic/political websites, etc. offering diverse forms of interactive participations, information sharing, etc. may reflect the relationship between citizen and state, which could also influence the social networks on citizens' willingness to participate in social governance.In ref.
[17] used online and offline survey covering 15 provinces of China using questionnaires consisting of 1037 items including 773 valid items from online survey, and 250 items from offline survey.Here, the dependent variable "the citizens' willingness to participate in social governance was assessed by 4-points items like "not willing at all, " "willing to participate in all the available activities, " "willing to participate in the activities of easy access, " and "willing to participate in the activities of easy access, with self-interests involved".Independent variables were social capital having three parts viz.social networks, social trust, and social norms and each was assessed by 4-point items like "how many neighbors do you greet frequently in your community" and "how many neighbors are considered to be your friends in your community" To asses "social trust, " and "social norms, " Likert scales were used generating Ordinal scores.Scale to assess civic competence is multidimensional due to presence of different dimensions.Principal Component Analysis (PCA) or Factor Analysis (FA) of indicators or dimensions may result in several independent factors.Thus, finding total scores of a multidimensional scale may not be appropriate.For example, the manual of the 36-Item Short Form Health Survey questionnaire (SF-36) provides no support to calculate total/overall score of SF-36 for an individual or a country due to several independent factors being measured by the scale (http:// www.webci tation.org/ 6cfee fPkf )

Arithmetic aggregation
Addition of variables in ratio scale and variables in ordinal scale are problematic and difficult to interpret.Even if each of X and Y is in ratio scale, X + Y = Z is meaningful if X and Y follow similar distribution (may be with different parameters) and probability density function (pdf ) of Z can be derived as convolution of distributions of X and Y.
Summative scores of a scale as sum of item scores assign equal importance to the items and dimensions despite different item-total or dimension-total correlations, factor loadings for dimensions and items as observed from PCA. Summative scores also suffers from substitution effect (poor score in one dimension can be compensated by higher scores in other dimensions) and may mislead results [18,19].
Better could be to transfer data on K-point items (K = 2, 3, 4, 5, …….) to equidistant scores which can be further transformed to follow normal distribution with same range of item scores [20].Here, sum of normally distributed item scores will also follow normal, facilitating estimation of the parameters from the data.
In addition, summative scores may result in a number of tied scores and reduce discriminating power of the scale.Quality of a scale in terms of psychometric properties like reliability, validity, discriminating value need to be reported.

Scaling
Raw scores of the indicators in different units and in different score ranges are usually scaled before aggregation.Min-Max function was used [4] where i-th indicator ( X i ) was rescaled to where 0 ≤ Y i ≤ 1 Such scaling indicates relative performance and not absolute performance of a country [21].It depends significantly on X Max and X Min which may be unreliable outliers.If X is in percentage,[Max X i − Min(X i ) ] may not be meaningful.Human Poverty Index (HPI) considers 3rd root and 4th root of average of figures in percentage for HPI-1 and HPI-2 respectively [22].Min-Max transformation tends to overestimate the impact of indicators having small score ranges and changes distribution of the transformed scores and may have impact on the PI.Ranks of two countries may be influenced by performance of a third country [23].Decrease in performances of the worst performing country may increase value of Y i , even if X i for the i-th country remains unchanged.If X Min is changed, ranking and relative valuations may be changed due to change in marginal rates of substitution [24].X−Y curve for a country is not linear since increase in Y per unit increase in X is different for different values of X.For example, consider an indicator X taking values: 58, 70, 80, 85, 90, 96.Clearly, X Max = 96, X Min = 58 .Gain in Y due to increase of X from 80 to 90 is 0.2632.Similar gain from increasing X from 85 to 90 is 0.1032 which is less than 0.2632.
Better scaling ensuring unit less values is standardization by Z = X−Mean(X) SD(X) ∼ N (0, 1) where −∞ < Z i < ∞ .Negative scores can be avoided by further transforming Z-scores to have a desired score range say 1 to 100.In PCA, original input variables are standardized to Z-scores.Other illustrative transformations for scaling are: × 100 .It is less robust to the influence of outliers and is linearly related to Proportionate Normalization where × 100 where t denotes time period or ratio with X 0 i , the base period as Here, rate of increase of Income X is different for different values of X and Income X is not invariant under change of origin.Logarithmic transformation fails to satisfy desired properties like Translation Invariance and consistency in aggregation [25].The index depends on the normalization methods applied to different indicators.Moreover, non-linear logarithmic transformation may alter the structure of the data.r Lifeexpectancy,HDI>r Lifeexpectancy,GDP [26].But, the inequality got reversed on taking logarithmic transformations.
The scaling methods have advantages and disadvantages.There is no best method of scaling.Different normalization techniques resulted in different ranking lists [27].Multi-criteria decision-making (MCDM) methods like Analytic Hierarchy Process (AHP), Data envelopment analysis (DEA), Benefit of the Doubt (BoD), etc. require no normalization.

Weighted sum
Weighted sum is a popular approach for aggregation where weights ( W i �s) are chosen to the indicators ( X i ′s ) and the composite index (CI) is obtained as CI = ∑ W i X i where 0 < W i < 1 and ∑ W i = 1 to satisfy the convex property.In ref. [4] used weighted sum to get dimension scores and further weights to the dimensions to compute PI.
There are different methods of selecting weights.Approaches to find weights to the indicators could be normative (subjective weights), data-driven (determined objectively) or hybrid [28].Selected weights serve as 'trade-offs' and reflect relative importance.The amount of indicator 2 to be sacrificed to gain an extra unit of indicator 1 is reflected in the ratio . Such trade-offs may not be meaningful when the indicators relate to economic growth (say GDP) and improvement in non-monetary areas like values against discrimination.
Weights of DEA are obtained satisfying relevant constraints of the Linear programming and deriving a single aggregate measure for each decision making unit (DMU).DEA results are sensitive to the choice of inputs and outputs.However, the best specification cannot be tested.Moreover, number of efficient units tends to increase with increase in number of inputs and output variables [29].AHP finds weights of criteria based on n(n−1) 2 pair-wise comparisons for n-number of alternatives using 1 to 9 scales, which gets increased with increase in n.Inconsistency may take place since transitivity rule does not hold good for all elements of the matrix showing pair wise comparisons, ambiguity in judgement scale may not be avoided.AHP cannot be of much use to requirements priority [30].
Best-Worst Method (BWM) finds weights based on Tchebychev distance, requiring less pair-wise comparisons than AHP [31].However, major issues of BWM are lack of threshold for the consistency ratio, ordinal consistency, and complex calculation process, especially for large n.There could be situations where there is no unique best and/or worst criterion/ criteria (as required in BWM).Situations like ≥ 2 best or worst criteria cannot be solved easily by BWM.Using scale from 1 to 9 to determine the most important/preferred criterion (or the least important criterion) is subjective and is sensitive to the sample composition.BoD weights can be used for linear aggregation but not for non-linear/non-compensatory aggregation.The weight vector of AHP as factor loadings of the principal eigenvector has significant drawbacks [32].For Economic Sentiment Indicator and environmental sustainability index, PCA and FA failed [33].Weight vector W = (W 1 , W 2 , … ., W n ) T was proposed by [34] such that where Y= ∑ n i=1 W i X i and R is the correlation matrix and i ≠ j.In other words, Y is equi-correlated withZ ′ i s.However, different methods of selection of weights may affect PI differently and may not ensure rank robustness.CI as weighted sum do not discuss about variance of the weighted sum and correlation of CI with the chosen indicators.Equal weights were applied to the dimensions and indicators within each sub-dimension to compute Active Citizenship Composite Indicator [4].Equal weighting with perfect substitutability may reverse country ranks [35].No weight or equal weights are wrong and no weighting system is above criticism [36].Just like no consensus of the perfect weighting scheme, there is no ideal aggregation scheme [37].Thus, it is desirable to construct PI without considering weighted sum.

Arithmetic aggregation (Method-1)
Let X ij be the raw ordinal score of a country in the i-th item for choosing the j-th response-category where higher score ⇔ higher participation.For a 5-point item, weighted score (WS) = ∑ ∑ W ij X ij where W ij ′s are different for different levels of the item satisfying W ij > 0and ∑ 5 j=1 W ij = 1.Scores of the i-th item will be equidistant and monotonic if W i1 , 2W i2 , 3W i3 , 4W i4 and 5W i5 forms an arithmetic progression with positive common difference ( ).

Observations
i) W j(Final) are based on empirical probabilities.ii) f ij = 0 for a particular level of an item, can be taken as zero value for scoring Likert items as weighted sum.
iii) Generated scores (E) as weighted sum are continuous, equidistant and monotonic in ratio scale.iv) Method-1 is applicable for items with different number of response-categories including binary items.
Standardize E-scores of the i-th item by . Take linear transformation of Z ij to P-scores by: For the i-th item, P i ∼ N( i , 2 i ) and 1 ≤ P i ≤ 100 where estimates of i and 2 i are obtained from the data.P-score of an item as per Eq. ( 1) can be obtained irrespective of length of scale and width of items.
For the indicators in ratio scales, raw scores can be standardized and use (1) to follow normal.Domain score of a country is taken as sum of normally distributed P-score of relevant items which will follow normal with mean ∑ i i and SD = � ∑ 2 i + 2 ∑ i≠j Cov(P i , P j ) .Similarly, scale score is sum of domain scores also following normal.

Properties
1. Domain scores ( D i ) and scale scores ( S i ) of the i-th country are continuous, monotonically increasing, each following normal distribution.Normality ensures meaningful computation of arithmetic average, SD, correlation, etc. and facilitates statistical analysis under parametric set up including unbiased estimates of population mean ( ) , population variance 2 , confidence interval of , and testing of null hypothesis H 0 ∶ 1 = 2 or H 0 ∶ 2 1 = 2 2 etc. across time and space.

Var
⇒Positive correlations for most pair of dimensions.3. Progress of the i-th country in successive time-periods can be quantified by × 100 , which also quantifies respon- siveness of the scale and effectiveness of a implement policy measure.Deterioration is indicated if × 100 < 0 Similarly, progress for a group of countries is reflected if S i(t) > S i(t−1) .Normally distributed S i helps to test H 0 ∶ S t = S (t−1) andH 0 :Progress (t+1)overt = 0 .Deterioration if any may be probed to find extent of deterioration in concerned domain(s) for possible corrective actions.
4. The plot of progress/deterioration of a country at various time points can be used to compare PI from the base period. .The dimensions can be ranked in terms of ∇S ∇D i 6. Normally distributed scores satisfy the assumptions of PCA and enable to find factorial validity in terms of ratio of the first eigenvalue to the sum of all eigenvalues i.e.Factorial Validity = 1 ∑ i , where 1 is the highest eigenvalue reflecting the main factor for which the scale was developed [38] and accounts for 1 factorial validity avoids the problems of construct validity and selection of criterion scale.Tracy-Widom (TW) statistic can be used to test significance of the largest or other eigenvalues [39].7. Normality helps to estimate variance of subclass, scale and each item and estimated Cronbach alpha for a domain/ subclass at population level as Reliability of scale (r tt ) consisting of K-number of dimensions can be obtained as a function of dimension reliabilities by where r tt(i) and S xi denote respectively reliability and SD of the i-th dimension [34].
It can be proved that Thus, test reliability and Disc Test are related by a negative non-linear relationship.9. Quartile clustering helps in classification of a group of countries in four mutually-exclusive classes Q , 2 Q 3 , Q 4 .Quartile clustering of scale scores following normal distribution may be adopted because it is simple, appealing, provides well-defined cut-off scores for the four mutually exclusive classes and assigns equal probability to each quartile/class Similar approach may be used for classification to five classes (pentiles), ten classes (deciles), etc.For normally distributed dimension scores, a given score X 0 in dimension-1 will be equivalent to a score of Y 0 in dimen- sion-2 if where f (X) and g(Y) denote normal probability density function (pdf ) of transformed scores of dimension-1 and dimen- sion-2 respectively.The Eq. ( 7) can be solved using Standard Normal probability table.It helps to find all combinations of { X 0 , Y 0 } including cut-off scores of two scales.(3)

Geometric aggregation (Method-2)
Let X i0 denote value of the indicator in the base period.Unit free ratio indicates progress or decline of the country with respect to the i-th indicator at t-th time period over the base period.PI for the c-th time period (current period) is defined as the Geometric mean of n-indicators i.e.
or equivalently by avoiding the n-th root, Equation ( 9) can also be written in terms of dimensions scores as Here, > 1 indicates progress and < 1 indicates decline assuming higher score ⇔ higher participation.The indica- tors for which < 1 are critical and can be identified as the indicators for which < 1 for deciding appropriate action plan.
> 1 implies overall improvement made by the i-th country in period t over (t-1)-th period.
Note that each of ( 9) and ( 10) can be applied for all types of indicators in ratio scales or in ordinal scales and considers all chosen indicators including those in percentages and depicts overall improvement/decline in the current year over the base year by a continuous function which is symmetric over its arguments and increases monotonically.Replacing the base period vector by the vector for the previous year will give improvement of PI on year-to-year basis.

Properties
Each of ( 9) or (10) satisfy the following desired properties: 1. Independent of order of the chosen indicators or dimensions and independent of change of scale 2. Increase of say 1% in X it X i0 ⇒ 1% increase in the PI if all others are unchanged.Thus, the curve showing gain in 8. Reliability, factorial validity, discriminating value of PI or a dimension can be found taking logarithm on both sides of Eq. (10).9. Facilitates estimation of population GM, standard error of the GM and confidence interval of GM [40] . Geometric standard deviation (GSD), is given by log where S GM denotes GSD.This implies log

Discussion
Focusing on methodology of assessing PI, the paper describes two assumption free methods avoiding scaling.Method-2 also avoids selection of weights.Thus, the Method-2 avoids the problems of rank robustness specific to selection of weights [35].Normal distributions of two or more groups by Method-1 are likely to give rise to lower value of Ginis coefficient indicating equality.Conversion of discrete and ordinal scores of k-point items ( X i ) to continuous, monotonically increasing equidistant scores ( E i ) by weighted sum where weights to different response-categories of different items are in ratio scales.Linear transformations of E-scores to Z-scores and P-scores can be added with indicators measured in ratio scales.Benefits of both the method include: -Meaningful arithmetic aggregation (Method-1).Multiplicative aggregation in Method-2 can be converted to additive model by taking logarithms.-Normally distributed dimension scores ( D i ) and PI satisfy many desired properties like meaningful comparisons, rank- ing and classification of set of countries, estimation of population parameters and testing of statistical hypothesis.However, normality of PI to be tested for Method-2 by say Anderson -Darling test.-Identify critical dimensions showing deterioration with time and thus, draw attention of the policy makers for initiation of corrective actions -Find effect of small change in i-th dimension to PI by elasticity as ratio of change in PI due to unit change in a dimension.Such elasticity can be used to rank the dimensions.-Possible to assess Progress/decline of PI at successive time periods for monitoring of effect of policies and strategies.
Statistical hypothesis of significance of progress/decline of PI at two different time periods can be undertaken since ratio of two normally distributed variable follows 2 distribution.-High reliability of the scale (r tt ) indicates rank robustness, which can be quantified by Spearman's rank correlation ( ).
Thus, the proposed measures satisfying the desired properties is an improvement over the existing methods

Conclusion
The paper contributes to improve scoring of PI avoiding major limitations of ordinal scores and facilitating analysis under parametric set up for meaningful comparisons.Policy makers and researchers can take advantages of the proposed methods of arithmetic aggregation without weights or multiplicative aggregation without scaling and choosing weights.Both methods satisfy desired properties.Method-2 offers more generalized approach satisfying time reversal test and formation of chain indices.However, test of normality is required for this method unlike the Method-1 which ensures normally distributed scores.The proposed methodologies are innovative and contribute to estimation of citizen participation in their respective territories or countries.Empirical studies may be undertaken to find correlations of PIs by each proposed method and generalization of findings.
Author contributions The single author is involved in conceptualization, methodology, writing-original draft preparation, writing-reviewing and editing.
Funding No funding was obtained for this study.

×
100 where j denotes a negative indicator/domain.• Logarithmic transformation of an indicator:Y i = X i for X i ≥ 0 .For the Income component, HDI (2010) used Income X = log e X −log e (X Min ) log e (X Max ) −log e (X Min ) .

5 .
Policy makers are interested to know elasticity of each dimension as change in S-score per unit change in a dimension score i.e. ∇S ∇D i

1 )( 1 −
Sum of estimates of variance of items in the sub − class) Estimate of variance of the sub − class)

3 .< 1
Avoids scaling, selection of weights and reduces level of substitutability between component indicators.4.Not affected much by outliers and thus produce no bias for developed or under-developed countries.5.Satisfaction of time-reversal test and formation of chain indices by Method-2 may help inter-country comparisonover time by tracking the path of overall progress registered by a country.6. Facilitates construction of separate indices for each domain by focusing on indicators related to that domain without further weights for domains.7. Easy to find relative importance of each indicator.The ratios for which X it X i0 are the critical areas requiring attention.
8. Discriminating value of a scale indicates ability of the scale to distinguish between countries that have different degrees of the underlying construct.Discriminating value of an item ( Disc i ) and test ( Disc Test ) can be computed by Coefficient of variation (CV) where Disc i = . Cronbach and Disc Test (with m-items) are related by Since, variance of the i-th item S 2