I. Introduction

The age of women at which they have first child has important consequences on the demographic characteristic of the population [4]. Lower the age at first birth implies higher rate of fertility and population growth because of the shorter length of time between generations [6]. Teenage pregnancies and birth are substantial health risks both for the mother and the child [8]. Adolescent child-bearing restrict growth of mother’s educational attainment resulting in reduced economic opportunity and participation in the labour market for the mother as-well-as the household [2] [5]. There is a strong relationship between age at first birth and children ever born to women. Early childbearing results in large number of children compared to those who start late [1] [5]. Early initiation of childbearing lengthens the reproductive period and subsequently increases fertility. In developed economies, postponement of first births-due to an increase in the age at marriage, has contributed greatly to overall fertility decline. The relationship between age at first birth and fertility is studies extensively and has been well documented. This study aims to answer whether or not the age at first birth is influenced by socio-demographic factors and if yes to know the direction of such influences using frailty models. Scope: In a traditional Indian society and a demographically backward state of Uttar Pradesh with fertility rate still above three plus children there is immense families/societal pressure among newly wed couples to bear children. Not able to produce children immediately after marriage is perceived as women being in-fecund. This negatively impact family’s social standing in the community/society. It is therefore very important to revisit the socio-demographic differentials so as to study the society’s norms on marriage and child bearing. Indian government has legislated legal minimum age at marriage for boy (21 years) and girls (18 years). However, enforcement of this particular law is amongst the worst in the country. Demographically, backward states of Uttar Pradesh, Madhya Pradesh, Rajasthan and Bihar still have large proportions of girls getting married below the legal minimum. This study aims to revisit the strength of association between age at first birth and socio-demographic factors in light of frailty models. Instead of constant baseline hazard the log-logistic distribution is assumed with Inverse-Gaussian frailty.

II. Data and methods

The study utilizes the Date from India’s National Family Health Survey round three conducted by Macro International and International Institute of Population Sciences under the auspices of Ministry of Health and Family Welfare, GOI. The survey covered a wide range of data regarding demography and health of individual women and children besides collecting information on couples as well. Complete reproductive history including fertility of individual women and health status of the individual women aged 10-49 were collected. From the raw data set we have selected a cohort of married women aged 30-34 years for the current analysis as by this age each woman gets married and experiences the event of child bearing. A sample of 1641 women belongs to this cohort. The target fertility variable under study is the age of women at the time giving first birth. A set of socio-demographic variables has been selected for determining influencing factors to the women’s age at first birth. The central focus of the analysis is the frailty models with survival time data i.e., time to event data. A short description of survival analysis with parametric baseline hazard is presented below. Let,

T – Response, time to failure

xt – Row vector of covariates

[T |xtβ] – Some density f (t)

  • Survival analysis is characterized by censoring and truncation

  • It is much more convenient to think in terms of survival function S(t) = P (T > t) and hazard function h(t) = f (t)/S(t), i.e. instantaneous probability of failure given survival up to t.

  • Response is actually the triple (t0, t, d) where subject observed from (t0, t] and either failed (d = 1) or was censored (d = 0). The covariates in the model are assumed constant over (t0, t].

The proportional hazard assumes the form; h(ti) = h0(ti) exp(xi tβ) for some baseline hazard h0(t).

The cox-regression is a proportional hazard model that makes no assumption about the functional form of h0(t). Under the parametric survival model instead we assume some functional form for h0(t). For the current study we assume that the age at first birth episodes follows a log-logistic distribution which takes the form

$$ \begin{array}{l} {\rm{f(t) = b}}{{\rm{a}}^{\rm{b}}}{{\rm{t}}^{{\rm{b - 1}}}}{\rm{/[1 + (at}}{{\rm{)}}^{\rm{b}}}{{\rm{]}}^{\rm{2}}}\,\,\,\,\,\,{\rm{a,b > 0}}\\ {\rm{G(t) = 1/1 + (at}}{{\rm{)}}^{\rm{b}}}\\ {\rm{r(t) = b}}{{\rm{a}}^{\rm{b}}}{{\rm{t}}^{{\rm{b - 1}}}}{\rm{/1 + (at}}{{\rm{)}}^{\rm{b}}} \end{array} $$

Here a = exp ( - A ∝ ) and b = exp ( - B ß ) alpha and beta are the model parameters to be estimated. While A & B are the first component of the covariate row vectors and are equal to one.

The parametric specification plus covariates can only go so far in explaining the variability in observed time to failure. The excess unexplained variability is an over-dispersion which is caused either by model misspecification or omitted covariates. This led to the development of frailty models. A frailty model attempts to measure this over -dispersion by modeling it as resulting from a latent multiplicative effect on the hazard function, i.e. the hazard becomes h(t|α) = α h(t). From the proportional hazards perspective, it is easy to see how ‘α’ may correspond to an omitted covariate (or set of covariates).

$$ {\rm{h(}}{{\rm{t}}_{\rm{i}}}\left| {{{\rm{\alpha }}_{\rm{i}}}{\rm{) = }}} \right.{{\rm{\alpha }}_{\rm{i}}}{\rm{h(}}{{\rm{t}}_{\rm{i}}}{\rm{) = }}{{\rm{\alpha }}_{\rm{i}}}{{\rm{h}}_{\rm{0}}}{\rm{(}}{{\rm{t}}_{\rm{i}}}{\rm{)exp(}}{{\rm{x}}_{\rm{i}}}^{\rm{t}}{\rm{\beta )}} $$

In this study we assume distribution of alpha as Inverse Gaussian having density and survival function as

$$ {\rm{G(\alpha ) = (2\pi \theta }}{{\rm{\alpha }}^{\rm{3}}}{{\rm{)}}^{{\rm{ - 1/2}}}}{\rm{exp\{ - 1/2\theta (\alpha - 2 + 1/\theta )\} }} $$

The unconditional survival function is given by;

$$ {{\rm{S}}_{\rm{\theta }}}{\rm{(t) = exp\{ 1/\theta }}\,{\rm{(1 - [1 - 2\theta }}\,{\rm{ln\{ S(t)\} }}{{\rm{]}}^{{\rm{1/2}}}}{\rm{)\} }} $$

For further details on frailty model one can refer to Wienke, A. [7]. Description on the dependent and selected independent variables used in the analysis is provided below.

Dependent variable: Age at first birth is the dependent variable and it ranges between 12 and 34. Therefore outcome variable under the survival setting is

Yi :

= Age of women at first birth.

Covariates

X1 ::

Age at first marriage of women

X2 ::

Place of residence

X3 ::

Literacy status of women

X4 ::

Religion

X5 ::

Ever use of contraceptives

X6 ::

Caste

X7 ::

Household Wealth Index

The study analyzes the data using frailty models. Before incorporating frailty variables into the survival models, the parametric proportional hazard method of survival analysis is performed using log logistic distribution.

III. Findings and discussion

The characteristics of the sampled women aged 30-34 is presented in the table 1 below. The distribution of age at first birth is slightly skewed as is reflected in the graph below. The majority of women experience birth of first child between ages 15-24. About 72 percent of the sampled women aged 30-34 got married by age 18 the legal minimum for girls. Sampled women are predominantly Hindus (79%) and live in rural areas (58%). More than half of the sampled women (59%) are illiterate. Only about 10 percent of the sampled women had education levels beyond secondary school. Twenty three percent of the sampled women belonged to SC/ST category while 46 percent of the women belonged to other backward caste category. Thirty one percent of the women belonged to general caste category. Twenty four percent of the women never used any method of contraception while about 74 percent of the women did use any method of contraception which includes both modern as well as traditional. Forty one percent of the sampled women belonged to either poorest or poorer categories, while 43 percent were richer or richest. About 16 percent of the women belonged to middle income group (q3 quintile).

Table: 1 Socio-demographic Characteristics of the Study Sample

Table 2 below shows the mean and median survival duration of the women’s selected characteristics, those literate, marrying at higher ages, belonging to other caste (other than SC/ST & OBC) and are from high socio-economic strata are more likely to give their first birth at higher ages than their respective counterparts. The mean and the median survival duration are computed only for those variables that are used in the Cox Hazard model in table 3.

figure 1
Table 2: Mean and Median survival duration of the selected background characteristics

Table 3 below presents findings from the Cox-survival model with constant baseline hazard function. Prior running Cox model the proportionality assumption for each covariate under study was examined. The place of residence, religion and the ever use of contraception was dropped from the analysis as they do not meet the proportionality criterion. The Log-log plot is presented through graphs at Appendix. In the final run education, age at marriage, caste and the wealth index were used to fit the cox proportional hazards model. Respondents who are literate have 24 percent less likely to have age at first birth at age which illiterate women would like to have, the difference is statistically significant. Similarly, women marrying ages 15-19 are 58 percent less likely to have age at first birth at which women of ages 8-14 years have their first birth. As the age of women increases higher percentages are less likely to have age at first birth at which women of age 8-14 have. The difference is statistically significant. Caste is not statistically significant in explaining the survival difference in age at first birth. No significant difference in age at first birth was found either by wealth quintiles or the socio-economic category to which the respondent belongs.

Table 3 Findings from Cox-Survival Model with Constant Hazard Function

Table 4 below presents findings from the parametric survival model were-in the baseline hazard function for age at first birth follows the log-logistic distribution. We also try to capture unobserved heterogeneity in survival time by incorporating Inverse-Gaussian frailty. The age of women at their first birth may be affected by the factors those are not considered in the analysis. Some of these factors are unobservable.

Table 4 Findings from Parametric Survival Model with Inverse Gaussian Frailty

For example age at marriage significantly determines age at first birth. However, there are many cultural considerations that determine age at marriage such as dowry, socio-economic status of respondent’s family, individual attributes such as beautifulness etc. cannot be precisely incorporated in the model, it is therefore wise to model individual frailty. As the parametric form is assigned to the baseline hazard function we run the model incorporating full set of covariates. The coefficients are more or less in the same direction as in the case of Cox-Survival model, however a non zero shape parameter (gamma) justifies the use of parametric form for baseline hazard function. Similarly, the hypothesis that the estimate of variance is zero if rejected pointing towards unobserved heterogeneity. It is therefore parsimonious to use the log-logistic parametric form with inverse Gaussian Frailty rather than simple Cox Survival Model.

IV. Conclusion

Age at marriage and women’s literacy is the most significant factor explaining variations in age at first birth. Also The shape parameter value of 0.055 (gamma) under log-logistic baseline hazards reflects better fit than constant baseline hazards. The Inverse-Gaussian frailty with non-zero variance estimate of θ=0.11 reflects significant unobserved heterogeneity in the Age at First Birth.

Acknowledgement

Authors would like to thank MOHFW, GOI New Delhi for supporting the study