The number of senior residents in China is projected to exceed a third of the total population in 2060, approximately corresponding to 500 million [1]. Increases in the proportion of seniors require that special attention be given to their health. There is compelling evidence that an active lifestyle can contribute to health aging [2]. Health-enhancing amounts of physical activity can be accumulated through walking [3], which is one of the recommended forms of physical activity for seniors because it is versatile, affordable, and safe [4].

There is growing, albeit inconsistent, evidence that the neighborhood built environment can facilitate or hinder residents' engagement in walking for different purposes in both adults and seniors [49]. Conflicting findings have in part been attributed to using different built-environment measures with sometimes doubtful or unknown metric properties [4, 10]. To address this problem, there have been recent calls for the development of comparable and good-quality measures of the built environment [10, 11]. The built environment has been assessed using three main methods: questionnaires, systematic observations, and archival data sets layered and analyzed with Geographic Information Systems (GIS). While systematic observations and GIS databases provide objective data on neighborhood attributes, questionnaires gauge residents' perceptions of the local environment. These methods complement each other and offer somewhat different but equally important information contributing to a better understanding of walking behavior [1215].

Of all available questionnaires assessing perceived attributes of the neighborhood environment believed to impact on residents' walking, the Neighborhood Environment Walkability Scale (NEWS) [16] and its abbreviated version (NEWS-A) [17] are the most frequently used internationally [7, 1820]. Their popularity is in part due to them being the 'official' instruments of the International Physical Activity and the Environment Network, an international collaborative research initiative aimed at conducting cross-country pooled analyses of built-environment and physical activity relationships. The NEWS-A has been translated in many languages and found to be sufficiently reliable and valid in adult populations [1619, 2123].

However, the NEWS-A was developed in the USA, which raises some concerns about its applicability to geographical locations differing in environmental attributes and culture [18, 24]. Asian urban environments are much denser both in terms of population and destinations than those of countries such as the USA and Australia [18]. Additionally, they exhibit walking-related environmental idiosyncrasies (e.g., air pollution, crowdedness, diverse terrain, types of destinations) that are not captured by tools developed for less dense locations. Although the Chinese version of the NEWS-A showed good reliability and construct validity in a sample of Hong Kong adults [18], it does not include items measuring environmental attributes relevant to Asian high-density urban environments and, thus, its value in a geographically-specific context is limited.

Another limitation associated with the NEWS-A is that it was developed and tested on adult populations (20-65 years). Therefore, its suitability for older adults (65+ years) is uncertain. To gain a better understanding of how the built environment affects the physical activity levels of populations and, subsequently, devise effective environmental interventions advantageous to the majority of residents, it is also important to identify age-group differences in effects. This implies the use of measures that are both comparable (e.g., consist of common items) and age specific (i.e., gauge environmental attributes that are of particular relevance to specific age groups). However, although recent studies have used the full version or selected items of the NEWS-A with older adults [7, 20], to these authors' knowledge, no studies have yet attempted to adapt and/or conduct reliability and factorial validity tests of the original or adapted versions of the NEWS-A on this particular age group. To address the lack of questionnaires measuring universally-comparable environmental aspects related to walking as well as aspects specific to an Asian urban context and older residents (65+ years), the aim of this study was to develop a new version of the NEWS-A for Chinese senior residents (thereafter termed NEWS-CS) and test its reliability and factorial validity on a sample of Hong Kong seniors.


This study consisted of three stages: (1) development and translation; (2) pilot study including cognitive testing; (3) reliability and factorial validity testing. The study protocol was approved by the ethics committees of participating research institutions.


The original English [17] and Chinese translation [18] of the Neighborhood Environment Walkability Scale - Abbreviated (NEWS-A) were the source instruments used to develop the Neighborhood Environment Walkability Scale for Chinese Seniors (NEWS-CS). The original NEWS-A assesses perceived environmental characteristics, stemming in part from the urban planning literature, believed to influence walking and other forms of physical activity [16, 17]. It consists of 54 items. These are grouped into subscales including perceived residential density, proximity to non-residential land uses (land use mix - diversity), ease of access to non-residential uses (land use mix - access), street connectivity, infrastructure for walking and cycling, aesthetics, traffic safety and safety from crime. Four single items assess hilly streets, difficult car parking in shopping areas, absence of cul-de-sacs, and perceived major physical barriers to walking. All subscales and single items, with the exception of residential density and land use mix - diversity, are rated on a 4-point Likert scale. Residential density items use a 5-point scale, whereby ratings are weighted relative to the average residential density that a specific item represents. The weighted ratings are summed to create a perceived residential density score. Land use mix - diversity is assessed by the perceived walking proximity from home to various destinations, with responses ranging from 1- to 5-minute to >30-min walking distance. The NEWS-A has been shown to have acceptable levels of validity and reliability [1618]. A study on Hong Kong adults reported high levels of test-retest reliability of the Chinese version of the NEWS-A [18].

The version of the NEWS-CS examined in this study encompasses all but two items of the NEWS-A, in their original or slightly modified form, and 24 additional items describing features of the environment relevant to the study setting and senior residents (see Development and translation section below).

Participants and procedures

Development and translation

A multi-disciplinary panel of experts from the fields of public health (n = 3), urban planning (n = 1), Elderly Health Service (n = 4), physiotherapy (n = 1) and physical activity (n = 3) reviewed the original English and Chinese versions of the NEWS-A, and adapted them to reflect the built environment of Hong Kong and needs of older adults [25]. The panel's adaptation of the NEWS-A (thereafter, NEWS-CS) included all items of the original instrument, in their original or slightly modified form, except for one (see Table 1). In total, 23 items describing features of the environment relevant to the study setting and target population were added and 13 were modified.

Table 1 Adaptations of the NEWS-A for Chinese older adults

Specifically, the 'residential density' subscale was modified to capture the high level of residential density typical of Hong Kong. Eight types of destinations were added to the 'land use mix - diversity' subscale. The descriptors of six other destinations were modified to provide a better match to either the local environment or senior residents (Table 1). Modifications were also proposed to three original items measuring infrastructure and safety for walking. Eleven items were added to other subscales of the NEWS-A in order to capture particular features of the built environment characteristic of Hong Kong - namely, a highly developed public transport network, high levels of population density and crowdedness, terrain steepness, air pollution, indoor-places for walking, safety issues related to high-rise buildings, street hygiene concerns associated with to the presence of dogs, and aspects of social disorder (Table 1). Five newly-developed items were included to gauge aspects of the environment believed to be particularly important to senior residents (e.g., sitting facilities, road and traffic hazards, access to the entrance and exit of high-rise buildings, ability to ask for assistance) [25].

The proposed response scales for the 'residential density' and land use mix - diversity' subscales were the same as those in the original NEWS-A. In addition to the original response scale (strongly disagree, somewhat disagree, somewhat agree, strongly agree), two additional Likert-like scales were proposed for the remaining items of the NEWS-CS. Two panel members maintained that the respondents might find the word 'somewhat' confusing. Therefore, the use of a scale with the response anchors strongly disagree, disagree, agree, strongly agree was suggested. Others suggested that the response anchors disagree, somewhat disagree, somewhat agree, and agree might be more appropriate since previous research had shown that some Asian populations (China and Japan) tend to avoid endorsing extreme answers [26]. Such tendency can have a negative effect on response variability and, hence, the ability to evaluate relationships between variables [26].

The English versions of the NEWS-CS with three different response scales were translated into Chinese following the World Health Organization guidelines [27]. This included (1) forward independent translations by two bilingual Chinese native speakers familiar with the areas of study (physical activity and urban planning); (2) an assessment of the accuracy of the translations by two independent bilingual experts in survey development; (3) discussion of translation problems and consensual amendments to the forward translation by the panel of experts in collaboration with the translators; (4) a back-translation of the NEWS-CS into English by an independent translator with no knowledge of the original questionnaire. The last three steps were reiterated until equivalence between the original English version of the NEWS-CS and the back-translation of the Chinese NEWS-CS was achieved.

Pilot testing and cognitive interviews

The translated instruments were pre-tested on 50 Chinese-speaking persons aged 65 or above, able to walk without assistance and communicate verbally, with no diagnosis of cognitive impairment and residing in the community (Table 2). Participants were selected and screened for eligibility from membership lists of four Elderly Health Centres (EHCs), representing areas of Hong Kong differing in socio-economic status (SES) and transport-related walkability (see below for details) [28]. The EHCs, one in each of the 18 districts of Hong Kong, were established by the Department of Health of the Government of the Hong Kong Special Administrative Region to provide comprehensive primary care services for elderly aged 65 and over residing in Hong Kong on a membership basis. It was decided to sample participants from the EHCs because of their appropriate age, observed willingness to participate in health-related studies, the ability to pre-screen their health with the assistance of EHCs staff, and their similarity in age, SES, and health status to the general elderly population of Hong Kong [29]. EHC members differ from the general population in gender distribution (66% vs. 54% women) and education level in men only (slightly higher education level among EHC members) [29].

Table 2 Socio-demographic and area characteristics of study samples

Participants were balanced by gender and EHCs (Table 2). After signing an informed consent form, they were randomized to one of the three versions of the NEWS-CS (with different response scales) and asked to verbalize their thought process while they were providing responses to the items. They were also asked questions about the meaning, the choice of words and appropriateness of each item in the questionnaire. Additionally, they were invited to reveal any aspect of the built environment relevant to their walking that was not included in the NEWS-CS. The results of the pilot study were used to subsequently modify the NEWS-CS.

Reliability and factorial validity testing

A sample of 484 of eligible seniors (able to walk without assistance, with no diagnosis of cognitive impairment and residing in one of a priori selected locations) were recruited from membership lists of four EHCs selected based on available data on walkability and SES of their catchment areas (districts) to represent the following strata: high walkable/high SES; high walkable/low SES; low walkable/high SES; and low walkable/low SES (Table 2). This type of stratification was needed to maximize variance in the environmental variables of interest. Median household income and percentage of owner-occupiers were used as measures of area SES and were obtained from the Census and Statistics Department of Hong Kong SAR. Household density (number of households per km2), intersection density (number of intersections with three or more unique intersecting streets per km2) and commercial and service destination density (commercial and service destinations per km2) were used as measures of walkability. This information was obtained from the Census and Statistics Department and from Centamap (; Table 2).

Eight street blocks, with at least 25 EHCs members residing in the block, were randomly selected in each area (total 32 street blocks). Approximately 15 EHCs members were recruited from each street block. The NEWS-CS was interviewer-administered to consenting participants (recruitment rate: 78% of the total number of invited participants). Women [χ2(1) = 3.90; p = .048] and elders living in lower SES areas [χ2(1) = 5.73; p = .017] were slightly more likely to consent to participation. Socio-demographic information was also collected (age, gender, and educational attainment). Completion of the NEWS-CS and socio-demographic questions took, on average, ~25 minutes. All participants were asked whether they would be willing to be re-assessed two weeks after the first assessment. Three consenting participants per street block were recruited for this component of the study (n = 96). Ninety-two participants attended both interviews. The interval between the two interviews ranged from 14 to 20 days (average of 17 days).

Data analytic plan

Assessment of Likert scales (pilot study)

To examine whether the Likert scale used in the original NEWS-A would result in a potential reduction of variability in responses due to the tendency to avoid endorsing extreme responses (i.e., strongly agree), standard deviations of the scores were computed for each relevant NEWS-CS item by type of Likert scale (three types were used), and entered as data in the analyses. The items represented cases, while the three types of Likert scale represented conditions (i.e., independent variables). Given that the distribution of standard deviations was positively skewed, the Wilcoxon signed-rank test was used to test the significance of the difference between the standard deviations obtained using different Likert scales.

Test-retest reliability (main study)

Test-retest reliability of each item and subscale of the NEWS-CS was established by computing intraclass correlation coefficients. Based on previously proposed classification systems, ICC values below 0.50 were classified as poor, 0.50 to 0.75 as moderate, and above 0.75 as high levels of reliability [30]. Percentage agreement was also computed as an additional measure of test-retest reliability to assess the percentage of individuals who gave the same response on an item on both assessments. Percent agreement was also computed for subscales with restricted variability (SD < 0.5), as low levels of variability may result in very low ICC values even though the actual agreement between assessment is high. Items and subscales with moderate or low ICC values, restricted variability (SD < 0.5), but percent agreement greater than 75% were considered highly reliable. While items and scales with low ICC, but percent agreement exceeding 60%, were considered moderately reliable [31].

Factorial structure (main study)

Confirmatory factor analysis (CFA) was performed on the responses of the NEWS-CS for items that could be factor-analyzed (i.e., all items but those measuring perceived dwelling density and land use mix diversity). To account for street-block clustering effects threatening the validity of the standard errors estimates, CFAs were conducted using the within-neighborhood variance/covariance matrix, containing estimates of relationships among differences in responses between participants living in the same street-blocks [17, 21]. Multilevel CFA, simultaneously modeling within- and between-street-block covariances, could not be employed since the number of sampled street blocks (32) was too small for the number of factor-analyzable items (40). CFA was conducted using the Maximum Likelihood Estimation (MLE) method. Based on previous findings [17, 19, 21] and a preliminary classification of newly-added items, a measurement model of the NEWS-CS including the following factors and single items was estimated: access to services (4 items); street connectivity (2 items); infrastructure for walking (4 items); indoor places for walking (2 items); physical and social disorder (3 items); aesthetics (4 items); crowdedness (2 items); presence of people (2 items); traffic and road hazards (8 items); crime (3 items); hilly streets (single item); physical barriers to walking (single item); dead ends (single item); easy access to entrance of residence (single item); sitting facilities (single item); bridge or tunnel (single item). Another model determined whether the factor structure of the original NEWS-A estimated on a sample of American adults [17] held for the examined sample of Chinese older adults. This model was estimated using only factor-analyzable items common to the NEWS-A and NEWS-CS (23 items). It consisted of the following factors and single items: access to services (3 items); street connectivity (2 items); infrastructure for walking (5 items); aesthetics (4 items); traffic hazards (3 items); crime (3 items); hilly streets (single item); physical barriers to walking (single item); dead ends (single item). The models were re-specified following the iterative model-generating approach outlined by Jöreskog and Sörbom [32]. Model re-specification was based on an analysis of standardized factor loadings, standardized residual covariances, Wald tests, univariate Lagrange multiplier tests, and substantive considerations. Factor loadings with an absolute value greater than 0.30 were considered to be significant. A combination of indices of model fit, as proposed by Hu and Bentler [33], and Kline [34], was used to assess the goodness-of-fit of the measurement models. We used the standardized root mean squared residual (SRMS), the root mean square error of approximation (RMSEA), and the comparative fit index (CFI). Values indicative of good model fit are ≤0.08 for SRMR, ≤0.06 for RMSEA, and ≥0.95 for CFI [33]. We also reported the χ2 test and the Akaike information criterion (AIC) of the standard goodness-of-fit χ2 statistic including a penalty for complexity. CFAs were conducted using EQS 6.1 (Multivariate Software Inc., Encino, CA, 2004).


Assessment of Likert scales (pilot study)

One of the aims of the pilot study was to examine whether the use of different anchors with a 4-point Likert scale would result in significantly different levels of variability in responses. Standard deviations of scores on items rated using the original Likert scale (strongly agree, somewhat agree, somewhat disagree, strongly disagree) were significantly larger (M = 0.92; Median = 1.10) than those obtained using a Likert scale without the adjective 'somewhat' (M = 0.67; Median = 1.01) (signrank z = 3.33; p < .001). The Likert scale without the adjective 'strongly' yielded similar standard deviations (M = 0.96; Median = 1.12) to those of the original Likert scale (signrank z = -1.63; p = .103) but larger standard deviations than the scale without the adjective 'somewhat' (signrank z = -4.59; p < .001). Given that the original Likert scale yielded one of the largest levels of variability in responses, it was used in the main study.

Test-retest reliability (main study)

Table 3 shows the descriptive statistics and test-retest reliability indices for each item and subscale of the NEWS-CS. In general, the reliability of the subscales was moderate, with ICCs ranging from 0.52 to 0.77. The subscale 'presence of people' had a low ICC (0.37) but a high percent agreement (87%) indicative of good reliability. These discrepant results were likely due to the low variability in responses (SD = 0.3). Residential density and land use mix diversity were the subscales with the highest reliability indices. At the item level, reliability ranged from poor (e.g., distance to Western non-fast food restaurant) to excellent (e.g., some items measuring residential density) and, in the main, was moderate. Six items had very low ICCs but high percent agreement (e.g., shopping accessible via public transport). All of these items showed low variability in responses, explaining the differences in results. Items with consistently low reliability were distance to 'video/audio shop', 'Western non-fast food restaurant', 'Western coffee shop', and 'public toilet'. Exclusion of these items from the land use mix - diversity scale had minimal impact on the reliability of the subscale (Table 3).

Table 3 Descriptive statistics and test-retest reliability of the NEWS-CS (reliability subsample; N = 92)

Factorial structure (main study)

The originally proposed measurement model of the NEWS-CS met only one of the three goodness-of-fit criteria (CFI = 0.86; SRMR = 0.073; RMSEA = 0.051, 95% CI: 0.048, 0.055; χ2(701) = 1594.7; AIC = 192.7). Standardized loadings were below the criterion value of |.30| for the items 'fence separating sidewalk and traffic', 'slow traffic speed on streets', and 'speeding drivers'. Analysis of indices of poor model fit at the item level lead to several modifications of the measurement model. Firstly the item 'fence separating sidewalk and traffic' was defined as a single item as it did not significantly load on any of the factors. The items 'slow traffic speed on streets' and 'speeding drivers' formed a 'Traffic speed' factor separate from other traffic-related items. The items 'hilly streets', 'many dead ends', and 'major barriers (e.g., roadwork, freeways)' were inter-correlated and significantly loaded on an additional factor named 'Physical barriers to walking' (Table 4). Some factors were not significantly correlated and, thus, their covariance was constrained to zero (Table 5). The model fit of the re-specified model was considerably better than the original model, with two of the three indices meeting the goodness-of-fit criteria and one index meeting a less stringent criterion (CFI = 0.91; SRMR = 0.058; RMSEA = 0.041, 95% CI: 0.037, 0.045; χ2(701) = 1277.5; AIC = -124.5) [34]. All standardized loadings were significant at the 0.001 level. Moderate-to-strong positive correlations were found among 'Traffic and road hazards', 'Crowdedness', and 'Social disorder - littering' (Table 5). The factor 'Indoor places for walking' was substantially negatively related to 'Traffic and road hazards' and 'Physical barriers to walking'. A moderate negative association was also observed between 'Physical barriers to walking' and 'Access to services'.

Table 4 Measurement model of the full NEWS-CS and NEWS-CS including items common to the original NEWS-A
Table 5 Correlations between latent factors of the full NEWS-CS (above diagonal) and NEWS-CS including only items common to the NEWS-A (below diagonal)

The a priori measurement model of the NEWS-CS including only items common to the original NEWS-A met the goodness-of-fit criteria for the SRMR and RMSEA (CFI = 0.89; SRMR = 0.067; RMSEA = 0.037, 95% CI: 0.030, 0.043; χ2(217) = 358.3; AIC = -75.7). The item 'fence separating sidewalk and traffic' did not significantly load onto the factor it was supposed to measure ('Infrastructure for walking') and standardized residuals for the covariances of the items 'hilly streets', 'many dead ends', and 'major barriers (e.g., roadwork, freeways)' suggested that they were inter-correlated rather than independent (Table 4). Additionally, several latent factors were not inter-correlated. After re-defining 'fence separating sidewalk and traffic' as a single item, adding an additional factor ('Physical barriers to walking') explaining the observed inter-correlations between 'hilly streets', 'many dead ends', and 'major barriers (e.g., roadwork, freeways)', and constraining relevant inter-factor covariances to zero, the model met all goodness-of-fit criteria and yielded the lowest AIC value (CFI = 0.97; SRMR = 0.042; RMSEA = 0.021, 95% CI: 0.009, 0.029; χ2(216) = 260.7; AIC = -171.3). All standardized loadings were significant at the 0.001 level. Moderate associations were observed among 'Infrastructure for walking', 'Physical barriers to walking' and 'Access to services' and between 'Crime' and 'Infrastructure for walking' (Table 5).


The main aims of this study were (1) to adapt the original NEWS-A [17, 18], a questionnaire gauging perceived neighborhood attributes related to walking, for Chinese seniors living in Hong Kong and similar metropolises in China; and (2) to conduct reliability and factorial validity testing of the adapted instrument NEWS-CS.

Development of the NEWS-CS

Aspects of the built environment relevant to the target geographical setting [35] and age group [25] that were not covered by the original instrument were identified and added to the NEWS-CS. Location-specific attributes included types of food outlets (e.g., Chinese noodle shops) and other destinations (e.g., betting branches), elements of pedestrian infrastructure (e.g., pedestrian bridges and indoor places for walking), attributes associated with high levels of residential density (e.g., crowded streets) and steep terrain (e.g., slippery streets), and idiosyncratic forms of littering and social disorder (e.g., objects dropping from high-rise building). Similarly to studies conducted in the USA [11, 14, 35], environmental attributes identified to potentially affect walking in older residents were the presence of specific destinations (e.g., community centers and clinics), public transport infrastructure (e.g., access of shops via public transport), traffic/crime safety (e.g., parked cars blocking vision while crossing roads), and services (e.g., sitting facilities).

The original 4-point Likert scale did not require modifications as it yielded similar or higher levels of variability in responses than alternative scales. The main motivation for modifying the anchors of the Likert scale was the documented tendency of Asians to avoid extreme responses such as 'strongly agree' or 'strongly disagree', which results in restricted variability of scores [26]. No such tendency was detected in the present study. The unexpected results may be attributable to the socio-demographic characteristics of the examined sample. Namely, studies that identified low rates of extreme response styles in Asians were conducted on younger and more highly educated participants [36]. Recent findings suggest that the tendency to avoid extreme responses is negatively related to age and positively related to education [37]. This means that older respondents with a lower educational attainment would be less likely to avoid extreme responses than their younger and more highly educated counterparts.

Reliability of the NEWS-CS

In general, the subscales and single items of the NEWS-CS showed moderate to good test-retest reliability. Unacceptably low levels of reliability were observed only for four items measuring distance to destinations. Three of these items (Westerns non-fast food restaurant and cafés; audio/video shops) pertained to services of low relevance and affordability for Hong Kong Chinese seniors. The fourth unreliable item, public toilet, could also be classified as an unlikely destination respondents would walk to from home. Given that the infrequent use of services results in poorer knowledge or awareness of their location in space [38], the low reliability indices observed for the above items are explicable. It is suggested that these four types of destination be omitted from the NEWS-CS.

Overall, the reliability of the NEWS-CS in the examined sample was lower than that reported for the original NEWS or similar instruments in adults [16, 18, 23, 24]. Specifically, while test-retest reliability (based on ICC) of the NEWS-CS subscales ranged from 0.37 to 0.77, it ranged from 0.62 to 0.88 for the Australian version of the NEWS [23], from 0.58 to 0.80 for the original NEWS [16], and from 0.57 to 0.99 for the Chinese translation of the original NEWS-A [18]. The observed differences may be due to aging-related cognitive decline adversely affecting memory and, thus, response reliability [39]. Also, the longer interval between surveys used in this study (2-3 weeks) compared to a study on Chinese version of the NEWS-A for adults (7-10 days) might have contributed to lower levels of reliability [18]. To unravel the possible reasons for the observed discrepancies in reliability estimates, the test-retest reliability of the NEWS-CS should be examined in younger Hong Kong adults (20-65 years) and in a sample of seniors but using a shorter between-assessment interval.

Factorial structure of the NEWS-CS

The final measurement model of the NEWS-CS encompassed 12 conceptually distinct latent factors and four single items. The model fitted the data relatively well, although one of the indices of goodness-of-fit (CFI) did not meet a more stringent criterion of model fit [33]. The CFI is sensitive to the magnitude of correlation between variables [34]. Given that co-occurring environmental characteristics are sometimes only weakly-to-moderately correlated [10], relatively robust measurement models of environmental characteristics with acceptable values for other indices (e.g., RMSEA and SRMR) may yield lower CFI values.

Contrary to expectations, the item 'fences separating sidewalk from traffic' did not load on an 'Infrastructure for walking' factor but required re-specification as a single item. The independence of similar perceived environmental features (e.g., grass/dirt strip separating sidewalks from traffic) from other elements of pedestrian infrastructure listed in the original NEWS or NEWS-A was observed in other two studies [19, 21]. Overall, current cumulative evidence suggest that this item should be considered as a separate characteristic. In fact, safety fences or grass/dirt strips do not necessarily co-occur with sidewalks, cross-walks and other general pedestrian infrastructure. Another unexpected but not unique finding was the relative independence of items gauging traffic speed from items gauging traffic load and road hazards. Similar results were observed in Australia [19] but not in the USA [16, 21]. In general, the roads in urban areas of Hong Kong and Australian cities have lower traffic capacity than those in the USA. It is not difficult to understand how inner-city, narrow, traffic-congested roads of Hong Kong, which are frequently affected by road construction works, could be perceived as having low-speed traffic, while more peripheral, major roads could be associated with high-speed traffic. The large variability in traffic speed across urban areas of Hong Kong may be the main reason why perceived traffic speed and load/hazards did not load on a single traffic-related factor. The third and last unexpected finding regarding the factorial structure of the NEWS-CS were the relatively high inter-correlations between items measuring physical barriers to walking, presence of dead-ends, and hilly streets, which formed a new latent factor of 'Physical barriers to walking'. These items were not correlated in three earlier studies [17, 19, 21]. Unlike previously examined locations (Adelaide, Seattle, and Baltimore), Hong Kong is typified by a mixture of flat and extremely hilly terrain (elevation ranging from 0 m to 958m), where dead-ends and the presence of physical barriers such as stairs or steep slopes are a common occurrence. Thus, the observed associations between these three items reflect the peculiar hilly terrain of Hong Kong.

To examine the extent to which the original measurement model of the NEWS-A could be generalized to different geographical location, culture, and age groups, confirmatory factor analysis was conducted on the items common to the NEWS-A and NEWS-CS. Similarly to the originally proposed measurement model of the NEWS-CS (see above), to achieve a good level of fit to the data, the initial model of the NEWS-A required two modifications: one related to 'fences separating sidewalks from traffic' being a single item, the other pertaining to 'hilly streets', 'dead-ends' and 'physical barriers to walking' being underlain by a common factor ('Physical barriers to walking'). As noted above, other studies found that the presence of safety fences or dirt strips separating traffic did not consistently co-occur with other perceived environmental attributes measured by the NEWS-A. Therefore, there is considerable support for it being a characteristic that is independent of other aspects of pedestrian infrastructure. The fact that the structure of other subscales of the original NEWS-A was sufficiently valid and the inter-factor correlations were similar to those observed in other samples [17, 19, 21] speaks in favor of the generalizability of the measurement model of the NEWS-A to geographical locations, cultures and age groups markedly different from those for which the NEWS and NEWS-A were developed [21]. The observed higher inter-correlations between three types of physical barriers to walking in Hong Kong would not constitute a problem in conducting cross-location comparative analyses on the associations between perceived environmental characteristics and physical activity behavior. This is because, in such case, these three characteristics would be modeled as three separate predictors rather than a single attribute. Notably, although correlated, these attributes were not highly collinear (the correlations ranged from 0.23 to 0.43), permitting their simultaneous inclusion in regression models.

Limitations and future research

The main limitation of this study pertains to using a convenience sample of Hong Kong residents, all members of the Elderly Health Centres. However, the fact that previous investigations have found this particular group of seniors to be sufficiently representative of the general population is reassuring [29]. Another limitation regards the exclusive focus on Hong Kong. The use of the NEWS-CS in other Chinese and Asian urban locations may require further adaptations to capture environmental facilitators and deterrents of walking relevant to these locations. In this study, neighborhood was defined as an area within 10-15 minute walk from home. It remains to be seen whether this definition of neighborhood is clearly understood and corresponds to that of the respondents. Additionally, the effect of neighborhood size on the associations between the built environment and walking is unclear. There is evidence that different neighborhood geographic boundaries and sizes impact on the strength of associations [40]. Moreover, different definitions of neighborhood may be required to capture environmental effects on walking for different purposes. Thus, future instruments measuring perceptions of the local environment may need to use a different definition of neighborhoods for attributes associated with walking for recreation and those associated with walking for transport.


The main aim of this study was to develop/adapt and validate a self-report measure of neighborhood environmental characteristics related to walking appropriate for Chinese seniors. With the exception of a few destination-related items with limited relevance to the target population, the NEWS-CS showed acceptable levels of test-retest reliability and construct validity. Importantly, the factorial structure of the items common to the original instrument (NEWS-A) corresponded to that obtained in other geographical locations, cultures and age groups [17, 19, 21]. These results are encouraging as they support the feasibility of cross-country and age-group comparisons of the effect of the neighborhood environment on walking. Further validation works needed to establish the generalizability of the NEWS-CS to other Asian locations and test its criterion validity, i.e., the extent to which perceived attributes of the local environment can explain walking for different purposes in Chinese seniors.