1 Introduction

Research on housing prices based on the hedonic model has widely clarified that the properties of houses (square footage, number of bedrooms, house orientation, building age, and so on), the location, and the neighborhood characteristics undoubtedly affect the prices (Goodman, 1978; Rosen, 1974). Typically, these empirical studies state themselves as unique to the experience of a given city and attribute the uniqueness to the distinctive urban spatial structure formed by the interaction between the city’s physical characteristics and the inhabitants’ social needs (Lefebvre, 2003; Lefebvre & Nicholson-Smith, 1991; Lefebvre et al., 1996). However, different case studies in different cities can lead to differences in empirical conclusions. For example, the bid rent model based on monocentric urban form (Alonso, 1964) may have a poor fitting performance for polycentric cities (Wen & Tao, 2015). Likewise, the same kind of urban functional zones in different cities may also affect housing prices differently, such as the impact of hospitals on surrounding housing prices. If hospitals are considered as a type of valuable amenity, then it could increase the commercial values of houses in Shanghai, China Tam et al. ()Tam et al.(2019, 2019), while the negative externalities of hospitals probably depreciate the surrounding housing prices in Beijing, China (Qiao et al., 2021). Meanwhile, no significant impact has been found in Greater Kuala Lumpur, Malaysia (Teck-Hong, 2012).

Regarding the uncertainty or even the conflicting conclusions in various cities, the comparative research is imperative to explore how the differences are caused by the city’s uniqueness. Limited studies discussed the housing price difference at the city-level market is primarily driven by the city’s economic fundamentals and social demographic structure (Mirkatouli et al., 2018; Waltert & Schläpfer, 2010; Wang et al., 2017; Yi & Huang, 2014). But, as Lynch and Rodwin (1958) Lynch et al. (1958) stated, a well-designed community, a good built environment, and adequate service facilities would not automatically produce the perfect settlements in a city. Houses and their prices are also affected by the city’s organization and arrangement from a larger environment. In this vein, we raise a question that, would the urban form and the urban functional zone distribution affect the housing prices within and between cities? If yes, then to what extent do the urban form and the urban functional zones affect the spatial distribution of housing prices? The empirical evidence could tell whether the effects of urban form and urban functional zones on housing prices were underestimated in previous studies. Also, the findings could support the toolsets to formulate the urban housing policies related to planning human-friendly and affordable settlements when constructing a new town or developing the suburbs under fast urbanization in China and other regions.

Therefore, we construct three hypotheses for assessing the effects of urban forms and urban functional zones on housing prices. First, urban forms and urban functional zones are significantly different across cities. Second, housing prices vary between cities. Finally, the housing price variation is affected by differentiated urban forms and urban functional zones. This study selects 10 representative extra-large cities in China as comparative cases, namely, Shanghai, Beijing, Shenzhen, Guangzhou, Chengdu, Hangzhou, Wuhan, Nanjing, Changsha, and Xi’an (sorted by GDP in 2020). Housing estate price data are calculated from historical transactions to estimate the average price per square meter of a housing estate. Indicators of urban functional zones are identified through the points-of-interest (POI) data obtained from Gaode Map, which is the biggest map and navigation service provider in China, through the contextual semantics algorithm of the natural language process. The relevant indicators in urban form are derived from the integration of road network, building footprint, land use data, parcel data, and administrative boundaries. To estimate the exact effects of intra-city and inter-city influencing factors, we apply the hierarchical linear model (HLM), which supports the fitting of multilevel groups of observations.

The rest of this paper is organized as follows. Section 2 introduces the measurements of independent and dependent variables from previous studies and their findings on these influencing factors on housing prices. Section 3 exhibits 10 extra-large cities’ layouts and descriptive characteristics of urban forms and urban functional zones. Section 4 elaborates the steps of hypothesis validation. Section 5 presents the findings and contributions of this study.

2 Urban form and urban functional zones

The understanding and measurements of urban forms and urban functional zones are complex and mixed. And the selection of appropriate measure methods largely depends on the spatial scale of the case studies (Lynch & Rodwin, 1958). At the city scale, the discussion on urban form includes the city’s physical characteristics, such as the size, the shape, the monocentric/polycentric urban structure, the spatial configuration of the urban area (market town, central business district, or suburbs), densities, and the land use patterns (Dempsey et al., 2010). At the same level, the urban functional zone is related to the comprehensive land use for different purposes, such as residential, work, social, commuting, recreational, and administrational activities (Long et al., 2015; Tu et al., 2018). Then, the effects of urban functional zones on housing prices are measured by the distance to the transportation infrastructure (metro/bus stations), the supply of public services, and the proximity to green space, parks, hospitals, and other valuable amenities. In other words, the distribution of urban functional zones represents the relations between the residents’ social needs and the urban spatial forms, and the form of cities is conversely shaped by the needs of residents or societies. Therefore, the indicators of urban forms and urban functional zones must be separately considered and dealt with (see Lynch & Rodwin, 1958 pp. 203).

2.1 POI-based urban functional zone within cities

With the rapid urbanization and increased human activities and their interaction, different urban land parcels have gradually formed diverse divisions of urban functional zones (Lefebvre, 2003). As a proxy, urban functional zones represent a rich mixture of places to accommodate human activities, such as living, working, shopping, dining, and recreations, which are vital for sensing the daily living of citizens and capturing the evolving nature of urban functions (Crooks et al., 2015).

Considering the complexity of urban functional zones derived from human activities and interactions (Batty, 2008), the image feature detection method is the traditional and widely used way of extracting and analyzing land use patterns and urban functional zones and relies on remote sensing images with a high spatial resolution (Cao et al., 2020; Wen et al., 2015; Zhang & Du, 2015; Zhang et al., 2017; Zhong et al., 2015). However, although this method specializes in detecting and reflecting the natural properties and features of ground objects, such as land use types, it fails to capture detailed human-involved functional information and socio-economic characteristics (Liu et al., 2015). These limitations are generally overcome by the emergence of big geographic data, such as POIs (Hu et al., 2020; Jiang et al., 2015; Liu & Long, 2016; Liu et al., 2020; Qian et al., 2021; Yao et al., 2017; Yi et al., 2019; Zhai et al., 2019; Zhang et al., 2017, 2019), social media (Du et al., 2020; Gao et al., 2017; Liu et al., 2015; Qian et al., 2021; Tu et al., 2017; Zhang et al., 2019; Zhi et al., 2016; Zhou & Zhang, 2016), mobile phone and floating car trajectory data (Gao et al., 2019; Hu et al., 2021; Qi et al., 2011; Tu et al., 2017; J. Zhang et al., 2021a, 2021b; Zhong et al., 2014), due to the strength of big geographic data to accurately and large-scale capture the human activities. Numerous studies have exemplified the usability and reliability of methods integrated with such data sources in urban functional zones inference, identification, and detection, despite the requirement of meticulous data processing (Li et al., 2016; Niu & Silva, 2020). The methods, data source, and their pros and cons for urban functional zones identification are summarized in Table 1.

Table 1 Five mainstreams for urban functional zones identification

In recent studies, POI data is the most widely used data source and has four main advantages for urban functional zones identification at a fine-grained scale. First, POI data is highly accessible and available from online map service providers. Second, POI data is highly reliable due to the continuous and multi-source maintenance by commercial map suppliers and volunteer geographical information (Batty et al., 2012). Third, POI data has a fine-grained spatial resolution and a continuous temporal sequence. Lastly, the approximate global coverage of POI data supports large-scale research and global comparative studies. For instance, Jiang et al. (2015) utilized POIs obtained from Yahoo Online and employment data from census data at the aggregate level to identify the land use types at the city block level. Liu and Long (2016), proposed an automatic detection method for urban functional zones by using open resource data, POI data, and road networks from OpenStreetMap.

However, two challenges in using POIs remain (Zhai et al., 2019). First, as the description of POIs has several layers, synonymy and ambiguity are common when directly using these layers’ information. As shown in Appendix Table A1, the bottom layer of food service can be a component in residential or commercial urban functional zones, but the combined semantic signature may be different (such as accommodation service–hotel–food and beverage vs. commercial housing–industrial park–food and beverage). Second, because POI description can only represent the information in the location of the point, redundancy is commonly seen in large plots, like hospitals, parks, airports, or universities. For instance, in addition to one POI with a combined description of tourist attraction–scenic spot–park, the spatial area of a park contains many other ancillary facilities (e.g., ticket office, catering, shopping, sports facilities, and toilets). If only the spatial relationships between these POIs in a given area are considered when inferring the function of the place, redundant POIs would affect the result.

Therefore, we followed the automatic method proposed by Liu and Long (2016) and proposed a hybrid approach to urban functional zones detection to take full advantage of the POI data. The process involves two steps (Fig. 1):

•First, use POI types, which represent single land use, to assign the land use type for urban functional zones.

•Second, use the multi-layer combined descriptions of the POIs reclassified by Sense2VecFootnote 1 to infer the remaining zones with mixed land use.

Fig. 1
figure 1

Flow diagram of detecting urban functional zones

2.2 Urban form indicators at the city scale

Urban form represents the physical form of a city and comprises activities (Lynch & Rodwin, 1958). The definition and measurements of urban form in the literature are very diverse, mainly relying on the scale of the research (Batty, 2008; Batty & Longley, 1994). Huang et al. (2007) Huang et al. (2007) utilized satellite images of 77 metropolitan areas in Asia, the United States, Europe, Latin America, and Australia to quantify the five dimensions (i.e., compactness, centrality, complexity, porosity, and density) of urban form from the physical characteristics of the urban landscape. Clifton et al. (2008) Clifton et al. (2008) divided the quantitative methods of analyzing urban forms into five categories, namely, landscape ecology, economic structure, transportation, community design, and urban design, and sorted them by research scales from regional, metropolitan, sub-metropolitan, neighborhood to the smallest building block. At the city scale, the measures include the urban built-up size, density, and diversity, the urban structure, and the polycentricity. Dempsey et al. (2008) Dempsey et al. (2008) classified urban forms of different scales into five broad and interrelated elements that constitute a specific city and are encompassed by density, transport, housing, land use, and layout. The indicators of urban form at the city scale reflect the ratio of population, households, or housing units to the area of the entire city, including the city’s population density, the household density, the dwellings density, and the city area. Echenique et al. (2012) Echenique et al. (2012) proposed that urban forms (compaction, dispersed, and planned expansion) in English city regions do not have obvious advantages for sustainable development. Conversely, population growth and changes in residents’ lifestyles dominate the impacts on natural environment and resources. In comparative studies of global cities, openness and proximity are often used to represent the fragmentation or compactness of urban form (Dong et al., 2019; Schneider & Woodcock, 2008; Xu et al., 2020).

The frequently used urban form indicators for extra-large cities should be capable of indicating the outcomes that have great significance at the city or metropolitan scale and can be controlled and describe the different effects caused by the rearranged patterns at this scale (Lynch & Rodwin, 1958). In this study, we applied this criterion and utilized indicators to reflect the five dimensions of urban form, including compactness, centrality, polycentricity, dissimilarity, and density.

Compactness (Fig. 2a). Compactness does not only measure the shape of an individual developed block but also the fragmentation of the overall urban landscape (Li and Yeh, 2004; Angel et al., 2017, 2020). The estimation is based on the average comparison between the perimeters of each developed block and a circumscribed circle of the same area. The CI is calculated as follows:

$$Compactness = \frac{{\sum P_{i} /p_{i} }}{{N^{2} }} = \frac{{\sum 2\pi \sqrt {S_{i} /\pi } /p_{i} }}{{N^{2} }}$$

where \({S}_{i}\) and \({p}_{i}\) are the respective area and perimeter of developed block \(i\), \({P}_{i}\) is the perimeter of the circumscribed circle with area \({S}_{i}\), and \(N\) is the total number of blocks. Given two cities with the same amount of development, when city A is more compacted than city B, then the development of B is more evenly distributed than that of A. With a low degree of compactness, the sprawl-like development model is spreading in this city.

Fig. 2
figure 2

Schematic of assessing spatial urban form

Centrality (Fig. 2b). Centrality is the degree of proximity of a city’s developed areas to its urban central business district (CBD) (Galster et al., 2001). Centrality is a measurement of the areal size of a city. One of the most common forms of urban development is the loss of centrality, which results in increased travel distance and time as land development is far from the CBD. The centrality index increases with the decrease of the radius between the developed areas and the CBD. Conversely, the centrality index is low when a city has a great spread. The calculation formula of centrality is as follows:

$$Centrality = \frac{{\sum D_{i} /N - 1}}{R} = \frac{{\sum D_{i} /N - 1}}{{\sqrt {S/\pi } }}$$

where \({D}_{i}\) is the distance of the centroid of developed block \(i\) to the centroid of the city center, \(N\) is the total number of developed blocks, \(R\) is the radius of a circle with area \(S\), and \(S\) is the total built-up area of the city. Therefore, centrality is also sensitive to the shape of the city, that is, whether it is elongated or circular. The narrower the shape of the city is, the greater the centrality index, and vice versa.

Polycentricity (Fig. 2c). Polycentricity reflects the degree to which urban areas are characterized by a polycentric (rather than monocentric) development model. If a city’s CBD is the only dense development zone, then this city has a monocentric structure and its polycentricity index equals one. If land development is scattered in several highly developed areas and each area contains an activity cluster, which accounts for a large part of the total number of such activities in the city, then this area is defined as the sub-center of the city. A polycentric city has a high polycentricity index, which equals to the number of sub-centers of the city.

Dissimilarity of housing development (Fig. 2d). The dissimilarity index refers to the unevenness of housing development. The housing development with a high dissimilarity index indicates the houses gathered at some developed places. While a low dissimilarity index represents houses evenly developed in a city (Schwarz, 2010; Tsai, 2005).

$$Dissimilarity = \frac{{\mathop \sum \nolimits_{i = 1}^{N} \left| {X_{i} - Y_{i} } \right|}}{2}$$

where \({X}_{i}\) is the proportion of land area, \({Y}_{i}\) is the proportion of housing development indicator of block \(i\), and \(N\) is the total number of blocks in a city. A dissimilarity near 1 indicates large spatial differences of housing development between blocks in a city, while a value near 0 indicates the even distribution of housing development. In this study, we utilized residential population as the housing development indicator.

Density (Fig. 2e). Density is generally accepted as a dimension of urban form. Usually, density is calculated by comparing the footprint of buildings to the extracted built-up area. Density indicates the number of residents per square kilometer of developable area in the city. However, building density cannot distinguish between the high-density forms of high- and low-rise buildings. With the enrichment and availability of 3D information, the building area ratio (BAR) has been defined to reflect the dense 3D urban form (Long et al., 2019). The BAR is expressed as follows:

$$BAR = \frac{{\mathop \sum \nolimits_{n} A_{i} }}{S} = \frac{{\mathop \sum \nolimits_{n} P_{i} \times F_{i} }}{S}$$

where \(\sum {A}_{i}\) is the total indoor area of all the buildings within the city. \(S\) is the area of the urban built-up region. \({P}_{i}\) is the footprint of individual building i, and \({F}_{i}\) is the floor number of building i. n is the total number of buildings. A high BAR indicates a dense urban form and vice versa.

2.3 Effects of urban forms and urban functional zones on housing prices

Existing studies have extensively demonstrated the influence of house location and accessibility on the housing prices in a specific city on the basis of hedonic price models. Usually, scholars investigate the positive impact of the distribution of urban functional zones on housing prices with the point-like benefits facilities, such as educational facilities (Wen et al., 2014, 2019), sports facilities (Feng & Humphreys, 2012), parks (Wu et al., 2017), shared bikes (Qiao et al., 2021), transit-oriented-development facilities (Duncan, 2011), and comprehensive public facilities (Yuan et al., 2020). These empirical findings suggest that residents are willing to pay for good amenities. Despite the amenity effects of public facilities that bring premium to houses, the negative externalities of public facilities can also reduce the housing prices. Obnoxious public facilities are those necessary for society but not welcomed by nearby local residents because they have a negative impact on the sanitation environment of neighboring areas. Obnoxious facilities, such as hazardous industrial facilities (Grislain-Letrémy & Katossky, 2014), hazardous chemical facilities (Lee et al., 2008), resettlement houses (M. Zhang et al., 2021a, 2021b), and toxic waste sites (Kohlhase, 1991), have been shown to have negative impacts on housing prices.

However, past studies on the evaluation of the relationship between housing prices and facilities were usually based on the point-like distribution of a single type of facility, measuring the proximity to specific facilities and then exploring the impact of facility-based accessibility on housing prices. One of the shortcomings of the big data measurement of point service facilities is the multi-semantics of the data, as stated in Sect. 2.1. Meanwhile, the urban functional zone is a new representation approach to reflecting the complex and comprehensive human activities in space, taking advantage of the single point-like facilities measurement. The distribution of urban functional zones, such as business zones, working, shopping zones, parks and green spaces, hospital areas, and school catchments, presents the urban structure within a city. Thus, this study involved urban functional zones as a novel indicator to replace the single point-like facilities indicators.

In terms of the urban form’s impacts on housing prices, existing studies proposed a hypothesis that localized urban forms have impacts on the housing prices within the local area. For example, localized high density reduces the surrounding housing prices (Fesselmeyer & Seah, 2018), and increasing the compactness contributes to increased land values and may cause increased housing prices, especially in newly developed urban areas that must accommodate migrant populations (Hamidi & Ewing, 2015). However, with the increase in compactness, transportation costs have dropped faster than the housing costs have increased, resulting in a net decrease in household costs. Whereas Wassmer and Baass (2006) argued that no evidence that centralization would raise the housing prices in the urban area has been found. In view that a single indicator, such as compactness, density, or concentration, cannot fully show the comprehensive physical environment of the city (Galster et al., 2001), and the lack of comprehensive urban form indicators in the past comparative studies limited the understanding to the impact of urban form on urban housing prices. This study attempted to comprehensively measure the urban form of 10 large cities through the five dimensions described in Sect. 2.2 to bridge this research gap.

In general, the improved urban functional zones and urban forms indicators of the two sub-market levels, namely, intra- and inter-cities, help answer the research question on whether the layout of the urban physical environment can regulate the housing market. This study can help urban planners and policymakers have a better understanding of the impact of the urban physical environment on housing prices.

3 Comparative cases

3.1 Extra-large cities

This study measured the distribution of urban functional zones from the intra-city scale and the difference in urban form from the inter-city scale. These cross-level indicators were applied to estimate the impacts of urban forms and urban functional zones on housing prices between cities. We selected Shanghai, Beijing, Shenzhen, Guangzhou, Chengdu, Hangzhou, Wuhan, Nanjing, Changsha, and Xi’an (sorted by GDP), which are 10 major extra-large cities in China, as our research cases (see Fig. 3). Beijing is the capital city of China and the political and cultural center. Shanghai is China’s financial center and an international metropolis. They are the representatives of extra-large cities in the northern and eastern regions of China respectively (Fig. 3a, d). Shenzhen represents the economically developed special administrative region along the southern coast (Fig. 3j). Shenzhen is also regarded as a pioneer city in China’s reform policy, attracting numerous immigrants. The other cities are provincial capitals, representing the most thriving housing markets locally. These ten extra-large cities illustrate the diversity of urban forms and socio-economic developments in China.

Fig. 3
figure 3

Built-up area of the ten extra-large cities. (a) Beijing, (b) Xi’an, (c) Chengdu, (d) Shanghai, (e) Nanjing, (f) Hangzhou, (g) Wuhan, (h) Changsha, (i) Guangzhou, (j) Shenzhen

3.2 Data description

This study involved four datasets. One is the road network data of 2018 obtained from Tianditu, which is an official map supplier certificated by the Land and Survey Department of China (https://map.tianditu.gov.cn). The road network data was used to divide the urban area into small zones for urban functional zone detection. Another primary dataset is the 2018 POI data crawled via the Gaode Map Service (http://map.gaode.com), which is the most widely used map and navigation service provider in China. We obtained 3,603,178 POI records of multiple categories for our study areas. As the category level is upgraded to three layers, a more detailed description is provided by combining three levels of separate Chinese phrases, which have the most integrated information. In our POI dataset, the top-level category has 13 tags, and the bottom-level category has 2,168 tags (Appendix Table A1). Third, the average housing estate price per-square-meter (HEPrice) was calculated from historical transaction records from Nov. 2020 to Oct. 2021. One year’s data could smooth out the seasonal effects of cyclical fluctuation. Housing estate is referred to as Xiaoqu in Chinese. Xiaoqu is a complex of one or tens of residential buildings built by the same property developer or the municipal government. These buildings typically have the same housing qualities and shared facilities, such as gardens, swimming pools, courts, and fitness equipment. Thus, the price of housing estates enabled this study to overcome the impact of the housing structure attributes, thereafter focusing on the impact of urban forms and urban functional zones. The historical housing estate price is the “asking price”, which was obtained from Lianjia, Anjuke, and Fangtianxia, which are three national real estate platforms in China, and the housing estates were counted only when their housing transactions exceeded three during the past year. The descriptive statistics of the average per-square-meter housing estate price in extra-large cities are shown in Table 2. Figure 4 illustrates the spatial distribution of the standardized housing prices for each city. The red color represents areas with high local housing prices, and the blue color represents areas with low housing prices. Finally, 47,316 housing estates in the 10 extra-large cities have validated average prices after data cross-validation from three platforms. Lastly, the social-economic data of the cities were obtained from China’s National Statistical Yearbook 2020 (China, 2020).

Table 2 Descriptive statistics of average per-square-meter housing estate price in the ten extra-large cities
Fig. 4
figure 4

Interpolated housing prices. (a) Beijing, (b) Xi’an, (c) Chengdu, (d) Shanghai, (e) Nanjing, (f) Hangzhou, (g) Wuhan, (h) Changsha, (i) Guangzhou, (j) Shenzhen

3.3 Potential determinants

Several potential influencing factors possibly affect the HEPrice even after internalizing the housing structure attributes’ effects. According to previous literature reviews, we classified three categories of indicators to evaluate the cross-scale potential determinants of housing estate price (see Table 3).

Table 3 Variables and statistics based on three categories of spatial scales

Category 1 is the individual category, including the structure and location attributes of specific housing estates. Estate building year has a significant impact on housing price due to the bank loan policy in China. A high down payment is associated with an older house, a shorter load period, and a lower available loan amount. The increasing burden of down payment with the increased building year would intensely affect buyers’ choice and thus be reflected in the pricing strategy of sellers. Furthermore, the location attributes have a wide impact on the capability of accessing urban service opportunities and access to urban services may increase house values, as evidenced by higher prices for homes close to transit in previous studies (Duncan, 2011; Shen & Karimi, 2017).

Category 2 attempts to explore the effects of the distribution of urban functional zones in intra-city, which is an extension of the impact of housing estates on the neighborhood attributes. The proximity to urban functional zones represents the time cost and expense burden of reaching potential destinations to obtain urban services, such as medical service, educational resources, and physical exercise amenities. The proximity indicators of urban functional zones, including working zones, shopping centers, commercial business zones, parks and green spaces, 3A hospitals, and primary and middle schools, are selected as the level 1 indicators (Qiao et al., 2021; Tam et al., 2019; Wang et al., 2017; Wen et al., 2017; Zhang et al., 2023).

Category 3 covers the city-level attributes of the socio-economic and physical urban form. The physical urban form indicators include the compactness, centrality, polycentricity, dissimilarity, and density. The details are presented in Sect. 2. The socio-economic factors include the Gini coefficient of population, the Moran index of education, and the GDP per capita, (Tsai, 2005; Villar & Raya, 2015; Zhang et al., 2016). The Gini coefficient of population represents the unequal population distribution in the metropolitan area. A high Gini coefficient approaching 1 indicates a high unevenness of population. The Moran index of education ranges from -1 to 1, illustrating the clustering pattern for the low-educated population (educated in primary school or below). If the value approaches 0, then the pattern is randomly scattered, while an absolute value near 1 represents a clustered pattern. This indicator shows whether low education, as a proxy of poverty, is agglomerated. The GDP per capita indicates a city’s level of economic development. As the urban form indicator of building density is collinear with the population density, we omitted it. The final variables and statistics are shown in Table 3. The variance inflation factor (VIF) of the final variables is provided in Appendix Table A3. The VIFs of majority of variables are less than 5 and all VIFs are less than 10 indicating there is no significant multicollinearity between the final variables.

3.4 Hierarchical regression model

The traditional ordinary least squares (OLS) hedonic model has been criticized for ignoring the existence of housing market segmentation (Goodman & Thibodeau, 1998, 2003). In short, a house occupies a unique location in space, and the geographic attributes tied to its location cannot be replicated. The demand for houses in certain locations may be very inelastic, such as houses near prestigious schools (Wen et al., 2017). The irreplaceability of different housing locations shows that houses are essentially nested in spatial submarkets with different pricing premiums (Orford, 2000). Therefore, to solve the fixed effects of intra-city and inter-city submarkets, the hierarchical linear model (HLM) is introduced. In the first level, the individual housing estate attributes of proximity to urban functional zones are measured (see Table 3, “Level” column). This level reflects the housing values nested in intra-city functions. In the second level, the inter-city attributes of the urban physical forms and the city-based socio-economics of the habitants are assessed. The formulas of the two-level HLM used in this study are as follows (Goodman & Thibodeau, 2003):

$$Level_{1} :P_{{ij}} = \alpha _{{0j}} + \alpha _{{1j}} \left( {x_{{1ij}} } \right) + \alpha _{{2j}} \left( {x_{{2ij}} } \right) + \cdots + \alpha _{{pj}} \left( {x_{{pij}} } \right) + e_{{ij}} ,e_{{ij}} \in N\left( {0,\;\sigma ^{2} } \right),$$

where \({P}_{ij}\) is the logarithm-transformed HEPrice of estate i within city j; \({\alpha }_{pj}\) (p = 0, 1, 2, …, p) is an intercept and regression coefficient. \({x}_{1ij}\) is the level 1 variable that represent individual housing estate attributes. \({e}_{ij}\) is a random error.

$$Level_{2} : \alpha_{pj} = \beta_{p0} + \beta_{p1j} \left( {y_{1j} } \right) + \beta_{p2} \left( {y_{2j} } \right) + \cdots + \beta_{pq} \left( {y_{qj} } \right) + \varepsilon_{j}$$

where \({\beta }_{pq}\) (q = 0, 1, 2, …, q) is the level 2 intercept and regression coefficient, and \({y}_{qj}\) is the level 2 variable related to urban form indicators. \({\varepsilon }_{j}\) is the random error at level 2.

4 Results

4.1 Disparities of urban forms and urban functional zones in ten extra-large cities

The visualized profiles of urban forms and urban functional zones illustrate the disparities between the 10 extra-large cities in Fig. 5 (see details in Appendix Table A2). From the different dimensions of measurements, clear disparities of urban forms and urban functional zones can be witnessed. For instance, Shanghai and Beijing both represent similar short distance to urban functions which benefits from their urban form of low compactness, low centrality and their six subcenters of working zones (Fig. 5a, d). However, the urban dwelling density shows that Shanghai developed as a high-density sprawl and Beijing maintained a relatively low-density. Shenzhen, which is also a polycentric developed city with five subcenters, has the highest centrality index among the 10 cities but the lowest dissimilarity index. This result can be attributed to the limitation of the developable land resource. Shenzhen had to spread along the narrow and long administrative boundary (Fig. 3j), presenting an urban form of high-density and a polycentric sprawl. Similarly, Wuhan and Changsha’s urban built-up area are divided into parts due to the segmentation by lakes and rivers and thus formed a resource-limited sprawl pattern with a monocentric urban form (Fig. 3g, h). However, Changsha has a relatively high dissimilarity of housing development, causing this city to have a high-dissimilarity sprawl pattern compared to Wuhan, which becomes a low-centrality sprawl city. These two cities also belong to the monocentric urban form. Furthermore, the monocentric structure increases the proximity to urban functions compared to the polycentric city. Chengdu has the same compactness and centrality as Wuhan and Changsha but more subcenters. Thus, Chengdu has a shorter distance to urban functional zones with a typical polycentric sprawl with high-density and low-centrality characteristics (Fig. 3c).

Fig. 5
figure 5

Visualizing indicators of urban forms and urban functional zones between extra-large cities

Through the differences in these indicators, we can validate the first research hypothesis of this study, that is, significant differences in urban forms and functional zone distribution exist between these study cities. The disparities are formed under the influence of a series of complex factors as a result of land resources, urban planning, urban economy, and population structure.

4.2 Variation of housing prices in ten extra-large cities

Table 4 presents the analysis of variance of the HEPrice within and between the 10 cities through one-way ANOVA. The analysis compares the means of the HEPrice between the study cities and determines whether any of the housing price means are statistically, significantly different from each other. To specifically verify whether the majority of the cities exhibit a difference in HEPrice, we further performed post-hoc pairwise comparisons, that is, Fisher–Hayter pairwise comparisons, which are fit for the unequal cell sizes between groups. This comparison could help determine which specific cities differed from each other. The result shows that most of the cities have significant differences in HEPrice (Appendix Table A4), validating our second hypothesis, that is, housing prices vary between cities.

Table 4 Analysis of variance of average per-square-meter housing estate price (HEPrice) within and between extra-large cities

4.3 Effects of urban forms and urban functional zones on housing price

To evaluate the effects of urban forms and urban functional zones on the housing prices between cities, we constructed three different models based on the HLM as shown in Table 5. Model 1 only includes the building year as a baseline model, and Model 2 includes all level-1 variables. Model 3 controls both level-1 and level-2 indicators. The OLS model is a comparative model applied to the OLS method. Fitness indicators AIC and BIC show that Model 2 has a higher fit performance than baseline Model 1, and Model 3 has the highest fit performance among the models, indicating that the HLM with all variables is appropriate for this study.

Table 5 Effects of urban forms and urban functional zones on housing estate prices at multi levels

In the comparison of Models 2 and 3 in Table 5, the residential intra-class correlation reveals that the inter-city differences explain 76.5% of the variance of HEPrice. This answers exactly how much of the variance of HEPrice is contributed by the city’s uniqueness. Model 3 also validates our third hypothesis that housing price variation is partially produced by differentiated urban forms and functions. Specifically, the HEPrice would decrease significantly (− 10.996***) with the increase of compactness, while centrality is positively associated with the housing price (1.287***). Polycentricity stimulates the city-level housing price (0.180***) because more working subcenters could attract migrants, thereby increasing the demands of residential settlements (Wen & Tao, 2015). The dissimilarity index is positively associated with the city’s HEPrice (1.458***), indicating that the uneven land development of the city is associated with high housing prices on the overall level. This result supplements the evidence of the impact of polycentricity on housing prices from another dimension. Balanced development was achieved through the allocation of state investment, which often drives the construction of new satellite cities/towns (Liu et al., 2018). The density index negatively affects the HEPrice (− 0.232***), suggesting that a high dwelling density is related to a low city-level housing price. This result is inconsistent with the previous intra-city evidence of housing premiums induced by local density (Kulish et al., 2012). High-density areas at the local level are often urban CBDs where the affluent are attracted to live nearby, thereby increasing the housing prices. Conversely, for city-level density comparison, high urban density typically means a high level of house supply at the overall level. When controlling other variables, such as the GDP and the population structure, a large amount of supply would depreciate the values of individual houses if the demand is fixed.

In terms of urban functional zones, Model 3 shows that only urban functions of working zones (− 0.092***), 3A hospitals (− 0.044***), and primary and middle schools (− 0.084***) have premiums on housing prices after controlling the city’s differences. The findings are consistent with most of the previous empirical evidence that working opportunities, convenient commuting, and valuable amenities, like hospitals and schools, have capitalization effects on housing prices (Wang et al., 2017; Wen et al., 2017). Conversely, proximity to shopping centers likely inhibits the surrounding housing values (0.061***). This result is consistent with that of a previous study where large-scale and high-end shopping complexes have a significant negative impact on the housing prices in core urban areas (Zhang et al., 2020).

5 Conclusions

Although previous studies have affirmed the contribution of a city’s uniqueness to the housing market in that city, this study supplements the empirical evidence of the influence of urban forms and urban functional zones on the differentiated performance of the housing market on multiple scales (within and between cities). Our findings indicate that the difference in housing prices across cities is not only attributable to the city’s uniqueness in the social and economic environment (Mirkatouli et al., 2018; Waltert & Schläpfer, 2010; Wang et al., 2017; Yi & Huang, 2014) but also to the urban form, which is led by urban growth strategies.

For the different urban forms in the five dimensions, our results show that land use development policies for low compactness, high centrality, high polycentricity, low density, and high dissimilarity in housing development may significantly increase the average housing price of the city’s housing market. This phenomenon indicates that paying attention to the housing unaffordability within the cities due to the influences of house location should also regulate the housing market on the city level through appropriate urban planning and design. For example, when considering developing a new urban sub-center, constructing high-density communities could provide more houses for migrants and reduce the housing prices (Fesselmeyer & Seah, 2018). Our research proposes to examine the impacts of the compactness, centrality, polycentricity, and dissimilarity in housing development and the density on housing prices in the evaluation of urban development projects.

For the distribution of urban functional zones within the cities, housing prices are affected by the spatial distribution of urban functions is well studied in previous studies (Qiao et al., 2021; Tam et al., 2019; Wang et al., 2017; Wen et al., 2017). But our finding provides a unified perspective for such single-city housing price studies (Shen & Karimi, 2017). After controlling the city as an influencing factor of housing price differences, working centers, high-quality hospitals, and schools can increase the housing prices of neighboring areas.

This research has its limitations. The spatial form of a city is always more or less related to its level of economic development. The world’s most economically developed global cities (e.g., Chicago, Tokyo, Hong Kong, New York, Shanghai) are high-density cities with high housing prices. We cannot rashly infer that if a city is actively designed and built into a certain type of form, it will automatically push up or decrease local housing prices. The focus of this research is to answer whether the design and planning of the city’s physical facilities and urban functional zones’ layouts can certainly affect the housing prices if the social and economic conditions are equivalent to other cities. The findings emphasize that the moderating role of urban design and planning in social inequality caused by the housing unaffordability is feasible and our conclusions highlight a series of urban design indicators as a toolbox for planners and policymakers.