Effect of data homogenization on estimate of temperature trend: a case of Huairou station in Beijing Municipality
Daily minimum temperature (Tmin) and maximum temperature (Tmax) data of Huairou station in Beijing from 1960 to 2008 are examined and adjusted for inhomogeneities by applying the data of two nearby reference stations. Urban effects on the linear trends of the original and adjusted temperature series are estimated and compared. Results show that relocations of station cause obvious discontinuities in the data series, and one of the discontinuities for Tmin are highly significant when the station was moved from downtown to suburb in 1996. The daily Tmin and Tmax data are adjusted for the inhomogeneities. The mean annual Tmin and Tmax at Huairou station drop by 1.377°C and 0.271°C respectively after homogenization. The adjustments for Tmin are larger than those for Tmax, especially in winter, and the seasonal differences of the adjustments are generally more obvious for Tmin than for Tmax. Urban effects on annual mean Tmin and Tmax trends are −0.004°C/10 year and −0.035°C/10 year respectively for the original data, but they increase to 0.388°C/10 year and 0.096°C/10 year respectively for the adjusted data. The increase is more significant for the annual mean Tmin series. Urban contributions to the overall trends of annual mean Tmin and Tmax reach 100% and 28.8% respectively for the adjusted data. Our analysis shows that data homogenization for the stations moved from downtowns to suburbs can lead to a significant overestimate of rising trends of surface air temperature, and this necessitates a careful evaluation and adjustment for urban biases before the data are applied in analyses of local and regional climate change.
The homogeneous time series of climate variables are defined as those which contain only climatic variation and regional trend information. It is generally recognized that only by using homogenized data series can the long-term climatic trends be accurately detected. However, due to changes in observing sites, instruments, observing schedule, observing habits and micro-environment around the observational grounds, discontinuous points in the observational records can be created, especially for surface air temperature (SAT) records. The inhomogeneous data may bring certain deviation for estimating climatic trends, leading to inaccurate analyses for regional climate change detection in some circumstances (Jones et al. 1986; Easterling and Peterson 1995a; Yan et al. 2001; Ren et al. 2005; Menne et al. 2010). Therefore, researchers commonly examine and adjust the inhomogeneities before going to analyze long-term SAT trends at single sites or on regional scales, by combining varied mathematical methods and station metadata (e.g., Jones et al. 1986; Easterling and Peterson 1995a, b; Alexandersson and Moberg 1997; Aguilar et al. 2003; Menne and Williams 2005).
The inhomogeneities caused by observing schedule, instrumentation, and relocation were adjusted in SAT data of the United States USHCN (US Historical Climate Network) (Karl and Williams 1987; Quayle et al. 1991; Easterling and Peterson 1995a, b; Menne and Williams 2009). Vincent (1998) and Vincent et al. (2002) adjusted the Canadian Tmin and Tmax series using multiple linear regression method. Researchers from other countries and regions also created their own homogeneous SAT dataset using methods such as Easterling and Peterson method, Standard Normal Homogeneity Test, Two-Phase Regression, Penalized Maximal t Test, and Multiple Analysis of Series for Homogenization (MASH) (Wang et al. 2007a; Aguilar et al. 2003; Reeves et al. 2007).
Chinese researchers made studies of SAT data homogenization (Song et al. 1995; Zhai and Eskridge 1996; Liu 2000; Yan et al. 2001; Yan and Jones 2008; Li et al. 2004; Li and Dong 2009; Li and Yan 2010). Using three different tests for undocumented change points, for example, Li et al. (2009) estimated the artificial discontinuities in annual mean daily Tmin and Tmax in southeastern China and found that there are more discontinuity points in annual mean Tmin series; Li and Yan (2010) apply MASH method to detect and to adjust inhomogeneities for daily SAT series of 1960–2006 at Beijing station.
Given the adjustments are accurate and applicable for monitoring and detecting regional climate change, however, there remain still a few of issues to be solved. One is what effect the homogenization will have on the estimated long-term SAT trends at a single station or in a large region. If the SAT trends are significantly different between the prior and aft adjusted data series, then what are the underlying causes? Does the adjustment for inhomogeneities significantly recover the urban bias when the breakpoints are mostly caused by the moves of stations from urban areas to rural areas, as previously suggested by Winkler et al. (1981) and Hansen et al. (2001) for the United States and recently by Ren et al. (2010) for mainland China? To answer these questions will certainly deepen our understanding of the systematic biases of the SAT data and their influences on the estimates of magnitude and rate of local and regional temperature change.
In this paper, we examine the effect of data homogenization on the estimate of Tmin and Tmax trends at Huairou station of Beijing Municipality (BM). We first make an examination of inhomogeneities of temperature data and adjust the breakpoints identified to obtain a homogenized SAT data; we then compare the linear trends of the original and adjusted SAT series and analyze the urban effects on the linear trends of the original and adjusted SAT series using the temperature data of the same reference stations.
2 Data and methods
Information of the weather stations used in this study
Start time of record
Population in 2000 (103)
Less than 1.0
As a rapidly grown small city, Huairou is located in the northern mountainous areas of the BM, with a population of ~75 thousands in the urban area in 2000 (Fig. 1a). Huairou weather station is a typical urban station (Fig. 1b), and the recent 49-year records from the station are evaluated in this paper for the data inhomogeneities and urbanization effects on the SAT trends as a target observational site. Although Xiayunling and Shangdianzi stations are also located in the mountainous areas, they are both far from larger residential areas, with the former being on a valley in the southwest and the latter on a slope near a small village having a population of no more than a thousand in the northeast (Fig. 1a). The two observational sites are chosen as the reference stations from 20 weather stations with over 30-year records in the BM. In addition to the small population of the residential areas near stations and the similar physiographic characteristics to the target station, the reference stations are also required to have the continuous observation records with as possible as less the missing observational values. The two weather stations were ever used as reference stations in previous studies of urbanization effect on the SAT trends of Beijing station (Chu and Ren 2005; Ren et al. 2007).
Inhomogeneities of SAT data can be caused by such factors as instrumentation, relocation, change in observational time, and modified statistical methods for daily averages. The introduction of the Autonomous Weather Stations (AWS) to operational observations around 2004 in mainland China may in certain extent have resulted in additional inhomogeneities in SAT records. Wang et al. (2007b) indicated, however, that the SAT of AWS has certain difference from that of manual weather stations, but overall the difference is small and not significant. No change in observational time and statistical methods of daily mean SAT occurred during the last 50 years, and these will not cause any detectable inhomogeneities of the SAT data. It has been realized that the most important factor causing the inhomogeneities of SAT data is the frequent relocations of stations in mainland China (Yan et al. 2001; Li et al. 2004; Ren et al. 2005).
Huairou station experienced relocation twice. It was moved for the first time from West Gate of the old town (Site 1 in Fig. 1b) to Beitumenzi (Site 2 in Fig. 1b) at the East Gate Road outside the old town on 1 August 1964. The second move occurred on 1 July 1996, from Beitumenzi to a suburban village called Liugezhang (Site 3 in Fig. 1b), about 5.5 km from the center of the old town (BMB 2009). For the two reference stations, on the other hand, the only move occurred for Shangdianzi station on 1 September 1989, but the horizontal distance of the movement was 750 m, and the observational grounds changed from 255 m above sea level (ASL) to 293 m ASL, increasing by 38 m in altitude.
The data are quality-controlled with the following steps: (1) if the maximum temperature (Tmax) values are lower than the minimum temperature (Tmin) values, they are registered as unreasonable readings. There is no unreasonable record in the SAT dataset of Huairou station; (2) the values beyond four times of standard deviation are marked as outliers. If outliers are detected, the reasonable records are retained, and unreasonable ones are corrected or regarded as missing values, based on the comparison to the records of the neighboring stations. There is only one outlier found in the dataset, but it is not unreasonable; (3) missing values, which account for less than 0.25% of the total records, are filled in by using the means of the same stations for the reference time period 1971–2000.
Discontinuous points in annual difference series of the target station and the reference stations are detected by using method of moving t test. As mentioned above, Huairou station was moved in 1964 and 1996. In order to effectively identify the discontinuities due to the relocations, the son series length is set as 3 years since the dataset started in 1960. Therefore, the series length n = 49, the son series length n1 = n2 = 3, and the significance level α = 0.01. The metadata are used to validate the existence of the inhomogeneous points, and they are adjusted if proved to be real and caused by relocation. Otherwise, the original records are kept as they were.
The 5-year averages of monthly mean SAT difference between the target station and the reference series is taken as the adjustment values. If the records are less than 5 years before or after discontinuous points, then all the years of record available are used to determine the adjustment values. The adjustments for inhomogeneities are made on basis of daily SAT data. The daily adjustment values are obtained by a linear interpolation method, with the monthly mean adjustment values being assigned to the mid-month days (15th or 14th) of the neighboring months.
The sections of data after the last documented inhomogeneous points are taken as the base series, and they remain unchanged. Before the inhomogeneous points, the adjustment values are added to the original records for every day.
ΔTur can also be estimated by calculating the annual and monthly mean SAT differences between urban station and reference series and the linear trend of the difference series over the time period analyzed. In this paper, the annual mean SAT difference series of Tmin and Tmax between Huairou station and the average reference series are constructed, and their linear trends for the time period 1960–2008 are estimated by using least-square method and are examined for statistical significance by t test.
Generally, ΔTur/Tu is a positive value less than 100% (0 ≤ Eu ≤ 100%); absolute value is taken because it, in certain circumstances, assumes negative value due to the effects other than increasing UHI intensity. If Eu = 100%, then it shows that the SAT trend of the urban station is entirely caused by urbanization; if Eu is more than 100%, it implies that the extra trend might have been caused by other local factors not yet identified or the errors of data, but it is regarded as 100% in this study. As the definition implies, urban contribution is not calculated if the urban effect is not statistically significant.
3 The results
3.1 Detection and adjustment of data inhomogeneities
There is no discontinuous point detected in the SAT data series of the two reference stations, despite the relocation of Shangdianzi station in September 1989. This happens mainly due to the relatively small change in the altitude and the environment. No adjustment is done, therefore, and the quality-controlled data are used for producing a single and average reference series.
The mean and variance of annual Tmin and Tmax of Huairou station before and after adjustment during 1960–2008 (degrees Centigrade)
Urban effects on the Tmin and Tmax trends of the Huairou station (degrees Centigrade per 10 years) and the urban contribution to the overall temperature trends (percent) for the data series before and after adjustment for period 1960–2008
The annual mean Tmin and Tmax values are all reduced after the adjustments, and the reasons for the decrease are that the adjustments are made with the section of data series at the present location of observation as baseline, and also the sections of data series adjusted before the last relocation are longer in combination than the latest section of records. Once again, the reduction of the annual mean Tmin is significantly larger than that of the annual mean Tmax.
3.2 Urban biases in adjusted and original data series
Table 3 gives the urban effects and urban contributions of Huairou station for the time period 1960–2008 for the data series before and after the data adjustments. The urban effects are −0.004°C/10 year and −0.035°C/10 year, respectively, for the Tmin and Tmax before the adjustments, all non-significant statistically, but they increase to 0.388°C/10 year and 0.096°C/10 year, respectively, after the adjustments, both statistically significant at the 0.01 confidence level. The adjusted Tmin series witnesses a more significant increase in the annual mean urban warming trend.
Urban contributions to the overall temperature trends for Huairou station are not estimated for the original annual mean Tmin and Tmax series due to the non-significance of the urban effects, but they reach 100% and 28.8% for the adjusted annual mean Tmin and Tmax series, respectively. After the data adjustment for inhomogeneities, the positive trend of annual mean Tmin at Huairou station during 1960–2008 can be totally explained by the urban effect, and almost a third of the warming trend observed for annual mean Tmax at the station can be attributed to the urban effect.
Relocations of weather stations from downtowns to suburbs are a common practice in mainland China during the past decades, especially for the national reference climate stations and national basic meteorological stations (Li et al. 2004; Ren et al. 2010), which have been mostly frequently applied for analyses of regional climate change. This occurs mainly due to the closeness of the weather stations to built-up areas of cities and towns and the unprecedented urbanization process over the past decades in mainland China under the rapid growth of economy (Ren et al. 2008). Our analysis and the findings of Huairou station in this paper therefore are in some extent of representativeness to the SAT datasets commonly used in studies for the country.
The frequent relocations of stations usually cause obvious inhomogeneities in SAT data, which require a homogenization before long-term trends of temperature can be analyzed. However, the adjustment may change the estimates of mean SAT trends at single stations or even in regional scale and may lead to an overestimate of the warming rates for the stations or the regions. This phenomenon were pointed out in previous studies (Hansen et al. 2001; Menne et al. 2009; Ren et al. 2010) but have not been exclusively examined. Winkler et al. (1981) found, however, that the homogeneity-adjusted SAT data depict a larger UHI intensity and UHI extent in the urban area of Minneapolis–St. Paul, Minnesota. They adjusted the data inhomogeneities induced by changes in observational time and station location. Our analysis for Huairou station in this paper shows that the increased warming rates as estimated from the homogeneity-adjusted data series compared with the original data series mainly result from the recovery of the urban warming trends. The regained warming trends, especially for the annual mean Tmin, are caused by enhanced urban effect near the first location of the city station, which now has been located in the center of the built-up areas due to the urbanization. The overall trend and the urban effect in the annual mean Tmax series also increase after the homogenization, but the changes are much smaller.
Further investigations are needed to understand to what extent the data homogenization of the national reference climate stations and national basic meteorological stations in mainland China has affected the estimates of the large scale SAT trends. It is reasonable to assume that the effect of the data homogenization on the estimates of SAT trends and urban biases for the country on a whole would be more moderate than that reported for Huairou station in this paper, but it would not be overlooked considering that a common practice is to relocate the weather stations within built-up areas to suburbs or countryside when they are regarded as being less representative for monitoring baseline climate, and this will result in obvious inhomogeneities in the SAT data series in mainland China, which has been consensually regarded as improper for applications in studies of climate change and requires a homogeneity-adjustment before they could be used in studies. If the homogenization significantly affects the SAT trends for part or even majority of the stations in the country, the urban biases in the homogenized SAT data series of the stations have to be more carefully assessed and adjusted before they are to be confidently used in analyses of climate change.
The issue is also relevant to a few of questions baffling the researchers of climate change. One is the understanding of the different trends of Tmin and Tmax in continents. The “asymmetry” in increases of Tmin and Tmax series and the resulting decline of the Diurnal Temperature Range (DTR) were reported for many regions (e.g., Karl et al. 1993; Xie and Cao 1996; Zhai and Pan 2003; Qian and Lin 2004; Choi et al. 2009; Zhou and Ren 2011). The changes were related to the increase in cloud coverage and precipitation worldwide and aerosols over some regions (Dai et al. 1999; Easterling et al. 2000). However, Zhou and Ren (2009, 2011) found a larger urban contribution to the “asymmetry” in the Tmin and Tmax increases and in the decrease of the DTR in North China. The analysis result based on the homogeneity-adjusted data in this paper also shows that the significant increase in annual mean Tmin at Huairou station might have been completely explained by urbanization, and the increase in annual mean Tmax might have been partially caused by urbanization, generally consistent with the conclusions drawn by Zhou and Ren (2009, 2011) for North China and by Zhang et al. (2011) for Beijing station.
The adjusted annual mean Tmin and Tmax drop by 1.377°C and 0.271°C, respectively, and the adjustment values for Tmin are significantly larger than those for Tmax. The location changes of Huairou station from downtown to the suburb, especially the second move in 1996, cause more significant drop in annual mean Tmin. The drops in monthly mean Tmin values are larger during winter than those during summer.
The data homogenization for the station relocations from downtown to suburb at Huairou station leads to an increase in mean SAT trends, and the increase is more significant for Tmin than for Tmax. The urban effects on annual mean Tmin and Tmax trends are statistically insignificant (−0.004°C/10 year and −0.035°C/10 year, respectively) for the original data series, but they reach 0.388°C/10 year and 0.096°C/10 year, respectively, for the homogeneity-adjusted data series. The urban contributions to the overall positive SAT trends are 100% and 28.8%, respectively, for Tmin and Tmax for the homogeneity-adjusted data.
The larger effects of relocations, homogenization, and urbanization on Tmin data series than on Tmax data series in a larger extent explain the "asymmetry" in daytime and nighttime SAT trends at Huairou station, and the urban effect is also a major contributor to the DTR decline as implied in the "asymmetry" changes of the annual mean Tmin and Tmax for the homogeneity-adjusted data at the station.
This work is financially supported by the Ministry of Science and Technology of China (GYHY201206012). Thanks are also due to Mr. G.H. Li. at the Huairou Meteorological Bureau, Beijing Municipality, for providing the information of station history.
Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.