CODC-v1: a quality-controlled and bias-corrected ocean temperature profile database from 1940–2023

Zhang, Bin; Cheng, Lijing; Tan, Zhetao; Gouretski, Viktor; Li, Fuchao; Pan, Yuying; Yuan, Huifeng; Ren, Huanping; Reseghetti, Franco; Zhu, Jiang; Wang, Fan

doi:10.1038/s41597-024-03494-8

CODC-v1: a quality-controlled and bias-corrected ocean temperature profile database from 1940–2023

Data Descriptor
Open access
Published: 22 June 2024

Volume 11, article number 666, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Data

CODC-v1: a quality-controlled and bias-corrected ocean temperature profile database from 1940–2023

Download PDF

Bin Zhang^1,2,3,
Lijing Cheng ORCID: orcid.org/0000-0002-9854-0392^2,3,4,5,
Zhetao Tan ORCID: orcid.org/0000-0003-4342-3356^2,4,5,
Viktor Gouretski^2,4,
Fuchao Li^1,2,3,
Yuying Pan^2,4,5,
Huifeng Yuan ORCID: orcid.org/0009-0008-7915-1720^5,6,
Huanping Ren^1,2,3,
Franco Reseghetti⁷,
Jiang Zhu^2,4,5 &
…
Fan Wang^1,2,3

530 Accesses
12 Altmetric
1 Mention
Explore all metrics

Abstract

High-quality ocean in situ profile observations are fundamental for ocean and climate research and operational oceanographic applications. Here we describe a new global ocean subsurface temperature profile database named the Chinese Academy of Science (CAS) Oceanography Data Center version 1 (CODC-v1). This database contains over 17 million temperature profiles between 1940–2023 from all available instruments. The major data source is the World Ocean Database (WOD), but CODC-v1 also includes some data from some Chinese institutes which are not available in WOD. The data are quality-controlled (QC-ed) by a new QC system that considers the skewness of local temperature distributions, topographic barriers, and the shift of temperature distributions due to climate change. Biases in Mechanical Bathythermographs (MBTs), eXpendable Bathythermographs (XBTs), and Bottle data (OSD) are all corrected using recently proposed correction schemes, which makes CODC-v1 a bias-corrected dataset. These aspects ensure the data quality of the CODC-v1 database, making it suitable for a wide spectrum of ocean and climate research and applications.

SCSPOD14, a South China Sea physical oceanographic dataset derived from in situ measurements during 1919–2014

Article Open access 26 April 2016

Multi-decadal ocean temperature time-series and climatologies from Australia’s long-term National Reference Stations

Article Open access 07 April 2022

The International Bathymetric Chart of the Southern Ocean Version 2

Article Open access 07 June 2022

Background & Summary

The increasing concentration of greenhouse gases in the atmosphere creates a net positive radiative forcing in the climate system. In response, the Earth’s surface warms, triggering feedback in the climate system¹. At the same time, heat penetrates into the ocean, resulting in the warming of ocean waters, and increasing ocean heat content (OHC)^2,3,4. Ocean warming has consequences, such as strengthening tropical cyclones, fuelling extreme events, and raising the global sea level⁵. Meanwhile, ocean warming and stronger vertical stratification decrease the vertical exchange of oxygen, leading to ocean deoxygenation and threatening marine life^6,7. Even if society can slow all greenhouse gas emissions and achieve the “net zero” by mid-century as targeted by the United Nations Paris Agreement, there is a lag built into the climate system primarily as a result of the ocean thermal inertia with changes such as deep ocean warming and sea-level rise continuing for at least several hundred years^1,5.

The influence of oceanic changes on social and economic systems depends on the physical state of the ocean and the local vulnerability to changes, as well as on worldwide efforts to mitigate the impact of future climate change and to adapt to current and forthcoming changes. Thus, our knowledge of climate change, particularly of ocean warming, can be used to support social adaptation and mitigation decisions. High-quality and bias-free in situ observations are critical for detecting statistically significant changes and monitoring the variability of the climate. Therefore, maintaining, managing, and improving the global archive of ocean data is crucial for climate actions and operational oceanography applications.

Two groups of data-processing techniques are critical to ensure a high-quality dataset: (1) data quality control (QC), which removes incorrect outliers (e.g., observations that differ significantly and erroneously from the majority of the data in the population), and (2) bias correction, which seeks to eliminate systematic errors in the data.

Many QC-ed datasets/products have been provided in the past decades by several research groups, each usually linked to a specific database, for example, World Ocean Database (WOD)⁸; Global Ocean Data Analysis Project (GLODAP) dataset⁹; Met Office Hadley Centre (EN) dataset¹⁰; Copernicus in situ ocean dataset of temperature and salinity (CORA)¹¹; Argo database¹²; Global Temperature and Salinity Profile Programme (GTSPP)¹³, International Quality-controlled Ocean Database initiative (IQuOD)^14,15. Each QC system has its strengths and limitations¹⁶. For example, many existing QC systems assume a Gaussian distribution for regional temperature distributions, which neglects the skewness of the natural temperature variations. Also, most QC schemes do not implement a vertical temperature gradient check, which uses information on the profile shape, thus increasing the ability of the QC scheme to identify outliers. Besides, the long-term trend of the local climatological thresholds in a warming climate has not been accounted for, leading to the exclusion (rejection) of data linked to realistic extreme events. The topographic barriers that separate different water masses are often not taken into account in many QC systems. To resolve these issues, ref. ¹⁷ proposed a new QC system, which has been shown to be superior in identifying outliers and minimizing the wrong flagging of good data. This new AutoQC system, namely the Chinese Academy of Science (CAS) Oceanography Data Center Quality Control system (CODC-QC), is applied in the new database introduced in this paper.

Systematic errors have been identified in the data collected by several instrument types, namely mechanical bathythermographs (MBTs)^18,19, expendable bathythermographs (XBTs)^20,21, Nansen bottle casts²² and Argo floats²³. Thus, biases are common to many ocean instruments and impact data accuracy. For example, a spurious decadal variability due to the bias in XBT data was found in the OHC record from 1970–2001, leading to the underestimation in the estimate of long-term ocean warming rate²⁴. Community efforts have been made to understand the errors and improve the data quality. New bias correction schemes have been suggested for XBT, MBT, and Nansen Bottle cast data^19,22,25, and are implemented in the database under consideration. We underline that the often unstable, if not critical, operating conditions at sea contribute to worsening the quality of marine data as well as a large amount of data recorded with instruments having generally poor quality, mainly until the 1990s. Therefore, the right evaluation and estimate of the uncertainties in the measurements of marine parameters are not simple operations. A significant effort has been made by researchers to understand the problem and find a way to improve the data quality.

Benefiting from the collective progress of QC and bias corrections, this study describes a newly organized ocean in situ data archive, named the Chinese Academy of Science Oceanography Data Center version 1 (CODC-v1) database, which is QC-ed and bias-corrected to enable wider use of accurate ocean data both for climate research and in operational applications. Additionally, the availability of the CODC-v1 facilitates further inter-comparison between databases. The difference in QC, bias-correction, and other data processing procedures represents a source of uncertainty in OHC estimates as well as in other applications (i.e., reanalysis product generation), which has not been quantified yet²⁶.

Table 1 and Table 2 list the data format and introduces variables in the data files of CODC-v1. In addition, we provide access to the software that enables users to tailor data processing and interpolation methods for their specific purposes.

Table 1 A description of the metadata information stored in CODC-v1 (in MATLAB files).

Full size table

Table 2 A description of the temperature profile data and their flags stored in CODC-v1 (in MATLAB files).

Full size table

Methods

Workflow overview

Data sources

The primary data are obtained from in situ measurements available through the World Ocean Database (WOD) downloaded in March 2023⁸. The instruments include XBT²⁷, Argo¹², Conductivity/Temperature/Depth (CTD), MBT, bottle, moored and buoy (MRB), Drifting buoy (DRB), glider (GLD), Autonomous Pinniped data (APB)²⁸ and others. Both Delayed-mode and Real-time mode Argo data are assembled in CODC-v1. Since this database will be updated regularly at least on an annual basis as in ref. ²⁹, newly available Delayed-Mode Argo data from the Argo Data Centre will be added and replace the Real-time data during the update process (every 1~4 months). Besides, some unique data owned by several institutes and not yet available through the WOD and GTSPP are included in CODC-v1. These unique data include some publicly available data^{30,31,32,33,34,35,36} and previously archived data by CODC. There is a total of 17,657,649 profiles from January 1940 to December 2023 in CODC-v1 (with 22,784 from Chinese institutes), which are all publicly accessible.

Instrumentation types and spatial and temporal data coverage

The data distribution over time for different instruments is illustrated in Fig. 1a,with the spatial distribution of data shown in Figs. 1b and 2a. The Nansen Bottle (used as BOT before, but OSD in this paper), invented at the end of the 19^th century, has been used to measure the seawater temperature at depths from 1940 until now. There are ~2.61 million profiles within the OSD instrumentation category (ocean station data, including bottle and low-resolution CTD profiles), which are mainly distributed in the coastal regions of the Northern Hemisphere (Fig. 2).

MBTs (invented around 1930) measured the temperature in the layers down to the depth of ~300 m, and was the main component of the database in the period 1940-1965¹⁹. They have a much wider geographic coverage than the OSD, extending over the entire Northern Hemisphere, mainly along busy trade lines connecting continents and between countries (Fig. 2).

Since the end of 1965, XBT probes have been widely deployed, first by Navies and soon adopted by oceanographers. They constitute the most abundant component of the database during the period 1966–2001 and still play a critical role today (Figs. 1a, 2)^21,27. There are different models of XBT probes, with two large families that reach depths of 450 m (shallow) and 750 m (deep), respectively. The shallow version was the first invented and mainly used until 1990 (Fig. 1b), progressively replaced by the deep model which became the most widespread in the database in the period up to the 2000s (Fig. 1b)^37,38. Due to its affordability and relative ease of use, XBT probes were used extensively in the three decades from the late-1960s to 2001 in the global ocean, with a significant component recorded within the Ship of Opportunity Program (SOOP) activity, which exploits commercial ships. There is a total of approximately 2.35 million XBT profiles in CODC-v1, but spatial sampling gaps are evident, especially in the South and Southeast Pacific.

The CTDs have also been widely used since the 1960s mainly by research vessels, they typically observe ocean temperature from the sea surface down to at least 1000 m (but the maximum depth is flexible) (Fig. 1b). Because of the high accuracy of CTDs, CTD data are always regarded as the “golden standard” of the ocean observations. During the 1990s, increased subsurface measurement coverage of the global ocean by means of high-quality hydrographic sections (CTDs are mainly used) was achieved as part of the World Ocean Circulation Experiment (WOCE)^39,40.

Since the beginning of this millennium, the Argo array of autonomous floats began to monitor the upper 2000 m of the open ocean (Fig. 1a,b)⁴¹. After a few years to settle the structure, the Argo programme achieved a homogeneous and isotropic distribution of temperature profiles guaranteed by about 4000 floats operating simultaneously and providing ~12000 profiles per month. The type of sensor installed, completely equivalent to that of the CTDs, guarantees high quality and is also subjected to a continuous qualitative review process. Its advent, integrated with other measuring instruments, has opened a new era of the global ocean observation system allowing for the first time the global coverage of the open ocean (Fig. 2). There are currently approximately 2.70 million Argo profiles in CODC-v1.

The APB profiles were collected by programs that equipped marine animals with temperature and salinity data loggers⁴². There are ~2.04 million APB temperature profiles in the CODC-v1 (Fig. 1a), primarily located in the regions where these animals live, i.e. in the high-latitude coastal regions (Fig. 2)²⁸. The overall quality of this data is still under analysis and the presence of potential bias in the data collected by some types of sensors is suspected.

The glider (GLD), an autonomous, unmanned underwater vehicle that requires little or no human assistance during travel, provides a mass of profile observations near coastlines (Fig. 2)⁴³. The type of instrumentation installed is identical to that of the CTDs, therefore capable of providing excellent data quality. There are ~2.5 million GLD profiles in total. Although the amount of APB and GLD profiles is comparable to that of Argo profiles, they have a limited geographical coverage (Fig. 2).

Data processing

The data are processed following the data processing flow shown in Fig. 3. First, all the data have been unified with an internal data format, facilitating the follow-on data processing. The duplicates are also removed at this step. Because the raw in situ data are of heterogeneous quality, a unified QC system is essential to homogenize the data and to ensure the highest possible level of consistent quality across the entire database^15,17,44. This database has been QC-ed by a newly proposed CODC-QC system¹⁷ (latest version in November 2023) that includes 14 distinct quality checks, illustrated in Fig. 4. Besides the basic information checks, the system compares each observation with predefined climatological ranges (e.g., global range check, seawater freezing point check, local climatological range check) and also checks of the profile shape (e.g., spike check, constant value check, local vertical gradient range check) and checks related to the specific instrumentation (e.g., XBT instrument check).

The key advantages of this QC system include: (1) accounting for the anisotropic feature of the local temperature distribution (2) accounting for topographic barriers separating different water mass properties; (3) local climatological ranges vary with time to account for ocean warming, reducing the possibility of erroneously excluding true extreme events; (4) all QC modules are optimized with a diagnostic tool⁴⁵ and then they are also evaluated manually; (5) the CODC-QC system has been evaluated using two benchmark datasets, indicating the system skill in removing spurious data and minimizing the percentage of mistakenly flagged good data¹⁷.

The bias in XBT data was first reported soon after its invention in the early 1960s, however, the global impact of this bias was not fully recognized until 2007²⁰. Since then, many studies have been devoted to understanding the error sources and proposing correction schemes^21,27,46. It is now well established that the XBT bias has two components related to depth and temperature measurements, respectively^47,48. The depth bias is due to the absence of a pressure sensor and the manufacturer’s equation describing the fall rate of XBT probes does not accurately describe the actual probe motion. The temperature bias can be traced back to the temperature sensor, the acquisition system, and in general to the electrical-electronic part of the XBT system (probes + acquisition unit), etc. Biases in the XBT data were minimized using the correction scheme shown in ref. ²⁵ (hereinafter CH14 scheme), which corrects both depth and temperature biases and depends on water temperature, probe type, and time. Further evaluations confirmed that CH14 may better correct local and global biases in XBT data than other schemes⁴⁹, therefore CH14 is currently recommended by the oceanographic community^21,26,27.

MBT data dominated the global ocean database from 1940 to 1970. The MBT temperature bias depends on the instrument type, data acquisition procedure, and data processing procedure, for instance, temperature and pressure sensor response delay, inaccurate grid viewer calibration, and data reading system errors. A correction scheme has recently been described in ref. ¹⁹ (named GC20 hereinafter) and is capable of correcting for both depth and temperature biases, showing a dependence on country, depth and time. Therefore, the time-varying bias is corrected separately for country-specific profiles, including the United States, Soviet Union (Russia), Japan, Canada, and Great Britain. Data from all other countries are corrected with globally averaged correction factors (for depth and temperature separately) calculated by all MBT data, which is also time varying (i.e., annual correction).

Bottle data contributes a significant fraction of data before 1990, and the recent analysis quantified bottle data temperature bias to be ~0.05 °C before 1980 on 0–700 m average²². A correction scheme was recently proposed in ref. ²² (named GCB22 from now on) and applied in the CODC-v1 database in this study. Both the depth and temperature measurements of the bottle data are biased and vary with the calendar year. Temperature bias is constant with depth, while depth bias varies with depth.

Data Records

QC-ed and Bias-corrected data of CODC-v1⁵⁰ are available from the Chinese Academy of Sciences Ocean Data Repository at https://doi.org/10.12157/IOCAS.20230525.001, and no registration is needed to download the dataset. The product includes one “.mat” (MATLAB format) and “.nc” (NetCDF format) file per month, which collects all data in a particular month. The data in the two formats are identical, so Table 1 and Table 2 only describe the variables in the MATLAB format. Besides, because the main data source is WOD, we have kept all WOD metadata information, for example, the WOD QC flags and WOD data ID. This consistency is essential for the future comparison of CODC-v1 with WOD, for instance, investigating the impacts of different QC and bias-correction on OHC estimate, ocean reanalysis product generation, and ocean prediction. Additionally, we also note that the CODC database is different from the Institute of Atmospheric Physics (IAP) datasets, which are referred to as gridded datasets after spatial interpolation but with the CODC in situ profile database as an input.

For the entire database, 7.18% of measurements (e.g., 183,414,409 data points) are rejected by CODC-QC, in which the CTD and Argo data exhibit the lowest rejection rate. In comparison, the XBT (13.84%), Glider (7.48%), and Nansen Bottle (6.55%) are characterized by the top-3 highest rejection rates. In the near-surface layer (0–50 m), 7.41% of data were rejected, with MRB and DRB data exhibiting the highest percentage. Within 50–400 m, 5.56% of data were rejected, dominated by MBT data; while within 400–1000 m, 8.14% of data were rejected, dominated by XBT data; 1000–2000 m, 3.50% of data were rejected, dominated by APB data; below 2000 m, 4.85% of data were rejected, dominated by Argo data. For Chinese data, 6.17% of measurements are rejected by CODC-QC. This rejection shows a homogenous change with depth in the upper 500 m, primarily ranging from 2.0% to 2.5%. However, it increases with depth from 500 m to 1,000 m.

Technical Validation

The QC system has been thoroughly validated in ref. ¹⁷, where the detailed description can be found. Figure 5 provides several representative examples showing temperature profiles before and after the CODC-QC system. We randomly selected 3,000 temperature profiles collected by several different instruments in (a-c) July 1975, (d-f) March 1985, (g-i) October 1995, (j-l) March 2005, (m-o) August 2022. These examples illustrate the existence of many erroneous data (e.g., spikes, extreme values, constant values) from different instruments, which have been identified and flagged in CODC-v1. Specifically, some profiles looked suspicious (e.g. the near vertically constant profile around 13 °C in Fig. 5b,e, k, and n) but still passed the CODC-v1 check. After a manual investigation, these profiles are physically plausible as they are from the Mediterranean Sea, characterised by a homogeneous thermal structure with depth. This is an example of the validity of the QC system. Within the 14 modules, the local climatological range check and the local climatological gradient range check are the most effective in detecting outliers.

Additional validation of the QC-ed data can be made through inspection of gridded averages and standard deviations, which represent the climatological state of the ocean temperature. An effective QC system should be able to delete spikes or any unphysical temperature distributions (Fig. 6 versus Fig. S1). Figure 6 shows the temperature gridded average and standard deviation at several selected layers (5 m, 200 m, 700 m, and 1500 m). The respective spatial patterns appear well-defined and physically meaningful without significant spurious signals. For example, the warm pool (temperature > 28 °C) in the Western Pacific, North Atlantic, and Eastern Indian Ocean, as well as cold polar regions, are all well-defined (Fig. 6a). The warmer waters in the subtropical regions can be seen in the subsurface level (e.g., 200 m in Fig. 6c) associated with a deeper mixed layer and thermocline in these regions. Higher variability occurs in the energetic regions of the Gulf Stream, Kuroshio, and Antarctic Circumpolar Current regions all the way down to at least 700 m level (Fig. 6b,d, f). The deep-reaching Agulhas current is also visible in Fig. 6f down to 700 m. The gridded fields show vertically homogeneous and high temperatures in the Mediterranean Sea. The Mediterranean outflow impacts the Atlantic Ocean, causing higher temperatures and increased temperature variability in the middle Atlantic Ocean at 1500 m (Fig. 6g,h). These physically meaningful patterns suggest the success of the QC system in removing outliers. Instead, for the data without QC (Fig. S1), there appears to be spurious increases in temperature standard deviation in some isolated grid boxes and along some cruise lines, indicating the impact of erroneous data.

We compared the original temperature profiles with co-located CTD profiles to assess the impact of bias corrections for XBT, MBT, and OSD temperature profiles (Fig. 7). Based on previous studies^19,22,25,49, we selected the collocation bubble size of 150 km and the maximum time difference between the observations of ~45 days. Previous studies suggested bias corrections were not sensitive to the slight modification of this choice of time and spatial distance^19,22. The XBT bias is as large as 0.1~0.2 °C before 1980 on the global 0–700 m average and reduces to less than 0.05 °C after 1990. For the MBT data from the United States of America (which is the major subset of the global MBT archive), the bias is as large as 0.2 °C during the 1970s on the global 0–250 m average, and 0.05 ~ 0.1 °C from the 1940s to the late 1960s. The application of bias corrections can effectively reduce the original bias as demonstrated by Fig. 7c,d. Similar results are found for bottle correction. For OSD data, a positive bias is characteristic during 1970–1980 (~0.05 °C), being larger from the 200 m to 600 m layer than in the other layers. After 1980, the bias diminishes, probably due to the improved data recording techniques. The original time and depth-varying biases can be substantially reduced after applying corrections implemented by GBC22 (Fig. 7e,f). The above results illustrate the impact of bias corrections, substantially reducing the original biases for several instrument types and respectively increasing the homogeneity of the entire database.

An additional evaluation has been made for the annual cycle of OHC for the upper 2000 m to check if the new QC system results in a better description of the OHC annual cycle. Figure 8 shows the global and hemispheric climatological annual cycle (for the period 2005–2020) of 0–2000 m OHC as calculated on the basis of the CODC-v1 database. For the calculation of the annual OHC cycle, all CODC-v1 data from 2005 to 2020 are averaged together to construct a monthly climatology using the Institute of Atmospheric Physics (IAP) mapping approach⁵¹. The OHC annual cycle was also calculated based on several available gridded climatologies, including the previous version of the IAP⁵¹, WAGHC product (WOCE/Argo Global Hydrographic Climatology)⁵² from the University of Hamburg, and two Argo-only gridded products—the Scripps Institution of Oceanography (SCRIPPS)⁵³ and the Barnes objective analysis Argo (BOA)⁵⁴.

The global OHC appears to reach its maximum from March to April being at a minimum from August to September (Fig. 8a). The two hemispheres show opposite OHC annual variation, with peak values in March for the Southern Hemisphere and in September for the Northern Hemisphere (Fig. 8b,c). These variations are associated with the Earth’s orbital movement and with the uneven distribution of land and sea⁵⁵. The opposite changes in the two hemispheres are largely cancelled with each other when deriving the global OHC so that the data errors will stand out. This is a reason why the global-scale quantity can be a good evaluation of the data quality. Compared with the previous IAP version where WOD-QC flags were used for QC, a notable difference of the new CODC-v1 OHC annual cycle is its smoothness, compared to several spurious unphysical month-to-month variations such as the big OHC increase from November to December. The new CODC-v1 OHC annual cycle is in better agreement with the other products, especially with the two Argo-only products, which are based on high-quality Argo observations (Fig. 8). We thus conclude that the new CODC-v1 data has the advantage of better-representing ocean temperature variability due to the reduction of the impact of random and systematic errors. An additional investigation (Fig. S2) reveals that the remaining differences between CODC-v1 and other products are mainly sourced from the eddy-rich regions, including Kuroshio, Gulf Stream, and the Southern Ocean. Further work is needed to fully understand such differences.

In summary, the evaluation results presented here and in earlier studies confirm the high data quality of the new CODC-v1 database achieved due to the application of the new QC system and bias correction schemes. We note that the original temperature profiles are also stored in the CODC-v1 database so that the users can trace back all alterations introduced into original observations.

Usage Notes

The CODC-v1 database is available from https://english.casodc.com/data/metadata-special-detail?id=1614882932386746369 and https://doi.org/10.12157/IOCAS.20230525.001, https://msdc.qdio.ac.cn, and http://www.ocean.iap.ac.cn/ (under the label of Data Service – In situ observations). Argo data were collected and made freely available by the International Argo Program and the national programs that contribute to it. (https://argo.ucsd.edu, https://www.ocean-ops.org). The Argo Program is part of the Global Ocean Observing System.

Code availability

The CODC-QC is freely available from GitHub (https://github.com/zqtzt/CODCQC) as an Open-Source Python package under the Apache-2.0 License. The XBT, MBT, and Nansen Bottle bias correction schemes are freely available from the IAP ocean website (http://www.ocean.iap.ac.cn) under the ‘Data service - New techniques’ label.

References

Abram, N. et al. Framing and Context of the Report. In IPCC Special Report on the Ocean and Cryosphere in a Changing Climate (eds Pörtner, H.-O. et al.). 73-129 (Cambridge Univ. Press, 2019).
Gleckler, P. J. et al. Human-induced global ocean warming on multidecadal timescales. Nat. Clim. Change 2, 524–529, https://doi.org/10.1038/nclimate1553 (2012).
Article ADS Google Scholar
Hansen, J. et al. Climate Response Times: Dependence on Climate Sensitivity and Ocean Mixing. Science 229, 857–859 (1985).
Article CAS PubMed ADS Google Scholar
von Schuckmann, K. et al. Heat stored in the Earth system: where does the energy go? Earth Syst. Sci. Data 12, 2013–2041, https://doi.org/10.5194/essd-12-2013-2020 (2020).
Article ADS Google Scholar
Cheng, L. et al. Past and future ocean warming. Nature Reviews Earth & Environment, https://doi.org/10.1038/s43017-022-00345-1 (2022).
Li, G. et al. Increasing ocean stratification over the past half-century. Nat. Clim. Change 10, 1116–1123, https://doi.org/10.1038/s41558-020-00918-2 (2020).
Article ADS Google Scholar
Bindoff, N. L. et al. Changing Ocean, Marine Ecosystems, and Dependent Communities. In IPCC Special Report on the Ocean and Cryosphere in a Changing Climate (eds Pörtner, H.-O. et al.). (Cambridge Univ. Press, 2019).
Boyer, T. P. et al. Vol. 87 World Ocean Database 2018 (A. V. Mishonov, Technical Ed, NOAA Atlas NESDIS 2018).
Lauvset, S. K. et al. GLODAPv2.2022: the latest version of the global interior ocean biogeochemical data product. Earth Syst. Sci. Data 14, 5543–5572, https://doi.org/10.5194/essd-14-5543-2022 (2022).
Article ADS Google Scholar
Good, S. A., Martin, M. J. & Rayner, N. A. EN4: Quality controlled ocean temperature and salinity profiles and monthly objective analyses with uncertainty estimates. J. Geophys. Res. Oceans 118, 6704–6716, https://doi.org/10.1002/2013jc009067 (2013).
Article ADS Google Scholar
Szekely, T., Gourrion, J., Pouliquen, S. & Reverdin, G. The CORA 5.2 dataset for global in situ temperature and salinity measurements: data description and validation. Ocean Sci. 15, 1601–1614, https://doi.org/10.5194/os-15-1601-2019 (2019).
Article ADS Google Scholar
Argo. Argo float data and metadata from Global Data Assembly Centre (Argo GDAC). SEANOE, https://doi.org/10.17882/42182 (2000).
Sun, C. et al. The Data Management System for the Global Temperature and Salinity Profile Programme. (Venice, Italy, 2009).
IQuOD-Team. International Quality Controlled Ocean Database (IQuOD) version 0.1—Aggregated and community quality controlled ocean profile data 1772-2018 (NCEI Accession 0170893). NOAA National Centers for Environmental Information (2018).
Cowley, R. et al. International Quality-Controlled Ocean Database (IQuOD) v0.1: The Temperature Uncertainty Specification. Frontiers in Marine Science 8, https://doi.org/10.3389/fmars.2021.689695 (2021).
Tan, Z., Zhang, B., Wu, X., Dong, M. & Cheng, L. Quality control for ocean observations: From present to future. Science China Earth Sciences 65, 215–233, https://doi.org/10.1007/s11430-021-9846-7 (2022).
Article ADS Google Scholar
Tan, Z. et al. A new automatic quality control system for ocean profile observations and impact on ocean warming estimate. Deep-Sea Research Part I 194, 1–20, https://doi.org/10.1016/j.dsr.2022.103961 (2023).
Article Google Scholar
Ishii, M. & Kimoto, M. Reevaluation of historical ocean heat content variations with time-varying XBT and MBT depth bias corrections. J. Oceanogr. 65, 287–299, https://doi.org/10.1007/s10872-009-0027-7 (2009).
Article Google Scholar
Gouretski, V. & Cheng, L. Correction for Systematic Errors in the Global Dataset of Temperature Profiles from Mechanical Bathythermographs. J. Atmos. Ocean. Technol. 37, 841–855, https://doi.org/10.1175/jtech-d-19-0205.1 (2020).
Article ADS Google Scholar
Gouretski, V. & Koltermann, K. P. How much is the ocean really warming? Geophys. Res. Lett. 34, https://doi.org/10.1029/2006gl027834 (2007).
Cheng, L. et al. XBT Science: Assessment of Instrumental Biases and Errors. Bull. Am. Meteorol. Soc. 97, 924–933, https://doi.org/10.1175/bams-d-15-00031.1 (2016).
Article ADS Google Scholar
Gouretski, V., Cheng, L. & Boyer, T. On the Consistency of the Bottle and CTD Profile Data. J. Atmos. Ocean. Technol. 39, 1869–1887, https://doi.org/10.1175/JTECH-D-22-0004.1 (2022).
Article ADS Google Scholar
Willis, J. K., Lyman, J. M., Johnson, G. C. & Gilson, J. Correction to “Recent cooling of the upper ocean”. Geophys. Res. Lett. 34, https://doi.org/10.1029/2007gl030323 (2007).
Levitus, S. et al. Global ocean heat content 1955–2008 in light of recently revealed instrumentation problems. Geophys. Res. Lett. 36, https://doi.org/10.1029/2008GL037155 (2009).
Cheng, L., Zhu, J., Cowley, R., Boyer, T. & Wijffels, S. Time, Probe Type, and Temperature Variable Bias Corrections to Historical Expendable Bathythermograph Observations. J. Atmos. Ocean. Technol. 31, 1793–1825, https://doi.org/10.1175/jtech-d-13-00197.1 (2014).
Article ADS Google Scholar
Boyer, T. et al. Sensitivity of Global Upper-Ocean Heat Content Estimates to Mapping Methods, XBT Bias Corrections, and Baseline Climatologies. J. Clim. 29, 4817–4842, https://doi.org/10.1175/jcli-d-15-0801.1 (2016).
Article ADS Google Scholar
Goni, G. J. et al. More Than 50 Years of Successful Continuous Temperature Section Measurements by the Global Expendable Bathythermograph Network, Its Integrability, Societal Benefits, and Future. Fron. Mar. Sci. 6, https://doi.org/10.3389/fmars.2019.00452 (2019).
McMahon, C. R. et al. Animal Borne Ocean Sensors – AniBOS – An Essential Component of the Global Ocean Observing System. Front. Mar. Sci. 8, https://doi.org/10.3389/fmars.2021.751840 (2021).
Cheng, L. et al. New Record Ocean Temperatures and Related Climate Indicators in 2023. Advances in Atmospheric Sciences, https://doi.org/10.1007/s00376-024-3378-5 (2024).
Wang, X., Wang, C., Liu, C., Jia, S. & Wang, Y. A dataset of profile observation on three-anchor buoy integrated observation platform of the East China Observation station in 2018–2019. https://doi.org/10.11922/sciencedb.926 (2019).
Jia, S., Liu, C., Wang, C. & Wang, X. A dataset of temperature, salinity and depth profile of sea water based on No.6 Buoy of the East China Observation Station during 2014–2015. https://doi.org/10.11922/sciencedb.931 (2019).
Chang, Y., et al The ocean dynamic datasets of seafloor observation network experiment system at the South China Sea. https://doi.org/10.11922/sciencedb.823 (2019).
Han, Y., Zhao, Y., Guan, J., Shen, X., Chen, F. A meteorological and hydrological observation dataset of Taiwan and its surrounding islands. https://doi.org/10.11922/sciencedb.972 (2020).
Men, Z., Dai, R., Li, Y., Zhang, G. & Xu, K. A dataset of benthic environmental parameters in the Yellow Sea in the years 2007-2009. https://doi.org/10.11922/sciencedb.554 (2018).
Xu, C., Li, S. & Chen, R. 2009-2012 South China Sea section scientific CTD CTD data sets. https://doi.org/10.11922/sciencedb.41 (2015).
Hu, Z. et al. Oceanographic data collected within the eastern equatorial Indian Ocean by JAMES during December 2019‒February 2020. https://doi.org/10.11922/sciencedb.01136 (2021).
Abraham, J. P. et al. A review of global ocean temperature observations: Implications for ocean heat content estimates and climate change. Rev. Geophys. 51, 450–483, https://doi.org/10.1002/rog.20022 (2013).
Article ADS Google Scholar
Meyssignac, B. et al. Measuring Global Ocean Heat Content to Estimate the Earth Energy Imbalance. Front. Mar. Sci. 6, https://doi.org/10.3389/fmars.2019.00432 (2019).
Wunsch, C. & Heimbach, P. Practical global oceanic state estimation. Physica D: Nonlinear Phenomena 230, 197–208, https://doi.org/10.1016/j.physd.2006.09.040 (2007).
Article MathSciNet ADS Google Scholar
Wunsch, C. & Heimbach, P. Bidecadal Thermal Changes in the Abyssal Ocean. J. Phys. Oceanogr. 44, 2013–2030, https://doi.org/10.1175/jpo-d-13-096.1 (2014).
Article ADS Google Scholar
Riser, S. C. et al. Fifteen years of ocean observations with the global Argo array. Nat. Clim. Change 6, 145–153, https://doi.org/10.1038/nclimate2872 (2016).
Article ADS Google Scholar
Roquet, F. et al. Ocean Observations Using Tagged Animals. Oceanography 30, 139, https://doi.org/10.5670/oceanog.2017.235 (2017).
Article Google Scholar
Testor, P. et al. OceanGliders: A Component of the Integrated GOOS. Frontiers in Marine Science 6, https://doi.org/10.3389/fmars.2019.00422 (2019).
Roemmich, D. et al. On the Future of Argo: A Global, Full-Depth, Multi-Disciplinary Array. Front. Mar. Sci. 6, https://doi.org/10.3389/fmars.2019.00439 (2019).
Gouretski, V. WOCE-Argo Global Hydrographic Climatology (WAGHC Version 1.0). World Data Center for Climate (WDCC) at DKRZ, https://doi.org/10.1594/WDCC/WAGHC_V1.0 (2018).
Wijffels, S. E. et al. Changing Expendable Bathythermograph Fall Rates and Their Impact on Estimates of Thermosteric Sea Level Rise. J. Clim. 21, 5657–5672, https://doi.org/10.1175/2008jcli2290.1 (2008).
Article ADS Google Scholar
Cowley, R. et al. XBT and CTD pairs dataset Version 1. v2. CSIRO. Data Collection, https://doi.org/10.4225/08/52AE99A4663B1 (2013).
Tan, Z. et al. Examining the Influence of Recording System on the Pure Temperature Error in XBT Data. Journal of Atmospheric and Oceanic Technology 38, 759–776, https://doi.org/10.1175/JTECH-D-20-0136.1 (2021).
Article ADS Google Scholar
Cheng, L. et al. How Well Can We Correct Systematic Errors in Historical XBT Data? J. Atmos. Ocean. Technol. 35, 1103–1125, https://doi.org/10.1175/jtech-d-17-0122.1 (2018).
Article ADS Google Scholar
Zhang, B. et al. CAS-Ocean Data Center, Global Ocean Science Database (CODCv1): temperature. Marine Science Data Center of the Chinese Academy of Science, https://doi.org/10.12157/IOCAS.20230525.001 (2024).
Cheng, L. et al. Improved estimates of ocean heat content from 1960 to 2015. Sci. Adv. 3, e1601545, https://doi.org/10.1126/sciadv.1601545 (2017).
Article PubMed PubMed Central ADS Google Scholar
Gouretski, V. World Ocean Circulation Experiment – Argo Global Hydrographic Climatology. Ocean Sci. 14, 1127–1146, https://doi.org/10.5194/os-14-1127-2018 (2018).
Article ADS Google Scholar
Roemmich, D. & Gilson, J. The 2004–2008 mean and annual cycle of temperature, salinity, and steric height in the global ocean from the Argo Program. Progress in Oceanography 82, 81–100, https://doi.org/10.1016/j.pocean.2009.03.004 (2009).
Article ADS Google Scholar
Li, H. et al. Development of a global gridded Argo data set with Barnes successive corrections. Journal of Geophysical Research: Oceans 122, 866–889, https://doi.org/10.1002/2016JC012285 (2017).
Article ADS Google Scholar
Pan, Y. et al. Annual Cycle in Upper-Ocean Heat Content and the Global Energy Budget. Journal of Climate 36, 5003–5026, https://doi.org/10.1175/JCLI-D-22-0776.1 (2023).
Article ADS Google Scholar

Download references

Acknowledgements

This dataset is supported by the National Natural Science Foundation of China (Grant no. 42122046, 42206204, 42076202), Strategic Priority Research Program of the Chinese Academy of Sciences (Grant no. XDB42040402), the new Cornerstone Science Foundation through the XPLORER PRIZE, Youth Innovation Promotion Association CAS, National Key Scientific and Technological Infrastructure project “Earth System Science Numerical Simulator Facility” (EarthLab), Network Security and Informatization Plan of Chinese Academy of Sciences (Grant No. WX2022SDC-XK11). This work was also supported by the Oceanographic Data Center, Chinese Academy of Sciences. Chinese data collection comes from ScienceDB^{30,31,32,33,34,35,36}, Institute of Oceanography, Chinese Academy of Sciences, South China Sea Oceanographic Data Center, etc.

Author information

Authors and Affiliations

Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, China
Bin Zhang, Fuchao Li, Huanping Ren & Fan Wang
Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, 266071, China
Bin Zhang, Lijing Cheng, Zhetao Tan, Viktor Gouretski, Fuchao Li, Yuying Pan, Huanping Ren, Jiang Zhu & Fan Wang
Oceanographic Data Center, Chinese Academy of Sciences, Qingdao, 266071, China
Bin Zhang, Lijing Cheng, Fuchao Li, Huanping Ren & Fan Wang
International Center for Climate and Environment Sciences, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing, 100029, China
Lijing Cheng, Zhetao Tan, Viktor Gouretski, Yuying Pan & Jiang Zhu
University of Chinese Academy of Sciences, Beijing, 101408, China
Lijing Cheng, Zhetao Tan, Yuying Pan, Huifeng Yuan & Jiang Zhu
Computer Network Information Center, Chinese Academy of Sciences, Beijing, 100039, China
Huifeng Yuan
Istituto Nazionale di Geofisica e Vulcanologia (INGV), Bologna, 40127, Italy
Franco Reseghetti

Authors

Bin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Lijing Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Zhetao Tan
View author publications
You can also search for this author in PubMed Google Scholar
Viktor Gouretski
View author publications
You can also search for this author in PubMed Google Scholar
Fuchao Li
View author publications
You can also search for this author in PubMed Google Scholar
Yuying Pan
View author publications
You can also search for this author in PubMed Google Scholar
Huifeng Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Huanping Ren
View author publications
You can also search for this author in PubMed Google Scholar
Franco Reseghetti
View author publications
You can also search for this author in PubMed Google Scholar
Jiang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Fan Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.C. and F.W. designed the research and wrote the first manuscript. B.Z. and Z.T. designed the methods and created the figures, and all authors contributed to data collection, data processing, and writing.

Corresponding authors

Correspondence to Lijing Cheng or Fan Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, B., Cheng, L., Tan, Z. et al. CODC-v1: a quality-controlled and bias-corrected ocean temperature profile database from 1940–2023. Sci Data 11, 666 (2024). https://doi.org/10.1038/s41597-024-03494-8

Download citation

Received: 26 December 2023
Accepted: 10 June 2024
Published: 22 June 2024
DOI: https://doi.org/10.1038/s41597-024-03494-8
Springer Nature Limited

CODC-v1: a quality-controlled and bias-corrected ocean temperature profile database from 1940–2023

Abstract

Similar content being viewed by others

SCSPOD14, a South China Sea physical oceanographic dataset derived from in situ measurements during 1919–2014

Multi-decadal ocean temperature time-series and climatologies from Australia’s long-term National Reference Stations

The International Bathymetric Chart of the Southern Ocean Version 2

Background & Summary