Automated cleaning and uncertainty attribution of archival bathymetry based on a priori knowledge
Hydrographic offices hold large valuable historic bathymetric data sets, many of which were collected using older generation survey systems that contain little or no metadata and/or uncertainty estimates. These bathymetric data sets generally contain large outlier (errant) data points to clean, yet standard practice does not include rigorous automated procedures for systematic cleaning of these historical data sets and their subsequent conversion into reusable data formats. In this paper, we propose an automated method for this task. We utilize statistically diverse threshold tests, including a robust least trimmed squared method, to clean the data. We use LOESS weighted regression residuals together with a Student-t distribution to attribute uncertainty for each retained sounding; the resulting uncertainty values compare favorably with native estimates of uncertainty from co-located data sets which we use to estimate a point-wise goodness-of-fit measure. Storing a cleansed validated data set augmented with uncertainty in a re-usable format provides the details of this analysis for subsequent users. Our test results indicate that the method significantly improves the quality of the data set while concurrently providing confidence interval estimates and point-wise goodness-of-fit estimates as referenced to current hydrographic practices.
KeywordsHydrography Bathymetry Historical data Data cleaning
We would like to thank the Office of Naval Research for sponsoring this research under the Worldwide High-Resolution Bathymetry project of the Naval Research Laboratory’s Base Program.
- Calder BR, Garder JV (2008) “U.S. Law of the sea cruise to map the foot of the slope of the Northeast U.S. Atlantic Continental Margin: Leg 6”, University of New Hampshire (UNH), Center for Coastal and Ocean Mapping (CCOM)/Joint Hydrographic Center (JHC). http://ccom.unh.edu/sites/default/files/publications/Calder_08_cruise_report_Atlantic_KNOX17RR_0.pdf. Accessed 10 Mar 2017
- Calder B, Mayer LA (2003) “Automatic processing of high-rate, high-density multibeam echosounder data”. Geochem Geophys Geosyst (American Geophysical Union) 4(6):1048. doi: 10.1029/2002GC000486.
- Doornik, JA (2011) Robust estimation using least trimmed squares. economic department, Oxford University, Oxford, United Kingdom: Economics Department, University of Oxford. http://econ.au.dk/fileadmin/site_files/filer_oekonomi/subsites/creates/Seminar_Papers/2011/ELTS.pdf
- Draper N, Smith H (1981) Applied regression analysis, 2nd edn. Wiley, New YorkGoogle Scholar
- Elmore P, Fabre D, Sawyer R, Ladner R (2012) “Uncertainty estimation of historical bathymetric data from bayesian networks”. Geochem Geophys Geosyst 13 (11). doi: 10.1029/2012GC004144
- Free Software Foundation (2015) GSL—GNU scientific library, Version 2.1. http://www.gnu.org/software/gsl/. Accessed May 15 2016
- Galassi M, Davies J, Theiler J, Gough R, Jungman G, Alken P, Booth M, Rossi F (2009) GNU scientific library reference manual, 3rd edn. Network Theory Ltd, LondonGoogle Scholar
- Hare R (1995) “Depth and position error budgets for multibeam echosounding”. Int Hydrogr Rev 72:37–69. https://journals.lib.unb.ca/index.php/ihr/article/viewFile/23178/26953
- Hare R, Eakins B, Amante C (2011) “Modelling bathymetric uncertainty”. Int Hydrogr Rev 6:31–42. https://journals.lib.unb.ca/index.php/ihr/article/view/20888. Accessed 31 May 2016
- IHO (2008) IHO standards for hydrographic surveys: special publication No. 44 Fifth edition. International Hydrographic Bureau, Monaco, p 28Google Scholar
- Jakobsson M, Mayer LA, Coakley B, Dowdeswell JA, Forbes S, Fridman B, Hodnesdal H et al. (2012) “The international bathymetric chart of THE arctic ocean (IBCAO) version 3.0.” Geophys Res Lett. doi: 10.1029/2012GL052219
- JCGM (2008) “Evaluation of measurement data—Guide to the expression of uncertainty in measurement.” http://www.bipm.org/utils/common/documents/jcgm/JCGM_100_2008_E.pdf. September. Accessed May 31 2016. http://www.bipm.org/utils/common/documents/jcgm/JCGM_100_2008_E.pdf
- Ladner R, Moseley J (2002) “Managing heterogeneous databases to support diverse product requirements”. OCEANS ‘02 MTS/IEEE. IEEE. 895–899. doi: 10.1109/OCEANS.2002.1192086
- Lawson CL, Hanson RJ (1995) Solving least squares problems. S.I.A.MGoogle Scholar
- Press WH, Teukolsky SA, Vetterling WT, Flannery BP (2007) Numerical recipes—the art of scientific computing. 3rd. Cambridge University Press, New YorkGoogle Scholar
- Smith WH, Sandwell DT (1997) “Global sea floor topography from satellite altimetry and ship depth soundings.” Science 277. doi: 10.1126/science.277.5334.1956
- Taylor BN, Kuyatt CE (1994) “NIST technical note 1297 guidelines for evaluating and expressing the uncertainty of NIST measurement results”. http://nist.gov/pml/pubs/tn1297/. September. Accessed May 31, 2016
- Tharp M, Frankel H (1986) “Mappers of the deep”. Nat Hist 95:49–62. http://www.faculty.umb.edu/anamarija.frankic/files/Oceanography%20Spring%2008/MidAtlantic%20ridge%20discovery.pdf.gz
- Weatherall P, Marks KM, Jakobsson M, Schmitt T, Tani S, Arndt JE, Rovere M, Chayes D, Ferrini V, Wigley R (2015) “A new digital bathymetric model of the world’s oceans”. Earth Space Sci. doi: 10.1002/2015EA000107
- Weber JR (1983) “Maps of the arctic basin sea floor: a history of bathymetry and its interpretation”. Arctic 36 (2):121–142. http://www.jstor.org/stable/40509566