Skip to main content

Creating Structured, Linked Geographic Data from Historical Maps: Challenges and Trends

  • Chapter
  • First Online:
Using Historical Maps in Scientific Studies

Part of the book series: SpringerBriefs in Geography ((BRIEFSGEOGRAPHY))

Abstract

Historical geographic data are essential for a variety of studies of cancer and environmental epidemiology, urbanization, and landscape ecology. However, existing data sources typically contain only contemporary information. Historical maps hold a great deal of detailed geographic information at various times in the past. Yet, finding relevant maps is difficult, and the map content is not machine-readable. This chapter presents the challenges and trends in building a map processing, modeling, linking, and publishing framework. The framework will enable querying historical map collections as a unified and structured spatiotemporal source in which individual geographic phenomena (extracted from maps) are modeled (described) with semantic descriptions and linked to other data sources (e.g., DBpedia). This framework will allow making use of historical geographic datasets from a variety of maps, efficiently, over large geographic extents. Realizing such a framework poses significant research challenges in multiple fields in computer science including digital map processing, data integration, and the Semantic Web technologies, and other disciplines such as spatial, social, and health sciences. Tackling these challenges will not only advance research in computer science and geographic information science but also present a unique opportunity for interdisciplinary research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://ngmdb.usgs.gov/ngmdb/ngmdbhome.html.

  2. 2.

    http://ngmdb.usgs.gov/maps/TopoView/.

  3. 3.

    http://www.davidrumsey.com/.

  4. 4.

    http://OldMapsOnline.org/.

  5. 5.

    http://maps.nls.uk/.

  6. 6.

    This chapter is based on a previous vision paper presented at the 2015 ACM SIGSPATIAL Conference [Chi15] and the First Place of the Best Vision Paper Award sponsored by the Computing Research Association’s Computing Community Consortium under the CCC Blue Sky initiative.

  7. 7.

    https://www.geonames.org/.

  8. 8.

    http://wiki.dbpedia.org/.

  9. 9.

    This is just an example. By no means the author is an expert of soil contamination or growing grapefruits.

  10. 10.

    To appreciate this difficulty from experience, the reader is encouraged to explore how long it would take to find a large-scale map of 1941 Budapest.

  11. 11.

    http://ablesw.com/r2v/download.html.

  12. 12.

    https://github.com/NYPL/map-vectorizer/.

  13. 13.

    http://commons.pelagios.org/.

  14. 14.

    http://www.visionofbritain.org.uk/data/#tabgb1900.

  15. 15.

    A USGS historical topographic map with the 600 DPI (dots-per-inch) scan resolution is about 12, 000 × 12, 000 pixels.

  16. 16.

    National Science Foundation (United States).

  17. 17.

    The reader is referred to [Cla10] for a detailed introduction to GISs and GIS data formats.

  18. 18.

    https://geojson.org/.

  19. 19.

    http://www.opengeospatial.org/.

  20. 20.

    https://www.nhgis.org.

  21. 21.

    http://geovocab.org/doc/neogeo.

  22. 22.

    https://www.w3.org/RDF/.

  23. 23.

    For example, the URI, https://www.geonames.org/3020251/embrun.html, refer to the town Embrun in France. The reader is referred to the GeoNames Ontology website (http://www.geonames.org/ontology/documentation.html) for more examples about URIs and the Geo Semantic Web.

  24. 24.

    http://usc-isi-i2.github.io/karma/.

  25. 25.

    https://www.w3.org/2005/Incubator/geo/XGR-geo-ont-20071023/.

  26. 26.

    See the full tutorial here: https://github.com/usc-isi-i2/Web-Karma/wiki/Working-with-geospatial-data/.

  27. 27.

    https://recogito.pelagios.org/.

  28. 28.

    http://commons.pelagios.org/.

  29. 29.

    The reader is referred to [Tam18] for an overview of ontologies and the ontologies that describes geographic data.

References

  1. M.G. Arteaga, Historical map polygon and feature extractor, in MapInteract 2013, Proceedings of the 1st ACM SIGSPATIAL International Workshop on MapInteraction, November 5th, 2013, Orlando, Florida, USA (2013), pp. 66–71. https://doi.org/10.1145/2534931.2534932

  2. B. Barz, T.C. van Dijk, B. Spaan, J. Denzler, Putting user reputation on the map: unsupervised quality control for crowdsourced historical data, in Proceedings of the 2nd ACM SIGSPATIAL Workshop on Geospatial Humanities, GeoHumanities’18 (ACM, New York, 2018), pp. 3:1–3:6. ISBN: 978-1-4503-6032-6. https://doi.org/10.1145/3282933.3282937

  3. F. Bastani, S. He, S. Abbar, M. Alizadeh, H. Balakrishnan, S. Chawla, S. Madden, Machine-assisted map editing, in Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPATIAL ’18 (ACM, New York, 2018), pp. 23–32. ISBN: 978-1-4503-5889-7. https://doi.org/10.1145/3274895.3274927

    Book  Google Scholar 

  4. B. Budig, T.C. van Dijk, Active learning for classifying template matches in historical maps, in Discovery Science, ed. by N. Japkowicz, S. Matwin. Lecture Notes in Computer Science (Springer, Berlin, 2015), pp. 33–47. ISBN: 9783319242811, 9783319242828. https://doi.org/10.1007/978-3-319-24282-8_5

    Chapter  Google Scholar 

  5. B. Budig, T.C.V. Dijk, A. Wolff, Matching labels and markers in historical maps: an algorithm with interactive postprocessing, in ACM Transactions on Spatial Algorithms and Systems (TSAS) 2.4 (2016), pp. 13:1–13:24. ISSN: 2374-0353. https://doi.org/10.1145/2994598

    Article  Google Scholar 

  6. C.S. Beattie, 3D visualization models as a tool for reconstructing the historical landscape of the Ballona creek watershed, MA thesis, University of Southern California, 2014

    Google Scholar 

  7. C. Bizer, T. Heath, T. Berners-Lee, Linked data - the story so far. Int. J. Semant. Web Inf. Syst. 5(3), 1–22 (2009). ISSN: 1552-6283

    Article  Google Scholar 

  8. B. Budig, T.C. van Dijk, F. Feitsch, M.G. Arteaga, Polygon consensus: smart crowdsourcing for extracting building footprints from historical maps, in Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPACIAL ’16 (ACM, New York, 2016), pp. 66:1–66:4. ISBN: 978-1-4503-4589-7. https://doi.org/10.1145/2996913.2996951

  9. B. Budig, Efficient algorithms and user interaction for metadata extraction from historical maps, in Proceedings of the 2Nd ACM SIGSPATIAL PhD Workshop, SIGSPATIAL PhD ’15 (ACM, New York, 2016), pp. 4:1–4:4. ISBN: 978-1-4503-3980-3. https://doi.org/10.1145/2855680.2855841

  10. B. Budig, Extracting spatial information from historical maps: algorithms and interaction, PhD thesis, University of Würzburg, 2018

    Google Scholar 

  11. B. Budig, T.C. van Dijk, F. Kirchner, Glyph miner: a system for efficiently extracting glyphs from early prints in the context of OCR, in 2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL) (2016), pp. 31–34

    Google Scholar 

  12. Y.-Y. Chiang, P. Chioh, S. Moghaddam, A training-by-example approach for symbol spotting from raster maps, in Proceedings of the 8th International Conference on Geographic Information Science (2014), pp. 264–269

    Google Scholar 

  13. C.-C. Chen, C.A. Knoblock, C. Shahabi, Y.-Y. Chiang, S. Thakkar, Automatically and accurately conflating orthoimagery and street maps, in Proceedings of the 12th Annual ACM International Workshop on Geographic Information Systems, GIS ’04 (ACM, New York, 2004), pp. 47–56. ISBN: 9781581139792. https://doi.org/10.1145/1032222.1032231

  14. Y.-Y. Chiang, S. Moghaddam, S. Gupta, R. Fernandes, C.A. Knoblock, From map images to geographic names, in Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM, New York, 2014), pp. 581–584. ISBN: 9781450331319. https://doi.org/10.1145/2666310.2666374

  15. Y.-Y. Chiang, S. Leyk, N.H. Nazari, S. Moghaddam, The impact of graphical quality on automatic text recognition in digital maps, in Proceedings of the 27th International Cartographic Conference (2015) ISBN: 9788588783119

    Google Scholar 

  16. Y.-Y. Chiang, S. Leyk, N.H. Nazari, S. Moghaddam, T.X. Tan, Assessing the impact of graphical quality on automatic text recognition in digital maps. Comput. Geosci. 93, 21–35 (2016). ISSN: 0098-3004. https://doi.org/10.1016/j.cageo.2016.04.013

    Article  Google Scholar 

  17. Y.-Y. Chiang, Harvesting geographic features from heterogeneous raster maps, PhD thesis, Los Angeles, CA, USA: University of Southern California, 2010. ISBN: 9781124412498

    Google Scholar 

  18. Y.-Y. Chiang, Strabo: a complete system for label recognition in maps, in Proceedings of the 26th International Cartographic Conference (ICC’13) (2013), pp. 838–838. ISBN: 9781907075063

    Google Scholar 

  19. Y.-Y. Chiang, Querying historical maps as a unified, structured, and linked spatiotemporal source: vision paper, in Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems. GIS ’15 (ACM, New York, 2015), pp. 16:1–16:4. ISBN: 9781450339674. https://doi.org/10.1145/2820783.2820887

  20. Y.-Y. Chiang, Unlocking textual content from historical maps - potentials and applications, trends, and outlooks, in Recent Trends in Image Processing and Pattern Recognition, ed. by K. Santosh, M. Hangarge, V. Bevilacqua, A. Negi (Springer, Singapore, 2017), pp. 111–124. ISBN: 978-981-10-4859-3

    Chapter  Google Scholar 

  21. Y.-Y. Chiang, C.A. Knoblock, Automatic extraction of road intersection position, connectivity, and orientations from raster maps, in Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM Press, New York, 2008), pp. 1–10. ISBN: 9781605583235. https://doi.org/10.1145/1463434.1463463

  22. Y.-Y. Chiang, C.A. Knoblock, Recognizing text in raster maps. GeoInformatica 19(1), 1–27 (2014). ISSN: 1384-6175, 1573-7624. https://doi.org/10.1007/s10707-014-0203-9

    Article  Google Scholar 

  23. Y.-Y. Chiang, S. Leyk, Exploiting online gazetteer for fully automatic extraction of cartographic symbols, in Proceedings of the 27th International Cartographic Conference (2015). ISBN: 9788588783119

    Google Scholar 

  24. K.C. Clarke, Getting Started with Geographic Information Systems (Pearson, London, 2010). ISBN: 9780131494985

    Google Scholar 

  25. Y.-Y. Chiang, S. Leyk, C.A. Knoblock, Efficient and robust graphics recognition from historical maps, in Graphics Recognition. New Trends and Challenges: 9th International Workshop, GREC 2011, Seoul, Korea, September 15–16, 2011, Revised Selected Papers, vol. 7423, ed. by Y.-B. Kwon, J.-M. Ogier. Lecture Notes in Computer Science, GREC’11 (Springer, Berlin, 2013), pp. 25–35. ISBN: 9783642368233. https://doi.org/10.1007/978-3-642-36824-0_3

    Chapter  Google Scholar 

  26. Y.-Y. Chiang, S. Leyk, C.A. Knoblock, A survey of digital map processing techniques, in ACM Comput. Surv. 47(1), 1–44 (2014). ISSN: 0360-0300. https://doi.org/10.1145/2557423

    Article  Google Scholar 

  27. W. Duan, Y.-Y. Chiang, C.A. Knoblock, V. Jain, D. Feldman, J.H. Uhl, S. Leyk, Automatic alignment of geographic features in contemporary vector data and historical maps, in Proceedings of the 1st Workshop on Artificial Intelligence and Deep Learning for Geographic Knowledge Discovery (ACM, New York, 2017), pp. 45–54

    Google Scholar 

  28. W. Duan, Y. Chiang, C.A. Knoblock, S. Leyk, J. Uhl, Automatic generation of precisely delineated geographic features from georeferenced historical maps using deep learning, in Proceedings of the AutoCarto (2018)

    Google Scholar 

  29. ESRI, ESRI shapefile technical description, Tech. rep. ESRI, 1998

    Google Scholar 

  30. S. Frischknecht, A. Carosio, Raster-based methods to extract structured information from scanned topographic maps, in International Archives of Photogrammetry and Remote Sensing, vol. 32, Part 3-4W2 (1997), pp. 1–5

    Google Scholar 

  31. S. Frischknecht, E. Kanani, Automatic interpretation of scanned topographic maps: a raster-based approach, in Graphics Recognition Algorithms and Systems, vol. 1398 (Springer, Berlin, 1997), pp. 207–220. ISBN: 9783540643814. https://doi.org/10.1007/3-540-64381-8_50

    Chapter  Google Scholar 

  32. I.N. Gregory, P.S. Ell, Historical GIS: Technologies, Methodologies, and Scholarship, vol. 39. Cambridge Studies in Historical Geography (Cambridge University Press, Cambridge, 2007). ISBN: 9781139467711

    Google Scholar 

  33. B. Godfrey, H. Eveleth, An adaptable approach for generating vector features from scanned historical thematic maps using image enhancement and remote sensing techniques in a in a geographic information system. J. Map Geogr. Libr., 18–36 (2015). ISSN: 1542-0353. https://doi.org/10.1080/15420353.2014.1001107

    Article  Google Scholar 

  34. J. Gelernter, MapSearch: a protocol and prototype application to find maps, PhD thesis. Rutgers, The State University of New Jersey, 2008

    Google Scholar 

  35. D. Garijo, Y. Gil, A. Harth, Challenges for provenance analytics over geospatial data, in Provenance and Annotation of Data and Processes, vol. 8628, ed. by B. Ludäscher, B. Plale. Lecture Notes in Computer Science (Springer, Berlin, 2015), pp. 261–263

    Google Scholar 

  36. C.R. Greenwalt, M.E. Shultz, Principles of error theory and cartographic applications, Tech. rep. 1962

    Google Scholar 

  37. K. Janowicz, S. Scheider, T. Pehle, G. Hart, Geospatial semantics and linked spatiotemporal data – past, present, and future. Semantic Web 3(4), 321–332 (2012). https://doi.org/10.3233/SW-2012-0077

    Google Scholar 

  38. L. Kurashige, Rethinking anti-immigrant racism: lessons from the los angeles vote on the 1920 Alien land law. South. Calif. Q. 95(3), 265–283 (2013). ISSN: 0038-3929. https://doi.org/10.1525/scq.2013.95.3.265

    Article  Google Scholar 

  39. A. Khotanzad, E. Zink, Contour line and geographic feature extraction from USGS color topographical paper maps. IEEE Trans. Pattern Anal. Mach. Intell. 25(1), 18–31 (2003). Issn: 0162-8828. https://doi.org/10.1109/TPAMI.2003.1159943

    Article  Google Scholar 

  40. S. Leyk, R. Boesch, R. Weibel, A conceptual framework for uncertainty investigation in map-based land cover change modelling. Trans. GIS 9(3), 291–322 (2005). https://doi.org/10.1111/j.1467-9671.2005.00220.x eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1111/j.1467-9671.2005.00220.x

    Article  Google Scholar 

  41. H. Lin, Y.-Y. Chiang, An uncertainty aware method for geographic data conflation, in Proceedings of the 7th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2018 (ACM, Seattle, 2018), pp. 20–27. ISBN: 978-1-4503-6041-8. https://doi.org/10.1145/3282834.3282842

    Book  Google Scholar 

  42. H. Lin, Y.-Y. Chiang, SRC: automatic extraction of phrase-level map labels from historical maps. SIGSPATIAL Spec. 9(3), 14–15 (2018)

    Article  Google Scholar 

  43. H. Li, J. Liu, X. Zhou, Intelligent map reader: a framework for topographic map understanding with deep learning and gazetteer. IEEE Access 6, 25363–25376 (2018). ISSN: 2169-3536. https://doi.org/10.1109/ACCESS.2018.2823501

    Article  Google Scholar 

  44. J. Long, E. Shelhamer, T. Darrell, Fully convolutional networks for semantic segmentation, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015), pp. 3431–3440

    Google Scholar 

  45. S. Manson, J. Schroeder, D. Van Riper, S. Ruggles, et al., IPUMS National Historical Geographic Information System: Version 12.0 [Database], Minneapolis: University of Minnesota (2017)

    Google Scholar 

  46. W.B. Mitchell, GIRAS: a geographic information retrieval and analysis system for handling land use and land cover data, 1059. US Govt. Print. Off. (1977)

    Google Scholar 

  47. G. Nagy, A. Samal, S. Seth, T. Fisher, et al., Reading street names from maps-technical challenges, in Proceedings of GIS/LIS (1997)

    Google Scholar 

  48. L. Page, S. Brin, R. Motwani, T. Winograd, The PageRank citation ranking: bringing order to the web, Tech. rep. Stanford InfoLab, 1999

    Google Scholar 

  49. A. Pezeshk, Feature extraction and text recognition from scanned color topographic maps, PhD thesis, Pennsylvania State University, 2011

    Google Scholar 

  50. R. Simon, E. Barker, L. Isaksen, et al., Linking early geospatial documents, one place at a time: annotation of geographic documents with recogito, in e- (2015). http://oro.open.ac.uk/43613/

    Google Scholar 

  51. R. Simon, C. Sadilek, J. Korb, M. Baldauf, B. Haslhofer, Tag clouds and old maps: annotations as linked spatiotemporal data in the cultural heritage domain, in Workshop On Linked Spatiotemporal Data, Zurich, Switzerland (2010)

    Google Scholar 

  52. R. Simon, P. Pilgerstorfer, L. Isaksen, E. Barker, Towards semi-automatic annotation of toponyms on old maps. e - Perimetron 9(3), 105–128 (2014)

    Google Scholar 

  53. K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale image recognition, in 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, Conference Track Proceedings (2015)

    Google Scholar 

  54. T. Tambassi, The Philosophy of Geo-Ontologies (Springer, Berlin, 2018)

    Book  Google Scholar 

  55. P.H.S. Torr, A. Zisserman, MLESAC: a new robust estimator with application to estimating image geometry. Comp. Vision Image Underst. 78(1), 138–156 (2000). ISSN: 1077-3142. https://doi.org/10.1006/cviu.1999.0832

    Article  Google Scholar 

  56. J.H. Uhl, S. Leyk, Y.-Y. Chiang, W. Duan, C.A. Knoblock, Extracting human settlement footprint from historical topographic map series using context-based machine learning, in IET Conference Proceedings (2017)

    Google Scholar 

  57. J.H. Uhl, S. Leyk, Y.-Y. Chiang, W. Duan, C.A. Knoblock, Spatialising uncertainty in image segmentation using weakly supervised convolutional neural networks: a case study from historical map processing. IET Image Process. 12(11), 2084–2091 (2018)

    Article  Google Scholar 

  58. J. Uhl, S. Leyk, Y.-Y. Chiang, W. Duan, C. Knoblock, Map archive mining: visual-analytical approaches to explore large historical map collections. ISPRS Int. J. Geo-Inform. 7(4), 148 (2018)

    Article  Google Scholar 

  59. J.H. Uhl, Spatio-temporal information extraction under uncertainty using multi-source data integration and machine learning: applications to human settlement modelling, PhD thesis, University of Colorado (2019)

    Google Scholar 

  60. J. Weinman, Toponym recognition in historical maps by gazetteer alignment, in Proceedings of the 12th International Conference on Document Analysis and Recognition (2013), pp. 1044–1048. https://doi.org/10.1109/ICDAR.2013.209

  61. J. Weinman, Geographic and style models for historical map alignment and toponym recognition, in 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1 (IEEE, Piscataway, 2017), pp. 957–964

    Google Scholar 

  62. R. Yu, Z. Luo, Y.-Y. Chiang, Recognizing text in historical maps using maps from multiple time periods, in 2016 23rd International Conference on Pattern Recognition (ICPR) (IEEE, Piscataway, 2016), pp. 3993–3998

    Google Scholar 

  63. H. Zhao, J. Shi, X. Qi, X. Wang, J. Jia, Pyramid scene parsing network, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017), pp. 2881–2890

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2020 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Chiang, YY., Duan, W., Leyk, S., Uhl, J.H., Knoblock, C.A. (2020). Creating Structured, Linked Geographic Data from Historical Maps: Challenges and Trends. In: Using Historical Maps in Scientific Studies. SpringerBriefs in Geography. Springer, Cham. https://doi.org/10.1007/978-3-319-66908-3_3

Download citation

Publish with us

Policies and ethics