Abstract
Historical geographic data are essential for a variety of studies of cancer and environmental epidemiology, urbanization, and landscape ecology. However, existing data sources typically contain only contemporary information. Historical maps hold a great deal of detailed geographic information at various times in the past. Yet, finding relevant maps is difficult, and the map content is not machine-readable. This chapter presents the challenges and trends in building a map processing, modeling, linking, and publishing framework. The framework will enable querying historical map collections as a unified and structured spatiotemporal source in which individual geographic phenomena (extracted from maps) are modeled (described) with semantic descriptions and linked to other data sources (e.g., DBpedia). This framework will allow making use of historical geographic datasets from a variety of maps, efficiently, over large geographic extents. Realizing such a framework poses significant research challenges in multiple fields in computer science including digital map processing, data integration, and the Semantic Web technologies, and other disciplines such as spatial, social, and health sciences. Tackling these challenges will not only advance research in computer science and geographic information science but also present a unique opportunity for interdisciplinary research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
This chapter is based on a previous vision paper presented at the 2015 ACM SIGSPATIAL Conference [Chi15] and the First Place of the Best Vision Paper Award sponsored by the Computing Research Association’s Computing Community Consortium under the CCC Blue Sky initiative.
- 7.
- 8.
- 9.
This is just an example. By no means the author is an expert of soil contamination or growing grapefruits.
- 10.
To appreciate this difficulty from experience, the reader is encouraged to explore how long it would take to find a large-scale map of 1941 Budapest.
- 11.
- 12.
- 13.
- 14.
- 15.
A USGS historical topographic map with the 600 DPI (dots-per-inch) scan resolution is about 12, 000 × 12, 000 pixels.
- 16.
National Science Foundation (United States).
- 17.
The reader is referred to [Cla10] for a detailed introduction to GISs and GIS data formats.
- 18.
- 19.
- 20.
- 21.
- 22.
- 23.
For example, the URI, https://www.geonames.org/3020251/embrun.html, refer to the town Embrun in France. The reader is referred to the GeoNames Ontology website (http://www.geonames.org/ontology/documentation.html) for more examples about URIs and the Geo Semantic Web.
- 24.
- 25.
- 26.
See the full tutorial here: https://github.com/usc-isi-i2/Web-Karma/wiki/Working-with-geospatial-data/.
- 27.
- 28.
- 29.
The reader is referred to [Tam18] for an overview of ontologies and the ontologies that describes geographic data.
References
M.G. Arteaga, Historical map polygon and feature extractor, in MapInteract 2013, Proceedings of the 1st ACM SIGSPATIAL International Workshop on MapInteraction, November 5th, 2013, Orlando, Florida, USA (2013), pp. 66–71. https://doi.org/10.1145/2534931.2534932
B. Barz, T.C. van Dijk, B. Spaan, J. Denzler, Putting user reputation on the map: unsupervised quality control for crowdsourced historical data, in Proceedings of the 2nd ACM SIGSPATIAL Workshop on Geospatial Humanities, GeoHumanities’18 (ACM, New York, 2018), pp. 3:1–3:6. ISBN: 978-1-4503-6032-6. https://doi.org/10.1145/3282933.3282937
F. Bastani, S. He, S. Abbar, M. Alizadeh, H. Balakrishnan, S. Chawla, S. Madden, Machine-assisted map editing, in Proceedings of the 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPATIAL ’18 (ACM, New York, 2018), pp. 23–32. ISBN: 978-1-4503-5889-7. https://doi.org/10.1145/3274895.3274927
B. Budig, T.C. van Dijk, Active learning for classifying template matches in historical maps, in Discovery Science, ed. by N. Japkowicz, S. Matwin. Lecture Notes in Computer Science (Springer, Berlin, 2015), pp. 33–47. ISBN: 9783319242811, 9783319242828. https://doi.org/10.1007/978-3-319-24282-8_5
B. Budig, T.C.V. Dijk, A. Wolff, Matching labels and markers in historical maps: an algorithm with interactive postprocessing, in ACM Transactions on Spatial Algorithms and Systems (TSAS) 2.4 (2016), pp. 13:1–13:24. ISSN: 2374-0353. https://doi.org/10.1145/2994598
C.S. Beattie, 3D visualization models as a tool for reconstructing the historical landscape of the Ballona creek watershed, MA thesis, University of Southern California, 2014
C. Bizer, T. Heath, T. Berners-Lee, Linked data - the story so far. Int. J. Semant. Web Inf. Syst. 5(3), 1–22 (2009). ISSN: 1552-6283
B. Budig, T.C. van Dijk, F. Feitsch, M.G. Arteaga, Polygon consensus: smart crowdsourcing for extracting building footprints from historical maps, in Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPACIAL ’16 (ACM, New York, 2016), pp. 66:1–66:4. ISBN: 978-1-4503-4589-7. https://doi.org/10.1145/2996913.2996951
B. Budig, Efficient algorithms and user interaction for metadata extraction from historical maps, in Proceedings of the 2Nd ACM SIGSPATIAL PhD Workshop, SIGSPATIAL PhD ’15 (ACM, New York, 2016), pp. 4:1–4:4. ISBN: 978-1-4503-3980-3. https://doi.org/10.1145/2855680.2855841
B. Budig, Extracting spatial information from historical maps: algorithms and interaction, PhD thesis, University of Würzburg, 2018
B. Budig, T.C. van Dijk, F. Kirchner, Glyph miner: a system for efficiently extracting glyphs from early prints in the context of OCR, in 2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL) (2016), pp. 31–34
Y.-Y. Chiang, P. Chioh, S. Moghaddam, A training-by-example approach for symbol spotting from raster maps, in Proceedings of the 8th International Conference on Geographic Information Science (2014), pp. 264–269
C.-C. Chen, C.A. Knoblock, C. Shahabi, Y.-Y. Chiang, S. Thakkar, Automatically and accurately conflating orthoimagery and street maps, in Proceedings of the 12th Annual ACM International Workshop on Geographic Information Systems, GIS ’04 (ACM, New York, 2004), pp. 47–56. ISBN: 9781581139792. https://doi.org/10.1145/1032222.1032231
Y.-Y. Chiang, S. Moghaddam, S. Gupta, R. Fernandes, C.A. Knoblock, From map images to geographic names, in Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM, New York, 2014), pp. 581–584. ISBN: 9781450331319. https://doi.org/10.1145/2666310.2666374
Y.-Y. Chiang, S. Leyk, N.H. Nazari, S. Moghaddam, The impact of graphical quality on automatic text recognition in digital maps, in Proceedings of the 27th International Cartographic Conference (2015) ISBN: 9788588783119
Y.-Y. Chiang, S. Leyk, N.H. Nazari, S. Moghaddam, T.X. Tan, Assessing the impact of graphical quality on automatic text recognition in digital maps. Comput. Geosci. 93, 21–35 (2016). ISSN: 0098-3004. https://doi.org/10.1016/j.cageo.2016.04.013
Y.-Y. Chiang, Harvesting geographic features from heterogeneous raster maps, PhD thesis, Los Angeles, CA, USA: University of Southern California, 2010. ISBN: 9781124412498
Y.-Y. Chiang, Strabo: a complete system for label recognition in maps, in Proceedings of the 26th International Cartographic Conference (ICC’13) (2013), pp. 838–838. ISBN: 9781907075063
Y.-Y. Chiang, Querying historical maps as a unified, structured, and linked spatiotemporal source: vision paper, in Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems. GIS ’15 (ACM, New York, 2015), pp. 16:1–16:4. ISBN: 9781450339674. https://doi.org/10.1145/2820783.2820887
Y.-Y. Chiang, Unlocking textual content from historical maps - potentials and applications, trends, and outlooks, in Recent Trends in Image Processing and Pattern Recognition, ed. by K. Santosh, M. Hangarge, V. Bevilacqua, A. Negi (Springer, Singapore, 2017), pp. 111–124. ISBN: 978-981-10-4859-3
Y.-Y. Chiang, C.A. Knoblock, Automatic extraction of road intersection position, connectivity, and orientations from raster maps, in Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM Press, New York, 2008), pp. 1–10. ISBN: 9781605583235. https://doi.org/10.1145/1463434.1463463
Y.-Y. Chiang, C.A. Knoblock, Recognizing text in raster maps. GeoInformatica 19(1), 1–27 (2014). ISSN: 1384-6175, 1573-7624. https://doi.org/10.1007/s10707-014-0203-9
Y.-Y. Chiang, S. Leyk, Exploiting online gazetteer for fully automatic extraction of cartographic symbols, in Proceedings of the 27th International Cartographic Conference (2015). ISBN: 9788588783119
K.C. Clarke, Getting Started with Geographic Information Systems (Pearson, London, 2010). ISBN: 9780131494985
Y.-Y. Chiang, S. Leyk, C.A. Knoblock, Efficient and robust graphics recognition from historical maps, in Graphics Recognition. New Trends and Challenges: 9th International Workshop, GREC 2011, Seoul, Korea, September 15–16, 2011, Revised Selected Papers, vol. 7423, ed. by Y.-B. Kwon, J.-M. Ogier. Lecture Notes in Computer Science, GREC’11 (Springer, Berlin, 2013), pp. 25–35. ISBN: 9783642368233. https://doi.org/10.1007/978-3-642-36824-0_3
Y.-Y. Chiang, S. Leyk, C.A. Knoblock, A survey of digital map processing techniques, in ACM Comput. Surv. 47(1), 1–44 (2014). ISSN: 0360-0300. https://doi.org/10.1145/2557423
W. Duan, Y.-Y. Chiang, C.A. Knoblock, V. Jain, D. Feldman, J.H. Uhl, S. Leyk, Automatic alignment of geographic features in contemporary vector data and historical maps, in Proceedings of the 1st Workshop on Artificial Intelligence and Deep Learning for Geographic Knowledge Discovery (ACM, New York, 2017), pp. 45–54
W. Duan, Y. Chiang, C.A. Knoblock, S. Leyk, J. Uhl, Automatic generation of precisely delineated geographic features from georeferenced historical maps using deep learning, in Proceedings of the AutoCarto (2018)
ESRI, ESRI shapefile technical description, Tech. rep. ESRI, 1998
S. Frischknecht, A. Carosio, Raster-based methods to extract structured information from scanned topographic maps, in International Archives of Photogrammetry and Remote Sensing, vol. 32, Part 3-4W2 (1997), pp. 1–5
S. Frischknecht, E. Kanani, Automatic interpretation of scanned topographic maps: a raster-based approach, in Graphics Recognition Algorithms and Systems, vol. 1398 (Springer, Berlin, 1997), pp. 207–220. ISBN: 9783540643814. https://doi.org/10.1007/3-540-64381-8_50
I.N. Gregory, P.S. Ell, Historical GIS: Technologies, Methodologies, and Scholarship, vol. 39. Cambridge Studies in Historical Geography (Cambridge University Press, Cambridge, 2007). ISBN: 9781139467711
B. Godfrey, H. Eveleth, An adaptable approach for generating vector features from scanned historical thematic maps using image enhancement and remote sensing techniques in a in a geographic information system. J. Map Geogr. Libr., 18–36 (2015). ISSN: 1542-0353. https://doi.org/10.1080/15420353.2014.1001107
J. Gelernter, MapSearch: a protocol and prototype application to find maps, PhD thesis. Rutgers, The State University of New Jersey, 2008
D. Garijo, Y. Gil, A. Harth, Challenges for provenance analytics over geospatial data, in Provenance and Annotation of Data and Processes, vol. 8628, ed. by B. Ludäscher, B. Plale. Lecture Notes in Computer Science (Springer, Berlin, 2015), pp. 261–263
C.R. Greenwalt, M.E. Shultz, Principles of error theory and cartographic applications, Tech. rep. 1962
K. Janowicz, S. Scheider, T. Pehle, G. Hart, Geospatial semantics and linked spatiotemporal data – past, present, and future. Semantic Web 3(4), 321–332 (2012). https://doi.org/10.3233/SW-2012-0077
L. Kurashige, Rethinking anti-immigrant racism: lessons from the los angeles vote on the 1920 Alien land law. South. Calif. Q. 95(3), 265–283 (2013). ISSN: 0038-3929. https://doi.org/10.1525/scq.2013.95.3.265
A. Khotanzad, E. Zink, Contour line and geographic feature extraction from USGS color topographical paper maps. IEEE Trans. Pattern Anal. Mach. Intell. 25(1), 18–31 (2003). Issn: 0162-8828. https://doi.org/10.1109/TPAMI.2003.1159943
S. Leyk, R. Boesch, R. Weibel, A conceptual framework for uncertainty investigation in map-based land cover change modelling. Trans. GIS 9(3), 291–322 (2005). https://doi.org/10.1111/j.1467-9671.2005.00220.x eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1111/j.1467-9671.2005.00220.x
H. Lin, Y.-Y. Chiang, An uncertainty aware method for geographic data conflation, in Proceedings of the 7th ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2018 (ACM, Seattle, 2018), pp. 20–27. ISBN: 978-1-4503-6041-8. https://doi.org/10.1145/3282834.3282842
H. Lin, Y.-Y. Chiang, SRC: automatic extraction of phrase-level map labels from historical maps. SIGSPATIAL Spec. 9(3), 14–15 (2018)
H. Li, J. Liu, X. Zhou, Intelligent map reader: a framework for topographic map understanding with deep learning and gazetteer. IEEE Access 6, 25363–25376 (2018). ISSN: 2169-3536. https://doi.org/10.1109/ACCESS.2018.2823501
J. Long, E. Shelhamer, T. Darrell, Fully convolutional networks for semantic segmentation, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015), pp. 3431–3440
S. Manson, J. Schroeder, D. Van Riper, S. Ruggles, et al., IPUMS National Historical Geographic Information System: Version 12.0 [Database], Minneapolis: University of Minnesota (2017)
W.B. Mitchell, GIRAS: a geographic information retrieval and analysis system for handling land use and land cover data, 1059. US Govt. Print. Off. (1977)
G. Nagy, A. Samal, S. Seth, T. Fisher, et al., Reading street names from maps-technical challenges, in Proceedings of GIS/LIS (1997)
L. Page, S. Brin, R. Motwani, T. Winograd, The PageRank citation ranking: bringing order to the web, Tech. rep. Stanford InfoLab, 1999
A. Pezeshk, Feature extraction and text recognition from scanned color topographic maps, PhD thesis, Pennsylvania State University, 2011
R. Simon, E. Barker, L. Isaksen, et al., Linking early geospatial documents, one place at a time: annotation of geographic documents with recogito, in e- (2015). http://oro.open.ac.uk/43613/
R. Simon, C. Sadilek, J. Korb, M. Baldauf, B. Haslhofer, Tag clouds and old maps: annotations as linked spatiotemporal data in the cultural heritage domain, in Workshop On Linked Spatiotemporal Data, Zurich, Switzerland (2010)
R. Simon, P. Pilgerstorfer, L. Isaksen, E. Barker, Towards semi-automatic annotation of toponyms on old maps. e - Perimetron 9(3), 105–128 (2014)
K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale image recognition, in 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7–9, 2015, Conference Track Proceedings (2015)
T. Tambassi, The Philosophy of Geo-Ontologies (Springer, Berlin, 2018)
P.H.S. Torr, A. Zisserman, MLESAC: a new robust estimator with application to estimating image geometry. Comp. Vision Image Underst. 78(1), 138–156 (2000). ISSN: 1077-3142. https://doi.org/10.1006/cviu.1999.0832
J.H. Uhl, S. Leyk, Y.-Y. Chiang, W. Duan, C.A. Knoblock, Extracting human settlement footprint from historical topographic map series using context-based machine learning, in IET Conference Proceedings (2017)
J.H. Uhl, S. Leyk, Y.-Y. Chiang, W. Duan, C.A. Knoblock, Spatialising uncertainty in image segmentation using weakly supervised convolutional neural networks: a case study from historical map processing. IET Image Process. 12(11), 2084–2091 (2018)
J. Uhl, S. Leyk, Y.-Y. Chiang, W. Duan, C. Knoblock, Map archive mining: visual-analytical approaches to explore large historical map collections. ISPRS Int. J. Geo-Inform. 7(4), 148 (2018)
J.H. Uhl, Spatio-temporal information extraction under uncertainty using multi-source data integration and machine learning: applications to human settlement modelling, PhD thesis, University of Colorado (2019)
J. Weinman, Toponym recognition in historical maps by gazetteer alignment, in Proceedings of the 12th International Conference on Document Analysis and Recognition (2013), pp. 1044–1048. https://doi.org/10.1109/ICDAR.2013.209
J. Weinman, Geographic and style models for historical map alignment and toponym recognition, in 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1 (IEEE, Piscataway, 2017), pp. 957–964
R. Yu, Z. Luo, Y.-Y. Chiang, Recognizing text in historical maps using maps from multiple time periods, in 2016 23rd International Conference on Pattern Recognition (ICPR) (IEEE, Piscataway, 2016), pp. 3993–3998
H. Zhao, J. Shi, X. Qi, X. Wang, J. Jia, Pyramid scene parsing network, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017), pp. 2881–2890
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2020 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Chiang, YY., Duan, W., Leyk, S., Uhl, J.H., Knoblock, C.A. (2020). Creating Structured, Linked Geographic Data from Historical Maps: Challenges and Trends. In: Using Historical Maps in Scientific Studies. SpringerBriefs in Geography. Springer, Cham. https://doi.org/10.1007/978-3-319-66908-3_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-66908-3_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66907-6
Online ISBN: 978-3-319-66908-3
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)