Abstract
DALI is a practical system that exploits Linked Data to provide federated entity search and spatial exploration across hundreds of information sources containing Open and Enterprise data pertaining to cities, which are stored in tabular files or in their original enterprise systems. Our system is able to lift data into a meaningful linked structure with explicit semantics, and support novel contextual search and retrieval tasks by identifying related entities across models and data sources. We evaluate in two pilot scenarios. In the first, data-engineers bring together public and enterprise datasets about public safety. In the second, knowledge-engineers and domain-experts, build a view of health and social care providers for vulnerable populations. We show that our approach can re-use data assets and provides better results than pure text-based approaches in finding relevant information, as well as satisfying specific information needs.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - A crystallization point for the Web of Data. Web Semantics 7(3), 154–165 (2009)
Cohen, W., Ravikumar, P., Fienberg, S.E.: A comparison of string distance metrics for name-matching tasks. In: IJCAI Workshop on Information Integration (2003)
Das Sarma, A., Fang, L., Gupta, N., et al: Finding related tables. In: SIGMOD 2012
Davis, F.D.: Perceived usefulness, ease of use, and user acceptance of information technology. MIS quarterly 13, 319–340 (1989)
Ding, L., Lebo, T., Erickson, J.S., DiFranzon, D., et al.: A portal for linked open government data ecosystems. Web Semantics 9(3) (2011)
Dublin Core: http://dublincore.org/documents/dcmi-terms/
Ermilov, I. Auer, S., Stadler, C. Csv2rdf: user-driven csv to rdf mass conversion framework. In: ISEM 2013
Fleiss, J.L., Cohen, J.: The equivalence of weighted kappa and the intraclass correlation coefficient as measure of reliability. Educational and Psychological Measurement 33, 613–619 (1973)
Miller, G.A.: WordNet: A Lexical Database for English. Communications of the ACM 38(11), 39–41 (1995)
Han, L., Finin, T.W., Parr, C.S., Sachs, J., Joshi, A.: RDF123: from spreadsheets to RDF. In: Sheth, A.P., Staab, S., Dean, M., Paolucci, M., Maynard, D., Finin, T., Thirunarayan, K. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 451–466. Springer, Heidelberg (2008)
http://www.ibm.com/developerworks/industry/library/ind-intelligent-operations-center/
http://www.w3.org/2001/sw/rdb2rdf/ and http://www.w3.org/TR/r2rml/
IBM Curam: http://www-03.ibm.com/software/products/en/social-programs
Kotoulas, S., Sedlazek, W., Lopez, V., et al.: Linked data for citizen-centric care. In: MIE 2014
Lopez, V., Kotoulas, S., Sbodio, M.L., Lloyd, R.: Guided exploration and integration of urban data. In: Hypertext 2013
Maali, F., Cyganiak, R., Peristeras, V.: A publishing pipeline for linked government data. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 778–792. Springer, Heidelberg (2012)
NYC Open Data portal: https://data.cityofnewyork.us/data
Official datasets provided by Medicare and Medicaid services: https://data.medicare.gov/
OWL Time: http://www.w3.org/TR/owl-time/
Quercini, G., Reynaud, C.: Entity discovery and annotation in tables. In: EDBT (2013)
Raimond, Y., Ferne, T.: The BBC world service archive prototype. In: ISWC Semantic Web Challenge (2013)
Rodriguez-Muro, M., Rezk, M., Hardi, J., Slusnys, M., Bagosi, T., Calvanese, D.: Evaluating SPARQL-to-SQL translation in ontop. In: ORE 2013
Scharffe, F., Atemezing, G., Troncy, R., Gandon, F., et al.: Enabling linked-data publication with the datalift platform. In: Workshop on Semantic Cities, AAAI 2012
Skjæveland, M.G., Lian, E.H., Horrocks, I.: Publishing the norwegian petroleum directorate’s factpages as semantic web data. In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 162–177. Springer, Heidelberg (2013)
Social Care taxonomy from the UK Social Care Institute for Excellence: http://www.scie.org.uk
WGS84 geo-coordinates: http://www.w3.org/2003/01/geo/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Lopez, V., Stephenson, M., Kotoulas, S., Tommasi, P. (2015). Data Access Linking and Integration with DALI: Building a Safety Net for an Ocean of City Data. In: Arenas, M., et al. The Semantic Web - ISWC 2015. ISWC 2015. Lecture Notes in Computer Science(), vol 9367. Springer, Cham. https://doi.org/10.1007/978-3-319-25010-6_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-25010-6_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25009-0
Online ISBN: 978-3-319-25010-6
eBook Packages: Computer ScienceComputer Science (R0)