Abstract
The majority of datasets on Open Government Data (OGD) portals are stored in comma-separated values (CSV) file. Publishing CSV data as a Linked Open Data (LOD) on the Web is an active field of research. However, there are very few effective applications have been developed with this purpose. Linked Data refer many ways for connecting and publishing structured data to data consumers, but available datasets are in CSV format. Therefore, publishing the CSV model on the webpage, it is needed to change CSV in RDF file format. Many methods and tools have been proposed for data mapping and publishing, however, most of them are not followed by the W3C recommendations rules. The contribution and goal of this paper are to develop a Semantic approach that can effectively convert CSV data into RDF data with rich semantics and release RDF data on the web using LOD principles. We utilize Semantic Web resources and W3C recommendation rules in automatic data publishing method, which enables distributed system for scalability. We apply the proposed method to existing CSVW Implementation Report-W3C and U.S Government’s application (data.gov). Our experimental results indicate that the proposed approach successfully converts CSV to RDF data and publish those RDF as LOD on the Web, with adequate performance on any sized datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ermilov, I., Auer, S., Stadler, C.: User-driven semantic mapping of tabular data. In: Proceedings of the 9th International Conference on Semantic Systems (I-SEMANTICS), Graz, Austria, pp. 105–112 (2013)
Shafranovich, Y.: Common Format and MIME Type for Comma-Separated Values (CSV) Files. IETF RFC 4180 (2005). http://tools.ietf.org/html/rfc4180. Accessed 30 Jan 2019
Tennison, J., Kellogg, G. (eds.): Model for Tabular Data and Metadata on the Web. W3C Recommendation, 17 December 2015. http://www.w3.org/TR/tabular-data-model/. Accessed 3 Dec 2018
Dadzie, A.S., Rowe, M.: Approaches to visualizing linked data: a survey. Semant. Web 2, 89–124 (2011)
Ermilov, I., Auer, S., Stadler, C.: CSV2RDF: user-driven CSV to RDF mass conversion framework. In: Proceedings of the ISEM 2013, Graz, Austria (2013)
Lassila, O., Swick, R.R.: Resource Description Framework (RDF) Model and Syntax Specification. W3C Recommendation, 22 February 1999. https://www.w3.org/TR/1999/REC-rdf-syntax-19990222/. Accessed 13 Dec 2018
Lassila, O., Swick, R.R.: Resource Description Framework (RDF) Model and Syntax. W3C Working Draft 16 February 1998. https://www.w3.org/TR/WD-rdf-syntax-971002/. Accessed 10 Nov 2018
Bizer, C., Cyganiak, R., Heath, T.: How to Publish Linked Data on the Web. http://wifo5-03.informatik.uni-mannheim.de/bizer/pub/LinkedDataTutorial/. Accessed 10 Nov 2018
Tennison, J.: Linked CSV. Unofficial Draft, 08 March 2013. Open Data Institute. http://jenit.github.io/linked-csv/. Accessed 10 Sept 2018
Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semant. Web Inf. Syst. (IJSWIS) 5(3), 1–22 (2009)
Behkamal, B., Kahani, M., Paydar, S., Dadkhah, M., Sekhavaty, E.: Publishing Persian linked data: challenges and lessons learned. In: Proceedings of the 5th International Symposium on Telecommunications, Tehran, Iran, pp. 732–737 (2010)
Sarkar, A., Marjit, U., Biswas, U.: Linked data generation for the university data from legacy database. Int. J. Web Semant. Technol. 2(3), 21–31 (2010)
Rowe, M., Ciravegna, F.: Data.dcs: converting legacy data into linked data. In: Proceedings of the Linked Data on the Web Workshop, World Wide Web Conference, North Carolina, USA (2010)
Li, J., Zhao, Y.: A case study on linked data generation and consumption. In: Proceedings of the Linked Data on the Web (LDOW2008), Beijing, China (2008)
Maali, F., Cyganiak, R., Peristeras, V.: A publishing pipeline for Linked Government Data. In: Proceedings of the 9th Extended Semantic Web Conference (ESWC2012), Greece, pp. 778–792 (2012)
Hyland, B., Atemezing, G., VillazĂłn-Terrazas, B.: Best Practices for Publishing Linked Data. W3C Working Group Note, 09 January 2014. http://www.w3.org/TR/ld-bp/. Accessed 10 Sept 2018
Polfliet, S., Ichise, R.: Automated mapping generation for converting databases into linked data. In: Proceedings of 9th International Semantic Web Conference (ISWC2010), Shanghai, China, pp. 173–176 (2010)
Haase, P., Schmidt, M., Schwarte, A.: The information workbench as a self-service platform for linked data applications. In: Proceedings of the 2nd International Workshop on Consuming Linked Data (COLD), Bonn, Germany, pp. 119–124 (2011)
Speicher, S., Arwe, J., Malhotra, A.: Linked Data Platform 1.0. W3C Recommendation, 26 February 2015. https://www.w3.org/TR/ldp/. Accessed 11 July 2018
Man, K., Mutz, A.: Transparent Content Negotiation in HTTP. Network Working Group (1998). https://tools.ietf.org/html/rfc2295. Accessed 25 Sept 2018
Heath, T., Bizer, C.: Linked Data: Evolving the Web into a Global Data Space, vol. 1. Morgan & Claypool (2011)
Hyland, B., Atemezing, G.: Linked Data Glossary. W3C Working Group Note, 27 June 2013. https://www.w3.org/TR/ld-glossary/. Accessed 20 Oct 2018
Mahmud, S.M.H., Hossin, M.A., Jahan, H., Noori, S.R.H., Bhuiyan, T.: CSV-ANNOTATE: generate annotated tables from CSV file. In: Proceedings of the 2018 International Conference on Artificial Intelligence and Big Data (ICAIBD), Chengdu, China, pp. 71–75 (2018)
Mahmud, S.M.H., Hossin, M.A., Jahan, H., Noori, S.R.H., Hossain, M.F.: CSV2RDF: generating RDF data from CSV file using semantic web technologies. J. Theor. Appl. Inf. Technol. 96(20), 6889–6902 (2018)
Kellogg, G.: CSVW Implementation Report. W3C Document, 28 October 2015. https://w3c.github.io/csvw/tests/reports/index.html. Accessed 20 Jan 2019
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Mahmud, S.M.H., Hossin, M.A., Hasan, M.R., Jahan, H., Noori, S.R.H., Ahmed, M.R. (2020). Publishing CSV Data as Linked Data on the Web. In: Singh, P., Panigrahi, B., Suryadevara, N., Sharma, S., Singh, A. (eds) Proceedings of ICETIT 2019. Lecture Notes in Electrical Engineering, vol 605. Springer, Cham. https://doi.org/10.1007/978-3-030-30577-2_72
Download citation
DOI: https://doi.org/10.1007/978-3-030-30577-2_72
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30576-5
Online ISBN: 978-3-030-30577-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)