Skip to main content

A Survey on Efficient Management of Large RDF Graph for Semantic Web in Big Data

  • Conference paper
  • First Online:
Emerging Technologies in Data Mining and Information Security

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1300))

  • 1006 Accesses

Abstract

Semantic web expands the web principles by allowing the computer to understand and easily analyze the data on web. Presently, RDF is used as a triplet model for the semantic web. There is primary need for efficient storage, data retrieval from RDF graph of semantic web in live world application. This paper compares work done in semantic web and also discusses the various challenges involved including scalability, real-time efficient storage and query processing in graph oriented distributed database. The different approaches compared are direct relational mapping approach, entity-based perspectives with different indexing techniques for querying linked data for multilevel indexing framework and graph-based approach. This paper provides an overview of the features and techniques for storing the RDF graph and managing the metadata of data for the semantic web.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Alexaki, S., Karvounarakis, G., Christophides, V., Tolle, K., Plexousakis, D.: The ICS-forth RDF suite: managing voluminous RDF description bases. In: 2nd International Workshop on the Semantic Web, pp. 1–13 (2001)

    Google Scholar 

  2. Allemang, D., Hendler, J.A..: Semantic Web for the Working Ontologist: Effective Modeling in RDFS and OWL, 2nd edn, pp. 10. Morgan Kaufmann, San Francisco (2011)

    Google Scholar 

  3. Cure, O., Guillaume, B.: RDF Database Systems Triples Storage and SPARQL Query Processing, 1st edn, pp. 47–69. Elsevier, Waltham, MA, USA (2015)

    Google Scholar 

  4. Angles, R., Gutierrez, C..: Querying RDF data from a graph database perspective. In: Proceedings of the Second European Semantic Web Conference (ESWC), pp. 346–360 (2005)

    Google Scholar 

  5. Aluç, G., Özsu, T., Hartig, O., Daudjee, K.: Executing queries over schemaless RDF databases. In: Proceedings of the 31st International Conference on Data Engineering, pp. 807–818 (2015)

    Google Scholar 

  6. Beckett, D.: The design and implementation of the Redland RDF application framework. In: Proceedings of the 10th International Conference on World Wide Web, pp. 449–456. ACM, NY Press (2001)

    Google Scholar 

  7. Bönström, V., Schweppe, H., Hinze, A.: Storing RDF as a graph. In: Proceedings of the First Conference on Latin American Web Congress, p. 27. IEEE Computer Society, Washington, DC (2003)

    Google Scholar 

  8. Brickley, D., Guha, R.: RDF vocabulary description language 1.0: RDF schema. W3C Recommendation (2004)

    Google Scholar 

  9. Bornea, M.A., Dolby J., Srinivas K., Kementsietsidis A., Bhattacharjee B., Udrea O., Dantressangle P.: Building an efficient RDF store over a relational database. In: SIGMOD Conference, pp. 121–132, New York, NY, USA (2013)

    Google Scholar 

  10. Broekstra, J., Harmelen V., Kampman, A.: Sesame: a generic architecture for storing and querying RDF and RDF schema. In: The International Semantic Web Conference (ISWC), pp. 54–68 (2002)

    Google Scholar 

  11. Fernandez, J., Gutierrez, C., Martinez-Prieto, M.: Compact representation of large RDF data sets for publishing and exchange. In: International Semantic Web Conference, vol. 1, pp. 193–208 (2010)

    Google Scholar 

  12. Ladwig, G., Tran, T.: Linked data query processing strategies. In: International Semantic Web Conference (ISWC), vol. 1, pp. 453–469 (2010)

    Google Scholar 

  13. Arenas, M., Perez, J., Gutierrez, C.: Foundations of RDF databases. In: Reasoning Web, pp. 158–204 (2009)

    Google Scholar 

  14. Huang, J., Ren, K., Abadi, D.: Scalable SPARQL querying of large RDF graphs. In: PVLDB 4, pp. 1123–1134 (2011)

    Google Scholar 

  15. Mallidi, S., Bebee, B., Choi, D., Gupta, A., Gutmans, A., Khandelwal, A., Kiran, Y., McGaughy, B., Personick, M., Rajan, K., Rondelli, S., Ryazanov, A., Schmidt, M., Sengupta, K., Thompson, B., Vaidya, D., Wang, S.: Amazon neptune: graph data management in the cloud. In: International Semantic Web Conference, pp. 46–97, WA, USA (2018)

    Google Scholar 

  16. Cudré-Mauroux, P., Enchev, I., Groth, P.T., Fundatureanu, S., Haque, A., Harth, A., Keppmann, F.L., Sequeda, J., Miranker, P., Wylot, M.: NoSQL databases for RDF: an empirical evaluation. In: International Semantic Web Conference, vol. 2, pp. 310–325 (2013)

    Google Scholar 

  17. Hartig, O.: SPARQL for a web of linked data: semantics and computability. In: Proceedings of the 9th Extended Semantic Web Conference, pp. 8–23 (2012)

    Google Scholar 

  18. Peng, P., Özsu, T., Zou, L., Zhao, D., Chen, L.: Processing SPARQL queries over distributed RDF graphs. VLDB J. 243–268 (2016)

    Google Scholar 

  19. Lefranois, M., Zimmermann, A., Bakerally, N.: A SPARQL extension for generating RDF from heterogeneous formats, pp. 82–98. Springer, Berlin (2017)

    Google Scholar 

  20. Ozsu, T.: A survey of RDF data management systems. Front. Comput. Sci. 418–432 (2016)

    Google Scholar 

  21. Atre, M., Chaoji V., Hendler, J., Zaki, M.: Matrix “bit” loaded: a scalable lightweight join query processor for RDF data. In: Proceedings of the 19th International Conference on World Wide Web, pp. 41–50. ACM Press, New York, USA (2010)

    Google Scholar 

  22. Lei, Z., Özsu, T., Chen, L., Shen, X., Huang, R., Zhao, D.: gStore: a graph-based SPARQL query engine. VLDB J. 565–590 (2014)

    Google Scholar 

  23. Papailiou, N., Tsoumakos, D., Karras, P., Konstantinou, I., Koziris, N.: H2RDF+: an efficient data management system for big RDF graphs. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 909–912 (2014)

    Google Scholar 

  24. Weiss, C., Bernstein, A., Karras, P.: Hexastore: sextuple indexing for semantic web data management. In. Proceedings of the VLDB Endowment, pp. 1008–1019 (2008)

    Google Scholar 

  25. Castillo, R.: RDF mata view: indexing RDF data for SPARQL queries. In: 9th International Semantic Web Conference (2010)

    Google Scholar 

  26. Acharjya, D.P., Kauser Ahmed, P.: A survey on big data analytics: challenges, open research issues and tools. Int. J. Adv. Comput. Sci. Appl. (IJACSA) 7(2) (2016)

    Google Scholar 

  27. Fletcher, H., Beck, W.: Scalable indexing of RDF graphs for efficient join processing. In: Proceeding of the 18th ACM Conference on Information and Knowledge Management, pp. 1513–1516. ACM Press, New York (2009)

    Google Scholar 

  28. Harris, S., Shadbol, N., Lamb, N.: 4Store: the design and implementation of a clustered RDF store. In: Proceedings of the 5th International Workshop on Scalable Semantic Web Knowledge Base Systems (IWSSWBS), pp. 16–25 (2009)

    Google Scholar 

  29. Ladwig, G., Harth, A.: Cumulus RDF: linked data management on nested key-value stores. In: Proceedings of the 7th International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWBS) at the 10th International Semantic Web Conference (ISWC), pp. 30–42. Springer, Berlin (2011)

    Google Scholar 

  30. Zou, L., Ozsu, M.: Distancejoin: pattern match query in a large graph database. In: PVLDB, pp. 886–897 (2009)

    Google Scholar 

  31. Myung, J., Lee, G., Yeon, J.: SPARQL basic graph pattern processing with iterative MapReduce. In: Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud, pp. 6–16. ACM Press, New York (2010)

    Google Scholar 

  32. Cattell, R.: Scalable SQL and NoSQL data stores. In: SIGMOD Rec., pp. 134–150, New York, NY, USA (2011)

    Google Scholar 

  33. Tsatsanifos, G., Sellis, T., Sacharidis, D.: On enhancing scalability for distributed RDF/S stores. In: Proceedings of the 14th International Conference on Extending Database Technology, pp. 141–152. ACM Press, New York (2011)

    Google Scholar 

  34. AWS Product: Amazon Neptune—Graph Oriented Distributed Database. https://aws.amazon.com/neptune/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ashutosh A. Abhangi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Abhangi, A.A., Iyer, S. (2021). A Survey on Efficient Management of Large RDF Graph for Semantic Web in Big Data. In: Hassanien, A.E., Bhattacharyya, S., Chakrabati, S., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 1300. Springer, Singapore. https://doi.org/10.1007/978-981-33-4367-2_24

Download citation

Publish with us

Policies and ethics