Skip to main content

Towards Automatically Retrieving Discoveries and Generating Ontologies

  • Conference paper
  • First Online:
Information Science and Applications

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 339))

Abstract

For the web to become intelligent, machines needs to be able to extract the nature and semantics of various concepts and the relationships between them. Most approaches focus on methods involving manually teaching the machine about different entities, their properties manually constructing an ontology. This paper discusses an approach where the necessary metadata is extracted automatically from Wikipedia, the online encyclopedia. This metadata is then used to compare documents allowing them to be clustered together so that similar documents can be identified allowing alternative knowledge to be discovered. The results show that an ontology indicating the relationships between types of documents can be automatically identified and also alternative knowledge can be discovered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. O’Reilly, T.: What is Web 2.0: Design Patterns and Business Models for the next generation of Software, Communications and Strategies, no.1, p.17, first quarter, (2007)

    Google Scholar 

  2. Wesch, M.: Information R/Evolution, Kansas State University, www.youtube.com/watch?v=-4CV05HyAbM

    Google Scholar 

  3. Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web, in Scientific American, May 2001.

    Google Scholar 

  4. Allemang, D, Hendler, J.: RDF - the Basis of the Semantic Web, in Semantic Web for the Working Ontologist, 2nd Ed. 2011.

    Google Scholar 

  5. Gardenfors, P.: How to make the Semantic Web more Semantic, in Formal Ontology in Information Systems, FOIS 2004.

    Google Scholar 

  6. Waters, N.: Why you can’t Cite Wikipedia in my Class, Communications of the ACM, Vol. 50, No. 9, (2007)

    Google Scholar 

  7. Lih, A.: Wikipedia as Participatory Journalism: Reliable Sources? Metrics for Evaluating Collaborative Media as a News Resource, 5th International Symposium on Online Journalism, Austin Texas (2004)

    Google Scholar 

  8. Chesney, T.: An Empirical Examination of wikipedia’s Credibility, First Monday, Vol.11, No. 11 (2006)

    Google Scholar 

  9. Halavais, A., Lackoff, D.: An Analysis of Topic Coverage of Wikipedia, Journal of Computer-Mediated Communication, Vol. 13, Issue 2, (2008), 429-440

    Google Scholar 

  10. Auer, S., Lehman, J.: What have Innsbruck and Liepzig in common? Extracting Semantics from Wiki Content, The Semantic Web Research and Applications, Lecture Notes in Computer Science, Volume 4519, (2007) 503-517

    Google Scholar 

  11. Volkel, M., Krotzsch, M. Vrandecic, D. Haller, H, Studer, R.: Semantic Wikipedia, WWW ’06 Proceedings of the 15th International Conference on World Wide Web, (2006) 585-594.

    Google Scholar 

  12. Krotzsch, M. Vrandecic, D. Volkel, M.: Wikipedia and the Semantic Web, the Missing Links”, Proceedings of Wikimania 2005, Frankfurt, Germany, (2005)

    Google Scholar 

  13. Rayson, P, Garside, R.: Comparing Corpora using Frequency Profiling, in Procceedings of the Workshop on Comparing Corpora held in conjunction with the 38th annual meeting of the Association for Computational Linguistics, ACL 2000, Hong Kong, (2000).

    Google Scholar 

  14. Cosh, K., Burns, R., Daniel, T.: Content Clouds, Classifying Content in Web 2.0, Library Review, Vol. 57, Issue 9, (2008) 722-729.

    Google Scholar 

  15. Robert, P., Escoufier, Y.: A Unifying Tool for Linear Multivariate Statistical Methods: the RV-Coefficient, Journal of the Royal Statistical Society, Vol. 25, No. 3 (1976)

    Google Scholar 

  16. Cosh, K.: On Automatically Extracting Discoveries from User Generated Content, In Proceedings of CISIS 2014, the 8th International Conference on Complex, Intelligent, and Software Intensive Systems (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cosh, K. (2015). Towards Automatically Retrieving Discoveries and Generating Ontologies. In: Kim, K. (eds) Information Science and Applications. Lecture Notes in Electrical Engineering, vol 339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46578-3_72

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-46578-3_72

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-46577-6

  • Online ISBN: 978-3-662-46578-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics