Abstract
For the web to become intelligent, machines needs to be able to extract the nature and semantics of various concepts and the relationships between them. Most approaches focus on methods involving manually teaching the machine about different entities, their properties manually constructing an ontology. This paper discusses an approach where the necessary metadata is extracted automatically from Wikipedia, the online encyclopedia. This metadata is then used to compare documents allowing them to be clustered together so that similar documents can be identified allowing alternative knowledge to be discovered. The results show that an ontology indicating the relationships between types of documents can be automatically identified and also alternative knowledge can be discovered.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
O’Reilly, T.: What is Web 2.0: Design Patterns and Business Models for the next generation of Software, Communications and Strategies, no.1, p.17, first quarter, (2007)
Wesch, M.: Information R/Evolution, Kansas State University, www.youtube.com/watch?v=-4CV05HyAbM
Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web, in Scientific American, May 2001.
Allemang, D, Hendler, J.: RDF - the Basis of the Semantic Web, in Semantic Web for the Working Ontologist, 2nd Ed. 2011.
Gardenfors, P.: How to make the Semantic Web more Semantic, in Formal Ontology in Information Systems, FOIS 2004.
Waters, N.: Why you can’t Cite Wikipedia in my Class, Communications of the ACM, Vol. 50, No. 9, (2007)
Lih, A.: Wikipedia as Participatory Journalism: Reliable Sources? Metrics for Evaluating Collaborative Media as a News Resource, 5th International Symposium on Online Journalism, Austin Texas (2004)
Chesney, T.: An Empirical Examination of wikipedia’s Credibility, First Monday, Vol.11, No. 11 (2006)
Halavais, A., Lackoff, D.: An Analysis of Topic Coverage of Wikipedia, Journal of Computer-Mediated Communication, Vol. 13, Issue 2, (2008), 429-440
Auer, S., Lehman, J.: What have Innsbruck and Liepzig in common? Extracting Semantics from Wiki Content, The Semantic Web Research and Applications, Lecture Notes in Computer Science, Volume 4519, (2007) 503-517
Volkel, M., Krotzsch, M. Vrandecic, D. Haller, H, Studer, R.: Semantic Wikipedia, WWW ’06 Proceedings of the 15th International Conference on World Wide Web, (2006) 585-594.
Krotzsch, M. Vrandecic, D. Volkel, M.: Wikipedia and the Semantic Web, the Missing Links”, Proceedings of Wikimania 2005, Frankfurt, Germany, (2005)
Rayson, P, Garside, R.: Comparing Corpora using Frequency Profiling, in Procceedings of the Workshop on Comparing Corpora held in conjunction with the 38th annual meeting of the Association for Computational Linguistics, ACL 2000, Hong Kong, (2000).
Cosh, K., Burns, R., Daniel, T.: Content Clouds, Classifying Content in Web 2.0, Library Review, Vol. 57, Issue 9, (2008) 722-729.
Robert, P., Escoufier, Y.: A Unifying Tool for Linear Multivariate Statistical Methods: the RV-Coefficient, Journal of the Royal Statistical Society, Vol. 25, No. 3 (1976)
Cosh, K.: On Automatically Extracting Discoveries from User Generated Content, In Proceedings of CISIS 2014, the 8th International Conference on Complex, Intelligent, and Software Intensive Systems (2014)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cosh, K. (2015). Towards Automatically Retrieving Discoveries and Generating Ontologies. In: Kim, K. (eds) Information Science and Applications. Lecture Notes in Electrical Engineering, vol 339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46578-3_72
Download citation
DOI: https://doi.org/10.1007/978-3-662-46578-3_72
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-46577-6
Online ISBN: 978-3-662-46578-3
eBook Packages: EngineeringEngineering (R0)