Skip to main content

Semi-automatic Construction of Topic Ontologies

  • Conference paper
Semantics, Web and Mining (EWMF 2005, KDO 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4289))

Included in the following conference series:

Abstract

In this paper, we review two techniques for topic discovery in collections of text documents (Latent Semantic Indexing and K-Means clustering) and present how we integrated them into a system for semi-automatic topic ontology construction. The OntoGen system offers support to the user during the construction process by suggesting topics and analyzing them in real time. It suggests names for the topics in two alternative ways both based on extracting keywords from a set of documents inside the topic. The first set of descriptive keyword is extracted using document centroid vectors, while the second set of distinctive keyword is extracted from the SVM classification model dividing documents in the topic from the neighboring documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agirre, E., Ansa, O., Hovy, E., Martinez., D.: Enriching Very Large Ontologies Using the WWW. In: Proceedings of the Ontology Learning Workshop, The 14th European Conference on Artificial Inteligence (ECAI), Berlin, Germany (2000)

    Google Scholar 

  2. Bisson, G., Nedellec, C., Canamero, L.: Designing clustering methods for ontology building: The Mo’K workbench. In: Proceedings of the Ontology Learning Workshop, The 14th European Conference on Artificial Inteligence (ECAI), Berlin, Germany (2000)

    Google Scholar 

  3. Brank, J., Grobelnik, M., Milic-Frayling, N., Mladenic, D.: Feature selection using support vector machines. In: Proceedings of the 3rd International Conference on Data Mining Methods and Databases for Engineering, Finance, and Other Fields, Bologna, Italy (2002)

    Google Scholar 

  4. Cimiano, P., Pivk, A., Schmidt-Thieme, L., Staab, S.: Learning Taxonomic Relations from Heterogeneous Evidence. In: Proceedings of the Ontology Learning and Population Workshop, The 16th European Conference on Artificial Inteligence (ECAI), Valenci, Spain (2004)

    Google Scholar 

  5. Deerwester, S., Dumais, S., Furnas, G., Landuer, T., Harshman, R.: Indexing by Latent Semantic Analysis. Journal of the American Society of Information Science 41(6), 391–407 (1990)

    Article  Google Scholar 

  6. Douglas, B., Guha, L.R.V.: Building Large Knowledge-Based Systems. Addison Wesley, Reading (1990)

    Google Scholar 

  7. Lpez, M.F.: Overview of the methodologies for building ontologies. In: Proceedings of the Ontologies and Problem-Solving Methods Workshop, The 16th International Joint Conference on Artificial Inteligence (IJCAI), Stockholm, Sweden (1999)

    Google Scholar 

  8. Fortuna, B., Grobelnik, M., Mladenic, D.: Visualization of text document corpus. Informatica 29, 497–502 (2005)

    Google Scholar 

  9. Fortuna, B., Grobelnik, M., Mladenic, D.: Background Knowledge for Ontology Construction. In: Poster at 16th International World Wide Web Conference (WWW 2006), Edinburgh, Scotland (2006)

    Google Scholar 

  10. Grobelnik, M., Mladenic, D.: Efficient visualization of large text corpora. In: Proceedings of the 17th TELRI seminar, Dubrovnik, Croatia (2002)

    Google Scholar 

  11. Heyer, G., Läuter, M., Quasthoff, U., Wittig, T., Wolff, C.: Learning Relations using Collocations. In: Proceedings of Workshop on Ontology Learning, The 17th International Joint Conference on Artificial Inteligence (IJCAI), Seattle, USA (2001)

    Google Scholar 

  12. Jain, A.K., Murty, M.N., Flynn, P.J.: Data Clustering: A Review. ACM Computing Surveys 31(3), 264–323 (1999)

    Article  Google Scholar 

  13. Joachims, T.: Making large-scale svm learning practical. In: Scholkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods: Support Vector Machines. MIT Press, Cambridge (1998)

    Google Scholar 

  14. Leskovec, J., Grobelnik, M., Milic-Frayling, N.: Learning Semantic Graph Mapping for Document Summarization. In: Proceedings of Workshop on Knowledge Discovery and Ontologies, 15th European Conference on Machine Learning (ECML) and 8th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD), Pisa, Italy (2004)

    Google Scholar 

  15. Maedche, A., Staab, S.: Discovering conceptual relations from text. In: The 14th European Conference on Artificial Inteligence (ECAI), Berlin, Germany, pp. 321–325 (2000)

    Google Scholar 

  16. Reinberger, M.-L., Spyns, P.: Discovering Knowledge in Texts for the learning of DOGMA-inspired ontologies. In: Proceedings of the Ontology Learning and Population Workshop, The 16th European Conference on Artificial Inteligence (ECAI), Valenci, Spain (2004)

    Google Scholar 

  17. Salton, G.: Developments in Automatic Text Retrieval. Science 253, 974–979 (1991)

    Article  MathSciNet  Google Scholar 

  18. Steinbach, M., Karypis, G., Kumar, V.: A comparison of document clustering techniques. In: Proceedings of KDD Workshop on Text Mining, 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), Boston, USA (2000)

    Google Scholar 

  19. Uschold, M.: Towards a Methodology for Building Ontologies. In: Workshop on Basic Ontological Issues in Knowledge Sharing, The 14th International Joint Conference on Artificial Inteligence (IJCAI), Motnreal, Canada (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fortuna, B., Mladenič, D., Grobelnik, M. (2006). Semi-automatic Construction of Topic Ontologies. In: Ackermann, M., et al. Semantics, Web and Mining. EWMF KDO 2005 2005. Lecture Notes in Computer Science(), vol 4289. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11908678_8

Download citation

  • DOI: https://doi.org/10.1007/11908678_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-47697-9

  • Online ISBN: 978-3-540-47698-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics