Studying Evolution of a Branch of Knowledge by Constructing and Analyzing Its Ontology

  • Pavel Makagonov
  • Alejandro Ruiz Figueroa
  • Alexander Gelbukh
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3999)


We propose a method for semi-automatic construction of an ontology of a given branch of science for measuring its evolution in time. The method relies on a collection of documents in the given thematic domain. We observe that the words of different levels of abstraction are located within different parts of a document: say, the title or abstract contains more general words than the body of the paper. What is more, the hierarchical structure of the documents allows us to determine the parent-child relation between words: e.g., a word that appears in the title of a paper is a candidate for a parent of the words appearing in the body of this paper; if such a relation is repeated several times, we register such a parent-child pair in our ontology. Using the papers corresponding to different years, we construct such an ontology for each year independently. Comparing such ontologies (using tree edit distance measure) for different years reveals the trends of evolution of the given branch of science.


Thematic Domain Paper Title Grey Rectangle Ontology Construction Vector Space Representation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Kuhn, T.S.: The Structure of Scientific Revolutions, 2nd edn. University of Chicago Press (1970)Google Scholar
  2. 2.
    References Book of Proceedings of Free Economic Society of Russia. (1983–2000), Vol.4. p. 756, Moscow, Russia (2000)Google Scholar
  3. 3.
    Makagonov, P., Alexandrov, P.M., Sboychakov, K.: A toolkit for development of the domain-oriented dictionaries for structuring document flows. In: Kiers, H.A., et al. (eds.) Data Analysis, Classification, and Related Methods, Studies in classification, data analysis, and knowledge organization, pp. 83–88. Springer, Heidelberg (2000)CrossRefGoogle Scholar
  4. 4.
    IRBIS Automated Library System, Russian National Public Library for Science and Technology,
  5. 5.
    Gelbukh, A., Alexandrov, M., Han, S.-Y.: Detecting Inflection Patterns in Natural Language by Minimization of Morphological Model. In: Sanfeliu, A., Martínez Trinidad, J.F., Carrasco Ochoa, J.A. (eds.) CIARP 2004. LNCS, vol. 3287, pp. 432–438. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  6. 6.
    Wagner III, J.A., Gooding, R.Z.: Effects of Societal Trends on Participation Research. Administrative Science Quarterly 32, 241–262 (1987)CrossRefGoogle Scholar
  7. 7.
    Makagonov, P., Ruiz Figueroa, A., Sboychakov, K., Gelbukh, A.: Learning a Domain Ontology from Hierarchically Structured Texts. In: Proc. of Workshop Learning and Extending Lexical Ontologies by using Machine Learning Methods at 22nd International Conference on Machine Learning, ICML 2005, Bonn, Germany (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Pavel Makagonov
    • 1
  • Alejandro Ruiz Figueroa
    • 1
  • Alexander Gelbukh
    • 2
  1. 1.Mixteca University of TechnologyOaxacaMéxico
  2. 2.Center for Computing ResearchNational Polytechnic InstituteMexico DFMéxico

Personalised recommendations