Skip to main content

Classification of Text Documents of an Electronic Archive Based on an Ontological Model

  • Conference paper
  • First Online:
Intelligent Data Engineering and Analytics

Abstract

A modern design organization has a significant electronic archive of documents in an unstructured form. Solving the problem of using the experience of previous projects to solve new problems can be based on the use of intelligent methods and algorithms for analyzing text documents of an organization in order to build a classification system for electronic archives. This work presents an ontological model of a text document as an electronic archive resource. The paper also presents an ontologically oriented classification algorithm for technical documents. In conclusion, the results of experiments confirming the effectiveness of models and algorithms in solving the problem of classifying a document archive are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Yong, W., Liming, L., Yongsheng, Q.: Improvement of big data retrieval algorithm in the intelligent archives management, pp. 487–491 (2015). https://doi.org/10.1109/icemi.2015.7494245

  2. Pandolfo, L., Pulina, L., Zielinski, M.: Towards an ontology for describing archival resources (2017)

    Google Scholar 

  3. Pandolfo, L., Pulina, L., Adorni, G.: A framework for automatic population of ontology-based digital libraries. In: AI * IA 2016 Advances in Artificial Intelligence, pp. 406–417. Springer (2016)

    Google Scholar 

  4. Kruk, S.R., McDaniel, B.: Semantic Digital Libraries. Springer (2009)

    Google Scholar 

  5. Yan, Z., Scharffe, F., Ding, Y.: Semantic search on cross-media cultural archives. In: Wegrzyn-Wolska, K.M., Szczepaniak, P.S. (eds.) Advances in Intelligent Web Mastering. Advances in Soft Computing, vol. 43, pp. 375–380. Springer, Berlin, Heidelberg (2007)

    Google Scholar 

  6. Zagorulko, Y.A.: Semantic approach to the analysis of documents based on the ontology of the subject area. Zagorulko, Y.A, Kononenko, I.S., Sidorova, E.A., Electronics Resource. Access mode: http://www.dialog-21.ru/digests/dialog2006/materials/html/SidorovaE.htm

  7. Gavrilova, T.A.: Knowledge Base of Intelligent Systems. St. Petersburg: Peter (2000)

    Google Scholar 

  8. Schneider, T., Hashemi, A., Bennett, M., Brady, M., Casanave, C., Graves, H., Grüninger, M., Guarino, N., Levenchuk, A., Lucier, E., Obrst, L., Ray, S., Sriram, R., Vizedom, A., West, M., Whetzel, T., Yim, P.: Ontology for big systems: the ontology summit 2012 communiqué. Appl. Ontol. 7, 357–371 (2012). https://doi.org/10.3233/AO-2012-0111

    Article  Google Scholar 

  9. Serrano-Guerrero, J., Olivas, J.A., de la Mata, J., Garces, P.: Physical and semantic relations to build ontologies for representing documents. In: En Liu, Y., Chen, G., Ying, M. (eds.) Fuzzy Logic, Soft Computing and Computational Intelligence (Eleventh International Fuzzy Systems Association World Congress IFSA), Beijing, China, vol. 1, pp. 503–508. Tsinghua University Press, Springer (2005)

    Google Scholar 

  10. Zagoruyko N.G.: Applied methods of data and knowledge analysis. Novosibirsk: IM SB RAS (1999)

    Google Scholar 

  11. Yarushkina, N. Moshkin, V., Filippov, A.: Development of a knowledge base based on context analysis of external information resources. In: DS-ITNT 2018. Proceedings of the International conference Information Technology and Nanotechnology, pp. 328–337. Samara, Russia (2018)

    Google Scholar 

  12. Namestnikov, A., Filippov, A., Avvakumova, V.: An ontology-based model of technical documentation fuzzy structuring. In: 2nd International Workshop on Soft Computing Applications and Knowledge Discovery. SCAKD (2016)

    Google Scholar 

  13. Filippov, A., Moshkin, V., Namestnikov, A., Guskov, G., Samokhvalov, M.: Approach to translation of RDF/OWL-ontology to the graphic knowledge base of intelligent systems. In: Proceedings of the II International Scientific and Practical Conference “Fuzzy Technologies in the Industry—FTI 2018”, pp. 44–49. Ulyanovsk (2018)

    Google Scholar 

  14. Radionova, Y.A.: A method for constructing an evaluation function that determines the effectiveness of automatic clustering algorithms. Autom. Control Process 15, 23–28 (2009)

    Google Scholar 

Download references

Acknowledgements

This paper has been approved within the framework of the federal target project “R&D for Priority Areas of the Russian Science-and-Technology Complex Development for 2014–2020”, government contract No 05.604.21.0252 on the subject “The development and research of models, methods and algorithms for classifying large semistructured data based on hybridization of semantic-ontological analysis and machine learning”.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vadim Moshkin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zarubin, A., Koval, A., Moshkin, V. (2021). Classification of Text Documents of an Electronic Archive Based on an Ontological Model. In: Satapathy, S., Zhang, YD., Bhateja, V., Majhi, R. (eds) Intelligent Data Engineering and Analytics. Advances in Intelligent Systems and Computing, vol 1177. Springer, Singapore. https://doi.org/10.1007/978-981-15-5679-1_56

Download citation

Publish with us

Policies and ethics