Skip to main content

Development of a Data Mining Subsystem for the Citeck Electronic Document Management System

  • Conference paper
  • First Online:
Proceedings of 2nd International Conference on Smart Computing and Cyber Security (SMARTCYBER 2021)


An urgent task in the implementation of electronic document management systems (EDMS) is an expansion of their functionality through personalization and taking into account the individual characteristics of the organization. The article deals with the development of a data mining subsystem using machine learning methods. As part of the study, the principles of formalizing the processes of processing incoming correspondence and organizational and administrative documents, methods of collecting and analyzing data on the work of users with various types of documents through the use of artificial neural networks, and a comprehensive assessment of improving the efficiency of the EDMS of an educational organization were studied. The scientific novelty of the research lies in the development of an algorithm and software for automating the collection and analysis of data through the use of neural networks in the EDMS. The main scientific results include formalized criteria for documents and stages of their development, the algorithm of the mining subsystem, and the developed software for the EDMS of the Lyceum №22 “Hope of Siberia”, Novosibirsk, Russia. The obtained results enabled to identify the types of documents and the stages of their development that are most demanding on the resources necessary for their implementation, which can later be used to find ways to optimally organize work on the preparation of documents of various types.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions


  1. Altuhova NF, Dzybenko AL, Loseva VV (2019) Electronic document management systems. Knorus, p 202

    Google Scholar 

  2. Istratova EE, Sin DD (2020) Choosing a data mining tool for scientific data processing. In: Challenges of the digital economy: development of a comfortable urban environment. Bryansk, pp 348–352

    Google Scholar 

  3. Istratova EE, Chernikov AA (2019) Application of electronic document management systems in the small business of the city of Novosibirsk. In: Actual problems of the world economy and management. Belarus, pp 256–257

    Google Scholar 

  4. Simonova SI (2015) Data mining for tasks CRM. Int J Open Inf Technol (electronic journal) 2. Accessed 1 Dec 2020 (in Russian)

  5. Efremova LI, Kolekina AO (2019) Choosing an electronic document management system for an enterprise. VUiT Bull (electronic journal) 1. Accessed 1 Dec 2020 (in Russian)

  6. Aprishko DV, Taran VN (2020) Electronic document management systems and their review/distance educational technologies. In: Materials of the IV all-Russian scientific and practical conference, pp 244–249. Accessed 2 Dec 2020 (in Russian)

  7. de Lange P, Nicolaescu P, Neumann AT (2020) Integrating web-based collaborative live editing and wireframing into a model-driven web engineering process. Data Sci (electronic journal) 5:240–260. Accessed 1 Dec 2020

  8. Medvedeva OV, Paramonova MG (2019) Digitalization of management and electronic document management systems. Uchenye zapiski of the Tambov branch of RosMU (electronic journal) 13. Accessed 1 Dec 2020 (in Russian)

  9. Patil T, Bhavsar AK (2021) Data science team roles and need of data science: a review of different cases: data science and intelligent applications. Lect Notes Data Eng Commun Technol (electronic journal) 52:13–22. Accessed 1 Dec 2020

  10. Bhagchandani A, Trivedi D (2021) A machine learning algorithm to predict financial investment: data science and intelligent applications. Lect Notes Data Eng Commun Technol (electronic journal) 52:261–266. Accessed 1 Dec 2020

  11. Paramonova IE (2016) Electronic document-management systems: a classification and new opportunities for a scientific technical library. Sci Tech Inf Proc (electronic journal) 43:136–143. Accessed 1 Dec 2020

  12. Ginsburg M (1999) An agent framework for intranet document management. Autonom Agents Multi-Agent Syst (electronic journal) 2:271–286. Accessed 1 Dec 2020

  13. Hillah LM, Maesano AP, De Rosa F (2017) Automation and intelligent scheduling of distributed system functional testing. Softw Tools Technol Transfer (electronic journal) 19:281–308. Accessed 2 Dec 2020

  14. Ahmad K, Sahu M, Shrivastava M (2020) An efficient image retrieval tool: query based image management system. Int J Inf Technol (electronic journal) 12:103–111. Accessed 2 Dec 2020

  15. Kleshchev AS, Chernyakhovskaya MY, Shalfeeva EA (2015) Features of the automation of intellectual activities. Autom Doc Math Linguist (electronic journal) 49:10–20. Accessed 1 Dec 2020

  16. Kolekar S, Sanjeevi S, Bormane DS (2010) The framework of an adaptive user interface for e-learning environment using artificial neural network. In: International conference on E-learning E-business, EEE 2010, Las Vegas, Nevada, USA, pp 65–69

    Google Scholar 

  17. Fernndez-Garca AJ (2019) A recommender system for component-based applications using machine learning techniques. Knowl-Based Syst 164:68–84

    Google Scholar 

  18. Okuda H, Ogata S, Matsuura S (2013) Experimental development based on mapping rule between requirements analysis model and web framework specific design model. SpringerPlus (electronic journal) 2:123–130. Accessed 6 Dec 2020

  19. Bobileva MP (2019) Management document flow: from paper to electronic. Questions of theory and practice. Termika, p 470

    Google Scholar 

  20. Obukhov A, Krasnyanskiy M, Nikolyukin M (2020) Algorithm of adaptation of electronic document management system based on machine learning technology. Prog Artif Intell (electronic journal) 9:287–303. Accessed 2 Dec 2020

  21. Ghaibi N, Dassi O, Ayed L (2018) User interface adaptation based on a business rules management system and machine learning. Enterprise Inf Syst 65–69

    Google Scholar 

  22. Kotzyba M, Gossen T, Stober S, Nurnberger A (2017) Model-based frameworks for user adapted information exploration: an overview. Compan Technol 37–56

    Google Scholar 

  23. Peng F, Lu X, Ma C, Qian Y, Lu J, Yang J (2018) Multi-level preference regression for cold-start recommendations. Int J Mach Learn Cybern 9(7):1117–1130

    Article  Google Scholar 

  24. Krivenko YuS, Minasyan AT, Razinkov AO (2018) Research of data mining technologies. In: Actual problems of management in the electronic economy. Kursk, pp 182–184

    Google Scholar 

  25. Elmabaredy A, Elkholy E, Tolba AA (2020) Web-based adaptive presentation techniques to enhance learning outcomes in higher education. RPTEL (electronic journal) 15:20–38. Accessed 1 Dec 2020

  26. Shokin YuI, Yurchenko AV (2019) On the models of organizing the storage and use of scientific data: basic principles, processes and mechanisms. Inf Manage Syst 3:45–54

    Google Scholar 

  27. Barsegian AA, Kupriyanov MS, Stepanenko II (2004) Data analysis methods and models: OLAP и DataMining, p 336

    Google Scholar 

  28. Istratova E, Sin D, Strokin K (2021) A comparative analysis of data mining analysis tools. In: Pattnaik PK, Sain M, Al-Absi AA, Kumar P (eds) Proceedings of international conference on smart computing and cyber security. SMARTCYBER 2020. Lecture notes in networks and systems, vol 149. Springer, Singapore.

Download references

Author information

Authors and Affiliations


Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Istratova, E., Sin, D. (2022). Development of a Data Mining Subsystem for the Citeck Electronic Document Management System. In: Pattnaik, P.K., Sain, M., Al-Absi, A.A. (eds) Proceedings of 2nd International Conference on Smart Computing and Cyber Security. SMARTCYBER 2021. Lecture Notes in Networks and Systems, vol 395. Springer, Singapore.

Download citation

Publish with us

Policies and ethics