Skip to main content

Ontology-Based Data Mining Workflow Construction

  • Conference paper
  • First Online:
Computational Science and Its Applications – ICCSA 2021 (ICCSA 2021)

Abstract

Currently, Data mining is applied in various domains. Many data science researchers are confused on which algorithms are suitable for the context. Hundreds of the operators/algorithms are combined within complex data mining workflows. A data mining assistant can significantly improve the efficiency of workflow construction. In our previous work, we constructed data mining ontologies based on “semantic meta mining” to support the selection of algorithms and operators for solving data mining tasks. But the use of such ontologies is still unfriendly. Strict query syntax still plagues many users. This paper proposes an interactive interface based on the reasoning mechanism to help users generate queries and build suitable data mining workflows.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hilario, M., et al.: Ontology-based meta-mining of knowledge discovery workflows. In: Meta-Learning in Computational Intelligence, pp. 273–315. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20980-2_9

  2. Panov, P., Džeroski, S., Soldatova, L.: OntoDM: an ontology of data mining. In: 2008 IEEE International Conference on Data Mining Workshops. IEEE (2008)

    Google Scholar 

  3. Panov, P., Soldatova, L.N., Džeroski, S.: Generic ontology of datatypes. Inf. Sci. 329, 900–920 (2016)

    Article  Google Scholar 

  4. Keet, C.M., et al.: The data mining optimization ontology. J. Web Semant. 32, 43–53 (2015)

    Google Scholar 

  5. Žáková, M., et al.: Automating knowledge discovery workflow composition through ontology-based planning. IEEE Trans. Autom. Sci. Eng. 8(2), 253–264 (2010)

    Google Scholar 

  6. Benali, K., Rahal, S.A.: OntoDTA: ontology-guided decision tree assistance. J. Inf. Knowl. Manag. 16(03), 1750031 (2017)

    Google Scholar 

  7. Diamantini, C., Potena, D., Storti, E.: Kddonto: an ontology for discovery and composition of KDD algorithms. In: Third Generation Data Mining: Towards Service-Oriented Knowledge Discovery (SoKD’09), pp. 13–24 (2009)

    Google Scholar 

  8. Tianxing, M., et al.: A meta-mining ontology framework for data processing. Int. J. Embedded Real-Time Commun. Syst. (IJERTCS) 12(2), 37–56 (2021)

    Google Scholar 

  9. Pan, J.Z., Thomas, E., Zhao, Y.: Completeness guaranteed approximations for OWL-DL query answering. Description Logics 477 (2009)

    Google Scholar 

  10. Proctor, M.: Drools: a rule engine for complex event processing. In: Schürr, A., Varró, D., Varró, G. (eds.) AGTIVE 2011. LNCS, vol. 7233, pp. 2–2. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-34176-2_2

    Chapter  Google Scholar 

  11. Wirth, R., Hipp, J.: CRISP-DM: towards a standard process model for data mining. In: Proceedings of the 4th International Conference on the Practical Applications of Knowledge Discovery and Data Mining, vol. 1. Springer, London (2000)

    Google Scholar 

  12. Brachman, R.J., Anand, T.: The process of knowledge discovery in databases: a first sketch. In: KDD Workshop, vol. 3 (1994)

    Google Scholar 

  13. Shafique, U., Qaiser, H.: A comparative study of data mining process models (KDD, CRISP-DM and SEMMA). Int. J. Innov. Sci. Res. 12(1), 217–222 (2014)

    Google Scholar 

  14. Tianxing, M., Stankova, E., Vodyaho, A., Zhukova, N., Shichkina, Y.: Domain-Oriented Multilevel Ontology for Adaptive Data Processing. In: Gervasi, O., et al. (eds.) ICCSA 2020. LNCS, vol. 12249, pp. 634–649. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58799-4_46

    Chapter  Google Scholar 

  15. Horridge, M., et al.: The Manchester OWL Syntax. OWLed, vol. 216 (2006)

    Google Scholar 

  16. Doukas, C., Chatziioannou, A., Maglogiannis, I.: Intelligent planning of biomedical image mining workflows. In: Proceedings of the 10th IEEE International Conference on Information Technology and Applications in Biomedicine. IEEE (2010)

    Google Scholar 

  17. Tianxing, M., et al.: A hierarchical data mining process ontology. In: 2021 28th Conference of Open Innovations Association (FRUCT). IEEE (2021)

    Google Scholar 

  18. Panov, P., Soldatova, L., Džeroski, S.: OntoDM-KDD: ontology for representing the knowledge discovery process. In: Fürnkranz, J., Hüllermeier, E., Higuchi, T. (eds.) DS 2013. LNCS (LNAI), vol. 8140, pp. 126–140. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40897-7_9

    Chapter  Google Scholar 

  19. Noy, N.F., et al.: Protégé-2000: an open-source ontology-development and knowledge-acquisition environment. In: AMIA... Annual Symposium proceedings. AMIA Symposium, vol. 2003. American Medical Informatics Association (2003)

    Google Scholar 

  20. Liu, D., Gu, T., Xue, J.-P.: Rule engine based on improvement rete algorithm. In: The 2010 International Conference on Apperceiving Computing and Intelligence Analysis Proceeding. IEEE (2010)

    Google Scholar 

  21. Yang, P., et al.: An intelligent tumors coding method based on drools. J. New Media 2(3), 111 (2020)

    Google Scholar 

  22. Huang, G.B., et al.: Labeled faces in the wild: a database for studying face recognition in unconstrained environments. In: Workshop on Faces in'Real-Life'Images: Detection, Alignment, and Recognition (2008)

    Google Scholar 

  23. Information Artifact Ontology (IAO) web page. http://www.obofoundry.org/ontology/iao.html

  24. Glimm, B., et al.: HermiT: an OWL 2 reasoner. J. Autom. Reasoning 53(3), 245–269 (2014)

    Google Scholar 

  25. DL Query tab. https://protegewiki.stanford.edu/wiki/DLQueryTab

Download references

Acknowledgments

«The paper was prepared in Saint- Petersburg Electrotechnical University (LETI), and is supported by the Agreement № 075–11-2019–053 dated 20.11.2019 (Ministry of Science and Higher Education of the Russian Federation, in accordance with the Decree of the Government of the Russian Federation of April 9, 2010 No. 218), project «Creation of a domestic high-tech production of vehicle security systems based on a control mechanism and intelligent sensors, including millimeter radars in the 76–77 GHz range»

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tianxing, M., Lebedev, S., Vodyaho, A., Zhukova, N., Shichkina, Y.A. (2021). Ontology-Based Data Mining Workflow Construction. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2021. ICCSA 2021. Lecture Notes in Computer Science(), vol 12956. Springer, Cham. https://doi.org/10.1007/978-3-030-87010-2_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-87010-2_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-87009-6

  • Online ISBN: 978-3-030-87010-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics