Conversion of Real Data from Production Process of Automotive Company for Process Mining Analysis

Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 74)


The aim of this paper is to convert the real data from the raw format from different information systems (log files) to the format, which is suitable for process mining analysis of a production process in a large automotive company. The conversion process will start with the import from several relational databases. The motivation is to use the DISCO tool for importing real pre-processed data and to conduct process mining analysis of a production process. DISCO generates process models from imported data in a comprehensive graphical form and provides different statistical features to analyse the process. This makes it possible to examine the production process in detail, identify bottlenecks, and streamline the process. The paper firstly presents a brief introduction of a manufacturing process in a company. Secondly, it provides a description of a conversion and pre-processing of chosen real data structures for the DISCO import. Then, it briefly describes the DISCO tool and proper format of pre-processed log file, which serves as desired input data. This data will be the main source for all consecutive operations in generated process map. Finally, it provides a sample analysis description with emphasis on one production process (process map and few statistics). To conclude, the results obtained show high demands on pre-processing of real data for suitable import format into DISCO tool and vital possibilities of process mining methods to optimize a production process in an automotive company.


Process mining Data cleaning Data cleaning tools DISCO 



This paper was supported by the project of Silesian University in Opava, Czech Republic SGS/19/2016 titled “Advanced mining methods and simulation techniques in business process domain.”


  1. 1.
    Aalst, W.V.D.: Process Mining: Data Science in Action, 2nd edn. Springer, New York (2016). ISBN 978-3-662-49850-7CrossRefGoogle Scholar
  2. 2.
    Gibert, K., Sanchez-Marre, M., Izquierdo, J.: A survey on pre-processing techniques: Relevant issues in the context of environmental data mining. AI Commun. 29(6), 627–663 (2016)MathSciNetCrossRefGoogle Scholar
  3. 3.
    Zakarija, I., Skopljanac-Macina, F., Blaskovic, B.: Discovering process model from incomplete log using process mining. In: Mustra, M., Tralic, D., Zovkocihlar, B. (eds.) Proceedings of 57th International Symposium ELMAR-2015, pp. 117–120. IEEE (2015)Google Scholar
  4. 4.
    Osborne, J.W.: Best Practices in Data Cleaning: A Complete Guide to Everything You Need to do Before and After Collecting Your Data. Sage, Los Angeles (2013)CrossRefGoogle Scholar
  5. 5.
    Suriadi, S., Andrews, R., ter Hofstede, A.H.M., Wynn, M.T.: Event log imperfection patterns for process mining: Towards a systematic approach to cleaning event logs. Inf. Syst. 64, 132–150 (2017)CrossRefGoogle Scholar
  6. 6.
    Abedjan, Z., Chu, X., Deng, D., Fernandez, R.C., Ilyas, I.F., Ouzzani, M., Papotti, P., Stonebraker, M., Tang, N.: Detecting data errors: Where are we and what needs to be done? Proc. VLDB Endow. 9(12), 993–1004 (2016)CrossRefGoogle Scholar
  7. 7.
    Gunther, C.W., Rozinat, A.: Disco: Discover your processes. In: Proceeding of BPM Demos, CEUR Workshop Proceedings, Vol. 940, pp. 40–44 (2012)Google Scholar
  8. 8.
    Rozinat, A.: Disco User’s Guide (2017).
  9. 9.
    Aalst, W.V.D.: Process Mining: Discovery, Conformance and Enhancement of Business Processes. Springer, New York (2011). ISBN 978-3-642-19344-6CrossRefzbMATHGoogle Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  1. 1.Department of Business Economics and Management, School of Business Administration in KarvináSilesian University in OpavaKarvináCzech Republic

Personalised recommendations