Abstract
Process mining technology has been widely used to optimize the processes of various organizations, especially in enterprises. It facilitates cooperation between departments and prompts efficient process design and resource scheduling in the production workshop. However, as a data-driven approach, the lack of production logs hinders the development of enterprise process mining research. Therefore, we introduce a new benchmark dataset named TV-ALP to provide effective data support for the combination of process mining and workshop production. The dataset comes from the field survey experience and product operation instructions, which highly restores the operation of the production workshop and details the processing of television (TV) sets in the assembly line. We compare TV-ALP with other public datasets for detailed statistical analysis and provide benchmark performance for the remaining time prediction task. The experimental results show that TV-ALP can meet the requirements of process mining and analysis research in terms of both scale and quality. In addition, while maintaining the data commonality of public datasets, it also emphasizes the importance of role information and supports a series of role-based log studies. The complete dataset is accessible for download at https://github.com/Zzou-Sdust/TV-ALP-dataset.
Similar content being viewed by others
Data availability and access statement
The datasets generated during and/or analysed during the current study are available in the [TV-ALP-dataset] repository, [https://github.com/Zzou-Sdust/TV-ALP-dataset].
References
Van Der Aalst W (2012) Process mining: Overview and opportunities. ACM Trans Manag Inf Syst (TMIS) 3(2):1–17
Dongen B (2012) BPI Challenge 2012.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:3926db30-f712-4394-aebc-75976070e91f
Dongen B (2015) BPI Challenge 2015.4TU.ResearchData.4TU.ResearchData .https://doi.org/10.4121/uuid:31a308ef-c844-48da-948c-305d167a0ec1
Mannhardt F (2017) Hospital Billing - Event Log.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:76c46b83-c930-4798-a1c9-4be94dfeb741
Polato M (2017) misc belonging to the help desk log of an Italian Company.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:0c60edf1-6f83-4e75-9367-4c63b3e9d5bb
Dongen B (2017) BPI Challenge 2017.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:5f3067df-f10b-45da-b98b-86ae4c7a310b
Levy D (2014) Production analysis with process mining technology.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:68726926-5ac5-4fab-b873-ee76ea412399
Huang S, Liu Y, Fung C, He R, Zhao Y, Yang H, Luan Z (2020) Hitanomaly: Hierarchical transformers for anomaly detection in system log. IEEE Trans Netw Serv Manag 17(4):2064–2076
Li N, Zhao X (2023) A multi-modal dataset for gait recognition under occlusion. Appl Intell 53(2):1517–1534
Dongen B (2011) Real-life event logs - Hospital log.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:d9769f3d-0ab0-4fb8-803b-0d1120ffcf54
Dongen B (2019) BPI Challenge 2019.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:d06aff4b-79f0-45e6-8ec8-e19730c248f1
Augusto A, Conforti R, Dumas M, La Rosa M, Maggi FM, Marrella A, Mecella M, Soo A (2018) Automated discovery of process models from event logs: Review and benchmark. IEEE Trans Knowl Data Eng 31(4):686–705
Chapela-Campa D, Dumas M, Mucientes M, Lama M (2022) Efficient edge filtering of directly-follows graphs for process mining. Inf Sci 610:830–846
Bag S, Wood LC, Mangla SK, Luthra S (2020) Procurement 4.0 and its implications on business process performance in a circular economy. Resour Conserv Recycl 152:104502
Groß S, Yeshchenko A, Djurica D, Mendling J (2020) Process mining supported process redesign: Matching problems with solutions. In: Estefanía Serral Asensio, Janis Stirna (ed.) process mining supported process redesign: Matching Problems with Solutions, pp 24–33
Brunk J, Stottmeister J, Weinzierl S, Matzner M, Becker J (2020) Exploring the effect of context information on deep learning business process predictions. J Decis Syst 29(sup1):328–343
Teinemaa I, Dumas M, Rosa ML, Maggi M (2019) Outcome-oriented predictive process monitoring: Review and benchmark. ACM Trans Knowl Discov Data (TKDD) 13(2):1–57
Kim J, Comuzzi M, Dumas M, Maggi FM, Teinemaa I (2022) Encoding resource experience for predictive process monitoring. Decis Support Syst 153:113669
Sindhgatta R, Moreira C, Ouyang C, Barros A (2020) Exploring interpretable predictive models for business processes. In: International conference on business process management, pp. 257–272. Springer
Lee S, Comuzzi M, Kwon N (2022) Exploring the suitability of rule-based classification to provide interpretability in outcome-based process predictive monitoring. Algorithms 15(6):187
Chiorrini A, Diamantini C, Mircoli A, Potena D (2022) Exploiting instance graphs and graph neural networks for next activity prediction. In: International conference on process mining, pp 115–126. Springer
Heinrich K, Zschech P, Janiesch C, Bonin M (2021) Process data properties matter: Introducing gated convolutional neural networks (gcnn) and key-value-predict attention networks (kvp) for next event prediction with deep learning. Decis Support Syst 143:113494
Verenich I, Dumas M, Rosa ML, Maggi FM, Teinemaa I (2019) Survey and cross-benchmark comparison of remaining time prediction methods in business process monitoring. ACM Trans Intell Syst Technol (TIST) 10(4):1–34
Ni W, Yan M, Liu T, Zeng Q (2022) Predicting remaining execution time of business process instances via auto-encoded transition system. Intell Data Anal 26(2):543–562
Chen H, Fang X, Fang H (2022) Multi-task prediction method of business process based on bert and transfer learning. Knowl-Based Syst 254:109603
Guo P, Tang CS, Wang Y, Zhao M (2019) The impact of reimbursement policy on social welfare, revisit rate, and waiting time in a public healthcare system: Fee-for-service versus bundled payment. Manuf Serv Oper Manag 21(1):154–170
Marin-Castro HM, Tello-Leal E (2021) An end-to-end approach and tool for bpmn process discovery. Expert Syst Appl 174:114662
Wang J,Yu D, Liu C, Sun X (2019) Outcome-oriented predictive process monitoring with attention-based bidirectional lstm neural networks. In: 2019 IEEE international conference on web services (ICWS), pp 360–367. IEEE
Sadeghianasl S, Hofstede AH, Wynn MT, Suriadi S (2019) A contextual approach to detecting synonymous and polluted activity labels in process event logs. In: OTM confederated international conferences" on the move to meaningful internet systems", pp 76–94. Springer
Liu J, Xu J, Zhang R, Reiff-Marganiec S (2021) A repairing missing activities approach with succession relation for event logs. Knowl Inf Syst 63(2):477–495
Marcus D, van Dongen B (2016) BPI Challenge 2016.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:360795c8-1dd6-4a5b-a443-185001076eab
Chen W (2020) Intelligent manufacturing production line data monitoring system for industrial internet of things. Comput Commun 151:31–41
Lai Z-H, Tao W, Leu MC, Yin Z (2020) Smart augmented reality instructional system for mechanical assembly towards worker-centered intelligent manufacturing. J Manuf Syst 55:69–81
Glock CH, Grosse EH, Neumann WP, Feldman A (2021) Assistive devices for manual materials handling in warehouses: a systematic literature review. Int J Prod Res 59(11):3446–3469
Besharati Moghaddam, F.,Lopez, A.J.,Van Gheluwe, C.,De Vuyst, S.,Gautama, S.:Data-driven operator functional state classification in smart manufacturing. Appl Intell, pp 1–13(2023)
Lin Y-C, Yeh C-C, Chen W-H, Hsu K-Y (2020) Implementation criteria for intelligent systems in motor production line process management. Processes 8(5):537
Murata T (1989) Petri nets: Properties, analysis and applications. Proceedings of the IEEE 77(4):541–580
Miqueo A, Torralba M, Yagüe-Fabra JA (2022) Models to evaluate the performance of high-mix low-volume manual or semi-automatic assembly lines. Procedia CIRP 107:1461–1466
Gualtieri L, Palomba I, Merati FA, Rauch E, Vidoni R (2020) Design of human-centered collaborative assembly workstations for the improvement of operators? physical ergonomics and production efficiency: A case study. Sustainability 12(9):3606
Arlinghaus A, Bohle P, Iskra-Golec I, Jansen N, Jay S, Rotenberg L (2019) Working time society consensus statements: Evidence-based effects of shift work and non-standard working hours on workers, family and community. Ind Health 57(2):184–200
Choueiri AC, Sato DMV, Scalabrin EE, Santos EAP (2020) An extended model for remaining time prediction in manufacturing systems using process mining. J Manuf Syst 56:188–201
Leitão, J.,Pereira, D.,Gonçalves, Â.:Quality of work life and organizational performance: Workers? feelings of contributing, or not, to the organization?s productivity. Int J Environ Res Public Health 16(20):3803(2019)
Aalst W, Weijters T, Maruster L (2004) Workflow mining: Discovering process models from event logs. IEEE Trans Knowl Data Eng 16(9):1128–1142
Verbeek H, Buijs JC,Van Dongen BF, Van Der Aalst WM (2011) Xes, xesame, and prom 6. In: Information systems evolution: CAiSE Forum 2010, Hammamet, Tunisia, June 7-9, 2010, Selected Extended Papers 22, pp 60–75. Springer
Burattin A (2016) Plg2: Multiperspective process randomization with online and offline simulations. In: BPM (Demos), pp 1–6. Citeseer
Fousek, J.,Kuncova, M.,Fábry, J.,Zoltay, Z (2017) Discrete event simulation-production model in simul8. In: ECMS, pp 229–234
Shugurov IS, Mitsyuk AA (2014) Generation of a Set of Event Logs with Noise. In: Proceedings of the 8th Spring/Summer young researchers colloquium on software engineering (SYRCoSE 2014), pp 88–95
Dongen B, Borchert F (2018) BPI Challenge 2018.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:3301445f-95e8-4ff0-98a4-901f1f204972
Adams JN, van Zelst SJ, Rose T, van der Aalst WMP (2023) Explainable concept drift in process mining. Inf Syst 114:102177. https://doi.org/10.1016/j.is.2023.102177
Rinderle-Ma S, Winter K, Benzin J-V (2023) Predictive compliance monitoring in process-aware information systems: State of the art, functionalities, research directions. Inf Syst 115:102210. https://doi.org/10.1016/j.is.2023.102210
Ni W, Sun Y et al (2020) Business process remaining time prediction using bidirectional recurrent neural networks with attention. Comput Integr Manuf Syst 26(6):1564–1572
Bukhsh ZA, Saeed A, Dijkman RM (2021) Processtransformer: Predictive business process monitoring with transformernetwork. arXiv:2104.00721
Xu X, Liu C, Li T, Guo N, Ren C, Zeng Q (2022) Business process remaining time prediction: An approach based on bidirectional quasi recurrent neural network with attention. ACTA ELECTONICA SINICA 50(8):1975
Xu X, Zhang S, Li T, Guo N, Dong L, Liu C, Ren C (2022) Business process remaining time prediction method based on trajectory clustering. Comput Eng 48(11):247–256
Guo N, Liu C, Li C, Liu t, Wen L, Zeng Q (2023) Explainable feature-based hierarchical approach to predict remaining processtime.Journal of Software, pp 1–16
Acknowledgements
This work is supported by National Key R &D Program of China [2022ZD0119501]; NSFC [52374221]; Sci. & Tech. Development Fund of Shandong Province of China [ZR2022MF288, ZR2023MF097]; the Taishan Scholar Program of Shandong Province [ts20190936].
Author information
Authors and Affiliations
Contributions
Minghao Zou: Conceptualization, Methodology, Writing-original draft, Investigation. Qingtian Zeng: Conceptualization, Methodology, Writing-Reviewing, Investigation. Hua Duan: Methodology, Writing-Reviewing. Weijian Ni: Writing-Reviewing, Investigation. Shuang Chen: Data curation.
Corresponding authors
Ethics declarations
Competing of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Ethical and informed consent for data used statement
All authors support the journal’s data ethics and support the informed consent policy.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zou, M., Zeng, Q., Duan, H. et al. TV-ALP: A log dataset of television assembly line production under multi-person collaboration for process mining research. Appl Intell 54, 3990–4011 (2024). https://doi.org/10.1007/s10489-024-05347-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-024-05347-8