Skip to main content
Log in

TV-ALP: A log dataset of television assembly line production under multi-person collaboration for process mining research

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Process mining technology has been widely used to optimize the processes of various organizations, especially in enterprises. It facilitates cooperation between departments and prompts efficient process design and resource scheduling in the production workshop. However, as a data-driven approach, the lack of production logs hinders the development of enterprise process mining research. Therefore, we introduce a new benchmark dataset named TV-ALP to provide effective data support for the combination of process mining and workshop production. The dataset comes from the field survey experience and product operation instructions, which highly restores the operation of the production workshop and details the processing of television (TV) sets in the assembly line. We compare TV-ALP with other public datasets for detailed statistical analysis and provide benchmark performance for the remaining time prediction task. The experimental results show that TV-ALP can meet the requirements of process mining and analysis research in terms of both scale and quality. In addition, while maintaining the data commonality of public datasets, it also emphasizes the importance of role information and supports a series of role-based log studies. The complete dataset is accessible for download at https://github.com/Zzou-Sdust/TV-ALP-dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Algorithm 1
Fig. 7
Algorithm 2
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Data availability and access statement

The datasets generated during and/or analysed during the current study are available in the [TV-ALP-dataset] repository, [https://github.com/Zzou-Sdust/TV-ALP-dataset].

References

  1. Van Der Aalst W (2012) Process mining: Overview and opportunities. ACM Trans Manag Inf Syst (TMIS) 3(2):1–17

    Article  Google Scholar 

  2. Dongen B (2012) BPI Challenge 2012.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:3926db30-f712-4394-aebc-75976070e91f

  3. Dongen B (2015) BPI Challenge 2015.4TU.ResearchData.4TU.ResearchData .https://doi.org/10.4121/uuid:31a308ef-c844-48da-948c-305d167a0ec1

  4. Mannhardt F (2017) Hospital Billing - Event Log.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:76c46b83-c930-4798-a1c9-4be94dfeb741

  5. Polato M (2017) misc belonging to the help desk log of an Italian Company.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:0c60edf1-6f83-4e75-9367-4c63b3e9d5bb

  6. Dongen B (2017) BPI Challenge 2017.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:5f3067df-f10b-45da-b98b-86ae4c7a310b

  7. Levy D (2014) Production analysis with process mining technology.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:68726926-5ac5-4fab-b873-ee76ea412399

  8. Huang S, Liu Y, Fung C, He R, Zhao Y, Yang H, Luan Z (2020) Hitanomaly: Hierarchical transformers for anomaly detection in system log. IEEE Trans Netw Serv Manag 17(4):2064–2076

    Article  Google Scholar 

  9. Li N, Zhao X (2023) A multi-modal dataset for gait recognition under occlusion. Appl Intell 53(2):1517–1534

    Article  Google Scholar 

  10. Dongen B (2011) Real-life event logs - Hospital log.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:d9769f3d-0ab0-4fb8-803b-0d1120ffcf54

  11. Dongen B (2019) BPI Challenge 2019.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:d06aff4b-79f0-45e6-8ec8-e19730c248f1

  12. Augusto A, Conforti R, Dumas M, La Rosa M, Maggi FM, Marrella A, Mecella M, Soo A (2018) Automated discovery of process models from event logs: Review and benchmark. IEEE Trans Knowl Data Eng 31(4):686–705

    Article  Google Scholar 

  13. Chapela-Campa D, Dumas M, Mucientes M, Lama M (2022) Efficient edge filtering of directly-follows graphs for process mining. Inf Sci 610:830–846

    Article  Google Scholar 

  14. Bag S, Wood LC, Mangla SK, Luthra S (2020) Procurement 4.0 and its implications on business process performance in a circular economy. Resour Conserv Recycl 152:104502

  15. Groß S, Yeshchenko A, Djurica D, Mendling J (2020) Process mining supported process redesign: Matching problems with solutions. In: Estefanía Serral Asensio, Janis Stirna (ed.) process mining supported process redesign: Matching Problems with Solutions, pp 24–33

  16. Brunk J, Stottmeister J, Weinzierl S, Matzner M, Becker J (2020) Exploring the effect of context information on deep learning business process predictions. J Decis Syst 29(sup1):328–343

    Article  Google Scholar 

  17. Teinemaa I, Dumas M, Rosa ML, Maggi M (2019) Outcome-oriented predictive process monitoring: Review and benchmark. ACM Trans Knowl Discov Data (TKDD) 13(2):1–57

    Article  Google Scholar 

  18. Kim J, Comuzzi M, Dumas M, Maggi FM, Teinemaa I (2022) Encoding resource experience for predictive process monitoring. Decis Support Syst 153:113669

    Article  Google Scholar 

  19. Sindhgatta R, Moreira C, Ouyang C, Barros A (2020) Exploring interpretable predictive models for business processes. In: International conference on business process management, pp. 257–272. Springer

  20. Lee S, Comuzzi M, Kwon N (2022) Exploring the suitability of rule-based classification to provide interpretability in outcome-based process predictive monitoring. Algorithms 15(6):187

  21. Chiorrini A, Diamantini C, Mircoli A, Potena D (2022) Exploiting instance graphs and graph neural networks for next activity prediction. In: International conference on process mining, pp 115–126. Springer

  22. Heinrich K, Zschech P, Janiesch C, Bonin M (2021) Process data properties matter: Introducing gated convolutional neural networks (gcnn) and key-value-predict attention networks (kvp) for next event prediction with deep learning. Decis Support Syst 143:113494

    Article  Google Scholar 

  23. Verenich I, Dumas M, Rosa ML, Maggi FM, Teinemaa I (2019) Survey and cross-benchmark comparison of remaining time prediction methods in business process monitoring. ACM Trans Intell Syst Technol (TIST) 10(4):1–34

    Article  Google Scholar 

  24. Ni W, Yan M, Liu T, Zeng Q (2022) Predicting remaining execution time of business process instances via auto-encoded transition system. Intell Data Anal 26(2):543–562

    Article  Google Scholar 

  25. Chen H, Fang X, Fang H (2022) Multi-task prediction method of business process based on bert and transfer learning. Knowl-Based Syst 254:109603

    Article  Google Scholar 

  26. Guo P, Tang CS, Wang Y, Zhao M (2019) The impact of reimbursement policy on social welfare, revisit rate, and waiting time in a public healthcare system: Fee-for-service versus bundled payment. Manuf Serv Oper Manag 21(1):154–170

    Article  Google Scholar 

  27. Marin-Castro HM, Tello-Leal E (2021) An end-to-end approach and tool for bpmn process discovery. Expert Syst Appl 174:114662

    Article  Google Scholar 

  28. Wang J,Yu D, Liu C, Sun X (2019) Outcome-oriented predictive process monitoring with attention-based bidirectional lstm neural networks. In: 2019 IEEE international conference on web services (ICWS), pp 360–367. IEEE

  29. Sadeghianasl S, Hofstede AH, Wynn MT, Suriadi S (2019) A contextual approach to detecting synonymous and polluted activity labels in process event logs. In: OTM confederated international conferences" on the move to meaningful internet systems", pp 76–94. Springer

  30. Liu J, Xu J, Zhang R, Reiff-Marganiec S (2021) A repairing missing activities approach with succession relation for event logs. Knowl Inf Syst 63(2):477–495

    Article  Google Scholar 

  31. Marcus D, van Dongen B (2016) BPI Challenge 2016.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:360795c8-1dd6-4a5b-a443-185001076eab

  32. Chen W (2020) Intelligent manufacturing production line data monitoring system for industrial internet of things. Comput Commun 151:31–41

    Article  Google Scholar 

  33. Lai Z-H, Tao W, Leu MC, Yin Z (2020) Smart augmented reality instructional system for mechanical assembly towards worker-centered intelligent manufacturing. J Manuf Syst 55:69–81

    Article  Google Scholar 

  34. Glock CH, Grosse EH, Neumann WP, Feldman A (2021) Assistive devices for manual materials handling in warehouses: a systematic literature review. Int J Prod Res 59(11):3446–3469

    Article  Google Scholar 

  35. Besharati Moghaddam, F.,Lopez, A.J.,Van Gheluwe, C.,De Vuyst, S.,Gautama, S.:Data-driven operator functional state classification in smart manufacturing. Appl Intell, pp 1–13(2023)

  36. Lin Y-C, Yeh C-C, Chen W-H, Hsu K-Y (2020) Implementation criteria for intelligent systems in motor production line process management. Processes 8(5):537

    Article  Google Scholar 

  37. Murata T (1989) Petri nets: Properties, analysis and applications. Proceedings of the IEEE 77(4):541–580

    Article  Google Scholar 

  38. Miqueo A, Torralba M, Yagüe-Fabra JA (2022) Models to evaluate the performance of high-mix low-volume manual or semi-automatic assembly lines. Procedia CIRP 107:1461–1466

    Article  Google Scholar 

  39. Gualtieri L, Palomba I, Merati FA, Rauch E, Vidoni R (2020) Design of human-centered collaborative assembly workstations for the improvement of operators? physical ergonomics and production efficiency: A case study. Sustainability 12(9):3606

    Article  Google Scholar 

  40. Arlinghaus A, Bohle P, Iskra-Golec I, Jansen N, Jay S, Rotenberg L (2019) Working time society consensus statements: Evidence-based effects of shift work and non-standard working hours on workers, family and community. Ind Health 57(2):184–200

    Article  Google Scholar 

  41. Choueiri AC, Sato DMV, Scalabrin EE, Santos EAP (2020) An extended model for remaining time prediction in manufacturing systems using process mining. J Manuf Syst 56:188–201

    Article  Google Scholar 

  42. Leitão, J.,Pereira, D.,Gonçalves, Â.:Quality of work life and organizational performance: Workers? feelings of contributing, or not, to the organization?s productivity. Int J Environ Res Public Health 16(20):3803(2019)

  43. Aalst W, Weijters T, Maruster L (2004) Workflow mining: Discovering process models from event logs. IEEE Trans Knowl Data Eng 16(9):1128–1142

    Article  Google Scholar 

  44. Verbeek H, Buijs JC,Van Dongen BF, Van Der Aalst WM (2011) Xes, xesame, and prom 6. In: Information systems evolution: CAiSE Forum 2010, Hammamet, Tunisia, June 7-9, 2010, Selected Extended Papers 22, pp 60–75. Springer

  45. Burattin A (2016) Plg2: Multiperspective process randomization with online and offline simulations. In: BPM (Demos), pp 1–6. Citeseer

  46. Fousek, J.,Kuncova, M.,Fábry, J.,Zoltay, Z (2017) Discrete event simulation-production model in simul8. In: ECMS, pp 229–234

  47. Shugurov IS, Mitsyuk AA (2014) Generation of a Set of Event Logs with Noise. In: Proceedings of the 8th Spring/Summer young researchers colloquium on software engineering (SYRCoSE 2014), pp 88–95

  48. Dongen B, Borchert F (2018) BPI Challenge 2018.4TU.ResearchData.4TU.ResearchData. https://doi.org/10.4121/uuid:3301445f-95e8-4ff0-98a4-901f1f204972

  49. Adams JN, van Zelst SJ, Rose T, van der Aalst WMP (2023) Explainable concept drift in process mining. Inf Syst 114:102177. https://doi.org/10.1016/j.is.2023.102177

    Article  Google Scholar 

  50. Rinderle-Ma S, Winter K, Benzin J-V (2023) Predictive compliance monitoring in process-aware information systems: State of the art, functionalities, research directions. Inf Syst 115:102210. https://doi.org/10.1016/j.is.2023.102210

    Article  Google Scholar 

  51. Ni W, Sun Y et al (2020) Business process remaining time prediction using bidirectional recurrent neural networks with attention. Comput Integr Manuf Syst 26(6):1564–1572

  52. Bukhsh ZA, Saeed A, Dijkman RM (2021) Processtransformer: Predictive business process monitoring with transformernetwork. arXiv:2104.00721

  53. Xu X, Liu C, Li T, Guo N, Ren C, Zeng Q (2022) Business process remaining time prediction: An approach based on bidirectional quasi recurrent neural network with attention. ACTA ELECTONICA SINICA 50(8):1975

  54. Xu X, Zhang S, Li T, Guo N, Dong L, Liu C, Ren C (2022) Business process remaining time prediction method based on trajectory clustering. Comput Eng 48(11):247–256

    Google Scholar 

  55. Guo N, Liu C, Li C, Liu t, Wen L, Zeng Q (2023) Explainable feature-based hierarchical approach to predict remaining processtime.Journal of Software, pp 1–16

Download references

Acknowledgements

This work is supported by National Key R &D Program of China [2022ZD0119501]; NSFC [52374221]; Sci. & Tech. Development Fund of Shandong Province of China [ZR2022MF288, ZR2023MF097]; the Taishan Scholar Program of Shandong Province [ts20190936].

Author information

Authors and Affiliations

Authors

Contributions

Minghao Zou: Conceptualization, Methodology, Writing-original draft, Investigation. Qingtian Zeng: Conceptualization, Methodology, Writing-Reviewing, Investigation. Hua Duan: Methodology, Writing-Reviewing. Weijian Ni: Writing-Reviewing, Investigation. Shuang Chen: Data curation.

Corresponding authors

Correspondence to Qingtian Zeng or Hua Duan.

Ethics declarations

Competing of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethical and informed consent for data used statement

All authors support the journal’s data ethics and support the informed consent policy.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zou, M., Zeng, Q., Duan, H. et al. TV-ALP: A log dataset of television assembly line production under multi-person collaboration for process mining research. Appl Intell 54, 3990–4011 (2024). https://doi.org/10.1007/s10489-024-05347-8

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-024-05347-8

Keywords

Navigation