Abstract
Work measurement methods previously proposed require considerable time and effort by time study analysts because they have to measure the required time through direct observations. In this study, however, we propose a method which efficiently measures the standard times without involving human analysts by using speech recognition and digital image processing techniques. First, we implement a prototype system which can acquire the status of manufacturing cells through a speech recognition system. Second, using image processing, we suggest a method which consists of two main steps: motion representation and cycle segmentation. In the motion representation step, we first detect the motion of any object distinct from its background by differencing two consecutive images separated by a constant time interval. The images thus obtained then pass through an edge detector filter. Finally, the mean values of coordinates of significant pixels of the edge image are obtained. Through these processes, the motions of the observed worker are represented by two time series of data of worker location in horizontal and vertical axes. In the cycle segmentation step, we extract the frames which have maximum or minimum coordinates in one cycle, store them in a stack, and calculate each cycle time using these frames. In this step we also consider methods for detecting work delays due to unexpected events such as an operator’s movement out of the work area, or interruptions. To conclude, the experimental results show that the proposed method is very cost-effective and useful for measuring time standards for various work environments.
Similar content being viewed by others
References
Molnar KK, Kletke MG (1996) The impact on user performance and satisfaction of a voice-based front-end interface for a standard software tool. Int J Hum-Comput Stud 45:287–303
Nakazato S, Kudo I, Shirai K (1996) Hybrid method of data collection for evaluating speech dialogue system. IEICE Trans Info Syst E79-D(1):41–46
Dillon TW, Norcio AF (1997) User performance and acceptance of a speech-input interface in a health assessment task. Int J Hum-Comput Stud 47:591–602
Watanabe T (1999) Design of speech recognition system: problems and solutions. Syst Comput Japan 30(12):95–105
McCallum MC, Campbell JL, Richman JB, Brown JL (2004) Speech recognition and in-vehicle telematics devices: potential reductions in driver distraction. Int J Speech Technol 7:25–33
Lee C, Xu Y (1996) Interactive learning of gestures for human/robot interfaces. 1996 IEEE International Conference on Robotics and Automation, Minneapolis, MN 4:2982–2987
Freeman W, Roth M (1995) Orientation histogram for hand gesture recognition. Int’l Workshop on Automatic Face and Gesture Recognition
Toshiba Corp (1998) Toshiba’s motion processor recognizes gestures in real time basis for future generation of natural interfaces between people and computers. http://www.toshiba.com/news/980715.htm
Müller H, Michoux N, Bandon D, Geissbuhler A (2004) A review of content-based image retrieval systems in medical applications—clinical benefits and future directions. Int J Med Info 73(1):1–23
Bobick AF, Davis J (1996) Real-time recognition of activity using temporal templates. Proc IEEE Computer Society Workshop Applications on Computer Vision, Sarasota, FL, pp 39–42
Davis J (1999) Recognizing movement using motion histograms. MIT Media Lab Perceptual Computing Group Technical Report No. 487, MIT
Fejes S, Davis LS (1999) Detection of independent motion using directional motion estimation. Comput Vision Image Understand 74(2):101–120
Collins RT, Lipton A, Kanade T, Fujiyoshi H, Duggins D, Tsin Y, Tolliver D, Enomoto N, Hasegawa O (2000) A system for video surveillance and monitoring: VSAM final report. CMU-R1-TR-00-12, Technical Report, Carnegie Mellon University
Haritaoglu I, Harwood D, Davis LS (2000) W4: real-time surveillance of people and their activities. IEEE Trans Pattern Anal Mach Intell 22(8):809–830
Maggioni C, Kammerer B (1998) Gesture computer: history, design, and applications. In: Cipolla R, Pentland A (eds) Computer vision for human-machine interaction, pp 23–51, Cambridge University Press, Cambridge, MA
Koenderink JJ, van Doorn AJ (1986) Dynamic shape. Technical report, Department of Medical and Physiological Physics, State University, Utrecht, The Netherlands
Yang YM (1997) Sign language recognition using image processing. KOSEF No. 951-0906-085-2, Korea Science and Engineering Foundation
Sonka M, Hlavac V, Boyle R (1999) Image processing, analysis, and machine vision, 2nd edn. PWS, Boston
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sim, ES., Lee, HG., Lee, JC. et al. Efficient work measurement system of manufacturing cells using speech recognition and digital image processing technology. Int J Adv Manuf Technol 29, 772–785 (2006). https://doi.org/10.1007/s00170-005-2557-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00170-005-2557-5