Skip to main content
Log in

Efficient work measurement system of manufacturing cells using speech recognition and digital image processing technology

  • Original Article
  • Published:
The International Journal of Advanced Manufacturing Technology Aims and scope Submit manuscript

Abstract

Work measurement methods previously proposed require considerable time and effort by time study analysts because they have to measure the required time through direct observations. In this study, however, we propose a method which efficiently measures the standard times without involving human analysts by using speech recognition and digital image processing techniques. First, we implement a prototype system which can acquire the status of manufacturing cells through a speech recognition system. Second, using image processing, we suggest a method which consists of two main steps: motion representation and cycle segmentation. In the motion representation step, we first detect the motion of any object distinct from its background by differencing two consecutive images separated by a constant time interval. The images thus obtained then pass through an edge detector filter. Finally, the mean values of coordinates of significant pixels of the edge image are obtained. Through these processes, the motions of the observed worker are represented by two time series of data of worker location in horizontal and vertical axes. In the cycle segmentation step, we extract the frames which have maximum or minimum coordinates in one cycle, store them in a stack, and calculate each cycle time using these frames. In this step we also consider methods for detecting work delays due to unexpected events such as an operator’s movement out of the work area, or interruptions. To conclude, the experimental results show that the proposed method is very cost-effective and useful for measuring time standards for various work environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Molnar KK, Kletke MG (1996) The impact on user performance and satisfaction of a voice-based front-end interface for a standard software tool. Int J Hum-Comput Stud 45:287–303

    Google Scholar 

  2. Nakazato S, Kudo I, Shirai K (1996) Hybrid method of data collection for evaluating speech dialogue system. IEICE Trans Info Syst E79-D(1):41–46

  3. Dillon TW, Norcio AF (1997) User performance and acceptance of a speech-input interface in a health assessment task. Int J Hum-Comput Stud 47:591–602

    Google Scholar 

  4. Watanabe T (1999) Design of speech recognition system: problems and solutions. Syst Comput Japan 30(12):95–105

    Article  Google Scholar 

  5. McCallum MC, Campbell JL, Richman JB, Brown JL (2004) Speech recognition and in-vehicle telematics devices: potential reductions in driver distraction. Int J Speech Technol 7:25–33

    Article  Google Scholar 

  6. Lee C, Xu Y (1996) Interactive learning of gestures for human/robot interfaces. 1996 IEEE International Conference on Robotics and Automation, Minneapolis, MN 4:2982–2987

  7. Freeman W, Roth M (1995) Orientation histogram for hand gesture recognition. Int’l Workshop on Automatic Face and Gesture Recognition

  8. Toshiba Corp (1998) Toshiba’s motion processor recognizes gestures in real time basis for future generation of natural interfaces between people and computers. http://www.toshiba.com/news/980715.htm

  9. Müller H, Michoux N, Bandon D, Geissbuhler A (2004) A review of content-based image retrieval systems in medical applications—clinical benefits and future directions. Int J Med Info 73(1):1–23

    Article  Google Scholar 

  10. Bobick AF, Davis J (1996) Real-time recognition of activity using temporal templates. Proc IEEE Computer Society Workshop Applications on Computer Vision, Sarasota, FL, pp 39–42

  11. Davis J (1999) Recognizing movement using motion histograms. MIT Media Lab Perceptual Computing Group Technical Report No. 487, MIT

  12. Fejes S, Davis LS (1999) Detection of independent motion using directional motion estimation. Comput Vision Image Understand 74(2):101–120

    Article  Google Scholar 

  13. Collins RT, Lipton A, Kanade T, Fujiyoshi H, Duggins D, Tsin Y, Tolliver D, Enomoto N, Hasegawa O (2000) A system for video surveillance and monitoring: VSAM final report. CMU-R1-TR-00-12, Technical Report, Carnegie Mellon University

    Google Scholar 

  14. Haritaoglu I, Harwood D, Davis LS (2000) W4: real-time surveillance of people and their activities. IEEE Trans Pattern Anal Mach Intell 22(8):809–830

    Article  Google Scholar 

  15. Maggioni C, Kammerer B (1998) Gesture computer: history, design, and applications. In: Cipolla R, Pentland A (eds) Computer vision for human-machine interaction, pp 23–51, Cambridge University Press, Cambridge, MA

  16. Koenderink JJ, van Doorn AJ (1986) Dynamic shape. Technical report, Department of Medical and Physiological Physics, State University, Utrecht, The Netherlands

  17. Yang YM (1997) Sign language recognition using image processing. KOSEF No. 951-0906-085-2, Korea Science and Engineering Foundation

  18. Sonka M, Hlavac V, Boyle R (1999) Image processing, analysis, and machine vision, 2nd edn. PWS, Boston

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hyoung-Gon Lee.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sim, ES., Lee, HG., Lee, JC. et al. Efficient work measurement system of manufacturing cells using speech recognition and digital image processing technology. Int J Adv Manuf Technol 29, 772–785 (2006). https://doi.org/10.1007/s00170-005-2557-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00170-005-2557-5

Keywords

Navigation