Skip to main content

A Speech Recognition Mechanism for Enabling Interactions Between End-Users and Healthcare Applications

  • Conference paper
  • First Online:
17th International Conference on Information Technology–New Generations (ITNG 2020)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1134))

  • 1533 Accesses

Abstract

The constant evolution of software development technologies has provided new interactions mechanisms for improving the usability of software systems and increasing the productivity of healthcare professionals. In this sense, speech recognition uses methods and technologies that allow the capture and transcription of spoken language automatically. However, few studies only have used health professionals to prototype and validate graphical user interfaces with speech recognition for helping in the development of healthcare applications. This paper specifies a computational solution that makes use of speech recognition to assist healthcare professionals in recording patients’ clinical care data. Six physicians have participated in our study by prototyping activities and specifying workflow to be carried out in patient care. After that, the software architecture is specified and the proposed solution, which has been implemented based on the prototyping task performed by the end users, is detailed. To evaluate the proposed solution, we have conducted interviews with health professionals and the results showed a reduction in time and effort for recording patient information. In addition, using a quantitative approach, aspects of learnability, memorability, efficiency and satisfaction were investigated, where the proposed healthcare application tool obtained an average evaluation of 88% with respect to usability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Erdil, K., Finn, E., Keating, K., Meattle, J., Park, S., Yoon, D.: Software maintenance as part of the software life cycle. In: Comp180: Software Engineering Project (2003)

    Google Scholar 

  2. Araújo, A., Times, V., Urbano, M.: A cloud service for graphical user interfaces generation and electronic health record storage. In: Information Technology-New Generations, pp. 257–263. Springer, Cham (2018)

    Google Scholar 

  3. Araújo, A., Times, V., Urbano, M.: Towards a reusable framework for generating health information systems. In: 16th International Conference on Information Technology-New Generations (ITNG 2019), pp. 423–428. Springer, Cham (2019)

    Google Scholar 

  4. Myers, B.: A brief history of human computer interaction technology. Interactions 5(2), 44–54 (2001)

    Article  Google Scholar 

  5. Kopanitsa, G., Tsvetkova, Z., Veseli, H.: Analysis of metrics for the usability evaluation of EHR management systems. Stud. Health Technol. Inform. 180, 358–62 (2012)

    Google Scholar 

  6. Memon, A., Banerjee, I., Nagarajan, A.: GUI ripping: reverse engineering of graphical user interfaces for testing. In: Proceedings of 10th Working Conference on Reverse Engineering (WCRE 2003), pp. 260–269. IEEE, Piscataway (2003)

    Google Scholar 

  7. Johnson, C., Johnson, T., Zhang, J.: A user-centered framework for redesigning health care interfaces. J. Biomed. Inform. 38, 75–87 (2005)

    Article  Google Scholar 

  8. Deng, l., Huang, X.: Challenges in adopting speech recognition. Commun. Assoc. Comput. Mach. 47, 69–75 (2004)

    Google Scholar 

  9. Neti, C., Iyengar, G., Potamianos, G., Senior, A., Maison, B.: Perceptual interfaces for information interaction: joint processing of audio and visual information for human-computer interaction. In: Sixth International Conference on Spoken Language Processing (2001)

    Google Scholar 

  10. Deng, l., Wang, K., Acero, A., Hon, H.-W., Droppo, J., Boulis, C., Wang, Y.-Y., Jacoby, D., Chelba, C., Huang, X.: Distributed speech processing in mipad’s multimodal user interface. IEEE Trans. Speech Audio Process. 10, 605–619 (2002)

    Google Scholar 

  11. Heisterkamp, P.: Linguatronic product-level speech system for Mercedes-Benz cars. In: Proceedings of the First International Conference on Human Language Technology Research. Association for Computational Linguistics (2000)

    Google Scholar 

  12. Sánchez, M., Framinan, J., Calderón, C., Ortega, J., Martín, E., Cervera, J.: Application of business process management to drive the deployment of a speech recognition system in a healthcare organization. Stud. Health Technol. Inform. 136, 511–516 (2008)

    Google Scholar 

  13. Krishnaraj, A., Lee, J., Laws, S., Crawford, T.: Voice recognition software: effect on radiology report turnaround time at an academic medical center. AJR. merican journal of roentgenology. 195. 194–7. (2010). https://doi.org/10.2214/AJR.09.3169

  14. Derman, Y., Arenovich, T., Strauss, J.: Speech recognition software and electronic psychiatric progress notes: physicians ratings and preferences. BMC Med. Inform. Decis. Mak. 10, 44 (2010)

    Article  Google Scholar 

  15. Araujo, A., Times, V., Silva, M.: A tool for generating health applications using archetypes. IEEE Softw.37, 60–67 (2020)

    Article  Google Scholar 

  16. Shaikh, N., Deshmukh, R.: Speech recognition system—a review. IOSR J. Comput. Eng. 18, 01–09 (2016)

    Google Scholar 

  17. Juang, B.-H., and Rabiner, L.R.: Automatic speech recognition—a brief history of the technology development, in (Brown, K., ed.): Encyclopedia of Language and Linguistics, Elsevier, 2005, pp. 67

    Google Scholar 

  18. King, S., Frankel, J., Livescu, K., McDermott, E., Richmond, K., Wester, M.: Speech production knowledge in automatic speech recognition. J. Acoust. Soc. Am. 121, 723–42 (2007)

    Article  Google Scholar 

  19. Morales, N., Hansen, J., Toledano, D.: Mfcc compensation for improved recognition of filtered and band-limited speech. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’05), vol. 1, pp. 521–524 (2005)

    Google Scholar 

  20. Klevans, R.L., Rodman, R.D.: Voice Recognition, 1st ed. Artech House, Inc., Norwood (1997)

    Google Scholar 

  21. Maheswari, N., Kabilan, A., Venkatesh, R.: A hybrid model of neural network approach for speaker independent word recognition. Int. J. Comput. Theory Eng. 2(6), 912–915 (2010)

    Article  Google Scholar 

  22. Weintraub, M., Murveit, H., Cohen, M., Price, P., Bernstein, J., Baldwin, G., Bell, D.: Linguistic constraints in hidden Markov model based speech recognition. In: International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 699–702 (1989)

    Google Scholar 

  23. Collins, M.: Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (2002)

    Google Scholar 

  24. Cáceres, M., Jägenstedt, P., Natal, A., Shires, G.: Web speech API specification, 2012, [online] Available: https://wicg.github.io/speech-api

  25. Barón, A., Green, P.: Safety and Usability of Speech Interfaces for In-vehicle Tasks while Driving: A Brief Literature Review. University of Michigan, Ann Arbor (2019)

    Google Scholar 

  26. Kauppinen, T., Koivikko, M., Ahovuo, J.: Improvement of report workflow and productivity using speech recognition—a follow-up study. J. Digit. Imaging Off. J. Soc. Comput. Appl. Radiol. 21, 378–382 (2008)

    Google Scholar 

  27. Lai, J., Vergo, J.: Medspeak: Report creation with continuous speech recognition. In: Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems, pp. 431–438 (1997)

    Google Scholar 

  28. Bafhtiar, G., Bodinier, V., Despotou, G., Elliott, M., Bryant, N., Arvanitis, T.: Providing patient home clinical decision support using off-the-shelf cloud-based smart voice recognition. In: WIN Health Informatics Network Annual Conference (2017)

    Google Scholar 

  29. Jackson, C., Orebaugh, A.: A study of security and privacy issues associated with the amazon echo. Int. J. Internet Things Cyber-Assurance 1, 91 (2018)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Henrique Couto .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Couto, H., Lima, Í., Souza, R., Araújo, A., Times, V. (2020). A Speech Recognition Mechanism for Enabling Interactions Between End-Users and Healthcare Applications. In: Latifi, S. (eds) 17th International Conference on Information Technology–New Generations (ITNG 2020). Advances in Intelligent Systems and Computing, vol 1134. Springer, Cham. https://doi.org/10.1007/978-3-030-43020-7_57

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-43020-7_57

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-43019-1

  • Online ISBN: 978-3-030-43020-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics