A Speech Recognition Mechanism for Enabling Interactions Between End-Users and Healthcare Applications

Couto, Henrique; Lima, Ítalo; Souza, Rychard; Araújo, André; Times, Valéria

doi:10.1007/978-3-030-43020-7_57

Henrique Couto¹⁵,
Ítalo Lima¹⁵,
Rychard Souza¹⁵,
André Araújo¹⁵ &
…
Valéria Times¹⁶

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1134))

1533 Accesses

Abstract

The constant evolution of software development technologies has provided new interactions mechanisms for improving the usability of software systems and increasing the productivity of healthcare professionals. In this sense, speech recognition uses methods and technologies that allow the capture and transcription of spoken language automatically. However, few studies only have used health professionals to prototype and validate graphical user interfaces with speech recognition for helping in the development of healthcare applications. This paper specifies a computational solution that makes use of speech recognition to assist healthcare professionals in recording patients’ clinical care data. Six physicians have participated in our study by prototyping activities and specifying workflow to be carried out in patient care. After that, the software architecture is specified and the proposed solution, which has been implemented based on the prototyping task performed by the end users, is detailed. To evaluate the proposed solution, we have conducted interviews with health professionals and the results showed a reduction in time and effort for recording patient information. In addition, using a quantitative approach, aspects of learnability, memorability, efficiency and satisfaction were investigated, where the proposed healthcare application tool obtained an average evaluation of 88% with respect to usability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Erdil, K., Finn, E., Keating, K., Meattle, J., Park, S., Yoon, D.: Software maintenance as part of the software life cycle. In: Comp180: Software Engineering Project (2003)
Google Scholar
Araújo, A., Times, V., Urbano, M.: A cloud service for graphical user interfaces generation and electronic health record storage. In: Information Technology-New Generations, pp. 257–263. Springer, Cham (2018)
Google Scholar
Araújo, A., Times, V., Urbano, M.: Towards a reusable framework for generating health information systems. In: 16th International Conference on Information Technology-New Generations (ITNG 2019), pp. 423–428. Springer, Cham (2019)
Google Scholar
Myers, B.: A brief history of human computer interaction technology. Interactions 5(2), 44–54 (2001)
Article Google Scholar
Kopanitsa, G., Tsvetkova, Z., Veseli, H.: Analysis of metrics for the usability evaluation of EHR management systems. Stud. Health Technol. Inform. 180, 358–62 (2012)
Google Scholar
Memon, A., Banerjee, I., Nagarajan, A.: GUI ripping: reverse engineering of graphical user interfaces for testing. In: Proceedings of 10th Working Conference on Reverse Engineering (WCRE 2003), pp. 260–269. IEEE, Piscataway (2003)
Google Scholar
Johnson, C., Johnson, T., Zhang, J.: A user-centered framework for redesigning health care interfaces. J. Biomed. Inform. 38, 75–87 (2005)
Article Google Scholar
Deng, l., Huang, X.: Challenges in adopting speech recognition. Commun. Assoc. Comput. Mach. 47, 69–75 (2004)
Google Scholar
Neti, C., Iyengar, G., Potamianos, G., Senior, A., Maison, B.: Perceptual interfaces for information interaction: joint processing of audio and visual information for human-computer interaction. In: Sixth International Conference on Spoken Language Processing (2001)
Google Scholar
Deng, l., Wang, K., Acero, A., Hon, H.-W., Droppo, J., Boulis, C., Wang, Y.-Y., Jacoby, D., Chelba, C., Huang, X.: Distributed speech processing in mipad’s multimodal user interface. IEEE Trans. Speech Audio Process. 10, 605–619 (2002)
Google Scholar
Heisterkamp, P.: Linguatronic product-level speech system for Mercedes-Benz cars. In: Proceedings of the First International Conference on Human Language Technology Research. Association for Computational Linguistics (2000)
Google Scholar
Sánchez, M., Framinan, J., Calderón, C., Ortega, J., Martín, E., Cervera, J.: Application of business process management to drive the deployment of a speech recognition system in a healthcare organization. Stud. Health Technol. Inform. 136, 511–516 (2008)
Google Scholar
Krishnaraj, A., Lee, J., Laws, S., Crawford, T.: Voice recognition software: effect on radiology report turnaround time at an academic medical center. AJR. merican journal of roentgenology. 195. 194–7. (2010). https://doi.org/10.2214/AJR.09.3169
Derman, Y., Arenovich, T., Strauss, J.: Speech recognition software and electronic psychiatric progress notes: physicians ratings and preferences. BMC Med. Inform. Decis. Mak. 10, 44 (2010)
Article Google Scholar
Araujo, A., Times, V., Silva, M.: A tool for generating health applications using archetypes. IEEE Softw.37, 60–67 (2020)
Article Google Scholar
Shaikh, N., Deshmukh, R.: Speech recognition system—a review. IOSR J. Comput. Eng. 18, 01–09 (2016)
Google Scholar
Juang, B.-H., and Rabiner, L.R.: Automatic speech recognition—a brief history of the technology development, in (Brown, K., ed.): Encyclopedia of Language and Linguistics, Elsevier, 2005, pp. 67
Google Scholar
King, S., Frankel, J., Livescu, K., McDermott, E., Richmond, K., Wester, M.: Speech production knowledge in automatic speech recognition. J. Acoust. Soc. Am. 121, 723–42 (2007)
Article Google Scholar
Morales, N., Hansen, J., Toledano, D.: Mfcc compensation for improved recognition of filtered and band-limited speech. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’05), vol. 1, pp. 521–524 (2005)
Google Scholar
Klevans, R.L., Rodman, R.D.: Voice Recognition, 1st ed. Artech House, Inc., Norwood (1997)
Google Scholar
Maheswari, N., Kabilan, A., Venkatesh, R.: A hybrid model of neural network approach for speaker independent word recognition. Int. J. Comput. Theory Eng. 2(6), 912–915 (2010)
Article Google Scholar
Weintraub, M., Murveit, H., Cohen, M., Price, P., Bernstein, J., Baldwin, G., Bell, D.: Linguistic constraints in hidden Markov model based speech recognition. In: International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 699–702 (1989)
Google Scholar
Collins, M.: Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (2002)
Google Scholar
Cáceres, M., Jägenstedt, P., Natal, A., Shires, G.: Web speech API specification, 2012, [online] Available: https://wicg.github.io/speech-api
Barón, A., Green, P.: Safety and Usability of Speech Interfaces for In-vehicle Tasks while Driving: A Brief Literature Review. University of Michigan, Ann Arbor (2019)
Google Scholar
Kauppinen, T., Koivikko, M., Ahovuo, J.: Improvement of report workflow and productivity using speech recognition—a follow-up study. J. Digit. Imaging Off. J. Soc. Comput. Appl. Radiol. 21, 378–382 (2008)
Google Scholar
Lai, J., Vergo, J.: Medspeak: Report creation with continuous speech recognition. In: Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems, pp. 431–438 (1997)
Google Scholar
Bafhtiar, G., Bodinier, V., Despotou, G., Elliott, M., Bryant, N., Arvanitis, T.: Providing patient home clinical decision support using off-the-shelf cloud-based smart voice recognition. In: WIN Health Informatics Network Annual Conference (2017)
Google Scholar
Jackson, C., Orebaugh, A.: A study of security and privacy issues associated with the amazon echo. Int. J. Internet Things Cyber-Assurance 1, 91 (2018)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Advanced Studies in Data Science and Software Engineering, Federal University of Alagoas, Penedo, Brazil
Henrique Couto, Ítalo Lima, Rychard Souza & André Araújo
Center for Informatics, Federal University of Pernambuco, Recife, Brazil
Valéria Times

Authors

Henrique Couto
View author publications
You can also search for this author in PubMed Google Scholar
Ítalo Lima
View author publications
You can also search for this author in PubMed Google Scholar
Rychard Souza
View author publications
You can also search for this author in PubMed Google Scholar
André Araújo
View author publications
You can also search for this author in PubMed Google Scholar
Valéria Times
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Henrique Couto .

Editor information

Editors and Affiliations

Department of Electrical and Computer Engineering, University of Nevada, Las Vegas, Las Vegas, NV, USA
Shahram Latifi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Couto, H., Lima, Í., Souza, R., Araújo, A., Times, V. (2020). A Speech Recognition Mechanism for Enabling Interactions Between End-Users and Healthcare Applications. In: Latifi, S. (eds) 17th International Conference on Information Technology–New Generations (ITNG 2020). Advances in Intelligent Systems and Computing, vol 1134. Springer, Cham. https://doi.org/10.1007/978-3-030-43020-7_57

Download citation

DOI: https://doi.org/10.1007/978-3-030-43020-7_57
Published: 12 May 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-43019-1
Online ISBN: 978-3-030-43020-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics