Computational Analysis of Audio Recordings of Piano Performance for Automatic Evaluation

Kato, Norihiro; Nakamura, Eita; Mine, Kyoko; Doeda, Orie; Yamada, Masanao

doi:10.1007/978-3-031-42682-7_46

Norihiro Kato¹²,
Eita Nakamura¹³,
Kyoko Mine¹⁴,
Orie Doeda¹² &
…
Masanao Yamada¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14200))

Included in the following conference series:

European Conference on Technology Enhanced Learning

1445 Accesses

Abstract

We developed a computational evaluation method for piano performance with the goal of building a practice support system for beginners. We recorded students’ performances as audio data and applied several recent methods for audio-to-MIDI transcription based on deep neural networks to extract the pitch, onset time, and offset time of musical notes. To determine the correctness of the performance, we aligned the extracted MIDI data with the musical score using a hidden Markov model (HMM). We compared the audio-to-MIDI transcription methods and optimized the weight on different types of performance errors to conform to teacher’s assessment. Our experiments showed a strong correlation between the rate of performance errors obtained from the alignment and the evaluation by a teacher who listened to the performance. The results that indicate performance errors and tempo stability can be used in a practice support system that provides feedback to learners.

This work was supported by JSPS KAKENHI Grant Numbers 21K02846, 21K12187, 22H03661.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Deja, J.A.: Piano learning and improvisation through adaptive visualisation and digital augmentation. In: Companion Proceedings of the 2022 Conference on Interactive Surfaces and Spaces, pp. 41–45 (2022)
Google Scholar
Dorfman, J.: Theory and Practice of Technology-based Music Instruction. Oxford University Press, Oxford (2022)
Google Scholar
Fukuda, T., Ikemiya, Y., Itoyama, K., Yoshii, K.: “A score-informed piano tutoring system with mistake detection and score simplification” within the music education contexts. In: Proceedings of the 12th Sound and Music Computing Conference (SMC), vol. 1, pp. 105–110 (2015)
Google Scholar
Heyen, F., Ngo, Q.Q., Kurzhals, K., Sedlmair, M.: Data-driven visual reflection on music instrument practice. In: ACM CHI Conference on Human Factors in Computing Systems (2022)
Google Scholar
Kim, H., Ramoneda, P., Miron, M., Serra, X.: An overview of automatic piano performance assessment within the music education contexts. Proc. Int. Soc. Music Inf. Retrieval 1, 465–474 (2017)
Google Scholar
Kong, Q., Li, B., Song, X., Wan, Y., Wang, Y.: High-resolution piano transcription with pedals by regressing onset and offset times. IEEE/ACM Trans. Audio Speech Lang. Process. 29, 3707–3717 (2021)
Article Google Scholar
Lerch, A., Arthur, C., Pati, A., Gururani, S.: An interdisciplinary review of music performance analysis. Trans. Int. Soc. Music Inf. Retrieval 3(1), 221–245 (2021)
Article Google Scholar
Lima, H.B., Santos, C.G.R.D., Meiguins, B.S.: A survey of music visualization techniques. ACM Comput. Surv. (CSUR) 57(7), 1–29 (2022)
Article Google Scholar
Nakamura, E., Yoshii, K., Katayose, H.: Performance error detection and post-processing for fast and accurate symbolic music alignment. IN: Proceedings of the International Society for Music Information Retrieval, pp. 347–353 (2017)
Google Scholar
Shibata, K., Nakamura, E., Yoshi, K.: Non-local musical statistics as guides for audio-to-score piano transcription. Inf. Sci. 566, 262–280 (2021)
Article MathSciNet Google Scholar
Wang, W., Pan, J., Yi, H., Song, Z., Li, M.: Audio-based piano performance evaluation for beginners with convolutional neural network and attention mechanism. IEEE/ACM Trans. Audio Speech Lang. Process. 29, 1119–1133 (2021)
Article Google Scholar
Wu, C.W., Gururani, S., Pati, A., Vidwans, A.: Towards the objective assessment of music performances. In: International Conference on Music Perception and Cognition (ICMPC), pp. 99–103 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Technology, Kushiro College, Kushiro, Japan
Norihiro Kato, Orie Doeda & Masanao Yamada
Kyoto University, Kyoto, Japan
Eita Nakamura
Osaka Ohtani University, Tondabayashi, Japan
Kyoko Mine

Authors

Norihiro Kato
View author publications
You can also search for this author in PubMed Google Scholar
Eita Nakamura
View author publications
You can also search for this author in PubMed Google Scholar
Kyoko Mine
View author publications
You can also search for this author in PubMed Google Scholar
Orie Doeda
View author publications
You can also search for this author in PubMed Google Scholar
Masanao Yamada
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Masanao Yamada .

Editor information

Editors and Affiliations

KTH Royal Institute of Technology, Stockholm, Sweden
Olga Viberg
Goethe University Frankfurt, Frankfurt am Main, Germany
Ioana Jivet
Universidad Carlos III de Madrid, Madrid, Spain
Pedro J. Muñoz-Merino
University of Macedonia, Thessaloniki, Greece
Maria Perifanou
CODE University of Applied Sciences, Berlin, Germany
Tina Papathoma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kato, N., Nakamura, E., Mine, K., Doeda, O., Yamada, M. (2023). Computational Analysis of Audio Recordings of Piano Performance for Automatic Evaluation. In: Viberg, O., Jivet, I., Muñoz-Merino, P., Perifanou, M., Papathoma, T. (eds) Responsive and Sustainable Educational Futures. EC-TEL 2023. Lecture Notes in Computer Science, vol 14200. Springer, Cham. https://doi.org/10.1007/978-3-031-42682-7_46

Download citation

DOI: https://doi.org/10.1007/978-3-031-42682-7_46
Published: 28 August 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-42681-0
Online ISBN: 978-3-031-42682-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Computational Analysis of Audio Recordings of Piano Performance for Automatic Evaluation