Skip to main content

Methodologies and tools for audio digital archives


In response to the proposal of digitizing the entire back-run of several European audio archives, many research projects have been carried out in order to discover the technical issues involved in making prestigious audio documents digitally available, which are related to the A/D transfer process and supervised metadata extraction. This article gives an innovative approach to metadata extraction from such a complex source material. This article also describes the protocols defined, the processes undertaken, the results ascertained from several audio documents preservation projects and the techniques used. In addition, a number of recommendations are given for the re-recording process, aimed at minimizing the information loss and to automatically measure the unintentional alterations introduced by the A/D equipment.

This is a preview of subscription content, access via your institution.


  1. 3MCompany: High frequency bias requirements for magnetic tape recording. 3M SoundTalk Bull. 1(2), 1–4 (1968)

    Google Scholar 

  2. Adorno T.W.: Philosophy of New Music. University of Minnesota Press, Minneapolis (2006)

    Google Scholar 

  3. AES-11id-2006: AES Information Document for Preservation of Audio Recordings—Extended Term Storage Environment for Multiple Media Archives. AES (2006)

  4. AES22-1997: AES Recommended Practice for Audio Preservation and Restoration—Storage and Handling—Storage of Polyester-Base Magnetic Tape. AES (2003)

  5. AES28-1997: AES Standard for Audio Preservation and Restoration—Method for Estimating Life Expectancy of Compact Discs (CD-ROM), Based on Effects of Temperature and Relative Humidity (includes Amendment 1-2001). AES (2003)

  6. AES31-2-2006: AES standard on Network and File Transfer of Audio—Audio-File Transfer Exchange—File Format for Transferring Digital Audio Data Between Systems of Different Type and Manufacture. AES (2006)

  7. AES35-2000 AES Standard for Audio Preservation and Restoration—Method for Estimating Life Expectancy of Magneto-Optical (M-O) Disks, Based on Effects of Temperature and Relative Humidity. AES (2005)

  8. AES38-2000: Aes Standard for Audio Preservation and Restoration—Life Expectancy of Information Stored in Recordable Compact Disc Systems—Method for Estimating, Based on Effects of Temperature and Relative Humidity (2005)

  9. AES49-2005 AES Standard for Audio Preservation and Restoration—Magnetic Tape—Care and Handling Practices for Extended Usage. AES (2005)

  10. A.E. Society: Method for measurement of weighted peak flutter of sound recording and reproducing equipment, AES6-2008. AES Standard (2008)

  11. Bertram H., Cuddihy E.: Kinetics of the humid aging of magnetic recording tape. IEEE Trans. Magn. 27, 4388–4395 (1982)

    Google Scholar 

  12. Boll S.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust Speech Signal Process. ASSSP 27(2), 113–120 (1979)

    Article  Google Scholar 

  13. Boney, L., Tewfik, A., Hamdy, K.: Digital watermarks for audio signals. In: IEEE Proceedings Multimedia pp. 473–480 (1996)

  14. Boston, G.: Safeguarding the Documentary Heritage. A Guide to Standards, Recommended Practices and Reference Literature Related to the Preservation of Documents of All Kinds. UNESCO (1988)

  15. Brock-Nannestad, G.: The Objective Basis for the Production of High Quality Transfers from Pre-1925 Sound Recordings. In: AES Preprint n °4610 Audio Engineering Society 103rd Convention, pp. 26–29. New York (1997)

  16. Brown University Library: Center for digital initiatives (2010).

  17. Burt L.: Chemical Technology in the Edison Recording Industry. J. Audio Eng. Soc. (10-11), 712–717 (1977)

    Google Scholar 

  18. Calas, M., Fountaine, J. La conservation des documents sonores. CNRS, Paris, France (1996)

  19. Canazza S.: Noise and Representation Systems: A Comparison among Audio Restoration Algorithms. Lulu Enterprise, USA (2007)

    Google Scholar 

  20. Canazza S., Vidolin A.: Preserving electroacoustic music. J. New Music Res. 30(4), 351–363 (2001)

    Article  Google Scholar 

  21. Canazza, S., Vidolin, A.: Special issue on preserving electroacoustic music. J. New Music Res. 30(4) (2001)

  22. Cano P., Batlle E., Kalker T., Haitsma J.: A review of audio fingerprinting. J. VLSI Signal Process. 41, 271–284 (2005)

    Article  Google Scholar 

  23. Cavaglieri, S., Johnsen, O., Bapst, F.: Optical retrieval and storage of analog sound recordings. In: AES (ed.) Proceedings of AES 20th International Conference. Budapest, Hungary (2001)

  24. Cohen, E.: Preservation of audio in folk heritage collections in crisis. In: Proceedings of Council on Library and Information Resources. Washington, DC, USA (2001)

  25. Daugman J.: How iris recognition works. IEEE Transactions on circuits and systems for video technology 14(1), 21–30 (2004)

    Article  Google Scholar 

  26. Daugman J.: New methods in iris recognition. IEEE Trans Syst Man Cybern B Cybern 37(5), 1167–1175 (2007)

    Article  Google Scholar 

  27. Dixon, S., Widmer, G.: Match: a music alignment tool chest. In: Proceedings of the International Conference of Music Information Retrieval, pp. 492–497 (2005)

  28. EBU: Specification of the Broadcast Wave Format: A Format for Audio Data Files in Broadcasting—Tech 3285. EBU (1997)

  29. Ephraim, Y., Malah, D.: Speech Enhancement Using a Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator. IEEE Trans. Acoust. Speech Signal Process. 32(6), 1109–1121

  30. Esquef P.A.A., Valimaki V., Karjalainen M.: Restoration and enhancement of solo guitar recordings based on sound source modeling. J. Audio Eng. Soc. 50(4), 227–236 (2002)

    Google Scholar 

  31. Fedeyev V., Haber C.: Reconstruction of mechanically recorded sound by image processing. J. Audio Eng. Soc. 51(12), 1172–1185 (2003)

    Google Scholar 

  32. Gibson, G.: Magnetic tape deterioration: recognition, recovery and prevention (1996).

  33. Grancharov, V., Samuelsson, J., Kleijn, B.: Noise-dependent postfiltering. Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP) 1, 457–460 (2004)

  34. Grancharov, V., Samuelsson, J., Kleijn, B.: Improved Kalman filtering for speech enhancement. In: Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP) 1, 1109–1112 (2005)

  35. Grancharov V., Samuelsson J., Kleijn B.: On casual algorithms for speech enhancement. Trans. Audio Speech Lang. Process. 14(3), 273–276 (2006)

    Google Scholar 

  36. Hart, M.: Preserving our musical heritage: a musician’s outreach to audio engineers. J. Audio Eng. Soc. 49(7–8) (2001)

  37. IASA-TC 03: The Safeguarding of the Audio Heritage: Ethics, Principles and Preservation Strategy. IASA Technical Committee (2005)

  38. IASA-TC 04: Guidelines on the Production and Preservation of Digital Objects. IASA Technical Committee (2004)

  39. IFLA/UNESCO: Safeguarding our Documentary Heritage/Conservation préventive du patrimoine documentaire/Salvaguardando nuestro patrimonio documental. CD-ROM Bi-lingual: English/French/Spanish. UNESCO “Memory of the World” Programme, French Ministry of Culture and Communication (2000)

  40. Khanna S.: Vinyl compound for the phonographic industry. J. Audio Eng. Soc. 10–11, 712–717 (1977)

    Google Scholar 

  41. Knight G.: Factors relating to long term storage of magnetic tape. Phonograph. Bull. 18, 16–37 (1977)

    Google Scholar 

  42. Laurent S.: The Care of Cylinders and Discs. Technical Coordinating. Committee, Milton Keynes (1997)

    Google Scholar 

  43. Library of Congress: Metadata encoding and transmission standard (METS) (2010).

  44. Lim J., Oppenheim A.: All-pole modeling of degraded speech. IEEE Trans. Acoust. Speech Signal Process. 26(3), 197–210 (1978)

    MATH  Article  Google Scholar 

  45. Liu Y., Zhang D., Lu G., Ma W.: A survey of content-based image retrieval with high-level semantics. Pattern Recognit. 40(1), 262–282 (2007)

    MATH  Article  Google Scholar 

  46. Ma N., Bouchard M., Goubran R.A.: Speech enhancement using a masking threshold constrained Kalman filter and its heuristic implementations. IEEE Trans. Speech Audio Lang. Process. 14(1), 19–32 (2006)

    Article  Google Scholar 

  47. Manjunath B., Salembier P., Sikora T.: Introduction to MPEG-7: Multimedia Content Description Interface. Wiley, New York (2002)

    Google Scholar 

  48. Miller D.: The Science of Musical Sounds. Macmillan, New York (1922)

    Google Scholar 

  49. Miotto, R., Orio, N.: Automatic identification of music works through audio matching. In: Proceedings of 11th European Conference on Digital Libraries, pp. 124–135 (2007)

  50. Müller, M., Kurth, F., Clausen, F.: Audio matching via chroma-based statistical features. In: Proceedings of the International Conference of Music Information Retrieval, pp. 288–295 (2005)

  51. Negroponte N.: Being Digital. Vintage Books, New York (1995)

    Google Scholar 

  52. Niedźwiecki M., Cisowski K.: Adaptive scheme for elimination of broadband noise and impulsive disturbances from AR and ARMA signals. IEEE Trans. Signal Process. 44(3), 967–982 (1996)

    Google Scholar 

  53. Orcalli A.: On the methodologies of audio restoration. J. New Music Res. 30(4), 307–322 (2001)

    Article  Google Scholar 

  54. Orio, N., Snidaro, L., Canazza, S.: Semi-automatic metadata extraction from shellac and vinyl disc. In: Proceedings of Workshop on Digital Preservation Weaving Factory for Analogue Audio Collections. Firenze University Press, Firenze, Italy, pp. 38–45 (2008)

  55. Orio, N., Zattra, L.: Audio matching for the philological analysis of electro-acoustic music. In: Proceedings of the International Computer Music Conference, pp. 157–164 (2007)

  56. Paliwal, K., Basu, A.: A speech enhancement method based on Kalman filtering. Proc. IEEE Int. Conf. Acoust. Speech Signal Audio Process. vol. 12, pp. 177–180 (1987)

  57. Pickett A., Lemcoe M.: Preservation and Storage of Sound Recordings. ARSC, Washington, DC, USA (1991)

    Google Scholar 

  58. Rabiner L., Juang B.: Fundamentals of Speech Recognition. Prentice-Hall, Englewood Cliffs, NJ (1993)

    Google Scholar 

  59. Schüller D.: The ethics of preservation, restoration, and re-issues of historical sound recordings. J. Audio Eng. Soc. 39(12), 1014–1016 (1991)

    Google Scholar 

  60. Shapiro L., Stockman G.: Computer Vision. Prentice-Hall, Upper Saddle River (2001)

    Google Scholar 

  61. Smith, A.: Why digitize? In: Proceedings of Council on Library and Information Resources. Washington, DC, USA (1999)

  62. Snidaro L., Foresti G.L.: Real-time thresholding with Euler numbers. Pattern Recognit. Lett. 24(9-10), 1533–1544 (2003)

    MATH  Article  Google Scholar 

  63. Storm W.: The establishment of international re-recording standards. Phonograph. Bull. 27, 5–12 (1980)

    Google Scholar 

  64. Stotzer, S., Johnsen, O., Bapst, F., Sudan, C., Ingol, R.: Phonographic sound extraction using image and signal processing. In: Proc. ICASSP, 4, 289–292 (2004)

  65. Tsoukalas D., Mourjopoulos J., Kokkinakis G.: Speech enhancement based on audible noise suppression. IEEE Trans. Acoust. Speech Signal Process. 5(6), 497–514 (1997)

    Google Scholar 

  66. Virag N.: Single channel speech enhancement based on masking properties of the human auditory system. IEEE Trans. Acoust. Speech Signal Process. 7(2), 126–137 (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Sergio Canazza.

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Orio, N., Snidaro, L., Canazza, S. et al. Methodologies and tools for audio digital archives. Int J Digit Libr 10, 201–220 (2009).

Download citation

  • Published:

  • Issue Date:

  • DOI:


  • A/D transfer
  • Metadata
  • Digital archives
  • Historical audio documents