International Journal on Digital Libraries

, Volume 10, Issue 4, pp 201–220 | Cite as

Methodologies and tools for audio digital archives

  • Nicola Orio
  • Lauro Snidaro
  • Sergio Canazza
  • Gian Luca Foresti
Article

Abstract

In response to the proposal of digitizing the entire back-run of several European audio archives, many research projects have been carried out in order to discover the technical issues involved in making prestigious audio documents digitally available, which are related to the A/D transfer process and supervised metadata extraction. This article gives an innovative approach to metadata extraction from such a complex source material. This article also describes the protocols defined, the processes undertaken, the results ascertained from several audio documents preservation projects and the techniques used. In addition, a number of recommendations are given for the re-recording process, aimed at minimizing the information loss and to automatically measure the unintentional alterations introduced by the A/D equipment.

Keywords

A/D transfer Metadata Digital archives Historical audio documents 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    3MCompany: High frequency bias requirements for magnetic tape recording. 3M SoundTalk Bull. 1(2), 1–4 (1968)Google Scholar
  2. 2.
    Adorno T.W.: Philosophy of New Music. University of Minnesota Press, Minneapolis (2006)Google Scholar
  3. 3.
    AES-11id-2006: AES Information Document for Preservation of Audio Recordings—Extended Term Storage Environment for Multiple Media Archives. AES (2006)Google Scholar
  4. 4.
    AES22-1997: AES Recommended Practice for Audio Preservation and Restoration—Storage and Handling—Storage of Polyester-Base Magnetic Tape. AES (2003)Google Scholar
  5. 5.
    AES28-1997: AES Standard for Audio Preservation and Restoration—Method for Estimating Life Expectancy of Compact Discs (CD-ROM), Based on Effects of Temperature and Relative Humidity (includes Amendment 1-2001). AES (2003)Google Scholar
  6. 6.
    AES31-2-2006: AES standard on Network and File Transfer of Audio—Audio-File Transfer Exchange—File Format for Transferring Digital Audio Data Between Systems of Different Type and Manufacture. AES (2006)Google Scholar
  7. 7.
    AES35-2000 AES Standard for Audio Preservation and Restoration—Method for Estimating Life Expectancy of Magneto-Optical (M-O) Disks, Based on Effects of Temperature and Relative Humidity. AES (2005)Google Scholar
  8. 8.
    AES38-2000: Aes Standard for Audio Preservation and Restoration—Life Expectancy of Information Stored in Recordable Compact Disc Systems—Method for Estimating, Based on Effects of Temperature and Relative Humidity (2005)Google Scholar
  9. 9.
    AES49-2005 AES Standard for Audio Preservation and Restoration—Magnetic Tape—Care and Handling Practices for Extended Usage. AES (2005)Google Scholar
  10. 10.
    A.E. Society: Method for measurement of weighted peak flutter of sound recording and reproducing equipment, AES6-2008. AES Standard (2008)Google Scholar
  11. 11.
    Bertram H., Cuddihy E.: Kinetics of the humid aging of magnetic recording tape. IEEE Trans. Magn. 27, 4388–4395 (1982)Google Scholar
  12. 12.
    Boll S.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust Speech Signal Process. ASSSP 27(2), 113–120 (1979)CrossRefGoogle Scholar
  13. 13.
    Boney, L., Tewfik, A., Hamdy, K.: Digital watermarks for audio signals. In: IEEE Proceedings Multimedia pp. 473–480 (1996)Google Scholar
  14. 14.
    Boston, G.: Safeguarding the Documentary Heritage. A Guide to Standards, Recommended Practices and Reference Literature Related to the Preservation of Documents of All Kinds. UNESCO (1988)Google Scholar
  15. 15.
    Brock-Nannestad, G.: The Objective Basis for the Production of High Quality Transfers from Pre-1925 Sound Recordings. In: AES Preprint n °4610 Audio Engineering Society 103rd Convention, pp. 26–29. New York (1997)Google Scholar
  16. 16.
    Brown University Library: Center for digital initiatives (2010). http://pike.services.brown.edu/
  17. 17.
    Burt L.: Chemical Technology in the Edison Recording Industry. J. Audio Eng. Soc. (10-11), 712–717 (1977)Google Scholar
  18. 18.
    Calas, M., Fountaine, J. La conservation des documents sonores. CNRS, Paris, France (1996)Google Scholar
  19. 19.
    Canazza S.: Noise and Representation Systems: A Comparison among Audio Restoration Algorithms. Lulu Enterprise, USA (2007)Google Scholar
  20. 20.
    Canazza S., Vidolin A.: Preserving electroacoustic music. J. New Music Res. 30(4), 351–363 (2001)CrossRefGoogle Scholar
  21. 21.
    Canazza, S., Vidolin, A.: Special issue on preserving electroacoustic music. J. New Music Res. 30(4) (2001)Google Scholar
  22. 22.
    Cano P., Batlle E., Kalker T., Haitsma J.: A review of audio fingerprinting. J. VLSI Signal Process. 41, 271–284 (2005)CrossRefGoogle Scholar
  23. 23.
    Cavaglieri, S., Johnsen, O., Bapst, F.: Optical retrieval and storage of analog sound recordings. In: AES (ed.) Proceedings of AES 20th International Conference. Budapest, Hungary (2001)Google Scholar
  24. 24.
    Cohen, E.: Preservation of audio in folk heritage collections in crisis. In: Proceedings of Council on Library and Information Resources. Washington, DC, USA (2001)Google Scholar
  25. 25.
    Daugman J.: How iris recognition works. IEEE Transactions on circuits and systems for video technology 14(1), 21–30 (2004)CrossRefGoogle Scholar
  26. 26.
    Daugman J.: New methods in iris recognition. IEEE Trans Syst Man Cybern B Cybern 37(5), 1167–1175 (2007)CrossRefGoogle Scholar
  27. 27.
    Dixon, S., Widmer, G.: Match: a music alignment tool chest. In: Proceedings of the International Conference of Music Information Retrieval, pp. 492–497 (2005)Google Scholar
  28. 28.
    EBU: Specification of the Broadcast Wave Format: A Format for Audio Data Files in Broadcasting—Tech 3285. EBU (1997)Google Scholar
  29. 29.
    Ephraim, Y., Malah, D.: Speech Enhancement Using a Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator. IEEE Trans. Acoust. Speech Signal Process. 32(6), 1109–1121Google Scholar
  30. 30.
    Esquef P.A.A., Valimaki V., Karjalainen M.: Restoration and enhancement of solo guitar recordings based on sound source modeling. J. Audio Eng. Soc. 50(4), 227–236 (2002)Google Scholar
  31. 31.
    Fedeyev V., Haber C.: Reconstruction of mechanically recorded sound by image processing. J. Audio Eng. Soc. 51(12), 1172–1185 (2003)Google Scholar
  32. 32.
    Gibson, G.: Magnetic tape deterioration: recognition, recovery and prevention (1996). http://www.unesco.org/webworld/ramp/html/r9704e/r9704e11.htm
  33. 33.
    Grancharov, V., Samuelsson, J., Kleijn, B.: Noise-dependent postfiltering. Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP) 1, 457–460 (2004)Google Scholar
  34. 34.
    Grancharov, V., Samuelsson, J., Kleijn, B.: Improved Kalman filtering for speech enhancement. In: Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP) 1, 1109–1112 (2005)Google Scholar
  35. 35.
    Grancharov V., Samuelsson J., Kleijn B.: On casual algorithms for speech enhancement. Trans. Audio Speech Lang. Process. 14(3), 273–276 (2006)Google Scholar
  36. 36.
    Hart, M.: Preserving our musical heritage: a musician’s outreach to audio engineers. J. Audio Eng. Soc. 49(7–8) (2001)Google Scholar
  37. 37.
    IASA-TC 03: The Safeguarding of the Audio Heritage: Ethics, Principles and Preservation Strategy. IASA Technical Committee (2005)Google Scholar
  38. 38.
    IASA-TC 04: Guidelines on the Production and Preservation of Digital Objects. IASA Technical Committee (2004)Google Scholar
  39. 39.
    IFLA/UNESCO: Safeguarding our Documentary Heritage/Conservation préventive du patrimoine documentaire/Salvaguardando nuestro patrimonio documental. CD-ROM Bi-lingual: English/French/Spanish. UNESCO “Memory of the World” Programme, French Ministry of Culture and Communication (2000)Google Scholar
  40. 40.
    Khanna S.: Vinyl compound for the phonographic industry. J. Audio Eng. Soc. 10–11, 712–717 (1977)Google Scholar
  41. 41.
    Knight G.: Factors relating to long term storage of magnetic tape. Phonograph. Bull. 18, 16–37 (1977)Google Scholar
  42. 42.
    Laurent S.: The Care of Cylinders and Discs. Technical Coordinating. Committee, Milton Keynes (1997)Google Scholar
  43. 43.
    Library of Congress: Metadata encoding and transmission standard (METS) (2010). http://www.loc.gov/standards/mets/
  44. 44.
    Lim J., Oppenheim A.: All-pole modeling of degraded speech. IEEE Trans. Acoust. Speech Signal Process. 26(3), 197–210 (1978)MATHCrossRefGoogle Scholar
  45. 45.
    Liu Y., Zhang D., Lu G., Ma W.: A survey of content-based image retrieval with high-level semantics. Pattern Recognit. 40(1), 262–282 (2007)MATHCrossRefGoogle Scholar
  46. 46.
    Ma N., Bouchard M., Goubran R.A.: Speech enhancement using a masking threshold constrained Kalman filter and its heuristic implementations. IEEE Trans. Speech Audio Lang. Process. 14(1), 19–32 (2006)CrossRefGoogle Scholar
  47. 47.
    Manjunath B., Salembier P., Sikora T.: Introduction to MPEG-7: Multimedia Content Description Interface. Wiley, New York (2002)Google Scholar
  48. 48.
    Miller D.: The Science of Musical Sounds. Macmillan, New York (1922)Google Scholar
  49. 49.
    Miotto, R., Orio, N.: Automatic identification of music works through audio matching. In: Proceedings of 11th European Conference on Digital Libraries, pp. 124–135 (2007)Google Scholar
  50. 50.
    Müller, M., Kurth, F., Clausen, F.: Audio matching via chroma-based statistical features. In: Proceedings of the International Conference of Music Information Retrieval, pp. 288–295 (2005)Google Scholar
  51. 51.
    Negroponte N.: Being Digital. Vintage Books, New York (1995)Google Scholar
  52. 52.
    Niedźwiecki M., Cisowski K.: Adaptive scheme for elimination of broadband noise and impulsive disturbances from AR and ARMA signals. IEEE Trans. Signal Process. 44(3), 967–982 (1996)Google Scholar
  53. 53.
    Orcalli A.: On the methodologies of audio restoration. J. New Music Res. 30(4), 307–322 (2001)CrossRefGoogle Scholar
  54. 54.
    Orio, N., Snidaro, L., Canazza, S.: Semi-automatic metadata extraction from shellac and vinyl disc. In: Proceedings of Workshop on Digital Preservation Weaving Factory for Analogue Audio Collections. Firenze University Press, Firenze, Italy, pp. 38–45 (2008)Google Scholar
  55. 55.
    Orio, N., Zattra, L.: Audio matching for the philological analysis of electro-acoustic music. In: Proceedings of the International Computer Music Conference, pp. 157–164 (2007)Google Scholar
  56. 56.
    Paliwal, K., Basu, A.: A speech enhancement method based on Kalman filtering. Proc. IEEE Int. Conf. Acoust. Speech Signal Audio Process. vol. 12, pp. 177–180 (1987)Google Scholar
  57. 57.
    Pickett A., Lemcoe M.: Preservation and Storage of Sound Recordings. ARSC, Washington, DC, USA (1991)Google Scholar
  58. 58.
    Rabiner L., Juang B.: Fundamentals of Speech Recognition. Prentice-Hall, Englewood Cliffs, NJ (1993)Google Scholar
  59. 59.
    Schüller D.: The ethics of preservation, restoration, and re-issues of historical sound recordings. J. Audio Eng. Soc. 39(12), 1014–1016 (1991)Google Scholar
  60. 60.
    Shapiro L., Stockman G.: Computer Vision. Prentice-Hall, Upper Saddle River (2001)Google Scholar
  61. 61.
    Smith, A.: Why digitize? In: Proceedings of Council on Library and Information Resources. Washington, DC, USA (1999)Google Scholar
  62. 62.
    Snidaro L., Foresti G.L.: Real-time thresholding with Euler numbers. Pattern Recognit. Lett. 24(9-10), 1533–1544 (2003)MATHCrossRefGoogle Scholar
  63. 63.
    Storm W.: The establishment of international re-recording standards. Phonograph. Bull. 27, 5–12 (1980)Google Scholar
  64. 64.
    Stotzer, S., Johnsen, O., Bapst, F., Sudan, C., Ingol, R.: Phonographic sound extraction using image and signal processing. In: Proc. ICASSP, 4, 289–292 (2004)Google Scholar
  65. 65.
    Tsoukalas D., Mourjopoulos J., Kokkinakis G.: Speech enhancement based on audible noise suppression. IEEE Trans. Acoust. Speech Signal Process. 5(6), 497–514 (1997)Google Scholar
  66. 66.
    Virag N.: Single channel speech enhancement based on masking properties of the human auditory system. IEEE Trans. Acoust. Speech Signal Process. 7(2), 126–137 (1999)Google Scholar

Copyright information

© Springer-Verlag 2010

Authors and Affiliations

  • Nicola Orio
    • 1
  • Lauro Snidaro
    • 2
  • Sergio Canazza
    • 1
  • Gian Luca Foresti
    • 2
  1. 1.Department of Information EngineeringUniversity of PadovaPadovaItaly
  2. 2.Department of Mathematics and InformaticsUniversity of UdineUdineItaly

Personalised recommendations