Abstract
The pulse code modulation (PCM) method of digitizing analog signals has become a standard both in digital audio and in speech research, the focus of this paper. The solutions to some problems encountered in earlier systems at Haskins Laboratories are outlined, along with general properties of A/D conversion. Specialized features of the current Haskins Laboratories system, which has also been installed at more than a dozen other laboratories, are also detailed: the Nyquist filter response, the high-frequency preemphasis filter characteristics, the dynamic range, the timing resolution for single- and (synchronized) dual-channel signals, and the form of the digitized speech files (header information, data, and label structure). While the solutions adopted in this system are not intended to be considered a standard, the design principles involved are of interest to users and creators of other PCM systems.
Article PDF
Similar content being viewed by others
References
Baken, R. J. (1987).Clinical measurement of speech and voice. Boston: College-Hill.
Cooper, F. S., &Mattingly, I. G. (1969). A computer-controlled PCM system for the investigation of dichotic speech perception.Journal of the Acoustical Society of America,46, S115(A).
Fowler, C. A., Whalen, D. H., &Cooper, A. M. (1988). Perceived timing is produced timing: A reply to Howell.Perception & Psychophysics,43, 94–98.
Goodall, W. M. (1947). Telephony by pulse code modulation.Bell 1 System Technical Journal,26, 395–409.
Heute, U. (1988). Medium-rate speech coding-Trial of a review.Speech Communication,7, 125–149.
Howell, P. (1988). Prediction of the P-center location from the distribution of energy in the amplitude envelope: I.Perception & Psychophysics,43, 90–93.
Licklider, J. C. R., &Pollack, I. (1948). Effects of differentiation, integration and infinite peak clipping upon the intelligibility of speech.Journal of the Acoustical Society of America,25, 375–388.
Mertus, J. (1989). Standards for PCM files.Behavior Research Methods, Instruments, & Computers,21, 126–129.
Robinson, D. W., &Dadson, R. S. (1956). A redetermination of the equal-loudness relations for pure tones.British Journal of Applied Physics,7, 166–181.
Schroeder, M. R. (1968). Reference signal for signal quality studies.Journal of the Acoustical Society of America,44, 1735–1736.
Tuller, B., &Fowler, C. A. (1981). The contribution of amplitude to the perception of isochrony.Haskins Laboratories Status Report on Speech Research,SR65, 245–250). New Haven, CT: Haskins Laboratories.
Whalen, D. H. (1984). Subcategorical phonetic mismatches slow phonetic judgments.Perception & Psychophysics,35, 49–64.
Author information
Authors and Affiliations
Additional information
The writing of this article was supported by NIH Contract N01-HD-5-2910 to Haskins Laboratories. We thank Michael D’Angelo, Vincent Gulisano, Mark Tiede, Ignatius G. Mattingly, Patrick W. Nye, Tom Carrell, David B. Pisoni, and two anonymous reviewers for helpful comments. We also thank Leonard Szubowicz for the time and care spent designing and implementing the original version of the Haskins PCM software.
Rights and permissions
About this article
Cite this article
Whalen, D.H., Wiley, E.R., Rubin, P.E. et al. The Haskins Laboratories’ pulse code modulation (PCM) system. Behavior Research Methods, Instruments, & Computers 22, 550–559 (1990). https://doi.org/10.3758/BF03204440
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03204440