Skip to main content

Improving the Performance of Acoustic Event Classification by Selecting and Combining Information Sources Using the Fuzzy Integral

  • Conference paper
Machine Learning for Multimodal Interaction (MLMI 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3869))

Included in the following conference series:

Abstract

Acoustic events produced in meeting-room-like environments may carry information useful for perceptually aware interfaces. In this paper, we focus on the problem of combining different information sources at different structural levels for classifying human vocal-tract non-speech sounds. The Fuzzy Integral (FI) approach is used to fuse outputs of several classification systems, and feature selection and ranking are carried out based on the knowledge extracted from the Fuzzy Measure (FM). In the experiments with a limited set of training data, the FI-based decision-level fusion showed a classification performance which is much higher than the one from the best single classifier and can surpass the performance resulting from the integration at the feature-level by Support Vector Machines. Although only fusion of audio information sources is considered in this work, the conclusions may be extensible to the multi-modal case.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Temko, A., Nadeu, C.: Meeting room acoustic event classification by support vector machines and variable-feature-set clustering. In: ICASSP 2005, Philadelphia (March 2005)

    Google Scholar 

  2. Schölkopf, B., Smola, A.: Learning with Kernels. MIT Press, Cambridge (2002)

    MATH  Google Scholar 

  3. Weston, J., Mukherjee, J., Chapelle, O., Pontil, M., Poggio, T., Vapnik, V.: Feature Selection for SVMs. In: Proc. of NIPS (2000)

    Google Scholar 

  4. Sugeno, M.: Theory of fuzzy integrals and its applications, PhD thesis, Tokyo Institute of Technology (1974)

    Google Scholar 

  5. Grabisch, M.: The Choquet integral as a linear interpolator. In: 10th Int. Conf. on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2004), Perugia (Italy), July 2004, pp. 373–378 (2004)

    Google Scholar 

  6. Grabisch, M.: Fuzzy integral in multi-criteria decision-making. Fuzzy Sets & Systems 69, 279–298 (1995)

    Article  MathSciNet  MATH  Google Scholar 

  7. Kuncheva, L.: ‘Fuzzy’ vs ‘Non-fuzzy’ in combining classifiers designed by boosting. IEEE Transactions on Fuzzy Systems 11(6), 729–741 (2003)

    Article  Google Scholar 

  8. Chang, S., Greenberg, S.: Syllable-proximity evaluation in automatic speech recognition using fuzzy measures and a fuzzy integral. In: Proc. of the 12th IEEE Fuzzy Systems Conf., pp. 828–833 (2003)

    Google Scholar 

  9. Grabisch, M.: A new algorithm for identifying fuzzy measures and its application to pattern recognition. In: Proc. of 4th IEEE Int. Conf. on Fuzzy Systems, Yokohama, Japan, pp. 145–150 (1995)

    Google Scholar 

  10. Wu, Y., Chang, E., Chang, K., Smith, J.: Optimal Multimodal Fusion for Multimedia Data Analysis. In: Proc. ACM Int. Conf. on Multimedia, New York, pp. 572–579 (October 2004)

    Google Scholar 

  11. Kuncheva, L.: Combining classifiers: Soft computing solutions. Lecture Notes in Pattern Recognition, pp. 427–452. World Scientific Publishing Co, Singapore (2001)

    Google Scholar 

  12. Marichal, J.-L.: Behavioral analysis of aggregation in multicriteria decision aid, Preferences and Decisions under Incomplete Knowledge. Studies in Fuzziness and Soft Computing 51, 153–178 (2000)

    Article  MathSciNet  MATH  Google Scholar 

  13. Marichal, J.-L.: Entropy of discrete Choquet capacities. European Journal of Operational Research 137(3), 612–624 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  14. Kojadinovic, I., Marichal, J.-L., Roubens, M.: An axiomatic approach to the definition of the entropy of a discrete choquet capacity. In: 9th Int. Conf. on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2002), Annecy (France), pp. 763–768 (2002)

    Google Scholar 

  15. Evaluation Packages for the First CHIL Evaluation Campaign. In: CHIL project Deliverable D7.4 (Mar 2005) Available at, http://chil.server.de/servlet/is/2712/

  16. Kuncheva, L.: Combining Pattern Classifiers. John Wiley & Sons, Inc, Chichester (2004)

    Book  MATH  Google Scholar 

  17. Mikenina, L., Zimmermann, H.: Improved feature selection and classification by the 2-additive fuzzy measure. Fuzzy Sets and Systems 107(2), 197–218 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  18. Nadeu, C., Hernando, J., Gorricho, M.: On the decorrelation of filter-bank energies in speech recognition. In: Proc. Eurospeech 1995, pp. 1381–1384 (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Temko, A., Macho, D., Nadeu, C. (2006). Improving the Performance of Acoustic Event Classification by Selecting and Combining Information Sources Using the Fuzzy Integral. In: Renals, S., Bengio, S. (eds) Machine Learning for Multimodal Interaction. MLMI 2005. Lecture Notes in Computer Science, vol 3869. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11677482_31

Download citation

  • DOI: https://doi.org/10.1007/11677482_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32549-9

  • Online ISBN: 978-3-540-32550-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics