Multi-class Ensemble-Based Active Learning

Körner, Christine; Wrobel, Stefan

doi:10.1007/11871842_68

Christine Körner²¹ &
Stefan Wrobel^21,22

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4212))

Included in the following conference series:

European Conference on Machine Learning

6072 Accesses
20 Citations

Abstract

Ensemble-based active learning has been proven to efficiently reduce the number of training instances and thus the cost of data acquisition. To determine the utility of a candidate training instance, the disagreement about its class value among the ensemble members is used. While the disagreement for binary classification is easily determined using margins, the adaption to multi-class problems is not straightforward and little studied in the literature. In this paper we consider four approaches to measure ensemble disagreement, including margins, uncertainty sampling and entropy, and evaluate them empirically on various ensemble strategies for active learning. We show that margins outperform the other disagreement measures on three of four active learning strategies. Our experiments also show that some active learning strategies are more sensitive to the choice of disagreement measure than others.

Download to read the full chapter text

Chapter PDF

Adaptive Active Learning with Ensemble of Learners and Multiclass Problems

Diversity Measures and Margin Criteria in Multi-class Majority Vote Ensemble

Active Learning Algorithm Using the Discrimination Function of the Base Classifiers

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Abe, N., Mamitsuka, H.: Query learning strategies using boosting and bagging. In: Proc. of ICML 1998, pp. 1–9. Morgan Kaufmann, San Francisco (1998)
Google Scholar
Muslea, I., Minton, S., Knoblock, C.A.: Selective sampling with redundant views. In: Proc. of AAAI 2000, pp. 621–626. AAAI Press / The MIT Press (2000)
Google Scholar
Melville, P., Mooney, R.: Diverse ensembles for active learning. In: Proc. of ICML 2004, pp. 584–591. ACM, New York (2004)
Google Scholar
Lewis, D.D., Catlett, J.: Heterogeneous uncertainty sampling for supervised learning. In: Proc. of ICML 1994, pp. 148–156. ACM Press, New York (1994)
Google Scholar
Lewis, D.D., Gale, W.A.: A sequential algorithm for training text classifiers. In: Proc. of SIGIR 1994, pp. 3–12. ACM / Springer (1994)
Google Scholar
Dagan, I., Engelson, S.: Committee-based sampling for training probabilistic classifiers. In: Proc. of ICML 1995, pp. 150–157. Morgan Kaufmann, San Francisco (1995)
Google Scholar
McCallum, A., Nigam, K.: Employing em and pool-based active learning for text classification. In: Proc. of ICML 1998, pp. 350–358. Morgan Kaufmann, San Francisco (1998)
Google Scholar
Melville, P., Yang, S.M., Saar-Tsechansky, M., Mooney, R.: Active learning for probability estimation using jensen-shannon divergence. In: Gama, J., Camacho, R., Brazdil, P.B., Jorge, A.M., Torgo, L. (eds.) ECML 2005. LNCS, vol. 3720, pp. 268–279. Springer, Heidelberg (2005)
Chapter Google Scholar
Seung, H.S., Opper, M., Sompolinsky, H.: Query by committee. In: Proc. of COLT 1992, pp. 287–294. ACM, New York (1992)
Chapter Google Scholar
Freund, Y., Seung, H.S., Shamir, E., Tishby, N.: Selective sampling using the query by committee algorithm. Machine Learning 28(2-3), 133–168 (1997)
Article MATH Google Scholar
Breiman, L.: Bagging predictors. Technical report 421, University of California, Berkeley (1994)
Google Scholar
Freund, Y., Schapire, R.: Experiments with a new boosting algorithm. In: Proc. of ICML 1996, pp. 148–156. Morgan Kaufmann, San Francisco (1996)
Google Scholar
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proc. of COLT 1998, pp. 92–100. ACM, New York (1998)
Chapter Google Scholar
Nigam, K., Ghani, R.: Analyzing the effectiveness and applicability of co-training. In: Proc. of CIKM 2000, pp. 86–93. ACM, New York (2000)
Chapter Google Scholar
Muslea, I.: Active Learning with Multiple Views. PhD thesis, University of Southern California (2002)
Google Scholar
Melville, P., Mooney, R.: Constructing diverse classifier ensembles using artificial training examples. In: Proc. of IJCAI 2003, pp. 505–510. Morgan Kaufmann, San Francisco (2003)
Google Scholar
Blake, C.L., Merz, C.J.: Uci repository of machine learning databases, http://www.ics.uci.edu/~mlearn/MLRepository.html
Witten, I.H., Frank, E.: Data Mining - Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Fraunhofer Institut Intelligente Analyse- und Informationssysteme, Germany
Christine Körner & Stefan Wrobel
Dept. of Computer Science III, University of Bonn, Germany
Stefan Wrobel

Authors

Christine Körner
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Wrobel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Körner, C., Wrobel, S. (2006). Multi-class Ensemble-Based Active Learning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_68

Download citation

DOI: https://doi.org/10.1007/11871842_68
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Multi-class Ensemble-Based Active Learning

Abstract

Chapter PDF

Similar content being viewed by others

Adaptive Active Learning with Ensemble of Learners and Multiclass Problems

Diversity Measures and Margin Criteria in Multi-class Majority Vote Ensemble

Active Learning Algorithm Using the Discrimination Function of the Base Classifiers

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Multi-class Ensemble-Based Active Learning

Abstract

Chapter PDF

Similar content being viewed by others

Adaptive Active Learning with Ensemble of Learners and Multiclass Problems

Diversity Measures and Margin Criteria in Multi-class Majority Vote Ensemble

Active Learning Algorithm Using the Discrimination Function of the Base Classifiers

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation