Abstract
The selection of informative samples known as query selection is the most challenging task in active learning. In this article, a batch-mode active learning technique is presented by defining a novel query function. The proposed technique first divides the unlabeled samples into uniform partitions in one-dimensional feature space according to their distribution in the original feature space. Then to select the most informative samples from the unlabeled pool, one sample from each partition is selected based on an uncertainty criterion defined by exploiting SVM classifier. The number of unlabeled samples selected at each iteration of active learning is determined automatically and depends on the number of non-empty partitions generated. The effectiveness of the proposed technique is measured by comparing it with four state-of-the-art techniques exist in the literature by using four different UCI repository data sets. The experimental analysis proved that the proposed technique is robust and computationally less demanding.
Similar content being viewed by others
References
Brezhneva O, Tret’yakov AA (2011) An elementary proof of the Karush–Kuhn–Tucker theorem in normed linear spaces for problems with a finite number of inequality constraints. Optimization 60(5):613–618
Brinker K (2003) Incorporating diversity in active learning with support vector machines. ICML 3:59–66
Burges CJ (1998) A tutorial on support vector machines for pattern recognition. Data Mining Knowl Discov 2(2):121–167
Campbell C, Cristianini N, Smola A, et al (2000) Query learning with large margin classifiers. In: ICML, pp 111–118
Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):27
Cheng S, Shih FY (2007) An improved incremental training algorithm for support vector machines using active query. Pattern Recogn 40(3):964–971
Cohn DA, Ghahramani Z, Jordan MI (1996) Active learning with statistical models. J Artif Intell Res 4(1):129–145
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines. Cambridge University Press, Cambridge
Demir B, Persello C, Bruzzone L (2011) Batch-mode active-learning methods for the interactive classification of remote sensing images. IEEE Tran Geosci Remote Sens 49(3):1014–1031
Freund Y, Seung HS, Shamir E, Tishby N (1997) Selective sampling using the query by committee algorithm. Mach Learn 28(2–3):133–168
Fukumizu K (2000) Statistical active learning in multilayer perceptrons. IEEE Trans Neural Netw 11(1):17–26
Huang JS, Jin R, Zhou Z-H (2010) Active learning by querying informative and representative examples. In: Lafferty JD, Williams CKI, Shawe-Taylor J, Zemel RS, Culotta A (eds) Advances in neural information processing systems, vol 23. Curran Associates, Inc., Red Hook, pp 892–900. http://papers.nips.cc/paper/4176-active-learning-by-querying-informative-and-representative-examples.pdf
Huo LZ, Tang P (2014) A batch-mode active learning algorithm using region-partitioning diversity for SVM classifier. IEEE J Sel Top Appl Earth Obs Remote Sensing 7(4):1036–1046
Jiawei H, Kamber M (2001) Data mining: concepts and techniques. Morgan Kaufmann Ltd, San Francisco, p 5
Li M, Sethi IK (2006) Confidence-based active learning. IEEE Trans Pattern Anal Mach Intell 28(8):1251–1261
Mamitsuka NAH (1998) Query learning strategies using boosting and bagging. In: Proceedings of the fifteenth international conference on machine learning (ICML’98), vol 1, Morgan Kaufmann Pub., San Francisco
Mitra P, Shankar BU, Pal SK (2004) Segmentation of multispectral remote sensing images using active support vector machines. Pattern Recogn Lett 25(9):1067–1074
Patra S, Bruzzone L (2011) A fast cluster-assumption based active-learning technique for classification of remote sensing images. IEEE Trans Geosci Remote Sens 49(5):1617–1626
Patra S, Bruzzone L (2012) A cluster-assumption based batch mode active learning technique. Pattern Recogn Lett 33(9):1042–1048
Patra S, Bruzzone L (2014) A novel SOM-SVM based active learning technique for remote sensing image classification. IEEE Trans Geosci Remote Sens 52(11):6899–6910
Rajan S, Ghosh J, Crawford MM (2008) An active learning approach to hyperspectral data classification. IEEE Trans Geosci Remote Sens 46(4):1231–1242
Ren J (2012) ANN vs. SVM: which one performs better in classification of MCCS in mammogram imaging. Knowl Based Syst 26:144–153
Schohn G, Cohn D (2000) Less is more: active learning with support vector machines. In: ICML, Citeseer, pp 839–846
Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on computational learning theory ACM, New York, pp 287–294
Shigeo A (2005) Support vector machines for pattern classification. Advances in pattern recognition. Springer, Heidelberg
Tong S, Koller D (2001) Support vector machine active learning with applications to text classification. J Mach Learn Res 2:45–66
Tuia D, Ratle F, Pacifici F, Kanevski MF, Emery WJ (2009) Active learning methods for remote sensing image classification. IEEE Trans Geosci Remote Sens 47(7):2218–2232
Vapnik V (2001) The nature of statistical learning theory, 2nd edn. Springer, New York
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interests regarding the publication of this paper.
Informed consent
Consent to submit has been received explicitly from all co-authors, as well as from the responsible authorities tacitly or explicitly at the institute/organization where the work has been carried out before the work is submitted. Research involving human participants and/or animals Our research does not include human participants or animals.
Additional information
Communicated by V. Loia.
Rights and permissions
About this article
Cite this article
Singla, A., Patra, S. A fast partition-based batch-mode active learning technique using SVM classifier. Soft Comput 22, 4627–4637 (2018). https://doi.org/10.1007/s00500-017-2645-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-017-2645-0