Exploiting Unlabeled Data in Content-Based Image Retrieval

Zhou, Zhi-Hua; Chen, Ke-Jia; Jiang, Yuan

doi:10.1007/978-3-540-30115-8_48

Zhi-Hua Zhou²²,
Ke-Jia Chen²² &
Yuan Jiang²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3201))

Included in the following conference series:

European Conference on Machine Learning

4117 Accesses
35 Citations

Abstract

In this paper, the Ssair (Semi-Supervised Active Image Retrieval) approach, which attempts to exploit unlabeled data to improve the performance of content-based image retrieval (Cbir), is proposed. This approach combines the merits of semi-supervised learning and active learning. In detail, in each round of relevance feedback, two simple learners are trained from the labeled data, i.e. images from user query and user feedback. Each learner then classifies the unlabeled images in the database and passes the most relevant/irrelevant images to the other learner. After re-training with the additional labeled data, the learners classify the images in the database again and then their classifications are merged. Images judged to be relevant with high confidence are returned as the retrieval result, while these judged with low confidence are put into the pool which is used in the next round of relevance feedback. Experiments show that semi-supervised learning and active learning mechanisms are both beneficial to Cbir.

Download to read the full chapter text

Chapter PDF

A Semi-Supervised Active Learning FSVM for Content Based Image Retrieval

Consistency-Based Semi-supervised Active Learning: Towards Minimizing Labeling Cost

A novel relevance feedback method for CBIR

Article 05 February 2018

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Abe, N., Mamitsuka, H.: Query learning strategies using boosting and bagging. In: Proceedings of the 15th International Conference on Machine Learning, Madison, WI, pp. 1–9 (1998)
Google Scholar
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the 11th Annual Conference on Computational Learning Theory, Madison, WI, pp. 92–100 (1998)
Google Scholar
Bookstein, A.: Information retrieval: a sequential learning process. Journal of the American Society for Information Science 34, 331–342 (1983)
Article Google Scholar
Ciocca, G., Schettini, R.: A relevance feedback mechanism for content-based image retrieval. Information Processing and Management 35, 605–632 (1999)
Article Google Scholar
Goldman, S., Zhou, Y.: Enhancing supervised learning with unlabeled data. In: Proceedings of the 17th International Conference on Machine Learning, San Francisco, CA, pp. 327–334 (2000)
Google Scholar
Jaakkola, T., Haussler, D.: Exploiting generative models in discriminative classifiers. In: Kearns, M.S., Solla, S.A., Cohn, D.A. (eds.) Advances in Neural Information Processing Systems, vol. 11, pp. 487–493. MIT Press, Cambridge (1999)
Google Scholar
Joachims, T.: Transductive inference for text classification using support vector machines. In: Proceedings of the 16th International Conference on Machine Learning, Bled, Slovenia, pp. 200–209 (1999)
Google Scholar
Lewis, D.: Representation and learning in information retrieval. PhD thesis, Department of Computer Science, University of Massachusetts, Amherst, MA (1992)
Google Scholar
Lewis, D., Gale, W.: A sequential algorithm for training text classifiers. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, pp. 3–12 (1994)
Google Scholar
Mehtre, B.M., Kankanhalli, M.S., Narasimhalu, A.D., Man, G.C.: Color matching for image retrieval. Pattern Recognition Letters 16, 325–331 (1995)
Article Google Scholar
Müller, H., Müller, W., Squire, D.M., Marchand-Maillet, S., Pun, T.: Performance evaluation in content-based image retrieval: overview and proposals. Pattern Recognition Letters 22, 593–601 (2001)
Article MATH Google Scholar
Muslea, I., Minton, S., Knoblock, C.A.: Selective sampling with redundant views. In: Proceedings of the 17th National Conference on Artificial Intelligence, Austin, TX, pp. 621–626 (2000)
Google Scholar
Nigam, K., McCallum, A., Thrun, S., Mitchell, T.: Text classification from labeled and unlabeled documents using EM. Machine Learning 39, 103–134 (2000)
Article MATH Google Scholar
Rui, Y., Huang, T.S., Ortega, M., Mehrotra, S.: Relevance feedback: a power tool for interactive content-based image retrieval. IEEE Transactions on Circuits and Systems for Video Technology 8, 644–655 (1998)
Article Google Scholar
Seung, H., Opper, M., Sompolinsky, H.: Query by committee. In: Proceedings of the 5th ACM Workshop on Computational Learning Theory, Pittsburgh, PA, pp. 287–294 (1992)
Google Scholar
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 1349–1380 (2000)
Article Google Scholar
Tong, S., Chang, E.: Support vector machine active learning for image retrieval. In: Proceedings of the 9th ACM International Conference on Multimedia, Ottawa, Canada, pp. 107–118 (2001)
Google Scholar
Wu, Y., Tian, Q., Huang, T.S.: Discriminant-EM algorithm with application to image retrieval. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, Hilton Head, SC, pp. 222–227 (2000)
Google Scholar
Zhang, C., Chen, T.: An active learning framework for content-based information retrieval. IEEE Transactions on Multimedia 4, 260–268 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

National Laboratory for Novel Software Technology, Nanjing University, Nanjing, 210093, China
Zhi-Hua Zhou, Ke-Jia Chen & Yuan Jiang

Authors

Zhi-Hua Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Ke-Jia Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Jiang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

INSA-Lyon, LIRIS CNRS UMR5205, F-69621, Villeurbanne, France
Jean-François Boulicaut
Dipartimento di Informatica, Università degli Studi di Bari,
Floriana Esposito
Pisa KDD Laboratory, ISTI - CNR, Area della Ricerca di Pisa, Via Giuseppe Moruzzi 1, Pisa, Italy
Fosca Giannotti
Dipartimento di Informatica, Via F. Buonarroti 2, 56127, Pisa, Italy
Dino Pedreschi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhou, ZH., Chen, KJ., Jiang, Y. (2004). Exploiting Unlabeled Data in Content-Based Image Retrieval. In: Boulicaut, JF., Esposito, F., Giannotti, F., Pedreschi, D. (eds) Machine Learning: ECML 2004. ECML 2004. Lecture Notes in Computer Science(), vol 3201. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30115-8_48

Download citation

DOI: https://doi.org/10.1007/978-3-540-30115-8_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23105-9
Online ISBN: 978-3-540-30115-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Exploiting Unlabeled Data in Content-Based Image Retrieval

Abstract

Chapter PDF

Similar content being viewed by others

A Semi-Supervised Active Learning FSVM for Content Based Image Retrieval

Consistency-Based Semi-supervised Active Learning: Towards Minimizing Labeling Cost

A novel relevance feedback method for CBIR

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Exploiting Unlabeled Data in Content-Based Image Retrieval

Abstract

Chapter PDF

Similar content being viewed by others

A Semi-Supervised Active Learning FSVM for Content Based Image Retrieval

Consistency-Based Semi-supervised Active Learning: Towards Minimizing Labeling Cost

A novel relevance feedback method for CBIR

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation