Abstract
We describe a clustering approach with the emphasis on detecting coherent structures in a complex dataset, and illustrate its effectiveness with computer vision applications. By complex data, we mean that the attribute variations among the data are too extensive such that clustering based on a single feature representation/descriptor is insufficient to faithfully divide the data into meaningful groups. The proposed method thus assumes the data are represented with various feature representations, and aims to uncover the underlying cluster structure. To that end, we associate each cluster with a boosting classifier derived from multiple kernel learning, and apply the cluster-specific classifier to feature selection across various descriptors to best separate data of the cluster from the rest. Specifically, we integrate the multiple, correlative training tasks of the cluster-specific classifiers into the clustering procedure, and cast them as a joint constrained optimization problem. Through the optimization iterations, the cluster structure is gradually revealed by these classifiers, while their discriminant power to capture similar data would be progressively improved owing to better data labeling.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Dueck, D., Frey, B.: Non-metric affinity propagation for unsupervised image categorization. In: ICCV (2007)
Tuzel, O., Porikli, F., Meer, P.: Kernel methods forweakly supervised mean shift clustering. In: ICCV (2009)
Shi, J., Malik, J.: Normalized cuts and image segmentation. TPAMI (2000)
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. TPAMI (2002)
Roth, V., Lange, T.: Feature selection in clustering problems. In: NIPS (2003)
Ye, J., Zhao, Z., Wu, M.: Discriminative k-means for clustering. In: NIPS (2007)
Lanckriet, G., Cristianini, N., Bartlett, P., Ghaoui, L., Jordan, M.: Learning the kernel matrix with semidefinite programming. JMLR (2004)
Ng, A., Jordan, M., Weiss, Y.: On spectral clustering: Analysis and an algorithm. In: NIPS (2001)
Frey, B., Dueck, D.: Clustering by passing messages between data points. Science (2007)
Cheng, H., Hua, K., Vu, K.: Constrained locally weighted clustering. In: VLDB (2008)
Domeniconi, C., Al-Razgan, M.: Weighted cluster ensembles: Methods and analysis. TKDD (2009)
Berg, A., Malik, J.: Geometric blur for template matching. In: CVPR (2001)
Lowe, D.: Distinctive image features from scale-invariant keypoints. IJCV (2004)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006)
Bosch, A., Zisserman, A., Muñoz, X.: Representing shape with a spatial pyramid kernel. In: CIVR (2007)
Xu, L., Neufeld, J., Larson, B., Schuurmans, D.: Maximum margin clustering. In: NIPS (2004)
Zhao, B., Wang, F., Zhang, C.: Efficient multiclass maximum margin clustering. In: ICML (2008)
Strehl, A., Ghosh, J.: Cluster ensembles – A knowledge reuse framework for combining multiple partitions. JMLR (2002)
Fred, A., Jain, A.: Combining multiple clusterings using evidence accumulation. TPAMI (2005)
Xing, E., Ng, A., Jordan, M., Russell, S.: Distance metric learning with application to clustering with side-information. In: NIPS (2002)
Mutch, J., Lowe, D.: Multiclass object recognition with sparse, localized features. In: CVPR (2006)
Zhang, H., Berg, A., Maire, M., Malik, J.: SVM-KNN: Discriminative nearest neighbor classification for visual category recognition. In: CVPR (2006)
Lin, Y.-Y., Liu, T.-L., Fuh, C.-S.: Local ensemble kernel learning for object category recognition. In: CVPR (2007)
Lin, Y.-Y., Tsai, J.-F., Liu, T.-L.: Efficient discriminative local learning for object recognition. In: ICCV (2009)
Moghaddam, B., Shakhnarovich, G.: Boosted dyadic kernel discriminants. In: NIPS (2002)
Collins, M., Schapire, R., Singer, Y.: Logistic regression, AdaBoost and Bregman distances. ML (2002)
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: A statistical view of boosting. Annals of Statistics (2000)
Wolsey, L.: Integer Programming. John Wiley & Sons, Chichester (1998)
The MOSEK Optimization Software, http://www.mosek.com/index.html
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. In: CVPR Workshop on Generative-Model Based Vision (2004)
Shechtman, E., Irani, M.: Matching local self-similarities across images and videos. In: CVPR (2007)
Cox, T., Cox, M.: Multidimentional Scaling. Chapman & Hall, London (1994)
Sim, T., Baker, S., Bsat, M.: The CMU pose, illumination, and expression database. TPAMI (2005)
Gross, R., Brajovic, V.: An image preprocessing algorithm for illumination invariant face recognition. In: Kittler, J., Nixon, M.S. (eds.) AVBPA 2003. LNCS, vol. 2688, Springer, Heidelberg (2003)
Ojala, T., Pietikäinen, M., Mäenpää, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. TPAMI (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lin, YY., Liu, TL., Fuh, CS. (2010). Clustering Complex Data with Group-Dependent Feature Selection. In: Daniilidis, K., Maragos, P., Paragios, N. (eds) Computer Vision – ECCV 2010. ECCV 2010. Lecture Notes in Computer Science, vol 6316. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15567-3_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-15567-3_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15566-6
Online ISBN: 978-3-642-15567-3
eBook Packages: Computer ScienceComputer Science (R0)