Abstract
Classification of spectral data has raised a growing interest in may research areas. However, this type of data usually suffers from the curse of dimensionality. This causes most statistical methods and/or classifiers to not perform well. A recently proposed alternative which can help avoiding this problem is the Dissimilarity Representation, in which objects are represented by their dissimilarities to representative objects of each class. However, this approach depends on the selection of a suitable dissimilarity measure. For spectra, the incorporation of information on their shape, can be significant for a good discrimination. In this paper, we make a study on the benefit of using a measure which takes shape of spectra into account. We show that the shape-based measure not only leads to better classification results, but that a certain number of objects is enough to achieve it. The experiments are conducted on three one-dimensional data sets and a two-dimensional one.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Fukunaga, K., Hayes, R.: Effects of sample size in classifier design. IEEE Transactions on Pattern Analysis and Machine Intelligence 11(8), 873–885 (1991)
Raudys, S.J., Jain, A.K.: Small sample size effects in statistical pattern recognition: Recommendations for practitioners. IEEE Transactions on Pattern Analysis and Machine Intelligence 3(3), 252–264 (1991)
Classifiers in almost empty spaces. In: 15th International Conference on Pattern Recognition, Barcelona, Spain, vol. 2. IEEE Computer Society, Los Alamitos (2000)
Pekalska, E., Duin, R.P.W.: The Dissimilarity Representation For Pattern Recognition. Foundations and Applications. World Scientific, Singapore (2005)
Orozco-Alzate, M., García, M.E., Duin, R.P.W., Castellanos, C.G.: Dissimilarity-based classification of seismic signals at Nevado del Ruiz Volcano. Earth Sci. Res. J. 10(2), 57–65 (2006)
Paclik, P., Duin, R.P.W.: Dissimilarity-based classification of spectra: computational issues. Real Time Imaging 9(4), 237–244 (2003)
Porro-Muñoz, D., Talavera, I., Duin, R.P.W., Hernández, N., Orozco-Alzate, M.: Dissimilarity representation on functional spectral data for classification. Journal of Chemometrics Early View (2011)
Porro-Muñoz, D., Duin, R.P.W., Orozco-Alzate, M., Talavera, I., Londoño-Bonilla, J.M.: The dissimilarity representation as a tool for three-way data classification: A 2D measure. In: Hancock, E.R., Wilson, R.C., Windeatt, T., Ulusoy, I., Escolano, F. (eds.) SSPR&SPR 2010. LNCS, vol. 6218, pp. 569–578. Springer, Heidelberg (2010)
Porro-Muñoz, D., Talavera, I., Duin, R.P.W., Hernández, N.: The representation of chemical spectral data for classification. In: Bayro-Corrochano, E., Eklundh, J.-O. (eds.) CIARP 2009. LNCS, vol. 5856, pp. 513–520. Springer, Heidelberg (2009)
Zuo, W., Zhang, D., Wang, K.: An assembled matrix distance metric for 2DPCA-based image recognition. Pattern Recognition Letters 27, 210–216 (2006)
Yang, J., Yang, J.Y.: From image vector to matrix: A straightforward image projection technique-IMPCA vs. PCA. Pattern Recognition 35, 1997–1999 (2002)
Yang, J., Zhang, D., Frangi, A., Yang, J.Y.: Two-dimensional PCA: A new approach to appearance-based face representation and recognition. IEEE Trans. Pattern Anal. Machine Intell 26(1), 131–137 (2004)
Thodberg, H.H.: Tecator dataset, Danish Meat Research Institute (1995), http://www.lib.stat.cmu.edu/datasets/tecator
Skov, T.: Wine dataset (2008), http://www.models.kvl.dk/datasets.html
Skov, T., Ballabio, D., Bro, R.: Multiblock variance partitioning. A new approach for comparing variation in multiple data blocks. Analytica Chimica Acta 615(1), 18–29 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Porro-Muñoz, D., Duin, R.P.W., Talavera, I., Orozco-Alzate, M. (2011). A Study on the Influence of Shape in Classifying Small Spectral Data Sets. In: Pelillo, M., Hancock, E.R. (eds) Similarity-Based Pattern Recognition. SIMBAD 2011. Lecture Notes in Computer Science, vol 7005. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24471-1_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-24471-1_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24470-4
Online ISBN: 978-3-642-24471-1
eBook Packages: Computer ScienceComputer Science (R0)