Transformation Invariance in Pattern Recognition – Tangent Distance and Tangent Propagation

Simard, Patrice Y.; LeCun, Yann A.; Denker, John S.; Victorri, Bernard

doi:10.1007/978-3-642-35289-8_17

Patrice Y. Simard¹⁸,
Yann A. LeCun¹⁸,
John S. Denker¹⁸ &
…
Bernard Victorri¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7700))

65k Accesses
14 Citations
3 Altmetric

Abstract

In pattern recognition, statistical modeling, or regression, the amount of data is a critical factor affecting the performance. If the amount of data and computational resources are unlimited, even trivial algorithms will converge to the optimal solution. However, in the practical case, given limited data and other resources, satisfactory performance requires sophisticated methods to regularize the problem by introducing a priori knowledge. Invariance of the output with respect to certain transformations of the input is a typical example of such a priori knowledge. In this chapter, we introduce the concept of tangent vectors, which compactly represent the essence of these transformation invariances, and two classes of algorithms, “tangent distance” and “tangent propagation”, which make use of these invariances to improve performance.

Previously published in: Orr, G.B. and Müller, K.-R. (Eds.): LNCS 1524, ISBN 978-3-540-65311-0 (1998).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Devijver, P.A., Kittler, J.: Pattern Recognition, A Statistical Approache. Prentice-Hall, Englewood Cliffs (1982)
MATH Google Scholar
Aho, A.V., Hopcroft, J.E., Ullman, J.D.: Data Structure and Algorithms. Addison-Wesley (1983)
Google Scholar
Bottou, L., Vapnik, V.N.: Local learning algorithms. Neural Computation 4(6), 888–900 (1992)
Article Google Scholar
Broder, A.J.: Strategies for efficient incremental nearest neighbor search. Pattern Recognition 23, 171–178 (1990)
Article Google Scholar
Broomhead, D.S., Lowe, D.: Multivariable functional interpolation and adaptive networks. Complex Systems 2, 321–355 (1988)
MathSciNet MATH Google Scholar
Choquet-Bruhat, Y., DeWitt-Morette, C., Dillard-Bleick, M.: Analysis, Manifolds and Physics. North-Holland, Amsterdam (1982)
MATH Google Scholar
Cortes, C., Vapnik, V.: Support vector networks. Machine Learning 20, 273–297 (1995)
MATH Google Scholar
Dasarathy, B.V.: Nearest Neighbor (NN) Norms: NN Pattern classification Techniques. IEEE Computer Society Press, Los Alamitos (1991)
Google Scholar
Drucker, H., Schapire, R., Simard, P.Y.: Boosting performance in neural networks. International Journal of Pattern Recognition and Artificial Intelligence 7(4), 705–719 (1993)
Article Google Scholar
Fukunaga, K., Flick, T.E.: An optimal global nearest neighbor metric. IEEE transactions on Pattern analysis and Machine Intelligence 6(3), 314–318 (1984)
Article MATH Google Scholar
Gilmore, R.: Lie Groups, Lie Algebras and some of their Applications. Wiley, New York (1974)
MATH Google Scholar
Hastie, T., Kishon, E., Clark, M., Fan, J.: A model for signature verification. Technical Report 11214-910715-07TM, AT&T Bell Laboratories (July 1991)
Google Scholar
Hastie, T., Simard, P.Y.: Metrics and models for handwritten character recognition. Statistical Science 13 (1998)
Google Scholar
Hastie, T.J., Tibshirani, R.J.: Generalized Linear Models. Chapman and Hall, London (1990)
MATH Google Scholar
Hinton, G.E., Williams, C.K.I., Revow, M.D.: Adaptive elastic models for hand-printed character recognition. In: Advances in Neural Information Processing Systems, pp. 512–519. Morgan Kaufmann Publishers (1992)
Google Scholar
Hoerl, A.E., Kennard, R.W.: Ridge regression: Biased estimation for non-orthogonal problems. Technometrics 12, 55–67 (1970)
Article MATH Google Scholar
Kohonen, T.: Self-organization and associative memory. Springer Series in Information Sciences, vol. 8. Springer (1984)
Google Scholar
Le Cun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Handwritten digit recognition with a back-propagation network. In: Touretzky, D. (ed.) Advances in Neural Information Processing Systems, vol. 2, Morgan Kaufmann, Denver (1989)
Google Scholar
LeCun, Y.: Generalization and network design strategies. In: Pfeifer, R., Schreter, Z., Fogelman, F., Steels, L. (eds.) Connectionism in Perspective, Zurich, Switzerland (1989); Elsevier, An extended version was published as a technical report of the University of Toronto
Google Scholar
LeCun, Y., Jackel, L.D., Bottou, L., Cortes, C., Denker, J.S., Drucker, H., Guyon, I., Muller, U.A., Sackinger, E., Simard, P., Vapnik, V.: Learning algorithms for classification: A comparison on handwritten digit recognition. In: Oh, J.H., Kwon, C., Cho, S. (eds.) Neural Networks: The Statistical Mechanics Perspective, pp. 261–276. World Scientific (1995)
Google Scholar
Parzen, E.: On estimation of a probability density function and mode. Ann. Math. Stat. 33, 1065–1076 (1962)
Article MathSciNet MATH Google Scholar
Press, W.H., Flannery, B.P., Teukolsky, S.A., Vetterling, W.T.: Numerical Recipes. Cambridge University Press, Cambridge (1988)
MATH Google Scholar
Schwenk, H.: The diabolo classifier. Neural Computation (1998) (in press)
Google Scholar
Sibson, R.: Studies in the robustness of multidimensional scaling: Procrustes statistices. J. R. Statist. Soc. 40, 234–238 (1978)
MATH Google Scholar
Simard, P.Y.: Efficient computation of complex distance metrics using hierarchical filtering. In: Advances in Neural Information Processing Systems. Morgan Kaufmann Publishers (1994)
Google Scholar
Sinden, F., Wilfong, G.: On-line recognition of handwritten symbols. Technical Report 11228-910930-02IM, AT&T Bell Laboratories (June 1992)
Google Scholar
Vapnik, V.N.: Estimation of dependences based on empirical data. Springer (1982)
Google Scholar
Vapnik, V.N., Chervonenkis, A.Y.: On the uniform convergence of relative frequencies of events to their probabilities. Th. Prob. and its Applications 17(2), 264–280 (1971)
Article MATH Google Scholar
Vasconcelos, N., Lippman, A.: Multiresolution tangent distance for affine-invariant classification. In: Advances in Neural Information Processing Systems, vol. 10, pp. 843–849. Morgan Kaufmann Publishers (1998)
Google Scholar
Voisin, J., Devijver, P.: An application of the multiedit-condensing technique to the reference selection problem in a print recognition system. Pattern Recogntion 20(5), 465–474 (1987)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Image Processing Services Research Lab, AT& T Labs - Research, 100 Schulz Drive, Red Bank, NJ, 07701-7033, USA
Patrice Y. Simard, Yann A. LeCun & John S. Denker
CNRS, ELSAP, ENS, 1 rue Maurice Arnoux, F-92120, Montrouge, France
Bernard Victorri

Authors

Patrice Y. Simard
View author publications
You can also search for this author in PubMed Google Scholar
Yann A. LeCun
View author publications
You can also search for this author in PubMed Google Scholar
John S. Denker
View author publications
You can also search for this author in PubMed Google Scholar
Bernard Victorri
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science, Technische Universität Berlin, Franklinstr. 28/29, 10587, Berlin, Germany
Grégoire Montavon & Klaus-Robert Müller &
Dept. of computer Science, Willamette University, 900 State Street, 97301, Salem, OR, USA
Geneviève B. Orr

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Simard, P.Y., LeCun, Y.A., Denker, J.S., Victorri, B. (2012). Transformation Invariance in Pattern Recognition – Tangent Distance and Tangent Propagation. In: Montavon, G., Orr, G.B., Müller, KR. (eds) Neural Networks: Tricks of the Trade. Lecture Notes in Computer Science, vol 7700. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35289-8_17

Download citation

DOI: https://doi.org/10.1007/978-3-642-35289-8_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35288-1
Online ISBN: 978-3-642-35289-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics