Abstract
In this paper, we employ a zero-order local deformation model to model the visual variability of video streams of American sign language (ASL) words. We discuss two possible ways of combining the model with the tangent distance used to compensate for affine global transformations. The integration of the deformation model into our recognition system improves the error rate on a database of ASL words from 22.2% to 17.2%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Simard, P., Le Cun, Y., Denker, J.: Efficient Pattern Recognition Using a New Transformation Distance. In: Advances in NIPS, pp. 50–58. Morgan Kaufmann, San Francisco (1993)
Keysers, D., Macherey, W., Ney, H., Dahmen, J.: Adaptation in Statistical Pattern Recognition Using Tangent Vectors. TPAMI 26(2), 269–274 (2004)
Keysers, D., Gollan, C., Ney, H.: Local Context in Non-linear Deformation Models for Handwritten Character Recognition. In: ICPR, Cambridge, UK, August 2004, vol. 4, pp. 511–514 (2004)
Dreuw, P., Keysers, D., Deselaers, T., Ney, H.: Gesture Recognition Using Image Comparison Methods. In: Gesture in Human-Computer Interaction and Simulation, Vannes, France (May 2005) (in press)
Zahedi, M., Keysers, D., Ney, H.: Pronunciation Clustering and Modeling of Variability for Appearance-based Sign Language Recognition. In: Gesture in Human-Computer Interaction and Simulation, Vannes, France (May 2005) (in press)
Zahedi, M., Keysers, D., Ney, H.: Appearance-based Recognition of Words in American Sign Language. In: Iberian Conf. on Pattern Recognition and Image Analysis, Estoril, Portugal (June 2005) (in press)
Rabiner, L.R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proc. of the IEEE 77(2), 267–296 (1989)
Bowden, R., Windridge, D., Kadir, T., Zisserman, A., Brady, M.: A Linguistic Feature Vector for the Visual Interpretation of Sign Language. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 390–401. Springer, Heidelberg (2004)
Bauer, B., Hienz, H., Kraiss, K.F.: Video-Based Continuous Sign Language Recognition Using Statistical Methods. In: ICPR, Barcelona, Spain, September 2000, pp. 463–466 (2000)
Starner, T., Weaver, J., Pentland, A.: Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video. TPAMI 20(12), 1371–1375 (1998)
Vogler, C., Metaxas, D.: Adapting Hidden Markov Models for ASL Recognition by Using Three-dimensional Computer Vision Methods. In: Proc. Int. Conf. on Systems, Man and Cybernetics, Orlando, FL, October 1997, pp. 156–161 (1997)
Keysers, D., Dahmen, J., Ney, H., Wein, B., Lehmann, T.M.: Statistical Framework for Model-based Image Retrieval in Medical Applications. Journal of Electronic Imaging 12(1), 59–68 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zahedi, M., Keysers, D., Deselaers, T., Ney, H. (2005). Combination of Tangent Distance and an Image Distortion Model for Appearance-Based Sign Language Recognition. In: Kropatsch, W.G., Sablatnig, R., Hanbury, A. (eds) Pattern Recognition. DAGM 2005. Lecture Notes in Computer Science, vol 3663. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11550518_50
Download citation
DOI: https://doi.org/10.1007/11550518_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28703-2
Online ISBN: 978-3-540-31942-9
eBook Packages: Computer ScienceComputer Science (R0)