Fast and Accurate Handwritten Character Recognition Using Approximate Nearest Neighbours Search on Large Databases

Pérez-Cortes, Juan C.; Llobet, Rafael; Arlandis, Joaquim

doi:10.1007/3-540-44522-6_79

Juan C. Pérez-Cortes⁸,
Rafael Llobet⁸ &
Joaquim Arlandis⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1876))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

984 Accesses
7 Citations

Abstract

In this work, fast approximate nearest neighbours search algorithms are shown to provide high accuracies, similar to those of exact nearest neighbour search, at a fraction of the computational cost in an OCR task. Recent studies [26,15] have shown the power of k-nearest neighbour classifiers (k-nn) using large databases, for character recognition. In those works, the error rate is found to decrease consistently as the size of the database increases. Unfortunately, a large database implies large search times if an exhaustive search algorithm is used. This is often cited as a major problem that limits the practical value of the k- nearest neighbours classification method. The error rates and search times presented in this paper prove, however, that k-nn can be a practical technique for a character recognition task.

Download to read the full chapter text

Chapter PDF

Design and Implementation of Handwritten Digit Recognition Based on K-Nearest Neighbor Algorithm

A Fast and Efficient K-Nearest Neighbor Classifier Using a Convex Envelope

A systematic review on handwritten document analysis and recognition

Article 02 June 2023

Sanasam Inunganbi

Keywords

References

S. Arya, D.M. Mount, N.S. Netanyahu, R. Silverman, and A. Wu. An optimal algorithm for approximate nearest neighbor searching. Journal of the ACM, 45:891–923, 1998.
Article MATH MathSciNet Google Scholar
N. Beckmann, H.-P. Kriegel, R. Schneider, and B. Seeger. The r*-tree: An efficient and robust access method for points and rectangles. In ACM SIGMOD Conf. on the Management of Data 90, Atlantic City., May 1990.
Google Scholar
J.S. Beis and D.G. Lowe. Indexing without invariants in 3d object recognition. IEEE Trans. on PAMI, 21(10):1000–1015, October 1999.
Google Scholar
J.L. Bentley, B.W. Weide, and A.C. Yao. Optimal expected time algorithms for closest point problems. ACM Trans. on Math. Software, 6:563–580, 1980.
Article MATH MathSciNet Google Scholar
S. Berchtold, D.A. Keim, and H.P. Kriegel. The x-tree: An index structure for high-dimensional data. In Proc. 22nd Very Large Database Conference, Bombay, India, pages 28–39, 1996.
Google Scholar
M. Bern. Approximate closest-point queries in high dimensions. Pattern Recognition, 45:95–99, 1993.
MATH MathSciNet Google Scholar
S. Brin. Near neighbor search in large metric spaces. In Proc. 21st Inter. Conf. on Very Large Data Bases, pages 574–584, 1995.
Google Scholar
P. A. Devijver and J. Kittler. On the edited nearest neighbour rule. pages 72–80. Proceedings of the 5th International Conference on Pattern Recognition, IEEE Computer Society Press, Los Alamitos, CA, 1980.
Google Scholar
L. Devroye, L. Györfi, and G. Lugosi. A Probabilistic Theory of Pattern Recognition. Springer-Verlag., 1996.
Google Scholar
J. H. Friedman, J. L. Bentley, and R. A. Finkel. An algorithm finding best matches in logarithmic expected time. ACM Trans. Math. Software, 3:209–226, 1977.
Article MATH Google Scholar
K. Fukunaga and P. M. Narendra. A branch and bound algorithm for computing k-nearest neighbors. 24:750–753, 1975.
MATH MathSciNet Google Scholar
S. Geva and J. Sitte. Adaptive nearest neighbor pattern classification. IEEE Trans on Neural Networks, 2(2):318–322, 1991.
Article Google Scholar
P.J. Grother and G.T. Candela. Comparison of handprinted digit classifiers. In NISTIR, 1993.
Google Scholar
A. Guttman. R-trees: A dynamic index structure for spatial searching. In UCB, Elec.Res.Lab, Res.R. No.M83-64, with Stonebraker, M., 1983.
Google Scholar
T.M. Ha and H. Bunke. Off-line, handwritten numeral recognition by perturbation method. IEEE Trans. on PAMI, 19(5):535–539, May 1997.
Google Scholar
P.E. Hart. The condensed nearest neighbor rule. IEEE Trans. on Information Theory, 125:515–516, 1968.
Article Google Scholar
R. Indyk and R Motwani. Approximate nearest neighbors: Towards removing the curse of dimensionality. In 30th Symposium on Theory of Computing, 1998.
Google Scholar
B. S. Kim and S. B. Park. A fast k nearest neighbor finding algorithm based on the ordered partition. IEEE Trans. on PAMI, 8:761–766, 1986.
MATH Google Scholar
T. Kohonen. Self Organization and Associative Memory. Springer-Verlag., 1988.
Google Scholar
E. Kushilevitz, R. Ostrovsky, and Y. Rabani. Efficient search for approximate nearest neighbor in high dimensional spaces. In 30th Symposium on Theory of Computing, 1998.
Google Scholar
G. Loizou and S. J. Maybank. The nearest neighbor and the Bayes error rates. IEEE Trans. on PAMI, 9:254–262, 1987.
MATH Google Scholar
L. Miclet and M. Dabouz. Approximative fast nearest neighbor recognition. Pat tern Recognition Letters, 1:277–285, 1983.
Article Google Scholar
J.C. Perez and E. Vidal. An approximate nearest neighbours search algorithm based on the extended general spacefilling curves heuristic. In Workshop on Statistical Pattern Recognition SPR-98, Sydney, Australia., 1998.
Google Scholar
J.C. Perez and E. Vidal. The extended general spacefilling curves heuristic. In Intl. Conf. on Pattern Recognition ICPR-98, Brisbane, Australia., 1998.
Google Scholar
H. Robinson. Database Analysis and Design., 1981.
Google Scholar
S.J. Smith, Sims K. Bourgoin, M.O., and H.L. Voorhees. Handwritten character classification using nearest neighbor in large databases. IEEE Trans. on PAMI, 16(9):915–919, September 1994.
Google Scholar
R.F. Sproull. Refinements to nearest-neighbor searching in k-dimensional trees. Algorithmica, 6:579–589, 1991.
Article MATH MathSciNet Google Scholar
C. Tomasi and R. Manduchi. Stereo matching as a nearest-neighbor problem. IEEE Trans. on PAMI, 20(3):333–340, March 1998.
Google Scholar
D.L. Wilson. Asymptotic properties of nearest neighbor rules using edited data. IEEE Trans. on Systems, Man and Cybernetics, 2:408–420, 1972.
Article MATH Google Scholar
P. Yianilos. Data structures and algorithms for nearest neighbor search in general metric spaces. In 4th ACM Symp. on Discrete Algorithms, pages 311–321, 1993.
Google Scholar
P. Zakarauskas and J.M. Ozard. Complexity analysis for partitioning nearestneighbor searching algorithms. IEEE Trans. on PAMI, 18(6):663–668, June 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

Instituto Tecnológico de Informática, Universidad Politécnica de Valencia, Camino de Vera, s/n, 46071, Valencia, Spain
Juan C. Pérez-Cortes, Rafael Llobet & Joaquim Arlandis

Authors

Juan C. Pérez-Cortes
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Llobet
View author publications
You can also search for this author in PubMed Google Scholar
Joaquim Arlandis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of València, 46100, Burjassot (València), Spain
Francesc J. Ferri
Department of Computer Languages and Systems, University of Alicante, 03071, Alicante, Spain
José M. Iñesta
School of Computer Science and Engineering, University of New South Wales, Sydney, NSW, 2052, Australia
Adnan Amin
Institute of Information Theory and Automation, Academy of Sciences of the Czech Republic, 182 08, Prague 8, Czech Republic
Pavel Pudil

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pérez-Cortes, J.C., Llobet, R., Arlandis, J. (2000). Fast and Accurate Handwritten Character Recognition Using Approximate Nearest Neighbours Search on Large Databases. In: Ferri, F.J., Iñesta, J.M., Amin, A., Pudil, P. (eds) Advances in Pattern Recognition. SSPR /SPR 2000. Lecture Notes in Computer Science, vol 1876. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44522-6_79

Download citation

DOI: https://doi.org/10.1007/3-540-44522-6_79
Published: 21 December 2000
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67946-2
Online ISBN: 978-3-540-44522-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Fast and Accurate Handwritten Character Recognition Using Approximate Nearest Neighbours Search on Large Databases

Abstract

Chapter PDF

Similar content being viewed by others

Design and Implementation of Handwritten Digit Recognition Based on K-Nearest Neighbor Algorithm

A Fast and Efficient K-Nearest Neighbor Classifier Using a Convex Envelope

A systematic review on handwritten document analysis and recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Fast and Accurate Handwritten Character Recognition Using Approximate Nearest Neighbours Search on Large Databases

Abstract

Chapter PDF

Similar content being viewed by others

Design and Implementation of Handwritten Digit Recognition Based on K-Nearest Neighbor Algorithm

A Fast and Efficient K-Nearest Neighbor Classifier Using a Convex Envelope

A systematic review on handwritten document analysis and recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation