Skip to main content

Font Distribution Observation by Network-Based Analysis

  • Conference paper
  • First Online:
Camera-Based Document Analysis and Recognition (CBDAR 2013)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8357))

  • 802 Accesses

Abstract

The off-the-shelf Optical Character Recognition (OCR) engines return mediocre performance on the decorative characters which usually appear in natural scenes such as signboards. A reasonable way towards the so-called camera-based OCR is to collect a large-scale font set and analyze the distribution of font samples for realizing some character recognition engine which is tolerant to font shape variations. This paper is concerned with the issue of font distribution analysis by network. Minimum Spanning Tree (MST) is employed to construct font network with respect to Chamfer distance. After clustering, some centrality criterion, namely closeness centrality, eccentricity centrality or betweenness centrality, is introduced for extracting typical font samples. The network structure allows us to observe the font shape transition between any two samples, which is useful to create new fonts and recognize unseen decorative characters. Moreover, unlike the Principal Component Analysis (PCA), the font network fulfills distribution visualization through measuring the dissimilarity between samples rather than the lossy processing of dimensionality reduction. Compared with K-means algorithm, network-based clustering has the ability to preserve small size font clusters which generally consist of samples taking special appearances. Experiments demonstrate that the proposed network-based analysis is an effective way to grasp font distribution, and thus provides helpful information for decorative character recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: IEEE Conference on Computer Vision and Pattern Recognition (2010)

    Google Scholar 

  2. Wang, K., Belongie, S.: Word spotting in the wild. In: Daniilidis, Kostas, Maragos, Petros, Paragios, Nikos (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 591–604. Springer, Heidelberg (2010)

    Google Scholar 

  3. Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: IEEE International Conference on Computer Vision (2011)

    Google Scholar 

  4. Mishra, A., Alahari, K., Jawahar, C.: Top-down and bottom-up cues for scene text recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (2012)

    Google Scholar 

  5. Zhu, Y., Tan, T., Wang, Y.: Font recognition based on global texture analysis. IEEE Trans. Pattern Anal. Mach. Intell. 23(10), 1192–1200 (2001)

    Article  Google Scholar 

  6. Omachi, S., Inoue, M., Aso, H.: Structure extraction from decorated characters using multiscale images. IEEE Trans. Pattern Anal. Mach. Intell. 23(3), 315–322 (2001)

    Article  Google Scholar 

  7. Omachi, S., Megawa, S., Aso, H.: Decorative character recognition by graph matching. IEICE Trans. Inf. Syst. E90–D(10), 1720–1723 (2007)

    Article  Google Scholar 

  8. Wang, S., Uchida, S., Liwicki, M.: Part-based recognition of arbitrary fonts. In: International Conference on Document Analysis and Recognition (2013)

    Google Scholar 

  9. Borgefors, G.: Hierarchical chamfer matching: a parametric edge matching algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 10(6), 849–865 (1988)

    Article  Google Scholar 

  10. Shi, D., Gumm, S., Damper, R.: Handwritten chinese radical recognition using nonlinear active shape models. IEEE Trans. Pattern Anal. Mach. Intell. 25(2), 277–280 (2003)

    Article  Google Scholar 

  11. Cormen, T., Leiserson, C., Rivest, R., Stein, C.: Graph algorithms (Section 23.2: The algorithms of Kruskal and Prim). In: Cormen, T.H. (ed.) Introudction to Algorithms, 3rd edn, pp. 631–638. MIT Press, Cambridge (2009)

    Google Scholar 

  12. Barrow, H., Tenenbaum, J., Bolles, R., Wolf, H.: Parametric correspondence and chamfer matching: two new techniques for image matching. In: International Joint Conference Artificial Intelligence (1977)

    Google Scholar 

  13. Opsahl, T., Agneessens, F., Skvoretz, J.: Node centrality in weighted networks: generalizing degree and shortest paths. Soc. Netw. 32(3), 245–251 (2010)

    Article  Google Scholar 

Download references

Acknowledgment

The authors would like to thank the support of Creation of Human-Harmonized Information Technology for Convivial Society, which is a CREST project organized by Japan Science and Technology Agency (JST).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rong Huang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Nakamoto, C., Huang, R., Koizumi, S., Ishida, R., Feng, Y., Uchida, S. (2014). Font Distribution Observation by Network-Based Analysis. In: Iwamura, M., Shafait, F. (eds) Camera-Based Document Analysis and Recognition. CBDAR 2013. Lecture Notes in Computer Science(), vol 8357. Springer, Cham. https://doi.org/10.1007/978-3-319-05167-3_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05167-3_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05166-6

  • Online ISBN: 978-3-319-05167-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics