Abstract
This paper proposes an efficient probabilistic indexing scheme called P robabilistic R etrieval-Tree(PR-Tree) to facilitate a efficient probabilistic retrieval of Chinese calligraphic manuscript images based on triple features such as contour points, character styles and number of strokes. To the best of our understanding, this is first work on probabilistic retrieval over Chinese character. Different from conventional character retrieval and indexing methods [18] which only adopts shape similarity as a query metric, our proposed indexing algorithm allows user to choose the above three kinds of features as query elements. Moreover, a probabilistic model is introduced into the character retrieval process. Comprehensive experiments are conducted to testify the effectiveness and efficiency of our proposed retrieval and indexing methods respectively.
This paper is partially supported by the Program of National Natural Science Foundation of China under Grant No. 60003047,No.60873022, No.60903053; The Program of Natural Science Foundation of Zhejiang Province under Grant No. Z1100822,No.Y1080148, No.Y1090165; The Science Fund for Young Scholars of Zhejiang Gongshang University under Grant No. G09-7.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhang, X.-Z.: Chinese Character Recognition Techniques. Tsinghua University Press, Beijing (1992)
Wu, Y.-S., Ding, X.-Q.: Chinese character recognition: the principles and the implementations. High Education Press, Beijing (1992)
Rath, T.M., Manmatha, R., Lavrenko, V.: A search engine for historical manuscript images. In: Proceedings SIGIR Conference, pp. 369–376 (2004)
Yosef, I.B., Kedem, K., Dinstein, I., Beit-Arie, M., Engel, E.: Classification of Hebrew Calligraphic Handwriting Styles: Preliminary Results. In: Proceedings of Conference, DIAL 2004, pp. 299–305 (2004)
Palmondon, R., Srihari, S.N.: On-Line and Off-Line hand-writing Recognition: A Comprehensive Survey. IEEE Trans. on Pattern Analysis and Machine Intelligence 22(1), 63–84 (2000)
Shi, B-l., Zhang, L., Wang, Y., Chen, Z-F.: Content Based Chinese Script Retrieval Through Visual Similarity Criteria. Chinese Journal of Software 12(9), 1336–1342 (2001)
Chui, H.-l., Anand, R.: A new point matching algorithm for non-rigid registration. Computer Vision and Image Understanding 89(2-3), 114–141 (2003)
Belongie, S., Malik, J., Puzicha, J.: Shape Matching and Object Recognition Using Shape Contexts. IEEE Trans. on Pattern Analysis and Machine Intelligence 24(4), 509–522 (2002)
Cohen, S., Guibas, L.: The Earth Mover’s Distance under Transformation Sets. In: Proceedings ICCV Conference Corfu, Greece, pp. 173–187 (September 1999)
Böhm, C., Berchtold, S., Keim, D.: Searching in High-dimensional Spaces: Index Structures for Improving the Performance of Multimedia Databases. ACM Computing Surveys 33(3) (2001)
Guttman, A.: R-tree: A dynamic index structure for spatial searching. In: Proceedings SIGMOD Conference, pp. 47–54 (1984)
Berchtold, S., Keim, D.A., Kriegel, H.P.: The X-tree: An index structure for high-dimensional data. In: Proceedings LDB Conference, pp. 28–37 (1996)
Weber, R., Schek, H., Blott, S.: A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In: Proceedings of VLDB Conference, pp. 194–205 (1998)
Berchtold, S., Bohm, C., Kriegel, H.P., Sander, J., Jagadish, H.V.: Independent quantization: An index compression technique for high-dimensional data spaces. In: Proceedings of ICDE Conference, pp. 577–588 (2000)
Jagadish, H.V., Ooi, B.C., Tan, K.L., Yu, C., Zhang, R.: iDistance: An Adaptive B+-tree Based Indexing Method for Nearest Neighbor Search. ACM Trans. on Database Systems 30(2), 364–397 (2005)
Frey, B.J., Dueck, D.: Clustering by Passing Messages Between Data Points. Science 315, 972–976
The CADAL Project (2010), www.cadal.zju.edu.cn
Zhuang, Y., Zhuang, Y.-T., Li, Q., Chen, L.: Interactive Indexing for Large Chinese Calligraphic Character Databases. ACM Trans. on Asian Language Information Processing (TALIP) 6(2) (2007)
Leung, H., Wong, S.T.S., Horace, H.S.Ip.: Preserving archaic Chinese calligraphy and reproducing its dynamic brush writing. IEEE Signal Processing Magazine 25(4), 49–54 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhuang, Y. (2010). Web-Based Probabilistic Retrieval of Chinese Calligraphic Character Images: An Efficiency Study. In: Luo, X., Spaniol, M., Wang, L., Li, Q., Nejdl, W., Zhang, W. (eds) Advances in Web-Based Learning – ICWL 2010. ICWL 2010. Lecture Notes in Computer Science, vol 6483. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17407-0_36
Download citation
DOI: https://doi.org/10.1007/978-3-642-17407-0_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17406-3
Online ISBN: 978-3-642-17407-0
eBook Packages: Computer ScienceComputer Science (R0)