Structure-Based Web Access Method for Ancient Chinese Characters

  • Xiaoqing Lu
  • Yingmin Tang
  • Zhi Tang
  • Yujun Gao
  • Jianguo Zhang
Part of the Communications in Computer and Information Science book series (CCIS, volume 400)


How to preserve and make use of ancient Chinese characters is not only a mission to contemporary scientists but is also a technical challenge. This paper proposes a feasible solution to enable character collection, management, and access on the Internet. Its advantage lies in a unified representation for encoded and uncoded characters that provide a visual convenient and efficient retrieval method that does not require new users to have any prior knowledge about ancient Chinese characters. We also design a system suitable for describing the relationships between ancient Chinese characters and contemporary ones. As the implementation result, a website is established for public access to ancient Chinese characters.


Ancient Characters Digital Heritage Web Access 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Li, N.: Computational Research of Chinese Writing System Han4-Zi4. Literary and Linguistic Computing 9(3), 225–234 (1994)CrossRefGoogle Scholar
  2. 2.
    Zhang, Z.-X.: On Some Issues of the Establishment of Ancient Chinese Font. Journal of Chinese Information Processing 17(6), 60–66 (2003)Google Scholar
  3. 3.
    Liu, Z.-J.: Investigation into the Collation of Glyphs of Ancient Writings for Computer Processing. Applied Linguistics No 4, 120–123 (2004)Google Scholar
  4. 4.
    Liu, Z.-J.: Encoding Ancient Chinese Characters with Unicode and the Construction of Standard Digital Platform. Journal of Hangzhou Teachers College 29(6), 37–40 (2007)MATHGoogle Scholar
  5. 5.
    Jiang, M.-H.: Construction on Word-base of Oracle-Bone Inscriptions and its Intelligent Repository. Computer Engineering and Applications 40(4), 45–48 (2004)Google Scholar
  6. 6.
    Juang, D., Wang, J.H., Lai, C.Y., Hsieh, C.C., Chien, L.H., Ho, J.M.: Resolving the Unencoded Character Problem for Chinese Digital Libraries. In: Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL 2005, pp. 311–319. ACM, Denver (2005)Google Scholar
  7. 7.
    Zhuang, Y., Zhuang, Y.-T., Li, Q., Chen, L.: Interactive High-Dimensional Index for Large Chinese Calligraphic Character Databases. ACM Transactions on Asian Language Information Processing 6(2), 8-es (2007)Google Scholar
  8. 8.
    Kirk, J.S.: Chinese Character Identification by Visual Features Using Self-Organizing Map Sets and Relevance Feedback. In: IEEE International Joint Conference on Neural Networks, pp. 3216–3221 (2008)Google Scholar
  9. 9.
    Chen, D., Li, N., Li, L.: Online recognition of ancient characters. Journal of Beijing Institute of Machinery 23(4), 32–37 (2008)Google Scholar
  10. 10.
    Allen, J.D., Becker, J., et al.: The Unicode Consortium. The Unicode Standard, Version 5.0. Addison-Wesley, Boston (2006)Google Scholar
  11. 11.
    Zhuang, Y.-T., Zhang, X.-F., Wu, J.-Q., Lu, X.-Q.: Retrieval of Chinese Calligraphic Character Image. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds.) PCM 2004. LNCS, vol. 3331, pp. 17–24. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  12. 12.
    Bishop, T., Cook, R.: A Specification for CDL Character Description Language. In: Glyph and Typesetting Workshop, Kyoto, Japan (2003)Google Scholar
  13. 13.
    Lu, Q.: The Ideographic Composition Scheme and Its Applications in Chinese Text Processing. In: Proc. of the 18th International Unicode Conference, IUC-18 (2001)Google Scholar
  14. 14.
    Juang, D., Hsieh, C.-C., Lin, S.: On Resolving the Missing Character Problem for Full-text Database for Chinese Ancient Texts in Academia Sinica. In: The Second Cross-Strait Symposium on the Rectification of Ancient Texts, pp. 1–8, Beijing (1998)Google Scholar
  15. 15.
    Hsieh, C.-C.: On the Formalization and Search of Glyphs in Chinese Ancient Texts. In: Conference on Rare Book and Information Technology, pp. 1–6, Taipei (1997)Google Scholar
  16. 16.
    Hsieh, C.-C.: A Descriptive Method for Re-engineering Hanzi Information Interchange Codes-On Redesigning Hanzi Interchange Code Part 2. In: International Conference on Hanzi Character Code and Database, pp. 1–9, Kyoto (1996)Google Scholar
  17. 17.
    Hsieh, C.-C.: The Missing Character Problem in Electronic Ancient Texts. In: The First Conference on Chinese Etymology, Tianjin, pp. 1–8. Tianjin (1996)Google Scholar
  18. 18.
    Beckmann, N., Kriegel, H.P., Schneider, R., Seeger, B.: The R*-tree: An Efficient and Robust Access Method for Characters and Rectangles. In: Proceedings of ACM SIGMOD International Conference on Management of Data, ACM SIGMOD 1990, pp. 322–331. ACM, New York (1990)CrossRefGoogle Scholar
  19. 19.
    Lin, J.-W., Lin, F.-S.: An Auxiliary Unicode Han Character Lookup Service Based on Glyph Shape Similarity. In: IEEE The 11th International Symposium on Communications & Information Technologies (ISCIT 2011), pp. 489–492 (2011)Google Scholar
  20. 20.
    The Unicode Standard The Unicode Consortium, version 6.2 (2012),
  21. 21.
    Xu, S.-H., Jiang, H., Jin, T., Lau, F.C.M., Pan, Y.: Automatic Facsimile of Chinese Calligraphic Writings. Computer Graphics Forum 27(7), 1879–1886 (2008)CrossRefGoogle Scholar
  22. 22.
    Xu, S.-H., Jiang, H., Jin, T., Lau, F.C.M., Pan, Y.: Automatic Generation of Chinese Calligraphic Writings with Style Imitation. IEEE Intelligent Systems 24(2), 44–53 (2009)CrossRefGoogle Scholar
  23. 23.
    Lai, P.-K., Pong, M.-C., Yeung, D.-Y.: Chinese Glyph Generation Using Character Composition and Beauty Evaluation Metrics. In: International Conference on Computer Processing of Oriental Languages, ICCPOL 1995, Honolulu, Hawaii, pp. 92–99 (1995)Google Scholar
  24. 24.
    Lai, P.-K., Yeung, D.-Y., Pong, M.-C.: A Heuristic Search Approach to Chinese Glyph Generation Using Hierarchical Character Composition. Computer Processing of Oriental Languages 10(3), 307–323 (1996)Google Scholar
  25. 25.
    Wang, P.Y.C., Siu, C.H.: Designing Chinese Typeface using Components. In: Computer Software and Applications Conference, pp. 412–421 (1995)Google Scholar
  26. 26.
    Feng, W.-R., Jin, L.-W.: Hierarchical Chinese character database based on radical reuse. Computer Applications 26(3), 714–716 (2006)Google Scholar
  27. 27.
    Lu, X.-Q.: R&D of Super Font and Related Technologies. In: The Twenty-second International Unicode Conference, IUC22, San Jose, California, September 9–13 (2002),
  28. 28.
    Tang, Y.-M., Zhang, Y.-X., Lu, X.-Q.: A TrueType Font Compression Method Based on the Structure of Chinese Characters. Microelectronics & Computer 24(06), 52–55 (2007)Google Scholar
  29. 29.
    Sun, H., Tang, Y.-M., Lian, Z.-H., Xiao, J.-G.: Research on Distortionless Resizing Method for Components of Chinese Characters. Application Research of Computers 30 (2013),
  30. 30.
    Shi, C., Xiao, J., Jia, W., Xu, C.: Automatic Generation of Chinese Character Based on Human Vision and Prior Knowledge of Calligraphy. In: Zhou, M., Zhou, G., Zhao, D., Liu, Q., Zou, L. (eds.) NLPCC 2012. CCIS, vol. 333, pp. 23–33. Springer, Heidelberg (2012)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Xiaoqing Lu
    • 1
  • Yingmin Tang
    • 1
  • Zhi Tang
    • 1
  • Yujun Gao
    • 2
    • 3
  • Jianguo Zhang
    • 2
    • 4
  1. 1.Institute of Computer Science and TechnologyPeking UniversityBeijingChina
  2. 2.Beijing Founder Electronics Co., Ltd.BeijingChina
  3. 3.Center for Chinese Font Design and ResearchBeijingChina
  4. 4.State Key Laboratory of Digital Publishing TechnologyPeking University Founder Group Co., Ltd.BeijingChina

Personalised recommendations