Virtual Example Synthesis Based on PCA for Off-Line Handwritten Character Recognition
This paper proposes a method to improve off-line character classifiers learned from examples using virtual examples synthesized from an on-line character database. To obtain good classifiers, a large database which contains a large enough number of variations of handwritten characters is usually required. However, in practice, collecting enough data is time-consuming and costly. In this paper, we propose a method to train SVM for off-line character recognition based on artificially augmented examples using on-line characters.
In our method, virtual examples are synthesized from on-line characters by the following two steps: (1) applying affine transformation to each stroke of “real” characters, and (2) applying affine transformation to each stroke of artificial characters, which are synthesized on the basis of PCA. SVM classifiers are trained by using the training samples containing artificially generated patterns and real characters. We examine the effectiveness of the proposed method with respect to the recognition rates and number of support vectors of SVM through experiments involving the handwritten Japanese Hiragana character classification.
- 2.Ha, T.M., Bunke, H.: Off-line, handwritten numeral recognition by perturbation method. IEEE Trans. PAMI 19(5), 535–539 (1997)Google Scholar
- 3.Joachims, T.: Making large-scale SVM learning practical. In: Advances in kernel methods, Ch. 11. MIT Press, Cambridge (1999)Google Scholar
- 5.Miller, E., Matsakis, N., Viola, P.: Learning from one example through shared densities on transformation. In: Proc. CVPR 2000, vol. 1, pp. 464–471 (2000)Google Scholar
- 9.Velek, O., Lieu, C.-L., Jaeger, S., Nakagawa, M.: An improved approach to generating realistic Kanji character images from on-line characters and its benefit to off-line recognition performance. In: Proc. ICPR 2002, vol. 1, pp. 588–591 (2002)Google Scholar
- 10.Velek, O., Jaeger, S., Nakagawa, M.: A new warping technique for normalizing likelihood of multiple classifiers and its effectiveness in combined on-line/off-line Japanese character recognition. In: Proc. IWFHR 2002, pp. 177–182 (2002)Google Scholar
- 11.Schölkopf, R., Smola, A.J.: Learning with Kernels. MIT Press, Cambridge (2002)Google Scholar
- 13.Miyao, H., Maruyama, M., Nakano, Y., Hananoi, T.: Off-Line Handwritten Character Recognition by SVM based on the Virtual Examples Synthesized from On-Line Characters. In: Proc. ICDAR 2005, vol. 1, pp. 494–498 (2005)Google Scholar