Post-processing of Handwritten Pitman’s Shorthand Using Unigram and Heuristic Approaches
The computer transcription of handwritten Pitman’s shorthand has enormous potential as a means of rapid text entry to today’s handheld devices. Recognition errors caused in pattern segmentation and classification raises the incidence of ambiguous interpretation in existing systems and the paper proposes a well-established unigram technique and an efficient heuristic method to reduce ambiguity in a linguistic post processor. Heuristics applied in our transcription system are: – firstly, incorporating visual stimulus as used by human readers; secondly, applying knowledge of the most common words of Pitman shorthand; and finally, adding knowledge of collocation. An experiment using a phonetic Lexicon of 5000 entries shows the distribution of ambiguity in a shorthand lexicon due to the similarity of outlines’ and estimates the transcription accuracy of 94%.
KeywordsWord Frequency Anchor Node Human Reader Ambiguous Interpretation Statistical Language Modeling
- 2.Qiao, Y., Leedham, C.G.: Segmentation of handwritten Pitman shorthand outlines using an interactive heuristic search. In: Proc. 5th International Graphonomics Conference (ISBN 0- 9630246-0-4), pp. 157-162, Tempe, Arizona, USA, pp. 27-30 (October 1991)Google Scholar
- 4.Nagabhushan, P., Anami, B.: Dictionary Supported Generation of English Text from Pitman Shorthand Scripted Phonetic Text. In: Language engineering conference, Hyderabad, India, December 13-15, p. 33 (2002)Google Scholar