Domain–Driven Automatic Spelling Correction for Mammography Reports

Mykowiecka, Agnieszka; Marciniak, Małgorzata

doi:10.1007/3-540-33521-8_56

Agnieszka Mykowiecka³ &
Małgorzata Marciniak³

Part of the book series: Advances in Soft Computing ((AINSC,volume 35))

599 Accesses
5 Citations

Abstract

The paper presents a program for automatic spelling correction of texts from a very specific domain, which has been applied to mammography reports. We describe different types of errors and present the program of correction based on the Levenshtein distance and probability of bigrams.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

1. Brill E. and R. C. Moore. An ImprovedModel for Noisy Channel Spelling Correction, In: Proceedings of the 38th Annual Meeting of the ACL,2000, pp. 286–293.
Google Scholar
2. Cucerzan, S.and E. Brill, Spelling correction as an iterative process that exploits the collective knowledge of web users in Proceedings of EMNLP 2004.
Google Scholar
3. Busemann S. and Krieger H.-U. Resources and Techniques for Multilingual Information Extraction. In: Proceedings of LREC 2004, Lisbon, Portugal, 2004, pp. 1923–1926.
Google Scholar
4. Crowell, J., Q. Zeng, L. Ngo, E. Lacroix, A Frequency-based Technique to Improve the Spelling Suggestion Rank in Medical Queries, In: Journal of he American Medical Informatics Association, vol 11, May/;Jun, 2004, pp 179–185.
Article Google Scholar
5. Daciuk, J., Incremental Construction of Finite-State Automata and Transducers, and their Use in the Natural Language Processing, Ph.D. dissertation, Technical University of Gdańsk, Poland, 1998.
Google Scholar
6. Drożdżyński W., Krieger H.-U., Piskorski J., Schäfer U., and Xu F. Shallow Processing with Uni.cation and Typed Feature Structures - Foundations and Applications. In: German AI Journal KI-Zeitschrift, 01/;04. Gesellschaft für Informatik e.V, 2004.
Google Scholar
7. Jurafsky D., J, Martin, Speech and Language Processing. Prentice Hall, 2000.
Google Scholar
8. Kukich K. Technique for automatically correcting words in text, In: ACM Comput. Surv., Vol. 24(4), 1992, pp. 377–439.
Article Google Scholar
9. Kupść A., Marciniak M., Mykowiecka A., Piskorski J., and Podsiadły-Marczykowska T., Information Extraction from Mammogram Reports. In: KONVENS 2004, Vienna, Austria, 2004, pp. 113–116.
Google Scholar
10. Levenshtein V. I., Binary codes capable of correcting deletions, insertions, and reversals, Doklady Akademii Nauk SSSR, 163(4):845–848, 1965 (Russian). English translation in Soviet Physics Doklady, 10(8):707–710, 1966.
MathSciNet Google Scholar
11. Marciniak M., Mykowiecka A., Kupść A., and Piskorski J. Intelligent Content Extraction from Polish Medical Reports. In: Intelligent Media Technology for Communicative Intelligence, Springer, Berlin, Heidelberg, 2005, pp. 68–78
Chapter Google Scholar
12. Mykowiecka A., Kupść A., Marciniak M., Rule-based Medical Content Extraction and Classi.cation. In: Intelligent Information Processing and Web Mining. Proceedings of the IIS'05 Conference, Gdańsk, Springer, 2005, pp. 237–245.
Google Scholar
13. Piskorski J., Homola P., Marciniak M., Mykowiecka A., Przepiórkowski A., and Woliński M. Information Extraction for Polish using the SProUT Platform. In: Intelligent Information Processing and Web Mining. Proceedings of the IIS'04 Conference, Zakopane, Springer, 2004, pp. 225–236.
Google Scholar
14. Woliński M., Morfeusz - a Practical Tool for the Morphological Analysis of Polish. In this volume.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science, Polish Academy of Sciences, Ordona 21, 01-237, Warsaw, Poland
Agnieszka Mykowiecka & Małgorzata Marciniak

Authors

Agnieszka Mykowiecka
View author publications
You can also search for this author in PubMed Google Scholar
Małgorzata Marciniak
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Polish Academy of Sciences, Institute of Computer Science, ul. Ordona 21, 01-237, Warszawa, Poland
Mieczysław A. Kłopotek , Sławomir T. Wierzchoń & Krzysztof Trojanowski , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mykowiecka, A., Marciniak, M. (2006). Domain–Driven Automatic Spelling Correction for Mammography Reports. In: Kłopotek, M.A., Wierzchoń, S.T., Trojanowski, K. (eds) Intelligent Information Processing and Web Mining. Advances in Soft Computing, vol 35. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-33521-8_56

Download citation

DOI: https://doi.org/10.1007/3-540-33521-8_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33520-7
Online ISBN: 978-3-540-33521-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics