Intelligent Content Extraction from Polish Medical Reports

  • Małgorzata Marciniak
  • Agnieszka Mykowiecka
  • Anna Kupść
  • Jakub Piskorski
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3490)


The paper presents a method for intelligent automatic processing of medical reports. First, we extract single pieces of information using SProUT (a general-purpose Information Extraction platform), and then, externally merge the results in order to obtain a detailed formalised description of the reports.


Extraction Rule Output Structure Grammar Rule Outer Quadrant Coreference Resolution 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Busemann, S., Krieger, H.-U.: Resources and Techniques for Multilingual Information Extraction. In: Proceedings of LREC 2004, Lisbon, Portugal, pp. 1923–1926 (2004)Google Scholar
  2. 2.
    Drożdżyński, W., Homola, P., Piskorski, J., Zinkevičius, V.: Adopting SProUT to processing Baltic and Slavonic languages. In: Proceedings of the IESL Workshop in conjunction with the RANLP 2003 Conference, Bulgaria (2003)Google Scholar
  3. 3.
    Drożdżyński, W., Krieger, H.-U., Piskorski, J., Schäfer, U., Xu, F.: Shallow Processing with Unification and Typed Feature Structures — Foundations and Applications. German AI Journal KI-Zeitschrift, 01/04. Gesellschaft für Informatik e.V (2004)Google Scholar
  4. 4.
    Hahn, U., Romacker, M., Schultz, S.: MEDSYNDIKATE — a natural language system for the extraction of medical information from findings reports. International Journal of Medical Informatics, 63–74 (2002)Google Scholar
  5. 5.
    Jain, N.L., Friedman, C.: Identification of Findings Suspicious for Breast Cancer Based on Natural Language Processing of Mammogram Reports. In: Proceedings of the American Medical Informatics Association Annual Fall Symposium, pp. 829–833 (1997)Google Scholar
  6. 6.
    Kupść, A., Marciniak, M., Mykowiecka, A., Piskorski, J., Podsiadły-Marczykowska, T.: Information Extraction from Mammographic Reports. In: KONVENS 2004, Vienna, Austria, pp. 113–116 (2004)Google Scholar
  7. 7.
    Piskorski, J., Homola, P., Marciniak, M., Mykowiecka, A., Przepiórkowski, A., Woliński, M.: Information Extraction for Polish using the SProUT Platform. In: Proceedings of ISMIS 2004, Zakopane, pp. 225–236 (2004)Google Scholar
  8. 8.
    Ruch, P., Baud, R., Geissbruhler, A.: Evaluating and reducing the effect of data corruption when applying bag of words approaches to medical records. International Journal of Medical Informatics (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Małgorzata Marciniak
    • 1
  • Agnieszka Mykowiecka
    • 1
  • Anna Kupść
    • 1
  • Jakub Piskorski
    • 2
  1. 1.Institute of Computer SciencePolish Academy of SciencesWarsawPoland
  2. 2.DFKI GmbHSaarbückenGermany

Personalised recommendations