Skip to main content

Novel Nature Inspired Techniques in Medical Information Retrieval

  • Conference paper
  • 543 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6865))

Abstract

In this work we have studied, evaluated and proposed different swarm intelligence techniques for mining information from loosely structured medical textual records with no apriori knowledge. We describe the process of mining a large dataset of ~50,000–120,000 records × 20 attributes in DB tables, originating from the hospital information system recording over 10 years. This paper concerns only textual attributes with free text input, that means 613,000 text fields in 16 attributes. Each attribute item contains ~800–1,500 characters (diagnoses, medications, etc.). The output of this task is a set of ordered/nominal attributes suitable for rule discovery mining.

Information mining from textual data becomes a very challenging task when the structure of the text record is very loose without any rules. The task becomes even harder when natural language is used and no apriori knowledge is available. The medical environment itself is also very specific: the natural language used in textual description varies with the personality creating the record, however it is restricted by terminology (i.e. medical terms, medical standards, etc.). Moreover, the typical patient record is filled with typographical errors, duplicates and many (nonstandard) abbreviations.

Nature inspired methods have their origin in real nature processes and play an important role in the domain of artificial intelligence. They offer fast and robust solutions to many problems, although they belong to the branch of approximative methods. The high number of individuals and the decentralized approach to task coordination in the studied species revealed high degree of parallelism, self-organization and fault tolerance.

First, classical approaches such as basic statistic approaches, word (and word sequence) frequency analysis, etc., have been used to simplify the textual data and provide an overview of the data. Finally, an ant-inspired self-organizing approach has been used to automatically provide a simplified dominant structure, presenting structure of the records in the human readable form that can be further utilized in the mining process as it describes the vast majority of the records.

Note that this project is an ongoing process (and research) and new data are irregularly received from the medical facility, justifying the need for robust and fool-proof algorithms.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Blum, C.: Ant colony optimization: Introduction and recent trends. Physics of Life Reviews 2(4), 353–373 (2005)

    Article  Google Scholar 

  2. Bursa, M., Huptych, M., Lhotska, L.: Ant colony inspired metaheuristics in biological signal processing: Hybrid ant colony and evolutionary approach. In: Biosignals 2008-II, vol. 2, pp. 90–95. INSTICC Press, Setubal (2008)

    Google Scholar 

  3. Bursa, M., Lhotska, L., Macas, M.: Hybridized swarm metaheuristics for evolutionary random forest generation. In: Proceedings of the 7th International Conference on Hybrid Intelligent Systems, pp. 150–155. IEEE CSP, Los Alamitos (2007)

    Google Scholar 

  4. Dorigo, M., Stutzle, T.: Ant Colony Optimization. MIT Press, Cambridge (2004)

    MATH  Google Scholar 

  5. Freitag, D., McCallum, A.K.: Information extraction with hmms and shrinkage. In: Proceedings of the AAAI Workshop on Machine Learining for Information Extraction (1999)

    Google Scholar 

  6. Grasse, P.-P.: La reconstruction du nid et les coordinations inter-individuelles chez bellicositermes natalensis et cubitermes sp. la theorie de la stigmergie: Essai d’interpretation des termites constructeurs. Insectes Sociaux 6, 41–81 (1959)

    Article  Google Scholar 

  7. Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of the ICML, pp. 282–289 (2001); Text processing: interobserver agreement among linquists at 70

    Google Scholar 

  8. Lumer, E.D., Faieta, B.: Diversity and adaptation in populations of clustering ants. In: From Animals to Animats: Proceedings of the 3rd International Conference on the Simulation of Adaptive Behaviour, vol. 3, pp. 501–508 (1994)

    Google Scholar 

  9. Trianni, V., Labella, T.H., Dorigo, M.: Evolution of direct communication for a swarm-bot performing hole avoidance. In: Dorigo, M., Birattari, M., Blum, C., Gambardella, L.M., Mondada, F., Stützle, T. (eds.) ANTS 2004. LNCS, vol. 3172, pp. 130–141. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bursa, M. et al. (2011). Novel Nature Inspired Techniques in Medical Information Retrieval. In: Böhm, C., Khuri, S., Lhotská, L., Pisanti, N. (eds) Information Technology in Bio- and Medical Informatics. ITBAM 2011. Lecture Notes in Computer Science, vol 6865. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23208-4_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23208-4_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23207-7

  • Online ISBN: 978-3-642-23208-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics