Medicinal Property Knowledge Extraction from Herbal Documents for Supporting Question Answering System

Pechsiri, Chaveevan; Painuall, Sumran; Janviriyasopak, Uraiwan

doi:10.1007/978-3-642-28320-8_37

Chaveevan Pechsiri²³,
Sumran Painuall²³ &
Uraiwan Janviriyasopak²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7104))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1473 Accesses

Abstract

The aim of this paper is to automatically extract the medicinal properties of an object, especially an herb, from technical documents as knowledge sources for health-care problem solving through the question-answering system, especially What-Question, for disease treatment. The extracted medicinal property knowledge is based on multiple simple sentence or EDUs (Elementary Discourse Units). There are three problems of extracting the medicinal property knowledge: the herbal object identification problem, the medicinal property identification problem for each object and the medicinal property boundary determination problem. We propose using NLP (Natural Language Processing) with statistical based approach to identify the medicinal property and also with machine learning technique as Naïve Bayes with verb features for solving the boundary problem. The result shows successfully the medicinal property extraction of the precision and recall of 86% and 77%, respectively, along with 87% correctness of the boundary determination.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Weeber, M., Vos, R.: Extracting expert medical knowledge from texts. In: Working Notes of the Intelligent Data Analysis in Medicine and Pharmacology Workshop (1998)
Google Scholar
Carlson, L., Marcu, D., Okurowski, M. E.: Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory. In: Current Directions in Discourse and Dialogue, pp. 85–112 (2003)
Google Scholar
Kongwan, K., Kawtrakul, A.: Know-what: A Development of Object-Property Extraction from Thai Texts and Query System. In: Proceedings of SNLP 2005, Bangkok, Thailand, pp. 157–162 (2005)
Google Scholar
Fang, Y.-C., Huang, H.-C., Chen, H.-H., Juan, H.-F.: TCMGeneDIT: a database for associ-ated traditional Chinese medicine, gene and disease information using text mining. BioMed. Central Complementary and Alternative Medicine 8, 58 (2008)
Article Google Scholar
Paşca, M.: Turning Web Text and Search Queries into Factual Knowledge: Hierarchical Class Attribute Extraction. In: Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008)
Google Scholar
Mitchell, T.M.: Machine Learning. The McGraw-Hill Companies Inc. and MIT Press, Singapore (1997)
Google Scholar
Sudprasert, S., Kawtrakul, A.: Thai Word Segmentation based on Global and Local Unsupervised Learning. In: Proceedings of NCSEC 2003 (2003)
Google Scholar
Chanlekha, H., Kawtrakul, A.: Thai Named Entity Extraction by incorporating Maximum Entropy Model with Simple Heuristic Information. In: IJCNLP 2004 Proceedings (2004)
Google Scholar
Chareonsuk, J., Sukvakree, T., Kawtrakul, A.: Elementary Discourse unit Segmentation for Thai using Discourse Cue and Syntactic Information. In: Proceedings of NCSEC 2005 (2005)
Google Scholar
Guthrie, J.A., Guthrie, L., Wilks, Y., Aidinejad, H.: Subject-dependent co-occurrence and word sense disambiguation. In: Proceedings of the 29th Annual Meeting on Association for Computational Linguistics (1991)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Information Technology, DhurakijPundit University, Bangkok, Thailand
Chaveevan Pechsiri & Sumran Painuall
Eastern Industry Co.ltd., Bangkok, Thailand
Uraiwan Janviriyasopak

Authors

Chaveevan Pechsiri
View author publications
You can also search for this author in PubMed Google Scholar
Sumran Painuall
View author publications
You can also search for this author in PubMed Google Scholar
Uraiwan Janviriyasopak
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Engineering and Information Technology, University of Technology Sydney, Broadway, PO Box 123, NSW 2007, Sydney, Australia
Longbing Cao
Shenzhen Institute of Advanced Technology (SIAT), Chinese Academy of Sciences, 518055, Shenzhen, China
Joshua Zhexue Huang & Jun Luo &
The University of Melbourne, VIC 3010, Melbourne, Australia
James Bailey
The University of Auckland, Auckland, New Zealand
Yun Sing Koh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pechsiri, C., Painuall, S., Janviriyasopak, U. (2012). Medicinal Property Knowledge Extraction from Herbal Documents for Supporting Question Answering System. In: Cao, L., Huang, J.Z., Bailey, J., Koh, Y.S., Luo, J. (eds) New Frontiers in Applied Data Mining. PAKDD 2011. Lecture Notes in Computer Science(), vol 7104. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28320-8_37

Download citation

DOI: https://doi.org/10.1007/978-3-642-28320-8_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28319-2
Online ISBN: 978-3-642-28320-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics