Effectiveness of Keyword and Semantic Relation Extraction for Knowledge Map Generation

Sornlertlamvanich, Virach; Kruengkrai, Canasai

doi:10.1007/978-3-319-31468-6_14

Effectiveness of Keyword and Semantic Relation Extraction for Knowledge Map Generation

Virach Sornlertlamvanich¹⁵ &
Canasai Kruengkrai¹⁶

Conference paper
First Online: 13 March 2016

459 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9442))

Abstract

We explore the named entity (NE) recognition and semantic relation extraction technique on the Thai cultural database. Within the limited domain and well-structured database, our proposed method can perform in an acceptable high accuracy to generate the tuples of semantic relation for expressing the essence of the record in terms of infobox and knowledge map. In this paper, we propose a semantic relation extraction approach based on simple relation templates that determine relation types and their arguments. We attempt to reduce semantic drift of the arguments by using named entity models as semantic constraints. Experimental results indicate that our approach is very promising. We successfully apply our approach to a cultural database and discover more than 18,000 relation instances with expected high accuracy.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

References

Agichtein, E., Gravano, L.: Snowball: extracting relations from large plain-text collections. In: Proceedings of ICDL, pp. 85–94 (2000)
Google Scholar
Banko, M., Cafarella, M.J., Soderl, S., Broadhead, M., Etzioni, O.: Open information extraction from the web. In: Proceedings of IJCAI, pp. 2670–2676 (2007)
Google Scholar
Charoenporn, T., Kruengkrai, C., Sornlertlamvanich, V., Isahara, H.: Acquiring semantic information in the TCL’s computational lexicon. In: Proceedings of the Fourth Workshop on Asia Language Resources (2004)
Google Scholar
Chen, H., Benson, E., Naseem, T., Barzilay, R.: In-domain relation discovery with meta-constraints via posterior regularization. In: Proceedings of ACL-HLT, pp. 530–540 (2011)
Google Scholar
Crammer, K., McDonald, R., Pereira, F.: Scalable large-margin online learning for structured classification. In: Proceedings of NIPS Workshop on Learning with Structured Outputs (2005)
Google Scholar
Hoffmann, R., Zhang, C., Weld, D.S.: Learning 5000 relational extractors. In: Proceedings of ACL (2010)
Google Scholar
Kok, S., Domingos, P.: Extracting semantic networks from text via relational clustering. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part I. LNCS (LNAI), vol. 5211, pp. 624–639. Springer, Heidelberg (2008)
Chapter Google Scholar
Kruengkrai, C., Uchimoto, K., Kazama, J., Torisawa, K., Isahara, H., Jaruskulchai, C.: A word and character-cluster hybrid model for Thai word segmentation. In: Proceedings of InterBEST: Thai Word Segmentation Workshop (2009)
Google Scholar
Kruengkrai, C., Sornlertlamvanich, V., Buranasing, W., Charoenporn, T.: Semantic relation extraction from a cultural database. In: Proceedings of The 3rd Workshop on South and Southeast Asian NLP (2012)
Google Scholar
Lin, D., Pantel, P.: Dirt-discovery of inference rules from text. In: Proceedings of KDD, pp. 323–328 (2001)
Google Scholar
Pantel, P., Pennacchiotti, M.: Espresso: leveraging generic patterns for automatically harvesting semantic relations. In: Proceedings of ACL, pp. 113–120 (2006)
Google Scholar
Poon, H., Domingos, P.: Unsupervised semantic parsing. In: Proceedings of EMNLP, pp. 1–10 (2009)
Google Scholar
Schoenmackers, S., Etzioni, O., Weld, D.S., Davis, J.: Learning first-order horn clauses from web text. In: Proceedings of EMNLP, pp. 1088–1098 (2010)
Google Scholar
Sornlertlamvanich, V., Charoenporn, T., Isahara, H.: ORCHID: Thai part-of-speech tagged corpus. Technical report TR-NECTEC-1997-001, NECTEC (1997)
Google Scholar
Theeramunkong, T., Boriboon, M., Haruechaiyasak, C., Kittiphattanabawon, N., Kosawat, K., Onsuwan, C., Siriwat, I., Suwanapong, T., Tongtep, N.: THAI-NEST: a framework for Thai named entity tagging specification and tools. In: Proceedings of CILC (2010)
Google Scholar
Theeramunkong, T., Sornlertlamvanich, V., Tanhermhong, T., Chinnan, W.: Character cluster based Thai information retrieval. In Proceedings of IRAL, pp. 75–80 (2000)
Google Scholar
Yates, A., Etzioni, O.: Unsupervised methods for determining object and relation synonyms on the web. J. Artif. Intell. Res. 34, 255–296 (2009)
MATH Google Scholar

Download references

Acknowledgement

The experiments in this paper are conducted on the Thai Cultural Database of the Ministry of Culture, developed under the central information project since November 2010.

Author information

Authors and Affiliations

Sirindhorn International Institute of Technology, Thammasat University, Bangkok, Thailand
Virach Sornlertlamvanich
Graduate School of Information Sciences, Tohoku University, Sendai, Japan
Canasai Kruengkrai

Authors

Virach Sornlertlamvanich
View author publications
You can also search for this author in PubMed Google Scholar
Canasai Kruengkrai
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Virach Sornlertlamvanich .

Editor information

Editors and Affiliations

Unit of Design, Kyoto University, Kyoto, Japan
Yohei Murakami
Kyoto University, Kyoto, Japan
Donghui Lin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sornlertlamvanich, V., Kruengkrai, C. (2016). Effectiveness of Keyword and Semantic Relation Extraction for Knowledge Map Generation. In: Murakami, Y., Lin, D. (eds) Worldwide Language Service Infrastructure. WLSI 2015. Lecture Notes in Computer Science(), vol 9442. Springer, Cham. https://doi.org/10.1007/978-3-319-31468-6_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-31468-6_14
Published: 13 March 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31467-9
Online ISBN: 978-3-319-31468-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics