Abstract
Given an entity (query), slot filling aims to find and extract the values (slot fillers) of its specific attributes (slot types) from a large-scale of document collections. Most existing work of slot filling models slot fillers separately and only considers direct relations between slot fillers and query, ignoring other slot fillers in context. In this paper we propose an unsupervised slot filler refinement approach via entity community construction to filter out the incorrect fillers collaboratively. The community-based framework mainly consists of (1) filler community generated by a point-wise mutual information-based hierarchical clustering, and (2) query community constructed by a co-occurrence graph model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
queryID: SF13_ENG_038, in KBP 2013 ESF data set.
- 2.
- 3.
per:{cause_of_death, date_of_birth, date_of_death, age, charges} and
org:{date_founded, date_dissolved, number_of_employees_members, website}.
- 4.
- 5.
With slot type per:parents.
References
Surdeanu, M.: Overview of the TAC2013 knowledge base population evaluation: English slot filling and temporal slot filling. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3), 75–174 (2009)
Ji, H., Grishman, R., Dang, H.T., Griffitt, K., Ellis, J.: Overview of the TAC 2010 knowledge base population track. In: Proceedings of the Third Text Analysis Conference (TAC) (2010)
Sammons, M., Song, Y., Wang, R., Kundu, G., Tsai, C.T., Upadhyay, S., Ancha, S., Mayhew, S., Roth, D.: Overview of UI-CCQ systems for event argument extraction, entity discovery and linking, and slot filler validation. In: Proceedings of the Seventh Text Analysis Conference (TAC) (2014)
Yu, D., Huang, H., Cassidy, T., Ji, H., Wang, C., Zhi, S., Han, J., Voss, C.R., Magdon-Ismail, M.: The wisdom of minority: unsupervised slot filling validation based on multi-dimensional truth-finding. In: Proceedings of the 25th International Conference on Computational Linguistics (COLING), pp. 1567–1578 (2014)
Rajani, N.F., Viswanathan, V., Bentor, Y., Mooney, R.J.: Stacked ensembles of information extractors for knowledge-base population. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL), pp. 177–187 (2015)
Xu, S., Zhang, C., Niu, Z., Mei, R., Chen, J., Zhang, J., Fu, H.: Bit’s slot-filling method for TAC-KBP 2013. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
Nguyen, T.H., He, Y., Pershina, M., Li, X., Grishman, R.: New York University 2014 knowledge base population systems. In: Proceedings of the Seventh Text Analysis Conference (TAC) (2014)
Białecki, A., Muir, R., Ingersoll, G., Imagination, L.: Apache Lucene 4. In: SIGIR 2012 Workshop on Open Source Information Retrieval (2012)
Angeli, G., Premkumar, M.J., Manning, C.D.: Leveraging linguistic structure for open domain information extraction. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL), pp. 344–354 (2015)
Dasgupta, S., Long, P.M.: Performance guarantees for hierarchical clustering. J. Comput. Syst. Sci. 70(4), 555–569 (2005)
Prim, R.C.: Shortest connection networks and some generalizations. Bell Labs Tech. J. 36(6), 1389–1401 (1957)
Pakhira, M.K.: A fast k-means algorithm using cluster shifting to produce compact and separate clusters (research note). Int. J. Eng.-Trans. A: Basics 28(1), 35–43 (2015)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)
Rao, C.R.: A review of canonical coordinates and an alternative to correspondence analysis using Hellinger distance. Qüestiió: quaderns d’estadÃstica i investigació operativa 19(1), 23–63 (1995)
Girvan, M., Newman, M.E.: Community structure in social and biological networks. Proc. Nat. Acad. Sci. 99(12), 7821–7826 (2002)
Clauset, A., Newman, M.E., Moore, C.: Finding community structure in very large networks. Phys. Rev. E 70(6), 066111 (2004)
Roth, B., Barth, T., Wiegand, M., Singh, M., Klakow, D.: Effective slot filling based on shallow distant supervision methods. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
Yu, D., Li, H., Cassidy, T., Li, Q., Huang, H., Chen, Z., Ji, H., Zhang, Y., Roth, D.: RPI-BLENDER TAC-KBP2013 knowledge base population system. In: Theory and Applications of Categories (2013)
Angeli, G., Chaganty, A.T., Chang, A.X., Reschke, K., Tibshirani, J., Wu, J., Bastani, O., Siilats, K., Manning, C.D.: Stanford’s 2013 KBP system. In: Proceedings of the Sixth Text Analysis Conference (TAC) (2013)
Powers, D.M.W.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)
Griffiths, T.: Gibbs Sampling in the Generative Model of Latent Dirichlet Allocation (2002)
Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proc. Nat. Acad. Sci. 101(Suppl. 1), 5228–5235 (2004)
Lewis, J., Ossowski, S., Hicks, J., Errami, M., Garner, H.R.: Text similarity: an alternative way to search medline. Bioinformatics 22(18), 2298–2304 (2006)
Acknowledgement
This research work is supported by National Natural Science Foundation of China (Grants No. 61672367, No. 61672368, No. 61703293), the Research Foundation of the Ministry of Education and China Mobile, MCM20150602 and the Science and Technology Plan of Jiangsu, SBK2015022101 and BK20151222. The authors would like to thank the anonymous reviewers for their insightful comments and suggestions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Xu, Z., Song, R., Zou, B., Hong, Y. (2018). Unsupervised Slot Filler Refinement via Entity Community Construction. In: Huang, X., Jiang, J., Zhao, D., Feng, Y., Hong, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2017. Lecture Notes in Computer Science(), vol 10619. Springer, Cham. https://doi.org/10.1007/978-3-319-73618-1_54
Download citation
DOI: https://doi.org/10.1007/978-3-319-73618-1_54
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73617-4
Online ISBN: 978-3-319-73618-1
eBook Packages: Computer ScienceComputer Science (R0)