Abstract
Relation extraction is an important part of the natural language processing field that has been receiving increasing attention due to the massive growth of the information available on the web, which makes its tasks impossible through manual means. Although for domain-specific relation extraction tasks, pattern-based methods have a long and established history as a successful approach, and they can suffer from precision and recall problems and may require a lot of manual effort. To work around these issues, this paper proposes the application of well-known metaheuristics to select patterns that maximize performance metric. This approach was applied to a binary sentence-level relation extraction problem in Portuguese language, and the results were compared using statistical tests and F1 score, reaching a significant value of 0.67 with the harmony search algorithm. The other algorithms evaluated are genetic algorithm and simulated annealing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Hearst MA (1992) Automatic acquisition of hyponyms from large text corpora. In: Coling 1992 volume 2: the 15th international conference on computational linguistics, Nantes, France
Ravichandran D, Hovy E (2002) Learning surface text patterns for a question answering system. In: Proceedings of the 40th annual meeting of the Association for Computational Linguistics, Philadelphia, United States, pp 41–47
Mandya A, Bollegala D, Coenen F, Atkinson K (2017) Frame-based semantic patterns for relation extraction. In: International conference of the Pacific Association for Computational Linguistics. Springer, Singapore, pp 51–62
Jijkoun V, Mur J, de Rijke M (2004) Information extraction for question answering: improving recall through syntactic patterns. In: Proceedings of the 20th international conference on computational linguistics, Stroudsburg, United States, pp 1284–1290
Wu F, Weld DS (2010) Open information extraction using Wikipedia. In: Proceedings of the 48th annual meeting of the Association for Computational Linguistics, Uppsala, Sweden, pp 118–127
Ferreira J, Oliveira HG, Rodrigues R (2019) NLPyPort: named entity recognition with CRF and rule-based relation extraction. IberLEF@ SEPLN, pp 468–477
Boujelben I, Jamoussi S, Hamadou AB (2014) Genetic algorithm for extracting relations between named entities. In: 6th language and technology conference, Poznanń, Poland, pp 484–488
Bianchi L, Dorigo M, Gambardella LM, Gutjahr WJ (2009) A survey on metaheuristics for stochastic combinatorial optimization. Nat Comput 8(2):239–287
Pincus M (1970) A Monte Carlo method for the approximate solution of certain types of constrained optimization problems. Oper Res 18(6):1225–1228
Khachaturyan A, Semenovsovskaya S, Vainshtein B (1981) The thermodynamic approach to the structure analysis of crystals. Acta Crystallogr Sect A Crystal Phys Diffr Theor Gen Crystallogr 37(5):742–754
Kirkpatrick S, Gelatt CD, Vecchi MP (1983) Optimization by simulated annealing. Science 220(4598):671–680
Holland JH (1975) Adaptation in natural and artificial systems: an introductory analysis with applications to biology, control, and artificial intelligence. MIT Press, Cambridge
Geem ZW, Kim JH, Loganathan GV (2001) A new heuristic optimization algorithm: harmony search. Simulation 76(2):60–68
Batista DS, Forte D, Silva R, Martins B, Silva M (2013) Extracçao de relaçoes semânticas de textos em português explorando a dbpédia e a wikipédia. Linguamatica 5(1):41–57
Shapiro SS, Wilk MB (1965) An analysis of variance test for normality (complete samples). Biometrika 52(3/4):591–611
Wilcoxon F (1992) Individual comparisons by ranking methods. In: Breakthroughs in statistics. Springer, Berlin, pp 196–202
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Manke, L.F., dos Santos Coelho, L. (2022). Metaheuristics Applied to Pattern-Based Portuguese Relation Extraction. In: Kim, J.H., Deep, K., Geem, Z.W., Sadollah, A., Yadav, A. (eds) Proceedings of 7th International Conference on Harmony Search, Soft Computing and Applications. Lecture Notes on Data Engineering and Communications Technologies, vol 140. Springer, Singapore. https://doi.org/10.1007/978-981-19-2948-9_15
Download citation
DOI: https://doi.org/10.1007/978-981-19-2948-9_15
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-2947-2
Online ISBN: 978-981-19-2948-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)