Comparison of String Distance Metrics for Lemmatisation of Named Entities in Polish
This paper presents the results of recent experiments on application of string distance metrics to the problem of named entity lemmatisation in Polish. It extends of our work in  by introducing new results for organisation names. Furthermore, the results presented here and in [2,3] centering around the same topic were used to make a comparative study of the average usefulness of the numerous examined string distance metrics to lemmatisation of Polish named-entities of various types. In particular, we focus on lemmatisation of country names, organisation names and person names.
Keywordsnamed entities lemmatisation string distance metrics highly inflective languages
Unable to display preview. Download preview PDF.