Abstract
The world wide web is the biggest information source which people consult daily for facts and events. Studies demonstrate that 30% of the searches relate to proper names such as organizations, actors, singers, books or movie titles. However, a serious problem is posed by the high level of ambiguity where one and the same name can be shared by different individuals or even across different proper name categories. In order to provide faster and more relevant access to the requested information, current research focuses on the clustering of web pages related to the same individual. In this paper, we focus on the resolution of the web people search problem through the integration of domain information.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Javier, A., Gonzalo, J., Sekine, S.: The semeval-2007 weps evaluation: Establishing a benchmark for the web people search task. In: Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007), pp. 64–69 (2007)
Bagga, A., Baldwin, B.: Entity-based cross-document coreferencing using the vector space model. In: Proceedings of the Thirty-Sixth Annual Meeting of the Association for Computational Linguistics and Seventeenth International Conference on Computational Linguistics, pp. 79–85 (1998)
Pedersen, T., Purandare, A., Kulkarni, A.: Name discrimination by clustering similar contexts. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 226–237. Springer, Heidelberg (2005)
Kozareva, Z., Vázquez, S., Montoyo, A.: Multilingual name disambiguation with semantic information. In: Matoušek, V., Mautner, P. (eds.) TSD 2007. LNCS (LNAI), vol. 4629, pp. 23–30. Springer, Heidelberg (2007)
Pedersen, T., Kulkarni, A.: Unsupervised discrimination of person names in web contexts. In: Gelbukh, A. (ed.) CICLing 2007. LNCS, vol. 4394, pp. 299–310. Springer, Heidelberg (2007)
Kozareva, Z., Vázquez, S., Montoyo, A.: Discovering the underlying meanings and categories of a name through domain and semantic information. In: Proceedings of the Conference on Recent Advances in Natural Language Processing RANLP (2007)
Chen, Y., Martin, J.H.: Cu-comsem: Exploring rich features for unsupervised web personal name disambiguation. In: Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007), pp. 125–128 (2007)
Popescu, O., Magnini, B.: Irst-bp: Web people search using name entities. In: Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007), pp. 195–198 (2007)
Agirre, E., Soroa, A.: Ubc-as: A graph based unsupervised system for induction and classification. In: Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007), pp. 346–349 (2007)
Magnini, B., Cavaglia, G.: Integrating subject field codes into wordnet. In: Proceedings of LREC-2000, Second International Conference on Language Resources and Evaluation, pp. 1413–1418 (2000)
Esuli, A., Sebastiani, F.: Pageranking wordnet synsets: An application to opinion mining. In: Proceedings of ACL-2007, the 45th Annual Meeting of the Association of Computational Linguistics, pp. 424–431 (2007)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems 30, 107–117 (1998)
Liu, H.: Montylingua: An end-to-end natural language processor with common sense (2004), http://web.media.mit.edu/~hugo/montylingua
Cleuziou, G., Martin, L., Vrain, C.: Poboc: an overlapping clustering algorithm. In: Application to rule-based classification and textual data, pp. 440–444 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kozareva, Z., Moraliyski, R., Dias, G. (2008). Web People Search with Domain Ranking. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2008. Lecture Notes in Computer Science(), vol 5246. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87391-4_19
Download citation
DOI: https://doi.org/10.1007/978-3-540-87391-4_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87390-7
Online ISBN: 978-3-540-87391-4
eBook Packages: Computer ScienceComputer Science (R0)