Abstract
Contextual advertising is an important part of the Web economy today. Profit is linked to the interest that users find in the ads presented to them. The problem is for contextual advertising platforms to select the most relevant ads. Simple keyword matching techniques for matching ads to page content give poor accuracy. Problems such as homonymy, polysemy, limited intersection between content and selection keywords as well as context mismatch can significantly degrade the precision of ads selection. In this paper, we propose a method for improving the relevance of contextual ads based on “Wikipedia matching”. It is a technique that uses Wikipedia articles as “reference points” for ads selection. In our research, we worked on English language, but it is possible to port the algorithm to other languages.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Hayter, A.J.: Probability and Statistics for Engineers and Scientists. Duxbury, Belmont (2007)
Broder, A., Fontoura, M., Josifovski, V., Riedel, L.: Semantic approach to contextual advertising. In: Proc. of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, The Netherlands (2007)
Deerwester, S., Dumais, S., Furnas, G., Landauer, T., Harshman, R.: Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6), 407–491 (1988)
Jun, Z.: Comprehensive Perl Archive Network (2007), http://search.cpan.org/~jzhang/html-contentextractor-0.02/lib/html/contentextractor.pm
Manning, C.D., Schütze, H.: Foundations of statistical natural language processing. MIT Press, Cambridge (1999)
Murdock, V., Ciaramita, M., Plachouras, V.: A noisy-channel approach to contextual advertising. In: Proc. of the 1st International Workshop on Data Mining and Audience Intelligence for Advertising, San Jose, California, pp. 21–27 (2007)
Porter, M.F.: An algorithm for suffix stripping. Readings in Information Retrieval, 313–316 (1997)
Ribeiro-Neto, B., Cristo, M.: Impedance coupling in content-targeted advertising. In: Proc. of the 28th annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil, pp. 496–503 (2005)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM 18(11), 613–620 (1975)
Zhang, Y., Vogel, S.: Measuring confidence intervals for the machine translation evaluation metrics. In: Proceedings of the 10th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI 2004), pp. 4–6 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pak, A. (2011). Using Wikipedia to Improve Precision of Contextual Advertising. In: Vetulani, Z. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2009. Lecture Notes in Computer Science(), vol 6562. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20095-3_49
Download citation
DOI: https://doi.org/10.1007/978-3-642-20095-3_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20094-6
Online ISBN: 978-3-642-20095-3
eBook Packages: Computer ScienceComputer Science (R0)