Abstract
The development of network systems and the widespread access to the Internet generate the need for efficient tools of information retrieval. The globalisation of the Internet doesn’t mean the unification of the languages used there even if we observe the dominance of English. Much information is accessible in other national languages which, however, for good understanding of the text, require the use of language specific characters, diacritical marks and/or accents. The results of testing the effectiveness of retrieval to queries expressed in Polish language using words with diacritics are presented. Then the influence of Polish local characters on the number of items retrieved by search engines is analysed and the reasons for using or not using diacritics is examined.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bar-Ilan, J., Gutman, T.: How do search engines handle non-English queries? - A case study. In: WWW2003 Proc. of the Twelfth International World Wide Web Conference (2003),available at, http://www2003.org/cdrom/papers/alternate/P415/BARILAN.HTM
Brooks, T.A.: Orthography as a fundamental impediment to online information retrieval. J. ASIS 49(8), 731–741 (1998)
Can, F., Nuray, R., Sevdik, A.B.: Automatic performance evaluation of Web search engines. Inf. Proc. Man. 40(3), 495–514 (2004)
Choroś, K.:: Effectiveness of Internet search engines (in Polish). In: Multimedialne i sieciowe systemy informacyjne. Ofic. Wyd. Polit. Wroc., Wroclaw. Efektywnośé wyszukiwarek internetowych, pp. 115–123 (2002)
Clarke, S.J., Willet, P.: Estimating the recall performance of Web search engines. ASLIB Proc. 49(6), 184–189 (1997)
Craven, T.C.: Variations in use of meta tag descriptions by Web pages in different languages. Inf. Proc. Man. 40(3), 479–493 (2004)
Fricke, M.: Measuring recall. J. of Inf. Science 24(6), 409–417 (1998)
Grefenstette, G, Nioche, J.: Estimation of English and non-English Language Use on the WWW, Available at, http://arxiv.org/ftp/cs/papers/0006/0006032.pdf
Salton, G., McGill, M.J.: Introduction to Modern information Retrieval., vol. 164, p. 55. McGraw-Hill, Inc, New York (1983)
Sroka, M.: Web search engines for Polish information retrieval: Questions of search capabilities and retrieval performance. Int. Inf. Lib. Res. 32, 87–98 (2000)
Vaughan, L.: New measurements for search engine evaluation proposed and tested. Inf. Proc. Man. 40(4), 677–691 (2004)
www.global-reach.biz/globstats (Accessed on November 12, 2004)
www.searchengineshowdown.com/newsarchive/000358.shtml (Accessed on November 16, 2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Choroś, K. (2005). Testing the Effectiveness of Retrieval to Queries Using Polish Words with Diacritics. In: Szczepaniak, P.S., Kacprzyk, J., Niewiadomski, A. (eds) Advances in Web Intelligence. AWIC 2005. Lecture Notes in Computer Science(), vol 3528. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11495772_17
Download citation
DOI: https://doi.org/10.1007/11495772_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26219-0
Online ISBN: 978-3-540-31900-9
eBook Packages: Computer ScienceComputer Science (R0)