The Effects of Web Logs and the Semantic Web on Autonomous Web Agents
Search engines exploit the Web’s hyperlink structure to help infer information content. The new phenomenon of personal Web logs, or ‘blogs’, encourage more extensive annotation of Web content. If their resulting link structures bias the Web crawling applications that search engines depend upon, there are implications for another form of annotation rapidly on the rise, the Semantic Web. We conducted a Web crawl of 160 000 pages in which the link structure of the Web is compared with that of several thousand blogs. Results show that the two link structures are significantly different. We analyse the differences and infer the likely effect upon the performance of existing and future Web agents. The Semantic Web offers new opportunities to navigate the Web, but Web agents should be designed to take advantage of the emerging link structures, or their effectiveness will diminish.
Unable to display preview. Download preview PDF.