Abstract
Web monitoring systems report any changes to their target web pages by revisiting them frequently. As they operate under significant resource constraints, it is essential to minimize revisits while ensuring minimal delay and maximum coverage. Various statistical scheduling methods have been proposed to resolve this problem; however, they are static and cannot easily cope with events in the real world. This paper proposes a new scheduling method that manages unpredictable events. An MCRDR (Multiple Classification Ripple-Down Rules) document classification knowledge base was reused to detect events and to initiate a prompt web monitoring process independent of a static monitoring schedule. Our experiment demonstrates that the approach improves monitoring efficiency significantly.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Liu, L., Pu, C., Han, W.: CONQUER: a continual query system for update monitoring in the WWW. Computer Systems Science and Engineering 14(2), 99–112 (1999)
Naughton, J., et al.: The Niagara internet query system. IEEE Data Engineering Bulletin 24(2), 27–33 (2001)
Liu, L., Pu, C., Tang, W.: Continual Queries for Internet Scale Event-Driven Information Delivery. IEEE Transactions on Knowledge and Data Engineering 11(4), 610–628 (1999)
Liu, L., Pu, C., Tang, W.: WebCQ: Detecting and delivering information changes on the Web. In: CIKM 2000. ACM Press, Washington D.C (2000)
Pandey, S., Ramamritham, K., Chakrabarti, S.: Monitoring the dynamic web to respond to continuous queries. In: WWW 2003, Budapest, Hungary (2003)
Pandey, S., Dhamdhere, K., Olston, C.: WIC: A General-Purpose Algorithm for Monitoring Web Information Sources. In: 30th VLDB Conference, Toronto, Canada (2004)
Bright, L., Gal, A., Raschid, L.: Adaptive pull-based policies for wide area data delivery. ACM Transactions on Database Systems (TODS) 31(2), 631–671 (2006)
Kang, B., Compton, P., Preston, P.: Multiple Classification Ripple Down Rules: Evaluation and Possibilities. In: 9th AAAI-Sponsored Banff Knowledge Acquisition for Knowledge-Based Systems Workshop, Banff, Canada, University of Calgary (1995)
Kim, Y.S., et al.: Adaptive Web Document Classification with MCRDR. In: International Conference on Information Technology: Coding and Computing ITCC 2004, Orleans, Las Vegas, Nevada, USA (2004)
Park, S.S., Kim, Y.S., Kang, B.H.: Web Document Classification: Managing Context Change. In: IADIS International Conference WWW/Internet 2004, Madrid, Spain (2004)
Kim, Y.S., et al.: Incremental Knowledge Management of Web Community Groups on Web Portals. In: 5th International Conference on Practical Aspects of Knowledge Management, Vienna, Austria (2004)
Kim, Y.S., et al.: Knowledge Acquisition Behavior Anaysis in the Open-ended Document Classification. In: 19th ACS Australian Joint Conference on Artificial Intelligence, Hobart, Australia (2006)
Kang, B.-h., Kim, Y.S., Choi, Y.J.: Does multi-user document classification really help knowledge management? In: Orgun, M.A., Thornton, J. (eds.) AI 2007. LNCS, vol. 4830, pp. 327–336. Springer, Heidelberg (2007)
Brewington, B.E., Cybenko, G.: Keeping Up with the Changing Web. Computer 33(5), 52–58 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, Y.S., Kang, S.W., Kang, B.H., Compton, P. (2009). Using Knowledge Base for Event-Driven Scheduling of Web Monitoring Systems. In: Di Noia, T., Buccafurri, F. (eds) E-Commerce and Web Technologies. EC-Web 2009. Lecture Notes in Computer Science, vol 5692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03964-5_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-03964-5_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03963-8
Online ISBN: 978-3-642-03964-5
eBook Packages: Computer ScienceComputer Science (R0)