Abstract
This chapter describes systems that automatically classify web pages into meaningful categories. It first defines two types of web page classification: subject based and genre based classifications. It then describes the state of the art techniques and subsystems used to build automatic web page classification systems, including web page representations, dimensionality reductions, web page classifiers, and evaluation of web page classifiers. Such systems are essential tools for Web Mining and for the future of Semantic Web.
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Editor information
Rights and permissions
About this chapter
Cite this chapter
Choi, B., Yao, Z. Web Page Classification*. In: Chu, W., Young Lin, T. (eds) Foundations and Advances in Data Mining. Studies in Fuzziness and Soft Computing, vol 180. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11362197_9
Download citation
DOI: https://doi.org/10.1007/11362197_9
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25057-9
Online ISBN: 978-3-540-32393-8
eBook Packages: EngineeringEngineering (R0)