Ranking Web News Via Homepage Visual Layout and Cross-Site Voting
Reading news is one of the most popular activities when people surf the internet. As too many news sources provide independent news information and each has its own preference, detecting unbiased important news might be very useful for users to keep up to date with what are happening in the world. In this paper we present a novel method to identify important news in web environment which consists of diversified online news sites. We observe that a piece of important news generally occupies visually significant place in some homepage of a news site and import news event will be reported by many news sites. To explore these two properties, we model the relationship between homepages, news and latent events by a tripartite graph, and present an algorithm to identify important news in this model. Based on this algorithm, we implement a system TOPSTORY to dynamically generate homepages for users to browse important news reports. Our experimental study indicates the effectiveness of proposed approach.
KeywordsAverage Importance Principal Eigenvector News Source News Site Topic Detection
Unable to display preview. Download preview PDF.
- 1.2004 Web Usage Survey Results. Sponsored by Cerberian and SonicWall, http://www.cerberian.com/content/CerberianSonicWallSurveyResults.pdf
- 2.Broder, A.: Graph Structure in the Web. In: The Ninth International WWW Conference (2000)Google Scholar
- 3.Cai, D., Yu, S., Wen, J.-R., Ma, W.-Y.: VIPS: a Vision-based Page Segementation Algorithm. Microsoft Technical Report, MSR-TR-2003-79 (2003)Google Scholar
- 5.Allan, J., Carbonell, G., Doddington, J., Yamron, J., Yang, Y.: Topic detection and tracking pilot study: Final report. In: Proceedings of the Broadcast News Understanding and Transcription Workshop, pp. 194–218 (1998)Google Scholar
- 6.Allan, J., Lavrenko, V., Jin, H.: First story detection in TDT is hard. In: Proceedings of the Ninth International Conference on Information and Knowledge Management, pp. 374–381 (2000)Google Scholar
- 8.Kleinberg, J.M., Kumar, R., Raghavan, P., Rajagopalan, S., Tomkins, A.S.: The Web as a graph: measurements, models and methods. In: Proc. 5th Int. Computing and Combinatorics (1999)Google Scholar