Advertisement

Ranking Web News Via Homepage Visual Layout and Cross-Site Voting

  • Jinyi Yao
  • Jue Wang
  • Zhiwei Li
  • Mingjing Li
  • Wei-Ying Ma
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3936)

Abstract

Reading news is one of the most popular activities when people surf the internet. As too many news sources provide independent news information and each has its own preference, detecting unbiased important news might be very useful for users to keep up to date with what are happening in the world. In this paper we present a novel method to identify important news in web environment which consists of diversified online news sites. We observe that a piece of important news generally occupies visually significant place in some homepage of a news site and import news event will be reported by many news sites. To explore these two properties, we model the relationship between homepages, news and latent events by a tripartite graph, and present an algorithm to identify important news in this model. Based on this algorithm, we implement a system TOPSTORY to dynamically generate homepages for users to browse important news reports. Our experimental study indicates the effectiveness of proposed approach.

Keywords

Average Importance Principal Eigenvector News Source News Site Topic Detection 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    2004 Web Usage Survey Results. Sponsored by Cerberian and SonicWall, http://www.cerberian.com/content/CerberianSonicWallSurveyResults.pdf
  2. 2.
    Broder, A.: Graph Structure in the Web. In: The Ninth International WWW Conference (2000)Google Scholar
  3. 3.
    Cai, D., Yu, S., Wen, J.-R., Ma, W.-Y.: VIPS: a Vision-based Page Segementation Algorithm. Microsoft Technical Report, MSR-TR-2003-79 (2003)Google Scholar
  4. 4.
    Allan, J., et al. (eds.): Topic Detection and Tracking. Springer, Heidelberg (2002)MATHGoogle Scholar
  5. 5.
    Allan, J., Carbonell, G., Doddington, J., Yamron, J., Yang, Y.: Topic detection and tracking pilot study: Final report. In: Proceedings of the Broadcast News Understanding and Transcription Workshop, pp. 194–218 (1998)Google Scholar
  6. 6.
    Allan, J., Lavrenko, V., Jin, H.: First story detection in TDT is hard. In: Proceedings of the Ninth International Conference on Information and Knowledge Management, pp. 374–381 (2000)Google Scholar
  7. 7.
    Kleinberg, J.M.: Authoritative Sources in a Hyperlinked Environment. Journal of the ACM 46(5), 604–622 (1999)MathSciNetCrossRefMATHGoogle Scholar
  8. 8.
    Kleinberg, J.M., Kumar, R., Raghavan, P., Rajagopalan, S., Tomkins, A.S.: The Web as a graph: measurements, models and methods. In: Proc. 5th Int. Computing and Combinatorics (1999)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Jinyi Yao
    • 1
  • Jue Wang
    • 2
  • Zhiwei Li
    • 1
  • Mingjing Li
    • 1
  • Wei-Ying Ma
    • 1
  1. 1.Microsoft Research AsiaBeijingChina
  2. 2.State Key Lab of Intelligent System and TechnologyTsinghua UniversityBeijingChina

Personalised recommendations