Construction and Analysis of Web-Based Computer Science Information Networks

  • Jiawei Han
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6743)


With the rapid development of the Web, huge amounts of information are available on the Web in the form of Web documents, structures, and links. It has been a dream of the database and Web communities to harvest information exhibited on the Web and reconcile the unstructured nature of the Web with the semi-structured schemas of the database paradigm. This is a challenging task. Even though databases are currently used to generate Web content in some sites, the schemas of these databases are rarely consistent across a domain. However, with the recent research in Web structure mining and information network analysis, major progress has been made at discovering Web hidden structures, constructing heterogeneous information networks by integration of information from structured databases and Web contents, and performing in-depth analysis for systematic harvesting of such rich information on the Web.


  1. 1.
    Ji, M., Sun, Y., Danilevsky, M., Han, J., Gao, J.: Graph regularized transductive classification on heterogeneous information networks. In: Proc. 2010 European Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2010), Barcelona, Spain (September 2010)Google Scholar
  2. 2.
    Sun, Y., Han, J., Yan, X., Yu, P.S., Wu, T.: PathSim: Meta path-based top-k similarity search in heterogeneous information networks. In: Proc. 2011 Int. Conf. on Very Large Data Based (VLDB 2011), Seattle, WA (August 2011)Google Scholar
  3. 3.
    Sun, Y., Han, J., Zhao, P., Yin, Z., Cheng, H., Wu, T.: RankClus: Integrating clustering with ranking for heterogeneous information network analysis. In: Proc. 2009 Int. Conf. on Extending Data Base Technology (EDBT 2009), Saint-Petersburg, Russia (March 2009)Google Scholar
  4. 4.
    Sun, Y., Yu, Y., Han, J.: Ranking-based clustering of heterogeneous information networks with star network schema. In: Proc. 2009 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD 2009), Paris, France (June 2009)Google Scholar
  5. 5.
    Wang, C., Han, J., Jia, Y., Tang, J., Zhang, D., Yu, Y., Guo, J.: Mining advisor-advisee relationships from research publication networks. In: Proc. 2010 ACM SIGKDD Conf. on Knowledge Discovery and Data Mining (KDD 2010), Washington D.C (July 2010)Google Scholar
  6. 6.
    Weninger, T., Danilevsky, M., Fumarola, F., Hailpern, J., Han, J., Ji, M., Johnston, T.J., Kallumadi, S., Kim, H., Li, Z., McCloskey, D., Sun, Y., TeGrotenhuis, N.E., Wang, C., Yu, X.: Winacs: Construction and analysis of web-based computer science information networks. In: Proc. of 2011 ACM SIGMOD Int. Conf. on Management of Data (SIGMOD 2011) (system demo), Athens, Greece (June 2011)Google Scholar
  7. 7.
    Weninger, T., Fumarola, F., Han, J., Malerba, D.: Mapping web pages to database records via link paths. In: Proc. 2010 ACM Int. Conf. on Information and Knowledge Management (CIKM 2010), Toronto, Canada (October 2010)Google Scholar
  8. 8.
    Weninger, T., Fumarola, F., Lin, C.X., Barber, R., Han, J., Malerba, D.: Growing parallel paths for entity-page discovery. In: Proc. of 2011 Int. World Wide Web Conf (WWW 2011), Hyderabad, India (March 2011)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Jiawei Han
    • 1
  1. 1.Department of Computer ScienceUniversity of Illinois at Urbana-ChampaignUSA

Personalised recommendations